Query psy7765
Match_columns 77
No_of_seqs 133 out of 1008
Neff 8.2
Searched_HMMs 46136
Date Fri Aug 16 18:15:31 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7765.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7765hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2100|consensus 99.6 1.6E-15 3.4E-20 111.2 5.5 73 2-76 646-718 (755)
2 KOG2281|consensus 99.5 2E-14 4.3E-19 103.6 5.7 73 2-76 764-838 (867)
3 COG1506 DAP2 Dipeptidyl aminop 99.4 8.9E-14 1.9E-18 100.1 4.7 74 2-76 509-587 (620)
4 PF00326 Peptidase_S9: Prolyl 99.4 8.3E-13 1.8E-17 83.2 4.4 59 15-76 120-180 (213)
5 PRK10115 protease 2; Provision 98.9 2.2E-09 4.8E-14 78.5 3.8 59 15-76 581-642 (686)
6 PLN02442 S-formylglutathione h 98.6 6.9E-08 1.5E-12 63.9 5.0 64 10-76 190-254 (283)
7 PRK13604 luxD acyl transferase 98.3 9.9E-07 2.2E-11 59.6 4.8 43 28-71 191-233 (307)
8 PRK10566 esterase; Provisional 98.3 1.9E-06 4.2E-11 55.0 5.6 43 30-72 176-218 (249)
9 PRK11460 putative hydrolase; P 98.3 1.8E-06 3.8E-11 55.7 5.0 37 40-76 148-184 (232)
10 COG0400 Predicted esterase [Ge 98.2 2.7E-06 5.9E-11 54.7 4.5 39 38-76 144-182 (207)
11 PRK05371 x-prolyl-dipeptidyl a 98.2 4.1E-06 8.8E-11 62.3 5.8 51 24-75 440-490 (767)
12 TIGR02821 fghA_ester_D S-formy 98.2 4.3E-06 9.2E-11 54.9 4.8 63 9-76 184-248 (275)
13 PF02230 Abhydrolase_2: Phosph 98.1 4.1E-06 8.9E-11 53.2 4.1 37 40-76 155-191 (216)
14 COG1505 Serine proteases of th 98.1 5.2E-06 1.1E-10 60.3 4.2 62 15-76 554-616 (648)
15 PF03583 LIP: Secretory lipase 98.0 1.2E-05 2.5E-10 53.8 4.9 37 40-76 219-256 (290)
16 PF07859 Abhydrolase_3: alpha/ 97.9 2.5E-05 5.4E-10 48.7 4.6 47 27-76 153-200 (211)
17 PF01738 DLH: Dienelactone hyd 97.7 0.00011 2.4E-09 46.4 5.4 45 31-76 137-181 (218)
18 PLN02298 hydrolase, alpha/beta 97.7 7.3E-05 1.6E-09 49.8 4.5 38 35-73 247-284 (330)
19 TIGR01840 esterase_phb esteras 97.6 5.9E-05 1.3E-09 47.7 3.4 30 40-69 168-197 (212)
20 KOG1552|consensus 97.6 2.5E-05 5.5E-10 51.7 1.3 37 32-69 185-221 (258)
21 PRK10749 lysophospholipase L2; 97.6 0.00016 3.6E-09 48.5 5.0 37 34-71 254-290 (330)
22 PRK10162 acetyl esterase; Prov 97.4 0.00067 1.5E-08 45.6 6.5 35 40-76 248-282 (318)
23 PLN02385 hydrolase; alpha/beta 97.4 0.00023 4.9E-09 48.0 4.2 39 35-74 275-313 (349)
24 PLN02652 hydrolase; alpha/beta 97.4 0.0003 6.5E-09 48.9 4.3 39 35-74 320-358 (395)
25 PHA02857 monoglyceride lipase; 97.4 0.00034 7.4E-09 45.2 4.3 34 34-68 204-237 (276)
26 COG0657 Aes Esterase/lipase [L 97.3 0.00075 1.6E-08 44.9 5.4 47 28-76 233-279 (312)
27 PRK11071 esterase YqiA; Provis 97.2 0.00035 7.7E-09 43.8 3.1 27 40-66 136-162 (190)
28 PF12695 Abhydrolase_5: Alpha/ 97.2 0.00073 1.6E-08 39.3 4.1 33 34-67 99-131 (145)
29 COG0429 Predicted hydrolase of 97.1 0.00027 5.8E-09 48.5 1.4 52 5-60 243-294 (345)
30 KOG1455|consensus 96.9 0.00088 1.9E-08 45.5 3.1 43 31-74 238-280 (313)
31 PRK06765 homoserine O-acetyltr 96.9 0.0027 5.9E-08 44.1 5.5 45 30-75 314-358 (389)
32 TIGR01738 bioH putative pimelo 96.9 0.0019 4.1E-08 39.8 4.1 34 33-67 182-215 (245)
33 COG1073 Hydrolases of the alph 96.9 0.0017 3.7E-08 41.4 3.9 44 26-69 218-261 (299)
34 TIGR03611 RutD pyrimidine util 96.8 0.0019 4.2E-08 40.2 3.9 34 33-67 192-225 (257)
35 KOG2112|consensus 96.8 0.0017 3.7E-08 41.9 3.2 37 40-76 144-180 (206)
36 TIGR01607 PST-A Plasmodium sub 96.7 0.0024 5.2E-08 43.2 4.0 33 40-72 270-302 (332)
37 TIGR01392 homoserO_Ac_trn homo 96.7 0.0042 9E-08 42.0 5.0 40 33-73 282-321 (351)
38 KOG1838|consensus 96.7 0.0014 3.1E-08 46.0 2.5 50 4-57 290-339 (409)
39 PF08840 BAAT_C: BAAT / Acyl-C 96.6 0.0039 8.4E-08 39.9 4.2 37 35-72 111-148 (213)
40 COG4099 Predicted peptidase [G 96.6 0.00096 2.1E-08 45.7 1.4 43 34-76 309-351 (387)
41 TIGR02427 protocat_pcaD 3-oxoa 96.6 0.0039 8.4E-08 38.4 3.8 35 32-67 186-220 (251)
42 TIGR02240 PHA_depoly_arom poly 96.5 0.0047 1E-07 40.1 4.0 34 33-67 201-234 (276)
43 TIGR01249 pro_imino_pep_1 prol 96.5 0.0046 9.9E-08 41.0 3.9 34 34-67 242-275 (306)
44 PF10503 Esterase_phd: Esteras 96.5 0.0038 8.2E-08 40.5 3.3 30 40-69 169-198 (220)
45 PRK05077 frsA fermentation/res 96.5 0.0045 9.6E-08 43.3 3.9 30 36-66 352-381 (414)
46 PRK00175 metX homoserine O-ace 96.5 0.0096 2.1E-07 40.9 5.4 40 34-74 304-343 (379)
47 TIGR03343 biphenyl_bphD 2-hydr 96.4 0.0049 1.1E-07 39.6 3.6 34 33-67 217-250 (282)
48 PF02129 Peptidase_S15: X-Pro 96.0 0.018 3.8E-07 37.7 4.8 51 23-75 212-263 (272)
49 PRK07581 hypothetical protein; 96.0 0.02 4.3E-07 38.3 5.1 34 33-67 269-302 (339)
50 TIGR03056 bchO_mg_che_rel puta 96.0 0.014 3E-07 37.1 4.2 32 35-67 216-247 (278)
51 PRK10349 carboxylesterase BioH 96.0 0.0066 1.4E-07 38.8 2.7 36 31-67 188-223 (256)
52 TIGR01836 PHA_synth_III_C poly 96.0 0.0088 1.9E-07 40.5 3.2 33 35-68 282-314 (350)
53 PRK10985 putative hydrolase; P 95.9 0.0072 1.6E-07 40.5 2.7 40 23-63 239-278 (324)
54 PF08386 Abhydrolase_4: TAP-li 95.8 0.014 3E-07 33.4 3.2 29 40-68 34-62 (103)
55 PLN02511 hydrolase 95.8 0.0066 1.4E-07 41.9 2.0 34 25-59 284-317 (388)
56 PLN03087 BODYGUARD 1 domain co 95.7 0.015 3.2E-07 41.7 3.5 30 37-67 416-445 (481)
57 PRK06489 hypothetical protein; 95.6 0.018 3.9E-07 39.1 3.7 35 32-67 285-321 (360)
58 COG2267 PldB Lysophospholipase 95.6 0.015 3.1E-07 39.1 3.3 35 36-71 225-260 (298)
59 PF07519 Tannase: Tannase and 95.6 0.011 2.5E-07 42.1 2.8 31 40-70 353-383 (474)
60 PRK03592 haloalkane dehalogena 95.6 0.027 5.8E-07 36.8 4.4 32 35-67 224-255 (295)
61 PLN02578 hydrolase 95.6 0.023 4.9E-07 38.5 4.1 33 34-67 291-323 (354)
62 PF00561 Abhydrolase_1: alpha/ 95.6 0.014 2.9E-07 35.9 2.8 38 28-66 164-201 (230)
63 PRK00870 haloalkane dehalogena 95.6 0.016 3.5E-07 38.1 3.2 32 35-68 235-266 (302)
64 PLN02965 Probable pheophorbida 95.4 0.023 5.1E-07 36.4 3.5 31 37-68 191-221 (255)
65 KOG4667|consensus 95.4 0.017 3.8E-07 38.1 2.8 29 40-68 199-227 (269)
66 PF12697 Abhydrolase_6: Alpha/ 95.3 0.025 5.4E-07 34.1 3.2 37 29-66 166-202 (228)
67 PRK08775 homoserine O-acetyltr 95.2 0.027 5.8E-07 38.0 3.3 32 35-67 273-304 (343)
68 KOG1454|consensus 95.1 0.048 1.1E-06 37.1 4.4 38 29-66 253-290 (326)
69 PLN02824 hydrolase, alpha/beta 95.0 0.039 8.4E-07 36.0 3.6 31 34-65 229-259 (294)
70 PLN02679 hydrolase, alpha/beta 95.0 0.07 1.5E-06 36.4 4.9 35 33-68 286-321 (360)
71 COG0412 Dienelactone hydrolase 94.8 0.07 1.5E-06 34.7 4.5 40 35-75 154-193 (236)
72 PRK14875 acetoin dehydrogenase 94.6 0.045 9.7E-07 36.6 3.2 31 32-63 307-337 (371)
73 PF03959 FSH1: Serine hydrolas 94.5 0.066 1.4E-06 34.1 3.7 34 35-69 157-190 (212)
74 PF05705 DUF829: Eukaryotic pr 94.1 0.15 3.4E-06 32.6 4.8 37 40-76 178-214 (240)
75 TIGR01250 pro_imino_pep_2 prol 94.0 0.13 2.8E-06 32.3 4.3 32 33-66 225-256 (288)
76 PRK10673 acyl-CoA esterase; Pr 93.8 0.095 2.1E-06 33.1 3.4 30 36-66 192-221 (255)
77 PF11144 DUF2920: Protein of u 93.7 0.18 4E-06 35.6 4.9 37 40-76 293-329 (403)
78 PLN02211 methyl indole-3-aceta 93.6 0.1 2.2E-06 34.2 3.4 28 40-67 211-238 (273)
79 PLN02872 triacylglycerol lipas 93.5 0.098 2.1E-06 36.6 3.3 46 23-68 306-353 (395)
80 COG1647 Esterase/lipase [Gener 93.3 0.092 2E-06 34.6 2.7 40 32-72 174-213 (243)
81 COG1770 PtrB Protease II [Amin 92.9 0.11 2.4E-06 38.8 2.9 58 15-72 584-641 (682)
82 TIGR01838 PHA_synth_I poly(R)- 92.8 0.15 3.2E-06 37.2 3.5 34 33-67 409-442 (532)
83 PRK07868 acyl-CoA synthetase; 92.7 0.15 3.2E-06 39.3 3.5 32 35-67 293-324 (994)
84 PLN03084 alpha/beta hydrolase 92.1 0.34 7.3E-06 33.8 4.4 28 37-65 323-350 (383)
85 PRK03204 haloalkane dehalogena 91.6 0.26 5.6E-06 32.4 3.3 28 40-67 227-255 (286)
86 PRK05855 short chain dehydroge 91.6 0.15 3.2E-06 36.0 2.2 27 38-65 232-258 (582)
87 PF06821 Ser_hydrolase: Serine 91.2 0.39 8.5E-06 29.8 3.7 28 41-68 115-142 (171)
88 KOG4391|consensus 90.3 0.39 8.5E-06 32.0 3.1 31 35-66 217-247 (300)
89 KOG2551|consensus 90.2 0.28 6E-06 32.3 2.4 34 35-69 159-192 (230)
90 TIGR03695 menH_SHCHC 2-succiny 90.0 0.68 1.5E-05 28.1 3.9 29 33-63 188-216 (251)
91 PF10605 3HBOH: 3HB-oligomer h 89.7 0.19 4E-06 37.5 1.3 32 37-68 552-583 (690)
92 PF05448 AXE1: Acetyl xylan es 88.8 0.48 1E-05 32.3 2.8 41 27-68 250-290 (320)
93 KOG2984|consensus 88.5 0.35 7.6E-06 31.9 1.9 26 35-61 212-237 (277)
94 PLN02894 hydrolase, alpha/beta 88.5 1.2 2.6E-05 31.0 4.7 30 34-65 320-349 (402)
95 COG2021 MET2 Homoserine acetyl 88.2 1.3 2.8E-05 31.1 4.5 42 29-71 296-337 (368)
96 PLN02606 palmitoyl-protein thi 88.1 0.98 2.1E-05 31.0 3.9 36 40-76 26-62 (306)
97 PF05728 UPF0227: Uncharacteri 87.4 0.79 1.7E-05 29.0 3.0 26 40-65 134-159 (187)
98 TIGR01839 PHA_synth_II poly(R) 87.1 0.89 1.9E-05 33.5 3.4 35 30-66 433-467 (560)
99 KOG2237|consensus 86.8 0.71 1.5E-05 34.7 2.8 44 26-69 617-661 (712)
100 PRK10439 enterobactin/ferric e 86.5 1.4 3.1E-05 31.0 4.1 36 41-77 350-385 (411)
101 COG3571 Predicted hydrolase of 84.4 0.8 1.7E-05 29.2 1.9 26 33-59 136-161 (213)
102 TIGR01849 PHB_depoly_PhaZ poly 84.1 1.4 3E-05 31.3 3.1 32 35-66 333-364 (406)
103 COG3545 Predicted esterase of 83.8 3 6.4E-05 26.6 4.2 41 25-67 104-144 (181)
104 KOG3043|consensus 83.0 3.2 6.8E-05 27.6 4.3 37 34-71 159-195 (242)
105 PRK11126 2-succinyl-6-hydroxy- 82.9 1.4 2.9E-05 27.6 2.6 20 34-54 183-202 (242)
106 PLN02633 palmitoyl protein thi 82.0 2.7 5.8E-05 29.0 3.8 33 40-73 25-58 (314)
107 PF02089 Palm_thioest: Palmito 81.7 1.3 2.9E-05 29.9 2.2 26 40-65 5-33 (279)
108 KOG1515|consensus 81.6 2.6 5.7E-05 29.1 3.7 34 40-75 268-301 (336)
109 COG2945 Predicted hydrolase of 81.2 1.6 3.4E-05 28.4 2.3 30 35-65 145-174 (210)
110 PF07755 DUF1611: Protein of u 80.9 5.1 0.00011 27.4 4.8 39 37-76 109-147 (301)
111 cd05560 Xcc1710_like Xcc1710_l 80.9 2.9 6.4E-05 24.2 3.3 35 41-77 52-86 (109)
112 PLN02980 2-oxoglutarate decarb 80.2 2.9 6.4E-05 34.4 4.0 33 34-68 1563-1595(1655)
113 PF14714 KH_dom-like: KH-domai 80.1 5.5 0.00012 21.8 4.0 36 40-75 38-77 (80)
114 COG3458 Acetyl esterase (deace 78.8 2.1 4.6E-05 29.3 2.5 39 30-69 250-288 (321)
115 COG0596 MhpC Predicted hydrola 78.2 4 8.6E-05 24.3 3.4 30 35-65 217-246 (282)
116 KOG2624|consensus 76.9 3.1 6.8E-05 29.5 3.0 49 22-71 314-363 (403)
117 TIGR03100 hydr1_PEP hydrolase, 75.7 8.3 0.00018 25.2 4.6 36 40-75 26-62 (274)
118 KOG2382|consensus 75.0 3.9 8.5E-05 28.2 3.0 32 36-68 250-281 (315)
119 PF02244 Propep_M14: Carboxype 74.0 1.4 3.1E-05 23.2 0.5 27 50-76 36-62 (74)
120 PF08357 SEFIR: SEFIR domain; 73.5 7.3 0.00016 23.1 3.6 34 41-75 1-35 (150)
121 PF11339 DUF3141: Protein of u 73.3 3.6 7.8E-05 30.5 2.6 29 34-63 292-320 (581)
122 cd00248 Mth938-like Mth938-lik 72.0 7.2 0.00016 22.5 3.2 35 41-77 52-86 (109)
123 PF00450 Peptidase_S10: Serine 71.8 4.5 9.7E-05 27.6 2.7 28 40-67 330-357 (415)
124 PF10142 PhoPQ_related: PhoPQ- 70.1 2.6 5.6E-05 29.6 1.2 45 23-68 246-290 (367)
125 PF13676 TIR_2: TIR domain; PD 69.9 5.7 0.00012 21.8 2.5 28 43-74 1-28 (102)
126 PF06500 DUF1100: Alpha/beta h 69.2 8.3 0.00018 27.6 3.6 26 40-65 352-377 (411)
127 TIGR03100 hydr1_PEP hydrolase, 69.1 6.8 0.00015 25.6 3.0 21 34-55 202-222 (274)
128 cd03409 Chelatase_Class_II Cla 68.7 12 0.00027 20.4 3.7 27 43-69 3-29 (101)
129 PLN02213 sinapoylglucose-malat 68.1 5.5 0.00012 27.0 2.5 31 40-70 233-263 (319)
130 KOG2541|consensus 67.6 12 0.00025 25.7 3.8 30 35-65 19-50 (296)
131 PF02273 Acyl_transf_2: Acyl t 66.5 8.1 0.00018 26.2 3.0 37 29-66 185-221 (294)
132 cd03413 CbiK_C Anaerobic cobal 65.2 15 0.00033 20.9 3.6 25 43-69 4-28 (103)
133 TIGR00976 /NonD putative hydro 63.2 17 0.00037 26.4 4.3 40 28-70 221-260 (550)
134 PTZ00472 serine carboxypeptida 62.8 8.8 0.00019 27.5 2.8 30 40-69 364-393 (462)
135 PF12146 Hydrolase_4: Putative 61.6 7.1 0.00015 21.1 1.7 34 40-76 16-49 (79)
136 TIGR00666 PBP4 D-alanyl-D-alan 61.4 19 0.0004 24.9 4.1 31 42-72 55-85 (345)
137 cd05125 Mth938_2P1-like Mth938 60.0 18 0.00039 21.1 3.4 35 41-77 54-88 (114)
138 PF04430 DUF498: Protein of un 59.9 4.1 8.8E-05 23.4 0.6 35 40-76 52-86 (110)
139 PF04083 Abhydro_lipase: Parti 59.7 6 0.00013 20.7 1.2 13 40-52 43-55 (63)
140 PLN03016 sinapoylglucose-malat 59.5 10 0.00022 27.0 2.6 31 40-70 347-377 (433)
141 KOG1282|consensus 59.5 9.5 0.00021 27.6 2.5 27 41-67 364-390 (454)
142 PF02113 Peptidase_S13: D-Ala- 59.5 20 0.00044 25.6 4.1 32 41-72 81-112 (444)
143 PRK11113 D-alanyl-D-alanine ca 58.8 20 0.00044 25.9 4.0 30 43-72 100-129 (477)
144 PRK10834 vancomycin high tempe 58.2 17 0.00037 24.1 3.3 34 40-74 81-114 (239)
145 KOG3363|consensus 58.1 20 0.00043 22.8 3.4 37 41-77 111-147 (196)
146 PLN02209 serine carboxypeptida 57.2 12 0.00026 26.8 2.6 32 40-71 351-382 (437)
147 PF12913 SH3_6: SH3 domain of 56.8 11 0.00024 19.3 1.9 14 40-53 26-39 (54)
148 COG0627 Predicted esterase [Ge 56.4 15 0.00033 25.2 3.0 63 10-76 208-285 (316)
149 PLN02872 triacylglycerol lipas 56.4 9 0.00019 26.9 1.9 36 40-75 74-112 (395)
150 COG3243 PhaC Poly(3-hydroxyalk 54.5 19 0.00042 26.0 3.3 32 35-67 326-357 (445)
151 TIGR02803 ExbD_1 TonB system t 54.4 22 0.00047 20.6 3.1 24 49-72 91-114 (122)
152 PF06342 DUF1057: Alpha/beta h 53.6 22 0.00047 24.5 3.3 33 35-68 208-240 (297)
153 TIGR02764 spore_ybaN_pdaB poly 53.5 28 0.0006 21.5 3.6 38 37-74 148-185 (191)
154 PRK00923 sirohydrochlorin coba 53.4 22 0.00047 20.5 3.0 27 43-70 5-31 (126)
155 PF12646 DUF3783: Domain of un 53.1 26 0.00057 17.8 2.9 28 42-74 2-29 (58)
156 cd03414 CbiX_SirB_C Sirohydroc 52.3 15 0.00033 20.8 2.2 27 43-70 4-30 (117)
157 TIGR02804 ExbD_2 TonB system t 52.2 22 0.00048 20.6 2.9 24 49-72 90-113 (121)
158 PRK08335 translation initiatio 52.0 18 0.00039 24.4 2.7 26 51-76 141-166 (275)
159 TIGR02801 tolR TolR protein. T 51.6 26 0.00055 20.4 3.1 28 43-72 95-122 (129)
160 PHA01474 nonstructural protein 51.0 3.5 7.6E-05 20.3 -0.6 14 1-14 1-14 (52)
161 COG4635 HemG Flavodoxin [Energ 50.7 33 0.00072 21.7 3.6 35 42-77 2-36 (175)
162 PF11823 DUF3343: Protein of u 50.3 31 0.00067 18.1 3.0 21 56-76 11-31 (73)
163 PF12531 DUF3731: DNA-K relate 49.2 19 0.00041 24.1 2.5 38 31-70 136-173 (249)
164 PF08538 DUF1749: Protein of u 47.2 15 0.00032 25.3 1.8 34 35-69 228-262 (303)
165 PF00756 Esterase: Putative es 46.8 17 0.00036 23.0 1.9 51 26-76 165-229 (251)
166 KOG2521|consensus 46.6 36 0.00077 23.9 3.6 36 41-76 226-261 (350)
167 PRK11024 colicin uptake protei 46.4 34 0.00073 20.4 3.1 27 44-72 106-132 (141)
168 smart00460 TGc Transglutaminas 46.2 36 0.00078 16.9 2.9 20 57-76 11-30 (68)
169 COG2949 SanA Uncharacterized m 46.0 26 0.00057 23.1 2.7 35 40-75 93-127 (235)
170 PLN02757 sirohydrochlorine fer 45.9 24 0.00053 21.6 2.5 28 41-69 15-42 (154)
171 PF07521 RMMBL: RNA-metabolisi 45.8 3.5 7.6E-05 19.8 -1.1 21 30-50 22-42 (43)
172 PF09198 T4-Gluco-transf: Bact 45.6 22 0.00048 16.6 1.7 22 52-73 16-37 (38)
173 cd03416 CbiX_SirB_N Sirohydroc 44.7 33 0.00071 18.8 2.7 27 43-70 3-29 (101)
174 PRK11267 biopolymer transport 44.7 36 0.00079 20.3 3.1 28 43-72 101-128 (141)
175 PRK13604 luxD acyl transferase 44.5 50 0.0011 22.7 4.0 32 40-74 37-68 (307)
176 COG0848 ExbD Biopolymer transp 43.9 35 0.00076 20.6 2.9 31 40-72 96-126 (137)
177 cd00858 GlyRS_anticodon GlyRS 43.7 62 0.0013 18.5 3.9 34 41-75 27-60 (121)
178 PF06713 bPH_4: Bacterial PH d 43.5 24 0.00052 18.8 1.9 31 40-70 43-74 (74)
179 cd00860 ThrRS_anticodon ThrRS 43.3 50 0.0011 17.3 3.6 21 55-75 13-33 (91)
180 COG3367 Uncharacterized conser 43.3 66 0.0014 22.6 4.4 37 40-76 147-183 (339)
181 PF01674 Lipase_2: Lipase (cla 42.6 25 0.00054 22.8 2.2 29 40-71 1-30 (219)
182 PF08532 Glyco_hydro_42M: Beta 42.3 32 0.00069 21.7 2.7 22 55-76 28-49 (207)
183 cd00861 ProRS_anticodon_short 42.3 54 0.0012 17.4 3.4 21 55-75 16-36 (94)
184 TIGR00668 apaH bis(5'-nucleosy 42.3 57 0.0012 22.1 4.0 28 41-68 114-141 (279)
185 KOG3253|consensus 42.1 45 0.00097 25.7 3.6 34 35-69 300-333 (784)
186 KOG4178|consensus 41.8 42 0.00091 23.3 3.3 22 35-57 254-275 (322)
187 TIGR01890 N-Ac-Glu-synth amino 41.0 58 0.0013 23.0 4.0 29 40-77 48-76 (429)
188 TIGR00250 RNAse_H_YqgF RNAse H 40.3 76 0.0016 18.8 3.9 37 41-77 48-90 (130)
189 COG4947 Uncharacterized protei 40.2 44 0.00096 21.7 3.0 58 13-76 141-207 (227)
190 COG1184 GCD2 Translation initi 39.0 31 0.00068 23.7 2.4 23 54-76 154-176 (301)
191 PRK00109 Holliday junction res 38.6 83 0.0018 18.8 4.0 37 41-77 54-96 (138)
192 PRK00093 GTP-binding protein D 38.4 96 0.0021 21.6 4.8 36 40-75 387-426 (435)
193 PRK06372 translation initiatio 38.3 42 0.0009 22.5 2.8 24 53-76 117-140 (253)
194 COG2344 AT-rich DNA-binding pr 38.3 37 0.0008 22.2 2.5 42 24-72 131-172 (211)
195 KOG3101|consensus 37.9 24 0.00051 23.7 1.6 40 13-55 191-230 (283)
196 PF02472 ExbD: Biopolymer tran 37.8 45 0.00097 19.0 2.6 28 43-72 94-121 (130)
197 TIGR03230 lipo_lipase lipoprot 37.7 79 0.0017 22.9 4.3 29 40-68 41-69 (442)
198 PF13344 Hydrolase_6: Haloacid 37.3 22 0.00047 20.0 1.2 22 55-76 15-36 (101)
199 PF01535 PPR: PPR repeat; Int 37.1 36 0.00078 13.8 2.1 16 56-71 16-31 (31)
200 cd00859 HisRS_anticodon HisRS 36.9 50 0.0011 16.9 2.6 21 55-75 13-33 (91)
201 PF07137 VDE: Violaxanthin de- 36.4 26 0.00057 22.7 1.6 26 49-74 154-179 (198)
202 PRK08535 translation initiatio 35.8 45 0.00098 22.7 2.8 25 52-76 153-177 (310)
203 PF13041 PPR_2: PPR repeat fam 35.7 54 0.0012 15.4 2.5 20 55-74 18-37 (50)
204 PF05902 4_1_CTD: 4.1 protein 35.6 47 0.001 19.7 2.4 24 42-69 71-94 (114)
205 TIGR00627 tfb4 transcription f 35.6 99 0.0022 21.0 4.3 39 37-76 144-182 (279)
206 PF03720 UDPG_MGDP_dh_C: UDP-g 35.4 40 0.00088 18.9 2.2 21 57-77 17-37 (106)
207 PF06441 EHN: Epoxide hydrolas 34.7 21 0.00046 20.7 0.9 10 40-49 92-101 (112)
208 TIGR00511 ribulose_e2b2 ribose 34.7 42 0.00092 22.8 2.5 25 52-76 148-172 (301)
209 TIGR01205 D_ala_D_alaTIGR D-al 34.6 63 0.0014 21.3 3.3 28 24-52 16-43 (315)
210 PF10230 DUF2305: Uncharacteri 34.4 90 0.0019 20.6 3.9 42 31-73 213-254 (266)
211 PRK08334 translation initiatio 34.3 60 0.0013 22.9 3.2 27 50-76 198-225 (356)
212 PLN02825 amino-acid N-acetyltr 33.7 74 0.0016 23.5 3.7 33 35-77 44-76 (515)
213 PF02698 DUF218: DUF218 domain 33.7 69 0.0015 18.8 3.1 34 41-74 38-71 (155)
214 PRK09912 L-glyceraldehyde 3-ph 33.6 84 0.0018 21.4 3.8 31 40-71 131-161 (346)
215 cd01786 STE50_RA Ubiquitin-lik 33.0 75 0.0016 18.2 2.9 34 41-74 58-92 (98)
216 PRK06371 translation initiatio 32.9 58 0.0013 22.7 2.9 26 51-76 176-202 (329)
217 cd04237 AAK_NAGS-ABP AAK_NAGS- 32.8 80 0.0017 21.1 3.5 32 35-76 45-76 (280)
218 PRK08057 cobalt-precorrin-6x r 32.7 60 0.0013 21.5 2.9 28 40-74 2-29 (248)
219 PF09587 PGA_cap: Bacterial ca 32.2 77 0.0017 20.5 3.3 33 41-73 186-219 (250)
220 TIGR00583 mre11 DNA repair pro 32.2 32 0.0007 24.5 1.6 15 40-54 110-124 (405)
221 PF03358 FMN_red: NADPH-depend 32.1 70 0.0015 18.7 2.9 35 42-76 2-37 (152)
222 PRK04531 acetylglutamate kinas 32.0 98 0.0021 22.0 4.0 35 33-77 60-94 (398)
223 PF07485 DUF1529: Domain of Un 31.8 57 0.0012 19.4 2.4 26 50-75 61-86 (123)
224 PF00462 Glutaredoxin: Glutare 31.3 48 0.0011 16.2 1.9 20 58-77 11-30 (60)
225 cd03027 GRX_DEP Glutaredoxin ( 31.3 67 0.0015 16.4 2.5 20 58-77 13-32 (73)
226 KOG4388|consensus 31.2 1.1E+02 0.0024 23.7 4.2 34 40-75 787-820 (880)
227 PRK10329 glutaredoxin-like pro 31.1 66 0.0014 17.2 2.5 20 58-77 13-32 (81)
228 COG2908 Uncharacterized protei 31.1 83 0.0018 21.0 3.3 9 40-48 105-113 (237)
229 TIGR02194 GlrX_NrdH Glutaredox 31.0 70 0.0015 16.3 2.5 20 58-77 11-30 (72)
230 cd06259 YdcF-like YdcF-like. Y 30.9 98 0.0021 18.0 3.4 34 40-73 34-67 (150)
231 PF01008 IF-2B: Initiation fac 30.8 31 0.00067 22.7 1.3 24 53-76 141-164 (282)
232 COG3618 Predicted metal-depend 30.7 67 0.0015 21.9 2.9 36 41-76 108-143 (279)
233 COG1535 EntB Isochorismate hyd 30.5 75 0.0016 20.7 2.9 36 40-75 29-76 (218)
234 PF13812 PPR_3: Pentatricopept 30.4 52 0.0011 13.6 2.4 17 56-72 17-33 (34)
235 PRK13398 3-deoxy-7-phosphohept 30.3 1.4E+02 0.0031 19.9 4.4 32 41-73 26-57 (266)
236 COG0816 Predicted endonuclease 30.1 64 0.0014 19.6 2.5 21 57-77 74-95 (141)
237 TIGR01293 Kv_beta voltage-depe 29.6 1.6E+02 0.0034 19.8 4.6 31 40-71 113-143 (317)
238 COG1447 CelC Phosphotransferas 29.5 55 0.0012 19.1 2.0 27 40-67 70-96 (105)
239 cd03415 CbiX_CbiC Archaeal sir 29.5 1.1E+02 0.0023 18.1 3.3 25 43-68 4-28 (125)
240 KOG1466|consensus 29.4 45 0.00097 22.9 1.8 22 55-76 166-187 (313)
241 PF10882 bPH_5: Bacterial PH d 29.3 74 0.0016 17.4 2.5 30 40-69 69-99 (100)
242 PLN03194 putative disease resi 29.1 1.1E+02 0.0023 19.7 3.4 34 40-75 26-59 (187)
243 cd07422 MPP_ApaH Escherichia c 28.7 1.1E+02 0.0025 20.2 3.7 28 41-68 112-139 (257)
244 PRK00166 apaH diadenosine tetr 28.4 1.1E+02 0.0024 20.5 3.6 28 41-68 114-141 (275)
245 COG2818 Tag 3-methyladenine DN 28.2 50 0.0011 21.2 1.9 18 56-73 140-157 (188)
246 TIGR03101 hydr2_PEP hydrolase, 28.1 1.3E+02 0.0027 20.1 3.8 35 41-75 26-61 (266)
247 cd04885 ACT_ThrD-I Tandem C-te 28.1 88 0.0019 15.9 2.6 31 40-74 37-67 (68)
248 PF13823 ADH_N_assoc: Alcohol 28.1 28 0.00061 14.6 0.5 12 43-54 3-14 (23)
249 COG3208 GrsT Predicted thioest 28.1 1.6E+02 0.0035 19.8 4.2 29 37-66 174-202 (244)
250 TIGR02544 III_secr_YscJ type I 27.8 89 0.0019 19.9 3.0 21 55-75 27-47 (193)
251 cd01458 vWA_ku Ku70/Ku80 N-ter 27.8 1.2E+02 0.0026 19.1 3.6 37 40-76 128-168 (218)
252 PF11950 DUF3467: Protein of u 27.8 59 0.0013 18.1 1.9 18 52-69 62-79 (92)
253 TIGR02884 spore_pdaA delta-lac 27.7 1.6E+02 0.0034 18.9 4.1 34 40-75 186-219 (224)
254 cd01452 VWA_26S_proteasome_sub 27.7 1.7E+02 0.0036 18.6 4.6 34 42-76 109-142 (187)
255 cd03418 GRX_GRXb_1_3_like Glut 27.5 83 0.0018 15.8 2.4 20 58-77 12-31 (75)
256 PRK06036 translation initiatio 27.5 93 0.002 21.7 3.2 42 31-76 171-213 (339)
257 COG0124 HisS Histidyl-tRNA syn 27.4 1.5E+02 0.0033 21.4 4.3 33 40-75 335-367 (429)
258 PRK09061 D-glutamate deacylase 27.4 2E+02 0.0043 20.9 5.0 35 40-74 245-279 (509)
259 PF01583 APS_kinase: Adenylyls 27.4 1.6E+02 0.0034 18.2 4.3 21 57-77 17-37 (156)
260 PLN03033 2-dehydro-3-deoxyphos 27.3 1.9E+02 0.004 20.0 4.5 42 30-73 5-50 (290)
261 PF00532 Peripla_BP_1: Peripla 27.2 1.4E+02 0.003 19.6 3.9 32 43-74 122-153 (279)
262 PRK02929 L-arabinose isomerase 27.1 1.1E+02 0.0024 22.5 3.6 41 35-76 93-147 (499)
263 cd00578 L-fuc_L-ara-isomerases 27.0 1.7E+02 0.0036 20.8 4.5 41 35-76 84-132 (452)
264 COG3411 Ferredoxin [Energy pro 27.0 98 0.0021 16.4 2.5 35 40-74 16-52 (64)
265 PRK12326 preprotein translocas 26.8 1.7E+02 0.0038 22.9 4.7 31 40-76 427-457 (764)
266 PF02629 CoA_binding: CoA bind 26.7 83 0.0018 17.2 2.4 20 54-73 70-89 (96)
267 COG0132 BioD Dethiobiotin synt 26.7 1.7E+02 0.0036 19.2 4.1 42 29-73 126-167 (223)
268 PF09413 DUF2007: Domain of un 26.5 99 0.0021 15.6 2.6 21 56-76 9-29 (67)
269 PF02525 Flavodoxin_2: Flavodo 26.5 97 0.0021 19.1 3.0 35 42-76 2-39 (199)
270 PTZ00062 glutaredoxin; Provisi 26.4 1.5E+02 0.0033 19.0 3.9 35 41-77 114-149 (204)
271 TIGR00624 tag DNA-3-methyladen 26.2 38 0.00082 21.5 1.0 24 50-73 128-153 (179)
272 PRK00939 translation initiatio 26.2 1.1E+02 0.0024 17.5 2.9 19 58-76 80-98 (99)
273 COG1834 N-Dimethylarginine dim 25.9 83 0.0018 21.3 2.7 23 55-77 36-58 (267)
274 cd03143 A4_beta-galactosidase_ 25.9 1.1E+02 0.0025 17.9 3.1 22 55-76 24-45 (154)
275 KOG4409|consensus 25.9 1.5E+02 0.0032 21.1 3.9 45 26-71 288-333 (365)
276 cd03557 L-arabinose_isomerase 25.8 97 0.0021 22.7 3.2 41 35-76 87-141 (484)
277 PF10137 TIR-like: Predicted n 25.7 1.5E+02 0.0033 17.5 3.8 6 43-48 2-7 (125)
278 TIGR00756 PPR pentatricopeptid 25.7 64 0.0014 13.1 2.5 18 56-73 16-33 (35)
279 PF04056 Ssl1: Ssl1-like; Int 25.7 1.5E+02 0.0031 19.1 3.6 45 31-76 92-136 (193)
280 cd04446 DEP_DEPDC4 DEP (Dishev 25.6 1.2E+02 0.0026 17.3 2.9 22 51-72 49-70 (95)
281 COG4545 Glutaredoxin-related p 25.6 76 0.0016 17.6 2.0 20 58-77 14-33 (85)
282 PF03129 HGTP_anticodon: Antic 25.3 1E+02 0.0023 16.3 2.7 20 56-75 15-34 (94)
283 PF13709 DUF4159: Domain of un 25.3 1.8E+02 0.0038 18.6 4.0 30 40-69 53-82 (207)
284 cd03060 GST_N_Omega_like GST_N 25.1 1E+02 0.0022 15.5 2.5 19 58-76 11-29 (71)
285 PF01841 Transglut_core: Trans 24.8 90 0.0019 17.0 2.4 21 56-76 55-75 (113)
286 PRK05720 mtnA methylthioribose 24.7 1.1E+02 0.0023 21.5 3.1 42 31-76 170-212 (344)
287 PF03960 ArsC: ArsC family; I 24.6 73 0.0016 17.9 2.0 20 58-77 8-27 (110)
288 PRK05279 N-acetylglutamate syn 24.6 1.2E+02 0.0027 21.4 3.5 28 40-76 56-83 (441)
289 TIGR01616 nitro_assoc nitrogen 24.6 95 0.0021 18.3 2.5 19 59-77 14-32 (126)
290 PF06028 DUF915: Alpha/beta hy 24.3 48 0.001 22.0 1.3 35 40-74 184-224 (255)
291 PRK12475 thiamine/molybdopteri 24.0 1.1E+02 0.0024 21.1 3.0 19 44-66 318-336 (338)
292 PF03446 NAD_binding_2: NAD bi 24.0 1E+02 0.0022 18.5 2.7 28 40-71 86-113 (163)
293 TIGR00524 eIF-2B_rel eIF-2B al 23.8 86 0.0019 21.4 2.5 25 52-76 159-184 (303)
294 PRK12902 secA preprotein trans 23.8 1.8E+02 0.0039 23.4 4.3 31 40-76 439-469 (939)
295 PRK01372 ddl D-alanine--D-alan 23.7 1.2E+02 0.0027 19.8 3.2 35 42-76 6-42 (304)
296 PF06850 PHB_depo_C: PHB de-po 23.5 1.3E+02 0.0029 19.6 3.1 30 38-68 132-161 (202)
297 cd03032 ArsC_Spx Arsenate Redu 23.5 1.2E+02 0.0025 17.3 2.7 20 58-77 12-31 (115)
298 cd02977 ArsC_family Arsenate R 23.3 1.2E+02 0.0026 16.7 2.7 19 59-77 12-30 (105)
299 PF07819 PGAP1: PGAP1-like pro 23.3 1.4E+02 0.0029 19.2 3.2 24 40-66 4-27 (225)
300 PRK11340 phosphodiesterase Yae 23.3 94 0.002 20.4 2.6 32 41-72 113-144 (271)
301 PF00070 Pyr_redox: Pyridine n 23.1 74 0.0016 16.5 1.7 19 58-76 10-28 (80)
302 TIGR00823 EIIA-LAC phosphotran 23.0 53 0.0011 18.7 1.2 28 40-68 68-95 (99)
303 cd00738 HGTP_anticodon HGTP an 23.0 1.3E+02 0.0028 15.7 3.3 20 56-75 17-36 (94)
304 cd03785 GT1_MurG MurG is an N- 22.9 1.6E+02 0.0034 19.4 3.6 22 54-75 11-32 (350)
305 PRK04169 geranylgeranylglycery 22.9 2.3E+02 0.005 18.7 4.3 38 34-74 27-64 (232)
306 PRK10853 putative reductase; P 22.9 1.2E+02 0.0027 17.5 2.8 19 59-77 13-31 (118)
307 PRK05772 translation initiatio 22.9 1E+02 0.0023 21.7 2.8 42 31-76 191-233 (363)
308 cd00215 PTS_IIA_lac PTS_IIA, P 22.9 56 0.0012 18.5 1.2 28 40-68 66-93 (97)
309 COG1509 KamA Lysine 2,3-aminom 22.8 68 0.0015 22.8 1.9 30 40-69 281-315 (369)
310 cd03034 ArsC_ArsC Arsenate Red 22.7 1.1E+02 0.0024 17.4 2.5 19 59-77 12-30 (112)
311 COG0859 RfaF ADP-heptose:LPS h 22.6 2.4E+02 0.0053 19.1 4.5 33 41-73 175-211 (334)
312 PF12010 DUF3502: Domain of un 22.5 39 0.00085 20.0 0.6 25 45-72 92-116 (134)
313 cd03033 ArsC_15kD Arsenate Red 22.4 1.3E+02 0.0028 17.3 2.8 19 59-77 13-31 (113)
314 cd03041 GST_N_2GST_N GST_N fam 22.4 1.1E+02 0.0025 15.7 2.4 19 58-76 12-30 (77)
315 PRK15324 type III secretion sy 22.4 1.3E+02 0.0028 20.1 3.1 22 55-76 28-49 (252)
316 TIGR00512 salvage_mtnA S-methy 22.3 1.4E+02 0.0029 20.9 3.2 42 31-76 170-212 (331)
317 PRK14105 selenophosphate synth 22.2 1.2E+02 0.0026 20.8 2.9 23 53-75 300-322 (345)
318 PRK00943 selenophosphate synth 22.1 1.2E+02 0.0026 20.8 3.0 24 52-75 307-330 (347)
319 cd03412 CbiK_N Anaerobic cobal 22.1 1.2E+02 0.0025 17.7 2.6 26 43-69 4-29 (127)
320 cd04252 AAK_NAGK-fArgBP AAK_NA 21.9 1.9E+02 0.0042 18.8 3.8 15 35-50 24-38 (248)
321 TIGR01768 GGGP-family geranylg 21.9 2.2E+02 0.0048 18.7 4.0 38 34-74 22-59 (223)
322 COG1334 FlaG Uncharacterized f 21.9 97 0.0021 18.5 2.1 17 53-69 95-111 (120)
323 PF12122 DUF3582: Protein of u 21.8 1.6E+02 0.0034 16.8 3.0 22 55-76 9-30 (101)
324 TIGR02873 spore_ylxY probable 21.7 2.6E+02 0.0056 18.6 4.6 54 21-76 55-117 (268)
325 PF02082 Rrf2: Transcriptional 21.6 1.4E+02 0.0029 15.9 2.6 20 52-71 35-54 (83)
326 cd04906 ACT_ThrD-I_1 First of 21.6 1.5E+02 0.0032 15.8 2.9 20 56-75 52-71 (85)
327 PRK10494 hypothetical protein; 21.6 1.8E+02 0.0038 19.3 3.6 34 41-74 121-154 (259)
328 PRK12464 1-deoxy-D-xylulose 5- 21.5 49 0.0011 23.6 1.0 65 1-66 209-273 (383)
329 cd03054 GST_N_Metaxin GST_N fa 21.5 1.3E+02 0.0028 15.1 3.9 20 57-76 17-36 (72)
330 PF04282 DUF438: Family of unk 21.5 1.3E+02 0.0029 16.1 2.5 22 53-74 28-49 (71)
331 PF02780 Transketolase_C: Tran 21.5 1E+02 0.0023 17.5 2.3 21 56-76 21-41 (124)
332 COG0529 CysC Adenylylsulfate k 21.4 2.4E+02 0.0053 18.3 4.7 38 40-77 21-58 (197)
333 PF00258 Flavodoxin_1: Flavodo 21.4 77 0.0017 18.2 1.7 18 59-76 14-31 (143)
334 PF13362 Toprim_3: Toprim doma 21.3 1.5E+02 0.0033 15.9 4.8 37 40-76 40-77 (96)
335 TIGR01133 murG undecaprenyldip 21.3 1.2E+02 0.0027 19.8 2.8 30 43-74 3-32 (348)
336 PRK07094 biotin synthase; Prov 21.0 2.7E+02 0.0058 18.6 4.9 34 40-73 86-120 (323)
337 PF03652 UPF0081: Uncharacteri 20.9 1.1E+02 0.0024 18.2 2.3 37 41-77 51-94 (135)
338 PRK03003 GTP-binding protein D 20.8 2.6E+02 0.0057 20.0 4.5 34 40-75 425-462 (472)
339 PRK07583 cytosine deaminase-li 20.8 2.3E+02 0.0051 19.9 4.3 36 40-75 256-291 (438)
340 PLN02696 1-deoxy-D-xylulose-5- 20.8 45 0.00098 24.3 0.7 65 1-66 272-336 (454)
341 TIGR03594 GTPase_EngA ribosome 20.7 2.3E+02 0.005 19.6 4.2 30 40-69 387-416 (429)
342 cd03036 ArsC_like Arsenate Red 20.6 1.6E+02 0.0034 16.7 2.9 20 58-77 11-30 (111)
343 TIGR03675 arCOG00543 arCOG0054 20.6 2.3E+02 0.0051 21.4 4.4 37 30-72 578-617 (630)
344 PF11426 Tn7_TnsC_Int: Tn7 tra 20.5 19 0.00041 18.0 -0.9 15 61-75 26-40 (48)
345 PF04244 DPRP: Deoxyribodipyri 20.5 1.4E+02 0.003 19.5 2.8 20 58-77 50-69 (224)
346 PRK13237 tyrosine phenol-lyase 20.3 1.1E+02 0.0025 22.3 2.6 19 56-74 322-340 (460)
347 PF13361 UvrD_C: UvrD-like hel 20.3 1.1E+02 0.0023 19.9 2.3 20 56-75 87-106 (351)
348 PF13721 SecD-TM1: SecD export 20.2 1.8E+02 0.004 16.4 3.3 32 42-75 32-64 (101)
349 PRK04940 hypothetical protein; 20.1 1.5E+02 0.0033 18.8 2.9 26 41-66 125-150 (180)
350 PF03979 Sigma70_r1_1: Sigma-7 20.1 1.2E+02 0.0026 16.3 2.2 22 52-73 34-55 (82)
351 COG1362 LAP4 Aspartyl aminopep 20.1 2.8E+02 0.006 20.3 4.4 40 37-76 337-376 (437)
352 PRK00819 RNA 2'-phosphotransfe 20.1 1.2E+02 0.0026 19.2 2.4 12 40-51 92-103 (179)
353 PF00542 Ribosomal_L12: Riboso 20.1 1.6E+02 0.0034 15.5 3.6 31 40-75 37-67 (68)
354 TIGR00243 Dxr 1-deoxy-D-xylulo 20.0 55 0.0012 23.4 1.0 65 1-66 217-281 (389)
No 1
>KOG2100|consensus
Probab=99.59 E-value=1.6e-15 Score=111.18 Aligned_cols=73 Identities=47% Similarity=0.740 Sum_probs=66.7
Q ss_pred ccchhhhhhhhhhCCCCCCCCcccccccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 2 FWITNSIYTERYMGLPEMYDNEVGYNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 2 w~~~~~~~~e~~lG~p~~~~~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
|++|++.|+||+||.| .++...|.+.++..++..++++-+||+||+.|++||++||.+++++|+.+|++++++
T Consensus 646 ~~~yds~~terymg~p--~~~~~~y~e~~~~~~~~~~~~~~~LliHGt~DdnVh~q~s~~~~~aL~~~gv~~~~~ 718 (755)
T KOG2100|consen 646 WLYYDSTYTERYMGLP--SENDKGYEESSVSSPANNIKTPKLLLIHGTEDDNVHFQQSAILIKALQNAGVPFRLL 718 (755)
T ss_pred eeeecccccHhhcCCC--ccccchhhhccccchhhhhccCCEEEEEcCCcCCcCHHHHHHHHHHHHHCCCceEEE
Confidence 8999999999999999 676667999999999999954556999999999999999999999999999998875
No 2
>KOG2281|consensus
Probab=99.52 E-value=2e-14 Score=103.55 Aligned_cols=73 Identities=37% Similarity=0.585 Sum_probs=68.0
Q ss_pred ccchhhhhhhhhhCCCCCCCCcccccccChhHHhccCCC--CcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 2 FWITNSIYTERYMGLPEMYDNEVGYNWTDVTRKVGNFVN--KTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 2 w~~~~~~~~e~~lG~p~~~~~~~~y~~~sp~~~~~~~~~--~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
|+.|++.|+|||||.| +.+...|.+.|...++.++.. ..+||+||.-|.+|++.|+.+|+.+|.++|+++|++
T Consensus 764 W~~YDTgYTERYMg~P--~~nE~gY~agSV~~~VeklpdepnRLlLvHGliDENVHF~Hts~Lvs~lvkagKpyeL~ 838 (867)
T KOG2281|consen 764 WRLYDTGYTERYMGYP--DNNEHGYGAGSVAGHVEKLPDEPNRLLLVHGLIDENVHFAHTSRLVSALVKAGKPYELQ 838 (867)
T ss_pred eeeecccchhhhcCCC--ccchhcccchhHHHHHhhCCCCCceEEEEecccccchhhhhHHHHHHHHHhCCCceEEE
Confidence 9999999999999999 778889999999999998852 359999999999999999999999999999999986
No 3
>COG1506 DAP2 Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]
Probab=99.45 E-value=8.9e-14 Score=100.08 Aligned_cols=74 Identities=23% Similarity=0.181 Sum_probs=60.1
Q ss_pred ccchhhhhhhhhhCCCCC-----CCCcccccccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 2 FWITNSIYTERYMGLPEM-----YDNEVGYNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 2 w~~~~~~~~e~~lG~p~~-----~~~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
|..+...+++.+.+.+.. ..+.+.|...||++++.+. ++|+||+||+.|.+||++|+++|+++|+.+|++++++
T Consensus 509 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~sp~~~~~~i-~~P~LliHG~~D~~v~~~q~~~~~~aL~~~g~~~~~~ 587 (620)
T COG1506 509 WLLYFGESTEGLRFDPEENGGGPPEDREKYEDRSPIFYADNI-KTPLLLIHGEEDDRVPIEQAEQLVDALKRKGKPVELV 587 (620)
T ss_pred hhhhccccchhhcCCHHHhCCCcccChHHHHhcChhhhhccc-CCCEEEEeecCCccCChHHHHHHHHHHHHcCceEEEE
Confidence 444555555655543210 1156789999999999999 7999999999999999999999999999999999986
No 4
>PF00326 Peptidase_S9: Prolyl oligopeptidase family This family belongs to family S9 of the peptidase classification.; InterPro: IPR001375 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain covers the active site serine of the serine peptidases belonging to MEROPS peptidase family S9 (prolyl oligopeptidase family, clan SC). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Examples of protein families containing this domain are: Prolyl endopeptidase (3.4.21.26 from EC) (PE) (also called post-proline cleaving enzyme). PE is an enzyme that cleaves peptide bonds on the C-terminal side of prolyl residues. The sequence of PE has been obtained from a mammalian species (pig) and from bacteria (Flavobacterium meningosepticum and Aeromonas hydrophila); there is a high degree of sequence conservation between these sequences. Escherichia coli protease II (3.4.21.83 from EC) (oligopeptidase B) (gene prtB) which cleaves peptide bonds on the C-terminal side of lysyl and argininyl residues. Dipeptidyl peptidase IV (3.4.14.5 from EC) (DPP IV). DPP IV is an enzyme that removes N-terminal dipeptides sequentially from polypeptides having unsubstituted N-termini provided that the penultimate residue is proline. Saccharomyces cerevisiae (Baker's yeast) vacuolar dipeptidyl aminopeptidases A and B (DPAP A and DPAP B), encoded by the STE13 and DAP2 genes respectively. DPAP A is responsible for the proteolytic maturation of the alpha-factor precursor. Acylamino-acid-releasing enzyme (3.4.19.1 from EC) (acyl-peptide hydrolase). This enzyme catalyses the hydrolysis of the amino-terminal peptide bond of an N-acetylated protein to generate a N-acetylated amino acid and a protein with a free amino-terminus. These proteins belong to MEROPS peptidase families S9A, S9B and S9C.; GO: 0008236 serine-type peptidase activity, 0006508 proteolysis; PDB: 2AJ8_D 1ORV_D 2AJB_C 2BUC_D 1ORW_D 2AJC_D 2AJD_C 2BUA_A 2HU8_B 3O4J_B ....
Probab=99.36 E-value=8.3e-13 Score=83.24 Aligned_cols=59 Identities=29% Similarity=0.292 Sum_probs=53.9
Q ss_pred CCCCCCCCcccccccChhHHhcc--CCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 15 GLPEMYDNEVGYNWTDVTRKVGN--FVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 15 G~p~~~~~~~~y~~~sp~~~~~~--~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
|.+ ...++.|+..||+..+.+ . ++|+||+||++|.+||++|+.+|+++|+++|++++++
T Consensus 120 ~~~--~~~~~~~~~~s~~~~~~~~~~-~~P~li~hG~~D~~Vp~~~s~~~~~~L~~~g~~~~~~ 180 (213)
T PF00326_consen 120 GDP--WDNPEFYRELSPISPADNVQI-KPPVLIIHGENDPRVPPSQSLRLYNALRKAGKPVELL 180 (213)
T ss_dssp SST--TTSHHHHHHHHHGGGGGGCGG-GSEEEEEEETTBSSSTTHHHHHHHHHHHHTTSSEEEE
T ss_pred Ccc--chhhhhhhhhccccccccccC-CCCEEEEccCCCCccCHHHHHHHHHHHHhcCCCEEEE
Confidence 445 568888999999999999 7 7999999999999999999999999999999998875
No 5
>PRK10115 protease 2; Provisional
Probab=98.86 E-value=2.2e-09 Score=78.46 Aligned_cols=59 Identities=17% Similarity=0.061 Sum_probs=48.7
Q ss_pred CCCCCCC-Cc-ccccccChhHHhccCCCCc-EEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 15 GLPEMYD-NE-VGYNWTDVTRKVGNFVNKT-FLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 15 G~p~~~~-~~-~~y~~~sp~~~~~~~~~~p-~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
|.| ++ .. +..++.||+.++.+. ..| +||+||..|.+||+.|+.+|+++|+++|++++++
T Consensus 581 G~p--~~~~~~~~l~~~SP~~~v~~~-~~P~lLi~~g~~D~RV~~~~~~k~~a~Lr~~~~~~~~v 642 (686)
T PRK10115 581 GNP--QDPQYYEYMKSYSPYDNVTAQ-AYPHLLVTTGLHDSQVQYWEPAKWVAKLRELKTDDHLL 642 (686)
T ss_pred CCC--CCHHHHHHHHHcCchhccCcc-CCCceeEEecCCCCCcCchHHHHHHHHHHhcCCCCceE
Confidence 888 43 22 333569999999998 577 7788999999999999999999999999986653
No 6
>PLN02442 S-formylglutathione hydrolase
Probab=98.62 E-value=6.9e-08 Score=63.86 Aligned_cols=64 Identities=20% Similarity=0.224 Sum_probs=52.1
Q ss_pred hhhhhCCCCCCCCcccccccChhHHhccCCCCcEEEEEeCCCCccCHH-HHHHHHHHHHhcCCCceee
Q psy7765 10 TERYMGLPEMYDNEVGYNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQ-QSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 10 ~e~~lG~p~~~~~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~-~s~~l~~al~~~g~~~~~~ 76 (77)
...++|.+ .+..+.|...+|+..+... ++|++|+||++|..|+.. ++..|+++++++|.++++.
T Consensus 190 ~~~~~g~~--~~~~~~~d~~~~~~~~~~~-~~pvli~~G~~D~~v~~~~~s~~~~~~l~~~g~~~~~~ 254 (283)
T PLN02442 190 FTNYLGSD--KADWEEYDATELVSKFNDV-SATILIDQGEADKFLKEQLLPENFEEACKEAGAPVTLR 254 (283)
T ss_pred HHHHcCCC--hhhHHHcChhhhhhhcccc-CCCEEEEECCCCccccccccHHHHHHHHHHcCCCeEEE
Confidence 34566766 4556667778888777767 799999999999999974 7999999999999998875
No 7
>PRK13604 luxD acyl transferase; Provisional
Probab=98.33 E-value=9.9e-07 Score=59.63 Aligned_cols=43 Identities=12% Similarity=0.228 Sum_probs=37.7
Q ss_pred ccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCC
Q psy7765 28 WTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADI 71 (77)
Q Consensus 28 ~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~ 71 (77)
..+|+..++++ +.|+|++||++|..||++.+.+++++++....
T Consensus 191 ~~s~i~~~~~l-~~PvLiIHG~~D~lVp~~~s~~l~e~~~s~~k 233 (307)
T PRK13604 191 LDSTINKMKGL-DIPFIAFTANNDSWVKQSEVIDLLDSIRSEQC 233 (307)
T ss_pred cccHHHHHhhc-CCCEEEEEcCCCCccCHHHHHHHHHHhccCCc
Confidence 57888989999 69999999999999999999999998764333
No 8
>PRK10566 esterase; Provisional
Probab=98.31 E-value=1.9e-06 Score=54.96 Aligned_cols=43 Identities=33% Similarity=0.499 Sum_probs=35.6
Q ss_pred ChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 30 DVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 30 sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
++...+.++...|+|++||++|..||++++.+++++++++|.+
T Consensus 176 ~~~~~~~~i~~~P~Lii~G~~D~~v~~~~~~~l~~~l~~~g~~ 218 (249)
T PRK10566 176 EVTHQLEQLADRPLLLWHGLADDVVPAAESLRLQQALRERGLD 218 (249)
T ss_pred ChhhhhhhcCCCCEEEEEcCCCCcCCHHHHHHHHHHHHhcCCC
Confidence 3444455542489999999999999999999999999999875
No 9
>PRK11460 putative hydrolase; Provisional
Probab=98.28 E-value=1.8e-06 Score=55.71 Aligned_cols=37 Identities=19% Similarity=0.170 Sum_probs=34.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
.+|+|++||+.|..||++.+.+++++|++.|.+++++
T Consensus 148 ~~pvli~hG~~D~vvp~~~~~~~~~~L~~~g~~~~~~ 184 (232)
T PRK11460 148 ATTIHLIHGGEDPVIDVAHAVAAQEALISLGGDVTLD 184 (232)
T ss_pred CCcEEEEecCCCCccCHHHHHHHHHHHHHCCCCeEEE
Confidence 6899999999999999999999999999999988765
No 10
>COG0400 Predicted esterase [General function prediction only]
Probab=98.20 E-value=2.7e-06 Score=54.66 Aligned_cols=39 Identities=28% Similarity=0.267 Sum_probs=36.3
Q ss_pred CCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 38 FVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 38 ~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
.+.+|+||+||+.|++||...+.++.+.|++.|..++..
T Consensus 144 ~~~~pill~hG~~Dpvvp~~~~~~l~~~l~~~g~~v~~~ 182 (207)
T COG0400 144 LAGTPILLSHGTEDPVVPLALAEALAEYLTASGADVEVR 182 (207)
T ss_pred cCCCeEEEeccCcCCccCHHHHHHHHHHHHHcCCCEEEE
Confidence 347999999999999999999999999999999999875
No 11
>PRK05371 x-prolyl-dipeptidyl aminopeptidase; Provisional
Probab=98.19 E-value=4.1e-06 Score=62.34 Aligned_cols=51 Identities=18% Similarity=0.124 Sum_probs=46.0
Q ss_pred ccccccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCcee
Q psy7765 24 VGYNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 24 ~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
+.+...|++.+++++ +.|+|++||..|..|++.++.+++++|+++|++.++
T Consensus 440 ~fW~~rn~~~~~~kI-kvPvLlIhGw~D~~V~~~~s~~ly~aL~~~g~pkkL 490 (767)
T PRK05371 440 DFWDDRNYLKDADKI-KASVLVVHGLNDWNVKPKQVYQWWDALPENGVPKKL 490 (767)
T ss_pred HHHHhCCHhhHhhCC-CCCEEEEeeCCCCCCChHHHHHHHHHHHhcCCCeEE
Confidence 346778999999999 799999999999999999999999999998887665
No 12
>TIGR02821 fghA_ester_D S-formylglutathione hydrolase. This model describes a protein family from bacteria, yeast, and human, with a conserved critical role in formaldehyde detoxification as S-formylglutathione hydrolase (EC 3.1.2.12). Members in eukaryotes such as the human protein are better known as esterase D (EC 3.1.1.1), an enzyme with broad specificity, although S-formylglutathione hydrolase has now been demonstrated as well.
Probab=98.15 E-value=4.3e-06 Score=54.94 Aligned_cols=63 Identities=21% Similarity=0.175 Sum_probs=47.4
Q ss_pred hhhhhhCCCCCCCCcccccccChhHHhccC-CCCcEEEEEeCCCCccCH-HHHHHHHHHHHhcCCCceee
Q psy7765 9 YTERYMGLPEMYDNEVGYNWTDVTRKVGNF-VNKTFLLMHGTADDNVHY-QQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 9 ~~e~~lG~p~~~~~~~~y~~~sp~~~~~~~-~~~p~ll~hG~~D~~Vp~-~~s~~l~~al~~~g~~~~~~ 76 (77)
+...++|.+ .+. +...||...+... ..+|+++.||+.|..||. .++..+.++|+++|+++++.
T Consensus 184 ~~~~~l~~~--~~~---~~~~~~~~~~~~~~~~~plli~~G~~D~~v~~~~~~~~~~~~l~~~g~~v~~~ 248 (275)
T TIGR02821 184 AFSAYLGAD--EAA---WRSYDASLLVADGGRHSTILIDQGTADQFLDEQLRPDAFEQACRAAGQALTLR 248 (275)
T ss_pred HHHHHhccc--ccc---hhhcchHHHHhhcccCCCeeEeecCCCcccCccccHHHHHHHHHHcCCCeEEE
Confidence 345566655 222 3445677655542 158999999999999999 79999999999999999875
No 13
>PF02230 Abhydrolase_2: Phospholipase/Carboxylesterase; InterPro: IPR003140 This entry represents the alpha/beta hydrolase domain found in phospholipases [], carboxylesterases [] and thioesterases.; GO: 0016787 hydrolase activity; PDB: 3U0V_A 1AUR_A 1AUO_B 1FJ2_B 3CN9_A 3CN7_A.
Probab=98.12 E-value=4.1e-06 Score=53.15 Aligned_cols=37 Identities=24% Similarity=0.290 Sum_probs=32.0
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
.+|+|++||+.|..||++.+...++.|++.|.+++++
T Consensus 155 ~~pi~~~hG~~D~vvp~~~~~~~~~~L~~~~~~v~~~ 191 (216)
T PF02230_consen 155 KTPILIIHGDEDPVVPFEWAEKTAEFLKAAGANVEFH 191 (216)
T ss_dssp TS-EEEEEETT-SSSTHHHHHHHHHHHHCTT-GEEEE
T ss_pred CCcEEEEecCCCCcccHHHHHHHHHHHHhcCCCEEEE
Confidence 5899999999999999999999999999999988875
No 14
>COG1505 Serine proteases of the peptidase family S9A [Amino acid transport and metabolism]
Probab=98.07 E-value=5.2e-06 Score=60.26 Aligned_cols=62 Identities=15% Similarity=0.073 Sum_probs=50.2
Q ss_pred CCCCCCCCcccccccChhHHhcc-CCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 15 GLPEMYDNEVGYNWTDVTRKVGN-FVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 15 G~p~~~~~~~~y~~~sp~~~~~~-~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
|.|..-++-....+.||.++++. .+-||+||--++.|++|+|.|+..|+.+|++.|.++-+.
T Consensus 554 G~Pd~P~d~~~l~~YSPy~nl~~g~kYP~~LITTs~~DDRVHPaHarKfaa~L~e~~~pv~~~ 616 (648)
T COG1505 554 GNPDDPEDRAFLLAYSPYHNLKPGQKYPPTLITTSLHDDRVHPAHARKFAAKLQEVGAPVLLR 616 (648)
T ss_pred CCCCCHHHHHHHHhcCchhcCCccccCCCeEEEcccccccccchHHHHHHHHHHhcCCceEEE
Confidence 66621233355667899999986 336999999999999999999999999999999887654
No 15
>PF03583 LIP: Secretory lipase ; InterPro: IPR005152 This entry represents a family of secreted lipases. Family members include the LIP lipases from Candida albicans, which are expressed and secreted during the infection cycle of these pathogens [].; GO: 0004806 triglyceride lipase activity, 0016042 lipid catabolic process
Probab=98.02 E-value=1.2e-05 Score=53.76 Aligned_cols=37 Identities=24% Similarity=0.134 Sum_probs=35.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcC-CCceee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAAD-IFFYQQ 76 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g-~~~~~~ 76 (77)
+.|++|.||..|..||+..+..+++.++++| .+++++
T Consensus 219 ~~Pv~i~~g~~D~vvP~~~~~~l~~~~c~~G~a~V~~~ 256 (290)
T PF03583_consen 219 TVPVLIYQGTADEVVPPADTDALVAKWCAAGGADVEYV 256 (290)
T ss_pred CCCEEEEecCCCCCCChHHHHHHHHHHHHcCCCCEEEE
Confidence 6899999999999999999999999999999 799876
No 16
>PF07859 Abhydrolase_3: alpha/beta hydrolase fold A web page of Esterases and alpha/beta hydrolases.; InterPro: IPR013094 The alpha/beta hydrolase fold [] is common to a number of hydrolytic enzymes of widely differing phylogenetic origin and catalytic function. The core of each enzyme is an alpha/beta-sheet (rather than a barrel), containing 8 strands connected by helices []. The enzymes are believed to have diverged from a common ancestor, preserving the arrangement of the catalytic residues. All have a catalytic triad, the elements of which are borne on loops, which are the best conserved structural features of the fold. Esterase (EST) from Pseudomonas putida is a member of the alpha/beta hydrolase fold superfamily of enzymes []. In most of the family members the beta-strands are parallels, but some have an inversion of the first strands, which gives it an antiparallel orientation. The catalytic triad residues are presented on loops. One of these is the nucleophile elbow and is the most conserved feature of the fold. Some other members lack one or all of the catalytic residues. Some members are therefore inactive but others are involved in surface recognition. The ESTHER database [] gathers and annotates all the published information related to gene and protein sequences of this superfamily []. This entry represents the catalytic domain fold-3 of alpha/beta hydrolase. ; GO: 0016787 hydrolase activity, 0008152 metabolic process; PDB: 3D7R_B 2C7B_B 3ZWQ_B 2YH2_B 3BXP_A 3D3N_A 1LZK_A 1LZL_A 2O7V_A 2O7R_A ....
Probab=97.90 E-value=2.5e-05 Score=48.75 Aligned_cols=47 Identities=19% Similarity=0.114 Sum_probs=37.3
Q ss_pred cccChhHH-hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 27 NWTDVTRK-VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 27 ~~~sp~~~-~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
...||+.. ..+. .+|++|++|+.|..+ .++..|+++|+++|++++++
T Consensus 153 ~~~sp~~~~~~~~-~Pp~~i~~g~~D~l~--~~~~~~~~~L~~~gv~v~~~ 200 (211)
T PF07859_consen 153 PLASPLNASDLKG-LPPTLIIHGEDDVLV--DDSLRFAEKLKKAGVDVELH 200 (211)
T ss_dssp TTTSGGGSSCCTT-CHEEEEEEETTSTTH--HHHHHHHHHHHHTT-EEEEE
T ss_pred ccccccccccccc-CCCeeeeccccccch--HHHHHHHHHHHHCCCCEEEE
Confidence 36788865 2222 589999999999875 58999999999999999875
No 17
>PF01738 DLH: Dienelactone hydrolase family; InterPro: IPR002925 Dienelactone hydrolases play a crucial role in chlorocatechol degradation via the modified ortho cleavage pathway. Enzymes induced in 4-fluorobenzoate-utilizing bacteria have been classified into three groups on the basis of their specificity towards cis- and trans-dienelactone []. Some proteins contain repeated small fragments of this domain (for example rat kan-1 protein).; GO: 0016787 hydrolase activity; PDB: 1GGV_A 1ZIY_A 1ZI6_A 1ZIC_A 1ZJ5_A 1ZI8_A 1ZJ4_A 1ZI9_A 1ZIX_A 3F67_A.
Probab=97.71 E-value=0.00011 Score=46.39 Aligned_cols=45 Identities=18% Similarity=0.182 Sum_probs=35.2
Q ss_pred hhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 31 VTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 31 p~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
+....... +.|+++++|+.|..+|.+....+.++|+++|.+++++
T Consensus 137 ~~~~~~~~-~~P~l~~~g~~D~~~~~~~~~~~~~~l~~~~~~~~~~ 181 (218)
T PF01738_consen 137 PLEDAPKI-KAPVLILFGENDPFFPPEEVEALEEALKAAGVDVEVH 181 (218)
T ss_dssp HHHHGGG---S-EEEEEETT-TTS-HHHHHHHHHHHHCTTTTEEEE
T ss_pred chhhhccc-CCCEeecCccCCCCCChHHHHHHHHHHHhcCCcEEEE
Confidence 44456677 7999999999999999999999999999999998875
No 18
>PLN02298 hydrolase, alpha/beta fold family protein
Probab=97.70 E-value=7.3e-05 Score=49.80 Aligned_cols=38 Identities=21% Similarity=0.303 Sum_probs=31.8
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCc
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFF 73 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~ 73 (77)
+..+ +.|+||+||++|..||++++..+++++...+..+
T Consensus 247 l~~i-~~PvLii~G~~D~ivp~~~~~~l~~~i~~~~~~l 284 (330)
T PLN02298 247 LKDV-SIPFIVLHGSADVVTDPDVSRALYEEAKSEDKTI 284 (330)
T ss_pred hhhc-CCCEEEEecCCCCCCCHHHHHHHHHHhccCCceE
Confidence 4567 6999999999999999999999999876545443
No 19
>TIGR01840 esterase_phb esterase, PHB depolymerase family. This model describes a subfamily among lipases of the ab-hydrolase family. This subfamily includes bacterial depolymerases for poly(3-hydroxybutyrate) (PHB) and related polyhydroxyalkanoates (PHA), as well as acetyl xylan esterases, feruloyl esterases, and others from fungi.
Probab=97.64 E-value=5.9e-05 Score=47.66 Aligned_cols=30 Identities=20% Similarity=0.209 Sum_probs=27.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
.+|++|+||++|.+||++.+..++++|++.
T Consensus 168 ~p~~~i~hG~~D~vVp~~~~~~~~~~l~~~ 197 (212)
T TIGR01840 168 TPIMSVVHGDADYTVLPGNADEIRDAMLKV 197 (212)
T ss_pred CCeEEEEEcCCCceeCcchHHHHHHHHHHh
Confidence 466789999999999999999999999987
No 20
>KOG1552|consensus
Probab=97.61 E-value=2.5e-05 Score=51.67 Aligned_cols=37 Identities=22% Similarity=0.414 Sum_probs=32.4
Q ss_pred hHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 32 TRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 32 ~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
+..++.. +.|+|++||+.|.+||++|..+++++.++.
T Consensus 185 i~kI~~i-~~PVLiiHgtdDevv~~sHg~~Lye~~k~~ 221 (258)
T KOG1552|consen 185 IEKISKI-TCPVLIIHGTDDEVVDFSHGKALYERCKEK 221 (258)
T ss_pred cCcceec-cCCEEEEecccCceecccccHHHHHhcccc
Confidence 5556666 689999999999999999999999998774
No 21
>PRK10749 lysophospholipase L2; Provisional
Probab=97.59 E-value=0.00016 Score=48.46 Aligned_cols=37 Identities=19% Similarity=0.259 Sum_probs=32.3
Q ss_pred HhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCC
Q psy7765 34 KVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADI 71 (77)
Q Consensus 34 ~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~ 71 (77)
.+.++ +.|+|++||++|..||++.+..+++++.+++.
T Consensus 254 ~~~~i-~~P~Lii~G~~D~vv~~~~~~~~~~~l~~~~~ 290 (330)
T PRK10749 254 GAGDI-TTPLLLLQAEEERVVDNRMHDRFCEARTAAGH 290 (330)
T ss_pred hccCC-CCCEEEEEeCCCeeeCHHHHHHHHHHHhhcCC
Confidence 45567 79999999999999999999999999987653
No 22
>PRK10162 acetyl esterase; Provisional
Probab=97.45 E-value=0.00067 Score=45.59 Aligned_cols=35 Identities=14% Similarity=0.048 Sum_probs=32.3
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
.+|++|++|+.|..++ ++..|+++|+++|++++++
T Consensus 248 lPp~~i~~g~~D~L~d--e~~~~~~~L~~aGv~v~~~ 282 (318)
T PRK10162 248 VPPCFIAGAEFDPLLD--DSRLLYQTLAAHQQPCEFK 282 (318)
T ss_pred CCCeEEEecCCCcCcC--hHHHHHHHHHHcCCCEEEE
Confidence 5899999999999875 8999999999999999875
No 23
>PLN02385 hydrolase; alpha/beta fold family protein
Probab=97.43 E-value=0.00023 Score=48.00 Aligned_cols=39 Identities=21% Similarity=0.273 Sum_probs=31.8
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
+.++ +.|+|++||++|..||+..+..+++.+...+..+.
T Consensus 275 l~~i-~~P~Lii~G~~D~vv~~~~~~~l~~~~~~~~~~l~ 313 (349)
T PLN02385 275 LEEV-SLPLLILHGEADKVTDPSVSKFLYEKASSSDKKLK 313 (349)
T ss_pred cccC-CCCEEEEEeCCCCccChHHHHHHHHHcCCCCceEE
Confidence 4567 79999999999999999999999988754444443
No 24
>PLN02652 hydrolase; alpha/beta fold family protein
Probab=97.37 E-value=0.0003 Score=48.88 Aligned_cols=39 Identities=21% Similarity=0.184 Sum_probs=31.9
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
+.++ +.|+|++||++|..||++++.++++.+...++.+.
T Consensus 320 L~~I-~vPvLIi~G~~D~vvp~~~a~~l~~~~~~~~k~l~ 358 (395)
T PLN02652 320 FKSV-TVPFMVLHGTADRVTDPLASQDLYNEAASRHKDIK 358 (395)
T ss_pred cccC-CCCEEEEEeCCCCCCCHHHHHHHHHhcCCCCceEE
Confidence 4566 69999999999999999999999998765444443
No 25
>PHA02857 monoglyceride lipase; Provisional
Probab=97.37 E-value=0.00034 Score=45.18 Aligned_cols=34 Identities=9% Similarity=0.250 Sum_probs=29.2
Q ss_pred HhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 34 KVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 34 ~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
.+.++ +.|+|++||++|..||+..+.++.+.+..
T Consensus 204 ~l~~i-~~Pvliv~G~~D~i~~~~~~~~l~~~~~~ 237 (276)
T PHA02857 204 IIPKI-KTPILILQGTNNEISDVSGAYYFMQHANC 237 (276)
T ss_pred hcccC-CCCEEEEecCCCCcCChHHHHHHHHHccC
Confidence 34567 79999999999999999999999887644
No 26
>COG0657 Aes Esterase/lipase [Lipid metabolism]
Probab=97.30 E-value=0.00075 Score=44.86 Aligned_cols=47 Identities=13% Similarity=-0.090 Sum_probs=38.7
Q ss_pred ccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 28 WTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 28 ~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
..||+....-...+|+||++|+.|..++ ++..++++|+++|++++++
T Consensus 233 ~~spl~~~~~~~lPP~~i~~a~~D~l~~--~~~~~a~~L~~agv~~~~~ 279 (312)
T COG0657 233 EASPLASDDLSGLPPTLIQTAEFDPLRD--EGEAYAERLRAAGVPVELR 279 (312)
T ss_pred ccCccccccccCCCCEEEEecCCCcchh--HHHHHHHHHHHcCCeEEEE
Confidence 4677764421114899999999999999 9999999999999999875
No 27
>PRK11071 esterase YqiA; Provisional
Probab=97.23 E-value=0.00035 Score=43.85 Aligned_cols=27 Identities=22% Similarity=0.351 Sum_probs=24.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
..|++++||++|+.||++++.+++++.
T Consensus 136 ~~~v~iihg~~De~V~~~~a~~~~~~~ 162 (190)
T PRK11071 136 PDLIWLLQQTGDEVLDYRQAVAYYAAC 162 (190)
T ss_pred hhhEEEEEeCCCCcCCHHHHHHHHHhc
Confidence 578899999999999999999999864
No 28
>PF12695 Abhydrolase_5: Alpha/beta hydrolase family; PDB: 3D0K_B 2I3D_B 3DOH_B 3DOI_B 3PFB_A 3S2Z_B 3PFC_A 3QM1_A 3PF8_B 3PF9_A ....
Probab=97.20 E-value=0.00073 Score=39.33 Aligned_cols=33 Identities=27% Similarity=0.333 Sum_probs=28.1
Q ss_pred HhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 34 KVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 34 ~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
.+... +.|++++||+.|..+|++++.++++++.
T Consensus 99 ~~~~~-~~pv~~i~g~~D~~~~~~~~~~~~~~~~ 131 (145)
T PF12695_consen 99 DLAKI-RIPVLFIHGENDPLVPPEQVRRLYEALP 131 (145)
T ss_dssp HHTTT-TSEEEEEEETT-SSSHHHHHHHHHHHHC
T ss_pred hhhcc-CCcEEEEEECCCCcCCHHHHHHHHHHcC
Confidence 34456 6999999999999999999999999877
No 29
>COG0429 Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only]
Probab=97.05 E-value=0.00027 Score=48.49 Aligned_cols=52 Identities=15% Similarity=0.151 Sum_probs=41.7
Q ss_pred hhhhhhhhhhCCCCCCCCcccccccChhHHhccCCCCcEEEEEeCCCCccCHHHHH
Q psy7765 5 TNSIYTERYMGLPEMYDNEVGYNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSM 60 (77)
Q Consensus 5 ~~~~~~e~~lG~p~~~~~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~ 60 (77)
++..++-..+|-+ +..+-|+++|.++.+.++ .+|+||||..+|+.++++..-
T Consensus 243 FD~~~Tap~~Gf~---da~dYYr~aSs~~~L~~I-r~PtLii~A~DDP~~~~~~iP 294 (345)
T COG0429 243 FDDLLTAPLHGFA---DAEDYYRQASSLPLLPKI-RKPTLIINAKDDPFMPPEVIP 294 (345)
T ss_pred ccceeeecccCCC---cHHHHHHhcccccccccc-ccceEEEecCCCCCCChhhCC
Confidence 3445566677876 345669999999999999 799999999999999985543
No 30
>KOG1455|consensus
Probab=96.95 E-value=0.00088 Score=45.48 Aligned_cols=43 Identities=21% Similarity=0.326 Sum_probs=35.5
Q ss_pred hhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 31 VTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 31 p~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
...++... +.|+|++||++|.+..+.-|..|++....+.+...
T Consensus 238 le~~l~~v-tvPflilHG~dD~VTDp~~Sk~Lye~A~S~DKTlK 280 (313)
T KOG1455|consen 238 LEKNLNEV-TVPFLILHGTDDKVTDPKVSKELYEKASSSDKTLK 280 (313)
T ss_pred HHHhcccc-cccEEEEecCCCcccCcHHHHHHHHhccCCCCcee
Confidence 33445566 79999999999999999999999998877776554
No 31
>PRK06765 homoserine O-acetyltransferase; Provisional
Probab=96.93 E-value=0.0027 Score=44.12 Aligned_cols=45 Identities=9% Similarity=0.090 Sum_probs=36.3
Q ss_pred ChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCcee
Q psy7765 30 DVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 30 sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
+....+..+ +.|+++++|+.|..+|+.++.++++.+...+..+++
T Consensus 314 dl~~~L~~I-~~PtLvI~G~~D~l~p~~~~~~la~~lp~~~~~a~l 358 (389)
T PRK06765 314 SLEEALSNI-EANVLMIPCKQDLLQPPRYNYKMVDILQKQGKYAEV 358 (389)
T ss_pred CHHHHHhcC-CCCEEEEEeCCCCCCCHHHHHHHHHHhhhcCCCeEE
Confidence 445566678 799999999999999999999999998765444443
No 32
>TIGR01738 bioH putative pimeloyl-BioC--CoA transferase BioH. This CoA-binding enzyme is required for the production of pimeloyl-coenzyme A, the substrate of the BioF protein early in the biosynthesis of biotin. Its exact function is unknown, but is proposed in ref 2. This enzyme belongs to the alpha/beta hydrolase fold family (pfam model pfam00561). Members of this family are restricted to the Proteobacteria.
Probab=96.90 E-value=0.0019 Score=39.75 Aligned_cols=34 Identities=24% Similarity=0.208 Sum_probs=28.5
Q ss_pred HHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 33 RKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 33 ~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
..+.++ +.|+++++|+.|..||++.+..+.+.+.
T Consensus 182 ~~l~~i-~~Pvlii~g~~D~~~~~~~~~~~~~~~~ 215 (245)
T TIGR01738 182 QPLQNI-SVPFLRLYGYLDGLVPAKVVPYLDKLAP 215 (245)
T ss_pred HHHhcC-CCCEEEEeecCCcccCHHHHHHHHHhCC
Confidence 346677 7999999999999999999888877653
No 33
>COG1073 Hydrolases of the alpha/beta superfamily [General function prediction only]
Probab=96.87 E-value=0.0017 Score=41.41 Aligned_cols=44 Identities=20% Similarity=0.225 Sum_probs=36.6
Q ss_pred ccccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 26 YNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 26 y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
+...++...+..+...|++++||..|..||..++..++.+.+..
T Consensus 218 ~~~~d~~~~~~~i~~~P~l~~~G~~D~~vp~~~~~~~~~~~~~~ 261 (299)
T COG1073 218 LLLLDPFDDAEKISPRPVLLVHGERDEVVPLRDAEDLYEAARER 261 (299)
T ss_pred hccCcchhhHhhcCCcceEEEecCCCcccchhhhHHHHhhhccC
Confidence 55566777777773369999999999999999999999998774
No 34
>TIGR03611 RutD pyrimidine utilization protein D. This protein is observed in operons extremely similar to that characterized in E. coli K-12 responsible for the import and catabolism of pyrimidines, primarily uracil. This protein is a member of the hydrolase, alpha/beta fold family defined by pfam00067.
Probab=96.85 E-value=0.0019 Score=40.21 Aligned_cols=34 Identities=32% Similarity=0.424 Sum_probs=29.2
Q ss_pred HHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 33 RKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 33 ~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
..+..+ +.|++++||++|..+|++++.++++.+.
T Consensus 192 ~~~~~i-~~P~l~i~g~~D~~~~~~~~~~~~~~~~ 225 (257)
T TIGR03611 192 ARLDRI-QHPVLLIANRDDMLVPYTQSLRLAAALP 225 (257)
T ss_pred HHhccc-CccEEEEecCcCcccCHHHHHHHHHhcC
Confidence 456677 7999999999999999999999888754
No 35
>KOG2112|consensus
Probab=96.77 E-value=0.0017 Score=41.89 Aligned_cols=37 Identities=27% Similarity=0.275 Sum_probs=33.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
.+|+++.||+.|+.||..-.....+.|+..|..+++.
T Consensus 144 ~~~i~~~Hg~~d~~vp~~~g~~s~~~l~~~~~~~~f~ 180 (206)
T KOG2112|consen 144 YTPILLCHGTADPLVPFRFGEKSAQFLKSLGVRVTFK 180 (206)
T ss_pred cchhheecccCCceeehHHHHHHHHHHHHcCCceeee
Confidence 5899999999999999999999999999999987663
No 36
>TIGR01607 PST-A Plasmodium subtelomeric family (PST-A). These genes are preferentially located in the subtelomeric regions of the chromosomes of both P. falciparum and P. yoelii.
Probab=96.74 E-value=0.0024 Score=43.18 Aligned_cols=33 Identities=15% Similarity=0.273 Sum_probs=27.5
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
+.|+|++||+.|..|+++.+..+++++...++.
T Consensus 270 ~~P~Lii~G~~D~vv~~~~~~~~~~~~~~~~~~ 302 (332)
T TIGR01607 270 DIPILFIHSKGDCVCSYEGTVSFYNKLSISNKE 302 (332)
T ss_pred CCCEEEEEeCCCCccCHHHHHHHHHhccCCCcE
Confidence 479999999999999999999998876544443
No 37
>TIGR01392 homoserO_Ac_trn homoserine O-acetyltransferase. This family describes homoserine-O-acetyltransferase, an enzyme of methionine biosynthesis. This model has been rebuilt to identify sequences more broadly, including a number of sequences suggested to be homoserine O-acetyltransferase based on proximity to other Met biosynthesis genes.
Probab=96.72 E-value=0.0042 Score=41.98 Aligned_cols=40 Identities=30% Similarity=0.250 Sum_probs=33.6
Q ss_pred HHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCc
Q psy7765 33 RKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFF 73 (77)
Q Consensus 33 ~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~ 73 (77)
..+.++ +.|+|+++|+.|..+|+..+..+++.+......+
T Consensus 282 ~~l~~I-~~P~Lvi~G~~D~~~p~~~~~~~a~~i~~~~~~v 321 (351)
T TIGR01392 282 EALSRI-KAPFLVVSITSDWLFPPAESRELAKALPAAGLRV 321 (351)
T ss_pred HHHhhC-CCCEEEEEeCCccccCHHHHHHHHHHHhhcCCce
Confidence 345567 6999999999999999999999999988765543
No 38
>KOG1838|consensus
Probab=96.66 E-value=0.0014 Score=46.02 Aligned_cols=50 Identities=14% Similarity=0.151 Sum_probs=41.8
Q ss_pred chhhhhhhhhhCCCCCCCCcccccccChhHHhccCCCCcEEEEEeCCCCccCHH
Q psy7765 4 ITNSIYTERYMGLPEMYDNEVGYNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQ 57 (77)
Q Consensus 4 ~~~~~~~e~~lG~p~~~~~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~ 57 (77)
-+|..++...+|-+ . -.+-|+++|+.+.++++ ..|+++++..+|+.||..
T Consensus 290 eFD~~~t~~~~gf~--~-~deYY~~aSs~~~v~~I-~VP~L~ina~DDPv~p~~ 339 (409)
T KOG1838|consen 290 EFDEALTRPMFGFK--S-VDEYYKKASSSNYVDKI-KVPLLCINAADDPVVPEE 339 (409)
T ss_pred HHHhhhhhhhcCCC--c-HHHHHhhcchhhhcccc-cccEEEEecCCCCCCCcc
Confidence 35666777777877 2 45779999999999999 799999999999999974
No 39
>PF08840 BAAT_C: BAAT / Acyl-CoA thioester hydrolase C terminal; InterPro: IPR014940 Acyl-CoA thioesterases are a group of enzymes that catalyse the hydrolysis of acyl-CoAs to the free fatty acid and coenzyme A (CoASH), providing the potential to regulate intracellular levels of acyl-CoAs, free fatty acids and CoASH. Bile acid-CoA:amino acid N-acetyltransferase (BAAT) is involved in bile acid metabolism and may also act as an acyl-CoA thioesterase that regulates intracellular levels of free fatty acids []. This entry represents a catalytic domain is found at the C terminus of acyl-CoA thioester hydrolases and bile acid-CoA:amino acid N-acetyltransferases. ; PDB: 3K2I_B 3HLK_B.
Probab=96.64 E-value=0.0039 Score=39.89 Aligned_cols=37 Identities=22% Similarity=0.268 Sum_probs=26.7
Q ss_pred hccCCCCcEEEEEeCCCCccCHHH-HHHHHHHHHhcCCC
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQ-SMMLAKALEAADIF 72 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~-s~~l~~al~~~g~~ 72 (77)
++++ +.|+||+.|++|.+.|... +..+.++|+++|.+
T Consensus 111 vE~i-~~piLli~g~dD~~WpS~~~a~~i~~rL~~~~~~ 148 (213)
T PF08840_consen 111 VEKI-KGPILLISGEDDQIWPSSEMAEQIEERLKAAGFP 148 (213)
T ss_dssp GGG---SEEEEEEETT-SSS-HHHHHHHHHHHHHCTT--
T ss_pred HHHc-CCCEEEEEeCCCCccchHHHHHHHHHHHHHhCCC
Confidence 7788 7999999999999999765 55678889998877
No 40
>COG4099 Predicted peptidase [General function prediction only]
Probab=96.63 E-value=0.00096 Score=45.68 Aligned_cols=43 Identities=19% Similarity=0.193 Sum_probs=37.7
Q ss_pred HhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 34 KVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 34 ~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
+++.+++.|+.++|+.+|..+|.++|.-+++.|++-+.++.+.
T Consensus 309 lv~~lk~~piWvfhs~dDkv~Pv~nSrv~y~~lk~~~~kv~Yt 351 (387)
T COG4099 309 LVRTLKKAPIWVFHSSDDKVIPVSNSRVLYERLKALDRKVNYT 351 (387)
T ss_pred hhhhhccCceEEEEecCCCccccCcceeehHHHHhhccccchh
Confidence 4666667999999999999999999999999999988876653
No 41
>TIGR02427 protocat_pcaD 3-oxoadipate enol-lactonase. Members of this family are 3-oxoadipate enol-lactonase. Note that the substrate is known as 3-oxoadipate enol-lactone, 2-oxo-2,3-dihydrofuran-5-acetate, 4,5-Dihydro-5-oxofuran-2-acetate, and 5-oxo-4,5-dihydrofuran-2-acetate. The enzyme the catalyzes the fourth step in the protocatechuate degradation to beta-ketoadipate and then to succinyl-CoA and acetyl-CoA. 4-hydroxybenzoate, 3-hydroxybenzoate, and vanillate all can be converted in one step to protocatechuate. This enzyme also acts in catechol degradation. In genomes that catabolize both catechol and protocatechuate, two forms of this enzyme may be found. All members of the seed alignment for this model were chosen from within protocatechuate degradation operons of at least three genes of the pathway, from genomes with the complete pathway through beta-ketoadipate.
Probab=96.57 E-value=0.0039 Score=38.39 Aligned_cols=35 Identities=14% Similarity=0.197 Sum_probs=28.5
Q ss_pred hHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 32 TRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 32 ~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
...+.+. +.|++++||+.|..+|.+.+..+.+.+.
T Consensus 186 ~~~~~~~-~~Pvlii~g~~D~~~~~~~~~~~~~~~~ 220 (251)
T TIGR02427 186 RDRLGAI-AVPTLCIAGDQDGSTPPELVREIADLVP 220 (251)
T ss_pred HHHhhhc-CCCeEEEEeccCCcCChHHHHHHHHhCC
Confidence 3445567 6999999999999999998888877653
No 42
>TIGR02240 PHA_depoly_arom poly(3-hydroxyalkanoate) depolymerase. This family consists of the polyhydroxyalkanoic acid (PHA) depolymerase of Pseudomonas oleovorans, Pseudomonas putida BM01, and related species. This enzyme is part of polyester storage and mobilization system as in many bacteria. However, species containing this enzyme are unusual in their capacity to produce aromatic polyesters when grown on carbon sources such as benzoic acid or phenylacetic acid.
Probab=96.50 E-value=0.0047 Score=40.06 Aligned_cols=34 Identities=15% Similarity=0.196 Sum_probs=28.9
Q ss_pred HHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 33 RKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 33 ~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
..+.++ +.|+++++|+.|..||+.++.++.+.+.
T Consensus 201 ~~l~~i-~~P~lii~G~~D~~v~~~~~~~l~~~~~ 234 (276)
T TIGR02240 201 HWLHKI-QQPTLVLAGDDDPIIPLINMRLLAWRIP 234 (276)
T ss_pred hHhhcC-CCCEEEEEeCCCCcCCHHHHHHHHHhCC
Confidence 345677 6999999999999999999999987654
No 43
>TIGR01249 pro_imino_pep_1 proline iminopeptidase, Neisseria-type subfamily. This model represents one of two related families of proline iminopeptidase in the alpha/beta fold hydrolase family. The fine specificities of the various members, including both the range of short peptides from which proline can be removed and whether other amino acids such as alanine can be also removed, may vary among members.
Probab=96.47 E-value=0.0046 Score=40.95 Aligned_cols=34 Identities=24% Similarity=0.283 Sum_probs=27.8
Q ss_pred HhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 34 KVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 34 ~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
.+..+...|++++||..|..||+..+.++++++.
T Consensus 242 ~~~~i~~~P~lii~g~~D~~~p~~~~~~~~~~~~ 275 (306)
T TIGR01249 242 NISKIRNIPTYIVHGRYDLCCPLQSAWALHKAFP 275 (306)
T ss_pred hhhhccCCCeEEEecCCCCCCCHHHHHHHHHhCC
Confidence 3445524799999999999999999999988754
No 44
>PF10503 Esterase_phd: Esterase PHB depolymerase
Probab=96.45 E-value=0.0038 Score=40.53 Aligned_cols=30 Identities=27% Similarity=0.289 Sum_probs=26.5
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
..|++++||+.|..|.+.+..++.+....-
T Consensus 169 ~~P~~v~hG~~D~tV~~~n~~~~~~q~~~~ 198 (220)
T PF10503_consen 169 GYPRIVFHGTADTTVNPQNADQLVAQWLNV 198 (220)
T ss_pred CCCEEEEecCCCCccCcchHHHHHHHHHHc
Confidence 469999999999999999999988887654
No 45
>PRK05077 frsA fermentation/respiration switch protein; Reviewed
Probab=96.45 E-value=0.0045 Score=43.29 Aligned_cols=30 Identities=13% Similarity=-0.023 Sum_probs=26.1
Q ss_pred ccCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 36 GNFVNKTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 36 ~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
..+ +.|+|++||+.|..||++++..++++.
T Consensus 352 ~~i-~~PvLiI~G~~D~ivP~~~a~~l~~~~ 381 (414)
T PRK05077 352 RRC-PTPMLSGYWKNDPFSPEEDSRLIASSS 381 (414)
T ss_pred cCC-CCcEEEEecCCCCCCCHHHHHHHHHhC
Confidence 356 699999999999999999999887654
No 46
>PRK00175 metX homoserine O-acetyltransferase; Provisional
Probab=96.45 E-value=0.0096 Score=40.86 Aligned_cols=40 Identities=23% Similarity=0.111 Sum_probs=33.6
Q ss_pred HhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 34 KVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 34 ~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
.+.++ +.|+|+++|..|..+|+..+..+++.+...+..++
T Consensus 304 ~l~~I-~~PtLvI~G~~D~~~p~~~~~~la~~i~~a~~~~~ 343 (379)
T PRK00175 304 ALARI-KARFLVVSFTSDWLFPPARSREIVDALLAAGADVS 343 (379)
T ss_pred HHhcC-CCCEEEEEECCccccCHHHHHHHHHHHHhcCCCeE
Confidence 45567 68999999999999999999999999887765433
No 47
>TIGR03343 biphenyl_bphD 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase. Members of this family are 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase, or HOPD hydrolase, the BphD protein of biphenyl degradation. BphD acts on the product of ring meta-cleavage by BphC. Many species carrying bphC and bphD are capable of degrading polychlorinated biphenyls as well as biphenyl itself.
Probab=96.39 E-value=0.0049 Score=39.62 Aligned_cols=34 Identities=18% Similarity=0.296 Sum_probs=29.0
Q ss_pred HHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 33 RKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 33 ~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
..+.++ +.|+++++|+.|..||+..+.++++.+.
T Consensus 217 ~~l~~i-~~Pvlli~G~~D~~v~~~~~~~~~~~~~ 250 (282)
T TIGR03343 217 ARLGEI-KAKTLVTWGRDDRFVPLDHGLKLLWNMP 250 (282)
T ss_pred HHHhhC-CCCEEEEEccCCCcCCchhHHHHHHhCC
Confidence 345677 7999999999999999999999888764
No 48
>PF02129 Peptidase_S15: X-Pro dipeptidyl-peptidase (S15 family); InterPro: IPR000383 This entry represents a domain found peptidases Xaa-Pro dipeptidyl-peptidase and glutaryl-7-aminocephalosporanic-acid acylase, which belong to MEROPS peptidase families S15 and S45 respectively []. It is also found in hydrolases from the CocE/NonD family. Cocaine esterase (CocE) hydrolyzes cocaine endowing the bacteria with the ability to utilise cocaine as a sole source of carbon and energy []. ; GO: 0004177 aminopeptidase activity, 0006508 proteolysis; PDB: 1LNS_A 3PUI_A 3PUH_B 1JU3_A 3I2I_A 3I2G_A 1JU4_A 3I2K_A 3IDA_A 3I2H_A ....
Probab=96.03 E-value=0.018 Score=37.72 Aligned_cols=51 Identities=12% Similarity=0.066 Sum_probs=41.0
Q ss_pred cccccccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcC-CCcee
Q psy7765 23 EVGYNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAAD-IFFYQ 75 (77)
Q Consensus 23 ~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g-~~~~~ 75 (77)
.+.+.+.++...+.++ +.|+|+++|-.|.... ..+.+.+++|++.+ .+..+
T Consensus 212 ~~~w~~~~~~~~~~~i-~vP~l~v~Gw~D~~~~-~~~~~~~~~l~~~~~~~~~L 263 (272)
T PF02129_consen 212 DPFWQERSPSERLDKI-DVPVLIVGGWYDTLFL-RGALRAYEALRAPGSKPQRL 263 (272)
T ss_dssp SHHHHTTBHHHHHGG---SEEEEEEETTCSSTS-HHHHHHHHHHCTTSTC-EEE
T ss_pred CHHHHhCChHHHHhhC-CCCEEEecccCCcccc-hHHHHHHHHhhcCCCCCCEE
Confidence 4567788888888898 7999999999998888 89999999999988 54444
No 49
>PRK07581 hypothetical protein; Validated
Probab=96.01 E-value=0.02 Score=38.33 Aligned_cols=34 Identities=12% Similarity=0.075 Sum_probs=28.8
Q ss_pred HHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 33 RKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 33 ~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
..+.++ +.|+|+++|+.|..+|+..+..+++.+.
T Consensus 269 ~~L~~I-~~PtLvI~G~~D~~~p~~~~~~l~~~ip 302 (339)
T PRK07581 269 AALGSI-TAKTFVMPISTDLYFPPEDCEAEAALIP 302 (339)
T ss_pred HHHhcC-CCCEEEEEeCCCCCCCHHHHHHHHHhCC
Confidence 345667 6999999999999999999998887653
No 50
>TIGR03056 bchO_mg_che_rel putative magnesium chelatase accessory protein. Members of this family belong to the alpha/beta fold family hydrolases (PFAM model pfam00561). Members are found in bacterial genomes if and only if they encoded for anoxygenic photosynthetic systems similar to that of Rhodobacter capsulatus and other alpha-Proteobacteria. Members often are encoded in the same operon as subunits of the protoporphyrin IX magnesium chelatase, and were once designated BchO. No literature supports a role as an actual subunit of magnesium chelatase, but an accessory role is possible, as suggested by placement by its probable hydrolase activity.
Probab=96.00 E-value=0.014 Score=37.09 Aligned_cols=32 Identities=19% Similarity=0.211 Sum_probs=26.8
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
+.++ ..|+++++|++|..||.+.+..+.+.+.
T Consensus 216 ~~~i-~~P~lii~g~~D~~vp~~~~~~~~~~~~ 247 (278)
T TIGR03056 216 LPRI-TIPLHLIAGEEDKAVPPDESKRAATRVP 247 (278)
T ss_pred cccC-CCCEEEEEeCCCcccCHHHHHHHHHhcc
Confidence 4556 6899999999999999999888876653
No 51
>PRK10349 carboxylesterase BioH; Provisional
Probab=96.00 E-value=0.0066 Score=38.75 Aligned_cols=36 Identities=25% Similarity=0.255 Sum_probs=28.4
Q ss_pred hhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 31 VTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 31 p~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
....+.++ +.|+++++|+.|..+|.+.+..+.+.+.
T Consensus 188 ~~~~l~~i-~~P~lii~G~~D~~~~~~~~~~~~~~i~ 223 (256)
T PRK10349 188 LRQPLQNV-SMPFLRLYGYLDGLVPRKVVPMLDKLWP 223 (256)
T ss_pred cHHHHhhc-CCCeEEEecCCCccCCHHHHHHHHHhCC
Confidence 33456677 7999999999999999988877766653
No 52
>TIGR01836 PHA_synth_III_C poly(R)-hydroxyalkanoic acid synthase, class III, PhaC subunit. This model represents the PhaC subunit of a heterodimeric form of polyhydroxyalkanoic acid (PHA) synthase. Excepting the PhaC of Bacillus megaterium (which needs PhaR), all members require PhaE (TIGR01834) for activity and are designated class III. This enzyme builds ester polymers for carbon and energy storage that accumulate in inclusions, and both this enzyme and the depolymerase associate with the inclusions. Class III enzymes polymerize short-chain-length hydroxyalkanoates.
Probab=95.95 E-value=0.0088 Score=40.45 Aligned_cols=33 Identities=18% Similarity=0.157 Sum_probs=28.5
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
+.++ +.|+++++|+.|..+|++.+..+++.+..
T Consensus 282 l~~i-~~Pvliv~G~~D~i~~~~~~~~~~~~~~~ 314 (350)
T TIGR01836 282 LKNI-KMPILNIYAERDHLVPPDASKALNDLVSS 314 (350)
T ss_pred HHhC-CCCeEEEecCCCCcCCHHHHHHHHHHcCC
Confidence 5566 69999999999999999999999887653
No 53
>PRK10985 putative hydrolase; Provisional
Probab=95.94 E-value=0.0072 Score=40.52 Aligned_cols=40 Identities=13% Similarity=0.042 Sum_probs=32.6
Q ss_pred cccccccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHH
Q psy7765 23 EVGYNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLA 63 (77)
Q Consensus 23 ~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~ 63 (77)
.+.|...++...+.++ ..|+++++|..|..++++....+.
T Consensus 239 ~~~y~~~~~~~~l~~i-~~P~lii~g~~D~~~~~~~~~~~~ 278 (324)
T PRK10985 239 IDYYRQCSALPLLNQI-RKPTLIIHAKDDPFMTHEVIPKPE 278 (324)
T ss_pred HHHHHHCChHHHHhCC-CCCEEEEecCCCCCCChhhChHHH
Confidence 3456677777788888 799999999999999987776654
No 54
>PF08386 Abhydrolase_4: TAP-like protein; InterPro: IPR013595 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents a C-terminal domain associated with putative hydrolases and bacterial peptidases that belong to MEROPS peptidase family S33 (clan SC). They are related to a tripeptidyl aminopeptidase from Streptomyces lividans (Q54410 from SWISSPROT). A member of this family (Q6E3K7 from SWISSPROT) is thought to be involved in the C-terminal processing of propionicin F, a bacteriocidin characterised from Propionibacterium freudenreichii []. ; GO: 0008233 peptidase activity
Probab=95.81 E-value=0.014 Score=33.42 Aligned_cols=29 Identities=24% Similarity=0.367 Sum_probs=27.0
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
..|+|++.|+.|...|++.+..+.++|..
T Consensus 34 ~~piL~l~~~~Dp~TP~~~a~~~~~~l~~ 62 (103)
T PF08386_consen 34 APPILVLGGTHDPVTPYEGARAMAARLPG 62 (103)
T ss_pred CCCEEEEecCcCCCCcHHHHHHHHHHCCC
Confidence 59999999999999999999999998865
No 55
>PLN02511 hydrolase
Probab=95.76 E-value=0.0066 Score=41.95 Aligned_cols=34 Identities=9% Similarity=0.111 Sum_probs=29.2
Q ss_pred cccccChhHHhccCCCCcEEEEEeCCCCccCHHHH
Q psy7765 25 GYNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQQS 59 (77)
Q Consensus 25 ~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s 59 (77)
-|...|+...+.++ +.|+|+++|.+|+.+|+...
T Consensus 284 yy~~~s~~~~L~~I-~vPtLiI~g~dDpi~p~~~~ 317 (388)
T PLN02511 284 YYSNSSSSDSIKHV-RVPLLCIQAANDPIAPARGI 317 (388)
T ss_pred HHHHcCchhhhccC-CCCeEEEEcCCCCcCCcccC
Confidence 36678888889999 79999999999999997654
No 56
>PLN03087 BODYGUARD 1 domain containing hydrolase; Provisional
Probab=95.66 E-value=0.015 Score=41.74 Aligned_cols=30 Identities=13% Similarity=0.257 Sum_probs=26.7
Q ss_pred cCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 37 NFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 37 ~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
++ +.|++|+||.+|..+|++.+..+++.+.
T Consensus 416 ~I-~vPtLII~Ge~D~ivP~~~~~~la~~iP 445 (481)
T PLN03087 416 QL-KCDVAIFHGGDDELIPVECSYAVKAKVP 445 (481)
T ss_pred hC-CCCEEEEEECCCCCCCHHHHHHHHHhCC
Confidence 56 6999999999999999999999987763
No 57
>PRK06489 hypothetical protein; Provisional
Probab=95.64 E-value=0.018 Score=39.09 Aligned_cols=35 Identities=9% Similarity=0.151 Sum_probs=27.7
Q ss_pred hHHhccCCCCcEEEEEeCCCCccCHHHH--HHHHHHHH
Q psy7765 32 TRKVGNFVNKTFLLMHGTADDNVHYQQS--MMLAKALE 67 (77)
Q Consensus 32 ~~~~~~~~~~p~ll~hG~~D~~Vp~~~s--~~l~~al~ 67 (77)
...+.++ +.|+|+++|+.|..+|++.+ ..+++.+.
T Consensus 285 ~~~L~~I-~~PvLvI~G~~D~~~p~~~~~~~~la~~ip 321 (360)
T PRK06489 285 SPDLEKI-KAPVLAINSADDERNPPETGVMEAALKRVK 321 (360)
T ss_pred HHHHHhC-CCCEEEEecCCCcccChhhHHHHHHHHhCc
Confidence 3346677 79999999999999999876 66766653
No 58
>COG2267 PldB Lysophospholipase [Lipid metabolism]
Probab=95.64 E-value=0.015 Score=39.12 Aligned_cols=35 Identities=17% Similarity=0.169 Sum_probs=28.2
Q ss_pred ccCCCCcEEEEEeCCCCccC-HHHHHHHHHHHHhcCC
Q psy7765 36 GNFVNKTFLLMHGTADDNVH-YQQSMMLAKALEAADI 71 (77)
Q Consensus 36 ~~~~~~p~ll~hG~~D~~Vp-~~~s~~l~~al~~~g~ 71 (77)
... ..|+||+||++|.+|+ .+.+.++++++...++
T Consensus 225 ~~~-~~PvLll~g~~D~vv~~~~~~~~~~~~~~~~~~ 260 (298)
T COG2267 225 PAI-ALPVLLLQGGDDRVVDNVEGLARFFERAGSPDK 260 (298)
T ss_pred ccc-cCCEEEEecCCCccccCcHHHHHHHHhcCCCCc
Confidence 344 6999999999999999 7888888877655543
No 59
>PF07519 Tannase: Tannase and feruloyl esterase; InterPro: IPR011118 This family includes fungal tannase [] and feruloyl esterase [, ]. It also includes several bacterial homologues of unknown function.
Probab=95.62 E-value=0.011 Score=42.14 Aligned_cols=31 Identities=23% Similarity=0.208 Sum_probs=28.7
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcC
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAAD 70 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g 70 (77)
...+|++||..|..||+..++.+++++.+.-
T Consensus 353 GGKLI~~HG~aD~~I~p~~ti~YY~~V~~~~ 383 (474)
T PF07519_consen 353 GGKLILYHGWADPLIPPQGTIDYYERVVARM 383 (474)
T ss_pred CCeEEEEecCCCCccCCCcHHHHHHHHHHhc
Confidence 5899999999999999999999999998763
No 60
>PRK03592 haloalkane dehalogenase; Provisional
Probab=95.62 E-value=0.027 Score=36.77 Aligned_cols=32 Identities=6% Similarity=0.013 Sum_probs=23.9
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
+..+ +.|++++||..|..+++....++...+.
T Consensus 224 l~~i-~~P~lii~G~~D~~~~~~~~~~~~~~~~ 255 (295)
T PRK03592 224 LATS-DVPKLLINAEPGAILTTGAIRDWCRSWP 255 (295)
T ss_pred hccC-CCCeEEEeccCCcccCcHHHHHHHHHhh
Confidence 4456 7999999999999996666556555443
No 61
>PLN02578 hydrolase
Probab=95.60 E-value=0.023 Score=38.55 Aligned_cols=33 Identities=15% Similarity=0.164 Sum_probs=28.0
Q ss_pred HhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 34 KVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 34 ~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
.+.++ +.|++++||+.|..||...+.++.+.+.
T Consensus 291 ~l~~i-~~PvLiI~G~~D~~v~~~~~~~l~~~~p 323 (354)
T PLN02578 291 LLSKL-SCPLLLLWGDLDPWVGPAKAEKIKAFYP 323 (354)
T ss_pred HhhcC-CCCEEEEEeCCCCCCCHHHHHHHHHhCC
Confidence 45667 7999999999999999999888877653
No 62
>PF00561 Abhydrolase_1: alpha/beta hydrolase fold A web page of Esterases and alpha/beta hydrolases.; InterPro: IPR000073 The alpha/beta hydrolase fold [] is common to a number of hydrolytic enzymes of widely differing phylogenetic origin and catalytic function. The core of each enzyme is an alpha/beta-sheet (rather than a barrel), containing 8 strands connected by helices []. The enzymes are believed to have diverged from a common ancestor, preserving the arrangement of the catalytic residues. All have a catalytic triad, the elements of which are borne on loops, which are the best conserved structural features of the fold. Esterase (EST) from Pseudomonas putida is a member of the alpha/beta hydrolase fold superfamily of enzymes []. In most of the family members the beta-strands are parallels, but some have an inversion of the first strands, which gives it an antiparallel orientation. The catalytic triad residues are presented on loops. One of these is the nucleophile elbow and is the most conserved feature of the fold. Some other members lack one or all of the catalytic residues. Some members are therefore inactive but others are involved in surface recognition. The ESTHER database [] gathers and annotates all the published information related to gene and protein sequences of this superfamily []. This entry represents fold-1 of alpha/beta hydrolase.; PDB: 2VAT_E 2VAX_C 2VAV_H 2PSJ_A 2PSH_B 2PSE_A 2PSF_A 2PSD_A 2EDA_A 1CIJ_A ....
Probab=95.58 E-value=0.014 Score=35.91 Aligned_cols=38 Identities=26% Similarity=0.249 Sum_probs=30.5
Q ss_pred ccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 28 WTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 28 ~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
..++...+... +.|+++++|+.|..+|+..+..+.+.+
T Consensus 164 ~~~~~~~l~~i-~~p~l~i~~~~D~~~p~~~~~~~~~~~ 201 (230)
T PF00561_consen 164 VWDPSPALSNI-KVPTLIIWGEDDPLVPPESSEQLAKLI 201 (230)
T ss_dssp HHHHHHHHTTT-TSEEEEEEETTCSSSHHHHHHHHHHHS
T ss_pred ccccccccccc-CCCeEEEEeCCCCCCCHHHHHHHHHhc
Confidence 34445556677 799999999999999999999976554
No 63
>PRK00870 haloalkane dehalogenase; Provisional
Probab=95.56 E-value=0.016 Score=38.07 Aligned_cols=32 Identities=19% Similarity=0.288 Sum_probs=26.1
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
+..+ +.|++++||+.|..+|... ..+.+.+..
T Consensus 235 l~~i-~~P~lii~G~~D~~~~~~~-~~~~~~~~~ 266 (302)
T PRK00870 235 LERW-DKPFLTAFSDSDPITGGGD-AILQKRIPG 266 (302)
T ss_pred hhcC-CCceEEEecCCCCcccCch-HHHHhhccc
Confidence 4567 7999999999999999866 777776654
No 64
>PLN02965 Probable pheophorbidase
Probab=95.40 E-value=0.023 Score=36.41 Aligned_cols=31 Identities=3% Similarity=-0.010 Sum_probs=26.7
Q ss_pred cCCCCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 37 NFVNKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 37 ~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
.+ +.|+++++|++|..||+..+..+++.+..
T Consensus 191 ~i-~vP~lvi~g~~D~~~~~~~~~~~~~~~~~ 221 (255)
T PLN02965 191 AE-KVPRVYIKTAKDNLFDPVRQDVMVENWPP 221 (255)
T ss_pred cC-CCCEEEEEcCCCCCCCHHHHHHHHHhCCc
Confidence 46 79999999999999999999888876543
No 65
>KOG4667|consensus
Probab=95.38 E-value=0.017 Score=38.08 Aligned_cols=29 Identities=24% Similarity=0.359 Sum_probs=26.7
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
..|+|-+||..|.+||++.+.+|++.+..
T Consensus 199 ~C~VLTvhGs~D~IVPve~AkefAk~i~n 227 (269)
T KOG4667|consen 199 QCRVLTVHGSEDEIVPVEDAKEFAKIIPN 227 (269)
T ss_pred cCceEEEeccCCceeechhHHHHHHhccC
Confidence 68999999999999999999999997655
No 66
>PF12697 Abhydrolase_6: Alpha/beta hydrolase family; PDB: 3LLC_A 3A2N_E 3A2M_A 3A2L_A 3AFI_F 3C5V_A 3C5W_P 3E0X_A 2ZJF_A 3QYJ_A ....
Probab=95.27 E-value=0.025 Score=34.05 Aligned_cols=37 Identities=24% Similarity=0.360 Sum_probs=29.4
Q ss_pred cChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 29 TDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 29 ~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
.++...+... +.|+++++|+.|..++......+.+.+
T Consensus 166 ~~~~~~~~~~-~~pvl~i~g~~D~~~~~~~~~~~~~~~ 202 (228)
T PF12697_consen 166 ADLSEALPRI-KVPVLVIHGEDDPIVPPESAEELADKL 202 (228)
T ss_dssp HHHHHHHHGS-SSEEEEEEETTSSSSHHHHHHHHHHHS
T ss_pred cccccccccc-CCCeEEeecCCCCCCCHHHHHHHHHHC
Confidence 3444566777 799999999999999987777777654
No 67
>PRK08775 homoserine O-acetyltransferase; Provisional
Probab=95.17 E-value=0.027 Score=37.95 Aligned_cols=32 Identities=16% Similarity=0.099 Sum_probs=27.5
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
+.++ +.|+|+++|+.|..+|+..+.++++++.
T Consensus 273 l~~I-~~PtLvi~G~~D~~~p~~~~~~~~~~i~ 304 (343)
T PRK08775 273 PEAI-RVPTVVVAVEGDRLVPLADLVELAEGLG 304 (343)
T ss_pred hhcC-CCCeEEEEeCCCEeeCHHHHHHHHHHcC
Confidence 3456 6899999999999999999999888764
No 68
>KOG1454|consensus
Probab=95.10 E-value=0.048 Score=37.14 Aligned_cols=38 Identities=18% Similarity=0.240 Sum_probs=29.8
Q ss_pred cChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 29 TDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 29 ~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
.++...+.+..+.|++|++|..|..||.+.+..+.+.+
T Consensus 253 ~~~~~~~~~i~~~pvlii~G~~D~~~p~~~~~~~~~~~ 290 (326)
T KOG1454|consen 253 ENLLSLIKKIWKCPVLIIWGDKDQIVPLELAEELKKKL 290 (326)
T ss_pred chHHHhhccccCCceEEEEcCcCCccCHHHHHHHHhhC
Confidence 34555677773599999999999999999777776654
No 69
>PLN02824 hydrolase, alpha/beta fold family protein
Probab=94.96 E-value=0.039 Score=36.00 Aligned_cols=31 Identities=13% Similarity=0.033 Sum_probs=25.9
Q ss_pred HhccCCCCcEEEEEeCCCCccCHHHHHHHHHH
Q psy7765 34 KVGNFVNKTFLLMHGTADDNVHYQQSMMLAKA 65 (77)
Q Consensus 34 ~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~a 65 (77)
.+.++ +.|+++++|++|..+|...+..+.+.
T Consensus 229 ~l~~i-~~P~lvi~G~~D~~~~~~~~~~~~~~ 259 (294)
T PLN02824 229 LLPAV-KCPVLIAWGEKDPWEPVELGRAYANF 259 (294)
T ss_pred HHhhc-CCCeEEEEecCCCCCChHHHHHHHhc
Confidence 35667 79999999999999999988876554
No 70
>PLN02679 hydrolase, alpha/beta fold family protein
Probab=94.96 E-value=0.07 Score=36.36 Aligned_cols=35 Identities=11% Similarity=0.156 Sum_probs=25.0
Q ss_pred HHhccCCCCcEEEEEeCCCCccCHHHH-HHHHHHHHh
Q psy7765 33 RKVGNFVNKTFLLMHGTADDNVHYQQS-MMLAKALEA 68 (77)
Q Consensus 33 ~~~~~~~~~p~ll~hG~~D~~Vp~~~s-~~l~~al~~ 68 (77)
..+..+ +.|+|+++|++|..+|+... .++.+.+.+
T Consensus 286 ~~l~~i-~~PtLii~G~~D~~~p~~~~~~~~~~~l~~ 321 (360)
T PLN02679 286 KLIPRI-SLPILVLWGDQDPFTPLDGPVGKYFSSLPS 321 (360)
T ss_pred HHhhhc-CCCEEEEEeCCCCCcCchhhHHHHHHhhhc
Confidence 345667 69999999999999998753 234444433
No 71
>COG0412 Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=94.84 E-value=0.07 Score=34.73 Aligned_cols=40 Identities=23% Similarity=0.135 Sum_probs=34.9
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCcee
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
..+. +.|+++.+|+.|..+|......+.+++.++++.+++
T Consensus 154 ~~~~-~~pvl~~~~~~D~~~p~~~~~~~~~~~~~~~~~~~~ 193 (236)
T COG0412 154 APKI-KVPVLLHLAGEDPYIPAADVDALAAALEDAGVKVDL 193 (236)
T ss_pred cccc-cCcEEEEecccCCCCChhHHHHHHHHHHhcCCCeeE
Confidence 4466 799999999999999999999999999999765543
No 72
>PRK14875 acetoin dehydrogenase E2 subunit dihydrolipoyllysine-residue acetyltransferase; Provisional
Probab=94.59 E-value=0.045 Score=36.63 Aligned_cols=31 Identities=13% Similarity=0.241 Sum_probs=25.3
Q ss_pred hHHhccCCCCcEEEEEeCCCCccCHHHHHHHH
Q psy7765 32 TRKVGNFVNKTFLLMHGTADDNVHYQQSMMLA 63 (77)
Q Consensus 32 ~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~ 63 (77)
...+.+. +.|++++||+.|..||+.++..+.
T Consensus 307 ~~~l~~i-~~Pvlii~g~~D~~vp~~~~~~l~ 337 (371)
T PRK14875 307 RDRLASL-AIPVLVIWGEQDRIIPAAHAQGLP 337 (371)
T ss_pred HHHHhcC-CCCEEEEEECCCCccCHHHHhhcc
Confidence 3356677 699999999999999998876654
No 73
>PF03959 FSH1: Serine hydrolase (FSH1); InterPro: IPR005645 This entry represents proteins belonging to the AB hydrolase family. It consists of serine hydrolases of unknown specificity [, ] and includes uncharacterised proteins.; PDB: 1YCD_A.
Probab=94.49 E-value=0.066 Score=34.06 Aligned_cols=34 Identities=21% Similarity=0.178 Sum_probs=25.0
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
...+ ..|+|-++|.+|..|+.+.+..|++.....
T Consensus 157 ~~~i-~iPtlHv~G~~D~~~~~~~s~~L~~~~~~~ 190 (212)
T PF03959_consen 157 EPKI-SIPTLHVIGENDPVVPPERSEALAEMFDPD 190 (212)
T ss_dssp -TT----EEEEEEETT-SSS-HHHHHHHHHHHHHH
T ss_pred cccC-CCCeEEEEeCCCCCcchHHHHHHHHhccCC
Confidence 3456 699999999999999999999999988764
No 74
>PF05705 DUF829: Eukaryotic protein of unknown function (DUF829); InterPro: IPR008547 This signature identifies Transmembrane protein 53, that have no known function but are predicted to be integral membrane proteins.
Probab=94.06 E-value=0.15 Score=32.64 Aligned_cols=37 Identities=11% Similarity=0.002 Sum_probs=34.1
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
..|.+.+.+++|..||.+..+++++..++.|.+++..
T Consensus 178 ~~p~lylYS~~D~l~~~~~ve~~~~~~~~~G~~V~~~ 214 (240)
T PF05705_consen 178 RCPRLYLYSKADPLIPWRDVEEHAEEARRKGWDVRAE 214 (240)
T ss_pred CCCeEEecCCCCcCcCHHHHHHHHHHHHHcCCeEEEe
Confidence 5899999999999999999999999999999987754
No 75
>TIGR01250 pro_imino_pep_2 proline-specific peptidases, Bacillus coagulans-type subfamily. This model describes a subfamily of the alpha/beta fold family of hydrolases. Characterized members include prolinases (Pro-Xaa dipeptidase, EC 3.4.13.8), prolyl aminopeptidases (EC 3.4.11.5), and a leucyl aminopeptidase
Probab=94.01 E-value=0.13 Score=32.28 Aligned_cols=32 Identities=16% Similarity=0.218 Sum_probs=24.7
Q ss_pred HHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 33 RKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 33 ~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
..+.+. +.|+++++|+.|. +++..+..+.+.+
T Consensus 225 ~~l~~i-~~P~lii~G~~D~-~~~~~~~~~~~~~ 256 (288)
T TIGR01250 225 DKLSEI-KVPTLLTVGEFDT-MTPEAAREMQELI 256 (288)
T ss_pred HHhhcc-CCCEEEEecCCCc-cCHHHHHHHHHhc
Confidence 445667 6999999999998 5677777777654
No 76
>PRK10673 acyl-CoA esterase; Provisional
Probab=93.79 E-value=0.095 Score=33.07 Aligned_cols=30 Identities=13% Similarity=0.095 Sum_probs=24.9
Q ss_pred ccCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 36 GNFVNKTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 36 ~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
..+ ..|+++++|+.|..|+.+.+..+.+.+
T Consensus 192 ~~~-~~P~l~i~G~~D~~~~~~~~~~~~~~~ 221 (255)
T PRK10673 192 PAW-PHPALFIRGGNSPYVTEAYRDDLLAQF 221 (255)
T ss_pred CCC-CCCeEEEECCCCCCCCHHHHHHHHHhC
Confidence 345 589999999999999988888877654
No 77
>PF11144 DUF2920: Protein of unknown function (DUF2920); InterPro: IPR022605 This bacterial family of proteins has no known function.
Probab=93.74 E-value=0.18 Score=35.64 Aligned_cols=37 Identities=14% Similarity=0.074 Sum_probs=33.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
++-....|+..|..+|.++-.+|++.|++.|.+++++
T Consensus 293 ~~~yvsYHs~~D~~~p~~~K~~l~~~l~~lgfda~l~ 329 (403)
T PF11144_consen 293 KIIYVSYHSIKDDLAPAEDKEELYEILKNLGFDATLH 329 (403)
T ss_pred ceEEEEEeccCCCCCCHHHHHHHHHHHHHcCCCeEEE
Confidence 4667788999999999999999999999999999875
No 78
>PLN02211 methyl indole-3-acetate methyltransferase
Probab=93.57 E-value=0.1 Score=34.22 Aligned_cols=28 Identities=14% Similarity=0.083 Sum_probs=25.0
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
+.|+++|+|++|..+|++....|++.+.
T Consensus 211 ~vP~l~I~g~~D~~ip~~~~~~m~~~~~ 238 (273)
T PLN02211 211 KVPRVYIKTLHDHVVKPEQQEAMIKRWP 238 (273)
T ss_pred ccceEEEEeCCCCCCCHHHHHHHHHhCC
Confidence 4799999999999999999999987753
No 79
>PLN02872 triacylglycerol lipase
Probab=93.47 E-value=0.098 Score=36.60 Aligned_cols=46 Identities=9% Similarity=-0.025 Sum_probs=32.9
Q ss_pred cccccccC-hhHHhccCC-CCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 23 EVGYNWTD-VTRKVGNFV-NKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 23 ~~~y~~~s-p~~~~~~~~-~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
...|-... |...++++. +.|+++++|+.|..|++..+.++.+.|..
T Consensus 306 ~~~Yg~~~pP~Y~l~~i~~~~Pv~i~~G~~D~lv~~~dv~~l~~~Lp~ 353 (395)
T PLN02872 306 LKLYGQVNPPAFDLSLIPKSLPLWMGYGGTDGLADVTDVEHTLAELPS 353 (395)
T ss_pred HHHhCCCCCCCcCcccCCCCccEEEEEcCCCCCCCHHHHHHHHHHCCC
Confidence 33444433 333455552 37999999999999999999999888754
No 80
>COG1647 Esterase/lipase [General function prediction only]
Probab=93.32 E-value=0.092 Score=34.63 Aligned_cols=40 Identities=18% Similarity=0.273 Sum_probs=33.0
Q ss_pred hHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 32 TRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 32 ~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
...++.+ ..|+|+++|.+|..||.+.+.-+++........
T Consensus 174 ~~~~~~I-~~pt~vvq~~~D~mv~~~sA~~Iy~~v~s~~Ke 213 (243)
T COG1647 174 RRSLDKI-YSPTLVVQGRQDEMVPAESANFIYDHVESDDKE 213 (243)
T ss_pred Hhhhhhc-ccchhheecccCCCCCHHHHHHHHHhccCCcce
Confidence 4456677 799999999999999999999999887655443
No 81
>COG1770 PtrB Protease II [Amino acid transport and metabolism]
Probab=92.90 E-value=0.11 Score=38.81 Aligned_cols=58 Identities=16% Similarity=0.030 Sum_probs=45.5
Q ss_pred CCCCCCCCcccccccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 15 GLPEMYDNEVGYNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 15 G~p~~~~~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
|.|.+.+.-+-....||..++...+-||+|..-|..|.+|.+-+-.+.+++|+..+..
T Consensus 584 GNP~d~e~y~yikSYSPYdNV~a~~YP~ilv~~Gl~D~rV~YwEpAKWvAkLR~~~td 641 (682)
T COG1770 584 GNPLDPEYYDYIKSYSPYDNVEAQPYPAILVTTGLNDPRVQYWEPAKWVAKLRELKTD 641 (682)
T ss_pred CCcCCHHHHHHHhhcCchhccccCCCCceEEEccccCCccccchHHHHHHHHhhcccC
Confidence 7773112223334579999998766799999999999999999999999999987654
No 82
>TIGR01838 PHA_synth_I poly(R)-hydroxyalkanoic acid synthase, class I. This model represents the class I subfamily of poly(R)-hydroxyalkanoate synthases, which polymerizes hydroxyacyl-CoAs with three to five carbons in the hydroxyacyl backbone into aliphatic esters termed poly(R)-hydroxyalkanoic acids. These polymers accumulate as carbon and energy storage inclusions in many species and can amount to 90 percent of the dry weight of cell.
Probab=92.85 E-value=0.15 Score=37.15 Aligned_cols=34 Identities=12% Similarity=0.110 Sum_probs=28.7
Q ss_pred HHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 33 RKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 33 ~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
..+..+ +.|+|+++|..|.+||+..+..+.+.+-
T Consensus 409 ~dL~~I-~vPvLvV~G~~D~IvP~~sa~~l~~~i~ 442 (532)
T TIGR01838 409 LDLSKV-KVPVYIIATREDHIAPWQSAYRGAALLG 442 (532)
T ss_pred cchhhC-CCCEEEEeeCCCCcCCHHHHHHHHHHCC
Confidence 357778 7999999999999999999988876543
No 83
>PRK07868 acyl-CoA synthetase; Validated
Probab=92.68 E-value=0.15 Score=39.29 Aligned_cols=32 Identities=16% Similarity=0.140 Sum_probs=28.0
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
+.++ +.|+|+++|..|..+|+..+..+.+.+-
T Consensus 293 L~~i-~~P~L~i~G~~D~ivp~~~~~~l~~~i~ 324 (994)
T PRK07868 293 LADI-TCPVLAFVGEVDDIGQPASVRGIRRAAP 324 (994)
T ss_pred hhhC-CCCEEEEEeCCCCCCCHHHHHHHHHhCC
Confidence 5677 6999999999999999999999977653
No 84
>PLN03084 alpha/beta hydrolase fold protein; Provisional
Probab=92.10 E-value=0.34 Score=33.78 Aligned_cols=28 Identities=18% Similarity=0.302 Sum_probs=24.3
Q ss_pred cCCCCcEEEEEeCCCCccCHHHHHHHHHH
Q psy7765 37 NFVNKTFLLMHGTADDNVHYQQSMMLAKA 65 (77)
Q Consensus 37 ~~~~~p~ll~hG~~D~~Vp~~~s~~l~~a 65 (77)
++ +.|+++++|+.|..+++..+.++++.
T Consensus 323 ~i-~vPvLiI~G~~D~~v~~~~~~~~a~~ 350 (383)
T PLN03084 323 NW-KTPITVCWGLRDRWLNYDGVEDFCKS 350 (383)
T ss_pred cC-CCCEEEEeeCCCCCcCHHHHHHHHHh
Confidence 45 68999999999999999988887775
No 85
>PRK03204 haloalkane dehalogenase; Provisional
Probab=91.60 E-value=0.26 Score=32.37 Aligned_cols=28 Identities=21% Similarity=0.186 Sum_probs=21.4
Q ss_pred CCcEEEEEeCCCCccCHHH-HHHHHHHHH
Q psy7765 40 NKTFLLMHGTADDNVHYQQ-SMMLAKALE 67 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~-s~~l~~al~ 67 (77)
+.|+++++|..|..+++.. ...+.+.+.
T Consensus 227 ~~PtliI~G~~D~~~~~~~~~~~~~~~ip 255 (286)
T PRK03204 227 TKPTLLVWGMKDVAFRPKTILPRLRATFP 255 (286)
T ss_pred CCCeEEEecCCCcccCcHHHHHHHHHhcC
Confidence 5999999999999987654 555665543
No 86
>PRK05855 short chain dehydrogenase; Validated
Probab=91.58 E-value=0.15 Score=36.05 Aligned_cols=27 Identities=19% Similarity=0.012 Sum_probs=23.7
Q ss_pred CCCCcEEEEEeCCCCccCHHHHHHHHHH
Q psy7765 38 FVNKTFLLMHGTADDNVHYQQSMMLAKA 65 (77)
Q Consensus 38 ~~~~p~ll~hG~~D~~Vp~~~s~~l~~a 65 (77)
. +.|+++++|.+|..||+.+...+.+.
T Consensus 232 ~-~~P~lii~G~~D~~v~~~~~~~~~~~ 258 (582)
T PRK05855 232 T-DVPVQLIVPTGDPYVRPALYDDLSRW 258 (582)
T ss_pred c-cCceEEEEeCCCcccCHHHhcccccc
Confidence 5 68999999999999999988887654
No 87
>PF06821 Ser_hydrolase: Serine hydrolase; InterPro: IPR010662 This family contains a number of hypothetical bacterial proteins of unknown function, which may be cytosolic. The Crystal Structure Of The Yden Gene Product Swiss:P96671 from B. Subtilis has been solved. The structure shows an alpha-beta hydrolase fold suggesting an enzymatic function for these proteins [].; GO: 0016787 hydrolase activity; PDB: 3BDV_B 2QS9_A 1UXO_A.
Probab=91.24 E-value=0.39 Score=29.82 Aligned_cols=28 Identities=21% Similarity=0.298 Sum_probs=23.6
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
.|.+++.+++|+.||+..+.++++++..
T Consensus 115 ~~~~viaS~nDp~vp~~~a~~~A~~l~a 142 (171)
T PF06821_consen 115 FPSIVIASDNDPYVPFERAQRLAQRLGA 142 (171)
T ss_dssp CCEEEEEETTBSSS-HHHHHHHHHHHT-
T ss_pred CCeEEEEcCCCCccCHHHHHHHHHHcCC
Confidence 5569999999999999999999998843
No 88
>KOG4391|consensus
Probab=90.31 E-value=0.39 Score=32.00 Aligned_cols=31 Identities=23% Similarity=0.282 Sum_probs=26.6
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
+... ..|.|++-|.+|..|||.+-.+++...
T Consensus 217 i~~~-~~P~LFiSGlkDelVPP~~Mr~Ly~~c 247 (300)
T KOG4391|consen 217 IGQC-RMPFLFISGLKDELVPPVMMRQLYELC 247 (300)
T ss_pred hccc-cCceEEeecCccccCCcHHHHHHHHhC
Confidence 3345 699999999999999999999998754
No 89
>KOG2551|consensus
Probab=90.24 E-value=0.28 Score=32.27 Aligned_cols=34 Identities=24% Similarity=0.266 Sum_probs=29.9
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
...+ ..|.|-+.|+.|++||..-|..|++...++
T Consensus 159 ~~~i-~~PSLHi~G~~D~iv~~~~s~~L~~~~~~a 192 (230)
T KOG2551|consen 159 KRPL-STPSLHIFGETDTIVPSERSEQLAESFKDA 192 (230)
T ss_pred ccCC-CCCeeEEecccceeecchHHHHHHHhcCCC
Confidence 4456 799999999999999999999999987665
No 90
>TIGR03695 menH_SHCHC 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase. This protein catalyzes the formation of SHCHC, or (1 R,6 R)-2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate, by elmination of pyruvate from 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate (SEPHCHC). Note that SHCHC synthase activity previously was attributed to MenD, which in fact is SEPHCHC synthase.
Probab=90.01 E-value=0.68 Score=28.11 Aligned_cols=29 Identities=17% Similarity=0.259 Sum_probs=20.2
Q ss_pred HHhccCCCCcEEEEEeCCCCccCHHHHHHHH
Q psy7765 33 RKVGNFVNKTFLLMHGTADDNVHYQQSMMLA 63 (77)
Q Consensus 33 ~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~ 63 (77)
..+... ..|+++++|..|..++ .....+.
T Consensus 188 ~~~~~~-~~P~l~i~g~~D~~~~-~~~~~~~ 216 (251)
T TIGR03695 188 PKLQAL-TIPVLYLCGEKDEKFV-QIAKEMQ 216 (251)
T ss_pred HHhhCC-CCceEEEeeCcchHHH-HHHHHHH
Confidence 345567 6999999999998764 3333333
No 91
>PF10605 3HBOH: 3HB-oligomer hydrolase (3HBOH) ; InterPro: IPR016582 This entry represents a group of predicted D-(-)-3-hydroxybutyrate oligomer hydrolases (also known as 3HB-oligomer hydrolase), which function in the degradation of poly-3-hydroxybutyrate (PHB). These enzymes catalyse the hydrolysis of D(-)-3-hydroxybutyrate oligomers (3HB-oligomers) into 3HB-monomers [, ].; GO: 0047989 hydroxybutyrate-dimer hydrolase activity, 0019605 butyrate metabolic process, 0005615 extracellular space
Probab=89.65 E-value=0.19 Score=37.48 Aligned_cols=32 Identities=19% Similarity=0.266 Sum_probs=26.9
Q ss_pred cCCCCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 37 NFVNKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 37 ~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
++..+|++|+||..|..+|+.++-+-|-+|.+
T Consensus 552 ~L~GKPaIiVhGR~DaLlPvnh~Sr~Y~~ln~ 583 (690)
T PF10605_consen 552 NLHGKPAIIVHGRSDALLPVNHTSRPYLGLNR 583 (690)
T ss_pred CcCCCceEEEecccceecccCCCchHHHHHhh
Confidence 34468999999999999999998887777764
No 92
>PF05448 AXE1: Acetyl xylan esterase (AXE1); InterPro: IPR008391 This family consists of several bacterial acetyl xylan esterase proteins. Acetyl xylan esterases are enzymes that hydrolyse the ester linkages of the acetyl groups in position 2 and/or 3 of the xylose moieties of natural acetylated xylan from hardwood. These enzymes are one of the accessory enzymes which are part of the xylanolytic system, together with xylanases, beta-xylosidases, alpha-arabinofuranosidases and methylglucuronidases; these are all required for the complete hydrolysis of xylan [].; PDB: 1VLQ_H 3M81_E 3M82_D 3M83_C 3FCY_A 1ODS_F 1ODT_C 1L7A_A 3FYT_A 2XLB_F ....
Probab=88.77 E-value=0.48 Score=32.30 Aligned_cols=41 Identities=12% Similarity=0.103 Sum_probs=29.9
Q ss_pred cccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 27 NWTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 27 ~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
.-.+..+.+..+ +.|+++.-|..|++|||+-....++++..
T Consensus 250 ~Y~D~~nfA~ri-~~pvl~~~gl~D~~cPP~t~fA~yN~i~~ 290 (320)
T PF05448_consen 250 SYFDAVNFARRI-KCPVLFSVGLQDPVCPPSTQFAAYNAIPG 290 (320)
T ss_dssp HTT-HHHHGGG---SEEEEEEETT-SSS-HHHHHHHHCC--S
T ss_pred hhhhHHHHHHHc-CCCEEEEEecCCCCCCchhHHHHHhccCC
Confidence 345778888889 79999999999999999999999988754
No 93
>KOG2984|consensus
Probab=88.54 E-value=0.35 Score=31.91 Aligned_cols=26 Identities=19% Similarity=0.213 Sum_probs=20.9
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHH
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMM 61 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~ 61 (77)
+.+. +.|+||+||..|+.|+-.+.--
T Consensus 212 lp~v-kcPtli~hG~kDp~~~~~hv~f 237 (277)
T KOG2984|consen 212 LPQV-KCPTLIMHGGKDPFCGDPHVCF 237 (277)
T ss_pred cccc-cCCeeEeeCCcCCCCCCCCccc
Confidence 4556 7999999999999998766543
No 94
>PLN02894 hydrolase, alpha/beta fold family protein
Probab=88.46 E-value=1.2 Score=30.99 Aligned_cols=30 Identities=10% Similarity=0.045 Sum_probs=21.9
Q ss_pred HhccCCCCcEEEEEeCCCCccCHHHHHHHHHH
Q psy7765 34 KVGNFVNKTFLLMHGTADDNVHYQQSMMLAKA 65 (77)
Q Consensus 34 ~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~a 65 (77)
.+..+ +.|+++++|..|..++ ....++.+.
T Consensus 320 ~l~~I-~vP~liI~G~~D~i~~-~~~~~~~~~ 349 (402)
T PLN02894 320 SASEW-KVPTTFIYGRHDWMNY-EGAVEARKR 349 (402)
T ss_pred hcccC-CCCEEEEEeCCCCCCc-HHHHHHHHH
Confidence 35667 6999999999998665 555555544
No 95
>COG2021 MET2 Homoserine acetyltransferase [Amino acid transport and metabolism]
Probab=88.16 E-value=1.3 Score=31.14 Aligned_cols=42 Identities=26% Similarity=0.193 Sum_probs=38.0
Q ss_pred cChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCC
Q psy7765 29 TDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADI 71 (77)
Q Consensus 29 ~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~ 71 (77)
-+...-+.++ +.|+|++-=+.|...|+++..+++++|..++.
T Consensus 296 ~~l~~al~~i-~~~~lv~gi~sD~lfp~~~~~~~~~~L~~~~~ 337 (368)
T COG2021 296 GDLTAALARI-KAPVLVVGITSDWLFPPELQRALAEALPAAGA 337 (368)
T ss_pred CcHHHHHhcC-ccCEEEEEecccccCCHHHHHHHHHhccccCc
Confidence 5666678888 69999999999999999999999999999886
No 96
>PLN02606 palmitoyl-protein thioesterase
Probab=88.09 E-value=0.98 Score=30.96 Aligned_cols=36 Identities=17% Similarity=0.087 Sum_probs=25.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHh-cCCCceee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEA-ADIFFYQQ 76 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~-~g~~~~~~ 76 (77)
..|++||||..|..... ....+.+.+.+ .|.+...+
T Consensus 26 ~~PvViwHGlgD~~~~~-~~~~~~~~i~~~~~~pg~~v 62 (306)
T PLN02606 26 SVPFVLFHGFGGECSNG-KVSNLTQFLINHSGYPGTCV 62 (306)
T ss_pred CCCEEEECCCCcccCCc-hHHHHHHHHHhCCCCCeEEE
Confidence 58999999999998875 44555555542 36665543
No 97
>PF05728 UPF0227: Uncharacterised protein family (UPF0227); InterPro: IPR008886 Despite being classed as uncharacterised proteins, the members of this family are almost certainly enzymes in that they contain a domain distantly related to IPR000073 from INTERPRO. One of the members of this family YqiA has been shown to be a esterase []. Other members, which include the Escherichia coli (strain K12) YcfP protein are uncharacterised.
Probab=87.40 E-value=0.79 Score=29.00 Aligned_cols=26 Identities=15% Similarity=0.427 Sum_probs=22.4
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHH
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKA 65 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~a 65 (77)
..++++++|++|.+++++.+...++.
T Consensus 134 ~~~~lvll~~~DEvLd~~~a~~~~~~ 159 (187)
T PF05728_consen 134 PERYLVLLQTGDEVLDYREAVAKYRG 159 (187)
T ss_pred CccEEEEEecCCcccCHHHHHHHhcC
Confidence 36999999999999999888777654
No 98
>TIGR01839 PHA_synth_II poly(R)-hydroxyalkanoic acid synthase, class II. This model represents the class II subfamily of poly(R)-hydroxyalkanoate synthases, which polymerizes hydroxyacyl-CoAs, typically with six to fourteen carbons in the hydroxyacyl backbone into aliphatic esters termed poly(R)-hydroxyalkanoic acids. These polymers accumulate as carbon and energy storage inclusions in many species and can amount to 90 percent of the dry weight of cell.
Probab=87.13 E-value=0.89 Score=33.55 Aligned_cols=35 Identities=14% Similarity=0.083 Sum_probs=30.2
Q ss_pred ChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 30 DVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 30 sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
-++ .++++ +.|+|.+-|..|.+||++.+..+.+.+
T Consensus 433 ~~i-dL~~I-~~Pvl~va~~~DHIvPw~s~~~~~~l~ 467 (560)
T TIGR01839 433 TPI-DLKKV-KCDSFSVAGTNDHITPWDAVYRSALLL 467 (560)
T ss_pred EEe-chhcC-CCCeEEEecCcCCcCCHHHHHHHHHHc
Confidence 344 57788 799999999999999999999988765
No 99
>KOG2237|consensus
Probab=86.80 E-value=0.71 Score=34.69 Aligned_cols=44 Identities=18% Similarity=0.150 Sum_probs=34.2
Q ss_pred ccccChhHHhccCC-CCcEEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 26 YNWTDVTRKVGNFV-NKTFLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 26 y~~~sp~~~~~~~~-~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
|...||...+.... -|.+|+.-+..|++|++-++..++++|+..
T Consensus 617 i~~y~pv~~i~~q~~YPS~lvtta~hD~RV~~~~~~K~vAklre~ 661 (712)
T KOG2237|consen 617 ISPYSPVDNIKKQVQYPSMLVTTADHDDRVGPLESLKWVAKLREA 661 (712)
T ss_pred ecccCccCCCchhccCcceEEeeccCCCcccccchHHHHHHHHHH
Confidence 44445555444432 478999999999999999999999999876
No 100
>PRK10439 enterobactin/ferric enterobactin esterase; Provisional
Probab=86.46 E-value=1.4 Score=31.04 Aligned_cols=36 Identities=14% Similarity=0.178 Sum_probs=29.9
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceeeC
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~~ 77 (77)
..++|-+|+.|..+ +....+|++.|+++|.++++.+
T Consensus 350 lr~~i~~G~~E~~~-~~~~~~l~~~L~~~G~~~~~~~ 385 (411)
T PRK10439 350 LRIVLEAGRREPMI-MRANQALYAQLHPAGHSVFWRQ 385 (411)
T ss_pred ceEEEeCCCCCchH-HHHHHHHHHHHHHCCCcEEEEE
Confidence 47899999998654 5678999999999999988753
No 101
>COG3571 Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only]
Probab=84.37 E-value=0.8 Score=29.23 Aligned_cols=26 Identities=15% Similarity=0.249 Sum_probs=21.1
Q ss_pred HHhccCCCCcEEEEEeCCCCccCHHHH
Q psy7765 33 RKVGNFVNKTFLLMHGTADDNVHYQQS 59 (77)
Q Consensus 33 ~~~~~~~~~p~ll~hG~~D~~Vp~~~s 59 (77)
.|+..+ ++|++|.||+.|..=.-.+.
T Consensus 136 ~HL~gl-~tPtli~qGtrD~fGtr~~V 161 (213)
T COG3571 136 EHLTGL-KTPTLITQGTRDEFGTRDEV 161 (213)
T ss_pred hhccCC-CCCeEEeecccccccCHHHH
Confidence 478889 79999999999987665544
No 102
>TIGR01849 PHB_depoly_PhaZ polyhydroxyalkanoate depolymerase, intracellular. This model represents an intracellular depolymerase for polyhydroxyalkanoate (PHA), a carbon and energy storing polyester that accumulates in granules in many bacterial species when carbon sources are abundant but other nutrients are limiting. This family is named for PHAs generally, rather than polyhydroxybutyrate (PHB) specificially as in Ralstonia eutropha H16, to avoid overcalling chemical specificity in other species. Note that this family lacks the classic GXSXG lipase motif and instead shows weak similarity to some
Probab=84.14 E-value=1.4 Score=31.30 Aligned_cols=32 Identities=19% Similarity=0.215 Sum_probs=27.0
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
++.+.+.|+|.+-|+.|.+||+.|+..+.+..
T Consensus 333 l~~I~~~pll~V~ge~D~I~p~~qt~aa~~l~ 364 (406)
T TIGR01849 333 PGAITRVALLTVEGENDDISGLGQTKAALRLC 364 (406)
T ss_pred HHHCcccceEEEeccCCCcCCHHHhHHHHHHh
Confidence 55672399999999999999999999977753
No 103
>COG3545 Predicted esterase of the alpha/beta hydrolase fold [General function prediction only]
Probab=83.78 E-value=3 Score=26.61 Aligned_cols=41 Identities=17% Similarity=0.071 Sum_probs=31.4
Q ss_pred cccccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 25 GYNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 25 ~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
.....+|+. ..++ --|.++++..+|..|+++++..++++.-
T Consensus 104 ~~~tf~~~p-~~~l-pfps~vvaSrnDp~~~~~~a~~~a~~wg 144 (181)
T COG3545 104 HLMTFDPIP-REPL-PFPSVVVASRNDPYVSYEHAEDLANAWG 144 (181)
T ss_pred hccccCCCc-cccC-CCceeEEEecCCCCCCHHHHHHHHHhcc
Confidence 334455554 4445 4799999999999999999999998753
No 104
>KOG3043|consensus
Probab=83.02 E-value=3.2 Score=27.58 Aligned_cols=37 Identities=14% Similarity=0.263 Sum_probs=31.7
Q ss_pred HhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCC
Q psy7765 34 KVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADI 71 (77)
Q Consensus 34 ~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~ 71 (77)
.+... +.|++++-|+.|..+|++...++-+.|+++-.
T Consensus 159 D~~~v-k~Pilfl~ae~D~~~p~~~v~~~ee~lk~~~~ 195 (242)
T KOG3043|consen 159 DIANV-KAPILFLFAELDEDVPPKDVKAWEEKLKENPA 195 (242)
T ss_pred HHhcC-CCCEEEEeecccccCCHHHHHHHHHHHhcCcc
Confidence 46667 69999999999999999999998888887644
No 105
>PRK11126 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase; Provisional
Probab=82.94 E-value=1.4 Score=27.56 Aligned_cols=20 Identities=15% Similarity=0.333 Sum_probs=16.5
Q ss_pred HhccCCCCcEEEEEeCCCCcc
Q psy7765 34 KVGNFVNKTFLLMHGTADDNV 54 (77)
Q Consensus 34 ~~~~~~~~p~ll~hG~~D~~V 54 (77)
.+.++ +.|++++||..|..+
T Consensus 183 ~l~~i-~~P~lii~G~~D~~~ 202 (242)
T PRK11126 183 ALQAL-TFPFYYLCGERDSKF 202 (242)
T ss_pred Hhhcc-CCCeEEEEeCCcchH
Confidence 45567 699999999999855
No 106
>PLN02633 palmitoyl protein thioesterase family protein
Probab=81.98 E-value=2.7 Score=28.98 Aligned_cols=33 Identities=12% Similarity=0.165 Sum_probs=22.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhc-CCCc
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAA-DIFF 73 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~-g~~~ 73 (77)
..|+.||||..|....+ --..+.+.+.+. |.++
T Consensus 25 ~~P~ViwHG~GD~c~~~-g~~~~~~l~~~~~g~~~ 58 (314)
T PLN02633 25 SVPFIMLHGIGTQCSDA-TNANFTQLLTNLSGSPG 58 (314)
T ss_pred CCCeEEecCCCcccCCc-hHHHHHHHHHhCCCCce
Confidence 58999999999999875 333444444332 4444
No 107
>PF02089 Palm_thioest: Palmitoyl protein thioesterase; InterPro: IPR002472 Neuronal ceroid lipofuscinoses (NCL) represent a group of encephalopathies that occur in 1 in 12,500 children. Mutations in the palmitoyl protein thioesterase gene causing infantile neuronal ceroid lipofuscinosis []. The most common mutation results in intracellular accumulation of the polypeptide and undetectable enzyme activity in the brain. Direct sequencing of cDNAs derived from brain RNA of INCL patients has shown a mis-sense transversion of A to T at nucleotide position 364, which results in substitution of Trp for Arg at position 122 in the protein - Arg 122 is immediately adjacent to a lipase consensus sequence that contains the putative active site Ser of PPT. The occurrence of this and two other independent mutations in the PPT gene strongly suggests that defects in this gene cause INCL.; GO: 0008474 palmitoyl-(protein) hydrolase activity, 0006464 protein modification process; PDB: 3GRO_B 1PJA_A 1EXW_A 1EH5_A 1EI9_A.
Probab=81.65 E-value=1.3 Score=29.93 Aligned_cols=26 Identities=15% Similarity=0.208 Sum_probs=11.8
Q ss_pred CCcEEEEEeCCCCccC---HHHHHHHHHH
Q psy7765 40 NKTFLLMHGTADDNVH---YQQSMMLAKA 65 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp---~~~s~~l~~a 65 (77)
..|+.||||..|.... +....++.+.
T Consensus 5 ~~PvViwHGmGD~~~~~~~m~~i~~~i~~ 33 (279)
T PF02089_consen 5 PLPVVIWHGMGDSCCNPSSMGSIKELIEE 33 (279)
T ss_dssp S--EEEE--TT--S--TTTHHHHHHHHHH
T ss_pred CCcEEEEEcCccccCChhHHHHHHHHHHH
Confidence 5899999999999864 3444444443
No 108
>KOG1515|consensus
Probab=81.64 E-value=2.6 Score=29.14 Aligned_cols=34 Identities=15% Similarity=0.052 Sum_probs=30.7
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCcee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
-+|+|++-+..|... .+...++++|++.|+++++
T Consensus 268 lp~tlv~~ag~D~L~--D~~~~Y~~~Lkk~Gv~v~~ 301 (336)
T KOG1515|consen 268 LPPTLVVVAGYDVLR--DEGLAYAEKLKKAGVEVTL 301 (336)
T ss_pred CCceEEEEeCchhhh--hhhHHHHHHHHHcCCeEEE
Confidence 578999999998887 6999999999999999985
No 109
>COG2945 Predicted hydrolase of the alpha/beta superfamily [General function prediction only]
Probab=81.15 E-value=1.6 Score=28.40 Aligned_cols=30 Identities=20% Similarity=0.272 Sum_probs=23.9
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHH
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKA 65 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~a 65 (77)
+.+. -.|.+++||+.|++|.+....+.++.
T Consensus 145 l~P~-P~~~lvi~g~~Ddvv~l~~~l~~~~~ 174 (210)
T COG2945 145 LAPC-PSPGLVIQGDADDVVDLVAVLKWQES 174 (210)
T ss_pred ccCC-CCCceeEecChhhhhcHHHHHHhhcC
Confidence 4444 47999999999999998888777654
No 110
>PF07755 DUF1611: Protein of unknown function (DUF1611); InterPro: IPR011669 This entry contains a number of hypothetical bacterial and archaeal proteins. The region is approximately 350 residues long. A member of this family (Q6M063 from SWISSPROT) is thought to associate with another subunit to form an H+-transporting ATPase, but no evidence has been found to support this.; PDB: 2G0T_A 2OBN_A.
Probab=80.90 E-value=5.1 Score=27.44 Aligned_cols=39 Identities=18% Similarity=0.058 Sum_probs=32.4
Q ss_pred cCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 37 NFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 37 ~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
.. +.|.++.-||....=-+.-+..+.++|+++|+++.|+
T Consensus 109 ~~-~~~rv~~vGTDcavGK~tTal~L~~~l~~~G~~a~fv 147 (301)
T PF07755_consen 109 EV-KAKRVLTVGTDCAVGKMTTALELRRALRERGINAGFV 147 (301)
T ss_dssp G--SSEEEEEEESSSSSSHHHHHHHHHHHHHHTT--EEEE
T ss_pred cC-CCCEEEEEccCccccHHHHHHHHHHHHHHcCCCceEE
Confidence 44 6899999999888888999999999999999998875
No 111
>cd05560 Xcc1710_like Xcc1710_like family, specific to proteobacteria. Xcc1710 is a hypothetical protein from Xanthomonas campestris pv. campestris str. ATCC 33913, similar to Mth938, a hypothetical protein encoded by the Methanobacterium thermoautotrophicum (Mth) genome. Their three-dimensional structures have been determined, but their functions are unknown.
Probab=80.88 E-value=2.9 Score=24.16 Aligned_cols=35 Identities=20% Similarity=0.118 Sum_probs=27.1
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceeeC
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~~ 77 (77)
.|=+|+-|+......+ ..++.+.|+++|+.+|..+
T Consensus 52 ~peiliiGTG~~~~~~--~~~~~~~l~~~gi~vE~m~ 86 (109)
T cd05560 52 QPEVILLGTGERQRFP--PPALLAPLLARGIGVEVMD 86 (109)
T ss_pred CCCEEEEecCCCCCcC--CHHHHHHHHHcCCeEEEEC
Confidence 4668888888887766 4667788999999988753
No 112
>PLN02980 2-oxoglutarate decarboxylase/ hydro-lyase/ magnesium ion binding / thiamin pyrophosphate binding
Probab=80.24 E-value=2.9 Score=34.40 Aligned_cols=33 Identities=15% Similarity=0.273 Sum_probs=25.9
Q ss_pred HhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 34 KVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 34 ~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
.+.++ +.|+|+++|.+|..++ ..+.++.+.+.+
T Consensus 1563 ~L~~I-~~PtLlI~Ge~D~~~~-~~a~~~~~~i~~ 1595 (1655)
T PLN02980 1563 DLKQC-DTPLLLVVGEKDVKFK-QIAQKMYREIGK 1595 (1655)
T ss_pred HHhhC-CCCEEEEEECCCCccH-HHHHHHHHHccc
Confidence 46677 6999999999999875 677777776654
No 113
>PF14714 KH_dom-like: KH-domain-like of EngA bacterial GTPase enzymes, C-terminal; PDB: 2HJG_A 1MKY_A.
Probab=80.12 E-value=5.5 Score=21.83 Aligned_cols=36 Identities=8% Similarity=0.031 Sum_probs=25.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhc----CCCcee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAA----DIFFYQ 75 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~----g~~~~~ 75 (77)
.+|+|++++.....++.+-..-|.+.|++. |+++.+
T Consensus 38 ~PPtFv~f~N~~~~~~~sY~ryL~n~lRe~f~f~G~Pi~l 77 (80)
T PF14714_consen 38 RPPTFVLFVNDPELLPESYKRYLENQLREAFGFEGVPIRL 77 (80)
T ss_dssp TTTEEEEEES-CCC--HHHHHHHHHHHHHHH--TTS--EE
T ss_pred CCCEEEEEeCCcccCCHHHHHHHHHHHHHHCCCCceeEEE
Confidence 699999999999999999988888888874 555443
No 114
>COG3458 Acetyl esterase (deacetylase) [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=78.80 E-value=2.1 Score=29.32 Aligned_cols=39 Identities=15% Similarity=0.169 Sum_probs=31.8
Q ss_pred ChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 30 DVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 30 sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
+..+.+..+ +.|+|+.-|.-|+++||+-....++++-..
T Consensus 250 D~~n~A~Ri-K~pvL~svgL~D~vcpPstqFA~yN~l~~~ 288 (321)
T COG3458 250 DIVNLAARI-KVPVLMSVGLMDPVCPPSTQFAAYNALTTS 288 (321)
T ss_pred hhhhHHHhh-ccceEEeecccCCCCCChhhHHHhhcccCC
Confidence 344556677 799999999999999999888888887543
No 115
>COG0596 MhpC Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]
Probab=78.16 E-value=4 Score=24.34 Aligned_cols=30 Identities=27% Similarity=0.334 Sum_probs=21.3
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHH
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKA 65 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~a 65 (77)
.... ..|+++++|..|..+|......+.+.
T Consensus 217 ~~~~-~~P~l~i~g~~d~~~~~~~~~~~~~~ 246 (282)
T COG0596 217 LARI-TVPTLIIHGEDDPVVPAELARRLAAA 246 (282)
T ss_pred hccC-CCCeEEEecCCCCcCCHHHHHHHHhh
Confidence 4445 69999999999977776664444443
No 116
>KOG2624|consensus
Probab=76.93 E-value=3.1 Score=29.51 Aligned_cols=49 Identities=18% Similarity=0.119 Sum_probs=38.9
Q ss_pred Cccccccc-ChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCC
Q psy7765 22 NEVGYNWT-DVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADI 71 (77)
Q Consensus 22 ~~~~y~~~-sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~ 71 (77)
+...|.+. .|...+++. +.|+.|+.|..|..+.++....+...+....+
T Consensus 314 N~~~Y~q~~pP~Y~l~~i-~~P~~l~~g~~D~l~~~~DV~~~~~~~~~~~~ 363 (403)
T KOG2624|consen 314 NLKHYGQSTPPEYDLTNI-KVPTALYYGDNDWLADPEDVLILLLVLPNSVI 363 (403)
T ss_pred cHhhcCCCCCCCCCcccc-ccCEEEEecCCcccCCHHHHHHHHHhcccccc
Confidence 45556554 455567777 79999999999999999999988888776655
No 117
>TIGR03100 hydr1_PEP hydrolase, ortholog 1, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 2, TIGR03101) of the same superfamily.
Probab=75.68 E-value=8.3 Score=25.20 Aligned_cols=36 Identities=22% Similarity=0.287 Sum_probs=26.8
Q ss_pred CCcEEEEEeCCCCcc-CHHHHHHHHHHHHhcCCCcee
Q psy7765 40 NKTFLLMHGTADDNV-HYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 40 ~~p~ll~hG~~D~~V-p~~~s~~l~~al~~~g~~~~~ 75 (77)
.++++++||..+..+ .......+++.|.++|..+-.
T Consensus 26 ~~~vv~i~gg~~~~~g~~~~~~~la~~l~~~G~~v~~ 62 (274)
T TIGR03100 26 TTGVLIVVGGPQYRVGSHRQFVLLARRLAEAGFPVLR 62 (274)
T ss_pred CCeEEEEeCCccccCCchhHHHHHHHHHHHCCCEEEE
Confidence 468999998887766 345567789999988876543
No 118
>KOG2382|consensus
Probab=75.04 E-value=3.9 Score=28.18 Aligned_cols=32 Identities=16% Similarity=0.265 Sum_probs=26.6
Q ss_pred ccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 36 GNFVNKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 36 ~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
... ..|+++++|+++..||..+-.++.+.+-.
T Consensus 250 ~~~-~~pvlfi~g~~S~fv~~~~~~~~~~~fp~ 281 (315)
T KOG2382|consen 250 GPY-TGPVLFIKGLQSKFVPDEHYPRMEKIFPN 281 (315)
T ss_pred ccc-ccceeEEecCCCCCcChhHHHHHHHhccc
Confidence 455 79999999999999999988887776543
No 119
>PF02244 Propep_M14: Carboxypeptidase activation peptide; InterPro: IPR003146 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. The peptidases are synthesised as inactive molecules, zymogens, with propeptides that must be removed by proteolytic cleavage to activate the enzyme. Structural studies of carboxypeptidases A and B reveal the propeptide to exist as a globular domain, followed by an extended alpha-helix; this shields the catalytic site, without specifically binding to it, while the substrate-binding site is blocked by making specific contacts [, ]. Members of this propeptide family are found in the metallocarboxypeptidases: A1, A2 [], A3, A4, A5, A6, U, insect gut carboxypeptidase and B [], and and are associated with peptidases belonging to MEROPS peptidase family M14A. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase.; GO: 0004180 carboxypeptidase activity, 0006508 proteolysis; PDB: 3D68_C 3D67_B 3D66_A 1PBA_A 3GLJ_A 1NSA_A 1KWM_A 2BOA_B 1PYT_A 1JQG_A ....
Probab=74.01 E-value=1.4 Score=23.20 Aligned_cols=27 Identities=15% Similarity=0.026 Sum_probs=23.0
Q ss_pred CCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 50 ADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 50 ~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
.|-.||+.+...|.+-|++.|+.++..
T Consensus 36 ~dv~V~p~~~~~f~~~L~~~~i~~~v~ 62 (74)
T PF02244_consen 36 VDVMVPPEKLEEFEELLKEHGIEYEVL 62 (74)
T ss_dssp EEEEEEGGGHHHHHHHHHHTT-EEEEE
T ss_pred EEEEECHHHHHHHHHHHHHCCCcEEEE
Confidence 577899999999999999999988753
No 120
>PF08357 SEFIR: SEFIR domain; InterPro: IPR013568 This domain is found in IL17 receptors (IL17Rs, e.g. Q60943 from SWISSPROT) and SEF proteins (e.g. Q8QHJ9 from SWISSPROT). The latter are feedback inhibitors of FGF signalling and are also thought to be receptors. Due to its similarity to the TIR domain (IPR000157 from INTERPRO), the SEFIR region is thought to be involved in homotypic interactions with other SEFIR/TIR-domain-containing proteins. Thus, SEFs and IL17Rs may be involved in TOLL/IL1R-like signalling pathways [].
Probab=73.52 E-value=7.3 Score=23.07 Aligned_cols=34 Identities=18% Similarity=0.188 Sum_probs=28.6
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHhc-CCCcee
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEAA-DIFFYQ 75 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~~-g~~~~~ 75 (77)
+++||++.- |..-+.+-...|++.|++. |+++.+
T Consensus 1 ~kVfI~Ys~-d~~~h~~~V~~la~~L~~~~g~~V~l 35 (150)
T PF08357_consen 1 RKVFISYSH-DSEEHKEWVLALAEFLRQNCGIDVIL 35 (150)
T ss_pred CeEEEEeCC-CCHHHHHHHHHHHHHHHhccCCceee
Confidence 468888885 8888888899999999999 988753
No 121
>PF11339 DUF3141: Protein of unknown function (DUF3141); InterPro: IPR024501 This family of proteins appears to be predominantly expressed in Proteobacteria. Their function is unknown.
Probab=73.33 E-value=3.6 Score=30.48 Aligned_cols=29 Identities=14% Similarity=0.203 Sum_probs=25.1
Q ss_pred HhccCCCCcEEEEEeCCCCccCHHHHHHHH
Q psy7765 34 KVGNFVNKTFLLMHGTADDNVHYQQSMMLA 63 (77)
Q Consensus 34 ~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~ 63 (77)
.+.++ ..|++++-|..|.+.|++|+....
T Consensus 292 DLr~I-r~Piivfas~gDnITPP~QaL~WI 320 (581)
T PF11339_consen 292 DLRNI-RSPIIVFASYGDNITPPQQALNWI 320 (581)
T ss_pred ehhhC-CCCEEEEeccCCCCCChhHhccch
Confidence 57788 799999999999999999996543
No 122
>cd00248 Mth938-like Mth938-like domain. The members of this family include: Mth938, 2P1, Xcr35, Rpa2829, and several uncharacterized sequences. Mth938 is a hypothetical protein encoded by the Methanobacterium thermoautotrophicum (Mth) genome. This protein crystallizes as a dimer, although it is monomeric in solution, with one disulfide bond in each monomer. 2P1 is a partially characterized nuclear protein which is homologous to E3-3 from rat and known to be alternately spliced. Xcr35 and Rpa2829 are hypothetical proteins of unknown function from the Xanthomonas campestris and Rhodopseudomonas palustris genomes, respectively, for which the crystal structures have been determined.
Probab=71.97 E-value=7.2 Score=22.45 Aligned_cols=35 Identities=26% Similarity=0.204 Sum_probs=26.5
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceeeC
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~~ 77 (77)
+|=+|+-|+......+. .++.+.|+++|+.+|..+
T Consensus 52 ~peiliiGTG~~~~~~~--~~~~~~l~~~gI~vE~m~ 86 (109)
T cd00248 52 RPDILLIGTGAEIAFLP--RALRAALRAAGIGVEVMS 86 (109)
T ss_pred CCCEEEEcCCCCCCcCC--HHHHHHHHHcCCeEEEeC
Confidence 47788889888775543 446788999999988753
No 123
>PF00450 Peptidase_S10: Serine carboxypeptidase; InterPro: IPR001563 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of serine peptidases belong to MEROPS peptidase family S10 (clan SC). The type example is carboxypeptidase Y from Saccharomyces cerevisiae (Baker's yeast) []. All known carboxypeptidases are either metallo carboxypeptidases or serine carboxypeptidases (3.4.16.5 from EC and 3.4.16.6 from EC). The catalytic activity of the serine carboxypeptidases, like that of the trypsin family serine proteases, is provided by a charge relay system involving an aspartic acid residue hydrogen-bonded to a histidine, which is itself hydrogen-bonded to a serine []. The sequences surrounding the active site serine and histidine residues are highly conserved in all the serine carboxypeptidases.; GO: 0004185 serine-type carboxypeptidase activity, 0006508 proteolysis; PDB: 1AC5_A 1WHS_B 3SC2_B 1WHT_A 1BCR_A 1BCS_A 1GXS_A 1IVY_A 1WPX_A 1YSC_A ....
Probab=71.76 E-value=4.5 Score=27.62 Aligned_cols=28 Identities=14% Similarity=0.173 Sum_probs=22.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
+.++||.+|..|.+||...++...+.|.
T Consensus 330 ~irVLiy~Gd~D~i~n~~Gt~~~i~~L~ 357 (415)
T PF00450_consen 330 GIRVLIYNGDLDLICNFLGTERWIDNLN 357 (415)
T ss_dssp T-EEEEEEETT-SSS-HHHHHHHHHCTE
T ss_pred cceeEEeccCCCEEEEeccchhhhhccc
Confidence 4899999999999999999999887754
No 124
>PF10142 PhoPQ_related: PhoPQ-activated pathogenicity-related protein; InterPro: IPR009199 Proteins in this entry are believed to play a role in virulence/pathogenicity in Salmonella. Salmonella typhi PqaA has been shown to be activated by PhoP/Q two-component regulatory system, which regulates many virulence genes []. It has been also shown to confer resistance to antimicrobial peptides (melittin) []. Members of this family are predicted to belong to the alpha/beta hydrolase domain superfamily.
Probab=70.12 E-value=2.6 Score=29.57 Aligned_cols=45 Identities=11% Similarity=0.034 Sum_probs=39.8
Q ss_pred cccccccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 23 EVGYNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 23 ~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
.+.-+..+|+.+.+++ +.|-+++-|+.|+--.+..+.-++..|..
T Consensus 246 ~~L~~ivDP~~Y~~rL-~~PK~ii~atgDeFf~pD~~~~y~d~L~G 290 (367)
T PF10142_consen 246 DKLMQIVDPYSYRDRL-TMPKYIINATGDEFFVPDSSNFYYDKLPG 290 (367)
T ss_pred HHHHHhcCHHHHHHhc-CccEEEEecCCCceeccCchHHHHhhCCC
Confidence 4555778999999999 89999999999999999999999988764
No 125
>PF13676 TIR_2: TIR domain; PDB: 3H16_B 3UB4_A 2Y92_A 3UB3_A 3UB2_A.
Probab=69.89 E-value=5.7 Score=21.80 Aligned_cols=28 Identities=25% Similarity=0.329 Sum_probs=21.1
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 43 FLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 43 ~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
+||.|...|.. .+..+.+.|++.|..+-
T Consensus 1 VFIS~~~~D~~----~a~~l~~~L~~~g~~v~ 28 (102)
T PF13676_consen 1 VFISYSSEDRE----FAERLAERLESAGIRVF 28 (102)
T ss_dssp EEEEEEGGGCC----CHHHHHHHHHHTT--EE
T ss_pred eEEEecCCcHH----HHHHHHHHHhhcCCEEE
Confidence 58899988844 68889999999988654
No 126
>PF06500 DUF1100: Alpha/beta hydrolase of unknown function (DUF1100); InterPro: IPR010520 Proteins in this entry display esterase activity toward pNP-butyrate []. This entry also includes 2,6-dihydropseudooxynicotine hydrolase which has a role in nicotine catabolism by cleaving a C-C bond in 2,6-dihydroxypseudooxyicotine [, ].; PDB: 3OUR_A 3MVE_B 2JBW_C.
Probab=69.22 E-value=8.3 Score=27.57 Aligned_cols=26 Identities=19% Similarity=0.340 Sum_probs=19.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHH
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKA 65 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~a 65 (77)
.+|+|.+-|..|+.+|.+.+.-++..
T Consensus 352 ~~plL~i~~~~D~v~P~eD~~lia~~ 377 (411)
T PF06500_consen 352 PTPLLAINGEDDPVSPIEDSRLIAES 377 (411)
T ss_dssp SS-EEEEEETT-SSS-HHHHHHHHHT
T ss_pred CcceEEeecCCCCCCCHHHHHHHHhc
Confidence 58999999999999999998877653
No 127
>TIGR03100 hydr1_PEP hydrolase, ortholog 1, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 2, TIGR03101) of the same superfamily.
Probab=69.10 E-value=6.8 Score=25.59 Aligned_cols=21 Identities=14% Similarity=0.241 Sum_probs=17.2
Q ss_pred HhccCCCCcEEEEEeCCCCccC
Q psy7765 34 KVGNFVNKTFLLMHGTADDNVH 55 (77)
Q Consensus 34 ~~~~~~~~p~ll~hG~~D~~Vp 55 (77)
.+... ..|++|++|+.|..++
T Consensus 202 ~l~~~-~~P~ll~~g~~D~~~~ 222 (274)
T TIGR03100 202 GLERF-QGPVLFILSGNDLTAQ 222 (274)
T ss_pred HHHhc-CCcEEEEEcCcchhHH
Confidence 44566 6999999999998864
No 128
>cd03409 Chelatase_Class_II Class II Chelatase: a family of ATP-independent monomeric or homodimeric enzymes that catalyze the insertion of metal into protoporphyrin rings. This family includes protoporphyrin IX ferrochelatase (HemH), sirohydrochlorin ferrochelatase (SirB) and the cobaltochelatases, CbiK and CbiX. HemH and SirB are involved in heme and siroheme biosynthesis, respectively, while the cobaltochelatases are associated with cobalamin biosynthesis. Excluded from this family are the ATP-dependent heterotrimeric chelatases (class I) and the multifunctional homodimeric enzymes with dehydrogenase and chelatase activities (class III).
Probab=68.74 E-value=12 Score=20.36 Aligned_cols=27 Identities=15% Similarity=0.180 Sum_probs=13.1
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 43 FLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 43 ~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
+++.||+.+..--.....+++++|+++
T Consensus 3 llv~HGs~~~s~~~~~~~~~~~~l~~~ 29 (101)
T cd03409 3 LVVGHGSPYKDPYKKDIEAQAHNLAES 29 (101)
T ss_pred EEEECCCCCCccHHHHHHHHHHHHHHH
Confidence 345566655432223444455555443
No 129
>PLN02213 sinapoylglucose-malate O-sinapoyltransferase/ carboxypeptidase
Probab=68.09 E-value=5.5 Score=26.99 Aligned_cols=31 Identities=16% Similarity=0.205 Sum_probs=27.3
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcC
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAAD 70 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g 70 (77)
..++||..|..|.+||.--++.+.++|.=.+
T Consensus 233 ~i~VliY~Gd~D~icn~~g~~~wi~~L~w~~ 263 (319)
T PLN02213 233 GYRSLIYSGDHDIAVPFLATQAWIRSLNYSP 263 (319)
T ss_pred CceEEEEECCcCeeCCcHhHHHHHHhcCCCC
Confidence 4799999999999999999999999986433
No 130
>KOG2541|consensus
Probab=67.65 E-value=12 Score=25.68 Aligned_cols=30 Identities=17% Similarity=0.186 Sum_probs=22.3
Q ss_pred hccCCCCcEEEEEeCCCCccC--HHHHHHHHHH
Q psy7765 35 VGNFVNKTFLLMHGTADDNVH--YQQSMMLAKA 65 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp--~~~s~~l~~a 65 (77)
++.. ..|+.++||..|.... +.+-.++.+.
T Consensus 19 ~s~s-~~P~ii~HGigd~c~~~~~~~~~q~l~~ 50 (296)
T KOG2541|consen 19 VSPS-PVPVIVWHGIGDSCSSLSMANLTQLLEE 50 (296)
T ss_pred cCcc-cCCEEEEeccCcccccchHHHHHHHHHh
Confidence 3343 3899999999999998 6666665554
No 131
>PF02273 Acyl_transf_2: Acyl transferase; InterPro: IPR003157 LuxD proteins are bacterial acyl transferases. Together with an acyl-protein synthetase (LuxE) and reductase (LuxC), they form a multienzyme complex. This complex channels activated fatty acids into the aldehyde substrate for the luciferase-catalyzed bacterial bioluminescence reaction [, ]. ; GO: 0016746 transferase activity, transferring acyl groups, 0006631 fatty acid metabolic process; PDB: 1THT_B.
Probab=66.54 E-value=8.1 Score=26.25 Aligned_cols=37 Identities=14% Similarity=0.216 Sum_probs=26.4
Q ss_pred cChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 29 TDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 29 ~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
.|.++.++.+ +.|++.+++.+|.=|......++...+
T Consensus 185 ~ST~~~~k~l-~iP~iaF~A~~D~WV~q~eV~~~~~~~ 221 (294)
T PF02273_consen 185 DSTINDMKRL-SIPFIAFTANDDDWVKQSEVEELLDNI 221 (294)
T ss_dssp HHHHHHHTT---S-EEEEEETT-TTS-HHHHHHHHTT-
T ss_pred hhHHHHHhhC-CCCEEEEEeCCCccccHHHHHHHHHhc
Confidence 4677788899 799999999999999998888876644
No 132
>cd03413 CbiK_C Anaerobic cobalamin biosynthetic cobalt chelatase (CbiK), C-terminal domain. CbiK is part of the cobalt-early path for cobalamin biosynthesis. It catalyzes the insertion of cobalt into the oxidized form of precorrin-2, factor II (sirohydrochlorin), the second step of the anaerobic branch of vitamin B12 biosynthesis. CbiK belongs to the class II family of chelatases, and is a homomeric enzyme that does not require ATP for its enzymatic activity.
Probab=65.15 E-value=15 Score=20.92 Aligned_cols=25 Identities=20% Similarity=0.226 Sum_probs=13.8
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 43 FLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 43 ~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
+|+-||+.|.. .+.-..|.+.+++.
T Consensus 4 llvgHGSr~~~--~~~~~~l~~~l~~~ 28 (103)
T cd03413 4 VFMGHGTDHPS--NAVYAALEYVLREE 28 (103)
T ss_pred EEEECCCCchh--hhHHHHHHHHHHhc
Confidence 34557777664 24445556665554
No 133
>TIGR00976 /NonD putative hydrolase, CocE/NonD family. This model represents a protein subfamily that includes the cocaine esterase CocE, several glutaryl-7-ACA acylases, and the putative diester hydrolase NonD of Streptomyces griseus (all hydrolases). This family shows extensive, low-level similarity to a family of xaa-pro dipeptidyl-peptidases, and local similarity by PSI-BLAST to many other hydrolases.
Probab=63.22 E-value=17 Score=26.35 Aligned_cols=40 Identities=23% Similarity=0.234 Sum_probs=30.4
Q ss_pred ccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcC
Q psy7765 28 WTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAAD 70 (77)
Q Consensus 28 ~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g 70 (77)
..+....+.++ +.|+|++.|=.|. ....+.+.++++.+.+
T Consensus 221 ~~~~~~~~~~i-~vP~l~~~gw~D~--~~~g~~~~~~~~~~~~ 260 (550)
T TIGR00976 221 SISLWRDLGGS-DVPTLVTGGWYDN--HSRGSIRLFLAVHRGG 260 (550)
T ss_pred cCchhhHhcCC-CCCEEEeCcccCC--CCchHHHHHHHHhhcC
Confidence 45556667788 7999999999993 5567788888877654
No 134
>PTZ00472 serine carboxypeptidase (CBP1); Provisional
Probab=62.85 E-value=8.8 Score=27.51 Aligned_cols=30 Identities=13% Similarity=0.117 Sum_probs=26.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
..++||..|..|.+||+.-++++.++|.=.
T Consensus 364 gikVLiYnGd~D~icn~~Gt~~wi~~L~w~ 393 (462)
T PTZ00472 364 GVRVMIYAGDMDFICNWIGNKAWTLALQWP 393 (462)
T ss_pred CceEEEEECCcCeecCcHhHHHHHHhCCCC
Confidence 479999999999999999999999988633
No 135
>PF12146 Hydrolase_4: Putative lysophospholipase; InterPro: IPR022742 This domain is found in bacteria and eukaryotes and is approximately 110 amino acids in length. Many members are annotated as being lysophospholipases, and others as alpha-beta hydrolase fold-containing proteins.
Probab=61.60 E-value=7.1 Score=21.09 Aligned_cols=34 Identities=15% Similarity=0.242 Sum_probs=23.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
+.-++|+||..+..-- =..+++.|.++|..+-..
T Consensus 16 k~~v~i~HG~~eh~~r---y~~~a~~L~~~G~~V~~~ 49 (79)
T PF12146_consen 16 KAVVVIVHGFGEHSGR---YAHLAEFLAEQGYAVFAY 49 (79)
T ss_pred CEEEEEeCCcHHHHHH---HHHHHHHHHhCCCEEEEE
Confidence 4679999998654443 344788888888876443
No 136
>TIGR00666 PBP4 D-alanyl-D-alanine carboxypeptidase, serine-type, PBP4 family. In E. coli, this protein is known as penicillin binding protein 4 (dacB). A signal sequence is cleaved from a precursor form. The protein is described as periplasmic in E. coli (Gram-negative) and extracellular in Actinomadura R39 (Gram-positive). Unlike some other proteins with similar activity, it does not form transpeptidation. It is not essential for viability. This family is related to class A beta-lactamases.
Probab=61.40 E-value=19 Score=24.86 Aligned_cols=31 Identities=16% Similarity=0.134 Sum_probs=27.9
Q ss_pred cEEEEEeCCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 42 TFLLMHGTADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 42 p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
=-|.+.|..|+..-.+.-..|+++|++.|+.
T Consensus 55 G~L~i~G~GDP~L~~~~L~~la~~l~~~Gi~ 85 (345)
T TIGR00666 55 GNLVLRFGGDPTLKRQDIRNLVATLKKSGVK 85 (345)
T ss_pred ccEEEEeecCCCcCHHHHHHHHHHHHHcCCc
Confidence 3688889999999999999999999999985
No 137
>cd05125 Mth938_2P1-like Mth938_2P1-like domain. This model contains sequences that are similar to 2P1, a partially characterized nuclear protein, which is homologous to E3-3 from rat and known to be alternatively spliced. Its function is unknown. This family is part of the Mth938 family, for which structures, but no functional data are available.
Probab=60.00 E-value=18 Score=21.14 Aligned_cols=35 Identities=20% Similarity=0.170 Sum_probs=25.0
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceeeC
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~~ 77 (77)
.|=+|+-|+.....++.. ++.+.|+++|+.++..+
T Consensus 54 ~peiliiGtG~~~~~~~~--~~~~~l~~~gi~vevm~ 88 (114)
T cd05125 54 RPEILVIGTGRKSRPLSP--ELRKYFKKLGIAVEVVD 88 (114)
T ss_pred CCCEEEEccCCCCCcCCH--HHHHHHHHcCCEEEEEC
Confidence 566788888887655443 45688889999888653
No 138
>PF04430 DUF498: Protein of unknown function (DUF498/DUF598); InterPro: IPR007523 NADH:ubiquinone oxidoreductase (complex I) (1.6.5.3 from EC) is a respiratory-chain enzyme that catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane (NADH + ubiquinone = NAD+ + ubiquinol) []. Complex I is a major source of reactive oxygen species (ROS) that are predominantly formed by electron transfer from FMNH(2). Complex I is found in bacteria, cyanobacteria (as a NADH-plastoquinone oxidoreductase), archaea [], mitochondira, and in the hydrogenosome, a mitochondria-derived organelle. In general, the bacterial complex consists of 14 different subunits, while the mitochondrial complex contains homologues to these subunits in addition to approximately 31 additional proteins []. Mitochondrial complex I, which is located in the inner mitochondrial membrane, is the largest multimeric respiratory enzyme in the mitochondria, consisting of more than 40 subunits, one FMN co-factor and eight FeS clusters []. The assembly of mitochondrial complex I is an intricate process that requires the cooperation of the nuclear and mitochondrial genomes [, ]. Mitochondrial complex I can cycle between active and deactive forms that can be distinguished by the reactivity towards divalent cations and thiol-reactive agents. All redox prosthetic groups reside in the peripheral arm of the L-shaped structure. The NADH oxidation domain harbouring the FMN cofactor is connected via a chain of iron-sulphur clusters to the ubiquinone reduction site that is located in a large pocket formed by the PSST and 49kDa subunits of complex I []. This is entry represents an essential factor for the assembly of mitochondrial NADH:ubiquinone oxidoreductase complex (complex I) []. The crystal structure of this protein revealed a 3-layer beta+alpha/beta/alpha topology [].; PDB: 2K2E_A 2Q4Q_B 2AB1_A 2FVT_A 2CYJ_A 1IHN_B 2GM2_A 3CPK_A 2FI9_A.
Probab=59.92 E-value=4.1 Score=23.41 Aligned_cols=35 Identities=20% Similarity=0.122 Sum_probs=22.1
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
..|=+|+-|+......+ ..++.+.|+++|+.++..
T Consensus 52 p~pe~liiGtG~~~~~~--~~~~~~~l~~~GI~ve~m 86 (110)
T PF04430_consen 52 PKPEVLIIGTGKRQLFL--PPELREYLRKKGIGVEVM 86 (110)
T ss_dssp CS-SEEEEEETTS-SEC--THHHHHHHHTTT-EEEEE
T ss_pred CCCcEEEEccCCccccC--CHHHHHHHHHcCCeEEEE
Confidence 36777777777765443 344677789999988864
No 139
>PF04083 Abhydro_lipase: Partial alpha/beta-hydrolase lipase region; InterPro: IPR006693 The alpha/beta hydrolase fold is common to several hydrolytic enzymes of widely differing phylogenetic origin and catalytic function. The core of each enzyme is similar: an alpha/beta sheet, not barrel, of eight beta-sheets connected by alpha-helices []. This entry represents the N-terminal part of an alpha/beta hydrolase domain found in a number of lipases.; GO: 0006629 lipid metabolic process; PDB: 1K8Q_B 1HLG_B.
Probab=59.72 E-value=6 Score=20.71 Aligned_cols=13 Identities=31% Similarity=0.485 Sum_probs=6.4
Q ss_pred CCcEEEEEeCCCC
Q psy7765 40 NKTFLLMHGTADD 52 (77)
Q Consensus 40 ~~p~ll~hG~~D~ 52 (77)
++|+||.||..+.
T Consensus 43 k~pVll~HGL~~s 55 (63)
T PF04083_consen 43 KPPVLLQHGLLQS 55 (63)
T ss_dssp --EEEEE--TT--
T ss_pred CCcEEEECCcccC
Confidence 6899999998654
No 140
>PLN03016 sinapoylglucose-malate O-sinapoyltransferase
Probab=59.48 E-value=10 Score=27.03 Aligned_cols=31 Identities=16% Similarity=0.205 Sum_probs=27.4
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcC
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAAD 70 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g 70 (77)
+-++||..|..|.+||+--++.+.++|.=.+
T Consensus 347 ~irVLiY~Gd~D~icn~~Gt~~wi~~L~w~~ 377 (433)
T PLN03016 347 GYRSLIYSGDHDIAVPFLATQAWIRSLNYSP 377 (433)
T ss_pred CceEEEEECCccccCCcHhHHHHHHhCCCCC
Confidence 4799999999999999999999999886443
No 141
>KOG1282|consensus
Probab=59.45 E-value=9.5 Score=27.58 Aligned_cols=27 Identities=22% Similarity=0.258 Sum_probs=25.2
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
-+++|.-|+.|.+||+.-++...++|.
T Consensus 364 ~rvliysGD~D~~~p~~gt~~~i~~L~ 390 (454)
T KOG1282|consen 364 YRVLIYSGDHDLVVPFLGTQAWIKSLN 390 (454)
T ss_pred eEEEEEeCCcceeCcchhhHHHHHhcc
Confidence 699999999999999999999988876
No 142
>PF02113 Peptidase_S13: D-Ala-D-Ala carboxypeptidase 3 (S13) family; InterPro: IPR000667 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This family of serine peptidases belong to MEROPS peptidase family S13 (D-Ala-D-Ala carboxypeptidase C, clan SE). The predicted active site residues for members of this family and family S12 occur in the motif SXXK. D-Ala-D-Ala carboxypeptidase C is involved in the metabolism of cell components []; it is synthesised with a leader peptide to target it to the cell membrane []. After cleavage of the leader peptide, the enzyme is retained in the membrane by a C-terminal anchor []. There are three families of serine-type D-Ala-D-Ala peptidase (designated S11, S12 and S13), which are also known as low molecular weight penicillin-binding proteins []. Family S13 comprises D-Ala-D-Ala peptidases that have sufficient sequence similarity around their active sites to assume a distant evolutionary relationship to other clan members; members of the S13 family also bind penicillin and have D-amino-peptidase activity. Proteases of family S11 have exclusive D-Ala-D-Ala peptidase activity, while some members of S12 are C beta-lactamases [].; GO: 0004185 serine-type carboxypeptidase activity, 0006508 proteolysis; PDB: 3A3F_B 3A3E_B 3A3D_A 3A3I_B 2Y59_C 1W8Q_A 3ZVT_B 3ZVW_B 2VGJ_B 1W79_D ....
Probab=59.45 E-value=20 Score=25.60 Aligned_cols=32 Identities=28% Similarity=0.298 Sum_probs=26.9
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
.--|++.|..|+..-.++...+++.|+++|+.
T Consensus 81 ~G~L~l~G~GDP~l~~~~l~~la~~l~~~Gi~ 112 (444)
T PF02113_consen 81 NGDLYLKGGGDPSLTSEDLWALAAQLKAAGIK 112 (444)
T ss_dssp ESEEEEEECSBTTBCHHHHHHHHHHHHHTT-S
T ss_pred CCcEEEEecCCCccCHHHHHHHHHHHHHcCCC
Confidence 34566779999999999999999999999974
No 143
>PRK11113 D-alanyl-D-alanine carboxypeptidase/endopeptidase; Provisional
Probab=58.80 E-value=20 Score=25.90 Aligned_cols=30 Identities=13% Similarity=0.081 Sum_probs=27.1
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 43 FLLMHGTADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 43 ~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
-|++.|..|+.--.+.-..++++|+++|+.
T Consensus 100 dL~i~G~GDPtL~~~~L~~la~~l~~~GI~ 129 (477)
T PRK11113 100 DLIARFGGDPTLTRQDLRNMVATLKKSGVK 129 (477)
T ss_pred eEEEEEecCCCCCHHHHHHHHHHHHHcCCc
Confidence 577799999999999999999999999985
No 144
>PRK10834 vancomycin high temperature exclusion protein; Provisional
Probab=58.16 E-value=17 Score=24.13 Aligned_cols=34 Identities=26% Similarity=0.336 Sum_probs=27.6
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
..+.+|+-|.... -.+.++..|.+.|.+.|++.+
T Consensus 81 k~~~ilvSGg~~~-~~~~Ea~~M~~yLi~~GVp~e 114 (239)
T PRK10834 81 KVNYLLLSGDNAL-QSYNEPMTMRKDLIAAGVDPS 114 (239)
T ss_pred CCCEEEEeCCCCC-CCCCHHHHHHHHHHHcCCCHH
Confidence 4677888887643 467899999999999999865
No 145
>KOG3363|consensus
Probab=58.14 E-value=20 Score=22.82 Aligned_cols=37 Identities=19% Similarity=0.135 Sum_probs=31.6
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceeeC
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~~ 77 (77)
.|=+|+-|-.|..-|..-+.++.+-+++.|+..|.+|
T Consensus 111 kidlLIvG~Gd~~~p~~v~~~V~~F~k~~ki~lEi~d 147 (196)
T KOG3363|consen 111 KIDLLIVGCGDKKHPDKVRPSVRQFVKSHKIKLEIVD 147 (196)
T ss_pred CccEEEEecCCcCCchhcCHHHHHHHHHhCcceEEec
Confidence 5678888999999998888888888899999988765
No 146
>PLN02209 serine carboxypeptidase
Probab=57.22 E-value=12 Score=26.80 Aligned_cols=32 Identities=22% Similarity=0.266 Sum_probs=27.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCC
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADI 71 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~ 71 (77)
..++||..|..|.+||..-++.+.++|.=.+.
T Consensus 351 girVLiY~GD~D~icn~~Gte~wi~~L~w~~~ 382 (437)
T PLN02209 351 GYRSLIFSGDHDITMPFQATQAWIKSLNYSII 382 (437)
T ss_pred CceEEEEECCccccCCcHhHHHHHHhcCCccC
Confidence 47999999999999999999999999865443
No 147
>PF12913 SH3_6: SH3 domain of the SH3b1 type; PDB: 3M1U_B.
Probab=56.85 E-value=11 Score=19.27 Aligned_cols=14 Identities=14% Similarity=0.396 Sum_probs=10.8
Q ss_pred CCcEEEEEeCCCCc
Q psy7765 40 NKTFLLMHGTADDN 53 (77)
Q Consensus 40 ~~p~ll~hG~~D~~ 53 (77)
..|++|.|-..|..
T Consensus 26 gtPv~i~H~S~D~~ 39 (54)
T PF12913_consen 26 GTPVYILHTSRDGA 39 (54)
T ss_dssp T-EEEEEEE-TTSS
T ss_pred CCCEEEEEECCCCC
Confidence 69999999999975
No 148
>COG0627 Predicted esterase [General function prediction only]
Probab=56.41 E-value=15 Score=25.18 Aligned_cols=63 Identities=21% Similarity=0.131 Sum_probs=42.5
Q ss_pred hhhhhCCCCCCCCcccccccChhHHhcc---C----------CCCcEEEEEeCCCCccC--HHHHHHHHHHHHhcCCCce
Q psy7765 10 TERYMGLPEMYDNEVGYNWTDVTRKVGN---F----------VNKTFLLMHGTADDNVH--YQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 10 ~e~~lG~p~~~~~~~~y~~~sp~~~~~~---~----------~~~p~ll~hG~~D~~Vp--~~~s~~l~~al~~~g~~~~ 74 (77)
.+.++|.+ .. ...++-+|...+++ . ..+++++-.|..|.... ...+..|.+++++.|++.+
T Consensus 208 ~~~~~G~~--~~--~~w~~~D~~~~~~~l~~~~~~~~~~~~~~~~~~~~d~g~ad~~~~~~~~~~~~~~~a~~~~g~~~~ 283 (316)
T COG0627 208 FNAMLGPD--SD--PAWQENDPLSLIEKLVANANTRIWVYGGSPPELLIDNGPADFFLAANNLSTRAFAEALRAAGIPNG 283 (316)
T ss_pred HHHhcCCC--cc--ccccccCchhHHHHhhhcccccceecccCCCccccccccchhhhhhcccCHHHHHHHHHhcCCCce
Confidence 44566765 22 12444556655553 1 14888888999998885 3447889999999999877
Q ss_pred ee
Q psy7765 75 QQ 76 (77)
Q Consensus 75 ~~ 76 (77)
+.
T Consensus 284 ~~ 285 (316)
T COG0627 284 VR 285 (316)
T ss_pred ee
Confidence 54
No 149
>PLN02872 triacylglycerol lipase
Probab=56.36 E-value=9 Score=26.93 Aligned_cols=36 Identities=19% Similarity=0.132 Sum_probs=21.2
Q ss_pred CCcEEEEEeCC---CCccCHHHHHHHHHHHHhcCCCcee
Q psy7765 40 NKTFLLMHGTA---DDNVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 40 ~~p~ll~hG~~---D~~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
.+|++++||.. |.-+.......++..|.++|-.+.+
T Consensus 74 ~~~Vll~HGl~~ss~~w~~~~~~~sla~~La~~GydV~l 112 (395)
T PLN02872 74 GPPVLLQHGLFMAGDAWFLNSPEQSLGFILADHGFDVWV 112 (395)
T ss_pred CCeEEEeCcccccccceeecCcccchHHHHHhCCCCccc
Confidence 57999999974 3322111123355567777876654
No 150
>COG3243 PhaC Poly(3-hydroxyalkanoate) synthetase [Lipid metabolism]
Probab=54.45 E-value=19 Score=26.03 Aligned_cols=32 Identities=13% Similarity=0.200 Sum_probs=26.5
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
+..+ +.|++.+.|..|.++|.+.....++.+.
T Consensus 326 L~~I-t~pvy~~a~~~DhI~P~~Sv~~g~~l~~ 357 (445)
T COG3243 326 LGDI-TCPVYNLAAEEDHIAPWSSVYLGARLLG 357 (445)
T ss_pred hhhc-ccceEEEeecccccCCHHHHHHHHHhcC
Confidence 5567 7999999999999999988777776543
No 151
>TIGR02803 ExbD_1 TonB system transport protein ExbD, group 1. Members of this family are Gram-negative bacterial inner membrane proteins, generally designated ExbD, related to the TolR family modeled by TIGRFAMs TIGR02801. Members always are encoded next to a protein designated ExbB (TIGR02797), related to the TolQ family modeled by TIGRFAMs TIGR02796. ExbD and ExbB together form a proton channel through which they can harness the proton-motive force to energize TonB, which in turn energizes TonB-dependent receptors in the outer membrane. TonB-dependent receptors with known specificity tend to import siderophores or vitamin B12. A TonB system and Tol-Pal system often will co-exist in a single bacterial genome.
Probab=54.40 E-value=22 Score=20.59 Aligned_cols=24 Identities=29% Similarity=0.403 Sum_probs=21.3
Q ss_pred CCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 49 TADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 49 ~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
..|..+|++....+.+.++++|+.
T Consensus 91 ~aD~~~~~~~vv~v~d~~~~aG~~ 114 (122)
T TIGR02803 91 RADKTVDYGDLMKVMNLLRQAGYL 114 (122)
T ss_pred EcCCCCCHHHHHHHHHHHHHcCCC
Confidence 368899999999999999999985
No 152
>PF06342 DUF1057: Alpha/beta hydrolase of unknown function (DUF1057); InterPro: IPR010463 This entry consists of proteins of unknown function which have an alpha/beta hydrolase fold.
Probab=53.61 E-value=22 Score=24.46 Aligned_cols=33 Identities=15% Similarity=0.171 Sum_probs=27.5
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
+... ..|+|+..|..|..|--+.+.+++..++.
T Consensus 208 ln~~-~ikvli~ygg~DhLIEeeI~~E~a~~f~~ 240 (297)
T PF06342_consen 208 LNKK-PIKVLIAYGGKDHLIEEEISFEFAMKFKG 240 (297)
T ss_pred hccC-CCcEEEEEcCcchhhHHHHHHHHHHHhCC
Confidence 3344 58999999999999999999999887643
No 153
>TIGR02764 spore_ybaN_pdaB polysaccharide deacetylase family sporulation protein PdaB. This model describes the YbaN protein family, also called PdaB and SpoVIE, of Gram-positive bacteria. Although ybaN null mutants have only a mild sporulation defect, ybaN/ytrI double mutants show drastically reducted sporulation efficiencies. This synthetic defect suggests the role of this sigmaE-controlled gene in sporulation had been masked by functional redundancy. Members of this family are homologous to a characterized polysaccharide deacetylase; the exact function this protein family is unknown.
Probab=53.52 E-value=28 Score=21.45 Aligned_cols=38 Identities=16% Similarity=0.190 Sum_probs=26.2
Q ss_pred cCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 37 NFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 37 ~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
..+...++|+|...+...-.+....+...|+++|-.+-
T Consensus 148 ~~~~g~Iil~Hd~~~~~~t~~~l~~~i~~l~~~Gy~~v 185 (191)
T TIGR02764 148 NTKPGDIILLHASDSAKQTVKALPTIIKKLKEKGYEFV 185 (191)
T ss_pred cCCCCCEEEEeCCCCcHhHHHHHHHHHHHHHHCCCEEE
Confidence 34356799999644444445566778999999987654
No 154
>PRK00923 sirohydrochlorin cobaltochelatase; Reviewed
Probab=53.41 E-value=22 Score=20.54 Aligned_cols=27 Identities=15% Similarity=0.182 Sum_probs=17.9
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHhcC
Q psy7765 43 FLLMHGTADDNVHYQQSMMLAKALEAAD 70 (77)
Q Consensus 43 ~ll~hG~~D~~Vp~~~s~~l~~al~~~g 70 (77)
+++-||+.|..- -.....+++.+++++
T Consensus 5 vlv~hGS~~~~~-~~~~~~~~~~l~~~~ 31 (126)
T PRK00923 5 LLVGHGSRLPYN-KEVVTKIAEKIKEKH 31 (126)
T ss_pred EEEeCCCCChHH-HHHHHHHHHHHHHhC
Confidence 567799887533 255666777777653
No 155
>PF12646 DUF3783: Domain of unknown function (DUF3783); InterPro: IPR016621 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=53.06 E-value=26 Score=17.82 Aligned_cols=28 Identities=29% Similarity=0.336 Sum_probs=20.7
Q ss_pred cEEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 42 TFLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 42 p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
++++++|-.+. +-..+..++++.|++..
T Consensus 2 ~~ll~~g~~~~-----el~~~l~~~r~~~~~~~ 29 (58)
T PF12646_consen 2 EFLLFSGFSGE-----ELDKFLDALRKAGIPIP 29 (58)
T ss_pred CEEEECCCCHH-----HHHHHHHHHHHcCCCcc
Confidence 67888886554 56678888899888543
No 156
>cd03414 CbiX_SirB_C Sirohydrochlorin cobalt chelatase (CbiX) and sirohydrochlorin iron chelatase (SirB), C-terminal domain. SirB catalyzes the ferro-chelation of sirohydrochlorin to siroheme, the prosthetic group of sulfite and nitrite reductases. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, an important step in the vitamin B12 biosynthetic pathway. CbiX often contains a C-terminal histidine-rich region that may be important for metal delivery and/or storage, and may also contain an iron-sulfur center. Both CbiX and SirB are found in a wide range of bacteria.
Probab=52.32 E-value=15 Score=20.75 Aligned_cols=27 Identities=22% Similarity=0.398 Sum_probs=19.4
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHhcC
Q psy7765 43 FLLMHGTADDNVHYQQSMMLAKALEAAD 70 (77)
Q Consensus 43 ~ll~hG~~D~~Vp~~~s~~l~~al~~~g 70 (77)
+++-||+.|.... ....++++.++++.
T Consensus 4 llv~HGS~~~~~~-~~~~~l~~~l~~~~ 30 (117)
T cd03414 4 VLVGRGSSDPDAN-ADVAKIARLLEEGT 30 (117)
T ss_pred EEEcCCCCCHHHH-HHHHHHHHHHHHhc
Confidence 5788999876443 56777888887653
No 157
>TIGR02804 ExbD_2 TonB system transport protein ExbD, group 2. Members of this family are Gram-negative bacterial inner membrane proteins, generally designated ExbD, related to the TolR family modeled by TIGRFAMs TIGR02801. Members always are encoded next to a protein designated ExbB (TIGR02797), related to the TolQ family modeled by TIGRFAMs TIGR02796. ExbD and ExbB together form a proton channel through which they can harness the proton-motive force to energize TonB, which in turn energizes TonB-dependent receptors in the outer membrane. TonB-dependent receptors with known specificity tend to import siderophores or vitamin B12. A TonB system and Tol-Pal system often will co-exist in a single bacterial genome.
Probab=52.23 E-value=22 Score=20.59 Aligned_cols=24 Identities=17% Similarity=0.306 Sum_probs=21.3
Q ss_pred CCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 49 TADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 49 ~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
..|..+++++...+-++++++|+.
T Consensus 90 ~aD~~~~~~~vv~v~d~~~~~G~~ 113 (121)
T TIGR02804 90 KSDKEAKFQDFVTITDMLKAKEHE 113 (121)
T ss_pred EeCCCCCHhHHHHHHHHHHHcCCC
Confidence 368899999999999999999975
No 158
>PRK08335 translation initiation factor IF-2B subunit alpha; Validated
Probab=52.01 E-value=18 Score=24.42 Aligned_cols=26 Identities=23% Similarity=0.190 Sum_probs=22.1
Q ss_pred CCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 51 DDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 51 D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
+..-|..|..+|++.|.+.|+++.++
T Consensus 141 ~EsrP~~qG~~la~eL~~~GI~vtlI 166 (275)
T PRK08335 141 TESAPDYEGLALANELEFLGIEFEVI 166 (275)
T ss_pred ecCCCchhHHHHHHHHHHCCCCEEEE
Confidence 35667789999999999999999875
No 159
>TIGR02801 tolR TolR protein. The model describes the inner membrane protein TolR, part of the TolR/TolQ complex that transduces energy from the proton-motive force, through TolA, to an outer membrane complex made up of TolB and Pal (peptidoglycan-associated lipoprotein). The complex is required to maintain outer membrane integrity, and defects may cause a defect in the import of some organic compounds in addition to the resulting morphologic. While several gene pairs homologous to talR and tolQ may be found in a single genome, but the scope of this model is set to favor finding only bone fide TolR, supported by operon structure as well as by score.
Probab=51.57 E-value=26 Score=20.40 Aligned_cols=28 Identities=29% Similarity=0.373 Sum_probs=23.1
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 43 FLLMHGTADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 43 ~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
.+++. .|..+++++...+.++++++|+.
T Consensus 95 ~v~i~--aD~~~~~~~vv~vmd~~~~~G~~ 122 (129)
T TIGR02801 95 PVLIR--ADKTVPYGEVIKVMALLKQAGIE 122 (129)
T ss_pred eEEEE--cCCCCCHHHHHHHHHHHHHcCCC
Confidence 34444 68899999999999999999984
No 160
>PHA01474 nonstructural protein
Probab=51.02 E-value=3.5 Score=20.30 Aligned_cols=14 Identities=36% Similarity=0.330 Sum_probs=10.7
Q ss_pred Cccchhhhhhhhhh
Q psy7765 1 MFWITNSIYTERYM 14 (77)
Q Consensus 1 ~w~~~~~~~~e~~l 14 (77)
|||.|++.-+..|.
T Consensus 1 mwriyesrttyeys 14 (52)
T PHA01474 1 MWRIYESRTTYEYS 14 (52)
T ss_pred CceEeeccceeeeh
Confidence 89999987766553
No 161
>COG4635 HemG Flavodoxin [Energy production and conversion / Coenzyme metabolism]
Probab=50.69 E-value=33 Score=21.73 Aligned_cols=35 Identities=23% Similarity=0.254 Sum_probs=27.4
Q ss_pred cEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceeeC
Q psy7765 42 TFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 42 p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~~ 77 (77)
.+||..++.|...- +-+..++..|++.|..++++|
T Consensus 2 k~LIlYstr~GqT~-kIA~~iA~~L~e~g~qvdi~d 36 (175)
T COG4635 2 KTLILYSTRDGQTR-KIAEYIASHLRESGIQVDIQD 36 (175)
T ss_pred ceEEEEecCCCcHH-HHHHHHHHHhhhcCCeeeeee
Confidence 47889999988754 335668889999999998865
No 162
>PF11823 DUF3343: Protein of unknown function (DUF3343); InterPro: IPR021778 This family of proteins are functionally uncharacterised. This protein is found in bacteria and archaea. Proteins in this family are typically between 78 to 102 amino acids in length.
Probab=50.27 E-value=31 Score=18.14 Aligned_cols=21 Identities=19% Similarity=0.075 Sum_probs=17.5
Q ss_pred HHHHHHHHHHHHhcCCCceee
Q psy7765 56 YQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 56 ~~~s~~l~~al~~~g~~~~~~ 76 (77)
..++.++.+.|+++|++++++
T Consensus 11 t~~a~~~ek~lk~~gi~~~li 31 (73)
T PF11823_consen 11 THDAMKAEKLLKKNGIPVRLI 31 (73)
T ss_pred HHHHHHHHHHHHHCCCcEEEe
Confidence 457899999999999988764
No 163
>PF12531 DUF3731: DNA-K related protein ; InterPro: IPR021030 Proteins in this family are bacterial proteins of approximately 250 amino acids in length. There are two conserved sequence motifs: RPG and WRR. The proteins in this family are frequently annotated as DNA-K related proteins however there is little accompanying literature to confirm this.
Probab=49.18 E-value=19 Score=24.11 Aligned_cols=38 Identities=21% Similarity=0.133 Sum_probs=28.3
Q ss_pred hhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcC
Q psy7765 31 VTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAAD 70 (77)
Q Consensus 31 p~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g 70 (77)
+.-.+..+ .---.+||..+.+||++.+....++|.+.+
T Consensus 136 ~wWAlGRl--gaR~p~yGs~h~Vvp~~~~~~wl~~ll~~d 173 (249)
T PF12531_consen 136 HWWALGRL--GARVPFYGSAHNVVPPEVAEQWLDALLALD 173 (249)
T ss_pred hHHHHHHH--HhcCcccCCcccccCHHHHHHHHHHHHhcC
Confidence 34445555 223357899999999999999999987765
No 164
>PF08538 DUF1749: Protein of unknown function (DUF1749); InterPro: IPR013744 This is a plant and fungal family of unknown function. This family contains many hypothetical proteins. ; PDB: 2Q0X_B.
Probab=47.16 E-value=15 Score=25.27 Aligned_cols=34 Identities=24% Similarity=0.259 Sum_probs=10.0
Q ss_pred hccCCCCcEEEEEeCCCCccCHHH-HHHHHHHHHhc
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQ-SMMLAKALEAA 69 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~-s~~l~~al~~~ 69 (77)
...+ ..|+|++.+.+|..||..- -..+.++.+++
T Consensus 228 fG~v-~~plLvl~Sg~DEyvP~~vdk~~Ll~rw~~a 262 (303)
T PF08538_consen 228 FGKV-SKPLLVLYSGKDEYVPPWVDKEALLERWKAA 262 (303)
T ss_dssp GGG---S-EEEEEE--TT------------------
T ss_pred hccC-CCceEEEecCCCceecccccccccccccccc
Confidence 3445 5899999999999999744 34466665543
No 165
>PF00756 Esterase: Putative esterase; InterPro: IPR000801 This family contains several seemingly unrelated proteins, including human esterase D; mycobacterial antigen 85, which is responsible for the high affinity of mycobacteria to fibronectin; Corynebacterium glutamicum major secreted protein PS1; and hypothetical proteins from Escherichia coli, yeast, mycobacteria and Haemophilus influenzae.; PDB: 3LS2_A 1VA5_B 1DQZ_B 3HRH_A 1DQY_A 2GZR_A 2GZS_A 3GFF_A 1R88_A 3E4D_D ....
Probab=46.77 E-value=17 Score=22.99 Aligned_cols=51 Identities=20% Similarity=0.079 Sum_probs=30.1
Q ss_pred ccccChhHHhc----cCCCCcEEEEEeCCCCccC----------HHHHHHHHHHHHhcCCCceee
Q psy7765 26 YNWTDVTRKVG----NFVNKTFLLMHGTADDNVH----------YQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 26 y~~~sp~~~~~----~~~~~p~ll~hG~~D~~Vp----------~~~s~~l~~al~~~g~~~~~~ 76 (77)
+...++...+. +....+++|..|+.|.... .....++.+.|+..|++..++
T Consensus 165 ~~~~~~~~~~~~~~~~~~~~~i~l~~G~~d~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~ 229 (251)
T PF00756_consen 165 WKENDPFDLIKALSQKKKPLRIYLDVGTKDEFGGWEDSAQILQFLANNRELAQLLKAKGIPHTYH 229 (251)
T ss_dssp HGGCHHHHHHHHHHHTTSEEEEEEEEETTSTTHHCSHHHHHHHHHHHHHHHHHHCCCEECTTESE
T ss_pred hhhccHHHHhhhhhcccCCCeEEEEeCCCCcccccccCHHHHHHHHHhHhhHHHHHHcCCCceEE
Confidence 44455555442 2225788999999999443 233444555566666666543
No 166
>KOG2521|consensus
Probab=46.60 E-value=36 Score=23.91 Aligned_cols=36 Identities=8% Similarity=-0.031 Sum_probs=30.7
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
.+.+-+-|..|.++|..+.++|++..++.|+.+.-+
T Consensus 226 ~~~ly~~s~~d~v~~~~~ie~f~~~~~~~g~~v~s~ 261 (350)
T KOG2521|consen 226 WNQLYLYSDNDDVLPADEIEKFIALRREKGVNVKSV 261 (350)
T ss_pred ccceeecCCccccccHHHHHHHHHHHHhcCceEEEe
Confidence 566667799999999999999999999999877643
No 167
>PRK11024 colicin uptake protein TolR; Provisional
Probab=46.42 E-value=34 Score=20.41 Aligned_cols=27 Identities=19% Similarity=0.335 Sum_probs=22.5
Q ss_pred EEEEeCCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 44 LLMHGTADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 44 ll~hG~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
+++. .|..+|++....+-++++++|+.
T Consensus 106 V~i~--aD~~~~~~~vv~vmd~~k~aG~~ 132 (141)
T PRK11024 106 FLIG--GAKDVPYDEIIKALNLLHSAGVK 132 (141)
T ss_pred EEEE--cCCCCCHHHHHHHHHHHHHcCCC
Confidence 4444 57889999999999999999984
No 168
>smart00460 TGc Transglutaminase/protease-like homologues. Transglutaminases are enzymes that establish covalent links between proteins. A subset of transglutaminase homologues appear to catalyse the reverse reaction, the hydrolysis of peptide bonds. Proteins with this domain are both extracellular and intracellular, and it is likely that the eukaryotic intracellular proteins are involved in signalling events.
Probab=46.17 E-value=36 Score=16.93 Aligned_cols=20 Identities=10% Similarity=0.008 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHhcCCCceee
Q psy7765 57 QQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 57 ~~s~~l~~al~~~g~~~~~~ 76 (77)
..+..+...|+..|++++++
T Consensus 11 ~~a~l~~~llr~~GIpar~v 30 (68)
T smart00460 11 EFAALFVALLRSLGIPARVV 30 (68)
T ss_pred HHHHHHHHHHHHCCCCeEEE
Confidence 45677788889999999875
No 169
>COG2949 SanA Uncharacterized membrane protein [Function unknown]
Probab=46.02 E-value=26 Score=23.15 Aligned_cols=35 Identities=26% Similarity=0.384 Sum_probs=30.3
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCcee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
+...||+-|++ ..+.+.+...|.+.|.+.|++.+.
T Consensus 93 KV~~LLlSGDN-~~~sYnEp~tM~kdL~~~GVp~~~ 127 (235)
T COG2949 93 KVNYLLLSGDN-ATVSYNEPRTMRKDLIAAGVPAKN 127 (235)
T ss_pred CeeEEEEecCC-CcccccchHHHHHHHHHcCCCHHH
Confidence 67889999954 788999999999999999998764
No 170
>PLN02757 sirohydrochlorine ferrochelatase
Probab=45.89 E-value=24 Score=21.59 Aligned_cols=28 Identities=7% Similarity=0.080 Sum_probs=21.4
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
.=+++-||+.|+. ......++++.+++.
T Consensus 15 ~lllvgHGSrd~~-a~~~~~~la~~l~~~ 42 (154)
T PLN02757 15 GVVIVDHGSRRKE-SNLMLEEFVAMYKQK 42 (154)
T ss_pred EEEEEeCCCCCHH-HHHHHHHHHHHHHhh
Confidence 5578899999997 446677788888764
No 171
>PF07521 RMMBL: RNA-metabolising metallo-beta-lactamase; InterPro: IPR011108 The metallo-beta-lactamase fold contains five sequence motifs. The first four motifs are found in IPR001279 from INTERPRO and are common to all metallo-beta-lactamases. The fifth motif appears to be specific to function. This entry represents the fifth motif from metallo-beta-lactamases involved in RNA metabolism [].; PDB: 3ZQ4_D 2I7T_A 2I7V_A 2YCB_B 3BK1_A 3T3N_A 3BK2_A 3T3O_A 3AF5_A 3AF6_A ....
Probab=45.75 E-value=3.5 Score=19.79 Aligned_cols=21 Identities=14% Similarity=0.419 Sum_probs=13.7
Q ss_pred ChhHHhccCCCCcEEEEEeCC
Q psy7765 30 DVTRKVGNFVNKTFLLMHGTA 50 (77)
Q Consensus 30 sp~~~~~~~~~~p~ll~hG~~ 50 (77)
+....++.+....++++||+.
T Consensus 22 ~L~~~i~~~~p~~vilVHGe~ 42 (43)
T PF07521_consen 22 ELLEFIEQLNPRKVILVHGEP 42 (43)
T ss_dssp HHHHHHHHHCSSEEEEESSEH
T ss_pred HHHHHHHhcCCCEEEEecCCC
Confidence 345555555347889999964
No 172
>PF09198 T4-Gluco-transf: Bacteriophage T4 beta-glucosyltransferase; InterPro: IPR015281 Members of this family are DNA-modifying enzymes encoded by bacteriophage T4 that transfer glucose from uridine diphosphoglucose to 5-hydroxymethyl cytosine bases of phage T4 DNA []. ; PDB: 1J39_A 1SXQ_B 1NZF_A 1M5R_B 1JEJ_A 1JIV_A 1NZD_A 1NVK_A 2BGU_A 1JIU_A ....
Probab=45.58 E-value=22 Score=16.57 Aligned_cols=22 Identities=9% Similarity=0.169 Sum_probs=17.7
Q ss_pred CccCHHHHHHHHHHHHhcCCCc
Q psy7765 52 DNVHYQQSMMLAKALEAADIFF 73 (77)
Q Consensus 52 ~~Vp~~~s~~l~~al~~~g~~~ 73 (77)
..||.+++.-+.+-+++.|.++
T Consensus 16 kt~p~setiyl~~~~~~mgl~v 37 (38)
T PF09198_consen 16 KTTPSSETIYLFKCISDMGLNV 37 (38)
T ss_dssp SSHHHHHHHHHHHHHHTTT-EE
T ss_pred eecCccceEeHHHHHHHhCCCC
Confidence 4678899999999999988765
No 173
>cd03416 CbiX_SirB_N Sirohydrochlorin cobalt chelatase (CbiX) and sirohydrochlorin iron chelatase (SirB), N-terminal domain. SirB catalyzes the ferro-chelation of sirohydrochlorin to siroheme, the prosthetic group of sulfite and nitrite reductases. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, an important step in the vitamin B12 biosynthetic pathway. CbiX often contains a C-terminal histidine-rich region that may be important for metal delivery and/or storage, and may also contain an iron-sulfur center. Both are found in a wide range of bacteria. This subgroup also contains single domain proteins from archaea and bacteria which may represent the ancestral form of class II chelatases before domain duplication occurred.
Probab=44.70 E-value=33 Score=18.84 Aligned_cols=27 Identities=26% Similarity=0.309 Sum_probs=19.0
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHhcC
Q psy7765 43 FLLMHGTADDNVHYQQSMMLAKALEAAD 70 (77)
Q Consensus 43 ~ll~hG~~D~~Vp~~~s~~l~~al~~~g 70 (77)
+++-||+.+.... .....+++.+++..
T Consensus 3 vlv~hGS~~~~~~-~~~~~l~~~l~~~~ 29 (101)
T cd03416 3 LLVGHGSRDPRAA-EALEALAERLRERL 29 (101)
T ss_pred EEEEcCCCCHHHH-HHHHHHHHHHHhhC
Confidence 5778999887332 45667888887764
No 174
>PRK11267 biopolymer transport protein ExbD; Provisional
Probab=44.66 E-value=36 Score=20.28 Aligned_cols=28 Identities=25% Similarity=0.444 Sum_probs=22.8
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 43 FLLMHGTADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 43 ~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
.+++. .|..+|++....+-++++++|+.
T Consensus 101 ~V~I~--aD~~~~~~~vv~vmd~l~~aG~~ 128 (141)
T PRK11267 101 TIFFR--ADKTVDYETLMKVMDTLHQAGYL 128 (141)
T ss_pred eEEEE--cCCCCCHHHHHHHHHHHHHcCCC
Confidence 34444 57889999999999999999984
No 175
>PRK13604 luxD acyl transferase; Provisional
Probab=44.50 E-value=50 Score=22.69 Aligned_cols=32 Identities=25% Similarity=0.251 Sum_probs=26.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
.+.+++.||..... .+...+++.|.++|..+=
T Consensus 37 ~~~vIi~HGf~~~~---~~~~~~A~~La~~G~~vL 68 (307)
T PRK13604 37 NNTILIASGFARRM---DHFAGLAEYLSSNGFHVI 68 (307)
T ss_pred CCEEEEeCCCCCCh---HHHHHHHHHHHHCCCEEE
Confidence 36688999999865 458999999999998653
No 176
>COG0848 ExbD Biopolymer transport protein [Intracellular trafficking and secretion]
Probab=43.87 E-value=35 Score=20.56 Aligned_cols=31 Identities=23% Similarity=0.350 Sum_probs=25.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
..|.+++. +|..|+++...+.-++|+++|..
T Consensus 96 ~~~~v~i~--aD~~v~y~~vv~vm~~l~~aG~~ 126 (137)
T COG0848 96 KNPRVVIR--ADKNVKYGTVVKVMDLLKEAGFK 126 (137)
T ss_pred CCceEEEE--eCCCCCHHHHHHHHHHHHHcCCc
Confidence 35555555 58899999999999999999874
No 177
>cd00858 GlyRS_anticodon GlyRS Glycyl-anticodon binding domain. GlyRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only.
Probab=43.74 E-value=62 Score=18.52 Aligned_cols=34 Identities=3% Similarity=-0.057 Sum_probs=23.4
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCcee
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
.+++++.-+ |.......+.++++.|+++|+.+++
T Consensus 27 ~~v~Ii~~~-~~~~~~~~a~~la~~LR~~gi~v~~ 60 (121)
T cd00858 27 IKVAVLPLV-KRDELVEIAKEISEELRELGFSVKY 60 (121)
T ss_pred cEEEEEecC-CcHHHHHHHHHHHHHHHHCCCEEEE
Confidence 344555543 2233456889999999999998876
No 178
>PF06713 bPH_4: Bacterial PH domain; InterPro: IPR009589 This entry is represented by Bacteriophage SP-beta, YolF. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several hypothetical proteins specific to Oceanobacillus and Bacillus species. Members of this family are typically around 130 residues in length. The function of this family is unknown.
Probab=43.49 E-value=24 Score=18.78 Aligned_cols=31 Identities=6% Similarity=0.060 Sum_probs=23.8
Q ss_pred CCcEEEEEeCCCC-ccCHHHHHHHHHHHHhcC
Q psy7765 40 NKTFLLMHGTADD-NVHYQQSMMLAKALEAAD 70 (77)
Q Consensus 40 ~~p~ll~hG~~D~-~Vp~~~s~~l~~al~~~g 70 (77)
...+.|-+|..+. .|.|++-.+|.++|++.+
T Consensus 43 ~~rl~I~y~~~~~i~IsP~~~~~FI~~L~k~n 74 (74)
T PF06713_consen 43 LDRLEIYYGKYKSILISPKDKEEFIAELQKRN 74 (74)
T ss_pred ccEEEEEECCCCEEEEECCCHHHHHHHHHhhC
Confidence 3567777775554 678999999999998864
No 179
>cd00860 ThrRS_anticodon ThrRS Threonyl-anticodon binding domain. ThrRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only.
Probab=43.30 E-value=50 Score=17.27 Aligned_cols=21 Identities=29% Similarity=0.208 Sum_probs=17.3
Q ss_pred CHHHHHHHHHHHHhcCCCcee
Q psy7765 55 HYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 55 p~~~s~~l~~al~~~g~~~~~ 75 (77)
....+..+++.|++.|..+++
T Consensus 13 ~~~~a~~~~~~Lr~~g~~v~~ 33 (91)
T cd00860 13 HLDYAKEVAKKLSDAGIRVEV 33 (91)
T ss_pred HHHHHHHHHHHHHHCCCEEEE
Confidence 466788999999999988775
No 180
>COG3367 Uncharacterized conserved protein [Function unknown]
Probab=43.29 E-value=66 Score=22.59 Aligned_cols=37 Identities=16% Similarity=0.043 Sum_probs=32.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
..+..+.-||...+=-..-+..|.+++++.|+.+.++
T Consensus 147 ~a~~V~vvGTd~~vGKrTTa~~L~~~~~e~G~~a~fv 183 (339)
T COG3367 147 DAKVVLVVGTDCAVGKRTTALELREAAREEGIKAGFV 183 (339)
T ss_pred CCcEEEEeccccccchhHHHHHHHHHHHHhCCccceE
Confidence 5789999999888888999999999999999988764
No 181
>PF01674 Lipase_2: Lipase (class 2); InterPro: IPR002918 Lipases or triacylglycerol acylhydrolases hydrolyse ester bonds in triacylglycerol giving diacylglycerol, monoacylglycerol, glycerol and free fatty acids []. This group of lipases has been called class 2 as they are not clearly related to other lipase families, and includes LipA and LipB from Bacillus subtilis [] and uncharacterised proteins from Caenorhabditis.; PDB: 2VTV_B 2X76_A 2X5X_A 2QXU_A 3QMM_A 1I6W_A 3D2C_J 2QXT_B 1R50_A 1T2N_A ....
Probab=42.59 E-value=25 Score=22.84 Aligned_cols=29 Identities=34% Similarity=0.500 Sum_probs=15.4
Q ss_pred CCcEEEEEeCCC-CccCHHHHHHHHHHHHhcCC
Q psy7765 40 NKTFLLMHGTAD-DNVHYQQSMMLAKALEAADI 71 (77)
Q Consensus 40 ~~p~ll~hG~~D-~~Vp~~~s~~l~~al~~~g~ 71 (77)
+.|++|+||+.. ....+ ..++..|+++|-
T Consensus 1 ~~PVVlVHG~~~~~~~~w---~~~~~~l~~~GY 30 (219)
T PF01674_consen 1 NRPVVLVHGTGGNAYSNW---STLAPYLKAAGY 30 (219)
T ss_dssp S--EEEE--TTTTTCGGC---CHHHHHHHHTT-
T ss_pred CCCEEEECCCCcchhhCH---HHHHHHHHHcCC
Confidence 379999999996 33333 345566777774
No 182
>PF08532 Glyco_hydro_42M: Beta-galactosidase trimerisation domain; InterPro: IPR013738 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is non catalytic domain B of beta-galactosidase enzymes belong to the glycosyl hydrolase 42 family. This domain is related to glutamine amidotransferase enzymes, but the catalytic residues are replaced by non functional amino acids. This domain is involved in trimerisation. ; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process; PDB: 1KWK_A 1KWG_A.
Probab=42.34 E-value=32 Score=21.72 Aligned_cols=22 Identities=18% Similarity=-0.097 Sum_probs=16.9
Q ss_pred CHHHHHHHHHHHHhcCCCceee
Q psy7765 55 HYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 55 p~~~s~~l~~al~~~g~~~~~~ 76 (77)
...+...++++|.+.|++++++
T Consensus 28 y~~~~~~~y~al~~~gi~vDvv 49 (207)
T PF08532_consen 28 YRDQVRGWYRALRELGIPVDVV 49 (207)
T ss_dssp HHHHHHHHHHHHHTTT--EEEE
T ss_pred HHHHHHHHHHHHHHcCCceEEe
Confidence 4566788999999999999987
No 183
>cd00861 ProRS_anticodon_short ProRS Prolyl-anticodon binding domain, short version found predominantly in bacteria. ProRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only.
Probab=42.30 E-value=54 Score=17.38 Aligned_cols=21 Identities=19% Similarity=0.104 Sum_probs=16.6
Q ss_pred CHHHHHHHHHHHHhcCCCcee
Q psy7765 55 HYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 55 p~~~s~~l~~al~~~g~~~~~ 75 (77)
....+..+++.|+++|+.+++
T Consensus 16 ~~~~a~~la~~Lr~~g~~v~~ 36 (94)
T cd00861 16 QQELAEKLYAELQAAGVDVLL 36 (94)
T ss_pred HHHHHHHHHHHHHHCCCEEEE
Confidence 345688899999999988875
No 184
>TIGR00668 apaH bis(5'-nucleosyl)-tetraphosphatase (symmetrical). Alternate names include diadenosine-tetraphosphatase and Ap4A hydrolase.
Probab=42.30 E-value=57 Score=22.14 Aligned_cols=28 Identities=21% Similarity=0.277 Sum_probs=23.4
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
..++++||.-++..+.+++..+++.+.+
T Consensus 114 ~~~~lVHAGi~P~w~l~~a~~~a~eve~ 141 (279)
T TIGR00668 114 KKLVMAHAGITPQWDLQTAKECARDVEA 141 (279)
T ss_pred CCEEEEecCCCCCCcHHHHHHHHHHHHH
Confidence 4578999999999999999987776544
No 185
>KOG3253|consensus
Probab=42.06 E-value=45 Score=25.65 Aligned_cols=34 Identities=12% Similarity=0.207 Sum_probs=29.9
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
+..+ +.|+|++.|..|..+++..-+.+.+.+++.
T Consensus 300 Lldm-k~PVLFV~Gsnd~mcspn~ME~vreKMqA~ 333 (784)
T KOG3253|consen 300 LLDM-KQPVLFVIGSNDHMCSPNSMEEVREKMQAE 333 (784)
T ss_pred hHhc-CCceEEEecCCcccCCHHHHHHHHHHhhcc
Confidence 4456 699999999999999999999999988764
No 186
>KOG4178|consensus
Probab=41.76 E-value=42 Score=23.35 Aligned_cols=22 Identities=18% Similarity=0.394 Sum_probs=18.9
Q ss_pred hccCCCCcEEEEEeCCCCccCHH
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQ 57 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~ 57 (77)
..++ ..|++++.|..|..-++.
T Consensus 254 ~~~i-~iPv~fi~G~~D~v~~~p 275 (322)
T KOG4178|consen 254 LAKI-TIPVLFIWGDLDPVLPYP 275 (322)
T ss_pred cccc-ccceEEEEecCcccccch
Confidence 4456 799999999999999877
No 187
>TIGR01890 N-Ac-Glu-synth amino-acid N-acetyltransferase. This model represents a clade of amino-acid N-acetyltransferases acting mainly on glutamate in the first step of the "acetylated" ornithine biosynthesis pathway. For this reason it is also called N-acetylglutamate synthase. The enzyme may also act on aspartate.
Probab=41.04 E-value=58 Score=22.97 Aligned_cols=29 Identities=17% Similarity=0.189 Sum_probs=22.3
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceeeC
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~~ 77 (77)
..++.|+||.. -++-+.+.+.|++.++++
T Consensus 48 g~~~vlVHGgg---------~~i~~~~~~~g~~~~~~~ 76 (429)
T TIGR01890 48 GVRLVLVHGAR---------PQIERILAARGRTPHYHR 76 (429)
T ss_pred CCcEEEEcCCC---------HHHHHHHHHcCCCceeeC
Confidence 46899999987 556667778899887763
No 188
>TIGR00250 RNAse_H_YqgF RNAse H-fold protein YqgF. This protein family, which exhibits an RNAse H fold in crystal structure, has been proposed as a putative Holliday junction resolvase, an alternate to RuvC.
Probab=40.26 E-value=76 Score=18.77 Aligned_cols=37 Identities=14% Similarity=0.123 Sum_probs=25.2
Q ss_pred CcEEEEEe-----CCCCccCHHHHHHHHHHHHhc-CCCceeeC
Q psy7765 41 KTFLLMHG-----TADDNVHYQQSMMLAKALEAA-DIFFYQQD 77 (77)
Q Consensus 41 ~p~ll~hG-----~~D~~Vp~~~s~~l~~al~~~-g~~~~~~~ 77 (77)
.+-.++-| .....---..+..|++.|++. ++++.++|
T Consensus 48 ~~~~iVvGlP~~~dG~~~~~a~~v~~f~~~L~~~~~~~v~~~D 90 (130)
T TIGR00250 48 TPDKIVVGLPLNMDGTEGPLTERAQKFANRLEGRFGVPVVLWD 90 (130)
T ss_pred CCCEEEEeccCCCCcCcCHHHHHHHHHHHHHHHHhCCCEEEEc
Confidence 45667777 333333346677899999766 88988876
No 189
>COG4947 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.19 E-value=44 Score=21.71 Aligned_cols=58 Identities=17% Similarity=0.312 Sum_probs=38.2
Q ss_pred hhCCCCCCCCcccccccChhHHhccCC---------CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 13 YMGLPEMYDNEVGYNWTDVTRKVGNFV---------NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 13 ~lG~p~~~~~~~~y~~~sp~~~~~~~~---------~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
++|.- .++...| .||..++..+. ...+.+.-|.+|+..+ +..+|.+.|.+..++.++.
T Consensus 141 ffg~y--yddDv~y--nsP~dylpg~~dp~~l~rlr~~~~vfc~G~e~~~L~--~~~~L~~~l~dKqipaw~~ 207 (227)
T COG4947 141 FFGGY--YDDDVYY--NSPSDYLPGLADPFRLERLRRIDMVFCIGDEDPFLD--NNQHLSRLLSDKQIPAWMH 207 (227)
T ss_pred hcccc--ccCceee--cChhhhccCCcChHHHHHHhhccEEEEecCcccccc--chHHHHHHhccccccHHHH
Confidence 45655 3333333 68887776652 2346777888888777 4567778888888887653
No 190
>COG1184 GCD2 Translation initiation factor 2B subunit, eIF-2B alpha/beta/delta family [Translation, ribosomal structure and biogenesis]
Probab=39.01 E-value=31 Score=23.71 Aligned_cols=23 Identities=17% Similarity=0.086 Sum_probs=20.1
Q ss_pred cCHHHHHHHHHHHHhcCCCceee
Q psy7765 54 VHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 54 Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
-|-.|...+++.|++.|+++.++
T Consensus 154 RP~~eG~~~ak~L~~~gI~~~~I 176 (301)
T COG1184 154 RPRGEGRIMAKELRQSGIPVTVI 176 (301)
T ss_pred CCcchHHHHHHHHHHcCCceEEE
Confidence 47778999999999999998875
No 191
>PRK00109 Holliday junction resolvase-like protein; Reviewed
Probab=38.59 E-value=83 Score=18.80 Aligned_cols=37 Identities=16% Similarity=0.150 Sum_probs=24.9
Q ss_pred CcEEEEEe-----CCCCccCHHHHHHHHHHHHhc-CCCceeeC
Q psy7765 41 KTFLLMHG-----TADDNVHYQQSMMLAKALEAA-DIFFYQQD 77 (77)
Q Consensus 41 ~p~ll~hG-----~~D~~Vp~~~s~~l~~al~~~-g~~~~~~~ 77 (77)
.+..++-| .....---..+..|++.|++. ++++.++|
T Consensus 54 ~i~~iVvGlP~~~~G~~~~~~~~v~~f~~~L~~~~~~~v~~~D 96 (138)
T PRK00109 54 QPDGLVVGLPLNMDGTEGPRTERARKFANRLEGRFGLPVVLVD 96 (138)
T ss_pred CCCEEEEeccCCCCCCcCHHHHHHHHHHHHHHHHhCCCEEEEc
Confidence 55667777 333333346677899999775 88888876
No 192
>PRK00093 GTP-binding protein Der; Reviewed
Probab=38.40 E-value=96 Score=21.63 Aligned_cols=36 Identities=11% Similarity=0.101 Sum_probs=29.6
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhc----CCCcee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAA----DIFFYQ 75 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~----g~~~~~ 75 (77)
.+|+|++++..-..+|...-.-|.+.+++. |+|..+
T Consensus 387 ~pp~f~~~~n~~~~~~~~y~~~l~~~~r~~~~~~g~p~~~ 426 (435)
T PRK00093 387 NPPTFVLFVNDPELLPFSYKRYLENQLREAFDFEGTPIRL 426 (435)
T ss_pred CCCEEEEEeCCcccCCHHHHHHHHHHHHHHcCCCcccEEE
Confidence 699999999988899999888888888874 555544
No 193
>PRK06372 translation initiation factor IF-2B subunit delta; Provisional
Probab=38.29 E-value=42 Score=22.46 Aligned_cols=24 Identities=17% Similarity=0.044 Sum_probs=20.4
Q ss_pred ccCHHHHHHHHHHHHhcCCCceee
Q psy7765 53 NVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 53 ~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
.-|..+...+++.|.+.|++++++
T Consensus 117 SrP~~eG~~~a~~L~~~GI~vtli 140 (253)
T PRK06372 117 SRPMLEGIDMAKLLVKSGIDVVLL 140 (253)
T ss_pred CCCchHHHHHHHHHHHCCCCEEEE
Confidence 347778899999999999999875
No 194
>COG2344 AT-rich DNA-binding protein [General function prediction only]
Probab=38.28 E-value=37 Score=22.15 Aligned_cols=42 Identities=17% Similarity=0.204 Sum_probs=27.9
Q ss_pred ccccccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 24 VGYNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 24 ~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
..|.-.+....+.+. ..-+-++ .||-.++.+.++.|.++|+.
T Consensus 131 ~V~~~d~le~~v~~~-dv~iaiL------tVPa~~AQ~vad~Lv~aGVk 172 (211)
T COG2344 131 PVYDLDDLEKFVKKN-DVEIAIL------TVPAEHAQEVADRLVKAGVK 172 (211)
T ss_pred eeechHHHHHHHHhc-CccEEEE------EccHHHHHHHHHHHHHcCCc
Confidence 334444444455544 3444333 68999999999999999985
No 195
>KOG3101|consensus
Probab=37.94 E-value=24 Score=23.67 Aligned_cols=40 Identities=20% Similarity=0.323 Sum_probs=23.9
Q ss_pred hhCCCCCCCCcccccccChhHHhccCCCCcEEEEEeCCCCccC
Q psy7765 13 YMGLPEMYDNEVGYNWTDVTRKVGNFVNKTFLLMHGTADDNVH 55 (77)
Q Consensus 13 ~lG~p~~~~~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp 55 (77)
|||.. ...=+.|.+.-.++..... ..-+||=+|..|.-.+
T Consensus 191 YLG~~--ka~W~~yDat~lik~y~~~-~~~ilIdqG~~D~Fl~ 230 (283)
T KOG3101|consen 191 YLGDN--KAQWEAYDATHLIKNYRGV-GDDILIDQGAADNFLA 230 (283)
T ss_pred ccCCC--hHHHhhcchHHHHHhcCCC-CccEEEecCccchhhh
Confidence 45543 2222445555556555555 4568888888887666
No 196
>PF02472 ExbD: Biopolymer transport protein ExbD/TolR; InterPro: IPR003400 This group of proteins are membrane bound transport proteins essential for ferric ion uptake in bacteria []. The family consists of ExbD, and TolR which are involved in TonB-dependent transport of various receptor bound substrates including colicins [].; GO: 0005215 transporter activity, 0006810 transport, 0016020 membrane; PDB: 2JWL_A 2JWK_A 2PFU_A.
Probab=37.75 E-value=45 Score=19.02 Aligned_cols=28 Identities=25% Similarity=0.368 Sum_probs=20.1
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 43 FLLMHGTADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 43 ~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
.+++.+ |..++++....+.++++++|..
T Consensus 94 ~v~i~a--D~~~~y~~vv~vl~~l~~~g~~ 121 (130)
T PF02472_consen 94 RVLIRA--DKDAPYQDVVDVLDALREAGFT 121 (130)
T ss_dssp -EEEEE---TTS-HHHHHHHHHHHHHTT--
T ss_pred eEEEEe--CCCCCHHHHHHHHHHHHHcCCC
Confidence 566665 7889999999999999999974
No 197
>TIGR03230 lipo_lipase lipoprotein lipase. Members of this protein family are lipoprotein lipase (EC 3.1.1.34), a eukaryotic triacylglycerol lipase active in plasma and similar to pancreatic and hepatic triacylglycerol lipases (EC 3.1.1.3). It is also called clearing factor. It cleaves chylomicron and VLDL triacylglycerols; it also has phospholipase A-1 activity.
Probab=37.73 E-value=79 Score=22.87 Aligned_cols=29 Identities=28% Similarity=0.305 Sum_probs=18.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
.++++|+||-.+....-.....+.++|..
T Consensus 41 ~ptvIlIHG~~~s~~~~~w~~~l~~al~~ 69 (442)
T TIGR03230 41 TKTFIVIHGWTVTGMFESWVPKLVAALYE 69 (442)
T ss_pred CCeEEEECCCCcCCcchhhHHHHHHHHHh
Confidence 68999999988753222334456766653
No 198
>PF13344 Hydrolase_6: Haloacid dehalogenase-like hydrolase; PDB: 2HO4_B 1YV9_A 1WVI_B 3EPR_A 2P27_A 2OYC_A 2CFT_A 2P69_A 2CFS_A 2CFR_A ....
Probab=37.28 E-value=22 Score=19.97 Aligned_cols=22 Identities=9% Similarity=-0.133 Sum_probs=16.4
Q ss_pred CHHHHHHHHHHHHhcCCCceee
Q psy7765 55 HYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 55 p~~~s~~l~~al~~~g~~~~~~ 76 (77)
++..+.++.++|+++|+++-++
T Consensus 15 ~ipga~e~l~~L~~~g~~~~~l 36 (101)
T PF13344_consen 15 PIPGAVEALDALRERGKPVVFL 36 (101)
T ss_dssp E-TTHHHHHHHHHHTTSEEEEE
T ss_pred cCcCHHHHHHHHHHcCCCEEEE
Confidence 5557888888888888877664
No 199
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=37.09 E-value=36 Score=13.79 Aligned_cols=16 Identities=6% Similarity=0.358 Sum_probs=11.4
Q ss_pred HHHHHHHHHHHHhcCC
Q psy7765 56 YQQSMMLAKALEAADI 71 (77)
Q Consensus 56 ~~~s~~l~~al~~~g~ 71 (77)
+..+.++.+.+++.|+
T Consensus 16 ~~~a~~~~~~M~~~g~ 31 (31)
T PF01535_consen 16 FEEALEVFDEMRERGI 31 (31)
T ss_pred HHHHHHHHHHHhHCcC
Confidence 4567778887777764
No 200
>cd00859 HisRS_anticodon HisRS Histidyl-anticodon binding domain. HisRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only.
Probab=36.85 E-value=50 Score=16.92 Aligned_cols=21 Identities=24% Similarity=0.173 Sum_probs=17.0
Q ss_pred CHHHHHHHHHHHHhcCCCcee
Q psy7765 55 HYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 55 p~~~s~~l~~al~~~g~~~~~ 75 (77)
....+.++++.|++.|..+++
T Consensus 13 ~~~~a~~i~~~Lr~~g~~v~~ 33 (91)
T cd00859 13 ALSEALELAEQLRDAGIKAEI 33 (91)
T ss_pred HHHHHHHHHHHHHHCCCEEEE
Confidence 335688999999999988765
No 201
>PF07137 VDE: Violaxanthin de-epoxidase (VDE); InterPro: IPR010788 This family represents a conserved region approximately 350 residues long within plant violaxanthin de-epoxidase (VDE). In higher plants, violaxanthin de-epoxidase forms part of a conserved system that dissipates excess energy as heat in the light-harvesting complexes of photosystem II (PSII), thus protecting them from photo-inhibitory damage [].; GO: 0046422 violaxanthin de-epoxidase activity, 0055114 oxidation-reduction process, 0009507 chloroplast; PDB: 3CQN_B 3CQR_A.
Probab=36.37 E-value=26 Score=22.65 Aligned_cols=26 Identities=23% Similarity=0.109 Sum_probs=19.0
Q ss_pred CCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 49 TADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 49 ~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
+.+...|.+...++.+++++.|+.+.
T Consensus 154 trs~~lP~s~~p~l~~aa~k~G~d~~ 179 (198)
T PF07137_consen 154 TRSPTLPESIVPELRRAAKKAGIDFS 179 (198)
T ss_dssp ESSSS--GGGHHHHHHHHHHTT--GG
T ss_pred eCCCCCChHHhHHHHHHHHHhCCCHH
Confidence 35788899999999999999999763
No 202
>PRK08535 translation initiation factor IF-2B subunit delta; Provisional
Probab=35.83 E-value=45 Score=22.71 Aligned_cols=25 Identities=20% Similarity=0.061 Sum_probs=21.1
Q ss_pred CccCHHHHHHHHHHHHhcCCCceee
Q psy7765 52 DNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 52 ~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
..-|..+...+++.|.+.|+++.++
T Consensus 153 EsrP~~~G~~~a~~L~~~GI~vtlI 177 (310)
T PRK08535 153 ETRPRNQGHITAKELAEYGIPVTLI 177 (310)
T ss_pred cCCchhhHHHHHHHHHHCCCCEEEE
Confidence 4457788899999999999999875
No 203
>PF13041 PPR_2: PPR repeat family
Probab=35.66 E-value=54 Score=15.40 Aligned_cols=20 Identities=15% Similarity=0.230 Sum_probs=16.3
Q ss_pred CHHHHHHHHHHHHhcCCCce
Q psy7765 55 HYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 55 p~~~s~~l~~al~~~g~~~~ 74 (77)
.++++.++++.+++.|+...
T Consensus 18 ~~~~a~~l~~~M~~~g~~P~ 37 (50)
T PF13041_consen 18 KFEEALKLFKEMKKRGIKPD 37 (50)
T ss_pred CHHHHHHHHHHHHHcCCCCC
Confidence 46789999999999988643
No 204
>PF05902 4_1_CTD: 4.1 protein C-terminal domain (CTD); InterPro: IPR008379 There is a unique sequence domain at the C terminus of all known 4.1 proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24 kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates [].; GO: 0003779 actin binding, 0005198 structural molecule activity, 0005856 cytoskeleton
Probab=35.64 E-value=47 Score=19.69 Aligned_cols=24 Identities=29% Similarity=0.523 Sum_probs=16.0
Q ss_pred cEEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 42 TFLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 42 p~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
.-++|+|+.| |. |=..|+.|++++
T Consensus 71 KRIvITGD~D--ID--hDqaLa~aI~eA 94 (114)
T PF05902_consen 71 KRIVITGDAD--ID--HDQALAQAIKEA 94 (114)
T ss_pred EEEEEecCCC--cc--hHHHHHHHHHHH
Confidence 3478899888 44 455567776665
No 205
>TIGR00627 tfb4 transcription factor tfb4. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=35.57 E-value=99 Score=20.98 Aligned_cols=39 Identities=15% Similarity=0.122 Sum_probs=31.9
Q ss_pred cCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 37 NFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 37 ~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
+. +.++|++-+..|..-.+-.......++++.++.+..+
T Consensus 144 ~~-~~RIlii~~s~~~~~qYi~~mn~Ifaaqk~~I~Idv~ 182 (279)
T TIGR00627 144 KL-KSRILVISITPDMALQYIPLMNCIFSAQKQNIPIDVV 182 (279)
T ss_pred CC-cceEEEEECCCCchHHHHHHHHHHHHHHHcCceEEEE
Confidence 44 6899999999998877777788888889999987643
No 206
>PF03720 UDPG_MGDP_dh_C: UDP-glucose/GDP-mannose dehydrogenase family, UDP binding domain; InterPro: IPR014027 The UDP-glucose/GDP-mannose dehydrogenases are a small group of enzymes which possesses the ability to catalyse the NAD-dependent 2-fold oxidation of an alcohol to an acid without the release of an aldehyde intermediate [, ]. The enzymes have a wide range of functions. In plants UDP-glucose dehydrogenase, 1.1.1.22 from EC, is an important enzyme in the synthesis of hemicellulose and pectin [], which are the components of newly formed cell walls; while in zebrafish UDP-glucose dehydrogenase is required for cardiac valve formation []. In Xanthomonas campestris, a plant pathogen, UDP-glucose dehydrogenase is required for virulence []. GDP-mannose dehydrogenase, 1.1.1.132 from EC, catalyses the formation of GDP-mannuronic acid, which is the monomeric unit from which the exopolysaccharide alginate is formed. Alginate is secreted by a number of bacteria, which include Pseudomonas aeruginosa and Azotobacter vinelandii. In P. aeruginosa, alginate is believed to play an important role in the bacteria's resistance to antibiotics and the host immune response [], while in A. vinelandii it is essential for the encystment process []. This entry represents the C-terminal substrate-binding domain of these enzymes. Structural studies indicate that this domain forms an incomplete dinucleotide binding fold [, ].; GO: 0016616 oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor, 0051287 NAD binding, 0055114 oxidation-reduction process; PDB: 3GG2_D 1DLI_A 1DLJ_A 2Y0E_D 2Y0D_B 2Y0C_D 1MV8_B 1MUU_A 1MFZ_C 3TDK_B ....
Probab=35.42 E-value=40 Score=18.92 Aligned_cols=21 Identities=14% Similarity=0.097 Sum_probs=15.4
Q ss_pred HHHHHHHHHHHhcCCCceeeC
Q psy7765 57 QQSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 57 ~~s~~l~~al~~~g~~~~~~~ 77 (77)
+-+..+++.|++.|..+...|
T Consensus 17 Sp~~~l~~~L~~~g~~V~~~D 37 (106)
T PF03720_consen 17 SPALELIEELKERGAEVSVYD 37 (106)
T ss_dssp -HHHHHHHHHHHTT-EEEEE-
T ss_pred CHHHHHHHHHHHCCCEEEEEC
Confidence 357889999999999887764
No 207
>PF06441 EHN: Epoxide hydrolase N terminus; InterPro: IPR010497 This entry represents the N-terminal region of the eukaryotic epoxide hydrolase protein. Epoxide hydrolases (3.3.2.3 from EC) comprise a group of functionally related enzymes that catalyse the addition of water to oxirane compounds (epoxides), thereby usually generating vicinal trans-diols. EHs have been found in all types of living organisms, including mammals, invertebrates, plants, fungi and bacteria. In animals, the major interest in EH is directed towards their detoxification capacity for epoxides since they are important safeguards against the cytotoxic and genotoxic potential of oxirane derivatives that are often reactive electrophiles because of the high tension of the three-membered ring system and the strong polarisation of the C--O bonds. This is of significant relevance because epoxides are frequent intermediary metabolites, which arise during the biotransformation of foreign compounds []. This domain is often found in conjunction with IPR000073 from INTERPRO.; GO: 0004301 epoxide hydrolase activity, 0009636 response to toxin, 0016020 membrane; PDB: 3G0I_B 3G02_A 1QO7_A.
Probab=34.71 E-value=21 Score=20.74 Aligned_cols=10 Identities=40% Similarity=0.654 Sum_probs=6.2
Q ss_pred CCcEEEEEeC
Q psy7765 40 NKTFLLMHGT 49 (77)
Q Consensus 40 ~~p~ll~hG~ 49 (77)
..|++|+||=
T Consensus 92 aiPLll~HGW 101 (112)
T PF06441_consen 92 AIPLLLLHGW 101 (112)
T ss_dssp -EEEEEE--S
T ss_pred CeEEEEECCC
Confidence 5899999994
No 208
>TIGR00511 ribulose_e2b2 ribose-1,5-bisphosphate isomerase, e2b2 family. The delineation of this family was based originally, in part, on a discussion and neighbor-joining phylogenetic study by Kyrpides and Woese of archaeal and other proteins homologous to the alpha, beta, and delta subunits of eukaryotic initiation factor 2B (eIF-2B), a five-subunit molecule that catalyzes GTP recycling for eIF-2. Recently, Sato, et al. assigned the function ribulose-1,5 bisphosphate isomerase.
Probab=34.66 E-value=42 Score=22.77 Aligned_cols=25 Identities=20% Similarity=0.073 Sum_probs=20.7
Q ss_pred CccCHHHHHHHHHHHHhcCCCceee
Q psy7765 52 DNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 52 ~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
..-|..+...+++.|.+.|+++.++
T Consensus 148 EsrP~~~G~~~a~~L~~~gI~vtlI 172 (301)
T TIGR00511 148 ETRPRKQGHITAKELRDYGIPVTLI 172 (301)
T ss_pred cCCCcchHHHHHHHHHHCCCCEEEE
Confidence 3447778899999999999999875
No 209
>TIGR01205 D_ala_D_alaTIGR D-alanine--D-alanine ligase. but a number of antibiotic resistance proteins score above the trusted cutoff of this model.
Probab=34.64 E-value=63 Score=21.31 Aligned_cols=28 Identities=4% Similarity=-0.107 Sum_probs=13.4
Q ss_pred ccccccChhHHhccCCCCcEEEEEeCCCC
Q psy7765 24 VGYNWTDVTRKVGNFVNKTFLLMHGTADD 52 (77)
Q Consensus 24 ~~y~~~sp~~~~~~~~~~p~ll~hG~~D~ 52 (77)
+.+.....+..+... .-.+.++....|.
T Consensus 16 s~~s~~~i~~al~~~-g~~v~~i~~~~~~ 43 (315)
T TIGR01205 16 SLVSAAAVLKALRDL-GYDVYPVDIDKMG 43 (315)
T ss_pred eHHHHHHHHHHHhhc-CCEEEEEeecCCc
Confidence 344444444455554 3445555555443
No 210
>PF10230 DUF2305: Uncharacterised conserved protein (DUF2305); InterPro: IPR019363 This entry contains proteins that have no known function.
Probab=34.37 E-value=90 Score=20.56 Aligned_cols=42 Identities=10% Similarity=0.071 Sum_probs=30.8
Q ss_pred hhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCc
Q psy7765 31 VTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFF 73 (77)
Q Consensus 31 p~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~ 73 (77)
-+...... ..++.+..|..|.=||.+.-.++.++....+..+
T Consensus 213 ~~~~~~~~-~~kl~f~fg~~D~Wvp~~~~~~l~~~~~~~~~~~ 254 (266)
T PF10230_consen 213 LIKHHNEN-GDKLWFYFGQNDHWVPNETRDELIERYPGHEPDV 254 (266)
T ss_pred HHHHhccC-CCEEEEEEeCCCCCCCHHHHHHHHHHcCCCCCeE
Confidence 33333333 5899999999999999999999988776443333
No 211
>PRK08334 translation initiation factor IF-2B subunit beta; Validated
Probab=34.31 E-value=60 Score=22.87 Aligned_cols=27 Identities=19% Similarity=0.008 Sum_probs=22.7
Q ss_pred CCCccCHHHHHHH-HHHHHhcCCCceee
Q psy7765 50 ADDNVHYQQSMML-AKALEAADIFFYQQ 76 (77)
Q Consensus 50 ~D~~Vp~~~s~~l-~~al~~~g~~~~~~ 76 (77)
-|..-|..|..+| +..|.+.|+++.++
T Consensus 198 v~EtRP~~qG~rlta~eL~~~GI~vtlI 225 (356)
T PRK08334 198 VDETRPVLQGARLSAWEYHYDGIPLKLI 225 (356)
T ss_pred ECCCCchhhHHHHHHHHHHHCCCCEEEE
Confidence 4577899999988 57899999999875
No 212
>PLN02825 amino-acid N-acetyltransferase
Probab=33.73 E-value=74 Score=23.50 Aligned_cols=33 Identities=18% Similarity=0.171 Sum_probs=25.8
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceeeC
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~~ 77 (77)
+... ..++.|+||. .-++-+.|++.|++.++++
T Consensus 44 L~~l-Gi~~VlVHGg---------gpqI~~~l~~~gi~~~f~~ 76 (515)
T PLN02825 44 LHGL-GIKFVLVPGT---------HVQIDKLLAERGREPKYVG 76 (515)
T ss_pred HHHC-CCCEEEEcCC---------CHHHHHHHHHcCCCceeeC
Confidence 4456 5899999997 4567788889999988864
No 213
>PF02698 DUF218: DUF218 domain; InterPro: IPR003848 This domain of unknown function is found in several uncharacterised proteins.; PDB: 3CA8_A.
Probab=33.67 E-value=69 Score=18.82 Aligned_cols=34 Identities=9% Similarity=-0.032 Sum_probs=19.1
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
.|.+++-|.....-...++..+.+.|.++|++.+
T Consensus 38 ~~~il~SGg~~~~~~~~ea~~~~~~l~~~gvp~~ 71 (155)
T PF02698_consen 38 APRILFSGGYGHGDGRSEAEAMRDYLIELGVPEE 71 (155)
T ss_dssp T--EEEE--SSTTHTS-HHHHHHHHHHHT---GG
T ss_pred CCeEEECCCCCCCCCCCHHHHHHHHHHhcccchh
Confidence 5667777755554477788888999998888754
No 214
>PRK09912 L-glyceraldehyde 3-phosphate reductase; Provisional
Probab=33.65 E-value=84 Score=21.45 Aligned_cols=31 Identities=10% Similarity=0.093 Sum_probs=25.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCC
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADI 71 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~ 71 (77)
..-++++|.- |..+|...+.+..+.|++.|+
T Consensus 131 ~iDl~~lH~~-~~~~~~~e~~~al~~l~~~Gk 161 (346)
T PRK09912 131 YVDIFYSHRV-DENTPMEETASALAHAVQSGK 161 (346)
T ss_pred cEEEEEeCCC-CCCCCHHHHHHHHHHHHHcCC
Confidence 4689999985 667888888888888898886
No 215
>cd01786 STE50_RA Ubiquitin-like domain of STE50_RA. STE50_RA The fungal adaptor protein STE50 is an essential component of three MAPK-mediated signalling pathways, which control the mating response, invasive/filamentous growth and osmotolerance (HOG pathway), respectively. STE50 functions in cell signalling between the activated G protein and STE11. The domain architecture of STE50 includes an amino-terminal SAM (sterile alpha motif) domain in addition to the carboxy-terminal ubiquitin-like RA (RAS-associated) domain. While the SAM domain interacts with STE11, the RA domain interacts with CDC42 and RAS. Modulation of signal transduction by STE50 specifically affects the pheromone-response pathway in yeast.
Probab=33.04 E-value=75 Score=18.25 Aligned_cols=34 Identities=9% Similarity=0.214 Sum_probs=26.6
Q ss_pred CcEEEEEeCCCCccCHHH-HHHHHHHHHhcCCCce
Q psy7765 41 KTFLLMHGTADDNVHYQQ-SMMLAKALEAADIFFY 74 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~-s~~l~~al~~~g~~~~ 74 (77)
--++|..|+...+.+..+ -..+.+.|++.|..--
T Consensus 58 Y~L~I~YgdqER~L~ldEKPl~lFk~L~~~g~~P~ 92 (98)
T cd01786 58 YVLVICYGDQERILDLDEKPVIIFKNLKQQGLHPA 92 (98)
T ss_pred eEEEEEeCCeeeeccccccHHHHHHHHHHcCCCCe
Confidence 357889999888888754 7778999999887543
No 216
>PRK06371 translation initiation factor IF-2B subunit alpha; Provisional
Probab=32.87 E-value=58 Score=22.65 Aligned_cols=26 Identities=23% Similarity=0.032 Sum_probs=21.7
Q ss_pred CCccCHHHHHHH-HHHHHhcCCCceee
Q psy7765 51 DDNVHYQQSMML-AKALEAADIFFYQQ 76 (77)
Q Consensus 51 D~~Vp~~~s~~l-~~al~~~g~~~~~~ 76 (77)
+..-|..|..+| +..|.+.|+++.++
T Consensus 176 ~EsRP~~qG~rlta~eL~~~GI~vtlI 202 (329)
T PRK06371 176 DETRPRLQGARLTAWELAQEGIDHAII 202 (329)
T ss_pred CCCCCcchHHHHHHHHHHHCCCCEEEE
Confidence 467788888875 89999999999875
No 217
>cd04237 AAK_NAGS-ABP AAK_NAGS-ABP: N-acetylglutamate (NAG) kinase-like domain of the NAG Synthase (NAGS) of the arginine-biosynthesis pathway (ABP) found in gamma- and beta-proteobacteria and higher plant chloroplasts. Domain architecture of these NAGS consisted of an N-terminal NAG kinase-like (ArgB) domain (this CD) and a C-terminal NAG synthase, acetyltransferase (ArgA) domain. Both bacterial and plant sequences in this CD have a conserved N-terminal extension; a similar sequence in the NAG kinases of the cyclic arginine-biosynthesis pathway has been implicated in feedback inhibition sensing. Plant sequences also have an N-terminal chloroplast transit peptide and an insert (approx. 70 residues) in the C-terminal region of ArgB. Members of this CD belong to the Amino Acid Kinase Superfamily (AAK).
Probab=32.81 E-value=80 Score=21.08 Aligned_cols=32 Identities=13% Similarity=0.211 Sum_probs=21.8
Q ss_pred hccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
+... ..++.|+||. ..++-+.+.+.|++.+++
T Consensus 45 l~~~-g~~~ViVHGg---------gp~i~~~~~~~gi~~~~~ 76 (280)
T cd04237 45 LHSL-GIRLVLVHGA---------RPQIDQRLAERGLEPRYH 76 (280)
T ss_pred HHHC-CCcEEEEeCC---------CHHHHHHHHHcCCCcccc
Confidence 3344 4689999997 455566667777776654
No 218
>PRK08057 cobalt-precorrin-6x reductase; Reviewed
Probab=32.68 E-value=60 Score=21.47 Aligned_cols=28 Identities=36% Similarity=0.440 Sum_probs=22.0
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
...+|++-||. ++.++++.|.+.|.++-
T Consensus 2 ~~~IlvlgGT~-------egr~la~~L~~~g~~v~ 29 (248)
T PRK08057 2 MPRILLLGGTS-------EARALARALAAAGVDIV 29 (248)
T ss_pred CceEEEEechH-------HHHHHHHHHHhCCCeEE
Confidence 35688889986 58899999988887653
No 219
>PF09587 PGA_cap: Bacterial capsule synthesis protein PGA_cap; InterPro: IPR019079 CapA is a putative poly-gamma-glutamate capsule biosynthesis protein found in bacteria. Poly-gamma-glutamate is a natural polymer that may be involved in virulence and may help bacteria survive in high salt concentrations. It is a surface-associated protein [].
Probab=32.21 E-value=77 Score=20.48 Aligned_cols=33 Identities=21% Similarity=0.118 Sum_probs=25.2
Q ss_pred CcEEEEEe-CCCCccCHHHHHHHHHHHHhcCCCc
Q psy7765 41 KTFLLMHG-TADDNVHYQQSMMLAKALEAADIFF 73 (77)
Q Consensus 41 ~p~ll~hG-~~D~~Vp~~~s~~l~~al~~~g~~~ 73 (77)
.-++.+|. .....-|....++++++|.++|+++
T Consensus 186 ~vIv~~HwG~e~~~~p~~~q~~~a~~lidaGaDi 219 (250)
T PF09587_consen 186 VVIVSLHWGIEYENYPTPEQRELARALIDAGADI 219 (250)
T ss_pred EEEEEeccCCCCCCCCCHHHHHHHHHHHHcCCCE
Confidence 44555554 4677778888899999999999765
No 220
>TIGR00583 mre11 DNA repair protein (mre11). All proteins in this family for which functions are known are subunits of a nuclease complex made up of multiple proteins including MRE11 and RAD50 homologs. The functions of this nuclease complex include recombinational repair and non-homolgous end joining. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). The proteins in this family are distantly related to proteins in the SbcCD complex of bacteria.
Probab=32.16 E-value=32 Score=24.46 Aligned_cols=15 Identities=27% Similarity=0.445 Sum_probs=13.1
Q ss_pred CCcEEEEEeCCCCcc
Q psy7765 40 NKTFLLMHGTADDNV 54 (77)
Q Consensus 40 ~~p~ll~hG~~D~~V 54 (77)
..|+|+|||..|.-.
T Consensus 110 ~iPVf~I~GNHD~p~ 124 (405)
T TIGR00583 110 AIPVFSIHGNHDDPS 124 (405)
T ss_pred CCCEEEEcCCCCCcc
Confidence 589999999999854
No 221
>PF03358 FMN_red: NADPH-dependent FMN reductase; InterPro: IPR005025 NADPH-dependent FMN reductase (1.5.1.29 from EC) reduces FMN and also reduces riboflavin and FAD, although more slowly. Members of this entry catalyse the reaction NAD(P)H + FMN = NAD(P)(+) + FMNH(2).; PDB: 3SVL_B 3GFS_F 3GFQ_A 1NNI_1 2GSW_B 3GFR_D 1T0I_B 3D7N_A 2R97_A 3B6K_A ....
Probab=32.06 E-value=70 Score=18.65 Aligned_cols=35 Identities=20% Similarity=0.243 Sum_probs=19.8
Q ss_pred cEEEEEeCCCCccC-HHHHHHHHHHHHhcCCCceee
Q psy7765 42 TFLLMHGTADDNVH-YQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 42 p~ll~hG~~D~~Vp-~~~s~~l~~al~~~g~~~~~~ 76 (77)
.++++.|..-..-. -.-+..+.+.+.+.|..++++
T Consensus 2 kilii~gS~r~~~~t~~l~~~~~~~l~~~g~e~~~i 37 (152)
T PF03358_consen 2 KILIINGSPRKNSNTRKLAEAVAEQLEEAGAEVEVI 37 (152)
T ss_dssp EEEEEESSSSTTSHHHHHHHHHHHHHHHTTEEEEEE
T ss_pred EEEEEECcCCCCCHHHHHHHHHHHHHHHcCCEEEEE
Confidence 36778887653322 223444555566667776654
No 222
>PRK04531 acetylglutamate kinase; Provisional
Probab=32.05 E-value=98 Score=22.00 Aligned_cols=35 Identities=23% Similarity=0.207 Sum_probs=25.3
Q ss_pred HHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceeeC
Q psy7765 33 RKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 33 ~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~~ 77 (77)
..+... ..+..|+||.. -++-+.|.+.|++.++++
T Consensus 60 a~L~~~-G~~~VlVHGgg---------pqI~~~l~~~gie~~~v~ 94 (398)
T PRK04531 60 SFLQEV-GLTPIVVHGAG---------PQLDAELDAAGIEKETVN 94 (398)
T ss_pred HHHHHC-CCcEEEEECCC---------HHHHHHHHHcCCCcEEEC
Confidence 344555 57889999975 444577888899888764
No 223
>PF07485 DUF1529: Domain of Unknown Function (DUF1259); InterPro: IPR011094 This family is the lppY/lpqO homologue family. They are related to 'probable conserved lipoproteins' LppY and LpqO from Mycobacterium bovis.
Probab=31.83 E-value=57 Score=19.41 Aligned_cols=26 Identities=15% Similarity=-0.022 Sum_probs=21.7
Q ss_pred CCCccCHHHHHHHHHHHHhcCCCcee
Q psy7765 50 ADDNVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 50 ~D~~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
.|..+-.++...+.++|+++|+.+.-
T Consensus 61 Gd~vll~~EV~pvi~aL~~~GI~vtA 86 (123)
T PF07485_consen 61 GDFVLLEDEVNPVISALRKNGIEVTA 86 (123)
T ss_pred ecEEecHHHHHHHHHHHHHCCceEEE
Confidence 45677888899999999999998753
No 224
>PF00462 Glutaredoxin: Glutaredoxin; InterPro: IPR002109 Glutaredoxins [, , ], also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system []. Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond []. It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed [] that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This entry represents Glutaredoxin.; GO: 0009055 electron carrier activity, 0015035 protein disulfide oxidoreductase activity, 0045454 cell redox homeostasis; PDB: 1QFN_A 1GRX_A 1EGO_A 1EGR_A 3RHC_A 3RHB_A 3IPZ_A 1NHO_A 3GX8_A 3D5J_A ....
Probab=31.29 E-value=48 Score=16.19 Aligned_cols=20 Identities=15% Similarity=0.115 Sum_probs=14.9
Q ss_pred HHHHHHHHHHhcCCCceeeC
Q psy7765 58 QSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 58 ~s~~l~~al~~~g~~~~~~~ 77 (77)
.+...-+.|++.|++++.+|
T Consensus 11 ~C~~~~~~L~~~~i~y~~~d 30 (60)
T PF00462_consen 11 YCKKAKEFLDEKGIPYEEVD 30 (60)
T ss_dssp HHHHHHHHHHHTTBEEEEEE
T ss_pred CHHHHHHHHHHcCCeeeEcc
Confidence 56667777888888887653
No 225
>cd03027 GRX_DEP Glutaredoxin (GRX) family, Dishevelled, Egl-10, and Pleckstrin (DEP) subfamily; composed of uncharacterized proteins containing a GRX domain and additional domains DEP and DUF547, both of which have unknown functions. GRX is a glutathione (GSH) dependent reductase containing a redox active CXXC motif in a TRX fold. It has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. By altering the redox state of target proteins, GRX is involved in many cellular functions.
Probab=31.25 E-value=67 Score=16.36 Aligned_cols=20 Identities=5% Similarity=0.095 Sum_probs=15.5
Q ss_pred HHHHHHHHHHhcCCCceeeC
Q psy7765 58 QSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 58 ~s~~l~~al~~~g~~~~~~~ 77 (77)
.+....+.|.+.|++++.+|
T Consensus 13 ~C~ka~~~L~~~gi~~~~~d 32 (73)
T cd03027 13 DCTAVRLFLREKGLPYVEIN 32 (73)
T ss_pred hHHHHHHHHHHCCCceEEEE
Confidence 46677777889999988764
No 226
>KOG4388|consensus
Probab=31.23 E-value=1.1e+02 Score=23.69 Aligned_cols=34 Identities=15% Similarity=0.098 Sum_probs=29.6
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCcee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
.+|+.|+.-.-|+... .+..|++.|++.|.++.+
T Consensus 787 LPp~~i~ac~mDP~LD--D~vmfA~kLr~lG~~v~l 820 (880)
T KOG4388|consen 787 LPPVHIVACAMDPMLD--DSVMFARKLRNLGQPVTL 820 (880)
T ss_pred CCCceEEEeccCcchh--HHHHHHHHHHhcCCceee
Confidence 4899999998898876 899999999999987764
No 227
>PRK10329 glutaredoxin-like protein; Provisional
Probab=31.14 E-value=66 Score=17.22 Aligned_cols=20 Identities=20% Similarity=0.190 Sum_probs=14.9
Q ss_pred HHHHHHHHHHhcCCCceeeC
Q psy7765 58 QSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 58 ~s~~l~~al~~~g~~~~~~~ 77 (77)
++.+.-+.|.+.|++++.+|
T Consensus 13 ~C~~ak~~L~~~gI~~~~id 32 (81)
T PRK10329 13 QCHATKRAMESRGFDFEMIN 32 (81)
T ss_pred hHHHHHHHHHHCCCceEEEE
Confidence 45566777788999988764
No 228
>COG2908 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=31.13 E-value=83 Score=20.98 Aligned_cols=9 Identities=56% Similarity=0.899 Sum_probs=5.5
Q ss_pred CCcEEEEEe
Q psy7765 40 NKTFLLMHG 48 (77)
Q Consensus 40 ~~p~ll~hG 48 (77)
..+++++||
T Consensus 105 g~~~Ll~HG 113 (237)
T COG2908 105 GKRILLAHG 113 (237)
T ss_pred CcEEEEEeC
Confidence 456666666
No 229
>TIGR02194 GlrX_NrdH Glutaredoxin-like protein NrdH. NrdH-redoxin is a representative of a class of small redox proteins that contain a conserved CXXC motif and are characterized by a glutaredoxin-like amino acid sequence and thioredoxin-like activity profile. Unlike other the glutaredoxins to which it is most closely related, NrdH aparrently does not interact with glutathione/glutathione reductase, but rather with thioredoxin reductase to catalyze the reduction of ribonucleotide reductase.
Probab=30.95 E-value=70 Score=16.32 Aligned_cols=20 Identities=40% Similarity=0.409 Sum_probs=14.6
Q ss_pred HHHHHHHHHHhcCCCceeeC
Q psy7765 58 QSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 58 ~s~~l~~al~~~g~~~~~~~ 77 (77)
.+...-+.|.+.|++++.+|
T Consensus 11 ~C~~ak~~L~~~~i~~~~~d 30 (72)
T TIGR02194 11 QCKMTKKALEEHGIAFEEIN 30 (72)
T ss_pred HHHHHHHHHHHCCCceEEEE
Confidence 45566677888899888754
No 230
>cd06259 YdcF-like YdcF-like. YdcF-like is a large family of mainly bacterial proteins, with a few members found in fungi, plants, and archaea. Escherichia coli YdcF has been shown to bind S-adenosyl-L-methionine (AdoMet), but a biochemical function has not been idenitified. The family also includes Escherichia coli sanA and Salmonella typhimurium sfiX, which are involved in vancomycin resistance; sfiX may also be involved in murein synthesis.
Probab=30.95 E-value=98 Score=18.03 Aligned_cols=34 Identities=15% Similarity=0.164 Sum_probs=26.1
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCc
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFF 73 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~ 73 (77)
..+.+|+-|..-..-...++..+.+.|.+.|++.
T Consensus 34 ~~~~ii~sGg~~~~~~~~ea~~m~~~l~~~gv~~ 67 (150)
T cd06259 34 PAPKLIVSGGQGPGEGYSEAEAMARYLIELGVPA 67 (150)
T ss_pred CCCEEEEcCCCCCCCCCCHHHHHHHHHHHcCCCH
Confidence 4677777776655556778999999999999765
No 231
>PF01008 IF-2B: Initiation factor 2 subunit family; InterPro: IPR000649 Initiation factor 2 binds to Met-tRNA, GTP and the small ribosomal subunit. The eukaryotic translation initiation factor EIF-2B is a complex made up of five different subunits, alpha, beta, gamma, delta and epsilon, and catalyses the exchange of EIF-2-bound GDP for GTP. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes; related proteins from archaebacteria and IF-2 from prokaryotes and also contains a subfamily of proteins in eukaryotes, archaeae (e.g. Pyrococcus furiosus), or eubacteria such as Bacillus subtilis and Thermotoga maritima. Many of these proteins were initially annotated as putative translation initiation factors despite the fact that there is no evidence for the requirement of an IF2 recycling factor in prokaryotic translation initiation. Recently, one of these proteins from B. subtilis has been functionally characterised as a 5-methylthioribose-1-phosphate isomerase (MTNA) []. This enzyme participates in the methionine salvage pathway catalysing the isomerisation of 5-methylthioribose-1-phosphate to 5-methylthioribulose-1-phosphate []. The methionine salvage pathway leads to the synthesis of methionine from methylthioadenosine, the end product of the spermidine and spermine anabolism in many species.; GO: 0044237 cellular metabolic process; PDB: 1VB5_A 1T5O_D 3A11_E 3VM6_C 1W2W_A 1T9K_A 3ECS_B 2YRF_A 2YVK_B 2A0U_A ....
Probab=30.79 E-value=31 Score=22.69 Aligned_cols=24 Identities=25% Similarity=0.270 Sum_probs=18.0
Q ss_pred ccCHHHHHHHHHHHHhcCCCceee
Q psy7765 53 NVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 53 ~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
.-|..+...+++.|.+.|++++++
T Consensus 141 s~P~~eG~~~a~~L~~~gi~v~~i 164 (282)
T PF01008_consen 141 SRPYNEGRLMAKELAEAGIPVTLI 164 (282)
T ss_dssp -TTTTHHHTHHHHHHHTT-EEEEE
T ss_pred CCcchhhhhHHHHhhhcceeEEEE
Confidence 345667788999999999998875
No 232
>COG3618 Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]
Probab=30.75 E-value=67 Score=21.92 Aligned_cols=36 Identities=14% Similarity=-0.011 Sum_probs=29.9
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
......|+..|...--..-.+.+++|.+.|..|++|
T Consensus 108 GvR~~l~~~p~~~~~a~~~r~~~~rL~~~gl~fdl~ 143 (279)
T COG3618 108 GVRRNLHVVPDGLFEAPAWRANVERLAKLGLHFDLQ 143 (279)
T ss_pred eeeehhhcCCccchhhHHHHHHHHHHHhcCCeEEEE
Confidence 344445999999988888889999999999999886
No 233
>COG1535 EntB Isochorismate hydrolase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=30.48 E-value=75 Score=20.70 Aligned_cols=36 Identities=19% Similarity=0.119 Sum_probs=25.5
Q ss_pred CCcEEEEEeCCCCccC------------HHHHHHHHHHHHhcCCCcee
Q psy7765 40 NKTFLLMHGTADDNVH------------YQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp------------~~~s~~l~~al~~~g~~~~~ 75 (77)
++++||||-.++--|. +.+..+|-.-.+++|+++-+
T Consensus 29 ~RavLLIhDMQ~YFv~~~~~~~~~~~~li~Ni~~Lr~~~~~~giPVvy 76 (218)
T COG1535 29 KRAVLLIHDMQNYFVSPWGENCPLMEQLIANIAKLRIWCKQAGIPVVY 76 (218)
T ss_pred ccceeeeehhHHhhcCCCCCCCccHHHHHHHHHHHHHHHHHcCCcEEE
Confidence 6899999988876663 23445566666788888754
No 234
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=30.45 E-value=52 Score=13.65 Aligned_cols=17 Identities=6% Similarity=0.128 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHHhcCCC
Q psy7765 56 YQQSMMLAKALEAADIF 72 (77)
Q Consensus 56 ~~~s~~l~~al~~~g~~ 72 (77)
+..+.++.+.+++.|+.
T Consensus 17 ~~~a~~~~~~M~~~gv~ 33 (34)
T PF13812_consen 17 PDAALQLFDEMKEQGVK 33 (34)
T ss_pred HHHHHHHHHHHHHhCCC
Confidence 55678888888888764
No 235
>PRK13398 3-deoxy-7-phosphoheptulonate synthase; Provisional
Probab=30.29 E-value=1.4e+02 Score=19.93 Aligned_cols=32 Identities=13% Similarity=0.194 Sum_probs=25.9
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCc
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEAADIFF 73 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~ 73 (77)
.+++++-| -+..=..++..+++++|++.|+++
T Consensus 26 ~~~~~iaG-PCsie~~~~~~~~A~~lk~~g~~~ 57 (266)
T PRK13398 26 EEKIIIAG-PCAVESEEQMVKVAEKLKELGVHM 57 (266)
T ss_pred CCEEEEEe-CCcCCCHHHHHHHHHHHHHcCCCE
Confidence 47888888 455557899999999999998764
No 236
>COG0816 Predicted endonuclease involved in recombination (possible Holliday junction resolvase in Mycoplasmas and B. subtilis) [DNA replication, recombination, and repair]
Probab=30.05 E-value=64 Score=19.65 Aligned_cols=21 Identities=14% Similarity=0.127 Sum_probs=16.2
Q ss_pred HHHHHHHHHHHhc-CCCceeeC
Q psy7765 57 QQSMMLAKALEAA-DIFFYQQD 77 (77)
Q Consensus 57 ~~s~~l~~al~~~-g~~~~~~~ 77 (77)
+.+..|++.|+++ ++++.++|
T Consensus 74 ~~~~~f~~~L~~r~~lpv~l~D 95 (141)
T COG0816 74 ELARKFAERLKKRFNLPVVLWD 95 (141)
T ss_pred HHHHHHHHHHHHhcCCCEEEEc
Confidence 3488899998876 68888775
No 237
>TIGR01293 Kv_beta voltage-dependent potassium channel beta subunit, animal. Plant beta subunits and their closely related bacterial homologs (in Deinococcus radiudurans, Xylella fastidiosa, etc.) appear more closely related to each other than to animal forms. However, the bacterial species lack convincing counterparts the Kv alpha subunit and the Kv beta homolog may serve as an enzyme. Cutoffs are set for this model such that yeast and plant forms and bacterial close homologs score between trusted and noise cutoffs.
Probab=29.64 E-value=1.6e+02 Score=19.76 Aligned_cols=31 Identities=6% Similarity=0.108 Sum_probs=25.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCC
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADI 71 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~ 71 (77)
...++++|.- |..+|...+.+-.+.|++.|+
T Consensus 113 ~iDl~~lH~~-~~~~~~~e~~~aL~~l~~~G~ 143 (317)
T TIGR01293 113 YVDIVFANRP-DPNTPMEETVRAMTYVINQGM 143 (317)
T ss_pred cEeEEEeccC-CCCCCHHHHHHHHHHHHHcCC
Confidence 4789999995 567888888888888888876
No 238
>COG1447 CelC Phosphotransferase system cellobiose-specific component IIA [Carbohydrate transport and metabolism]
Probab=29.53 E-value=55 Score=19.08 Aligned_cols=27 Identities=26% Similarity=0.269 Sum_probs=17.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALE 67 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~ 67 (77)
..+++++|| +|......-...+++.+.
T Consensus 70 ~~slllvHA-QDHLMtt~~~~~l~ke~i 96 (105)
T COG1447 70 EVSLLLVHA-QDHLMTTITEKDLIKELI 96 (105)
T ss_pred ceeeeeeeh-hHHHHHHHHHHHHHHHHH
Confidence 578999999 676665544555555443
No 239
>cd03415 CbiX_CbiC Archaeal sirohydrochlorin cobalt chelatase (CbiX) single domain. Proteins in this subgroup contain a single CbiX domain N-terminal to a precorrin-8X methylmutase (CbiC) domain. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, while CbiC catalyzes the conversion of cobalt-precorrin 8 to cobyrinic acid by methyl rearrangement. Both CbiX and CbiC are involved in vitamin B12 biosynthesis.
Probab=29.52 E-value=1.1e+02 Score=18.11 Aligned_cols=25 Identities=20% Similarity=0.355 Sum_probs=16.3
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 43 FLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 43 ~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
+++-||..|..-. +...++++.+++
T Consensus 4 llvgHGSR~~~~~-~~~~~la~~l~~ 28 (125)
T cd03415 4 IIITHGSRRNTFN-EDMEEWAAYLER 28 (125)
T ss_pred EEEecCCCChHHH-HHHHHHHHHHHh
Confidence 6778888887532 445566777654
No 240
>KOG1466|consensus
Probab=29.35 E-value=45 Score=22.91 Aligned_cols=22 Identities=18% Similarity=0.070 Sum_probs=18.3
Q ss_pred CHHHHHHHHHHHHhcCCCceee
Q psy7765 55 HYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 55 p~~~s~~l~~al~~~g~~~~~~ 76 (77)
|-.+...|++.|+++|+|+.++
T Consensus 166 Pd~sG~lm~~~L~~~~IPvtlv 187 (313)
T KOG1466|consen 166 PDGSGKLMAKELKKLGIPVTLV 187 (313)
T ss_pred CCCchhHHHHHHHhcCCCeEEE
Confidence 4456889999999999999875
No 241
>PF10882 bPH_5: Bacterial PH domain; InterPro: IPR020482 This entry contains membrane proteins with no known function.
Probab=29.30 E-value=74 Score=17.41 Aligned_cols=30 Identities=17% Similarity=0.250 Sum_probs=22.6
Q ss_pred CCcEEEEEeCCCC-ccCHHHHHHHHHHHHhc
Q psy7765 40 NKTFLLMHGTADD-NVHYQQSMMLAKALEAA 69 (77)
Q Consensus 40 ~~p~ll~hG~~D~-~Vp~~~s~~l~~al~~~ 69 (77)
..++++++..+.. .+.+++..+|+++|+++
T Consensus 69 ~~~~i~I~t~~~~y~isp~~~~~fi~~l~~r 99 (100)
T PF10882_consen 69 NKNVILIKTKDKTYVISPEDPEEFIEALKKR 99 (100)
T ss_pred CCCEEEEEECCceEEEcCCCHHHHHHHHHhc
Confidence 4788888876533 33788899999999876
No 242
>PLN03194 putative disease resistance protein; Provisional
Probab=29.15 E-value=1.1e+02 Score=19.73 Aligned_cols=34 Identities=12% Similarity=0.121 Sum_probs=27.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCcee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
..-+||-|-..|..-. -+..|+++|+++|+.+-+
T Consensus 26 ~yDVFISFrG~DtR~~--FvshL~~aL~~~GI~vF~ 59 (187)
T PLN03194 26 PCDVFINHRGIDTKRT--IATLLYDHLSRLNLRPFL 59 (187)
T ss_pred CCcEEEeCCCcccccc--HHHHHHHHHHHCCCEEEE
Confidence 5789999988887654 457899999999987643
No 243
>cd07422 MPP_ApaH Escherichia coli ApaH and related proteins, metallophosphatase domain. ApaH (also known as symmetrically cleaving Ap4A hydrolase and bis(5'nucleosyl)-tetraphosphatase) is a bacterial member of the PPP (phosphoprotein phosphatase) family of serine/threonine phosphatases that hydrolyzes the nucleotide-signaling molecule diadenosine tetraphosphate (Ap(4)A) into two ADP and also hydrolyzes Ap(5)A, Gp(4)G, and other extending compounds. Null mutations in apaH result in high intracellular levels of Ap(4)A which correlate with multiple phenotypes, including a decreased expression of catabolite-repressible genes, a reduction in the expression of flagellar operons, and an increased sensitivity to UV and heat. Ap4A hydrolase is important in responding to heat shock and oxidative stress via regulating the concentration of Ap4A in bacteria. Ap4A hydrolase is also thought to play a role in siderophore production, but the mechanism by which ApaH interacts with siderophore pathwa
Probab=28.67 E-value=1.1e+02 Score=20.25 Aligned_cols=28 Identities=25% Similarity=0.363 Sum_probs=23.6
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
..++++||.-++..+..++..+++.+.+
T Consensus 112 ~~~l~vHAGi~p~w~~~~~~~~a~eve~ 139 (257)
T cd07422 112 LGILMVHAGIPPQWSIEQALKLAREVEA 139 (257)
T ss_pred ccEEEEccCCCCCCCHHHHHHHHHHHHH
Confidence 3689999999999999998888877655
No 244
>PRK00166 apaH diadenosine tetraphosphatase; Reviewed
Probab=28.43 E-value=1.1e+02 Score=20.55 Aligned_cols=28 Identities=18% Similarity=0.318 Sum_probs=22.4
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
..++++||.-++.....++..+++.+.+
T Consensus 114 ~~~l~vHAGi~p~~~~~~~~~~a~eve~ 141 (275)
T PRK00166 114 LGLVMVHAGIPPQWDLATALALAREVEA 141 (275)
T ss_pred CCEEEEccCCCCCCCHHHHHHHHHHHHH
Confidence 4578999999999988888877776544
No 245
>COG2818 Tag 3-methyladenine DNA glycosylase [DNA replication, recombination, and repair]
Probab=28.19 E-value=50 Score=21.23 Aligned_cols=18 Identities=28% Similarity=0.283 Sum_probs=14.8
Q ss_pred HHHHHHHHHHHHhcCCCc
Q psy7765 56 YQQSMMLAKALEAADIFF 73 (77)
Q Consensus 56 ~~~s~~l~~al~~~g~~~ 73 (77)
..-|..|+++|+++|..+
T Consensus 140 t~~S~~mskaLKkrGf~f 157 (188)
T COG2818 140 TELSDAMSKALKKRGFKF 157 (188)
T ss_pred chhHHHHHHHHHHccCee
Confidence 456899999999999644
No 246
>TIGR03101 hydr2_PEP hydrolase, ortholog 2, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 1, TIGR03100) of the same superfamily.
Probab=28.13 E-value=1.3e+02 Score=20.06 Aligned_cols=35 Identities=14% Similarity=0.126 Sum_probs=22.5
Q ss_pred CcEEEEEeCCCCccC-HHHHHHHHHHHHhcCCCcee
Q psy7765 41 KTFLLMHGTADDNVH-YQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp-~~~s~~l~~al~~~g~~~~~ 75 (77)
+.++++||-....-. ......+++.|.++|..+-.
T Consensus 26 ~~VlllHG~g~~~~~~~~~~~~la~~La~~Gy~Vl~ 61 (266)
T TIGR03101 26 GVVIYLPPFAEEMNKSRRMVALQARAFAAGGFGVLQ 61 (266)
T ss_pred eEEEEECCCcccccchhHHHHHHHHHHHHCCCEEEE
Confidence 568899997654332 23345578888888865533
No 247
>cd04885 ACT_ThrD-I Tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase). This CD includes each of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase) which catalyzes the committed step in branched chain amino acid biosynthesis in plants and microorganisms, the pyridoxal 5'-phosphate (PLP)-dependent dehydration/deamination of L-threonine (or L-serine) to 2-ketobutyrate (or pyruvate). ThrD-I is a cooperative, feedback-regulated (isoleucine and valine) allosteric enzyme that forms a tetramer and contains four pyridoxal phosphate moieties. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=28.13 E-value=88 Score=15.87 Aligned_cols=31 Identities=19% Similarity=0.143 Sum_probs=23.3
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
..++++.--+.| ..+..++.++|+++|..++
T Consensus 37 ~~~v~v~ie~~~----~~~~~~i~~~L~~~G~~~~ 67 (68)
T cd04885 37 EARVLVGIQVPD----REDLAELKERLEALGYPYV 67 (68)
T ss_pred ceEEEEEEEeCC----HHHHHHHHHHHHHcCCCcc
Confidence 466776665554 4899999999999997653
No 248
>PF13823 ADH_N_assoc: Alcohol dehydrogenase GroES-associated; PDB: 2DPH_B.
Probab=28.07 E-value=28 Score=14.64 Aligned_cols=12 Identities=33% Similarity=0.526 Sum_probs=8.8
Q ss_pred EEEEEeCCCCcc
Q psy7765 43 FLLMHGTADDNV 54 (77)
Q Consensus 43 ~ll~hG~~D~~V 54 (77)
-+.+||..|-.|
T Consensus 3 Av~y~G~~~v~v 14 (23)
T PF13823_consen 3 AVVYHGPKDVRV 14 (23)
T ss_dssp EEEEEETTEEEE
T ss_pred ceEEeCCCceEE
Confidence 367899887655
No 249
>COG3208 GrsT Predicted thioesterase involved in non-ribosomal peptide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=28.05 E-value=1.6e+02 Score=19.75 Aligned_cols=29 Identities=10% Similarity=0.046 Sum_probs=23.8
Q ss_pred cCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 37 NFVNKTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 37 ~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
.+ ..|+..+-|++|..|..++.....+..
T Consensus 174 pl-~~pi~~~~G~~D~~vs~~~~~~W~~~t 202 (244)
T COG3208 174 PL-ACPIHAFGGEKDHEVSRDELGAWREHT 202 (244)
T ss_pred Cc-CcceEEeccCcchhccHHHHHHHHHhh
Confidence 45 689999999999999988877666543
No 250
>TIGR02544 III_secr_YscJ type III secretion apparatus lipoprotein, YscJ/HrcJ family. All members of this protein family are predicted lipoproteins with a conserved Cys near the N-terminus for cleavage and modification, and are part of known or predicted type III secretion systems. Members are found in both plant and animal pathogens, including the obligately intracellular chlamydial species and (non-pathogenic) root nodule bacteria. The most closely related proteins outside this family are examples of the flagellar M-ring protein FliF.
Probab=27.84 E-value=89 Score=19.87 Aligned_cols=21 Identities=10% Similarity=-0.073 Sum_probs=18.6
Q ss_pred CHHHHHHHHHHHHhcCCCcee
Q psy7765 55 HYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 55 p~~~s~~l~~al~~~g~~~~~ 75 (77)
..+++-++...|.++|++++.
T Consensus 27 ~~~da~~I~~~L~~~gI~y~~ 47 (193)
T TIGR02544 27 SEREANEMLAVLMRHGIDAEK 47 (193)
T ss_pred CHHHHHHHHHHHHHCCCCeEE
Confidence 467899999999999999986
No 251
>cd01458 vWA_ku Ku70/Ku80 N-terminal domain. The Ku78 heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks (DSB) in a preferred orientation. DSB's are repaired by either homologues recombination or non-homologues end joining and facilitate repair by the non-homologous end-joining pathway (NHEJ). The Ku heterodimer is required for accurate process that tends to preserve the sequence at the junction. Ku78 is found in all three kingdoms of life. However, only the eukaryotic proteins have a vWA domain fused to them at their N-termini. The vWA domain is not involved in DNA binding but may very likey mediate Ku78's interactions with other proteins. Members of this subgroup lack the conserved MIDAS motif.
Probab=27.82 E-value=1.2e+02 Score=19.10 Aligned_cols=37 Identities=19% Similarity=0.228 Sum_probs=26.2
Q ss_pred CCcEEEEEeCCCCc----cCHHHHHHHHHHHHhcCCCceee
Q psy7765 40 NKTFLLMHGTADDN----VHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 40 ~~p~ll~hG~~D~~----Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
...++|+--..|+. +...++...++.|.+.|+.++.+
T Consensus 128 ~k~IvL~TDg~~p~~~~~~~~~~~~~~a~~l~~~gI~i~~i 168 (218)
T cd01458 128 HKRIFLFTNNDDPHGGDSIKDSQAAVKAEDLKDKGIELELF 168 (218)
T ss_pred ccEEEEECCCCCCCCCCHHHHHHHHHHHHHHHhCCcEEEEE
Confidence 45666666555543 55677888999999999887654
No 252
>PF11950 DUF3467: Protein of unknown function (DUF3467); InterPro: IPR021857 This entry is represented by Bacteriophage 92, 0rf53. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family of proteins are functionally uncharacterised. This protein is found in bacteria, archaea and viruses. Proteins in this family are typically between 101 to 118 amino acids in length.
Probab=27.76 E-value=59 Score=18.10 Aligned_cols=18 Identities=28% Similarity=0.152 Sum_probs=14.6
Q ss_pred CccCHHHHHHHHHHHHhc
Q psy7765 52 DNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 52 ~~Vp~~~s~~l~~al~~~ 69 (77)
-..+|.++.+|..+|.++
T Consensus 62 VimsP~~AKrL~~aL~~~ 79 (92)
T PF11950_consen 62 VIMSPQHAKRLLKALQQN 79 (92)
T ss_pred EEeCHHHHHHHHHHHHHH
Confidence 345899999999999764
No 253
>TIGR02884 spore_pdaA delta-lactam-biosynthetic de-N-acetylase. Muramic delta-lactam is an unusual constituent of peptidoglycan, found only in bacterial spores in the peptidoglycan wall, or spore cortex. The proteins in this family are PdaA (yfjS), a member of a larger family of polysaccharide deacetylases, and are specificially involved in delta-lactam biosynthesis. PdaA acts immediately after CwlD, an N-acetylmuramoyl-L-alanine amidase and performs a de-N-acetylation. PdaA may also perform the following transpeptidation for lactam ring formation, as heterologous expression in E. coli of CwlD and PdaA together is sufficient for delta-lactam production.
Probab=27.73 E-value=1.6e+02 Score=18.86 Aligned_cols=34 Identities=24% Similarity=0.315 Sum_probs=24.7
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCcee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
...++|+|...... .+...++...|+++|-.+--
T Consensus 186 ~g~IiLlHd~~~~t--~~aL~~ii~~lk~~Gy~fvt 219 (224)
T TIGR02884 186 PGAILLLHAVSKDN--AEALDKIIKDLKEQGYTFKS 219 (224)
T ss_pred CCcEEEEECCCCCH--HHHHHHHHHHHHHCCCEEEE
Confidence 46899999754433 45667799999999976643
No 254
>cd01452 VWA_26S_proteasome_subunit 26S proteasome plays a major role in eukaryotic protein breakdown, especially for ubiquitin-tagged proteins. It is an ATP-dependent protease responsible for the bulk of non-lysosomal proteolysis in eukaryotes, often using covalent modification of proteins by ubiquitylation. It consists of a 20S proteolytic core particle (CP) and a 19S regulatory particle (RP). The CP is an ATP independent peptidase consisting of hydrolyzing activities. One or both ends of CP carry the RP that confers both ubiquitin and ATP dependence to the 26S proteosome. The RP's proposed functions include recognition of substrates and translocation of these to CP for proteolysis. The RP can dissociate into a stable lid and base subcomplexes. The base is composed of three non-ATPase subunits (Rpn 1, 2 and 10). A single residue in the vWA domain of Rpn10 has been implicated to be responsible for stabilizing the lid-base association.
Probab=27.70 E-value=1.7e+02 Score=18.58 Aligned_cols=34 Identities=15% Similarity=0.109 Sum_probs=24.8
Q ss_pred cEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 42 TFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 42 p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
.++++-| .|...+.......+++|++.|+.+..+
T Consensus 109 rivi~v~-S~~~~d~~~i~~~~~~lkk~~I~v~vI 142 (187)
T cd01452 109 RIVAFVG-SPIEEDEKDLVKLAKRLKKNNVSVDII 142 (187)
T ss_pred eEEEEEe-cCCcCCHHHHHHHHHHHHHcCCeEEEE
Confidence 5667766 344457778888999999999877543
No 255
>cd03418 GRX_GRXb_1_3_like Glutaredoxin (GRX) family, GRX bacterial class 1 and 3 (b_1_3)-like subfamily; composed of bacterial GRXs, approximately 10 kDa in size, and proteins containing a GRX or GRX-like domain. GRX is a glutathione (GSH) dependent reductase, catalyzing the disulfide reduction of target proteins such as ribonucleotide reductase. It contains a redox active CXXC motif in a TRX fold and uses a similar dithiol mechanism employed by TRXs for intramolecular disulfide bond reduction of protein substrates. Unlike TRX, GRX has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. The flow of reducing equivalents in the GRX system goes from NADPH - GSH reductase - GSH - GRX - protein substrates. By altering the redox state of target proteins, GRX is involved in many cellular functions including DNA synthesis, signal transduction and the defense against oxidative stress. Different classes are known i
Probab=27.53 E-value=83 Score=15.81 Aligned_cols=20 Identities=10% Similarity=0.112 Sum_probs=14.3
Q ss_pred HHHHHHHHHHhcCCCceeeC
Q psy7765 58 QSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 58 ~s~~l~~al~~~g~~~~~~~ 77 (77)
.+....+.|++.|++++.++
T Consensus 12 ~C~~ak~~L~~~~i~~~~i~ 31 (75)
T cd03418 12 YCVRAKALLDKKGVDYEEID 31 (75)
T ss_pred HHHHHHHHHHHCCCcEEEEE
Confidence 45666677788898887653
No 256
>PRK06036 translation initiation factor IF-2B subunit alpha; Provisional
Probab=27.49 E-value=93 Score=21.72 Aligned_cols=42 Identities=19% Similarity=0.075 Sum_probs=28.1
Q ss_pred hhHHhccCCCCcEEEEEeCCCCccCHHHHHHH-HHHHHhcCCCceee
Q psy7765 31 VTRKVGNFVNKTFLLMHGTADDNVHYQQSMML-AKALEAADIFFYQQ 76 (77)
Q Consensus 31 p~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l-~~al~~~g~~~~~~ 76 (77)
.+..+... ...+-++ -+..-|..|..++ +..|.+.|+++.++
T Consensus 171 ~l~~A~~~-gk~~~V~---v~EtRP~~qG~rlta~eL~~~GI~vtlI 213 (339)
T PRK06036 171 VIRSAVEQ-GKEIKVI---ACETRPLNQGSRLTTWELMQDNIPVTLI 213 (339)
T ss_pred HHHHHHHc-CCceEEE---EcCCCchhhHHHHHHHHHHHcCCCEEEE
Confidence 34444333 3444444 3467788888876 68999999999875
No 257
>COG0124 HisS Histidyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]
Probab=27.45 E-value=1.5e+02 Score=21.40 Aligned_cols=33 Identities=21% Similarity=0.155 Sum_probs=27.1
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCcee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
.++++++.-..+. ...+..+++.|+++|+.+++
T Consensus 335 ~~~v~v~~~~~~~---~~~a~~la~~LR~~g~~~~~ 367 (429)
T COG0124 335 RVDVYVVPLGEDA---EPEALKLAQKLRAAGISVEV 367 (429)
T ss_pred CCCEEEEEcCchh---HHHHHHHHHHHHHcCCcEEE
Confidence 4778888776666 47899999999999998875
No 258
>PRK09061 D-glutamate deacylase; Validated
Probab=27.43 E-value=2e+02 Score=20.94 Aligned_cols=35 Identities=9% Similarity=0.094 Sum_probs=28.4
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
..|+.|.|-.....-+...+.++.+++++.|+++.
T Consensus 245 G~rv~IsHlss~g~~~~~~~le~I~~Ar~~Gi~Vt 279 (509)
T PRK09061 245 GAHMHICHVNSTSLRDIDRCLALVEKAQAQGLDVT 279 (509)
T ss_pred CCCEEEEeeccCCcccHHHHHHHHHHHHHcCCcEE
Confidence 57888888887776677888888888888888774
No 259
>PF01583 APS_kinase: Adenylylsulphate kinase; InterPro: IPR002891 Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []: Serine/threonine-protein kinases Tyrosine-protein kinases Dual specific protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins) Protein kinase function has been evolutionarily conserved from Escherichia coli to human []. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases []. This domain contains an ATP binding P-loop motif [].; GO: 0005524 ATP binding, 0016301 kinase activity, 0016772 transferase activity, transferring phosphorus-containing groups, 0000103 sulfate assimilation; PDB: 1M7H_B 1M7G_B 3CR7_B 1D6J_A 2OFW_G 1X6V_B 1XNJ_A 1XJQ_B 2PEY_A 2PEZ_B ....
Probab=27.36 E-value=1.6e+02 Score=18.17 Aligned_cols=21 Identities=29% Similarity=0.245 Sum_probs=12.7
Q ss_pred HHHHHHHHHHHhcCCCceeeC
Q psy7765 57 QQSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 57 ~~s~~l~~al~~~g~~~~~~~ 77 (77)
.-+..+.+.|.+.|.++.++|
T Consensus 17 TlA~~L~~~L~~~g~~~~~LD 37 (156)
T PF01583_consen 17 TLARALERRLFARGIKVYLLD 37 (156)
T ss_dssp HHHHHHHHHHHHTTS-EEEEE
T ss_pred HHHHHHHHHHHHcCCcEEEec
Confidence 445666777777777666653
No 260
>PLN03033 2-dehydro-3-deoxyphosphooctonate aldolase; Provisional
Probab=27.31 E-value=1.9e+02 Score=19.97 Aligned_cols=42 Identities=12% Similarity=0.169 Sum_probs=30.2
Q ss_pred ChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhc----CCCc
Q psy7765 30 DVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAA----DIFF 73 (77)
Q Consensus 30 sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~----g~~~ 73 (77)
|.+.+.-.. ..|+++|.| -+.+=..+|..+.++++++. |+++
T Consensus 5 ~~~~~~~~~-~~~~~lIAG-PC~iEs~e~~~~~A~~lk~~~~~~g~~~ 50 (290)
T PLN03033 5 SLLYNQLKA-AEPFFLLAG-PNVIESEEHILRMAKHIKDISTKLGLPL 50 (290)
T ss_pred hHHHHhcCC-CCCeEEEec-CChhcCHHHHHHHHHHHHHHHHhCCCcE
Confidence 444444444 578888888 56666889999999999986 6654
No 261
>PF00532 Peripla_BP_1: Periplasmic binding proteins and sugar binding domain of LacI family; InterPro: IPR001761 This family includes the periplasmic binding proteins, and the LacI family transcriptional regulators. The periplasmic binding proteins are the primary receptors for chemotaxis and transport of many sugar based solutes. The LacI family of proteins consist of transcriptional regulators related to the lac repressor. In this case, generally the sugar binding domain binds a sugar which changes the DNA binding activity of the repressor domain (lacI) [, ].; PDB: 1BAP_A 7ABP_A 6ABP_A 1ABF_A 5ABP_A 2WRZ_B 9ABP_A 1APB_A 1ABE_A 8ABP_A ....
Probab=27.22 E-value=1.4e+02 Score=19.60 Aligned_cols=32 Identities=16% Similarity=0.184 Sum_probs=21.3
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 43 FLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 43 ~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
+.++.|..+...-..-..-|-+||+++|.++.
T Consensus 122 I~~i~~~~~~~~~~~R~~Gy~~Al~~~Gl~~~ 153 (279)
T PF00532_consen 122 IAFIGGPEDSSTSRERLQGYRDALKEAGLPID 153 (279)
T ss_dssp EEEEEESTTTHHHHHHHHHHHHHHHHTTSCEE
T ss_pred EEEEecCcchHHHHHHHHHHHHHHHHcCCCCC
Confidence 77777777666444444457788888887543
No 262
>PRK02929 L-arabinose isomerase; Provisional
Probab=27.05 E-value=1.1e+02 Score=22.53 Aligned_cols=41 Identities=12% Similarity=0.033 Sum_probs=27.4
Q ss_pred hccCCCCcEEEEEeCCCCc----------cCHHHH----HHHHHHHHhcCCCceee
Q psy7765 35 VGNFVNKTFLLMHGTADDN----------VHYQQS----MMLAKALEAADIFFYQQ 76 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~----------Vp~~~s----~~l~~al~~~g~~~~~~ 76 (77)
++.+ +.|+|++|---.+. ....|| .++...|.+.|++++.+
T Consensus 93 ~~~l-~~PvL~~~~Q~~~e~p~~~id~d~m~lnqs~~G~~e~~~il~R~gi~~~~v 147 (499)
T PRK02929 93 LSAL-QKPLLHLHTQFNAEIPWDTIDMDFMNLNQSAHGDREFGFIGARLRKQRKVV 147 (499)
T ss_pred HHHc-CCCEEEEecCCCccCCCCCCCcchhhhhhcccChHHHHHHHHHcCCCeeEE
Confidence 5567 79999999922222 233333 36888899999988764
No 263
>cd00578 L-fuc_L-ara-isomerases L-fucose isomerase (FucIase) and L-arabinose isomerase (AI) family; composed of FucIase, AI and similar proteins. FucIase converts L-fucose, an aldohexose, to its ketose form, which prepares it for aldol cleavage (similar to the isomerization of glucose in glycolysis). L-fucose (or 6-deoxy-L-galactose) is found in various oligo- and polysaccharides in mammals, bacteria and plants. AI catalyzes the isomerization of L-arabinose to L-ribulose, the first reaction in its conversion to D-xylulose-5-phosphate, an intermediate in the pentose phosphate pathway, which allows L-arabinose to be used as a carbon source. AI can also convert D-galactose to D-tagatose at elevated temperatures in the presence of divalent metal ions. D-tagatose, rarely found in nature, is of commercial interest as a low-calorie sugar substitute.
Probab=27.03 E-value=1.7e+02 Score=20.80 Aligned_cols=41 Identities=15% Similarity=0.100 Sum_probs=25.5
Q ss_pred hccCCCCcEEEEEeCCCCc--------cCHHHHHHHHHHHHhcCCCceee
Q psy7765 35 VGNFVNKTFLLMHGTADDN--------VHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~--------Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
+... +.|+++++--.... -..--..++...|++.|+++..+
T Consensus 84 ~~~~-~~Pvll~a~~~~~~~~~~~~~~~s~~g~~~~~~~l~r~gi~~~~v 132 (452)
T cd00578 84 LSEL-RKPVLLLATQFNREIPDFMNLNQSACGLREFGNILARLGIPFKVV 132 (452)
T ss_pred HHhc-CCCEEEEeCCCCCCCCchhhhhcchhhhHHHHHHHHHcCCceeEE
Confidence 4456 68988777633211 12223566888888889888753
No 264
>COG3411 Ferredoxin [Energy production and conversion]
Probab=26.98 E-value=98 Score=16.44 Aligned_cols=35 Identities=6% Similarity=-0.043 Sum_probs=25.5
Q ss_pred CCcEEEEEeCC--CCccCHHHHHHHHHHHHhcCCCce
Q psy7765 40 NKTFLLMHGTA--DDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 40 ~~p~ll~hG~~--D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
..|+++..-+. =..|.++.+.+++++....|.+++
T Consensus 16 ~gPvl~vYpegvWY~~V~p~~a~rIv~~hl~~Gr~Ve 52 (64)
T COG3411 16 DGPVLVVYPEGVWYTRVDPEDARRIVQSHLLGGRPVE 52 (64)
T ss_pred cCCEEEEecCCeeEeccCHHHHHHHHHHHHhCCCcch
Confidence 36666665433 246889999999999988887765
No 265
>PRK12326 preprotein translocase subunit SecA; Reviewed
Probab=26.83 E-value=1.7e+02 Score=22.90 Aligned_cols=31 Identities=23% Similarity=0.112 Sum_probs=26.5
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
..|+|+.- ..+++|+.+.+.|++.|++.+.+
T Consensus 427 GrPVLVgt------~sI~~SE~ls~~L~~~gI~h~vL 457 (764)
T PRK12326 427 GQPVLVGT------HDVAESEELAERLRAAGVPAVVL 457 (764)
T ss_pred CCCEEEEe------CCHHHHHHHHHHHHhCCCcceee
Confidence 68999875 45789999999999999998764
No 266
>PF02629 CoA_binding: CoA binding domain; InterPro: IPR003781 This domain has a Rossmann fold and is found in a number of proteins including succinyl CoA synthetases, malate and ATP-citrate ligases.; GO: 0005488 binding; PDB: 3IL2_B 3IKT_A 3IKV_B 2SCU_D 1JKJ_D 2NU7_A 1CQI_A 1JLL_A 2NU8_D 1SCU_D ....
Probab=26.70 E-value=83 Score=17.22 Aligned_cols=20 Identities=25% Similarity=0.077 Sum_probs=15.6
Q ss_pred cCHHHHHHHHHHHHhcCCCc
Q psy7765 54 VHYQQSMMLAKALEAADIFF 73 (77)
Q Consensus 54 Vp~~~s~~l~~al~~~g~~~ 73 (77)
||.+.+.+.+..+.++|+..
T Consensus 70 VP~~~a~~~~~~~~~~gIk~ 89 (96)
T PF02629_consen 70 VPAEAAQEVADELVEAGIKG 89 (96)
T ss_dssp S-HHHHHHHHHHHHHTT-SE
T ss_pred cCHHHHHHHHHHHHHcCCCE
Confidence 59999999999999998853
No 267
>COG0132 BioD Dethiobiotin synthetase [Coenzyme metabolism]
Probab=26.69 E-value=1.7e+02 Score=19.21 Aligned_cols=42 Identities=12% Similarity=0.205 Sum_probs=33.9
Q ss_pred cChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCc
Q psy7765 29 TDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFF 73 (77)
Q Consensus 29 ~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~ 73 (77)
..-+..+... +-|++|+-|.+-..++ |+.-=+++++++|+++
T Consensus 126 ~~~~D~~~~~-~lpvILV~~~~LGtIN--HtlLt~eal~~~gl~l 167 (223)
T COG0132 126 YTFADLAVQL-QLPVILVVGIKLGTIN--HTLLTVEALRARGLPL 167 (223)
T ss_pred ccHHHHHHHc-CCCEEEEecCCccHHH--HHHHHHHHHHHCCCCE
Confidence 3445566677 6999999999888876 8888899999999875
No 268
>PF09413 DUF2007: Domain of unknown function (DUF2007); InterPro: IPR018551 This is a family of proteins with unknown function. ; PDB: 2HFV_A.
Probab=26.54 E-value=99 Score=15.58 Aligned_cols=21 Identities=14% Similarity=0.146 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHhcCCCceee
Q psy7765 56 YQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 56 ~~~s~~l~~al~~~g~~~~~~ 76 (77)
+-++..+...|.++|+++...
T Consensus 9 ~~ea~~i~~~L~~~gI~~~v~ 29 (67)
T PF09413_consen 9 PIEAELIKGLLEENGIPAFVK 29 (67)
T ss_dssp HHHHHHHHHHHHHTT--EE--
T ss_pred HHHHHHHHHHHHhCCCcEEEE
Confidence 456777888999999988654
No 269
>PF02525 Flavodoxin_2: Flavodoxin-like fold; InterPro: IPR003680 This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) 1.6.99.2 from EC. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones and protect cells against damage by free radicals and reactive oxygen species []. This enzyme uses a FAD cofactor. The equation for this reaction is NAD(P)H + acceptor = NAD(P)(+) + reduced acceptor. This enzyme is also involved in the bioactivation of prodrugs used in chemotherapy []. The family also includes acyl carrier protein phosphodiesterase 3.1.4.14 from EC. This enzyme converts holo-ACP to apo-ACP by hydrolytic cleavage of the phosphopantetheine residue from ACP []. This family is related to FMN_red IPR005025 from INTERPRO and Flavodoxin_1 IPR008254 from INTERPRO.; GO: 0009055 electron carrier activity, 0016491 oxidoreductase activity, 0050662 coenzyme binding; PDB: 1T5B_B 1DXQ_B 2B3D_A 2Z9D_B 2Z9C_A 2Z98_A 2D5I_A 2Z9B_A 1TIK_A 1V4B_A ....
Probab=26.51 E-value=97 Score=19.09 Aligned_cols=35 Identities=9% Similarity=0.079 Sum_probs=22.3
Q ss_pred cEEEEEeCCCC--ccCHHHHHHHHHHHHhcC-CCceee
Q psy7765 42 TFLLMHGTADD--NVHYQQSMMLAKALEAAD-IFFYQQ 76 (77)
Q Consensus 42 p~ll~hG~~D~--~Vp~~~s~~l~~al~~~g-~~~~~~ 76 (77)
.+|++.|.-+. .+.-.-+..+.++++++| ..++..
T Consensus 2 kiLvI~asp~~~~S~s~~l~~~~~~~~~~~~~~~v~~~ 39 (199)
T PF02525_consen 2 KILVINASPRPEGSFSRALADAFLEGLQEAGPHEVEIR 39 (199)
T ss_dssp EEEEEE--SSTTTSHHHHHHHHHHHHHHHHTTSEEEEE
T ss_pred EEEEEEcCCCCccCHHHHHHHHHHHHHHHcCCCEEEEE
Confidence 36777777766 344455677888999988 556554
No 270
>PTZ00062 glutaredoxin; Provisional
Probab=26.40 E-value=1.5e+02 Score=19.04 Aligned_cols=35 Identities=17% Similarity=0.274 Sum_probs=23.8
Q ss_pred CcEEEEEeCCC-CccCHHHHHHHHHHHHhcCCCceeeC
Q psy7765 41 KTFLLMHGTAD-DNVHYQQSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 41 ~p~ll~hG~~D-~~Vp~~~s~~l~~al~~~g~~~~~~~ 77 (77)
+=++++.|+.+ +.+| .+.+..+-|++.|++++.+|
T Consensus 114 ~Vvvf~Kg~~~~p~C~--~C~~~k~~L~~~~i~y~~~D 149 (204)
T PTZ00062 114 KILLFMKGSKTFPFCR--FSNAVVNMLNSSGVKYETYN 149 (204)
T ss_pred CEEEEEccCCCCCCCh--hHHHHHHHHHHcCCCEEEEE
Confidence 55566677644 6677 45566777888899887654
No 271
>TIGR00624 tag DNA-3-methyladenine glycosylase I. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=26.23 E-value=38 Score=21.52 Aligned_cols=24 Identities=21% Similarity=0.325 Sum_probs=18.5
Q ss_pred CCCccCHHH--HHHHHHHHHhcCCCc
Q psy7765 50 ADDNVHYQQ--SMMLAKALEAADIFF 73 (77)
Q Consensus 50 ~D~~Vp~~~--s~~l~~al~~~g~~~ 73 (77)
....||.+. |..+.+.|+++|..+
T Consensus 128 ~~~~~p~~t~~S~~lskdLKkrGfkF 153 (179)
T TIGR00624 128 TDSEIPSSTPESKAMSKELKKRGFRF 153 (179)
T ss_pred ccccCCCCCHHHHHHHHHHHHcCCee
Confidence 445666655 999999999999754
No 272
>PRK00939 translation initiation factor Sui1; Reviewed
Probab=26.16 E-value=1.1e+02 Score=17.45 Aligned_cols=19 Identities=5% Similarity=-0.183 Sum_probs=10.6
Q ss_pred HHHHHHHHHHhcCCCceee
Q psy7765 58 QSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 58 ~s~~l~~al~~~g~~~~~~ 76 (77)
|...+.+.|.+.|++...+
T Consensus 80 ~r~~v~~~L~~~G~~~~~i 98 (99)
T PRK00939 80 HRERVKELLIKMGFSEENI 98 (99)
T ss_pred HHHHHHHHHHHcCCChhhc
Confidence 4555555666666655443
No 273
>COG1834 N-Dimethylarginine dimethylaminohydrolase [Amino acid transport and metabolism]
Probab=25.93 E-value=83 Score=21.33 Aligned_cols=23 Identities=22% Similarity=0.083 Sum_probs=19.3
Q ss_pred CHHHHHHHHHHHHhcCCCceeeC
Q psy7765 55 HYQQSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 55 p~~~s~~l~~al~~~g~~~~~~~ 77 (77)
=.+|=..|+++|+++|+.+.+++
T Consensus 36 A~aQh~~lve~l~~~gv~V~ll~ 58 (267)
T COG1834 36 AVAQHEALVEALEKNGVEVHLLP 58 (267)
T ss_pred HHHHHHHHHHHHHHCCCEEEEcC
Confidence 45777889999999999998763
No 274
>cd03143 A4_beta-galactosidase_middle_domain A4 beta-galactosidase middle domain: a type 1 glutamine amidotransferase (GATase1)-like domain. A4 beta-galactosidase middle domain: a type 1 glutamine amidotransferase (GATase1)-like domain. This group includes proteins similar to beta-galactosidase from Thermus thermophilus. Beta-Galactosidase hydrolyzes the beta-1,4-D-galactosidic linkage of lactose, as well as those of related chromogens, o-nitrophenyl-beta-D-galactopyranoside (ONP-Gal) and 5-bromo-4-chloro-3-indolyl-beta-D-galactoside (X-gal). This A4 beta-galactosidase middle domain lacks the catalytic triad of typical GATase1 domains. The reactive Cys residue found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow in typical GATase1 domains is not conserved in this group.
Probab=25.87 E-value=1.1e+02 Score=17.94 Aligned_cols=22 Identities=18% Similarity=0.078 Sum_probs=18.4
Q ss_pred CHHHHHHHHHHHHhcCCCceee
Q psy7765 55 HYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 55 p~~~s~~l~~al~~~g~~~~~~ 76 (77)
.......++++|.+.|++++++
T Consensus 24 y~~~~~~~~~~l~~~gi~~d~v 45 (154)
T cd03143 24 YLDLALALYRALRELGIPVDVV 45 (154)
T ss_pred HHHHHHHHHHHHHHCCCCEEEE
Confidence 3456888999999999999875
No 275
>KOG4409|consensus
Probab=25.85 E-value=1.5e+02 Score=21.15 Aligned_cols=45 Identities=9% Similarity=0.206 Sum_probs=29.0
Q ss_pred ccccChhHHhccCC-CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCC
Q psy7765 26 YNWTDVTRKVGNFV-NKTFLLMHGTADDNVHYQQSMMLAKALEAADI 71 (77)
Q Consensus 26 y~~~sp~~~~~~~~-~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~ 71 (77)
+.....++.+..++ ..|+++|+|..| -+...-..++.+.|...++
T Consensus 288 ~Ar~Pm~~r~~~l~~~~pv~fiyG~~d-WmD~~~g~~~~~~~~~~~~ 333 (365)
T KOG4409|consen 288 WARRPMIQRLRELKKDVPVTFIYGDRD-WMDKNAGLEVTKSLMKEYV 333 (365)
T ss_pred hhhhhHHHHHHhhccCCCEEEEecCcc-cccchhHHHHHHHhhcccc
Confidence 34455566666652 499999999655 5566667777776644333
No 276
>cd03557 L-arabinose_isomerase L-Arabinose isomerase (AI) catalyzes the isomerization of L-arabinose to L-ribulose, the first reaction in its conversion into D-xylulose-5-phosphate, an intermediate in the pentose phosphate pathway, which allows L-arabinose to be used as a carbon source. AI can also convert D-galactose to D-tagatose at elevated temperatures in the presence of divalent metal ions. D-tagatose, rarely found in nature, is of commercial interest as a low-calorie sugar substitute.
Probab=25.81 E-value=97 Score=22.69 Aligned_cols=41 Identities=15% Similarity=0.093 Sum_probs=26.7
Q ss_pred hccCCCCcEEEEEeCCCCccCH----------HHH----HHHHHHHHhcCCCceee
Q psy7765 35 VGNFVNKTFLLMHGTADDNVHY----------QQS----MMLAKALEAADIFFYQQ 76 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~D~~Vp~----------~~s----~~l~~al~~~g~~~~~~ 76 (77)
++.+ +.|+|++|--.-..+|. .|+ .++...|.+.|++++++
T Consensus 87 ~~~l-~~PvL~~~~q~~~~l~~~sidmd~m~l~qaahG~~e~~~il~R~gi~~~~v 141 (484)
T cd03557 87 LTAL-QKPLLHLHTQFNREIPWDTIDMDFMNLNQSAHGDREFGFIGSRMRIPRKVV 141 (484)
T ss_pred HHHc-CCCEEEEccCCCccCCCCCccchHHhhhhhcCCcHHHHHHHHHcCCCeeEE
Confidence 4567 79999999864333333 222 24566888899988764
No 277
>PF10137 TIR-like: Predicted nucleotide-binding protein containing TIR-like domain; InterPro: IPR019302 This entry represents a TIR-like domain found in a family of prokaryotic predicted nucleotide-binding proteins. Their exact function has not, as yet, been defined.
Probab=25.72 E-value=1.5e+02 Score=17.52 Aligned_cols=6 Identities=33% Similarity=1.060 Sum_probs=4.1
Q ss_pred EEEEEe
Q psy7765 43 FLLMHG 48 (77)
Q Consensus 43 ~ll~hG 48 (77)
+||.||
T Consensus 2 VFIvhg 7 (125)
T PF10137_consen 2 VFIVHG 7 (125)
T ss_pred EEEEeC
Confidence 567777
No 278
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=25.72 E-value=64 Score=13.12 Aligned_cols=18 Identities=17% Similarity=0.093 Sum_probs=14.0
Q ss_pred HHHHHHHHHHHHhcCCCc
Q psy7765 56 YQQSMMLAKALEAADIFF 73 (77)
Q Consensus 56 ~~~s~~l~~al~~~g~~~ 73 (77)
+..+.++.+.+++.|+..
T Consensus 16 ~~~a~~~~~~M~~~g~~p 33 (35)
T TIGR00756 16 VEEALELFKEMLERGIEP 33 (35)
T ss_pred HHHHHHHHHHHHHcCCCC
Confidence 467888888888888754
No 279
>PF04056 Ssl1: Ssl1-like; InterPro: IPR007198 Ssl1-like proteins are 40 kDa subunits of the transcription factor II H complex. This domain is often found associated with the C2H2 type Zn-finger (IPR007087 from INTERPRO).; GO: 0008270 zinc ion binding, 0006281 DNA repair, 0006355 regulation of transcription, DNA-dependent
Probab=25.66 E-value=1.5e+02 Score=19.07 Aligned_cols=45 Identities=9% Similarity=0.079 Sum_probs=32.7
Q ss_pred hhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 31 VTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 31 p~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
.+.+..+...+-++++-| .-..+.+.......+.|++.++.+..+
T Consensus 92 ~L~~~p~~~srEIlvi~g-Sl~t~Dp~di~~ti~~l~~~~IrvsvI 136 (193)
T PF04056_consen 92 SLKHMPSHGSREILVIFG-SLTTCDPGDIHETIESLKKENIRVSVI 136 (193)
T ss_pred HHhhCccccceEEEEEEe-ecccCCchhHHHHHHHHHHcCCEEEEE
Confidence 344444443567777777 556888889999999999999977653
No 280
>cd04446 DEP_DEPDC4 DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in DEPDC4-like proteins. DEPDC4 is a DEP domain containing protein of unknown function.
Probab=25.65 E-value=1.2e+02 Score=17.27 Aligned_cols=22 Identities=27% Similarity=0.413 Sum_probs=18.9
Q ss_pred CCccCHHHHHHHHHHHHhcCCC
Q psy7765 51 DDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 51 D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
|..++-.++.+|.++|.+.|+-
T Consensus 49 ~~~~tR~~Av~l~q~Ll~~gvi 70 (95)
T cd04446 49 DVDVPRAKAVRLCQALMDCRVF 70 (95)
T ss_pred cccCCHHHHHHHHHHHHHcCCe
Confidence 4577889999999999999874
No 281
>COG4545 Glutaredoxin-related protein [Posttranslational modification, protein turnover, chaperones]
Probab=25.60 E-value=76 Score=17.63 Aligned_cols=20 Identities=10% Similarity=0.057 Sum_probs=16.5
Q ss_pred HHHHHHHHHHhcCCCceeeC
Q psy7765 58 QSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 58 ~s~~l~~al~~~g~~~~~~~ 77 (77)
++..+.+.|.+.+++++++|
T Consensus 14 dca~a~eyl~rl~v~yd~Ve 33 (85)
T COG4545 14 DCAPAVEYLERLNVDYDFVE 33 (85)
T ss_pred chHHHHHHHHHcCCCceeee
Confidence 56677888999999999875
No 282
>PF03129 HGTP_anticodon: Anticodon binding domain; InterPro: IPR004154 tRNA synthetases, or tRNA ligases are involved in protein synthesis. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases [] it is probably the anticodon binding domain [].; GO: 0004812 aminoacyl-tRNA ligase activity, 0005524 ATP binding; PDB: 1KOG_B 1EVL_D 1EVK_B 1QF6_A 1FYF_B 2I4O_A 2I4M_A 2I4N_A 2I4L_A 1HC7_D ....
Probab=25.34 E-value=1e+02 Score=16.34 Aligned_cols=20 Identities=25% Similarity=0.124 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHhcCCCcee
Q psy7765 56 YQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 56 ~~~s~~l~~al~~~g~~~~~ 75 (77)
...+.++++.|+++|+.+++
T Consensus 15 ~~~a~~l~~~L~~~gi~v~~ 34 (94)
T PF03129_consen 15 IEYAQELANKLRKAGIRVEL 34 (94)
T ss_dssp HHHHHHHHHHHHHTTSEEEE
T ss_pred HHHHHHHHHHHHHCCCEEEE
Confidence 35678899999999987665
No 283
>PF13709 DUF4159: Domain of unknown function (DUF4159)
Probab=25.27 E-value=1.8e+02 Score=18.64 Aligned_cols=30 Identities=23% Similarity=0.214 Sum_probs=23.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
.-|++.++|..+-.-..++...|-+-|.+=
T Consensus 53 ~yP~ly~~g~~~~~~s~~e~~~Lr~Yl~~G 82 (207)
T PF13709_consen 53 FYPFLYWPGHGDFPLSDEEIANLRRYLENG 82 (207)
T ss_pred hCCEEEEeCCCCCCCCHHHHHHHHHHHHcC
Confidence 689999999999977777777777776553
No 284
>cd03060 GST_N_Omega_like GST_N family, Omega-like subfamily; composed of uncharacterized proteins with similarity to class Omega GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Class Omega GSTs show little or no GSH-conjugating activity towards standard GST substrates. Instead, they catalyze the GSH dependent reduction of protein disulfides, dehydroascorbate and monomethylarsonate, activities which are more characteristic of glutaredoxins. Like Omega enzymes, proteins in this subfamily contain a conserved cysteine equivalent to the first cysteine in the CXXC motif of glutaredoxins, which is a r
Probab=25.11 E-value=1e+02 Score=15.50 Aligned_cols=19 Identities=26% Similarity=0.051 Sum_probs=14.4
Q ss_pred HHHHHHHHHHhcCCCceee
Q psy7765 58 QSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 58 ~s~~l~~al~~~g~~~~~~ 76 (77)
.+.+...+|...|++++.+
T Consensus 11 ~~~rv~~~L~~~gl~~e~~ 29 (71)
T cd03060 11 YAMRARMALLLAGITVELR 29 (71)
T ss_pred HHHHHHHHHHHcCCCcEEE
Confidence 4566677788889988865
No 285
>PF01841 Transglut_core: Transglutaminase-like superfamily; InterPro: IPR002931 This domain is found in many proteins known to have transglutaminase activity, i.e. which cross-link proteins through an acyl-transfer reaction between the gamma-carboxamide group of peptide-bound glutamine and the epsilon-amino group of peptide-bound lysine, resulting in a epsilon-(gamma-glutamyl)lysine isopeptide bond. Tranglutaminases have been found in a diverse range of species, from bacteria through to mammals. The enzymes require calcium binding and their activity leads to post-translational modification of proteins through acyl-transfer reactions, involving peptidyl glutamine residues as acyl donors and a variety of primary amines as acyl acceptors, with the generation of proteinase resistant isopeptide bonds []. Sequence conservation in this superfamily primarily involves three motifs that centre around conserved cysteine, histidine, and aspartate residues that form the catalytic triad in the structurally characterised transglutaminase, the human blood clotting factor XIIIa' []. On the basis of the experimentally demonstrated activity of the Methanobacterium phage psiM2 pseudomurein endoisopeptidase [], it is proposed that many, if not all, microbial homologs of the transglutaminases are proteases and that the eukaryotic transglutaminases have evolved from an ancestral protease []. A subunit of plasma Factor XIII revealed that each Factor XIIIA subunit is composed of four domains (termed N-terminal beta-sandwich, core domain (containing the catalytic and the regulatory sites), and C-terminal beta-barrels 1 and 2) and that two monomers assemble into the native dimer through the surfaces in domains 1 and 2, in opposite orientation. This organisation in four domains is highly conserved during evolution among transglutaminase isoforms [].; PDB: 2F4M_A 2F4O_A 1GGY_B 1FIE_B 1GGU_B 1GGT_B 1F13_A 1QRK_B 1EVU_A 1EX0_B ....
Probab=24.76 E-value=90 Score=16.97 Aligned_cols=21 Identities=14% Similarity=-0.024 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHHhcCCCceee
Q psy7765 56 YQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 56 ~~~s~~l~~al~~~g~~~~~~ 76 (77)
...+.-|+.-|+..|+++.++
T Consensus 55 ~~~a~l~~allr~~Gipar~v 75 (113)
T PF01841_consen 55 EDYASLFVALLRALGIPARVV 75 (113)
T ss_dssp HHHHHHHHHHHHHHT--EEEE
T ss_pred HHHHHHHHHHHhhCCCceEEE
Confidence 456777888889999998764
No 286
>PRK05720 mtnA methylthioribose-1-phosphate isomerase; Reviewed
Probab=24.72 E-value=1.1e+02 Score=21.47 Aligned_cols=42 Identities=14% Similarity=-0.094 Sum_probs=27.2
Q ss_pred hhHHhccCCCCcEEEEEeCCCCccCHHHHHH-HHHHHHhcCCCceee
Q psy7765 31 VTRKVGNFVNKTFLLMHGTADDNVHYQQSMM-LAKALEAADIFFYQQ 76 (77)
Q Consensus 31 p~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~-l~~al~~~g~~~~~~ 76 (77)
.+..+... ...+-++- +..-|..|..+ ++..|.+.|+++.++
T Consensus 170 ~i~~A~~~-gk~~~V~v---~EsRP~~qG~~lta~eL~~~GI~vtlI 212 (344)
T PRK05720 170 PIYAAKEK-GIDIHVYA---DETRPRLQGARLTAWELYQAGIDVTVI 212 (344)
T ss_pred HHHHHHHc-CCceEEEE---cCCCChhhhHHHHHHHHHHCCCCEEEE
Confidence 44444433 34444443 35567778775 589999999999875
No 287
>PF03960 ArsC: ArsC family; InterPro: IPR006660 Several bacterial taxon have a chromosomal resistance system, encoded by the ars operon, for the detoxification of arsenate, arsenite, and antimonite []. This system transports arsenite and antimonite out of the cell. The pump is composed of two polypeptides, the products of the arsA and arsB genes. This two-subunit enzyme produces resistance to arsenite and antimonite. Arsenate, however, must first be reduced to arsenite before it is extruded. A third gene, arsC, expands the substrate specificity to allow for arsenate pumping and resistance. ArsC is an approximately 150-residue arsenate reductase that uses reduced glutathione (GSH) to convert arsenate to arsenite with a redox active cysteine residue in the active site. ArsC forms an active quaternary complex with GSH, arsenate, and glutaredoxin 1 (Grx1). The three ligands must be present simultaneously for reduction to occur []. The arsC family also comprises the Spx proteins which are GRAM-positive bacterial transcription factors that regulate the transcription of multiple genes in response to disulphide stress []. The arsC protein structure has been solved []. It belongs to the thioredoxin superfamily fold which is defined by a beta-sheet core surrounded by alpha-helices. The active cysteine residue of ArsC is located in the loop between the first beta-strand and the first helix, which is also conserved in the Spx protein and its homologues.; PDB: 2KOK_A 1SK1_A 1SK2_A 1JZW_A 1J9B_A 1S3C_A 1SD8_A 1SD9_A 1I9D_A 1SK0_A ....
Probab=24.63 E-value=73 Score=17.91 Aligned_cols=20 Identities=25% Similarity=0.074 Sum_probs=13.4
Q ss_pred HHHHHHHHHHhcCCCceeeC
Q psy7765 58 QSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 58 ~s~~l~~al~~~g~~~~~~~ 77 (77)
-+....+-|.++|++++++|
T Consensus 8 t~rka~~~L~~~gi~~~~~d 27 (110)
T PF03960_consen 8 TCRKALKWLEENGIEYEFID 27 (110)
T ss_dssp HHHHHHHHHHHTT--EEEEE
T ss_pred HHHHHHHHHHHcCCCeEeeh
Confidence 35566777889999998864
No 288
>PRK05279 N-acetylglutamate synthase; Validated
Probab=24.60 E-value=1.2e+02 Score=21.37 Aligned_cols=28 Identities=21% Similarity=0.249 Sum_probs=20.4
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
..++.|+||. .-++-+.+.+.|++.+++
T Consensus 56 g~~~VlVHGg---------g~~i~~~~~~~g~~~~~~ 83 (441)
T PRK05279 56 GIRLVLVHGA---------RPQIEEQLAARGIEPRYH 83 (441)
T ss_pred CCeEEEECCC---------CHHHHHHHHHcCCCceec
Confidence 4689999997 455566667778877664
No 289
>TIGR01616 nitro_assoc nitrogenase-associated protein. This model describes a small family of uncharacterized proteins found so far in alpha and gamma proteobacteria and in Nostoc sp. PCC 7120, a cyanobacterium. The gene for this protein is associated with nitrogenase genes. This family shows sequence similarity to TIGR00014, a glutaredoxin-dependent arsenate reductase that converts arsentate to arsenite for disposal. This family is one of several included in Pfam model pfam03960.
Probab=24.55 E-value=95 Score=18.32 Aligned_cols=19 Identities=26% Similarity=0.071 Sum_probs=15.0
Q ss_pred HHHHHHHHHhcCCCceeeC
Q psy7765 59 SMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 59 s~~l~~al~~~g~~~~~~~ 77 (77)
+..--+-|.++|++++++|
T Consensus 14 ~RKA~~~L~~~gi~~~~~d 32 (126)
T TIGR01616 14 NARQKAALKASGHDVEVQD 32 (126)
T ss_pred HHHHHHHHHHCCCCcEEEe
Confidence 5556677889999999875
No 290
>PF06028 DUF915: Alpha/beta hydrolase of unknown function (DUF915); InterPro: IPR010315 This family consists of bacterial proteins of unknown function, which are hydrolase-like.; PDB: 3LP5_A 3FLE_A 3DS8_A.
Probab=24.28 E-value=48 Score=22.01 Aligned_cols=35 Identities=23% Similarity=0.191 Sum_probs=27.1
Q ss_pred CCcEEEEEeC------CCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 40 NKTFLLMHGT------ADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 40 ~~p~ll~hG~------~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
...+|.|-|. .|..||...+..+.--++.+...++
T Consensus 184 ~i~VLnI~G~~~~g~~sDG~V~~~Ss~sl~~L~~~~~~~Y~ 224 (255)
T PF06028_consen 184 NIQVLNIYGDLEDGSNSDGIVPNASSLSLRYLLKNRAKSYQ 224 (255)
T ss_dssp T-EEEEEEEESBTTCSBTSSSBHHHHCTHHHHCTTTSSEEE
T ss_pred CeEEEEEecccCCCCCCCeEEeHHHHHHHHHHhhcccCceE
Confidence 5789999999 8999999999888776666554443
No 291
>PRK12475 thiamine/molybdopterin biosynthesis MoeB-like protein; Provisional
Probab=23.98 E-value=1.1e+02 Score=21.12 Aligned_cols=19 Identities=26% Similarity=0.420 Sum_probs=13.2
Q ss_pred EEEEeCCCCccCHHHHHHHHHHH
Q psy7765 44 LLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 44 ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
.++||+.| +..+..++..+
T Consensus 318 ~~~~g~~~----~~~a~~~~~~~ 336 (338)
T PRK12475 318 AFIHGTND----IKKAKRLYARY 336 (338)
T ss_pred EEEECCCC----HHHHHHHHHHh
Confidence 35889888 45677777654
No 292
>PF03446 NAD_binding_2: NAD binding domain of 6-phosphogluconate dehydrogenase; InterPro: IPR006115 6-Phosphogluconate dehydrogenase (1.1.1.44 from EC) (6PGD) is an oxidative carboxylase that catalyses the decarboxylating reduction of 6-phosphogluconate into ribulose 5-phosphate in the presence of NADP. This reaction is a component of the hexose mono-phosphate shunt and pentose phosphate pathways (PPP) [, ]. Prokaryotic and eukaryotic 6PGD are proteins of about 470 amino acids whose sequence are highly conserved []. The protein is a homodimer in which the monomers act independently []: each contains a large, mainly alpha-helical domain and a smaller beta-alpha-beta domain, containing a mixed parallel and anti-parallel 6-stranded beta sheet []. NADP is bound in a cleft in the small domain, the substrate binding in an adjacent pocket []. This family represents the NADP binding domain of 6-phosphogluconate dehydrogenase which adopts a Rossman fold. The C-terminal domain is described in IPR006114 from INTERPRO.; GO: 0004616 phosphogluconate dehydrogenase (decarboxylating) activity, 0006098 pentose-phosphate shunt, 0055114 oxidation-reduction process; PDB: 3AX6_D 3PDU_G 3Q3C_A 3OBB_A 4DLL_B 1PGP_A 1PGN_A 2PGD_A 1PGQ_A 1PGO_A ....
Probab=23.95 E-value=1e+02 Score=18.48 Aligned_cols=28 Identities=18% Similarity=0.210 Sum_probs=19.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCC
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADI 71 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~ 71 (77)
...+++-++ .+.++.+.++.+.+.++|+
T Consensus 86 ~g~iiid~s----T~~p~~~~~~~~~~~~~g~ 113 (163)
T PF03446_consen 86 PGKIIIDMS----TISPETSRELAERLAAKGV 113 (163)
T ss_dssp TTEEEEE-S----S--HHHHHHHHHHHHHTTE
T ss_pred cceEEEecC----Ccchhhhhhhhhhhhhccc
Confidence 345555555 4678999999999999884
No 293
>TIGR00524 eIF-2B_rel eIF-2B alpha/beta/delta-related uncharacterized proteins. This model, eIF-2B_rel, describes half of a superfamily, where the other half consists of eukaryotic translation initiation factor 2B (eIF-2B) subunits alpha, beta, and delta. It is unclear whether the eIF-2B_rel set is monophyletic, or whether they are all more closely related to each other than to any eIF-2B subunit because the eIF-2B clade is highly derived. Members of this branch of the family are all uncharacterized with respect to function and are found in the Archaea, Bacteria, and Eukarya, although a number are described as putative translation intiation factor components. Proteins found by eIF-2B_rel include at least three clades, including a set of uncharacterized eukaryotic proteins, a set found in some but not all Archaea, and a set universal so far among the Archaea and closely related to several uncharacterized bacterial proteins.
Probab=23.78 E-value=86 Score=21.39 Aligned_cols=25 Identities=16% Similarity=-0.042 Sum_probs=19.7
Q ss_pred CccCHHHH-HHHHHHHHhcCCCceee
Q psy7765 52 DNVHYQQS-MMLAKALEAADIFFYQQ 76 (77)
Q Consensus 52 ~~Vp~~~s-~~l~~al~~~g~~~~~~ 76 (77)
..-|..+. ..+++.|.+.|++++++
T Consensus 159 EsrP~~~G~~~~a~~L~~~gI~vtlI 184 (303)
T TIGR00524 159 ETRPRNQGSRLTAWELMQDGIDVTLI 184 (303)
T ss_pred CCCCccchHHHHHHHHHHCCCCEEEE
Confidence 33466666 77899999999999875
No 294
>PRK12902 secA preprotein translocase subunit SecA; Reviewed
Probab=23.77 E-value=1.8e+02 Score=23.41 Aligned_cols=31 Identities=16% Similarity=0.149 Sum_probs=26.7
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
..|+|+.-. .++.|+.+.+.|.+.|++.+++
T Consensus 439 GrPVLIgT~------SVe~SE~ls~~L~~~gi~h~vL 469 (939)
T PRK12902 439 GRPVLVGTT------SVEKSELLSALLQEQGIPHNLL 469 (939)
T ss_pred CCCEEEeeC------CHHHHHHHHHHHHHcCCchhee
Confidence 689998755 4889999999999999998765
No 295
>PRK01372 ddl D-alanine--D-alanine ligase; Reviewed
Probab=23.73 E-value=1.2e+02 Score=19.85 Aligned_cols=35 Identities=17% Similarity=0.138 Sum_probs=20.9
Q ss_pred cEEEEEeC--CCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 42 TFLLMHGT--ADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 42 p~ll~hG~--~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
++-++.|. .+..+.......+.++|++.|..+..+
T Consensus 6 ~v~~~~g~~~~~~~~~~~s~~~i~~al~~~g~~v~~i 42 (304)
T PRK01372 6 KVAVLMGGTSAEREVSLNSGAAVLAALREAGYDAHPI 42 (304)
T ss_pred EEEEEeCCCCCCceEeHHhHHHHHHHHHHCCCEEEEE
Confidence 34444432 334455566677777887777776654
No 296
>PF06850 PHB_depo_C: PHB de-polymerase C-terminus; InterPro: IPR009656 This entry represents the C terminus of bacterial poly(3-hydroxybutyrate) (PHB) de-polymerase. This degrades PHB granules to oligomers and monomers of 3-hydroxy-butyric acid.
Probab=23.55 E-value=1.3e+02 Score=19.58 Aligned_cols=30 Identities=23% Similarity=0.215 Sum_probs=23.7
Q ss_pred CCCCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 38 FVNKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 38 ~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
+..+++|-+-|+.|++..+.|+.. +..|+.
T Consensus 132 I~~taLlTVEGe~DDIsg~GQT~A-A~~LC~ 161 (202)
T PF06850_consen 132 IRRTALLTVEGERDDISGPGQTHA-AHDLCT 161 (202)
T ss_pred cccceeEEeecCcccCCcchHHHH-HHHHhc
Confidence 446889999999999999999876 444443
No 297
>cd03032 ArsC_Spx Arsenate Reductase (ArsC) family, Spx subfamily; Spx is a unique RNA polymerase (RNAP)-binding protein present in bacilli and some mollicutes. It inhibits transcription by binding to the C-terminal domain of the alpha subunit of RNAP, disrupting complex formation between RNAP and certain transcriptional activator proteins like ResD and ComA. In response to oxidative stress, Spx can also activate transcription, making it a general regulator that exerts both positive and negative control over transcription initiation. Spx has been shown to exert redox-sensitive transcriptional control over genes like trxA (TRX) and trxB (TRX reductase), genes that function in thiol homeostasis. This redox-sensitive activity is dependent on the presence of a CXXC motif, present in some members of the Spx subfamily, that acts as a thiol/disulfide switch. Spx has also been shown to repress genes in a sulfate-dependent manner independent of the presence of the CXXC motif.
Probab=23.51 E-value=1.2e+02 Score=17.27 Aligned_cols=20 Identities=20% Similarity=0.187 Sum_probs=14.9
Q ss_pred HHHHHHHHHHhcCCCceeeC
Q psy7765 58 QSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 58 ~s~~l~~al~~~g~~~~~~~ 77 (77)
.+....+.|.++|+++++++
T Consensus 12 ~c~ka~~~L~~~gi~~~~id 31 (115)
T cd03032 12 SCRKAKQWLEEHQIPFEERN 31 (115)
T ss_pred HHHHHHHHHHHCCCceEEEe
Confidence 35556677888999998874
No 298
>cd02977 ArsC_family Arsenate Reductase (ArsC) family; composed of TRX-fold arsenic reductases and similar proteins including the transcriptional regulator, Spx. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione (GSH) via glutaredoxin (GRX), through a single catalytic cysteine. This family of predominantly bacterial enzymes is unrelated to two other families of arsenate reductases which show similarity to low-molecular-weight acid phosphatases and phosphotyrosyl phosphatases. Spx is a general regulator that exerts negative and positive control over transcription initiation by binding to the C-terminal domain of the alpha subunit of RNA polymerase.
Probab=23.35 E-value=1.2e+02 Score=16.71 Aligned_cols=19 Identities=26% Similarity=0.043 Sum_probs=14.0
Q ss_pred HHHHHHHHHhcCCCceeeC
Q psy7765 59 SMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 59 s~~l~~al~~~g~~~~~~~ 77 (77)
+....+-|+++|++++++|
T Consensus 12 c~ka~~~L~~~~i~~~~id 30 (105)
T cd02977 12 SRKALAWLEEHGIEYEFID 30 (105)
T ss_pred HHHHHHHHHHcCCCcEEEe
Confidence 4555667788999998764
No 299
>PF07819 PGAP1: PGAP1-like protein; InterPro: IPR012908 The sequences found in this family are similar to PGAP1 (Q765A7 from SWISSPROT). This is an endoplasmic reticulum membrane protein with a catalytic serine-containing motif that is conserved in a number of lipases. PGAP1 functions as a GPI inositol-deacylase; this deacylation is important for the efficient transport of GPI-anchored proteins from the endoplasmic reticulum to the Golgi body [].; GO: 0016788 hydrolase activity, acting on ester bonds, 0006505 GPI anchor metabolic process, 0006886 intracellular protein transport, 0031227 intrinsic to endoplasmic reticulum membrane
Probab=23.27 E-value=1.4e+02 Score=19.25 Aligned_cols=24 Identities=38% Similarity=0.520 Sum_probs=17.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
+.|++++||.. =.++|...++..+
T Consensus 4 g~pVlFIhG~~---Gs~~q~rsl~~~~ 27 (225)
T PF07819_consen 4 GIPVLFIHGNA---GSYKQVRSLASEL 27 (225)
T ss_pred CCEEEEECcCC---CCHhHHHHHHHHH
Confidence 68999999943 2366777777766
No 300
>PRK11340 phosphodiesterase YaeI; Provisional
Probab=23.27 E-value=94 Score=20.44 Aligned_cols=32 Identities=19% Similarity=0.298 Sum_probs=22.6
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
.|++.+.|..|..........+.+.|++.|+.
T Consensus 113 ~pv~~V~GNHD~~~~~~~~~~~~~~l~~~gi~ 144 (271)
T PRK11340 113 APTFACFGNHDRPVGTEKNHLIGETLKSAGIT 144 (271)
T ss_pred CCEEEecCCCCcccCccchHHHHHHHHhcCcE
Confidence 68999999999765443334566777777763
No 301
>PF00070 Pyr_redox: Pyridine nucleotide-disulphide oxidoreductase; InterPro: IPR001327 FAD flavoproteins belonging to the family of pyridine nucleotide-disulphide oxidoreductases (glutathione reductase, trypanothione reductase, lipoamide dehydrogenase, mercuric reductase, thioredoxin reductase, alkyl hydroperoxide reductase) share sequence similarity with a number of other flavoprotein oxidoreductases, in particular with ferredoxin-NAD+ reductases involved in oxidative metabolism of a variety of hydrocarbons (rubredoxin reductase, putidaredoxin reductase, terpredoxin reductase, ferredoxin-NAD+ reductase components of benzene 1,2-dioxygenase, toluene 1,2-dioxygenase, chlorobenzene dioxygenase, biphenyl dioxygenase), NADH oxidase and NADH peroxidase [, , ]. Comparison of the crystal structures of human glutathione reductase and Escherichia coli thioredoxin reductase reveals different locations of their active sites, suggesting that the enzymes diverged from an ancestral FAD/NAD(P)H reductase and acquired their disulphide reductase activities independently []. Despite functional similarities, oxidoreductases of this family show no sequence similarity with adrenodoxin reductases [] and flavoprotein pyridine nucleotide cytochrome reductases (FPNCR) []. Assuming that disulphide reductase activity emerged later, during divergent evolution, the family can be referred to as FAD-dependent pyridine nucleotide reductases, FADPNR. To date, 3D structures of glutathione reductase [], thioredoxin reductase [], mercuric reductase [], lipoamide dehydrogenase [], trypanothione reductase [] and NADH peroxidase [] have been solved. The enzymes share similar tertiary structures based on a doubly-wound alpha/beta fold, but the relative orientations of their FAD- and NAD(P)H-binding domains may vary significantly. By contrast with the FPNCR family, the folds of the FAD- and NAD(P)H-binding domains are similar, suggesting that the domains evolved by gene duplication []. This entry describes a small NADH binding domain within a larger FAD binding domain described by IPR023753 from INTERPRO. It is found in both class I and class II oxidoreductases. ; GO: 0016491 oxidoreductase activity, 0050660 flavin adenine dinucleotide binding, 0055114 oxidation-reduction process; PDB: 1ZKQ_A 3DGZ_A 1ZDL_A 2R9Z_B 2RAB_A 2A87_B 1M6I_A 2YVG_A 2GR1_A 2GQW_A ....
Probab=23.08 E-value=74 Score=16.54 Aligned_cols=19 Identities=21% Similarity=0.013 Sum_probs=13.8
Q ss_pred HHHHHHHHHHhcCCCceee
Q psy7765 58 QSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 58 ~s~~l~~al~~~g~~~~~~ 76 (77)
-+.+++..|.+.|..+.++
T Consensus 10 ig~E~A~~l~~~g~~vtli 28 (80)
T PF00070_consen 10 IGIELAEALAELGKEVTLI 28 (80)
T ss_dssp HHHHHHHHHHHTTSEEEEE
T ss_pred HHHHHHHHHHHhCcEEEEE
Confidence 3667888888888777654
No 302
>TIGR00823 EIIA-LAC phosphotransferase system enzyme II, lactose-specific, factor III. operon. While the Lac permeases consist of two polypeptide chains (IIA and IICB), the Chb permease of E. coli consists of three (IIA, IIB and IIC). In B. subtilis, a PTS permease similar to the Chb permease of E. coli is believed to transport lichenan (a b-1,3;1,4 glucan) degradation products, oligosaccharides of 2-4 glucose units. This model is specific for the IIA subunit of the Lac PTS family.
Probab=23.00 E-value=53 Score=18.72 Aligned_cols=28 Identities=14% Similarity=0.105 Sum_probs=21.6
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
...+|++|+ .|......-...+++.+.+
T Consensus 68 ~~slLlvHA-QDhLMta~~~~~la~elI~ 95 (99)
T TIGR00823 68 EVSLTMVHA-QDHLMTTMLLKDLIAELIE 95 (99)
T ss_pred ccceeehhH-HHHHHHHHHHHHHHHHHHH
Confidence 477888998 7888877777777777654
No 303
>cd00738 HGTP_anticodon HGTP anticodon binding domain, as found at the C-terminus of histidyl, glycyl, threonyl and prolyl tRNA synthetases, which are classified as a group of class II aminoacyl-tRNA synthetases (aaRS). In aaRSs, the anticodon binding domain is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only. This domain is also found in the accessory subunit of mitochondrial polymerase gamma (Pol gamma b).
Probab=22.97 E-value=1.3e+02 Score=15.67 Aligned_cols=20 Identities=25% Similarity=0.094 Sum_probs=15.1
Q ss_pred HHHHHHHHHHHHhcCCCcee
Q psy7765 56 YQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 56 ~~~s~~l~~al~~~g~~~~~ 75 (77)
...+..+++.|++.|+.+++
T Consensus 17 ~~~a~~~~~~Lr~~g~~v~~ 36 (94)
T cd00738 17 REYAQKLLNALLANGIRVLY 36 (94)
T ss_pred HHHHHHHHHHHHHCCCEEEe
Confidence 45677788888888887764
No 304
>cd03785 GT1_MurG MurG is an N-acetylglucosaminyltransferase, the last enzyme involved in the intracellular phase of peptidoglycan biosynthesis. It transfers N-acetyl-D-glucosamine (GlcNAc) from UDP-GlcNAc to the C4 hydroxyl of a lipid-linked N-acetylmuramoyl pentapeptide (NAM). The resulting disaccharide is then transported across the cell membrane, where it is polymerized into NAG-NAM cell-wall repeat structure. MurG belongs to the GT-B structural superfamily of glycoslytransferases, which have characteristic N- and C-terminal domains, each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center and permits a high degree of flexibility.
Probab=22.95 E-value=1.6e+02 Score=19.35 Aligned_cols=22 Identities=18% Similarity=0.104 Sum_probs=14.3
Q ss_pred cCHHHHHHHHHHHHhcCCCcee
Q psy7765 54 VHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 54 Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
-+...+..++++|.+.|-++.+
T Consensus 11 G~~~~~~~la~~l~~~G~ev~v 32 (350)
T cd03785 11 GHIFPALALAEELRERGAEVLF 32 (350)
T ss_pred hhhhHHHHHHHHHHhCCCEEEE
Confidence 5555666777777777665544
No 305
>PRK04169 geranylgeranylglyceryl phosphate synthase-like protein; Reviewed
Probab=22.92 E-value=2.3e+02 Score=18.66 Aligned_cols=38 Identities=13% Similarity=0.148 Sum_probs=26.7
Q ss_pred HhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 34 KVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 34 ~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
.+... .+..+++-|+.+.. .++..++.+++++..+|+-
T Consensus 27 ~~~~~-gtdai~vGGS~~vt--~~~~~~~v~~ik~~~lPvi 64 (232)
T PRK04169 27 AICES-GTDAIIVGGSDGVT--EENVDELVKAIKEYDLPVI 64 (232)
T ss_pred HHHhc-CCCEEEEcCCCccc--hHHHHHHHHHHhcCCCCEE
Confidence 34445 57888888875433 6778888899888777654
No 306
>PRK10853 putative reductase; Provisional
Probab=22.92 E-value=1.2e+02 Score=17.54 Aligned_cols=19 Identities=26% Similarity=0.155 Sum_probs=14.4
Q ss_pred HHHHHHHHHhcCCCceeeC
Q psy7765 59 SMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 59 s~~l~~al~~~g~~~~~~~ 77 (77)
+..--+-|.++|++++++|
T Consensus 13 ~rkA~~~L~~~~i~~~~~d 31 (118)
T PRK10853 13 IKKARRWLEAQGIDYRFHD 31 (118)
T ss_pred HHHHHHHHHHcCCCcEEee
Confidence 4555566788999999875
No 307
>PRK05772 translation initiation factor IF-2B subunit alpha; Provisional
Probab=22.89 E-value=1e+02 Score=21.73 Aligned_cols=42 Identities=12% Similarity=0.051 Sum_probs=27.5
Q ss_pred hhHHhccCCCCcEEEEEeCCCCccCHHHHHHH-HHHHHhcCCCceee
Q psy7765 31 VTRKVGNFVNKTFLLMHGTADDNVHYQQSMML-AKALEAADIFFYQQ 76 (77)
Q Consensus 31 p~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l-~~al~~~g~~~~~~ 76 (77)
++..+... ...+-++ -+..-|..|..++ +..|.+.|+++.++
T Consensus 191 ~i~~a~~~-gk~f~V~---v~EsRP~~qG~rlta~eL~~~GIpvtlI 233 (363)
T PRK05772 191 PVKLAKAL-GMSVSVI---APETRPWLQGSRLTVYELMEEGIKVTLI 233 (363)
T ss_pred HHHHHHHC-CCeEEEE---ECCCCccchhHHHHHHHHHHCCCCEEEE
Confidence 34444444 3444444 3456777887765 78999999999875
No 308
>cd00215 PTS_IIA_lac PTS_IIA, PTS system, lactose/cellobiose specific IIA subunit. The bacterial phosphoenolpyruvate: sugar phosphotransferase system (PTS) is a multi-protein system involved in the regulation of a variety of metabolic and transcriptional processes. This family is one of four structurally and functionally distinct group IIA PTS system cytoplasmic enzymes, necessary for the uptake of carbohydrates across the cytoplasmic membrane and their phosphorylation. This family of proteins normally function as a homotrimer, stabilized by a centrally located metal ion. Separation into subunits is thought to occur after phosphorylation.
Probab=22.87 E-value=56 Score=18.55 Aligned_cols=28 Identities=25% Similarity=0.256 Sum_probs=21.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHh
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEA 68 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~ 68 (77)
...+|++|+ .|......-...+++.+.+
T Consensus 66 ~~slLlvHA-qDhLMta~~~~~la~e~I~ 93 (97)
T cd00215 66 EVSLLLVHA-QDHLMTTITLKDLAKEMIE 93 (97)
T ss_pred ccceeehhH-HHHHHHHHHHHHHHHHHHH
Confidence 478899998 7887777777777776654
No 309
>COG1509 KamA Lysine 2,3-aminomutase [Amino acid transport and metabolism]
Probab=22.81 E-value=68 Score=22.79 Aligned_cols=30 Identities=20% Similarity=0.330 Sum_probs=25.7
Q ss_pred CCcEEEEE-----eCCCCccCHHHHHHHHHHHHhc
Q psy7765 40 NKTFLLMH-----GTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 40 ~~p~ll~h-----G~~D~~Vp~~~s~~l~~al~~~ 69 (77)
-.|..+.| |+++-.||.+.+.++.++|+..
T Consensus 281 V~PYYl~~~D~~~G~~hfr~~i~~~~~i~~~lr~~ 315 (369)
T COG1509 281 VKPYYLHQLDLVQGAAHFRVPIAEGLQIVEELRGR 315 (369)
T ss_pred CcceEEeccCccCCccceeccHHHHHHHHHHHHHh
Confidence 36777765 8899999999999999999875
No 310
>cd03034 ArsC_ArsC Arsenate Reductase (ArsC) family, ArsC subfamily; arsenic reductases similar to that encoded by arsC on the R733 plasmid of Escherichia coli. E. coli ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], the first step in the detoxification of arsenic, using reducing equivalents derived from glutathione (GSH) via glutaredoxin (GRX). ArsC contains a single catalytic cysteine, within a thioredoxin fold, that forms a covalent thiolate-As(V) intermediate, which is reduced by GRX through a mixed GSH-arsenate intermediate. This family of predominantly bacterial enzymes is unrelated to two other families of arsenate reductases which show similarity to low-molecular-weight acid phosphatases and phosphotyrosyl phosphatases.
Probab=22.65 E-value=1.1e+02 Score=17.39 Aligned_cols=19 Identities=26% Similarity=-0.045 Sum_probs=14.3
Q ss_pred HHHHHHHHHhcCCCceeeC
Q psy7765 59 SMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 59 s~~l~~al~~~g~~~~~~~ 77 (77)
+.+-.+-|+++|++++++|
T Consensus 12 ~rkA~~~L~~~~i~~~~~d 30 (112)
T cd03034 12 SRNALALLEEAGIEPEIVE 30 (112)
T ss_pred HHHHHHHHHHCCCCeEEEe
Confidence 4455667888999998875
No 311
>COG0859 RfaF ADP-heptose:LPS heptosyltransferase [Cell envelope biogenesis, outer membrane]
Probab=22.55 E-value=2.4e+02 Score=19.06 Aligned_cols=33 Identities=12% Similarity=0.167 Sum_probs=26.8
Q ss_pred CcEEEEEeC----CCCccCHHHHHHHHHHHHhcCCCc
Q psy7765 41 KTFLLMHGT----ADDNVHYQQSMMLAKALEAADIFF 73 (77)
Q Consensus 41 ~p~ll~hG~----~D~~Vp~~~s~~l~~al~~~g~~~ 73 (77)
.|.++++.. .+.+-|.++-.++++.|.+.|..+
T Consensus 175 ~~~i~i~pg~s~~~~K~wp~e~~~~l~~~l~~~~~~V 211 (334)
T COG0859 175 RPYIVINPGASRGSAKRWPLEHYAELAELLIAKGYQV 211 (334)
T ss_pred CCeEEEeccccccccCCCCHHHHHHHHHHHHHCCCEE
Confidence 466666655 789999999999999999999443
No 312
>PF12010 DUF3502: Domain of unknown function (DUF3502); InterPro: IPR022627 This domain is about 140 amino acids in length and is functionally uncharacterised. It is found in bacteria C-terminal to PF01547 from PFAM.
Probab=22.48 E-value=39 Score=20.04 Aligned_cols=25 Identities=28% Similarity=0.203 Sum_probs=18.6
Q ss_pred EEEeCCCCccCHHHHHHHHHHHHhcCCC
Q psy7765 45 LMHGTADDNVHYQQSMMLAKALEAADIF 72 (77)
Q Consensus 45 l~hG~~D~~Vp~~~s~~l~~al~~~g~~ 72 (77)
|..|.-| |-....+|.+.|+++|++
T Consensus 92 L~~G~vd---~e~~~~~~~~kLk~AGid 116 (134)
T PF12010_consen 92 LETGLVD---PEEALPEFNEKLKAAGID 116 (134)
T ss_pred HHccCCC---HHHHHHHHHHHHHHhChH
Confidence 5567777 556677799999999863
No 313
>cd03033 ArsC_15kD Arsenate Reductase (ArsC) family, 15kD protein subfamily; composed of proteins of unknown function with similarity to thioredoxin-fold arsenic reductases, ArsC. It is encoded by an ORF present in a gene cluster associated with nitrogen fixation that also encodes dinitrogenase reductase ADP-ribosyltransferase (DRAT) and dinitrogenase reductase activating glycohydrolase (DRAG). ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione via glutaredoxin, through a single catalytic cysteine.
Probab=22.44 E-value=1.3e+02 Score=17.31 Aligned_cols=19 Identities=26% Similarity=0.062 Sum_probs=14.3
Q ss_pred HHHHHHHHHhcCCCceeeC
Q psy7765 59 SMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 59 s~~l~~al~~~g~~~~~~~ 77 (77)
+..--+-|.++|++++++|
T Consensus 13 crkA~~~L~~~gi~~~~~d 31 (113)
T cd03033 13 NARQKALLEAAGHEVEVRD 31 (113)
T ss_pred HHHHHHHHHHcCCCcEEee
Confidence 4455566889999999875
No 314
>cd03041 GST_N_2GST_N GST_N family, 2 repeats of the N-terminal domain of soluble GSTs (2 GST_N) subfamily; composed of uncharacterized proteins. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains.
Probab=22.42 E-value=1.1e+02 Score=15.66 Aligned_cols=19 Identities=5% Similarity=0.018 Sum_probs=14.9
Q ss_pred HHHHHHHHHHhcCCCceee
Q psy7765 58 QSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 58 ~s~~l~~al~~~g~~~~~~ 76 (77)
.+.+...+|...|++++.+
T Consensus 12 ~~~kv~~~L~~~gi~y~~~ 30 (77)
T cd03041 12 FCRLVREVLTELELDVILY 30 (77)
T ss_pred hHHHHHHHHHHcCCcEEEE
Confidence 4667777888999988765
No 315
>PRK15324 type III secretion system lipoprotein PrgK; Provisional
Probab=22.38 E-value=1.3e+02 Score=20.11 Aligned_cols=22 Identities=14% Similarity=0.049 Sum_probs=19.1
Q ss_pred CHHHHHHHHHHHHhcCCCceee
Q psy7765 55 HYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 55 p~~~s~~l~~al~~~g~~~~~~ 76 (77)
..+++-++...|.++|++++..
T Consensus 28 ~~~dAneIv~~L~~~gI~y~~~ 49 (252)
T PRK15324 28 DQEQANEVIAVLQMHNIEANKI 49 (252)
T ss_pred CHHHHHHHHHHHHHCCCCeEec
Confidence 4678999999999999999863
No 316
>TIGR00512 salvage_mtnA S-methyl-5-thioribose-1-phosphate isomerase. The delineation of this family was based in part on a discussion and neighbor-joining phylogenetic study, by Kyrpides and Woese, of archaeal and other proteins homologous to the alpha, beta, and delta subunits of eukaryotic initiation factor 2B (eIF-2B), a five-subunit molecule that catalyzes GTP recycling for eIF-2. This clade is now recognized to include the methionine salvage pathway enzyme MtnA.
Probab=22.26 E-value=1.4e+02 Score=20.85 Aligned_cols=42 Identities=14% Similarity=-0.074 Sum_probs=26.8
Q ss_pred hhHHhccCCCCcEEEEEeCCCCccCHHHHHH-HHHHHHhcCCCceee
Q psy7765 31 VTRKVGNFVNKTFLLMHGTADDNVHYQQSMM-LAKALEAADIFFYQQ 76 (77)
Q Consensus 31 p~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~-l~~al~~~g~~~~~~ 76 (77)
.+..+... ....-++-. ..-|..|..+ ++..|.+.|+++.++
T Consensus 170 ~l~~A~~~-g~~~~V~v~---EsrP~~qG~rlta~~L~~~GI~vtlI 212 (331)
T TIGR00512 170 VIRSAHEK-GRLEHVYAD---ETRPRLQGARLTAWELVQEGIPATLI 212 (331)
T ss_pred HHHHHHHc-CCceEEEEC---CCCchhhHHHHHHHHHHHCCCCEEEE
Confidence 44444433 344444433 4467778775 589999999999875
No 317
>PRK14105 selenophosphate synthetase; Provisional
Probab=22.22 E-value=1.2e+02 Score=20.78 Aligned_cols=23 Identities=17% Similarity=0.329 Sum_probs=19.4
Q ss_pred ccCHHHHHHHHHHHHhcCCCcee
Q psy7765 53 NVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 53 ~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
.||++++..+.+.|++.|+++..
T Consensus 300 tv~~e~~~~~~~~l~~~g~~~~~ 322 (345)
T PRK14105 300 SVKPEYKDKLIDKLEKNNVYAFE 322 (345)
T ss_pred EecHHHHHHHHHHHHhCCCCceE
Confidence 57889999999999999887654
No 318
>PRK00943 selenophosphate synthetase; Provisional
Probab=22.14 E-value=1.2e+02 Score=20.81 Aligned_cols=24 Identities=8% Similarity=-0.101 Sum_probs=20.3
Q ss_pred CccCHHHHHHHHHHHHhcCCCcee
Q psy7765 52 DNVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 52 ~~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
-.||++.+..+.++|++.|+++..
T Consensus 307 ~~v~~~~a~~v~~~l~~~g~~a~v 330 (347)
T PRK00943 307 VAVAPEAEAEVLAIAAEHGIELAA 330 (347)
T ss_pred EEeCHHHHHHHHHHHHhcCCCeEE
Confidence 457899999999999999987654
No 319
>cd03412 CbiK_N Anaerobic cobalamin biosynthetic cobalt chelatase (CbiK), N-terminal domain. CbiK is part of the cobalt-early path for cobalamin biosynthesis. It catalyzes the insertion of cobalt into the oxidized form of precorrin-2, factor II (sirohydrochlorin), the second step of the anaerobic branch of vitamin B12 biosynthesis. CbiK belongs to the class II family of chelatases and is a homomeric enzyme that does not require ATP for its enzymatic activity.
Probab=22.11 E-value=1.2e+02 Score=17.75 Aligned_cols=26 Identities=19% Similarity=0.199 Sum_probs=16.8
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 43 FLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 43 ~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
+++-||+.|+ -.-+.-..+.+.+++.
T Consensus 4 llv~fGS~~~-~~~~~~~~i~~~l~~~ 29 (127)
T cd03412 4 LLVSFGTSYP-TAEKTIDAIEDKVRAA 29 (127)
T ss_pred EEEeCCCCCH-HHHHHHHHHHHHHHHH
Confidence 6788999888 3333455566666554
No 320
>cd04252 AAK_NAGK-fArgBP AAK_NAGK-fArgBP: N-Acetyl-L-glutamate kinase (NAGK) of the fungal arginine-biosynthetic pathway (fArgBP). The nuclear-encoded, mitochondrial polyprotein precursor with an N-terminal NAGK (ArgB) domain (this CD), a central DUF619 domain, and a C-terminal reductase domain (ArgC, N-Acetylglutamate Phosphate Reductase, NAGPR). The precursor is cleaved in the mitochondria into two distinct enzymes (NAGK-DUF619 and NAGPR). Native molecular weights of these proteins indicate that the kinase is an octamer whereas the reductase is a dimer. This CD also includes some gamma-proteobacteria (Xanthomonas and Xylella) NAG kinases with an N-terminal NAGK (ArgB) domain (this CD) and a C-terminal DUF619 domain. The DUF619 domain is described as a putative distant homolog of the acetyltransferase, ArgA, predicted to function in NAG synthase association in fungi. Eukaryotic sequences have an N-terminal mitochondrial transit peptide. Members of this NAG kinase domain CD belong to th
Probab=21.94 E-value=1.9e+02 Score=18.79 Aligned_cols=15 Identities=13% Similarity=0.326 Sum_probs=9.4
Q ss_pred hccCCCCcEEEEEeCC
Q psy7765 35 VGNFVNKTFLLMHGTA 50 (77)
Q Consensus 35 ~~~~~~~p~ll~hG~~ 50 (77)
+.+. ..+..|+||..
T Consensus 24 l~~~-g~~~VlVHGgg 38 (248)
T cd04252 24 LQHV-GLYPIVVHGAG 38 (248)
T ss_pred HHHC-CCcEEEEeCCC
Confidence 3344 34558999974
No 321
>TIGR01768 GGGP-family geranylgeranylglyceryl phosphate synthase family protein. This model represents a family of sequences including geranylgeranylglyceryl phosphate synthase which catalyzes the first committed step in the synthesis of ether-linked membrane lipids in archaea. The clade of bacterial sequences may have the same function or a closely related function. This model supercedes TIGR00265, which has been retired.
Probab=21.94 E-value=2.2e+02 Score=18.67 Aligned_cols=38 Identities=18% Similarity=0.287 Sum_probs=27.0
Q ss_pred HhccCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 34 KVGNFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 34 ~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
.+... .+..+++-|+. .|..++..++.+++++..+|+-
T Consensus 22 ~~~~~-gtdai~vGGS~--~vt~~~~~~~v~~ik~~~lPvi 59 (223)
T TIGR01768 22 AAAES-GTDAILIGGSQ--GVTYEKTDTLIEALRRYGLPII 59 (223)
T ss_pred HHHhc-CCCEEEEcCCC--cccHHHHHHHHHHHhccCCCEE
Confidence 34445 57888888874 3555688888888888877764
No 322
>COG1334 FlaG Uncharacterized flagellar protein FlaG [Cell motility and secretion]
Probab=21.86 E-value=97 Score=18.49 Aligned_cols=17 Identities=12% Similarity=0.262 Sum_probs=15.0
Q ss_pred ccCHHHHHHHHHHHHhc
Q psy7765 53 NVHYQQSMMLAKALEAA 69 (77)
Q Consensus 53 ~Vp~~~s~~l~~al~~~ 69 (77)
.+|++...+|+++|++.
T Consensus 95 qIPpee~L~l~~r~~d~ 111 (120)
T COG1334 95 QIPPEEALELAARMRDV 111 (120)
T ss_pred hCChHHHHHHHHHHHHh
Confidence 46999999999999875
No 323
>PF12122 DUF3582: Protein of unknown function (DUF3582); InterPro: IPR022732 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the N-terminal domain of membrane-bound serine endopeptidases belonging to MEROPS peptidase family S54 (rhomboid-1, clan ST). This domain contains a conserved ASW sequence motif and a single completely conserved residue F that may be functionally important. The tertiary structure of the GlpG protein from Escherichia coli has been determined []. The GlpG protein has six transmembrane domains (other members of the family are predicted to have seven), with the N- and C-terminal ends anchored in the cytoplasm. One transmembrane domain is shorter than the rest, creating an internal, aqueous cavity just below the membrane surface and it is here were proteolysis occurs. There is also a membrane-embedded loop between the first and second transmembrane domains which is postulated to act as a gate controlling substrate access to the active site. No other family of serine peptidases is known to have active site residues within transmembrane domains (although transmembrane active sites are known for aspartic peptidase and metallopeptidases), and the GlpG protein has the type structure for clan ST.; GO: 0004252 serine-type endopeptidase activity, 0016021 integral to membrane; PDB: 3UBB_A 3B45_A 3B44_A 2NRF_A 3TXT_A 2O7L_A 2XTU_A 2IRV_A 2XOW_A 2XTV_A ....
Probab=21.81 E-value=1.6e+02 Score=16.84 Aligned_cols=22 Identities=9% Similarity=0.055 Sum_probs=16.7
Q ss_pred CHHHHHHHHHHHHhcCCCceee
Q psy7765 55 HYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 55 p~~~s~~l~~al~~~g~~~~~~ 76 (77)
.+..+..|++-|+..|+++++.
T Consensus 9 n~r~AqaF~DYl~sqgI~~~i~ 30 (101)
T PF12122_consen 9 NPRAAQAFIDYLASQGIELQIE 30 (101)
T ss_dssp SHHHHHHHHHHHHHTT--EEEE
T ss_pred CHHHHHHHHHHHHHCCCeEEEE
Confidence 4667889999999999988764
No 324
>TIGR02873 spore_ylxY probable sporulation protein, polysaccharide deacetylase family. Members of this protein family are most closely related to TIGR02764, a subset of polysaccharide deacetylase family proteins found in a species if and only if the species forms endospores like those of Bacillus subtilis or Clostridium tetani. This family is likewise restricted to spore-formers, but is not universal among them in having sequences with full-length matches to the model.
Probab=21.66 E-value=2.6e+02 Score=18.64 Aligned_cols=54 Identities=6% Similarity=0.047 Sum_probs=32.2
Q ss_pred CCcccccccChhHHhccCCCCcEEEEEeCCCCcc---------CHHHHHHHHHHHHhcCCCceee
Q psy7765 21 DNEVGYNWTDVTRKVGNFVNKTFLLMHGTADDNV---------HYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 21 ~~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~V---------p~~~s~~l~~al~~~g~~~~~~ 76 (77)
+..-.|++.+|.-+++.+ .|-.+..|..+..+ ....+.++...|++.|++++|.
T Consensus 55 ~~~~~~~~~~~~~~l~~l--~~~~~~~g~~~~k~VaLTFDdg~~~~~t~~iL~iLkk~~vkATFF 117 (268)
T TIGR02873 55 EKLLVFDEVSPSVHLDDL--PPSPIYRGHPEKPMVALLINVAWGNEYLPEILQILKKHDVKATFF 117 (268)
T ss_pred ccccceeccCCCcchhhc--CCccEEecCCCCCEEEEEEeCCCCcchHHHHHHHHHHCCCCEEEE
Confidence 333456666666666655 22223334333221 2356779999999999998874
No 325
>PF02082 Rrf2: Transcriptional regulator; InterPro: IPR000944 The following uncharacterised bacterial proteins have been shown to be evolutionary related, Desulfovibrio vulgaris protein Rrf2; Escherichia coli hypothetical proteins yfhP and yjeB; Bacillus subtilis hypothetical proteins yhdE, yrzC and ywgB; Mycobacterium tuberculosis hypothetical protein Rv1287; and Synechocystis sp. (strain PCC 6803) hypothetical protein slr0846. These are small proteins of 12 to 18kDa which seem to contain a signal sequence, and may represent a family of probable transcriptional regulators.; PDB: 3T8T_A 3T8R_A 3K69_A 3LWF_C 1XD7_A 2Y75_E 1YLF_C.
Probab=21.63 E-value=1.4e+02 Score=15.88 Aligned_cols=20 Identities=10% Similarity=0.041 Sum_probs=16.0
Q ss_pred CccCHHHHHHHHHHHHhcCC
Q psy7765 52 DNVHYQQSMMLAKALEAADI 71 (77)
Q Consensus 52 ~~Vp~~~s~~l~~al~~~g~ 71 (77)
..+|+....++.+.|+++|.
T Consensus 35 ~~i~~~~l~kil~~L~~~Gl 54 (83)
T PF02082_consen 35 LGISPSYLRKILQKLKKAGL 54 (83)
T ss_dssp HTS-HHHHHHHHHHHHHTTS
T ss_pred HCcCHHHHHHHHHHHhhCCe
Confidence 35788889999999999886
No 326
>cd04906 ACT_ThrD-I_1 First of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase). This CD includes the first of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase) which catalyzes the committed step in branched chain amino acid biosynthesis in plants and microorganisms, the pyridoxal 5'-phosphate (PLP)-dependent dehydration/deamination of L-threonine (or L-serine) to 2-ketobutyrate (or pyruvate). ThrD-I is a cooperative, feedback-regulated (isoleucine and valine) allosteric enzyme that forms a tetramer and contains four pyridoxal phosphate moieties. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=21.62 E-value=1.5e+02 Score=15.84 Aligned_cols=20 Identities=15% Similarity=0.072 Sum_probs=17.2
Q ss_pred HHHHHHHHHHHHhcCCCcee
Q psy7765 56 YQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 56 ~~~s~~l~~al~~~g~~~~~ 75 (77)
.++..++.++|+++|..+..
T Consensus 52 ~~~~~~i~~~L~~~G~~~~~ 71 (85)
T cd04906 52 AEELAELLEDLKSAGYEVVD 71 (85)
T ss_pred HHHHHHHHHHHHHCCCCeEE
Confidence 67889999999999988754
No 327
>PRK10494 hypothetical protein; Provisional
Probab=21.59 E-value=1.8e+02 Score=19.32 Aligned_cols=34 Identities=12% Similarity=0.050 Sum_probs=23.1
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
.+.+|+-|..-..-+..++..+.+.+.+.|++.+
T Consensus 121 ~~~ii~SGg~~~~~~~sEA~~~~~~l~~lGVp~~ 154 (259)
T PRK10494 121 GAKLIFTGGAAKTNTVSTAEVGARVAQSLGVPRE 154 (259)
T ss_pred CCEEEEECCCCCCCCCCHHHHHHHHHHHcCCCHH
Confidence 4555666643222256688999999999999864
No 328
>PRK12464 1-deoxy-D-xylulose 5-phosphate reductoisomerase; Provisional
Probab=21.54 E-value=49 Score=23.56 Aligned_cols=65 Identities=15% Similarity=0.135 Sum_probs=45.0
Q ss_pred CccchhhhhhhhhhCCCCCCCCcccccccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 1 MFWITNSIYTERYMGLPEMYDNEVGYNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 1 ~w~~~~~~~~e~~lG~p~~~~~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
|+|-+|-+-+-.++|.|+ .+-.-.....|.++..=.+...-++..-|.-|..+|++.+..+=+++
T Consensus 209 mNKglEvIEA~~LF~i~~-d~I~vvIHPqSiVHsmVef~DGSv~aql~~pDMr~PI~yAL~yP~R~ 273 (383)
T PRK12464 209 MNKGFEVIEAHWLFDIPY-EKIDVLIHKESIIHSLVEFIDGSVLAQLGAPDMRMPIQYAFHYPTRL 273 (383)
T ss_pred hhhhHHHHHHHHHcCCCH-HHeEEEECCCCceeEEEEEcCccEEEEeCCCCcHHHHHHHcCCcccc
Confidence 567778777777889874 33334444556665444444678888999999999999988655544
No 329
>cd03054 GST_N_Metaxin GST_N family, Metaxin subfamily; composed of metaxins and related proteins. Metaxin 1 is a component of a preprotein import complex of the mitochondrial outer membrane. It extends to the cytosol and is anchored to the mitochondrial membrane through its C-terminal domain. In mice, metaxin is required for embryonic development. In humans, alterations in the metaxin gene may be associated with Gaucher disease. Metaxin 2 binds to metaxin 1 and may also play a role in protein translocation into the mitochondria. Genome sequencing shows that a third metaxin gene also exists in zebrafish, Xenopus, chicken and mammals. Sequence analysis suggests that all three metaxins share a common ancestry and that they possess similarity to GSTs. Also included in the subfamily are uncharacterized proteins with similarity to metaxins, including a novel GST from Rhodococcus with toluene o-monooxygenase and glutamylcysteine synthetase activities.
Probab=21.53 E-value=1.3e+02 Score=15.09 Aligned_cols=20 Identities=15% Similarity=0.144 Sum_probs=14.3
Q ss_pred HHHHHHHHHHHhcCCCceee
Q psy7765 57 QQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 57 ~~s~~l~~al~~~g~~~~~~ 76 (77)
..+.++...|+..|++++.+
T Consensus 17 p~~~~v~~~L~~~~i~~~~~ 36 (72)
T cd03054 17 PECLKVETYLRMAGIPYEVV 36 (72)
T ss_pred HHHHHHHHHHHhCCCceEEE
Confidence 45666777778888888764
No 330
>PF04282 DUF438: Family of unknown function (DUF438); InterPro: IPR007380 This is a a group of uncharacterised proteins.
Probab=21.50 E-value=1.3e+02 Score=16.13 Aligned_cols=22 Identities=9% Similarity=-0.016 Sum_probs=16.2
Q ss_pred ccCHHHHHHHHHHHHhcCCCce
Q psy7765 53 NVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 53 ~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
.|.+.....+-+.|.+.|++++
T Consensus 28 ~Vs~~EI~~~Eq~Li~eG~~~e 49 (71)
T PF04282_consen 28 DVSASEISAAEQELIQEGMPVE 49 (71)
T ss_pred CCCHHHHHHHHHHHHHcCCCHH
Confidence 4667777777888888887764
No 331
>PF02780 Transketolase_C: Transketolase, C-terminal domain; InterPro: IPR005476 Transketolase 2.2.1.1 from EC (TK) catalyzes the reversible transfer of a two-carbon ketol unit from xylulose 5-phosphate to an aldose receptor, such as ribose 5-phosphate, to form sedoheptulose 7-phosphate and glyceraldehyde 3- phosphate. This enzyme, together with transaldolase, provides a link between the glycolytic and pentose-phosphate pathways. TK requires thiamine pyrophosphate as a cofactor. In most sources where TK has been purified, it is a homodimer of approximately 70 Kd subunits. TK sequences from a variety of eukaryotic and prokaryotic sources [, ] show that the enzyme has been evolutionarily conserved. In the peroxisomes of methylotrophic yeast Pichia angusta (Yeast) (Hansenula polymorpha), there is a highly related enzyme, dihydroxy-acetone synthase (DHAS) 2.2.1.3 from EC (also known as formaldehyde transketolase), which exhibits a very unusual specificity by including formaldehyde amongst its substrates. 1-deoxyxylulose-5-phosphate synthase (DXP synthase) [] is an enzyme so far found in bacteria (gene dxs) and plants (gene CLA1) which catalyzes the thiamine pyrophosphoate-dependent acyloin condensation reaction between carbon atoms 2 and 3 of pyruvate and glyceraldehyde 3-phosphate to yield 1-deoxy-D- xylulose-5-phosphate (dxp), a precursor in the biosynthetic pathway to isoprenoids, thiamine (vitamin B1), and pyridoxol (vitamin B6). DXP synthase is evolutionary related to TK. The N-terminal section, contains a histidine residue which appears to function in proton transfer during catalysis []. In the central section there are conserved acidic residues that are part of the active cleft and may participate in substrate-binding []. This family includes transketolase enzymes 2.2.1.1 from EC and also partially matches to 2-oxoisovalerate dehydrogenase beta subunit P37941 from SWISSPROT 1.2.4.4 from EC. Both these enzymes utilise thiamine pyrophosphate as a cofactor, suggesting there may be common aspects in their mechanism of catalysis.; GO: 0003824 catalytic activity, 0008152 metabolic process; PDB: 1NGS_B 1TKA_A 1TRK_B 1TKB_A 1GPU_B 1AY0_B 1TKC_B 2E6K_A 3JU3_A 2R8P_B ....
Probab=21.49 E-value=1e+02 Score=17.49 Aligned_cols=21 Identities=19% Similarity=0.086 Sum_probs=17.6
Q ss_pred HHHHHHHHHHHHhcCCCceee
Q psy7765 56 YQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 56 ~~~s~~l~~al~~~g~~~~~~ 76 (77)
...+.+.++.|++.|+.++.+
T Consensus 21 ~~~al~A~~~L~~~Gi~~~vi 41 (124)
T PF02780_consen 21 VEEALEAAEELEEEGIKAGVI 41 (124)
T ss_dssp HHHHHHHHHHHHHTTCEEEEE
T ss_pred HHHHHHHHHHHHHcCCceeEE
Confidence 467888999999999988765
No 332
>COG0529 CysC Adenylylsulfate kinase and related kinases [Inorganic ion transport and metabolism]
Probab=21.45 E-value=2.4e+02 Score=18.30 Aligned_cols=38 Identities=16% Similarity=0.085 Sum_probs=26.4
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceeeC
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~~ 77 (77)
..|..||-...=..=--.-+..+.+.|.+.|..+.++|
T Consensus 21 ~~~~viW~TGLSGsGKSTiA~ale~~L~~~G~~~y~LD 58 (197)
T COG0529 21 QKGAVIWFTGLSGSGKSTIANALEEKLFAKGYHVYLLD 58 (197)
T ss_pred CCCeEEEeecCCCCCHHHHHHHHHHHHHHcCCeEEEec
Confidence 45544444444444555667789999999999998876
No 333
>PF00258 Flavodoxin_1: Flavodoxin; InterPro: IPR008254 This domain is found in a number of proteins including flavodoxin and nitric-oxide synthase. Flavodoxins are electron-transfer proteins that function in various electron transport systems. They bind one FMN molecule, which serves as a redox-active prosthetic group [] and are functionally interchangeable with ferredoxins. They have been isolated from prokaryotes, cyanobacteria, and some eukaryotic algae. Nitric oxide synthase (1.14.13.39 from EC) produces nitric oxide from L-arginie and NADPH. Nitric oxide acts as a messenger molecule in the body.; GO: 0010181 FMN binding, 0016491 oxidoreductase activity; PDB: 2WC1_A 2FVX_A 2FOX_A 6NUL_A 1FVX_A 2FAX_A 1FLN_A 1FLA_A 4NLL_A 2FDX_A ....
Probab=21.41 E-value=77 Score=18.24 Aligned_cols=18 Identities=11% Similarity=0.071 Sum_probs=13.6
Q ss_pred HHHHHHHHHhcCCCceee
Q psy7765 59 SMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 59 s~~l~~al~~~g~~~~~~ 76 (77)
+..+++.|+++|++++.+
T Consensus 14 A~~ia~~l~~~g~~~~~~ 31 (143)
T PF00258_consen 14 AEAIAEGLRERGVEVRVV 31 (143)
T ss_dssp HHHHHHHHHHTTSEEEEE
T ss_pred HHHHHHHHHHcCCceeee
Confidence 566788888888877664
No 334
>PF13362 Toprim_3: Toprim domain
Probab=21.34 E-value=1.5e+02 Score=15.94 Aligned_cols=37 Identities=19% Similarity=0.113 Sum_probs=27.6
Q ss_pred CCcEEEEEeCCCCccC-HHHHHHHHHHHHhcCCCceee
Q psy7765 40 NKTFLLMHGTADDNVH-YQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp-~~~s~~l~~al~~~g~~~~~~ 76 (77)
..+-+++-++.|..-. ...+..+++++.+.|+.+..+
T Consensus 40 ~~~~vii~~D~D~~~~G~~~a~~~~~~~~~~g~~~~~~ 77 (96)
T PF13362_consen 40 PGRRVIIAADNDKANEGQKAAEKAAERLEAAGIAVSIV 77 (96)
T ss_pred CCCeEEEEECCCCchhhHHHHHHHHHHHHhCCCeEEEE
Confidence 3555688888887743 455777899999999988764
No 335
>TIGR01133 murG undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase. RL J Bacteriol 1993 Mar;175(6):1841-3
Probab=21.29 E-value=1.2e+02 Score=19.84 Aligned_cols=30 Identities=13% Similarity=0.257 Sum_probs=16.0
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHhcCCCce
Q psy7765 43 FLLMHGTADDNVHYQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 43 ~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~ 74 (77)
++|+-|..-..+. .+..++++|.++|-++.
T Consensus 3 i~~~~g~~~g~~~--~~~~La~~L~~~g~eV~ 32 (348)
T TIGR01133 3 VVLAAGGTGGHIF--PALAVAEELIKRGVEVL 32 (348)
T ss_pred EEEEeCccHHHHh--HHHHHHHHHHhCCCEEE
Confidence 4555554433222 44566777776665544
No 336
>PRK07094 biotin synthase; Provisional
Probab=20.98 E-value=2.7e+02 Score=18.61 Aligned_cols=34 Identities=9% Similarity=0.151 Sum_probs=25.3
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhc-CCCc
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAA-DIFF 73 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~-g~~~ 73 (77)
....+.+.|..|...+.....++.+++++. ++.+
T Consensus 86 g~~~i~l~gG~~~~~~~~~l~~l~~~i~~~~~l~i 120 (323)
T PRK07094 86 GYRTIVLQSGEDPYYTDEKIADIIKEIKKELDVAI 120 (323)
T ss_pred CCCEEEEecCCCCCCCHHHHHHHHHHHHccCCceE
Confidence 466777777777777888888999998884 5543
No 337
>PF03652 UPF0081: Uncharacterised protein family (UPF0081); InterPro: IPR005227 Holliday junction resolvases (HJRs) are key enzymes of DNA recombination. The principal HJRs are now known or confidently predicted for all bacteria and archaea whose genomes have been completely sequenced, with many species encoding multiple potential HJRs. Structural and evolutionary relationships of HJRs and related nucleases suggests that the HJR function has evolved independently from at least four distinct structural folds, namely RNase H, endonuclease, endonuclease VII-colicin E and RusA (IPR008822 from INTERPRO): The endonuclease fold, whose structural prototypes are the phage exonuclease, the very short patch repair nuclease (Vsr) and type II restriction enzymes, is shown to encompass by far a greater diversity of nucleases than previously suspected. This fold unifies archaeal HJRs (IPR002732 from INTERPRO), repair nucleases such as RecB (IPR004586 from INTERPRO) and Vsr (IPR004603 from INTERPRO), restriction enzymes and a variety of predicted nucleases whose specific activities remain to be determined. The RNase H fold characterises the RuvC family (IPR002176 from INTERPRO), which is nearly ubiquitous in bacteria, and in addition the YqgF family (IPR005227 from INTERPRO). The proteins of this family, typified by Escherichia coli YqgF, are likely to function as an alternative to RuvC in most bacteria, but could be the principal HJRs in low-GC Gram-positive bacteria and Aquifex. Endonuclease VII of phage T4 (IPR004211 from INTERPRO) is shown to serve as a structural template for many nucleases, including McrA and other type II restriction enzymes. Together with colicin E7, endonuclease VII defines a distinct metal-dependent nuclease fold. Horizontal gene transfer, lineage-specific gene loss and gene family expansion, and non-orthologous gene displacement seem to have been major forces in the evolution of HJRs and related nucleases. A remarkable case of displacement is seen in the Lyme disease spirochete Borrelia burgdorferi, which does not possess any of the typical HJRs, but instead encodes, in its chromosome and each of the linear plasmids, members of the exonuclease family predicted to function as HJRs. The diversity of HJRs and related nucleases in bacteria and archaea contrasts with their near absence in eukaryotes. The few detected eukaryotic representatives of the endonuclease fold and the RNase H fold have probably been acquired from bacteria via horizontal gene transfer. The identity of the principal HJR(s) involved in recombination in eukaryotes remains uncertain; this function could be performed by topoisomerase IB or by a novel, so far undetected, class of enzymes. Likely HJRs and related nucleases were identified in the genomes of numerous bacterial and eukaryotic DNA viruses. Gene flow between viral and cellular genomes has probably played a major role in the evolution of this class of enzymes. This family represents the YqgF family of putative Holliday junction resolvases. With the exception of the spirochetes, the YqgF family is represented in all bacterial lineages, including the mycoplasmas with their highly degenerate genomes. The RuvC resolvases are conspicuously absent in the low-GC Gram-positive bacterial lineage, with the exception of Ureaplasma parvum (Ureaplasma urealyticum biotype 1) (Q9PQY7 from SWISSPROT, []). Furthermore, loss of function ruvC mutants of E. coli show a residual HJR activity that cannot be ascribed to the prophage-encoded RusA resolvase []. This suggests that the YqgF family proteins could be alternative HJRs whose function partially overlaps with that of RuvC [].; GO: 0016788 hydrolase activity, acting on ester bonds, 0006281 DNA repair, 0006310 DNA recombination, 0006974 response to DNA damage stimulus, 0005737 cytoplasm; PDB: 1NU0_A 1OVQ_A 1NMN_B 1VHX_B 1IV0_A.
Probab=20.85 E-value=1.1e+02 Score=18.22 Aligned_cols=37 Identities=14% Similarity=0.105 Sum_probs=23.9
Q ss_pred CcEEEEEeC-----CCCccCHHHHHHHHHHHHhc--CCCceeeC
Q psy7765 41 KTFLLMHGT-----ADDNVHYQQSMMLAKALEAA--DIFFYQQD 77 (77)
Q Consensus 41 ~p~ll~hG~-----~D~~Vp~~~s~~l~~al~~~--g~~~~~~~ 77 (77)
.+..++-|. ....---..+.+|++.|++. ++++.++|
T Consensus 51 ~i~~iVvGlP~~~~G~~~~~~~~v~~f~~~L~~~~~~ipV~~~D 94 (135)
T PF03652_consen 51 QIDGIVVGLPLNMDGSESEQARRVRKFAEELKKRFPGIPVILVD 94 (135)
T ss_dssp CECEEEEEEEBBCTSSC-CCHHHHHHHHHHHHHHH-TSEEEEEE
T ss_pred CCCEEEEeCCcccCCCccHHHHHHHHHHHHHHHhcCCCcEEEEC
Confidence 344455443 23333456788899999887 88888765
No 338
>PRK03003 GTP-binding protein Der; Reviewed
Probab=20.85 E-value=2.6e+02 Score=20.01 Aligned_cols=34 Identities=9% Similarity=0.026 Sum_probs=24.3
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhc----CCCcee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAA----DIFFYQ 75 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~----g~~~~~ 75 (77)
.||+|+++.. ..+|..--.-|.+.|+++ |+|+.+
T Consensus 425 ~pp~f~~~~~--~~~~~~y~~~l~~~~r~~~~~~g~pi~~ 462 (472)
T PRK03003 425 RPPTFVLFTT--GFLEAGYRRFLERRLRETFGFEGSPIRI 462 (472)
T ss_pred CCCEEEEEeC--CCCChHHHHHHHHHHHHHcCCCcceEEE
Confidence 6999999954 568877777777777764 455543
No 339
>PRK07583 cytosine deaminase-like protein; Validated
Probab=20.84 E-value=2.3e+02 Score=19.91 Aligned_cols=36 Identities=11% Similarity=-0.065 Sum_probs=28.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCcee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
..++.+.|+..-...+.....+..+.+++.|+.+-.
T Consensus 256 ~~~v~i~H~~~l~~~~~~~~~~~i~~la~~gv~vv~ 291 (438)
T PRK07583 256 EGKVTCGHCCSLAVQPEEQAQATIALVAEAGIAIVS 291 (438)
T ss_pred CCCEEEEeccchhcCCHHHHHHHHHHHHHcCCeEEE
Confidence 357899999876666777778888888998887654
No 340
>PLN02696 1-deoxy-D-xylulose-5-phosphate reductoisomerase
Probab=20.76 E-value=45 Score=24.28 Aligned_cols=65 Identities=12% Similarity=0.022 Sum_probs=45.3
Q ss_pred CccchhhhhhhhhhCCCCCCCCcccccccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 1 MFWITNSIYTERYMGLPEMYDNEVGYNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 1 ~w~~~~~~~~e~~lG~p~~~~~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
|+|.+|-+-+-.++|.|+ ++-.-.....|.++..=.+...-++.--|.-|.++|+..+..+=+++
T Consensus 272 mNKglEvIEA~~LF~~~~-d~I~vvIHPqSiIHsmVef~DGS~~Aql~~pDMrlPI~yAL~yP~R~ 336 (454)
T PLN02696 272 MNKGLEVIEAHYLFGADY-DDIDIVIHPQSIIHSMVETQDSSVLAQLGWPDMRLPILYTMSWPDRV 336 (454)
T ss_pred hhhhHHHHHHHHHcCCCH-HHeEEEECcCCeeeEEEEEcCCcEEEEecCCCcHHHHHHHcCCcccc
Confidence 567788777777889884 33333444456665433444688889999999999999998765554
No 341
>TIGR03594 GTPase_EngA ribosome-associated GTPase EngA. EngA (YfgK, Der) is a ribosome-associated essential GTPase with a duplication of its GTP-binding domain. It is broadly to universally distributed among bacteria. It appears to function in ribosome biogenesis or stability.
Probab=20.68 E-value=2.3e+02 Score=19.63 Aligned_cols=30 Identities=13% Similarity=0.279 Sum_probs=25.7
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhc
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAA 69 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~ 69 (77)
.||+|++++..=..+|..-..-|.+.+++.
T Consensus 387 ~pp~~~~~~n~~~~~~~~y~~~l~~~~~~~ 416 (429)
T TIGR03594 387 NPPTFVLFGNRPELLPFSYKRYLENQFREA 416 (429)
T ss_pred CCCEEEEEEcCcccCCHHHHHHHHHHHHHh
Confidence 699999999888889988888888888774
No 342
>cd03036 ArsC_like Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC. Proteins containing a redox active CXXC motif like TRX and glutaredoxin (GRX) function as protein disulfide oxidoreductases, altering the redox state of target proteins via the reversible oxidation of the active site dithiol. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione via GRX, through a single catalytic cysteine.
Probab=20.61 E-value=1.6e+02 Score=16.66 Aligned_cols=20 Identities=15% Similarity=0.092 Sum_probs=14.6
Q ss_pred HHHHHHHHHHhcCCCceeeC
Q psy7765 58 QSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 58 ~s~~l~~al~~~g~~~~~~~ 77 (77)
.+...-+.|.++|+++++++
T Consensus 11 ~c~ka~~~L~~~~i~~~~id 30 (111)
T cd03036 11 TCRKAKKWLDEHGVDYTAID 30 (111)
T ss_pred HHHHHHHHHHHcCCceEEec
Confidence 35555677888999998864
No 343
>TIGR03675 arCOG00543 arCOG00543 universal archaeal KH-domain/beta-lactamase-domain protein. This family of proteins is universal in the archaea and consistsof an N-terminal type-1 KH-domain (pfam00013) a central beta-lactamase-domain (pfam00753) with a C-terminal motif associated with RNA metabolism (pfam07521). KH-domains are associated with RNA-binding, so taken together, this protein is a likely metal-dependent RNAase. This family was defined in as arCOG01782.
Probab=20.59 E-value=2.3e+02 Score=21.44 Aligned_cols=37 Identities=22% Similarity=0.290 Sum_probs=24.3
Q ss_pred ChhHHhccCCC--CcEEEEEeCCCCccCHHHHHHHHHHHHhc-CCC
Q psy7765 30 DVTRKVGNFVN--KTFLLMHGTADDNVHYQQSMMLAKALEAA-DIF 72 (77)
Q Consensus 30 sp~~~~~~~~~--~p~ll~hG~~D~~Vp~~~s~~l~~al~~~-g~~ 72 (77)
..+.++..+.. ..++++||+. ..+..|++.|++. |.+
T Consensus 578 ~L~~~v~~~~p~p~~v~lvHGe~------~~~~~la~~l~~~~~~~ 617 (630)
T TIGR03675 578 QLMNYVRRMQPKPEKILLNHGEP------SKILDLASSIYKKFNIE 617 (630)
T ss_pred HHHHHHHhcCCCCCEEEEEcCCH------HHHHHHHHHHHHHhCCc
Confidence 35666776622 3499999975 4577788887644 443
No 344
>PF11426 Tn7_TnsC_Int: Tn7 transposition regulator TnsC; InterPro: IPR021542 TnsC is a molecular switch that regulates transposition and interacts with TnsA which is a component of the transposase. The two proteins interact via the residues 504-555 on TnsC. The TnsA/TnsC interaction is very important in Tn7 transposition []. ; PDB: 1T0F_C.
Probab=20.54 E-value=19 Score=18.00 Aligned_cols=15 Identities=20% Similarity=0.290 Sum_probs=10.5
Q ss_pred HHHHHHHhcCCCcee
Q psy7765 61 MLAKALEAADIFFYQ 75 (77)
Q Consensus 61 ~l~~al~~~g~~~~~ 75 (77)
.++++|+.+|..+.+
T Consensus 26 ~~y~~Lk~~Glifdl 40 (48)
T PF11426_consen 26 SVYEALKQNGLIFDL 40 (48)
T ss_dssp -HHHHHHHTTSB--H
T ss_pred HHHHHHHHCCeeeeH
Confidence 599999999987654
No 345
>PF04244 DPRP: Deoxyribodipyrimidine photo-lyase-related protein; InterPro: IPR007357 This family appears to be related to DNA photolyases.; PDB: 3ZXS_A.
Probab=20.51 E-value=1.4e+02 Score=19.52 Aligned_cols=20 Identities=15% Similarity=0.029 Sum_probs=13.8
Q ss_pred HHHHHHHHHHhcCCCceeeC
Q psy7765 58 QSMMLAKALEAADIFFYQQD 77 (77)
Q Consensus 58 ~s~~l~~al~~~g~~~~~~~ 77 (77)
.-..|+++|+++|..|.+++
T Consensus 50 aMRhfa~~L~~~G~~V~Y~~ 69 (224)
T PF04244_consen 50 AMRHFADELRAKGFRVHYIE 69 (224)
T ss_dssp HHHHHHHHHHHTT--EEEE-
T ss_pred HHHHHHHHHHhCCCEEEEEe
Confidence 34568999999999998864
No 346
>PRK13237 tyrosine phenol-lyase; Provisional
Probab=20.35 E-value=1.1e+02 Score=22.28 Aligned_cols=19 Identities=26% Similarity=0.148 Sum_probs=15.7
Q ss_pred HHHHHHHHHHHHhcCCCce
Q psy7765 56 YQQSMMLAKALEAADIFFY 74 (77)
Q Consensus 56 ~~~s~~l~~al~~~g~~~~ 74 (77)
+.|+..|.++|.++|+++.
T Consensus 322 i~~~~~l~~~L~~~Gvpv~ 340 (460)
T PRK13237 322 VGQVRYLGEKLLAAGVPIV 340 (460)
T ss_pred HHHHHHHHHHHHHCCCcee
Confidence 5678888999999999875
No 347
>PF13361 UvrD_C: UvrD-like helicase C-terminal domain; PDB: 1UAA_B 3U4Q_A 3U44_A 1W36_B 3K70_B 2IS6_B 3LFU_A 2IS2_B 2IS1_B 2IS4_A ....
Probab=20.26 E-value=1.1e+02 Score=19.95 Aligned_cols=20 Identities=25% Similarity=0.190 Sum_probs=14.3
Q ss_pred HHHHHHHHHHHHhcCCCcee
Q psy7765 56 YQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 56 ~~~s~~l~~al~~~g~~~~~ 75 (77)
-.+...+.++|+++|+|+..
T Consensus 87 ~~~~~~i~~~L~~~gIp~~~ 106 (351)
T PF13361_consen 87 NSQIKEIEDALKEAGIPYRI 106 (351)
T ss_dssp GGHHHHHHHHHHHTTS-EEE
T ss_pred chhHHHHHHHHhhhcceeEe
Confidence 45677788888888888754
No 348
>PF13721 SecD-TM1: SecD export protein N-terminal TM region
Probab=20.17 E-value=1.8e+02 Score=16.41 Aligned_cols=32 Identities=19% Similarity=0.227 Sum_probs=20.0
Q ss_pred cEEEEEeC-CCCccCHHHHHHHHHHHHhcCCCcee
Q psy7765 42 TFLLMHGT-ADDNVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 42 p~ll~hG~-~D~~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
|.+-+.+. ....++ ...++.+.|.++|+.+.-
T Consensus 32 pAvqIs~~~~~~~~~--~~~~v~~~L~~~~I~~k~ 64 (101)
T PF13721_consen 32 PAVQISASSAGVQLP--DAFQVEQALKAAGIAVKS 64 (101)
T ss_pred CcEEEecCCCCccCC--hHHHHHHHHHHCCCCcce
Confidence 44444443 333333 346999999999998763
No 349
>PRK04940 hypothetical protein; Provisional
Probab=20.15 E-value=1.5e+02 Score=18.79 Aligned_cols=26 Identities=12% Similarity=0.146 Sum_probs=21.8
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 41 KTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 41 ~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
..++++..+.|.+.++.++.+.+...
T Consensus 125 ~r~~vllq~gDEvLDyr~a~~~y~~~ 150 (180)
T PRK04940 125 DRCLVILSRNDEVLDSQRTAEELHPY 150 (180)
T ss_pred ccEEEEEeCCCcccCHHHHHHHhccC
Confidence 45799999999999999998776543
No 350
>PF03979 Sigma70_r1_1: Sigma-70 factor, region 1.1; InterPro: IPR007127 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ]. This entry represents Region 1.1 which modulates DNA binding by region 2 and 4 when sigma is unbound by the core RNA polymerase [, ]. Region 1.1 is also involved in promoter binding.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2K6X_A.
Probab=20.14 E-value=1.2e+02 Score=16.32 Aligned_cols=22 Identities=23% Similarity=0.197 Sum_probs=13.2
Q ss_pred CccCHHHHHHHHHHHHhcCCCc
Q psy7765 52 DNVHYQQSMMLAKALEAADIFF 73 (77)
Q Consensus 52 ~~Vp~~~s~~l~~al~~~g~~~ 73 (77)
..+.+.+...++..|.+.|+.+
T Consensus 34 ~~~~~e~id~i~~~L~~~gI~V 55 (82)
T PF03979_consen 34 DDLDPEQIDEIYDTLEDEGIEV 55 (82)
T ss_dssp S---HHHHHHHHHHHHTT----
T ss_pred cCCCHHHHHHHHHHHHHCCCEE
Confidence 3467788999999999999865
No 351
>COG1362 LAP4 Aspartyl aminopeptidase [Amino acid transport and metabolism]
Probab=20.13 E-value=2.8e+02 Score=20.33 Aligned_cols=40 Identities=8% Similarity=0.052 Sum_probs=34.0
Q ss_pred cCCCCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCceee
Q psy7765 37 NFVNKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQQ 76 (77)
Q Consensus 37 ~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~~ 76 (77)
.+...|++-++...-..-...-...+-+.++++|++.|..
T Consensus 337 ~ln~G~vik~~an~ry~td~~~~a~~~~l~~~~~Vp~Q~f 376 (437)
T COG1362 337 KLNKGPVIKVNANQRYATDSEGIALLRKLAQKAGVPWQVF 376 (437)
T ss_pred ccCCCceEEecCCCCcccCchHHHHHHHHHHHcCCceEEE
Confidence 3436899999999988888888888999999999998863
No 352
>PRK00819 RNA 2'-phosphotransferase; Reviewed
Probab=20.11 E-value=1.2e+02 Score=19.16 Aligned_cols=12 Identities=33% Similarity=0.556 Sum_probs=9.7
Q ss_pred CCcEEEEEeCCC
Q psy7765 40 NKTFLLMHGTAD 51 (77)
Q Consensus 40 ~~p~ll~hG~~D 51 (77)
.+|-.|+|||.-
T Consensus 92 ~~P~~lyHGT~~ 103 (179)
T PRK00819 92 TPPAVLYHGTSS 103 (179)
T ss_pred CCCceeEeCCCH
Confidence 578899999953
No 353
>PF00542 Ribosomal_L12: Ribosomal protein L7/L12 C-terminal domain; InterPro: IPR013823 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This entry represents the C-terminal domain of the large subunit ribosomal proteins, known as the L7/L12 family. L7/L12 is present in each 50S subunit in four copies organised as two dimers. The L8 protein complex consisting of two dimers of L7/L12 and L10 in Escherichia coli ribosomes is assembled on the conserved region of 23 S rRNA termed the GTPase-associated domain []. The L7/L12 dimer probably interacts with EF-Tu. L7 and L12 only differ in a single post translational modification of the addition of an acetyl group to the N terminus of L7.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 1DD4_B 1DD3_A 1RQU_B 2GYA_5 2GYC_5 1RQS_A 1RQV_A 1CTF_A 2XUX_L.
Probab=20.10 E-value=1.6e+02 Score=15.54 Aligned_cols=31 Identities=23% Similarity=0.183 Sum_probs=24.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHhcCCCcee
Q psy7765 40 NKTFLLMHGTADDNVHYQQSMMLAKALEAADIFFYQ 75 (77)
Q Consensus 40 ~~p~ll~hG~~D~~Vp~~~s~~l~~al~~~g~~~~~ 75 (77)
..|..|..| |+-.++..+.+.|.+.|..+++
T Consensus 37 ~~p~~ik~~-----v~keeAe~ik~~Le~aGa~v~l 67 (68)
T PF00542_consen 37 SLPKVIKEG-----VSKEEAEEIKKKLEAAGAKVEL 67 (68)
T ss_dssp TTTEEEEEE-----E-HHHHHHHHHHHHCCT-EEEE
T ss_pred hCCHHHHcC-----CCHHHHHHHHHHHHHcCCEEEe
Confidence 467777766 8889999999999999988775
No 354
>TIGR00243 Dxr 1-deoxy-D-xylulose 5-phosphate reductoisomerase. 1-deoxy-D-xylulose 5-phosphate is converted to 2-C-methyl-D-erythritol 4-phosphate in the presence of NADPH. It is involved in the synthesis of isopentenyl diphosphate (IPP), a basic building block in isoprenoid, thiamin, and pyridoxal biosynthesis.
Probab=20.03 E-value=55 Score=23.38 Aligned_cols=65 Identities=12% Similarity=0.077 Sum_probs=44.8
Q ss_pred CccchhhhhhhhhhCCCCCCCCcccccccChhHHhccCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy7765 1 MFWITNSIYTERYMGLPEMYDNEVGYNWTDVTRKVGNFVNKTFLLMHGTADDNVHYQQSMMLAKAL 66 (77)
Q Consensus 1 ~w~~~~~~~~e~~lG~p~~~~~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~Vp~~~s~~l~~al 66 (77)
|+|.+|-+-+-.++|.|+ .+-.-.....|.++..=.+...-++.--|.-|.++|++.+..+=+++
T Consensus 217 mNKglEvIEA~~LF~~~~-d~I~vvIHpqSiIHsmVef~DGSv~aql~~pDMrlPI~yAL~yP~R~ 281 (389)
T TIGR00243 217 MNKGLEYIEARWLFGASA-EQIDVLIHPQSIIHSMVEFQDGSVIAQLGEPDMRLPIAYAMAWPNRV 281 (389)
T ss_pred hhhhHHHHHHHHHcCCCH-HHeEEEECCCCceeEEEEEcCccEEEEeCCCCcHHHHHHHcCCcccc
Confidence 567778777777889884 33333444456665443444678888899999999999998755544
Done!