Query psy472
Match_columns 68
No_of_seqs 126 out of 1015
Neff 8.3
Searched_HMMs 46136
Date Sat Aug 17 00:04:52 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy472.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/472hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG1506 DAP2 Dipeptidyl aminop 99.6 7.7E-16 1.7E-20 106.1 7.7 59 9-68 534-592 (620)
2 PF00326 Peptidase_S9: Prolyl 99.6 8.7E-16 1.9E-20 93.1 6.9 61 7-68 123-185 (213)
3 KOG2100|consensus 99.6 6.1E-15 1.3E-19 103.8 6.9 66 1-68 658-723 (755)
4 KOG2281|consensus 99.4 4.6E-13 9.9E-18 93.0 6.0 66 1-68 776-843 (867)
5 PRK10115 protease 2; Provision 99.2 6.4E-11 1.4E-15 82.9 5.8 61 2-65 581-644 (686)
6 PLN02442 S-formylglutathione h 99.0 1E-09 2.3E-14 69.7 5.8 60 7-67 198-258 (283)
7 PRK10566 esterase; Provisional 98.8 2.6E-08 5.6E-13 61.1 7.1 52 17-68 176-229 (249)
8 PRK11460 putative hydrolase; P 98.8 1.7E-08 3.8E-13 62.5 6.1 42 27-68 148-189 (232)
9 PRK13604 luxD acyl transferase 98.7 5.4E-08 1.2E-12 63.2 5.8 51 15-68 191-241 (307)
10 PF02230 Abhydrolase_2: Phosph 98.7 6.4E-08 1.4E-12 59.1 5.6 41 27-67 155-195 (216)
11 PRK05371 x-prolyl-dipeptidyl a 98.7 1.4E-07 3.1E-12 67.1 7.4 57 11-68 440-496 (767)
12 TIGR02821 fghA_ester_D S-formy 98.6 1E-07 2.2E-12 60.2 5.7 55 13-67 196-252 (275)
13 COG0400 Predicted esterase [Ge 98.6 1.1E-07 2.4E-12 58.7 5.6 43 26-68 145-187 (207)
14 PF01738 DLH: Dienelactone hyd 98.6 3.2E-07 6.9E-12 55.8 6.3 50 18-68 137-186 (218)
15 PF07859 Abhydrolase_3: alpha/ 98.4 9.9E-07 2.2E-11 53.0 5.0 51 15-68 154-205 (211)
16 PF03583 LIP: Secretory lipase 98.3 1.6E-06 3.4E-11 55.7 5.8 41 27-67 219-260 (290)
17 PRK10749 lysophospholipase L2; 98.3 1.9E-06 4.1E-11 55.5 5.7 47 21-68 254-303 (330)
18 COG1505 Serine proteases of th 98.2 3.9E-06 8.4E-11 58.6 5.7 63 2-64 554-617 (648)
19 PF12695 Abhydrolase_5: Alpha/ 98.1 1.6E-05 3.5E-10 44.7 5.4 43 22-68 100-142 (145)
20 PLN02298 hydrolase, alpha/beta 98.0 1.4E-05 3E-10 51.2 5.1 43 22-67 247-289 (330)
21 PRK10162 acetyl esterase; Prov 98.0 1.9E-05 4E-10 51.0 5.6 39 28-68 249-287 (318)
22 PHA02857 monoglyceride lipase; 98.0 2.1E-05 4.5E-10 48.9 4.9 43 22-68 205-247 (276)
23 PF08840 BAAT_C: BAAT / Acyl-C 97.9 3.2E-05 7E-10 47.6 5.3 46 22-68 111-159 (213)
24 KOG1552|consensus 97.8 8.4E-06 1.8E-10 51.9 1.6 37 19-56 185-221 (258)
25 TIGR01738 bioH putative pimelo 97.8 9E-05 1.9E-09 43.9 5.7 43 20-67 182-224 (245)
26 PLN02385 hydrolase; alpha/beta 97.8 5.7E-05 1.2E-09 48.9 5.1 43 22-67 275-317 (349)
27 COG1073 Hydrolases of the alph 97.8 6E-05 1.3E-09 46.3 4.8 53 13-67 218-270 (299)
28 TIGR01840 esterase_phb esteras 97.8 3.1E-05 6.7E-10 47.1 3.2 31 27-57 168-198 (212)
29 PRK11071 esterase YqiA; Provis 97.7 6.1E-05 1.3E-09 45.5 4.3 26 27-52 136-161 (190)
30 COG0657 Aes Esterase/lipase [L 97.7 7.8E-05 1.7E-09 47.6 5.0 50 15-67 233-283 (312)
31 KOG2112|consensus 97.7 4.4E-05 9.6E-10 47.3 3.7 41 27-67 144-184 (206)
32 PLN02652 hydrolase; alpha/beta 97.7 0.00012 2.5E-09 49.0 5.6 44 22-68 320-363 (395)
33 TIGR03611 RutD pyrimidine util 97.6 0.00022 4.8E-09 42.8 5.4 44 19-67 191-234 (257)
34 PRK06765 homoserine O-acetyltr 97.6 0.00036 7.9E-09 46.6 6.8 49 18-67 315-363 (389)
35 TIGR02427 protocat_pcaD 3-oxoa 97.6 0.00027 5.9E-09 41.9 5.6 44 19-67 186-229 (251)
36 TIGR03343 biphenyl_bphD 2-hydr 97.6 0.00016 3.5E-09 44.8 4.5 42 21-67 218-259 (282)
37 COG4099 Predicted peptidase [G 97.5 3.5E-05 7.7E-10 50.6 1.1 47 20-66 308-354 (387)
38 TIGR01249 pro_imino_pep_1 prol 97.4 0.00041 8.9E-09 44.2 5.1 43 21-67 242-284 (306)
39 TIGR01607 PST-A Plasmodium sub 97.4 0.00044 9.6E-09 44.9 4.9 39 27-67 270-308 (332)
40 PRK05077 frsA fermentation/res 97.4 0.00057 1.2E-08 45.8 5.5 40 23-67 352-391 (414)
41 TIGR02240 PHA_depoly_arom poly 97.3 0.00083 1.8E-08 41.9 5.6 43 20-67 201-243 (276)
42 PRK10349 carboxylesterase BioH 97.3 0.00099 2.1E-08 40.9 5.8 46 17-67 187-232 (256)
43 COG0412 Dienelactone hydrolase 97.3 0.00088 1.9E-08 42.0 5.4 45 22-67 154-198 (236)
44 PRK00175 metX homoserine O-ace 97.3 0.0014 3.1E-08 43.1 6.4 45 21-66 304-348 (379)
45 PLN02511 hydrolase 97.3 0.00046 9.9E-09 45.8 4.1 53 11-67 283-335 (388)
46 PF12697 Abhydrolase_6: Alpha/ 97.2 0.0013 2.8E-08 38.2 5.4 48 16-68 166-213 (228)
47 TIGR01836 PHA_synth_III_C poly 97.2 0.00092 2E-08 43.4 4.8 43 22-67 282-324 (350)
48 TIGR03056 bchO_mg_che_rel puta 97.2 0.00098 2.1E-08 40.8 4.7 41 22-67 216-256 (278)
49 PF02129 Peptidase_S15: X-Pro 97.2 0.0024 5.3E-08 40.2 6.4 57 9-67 211-268 (272)
50 PLN02679 hydrolase, alpha/beta 97.1 0.0017 3.7E-08 42.5 5.8 47 20-67 286-333 (360)
51 TIGR01392 homoserO_Ac_trn homo 97.1 0.0027 5.9E-08 41.2 6.6 45 20-65 282-326 (351)
52 PLN03087 BODYGUARD 1 domain co 97.1 0.00096 2.1E-08 45.9 4.6 39 24-67 416-454 (481)
53 COG2267 PldB Lysophospholipase 97.1 0.00087 1.9E-08 43.3 3.9 43 22-68 224-268 (298)
54 PF00561 Abhydrolase_1: alpha/ 97.1 0.0022 4.8E-08 37.9 5.3 50 13-67 162-211 (230)
55 KOG4667|consensus 97.0 0.00074 1.6E-08 42.8 3.1 37 27-67 199-235 (269)
56 PRK06489 hypothetical protein; 97.0 0.0018 3.9E-08 42.2 4.8 43 20-67 286-330 (360)
57 PRK07581 hypothetical protein; 97.0 0.0024 5.2E-08 41.1 5.2 42 21-67 270-311 (339)
58 KOG1454|consensus 97.0 0.0024 5.1E-08 41.8 5.2 48 16-67 253-300 (326)
59 PRK10985 putative hydrolase; P 96.9 0.0034 7.4E-08 40.4 5.7 54 9-67 238-291 (324)
60 PF10503 Esterase_phd: Esteras 96.9 0.0013 2.9E-08 41.1 3.3 31 27-57 169-199 (220)
61 PRK03592 haloalkane dehalogena 96.9 0.0026 5.6E-08 40.0 4.6 42 22-67 224-265 (295)
62 COG0429 Predicted hydrolase of 96.8 0.0015 3.2E-08 43.3 3.4 41 8-49 256-296 (345)
63 KOG1455|consensus 96.8 0.0014 3.1E-08 42.9 3.1 46 19-67 239-284 (313)
64 PLN02824 hydrolase, alpha/beta 96.7 0.0027 5.8E-08 39.9 3.8 41 22-67 230-270 (294)
65 PF03959 FSH1: Serine hydrolas 96.7 0.0065 1.4E-07 37.2 5.4 42 23-68 158-199 (212)
66 PLN02965 Probable pheophorbida 96.7 0.0039 8.4E-08 38.5 4.4 39 24-67 191-229 (255)
67 PF08386 Abhydrolase_4: TAP-li 96.6 0.0047 1E-07 34.1 4.1 29 27-55 34-62 (103)
68 PRK14875 acetoin dehydrogenase 96.6 0.0041 9E-08 40.0 4.2 39 21-67 309-347 (371)
69 KOG1838|consensus 96.6 0.0025 5.3E-08 43.2 3.1 37 7-44 303-339 (409)
70 PRK10673 acyl-CoA esterase; Pr 96.6 0.0066 1.4E-07 36.9 4.7 40 23-67 192-231 (255)
71 PF05705 DUF829: Eukaryotic pr 96.6 0.013 2.8E-07 36.2 6.0 41 27-67 178-218 (240)
72 PRK00870 haloalkane dehalogena 96.5 0.0037 7.9E-08 39.5 3.6 43 22-67 235-277 (302)
73 PLN02578 hydrolase 96.5 0.0078 1.7E-07 39.2 5.2 33 21-54 291-323 (354)
74 PLN02872 triacylglycerol lipas 96.5 0.0057 1.2E-07 41.1 4.2 57 8-67 304-362 (395)
75 TIGR01250 pro_imino_pep_2 prol 96.4 0.013 2.8E-07 35.5 5.3 42 20-67 225-266 (288)
76 PRK08775 homoserine O-acetyltr 96.3 0.009 2E-07 38.7 4.2 41 22-66 273-313 (343)
77 PF11144 DUF2920: Protein of u 96.0 0.027 5.8E-07 38.3 5.5 41 27-67 293-333 (403)
78 PRK10439 enterobactin/ferric e 95.9 0.024 5.3E-07 38.2 5.0 40 28-68 350-389 (411)
79 PF07519 Tannase: Tannase and 95.9 0.01 2.3E-07 40.7 3.3 41 27-67 353-401 (474)
80 PRK03204 haloalkane dehalogena 95.7 0.024 5.1E-07 35.9 4.3 37 27-67 227-264 (286)
81 PLN03084 alpha/beta hydrolase 95.7 0.033 7.2E-07 37.2 5.1 38 24-67 323-360 (383)
82 PLN02894 hydrolase, alpha/beta 95.7 0.039 8.4E-07 36.9 5.4 43 21-68 320-362 (402)
83 TIGR03695 menH_SHCHC 2-succiny 95.7 0.04 8.6E-07 32.4 4.9 42 20-67 188-229 (251)
84 PLN02211 methyl indole-3-aceta 95.4 0.039 8.6E-07 34.8 4.4 28 27-54 211-238 (273)
85 KOG4391|consensus 95.3 0.021 4.5E-07 36.6 2.8 44 22-68 217-260 (300)
86 KOG3043|consensus 95.1 0.098 2.1E-06 33.3 5.4 47 20-67 158-205 (242)
87 PRK05855 short chain dehydroge 95.0 0.024 5.1E-07 38.4 2.7 37 25-66 232-268 (582)
88 PRK07868 acyl-CoA synthetase; 94.9 0.07 1.5E-06 39.4 5.1 31 22-53 293-323 (994)
89 PF10605 3HBOH: 3HB-oligomer h 94.6 0.033 7.2E-07 39.8 2.7 40 25-64 553-594 (690)
90 TIGR01839 PHA_synth_II poly(R) 94.6 0.11 2.4E-06 36.7 5.1 43 22-68 437-479 (560)
91 TIGR01838 PHA_synth_I poly(R)- 94.5 0.098 2.1E-06 36.7 4.8 33 21-54 410-442 (532)
92 PRK11126 2-succinyl-6-hydroxy- 94.5 0.13 2.8E-06 31.0 4.8 36 21-67 183-218 (242)
93 KOG1515|consensus 94.0 0.18 3.9E-06 33.5 5.0 39 28-68 269-307 (336)
94 PF05448 AXE1: Acetyl xylan es 93.5 0.22 4.9E-06 32.6 4.8 50 14-67 250-299 (320)
95 PF06821 Ser_hydrolase: Serine 93.3 0.17 3.6E-06 30.4 3.7 27 28-54 115-141 (171)
96 COG1647 Esterase/lipase [Gener 93.0 0.15 3.2E-06 32.5 3.2 45 19-66 174-218 (243)
97 KOG2551|consensus 92.9 0.17 3.8E-06 32.0 3.4 34 22-56 159-192 (230)
98 PF10142 PhoPQ_related: PhoPQ- 92.7 0.15 3.3E-06 34.2 3.2 55 9-67 245-299 (367)
99 COG1770 PtrB Protease II [Amin 92.7 0.27 5.9E-06 35.4 4.5 58 2-62 584-644 (682)
100 PLN02980 2-oxoglutarate decarb 92.6 0.35 7.5E-06 37.9 5.2 45 21-67 1563-1615(1655)
101 COG0596 MhpC Predicted hydrola 92.6 0.35 7.6E-06 27.9 4.3 42 22-67 217-258 (282)
102 TIGR01849 PHB_depoly_PhaZ poly 90.8 0.47 1E-05 32.3 3.9 32 22-53 333-364 (406)
103 COG2945 Predicted hydrolase of 90.3 0.37 8.1E-06 30.1 2.9 25 28-52 150-174 (210)
104 KOG2237|consensus 89.9 0.28 6E-06 35.4 2.3 48 11-58 612-663 (712)
105 KOG2984|consensus 89.8 0.26 5.7E-06 31.4 1.9 41 22-67 212-252 (277)
106 PF05728 UPF0227: Uncharacteri 89.7 0.48 1E-05 28.9 3.0 24 28-51 135-158 (187)
107 KOG2382|consensus 89.4 0.81 1.8E-05 30.3 4.0 39 23-66 250-288 (315)
108 COG3545 Predicted esterase of 88.4 1.1 2.3E-05 27.6 3.8 43 10-54 102-144 (181)
109 COG2021 MET2 Homoserine acetyl 88.3 1.7 3.6E-05 29.5 5.0 42 16-58 296-337 (368)
110 PLN02606 palmitoyl-protein thi 88.3 1.9 4.2E-05 28.5 5.2 37 27-64 26-63 (306)
111 PF00756 Esterase: Putative es 87.2 0.38 8.2E-06 29.4 1.4 57 11-67 163-233 (251)
112 COG3208 GrsT Predicted thioest 87.1 1.7 3.6E-05 27.9 4.3 41 24-68 174-214 (244)
113 COG3458 Acetyl esterase (deace 86.6 0.99 2.2E-05 29.8 3.1 40 18-58 251-290 (321)
114 COG3571 Predicted hydrolase of 85.0 0.74 1.6E-05 28.4 1.8 26 20-46 136-161 (213)
115 PF06028 DUF915: Alpha/beta hy 84.8 1.5 3.2E-05 28.1 3.2 41 27-67 184-230 (255)
116 PLN02633 palmitoyl protein thi 84.6 3.7 8.1E-05 27.3 5.1 38 27-65 25-63 (314)
117 KOG2624|consensus 84.0 1.6 3.4E-05 29.9 3.2 50 8-58 313-363 (403)
118 PF02244 Propep_M14: Carboxype 83.1 0.81 1.7E-05 23.3 1.3 30 35-64 34-63 (74)
119 TIGR03100 hydr1_PEP hydrolase, 83.1 2.5 5.4E-05 26.6 3.8 20 22-42 203-222 (274)
120 cd03409 Chelatase_Class_II Cla 83.1 3.8 8.3E-05 21.7 4.1 28 30-57 3-30 (101)
121 PF02089 Palm_thioest: Palmito 81.6 1.3 2.9E-05 28.8 2.1 26 27-52 5-33 (279)
122 KOG4409|consensus 81.6 4.8 0.0001 27.3 4.7 53 13-68 288-341 (365)
123 TIGR03100 hydr1_PEP hydrolase, 80.3 6.4 0.00014 24.8 4.9 37 27-63 26-63 (274)
124 PF08357 SEFIR: SEFIR domain; 80.2 4.3 9.3E-05 23.2 3.8 37 28-65 1-38 (150)
125 TIGR00976 /NonD putative hydro 79.5 6.1 0.00013 27.6 4.9 48 13-64 219-266 (550)
126 PF07755 DUF1611: Protein of u 79.3 8.3 0.00018 25.5 5.2 39 24-63 109-147 (301)
127 COG3243 PhaC Poly(3-hydroxyalk 77.9 6.1 0.00013 27.5 4.4 41 22-66 326-366 (445)
128 PRK10834 vancomycin high tempe 77.8 3.7 7.9E-05 26.3 3.2 39 27-66 81-119 (239)
129 cd05560 Xcc1710_like Xcc1710_l 76.0 6.7 0.00015 21.8 3.6 34 28-63 52-85 (109)
130 COG2382 Fes Enterochelin ester 75.3 9.8 0.00021 25.2 4.7 25 44-68 254-278 (299)
131 PF02273 Acyl_transf_2: Acyl t 75.0 3.9 8.5E-05 26.8 2.8 36 17-53 186-221 (294)
132 cd03413 CbiK_C Anaerobic cobal 74.8 12 0.00026 20.6 4.8 26 30-57 4-29 (103)
133 TIGR00666 PBP4 D-alanyl-D-alan 74.5 7.2 0.00016 25.9 4.0 31 29-59 55-85 (345)
134 PLN02213 sinapoylglucose-malat 73.6 3.2 6.9E-05 27.1 2.2 31 27-57 233-263 (319)
135 PF06500 DUF1100: Alpha/beta h 73.4 6.2 0.00013 27.2 3.6 25 27-51 352-376 (411)
136 PF02113 Peptidase_S13: D-Ala- 73.1 8.1 0.00018 26.6 4.1 32 29-60 82-113 (444)
137 PRK11113 D-alanyl-D-alanine ca 72.9 7.8 0.00017 26.9 4.0 30 30-59 100-129 (477)
138 KOG2521|consensus 72.5 11 0.00023 25.5 4.4 40 28-67 226-265 (350)
139 COG1647 Esterase/lipase [Gener 72.4 7 0.00015 25.1 3.4 38 27-67 15-52 (243)
140 PF11339 DUF3141: Protein of u 72.3 4.6 0.0001 28.9 2.8 26 22-48 293-318 (581)
141 PF00450 Peptidase_S10: Serine 72.1 5.1 0.00011 26.3 2.9 32 27-58 330-361 (415)
142 COG0627 Predicted esterase [Ge 72.0 4.2 9.2E-05 26.9 2.5 41 27-67 247-289 (316)
143 TIGR02764 spore_ybaN_pdaB poly 71.3 13 0.00027 22.2 4.3 37 24-60 148-184 (191)
144 TIGR03012 sulf_tusD_dsrE sulfu 71.3 16 0.00035 20.8 4.6 38 31-68 4-42 (127)
145 KOG2541|consensus 70.7 17 0.00036 24.1 4.9 25 28-52 24-50 (296)
146 PF13676 TIR_2: TIR domain; PD 70.5 5.7 0.00012 21.0 2.4 27 30-60 1-27 (102)
147 PF14714 KH_dom-like: KH-domai 68.7 15 0.00033 19.3 5.5 37 27-63 38-78 (80)
148 PF11823 DUF3343: Protein of u 68.0 12 0.00025 19.1 3.2 25 43-67 11-35 (73)
149 PRK00923 sirohydrochlorin coba 67.2 15 0.00033 20.4 3.8 27 30-57 5-31 (126)
150 KOG4178|consensus 66.8 15 0.00033 24.6 4.2 20 24-44 256-275 (322)
151 PLN03016 sinapoylglucose-malat 66.5 5.9 0.00013 27.2 2.3 32 27-58 347-378 (433)
152 cd03414 CbiX_SirB_C Sirohydroc 65.9 12 0.00026 20.4 3.2 27 30-57 4-30 (117)
153 KOG3253|consensus 65.9 19 0.00042 26.6 4.8 42 22-67 300-341 (784)
154 PRK00207 sulfur transfer compl 65.8 21 0.00045 20.4 4.2 38 31-68 5-43 (128)
155 PF12646 DUF3783: Domain of un 65.7 14 0.00031 18.2 3.1 28 29-61 2-29 (58)
156 COG2949 SanA Uncharacterized m 65.5 7 0.00015 24.9 2.3 38 27-65 93-130 (235)
157 PF08538 DUF1749: Protein of u 64.6 5.1 0.00011 26.5 1.7 33 23-56 229-262 (303)
158 KOG1282|consensus 63.9 7.2 0.00016 27.2 2.4 27 28-54 364-390 (454)
159 COG1553 DsrE Uncharacterized c 63.3 15 0.00033 21.3 3.3 26 43-68 17-43 (126)
160 cd03416 CbiX_SirB_N Sirohydroc 63.2 18 0.00039 19.2 3.5 27 30-57 3-29 (101)
161 PTZ00472 serine carboxypeptida 63.2 9.7 0.00021 26.3 2.9 31 27-57 364-394 (462)
162 cd00248 Mth938-like Mth938-lik 62.7 18 0.00039 20.1 3.5 34 28-63 52-85 (109)
163 COG1535 EntB Isochorismate hyd 62.1 18 0.00039 22.7 3.6 41 27-67 29-81 (218)
164 TIGR03230 lipo_lipase lipoprot 62.0 30 0.00065 24.1 5.1 40 27-66 41-80 (442)
165 PF12715 Abhydrolase_7: Abhydr 61.9 2.1 4.6E-05 29.2 -0.4 36 27-67 306-343 (390)
166 PF04083 Abhydro_lipase: Parti 61.5 5.6 0.00012 20.1 1.2 14 27-40 43-56 (63)
167 PLN02209 serine carboxypeptida 61.3 9.6 0.00021 26.3 2.6 32 27-58 351-382 (437)
168 PRK09061 D-glutamate deacylase 61.0 46 0.00099 23.3 5.9 44 22-66 241-284 (509)
169 TIGR00668 apaH bis(5'-nucleosy 60.5 22 0.00047 23.3 4.0 42 12-55 100-141 (279)
170 PLN02757 sirohydrochlorine fer 58.6 12 0.00026 22.1 2.4 29 28-57 15-43 (154)
171 COG0848 ExbD Biopolymer transp 57.6 36 0.00078 19.8 5.1 31 27-59 96-126 (137)
172 PF06649 DUF1161: Protein of u 57.4 23 0.0005 17.5 3.2 22 47-68 6-28 (52)
173 PF12146 Hydrolase_4: Putative 56.8 19 0.00042 18.7 2.8 31 27-60 16-46 (79)
174 PF04430 DUF498: Protein of un 56.7 6.5 0.00014 21.7 1.0 34 28-63 53-86 (110)
175 PRK13604 luxD acyl transferase 55.9 25 0.00054 23.3 3.8 30 28-60 38-67 (307)
176 PF02698 DUF218: DUF218 domain 55.6 16 0.00034 20.9 2.6 37 28-64 38-74 (155)
177 PRK08335 translation initiatio 55.5 20 0.00043 23.4 3.2 27 38-64 141-167 (275)
178 TIGR00627 tfb4 transcription f 54.7 55 0.0012 21.4 5.2 40 27-66 146-185 (279)
179 PF13721 SecD-TM1: SecD export 53.9 36 0.00078 18.7 4.4 40 27-67 30-69 (101)
180 PF06342 DUF1057: Alpha/beta h 53.4 25 0.00054 23.3 3.4 33 22-55 208-240 (297)
181 TIGR02803 ExbD_1 TonB system t 52.8 26 0.00056 19.6 3.1 24 36-59 91-114 (122)
182 cd06259 YdcF-like YdcF-like. Y 52.8 21 0.00045 20.2 2.8 37 28-64 35-71 (150)
183 TIGR02801 tolR TolR protein. T 52.7 26 0.00056 19.7 3.1 28 30-59 95-122 (129)
184 cd00858 GlyRS_anticodon GlyRS 52.3 39 0.00084 18.7 3.8 35 28-63 27-61 (121)
185 PF04056 Ssl1: Ssl1-like; Int 52.3 34 0.00074 21.2 3.8 46 20-66 94-139 (193)
186 TIGR02804 ExbD_2 TonB system t 52.2 24 0.00051 19.7 2.9 23 37-59 91-113 (121)
187 COG3618 Predicted metal-depend 51.8 25 0.00055 23.1 3.2 38 30-67 110-147 (279)
188 smart00460 TGc Transglutaminas 51.2 29 0.00062 16.7 3.4 22 43-64 10-31 (68)
189 COG0859 RfaF ADP-heptose:LPS h 50.8 68 0.0015 20.9 5.3 35 29-65 176-214 (334)
190 PF01674 Lipase_2: Lipase (cla 50.8 15 0.00033 23.0 2.1 29 28-59 2-31 (219)
191 PRK13973 thymidylate kinase; P 49.9 32 0.0007 20.9 3.5 31 37-67 11-41 (213)
192 cd05125 Mth938_2P1-like Mth938 49.7 41 0.00089 19.0 3.6 34 28-63 54-87 (114)
193 PF05853 DUF849: Prokaryotic p 49.2 29 0.00063 22.3 3.3 26 41-66 123-148 (272)
194 PF12531 DUF3731: DNA-K relate 49.1 21 0.00045 23.1 2.5 27 31-57 147-173 (249)
195 PF12913 SH3_6: SH3 domain of 48.4 18 0.0004 17.9 1.8 14 27-40 26-39 (54)
196 TIGR02884 spore_pdaA delta-lac 47.5 66 0.0014 19.8 4.6 36 24-61 183-218 (224)
197 COG4814 Uncharacterized protei 47.2 54 0.0012 21.7 4.2 42 27-68 216-263 (288)
198 PRK15478 cbiH cobalt-precorrin 47.0 71 0.0015 20.1 4.7 38 28-67 71-108 (241)
199 PRK11024 colicin uptake protei 46.7 35 0.00077 19.6 3.1 28 30-59 105-132 (141)
200 cd01786 STE50_RA Ubiquitin-lik 46.6 48 0.001 18.4 3.4 36 28-63 58-94 (98)
201 PF08904 DUF1849: Domain of un 46.4 40 0.00086 21.8 3.5 31 37-67 127-157 (252)
202 PRK10494 hypothetical protein; 46.0 33 0.00072 21.9 3.1 38 28-65 121-158 (259)
203 COG3411 Ferredoxin [Energy pro 45.6 33 0.00073 17.6 2.5 39 28-66 17-57 (64)
204 cd00861 ProRS_anticodon_short 45.6 43 0.00094 17.2 3.7 22 42-63 16-37 (94)
205 PRK11267 biopolymer transport 45.4 37 0.00081 19.5 3.1 29 29-59 100-128 (141)
206 cd03557 L-arabinose_isomerase 45.3 44 0.00095 23.6 3.8 42 22-64 87-142 (484)
207 COG2908 Uncharacterized protei 45.2 39 0.00084 21.7 3.3 22 27-50 105-126 (237)
208 PRK09912 L-glyceraldehyde 3-ph 45.1 48 0.001 21.8 3.8 30 28-58 132-161 (346)
209 cd03412 CbiK_N Anaerobic cobal 44.6 55 0.0012 18.5 3.6 27 30-57 4-30 (127)
210 PF14698 ASL_C2: Argininosucci 44.5 10 0.00023 19.4 0.6 15 46-60 4-18 (70)
211 PRK05765 precorrin-3B C17-meth 44.3 79 0.0017 19.8 4.8 38 28-67 73-110 (246)
212 cd03811 GT1_WabH_like This fam 44.3 72 0.0016 19.3 4.9 41 27-67 188-228 (353)
213 cd03817 GT1_UGDG_like This fam 44.2 75 0.0016 19.6 5.4 41 27-67 201-241 (374)
214 PF07521 RMMBL: RNA-metabolisi 44.0 3.4 7.3E-05 19.2 -1.2 20 18-37 23-42 (43)
215 TIGR01205 D_ala_D_alaTIGR D-al 44.0 68 0.0015 20.4 4.4 32 32-63 5-37 (315)
216 cd03808 GT1_cap1E_like This fa 43.6 75 0.0016 19.3 5.9 41 27-67 187-227 (359)
217 COG1184 GCD2 Translation initi 43.6 32 0.0007 22.8 2.8 24 41-64 154-177 (301)
218 cd03415 CbiX_CbiC Archaeal sir 43.2 63 0.0014 18.4 4.1 25 30-55 4-28 (125)
219 PF14480 DNA_pol3_a_NI: DNA po 42.9 46 0.001 16.7 4.8 50 14-64 22-74 (76)
220 PF06441 EHN: Epoxide hydrolas 42.8 13 0.00028 20.9 0.9 9 27-35 92-100 (112)
221 PF13041 PPR_2: PPR repeat fam 42.5 9 0.0002 17.7 0.1 21 42-62 18-38 (50)
222 cd07422 MPP_ApaH Escherichia c 42.5 55 0.0012 21.0 3.7 28 28-55 112-139 (257)
223 cd03821 GT1_Bme6_like This fam 42.4 81 0.0018 19.4 5.4 40 27-66 202-241 (375)
224 PRK12326 preprotein translocas 42.2 88 0.0019 23.6 5.0 31 27-63 427-457 (764)
225 PF07819 PGAP1: PGAP1-like pro 42.1 84 0.0018 19.5 4.5 25 27-54 4-28 (225)
226 PF13500 AAA_26: AAA domain; P 42.0 75 0.0016 18.9 4.1 44 20-66 123-166 (199)
227 TIGR01293 Kv_beta voltage-depe 41.9 60 0.0013 21.0 3.9 31 27-58 113-143 (317)
228 PRK04940 hypothetical protein; 41.3 59 0.0013 19.9 3.6 25 28-52 125-149 (180)
229 cd00860 ThrRS_anticodon ThrRS 40.6 51 0.0011 16.6 4.0 22 42-63 13-34 (91)
230 cd06592 GH31_glucosidase_KIAA1 40.6 45 0.00097 21.6 3.2 41 27-67 13-53 (303)
231 PF01535 PPR: PPR repeat; Int 40.6 28 0.00062 13.6 2.2 16 43-58 16-31 (31)
232 PRK00166 apaH diadenosine tetr 40.6 57 0.0012 21.1 3.6 28 28-55 114-141 (275)
233 PF03358 FMN_red: NADPH-depend 40.4 69 0.0015 18.0 4.2 37 29-65 2-39 (152)
234 PRK08334 translation initiatio 40.1 60 0.0013 22.1 3.7 28 37-64 198-226 (356)
235 PF06713 bPH_4: Bacterial PH d 39.9 28 0.0006 17.9 1.8 30 28-57 44-74 (74)
236 COG3367 Uncharacterized conser 39.4 94 0.002 21.1 4.5 37 27-63 147-183 (339)
237 cd08199 EEVS 2-epi-5-epi-valio 39.4 1.2E+02 0.0025 20.3 5.1 37 28-67 27-63 (354)
238 PRK12756 phospho-2-dehydro-3-d 39.3 1E+02 0.0023 21.0 4.7 34 30-63 229-262 (348)
239 cd00578 L-fuc_L-ara-isomerases 39.2 1.2E+02 0.0026 20.7 5.2 41 22-64 84-133 (452)
240 cd01458 vWA_ku Ku70/Ku80 N-ter 39.1 82 0.0018 19.1 4.0 39 27-65 128-170 (218)
241 PF13170 DUF4003: Protein of u 38.8 13 0.00029 24.3 0.5 37 29-66 185-221 (297)
242 cd01452 VWA_26S_proteasome_sub 38.7 93 0.002 19.0 5.6 38 28-66 108-145 (187)
243 PRK05629 hypothetical protein; 38.5 53 0.0012 21.2 3.3 38 28-67 6-45 (318)
244 PRK06372 translation initiatio 38.5 57 0.0012 21.1 3.3 25 39-63 116-140 (253)
245 PF09198 T4-Gluco-transf: Bact 38.4 43 0.00093 15.1 2.1 23 39-61 16-38 (38)
246 PRK08535 translation initiatio 38.3 59 0.0013 21.4 3.5 26 39-64 153-178 (310)
247 cd03798 GT1_wlbH_like This fam 38.0 95 0.0021 18.9 5.6 41 27-67 201-241 (377)
248 PF13709 DUF4159: Domain of un 38.0 82 0.0018 19.4 3.9 32 27-58 53-84 (207)
249 cd00859 HisRS_anticodon HisRS 37.6 55 0.0012 16.1 3.1 23 42-64 13-35 (91)
250 PF02472 ExbD: Biopolymer tran 37.2 49 0.0011 18.1 2.7 28 30-59 94-121 (130)
251 PF10230 DUF2305: Uncharacteri 37.2 1E+02 0.0023 19.5 4.4 32 27-58 221-252 (266)
252 TIGR02544 III_secr_YscJ type I 37.2 60 0.0013 19.9 3.2 22 41-62 26-47 (193)
253 PRK12902 secA preprotein trans 37.2 93 0.002 24.0 4.6 32 27-64 439-470 (939)
254 PF09587 PGA_cap: Bacterial ca 37.1 65 0.0014 20.1 3.4 33 28-60 186-219 (250)
255 PF13394 Fer4_14: 4Fe-4S singl 36.8 36 0.00077 18.4 2.0 37 30-67 50-88 (119)
256 cd03822 GT1_ecORF704_like This 36.8 1E+02 0.0023 19.0 5.7 41 27-67 184-224 (366)
257 COG0124 HisS Histidyl-tRNA syn 36.7 1.3E+02 0.0029 20.9 5.1 35 27-64 335-369 (429)
258 COG0125 Tmk Thymidylate kinase 36.5 71 0.0015 19.8 3.5 32 37-68 11-42 (208)
259 CHL00122 secA preprotein trans 36.2 1.1E+02 0.0023 23.5 4.8 31 27-63 424-454 (870)
260 TIGR00511 ribulose_e2b2 ribose 36.2 53 0.0011 21.5 3.0 25 39-63 148-172 (301)
261 TIGR03101 hydr2_PEP hydrolase, 36.0 1.2E+02 0.0026 19.5 4.7 40 28-67 202-242 (266)
262 cd03812 GT1_CapH_like This fam 35.6 1.1E+02 0.0024 19.1 5.7 41 27-67 191-231 (358)
263 cd03800 GT1_Sucrose_synthase T 35.4 1.2E+02 0.0026 19.3 5.6 41 27-67 219-259 (398)
264 PRK02458 ribose-phosphate pyro 35.3 1E+02 0.0023 20.4 4.3 38 27-68 7-44 (323)
265 PRK06371 translation initiatio 35.3 64 0.0014 21.7 3.3 27 38-64 176-203 (329)
266 PRK10877 protein disulfide iso 35.2 64 0.0014 20.2 3.1 30 37-66 115-145 (232)
267 KOG3363|consensus 34.8 91 0.002 19.3 3.6 36 28-63 111-146 (196)
268 TIGR02836 spore_IV_A stage IV 34.7 1.7E+02 0.0037 20.9 6.1 50 7-60 162-212 (492)
269 cd03819 GT1_WavL_like This fam 34.3 1.2E+02 0.0026 19.0 5.8 41 27-67 184-224 (355)
270 PF05902 4_1_CTD: 4.1 protein 34.2 52 0.0011 18.8 2.4 25 29-57 71-95 (114)
271 PF02629 CoA_binding: CoA bind 34.2 77 0.0017 16.7 3.7 26 41-66 70-95 (96)
272 PF12010 DUF3502: Domain of un 34.1 21 0.00046 20.5 0.8 25 32-59 92-116 (134)
273 cd03801 GT1_YqgM_like This fam 34.0 1.1E+02 0.0024 18.5 5.0 41 27-67 198-238 (374)
274 KOG1466|consensus 34.0 44 0.00095 22.2 2.3 23 42-64 166-188 (313)
275 PF01841 Transglut_core: Trans 33.7 77 0.0017 16.6 3.0 22 43-64 55-76 (113)
276 PF02374 ArsA_ATPase: Anion-tr 33.6 57 0.0012 21.3 2.8 30 37-66 215-244 (305)
277 PF03129 HGTP_anticodon: Antic 33.6 74 0.0016 16.3 3.0 21 43-63 15-35 (94)
278 PF01008 IF-2B: Initiation fac 33.5 36 0.00078 21.6 1.9 24 40-63 141-164 (282)
279 PF08532 Glyco_hydro_42M: Beta 33.4 68 0.0015 19.5 3.0 23 42-64 28-50 (207)
280 PF11950 DUF3467: Protein of u 33.3 43 0.00093 18.0 1.9 17 40-56 63-79 (92)
281 cd02006 TPP_Gcl Thiamine pyrop 33.2 1.1E+02 0.0024 18.4 4.5 38 27-68 75-112 (202)
282 KOG3946|consensus 33.1 82 0.0018 21.2 3.4 41 28-68 131-179 (338)
283 PRK08057 cobalt-precorrin-6x r 33.1 80 0.0017 20.1 3.4 28 28-62 3-30 (248)
284 PRK13398 3-deoxy-7-phosphohept 33.0 1.4E+02 0.003 19.3 4.5 32 28-60 26-57 (266)
285 PLN03033 2-dehydro-3-deoxyphos 32.7 1.4E+02 0.0031 19.8 4.5 43 16-60 4-50 (290)
286 PF04244 DPRP: Deoxyribodipyri 32.5 75 0.0016 20.0 3.1 25 43-67 48-72 (224)
287 PRK02929 L-arabinose isomerase 32.4 1.2E+02 0.0025 21.6 4.3 42 22-64 93-148 (499)
288 cd06593 GH31_xylosidase_YicI Y 32.0 1.1E+02 0.0024 19.7 4.0 28 40-67 20-47 (308)
289 TIGR02189 GlrX-like_plant Glut 31.6 27 0.00059 18.8 1.0 36 24-59 58-97 (99)
290 TIGR01769 GGGP geranylgeranylg 31.5 1.3E+02 0.0029 18.7 6.2 42 21-67 19-61 (205)
291 PRK01906 tetraacyldisaccharide 31.5 1.5E+02 0.0033 19.8 4.6 34 27-67 243-276 (338)
292 COG1447 CelC Phosphotransferas 31.4 47 0.001 18.7 1.9 27 27-54 70-96 (105)
293 KOG4388|consensus 31.4 1.1E+02 0.0024 22.9 4.1 37 28-66 788-824 (880)
294 PRK12903 secA preprotein trans 31.2 1.6E+02 0.0034 22.8 4.9 31 27-63 426-456 (925)
295 TIGR02873 spore_ylxY probable 30.9 1.4E+02 0.0029 19.3 4.2 22 44-65 98-119 (268)
296 PF06850 PHB_depo_C: PHB de-po 30.9 74 0.0016 20.0 2.8 27 25-51 132-158 (202)
297 PRK12822 phospho-2-dehydro-3-d 30.5 1.8E+02 0.0039 19.9 4.8 34 30-63 230-263 (356)
298 PRK06036 translation initiatio 30.4 1.1E+02 0.0023 20.6 3.8 28 37-64 186-214 (339)
299 PRK10629 EnvZ/OmpR regulon mod 30.3 1.1E+02 0.0025 17.5 4.7 39 27-67 34-73 (127)
300 PRK07933 thymidylate kinase; V 30.3 1E+02 0.0023 18.8 3.5 29 38-66 9-37 (213)
301 PF13579 Glyco_trans_4_4: Glyc 30.3 96 0.0021 16.6 3.4 21 45-65 6-26 (160)
302 PF04389 Peptidase_M28: Peptid 30.2 83 0.0018 18.2 3.0 36 32-67 20-59 (179)
303 PF13477 Glyco_trans_4_2: Glyc 30.2 98 0.0021 16.8 3.9 23 44-66 11-33 (139)
304 PF07131 DUF1382: Protein of u 30.0 62 0.0013 16.4 2.0 20 41-60 7-26 (61)
305 PF13823 ADH_N_assoc: Alcohol 29.9 24 0.00051 14.4 0.4 12 30-41 3-14 (23)
306 PF13812 PPR_3: Pentatricopept 29.9 49 0.0011 13.2 2.4 17 43-59 17-33 (34)
307 TIGR03675 arCOG00543 arCOG0054 29.9 1.6E+02 0.0034 21.5 4.7 36 18-59 579-617 (630)
308 PRK00093 GTP-binding protein D 29.9 1.8E+02 0.0038 19.6 5.6 38 27-64 387-428 (435)
309 PF15643 Tox-PL-2: Papain fold 29.7 80 0.0017 17.7 2.6 21 44-64 23-43 (100)
310 COG2818 Tag 3-methyladenine DN 29.7 51 0.0011 20.5 2.0 18 43-60 140-157 (188)
311 cd04906 ACT_ThrD-I_1 First of 29.4 91 0.002 16.1 2.8 20 43-62 52-71 (85)
312 PF10096 DUF2334: Uncharacteri 29.2 1.5E+02 0.0033 18.6 5.8 27 40-66 12-38 (243)
313 PLN02924 thymidylate kinase 29.1 92 0.002 19.3 3.1 39 29-67 16-54 (220)
314 PF00532 Peripla_BP_1: Peripla 28.8 1.6E+02 0.0034 18.6 4.9 33 28-60 119-152 (279)
315 TIGR00524 eIF-2B_rel eIF-2B al 28.7 76 0.0016 20.9 2.8 25 39-63 159-184 (303)
316 KOG4238|consensus 28.7 1.4E+02 0.0031 21.1 4.1 32 27-58 448-483 (627)
317 COG0337 AroB 3-dehydroquinate 28.6 1.2E+02 0.0025 20.8 3.7 38 28-68 34-71 (360)
318 COG3509 LpqC Poly(3-hydroxybut 28.5 9.6 0.00021 25.4 -1.3 17 28-44 191-207 (312)
319 PRK11172 dkgB 2,5-diketo-D-glu 28.4 1.4E+02 0.0031 18.7 3.9 32 27-58 90-122 (267)
320 cd04951 GT1_WbdM_like This fam 28.4 1.5E+02 0.0033 18.4 5.2 40 28-67 188-227 (360)
321 PRK02395 hypothetical protein; 28.3 1.5E+02 0.0033 19.0 4.1 27 30-56 138-165 (279)
322 cd06595 GH31_xylosidase_XylS-l 28.3 96 0.0021 20.0 3.2 25 42-66 23-47 (292)
323 PF02233 PNTB: NAD(P) transhyd 28.3 93 0.002 22.0 3.3 39 28-66 308-347 (463)
324 cd03060 GST_N_Omega_like GST_N 28.3 82 0.0018 15.3 3.2 22 45-66 11-32 (71)
325 PF07137 VDE: Violaxanthin de- 28.1 40 0.00086 21.1 1.3 26 36-61 154-179 (198)
326 PF09413 DUF2007: Domain of un 28.1 84 0.0018 15.3 2.8 20 43-62 9-28 (67)
327 PF13369 Transglut_core2: Tran 28.0 1.3E+02 0.0028 17.4 3.6 28 40-67 65-92 (152)
328 cd04333 ProX_deacylase This CD 27.8 82 0.0018 17.9 2.6 20 47-66 2-21 (148)
329 PRK05720 mtnA methylthioribose 27.7 1.1E+02 0.0024 20.6 3.4 27 38-64 186-213 (344)
330 COG2344 AT-rich DNA-binding pr 27.5 1.1E+02 0.0024 19.4 3.2 23 40-62 153-175 (211)
331 PRK05907 hypothetical protein; 27.5 1.5E+02 0.0032 19.6 4.0 16 21-38 13-28 (311)
332 PF00763 THF_DHG_CYH: Tetrahyd 27.5 1.2E+02 0.0026 16.9 3.7 40 28-67 29-68 (117)
333 PF08444 Gly_acyl_tr_C: Aralky 27.5 84 0.0018 17.1 2.4 20 46-65 41-60 (89)
334 PF02082 Rrf2: Transcriptional 27.4 97 0.0021 15.9 2.6 20 39-58 35-54 (83)
335 TIGR01890 N-Ac-Glu-synth amino 27.4 1.5E+02 0.0034 20.1 4.2 27 28-63 49-75 (429)
336 PF04282 DUF438: Family of unk 27.3 79 0.0017 16.5 2.2 22 40-61 28-49 (71)
337 PF00496 SBP_bac_5: Bacterial 27.2 1.2E+02 0.0027 19.4 3.6 36 31-66 296-331 (374)
338 PF01884 PcrB: PcrB family; I 27.2 89 0.0019 19.9 2.8 40 22-67 28-67 (230)
339 TIGR01357 aroB 3-dehydroquinat 27.1 1.9E+02 0.0041 19.0 5.1 37 28-67 21-57 (344)
340 cd04336 YeaK YeaK is an unchar 26.8 1.1E+02 0.0023 17.5 3.0 20 48-67 3-22 (153)
341 cd00738 HGTP_anticodon HGTP an 26.6 98 0.0021 15.6 3.7 20 43-62 17-36 (94)
342 cd08197 DOIS 2-deoxy-scyllo-in 26.5 1.5E+02 0.0032 19.9 3.9 37 28-67 24-60 (355)
343 PRK15113 glutathione S-transfe 26.4 1.5E+02 0.0033 17.7 5.4 37 28-66 3-39 (214)
344 COG1282 PntB NAD/NADP transhyd 26.4 82 0.0018 22.0 2.7 55 12-66 292-348 (463)
345 PRK12899 secA preprotein trans 26.4 2E+02 0.0043 22.5 4.8 31 27-63 568-598 (970)
346 cd00423 Pterin_binding Pterin 26.3 1.8E+02 0.0038 18.4 4.2 43 22-65 116-171 (258)
347 PRK12901 secA preprotein trans 26.3 1E+02 0.0022 24.3 3.3 30 27-62 628-657 (1112)
348 PRK07535 methyltetrahydrofolat 26.1 1.8E+02 0.004 18.6 4.5 42 22-65 112-159 (261)
349 KOG0673|consensus 25.9 2E+02 0.0043 18.9 4.6 60 1-65 201-261 (293)
350 cd03059 GST_N_SspA GST_N famil 25.9 90 0.002 14.9 3.0 22 45-66 11-32 (73)
351 PF00248 Aldo_ket_red: Aldo/ke 25.8 1.3E+02 0.0028 18.7 3.4 31 28-58 100-130 (283)
352 PF09397 Ftsk_gamma: Ftsk gamm 25.8 1.1E+02 0.0023 15.6 2.6 20 40-59 31-50 (65)
353 PF10137 TIR-like: Predicted n 25.7 1.4E+02 0.003 17.1 4.3 6 30-35 2-7 (125)
354 PRK13104 secA preprotein trans 25.6 2.5E+02 0.0055 21.7 5.2 31 27-63 444-474 (896)
355 cd03825 GT1_wcfI_like This fam 25.5 1.8E+02 0.0038 18.2 5.1 25 43-67 16-40 (365)
356 PRK00466 acetyl-lysine deacety 25.5 1.7E+02 0.0037 18.9 4.0 34 34-67 88-121 (346)
357 PF13684 Dak1_2: Dihydroxyacet 25.5 2.1E+02 0.0045 18.9 5.1 38 28-68 265-302 (313)
358 PRK05772 translation initiatio 25.4 1.2E+02 0.0026 20.7 3.3 28 37-64 206-234 (363)
359 PF10113 Fibrillarin_2: Fibril 25.3 2.2E+02 0.0048 20.3 4.6 44 22-66 185-228 (505)
360 cd03794 GT1_wbuB_like This fam 25.2 1.7E+02 0.0038 18.0 5.3 40 27-67 219-258 (394)
361 PRK09444 pntB pyridine nucleot 25.1 1.2E+02 0.0025 21.6 3.2 39 28-66 307-346 (462)
362 cd03814 GT1_like_2 This family 25.1 1.8E+02 0.0038 18.0 5.4 39 28-67 197-235 (364)
363 PRK13103 secA preprotein trans 25.1 1.2E+02 0.0026 23.4 3.5 30 27-62 449-478 (913)
364 PRK15324 type III secretion sy 25.0 1.2E+02 0.0026 19.5 3.2 22 41-62 27-48 (252)
365 cd04885 ACT_ThrD-I Tandem C-te 25.0 1E+02 0.0022 15.1 3.4 29 28-60 38-66 (68)
366 COG1509 KamA Lysine 2,3-aminom 24.9 62 0.0013 22.2 1.9 29 28-56 282-315 (369)
367 PRK05331 putative phosphate ac 24.9 89 0.0019 20.8 2.6 33 25-59 292-324 (334)
368 PF06415 iPGM_N: BPG-independe 24.9 1.9E+02 0.0042 18.4 5.9 49 19-68 20-71 (223)
369 KOG1579|consensus 24.8 90 0.0019 21.0 2.6 36 31-68 223-258 (317)
370 PF13344 Hydrolase_6: Haloacid 24.7 71 0.0015 17.2 1.9 22 42-63 15-36 (101)
371 PF01903 CbiX: CbiX; InterPro 24.5 1.2E+02 0.0026 15.9 2.8 28 34-62 1-29 (105)
372 PRK01372 ddl D-alanine--D-alan 24.4 1.7E+02 0.0036 18.5 3.8 36 29-64 6-43 (304)
373 PF06057 VirJ: Bacterial virul 24.2 95 0.002 19.3 2.5 24 17-40 129-152 (192)
374 TIGR00512 salvage_mtnA S-methy 24.1 1.5E+02 0.0033 19.9 3.6 27 38-64 186-213 (331)
375 PF03720 UDPG_MGDP_dh_C: UDP-g 23.9 1.2E+02 0.0025 16.4 2.6 19 44-62 17-35 (106)
376 PF01936 NYN: NYN domain; Int 23.9 94 0.002 17.1 2.3 30 28-64 97-126 (146)
377 cd03799 GT1_amsK_like This is 23.7 1.9E+02 0.0041 17.9 5.6 40 28-67 179-218 (355)
378 PF03979 Sigma70_r1_1: Sigma-7 23.6 94 0.002 16.2 2.1 22 39-60 34-55 (82)
379 TIGR00756 PPR pentatricopeptid 23.4 67 0.0015 12.6 2.5 18 42-59 15-32 (35)
380 cd03820 GT1_amsD_like This fam 23.3 1.8E+02 0.0039 17.5 5.5 41 27-67 177-217 (348)
381 PRK15348 type III secretion sy 23.3 2E+02 0.0043 18.6 3.9 22 42-63 28-49 (249)
382 PRK00002 aroB 3-dehydroquinate 23.0 2.4E+02 0.0051 18.7 5.1 37 28-67 32-68 (358)
383 PF13439 Glyco_transf_4: Glyco 23.0 1.4E+02 0.0031 16.2 3.2 22 44-65 16-37 (177)
384 PRK07708 hypothetical protein; 22.9 1.9E+02 0.0041 18.1 3.7 30 38-67 21-50 (219)
385 TIGR00854 pts-sorbose PTS syst 22.9 1.4E+02 0.003 17.6 2.9 23 45-67 124-146 (151)
386 cd02014 TPP_POX Thiamine pyrop 22.9 1.7E+02 0.0037 17.1 3.6 38 27-68 69-106 (178)
387 PRK08299 isocitrate dehydrogen 22.9 2.7E+02 0.0059 19.4 5.1 37 27-66 7-43 (402)
388 PF12122 DUF3582: Protein of u 22.8 1.5E+02 0.0032 16.3 3.1 24 42-65 9-32 (101)
389 cd00215 PTS_IIA_lac PTS_IIA, P 22.8 56 0.0012 17.9 1.2 28 27-55 66-93 (97)
390 PF02780 Transketolase_C: Tran 22.7 1.5E+02 0.0032 16.2 3.3 21 43-63 21-41 (124)
391 PF03446 NAD_binding_2: NAD bi 22.7 96 0.0021 17.9 2.3 32 25-60 84-115 (163)
392 cd03064 TRX_Fd_NuoE TRX-like [ 22.7 86 0.0019 15.9 1.9 26 27-52 54-79 (80)
393 PF02223 Thymidylate_kin: Thym 22.6 1.5E+02 0.0033 17.2 3.2 22 44-66 11-32 (186)
394 PRK09628 oorB 2-oxoglutarate-a 22.5 2.1E+02 0.0044 18.6 3.9 39 27-68 86-124 (277)
395 cd04446 DEP_DEPDC4 DEP (Dishev 22.4 1.5E+02 0.0033 16.3 2.9 23 38-60 49-71 (95)
396 PLN02711 Probable galactinol-- 22.4 1.8E+02 0.004 22.1 3.9 30 39-68 225-254 (777)
397 TIGR01133 murG undecaprenyldip 22.4 1.8E+02 0.0039 18.4 3.6 32 30-63 3-34 (348)
398 PF13362 Toprim_3: Toprim doma 22.4 1.3E+02 0.0029 15.6 4.6 38 28-65 41-79 (96)
399 COG4022 Uncharacterized protei 22.4 1.3E+02 0.0028 19.7 2.9 31 37-67 74-104 (286)
400 PF05343 Peptidase_M42: M42 gl 22.3 2.3E+02 0.0051 18.4 4.4 37 31-67 211-247 (292)
401 cd03143 A4_beta-galactosidase_ 22.2 1.6E+02 0.0036 16.6 3.3 24 40-63 22-45 (154)
402 cd03809 GT1_mtfB_like This fam 22.2 2E+02 0.0044 17.7 5.2 40 27-66 194-233 (365)
403 PF05990 DUF900: Alpha/beta hy 22.2 2.1E+02 0.0045 17.8 4.9 37 27-66 18-55 (233)
404 TIGR00624 tag DNA-3-methyladen 22.1 53 0.0011 20.1 1.1 24 37-60 128-153 (179)
405 PRK05782 bifunctional sirohydr 22.1 2.3E+02 0.005 19.1 4.1 28 28-56 8-35 (335)
406 COG4947 Uncharacterized protei 22.0 37 0.0008 21.3 0.4 42 23-66 169-210 (227)
407 COG1609 PurR Transcriptional r 22.0 2.4E+02 0.0052 18.4 4.6 33 28-60 176-208 (333)
408 cd01524 RHOD_Pyr_redox Member 22.0 1.3E+02 0.0028 15.3 3.1 22 45-68 63-84 (90)
409 PF00542 Ribosomal_L12: Riboso 21.9 1.3E+02 0.0028 15.3 4.3 31 27-62 37-67 (68)
410 PF13361 UvrD_C: UvrD-like hel 21.9 1.2E+02 0.0027 18.9 2.8 20 43-62 87-106 (351)
411 TIGR02617 tnaA_trp_ase tryptop 21.9 1.4E+02 0.0031 21.2 3.2 20 43-62 331-350 (467)
412 TIGR00823 EIIA-LAC phosphotran 21.8 57 0.0012 18.0 1.1 28 27-55 68-95 (99)
413 PRK09864 putative peptidase; P 21.8 2.7E+02 0.0058 18.9 5.4 42 25-67 247-288 (356)
414 PF13471 Transglut_core3: Tran 21.6 1.6E+02 0.0034 16.1 3.4 22 43-64 59-80 (117)
415 PRK10329 glutaredoxin-like pro 21.6 1.4E+02 0.0029 15.4 3.0 21 45-65 13-33 (81)
416 cd01439 TCCD_inducible_PARP_li 21.5 25 0.00055 19.8 -0.4 12 31-42 1-12 (121)
417 PRK05434 phosphoglyceromutase; 21.5 3.1E+02 0.0068 19.6 6.0 49 19-68 102-153 (507)
418 PRK12475 thiamine/molybdopteri 21.5 1.4E+02 0.003 19.9 3.1 19 31-53 318-336 (338)
419 KOG2650|consensus 21.4 1.1E+02 0.0024 21.3 2.6 31 35-65 63-93 (418)
420 PF00258 Flavodoxin_1: Flavodo 21.4 1.4E+02 0.003 16.5 2.7 21 45-65 13-33 (143)
421 PF13353 Fer4_12: 4Fe-4S singl 21.3 1.6E+02 0.0034 16.1 6.3 45 19-65 45-91 (139)
422 PLN00103 isocitrate dehydrogen 21.3 2.9E+02 0.0064 19.3 4.6 37 27-66 9-45 (410)
423 TIGR00182 plsX fatty acid/phos 21.2 1.3E+02 0.0028 20.0 2.8 32 27-59 280-311 (322)
424 cd06589 GH31 The enzymes of gl 21.1 1.8E+02 0.0038 18.4 3.4 29 39-67 19-47 (265)
425 PF11775 CobT_C: Cobalamin bio 21.0 1.8E+02 0.004 18.5 3.3 22 45-66 36-57 (219)
426 KOG4081|consensus 21.0 1.8E+02 0.0039 16.7 3.0 28 37-64 37-66 (115)
427 PLN02834 3-dehydroquinate synt 20.9 3E+02 0.0065 19.1 5.0 37 28-67 101-139 (433)
428 COG4635 HemG Flavodoxin [Energ 20.8 2.2E+02 0.0048 17.5 4.4 34 29-63 2-35 (175)
429 PF04157 EAP30: EAP30/Vps36 fa 20.7 1.8E+02 0.0039 18.0 3.3 28 40-67 132-159 (223)
430 KOG3975|consensus 20.7 1.2E+02 0.0025 20.2 2.4 36 22-58 238-273 (301)
431 cd08195 DHQS Dehydroquinate sy 20.6 2.6E+02 0.0057 18.4 5.2 37 28-67 25-61 (345)
432 KOG3101|consensus 20.6 1.1E+02 0.0025 19.9 2.4 35 8-42 194-230 (283)
433 cd05844 GT1_like_7 Glycosyltra 20.5 2.4E+02 0.0051 17.8 5.4 40 28-67 188-227 (367)
434 cd01822 Lysophospholipase_L1_l 20.3 1.8E+02 0.0039 16.4 4.9 34 29-62 67-106 (177)
435 cd04702 ASRGL1_like ASRGL1_lik 20.3 2.3E+02 0.0049 18.5 3.7 18 28-46 1-18 (261)
436 PF13104 DUF3956: Protein of u 20.3 65 0.0014 15.1 0.9 33 28-60 11-43 (45)
437 cd06597 GH31_transferase_CtsY 20.3 1.9E+02 0.0042 19.1 3.5 26 40-65 20-45 (340)
438 smart00854 PGA_cap Bacterial c 20.2 1.9E+02 0.0042 17.8 3.4 34 27-60 174-208 (239)
439 PF03850 Tfb4: Transcription f 20.2 2.6E+02 0.0057 18.2 5.0 42 24-66 142-184 (276)
440 PLN02297 ribose-phosphate pyro 20.2 2.8E+02 0.0062 18.6 5.3 38 25-63 64-101 (326)
441 TIGR01742 SA_tandem_lipo Staph 20.0 84 0.0018 20.5 1.7 14 27-40 210-223 (255)
442 PRK11274 glcF glycolate oxidas 20.0 2.2E+02 0.0047 19.0 3.8 36 27-62 166-201 (407)
No 1
>COG1506 DAP2 Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]
Probab=99.64 E-value=7.7e-16 Score=106.14 Aligned_cols=59 Identities=24% Similarity=0.250 Sum_probs=56.9
Q ss_pred CHHHHHhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 9 NLEGYKIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 9 ~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
+.+.|...||+.+++++ ++|+||+||+.|.+||++|+++|+++|+.+|+++++++||++
T Consensus 534 ~~~~~~~~sp~~~~~~i-~~P~LliHG~~D~~v~~~q~~~~~~aL~~~g~~~~~~~~p~e 592 (620)
T COG1506 534 DREKYEDRSPIFYADNI-KTPLLLIHGEEDDRVPIEQAEQLVDALKRKGKPVELVVFPDE 592 (620)
T ss_pred ChHHHHhcChhhhhccc-CCCEEEEeecCCccCChHHHHHHHHHHHHcCceEEEEEeCCC
Confidence 67899999999999999 699999999999999999999999999999999999999985
No 2
>PF00326 Peptidase_S9: Prolyl oligopeptidase family This family belongs to family S9 of the peptidase classification.; InterPro: IPR001375 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain covers the active site serine of the serine peptidases belonging to MEROPS peptidase family S9 (prolyl oligopeptidase family, clan SC). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Examples of protein families containing this domain are: Prolyl endopeptidase (3.4.21.26 from EC) (PE) (also called post-proline cleaving enzyme). PE is an enzyme that cleaves peptide bonds on the C-terminal side of prolyl residues. The sequence of PE has been obtained from a mammalian species (pig) and from bacteria (Flavobacterium meningosepticum and Aeromonas hydrophila); there is a high degree of sequence conservation between these sequences. Escherichia coli protease II (3.4.21.83 from EC) (oligopeptidase B) (gene prtB) which cleaves peptide bonds on the C-terminal side of lysyl and argininyl residues. Dipeptidyl peptidase IV (3.4.14.5 from EC) (DPP IV). DPP IV is an enzyme that removes N-terminal dipeptides sequentially from polypeptides having unsubstituted N-termini provided that the penultimate residue is proline. Saccharomyces cerevisiae (Baker's yeast) vacuolar dipeptidyl aminopeptidases A and B (DPAP A and DPAP B), encoded by the STE13 and DAP2 genes respectively. DPAP A is responsible for the proteolytic maturation of the alpha-factor precursor. Acylamino-acid-releasing enzyme (3.4.19.1 from EC) (acyl-peptide hydrolase). This enzyme catalyses the hydrolysis of the amino-terminal peptide bond of an N-acetylated protein to generate a N-acetylated amino acid and a protein with a free amino-terminus. These proteins belong to MEROPS peptidase families S9A, S9B and S9C.; GO: 0008236 serine-type peptidase activity, 0006508 proteolysis; PDB: 2AJ8_D 1ORV_D 2AJB_C 2BUC_D 1ORW_D 2AJC_D 2AJD_C 2BUA_A 2HU8_B 3O4J_B ....
Probab=99.64 E-value=8.7e-16 Score=93.08 Aligned_cols=61 Identities=26% Similarity=0.214 Sum_probs=57.7
Q ss_pred CCCHHHHHhcChhHHhhc--CCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 7 EDNLEGYKIAALNNKVDR--IRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 7 ~~~~~~y~~~sp~~~~~~--~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
+++++.|+..||+..+.+ . .+|+||+||++|++||++|+.+|+++|+++|+++++++||++
T Consensus 123 ~~~~~~~~~~s~~~~~~~~~~-~~P~li~hG~~D~~Vp~~~s~~~~~~L~~~g~~~~~~~~p~~ 185 (213)
T PF00326_consen 123 WDNPEFYRELSPISPADNVQI-KPPVLIIHGENDPRVPPSQSLRLYNALRKAGKPVELLIFPGE 185 (213)
T ss_dssp TTSHHHHHHHHHGGGGGGCGG-GSEEEEEEETTBSSSTTHHHHHHHHHHHHTTSSEEEEEETT-
T ss_pred chhhhhhhhhccccccccccC-CCCEEEEccCCCCccCHHHHHHHHHHHHhcCCCEEEEEcCcC
Confidence 678999999999999999 7 699999999999999999999999999999999999999985
No 3
>KOG2100|consensus
Probab=99.57 E-value=6.1e-15 Score=103.75 Aligned_cols=66 Identities=42% Similarity=0.687 Sum_probs=61.4
Q ss_pred CCCCCcCCCHHHHHhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 1 MGLPTFEDNLEGYKIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 1 lG~p~~~~~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
||.| .++...|+++++..+++.++.+.+||+||+.|++||++|+.+++++|+.+|+++++++||++
T Consensus 658 mg~p--~~~~~~y~e~~~~~~~~~~~~~~~LliHGt~DdnVh~q~s~~~~~aL~~~gv~~~~~vypde 723 (755)
T KOG2100|consen 658 MGLP--SENDKGYEESSVSSPANNIKTPKLLLIHGTEDDNVHFQQSAILIKALQNAGVPFRLLVYPDE 723 (755)
T ss_pred cCCC--ccccchhhhccccchhhhhccCCEEEEEcCCcCCcCHHHHHHHHHHHHHCCCceEEEEeCCC
Confidence 7999 88888899999999999996555699999999999999999999999999999999999985
No 4
>KOG2281|consensus
Probab=99.41 E-value=4.6e-13 Score=93.03 Aligned_cols=66 Identities=33% Similarity=0.603 Sum_probs=62.0
Q ss_pred CCCCCcCCCHHHHHhcChhHHhhcCCC--CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 1 MGLPTFEDNLEGYKIAALNNKVDRIRD--KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 1 lG~p~~~~~~~~y~~~sp~~~~~~~~~--~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
||.| +.+.++|...|...++.++.+ ..+||+||.-|++||+.|+..|+++|.++|+++|+++||++
T Consensus 776 Mg~P--~~nE~gY~agSV~~~VeklpdepnRLlLvHGliDENVHF~Hts~Lvs~lvkagKpyeL~IfP~E 843 (867)
T KOG2281|consen 776 MGYP--DNNEHGYGAGSVAGHVEKLPDEPNRLLLVHGLIDENVHFAHTSRLVSALVKAGKPYELQIFPNE 843 (867)
T ss_pred cCCC--ccchhcccchhHHHHHhhCCCCCceEEEEecccccchhhhhHHHHHHHHHhCCCceEEEEcccc
Confidence 8999 889999999999999999853 55999999999999999999999999999999999999985
No 5
>PRK10115 protease 2; Provisional
Probab=99.16 E-value=6.4e-11 Score=82.93 Aligned_cols=61 Identities=18% Similarity=0.143 Sum_probs=52.0
Q ss_pred CCCCcCCC--HHHHHhcChhHHhhcCCCCc-EEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEe
Q psy472 2 GLPTFEDN--LEGYKIAALNNKVDRIRDKQ-YLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 2 G~p~~~~~--~~~y~~~sp~~~~~~~~~~p-~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
|.| ++. .+.++..||++++++. ..| +|++||..|++||+.|+.+|+.+|+++|++.++++|
T Consensus 581 G~p--~~~~~~~~l~~~SP~~~v~~~-~~P~lLi~~g~~D~RV~~~~~~k~~a~Lr~~~~~~~~vl~ 644 (686)
T PRK10115 581 GNP--QDPQYYEYMKSYSPYDNVTAQ-AYPHLLVTTGLHDSQVQYWEPAKWVAKLRELKTDDHLLLL 644 (686)
T ss_pred CCC--CCHHHHHHHHHcCchhccCcc-CCCceeEEecCCCCCcCchHHHHHHHHHHhcCCCCceEEE
Confidence 677 432 2334678999999998 477 778899999999999999999999999999999988
No 6
>PLN02442 S-formylglutathione hydrolase
Probab=99.00 E-value=1e-09 Score=69.68 Aligned_cols=60 Identities=13% Similarity=0.127 Sum_probs=51.1
Q ss_pred CCCHHHHHhcChhHHhhcCCCCcEEEEEeCCCCccCH-HHHHHHHHHHHHcCCceEEEEecC
Q psy472 7 EDNLEGYKIAALNNKVDRIRDKQYLLVHGTMDDNVHF-QQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 7 ~~~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~-~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.+..+.|...+|+..+... ++|+|++||+.|+.|+. .++..|+++|++.|.++++++||+
T Consensus 198 ~~~~~~~d~~~~~~~~~~~-~~pvli~~G~~D~~v~~~~~s~~~~~~l~~~g~~~~~~~~pg 258 (283)
T PLN02442 198 KADWEEYDATELVSKFNDV-SATILIDQGEADKFLKEQLLPENFEEACKEAGAPVTLRLQPG 258 (283)
T ss_pred hhhHHHcChhhhhhhcccc-CCCEEEEECCCCccccccccHHHHHHHHHHcCCCeEEEEeCC
Confidence 4445666777777777666 69999999999999998 479999999999999999999997
No 7
>PRK10566 esterase; Provisional
Probab=98.82 E-value=2.6e-08 Score=61.13 Aligned_cols=52 Identities=21% Similarity=0.303 Sum_probs=42.7
Q ss_pred ChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCc--eEEEEecCC
Q psy472 17 ALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIM--FQSQECYRR 68 (68)
Q Consensus 17 sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~--~~~~~~~~~ 68 (68)
++...+.++...|+|++||++|+.||++++.+++++|+.+|.+ ++++.|++.
T Consensus 176 ~~~~~~~~i~~~P~Lii~G~~D~~v~~~~~~~l~~~l~~~g~~~~~~~~~~~~~ 229 (249)
T PRK10566 176 EVTHQLEQLADRPLLLWHGLADDVVPAAESLRLQQALRERGLDKNLTCLWEPGV 229 (249)
T ss_pred ChhhhhhhcCCCCEEEEEcCCCCcCCHHHHHHHHHHHHhcCCCcceEEEecCCC
Confidence 3444455542489999999999999999999999999999985 788888873
No 8
>PRK11460 putative hydrolase; Provisional
Probab=98.81 E-value=1.7e-08 Score=62.54 Aligned_cols=42 Identities=12% Similarity=0.061 Sum_probs=39.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
.+|+|++||++|+.||++.+..++++|++.|.++++++|+++
T Consensus 148 ~~pvli~hG~~D~vvp~~~~~~~~~~L~~~g~~~~~~~~~~~ 189 (232)
T PRK11460 148 ATTIHLIHGGEDPVIDVAHAVAAQEALISLGGDVTLDIVEDL 189 (232)
T ss_pred CCcEEEEecCCCCccCHHHHHHHHHHHHHCCCCeEEEEECCC
Confidence 589999999999999999999999999999999999999863
No 9
>PRK13604 luxD acyl transferase; Provisional
Probab=98.69 E-value=5.4e-08 Score=63.20 Aligned_cols=51 Identities=14% Similarity=0.140 Sum_probs=43.6
Q ss_pred hcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 15 IAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 15 ~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
..+|+..++++ +.|+|+|||+.|+.||++++..|+++++. -+.++.+|+|+
T Consensus 191 ~~s~i~~~~~l-~~PvLiIHG~~D~lVp~~~s~~l~e~~~s--~~kkl~~i~Ga 241 (307)
T PRK13604 191 LDSTINKMKGL-DIPFIAFTANNDSWVKQSEVIDLLDSIRS--EQCKLYSLIGS 241 (307)
T ss_pred cccHHHHHhhc-CCCEEEEEcCCCCccCHHHHHHHHHHhcc--CCcEEEEeCCC
Confidence 46788889889 59999999999999999999999997543 56788888874
No 10
>PF02230 Abhydrolase_2: Phospholipase/Carboxylesterase; InterPro: IPR003140 This entry represents the alpha/beta hydrolase domain found in phospholipases [], carboxylesterases [] and thioesterases.; GO: 0016787 hydrolase activity; PDB: 3U0V_A 1AUR_A 1AUO_B 1FJ2_B 3CN9_A 3CN7_A.
Probab=98.68 E-value=6.4e-08 Score=59.11 Aligned_cols=41 Identities=24% Similarity=0.253 Sum_probs=36.5
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..|+|++||+.|++||++.+...++.|++.+.+++++.|++
T Consensus 155 ~~pi~~~hG~~D~vvp~~~~~~~~~~L~~~~~~v~~~~~~g 195 (216)
T PF02230_consen 155 KTPILIIHGDEDPVVPFEWAEKTAEFLKAAGANVEFHEYPG 195 (216)
T ss_dssp TS-EEEEEETT-SSSTHHHHHHHHHHHHCTT-GEEEEEETT
T ss_pred CCcEEEEecCCCCcccHHHHHHHHHHHHhcCCCEEEEEcCC
Confidence 47999999999999999999999999999999999999994
No 11
>PRK05371 x-prolyl-dipeptidyl aminopeptidase; Provisional
Probab=98.65 E-value=1.4e-07 Score=67.13 Aligned_cols=57 Identities=19% Similarity=0.107 Sum_probs=51.9
Q ss_pred HHHHhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 11 EGYKIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 11 ~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
+-|...|++.+++++ +.|+|++||..|.+|++.++.+++++|+++|++.++.+.+++
T Consensus 440 ~fW~~rn~~~~~~kI-kvPvLlIhGw~D~~V~~~~s~~ly~aL~~~g~pkkL~l~~g~ 496 (767)
T PRK05371 440 DFWDDRNYLKDADKI-KASVLVVHGLNDWNVKPKQVYQWWDALPENGVPKKLFLHQGG 496 (767)
T ss_pred HHHHhCCHhhHhhCC-CCCEEEEeeCCCCCCChHHHHHHHHHHHhcCCCeEEEEeCCC
Confidence 456788999999999 699999999999999999999999999999999999887753
No 12
>TIGR02821 fghA_ester_D S-formylglutathione hydrolase. This model describes a protein family from bacteria, yeast, and human, with a conserved critical role in formaldehyde detoxification as S-formylglutathione hydrolase (EC 3.1.2.12). Members in eukaryotes such as the human protein are better known as esterase D (EC 3.1.1.1), an enzyme with broad specificity, although S-formylglutathione hydrolase has now been demonstrated as well.
Probab=98.63 E-value=1e-07 Score=60.20 Aligned_cols=55 Identities=13% Similarity=0.067 Sum_probs=45.9
Q ss_pred HHhcChhHHhhcC-CCCcEEEEEeCCCCccCH-HHHHHHHHHHHHcCCceEEEEecC
Q psy472 13 YKIAALNNKVDRI-RDKQYLLVHGTMDDNVHF-QQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 13 y~~~sp~~~~~~~-~~~p~ll~hG~~D~~V~~-~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+...||...+... ..+|+++.||+.|+.|+. .++..+.++|+++|+++++..|||
T Consensus 196 ~~~~~~~~~~~~~~~~~plli~~G~~D~~v~~~~~~~~~~~~l~~~g~~v~~~~~~g 252 (275)
T TIGR02821 196 WRSYDASLLVADGGRHSTILIDQGTADQFLDEQLRPDAFEQACRAAGQALTLRRQAG 252 (275)
T ss_pred hhhcchHHHHhhcccCCCeeEeecCCCcccCccccHHHHHHHHHHcCCCeEEEEeCC
Confidence 4455666655442 148999999999999999 799999999999999999999997
No 13
>COG0400 Predicted esterase [General function prediction only]
Probab=98.62 E-value=1.1e-07 Score=58.73 Aligned_cols=43 Identities=23% Similarity=0.156 Sum_probs=40.5
Q ss_pred CCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 26 RDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 26 ~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
+.+|+|++||+.|++||...+.++.+.|++.|..++...|++|
T Consensus 145 ~~~pill~hG~~Dpvvp~~~~~~l~~~l~~~g~~v~~~~~~~G 187 (207)
T COG0400 145 AGTPILLSHGTEDPVVPLALAEALAEYLTASGADVEVRWHEGG 187 (207)
T ss_pred CCCeEEEeccCcCCccCHHHHHHHHHHHHHcCCCEEEEEecCC
Confidence 3699999999999999999999999999999999999999854
No 14
>PF01738 DLH: Dienelactone hydrolase family; InterPro: IPR002925 Dienelactone hydrolases play a crucial role in chlorocatechol degradation via the modified ortho cleavage pathway. Enzymes induced in 4-fluorobenzoate-utilizing bacteria have been classified into three groups on the basis of their specificity towards cis- and trans-dienelactone []. Some proteins contain repeated small fragments of this domain (for example rat kan-1 protein).; GO: 0016787 hydrolase activity; PDB: 1GGV_A 1ZIY_A 1ZI6_A 1ZIC_A 1ZJ5_A 1ZI8_A 1ZJ4_A 1ZI9_A 1ZIX_A 3F67_A.
Probab=98.56 E-value=3.2e-07 Score=55.84 Aligned_cols=50 Identities=14% Similarity=0.123 Sum_probs=40.2
Q ss_pred hhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 18 LNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 18 p~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
+.....+. +.|+|+++|..|+.++.+....+.++|+++|.++++++|||.
T Consensus 137 ~~~~~~~~-~~P~l~~~g~~D~~~~~~~~~~~~~~l~~~~~~~~~~~y~ga 186 (218)
T PF01738_consen 137 PLEDAPKI-KAPVLILFGENDPFFPPEEVEALEEALKAAGVDVEVHVYPGA 186 (218)
T ss_dssp HHHHGGG---S-EEEEEETT-TTS-HHHHHHHHHHHHCTTTTEEEEEETT-
T ss_pred chhhhccc-CCCEeecCccCCCCCChHHHHHHHHHHHhcCCcEEEEECCCC
Confidence 44456677 599999999999999999999999999999999999999973
No 15
>PF07859 Abhydrolase_3: alpha/beta hydrolase fold A web page of Esterases and alpha/beta hydrolases.; InterPro: IPR013094 The alpha/beta hydrolase fold [] is common to a number of hydrolytic enzymes of widely differing phylogenetic origin and catalytic function. The core of each enzyme is an alpha/beta-sheet (rather than a barrel), containing 8 strands connected by helices []. The enzymes are believed to have diverged from a common ancestor, preserving the arrangement of the catalytic residues. All have a catalytic triad, the elements of which are borne on loops, which are the best conserved structural features of the fold. Esterase (EST) from Pseudomonas putida is a member of the alpha/beta hydrolase fold superfamily of enzymes []. In most of the family members the beta-strands are parallels, but some have an inversion of the first strands, which gives it an antiparallel orientation. The catalytic triad residues are presented on loops. One of these is the nucleophile elbow and is the most conserved feature of the fold. Some other members lack one or all of the catalytic residues. Some members are therefore inactive but others are involved in surface recognition. The ESTHER database [] gathers and annotates all the published information related to gene and protein sequences of this superfamily []. This entry represents the catalytic domain fold-3 of alpha/beta hydrolase. ; GO: 0016787 hydrolase activity, 0008152 metabolic process; PDB: 3D7R_B 2C7B_B 3ZWQ_B 2YH2_B 3BXP_A 3D3N_A 1LZK_A 1LZL_A 2O7V_A 2O7R_A ....
Probab=98.37 E-value=9.9e-07 Score=52.99 Aligned_cols=51 Identities=22% Similarity=0.151 Sum_probs=40.8
Q ss_pred hcChhHH-hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 15 IAALNNK-VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 15 ~~sp~~~-~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
..||+.. ..+. -+|++|++|+.|..+ .++..|+++|++.|+++++++|++.
T Consensus 154 ~~sp~~~~~~~~-~Pp~~i~~g~~D~l~--~~~~~~~~~L~~~gv~v~~~~~~g~ 205 (211)
T PF07859_consen 154 LASPLNASDLKG-LPPTLIIHGEDDVLV--DDSLRFAEKLKKAGVDVELHVYPGM 205 (211)
T ss_dssp TTSGGGSSCCTT-CHEEEEEEETTSTTH--HHHHHHHHHHHHTT-EEEEEEETTE
T ss_pred cccccccccccc-CCCeeeeccccccch--HHHHHHHHHHHHCCCCEEEEEECCC
Confidence 5677765 1112 389999999999865 6899999999999999999999873
No 16
>PF03583 LIP: Secretory lipase ; InterPro: IPR005152 This entry represents a family of secreted lipases. Family members include the LIP lipases from Candida albicans, which are expressed and secreted during the infection cycle of these pathogens [].; GO: 0004806 triglyceride lipase activity, 0016042 lipid catabolic process
Probab=98.35 E-value=1.6e-06 Score=55.69 Aligned_cols=41 Identities=17% Similarity=0.062 Sum_probs=38.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcC-CceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHAD-IMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g-~~~~~~~~~~ 67 (68)
+.|++|.||..|++||+..+..++++++++| .+++++.|+.
T Consensus 219 ~~Pv~i~~g~~D~vvP~~~~~~l~~~~c~~G~a~V~~~~~~~ 260 (290)
T PF03583_consen 219 TVPVLIYQGTADEVVPPADTDALVAKWCAAGGADVEYVRYPG 260 (290)
T ss_pred CCCEEEEecCCCCCCChHHHHHHHHHHHHcCCCCEEEEecCC
Confidence 5899999999999999999999999999999 7999998874
No 17
>PRK10749 lysophospholipase L2; Provisional
Probab=98.32 E-value=1.9e-06 Score=55.54 Aligned_cols=47 Identities=11% Similarity=-0.029 Sum_probs=40.1
Q ss_pred HhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCC---ceEEEEecCC
Q psy472 21 KVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADI---MFQSQECYRR 68 (68)
Q Consensus 21 ~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~---~~~~~~~~~~ 68 (68)
.+.++ +.|+|++||++|..|+++.+..+++++.+++. ..++++|+|.
T Consensus 254 ~~~~i-~~P~Lii~G~~D~vv~~~~~~~~~~~l~~~~~~~~~~~l~~~~ga 303 (330)
T PRK10749 254 GAGDI-TTPLLLLQAEEERVVDNRMHDRFCEARTAAGHPCEGGKPLVIKGA 303 (330)
T ss_pred hccCC-CCCEEEEEeCCCeeeCHHHHHHHHHHHhhcCCCCCCceEEEeCCC
Confidence 34567 69999999999999999999999999988774 4589999874
No 18
>COG1505 Serine proteases of the peptidase family S9A [Amino acid transport and metabolism]
Probab=98.21 E-value=3.9e-06 Score=58.63 Aligned_cols=63 Identities=17% Similarity=0.110 Sum_probs=48.6
Q ss_pred CCCCcCCCHHHHHhcChhHHhhcC-CCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEE
Q psy472 2 GLPTFEDNLEGYKIAALNNKVDRI-RDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 2 G~p~~~~~~~~y~~~sp~~~~~~~-~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
|.|..-+..+...+-||.++++.- +-+|+||--|+.|++||+.|+..|+.+|++.|.++-+.+
T Consensus 554 G~Pd~P~d~~~l~~YSPy~nl~~g~kYP~~LITTs~~DDRVHPaHarKfaa~L~e~~~pv~~~e 617 (648)
T COG1505 554 GNPDDPEDRAFLLAYSPYHNLKPGQKYPPTLITTSLHDDRVHPAHARKFAAKLQEVGAPVLLRE 617 (648)
T ss_pred CCCCCHHHHHHHHhcCchhcCCccccCCCeEEEcccccccccchHHHHHHHHHHhcCCceEEEe
Confidence 566211222355677999998873 239999999999999999999999999999997665543
No 19
>PF12695 Abhydrolase_5: Alpha/beta hydrolase family; PDB: 3D0K_B 2I3D_B 3DOH_B 3DOI_B 3PFB_A 3S2Z_B 3PFC_A 3QM1_A 3PF8_B 3PF9_A ....
Probab=98.06 E-value=1.6e-05 Score=44.70 Aligned_cols=43 Identities=21% Similarity=0.084 Sum_probs=36.2
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
.... ..|+|++||+.|+.++++++..+++++. .+.++++++++
T Consensus 100 ~~~~-~~pv~~i~g~~D~~~~~~~~~~~~~~~~---~~~~~~~i~g~ 142 (145)
T PF12695_consen 100 LAKI-RIPVLFIHGENDPLVPPEQVRRLYEALP---GPKELYIIPGA 142 (145)
T ss_dssp HTTT-TSEEEEEEETT-SSSHHHHHHHHHHHHC---SSEEEEEETTS
T ss_pred hhcc-CCcEEEEEECCCCcCCHHHHHHHHHHcC---CCcEEEEeCCC
Confidence 4455 5899999999999999999999998877 77888998874
No 20
>PLN02298 hydrolase, alpha/beta fold family protein
Probab=98.03 E-value=1.4e-05 Score=51.20 Aligned_cols=43 Identities=12% Similarity=0.105 Sum_probs=35.1
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.++ +.|+||+||+.|..||++++..+++++... ..++++|++
T Consensus 247 l~~i-~~PvLii~G~~D~ivp~~~~~~l~~~i~~~--~~~l~~~~~ 289 (330)
T PLN02298 247 LKDV-SIPFIVLHGSADVVTDPDVSRALYEEAKSE--DKTIKIYDG 289 (330)
T ss_pred hhhc-CCCEEEEecCCCCCCCHHHHHHHHHHhccC--CceEEEcCC
Confidence 4567 599999999999999999999999887543 356777776
No 21
>PRK10162 acetyl esterase; Provisional
Probab=98.01 E-value=1.9e-05 Score=51.03 Aligned_cols=39 Identities=10% Similarity=0.068 Sum_probs=36.5
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
+|++|++|+.|..+ .++..|+++|+++|+++++++|++.
T Consensus 249 Pp~~i~~g~~D~L~--de~~~~~~~L~~aGv~v~~~~~~g~ 287 (318)
T PRK10162 249 PPCFIAGAEFDPLL--DDSRLLYQTLAAHQQPCEFKLYPGT 287 (318)
T ss_pred CCeEEEecCCCcCc--ChHHHHHHHHHHcCCCEEEEEECCC
Confidence 89999999999987 5899999999999999999999974
No 22
>PHA02857 monoglyceride lipase; Provisional
Probab=97.96 E-value=2.1e-05 Score=48.91 Aligned_cols=43 Identities=9% Similarity=0.128 Sum_probs=35.4
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
+.++ +.|+|++||++|..||+..+..+.+.+.. ..++.+|+++
T Consensus 205 l~~i-~~Pvliv~G~~D~i~~~~~~~~l~~~~~~---~~~~~~~~~~ 247 (276)
T PHA02857 205 IPKI-KTPILILQGTNNEISDVSGAYYFMQHANC---NREIKIYEGA 247 (276)
T ss_pred cccC-CCCEEEEecCCCCcCChHHHHHHHHHccC---CceEEEeCCC
Confidence 4566 59999999999999999999999887633 4678888763
No 23
>PF08840 BAAT_C: BAAT / Acyl-CoA thioester hydrolase C terminal; InterPro: IPR014940 Acyl-CoA thioesterases are a group of enzymes that catalyse the hydrolysis of acyl-CoAs to the free fatty acid and coenzyme A (CoASH), providing the potential to regulate intracellular levels of acyl-CoAs, free fatty acids and CoASH. Bile acid-CoA:amino acid N-acetyltransferase (BAAT) is involved in bile acid metabolism and may also act as an acyl-CoA thioesterase that regulates intracellular levels of free fatty acids []. This entry represents a catalytic domain is found at the C terminus of acyl-CoA thioester hydrolases and bile acid-CoA:amino acid N-acetyltransferases. ; PDB: 3K2I_B 3HLK_B.
Probab=97.93 E-value=3.2e-05 Score=47.61 Aligned_cols=46 Identities=17% Similarity=0.195 Sum_probs=32.9
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHH-HHHHHHHHHHcCCc--eEEEEecCC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQ-SMMLAKSLQHADIM--FQSQECYRR 68 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~-s~~l~~~L~~~g~~--~~~~~~~~~ 68 (68)
++++ +.|+|+++|++|.+.|-.. +..+.+.|+++|.+ ++.+.|+++
T Consensus 111 vE~i-~~piLli~g~dD~~WpS~~~a~~i~~rL~~~~~~~~~~~l~Y~~a 159 (213)
T PF08840_consen 111 VEKI-KGPILLISGEDDQIWPSSEMAEQIEERLKAAGFPHNVEHLSYPGA 159 (213)
T ss_dssp GGG---SEEEEEEETT-SSS-HHHHHHHHHHHHHCTT-----EEEEETTB
T ss_pred HHHc-CCCEEEEEeCCCCccchHHHHHHHHHHHHHhCCCCcceEEEcCCC
Confidence 7778 6999999999999998854 55677889999986 788889874
No 24
>KOG1552|consensus
Probab=97.83 E-value=8.4e-06 Score=51.94 Aligned_cols=37 Identities=27% Similarity=0.433 Sum_probs=32.5
Q ss_pred hHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHc
Q psy472 19 NNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHA 56 (68)
Q Consensus 19 ~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~ 56 (68)
+..++.+ +.|+|++||+.|++|++.|..+|+++.+..
T Consensus 185 i~kI~~i-~~PVLiiHgtdDevv~~sHg~~Lye~~k~~ 221 (258)
T KOG1552|consen 185 IEKISKI-TCPVLIIHGTDDEVVDFSHGKALYERCKEK 221 (258)
T ss_pred cCcceec-cCCEEEEecccCceecccccHHHHHhcccc
Confidence 5556667 699999999999999999999999987775
No 25
>TIGR01738 bioH putative pimeloyl-BioC--CoA transferase BioH. This CoA-binding enzyme is required for the production of pimeloyl-coenzyme A, the substrate of the BioF protein early in the biosynthesis of biotin. Its exact function is unknown, but is proposed in ref 2. This enzyme belongs to the alpha/beta hydrolase fold family (pfam model pfam00561). Members of this family are restricted to the Proteobacteria.
Probab=97.81 E-value=9e-05 Score=43.95 Aligned_cols=43 Identities=16% Similarity=0.096 Sum_probs=34.5
Q ss_pred HHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 20 NKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 20 ~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..+.++ +.|+|+++|+.|..||++.+..+.+.+. ..+++++++
T Consensus 182 ~~l~~i-~~Pvlii~g~~D~~~~~~~~~~~~~~~~----~~~~~~~~~ 224 (245)
T TIGR01738 182 QPLQNI-SVPFLRLYGYLDGLVPAKVVPYLDKLAP----HSELYIFAK 224 (245)
T ss_pred HHHhcC-CCCEEEEeecCCcccCHHHHHHHHHhCC----CCeEEEeCC
Confidence 446678 5999999999999999998888876543 567778875
No 26
>PLN02385 hydrolase; alpha/beta fold family protein
Probab=97.80 E-value=5.7e-05 Score=48.91 Aligned_cols=43 Identities=14% Similarity=0.146 Sum_probs=34.9
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.++ +.|+|++||++|..||+..+..+++.+... ..++++|++
T Consensus 275 l~~i-~~P~Lii~G~~D~vv~~~~~~~l~~~~~~~--~~~l~~i~~ 317 (349)
T PLN02385 275 LEEV-SLPLLILHGEADKVTDPSVSKFLYEKASSS--DKKLKLYED 317 (349)
T ss_pred cccC-CCCEEEEEeCCCCccChHHHHHHHHHcCCC--CceEEEeCC
Confidence 4567 699999999999999999999999876433 356777776
No 27
>COG1073 Hydrolases of the alpha/beta superfamily [General function prediction only]
Probab=97.78 E-value=6e-05 Score=46.29 Aligned_cols=53 Identities=17% Similarity=0.159 Sum_probs=42.7
Q ss_pred HHhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 13 YKIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 13 y~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+...++...+..+...|+|++||..|..||..++..++++.... +.+..++++
T Consensus 218 ~~~~d~~~~~~~i~~~P~l~~~G~~D~~vp~~~~~~~~~~~~~~--~~~~~~~~~ 270 (299)
T COG1073 218 LLLLDPFDDAEKISPRPVLLVHGERDEVVPLRDAEDLYEAARER--PKKLLFVPG 270 (299)
T ss_pred hccCcchhhHhhcCCcceEEEecCCCcccchhhhHHHHhhhccC--CceEEEecC
Confidence 56667777788873369999999999999999999999988776 556666554
No 28
>TIGR01840 esterase_phb esterase, PHB depolymerase family. This model describes a subfamily among lipases of the ab-hydrolase family. This subfamily includes bacterial depolymerases for poly(3-hydroxybutyrate) (PHB) and related polyhydroxyalkanoates (PHA), as well as acetyl xylan esterases, feruloyl esterases, and others from fungi.
Probab=97.76 E-value=3.1e-05 Score=47.09 Aligned_cols=31 Identities=16% Similarity=0.122 Sum_probs=28.0
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHAD 57 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g 57 (68)
.+|++|+||++|.+||++++..++++|++..
T Consensus 168 ~p~~~i~hG~~D~vVp~~~~~~~~~~l~~~~ 198 (212)
T TIGR01840 168 TPIMSVVHGDADYTVLPGNADEIRDAMLKVY 198 (212)
T ss_pred CCeEEEEEcCCCceeCcchHHHHHHHHHHhc
Confidence 3667899999999999999999999999873
No 29
>PRK11071 esterase YqiA; Provisional
Probab=97.75 E-value=6.1e-05 Score=45.53 Aligned_cols=26 Identities=15% Similarity=0.245 Sum_probs=24.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHH
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKS 52 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~ 52 (68)
..|++++||++|+.||++++..++++
T Consensus 136 ~~~v~iihg~~De~V~~~~a~~~~~~ 161 (190)
T PRK11071 136 PDLIWLLQQTGDEVLDYRQAVAYYAA 161 (190)
T ss_pred hhhEEEEEeCCCCcCCHHHHHHHHHh
Confidence 47889999999999999999999985
No 30
>COG0657 Aes Esterase/lipase [Lipid metabolism]
Probab=97.74 E-value=7.8e-05 Score=47.63 Aligned_cols=50 Identities=14% Similarity=-0.014 Sum_probs=42.5
Q ss_pred hcChhHHhh-cCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 15 IAALNNKVD-RIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 15 ~~sp~~~~~-~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..||+.... .- -+|+|+++|+.|..++ ++..++++|+++|++++++.|++
T Consensus 233 ~~spl~~~~~~~-lPP~~i~~a~~D~l~~--~~~~~a~~L~~agv~~~~~~~~g 283 (312)
T COG0657 233 EASPLASDDLSG-LPPTLIQTAEFDPLRD--EGEAYAERLRAAGVPVELRVYPG 283 (312)
T ss_pred ccCccccccccC-CCCEEEEecCCCcchh--HHHHHHHHHHHcCCeEEEEEeCC
Confidence 466766442 11 3899999999999999 99999999999999999999987
No 31
>KOG2112|consensus
Probab=97.74 E-value=4.4e-05 Score=47.31 Aligned_cols=41 Identities=24% Similarity=0.191 Sum_probs=39.1
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..|+++.||+.|+.||..-.....+.|+..+..++++.|++
T Consensus 144 ~~~i~~~Hg~~d~~vp~~~g~~s~~~l~~~~~~~~f~~y~g 184 (206)
T KOG2112|consen 144 YTPILLCHGTADPLVPFRFGEKSAQFLKSLGVRVTFKPYPG 184 (206)
T ss_pred cchhheecccCCceeehHHHHHHHHHHHHcCCceeeeecCC
Confidence 47999999999999999999999999999999999999986
No 32
>PLN02652 hydrolase; alpha/beta fold family protein
Probab=97.71 E-value=0.00012 Score=48.95 Aligned_cols=44 Identities=14% Similarity=0.070 Sum_probs=35.1
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
+.++ +.|+|++||++|..||++++..+++++... ..++++|++.
T Consensus 320 L~~I-~vPvLIi~G~~D~vvp~~~a~~l~~~~~~~--~k~l~~~~ga 363 (395)
T PLN02652 320 FKSV-TVPFMVLHGTADRVTDPLASQDLYNEAASR--HKDIKLYDGF 363 (395)
T ss_pred cccC-CCCEEEEEeCCCCCCCHHHHHHHHHhcCCC--CceEEEECCC
Confidence 3556 599999999999999999999999876543 3466677763
No 33
>TIGR03611 RutD pyrimidine utilization protein D. This protein is observed in operons extremely similar to that characterized in E. coli K-12 responsible for the import and catabolism of pyrimidines, primarily uracil. This protein is a member of the hydrolase, alpha/beta fold family defined by pfam00067.
Probab=97.62 E-value=0.00022 Score=42.80 Aligned_cols=44 Identities=30% Similarity=0.286 Sum_probs=33.8
Q ss_pred hHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 19 NNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 19 ~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
...+.++ +.|+|++||+.|..+|++++..+++.+. ..+++++++
T Consensus 191 ~~~~~~i-~~P~l~i~g~~D~~~~~~~~~~~~~~~~----~~~~~~~~~ 234 (257)
T TIGR03611 191 SARLDRI-QHPVLLIANRDDMLVPYTQSLRLAAALP----NAQLKLLPY 234 (257)
T ss_pred HHHhccc-CccEEEEecCcCcccCHHHHHHHHHhcC----CceEEEECC
Confidence 3446667 5999999999999999999998887653 235566654
No 34
>PRK06765 homoserine O-acetyltransferase; Provisional
Probab=97.60 E-value=0.00036 Score=46.57 Aligned_cols=49 Identities=14% Similarity=0.091 Sum_probs=41.8
Q ss_pred hhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 18 LNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 18 p~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
....+.++ +.|+|+++|+.|..+|++++..+++.+...+...+++++++
T Consensus 315 l~~~L~~I-~~PtLvI~G~~D~l~p~~~~~~la~~lp~~~~~a~l~~I~s 363 (389)
T PRK06765 315 LEEALSNI-EANVLMIPCKQDLLQPPRYNYKMVDILQKQGKYAEVYEIES 363 (389)
T ss_pred HHHHHhcC-CCCEEEEEeCCCCCCCHHHHHHHHHHhhhcCCCeEEEEECC
Confidence 44456677 59999999999999999999999999988777888888764
No 35
>TIGR02427 protocat_pcaD 3-oxoadipate enol-lactonase. Members of this family are 3-oxoadipate enol-lactonase. Note that the substrate is known as 3-oxoadipate enol-lactone, 2-oxo-2,3-dihydrofuran-5-acetate, 4,5-Dihydro-5-oxofuran-2-acetate, and 5-oxo-4,5-dihydrofuran-2-acetate. The enzyme the catalyzes the fourth step in the protocatechuate degradation to beta-ketoadipate and then to succinyl-CoA and acetyl-CoA. 4-hydroxybenzoate, 3-hydroxybenzoate, and vanillate all can be converted in one step to protocatechuate. This enzyme also acts in catechol degradation. In genomes that catabolize both catechol and protocatechuate, two forms of this enzyme may be found. All members of the seed alignment for this model were chosen from within protocatechuate degradation operons of at least three genes of the pathway, from genomes with the complete pathway through beta-ketoadipate.
Probab=97.60 E-value=0.00027 Score=41.90 Aligned_cols=44 Identities=14% Similarity=0.083 Sum_probs=33.7
Q ss_pred hHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 19 NNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 19 ~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
...+.++ +.|++++||..|..+|.+.+..+.+.+. ..+++++++
T Consensus 186 ~~~~~~~-~~Pvlii~g~~D~~~~~~~~~~~~~~~~----~~~~~~~~~ 229 (251)
T TIGR02427 186 RDRLGAI-AVPTLCIAGDQDGSTPPELVREIADLVP----GARFAEIRG 229 (251)
T ss_pred HHHhhhc-CCCeEEEEeccCCcCChHHHHHHHHhCC----CceEEEECC
Confidence 3445567 5999999999999999998888777653 346777765
No 36
>TIGR03343 biphenyl_bphD 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase. Members of this family are 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase, or HOPD hydrolase, the BphD protein of biphenyl degradation. BphD acts on the product of ring meta-cleavage by BphC. Many species carrying bphC and bphD are capable of degrading polychlorinated biphenyls as well as biphenyl itself.
Probab=97.57 E-value=0.00016 Score=44.75 Aligned_cols=42 Identities=19% Similarity=0.208 Sum_probs=34.6
Q ss_pred HhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 21 KVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 21 ~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.+.++ +.|+|+++|+.|..|+.+.+.++++.+. .++++++++
T Consensus 218 ~l~~i-~~Pvlli~G~~D~~v~~~~~~~~~~~~~----~~~~~~i~~ 259 (282)
T TIGR03343 218 RLGEI-KAKTLVTWGRDDRFVPLDHGLKLLWNMP----DAQLHVFSR 259 (282)
T ss_pred HHhhC-CCCEEEEEccCCCcCCchhHHHHHHhCC----CCEEEEeCC
Confidence 35567 6999999999999999999998887764 577777775
No 37
>COG4099 Predicted peptidase [General function prediction only]
Probab=97.53 E-value=3.5e-05 Score=50.63 Aligned_cols=47 Identities=15% Similarity=0.169 Sum_probs=41.2
Q ss_pred HHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 20 NKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 20 ~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
..++.+++.|++++|+..|..+|+++|.-++..|+..+.++.+..|-
T Consensus 308 ~lv~~lk~~piWvfhs~dDkv~Pv~nSrv~y~~lk~~~~kv~Ytaf~ 354 (387)
T COG4099 308 YLVRTLKKAPIWVFHSSDDKVIPVSNSRVLYERLKALDRKVNYTAFL 354 (387)
T ss_pred hhhhhhccCceEEEEecCCCccccCcceeehHHHHhhccccchhhhh
Confidence 44666667999999999999999999999999999999988877664
No 38
>TIGR01249 pro_imino_pep_1 proline iminopeptidase, Neisseria-type subfamily. This model represents one of two related families of proline iminopeptidase in the alpha/beta fold hydrolase family. The fine specificities of the various members, including both the range of short peptides from which proline can be removed and whether other amino acids such as alanine can be also removed, may vary among members.
Probab=97.43 E-value=0.00041 Score=44.16 Aligned_cols=43 Identities=21% Similarity=0.288 Sum_probs=33.3
Q ss_pred HhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 21 KVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 21 ~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.+.++...|+|++||..|..||++.+..+++.+. ..+++++++
T Consensus 242 ~~~~i~~~P~lii~g~~D~~~p~~~~~~~~~~~~----~~~~~~~~~ 284 (306)
T TIGR01249 242 NISKIRNIPTYIVHGRYDLCCPLQSAWALHKAFP----EAELKVTNN 284 (306)
T ss_pred hhhhccCCCeEEEecCCCCCCCHHHHHHHHHhCC----CCEEEEECC
Confidence 3445524799999999999999999998887753 456777764
No 39
>TIGR01607 PST-A Plasmodium subtelomeric family (PST-A). These genes are preferentially located in the subtelomeric regions of the chromosomes of both P. falciparum and P. yoelii.
Probab=97.38 E-value=0.00044 Score=44.91 Aligned_cols=39 Identities=13% Similarity=0.129 Sum_probs=31.6
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.|+|++||+.|..|+++.+..+++++... ..++.+|++
T Consensus 270 ~~P~Lii~G~~D~vv~~~~~~~~~~~~~~~--~~~l~~~~g 308 (332)
T TIGR01607 270 DIPILFIHSKGDCVCSYEGTVSFYNKLSIS--NKELHTLED 308 (332)
T ss_pred CCCEEEEEeCCCCccCHHHHHHHHHhccCC--CcEEEEECC
Confidence 489999999999999999999888765432 456777776
No 40
>PRK05077 frsA fermentation/respiration switch protein; Reviewed
Probab=97.37 E-value=0.00057 Score=45.84 Aligned_cols=40 Identities=18% Similarity=-0.132 Sum_probs=31.2
Q ss_pred hcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 23 DRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 23 ~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.++ +.|+|++||+.|+.||++++..+++.. ...++.++|+
T Consensus 352 ~~i-~~PvLiI~G~~D~ivP~~~a~~l~~~~----~~~~l~~i~~ 391 (414)
T PRK05077 352 RRC-PTPMLSGYWKNDPFSPEEDSRLIASSS----ADGKLLEIPF 391 (414)
T ss_pred cCC-CCcEEEEecCCCCCCCHHHHHHHHHhC----CCCeEEEccC
Confidence 356 589999999999999999999877554 2455666665
No 41
>TIGR02240 PHA_depoly_arom poly(3-hydroxyalkanoate) depolymerase. This family consists of the polyhydroxyalkanoic acid (PHA) depolymerase of Pseudomonas oleovorans, Pseudomonas putida BM01, and related species. This enzyme is part of polyester storage and mobilization system as in many bacteria. However, species containing this enzyme are unusual in their capacity to produce aromatic polyesters when grown on carbon sources such as benzoic acid or phenylacetic acid.
Probab=97.34 E-value=0.00083 Score=41.88 Aligned_cols=43 Identities=14% Similarity=0.124 Sum_probs=33.4
Q ss_pred HHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 20 NKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 20 ~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..+.++ +.|+|+++|+.|..||++++..+.+.+. ..++.++++
T Consensus 201 ~~l~~i-~~P~lii~G~~D~~v~~~~~~~l~~~~~----~~~~~~i~~ 243 (276)
T TIGR02240 201 HWLHKI-QQPTLVLAGDDDPIIPLINMRLLAWRIP----NAELHIIDD 243 (276)
T ss_pred hHhhcC-CCCEEEEEeCCCCcCCHHHHHHHHHhCC----CCEEEEEcC
Confidence 345677 6999999999999999999999987653 345555553
No 42
>PRK10349 carboxylesterase BioH; Provisional
Probab=97.33 E-value=0.00099 Score=40.94 Aligned_cols=46 Identities=20% Similarity=0.158 Sum_probs=34.0
Q ss_pred ChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 17 ALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 17 sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+....+.++ +.|+|+++|..|..+|.+.+..+.+.+ ...++.++++
T Consensus 187 ~~~~~l~~i-~~P~lii~G~~D~~~~~~~~~~~~~~i----~~~~~~~i~~ 232 (256)
T PRK10349 187 DLRQPLQNV-SMPFLRLYGYLDGLVPRKVVPMLDKLW----PHSESYIFAK 232 (256)
T ss_pred ccHHHHhhc-CCCeEEEecCCCccCCHHHHHHHHHhC----CCCeEEEeCC
Confidence 333446667 699999999999999998877666554 3457777765
No 43
>COG0412 Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=97.31 E-value=0.00088 Score=41.99 Aligned_cols=45 Identities=18% Similarity=0.107 Sum_probs=41.6
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..+. +.|+|+.+|+.|..+|...-..+.++|.++++.+++.+|++
T Consensus 154 ~~~~-~~pvl~~~~~~D~~~p~~~~~~~~~~~~~~~~~~~~~~y~g 198 (236)
T COG0412 154 APKI-KVPVLLHLAGEDPYIPAADVDALAAALEDAGVKVDLEIYPG 198 (236)
T ss_pred cccc-cCcEEEEecccCCCCChhHHHHHHHHHHhcCCCeeEEEeCC
Confidence 4467 59999999999999999999999999999999999999986
No 44
>PRK00175 metX homoserine O-acetyltransferase; Provisional
Probab=97.27 E-value=0.0014 Score=43.14 Aligned_cols=45 Identities=22% Similarity=0.118 Sum_probs=38.9
Q ss_pred HhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 21 KVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 21 ~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.+.++ +.|+|+++|..|..+|++.+..+++.+...+..+++.+++
T Consensus 304 ~l~~I-~~PtLvI~G~~D~~~p~~~~~~la~~i~~a~~~~~l~~i~ 348 (379)
T PRK00175 304 ALARI-KARFLVVSFTSDWLFPPARSREIVDALLAAGADVSYAEID 348 (379)
T ss_pred HHhcC-CCCEEEEEECCccccCHHHHHHHHHHHHhcCCCeEEEEeC
Confidence 34567 5899999999999999999999999999988887877664
No 45
>PLN02511 hydrolase
Probab=97.27 E-value=0.00046 Score=45.78 Aligned_cols=53 Identities=8% Similarity=-0.058 Sum_probs=38.9
Q ss_pred HHHHhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 11 EGYKIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 11 ~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+-|+..|+...+.++ +.|+|+++|.+|+.+|...... .+.+..-..++.++++
T Consensus 283 ~yy~~~s~~~~L~~I-~vPtLiI~g~dDpi~p~~~~~~---~~~~~~p~~~l~~~~~ 335 (388)
T PLN02511 283 AYYSNSSSSDSIKHV-RVPLLCIQAANDPIAPARGIPR---EDIKANPNCLLIVTPS 335 (388)
T ss_pred HHHHHcCchhhhccC-CCCeEEEEcCCCCcCCcccCcH---hHHhcCCCEEEEECCC
Confidence 346788888889999 6999999999999999865421 1223344567777765
No 46
>PF12697 Abhydrolase_6: Alpha/beta hydrolase family; PDB: 3LLC_A 3A2N_E 3A2M_A 3A2L_A 3AFI_F 3C5V_A 3C5W_P 3E0X_A 2ZJF_A 3QYJ_A ....
Probab=97.23 E-value=0.0013 Score=38.23 Aligned_cols=48 Identities=25% Similarity=0.184 Sum_probs=36.8
Q ss_pred cChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 16 AALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 16 ~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
.++.....++ +.|+++++|..|..++.+.+..+.+.+ ..+++.+++++
T Consensus 166 ~~~~~~~~~~-~~pvl~i~g~~D~~~~~~~~~~~~~~~----~~~~~~~~~~~ 213 (228)
T PF12697_consen 166 ADLSEALPRI-KVPVLVIHGEDDPIVPPESAEELADKL----PNAELVVIPGA 213 (228)
T ss_dssp HHHHHHHHGS-SSEEEEEEETTSSSSHHHHHHHHHHHS----TTEEEEEETTS
T ss_pred cccccccccc-CCCeEEeecCCCCCCCHHHHHHHHHHC----CCCEEEEECCC
Confidence 3444567778 699999999999999977777776554 35788888853
No 47
>TIGR01836 PHA_synth_III_C poly(R)-hydroxyalkanoic acid synthase, class III, PhaC subunit. This model represents the PhaC subunit of a heterodimeric form of polyhydroxyalkanoic acid (PHA) synthase. Excepting the PhaC of Bacillus megaterium (which needs PhaR), all members require PhaE (TIGR01834) for activity and are designated class III. This enzyme builds ester polymers for carbon and energy storage that accumulate in inclusions, and both this enzyme and the depolymerase associate with the inclusions. Class III enzymes polymerize short-chain-length hydroxyalkanoates.
Probab=97.19 E-value=0.00092 Score=43.44 Aligned_cols=43 Identities=14% Similarity=0.033 Sum_probs=34.4
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.++ +.|+|+++|+.|..+|++.+..+.+.+.. ...++.+|++
T Consensus 282 l~~i-~~Pvliv~G~~D~i~~~~~~~~~~~~~~~--~~~~~~~~~~ 324 (350)
T TIGR01836 282 LKNI-KMPILNIYAERDHLVPPDASKALNDLVSS--EDYTELSFPG 324 (350)
T ss_pred HHhC-CCCeEEEecCCCCcCCHHHHHHHHHHcCC--CCeEEEEcCC
Confidence 4456 58999999999999999999999887653 4566777764
No 48
>TIGR03056 bchO_mg_che_rel putative magnesium chelatase accessory protein. Members of this family belong to the alpha/beta fold family hydrolases (PFAM model pfam00561). Members are found in bacterial genomes if and only if they encoded for anoxygenic photosynthetic systems similar to that of Rhodobacter capsulatus and other alpha-Proteobacteria. Members often are encoded in the same operon as subunits of the protoporphyrin IX magnesium chelatase, and were once designated BchO. No literature supports a role as an actual subunit of magnesium chelatase, but an accessory role is possible, as suggested by placement by its probable hydrolase activity.
Probab=97.18 E-value=0.00098 Score=40.80 Aligned_cols=41 Identities=20% Similarity=0.119 Sum_probs=32.1
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.++ +.|+|+++|+.|..||.+.+..+.+.+. ..++.++++
T Consensus 216 ~~~i-~~P~lii~g~~D~~vp~~~~~~~~~~~~----~~~~~~~~~ 256 (278)
T TIGR03056 216 LPRI-TIPLHLIAGEEDKAVPPDESKRAATRVP----TATLHVVPG 256 (278)
T ss_pred cccC-CCCEEEEEeCCCcccCHHHHHHHHHhcc----CCeEEEECC
Confidence 4456 5899999999999999998888876643 346677765
No 49
>PF02129 Peptidase_S15: X-Pro dipeptidyl-peptidase (S15 family); InterPro: IPR000383 This entry represents a domain found peptidases Xaa-Pro dipeptidyl-peptidase and glutaryl-7-aminocephalosporanic-acid acylase, which belong to MEROPS peptidase families S15 and S45 respectively []. It is also found in hydrolases from the CocE/NonD family. Cocaine esterase (CocE) hydrolyzes cocaine endowing the bacteria with the ability to utilise cocaine as a sole source of carbon and energy []. ; GO: 0004177 aminopeptidase activity, 0006508 proteolysis; PDB: 1LNS_A 3PUI_A 3PUH_B 1JU3_A 3I2I_A 3I2G_A 1JU4_A 3I2K_A 3IDA_A 3I2H_A ....
Probab=97.16 E-value=0.0024 Score=40.20 Aligned_cols=57 Identities=14% Similarity=0.053 Sum_probs=47.5
Q ss_pred CHHHHHhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcC-CceEEEEecC
Q psy472 9 NLEGYKIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHAD-IMFQSQECYR 67 (68)
Q Consensus 9 ~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g-~~~~~~~~~~ 67 (68)
..+.+++.++...+.++ +.|+|+++|-.|.... ..+.+.+++|++.+ .+..+++-|.
T Consensus 211 ~~~~w~~~~~~~~~~~i-~vP~l~v~Gw~D~~~~-~~~~~~~~~l~~~~~~~~~Liigpw 268 (272)
T PF02129_consen 211 YDPFWQERSPSERLDKI-DVPVLIVGGWYDTLFL-RGALRAYEALRAPGSKPQRLIIGPW 268 (272)
T ss_dssp SSHHHHTTBHHHHHGG---SEEEEEEETTCSSTS-HHHHHHHHHHCTTSTC-EEEEEESE
T ss_pred cCHHHHhCChHHHHhhC-CCCEEEecccCCcccc-hHHHHHHHHhhcCCCCCCEEEEeCC
Confidence 35678888888888999 6999999999998888 99999999999999 7888888664
No 50
>PLN02679 hydrolase, alpha/beta fold family protein
Probab=97.15 E-value=0.0017 Score=42.46 Aligned_cols=47 Identities=15% Similarity=0.007 Sum_probs=34.9
Q ss_pred HHhhcCCCCcEEEEEeCCCCccCHHHH-HHHHHHHHHcCCceEEEEecC
Q psy472 20 NKVDRIRDKQYLLVHGTMDDNVHFQQS-MMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 20 ~~~~~~~~~p~ll~hG~~D~~V~~~~s-~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..+.++ +.|+|++||..|..+|+..+ .++.+.|.+.-...+++++++
T Consensus 286 ~~l~~i-~~PtLii~G~~D~~~p~~~~~~~~~~~l~~~ip~~~l~~i~~ 333 (360)
T PLN02679 286 KLIPRI-SLPILVLWGDQDPFTPLDGPVGKYFSSLPSQLPNVTLYVLEG 333 (360)
T ss_pred HHhhhc-CCCEEEEEeCCCCCcCchhhHHHHHHhhhccCCceEEEEcCC
Confidence 345567 59999999999999999754 335555655555678888886
No 51
>TIGR01392 homoserO_Ac_trn homoserine O-acetyltransferase. This family describes homoserine-O-acetyltransferase, an enzyme of methionine biosynthesis. This model has been rebuilt to identify sequences more broadly, including a number of sequences suggested to be homoserine O-acetyltransferase based on proximity to other Met biosynthesis genes.
Probab=97.14 E-value=0.0027 Score=41.22 Aligned_cols=45 Identities=29% Similarity=0.226 Sum_probs=36.5
Q ss_pred HHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEe
Q psy472 20 NKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 20 ~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
..+.++ +.|+|+++|..|..+|+..+..+++.+......++++..
T Consensus 282 ~~l~~I-~~P~Lvi~G~~D~~~p~~~~~~~a~~i~~~~~~v~~~~i 326 (351)
T TIGR01392 282 EALSRI-KAPFLVVSITSDWLFPPAESRELAKALPAAGLRVTYVEI 326 (351)
T ss_pred HHHhhC-CCCEEEEEeCCccccCHHHHHHHHHHHhhcCCceEEEEe
Confidence 345567 599999999999999999999999999877666554443
No 52
>PLN03087 BODYGUARD 1 domain containing hydrolase; Provisional
Probab=97.13 E-value=0.00096 Score=45.85 Aligned_cols=39 Identities=10% Similarity=0.119 Sum_probs=32.5
Q ss_pred cCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 24 RIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 24 ~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
++ +.|+|++||.+|..+|++.+..|++.+. ..+++++++
T Consensus 416 ~I-~vPtLII~Ge~D~ivP~~~~~~la~~iP----~a~l~vI~~ 454 (481)
T PLN03087 416 QL-KCDVAIFHGGDDELIPVECSYAVKAKVP----RARVKVIDD 454 (481)
T ss_pred hC-CCCEEEEEECCCCCCCHHHHHHHHHhCC----CCEEEEeCC
Confidence 56 5999999999999999999999877763 357777776
No 53
>COG2267 PldB Lysophospholipase [Lipid metabolism]
Probab=97.09 E-value=0.00087 Score=43.26 Aligned_cols=43 Identities=16% Similarity=0.040 Sum_probs=32.0
Q ss_pred hhcCCCCcEEEEEeCCCCccC-HHHHHHHHHHHHHcCCc-eEEEEecCC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVH-FQQSMMLAKSLQHADIM-FQSQECYRR 68 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~-~~~s~~l~~~L~~~g~~-~~~~~~~~~ 68 (68)
.... ..|+||+||++|.+|+ .+.+.++.+ +.+.+ .++.+|+|+
T Consensus 224 ~~~~-~~PvLll~g~~D~vv~~~~~~~~~~~---~~~~~~~~~~~~~g~ 268 (298)
T COG2267 224 APAI-ALPVLLLQGGDDRVVDNVEGLARFFE---RAGSPDKELKVIPGA 268 (298)
T ss_pred cccc-cCCEEEEecCCCccccCcHHHHHHHH---hcCCCCceEEecCCc
Confidence 3345 5999999999999999 566666664 45554 588888874
No 54
>PF00561 Abhydrolase_1: alpha/beta hydrolase fold A web page of Esterases and alpha/beta hydrolases.; InterPro: IPR000073 The alpha/beta hydrolase fold [] is common to a number of hydrolytic enzymes of widely differing phylogenetic origin and catalytic function. The core of each enzyme is an alpha/beta-sheet (rather than a barrel), containing 8 strands connected by helices []. The enzymes are believed to have diverged from a common ancestor, preserving the arrangement of the catalytic residues. All have a catalytic triad, the elements of which are borne on loops, which are the best conserved structural features of the fold. Esterase (EST) from Pseudomonas putida is a member of the alpha/beta hydrolase fold superfamily of enzymes []. In most of the family members the beta-strands are parallels, but some have an inversion of the first strands, which gives it an antiparallel orientation. The catalytic triad residues are presented on loops. One of these is the nucleophile elbow and is the most conserved feature of the fold. Some other members lack one or all of the catalytic residues. Some members are therefore inactive but others are involved in surface recognition. The ESTHER database [] gathers and annotates all the published information related to gene and protein sequences of this superfamily []. This entry represents fold-1 of alpha/beta hydrolase.; PDB: 2VAT_E 2VAX_C 2VAV_H 2PSJ_A 2PSH_B 2PSE_A 2PSF_A 2PSD_A 2EDA_A 1CIJ_A ....
Probab=97.06 E-value=0.0022 Score=37.94 Aligned_cols=50 Identities=20% Similarity=0.124 Sum_probs=37.0
Q ss_pred HHhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 13 YKIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 13 y~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+...++.....++ +.|+|+++|+.|..+|+..+..+.+.+.. .+++++++
T Consensus 162 ~~~~~~~~~l~~i-~~p~l~i~~~~D~~~p~~~~~~~~~~~~~----~~~~~~~~ 211 (230)
T PF00561_consen 162 FSVWDPSPALSNI-KVPTLIIWGEDDPLVPPESSEQLAKLIPN----SQLVLIEG 211 (230)
T ss_dssp HHHHHHHHHHTTT-TSEEEEEEETTCSSSHHHHHHHHHHHSTT----EEEEEETT
T ss_pred ccccccccccccc-CCCeEEEEeCCCCCCCHHHHHHHHHhcCC----CEEEECCC
Confidence 3344445556677 69999999999999999999986654333 66777765
No 55
>KOG4667|consensus
Probab=97.04 E-value=0.00074 Score=42.83 Aligned_cols=37 Identities=22% Similarity=0.246 Sum_probs=30.7
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..|+|-+||..|.+||++++..|++.+.. .++++.+|
T Consensus 199 ~C~VLTvhGs~D~IVPve~AkefAk~i~n----H~L~iIEg 235 (269)
T KOG4667|consen 199 QCRVLTVHGSEDEIVPVEDAKEFAKIIPN----HKLEIIEG 235 (269)
T ss_pred cCceEEEeccCCceeechhHHHHHHhccC----CceEEecC
Confidence 59999999999999999999999987655 44555544
No 56
>PRK06489 hypothetical protein; Provisional
Probab=97.00 E-value=0.0018 Score=42.24 Aligned_cols=43 Identities=9% Similarity=0.021 Sum_probs=33.0
Q ss_pred HHhhcCCCCcEEEEEeCCCCccCHHHH--HHHHHHHHHcCCceEEEEecC
Q psy472 20 NKVDRIRDKQYLLVHGTMDDNVHFQQS--MMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 20 ~~~~~~~~~p~ll~hG~~D~~V~~~~s--~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..+.++ +.|+|+++|+.|..+|++.+ ..+++.+. ..+++++++
T Consensus 286 ~~L~~I-~~PvLvI~G~~D~~~p~~~~~~~~la~~ip----~a~l~~i~~ 330 (360)
T PRK06489 286 PDLEKI-KAPVLAINSADDERNPPETGVMEAALKRVK----HGRLVLIPA 330 (360)
T ss_pred HHHHhC-CCCEEEEecCCCcccChhhHHHHHHHHhCc----CCeEEEECC
Confidence 345667 59999999999999999876 56666543 357888876
No 57
>PRK07581 hypothetical protein; Validated
Probab=96.98 E-value=0.0024 Score=41.09 Aligned_cols=42 Identities=7% Similarity=-0.038 Sum_probs=33.2
Q ss_pred HhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 21 KVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 21 ~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.+.++ +.|+|+++|..|..+|+..+..+++.+.. .++.++++
T Consensus 270 ~L~~I-~~PtLvI~G~~D~~~p~~~~~~l~~~ip~----a~l~~i~~ 311 (339)
T PRK07581 270 ALGSI-TAKTFVMPISTDLYFPPEDCEAEAALIPN----AELRPIES 311 (339)
T ss_pred HHhcC-CCCEEEEEeCCCCCCCHHHHHHHHHhCCC----CeEEEeCC
Confidence 45567 59999999999999999999888776532 56777764
No 58
>KOG1454|consensus
Probab=96.97 E-value=0.0024 Score=41.83 Aligned_cols=48 Identities=21% Similarity=0.165 Sum_probs=35.7
Q ss_pred cChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 16 AALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 16 ~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.++...+.+..+.|+|+++|..|..||.+.+..|.+.+ -.+++.+.++
T Consensus 253 ~~~~~~~~~i~~~pvlii~G~~D~~~p~~~~~~~~~~~----pn~~~~~I~~ 300 (326)
T KOG1454|consen 253 ENLLSLIKKIWKCPVLIIWGDKDQIVPLELAEELKKKL----PNAELVEIPG 300 (326)
T ss_pred chHHHhhccccCCceEEEEcCcCCccCHHHHHHHHhhC----CCceEEEeCC
Confidence 34555566664599999999999999999666666554 6667777664
No 59
>PRK10985 putative hydrolase; Provisional
Probab=96.95 E-value=0.0034 Score=40.44 Aligned_cols=54 Identities=13% Similarity=0.077 Sum_probs=39.7
Q ss_pred CHHHHHhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 9 NLEGYKIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 9 ~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..+-|...++...+.++ ..|+|+++|..|..++.+....+.+. ....++.+.++
T Consensus 238 ~~~~y~~~~~~~~l~~i-~~P~lii~g~~D~~~~~~~~~~~~~~----~~~~~~~~~~~ 291 (324)
T PRK10985 238 AIDYYRQCSALPLLNQI-RKPTLIIHAKDDPFMTHEVIPKPESL----PPNVEYQLTEH 291 (324)
T ss_pred HHHHHHHCChHHHHhCC-CCCEEEEecCCCCCCChhhChHHHHh----CCCeEEEECCC
Confidence 35667788888888888 59999999999999998877665432 23455565554
No 60
>PF10503 Esterase_phd: Esterase PHB depolymerase
Probab=96.89 E-value=0.0013 Score=41.07 Aligned_cols=31 Identities=23% Similarity=0.269 Sum_probs=26.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHAD 57 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g 57 (68)
.-|+++|||+.|..|.+.++.++.+...+..
T Consensus 169 ~~P~~v~hG~~D~tV~~~n~~~~~~q~~~~~ 199 (220)
T PF10503_consen 169 GYPRIVFHGTADTTVNPQNADQLVAQWLNVH 199 (220)
T ss_pred CCCEEEEecCCCCccCcchHHHHHHHHHHcc
Confidence 3699999999999999999999998876543
No 61
>PRK03592 haloalkane dehalogenase; Provisional
Probab=96.88 E-value=0.0026 Score=39.96 Aligned_cols=42 Identities=10% Similarity=-0.016 Sum_probs=28.6
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..++ +.|+|++||..|..+++....++...+.. ..++.++++
T Consensus 224 l~~i-~~P~lii~G~~D~~~~~~~~~~~~~~~~~---~~~~~~i~~ 265 (295)
T PRK03592 224 LATS-DVPKLLINAEPGAILTTGAIRDWCRSWPN---QLEITVFGA 265 (295)
T ss_pred hccC-CCCeEEEeccCCcccCcHHHHHHHHHhhh---hcceeeccC
Confidence 3456 59999999999999966555565544333 345666654
No 62
>COG0429 Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only]
Probab=96.84 E-value=0.0015 Score=43.33 Aligned_cols=41 Identities=20% Similarity=0.147 Sum_probs=35.2
Q ss_pred CCHHHHHhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHH
Q psy472 8 DNLEGYKIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMML 49 (68)
Q Consensus 8 ~~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l 49 (68)
+..+=|+++|.+..+.++ .+|+||||..+|+.++.+..-..
T Consensus 256 da~dYYr~aSs~~~L~~I-r~PtLii~A~DDP~~~~~~iP~~ 296 (345)
T COG0429 256 DAEDYYRQASSLPLLPKI-RKPTLIINAKDDPFMPPEVIPKL 296 (345)
T ss_pred cHHHHHHhcccccccccc-ccceEEEecCCCCCCChhhCCcc
Confidence 556789999999999999 59999999999999998654443
No 63
>KOG1455|consensus
Probab=96.82 E-value=0.0014 Score=42.89 Aligned_cols=46 Identities=17% Similarity=0.214 Sum_probs=36.0
Q ss_pred hHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 19 NNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 19 ~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..+.... +.|+|++||++|.+..+.-|..|++....+.+. +.+|||
T Consensus 239 e~~l~~v-tvPflilHG~dD~VTDp~~Sk~Lye~A~S~DKT--lKlYpG 284 (313)
T KOG1455|consen 239 EKNLNEV-TVPFLILHGTDDKVTDPKVSKELYEKASSSDKT--LKLYPG 284 (313)
T ss_pred HHhcccc-cccEEEEecCCCcccCcHHHHHHHHhccCCCCc--eecccc
Confidence 3445556 699999999999999999999999976665554 555665
No 64
>PLN02824 hydrolase, alpha/beta fold family protein
Probab=96.71 E-value=0.0027 Score=39.88 Aligned_cols=41 Identities=12% Similarity=-0.018 Sum_probs=30.9
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.++ +.|+|+++|++|..+|.+.+..+.+. ....+++++++
T Consensus 230 l~~i-~~P~lvi~G~~D~~~~~~~~~~~~~~----~~~~~~~~i~~ 270 (294)
T PLN02824 230 LPAV-KCPVLIAWGEKDPWEPVELGRAYANF----DAVEDFIVLPG 270 (294)
T ss_pred Hhhc-CCCeEEEEecCCCCCChHHHHHHHhc----CCccceEEeCC
Confidence 5567 69999999999999999988775442 22356777765
No 65
>PF03959 FSH1: Serine hydrolase (FSH1); InterPro: IPR005645 This entry represents proteins belonging to the AB hydrolase family. It consists of serine hydrolases of unknown specificity [, ] and includes uncharacterised proteins.; PDB: 1YCD_A.
Probab=96.70 E-value=0.0065 Score=37.22 Aligned_cols=42 Identities=24% Similarity=0.117 Sum_probs=31.2
Q ss_pred hcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 23 DRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 23 ~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
..+ +.|+|-++|.+|..|+.+.+..|++..... .+++.+++|
T Consensus 158 ~~i-~iPtlHv~G~~D~~~~~~~s~~L~~~~~~~---~~v~~h~gG 199 (212)
T PF03959_consen 158 PKI-SIPTLHVIGENDPVVPPERSEALAEMFDPD---ARVIEHDGG 199 (212)
T ss_dssp TT----EEEEEEETT-SSS-HHHHHHHHHHHHHH---EEEEEESSS
T ss_pred ccC-CCCeEEEEeCCCCCcchHHHHHHHHhccCC---cEEEEECCC
Confidence 345 599999999999999999999999998876 666666654
No 66
>PLN02965 Probable pheophorbidase
Probab=96.69 E-value=0.0039 Score=38.51 Aligned_cols=39 Identities=5% Similarity=-0.055 Sum_probs=30.8
Q ss_pred cCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 24 RIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 24 ~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.+ +.|+|+++|++|..||+..+..|++.+.. ++++++++
T Consensus 191 ~i-~vP~lvi~g~~D~~~~~~~~~~~~~~~~~----a~~~~i~~ 229 (255)
T PLN02965 191 AE-KVPRVYIKTAKDNLFDPVRQDVMVENWPP----AQTYVLED 229 (255)
T ss_pred cC-CCCEEEEEcCCCCCCCHHHHHHHHHhCCc----ceEEEecC
Confidence 46 69999999999999999988888865443 35666654
No 67
>PF08386 Abhydrolase_4: TAP-like protein; InterPro: IPR013595 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents a C-terminal domain associated with putative hydrolases and bacterial peptidases that belong to MEROPS peptidase family S33 (clan SC). They are related to a tripeptidyl aminopeptidase from Streptomyces lividans (Q54410 from SWISSPROT). A member of this family (Q6E3K7 from SWISSPROT) is thought to be involved in the C-terminal processing of propionicin F, a bacteriocidin characterised from Propionibacterium freudenreichii []. ; GO: 0008233 peptidase activity
Probab=96.65 E-value=0.0047 Score=34.10 Aligned_cols=29 Identities=21% Similarity=0.282 Sum_probs=27.1
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHH
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQH 55 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~ 55 (68)
..|+|++.|+.|+..|++.+..+++.|..
T Consensus 34 ~~piL~l~~~~Dp~TP~~~a~~~~~~l~~ 62 (103)
T PF08386_consen 34 APPILVLGGTHDPVTPYEGARAMAARLPG 62 (103)
T ss_pred CCCEEEEecCcCCCCcHHHHHHHHHHCCC
Confidence 38999999999999999999999998775
No 68
>PRK14875 acetoin dehydrogenase E2 subunit dihydrolipoyllysine-residue acetyltransferase; Provisional
Probab=96.60 E-value=0.0041 Score=39.98 Aligned_cols=39 Identities=10% Similarity=0.083 Sum_probs=29.7
Q ss_pred HhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 21 KVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 21 ~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.+.++ +.|+|++||..|..||+.++..+.. .+++.++++
T Consensus 309 ~l~~i-~~Pvlii~g~~D~~vp~~~~~~l~~-------~~~~~~~~~ 347 (371)
T PRK14875 309 RLASL-AIPVLVIWGEQDRIIPAAHAQGLPD-------GVAVHVLPG 347 (371)
T ss_pred HHhcC-CCCEEEEEECCCCccCHHHHhhccC-------CCeEEEeCC
Confidence 45567 5999999999999999988765432 356677764
No 69
>KOG1838|consensus
Probab=96.59 E-value=0.0025 Score=43.20 Aligned_cols=37 Identities=30% Similarity=0.256 Sum_probs=33.7
Q ss_pred CCCHHHHHhcChhHHhhcCCCCcEEEEEeCCCCccCHH
Q psy472 7 EDNLEGYKIAALNNKVDRIRDKQYLLVHGTMDDNVHFQ 44 (68)
Q Consensus 7 ~~~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~ 44 (68)
....+-|+++|+.++++++ +.|+|+++..+|+.||..
T Consensus 303 ~~~deYY~~aSs~~~v~~I-~VP~L~ina~DDPv~p~~ 339 (409)
T KOG1838|consen 303 KSVDEYYKKASSSNYVDKI-KVPLLCINAADDPVVPEE 339 (409)
T ss_pred CcHHHHHhhcchhhhcccc-cccEEEEecCCCCCCCcc
Confidence 3567899999999999999 599999999999999986
No 70
>PRK10673 acyl-CoA esterase; Provisional
Probab=96.57 E-value=0.0066 Score=36.94 Aligned_cols=40 Identities=10% Similarity=-0.034 Sum_probs=30.3
Q ss_pred hcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 23 DRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 23 ~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..+ +.|+|+++|+.|..|+.+.+..+++.+. ..++.++++
T Consensus 192 ~~~-~~P~l~i~G~~D~~~~~~~~~~~~~~~~----~~~~~~~~~ 231 (255)
T PRK10673 192 PAW-PHPALFIRGGNSPYVTEAYRDDLLAQFP----QARAHVIAG 231 (255)
T ss_pred CCC-CCCeEEEECCCCCCCCHHHHHHHHHhCC----CcEEEEeCC
Confidence 345 5899999999999999988888776543 445666665
No 71
>PF05705 DUF829: Eukaryotic protein of unknown function (DUF829); InterPro: IPR008547 This signature identifies Transmembrane protein 53, that have no known function but are predicted to be integral membrane proteins.
Probab=96.56 E-value=0.013 Score=36.24 Aligned_cols=41 Identities=7% Similarity=0.017 Sum_probs=38.5
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..|-|.+.+++|+.|+++..+.+++..++.|.+++...|.+
T Consensus 178 ~~p~lylYS~~D~l~~~~~ve~~~~~~~~~G~~V~~~~f~~ 218 (240)
T PF05705_consen 178 RCPRLYLYSKADPLIPWRDVEEHAEEARRKGWDVRAEKFED 218 (240)
T ss_pred CCCeEEecCCCCcCcCHHHHHHHHHHHHHcCCeEEEecCCC
Confidence 48999999999999999999999999999999999988865
No 72
>PRK00870 haloalkane dehalogenase; Provisional
Probab=96.54 E-value=0.0037 Score=39.52 Aligned_cols=43 Identities=19% Similarity=0.136 Sum_probs=30.1
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.++ +.|++++||++|..+|... ..+.+.+.... ..++.++++
T Consensus 235 l~~i-~~P~lii~G~~D~~~~~~~-~~~~~~~~~~~-~~~~~~i~~ 277 (302)
T PRK00870 235 LERW-DKPFLTAFSDSDPITGGGD-AILQKRIPGAA-GQPHPTIKG 277 (302)
T ss_pred hhcC-CCceEEEecCCCCcccCch-HHHHhhccccc-ccceeeecC
Confidence 4567 6999999999999999866 77776665331 123445554
No 73
>PLN02578 hydrolase
Probab=96.54 E-value=0.0078 Score=39.23 Aligned_cols=33 Identities=15% Similarity=0.186 Sum_probs=27.7
Q ss_pred HhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy472 21 KVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQ 54 (68)
Q Consensus 21 ~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~ 54 (68)
.+.++ +.|+|++||+.|..|+...+..+.+.+.
T Consensus 291 ~l~~i-~~PvLiI~G~~D~~v~~~~~~~l~~~~p 323 (354)
T PLN02578 291 LLSKL-SCPLLLLWGDLDPWVGPAKAEKIKAFYP 323 (354)
T ss_pred HhhcC-CCCEEEEEeCCCCCCCHHHHHHHHHhCC
Confidence 34567 6999999999999999999888877653
No 74
>PLN02872 triacylglycerol lipase
Probab=96.46 E-value=0.0057 Score=41.06 Aligned_cols=57 Identities=12% Similarity=-0.029 Sum_probs=41.5
Q ss_pred CCHHHHHhcCh-hHHhhcCC-CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 8 DNLEGYKIAAL-NNKVDRIR-DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 8 ~~~~~y~~~sp-~~~~~~~~-~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+|...|-...| ...+++++ +.|+++++|+.|..+++..+..+.+.|... .++..+++
T Consensus 304 ~n~~~Yg~~~pP~Y~l~~i~~~~Pv~i~~G~~D~lv~~~dv~~l~~~Lp~~---~~l~~l~~ 362 (395)
T PLN02872 304 KNLKLYGQVNPPAFDLSLIPKSLPLWMGYGGTDGLADVTDVEHTLAELPSK---PELLYLEN 362 (395)
T ss_pred hhHHHhCCCCCCCcCcccCCCCccEEEEEcCCCCCCCHHHHHHHHHHCCCc---cEEEEcCC
Confidence 46667766664 44566662 379999999999999999999999887642 35555554
No 75
>TIGR01250 pro_imino_pep_2 proline-specific peptidases, Bacillus coagulans-type subfamily. This model describes a subfamily of the alpha/beta fold family of hydrolases. Characterized members include prolinases (Pro-Xaa dipeptidase, EC 3.4.13.8), prolyl aminopeptidases (EC 3.4.11.5), and a leucyl aminopeptidase
Probab=96.42 E-value=0.013 Score=35.50 Aligned_cols=42 Identities=14% Similarity=0.112 Sum_probs=29.8
Q ss_pred HHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 20 NKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 20 ~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..+.++ +.|+|+++|+.|. +++..+..+.+.+. ..++.++++
T Consensus 225 ~~l~~i-~~P~lii~G~~D~-~~~~~~~~~~~~~~----~~~~~~~~~ 266 (288)
T TIGR01250 225 DKLSEI-KVPTLLTVGEFDT-MTPEAAREMQELIA----GSRLVVFPD 266 (288)
T ss_pred HHhhcc-CCCEEEEecCCCc-cCHHHHHHHHHhcc----CCeEEEeCC
Confidence 345567 5999999999998 56777777766543 346667764
No 76
>PRK08775 homoserine O-acetyltransferase; Provisional
Probab=96.25 E-value=0.009 Score=38.69 Aligned_cols=41 Identities=17% Similarity=0.009 Sum_probs=32.5
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
+.++ +.|+|+++|..|..+|+..+..+++++.. ..++.+++
T Consensus 273 l~~I-~~PtLvi~G~~D~~~p~~~~~~~~~~i~p---~a~l~~i~ 313 (343)
T PRK08775 273 PEAI-RVPTVVVAVEGDRLVPLADLVELAEGLGP---RGSLRVLR 313 (343)
T ss_pred hhcC-CCCeEEEEeCCCEeeCHHHHHHHHHHcCC---CCeEEEEe
Confidence 3456 58999999999999999999888876632 35677775
No 77
>PF11144 DUF2920: Protein of unknown function (DUF2920); InterPro: IPR022605 This bacterial family of proteins has no known function.
Probab=95.99 E-value=0.027 Score=38.27 Aligned_cols=41 Identities=15% Similarity=0.038 Sum_probs=37.5
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
++-.+..|+..|+.+|.++-.+|++.|++.|.++++++..+
T Consensus 293 ~~~yvsYHs~~D~~~p~~~K~~l~~~l~~lgfda~l~lIkd 333 (403)
T PF11144_consen 293 KIIYVSYHSIKDDLAPAEDKEELYEILKNLGFDATLHLIKD 333 (403)
T ss_pred ceEEEEEeccCCCCCCHHHHHHHHHHHHHcCCCeEEEEecC
Confidence 46778899999999999999999999999999999998743
No 78
>PRK10439 enterobactin/ferric enterobactin esterase; Provisional
Probab=95.87 E-value=0.024 Score=38.25 Aligned_cols=40 Identities=13% Similarity=0.075 Sum_probs=35.1
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
.+++|-+|+.|..+ +....+|.+.|+++|+++++.+|+++
T Consensus 350 lr~~i~~G~~E~~~-~~~~~~l~~~L~~~G~~~~~~~~~GG 389 (411)
T PRK10439 350 LRIVLEAGRREPMI-MRANQALYAQLHPAGHSVFWRQVDGG 389 (411)
T ss_pred ceEEEeCCCCCchH-HHHHHHHHHHHHHCCCcEEEEECCCC
Confidence 47899999998654 57789999999999999999999986
No 79
>PF07519 Tannase: Tannase and feruloyl esterase; InterPro: IPR011118 This family includes fungal tannase [] and feruloyl esterase [, ]. It also includes several bacterial homologues of unknown function.
Probab=95.86 E-value=0.01 Score=40.73 Aligned_cols=41 Identities=12% Similarity=0.036 Sum_probs=33.5
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCC--------ceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADI--------MFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~--------~~~~~~~~~ 67 (68)
..++|++||..|..||+..+..+++++.+.-- -+++.+.||
T Consensus 353 GGKLI~~HG~aD~~I~p~~ti~YY~~V~~~~g~~~~~v~dF~RlF~vPG 401 (474)
T PF07519_consen 353 GGKLILYHGWADPLIPPQGTIDYYERVVARMGGALADVDDFYRLFMVPG 401 (474)
T ss_pred CCeEEEEecCCCCccCCCcHHHHHHHHHHhcccccccccceeEEEecCC
Confidence 58999999999999999999999999877653 245555554
No 80
>PRK03204 haloalkane dehalogenase; Provisional
Probab=95.74 E-value=0.024 Score=35.87 Aligned_cols=37 Identities=22% Similarity=0.052 Sum_probs=26.2
Q ss_pred CCcEEEEEeCCCCccCHH-HHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQ-QSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~-~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.|+|+++|..|..+++. ....+.+.+. ..+++++++
T Consensus 227 ~~PtliI~G~~D~~~~~~~~~~~~~~~ip----~~~~~~i~~ 264 (286)
T PRK03204 227 TKPTLLVWGMKDVAFRPKTILPRLRATFP----DHVLVELPN 264 (286)
T ss_pred CCCeEEEecCCCcccCcHHHHHHHHHhcC----CCeEEEcCC
Confidence 499999999999998765 4555555443 346666665
No 81
>PLN03084 alpha/beta hydrolase fold protein; Provisional
Probab=95.71 E-value=0.033 Score=37.24 Aligned_cols=38 Identities=13% Similarity=0.036 Sum_probs=30.3
Q ss_pred cCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 24 RIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 24 ~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
++ +.|+|+++|+.|..++.+.+..+++.. ..++.++++
T Consensus 323 ~i-~vPvLiI~G~~D~~v~~~~~~~~a~~~-----~a~l~vIp~ 360 (383)
T PLN03084 323 NW-KTPITVCWGLRDRWLNYDGVEDFCKSS-----QHKLIELPM 360 (383)
T ss_pred cC-CCCEEEEeeCCCCCcCHHHHHHHHHhc-----CCeEEEECC
Confidence 35 589999999999999998887777642 457777765
No 82
>PLN02894 hydrolase, alpha/beta fold family protein
Probab=95.69 E-value=0.039 Score=36.89 Aligned_cols=43 Identities=7% Similarity=-0.019 Sum_probs=29.9
Q ss_pred HhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 21 KVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 21 ~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
.+..+ +.|+++++|..|..++ ..+..+.+.+ +..+++++++++
T Consensus 320 ~l~~I-~vP~liI~G~~D~i~~-~~~~~~~~~~---~~~~~~~~i~~a 362 (402)
T PLN02894 320 SASEW-KVPTTFIYGRHDWMNY-EGAVEARKRM---KVPCEIIRVPQG 362 (402)
T ss_pred hcccC-CCCEEEEEeCCCCCCc-HHHHHHHHHc---CCCCcEEEeCCC
Confidence 35667 5999999999998665 5555554432 445778888763
No 83
>TIGR03695 menH_SHCHC 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase. This protein catalyzes the formation of SHCHC, or (1 R,6 R)-2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate, by elmination of pyruvate from 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate (SEPHCHC). Note that SHCHC synthase activity previously was attributed to MenD, which in fact is SEPHCHC synthase.
Probab=95.68 E-value=0.04 Score=32.38 Aligned_cols=42 Identities=19% Similarity=0.122 Sum_probs=29.0
Q ss_pred HHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 20 NKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 20 ~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..+.++ +.|+|+++|..|..++ .. .+.+.+..-..++.++++
T Consensus 188 ~~~~~~-~~P~l~i~g~~D~~~~-~~----~~~~~~~~~~~~~~~~~~ 229 (251)
T TIGR03695 188 PKLQAL-TIPVLYLCGEKDEKFV-QI----AKEMQKLLPNLTLVIIAN 229 (251)
T ss_pred HHhhCC-CCceEEEeeCcchHHH-HH----HHHHHhcCCCCcEEEEcC
Confidence 345567 5999999999998763 33 334455555678888876
No 84
>PLN02211 methyl indole-3-acetate methyltransferase
Probab=95.40 E-value=0.039 Score=34.82 Aligned_cols=28 Identities=14% Similarity=0.094 Sum_probs=25.1
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQ 54 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~ 54 (68)
+.|+++|+|++|..+|++....|++.+.
T Consensus 211 ~vP~l~I~g~~D~~ip~~~~~~m~~~~~ 238 (273)
T PLN02211 211 KVPRVYIKTLHDHVVKPEQQEAMIKRWP 238 (273)
T ss_pred ccceEEEEeCCCCCCCHHHHHHHHHhCC
Confidence 4799999999999999999999997754
No 85
>KOG4391|consensus
Probab=95.27 E-value=0.021 Score=36.56 Aligned_cols=44 Identities=14% Similarity=0.076 Sum_probs=32.3
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
+..- +.|+|++.|.+|..||+-+..+|+...-.+.+ ++..||++
T Consensus 217 i~~~-~~P~LFiSGlkDelVPP~~Mr~Ly~~c~S~~K--rl~eFP~g 260 (300)
T KOG4391|consen 217 IGQC-RMPFLFISGLKDELVPPVMMRQLYELCPSRTK--RLAEFPDG 260 (300)
T ss_pred hccc-cCceEEeecCccccCCcHHHHHHHHhCchhhh--hheeCCCC
Confidence 3344 58999999999999999999999976433332 35556653
No 86
>KOG3043|consensus
Probab=95.11 E-value=0.098 Score=33.29 Aligned_cols=47 Identities=11% Similarity=0.156 Sum_probs=39.9
Q ss_pred HHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCC-ceEEEEecC
Q psy472 20 NKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADI-MFQSQECYR 67 (68)
Q Consensus 20 ~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~-~~~~~~~~~ 67 (68)
..+.+. +.|+|++-|..|..+|+.....+-++|+++-. .+++.+|++
T Consensus 158 ~D~~~v-k~Pilfl~ae~D~~~p~~~v~~~ee~lk~~~~~~~~v~~f~g 205 (242)
T KOG3043|consen 158 ADIANV-KAPILFLFAELDEDVPPKDVKAWEEKLKENPAVGSQVKTFSG 205 (242)
T ss_pred hHHhcC-CCCEEEEeecccccCCHHHHHHHHHHHhcCcccceeEEEcCC
Confidence 345667 59999999999999999999999999988876 458888876
No 87
>PRK05855 short chain dehydrogenase; Validated
Probab=94.98 E-value=0.024 Score=38.44 Aligned_cols=37 Identities=19% Similarity=0.007 Sum_probs=28.0
Q ss_pred CCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 25 IRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 25 ~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
. +.|+|++||.+|..||..+...+++.+. ..++++++
T Consensus 232 ~-~~P~lii~G~~D~~v~~~~~~~~~~~~~----~~~~~~~~ 268 (582)
T PRK05855 232 T-DVPVQLIVPTGDPYVRPALYDDLSRWVP----RLWRREIK 268 (582)
T ss_pred c-cCceEEEEeCCCcccCHHHhccccccCC----cceEEEcc
Confidence 5 5899999999999999998888765432 34455544
No 88
>PRK07868 acyl-CoA synthetase; Validated
Probab=94.94 E-value=0.07 Score=39.42 Aligned_cols=31 Identities=16% Similarity=0.103 Sum_probs=27.3
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSL 53 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L 53 (68)
+.++ +.|+|+++|..|..+|++.+..+.+.+
T Consensus 293 L~~i-~~P~L~i~G~~D~ivp~~~~~~l~~~i 323 (994)
T PRK07868 293 LADI-TCPVLAFVGEVDDIGQPASVRGIRRAA 323 (994)
T ss_pred hhhC-CCCEEEEEeCCCCCCCHHHHHHHHHhC
Confidence 4567 589999999999999999999997765
No 89
>PF10605 3HBOH: 3HB-oligomer hydrolase (3HBOH) ; InterPro: IPR016582 This entry represents a group of predicted D-(-)-3-hydroxybutyrate oligomer hydrolases (also known as 3HB-oligomer hydrolase), which function in the degradation of poly-3-hydroxybutyrate (PHB). These enzymes catalyse the hydrolysis of D(-)-3-hydroxybutyrate oligomers (3HB-oligomers) into 3HB-monomers [, ].; GO: 0047989 hydroxybutyrate-dimer hydrolase activity, 0019605 butyrate metabolic process, 0005615 extracellular space
Probab=94.64 E-value=0.033 Score=39.77 Aligned_cols=40 Identities=15% Similarity=0.153 Sum_probs=31.0
Q ss_pred CCCCcEEEEEeCCCCccCHHHHHHHHHHHHHc--CCceEEEE
Q psy472 25 IRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHA--DIMFQSQE 64 (68)
Q Consensus 25 ~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~--g~~~~~~~ 64 (68)
++.+|.+++||..|..+|+.++-+-+-.|.+. |...++.+
T Consensus 553 L~GKPaIiVhGR~DaLlPvnh~Sr~Y~~ln~~~eG~~s~lrY 594 (690)
T PF10605_consen 553 LHGKPAIIVHGRSDALLPVNHTSRPYLGLNRQVEGRASRLRY 594 (690)
T ss_pred cCCCceEEEecccceecccCCCchHHHHHhhhhcccccceeE
Confidence 44689999999999999999999988887753 55333333
No 90
>TIGR01839 PHA_synth_II poly(R)-hydroxyalkanoic acid synthase, class II. This model represents the class II subfamily of poly(R)-hydroxyalkanoate synthases, which polymerizes hydroxyacyl-CoAs, typically with six to fourteen carbons in the hydroxyacyl backbone into aliphatic esters termed poly(R)-hydroxyalkanoic acids. These polymers accumulate as carbon and energy storage inclusions in many species and can amount to 90 percent of the dry weight of cell.
Probab=94.57 E-value=0.11 Score=36.71 Aligned_cols=43 Identities=14% Similarity=0.001 Sum_probs=35.8
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
++++ +.|+|.+-|..|.+||+..+..+.+.+ |-+.+++.+++|
T Consensus 437 L~~I-~~Pvl~va~~~DHIvPw~s~~~~~~l~---gs~~~fvl~~gG 479 (560)
T TIGR01839 437 LKKV-KCDSFSVAGTNDHITPWDAVYRSALLL---GGKRRFVLSNSG 479 (560)
T ss_pred hhcC-CCCeEEEecCcCCcCCHHHHHHHHHHc---CCCeEEEecCCC
Confidence 5667 699999999999999999999998755 336888887765
No 91
>TIGR01838 PHA_synth_I poly(R)-hydroxyalkanoic acid synthase, class I. This model represents the class I subfamily of poly(R)-hydroxyalkanoate synthases, which polymerizes hydroxyacyl-CoAs with three to five carbons in the hydroxyacyl backbone into aliphatic esters termed poly(R)-hydroxyalkanoic acids. These polymers accumulate as carbon and energy storage inclusions in many species and can amount to 90 percent of the dry weight of cell.
Probab=94.52 E-value=0.098 Score=36.65 Aligned_cols=33 Identities=12% Similarity=0.137 Sum_probs=27.6
Q ss_pred HhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy472 21 KVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQ 54 (68)
Q Consensus 21 ~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~ 54 (68)
.+..+ +.|+|+++|..|.+||++.+..+.+.+.
T Consensus 410 dL~~I-~vPvLvV~G~~D~IvP~~sa~~l~~~i~ 442 (532)
T TIGR01838 410 DLSKV-KVPVYIIATREDHIAPWQSAYRGAALLG 442 (532)
T ss_pred chhhC-CCCEEEEeeCCCCcCCHHHHHHHHHHCC
Confidence 35567 5999999999999999999998876543
No 92
>PRK11126 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase; Provisional
Probab=94.45 E-value=0.13 Score=31.00 Aligned_cols=36 Identities=11% Similarity=0.217 Sum_probs=24.8
Q ss_pred HhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 21 KVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 21 ~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
...++ +.|++++||..|..+. .+++. ...+++++++
T Consensus 183 ~l~~i-~~P~lii~G~~D~~~~-----~~~~~-----~~~~~~~i~~ 218 (242)
T PRK11126 183 ALQAL-TFPFYYLCGERDSKFQ-----ALAQQ-----LALPLHVIPN 218 (242)
T ss_pred Hhhcc-CCCeEEEEeCCcchHH-----HHHHH-----hcCeEEEeCC
Confidence 45567 5999999999998552 23322 2577888876
No 93
>KOG1515|consensus
Probab=94.03 E-value=0.18 Score=33.50 Aligned_cols=39 Identities=15% Similarity=0.096 Sum_probs=35.2
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
+|+|++-...|... .+...++++|++.|+.+++..|.++
T Consensus 269 p~tlv~~ag~D~L~--D~~~~Y~~~Lkk~Gv~v~~~~~e~~ 307 (336)
T KOG1515|consen 269 PPTLVVVAGYDVLR--DEGLAYAEKLKKAGVEVTLIHYEDG 307 (336)
T ss_pred CceEEEEeCchhhh--hhhHHHHHHHHHcCCeEEEEEECCC
Confidence 78999999999887 8999999999999999998888775
No 94
>PF05448 AXE1: Acetyl xylan esterase (AXE1); InterPro: IPR008391 This family consists of several bacterial acetyl xylan esterase proteins. Acetyl xylan esterases are enzymes that hydrolyse the ester linkages of the acetyl groups in position 2 and/or 3 of the xylose moieties of natural acetylated xylan from hardwood. These enzymes are one of the accessory enzymes which are part of the xylanolytic system, together with xylanases, beta-xylosidases, alpha-arabinofuranosidases and methylglucuronidases; these are all required for the complete hydrolysis of xylan [].; PDB: 1VLQ_H 3M81_E 3M82_D 3M83_C 3FCY_A 1ODS_F 1ODT_C 1L7A_A 3FYT_A 2XLB_F ....
Probab=93.52 E-value=0.22 Score=32.63 Aligned_cols=50 Identities=12% Similarity=-0.055 Sum_probs=35.7
Q ss_pred HhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 14 KIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 14 ~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.-.+..+.+..+ +.|+++.-|..|+++|+.-....+.++.. +-++.+||.
T Consensus 250 ~Y~D~~nfA~ri-~~pvl~~~gl~D~~cPP~t~fA~yN~i~~---~K~l~vyp~ 299 (320)
T PF05448_consen 250 SYFDAVNFARRI-KCPVLFSVGLQDPVCPPSTQFAAYNAIPG---PKELVVYPE 299 (320)
T ss_dssp HTT-HHHHGGG---SEEEEEEETT-SSS-HHHHHHHHCC--S---SEEEEEETT
T ss_pred hhhhHHHHHHHc-CCCEEEEEecCCCCCCchhHHHHHhccCC---CeeEEeccC
Confidence 344567778889 59999999999999999999999988753 368888875
No 95
>PF06821 Ser_hydrolase: Serine hydrolase; InterPro: IPR010662 This family contains a number of hypothetical bacterial proteins of unknown function, which may be cytosolic. The Crystal Structure Of The Yden Gene Product Swiss:P96671 from B. Subtilis has been solved. The structure shows an alpha-beta hydrolase fold suggesting an enzymatic function for these proteins [].; GO: 0016787 hydrolase activity; PDB: 3BDV_B 2QS9_A 1UXO_A.
Probab=93.34 E-value=0.17 Score=30.36 Aligned_cols=27 Identities=22% Similarity=0.297 Sum_probs=23.5
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQ 54 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~ 54 (68)
.|.+++.+++|+.||+..+..++++|.
T Consensus 115 ~~~~viaS~nDp~vp~~~a~~~A~~l~ 141 (171)
T PF06821_consen 115 FPSIVIASDNDPYVPFERAQRLAQRLG 141 (171)
T ss_dssp CCEEEEEETTBSSS-HHHHHHHHHHHT
T ss_pred CCeEEEEcCCCCccCHHHHHHHHHHcC
Confidence 566999999999999999999999874
No 96
>COG1647 Esterase/lipase [General function prediction only]
Probab=93.02 E-value=0.15 Score=32.51 Aligned_cols=45 Identities=18% Similarity=0.236 Sum_probs=34.6
Q ss_pred hHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 19 NNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 19 ~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
...++.+ ..|+|+++|.+|+.||.+.+..+++....... ++.+|.
T Consensus 174 ~~~~~~I-~~pt~vvq~~~D~mv~~~sA~~Iy~~v~s~~K--eL~~~e 218 (243)
T COG1647 174 RRSLDKI-YSPTLVVQGRQDEMVPAESANFIYDHVESDDK--ELKWLE 218 (243)
T ss_pred Hhhhhhc-ccchhheecccCCCCCHHHHHHHHHhccCCcc--eeEEEc
Confidence 4456677 59999999999999999999999987654433 444443
No 97
>KOG2551|consensus
Probab=92.91 E-value=0.17 Score=32.02 Aligned_cols=34 Identities=26% Similarity=0.289 Sum_probs=29.8
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHc
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHA 56 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~ 56 (68)
...+ +.|.|-+.|+.|++||..-|..|++...+.
T Consensus 159 ~~~i-~~PSLHi~G~~D~iv~~~~s~~L~~~~~~a 192 (230)
T KOG2551|consen 159 KRPL-STPSLHIFGETDTIVPSERSEQLAESFKDA 192 (230)
T ss_pred ccCC-CCCeeEEecccceeecchHHHHHHHhcCCC
Confidence 3346 699999999999999999999999987766
No 98
>PF10142 PhoPQ_related: PhoPQ-activated pathogenicity-related protein; InterPro: IPR009199 Proteins in this entry are believed to play a role in virulence/pathogenicity in Salmonella. Salmonella typhi PqaA has been shown to be activated by PhoP/Q two-component regulatory system, which regulates many virulence genes []. It has been also shown to confer resistance to antimicrobial peptides (melittin) []. Members of this family are predicted to belong to the alpha/beta hydrolase domain superfamily.
Probab=92.74 E-value=0.15 Score=34.22 Aligned_cols=55 Identities=13% Similarity=-0.046 Sum_probs=45.5
Q ss_pred CHHHHHhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 9 NLEGYKIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 9 ~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..+..+..+|+.+.+++ +-|-+++-|++|+--.+..+.-+.++|.. +-.+.+.||
T Consensus 245 f~~L~~ivDP~~Y~~rL-~~PK~ii~atgDeFf~pD~~~~y~d~L~G---~K~lr~vPN 299 (367)
T PF10142_consen 245 FDKLMQIVDPYSYRDRL-TMPKYIINATGDEFFVPDSSNFYYDKLPG---EKYLRYVPN 299 (367)
T ss_pred HHHHHHhcCHHHHHHhc-CccEEEEecCCCceeccCchHHHHhhCCC---CeeEEeCCC
Confidence 34566788999999999 79999999999999999999999998875 334555554
No 99
>COG1770 PtrB Protease II [Amino acid transport and metabolism]
Probab=92.67 E-value=0.27 Score=35.44 Aligned_cols=58 Identities=19% Similarity=0.176 Sum_probs=46.4
Q ss_pred CCCCcCCCHHHH---HhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEE
Q psy472 2 GLPTFEDNLEGY---KIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQS 62 (68)
Q Consensus 2 G~p~~~~~~~~y---~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~ 62 (68)
|.| . .++.| +.-||..++....-+++|+.-|..|+.|.+-+-.+-+.+|+..+.....
T Consensus 584 GNP--~-d~e~y~yikSYSPYdNV~a~~YP~ilv~~Gl~D~rV~YwEpAKWvAkLR~~~td~~p 644 (682)
T COG1770 584 GNP--L-DPEYYDYIKSYSPYDNVEAQPYPAILVTTGLNDPRVQYWEPAKWVAKLRELKTDGNP 644 (682)
T ss_pred CCc--C-CHHHHHHHhhcCchhccccCCCCceEEEccccCCccccchHHHHHHHHhhcccCCCc
Confidence 566 4 45544 3458988888764599999999999999999999999999998876533
No 100
>PLN02980 2-oxoglutarate decarboxylase/ hydro-lyase/ magnesium ion binding / thiamin pyrophosphate binding
Probab=92.56 E-value=0.35 Score=37.94 Aligned_cols=45 Identities=18% Similarity=0.235 Sum_probs=32.8
Q ss_pred HhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcC--------CceEEEEecC
Q psy472 21 KVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHAD--------IMFQSQECYR 67 (68)
Q Consensus 21 ~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g--------~~~~~~~~~~ 67 (68)
.+.++ +.|+|+++|..|..++ ..+..+.+.+.+.. ..++++++++
T Consensus 1563 ~L~~I-~~PtLlI~Ge~D~~~~-~~a~~~~~~i~~a~~~~~~~~~~~a~lvvI~~ 1615 (1655)
T PLN02980 1563 DLKQC-DTPLLLVVGEKDVKFK-QIAQKMYREIGKSKESGNDKGKEIIEIVEIPN 1615 (1655)
T ss_pred HHhhC-CCCEEEEEECCCCccH-HHHHHHHHHccccccccccccccceEEEEECC
Confidence 35567 5999999999999875 67777877765431 1258888876
No 101
>COG0596 MhpC Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]
Probab=92.55 E-value=0.35 Score=27.92 Aligned_cols=42 Identities=24% Similarity=0.184 Sum_probs=27.3
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.... +.|+++++|..|..+|......+...+.. ..++.++++
T Consensus 217 ~~~~-~~P~l~i~g~~d~~~~~~~~~~~~~~~~~---~~~~~~~~~ 258 (282)
T COG0596 217 LARI-TVPTLIIHGEDDPVVPAELARRLAAALPN---DARLVVIPG 258 (282)
T ss_pred hccC-CCCeEEEecCCCCcCCHHHHHHHHhhCCC---CceEEEeCC
Confidence 3344 59999999999977777664444444332 456666554
No 102
>TIGR01849 PHB_depoly_PhaZ polyhydroxyalkanoate depolymerase, intracellular. This model represents an intracellular depolymerase for polyhydroxyalkanoate (PHA), a carbon and energy storing polyester that accumulates in granules in many bacterial species when carbon sources are abundant but other nutrients are limiting. This family is named for PHAs generally, rather than polyhydroxybutyrate (PHB) specificially as in Ralstonia eutropha H16, to avoid overcalling chemical specificity in other species. Note that this family lacks the classic GXSXG lipase motif and instead shows weak similarity to some
Probab=90.78 E-value=0.47 Score=32.33 Aligned_cols=32 Identities=22% Similarity=0.149 Sum_probs=27.1
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSL 53 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L 53 (68)
.+++++.|+|.+-|..|.+||+.|+..+.+..
T Consensus 333 l~~I~~~pll~V~ge~D~I~p~~qt~aa~~l~ 364 (406)
T TIGR01849 333 PGAITRVALLTVEGENDDISGLGQTKAALRLC 364 (406)
T ss_pred HHHCcccceEEEeccCCCcCCHHHhHHHHHHh
Confidence 34562399999999999999999999988864
No 103
>COG2945 Predicted hydrolase of the alpha/beta superfamily [General function prediction only]
Probab=90.34 E-value=0.37 Score=30.11 Aligned_cols=25 Identities=24% Similarity=0.349 Sum_probs=21.7
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHH
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKS 52 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~ 52 (68)
.|.+++||+.|++|.+......++.
T Consensus 150 ~~~lvi~g~~Ddvv~l~~~l~~~~~ 174 (210)
T COG2945 150 SPGLVIQGDADDVVDLVAVLKWQES 174 (210)
T ss_pred CCceeEecChhhhhcHHHHHHhhcC
Confidence 7899999999999999888876654
No 104
>KOG2237|consensus
Probab=89.94 E-value=0.28 Score=35.44 Aligned_cols=48 Identities=23% Similarity=0.215 Sum_probs=35.7
Q ss_pred HHHHhcChhHHhhcCC----CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCC
Q psy472 11 EGYKIAALNNKVDRIR----DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADI 58 (68)
Q Consensus 11 ~~y~~~sp~~~~~~~~----~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~ 58 (68)
+-+-..||.+.+++.+ -+.+|+..+..|++|.+-++..+..+|+..-.
T Consensus 612 ~~~~~i~~y~pv~~i~~q~~YPS~lvtta~hD~RV~~~~~~K~vAklre~~~ 663 (712)
T KOG2237|consen 612 EDLIKISPYSPVDNIKKQVQYPSMLVTTADHDDRVGPLESLKWVAKLREATC 663 (712)
T ss_pred hhhheecccCccCCCchhccCcceEEeeccCCCcccccchHHHHHHHHHHhh
Confidence 3444444544444432 27899999999999999999999999997653
No 105
>KOG2984|consensus
Probab=89.81 E-value=0.26 Score=31.38 Aligned_cols=41 Identities=10% Similarity=0.079 Sum_probs=27.8
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.+. +.|+||+||..|+.|+-.|.--+-.. -...+++++|.
T Consensus 212 lp~v-kcPtli~hG~kDp~~~~~hv~fi~~~----~~~a~~~~~pe 252 (277)
T KOG2984|consen 212 LPQV-KCPTLIMHGGKDPFCGDPHVCFIPVL----KSLAKVEIHPE 252 (277)
T ss_pred cccc-cCCeeEeeCCcCCCCCCCCccchhhh----cccceEEEccC
Confidence 4456 59999999999999998876554332 22344555544
No 106
>PF05728 UPF0227: Uncharacterised protein family (UPF0227); InterPro: IPR008886 Despite being classed as uncharacterised proteins, the members of this family are almost certainly enzymes in that they contain a domain distantly related to IPR000073 from INTERPRO. One of the members of this family YqiA has been shown to be a esterase []. Other members, which include the Escherichia coli (strain K12) YcfP protein are uncharacterised.
Probab=89.73 E-value=0.48 Score=28.91 Aligned_cols=24 Identities=17% Similarity=0.422 Sum_probs=20.8
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHH
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAK 51 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~ 51 (68)
.++++++|++|++++++.+....+
T Consensus 135 ~~~lvll~~~DEvLd~~~a~~~~~ 158 (187)
T PF05728_consen 135 ERYLVLLQTGDEVLDYREAVAKYR 158 (187)
T ss_pred ccEEEEEecCCcccCHHHHHHHhc
Confidence 699999999999999987766554
No 107
>KOG2382|consensus
Probab=89.43 E-value=0.81 Score=30.33 Aligned_cols=39 Identities=13% Similarity=0.098 Sum_probs=31.4
Q ss_pred hcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 23 DRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 23 ~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
... ..|+|+++|+++..|+.++-.+|.+.+.. ++++.++
T Consensus 250 ~~~-~~pvlfi~g~~S~fv~~~~~~~~~~~fp~----~e~~~ld 288 (315)
T KOG2382|consen 250 GPY-TGPVLFIKGLQSKFVPDEHYPRMEKIFPN----VEVHELD 288 (315)
T ss_pred ccc-ccceeEEecCCCCCcChhHHHHHHHhccc----hheeecc
Confidence 344 69999999999999999999888876555 6666655
No 108
>COG3545 Predicted esterase of the alpha/beta hydrolase fold [General function prediction only]
Probab=88.44 E-value=1.1 Score=27.58 Aligned_cols=43 Identities=12% Similarity=0.071 Sum_probs=31.8
Q ss_pred HHHHHhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy472 10 LEGYKIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQ 54 (68)
Q Consensus 10 ~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~ 54 (68)
+......+|+.. ..+ --|.++++..+|+.|++++++.++++.-
T Consensus 102 ~~~~~tf~~~p~-~~l-pfps~vvaSrnDp~~~~~~a~~~a~~wg 144 (181)
T COG3545 102 PKHLMTFDPIPR-EPL-PFPSVVVASRNDPYVSYEHAEDLANAWG 144 (181)
T ss_pred hhhccccCCCcc-ccC-CCceeEEEecCCCCCCHHHHHHHHHhcc
Confidence 334444555553 344 2699999999999999999999998753
No 109
>COG2021 MET2 Homoserine acetyltransferase [Amino acid transport and metabolism]
Probab=88.34 E-value=1.7 Score=29.51 Aligned_cols=42 Identities=26% Similarity=0.164 Sum_probs=37.3
Q ss_pred cChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCC
Q psy472 16 AALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADI 58 (68)
Q Consensus 16 ~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~ 58 (68)
-++.....++ +.|+|++-=+.|...|+++...++++|...+.
T Consensus 296 ~~l~~al~~i-~~~~lv~gi~sD~lfp~~~~~~~~~~L~~~~~ 337 (368)
T COG2021 296 GDLTAALARI-KAPVLVVGITSDWLFPPELQRALAEALPAAGA 337 (368)
T ss_pred CcHHHHHhcC-ccCEEEEEecccccCCHHHHHHHHHhccccCc
Confidence 3555667788 59999999999999999999999999999997
No 110
>PLN02606 palmitoyl-protein thioesterase
Probab=88.32 E-value=1.9 Score=28.48 Aligned_cols=37 Identities=16% Similarity=0.102 Sum_probs=25.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHH-cCCceEEEE
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQH-ADIMFQSQE 64 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~-~g~~~~~~~ 64 (68)
..|++||||..|..-.. ....+.+.+.+ .+.+...+.
T Consensus 26 ~~PvViwHGlgD~~~~~-~~~~~~~~i~~~~~~pg~~v~ 63 (306)
T PLN02606 26 SVPFVLFHGFGGECSNG-KVSNLTQFLINHSGYPGTCVE 63 (306)
T ss_pred CCCEEEECCCCcccCCc-hHHHHHHHHHhCCCCCeEEEE
Confidence 48999999999987765 45555555543 377665554
No 111
>PF00756 Esterase: Putative esterase; InterPro: IPR000801 This family contains several seemingly unrelated proteins, including human esterase D; mycobacterial antigen 85, which is responsible for the high affinity of mycobacteria to fibronectin; Corynebacterium glutamicum major secreted protein PS1; and hypothetical proteins from Escherichia coli, yeast, mycobacteria and Haemophilus influenzae.; PDB: 3LS2_A 1VA5_B 1DQZ_B 3HRH_A 1DQY_A 2GZR_A 2GZS_A 3GFF_A 1R88_A 3E4D_D ....
Probab=87.20 E-value=0.38 Score=29.44 Aligned_cols=57 Identities=18% Similarity=0.107 Sum_probs=39.1
Q ss_pred HHHHhcChhHHhh----cCCCCcEEEEEeCCCCccC----------HHHHHHHHHHHHHcCCceEEEEecC
Q psy472 11 EGYKIAALNNKVD----RIRDKQYLLVHGTMDDNVH----------FQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 11 ~~y~~~sp~~~~~----~~~~~p~ll~hG~~D~~V~----------~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.|...+++..+. +.+..++++..|+.|.... ......+...|+.+|+.+.++.|++
T Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~i~l~~G~~d~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~G 233 (251)
T PF00756_consen 163 EAWKENDPFDLIKALSQKKKPLRIYLDVGTKDEFGGWEDSAQILQFLANNRELAQLLKAKGIPHTYHVFPG 233 (251)
T ss_dssp GHHGGCHHHHHHHHHHHTTSEEEEEEEEETTSTTHHCSHHHHHHHHHHHHHHHHHHCCCEECTTESEEEHS
T ss_pred HHhhhccHHHHhhhhhcccCCCeEEEEeCCCCcccccccCHHHHHHHHHhHhhHHHHHHcCCCceEEEecC
Confidence 3444555555442 2224788999999999442 3455667777888999999999885
No 112
>COG3208 GrsT Predicted thioesterase involved in non-ribosomal peptide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=87.08 E-value=1.7 Score=27.91 Aligned_cols=41 Identities=12% Similarity=0.001 Sum_probs=33.0
Q ss_pred cCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 24 RIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 24 ~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
.+ ..|+..+-|.+|..|..++...-. +..+..|++++|+|+
T Consensus 174 pl-~~pi~~~~G~~D~~vs~~~~~~W~---~~t~~~f~l~~fdGg 214 (244)
T COG3208 174 PL-ACPIHAFGGEKDHEVSRDELGAWR---EHTKGDFTLRVFDGG 214 (244)
T ss_pred Cc-CcceEEeccCcchhccHHHHHHHH---HhhcCCceEEEecCc
Confidence 35 599999999999999988766554 445668999999985
No 113
>COG3458 Acetyl esterase (deacetylase) [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=86.57 E-value=0.99 Score=29.80 Aligned_cols=40 Identities=20% Similarity=0.194 Sum_probs=33.0
Q ss_pred hhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCC
Q psy472 18 LNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADI 58 (68)
Q Consensus 18 p~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~ 58 (68)
..+.+.++ +.|+|+.-|..|+++|+.-...++++|....-
T Consensus 251 ~~n~A~Ri-K~pvL~svgL~D~vcpPstqFA~yN~l~~~K~ 290 (321)
T COG3458 251 IVNLAARI-KVPVLMSVGLMDPVCPPSTQFAAYNALTTSKT 290 (321)
T ss_pred hhhHHHhh-ccceEEeecccCCCCCChhhHHHhhcccCCce
Confidence 34556677 59999999999999999999999988766543
No 114
>COG3571 Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only]
Probab=84.97 E-value=0.74 Score=28.39 Aligned_cols=26 Identities=15% Similarity=0.234 Sum_probs=21.3
Q ss_pred HHhhcCCCCcEEEEEeCCCCccCHHHH
Q psy472 20 NKVDRIRDKQYLLVHGTMDDNVHFQQS 46 (68)
Q Consensus 20 ~~~~~~~~~p~ll~hG~~D~~V~~~~s 46 (68)
.|...+ ++|.||+||+.|..=.-++.
T Consensus 136 ~HL~gl-~tPtli~qGtrD~fGtr~~V 161 (213)
T COG3571 136 EHLTGL-KTPTLITQGTRDEFGTRDEV 161 (213)
T ss_pred hhccCC-CCCeEEeecccccccCHHHH
Confidence 567778 59999999999987766665
No 115
>PF06028 DUF915: Alpha/beta hydrolase of unknown function (DUF915); InterPro: IPR010315 This family consists of bacterial proteins of unknown function, which are hydrolase-like.; PDB: 3LP5_A 3FLE_A 3DS8_A.
Probab=84.80 E-value=1.5 Score=28.08 Aligned_cols=41 Identities=22% Similarity=0.178 Sum_probs=36.5
Q ss_pred CCcEEEEEeC------CCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGT------MDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~------~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+..+|.|-|. .|-.||...+..+..-++.....++..+|.|
T Consensus 184 ~i~VLnI~G~~~~g~~sDG~V~~~Ss~sl~~L~~~~~~~Y~e~~v~G 230 (255)
T PF06028_consen 184 NIQVLNIYGDLEDGSNSDGIVPNASSLSLRYLLKNRAKSYQEKTVTG 230 (255)
T ss_dssp T-EEEEEEEESBTTCSBTSSSBHHHHCTHHHHCTTTSSEEEEEEEES
T ss_pred CeEEEEEecccCCCCCCCeEEeHHHHHHHHHHhhcccCceEEEEEEC
Confidence 4789999999 8999999999999999999899998888875
No 116
>PLN02633 palmitoyl protein thioesterase family protein
Probab=84.58 E-value=3.7 Score=27.29 Aligned_cols=38 Identities=11% Similarity=0.100 Sum_probs=24.4
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHH-cCCceEEEEe
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQH-ADIMFQSQEC 65 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~-~g~~~~~~~~ 65 (68)
..|+.||||..|..... -...+.+.+.+ .|++...+..
T Consensus 25 ~~P~ViwHG~GD~c~~~-g~~~~~~l~~~~~g~~~~~i~i 63 (314)
T PLN02633 25 SVPFIMLHGIGTQCSDA-TNANFTQLLTNLSGSPGFCLEI 63 (314)
T ss_pred CCCeEEecCCCcccCCc-hHHHHHHHHHhCCCCceEEEEE
Confidence 48999999999998875 33334444433 3555554443
No 117
>KOG2624|consensus
Probab=84.01 E-value=1.6 Score=29.86 Aligned_cols=50 Identities=20% Similarity=0.170 Sum_probs=41.8
Q ss_pred CCHHHHHhcC-hhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCC
Q psy472 8 DNLEGYKIAA-LNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADI 58 (68)
Q Consensus 8 ~~~~~y~~~s-p~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~ 58 (68)
+|...|.... |...+++. +.|+.|++|..|-.+.++....+...+....+
T Consensus 313 ~N~~~Y~q~~pP~Y~l~~i-~~P~~l~~g~~D~l~~~~DV~~~~~~~~~~~~ 363 (403)
T KOG2624|consen 313 RNLKHYGQSTPPEYDLTNI-KVPTALYYGDNDWLADPEDVLILLLVLPNSVI 363 (403)
T ss_pred ccHhhcCCCCCCCCCcccc-ccCEEEEecCCcccCCHHHHHHHHHhcccccc
Confidence 4677777666 56667788 69999999999999999999999888877766
No 118
>PF02244 Propep_M14: Carboxypeptidase activation peptide; InterPro: IPR003146 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. The peptidases are synthesised as inactive molecules, zymogens, with propeptides that must be removed by proteolytic cleavage to activate the enzyme. Structural studies of carboxypeptidases A and B reveal the propeptide to exist as a globular domain, followed by an extended alpha-helix; this shields the catalytic site, without specifically binding to it, while the substrate-binding site is blocked by making specific contacts [, ]. Members of this propeptide family are found in the metallocarboxypeptidases: A1, A2 [], A3, A4, A5, A6, U, insect gut carboxypeptidase and B [], and and are associated with peptidases belonging to MEROPS peptidase family M14A. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase.; GO: 0004180 carboxypeptidase activity, 0006508 proteolysis; PDB: 3D68_C 3D67_B 3D66_A 1PBA_A 3GLJ_A 1NSA_A 1KWM_A 2BOA_B 1PYT_A 1JQG_A ....
Probab=83.15 E-value=0.81 Score=23.31 Aligned_cols=30 Identities=13% Similarity=0.066 Sum_probs=25.5
Q ss_pred eCCCCccCHHHHHHHHHHHHHcCCceEEEE
Q psy472 35 GTMDDNVHFQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 35 G~~D~~V~~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
+..|-.||+.+...+.+.|.+.|+.++.++
T Consensus 34 ~~~dv~V~p~~~~~f~~~L~~~~i~~~v~i 63 (74)
T PF02244_consen 34 RPVDVMVPPEKLEEFEELLKEHGIEYEVLI 63 (74)
T ss_dssp SEEEEEEEGGGHHHHHHHHHHTT-EEEEEE
T ss_pred CeEEEEECHHHHHHHHHHHHHCCCcEEEEe
Confidence 335778999999999999999999999876
No 119
>TIGR03100 hydr1_PEP hydrolase, ortholog 1, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 2, TIGR03101) of the same superfamily.
Probab=83.12 E-value=2.5 Score=26.64 Aligned_cols=20 Identities=15% Similarity=0.260 Sum_probs=16.3
Q ss_pred hhcCCCCcEEEEEeCCCCccC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVH 42 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~ 42 (68)
+..+ +.|+|+++|+.|...+
T Consensus 203 l~~~-~~P~ll~~g~~D~~~~ 222 (274)
T TIGR03100 203 LERF-QGPVLFILSGNDLTAQ 222 (274)
T ss_pred HHhc-CCcEEEEEcCcchhHH
Confidence 4456 5999999999999853
No 120
>cd03409 Chelatase_Class_II Class II Chelatase: a family of ATP-independent monomeric or homodimeric enzymes that catalyze the insertion of metal into protoporphyrin rings. This family includes protoporphyrin IX ferrochelatase (HemH), sirohydrochlorin ferrochelatase (SirB) and the cobaltochelatases, CbiK and CbiX. HemH and SirB are involved in heme and siroheme biosynthesis, respectively, while the cobaltochelatases are associated with cobalamin biosynthesis. Excluded from this family are the ATP-dependent heterotrimeric chelatases (class I) and the multifunctional homodimeric enzymes with dehydrogenase and chelatase activities (class III).
Probab=83.05 E-value=3.8 Score=21.65 Aligned_cols=28 Identities=14% Similarity=0.075 Sum_probs=16.0
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHHcC
Q psy472 30 YLLVHGTMDDNVHFQQSMMLAKSLQHAD 57 (68)
Q Consensus 30 ~ll~hG~~D~~V~~~~s~~l~~~L~~~g 57 (68)
+|+.||+.+..-.......++++|+++.
T Consensus 3 llv~HGs~~~s~~~~~~~~~~~~l~~~~ 30 (101)
T cd03409 3 LVVGHGSPYKDPYKKDIEAQAHNLAESL 30 (101)
T ss_pred EEEECCCCCCccHHHHHHHHHHHHHHHC
Confidence 4566777765333345556666665543
No 121
>PF02089 Palm_thioest: Palmitoyl protein thioesterase; InterPro: IPR002472 Neuronal ceroid lipofuscinoses (NCL) represent a group of encephalopathies that occur in 1 in 12,500 children. Mutations in the palmitoyl protein thioesterase gene causing infantile neuronal ceroid lipofuscinosis []. The most common mutation results in intracellular accumulation of the polypeptide and undetectable enzyme activity in the brain. Direct sequencing of cDNAs derived from brain RNA of INCL patients has shown a mis-sense transversion of A to T at nucleotide position 364, which results in substitution of Trp for Arg at position 122 in the protein - Arg 122 is immediately adjacent to a lipase consensus sequence that contains the putative active site Ser of PPT. The occurrence of this and two other independent mutations in the PPT gene strongly suggests that defects in this gene cause INCL.; GO: 0008474 palmitoyl-(protein) hydrolase activity, 0006464 protein modification process; PDB: 3GRO_B 1PJA_A 1EXW_A 1EH5_A 1EI9_A.
Probab=81.60 E-value=1.3 Score=28.84 Aligned_cols=26 Identities=15% Similarity=0.146 Sum_probs=12.2
Q ss_pred CCcEEEEEeCCCCccC---HHHHHHHHHH
Q psy472 27 DKQYLLVHGTMDDNVH---FQQSMMLAKS 52 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~---~~~s~~l~~~ 52 (68)
..|+.||||..|.... +.....+.++
T Consensus 5 ~~PvViwHGmGD~~~~~~~m~~i~~~i~~ 33 (279)
T PF02089_consen 5 PLPVVIWHGMGDSCCNPSSMGSIKELIEE 33 (279)
T ss_dssp S--EEEE--TT--S--TTTHHHHHHHHHH
T ss_pred CCcEEEEEcCccccCChhHHHHHHHHHHH
Confidence 3799999999998753 4555555543
No 122
>KOG4409|consensus
Probab=81.60 E-value=4.8 Score=27.30 Aligned_cols=53 Identities=11% Similarity=0.123 Sum_probs=35.9
Q ss_pred HHhcChhHHhhcCC-CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 13 YKIAALNNKVDRIR-DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 13 y~~~sp~~~~~~~~-~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
+++..-++.+..++ +.|+++++| .++-++..-...+.+.| ....++.++.|++
T Consensus 288 ~Ar~Pm~~r~~~l~~~~pv~fiyG-~~dWmD~~~g~~~~~~~--~~~~~~~~~v~~a 341 (365)
T KOG4409|consen 288 WARRPMIQRLRELKKDVPVTFIYG-DRDWMDKNAGLEVTKSL--MKEYVEIIIVPGA 341 (365)
T ss_pred hhhhhHHHHHHhhccCCCEEEEec-CcccccchhHHHHHHHh--hcccceEEEecCC
Confidence 44455555555552 499999999 55667888888888877 4445667766653
No 123
>TIGR03100 hydr1_PEP hydrolase, ortholog 1, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 2, TIGR03101) of the same superfamily.
Probab=80.30 E-value=6.4 Score=24.77 Aligned_cols=37 Identities=24% Similarity=0.240 Sum_probs=27.2
Q ss_pred CCcEEEEEeCCCCcc-CHHHHHHHHHHHHHcCCceEEE
Q psy472 27 DKQYLLVHGTMDDNV-HFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V-~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
.++++++||..+..+ .......+++.|.++|..+-..
T Consensus 26 ~~~vv~i~gg~~~~~g~~~~~~~la~~l~~~G~~v~~~ 63 (274)
T TIGR03100 26 TTGVLIVVGGPQYRVGSHRQFVLLARRLAEAGFPVLRF 63 (274)
T ss_pred CCeEEEEeCCccccCCchhHHHHHHHHHHHCCCEEEEe
Confidence 368999999888766 4455667899999988655433
No 124
>PF08357 SEFIR: SEFIR domain; InterPro: IPR013568 This domain is found in IL17 receptors (IL17Rs, e.g. Q60943 from SWISSPROT) and SEF proteins (e.g. Q8QHJ9 from SWISSPROT). The latter are feedback inhibitors of FGF signalling and are also thought to be receptors. Due to its similarity to the TIR domain (IPR000157 from INTERPRO), the SEFIR region is thought to be involved in homotypic interactions with other SEFIR/TIR-domain-containing proteins. Thus, SEFs and IL17Rs may be involved in TOLL/IL1R-like signalling pathways [].
Probab=80.23 E-value=4.3 Score=23.20 Aligned_cols=37 Identities=16% Similarity=0.198 Sum_probs=31.2
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHc-CCceEEEEe
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHA-DIMFQSQEC 65 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~-g~~~~~~~~ 65 (68)
+++||++.- |.--+.+....|++.|++. |+++.+-.|
T Consensus 1 ~kVfI~Ys~-d~~~h~~~V~~la~~L~~~~g~~V~lD~~ 38 (150)
T PF08357_consen 1 RKVFISYSH-DSEEHKEWVLALAEFLRQNCGIDVILDQW 38 (150)
T ss_pred CeEEEEeCC-CCHHHHHHHHHHHHHHHhccCCceeecHH
Confidence 468899985 8888889999999999999 998876444
No 125
>TIGR00976 /NonD putative hydrolase, CocE/NonD family. This model represents a protein subfamily that includes the cocaine esterase CocE, several glutaryl-7-ACA acylases, and the putative diester hydrolase NonD of Streptomyces griseus (all hydrolases). This family shows extensive, low-level similarity to a family of xaa-pro dipeptidyl-peptidases, and local similarity by PSI-BLAST to many other hydrolases.
Probab=79.53 E-value=6.1 Score=27.56 Aligned_cols=48 Identities=17% Similarity=0.115 Sum_probs=34.6
Q ss_pred HHhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEE
Q psy472 13 YKIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 13 y~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
+...+....++++ +.|+|++.|=.| .....+...+.++.+.+ +..+++
T Consensus 219 w~~~~~~~~~~~i-~vP~l~~~gw~D--~~~~g~~~~~~~~~~~~-~~~lil 266 (550)
T TIGR00976 219 WVSISLWRDLGGS-DVPTLVTGGWYD--NHSRGSIRLFLAVHRGG-AQRLVV 266 (550)
T ss_pred hccCchhhHhcCC-CCCEEEeCcccC--CCCchHHHHHHHHhhcC-CceEEE
Confidence 3345555567778 699999999999 36778888888877764 444444
No 126
>PF07755 DUF1611: Protein of unknown function (DUF1611); InterPro: IPR011669 This entry contains a number of hypothetical bacterial and archaeal proteins. The region is approximately 350 residues long. A member of this family (Q6M063 from SWISSPROT) is thought to associate with another subunit to form an H+-transporting ATPase, but no evidence has been found to support this.; PDB: 2G0T_A 2OBN_A.
Probab=79.27 E-value=8.3 Score=25.49 Aligned_cols=39 Identities=15% Similarity=0.131 Sum_probs=32.6
Q ss_pred cCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 24 RIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 24 ~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
+. +.|.+++-||....==+.-+..|.++|+++|++..+.
T Consensus 109 ~~-~~~rv~~vGTDcavGK~tTal~L~~~l~~~G~~a~fv 147 (301)
T PF07755_consen 109 EV-KAKRVLTVGTDCAVGKMTTALELRRALRERGINAGFV 147 (301)
T ss_dssp G--SSEEEEEEESSSSSSHHHHHHHHHHHHHHTT--EEEE
T ss_pred cC-CCCEEEEEccCccccHHHHHHHHHHHHHHcCCCceEE
Confidence 44 5899999999998888999999999999999998775
No 127
>COG3243 PhaC Poly(3-hydroxyalkanoate) synthetase [Lipid metabolism]
Probab=77.93 E-value=6.1 Score=27.48 Aligned_cols=41 Identities=10% Similarity=-0.044 Sum_probs=31.6
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
...+ +.|++.+.|..|.++|+.-....++.+.. .+++....
T Consensus 326 L~~I-t~pvy~~a~~~DhI~P~~Sv~~g~~l~~g---~~~f~l~~ 366 (445)
T COG3243 326 LGDI-TCPVYNLAAEEDHIAPWSSVYLGARLLGG---EVTFVLSR 366 (445)
T ss_pred hhhc-ccceEEEeecccccCCHHHHHHHHHhcCC---ceEEEEec
Confidence 4456 69999999999999999988888876543 55555544
No 128
>PRK10834 vancomycin high temperature exclusion protein; Provisional
Probab=77.83 E-value=3.7 Score=26.25 Aligned_cols=39 Identities=21% Similarity=0.220 Sum_probs=30.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
..+.+|+.|.... -.+.++..|.+.|.+.|++-+-++..
T Consensus 81 k~~~ilvSGg~~~-~~~~Ea~~M~~yLi~~GVp~e~Ii~e 119 (239)
T PRK10834 81 KVNYLLLSGDNAL-QSYNEPMTMRKDLIAAGVDPSDIVLD 119 (239)
T ss_pred CCCEEEEeCCCCC-CCCCHHHHHHHHHHHcCCCHHHEEec
Confidence 3677888887644 46789999999999999987766544
No 129
>cd05560 Xcc1710_like Xcc1710_like family, specific to proteobacteria. Xcc1710 is a hypothetical protein from Xanthomonas campestris pv. campestris str. ATCC 33913, similar to Mth938, a hypothetical protein encoded by the Methanobacterium thermoautotrophicum (Mth) genome. Their three-dimensional structures have been determined, but their functions are unknown.
Probab=76.03 E-value=6.7 Score=21.84 Aligned_cols=34 Identities=15% Similarity=0.075 Sum_probs=26.6
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
.|=+++-|+......+ ...+.+.|+++|+.+|.+
T Consensus 52 ~peiliiGTG~~~~~~--~~~~~~~l~~~gi~vE~m 85 (109)
T cd05560 52 QPEVILLGTGERQRFP--PPALLAPLLARGIGVEVM 85 (109)
T ss_pred CCCEEEEecCCCCCcC--CHHHHHHHHHcCCeEEEE
Confidence 4667888888887766 466778899999998875
No 130
>COG2382 Fes Enterochelin esterase and related enzymes [Inorganic ion transport and metabolism]
Probab=75.26 E-value=9.8 Score=25.19 Aligned_cols=25 Identities=20% Similarity=0.196 Sum_probs=22.2
Q ss_pred HHHHHHHHHHHHcCCceEEEEecCC
Q psy472 44 QQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 44 ~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
....+|++.|+++|+++.+..|+|+
T Consensus 254 ~pNr~L~~~L~~~g~~~~yre~~Gg 278 (299)
T COG2382 254 RPNRALAAQLEKKGIPYYYREYPGG 278 (299)
T ss_pred chhHHHHHHHHhcCCcceeeecCCC
Confidence 5566789999999999999999986
No 131
>PF02273 Acyl_transf_2: Acyl transferase; InterPro: IPR003157 LuxD proteins are bacterial acyl transferases. Together with an acyl-protein synthetase (LuxE) and reductase (LuxC), they form a multienzyme complex. This complex channels activated fatty acids into the aldehyde substrate for the luciferase-catalyzed bacterial bioluminescence reaction [, ]. ; GO: 0016746 transferase activity, transferring acyl groups, 0006631 fatty acid metabolic process; PDB: 1THT_B.
Probab=74.99 E-value=3.9 Score=26.77 Aligned_cols=36 Identities=17% Similarity=0.231 Sum_probs=25.9
Q ss_pred ChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHH
Q psy472 17 ALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSL 53 (68)
Q Consensus 17 sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L 53 (68)
|.+..++.+ +.|++.+++.+|.-|-.....++.+.+
T Consensus 186 ST~~~~k~l-~iP~iaF~A~~D~WV~q~eV~~~~~~~ 221 (294)
T PF02273_consen 186 STINDMKRL-SIPFIAFTANDDDWVKQSEVEELLDNI 221 (294)
T ss_dssp HHHHHHTT---S-EEEEEETT-TTS-HHHHHHHHTT-
T ss_pred hHHHHHhhC-CCCEEEEEeCCCccccHHHHHHHHHhc
Confidence 567778888 799999999999999999888887643
No 132
>cd03413 CbiK_C Anaerobic cobalamin biosynthetic cobalt chelatase (CbiK), C-terminal domain. CbiK is part of the cobalt-early path for cobalamin biosynthesis. It catalyzes the insertion of cobalt into the oxidized form of precorrin-2, factor II (sirohydrochlorin), the second step of the anaerobic branch of vitamin B12 biosynthesis. CbiK belongs to the class II family of chelatases, and is a homomeric enzyme that does not require ATP for its enzymatic activity.
Probab=74.76 E-value=12 Score=20.58 Aligned_cols=26 Identities=23% Similarity=0.213 Sum_probs=17.1
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHHcC
Q psy472 30 YLLVHGTMDDNVHFQQSMMLAKSLQHAD 57 (68)
Q Consensus 30 ~ll~hG~~D~~V~~~~s~~l~~~L~~~g 57 (68)
+|+-||+.|.. .+.-..|.+.+++.+
T Consensus 4 llvgHGSr~~~--~~~~~~l~~~l~~~~ 29 (103)
T cd03413 4 VFMGHGTDHPS--NAVYAALEYVLREED 29 (103)
T ss_pred EEEECCCCchh--hhHHHHHHHHHHhcC
Confidence 45558888875 366666777776654
No 133
>TIGR00666 PBP4 D-alanyl-D-alanine carboxypeptidase, serine-type, PBP4 family. In E. coli, this protein is known as penicillin binding protein 4 (dacB). A signal sequence is cleaved from a precursor form. The protein is described as periplasmic in E. coli (Gram-negative) and extracellular in Actinomadura R39 (Gram-positive). Unlike some other proteins with similar activity, it does not form transpeptidation. It is not essential for viability. This family is related to class A beta-lactamases.
Probab=74.46 E-value=7.2 Score=25.90 Aligned_cols=31 Identities=16% Similarity=0.129 Sum_probs=28.4
Q ss_pred cEEEEEeCCCCccCHHHHHHHHHHHHHcCCc
Q psy472 29 QYLLVHGTMDDNVHFQQSMMLAKSLQHADIM 59 (68)
Q Consensus 29 p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~ 59 (68)
--|++.|..|+....+.-..|+++|++.|+.
T Consensus 55 G~L~i~G~GDP~L~~~~L~~la~~l~~~Gi~ 85 (345)
T TIGR00666 55 GNLVLRFGGDPTLKRQDIRNLVATLKKSGVK 85 (345)
T ss_pred ccEEEEeecCCCcCHHHHHHHHHHHHHcCCc
Confidence 3688999999999999999999999999985
No 134
>PLN02213 sinapoylglucose-malate O-sinapoyltransferase/ carboxypeptidase
Probab=73.61 E-value=3.2 Score=27.08 Aligned_cols=31 Identities=23% Similarity=0.245 Sum_probs=27.7
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHAD 57 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g 57 (68)
..++||..|..|.+|++--++...++|.=.+
T Consensus 233 ~i~VliY~Gd~D~icn~~g~~~wi~~L~w~~ 263 (319)
T PLN02213 233 GYRSLIYSGDHDIAVPFLATQAWIRSLNYSP 263 (319)
T ss_pred CceEEEEECCcCeeCCcHhHHHHHHhcCCCC
Confidence 4799999999999999999999999987444
No 135
>PF06500 DUF1100: Alpha/beta hydrolase of unknown function (DUF1100); InterPro: IPR010520 Proteins in this entry display esterase activity toward pNP-butyrate []. This entry also includes 2,6-dihydropseudooxynicotine hydrolase which has a role in nicotine catabolism by cleaving a C-C bond in 2,6-dihydroxypseudooxyicotine [, ].; PDB: 3OUR_A 3MVE_B 2JBW_C.
Probab=73.42 E-value=6.2 Score=27.18 Aligned_cols=25 Identities=20% Similarity=0.306 Sum_probs=19.6
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHH
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAK 51 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~ 51 (68)
.+|+|.+.|..|+++|.+.+..++.
T Consensus 352 ~~plL~i~~~~D~v~P~eD~~lia~ 376 (411)
T PF06500_consen 352 PTPLLAINGEDDPVSPIEDSRLIAE 376 (411)
T ss_dssp SS-EEEEEETT-SSS-HHHHHHHHH
T ss_pred CcceEEeecCCCCCCCHHHHHHHHh
Confidence 4899999999999999999988775
No 136
>PF02113 Peptidase_S13: D-Ala-D-Ala carboxypeptidase 3 (S13) family; InterPro: IPR000667 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This family of serine peptidases belong to MEROPS peptidase family S13 (D-Ala-D-Ala carboxypeptidase C, clan SE). The predicted active site residues for members of this family and family S12 occur in the motif SXXK. D-Ala-D-Ala carboxypeptidase C is involved in the metabolism of cell components []; it is synthesised with a leader peptide to target it to the cell membrane []. After cleavage of the leader peptide, the enzyme is retained in the membrane by a C-terminal anchor []. There are three families of serine-type D-Ala-D-Ala peptidase (designated S11, S12 and S13), which are also known as low molecular weight penicillin-binding proteins []. Family S13 comprises D-Ala-D-Ala peptidases that have sufficient sequence similarity around their active sites to assume a distant evolutionary relationship to other clan members; members of the S13 family also bind penicillin and have D-amino-peptidase activity. Proteases of family S11 have exclusive D-Ala-D-Ala peptidase activity, while some members of S12 are C beta-lactamases [].; GO: 0004185 serine-type carboxypeptidase activity, 0006508 proteolysis; PDB: 3A3F_B 3A3E_B 3A3D_A 3A3I_B 2Y59_C 1W8Q_A 3ZVT_B 3ZVW_B 2VGJ_B 1W79_D ....
Probab=73.11 E-value=8.1 Score=26.57 Aligned_cols=32 Identities=25% Similarity=0.208 Sum_probs=27.7
Q ss_pred cEEEEEeCCCCccCHHHHHHHHHHHHHcCCce
Q psy472 29 QYLLVHGTMDDNVHFQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 29 p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~ 60 (68)
--|++.|..|+....++...++++|+++|+.-
T Consensus 82 G~L~l~G~GDP~l~~~~l~~la~~l~~~Gi~~ 113 (444)
T PF02113_consen 82 GDLYLKGGGDPSLTSEDLWALAAQLKAAGIKR 113 (444)
T ss_dssp SEEEEEECSBTTBCHHHHHHHHHHHHHTT-SE
T ss_pred CcEEEEecCCCccCHHHHHHHHHHHHHcCCCe
Confidence 46778899999999999999999999999843
No 137
>PRK11113 D-alanyl-D-alanine carboxypeptidase/endopeptidase; Provisional
Probab=72.90 E-value=7.8 Score=26.94 Aligned_cols=30 Identities=13% Similarity=0.094 Sum_probs=27.8
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHHcCCc
Q psy472 30 YLLVHGTMDDNVHFQQSMMLAKSLQHADIM 59 (68)
Q Consensus 30 ~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~ 59 (68)
-|++.|..|+.---+.-..|+++|++.|+.
T Consensus 100 dL~i~G~GDPtL~~~~L~~la~~l~~~GI~ 129 (477)
T PRK11113 100 DLIARFGGDPTLTRQDLRNMVATLKKSGVK 129 (477)
T ss_pred eEEEEEecCCCCCHHHHHHHHHHHHHcCCc
Confidence 588899999999999999999999999985
No 138
>KOG2521|consensus
Probab=72.46 E-value=11 Score=25.53 Aligned_cols=40 Identities=10% Similarity=-0.001 Sum_probs=34.8
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.+.+-+-|..|.++|..+.++++...++.|+.+...-|.+
T Consensus 226 ~~~ly~~s~~d~v~~~~~ie~f~~~~~~~g~~v~s~~~~d 265 (350)
T KOG2521|consen 226 WNQLYLYSDNDDVLPADEIEKFIALRREKGVNVKSVKFKD 265 (350)
T ss_pred ccceeecCCccccccHHHHHHHHHHHHhcCceEEEeeccC
Confidence 5667777999999999999999999999999988776654
No 139
>COG1647 Esterase/lipase [General function prediction only]
Probab=72.36 E-value=7 Score=25.10 Aligned_cols=38 Identities=18% Similarity=0.211 Sum_probs=29.6
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.-+|++||-. =.+.....|.+.|+++|-.+-.--|||
T Consensus 15 ~~AVLllHGFT---Gt~~Dvr~Lgr~L~e~GyTv~aP~ypG 52 (243)
T COG1647 15 NRAVLLLHGFT---GTPRDVRMLGRYLNENGYTVYAPRYPG 52 (243)
T ss_pred CEEEEEEeccC---CCcHHHHHHHHHHHHCCceEecCCCCC
Confidence 47899999954 456777889999999998776655554
No 140
>PF11339 DUF3141: Protein of unknown function (DUF3141); InterPro: IPR024501 This family of proteins appears to be predominantly expressed in Proteobacteria. Their function is unknown.
Probab=72.34 E-value=4.6 Score=28.88 Aligned_cols=26 Identities=15% Similarity=0.129 Sum_probs=23.0
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHH
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMM 48 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~ 48 (68)
+.++ ..|++++-|..|.+.|++|++.
T Consensus 293 Lr~I-r~Piivfas~gDnITPP~QaL~ 318 (581)
T PF11339_consen 293 LRNI-RSPIIVFASYGDNITPPQQALN 318 (581)
T ss_pred hhhC-CCCEEEEeccCCCCCChhHhcc
Confidence 5667 5999999999999999999954
No 141
>PF00450 Peptidase_S10: Serine carboxypeptidase; InterPro: IPR001563 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of serine peptidases belong to MEROPS peptidase family S10 (clan SC). The type example is carboxypeptidase Y from Saccharomyces cerevisiae (Baker's yeast) []. All known carboxypeptidases are either metallo carboxypeptidases or serine carboxypeptidases (3.4.16.5 from EC and 3.4.16.6 from EC). The catalytic activity of the serine carboxypeptidases, like that of the trypsin family serine proteases, is provided by a charge relay system involving an aspartic acid residue hydrogen-bonded to a histidine, which is itself hydrogen-bonded to a serine []. The sequences surrounding the active site serine and histidine residues are highly conserved in all the serine carboxypeptidases.; GO: 0004185 serine-type carboxypeptidase activity, 0006508 proteolysis; PDB: 1AC5_A 1WHS_B 3SC2_B 1WHT_A 1BCR_A 1BCS_A 1GXS_A 1IVY_A 1WPX_A 1YSC_A ....
Probab=72.06 E-value=5.1 Score=26.32 Aligned_cols=32 Identities=16% Similarity=0.199 Sum_probs=24.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADI 58 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~ 58 (68)
..++|+.+|..|-+|++-.++...+.|.=.+.
T Consensus 330 ~irVLiy~Gd~D~i~n~~Gt~~~i~~L~w~~~ 361 (415)
T PF00450_consen 330 GIRVLIYNGDLDLICNFLGTERWIDNLNWSGK 361 (415)
T ss_dssp T-EEEEEEETT-SSS-HHHHHHHHHCTECTEE
T ss_pred cceeEEeccCCCEEEEeccchhhhhccccCcc
Confidence 48999999999999999999999888765544
No 142
>COG0627 Predicted esterase [General function prediction only]
Probab=72.00 E-value=4.2 Score=26.85 Aligned_cols=41 Identities=17% Similarity=0.102 Sum_probs=34.7
Q ss_pred CCcEEEEEeCCCCccC--HHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVH--FQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~--~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..++++-.|..|.... ...+..|.+++.+.|++.++..+++
T Consensus 247 ~~~~~~d~g~ad~~~~~~~~~~~~~~~a~~~~g~~~~~~~~~~ 289 (316)
T COG0627 247 PPELLIDNGPADFFLAANNLSTRAFAEALRAAGIPNGVRDQPG 289 (316)
T ss_pred CCccccccccchhhhhhcccCHHHHHHHHHhcCCCceeeeCCC
Confidence 4788899999999885 3457889999999999999988854
No 143
>TIGR02764 spore_ybaN_pdaB polysaccharide deacetylase family sporulation protein PdaB. This model describes the YbaN protein family, also called PdaB and SpoVIE, of Gram-positive bacteria. Although ybaN null mutants have only a mild sporulation defect, ybaN/ytrI double mutants show drastically reducted sporulation efficiencies. This synthetic defect suggests the role of this sigmaE-controlled gene in sporulation had been masked by functional redundancy. Members of this family are homologous to a characterized polysaccharide deacetylase; the exact function this protein family is unknown.
Probab=71.31 E-value=13 Score=22.17 Aligned_cols=37 Identities=14% Similarity=0.228 Sum_probs=27.0
Q ss_pred cCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCce
Q psy472 24 RIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 24 ~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~ 60 (68)
+.+...++|+|-..+...-.+.-..+...|+++|..+
T Consensus 148 ~~~~g~Iil~Hd~~~~~~t~~~l~~~i~~l~~~Gy~~ 184 (191)
T TIGR02764 148 NTKPGDIILLHASDSAKQTVKALPTIIKKLKEKGYEF 184 (191)
T ss_pred cCCCCCEEEEeCCCCcHhHHHHHHHHHHHHHHCCCEE
Confidence 4445789999965455455667788999999999655
No 144
>TIGR03012 sulf_tusD_dsrE sulfur relay protein TusD/DsrE. The three proteins TusB, TusC, and TusD form a heterohexamer responsible for a sulfur relay reaction. In large numbers of proteobacterial species, this complex acts on a Cys-derived persulfide moiety, delivered by the cysteine desulfurase IscS to TusA, then to TusBCD. The activated sulfur group is then transferred to TusE (DsrC), then by MnmA (TrmU) for modification of an anticodon nucleotide in tRNAs for Glu, Lys, and Gln. The sulfur relay complex TusBCD is also found, under the designation DsrEFH, in phototrophic and chemotrophic sulfur bacteria, such as Chromatium vinosum. In these organisms, it seems the primary purpose is related to sulfur flux, such as oxidation from sulfide to molecular sulfur to sulfate.
Probab=71.28 E-value=16 Score=20.77 Aligned_cols=38 Identities=16% Similarity=0.034 Sum_probs=27.5
Q ss_pred EEEEeCCCCccCHHHHHHHHHHHHHcCCc-eEEEEecCC
Q psy472 31 LLVHGTMDDNVHFQQSMMLAKSLQHADIM-FQSQECYRR 68 (68)
Q Consensus 31 ll~hG~~D~~V~~~~s~~l~~~L~~~g~~-~~~~~~~~~ 68 (68)
+++.+.-...-....+.+++.++.+.|-+ ..+.+|.+|
T Consensus 4 iv~~~~P~~~~~~~~al~~A~aa~~~gh~v~~vFf~~Dg 42 (127)
T TIGR03012 4 LLVTGPPYGTQAASSAYQFAQALLAKGHEIVRVFFYQDG 42 (127)
T ss_pred EEEeCCCCCcHHHHHHHHHHHHHHHCCCcEEEEEEehHH
Confidence 44555555556667888999999998887 578877664
No 145
>KOG2541|consensus
Probab=70.71 E-value=17 Score=24.05 Aligned_cols=25 Identities=16% Similarity=0.166 Sum_probs=21.5
Q ss_pred CcEEEEEeCCCCccC--HHHHHHHHHH
Q psy472 28 KQYLLVHGTMDDNVH--FQQSMMLAKS 52 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~--~~~s~~l~~~ 52 (68)
.|+.++||..|.... +.+-.++.+.
T Consensus 24 ~P~ii~HGigd~c~~~~~~~~~q~l~~ 50 (296)
T KOG2541|consen 24 VPVIVWHGIGDSCSSLSMANLTQLLEE 50 (296)
T ss_pred CCEEEEeccCcccccchHHHHHHHHHh
Confidence 899999999999988 7777777766
No 146
>PF13676 TIR_2: TIR domain; PDB: 3H16_B 3UB4_A 2Y92_A 3UB3_A 3UB2_A.
Probab=70.46 E-value=5.7 Score=21.02 Aligned_cols=27 Identities=22% Similarity=0.228 Sum_probs=20.4
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHHcCCce
Q psy472 30 YLLVHGTMDDNVHFQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 30 ~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~ 60 (68)
++|.|...|.. .+..+.+.|++.|..+
T Consensus 1 VFIS~~~~D~~----~a~~l~~~L~~~g~~v 27 (102)
T PF13676_consen 1 VFISYSSEDRE----FAERLAERLESAGIRV 27 (102)
T ss_dssp EEEEEEGGGCC----CHHHHHHHHHHTT--E
T ss_pred eEEEecCCcHH----HHHHHHHHHhhcCCEE
Confidence 57889988844 7888999999998854
No 147
>PF14714 KH_dom-like: KH-domain-like of EngA bacterial GTPase enzymes, C-terminal; PDB: 2HJG_A 1MKY_A.
Probab=68.67 E-value=15 Score=19.31 Aligned_cols=37 Identities=5% Similarity=-0.003 Sum_probs=26.0
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHH----cCCceEEE
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQH----ADIMFQSQ 63 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~----~g~~~~~~ 63 (68)
.+|.+++++.....++.+-..-|.+.|++ .|++..+.
T Consensus 38 ~PPtFv~f~N~~~~~~~sY~ryL~n~lRe~f~f~G~Pi~l~ 78 (80)
T PF14714_consen 38 RPPTFVLFVNDPELLPESYKRYLENQLREAFGFEGVPIRLI 78 (80)
T ss_dssp TTTEEEEEES-CCC--HHHHHHHHHHHHHHH--TTS--EEE
T ss_pred CCCEEEEEeCCcccCCHHHHHHHHHHHHHHCCCCceeEEEE
Confidence 59999999999999999998888888777 35555543
No 148
>PF11823 DUF3343: Protein of unknown function (DUF3343); InterPro: IPR021778 This family of proteins are functionally uncharacterised. This protein is found in bacteria and archaea. Proteins in this family are typically between 78 to 102 amino acids in length.
Probab=67.97 E-value=12 Score=19.13 Aligned_cols=25 Identities=20% Similarity=0.129 Sum_probs=20.6
Q ss_pred HHHHHHHHHHHHHcCCceEEEEecC
Q psy472 43 FQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..++.++.+.|+++|+++++.--|.
T Consensus 11 t~~a~~~ek~lk~~gi~~~liP~P~ 35 (73)
T PF11823_consen 11 THDAMKAEKLLKKNGIPVRLIPTPR 35 (73)
T ss_pred HHHHHHHHHHHHHCCCcEEEeCCCh
Confidence 4688899999999999988876553
No 149
>PRK00923 sirohydrochlorin cobaltochelatase; Reviewed
Probab=67.21 E-value=15 Score=20.45 Aligned_cols=27 Identities=15% Similarity=0.146 Sum_probs=18.6
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHHcC
Q psy472 30 YLLVHGTMDDNVHFQQSMMLAKSLQHAD 57 (68)
Q Consensus 30 ~ll~hG~~D~~V~~~~s~~l~~~L~~~g 57 (68)
+|+-||+.|+.- ......+++.+++++
T Consensus 5 vlv~hGS~~~~~-~~~~~~~~~~l~~~~ 31 (126)
T PRK00923 5 LLVGHGSRLPYN-KEVVTKIAEKIKEKH 31 (126)
T ss_pred EEEeCCCCChHH-HHHHHHHHHHHHHhC
Confidence 567799987632 366677788887754
No 150
>KOG4178|consensus
Probab=66.83 E-value=15 Score=24.58 Aligned_cols=20 Identities=20% Similarity=0.474 Sum_probs=17.9
Q ss_pred cCCCCcEEEEEeCCCCccCHH
Q psy472 24 RIRDKQYLLVHGTMDDNVHFQ 44 (68)
Q Consensus 24 ~~~~~p~ll~hG~~D~~V~~~ 44 (68)
.+ ..|++++.|..|..-++.
T Consensus 256 ~i-~iPv~fi~G~~D~v~~~p 275 (322)
T KOG4178|consen 256 KI-TIPVLFIWGDLDPVLPYP 275 (322)
T ss_pred cc-ccceEEEEecCcccccch
Confidence 45 599999999999999987
No 151
>PLN03016 sinapoylglucose-malate O-sinapoyltransferase
Probab=66.47 E-value=5.9 Score=27.21 Aligned_cols=32 Identities=25% Similarity=0.279 Sum_probs=28.3
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADI 58 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~ 58 (68)
.-++||..|..|-+|++--++...++|.=.+.
T Consensus 347 ~irVLiY~Gd~D~icn~~Gt~~wi~~L~w~~~ 378 (433)
T PLN03016 347 GYRSLIYSGDHDIAVPFLATQAWIRSLNYSPI 378 (433)
T ss_pred CceEEEEECCccccCCcHhHHHHHHhCCCCCC
Confidence 47999999999999999999999999875553
No 152
>cd03414 CbiX_SirB_C Sirohydrochlorin cobalt chelatase (CbiX) and sirohydrochlorin iron chelatase (SirB), C-terminal domain. SirB catalyzes the ferro-chelation of sirohydrochlorin to siroheme, the prosthetic group of sulfite and nitrite reductases. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, an important step in the vitamin B12 biosynthetic pathway. CbiX often contains a C-terminal histidine-rich region that may be important for metal delivery and/or storage, and may also contain an iron-sulfur center. Both CbiX and SirB are found in a wide range of bacteria.
Probab=65.87 E-value=12 Score=20.42 Aligned_cols=27 Identities=19% Similarity=0.272 Sum_probs=17.2
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHHcC
Q psy472 30 YLLVHGTMDDNVHFQQSMMLAKSLQHAD 57 (68)
Q Consensus 30 ~ll~hG~~D~~V~~~~s~~l~~~L~~~g 57 (68)
+|+-||+.|+.. ......+++.|.+..
T Consensus 4 llv~HGS~~~~~-~~~~~~l~~~l~~~~ 30 (117)
T cd03414 4 VLVGRGSSDPDA-NADVAKIARLLEEGT 30 (117)
T ss_pred EEEcCCCCCHHH-HHHHHHHHHHHHHhc
Confidence 567788877543 256666777776543
No 153
>KOG3253|consensus
Probab=65.87 E-value=19 Score=26.56 Aligned_cols=42 Identities=12% Similarity=0.068 Sum_probs=33.1
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+... +.|+|++.|..|..++.+..+.+..++++. +++++..+
T Consensus 300 Lldm-k~PVLFV~Gsnd~mcspn~ME~vreKMqA~---~elhVI~~ 341 (784)
T KOG3253|consen 300 LLDM-KQPVLFVIGSNDHMCSPNSMEEVREKMQAE---VELHVIGG 341 (784)
T ss_pred hHhc-CCceEEEecCCcccCCHHHHHHHHHHhhcc---ceEEEecC
Confidence 3345 599999999999999999999999887764 44555543
No 154
>PRK00207 sulfur transfer complex subunit TusD; Validated
Probab=65.78 E-value=21 Score=20.39 Aligned_cols=38 Identities=11% Similarity=0.003 Sum_probs=25.3
Q ss_pred EEEEeCCCCccCHHHHHHHHHHHHHcCCc-eEEEEecCC
Q psy472 31 LLVHGTMDDNVHFQQSMMLAKSLQHADIM-FQSQECYRR 68 (68)
Q Consensus 31 ll~hG~~D~~V~~~~s~~l~~~L~~~g~~-~~~~~~~~~ 68 (68)
+++++.-...--...+.+++.++.+.|-+ +.+.+|.+|
T Consensus 5 iv~~~~Py~~~~~~~al~~A~aa~~~gh~v~~vFf~~Dg 43 (128)
T PRK00207 5 IAVTGPAYGTQQASSAYQFAQALLAEGHELVSVFFYQDG 43 (128)
T ss_pred EEEcCCCCCCHHHHHHHHHHHHHHhCCCCeeEEEEehHH
Confidence 44444444445557888888888888876 577777653
No 155
>PF12646 DUF3783: Domain of unknown function (DUF3783); InterPro: IPR016621 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=65.68 E-value=14 Score=18.19 Aligned_cols=28 Identities=21% Similarity=0.336 Sum_probs=21.6
Q ss_pred cEEEEEeCCCCccCHHHHHHHHHHHHHcCCceE
Q psy472 29 QYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQ 61 (68)
Q Consensus 29 p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~ 61 (68)
++++++|-.+ ++...+.+++++.|++..
T Consensus 2 ~~ll~~g~~~-----~el~~~l~~~r~~~~~~~ 29 (58)
T PF12646_consen 2 EFLLFSGFSG-----EELDKFLDALRKAGIPIP 29 (58)
T ss_pred CEEEECCCCH-----HHHHHHHHHHHHcCCCcc
Confidence 6788888655 477888999999988543
No 156
>COG2949 SanA Uncharacterized membrane protein [Function unknown]
Probab=65.48 E-value=7 Score=24.88 Aligned_cols=38 Identities=21% Similarity=0.308 Sum_probs=32.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEe
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
+...||+.| +...+.+.+-..|.+.|.+.|++.+.++.
T Consensus 93 KV~~LLlSG-DN~~~sYnEp~tM~kdL~~~GVp~~~i~l 130 (235)
T COG2949 93 KVNYLLLSG-DNATVSYNEPRTMRKDLIAAGVPAKNIFL 130 (235)
T ss_pred CeeEEEEec-CCCcccccchHHHHHHHHHcCCCHHHeee
Confidence 588999999 45678899999999999999998876654
No 157
>PF08538 DUF1749: Protein of unknown function (DUF1749); InterPro: IPR013744 This is a plant and fungal family of unknown function. This family contains many hypothetical proteins. ; PDB: 2Q0X_B.
Probab=64.61 E-value=5.1 Score=26.48 Aligned_cols=33 Identities=18% Similarity=0.194 Sum_probs=9.6
Q ss_pred hcCCCCcEEEEEeCCCCccCHH-HHHHHHHHHHHc
Q psy472 23 DRIRDKQYLLVHGTMDDNVHFQ-QSMMLAKSLQHA 56 (68)
Q Consensus 23 ~~~~~~p~ll~hG~~D~~V~~~-~s~~l~~~L~~~ 56 (68)
..+ ..|+|++.|.+|+.||.. .-..|.+..++.
T Consensus 229 G~v-~~plLvl~Sg~DEyvP~~vdk~~Ll~rw~~a 262 (303)
T PF08538_consen 229 GKV-SKPLLVLYSGKDEYVPPWVDKEALLERWKAA 262 (303)
T ss_dssp GG---S-EEEEEE--TT------------------
T ss_pred ccC-CCceEEEecCCCceecccccccccccccccc
Confidence 345 479999999999999984 334455555543
No 158
>KOG1282|consensus
Probab=63.95 E-value=7.2 Score=27.16 Aligned_cols=27 Identities=30% Similarity=0.282 Sum_probs=25.3
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQ 54 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~ 54 (68)
-++||+.|+.|-+||+--++...++|.
T Consensus 364 ~rvliysGD~D~~~p~~gt~~~i~~L~ 390 (454)
T KOG1282|consen 364 YRVLIYSGDHDLVVPFLGTQAWIKSLN 390 (454)
T ss_pred eEEEEEeCCcceeCcchhhHHHHHhcc
Confidence 599999999999999999999988887
No 159
>COG1553 DsrE Uncharacterized conserved protein involved in intracellular sulfur reduction [Inorganic ion transport and metabolism]
Probab=63.27 E-value=15 Score=21.32 Aligned_cols=26 Identities=8% Similarity=-0.028 Sum_probs=21.7
Q ss_pred HHHHHHHHHHHHHcC-CceEEEEecCC
Q psy472 43 FQQSMMLAKSLQHAD-IMFQSQECYRR 68 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g-~~~~~~~~~~~ 68 (68)
...+.+|+.+|.+.| ..+.+.+|.||
T Consensus 17 a~~A~~fA~all~~gh~~v~iFly~Dg 43 (126)
T COG1553 17 AFSALRFAEALLEQGHELVRLFLYQDG 43 (126)
T ss_pred HHHHHHHHHHHHHcCCeEEEEEEeecc
Confidence 457889999999995 88999998875
No 160
>cd03416 CbiX_SirB_N Sirohydrochlorin cobalt chelatase (CbiX) and sirohydrochlorin iron chelatase (SirB), N-terminal domain. SirB catalyzes the ferro-chelation of sirohydrochlorin to siroheme, the prosthetic group of sulfite and nitrite reductases. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, an important step in the vitamin B12 biosynthetic pathway. CbiX often contains a C-terminal histidine-rich region that may be important for metal delivery and/or storage, and may also contain an iron-sulfur center. Both are found in a wide range of bacteria. This subgroup also contains single domain proteins from archaea and bacteria which may represent the ancestral form of class II chelatases before domain duplication occurred.
Probab=63.23 E-value=18 Score=19.21 Aligned_cols=27 Identities=26% Similarity=0.267 Sum_probs=16.5
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHHcC
Q psy472 30 YLLVHGTMDDNVHFQQSMMLAKSLQHAD 57 (68)
Q Consensus 30 ~ll~hG~~D~~V~~~~s~~l~~~L~~~g 57 (68)
+|+-||+.++. .......+++.|++..
T Consensus 3 vlv~hGS~~~~-~~~~~~~l~~~l~~~~ 29 (101)
T cd03416 3 LLVGHGSRDPR-AAEALEALAERLRERL 29 (101)
T ss_pred EEEEcCCCCHH-HHHHHHHHHHHHHhhC
Confidence 46668877762 2245566777776654
No 161
>PTZ00472 serine carboxypeptidase (CBP1); Provisional
Probab=63.20 E-value=9.7 Score=26.29 Aligned_cols=31 Identities=16% Similarity=0.182 Sum_probs=27.5
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHAD 57 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g 57 (68)
..++||..|..|.+|++.-++...++|.=.|
T Consensus 364 gikVLiYnGd~D~icn~~Gt~~wi~~L~w~g 394 (462)
T PTZ00472 364 GVRVMIYAGDMDFICNWIGNKAWTLALQWPG 394 (462)
T ss_pred CceEEEEECCcCeecCcHhHHHHHHhCCCCC
Confidence 4799999999999999999999999987443
No 162
>cd00248 Mth938-like Mth938-like domain. The members of this family include: Mth938, 2P1, Xcr35, Rpa2829, and several uncharacterized sequences. Mth938 is a hypothetical protein encoded by the Methanobacterium thermoautotrophicum (Mth) genome. This protein crystallizes as a dimer, although it is monomeric in solution, with one disulfide bond in each monomer. 2P1 is a partially characterized nuclear protein which is homologous to E3-3 from rat and known to be alternately spliced. Xcr35 and Rpa2829 are hypothetical proteins of unknown function from the Xanthomonas campestris and Rhodopseudomonas palustris genomes, respectively, for which the crystal structures have been determined.
Probab=62.68 E-value=18 Score=20.05 Aligned_cols=34 Identities=21% Similarity=0.220 Sum_probs=25.8
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
.|=+++-|+......+. ..+.+.|+++|+.+|.+
T Consensus 52 ~peiliiGTG~~~~~~~--~~~~~~l~~~gI~vE~m 85 (109)
T cd00248 52 RPDILLIGTGAEIAFLP--RALRAALRAAGIGVEVM 85 (109)
T ss_pred CCCEEEEcCCCCCCcCC--HHHHHHHHHcCCeEEEe
Confidence 37788888888775543 44678899999998875
No 163
>COG1535 EntB Isochorismate hydrolase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=62.07 E-value=18 Score=22.68 Aligned_cols=41 Identities=17% Similarity=0.029 Sum_probs=32.0
Q ss_pred CCcEEEEEeCCCCccCH------------HHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHF------------QQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~------------~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
++++||||-.++-.|.+ .|..+|-.-.+++|+++-+..-|+
T Consensus 29 ~RavLLIhDMQ~YFv~~~~~~~~~~~~li~Ni~~Lr~~~~~~giPVvyTaqp~ 81 (218)
T COG1535 29 KRAVLLIHDMQNYFVSPWGENCPLMEQLIANIAKLRIWCKQAGIPVVYTAQPG 81 (218)
T ss_pred ccceeeeehhHHhhcCCCCCCCccHHHHHHHHHHHHHHHHHcCCcEEEEecCC
Confidence 58999999988776653 456677777889999998877665
No 164
>TIGR03230 lipo_lipase lipoprotein lipase. Members of this protein family are lipoprotein lipase (EC 3.1.1.34), a eukaryotic triacylglycerol lipase active in plasma and similar to pancreatic and hepatic triacylglycerol lipases (EC 3.1.1.3). It is also called clearing factor. It cleaves chylomicron and VLDL triacylglycerols; it also has phospholipase A-1 activity.
Probab=62.02 E-value=30 Score=24.07 Aligned_cols=40 Identities=13% Similarity=0.075 Sum_probs=24.5
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.++++++||-.+....-.....|.++|......+..+.++
T Consensus 41 ~ptvIlIHG~~~s~~~~~w~~~l~~al~~~~~d~nVI~VD 80 (442)
T TIGR03230 41 TKTFIVIHGWTVTGMFESWVPKLVAALYEREPSANVIVVD 80 (442)
T ss_pred CCeEEEECCCCcCCcchhhHHHHHHHHHhccCCCEEEEEE
Confidence 4899999998875422234445777765544445555443
No 165
>PF12715 Abhydrolase_7: Abhydrolase family; PDB: 3NUZ_C 3G8Y_A.
Probab=61.86 E-value=2.1 Score=29.22 Aligned_cols=36 Identities=17% Similarity=0.190 Sum_probs=20.1
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHH--HHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSL--QHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L--~~~g~~~~~~~~~~ 67 (68)
-+|+|++-|+.|..+|. +.++- ..+.-.++++.||+
T Consensus 306 PRPll~~nG~~Dklf~i-----V~~AY~~~~~p~n~~~~~~p~ 343 (390)
T PF12715_consen 306 PRPLLFENGGKDKLFPI-----VRRAYAIMGAPDNFQIHHYPK 343 (390)
T ss_dssp TS-EEESS-B-HHHHHH-----HHHHHHHTT-GGGEEE---GG
T ss_pred CCcchhhcCCcccccHH-----HHHHHHhcCCCcceEEeeccc
Confidence 48999999999998876 22333 34444788888874
No 166
>PF04083 Abhydro_lipase: Partial alpha/beta-hydrolase lipase region; InterPro: IPR006693 The alpha/beta hydrolase fold is common to several hydrolytic enzymes of widely differing phylogenetic origin and catalytic function. The core of each enzyme is similar: an alpha/beta sheet, not barrel, of eight beta-sheets connected by alpha-helices []. This entry represents the N-terminal part of an alpha/beta hydrolase domain found in a number of lipases.; GO: 0006629 lipid metabolic process; PDB: 1K8Q_B 1HLG_B.
Probab=61.51 E-value=5.6 Score=20.10 Aligned_cols=14 Identities=29% Similarity=0.513 Sum_probs=6.8
Q ss_pred CCcEEEEEeCCCCc
Q psy472 27 DKQYLLVHGTMDDN 40 (68)
Q Consensus 27 ~~p~ll~hG~~D~~ 40 (68)
++|+||.||..+..
T Consensus 43 k~pVll~HGL~~ss 56 (63)
T PF04083_consen 43 KPPVLLQHGLLQSS 56 (63)
T ss_dssp --EEEEE--TT--G
T ss_pred CCcEEEECCcccCh
Confidence 58999999987643
No 167
>PLN02209 serine carboxypeptidase
Probab=61.29 E-value=9.6 Score=26.25 Aligned_cols=32 Identities=28% Similarity=0.298 Sum_probs=28.7
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADI 58 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~ 58 (68)
..++|+..|..|-+|++--++...++|.=.+.
T Consensus 351 girVLiY~GD~D~icn~~Gte~wi~~L~w~~~ 382 (437)
T PLN02209 351 GYRSLIFSGDHDITMPFQATQAWIKSLNYSII 382 (437)
T ss_pred CceEEEEECCccccCCcHhHHHHHHhcCCccC
Confidence 47999999999999999999999999986665
No 168
>PRK09061 D-glutamate deacylase; Validated
Probab=61.01 E-value=46 Score=23.30 Aligned_cols=44 Identities=7% Similarity=0.078 Sum_probs=36.1
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
+... ..|+.+.|-...-.-+...+..+.+++++.|+++...+||
T Consensus 241 A~~~-G~rv~IsHlss~g~~~~~~~le~I~~Ar~~Gi~Vt~e~~P 284 (509)
T PRK09061 241 AAET-GAHMHICHVNSTSLRDIDRCLALVEKAQAQGLDVTTEAYP 284 (509)
T ss_pred HHHh-CCCEEEEeeccCCcccHHHHHHHHHHHHHcCCcEEEEecC
Confidence 3444 5789999988877677888999999999999999887776
No 169
>TIGR00668 apaH bis(5'-nucleosyl)-tetraphosphatase (symmetrical). Alternate names include diadenosine-tetraphosphatase and Ap4A hydrolase.
Probab=60.50 E-value=22 Score=23.26 Aligned_cols=42 Identities=12% Similarity=0.156 Sum_probs=30.0
Q ss_pred HHHhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHH
Q psy472 12 GYKIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQH 55 (68)
Q Consensus 12 ~y~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~ 55 (68)
.|-+..|+.... . ...++++|+.-++..+++++..+++.+.+
T Consensus 100 ~wLr~lPl~i~~-~-~~~~~lVHAGi~P~w~l~~a~~~a~eve~ 141 (279)
T TIGR00668 100 NWLRRQPLLQHD-E-EKKLVMAHAGITPQWDLQTAKECARDVEA 141 (279)
T ss_pred HHHHcCCcEEEe-C-CCCEEEEecCCCCCCcHHHHHHHHHHHHH
Confidence 444455544321 1 34688999999999999999998887654
No 170
>PLN02757 sirohydrochlorine ferrochelatase
Probab=58.55 E-value=12 Score=22.14 Aligned_cols=29 Identities=7% Similarity=0.027 Sum_probs=22.2
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHAD 57 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g 57 (68)
.=+|+-||+.|+. .......+++.|++..
T Consensus 15 ~lllvgHGSrd~~-a~~~~~~la~~l~~~~ 43 (154)
T PLN02757 15 GVVIVDHGSRRKE-SNLMLEEFVAMYKQKT 43 (154)
T ss_pred EEEEEeCCCCCHH-HHHHHHHHHHHHHhhC
Confidence 4578889999997 5577778888887643
No 171
>COG0848 ExbD Biopolymer transport protein [Intracellular trafficking and secretion]
Probab=57.63 E-value=36 Score=19.79 Aligned_cols=31 Identities=16% Similarity=0.334 Sum_probs=25.7
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCc
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIM 59 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~ 59 (68)
..|.+++.+ |..|+++.....-+.|+++|..
T Consensus 96 ~~~~v~i~a--D~~v~y~~vv~vm~~l~~aG~~ 126 (137)
T COG0848 96 KNPRVVIRA--DKNVKYGTVVKVMDLLKEAGFK 126 (137)
T ss_pred CCceEEEEe--CCCCCHHHHHHHHHHHHHcCCc
Confidence 356556654 8999999999999999999984
No 172
>PF06649 DUF1161: Protein of unknown function (DUF1161); InterPro: IPR010595 This family consists of several short, hypothetical bacterial proteins of unknown function.
Probab=57.43 E-value=23 Score=17.47 Aligned_cols=22 Identities=5% Similarity=0.064 Sum_probs=17.4
Q ss_pred HHHHHHHHHcCC-ceEEEEecCC
Q psy472 47 MMLAKSLQHADI-MFQSQECYRR 68 (68)
Q Consensus 47 ~~l~~~L~~~g~-~~~~~~~~~~ 68 (68)
.++..+++.+|+ .|.+.+.|+.
T Consensus 6 ~eI~~KI~anGV~~ftLeiV~nd 28 (52)
T PF06649_consen 6 AEIEQKIIANGVSSFTLEIVPND 28 (52)
T ss_pred HHHHHHHHHcCCCceEEEEeeCc
Confidence 467889999999 7788887763
No 173
>PF12146 Hydrolase_4: Putative lysophospholipase; InterPro: IPR022742 This domain is found in bacteria and eukaryotes and is approximately 110 amino acids in length. Many members are annotated as being lysophospholipases, and others as alpha-beta hydrolase fold-containing proteins.
Probab=56.83 E-value=19 Score=18.68 Aligned_cols=31 Identities=19% Similarity=0.223 Sum_probs=22.3
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCce
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~ 60 (68)
+.=++|+||..+-. .--..+++.|.++|..+
T Consensus 16 k~~v~i~HG~~eh~---~ry~~~a~~L~~~G~~V 46 (79)
T PF12146_consen 16 KAVVVIVHGFGEHS---GRYAHLAEFLAEQGYAV 46 (79)
T ss_pred CEEEEEeCCcHHHH---HHHHHHHHHHHhCCCEE
Confidence 36799999986543 34455788888888765
No 174
>PF04430 DUF498: Protein of unknown function (DUF498/DUF598); InterPro: IPR007523 NADH:ubiquinone oxidoreductase (complex I) (1.6.5.3 from EC) is a respiratory-chain enzyme that catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane (NADH + ubiquinone = NAD+ + ubiquinol) []. Complex I is a major source of reactive oxygen species (ROS) that are predominantly formed by electron transfer from FMNH(2). Complex I is found in bacteria, cyanobacteria (as a NADH-plastoquinone oxidoreductase), archaea [], mitochondira, and in the hydrogenosome, a mitochondria-derived organelle. In general, the bacterial complex consists of 14 different subunits, while the mitochondrial complex contains homologues to these subunits in addition to approximately 31 additional proteins []. Mitochondrial complex I, which is located in the inner mitochondrial membrane, is the largest multimeric respiratory enzyme in the mitochondria, consisting of more than 40 subunits, one FMN co-factor and eight FeS clusters []. The assembly of mitochondrial complex I is an intricate process that requires the cooperation of the nuclear and mitochondrial genomes [, ]. Mitochondrial complex I can cycle between active and deactive forms that can be distinguished by the reactivity towards divalent cations and thiol-reactive agents. All redox prosthetic groups reside in the peripheral arm of the L-shaped structure. The NADH oxidation domain harbouring the FMN cofactor is connected via a chain of iron-sulphur clusters to the ubiquinone reduction site that is located in a large pocket formed by the PSST and 49kDa subunits of complex I []. This is entry represents an essential factor for the assembly of mitochondrial NADH:ubiquinone oxidoreductase complex (complex I) []. The crystal structure of this protein revealed a 3-layer beta+alpha/beta/alpha topology [].; PDB: 2K2E_A 2Q4Q_B 2AB1_A 2FVT_A 2CYJ_A 1IHN_B 2GM2_A 3CPK_A 2FI9_A.
Probab=56.68 E-value=6.5 Score=21.74 Aligned_cols=34 Identities=21% Similarity=0.212 Sum_probs=21.6
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
.|=+++-|+......+. ..+.+.|+++|+.+|.+
T Consensus 53 ~pe~liiGtG~~~~~~~--~~~~~~l~~~GI~ve~m 86 (110)
T PF04430_consen 53 KPEVLIIGTGKRQLFLP--PELREYLRKKGIGVEVM 86 (110)
T ss_dssp S-SEEEEEETTS-SECT--HHHHHHHHTTT-EEEEE
T ss_pred CCcEEEEccCCccccCC--HHHHHHHHHcCCeEEEE
Confidence 67677777777644433 34677789999988875
No 175
>PRK13604 luxD acyl transferase; Provisional
Probab=55.87 E-value=25 Score=23.29 Aligned_cols=30 Identities=17% Similarity=0.136 Sum_probs=25.5
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCce
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~ 60 (68)
+-+++.||..... .+...+++.|.++|..+
T Consensus 38 ~~vIi~HGf~~~~---~~~~~~A~~La~~G~~v 67 (307)
T PRK13604 38 NTILIASGFARRM---DHFAGLAEYLSSNGFHV 67 (307)
T ss_pred CEEEEeCCCCCCh---HHHHHHHHHHHHCCCEE
Confidence 5688999999975 46999999999999865
No 176
>PF02698 DUF218: DUF218 domain; InterPro: IPR003848 This domain of unknown function is found in several uncharacterised proteins.; PDB: 3CA8_A.
Probab=55.56 E-value=16 Score=20.89 Aligned_cols=37 Identities=8% Similarity=-0.005 Sum_probs=21.8
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEE
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
.+.+++-|.....-...++..|.+.|.++|++-+-++
T Consensus 38 ~~~il~SGg~~~~~~~~ea~~~~~~l~~~gvp~~~I~ 74 (155)
T PF02698_consen 38 APRILFSGGYGHGDGRSEAEAMRDYLIELGVPEERII 74 (155)
T ss_dssp T--EEEE--SSTTHTS-HHHHHHHHHHHT---GGGEE
T ss_pred CCeEEECCCCCCCCCCCHHHHHHHHHHhcccchheeE
Confidence 5667777766655578899999999999999755443
No 177
>PRK08335 translation initiation factor IF-2B subunit alpha; Validated
Probab=55.49 E-value=20 Score=23.38 Aligned_cols=27 Identities=19% Similarity=0.251 Sum_probs=22.0
Q ss_pred CCccCHHHHHHHHHHHHHcCCceEEEE
Q psy472 38 DDNVHFQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 38 D~~V~~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
++.-|..|..+|+..|.+.|+++.++.
T Consensus 141 ~EsrP~~qG~~la~eL~~~GI~vtlI~ 167 (275)
T PRK08335 141 TESAPDYEGLALANELEFLGIEFEVIT 167 (275)
T ss_pred ecCCCchhHHHHHHHHHHCCCCEEEEe
Confidence 456677888889999999999988753
No 178
>TIGR00627 tfb4 transcription factor tfb4. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=54.73 E-value=55 Score=21.42 Aligned_cols=40 Identities=18% Similarity=0.085 Sum_probs=34.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
+.++|++.+..|..-.+-.......++++.++.+......
T Consensus 146 ~~RIlii~~s~~~~~qYi~~mn~Ifaaqk~~I~Idv~~L~ 185 (279)
T TIGR00627 146 KSRILVISITPDMALQYIPLMNCIFSAQKQNIPIDVVSIG 185 (279)
T ss_pred cceEEEEECCCCchHHHHHHHHHHHHHHHcCceEEEEEeC
Confidence 4899999999998888888888899999999998876543
No 179
>PF13721 SecD-TM1: SecD export protein N-terminal TM region
Probab=53.95 E-value=36 Score=18.70 Aligned_cols=40 Identities=15% Similarity=0.232 Sum_probs=27.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.|.+-+.+.... +......++.+.|+++|+.++-....+
T Consensus 30 e~pAvqIs~~~~~-~~~~~~~~v~~~L~~~~I~~k~i~~~~ 69 (101)
T PF13721_consen 30 EDPAVQISASSAG-VQLPDAFQVEQALKAAGIAVKSIEQEG 69 (101)
T ss_pred CCCcEEEecCCCC-ccCChHHHHHHHHHHCCCCcceEEeeC
Confidence 3567777765422 234445799999999999998776554
No 180
>PF06342 DUF1057: Alpha/beta hydrolase of unknown function (DUF1057); InterPro: IPR010463 This entry consists of proteins of unknown function which have an alpha/beta hydrolase fold.
Probab=53.40 E-value=25 Score=23.35 Aligned_cols=33 Identities=15% Similarity=0.199 Sum_probs=27.6
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHH
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQH 55 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~ 55 (68)
+.+. +.|+|+..|..|-.|--+.+.+++.+.+.
T Consensus 208 ln~~-~ikvli~ygg~DhLIEeeI~~E~a~~f~~ 240 (297)
T PF06342_consen 208 LNKK-PIKVLIAYGGKDHLIEEEISFEFAMKFKG 240 (297)
T ss_pred hccC-CCcEEEEEcCcchhhHHHHHHHHHHHhCC
Confidence 4444 58999999999999999999999887643
No 181
>TIGR02803 ExbD_1 TonB system transport protein ExbD, group 1. Members of this family are Gram-negative bacterial inner membrane proteins, generally designated ExbD, related to the TolR family modeled by TIGRFAMs TIGR02801. Members always are encoded next to a protein designated ExbB (TIGR02797), related to the TolQ family modeled by TIGRFAMs TIGR02796. ExbD and ExbB together form a proton channel through which they can harness the proton-motive force to energize TonB, which in turn energizes TonB-dependent receptors in the outer membrane. TonB-dependent receptors with known specificity tend to import siderophores or vitamin B12. A TonB system and Tol-Pal system often will co-exist in a single bacterial genome.
Probab=52.81 E-value=26 Score=19.56 Aligned_cols=24 Identities=21% Similarity=0.355 Sum_probs=21.7
Q ss_pred CCCCccCHHHHHHHHHHHHHcCCc
Q psy472 36 TMDDNVHFQQSMMLAKSLQHADIM 59 (68)
Q Consensus 36 ~~D~~V~~~~s~~l~~~L~~~g~~ 59 (68)
..|..++++......+.++++|+.
T Consensus 91 ~aD~~~~~~~vv~v~d~~~~aG~~ 114 (122)
T TIGR02803 91 RADKTVDYGDLMKVMNLLRQAGYL 114 (122)
T ss_pred EcCCCCCHHHHHHHHHHHHHcCCC
Confidence 358999999999999999999985
No 182
>cd06259 YdcF-like YdcF-like. YdcF-like is a large family of mainly bacterial proteins, with a few members found in fungi, plants, and archaea. Escherichia coli YdcF has been shown to bind S-adenosyl-L-methionine (AdoMet), but a biochemical function has not been idenitified. The family also includes Escherichia coli sanA and Salmonella typhimurium sfiX, which are involved in vancomycin resistance; sfiX may also be involved in murein synthesis.
Probab=52.77 E-value=21 Score=20.22 Aligned_cols=37 Identities=14% Similarity=0.161 Sum_probs=28.9
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEE
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
.+.+++-|..-..-...++..|.+.|.+.|++.+-++
T Consensus 35 ~~~ii~sGg~~~~~~~~ea~~m~~~l~~~gv~~~~I~ 71 (150)
T cd06259 35 APKLIVSGGQGPGEGYSEAEAMARYLIELGVPAEAIL 71 (150)
T ss_pred CCEEEEcCCCCCCCCCCHHHHHHHHHHHcCCCHHHee
Confidence 7888888876665577899999999999998764433
No 183
>TIGR02801 tolR TolR protein. The model describes the inner membrane protein TolR, part of the TolR/TolQ complex that transduces energy from the proton-motive force, through TolA, to an outer membrane complex made up of TolB and Pal (peptidoglycan-associated lipoprotein). The complex is required to maintain outer membrane integrity, and defects may cause a defect in the import of some organic compounds in addition to the resulting morphologic. While several gene pairs homologous to talR and tolQ may be found in a single genome, but the scope of this model is set to favor finding only bone fide TolR, supported by operon structure as well as by score.
Probab=52.67 E-value=26 Score=19.66 Aligned_cols=28 Identities=21% Similarity=0.351 Sum_probs=23.6
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHHcCCc
Q psy472 30 YLLVHGTMDDNVHFQQSMMLAKSLQHADIM 59 (68)
Q Consensus 30 ~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~ 59 (68)
.+++. .|..+++++...+.++++++|+.
T Consensus 95 ~v~i~--aD~~~~~~~vv~vmd~~~~~G~~ 122 (129)
T TIGR02801 95 PVLIR--ADKTVPYGEVIKVMALLKQAGIE 122 (129)
T ss_pred eEEEE--cCCCCCHHHHHHHHHHHHHcCCC
Confidence 34444 58999999999999999999984
No 184
>cd00858 GlyRS_anticodon GlyRS Glycyl-anticodon binding domain. GlyRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only.
Probab=52.35 E-value=39 Score=18.68 Aligned_cols=35 Identities=3% Similarity=0.019 Sum_probs=24.5
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
..++++.-. |.......+..+++.|+++|+.+++.
T Consensus 27 ~~v~Ii~~~-~~~~~~~~a~~la~~LR~~gi~v~~d 61 (121)
T cd00858 27 IKVAVLPLV-KRDELVEIAKEISEELRELGFSVKYD 61 (121)
T ss_pred cEEEEEecC-CcHHHHHHHHHHHHHHHHCCCEEEEe
Confidence 345665544 22334578899999999999988763
No 185
>PF04056 Ssl1: Ssl1-like; InterPro: IPR007198 Ssl1-like proteins are 40 kDa subunits of the transcription factor II H complex. This domain is often found associated with the C2H2 type Zn-finger (IPR007087 from INTERPRO).; GO: 0008270 zinc ion binding, 0006281 DNA repair, 0006355 regulation of transcription, DNA-dependent
Probab=52.25 E-value=34 Score=21.15 Aligned_cols=46 Identities=11% Similarity=0.081 Sum_probs=34.4
Q ss_pred HHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 20 NKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 20 ~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.+......+-+|++-| .-..+.+.......+.|++.++.+..+-+.
T Consensus 94 ~~~p~~~srEIlvi~g-Sl~t~Dp~di~~ti~~l~~~~IrvsvI~la 139 (193)
T PF04056_consen 94 KHMPSHGSREILVIFG-SLTTCDPGDIHETIESLKKENIRVSVISLA 139 (193)
T ss_pred hhCccccceEEEEEEe-ecccCCchhHHHHHHHHHHcCCEEEEEEEh
Confidence 3343332467888887 666888889999999999999988877553
No 186
>TIGR02804 ExbD_2 TonB system transport protein ExbD, group 2. Members of this family are Gram-negative bacterial inner membrane proteins, generally designated ExbD, related to the TolR family modeled by TIGRFAMs TIGR02801. Members always are encoded next to a protein designated ExbB (TIGR02797), related to the TolQ family modeled by TIGRFAMs TIGR02796. ExbD and ExbB together form a proton channel through which they can harness the proton-motive force to energize TonB, which in turn energizes TonB-dependent receptors in the outer membrane. TonB-dependent receptors with known specificity tend to import siderophores or vitamin B12. A TonB system and Tol-Pal system often will co-exist in a single bacterial genome.
Probab=52.17 E-value=24 Score=19.73 Aligned_cols=23 Identities=17% Similarity=0.284 Sum_probs=21.3
Q ss_pred CCCccCHHHHHHHHHHHHHcCCc
Q psy472 37 MDDNVHFQQSMMLAKSLQHADIM 59 (68)
Q Consensus 37 ~D~~V~~~~s~~l~~~L~~~g~~ 59 (68)
.|..+++++.....+.++++|+.
T Consensus 91 aD~~~~~~~vv~v~d~~~~~G~~ 113 (121)
T TIGR02804 91 SDKEAKFQDFVTITDMLKAKEHE 113 (121)
T ss_pred eCCCCCHhHHHHHHHHHHHcCCC
Confidence 58899999999999999999985
No 187
>COG3618 Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]
Probab=51.75 E-value=25 Score=23.09 Aligned_cols=38 Identities=13% Similarity=-0.061 Sum_probs=32.7
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 30 YLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 30 ~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
-...|+.-|...--..-.+.+++|.+.|..|++++++.
T Consensus 110 R~~l~~~p~~~~~a~~~r~~~~rL~~~gl~fdl~~~~~ 147 (279)
T COG3618 110 RRNLHVVPDGLFEAPAWRANVERLAKLGLHFDLQVDPH 147 (279)
T ss_pred eehhhcCCccchhhHHHHHHHHHHHhcCCeEEEEeChh
Confidence 34449999998888888999999999999999999874
No 188
>smart00460 TGc Transglutaminase/protease-like homologues. Transglutaminases are enzymes that establish covalent links between proteins. A subset of transglutaminase homologues appear to catalyse the reverse reaction, the hydrolysis of peptide bonds. Proteins with this domain are both extracellular and intracellular, and it is likely that the eukaryotic intracellular proteins are involved in signalling events.
Probab=51.16 E-value=29 Score=16.69 Aligned_cols=22 Identities=9% Similarity=-0.007 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHHcCCceEEEE
Q psy472 43 FQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
.+.+..++..|+..|+++++..
T Consensus 10 ~~~a~l~~~llr~~GIpar~v~ 31 (68)
T smart00460 10 GEFAALFVALLRSLGIPARVVS 31 (68)
T ss_pred HHHHHHHHHHHHHCCCCeEEEe
Confidence 4567778889999999998864
No 189
>COG0859 RfaF ADP-heptose:LPS heptosyltransferase [Cell envelope biogenesis, outer membrane]
Probab=50.80 E-value=68 Score=20.94 Aligned_cols=35 Identities=14% Similarity=0.171 Sum_probs=27.1
Q ss_pred cEEEEEeC----CCCccCHHHHHHHHHHHHHcCCceEEEEe
Q psy472 29 QYLLVHGT----MDDNVHFQQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 29 p~ll~hG~----~D~~V~~~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
|.++++.. .+.+-|.++-.++++.|++.| .+..++
T Consensus 176 ~~i~i~pg~s~~~~K~wp~e~~~~l~~~l~~~~--~~Vvl~ 214 (334)
T COG0859 176 PYIVINPGASRGSAKRWPLEHYAELAELLIAKG--YQVVLF 214 (334)
T ss_pred CeEEEeccccccccCCCCHHHHHHHHHHHHHCC--CEEEEe
Confidence 55555544 799999999999999999999 444443
No 190
>PF01674 Lipase_2: Lipase (class 2); InterPro: IPR002918 Lipases or triacylglycerol acylhydrolases hydrolyse ester bonds in triacylglycerol giving diacylglycerol, monoacylglycerol, glycerol and free fatty acids []. This group of lipases has been called class 2 as they are not clearly related to other lipase families, and includes LipA and LipB from Bacillus subtilis [] and uncharacterised proteins from Caenorhabditis.; PDB: 2VTV_B 2X76_A 2X5X_A 2QXU_A 3QMM_A 1I6W_A 3D2C_J 2QXT_B 1R50_A 1T2N_A ....
Probab=50.79 E-value=15 Score=22.96 Aligned_cols=29 Identities=31% Similarity=0.386 Sum_probs=15.9
Q ss_pred CcEEEEEeCCC-CccCHHHHHHHHHHHHHcCCc
Q psy472 28 KQYLLVHGTMD-DNVHFQQSMMLAKSLQHADIM 59 (68)
Q Consensus 28 ~p~ll~hG~~D-~~V~~~~s~~l~~~L~~~g~~ 59 (68)
.|++|+||... ....+. .++..|+++|-.
T Consensus 2 ~PVVlVHG~~~~~~~~w~---~~~~~l~~~GY~ 31 (219)
T PF01674_consen 2 RPVVLVHGTGGNAYSNWS---TLAPYLKAAGYC 31 (219)
T ss_dssp --EEEE--TTTTTCGGCC---HHHHHHHHTT--
T ss_pred CCEEEECCCCcchhhCHH---HHHHHHHHcCCC
Confidence 69999999997 444443 355667788853
No 191
>PRK13973 thymidylate kinase; Provisional
Probab=49.92 E-value=32 Score=20.92 Aligned_cols=31 Identities=16% Similarity=-0.027 Sum_probs=25.7
Q ss_pred CCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 37 MDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 37 ~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
-|-.==-.|+..|++.|.+.|..+.+.-+|+
T Consensus 11 ~dGsGKtTq~~~l~~~l~~~g~~~~~~~~p~ 41 (213)
T PRK13973 11 GEGAGKSTQIRLLAERLRAAGYDVLVTREPG 41 (213)
T ss_pred CCCCCHHHHHHHHHHHHHHCCCeEEEEECCC
Confidence 5555566899999999999999998887775
No 192
>cd05125 Mth938_2P1-like Mth938_2P1-like domain. This model contains sequences that are similar to 2P1, a partially characterized nuclear protein, which is homologous to E3-3 from rat and known to be alternatively spliced. Its function is unknown. This family is part of the Mth938 family, for which structures, but no functional data are available.
Probab=49.69 E-value=41 Score=18.95 Aligned_cols=34 Identities=21% Similarity=0.215 Sum_probs=24.1
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
.|=+++-|+.....++.. .+.+.|+++|+.+|.+
T Consensus 54 ~peiliiGtG~~~~~~~~--~~~~~l~~~gi~vevm 87 (114)
T cd05125 54 RPEILVIGTGRKSRPLSP--ELRKYFKKLGIAVEVV 87 (114)
T ss_pred CCCEEEEccCCCCCcCCH--HHHHHHHHcCCEEEEE
Confidence 566777788877555544 4457788999988875
No 193
>PF05853 DUF849: Prokaryotic protein of unknown function (DUF849); InterPro: IPR008567 This family consists of several hypothetical prokaryotic proteins with no known function.; PDB: 3C6C_A 2Y7G_B 2Y7F_B 2Y7D_D 2Y7E_B 3LOT_A 3FA5_B 3NO5_C 3E02_A 3E49_B ....
Probab=49.25 E-value=29 Score=22.35 Aligned_cols=26 Identities=12% Similarity=-0.000 Sum_probs=20.8
Q ss_pred cCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 41 VHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 41 V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
-++.....+++.++++|+..|+.+|.
T Consensus 123 n~~~~~~~~~~~~~e~Gi~pe~ev~d 148 (272)
T PF05853_consen 123 NTPADARELARRMRERGIKPEIEVFD 148 (272)
T ss_dssp --HHHHHHHHHHHHHTT-EEEEEESS
T ss_pred CCHHHHHHHHHHHHHcCCeEEEEEEc
Confidence 34677889999999999999999986
No 194
>PF12531 DUF3731: DNA-K related protein ; InterPro: IPR021030 Proteins in this family are bacterial proteins of approximately 250 amino acids in length. There are two conserved sequence motifs: RPG and WRR. The proteins in this family are frequently annotated as DNA-K related proteins however there is little accompanying literature to confirm this.
Probab=49.15 E-value=21 Score=23.10 Aligned_cols=27 Identities=15% Similarity=0.086 Sum_probs=23.5
Q ss_pred EEEEeCCCCccCHHHHHHHHHHHHHcC
Q psy472 31 LLVHGTMDDNVHFQQSMMLAKSLQHAD 57 (68)
Q Consensus 31 ll~hG~~D~~V~~~~s~~l~~~L~~~g 57 (68)
..+||..+.+||++.+.....+|.+.+
T Consensus 147 ~p~yGs~h~Vvp~~~~~~wl~~ll~~d 173 (249)
T PF12531_consen 147 VPFYGSAHNVVPPEVAEQWLDALLALD 173 (249)
T ss_pred CcccCCcccccCHHHHHHHHHHHHhcC
Confidence 357999999999999999999887765
No 195
>PF12913 SH3_6: SH3 domain of the SH3b1 type; PDB: 3M1U_B.
Probab=48.44 E-value=18 Score=17.86 Aligned_cols=14 Identities=14% Similarity=0.304 Sum_probs=10.6
Q ss_pred CCcEEEEEeCCCCc
Q psy472 27 DKQYLLVHGTMDDN 40 (68)
Q Consensus 27 ~~p~ll~hG~~D~~ 40 (68)
..|++|.|-+.|..
T Consensus 26 gtPv~i~H~S~D~~ 39 (54)
T PF12913_consen 26 GTPVYILHTSRDGA 39 (54)
T ss_dssp T-EEEEEEE-TTSS
T ss_pred CCCEEEEEECCCCC
Confidence 58999999999974
No 196
>TIGR02884 spore_pdaA delta-lactam-biosynthetic de-N-acetylase. Muramic delta-lactam is an unusual constituent of peptidoglycan, found only in bacterial spores in the peptidoglycan wall, or spore cortex. The proteins in this family are PdaA (yfjS), a member of a larger family of polysaccharide deacetylases, and are specificially involved in delta-lactam biosynthesis. PdaA acts immediately after CwlD, an N-acetylmuramoyl-L-alanine amidase and performs a de-N-acetylation. PdaA may also perform the following transpeptidation for lactam ring formation, as heterologous expression in E. coli of CwlD and PdaA together is sufficient for delta-lactam production.
Probab=47.48 E-value=66 Score=19.84 Aligned_cols=36 Identities=25% Similarity=0.371 Sum_probs=26.0
Q ss_pred cCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceE
Q psy472 24 RIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQ 61 (68)
Q Consensus 24 ~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~ 61 (68)
+.+...++|+|...... .+.-..+...|+++|..|-
T Consensus 183 ~~~~g~IiLlHd~~~~t--~~aL~~ii~~lk~~Gy~fv 218 (224)
T TIGR02884 183 KIHPGAILLLHAVSKDN--AEALDKIIKDLKEQGYTFK 218 (224)
T ss_pred cCCCCcEEEEECCCCCH--HHHHHHHHHHHHHCCCEEE
Confidence 33347899999754433 4677789999999997653
No 197
>COG4814 Uncharacterized protein with an alpha/beta hydrolase fold [General function prediction only]
Probab=47.16 E-value=54 Score=21.66 Aligned_cols=42 Identities=14% Similarity=0.287 Sum_probs=36.2
Q ss_pred CCcEEEEEeCC------CCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 27 DKQYLLVHGTM------DDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 27 ~~p~ll~hG~~------D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
+.-+|++.|+- |-.||+..|.....-+..++..+.--+|+++
T Consensus 216 ~~evl~IaGDl~dg~~tDG~Vp~assls~~~lf~~~~ksy~e~~~~Gk 263 (288)
T COG4814 216 NTEVLLIAGDLDDGKQTDGAVPWASSLSIYHLFKKNGKSYIESLYKGK 263 (288)
T ss_pred CcEEEEEecccccCCcCCCceechHhHHHHHHhccCcceeEEEeeeCC
Confidence 47899999985 5689999999999999999999888788763
No 198
>PRK15478 cbiH cobalt-precorrin-3B C(17)-methyltransferase; Provisional
Probab=47.01 E-value=71 Score=20.11 Aligned_cols=38 Identities=5% Similarity=0.034 Sum_probs=29.6
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..+.++++ .|+.+. .-+..+.+.|.+.++.+++.+.||
T Consensus 71 ~~Vv~L~s-GDP~~~-g~~~~~~~~l~~~~~~~~veviPG 108 (241)
T PRK15478 71 HNVALISS-GDAGIY-GMAGLVLELVSKQKLDVEVRLIPG 108 (241)
T ss_pred CeEEEEEC-CCCCcc-hhHHHHHHHHHhcCCCCcEEEeCC
Confidence 56766764 899885 577778888888888888888886
No 199
>PRK11024 colicin uptake protein TolR; Provisional
Probab=46.72 E-value=35 Score=19.58 Aligned_cols=28 Identities=14% Similarity=0.243 Sum_probs=23.4
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHHcCCc
Q psy472 30 YLLVHGTMDDNVHFQQSMMLAKSLQHADIM 59 (68)
Q Consensus 30 ~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~ 59 (68)
.+++. .|..++++....+.+.++++|+.
T Consensus 105 ~V~i~--aD~~~~~~~vv~vmd~~k~aG~~ 132 (141)
T PRK11024 105 VFLIG--GAKDVPYDEIIKALNLLHSAGVK 132 (141)
T ss_pred eEEEE--cCCCCCHHHHHHHHHHHHHcCCC
Confidence 34455 58889999999999999999984
No 200
>cd01786 STE50_RA Ubiquitin-like domain of STE50_RA. STE50_RA The fungal adaptor protein STE50 is an essential component of three MAPK-mediated signalling pathways, which control the mating response, invasive/filamentous growth and osmotolerance (HOG pathway), respectively. STE50 functions in cell signalling between the activated G protein and STE11. The domain architecture of STE50 includes an amino-terminal SAM (sterile alpha motif) domain in addition to the carboxy-terminal ubiquitin-like RA (RAS-associated) domain. While the SAM domain interacts with STE11, the RA domain interacts with CDC42 and RAS. Modulation of signal transduction by STE50 specifically affects the pheromone-response pathway in yeast.
Probab=46.56 E-value=48 Score=18.41 Aligned_cols=36 Identities=8% Similarity=0.204 Sum_probs=27.2
Q ss_pred CcEEEEEeCCCCccCH-HHHHHHHHHHHHcCCceEEE
Q psy472 28 KQYLLVHGTMDDNVHF-QQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~-~~s~~l~~~L~~~g~~~~~~ 63 (68)
=-++|.+|+..+.... +--..+.+.|++.|..--++
T Consensus 58 Y~L~I~YgdqER~L~ldEKPl~lFk~L~~~g~~P~fM 94 (98)
T cd01786 58 YVLVICYGDQERILDLDEKPVIIFKNLKQQGLHPAIM 94 (98)
T ss_pred eEEEEEeCCeeeeccccccHHHHHHHHHHcCCCCeeE
Confidence 4578888988888888 55677888899888755443
No 201
>PF08904 DUF1849: Domain of unknown function (DUF1849); InterPro: IPR015000 This group of proteins are functionally uncharacterised.
Probab=46.38 E-value=40 Score=21.79 Aligned_cols=31 Identities=0% Similarity=-0.033 Sum_probs=26.9
Q ss_pred CCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 37 MDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 37 ~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.+...|.+|+..+.++.++-..-++..+|+|
T Consensus 127 ~~tlFPt~H~~~iI~aA~aG~~~~~~~vFDG 157 (252)
T PF08904_consen 127 GGTLFPTEHTRAIIEAAKAGERFLEAPVFDG 157 (252)
T ss_pred ccccCcHHHHHHHHHHHhcCCceEeeeeecC
Confidence 4788999999999999877777889999986
No 202
>PRK10494 hypothetical protein; Provisional
Probab=45.97 E-value=33 Score=21.91 Aligned_cols=38 Identities=11% Similarity=0.017 Sum_probs=26.1
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEe
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
.+.+++-|..-..-+..++..+.+.|.+.|++.+-++.
T Consensus 121 ~~~ii~SGg~~~~~~~sEA~~~~~~l~~lGVp~~~Ii~ 158 (259)
T PRK10494 121 GAKLIFTGGAAKTNTVSTAEVGARVAQSLGVPREDIIT 158 (259)
T ss_pred CCEEEEECCCCCCCCCCHHHHHHHHHHHcCCCHHHeee
Confidence 45566666432222567999999999999998765544
No 203
>COG3411 Ferredoxin [Energy production and conversion]
Probab=45.61 E-value=33 Score=17.61 Aligned_cols=39 Identities=10% Similarity=-0.089 Sum_probs=29.2
Q ss_pred CcEEEEEeCC--CCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 28 KQYLLVHGTM--DDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 28 ~p~ll~hG~~--D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.|+|+..-+. =.+|.++.+.+++++....|.+++-.+|.
T Consensus 17 gPvl~vYpegvWY~~V~p~~a~rIv~~hl~~Gr~Ve~~~~~ 57 (64)
T COG3411 17 GPVLVVYPEGVWYTRVDPEDARRIVQSHLLGGRPVEELIYH 57 (64)
T ss_pred CCEEEEecCCeeEeccCHHHHHHHHHHHHhCCCcchhhccc
Confidence 5666655332 24688999999999999999988766664
No 204
>cd00861 ProRS_anticodon_short ProRS Prolyl-anticodon binding domain, short version found predominantly in bacteria. ProRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only.
Probab=45.59 E-value=43 Score=17.15 Aligned_cols=22 Identities=18% Similarity=0.060 Sum_probs=17.4
Q ss_pred CHHHHHHHHHHHHHcCCceEEE
Q psy472 42 HFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 42 ~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
....+..++..|+++|+.+++.
T Consensus 16 ~~~~a~~la~~Lr~~g~~v~~d 37 (94)
T cd00861 16 QQELAEKLYAELQAAGVDVLLD 37 (94)
T ss_pred HHHHHHHHHHHHHHCCCEEEEE
Confidence 4467888999999999888763
No 205
>PRK11267 biopolymer transport protein ExbD; Provisional
Probab=45.42 E-value=37 Score=19.52 Aligned_cols=29 Identities=17% Similarity=0.393 Sum_probs=24.0
Q ss_pred cEEEEEeCCCCccCHHHHHHHHHHHHHcCCc
Q psy472 29 QYLLVHGTMDDNVHFQQSMMLAKSLQHADIM 59 (68)
Q Consensus 29 p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~ 59 (68)
..+++. .|..++++....+.+.++++|+.
T Consensus 100 ~~V~I~--aD~~~~~~~vv~vmd~l~~aG~~ 128 (141)
T PRK11267 100 TTIFFR--ADKTVDYETLMKVMDTLHQAGYL 128 (141)
T ss_pred ceEEEE--cCCCCCHHHHHHHHHHHHHcCCC
Confidence 334455 58899999999999999999984
No 206
>cd03557 L-arabinose_isomerase L-Arabinose isomerase (AI) catalyzes the isomerization of L-arabinose to L-ribulose, the first reaction in its conversion into D-xylulose-5-phosphate, an intermediate in the pentose phosphate pathway, which allows L-arabinose to be used as a carbon source. AI can also convert D-galactose to D-tagatose at elevated temperatures in the presence of divalent metal ions. D-tagatose, rarely found in nature, is of commercial interest as a low-calorie sugar substitute.
Probab=45.33 E-value=44 Score=23.56 Aligned_cols=42 Identities=12% Similarity=0.116 Sum_probs=27.9
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHH----------H----HHHHHHHHHcCCceEEEE
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQ----------S----MMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~----------s----~~l~~~L~~~g~~~~~~~ 64 (68)
++.+ +.|+|++|----..+|++. + ..+...|.+.|++++++.
T Consensus 87 ~~~l-~~PvL~~~~q~~~~l~~~sidmd~m~l~qaahG~~e~~~il~R~gi~~~~v~ 142 (484)
T cd03557 87 LTAL-QKPLLHLHTQFNREIPWDTIDMDFMNLNQSAHGDREFGFIGSRMRIPRKVVV 142 (484)
T ss_pred HHHc-CCCEEEEccCCCccCCCCCccchHHhhhhhcCCcHHHHHHHHHcCCCeeEEE
Confidence 5567 6999999986533444422 2 124458899999987765
No 207
>COG2908 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=45.20 E-value=39 Score=21.73 Aligned_cols=22 Identities=27% Similarity=0.318 Sum_probs=14.7
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHH
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLA 50 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~ 50 (68)
+.++|++|| |...+.....+..
T Consensus 105 g~~~Ll~HG--D~f~t~~~~y~~~ 126 (237)
T COG2908 105 GKRILLAHG--DTFCTDDRAYQWF 126 (237)
T ss_pred CcEEEEEeC--CcccchHHHHHHH
Confidence 578899998 6666665544443
No 208
>PRK09912 L-glyceraldehyde 3-phosphate reductase; Provisional
Probab=45.09 E-value=48 Score=21.81 Aligned_cols=30 Identities=10% Similarity=0.153 Sum_probs=26.1
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADI 58 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~ 58 (68)
..++++|.- |..+|++.+.....+|++.|.
T Consensus 132 iDl~~lH~~-~~~~~~~e~~~al~~l~~~Gk 161 (346)
T PRK09912 132 VDIFYSHRV-DENTPMEETASALAHAVQSGK 161 (346)
T ss_pred EEEEEeCCC-CCCCCHHHHHHHHHHHHHcCC
Confidence 789999985 667899999998999999986
No 209
>cd03412 CbiK_N Anaerobic cobalamin biosynthetic cobalt chelatase (CbiK), N-terminal domain. CbiK is part of the cobalt-early path for cobalamin biosynthesis. It catalyzes the insertion of cobalt into the oxidized form of precorrin-2, factor II (sirohydrochlorin), the second step of the anaerobic branch of vitamin B12 biosynthesis. CbiK belongs to the class II family of chelatases and is a homomeric enzyme that does not require ATP for its enzymatic activity.
Probab=44.61 E-value=55 Score=18.48 Aligned_cols=27 Identities=15% Similarity=0.052 Sum_probs=18.5
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHHcC
Q psy472 30 YLLVHGTMDDNVHFQQSMMLAKSLQHAD 57 (68)
Q Consensus 30 ~ll~hG~~D~~V~~~~s~~l~~~L~~~g 57 (68)
+|+-||+.|+ -.-+.-..+++.+++.-
T Consensus 4 llv~fGS~~~-~~~~~~~~i~~~l~~~~ 30 (127)
T cd03412 4 LLVSFGTSYP-TAEKTIDAIEDKVRAAF 30 (127)
T ss_pred EEEeCCCCCH-HHHHHHHHHHHHHHHHC
Confidence 6788999998 44445566777776554
No 210
>PF14698 ASL_C2: Argininosuccinate lyase C-terminal; PDB: 1XWO_A 2E9F_A 1TJW_C 1TJU_A 1DCN_B 1K7W_B 1HY1_C 1TJV_B 1AUW_A 1U15_B ....
Probab=44.53 E-value=10 Score=19.42 Aligned_cols=15 Identities=33% Similarity=0.310 Sum_probs=7.8
Q ss_pred HHHHHHHHHHcCCce
Q psy472 46 SMMLAKSLQHADIMF 60 (68)
Q Consensus 46 s~~l~~~L~~~g~~~ 60 (68)
+..+++.|.+.|++|
T Consensus 4 ATdlAD~LVr~GipF 18 (70)
T PF14698_consen 4 ATDLADYLVRKGIPF 18 (70)
T ss_dssp HHHHHHHHHHTTS-H
T ss_pred HHHHHHHHHHcCCCH
Confidence 345555555556555
No 211
>PRK05765 precorrin-3B C17-methyltransferase; Provisional
Probab=44.32 E-value=79 Score=19.85 Aligned_cols=38 Identities=11% Similarity=0.016 Sum_probs=28.7
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..+.++.+ .|+.+- .-+..+.+.|.+.|++.++.+.||
T Consensus 73 ~~Vv~l~~-GDP~i~-~~~~~~~~~l~~~~~~~~veviPG 110 (246)
T PRK05765 73 NIVALVSS-GDPQVY-GMAGLVFELISRRKLDVDVEVIPG 110 (246)
T ss_pred CcEEEEeC-CCchhh-hhHHHHHHHHHhcCCCCCEEEeCC
Confidence 56777777 799884 446678888888888767777775
No 212
>cd03811 GT1_WabH_like This family is most closely related to the GT1 family of glycosyltransferases. WabH in Klebsiella pneumoniae has been shown to transfer a GlcNAc residue from UDP-GlcNAc onto the acceptor GalUA residue in the cellular outer core.
Probab=44.30 E-value=72 Score=19.32 Aligned_cols=41 Identities=12% Similarity=0.020 Sum_probs=30.6
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+...+++.|.-+..=.+....+.++.|.+.+.+++++++.+
T Consensus 188 ~~~~i~~~g~~~~~k~~~~~i~~~~~l~~~~~~~~l~i~G~ 228 (353)
T cd03811 188 DGPVILAVGRLSPQKGFDTLIRAFALLRKEGPDARLVILGD 228 (353)
T ss_pred CceEEEEEecchhhcChHHHHHHHHHhhhcCCCceEEEEcC
Confidence 46788888887766666777777777777777888887754
No 213
>cd03817 GT1_UGDG_like This family is most closely related to the GT1 family of glycosyltransferases. UDP-glucose-diacylglycerol glucosyltransferase (UGDG; also known as 1,2-diacylglycerol 3-glucosyltransferase) catalyzes the transfer of glucose from UDP-glucose to 1,2-diacylglycerol forming 3-D-glucosyl-1,2-diacylglycerol.
Probab=44.24 E-value=75 Score=19.56 Aligned_cols=41 Identities=12% Similarity=0.037 Sum_probs=31.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.+.+++.|.-+..-...........+.+.+..++++++.+
T Consensus 201 ~~~~i~~~G~~~~~k~~~~l~~~~~~~~~~~~~~~l~i~G~ 241 (374)
T cd03817 201 DEPVLLYVGRLAKEKNIDFLIRAFARLLKEEPDVKLVIVGD 241 (374)
T ss_pred CCeEEEEEeeeecccCHHHHHHHHHHHHHhCCCeEEEEEeC
Confidence 36778888987777777788888888887777888888754
No 214
>PF07521 RMMBL: RNA-metabolising metallo-beta-lactamase; InterPro: IPR011108 The metallo-beta-lactamase fold contains five sequence motifs. The first four motifs are found in IPR001279 from INTERPRO and are common to all metallo-beta-lactamases. The fifth motif appears to be specific to function. This entry represents the fifth motif from metallo-beta-lactamases involved in RNA metabolism [].; PDB: 3ZQ4_D 2I7T_A 2I7V_A 2YCB_B 3BK1_A 3T3N_A 3BK2_A 3T3O_A 3AF5_A 3AF6_A ....
Probab=44.04 E-value=3.4 Score=19.18 Aligned_cols=20 Identities=25% Similarity=0.529 Sum_probs=12.2
Q ss_pred hhHHhhcCCCCcEEEEEeCC
Q psy472 18 LNNKVDRIRDKQYLLVHGTM 37 (68)
Q Consensus 18 p~~~~~~~~~~p~ll~hG~~ 37 (68)
....++..+...++++||..
T Consensus 23 L~~~i~~~~p~~vilVHGe~ 42 (43)
T PF07521_consen 23 LLEFIEQLNPRKVILVHGEP 42 (43)
T ss_dssp HHHHHHHHCSSEEEEESSEH
T ss_pred HHHHHHhcCCCEEEEecCCC
Confidence 34444444336788999863
No 215
>TIGR01205 D_ala_D_alaTIGR D-alanine--D-alanine ligase. but a number of antibiotic resistance proteins score above the trusted cutoff of this model.
Probab=44.00 E-value=68 Score=20.37 Aligned_cols=32 Identities=13% Similarity=0.191 Sum_probs=18.7
Q ss_pred EEEeCCCCc-cCHHHHHHHHHHHHHcCCceEEE
Q psy472 32 LVHGTMDDN-VHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 32 l~hG~~D~~-V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
|+-|..++. |....+..+.++|++.|..+.+.
T Consensus 5 ~~gg~s~e~~~s~~s~~~i~~al~~~g~~v~~i 37 (315)
T TIGR01205 5 LFGGKSAEHEISLVSAAAVLKALRDLGYDVYPV 37 (315)
T ss_pred EeCCCCCCeeeeHHHHHHHHHHHhhcCCEEEEE
Confidence 344444444 56666667777777766665544
No 216
>cd03808 GT1_cap1E_like This family is most closely related to the GT1 family of glycosyltransferases. cap1E in Streptococcus pneumoniae is required for the synthesis of type 1 capsular polysaccharides.
Probab=43.59 E-value=75 Score=19.34 Aligned_cols=41 Identities=10% Similarity=0.015 Sum_probs=32.3
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.+.+++.|.-...=.+....+.++.|.+.+.+++++++..
T Consensus 187 ~~~~i~~~G~~~~~k~~~~li~~~~~l~~~~~~~~l~i~G~ 227 (359)
T cd03808 187 DDPVFLFVARLLKDKGIDELLEAARILKAKGPNVRLLLVGD 227 (359)
T ss_pred CCcEEEEEeccccccCHHHHHHHHHHHHhcCCCeEEEEEcC
Confidence 46889999988777777777778888877778888888754
No 217
>COG1184 GCD2 Translation initiation factor 2B subunit, eIF-2B alpha/beta/delta family [Translation, ribosomal structure and biogenesis]
Probab=43.56 E-value=32 Score=22.84 Aligned_cols=24 Identities=17% Similarity=0.127 Sum_probs=20.4
Q ss_pred cCHHHHHHHHHHHHHcCCceEEEE
Q psy472 41 VHFQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 41 V~~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
-|-.+...|++.|.+.|+++.+++
T Consensus 154 RP~~eG~~~ak~L~~~gI~~~~I~ 177 (301)
T COG1184 154 RPRGEGRIMAKELRQSGIPVTVIV 177 (301)
T ss_pred CCcchHHHHHHHHHHcCCceEEEe
Confidence 466789999999999999888764
No 218
>cd03415 CbiX_CbiC Archaeal sirohydrochlorin cobalt chelatase (CbiX) single domain. Proteins in this subgroup contain a single CbiX domain N-terminal to a precorrin-8X methylmutase (CbiC) domain. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, while CbiC catalyzes the conversion of cobalt-precorrin 8 to cobyrinic acid by methyl rearrangement. Both CbiX and CbiC are involved in vitamin B12 biosynthesis.
Probab=43.24 E-value=63 Score=18.41 Aligned_cols=25 Identities=16% Similarity=0.260 Sum_probs=17.3
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHH
Q psy472 30 YLLVHGTMDDNVHFQQSMMLAKSLQH 55 (68)
Q Consensus 30 ~ll~hG~~D~~V~~~~s~~l~~~L~~ 55 (68)
+++-||+.|+. ..+....+++.+++
T Consensus 4 llvgHGSR~~~-~~~~~~~la~~l~~ 28 (125)
T cd03415 4 IIITHGSRRNT-FNEDMEEWAAYLER 28 (125)
T ss_pred EEEecCCCChH-HHHHHHHHHHHHHh
Confidence 57788888884 34556667777754
No 219
>PF14480 DNA_pol3_a_NI: DNA polymerase III polC-type N-terminus I
Probab=42.93 E-value=46 Score=16.73 Aligned_cols=50 Identities=14% Similarity=0.067 Sum_probs=37.4
Q ss_pred HhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHc--CC-ceEEEE
Q psy472 14 KIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHA--DI-MFQSQE 64 (68)
Q Consensus 14 ~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~--g~-~~~~~~ 64 (68)
+.+. +..+.-.++...+-+|=..+..+|.+.-..+.+.|+.. ++ .+++.+
T Consensus 22 ~~~~-I~kv~v~k~~~~w~f~l~~~~~l~~~~~~~~~~~l~~~F~~ia~v~~~i 74 (76)
T PF14480_consen 22 EDAE-IEKVTVHKKSRKWRFHLSSPHILPFEVYQKFEEKLKKQFSHIAKVELII 74 (76)
T ss_pred cccE-EEEEEEEccCCEEEEEEEeCCcCCHHHHHHHHHHHHHHhCCcCeEEEEE
Confidence 4443 34444444689999999999999999999999999988 33 555543
No 220
>PF06441 EHN: Epoxide hydrolase N terminus; InterPro: IPR010497 This entry represents the N-terminal region of the eukaryotic epoxide hydrolase protein. Epoxide hydrolases (3.3.2.3 from EC) comprise a group of functionally related enzymes that catalyse the addition of water to oxirane compounds (epoxides), thereby usually generating vicinal trans-diols. EHs have been found in all types of living organisms, including mammals, invertebrates, plants, fungi and bacteria. In animals, the major interest in EH is directed towards their detoxification capacity for epoxides since they are important safeguards against the cytotoxic and genotoxic potential of oxirane derivatives that are often reactive electrophiles because of the high tension of the three-membered ring system and the strong polarisation of the C--O bonds. This is of significant relevance because epoxides are frequent intermediary metabolites, which arise during the biotransformation of foreign compounds []. This domain is often found in conjunction with IPR000073 from INTERPRO.; GO: 0004301 epoxide hydrolase activity, 0009636 response to toxin, 0016020 membrane; PDB: 3G0I_B 3G02_A 1QO7_A.
Probab=42.84 E-value=13 Score=20.89 Aligned_cols=9 Identities=44% Similarity=0.733 Sum_probs=5.7
Q ss_pred CCcEEEEEe
Q psy472 27 DKQYLLVHG 35 (68)
Q Consensus 27 ~~p~ll~hG 35 (68)
..|+||+||
T Consensus 92 aiPLll~HG 100 (112)
T PF06441_consen 92 AIPLLLLHG 100 (112)
T ss_dssp -EEEEEE--
T ss_pred CeEEEEECC
Confidence 489999999
No 221
>PF13041 PPR_2: PPR repeat family
Probab=42.55 E-value=9 Score=17.65 Aligned_cols=21 Identities=24% Similarity=0.396 Sum_probs=17.1
Q ss_pred CHHHHHHHHHHHHHcCCceEE
Q psy472 42 HFQQSMMLAKSLQHADIMFQS 62 (68)
Q Consensus 42 ~~~~s~~l~~~L~~~g~~~~~ 62 (68)
.++++.++.+.+++.|+....
T Consensus 18 ~~~~a~~l~~~M~~~g~~P~~ 38 (50)
T PF13041_consen 18 KFEEALKLFKEMKKRGIKPDS 38 (50)
T ss_pred CHHHHHHHHHHHHHcCCCCCH
Confidence 468899999999999986543
No 222
>cd07422 MPP_ApaH Escherichia coli ApaH and related proteins, metallophosphatase domain. ApaH (also known as symmetrically cleaving Ap4A hydrolase and bis(5'nucleosyl)-tetraphosphatase) is a bacterial member of the PPP (phosphoprotein phosphatase) family of serine/threonine phosphatases that hydrolyzes the nucleotide-signaling molecule diadenosine tetraphosphate (Ap(4)A) into two ADP and also hydrolyzes Ap(5)A, Gp(4)G, and other extending compounds. Null mutations in apaH result in high intracellular levels of Ap(4)A which correlate with multiple phenotypes, including a decreased expression of catabolite-repressible genes, a reduction in the expression of flagellar operons, and an increased sensitivity to UV and heat. Ap4A hydrolase is important in responding to heat shock and oxidative stress via regulating the concentration of Ap4A in bacteria. Ap4A hydrolase is also thought to play a role in siderophore production, but the mechanism by which ApaH interacts with siderophore pathwa
Probab=42.54 E-value=55 Score=20.99 Aligned_cols=28 Identities=21% Similarity=0.367 Sum_probs=24.5
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHH
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQH 55 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~ 55 (68)
..++++||.-.+..+..++..+++.+.+
T Consensus 112 ~~~l~vHAGi~p~w~~~~~~~~a~eve~ 139 (257)
T cd07422 112 LGILMVHAGIPPQWSIEQALKLAREVEA 139 (257)
T ss_pred ccEEEEccCCCCCCCHHHHHHHHHHHHH
Confidence 3789999999999999999998888665
No 223
>cd03821 GT1_Bme6_like This family is most closely related to the GT1 family of glycosyltransferases. Bme6 in Brucella melitensis has been shown to be involved in the biosynthesis of a polysaccharide.
Probab=42.38 E-value=81 Score=19.38 Aligned_cols=40 Identities=13% Similarity=0.042 Sum_probs=31.4
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
+.+.+++.|.-+..=......+.+..|.+.+.++++.+..
T Consensus 202 ~~~~i~~~G~~~~~K~~~~li~a~~~l~~~~~~~~l~i~G 241 (375)
T cd03821 202 DKRIILFLGRLHPKKGLDLLIEAFAKLAERFPDWHLVIAG 241 (375)
T ss_pred CCcEEEEEeCcchhcCHHHHHHHHHHhhhhcCCeEEEEEC
Confidence 4788999998887777777777777788777788887764
No 224
>PRK12326 preprotein translocase subunit SecA; Reviewed
Probab=42.17 E-value=88 Score=23.57 Aligned_cols=31 Identities=19% Similarity=0.105 Sum_probs=26.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
..|+|+.- ..+++|+.+.+.|++.|++++.+
T Consensus 427 GrPVLVgt------~sI~~SE~ls~~L~~~gI~h~vL 457 (764)
T PRK12326 427 GQPVLVGT------HDVAESEELAERLRAAGVPAVVL 457 (764)
T ss_pred CCCEEEEe------CCHHHHHHHHHHHHhCCCcceee
Confidence 58999976 46899999999999999988754
No 225
>PF07819 PGAP1: PGAP1-like protein; InterPro: IPR012908 The sequences found in this family are similar to PGAP1 (Q765A7 from SWISSPROT). This is an endoplasmic reticulum membrane protein with a catalytic serine-containing motif that is conserved in a number of lipases. PGAP1 functions as a GPI inositol-deacylase; this deacylation is important for the efficient transport of GPI-anchored proteins from the endoplasmic reticulum to the Golgi body [].; GO: 0016788 hydrolase activity, acting on ester bonds, 0006505 GPI anchor metabolic process, 0006886 intracellular protein transport, 0031227 intrinsic to endoplasmic reticulum membrane
Probab=42.11 E-value=84 Score=19.49 Aligned_cols=25 Identities=32% Similarity=0.450 Sum_probs=18.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQ 54 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~ 54 (68)
+.|+|++||- .=.+.|...++..+.
T Consensus 4 g~pVlFIhG~---~Gs~~q~rsl~~~~~ 28 (225)
T PF07819_consen 4 GIPVLFIHGN---AGSYKQVRSLASELQ 28 (225)
T ss_pred CCEEEEECcC---CCCHhHHHHHHHHHh
Confidence 5899999993 334678888887773
No 226
>PF13500 AAA_26: AAA domain; PDB: 3OF5_A 2IOJ_A 4A0G_B 4A0R_A 4A0H_B 4A0F_B 3FMI_C 3FPA_D 3FMF_C 3FGN_A ....
Probab=41.98 E-value=75 Score=18.86 Aligned_cols=44 Identities=9% Similarity=0.119 Sum_probs=32.4
Q ss_pred HHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 20 NKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 20 ~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.-+..+ +.|++++-+..... ++|+..-.+.++..|+++--.++.
T Consensus 123 dia~~L-~a~vIlV~~~~~g~--i~~~l~~~~~~~~~g~~v~GvI~N 166 (199)
T PF13500_consen 123 DIAKAL-GAPVILVASGRLGT--INHTLLTIEALKQRGIRVLGVILN 166 (199)
T ss_dssp HHHHHH-T-EEEEEEESSTTH--HHHHHHHHHHHHCTTS-EEEEEEE
T ss_pred HHHHHc-CCCEEEEeCCCCCC--HHHHHHHHHHHHhcCCCEEEEEEE
Confidence 445566 58999888887764 788888888899889888776664
No 227
>TIGR01293 Kv_beta voltage-dependent potassium channel beta subunit, animal. Plant beta subunits and their closely related bacterial homologs (in Deinococcus radiudurans, Xylella fastidiosa, etc.) appear more closely related to each other than to animal forms. However, the bacterial species lack convincing counterparts the Kv alpha subunit and the Kv beta homolog may serve as an enzyme. Cutoffs are set for this model such that yeast and plant forms and bacterial close homologs score between trusted and noise cutoffs.
Probab=41.89 E-value=60 Score=20.96 Aligned_cols=31 Identities=6% Similarity=0.140 Sum_probs=26.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADI 58 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~ 58 (68)
...++++|.- |+.+|++.+....+.|++.|.
T Consensus 113 ~iDl~~lH~~-~~~~~~~e~~~aL~~l~~~G~ 143 (317)
T TIGR01293 113 YVDIVFANRP-DPNTPMEETVRAMTYVINQGM 143 (317)
T ss_pred cEeEEEeccC-CCCCCHHHHHHHHHHHHHcCC
Confidence 3789999995 667889888888888999986
No 228
>PRK04940 hypothetical protein; Provisional
Probab=41.33 E-value=59 Score=19.92 Aligned_cols=25 Identities=12% Similarity=0.150 Sum_probs=21.2
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHH
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKS 52 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~ 52 (68)
..++++..+.|++.++.++......
T Consensus 125 ~r~~vllq~gDEvLDyr~a~~~y~~ 149 (180)
T PRK04940 125 DRCLVILSRNDEVLDSQRTAEELHP 149 (180)
T ss_pred ccEEEEEeCCCcccCHHHHHHHhcc
Confidence 5679999999999999999876643
No 229
>cd00860 ThrRS_anticodon ThrRS Threonyl-anticodon binding domain. ThrRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only.
Probab=40.65 E-value=51 Score=16.61 Aligned_cols=22 Identities=27% Similarity=0.357 Sum_probs=18.1
Q ss_pred CHHHHHHHHHHHHHcCCceEEE
Q psy472 42 HFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 42 ~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
....+..+++.|++.|+.+++.
T Consensus 13 ~~~~a~~~~~~Lr~~g~~v~~d 34 (91)
T cd00860 13 HLDYAKEVAKKLSDAGIRVEVD 34 (91)
T ss_pred HHHHHHHHHHHHHHCCCEEEEE
Confidence 5678899999999999887763
No 230
>cd06592 GH31_glucosidase_KIAA1161 KIAA1161 is an uncharacterized Homo sapiens protein with a glycosyl hydrolase family 31 (GH31) domain that is homologous to the Escherichia coli YihQ glucosidase. Orthologs of KIA1161 are found in eukaryotes and prokaryotes. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=40.58 E-value=45 Score=21.63 Aligned_cols=41 Identities=5% Similarity=-0.012 Sum_probs=30.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..|++......-..+.-+....+++.+++.|++++.+..+.
T Consensus 13 ~~p~W~~W~~~~~~~s~~~v~~~~~~~~~~~iP~d~i~iD~ 53 (303)
T cd06592 13 RSPIWSTWARYKADINQETVLNYAQEIIDNGFPNGQIEIDD 53 (303)
T ss_pred CCCccCChhhhccCcCHHHHHHHHHHHHHcCCCCCeEEeCC
Confidence 36776666555566777888889999999999988877654
No 231
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=40.57 E-value=28 Score=13.63 Aligned_cols=16 Identities=13% Similarity=0.432 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHcCC
Q psy472 43 FQQSMMLAKSLQHADI 58 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g~ 58 (68)
++.+..+.+.+++.|+
T Consensus 16 ~~~a~~~~~~M~~~g~ 31 (31)
T PF01535_consen 16 FEEALEVFDEMRERGI 31 (31)
T ss_pred HHHHHHHHHHHhHCcC
Confidence 5677778888777764
No 232
>PRK00166 apaH diadenosine tetraphosphatase; Reviewed
Probab=40.57 E-value=57 Score=21.10 Aligned_cols=28 Identities=14% Similarity=0.322 Sum_probs=23.3
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHH
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQH 55 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~ 55 (68)
..++++||.-++.....++..+++.+++
T Consensus 114 ~~~l~vHAGi~p~~~~~~~~~~a~eve~ 141 (275)
T PRK00166 114 LGLVMVHAGIPPQWDLATALALAREVEA 141 (275)
T ss_pred CCEEEEccCCCCCCCHHHHHHHHHHHHH
Confidence 5689999999999999888888777554
No 233
>PF03358 FMN_red: NADPH-dependent FMN reductase; InterPro: IPR005025 NADPH-dependent FMN reductase (1.5.1.29 from EC) reduces FMN and also reduces riboflavin and FAD, although more slowly. Members of this entry catalyse the reaction NAD(P)H + FMN = NAD(P)(+) + FMNH(2).; PDB: 3SVL_B 3GFS_F 3GFQ_A 1NNI_1 2GSW_B 3GFR_D 1T0I_B 3D7N_A 2R97_A 3B6K_A ....
Probab=40.41 E-value=69 Score=18.00 Aligned_cols=37 Identities=14% Similarity=0.222 Sum_probs=22.3
Q ss_pred cEEEEEeCCC-CccCHHHHHHHHHHHHHcCCceEEEEe
Q psy472 29 QYLLVHGTMD-DNVHFQQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 29 p~ll~hG~~D-~~V~~~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
.++++.|..- .-..-.-+..+.+.|++.|+.++++-.
T Consensus 2 kilii~gS~r~~~~t~~l~~~~~~~l~~~g~e~~~i~l 39 (152)
T PF03358_consen 2 KILIINGSPRKNSNTRKLAEAVAEQLEEAGAEVEVIDL 39 (152)
T ss_dssp EEEEEESSSSTTSHHHHHHHHHHHHHHHTTEEEEEEEC
T ss_pred EEEEEECcCCCCCHHHHHHHHHHHHHHHcCCEEEEEec
Confidence 4678888764 333334445556667777777666543
No 234
>PRK08334 translation initiation factor IF-2B subunit beta; Validated
Probab=40.07 E-value=60 Score=22.07 Aligned_cols=28 Identities=18% Similarity=0.107 Sum_probs=23.6
Q ss_pred CCCccCHHHHHHH-HHHHHHcCCceEEEE
Q psy472 37 MDDNVHFQQSMML-AKSLQHADIMFQSQE 64 (68)
Q Consensus 37 ~D~~V~~~~s~~l-~~~L~~~g~~~~~~~ 64 (68)
-|+.-|..|..+| +..|.+.|+++.++.
T Consensus 198 v~EtRP~~qG~rlta~eL~~~GI~vtlI~ 226 (356)
T PRK08334 198 VDETRPVLQGARLSAWEYHYDGIPLKLIS 226 (356)
T ss_pred ECCCCchhhHHHHHHHHHHHCCCCEEEEe
Confidence 4688899999988 578999999998764
No 235
>PF06713 bPH_4: Bacterial PH domain; InterPro: IPR009589 This entry is represented by Bacteriophage SP-beta, YolF. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several hypothetical proteins specific to Oceanobacillus and Bacillus species. Members of this family are typically around 130 residues in length. The function of this family is unknown.
Probab=39.93 E-value=28 Score=17.86 Aligned_cols=30 Identities=10% Similarity=0.073 Sum_probs=23.4
Q ss_pred CcEEEEEeCCC-CccCHHHHHHHHHHHHHcC
Q psy472 28 KQYLLVHGTMD-DNVHFQQSMMLAKSLQHAD 57 (68)
Q Consensus 28 ~p~ll~hG~~D-~~V~~~~s~~l~~~L~~~g 57 (68)
..+.|.+|..+ -.|.|.+-..|.++|++.+
T Consensus 44 ~rl~I~y~~~~~i~IsP~~~~~FI~~L~k~n 74 (74)
T PF06713_consen 44 DRLEIYYGKYKSILISPKDKEEFIAELQKRN 74 (74)
T ss_pred cEEEEEECCCCEEEEECCCHHHHHHHHHhhC
Confidence 56777777444 4788999999999998764
No 236
>COG3367 Uncharacterized conserved protein [Function unknown]
Probab=39.39 E-value=94 Score=21.10 Aligned_cols=37 Identities=19% Similarity=0.090 Sum_probs=32.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
+.+.+++-||...+=-..-+..|.++|++.|+...+.
T Consensus 147 ~a~~V~vvGTd~~vGKrTTa~~L~~~~~e~G~~a~fv 183 (339)
T COG3367 147 DAKVVLVVGTDCAVGKRTTALELREAAREEGIKAGFV 183 (339)
T ss_pred CCcEEEEeccccccchhHHHHHHHHHHHHhCCccceE
Confidence 4789999999998888999999999999999987653
No 237
>cd08199 EEVS 2-epi-5-epi-valiolone synthase (EEVS). 2-epi-5-epi-valiolone synthases catalyze the cyclization of sedoheptulose 7-phosphate to 2-epi-5-epi-valiolone in the biosynthesis of C(7)N-aminocyclitol-containing products. The cyclization product, 2-epi-5-epi-valiolone ((2S,3S,4S,5R)-5-(hydroxymethyl)cyclohexanon-2,3,4,5-tetrol), is a precursor of the valienamine moiety. The valienamine unit is responsible for their biological activities as various glycosidic hydrolases inhibitors. Two important microbial secondary metabolites, i.e., validamycin and acarbose, are used in agricultural and biomedical applications. Validamycine A is an antifungal antibiotic which has a strong trehalase inhibitory activity and has been used to control sheath blight disease in rice caused by Rhizoctonia solani. Acarbose is an alpha-glucosidase inhibitor used for the treatment of type II insulin-independent diabetes. Salbostatin produced by Streptomyces albus also belongs to this family. It exhibits s
Probab=39.38 E-value=1.2e+02 Score=20.33 Aligned_cols=37 Identities=19% Similarity=0.131 Sum_probs=25.3
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..++++. |..|.-.....+.+.|.+.|+.+...+|++
T Consensus 27 ~~~lvVt---d~~v~~~~~~~v~~~l~~~g~~~~~~v~~~ 63 (354)
T cd08199 27 GRRFVVV---DQNVDKLYGKKLREYFAHHNIPLTILVLRA 63 (354)
T ss_pred CeEEEEE---CccHHHHHHHHHHHHHHhcCCceEEEEeCC
Confidence 4566665 566654455678888888888887766764
No 238
>PRK12756 phospho-2-dehydro-3-deoxyheptonate aldolase; Provisional
Probab=39.27 E-value=1e+02 Score=20.98 Aligned_cols=34 Identities=15% Similarity=0.188 Sum_probs=27.8
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 30 YLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 30 ~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
.+++-|..-+|-+.++.....++|+++|.+-.++
T Consensus 229 HvILRGg~~PNY~~~~v~~a~~~l~~~~l~~~im 262 (348)
T PRK12756 229 HIIMRGGKKPNYHAEDIAAACDTLREFDLPEHLV 262 (348)
T ss_pred EEEeeCCCCCCCCHHHHHHHHHHHHHCCCCCcEE
Confidence 4555666789999999999999999999875544
No 239
>cd00578 L-fuc_L-ara-isomerases L-fucose isomerase (FucIase) and L-arabinose isomerase (AI) family; composed of FucIase, AI and similar proteins. FucIase converts L-fucose, an aldohexose, to its ketose form, which prepares it for aldol cleavage (similar to the isomerization of glucose in glycolysis). L-fucose (or 6-deoxy-L-galactose) is found in various oligo- and polysaccharides in mammals, bacteria and plants. AI catalyzes the isomerization of L-arabinose to L-ribulose, the first reaction in its conversion to D-xylulose-5-phosphate, an intermediate in the pentose phosphate pathway, which allows L-arabinose to be used as a carbon source. AI can also convert D-galactose to D-tagatose at elevated temperatures in the presence of divalent metal ions. D-tagatose, rarely found in nature, is of commercial interest as a low-calorie sugar substitute.
Probab=39.16 E-value=1.2e+02 Score=20.72 Aligned_cols=41 Identities=17% Similarity=0.100 Sum_probs=28.5
Q ss_pred hhcCCCCcEEEEEeCCCC---------ccCHHHHHHHHHHHHHcCCceEEEE
Q psy472 22 VDRIRDKQYLLVHGTMDD---------NVHFQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~---------~V~~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
+... +.|+|+++- .|+ +-...-...+...|.+.|+++..+.
T Consensus 84 ~~~~-~~Pvll~a~-~~~~~~~~~~~~~~s~~g~~~~~~~l~r~gi~~~~v~ 133 (452)
T cd00578 84 LSEL-RKPVLLLAT-QFNREIPDFMNLNQSACGLREFGNILARLGIPFKVVY 133 (452)
T ss_pred HHhc-CCCEEEEeC-CCCCCCCchhhhhcchhhhHHHHHHHHHcCCceeEEE
Confidence 4456 589987776 442 3334455678889999999987654
No 240
>cd01458 vWA_ku Ku70/Ku80 N-terminal domain. The Ku78 heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks (DSB) in a preferred orientation. DSB's are repaired by either homologues recombination or non-homologues end joining and facilitate repair by the non-homologous end-joining pathway (NHEJ). The Ku heterodimer is required for accurate process that tends to preserve the sequence at the junction. Ku78 is found in all three kingdoms of life. However, only the eukaryotic proteins have a vWA domain fused to them at their N-termini. The vWA domain is not involved in DNA binding but may very likey mediate Ku78's interactions with other proteins. Members of this subgroup lack the conserved MIDAS motif.
Probab=39.10 E-value=82 Score=19.11 Aligned_cols=39 Identities=18% Similarity=0.207 Sum_probs=28.5
Q ss_pred CCcEEEEEeCCCCc----cCHHHHHHHHHHHHHcCCceEEEEe
Q psy472 27 DKQYLLVHGTMDDN----VHFQQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 27 ~~p~ll~hG~~D~~----V~~~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
...++|+--..|+. +..+++...++.|.+.|+.++.+-.
T Consensus 128 ~k~IvL~TDg~~p~~~~~~~~~~~~~~a~~l~~~gI~i~~i~i 170 (218)
T cd01458 128 HKRIFLFTNNDDPHGGDSIKDSQAAVKAEDLKDKGIELELFPL 170 (218)
T ss_pred ccEEEEECCCCCCCCCCHHHHHHHHHHHHHHHhCCcEEEEEec
Confidence 35677776655553 5678888899999999998877654
No 241
>PF13170 DUF4003: Protein of unknown function (DUF4003)
Probab=38.75 E-value=13 Score=24.25 Aligned_cols=37 Identities=11% Similarity=0.123 Sum_probs=30.3
Q ss_pred cEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 29 QYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 29 p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.+|.+.+..|... +..+..+.+.|++.|+.+....||
T Consensus 185 ~iLaL~~~~~~~~-v~r~~~l~~~l~~~~~kik~~~yp 221 (297)
T PF13170_consen 185 HILALSEGDDQEK-VARVIELYNALKKNGVKIKYMHYP 221 (297)
T ss_pred HHHHhccccchHH-HHHHHHHHHHHHHcCCcccccccc
Confidence 3455666667777 999999999999999998887776
No 242
>cd01452 VWA_26S_proteasome_subunit 26S proteasome plays a major role in eukaryotic protein breakdown, especially for ubiquitin-tagged proteins. It is an ATP-dependent protease responsible for the bulk of non-lysosomal proteolysis in eukaryotes, often using covalent modification of proteins by ubiquitylation. It consists of a 20S proteolytic core particle (CP) and a 19S regulatory particle (RP). The CP is an ATP independent peptidase consisting of hydrolyzing activities. One or both ends of CP carry the RP that confers both ubiquitin and ATP dependence to the 26S proteosome. The RP's proposed functions include recognition of substrates and translocation of these to CP for proteolysis. The RP can dissociate into a stable lid and base subcomplexes. The base is composed of three non-ATPase subunits (Rpn 1, 2 and 10). A single residue in the vWA domain of Rpn10 has been implicated to be responsible for stabilizing the lid-base association.
Probab=38.75 E-value=93 Score=19.04 Aligned_cols=38 Identities=13% Similarity=0.117 Sum_probs=29.3
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
..++++-|. |-..+..+....+++|++.|+.+..+-|-
T Consensus 108 ~rivi~v~S-~~~~d~~~i~~~~~~lkk~~I~v~vI~~G 145 (187)
T cd01452 108 QRIVAFVGS-PIEEDEKDLVKLAKRLKKNNVSVDIINFG 145 (187)
T ss_pred ceEEEEEec-CCcCCHHHHHHHHHHHHHcCCeEEEEEeC
Confidence 366777773 43457888889999999999998887664
No 243
>PRK05629 hypothetical protein; Validated
Probab=38.49 E-value=53 Score=21.24 Aligned_cols=38 Identities=11% Similarity=0.064 Sum_probs=25.9
Q ss_pred CcEEEEEeCCCCccCHHHHHH-HHHHHHHcCC-ceEEEEecC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMM-LAKSLQHADI-MFQSQECYR 67 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~-l~~~L~~~g~-~~~~~~~~~ 67 (68)
.|+.+++|..+-. .+++.. +.+++...+. ++.+..|.+
T Consensus 6 ~~vyL~~G~e~~l--~~~~~~~i~~~~~~~~~~~~n~~~~d~ 45 (318)
T PRK05629 6 PPVHLVLGDDEFL--AERARLNIVHDIRSSMADSLQVTTLKA 45 (318)
T ss_pred CceEEEEeCHHHH--HHHHHHHHHHHHhccCCCCCceEEeec
Confidence 6999999966533 555555 6666665554 677777765
No 244
>PRK06372 translation initiation factor IF-2B subunit delta; Provisional
Probab=38.45 E-value=57 Score=21.07 Aligned_cols=25 Identities=16% Similarity=0.117 Sum_probs=20.9
Q ss_pred CccCHHHHHHHHHHHHHcCCceEEE
Q psy472 39 DNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 39 ~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
+.-|..+...+++.|.+.|+++.++
T Consensus 116 ESrP~~eG~~~a~~L~~~GI~vtli 140 (253)
T PRK06372 116 ESRPMLEGIDMAKLLVKSGIDVVLL 140 (253)
T ss_pred cCCCchHHHHHHHHHHHCCCCEEEE
Confidence 3457778899999999999999765
No 245
>PF09198 T4-Gluco-transf: Bacteriophage T4 beta-glucosyltransferase; InterPro: IPR015281 Members of this family are DNA-modifying enzymes encoded by bacteriophage T4 that transfer glucose from uridine diphosphoglucose to 5-hydroxymethyl cytosine bases of phage T4 DNA []. ; PDB: 1J39_A 1SXQ_B 1NZF_A 1M5R_B 1JEJ_A 1JIV_A 1NZD_A 1NVK_A 2BGU_A 1JIU_A ....
Probab=38.40 E-value=43 Score=15.09 Aligned_cols=23 Identities=9% Similarity=0.179 Sum_probs=17.9
Q ss_pred CccCHHHHHHHHHHHHHcCCceE
Q psy472 39 DNVHFQQSMMLAKSLQHADIMFQ 61 (68)
Q Consensus 39 ~~V~~~~s~~l~~~L~~~g~~~~ 61 (68)
..+|.+++.-|.+-+++-|.+++
T Consensus 16 kt~p~setiyl~~~~~~mgl~vd 38 (38)
T PF09198_consen 16 KTTPSSETIYLFKCISDMGLNVD 38 (38)
T ss_dssp SSHHHHHHHHHHHHHHTTT-EEE
T ss_pred eecCccceEeHHHHHHHhCCCCC
Confidence 35788899999999999887653
No 246
>PRK08535 translation initiation factor IF-2B subunit delta; Provisional
Probab=38.34 E-value=59 Score=21.36 Aligned_cols=26 Identities=19% Similarity=0.057 Sum_probs=20.9
Q ss_pred CccCHHHHHHHHHHHHHcCCceEEEE
Q psy472 39 DNVHFQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 39 ~~V~~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
+.-|..+...+++.|.+.|+++.++.
T Consensus 153 EsrP~~~G~~~a~~L~~~GI~vtlI~ 178 (310)
T PRK08535 153 ETRPRNQGHITAKELAEYGIPVTLIV 178 (310)
T ss_pred cCCchhhHHHHHHHHHHCCCCEEEEe
Confidence 44577788889999999999888753
No 247
>cd03798 GT1_wlbH_like This family is most closely related to the GT1 family of glycosyltransferases. wlbH in Bordetella parapertussis has been shown to be required for the biosynthesis of a trisaccharide that, when attached to the B. pertussis lipopolysaccharide (LPS) core (band B), generates band A LPS.
Probab=38.04 E-value=95 Score=18.94 Aligned_cols=41 Identities=12% Similarity=-0.006 Sum_probs=32.7
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+...+++.|.-+..=..+.....++.+.+.+..++++++.+
T Consensus 201 ~~~~i~~~g~~~~~k~~~~li~~~~~~~~~~~~~~l~i~g~ 241 (377)
T cd03798 201 DKKVILFVGRLVPRKGIDYLIEALARLLKKRPDVHLVIVGD 241 (377)
T ss_pred CceEEEEeccCccccCHHHHHHHHHHHHhcCCCeEEEEEcC
Confidence 46788999988887778888888888887777888887653
No 248
>PF13709 DUF4159: Domain of unknown function (DUF4159)
Probab=37.95 E-value=82 Score=19.44 Aligned_cols=32 Identities=16% Similarity=0.098 Sum_probs=27.3
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADI 58 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~ 58 (68)
.-|++.++|..+-.-..++...|-+-|.+=|.
T Consensus 53 ~yP~ly~~g~~~~~~s~~e~~~Lr~Yl~~GGf 84 (207)
T PF13709_consen 53 FYPFLYWPGHGDFPLSDEEIANLRRYLENGGF 84 (207)
T ss_pred hCCEEEEeCCCCCCCCHHHHHHHHHHHHcCCE
Confidence 47999999999998888888888888876654
No 249
>cd00859 HisRS_anticodon HisRS Histidyl-anticodon binding domain. HisRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only.
Probab=37.65 E-value=55 Score=16.13 Aligned_cols=23 Identities=22% Similarity=0.286 Sum_probs=18.5
Q ss_pred CHHHHHHHHHHHHHcCCceEEEE
Q psy472 42 HFQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 42 ~~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
....+..++.+|++.|+.+++..
T Consensus 13 ~~~~a~~i~~~Lr~~g~~v~~~~ 35 (91)
T cd00859 13 ALSEALELAEQLRDAGIKAEIDY 35 (91)
T ss_pred HHHHHHHHHHHHHHCCCEEEEec
Confidence 34678899999999999887654
No 250
>PF02472 ExbD: Biopolymer transport protein ExbD/TolR; InterPro: IPR003400 This group of proteins are membrane bound transport proteins essential for ferric ion uptake in bacteria []. The family consists of ExbD, and TolR which are involved in TonB-dependent transport of various receptor bound substrates including colicins [].; GO: 0005215 transporter activity, 0006810 transport, 0016020 membrane; PDB: 2JWL_A 2JWK_A 2PFU_A.
Probab=37.25 E-value=49 Score=18.14 Aligned_cols=28 Identities=18% Similarity=0.381 Sum_probs=20.3
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHHcCCc
Q psy472 30 YLLVHGTMDDNVHFQQSMMLAKSLQHADIM 59 (68)
Q Consensus 30 ~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~ 59 (68)
.+++.+ |..++++....+.++++++|+.
T Consensus 94 ~v~i~a--D~~~~y~~vv~vl~~l~~~g~~ 121 (130)
T PF02472_consen 94 RVLIRA--DKDAPYQDVVDVLDALREAGFT 121 (130)
T ss_dssp -EEEEE---TTS-HHHHHHHHHHHHHTT--
T ss_pred eEEEEe--CCCCCHHHHHHHHHHHHHcCCC
Confidence 566654 8889999999999999999984
No 251
>PF10230 DUF2305: Uncharacterised conserved protein (DUF2305); InterPro: IPR019363 This entry contains proteins that have no known function.
Probab=37.24 E-value=1e+02 Score=19.53 Aligned_cols=32 Identities=13% Similarity=0.062 Sum_probs=27.7
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADI 58 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~ 58 (68)
..++.++.|..|.=||.+....|.++......
T Consensus 221 ~~kl~f~fg~~D~Wvp~~~~~~l~~~~~~~~~ 252 (266)
T PF10230_consen 221 GDKLWFYFGQNDHWVPNETRDELIERYPGHEP 252 (266)
T ss_pred CCEEEEEEeCCCCCCCHHHHHHHHHHcCCCCC
Confidence 37999999999999999999999988775443
No 252
>TIGR02544 III_secr_YscJ type III secretion apparatus lipoprotein, YscJ/HrcJ family. All members of this protein family are predicted lipoproteins with a conserved Cys near the N-terminus for cleavage and modification, and are part of known or predicted type III secretion systems. Members are found in both plant and animal pathogens, including the obligately intracellular chlamydial species and (non-pathogenic) root nodule bacteria. The most closely related proteins outside this family are examples of the flagellar M-ring protein FliF.
Probab=37.23 E-value=60 Score=19.91 Aligned_cols=22 Identities=9% Similarity=0.024 Sum_probs=19.3
Q ss_pred cCHHHHHHHHHHHHHcCCceEE
Q psy472 41 VHFQQSMMLAKSLQHADIMFQS 62 (68)
Q Consensus 41 V~~~~s~~l~~~L~~~g~~~~~ 62 (68)
-...++-++..+|+++|+++++
T Consensus 26 L~~~da~~I~~~L~~~gI~y~~ 47 (193)
T TIGR02544 26 LSEREANEMLAVLMRHGIDAEK 47 (193)
T ss_pred CCHHHHHHHHHHHHHCCCCeEE
Confidence 3467899999999999999987
No 253
>PRK12902 secA preprotein translocase subunit SecA; Reviewed
Probab=37.23 E-value=93 Score=24.04 Aligned_cols=32 Identities=19% Similarity=0.186 Sum_probs=27.6
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEE
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
..|+|+.-. .++.|+.+++.|.+.|++++++-
T Consensus 439 GrPVLIgT~------SVe~SE~ls~~L~~~gi~h~vLN 470 (939)
T PRK12902 439 GRPVLVGTT------SVEKSELLSALLQEQGIPHNLLN 470 (939)
T ss_pred CCCEEEeeC------CHHHHHHHHHHHHHcCCchheee
Confidence 589999765 58999999999999999987753
No 254
>PF09587 PGA_cap: Bacterial capsule synthesis protein PGA_cap; InterPro: IPR019079 CapA is a putative poly-gamma-glutamate capsule biosynthesis protein found in bacteria. Poly-gamma-glutamate is a natural polymer that may be involved in virulence and may help bacteria survive in high salt concentrations. It is a surface-associated protein [].
Probab=37.05 E-value=65 Score=20.07 Aligned_cols=33 Identities=21% Similarity=0.177 Sum_probs=25.3
Q ss_pred CcEEEEE-eCCCCccCHHHHHHHHHHHHHcCCce
Q psy472 28 KQYLLVH-GTMDDNVHFQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 28 ~p~ll~h-G~~D~~V~~~~s~~l~~~L~~~g~~~ 60 (68)
.-++++| |.....-|......++.+|.++|++.
T Consensus 186 ~vIv~~HwG~e~~~~p~~~q~~~a~~lidaGaDi 219 (250)
T PF09587_consen 186 VVIVSLHWGIEYENYPTPEQRELARALIDAGADI 219 (250)
T ss_pred EEEEEeccCCCCCCCCCHHHHHHHHHHHHcCCCE
Confidence 4455555 55777788888899999999999764
No 255
>PF13394 Fer4_14: 4Fe-4S single cluster domain; PDB: 1TV8_B 1TV7_A 2FB2_A 2FB3_A.
Probab=36.83 E-value=36 Score=18.38 Aligned_cols=37 Identities=8% Similarity=0.020 Sum_probs=23.1
Q ss_pred EEEEEeCCCCc--cCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 30 YLLVHGTMDDN--VHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 30 ~ll~hG~~D~~--V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.+.++| .++. ........+.+.+++.+-.....+..+
T Consensus 50 ~v~~~G-GEPll~~~~~~l~~~i~~~~~~~~~~~i~i~TN 88 (119)
T PF13394_consen 50 TVVFTG-GEPLLYLNPEDLIELIEYLKERGPEIKIRIETN 88 (119)
T ss_dssp EEEEES-SSGGGSTTHHHHHHHHCTSTT-----EEEEEE-
T ss_pred EEEEEC-CCCccccCHHHHHHHHHHHHhhCCCceEEEEeC
Confidence 478888 8898 456778889988888884444444443
No 256
>cd03822 GT1_ecORF704_like This family is most closely related to the GT1 family of glycosyltransferases. ORF704 in E. coli has been shown to be involved in the biosynthesis of O-specific mannose homopolysaccharides.
Probab=36.80 E-value=1e+02 Score=19.05 Aligned_cols=41 Identities=10% Similarity=-0.024 Sum_probs=30.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.+.+++.|.-...=.+........++.+.+.++++.++.+
T Consensus 184 ~~~~i~~~G~~~~~K~~~~ll~a~~~~~~~~~~~~l~i~G~ 224 (366)
T cd03822 184 GRPVLLTFGLLRPYKGLELLLEALPLLVAKHPDVRLLVAGE 224 (366)
T ss_pred CCeEEEEEeeccCCCCHHHHHHHHHHHHhhCCCeEEEEecc
Confidence 46778888877776667777777777877777888887654
No 257
>COG0124 HisS Histidyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]
Probab=36.67 E-value=1.3e+02 Score=20.93 Aligned_cols=35 Identities=20% Similarity=0.152 Sum_probs=28.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEE
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
..+++++.-..+. -..+..++++|+++|+.+++.+
T Consensus 335 ~~~v~v~~~~~~~---~~~a~~la~~LR~~g~~~~~~~ 369 (429)
T COG0124 335 RVDVYVVPLGEDA---EPEALKLAQKLRAAGISVEVDY 369 (429)
T ss_pred CCCEEEEEcCchh---HHHHHHHHHHHHHcCCcEEEEe
Confidence 4778888776665 4899999999999999988765
No 258
>COG0125 Tmk Thymidylate kinase [Nucleotide transport and metabolism]
Probab=36.53 E-value=71 Score=19.84 Aligned_cols=32 Identities=16% Similarity=0.089 Sum_probs=26.2
Q ss_pred CCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 37 MDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 37 ~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
-|-.==-.|+..+++.|++.|..+-+.-.|++
T Consensus 11 iDGaGKTT~~~~L~~~l~~~g~~v~~trEP~~ 42 (208)
T COG0125 11 IDGAGKTTQAELLKERLEERGIKVVLTREPGG 42 (208)
T ss_pred CCCCCHHHHHHHHHHHHHHcCCeEEEEeCCCC
Confidence 45555557899999999999999988888874
No 259
>CHL00122 secA preprotein translocase subunit SecA; Validated
Probab=36.25 E-value=1.1e+02 Score=23.49 Aligned_cols=31 Identities=16% Similarity=0.196 Sum_probs=27.1
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
..|+|+.-. .++.|+.+++.|.+.|+++++.
T Consensus 424 grPVLIgT~------SIe~SE~ls~~L~~~gi~h~vL 454 (870)
T CHL00122 424 GRPILIGTT------TIEKSELLSQLLKEYRLPHQLL 454 (870)
T ss_pred CCCEEEeeC------CHHHHHHHHHHHHHcCCcccee
Confidence 589999865 6899999999999999998764
No 260
>TIGR00511 ribulose_e2b2 ribose-1,5-bisphosphate isomerase, e2b2 family. The delineation of this family was based originally, in part, on a discussion and neighbor-joining phylogenetic study by Kyrpides and Woese of archaeal and other proteins homologous to the alpha, beta, and delta subunits of eukaryotic initiation factor 2B (eIF-2B), a five-subunit molecule that catalyzes GTP recycling for eIF-2. Recently, Sato, et al. assigned the function ribulose-1,5 bisphosphate isomerase.
Probab=36.22 E-value=53 Score=21.52 Aligned_cols=25 Identities=20% Similarity=0.118 Sum_probs=21.0
Q ss_pred CccCHHHHHHHHHHHHHcCCceEEE
Q psy472 39 DNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 39 ~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
+.-|..+...+++.|.+.|+++.+.
T Consensus 148 EsrP~~~G~~~a~~L~~~gI~vtlI 172 (301)
T TIGR00511 148 ETRPRKQGHITAKELRDYGIPVTLI 172 (301)
T ss_pred cCCCcchHHHHHHHHHHCCCCEEEE
Confidence 4557778899999999999999875
No 261
>TIGR03101 hydr2_PEP hydrolase, ortholog 2, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 1, TIGR03100) of the same superfamily.
Probab=36.04 E-value=1.2e+02 Score=19.47 Aligned_cols=40 Identities=3% Similarity=-0.177 Sum_probs=30.6
Q ss_pred CcEEEEEeCCC-CccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 28 KQYLLVHGTMD-DNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 28 ~p~ll~hG~~D-~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.+++++.-..+ .--....+..+++.+++.|+.++.+.|++
T Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~v~~~~~~~ 242 (266)
T TIGR03101 202 CPVHWFEVRPEEGATLSPVFSRLGEQWVQSGVEVTVDLVPG 242 (266)
T ss_pred CceEEEEeccccCCCCCHHHHHHHHHHHHcCCeEeeeecCC
Confidence 57777766433 33344678889999999999999999986
No 262
>cd03812 GT1_CapH_like This family is most closely related to the GT1 family of glycosyltransferases. capH in Staphylococcus aureus has been shown to be required for the biosynthesis of the type 1 capsular polysaccharide (CP1).
Probab=35.64 E-value=1.1e+02 Score=19.09 Aligned_cols=41 Identities=10% Similarity=-0.029 Sum_probs=32.1
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.+.+++.|.-+..=......+.+..|.+.+.+++++++.+
T Consensus 191 ~~~~i~~vGr~~~~Kg~~~li~a~~~l~~~~~~~~l~ivG~ 231 (358)
T cd03812 191 DKFVIGHVGRFSEQKNHEFLIEIFAELLKKNPNAKLLLVGD 231 (358)
T ss_pred CCEEEEEEeccccccChHHHHHHHHHHHHhCCCeEEEEEeC
Confidence 46788889987777667777788888888888888888754
No 263
>cd03800 GT1_Sucrose_synthase This family is most closely related to the GT1 family of glycosyltransferases. The sucrose-phosphate synthases in this family may be unique to plants and photosynthetic bacteria. This enzyme catalyzes the synthesis of sucrose 6-phosphate from fructose 6-phosphate and uridine 5'-diphosphate-glucose, a key regulatory step of sucrose metabolism. The activity of this enzyme is regulated by phosphorylation and moderated by the concentration of various metabolites and light.
Probab=35.36 E-value=1.2e+02 Score=19.31 Aligned_cols=41 Identities=15% Similarity=0.032 Sum_probs=32.3
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.+++++-|.-+..=.+....+.+..|.+.+.++++.+..+
T Consensus 219 ~~~~i~~~gr~~~~k~~~~ll~a~~~l~~~~~~~~l~i~G~ 259 (398)
T cd03800 219 DKPRILAVGRLDPRKGIDTLIRAYAELPELRERANLVIVGG 259 (398)
T ss_pred CCcEEEEEcccccccCHHHHHHHHHHHHHhCCCeEEEEEEC
Confidence 36788888888877778888888888887777888887754
No 264
>PRK02458 ribose-phosphate pyrophosphokinase; Provisional
Probab=35.33 E-value=1e+02 Score=20.41 Aligned_cols=38 Identities=16% Similarity=0.052 Sum_probs=28.0
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
+.+++++.|... ..-+..+++.|.-.-..+++..||||
T Consensus 7 ~~~~~i~~~~~~----~~la~~ia~~lg~~l~~~~~~~FpdG 44 (323)
T PRK02458 7 DKQIKLFSLNSN----LEIAEKIAQAAGVPLGKLSSRQFSDG 44 (323)
T ss_pred CCCeEEEECCCC----HHHHHHHHHHhCCceeeeEEEECCCC
Confidence 368899998654 45667777777555568899999986
No 265
>PRK06371 translation initiation factor IF-2B subunit alpha; Provisional
Probab=35.26 E-value=64 Score=21.68 Aligned_cols=27 Identities=22% Similarity=0.075 Sum_probs=22.1
Q ss_pred CCccCHHHHHHH-HHHHHHcCCceEEEE
Q psy472 38 DDNVHFQQSMML-AKSLQHADIMFQSQE 64 (68)
Q Consensus 38 D~~V~~~~s~~l-~~~L~~~g~~~~~~~ 64 (68)
++.-|..|..+| +..|.+.|+++.++.
T Consensus 176 ~EsRP~~qG~rlta~eL~~~GI~vtlI~ 203 (329)
T PRK06371 176 DETRPRLQGARLTAWELAQEGIDHAIIA 203 (329)
T ss_pred CCCCCcchHHHHHHHHHHHCCCCEEEEc
Confidence 567788888775 889999999998753
No 266
>PRK10877 protein disulfide isomerase II DsbC; Provisional
Probab=35.15 E-value=64 Score=20.21 Aligned_cols=30 Identities=10% Similarity=-0.021 Sum_probs=21.5
Q ss_pred CCCccCHHHH-HHHHHHHHHcCCceEEEEec
Q psy472 37 MDDNVHFQQS-MMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 37 ~D~~V~~~~s-~~l~~~L~~~g~~~~~~~~~ 66 (68)
.|.++|+.+- ..-...+.+.|+.+.+..||
T Consensus 115 tDp~CpyCkkl~~~l~~~~~~~v~v~~~~~P 145 (232)
T PRK10877 115 TDITCGYCHKLHEQMKDYNALGITVRYLAFP 145 (232)
T ss_pred ECCCChHHHHHHHHHHHHhcCCeEEEEEecc
Confidence 5889999544 34555666678888888787
No 267
>KOG3363|consensus
Probab=34.78 E-value=91 Score=19.26 Aligned_cols=36 Identities=17% Similarity=0.098 Sum_probs=30.0
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
.+=+++-|-.|..-|..-+.+...-++..++..|.+
T Consensus 111 kidlLIvG~Gd~~~p~~v~~~V~~F~k~~ki~lEi~ 146 (196)
T KOG3363|consen 111 KIDLLIVGCGDKKHPDKVRPSVRQFVKSHKIKLEIV 146 (196)
T ss_pred CccEEEEecCCcCCchhcCHHHHHHHHHhCcceEEe
Confidence 567888899999999888888888888888887764
No 268
>TIGR02836 spore_IV_A stage IV sporulation protein A. A comparative genome analysis of all sequenced genomes of shows a number of proteins conserved strictly among the endospore-forming subset of the Firmicutes. This protein, a member of this panel, is designated stage IV sporulation protein A. It acts in the mother cell compartment and plays a role in spore coat morphogenesis.
Probab=34.71 E-value=1.7e+02 Score=20.92 Aligned_cols=50 Identities=8% Similarity=0.191 Sum_probs=34.6
Q ss_pred CCCHHHHHhcChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHc-CCce
Q psy472 7 EDNLEGYKIAALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHA-DIMF 60 (68)
Q Consensus 7 ~~~~~~y~~~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~-g~~~ 60 (68)
.++...++.. .+..+... +.|++++-...|+..+- +..+++.|.+. ++++
T Consensus 162 Re~y~~aEe~-~i~eLk~~-~kPfiivlN~~dp~~~e--t~~l~~~l~eky~vpv 212 (492)
T TIGR02836 162 REDYVEAEER-VIEELKEL-NKPFIILLNSTHPYHPE--TEALRQELEEKYDVPV 212 (492)
T ss_pred cccchHHHHH-HHHHHHhc-CCCEEEEEECcCCCCch--hHHHHHHHHHHhCCce
Confidence 4444455554 56777788 69999999999987654 66677777544 6554
No 269
>cd03819 GT1_WavL_like This family is most closely related to the GT1 family of glycosyltransferases. WavL in Vibrio cholerae has been shown to be involved in the biosynthesis of the lipopolysaccharide core.
Probab=34.34 E-value=1.2e+02 Score=18.97 Aligned_cols=41 Identities=12% Similarity=-0.001 Sum_probs=30.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.+++++-|.-+..=......+...+|++.+.++++++..+
T Consensus 184 ~~~~i~~~Gr~~~~Kg~~~li~~~~~l~~~~~~~~l~ivG~ 224 (355)
T cd03819 184 GKPVILLPGRLTRWKGQEVFIEALARLKKDDPDVHLLIVGD 224 (355)
T ss_pred CceEEEEeeccccccCHHHHHHHHHHHHhcCCCeEEEEEEC
Confidence 35678888877777677778888888888777888887653
No 270
>PF05902 4_1_CTD: 4.1 protein C-terminal domain (CTD); InterPro: IPR008379 There is a unique sequence domain at the C terminus of all known 4.1 proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24 kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates [].; GO: 0003779 actin binding, 0005198 structural molecule activity, 0005856 cytoskeleton
Probab=34.18 E-value=52 Score=18.82 Aligned_cols=25 Identities=20% Similarity=0.493 Sum_probs=14.8
Q ss_pred cEEEEEeCCCCccCHHHHHHHHHHHHHcC
Q psy472 29 QYLLVHGTMDDNVHFQQSMMLAKSLQHAD 57 (68)
Q Consensus 29 p~ll~hG~~D~~V~~~~s~~l~~~L~~~g 57 (68)
.-.+++|+.| |.-. ..|+.+++++.
T Consensus 71 KRIvITGD~D--IDhD--qaLa~aI~eAk 95 (114)
T PF05902_consen 71 KRIVITGDAD--IDHD--QALAQAIKEAK 95 (114)
T ss_pred EEEEEecCCC--cchH--HHHHHHHHHHH
Confidence 3468899888 5533 34555555443
No 271
>PF02629 CoA_binding: CoA binding domain; InterPro: IPR003781 This domain has a Rossmann fold and is found in a number of proteins including succinyl CoA synthetases, malate and ATP-citrate ligases.; GO: 0005488 binding; PDB: 3IL2_B 3IKT_A 3IKV_B 2SCU_D 1JKJ_D 2NU7_A 1CQI_A 1JLL_A 2NU8_D 1SCU_D ....
Probab=34.18 E-value=77 Score=16.75 Aligned_cols=26 Identities=19% Similarity=-0.028 Sum_probs=19.8
Q ss_pred cCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 41 VHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 41 V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
||...+...+..|.++|+..=+-+.|
T Consensus 70 VP~~~a~~~~~~~~~~gIk~i~nft~ 95 (96)
T PF02629_consen 70 VPAEAAQEVADELVEAGIKGIVNFTP 95 (96)
T ss_dssp S-HHHHHHHHHHHHHTT-SEEEEESS
T ss_pred cCHHHHHHHHHHHHHcCCCEEEEeCC
Confidence 59999999999999999987665544
No 272
>PF12010 DUF3502: Domain of unknown function (DUF3502); InterPro: IPR022627 This domain is about 140 amino acids in length and is functionally uncharacterised. It is found in bacteria C-terminal to PF01547 from PFAM.
Probab=34.06 E-value=21 Score=20.46 Aligned_cols=25 Identities=24% Similarity=0.197 Sum_probs=18.8
Q ss_pred EEEeCCCCccCHHHHHHHHHHHHHcCCc
Q psy472 32 LVHGTMDDNVHFQQSMMLAKSLQHADIM 59 (68)
Q Consensus 32 l~hG~~D~~V~~~~s~~l~~~L~~~g~~ 59 (68)
|..|.-| |-.....|.++|+++|++
T Consensus 92 L~~G~vd---~e~~~~~~~~kLk~AGid 116 (134)
T PF12010_consen 92 LETGLVD---PEEALPEFNEKLKAAGID 116 (134)
T ss_pred HHccCCC---HHHHHHHHHHHHHHhChH
Confidence 4567777 556677799999999863
No 273
>cd03801 GT1_YqgM_like This family is most closely related to the GT1 family of glycosyltransferases and named after YqgM in Bacillus licheniformis about which little is known. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold.
Probab=33.99 E-value=1.1e+02 Score=18.51 Aligned_cols=41 Identities=10% Similarity=-0.027 Sum_probs=30.4
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.+.+++-|..+..-...........+.+.+..++++++.+
T Consensus 198 ~~~~i~~~g~~~~~k~~~~~i~~~~~~~~~~~~~~l~i~G~ 238 (374)
T cd03801 198 DEPVILFVGRLVPRKGVDLLLEALAKLRKEYPDVRLVIVGD 238 (374)
T ss_pred CCeEEEEecchhhhcCHHHHHHHHHHHhhhcCCeEEEEEeC
Confidence 36788888988876677777777777777766788887653
No 274
>KOG1466|consensus
Probab=33.96 E-value=44 Score=22.17 Aligned_cols=23 Identities=17% Similarity=0.065 Sum_probs=19.2
Q ss_pred CHHHHHHHHHHHHHcCCceEEEE
Q psy472 42 HFQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 42 ~~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
|-.....|++.|+++|+|+.+++
T Consensus 166 Pd~sG~lm~~~L~~~~IPvtlvl 188 (313)
T KOG1466|consen 166 PDGSGKLMAKELKKLGIPVTLVL 188 (313)
T ss_pred CCCchhHHHHHHHhcCCCeEEEe
Confidence 44567899999999999998864
No 275
>PF01841 Transglut_core: Transglutaminase-like superfamily; InterPro: IPR002931 This domain is found in many proteins known to have transglutaminase activity, i.e. which cross-link proteins through an acyl-transfer reaction between the gamma-carboxamide group of peptide-bound glutamine and the epsilon-amino group of peptide-bound lysine, resulting in a epsilon-(gamma-glutamyl)lysine isopeptide bond. Tranglutaminases have been found in a diverse range of species, from bacteria through to mammals. The enzymes require calcium binding and their activity leads to post-translational modification of proteins through acyl-transfer reactions, involving peptidyl glutamine residues as acyl donors and a variety of primary amines as acyl acceptors, with the generation of proteinase resistant isopeptide bonds []. Sequence conservation in this superfamily primarily involves three motifs that centre around conserved cysteine, histidine, and aspartate residues that form the catalytic triad in the structurally characterised transglutaminase, the human blood clotting factor XIIIa' []. On the basis of the experimentally demonstrated activity of the Methanobacterium phage psiM2 pseudomurein endoisopeptidase [], it is proposed that many, if not all, microbial homologs of the transglutaminases are proteases and that the eukaryotic transglutaminases have evolved from an ancestral protease []. A subunit of plasma Factor XIII revealed that each Factor XIIIA subunit is composed of four domains (termed N-terminal beta-sandwich, core domain (containing the catalytic and the regulatory sites), and C-terminal beta-barrels 1 and 2) and that two monomers assemble into the native dimer through the surfaces in domains 1 and 2, in opposite orientation. This organisation in four domains is highly conserved during evolution among transglutaminase isoforms [].; PDB: 2F4M_A 2F4O_A 1GGY_B 1FIE_B 1GGU_B 1GGT_B 1F13_A 1QRK_B 1EVU_A 1EX0_B ....
Probab=33.67 E-value=77 Score=16.63 Aligned_cols=22 Identities=9% Similarity=-0.005 Sum_probs=16.4
Q ss_pred HHHHHHHHHHHHHcCCceEEEE
Q psy472 43 FQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
.+.+..|+..|+..|+++.++.
T Consensus 55 ~~~a~l~~allr~~Gipar~v~ 76 (113)
T PF01841_consen 55 EDYASLFVALLRALGIPARVVS 76 (113)
T ss_dssp HHHHHHHHHHHHHHT--EEEEE
T ss_pred HHHHHHHHHHHhhCCCceEEEE
Confidence 4677888999999999998764
No 276
>PF02374 ArsA_ATPase: Anion-transporting ATPase; PDB: 2WOO_A 3IBG_B 3SJA_A 3H84_B 3SJD_A 3ZS9_A 3A37_A 2WOJ_A 3SJC_B 3A36_B ....
Probab=33.61 E-value=57 Score=21.30 Aligned_cols=30 Identities=10% Similarity=0.044 Sum_probs=24.9
Q ss_pred CCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 37 MDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 37 ~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.=+..++.++.++...|+..|+++..++.+
T Consensus 215 ~pE~l~i~Et~r~~~~L~~~gi~v~~vVvN 244 (305)
T PF02374_consen 215 NPEPLAIAETERLLTELKLYGIPVDAVVVN 244 (305)
T ss_dssp ESSHHHHHHHHHHHHHHHHTT-EEEEEEEE
T ss_pred cCCcchHHHHHHHHHHHHhcCCccCeEEEE
Confidence 345678999999999999999999988764
No 277
>PF03129 HGTP_anticodon: Anticodon binding domain; InterPro: IPR004154 tRNA synthetases, or tRNA ligases are involved in protein synthesis. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases [] it is probably the anticodon binding domain [].; GO: 0004812 aminoacyl-tRNA ligase activity, 0005524 ATP binding; PDB: 1KOG_B 1EVL_D 1EVK_B 1QF6_A 1FYF_B 2I4O_A 2I4M_A 2I4N_A 2I4L_A 1HC7_D ....
Probab=33.55 E-value=74 Score=16.35 Aligned_cols=21 Identities=24% Similarity=0.286 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHHcCCceEEE
Q psy472 43 FQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g~~~~~~ 63 (68)
...+..+++.|.++|+.+++.
T Consensus 15 ~~~a~~l~~~L~~~gi~v~~d 35 (94)
T PF03129_consen 15 IEYAQELANKLRKAGIRVELD 35 (94)
T ss_dssp HHHHHHHHHHHHHTTSEEEEE
T ss_pred HHHHHHHHHHHHHCCCEEEEE
Confidence 467888999999999877654
No 278
>PF01008 IF-2B: Initiation factor 2 subunit family; InterPro: IPR000649 Initiation factor 2 binds to Met-tRNA, GTP and the small ribosomal subunit. The eukaryotic translation initiation factor EIF-2B is a complex made up of five different subunits, alpha, beta, gamma, delta and epsilon, and catalyses the exchange of EIF-2-bound GDP for GTP. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes; related proteins from archaebacteria and IF-2 from prokaryotes and also contains a subfamily of proteins in eukaryotes, archaeae (e.g. Pyrococcus furiosus), or eubacteria such as Bacillus subtilis and Thermotoga maritima. Many of these proteins were initially annotated as putative translation initiation factors despite the fact that there is no evidence for the requirement of an IF2 recycling factor in prokaryotic translation initiation. Recently, one of these proteins from B. subtilis has been functionally characterised as a 5-methylthioribose-1-phosphate isomerase (MTNA) []. This enzyme participates in the methionine salvage pathway catalysing the isomerisation of 5-methylthioribose-1-phosphate to 5-methylthioribulose-1-phosphate []. The methionine salvage pathway leads to the synthesis of methionine from methylthioadenosine, the end product of the spermidine and spermine anabolism in many species.; GO: 0044237 cellular metabolic process; PDB: 1VB5_A 1T5O_D 3A11_E 3VM6_C 1W2W_A 1T9K_A 3ECS_B 2YRF_A 2YVK_B 2A0U_A ....
Probab=33.46 E-value=36 Score=21.57 Aligned_cols=24 Identities=21% Similarity=0.277 Sum_probs=17.4
Q ss_pred ccCHHHHHHHHHHHHHcCCceEEE
Q psy472 40 NVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 40 ~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
.-|..+...+++.|.+.|+++.++
T Consensus 141 s~P~~eG~~~a~~L~~~gi~v~~i 164 (282)
T PF01008_consen 141 SRPYNEGRLMAKELAEAGIPVTLI 164 (282)
T ss_dssp -TTTTHHHTHHHHHHHTT-EEEEE
T ss_pred CCcchhhhhHHHHhhhcceeEEEE
Confidence 345667788999999999888775
No 279
>PF08532 Glyco_hydro_42M: Beta-galactosidase trimerisation domain; InterPro: IPR013738 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is non catalytic domain B of beta-galactosidase enzymes belong to the glycosyl hydrolase 42 family. This domain is related to glutamine amidotransferase enzymes, but the catalytic residues are replaced by non functional amino acids. This domain is involved in trimerisation. ; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process; PDB: 1KWK_A 1KWG_A.
Probab=33.44 E-value=68 Score=19.46 Aligned_cols=23 Identities=13% Similarity=-0.026 Sum_probs=17.4
Q ss_pred CHHHHHHHHHHHHHcCCceEEEE
Q psy472 42 HFQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 42 ~~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
...+...++++|.+.|++++++-
T Consensus 28 y~~~~~~~y~al~~~gi~vDvv~ 50 (207)
T PF08532_consen 28 YRDQVRGWYRALRELGIPVDVVS 50 (207)
T ss_dssp HHHHHHHHHHHHHTTT--EEEE-
T ss_pred HHHHHHHHHHHHHHcCCceEEec
Confidence 45778889999999999999874
No 280
>PF11950 DUF3467: Protein of unknown function (DUF3467); InterPro: IPR021857 This entry is represented by Bacteriophage 92, 0rf53. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family of proteins are functionally uncharacterised. This protein is found in bacteria, archaea and viruses. Proteins in this family are typically between 101 to 118 amino acids in length.
Probab=33.31 E-value=43 Score=18.02 Aligned_cols=17 Identities=29% Similarity=0.225 Sum_probs=13.9
Q ss_pred ccCHHHHHHHHHHHHHc
Q psy472 40 NVHFQQSMMLAKSLQHA 56 (68)
Q Consensus 40 ~V~~~~s~~l~~~L~~~ 56 (68)
+.++.++.+|+.+|.++
T Consensus 63 imsP~~AKrL~~aL~~~ 79 (92)
T PF11950_consen 63 IMSPQHAKRLLKALQQN 79 (92)
T ss_pred EeCHHHHHHHHHHHHHH
Confidence 45789999999998764
No 281
>cd02006 TPP_Gcl Thiamine pyrophosphate (TPP) family, Gcl subfamily, TPP-binding module; composed of proteins similar to Escherichia coli glyoxylate carboligase (Gcl). E. coli glyoxylate carboligase, plays a key role in glyoxylate metabolism where it catalyzes the condensation of two molecules of glyoxylate to give tartronic semialdehyde and carbon dioxide. This enzyme requires TPP, magnesium ion and FAD as cofactors.
Probab=33.18 E-value=1.1e+02 Score=18.36 Aligned_cols=38 Identities=18% Similarity=0.117 Sum_probs=29.5
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
+.|++.+.|+..-.-.. .+|.. +.+.++++-++++.|+
T Consensus 75 ~~~vv~i~GDG~f~m~~---~eL~T-a~~~~lpviivV~NN~ 112 (202)
T cd02006 75 DRQVVALSGDYDFQFMI---EELAV-GAQHRIPYIHVLVNNA 112 (202)
T ss_pred CCeEEEEEeChHhhccH---HHHHH-HHHhCCCeEEEEEeCc
Confidence 47899999988877775 34444 4778999999999875
No 282
>KOG3946|consensus
Probab=33.14 E-value=82 Score=21.21 Aligned_cols=41 Identities=17% Similarity=0.070 Sum_probs=31.2
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHc--------CCceEEEEecCC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHA--------DIMFQSQECYRR 68 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~--------g~~~~~~~~~~~ 68 (68)
.|-..+-|..|-.||=.--..++++|.+. .+..++++|+|+
T Consensus 131 ~p~~~~vgatdsAvpcamll~laq~l~~~~~~~~~~s~lsL~LvFFDGE 179 (338)
T KOG3946|consen 131 FPGGMFVGATDSAVPCAMLLNLAQALDKILCSKVSASQLSLQLVFFDGE 179 (338)
T ss_pred CCCcceEeeccccccHHHHHHHHHHHHHHHhcccCcCceeEEEEEeccH
Confidence 56666899999999998888888877653 245677888764
No 283
>PRK08057 cobalt-precorrin-6x reductase; Reviewed
Probab=33.13 E-value=80 Score=20.15 Aligned_cols=28 Identities=29% Similarity=0.357 Sum_probs=22.2
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEE
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQS 62 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~ 62 (68)
..+|++-||. .+..+++.|.+.|.++-.
T Consensus 3 ~~IlvlgGT~-------egr~la~~L~~~g~~v~~ 30 (248)
T PRK08057 3 PRILLLGGTS-------EARALARALAAAGVDIVL 30 (248)
T ss_pred ceEEEEechH-------HHHHHHHHHHhCCCeEEE
Confidence 5688999986 588999999988876644
No 284
>PRK13398 3-deoxy-7-phosphoheptulonate synthase; Provisional
Probab=32.96 E-value=1.4e+02 Score=19.29 Aligned_cols=32 Identities=13% Similarity=0.190 Sum_probs=26.5
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCce
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~ 60 (68)
.+++++-| =+-+=..++..++++.|++.|+.+
T Consensus 26 ~~~~~iaG-PCsie~~~~~~~~A~~lk~~g~~~ 57 (266)
T PRK13398 26 EEKIIIAG-PCAVESEEQMVKVAEKLKELGVHM 57 (266)
T ss_pred CCEEEEEe-CCcCCCHHHHHHHHHHHHHcCCCE
Confidence 47788877 566667899999999999999874
No 285
>PLN03033 2-dehydro-3-deoxyphosphooctonate aldolase; Provisional
Probab=32.71 E-value=1.4e+02 Score=19.84 Aligned_cols=43 Identities=12% Similarity=0.182 Sum_probs=31.6
Q ss_pred cChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHc----CCce
Q psy472 16 AALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHA----DIMF 60 (68)
Q Consensus 16 ~sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~----g~~~ 60 (68)
+|.+.+.-.. ..|+++|-| =+-+-.-++....++.|++. |+++
T Consensus 4 ~~~~~~~~~~-~~~~~lIAG-PC~iEs~e~~~~~A~~lk~~~~~~g~~~ 50 (290)
T PLN03033 4 SSLLYNQLKA-AEPFFLLAG-PNVIESEEHILRMAKHIKDISTKLGLPL 50 (290)
T ss_pred hhHHHHhcCC-CCCeEEEec-CChhcCHHHHHHHHHHHHHHHHhCCCcE
Confidence 3444444444 478888888 77888889999999999996 7644
No 286
>PF04244 DPRP: Deoxyribodipyrimidine photo-lyase-related protein; InterPro: IPR007357 This family appears to be related to DNA photolyases.; PDB: 3ZXS_A.
Probab=32.49 E-value=75 Score=20.00 Aligned_cols=25 Identities=16% Similarity=0.107 Sum_probs=17.3
Q ss_pred HHHHHHHHHHHHHcCCceEEEEecC
Q psy472 43 FQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
......+++.|+++|..+.++.+++
T Consensus 48 ~saMRhfa~~L~~~G~~V~Y~~~~~ 72 (224)
T PF04244_consen 48 FSAMRHFADELRAKGFRVHYIELDD 72 (224)
T ss_dssp HHHHHHHHHHHHHTT--EEEE-TT-
T ss_pred HHHHHHHHHHHHhCCCEEEEEeCCC
Confidence 4556678999999999999987763
No 287
>PRK02929 L-arabinose isomerase; Provisional
Probab=32.38 E-value=1.2e+02 Score=21.62 Aligned_cols=42 Identities=12% Similarity=0.101 Sum_probs=28.4
Q ss_pred hhcCCCCcEEEEEeCCCCcc----------CHHHH----HHHHHHHHHcCCceEEEE
Q psy472 22 VDRIRDKQYLLVHGTMDDNV----------HFQQS----MMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V----------~~~~s----~~l~~~L~~~g~~~~~~~ 64 (68)
++.+ +.|+|++|---.+-+ ...|| ..+...|.+.|++++++.
T Consensus 93 ~~~l-~~PvL~~~~Q~~~e~p~~~id~d~m~lnqs~~G~~e~~~il~R~gi~~~~v~ 148 (499)
T PRK02929 93 LSAL-QKPLLHLHTQFNAEIPWDTIDMDFMNLNQSAHGDREFGFIGARLRKQRKVVV 148 (499)
T ss_pred HHHc-CCCEEEEecCCCccCCCCCCCcchhhhhhcccChHHHHHHHHHcCCCeeEEE
Confidence 5567 699999998222222 23333 357888999999987765
No 288
>cd06593 GH31_xylosidase_YicI YicI alpha-xylosidase is a glycosyl hydrolase family 31 (GH31) enzyme that catalyzes the release of an alpha-xylosyl residue from the non-reducing end of alpha-xyloside substrates such as alpha-xylosyl fluoride and isoprimeverose. YicI forms a homohexamer (a trimer of dimers). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The YicI family corresponds to subgroup 4 in the Ernst et al classification of GH31 enzymes.
Probab=32.02 E-value=1.1e+02 Score=19.65 Aligned_cols=28 Identities=4% Similarity=-0.053 Sum_probs=23.1
Q ss_pred ccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 40 NVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 40 ~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
...-+..+++++.+++.|++++.+....
T Consensus 20 y~~~~~v~~~~~~~~~~~iP~d~~~lD~ 47 (308)
T cd06593 20 YYDEEEVNEFADGMRERNLPCDVIHLDC 47 (308)
T ss_pred CCCHHHHHHHHHHHHHcCCCeeEEEEec
Confidence 3566788999999999999999877653
No 289
>TIGR02189 GlrX-like_plant Glutaredoxin-like family. This family of glutaredoxin-like proteins is aparrently limited to plants. Multiple isoforms are found in A. thaliana and O.sativa.
Probab=31.59 E-value=27 Score=18.83 Aligned_cols=36 Identities=17% Similarity=0.189 Sum_probs=24.9
Q ss_pred cCCCCcEEEEEe----CCCCccCHHHHHHHHHHHHHcCCc
Q psy472 24 RIRDKQYLLVHG----TMDDNVHFQQSMMLAKSLQHADIM 59 (68)
Q Consensus 24 ~~~~~p~ll~hG----~~D~~V~~~~s~~l~~~L~~~g~~ 59 (68)
..++.|.+++-| .-|+..-..++-.|.+.|++.|..
T Consensus 58 g~~tvP~Vfi~g~~iGG~ddl~~l~~~G~L~~~l~~~~~~ 97 (99)
T TIGR02189 58 CSPAVPAVFVGGKLVGGLENVMALHISGSLVPMLKQAGAL 97 (99)
T ss_pred CCCCcCeEEECCEEEcCHHHHHHHHHcCCHHHHHHHhCcc
Confidence 333578887664 566666677777788888887764
No 290
>TIGR01769 GGGP geranylgeranylglyceryl phosphate synthase. This model represents geranylgeranylglyceryl phosphate synthase which catalyzes the first committed step in the synthesis of ether-linked membrane lipids in archaea. The active enzyme is reported to be a homopentamer in Methanobacterium thermoautotrophicum but is reported to be a homodimer in Thermoplasma acidophilum.
Probab=31.53 E-value=1.3e+02 Score=18.68 Aligned_cols=42 Identities=7% Similarity=0.023 Sum_probs=30.2
Q ss_pred HhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHc-CCceEEEEecC
Q psy472 21 KVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHA-DIMFQSQECYR 67 (68)
Q Consensus 21 ~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~-g~~~~~~~~~~ 67 (68)
.+... .+..+++-|+ ..|..++..++.+++++. .+|+ .+||+
T Consensus 19 ~v~~~-gtDaI~VGGS--~gvt~~~~~~~v~~ik~~~~lPv--ilfp~ 61 (205)
T TIGR01769 19 NAKDA-GTDAIMVGGS--LGIVESNLDQTVKKIKKITNLPV--ILFPG 61 (205)
T ss_pred HHHhc-CCCEEEEcCc--CCCCHHHHHHHHHHHHhhcCCCE--EEECC
Confidence 45555 4788888887 557889999999999884 3554 34554
No 291
>PRK01906 tetraacyldisaccharide 4'-kinase; Provisional
Probab=31.50 E-value=1.5e+02 Score=19.85 Aligned_cols=34 Identities=12% Similarity=0.070 Sum_probs=26.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..+++++-|-.+ -..|.+.|++.|...+.+.|||
T Consensus 243 ~~~v~a~aGIgn-------P~~F~~~L~~~G~~~~~~~fpD 276 (338)
T PRK01906 243 GERVLAAAGIGA-------PERFFATLRAAGLAPATRALPD 276 (338)
T ss_pred CCeEEEEEECCC-------HHHHHHHHHHcCCceeEEeCCC
Confidence 357888888665 4678888899999887788886
No 292
>COG1447 CelC Phosphotransferase system cellobiose-specific component IIA [Carbohydrate transport and metabolism]
Probab=31.43 E-value=47 Score=18.69 Aligned_cols=27 Identities=30% Similarity=0.272 Sum_probs=16.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHH
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQ 54 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~ 54 (68)
...+|++|+ +|..-.-.-...|++.+.
T Consensus 70 ~~slllvHA-QDHLMtt~~~~~l~ke~i 96 (105)
T COG1447 70 EVSLLLVHA-QDHLMTTITEKDLIKELI 96 (105)
T ss_pred ceeeeeeeh-hHHHHHHHHHHHHHHHHH
Confidence 488999999 676555444444444443
No 293
>KOG4388|consensus
Probab=31.36 E-value=1.1e+02 Score=22.87 Aligned_cols=37 Identities=19% Similarity=0.181 Sum_probs=31.3
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
+|+.|+.-..|+.. ..+..|+++|++.|.++.+.+..
T Consensus 788 Pp~~i~ac~mDP~L--DD~vmfA~kLr~lG~~v~l~vle 824 (880)
T KOG4388|consen 788 PPVHIVACAMDPML--DDSVMFARKLRNLGQPVTLRVLE 824 (880)
T ss_pred CCceEEEeccCcch--hHHHHHHHHHHhcCCceeehhhh
Confidence 78999998889875 57889999999999988876643
No 294
>PRK12903 secA preprotein translocase subunit SecA; Reviewed
Probab=31.22 E-value=1.6e+02 Score=22.85 Aligned_cols=31 Identities=19% Similarity=0.179 Sum_probs=26.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
..|+|+.-. .++.|+.+.+.|.+.|++++++
T Consensus 426 gqPVLVgT~------SIe~SE~ls~~L~~~gi~h~vL 456 (925)
T PRK12903 426 GQPILIGTA------QVEDSETLHELLLEANIPHTVL 456 (925)
T ss_pred CCCEEEEeC------cHHHHHHHHHHHHHCCCCceee
Confidence 589999754 7899999999999999998754
No 295
>TIGR02873 spore_ylxY probable sporulation protein, polysaccharide deacetylase family. Members of this protein family are most closely related to TIGR02764, a subset of polysaccharide deacetylase family proteins found in a species if and only if the species forms endospores like those of Bacillus subtilis or Clostridium tetani. This family is likewise restricted to spore-formers, but is not universal among them in having sequences with full-length matches to the model.
Probab=30.87 E-value=1.4e+02 Score=19.25 Aligned_cols=22 Identities=9% Similarity=-0.103 Sum_probs=19.0
Q ss_pred HHHHHHHHHHHHcCCceEEEEe
Q psy472 44 QQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 44 ~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
..+.++...|++.|++..+++.
T Consensus 98 ~~t~~iL~iLkk~~vkATFFv~ 119 (268)
T TIGR02873 98 EYLPEILQILKKHDVKATFFLE 119 (268)
T ss_pred chHHHHHHHHHHCCCCEEEEee
Confidence 5778999999999999988764
No 296
>PF06850 PHB_depo_C: PHB de-polymerase C-terminus; InterPro: IPR009656 This entry represents the C terminus of bacterial poly(3-hydroxybutyrate) (PHB) de-polymerase. This degrades PHB granules to oligomers and monomers of 3-hydroxy-butyric acid.
Probab=30.85 E-value=74 Score=20.02 Aligned_cols=27 Identities=30% Similarity=0.245 Sum_probs=22.6
Q ss_pred CCCCcEEEEEeCCCCccCHHHHHHHHH
Q psy472 25 IRDKQYLLVHGTMDDNVHFQQSMMLAK 51 (68)
Q Consensus 25 ~~~~p~ll~hG~~D~~V~~~~s~~l~~ 51 (68)
++++++|-+=|..|++..+.|+....+
T Consensus 132 I~~taLlTVEGe~DDIsg~GQT~AA~~ 158 (202)
T PF06850_consen 132 IRRTALLTVEGERDDISGPGQTHAAHD 158 (202)
T ss_pred cccceeEEeecCcccCCcchHHHHHHH
Confidence 446899999999999999999876543
No 297
>PRK12822 phospho-2-dehydro-3-deoxyheptonate aldolase; Provisional
Probab=30.49 E-value=1.8e+02 Score=19.92 Aligned_cols=34 Identities=12% Similarity=0.175 Sum_probs=27.4
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 30 YLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 30 ~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
.+++-|..-+|-..++.....++|++.|.+-.++
T Consensus 230 HvILRGg~~PNY~~~~v~~a~~~l~~~~l~~~vm 263 (356)
T PRK12822 230 HIILRGGREPNYGLSDVTKASKLLHDEGLNHRLI 263 (356)
T ss_pred eEEEeCCCCCCCCHHHHHHHHHHHHHCCCCCcEE
Confidence 4555677789999999999999999999875444
No 298
>PRK06036 translation initiation factor IF-2B subunit alpha; Provisional
Probab=30.37 E-value=1.1e+02 Score=20.65 Aligned_cols=28 Identities=14% Similarity=0.049 Sum_probs=22.0
Q ss_pred CCCccCHHHHHHH-HHHHHHcCCceEEEE
Q psy472 37 MDDNVHFQQSMML-AKSLQHADIMFQSQE 64 (68)
Q Consensus 37 ~D~~V~~~~s~~l-~~~L~~~g~~~~~~~ 64 (68)
-++.-|..|..+| +..|.+.|+++.++.
T Consensus 186 v~EtRP~~qG~rlta~eL~~~GI~vtlI~ 214 (339)
T PRK06036 186 ACETRPLNQGSRLTTWELMQDNIPVTLIT 214 (339)
T ss_pred EcCCCchhhHHHHHHHHHHHcCCCEEEEe
Confidence 3467788888776 689999999998763
No 299
>PRK10629 EnvZ/OmpR regulon moderator; Provisional
Probab=30.30 E-value=1.1e+02 Score=17.53 Aligned_cols=39 Identities=10% Similarity=0.140 Sum_probs=29.3
Q ss_pred CCcEEEEEeC-CCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGT-MDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~-~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+-|.+=|.+. ....+ .....+.+.|.++|+.++.....+
T Consensus 34 edpavQIs~~~~g~~~--~~~~~v~~~L~~~gI~~ksi~~~~ 73 (127)
T PRK10629 34 QESTLAIRAVHQGASL--PDGFYVYQHLDANGIHIKSITPEN 73 (127)
T ss_pred CCceEEEecCCCCCcc--chHHHHHHHHHHCCCCcceEEeeC
Confidence 4677777776 44344 777889999999999988766554
No 300
>PRK07933 thymidylate kinase; Validated
Probab=30.29 E-value=1e+02 Score=18.82 Aligned_cols=29 Identities=10% Similarity=-0.039 Sum_probs=22.8
Q ss_pred CCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 38 DDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 38 D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
|-.==-.++..|++.|+..|..+.+..+|
T Consensus 9 dGsGKST~~~~L~~~L~~~g~~v~~~~~P 37 (213)
T PRK07933 9 DGAGKRTLTEALRAALEARGRSVATLAFP 37 (213)
T ss_pred CCCCHHHHHHHHHHHHHHCCCeEEEEecC
Confidence 33334468899999999999988888777
No 301
>PF13579 Glyco_trans_4_4: Glycosyl transferase 4-like domain; PDB: 3C4Q_B 3C4V_A 3C48_B 1Z2T_A.
Probab=30.26 E-value=96 Score=16.65 Aligned_cols=21 Identities=14% Similarity=-0.053 Sum_probs=16.3
Q ss_pred HHHHHHHHHHHcCCceEEEEe
Q psy472 45 QSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 45 ~s~~l~~~L~~~g~~~~~~~~ 65 (68)
....++++|.+.|..+.+...
T Consensus 6 ~~~~l~~~L~~~G~~V~v~~~ 26 (160)
T PF13579_consen 6 YVRELARALAARGHEVTVVTP 26 (160)
T ss_dssp HHHHHHHHHHHTT-EEEEEEE
T ss_pred HHHHHHHHHHHCCCEEEEEec
Confidence 567899999999988887654
No 302
>PF04389 Peptidase_M28: Peptidase family M28; InterPro: IPR007484 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain is found in metallopeptidases belonging to the MEROPS peptidase family M28 (aminopeptidase Y, clan MH) []. They also contain a transferrin receptor-like dimerisation domain (IPR007365 from INTERPRO) and a protease-associated PA domain (IPR003137 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 3BXM_A 2C6P_A 1Z8L_C 3SJF_A 3BHX_A 2C6G_A 3D7F_A 2XEG_A 3BI1_A 2C6C_A ....
Probab=30.22 E-value=83 Score=18.16 Aligned_cols=36 Identities=14% Similarity=0.045 Sum_probs=25.9
Q ss_pred EEEeCCCCccCHHHHHHHHHHHHHcCC----ceEEEEecC
Q psy472 32 LVHGTMDDNVHFQQSMMLAKSLQHADI----MFQSQECYR 67 (68)
Q Consensus 32 l~hG~~D~~V~~~~s~~l~~~L~~~g~----~~~~~~~~~ 67 (68)
...|..|.-....-.+++++.|++.+. .+.+..|.+
T Consensus 20 ~~~GA~DnasGva~lLelAr~l~~~~~~~~~~i~fv~~~~ 59 (179)
T PF04389_consen 20 WSPGANDNASGVAALLELARVLKELKPQPKRTIRFVFFDG 59 (179)
T ss_dssp SSS-TTTTHHHHHHHHHHHHHHHHSTHSSSEEEEEEEESS
T ss_pred ccCCcccchHHHHHHHHHHHHHHHhhcccCccEEEEEecc
Confidence 567888999999999999999998543 345555543
No 303
>PF13477 Glyco_trans_4_2: Glycosyl transferase 4-like
Probab=30.20 E-value=98 Score=16.76 Aligned_cols=23 Identities=17% Similarity=-0.034 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHHcCCceEEEEec
Q psy472 44 QQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 44 ~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.....+++.|++.|.++......
T Consensus 11 ~~~~~~~~~L~~~g~~V~ii~~~ 33 (139)
T PF13477_consen 11 TFIYNLAKELKKRGYDVHIITPR 33 (139)
T ss_pred HHHHHHHHHHHHCCCEEEEEEcC
Confidence 46788999999999988877653
No 304
>PF07131 DUF1382: Protein of unknown function (DUF1382); InterPro: IPR009814 This entry is represented by Bacteriophage lambda, Xis. This entry overlaps with IPR009750, both representing lambda Xis. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several hypothetical Escherichia coli and Bacteriophage lambda-like proteins of around 60 residues in length. The function of this family is unknown.
Probab=29.96 E-value=62 Score=16.42 Aligned_cols=20 Identities=35% Similarity=0.619 Sum_probs=15.8
Q ss_pred cCHHHHHHHHHHHHHcCCce
Q psy472 41 VHFQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 41 V~~~~s~~l~~~L~~~g~~~ 60 (68)
|...+++.++..|.+.|+.|
T Consensus 7 v~LR~~lE~A~~La~~GIRF 26 (61)
T PF07131_consen 7 VDLRKALEMAHSLAHIGIRF 26 (61)
T ss_pred HHHHHHHHHHHHHHHcCcee
Confidence 34577888999999999865
No 305
>PF13823 ADH_N_assoc: Alcohol dehydrogenase GroES-associated; PDB: 2DPH_B.
Probab=29.95 E-value=24 Score=14.40 Aligned_cols=12 Identities=33% Similarity=0.476 Sum_probs=8.3
Q ss_pred EEEEEeCCCCcc
Q psy472 30 YLLVHGTMDDNV 41 (68)
Q Consensus 30 ~ll~hG~~D~~V 41 (68)
-+.+||..|-.|
T Consensus 3 Av~y~G~~~v~v 14 (23)
T PF13823_consen 3 AVVYHGPKDVRV 14 (23)
T ss_dssp EEEEEETTEEEE
T ss_pred ceEEeCCCceEE
Confidence 467888877544
No 306
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=29.94 E-value=49 Score=13.23 Aligned_cols=17 Identities=6% Similarity=0.169 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHHHcCCc
Q psy472 43 FQQSMMLAKSLQHADIM 59 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g~~ 59 (68)
++.+..+.+.+++.|+.
T Consensus 17 ~~~a~~~~~~M~~~gv~ 33 (34)
T PF13812_consen 17 PDAALQLFDEMKEQGVK 33 (34)
T ss_pred HHHHHHHHHHHHHhCCC
Confidence 56788888888888863
No 307
>TIGR03675 arCOG00543 arCOG00543 universal archaeal KH-domain/beta-lactamase-domain protein. This family of proteins is universal in the archaea and consistsof an N-terminal type-1 KH-domain (pfam00013) a central beta-lactamase-domain (pfam00753) with a C-terminal motif associated with RNA metabolism (pfam07521). KH-domains are associated with RNA-binding, so taken together, this protein is a likely metal-dependent RNAase. This family was defined in as arCOG01782.
Probab=29.91 E-value=1.6e+02 Score=21.53 Aligned_cols=36 Identities=33% Similarity=0.431 Sum_probs=24.3
Q ss_pred hhHHhhcCCC--CcEEEEEeCCCCccCHHHHHHHHHHHHHc-CCc
Q psy472 18 LNNKVDRIRD--KQYLLVHGTMDDNVHFQQSMMLAKSLQHA-DIM 59 (68)
Q Consensus 18 p~~~~~~~~~--~p~ll~hG~~D~~V~~~~s~~l~~~L~~~-g~~ 59 (68)
.+.++...+. ..++++||.. ..+..|++.|++. +.+
T Consensus 579 L~~~v~~~~p~p~~v~lvHGe~------~~~~~la~~l~~~~~~~ 617 (630)
T TIGR03675 579 LMNYVRRMQPKPEKILLNHGEP------SKILDLASSIYKKFNIE 617 (630)
T ss_pred HHHHHHhcCCCCCEEEEEcCCH------HHHHHHHHHHHHHhCCc
Confidence 5566666622 3499999954 5778888888754 443
No 308
>PRK00093 GTP-binding protein Der; Reviewed
Probab=29.90 E-value=1.8e+02 Score=19.58 Aligned_cols=38 Identities=8% Similarity=0.079 Sum_probs=31.0
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHH----cCCceEEEE
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQH----ADIMFQSQE 64 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~----~g~~~~~~~ 64 (68)
.+|.+++++..-..++.....-|.+.|++ .|+|.++.+
T Consensus 387 ~pp~f~~~~n~~~~~~~~y~~~l~~~~r~~~~~~g~p~~~~~ 428 (435)
T PRK00093 387 NPPTFVLFVNDPELLPFSYKRYLENQLREAFDFEGTPIRLEF 428 (435)
T ss_pred CCCEEEEEeCCcccCCHHHHHHHHHHHHHHcCCCcccEEEEE
Confidence 48999999988888999988888888877 367776654
No 309
>PF15643 Tox-PL-2: Papain fold toxin 2
Probab=29.73 E-value=80 Score=17.66 Aligned_cols=21 Identities=19% Similarity=0.072 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHcCCceEEEE
Q psy472 44 QQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 44 ~~s~~l~~~L~~~g~~~~~~~ 64 (68)
+=+..|.+.|+++|++.++.-
T Consensus 23 ~cA~Al~~~L~~~gI~Gk~i~ 43 (100)
T PF15643_consen 23 ECASALKQFLKQAGIPGKIIR 43 (100)
T ss_pred HHHHHHHHHHHHCCCCceEEE
Confidence 446678888999999887653
No 310
>COG2818 Tag 3-methyladenine DNA glycosylase [DNA replication, recombination, and repair]
Probab=29.66 E-value=51 Score=20.48 Aligned_cols=18 Identities=22% Similarity=0.312 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHHcCCce
Q psy472 43 FQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g~~~ 60 (68)
..-|..|+++|+++|..|
T Consensus 140 t~~S~~mskaLKkrGf~f 157 (188)
T COG2818 140 TELSDAMSKALKKRGFKF 157 (188)
T ss_pred chhHHHHHHHHHHccCee
Confidence 577899999999999654
No 311
>cd04906 ACT_ThrD-I_1 First of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase). This CD includes the first of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase) which catalyzes the committed step in branched chain amino acid biosynthesis in plants and microorganisms, the pyridoxal 5'-phosphate (PLP)-dependent dehydration/deamination of L-threonine (or L-serine) to 2-ketobutyrate (or pyruvate). ThrD-I is a cooperative, feedback-regulated (isoleucine and valine) allosteric enzyme that forms a tetramer and contains four pyridoxal phosphate moieties. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=29.38 E-value=91 Score=16.11 Aligned_cols=20 Identities=15% Similarity=0.086 Sum_probs=17.2
Q ss_pred HHHHHHHHHHHHHcCCceEE
Q psy472 43 FQQSMMLAKSLQHADIMFQS 62 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g~~~~~ 62 (68)
.++...+.++|+++|..+..
T Consensus 52 ~~~~~~i~~~L~~~G~~~~~ 71 (85)
T cd04906 52 AEELAELLEDLKSAGYEVVD 71 (85)
T ss_pred HHHHHHHHHHHHHCCCCeEE
Confidence 78899999999999987653
No 312
>PF10096 DUF2334: Uncharacterized protein conserved in bacteria (DUF2334); InterPro: IPR018763 This group of proteins has no known function.
Probab=29.17 E-value=1.5e+02 Score=18.64 Aligned_cols=27 Identities=15% Similarity=0.028 Sum_probs=13.8
Q ss_pred ccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 40 NVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 40 ~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.+.++....+++.|.+.|+++-+-+.|
T Consensus 12 ~~~~~~l~~i~d~l~~~~ipf~v~vIP 38 (243)
T PF10096_consen 12 FSDLEKLKEIADYLYKYGIPFSVAVIP 38 (243)
T ss_pred CCCHHHHHHHHHHHHHCCCCEEEEEEe
Confidence 344455555555555555555444443
No 313
>PLN02924 thymidylate kinase
Probab=29.07 E-value=92 Score=19.32 Aligned_cols=39 Identities=15% Similarity=0.037 Sum_probs=30.8
Q ss_pred cEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 29 QYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 29 p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..||.-=.-|-.==-.|+..|++.|...|..+.+.-+|+
T Consensus 16 g~~IviEGiDGsGKsTq~~~L~~~l~~~g~~v~~~~ep~ 54 (220)
T PLN02924 16 GALIVLEGLDRSGKSTQCAKLVSFLKGLGVAAELWRFPD 54 (220)
T ss_pred CeEEEEECCCCCCHHHHHHHHHHHHHhcCCCceeeeCCC
Confidence 444444448888888999999999999999987777764
No 314
>PF00532 Peripla_BP_1: Periplasmic binding proteins and sugar binding domain of LacI family; InterPro: IPR001761 This family includes the periplasmic binding proteins, and the LacI family transcriptional regulators. The periplasmic binding proteins are the primary receptors for chemotaxis and transport of many sugar based solutes. The LacI family of proteins consist of transcriptional regulators related to the lac repressor. In this case, generally the sugar binding domain binds a sugar which changes the DNA binding activity of the repressor domain (lacI) [, ].; PDB: 1BAP_A 7ABP_A 6ABP_A 1ABF_A 5ABP_A 2WRZ_B 9ABP_A 1APB_A 1ABE_A 8ABP_A ....
Probab=28.83 E-value=1.6e+02 Score=18.63 Aligned_cols=33 Identities=12% Similarity=0.212 Sum_probs=24.6
Q ss_pred Cc-EEEEEeCCCCccCHHHHHHHHHHHHHcCCce
Q psy472 28 KQ-YLLVHGTMDDNVHFQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 28 ~p-~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~ 60 (68)
.. +.++.|..+...-.+-..-+.++|+++|+++
T Consensus 119 ~~~I~~i~~~~~~~~~~~R~~Gy~~Al~~~Gl~~ 152 (279)
T PF00532_consen 119 RRPIAFIGGPEDSSTSRERLQGYRDALKEAGLPI 152 (279)
T ss_dssp CSTEEEEEESTTTHHHHHHHHHHHHHHHHTTSCE
T ss_pred CCeEEEEecCcchHHHHHHHHHHHHHHHHcCCCC
Confidence 44 8888888777755555556888999998855
No 315
>TIGR00524 eIF-2B_rel eIF-2B alpha/beta/delta-related uncharacterized proteins. This model, eIF-2B_rel, describes half of a superfamily, where the other half consists of eukaryotic translation initiation factor 2B (eIF-2B) subunits alpha, beta, and delta. It is unclear whether the eIF-2B_rel set is monophyletic, or whether they are all more closely related to each other than to any eIF-2B subunit because the eIF-2B clade is highly derived. Members of this branch of the family are all uncharacterized with respect to function and are found in the Archaea, Bacteria, and Eukarya, although a number are described as putative translation intiation factor components. Proteins found by eIF-2B_rel include at least three clades, including a set of uncharacterized eukaryotic proteins, a set found in some but not all Archaea, and a set universal so far among the Archaea and closely related to several uncharacterized bacterial proteins.
Probab=28.74 E-value=76 Score=20.88 Aligned_cols=25 Identities=16% Similarity=0.012 Sum_probs=19.9
Q ss_pred CccCHHHH-HHHHHHHHHcCCceEEE
Q psy472 39 DNVHFQQS-MMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 39 ~~V~~~~s-~~l~~~L~~~g~~~~~~ 63 (68)
+.-|..+. ..+++.|.+.|+++.++
T Consensus 159 EsrP~~~G~~~~a~~L~~~gI~vtlI 184 (303)
T TIGR00524 159 ETRPRNQGSRLTAWELMQDGIDVTLI 184 (303)
T ss_pred CCCCccchHHHHHHHHHHCCCCEEEE
Confidence 44566666 78899999999999876
No 316
>KOG4238|consensus
Probab=28.67 E-value=1.4e+02 Score=21.05 Aligned_cols=32 Identities=25% Similarity=0.356 Sum_probs=25.5
Q ss_pred CCcEEEEEe----CCCCccCHHHHHHHHHHHHHcCC
Q psy472 27 DKQYLLVHG----TMDDNVHFQQSMMLAKSLQHADI 58 (68)
Q Consensus 27 ~~p~ll~hG----~~D~~V~~~~s~~l~~~L~~~g~ 58 (68)
+.|+||+|- ++|+.||..--..-..+..+.|+
T Consensus 448 k~pvlllhplggwtkdddvpl~~rmkqh~avl~e~v 483 (627)
T KOG4238|consen 448 KHPVLLLHPLGGWTKDDDVPLDWRMKQHAAVLEEGV 483 (627)
T ss_pred cCceEEEecCCCCccCCCccchhhhHHHHHHHHhcc
Confidence 589999996 68999999776666667677776
No 317
>COG0337 AroB 3-dehydroquinate synthetase [Amino acid transport and metabolism]
Probab=28.58 E-value=1.2e+02 Score=20.83 Aligned_cols=38 Identities=18% Similarity=0.104 Sum_probs=30.9
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
.++++++ |+.|..-....+...|++.|+.+..++.|+|
T Consensus 34 ~k~~ivt---d~~v~~~y~~~~~~~l~~~g~~v~~~~lp~G 71 (360)
T COG0337 34 RKVAIVT---DETVAPLYLEKLLATLEAAGVEVDSIVLPDG 71 (360)
T ss_pred CeEEEEE---CchhHHHHHHHHHHHHHhcCCeeeEEEeCCC
Confidence 4566664 6778777889999999999999988888875
No 318
>COG3509 LpqC Poly(3-hydroxybutyrate) depolymerase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=28.50 E-value=9.6 Score=25.38 Aligned_cols=17 Identities=24% Similarity=0.344 Sum_probs=12.7
Q ss_pred CcEEEEEeCCCCccCHH
Q psy472 28 KQYLLVHGTMDDNVHFQ 44 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~ 44 (68)
.+++-+||+.|+.+|++
T Consensus 191 ~~~m~~~G~~Dp~~p~~ 207 (312)
T COG3509 191 VSVMAFHGTADPLNPYH 207 (312)
T ss_pred hhHHHhcCCCCCCCCCC
Confidence 56777888888877764
No 319
>PRK11172 dkgB 2,5-diketo-D-gluconate reductase B; Provisional
Probab=28.37 E-value=1.4e+02 Score=18.75 Aligned_cols=32 Identities=16% Similarity=0.151 Sum_probs=24.7
Q ss_pred CCcEEEEEeCCC-CccCHHHHHHHHHHHHHcCC
Q psy472 27 DKQYLLVHGTMD-DNVHFQQSMMLAKSLQHADI 58 (68)
Q Consensus 27 ~~p~ll~hG~~D-~~V~~~~s~~l~~~L~~~g~ 58 (68)
..-++++|.-.. ..++...+.....+|++.|.
T Consensus 90 ~iDl~~lH~~~~~~~~~~~~~~~~l~~l~~~Gk 122 (267)
T PRK11172 90 YVDLTLIHWPSPNDEVSVEEFMQALLEAKKQGL 122 (267)
T ss_pred ceEEEEeCCCCCCCCCCHHHHHHHHHHHHHCCC
Confidence 378999998532 34688888888888888886
No 320
>cd04951 GT1_WbdM_like This family is most closely related to the GT1 family of glycosyltransferases and is named after WbdM in Escherichia coli. In general glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have
Probab=28.36 E-value=1.5e+02 Score=18.41 Aligned_cols=40 Identities=8% Similarity=0.011 Sum_probs=26.7
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.+++++-|.-...=......+.+..+.+.+.++++.++.+
T Consensus 188 ~~~~l~~g~~~~~kg~~~li~a~~~l~~~~~~~~l~i~G~ 227 (360)
T cd04951 188 TFVILAVGRLVEAKDYPNLLKAFAKLLSDYLDIKLLIAGD 227 (360)
T ss_pred CEEEEEEeeCchhcCcHHHHHHHHHHHhhCCCeEEEEEcC
Confidence 5677777766655555666666666666666778877754
No 321
>PRK02395 hypothetical protein; Provisional
Probab=28.32 E-value=1.5e+02 Score=18.98 Aligned_cols=27 Identities=22% Similarity=0.254 Sum_probs=17.8
Q ss_pred EEEE-EeCCCCccCHHHHHHHHHHHHHc
Q psy472 30 YLLV-HGTMDDNVHFQQSMMLAKSLQHA 56 (68)
Q Consensus 30 ~ll~-hG~~D~~V~~~~s~~l~~~L~~~ 56 (68)
++|+ ||+.++--.-+....+++.|++.
T Consensus 138 lvlvgHGS~~~~~a~~~~~~~a~~l~~~ 165 (279)
T PRK02395 138 LAVVGHGTERNENSAKAIYYHADRLRER 165 (279)
T ss_pred EEEEecCCCCchhHHHHHHHHHHHHHhh
Confidence 4444 99987644445666777777664
No 322
>cd06595 GH31_xylosidase_XylS-like This family represents an uncharacterized glycosyl hydrolase family 31 (GH31) enzyme found in bacteria and eukaryotes that is related to the XylS xylosidase of Sulfolobus solfataricus. Alpha-xylosidases catalyze the release of an alpha-xylose residue from the non-reducing end of alpha-xyloside substrates. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=28.31 E-value=96 Score=19.99 Aligned_cols=25 Identities=8% Similarity=0.057 Sum_probs=21.6
Q ss_pred CHHHHHHHHHHHHHcCCceEEEEec
Q psy472 42 HFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 42 ~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.-++.+++++.++++++|++.++++
T Consensus 23 s~~ev~~v~~~~r~~~iP~D~i~lD 47 (292)
T cd06595 23 SDEEYLALMDRFKKHNIPLDVLVID 47 (292)
T ss_pred CHHHHHHHHHHHHHhCCCccEEEEe
Confidence 4578899999999999999988764
No 323
>PF02233 PNTB: NAD(P) transhydrogenase beta subunit; InterPro: IPR012136 NAD(P) transhydrogenase catalyses the transfer of reducing equivalents between NAD(H) and NADP(H), coupled to the translocation of protons across a membrane []. It is an integral membrane protein found in most organisms except for yeasts, plants and some bacterial species. In bacterial species it is located in the cytoplasmic membrane, while in mitochondria it is located in the inner membrane. Under most physiological conditions this enzyme synthesises NADPH, driven by consumption of the proton electrochemical gradient. The resulting NADPH is subsequently used for biosynthetic reactions or the reduction of glutathione. The global structure of this enzyme is similar in all organisms, consisting of three distinct domains, though the polypeptide composition can vary. Domain I binds NAD(+)/NADH, domain II is a hydrophobic membrane-spanning domain, and domain III binds NADP(+)/NADPH. Domain I is composed of two subdomains, both of which form a Rossman fold, while domain III consists of a single Rossman fold where the NADP(+) is flipped relative to the normal orientation of bound nucleotides within the Rossman fold [, , ]. Several residues within these domains are thought to make functionally important interdomain contacts for hydride transfer between these domains []. Proton translocation occurs through domain II and is thought to induce conformational changes which are transmitted across domain III to the site of hydride transfer between domains I and III. This entry represents the beta subunit found in bacterial two-subunit NADP(H) transhydrogenases. This subunit forms domain III and part of the transmembrane domain II. ; GO: 0008750 NAD(P)+ transhydrogenase (AB-specific) activity, 0050661 NADP binding, 0055114 oxidation-reduction process, 0016021 integral to membrane; PDB: 1PT9_A 1DJL_A 1U31_B 2BRU_C 1PTJ_C 1HZZ_C 2FRD_C 2FSV_C 1XLT_C 1U2G_C ....
Probab=28.31 E-value=93 Score=22.01 Aligned_cols=39 Identities=15% Similarity=0.049 Sum_probs=23.5
Q ss_pred CcEEEEEeCCCCccCH-HHHHHHHHHHHHcCCceEEEEec
Q psy472 28 KQYLLVHGTMDDNVHF-QQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~-~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
..++|+=|-.=-.-.- ....+|++.|+++|+.+++-+-|
T Consensus 308 ~~VvIVPGYGmAvAqAQ~~v~el~~~L~~~G~~V~faIHP 347 (463)
T PF02233_consen 308 KKVVIVPGYGMAVAQAQHAVAELADLLEERGVEVKFAIHP 347 (463)
T ss_dssp SEEEEEESHHHHHCTTHHHHHHHHHHHHHTT-EEEEEE-T
T ss_pred CceEEecCchHHHHHHHHHHHHHHHHHHhCCCEEEEEecc
Confidence 4566665532222222 23457999999999999987755
No 324
>cd03060 GST_N_Omega_like GST_N family, Omega-like subfamily; composed of uncharacterized proteins with similarity to class Omega GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Class Omega GSTs show little or no GSH-conjugating activity towards standard GST substrates. Instead, they catalyze the GSH dependent reduction of protein disulfides, dehydroascorbate and monomethylarsonate, activities which are more characteristic of glutaredoxins. Like Omega enzymes, proteins in this subfamily contain a conserved cysteine equivalent to the first cysteine in the CXXC motif of glutaredoxins, which is a r
Probab=28.30 E-value=82 Score=15.27 Aligned_cols=22 Identities=23% Similarity=0.129 Sum_probs=16.5
Q ss_pred HHHHHHHHHHHcCCceEEEEec
Q psy472 45 QSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 45 ~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.+.+...+|..+|++++.....
T Consensus 11 ~~~rv~~~L~~~gl~~e~~~v~ 32 (71)
T cd03060 11 YAMRARMALLLAGITVELREVE 32 (71)
T ss_pred HHHHHHHHHHHcCCCcEEEEeC
Confidence 4566677788999999887653
No 325
>PF07137 VDE: Violaxanthin de-epoxidase (VDE); InterPro: IPR010788 This family represents a conserved region approximately 350 residues long within plant violaxanthin de-epoxidase (VDE). In higher plants, violaxanthin de-epoxidase forms part of a conserved system that dissipates excess energy as heat in the light-harvesting complexes of photosystem II (PSII), thus protecting them from photo-inhibitory damage [].; GO: 0046422 violaxanthin de-epoxidase activity, 0055114 oxidation-reduction process, 0009507 chloroplast; PDB: 3CQN_B 3CQR_A.
Probab=28.10 E-value=40 Score=21.12 Aligned_cols=26 Identities=19% Similarity=0.125 Sum_probs=19.0
Q ss_pred CCCCccCHHHHHHHHHHHHHcCCceE
Q psy472 36 TMDDNVHFQQSMMLAKSLQHADIMFQ 61 (68)
Q Consensus 36 ~~D~~V~~~~s~~l~~~L~~~g~~~~ 61 (68)
+.+...|.+...++.+++++.|+.+.
T Consensus 154 trs~~lP~s~~p~l~~aa~k~G~d~~ 179 (198)
T PF07137_consen 154 TRSPTLPESIVPELRRAAKKAGIDFS 179 (198)
T ss_dssp ESSSS--GGGHHHHHHHHHHTT--GG
T ss_pred eCCCCCChHHhHHHHHHHHHhCCCHH
Confidence 35788999999999999999999763
No 326
>PF09413 DUF2007: Domain of unknown function (DUF2007); InterPro: IPR018551 This is a family of proteins with unknown function. ; PDB: 2HFV_A.
Probab=28.08 E-value=84 Score=15.29 Aligned_cols=20 Identities=10% Similarity=-0.019 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHcCCceEE
Q psy472 43 FQQSMMLAKSLQHADIMFQS 62 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g~~~~~ 62 (68)
.-++..+...|+.+|+++..
T Consensus 9 ~~ea~~i~~~L~~~gI~~~v 28 (67)
T PF09413_consen 9 PIEAELIKGLLEENGIPAFV 28 (67)
T ss_dssp HHHHHHHHHHHHHTT--EE-
T ss_pred HHHHHHHHHHHHhCCCcEEE
Confidence 56778888999999998764
No 327
>PF13369 Transglut_core2: Transglutaminase-like superfamily
Probab=28.02 E-value=1.3e+02 Score=17.40 Aligned_cols=28 Identities=0% Similarity=0.025 Sum_probs=24.8
Q ss_pred ccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 40 NVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 40 ~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
-.|+.-+.-+..-.++.|++++..-||+
T Consensus 65 G~Pi~L~ily~~va~rlGl~~~~v~~Pg 92 (152)
T PF13369_consen 65 GIPISLAILYLEVARRLGLPAEPVNFPG 92 (152)
T ss_pred CCcHHHHHHHHHHHHHcCCeEEEEecCC
Confidence 5788888889999999999999999886
No 328
>cd04333 ProX_deacylase This CD, composed mainly of bacterial single-domain proteins, includes the Thermus thermophilus (Tt) YbaK-like protein, a homolog of the trans-acting Escherichia coli YbaK Cys-tRNA(Pro) deacylase and the Agrobacterium tumefaciens ProX Ala-tRNA(Pro) deacylase and also the cis-acting prolyl-tRNA synthetase-editing domain (ProRS-INS). While ProX and ProRS-INS hydrolyze misacylated Ala-tRNA(Pro), the E. coli YbaK hydrolyzes misacylated Cys-tRNA(Pro). A few CD members are N-terminal, YbaK-ProX-like domains of an uncharacterized protein with a C-terminal, predicted Fe-S protein domain.
Probab=27.79 E-value=82 Score=17.94 Aligned_cols=20 Identities=10% Similarity=-0.057 Sum_probs=17.1
Q ss_pred HHHHHHHHHcCCceEEEEec
Q psy472 47 MMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 47 ~~l~~~L~~~g~~~~~~~~~ 66 (68)
.++.+.|.++++.++...++
T Consensus 2 ~~~~~~L~~~~i~~~~~~~~ 21 (148)
T cd04333 2 ERVRAFLAARGLDLEVIELP 21 (148)
T ss_pred HHHHHHHHHCCCCCeEEECC
Confidence 46788899999999988887
No 329
>PRK05720 mtnA methylthioribose-1-phosphate isomerase; Reviewed
Probab=27.73 E-value=1.1e+02 Score=20.62 Aligned_cols=27 Identities=22% Similarity=0.035 Sum_probs=21.1
Q ss_pred CCccCHHHHHH-HHHHHHHcCCceEEEE
Q psy472 38 DDNVHFQQSMM-LAKSLQHADIMFQSQE 64 (68)
Q Consensus 38 D~~V~~~~s~~-l~~~L~~~g~~~~~~~ 64 (68)
++.-|..|..+ ++..|.+.|+++.++.
T Consensus 186 ~EsRP~~qG~~lta~eL~~~GI~vtlI~ 213 (344)
T PRK05720 186 DETRPRLQGARLTAWELYQAGIDVTVIT 213 (344)
T ss_pred cCCCChhhhHHHHHHHHHHCCCCEEEEc
Confidence 45667788775 5899999999998763
No 330
>COG2344 AT-rich DNA-binding protein [General function prediction only]
Probab=27.53 E-value=1.1e+02 Score=19.36 Aligned_cols=23 Identities=17% Similarity=0.062 Sum_probs=19.5
Q ss_pred ccCHHHHHHHHHHHHHcCCceEE
Q psy472 40 NVHFQQSMMLAKSLQHADIMFQS 62 (68)
Q Consensus 40 ~V~~~~s~~l~~~L~~~g~~~~~ 62 (68)
.||-.++...++.|.++|+.--+
T Consensus 153 tVPa~~AQ~vad~Lv~aGVkGIl 175 (211)
T COG2344 153 TVPAEHAQEVADRLVKAGVKGIL 175 (211)
T ss_pred EccHHHHHHHHHHHHHcCCceEE
Confidence 58999999999999999986543
No 331
>PRK05907 hypothetical protein; Provisional
Probab=27.51 E-value=1.5e+02 Score=19.57 Aligned_cols=16 Identities=13% Similarity=0.233 Sum_probs=12.1
Q ss_pred HhhcCCCCcEEEEEeCCC
Q psy472 21 KVDRIRDKQYLLVHGTMD 38 (68)
Q Consensus 21 ~~~~~~~~p~ll~hG~~D 38 (68)
.+++- .|+.+++|+..
T Consensus 13 ~~~~~--~~~y~~~g~~~ 28 (311)
T PRK05907 13 YYEEK--RPAVIVIGSSS 28 (311)
T ss_pred HHhcC--CceEEEecCCc
Confidence 34444 39999999888
No 332
>PF00763 THF_DHG_CYH: Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain; InterPro: IPR020630 Enzymes that participate in the transfer of one-carbon units require the coenzyme tetrahydrofolate (THF). Various reactions generate one-carbon derivatives of THF, which can be interconverted between different oxidation states by methylene-THF dehydrogenase (1.5.1.5 from EC), methenyl-THF cyclohydrolase (3.5.4.9 from EC) and formyl-THF synthetase (6.3.4.3 from EC) [, ]. The dehydrogenase and cyclohydrolase activities are expressed by a variety of multifunctional enzymes, including the tri-functional eukaryotic C1-tetrahydrofolate synthase []; a bifunctional eukaryotic mitochondrial protein; and the bifunctional Escherichia coli folD protein [, ]. Methylene-tetrahydrofolate dehydrogenase and methenyltetrahydrofolate cyclo-hydrolase share an overlapping active site [], and as such are usually located together in proteins, acting in tandem on the carbon-nitrogen bonds of substrates other than peptide bonds. This entry represents the N-terminal catalytic domain of these enzymes. ; GO: 0003824 catalytic activity, 0004488 methylenetetrahydrofolate dehydrogenase (NADP+) activity, 0009396 folic acid-containing compound biosynthetic process, 0055114 oxidation-reduction process; PDB: 2C2X_B 2C2Y_A 1EDZ_A 1EE9_A 4A26_B 3NGL_C 3NGX_A 1B0A_A 1DIA_A 1A4I_B ....
Probab=27.48 E-value=1.2e+02 Score=16.87 Aligned_cols=40 Identities=18% Similarity=0.048 Sum_probs=24.7
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+|.|.+--..|+--.........+..++.|+.++.+.|+.
T Consensus 29 ~P~Laii~vg~d~~S~~Y~~~k~k~~~~~Gi~~~~~~l~~ 68 (117)
T PF00763_consen 29 TPKLAIILVGDDPASISYVRSKQKAAEKLGIEFELIELPE 68 (117)
T ss_dssp --EEEEEEES--HHHHHHHHHHHHHHHHHT-EEEEEEE-T
T ss_pred CcEEEEEecCCChhHHHHHHHHHHHHHHcCCceEEEECCC
Confidence 5777666656655445555666688899999999998864
No 333
>PF08444 Gly_acyl_tr_C: Aralkyl acyl-CoA:amino acid N-acyltransferase, C-terminal region; InterPro: IPR013652 This entry represents mammalian-specific glycine N-acyltransferase (also called aralkyl acyl-CoA:amino acid N-acyltransferase; 2.3.1.13 from EC). Mitochondrial acyltransferases catalyse the transfer of an acyl group from acyl-CoA to the N terminus of glycine to produce N-acylglycine. These enzymes can conjugate a multitude of substrates to form a variety of N-acylglycines. The CoA derivatives of a number of aliphatic and aromatic acids, but not phenylacetyl-CoA or (indol-3-yl)acetyl-CoA, can act as donor [, ].
Probab=27.45 E-value=84 Score=17.11 Aligned_cols=20 Identities=20% Similarity=0.138 Sum_probs=15.7
Q ss_pred HHHHHHHHHHcCCceEEEEe
Q psy472 46 SMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 46 s~~l~~~L~~~g~~~~~~~~ 65 (68)
+..++++|.++|+++-.++-
T Consensus 41 ~~~~~~~L~~~g~P~Y~hv~ 60 (89)
T PF08444_consen 41 MYHLAQYLHKLGFPFYGHVD 60 (89)
T ss_pred HHHHHHHHHHCCCCeEeehH
Confidence 45678889999999877653
No 334
>PF02082 Rrf2: Transcriptional regulator; InterPro: IPR000944 The following uncharacterised bacterial proteins have been shown to be evolutionary related, Desulfovibrio vulgaris protein Rrf2; Escherichia coli hypothetical proteins yfhP and yjeB; Bacillus subtilis hypothetical proteins yhdE, yrzC and ywgB; Mycobacterium tuberculosis hypothetical protein Rv1287; and Synechocystis sp. (strain PCC 6803) hypothetical protein slr0846. These are small proteins of 12 to 18kDa which seem to contain a signal sequence, and may represent a family of probable transcriptional regulators.; PDB: 3T8T_A 3T8R_A 3K69_A 3LWF_C 1XD7_A 2Y75_E 1YLF_C.
Probab=27.44 E-value=97 Score=15.89 Aligned_cols=20 Identities=10% Similarity=0.059 Sum_probs=16.7
Q ss_pred CccCHHHHHHHHHHHHHcCC
Q psy472 39 DNVHFQQSMMLAKSLQHADI 58 (68)
Q Consensus 39 ~~V~~~~s~~l~~~L~~~g~ 58 (68)
..+|.....++.+.|+++|+
T Consensus 35 ~~i~~~~l~kil~~L~~~Gl 54 (83)
T PF02082_consen 35 LGISPSYLRKILQKLKKAGL 54 (83)
T ss_dssp HTS-HHHHHHHHHHHHHTTS
T ss_pred HCcCHHHHHHHHHHHhhCCe
Confidence 36788999999999999997
No 335
>TIGR01890 N-Ac-Glu-synth amino-acid N-acetyltransferase. This model represents a clade of amino-acid N-acetyltransferases acting mainly on glutamate in the first step of the "acetylated" ornithine biosynthesis pathway. For this reason it is also called N-acetylglutamate synthase. The enzyme may also act on aspartate.
Probab=27.44 E-value=1.5e+02 Score=20.12 Aligned_cols=27 Identities=19% Similarity=0.212 Sum_probs=19.9
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
.++.|+||.. -++.+.|.+.|++.++.
T Consensus 49 ~~~vlVHGgg---------~~i~~~~~~~g~~~~~~ 75 (429)
T TIGR01890 49 VRLVLVHGAR---------PQIERILAARGRTPHYH 75 (429)
T ss_pred CcEEEEcCCC---------HHHHHHHHHcCCCceee
Confidence 5889999987 45666677888876654
No 336
>PF04282 DUF438: Family of unknown function (DUF438); InterPro: IPR007380 This is a a group of uncharacterised proteins.
Probab=27.26 E-value=79 Score=16.46 Aligned_cols=22 Identities=9% Similarity=0.107 Sum_probs=17.5
Q ss_pred ccCHHHHHHHHHHHHHcCCceE
Q psy472 40 NVHFQQSMMLAKSLQHADIMFQ 61 (68)
Q Consensus 40 ~V~~~~s~~l~~~L~~~g~~~~ 61 (68)
.|++.....+-+.|.+.|++++
T Consensus 28 ~Vs~~EI~~~Eq~Li~eG~~~e 49 (71)
T PF04282_consen 28 DVSASEISAAEQELIQEGMPVE 49 (71)
T ss_pred CCCHHHHHHHHHHHHHcCCCHH
Confidence 5778888888888888888753
No 337
>PF00496 SBP_bac_5: Bacterial extracellular solute-binding proteins, family 5 Middle; InterPro: IPR000914 Bacterial high affinity transport systems are involved in active transport of solutes across the cytoplasmic membrane. The protein components of these traffic systems include one or two transmembrane protein components, one or two membrane-associated ATP-binding proteins and a high affinity periplasmic solute-binding protein. The latter are thought to bind the substrate in the vicinity of the inner membrane, and to transfer it to a complex of inner membrane proteins for concentration into Gram-positive bacteria which are surrounded by a single membrane and therefore have no periplasmic region the equivalent proteins are bound to the membrane via an N-terminal lipid anchor. These homologue proteins do not play an integral role in the transport process per se, but probably serve as receptors to trigger or initiate translocation of the solute throught the membrane by binding to external sites of the integral membrane proteins of the efflux system. In addition at least some solute-binding proteins function in the initiation of sensory transduction pathways. On the basis of sequence similarities, the vast majority of these solute-binding proteins can be grouped [] into eight families of clusters, which generally correlate with the nature of the solute bound. Family 5 currently includes periplasmic oligopeptide-binding proteins (oppA) of Gram-negative bacteria and homologous lipoproteins in Gram-positive bacteria (oppA, amiA or appA); periplasmic dipeptide-binding proteins of Escherichia coli (dppA) and Bacillus subtilis (dppE); periplasmic murein peptide-binding protein of E. coli (mppA); periplasmic peptide-binding proteins sapA of E. coli, Salmonella typhimurium and Haemophilus influenzae; periplasmic nickel-binding protein (nikA) of E. coli; haem-binding lipoprotein (hbpA or dppA) from H. influenzae; lipoprotein xP55 from Streptomyces lividans; and hypothetical proteins from H. influenzae (HI0213) and Rhizobium sp. (strain NGR234) symbiotic plasmid (y4tO and y4wM).; GO: 0005215 transporter activity, 0006810 transport; PDB: 1B51_A 1B0H_A 1QKA_A 1B9J_A 1B6H_A 1OLA_A 1B3L_C 1JEV_A 1B5H_A 1JET_A ....
Probab=27.22 E-value=1.2e+02 Score=19.40 Aligned_cols=36 Identities=19% Similarity=0.190 Sum_probs=28.7
Q ss_pred EEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 31 LLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 31 ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
+++--..+......-+..+.+.|++.|+.+++...+
T Consensus 296 ~~~~~~~~~~~~~~~a~~l~~~l~~~Gi~v~i~~~~ 331 (374)
T PF00496_consen 296 LIILYTSDDPIWKAIAEALQEQLKKIGIKVEIKPVD 331 (374)
T ss_dssp EEEEEETTSHHHHHHHHHHHHHHHHTTEEEEEEEES
T ss_pred ccccccccccchHHHHHHHHHHHhhcceeEEEEEeC
Confidence 444455667777799999999999999999887764
No 338
>PF01884 PcrB: PcrB family; InterPro: IPR008205 This entry represents geranylgeranylglyceryl phosphate (GGGP) synthase, which is a prenyltransferase that catalyses the transfer of the geranylgeranyl moiety of geranylgeranyl diphosphate (GGPP) to the C3 hydroxyl of sn-glycerol-1-phosphate (G1P). This reaction is the first ether-bond-formation step in the biosynthesis of archaeal membrane lipids. This entry also matches putative glycerol-1-phosphate prenyltransferases that may catalyse the transfer of a prenyl moiety to sn-glycerol-1-phosphate (G1P) []. Some of the prokaryotic proteins in this family are related to pcrB. The Staphylococcus aureus chromosomal gene pcrA encodes a protein with significant similarity (40% identity) to two Escherichia coli helicases: the helicase II encoded by the uvrD gene and the Rep helicase. PcrB gene seems to belong to an operon containing at least one other gene, pcrBA, downstream from pcrB []. The PcrB proteins often contain an FMN binding site although the function of these proteins is still unknown.; GO: 0016765 transferase activity, transferring alkyl or aryl (other than methyl) groups; PDB: 1VIZ_A 2F6X_B 2F6U_B 3VKD_A 3VKA_A 3VK5_B 3VKC_B 3VKB_B.
Probab=27.20 E-value=89 Score=19.93 Aligned_cols=40 Identities=13% Similarity=0.083 Sum_probs=25.4
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+... .+..+++-|+. ..+..++...+.++. .+.-..+||+
T Consensus 28 ~~~~-gtDai~VGGS~-~~~~~d~vv~~ik~~----~~lPvilfPg 67 (230)
T PF01884_consen 28 ACES-GTDAIIVGGSD-TGVTLDNVVALIKRV----TDLPVILFPG 67 (230)
T ss_dssp HHCT-T-SEEEEE-ST-HCHHHHHHHHHHHHH----SSS-EEEETS
T ss_pred HHhc-CCCEEEECCCC-CccchHHHHHHHHhc----CCCCEEEeCC
Confidence 3344 47888888877 788888888777765 4444566665
No 339
>TIGR01357 aroB 3-dehydroquinate synthase. This model represents 3-dehydroquinate synthase, the enzyme catalyzing the second of seven steps in the shikimate pathway of chorismate biosynthesis. Chorismate is the last common intermediate in the biosynthesis of all three aromatic amino acids.
Probab=27.13 E-value=1.9e+02 Score=18.99 Aligned_cols=37 Identities=11% Similarity=0.041 Sum_probs=21.4
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.++|++.| ..+.-.....+.+.|.+.|+.+...+|++
T Consensus 21 ~~~livtd---~~~~~~~~~~v~~~L~~~g~~~~~~~~~~ 57 (344)
T TIGR01357 21 SKLVIITD---ETVADLYADKLLEALQALGYNVLKLTVPD 57 (344)
T ss_pred CeEEEEEC---CchHHHHHHHHHHHHHhcCCceeEEEeCC
Confidence 35566653 44433345566667777777766555653
No 340
>cd04336 YeaK YeaK is an uncharacterized Echerichia coli protein with a YbaK-like domain of unknown function. The YbaK-like domain family includes the INS amino acid-editing domain of the bacterial class II prolyl tRNA synthetase (ProRS), and it's trans-acting homologs, YbaK, and ProX. The primary function of INS is to hydrolyze mischarged cysteinyl-tRNA(Pro)'s, thus helping ensure the fidelity of translation. Organisms whose ProRS lacks the INS domain express a single-domain INS homolog such as YbaK, ProX, or PrdX which supplies the function of INS in trans.
Probab=26.81 E-value=1.1e+02 Score=17.48 Aligned_cols=20 Identities=15% Similarity=0.106 Sum_probs=16.0
Q ss_pred HHHHHHHHcCCceEEEEecC
Q psy472 48 MLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 48 ~l~~~L~~~g~~~~~~~~~~ 67 (68)
++.+.|.+.|++++.+-+++
T Consensus 3 ~v~~~L~~~~i~y~~~~~~~ 22 (153)
T cd04336 3 RLQELLNTNGARFRVLDHPP 22 (153)
T ss_pred HHHHHHHHCCCCEEEEecCC
Confidence 56778999999999887664
No 341
>cd00738 HGTP_anticodon HGTP anticodon binding domain, as found at the C-terminus of histidyl, glycyl, threonyl and prolyl tRNA synthetases, which are classified as a group of class II aminoacyl-tRNA synthetases (aaRS). In aaRSs, the anticodon binding domain is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only. This domain is also found in the accessory subunit of mitochondrial polymerase gamma (Pol gamma b).
Probab=26.63 E-value=98 Score=15.58 Aligned_cols=20 Identities=15% Similarity=0.011 Sum_probs=14.9
Q ss_pred HHHHHHHHHHHHHcCCceEE
Q psy472 43 FQQSMMLAKSLQHADIMFQS 62 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g~~~~~ 62 (68)
...+..++..|++.|+.+++
T Consensus 17 ~~~a~~~~~~Lr~~g~~v~~ 36 (94)
T cd00738 17 REYAQKLLNALLANGIRVLY 36 (94)
T ss_pred HHHHHHHHHHHHHCCCEEEe
Confidence 45677788888888876665
No 342
>cd08197 DOIS 2-deoxy-scyllo-inosose synthase (DOIS) catalyzes carbocycle formation from D-glucose-6-phosphate to 2-deoxy-scyllo-inosose through a multistep reaction in the biosynthesis of aminoglycoside antibiotics. 2-deoxy-scyllo-inosose synthase (DOIS) catalyzes carbocycle formation from D-glucose-6-phosphate to 2-deoxy-scyllo-inosose through a multistep reaction in the biosynthesis of aminoglycoside antibiotics. 2-Deoxystreptamine (DOS)-containing aminoglycoside antibiotics includes neomycin, kanamycin, gentamicin, and ribostamycin. They are important antibacterial agents. DOIS is a homologue of the dehydroquinate synthase which catalyzes the cyclization of 3-deoxy-D-arabino-heputulosonate-7-phosphate to dehydroquinate (DHQ) in the shikimate pathway.
Probab=26.47 E-value=1.5e+02 Score=19.87 Aligned_cols=37 Identities=24% Similarity=0.191 Sum_probs=24.3
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..++++.+ .++.-.....+.+.|++.|+.+..++|++
T Consensus 24 ~rvlvVtd---~~v~~~~~~~l~~~L~~~g~~~~~~~~~~ 60 (355)
T cd08197 24 DKYLLVTD---SNVEDLYGHRLLEYLREAGAPVELLSVPS 60 (355)
T ss_pred CeEEEEEC---ccHHHHHHHHHHHHHHhcCCceEEEEeCC
Confidence 45666655 33433355667788888888887777764
No 343
>PRK15113 glutathione S-transferase; Provisional
Probab=26.43 E-value=1.5e+02 Score=17.72 Aligned_cols=37 Identities=14% Similarity=0.066 Sum_probs=24.9
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.|.+.++|.....-++. .+..-.|...|++++.+..+
T Consensus 3 ~~~~~Ly~~~~~~s~~~--~rv~~~l~e~gi~~e~~~v~ 39 (214)
T PRK15113 3 KPAITLYSDAHFFSPYV--MSAFVALQEKGLPFELKTVD 39 (214)
T ss_pred CCeEEEEeCCCCCCchH--HHHHHHHHHcCCCCeEEEeC
Confidence 56777787655444544 45555578889999887654
No 344
>COG1282 PntB NAD/NADP transhydrogenase beta subunit [Energy production and conversion]
Probab=26.41 E-value=82 Score=22.01 Aligned_cols=55 Identities=9% Similarity=0.091 Sum_probs=33.3
Q ss_pred HHHhcChhHHhhcCCC-CcEEEEEeCCCCccCHHHH-HHHHHHHHHcCCceEEEEec
Q psy472 12 GYKIAALNNKVDRIRD-KQYLLVHGTMDDNVHFQQS-MMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 12 ~y~~~sp~~~~~~~~~-~p~ll~hG~~D~~V~~~~s-~~l~~~L~~~g~~~~~~~~~ 66 (68)
.+++.|+-..+...++ ..+.+.=|-.=-+..-+|+ .+++++|+++|+++++-+-|
T Consensus 292 ~vk~~saedaA~~l~nA~sVIIvPGYGmAVAQAQh~v~E~~~~L~~~Gv~VrfaIHP 348 (463)
T COG1282 292 EVKEGSAEDAAELLKNASSVIIVPGYGMAVAQAQHPVAEITEKLRARGVNVRFAIHP 348 (463)
T ss_pred ceeccCHHHHHHHHhCCCeEEEecCchHHHHhhhhHHHHHHHHHHhcCCeeeEeecc
Confidence 3445555444433322 4556665544444444444 45999999999999887654
No 345
>PRK12899 secA preprotein translocase subunit SecA; Reviewed
Probab=26.41 E-value=2e+02 Score=22.49 Aligned_cols=31 Identities=16% Similarity=0.139 Sum_probs=26.7
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
..|+|+... .++.|+.++..|.+.|+++.+.
T Consensus 568 grPvLigt~------si~~se~ls~~L~~~gi~h~vL 598 (970)
T PRK12899 568 GNPILIGTE------SVEVSEKLSRILRQNRIEHTVL 598 (970)
T ss_pred CCCEEEEeC------cHHHHHHHHHHHHHcCCcceec
Confidence 589999875 5789999999999999998764
No 346
>cd00423 Pterin_binding Pterin binding enzymes. This family includes dihydropteroate synthase (DHPS) and cobalamin-dependent methyltransferases such as methyltetrahydrofolate, corrinoid iron-sulfur protein methyltransferase (MeTr) and methionine synthase (MetH). DHPS, a functional homodimer, catalyzes the condensation of p-aminobenzoic acid (pABA) in the de novo biosynthesis of folate, which is an essential cofactor in both nucleic acid and protein biosynthesis. Prokaryotes (and some lower eukaryotes) must synthesize folate de novo, while higher eukaryotes are able to utilize dietary folate and therefore lack DHPS. Sulfonamide drugs, which are substrate analogs of pABA, target DHPS. Cobalamin-dependent methyltransferases catalyze the transfer of a methyl group via a methyl- cob(III)amide intermediate. These include MeTr, a functional heterodimer, and the folate binding domain of MetH.
Probab=26.33 E-value=1.8e+02 Score=18.40 Aligned_cols=43 Identities=9% Similarity=-0.056 Sum_probs=28.0
Q ss_pred hhcCCCCcEEEEEeCCCCcc-------C------HHHHHHHHHHHHHcCCceEEEEe
Q psy472 22 VDRIRDKQYLLVHGTMDDNV-------H------FQQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V-------~------~~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
+... +.|+.++|-..+..- + .+...+..+.|.+.|++.+-+++
T Consensus 116 ~~~~-~~~vV~m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~Gi~~~~Iil 171 (258)
T cd00423 116 AAEY-GAPVVLMHMDGTPQTMQNNPYYADVVDEVVEFLEERVEAATEAGIPPEDIIL 171 (258)
T ss_pred HHHc-CCCEEEECcCCCCcccccCCCcchHHHHHHHHHHHHHHHHHHcCCCHHHEEE
Confidence 4455 689999996543320 1 35666777788899987654443
No 347
>PRK12901 secA preprotein translocase subunit SecA; Reviewed
Probab=26.30 E-value=1e+02 Score=24.33 Aligned_cols=30 Identities=17% Similarity=0.191 Sum_probs=26.0
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEE
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQS 62 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~ 62 (68)
..|+|+.-. .++.|+.+.+.|++.|+++++
T Consensus 628 GrPVLVGT~------SVe~SE~lS~~L~~~gI~H~V 657 (1112)
T PRK12901 628 GRPVLVGTT------SVEISELLSRMLKMRKIPHNV 657 (1112)
T ss_pred CCCEEEEeC------cHHHHHHHHHHHHHcCCcHHH
Confidence 589999765 499999999999999998764
No 348
>PRK07535 methyltetrahydrofolate:corrinoid/iron-sulfur protein methyltransferase; Validated
Probab=26.13 E-value=1.8e+02 Score=18.58 Aligned_cols=42 Identities=7% Similarity=0.003 Sum_probs=27.8
Q ss_pred hhcCCCCcEEEEEeCCCCccC------HHHHHHHHHHHHHcCCceEEEEe
Q psy472 22 VDRIRDKQYLLVHGTMDDNVH------FQQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~------~~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
+... +.|+.++|... .-+| .+....+.+.+.+.|++-+-+++
T Consensus 112 ~~~~-g~~vv~m~~~~-~g~P~t~~~~~~~l~~~v~~a~~~GI~~~~Iil 159 (261)
T PRK07535 112 VKKY-NAPVVALTMDD-TGIPKDAEDRLAVAKELVEKADEYGIPPEDIYI 159 (261)
T ss_pred HHHh-CCCEEEEecCC-CCCCCCHHHHHHHHHHHHHHHHHcCCCHhHEEE
Confidence 4455 58999999632 3344 45566778888999997544443
No 349
>KOG0673|consensus
Probab=25.89 E-value=2e+02 Score=18.88 Aligned_cols=60 Identities=22% Similarity=0.256 Sum_probs=44.0
Q ss_pred CCCCCcCCCHHHHHhc-ChhHHhhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEe
Q psy472 1 MGLPTFEDNLEGYKIA-ALNNKVDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 1 lG~p~~~~~~~~y~~~-sp~~~~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
||.| . |.+.|.-- .-+.++-.+ .|==++|-..|-.|--.|...|...|++.-.+|.-+.+
T Consensus 201 lGVP--F-nIASYsLLT~miAhv~gl--~pgdfiH~lGdahvy~~Hv~al~~Ql~r~PrpFPkl~i 261 (293)
T KOG0673|consen 201 LGVP--F-NIASYSLLTCMIAHVCGL--KPGDFIHVLGDAHVYKDHVDALQEQLQRPPRPFPKLKI 261 (293)
T ss_pred cCcc--c-hhHHHHHHHHHHHHHhCC--CCCceEEecchhhhhHHHHHHHHHHHhcCCCCCCceee
Confidence 4666 3 45555432 245566666 57778999999999999999999999998888755443
No 350
>cd03059 GST_N_SspA GST_N family, Stringent starvation protein A (SspA) subfamily; SspA is a RNA polymerase (RNAP)-associated protein required for the lytic development of phage P1 and for stationary phase-induced acid tolerance of E. coli. It is implicated in survival during nutrient starvation. SspA adopts the GST fold with an N-terminal TRX-fold domain and a C-terminal alpha helical domain, but it does not bind glutathione (GSH) and lacks GST activity. SspA is highly conserved among gram-negative bacteria. Related proteins found in Neisseria (called RegF), Francisella and Vibrio regulate the expression of virulence factors necessary for pathogenesis.
Probab=25.87 E-value=90 Score=14.92 Aligned_cols=22 Identities=9% Similarity=-0.010 Sum_probs=16.2
Q ss_pred HHHHHHHHHHHcCCceEEEEec
Q psy472 45 QSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 45 ~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.+......|...|++++...+.
T Consensus 11 ~~~~v~~~l~~~gi~~~~~~v~ 32 (73)
T cd03059 11 YSHRVRIVLAEKGVSVEIIDVD 32 (73)
T ss_pred hHHHHHHHHHHcCCccEEEEcC
Confidence 4566666789999999886543
No 351
>PF00248 Aldo_ket_red: Aldo/keto reductase family; InterPro: IPR023210 The aldo-keto reductase family includes a number of related monomeric NADPH-dependent oxidoreductases, such as aldehyde reductase, aldose reductase, prostaglandin F synthase, xylose reductase, rho crystallin, and many others []. All possess a similar structure, with a beta-alpha-beta fold characteristic of nucleotide binding proteins []. The fold comprises a parallel beta-8/alpha-8-barrel, which contains a novel NADP-binding motif. The binding site is located in a large, deep, elliptical pocket in the C-terminal end of the beta sheet, the substrate being bound in an extended conformation. The hydrophobic nature of the pocket favours aromatic and apolar substrates over highly polar ones []. Binding of the NADPH coenzyme causes a massive conformational change, reorienting a loop, effectively locking the coenzyme in place. This binding is more similar to FAD- than to NAD(P)-binding oxidoreductases []. Some proteins of this entry contain a K+ ion channel beta chain regulatory domain; these are reported to have oxidoreductase activity []. This entry represents the NADP-dependent oxidoreductase domain found in these proteins.; PDB: 1C9W_A 4F40_B 1VBJ_A 1XGD_A 1X97_A 2ACS_A 1EF3_A 2ACU_A 1PWM_A 2NVD_A ....
Probab=25.84 E-value=1.3e+02 Score=18.68 Aligned_cols=31 Identities=13% Similarity=0.101 Sum_probs=25.0
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADI 58 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~ 58 (68)
.-++++|+-.........+.....+|++.|.
T Consensus 100 iDl~~lH~~~~~~~~~~~~~~~l~~l~~~G~ 130 (283)
T PF00248_consen 100 IDLLLLHWPDPSEDALEEVWEALEELKKEGK 130 (283)
T ss_dssp EEEEEESSSSTTSSHHHHHHHHHHHHHHTTS
T ss_pred hhccccccccccccccchhhhhhhhcccccc
Confidence 6899999955444448999999999999986
No 352
>PF09397 Ftsk_gamma: Ftsk gamma domain; InterPro: IPR018541 This domain directs oriented DNA translocation and forms a winged helix structure []. Mutated proteins with substitutions in the FtsK gamma DNA-recognition helix are impaired in DNA binding []. ; PDB: 2VE9_B 2VE8_E 2J5O_A 2J5P_A.
Probab=25.77 E-value=1.1e+02 Score=15.64 Aligned_cols=20 Identities=10% Similarity=0.376 Sum_probs=15.4
Q ss_pred ccCHHHHHHHHHHHHHcCCc
Q psy472 40 NVHFQQSMMLAKSLQHADIM 59 (68)
Q Consensus 40 ~V~~~~s~~l~~~L~~~g~~ 59 (68)
.|.+..+.++.+.|.+.|+-
T Consensus 31 rIGynrAariid~LE~~GiV 50 (65)
T PF09397_consen 31 RIGYNRAARIIDQLEEEGIV 50 (65)
T ss_dssp T--HHHHHHHHHHHHHCTSB
T ss_pred CCCHHHHHHHHHHHHHCCCC
Confidence 46788899999999999973
No 353
>PF10137 TIR-like: Predicted nucleotide-binding protein containing TIR-like domain; InterPro: IPR019302 This entry represents a TIR-like domain found in a family of prokaryotic predicted nucleotide-binding proteins. Their exact function has not, as yet, been defined.
Probab=25.72 E-value=1.4e+02 Score=17.07 Aligned_cols=6 Identities=50% Similarity=1.093 Sum_probs=4.0
Q ss_pred EEEEEe
Q psy472 30 YLLVHG 35 (68)
Q Consensus 30 ~ll~hG 35 (68)
+||.||
T Consensus 2 VFIvhg 7 (125)
T PF10137_consen 2 VFIVHG 7 (125)
T ss_pred EEEEeC
Confidence 566666
No 354
>PRK13104 secA preprotein translocase subunit SecA; Reviewed
Probab=25.55 E-value=2.5e+02 Score=21.70 Aligned_cols=31 Identities=19% Similarity=0.207 Sum_probs=27.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
..|+|++-. ..+.|..+++.|.+.|+++..+
T Consensus 444 g~PVLVgt~------Sie~sE~ls~~L~~~gi~h~vL 474 (896)
T PRK13104 444 KQPVLVGTV------SIEASEFLSQLLKKENIKHQVL 474 (896)
T ss_pred CCCEEEEeC------cHHHHHHHHHHHHHcCCCeEee
Confidence 589999876 5789999999999999998765
No 355
>cd03825 GT1_wcfI_like This family is most closely related to the GT1 family of glycosyltransferases. wcfI in Bacteroides fragilis has been shown to be involved in the capsular polysaccharide biosynthesis.
Probab=25.53 E-value=1.8e+02 Score=18.17 Aligned_cols=25 Identities=16% Similarity=0.050 Sum_probs=19.7
Q ss_pred HHHHHHHHHHHHHcCCceEEEEecC
Q psy472 43 FQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
-.++..+++.|.+.|..+.+....+
T Consensus 16 ~~~~~~l~~~l~~~G~~v~v~~~~~ 40 (365)
T cd03825 16 ARAAYRLHRALQAAGVDSTMLVQEK 40 (365)
T ss_pred HHHHHHHHHHHHhcCCceeEEEeec
Confidence 4678889999999998888776543
No 356
>PRK00466 acetyl-lysine deacetylase; Validated
Probab=25.51 E-value=1.7e+02 Score=18.93 Aligned_cols=34 Identities=15% Similarity=0.106 Sum_probs=22.6
Q ss_pred EeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 34 HGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 34 hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
-|..|..-.........++|++.|.+..+....+
T Consensus 88 rG~~DmKgg~aa~l~a~~~l~~~~~~i~~~~~~d 121 (346)
T PRK00466 88 RGAVDAKGPLISMIIAAWLLNEKGIKVMVSGLAD 121 (346)
T ss_pred cCccccchHHHHHHHHHHHHHHcCCCEEEEEEcC
Confidence 4555665566677777778888877766665554
No 357
>PF13684 Dak1_2: Dihydroxyacetone kinase family
Probab=25.50 E-value=2.1e+02 Score=18.91 Aligned_cols=38 Identities=13% Similarity=0.061 Sum_probs=29.6
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
-=+.++.| ..++-+.+..+.+.|++.--.+++.+|.|+
T Consensus 265 elvTi~~G---~~~~~~~a~~l~~~l~~~~p~~eve~~~Gg 302 (313)
T PF13684_consen 265 ELVTIYYG---EDVSEEEAEALAEFLEEKYPDVEVEVYDGG 302 (313)
T ss_pred eEEEEEec---CCCCHHHHHHHHHHHHHHhCCeEEEEEECC
Confidence 34667777 456777999999999998778888888775
No 358
>PRK05772 translation initiation factor IF-2B subunit alpha; Provisional
Probab=25.37 E-value=1.2e+02 Score=20.69 Aligned_cols=28 Identities=14% Similarity=0.062 Sum_probs=21.4
Q ss_pred CCCccCHHHHHHH-HHHHHHcCCceEEEE
Q psy472 37 MDDNVHFQQSMML-AKSLQHADIMFQSQE 64 (68)
Q Consensus 37 ~D~~V~~~~s~~l-~~~L~~~g~~~~~~~ 64 (68)
-++.-|..|..++ +..|.+.|+++.++.
T Consensus 206 v~EsRP~~qG~rlta~eL~~~GIpvtlI~ 234 (363)
T PRK05772 206 APETRPWLQGSRLTVYELMEEGIKVTLIT 234 (363)
T ss_pred ECCCCccchhHHHHHHHHHHCCCCEEEEe
Confidence 3566778887765 688999999988764
No 359
>PF10113 Fibrillarin_2: Fibrillarin-like archaeal protein; InterPro: IPR016760 Members of this protein family are HmdC, whose gene regularly occurs in the context of genes for HmdA (5,10-methenyltetrahydromethanopterin hydrogenase) and the radical SAM protein HmdB involved in biosynthesis of the HmdA cofactor. Bioinformatics suggests this protein, a homologue of eukaryotic fibrillarin, may be involved in biosynthesis of the guanylyl pyridinol cofactor in HmdA.
Probab=25.26 E-value=2.2e+02 Score=20.28 Aligned_cols=44 Identities=5% Similarity=-0.065 Sum_probs=37.4
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.++. -..+.-.+|...--.|.+.++..++..++.|.-.+-++|=
T Consensus 185 sssV-~VNl~~eQg~e~HaApldE~~~Va~~Akk~gkGveaI~~v 228 (505)
T PF10113_consen 185 SSSV-CVNLTEEQGGEEHAAPLDEMEEVAELAKKYGKGVEAIMHV 228 (505)
T ss_pred hhhh-eEEeeeccCCcccCCCHHHHHHHHHHHHHhCCCceEEEEe
Confidence 3344 4788889999999999999999999999999988877763
No 360
>cd03794 GT1_wbuB_like This family is most closely related to the GT1 family of glycosyltransferases. wbuB in E. coli is involved in the biosynthesis of the O26 O-antigen. It has been proposed to function as an N-acetyl-L-fucosamine (L-FucNAc) transferase.
Probab=25.23 E-value=1.7e+02 Score=17.95 Aligned_cols=40 Identities=15% Similarity=0.047 Sum_probs=30.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+...+++-|..+..-......+.++++.+. .+++++++.+
T Consensus 219 ~~~~i~~~G~~~~~k~~~~l~~~~~~l~~~-~~~~l~i~G~ 258 (394)
T cd03794 219 DKFVVLYAGNIGRAQGLDTLLEAAALLKDR-PDIRFLIVGD 258 (394)
T ss_pred CcEEEEEecCcccccCHHHHHHHHHHHhhc-CCeEEEEeCC
Confidence 467788888888877778888888887776 6777777643
No 361
>PRK09444 pntB pyridine nucleotide transhydrogenase; Provisional
Probab=25.08 E-value=1.2e+02 Score=21.58 Aligned_cols=39 Identities=10% Similarity=-0.018 Sum_probs=25.6
Q ss_pred CcEEEEEeCCCCccCHHH-HHHHHHHHHHcCCceEEEEec
Q psy472 28 KQYLLVHGTMDDNVHFQQ-SMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~-s~~l~~~L~~~g~~~~~~~~~ 66 (68)
..++|+=|-.=-.-.-+| ..+|++.|+++|+++++=+-|
T Consensus 307 ~~ViIVPGYGmAVAqAQh~v~el~~~L~~~Gv~V~faIHP 346 (462)
T PRK09444 307 HSVIITPGYGMAVAQAQYPVAEITEKLRARGINVRFGIHP 346 (462)
T ss_pred CcEEEECChHHHHHHHHHHHHHHHHHHHHCCCeEEEEecc
Confidence 566666554433333333 346999999999999886644
No 362
>cd03814 GT1_like_2 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog
Probab=25.05 E-value=1.8e+02 Score=17.96 Aligned_cols=39 Identities=10% Similarity=0.040 Sum_probs=28.4
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.+.+++.|.-...=........+++|.+. .+++++++.+
T Consensus 197 ~~~i~~~G~~~~~k~~~~~i~~~~~l~~~-~~~~l~i~G~ 235 (364)
T cd03814 197 RPVLLYVGRLAPEKNLEALLDADLPLRRR-PPVRLVIVGD 235 (364)
T ss_pred CeEEEEEeccccccCHHHHHHHHHHhhhc-CCceEEEEeC
Confidence 67888999776666667777777777766 5677777654
No 363
>PRK13103 secA preprotein translocase subunit SecA; Reviewed
Probab=25.05 E-value=1.2e+02 Score=23.37 Aligned_cols=30 Identities=13% Similarity=0.183 Sum_probs=26.2
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEE
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQS 62 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~ 62 (68)
..|+|+.- ..++.|+.+.+.|++.|+++++
T Consensus 449 GrPVLVGT------~SVe~SE~ls~~L~~~gi~h~V 478 (913)
T PRK13103 449 GRPVLVGT------ATIETSEHMSNLLKKEGIEHKV 478 (913)
T ss_pred CCCEEEEe------CCHHHHHHHHHHHHHcCCcHHH
Confidence 58999976 4789999999999999998764
No 364
>PRK15324 type III secretion system lipoprotein PrgK; Provisional
Probab=25.00 E-value=1.2e+02 Score=19.54 Aligned_cols=22 Identities=18% Similarity=0.120 Sum_probs=19.3
Q ss_pred cCHHHHHHHHHHHHHcCCceEE
Q psy472 41 VHFQQSMMLAKSLQHADIMFQS 62 (68)
Q Consensus 41 V~~~~s~~l~~~L~~~g~~~~~ 62 (68)
-...++-++...|.++|++++.
T Consensus 27 L~~~dAneIv~~L~~~gI~y~~ 48 (252)
T PRK15324 27 LDQEQANEVIAVLQMHNIEANK 48 (252)
T ss_pred CCHHHHHHHHHHHHHCCCCeEe
Confidence 4568999999999999999976
No 365
>cd04885 ACT_ThrD-I Tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase). This CD includes each of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase) which catalyzes the committed step in branched chain amino acid biosynthesis in plants and microorganisms, the pyridoxal 5'-phosphate (PLP)-dependent dehydration/deamination of L-threonine (or L-serine) to 2-ketobutyrate (or pyruvate). ThrD-I is a cooperative, feedback-regulated (isoleucine and valine) allosteric enzyme that forms a tetramer and contains four pyridoxal phosphate moieties. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=25.00 E-value=1e+02 Score=15.10 Aligned_cols=29 Identities=14% Similarity=0.078 Sum_probs=22.2
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCce
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~ 60 (68)
.++++.--+.| .+|...+.++|+++|..+
T Consensus 38 ~~v~v~ie~~~----~~~~~~i~~~L~~~G~~~ 66 (68)
T cd04885 38 ARVLVGIQVPD----REDLAELKERLEALGYPY 66 (68)
T ss_pred eEEEEEEEeCC----HHHHHHHHHHHHHcCCCc
Confidence 56666665554 589999999999999765
No 366
>COG1509 KamA Lysine 2,3-aminomutase [Amino acid transport and metabolism]
Probab=24.94 E-value=62 Score=22.19 Aligned_cols=29 Identities=21% Similarity=0.363 Sum_probs=25.1
Q ss_pred CcEEEEE-----eCCCCccCHHHHHHHHHHHHHc
Q psy472 28 KQYLLVH-----GTMDDNVHFQQSMMLAKSLQHA 56 (68)
Q Consensus 28 ~p~ll~h-----G~~D~~V~~~~s~~l~~~L~~~ 56 (68)
.|..++| |+++..|+.+.+.++.++|+..
T Consensus 282 ~PYYl~~~D~~~G~~hfr~~i~~~~~i~~~lr~~ 315 (369)
T COG1509 282 KPYYLHQLDLVQGAAHFRVPIAEGLQIVEELRGR 315 (369)
T ss_pred cceEEeccCccCCccceeccHHHHHHHHHHHHHh
Confidence 6777665 8899999999999999999875
No 367
>PRK05331 putative phosphate acyltransferase; Provisional
Probab=24.93 E-value=89 Score=20.85 Aligned_cols=33 Identities=18% Similarity=0.321 Sum_probs=26.3
Q ss_pred CCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCc
Q psy472 25 IRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIM 59 (68)
Q Consensus 25 ~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~ 59 (68)
+ +.+++..||..|.. -+.++...+..+.+.++.
T Consensus 292 ~-~g~vik~HG~s~~~-a~~~ai~~a~~~~~~~~~ 324 (334)
T PRK05331 292 L-NGIVIKSHGSSDAK-AFANAIRQAKEAVENNVP 324 (334)
T ss_pred C-CceEEEECCCCChH-HHHHHHHHHHHHHHhCHH
Confidence 5 58999999999876 677888888887777653
No 368
>PF06415 iPGM_N: BPG-independent PGAM N-terminus (iPGM_N); InterPro: IPR011258 This family represents the N-terminal region of the 2,3-bisphosphoglycerate-independent phosphoglycerate mutase (or phosphoglyceromutase or BPG-independent PGAM) protein (5.4.2.1 from EC). The family is found in conjunction with Metalloenzyme (located in the C-terminal region of the protein). ; GO: 0004619 phosphoglycerate mutase activity, 0030145 manganese ion binding, 0006007 glucose catabolic process, 0005737 cytoplasm; PDB: 1EQJ_A 1EJJ_A 1O99_A 1O98_A 3IGZ_B 3IGY_B 3NVL_A 2IFY_A.
Probab=24.93 E-value=1.9e+02 Score=18.36 Aligned_cols=49 Identities=12% Similarity=0.089 Sum_probs=33.1
Q ss_pred hHHhhcCCCCcEEEEEeCCCCccC--HHHHHHHHHHHHHcCCc-eEEEEecCC
Q psy472 19 NNKVDRIRDKQYLLVHGTMDDNVH--FQQSMMLAKSLQHADIM-FQSQECYRR 68 (68)
Q Consensus 19 ~~~~~~~~~~p~ll~hG~~D~~V~--~~~s~~l~~~L~~~g~~-~~~~~~~~~ 68 (68)
+.++.+. +..+-|+-=..|--|| ..|-..|.+.++++|+. +-+|.|-+|
T Consensus 20 ~~~~k~~-~~~lHl~GLlSdGGVHSh~~Hl~al~~~a~~~gv~~V~vH~f~DG 71 (223)
T PF06415_consen 20 IEHAKKN-GGRLHLMGLLSDGGVHSHIDHLFALIKLAKKQGVKKVYVHAFTDG 71 (223)
T ss_dssp HHHHCCT-T--EEEEEEESS-SSS--HHHHHHHHHHHHHTT-SEEEEEEEE-S
T ss_pred HHHHHhc-CCeEEEEEEecCCCccccHHHHHHHHHHHHHcCCCEEEEEEecCC
Confidence 4555555 4566676666777775 58889999999999985 888888765
No 369
>KOG1579|consensus
Probab=24.85 E-value=90 Score=21.00 Aligned_cols=36 Identities=19% Similarity=-0.081 Sum_probs=27.7
Q ss_pred EEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 31 LLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 31 ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
++.-|-+ .+++.+...+...|....-...+++|||+
T Consensus 223 ~~~IGvN--C~~~~~~~~~~~~L~~~~~~~~llvYPNs 258 (317)
T KOG1579|consen 223 LLGIGVN--CVSPNFVEPLLKELMAKLTKIPLLVYPNS 258 (317)
T ss_pred eEEEEec--cCCchhccHHHHHHhhccCCCeEEEecCC
Confidence 4444544 57788888888888877778899999985
No 370
>PF13344 Hydrolase_6: Haloacid dehalogenase-like hydrolase; PDB: 2HO4_B 1YV9_A 1WVI_B 3EPR_A 2P27_A 2OYC_A 2CFT_A 2P69_A 2CFS_A 2CFR_A ....
Probab=24.68 E-value=71 Score=17.18 Aligned_cols=22 Identities=5% Similarity=-0.093 Sum_probs=15.6
Q ss_pred CHHHHHHHHHHHHHcCCceEEE
Q psy472 42 HFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 42 ~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
++..+..+.++|+++|+++-++
T Consensus 15 ~ipga~e~l~~L~~~g~~~~~l 36 (101)
T PF13344_consen 15 PIPGAVEALDALRERGKPVVFL 36 (101)
T ss_dssp E-TTHHHHHHHHHHTTSEEEEE
T ss_pred cCcCHHHHHHHHHHcCCCEEEE
Confidence 5566788888888888776554
No 371
>PF01903 CbiX: CbiX; InterPro: IPR002762 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase []. In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase []. There are at least two distinct cobalamin biosynthetic pathways in bacteria []: Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway []; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis []; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii subsp. shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) []. There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents the CbiX protein, which functions as a cobalt-chelatase in the anaerobic biosynthesis of cobalamin. It catalyses the insertion of cobalt into sirohydrochlorin. The structure of CbiX from Archaeoglobus fulgidus consists of a central mixed beta-sheet flanked by four alpha-helices, although it is about half the size of other Class II tetrapyrrole chelatases []. The CbiX proteins found in archaea appear to be shorter than those found in eubacteria [].; GO: 0016829 lyase activity, 0046872 metal ion binding, 0009236 cobalamin biosynthetic process; PDB: 2XWQ_C 2DJ5_A 1TJN_A 2XWS_A 3LYH_B 2JH3_D.
Probab=24.49 E-value=1.2e+02 Score=15.93 Aligned_cols=28 Identities=21% Similarity=0.140 Sum_probs=17.2
Q ss_pred EeCCCCccCHHHHHHHHHHHHHcCC-ceEE
Q psy472 34 HGTMDDNVHFQQSMMLAKSLQHADI-MFQS 62 (68)
Q Consensus 34 hG~~D~~V~~~~s~~l~~~L~~~g~-~~~~ 62 (68)
||+.|+. .-.....+++.|++... +++.
T Consensus 1 HGSr~~~-~~~~~~~la~~l~~~~~~~v~~ 29 (105)
T PF01903_consen 1 HGSRDPE-ANAELEDLADRLRERLPVPVEV 29 (105)
T ss_dssp -STSSCH-HHHHHHHHHHHHHHHTSSEEEE
T ss_pred CCCCCHH-HHHHHHHHHHHHHhhcCCeEEE
Confidence 7877744 33566778888888654 4443
No 372
>PRK01372 ddl D-alanine--D-alanine ligase; Reviewed
Probab=24.39 E-value=1.7e+02 Score=18.51 Aligned_cols=36 Identities=11% Similarity=0.194 Sum_probs=25.0
Q ss_pred cEEEEEeC--CCCccCHHHHHHHHHHHHHcCCceEEEE
Q psy472 29 QYLLVHGT--MDDNVHFQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 29 p~ll~hG~--~D~~V~~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
++.++.|. .+..+.......+.++|++.|..+.++-
T Consensus 6 ~v~~~~g~~~~~~~~~~~s~~~i~~al~~~g~~v~~i~ 43 (304)
T PRK01372 6 KVAVLMGGTSAEREVSLNSGAAVLAALREAGYDAHPID 43 (304)
T ss_pred EEEEEeCCCCCCceEeHHhHHHHHHHHHHCCCEEEEEe
Confidence 44455433 4556667788899999999998877653
No 373
>PF06057 VirJ: Bacterial virulence protein (VirJ); InterPro: IPR010333 This entry contains several bacterial VirJ virulence proteins. VirJ is thought to be involved in the type IV secretion system. It is thought that the substrate proteins localised to the periplasm may associate with the pilus in a manner that is mediated by VirJ, and suggest a two-step process for type IV secretion in Agrobacterium [].
Probab=24.22 E-value=95 Score=19.32 Aligned_cols=24 Identities=8% Similarity=0.217 Sum_probs=18.8
Q ss_pred ChhHHhhcCCCCcEEEEEeCCCCc
Q psy472 17 ALNNKVDRIRDKQYLLVHGTMDDN 40 (68)
Q Consensus 17 sp~~~~~~~~~~p~ll~hG~~D~~ 40 (68)
.++..+.++...|+++|+|+.++.
T Consensus 129 ~~~pei~~l~~~~v~CiyG~~E~d 152 (192)
T PF06057_consen 129 PVIPEIAKLPPAPVQCIYGEDEDD 152 (192)
T ss_pred CchHHHHhCCCCeEEEEEcCCCCC
Confidence 466677777557999999988775
No 374
>TIGR00512 salvage_mtnA S-methyl-5-thioribose-1-phosphate isomerase. The delineation of this family was based in part on a discussion and neighbor-joining phylogenetic study, by Kyrpides and Woese, of archaeal and other proteins homologous to the alpha, beta, and delta subunits of eukaryotic initiation factor 2B (eIF-2B), a five-subunit molecule that catalyzes GTP recycling for eIF-2. This clade is now recognized to include the methionine salvage pathway enzyme MtnA.
Probab=24.09 E-value=1.5e+02 Score=19.88 Aligned_cols=27 Identities=19% Similarity=-0.028 Sum_probs=20.7
Q ss_pred CCccCHHHHHH-HHHHHHHcCCceEEEE
Q psy472 38 DDNVHFQQSMM-LAKSLQHADIMFQSQE 64 (68)
Q Consensus 38 D~~V~~~~s~~-l~~~L~~~g~~~~~~~ 64 (68)
++.-|..|..+ ++..|.+.|+++.++.
T Consensus 186 ~EsrP~~qG~rlta~~L~~~GI~vtlI~ 213 (331)
T TIGR00512 186 DETRPRLQGARLTAWELVQEGIPATLIT 213 (331)
T ss_pred CCCCchhhHHHHHHHHHHHCCCCEEEEc
Confidence 34567788775 5889999999998753
No 375
>PF03720 UDPG_MGDP_dh_C: UDP-glucose/GDP-mannose dehydrogenase family, UDP binding domain; InterPro: IPR014027 The UDP-glucose/GDP-mannose dehydrogenases are a small group of enzymes which possesses the ability to catalyse the NAD-dependent 2-fold oxidation of an alcohol to an acid without the release of an aldehyde intermediate [, ]. The enzymes have a wide range of functions. In plants UDP-glucose dehydrogenase, 1.1.1.22 from EC, is an important enzyme in the synthesis of hemicellulose and pectin [], which are the components of newly formed cell walls; while in zebrafish UDP-glucose dehydrogenase is required for cardiac valve formation []. In Xanthomonas campestris, a plant pathogen, UDP-glucose dehydrogenase is required for virulence []. GDP-mannose dehydrogenase, 1.1.1.132 from EC, catalyses the formation of GDP-mannuronic acid, which is the monomeric unit from which the exopolysaccharide alginate is formed. Alginate is secreted by a number of bacteria, which include Pseudomonas aeruginosa and Azotobacter vinelandii. In P. aeruginosa, alginate is believed to play an important role in the bacteria's resistance to antibiotics and the host immune response [], while in A. vinelandii it is essential for the encystment process []. This entry represents the C-terminal substrate-binding domain of these enzymes. Structural studies indicate that this domain forms an incomplete dinucleotide binding fold [, ].; GO: 0016616 oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor, 0051287 NAD binding, 0055114 oxidation-reduction process; PDB: 3GG2_D 1DLI_A 1DLJ_A 2Y0E_D 2Y0D_B 2Y0C_D 1MV8_B 1MUU_A 1MFZ_C 3TDK_B ....
Probab=23.95 E-value=1.2e+02 Score=16.40 Aligned_cols=19 Identities=11% Similarity=0.149 Sum_probs=13.3
Q ss_pred HHHHHHHHHHHHcCCceEE
Q psy472 44 QQSMMLAKSLQHADIMFQS 62 (68)
Q Consensus 44 ~~s~~l~~~L~~~g~~~~~ 62 (68)
+-+..+++.|++.|..+..
T Consensus 17 Sp~~~l~~~L~~~g~~V~~ 35 (106)
T PF03720_consen 17 SPALELIEELKERGAEVSV 35 (106)
T ss_dssp -HHHHHHHHHHHTT-EEEE
T ss_pred CHHHHHHHHHHHCCCEEEE
Confidence 4578899999999886543
No 376
>PF01936 NYN: NYN domain; InterPro: IPR021139 This highly conserved domain has no known function. However it contains many conserved aspartates, suggesting an enzymatic function such as an endonuclease or glycosyl hydrolase.; PDB: 2QIP_A.
Probab=23.91 E-value=94 Score=17.09 Aligned_cols=30 Identities=20% Similarity=0.182 Sum_probs=17.2
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEE
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
.-++|+-|+.| -.-+++.|++.|+.+.+..
T Consensus 97 d~ivLvSgD~D-------f~~~v~~l~~~g~~V~v~~ 126 (146)
T PF01936_consen 97 DTIVLVSGDSD-------FAPLVRKLRERGKRVIVVG 126 (146)
T ss_dssp SEEEEE---GG-------GHHHHHHHHHH--EEEEEE
T ss_pred CEEEEEECcHH-------HHHHHHHHHHcCCEEEEEE
Confidence 56777878766 4456777788888776655
No 377
>cd03799 GT1_amsK_like This is a family of GT1 glycosyltransferases found specifically in certain bacteria. amsK in Erwinia amylovora, has been reported to be involved in the biosynthesis of amylovoran, a exopolysaccharide acting as a virulence factor.
Probab=23.73 E-value=1.9e+02 Score=17.93 Aligned_cols=40 Identities=13% Similarity=0.071 Sum_probs=28.8
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
...+++.|.-...=.+....+..++|.+.+..++++++.+
T Consensus 179 ~~~i~~~g~~~~~k~~~~l~~~~~~l~~~~~~~~l~i~G~ 218 (355)
T cd03799 179 PLRILSVGRLVEKKGLDYLLEALALLKDRGIDFRLDIVGD 218 (355)
T ss_pred CeEEEEEeeeccccCHHHHHHHHHHHhhcCCCeEEEEEEC
Confidence 5567777877666667777777777777777788877643
No 378
>PF03979 Sigma70_r1_1: Sigma-70 factor, region 1.1; InterPro: IPR007127 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ]. This entry represents Region 1.1 which modulates DNA binding by region 2 and 4 when sigma is unbound by the core RNA polymerase [, ]. Region 1.1 is also involved in promoter binding.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2K6X_A.
Probab=23.57 E-value=94 Score=16.15 Aligned_cols=22 Identities=18% Similarity=0.211 Sum_probs=13.6
Q ss_pred CccCHHHHHHHHHHHHHcCCce
Q psy472 39 DNVHFQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 39 ~~V~~~~s~~l~~~L~~~g~~~ 60 (68)
..+.+.+...++..|.+.|+.+
T Consensus 34 ~~~~~e~id~i~~~L~~~gI~V 55 (82)
T PF03979_consen 34 DDLDPEQIDEIYDTLEDEGIEV 55 (82)
T ss_dssp S---HHHHHHHHHHHHTT----
T ss_pred cCCCHHHHHHHHHHHHHCCCEE
Confidence 4477889999999999999865
No 379
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=23.36 E-value=67 Score=12.57 Aligned_cols=18 Identities=17% Similarity=0.265 Sum_probs=14.1
Q ss_pred CHHHHHHHHHHHHHcCCc
Q psy472 42 HFQQSMMLAKSLQHADIM 59 (68)
Q Consensus 42 ~~~~s~~l~~~L~~~g~~ 59 (68)
.++.+..+.+.+++.|+.
T Consensus 15 ~~~~a~~~~~~M~~~g~~ 32 (35)
T TIGR00756 15 RVEEALELFKEMLERGIE 32 (35)
T ss_pred CHHHHHHHHHHHHHcCCC
Confidence 356788888889888874
No 380
>cd03820 GT1_amsD_like This family is most closely related to the GT1 family of glycosyltransferases. AmSD in Erwinia amylovora has been shown to be involved in the biosynthesis of amylovoran, the acidic exopolysaccharide acting as a virulence factor. This enzyme may be responsible for the formation of galactose alpha-1,6 linkages in amylovoran.
Probab=23.30 E-value=1.8e+02 Score=17.52 Aligned_cols=41 Identities=7% Similarity=0.028 Sum_probs=28.0
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
..+.+++-|.-...=.+....+..+.|.+...+++++++.+
T Consensus 177 ~~~~i~~~g~~~~~K~~~~l~~~~~~l~~~~~~~~l~i~G~ 217 (348)
T cd03820 177 KSKRILAVGRLVPQKGFDLLIEAWAKIAKKHPDWKLRIVGD 217 (348)
T ss_pred CCcEEEEEEeeccccCHHHHHHHHHHHHhcCCCeEEEEEeC
Confidence 36778888876665566666667777766666777777643
No 381
>PRK15348 type III secretion system lipoprotein SsaJ; Provisional
Probab=23.26 E-value=2e+02 Score=18.58 Aligned_cols=22 Identities=9% Similarity=0.009 Sum_probs=15.3
Q ss_pred CHHHHHHHHHHHHHcCCceEEE
Q psy472 42 HFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 42 ~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
..+++-++...|+++|++++..
T Consensus 28 ~~~dA~~I~a~L~~~gI~y~~~ 49 (249)
T PRK15348 28 PEDEANQMLALLMQHHIDAEKK 49 (249)
T ss_pred CHHHHHHHHHHHHHcCCCceEe
Confidence 4566777777777777777653
No 382
>PRK00002 aroB 3-dehydroquinate synthase; Reviewed
Probab=22.97 E-value=2.4e+02 Score=18.72 Aligned_cols=37 Identities=19% Similarity=0.047 Sum_probs=23.9
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.++|++.+ ..+.-.....+.+.|.+.|+.+..++|++
T Consensus 32 ~~~livtd---~~~~~~~~~~v~~~L~~~gi~~~~~~~~~ 68 (358)
T PRK00002 32 KKVAIVTD---ETVAPLYLEKLRASLEAAGFEVDVVVLPD 68 (358)
T ss_pred CeEEEEEC---CchHHHHHHHHHHHHHhcCCceEEEEeCC
Confidence 46777764 44433455667777888888877666654
No 383
>PF13439 Glyco_transf_4: Glycosyltransferase Family 4; PDB: 2JJM_E 3MBO_C 2GEJ_A 2GEK_A.
Probab=22.95 E-value=1.4e+02 Score=16.23 Aligned_cols=22 Identities=14% Similarity=0.066 Sum_probs=16.6
Q ss_pred HHHHHHHHHHHHcCCceEEEEe
Q psy472 44 QQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 44 ~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
..+..|+++|.+.|..+.+...
T Consensus 16 ~~~~~l~~~l~~~G~~v~v~~~ 37 (177)
T PF13439_consen 16 RVVLNLARALAKRGHEVTVVSP 37 (177)
T ss_dssp HHHHHHHHHHHHTT-EEEEEES
T ss_pred HHHHHHHHHHHHCCCEEEEEEc
Confidence 4677899999999988877643
No 384
>PRK07708 hypothetical protein; Validated
Probab=22.92 E-value=1.9e+02 Score=18.09 Aligned_cols=30 Identities=10% Similarity=0.171 Sum_probs=25.8
Q ss_pred CCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 38 DDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 38 D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.+-....++..+++.|.+.|..-++.+++.
T Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 50 (219)
T PRK07708 21 SDWMNIEEALQLAEDFEKTGRVKELEFYDE 50 (219)
T ss_pred eccccHHHHHHHHHHHhhcCCceeEEEecC
Confidence 567889999999999999999888877754
No 385
>TIGR00854 pts-sorbose PTS system, mannose/fructose/sorbose family, IIB component. Bacterial PTS transporters transport and concomitantly phosphorylate their sugar substrates, and typically consist of multiple subunits or protein domains.The Man family is unique in several respects among PTS permease families.It is the only PTS family in which members possess a IID protein. It is the only PTS family in which the IIB constituent is phosphorylated on a histidyl rather than a cysteyl residue. Its permease members exhibit broad specificity for a range of sugars, rather than being specific for just one or a few sugars. The mannose permease of E. coli, for example, can transport and phosphorylate glucose, mannose, fructose, glucosamine, N-acetylglucosamine, and other sugars. Other members of this can transport sorbose, fructose and N-acetylglucosamine. This family is specific for the IIB components of this family of PTS transporters.
Probab=22.91 E-value=1.4e+02 Score=17.59 Aligned_cols=23 Identities=9% Similarity=0.112 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHcCCceEEEEecC
Q psy472 45 QSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 45 ~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.=.+..++|.+.|+.++++..|+
T Consensus 124 ~e~~~l~~l~~~Gv~v~~q~vP~ 146 (151)
T TIGR00854 124 QDITAFRFLKQRGVKLFLRDVPS 146 (151)
T ss_pred HHHHHHHHHHHcCCEEEEEECcC
Confidence 34456778889999999998885
No 386
>cd02014 TPP_POX Thiamine pyrophosphate (TPP) family, Pyruvate oxidase (POX) subfamily, TPP-binding module; composed of proteins similar to Lactobacillus plantarum POX, which plays a key role in controlling acetate production under aerobic conditions. POX decarboxylates pyruvate, producing hydrogen peroxide and the energy-storage metabolite acetylphosphate. It requires FAD in addition to TPP and a divalent cation as cofactors.
Probab=22.90 E-value=1.7e+02 Score=17.12 Aligned_cols=38 Identities=11% Similarity=0.013 Sum_probs=27.6
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
+.+++.+.|+..-.-... .|.. +.+.++++.++++.|+
T Consensus 69 ~~~vv~i~GDG~f~~~~~---el~t-~~~~~lp~~~iv~NN~ 106 (178)
T cd02014 69 DRQVIALSGDGGFAMLMG---DLIT-AVKYNLPVIVVVFNNS 106 (178)
T ss_pred CCcEEEEEcchHHHhhHH---HHHH-HHHhCCCcEEEEEECC
Confidence 479999999887765532 2443 4667889999998874
No 387
>PRK08299 isocitrate dehydrogenase; Validated
Probab=22.88 E-value=2.7e+02 Score=19.37 Aligned_cols=37 Identities=5% Similarity=-0.060 Sum_probs=30.9
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
..|+.++||+. +-.+-+..+.+.|...++++++..|+
T Consensus 7 ~~~~~~~~gd~---i~~~~~~~~~~~~~~~~~~i~~~~~d 43 (402)
T PRK08299 7 KNPVVELDGDE---MTRIIWKFIKDKLILPYLDIDLEYYD 43 (402)
T ss_pred CCceEEecCCC---chHHHHHHHHHHHhccCCCeEEEEEc
Confidence 48999999954 55677788888999999999998886
No 388
>PF12122 DUF3582: Protein of unknown function (DUF3582); InterPro: IPR022732 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the N-terminal domain of membrane-bound serine endopeptidases belonging to MEROPS peptidase family S54 (rhomboid-1, clan ST). This domain contains a conserved ASW sequence motif and a single completely conserved residue F that may be functionally important. The tertiary structure of the GlpG protein from Escherichia coli has been determined []. The GlpG protein has six transmembrane domains (other members of the family are predicted to have seven), with the N- and C-terminal ends anchored in the cytoplasm. One transmembrane domain is shorter than the rest, creating an internal, aqueous cavity just below the membrane surface and it is here were proteolysis occurs. There is also a membrane-embedded loop between the first and second transmembrane domains which is postulated to act as a gate controlling substrate access to the active site. No other family of serine peptidases is known to have active site residues within transmembrane domains (although transmembrane active sites are known for aspartic peptidase and metallopeptidases), and the GlpG protein has the type structure for clan ST.; GO: 0004252 serine-type endopeptidase activity, 0016021 integral to membrane; PDB: 3UBB_A 3B45_A 3B44_A 2NRF_A 3TXT_A 2O7L_A 2XTU_A 2IRV_A 2XOW_A 2XTV_A ....
Probab=22.83 E-value=1.5e+02 Score=16.34 Aligned_cols=24 Identities=13% Similarity=0.065 Sum_probs=17.3
Q ss_pred CHHHHHHHHHHHHHcCCceEEEEe
Q psy472 42 HFQQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 42 ~~~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
.+..+..|++-|+..|+++++...
T Consensus 9 n~r~AqaF~DYl~sqgI~~~i~~~ 32 (101)
T PF12122_consen 9 NPRAAQAFIDYLASQGIELQIEPE 32 (101)
T ss_dssp SHHHHHHHHHHHHHTT--EEEE-S
T ss_pred CHHHHHHHHHHHHHCCCeEEEEEC
Confidence 356788999999999998887653
No 389
>cd00215 PTS_IIA_lac PTS_IIA, PTS system, lactose/cellobiose specific IIA subunit. The bacterial phosphoenolpyruvate: sugar phosphotransferase system (PTS) is a multi-protein system involved in the regulation of a variety of metabolic and transcriptional processes. This family is one of four structurally and functionally distinct group IIA PTS system cytoplasmic enzymes, necessary for the uptake of carbohydrates across the cytoplasmic membrane and their phosphorylation. This family of proteins normally function as a homotrimer, stabilized by a centrally located metal ion. Separation into subunits is thought to occur after phosphorylation.
Probab=22.77 E-value=56 Score=17.92 Aligned_cols=28 Identities=29% Similarity=0.263 Sum_probs=21.0
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHH
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQH 55 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~ 55 (68)
...+|++|. +|..-...-...|++.+..
T Consensus 66 ~~slLlvHA-qDhLMta~~~~~la~e~I~ 93 (97)
T cd00215 66 EVSLLLVHA-QDHLMTTITLKDLAKEMIE 93 (97)
T ss_pred ccceeehhH-HHHHHHHHHHHHHHHHHHH
Confidence 478889998 7887777777777776654
No 390
>PF02780 Transketolase_C: Transketolase, C-terminal domain; InterPro: IPR005476 Transketolase 2.2.1.1 from EC (TK) catalyzes the reversible transfer of a two-carbon ketol unit from xylulose 5-phosphate to an aldose receptor, such as ribose 5-phosphate, to form sedoheptulose 7-phosphate and glyceraldehyde 3- phosphate. This enzyme, together with transaldolase, provides a link between the glycolytic and pentose-phosphate pathways. TK requires thiamine pyrophosphate as a cofactor. In most sources where TK has been purified, it is a homodimer of approximately 70 Kd subunits. TK sequences from a variety of eukaryotic and prokaryotic sources [, ] show that the enzyme has been evolutionarily conserved. In the peroxisomes of methylotrophic yeast Pichia angusta (Yeast) (Hansenula polymorpha), there is a highly related enzyme, dihydroxy-acetone synthase (DHAS) 2.2.1.3 from EC (also known as formaldehyde transketolase), which exhibits a very unusual specificity by including formaldehyde amongst its substrates. 1-deoxyxylulose-5-phosphate synthase (DXP synthase) [] is an enzyme so far found in bacteria (gene dxs) and plants (gene CLA1) which catalyzes the thiamine pyrophosphoate-dependent acyloin condensation reaction between carbon atoms 2 and 3 of pyruvate and glyceraldehyde 3-phosphate to yield 1-deoxy-D- xylulose-5-phosphate (dxp), a precursor in the biosynthetic pathway to isoprenoids, thiamine (vitamin B1), and pyridoxol (vitamin B6). DXP synthase is evolutionary related to TK. The N-terminal section, contains a histidine residue which appears to function in proton transfer during catalysis []. In the central section there are conserved acidic residues that are part of the active cleft and may participate in substrate-binding []. This family includes transketolase enzymes 2.2.1.1 from EC and also partially matches to 2-oxoisovalerate dehydrogenase beta subunit P37941 from SWISSPROT 1.2.4.4 from EC. Both these enzymes utilise thiamine pyrophosphate as a cofactor, suggesting there may be common aspects in their mechanism of catalysis.; GO: 0003824 catalytic activity, 0008152 metabolic process; PDB: 1NGS_B 1TKA_A 1TRK_B 1TKB_A 1GPU_B 1AY0_B 1TKC_B 2E6K_A 3JU3_A 2R8P_B ....
Probab=22.74 E-value=1.5e+02 Score=16.24 Aligned_cols=21 Identities=14% Similarity=0.170 Sum_probs=17.9
Q ss_pred HHHHHHHHHHHHHcCCceEEE
Q psy472 43 FQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g~~~~~~ 63 (68)
...+.+.++.|.+.|+.+++.
T Consensus 21 ~~~al~A~~~L~~~Gi~~~vi 41 (124)
T PF02780_consen 21 VEEALEAAEELEEEGIKAGVI 41 (124)
T ss_dssp HHHHHHHHHHHHHTTCEEEEE
T ss_pred HHHHHHHHHHHHHcCCceeEE
Confidence 578899999999999987765
No 391
>PF03446 NAD_binding_2: NAD binding domain of 6-phosphogluconate dehydrogenase; InterPro: IPR006115 6-Phosphogluconate dehydrogenase (1.1.1.44 from EC) (6PGD) is an oxidative carboxylase that catalyses the decarboxylating reduction of 6-phosphogluconate into ribulose 5-phosphate in the presence of NADP. This reaction is a component of the hexose mono-phosphate shunt and pentose phosphate pathways (PPP) [, ]. Prokaryotic and eukaryotic 6PGD are proteins of about 470 amino acids whose sequence are highly conserved []. The protein is a homodimer in which the monomers act independently []: each contains a large, mainly alpha-helical domain and a smaller beta-alpha-beta domain, containing a mixed parallel and anti-parallel 6-stranded beta sheet []. NADP is bound in a cleft in the small domain, the substrate binding in an adjacent pocket []. This family represents the NADP binding domain of 6-phosphogluconate dehydrogenase which adopts a Rossman fold. The C-terminal domain is described in IPR006114 from INTERPRO.; GO: 0004616 phosphogluconate dehydrogenase (decarboxylating) activity, 0006098 pentose-phosphate shunt, 0055114 oxidation-reduction process; PDB: 3AX6_D 3PDU_G 3Q3C_A 3OBB_A 4DLL_B 1PGP_A 1PGN_A 2PGD_A 1PGQ_A 1PGO_A ....
Probab=22.72 E-value=96 Score=17.94 Aligned_cols=32 Identities=16% Similarity=0.259 Sum_probs=21.7
Q ss_pred CCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCce
Q psy472 25 IRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 25 ~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~ 60 (68)
.+...+++-++ .+.++.+.++++.+.++|+.+
T Consensus 84 l~~g~iiid~s----T~~p~~~~~~~~~~~~~g~~~ 115 (163)
T PF03446_consen 84 LRPGKIIIDMS----TISPETSRELAERLAAKGVRY 115 (163)
T ss_dssp S-TTEEEEE-S----S--HHHHHHHHHHHHHTTEEE
T ss_pred cccceEEEecC----Ccchhhhhhhhhhhhhcccee
Confidence 33356676666 567899999999999999643
No 392
>cd03064 TRX_Fd_NuoE TRX-like [2Fe-2S] Ferredoxin (Fd) family, NADH:ubiquinone oxidoreductase (Nuo) subunit E subfamily; Nuo, also called respiratory chain Complex 1, is the entry point for electrons into the respiratory chains of bacteria and the mitochondria of eukaryotes. It is a multisubunit complex with at least 14 core subunits. It catalyzes the electron transfer of NADH to quinone coupled with the transfer of protons across the membrane, providing the proton motive force required for energy-consuming processes. Electrons are transferred from NADH to quinone through a chain of iron-sulfur clusters in Nuo, including the [2Fe-2S] cluster present in NuoE core subunit, also called the 24 kD subunit of Complex 1. This subfamily also include formate dehydrogenases, NiFe hydrogenases and NAD-reducing hydrogenases, that contain a NuoE domain. A subset of these proteins contain both NuoE and NuoF in a single chain. NuoF, also called the 51 kD subunit of Complex 1, contains one [4Fe-4S] clu
Probab=22.72 E-value=86 Score=15.89 Aligned_cols=26 Identities=0% Similarity=0.049 Sum_probs=18.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHH
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKS 52 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~ 52 (68)
..|+++++|.-=..|.++...++.++
T Consensus 54 ~gP~v~v~g~~y~~vt~~~i~~i~~~ 79 (80)
T cd03064 54 LAPVMMINDDVYGRLTPEKVDAILEA 79 (80)
T ss_pred CCCEEEECCEEECCCCHHHHHHHHHh
Confidence 58999998766666777777766654
No 393
>PF02223 Thymidylate_kin: Thymidylate kinase; InterPro: IPR018094 Thymidylate kinase (2.7.4.9 from EC; dTMP kinase) catalyzes the phosphorylation of thymidine 5'-monophosphate (dTMP) to form thymidine 5'-diphosphate (dTDP) in the presence of ATP and magnesium: ATP + thymidine 5'-phosphate = ADP + thymidine 5'-diphosphate Thymidylate kinase is an ubiquitous enzyme of about 25 Kd and is important in the dTTP synthesis pathway for DNA synthesis. The function of dTMP kinase in eukaryotes comes from the study of a cell cycle mutant, cdc8, in Saccharomyces cerevisiae. Structural and functional analyses suggest that the cDNA codes for authentic human dTMP kinase. The mRNA levels and enzyme activities corresponded to cell cycle progression and cell growth stages[]. ; GO: 0004798 thymidylate kinase activity, 0005524 ATP binding, 0006233 dTDP biosynthetic process; PDB: 2PLR_B 1NMX_A 1NN0_A 2XX3_A 1NN3_A 1E9F_A 1E2Q_A 1E2D_A 1E9A_A 1E99_A ....
Probab=22.59 E-value=1.5e+02 Score=17.21 Aligned_cols=22 Identities=18% Similarity=0.078 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHcCCceEEEEec
Q psy472 44 QQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 44 ~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.++..|.++|...|.. .+..+|
T Consensus 11 T~~~~L~~~l~~~~~~-~~~~~~ 32 (186)
T PF02223_consen 11 TQIRLLAEALKEKGYK-VIITFP 32 (186)
T ss_dssp HHHHHHHHHHHHTTEE-EEEEES
T ss_pred HHHHHHHHHHHHcCCc-ccccCC
Confidence 5788899999999988 444444
No 394
>PRK09628 oorB 2-oxoglutarate-acceptor oxidoreductase subunit OorB; Reviewed
Probab=22.46 E-value=2.1e+02 Score=18.65 Aligned_cols=39 Identities=15% Similarity=-0.022 Sum_probs=27.3
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
+.+++.+.|+.|-. ......+.. +.+.++++.++++.|+
T Consensus 86 dr~VV~i~GDG~f~--~~g~~el~t-a~r~nlpi~iIV~NN~ 124 (277)
T PRK09628 86 DKHVIVVSGDGDGL--AIGGNHTIH-GCRRNIDLNFILINNF 124 (277)
T ss_pred CCeEEEEECchHHH--HhhHHHHHH-HHHhCcCeEEEEEECh
Confidence 57999999999942 122233332 3777999999999874
No 395
>cd04446 DEP_DEPDC4 DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in DEPDC4-like proteins. DEPDC4 is a DEP domain containing protein of unknown function.
Probab=22.41 E-value=1.5e+02 Score=16.26 Aligned_cols=23 Identities=17% Similarity=0.247 Sum_probs=19.5
Q ss_pred CCccCHHHHHHHHHHHHHcCCce
Q psy472 38 DDNVHFQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 38 D~~V~~~~s~~l~~~L~~~g~~~ 60 (68)
|..++=.++.+|.++|.+.|+-.
T Consensus 49 ~~~~tR~~Av~l~q~Ll~~gvi~ 71 (95)
T cd04446 49 DVDVPRAKAVRLCQALMDCRVFE 71 (95)
T ss_pred cccCCHHHHHHHHHHHHHcCCee
Confidence 45678899999999999999844
No 396
>PLN02711 Probable galactinol--sucrose galactosyltransferase
Probab=22.40 E-value=1.8e+02 Score=22.06 Aligned_cols=30 Identities=17% Similarity=-0.189 Sum_probs=26.9
Q ss_pred CccCHHHHHHHHHHHHHcCCceEEEEecCC
Q psy472 39 DNVHFQQSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 39 ~~V~~~~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
..|..+-..+-.+.|.+.|++..+++.+|+
T Consensus 225 ~~Vt~egI~~gl~~L~~~Gip~~~vIIDDG 254 (777)
T PLN02711 225 LTVHPQGVWEGVKGLVDGGCPPGLVLIDDG 254 (777)
T ss_pred ccCCHHHHHHHHHHHHhCCCCccEEEEcCC
Confidence 458889999999999999999999998875
No 397
>TIGR01133 murG undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase. RL J Bacteriol 1993 Mar;175(6):1841-3
Probab=22.40 E-value=1.8e+02 Score=18.38 Aligned_cols=32 Identities=16% Similarity=0.165 Sum_probs=20.1
Q ss_pred EEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 30 YLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 30 ~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
++++-|..- -+...+..|++.|.++|..+.+.
T Consensus 3 i~~~~g~~~--g~~~~~~~La~~L~~~g~eV~vv 34 (348)
T TIGR01133 3 VVLAAGGTG--GHIFPALAVAEELIKRGVEVLWL 34 (348)
T ss_pred EEEEeCccH--HHHhHHHHHHHHHHhCCCEEEEE
Confidence 566666553 44446668888888888554443
No 398
>PF13362 Toprim_3: Toprim domain
Probab=22.39 E-value=1.3e+02 Score=15.62 Aligned_cols=38 Identities=16% Similarity=0.109 Sum_probs=28.3
Q ss_pred CcEEEEEeCCCCc-cCHHHHHHHHHHHHHcCCceEEEEe
Q psy472 28 KQYLLVHGTMDDN-VHFQQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 28 ~p~ll~hG~~D~~-V~~~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
.+-+++-++.|.. .-...+..++++|.+.|+.+..+.-
T Consensus 41 ~~~vii~~D~D~~~~G~~~a~~~~~~~~~~g~~~~~~~p 79 (96)
T PF13362_consen 41 GRRVIIAADNDKANEGQKAAEKAAERLEAAGIAVSIVEP 79 (96)
T ss_pred CCeEEEEECCCCchhhHHHHHHHHHHHHhCCCeEEEECC
Confidence 4556888887776 3456777889999999998877643
No 399
>COG4022 Uncharacterized protein conserved in archaea [Function unknown]
Probab=22.37 E-value=1.3e+02 Score=19.67 Aligned_cols=31 Identities=19% Similarity=0.192 Sum_probs=27.2
Q ss_pred CCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 37 MDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 37 ~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.|+.||+..|+-|...|++.-++....+.++
T Consensus 74 ~~~~v~FGaSEimmt~L~~g~lDaaV~vcDg 104 (286)
T COG4022 74 EDTFVPFGASEIMMTGLKRGLLDAAVTVCDG 104 (286)
T ss_pred ecceecccHHHHHHHHHhhcccceEEEEecC
Confidence 6889999999999999999888888777655
No 400
>PF05343 Peptidase_M42: M42 glutamyl aminopeptidase; InterPro: IPR008007 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M42 (glutamyl aminopeptidase family, clan MH). For members of this family and family M28 the predicted metal ligands occur in the same order in the sequence: H, D, E, D/E, H; and the active site residues occur in the motifs HXD and EE. ; PDB: 2WYR_C 2CF4_A 1VHO_A 3ISX_A 3KL9_G 1YLO_F 3CPX_C 1VHE_A 2GRE_F 1XFO_A ....
Probab=22.27 E-value=2.3e+02 Score=18.42 Aligned_cols=37 Identities=11% Similarity=-0.003 Sum_probs=27.6
Q ss_pred EEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 31 LLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 31 ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.|..++.-.+.+..-...+.+..++.++++++.++.+
T Consensus 211 ~i~~~D~~~i~~~~l~~~l~~~A~~~~Ip~Q~~~~~~ 247 (292)
T PF05343_consen 211 VIRVGDSSMIPNPKLVDKLREIAEENGIPYQREVFSG 247 (292)
T ss_dssp EEEEEETTEESHHHHHHHHHHHHHHTT--EEEEEESS
T ss_pred EEEEccCCCCCCHHHHHHHHHHHHHcCCCeEEEecCC
Confidence 4455556678888888999999999999999976654
No 401
>cd03143 A4_beta-galactosidase_middle_domain A4 beta-galactosidase middle domain: a type 1 glutamine amidotransferase (GATase1)-like domain. A4 beta-galactosidase middle domain: a type 1 glutamine amidotransferase (GATase1)-like domain. This group includes proteins similar to beta-galactosidase from Thermus thermophilus. Beta-Galactosidase hydrolyzes the beta-1,4-D-galactosidic linkage of lactose, as well as those of related chromogens, o-nitrophenyl-beta-D-galactopyranoside (ONP-Gal) and 5-bromo-4-chloro-3-indolyl-beta-D-galactoside (X-gal). This A4 beta-galactosidase middle domain lacks the catalytic triad of typical GATase1 domains. The reactive Cys residue found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow in typical GATase1 domains is not conserved in this group.
Probab=22.24 E-value=1.6e+02 Score=16.62 Aligned_cols=24 Identities=13% Similarity=0.110 Sum_probs=19.5
Q ss_pred ccCHHHHHHHHHHHHHcCCceEEE
Q psy472 40 NVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 40 ~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
.-.......++++|.+.|+++++.
T Consensus 22 ~~y~~~~~~~~~~l~~~gi~~d~v 45 (154)
T cd03143 22 LRYLDLALALYRALRELGIPVDVV 45 (154)
T ss_pred cCHHHHHHHHHHHHHHCCCCEEEE
Confidence 334578888999999999998875
No 402
>cd03809 GT1_mtfB_like This family is most closely related to the GT1 family of glycosyltransferases. mtfB (mannosyltransferase B) in E. coli has been shown to direct the growth of the O9-specific polysaccharide chain. It transfers two mannoses into the position 3 of the previously synthesized polysaccharide.
Probab=22.19 E-value=2e+02 Score=17.71 Aligned_cols=40 Identities=13% Similarity=0.087 Sum_probs=31.7
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
+...+++-|.-...=.+....+....+.+.+.++++++..
T Consensus 194 ~~~~i~~~G~~~~~K~~~~~l~~~~~~~~~~~~~~l~i~G 233 (365)
T cd03809 194 PRPYFLYVGTIEPRKNLERLLEAFARLPAKGPDPKLVIVG 233 (365)
T ss_pred CCCeEEEeCCCccccCHHHHHHHHHHHHHhcCCCCEEEec
Confidence 4678888898887777888888888888887777777764
No 403
>PF05990 DUF900: Alpha/beta hydrolase of unknown function (DUF900); InterPro: IPR010297 This domain is associated with proteins of unknown function, which are hydrolase-like.
Probab=22.18 E-value=2.1e+02 Score=17.81 Aligned_cols=37 Identities=16% Similarity=0.086 Sum_probs=23.6
Q ss_pred CCcEEEEEeCCCCccCHHHH-HHHHHHHHHcCCceEEEEec
Q psy472 27 DKQYLLVHGTMDDNVHFQQS-MMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s-~~l~~~L~~~g~~~~~~~~~ 66 (68)
+.=++++|| -++.++.+ .++++-....+.+...+.|.
T Consensus 18 ~~vlvfVHG---yn~~f~~a~~r~aql~~~~~~~~~~i~Fs 55 (233)
T PF05990_consen 18 KEVLVFVHG---YNNSFEDALRRAAQLAHDLGFPGVVILFS 55 (233)
T ss_pred CeEEEEEeC---CCCCHHHHHHHHHHHHHHhCCCceEEEEE
Confidence 467889999 46677666 44555555555665555553
No 404
>TIGR00624 tag DNA-3-methyladenine glycosylase I. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=22.13 E-value=53 Score=20.13 Aligned_cols=24 Identities=21% Similarity=0.337 Sum_probs=18.7
Q ss_pred CCCccCHHH--HHHHHHHHHHcCCce
Q psy472 37 MDDNVHFQQ--SMMLAKSLQHADIMF 60 (68)
Q Consensus 37 ~D~~V~~~~--s~~l~~~L~~~g~~~ 60 (68)
....||... |..|.+.|+++|..+
T Consensus 128 ~~~~~p~~t~~S~~lskdLKkrGfkF 153 (179)
T TIGR00624 128 TDSEIPSSTPESKAMSKELKKRGFRF 153 (179)
T ss_pred ccccCCCCCHHHHHHHHHHHHcCCee
Confidence 445667755 999999999999754
No 405
>PRK05782 bifunctional sirohydrochlorin cobalt chelatase/precorrin-8X methylmutase; Validated
Probab=22.12 E-value=2.3e+02 Score=19.10 Aligned_cols=28 Identities=18% Similarity=0.197 Sum_probs=19.4
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHc
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHA 56 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~ 56 (68)
.=+|+-||+.|+. .-+....|++.|+++
T Consensus 8 aiLLvgHGSRdp~-~~~~~~~La~~l~~~ 35 (335)
T PRK05782 8 AIILIGHGSRRET-FNSDMEGMANYLKEK 35 (335)
T ss_pred eEEEEecCCCChH-HHHHHHHHHHHHHhc
Confidence 4478889999984 335666677777653
No 406
>COG4947 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.03 E-value=37 Score=21.31 Aligned_cols=42 Identities=17% Similarity=0.177 Sum_probs=29.4
Q ss_pred hcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 23 DRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 23 ~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
++++...+.+.-|..|+-.+ +..+|.+.|.++.++..+.++.
T Consensus 169 ~rlr~~~~vfc~G~e~~~L~--~~~~L~~~l~dKqipaw~~~Wg 210 (227)
T COG4947 169 ERLRRIDMVFCIGDEDPFLD--NNQHLSRLLSDKQIPAWMHVWG 210 (227)
T ss_pred HHHhhccEEEEecCcccccc--chHHHHHHhccccccHHHHHhc
Confidence 33434567788888877765 4567888889999887765543
No 407
>COG1609 PurR Transcriptional regulators [Transcription]
Probab=22.02 E-value=2.4e+02 Score=18.43 Aligned_cols=33 Identities=12% Similarity=0.242 Sum_probs=24.1
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCce
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~ 60 (68)
.++.++.|..+....-.--.-+.++|.++|+.+
T Consensus 176 ~~i~~i~~~~~~~~~~~R~~Gf~~al~~~~~~~ 208 (333)
T COG1609 176 RRIAFIGGPLDSSASRERLEGYRAALREAGLPI 208 (333)
T ss_pred ceEEEEeCCCccccHhHHHHHHHHHHHHCCCCC
Confidence 677777777766666666677778888888775
No 408
>cd01524 RHOD_Pyr_redox Member of the Rhodanese Homology Domain superfamily. Included in this CD are the Lactococcus lactis NADH oxidase, Bacillus cereus NADH dehydrogenase, and Bacteroides thetaiotaomicron pyridine nucleotide-disulphide oxidoreductase, and similar rhodanese-like domains found C-terminal of the pyridine nucleotide-disulphide oxidoreductase (Pyr-redox) domain and the Pyr-redox dimerization domain.
Probab=22.00 E-value=1.3e+02 Score=15.29 Aligned_cols=22 Identities=9% Similarity=0.137 Sum_probs=16.4
Q ss_pred HHHHHHHHHHHcCCceEEEEecCC
Q psy472 45 QSMMLAKSLQHADIMFQSQECYRR 68 (68)
Q Consensus 45 ~s~~l~~~L~~~g~~~~~~~~~~~ 68 (68)
.+...+..|++.|. +...+.||
T Consensus 63 ~a~~~a~~L~~~G~--~v~~l~GG 84 (90)
T cd01524 63 RGYIAARILTQNGF--KVKNLDGG 84 (90)
T ss_pred hHHHHHHHHHHCCC--CEEEecCC
Confidence 46677888999998 66776664
No 409
>PF00542 Ribosomal_L12: Ribosomal protein L7/L12 C-terminal domain; InterPro: IPR013823 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This entry represents the C-terminal domain of the large subunit ribosomal proteins, known as the L7/L12 family. L7/L12 is present in each 50S subunit in four copies organised as two dimers. The L8 protein complex consisting of two dimers of L7/L12 and L10 in Escherichia coli ribosomes is assembled on the conserved region of 23 S rRNA termed the GTPase-associated domain []. The L7/L12 dimer probably interacts with EF-Tu. L7 and L12 only differ in a single post translational modification of the addition of an acetyl group to the N terminus of L7.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 1DD4_B 1DD3_A 1RQU_B 2GYA_5 2GYC_5 1RQS_A 1RQV_A 1CTF_A 2XUX_L.
Probab=21.88 E-value=1.3e+02 Score=15.31 Aligned_cols=31 Identities=16% Similarity=0.185 Sum_probs=24.8
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEE
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQS 62 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~ 62 (68)
+.|..+..| |+-+.+..+.++|.+.|..+++
T Consensus 37 ~~p~~ik~~-----v~keeAe~ik~~Le~aGa~v~l 67 (68)
T PF00542_consen 37 SLPKVIKEG-----VSKEEAEEIKKKLEAAGAKVEL 67 (68)
T ss_dssp TTTEEEEEE-----E-HHHHHHHHHHHHCCT-EEEE
T ss_pred hCCHHHHcC-----CCHHHHHHHHHHHHHcCCEEEe
Confidence 467777777 8999999999999999988775
No 410
>PF13361 UvrD_C: UvrD-like helicase C-terminal domain; PDB: 1UAA_B 3U4Q_A 3U44_A 1W36_B 3K70_B 2IS6_B 3LFU_A 2IS2_B 2IS1_B 2IS4_A ....
Probab=21.88 E-value=1.2e+02 Score=18.90 Aligned_cols=20 Identities=20% Similarity=0.232 Sum_probs=15.4
Q ss_pred HHHHHHHHHHHHHcCCceEE
Q psy472 43 FQQSMMLAKSLQHADIMFQS 62 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g~~~~~ 62 (68)
-.+...+.+.|.+.|+++..
T Consensus 87 ~~~~~~i~~~L~~~gIp~~~ 106 (351)
T PF13361_consen 87 NSQIKEIEDALKEAGIPYRI 106 (351)
T ss_dssp GGHHHHHHHHHHHTTS-EEE
T ss_pred chhHHHHHHHHhhhcceeEe
Confidence 46788899999999999743
No 411
>TIGR02617 tnaA_trp_ase tryptophanase, leader peptide-associated. Members of this family belong to the beta-eliminating lyase family (pfam01212) and act as tryptophanase (L-tryptophan indole-lyase). The tryptophanases of this family, as a rule, are found with a tryptophanase leader peptide (TnaC) encoded upstream. Both tryptophanases (4.1.99.1) and tyrosine phenol-lyases (EC 4.1.99.2) are found between trusted and noise cutoffs, but this model captures nearly all tryptophanases for which the leader peptide gene tnaC can be found upstream.
Probab=21.88 E-value=1.4e+02 Score=21.15 Aligned_cols=20 Identities=20% Similarity=0.330 Sum_probs=16.4
Q ss_pred HHHHHHHHHHHHHcCCceEE
Q psy472 43 FQQSMMLAKSLQHADIMFQS 62 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g~~~~~ 62 (68)
+.|..-|.++|.++|+++..
T Consensus 331 i~qv~yl~~~L~~~Gvpi~~ 350 (467)
T TIGR02617 331 INQVQYLVNGLEEIGVVCQQ 350 (467)
T ss_pred HHHHHHHHHHHHhCCCcEEe
Confidence 46778899999999999853
No 412
>TIGR00823 EIIA-LAC phosphotransferase system enzyme II, lactose-specific, factor III. operon. While the Lac permeases consist of two polypeptide chains (IIA and IICB), the Chb permease of E. coli consists of three (IIA, IIB and IIC). In B. subtilis, a PTS permease similar to the Chb permease of E. coli is believed to transport lichenan (a b-1,3;1,4 glucan) degradation products, oligosaccharides of 2-4 glucose units. This model is specific for the IIA subunit of the Lac PTS family.
Probab=21.81 E-value=57 Score=17.95 Aligned_cols=28 Identities=18% Similarity=0.112 Sum_probs=21.4
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHH
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQH 55 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~ 55 (68)
...+|++|. +|......-...|++.+..
T Consensus 68 ~~slLlvHA-QDhLMta~~~~~la~elI~ 95 (99)
T TIGR00823 68 EVSLTMVHA-QDHLMTTMLLKDLIAELIE 95 (99)
T ss_pred ccceeehhH-HHHHHHHHHHHHHHHHHHH
Confidence 477888888 8888777777777777654
No 413
>PRK09864 putative peptidase; Provisional
Probab=21.79 E-value=2.7e+02 Score=18.88 Aligned_cols=42 Identities=12% Similarity=0.039 Sum_probs=33.0
Q ss_pred CCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 25 IRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 25 ~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
+.+.|++-+. +.=.+.+..-...+.+..++.|+++++.+.++
T Consensus 247 lG~Gp~i~~~-D~~~i~~~~l~~~l~~~A~~~~Ip~Q~~~~~~ 288 (356)
T PRK09864 247 LGQGPGLMLF-DKRYFPNQKLVAALKSCAAHNDLPLQFSTMKT 288 (356)
T ss_pred cCCCCeEEEc-cCCccCCHHHHHHHHHHHHHcCCCceEEEcCC
Confidence 3357888544 34577888888899999999999999988764
No 414
>PF13471 Transglut_core3: Transglutaminase-like superfamily
Probab=21.63 E-value=1.6e+02 Score=16.12 Aligned_cols=22 Identities=14% Similarity=0.146 Sum_probs=17.9
Q ss_pred HHHHHHHHHHHHHcCCceEEEE
Q psy472 43 FQQSMMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 43 ~~~s~~l~~~L~~~g~~~~~~~ 64 (68)
..+|..+..-|...|++.++++
T Consensus 59 L~~ala~~~~L~~~gi~~~l~i 80 (117)
T PF13471_consen 59 LPRALALQRLLRRRGIPATLVI 80 (117)
T ss_pred HHHHHHHHHHHHhcCCCcEEEE
Confidence 4688888888999999888764
No 415
>PRK10329 glutaredoxin-like protein; Provisional
Probab=21.60 E-value=1.4e+02 Score=15.40 Aligned_cols=21 Identities=10% Similarity=0.177 Sum_probs=14.6
Q ss_pred HHHHHHHHHHHcCCceEEEEe
Q psy472 45 QSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 45 ~s~~l~~~L~~~g~~~~~~~~ 65 (68)
.+.+.-+.|.+.|++|+.+-.
T Consensus 13 ~C~~ak~~L~~~gI~~~~idi 33 (81)
T PRK10329 13 QCHATKRAMESRGFDFEMINV 33 (81)
T ss_pred hHHHHHHHHHHCCCceEEEEC
Confidence 344566677889999886643
No 416
>cd01439 TCCD_inducible_PARP_like Poly(ADP-ribose) polymerases catalyse the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ribose) polymerase is a regulatory component induced by DNA damage. The carboxyl-terminal region is the most highly conserved region of the protein. 2,3,7,8-Tetrachlorodibenzo-p-dioxin (TCDD) causes pleotropic effects in mammalian species through modulating gene expression. TCCD indicible PARP (TiPARP) is a target of TCDD that may contribute to multiple responses to TCDD by modulating protein function through poly ADP-ribosylation
Probab=21.54 E-value=25 Score=19.83 Aligned_cols=12 Identities=58% Similarity=0.825 Sum_probs=8.0
Q ss_pred EEEEeCCCCccC
Q psy472 31 LLVHGTMDDNVH 42 (68)
Q Consensus 31 ll~hG~~D~~V~ 42 (68)
|||||+.-..+.
T Consensus 1 ~LfHGt~~~~~~ 12 (121)
T cd01439 1 LLFHGTSADAVE 12 (121)
T ss_pred CcccccChhhHH
Confidence 478888765543
No 417
>PRK05434 phosphoglyceromutase; Provisional
Probab=21.51 E-value=3.1e+02 Score=19.60 Aligned_cols=49 Identities=12% Similarity=0.075 Sum_probs=38.3
Q ss_pred hHHhhcCCCCcEEEEEeCCCCccCH--HHHHHHHHHHHHcCC-ceEEEEecCC
Q psy472 19 NNKVDRIRDKQYLLVHGTMDDNVHF--QQSMMLAKSLQHADI-MFQSQECYRR 68 (68)
Q Consensus 19 ~~~~~~~~~~p~ll~hG~~D~~V~~--~~s~~l~~~L~~~g~-~~~~~~~~~~ 68 (68)
+.++++. +..+-|+-=-.|--||- .|-..|.+.++++|+ .+-+|.|-+|
T Consensus 102 ~~~~~~~-~~~lHl~GL~SdggVHsh~~hl~~l~~~a~~~g~~~v~vH~~~DG 153 (507)
T PRK05434 102 IDKAKKN-GGALHLMGLLSDGGVHSHIDHLFALLELAKEEGVKKVYVHAFLDG 153 (507)
T ss_pred HHHHHhc-CCeEEEEEeccCCCcccHHHHHHHHHHHHHHcCCCEEEEEEecCC
Confidence 3445444 46777777778888876 888889999999998 8889988765
No 418
>PRK12475 thiamine/molybdopterin biosynthesis MoeB-like protein; Provisional
Probab=21.50 E-value=1.4e+02 Score=19.85 Aligned_cols=19 Identities=26% Similarity=0.434 Sum_probs=12.5
Q ss_pred EEEEeCCCCccCHHHHHHHHHHH
Q psy472 31 LLVHGTMDDNVHFQQSMMLAKSL 53 (68)
Q Consensus 31 ll~hG~~D~~V~~~~s~~l~~~L 53 (68)
.++||+.| +..+..++..+
T Consensus 318 ~~~~g~~~----~~~a~~~~~~~ 336 (338)
T PRK12475 318 AFIHGTND----IKKAKRLYARY 336 (338)
T ss_pred EEEECCCC----HHHHHHHHHHh
Confidence 35899988 45666666544
No 419
>KOG2650|consensus
Probab=21.43 E-value=1.1e+02 Score=21.27 Aligned_cols=31 Identities=16% Similarity=0.063 Sum_probs=27.7
Q ss_pred eCCCCccCHHHHHHHHHHHHHcCCceEEEEe
Q psy472 35 GTMDDNVHFQQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 35 G~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
...|-.|++.....+.+-|+..++.+++++.
T Consensus 63 ~~~di~V~~~~~~~~~~~L~~~~i~~~v~i~ 93 (418)
T KOG2650|consen 63 QPVDILVPPEDLAAFKAFLKSANISYEVLIE 93 (418)
T ss_pred CCceEEECHHHHHHHHHHHHhcCCceEEEhh
Confidence 5679999999999999999999999997654
No 420
>PF00258 Flavodoxin_1: Flavodoxin; InterPro: IPR008254 This domain is found in a number of proteins including flavodoxin and nitric-oxide synthase. Flavodoxins are electron-transfer proteins that function in various electron transport systems. They bind one FMN molecule, which serves as a redox-active prosthetic group [] and are functionally interchangeable with ferredoxins. They have been isolated from prokaryotes, cyanobacteria, and some eukaryotic algae. Nitric oxide synthase (1.14.13.39 from EC) produces nitric oxide from L-arginie and NADPH. Nitric oxide acts as a messenger molecule in the body.; GO: 0010181 FMN binding, 0016491 oxidoreductase activity; PDB: 2WC1_A 2FVX_A 2FOX_A 6NUL_A 1FVX_A 2FAX_A 1FLN_A 1FLA_A 4NLL_A 2FDX_A ....
Probab=21.38 E-value=1.4e+02 Score=16.50 Aligned_cols=21 Identities=10% Similarity=0.177 Sum_probs=15.1
Q ss_pred HHHHHHHHHHHcCCceEEEEe
Q psy472 45 QSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 45 ~s~~l~~~L~~~g~~~~~~~~ 65 (68)
-+..+++.|+++|++++++-.
T Consensus 13 ~A~~ia~~l~~~g~~~~~~~~ 33 (143)
T PF00258_consen 13 MAEAIAEGLRERGVEVRVVDL 33 (143)
T ss_dssp HHHHHHHHHHHTTSEEEEEEG
T ss_pred HHHHHHHHHHHcCCceeeech
Confidence 356677888888888776543
No 421
>PF13353 Fer4_12: 4Fe-4S single cluster domain; PDB: 3C8F_A 3CB8_A 3T7V_A 2YX0_A 3CAN_A.
Probab=21.33 E-value=1.6e+02 Score=16.10 Aligned_cols=45 Identities=4% Similarity=0.008 Sum_probs=32.6
Q ss_pred hHHhhcCCCCcEEEEEeCCCCcc--CHHHHHHHHHHHHHcCCceEEEEe
Q psy472 19 NNKVDRIRDKQYLLVHGTMDDNV--HFQQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 19 ~~~~~~~~~~p~ll~hG~~D~~V--~~~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
+..+... ....+.++| .++.. .......+.+.+++.+.......+
T Consensus 45 i~~~~~~-~~~~i~l~G-GEPll~~~~~~l~~i~~~~k~~~~~~~~~~t 91 (139)
T PF13353_consen 45 IEELKNY-GIKGIVLTG-GEPLLHENYDELLEILKYIKEKFPKKIIILT 91 (139)
T ss_dssp CHHHCCC-CCCEEEEEC-STGGGHHSHHHHHHHHHHHHHTT-SEEEEEE
T ss_pred hhHHhcC-CceEEEEcC-CCeeeeccHhHHHHHHHHHHHhCCCCeEEEE
Confidence 4445455 478888888 88999 789999999999999884333333
No 422
>PLN00103 isocitrate dehydrogenase (NADP+); Provisional
Probab=21.27 E-value=2.9e+02 Score=19.26 Aligned_cols=37 Identities=5% Similarity=-0.081 Sum_probs=29.0
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.+|+.++||+. +-.+-+..+.++|...++++++..|+
T Consensus 9 ~~p~~~~~Gd~---~~~~~~~~~~~~~~~~~~~i~~~~~d 45 (410)
T PLN00103 9 ANPIVEMDGDE---MTRVIWKSIKDKLIFPFLDLDIKYFD 45 (410)
T ss_pred cCCeEEecCCc---chHHHHHHHHHHHhcCCCCeEEEEEc
Confidence 48999999955 33455667788888999999988875
No 423
>TIGR00182 plsX fatty acid/phospholipid synthesis protein PlsX. This protein of fatty acid/phospholipid biosynthesis, called PlsX after the member in Streptococcus pneumoniae, is proposed to be a phosphate acyltransferase that partners with PlsY (TIGR00023) in a two-step 1-acylglycerol-3-phosphate biosynthesis pathway alternative to the one-step PlsB (EC 2.3.1.15) pathway.
Probab=21.22 E-value=1.3e+02 Score=20.03 Aligned_cols=32 Identities=19% Similarity=0.349 Sum_probs=26.3
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCc
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIM 59 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~ 59 (68)
+..++..||..|.. -+.++...+..+.++++.
T Consensus 280 ~g~vvk~HG~s~~~-a~~~ai~~a~~~~~~~~~ 311 (322)
T TIGR00182 280 NKLVIKSHGSSDSR-AFFSAIRQAHEAVKSQVI 311 (322)
T ss_pred CceEEEEcCCCCHH-HHHHHHHHHHHHHHhCHH
Confidence 58899999999884 788888888888777753
No 424
>cd06589 GH31 The enzymes of glycosyl hydrolase family 31 (GH31) occur in prokaryotes, eukaryotes, and archaea with a wide range of hydrolytic activities, including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. In most cases, the pyranose moiety recognized in subsite -1 of the substrate binding site is an alpha-D-glucose, though some GH31 family members show a preference for alpha-D-xylose. Several GH31 enzymes can accommodate both glucose and xylose and different levels of discrimination between the two have been observed. Most characterized GH31 enzymes are alpha-glucosidases. In mammals, GH31 members with alpha-glucosidase activity are implicated in at least three distinct biological processes
Probab=21.09 E-value=1.8e+02 Score=18.39 Aligned_cols=29 Identities=7% Similarity=0.028 Sum_probs=23.9
Q ss_pred CccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 39 DNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 39 ~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
....-+....+++.++++++|++.+....
T Consensus 19 ~~~~~~~v~~~~~~~~~~~iP~d~~~lD~ 47 (265)
T cd06589 19 GYGDQDKVLEVIDGMRENDIPLDGFVLDD 47 (265)
T ss_pred CCCCHHHHHHHHHHHHHcCCCccEEEECc
Confidence 34566788999999999999999887654
No 425
>PF11775 CobT_C: Cobalamin biosynthesis protein CobT VWA domain
Probab=21.05 E-value=1.8e+02 Score=18.50 Aligned_cols=22 Identities=14% Similarity=0.179 Sum_probs=18.5
Q ss_pred HHHHHHHHHHHcCCceEEEEec
Q psy472 45 QSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 45 ~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.+..|+++|.+.|+++|..=|.
T Consensus 36 ~adila~aL~~~gvp~EVlGFt 57 (219)
T PF11775_consen 36 CADILARALERCGVPVEVLGFT 57 (219)
T ss_pred HHHHHHHHHHhCCCCeEEEeee
Confidence 4567999999999999987663
No 426
>KOG4081|consensus
Probab=20.98 E-value=1.8e+02 Score=16.66 Aligned_cols=28 Identities=11% Similarity=0.065 Sum_probs=21.0
Q ss_pred CCCccCHHHH--HHHHHHHHHcCCceEEEE
Q psy472 37 MDDNVHFQQS--MMLAKSLQHADIMFQSQE 64 (68)
Q Consensus 37 ~D~~V~~~~s--~~l~~~L~~~g~~~~~~~ 64 (68)
.|....+.+. ++..++|.+.|.++.+++
T Consensus 37 h~k~~qw~~~vVeqil~~LvKl~k~yKyiV 66 (115)
T KOG4081|consen 37 HDKVNQWTQKVVEQILTALVKLGKPYKYIV 66 (115)
T ss_pred cchHHHHHHHHHHHHHHHHHHcCCCeEEEE
Confidence 3555666544 678889999999998875
No 427
>PLN02834 3-dehydroquinate synthase
Probab=20.87 E-value=3e+02 Score=19.09 Aligned_cols=37 Identities=14% Similarity=0.023 Sum_probs=23.9
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEE--EEecC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQS--QECYR 67 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~--~~~~~ 67 (68)
.+++++. |..|.-.....+.+.|.+.|+.+.. .+|++
T Consensus 101 ~rvlIVt---D~~v~~~~~~~v~~~L~~~g~~~~v~~~v~~~ 139 (433)
T PLN02834 101 KRVLVVT---NETVAPLYLEKVVEALTAKGPELTVESVILPD 139 (433)
T ss_pred CEEEEEE---CccHHHHHHHHHHHHHHhcCCceEEEEEEecC
Confidence 4556655 5555555666778888888876655 45554
No 428
>COG4635 HemG Flavodoxin [Energy production and conversion / Coenzyme metabolism]
Probab=20.80 E-value=2.2e+02 Score=17.53 Aligned_cols=34 Identities=21% Similarity=0.285 Sum_probs=25.1
Q ss_pred cEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 29 QYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 29 p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
.+||..++.|-+-- .=+..++..|.+.|..+++.
T Consensus 2 k~LIlYstr~GqT~-kIA~~iA~~L~e~g~qvdi~ 35 (175)
T COG4635 2 KTLILYSTRDGQTR-KIAEYIASHLRESGIQVDIQ 35 (175)
T ss_pred ceEEEEecCCCcHH-HHHHHHHHHhhhcCCeeeee
Confidence 47888888887543 33556788899999888764
No 429
>PF04157 EAP30: EAP30/Vps36 family; InterPro: IPR007286 EAP30 is a subunit of the ELL complex. The ELL is an 80kDa RNA polymerase II transcription factor. ELL interacts with three other proteins to form the complex known as ELL complex. The ELL complex is capable of increasing that catalytic rate of transcription elongation, but is unable to repress initiation of transcription by RNA polymerase II as is the case of ELL. EAP30 is thought to lead to the derepression of ELL's transcriptional inhibitory activity. ; PDB: 2ZME_A 3CUQ_A 1W7P_D 1U5T_B.
Probab=20.73 E-value=1.8e+02 Score=18.00 Aligned_cols=28 Identities=11% Similarity=0.055 Sum_probs=23.9
Q ss_pred ccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 40 NVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 40 ~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.|.++...+-.+.|...|.++++.-|+.
T Consensus 132 lISp~Di~~A~~~l~~lg~g~~l~~~~s 159 (223)
T PF04157_consen 132 LISPEDILRACKLLEVLGLGFRLRKFGS 159 (223)
T ss_dssp T--HHHHHHHHHHHCCCTSSEEEEEETT
T ss_pred CcCHHHHHHHHHHHHHcCCCeEEEEeCC
Confidence 8899999999999999999999998873
No 430
>KOG3975|consensus
Probab=20.73 E-value=1.2e+02 Score=20.23 Aligned_cols=36 Identities=14% Similarity=0.068 Sum_probs=28.4
Q ss_pred hhcCCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCC
Q psy472 22 VDRIRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADI 58 (68)
Q Consensus 22 ~~~~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~ 58 (68)
++.. -.-+.+..|+.|.-||.+....+-+.+.+..+
T Consensus 238 ~een-~d~l~Fyygt~DgW~p~~~~d~~kdd~~eed~ 273 (301)
T KOG3975|consen 238 CEEN-LDSLWFYYGTNDGWVPSHYYDYYKDDVPEEDL 273 (301)
T ss_pred HHhc-CcEEEEEccCCCCCcchHHHHHHhhhcchhce
Confidence 4443 36788999999999999999999887766544
No 431
>cd08195 DHQS Dehydroquinate synthase (DHQS) catalyzes the conversion of DAHP to DHQ in shikimate pathway for aromatic compounds synthesis. Dehydroquinate synthase (DHQS) catalyzes the conversion of 3-deoxy-D-arabino-heptulosonate-7-phosphate (DAHP) to dehydroquinate (DHQ) in the second step of the shikimate pathway. This pathway, which involves seven sequential enzymatic steps in the conversion of erythrose 4-phosphate and phosphoenolpyruvate into chorismate for subsequent synthesis of aromatic compounds, is found in bacteria, microbial eukaryotes, and plants, but not in mammals. Therefore, enzymes of this pathway are attractive targets for the development of non-toxic antimicrobial compounds, herbicides and anti-parasitic agents. The activity of DHQS requires nicotinamide adenine dinucleotide (NAD) as cofactor. A single active site in DHQS catalyzes five sequential reactions involving alcohol oxidation, phosphate elimination, carbonyl reduction, ring opening, and intramolecular aldol
Probab=20.64 E-value=2.6e+02 Score=18.37 Aligned_cols=37 Identities=22% Similarity=0.136 Sum_probs=24.4
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.+++++. |..+.-.....+.+.|.+.|+.+...+|++
T Consensus 25 ~~~livt---d~~~~~~~~~~l~~~L~~~g~~~~~~~~~~ 61 (345)
T cd08195 25 SKILIVT---DENVAPLYLEKLKAALEAAGFEVEVIVIPA 61 (345)
T ss_pred CeEEEEE---CCchHHHHHHHHHHHHHhcCCceEEEEeCC
Confidence 4566665 344443556677777888888877777764
No 432
>KOG3101|consensus
Probab=20.55 E-value=1.1e+02 Score=19.87 Aligned_cols=35 Identities=17% Similarity=0.296 Sum_probs=21.7
Q ss_pred CCHHHHHhcChhHHhhcCC--CCcEEEEEeCCCCccC
Q psy472 8 DNLEGYKIAALNNKVDRIR--DKQYLLVHGTMDDNVH 42 (68)
Q Consensus 8 ~~~~~y~~~sp~~~~~~~~--~~p~ll~hG~~D~~V~ 42 (68)
++++.|+.-++..-+++.+ ..-+||=+|..|.-..
T Consensus 194 ~~ka~W~~yDat~lik~y~~~~~~ilIdqG~~D~Fl~ 230 (283)
T KOG3101|consen 194 DNKAQWEAYDATHLIKNYRGVGDDILIDQGAADNFLA 230 (283)
T ss_pred CChHHHhhcchHHHHHhcCCCCccEEEecCccchhhh
Confidence 4555555555544444432 3568888898887766
No 433
>cd05844 GT1_like_7 Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center
Probab=20.53 E-value=2.4e+02 Score=17.78 Aligned_cols=40 Identities=8% Similarity=-0.150 Sum_probs=27.3
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEEEecC
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECYR 67 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~~ 67 (68)
.+.+++-|.....=......+....|.+.+.++.+.+..+
T Consensus 188 ~~~i~~~G~~~~~K~~~~li~a~~~l~~~~~~~~l~ivG~ 227 (367)
T cd05844 188 PPRILFVGRFVEKKGPLLLLEAFARLARRVPEVRLVIIGD 227 (367)
T ss_pred CcEEEEEEeeccccChHHHHHHHHHHHHhCCCeEEEEEeC
Confidence 5577777877666666666666666766667777777643
No 434
>cd01822 Lysophospholipase_L1_like Lysophospholipase L1-like subgroup of SGNH-hydrolases. The best characterized member in this family is TesA, an E. coli periplasmic protein with thioesterase, esterase, arylesterase, protease and lysophospholipase activity.
Probab=20.34 E-value=1.8e+02 Score=16.36 Aligned_cols=34 Identities=15% Similarity=0.041 Sum_probs=21.7
Q ss_pred cEEEEEeCCCC--ccCHHH----HHHHHHHHHHcCCceEE
Q psy472 29 QYLLVHGTMDD--NVHFQQ----SMMLAKSLQHADIMFQS 62 (68)
Q Consensus 29 p~ll~hG~~D~--~V~~~~----s~~l~~~L~~~g~~~~~ 62 (68)
=++|.-|++|. .++.++ ...+.+++++.+..+-+
T Consensus 67 ~v~i~~G~ND~~~~~~~~~~~~~l~~li~~~~~~~~~vil 106 (177)
T cd01822 67 LVILELGGNDGLRGIPPDQTRANLRQMIETAQARGAPVLL 106 (177)
T ss_pred EEEEeccCcccccCCCHHHHHHHHHHHHHHHHHCCCeEEE
Confidence 47777888884 344543 35677777777665544
No 435
>cd04702 ASRGL1_like ASRGL1_like domains, a subfamily of the L-Asparaginase type 2-like enzymes. The wider family includes Glycosylasparaginase, Taspase 1 and L-Asparaginase type 2 enzymes. The proenzymes undergo autoproteolytic cleavage before a threonine to generate alpha and beta subunits. The threonine becomes the N-terminal residue of the beta subunit and is the catalytic residue. ASRGL1, or asparaginase-like 1, has been cloned from mammalian testis cDNA libraries. It has been identified as a sperm antigen that may induce the production of autoantibodies following obstruction of the male reproductive tract, e.g. vasectomy.
Probab=20.31 E-value=2.3e+02 Score=18.53 Aligned_cols=18 Identities=17% Similarity=0.281 Sum_probs=12.1
Q ss_pred CcEEEEEeCCCCccCHHHH
Q psy472 28 KQYLLVHGTMDDNVHFQQS 46 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s 46 (68)
.|++++||..=.. +.+..
T Consensus 1 ~p~i~vHgGAG~~-~~~~~ 18 (261)
T cd04702 1 RPVIIVHGGAGTI-PDERV 18 (261)
T ss_pred CcEEEEEcCCCCC-chhHH
Confidence 3789999988543 44433
No 436
>PF13104 DUF3956: Protein of unknown function (DUF3956)
Probab=20.29 E-value=65 Score=15.08 Aligned_cols=33 Identities=18% Similarity=0.186 Sum_probs=25.5
Q ss_pred CcEEEEEeCCCCccCHHHHHHHHHHHHHcCCce
Q psy472 28 KQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 28 ~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~ 60 (68)
-|.+++.=..=++.-.+-|.+.+-+|+..|++.
T Consensus 11 qp~lv~svagieiarleislqval~l~~lgipi 43 (45)
T PF13104_consen 11 QPFLVVSVAGIEIARLEISLQVALTLIALGIPI 43 (45)
T ss_pred CeeEEEEEeeeEEEEEeeeHHHHHHHHHcCCcc
Confidence 577777666666777788889999999998863
No 437
>cd06597 GH31_transferase_CtsY CtsY (cyclic tetrasaccharide-synthesizing enzyme Y) is a bacterial 3-alpha-isomaltosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsZ. CtsY and CtsZ both have a glycosyl hydrolase family 31 (GH31) catalytic domain. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=20.28 E-value=1.9e+02 Score=19.13 Aligned_cols=26 Identities=8% Similarity=-0.142 Sum_probs=21.7
Q ss_pred ccCHHHHHHHHHHHHHcCCceEEEEe
Q psy472 40 NVHFQQSMMLAKSLQHADIMFQSQEC 65 (68)
Q Consensus 40 ~V~~~~s~~l~~~L~~~g~~~~~~~~ 65 (68)
.-.-+..+.+++.++++|+|+..+..
T Consensus 20 Y~~~~ev~~v~~~~~~~~iP~d~i~l 45 (340)
T cd06597 20 WDTQAEVMRQMDAHEEHGIPVTVVVI 45 (340)
T ss_pred CCCHHHHHHHHHHHHHcCCCeeEEEE
Confidence 34567888999999999999998755
No 438
>smart00854 PGA_cap Bacterial capsule synthesis protein PGA_cap. This protein is a putative poly-gamma-glutamate capsule biosynthesis protein found in bacteria. Poly-gamma-glutamate is a natural polymer that may be involved in virulence and may help bacteria survive in high salt concentrations. It is a surface-associated protein.
Probab=20.23 E-value=1.9e+02 Score=17.79 Aligned_cols=34 Identities=18% Similarity=0.067 Sum_probs=23.7
Q ss_pred CCcEEEEEeCCCC-ccCHHHHHHHHHHHHHcCCce
Q psy472 27 DKQYLLVHGTMDD-NVHFQQSMMLAKSLQHADIMF 60 (68)
Q Consensus 27 ~~p~ll~hG~~D~-~V~~~~s~~l~~~L~~~g~~~ 60 (68)
+.=+++.|...+. ..|......++++|.+.|++.
T Consensus 174 D~vIv~~H~G~e~~~~p~~~~~~~A~~l~~~G~Dv 208 (239)
T smart00854 174 DVVIVSLHWGVEYQYEPTDEQRELAHALIDAGADV 208 (239)
T ss_pred CEEEEEecCccccCCCCCHHHHHHHHHHHHcCCCE
Confidence 3456777776554 355566688999999888754
No 439
>PF03850 Tfb4: Transcription factor Tfb4; InterPro: IPR004600 Members of this family are part of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. The core-TFIIH basal transcription factor complex has six subunits, this is the p34 subunit.; GO: 0006281 DNA repair, 0006355 regulation of transcription, DNA-dependent, 0000439 core TFIIH complex
Probab=20.17 E-value=2.6e+02 Score=18.19 Aligned_cols=42 Identities=14% Similarity=0.167 Sum_probs=33.5
Q ss_pred cCCCCcEEE-EEeCCCCccCHHHHHHHHHHHHHcCCceEEEEec
Q psy472 24 RIRDKQYLL-VHGTMDDNVHFQQSMMLAKSLQHADIMFQSQECY 66 (68)
Q Consensus 24 ~~~~~p~ll-~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~~~~ 66 (68)
.. +.++|+ +.+..|....+-..+...=+.++.+++.+.....
T Consensus 142 ~~-~~RILv~~s~s~d~~~QYi~~MN~iFaAqk~~v~IDv~~L~ 184 (276)
T PF03850_consen 142 SL-KSRILVIVSGSPDSSSQYIPLMNCIFAAQKQKVPIDVCKLG 184 (276)
T ss_pred Cc-CccEEEEEecCCCccHHHHHHHHHHHHHhcCCceeEEEEec
Confidence 45 479999 8999999988887777777779999988776553
No 440
>PLN02297 ribose-phosphate pyrophosphokinase
Probab=20.16 E-value=2.8e+02 Score=18.57 Aligned_cols=38 Identities=8% Similarity=0.039 Sum_probs=27.7
Q ss_pred CCCCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEEE
Q psy472 25 IRDKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQSQ 63 (68)
Q Consensus 25 ~~~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~~ 63 (68)
.+...+++++.+..+ -..-+.+.+..+|++.|..--..
T Consensus 64 vrg~~V~ivqs~~~p-d~lmELLl~~dAlr~~ga~~i~~ 101 (326)
T PLN02297 64 IRGQHVAFLASFSSP-AVIFEQLSVIYALPKLFVASFTL 101 (326)
T ss_pred cCCCeEEEECCCCCC-hHHHHHHHHHHHHHHcCCCEEEE
Confidence 334678999997655 45677788999999999854333
No 441
>TIGR01742 SA_tandem_lipo Staphylococcus tandem lipoproteins. Members of this family are predicted lipoproteins (mostly), found in Staphylococcus aureus in several different tandem clusters in pathogenicity islands. Members are also found, clustered, in Staphylococcus epidermidis.
Probab=20.02 E-value=84 Score=20.49 Aligned_cols=14 Identities=29% Similarity=0.342 Sum_probs=11.9
Q ss_pred CCcEEEEEeCCCCc
Q psy472 27 DKQYLLVHGTMDDN 40 (68)
Q Consensus 27 ~~p~ll~hG~~D~~ 40 (68)
..|.|++||+.|--
T Consensus 210 kapkL~lkG~gdlk 223 (255)
T TIGR01742 210 KAPKLLLKGSGDLK 223 (255)
T ss_pred CCcEEEEeeccCcC
Confidence 59999999999853
No 442
>PRK11274 glcF glycolate oxidase iron-sulfur subunit; Provisional
Probab=20.02 E-value=2.2e+02 Score=19.05 Aligned_cols=36 Identities=14% Similarity=0.230 Sum_probs=30.3
Q ss_pred CCcEEEEEeCCCCccCHHHHHHHHHHHHHcCCceEE
Q psy472 27 DKQYLLVHGTMDDNVHFQQSMMLAKSLQHADIMFQS 62 (68)
Q Consensus 27 ~~p~ll~hG~~D~~V~~~~s~~l~~~L~~~g~~~~~ 62 (68)
+..++++.|-.....-++-...+.+.|++.|+.+.+
T Consensus 166 ~~~V~~f~gC~~~~~~p~~~~a~~~vL~~~Gv~v~~ 201 (407)
T PRK11274 166 ARRVLMLEGCVQPAMSPNINAATARVLDRLGISLVV 201 (407)
T ss_pred CceEEEEccccccccCccHHHHHHHHHHHCCCEEEe
Confidence 357999999888888888888899999999998754
Done!