Query psy11781
Match_columns 1228
No_of_seqs 429 out of 1106
Neff 5.8
Searched_HMMs 46136
Date Fri Aug 16 18:48:52 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11781.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11781hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF03022 MRJP: Major royal jel 100.0 1.4E-68 3E-73 602.3 24.9 284 28-344 1-287 (287)
2 PF03022 MRJP: Major royal jel 100.0 4.2E-67 9E-72 590.3 29.4 283 468-784 1-287 (287)
3 PF08450 SGL: SMP-30/Gluconola 99.2 2.9E-09 6.3E-14 117.6 22.6 233 410-752 1-246 (246)
4 PF08450 SGL: SMP-30/Gluconola 99.0 2.1E-08 4.5E-13 110.9 21.3 232 860-1202 1-246 (246)
5 COG3386 Gluconolactonase [Carb 98.8 6.6E-07 1.4E-11 102.9 22.4 248 417-756 21-280 (307)
6 COG3386 Gluconolactonase [Carb 98.5 6.2E-06 1.3E-10 95.0 21.2 246 867-1205 21-279 (307)
7 PLN02919 haloacid dehalogenase 98.5 7.5E-06 1.6E-10 108.2 22.2 271 409-750 568-876 (1057)
8 COG4257 Vgb Streptogramin lyas 98.3 7.7E-05 1.7E-09 82.7 20.1 250 468-774 62-315 (353)
9 PLN02919 haloacid dehalogenase 98.2 7.2E-05 1.6E-09 99.2 23.1 228 465-750 565-821 (1057)
10 COG4257 Vgb Streptogramin lyas 98.2 0.00012 2.5E-09 81.3 20.2 250 918-1224 62-315 (353)
11 PF10282 Lactonase: Lactonase, 97.5 0.0036 7.8E-08 73.4 17.9 170 468-712 144-321 (345)
12 PF10282 Lactonase: Lactonase, 97.4 0.0079 1.7E-07 70.5 19.6 176 918-1166 144-327 (345)
13 PF06977 SdiA-regulated: SdiA- 97.2 0.038 8.3E-07 62.3 20.7 206 469-750 23-239 (248)
14 PRK11028 6-phosphogluconolacto 97.1 0.032 7E-07 64.4 20.6 204 469-752 81-293 (330)
15 PRK11028 6-phosphogluconolacto 97.1 0.032 7E-07 64.4 20.6 203 920-1203 82-294 (330)
16 PF06977 SdiA-regulated: SdiA- 97.0 0.056 1.2E-06 60.9 19.9 206 919-1200 23-239 (248)
17 TIGR03866 PQQ_ABC_repeats PQQ- 96.8 0.32 6.9E-06 53.8 23.7 225 922-1201 35-267 (300)
18 PF02239 Cytochrom_D1: Cytochr 96.8 0.24 5.2E-06 59.0 23.8 329 502-970 17-358 (369)
19 TIGR02658 TTQ_MADH_Hv methylam 96.7 0.16 3.6E-06 60.0 21.1 236 929-1204 58-321 (352)
20 TIGR03866 PQQ_ABC_repeats PQQ- 96.5 0.44 9.6E-06 52.7 22.4 228 472-751 35-267 (300)
21 TIGR02604 Piru_Ver_Nterm putat 96.5 0.12 2.6E-06 61.4 18.5 75 466-551 12-89 (367)
22 TIGR02658 TTQ_MADH_Hv methylam 96.4 0.45 9.8E-06 56.4 22.9 84 478-568 57-142 (352)
23 TIGR02604 Piru_Ver_Nterm putat 96.4 0.14 3E-06 60.8 18.3 75 916-1001 12-89 (367)
24 KOG4659|consensus 96.1 0.43 9.2E-06 62.6 21.4 187 992-1204 325-555 (1899)
25 COG2706 3-carboxymuconate cycl 96.1 0.24 5.3E-06 57.5 17.8 170 470-715 147-325 (346)
26 COG2706 3-carboxymuconate cycl 96.1 0.22 4.9E-06 57.8 17.4 172 920-1167 147-327 (346)
27 COG3391 Uncharacterized conser 96.0 0.69 1.5E-05 55.3 21.6 193 468-752 74-273 (381)
28 COG3391 Uncharacterized conser 95.7 1.1 2.3E-05 53.7 21.9 192 919-1202 75-273 (381)
29 PRK02888 nitrous-oxide reducta 95.5 0.63 1.4E-05 58.4 19.3 119 1081-1208 326-459 (635)
30 KOG4499|consensus 95.0 0.79 1.7E-05 50.9 15.6 123 611-752 136-275 (310)
31 KOG4659|consensus 94.8 0.09 2E-06 68.4 9.1 118 631-753 412-554 (1899)
32 PF03088 Str_synth: Strictosid 94.8 0.056 1.2E-06 51.7 5.7 54 688-749 2-74 (89)
33 PF02239 Cytochrom_D1: Cytochr 94.6 2.8 6E-05 50.1 20.6 274 858-1189 77-364 (369)
34 PRK02888 nitrous-oxide reducta 94.3 2.1 4.5E-05 54.0 19.0 118 631-757 326-458 (635)
35 PF06433 Me-amine-dh_H: Methyl 94.1 0.16 3.5E-06 59.3 8.5 80 927-1020 248-328 (342)
36 COG3204 Uncharacterized protei 93.9 3.6 7.8E-05 47.4 18.3 67 1136-1207 235-310 (316)
37 COG3204 Uncharacterized protei 93.8 4.1 8.9E-05 46.9 18.5 191 499-757 106-310 (316)
38 PF06433 Me-amine-dh_H: Methyl 93.7 0.25 5.4E-06 57.8 9.2 81 477-571 248-329 (342)
39 PF03088 Str_synth: Strictosid 92.2 0.39 8.5E-06 46.0 6.5 36 531-570 1-53 (89)
40 PF05694 SBP56: 56kDa selenium 92.1 1.6 3.5E-05 52.6 12.9 125 951-1112 222-348 (461)
41 KOG1539|consensus 91.9 2.7 6E-05 53.5 14.9 194 467-719 448-653 (910)
42 KOG1539|consensus 91.4 5 0.00011 51.3 16.3 223 865-1169 416-653 (910)
43 PF07995 GSDH: Glucose / Sorbo 91.3 32 0.00069 40.5 22.6 131 632-776 8-156 (331)
44 KOG4499|consensus 91.2 0.73 1.6E-05 51.1 8.1 64 1135-1200 159-229 (310)
45 PF05694 SBP56: 56kDa selenium 90.6 2.5 5.5E-05 51.0 12.4 118 500-654 221-340 (461)
46 KOG1214|consensus 90.5 0.93 2E-05 57.2 9.0 138 608-772 1010-1151(1289)
47 KOG1520|consensus 89.6 3 6.6E-05 49.5 11.9 70 460-553 110-181 (376)
48 PF01436 NHL: NHL repeat; Int 89.5 0.52 1.1E-05 35.3 3.7 26 685-710 3-28 (28)
49 PF00058 Ldl_recept_b: Low-den 89.1 0.93 2E-05 37.2 5.3 41 1145-1192 1-42 (42)
50 TIGR03606 non_repeat_PQQ dehyd 88.1 25 0.00055 43.3 18.8 73 915-999 27-102 (454)
51 KOG1520|consensus 87.4 4.3 9.3E-05 48.2 11.2 162 530-751 117-283 (376)
52 PF01436 NHL: NHL repeat; Int 87.4 0.91 2E-05 33.9 3.8 26 1135-1160 3-28 (28)
53 TIGR03606 non_repeat_PQQ dehyd 86.9 20 0.00044 44.1 17.0 63 686-752 148-249 (454)
54 TIGR03075 PQQ_enz_alc_DH PQQ-d 86.4 1E+02 0.0022 38.9 33.7 77 951-1041 441-519 (527)
55 PF00058 Ldl_recept_b: Low-den 85.5 1.8 3.9E-05 35.5 5.0 41 695-742 1-42 (42)
56 KOG1214|consensus 85.5 3.3 7.1E-05 52.7 9.3 138 1058-1222 1010-1151(1289)
57 PF05096 Glu_cyclase_2: Glutam 85.4 63 0.0014 37.1 18.8 46 918-973 45-90 (264)
58 PF13360 PQQ_2: PQQ-like domai 84.3 66 0.0014 34.8 21.4 67 477-569 35-102 (238)
59 PF13449 Phytase-like: Esteras 83.8 8 0.00017 45.3 11.4 84 466-566 18-123 (326)
60 PF13360 PQQ_2: PQQ-like domai 83.6 70 0.0015 34.6 22.3 67 927-1019 35-102 (238)
61 PRK11138 outer membrane biogen 81.9 93 0.002 37.2 19.7 57 502-571 131-188 (394)
62 PRK04043 tolB translocation pr 81.3 12 0.00026 45.6 12.0 97 99-205 285-387 (419)
63 PF02333 Phytase: Phytase; In 80.3 45 0.00097 40.3 15.8 85 684-771 208-302 (381)
64 PF07995 GSDH: Glucose / Sorbo 80.2 54 0.0012 38.6 16.5 69 918-1001 2-71 (331)
65 cd00200 WD40 WD40 domain, foun 79.8 86 0.0019 33.1 21.9 76 920-1018 96-171 (289)
66 TIGR03032 conserved hypothetic 79.7 7.6 0.00016 45.3 8.8 89 915-1018 200-296 (335)
67 cd00200 WD40 WD40 domain, foun 79.7 87 0.0019 33.1 21.0 75 470-567 96-170 (289)
68 PF02333 Phytase: Phytase; In 79.4 60 0.0013 39.2 16.5 29 1134-1163 208-237 (381)
69 COG4247 Phy 3-phytase (myo-ino 77.2 1.1E+02 0.0024 35.1 16.4 61 532-592 55-115 (364)
70 PRK11138 outer membrane biogen 77.1 1.7E+02 0.0037 35.0 23.3 57 952-1021 131-188 (394)
71 TIGR03032 conserved hypothetic 76.9 9.9 0.00021 44.4 8.7 88 465-568 200-296 (335)
72 PRK05137 tolB translocation pr 76.8 1.2E+02 0.0026 36.8 18.7 61 950-1020 269-330 (435)
73 PRK04792 tolB translocation pr 76.3 99 0.0021 38.0 17.8 31 100-130 315-346 (448)
74 PRK04043 tolB translocation pr 75.8 34 0.00074 41.7 13.6 125 59-207 211-340 (419)
75 PRK04792 tolB translocation pr 74.4 30 0.00066 42.4 12.7 123 59-206 240-370 (448)
76 PRK02889 tolB translocation pr 73.7 76 0.0017 38.6 15.9 98 102-214 207-314 (427)
77 PRK00178 tolB translocation pr 73.1 39 0.00085 40.8 13.2 99 101-215 297-405 (430)
78 PF06739 SBBP: Beta-propeller 69.6 5.4 0.00012 32.1 3.2 31 1124-1154 3-33 (38)
79 PF06739 SBBP: Beta-propeller 68.3 5.4 0.00012 32.2 2.9 31 674-704 3-33 (38)
80 COG3823 Glutamine cyclotransfe 68.2 1.2E+02 0.0026 34.0 13.7 61 1144-1207 185-254 (262)
81 PF05787 DUF839: Bacterial pro 67.7 13 0.00029 46.6 7.6 69 1136-1205 352-458 (524)
82 PRK01029 tolB translocation pr 66.7 2.8E+02 0.0061 33.9 18.6 28 994-1021 200-228 (428)
83 PRK03629 tolB translocation pr 66.6 3.1E+02 0.0066 33.5 19.3 60 951-1020 267-327 (429)
84 PRK04922 tolB translocation pr 65.9 72 0.0016 38.8 13.4 99 101-215 302-410 (433)
85 PF05787 DUF839: Bacterial pro 65.9 15 0.00033 46.0 7.7 70 684-754 350-457 (524)
86 PF05096 Glu_cyclase_2: Glutam 65.7 79 0.0017 36.4 12.5 120 856-1020 87-211 (264)
87 smart00284 OLF Olfactomedin-li 63.8 1.8E+02 0.0038 33.5 14.8 158 387-568 51-214 (255)
88 COG4946 Uncharacterized protei 63.3 88 0.0019 38.4 12.6 87 465-572 399-487 (668)
89 PRK05137 tolB translocation pr 63.1 90 0.002 37.9 13.5 130 60-215 269-411 (435)
90 smart00135 LY Low-density lipo 60.9 15 0.00032 28.9 4.2 30 684-713 9-39 (43)
91 PF13449 Phytase-like: Esteras 60.6 37 0.00081 39.8 9.3 60 921-987 88-156 (326)
92 COG4247 Phy 3-phytase (myo-ino 60.4 1.7E+02 0.0037 33.6 13.5 50 993-1042 66-115 (364)
93 TIGR03075 PQQ_enz_alc_DH PQQ-d 59.3 4.7E+02 0.01 33.1 24.7 93 924-1016 240-337 (527)
94 KOG2055|consensus 58.6 1.7E+02 0.0038 35.9 14.0 218 951-1204 237-457 (514)
95 cd00216 PQQ_DH Dehydrogenases 58.6 4.5E+02 0.0097 32.7 22.5 68 62-130 257-328 (488)
96 TIGR02276 beta_rpt_yvtn 40-res 58.4 30 0.00066 27.2 5.7 32 928-971 3-34 (42)
97 PRK00178 tolB translocation pr 58.2 1.1E+02 0.0023 37.1 12.9 126 59-206 221-351 (430)
98 smart00135 LY Low-density lipo 57.8 19 0.00041 28.3 4.4 29 1135-1163 10-39 (43)
99 PF07433 DUF1513: Protein of u 56.3 4E+02 0.0087 31.5 16.5 82 951-1041 28-114 (305)
100 COG4946 Uncharacterized protei 56.3 1.3E+02 0.0029 36.9 12.6 86 915-1021 399-486 (668)
101 PRK04922 tolB translocation pr 56.1 1.1E+02 0.0025 37.1 12.7 126 59-206 226-356 (433)
102 PRK03629 tolB translocation pr 55.2 88 0.0019 38.1 11.5 31 99-129 295-326 (429)
103 COG3823 Glutamine cyclotransfe 54.5 3.1E+02 0.0067 30.9 13.9 41 470-521 48-88 (262)
104 TIGR03074 PQQ_membr_DH membran 52.2 7.2E+02 0.016 33.1 37.3 83 951-1041 641-740 (764)
105 PF07494 Reg_prop: Two compone 51.5 14 0.0003 26.7 2.3 16 469-484 6-21 (24)
106 PRK01029 tolB translocation pr 51.5 3.8E+02 0.0082 32.8 16.1 28 104-131 200-228 (428)
107 TIGR02800 propeller_TolB tol-p 51.4 1.9E+02 0.0041 34.4 13.4 133 60-214 213-352 (417)
108 KOG2055|consensus 50.8 2E+02 0.0044 35.4 12.8 217 501-753 237-456 (514)
109 PF08309 LVIVD: LVIVD repeat; 50.7 28 0.0006 28.9 4.2 26 531-563 5-30 (42)
110 PF07494 Reg_prop: Two compone 50.1 17 0.00036 26.4 2.5 16 919-934 6-21 (24)
111 KOG0268|consensus 49.9 1.1E+02 0.0024 36.5 10.3 180 949-1225 165-346 (433)
112 PRK02889 tolB translocation pr 48.9 1.9E+02 0.004 35.3 12.9 128 59-207 218-349 (427)
113 KOG1446|consensus 47.3 5.4E+02 0.012 30.3 18.3 183 953-1166 82-265 (311)
114 PF08309 LVIVD: LVIVD repeat; 46.4 35 0.00076 28.3 4.2 20 993-1013 11-30 (42)
115 TIGR02276 beta_rpt_yvtn 40-res 44.9 45 0.00097 26.2 4.6 27 992-1018 2-28 (42)
116 COG0823 TolB Periplasmic compo 43.1 2.4E+02 0.0051 34.8 12.5 127 61-213 218-352 (425)
117 COG3211 PhoX Predicted phospha 43.0 62 0.0014 40.7 7.4 68 686-753 419-520 (616)
118 COG5276 Uncharacterized conser 39.9 49 0.0011 38.5 5.5 31 527-564 86-116 (370)
119 COG3211 PhoX Predicted phospha 39.2 71 0.0015 40.2 7.1 69 1136-1204 419-521 (616)
120 PF02897 Peptidase_S9_N: Proly 39.1 5.7E+02 0.012 30.6 14.9 82 409-514 327-409 (414)
121 KOG0282|consensus 38.6 37 0.00081 41.4 4.5 133 501-655 322-461 (503)
122 TIGR02800 propeller_TolB tol-p 38.4 3.6E+02 0.0079 32.0 13.0 30 101-130 288-318 (417)
123 PF05567 Neisseria_PilC: Neiss 38.1 58 0.0013 38.6 6.1 58 949-1021 179-238 (335)
124 COG0823 TolB Periplasmic compo 36.9 1E+02 0.0022 37.9 8.0 87 102-203 204-299 (425)
125 PF14269 Arylsulfotran_2: Aryl 36.0 7.7E+02 0.017 28.9 16.7 171 827-1019 107-296 (299)
126 PF01011 PQQ: PQQ enzyme repea 35.9 56 0.0012 25.9 3.8 23 62-84 11-33 (38)
127 TIGR03300 assembly_YfgL outer 35.0 1.5E+02 0.0032 35.1 8.9 57 951-1020 115-172 (377)
128 KOG0316|consensus 34.6 6.6E+02 0.014 28.9 12.8 52 951-1013 81-132 (307)
129 PF07433 DUF1513: Protein of u 34.5 8.4E+02 0.018 28.9 18.5 70 501-580 28-103 (305)
130 PF14269 Arylsulfotran_2: Aryl 32.4 8.7E+02 0.019 28.4 19.2 96 469-570 58-182 (299)
131 TIGR03300 assembly_YfgL outer 31.9 1.5E+02 0.0033 34.9 8.4 67 478-570 65-132 (377)
132 COG3292 Predicted periplasmic 31.8 6.2E+02 0.013 32.4 13.2 85 468-569 419-507 (671)
133 KOG0268|consensus 31.3 4.4E+02 0.0095 31.8 11.3 181 498-775 164-346 (433)
134 PF07676 PD40: WD40-like Beta 31.0 36 0.00078 26.8 2.0 20 186-205 9-28 (39)
135 KOG0291|consensus 30.7 8.6E+02 0.019 32.1 14.4 60 503-571 416-475 (893)
136 PF05567 Neisseria_PilC: Neiss 30.7 91 0.002 37.0 6.1 58 499-571 179-238 (335)
137 PF01731 Arylesterase: Arylest 30.2 88 0.0019 30.0 4.8 36 705-750 36-72 (86)
138 COG5276 Uncharacterized conser 30.1 53 0.0012 38.2 3.8 63 58-128 58-120 (370)
139 PRK01742 tolB translocation pr 28.3 1.1E+03 0.025 28.5 19.4 61 500-570 227-288 (429)
140 COG1770 PtrB Protease II [Amin 27.9 1.4E+03 0.031 29.9 15.7 189 68-335 106-301 (682)
141 PF00930 DPPIV_N: Dipeptidyl p 27.8 3.5E+02 0.0076 32.0 10.4 26 1081-1106 286-315 (353)
142 PF01731 Arylesterase: Arylest 26.4 1.6E+02 0.0035 28.2 5.8 53 499-564 34-86 (86)
143 PRK01742 tolB translocation pr 26.1 3.5E+02 0.0075 32.9 10.2 99 102-215 215-323 (429)
144 cd00216 PQQ_DH Dehydrogenases 26.0 2.5E+02 0.0053 35.0 9.0 74 928-1020 110-192 (488)
145 KOG0282|consensus 23.7 80 0.0017 38.8 3.9 132 952-1104 323-460 (503)
146 COG4222 Uncharacterized protei 22.9 2.1E+02 0.0046 34.8 7.2 101 916-1020 67-181 (391)
147 COG4222 Uncharacterized protei 22.5 2.2E+02 0.0049 34.6 7.3 100 467-570 68-181 (391)
148 KOG0291|consensus 22.3 1.8E+02 0.0038 37.9 6.6 60 952-1020 415-474 (893)
149 KOG2048|consensus 21.4 1.4E+03 0.03 29.8 13.8 72 500-577 89-165 (691)
150 PF14870 PSII_BNR: Photosynthe 21.0 5.7E+02 0.012 30.1 10.1 83 467-571 186-269 (302)
151 PF00930 DPPIV_N: Dipeptidyl p 20.9 7E+02 0.015 29.5 11.1 177 27-216 114-315 (353)
152 COG3670 Lignostilbene-alpha,be 20.8 2.1E+02 0.0046 35.4 6.6 138 871-1020 261-415 (490)
No 1
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=100.00 E-value=1.4e-68 Score=602.30 Aligned_cols=284 Identities=37% Similarity=0.741 Sum_probs=238.6
Q ss_pred eEEEEEEEecCCeEEEeecCcccccCCCceecCCeEEEEEccCCcEEEEEEcCCcccccCCcccceEEEEe-ecCCceEE
Q psy11781 28 ISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVV-EDCDHVFA 106 (1228)
Q Consensus 28 ~~v~~~~~d~~~~lw~ldtg~~~~~~~~~~~~~pkl~~~dl~t~~~~~~~~~p~~~~~~~s~~~~i~~d~~-~~~~~~~a 106 (1228)
+|||+++||+||||||||+|+++..+.+.+.||||||+|||+||++++||.||++++.++|+++|+|+|.. .+|+++||
T Consensus 1 ~sV~~v~iD~~~rLWVlD~G~~~~~~~~~~~~~pKLv~~Dl~t~~li~~~~~p~~~~~~~s~lndl~VD~~~~~~~~~~a 80 (287)
T PF03022_consen 1 VSVQRVQIDECGRLWVLDSGRPNGLQPPKQVCPPKLVAFDLKTNQLIRRYPFPPDIAPPDSFLNDLVVDVRDGNCDDGFA 80 (287)
T ss_dssp S-EEEEEE-TTSEEEEEE-CCHSSSSTTGHTS--EEEEEETTTTCEEEEEE--CCCS-TCGGEEEEEEECTTTTS-SEEE
T ss_pred CcccEEEEcCCCCEEEEeCCCcCCCCCCCCCCCcEEEEEECCCCcEEEEEECChHHcccccccceEEEEccCCCCcceEE
Confidence 68999999999999999999999777778899999999999999999999999999999999999999984 57999999
Q ss_pred EEeeCCcceEEEEEccCCceeeeecCccccCCCccceeeccceeeeccCeeeeecCcCCCCccccCCcccccccccccCC
Q psy11781 107 YVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNG 186 (1228)
Q Consensus 107 Yi~D~~~~g~iv~dl~~~~swR~~h~~~~~~p~~~~~~~~~~~f~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 186 (1228)
||||++.+||||||+.+|+|||+.|+++.++|.+..|+++|+.|+|.+|++|+++++.
T Consensus 81 YItD~~~~glIV~dl~~~~s~Rv~~~~~~~~p~~~~~~i~g~~~~~~dg~~gial~~~---------------------- 138 (287)
T PF03022_consen 81 YITDSGGPGLIVYDLATGKSWRVLHNSFSPDPDAGPFTIGGESFQWPDGIFGIALSPI---------------------- 138 (287)
T ss_dssp EEEETTTCEEEEEETTTTEEEEEETCGCTTS-SSEEEEETTEEEEETTSEEEEEE-TT----------------------
T ss_pred EEeCCCcCcEEEEEccCCcEEEEecCCcceeccccceeccCceEecCCCccccccCCC----------------------
Confidence 9999999999999999999999999999999999999999999999999999999974
Q ss_pred CcccccccCCCceEEEEecccCeeEEEecccccCCcccCCccchhhhhhhcCccccccccceeeEcCCCeEEEeccCCCe
Q psy11781 187 TNVDKTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNGVMFYNLVTKHS 266 (1228)
Q Consensus 187 ~~~~~~~~~~~~~lYf~~lsS~~~~sV~T~~Lr~~~~~~~~~~~~~~v~~~G~rg~~sqs~~~~~d~~Gvi~f~~v~~~a 266 (1228)
++|||+||||||+|++||+|||++||+++.... .++.+.|+.+|+|| +|++|+++|++|+|||++++++|
T Consensus 139 -------~~d~r~LYf~~lss~~ly~v~T~~L~~~~~~~~-~~~~~~v~~lG~k~--~~s~g~~~D~~G~ly~~~~~~~a 208 (287)
T PF03022_consen 139 -------SPDGRWLYFHPLSSRKLYRVPTSVLRDPSLSDA-QALASQVQDLGDKG--SQSDGMAIDPNGNLYFTDVEQNA 208 (287)
T ss_dssp -------STTS-EEEEEETT-SEEEEEEHHHHCSTT--HH-H-HHHT-EEEEE-----SECEEEEETTTEEEEEECCCTE
T ss_pred -------CCCccEEEEEeCCCCcEEEEEHHHhhCcccccc-ccccccceeccccC--CCCceEEECCCCcEEEecCCCCe
Confidence 356899999999999999999999999987532 22567899999998 69999999999999999999999
Q ss_pred eEEEeCCCCcccccccceeecccceeecCcceeeeC--CCeEEEEeccccchhcccCCCceeeEEEEEecccccccCCCC
Q psy11781 267 VGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDP--NDNIWILSNKLHQYLYGFLDFNVYNYRILVANSNDISNAGPA 344 (1228)
Q Consensus 267 I~cwn~~~~~~~~~~~~~v~~~~~~l~~p~~l~i~~--~g~lw~~sn~l~~~~~~~l~~~~~n~ri~~~~~~~~~~~~~~ 344 (1228)
|+|||+.+ +|++++..+|+++.++|+||++|+|++ +|+||+++||||+|+++.|+++|+||||++++++++|+||+|
T Consensus 209 I~~w~~~~-~~~~~~~~~l~~d~~~l~~pd~~~i~~~~~g~L~v~snrl~~~~~~~~~~~e~nfri~~~~v~~~~~~t~C 287 (287)
T PF03022_consen 209 IGCWDPDG-PYTPENFEILAQDPRTLQWPDGLKIDPEGDGYLWVLSNRLQRFAYGNLNPDEVNFRIWRARVDELIKGTVC 287 (287)
T ss_dssp EEEEETTT-SB-GCCEEEEEE-CC-GSSEEEEEE-T--TS-EEEEE-S--SSTGGG---SS-SEEEEES-HHHHHTTSTT
T ss_pred EEEEeCCC-CcCccchheeEEcCceeeccceeeeccccCceEEEEECcchHhhccCCCCCccEEeEEEeCHHHHhcCCCC
Confidence 99999996 779999999999998999999999999 999999999999999999999999999999999999999999
No 2
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=100.00 E-value=4.2e-67 Score=590.34 Aligned_cols=283 Identities=37% Similarity=0.756 Sum_probs=240.2
Q ss_pred eEEEEEEEeCCCcEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeec-CCCCeEE
Q psy11781 468 ISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVE-DCDHVFA 546 (1228)
Q Consensus 468 vsV~~v~iD~~gRLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~-~c~~~~a 546 (1228)
+|||+++||+||||||||+|.++..+.+.+.|+|||++|||+|++++|+|+||++++++.|+|++|+||.++ +|+++||
T Consensus 1 ~sV~~v~iD~~~rLWVlD~G~~~~~~~~~~~~~pKLv~~Dl~t~~li~~~~~p~~~~~~~s~lndl~VD~~~~~~~~~~a 80 (287)
T PF03022_consen 1 VSVQRVQIDECGRLWVLDSGRPNGLQPPKQVCPPKLVAFDLKTNQLIRRYPFPPDIAPPDSFLNDLVVDVRDGNCDDGFA 80 (287)
T ss_dssp S-EEEEEE-TTSEEEEEE-CCHSSSSTTGHTS--EEEEEETTTTCEEEEEE--CCCS-TCGGEEEEEEECTTTTS-SEEE
T ss_pred CcccEEEEcCCCCEEEEeCCCcCCCCCCCCCCCcEEEEEECCCCcEEEEEECChHHcccccccceEEEEccCCCCcceEE
Confidence 699999999999999999999998777788999999999999999999999999999999999999999977 6999999
Q ss_pred EEecCCCCeEEEEEcCCCeEEEEcCCCCCcCCCCcceEEcCeEeeecCCcceeecCcccCCcccCCCccccccccccccC
Q psy11781 547 YVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNG 626 (1228)
Q Consensus 547 YItDs~~~gLIVyDl~~g~swRv~~~s~~pdP~~~~~~i~G~~f~~~dGi~GIALsp~~~~~~~~~~~~~~~~~~~~~~~ 626 (1228)
||||++.+||||||+++|++|||+|++|+++|.+..|+++|+.|++++|++||||+|.
T Consensus 81 YItD~~~~glIV~dl~~~~s~Rv~~~~~~~~p~~~~~~i~g~~~~~~dg~~gial~~~---------------------- 138 (287)
T PF03022_consen 81 YITDSGGPGLIVYDLATGKSWRVLHNSFSPDPDAGPFTIGGESFQWPDGIFGIALSPI---------------------- 138 (287)
T ss_dssp EEEETTTCEEEEEETTTTEEEEEETCGCTTS-SSEEEEETTEEEEETTSEEEEEE-TT----------------------
T ss_pred EEeCCCcCcEEEEEccCCcEEEEecCCcceeccccceeccCceEecCCCccccccCCC----------------------
Confidence 9999999999999999999999999999999999999999999999999999999984
Q ss_pred ccccccccCCCceEEEeeCCCCceEEEechhhcCCCCCCCccc-cccceEeeccccccccccceEEccCCcEEEEcCCCC
Q psy11781 627 TNVDKTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSSD-IDEYFHYLGSRFRNTQASASAINSNGVMFYNLVTKH 705 (1228)
Q Consensus 627 ~~~~i~is~d~r~LYf~~lss~~ly~V~T~~Lr~~~~~~~~~~-~~~~v~~lG~~g~~~qs~G~a~D~~G~LY~t~~~~~ 705 (1228)
++++|+||||||+|.++|+|+|++|+++++ .+++ +.+.|+.+|+|| +|++|+++|++|+|||++++++
T Consensus 139 -------~~d~r~LYf~~lss~~ly~v~T~~L~~~~~--~~~~~~~~~v~~lG~k~--~~s~g~~~D~~G~ly~~~~~~~ 207 (287)
T PF03022_consen 139 -------SPDGRWLYFHPLSSRKLYRVPTSVLRDPSL--SDAQALASQVQDLGDKG--SQSDGMAIDPNGNLYFTDVEQN 207 (287)
T ss_dssp -------STTS-EEEEEETT-SEEEEEEHHHHCSTT----HHH-HHHT-EEEEE-----SECEEEEETTTEEEEEECCCT
T ss_pred -------CCCccEEEEEeCCCCcEEEEEHHHhhCccc--cccccccccceeccccC--CCCceEEECCCCcEEEecCCCC
Confidence 347899999999999999999999999996 4444 568899999998 7999999999999999999999
Q ss_pred eEEEEeCCCCccCCCCeeEEEecCCceecCCceEEcC--CCcEEEEecCccccccCCCCCccccEEEEEEecchhccCCC
Q psy11781 706 SVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDP--NDNIWILSNKLHQYLYGFLDFNVYNYRILVANSNDIKNNNQ 783 (1228)
Q Consensus 706 aI~~wn~~~~~~~~~n~~iV~~~d~~L~wPd~l~Id~--dg~LyVlsnrl~~~~~~~~d~~e~nfrI~r~~v~~~i~~t~ 783 (1228)
+|+|||++ ++|.++++.+|+++++.|+||++++|+. +|+|||++||||++++++++.+|+||||++++++++|+||+
T Consensus 208 aI~~w~~~-~~~~~~~~~~l~~d~~~l~~pd~~~i~~~~~g~L~v~snrl~~~~~~~~~~~e~nfri~~~~v~~~~~~t~ 286 (287)
T PF03022_consen 208 AIGCWDPD-GPYTPENFEILAQDPRTLQWPDGLKIDPEGDGYLWVLSNRLQRFAYGNLNPDEVNFRIWRARVDELIKGTV 286 (287)
T ss_dssp EEEEEETT-TSB-GCCEEEEEE-CC-GSSEEEEEE-T--TS-EEEEE-S--SSTGGG---SS-SEEEEES-HHHHHTTST
T ss_pred eEEEEeCC-CCcCccchheeEEcCceeeccceeeeccccCceEEEEECcchHhhccCCCCCccEEeEEEeCHHHHhcCCC
Confidence 99999999 9999999999998887799999999999 99999999999999999999999999999999999999999
Q ss_pred C
Q psy11781 784 P 784 (1228)
Q Consensus 784 c 784 (1228)
|
T Consensus 287 C 287 (287)
T PF03022_consen 287 C 287 (287)
T ss_dssp T
T ss_pred C
Confidence 9
No 3
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=99.17 E-value=2.9e-09 Score=117.61 Aligned_cols=233 Identities=15% Similarity=0.132 Sum_probs=147.8
Q ss_pred cccceEEe--CCeEEEEcccCCCCCCcEEEEEECCCCCCCCcccCCCCcCcCCCCCCCcceEEEEEEEe-CCCcEEEEeC
Q psy11781 410 LPLGIGIW--RSTIFLSFPKWKAGIPFTLASFNMNDPSESPILLPYPNWSYFDDSNCNSLISVFRMSVD-KCDRLWIMDT 486 (1228)
Q Consensus 410 ~P~gv~v~--~gRiFvt~PR~~~gvP~TL~~v~~~~~~~sp~l~PYP~~~wn~~~~c~~lvsV~~v~iD-~~gRLWVLDt 486 (1228)
+|.|+... ++++|++=-. --.|-.+..... .......| . +.++.++ +.|+|||.|.
T Consensus 1 l~Egp~~d~~~g~l~~~D~~-----~~~i~~~~~~~~--~~~~~~~~-----------~---~~G~~~~~~~g~l~v~~~ 59 (246)
T PF08450_consen 1 LGEGPVWDPRDGRLYWVDIP-----GGRIYRVDPDTG--EVEVIDLP-----------G---PNGMAFDRPDGRLYVADS 59 (246)
T ss_dssp CEEEEEEETTTTEEEEEETT-----TTEEEEEETTTT--EEEEEESS-----------S---EEEEEEECTTSEEEEEET
T ss_pred CCcceEEECCCCEEEEEEcC-----CCEEEEEECCCC--eEEEEecC-----------C---CceEEEEccCCEEEEEEc
Confidence 46677776 5888877322 225777765321 11111111 1 7888898 7899999998
Q ss_pred CCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEEecCCC--------CeEEE
Q psy11781 487 GVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFR--------YGLIV 558 (1228)
Q Consensus 487 G~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYItDs~~--------~gLIV 558 (1228)
+. +.++|++++++......+.. -.+...+||+++| .++.+|+||+.. ++|+.
T Consensus 60 ~~--------------~~~~d~~~g~~~~~~~~~~~-~~~~~~~ND~~vd-----~~G~ly~t~~~~~~~~~~~~g~v~~ 119 (246)
T PF08450_consen 60 GG--------------IAVVDPDTGKVTVLADLPDG-GVPFNRPNDVAVD-----PDGNLYVTDSGGGGASGIDPGSVYR 119 (246)
T ss_dssp TC--------------EEEEETTTTEEEEEEEEETT-CSCTEEEEEEEE------TTS-EEEEEECCBCTTCGGSEEEEE
T ss_pred Cc--------------eEEEecCCCcEEEEeeccCC-CcccCCCceEEEc-----CCCCEEEEecCCCccccccccceEE
Confidence 75 45569999988776676432 1145678999999 678899999874 34555
Q ss_pred EEcCCCeEEEEcCCCCCcCCCCcceEEcCeEeeecCCcceeecCcccCCcccCCCccccccccccccCccccccccCCCc
Q psy11781 559 YDFFKNTSYRLTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQR 638 (1228)
Q Consensus 559 yDl~~g~swRv~~~s~~pdP~~~~~~i~G~~f~~~dGi~GIALsp~~~~~~~~~~~~~~~~~~~~~~~~~~~i~is~d~r 638 (1228)
++.. ++...+. .+ ....+||+++| +++
T Consensus 120 ~~~~-~~~~~~~---------------~~-----~~~pNGi~~s~--------------------------------dg~ 146 (246)
T PF08450_consen 120 IDPD-GKVTVVA---------------DG-----LGFPNGIAFSP--------------------------------DGK 146 (246)
T ss_dssp EETT-SEEEEEE---------------EE-----ESSEEEEEEET--------------------------------TSS
T ss_pred ECCC-CeEEEEe---------------cC-----cccccceEECC--------------------------------cch
Confidence 5555 3333222 11 12346777766 788
Q ss_pred eEEEeeCCCCceEEEechhhcCCCCCCCccccccceEeeccccccccccceEEccCCcEEEEcCCCCeEEEEeCCCCccC
Q psy11781 639 YMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNGVMFYNLVTKHSVGCWNTKTKVYL 718 (1228)
Q Consensus 639 ~LYf~~lss~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~~g~~~qs~G~a~D~~G~LY~t~~~~~aI~~wn~~~~~~~ 718 (1228)
+||+.-....++++++.+. +.. .-. ....+..+... .+.++||++|++|+||++....+.|.+++++ +..
T Consensus 147 ~lyv~ds~~~~i~~~~~~~---~~~--~~~-~~~~~~~~~~~--~g~pDG~~vD~~G~l~va~~~~~~I~~~~p~-G~~- 216 (246)
T PF08450_consen 147 TLYVADSFNGRIWRFDLDA---DGG--ELS-NRRVFIDFPGG--PGYPDGLAVDSDGNLWVADWGGGRIVVFDPD-GKL- 216 (246)
T ss_dssp EEEEEETTTTEEEEEEEET---TTC--CEE-EEEEEEE-SSS--SCEEEEEEEBTTS-EEEEEETTTEEEEEETT-SCE-
T ss_pred heeecccccceeEEEeccc---ccc--cee-eeeeEEEcCCC--CcCCCcceEcCCCCEEEEEcCCCEEEEECCC-ccE-
Confidence 9999988888899998422 110 000 01112112111 1358999999999999999999999999998 643
Q ss_pred CCCeeEEEecCCceecCCceEEc-CC-CcEEEEecC
Q psy11781 719 PQTQDIVQTSRDILNFPNDLKID-PN-DNIWILSNK 752 (1228)
Q Consensus 719 ~~n~~iV~~~d~~L~wPd~l~Id-~d-g~LyVlsnr 752 (1228)
+..|.. + ...|+.++++ ++ +.|||+++|
T Consensus 217 ---~~~i~~-p--~~~~t~~~fgg~~~~~L~vTta~ 246 (246)
T PF08450_consen 217 ---LREIEL-P--VPRPTNCAFGGPDGKTLYVTTAR 246 (246)
T ss_dssp ---EEEEE--S--SSSEEEEEEESTTSSEEEEEEB-
T ss_pred ---EEEEcC-C--CCCEEEEEEECCCCCEEEEEeCC
Confidence 355542 2 3479999995 34 479999975
No 4
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=99.02 E-value=2.1e-08 Score=110.88 Aligned_cols=232 Identities=13% Similarity=0.101 Sum_probs=145.8
Q ss_pred cccceEEe--CCeEEEEeccCCCCCCcEEEEEECCCCCCCCccccCCCCCcCCCCCCCcceEEEEEEEe-CCCcEEEEeC
Q psy11781 860 LPLGIGIW--RSTIFLSFPKWKAGIPFTLASFNMNDPSESPILLPYPNWSYFDDSNCNSLISVFRMSVD-KCDRLWIMDT 936 (1228)
Q Consensus 860 ~p~gv~v~--~gRiFvt~PR~~~gvP~TLa~v~~~~~~~sP~l~PYP~~~wn~~~~c~~lvSV~~v~iD-~~grLWVLDt 936 (1228)
+|.|+... ++++|++=-. --+|-.+... ++.......| . ..++.++ +.|+|||.|.
T Consensus 1 l~Egp~~d~~~g~l~~~D~~-----~~~i~~~~~~--~~~~~~~~~~-----------~---~~G~~~~~~~g~l~v~~~ 59 (246)
T PF08450_consen 1 LGEGPVWDPRDGRLYWVDIP-----GGRIYRVDPD--TGEVEVIDLP-----------G---PNGMAFDRPDGRLYVADS 59 (246)
T ss_dssp CEEEEEEETTTTEEEEEETT-----TTEEEEEETT--TTEEEEEESS-----------S---EEEEEEECTTSEEEEEET
T ss_pred CCcceEEECCCCEEEEEEcC-----CCEEEEEECC--CCeEEEEecC-----------C---CceEEEEccCCEEEEEEc
Confidence 46677776 6888877322 2366666642 1111121222 1 7888888 7899999998
Q ss_pred CCCCccccccccCCCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCCC--------CeEEE
Q psy11781 937 GVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFR--------YGLIV 1008 (1228)
Q Consensus 937 G~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~--------~gLIV 1008 (1228)
+.. .++|++++++......+.. -.+....||+++| .++.+|+||+.. ++|+.
T Consensus 60 ~~~--------------~~~d~~~g~~~~~~~~~~~-~~~~~~~ND~~vd-----~~G~ly~t~~~~~~~~~~~~g~v~~ 119 (246)
T PF08450_consen 60 GGI--------------AVVDPDTGKVTVLADLPDG-GVPFNRPNDVAVD-----PDGNLYVTDSGGGGASGIDPGSVYR 119 (246)
T ss_dssp TCE--------------EEEETTTTEEEEEEEEETT-CSCTEEEEEEEE------TTS-EEEEEECCBCTTCGGSEEEEE
T ss_pred Cce--------------EEEecCCCcEEEEeeccCC-CcccCCCceEEEc-----CCCCEEEEecCCCccccccccceEE
Confidence 653 4459999988776776432 1145789999999 567899999975 34666
Q ss_pred EECCCCeEEEeecCCcCcCCCcceeEEcCEEEEeeCCccceecCccccccccCCCccccccccccccCccccceecCCCc
Q psy11781 1009 YDFFKNTSYRLTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQR 1088 (1228)
Q Consensus 1009 ydl~~g~swRv~h~s~~pdP~~~~~~i~G~~f~~~dGi~GiaLsp~~~~~~~~~p~~~~~~~~~~~~~~~~di~ls~d~r 1088 (1228)
++.. ++..++. ++ +. ...| |++||+++
T Consensus 120 ~~~~-~~~~~~~---------------~~--~~---~pNG--------------------------------i~~s~dg~ 146 (246)
T PF08450_consen 120 IDPD-GKVTVVA---------------DG--LG---FPNG--------------------------------IAFSPDGK 146 (246)
T ss_dssp EETT-SEEEEEE---------------EE--ES---SEEE--------------------------------EEEETTSS
T ss_pred ECCC-CeEEEEe---------------cC--cc---cccc--------------------------------eEECCcch
Confidence 6666 4433321 11 11 1123 45566899
Q ss_pred eEEEEeCCCCceEEEecchhcCCcccCCccccccceEEecc-ccCCCCcccEEEcCCCcEEEEecCCCeEEEEeCCCCcc
Q psy11781 1089 YMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGS-RFRNTQASASAINSNGVMFYNLVTKHSVGCWNTKTKVY 1167 (1228)
Q Consensus 1089 ~LYf~plss~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~-rg~~sqs~g~~~D~~G~LY~t~~~~~aI~~wn~~~~~~ 1167 (1228)
+||+.-....++++++.. .+.. .+. ..+.+.+ .+..+..+||++|++|+||++....++|.++|++ +..
T Consensus 147 ~lyv~ds~~~~i~~~~~~---~~~~-----~~~-~~~~~~~~~~~~g~pDG~~vD~~G~l~va~~~~~~I~~~~p~-G~~ 216 (246)
T PF08450_consen 147 TLYVADSFNGRIWRFDLD---ADGG-----ELS-NRRVFIDFPGGPGYPDGLAVDSDGNLWVADWGGGRIVVFDPD-GKL 216 (246)
T ss_dssp EEEEEETTTTEEEEEEEE---TTTC-----CEE-EEEEEEE-SSSSCEEEEEEEBTTS-EEEEEETTTEEEEEETT-SCE
T ss_pred heeecccccceeEEEecc---cccc-----cee-eeeeEEEcCCCCcCCCcceEcCCCCEEEEEcCCCEEEEECCC-ccE
Confidence 999999888989999842 1110 011 1111211 1111348999999999999999999999999999 653
Q ss_pred ccCCeeEEEecCCceecCCceEEe-CC-CcEEEEeCC
Q psy11781 1168 LPQTQDIVQTSRDILNFPNDLKID-PN-DNIWILSNK 1202 (1228)
Q Consensus 1168 ~~~n~~~v~~~~~~L~wPd~l~id-~d-g~Lyvlsn~ 1202 (1228)
. ..+.. . ...|+.+++. ++ +.|||++++
T Consensus 217 ~----~~i~~-p--~~~~t~~~fgg~~~~~L~vTta~ 246 (246)
T PF08450_consen 217 L----REIEL-P--VPRPTNCAFGGPDGKTLYVTTAR 246 (246)
T ss_dssp E----EEEE--S--SSSEEEEEEESTTSSEEEEEEB-
T ss_pred E----EEEcC-C--CCCEEEEEEECCCCCEEEEEeCC
Confidence 2 34432 2 3479999995 33 589999975
No 5
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=98.77 E-value=6.6e-07 Score=102.92 Aligned_cols=248 Identities=15% Similarity=0.137 Sum_probs=146.3
Q ss_pred eCCeEEEEcccCCCCCCcEEEEEECCCCCCCCcccCC-CCcCcCC-CCCCCcceEEEEEEEeCCCcEEEEeCCCCCcccC
Q psy11781 417 WRSTIFLSFPKWKAGIPFTLASFNMNDPSESPILLPY-PNWSYFD-DSNCNSLISVFRMSVDKCDRLWIMDTGVTNILSS 494 (1228)
Q Consensus 417 ~~gRiFvt~PR~~~gvP~TL~~v~~~~~~~sp~l~PY-P~~~wn~-~~~c~~lvsV~~v~iD~~gRLWVLDtG~~~~~~~ 494 (1228)
.++.++..-|.|.+... .|-.++.. .+.+..| |.-.-++ ......+ .....+|..++|++++.|.
T Consensus 21 ~~~~~~gEgP~w~~~~~-~L~w~DI~----~~~i~r~~~~~g~~~~~~~p~~~--~~~~~~d~~g~Lv~~~~g~------ 87 (307)
T COG3386 21 DKGATLGEGPVWDPDRG-ALLWVDIL----GGRIHRLDPETGKKRVFPSPGGF--SSGALIDAGGRLIACEHGV------ 87 (307)
T ss_pred ecccccccCccCcCCCC-EEEEEeCC----CCeEEEecCCcCceEEEECCCCc--ccceeecCCCeEEEEcccc------
Confidence 46777778888865443 45555542 1334333 2100000 0001222 4456788999999999994
Q ss_pred cCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEEecCCCCeEEE--EEcCCCeEEEEcCC
Q psy11781 495 IQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIV--YDFFKNTSYRLTHP 572 (1228)
Q Consensus 495 ~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYItDs~~~gLIV--yDl~~g~swRv~~~ 572 (1228)
..+|+.++..+..+.-+.+ -.+...+||.+|| .++.+|++|.+... .= -....|..||+.
T Consensus 88 ---------~~~~~~~~~~~t~~~~~~~-~~~~~r~ND~~v~-----pdG~~wfgt~~~~~-~~~~~~~~~G~lyr~~-- 149 (307)
T COG3386 88 ---------RLLDPDTGGKITLLAEPED-GLPLNRPNDGVVD-----PDGRIWFGDMGYFD-LGKSEERPTGSLYRVD-- 149 (307)
T ss_pred ---------EEEeccCCceeEEeccccC-CCCcCCCCceeEc-----CCCCEEEeCCCccc-cCccccCCcceEEEEc--
Confidence 4556566666433333332 3455889999999 57899999998200 00 011222333332
Q ss_pred CCCcCCCCcceEEcCeEeeecC----CcceeecCcccCCcccCCCccccccccccccCccccccccCCCceEEEeeCCCC
Q psy11781 573 YMYPEPTQSTYILDNLKFRWVD----GIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSSN 648 (1228)
Q Consensus 573 s~~pdP~~~~~~i~G~~f~~~d----Gi~GIALsp~~~~~~~~~~~~~~~~~~~~~~~~~~~i~is~d~r~LYf~~lss~ 648 (1228)
|. |....+.. ..+|||+|| |+++||+......
T Consensus 150 -----p~-------g~~~~l~~~~~~~~NGla~Sp--------------------------------Dg~tly~aDT~~~ 185 (307)
T COG3386 150 -----PD-------GGVVRLLDDDLTIPNGLAFSP--------------------------------DGKTLYVADTPAN 185 (307)
T ss_pred -----CC-------CCEEEeecCcEEecCceEECC--------------------------------CCCEEEEEeCCCC
Confidence 10 11111111 135667765 7899999999999
Q ss_pred ceEEEechhhcCCCCCCCccccccceEeec-cccccccccceEEccCCcEEEEcCCC-CeEEEEeCCCCccCCCCeeEEE
Q psy11781 649 RHYYVSTTDLRNSSRYVNSSDIDEYFHYLG-SRFRNTQASASAINSNGVMFYNLVTK-HSVGCWNTKTKVYLPQTQDIVQ 726 (1228)
Q Consensus 649 ~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG-~~g~~~qs~G~a~D~~G~LY~t~~~~-~aI~~wn~~~~~~~~~n~~iV~ 726 (1228)
++++++... ..... . . ...+.... +. +-++|+++|++|+||.+.... .+|.+|+|+ ++. ++.+.
T Consensus 186 ~i~r~~~d~-~~g~~--~-~--~~~~~~~~~~~---G~PDG~~vDadG~lw~~a~~~g~~v~~~~pd-G~l----~~~i~ 251 (307)
T COG3386 186 RIHRYDLDP-ATGPI--G-G--RRGFVDFDEEP---GLPDGMAVDADGNLWVAAVWGGGRVVRFNPD-GKL----LGEIK 251 (307)
T ss_pred eEEEEecCc-ccCcc--C-C--cceEEEccCCC---CCCCceEEeCCCCEEEecccCCceEEEECCC-CcE----EEEEE
Confidence 999998764 22221 1 1 11122222 33 469999999999999765554 499999998 543 24442
Q ss_pred ecCCceecCCceEEcCC--CcEEEEecCcccc
Q psy11781 727 TSRDILNFPNDLKIDPN--DNIWILSNKLHQY 756 (1228)
Q Consensus 727 ~~d~~L~wPd~l~Id~d--g~LyVlsnrl~~~ 756 (1228)
. +. .+|+.+.+... ..|||++++....
T Consensus 252 l--P~-~~~t~~~FgG~~~~~L~iTs~~~~~~ 280 (307)
T COG3386 252 L--PV-KRPTNPAFGGPDLNTLYITSARSGMS 280 (307)
T ss_pred C--CC-CCCccceEeCCCcCEEEEEecCCCCC
Confidence 1 23 78999999753 4799999888543
No 6
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=98.53 E-value=6.2e-06 Score=95.00 Aligned_cols=246 Identities=15% Similarity=0.135 Sum_probs=142.7
Q ss_pred eCCeEEEEeccCCCCCCcEEEEEECCCCCCCCccccC-CCCCcCC-CCCCCcceEEEEEEEeCCCcEEEEeCCCCCcccc
Q psy11781 867 WRSTIFLSFPKWKAGIPFTLASFNMNDPSESPILLPY-PNWSYFD-DSNCNSLISVFRMSVDKCDRLWIMDTGVTNILSS 944 (1228)
Q Consensus 867 ~~gRiFvt~PR~~~gvP~TLa~v~~~~~~~sP~l~PY-P~~~wn~-~~~c~~lvSV~~v~iD~~grLWVLDtG~~~~~~~ 944 (1228)
..+.++..-|.|.+... .|-.++... +.+..| |.-.-++ ......+ .....+|..|+|++++.|..
T Consensus 21 ~~~~~~gEgP~w~~~~~-~L~w~DI~~----~~i~r~~~~~g~~~~~~~p~~~--~~~~~~d~~g~Lv~~~~g~~----- 88 (307)
T COG3386 21 DKGATLGEGPVWDPDRG-ALLWVDILG----GRIHRLDPETGKKRVFPSPGGF--SSGALIDAGGRLIACEHGVR----- 88 (307)
T ss_pred ecccccccCccCcCCCC-EEEEEeCCC----CeEEEecCCcCceEEEECCCCc--ccceeecCCCeEEEEccccE-----
Confidence 46777777788865443 455555311 134333 2101000 0011222 44567889999999999974
Q ss_pred ccccCCCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCCCCeEEEEE------CCCCeEEE
Q psy11781 945 IQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYD------FFKNTSYR 1018 (1228)
Q Consensus 945 ~~~~~~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~~gLIVyd------l~~g~swR 1018 (1228)
.+|+.++..+..+.- .+.-.+...+||.+|| .++.+|++|.+. .+ ...|+.||
T Consensus 89 ----------~~~~~~~~~~t~~~~-~~~~~~~~r~ND~~v~-----pdG~~wfgt~~~-----~~~~~~~~~~~G~lyr 147 (307)
T COG3386 89 ----------LLDPDTGGKITLLAE-PEDGLPLNRPNDGVVD-----PDGRIWFGDMGY-----FDLGKSEERPTGSLYR 147 (307)
T ss_pred ----------EEeccCCceeEEecc-ccCCCCcCCCCceeEc-----CCCCEEEeCCCc-----cccCccccCCcceEEE
Confidence 334456665433333 2333445789999999 468999999984 11 01111222
Q ss_pred eecCCcCcCCCcceeEEcCEEEEeeCCccceecCccccccccCCCccccccccccccCcccc-ceecCCCceEEEEeCCC
Q psy11781 1019 LTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVD-KTIRDDQRYMYFHSLSS 1097 (1228)
Q Consensus 1019 v~h~s~~pdP~~~~~~i~G~~f~~~dGi~GiaLsp~~~~~~~~~p~~~~~~~~~~~~~~~~d-i~ls~d~r~LYf~plss 1097 (1228)
+ +| +|...++..+- + .-.| |++|||+++|||.--..
T Consensus 148 ~-------~p-------~g~~~~l~~~~--~---------------------------~~~NGla~SpDg~tly~aDT~~ 184 (307)
T COG3386 148 V-------DP-------DGGVVRLLDDD--L---------------------------TIPNGLAFSPDGKTLYVADTPA 184 (307)
T ss_pred E-------cC-------CCCEEEeecCc--E---------------------------EecCceEECCCCCEEEEEeCCC
Confidence 1 01 01111111110 0 1123 67788999999999999
Q ss_pred CceEEEecchhcCCcccCCccccccceEEec-cccCCCCcccEEEcCCCcEEEEecCC-CeEEEEeCCCCccccCCeeEE
Q psy11781 1098 NRHYYVSTTDLRNSSRYVNSSDIDEYFHYLG-SRFRNTQASASAINSNGVMFYNLVTK-HSVGCWNTKTKVYLPQTQDIV 1175 (1228)
Q Consensus 1098 ~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG-~rg~~sqs~g~~~D~~G~LY~t~~~~-~aI~~wn~~~~~~~~~n~~~v 1175 (1228)
.++++++... ..... ... ..+.... ++ +-.+||++|++|+||...+.. .+|.+|+++ ++.. ..+
T Consensus 185 ~~i~r~~~d~-~~g~~----~~~-~~~~~~~~~~---G~PDG~~vDadG~lw~~a~~~g~~v~~~~pd-G~l~----~~i 250 (307)
T COG3386 185 NRIHRYDLDP-ATGPI----GGR-RGFVDFDEEP---GLPDGMAVDADGNLWVAAVWGGGRVVRFNPD-GKLL----GEI 250 (307)
T ss_pred CeEEEEecCc-ccCcc----CCc-ceEEEccCCC---CCCCceEEeCCCCEEEecccCCceEEEECCC-CcEE----EEE
Confidence 9999998754 11111 111 1122222 33 358999999999999765554 499999999 6532 323
Q ss_pred EecCCceecCCceEEeCC--CcEEEEeCCchh
Q psy11781 1176 QTSRDILNFPNDLKIDPN--DNIWILSNKLHQ 1205 (1228)
Q Consensus 1176 ~~~~~~L~wPd~l~id~d--g~Lyvlsn~l~~ 1205 (1228)
.. +. .+|+...+... ..|||++++...
T Consensus 251 ~l--P~-~~~t~~~FgG~~~~~L~iTs~~~~~ 279 (307)
T COG3386 251 KL--PV-KRPTNPAFGGPDLNTLYITSARSGM 279 (307)
T ss_pred EC--CC-CCCccceEeCCCcCEEEEEecCCCC
Confidence 21 22 68899988653 479999988854
No 7
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=98.46 E-value=7.5e-06 Score=108.24 Aligned_cols=271 Identities=10% Similarity=0.082 Sum_probs=154.0
Q ss_pred CcccceEEe--CCeEEEEcccCCCCCCcEEEEEECCCCCCCCcccCCCC-cCc-CCCC--CCCcceEEEEEEEeCCC-cE
Q psy11781 409 NLPLGIGIW--RSTIFLSFPKWKAGIPFTLASFNMNDPSESPILLPYPN-WSY-FDDS--NCNSLISVFRMSVDKCD-RL 481 (1228)
Q Consensus 409 ~~P~gv~v~--~gRiFvt~PR~~~gvP~TL~~v~~~~~~~sp~l~PYP~-~~w-n~~~--~c~~lvsV~~v~iD~~g-RL 481 (1228)
..|.|++++ .|++||+=..- + .+-.++... ..+.-..+ -+. ...+ +...|..++++.+|..+ .|
T Consensus 568 ~~P~gvavd~~~g~lyVaDs~n--~---rI~v~d~~G----~~i~~ig~~g~~G~~dG~~~~a~f~~P~GIavd~~gn~L 638 (1057)
T PLN02919 568 KFPGKLAIDLLNNRLFISDSNH--N---RIVVTDLDG----NFIVQIGSTGEEGLRDGSFEDATFNRPQGLAYNAKKNLL 638 (1057)
T ss_pred CCCceEEEECCCCeEEEEECCC--C---eEEEEeCCC----CEEEEEccCCCcCCCCCchhccccCCCcEEEEeCCCCEE
Confidence 568899996 58999985331 1 233343211 11111111 000 0001 12468889999999864 69
Q ss_pred EEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECC--------Cccccc---CCCceeEEEEeecCCCCeEEEEec
Q psy11781 482 WIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILP--------TAQVFE---GSLFSNIVTEVVEDCDHVFAYVND 550 (1228)
Q Consensus 482 WVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p--------~~v~~~---~S~l~di~VD~~~~c~~~~aYItD 550 (1228)
||.|++.- .|.+||+.++.+. ++.=. ...... -..-.++++|. .++.+||+|
T Consensus 639 YVaDt~n~------------~Ir~id~~~~~V~-tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp----~~g~LyVad 701 (1057)
T PLN02919 639 YVADTENH------------ALREIDFVNETVR-TLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEP----VNEKVYIAM 701 (1057)
T ss_pred EEEeCCCc------------eEEEEecCCCEEE-EEeccCcccCCCCCChhhhHhhcCCCeEEEEec----CCCeEEEEE
Confidence 99999853 3888999887643 33110 000000 11234788882 368999999
Q ss_pred CCCCeEEEEEcCCCeEEEEcCCCCCcCCCCcceEEcCeEee--ecCCcceeecCcccCCcccCCCccccccccccccCcc
Q psy11781 551 VFRYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYILDNLKFR--WVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTN 628 (1228)
Q Consensus 551 s~~~gLIVyDl~~g~swRv~~~s~~pdP~~~~~~i~G~~f~--~~dGi~GIALsp~~~~~~~~~~~~~~~~~~~~~~~~~ 628 (1228)
.+...|.+||..++....+.-.- . ....+|..-. ......|
T Consensus 702 ~~~~~I~v~d~~~g~v~~~~G~G-~------~~~~~g~~~~~~~~~~P~G------------------------------ 744 (1057)
T PLN02919 702 AGQHQIWEYNISDGVTRVFSGDG-Y------ERNLNGSSGTSTSFAQPSG------------------------------ 744 (1057)
T ss_pred CCCCeEEEEECCCCeEEEEecCC-c------cccCCCCccccccccCccE------------------------------
Confidence 99999999999887654433110 0 0001110000 0011223
Q ss_pred ccccccCCCceEEEeeCCCCceEEEechhhcCCCCCCC-ccccccceEeeccc-c-----ccccccceEEccCCcEEEEc
Q psy11781 629 VDKTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVN-SSDIDEYFHYLGSR-F-----RNTQASASAINSNGVMFYNL 701 (1228)
Q Consensus 629 ~~i~is~d~r~LYf~~lss~~ly~V~T~~Lr~~~~~~~-~~~~~~~v~~lG~~-g-----~~~qs~G~a~D~~G~LY~t~ 701 (1228)
|++++++++||+.-..+.++..++++--....+ +. +.........+|+. | ....+.|+++|.+|+||+++
T Consensus 745 --IavspdG~~LYVADs~n~~Irv~D~~tg~~~~~-~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVAD 821 (1057)
T PLN02919 745 --ISLSPDLKELYIADSESSSIRALDLKTGGSRLL-AGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVAD 821 (1057)
T ss_pred --EEEeCCCCEEEEEECCCCeEEEEECCCCcEEEE-EecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEEE
Confidence 455567888999888887888887431100000 00 00000111112211 1 12357899999999999999
Q ss_pred CCCCeEEEEeCCCCccCCCCeeEEEe-----------cCCceecCCceEEcCCCcEEEEe
Q psy11781 702 VTKHSVGCWNTKTKVYLPQTQDIVQT-----------SRDILNFPNDLKIDPNDNIWILS 750 (1228)
Q Consensus 702 ~~~~aI~~wn~~~~~~~~~n~~iV~~-----------~d~~L~wPd~l~Id~dg~LyVls 750 (1228)
..++.|..||+.++. +..++. ....|..|.|++++++|.|||..
T Consensus 822 s~N~rIrviD~~tg~-----v~tiaG~G~~G~~dG~~~~a~l~~P~GIavd~dG~lyVaD 876 (1057)
T PLN02919 822 SYNHKIKKLDPATKR-----VTTLAGTGKAGFKDGKALKAQLSEPAGLALGENGRLFVAD 876 (1057)
T ss_pred CCCCEEEEEECCCCe-----EEEEeccCCcCCCCCcccccccCCceEEEEeCCCCEEEEE
Confidence 999999999997322 112221 12356789999999999999875
No 8
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=98.26 E-value=7.7e-05 Score=82.70 Aligned_cols=250 Identities=14% Similarity=0.180 Sum_probs=165.9
Q ss_pred eEEEEEEEeCCCcEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEE
Q psy11781 468 ISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAY 547 (1228)
Q Consensus 468 vsV~~v~iD~~gRLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aY 547 (1228)
...+.+..+.+|-+|.-+.|.-. |=..|.+|+++. +|.|+.. +.-..|+++ .++-++
T Consensus 62 ~ap~dvapapdG~VWft~qg~ga------------iGhLdP~tGev~-~ypLg~G-----a~Phgiv~g-----pdg~~W 118 (353)
T COG4257 62 SAPFDVAPAPDGAVWFTAQGTGA------------IGHLDPATGEVE-TYPLGSG-----ASPHGIVVG-----PDGSAW 118 (353)
T ss_pred CCccccccCCCCceEEecCcccc------------ceecCCCCCceE-EEecCCC-----CCCceEEEC-----CCCCee
Confidence 45678888999999999998754 556788999986 6888654 556678887 578999
Q ss_pred EecCCCCeEEEEEcCCCeEEEEcCCCCCcCCCCcceEEcCeEeee---cCCcceeecCcccCCcccCCCccccccccccc
Q psy11781 548 VNDVFRYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYILDNLKFRW---VDGIFGMAISPELSGKYKRHPYEYYHYNVHHY 624 (1228)
Q Consensus 548 ItDs~~~gLIVyDl~~g~swRv~~~s~~pdP~~~~~~i~G~~f~~---~dGi~GIALsp~~~~~~~~~~~~~~~~~~~~~ 624 (1228)
|+|.+. +|.-+|.++.++-|+..+--+++-+-.+...++.-..| ..|+.| -|.|.... -++|..|
T Consensus 119 itd~~~-aI~R~dpkt~evt~f~lp~~~a~~nlet~vfD~~G~lWFt~q~G~yG-rLdPa~~~------i~vfpaP---- 186 (353)
T COG4257 119 ITDTGL-AIGRLDPKTLEVTRFPLPLEHADANLETAVFDPWGNLWFTGQIGAYG-RLDPARNV------ISVFPAP---- 186 (353)
T ss_pred EecCcc-eeEEecCcccceEEeecccccCCCcccceeeCCCccEEEeeccccce-ecCcccCc------eeeeccC----
Confidence 999988 99999999999999975432333222233333333344 345555 56664321 2333332
Q ss_pred cCcccc-ccccCCCceEEEeeCCCCceEEEechhhcCCCCCCCccccccceEeeccccccccccceEEccCCcEEEEcCC
Q psy11781 625 NGTNVD-KTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNGVMFYNLVT 703 (1228)
Q Consensus 625 ~~~~~~-i~is~d~r~LYf~~lss~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~~g~~~qs~G~a~D~~G~LY~t~~~ 703 (1228)
-+.+.. |...|||. +||..|.+.-+-+|++..=.-+.+ ..-..+ .. -+.++-.|+.|.+..++..
T Consensus 187 qG~gpyGi~atpdGs-vwyaslagnaiaridp~~~~aev~--------p~P~~~-~~----gsRriwsdpig~~wittwg 252 (353)
T COG4257 187 QGGGPYGICATPDGS-VWYASLAGNAIARIDPFAGHAEVV--------PQPNAL-KA----GSRRIWSDPIGRAWITTWG 252 (353)
T ss_pred CCCCCcceEECCCCc-EEEEeccccceEEcccccCCccee--------cCCCcc-cc----cccccccCccCcEEEeccC
Confidence 245555 88899987 999999999888887633111111 000000 11 2566788999999999999
Q ss_pred CCeEEEEeCCCCccCCCCeeEEEecCCceecCCceEEcCCCcEEEEecCccccccCCCCCccccEEEEEEe
Q psy11781 704 KHSVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDPNDNIWILSNKLHQYLYGFLDFNVYNYRILVAN 774 (1228)
Q Consensus 704 ~~aI~~wn~~~~~~~~~n~~iV~~~d~~L~wPd~l~Id~dg~LyVlsnrl~~~~~~~~d~~e~nfrI~r~~ 774 (1228)
..+++++||.++.+. ...+-. .. --|.++.||..|.+|...-.-. ++ +.+|..+..|++|-++
T Consensus 253 ~g~l~rfdPs~~sW~--eypLPg-s~---arpys~rVD~~grVW~sea~ag-ai-~rfdpeta~ftv~p~p 315 (353)
T COG4257 253 TGSLHRFDPSVTSWI--EYPLPG-SK---ARPYSMRVDRHGRVWLSEADAG-AI-GRFDPETARFTVLPIP 315 (353)
T ss_pred CceeeEeCcccccce--eeeCCC-CC---CCcceeeeccCCcEEeeccccC-ce-eecCcccceEEEecCC
Confidence 999999999844331 111111 11 2488999999999998442211 12 3567888889988765
No 9
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=98.23 E-value=7.2e-05 Score=99.17 Aligned_cols=228 Identities=16% Similarity=0.160 Sum_probs=130.2
Q ss_pred CcceEEEEEEEeC-CCcEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEEC-CCcccccC-------CCceeEEE
Q psy11781 465 NSLISVFRMSVDK-CDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYIL-PTAQVFEG-------SLFSNIVT 535 (1228)
Q Consensus 465 ~~lvsV~~v~iD~-~gRLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~-p~~v~~~~-------S~l~di~V 535 (1228)
+.|..+.++.+|. .|+|||.|++.- +|++||+. ++++..+.- ...-...+ .....|++
T Consensus 565 s~l~~P~gvavd~~~g~lyVaDs~n~------------rI~v~d~~-G~~i~~ig~~g~~G~~dG~~~~a~f~~P~GIav 631 (1057)
T PLN02919 565 SPLKFPGKLAIDLLNNRLFISDSNHN------------RIVVTDLD-GNFIVQIGSTGEEGLRDGSFEDATFNRPQGLAY 631 (1057)
T ss_pred ccCCCCceEEEECCCCeEEEEECCCC------------eEEEEeCC-CCEEEEEccCCCcCCCCCchhccccCCCcEEEE
Confidence 4577888999997 588999999863 48999985 555543322 11000011 22457889
Q ss_pred EeecCCCCeEEEEecCCCCeEEEEEcCCCeEEEEcCCCCCcCCCCcceEEcCeEeeecCCcceeecCcccCCcccCCCcc
Q psy11781 536 EVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYE 615 (1228)
Q Consensus 536 D~~~~c~~~~aYItDs~~~gLIVyDl~~g~swRv~~~s~~pdP~~~~~~i~G~~f~~~dGi~GIALsp~~~~~~~~~~~~ 615 (1228)
|. +++.+||+|.....|-++|+.++....+...-. .|. .+..|..|.+- . -.+|
T Consensus 632 d~----~gn~LYVaDt~n~~Ir~id~~~~~V~tlag~G~-----------~g~--~~~gg~~~~~~-~------ln~P-- 685 (1057)
T PLN02919 632 NA----KKNLLYVADTENHALREIDFVNETVRTLAGNGT-----------KGS--DYQGGKKGTSQ-V------LNSP-- 685 (1057)
T ss_pred eC----CCCEEEEEeCCCceEEEEecCCCEEEEEeccCc-----------ccC--CCCCChhhhHh-h------cCCC--
Confidence 83 356799999999999999999887655432100 000 00011000000 0 0001
Q ss_pred ccccccccccCccccccccCCCceEEEeeCCCCceEEEechhhcCCCCCCCccccccceEeeccccccccccceEEccCC
Q psy11781 616 YYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNG 695 (1228)
Q Consensus 616 ~~~~~~~~~~~~~~~i~is~d~r~LYf~~lss~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~~g~~~qs~G~a~D~~G 695 (1228)
.++++++++..||+.--....++.+...-=.-..+ ..+..........+......++.|++++++|
T Consensus 686 -------------~gVa~dp~~g~LyVad~~~~~I~v~d~~~g~v~~~-~G~G~~~~~~g~~~~~~~~~~P~GIavspdG 751 (1057)
T PLN02919 686 -------------WDVCFEPVNEKVYIAMAGQHQIWEYNISDGVTRVF-SGDGYERNLNGSSGTSTSFAQPSGISLSPDL 751 (1057)
T ss_pred -------------eEEEEecCCCeEEEEECCCCeEEEEECCCCeEEEE-ecCCccccCCCCccccccccCccEEEEeCCC
Confidence 01455566677888877666777765321000000 0000000000000111112468899999887
Q ss_pred c-EEEEcCCCCeEEEEeCCCCccCCCCeeEEEe-------------------cCCceecCCceEEcCCCcEEEEe
Q psy11781 696 V-MFYNLVTKHSVGCWNTKTKVYLPQTQDIVQT-------------------SRDILNFPNDLKIDPNDNIWILS 750 (1228)
Q Consensus 696 ~-LY~t~~~~~aI~~wn~~~~~~~~~n~~iV~~-------------------~d~~L~wPd~l~Id~dg~LyVls 750 (1228)
. ||+++...+.|.+||+.++. ..+++. .+..+..|.+++++++|.|||..
T Consensus 752 ~~LYVADs~n~~Irv~D~~tg~-----~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVAD 821 (1057)
T PLN02919 752 KELYIADSESSSIRALDLKTGG-----SRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVAD 821 (1057)
T ss_pred CEEEEEECCCCeEEEEECCCCc-----EEEEEecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEEE
Confidence 5 99999999999999987322 122210 12347789999999999999876
No 10
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=98.21 E-value=0.00012 Score=81.32 Aligned_cols=250 Identities=14% Similarity=0.171 Sum_probs=165.4
Q ss_pred eEEEEEEEeCCCcEEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEE
Q psy11781 918 ISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAY 997 (1228)
Q Consensus 918 vSV~~v~iD~~grLWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aY 997 (1228)
...+.+..+.+|-+|.-+.|... |=-.|.+|+++. +|.|+.. +.-.-|+++ .++-+.
T Consensus 62 ~ap~dvapapdG~VWft~qg~ga------------iGhLdP~tGev~-~ypLg~G-----a~Phgiv~g-----pdg~~W 118 (353)
T COG4257 62 SAPFDVAPAPDGAVWFTAQGTGA------------IGHLDPATGEVE-TYPLGSG-----ASPHGIVVG-----PDGSAW 118 (353)
T ss_pred CCccccccCCCCceEEecCcccc------------ceecCCCCCceE-EEecCCC-----CCCceEEEC-----CCCCee
Confidence 45677888999999999999863 446789999986 6999765 455567887 478899
Q ss_pred EEcCCCCeEEEEECCCCeEEEeecCCcCcCCCcceeEEcCEEEEe---eCCccceecCccccccccCCCccccccccccc
Q psy11781 998 VNDVFRYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYILDNLKFRW---VDGIFGMAISPELSGKYKRHPYEYYHYNVHHY 1074 (1228)
Q Consensus 998 ItD~~~~gLIVydl~~g~swRv~h~s~~pdP~~~~~~i~G~~f~~---~dGi~GiaLsp~~~~~~~~~p~~~~~~~~~~~ 1074 (1228)
|+|.+. +|.-+|.++..+-|+.-+.-.++.+--+...++.-..| ..|+.| -|.|.... -+.|.-|
T Consensus 119 itd~~~-aI~R~dpkt~evt~f~lp~~~a~~nlet~vfD~~G~lWFt~q~G~yG-rLdPa~~~------i~vfpaP---- 186 (353)
T COG4257 119 ITDTGL-AIGRLDPKTLEVTRFPLPLEHADANLETAVFDPWGNLWFTGQIGAYG-RLDPARNV------ISVFPAP---- 186 (353)
T ss_pred EecCcc-eeEEecCcccceEEeecccccCCCcccceeeCCCccEEEeeccccce-ecCcccCc------eeeeccC----
Confidence 999998 99999999999999875433333332233334333334 345555 45553211 1222222
Q ss_pred cCcccc-ceecCCCceEEEEeCCCCceEEEecchhcCCcccCCccccccceEEeccccCCCCcccEEEcCCCcEEEEecC
Q psy11781 1075 NGTNVD-KTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNGVMFYNLVT 1153 (1228)
Q Consensus 1075 ~~~~~d-i~ls~d~r~LYf~plss~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~rg~~sqs~g~~~D~~G~LY~t~~~ 1153 (1228)
-|.+.. |...|||. +||..|.+.-+-+|++..-.-+.+ ..... + .+ .+-.+..|..|.+..++-.
T Consensus 187 qG~gpyGi~atpdGs-vwyaslagnaiaridp~~~~aev~-p~P~~-----~---~~----gsRriwsdpig~~wittwg 252 (353)
T COG4257 187 QGGGPYGICATPDGS-VWYASLAGNAIARIDPFAGHAEVV-PQPNA-----L---KA----GSRRIWSDPIGRAWITTWG 252 (353)
T ss_pred CCCCCcceEECCCCc-EEEEeccccceEEcccccCCccee-cCCCc-----c---cc----cccccccCccCcEEEeccC
Confidence 245555 89999997 899999999998887643322221 00000 0 11 1445677899999999999
Q ss_pred CCeEEEEeCCCCccccCCeeEEEecCCceecCCceEEeCCCcEEEEeCCchhhhcCCCCCCcccEEEEEEe
Q psy11781 1154 KHSVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDPNDNIWILSNKLHQYLYGFLDFNVYNYRILVAN 1224 (1228)
Q Consensus 1154 ~~aI~~wn~~~~~~~~~n~~~v~~~~~~L~wPd~l~id~dg~Lyvlsn~l~~~~~~~~d~~e~nfrI~r~~ 1224 (1228)
..++.++||.++.+. ...+- ..-.-|.+|.+|..|.+|+....-. ++ +++|..+..|++|-++
T Consensus 253 ~g~l~rfdPs~~sW~--eypLP----gs~arpys~rVD~~grVW~sea~ag-ai-~rfdpeta~ftv~p~p 315 (353)
T COG4257 253 TGSLHRFDPSVTSWI--EYPLP----GSKARPYSMRVDRHGRVWLSEADAG-AI-GRFDPETARFTVLPIP 315 (353)
T ss_pred CceeeEeCcccccce--eeeCC----CCCCCcceeeeccCCcEEeeccccC-ce-eecCcccceEEEecCC
Confidence 999999999854432 12221 2234689999999999998542221 11 3457777778887654
No 11
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=97.48 E-value=0.0036 Score=73.36 Aligned_cols=170 Identities=17% Similarity=0.285 Sum_probs=102.8
Q ss_pred eEEEEEEEeCCCc-EEEEeCCCCCcccCcCccCCCEEEEEECCCCc--EEEEEECCCcccccCCCceeEEEEeecCCCCe
Q psy11781 468 ISVFRMSVDKCDR-LWIMDTGVTNILSSIQQLCPPKIMVFDLKTNT--LIRKYILPTAQVFEGSLFSNIVTEVVEDCDHV 544 (1228)
Q Consensus 468 vsV~~v~iD~~gR-LWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~--vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~ 544 (1228)
-.+..+..+++|+ |||.|.|.- +|.+|++.+++ +...-.+ .+..++....|+++. ++.
T Consensus 144 ~h~H~v~~~pdg~~v~v~dlG~D------------~v~~~~~~~~~~~l~~~~~~---~~~~G~GPRh~~f~p----dg~ 204 (345)
T PF10282_consen 144 PHPHQVVFSPDGRFVYVPDLGAD------------RVYVYDIDDDTGKLTPVDSI---KVPPGSGPRHLAFSP----DGK 204 (345)
T ss_dssp TCEEEEEE-TTSSEEEEEETTTT------------EEEEEEE-TTS-TEEEEEEE---ECSTTSSEEEEEE-T----TSS
T ss_pred ccceeEEECCCCCEEEEEecCCC------------EEEEEEEeCCCceEEEeecc---ccccCCCCcEEEEcC----CcC
Confidence 4567788898865 999999963 38888887765 6542223 235678899999983 578
Q ss_pred EEEEecCCCCeEEEEEcC--CCeEEEEcCCCCCcCCCCcceEEcCeEeeecCCcceeecCcccCCcccCCCccccccccc
Q psy11781 545 FAYVNDVFRYGLIVYDFF--KNTSYRLTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVH 622 (1228)
Q Consensus 545 ~aYItDs~~~gLIVyDl~--~g~swRv~~~s~~pdP~~~~~~i~G~~f~~~dGi~GIALsp~~~~~~~~~~~~~~~~~~~ 622 (1228)
++|++......|.||++. +++.-.+..-...|. +..|
T Consensus 205 ~~Yv~~e~s~~v~v~~~~~~~g~~~~~~~~~~~~~-----------------~~~~------------------------ 243 (345)
T PF10282_consen 205 YAYVVNELSNTVSVFDYDPSDGSLTEIQTISTLPE-----------------GFTG------------------------ 243 (345)
T ss_dssp EEEEEETTTTEEEEEEEETTTTEEEEEEEEESCET-----------------TSCS------------------------
T ss_pred EEEEecCCCCcEEEEeecccCCceeEEEEeeeccc-----------------cccc------------------------
Confidence 999999999999999998 442222211111111 0000
Q ss_pred cccCccccccccCCCceEEEeeCCCCc--eEEEechhhcCCCCCCCccccccceEeeccccccccccceEEccCCc-EEE
Q psy11781 623 HYNGTNVDKTIRDDQRYMYFHSLSSNR--HYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNGV-MFY 699 (1228)
Q Consensus 623 ~~~~~~~~i~is~d~r~LYf~~lss~~--ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~~g~~~qs~G~a~D~~G~-LY~ 699 (1228)
.....+|.|++|||+||..-..+.. +|.++. .++.+ ..++.+...| ..+.++++|++|. ||+
T Consensus 244 --~~~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~---~~g~l--------~~~~~~~~~G--~~Pr~~~~s~~g~~l~V 308 (345)
T PF10282_consen 244 --ENAPAEIAISPDGRFLYVSNRGSNSISVFDLDP---ATGTL--------TLVQTVPTGG--KFPRHFAFSPDGRYLYV 308 (345)
T ss_dssp --SSSEEEEEE-TTSSEEEEEECTTTEEEEEEECT---TTTTE--------EEEEEEEESS--SSEEEEEE-TTSSEEEE
T ss_pred --cCCceeEEEecCCCEEEEEeccCCEEEEEEEec---CCCce--------EEEEEEeCCC--CCccEEEEeCCCCEEEE
Confidence 0011236677799999999888776 444421 11121 1222222223 2478999997775 888
Q ss_pred EcCCCCeEEEEeC
Q psy11781 700 NLVTKHSVGCWNT 712 (1228)
Q Consensus 700 t~~~~~aI~~wn~ 712 (1228)
+.-..+.|.+|+.
T Consensus 309 a~~~s~~v~vf~~ 321 (345)
T PF10282_consen 309 ANQDSNTVSVFDI 321 (345)
T ss_dssp EETTTTEEEEEEE
T ss_pred EecCCCeEEEEEE
Confidence 8888888888754
No 12
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=97.41 E-value=0.0079 Score=70.51 Aligned_cols=176 Identities=17% Similarity=0.261 Sum_probs=105.3
Q ss_pred eEEEEEEEeCCCc-EEEEeCCCCCccccccccCCCEEEEEECCCCc--EEEEEECCCCcccCCCcceeeEEEeecCCCce
Q psy11781 918 ISVFRMSVDKCDR-LWIMDTGVTNILSSIQQLCPPKIMVFDLKTNT--LIRKYILPTAQVFEGSLFSNIVTEVVEDCDHV 994 (1228)
Q Consensus 918 vSV~~v~iD~~gr-LWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~--lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~ 994 (1228)
-.+..+..+++|+ |||.|.|.- +|.+|++.+++ |...-.+ .+++++.-..++++. ++.
T Consensus 144 ~h~H~v~~~pdg~~v~v~dlG~D------------~v~~~~~~~~~~~l~~~~~~---~~~~G~GPRh~~f~p----dg~ 204 (345)
T PF10282_consen 144 PHPHQVVFSPDGRFVYVPDLGAD------------RVYVYDIDDDTGKLTPVDSI---KVPPGSGPRHLAFSP----DGK 204 (345)
T ss_dssp TCEEEEEE-TTSSEEEEEETTTT------------EEEEEEE-TTS-TEEEEEEE---ECSTTSSEEEEEE-T----TSS
T ss_pred ccceeEEECCCCCEEEEEecCCC------------EEEEEEEeCCCceEEEeecc---ccccCCCCcEEEEcC----CcC
Confidence 4456778888865 999999974 78888887765 6543333 234568888888874 478
Q ss_pred EEEEEcCCCCeEEEEECC--CCeEEEeecCCcCcCCCcceeEEcCEEEEeeCCccceecCccccccccCCCccccccccc
Q psy11781 995 FAYVNDVFRYGLIVYDFF--KNTSYRLTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVH 1072 (1228)
Q Consensus 995 ~aYItD~~~~gLIVydl~--~g~swRv~h~s~~pdP~~~~~~i~G~~f~~~dGi~GiaLsp~~~~~~~~~p~~~~~~~~~ 1072 (1228)
++|++......|.||++. +|+.-.+..-.+.|. +..|
T Consensus 205 ~~Yv~~e~s~~v~v~~~~~~~g~~~~~~~~~~~~~-----------------~~~~------------------------ 243 (345)
T PF10282_consen 205 YAYVVNELSNTVSVFDYDPSDGSLTEIQTISTLPE-----------------GFTG------------------------ 243 (345)
T ss_dssp EEEEEETTTTEEEEEEEETTTTEEEEEEEEESCET-----------------TSCS------------------------
T ss_pred EEEEecCCCCcEEEEeecccCCceeEEEEeeeccc-----------------cccc------------------------
Confidence 999999999999999998 543222211111111 0000
Q ss_pred cccCccccceecCCCceEEEEeCCCCceEEEecchhcCCcccCCccccccceEEeccccCCCCcccEEEcCCC-cEEEEe
Q psy11781 1073 HYNGTNVDKTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNG-VMFYNL 1151 (1228)
Q Consensus 1073 ~~~~~~~di~ls~d~r~LYf~plss~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~rg~~sqs~g~~~D~~G-~LY~t~ 1151 (1228)
....++|.||||+|+||-.--.+..+-..+..- .+..+ ..++.+-..| ..+.++++|++| .||++.
T Consensus 244 --~~~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~-~~g~l--------~~~~~~~~~G--~~Pr~~~~s~~g~~l~Va~ 310 (345)
T PF10282_consen 244 --ENAPAEIAISPDGRFLYVSNRGSNSISVFDLDP-ATGTL--------TLVQTVPTGG--KFPRHFAFSPDGRYLYVAN 310 (345)
T ss_dssp --SSSEEEEEE-TTSSEEEEEECTTTEEEEEEECT-TTTTE--------EEEEEEEESS--SSEEEEEE-TTSSEEEEEE
T ss_pred --cCCceeEEEecCCCEEEEEeccCCEEEEEEEec-CCCce--------EEEEEEeCCC--CCccEEEEeCCCCEEEEEe
Confidence 001234889999999999888877544333200 11111 1122222222 236789999878 588888
Q ss_pred cCCCeEEEE--eCCCCc
Q psy11781 1152 VTKHSVGCW--NTKTKV 1166 (1228)
Q Consensus 1152 ~~~~aI~~w--n~~~~~ 1166 (1228)
-..+.|.+| |++++.
T Consensus 311 ~~s~~v~vf~~d~~tG~ 327 (345)
T PF10282_consen 311 QDSNTVSVFDIDPDTGK 327 (345)
T ss_dssp TTTTEEEEEEEETTTTE
T ss_pred cCCCeEEEEEEeCCCCc
Confidence 888888866 444343
No 13
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=97.17 E-value=0.038 Score=62.26 Aligned_cols=206 Identities=14% Similarity=0.167 Sum_probs=100.3
Q ss_pred EEEEEEEeC-CCcEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEE
Q psy11781 469 SVFRMSVDK-CDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAY 547 (1228)
Q Consensus 469 sV~~v~iD~-~gRLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aY 547 (1228)
.+-++..|+ .++||++=-. ++.|+.+|+ +|++++++.|... .=+-+|++- +++...
T Consensus 23 e~SGLTy~pd~~tLfaV~d~------------~~~i~els~-~G~vlr~i~l~g~-----~D~EgI~y~-----g~~~~v 79 (248)
T PF06977_consen 23 ELSGLTYNPDTGTLFAVQDE------------PGEIYELSL-DGKVLRRIPLDGF-----GDYEGITYL-----GNGRYV 79 (248)
T ss_dssp -EEEEEEETTTTEEEEEETT------------TTEEEEEET-T--EEEEEE-SS------SSEEEEEE------STTEEE
T ss_pred CccccEEcCCCCeEEEEECC------------CCEEEEEcC-CCCEEEEEeCCCC-----CCceeEEEE-----CCCEEE
Confidence 477888886 5789987433 467999998 5899999999541 124456653 455566
Q ss_pred EecCCCCeEEEEEcCCCe--EEEEcCCCCCcCCCCcceEEcCeEeeecCCcceeecCcccCCcccCCCcccccccccccc
Q psy11781 548 VNDVFRYGLIVYDFFKNT--SYRLTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYN 625 (1228)
Q Consensus 548 ItDs~~~gLIVyDl~~g~--swRv~~~s~~pdP~~~~~~i~G~~f~~~dGi~GIALsp~~~~~~~~~~~~~~~~~~~~~~ 625 (1228)
|++.....|+++++.... .-+-. ...+.++ ..-.-.-|.=|+|.++
T Consensus 80 l~~Er~~~L~~~~~~~~~~~~~~~~---------~~~~~l~-~~~~~N~G~EGla~D~---------------------- 127 (248)
T PF06977_consen 80 LSEERDQRLYIFTIDDDTTSLDRAD---------VQKISLG-FPNKGNKGFEGLAYDP---------------------- 127 (248)
T ss_dssp EEETTTTEEEEEEE----TT--EEE---------EEEEE----S---SS--EEEEEET----------------------
T ss_pred EEEcCCCcEEEEEEeccccccchhh---------ceEEecc-cccCCCcceEEEEEcC----------------------
Confidence 777777888888884321 10000 0000000 0000012444666655
Q ss_pred CccccccccCCCceEEEeeCCCC-ceEEEechhhcCCCCCCCccccccceEeeccccccccccceEEc-cCCcEEEEcCC
Q psy11781 626 GTNVDKTIRDDQRYMYFHSLSSN-RHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAIN-SNGVMFYNLVT 703 (1228)
Q Consensus 626 ~~~~~i~is~d~r~LYf~~lss~-~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~~g~~~qs~G~a~D-~~G~LY~t~~~ 703 (1228)
.++.||..-=-.. .+|.++.-. ....+ ...+. ..+.. ........+|+++| .+|+||+-.-+
T Consensus 128 ----------~~~~L~v~kE~~P~~l~~~~~~~-~~~~~--~~~~~-~~~~~--~~~~~~d~S~l~~~p~t~~lliLS~e 191 (248)
T PF06977_consen 128 ----------KTNRLFVAKERKPKRLYEVNGFP-GGFDL--FVSDD-QDLDD--DKLFVRDLSGLSYDPRTGHLLILSDE 191 (248)
T ss_dssp ----------TTTEEEEEEESSSEEEEEEESTT--SS----EEEE--HHHH---HT--SS---EEEEETTTTEEEEEETT
T ss_pred ----------CCCEEEEEeCCCChhhEEEcccc-Cccce--eeccc-ccccc--ccceeccccceEEcCCCCeEEEEECC
Confidence 3333444322221 356655311 11111 00000 00000 11112346889999 88999999999
Q ss_pred CCeEEEEeCCCCccCCCCeeEEEecC------CceecCCceEEcCCCcEEEEe
Q psy11781 704 KHSVGCWNTKTKVYLPQTQDIVQTSR------DILNFPNDLKIDPNDNIWILS 750 (1228)
Q Consensus 704 ~~aI~~wn~~~~~~~~~n~~iV~~~d------~~L~wPd~l~Id~dg~LyVls 750 (1228)
...|..+|.+ +... ..+.-.. +.+.=|-||++|++|.|||++
T Consensus 192 s~~l~~~d~~-G~~~----~~~~L~~g~~gl~~~~~QpEGIa~d~~G~LYIvs 239 (248)
T PF06977_consen 192 SRLLLELDRQ-GRVV----SSLSLDRGFHGLSKDIPQPEGIAFDPDGNLYIVS 239 (248)
T ss_dssp TTEEEEE-TT---EE----EEEE-STTGGG-SS---SEEEEEE-TT--EEEEE
T ss_pred CCeEEEECCC-CCEE----EEEEeCCcccCcccccCCccEEEECCCCCEEEEc
Confidence 9999999988 5421 2222222 345558999999999999998
No 14
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=97.14 E-value=0.032 Score=64.45 Aligned_cols=204 Identities=12% Similarity=0.065 Sum_probs=112.3
Q ss_pred EEEEEEEeCCCc-EEEEeCCCCCcccCcCccCCCEEEEEECCCCcEE-EEEE-CCCcccccCCCceeEEEEeecCCCCeE
Q psy11781 469 SVFRMSVDKCDR-LWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLI-RKYI-LPTAQVFEGSLFSNIVTEVVEDCDHVF 545 (1228)
Q Consensus 469 sV~~v~iD~~gR-LWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi-~~y~-~p~~v~~~~S~l~di~VD~~~~c~~~~ 545 (1228)
++..+.+|++|+ |++..-+ ..+|.+||+.++..+ .... ++ .......++++ .++.+
T Consensus 81 ~p~~i~~~~~g~~l~v~~~~------------~~~v~v~~~~~~g~~~~~~~~~~-----~~~~~~~~~~~----p~g~~ 139 (330)
T PRK11028 81 SPTHISTDHQGRFLFSASYN------------ANCVSVSPLDKDGIPVAPIQIIE-----GLEGCHSANID----PDNRT 139 (330)
T ss_pred CceEEEECCCCCEEEEEEcC------------CCeEEEEEECCCCCCCCceeecc-----CCCcccEeEeC----CCCCE
Confidence 355788888775 7776543 245899999765422 1111 11 11233445665 35789
Q ss_pred EEEecCCCCeEEEEEcCCCeEEEEcCCCCCcCCCCcceEEcCeEeeecCCcceeecCcccCCcccCCCcccccccccccc
Q psy11781 546 AYVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYN 625 (1228)
Q Consensus 546 aYItDs~~~gLIVyDl~~g~swRv~~~s~~pdP~~~~~~i~G~~f~~~dGi~GIALsp~~~~~~~~~~~~~~~~~~~~~~ 625 (1228)
+|+++.+...|-|||+.++... . +.... ...+ ..-.|..+|++
T Consensus 140 l~v~~~~~~~v~v~d~~~~g~l--~-----~~~~~-~~~~-----~~g~~p~~~~~------------------------ 182 (330)
T PRK11028 140 LWVPCLKEDRIRLFTLSDDGHL--V-----AQEPA-EVTT-----VEGAGPRHMVF------------------------ 182 (330)
T ss_pred EEEeeCCCCEEEEEEECCCCcc--c-----ccCCC-ceec-----CCCCCCceEEE------------------------
Confidence 9999999999999999873321 1 00000 0000 00012233444
Q ss_pred CccccccccCCCceEEEeeCCCCceEEEechhhcCCCCCCCccccccceEeeccccc----cccccceEEccCC-cEEEE
Q psy11781 626 GTNVDKTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFR----NTQASASAINSNG-VMFYN 700 (1228)
Q Consensus 626 ~~~~~i~is~d~r~LYf~~lss~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~~g~----~~qs~G~a~D~~G-~LY~t 700 (1228)
++++++||-.-..+..+..+..+.- ...+ ..++.++..+. ...+.+++++.+| .||.+
T Consensus 183 --------~pdg~~lyv~~~~~~~v~v~~~~~~-~~~~--------~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~ 245 (330)
T PRK11028 183 --------HPNQQYAYCVNELNSSVDVWQLKDP-HGEI--------ECVQTLDMMPADFSDTRWAADIHITPDGRHLYAC 245 (330)
T ss_pred --------CCCCCEEEEEecCCCEEEEEEEeCC-CCCE--------EEEEEEecCCCcCCCCccceeEEECCCCCEEEEe
Confidence 4588999998775666554443100 0111 11122211110 0123468888777 48888
Q ss_pred cCCCCeEEEEeCCCCccCCCCeeEEEecCCceecCCceEEcCCCc-EEEEecC
Q psy11781 701 LVTKHSVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDPNDN-IWILSNK 752 (1228)
Q Consensus 701 ~~~~~aI~~wn~~~~~~~~~n~~iV~~~d~~L~wPd~l~Id~dg~-LyVlsnr 752 (1228)
+...+.|..|+.+ .. ...+.++..-. .-.+|.++.++++|. ||+ +|+
T Consensus 246 ~~~~~~I~v~~i~-~~--~~~~~~~~~~~-~~~~p~~~~~~~dg~~l~v-a~~ 293 (330)
T PRK11028 246 DRTASLISVFSVS-ED--GSVLSFEGHQP-TETQPRGFNIDHSGKYLIA-AGQ 293 (330)
T ss_pred cCCCCeEEEEEEe-CC--CCeEEEeEEEe-ccccCCceEECCCCCEEEE-EEc
Confidence 8888999999875 21 11233444222 236899999999995 555 455
No 15
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=97.14 E-value=0.032 Score=64.44 Aligned_cols=203 Identities=11% Similarity=0.081 Sum_probs=112.0
Q ss_pred EEEEEEeCCCc-EEEEeCCCCCccccccccCCCEEEEEECCCCcEE-EEEE-CCCCcccCCCcceeeEEEeecCCCceEE
Q psy11781 920 VFRMSVDKCDR-LWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLI-RKYI-LPTAQVFEGSLFSNIVTEVVEDCDHVFA 996 (1228)
Q Consensus 920 V~~v~iD~~gr-LWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~li-r~y~-~p~~~~~~~S~l~di~VD~~~~c~~~~a 996 (1228)
+..+.+|++|+ |++..-+. .+|.+||+.++..+ .... ++. ......++++ .++.++
T Consensus 82 p~~i~~~~~g~~l~v~~~~~------------~~v~v~~~~~~g~~~~~~~~~~~-----~~~~~~~~~~----p~g~~l 140 (330)
T PRK11028 82 PTHISTDHQGRFLFSASYNA------------NCVSVSPLDKDGIPVAPIQIIEG-----LEGCHSANID----PDNRTL 140 (330)
T ss_pred ceEEEECCCCCEEEEEEcCC------------CeEEEEEECCCCCCCCceeeccC-----CCcccEeEeC----CCCCEE
Confidence 45688888876 66665432 27999999865322 2111 211 1222334554 257899
Q ss_pred EEEcCCCCeEEEEECCCCeEEEeecCCcCcCCCcceeEE-cCEEEEeeCCccceecCccccccccCCCcccccccccccc
Q psy11781 997 YVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYIL-DNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYN 1075 (1228)
Q Consensus 997 YItD~~~~gLIVydl~~g~swRv~h~s~~pdP~~~~~~i-~G~~f~~~dGi~GiaLsp~~~~~~~~~p~~~~~~~~~~~~ 1075 (1228)
|+++.+...|-|||+.++.... +... ....+ .| .|.-+
T Consensus 141 ~v~~~~~~~v~v~d~~~~g~l~-------~~~~-~~~~~~~g------~~p~~--------------------------- 179 (330)
T PRK11028 141 WVPCLKEDRIRLFTLSDDGHLV-------AQEP-AEVTTVEG------AGPRH--------------------------- 179 (330)
T ss_pred EEeeCCCCEEEEEEECCCCccc-------ccCC-CceecCCC------CCCce---------------------------
Confidence 9999999999999998633211 0000 00000 01 11111
Q ss_pred CccccceecCCCceEEEEeCCCCceEEEecchhcCCcccCCccccccceEEeccccC----CCCcccEEEcCCC-cEEEE
Q psy11781 1076 GTNVDKTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFR----NTQASASAINSNG-VMFYN 1150 (1228)
Q Consensus 1076 ~~~~di~ls~d~r~LYf~plss~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~rg~----~sqs~g~~~D~~G-~LY~t 1150 (1228)
++++|++++||-.-..+..+...+.. ..+ ..+ ..++.++..+. ...+.++.++.+| .||.+
T Consensus 180 -----~~~~pdg~~lyv~~~~~~~v~v~~~~---~~~-----~~~-~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~ 245 (330)
T PRK11028 180 -----MVFHPNQQYAYCVNELNSSVDVWQLK---DPH-----GEI-ECVQTLDMMPADFSDTRWAADIHITPDGRHLYAC 245 (330)
T ss_pred -----EEECCCCCEEEEEecCCCEEEEEEEe---CCC-----CCE-EEEEEEecCCCcCCCCccceeEEECCCCCEEEEe
Confidence 45667899999887756666555532 110 000 11222221110 0124468888877 48888
Q ss_pred ecCCCeEEEEeCCCCccccCCeeEEEecCCceecCCceEEeCCCc-EEEEeCCc
Q psy11781 1151 LVTKHSVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDPNDN-IWILSNKL 1203 (1228)
Q Consensus 1151 ~~~~~aI~~wn~~~~~~~~~n~~~v~~~~~~L~wPd~l~id~dg~-Lyvlsn~l 1203 (1228)
+-..+.|..|+.+ .. .....++..- ..-.+|.++.++++|. ||+ +|+.
T Consensus 246 ~~~~~~I~v~~i~-~~--~~~~~~~~~~-~~~~~p~~~~~~~dg~~l~v-a~~~ 294 (330)
T PRK11028 246 DRTASLISVFSVS-ED--GSVLSFEGHQ-PTETQPRGFNIDHSGKYLIA-AGQK 294 (330)
T ss_pred cCCCCeEEEEEEe-CC--CCeEEEeEEE-eccccCCceEECCCCCEEEE-EEcc
Confidence 7778899998875 21 1123344432 2346899999999995 555 4553
No 16
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=96.99 E-value=0.056 Score=60.93 Aligned_cols=206 Identities=14% Similarity=0.160 Sum_probs=96.7
Q ss_pred EEEEEEEeC-CCcEEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEE
Q psy11781 919 SVFRMSVDK-CDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAY 997 (1228)
Q Consensus 919 SV~~v~iD~-~grLWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aY 997 (1228)
.+=++..|+ .++||.+=- .++.|+.+|+ ++++++++.|... .=+-.|++- +++...
T Consensus 23 e~SGLTy~pd~~tLfaV~d------------~~~~i~els~-~G~vlr~i~l~g~-----~D~EgI~y~-----g~~~~v 79 (248)
T PF06977_consen 23 ELSGLTYNPDTGTLFAVQD------------EPGEIYELSL-DGKVLRRIPLDGF-----GDYEGITYL-----GNGRYV 79 (248)
T ss_dssp -EEEEEEETTTTEEEEEET------------TTTEEEEEET-T--EEEEEE-SS------SSEEEEEE------STTEEE
T ss_pred CccccEEcCCCCeEEEEEC------------CCCEEEEEcC-CCCEEEEEeCCCC-----CCceeEEEE-----CCCEEE
Confidence 467788886 478998743 3458999997 5899999999541 113345542 345566
Q ss_pred EEcCCCCeEEEEECCCCe--EEEeecCCcCcCCCcceeEEcCEEEEeeCCccceecCccccccccCCCcccccccccccc
Q psy11781 998 VNDVFRYGLIVYDFFKNT--SYRLTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYN 1075 (1228)
Q Consensus 998 ItD~~~~gLIVydl~~g~--swRv~h~s~~pdP~~~~~~i~G~~f~~~dGi~GiaLsp~~~~~~~~~p~~~~~~~~~~~~ 1075 (1228)
|++.....|++++..... .-+-. ...+.++- .-.-.-|.=|+|..|
T Consensus 80 l~~Er~~~L~~~~~~~~~~~~~~~~---------~~~~~l~~-~~~~N~G~EGla~D~---------------------- 127 (248)
T PF06977_consen 80 LSEERDQRLYIFTIDDDTTSLDRAD---------VQKISLGF-PNKGNKGFEGLAYDP---------------------- 127 (248)
T ss_dssp EEETTTTEEEEEEE----TT--EEE---------EEEEE----S---SS--EEEEEET----------------------
T ss_pred EEEcCCCcEEEEEEeccccccchhh---------ceEEeccc-ccCCCcceEEEEEcC----------------------
Confidence 777777888888874321 11100 00000000 000011333444433
Q ss_pred CccccceecCCCceEEEEeCCCC-ceEEEecchhcCCcccCCccccccceEEeccccCCCCcccEEEc-CCCcEEEEecC
Q psy11781 1076 GTNVDKTIRDDQRYMYFHSLSSN-RHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAIN-SNGVMFYNLVT 1153 (1228)
Q Consensus 1076 ~~~~di~ls~d~r~LYf~plss~-~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~rg~~sqs~g~~~D-~~G~LY~t~~~ 1153 (1228)
.+..||..==-.+ ++|.++.-. ....+ ...+. ..+. ..+......+|+++| .+|.||+-.-+
T Consensus 128 ----------~~~~L~v~kE~~P~~l~~~~~~~-~~~~~--~~~~~-~~~~--~~~~~~~d~S~l~~~p~t~~lliLS~e 191 (248)
T PF06977_consen 128 ----------KTNRLFVAKERKPKRLYEVNGFP-GGFDL--FVSDD-QDLD--DDKLFVRDLSGLSYDPRTGHLLILSDE 191 (248)
T ss_dssp ----------TTTEEEEEEESSSEEEEEEESTT--SS----EEEE--HHHH---HT--SS---EEEEETTTTEEEEEETT
T ss_pred ----------CCCEEEEEeCCCChhhEEEcccc-Cccce--eeccc-cccc--cccceeccccceEEcCCCCeEEEEECC
Confidence 3333433221111 355554311 11111 00000 0000 011112346788999 88999999889
Q ss_pred CCeEEEEeCCCCccccCCeeEEEecC------CceecCCceEEeCCCcEEEEe
Q psy11781 1154 KHSVGCWNTKTKVYLPQTQDIVQTSR------DILNFPNDLKIDPNDNIWILS 1200 (1228)
Q Consensus 1154 ~~aI~~wn~~~~~~~~~n~~~v~~~~------~~L~wPd~l~id~dg~Lyvls 1200 (1228)
...|...|.+ +... ..+.-.. ..+.=|-||++|++|+|||++
T Consensus 192 s~~l~~~d~~-G~~~----~~~~L~~g~~gl~~~~~QpEGIa~d~~G~LYIvs 239 (248)
T PF06977_consen 192 SRLLLELDRQ-GRVV----SSLSLDRGFHGLSKDIPQPEGIAFDPDGNLYIVS 239 (248)
T ss_dssp TTEEEEE-TT---EE----EEEE-STTGGG-SS---SEEEEEE-TT--EEEEE
T ss_pred CCeEEEECCC-CCEE----EEEEeCCcccCcccccCCccEEEECCCCCEEEEc
Confidence 9999999988 6532 2222222 345569999999999999998
No 17
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=96.77 E-value=0.32 Score=53.82 Aligned_cols=225 Identities=13% Similarity=0.075 Sum_probs=114.2
Q ss_pred EEEEeCCCc-EEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEc
Q psy11781 922 RMSVDKCDR-LWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVND 1000 (1228)
Q Consensus 922 ~v~iD~~gr-LWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD 1000 (1228)
.+.++.+|+ |++...+. -+|.+||+.++++++++..+.. ...+.++ .++..+|++.
T Consensus 35 ~l~~~~dg~~l~~~~~~~------------~~v~~~d~~~~~~~~~~~~~~~-------~~~~~~~----~~g~~l~~~~ 91 (300)
T TIGR03866 35 GITLSKDGKLLYVCASDS------------DTIQVIDLATGEVIGTLPSGPD-------PELFALH----PNGKILYIAN 91 (300)
T ss_pred ceEECCCCCEEEEEECCC------------CeEEEEECCCCcEEEeccCCCC-------ccEEEEC----CCCCEEEEEc
Confidence 456666765 56665332 1688999999998876544222 1234443 2456899998
Q ss_pred CCCCeEEEEECCCCeEEEee-cCCcCcCCCcceeEEcCEEEEee--CCccceecCccccccccCCCccccccccccccCc
Q psy11781 1001 VFRYGLIVYDFFKNTSYRLT-HPYMYPEPTQSTYILDNLKFRWV--DGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGT 1077 (1228)
Q Consensus 1001 ~~~~gLIVydl~~g~swRv~-h~s~~pdP~~~~~~i~G~~f~~~--dGi~GiaLsp~~~~~~~~~p~~~~~~~~~~~~~~ 1077 (1228)
.....|.+||+.+++..+.. +. ..|....+.-+|..+... ++..-..+...... ...... .+.
T Consensus 92 ~~~~~l~~~d~~~~~~~~~~~~~---~~~~~~~~~~dg~~l~~~~~~~~~~~~~d~~~~~-------~~~~~~----~~~ 157 (300)
T TIGR03866 92 EDDNLVTVIDIETRKVLAEIPVG---VEPEGMAVSPDGKIVVNTSETTNMAHFIDTKTYE-------IVDNVL----VDQ 157 (300)
T ss_pred CCCCeEEEEECCCCeEEeEeeCC---CCcceEEECCCCCEEEEEecCCCeEEEEeCCCCe-------EEEEEE----cCC
Confidence 77778999999987654432 11 111111122222222110 00000000000000 000000 011
Q ss_pred ccc-ceecCCCceEEEEeCCCCceEEEecchhcCCcccCCccccccceEE--eccccCCCCcccEEEcCCCcE-EEEecC
Q psy11781 1078 NVD-KTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHY--LGSRFRNTQASASAINSNGVM-FYNLVT 1153 (1228)
Q Consensus 1078 ~~d-i~ls~d~r~LYf~plss~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~--lG~rg~~sqs~g~~~D~~G~L-Y~t~~~ 1153 (1228)
... ++.++++++||+..-.+.+++..+++-. .. ...+.. -+......++.+++++++|.. |++.-.
T Consensus 158 ~~~~~~~s~dg~~l~~~~~~~~~v~i~d~~~~---~~-------~~~~~~~~~~~~~~~~~~~~i~~s~dg~~~~~~~~~ 227 (300)
T TIGR03866 158 RPRFAEFTADGKELWVSSEIGGTVSVIDVATR---KV-------IKKITFEIPGVHPEAVQPVGIKLTKDGKTAFVALGP 227 (300)
T ss_pred CccEEEECCCCCEEEEEcCCCCEEEEEEcCcc---ee-------eeeeeecccccccccCCccceEECCCCCEEEEEcCC
Confidence 122 6789999999988655555666654321 10 001110 011110123567888888864 777667
Q ss_pred CCeEEEEeCCCCccccCCeeEEEecCCceecCCceEEeCCCcEEEEeC
Q psy11781 1154 KHSVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDPNDNIWILSN 1201 (1228)
Q Consensus 1154 ~~aI~~wn~~~~~~~~~n~~~v~~~~~~L~wPd~l~id~dg~Lyvlsn 1201 (1228)
.+.|..||.++++. ...+.. . ..+.++++.++|...+.++
T Consensus 228 ~~~i~v~d~~~~~~----~~~~~~--~--~~~~~~~~~~~g~~l~~~~ 267 (300)
T TIGR03866 228 ANRVAVVDAKTYEV----LDYLLV--G--QRVWQLAFTPDEKYLLTTN 267 (300)
T ss_pred CCeEEEEECCCCcE----EEEEEe--C--CCcceEEECCCCCEEEEEc
Confidence 78899999873321 122221 1 2577889988886434433
No 18
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=96.76 E-value=0.24 Score=59.04 Aligned_cols=329 Identities=16% Similarity=0.184 Sum_probs=152.3
Q ss_pred EEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEEecCCCCeEEEEEcCCCeEEEEcCCCCCcCCCCc
Q psy11781 502 KIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEPTQS 581 (1228)
Q Consensus 502 KLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYItDs~~~gLIVyDl~~g~swRv~~~s~~pdP~~~ 581 (1228)
+|.++|..|.+++++++-..+. ...+.+. -++.++|++. ..+.|-|+|+.+++.-+
T Consensus 17 ~v~viD~~t~~~~~~i~~~~~~------h~~~~~s----~Dgr~~yv~~-rdg~vsviD~~~~~~v~------------- 72 (369)
T PF02239_consen 17 SVAVIDGATNKVVARIPTGGAP------HAGLKFS----PDGRYLYVAN-RDGTVSVIDLATGKVVA------------- 72 (369)
T ss_dssp EEEEEETTT-SEEEEEE-STTE------EEEEE-T----T-SSEEEEEE-TTSEEEEEETTSSSEEE-------------
T ss_pred EEEEEECCCCeEEEEEcCCCCc------eeEEEec----CCCCEEEEEc-CCCeEEEEECCcccEEE-------------
Confidence 4889999999999988874431 1222221 2467999996 45789999999987433
Q ss_pred ceEEcCeEeeecCCcceeecCcccCCcccCCCccccccccccccCccccccccCCCceEEEeeCCCCceEEEechhhcCC
Q psy11781 582 TYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSSNRHYYVSTTDLRNS 661 (1228)
Q Consensus 582 ~~~i~G~~f~~~dGi~GIALsp~~~~~~~~~~~~~~~~~~~~~~~~~~~i~is~d~r~LYf~~lss~~ly~V~T~~Lr~~ 661 (1228)
.+.+++ +..|||+|+ |||++|-+-.....+--++++-|+-
T Consensus 73 ~i~~G~-------~~~~i~~s~--------------------------------DG~~~~v~n~~~~~v~v~D~~tle~- 112 (369)
T PF02239_consen 73 TIKVGG-------NPRGIAVSP--------------------------------DGKYVYVANYEPGTVSVIDAETLEP- 112 (369)
T ss_dssp EEE-SS-------EEEEEEE----------------------------------TTTEEEEEEEETTEEEEEETTT--E-
T ss_pred EEecCC-------CcceEEEcC--------------------------------CCCEEEEEecCCCceeEeccccccc-
Confidence 222221 123566654 8899998877666665566544321
Q ss_pred CCCCCccccccceEeeccc--cccccccceEEccCCc-EEEEcCCCCeEEEEeCCCCccCCCCeeEEEecCCceecCCce
Q psy11781 662 SRYVNSSDIDEYFHYLGSR--FRNTQASASAINSNGV-MFYNLVTKHSVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDL 738 (1228)
Q Consensus 662 ~~~~~~~~~~~~v~~lG~~--g~~~qs~G~a~D~~G~-LY~t~~~~~aI~~wn~~~~~~~~~n~~iV~~~d~~L~wPd~l 738 (1228)
.+.+..-+.. +..+...|+.....+. ..++.-+.+.|+.-|.. .+ .+..+-... .=.+|-+.
T Consensus 113 ---------v~~I~~~~~~~~~~~~Rv~aIv~s~~~~~fVv~lkd~~~I~vVdy~-d~---~~~~~~~i~--~g~~~~D~ 177 (369)
T PF02239_consen 113 ---------VKTIPTGGMPVDGPESRVAAIVASPGRPEFVVNLKDTGEIWVVDYS-DP---KNLKVTTIK--VGRFPHDG 177 (369)
T ss_dssp ---------EEEEE--EE-TTTS---EEEEEE-SSSSEEEEEETTTTEEEEEETT-TS---SCEEEEEEE----TTEEEE
T ss_pred ---------eeecccccccccccCCCceeEEecCCCCEEEEEEccCCeEEEEEec-cc---cccceeeec--cccccccc
Confidence 1112111110 1111223444444445 44566678999998876 32 222221111 22478899
Q ss_pred EEcCCCc-EEEEecCccccccCCCCCccccEEEEEEecchhccCCCCCCCCCCCCCCCCCCCCCcccCCCC-CcccccCC
Q psy11781 739 KIDPNDN-IWILSNKLHQYLYGFLDFNVYNYRILVANSNDIKNNNQPPNTKSNAGPARPSTDQTKSFHPSV-PSIYPSVP 816 (1228)
Q Consensus 739 ~Id~dg~-LyVlsnrl~~~~~~~~d~~e~nfrI~r~~v~~~i~~t~c~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 816 (1228)
.++++|. +++.+++-++.. -+|..+....+ .+++ .+. +.|..-.+-+|+.- |.|.-+--
T Consensus 178 ~~dpdgry~~va~~~sn~i~--viD~~~~k~v~-~i~~-------g~~---------p~~~~~~~~php~~g~vw~~~~~ 238 (369)
T PF02239_consen 178 GFDPDGRYFLVAANGSNKIA--VIDTKTGKLVA-LIDT-------GKK---------PHPGPGANFPHPGFGPVWATSGL 238 (369)
T ss_dssp EE-TTSSEEEEEEGGGTEEE--EEETTTTEEEE-EEE--------SSS---------BEETTEEEEEETTTEEEEEEEBS
T ss_pred ccCcccceeeecccccceeE--EEeeccceEEE-Eeec-------ccc---------ccccccccccCCCcceEEeeccc
Confidence 9999986 444444433221 22333333322 2221 111 22222223344333 22322211
Q ss_pred CCCC-C--CCC--ccEEEEEEeeeeeecCChHHHHhhccCCCCccCCCcccceEEe--CCeEEEEeccCCCCCCcEEEEE
Q psy11781 817 SNAY-P--SSS--HFQVVFRWKIMDFVFRDNKQKANLIRSNKFIPENNLPLGIGIW--RSTIFLSFPKWKAGIPFTLASF 889 (1228)
Q Consensus 817 ~~~~-~--~~~--~~~~vy~Wk~ldf~fp~~~~r~~ai~~g~y~p~n~~p~gv~v~--~gRiFvt~PR~~~gvP~TLa~v 889 (1228)
.... + ... ...-.+.||.+. -++..--+..++.+ ...++|.++-- +. --++..|
T Consensus 239 ~~~~~~~ig~~~v~v~d~~~wkvv~-----------------~I~~~G~glFi~thP~s~~vwvd~~~~-~~-~~~v~vi 299 (369)
T PF02239_consen 239 GYFAIPLIGTDPVSVHDDYAWKVVK-----------------TIPTQGGGLFIKTHPDSRYVWVDTFLN-PD-ADTVQVI 299 (369)
T ss_dssp SSSEEEEEE--TTT-STTTBTSEEE-----------------EEE-SSSS--EE--TT-SEEEEE-TT--SS-HT-EEEE
T ss_pred cceecccccCCccccchhhcCeEEE-----------------EEECCCCcceeecCCCCccEEeeccCC-CC-CceEEEE
Confidence 1000 0 000 000112222220 01222234566665 56788885332 22 4478888
Q ss_pred ECCCCCCCCccccCCCCCcCCCCCCCcceEEEEEEEeCCCc-EEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEE
Q psy11781 890 NMNDPSESPILLPYPNWSYFDDSNCNSLISVFRMSVDKCDR-LWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYI 968 (1228)
Q Consensus 890 ~~~~~~~sP~l~PYP~~~wn~~~~c~~lvSV~~v~iD~~gr-LWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~ 968 (1228)
+.+.-. ..+...- + .+ ..+..++..++|+ +||-.-..-+ +|++||.+|-+++++++
T Consensus 300 D~~tl~---~~~~i~~--~------~~-~~~~h~ef~~dG~~v~vS~~~~~~-----------~i~v~D~~Tl~~~~~i~ 356 (369)
T PF02239_consen 300 DKKTLK---VVKTITP--G------PG-KRVVHMEFNPDGKEVWVSVWDGNG-----------AIVVYDAKTLKEKKRIP 356 (369)
T ss_dssp ECCGTE---EEE-HHH--H------HT---EEEEEE-TTSSEEEEEEE--TT-----------EEEEEETTTTEEEEEEE
T ss_pred ECcCcc---eeEEEec--c------CC-CcEeccEECCCCCEEEEEEecCCC-----------EEEEEECCCcEEEEEEE
Confidence 752211 1111110 0 01 1277788888875 8997665432 89999999999999999
Q ss_pred CC
Q psy11781 969 LP 970 (1228)
Q Consensus 969 ~p 970 (1228)
++
T Consensus 357 ~~ 358 (369)
T PF02239_consen 357 VP 358 (369)
T ss_dssp --
T ss_pred ee
Confidence 54
No 19
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=96.66 E-value=0.16 Score=59.98 Aligned_cols=236 Identities=11% Similarity=0.032 Sum_probs=121.4
Q ss_pred CcEEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEECCCC-cccCCCcceeeEEEeecCCCceEEEEEcCC-CCeE
Q psy11781 929 DRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTA-QVFEGSLFSNIVTEVVEDCDHVFAYVNDVF-RYGL 1006 (1228)
Q Consensus 929 grLWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~-~~~~~S~l~di~VD~~~~c~~~~aYItD~~-~~gL 1006 (1228)
..|+|.-+...... .-...-.|-+||++|.++++++++|+. -..-+..-..+++. -++.++||++.. ...+
T Consensus 58 ~~lyva~~~~~R~~---~G~~~d~V~v~D~~t~~~~~~i~~p~~p~~~~~~~~~~~~ls----~dgk~l~V~n~~p~~~V 130 (352)
T TIGR02658 58 SFFAHASTVYSRIA---RGKRTDYVEVIDPQTHLPIADIELPEGPRFLVGTYPWMTSLT----PDNKTLLFYQFSPSPAV 130 (352)
T ss_pred CEEEEEeccccccc---cCCCCCEEEEEECccCcEEeEEccCCCchhhccCccceEEEC----CCCCEEEEecCCCCCEE
Confidence 36888877432110 111234799999999999999999865 11112222334443 247899999977 8999
Q ss_pred EEEECCCCeEEE-eec---CCcCc--CCCcceeEEcCEEEEeeCCccceecCccccccccCCCccccccccccccCcccc
Q psy11781 1007 IVYDFFKNTSYR-LTH---PYMYP--EPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVD 1080 (1228)
Q Consensus 1007 IVydl~~g~swR-v~h---~s~~p--dP~~~~~~i~G~~f~~~dGi~GiaLsp~~~~~~~~~p~~~~~~~~~~~~~~~~d 1080 (1228)
-|+|+++++.-+ |.- ...++ +-.+..+.++|......-+-.|.+. .+..+.||-.. +-++..
T Consensus 131 ~VvD~~~~kvv~ei~vp~~~~vy~t~e~~~~~~~~Dg~~~~v~~d~~g~~~---------~~~~~vf~~~~---~~v~~r 198 (352)
T TIGR02658 131 GVVDLEGKAFVRMMDVPDCYHIFPTANDTFFMHCRDGSLAKVGYGTKGNPK---------IKPTEVFHPED---EYLINH 198 (352)
T ss_pred EEEECCCCcEEEEEeCCCCcEEEEecCCccEEEeecCceEEEEecCCCceE---------EeeeeeecCCc---cccccC
Confidence 999999998877 331 11111 2122223345554443222222211 11112222100 000000
Q ss_pred ceecC-CCceEEEEeCCCCceEEEe-----cchhcCCcccCCccccccceEEeccccCCCCccc---EEEcCC-CcEEEE
Q psy11781 1081 KTIRD-DQRYMYFHSLSSNRHYYVS-----TTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQASA---SAINSN-GVMFYN 1150 (1228)
Q Consensus 1081 i~ls~-d~r~LYf~plss~~ly~V~-----T~~Lr~~~~~~~~~~~~~~v~~lG~rg~~sqs~g---~~~D~~-G~LY~t 1150 (1228)
=..++ |+++||-+.- ..+|.|+ +..+..-.+ +. .+++...=.++| ++++++ +.||+.
T Consensus 199 P~~~~~dg~~~~vs~e--G~V~~id~~~~~~~~~~~~~~----------~~-~~~~~~~wrP~g~q~ia~~~dg~~lyV~ 265 (352)
T TIGR02658 199 PAYSNKSGRLVWPTYT--GKIFQIDLSSGDAKFLPAIEA----------FT-EAEKADGWRPGGWQQVAYHRARDRIYLL 265 (352)
T ss_pred CceEcCCCcEEEEecC--CeEEEEecCCCcceecceeee----------cc-ccccccccCCCcceeEEEcCCCCEEEEE
Confidence 01234 6666555554 5588887 222221110 10 111100011233 899955 577774
Q ss_pred ec---------CCCeEEEEeCCCCccccCCeeEEEecCCceecCCceEEeCCCc-EEEEeCCch
Q psy11781 1151 LV---------TKHSVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDPNDN-IWILSNKLH 1204 (1228)
Q Consensus 1151 ~~---------~~~aI~~wn~~~~~~~~~n~~~v~~~~~~L~wPd~l~id~dg~-Lyvlsn~l~ 1204 (1228)
.. .-+.|..+|+.|+ +++.+=. .=..|-++++.+||. +-+.+|+..
T Consensus 266 ~~~~~~~thk~~~~~V~ViD~~t~-------kvi~~i~-vG~~~~~iavS~Dgkp~lyvtn~~s 321 (352)
T TIGR02658 266 ADQRAKWTHKTASRFLFVVDAKTG-------KRLRKIE-LGHEIDSINVSQDAKPLLYALSTGD 321 (352)
T ss_pred ecCCccccccCCCCEEEEEECCCC-------eEEEEEe-CCCceeeEEECCCCCeEEEEeCCCC
Confidence 42 2257999999833 2333211 123788999999987 555555543
No 20
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=96.50 E-value=0.44 Score=52.71 Aligned_cols=228 Identities=14% Similarity=0.093 Sum_probs=112.9
Q ss_pred EEEEeCCCc-EEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEEec
Q psy11781 472 RMSVDKCDR-LWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVND 550 (1228)
Q Consensus 472 ~v~iD~~gR-LWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYItD 550 (1228)
.+.++++|+ |++...+. -+|.+||+.++++++++..... ...+.++ .++..+|++.
T Consensus 35 ~l~~~~dg~~l~~~~~~~------------~~v~~~d~~~~~~~~~~~~~~~-------~~~~~~~----~~g~~l~~~~ 91 (300)
T TIGR03866 35 GITLSKDGKLLYVCASDS------------DTIQVIDLATGEVIGTLPSGPD-------PELFALH----PNGKILYIAN 91 (300)
T ss_pred ceEECCCCCEEEEEECCC------------CeEEEEECCCCcEEEeccCCCC-------ccEEEEC----CCCCEEEEEc
Confidence 456667665 56665431 2488999999998875543221 2234554 2456899998
Q ss_pred CCCCeEEEEEcCCCeEEEEcCCCCCcCCCCcceEEcCeEeeec--CCcceeecCcccCCcccCCCccccccccccccCcc
Q psy11781 551 VFRYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYILDNLKFRWV--DGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTN 628 (1228)
Q Consensus 551 s~~~gLIVyDl~~g~swRv~~~s~~pdP~~~~~~i~G~~f~~~--dGi~GIALsp~~~~~~~~~~~~~~~~~~~~~~~~~ 628 (1228)
.....|.+||+.+++..+..... ..|....+.-+|..+... ++..-..++.... +..+. ...+..
T Consensus 92 ~~~~~l~~~d~~~~~~~~~~~~~--~~~~~~~~~~dg~~l~~~~~~~~~~~~~d~~~~--------~~~~~---~~~~~~ 158 (300)
T TIGR03866 92 EDDNLVTVIDIETRKVLAEIPVG--VEPEGMAVSPDGKIVVNTSETTNMAHFIDTKTY--------EIVDN---VLVDQR 158 (300)
T ss_pred CCCCeEEEEECCCCeEEeEeeCC--CCcceEEECCCCCEEEEEecCCCeEEEEeCCCC--------eEEEE---EEcCCC
Confidence 77778999999987654433110 011111111122211110 0000000111000 00000 000111
Q ss_pred cc-ccccCCCceEEEeeCCCCceEEEechhhcCCCCCCCccccccceEeeccccccccccceEEccCCcE-EEEcCCCCe
Q psy11781 629 VD-KTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNGVM-FYNLVTKHS 706 (1228)
Q Consensus 629 ~~-i~is~d~r~LYf~~lss~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~~g~~~qs~G~a~D~~G~L-Y~t~~~~~a 706 (1228)
.. ++.++++++||+..-.+..++.++++- ... ...+ .++.-+......++.+++++.+|.. |++....+.
T Consensus 159 ~~~~~~s~dg~~l~~~~~~~~~v~i~d~~~---~~~---~~~~--~~~~~~~~~~~~~~~~i~~s~dg~~~~~~~~~~~~ 230 (300)
T TIGR03866 159 PRFAEFTADGKELWVSSEIGGTVSVIDVAT---RKV---IKKI--TFEIPGVHPEAVQPVGIKLTKDGKTAFVALGPANR 230 (300)
T ss_pred ccEEEECCCCCEEEEEcCCCCEEEEEEcCc---cee---eeee--eecccccccccCCccceEECCCCCEEEEEcCCCCe
Confidence 22 667899999998765454555554321 110 0000 0010010100123567888887764 777777788
Q ss_pred EEEEeCCCCccCCCCeeEEEecCCceecCCceEEcCCCcEEEEec
Q psy11781 707 VGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDPNDNIWILSN 751 (1228)
Q Consensus 707 I~~wn~~~~~~~~~n~~iV~~~d~~L~wPd~l~Id~dg~LyVlsn 751 (1228)
|..||.++++. ...+.. . ..+.++++.++|...+.++
T Consensus 231 i~v~d~~~~~~----~~~~~~-~---~~~~~~~~~~~g~~l~~~~ 267 (300)
T TIGR03866 231 VAVVDAKTYEV----LDYLLV-G---QRVWQLAFTPDEKYLLTTN 267 (300)
T ss_pred EEEEECCCCcE----EEEEEe-C---CCcceEEECCCCCEEEEEc
Confidence 99999872221 122221 1 3577889998886434433
No 21
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=96.45 E-value=0.12 Score=61.38 Aligned_cols=75 Identities=13% Similarity=0.166 Sum_probs=45.7
Q ss_pred cceEEEEEEEeCCCcEEEEeCCCCCcccCcCccCCC-EEEEEECCC--CcEEEEEECCCcccccCCCceeEEEEeecCCC
Q psy11781 466 SLISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPP-KIMVFDLKT--NTLIRKYILPTAQVFEGSLFSNIVTEVVEDCD 542 (1228)
Q Consensus 466 ~lvsV~~v~iD~~gRLWVLDtG~~~~~~~~~~~~~P-KLvvfDL~t--d~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~ 542 (1228)
.|.++..|.+|++|||||.++..--. ....+.... +|++++-.+ ++..+...|-+. ......|++. .
T Consensus 12 ~~~~P~~ia~d~~G~l~V~e~~~y~~-~~~~~~~~~~rI~~l~d~dgdG~~d~~~vfa~~----l~~p~Gi~~~-----~ 81 (367)
T TIGR02604 12 LLRNPIAVCFDERGRLWVAEGITYSR-PAGRQGPLGDRILILEDADGDGKYDKSNVFAEE----LSMVTGLAVA-----V 81 (367)
T ss_pred ccCCCceeeECCCCCEEEEeCCcCCC-CCCCCCCCCCEEEEEEcCCCCCCcceeEEeecC----CCCccceeEe-----c
Confidence 47888999999999999999854211 111111222 788887653 344444455332 2344667776 4
Q ss_pred CeEEEEecC
Q psy11781 543 HVFAYVNDV 551 (1228)
Q Consensus 543 ~~~aYItDs 551 (1228)
++ +||++.
T Consensus 82 ~G-lyV~~~ 89 (367)
T TIGR02604 82 GG-VYVATP 89 (367)
T ss_pred CC-EEEeCC
Confidence 56 888864
No 22
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=96.44 E-value=0.45 Score=56.36 Aligned_cols=84 Identities=15% Similarity=0.012 Sum_probs=57.9
Q ss_pred CCcEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCc-ccccCCCceeEEEEeecCCCCeEEEEecCC-CCe
Q psy11781 478 CDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTA-QVFEGSLFSNIVTEVVEDCDHVFAYVNDVF-RYG 555 (1228)
Q Consensus 478 ~gRLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~-v~~~~S~l~di~VD~~~~c~~~~aYItDs~-~~g 555 (1228)
...|+|..+...... .-...-.|-+||++|.++++++++|++ -..-+..-..+++. -++.++||++.. ...
T Consensus 57 g~~lyva~~~~~R~~---~G~~~d~V~v~D~~t~~~~~~i~~p~~p~~~~~~~~~~~~ls----~dgk~l~V~n~~p~~~ 129 (352)
T TIGR02658 57 GSFFAHASTVYSRIA---RGKRTDYVEVIDPQTHLPIADIELPEGPRFLVGTYPWMTSLT----PDNKTLLFYQFSPSPA 129 (352)
T ss_pred CCEEEEEeccccccc---cCCCCCEEEEEECccCcEEeEEccCCCchhhccCccceEEEC----CCCCEEEEecCCCCCE
Confidence 346888888422110 111235699999999999999999865 10112333455554 256799999977 899
Q ss_pred EEEEEcCCCeEEE
Q psy11781 556 LIVYDFFKNTSYR 568 (1228)
Q Consensus 556 LIVyDl~~g~swR 568 (1228)
+-|+|+.+++.-+
T Consensus 130 V~VvD~~~~kvv~ 142 (352)
T TIGR02658 130 VGVVDLEGKAFVR 142 (352)
T ss_pred EEEEECCCCcEEE
Confidence 9999999988776
No 23
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=96.35 E-value=0.14 Score=60.76 Aligned_cols=75 Identities=13% Similarity=0.166 Sum_probs=45.9
Q ss_pred cceEEEEEEEeCCCcEEEEeCCCCCccccccccCCC-EEEEEECCC--CcEEEEEECCCCcccCCCcceeeEEEeecCCC
Q psy11781 916 SLISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPP-KIMVFDLKT--NTLIRKYILPTAQVFEGSLFSNIVTEVVEDCD 992 (1228)
Q Consensus 916 ~lvSV~~v~iD~~grLWVLDtG~~~~~~~~~~~~~p-KLvvfDL~t--d~lir~y~~p~~~~~~~S~l~di~VD~~~~c~ 992 (1228)
.|.++..|.+|++|||||.++..-... ...+..+. +|++++-++ ++..++..|.+.. .....|++. .
T Consensus 12 ~~~~P~~ia~d~~G~l~V~e~~~y~~~-~~~~~~~~~rI~~l~d~dgdG~~d~~~vfa~~l----~~p~Gi~~~-----~ 81 (367)
T TIGR02604 12 LLRNPIAVCFDERGRLWVAEGITYSRP-AGRQGPLGDRILILEDADGDGKYDKSNVFAEEL----SMVTGLAVA-----V 81 (367)
T ss_pred ccCCCceeeECCCCCEEEEeCCcCCCC-CCCCCCCCCEEEEEEcCCCCCCcceeEEeecCC----CCccceeEe-----c
Confidence 478889999999999999998542111 11111222 888887654 3555555664432 233556665 3
Q ss_pred ceEEEEEcC
Q psy11781 993 HVFAYVNDV 1001 (1228)
Q Consensus 993 ~~~aYItD~ 1001 (1228)
++ +||++.
T Consensus 82 ~G-lyV~~~ 89 (367)
T TIGR02604 82 GG-VYVATP 89 (367)
T ss_pred CC-EEEeCC
Confidence 45 888864
No 24
>KOG4659|consensus
Probab=96.14 E-value=0.43 Score=62.59 Aligned_cols=187 Identities=14% Similarity=0.212 Sum_probs=104.1
Q ss_pred CceEEEEEcCCCCeEEEEECCCCeEEEeec----------CCcCcCCC----cceeEEcCEEEE---eeCCccc--eecC
Q psy11781 992 DHVFAYVNDVFRYGLIVYDFFKNTSYRLTH----------PYMYPEPT----QSTYILDNLKFR---WVDGIFG--MAIS 1052 (1228)
Q Consensus 992 ~~~~aYItD~~~~gLIVydl~~g~swRv~h----------~s~~pdP~----~~~~~i~G~~f~---~~dGi~G--iaLs 1052 (1228)
++++.||+. .|.+|-.=+.+|..+-|+- ..|.|-.. .+...|+..++- .+||-.. +.|+
T Consensus 325 nG~n~~it~--~Prvitt~mgdG~qR~veC~~C~G~a~~~~L~aPvala~a~DGSl~VGDfNyIRRI~~dg~v~tIl~L~ 402 (1899)
T KOG4659|consen 325 NGGNRLITE--EPRVITTAMGDGHQRDVECPKCEGKADSISLFAPVALAYAPDGSLIVGDFNYIRRISQDGQVSTILTLG 402 (1899)
T ss_pred CCcceEeec--CCceEEEeccCcccccccCCCCCCccccceeeceeeEEEcCCCcEEEccchheeeecCCCceEEEEEec
Confidence 467888887 7888888888888777761 13333211 122333332221 1444221 1222
Q ss_pred ccccccccCCCccccccccccccCccccceecCCCceEEEEeCCCCceEEEecchhcCCcccCCcccc-------ccceE
Q psy11781 1053 PELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDI-------DEYFH 1125 (1228)
Q Consensus 1053 p~~~~~~~~~p~~~~~~~~~~~~~~~~di~ls~d~r~LYf~plss~~ly~V~T~~Lr~~~~~~~~~~~-------~~~v~ 1125 (1228)
- ..|+--|| |++||=+.+||-+-.-++++|+|+.-.=+|... +-+.. ....+
T Consensus 403 ~-------t~~sh~Yy------------~AvsPvdgtlyvSdp~s~qv~rv~sl~~~d~~~--N~evvaG~Ge~Clp~de 461 (1899)
T KOG4659|consen 403 L-------TDTSHSYY------------IAVSPVDGTLYVSDPLSKQVWRVSSLEPQDSRN--NYEVVAGDGEVCLPADE 461 (1899)
T ss_pred C-------CCccceeE------------EEecCcCceEEecCCCcceEEEeccCCcccccc--CeeEEeccCcCcccccc
Confidence 1 11222222 789999999999999999999997522243332 11110 01112
Q ss_pred EeccccC--C---CCcccEEEcCCCcEEEEecCCCeEEEEeCCCCc-----------cccCC-eeEEEecCCceecCCce
Q psy11781 1126 YLGSRFR--N---TQASASAINSNGVMFYNLVTKHSVGCWNTKTKV-----------YLPQT-QDIVQTSRDILNFPNDL 1188 (1228)
Q Consensus 1126 ~lG~rg~--~---sqs~g~~~D~~G~LY~t~~~~~aI~~wn~~~~~-----------~~~~n-~~~v~~~~~~L~wPd~l 1188 (1228)
--|+-+. . -.+-|+++|+.|+|||.+-.. |...|.+ +- ..+.. -.++-.-+-+|.||+++
T Consensus 462 sCGDGalA~dA~L~~PkGIa~dk~g~lYfaD~t~--IR~iD~~-giIstlig~~~~~~~p~~C~~~~kl~~~~leWPT~L 538 (1899)
T KOG4659|consen 462 SCGDGALAQDAQLIFPKGIAFDKMGNLYFADGTR--IRVIDTT-GIISTLIGTTPDQHPPRTCAQITKLVDLQLEWPTSL 538 (1899)
T ss_pred ccCcchhcccceeccCCceeEccCCcEEEecccE--EEEeccC-ceEEEeccCCCCccCccccccccchhheeeecccce
Confidence 2222111 0 126789999999999997654 4444433 21 11111 01111113579999999
Q ss_pred EEeC-CCcEEEEeCCch
Q psy11781 1189 KIDP-NDNIWILSNKLH 1204 (1228)
Q Consensus 1189 ~id~-dg~Lyvlsn~l~ 1204 (1228)
+|++ |+.|||+.+.+-
T Consensus 539 aV~Pmdnsl~Vld~nvv 555 (1899)
T KOG4659|consen 539 AVDPMDNSLLVLDTNVV 555 (1899)
T ss_pred eecCCCCeEEEeecceE
Confidence 9998 778999998763
No 25
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=96.12 E-value=0.24 Score=57.51 Aligned_cols=170 Identities=18% Similarity=0.208 Sum_probs=106.7
Q ss_pred EEEEEEeCCC-cEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEE
Q psy11781 470 VFRMSVDKCD-RLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYV 548 (1228)
Q Consensus 470 V~~v~iD~~g-RLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYI 548 (1228)
+.-..+++.+ .||+.|-|.- ||.+||+..+++-...+. .+++++.-.-|++.. ++.+||+
T Consensus 147 ~H~a~~tP~~~~l~v~DLG~D------------ri~~y~~~dg~L~~~~~~---~v~~G~GPRHi~FHp----n~k~aY~ 207 (346)
T COG2706 147 VHSANFTPDGRYLVVPDLGTD------------RIFLYDLDDGKLTPADPA---EVKPGAGPRHIVFHP----NGKYAYL 207 (346)
T ss_pred cceeeeCCCCCEEEEeecCCc------------eEEEEEcccCcccccccc---ccCCCCCcceEEEcC----CCcEEEE
Confidence 4455678877 7999999963 599999997777643332 458889999999973 5789999
Q ss_pred ecCCCCeEEEEEcCC--CeEEEEcCCCCCcCCCCcceEEcCeEeeecCCcceeecCcccCCcccCCCccccccccccccC
Q psy11781 549 NDVFRYGLIVYDFFK--NTSYRLTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNG 626 (1228)
Q Consensus 549 tDs~~~gLIVyDl~~--g~swRv~~~s~~pdP~~~~~~i~G~~f~~~dGi~GIALsp~~~~~~~~~~~~~~~~~~~~~~~ 626 (1228)
.-.-...++||.... |+..++.--.+.|+ +|+ |. .+
T Consensus 208 v~EL~stV~v~~y~~~~g~~~~lQ~i~tlP~----dF~--g~-----~~------------------------------- 245 (346)
T COG2706 208 VNELNSTVDVLEYNPAVGKFEELQTIDTLPE----DFT--GT-----NW------------------------------- 245 (346)
T ss_pred EeccCCEEEEEEEcCCCceEEEeeeeccCcc----ccC--CC-----Cc-------------------------------
Confidence 998899999999887 66666654444444 121 10 01
Q ss_pred ccccccccCCCceEEEeeCCCC--ceEEEechhhcCCCCCCCccccccceEeecccccccc-ccceEEccCCcEEEE-cC
Q psy11781 627 TNVDKTIRDDQRYMYFHSLSSN--RHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQ-ASASAINSNGVMFYN-LV 702 (1228)
Q Consensus 627 ~~~~i~is~d~r~LYf~~lss~--~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~~g~~~q-s~G~a~D~~G~LY~t-~~ 702 (1228)
++.|-|++|||.||-+--.-. ..|+|+. +.. ....++....-+| +..+.+++.|.+.++ .-
T Consensus 246 -~aaIhis~dGrFLYasNRg~dsI~~f~V~~----~~g----------~L~~~~~~~teg~~PR~F~i~~~g~~Liaa~q 310 (346)
T COG2706 246 -AAAIHISPDGRFLYASNRGHDSIAVFSVDP----DGG----------KLELVGITPTEGQFPRDFNINPSGRFLIAANQ 310 (346)
T ss_pred -eeEEEECCCCCEEEEecCCCCeEEEEEEcC----CCC----------EEEEEEEeccCCcCCccceeCCCCCEEEEEcc
Confidence 123667889999998654433 2444431 211 1222232222235 789999977765555 33
Q ss_pred CCCe--EEEEeCCCC
Q psy11781 703 TKHS--VGCWNTKTK 715 (1228)
Q Consensus 703 ~~~a--I~~wn~~~~ 715 (1228)
..+. |+..|+.|+
T Consensus 311 ~sd~i~vf~~d~~TG 325 (346)
T COG2706 311 KSDNITVFERDKETG 325 (346)
T ss_pred CCCcEEEEEEcCCCc
Confidence 3343 444455433
No 26
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=96.11 E-value=0.22 Score=57.79 Aligned_cols=172 Identities=19% Similarity=0.206 Sum_probs=109.8
Q ss_pred EEEEEEeCCC-cEEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEE
Q psy11781 920 VFRMSVDKCD-RLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYV 998 (1228)
Q Consensus 920 V~~v~iD~~g-rLWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYI 998 (1228)
+.-..++..+ .||+.|-|.- ||.+||+..++|-...+. .+++++.-..|+... ++.+||+
T Consensus 147 ~H~a~~tP~~~~l~v~DLG~D------------ri~~y~~~dg~L~~~~~~---~v~~G~GPRHi~FHp----n~k~aY~ 207 (346)
T COG2706 147 VHSANFTPDGRYLVVPDLGTD------------RIFLYDLDDGKLTPADPA---EVKPGAGPRHIVFHP----NGKYAYL 207 (346)
T ss_pred cceeeeCCCCCEEEEeecCCc------------eEEEEEcccCcccccccc---ccCCCCCcceEEEcC----CCcEEEE
Confidence 4455677877 7999999985 899999998888754333 447888888899874 5789999
Q ss_pred EcCCCCeEEEEECCC--CeEEEeecCCcCcCCCcceeEEcCEEEEeeCCccceecCccccccccCCCccccccccccccC
Q psy11781 999 NDVFRYGLIVYDFFK--NTSYRLTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNG 1076 (1228)
Q Consensus 999 tD~~~~gLIVydl~~--g~swRv~h~s~~pdP~~~~~~i~G~~f~~~dGi~GiaLsp~~~~~~~~~p~~~~~~~~~~~~~ 1076 (1228)
.-.-...++||.... |+..++.--.+-|+. | . |- -
T Consensus 208 v~EL~stV~v~~y~~~~g~~~~lQ~i~tlP~d----F-------~---g~-----------------------------~ 244 (346)
T COG2706 208 VNELNSTVDVLEYNPAVGKFEELQTIDTLPED----F-------T---GT-----------------------------N 244 (346)
T ss_pred EeccCCEEEEEEEcCCCceEEEeeeeccCccc----c-------C---CC-----------------------------C
Confidence 999999999998887 666666533333321 1 1 11 1
Q ss_pred ccccceecCCCceEEEEeCC--CCceEEEecchhcCCcccCCccccccceEEeccccCCCC-cccEEEcCCCcEEEEec-
Q psy11781 1077 TNVDKTIRDDQRYMYFHSLS--SNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQ-ASASAINSNGVMFYNLV- 1152 (1228)
Q Consensus 1077 ~~~di~ls~d~r~LYf~pls--s~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~rg~~sq-s~g~~~D~~G~LY~t~~- 1152 (1228)
-+++|-||+|||.||-+--. |-..|+|+ .+ ....+.++....-+| +-...++..|.+.+..-
T Consensus 245 ~~aaIhis~dGrFLYasNRg~dsI~~f~V~----~~----------~g~L~~~~~~~teg~~PR~F~i~~~g~~Liaa~q 310 (346)
T COG2706 245 WAAAIHISPDGRFLYASNRGHDSIAVFSVD----PD----------GGKLELVGITPTEGQFPRDFNINPSGRFLIAANQ 310 (346)
T ss_pred ceeEEEECCCCCEEEEecCCCCeEEEEEEc----CC----------CCEEEEEEEeccCCcCCccceeCCCCCEEEEEcc
Confidence 23458899999999975322 23334442 11 111233333332234 67888998886655533
Q ss_pred CCC--eEEEEeCCCCcc
Q psy11781 1153 TKH--SVGCWNTKTKVY 1167 (1228)
Q Consensus 1153 ~~~--aI~~wn~~~~~~ 1167 (1228)
..+ .|+..|..|+.+
T Consensus 311 ~sd~i~vf~~d~~TG~L 327 (346)
T COG2706 311 KSDNITVFERDKETGRL 327 (346)
T ss_pred CCCcEEEEEEcCCCceE
Confidence 333 355666665543
No 27
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=95.95 E-value=0.69 Score=55.30 Aligned_cols=193 Identities=16% Similarity=0.179 Sum_probs=120.0
Q ss_pred eEEEEEEEeCCC-cEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEE
Q psy11781 468 ISVFRMSVDKCD-RLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFA 546 (1228)
Q Consensus 468 vsV~~v~iD~~g-RLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~a 546 (1228)
....++.++..+ +++|.+... ..+.++|++++++++...+.. ....+++|. ++.++
T Consensus 74 ~~p~~i~v~~~~~~vyv~~~~~------------~~v~vid~~~~~~~~~~~vG~-------~P~~~~~~~----~~~~v 130 (381)
T COG3391 74 VYPAGVAVNPAGNKVYVTTGDS------------NTVSVIDTATNTVLGSIPVGL-------GPVGLAVDP----DGKYV 130 (381)
T ss_pred ccccceeeCCCCCeEEEecCCC------------CeEEEEcCcccceeeEeeecc-------CCceEEECC----CCCEE
Confidence 344455555433 477777652 348999999999998777754 466788884 56799
Q ss_pred EEecCC--CCeEEEEEcCCCeEEEEcCCCCCcCCCCcceEEcCeEeeecCCcceeecCcccCCcccCCCccccccccccc
Q psy11781 547 YVNDVF--RYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHY 624 (1228)
Q Consensus 547 YItDs~--~~gLIVyDl~~g~swRv~~~s~~pdP~~~~~~i~G~~f~~~dGi~GIALsp~~~~~~~~~~~~~~~~~~~~~ 624 (1228)
|++|++ ..-+.|.|-.+++.-+-. .++. .-.|+|++|
T Consensus 131 YV~n~~~~~~~vsvid~~t~~~~~~~-------------~vG~-------~P~~~a~~p--------------------- 169 (381)
T COG3391 131 YVANAGNGNNTVSVIDAATNKVTATI-------------PVGN-------TPTGVAVDP--------------------- 169 (381)
T ss_pred EEEecccCCceEEEEeCCCCeEEEEE-------------ecCC-------CcceEEECC---------------------
Confidence 999995 577777777776544321 1111 114566655
Q ss_pred cCccccccccCCCceEEEeeCCCCceEEEechhhcCCCCCCCccccccceEeeccccccccccceEEccCCc-EEEEcCC
Q psy11781 625 NGTNVDKTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNGV-MFYNLVT 703 (1228)
Q Consensus 625 ~~~~~~i~is~d~r~LYf~~lss~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~~g~~~qs~G~a~D~~G~-LY~t~~~ 703 (1228)
+|..+|-..-.+..++.++++.. .. . . ...-..++ . ...+.+.+++.+|. +|+.+-.
T Consensus 170 -----------~g~~vyv~~~~~~~v~vi~~~~~---~v--~--~-~~~~~~~~-~--~~~P~~i~v~~~g~~~yV~~~~ 227 (381)
T COG3391 170 -----------DGNKVYVTNSDDNTVSVIDTSGN---SV--V--R-GSVGSLVG-V--GTGPAGIAVDPDGNRVYVANDG 227 (381)
T ss_pred -----------CCCeEEEEecCCCeEEEEeCCCc---ce--e--c-cccccccc-c--CCCCceEEECCCCCEEEEEecc
Confidence 67788888866666777763221 10 0 0 00000111 1 12368999998887 7877777
Q ss_pred C--CeEEEEeCCCCccCCCCeeEEEecCCceecCCceEEcCCCc-EEEEecC
Q psy11781 704 K--HSVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDPNDN-IWILSNK 752 (1228)
Q Consensus 704 ~--~aI~~wn~~~~~~~~~n~~iV~~~d~~L~wPd~l~Id~dg~-LyVlsnr 752 (1228)
. +.|...|..++.. .... ..-+.+ +|.++.+.++|. +||..++
T Consensus 228 ~~~~~v~~id~~~~~v----~~~~-~~~~~~-~~~~v~~~p~g~~~yv~~~~ 273 (381)
T COG3391 228 SGSNNVLKIDTATGNV----TATD-LPVGSG-APRGVAVDPAGKAAYVANSQ 273 (381)
T ss_pred CCCceEEEEeCCCceE----EEec-cccccC-CCCceeECCCCCEEEEEecC
Confidence 6 6999999972211 1111 133466 899999999985 6776666
No 28
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=95.72 E-value=1.1 Score=53.68 Aligned_cols=192 Identities=15% Similarity=0.148 Sum_probs=118.4
Q ss_pred EEEEEEEeCCC-cEEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEE
Q psy11781 919 SVFRMSVDKCD-RLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAY 997 (1228)
Q Consensus 919 SV~~v~iD~~g-rLWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aY 997 (1228)
...++.++..+ +++|.+... ..+.++|++++++++.+.+.. .-..+++|. ++.++|
T Consensus 75 ~p~~i~v~~~~~~vyv~~~~~------------~~v~vid~~~~~~~~~~~vG~-------~P~~~~~~~----~~~~vY 131 (381)
T COG3391 75 YPAGVAVNPAGNKVYVTTGDS------------NTVSVIDTATNTVLGSIPVGL-------GPVGLAVDP----DGKYVY 131 (381)
T ss_pred cccceeeCCCCCeEEEecCCC------------CeEEEEcCcccceeeEeeecc-------CCceEEECC----CCCEEE
Confidence 33444444433 477776653 268999999999998887754 346677774 467999
Q ss_pred EEcCC--CCeEEEEECCCCeEEEeecCCcCcCCCcceeEEcCEEEEeeCCccceecCccccccccCCCcccccccccccc
Q psy11781 998 VNDVF--RYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYN 1075 (1228)
Q Consensus 998 ItD~~--~~gLIVydl~~g~swRv~h~s~~pdP~~~~~~i~G~~f~~~dGi~GiaLsp~~~~~~~~~p~~~~~~~~~~~~ 1075 (1228)
++|.+ ..-+.|.|-.+++.-+. ..+.. ...|+
T Consensus 132 V~n~~~~~~~vsvid~~t~~~~~~-------------~~vG~-------~P~~~-------------------------- 165 (381)
T COG3391 132 VANAGNGNNTVSVIDAATNKVTAT-------------IPVGN-------TPTGV-------------------------- 165 (381)
T ss_pred EEecccCCceEEEEeCCCCeEEEE-------------EecCC-------CcceE--------------------------
Confidence 99995 57778888777654442 00010 01233
Q ss_pred CccccceecCCCceEEEEeCCCCceEEEecchhcCCcccCCccccccceEEeccccCCCCcccEEEcCCCc-EEEEecCC
Q psy11781 1076 GTNVDKTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNGV-MFYNLVTK 1154 (1228)
Q Consensus 1076 ~~~~di~ls~d~r~LYf~plss~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~rg~~sqs~g~~~D~~G~-LY~t~~~~ 1154 (1228)
+++|++..+|-..-.+..++.++++-. .+ ... ........|. -..+.+++.+|. +|+.+-..
T Consensus 166 ------a~~p~g~~vyv~~~~~~~v~vi~~~~~---~v--~~~-~~~~~~~~~~-----~P~~i~v~~~g~~~yV~~~~~ 228 (381)
T COG3391 166 ------AVDPDGNKVYVTNSDDNTVSVIDTSGN---SV--VRG-SVGSLVGVGT-----GPAGIAVDPDGNRVYVANDGS 228 (381)
T ss_pred ------EECCCCCeEEEEecCCCeEEEEeCCCc---ce--ecc-ccccccccCC-----CCceEEECCCCCEEEEEeccC
Confidence 455678889988877777877773221 11 000 0000011222 257899998887 77777776
Q ss_pred --CeEEEEeCCCCccccCCeeEEEecCCceecCCceEEeCCCc-EEEEeCC
Q psy11781 1155 --HSVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDPNDN-IWILSNK 1202 (1228)
Q Consensus 1155 --~aI~~wn~~~~~~~~~n~~~v~~~~~~L~wPd~l~id~dg~-Lyvlsn~ 1202 (1228)
+.|...|..++.. +.... .-+.+ +|.++.++++|. +|+..++
T Consensus 229 ~~~~v~~id~~~~~v----~~~~~-~~~~~-~~~~v~~~p~g~~~yv~~~~ 273 (381)
T COG3391 229 GSNNVLKIDTATGNV----TATDL-PVGSG-APRGVAVDPAGKAAYVANSQ 273 (381)
T ss_pred CCceEEEEeCCCceE----EEecc-ccccC-CCCceeECCCCCEEEEEecC
Confidence 5999999883221 12111 23567 999999999884 6666555
No 29
>PRK02888 nitrous-oxide reductase; Validated
Probab=95.54 E-value=0.63 Score=58.39 Aligned_cols=119 Identities=13% Similarity=0.194 Sum_probs=78.2
Q ss_pred ceecCCCceEEEEeCCCCceEEEecchhcC---CcccCCccccccceEEeccccCCCCcccEEEcCCCcEEEEecCCCeE
Q psy11781 1081 KTIRDDQRYMYFHSLSSNRHYYVSTTDLRN---SSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNGVMFYNLVTKHSV 1157 (1228)
Q Consensus 1081 i~ls~d~r~LYf~plss~~ly~V~T~~Lr~---~~~~~~~~~~~~~v~~lG~rg~~sqs~g~~~D~~G~LY~t~~~~~aI 1157 (1228)
+.+||||+++|-.-.-|..+-.|+++-++. ..+ ...+.+..+++. |..+ -..++|.+|+.|.++.-.+.|
T Consensus 326 V~vSPDGkylyVanklS~tVSVIDv~k~k~~~~~~~-~~~~~vvaevev-GlGP-----LHTaFDg~G~aytslf~dsqv 398 (635)
T PRK02888 326 VNTSPDGKYFIANGKLSPTVTVIDVRKLDDLFDGKI-KPRDAVVAEPEL-GLGP-----LHTAFDGRGNAYTTLFLDSQI 398 (635)
T ss_pred eEECCCCCEEEEeCCCCCcEEEEEChhhhhhhhccC-CccceEEEeecc-CCCc-----ceEEECCCCCEEEeEeeccee
Confidence 679999999999999999999999988764 111 012233444553 5433 456889999999999999999
Q ss_pred EEEeCCCCc---cccCCeeEEEecCCceec-CCceEE------eCCCcEEEEeCCchh--hhc
Q psy11781 1158 GCWNTKTKV---YLPQTQDIVQTSRDILNF-PNDLKI------DPNDNIWILSNKLHQ--YLY 1208 (1228)
Q Consensus 1158 ~~wn~~~~~---~~~~n~~~v~~~~~~L~w-Pd~l~i------d~dg~Lyvlsn~l~~--~~~ 1208 (1228)
.+||.+ +. |..+...-|.+ .-...+ |-.+.. +++|.-.|.-|++.+ |+.
T Consensus 399 ~kwn~~-~a~~~~~g~~~~~v~~-k~dV~y~pgh~~~~~g~t~~~dgk~l~~~nk~skdrfl~ 459 (635)
T PRK02888 399 VKWNIE-AAIRAYKGEKVDPIVQ-KLDVHYQPGHNHASMGETKEADGKWLVSLNKFSKDRFLP 459 (635)
T ss_pred EEEehH-HHHHHhccccCCccee-cccCCCccceeeecCCCcCCCCCCEEEEccccccccccC
Confidence 999987 42 33333222221 111222 333333 678866677788874 653
No 30
>KOG4499|consensus
Probab=94.96 E-value=0.79 Score=50.89 Aligned_cols=123 Identities=15% Similarity=0.190 Sum_probs=73.8
Q ss_pred CCCcccc-ccccccccCc----ccc--ccccCCCceEEEeeCCCCce--EE--EechhhcCCCCCCCccccccceEeecc
Q psy11781 611 RHPYEYY-HYNVHHYNGT----NVD--KTIRDDQRYMYFHSLSSNRH--YY--VSTTDLRNSSRYVNSSDIDEYFHYLGS 679 (1228)
Q Consensus 611 ~~~~~~~-~~~~~~~~~~----~~~--i~is~d~r~LYf~~lss~~l--y~--V~T~~Lr~~~~~~~~~~~~~~v~~lG~ 679 (1228)
+|..++| ....|+++.+ +.. |+-|-+.+.+||.--.+..+ |. .+|-.|.|... --++.+ .+..+
T Consensus 136 ~~~g~Ly~~~~~h~v~~i~~~v~IsNgl~Wd~d~K~fY~iDsln~~V~a~dyd~~tG~~snr~~---i~dlrk-~~~~e- 210 (310)
T KOG4499|consen 136 PIGGELYSWLAGHQVELIWNCVGISNGLAWDSDAKKFYYIDSLNYEVDAYDYDCPTGDLSNRKV---IFDLRK-SQPFE- 210 (310)
T ss_pred ccccEEEEeccCCCceeeehhccCCccccccccCcEEEEEccCceEEeeeecCCCcccccCcce---eEEecc-CCCcC-
Confidence 3455666 5555555432 222 66778889999987777655 33 45555544321 001111 11111
Q ss_pred ccccccccceEEccCCcEEEEcCCCCeEEEEeCCCCccCCCCeeEEEecCCceecC----CceEEcCC--CcEEEEecC
Q psy11781 680 RFRNTQASASAINSNGVMFYNLVTKHSVGCWNTKTKVYLPQTQDIVQTSRDILNFP----NDLKIDPN--DNIWILSNK 752 (1228)
Q Consensus 680 ~g~~~qs~G~a~D~~G~LY~t~~~~~aI~~wn~~~~~~~~~n~~iV~~~d~~L~wP----d~l~Id~d--g~LyVlsnr 752 (1228)
.-.++||++|.+|+||++-.+...|.+.|+.|+... + ++.+| +..+++.. ..|||++++
T Consensus 211 ---~~~PDGm~ID~eG~L~Va~~ng~~V~~~dp~tGK~L---~--------eiklPt~qitsccFgGkn~d~~yvT~aa 275 (310)
T KOG4499|consen 211 ---SLEPDGMTIDTEGNLYVATFNGGTVQKVDPTTGKIL---L--------EIKLPTPQITSCCFGGKNLDILYVTTAA 275 (310)
T ss_pred ---CCCCCcceEccCCcEEEEEecCcEEEEECCCCCcEE---E--------EEEcCCCceEEEEecCCCccEEEEEehh
Confidence 124899999999999999999999999999855422 1 12233 34555544 358888854
No 31
>KOG4659|consensus
Probab=94.80 E-value=0.09 Score=68.39 Aligned_cols=118 Identities=16% Similarity=0.242 Sum_probs=71.7
Q ss_pred ccccCCCceEEEeeCCCCceEEEechhhcCCCCCCCcccc-------ccceEeeccccc-----cccccceEEccCCcEE
Q psy11781 631 KTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDI-------DEYFHYLGSRFR-----NTQASASAINSNGVMF 698 (1228)
Q Consensus 631 i~is~d~r~LYf~~lss~~ly~V~T~~Lr~~~~~~~~~~~-------~~~v~~lG~~g~-----~~qs~G~a~D~~G~LY 698 (1228)
|++||=..+||.+-..++++|+|+.-.-++... +.+.+ ....+--|+-+. ...+.|+++|+.|+||
T Consensus 412 ~AvsPvdgtlyvSdp~s~qv~rv~sl~~~d~~~--N~evvaG~Ge~Clp~desCGDGalA~dA~L~~PkGIa~dk~g~lY 489 (1899)
T KOG4659|consen 412 IAVSPVDGTLYVSDPLSKQVWRVSSLEPQDSRN--NYEVVAGDGEVCLPADESCGDGALAQDAQLIFPKGIAFDKMGNLY 489 (1899)
T ss_pred EEecCcCceEEecCCCcceEEEeccCCcccccc--CeeEEeccCcCccccccccCcchhcccceeccCCceeEccCCcEE
Confidence 566667789999999999999997533233321 11101 111222333221 1237899999999999
Q ss_pred EEcCCCCeEEEEeCCCCccC------CCCee------EEEecCCceecCCceEEcC-CCcEEEEecCc
Q psy11781 699 YNLVTKHSVGCWNTKTKVYL------PQTQD------IVQTSRDILNFPNDLKIDP-NDNIWILSNKL 753 (1228)
Q Consensus 699 ~t~~~~~aI~~wn~~~~~~~------~~n~~------iV~~~d~~L~wPd~l~Id~-dg~LyVlsnrl 753 (1228)
|+|... |-..|.+ +-.. +.... +.-..+-+|.||++++|++ |+.|||+.+.+
T Consensus 490 faD~t~--IR~iD~~-giIstlig~~~~~~~p~~C~~~~kl~~~~leWPT~LaV~Pmdnsl~Vld~nv 554 (1899)
T KOG4659|consen 490 FADGTR--IRVIDTT-GIISTLIGTTPDQHPPRTCAQITKLVDLQLEWPTSLAVDPMDNSLLVLDTNV 554 (1899)
T ss_pred EecccE--EEEeccC-ceEEEeccCCCCccCccccccccchhheeeecccceeecCCCCeEEEeecce
Confidence 998763 4445544 2210 00011 1111234789999999998 66899999876
No 32
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=94.79 E-value=0.056 Score=51.65 Aligned_cols=54 Identities=17% Similarity=0.453 Sum_probs=37.2
Q ss_pred ceEEccC-CcEEEEcCC-----------------CCeEEEEeCCCCccCCCCeeEEEecCCceecCCceEEcCCCc-EEE
Q psy11781 688 ASAINSN-GVMFYNLVT-----------------KHSVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDPNDN-IWI 748 (1228)
Q Consensus 688 G~a~D~~-G~LY~t~~~-----------------~~aI~~wn~~~~~~~~~n~~iV~~~d~~L~wPd~l~Id~dg~-LyV 748 (1228)
+++++++ |.|||++.+ .+++.+|||.|+ ...++. +.|.||+|+++.+|+. |.|
T Consensus 2 dldv~~~~g~vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp~t~-----~~~vl~---~~L~fpNGVals~d~~~vlv 73 (89)
T PF03088_consen 2 DLDVDQDTGTVYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDPSTK-----ETTVLL---DGLYFPNGVALSPDESFVLV 73 (89)
T ss_dssp EEEE-TTT--EEEEES-SS--TTGHHHHHHHT---EEEEEEETTTT-----EEEEEE---EEESSEEEEEE-TTSSEEEE
T ss_pred ceeEecCCCEEEEEeCccccCccceeeeeecCCCCcCEEEEECCCC-----eEEEeh---hCCCccCeEEEcCCCCEEEE
Confidence 5788876 999999875 578999999843 335665 3699999999999986 444
Q ss_pred E
Q psy11781 749 L 749 (1228)
Q Consensus 749 l 749 (1228)
.
T Consensus 74 ~ 74 (89)
T PF03088_consen 74 A 74 (89)
T ss_dssp E
T ss_pred E
Confidence 3
No 33
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=94.58 E-value=2.8 Score=50.15 Aligned_cols=274 Identities=12% Similarity=0.096 Sum_probs=126.3
Q ss_pred CCcccceEEe-CCeEEEEeccCCCCCCcEEEEEECCCCCCCCccccCCCCCcCCCCCCCcceEEEEEEEeCCCcEEEEeC
Q psy11781 858 NNLPLGIGIW-RSTIFLSFPKWKAGIPFTLASFNMNDPSESPILLPYPNWSYFDDSNCNSLISVFRMSVDKCDRLWIMDT 936 (1228)
Q Consensus 858 n~~p~gv~v~-~gRiFvt~PR~~~gvP~TLa~v~~~~~~~sP~l~PYP~~~wn~~~~c~~lvSV~~v~iD~~grLWVLDt 936 (1228)
..-|.|++++ +||..+.--.+ |-++..++.+.-. .++=.|.-...... .=..|.+|........||+-.
T Consensus 77 G~~~~~i~~s~DG~~~~v~n~~----~~~v~v~D~~tle---~v~~I~~~~~~~~~---~~~Rv~aIv~s~~~~~fVv~l 146 (369)
T PF02239_consen 77 GGNPRGIAVSPDGKYVYVANYE----PGTVSVIDAETLE---PVKTIPTGGMPVDG---PESRVAAIVASPGRPEFVVNL 146 (369)
T ss_dssp SSEEEEEEE--TTTEEEEEEEE----TTEEEEEETTT-----EEEEEE--EE-TTT---S---EEEEEE-SSSSEEEEEE
T ss_pred CCCcceEEEcCCCCEEEEEecC----CCceeEecccccc---ceeecccccccccc---cCCCceeEEecCCCCEEEEEE
Confidence 4458899996 66655544333 5578878753211 12211221111110 011233344444455566532
Q ss_pred CCCCccccccccCCCEEEEEECCCCc--EEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCCCCeEEEEECCCC
Q psy11781 937 GVTNILSSIQQLCPPKIMVFDLKTNT--LIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKN 1014 (1228)
Q Consensus 937 G~~~~~~~~~~~~~pKLvvfDL~td~--lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~~gLIVydl~~g 1014 (1228)
-. .++|++.|....+ .++++.. +.++.|..+|. ++.|.|++-.+...|.|+|.+++
T Consensus 147 kd-----------~~~I~vVdy~d~~~~~~~~i~~-------g~~~~D~~~dp----dgry~~va~~~sn~i~viD~~~~ 204 (369)
T PF02239_consen 147 KD-----------TGEIWVVDYSDPKNLKVTTIKV-------GRFPHDGGFDP----DGRYFLVAANGSNKIAVIDTKTG 204 (369)
T ss_dssp TT-----------TTEEEEEETTTSSCEEEEEEE---------TTEEEEEE-T----TSSEEEEEEGGGTEEEEEETTTT
T ss_pred cc-----------CCeEEEEEeccccccceeeecc-------cccccccccCc----ccceeeecccccceeEEEeeccc
Confidence 22 1256666665543 2333332 35778888873 56799998888889999999999
Q ss_pred eEEEeecCCcCcCCCcc-eeEE-cCEEEEe---eCCccceecCccccccccCCCccccccc-cccccCccccceecCCCc
Q psy11781 1015 TSYRLTHPYMYPEPTQS-TYIL-DNLKFRW---VDGIFGMAISPELSGKYKRHPYEYYHYN-VHHYNGTNVDKTIRDDQR 1088 (1228)
Q Consensus 1015 ~swRv~h~s~~pdP~~~-~~~i-~G~~f~~---~dGi~GiaLsp~~~~~~~~~p~~~~~~~-~~~~~~~~~di~ls~d~r 1088 (1228)
+.-....---.|.|..+ .|.. ..-.+ | -.|.+-+++.-..+. ..|+...+..- .+...|.+.-+..+|+++
T Consensus 205 k~v~~i~~g~~p~~~~~~~~php~~g~v-w~~~~~~~~~~~~ig~~~v--~v~d~~~wkvv~~I~~~G~glFi~thP~s~ 281 (369)
T PF02239_consen 205 KLVALIDTGKKPHPGPGANFPHPGFGPV-WATSGLGYFAIPLIGTDPV--SVHDDYAWKVVKTIPTQGGGLFIKTHPDSR 281 (369)
T ss_dssp EEEEEEE-SSSBEETTEEEEEETTTEEE-EEEEBSSSSEEEEEE--TT--T-STTTBTSEEEEEE-SSSS--EE--TT-S
T ss_pred eEEEEeeccccccccccccccCCCcceE-EeeccccceecccccCCcc--ccchhhcCeEEEEEECCCCcceeecCCCCc
Confidence 88765432222222111 1111 11111 2 122222222211111 12444444222 234445553488999999
Q ss_pred eEEEE-eCCC--CceEEEecchhcCCcccCCccccccceEEeccccCCCCcccEEEcCCC-cEEEEecCCC-eEEEEeCC
Q psy11781 1089 YMYFH-SLSS--NRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNG-VMFYNLVTKH-SVGCWNTK 1163 (1228)
Q Consensus 1089 ~LYf~-plss--~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~rg~~sqs~g~~~D~~G-~LY~t~~~~~-aI~~wn~~ 1163 (1228)
+||.. +|+. ..+..|+++-|. .++.+...+ ..-...+.+.++| .+|++.-..+ +|..+|++
T Consensus 282 ~vwvd~~~~~~~~~v~viD~~tl~-------------~~~~i~~~~-~~~~~h~ef~~dG~~v~vS~~~~~~~i~v~D~~ 347 (369)
T PF02239_consen 282 YVWVDTFLNPDADTVQVIDKKTLK-------------VVKTITPGP-GKRVVHMEFNPDGKEVWVSVWDGNGAIVVYDAK 347 (369)
T ss_dssp EEEEE-TT-SSHT-EEEEECCGTE-------------EEE-HHHHH-T--EEEEEE-TTSSEEEEEEE--TTEEEEEETT
T ss_pred cEEeeccCCCCCceEEEEECcCcc-------------eeEEEeccC-CCcEeccEECCCCCEEEEEEecCCCEEEEEECC
Confidence 99999 3332 345555554441 112222111 0014567788777 7999999888 99999998
Q ss_pred CCccccCCeeEEEecCCceecCCceE
Q psy11781 1164 TKVYLPQTQDIVQTSRDILNFPNDLK 1189 (1228)
Q Consensus 1164 ~~~~~~~n~~~v~~~~~~L~wPd~l~ 1189 (1228)
| ++++..=. +.-|+|..
T Consensus 348 T-------l~~~~~i~--~~tP~G~f 364 (369)
T PF02239_consen 348 T-------LKEKKRIP--VPTPTGKF 364 (369)
T ss_dssp T-------TEEEEEEE----SEEEEE
T ss_pred C-------cEEEEEEE--eeCCCeEe
Confidence 3 33443321 55666654
No 34
>PRK02888 nitrous-oxide reductase; Validated
Probab=94.30 E-value=2.1 Score=53.98 Aligned_cols=118 Identities=16% Similarity=0.211 Sum_probs=73.7
Q ss_pred ccccCCCceEEEeeCCCCceEEEechhhcC---CCCCCCccccccceEeeccccccccccceEEccCCcEEEEcCCCCeE
Q psy11781 631 KTIRDDQRYMYFHSLSSNRHYYVSTTDLRN---SSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNGVMFYNLVTKHSV 707 (1228)
Q Consensus 631 i~is~d~r~LYf~~lss~~ly~V~T~~Lr~---~~~~~~~~~~~~~v~~lG~~g~~~qs~G~a~D~~G~LY~t~~~~~aI 707 (1228)
+.+||||+++|-.-..|..+--|+++-++. ..+ -..+.+...++. |.. +-..++|.+|+.|.++.-.++|
T Consensus 326 V~vSPDGkylyVanklS~tVSVIDv~k~k~~~~~~~-~~~~~vvaevev-GlG-----PLHTaFDg~G~aytslf~dsqv 398 (635)
T PRK02888 326 VNTSPDGKYFIANGKLSPTVTVIDVRKLDDLFDGKI-KPRDAVVAEPEL-GLG-----PLHTAFDGRGNAYTTLFLDSQI 398 (635)
T ss_pred eEECCCCCEEEEeCCCCCcEEEEEChhhhhhhhccC-CccceEEEeecc-CCC-----cceEEECCCCCEEEeEeeccee
Confidence 568899999999999999998888877654 111 011122334443 432 4677889999999999999999
Q ss_pred EEEeCCCC---ccCCCCee-EEEecCCceecCCceEE------cCCCcEEEEecCcc--ccc
Q psy11781 708 GCWNTKTK---VYLPQTQD-IVQTSRDILNFPNDLKI------DPNDNIWILSNKLH--QYL 757 (1228)
Q Consensus 708 ~~wn~~~~---~~~~~n~~-iV~~~d~~L~wPd~l~I------d~dg~LyVlsnrl~--~~~ 757 (1228)
.+||.. + .|..+... ++..-+--. =|-.+.. +++|.-.|.-|+|- +|+
T Consensus 399 ~kwn~~-~a~~~~~g~~~~~v~~k~dV~y-~pgh~~~~~g~t~~~dgk~l~~~nk~skdrfl 458 (635)
T PRK02888 399 VKWNIE-AAIRAYKGEKVDPIVQKLDVHY-QPGHNHASMGETKEADGKWLVSLNKFSKDRFL 458 (635)
T ss_pred EEEehH-HHHHHhccccCCcceecccCCC-ccceeeecCCCcCCCCCCEEEEcccccccccc
Confidence 999987 4 23333322 222111111 1333433 56776556667774 355
No 35
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=94.11 E-value=0.16 Score=59.34 Aligned_cols=80 Identities=21% Similarity=0.242 Sum_probs=56.9
Q ss_pred CCCcEEEE-eCCCCCccccccccCCCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCCCCe
Q psy11781 927 KCDRLWIM-DTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYG 1005 (1228)
Q Consensus 927 ~~grLWVL-DtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~~g 1005 (1228)
+.+||+|| +.|... ......-.|++||++|+|.++|++++.. ..+|.|-- -++-..|..+.+...
T Consensus 248 ~~~rlyvLMh~g~~g----sHKdpgteVWv~D~~t~krv~Ri~l~~~-------~~Si~Vsq---d~~P~L~~~~~~~~~ 313 (342)
T PF06433_consen 248 ASGRLYVLMHQGGEG----SHKDPGTEVWVYDLKTHKRVARIPLEHP-------IDSIAVSQ---DDKPLLYALSAGDGT 313 (342)
T ss_dssp TTTEEEEEEEE--TT-----TTS-EEEEEEEETTTTEEEEEEEEEEE-------ESEEEEES---SSS-EEEEEETTTTE
T ss_pred ccCeEEEEecCCCCC----CccCCceEEEEEECCCCeEEEEEeCCCc-------cceEEEcc---CCCcEEEEEcCCCCe
Confidence 57899998 666532 2344556899999999999999999654 23566631 256799999998899
Q ss_pred EEEEECCCCeEEEee
Q psy11781 1006 LIVYDFFKNTSYRLT 1020 (1228)
Q Consensus 1006 LIVydl~~g~swRv~ 1020 (1228)
|.|||..+|+.-|..
T Consensus 314 l~v~D~~tGk~~~~~ 328 (342)
T PF06433_consen 314 LDVYDAATGKLVRSI 328 (342)
T ss_dssp EEEEETTT--EEEEE
T ss_pred EEEEeCcCCcEEeeh
Confidence 999999999888853
No 36
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=93.90 E-value=3.6 Score=47.36 Aligned_cols=67 Identities=10% Similarity=0.121 Sum_probs=45.8
Q ss_pred cccEEEc-CCCcEEEEecCCCeEEEEeCCCCcc------ccCCeeEEEecCCceecCCceEEeCCCcEEEEe--CCchhh
Q psy11781 1136 ASASAIN-SNGVMFYNLVTKHSVGCWNTKTKVY------LPQTQDIVQTSRDILNFPNDLKIDPNDNIWILS--NKLHQY 1206 (1228)
Q Consensus 1136 s~g~~~D-~~G~LY~t~~~~~aI~~wn~~~~~~------~~~n~~~v~~~~~~L~wPd~l~id~dg~Lyvls--n~l~~~ 1206 (1228)
-+|+.+| .+|.|++=--+...+.-.|.. +.- ...+.++ ...+.=+-|+++|.+|.|||+| |.+-+|
T Consensus 235 vSgl~~~~~~~~LLVLS~ESr~l~Evd~~-G~~~~~lsL~~g~~gL----~~dipqaEGiamDd~g~lYIvSEPnlfy~F 309 (316)
T COG3204 235 VSGLEFNAITNSLLVLSDESRRLLEVDLS-GEVIELLSLTKGNHGL----SSDIPQAEGIAMDDDGNLYIVSEPNLFYRF 309 (316)
T ss_pred cccceecCCCCcEEEEecCCceEEEEecC-CCeeeeEEeccCCCCC----cccCCCcceeEECCCCCEEEEecCCcceec
Confidence 5688888 677777766677778888877 542 1112222 1246678999999999999999 555554
Q ss_pred h
Q psy11781 1207 L 1207 (1228)
Q Consensus 1207 ~ 1207 (1228)
.
T Consensus 310 ~ 310 (316)
T COG3204 310 T 310 (316)
T ss_pred c
Confidence 3
No 37
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=93.80 E-value=4.1 Score=46.91 Aligned_cols=191 Identities=14% Similarity=0.103 Sum_probs=102.4
Q ss_pred CCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEEecCCCCeEEEEEcCCCeEEEEcCCCCCcCC
Q psy11781 499 CPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEP 578 (1228)
Q Consensus 499 ~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYItDs~~~gLIVyDl~~g~swRv~~~s~~pdP 578 (1228)
.||.||-+++ +|+++++++++.- . =+-.|..= +++..=|+|....+|.++.+..+..--....
T Consensus 106 ~p~~iVElt~-~GdlirtiPL~g~-~----DpE~Ieyi-----g~n~fvi~dER~~~l~~~~vd~~t~~~~~~~------ 168 (316)
T COG3204 106 KPAAIVELTK-EGDLIRTIPLTGF-S----DPETIEYI-----GGNQFVIVDERDRALYLFTVDADTTVISAKV------ 168 (316)
T ss_pred CCceEEEEec-CCceEEEeccccc-C----ChhHeEEe-----cCCEEEEEehhcceEEEEEEcCCccEEeccc------
Confidence 5788999998 6899999999541 1 11122221 2233336777778888777776533222111
Q ss_pred CCcceEEcCeEeeecCCcceeecCcccCCcccCCCccccccccccccCccccccccCCCceEEEeeC-CCCceEEEechh
Q psy11781 579 TQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSL-SSNRHYYVSTTD 657 (1228)
Q Consensus 579 ~~~~~~i~G~~f~~~dGi~GIALsp~~~~~~~~~~~~~~~~~~~~~~~~~~~i~is~d~r~LYf~~l-ss~~ly~V~T~~ 657 (1228)
..+..+... ...-|.-|+|-+| .++.|||.-= .--.+|.+.
T Consensus 169 --~~i~L~~~~-k~N~GfEGlA~d~--------------------------------~~~~l~~aKEr~P~~I~~~~--- 210 (316)
T COG3204 169 --QKIPLGTTN-KKNKGFEGLAWDP--------------------------------VDHRLFVAKERNPIGIFEVT--- 210 (316)
T ss_pred --eEEeccccC-CCCcCceeeecCC--------------------------------CCceEEEEEccCCcEEEEEe---
Confidence 011111110 0123555666665 3445555322 222366665
Q ss_pred hcCC-CCCCC---ccccccceEeeccccccccccceEEc-cCCcEEEEcCCCCeEEEEeCCCCcc------CCCCeeEEE
Q psy11781 658 LRNS-SRYVN---SSDIDEYFHYLGSRFRNTQASASAIN-SNGVMFYNLVTKHSVGCWNTKTKVY------LPQTQDIVQ 726 (1228)
Q Consensus 658 Lr~~-~~~~~---~~~~~~~v~~lG~~g~~~qs~G~a~D-~~G~LY~t~~~~~aI~~wn~~~~~~------~~~n~~iV~ 726 (1228)
+.+ ++... +......+. ...-+|+.+| .+|+|++=.-+...+.-.|.. ++- ...+.++-
T Consensus 211 -~~~~~l~~~~~~~~~~~~~~f-------~~DvSgl~~~~~~~~LLVLS~ESr~l~Evd~~-G~~~~~lsL~~g~~gL~- 280 (316)
T COG3204 211 -QSPSSLSVHASLDPTADRDLF-------VLDVSGLEFNAITNSLLVLSDESRRLLEVDLS-GEVIELLSLTKGNHGLS- 280 (316)
T ss_pred -cCCcccccccccCcccccceE-------eeccccceecCCCCcEEEEecCCceEEEEecC-CCeeeeEEeccCCCCCc-
Confidence 111 11000 000111111 0125788888 678888877788888888887 441 11111111
Q ss_pred ecCCceecCCceEEcCCCcEEEEe--cCccccc
Q psy11781 727 TSRDILNFPNDLKIDPNDNIWILS--NKLHQYL 757 (1228)
Q Consensus 727 ~~d~~L~wPd~l~Id~dg~LyVls--nrl~~~~ 757 (1228)
..+.=+.|+++|.+|.|||+| |.|-+|-
T Consensus 281 ---~dipqaEGiamDd~g~lYIvSEPnlfy~F~ 310 (316)
T COG3204 281 ---SDIPQAEGIAMDDDGNLYIVSEPNLFYRFT 310 (316)
T ss_pred ---ccCCCcceeEECCCCCEEEEecCCcceecc
Confidence 134558899999999999999 5555543
No 38
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=93.75 E-value=0.25 Score=57.78 Aligned_cols=81 Identities=21% Similarity=0.240 Sum_probs=57.6
Q ss_pred CCCcEEEE-eCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEEecCCCCe
Q psy11781 477 KCDRLWIM-DTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYG 555 (1228)
Q Consensus 477 ~~gRLWVL-DtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYItDs~~~g 555 (1228)
+.+||+|| +.|..+. .....-.|++||++|++.++|++++.. ..+|.|-- -+.-.+|..+.+...
T Consensus 248 ~~~rlyvLMh~g~~gs----HKdpgteVWv~D~~t~krv~Ri~l~~~-------~~Si~Vsq---d~~P~L~~~~~~~~~ 313 (342)
T PF06433_consen 248 ASGRLYVLMHQGGEGS----HKDPGTEVWVYDLKTHKRVARIPLEHP-------IDSIAVSQ---DDKPLLYALSAGDGT 313 (342)
T ss_dssp TTTEEEEEEEE--TT-----TTS-EEEEEEEETTTTEEEEEEEEEEE-------ESEEEEES---SSS-EEEEEETTTTE
T ss_pred ccCeEEEEecCCCCCC----ccCCceEEEEEECCCCeEEEEEeCCCc-------cceEEEcc---CCCcEEEEEcCCCCe
Confidence 68899987 7665432 334567899999999999999999554 33666642 256789999888899
Q ss_pred EEEEEcCCCeEEEEcC
Q psy11781 556 LIVYDFFKNTSYRLTH 571 (1228)
Q Consensus 556 LIVyDl~~g~swRv~~ 571 (1228)
|.|||..+|+.-|...
T Consensus 314 l~v~D~~tGk~~~~~~ 329 (342)
T PF06433_consen 314 LDVYDAATGKLVRSIE 329 (342)
T ss_dssp EEEEETTT--EEEEE-
T ss_pred EEEEeCcCCcEEeehh
Confidence 9999999998888653
No 39
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=92.17 E-value=0.39 Score=45.97 Aligned_cols=36 Identities=19% Similarity=0.179 Sum_probs=22.2
Q ss_pred eeEEEEeecCCCCeEEEEecCC-----------------CCeEEEEEcCCCeEEEEc
Q psy11781 531 SNIVTEVVEDCDHVFAYVNDVF-----------------RYGLIVYDFFKNTSYRLT 570 (1228)
Q Consensus 531 ~di~VD~~~~c~~~~aYItDs~-----------------~~gLIVyDl~~g~swRv~ 570 (1228)
|++.|+- +++.+|+||++ .+.|+.||..+++..-++
T Consensus 1 ndldv~~----~~g~vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp~t~~~~vl~ 53 (89)
T PF03088_consen 1 NDLDVDQ----DTGTVYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDPSTKETTVLL 53 (89)
T ss_dssp -EEEE-T----TT--EEEEES-SS--TTGHHHHHHHT---EEEEEEETTTTEEEEEE
T ss_pred CceeEec----CCCEEEEEeCccccCccceeeeeecCCCCcCEEEEECCCCeEEEeh
Confidence 4677771 34999999996 367999999987655444
No 40
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=92.12 E-value=1.6 Score=52.64 Aligned_cols=125 Identities=16% Similarity=0.160 Sum_probs=60.9
Q ss_pred CEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeec--CCCceEEEEEcCCCCeEEEEECCCCeEEEeecCCcCcCC
Q psy11781 951 PKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVE--DCDHVFAYVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEP 1028 (1228)
Q Consensus 951 pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~--~c~~~~aYItD~~~~gLIVydl~~g~swRv~h~s~~pdP 1028 (1228)
-+|++||+++.++++++.|+++ .-+...+|. +-...+.|+.=+-...|..+=...+-.|..+-
T Consensus 222 ~~l~vWD~~~r~~~Q~idLg~~--------g~~pLEvRflH~P~~~~gFvg~aLss~i~~~~k~~~g~W~a~k------- 286 (461)
T PF05694_consen 222 HSLHVWDWSTRKLLQTIDLGEE--------GQMPLEVRFLHDPDANYGFVGCALSSSIWRFYKDDDGEWAAEK------- 286 (461)
T ss_dssp -EEEEEETTTTEEEEEEES-TT--------EEEEEEEEE-SSTT--EEEEEEE--EEEEEEEE-ETTEEEEEE-------
T ss_pred CeEEEEECCCCcEeeEEecCCC--------CCceEEEEecCCCCccceEEEEeccceEEEEEEcCCCCeeeeE-------
Confidence 3899999999999999999875 112333332 44566777766666666666664455577532
Q ss_pred CcceeEEcCEEEEeeCCccceecCccccccccCCCccccccccccccCccccceecCCCceEEEEeCCCCceEEEecchh
Q psy11781 1029 TQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSSNRHYYVSTTDL 1108 (1228)
Q Consensus 1029 ~~~~~~i~G~~f~~~dGi~GiaLsp~~~~~~~~~p~~~~~~~~~~~~~~~~di~ls~d~r~LYf~plss~~ly~V~T~~L 1108 (1228)
.+.|..++.+ | =.|-+... +|- ..+++=.||.||.|+|+||++----..+-.-+-+.-
T Consensus 287 ---Vi~ip~~~v~---~---~~lp~ml~--------~~~-----~~P~LitDI~iSlDDrfLYvs~W~~GdvrqYDISDP 344 (461)
T PF05694_consen 287 ---VIDIPAKKVE---G---WILPEMLK--------PFG-----AVPPLITDILISLDDRFLYVSNWLHGDVRQYDISDP 344 (461)
T ss_dssp ---EEEE--EE-----S---S---GGGG--------GG------EE------EEE-TTS-EEEEEETTTTEEEEEE-SST
T ss_pred ---EEECCCcccC---c---cccccccc--------ccc-----cCCCceEeEEEccCCCEEEEEcccCCcEEEEecCCC
Confidence 3444444332 1 00100000 000 011334579999999999999877665544443344
Q ss_pred cCCc
Q psy11781 1109 RNSS 1112 (1228)
Q Consensus 1109 r~~~ 1112 (1228)
.++.
T Consensus 345 ~~Pk 348 (461)
T PF05694_consen 345 FNPK 348 (461)
T ss_dssp TS-E
T ss_pred CCCc
Confidence 4443
No 41
>KOG1539|consensus
Probab=91.88 E-value=2.7 Score=53.51 Aligned_cols=194 Identities=15% Similarity=0.204 Sum_probs=125.0
Q ss_pred ceEEEEEEEeCCCcEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEE
Q psy11781 467 LISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFA 546 (1228)
Q Consensus 467 lvsV~~v~iD~~gRLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~a 546 (1228)
.+++..+.++.||..-++=...- -|=+|++..|-..++|- ++- ..+.-+.++++|- -+-.
T Consensus 448 ~~~~~av~vs~CGNF~~IG~S~G------------~Id~fNmQSGi~r~sf~--~~~-ah~~~V~gla~D~-----~n~~ 507 (910)
T KOG1539|consen 448 DINATAVCVSFCGNFVFIGYSKG------------TIDRFNMQSGIHRKSFG--DSP-AHKGEVTGLAVDG-----TNRL 507 (910)
T ss_pred CcceEEEEEeccCceEEEeccCC------------eEEEEEcccCeeecccc--cCc-cccCceeEEEecC-----CCce
Confidence 47888999999999877744332 27789999988877664 111 1235578899983 3345
Q ss_pred EEecCCCCeEEEEEcCCCe---EEEEcCCCCCcCCCCcceEEcCeEeeecCCcceeecCcccCCcccCCCcccccc----
Q psy11781 547 YVNDVFRYGLIVYDFFKNT---SYRLTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHY---- 619 (1228)
Q Consensus 547 YItDs~~~gLIVyDl~~g~---swRv~~~s~~pdP~~~~~~i~G~~f~~~dGi~GIALsp~~~~~~~~~~~~~~~~---- 619 (1228)
-||=...+-|-.||+++.. +||+.+. +.+....+..+++++|+..-.. .+||.
T Consensus 508 ~vsa~~~Gilkfw~f~~k~l~~~l~l~~~------------~~~iv~hr~s~l~a~~~ddf~I--------~vvD~~t~k 567 (910)
T KOG1539|consen 508 LVSAGADGILKFWDFKKKVLKKSLRLGSS------------ITGIVYHRVSDLLAIALDDFSI--------RVVDVVTRK 567 (910)
T ss_pred EEEccCcceEEEEecCCcceeeeeccCCC------------cceeeeeehhhhhhhhcCceeE--------EEEEchhhh
Confidence 5665556667788888765 4555432 2333345556677677654211 12222
Q ss_pred ccccccCcccc---ccccCCCceEEEeeC-CCCceEEEechhhcCCCCCCCccccccceEeeccccccccccceEEccCC
Q psy11781 620 NVHHYNGTNVD---KTIRDDQRYMYFHSL-SSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNG 695 (1228)
Q Consensus 620 ~~~~~~~~~~~---i~is~d~r~LYf~~l-ss~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~~g~~~qs~G~a~D~~G 695 (1228)
.++...+.+.. .+.|+|||||--.+| ++-+.|-|+|..|-|.-+ . +. .+..+.+..+|
T Consensus 568 vvR~f~gh~nritd~~FS~DgrWlisasmD~tIr~wDlpt~~lID~~~--v------------d~----~~~sls~SPng 629 (910)
T KOG1539|consen 568 VVREFWGHGNRITDMTFSPDGRWLISASMDSTIRTWDLPTGTLIDGLL--V------------DS----PCTSLSFSPNG 629 (910)
T ss_pred hhHHhhccccceeeeEeCCCCcEEEEeecCCcEEEEeccCcceeeeEe--c------------CC----cceeeEECCCC
Confidence 23333334333 579999999999999 778899999988766542 1 11 25667777776
Q ss_pred c-EEEEcCCCCeEEEEeCCCCccCC
Q psy11781 696 V-MFYNLVTKHSVGCWNTKTKVYLP 719 (1228)
Q Consensus 696 ~-LY~t~~~~~aI~~wn~~~~~~~~ 719 (1228)
- |--+-+.+++|+.|.-. .-|..
T Consensus 630 D~LAT~Hvd~~gIylWsNk-slF~~ 653 (910)
T KOG1539|consen 630 DFLATVHVDQNGIYLWSNK-SLFKS 653 (910)
T ss_pred CEEEEEEecCceEEEEEch-hHhee
Confidence 4 44456788999999876 55543
No 42
>KOG1539|consensus
Probab=91.36 E-value=5 Score=51.34 Aligned_cols=223 Identities=15% Similarity=0.171 Sum_probs=136.0
Q ss_pred EEeCCeEEEEeccCCCCCCcEEEEEECCCCCCCCccccCCCCCcCCCCCCCcceEEEEEEEeCCCcEEEEeCCCCCcccc
Q psy11781 865 GIWRSTIFLSFPKWKAGIPFTLASFNMNDPSESPILLPYPNWSYFDDSNCNSLISVFRMSVDKCDRLWIMDTGVTNILSS 944 (1228)
Q Consensus 865 ~v~~gRiFvt~PR~~~gvP~TLa~v~~~~~~~sP~l~PYP~~~wn~~~~c~~lvSV~~v~iD~~grLWVLDtG~~~~~~~ 944 (1228)
+.+.|..+.+.=+|++ .|+++...++. ..++ --.++..+.++.||..=++=....
T Consensus 416 ~~h~~~~~~~tW~~~n---~~~G~~~L~~~----~~~~-------------~~~~~~av~vs~CGNF~~IG~S~G----- 470 (910)
T KOG1539|consen 416 TAHKGKRSAYTWNFRN---KTSGRHVLDPK----RFKK-------------DDINATAVCVSFCGNFVFIGYSKG----- 470 (910)
T ss_pred EEecCcceEEEEeccC---cccccEEecCc----cccc-------------cCcceEEEEEeccCceEEEeccCC-----
Confidence 4577888888855543 35555543211 1222 236788889999998877644332
Q ss_pred ccccCCCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCCCCeEEEEECCCCe---EEEeec
Q psy11781 945 IQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNT---SYRLTH 1021 (1228)
Q Consensus 945 ~~~~~~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~~gLIVydl~~g~---swRv~h 1021 (1228)
-|=+|++..|-..++|- +.-+ .+.-+..+++|-. +..-||=...+-|-.||++++. +||+-+
T Consensus 471 -------~Id~fNmQSGi~r~sf~--~~~a-h~~~V~gla~D~~-----n~~~vsa~~~Gilkfw~f~~k~l~~~l~l~~ 535 (910)
T KOG1539|consen 471 -------TIDRFNMQSGIHRKSFG--DSPA-HKGEVTGLAVDGT-----NRLLVSAGADGILKFWDFKKKVLKKSLRLGS 535 (910)
T ss_pred -------eEEEEEcccCeeecccc--cCcc-ccCceeEEEecCC-----CceEEEccCcceEEEEecCCcceeeeeccCC
Confidence 57889999998877664 1111 2345778888843 3445665555667788888765 556544
Q ss_pred CCcCcCCCcceeEEcCEEEEeeCCccceecCccccccccCCCccccccccc----------cccCccccceecCCCceEE
Q psy11781 1022 PYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVH----------HYNGTNVDKTIRDDQRYMY 1091 (1228)
Q Consensus 1022 ~s~~pdP~~~~~~i~G~~f~~~dGi~GiaLsp~~~~~~~~~p~~~~~~~~~----------~~~~~~~di~ls~d~r~LY 1091 (1228)
. +.+.......+++++++..-.- ..|... |.+++. |.+.|||||||-
T Consensus 536 ~------------~~~iv~hr~s~l~a~~~ddf~I----------~vvD~~t~kvvR~f~gh~nrit-d~~FS~DgrWli 592 (910)
T KOG1539|consen 536 S------------ITGIVYHRVSDLLAIALDDFSI----------RVVDVVTRKVVREFWGHGNRIT-DMTFSPDGRWLI 592 (910)
T ss_pred C------------cceeeeeehhhhhhhhcCceeE----------EEEEchhhhhhHHhhcccccee-eeEeCCCCcEEE
Confidence 2 1222223334455554432100 011111 222221 267999999999
Q ss_pred EEeC-CCCceEEEecchhcCCcccCCccccccceEEeccccCCCCcccEEEcCCC-cEEEEecCCCeEEEEeCCCCcccc
Q psy11781 1092 FHSL-SSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNG-VMFYNLVTKHSVGCWNTKTKVYLP 1169 (1228)
Q Consensus 1092 f~pl-ss~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~rg~~sqs~g~~~D~~G-~LY~t~~~~~aI~~wn~~~~~~~~ 1169 (1228)
-.+| ++-+.|-|+|..|-|.-. -+. .+..+.+.++| .|-.+-+.+++|+-|.-. .-|..
T Consensus 593 sasmD~tIr~wDlpt~~lID~~~--------------vd~----~~~sls~SPngD~LAT~Hvd~~gIylWsNk-slF~~ 653 (910)
T KOG1539|consen 593 SASMDSTIRTWDLPTGTLIDGLL--------------VDS----PCTSLSFSPNGDFLATVHVDQNGIYLWSNK-SLFKS 653 (910)
T ss_pred EeecCCcEEEEeccCcceeeeEe--------------cCC----cceeeEECCCCCEEEEEEecCceEEEEEch-hHhee
Confidence 9999 888999999988877542 012 26677788888 455557788999999776 44543
No 43
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=91.30 E-value=32 Score=40.51 Aligned_cols=131 Identities=14% Similarity=0.092 Sum_probs=70.7
Q ss_pred cccCCCceEEEeeCCCCceEEEechhhcCCCCCCCcc-ccccceEeeccccccccccceEEcc----CCcEEEEcCCC--
Q psy11781 632 TIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSS-DIDEYFHYLGSRFRNTQASASAINS----NGVMFYNLVTK-- 704 (1228)
Q Consensus 632 ~is~d~r~LYf~~lss~~ly~V~T~~Lr~~~~~~~~~-~~~~~v~~lG~~g~~~qs~G~a~D~----~G~LY~t~~~~-- 704 (1228)
+..||+ .||..-..+ +++.+. .++.. ... .....+...+++| .-|+++|. +|.||+.-...
T Consensus 8 a~~pdG-~l~v~e~~G-~i~~~~----~~g~~--~~~v~~~~~v~~~~~~g----llgia~~p~f~~n~~lYv~~t~~~~ 75 (331)
T PF07995_consen 8 AFLPDG-RLLVAERSG-RIWVVD----KDGSL--KTPVADLPEVFADGERG----LLGIAFHPDFASNGYLYVYYTNADE 75 (331)
T ss_dssp EEETTS-CEEEEETTT-EEEEEE----TTTEE--CEEEEE-TTTBTSTTBS----EEEEEE-TTCCCC-EEEEEEEEE-T
T ss_pred EEeCCC-cEEEEeCCc-eEEEEe----CCCcC--cceecccccccccccCC----cccceeccccCCCCEEEEEEEcccC
Confidence 344565 467777755 688887 12221 000 0012344445555 57889986 59999987754
Q ss_pred ------CeEEEEeCCCCc-cCCCCeeEEEec-C---CceecCCceEEcCCCcEEEEecCccccccCCCCCccccEEEEEE
Q psy11781 705 ------HSVGCWNTKTKV-YLPQTQDIVQTS-R---DILNFPNDLKIDPNDNIWILSNKLHQYLYGFLDFNVYNYRILVA 773 (1228)
Q Consensus 705 ------~aI~~wn~~~~~-~~~~n~~iV~~~-d---~~L~wPd~l~Id~dg~LyVlsnrl~~~~~~~~d~~e~nfrI~r~ 773 (1228)
++|.+|.-. .. -......+|... . ..-.+..+|.+++||.|||....-.. .....+.+...=.|+|+
T Consensus 76 ~~~~~~~~v~r~~~~-~~~~~~~~~~~l~~~~p~~~~~~H~g~~l~fgpDG~LYvs~G~~~~-~~~~~~~~~~~G~ilri 153 (331)
T PF07995_consen 76 DGGDNDNRVVRFTLS-DGDGDLSSEEVLVTGLPDTSSGNHNGGGLAFGPDGKLYVSVGDGGN-DDNAQDPNSLRGKILRI 153 (331)
T ss_dssp SSSSEEEEEEEEEEE-TTSCEEEEEEEEEEEEES-CSSSS-EEEEEE-TTSEEEEEEB-TTT-GGGGCSTTSSTTEEEEE
T ss_pred CCCCcceeeEEEecc-CCccccccceEEEEEeCCCCCCCCCCccccCCCCCcEEEEeCCCCC-cccccccccccceEEEe
Confidence 789988775 22 111122233211 1 23456678999999999999854433 11223444555678998
Q ss_pred ecc
Q psy11781 774 NSN 776 (1228)
Q Consensus 774 ~v~ 776 (1228)
+.+
T Consensus 154 ~~d 156 (331)
T PF07995_consen 154 DPD 156 (331)
T ss_dssp ETT
T ss_pred ccc
Confidence 743
No 44
>KOG4499|consensus
Probab=91.17 E-value=0.73 Score=51.14 Aligned_cols=64 Identities=13% Similarity=0.181 Sum_probs=47.1
Q ss_pred CcccEEEc-CCCcEEEEecCCCeEEEEe--CCCCccccCCeeEEEe----cCCceecCCceEEeCCCcEEEEe
Q psy11781 1135 QASASAIN-SNGVMFYNLVTKHSVGCWN--TKTKVYLPQTQDIVQT----SRDILNFPNDLKIDPNDNIWILS 1200 (1228)
Q Consensus 1135 qs~g~~~D-~~G~LY~t~~~~~aI~~wn--~~~~~~~~~n~~~v~~----~~~~L~wPd~l~id~dg~Lyvls 1200 (1228)
-+.|++-| +.-.+||.+..+..|-.|| -.|+... |-+.|.. .+..=+.||||.||.+|+|||.+
T Consensus 159 IsNgl~Wd~d~K~fY~iDsln~~V~a~dyd~~tG~~s--nr~~i~dlrk~~~~e~~~PDGm~ID~eG~L~Va~ 229 (310)
T KOG4499|consen 159 ISNGLAWDSDAKKFYYIDSLNYEVDAYDYDCPTGDLS--NRKVIFDLRKSQPFESLEPDGMTIDTEGNLYVAT 229 (310)
T ss_pred CCccccccccCcEEEEEccCceEEeeeecCCCccccc--CcceeEEeccCCCcCCCCCCcceEccCCcEEEEE
Confidence 57899999 6679999999999998888 3324332 3333331 12234589999999999999987
No 45
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=90.57 E-value=2.5 Score=51.05 Aligned_cols=118 Identities=14% Similarity=0.111 Sum_probs=59.7
Q ss_pred CCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeec--CCCCeEEEEecCCCCeEEEEEcCCCeEEEEcCCCCCcC
Q psy11781 500 PPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVE--DCDHVFAYVNDVFRYGLIVYDFFKNTSYRLTHPYMYPE 577 (1228)
Q Consensus 500 ~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~--~c~~~~aYItDs~~~gLIVyDl~~g~swRv~~~s~~pd 577 (1228)
.-+|++||+++.+++++++|+++- -+.+.+|. +-...+.|+.=+-...|..|=...+..|...-
T Consensus 221 G~~l~vWD~~~r~~~Q~idLg~~g--------~~pLEvRflH~P~~~~gFvg~aLss~i~~~~k~~~g~W~a~k------ 286 (461)
T PF05694_consen 221 GHSLHVWDWSTRKLLQTIDLGEEG--------QMPLEVRFLHDPDANYGFVGCALSSSIWRFYKDDDGEWAAEK------ 286 (461)
T ss_dssp --EEEEEETTTTEEEEEEES-TTE--------EEEEEEEE-SSTT--EEEEEEE--EEEEEEEE-ETTEEEEEE------
T ss_pred cCeEEEEECCCCcEeeEEecCCCC--------CceEEEEecCCCCccceEEEEeccceEEEEEEcCCCCeeeeE------
Confidence 467999999999999999998641 12334443 44567888877777777777665556687651
Q ss_pred CCCcceEEcCeEeeecCCcceeecCcccCCcccCCCccccccccccccCccccccccCCCceEEEeeCCCCceEEEe
Q psy11781 578 PTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSSNRHYYVS 654 (1228)
Q Consensus 578 P~~~~~~i~G~~f~~~dGi~GIALsp~~~~~~~~~~~~~~~~~~~~~~~~~~~i~is~d~r~LYf~~lss~~ly~V~ 654 (1228)
.+.|..++..- =.|-+.-+ +...+-++=.||.||.|+|+||++--.-..+-.-+
T Consensus 287 ----Vi~ip~~~v~~------~~lp~ml~-------------~~~~~P~LitDI~iSlDDrfLYvs~W~~GdvrqYD 340 (461)
T PF05694_consen 287 ----VIDIPAKKVEG------WILPEMLK-------------PFGAVPPLITDILISLDDRFLYVSNWLHGDVRQYD 340 (461)
T ss_dssp ----EEEE--EE--S------S---GGGG-------------GG-EE------EEE-TTS-EEEEEETTTTEEEEEE
T ss_pred ----EEECCCcccCc------cccccccc-------------ccccCCCceEeEEEccCCCEEEEEcccCCcEEEEe
Confidence 33444333220 01100000 00000112235889999999999987766544444
No 46
>KOG1214|consensus
Probab=90.47 E-value=0.93 Score=57.25 Aligned_cols=138 Identities=12% Similarity=0.156 Sum_probs=93.4
Q ss_pred cccCCCcccc-ccccccccCccccccccCCCceEEEeeCCCCceEEEechhhcCCCCCCCccccccceEeeccccccccc
Q psy11781 608 KYKRHPYEYY-HYNVHHYNGTNVDKTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQA 686 (1228)
Q Consensus 608 ~~~~~~~~~~-~~~~~~~~~~~~~i~is~d~r~LYf~~lss~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~~g~~~qs 686 (1228)
+++|...+++ +.+.+-+.+ |..|...|++||+-|+++ +|.++-|. +. + -...| .++ .+.+
T Consensus 1010 ~~~K~~ak~~l~~p~~IiVG----idfDC~e~mvyWtDv~g~---SI~rasL~-G~----E--p~ti~----n~~-L~SP 1070 (1289)
T KOG1214|consen 1010 RLQKDAAKTLLSLPGSIIVG----IDFDCRERMVYWTDVAGR---SISRASLE-GA----E--PETIV----NSG-LISP 1070 (1289)
T ss_pred hhchhhhhceEecccceeee----eecccccceEEEeecCCC---cccccccc-CC----C--Cceee----ccc-CCCc
Confidence 4566655554 544433322 456778899999999998 66766664 22 1 11111 222 3568
Q ss_pred cceEEc-cCCcEEEEcCCCCeEEEEeCCCCccCCCCeeEEEecCCceecCCceEEcC-CCcEEEEe-cCccccccCCCCC
Q psy11781 687 SASAIN-SNGVMFYNLVTKHSVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDP-NDNIWILS-NKLHQYLYGFLDF 763 (1228)
Q Consensus 687 ~G~a~D-~~G~LY~t~~~~~aI~~wn~~~~~~~~~n~~iV~~~d~~L~wPd~l~Id~-dg~LyVls-nrl~~~~~~~~d~ 763 (1228)
.|+|+| ..-|+|++|...+.|..-.-+ +.+. +++.. +.|+=|-++.+|. .|+||-.. ||=...+ ++.+.
T Consensus 1071 EGiAVDh~~Rn~ywtDS~lD~IevA~Ld-G~~r----kvLf~--tdLVNPR~iv~D~~rgnLYwtDWnRenPkI-ets~m 1142 (1289)
T KOG1214|consen 1071 EGIAVDHIRRNMYWTDSVLDKIEVALLD-GSER----KVLFY--TDLVNPRAIVVDPIRGNLYWTDWNRENPKI-ETSSM 1142 (1289)
T ss_pred cceeeeeccceeeeeccccchhheeecC-Ccee----eEEEe--ecccCcceEEeecccCceeeccccccCCcc-eeecc
Confidence 999999 677999999999998876666 4432 34433 4588899999996 66898887 6666666 56677
Q ss_pred ccccEEEEE
Q psy11781 764 NVYNYRILV 772 (1228)
Q Consensus 764 ~e~nfrI~r 772 (1228)
+..|=|||-
T Consensus 1143 DG~NrRili 1151 (1289)
T KOG1214|consen 1143 DGENRRILI 1151 (1289)
T ss_pred CCccceEEe
Confidence 788888875
No 47
>KOG1520|consensus
Probab=89.59 E-value=3 Score=49.45 Aligned_cols=70 Identities=11% Similarity=0.187 Sum_probs=49.9
Q ss_pred CCCCCCcceEEEEEEEeCCC-cEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcc-cccCCCceeEEEEe
Q psy11781 460 DDSNCNSLISVFRMSVDKCD-RLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQ-VFEGSLFSNIVTEV 537 (1228)
Q Consensus 460 ~~~~c~~lvsV~~v~iD~~g-RLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v-~~~~S~l~di~VD~ 537 (1228)
..+-|- .+.+++.|..+ -|.|.|.=. | |++++...++.. .+..++ -++-.++|++.||
T Consensus 110 ~e~~CG---RPLGl~f~~~ggdL~VaDAYl----G---------L~~V~p~g~~a~---~l~~~~~G~~~kf~N~ldI~- 169 (376)
T KOG1520|consen 110 TEPLCG---RPLGIRFDKKGGDLYVADAYL----G---------LLKVGPEGGLAE---LLADEAEGKPFKFLNDLDID- 169 (376)
T ss_pred cccccC---CcceEEeccCCCeEEEEecce----e---------eEEECCCCCcce---eccccccCeeeeecCceeEc-
Confidence 344464 57889999877 999999732 2 888888776622 222222 2345789999999
Q ss_pred ecCCCCeEEEEecCCC
Q psy11781 538 VEDCDHVFAYVNDVFR 553 (1228)
Q Consensus 538 ~~~c~~~~aYItDs~~ 553 (1228)
.++.+|+||++.
T Consensus 170 ----~~g~vyFTDSSs 181 (376)
T KOG1520|consen 170 ----PEGVVYFTDSSS 181 (376)
T ss_pred ----CCCeEEEecccc
Confidence 589999999983
No 48
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=89.49 E-value=0.52 Score=35.27 Aligned_cols=26 Identities=19% Similarity=0.349 Sum_probs=23.8
Q ss_pred cccceEEccCCcEEEEcCCCCeEEEE
Q psy11781 685 QASASAINSNGVMFYNLVTKHSVGCW 710 (1228)
Q Consensus 685 qs~G~a~D~~G~LY~t~~~~~aI~~w 710 (1228)
++.|+++|++|+||++|.+++.|.++
T Consensus 3 ~P~gvav~~~g~i~VaD~~n~rV~vf 28 (28)
T PF01436_consen 3 YPHGVAVDSDGNIYVADSGNHRVQVF 28 (28)
T ss_dssp SEEEEEEETTSEEEEEECCCTEEEEE
T ss_pred CCcEEEEeCCCCEEEEECCCCEEEEC
Confidence 57899999999999999999999875
No 49
>PF00058 Ldl_recept_b: Low-density lipoprotein receptor repeat class B; InterPro: IPR000033 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=89.13 E-value=0.93 Score=37.15 Aligned_cols=41 Identities=17% Similarity=0.268 Sum_probs=34.2
Q ss_pred CcEEEEecCCC-eEEEEeCCCCccccCCeeEEEecCCceecCCceEEeC
Q psy11781 1145 GVMFYNLVTKH-SVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDP 1192 (1228)
Q Consensus 1145 G~LY~t~~~~~-aI~~wn~~~~~~~~~n~~~v~~~~~~L~wPd~l~id~ 1192 (1228)
|.||.++.... .|.+-+.+ +. +.+++.++ .|.+|.|++||.
T Consensus 1 ~~iYWtD~~~~~~I~~a~~d-Gs----~~~~vi~~--~l~~P~giaVD~ 42 (42)
T PF00058_consen 1 GKIYWTDWSQDPSIERANLD-GS----NRRTVISD--DLQHPEGIAVDW 42 (42)
T ss_dssp TEEEEEETTTTEEEEEEETT-ST----SEEEEEES--STSSEEEEEEET
T ss_pred CEEEEEECCCCcEEEEEECC-CC----CeEEEEEC--CCCCcCEEEECC
Confidence 57999999999 99999988 53 56777764 499999999984
No 50
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=88.08 E-value=25 Score=43.35 Aligned_cols=73 Identities=12% Similarity=0.214 Sum_probs=48.3
Q ss_pred CcceEEEEEEEeCCCcEEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEECCCCccc-CCCcceeeEEEeec--CC
Q psy11781 915 NSLISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVF-EGSLFSNIVTEVVE--DC 991 (1228)
Q Consensus 915 ~~lvSV~~v~iD~~grLWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~~~~-~~S~l~di~VD~~~--~c 991 (1228)
++|...-+|....+|+|||..-+.. +|++++..+++......+|.-+.. ....|.+|+++... .-
T Consensus 27 ~GL~~Pw~maflPDG~llVtER~~G------------~I~~v~~~~~~~~~~~~l~~v~~~~ge~GLlglal~PdF~~~~ 94 (454)
T TIGR03606 27 SGLNKPWALLWGPDNQLWVTERATG------------KILRVNPETGEVKVVFTLPEIVNDAQHNGLLGLALHPDFMQEK 94 (454)
T ss_pred CCCCCceEEEEcCCCeEEEEEecCC------------EEEEEeCCCCceeeeecCCceeccCCCCceeeEEECCCccccC
Confidence 5677888888888999999875321 577777766554334456543322 35778899998542 02
Q ss_pred CceEEEEE
Q psy11781 992 DHVFAYVN 999 (1228)
Q Consensus 992 ~~~~aYIt 999 (1228)
.+.++|++
T Consensus 95 ~n~~lYvs 102 (454)
T TIGR03606 95 GNPYVYIS 102 (454)
T ss_pred CCcEEEEE
Confidence 45788887
No 51
>KOG1520|consensus
Probab=87.42 E-value=4.3 Score=48.24 Aligned_cols=162 Identities=14% Similarity=0.169 Sum_probs=95.3
Q ss_pred ceeEEEEeecCCCCeEEEEecCCCCeEEEEEcCCCeEEEEcCCCCCcCCCCcceEEcCeEeeecCCcceeecCcccCCcc
Q psy11781 530 FSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKY 609 (1228)
Q Consensus 530 l~di~VD~~~~c~~~~aYItDs~~~gLIVyDl~~g~swRv~~~s~~pdP~~~~~~i~G~~f~~~dGi~GIALsp~~~~~~ 609 (1228)
--.|+.|. ..+-+||+|+ -.||.+.+...+++-.+.. .++|++|.+.+++ .+
T Consensus 117 PLGl~f~~----~ggdL~VaDA-YlGL~~V~p~g~~a~~l~~------------~~~G~~~kf~N~l---dI-------- 168 (376)
T KOG1520|consen 117 PLGIRFDK----KGGDLYVADA-YLGLLKVGPEGGLAELLAD------------EAEGKPFKFLNDL---DI-------- 168 (376)
T ss_pred cceEEecc----CCCeEEEEec-ceeeEEECCCCCcceeccc------------cccCeeeeecCce---eE--------
Confidence 44677773 3458999995 5799999999998555442 2688888875532 23
Q ss_pred cCCCccccccccccccCccccccccCCCceEEEeeCCCCceEE-EechhhcCCC---CCCCccccccceEeecccccccc
Q psy11781 610 KRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSSNRHYY-VSTTDLRNSS---RYVNSSDIDEYFHYLGSRFRNTQ 685 (1228)
Q Consensus 610 ~~~~~~~~~~~~~~~~~~~~~i~is~d~r~LYf~~lss~~ly~-V~T~~Lr~~~---~~~~~~~~~~~v~~lG~~g~~~q 685 (1228)
++ +..+||+.-|++.-.+ +--++|.... +..-+ -..+..+.+=+. ..-
T Consensus 169 ------------------------~~-~g~vyFTDSSsk~~~rd~~~a~l~g~~~GRl~~YD-~~tK~~~VLld~--L~F 220 (376)
T KOG1520|consen 169 ------------------------DP-EGVVYFTDSSSKYDRRDFVFAALEGDPTGRLFRYD-PSTKVTKVLLDG--LYF 220 (376)
T ss_pred ------------------------cC-CCeEEEeccccccchhheEEeeecCCCccceEEec-Ccccchhhhhhc--ccc
Confidence 23 3468888777732110 1111121111 00000 001122222211 134
Q ss_pred ccceEEccC-CcEEEEcCCCCeEEEEeCCCCccCCCCeeEEEecCCceecCCceEEcCCCcEEEEec
Q psy11781 686 ASASAINSN-GVMFYNLVTKHSVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDPNDNIWILSN 751 (1228)
Q Consensus 686 s~G~a~D~~-G~LY~t~~~~~aI~~wn~~~~~~~~~n~~iV~~~d~~L~wPd~l~Id~dg~LyVlsn 751 (1228)
+.|+++..+ ..+-+++.....|.++=.. ++- ..+.++.++.-+ -+||.+..+++|..||...
T Consensus 221 ~NGlaLS~d~sfvl~~Et~~~ri~rywi~-g~k-~gt~EvFa~~LP--G~PDNIR~~~~G~fWVal~ 283 (376)
T KOG1520|consen 221 PNGLALSPDGSFVLVAETTTARIKRYWIK-GPK-AGTSEVFAEGLP--GYPDNIRRDSTGHFWVALH 283 (376)
T ss_pred cccccCCCCCCEEEEEeeccceeeeeEec-CCc-cCchhhHhhcCC--CCCcceeECCCCCEEEEEe
Confidence 788888844 4677888888888887776 442 233355554222 5899999999999999773
No 52
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=87.39 E-value=0.91 Score=33.95 Aligned_cols=26 Identities=19% Similarity=0.349 Sum_probs=23.2
Q ss_pred CcccEEEcCCCcEEEEecCCCeEEEE
Q psy11781 1135 QASASAINSNGVMFYNLVTKHSVGCW 1160 (1228)
Q Consensus 1135 qs~g~~~D~~G~LY~t~~~~~aI~~w 1160 (1228)
++.|+++|++|+||++|.++++|.++
T Consensus 3 ~P~gvav~~~g~i~VaD~~n~rV~vf 28 (28)
T PF01436_consen 3 YPHGVAVDSDGNIYVADSGNHRVQVF 28 (28)
T ss_dssp SEEEEEEETTSEEEEEECCCTEEEEE
T ss_pred CCcEEEEeCCCCEEEEECCCCEEEEC
Confidence 46799999999999999999999764
No 53
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=86.87 E-value=20 Score=44.12 Aligned_cols=63 Identities=13% Similarity=0.202 Sum_probs=40.0
Q ss_pred ccceEEccCCcEEEEcCCC--------------------------------CeEEEEeCCCCccCCCCe-------eEEE
Q psy11781 686 ASASAINSNGVMFYNLVTK--------------------------------HSVGCWNTKTKVYLPQTQ-------DIVQ 726 (1228)
Q Consensus 686 s~G~a~D~~G~LY~t~~~~--------------------------------~aI~~wn~~~~~~~~~n~-------~iV~ 726 (1228)
...++++.+|.||++.-.. .+|.|.|++ +.--+.|- .|.+
T Consensus 148 GgrI~FgPDG~LYVs~GD~g~~~~~n~~~~~~aQ~~~~~~~~~~~d~~~~~GkILRin~D-GsiP~dNPf~~g~~~eIyA 226 (454)
T TIGR03606 148 GGRLVFGPDGKIYYTIGEQGRNQGANFFLPNQAQHTPTQQELNGKDYHAYMGKVLRLNLD-GSIPKDNPSINGVVSHIFT 226 (454)
T ss_pred CceEEECCCCcEEEEECCCCCCCcccccCcchhccccccccccccCcccCceEEEEEcCC-CCCCCCCCccCCCcceEEE
Confidence 5678899999999974332 267888877 43211111 1222
Q ss_pred ecCCceecCCceEEcCCCcEEEEecC
Q psy11781 727 TSRDILNFPNDLKIDPNDNIWILSNK 752 (1228)
Q Consensus 727 ~~d~~L~wPd~l~Id~dg~LyVlsnr 752 (1228)
. .++=|-+|+++++|.||+..+.
T Consensus 227 ~---G~RNp~Gla~dp~G~Lw~~e~G 249 (454)
T TIGR03606 227 Y---GHRNPQGLAFTPDGTLYASEQG 249 (454)
T ss_pred E---eccccceeEECCCCCEEEEecC
Confidence 1 2334778999999999997744
No 54
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=86.38 E-value=1e+02 Score=38.85 Aligned_cols=77 Identities=12% Similarity=0.163 Sum_probs=52.2
Q ss_pred CEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCCCCeEEEEECCCC-eEEEee-cCCcCcCC
Q psy11781 951 PKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKN-TSYRLT-HPYMYPEP 1028 (1228)
Q Consensus 951 pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~~gLIVydl~~g-~swRv~-h~s~~pdP 1028 (1228)
..|.++|++|++++=+++.+... ..+. ++. .++.+|+.+ ..+-|.+||.++| .-|++. ...+.+-|
T Consensus 441 g~l~AiD~~tGk~~W~~~~~~p~--~~~~---l~t------~g~lvf~g~-~~G~l~a~D~~TGe~lw~~~~g~~~~a~P 508 (527)
T TIGR03075 441 GSLIAWDPITGKIVWEHKEDFPL--WGGV---LAT------AGDLVFYGT-LEGYFKAFDAKTGEELWKFKTGSGIVGPP 508 (527)
T ss_pred eeEEEEeCCCCceeeEecCCCCC--CCcc---eEE------CCcEEEEEC-CCCeEEEEECCCCCEeEEEeCCCCceecC
Confidence 57999999999999877764321 1121 222 245566665 3568999999999 568876 34466655
Q ss_pred CcceeEEcCEEEE
Q psy11781 1029 TQSTYILDNLKFR 1041 (1228)
Q Consensus 1029 ~~~~~~i~G~~f~ 1041 (1228)
..|.++|+.+-
T Consensus 509 --~ty~~~G~qYv 519 (527)
T TIGR03075 509 --VTYEQDGKQYV 519 (527)
T ss_pred --EEEEeCCEEEE
Confidence 45668898774
No 55
>PF00058 Ldl_recept_b: Low-density lipoprotein receptor repeat class B; InterPro: IPR000033 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=85.52 E-value=1.8 Score=35.49 Aligned_cols=41 Identities=17% Similarity=0.277 Sum_probs=33.2
Q ss_pred CcEEEEcCCCC-eEEEEeCCCCccCCCCeeEEEecCCceecCCceEEcC
Q psy11781 695 GVMFYNLVTKH-SVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDP 742 (1228)
Q Consensus 695 G~LY~t~~~~~-aI~~wn~~~~~~~~~n~~iV~~~d~~L~wPd~l~Id~ 742 (1228)
|.||.++...+ .|.+-+.+ + .+.++|.++ .|.+|.|++||.
T Consensus 1 ~~iYWtD~~~~~~I~~a~~d-G----s~~~~vi~~--~l~~P~giaVD~ 42 (42)
T PF00058_consen 1 GKIYWTDWSQDPSIERANLD-G----SNRRTVISD--DLQHPEGIAVDW 42 (42)
T ss_dssp TEEEEEETTTTEEEEEEETT-S----TSEEEEEES--STSSEEEEEEET
T ss_pred CEEEEEECCCCcEEEEEECC-C----CCeEEEEEC--CCCCcCEEEECC
Confidence 57999999999 99998888 3 345666653 499999999974
No 56
>KOG1214|consensus
Probab=85.49 E-value=3.3 Score=52.71 Aligned_cols=138 Identities=12% Similarity=0.156 Sum_probs=92.9
Q ss_pred cccCCCcccc-ccccccccCccccceecCCCceEEEEeCCCCceEEEecchhcCCcccCCccccccceEEeccccCCCCc
Q psy11781 1058 KYKRHPYEYY-HYNVHHYNGTNVDKTIRDDQRYMYFHSLSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFRNTQA 1136 (1228)
Q Consensus 1058 ~~~~~p~~~~-~~~~~~~~~~~~di~ls~d~r~LYf~plss~~ly~V~T~~Lr~~~~~~~~~~~~~~v~~lG~rg~~sqs 1136 (1228)
+++|.-..++ |.|.+ +=+.|..+.-.|++||+-|+++ +|.++-|.-.. .. . +-.++. +..
T Consensus 1010 ~~~K~~ak~~l~~p~~----IiVGidfDC~e~mvyWtDv~g~---SI~rasL~G~E----p~------t-i~n~~L-~SP 1070 (1289)
T KOG1214|consen 1010 RLQKDAAKTLLSLPGS----IIVGIDFDCRERMVYWTDVAGR---SISRASLEGAE----PE------T-IVNSGL-ISP 1070 (1289)
T ss_pred hhchhhhhceEecccc----eeeeeecccccceEEEeecCCC---ccccccccCCC----Cc------e-eecccC-CCc
Confidence 4445445555 55532 2334778889999999999999 67766665211 11 1 111221 237
Q ss_pred ccEEEc-CCCcEEEEecCCCeEEEEeCCCCccccCCeeEEEecCCceecCCceEEeC-CCcEEEEe-CCchhhhcCCCCC
Q psy11781 1137 SASAIN-SNGVMFYNLVTKHSVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDP-NDNIWILS-NKLHQYLYGFLDF 1213 (1228)
Q Consensus 1137 ~g~~~D-~~G~LY~t~~~~~aI~~wn~~~~~~~~~n~~~v~~~~~~L~wPd~l~id~-dg~Lyvls-n~l~~~~~~~~d~ 1213 (1228)
.|+++| -.-|||+++.....|..-..+ +.++ ++++. +.|+=|-++.+|. .|+||-+. ||=+.++ ++.+.
T Consensus 1071 EGiAVDh~~Rn~ywtDS~lD~IevA~Ld-G~~r----kvLf~--tdLVNPR~iv~D~~rgnLYwtDWnRenPkI-ets~m 1142 (1289)
T KOG1214|consen 1071 EGIAVDHIRRNMYWTDSVLDKIEVALLD-GSER----KVLFY--TDLVNPRAIVVDPIRGNLYWTDWNRENPKI-ETSSM 1142 (1289)
T ss_pred cceeeeeccceeeeeccccchhheeecC-Ccee----eEEEe--ecccCcceEEeecccCceeeccccccCCcc-eeecc
Confidence 899999 556999999999999877766 6544 45554 4589999999997 56898876 6655555 44567
Q ss_pred CcccEEEEE
Q psy11781 1214 NVYNYRILV 1222 (1228)
Q Consensus 1214 ~e~nfrI~r 1222 (1228)
+..|=||+-
T Consensus 1143 DG~NrRili 1151 (1289)
T KOG1214|consen 1143 DGENRRILI 1151 (1289)
T ss_pred CCccceEEe
Confidence 777778774
No 57
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=85.39 E-value=63 Score=37.15 Aligned_cols=46 Identities=17% Similarity=0.181 Sum_probs=33.9
Q ss_pred eEEEEEEEeCCCcEEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEECCCCc
Q psy11781 918 ISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQ 973 (1228)
Q Consensus 918 vSV~~v~iD~~grLWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~~ 973 (1228)
.-.++...+.+|+||.= ||.- +.-+|..+|++|++++++..||+..
T Consensus 45 aFTQGL~~~~~g~LyES-TG~y---------G~S~l~~~d~~tg~~~~~~~l~~~~ 90 (264)
T PF05096_consen 45 AFTQGLEFLDDGTLYES-TGLY---------GQSSLRKVDLETGKVLQSVPLPPRY 90 (264)
T ss_dssp -EEEEEEEEETTEEEEE-ECST---------TEEEEEEEETTTSSEEEEEE-TTT-
T ss_pred ccCccEEecCCCEEEEe-CCCC---------CcEEEEEEECCCCcEEEEEECCccc
Confidence 44678888778888753 4443 3458999999999999999998864
No 58
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=84.28 E-value=66 Score=34.80 Aligned_cols=67 Identities=16% Similarity=0.194 Sum_probs=47.6
Q ss_pred CCCcEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEEecCCCCeE
Q psy11781 477 KCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGL 556 (1228)
Q Consensus 477 ~~gRLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYItDs~~~gL 556 (1228)
..++|++.+ ...+|+++|.+|++++-++.++..+... ..++ ++.+|+.... +.|
T Consensus 35 ~~~~v~~~~-------------~~~~l~~~d~~tG~~~W~~~~~~~~~~~------~~~~------~~~v~v~~~~-~~l 88 (238)
T PF13360_consen 35 DGGRVYVAS-------------GDGNLYALDAKTGKVLWRFDLPGPISGA------PVVD------GGRVYVGTSD-GSL 88 (238)
T ss_dssp ETTEEEEEE-------------TTSEEEEEETTTSEEEEEEECSSCGGSG------EEEE------TTEEEEEETT-SEE
T ss_pred eCCEEEEEc-------------CCCEEEEEECCCCCEEEEeeccccccce------eeec------ccccccccce-eee
Confidence 466777774 2356999999999999999996654322 3444 5677877743 389
Q ss_pred EEEEcCCCe-EEEE
Q psy11781 557 IVYDFFKNT-SYRL 569 (1228)
Q Consensus 557 IVyDl~~g~-swRv 569 (1228)
+.+|..+|+ .|++
T Consensus 89 ~~~d~~tG~~~W~~ 102 (238)
T PF13360_consen 89 YALDAKTGKVLWSI 102 (238)
T ss_dssp EEEETTTSCEEEEE
T ss_pred EecccCCcceeeee
Confidence 999966654 6774
No 59
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=83.84 E-value=8 Score=45.35 Aligned_cols=84 Identities=17% Similarity=0.271 Sum_probs=56.0
Q ss_pred cceEEEEEEEe-CCCcEEEE-eCCCCCcccCcCccCCCEEEEEECCCC-------cEEEEEECCCcccccCCCce-----
Q psy11781 466 SLISVFRMSVD-KCDRLWIM-DTGVTNILSSIQQLCPPKIMVFDLKTN-------TLIRKYILPTAQVFEGSLFS----- 531 (1228)
Q Consensus 466 ~lvsV~~v~iD-~~gRLWVL-DtG~~~~~~~~~~~~~PKLvvfDL~td-------~vi~~y~~p~~v~~~~S~l~----- 531 (1228)
.|-..-++..| ..+++|+| |.|. + ..+|.++.+++..+ ++...+.|... .+..+.
T Consensus 18 ~~GGlSgl~~~~~~~~~~avSD~g~-~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~---~G~~~~~~~~D 86 (326)
T PF13449_consen 18 PFGGLSGLDYDPDDGRFYAVSDRGP-N-------KGPPRFYTFRIDYDQGGIGGVTILDMIPLRDP---DGQPFPKNGLD 86 (326)
T ss_pred ccCcEeeEEEeCCCCEEEEEECCCC-C-------CCCCcEEEEEeeccCCCccceEeccceeccCC---CCCcCCcCCCC
Confidence 34444567777 78999977 8886 3 25788888877652 13333344221 122232
Q ss_pred --eEEEEeecCCCCeEEEEecCCC------CeEEEEEcCCCeE
Q psy11781 532 --NIVTEVVEDCDHVFAYVNDVFR------YGLIVYDFFKNTS 566 (1228)
Q Consensus 532 --di~VD~~~~c~~~~aYItDs~~------~gLIVyDl~~g~s 566 (1228)
.|++. .++.+||++.+. +.|+.||.. |+.
T Consensus 87 ~Egi~~~-----~~g~~~is~E~~~~~~~~p~I~~~~~~-G~~ 123 (326)
T PF13449_consen 87 PEGIAVP-----PDGSFWISSEGGRTGGIPPRIRRFDLD-GRV 123 (326)
T ss_pred hhHeEEe-----cCCCEEEEeCCccCCCCCCEEEEECCC-Ccc
Confidence 57784 578899999999 999999988 555
No 60
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=83.64 E-value=70 Score=34.59 Aligned_cols=67 Identities=16% Similarity=0.209 Sum_probs=47.6
Q ss_pred CCCcEEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCCCCeE
Q psy11781 927 KCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGL 1006 (1228)
Q Consensus 927 ~~grLWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~~gL 1006 (1228)
..++|++.+ ...+|++||.+|++++-++.++..+... ..++ ++.+|+..... .|
T Consensus 35 ~~~~v~~~~-------------~~~~l~~~d~~tG~~~W~~~~~~~~~~~------~~~~------~~~v~v~~~~~-~l 88 (238)
T PF13360_consen 35 DGGRVYVAS-------------GDGNLYALDAKTGKVLWRFDLPGPISGA------PVVD------GGRVYVGTSDG-SL 88 (238)
T ss_dssp ETTEEEEEE-------------TTSEEEEEETTTSEEEEEEECSSCGGSG------EEEE------TTEEEEEETTS-EE
T ss_pred eCCEEEEEc-------------CCCEEEEEECCCCCEEEEeeccccccce------eeec------cccccccccee-ee
Confidence 456777774 2348999999999999999996654322 3443 56778877443 99
Q ss_pred EEEECCCCe-EEEe
Q psy11781 1007 IVYDFFKNT-SYRL 1019 (1228)
Q Consensus 1007 IVydl~~g~-swRv 1019 (1228)
+.+|.++|+ .|++
T Consensus 89 ~~~d~~tG~~~W~~ 102 (238)
T PF13360_consen 89 YALDAKTGKVLWSI 102 (238)
T ss_dssp EEEETTTSCEEEEE
T ss_pred EecccCCcceeeee
Confidence 999966665 6774
No 61
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=81.93 E-value=93 Score=37.19 Aligned_cols=57 Identities=7% Similarity=0.069 Sum_probs=42.4
Q ss_pred EEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEEecCCCCeEEEEEcCCCe-EEEEcC
Q psy11781 502 KIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNT-SYRLTH 571 (1228)
Q Consensus 502 KLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYItDs~~~gLIVyDl~~g~-swRv~~ 571 (1228)
+|++||.+|++++-++.++..+ ....++. ++.+|+... .+.|+.+|.++|+ .|+...
T Consensus 131 ~l~ald~~tG~~~W~~~~~~~~------~ssP~v~------~~~v~v~~~-~g~l~ald~~tG~~~W~~~~ 188 (394)
T PRK11138 131 QVYALNAEDGEVAWQTKVAGEA------LSRPVVS------DGLVLVHTS-NGMLQALNESDGAVKWTVNL 188 (394)
T ss_pred EEEEEECCCCCCcccccCCCce------ecCCEEE------CCEEEEECC-CCEEEEEEccCCCEeeeecC
Confidence 5999999999999988886542 2334454 467888763 4579999998887 588764
No 62
>PRK04043 tolB translocation protein TolB; Provisional
Probab=81.35 E-value=12 Score=45.57 Aligned_cols=97 Identities=11% Similarity=0.090 Sum_probs=57.0
Q ss_pred ecCCceEEEEeeC-CcceEEEEEccCCceeeeec-----CccccCCCccceeeccceeeeccCeeeeecCcCCCCccccC
Q psy11781 99 EDCDHVFAYVNDV-FRYGLIVYDFFKNTSYRLTH-----PYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRH 172 (1228)
Q Consensus 99 ~~~~~~~aYi~D~-~~~g~iv~dl~~~~swR~~h-----~~~~~~p~~~~~~~~~~~f~~~~G~~~~~~~~~~~~~~~~~ 172 (1228)
.-.|+-+||.+|- +.+.|.++|+.+|+.=|+.. ..+.||-..+-|.....+-.+..|.+-+++-....|..+++
T Consensus 285 SPDG~~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt~~g~~~~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~g~~~~L 364 (419)
T PRK04043 285 VEDDKRIVFVSDRLGYPNIFMKKLNSGSVEQVVFHGKNNSSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNSDYIRRL 364 (419)
T ss_pred CCCCCEEEEEECCCCCceEEEEECCCCCeEeCccCCCcCceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCCCCeEEC
Confidence 4458889999987 56799999999988877743 22444444443333221111111323333333334444444
Q ss_pred CcccccccccccCCCcccccccCCCceEEEEec
Q psy11781 173 PYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSL 205 (1228)
Q Consensus 173 ~~~~~~~~~~~~~g~~~~~~~~~~~~~lYf~~l 205 (1228)
+.. |.+...+.|+||+.|+|..-
T Consensus 365 T~~----------~~~~~p~~SPDG~~I~f~~~ 387 (419)
T PRK04043 365 TAN----------GVNQFPRFSSDGGSIMFIKY 387 (419)
T ss_pred CCC----------CCcCCeEECCCCCEEEEEEc
Confidence 432 34445788999999999854
No 63
>PF02333 Phytase: Phytase; InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=80.34 E-value=45 Score=40.29 Aligned_cols=85 Identities=14% Similarity=0.137 Sum_probs=45.0
Q ss_pred ccccceEEc-cCCcEEEEcCCCCeEEEEeCCCCccCCCCeeEEEec-CCcee-cCCceEEc--CC--CcEEEEe---cCc
Q psy11781 684 TQASASAIN-SNGVMFYNLVTKHSVGCWNTKTKVYLPQTQDIVQTS-RDILN-FPNDLKID--PN--DNIWILS---NKL 753 (1228)
Q Consensus 684 ~qs~G~a~D-~~G~LY~t~~~~~aI~~wn~~~~~~~~~n~~iV~~~-d~~L~-wPd~l~Id--~d--g~LyVls---nrl 753 (1228)
+|+.|+++| ++|.||+++-. -+|++++.+ -. ....-.+|+.- .+.|. =.-||+|- .+ |+|.+.+ |.+
T Consensus 208 sQ~EGCVVDDe~g~LYvgEE~-~GIW~y~Ae-p~-~~~~~~~v~~~~g~~l~aDvEGlaly~~~~g~gYLivSsQG~~sf 284 (381)
T PF02333_consen 208 SQPEGCVVDDETGRLYVGEED-VGIWRYDAE-PE-GGNDRTLVASADGDGLVADVEGLALYYGSDGKGYLIVSSQGDNSF 284 (381)
T ss_dssp S-EEEEEEETTTTEEEEEETT-TEEEEEESS-CC-C-S--EEEEEBSSSSB-S-EEEEEEEE-CCC-EEEEEEEGGGTEE
T ss_pred CcceEEEEecccCCEEEecCc-cEEEEEecC-CC-CCCcceeeecccccccccCccceEEEecCCCCeEEEEEcCCCCeE
Confidence 699999999 89999999877 589999886 11 11111344322 22232 34577773 33 4565544 233
Q ss_pred cccccCCCCCccccEEEE
Q psy11781 754 HQYLYGFLDFNVYNYRIL 771 (1228)
Q Consensus 754 ~~~~~~~~d~~e~nfrI~ 771 (1228)
..|-...-+.-.-.|+|-
T Consensus 285 ~Vy~r~~~~~~~g~f~i~ 302 (381)
T PF02333_consen 285 AVYDREGPNAYVGSFRIG 302 (381)
T ss_dssp EEEESSTT--EEEEEEEE
T ss_pred EEEecCCCCcccceEEec
Confidence 334322223334456664
No 64
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=80.19 E-value=54 Score=38.61 Aligned_cols=69 Identities=19% Similarity=0.087 Sum_probs=41.3
Q ss_pred eEEEEEEEeCCCcEEEEeCCCCCccccccccCCCEEEEEECCCCcE-EEEEECCCCcccCCCcceeeEEEeecCCCceEE
Q psy11781 918 ISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTL-IRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFA 996 (1228)
Q Consensus 918 vSV~~v~iD~~grLWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~l-ir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~a 996 (1228)
..+-.|....+|+|+|.+. .. +|.+++ +.+.. -....+++-.......+..|++|..- -.++++
T Consensus 2 ~~P~~~a~~pdG~l~v~e~-~G------------~i~~~~-~~g~~~~~v~~~~~v~~~~~~gllgia~~p~f-~~n~~l 66 (331)
T PF07995_consen 2 NNPRSMAFLPDGRLLVAER-SG------------RIWVVD-KDGSLKTPVADLPEVFADGERGLLGIAFHPDF-ASNGYL 66 (331)
T ss_dssp SSEEEEEEETTSCEEEEET-TT------------EEEEEE-TTTEECEEEEE-TTTBTSTTBSEEEEEE-TTC-CCC-EE
T ss_pred CCceEEEEeCCCcEEEEeC-Cc------------eEEEEe-CCCcCcceecccccccccccCCcccceecccc-CCCCEE
Confidence 3456788888999999887 21 788888 55554 22234543223445788999998521 124677
Q ss_pred EEEcC
Q psy11781 997 YVNDV 1001 (1228)
Q Consensus 997 YItD~ 1001 (1228)
|++=+
T Consensus 67 Yv~~t 71 (331)
T PF07995_consen 67 YVYYT 71 (331)
T ss_dssp EEEEE
T ss_pred EEEEE
Confidence 76544
No 65
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=79.78 E-value=86 Score=33.08 Aligned_cols=76 Identities=12% Similarity=0.119 Sum_probs=44.8
Q ss_pred EEEEEEeCCCcEEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEE
Q psy11781 920 VFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVN 999 (1228)
Q Consensus 920 V~~v~iD~~grLWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYIt 999 (1228)
+..+...+++++.+.-.. ..+|.+||+.+++.++.+.-. ...+..+.++. ++.+++.+
T Consensus 96 i~~~~~~~~~~~~~~~~~------------~~~i~~~~~~~~~~~~~~~~~------~~~i~~~~~~~----~~~~l~~~ 153 (289)
T cd00200 96 VSSVAFSPDGRILSSSSR------------DKTIKVWDVETGKCLTTLRGH------TDWVNSVAFSP----DGTFVASS 153 (289)
T ss_pred EEEEEEcCCCCEEEEecC------------CCeEEEEECCCcEEEEEeccC------CCcEEEEEEcC----cCCEEEEE
Confidence 444455555555555431 126889999988887766521 23455566652 24556555
Q ss_pred cCCCCeEEEEECCCCeEEE
Q psy11781 1000 DVFRYGLIVYDFFKNTSYR 1018 (1228)
Q Consensus 1000 D~~~~gLIVydl~~g~swR 1018 (1228)
. ..+.|.+||+.+++..+
T Consensus 154 ~-~~~~i~i~d~~~~~~~~ 171 (289)
T cd00200 154 S-QDGTIKLWDLRTGKCVA 171 (289)
T ss_pred c-CCCcEEEEEccccccce
Confidence 4 34578999998655433
No 66
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=79.68 E-value=7.6 Score=45.35 Aligned_cols=89 Identities=12% Similarity=0.110 Sum_probs=53.1
Q ss_pred CcceEEEEEEEeCCCcEEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCce
Q psy11781 915 NSLISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHV 994 (1228)
Q Consensus 915 ~~lvSV~~v~iD~~grLWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~ 994 (1228)
++|.-..+.+- -+|+||++|+|.. +|..+|+.||+....-.+|. ...--.++.++.|= ...+-+.
T Consensus 200 ~GLsmPhSPRW-hdgrLwvldsgtG------------ev~~vD~~~G~~e~Va~vpG-~~rGL~f~G~llvV-gmSk~R~ 264 (335)
T TIGR03032 200 SGLSMPHSPRW-YQGKLWLLNSGRG------------ELGYVDPQAGKFQPVAFLPG-FTRGLAFAGDFAFV-GLSKLRE 264 (335)
T ss_pred cCccCCcCCcE-eCCeEEEEECCCC------------EEEEEcCCCCcEEEEEECCC-CCcccceeCCEEEE-EeccccC
Confidence 33444444333 3689999999987 68999999998887778865 22222344443331 1111111
Q ss_pred EEE-----EE---cCCCCeEEEEECCCCeEEE
Q psy11781 995 FAY-----VN---DVFRYGLIVYDFFKNTSYR 1018 (1228)
Q Consensus 995 ~aY-----It---D~~~~gLIVydl~~g~swR 1018 (1228)
-.. |+ +....||.|+|+++|..--
T Consensus 265 ~~~f~glpl~~~l~~~~CGv~vidl~tG~vv~ 296 (335)
T TIGR03032 265 SRVFGGLPIEERLDALGCGVAVIDLNSGDVVH 296 (335)
T ss_pred CCCcCCCchhhhhhhhcccEEEEECCCCCEEE
Confidence 111 11 1124899999999997443
No 67
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=79.67 E-value=87 Score=33.06 Aligned_cols=75 Identities=12% Similarity=0.130 Sum_probs=45.1
Q ss_pred EEEEEEeCCCcEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEEe
Q psy11781 470 VFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVN 549 (1228)
Q Consensus 470 V~~v~iD~~gRLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYIt 549 (1228)
+..+...+.+++.+.-.. .-+|.+||+.+++.++.+.- .......+.++. .+.+++.+
T Consensus 96 i~~~~~~~~~~~~~~~~~------------~~~i~~~~~~~~~~~~~~~~------~~~~i~~~~~~~----~~~~l~~~ 153 (289)
T cd00200 96 VSSVAFSPDGRILSSSSR------------DKTIKVWDVETGKCLTTLRG------HTDWVNSVAFSP----DGTFVASS 153 (289)
T ss_pred EEEEEEcCCCCEEEEecC------------CCeEEEEECCCcEEEEEecc------CCCcEEEEEEcC----cCCEEEEE
Confidence 444555555666555431 12488999998887766552 123456666652 24555555
Q ss_pred cCCCCeEEEEEcCCCeEE
Q psy11781 550 DVFRYGLIVYDFFKNTSY 567 (1228)
Q Consensus 550 Ds~~~gLIVyDl~~g~sw 567 (1228)
. ..+.|.+||+.+++..
T Consensus 154 ~-~~~~i~i~d~~~~~~~ 170 (289)
T cd00200 154 S-QDGTIKLWDLRTGKCV 170 (289)
T ss_pred c-CCCcEEEEEccccccc
Confidence 4 3567889999865543
No 68
>PF02333 Phytase: Phytase; InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=79.38 E-value=60 Score=39.23 Aligned_cols=29 Identities=7% Similarity=0.253 Sum_probs=24.9
Q ss_pred CCcccEEEc-CCCcEEEEecCCCeEEEEeCC
Q psy11781 1134 TQASASAIN-SNGVMFYNLVTKHSVGCWNTK 1163 (1228)
Q Consensus 1134 sqs~g~~~D-~~G~LY~t~~~~~aI~~wn~~ 1163 (1228)
+|+.|+++| ++|.||+++-. .+|++++.+
T Consensus 208 sQ~EGCVVDDe~g~LYvgEE~-~GIW~y~Ae 237 (381)
T PF02333_consen 208 SQPEGCVVDDETGRLYVGEED-VGIWRYDAE 237 (381)
T ss_dssp S-EEEEEEETTTTEEEEEETT-TEEEEEESS
T ss_pred CcceEEEEecccCCEEEecCc-cEEEEEecC
Confidence 689999999 99999999866 689999876
No 69
>COG4247 Phy 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]
Probab=77.22 E-value=1.1e+02 Score=35.07 Aligned_cols=61 Identities=13% Similarity=0.031 Sum_probs=36.6
Q ss_pred eEEEEeecCCCCeEEEEecCCCCeEEEEEcCCCeEEEEcCCCCCcCCCCcceEEcCeEeee
Q psy11781 532 NIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYILDNLKFRW 592 (1228)
Q Consensus 532 di~VD~~~~c~~~~aYItDs~~~gLIVyDl~~g~swRv~~~s~~pdP~~~~~~i~G~~f~~ 592 (1228)
|.+|=+.....+.-+-||-.-..||+||||.......+...-+.--...-.|.++|+.+..
T Consensus 55 DPAIwVh~t~P~kS~vItt~Kk~Gl~VYDLsGkqLqs~~~Gk~NNVDLrygF~LgG~~idi 115 (364)
T COG4247 55 DPAIWVHATNPDKSLVITTVKKAGLRVYDLSGKQLQSVNPGKYNNVDLRYGFQLGGQSIDI 115 (364)
T ss_pred CcceEeccCCcCcceEEEeeccCCeEEEecCCCeeeecCCCcccccccccCcccCCeEEEE
Confidence 3343333323455567777778999999999987777664322211122256777776653
No 70
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=77.07 E-value=1.7e+02 Score=34.99 Aligned_cols=57 Identities=7% Similarity=0.069 Sum_probs=42.6
Q ss_pred EEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCCCCeEEEEECCCCe-EEEeec
Q psy11781 952 KIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNT-SYRLTH 1021 (1228)
Q Consensus 952 KLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~~gLIVydl~~g~-swRv~h 1021 (1228)
+|++||.+|++++-++.++..+ ....++. ++.+|+... .+.|+.+|.++|+ .|+...
T Consensus 131 ~l~ald~~tG~~~W~~~~~~~~------~ssP~v~------~~~v~v~~~-~g~l~ald~~tG~~~W~~~~ 188 (394)
T PRK11138 131 QVYALNAEDGEVAWQTKVAGEA------LSRPVVS------DGLVLVHTS-NGMLQALNESDGAVKWTVNL 188 (394)
T ss_pred EEEEEECCCCCCcccccCCCce------ecCCEEE------CCEEEEECC-CCEEEEEEccCCCEeeeecC
Confidence 7999999999999999886542 2334443 467888754 4579999999887 588763
No 71
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=76.93 E-value=9.9 Score=44.43 Aligned_cols=88 Identities=11% Similarity=0.132 Sum_probs=54.3
Q ss_pred CcceEEEEEEEeCCCcEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEE-EeecCCCC
Q psy11781 465 NSLISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVT-EVVEDCDH 543 (1228)
Q Consensus 465 ~~lvsV~~v~iD~~gRLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~V-D~~~~c~~ 543 (1228)
++|.-+.+.+- -+||||++|+|.-. |..+|+++|+....-.+|.- ...-.++.++.| -.. +.+
T Consensus 200 ~GLsmPhSPRW-hdgrLwvldsgtGe------------v~~vD~~~G~~e~Va~vpG~-~rGL~f~G~llvVgmS--k~R 263 (335)
T TIGR03032 200 SGLSMPHSPRW-YQGKLWLLNSGRGE------------LGYVDPQAGKFQPVAFLPGF-TRGLAFAGDFAFVGLS--KLR 263 (335)
T ss_pred cCccCCcCCcE-eCCeEEEEECCCCE------------EEEEcCCCCcEEEEEECCCC-CcccceeCCEEEEEec--ccc
Confidence 44444444444 36999999999854 89999999998877788752 333344445443 211 112
Q ss_pred eEEE-----Eec---CCCCeEEEEEcCCCeEEE
Q psy11781 544 VFAY-----VND---VFRYGLIVYDFFKNTSYR 568 (1228)
Q Consensus 544 ~~aY-----ItD---s~~~gLIVyDl~~g~swR 568 (1228)
.... |++ ....||.|.|+.+|..--
T Consensus 264 ~~~~f~glpl~~~l~~~~CGv~vidl~tG~vv~ 296 (335)
T TIGR03032 264 ESRVFGGLPIEERLDALGCGVAVIDLNSGDVVH 296 (335)
T ss_pred CCCCcCCCchhhhhhhhcccEEEEECCCCCEEE
Confidence 1222 221 225899999999997443
No 72
>PRK05137 tolB translocation protein TolB; Provisional
Probab=76.75 E-value=1.2e+02 Score=36.81 Aligned_cols=61 Identities=16% Similarity=0.161 Sum_probs=38.2
Q ss_pred CCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCC-CCeEEEEECCCCeEEEee
Q psy11781 950 PPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVF-RYGLIVYDFFKNTSYRLT 1020 (1228)
Q Consensus 950 ~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~-~~gLIVydl~~g~swRv~ 1020 (1228)
.+.|+++|++++++.+ |... .......... . +.+..||.+|.. .+-|.++|+.+++..|+.
T Consensus 269 ~~~Iy~~d~~~~~~~~---Lt~~----~~~~~~~~~s--p-DG~~i~f~s~~~g~~~Iy~~d~~g~~~~~lt 330 (435)
T PRK05137 269 NTDIYTMDLRSGTTTR---LTDS----PAIDTSPSYS--P-DGSQIVFESDRSGSPQLYVMNADGSNPRRIS 330 (435)
T ss_pred CceEEEEECCCCceEE---ccCC----CCccCceeEc--C-CCCEEEEEECCCCCCeEEEEECCCCCeEEee
Confidence 4578999998887643 2111 1111112222 1 245678888754 478999999999888886
No 73
>PRK04792 tolB translocation protein TolB; Provisional
Probab=76.33 E-value=99 Score=37.98 Aligned_cols=31 Identities=10% Similarity=0.076 Sum_probs=24.2
Q ss_pred cCCceEEEEeeC-CcceEEEEEccCCceeeee
Q psy11781 100 DCDHVFAYVNDV-FRYGLIVYDFFKNTSYRLT 130 (1228)
Q Consensus 100 ~~~~~~aYi~D~-~~~g~iv~dl~~~~swR~~ 130 (1228)
..|+-++|.+|. +.+-|.++|+.+++.=|++
T Consensus 315 pDG~~I~f~s~~~g~~~Iy~~dl~~g~~~~Lt 346 (448)
T PRK04792 315 PDGKSLIFTSERGGKPQIYRVNLASGKVSRLT 346 (448)
T ss_pred CCCCEEEEEECCCCCceEEEEECCCCCEEEEe
Confidence 447778888886 6788999999988866653
No 74
>PRK04043 tolB translocation protein TolB; Provisional
Probab=75.84 E-value=34 Score=41.71 Aligned_cols=125 Identities=16% Similarity=0.220 Sum_probs=70.9
Q ss_pred cCCeEEEEEccCCcEEEEEEcCCcccccCCcccceEEEEeecCCceEEEEeeC-CcceEEEEEccCCceeeeec-Ccccc
Q psy11781 59 CPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDV-FRYGLIVYDFFKNTSYRLTH-PYMYP 136 (1228)
Q Consensus 59 ~~pkl~~~dl~t~~~~~~~~~p~~~~~~~s~~~~i~~d~~~~~~~~~aYi~D~-~~~g~iv~dl~~~~swR~~h-~~~~~ 136 (1228)
+.|.|.++||.|++.-+=..++.... ..+ ..--|+-+||..+. +..-|.++|+.+++.-|+++ +...-
T Consensus 211 ~~~~Iyv~dl~tg~~~~lt~~~g~~~---------~~~-~SPDG~~la~~~~~~g~~~Iy~~dl~~g~~~~LT~~~~~d~ 280 (419)
T PRK04043 211 RKPTLYKYNLYTGKKEKIASSQGMLV---------VSD-VSKDGSKLLLTMAPKGQPDIYLYDTNTKTLTQITNYPGIDV 280 (419)
T ss_pred CCCEEEEEECCCCcEEEEecCCCcEE---------eeE-ECCCCCEEEEEEccCCCcEEEEEECCCCcEEEcccCCCccC
Confidence 36889999998886544333333211 111 12236667777765 67889999999888666643 22122
Q ss_pred CCCccceeeccceeeec---cCeeeeecCcCCCCccccCCcccccccccccCCCcccccccCCCceEEEEeccc
Q psy11781 137 EPTQSTYILDNLKFRWV---DGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSS 207 (1228)
Q Consensus 137 ~p~~~~~~~~~~~f~~~---~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~lYf~~lsS 207 (1228)
.| .+.=+|.++-+. .|.+-++.-+...|+.+++.+. |.. ..++|+||+.|.|.....
T Consensus 281 ~p---~~SPDG~~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt~~----------g~~-~~~~SPDG~~Ia~~~~~~ 340 (419)
T PRK04043 281 NG---NFVEDDKRIVFVSDRLGYPNIFMKKLNSGSVEQVVFH----------GKN-NSSVSTYKNYIVYSSRET 340 (419)
T ss_pred cc---EECCCCCEEEEEECCCCCceEEEEECCCCCeEeCccC----------CCc-CceECCCCCEEEEEEcCC
Confidence 22 233344432222 3444444444445665555441 222 247899999999987654
No 75
>PRK04792 tolB translocation protein TolB; Provisional
Probab=74.43 E-value=30 Score=42.39 Aligned_cols=123 Identities=19% Similarity=0.141 Sum_probs=64.9
Q ss_pred cCCeEEEEEccCCcEEEEEEcCCcccccCCcccceEEEEeecCCceEEEEeeC-CcceEEEEEccCCceeeeecC-----
Q psy11781 59 CPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDV-FRYGLIVYDFFKNTSYRLTHP----- 132 (1228)
Q Consensus 59 ~~pkl~~~dl~t~~~~~~~~~p~~~~~~~s~~~~i~~d~~~~~~~~~aYi~D~-~~~g~iv~dl~~~~swR~~h~----- 132 (1228)
+.++|..+|+.+++..+--.++... ...++ ..-|+.+||..+. +..-|.++|+.+++.-|+...
T Consensus 240 g~~~L~~~dl~tg~~~~lt~~~g~~-------~~~~w---SPDG~~La~~~~~~g~~~Iy~~dl~tg~~~~lt~~~~~~~ 309 (448)
T PRK04792 240 RKAEIFVQDIYTQVREKVTSFPGIN-------GAPRF---SPDGKKLALVLSKDGQPEIYVVDIATKALTRITRHRAIDT 309 (448)
T ss_pred CCcEEEEEECCCCCeEEecCCCCCc-------CCeeE---CCCCCEEEEEEeCCCCeEEEEEECCCCCeEECccCCCCcc
Confidence 4678888999888653322222210 11222 2227778888765 566799999999988777332
Q ss_pred --ccccCCCccceeeccceeeeccCeeeeecCcCCCCccccCCcccccccccccCCCcccccccCCCceEEEEecc
Q psy11781 133 --YMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLS 206 (1228)
Q Consensus 133 --~~~~~p~~~~~~~~~~~f~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~lYf~~ls 206 (1228)
.+.||...+-|+-+ .+|.+-++.-....|+.+++... ...+.+.+.|+||+.|||....
T Consensus 310 ~p~wSpDG~~I~f~s~------~~g~~~Iy~~dl~~g~~~~Lt~~---------g~~~~~~~~SpDG~~l~~~~~~ 370 (448)
T PRK04792 310 EPSWHPDGKSLIFTSE------RGGKPQIYRVNLASGKVSRLTFE---------GEQNLGGSITPDGRSMIMVNRT 370 (448)
T ss_pred ceEECCCCCEEEEEEC------CCCCceEEEEECCCCCEEEEecC---------CCCCcCeeECCCCCEEEEEEec
Confidence 22233222211110 12322233222223443333221 1122356889999999997653
No 76
>PRK02889 tolB translocation protein TolB; Provisional
Probab=73.74 E-value=76 Score=38.57 Aligned_cols=98 Identities=18% Similarity=0.188 Sum_probs=55.7
Q ss_pred CceEEEEeeC-CcceEEEEEccCCceeeee-------cCccccCCCccceeeccceeeeccCeeeeecCcCCCCccccCC
Q psy11781 102 DHVFAYVNDV-FRYGLIVYDFFKNTSYRLT-------HPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHP 173 (1228)
Q Consensus 102 ~~~~aYi~D~-~~~g~iv~dl~~~~swR~~-------h~~~~~~p~~~~~~~~~~~f~~~~G~~~~~~~~~~~~~~~~~~ 173 (1228)
|+-+||.++. +++.|.++|+.+|+.=++. ++.+.||...+-|+.. .+|.+-++.-....|+.+++.
T Consensus 207 G~~la~~s~~~~~~~I~~~dl~~g~~~~l~~~~g~~~~~~~SPDG~~la~~~~------~~g~~~Iy~~d~~~~~~~~lt 280 (427)
T PRK02889 207 GTKLAYVSFESKKPVVYVHDLATGRRRVVANFKGSNSAPAWSPDGRTLAVALS------RDGNSQIYTVNADGSGLRRLT 280 (427)
T ss_pred CCEEEEEEccCCCcEEEEEECCCCCEEEeecCCCCccceEECCCCCEEEEEEc------cCCCceEEEEECCCCCcEECC
Confidence 6779999876 5678999999999876663 2344444443322211 244333333222223322222
Q ss_pred cccccccccccCCCcccccccCCCceEEEEec--ccCeeEEEe
Q psy11781 174 YEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSL--SSNRHYYVS 214 (1228)
Q Consensus 174 ~~~~~~~~~~~~g~~~~~~~~~~~~~lYf~~l--sS~~~~sV~ 214 (1228)
. ..|.+...+.|+||+.|+|..= ....+|.++
T Consensus 281 ~---------~~~~~~~~~wSpDG~~l~f~s~~~g~~~Iy~~~ 314 (427)
T PRK02889 281 Q---------SSGIDTEPFFSPDGRSIYFTSDRGGAPQIYRMP 314 (427)
T ss_pred C---------CCCCCcCeEEcCCCCEEEEEecCCCCcEEEEEE
Confidence 1 1244556789999999999642 233566654
No 77
>PRK00178 tolB translocation protein TolB; Provisional
Probab=73.09 E-value=39 Score=40.75 Aligned_cols=99 Identities=11% Similarity=0.118 Sum_probs=56.8
Q ss_pred CCceEEEEeeC-CcceEEEEEccCCceeeeec-------CccccCCCccceeeccceeeeccCeeeeecCcCCCCccccC
Q psy11781 101 CDHVFAYVNDV-FRYGLIVYDFFKNTSYRLTH-------PYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRH 172 (1228)
Q Consensus 101 ~~~~~aYi~D~-~~~g~iv~dl~~~~swR~~h-------~~~~~~p~~~~~~~~~~~f~~~~G~~~~~~~~~~~~~~~~~ 172 (1228)
.|+.++|.+|. +.+.|.++|+.+++.=|+.. +.+.|+...+-|.-. .+|.+-+++-....|+.+++
T Consensus 297 Dg~~i~f~s~~~g~~~iy~~d~~~g~~~~lt~~~~~~~~~~~Spdg~~i~~~~~------~~~~~~l~~~dl~tg~~~~l 370 (430)
T PRK00178 297 DGRTLYFTSDRGGKPQIYKVNVNGGRAERVTFVGNYNARPRLSADGKTLVMVHR------QDGNFHVAAQDLQRGSVRIL 370 (430)
T ss_pred CCCEEEEEECCCCCceEEEEECCCCCEEEeecCCCCccceEECCCCCEEEEEEc------cCCceEEEEEECCCCCEEEc
Confidence 47788999887 57789999999888555531 223333333322111 12333333333333433333
Q ss_pred CcccccccccccCCCcccccccCCCceEEEEeccc--CeeEEEec
Q psy11781 173 PYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSS--NRHYYVST 215 (1228)
Q Consensus 173 ~~~~~~~~~~~~~g~~~~~~~~~~~~~lYf~~lsS--~~~~sV~T 215 (1228)
.. .+.+.....|+||+.|+|+...+ .++|.++.
T Consensus 371 t~----------~~~~~~p~~spdg~~i~~~~~~~g~~~l~~~~~ 405 (430)
T PRK00178 371 TD----------TSLDESPSVAPNGTMLIYATRQQGRGVLMLVSI 405 (430)
T ss_pred cC----------CCCCCCceECCCCCEEEEEEecCCceEEEEEEC
Confidence 22 13455778999999999987643 44666655
No 78
>PF06739 SBBP: Beta-propeller repeat; InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=69.59 E-value=5.4 Score=32.14 Aligned_cols=31 Identities=23% Similarity=0.233 Sum_probs=22.8
Q ss_pred eEEeccccCCCCcccEEEcCCCcEEEEecCC
Q psy11781 1124 FHYLGSRFRNTQASASAINSNGVMFYNLVTK 1154 (1228)
Q Consensus 1124 v~~lG~rg~~sqs~g~~~D~~G~LY~t~~~~ 1154 (1228)
++.+|..+....+.++++|++||+|++-...
T Consensus 3 ~~~lG~~~~~~~~~~IavD~~GNiYv~G~T~ 33 (38)
T PF06739_consen 3 STQLGGPGAQDYGNGIAVDSNGNIYVTGYTN 33 (38)
T ss_pred EEEeCCCCCceeEEEEEECCCCCEEEEEeec
Confidence 4567754433458899999999999985543
No 79
>PF06739 SBBP: Beta-propeller repeat; InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=68.31 E-value=5.4 Score=32.17 Aligned_cols=31 Identities=23% Similarity=0.233 Sum_probs=22.4
Q ss_pred eEeeccccccccccceEEccCCcEEEEcCCC
Q psy11781 674 FHYLGSRFRNTQASASAINSNGVMFYNLVTK 704 (1228)
Q Consensus 674 v~~lG~~g~~~qs~G~a~D~~G~LY~t~~~~ 704 (1228)
++.+|..+....+.++++|++|++|++-...
T Consensus 3 ~~~lG~~~~~~~~~~IavD~~GNiYv~G~T~ 33 (38)
T PF06739_consen 3 STQLGGPGAQDYGNGIAVDSNGNIYVTGYTN 33 (38)
T ss_pred EEEeCCCCCceeEEEEEECCCCCEEEEEeec
Confidence 4566744433458999999999999985543
No 80
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=68.19 E-value=1.2e+02 Score=34.00 Aligned_cols=61 Identities=11% Similarity=0.184 Sum_probs=39.0
Q ss_pred CCcEEEEecCCCeEEEEeCCCCccc--------cCCeeEEEecCCceecCCceEEeCCC-cEEEEeCCchhhh
Q psy11781 1144 NGVMFYNLVTKHSVGCWNTKTKVYL--------PQTQDIVQTSRDILNFPNDLKIDPND-NIWILSNKLHQYL 1207 (1228)
Q Consensus 1144 ~G~LY~t~~~~~aI~~wn~~~~~~~--------~~n~~~v~~~~~~L~wPd~l~id~dg-~Lyvlsn~l~~~~ 1207 (1228)
+|.||-.-...++|.+.+|++++.. ..+.... ..+..=++|++.++++ .+|++--+++..+
T Consensus 185 dG~lyANVw~t~~I~rI~p~sGrV~~widlS~L~~~~~~~---~~~~nvlNGIA~~~~~~r~~iTGK~wp~lf 254 (262)
T COG3823 185 DGELYANVWQTTRIARIDPDSGRVVAWIDLSGLLKELNLD---KSNDNVLNGIAHDPQQDRFLITGKLWPLLF 254 (262)
T ss_pred ccEEEEeeeeecceEEEcCCCCcEEEEEEccCCchhcCcc---ccccccccceeecCcCCeEEEecCcCceeE
Confidence 4777777777788999988866531 0111211 2233457899988776 7888777777654
No 81
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=67.67 E-value=13 Score=46.60 Aligned_cols=69 Identities=16% Similarity=0.369 Sum_probs=49.9
Q ss_pred cccEEEc-CCCcEEEEecCCC-------------------eEEEEeCCCCc---cccCCeeEEEe--c------------
Q psy11781 1136 ASASAIN-SNGVMFYNLVTKH-------------------SVGCWNTKTKV---YLPQTQDIVQT--S------------ 1178 (1228)
Q Consensus 1136 s~g~~~D-~~G~LY~t~~~~~-------------------aI~~wn~~~~~---~~~~n~~~v~~--~------------ 1178 (1228)
..++.++ .+|.|||+..+.. .|++|++. +. ...-...++.. +
T Consensus 352 pEgi~~~p~~g~vY~a~T~~~~r~~~~~~~~n~~~~n~~G~I~r~~~~-~~d~~~~~f~~~~~~~~g~~~~~~~~~~~~~ 430 (524)
T PF05787_consen 352 PEGITVNPDDGEVYFALTNNSGRGESDVDAANPRAGNGYGQIYRYDPD-GNDHAATTFTWELFLVGGDPTDASGNGSNKC 430 (524)
T ss_pred ccCeeEeCCCCEEEEEEecCCCCcccccccCCcccCCcccEEEEeccc-CCccccceeEEEEEEEecCcccccccccCcc
Confidence 5688988 7799999988777 79999987 32 11112333332 1
Q ss_pred -CCceecCCceEEeCCCcEEEEeCCchh
Q psy11781 1179 -RDILNFPNDLKIDPNDNIWILSNKLHQ 1205 (1228)
Q Consensus 1179 -~~~L~wPd~l~id~dg~Lyvlsn~l~~ 1205 (1228)
...|.-||.|++|++|.|||.+..-..
T Consensus 431 ~~~~f~sPDNL~~d~~G~LwI~eD~~~~ 458 (524)
T PF05787_consen 431 DDNGFASPDNLAFDPDGNLWIQEDGGGS 458 (524)
T ss_pred cCCCcCCCCceEECCCCCEEEEeCCCCC
Confidence 346889999999999999999965543
No 82
>PRK01029 tolB translocation protein TolB; Provisional
Probab=66.70 E-value=2.8e+02 Score=33.94 Aligned_cols=28 Identities=7% Similarity=0.096 Sum_probs=24.0
Q ss_pred eEEEEEcCC-CCeEEEEECCCCeEEEeec
Q psy11781 994 VFAYVNDVF-RYGLIVYDFFKNTSYRLTH 1021 (1228)
Q Consensus 994 ~~aYItD~~-~~gLIVydl~~g~swRv~h 1021 (1228)
.++|+++.. .+.|.++|+.+|+..++.+
T Consensus 200 ~~~y~S~~~g~~~I~~~~l~~g~~~~lt~ 228 (428)
T PRK01029 200 PYLYVSYKLGVPKIFLGSLENPAGKKILA 228 (428)
T ss_pred EEEEEEccCCCceEEEEECCCCCceEeec
Confidence 488999865 5789999999999999874
No 83
>PRK03629 tolB translocation protein TolB; Provisional
Probab=66.60 E-value=3.1e+02 Score=33.49 Aligned_cols=60 Identities=17% Similarity=0.143 Sum_probs=39.3
Q ss_pred CEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCC-CCeEEEEECCCCeEEEee
Q psy11781 951 PKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVF-RYGLIVYDFFKNTSYRLT 1020 (1228)
Q Consensus 951 pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~-~~gLIVydl~~g~swRv~ 1020 (1228)
+.|+++|++++++.+ +... .......... .+.+..+|.+|.+ ...|.++|+.+++..++.
T Consensus 267 ~~I~~~d~~tg~~~~---lt~~----~~~~~~~~wS---PDG~~I~f~s~~~g~~~Iy~~d~~~g~~~~lt 327 (429)
T PRK03629 267 LNLYVMDLASGQIRQ---VTDG----RSNNTEPTWF---PDSQNLAYTSDQAGRPQVYKVNINGGAPQRIT 327 (429)
T ss_pred cEEEEEECCCCCEEE---ccCC----CCCcCceEEC---CCCCEEEEEeCCCCCceEEEEECCCCCeEEee
Confidence 479999999988753 2111 1112223332 1245678999864 568999999999888875
No 84
>PRK04922 tolB translocation protein TolB; Provisional
Probab=65.91 E-value=72 Score=38.77 Aligned_cols=99 Identities=12% Similarity=0.113 Sum_probs=54.2
Q ss_pred CCceEEEEeeC-CcceEEEEEccCCceeeee-------cCccccCCCccceeeccceeeeccCeeeeecCcCCCCccccC
Q psy11781 101 CDHVFAYVNDV-FRYGLIVYDFFKNTSYRLT-------HPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRH 172 (1228)
Q Consensus 101 ~~~~~aYi~D~-~~~g~iv~dl~~~~swR~~-------h~~~~~~p~~~~~~~~~~~f~~~~G~~~~~~~~~~~~~~~~~ 172 (1228)
.|+-++|.+|- +..-|.++|+.+++-=|+. .+.+.|+...+-|.-. .+|.+-+++-....|+.+.+
T Consensus 302 DG~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~g~~~~~~~~SpDG~~Ia~~~~------~~~~~~I~v~d~~~g~~~~L 375 (433)
T PRK04922 302 DGKSIYFTSDRGGRPQIYRVAASGGSAERLTFQGNYNARASVSPDGKKIAMVHG------SGGQYRIAVMDLSTGSVRTL 375 (433)
T ss_pred CCCEEEEEECCCCCceEEEEECCCCCeEEeecCCCCccCEEECCCCCEEEEEEC------CCCceeEEEEECCCCCeEEC
Confidence 37778999886 5677999999888755552 1233333333211100 12222222222223333222
Q ss_pred CcccccccccccCCCcccccccCCCceEEEEecc--cCeeEEEec
Q psy11781 173 PYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLS--SNRHYYVST 215 (1228)
Q Consensus 173 ~~~~~~~~~~~~~g~~~~~~~~~~~~~lYf~~ls--S~~~~sV~T 215 (1228)
.. ++.+..++.|+||+.|+|..-. ...+|.+++
T Consensus 376 t~----------~~~~~~p~~spdG~~i~~~s~~~g~~~L~~~~~ 410 (433)
T PRK04922 376 TP----------GSLDESPSFAPNGSMVLYATREGGRGVLAAVST 410 (433)
T ss_pred CC----------CCCCCCceECCCCCEEEEEEecCCceEEEEEEC
Confidence 22 1234467899999999998763 345777765
No 85
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=65.86 E-value=15 Score=46.01 Aligned_cols=70 Identities=16% Similarity=0.365 Sum_probs=50.9
Q ss_pred ccccceEEc-cCCcEEEEcCCCC-------------------eEEEEeCCCCc---cCCCCeeEEE--ec----------
Q psy11781 684 TQASASAIN-SNGVMFYNLVTKH-------------------SVGCWNTKTKV---YLPQTQDIVQ--TS---------- 728 (1228)
Q Consensus 684 ~qs~G~a~D-~~G~LY~t~~~~~-------------------aI~~wn~~~~~---~~~~n~~iV~--~~---------- 728 (1228)
....|++++ .+|.|||+..+.. .|++|++. +. ...-...++. -+
T Consensus 350 ~RpEgi~~~p~~g~vY~a~T~~~~r~~~~~~~~n~~~~n~~G~I~r~~~~-~~d~~~~~f~~~~~~~~g~~~~~~~~~~~ 428 (524)
T PF05787_consen 350 DRPEGITVNPDDGEVYFALTNNSGRGESDVDAANPRAGNGYGQIYRYDPD-GNDHAATTFTWELFLVGGDPTDASGNGSN 428 (524)
T ss_pred cCccCeeEeCCCCEEEEEEecCCCCcccccccCCcccCCcccEEEEeccc-CCccccceeEEEEEEEecCcccccccccC
Confidence 347899999 7899999987666 89999987 32 1111223332 22
Q ss_pred ---CCceecCCceEEcCCCcEEEEecCcc
Q psy11781 729 ---RDILNFPNDLKIDPNDNIWILSNKLH 754 (1228)
Q Consensus 729 ---d~~L~wPd~l~Id~dg~LyVlsnrl~ 754 (1228)
...|.-||.|+++++|.|||.+..-.
T Consensus 429 ~~~~~~f~sPDNL~~d~~G~LwI~eD~~~ 457 (524)
T PF05787_consen 429 KCDDNGFASPDNLAFDPDGNLWIQEDGGG 457 (524)
T ss_pred cccCCCcCCCCceEECCCCCEEEEeCCCC
Confidence 44688999999999999999996654
No 86
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=65.67 E-value=79 Score=36.39 Aligned_cols=120 Identities=20% Similarity=0.225 Sum_probs=75.0
Q ss_pred cCCCcccceEEeCCeEEEEeccCCCCCC-----cEEEEEECCCCCCCCccccCCCCCcCCCCCCCcceEEEEEEEeCCCc
Q psy11781 856 PENNLPLGIGIWRSTIFLSFPKWKAGIP-----FTLASFNMNDPSESPILLPYPNWSYFDDSNCNSLISVFRMSVDKCDR 930 (1228)
Q Consensus 856 p~n~~p~gv~v~~gRiFvt~PR~~~gvP-----~TLa~v~~~~~~~sP~l~PYP~~~wn~~~~c~~lvSV~~v~iD~~gr 930 (1228)
|....-.|+++.++++|.=. |+.|+- .||..+. --+||..-| +|. -| ..+
T Consensus 87 ~~~~FgEGit~~~d~l~qLT--Wk~~~~f~yd~~tl~~~~---------~~~y~~EGW-------GLt------~d-g~~ 141 (264)
T PF05096_consen 87 PPRYFGEGITILGDKLYQLT--WKEGTGFVYDPNTLKKIG---------TFPYPGEGW-------GLT------SD-GKR 141 (264)
T ss_dssp TTT--EEEEEEETTEEEEEE--SSSSEEEEEETTTTEEEE---------EEE-SSS---------EEE------EC-SSC
T ss_pred CccccceeEEEECCEEEEEE--ecCCeEEEEccccceEEE---------EEecCCcce-------EEE------cC-CCE
Confidence 45567789999999999888 988753 2333332 247886555 333 33 457
Q ss_pred EEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCCCCeEEEEE
Q psy11781 931 LWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYD 1010 (1228)
Q Consensus 931 LWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~~gLIVyd 1010 (1228)
||+=|- ..+|..+|.+|=++++++.... .-.+-..+|.|.. -++++|--==....|++.|
T Consensus 142 Li~SDG-------------S~~L~~~dP~~f~~~~~i~V~~-~g~pv~~LNELE~------i~G~IyANVW~td~I~~Id 201 (264)
T PF05096_consen 142 LIMSDG-------------SSRLYFLDPETFKEVRTIQVTD-NGRPVSNLNELEY------INGKIYANVWQTDRIVRID 201 (264)
T ss_dssp EEEE-S-------------SSEEEEE-TTT-SEEEEEE-EE-TTEE---EEEEEE------ETTEEEEEETTSSEEEEEE
T ss_pred EEEECC-------------ccceEEECCcccceEEEEEEEE-CCEECCCcEeEEE------EcCEEEEEeCCCCeEEEEe
Confidence 887764 2378999998888999888853 2334466776665 3688886655578999999
Q ss_pred CCCCeEEEee
Q psy11781 1011 FFKNTSYRLT 1020 (1228)
Q Consensus 1011 l~~g~swRv~ 1020 (1228)
.++|+--.+.
T Consensus 202 p~tG~V~~~i 211 (264)
T PF05096_consen 202 PETGKVVGWI 211 (264)
T ss_dssp TTT-BEEEEE
T ss_pred CCCCeEEEEE
Confidence 9999887765
No 87
>smart00284 OLF Olfactomedin-like domains.
Probab=63.77 E-value=1.8e+02 Score=33.47 Aligned_cols=158 Identities=13% Similarity=0.159 Sum_probs=86.8
Q ss_pred eecCCHHHHHhhccCCCc-ccCCCcccceEEeCCeEEEEcccCCCCCCcEEEEEECCCCCCCCcccCCCCcCcCCCCC-C
Q psy11781 387 FVFRDNKQKANLIRSNKF-IPENNLPLGIGIWRSTIFLSFPKWKAGIPFTLASFNMNDPSESPILLPYPNWSYFDDSN-C 464 (1228)
Q Consensus 387 f~~p~~~~r~~ai~~g~y-~p~n~~P~gv~v~~gRiFvt~PR~~~gvP~TL~~v~~~~~~~sp~l~PYP~~~wn~~~~-c 464 (1228)
++|++.++-...-.+..| .|....=+|..|.+|-+|..--+ ..+|...+..... --..+.-|++..|+.-. +
T Consensus 51 ~ey~~~~~f~~~~~~~~~~Lp~~~~GtG~VVYngslYY~~~~-----s~~iiKydL~t~~-v~~~~~Lp~a~y~~~~~Y~ 124 (255)
T smart00284 51 REYSSMSDFQMGKNPTDHPLPHAGQGTGVVVYNGSLYFNKFN-----SHDICRFDLTTET-YQKEPLLNGAGYNNRFPYA 124 (255)
T ss_pred EEecCHHHHhccCCceEEECCCccccccEEEECceEEEEecC-----CccEEEEECCCCc-EEEEEecCccccccccccc
Confidence 455666554433222222 23333346788889999985321 3568888764321 11234456655443211 2
Q ss_pred CcceEEEEEEEeCCCcEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCe
Q psy11781 465 NSLISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHV 544 (1228)
Q Consensus 465 ~~lvsV~~v~iD~~gRLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~ 544 (1228)
.+=-+-..+.+|+.| |||+=...-+. | .=-|-.+|..|=++++++..+- ...+.=+.+.| | |
T Consensus 125 ~~~~sdiDlAvDE~G-LWvIYat~~~~-g------~ivvSkLnp~tL~ve~tW~T~~---~k~sa~naFmv-----C--G 186 (255)
T smart00284 125 WGGFSDIDLAVDENG-LWVIYATEQNA-G------KIVISKLNPATLTIENTWITTY---NKRSASNAFMI-----C--G 186 (255)
T ss_pred cCCCccEEEEEcCCc-eEEEEeccCCC-C------CEEEEeeCcccceEEEEEEcCC---CcccccccEEE-----e--e
Confidence 222455788999988 99993332110 0 0013356766667888888732 23344466667 5 6
Q ss_pred EEEEecCC--CCe--EEEEEcCCCeEEE
Q psy11781 545 FAYVNDVF--RYG--LIVYDFFKNTSYR 568 (1228)
Q Consensus 545 ~aYItDs~--~~g--LIVyDl~~g~swR 568 (1228)
.+|++++. ..+ -.+||..+++.-.
T Consensus 187 vLY~~~s~~~~~~~I~yayDt~t~~~~~ 214 (255)
T smart00284 187 ILYVTRSLGSKGEKVFYAYDTNTGKEGH 214 (255)
T ss_pred EEEEEccCCCCCcEEEEEEECCCCccce
Confidence 89999973 233 5677777765443
No 88
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=63.33 E-value=88 Score=38.42 Aligned_cols=87 Identities=15% Similarity=0.192 Sum_probs=57.7
Q ss_pred CcceEEEEEEEeCCCcEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCe
Q psy11781 465 NSLISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHV 544 (1228)
Q Consensus 465 ~~lvsV~~v~iD~~gRLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~ 544 (1228)
..|-++.++.++.+|.--|+-.-+. .|+++|+.||++- .+.++ ...++.++++.. ..+-
T Consensus 399 ~~lg~I~av~vs~dGK~~vvaNdr~------------el~vididngnv~---~idkS---~~~lItdf~~~~---nsr~ 457 (668)
T COG4946 399 KDLGNIEAVKVSPDGKKVVVANDRF------------ELWVIDIDNGNVR---LIDKS---EYGLITDFDWHP---NSRW 457 (668)
T ss_pred CCccceEEEEEcCCCcEEEEEcCce------------EEEEEEecCCCee---Eeccc---ccceeEEEEEcC---Ccee
Confidence 4577888888888887666544333 3899999998864 22221 224566666652 2456
Q ss_pred EEEEecCC--CCeEEEEEcCCCeEEEEcCC
Q psy11781 545 FAYVNDVF--RYGLIVYDFFKNTSYRLTHP 572 (1228)
Q Consensus 545 ~aYItDs~--~~gLIVyDl~~g~swRv~~~ 572 (1228)
+||--=++ ...|=+||+.+++..+++.+
T Consensus 458 iAYafP~gy~tq~Iklydm~~~Kiy~vTT~ 487 (668)
T COG4946 458 IAYAFPEGYYTQSIKLYDMDGGKIYDVTTP 487 (668)
T ss_pred EEEecCcceeeeeEEEEecCCCeEEEecCC
Confidence 67743334 46788999999999999854
No 89
>PRK05137 tolB translocation protein TolB; Provisional
Probab=63.14 E-value=90 Score=37.94 Aligned_cols=130 Identities=14% Similarity=0.109 Sum_probs=66.4
Q ss_pred CCeEEEEEccCCcEEEEEEcCCcccccCCcccceEEEEeecCCceEEEEeeC-CcceEEEEEccCCceeeeec-------
Q psy11781 60 PPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDV-FRYGLIVYDFFKNTSYRLTH------- 131 (1228)
Q Consensus 60 ~pkl~~~dl~t~~~~~~~~~p~~~~~~~s~~~~i~~d~~~~~~~~~aYi~D~-~~~g~iv~dl~~~~swR~~h------- 131 (1228)
.+.|..+|++++++ ++.+--.. ...... -...|+-+||.+|. +.+-|-++|+.+++.-|+..
T Consensus 269 ~~~Iy~~d~~~~~~-~~Lt~~~~------~~~~~~---~spDG~~i~f~s~~~g~~~Iy~~d~~g~~~~~lt~~~~~~~~ 338 (435)
T PRK05137 269 NTDIYTMDLRSGTT-TRLTDSPA------IDTSPS---YSPDGSQIVFESDRSGSPQLYVMNADGSNPRRISFGGGRYST 338 (435)
T ss_pred CceEEEEECCCCce-EEccCCCC------ccCcee---EcCCCCEEEEEECCCCCCeEEEEECCCCCeEEeecCCCcccC
Confidence 46677777776654 33321111 001111 23447889999975 56789999998888777732
Q ss_pred CccccCCCccceeeccceeeeccCeeeeecCcCCCCccccCCcccccccccccCCCcccccccCCCceEEEEeccc----
Q psy11781 132 PYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSS---- 207 (1228)
Q Consensus 132 ~~~~~~p~~~~~~~~~~~f~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~lYf~~lsS---- 207 (1228)
+.+.|+-..+-|.-. ..|.+-+++-....+..+....+ +.+-.++.|+||+.|||..-..
T Consensus 339 ~~~SpdG~~ia~~~~------~~~~~~i~~~d~~~~~~~~lt~~----------~~~~~p~~spDG~~i~~~~~~~~~~~ 402 (435)
T PRK05137 339 PVWSPRGDLIAFTKQ------GGGQFSIGVMKPDGSGERILTSG----------FLVEGPTWAPNGRVIMFFRQTPGSGG 402 (435)
T ss_pred eEECCCCCEEEEEEc------CCCceEEEEEECCCCceEeccCC----------CCCCCCeECCCCCEEEEEEccCCCCC
Confidence 233344333322111 11222222222212222222111 1122678899999999976543
Q ss_pred -CeeEEEec
Q psy11781 208 -NRHYYVST 215 (1228)
Q Consensus 208 -~~~~sV~T 215 (1228)
..+|.+..
T Consensus 403 ~~~L~~~dl 411 (435)
T PRK05137 403 APKLYTVDL 411 (435)
T ss_pred cceEEEEEC
Confidence 35666653
No 90
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=60.85 E-value=15 Score=28.90 Aligned_cols=30 Identities=10% Similarity=0.168 Sum_probs=26.7
Q ss_pred ccccceEEc-cCCcEEEEcCCCCeEEEEeCC
Q psy11781 684 TQASASAIN-SNGVMFYNLVTKHSVGCWNTK 713 (1228)
Q Consensus 684 ~qs~G~a~D-~~G~LY~t~~~~~aI~~wn~~ 713 (1228)
..+.|+++| ..+.||+++.....|.+.+-+
T Consensus 9 ~~~~~la~d~~~~~lYw~D~~~~~I~~~~~~ 39 (43)
T smart00135 9 GHPNGLAVDWIEGRLYWTDWGLDVIEVANLD 39 (43)
T ss_pred CCcCEEEEeecCCEEEEEeCCCCEEEEEeCC
Confidence 457899999 778999999999999998876
No 91
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=60.63 E-value=37 Score=39.83 Aligned_cols=60 Identities=25% Similarity=0.395 Sum_probs=47.0
Q ss_pred EEEEEeCCCcEEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEECCCCc---------ccCCCcceeeEEEe
Q psy11781 921 FRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQ---------VFEGSLFSNIVTEV 987 (1228)
Q Consensus 921 ~~v~iD~~grLWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~~---------~~~~S~l~di~VD~ 987 (1228)
-+|+++..|.+||-|-|.... ..+|.|+.||+. +++++++.+|... .+.+..|..|++..
T Consensus 88 Egi~~~~~g~~~is~E~~~~~------~~~p~I~~~~~~-G~~~~~~~vP~~~~~~~~~~~~~~~N~G~E~la~~~ 156 (326)
T PF13449_consen 88 EGIAVPPDGSFWISSEGGRTG------GIPPRIRRFDLD-GRVIRRFPVPAAFLPDANGTSGRRNNRGFEGLAVSP 156 (326)
T ss_pred hHeEEecCCCEEEEeCCccCC------CCCCEEEEECCC-CcccceEccccccccccCccccccCCCCeEEEEECC
Confidence 345556899999999998532 238999999988 9999999999865 34567788888873
No 92
>COG4247 Phy 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]
Probab=60.44 E-value=1.7e+02 Score=33.60 Aligned_cols=50 Identities=14% Similarity=0.079 Sum_probs=31.0
Q ss_pred ceEEEEEcCCCCeEEEEECCCCeEEEeecCCcCcCCCcceeEEcCEEEEe
Q psy11781 993 HVFAYVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYILDNLKFRW 1042 (1228)
Q Consensus 993 ~~~aYItD~~~~gLIVydl~~g~swRv~h~s~~pdP~~~~~~i~G~~f~~ 1042 (1228)
+.-+-||-....||+||||..+...-+..--+.--...-.|-++|+.+..
T Consensus 66 ~kS~vItt~Kk~Gl~VYDLsGkqLqs~~~Gk~NNVDLrygF~LgG~~idi 115 (364)
T COG4247 66 DKSLVITTVKKAGLRVYDLSGKQLQSVNPGKYNNVDLRYGFQLGGQSIDI 115 (364)
T ss_pred CcceEEEeeccCCeEEEecCCCeeeecCCCcccccccccCcccCCeEEEE
Confidence 45566777778999999999876665542111111122346677877653
No 93
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=59.27 E-value=4.7e+02 Score=33.12 Aligned_cols=93 Identities=17% Similarity=0.061 Sum_probs=50.2
Q ss_pred EEeCCCcEEEEeCCCCCccccccc----cCCCEEEEEECCCCcEEEEEEC-CCCcccCCCcceeeEEEeecCCCceEEEE
Q psy11781 924 SVDKCDRLWIMDTGVTNILSSIQQ----LCPPKIMVFDLKTNTLIRKYIL-PTAQVFEGSLFSNIVTEVVEDCDHVFAYV 998 (1228)
Q Consensus 924 ~iD~~grLWVLDtG~~~~~~~~~~----~~~pKLvvfDL~td~lir~y~~-p~~~~~~~S~l~di~VD~~~~c~~~~aYI 998 (1228)
.+|+.-.|-.+=+|.+.-...... .-.-.|+++|++|+++.=.|.. |.++-.-+....-+.+|+..+-...-+.+
T Consensus 240 s~D~~~~lvy~~tGnp~p~~~~~r~gdnl~~~s~vAld~~TG~~~W~~Q~~~~D~wD~d~~~~p~l~d~~~~G~~~~~v~ 319 (527)
T TIGR03075 240 SYDPETNLIYFGTGNPSPWNSHLRPGDNLYTSSIVARDPDTGKIKWHYQTTPHDEWDYDGVNEMILFDLKKDGKPRKLLA 319 (527)
T ss_pred eEcCCCCeEEEeCCCCCCCCCCCCCCCCccceeEEEEccccCCEEEeeeCCCCCCccccCCCCcEEEEeccCCcEEEEEE
Confidence 556654444455555321100000 1234899999999999987876 55544334444455566532111113444
Q ss_pred EcCCCCeEEEEECCCCeE
Q psy11781 999 NDVFRYGLIVYDFFKNTS 1016 (1228)
Q Consensus 999 tD~~~~gLIVydl~~g~s 1016 (1228)
.=...+=+.|+|..+|+.
T Consensus 320 ~~~K~G~~~vlDr~tG~~ 337 (527)
T TIGR03075 320 HADRNGFFYVLDRTNGKL 337 (527)
T ss_pred EeCCCceEEEEECCCCce
Confidence 433445677777777765
No 94
>KOG2055|consensus
Probab=58.62 E-value=1.7e+02 Score=35.92 Aligned_cols=218 Identities=16% Similarity=0.196 Sum_probs=109.7
Q ss_pred CEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCCCCeEEEEECCCCeEEEeecCCcCcCCCc
Q psy11781 951 PKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEPTQ 1030 (1228)
Q Consensus 951 pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~~gLIVydl~~g~swRv~h~s~~pdP~~ 1030 (1228)
-+|+.+|=+++..+..+.|-. +-+....+- .|+.. .-++=....=+.+|||.+.+.-++.-+.-..++..
T Consensus 237 lrifqvDGk~N~~lqS~~l~~------fPi~~a~f~---p~G~~-~i~~s~rrky~ysyDle~ak~~k~~~~~g~e~~~~ 306 (514)
T KOG2055|consen 237 LRIFQVDGKVNPKLQSIHLEK------FPIQKAEFA---PNGHS-VIFTSGRRKYLYSYDLETAKVTKLKPPYGVEEKSM 306 (514)
T ss_pred EEEEEecCccChhheeeeecc------Cccceeeec---CCCce-EEEecccceEEEEeeccccccccccCCCCcccchh
Confidence 389999999999998888833 223333332 12222 44455556778999999999888875555554444
Q ss_pred ceeEE-cCEEEEeeCCccce-ecCccccccccCCCccccccccccccCccccceecCCCceEEEEeCCCCceEEEecchh
Q psy11781 1031 STYIL-DNLKFRWVDGIFGM-AISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSSNRHYYVSTTDL 1108 (1228)
Q Consensus 1031 ~~~~i-~G~~f~~~dGi~Gi-aLsp~~~~~~~~~p~~~~~~~~~~~~~~~~di~ls~d~r~LYf~plss~~ly~V~T~~L 1108 (1228)
..|.| +-.+|..-.|..|- -|--...+ + +--.....|.-.+++.|-|++.||-+--.| .+|.-+ |
T Consensus 307 e~FeVShd~~fia~~G~~G~I~lLhakT~-------e--li~s~KieG~v~~~~fsSdsk~l~~~~~~G-eV~v~n---l 373 (514)
T KOG2055|consen 307 ERFEVSHDSNFIAIAGNNGHIHLLHAKTK-------E--LITSFKIEGVVSDFTFSSDSKELLASGGTG-EVYVWN---L 373 (514)
T ss_pred heeEecCCCCeEEEcccCceEEeehhhhh-------h--hhheeeeccEEeeEEEecCCcEEEEEcCCc-eEEEEe---c
Confidence 44433 12224333343331 00000000 0 000001123334477888888888776666 354443 3
Q ss_pred cCCcccCCccccccceEEeccccCCCCcccEEEcCCCcEEEEecCCCeE-EEEeCCCCccccCCeeEEEecCCceecCCc
Q psy11781 1109 RNSSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNGVMFYNLVTKHSV-GCWNTKTKVYLPQTQDIVQTSRDILNFPND 1187 (1228)
Q Consensus 1109 r~~~~~~~~~~~~~~v~~lG~rg~~sqs~g~~~D~~G~LY~t~~~~~aI-~~wn~~~~~~~~~n~~~v~~~~~~L~wPd~ 1187 (1228)
+... ....+++-|.-. .+.+++..+|. |++--.+.+| ..+|-+ ..+...+-.-+..=+.---=.+.
T Consensus 374 ~~~~-------~~~rf~D~G~v~----gts~~~S~ng~-ylA~GS~~GiVNIYd~~-s~~~s~~PkPik~~dNLtt~Its 440 (514)
T KOG2055|consen 374 RQNS-------CLHRFVDDGSVH----GTSLCISLNGS-YLATGSDSGIVNIYDGN-SCFASTNPKPIKTVDNLTTAITS 440 (514)
T ss_pred CCcc-------eEEEEeecCccc----eeeeeecCCCc-eEEeccCcceEEEeccc-hhhccCCCCchhhhhhhheeeee
Confidence 3222 223344444432 34556667776 4444444444 355533 22222221112211111123567
Q ss_pred eEEeCCCcEEEEeCCch
Q psy11781 1188 LKIDPNDNIWILSNKLH 1204 (1228)
Q Consensus 1188 l~id~dg~Lyvlsn~l~ 1204 (1228)
|++.++++|..++++..
T Consensus 441 l~Fn~d~qiLAiaS~~~ 457 (514)
T KOG2055|consen 441 LQFNHDAQILAIASRVK 457 (514)
T ss_pred eeeCcchhhhhhhhhcc
Confidence 77888888777766554
No 95
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=58.60 E-value=4.5e+02 Score=32.72 Aligned_cols=68 Identities=9% Similarity=0.039 Sum_probs=41.8
Q ss_pred eEEEEEccCCcEEEEEEcCCccccc-CCcccceEEEEeecCCc--eEEEEeeCCcceEEEEEccCCce-eeee
Q psy11781 62 KIMVFDLKTNTLIRKYILPTAQVFE-GSLFSNIVTEVVEDCDH--VFAYVNDVFRYGLIVYDFFKNTS-YRLT 130 (1228)
Q Consensus 62 kl~~~dl~t~~~~~~~~~p~~~~~~-~s~~~~i~~d~~~~~~~--~~aYi~D~~~~g~iv~dl~~~~s-wR~~ 130 (1228)
.|+++|++|+++.+++.+....... ++.-.-+..+.....++ .++|+....+. +..+|..+|+- |+..
T Consensus 257 ~l~Ald~~tG~~~W~~~~~~~~~~~~~~~s~p~~~~~~~~~g~~~~~V~~g~~~G~-l~ald~~tG~~~W~~~ 328 (488)
T cd00216 257 SIVALDADTGKVKWFYQTTPHDLWDYDGPNQPSLADIKPKDGKPVPAIVHAPKNGF-FYVLDRTTGKLISARP 328 (488)
T ss_pred eEEEEcCCCCCEEEEeeCCCCCCcccccCCCCeEEeccccCCCeeEEEEEECCCce-EEEEECCCCcEeeEeE
Confidence 7999999999999999876543321 11111122222112233 36777654333 88999999874 7764
No 96
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=58.43 E-value=30 Score=27.22 Aligned_cols=32 Identities=22% Similarity=0.395 Sum_probs=26.3
Q ss_pred CCcEEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEECCC
Q psy11781 928 CDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPT 971 (1228)
Q Consensus 928 ~grLWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~ 971 (1228)
.++|||.+.+.- +|.+||+++.++++++....
T Consensus 3 ~~~lyv~~~~~~------------~v~~id~~~~~~~~~i~vg~ 34 (42)
T TIGR02276 3 GTKLYVTNSGSN------------TVSVIDTATNKVIATIPVGG 34 (42)
T ss_pred CCEEEEEeCCCC------------EEEEEECCCCeEEEEEECCC
Confidence 457999987643 78999999999999998853
No 97
>PRK00178 tolB translocation protein TolB; Provisional
Probab=58.18 E-value=1.1e+02 Score=37.07 Aligned_cols=126 Identities=16% Similarity=0.116 Sum_probs=64.0
Q ss_pred cCCeEEEEEccCCcEEEEEEcCCcccccCCcccceEEEEeecCCceEEEEeeC-CcceEEEEEccCCceeeeec-Ccccc
Q psy11781 59 CPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDV-FRYGLIVYDFFKNTSYRLTH-PYMYP 136 (1228)
Q Consensus 59 ~~pkl~~~dl~t~~~~~~~~~p~~~~~~~s~~~~i~~d~~~~~~~~~aYi~D~-~~~g~iv~dl~~~~swR~~h-~~~~~ 136 (1228)
+.|.|..+|+++++..+--.++.. ....++ ..-|+-+||..+. +..-|-++|+.+++..|+.+ ....-
T Consensus 221 ~~~~l~~~~l~~g~~~~l~~~~g~-------~~~~~~---SpDG~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~~~~~~~ 290 (430)
T PRK00178 221 KRPRIFVQNLDTGRREQITNFEGL-------NGAPAW---SPDGSKLAFVLSKDGNPEIYVMDLASRQLSRVTNHPAIDT 290 (430)
T ss_pred CCCEEEEEECCCCCEEEccCCCCC-------cCCeEE---CCCCCEEEEEEccCCCceEEEEECCCCCeEEcccCCCCcC
Confidence 357888888888765432222211 111222 2236778887765 55689999999998887743 21111
Q ss_pred CCCccceeeccceeee---ccCeeeeecCcCCCCccccCCcccccccccccCCCcccccccCCCceEEEEecc
Q psy11781 137 EPTQSTYILDNLKFRW---VDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLS 206 (1228)
Q Consensus 137 ~p~~~~~~~~~~~f~~---~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~lYf~~ls 206 (1228)
.| .+.-+|..+-+ .+|.+-++.-....|+.+++... .+.+...++|+||+.|+|....
T Consensus 291 ~~---~~spDg~~i~f~s~~~g~~~iy~~d~~~g~~~~lt~~---------~~~~~~~~~Spdg~~i~~~~~~ 351 (430)
T PRK00178 291 EP---FWGKDGRTLYFTSDRGGKPQIYKVNVNGGRAERVTFV---------GNYNARPRLSADGKTLVMVHRQ 351 (430)
T ss_pred Ce---EECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecC---------CCCccceEECCCCCEEEEEEcc
Confidence 11 12222222111 12333333322223433333221 0111245789999999998654
No 98
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=57.84 E-value=19 Score=28.28 Aligned_cols=29 Identities=10% Similarity=0.186 Sum_probs=25.8
Q ss_pred CcccEEEc-CCCcEEEEecCCCeEEEEeCC
Q psy11781 1135 QASASAIN-SNGVMFYNLVTKHSVGCWNTK 1163 (1228)
Q Consensus 1135 qs~g~~~D-~~G~LY~t~~~~~aI~~wn~~ 1163 (1228)
.+.|+++| ..+.||+++.....|.+.+-+
T Consensus 10 ~~~~la~d~~~~~lYw~D~~~~~I~~~~~~ 39 (43)
T smart00135 10 HPNGLAVDWIEGRLYWTDWGLDVIEVANLD 39 (43)
T ss_pred CcCEEEEeecCCEEEEEeCCCCEEEEEeCC
Confidence 47899999 778999999999999998876
No 99
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=56.30 E-value=4e+02 Score=31.46 Aligned_cols=82 Identities=21% Similarity=0.198 Sum_probs=52.1
Q ss_pred CEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEE----cCCCCeEEEEECCCCeEEEee-cCCcC
Q psy11781 951 PKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVN----DVFRYGLIVYDFFKNTSYRLT-HPYMY 1025 (1228)
Q Consensus 951 pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYIt----D~~~~gLIVydl~~g~swRv~-h~s~~ 1025 (1228)
--+++||..++++.+++.-|+. -.|.-=.+=. -++...|.| +.+.+-|-|||.+. ...|+. .+++.
T Consensus 28 ~~~~v~D~~~g~~~~~~~a~~g-----RHFyGHg~fs---~dG~~LytTEnd~~~g~G~IgVyd~~~-~~~ri~E~~s~G 98 (305)
T PF07433_consen 28 TFALVFDCRTGQLLQRLWAPPG-----RHFYGHGVFS---PDGRLLYTTENDYETGRGVIGVYDAAR-GYRRIGEFPSHG 98 (305)
T ss_pred cEEEEEEcCCCceeeEEcCCCC-----CEEecCEEEc---CCCCEEEEeccccCCCcEEEEEEECcC-CcEEEeEecCCC
Confidence 4799999999999986655443 4444422211 147789998 55667788999994 455554 56666
Q ss_pred cCCCcceeEEcCEEEE
Q psy11781 1026 PEPTQSTYILDNLKFR 1041 (1228)
Q Consensus 1026 pdP~~~~~~i~G~~f~ 1041 (1228)
-.|-...+.-+|+.+-
T Consensus 99 IGPHel~l~pDG~tLv 114 (305)
T PF07433_consen 99 IGPHELLLMPDGETLV 114 (305)
T ss_pred cChhhEEEcCCCCEEE
Confidence 5665444444454443
No 100
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=56.30 E-value=1.3e+02 Score=36.94 Aligned_cols=86 Identities=14% Similarity=0.165 Sum_probs=57.4
Q ss_pred CcceEEEEEEEeCCCcEEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCce
Q psy11781 915 NSLISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHV 994 (1228)
Q Consensus 915 ~~lvSV~~v~iD~~grLWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~ 994 (1228)
..|-.++++.++.+|.--|+-.-+. .|+++|+.||++- .+.++ ...++.++++.. ..+-
T Consensus 399 ~~lg~I~av~vs~dGK~~vvaNdr~------------el~vididngnv~---~idkS---~~~lItdf~~~~---nsr~ 457 (668)
T COG4946 399 KDLGNIEAVKVSPDGKKVVVANDRF------------ELWVIDIDNGNVR---LIDKS---EYGLITDFDWHP---NSRW 457 (668)
T ss_pred CCccceEEEEEcCCCcEEEEEcCce------------EEEEEEecCCCee---Eeccc---ccceeEEEEEcC---Ccee
Confidence 4578888999988887666644443 6899999998763 22111 123555555542 2455
Q ss_pred EEEEEcCC--CCeEEEEECCCCeEEEeec
Q psy11781 995 FAYVNDVF--RYGLIVYDFFKNTSYRLTH 1021 (1228)
Q Consensus 995 ~aYItD~~--~~gLIVydl~~g~swRv~h 1021 (1228)
+||-==++ ...|=+||+.++|..++..
T Consensus 458 iAYafP~gy~tq~Iklydm~~~Kiy~vTT 486 (668)
T COG4946 458 IAYAFPEGYYTQSIKLYDMDGGKIYDVTT 486 (668)
T ss_pred EEEecCcceeeeeEEEEecCCCeEEEecC
Confidence 66654444 3678999999999999974
No 101
>PRK04922 tolB translocation protein TolB; Provisional
Probab=56.10 E-value=1.1e+02 Score=37.05 Aligned_cols=126 Identities=16% Similarity=0.061 Sum_probs=64.0
Q ss_pred cCCeEEEEEccCCcEEEEEEcCCcccccCCcccceEEEEeecCCceEEEEeeC-CcceEEEEEccCCceeeeecC-cccc
Q psy11781 59 CPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDV-FRYGLIVYDFFKNTSYRLTHP-YMYP 136 (1228)
Q Consensus 59 ~~pkl~~~dl~t~~~~~~~~~p~~~~~~~s~~~~i~~d~~~~~~~~~aYi~D~-~~~g~iv~dl~~~~swR~~h~-~~~~ 136 (1228)
+.+.|..+|+++++..+--.++... ...++ ..-|+-+||..+. +..-|.++|+.+++..++.+. ...-
T Consensus 226 ~~~~l~~~dl~~g~~~~l~~~~g~~-------~~~~~---SpDG~~l~~~~s~~g~~~Iy~~d~~~g~~~~lt~~~~~~~ 295 (433)
T PRK04922 226 GRSAIYVQDLATGQRELVASFRGIN-------GAPSF---SPDGRRLALTLSRDGNPEIYVMDLGSRQLTRLTNHFGIDT 295 (433)
T ss_pred CCcEEEEEECCCCCEEEeccCCCCc-------cCceE---CCCCCEEEEEEeCCCCceEEEEECCCCCeEECccCCCCcc
Confidence 4577888888887653322232211 11222 2236667787665 556899999999988777332 1111
Q ss_pred CCCccceeeccceeeec---cCeeeeecCcCCCCccccCCcccccccccccCCCcccccccCCCceEEEEecc
Q psy11781 137 EPTQSTYILDNLKFRWV---DGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLS 206 (1228)
Q Consensus 137 ~p~~~~~~~~~~~f~~~---~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~lYf~~ls 206 (1228)
. ..|.-+|..+-+. .|-+-++.-....|+.+++... ++....+++|+||+.|+|....
T Consensus 296 ~---~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~---------g~~~~~~~~SpDG~~Ia~~~~~ 356 (433)
T PRK04922 296 E---PTWAPDGKSIYFTSDRGGRPQIYRVAASGGSAERLTFQ---------GNYNARASVSPDGKKIAMVHGS 356 (433)
T ss_pred c---eEECCCCCEEEEEECCCCCceEEEEECCCCCeEEeecC---------CCCccCEEECCCCCEEEEEECC
Confidence 1 1222233222211 2222222222223344443321 1122257899999999997653
No 102
>PRK03629 tolB translocation protein TolB; Provisional
Probab=55.20 E-value=88 Score=38.11 Aligned_cols=31 Identities=16% Similarity=0.087 Sum_probs=25.0
Q ss_pred ecCCceEEEEeeC-CcceEEEEEccCCceeee
Q psy11781 99 EDCDHVFAYVNDV-FRYGLIVYDFFKNTSYRL 129 (1228)
Q Consensus 99 ~~~~~~~aYi~D~-~~~g~iv~dl~~~~swR~ 129 (1228)
...|+-+||.+|. +..-|.++|+.+++.-|+
T Consensus 295 SPDG~~I~f~s~~~g~~~Iy~~d~~~g~~~~l 326 (429)
T PRK03629 295 FPDSQNLAYTSDQAGRPQVYKVNINGGAPQRI 326 (429)
T ss_pred CCCCCEEEEEeCCCCCceEEEEECCCCCeEEe
Confidence 4457889999987 567888999998887776
No 103
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=54.49 E-value=3.1e+02 Score=30.87 Aligned_cols=41 Identities=24% Similarity=0.095 Sum_probs=30.3
Q ss_pred EEEEEEeCCCcEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCC
Q psy11781 470 VFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPT 521 (1228)
Q Consensus 470 V~~v~iD~~gRLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~ 521 (1228)
.|++..|.. -...-+|. .+.-||-++||.+++++..-.|++
T Consensus 48 TQGL~~~~g--~i~esTG~---------yg~S~ir~~~L~~gq~~~s~~l~~ 88 (262)
T COG3823 48 TQGLEYLDG--HILESTGL---------YGFSKIRVSDLTTGQEIFSEKLAP 88 (262)
T ss_pred hcceeeeCC--EEEEeccc---------cccceeEEEeccCceEEEEeecCC
Confidence 467777644 33445664 466899999999999999888873
No 104
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=52.17 E-value=7.2e+02 Score=33.13 Aligned_cols=83 Identities=18% Similarity=0.210 Sum_probs=54.9
Q ss_pred CEEEEEECCCCcEEEEEECCCC--cc----------cCC-Cccee-eEEEeecCCCceEEEEEcCCCCeEEEEECCCCe-
Q psy11781 951 PKIMVFDLKTNTLIRKYILPTA--QV----------FEG-SLFSN-IVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNT- 1015 (1228)
Q Consensus 951 pKLvvfDL~td~lir~y~~p~~--~~----------~~~-S~l~d-i~VD~~~~c~~~~aYItD~~~~gLIVydl~~g~- 1015 (1228)
-.|.++|++|+|++-+.++... .. +.+ ..+.. ++. ..+.+|++=...+-|..||.++|+
T Consensus 641 G~l~AiDl~tGk~~W~~~~g~~~~~~p~~~~~~~~~~~g~p~~gG~l~T------agglvF~~gt~d~~l~A~D~~tGk~ 714 (764)
T TIGR03074 641 GYMAAIDLKTGKVVWQHPNGTVRDTGPMGIRMPLPIPIGVPTLGGPLAT------AGGLVFIGATQDNYLRAYDLSTGKE 714 (764)
T ss_pred EEEEEEECCCCcEeeeeECCccccccccccccccccccCCcccCCcEEE------cCCEEEEEeCCCCEEEEEECCCCce
Confidence 4799999999999988888321 00 001 11122 444 368899953355789999998876
Q ss_pred EEEeecC-CcCcCCCcceeE-EcCEEEE
Q psy11781 1016 SYRLTHP-YMYPEPTQSTYI-LDNLKFR 1041 (1228)
Q Consensus 1016 swRv~h~-s~~pdP~~~~~~-i~G~~f~ 1041 (1228)
-||..-+ .....| .++. ++|+++-
T Consensus 715 lW~~~l~~~~~a~P--~tY~~~~GkQYV 740 (764)
T TIGR03074 715 LWKARLPAGGQATP--MTYMGKDGKQYV 740 (764)
T ss_pred eeEeeCCCCcccCC--EEEEecCCEEEE
Confidence 6888743 455555 4677 7888874
No 105
>PF07494 Reg_prop: Two component regulator propeller; InterPro: IPR011110 A large group of two component regulator proteins appear to have the same N-terminal structure of 14 tandem repeats. These repeats show homology to members of IPR002372 from INTERPRO and IPR001680 from INTERPRO indicating that they are likely to form a beta-propeller. This family has been built with artificially high cut-offs in order to avoid overlaps with other beta-propeller families. The fourteen repeats are likely to form two propellers; it is not clear if these structures are likely to recruit other proteins or interact with DNA.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=51.51 E-value=14 Score=26.75 Aligned_cols=16 Identities=25% Similarity=0.590 Sum_probs=13.4
Q ss_pred EEEEEEEeCCCcEEEE
Q psy11781 469 SVFRMSVDKCDRLWIM 484 (1228)
Q Consensus 469 sV~~v~iD~~gRLWVL 484 (1228)
.|+.|..|++|+|||-
T Consensus 6 ~I~~i~~D~~G~lWig 21 (24)
T PF07494_consen 6 NIYSIYEDSDGNLWIG 21 (24)
T ss_dssp CEEEEEE-TTSCEEEE
T ss_pred eEEEEEEcCCcCEEEE
Confidence 5889999999999984
No 106
>PRK01029 tolB translocation protein TolB; Provisional
Probab=51.49 E-value=3.8e+02 Score=32.80 Aligned_cols=28 Identities=7% Similarity=0.011 Sum_probs=24.4
Q ss_pred eEEEEeeC-CcceEEEEEccCCceeeeec
Q psy11781 104 VFAYVNDV-FRYGLIVYDFFKNTSYRLTH 131 (1228)
Q Consensus 104 ~~aYi~D~-~~~g~iv~dl~~~~swR~~h 131 (1228)
.|||+++. +++.|-+.|+.+|+..|+++
T Consensus 200 ~~~y~S~~~g~~~I~~~~l~~g~~~~lt~ 228 (428)
T PRK01029 200 PYLYVSYKLGVPKIFLGSLENPAGKKILA 228 (428)
T ss_pred EEEEEEccCCCceEEEEECCCCCceEeec
Confidence 48899997 68899999999999999844
No 107
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=51.37 E-value=1.9e+02 Score=34.38 Aligned_cols=133 Identities=18% Similarity=0.170 Sum_probs=69.6
Q ss_pred CCeEEEEEccCCcEEEEEEcCCcccccCCcccceEEEEeecCCceEEEEeeC-CcceEEEEEccCCceeeeec-CccccC
Q psy11781 60 PPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDV-FRYGLIVYDFFKNTSYRLTH-PYMYPE 137 (1228)
Q Consensus 60 ~pkl~~~dl~t~~~~~~~~~p~~~~~~~s~~~~i~~d~~~~~~~~~aYi~D~-~~~g~iv~dl~~~~swR~~h-~~~~~~ 137 (1228)
.+.|.++|+.+++..+--.++... ....+ ...|+.++|..+. +..-|.++|+++++.-++.+ ....-.
T Consensus 213 ~~~i~v~d~~~g~~~~~~~~~~~~-------~~~~~---spDg~~l~~~~~~~~~~~i~~~d~~~~~~~~l~~~~~~~~~ 282 (417)
T TIGR02800 213 KPEIYVQDLATGQREKVASFPGMN-------GAPAF---SPDGSKLAVSLSKDGNPDIYVMDLDGKQLTRLTNGPGIDTE 282 (417)
T ss_pred CcEEEEEECCCCCEEEeecCCCCc-------cceEE---CCCCCEEEEEECCCCCccEEEEECCCCCEEECCCCCCCCCC
Confidence 378888898888654332232211 11222 2336778887765 56689999999998877733 211111
Q ss_pred CCccceeeccceeeec---cCeeeeecCcCCCCccccCCcccccccccccCCCcccccccCCCceEEEEeccc--CeeEE
Q psy11781 138 PTQSTYILDNLKFRWV---DGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSS--NRHYY 212 (1228)
Q Consensus 138 p~~~~~~~~~~~f~~~---~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~lYf~~lsS--~~~~s 212 (1228)
| .+.-+|..+-+. .|.+-++.-....++.+++... .+....+++|+||+.|+|..... .++|.
T Consensus 283 ~---~~s~dg~~l~~~s~~~g~~~iy~~d~~~~~~~~l~~~---------~~~~~~~~~spdg~~i~~~~~~~~~~~i~~ 350 (417)
T TIGR02800 283 P---SWSPDGKSIAFTSDRGGSPQIYMMDADGGEVRRLTFR---------GGYNASPSWSPDGDLIAFVHREGGGFNIAV 350 (417)
T ss_pred E---EECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecC---------CCCccCeEECCCCCEEEEEEccCCceEEEE
Confidence 2 222344332221 2222222222223333333221 12223678899999999987754 24555
Q ss_pred Ee
Q psy11781 213 VS 214 (1228)
Q Consensus 213 V~ 214 (1228)
+.
T Consensus 351 ~d 352 (417)
T TIGR02800 351 MD 352 (417)
T ss_pred Ee
Confidence 54
No 108
>KOG2055|consensus
Probab=50.78 E-value=2e+02 Score=35.40 Aligned_cols=217 Identities=17% Similarity=0.186 Sum_probs=108.1
Q ss_pred CEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEEecCCCCeEEEEEcCCCeEEEEcCCCCCcCCCC
Q psy11781 501 PKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEPTQ 580 (1228)
Q Consensus 501 PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYItDs~~~gLIVyDl~~g~swRv~~~s~~pdP~~ 580 (1228)
-+|..+|-+++..++.+.|- .+-+....+- .|+.. .-++=..+.-+.+||+.+.+.-++...+-.+++..
T Consensus 237 lrifqvDGk~N~~lqS~~l~------~fPi~~a~f~---p~G~~-~i~~s~rrky~ysyDle~ak~~k~~~~~g~e~~~~ 306 (514)
T KOG2055|consen 237 LRIFQVDGKVNPKLQSIHLE------KFPIQKAEFA---PNGHS-VIFTSGRRKYLYSYDLETAKVTKLKPPYGVEEKSM 306 (514)
T ss_pred EEEEEecCccChhheeeeec------cCccceeeec---CCCce-EEEecccceEEEEeeccccccccccCCCCcccchh
Confidence 45999999999988888873 2344555554 13322 44555567789999999998888875443333333
Q ss_pred cceEEc-CeEeeecCCcceee-c-CcccCCcccCCCccccccccccccCccccccccCCCceEEEeeCCCCceEEEechh
Q psy11781 581 STYILD-NLKFRWVDGIFGMA-I-SPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSSNRHYYVSTTD 657 (1228)
Q Consensus 581 ~~~~i~-G~~f~~~dGi~GIA-L-sp~~~~~~~~~~~~~~~~~~~~~~~~~~~i~is~d~r~LYf~~lss~~ly~V~T~~ 657 (1228)
..|.|. -.+|....|.+|-- | .... .++.+ -=.++|.-.+++.+.+++.|+-.--.| .+|-.+
T Consensus 307 e~FeVShd~~fia~~G~~G~I~lLhakT--------~eli~--s~KieG~v~~~~fsSdsk~l~~~~~~G-eV~v~n--- 372 (514)
T KOG2055|consen 307 ERFEVSHDSNFIAIAGNNGHIHLLHAKT--------KELIT--SFKIEGVVSDFTFSSDSKELLASGGTG-EVYVWN--- 372 (514)
T ss_pred heeEecCCCCeEEEcccCceEEeehhhh--------hhhhh--eeeeccEEeeEEEecCCcEEEEEcCCc-eEEEEe---
Confidence 333221 11122223332210 0 0000 00000 002233333477788888888776666 454443
Q ss_pred hcCCCCCCCccccccceEeeccccccccccceEEccCCcEEEEcCCCCeEEEEeCCCCccCCCCeeEEEecCCceecCCc
Q psy11781 658 LRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAINSNGVMFYNLVTKHSVGCWNTKTKVYLPQTQDIVQTSRDILNFPND 737 (1228)
Q Consensus 658 Lr~~~~~~~~~~~~~~v~~lG~~g~~~qs~G~a~D~~G~LY~t~~~~~aI~~wn~~~~~~~~~n~~iV~~~d~~L~wPd~ 737 (1228)
|+... ....+++-|.. ....+++..+|.++-+-...+-|..+|-+ ..+...+-.-|..=+.---=.+.
T Consensus 373 l~~~~-------~~~rf~D~G~v----~gts~~~S~ng~ylA~GS~~GiVNIYd~~-s~~~s~~PkPik~~dNLtt~Its 440 (514)
T KOG2055|consen 373 LRQNS-------CLHRFVDDGSV----HGTSLCISLNGSYLATGSDSGIVNIYDGN-SCFASTNPKPIKTVDNLTTAITS 440 (514)
T ss_pred cCCcc-------eEEEEeecCcc----ceeeeeecCCCceEEeccCcceEEEeccc-hhhccCCCCchhhhhhhheeeee
Confidence 32222 12233333322 23455666777755555556656666643 22221111111100100012456
Q ss_pred eEEcCCCcEEEEecCc
Q psy11781 738 LKIDPNDNIWILSNKL 753 (1228)
Q Consensus 738 l~Id~dg~LyVlsnrl 753 (1228)
|.+.++++|..++++.
T Consensus 441 l~Fn~d~qiLAiaS~~ 456 (514)
T KOG2055|consen 441 LQFNHDAQILAIASRV 456 (514)
T ss_pred eeeCcchhhhhhhhhc
Confidence 7777888876666655
No 109
>PF08309 LVIVD: LVIVD repeat; InterPro: IPR013211 This repeat is found in bacterial and archaeal cell surface proteins, many of which are hypothetical. The secondary structure corresponding to this repeat is predicted to comprise 4 beta-strands, which may associate to form a beta-propeller. The repeat copy number varies from 2-14. This repeat is sometimes found with the PKD domain IPR000601 from INTERPRO.
Probab=50.66 E-value=28 Score=28.90 Aligned_cols=26 Identities=27% Similarity=0.512 Sum_probs=20.4
Q ss_pred eeEEEEeecCCCCeEEEEecCCCCeEEEEEcCC
Q psy11781 531 SNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFK 563 (1228)
Q Consensus 531 ~di~VD~~~~c~~~~aYItDs~~~gLIVyDl~~ 563 (1228)
.++.|. +.||||+|. ..||.|+|..+
T Consensus 5 ~~v~v~------g~yaYva~~-~~Gl~IvDISn 30 (42)
T PF08309_consen 5 RDVAVS------GNYAYVADG-NNGLVIVDISN 30 (42)
T ss_pred EEEEEE------CCEEEEEeC-CCCEEEEECCC
Confidence 455664 789999984 57999999876
No 110
>PF07494 Reg_prop: Two component regulator propeller; InterPro: IPR011110 A large group of two component regulator proteins appear to have the same N-terminal structure of 14 tandem repeats. These repeats show homology to members of IPR002372 from INTERPRO and IPR001680 from INTERPRO indicating that they are likely to form a beta-propeller. This family has been built with artificially high cut-offs in order to avoid overlaps with other beta-propeller families. The fourteen repeats are likely to form two propellers; it is not clear if these structures are likely to recruit other proteins or interact with DNA.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=50.15 E-value=17 Score=26.36 Aligned_cols=16 Identities=25% Similarity=0.590 Sum_probs=13.3
Q ss_pred EEEEEEEeCCCcEEEE
Q psy11781 919 SVFRMSVDKCDRLWIM 934 (1228)
Q Consensus 919 SV~~v~iD~~grLWVL 934 (1228)
.|+.|..|++|+|||-
T Consensus 6 ~I~~i~~D~~G~lWig 21 (24)
T PF07494_consen 6 NIYSIYEDSDGNLWIG 21 (24)
T ss_dssp CEEEEEE-TTSCEEEE
T ss_pred eEEEEEEcCCcCEEEE
Confidence 5789999999999984
No 111
>KOG0268|consensus
Probab=49.88 E-value=1.1e+02 Score=36.52 Aligned_cols=180 Identities=12% Similarity=0.183 Sum_probs=101.0
Q ss_pred CCCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCCCCeEEEEECCCCeEEEeecCCcCcCC
Q psy11781 949 CPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEP 1028 (1228)
Q Consensus 949 ~~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~~gLIVydl~~g~swRv~h~s~~pdP 1028 (1228)
|..+|-+||+..+.-++.+....+.+.. +.... .++.+--+=....+|++||+..++--+=
T Consensus 165 cGe~i~IWD~~R~~Pv~smswG~Dti~s------vkfNp----vETsILas~~sDrsIvLyD~R~~~Pl~K--------- 225 (433)
T KOG0268|consen 165 CGEQIDIWDEQRDNPVSSMSWGADSISS------VKFNP----VETSILASCASDRSIVLYDLRQASPLKK--------- 225 (433)
T ss_pred cCceeeecccccCCccceeecCCCceeE------EecCC----CcchheeeeccCCceEEEecccCCccce---------
Confidence 8889999999999998888886552211 11111 1333433334568999999998754331
Q ss_pred CcceeEEcCEEEEeeCCccceecCccccccccCCCccccccccccccCccccceecCCCceEEEEeCCCCceEEEecchh
Q psy11781 1029 TQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSSNRHYYVSTTDL 1108 (1228)
Q Consensus 1029 ~~~~~~i~G~~f~~~dGi~GiaLsp~~~~~~~~~p~~~~~~~~~~~~~~~~di~ls~d~r~LYf~plss~~ly~V~T~~L 1108 (1228)
+.+.- -..+|+-.| |.|++-... .+ ..||.-+.+.|
T Consensus 226 ----Vi~~m-------RTN~IswnP-----------eafnF~~a~------------ED----------~nlY~~DmR~l 261 (433)
T KOG0268|consen 226 ----VILTM-------RTNTICWNP-----------EAFNFVAAN------------ED----------HNLYTYDMRNL 261 (433)
T ss_pred ----eeeec-------cccceecCc-----------cccceeecc------------cc----------ccceehhhhhh
Confidence 11000 011233332 233322110 11 23566666666
Q ss_pred cCCcccCCccccccceEEeccccCCCCcccEEEc--CCCcEEEEecCCCeEEEEeCCCCccccCCeeEEEecCCceecCC
Q psy11781 1109 RNSSRYVNSSDIDEYFHYLGSRFRNTQASASAIN--SNGVMFYNLVTKHSVGCWNTKTKVYLPQTQDIVQTSRDILNFPN 1186 (1228)
Q Consensus 1109 r~~~~~~~~~~~~~~v~~lG~rg~~sqs~g~~~D--~~G~LY~t~~~~~aI~~wn~~~~~~~~~n~~~v~~~~~~L~wPd 1186 (1228)
..+.. |. ++- -+.-|.+| .+|.=|++-.=..+|..++.+ .- .-+-++. ..|+|-.-
T Consensus 262 ~~p~~----------v~----~dh--vsAV~dVdfsptG~EfvsgsyDksIRIf~~~-~~----~SRdiYh-tkRMq~V~ 319 (433)
T KOG0268|consen 262 SRPLN----------VH----KDH--VSAVMDVDFSPTGQEFVSGSYDKSIRIFPVN-HG----HSRDIYH-TKRMQHVF 319 (433)
T ss_pred cccch----------hh----ccc--ceeEEEeccCCCcchhccccccceEEEeecC-CC----cchhhhh-Hhhhheee
Confidence 65442 00 110 13445555 888888888878888888876 22 1122444 35788887
Q ss_pred ceEEeCCCcEEEEeCCchhhhcCCCCCCcccEEEEEEec
Q psy11781 1187 DLKIDPNDNIWILSNKLHQYLYGFLDFNVYNYRILVANS 1225 (1228)
Q Consensus 1187 ~l~id~dg~Lyvlsn~l~~~~~~~~d~~e~nfrI~r~~v 1225 (1228)
+++...|. -||+| | .++.|.|+||++-
T Consensus 320 ~Vk~S~Ds-kyi~S--------G---Sdd~nvRlWka~A 346 (433)
T KOG0268|consen 320 CVKYSMDS-KYIIS--------G---SDDGNVRLWKAKA 346 (433)
T ss_pred EEEEeccc-cEEEe--------c---CCCcceeeeecch
Confidence 88876653 34544 2 4577888888753
No 112
>PRK02889 tolB translocation protein TolB; Provisional
Probab=48.88 E-value=1.9e+02 Score=35.27 Aligned_cols=128 Identities=15% Similarity=0.073 Sum_probs=63.5
Q ss_pred cCCeEEEEEccCCcEEEEEEcCCcccccCCcccceEEEEeecCCceEEEEeeC-CcceEEEEEccCCceeeeecCccccC
Q psy11781 59 CPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDV-FRYGLIVYDFFKNTSYRLTHPYMYPE 137 (1228)
Q Consensus 59 ~~pkl~~~dl~t~~~~~~~~~p~~~~~~~s~~~~i~~d~~~~~~~~~aYi~D~-~~~g~iv~dl~~~~swR~~h~~~~~~ 137 (1228)
+.|.|..+||.+++..+--.++.... ..++ ..-|+-+||..+- +..-|-++|+.+++..|+++....
T Consensus 218 ~~~~I~~~dl~~g~~~~l~~~~g~~~-------~~~~---SPDG~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~~~~~-- 285 (427)
T PRK02889 218 KKPVVYVHDLATGRRRVVANFKGSNS-------APAW---SPDGRTLAVALSRDGNSQIYTVNADGSGLRRLTQSSGI-- 285 (427)
T ss_pred CCcEEEEEECCCCCEEEeecCCCCcc-------ceEE---CCCCCEEEEEEccCCCceEEEEECCCCCcEECCCCCCC--
Confidence 35788888888886543222332110 1111 2225567776543 667788899988887776332110
Q ss_pred CCccceeeccceeeec---cCeeeeecCcCCCCccccCCcccccccccccCCCcccccccCCCceEEEEeccc
Q psy11781 138 PTQSTYILDNLKFRWV---DGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSS 207 (1228)
Q Consensus 138 p~~~~~~~~~~~f~~~---~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~lYf~~lsS 207 (1228)
.....+.-+|..+-+. .|.+-++.-....|+.+++... ++.....++|+||+.|+|..-.+
T Consensus 286 ~~~~~wSpDG~~l~f~s~~~g~~~Iy~~~~~~g~~~~lt~~---------g~~~~~~~~SpDG~~Ia~~s~~~ 349 (427)
T PRK02889 286 DTEPFFSPDGRSIYFTSDRGGAPQIYRMPASGGAAQRVTFT---------GSYNTSPRISPDGKLLAYISRVG 349 (427)
T ss_pred CcCeEEcCCCCEEEEEecCCCCcEEEEEECCCCceEEEecC---------CCCcCceEECCCCCEEEEEEccC
Confidence 0111233333332211 2333333222222333333221 12233578899999999976543
No 113
>KOG1446|consensus
Probab=47.27 E-value=5.4e+02 Score=30.34 Aligned_cols=183 Identities=11% Similarity=0.161 Sum_probs=0.0
Q ss_pred EEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCCCCeEEEEECCCCeEEEeecCCcCcCCCcce
Q psy11781 953 IMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEPTQST 1032 (1228)
Q Consensus 953 LvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~~gLIVydl~~g~swRv~h~s~~pdP~~~~ 1032 (1228)
|--.+|.+|+.+| .|+.. .-..+.|.+ |...-.+|+=+-...|-.||++..+=.-+++.+-.|-+ .
T Consensus 82 IryLsl~dNkylR--YF~GH----~~~V~sL~~-----sP~~d~FlS~S~D~tvrLWDlR~~~cqg~l~~~~~pi~---A 147 (311)
T KOG1446|consen 82 IRYLSLHDNKYLR--YFPGH----KKRVNSLSV-----SPKDDTFLSSSLDKTVRLWDLRVKKCQGLLNLSGRPIA---A 147 (311)
T ss_pred eEEEEeecCceEE--EcCCC----CceEEEEEe-----cCCCCeEEecccCCeEEeeEecCCCCceEEecCCCcce---e
Q ss_pred eEEcCEEEEeeCCccceecCccccccccCCCccccccccccccCcccc-ceecCCCceEEEEeCCCCceEEEecchhcCC
Q psy11781 1033 YILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVD-KTIRDDQRYMYFHSLSSNRHYYVSTTDLRNS 1111 (1228)
Q Consensus 1033 ~~i~G~~f~~~dGi~GiaLsp~~~~~~~~~p~~~~~~~~~~~~~~~~d-i~ls~d~r~LYf~plss~~ly~V~T~~Lr~~ 1111 (1228)
|.-.|.-|....|-..|-|..... +-+-|-.+|.++. .+--... |+-||||+.|--+...|. .|.|+
T Consensus 148 fDp~GLifA~~~~~~~IkLyD~Rs--~dkgPF~tf~i~~--~~~~ew~~l~FS~dGK~iLlsT~~s~-~~~lD------- 215 (311)
T KOG1446|consen 148 FDPEGLIFALANGSELIKLYDLRS--FDKGPFTTFSITD--NDEAEWTDLEFSPDGKSILLSTNASF-IYLLD------- 215 (311)
T ss_pred ECCCCcEEEEecCCCeEEEEEecc--cCCCCceeEccCC--CCccceeeeEEcCCCCEEEEEeCCCc-EEEEE-------
Q ss_pred cccCCccccccceEEeccccCCCCcccEEEcCCCcEEEEecCCCeEEEEeCCCCc
Q psy11781 1112 SRYVNSSDIDEYFHYLGSRFRNTQASASAINSNGVMFYNLVTKHSVGCWNTKTKV 1166 (1228)
Q Consensus 1112 ~~~~~~~~~~~~v~~lG~rg~~sqs~g~~~D~~G~LY~t~~~~~aI~~wn~~~~~ 1166 (1228)
+-+.+....|+..-..+ .-+...++-.+|...++-.....|..|+.+++.
T Consensus 216 ---Af~G~~~~tfs~~~~~~--~~~~~a~ftPds~Fvl~gs~dg~i~vw~~~tg~ 265 (311)
T KOG1446|consen 216 ---AFDGTVKSTFSGYPNAG--NLPLSATFTPDSKFVLSGSDDGTIHVWNLETGK 265 (311)
T ss_pred ---ccCCcEeeeEeeccCCC--CcceeEEECCCCcEEEEecCCCcEEEEEcCCCc
No 114
>PF08309 LVIVD: LVIVD repeat; InterPro: IPR013211 This repeat is found in bacterial and archaeal cell surface proteins, many of which are hypothetical. The secondary structure corresponding to this repeat is predicted to comprise 4 beta-strands, which may associate to form a beta-propeller. The repeat copy number varies from 2-14. This repeat is sometimes found with the PKD domain IPR000601 from INTERPRO.
Probab=46.39 E-value=35 Score=28.30 Aligned_cols=20 Identities=35% Similarity=0.519 Sum_probs=17.2
Q ss_pred ceEEEEEcCCCCeEEEEECCC
Q psy11781 993 HVFAYVNDVFRYGLIVYDFFK 1013 (1228)
Q Consensus 993 ~~~aYItD~~~~gLIVydl~~ 1013 (1228)
+.||||+|.. .||.|+|..+
T Consensus 11 g~yaYva~~~-~Gl~IvDISn 30 (42)
T PF08309_consen 11 GNYAYVADGN-NGLVIVDISN 30 (42)
T ss_pred CCEEEEEeCC-CCEEEEECCC
Confidence 7899999754 6999999986
No 115
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=44.87 E-value=45 Score=26.23 Aligned_cols=27 Identities=22% Similarity=0.106 Sum_probs=22.9
Q ss_pred CceEEEEEcCCCCeEEEEECCCCeEEE
Q psy11781 992 DHVFAYVNDVFRYGLIVYDFFKNTSYR 1018 (1228)
Q Consensus 992 ~~~~aYItD~~~~gLIVydl~~g~swR 1018 (1228)
++.++|+++.+...|.++|..+++.-+
T Consensus 2 d~~~lyv~~~~~~~v~~id~~~~~~~~ 28 (42)
T TIGR02276 2 DGTKLYVTNSGSNTVSVIDTATNKVIA 28 (42)
T ss_pred CCCEEEEEeCCCCEEEEEECCCCeEEE
Confidence 467899999999999999998876544
No 116
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=43.09 E-value=2.4e+02 Score=34.76 Aligned_cols=127 Identities=17% Similarity=0.173 Sum_probs=79.4
Q ss_pred CeEEEEEccCCcEEEEEEcCCcccccCCcccceEEEEeecCCceEEEEeeC-CcceEEEEEccCCceeee-------ecC
Q psy11781 61 PKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDV-FRYGLIVYDFFKNTSYRL-------THP 132 (1228)
Q Consensus 61 pkl~~~dl~t~~~~~~~~~p~~~~~~~s~~~~i~~d~~~~~~~~~aYi~D~-~~~g~iv~dl~~~~swR~-------~h~ 132 (1228)
|+|...||++++.-+--.|+.....+.-.. | |.-+|+..+- +...|-|+|+.++.-+|+ .||
T Consensus 218 ~~i~~~~l~~g~~~~i~~~~g~~~~P~fsp-----D-----G~~l~f~~~rdg~~~iy~~dl~~~~~~~Lt~~~gi~~~P 287 (425)
T COG0823 218 PRIYYLDLNTGKRPVILNFNGNNGAPAFSP-----D-----GSKLAFSSSRDGSPDIYLMDLDGKNLPRLTNGFGINTSP 287 (425)
T ss_pred ceEEEEeccCCccceeeccCCccCCccCCC-----C-----CCEEEEEECCCCCccEEEEcCCCCcceecccCCccccCc
Confidence 899999999997666655665433221100 1 3455555555 788888889888888887 568
Q ss_pred ccccCCCccceeeccceeeeccCeeeeecCcCCCCccccCCcccccccccccCCCcccccccCCCceEEEEecccCeeEE
Q psy11781 133 YMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSSNRHYY 212 (1228)
Q Consensus 133 ~~~~~p~~~~~~~~~~~f~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~lYf~~lsS~~~~s 212 (1228)
++.||-...=|+-+ ..|.|=+++-.+..++.+|+... .|.+.-..+|+||+.+=|-. ++...+.
T Consensus 288 s~spdG~~ivf~Sd------r~G~p~I~~~~~~g~~~~riT~~---------~~~~~~p~~SpdG~~i~~~~-~~~g~~~ 351 (425)
T COG0823 288 SWSPDGSKIVFTSD------RGGRPQIYLYDLEGSQVTRLTFS---------GGGNSNPVWSPDGDKIVFES-SSGGQWD 351 (425)
T ss_pred cCCCCCCEEEEEeC------CCCCcceEEECCCCCceeEeecc---------CCCCcCccCCCCCCEEEEEe-ccCCcee
Confidence 88888777755533 35655444444434445555542 13333567899999999988 3333333
Q ss_pred E
Q psy11781 213 V 213 (1228)
Q Consensus 213 V 213 (1228)
|
T Consensus 352 i 352 (425)
T COG0823 352 I 352 (425)
T ss_pred e
Confidence 3
No 117
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=42.95 E-value=62 Score=40.67 Aligned_cols=68 Identities=21% Similarity=0.393 Sum_probs=45.9
Q ss_pred ccceEEc-cCCcEEEEcCCCC----------------eEEEEeCCCCccCCCC--eeE--EEecCCceec----------
Q psy11781 686 ASASAIN-SNGVMFYNLVTKH----------------SVGCWNTKTKVYLPQT--QDI--VQTSRDILNF---------- 734 (1228)
Q Consensus 686 s~G~a~D-~~G~LY~t~~~~~----------------aI~~wn~~~~~~~~~n--~~i--V~~~d~~L~w---------- 734 (1228)
+.++++. .+|.+||++.+.. .|++|-+.++.++... ..+ .+.+...+.-
T Consensus 419 pE~i~~~p~~g~Vy~~lTNn~~r~~~~aNpr~~n~~G~I~r~~p~~~d~t~~~ftWdlF~~aG~~~~~~~~~~~~~~~~~ 498 (616)
T COG3211 419 PEWIAVNPGTGEVYFTLTNNGKRSDDAANPRAKNGYGQIVRWIPATGDHTDTKFTWDLFVEAGNPSVLEGGASANINANW 498 (616)
T ss_pred ccceeecCCcceEEEEeCCCCccccccCCCcccccccceEEEecCCCCccCccceeeeeeecCCccccccccccCccccc
Confidence 5667776 6778999988765 5899988743332221 122 2233333444
Q ss_pred ---CCceEEcCCCcEEEEecCc
Q psy11781 735 ---PNDLKIDPNDNIWILSNKL 753 (1228)
Q Consensus 735 ---Pd~l~Id~dg~LyVlsnrl 753 (1228)
||+|+||+.|.|||.+..-
T Consensus 499 f~~PDnl~fD~~GrLWi~TDg~ 520 (616)
T COG3211 499 FNSPDNLAFDPWGRLWIQTDGS 520 (616)
T ss_pred ccCCCceEECCCCCEEEEecCC
Confidence 9999999999999999654
No 118
>COG5276 Uncharacterized conserved protein [Function unknown]
Probab=39.87 E-value=49 Score=38.49 Aligned_cols=31 Identities=26% Similarity=0.522 Sum_probs=25.6
Q ss_pred CCCceeEEEEeecCCCCeEEEEecCCCCeEEEEEcCCC
Q psy11781 527 GSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKN 564 (1228)
Q Consensus 527 ~S~l~di~VD~~~~c~~~~aYItDs~~~gLIVyDl~~g 564 (1228)
.+.++|++| .+.+||++| +..||.++|..+=
T Consensus 86 ~~l~~Dv~v------se~yvyvad-~ssGL~IvDIS~P 116 (370)
T COG5276 86 RDLFADVRV------SEEYVYVAD-WSSGLRIVDISTP 116 (370)
T ss_pred hhhhheeEe------cccEEEEEc-CCCceEEEeccCC
Confidence 356788888 478999999 7789999999873
No 119
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=39.20 E-value=71 Score=40.22 Aligned_cols=69 Identities=22% Similarity=0.393 Sum_probs=45.3
Q ss_pred cccEEEc-CCCcEEEEecCCC----------------eEEEEeCCCCccccC--CeeE--EEecCCceec----------
Q psy11781 1136 ASASAIN-SNGVMFYNLVTKH----------------SVGCWNTKTKVYLPQ--TQDI--VQTSRDILNF---------- 1184 (1228)
Q Consensus 1136 s~g~~~D-~~G~LY~t~~~~~----------------aI~~wn~~~~~~~~~--n~~~--v~~~~~~L~w---------- 1184 (1228)
+.++++. .+|.+||++.++. .|.+|-+.++-+... ..++ .+.+...+.-
T Consensus 419 pE~i~~~p~~g~Vy~~lTNn~~r~~~~aNpr~~n~~G~I~r~~p~~~d~t~~~ftWdlF~~aG~~~~~~~~~~~~~~~~~ 498 (616)
T COG3211 419 PEWIAVNPGTGEVYFTLTNNGKRSDDAANPRAKNGYGQIVRWIPATGDHTDTKFTWDLFVEAGNPSVLEGGASANINANW 498 (616)
T ss_pred ccceeecCCcceEEEEeCCCCccccccCCCcccccccceEEEecCCCCccCccceeeeeeecCCccccccccccCccccc
Confidence 3456665 5677999988766 489998874322111 1222 2333334555
Q ss_pred ---CCceEEeCCCcEEEEeCCch
Q psy11781 1185 ---PNDLKIDPNDNIWILSNKLH 1204 (1228)
Q Consensus 1185 ---Pd~l~id~dg~Lyvlsn~l~ 1204 (1228)
||++++|+.|.|||.+.+-.
T Consensus 499 f~~PDnl~fD~~GrLWi~TDg~~ 521 (616)
T COG3211 499 FNSPDNLAFDPWGRLWIQTDGSG 521 (616)
T ss_pred ccCCCceEECCCCCEEEEecCCC
Confidence 99999999999999996544
No 120
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=39.14 E-value=5.7e+02 Score=30.62 Aligned_cols=82 Identities=23% Similarity=0.380 Sum_probs=47.4
Q ss_pred CcccceEEeCCeEEEEcccCCCCCCcEEEEEECCCCCCCCcccCCCCcCcCCCCCCCcceEEEEEEEe-CCCcEEEEeCC
Q psy11781 409 NLPLGIGIWRSTIFLSFPKWKAGIPFTLASFNMNDPSESPILLPYPNWSYFDDSNCNSLISVFRMSVD-KCDRLWIMDTG 487 (1228)
Q Consensus 409 ~~P~gv~v~~gRiFvt~PR~~~gvP~TL~~v~~~~~~~sp~l~PYP~~~wn~~~~c~~lvsV~~v~iD-~~gRLWVLDtG 487 (1228)
....++.+.+++++++. ..+|. ..|..+... .+......|-|. .-+|..+.-+ ..+.||+.=+|
T Consensus 327 ~~l~~~~~~~~~Lvl~~--~~~~~-~~l~v~~~~-~~~~~~~~~~p~-----------~g~v~~~~~~~~~~~~~~~~ss 391 (414)
T PF02897_consen 327 VSLEDVSLFKDYLVLSY--RENGS-SRLRVYDLD-DGKESREIPLPE-----------AGSVSGVSGDFDSDELRFSYSS 391 (414)
T ss_dssp EEEEEEEEETTEEEEEE--EETTE-EEEEEEETT--TEEEEEEESSS-----------SSEEEEEES-TT-SEEEEEEEE
T ss_pred eeEEEEEEECCEEEEEE--EECCc-cEEEEEECC-CCcEEeeecCCc-----------ceEEeccCCCCCCCEEEEEEeC
Confidence 35688888899999987 33443 466666653 011112223332 1223333333 35567776555
Q ss_pred CCCcccCcCccCCCEEEEEECCCCcEE
Q psy11781 488 VTNILSSIQQLCPPKIMVFDLKTNTLI 514 (1228)
Q Consensus 488 ~~~~~~~~~~~~~PKLvvfDL~td~vi 514 (1228)
.. .|+.+..+|++++++.
T Consensus 392 ~~---------~P~~~y~~d~~t~~~~ 409 (414)
T PF02897_consen 392 FT---------TPPTVYRYDLATGELT 409 (414)
T ss_dssp TT---------EEEEEEEEETTTTCEE
T ss_pred CC---------CCCEEEEEECCCCCEE
Confidence 43 5889999999999865
No 121
>KOG0282|consensus
Probab=38.60 E-value=37 Score=41.44 Aligned_cols=133 Identities=11% Similarity=0.155 Sum_probs=78.1
Q ss_pred CEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEEecCCCCeEEEEEcCCCeEEEEc-CCCCC----
Q psy11781 501 PKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLT-HPYMY---- 575 (1228)
Q Consensus 501 PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYItDs~~~gLIVyDl~~g~swRv~-~~s~~---- 575 (1228)
-||+.||.+++++++.|.-. -...++|.+= +++-=+|+=+....++||+...+---... .+.|+
T Consensus 322 ~ki~~wDiRs~kvvqeYd~h------Lg~i~~i~F~-----~~g~rFissSDdks~riWe~~~~v~ik~i~~~~~hsmP~ 390 (503)
T KOG0282|consen 322 KKIRQWDIRSGKVVQEYDRH------LGAILDITFV-----DEGRRFISSSDDKSVRIWENRIPVPIKNIADPEMHTMPC 390 (503)
T ss_pred CcEEEEeccchHHHHHHHhh------hhheeeeEEc-----cCCceEeeeccCccEEEEEcCCCccchhhcchhhccCcc
Confidence 47999999999988766541 2345555542 23444566555668999998876544332 12222
Q ss_pred --cCCCCcceEEcCeEeeecCCcceeecCcccCCcccCCCccccccccccccCccccccccCCCceEEEeeCCCCceEEE
Q psy11781 576 --PEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSSNRHYYV 653 (1228)
Q Consensus 576 --pdP~~~~~~i~G~~f~~~dGi~GIALsp~~~~~~~~~~~~~~~~~~~~~~~~~~~i~is~d~r~LYf~~lss~~ly~V 653 (1228)
-.|.. .+ +..+.+.-..++|+ +. .+||.+..+.+. .|..+|+...+.-||||++|..+.-.|. +|--
T Consensus 391 ~~~~P~~-~~-~~aQs~dN~i~ifs-~~-----~~~r~nkkK~fe--Gh~vaGys~~v~fSpDG~~l~SGdsdG~-v~~w 459 (503)
T KOG0282|consen 391 LTLHPNG-KW-FAAQSMDNYIAIFS-TV-----PPFRLNKKKRFE--GHSVAGYSCQVDFSPDGRTLCSGDSDGK-VNFW 459 (503)
T ss_pred eecCCCC-Ce-ehhhccCceEEEEe-cc-----cccccCHhhhhc--ceeccCceeeEEEcCCCCeEEeecCCcc-EEEe
Confidence 12211 11 12222222234444 22 245666666553 4677788878899999999999988874 5555
Q ss_pred ec
Q psy11781 654 ST 655 (1228)
Q Consensus 654 ~T 655 (1228)
++
T Consensus 460 dw 461 (503)
T KOG0282|consen 460 DW 461 (503)
T ss_pred ec
Confidence 43
No 122
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=38.37 E-value=3.6e+02 Score=32.02 Aligned_cols=30 Identities=20% Similarity=0.092 Sum_probs=23.2
Q ss_pred CCceEEEEeeC-CcceEEEEEccCCceeeee
Q psy11781 101 CDHVFAYVNDV-FRYGLIVYDFFKNTSYRLT 130 (1228)
Q Consensus 101 ~~~~~aYi~D~-~~~g~iv~dl~~~~swR~~ 130 (1228)
.|+-+++.+|. +.+.|.++|+.+++..|+.
T Consensus 288 dg~~l~~~s~~~g~~~iy~~d~~~~~~~~l~ 318 (417)
T TIGR02800 288 DGKSIAFTSDRGGSPQIYMMDADGGEVRRLT 318 (417)
T ss_pred CCCEEEEEECCCCCceEEEEECCCCCEEEee
Confidence 46778888876 5678999999988876663
No 123
>PF05567 Neisseria_PilC: Neisseria PilC beta-propeller domain; InterPro: IPR008707 This domain is found in several PilC protein sequences from Neisseria gonorrhoeae and Neisseria meningitidis. PilC is a phase-variable protein associated with pilus-mediated adherence of pathogenic Neisseria to target cells [].; PDB: 3HX6_A.
Probab=38.12 E-value=58 Score=38.61 Aligned_cols=58 Identities=21% Similarity=0.226 Sum_probs=32.4
Q ss_pred CCCEEEEEECCC-CcEEEEEECCCCcccCCCcceee-EEEeecCCCceEEEEEcCCCCeEEEEECCCCeEEEeec
Q psy11781 949 CPPKIMVFDLKT-NTLIRKYILPTAQVFEGSLFSNI-VTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLTH 1021 (1228)
Q Consensus 949 ~~pKLvvfDL~t-d~lir~y~~p~~~~~~~S~l~di-~VD~~~~c~~~~aYItD~~~~gLIVydl~~g~swRv~h 1021 (1228)
..++|+++|++| +++++++..+... ..+..+ .+|...+-.-.+||..|.+ |..||+.-
T Consensus 179 ~~~~lyi~d~~t~G~l~~~i~~~~~~----~gl~~~~~~D~d~DG~~D~vYaGDl~-----------GnlwR~dl 238 (335)
T PF05567_consen 179 GGAALYILDADTTGALIKKIDVPGGS----GGLSSPAVVDSDGDGYVDRVYAGDLG-----------GNLWRFDL 238 (335)
T ss_dssp --EEEEEEETTT---EEEEEEE--ST----T-EEEEEEE-TTSSSEE-EEEEEETT-----------SEEEEEE-
T ss_pred CCcEEEEEECCCCCceEEEEecCCCC----ccccccEEEeccCCCeEEEEEEEcCC-----------CcEEEEEC
Confidence 357999999999 9999999886542 134443 3454322223578988863 56677653
No 124
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=36.87 E-value=1e+02 Score=37.89 Aligned_cols=87 Identities=15% Similarity=0.112 Sum_probs=61.0
Q ss_pred CceEEEEeeC-Cc-ceEEEEEccCCceeee-------ecCccccCCCccceeeccceeeeccCeeeeecCcCCCCccccC
Q psy11781 102 DHVFAYVNDV-FR-YGLIVYDFFKNTSYRL-------THPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRH 172 (1228)
Q Consensus 102 ~~~~aYi~D~-~~-~g~iv~dl~~~~swR~-------~h~~~~~~p~~~~~~~~~~~f~~~~G~~~~~~~~~~~~~~~~~ 172 (1228)
++.+||.+=. ++ +-++++|+.+++.=++ .+|.|.||-..+-|... -||.+-|++-.-..++.+++
T Consensus 204 ~~~~~y~~f~~~~~~~i~~~~l~~g~~~~i~~~~g~~~~P~fspDG~~l~f~~~------rdg~~~iy~~dl~~~~~~~L 277 (425)
T COG0823 204 GKKLAYVSFELGGCPRIYYLDLNTGKRPVILNFNGNNGAPAFSPDGSKLAFSSS------RDGSPDIYLMDLDGKNLPRL 277 (425)
T ss_pred CCceEEEEEecCCCceEEEEeccCCccceeeccCCccCCccCCCCCCEEEEEEC------CCCCccEEEEcCCCCcceec
Confidence 5667777433 34 6799999999998888 34666666666544333 37877777776555443332
Q ss_pred CcccccccccccCCCcccccccCCCceEEEE
Q psy11781 173 PYEYYHYNVHHYNGTNVDKTIRDDQRYMYFH 203 (1228)
Q Consensus 173 ~~~~~~~~~~~~~g~~~~~~~~~~~~~lYf~ 203 (1228)
.+..|.+....+|+||+.+||.
T Consensus 278 ---------t~~~gi~~~Ps~spdG~~ivf~ 299 (425)
T COG0823 278 ---------TNGFGINTSPSWSPDGSKIVFT 299 (425)
T ss_pred ---------ccCCccccCccCCCCCCEEEEE
Confidence 3445888889999999999998
No 125
>PF14269 Arylsulfotran_2: Arylsulfotransferase (ASST)
Probab=36.05 E-value=7.7e+02 Score=28.85 Aligned_cols=171 Identities=13% Similarity=0.135 Sum_probs=97.9
Q ss_pred EEEEEEeeeeeecCChHHHH--hhccC--CCCccCCCc-ccceEEe-CCeEEEEeccCCCCCCcEEEEEECCCCC-----
Q psy11781 827 QVVFRWKIMDFVFRDNKQKA--NLIRS--NKFIPENNL-PLGIGIW-RSTIFLSFPKWKAGIPFTLASFNMNDPS----- 895 (1228)
Q Consensus 827 ~~vy~Wk~ldf~fp~~~~r~--~ai~~--g~y~p~n~~-p~gv~v~-~gRiFvt~PR~~~gvP~TLa~v~~~~~~----- 895 (1228)
+++|+|+-+|..-+++.... ..... ....|...+ .+.|+.+ +|.++||+ | .--++..|+.....
T Consensus 107 evlfeW~a~DH~~~~~~~~~~~~~~~~g~~~~~~~D~~HiNsV~~~~~G~yLiS~-R----~~~~i~~I~~~tG~I~W~l 181 (299)
T PF14269_consen 107 EVLFEWSASDHVDPNDSYDSQDPLPGSGGSSSFPWDYFHINSVDKDDDGDYLISS-R----NTSTIYKIDPSTGKIIWRL 181 (299)
T ss_pred CEEEEEEhhheecccccccccccccCCCcCCCCCCCccEeeeeeecCCccEEEEe-c----ccCEEEEEECCCCcEEEEe
Confidence 79999999999776665521 11111 122233333 5667775 78899987 3 34578888743221
Q ss_pred -C----CCccccCCCCCcCCCCCCCcceEEEEEEEeCCCcEEEEeCCCCCccccccccCCCEEEEEECCCCc--EEEEEE
Q psy11781 896 -E----SPILLPYPNWSYFDDSNCNSLISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNT--LIRKYI 968 (1228)
Q Consensus 896 -~----sP~l~PYP~~~wn~~~~c~~lvSV~~v~iD~~grLWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~--lir~y~ 968 (1228)
+ +-.+ +--.|.|+.. ..+.+.. +..++|=|.|.+.... ........+++.+|.++.+ +++++.
T Consensus 182 gG~~~~df~~-~~~~f~~QHd---ar~~~~~----~~~~~IslFDN~~~~~--~~~~~s~~~v~~ld~~~~~~~~~~~~~ 251 (299)
T PF14269_consen 182 GGKRNSDFTL-PATNFSWQHD---ARFLNES----NDDGTISLFDNANSDF--NGTEPSRGLVLELDPETMTVTLVREYS 251 (299)
T ss_pred CCCCCCcccc-cCCcEeeccC---CEEeccC----CCCCEEEEEcCCCCCC--CCCcCCCceEEEEECCCCEEEEEEEee
Confidence 0 1112 2333556543 1122111 5677888889865433 2344566899999999774 556665
Q ss_pred -CCCCcccCCCcceeeEEEeecCCCceEEEEEcCCCCeEEEEECCCCeEEEe
Q psy11781 969 -LPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRL 1019 (1228)
Q Consensus 969 -~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~~gLIVydl~~g~swRv 1019 (1228)
=|..+.... .-+... + .++.+.|+....+.+.=|+....--|.+
T Consensus 252 ~~~~~~~s~~--~G~~Q~-L----~nGn~li~~g~~g~~~E~~~~G~vv~~~ 296 (299)
T PF14269_consen 252 DHPDGFYSPS--QGSAQR-L----PNGNVLIGWGNNGRISEFTPDGEVVWEA 296 (299)
T ss_pred cCCCcccccC--CCcceE-C----CCCCEEEecCCCceEEEECCCCCEEEEE
Confidence 334333221 111221 1 3578889888888888888777666654
No 126
>PF01011 PQQ: PQQ enzyme repeat family.; InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=35.89 E-value=56 Score=25.95 Aligned_cols=23 Identities=17% Similarity=0.216 Sum_probs=20.3
Q ss_pred eEEEEEccCCcEEEEEEcCCccc
Q psy11781 62 KIMVFDLKTNTLIRKYILPTAQV 84 (1228)
Q Consensus 62 kl~~~dl~t~~~~~~~~~p~~~~ 84 (1228)
.|.++|++||+++++|.....+.
T Consensus 11 ~l~AlD~~TG~~~W~~~~~~~~~ 33 (38)
T PF01011_consen 11 YLYALDAKTGKVLWKFQTGPPVD 33 (38)
T ss_dssp EEEEEETTTTSEEEEEESSSGGG
T ss_pred EEEEEECCCCCEEEeeeCCCCCc
Confidence 79999999999999999877543
No 127
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=35.00 E-value=1.5e+02 Score=35.07 Aligned_cols=57 Identities=11% Similarity=0.136 Sum_probs=36.5
Q ss_pred CEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCCCCeEEEEECCCCeE-EEee
Q psy11781 951 PKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTS-YRLT 1020 (1228)
Q Consensus 951 pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~~gLIVydl~~g~s-wRv~ 1020 (1228)
-+|++||++|++++.++.++.. .+...+++ ++.+|+.. ..+.|..+|.++|+. |+..
T Consensus 115 g~l~ald~~tG~~~W~~~~~~~------~~~~p~v~------~~~v~v~~-~~g~l~a~d~~tG~~~W~~~ 172 (377)
T TIGR03300 115 GEVIALDAEDGKELWRAKLSSE------VLSPPLVA------NGLVVVRT-NDGRLTALDAATGERLWTYS 172 (377)
T ss_pred CEEEEEECCCCcEeeeeccCce------eecCCEEE------CCEEEEEC-CCCeEEEEEcCCCceeeEEc
Confidence 4677777777777777666443 23345543 77777753 466677777777777 5554
No 128
>KOG0316|consensus
Probab=34.56 E-value=6.6e+02 Score=28.86 Aligned_cols=52 Identities=13% Similarity=0.204 Sum_probs=32.4
Q ss_pred CEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCCCCeEEEEECCC
Q psy11781 951 PKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFK 1013 (1228)
Q Consensus 951 pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~~gLIVydl~~ 1013 (1228)
-.+.+||.+|||++|++.=... ..|.++.. ++.-+-.+=+....+=+||-++
T Consensus 81 k~v~vwDV~TGkv~Rr~rgH~a------qVNtV~fN-----eesSVv~SgsfD~s~r~wDCRS 132 (307)
T KOG0316|consen 81 KAVQVWDVNTGKVDRRFRGHLA------QVNTVRFN-----EESSVVASGSFDSSVRLWDCRS 132 (307)
T ss_pred ceEEEEEcccCeeeeecccccc------eeeEEEec-----CcceEEEeccccceeEEEEccc
Confidence 5799999999999998875443 24444443 3344445544455555555544
No 129
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=34.52 E-value=8.4e+02 Score=28.87 Aligned_cols=70 Identities=26% Similarity=0.299 Sum_probs=45.6
Q ss_pred CEEEEEECCCCcEEEEEECCCcccccCCCcee-EEEEeecCCCCeEEEEe----cCCCCeEEEEEcCCCeEEEEc-CCCC
Q psy11781 501 PKIMVFDLKTNTLIRKYILPTAQVFEGSLFSN-IVTEVVEDCDHVFAYVN----DVFRYGLIVYDFFKNTSYRLT-HPYM 574 (1228)
Q Consensus 501 PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~d-i~VD~~~~c~~~~aYIt----Ds~~~gLIVyDl~~g~swRv~-~~s~ 574 (1228)
--+++||..++++.+++.-|+ +..|.. -++. -++..+|.| +.+.+-|-|||... ...|+. .++.
T Consensus 28 ~~~~v~D~~~g~~~~~~~a~~-----gRHFyGHg~fs----~dG~~LytTEnd~~~g~G~IgVyd~~~-~~~ri~E~~s~ 97 (305)
T PF07433_consen 28 TFALVFDCRTGQLLQRLWAPP-----GRHFYGHGVFS----PDGRLLYTTENDYETGRGVIGVYDAAR-GYRRIGEFPSH 97 (305)
T ss_pred cEEEEEEcCCCceeeEEcCCC-----CCEEecCEEEc----CCCCEEEEeccccCCCcEEEEEEECcC-CcEEEeEecCC
Confidence 459999999999998665543 344443 3443 246789988 55677888999994 455554 3444
Q ss_pred CcCCCC
Q psy11781 575 YPEPTQ 580 (1228)
Q Consensus 575 ~pdP~~ 580 (1228)
--.|..
T Consensus 98 GIGPHe 103 (305)
T PF07433_consen 98 GIGPHE 103 (305)
T ss_pred CcChhh
Confidence 444443
No 130
>PF14269 Arylsulfotran_2: Arylsulfotransferase (ASST)
Probab=32.41 E-value=8.7e+02 Score=28.40 Aligned_cols=96 Identities=8% Similarity=0.000 Sum_probs=53.9
Q ss_pred EEEEEEEeCCCcEEEE-------eCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCC-------------
Q psy11781 469 SVFRMSVDKCDRLWIM-------DTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGS------------- 528 (1228)
Q Consensus 469 sV~~v~iD~~gRLWVL-------DtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S------------- 528 (1228)
....+.+.++|++||. |.-..+.. ....+-.-.+.-+|+.|++++.++..-+.+....+
T Consensus 58 d~He~~it~~gt~lvt~~~~~~~dls~~gg~-~~g~i~d~~~~EiDi~TgevlfeW~a~DH~~~~~~~~~~~~~~~~g~~ 136 (299)
T PF14269_consen 58 DHHEFEITPDGTALVTAYNPTPADLSPVGGP-EDGWILDDVFQEIDIETGEVLFEWSASDHVDPNDSYDSQDPLPGSGGS 136 (299)
T ss_pred CccceEEcCCCcEEEEEccceeccccccCcC-CCccEecceeEEeccCCCCEEEEEEhhheecccccccccccccCCCcC
Confidence 3555666677777771 11111110 01112345688899999999988876433321111
Q ss_pred --------CceeEEEEeecCCCCeEEEEecCCCCeEEEEEcCCCe-EEEEc
Q psy11781 529 --------LFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNT-SYRLT 570 (1228)
Q Consensus 529 --------~l~di~VD~~~~c~~~~aYItDs~~~gLIVyDl~~g~-swRv~ 570 (1228)
.+|+|..+ .++...|+=-....|+++|.++|+ .|++.
T Consensus 137 ~~~~~D~~HiNsV~~~-----~~G~yLiS~R~~~~i~~I~~~tG~I~W~lg 182 (299)
T PF14269_consen 137 SSFPWDYFHINSVDKD-----DDGDYLISSRNTSTIYKIDPSTGKIIWRLG 182 (299)
T ss_pred CCCCCCccEeeeeeec-----CCccEEEEecccCEEEEEECCCCcEEEEeC
Confidence 24455444 344445655556889999977765 56664
No 131
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=31.85 E-value=1.5e+02 Score=34.89 Aligned_cols=67 Identities=12% Similarity=0.180 Sum_probs=49.0
Q ss_pred CCcEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEEecCCCCeEE
Q psy11781 478 CDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLI 557 (1228)
Q Consensus 478 ~gRLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYItDs~~~gLI 557 (1228)
.++|++.+.. -.|.+||.+||+++-++.++..+ ...++++ ++.+|+.+. .+.|.
T Consensus 65 ~~~v~v~~~~-------------g~v~a~d~~tG~~~W~~~~~~~~------~~~p~v~------~~~v~v~~~-~g~l~ 118 (377)
T TIGR03300 65 GGKVYAADAD-------------GTVVALDAETGKRLWRVDLDERL------SGGVGAD------GGLVFVGTE-KGEVI 118 (377)
T ss_pred CCEEEEECCC-------------CeEEEEEccCCcEeeeecCCCCc------ccceEEc------CCEEEEEcC-CCEEE
Confidence 5688887643 13999999999999988887643 2345665 578999874 45899
Q ss_pred EEEcCCCe-EEEEc
Q psy11781 558 VYDFFKNT-SYRLT 570 (1228)
Q Consensus 558 VyDl~~g~-swRv~ 570 (1228)
.+|..+|+ .|+..
T Consensus 119 ald~~tG~~~W~~~ 132 (377)
T TIGR03300 119 ALDAEDGKELWRAK 132 (377)
T ss_pred EEECCCCcEeeeec
Confidence 99998776 46543
No 132
>COG3292 Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]
Probab=31.81 E-value=6.2e+02 Score=32.39 Aligned_cols=85 Identities=12% Similarity=0.063 Sum_probs=50.2
Q ss_pred eEEEEEEEeCCCcEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeec----CCCC
Q psy11781 468 ISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVE----DCDH 543 (1228)
Q Consensus 468 vsV~~v~iD~~gRLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~----~c~~ 543 (1228)
.-|-.|.-|.++.||+==++- |+-.+++.+++...+.+.. ...++|..--|.++... ....
T Consensus 419 s~V~aived~dnsLWIGTs~G--------------lvk~~pe~~~v~n~ln~~~-~~l~~s~~~~lg~~~~g~Lw~a~g~ 483 (671)
T COG3292 419 SEVSAIVEDPDNSLWIGTSGG--------------LVKRDPESGRVLNLLNPGS-HGLDGSRVEQLGLGPDGRLWLAAGS 483 (671)
T ss_pred cceeeeeecCCCCEEEeccCC--------------eEecCccccchhccccccc-CcCCcchhhhhccCCCCceEEEecc
Confidence 446677889999999844332 6677888888887666633 34566666666666321 1122
Q ss_pred eEEEEecCCCCeEEEEEcCCCeEEEE
Q psy11781 544 VFAYVNDVFRYGLIVYDFFKNTSYRL 569 (1228)
Q Consensus 544 ~~aYItDs~~~gLIVyDl~~g~swRv 569 (1228)
+. |.-|+ ...+.+|++.....|.-
T Consensus 484 gv-a~~~s-q~~~~~~~l~~~~q~qy 507 (671)
T COG3292 484 GV-AAWES-QRRMFVPILDLTGQRQY 507 (671)
T ss_pred ee-eeecc-ccceecccccccceeeE
Confidence 22 22232 34566677766655543
No 133
>KOG0268|consensus
Probab=31.35 E-value=4.4e+02 Score=31.82 Aligned_cols=181 Identities=12% Similarity=0.149 Sum_probs=102.1
Q ss_pred cCCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEEecCCCCeEEEEEcCCCeEEEEcCCCCCcC
Q psy11781 498 LCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLTHPYMYPE 577 (1228)
Q Consensus 498 ~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYItDs~~~gLIVyDl~~g~swRv~~~s~~pd 577 (1228)
.|+.+|-+||+..+.-++.+....+. +..+... --++.+--+-....+|++||+.++.--+=
T Consensus 164 TcGe~i~IWD~~R~~Pv~smswG~Dt------i~svkfN----pvETsILas~~sDrsIvLyD~R~~~Pl~K-------- 225 (433)
T KOG0268|consen 164 TCGEQIDIWDEQRDNPVSSMSWGADS------ISSVKFN----PVETSILASCASDRSIVLYDLRQASPLKK-------- 225 (433)
T ss_pred ccCceeeecccccCCccceeecCCCc------eeEEecC----CCcchheeeeccCCceEEEecccCCccce--------
Confidence 48999999999998888877775442 2222222 12344444445678999999998643221
Q ss_pred CCCcceEEcCeEeeecCCcceeecCcccCCcccCCCccccccccccccCccccccccCCCceEEEeeCCCCceEEEechh
Q psy11781 578 PTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSSNRHYYVSTTD 657 (1228)
Q Consensus 578 P~~~~~~i~G~~f~~~dGi~GIALsp~~~~~~~~~~~~~~~~~~~~~~~~~~~i~is~d~r~LYf~~lss~~ly~V~T~~ 657 (1228)
..+ . --.++|+.+|.. + .|-.+---..+|.-++..
T Consensus 226 -----Vi~-----~--mRTN~IswnPea-----------f----------------------nF~~a~ED~nlY~~DmR~ 260 (433)
T KOG0268|consen 226 -----VIL-----T--MRTNTICWNPEA-----------F----------------------NFVAANEDHNLYTYDMRN 260 (433)
T ss_pred -----eee-----e--ccccceecCccc-----------c----------------------ceeeccccccceehhhhh
Confidence 111 0 112455555521 0 111111112366666666
Q ss_pred hcCCCCCCCccccccceEeeccccccccccceEEc--cCCcEEEEcCCCCeEEEEeCCCCccCCCCeeEEEecCCceecC
Q psy11781 658 LRNSSRYVNSSDIDEYFHYLGSRFRNTQASASAIN--SNGVMFYNLVTKHSVGCWNTKTKVYLPQTQDIVQTSRDILNFP 735 (1228)
Q Consensus 658 Lr~~~~~~~~~~~~~~v~~lG~~g~~~qs~G~a~D--~~G~LY~t~~~~~aI~~wn~~~~~~~~~n~~iV~~~d~~L~wP 735 (1228)
|..+.- . ..| - -+.-|.+| .+|.=|++-.=..+|-.|+.+ .. .-+-|+ ...|||-.
T Consensus 261 l~~p~~------v-----~~d-h----vsAV~dVdfsptG~EfvsgsyDksIRIf~~~-~~----~SRdiY-htkRMq~V 318 (433)
T KOG0268|consen 261 LSRPLN------V-----HKD-H----VSAVMDVDFSPTGQEFVSGSYDKSIRIFPVN-HG----HSRDIY-HTKRMQHV 318 (433)
T ss_pred hcccch------h-----hcc-c----ceeEEEeccCCCcchhccccccceEEEeecC-CC----cchhhh-hHhhhhee
Confidence 644331 0 001 1 13445555 788888887777888888886 21 112333 34578877
Q ss_pred CceEEcCCCcEEEEecCccccccCCCCCccccEEEEEEec
Q psy11781 736 NDLKIDPNDNIWILSNKLHQYLYGFLDFNVYNYRILVANS 775 (1228)
Q Consensus 736 d~l~Id~dg~LyVlsnrl~~~~~~~~d~~e~nfrI~r~~v 775 (1228)
-+++..-|. -||+| -.++.|.|+||++.
T Consensus 319 ~~Vk~S~Ds-kyi~S-----------GSdd~nvRlWka~A 346 (433)
T KOG0268|consen 319 FCVKYSMDS-KYIIS-----------GSDDGNVRLWKAKA 346 (433)
T ss_pred eEEEEeccc-cEEEe-----------cCCCcceeeeecch
Confidence 777776554 24433 24688999999875
No 134
>PF07676 PD40: WD40-like Beta Propeller Repeat; InterPro: IPR011659 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events. This region appears to be related to the IPR001680 from INTERPRO repeat. This model is likely to miss copies within a sequence.; PDB: 2HQS_D 1C5K_A 2IVZ_A 2W8B_D 3IAX_A 1CRZ_A 1N6F_D 1N6D_C 1N6E_C 1K32_A ....
Probab=31.05 E-value=36 Score=26.76 Aligned_cols=20 Identities=30% Similarity=0.562 Sum_probs=15.4
Q ss_pred CCcccccccCCCceEEEEec
Q psy11781 186 GTNVDKTIRDDQRYMYFHSL 205 (1228)
Q Consensus 186 g~~~~~~~~~~~~~lYf~~l 205 (1228)
|.+...++|+||+.|||+.-
T Consensus 9 ~~~~~p~~SpDGk~i~f~s~ 28 (39)
T PF07676_consen 9 GDDGSPAWSPDGKYIYFTSN 28 (39)
T ss_dssp SSEEEEEE-TTSSEEEEEEE
T ss_pred ccccCEEEecCCCEEEEEec
Confidence 45557899999999999863
No 135
>KOG0291|consensus
Probab=30.74 E-value=8.6e+02 Score=32.07 Aligned_cols=60 Identities=13% Similarity=0.236 Sum_probs=51.7
Q ss_pred EEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEEecCCCCeEEEEEcCCCeEEEEcC
Q psy11781 503 IMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLTH 571 (1228)
Q Consensus 503 LvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYItDs~~~gLIVyDl~~g~swRv~~ 571 (1228)
+-+|||+..+=.|++..|.. ..|..++||. .+..+.+.+-....|.||++++|+---++.
T Consensus 416 VRAwDlkRYrNfRTft~P~p-----~QfscvavD~----sGelV~AG~~d~F~IfvWS~qTGqllDiLs 475 (893)
T KOG0291|consen 416 VRAWDLKRYRNFRTFTSPEP-----IQFSCVAVDP----SGELVCAGAQDSFEIFVWSVQTGQLLDILS 475 (893)
T ss_pred EEeeeecccceeeeecCCCc-----eeeeEEEEcC----CCCEEEeeccceEEEEEEEeecCeeeehhc
Confidence 88999999999999999875 4588999994 466777888888999999999999998883
No 136
>PF05567 Neisseria_PilC: Neisseria PilC beta-propeller domain; InterPro: IPR008707 This domain is found in several PilC protein sequences from Neisseria gonorrhoeae and Neisseria meningitidis. PilC is a phase-variable protein associated with pilus-mediated adherence of pathogenic Neisseria to target cells [].; PDB: 3HX6_A.
Probab=30.69 E-value=91 Score=37.02 Aligned_cols=58 Identities=21% Similarity=0.226 Sum_probs=32.3
Q ss_pred CCCEEEEEECCC-CcEEEEEECCCcccccCCCceeE-EEEeecCCCCeEEEEecCCCCeEEEEEcCCCeEEEEcC
Q psy11781 499 CPPKIMVFDLKT-NTLIRKYILPTAQVFEGSLFSNI-VTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLTH 571 (1228)
Q Consensus 499 ~~PKLvvfDL~t-d~vi~~y~~p~~v~~~~S~l~di-~VD~~~~c~~~~aYItDs~~~gLIVyDl~~g~swRv~~ 571 (1228)
..++|+++|++| +++++++..+... ..+..+ .+|...+---.+||..|.+ |..||+.-
T Consensus 179 ~~~~lyi~d~~t~G~l~~~i~~~~~~----~gl~~~~~~D~d~DG~~D~vYaGDl~-----------GnlwR~dl 238 (335)
T PF05567_consen 179 GGAALYILDADTTGALIKKIDVPGGS----GGLSSPAVVDSDGDGYVDRVYAGDLG-----------GNLWRFDL 238 (335)
T ss_dssp --EEEEEEETTT---EEEEEEE--ST----T-EEEEEEE-TTSSSEE-EEEEEETT-----------SEEEEEE-
T ss_pred CCcEEEEEECCCCCceEEEEecCCCC----ccccccEEEeccCCCeEEEEEEEcCC-----------CcEEEEEC
Confidence 468999999999 9999999886542 134444 3453322123478888753 57788764
No 137
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=30.22 E-value=88 Score=29.98 Aligned_cols=36 Identities=22% Similarity=0.460 Sum_probs=24.8
Q ss_pred CeEEEEeCCCCccCCCCeeEEEecCCceecCCceEEcCCC-cEEEEe
Q psy11781 705 HSVGCWNTKTKVYLPQTQDIVQTSRDILNFPNDLKIDPND-NIWILS 750 (1228)
Q Consensus 705 ~aI~~wn~~~~~~~~~n~~iV~~~d~~L~wPd~l~Id~dg-~LyVls 750 (1228)
..|..|+++ ...+++ +.|.+|+||.+++++ .|||.+
T Consensus 36 ~~Vvyyd~~-------~~~~va---~g~~~aNGI~~s~~~k~lyVa~ 72 (86)
T PF01731_consen 36 GNVVYYDGK-------EVKVVA---SGFSFANGIAISPDKKYLYVAS 72 (86)
T ss_pred ceEEEEeCC-------EeEEee---ccCCCCceEEEcCCCCEEEEEe
Confidence 345556665 134554 478999999999876 577766
No 138
>COG5276 Uncharacterized conserved protein [Function unknown]
Probab=30.10 E-value=53 Score=38.23 Aligned_cols=63 Identities=19% Similarity=0.255 Sum_probs=43.3
Q ss_pred ecCCeEEEEEccCCcEEEEEEcCCcccccCCcccceEEEEeecCCceEEEEeeCCcceEEEEEccCCceee
Q psy11781 58 LCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYR 128 (1228)
Q Consensus 58 ~~~pkl~~~dl~t~~~~~~~~~p~~~~~~~s~~~~i~~d~~~~~~~~~aYi~D~~~~g~iv~dl~~~~swR 128 (1228)
..|=-|.++|. |+...+|-.---..+..-++++|+||- +-+||++| +..||.++|..+-+|=|
T Consensus 58 ~qG~~l~i~di-tn~~~~t~~~l~~~i~~~~l~~Dv~vs------e~yvyvad-~ssGL~IvDIS~P~sP~ 120 (370)
T COG5276 58 GQGFILAILDI-TNVSLQTHDVLLSVINARDLFADVRVS------EEYVYVAD-WSSGLRIVDISTPDSPT 120 (370)
T ss_pred ccCceEeeccc-cCcccccCcceEEEEehhhhhheeEec------ccEEEEEc-CCCceEEEeccCCCCcc
Confidence 66778888887 555544433333333344567777664 56999999 89999999998866533
No 139
>PRK01742 tolB translocation protein TolB; Provisional
Probab=28.26 E-value=1.1e+03 Score=28.47 Aligned_cols=61 Identities=15% Similarity=0.080 Sum_probs=33.0
Q ss_pred CCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEEec-CCCCeEEEEEcCCCeEEEEc
Q psy11781 500 PPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVND-VFRYGLIVYDFFKNTSYRLT 570 (1228)
Q Consensus 500 ~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYItD-s~~~gLIVyDl~~g~swRv~ 570 (1228)
.++|.++|+.+++..+.-.++.. ...+++. ++ ++..||..+ .+..-|.++|+.+++..++.
T Consensus 227 ~~~i~i~dl~tg~~~~l~~~~g~-------~~~~~wS--PD-G~~La~~~~~~g~~~Iy~~d~~~~~~~~lt 288 (429)
T PRK01742 227 KSQLVVHDLRSGARKVVASFRGH-------NGAPAFS--PD-GSRLAFASSKDGVLNIYVMGANGGTPSQLT 288 (429)
T ss_pred CcEEEEEeCCCCceEEEecCCCc-------cCceeEC--CC-CCEEEEEEecCCcEEEEEEECCCCCeEeec
Confidence 35688888888764332223221 1123332 21 334566554 34455778888887766654
No 140
>COG1770 PtrB Protease II [Amino acid transport and metabolism]
Probab=27.90 E-value=1.4e+03 Score=29.86 Aligned_cols=189 Identities=18% Similarity=0.204 Sum_probs=0.0
Q ss_pred ccCCcEEEEEEcCCcccccCCcccceEEEEeecCCceEEEEeeC---CcceEEEEEccCCceeeeecCccccCCCcccee
Q psy11781 68 LKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDV---FRYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYI 144 (1228)
Q Consensus 68 l~t~~~~~~~~~p~~~~~~~s~~~~i~~d~~~~~~~~~aYi~D~---~~~g~iv~dl~~~~swR~~h~~~~~~p~~~~~~ 144 (1228)
+..+.......|-.+.-.+++-+-.+-.=-...-..-.||-.|. -.+.|.|.||+||+- +||.
T Consensus 106 ~~~g~~~eevlLD~n~~A~g~~f~~Lg~~~~s~D~~~la~s~D~~G~e~y~lr~kdL~tg~~--------~~d~------ 171 (682)
T COG1770 106 PDEGGEGEEVLLDVNKEAEGHDFFSLGAASISPDHNLLAYSVDVLGDEQYTLRFKDLATGEE--------LPDE------ 171 (682)
T ss_pred cCCCCCceeEeecchhccCcccceeeeeeeeCCCCceEEEEEecccccEEEEEEEecccccc--------cchh------
Q ss_pred eccceeeeccCeeeeecCcCCCCccccCCcccccccccccCCCcccccccCCCceEEEEecccC-eeEEEecccccCCcc
Q psy11781 145 LDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSSN-RHYYVSTTDLRNSSR 223 (1228)
Q Consensus 145 ~~~~~f~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~lYf~~lsS~-~~~sV~T~~Lr~~~~ 223 (1228)
..++.+ +++-.+|+++|||.-+--. +-..|--..|-....
T Consensus 172 --------i~~~~~-------------------------------~~~Wa~d~~~lfYt~~d~~~rp~kv~~h~~gt~~~ 212 (682)
T COG1770 172 --------ITNTSG-------------------------------SFAWAADGKTLFYTRLDENHRPDKVWRHRLGTPGS 212 (682)
T ss_pred --------hccccc-------------------------------ceEEecCCCeEEEEEEcCCCCcceEEEEecCCCCC
Q ss_pred cCCccchhhhhhhcCccccccccceeeEcCCCeEEEec--cCCCeeEEEeCCCCcccccccceeecccc-eeecCcceee
Q psy11781 224 YVNSSDIDEYFHYLGSRFRNTQASASAINSNGVMFYNL--VTKHSVGCWNTKTKVYLPQTQDIVQTSRD-ILNFPNDLKI 300 (1228)
Q Consensus 224 ~~~~~~~~~~v~~~G~rg~~sqs~~~~~d~~Gvi~f~~--v~~~aI~cwn~~~~~~~~~~~~~v~~~~~-~l~~p~~l~i 300 (1228)
. ++-|-...+-+. --+...++++-.|+... -+...|-|.+..++. .++.+.-+|+ .+.+=-+
T Consensus 213 ~------d~lvyeE~d~~f--~~~v~~s~s~~yi~i~~~~~~tsE~~ll~a~~p~----~~p~vv~pr~~g~eY~~e--- 277 (682)
T COG1770 213 S------DELVYEEKDDRF--FLSVGRSRSEAYIVISLGSHITSEVRLLDADDPE----AEPKVVLPRENGVEYSVE--- 277 (682)
T ss_pred c------ceEEEEcCCCcE--EEEeeeccCCceEEEEcCCCcceeEEEEecCCCC----CceEEEEEcCCCcEEeee---
Q ss_pred eCCCeEEEEeccccchhcccCCCceeeEEEEEecc
Q psy11781 301 DPNDNIWILSNKLHQYLYGFLDFNVYNYRILVANS 335 (1228)
Q Consensus 301 ~~~g~lw~~sn~l~~~~~~~l~~~~~n~ri~~~~~ 335 (1228)
...++.|++||. +..||+|+++.+
T Consensus 278 h~~d~f~i~sN~-----------~gknf~l~~ap~ 301 (682)
T COG1770 278 HGGDRFYILSNA-----------DGKNFKLVRAPV 301 (682)
T ss_pred ecCcEEEEEecC-----------CCcceEEEEccC
No 141
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=27.81 E-value=3.5e+02 Score=31.97 Aligned_cols=26 Identities=23% Similarity=0.339 Sum_probs=21.8
Q ss_pred ceecCCCceEEEEeCC----CCceEEEecc
Q psy11781 1081 KTIRDDQRYMYFHSLS----SNRHYYVSTT 1106 (1228)
Q Consensus 1081 i~ls~d~r~LYf~pls----s~~ly~V~T~ 1106 (1228)
+.++++++.|||.+-. -++||+|+..
T Consensus 286 ~~~d~~~~~iyf~a~~~~p~~r~lY~v~~~ 315 (353)
T PF00930_consen 286 LGWDEDNNRIYFTANGDNPGERHLYRVSLD 315 (353)
T ss_dssp EEEECTSSEEEEEESSGGTTSBEEEEEETT
T ss_pred ceEcCCCCEEEEEecCCCCCceEEEEEEeC
Confidence 5789999999999876 5589999865
No 142
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=26.41 E-value=1.6e+02 Score=28.21 Aligned_cols=53 Identities=19% Similarity=0.189 Sum_probs=37.6
Q ss_pred CCCEEEEEECCCCcEEEEEECCCcccccCCCceeEEEEeecCCCCeEEEEecCCCCeEEEEEcCCC
Q psy11781 499 CPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKN 564 (1228)
Q Consensus 499 ~~PKLvvfDL~td~vi~~y~~p~~v~~~~S~l~di~VD~~~~c~~~~aYItDs~~~gLIVyDl~~g 564 (1228)
+.--++-||.++-++ ++..-++-|.|.+|. ++.++||++..+..|-||..+++
T Consensus 34 ~~~~Vvyyd~~~~~~---------va~g~~~aNGI~~s~----~~k~lyVa~~~~~~I~vy~~~~~ 86 (86)
T PF01731_consen 34 PWGNVVYYDGKEVKV---------VASGFSFANGIAISP----DKKYLYVASSLAHSIHVYKRHKD 86 (86)
T ss_pred CCceEEEEeCCEeEE---------eeccCCCCceEEEcC----CCCEEEEEeccCCeEEEEEecCC
Confidence 344577777754222 222336778899983 46899999999999999988753
No 143
>PRK01742 tolB translocation protein TolB; Provisional
Probab=26.07 E-value=3.5e+02 Score=32.94 Aligned_cols=99 Identities=19% Similarity=0.249 Sum_probs=54.1
Q ss_pred CceEEEEeeC-CcceEEEEEccCCceeeee-------cCccccCCCccceeeccceeeeccCeeeeecCcCCCCccccCC
Q psy11781 102 DHVFAYVNDV-FRYGLIVYDFFKNTSYRLT-------HPYMYPEPTQSTYILDNLKFRWVDGIFGMAISPELSGKYKRHP 173 (1228)
Q Consensus 102 ~~~~aYi~D~-~~~g~iv~dl~~~~swR~~-------h~~~~~~p~~~~~~~~~~~f~~~~G~~~~~~~~~~~~~~~~~~ 173 (1228)
|+-+||+++. +++-|.++|+.+|+.=++. .+.+.||-..+-+.. ..+|..-+++-....|+.++.+
T Consensus 215 G~~la~~s~~~~~~~i~i~dl~tg~~~~l~~~~g~~~~~~wSPDG~~La~~~------~~~g~~~Iy~~d~~~~~~~~lt 288 (429)
T PRK01742 215 GSKLAYVSFENKKSQLVVHDLRSGARKVVASFRGHNGAPAFSPDGSRLAFAS------SKDGVLNIYVMGANGGTPSQLT 288 (429)
T ss_pred CCEEEEEEecCCCcEEEEEeCCCCceEEEecCCCccCceeECCCCCEEEEEE------ecCCcEEEEEEECCCCCeEeec
Confidence 6778999875 5678999999888643332 233444433332211 1244332322222223333222
Q ss_pred cccccccccccCCCcccccccCCCceEEEEec--ccCeeEEEec
Q psy11781 174 YEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSL--SSNRHYYVST 215 (1228)
Q Consensus 174 ~~~~~~~~~~~~g~~~~~~~~~~~~~lYf~~l--sS~~~~sV~T 215 (1228)
. ..+.+...+.|+||+.|+|..- ...++|.+++
T Consensus 289 ~---------~~~~~~~~~wSpDG~~i~f~s~~~g~~~I~~~~~ 323 (429)
T PRK01742 289 S---------GAGNNTEPSWSPDGQSILFTSDRSGSPQVYRMSA 323 (429)
T ss_pred c---------CCCCcCCEEECCCCCEEEEEECCCCCceEEEEEC
Confidence 1 1234557889999999999753 2346666553
No 144
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=25.97 E-value=2.5e+02 Score=35.00 Aligned_cols=74 Identities=11% Similarity=0.104 Sum_probs=48.4
Q ss_pred CCcEEEEeCCCCCccccccccCCCEEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcC------
Q psy11781 928 CDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDV------ 1001 (1228)
Q Consensus 928 ~grLWVLDtG~~~~~~~~~~~~~pKLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~------ 1001 (1228)
.+++++-+.. -.|+++|.+|++++-++.+..............+|+ ++.+|+...
T Consensus 110 ~~~V~v~~~~-------------g~v~AlD~~TG~~~W~~~~~~~~~~~~~i~ssP~v~------~~~v~vg~~~~~~~~ 170 (488)
T cd00216 110 PRKVFFGTFD-------------GRLVALDAETGKQVWKFGNNDQVPPGYTMTGAPTIV------KKLVIIGSSGAEFFA 170 (488)
T ss_pred CCeEEEecCC-------------CeEEEEECCCCCEeeeecCCCCcCcceEecCCCEEE------CCEEEEecccccccc
Confidence 4788875532 179999999999999998865431111112334554 366777643
Q ss_pred --CCCeEEEEECCCCe-EEEee
Q psy11781 1002 --FRYGLIVYDFFKNT-SYRLT 1020 (1228)
Q Consensus 1002 --~~~gLIVydl~~g~-swRv~ 1020 (1228)
..+.|+.+|.++|+ -|+..
T Consensus 171 ~~~~g~v~alD~~TG~~~W~~~ 192 (488)
T cd00216 171 CGVRGALRAYDVETGKLLWRFY 192 (488)
T ss_pred CCCCcEEEEEECCCCceeeEee
Confidence 23679999998776 67765
No 145
>KOG0282|consensus
Probab=23.74 E-value=80 Score=38.78 Aligned_cols=132 Identities=12% Similarity=0.190 Sum_probs=74.3
Q ss_pred EEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCCCCeEEEEECCCCeEEEee-cCCcCcCCCc
Q psy11781 952 KIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLT-HPYMYPEPTQ 1030 (1228)
Q Consensus 952 KLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~~gLIVydl~~g~swRv~-h~s~~pdP~~ 1030 (1228)
||+.||.+++++++.|.=.- ...++|..= +++-=+|+=+-...++||+....---+.. .+.|+.-|--
T Consensus 323 ki~~wDiRs~kvvqeYd~hL------g~i~~i~F~-----~~g~rFissSDdks~riWe~~~~v~ik~i~~~~~hsmP~~ 391 (503)
T KOG0282|consen 323 KIRQWDIRSGKVVQEYDRHL------GAILDITFV-----DEGRRFISSSDDKSVRIWENRIPVPIKNIADPEMHTMPCL 391 (503)
T ss_pred cEEEEeccchHHHHHHHhhh------hheeeeEEc-----cCCceEeeeccCccEEEEEcCCCccchhhcchhhccCcce
Confidence 89999999999987665321 234444432 23444555555569999998877555432 2223322210
Q ss_pred -----ceeEEcCEEEEeeCCccceecCccccccccCCCccccccccccccCccccceecCCCceEEEEeCCCCceEEEe
Q psy11781 1031 -----STYILDNLKFRWVDGIFGMAISPELSGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLSSNRHYYVS 1104 (1228)
Q Consensus 1031 -----~~~~i~G~~f~~~dGi~GiaLsp~~~~~~~~~p~~~~~~~~~~~~~~~~di~ls~d~r~LYf~plss~~ly~V~ 1104 (1228)
+.+. ..+.+.-.-++|++ . + +++.++.+.| -+|...|.+..+.-||||++|...--.|. +|-=+
T Consensus 392 ~~~P~~~~~-~aQs~dN~i~ifs~-~-~----~~r~nkkK~f--eGh~vaGys~~v~fSpDG~~l~SGdsdG~-v~~wd 460 (503)
T KOG0282|consen 392 TLHPNGKWF-AAQSMDNYIAIFST-V-P----PFRLNKKKRF--EGHSVAGYSCQVDFSPDGRTLCSGDSDGK-VNFWD 460 (503)
T ss_pred ecCCCCCee-hhhccCceEEEEec-c-c----ccccCHhhhh--cceeccCceeeEEEcCCCCeEEeecCCcc-EEEee
Confidence 1111 12222112234431 1 1 2455566655 24455677777899999999988877665 55444
No 146
>COG4222 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.95 E-value=2.1e+02 Score=34.83 Aligned_cols=101 Identities=18% Similarity=0.181 Sum_probs=52.8
Q ss_pred cceEEEEEEEeCCC-cEEEE-eCCCCCccccccccCCCEEEEEECCCCc--EEEEEEC--CC-Ccc-------cCCCcce
Q psy11781 916 SLISVFRMSVDKCD-RLWIM-DTGVTNILSSIQQLCPPKIMVFDLKTNT--LIRKYIL--PT-AQV-------FEGSLFS 981 (1228)
Q Consensus 916 ~lvSV~~v~iD~~g-rLWVL-DtG~~~~~~~~~~~~~pKLvvfDL~td~--lir~y~~--p~-~~~-------~~~S~l~ 981 (1228)
.+.---++..|..| +-|+| |.|........-..--+-.+.+|..+++ ++..+.| |+ .+. ..+..+.
T Consensus 67 p~~G~Sgi~~d~~~~~f~~lSDng~g~K~nS~dy~l~l~~~~~~~~~g~v~~~~~~~L~Dp~~k~pf~i~~~~~~~ralt 146 (391)
T COG4222 67 PVGGFSGITYDPQGDGYWALSDNGRGSKLNSRDYTLRLYRLGIDFGNGKVTPLDTTFLSDPNKKTPFPITGEDPEGRALT 146 (391)
T ss_pred CCCceeeeEEccCCCeEEEEeCCCcccccCccceEEEccccccccCCcceEEeEEEEeecCCCCCCccccccCchhhccc
Confidence 34446677888888 59999 8887653322211111122223334443 4444455 22 011 1122333
Q ss_pred eeEEEeecCCCceEEEEEcCCCCeEEEEECCCCeEEEee
Q psy11781 982 NIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLT 1020 (1228)
Q Consensus 982 di~VD~~~~c~~~~aYItD~~~~gLIVydl~~g~swRv~ 1020 (1228)
.--.|+.. -..++|.|..++.|+.+|+. |+--++.
T Consensus 147 ~~d~~~~s---~~~~~igdefgP~l~~f~~~-Gk~~~~~ 181 (391)
T COG4222 147 PADFDVES---SQGAWIGDEFGPYLLEFDAN-GKLVRVL 181 (391)
T ss_pred CCCcceee---ccccccccccCcceEEECCC-Ccccccc
Confidence 22222221 12238999999999999999 5555554
No 147
>COG4222 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.53 E-value=2.2e+02 Score=34.63 Aligned_cols=100 Identities=18% Similarity=0.179 Sum_probs=49.7
Q ss_pred ceEEEEEEEeCCC-cEEEE-eCCCCCcccCcCccCCCEEEEEECCCCc--EEEEEEC--CC-cc-------cccCCCcee
Q psy11781 467 LISVFRMSVDKCD-RLWIM-DTGVTNILSSIQQLCPPKIMVFDLKTNT--LIRKYIL--PT-AQ-------VFEGSLFSN 532 (1228)
Q Consensus 467 lvsV~~v~iD~~g-RLWVL-DtG~~~~~~~~~~~~~PKLvvfDL~td~--vi~~y~~--p~-~v-------~~~~S~l~d 532 (1228)
+.---++..|..+ +.|+| |.|.....-..-..--+-.+-+|..+++ ++..+.| |+ .+ ...+..+..
T Consensus 68 ~~G~Sgi~~d~~~~~f~~lSDng~g~K~nS~dy~l~l~~~~~~~~~g~v~~~~~~~L~Dp~~k~pf~i~~~~~~~ralt~ 147 (391)
T COG4222 68 VGGFSGITYDPQGDGYWALSDNGRGSKLNSRDYTLRLYRLGIDFGNGKVTPLDTTFLSDPNKKTPFPITGEDPEGRALTP 147 (391)
T ss_pred CCceeeeEEccCCCeEEEEeCCCcccccCccceEEEccccccccCCcceEEeEEEEeecCCCCCCccccccCchhhcccC
Confidence 4445677788887 68988 7776442211100000111222233443 3344444 21 01 112233333
Q ss_pred EEEEeecCCCCeEEEEecCCCCeEEEEEcCCCeEEEEc
Q psy11781 533 IVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLT 570 (1228)
Q Consensus 533 i~VD~~~~c~~~~aYItDs~~~gLIVyDl~~g~swRv~ 570 (1228)
.-.|+. +-..+||.|...+.|+.+|+. |+--++.
T Consensus 148 ~d~~~~---s~~~~~igdefgP~l~~f~~~-Gk~~~~~ 181 (391)
T COG4222 148 ADFDVE---SSQGAWIGDEFGPYLLEFDAN-GKLVRVL 181 (391)
T ss_pred CCccee---eccccccccccCcceEEECCC-Ccccccc
Confidence 222222 112338999999999999999 5555554
No 148
>KOG0291|consensus
Probab=22.27 E-value=1.8e+02 Score=37.85 Aligned_cols=60 Identities=13% Similarity=0.237 Sum_probs=52.5
Q ss_pred EEEEEECCCCcEEEEEECCCCcccCCCcceeeEEEeecCCCceEEEEEcCCCCeEEEEECCCCeEEEee
Q psy11781 952 KIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTSYRLT 1020 (1228)
Q Consensus 952 KLvvfDL~td~lir~y~~p~~~~~~~S~l~di~VD~~~~c~~~~aYItD~~~~gLIVydl~~g~swRv~ 1020 (1228)
.+-+|||+..+=.|++..|+. ..|.-++||. .+..+.+.+.....|.||++++|+---++
T Consensus 415 tVRAwDlkRYrNfRTft~P~p-----~QfscvavD~----sGelV~AG~~d~F~IfvWS~qTGqllDiL 474 (893)
T KOG0291|consen 415 TVRAWDLKRYRNFRTFTSPEP-----IQFSCVAVDP----SGELVCAGAQDSFEIFVWSVQTGQLLDIL 474 (893)
T ss_pred eEEeeeecccceeeeecCCCc-----eeeeEEEEcC----CCCEEEeeccceEEEEEEEeecCeeeehh
Confidence 577999999999999999886 4688899995 35678888888899999999999999988
No 149
>KOG2048|consensus
Probab=21.40 E-value=1.4e+03 Score=29.77 Aligned_cols=72 Identities=15% Similarity=0.099 Sum_probs=44.7
Q ss_pred CCEEEEEECCCCcEEEEEECCCccccc----CCCceeEEEEeecCCCCeEEEEecCCCCeEEEEEcCCCeE-EEEcCCCC
Q psy11781 500 PPKIMVFDLKTNTLIRKYILPTAQVFE----GSLFSNIVTEVVEDCDHVFAYVNDVFRYGLIVYDFFKNTS-YRLTHPYM 574 (1228)
Q Consensus 500 ~PKLvvfDL~td~vi~~y~~p~~v~~~----~S~l~di~VD~~~~c~~~~aYItDs~~~gLIVyDl~~g~s-wRv~~~s~ 574 (1228)
.-+|.-|||.+.+..+.++.-....-. .+. ..+.| +|++|.+|+=+. +++.|-|+..-++. =|++.=++
T Consensus 89 sg~i~EwDl~~lk~~~~~d~~gg~IWsiai~p~~-~~l~I----gcddGvl~~~s~-~p~~I~~~r~l~rq~sRvLslsw 162 (691)
T KOG2048|consen 89 SGSITEWDLHTLKQKYNIDSNGGAIWSIAINPEN-TILAI----GCDDGVLYDFSI-GPDKITYKRSLMRQKSRVLSLSW 162 (691)
T ss_pred CceEEEEecccCceeEEecCCCcceeEEEeCCcc-ceEEe----ecCCceEEEEec-CCceEEEEeecccccceEEEEEe
Confidence 346999999999988777764332211 110 22333 699999998885 46777777655555 56664444
Q ss_pred CcC
Q psy11781 575 YPE 577 (1228)
Q Consensus 575 ~pd 577 (1228)
.|+
T Consensus 163 ~~~ 165 (691)
T KOG2048|consen 163 NPT 165 (691)
T ss_pred cCC
Confidence 444
No 150
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=20.99 E-value=5.7e+02 Score=30.14 Aligned_cols=83 Identities=10% Similarity=0.113 Sum_probs=42.1
Q ss_pred ceEEEEEEEeCCCcEEEEeCCCCCcccCcCccCCCEEEEEECCCCcEEEEEECCCcccccCC-CceeEEEEeecCCCCeE
Q psy11781 467 LISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEGS-LFSNIVTEVVEDCDHVF 545 (1228)
Q Consensus 467 lvsV~~v~iD~~gRLWVLDtG~~~~~~~~~~~~~PKLvvfDL~td~vi~~y~~p~~v~~~~S-~l~di~VD~~~~c~~~~ 545 (1228)
-.+++.|-.++++.||++..|- .+.+.-..+. ......+..-.+..+ .+-|++.. .+.-
T Consensus 186 ~~riq~~gf~~~~~lw~~~~Gg--------------~~~~s~~~~~-~~~w~~~~~~~~~~~~~~ld~a~~-----~~~~ 245 (302)
T PF14870_consen 186 SRRIQSMGFSPDGNLWMLARGG--------------QIQFSDDPDD-GETWSEPIIPIKTNGYGILDLAYR-----PPNE 245 (302)
T ss_dssp SS-EEEEEE-TTS-EEEEETTT--------------EEEEEE-TTE-EEEE---B-TTSS--S-EEEEEES-----SSS-
T ss_pred cceehhceecCCCCEEEEeCCc--------------EEEEccCCCC-ccccccccCCcccCceeeEEEEec-----CCCC
Confidence 3677888889999999999874 2223211111 111222111111222 25566665 3456
Q ss_pred EEEecCCCCeEEEEEcCCCeEEEEcC
Q psy11781 546 AYVNDVFRYGLIVYDFFKNTSYRLTH 571 (1228)
Q Consensus 546 aYItDs~~~gLIVyDl~~g~swRv~~ 571 (1228)
+|+ ++..|.+.++...|++|+...
T Consensus 246 ~wa--~gg~G~l~~S~DgGktW~~~~ 269 (302)
T PF14870_consen 246 IWA--VGGSGTLLVSTDGGKTWQKDR 269 (302)
T ss_dssp EEE--EESTT-EEEESSTTSS-EE-G
T ss_pred EEE--EeCCccEEEeCCCCccceECc
Confidence 776 456678899999999999763
No 151
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=20.85 E-value=7e+02 Score=29.46 Aligned_cols=177 Identities=14% Similarity=0.160 Sum_probs=87.4
Q ss_pred ceEEEEEEEecCCeEEEeecCccc--ccCCCce----ec----CCeEEEEEccCCcEEEEEEcCCcccccCCcccceEEE
Q psy11781 27 LISVFRMSVDKCDRLWIMDTGVTN--ILSSIQQ----LC----PPKIMVFDLKTNTLIRKYILPTAQVFEGSLFSNIVTE 96 (1228)
Q Consensus 27 ~~~v~~~~~d~~~~lw~ldtg~~~--~~~~~~~----~~----~pkl~~~dl~t~~~~~~~~~p~~~~~~~s~~~~i~~d 96 (1228)
-|...++--.....+++++.+... ++..-+. +| .-+|.++||++++.. ....|......+.|+.++..
T Consensus 114 ~la~~~~d~~~v~~~~~~~~~~~~~~yp~~~~~~YPk~G~~np~v~l~v~~~~~~~~~-~~~~~~~~~~~~~yl~~v~W- 191 (353)
T PF00930_consen 114 YLAFLRFDEREVPEYPLPDYSPPDSQYPEVESIRYPKAGDPNPRVSLFVVDLASGKTT-ELDPPNSLNPQDYYLTRVGW- 191 (353)
T ss_dssp EEEEEEEE-TTS-EEEEEEESSSTESS-EEEEEE--BTTS---EEEEEEEESSSTCCC-EE---HHHHTSSEEEEEEEE-
T ss_pred EEEEEEECCcCCceEEeeccCCccccCCcccccccCCCCCcCCceEEEEEECCCCcEE-EeeeccccCCCccCccccee-
Confidence 455555555567888888886653 1111111 23 338899999999766 55556444445666655532
Q ss_pred EeecCCceEEEEeeC--CcceEEEEEccCCceeee----ecCccccCCCcccee--eccceeee---ccCeeeeecCcCC
Q psy11781 97 VVEDCDHVFAYVNDV--FRYGLIVYDFFKNTSYRL----THPYMYPEPTQSTYI--LDNLKFRW---VDGIFGMAISPEL 165 (1228)
Q Consensus 97 ~~~~~~~~~aYi~D~--~~~g~iv~dl~~~~swR~----~h~~~~~~p~~~~~~--~~~~~f~~---~~G~~~~~~~~~~ 165 (1228)
..+....++-..+= .+--|+.+|..+++.=.+ .+.++ +......- -+|..|-| .+|--=+++-...
T Consensus 192 -~~d~~~l~~~~~nR~q~~~~l~~~d~~tg~~~~~~~e~~~~Wv--~~~~~~~~~~~~~~~~l~~s~~~G~~hly~~~~~ 268 (353)
T PF00930_consen 192 -SPDGKRLWVQWLNRDQNRLDLVLCDASTGETRVVLEETSDGWV--DVYDPPHFLGPDGNEFLWISERDGYRHLYLYDLD 268 (353)
T ss_dssp -EETTEEEEEEEEETTSTEEEEEEEEECTTTCEEEEEEESSSSS--SSSSEEEE-TTTSSEEEEEEETTSSEEEEEEETT
T ss_pred -cCCCcEEEEEEcccCCCEEEEEEEECCCCceeEEEEecCCcce--eeecccccccCCCCEEEEEEEcCCCcEEEEEccc
Confidence 22222133333333 355566677766654222 22333 11111111 24444444 2554333444333
Q ss_pred CCccccCCcccccccccccCCCcccccccCCCceEEEEecc----cCeeEEEecc
Q psy11781 166 SGKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSLS----SNRHYYVSTT 216 (1228)
Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~lYf~~ls----S~~~~sV~T~ 216 (1228)
.+..+.++-+ .|+|... +.++.+++.|||..-. =+++|+|+..
T Consensus 269 ~~~~~~lT~G--~~~V~~i------~~~d~~~~~iyf~a~~~~p~~r~lY~v~~~ 315 (353)
T PF00930_consen 269 GGKPRQLTSG--DWEVTSI------LGWDEDNNRIYFTANGDNPGERHLYRVSLD 315 (353)
T ss_dssp SSEEEESS-S--SS-EEEE------EEEECTSSEEEEEESSGGTTSBEEEEEETT
T ss_pred ccceeccccC--ceeeccc------ceEcCCCCEEEEEecCCCCCceEEEEEEeC
Confidence 3333333332 2333222 3345788999999886 4599999888
No 152
>COG3670 Lignostilbene-alpha,beta-dioxygenase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=20.78 E-value=2.1e+02 Score=35.43 Aligned_cols=138 Identities=14% Similarity=0.140 Sum_probs=74.7
Q ss_pred EEEEeccCCCCCCcEEEEEECCCCCCCCccccC---CCCCcCCC---CCCCcceEEEEEEEeCCCcEEEEeCCCCCcccc
Q psy11781 871 IFLSFPKWKAGIPFTLASFNMNDPSESPILLPY---PNWSYFDD---SNCNSLISVFRMSVDKCDRLWIMDTGVTNILSS 944 (1228)
Q Consensus 871 iFvt~PR~~~gvP~TLa~v~~~~~~~sP~l~PY---P~~~wn~~---~~c~~lvSV~~v~iD~~grLWVLDtG~~~~~~~ 944 (1228)
-+....||.++.|..+..+..+.+ ..++=| |..-||-. .+|++-+-+--+..|.- -=.+|.+.-...+.
T Consensus 261 ~~~~~~~~~pe~ptri~V~pR~g~---~~irwfE~~p~fvfH~~NAye~~d~~v~~d~~~~~~~--~~~~~~~~~~~~g~ 335 (490)
T COG3670 261 ADGHAMRWRPELPTRILVLPREGD---SEIRWFEAEPGFVFHFLNAYEEGDEVVLVDFLRYDDF--SQTLDTLGEGPGGD 335 (490)
T ss_pred ccccceeecCCCCcEEEEEccCCC---CceeEEecCCeEEEEecceeecCCCcEEEEEEeeccc--cccccccccccccc
Confidence 377788888888988888876321 123322 44455532 34555444433333211 13355554433333
Q ss_pred ccccCCCEEEEE--ECCCCcEEEEEECCCCcccCCCcceeeEEEeec-CCCceEEEEEcCCC-----Ce---EEEEECCC
Q psy11781 945 IQQLCPPKIMVF--DLKTNTLIRKYILPTAQVFEGSLFSNIVTEVVE-DCDHVFAYVNDVFR-----YG---LIVYDFFK 1013 (1228)
Q Consensus 945 ~~~~~~pKLvvf--DL~td~lir~y~~p~~~~~~~S~l~di~VD~~~-~c~~~~aYItD~~~-----~g---LIVydl~~ 1013 (1228)
.....+|+|..| ||+|+++...-.. . ..-.--+|+.+. +-...|+|++=... .| |+.+|+++
T Consensus 336 ~~~~a~~~L~R~~lnl~tg~v~~~~~~------~-r~~EFP~in~~~~G~p~RY~Y~~~~rp~~~l~~g~~~l~k~D~~t 408 (490)
T COG3670 336 FRTLAPPRLHRWTLNLKTGKVAEEQLD------D-RACEFPRINPRYVGQPYRYTYMSGARPGTFLFQGLQTLAKHDLET 408 (490)
T ss_pred ccccCCCcceEEEEccccchhHHHhhh------h-hhcccCccChhhcCCceEEEEEecccCCccchhhcceeEEEeccC
Confidence 333457888765 8999974321100 0 001112333322 23457899883221 24 99999999
Q ss_pred CeEEEee
Q psy11781 1014 NTSYRLT 1020 (1228)
Q Consensus 1014 g~swRv~ 1020 (1228)
|+++...
T Consensus 409 g~~~~~~ 415 (490)
T COG3670 409 GTSQVYS 415 (490)
T ss_pred CcEEEEe
Confidence 9999975
Done!