Query psy4774
Match_columns 120
No_of_seqs 123 out of 1024
Neff 7.6
Searched_HMMs 46136
Date Fri Aug 16 17:21:28 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4774.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4774hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1520|consensus 99.9 6.7E-28 1.5E-32 187.5 8.4 109 4-115 107-219 (376)
2 PF03088 Str_synth: Strictosid 99.5 6.9E-15 1.5E-19 94.7 3.0 53 63-115 1-57 (89)
3 PF08450 SGL: SMP-30/Gluconola 99.4 3.5E-12 7.5E-17 94.2 8.6 92 12-115 40-134 (246)
4 COG3386 Gluconolactonase [Carb 99.2 2.5E-10 5.3E-15 88.2 10.6 95 9-113 64-160 (307)
5 PF08450 SGL: SMP-30/Gluconola 98.9 5.8E-09 1.3E-13 76.9 8.2 75 11-106 133-214 (246)
6 PLN02919 haloacid dehalogenase 98.9 1.7E-08 3.8E-13 88.6 11.0 72 10-81 802-880 (1057)
7 PLN02919 haloacid dehalogenase 98.9 2.3E-08 4.9E-13 87.9 11.2 77 4-80 616-704 (1057)
8 TIGR02604 Piru_Ver_Nterm putat 98.8 6.5E-08 1.4E-12 75.9 11.1 104 10-115 70-184 (367)
9 COG4257 Vgb Streptogramin lyas 98.8 5.2E-08 1.1E-12 74.4 9.5 71 10-84 102-172 (353)
10 PF03088 Str_synth: Strictosid 98.7 1E-07 2.2E-12 61.4 7.2 57 15-79 1-77 (89)
11 COG3386 Gluconolactonase [Carb 98.7 1.9E-07 4.1E-12 72.3 9.0 66 7-80 106-184 (307)
12 COG4257 Vgb Streptogramin lyas 98.6 3.2E-07 6.9E-12 70.2 7.9 70 6-80 183-253 (353)
13 TIGR02604 Piru_Ver_Nterm putat 98.4 1E-06 2.2E-11 69.2 7.6 61 11-79 123-203 (367)
14 KOG4659|consensus 98.3 5.8E-06 1.3E-10 73.2 9.0 72 2-79 355-427 (1899)
15 KOG4659|consensus 98.2 5.3E-06 1.1E-10 73.5 8.7 74 4-79 467-553 (1899)
16 PF01436 NHL: NHL repeat; Int 98.0 8.6E-06 1.9E-10 41.3 3.1 22 59-80 1-22 (28)
17 PF03022 MRJP: Major royal jel 97.8 8.4E-05 1.8E-09 57.0 7.1 63 12-79 186-255 (287)
18 PF01436 NHL: NHL repeat; Int 97.7 5E-05 1.1E-09 38.5 3.3 22 12-33 2-23 (28)
19 KOG4499|consensus 97.7 0.00029 6.3E-09 53.2 8.2 79 11-106 157-242 (310)
20 PF06977 SdiA-regulated: SdiA- 97.6 0.00018 3.8E-09 54.3 6.3 68 12-80 171-242 (248)
21 PF03022 MRJP: Major royal jel 97.6 0.00056 1.2E-08 52.4 8.9 66 14-80 3-87 (287)
22 KOG1520|consensus 97.5 0.00021 4.4E-09 56.7 5.0 60 12-79 161-239 (376)
23 PF10282 Lactonase: Lactonase, 97.3 0.0011 2.4E-08 51.6 7.6 63 13-80 246-313 (345)
24 COG3391 Uncharacterized conser 97.2 0.0046 1E-07 49.0 9.9 62 10-79 114-180 (381)
25 COG2133 Glucose/sorbosone dehy 97.0 0.0059 1.3E-07 49.0 8.7 75 12-105 314-397 (399)
26 PF05787 DUF839: Bacterial pro 96.9 0.006 1.3E-07 50.6 8.5 70 10-79 348-455 (524)
27 TIGR02658 TTQ_MADH_Hv methylam 96.8 0.021 4.5E-07 45.2 10.2 66 9-79 44-125 (352)
28 PF10282 Lactonase: Lactonase, 96.8 0.0085 1.8E-07 46.6 7.9 73 8-80 188-266 (345)
29 COG3211 PhoX Predicted phospha 96.8 0.0085 1.8E-07 50.0 8.1 70 10-79 415-520 (616)
30 COG3391 Uncharacterized conser 96.7 0.024 5.2E-07 44.9 10.2 69 9-80 157-228 (381)
31 PF07995 GSDH: Glucose / Sorbo 96.7 0.0058 1.3E-07 47.5 6.5 63 12-79 253-325 (331)
32 TIGR03032 conserved hypothetic 96.7 0.0076 1.7E-07 47.1 6.9 58 11-79 202-261 (335)
33 COG2706 3-carboxymuconate cycl 96.6 0.035 7.6E-07 43.7 9.9 61 13-78 245-309 (346)
34 PF07995 GSDH: Glucose / Sorbo 96.5 0.008 1.7E-07 46.8 6.3 65 12-78 2-71 (331)
35 PRK11028 6-phosphogluconolacto 96.5 0.033 7.1E-07 42.5 9.4 71 9-79 172-248 (330)
36 TIGR03606 non_repeat_PQQ dehyd 96.5 0.016 3.4E-07 47.4 7.8 68 10-77 28-103 (454)
37 smart00135 LY Low-density lipo 96.4 0.0093 2E-07 31.7 4.3 23 58-80 7-30 (43)
38 TIGR02658 TTQ_MADH_Hv methylam 96.4 0.044 9.5E-07 43.4 9.6 60 12-79 245-320 (352)
39 PRK11028 6-phosphogluconolacto 96.4 0.071 1.5E-06 40.7 10.4 61 13-79 229-294 (330)
40 COG2706 3-carboxymuconate cycl 96.2 0.12 2.6E-06 40.8 10.7 72 9-80 188-265 (346)
41 TIGR03866 PQQ_ABC_repeats PQQ- 96.1 0.13 2.9E-06 37.4 10.4 60 11-78 30-92 (300)
42 COG3823 Glutamine cyclotransfe 96.1 0.025 5.5E-07 42.2 6.3 58 22-79 185-249 (262)
43 COG3204 Uncharacterized protei 96.0 0.051 1.1E-06 42.2 8.1 67 15-82 236-306 (316)
44 KOG4499|consensus 95.9 0.074 1.6E-06 40.4 8.4 81 12-103 212-296 (310)
45 PF01731 Arylesterase: Arylest 95.8 0.05 1.1E-06 34.7 6.1 37 34-80 38-75 (86)
46 PF06739 SBBP: Beta-propeller 95.6 0.0093 2E-07 32.2 1.8 21 60-80 13-33 (38)
47 TIGR03606 non_repeat_PQQ dehyd 95.5 0.25 5.5E-06 40.4 10.7 93 11-106 78-208 (454)
48 COG3211 PhoX Predicted phospha 95.5 0.052 1.1E-06 45.4 6.5 63 10-78 498-573 (616)
49 TIGR03866 PQQ_ABC_repeats PQQ- 95.2 0.51 1.1E-05 34.3 10.6 59 12-78 207-268 (300)
50 KOG1214|consensus 95.1 0.13 2.8E-06 45.0 8.1 61 11-80 1067-1132(1289)
51 PF05096 Glu_cyclase_2: Glutam 95.1 0.16 3.4E-06 38.8 7.8 64 12-79 130-193 (264)
52 PF00058 Ldl_recept_b: Low-den 94.7 0.14 3.1E-06 27.9 5.0 40 23-69 1-42 (42)
53 PF13449 Phytase-like: Esteras 94.3 0.54 1.2E-05 36.4 9.3 66 13-80 86-168 (326)
54 PF02239 Cytochrom_D1: Cytochr 94.2 0.24 5.3E-06 39.2 7.2 57 14-78 39-97 (369)
55 smart00135 LY Low-density lipo 94.2 0.16 3.5E-06 26.7 4.4 31 10-40 7-39 (43)
56 PF06739 SBBP: Beta-propeller 94.1 0.052 1.1E-06 29.2 2.2 20 12-31 13-32 (38)
57 PF06977 SdiA-regulated: SdiA- 94.0 0.41 9E-06 36.1 7.7 67 14-80 120-192 (248)
58 PF05787 DUF839: Bacterial pro 93.6 0.43 9.4E-06 39.7 7.8 22 8-29 432-453 (524)
59 COG2133 Glucose/sorbosone dehy 93.5 0.25 5.4E-06 39.9 6.1 60 13-80 178-260 (399)
60 PF14269 Arylsulfotran_2: Aryl 93.2 1.9 4.1E-05 33.3 10.3 88 13-106 145-241 (299)
61 KOG1214|consensus 93.0 0.17 3.6E-06 44.4 4.7 62 12-80 1025-1089(1289)
62 PF14339 DUF4394: Domain of un 92.9 1.5 3.2E-05 33.0 9.1 75 12-106 27-104 (236)
63 PF14517 Tachylectin: Tachylec 92.7 0.17 3.7E-06 37.8 3.8 75 14-106 132-207 (229)
64 PF05096 Glu_cyclase_2: Glutam 92.6 0.98 2.1E-05 34.5 7.9 37 12-48 45-84 (264)
65 PF02333 Phytase: Phytase; In 92.3 0.83 1.8E-05 36.7 7.5 68 10-80 206-281 (381)
66 TIGR02276 beta_rpt_yvtn 40-res 91.2 1.3 2.8E-05 23.1 6.0 38 23-68 4-42 (42)
67 PF02239 Cytochrom_D1: Cytochr 91.1 1.4 3.1E-05 34.9 7.6 37 9-45 75-113 (369)
68 PF07433 DUF1513: Protein of u 91.1 3.9 8.5E-05 31.9 9.8 65 10-79 3-71 (305)
69 PF13449 Phytase-like: Esteras 90.8 3.2 6.9E-05 32.2 9.2 70 11-80 19-105 (326)
70 PRK04792 tolB translocation pr 89.1 6.3 0.00014 31.8 10.0 54 15-76 221-279 (448)
71 TIGR03032 conserved hypothetic 89.1 3.3 7.2E-05 32.7 8.0 70 13-110 153-238 (335)
72 PF13360 PQQ_2: PQQ-like domai 89.1 3.8 8.1E-05 29.2 7.9 27 18-44 32-58 (238)
73 PF01731 Arylesterase: Arylest 88.8 0.63 1.4E-05 29.6 3.2 52 63-116 1-55 (86)
74 PF13360 PQQ_2: PQQ-like domai 88.7 5.2 0.00011 28.4 8.5 62 19-106 170-231 (238)
75 TIGR03803 Gloeo_Verruco Gloeo_ 88.7 1.3 2.9E-05 23.3 3.9 30 70-110 1-31 (34)
76 KOG1446|consensus 88.7 8.8 0.00019 30.0 9.9 64 12-79 188-252 (311)
77 PRK02888 nitrous-oxide reducta 88.6 1.9 4.1E-05 36.8 6.7 62 9-76 318-391 (635)
78 PRK01029 tolB translocation pr 88.6 8.5 0.00018 31.0 10.4 75 15-111 330-411 (428)
79 PRK04792 tolB translocation pr 88.4 7.9 0.00017 31.3 10.1 57 14-78 308-369 (448)
80 PRK04922 tolB translocation pr 87.9 7 0.00015 31.2 9.5 54 15-76 207-265 (433)
81 COG3204 Uncharacterized protei 87.6 0.68 1.5E-05 36.1 3.3 29 12-40 284-312 (316)
82 smart00284 OLF Olfactomedin-li 87.5 5.3 0.00011 30.4 8.0 57 16-79 132-194 (255)
83 PRK05137 tolB translocation pr 87.1 12 0.00026 29.8 10.4 55 14-76 248-307 (435)
84 COG4946 Uncharacterized protei 87.0 7.1 0.00015 32.7 8.9 57 9-72 399-456 (668)
85 PRK04043 tolB translocation pr 87.0 10 0.00023 30.5 9.9 35 15-49 191-230 (419)
86 KOG0266|consensus 86.8 7.1 0.00015 31.6 9.0 61 12-79 247-308 (456)
87 PRK03629 tolB translocation pr 86.7 14 0.00031 29.6 10.6 53 15-75 246-303 (429)
88 TIGR02800 propeller_TolB tol-p 86.4 14 0.0003 28.8 10.5 55 16-78 282-341 (417)
89 PRK05137 tolB translocation pr 86.3 11 0.00024 30.0 9.8 55 14-76 204-263 (435)
90 KOG3567|consensus 85.9 1.2 2.6E-05 36.6 4.0 26 55-80 462-487 (501)
91 PF07494 Reg_prop: Two compone 85.6 0.96 2.1E-05 21.6 2.1 18 60-77 5-22 (24)
92 PF14517 Tachylectin: Tachylec 85.5 1.3 2.9E-05 33.2 3.8 65 16-80 85-150 (229)
93 cd00216 PQQ_DH Dehydrogenases 85.4 4.8 0.0001 32.9 7.4 23 22-44 61-83 (488)
94 PRK02889 tolB translocation pr 85.3 16 0.00035 29.2 10.2 34 15-48 199-236 (427)
95 TIGR02800 propeller_TolB tol-p 85.3 15 0.00032 28.7 9.9 56 15-78 193-253 (417)
96 PRK00178 tolB translocation pr 85.1 17 0.00037 28.7 10.7 54 15-76 246-304 (430)
97 PRK11138 outer membrane biogen 84.9 6.8 0.00015 30.8 7.8 23 22-44 256-278 (394)
98 PRK04922 tolB translocation pr 84.8 18 0.00039 28.9 10.3 56 15-78 295-355 (433)
99 PRK01742 tolB translocation pr 84.7 19 0.0004 28.8 10.7 54 15-76 251-309 (429)
100 KOG0272|consensus 84.5 5.6 0.00012 32.5 7.1 60 14-80 306-366 (459)
101 PRK12690 flgF flagellar basal 84.4 9.5 0.00021 28.6 8.0 64 11-75 75-148 (238)
102 TIGR03075 PQQ_enz_alc_DH PQQ-d 84.0 7.4 0.00016 32.4 7.9 23 22-44 69-91 (527)
103 PRK00178 tolB translocation pr 83.8 18 0.00039 28.6 9.9 34 15-48 202-239 (430)
104 PRK11138 outer membrane biogen 83.6 6.8 0.00015 30.8 7.3 28 16-44 289-316 (394)
105 PRK03629 tolB translocation pr 83.6 21 0.00046 28.6 10.7 56 14-77 289-349 (429)
106 PRK04043 tolB translocation pr 83.6 18 0.00039 29.1 9.8 56 16-79 237-297 (419)
107 KOG0319|consensus 83.2 5.2 0.00011 34.7 6.7 62 12-80 22-83 (775)
108 PF14583 Pectate_lyase22: Olig 83.2 3.6 7.7E-05 33.2 5.6 93 16-113 85-186 (386)
109 PRK12689 flgF flagellar basal 83.0 12 0.00027 28.2 8.2 65 11-77 81-155 (253)
110 PF14269 Arylsulfotran_2: Aryl 83.0 18 0.00039 27.9 9.3 68 21-106 210-290 (299)
111 smart00564 PQQ beta-propeller 82.9 2.7 5.8E-05 20.8 3.3 24 21-44 5-28 (33)
112 PRK02888 nitrous-oxide reducta 82.8 6.5 0.00014 33.7 7.2 21 9-29 232-253 (635)
113 COG1520 FOG: WD40-like repeat 82.7 8.7 0.00019 29.9 7.6 58 18-80 64-121 (370)
114 PRK12636 flgG flagellar basal 81.8 12 0.00026 28.3 7.7 67 11-78 88-164 (263)
115 COG3292 Predicted periplasmic 81.4 8.5 0.00018 32.8 7.2 62 15-81 250-311 (671)
116 KOG4649|consensus 81.0 15 0.00033 28.6 8.0 54 19-79 101-156 (354)
117 smart00284 OLF Olfactomedin-li 80.8 19 0.00041 27.4 8.5 57 22-79 83-149 (255)
118 TIGR03074 PQQ_membr_DH membran 80.6 12 0.00026 32.8 8.2 23 22-44 194-216 (764)
119 TIGR03300 assembly_YfgL outer 79.9 14 0.00029 28.7 7.8 23 22-44 241-263 (377)
120 PF14583 Pectate_lyase22: Olig 79.4 20 0.00044 28.9 8.6 54 18-78 42-101 (386)
121 PRK01742 tolB translocation pr 79.0 31 0.00067 27.5 9.9 55 14-76 206-265 (429)
122 PF05935 Arylsulfotrans: Aryls 79.0 27 0.0006 28.6 9.5 37 13-49 272-310 (477)
123 PRK12818 flgG flagellar basal 78.7 11 0.00023 28.5 6.6 64 11-75 89-166 (256)
124 COG0823 TolB Periplasmic compo 78.2 15 0.00032 29.9 7.6 34 14-47 240-277 (425)
125 COG3292 Predicted periplasmic 77.7 2.7 5.7E-05 35.7 3.3 59 13-79 166-225 (671)
126 PRK12641 flgF flagellar basal 77.1 27 0.00059 26.3 8.4 63 10-76 71-144 (252)
127 KOG1446|consensus 76.3 15 0.00033 28.7 6.9 62 14-80 143-209 (311)
128 KOG0291|consensus 76.1 35 0.00075 30.2 9.4 56 13-75 16-71 (893)
129 COG4787 FlgF Flagellar basal b 75.9 31 0.00068 25.9 8.3 70 9-80 72-150 (251)
130 PF01011 PQQ: PQQ enzyme repea 75.9 3.8 8.3E-05 21.4 2.6 22 23-44 1-22 (38)
131 TIGR03075 PQQ_enz_alc_DH PQQ-d 75.1 30 0.00065 28.8 8.8 29 15-44 114-142 (527)
132 PRK12819 flgG flagellar basal 74.8 23 0.0005 26.7 7.5 65 11-76 83-161 (257)
133 KOG0278|consensus 74.7 24 0.00051 27.4 7.4 55 19-79 232-287 (334)
134 TIGR03300 assembly_YfgL outer 74.1 22 0.00047 27.5 7.4 28 16-44 60-87 (377)
135 PF06433 Me-amine-dh_H: Methyl 73.8 29 0.00063 27.6 8.0 59 12-78 235-309 (342)
136 PF15416 DUF4623: Domain of un 72.4 29 0.00063 28.0 7.6 70 11-80 182-262 (442)
137 PRK02889 tolB translocation pr 72.0 49 0.0011 26.4 10.2 55 14-76 242-301 (427)
138 PF11161 DUF2944: Protein of u 71.7 23 0.0005 25.8 6.5 52 24-80 76-128 (187)
139 PRK12694 flgG flagellar basal 71.6 36 0.00079 25.6 7.9 61 11-76 89-160 (260)
140 cd00200 WD40 WD40 domain, foun 70.3 32 0.0007 23.6 8.6 60 13-79 221-281 (289)
141 PF02191 OLF: Olfactomedin-lik 70.2 17 0.00037 27.4 5.8 58 16-80 127-190 (250)
142 TIGR02608 delta_60_rpt delta-6 70.1 7.1 0.00015 22.7 3.0 30 14-44 3-40 (55)
143 TIGR03118 PEPCTERM_chp_1 conse 68.9 24 0.00052 27.9 6.4 66 10-79 242-323 (336)
144 PF07433 DUF1513: Protein of u 67.2 54 0.0012 25.7 8.1 51 12-71 217-269 (305)
145 PRK12640 flgF flagellar basal 66.1 26 0.00057 26.3 6.1 64 11-76 74-147 (246)
146 KOG0291|consensus 65.8 28 0.00061 30.7 6.7 59 13-78 437-497 (893)
147 PRK12692 flgG flagellar basal 65.7 57 0.0012 24.7 8.2 58 11-76 89-160 (262)
148 COG1520 FOG: WD40-like repeat 65.6 40 0.00088 26.2 7.3 30 16-45 105-134 (370)
149 cd00200 WD40 WD40 domain, foun 64.2 44 0.00095 22.9 9.1 60 12-78 136-197 (289)
150 KOG4378|consensus 63.6 63 0.0014 27.4 8.1 60 11-79 208-270 (673)
151 PF02393 US22: US22 like; Int 63.4 6 0.00013 25.9 2.0 25 93-117 87-113 (125)
152 PF13570 PQQ_3: PQQ-like domai 62.8 12 0.00026 19.5 2.8 24 16-40 16-39 (40)
153 TIGR03506 FlgEFG_subfam fagell 62.0 61 0.0013 23.9 8.1 66 11-77 74-150 (231)
154 PF08662 eIF2A: Eukaryotic tra 61.8 54 0.0012 23.3 6.9 59 12-79 101-163 (194)
155 KOG0282|consensus 61.8 14 0.00031 30.6 4.1 63 10-79 298-362 (503)
156 COG4993 Gcd Glucose dehydrogen 61.8 54 0.0012 28.5 7.6 23 22-44 214-236 (773)
157 PTZ00421 coronin; Provisional 61.7 92 0.002 25.8 9.6 60 13-79 127-188 (493)
158 KOG2110|consensus 61.5 52 0.0011 26.5 7.1 62 12-78 174-237 (391)
159 KOG0640|consensus 60.8 44 0.00095 26.7 6.5 64 13-80 218-282 (430)
160 KOG0318|consensus 60.5 1.1E+02 0.0023 26.1 9.3 31 9-39 403-433 (603)
161 PRK12693 flgG flagellar basal 58.5 76 0.0016 23.8 7.7 58 11-76 89-160 (261)
162 PRK12817 flgG flagellar basal 58.3 78 0.0017 23.8 7.9 66 11-77 85-166 (260)
163 KOG0643|consensus 58.2 51 0.0011 25.8 6.4 62 12-79 148-210 (327)
164 PRK12642 flgF flagellar basal 58.0 65 0.0014 24.0 6.9 65 10-75 72-146 (241)
165 KOG2106|consensus 57.1 54 0.0012 27.8 6.7 62 11-79 446-511 (626)
166 PF05935 Arylsulfotrans: Aryls 57.1 45 0.00097 27.4 6.4 56 17-78 153-208 (477)
167 PF08309 LVIVD: LVIVD repeat; 56.6 29 0.00062 18.9 3.6 28 12-40 2-29 (42)
168 PF11763 DIPSY: Cell-wall adhe 56.5 60 0.0013 22.0 5.8 25 58-82 80-104 (123)
169 KOG1379|consensus 56.5 64 0.0014 25.6 6.8 62 11-78 167-232 (330)
170 PF05567 Neisseria_PilC: Neiss 56.3 21 0.00045 28.0 4.2 54 34-106 183-240 (335)
171 KOG0303|consensus 56.3 1.1E+02 0.0024 25.1 9.6 86 14-106 176-275 (472)
172 PRK12643 flgF flagellar basal 56.2 39 0.00083 24.9 5.3 64 10-76 73-146 (209)
173 PF13964 Kelch_6: Kelch motif 54.4 24 0.00052 19.1 3.2 27 22-48 11-44 (50)
174 KOG0318|consensus 54.3 1.4E+02 0.003 25.5 9.5 66 11-80 190-256 (603)
175 KOG0771|consensus 54.2 46 0.00099 27.0 5.8 60 12-79 145-206 (398)
176 PTZ00486 apyrase Superfamily; 54.1 1.1E+02 0.0024 24.5 8.7 52 22-78 124-181 (352)
177 PRK01029 tolB translocation pr 54.1 78 0.0017 25.5 7.3 34 16-49 189-228 (428)
178 KOG4649|consensus 53.2 47 0.001 26.0 5.5 55 25-81 25-83 (354)
179 PRK12816 flgG flagellar basal 53.1 46 0.001 25.2 5.5 58 11-76 89-162 (264)
180 KOG3881|consensus 52.8 75 0.0016 25.9 6.7 63 13-81 204-270 (412)
181 KOG1274|consensus 52.8 93 0.002 28.0 7.8 28 16-43 101-129 (933)
182 TIGR02488 flgG_G_neg flagellar 52.5 98 0.0021 23.2 8.1 58 11-76 87-158 (259)
183 PF06079 Apyrase: Apyrase; In 51.0 90 0.0019 24.3 6.8 52 22-78 63-118 (291)
184 KOG4441|consensus 50.7 76 0.0016 26.8 6.9 57 14-79 325-389 (571)
185 KOG1274|consensus 50.2 1.1E+02 0.0024 27.5 7.9 77 12-88 139-221 (933)
186 PF14220 DUF4329: Domain of un 50.2 12 0.00025 25.4 1.6 14 93-106 98-111 (123)
187 PF00400 WD40: WD domain, G-be 49.9 33 0.00071 17.0 3.4 26 11-36 11-36 (39)
188 PHA03098 kelch-like protein; P 49.6 1.4E+02 0.003 24.4 8.2 72 21-110 436-518 (534)
189 KOG0272|consensus 48.3 39 0.00084 27.8 4.6 62 12-80 262-324 (459)
190 PF03178 CPSF_A: CPSF A subuni 48.3 83 0.0018 23.9 6.3 33 16-49 134-168 (321)
191 PF14870 PSII_BNR: Photosynthe 47.1 1.3E+02 0.0029 23.3 7.5 35 12-47 187-221 (302)
192 cd05845 Ig2_L1-CAM_like Second 46.9 18 0.00039 23.2 2.1 17 63-79 50-66 (95)
193 PF12275 DUF3616: Protein of u 46.4 64 0.0014 25.5 5.5 66 14-80 2-78 (330)
194 PF00930 DPPIV_N: Dipeptidyl p 46.2 1.4E+02 0.003 23.1 8.2 58 17-77 189-255 (353)
195 KOG0266|consensus 45.7 1.1E+02 0.0024 24.7 7.0 63 12-80 289-355 (456)
196 KOG2139|consensus 45.5 86 0.0019 25.5 6.0 57 16-79 200-258 (445)
197 PF14870 PSII_BNR: Photosynthe 44.7 1.5E+02 0.0032 23.1 8.7 93 11-106 103-207 (302)
198 cd00216 PQQ_DH Dehydrogenases 44.2 1.5E+02 0.0033 24.2 7.6 30 16-45 104-133 (488)
199 PF05694 SBP56: 56kDa selenium 44.0 94 0.002 25.8 6.2 68 12-79 312-394 (461)
200 PHA03219 nuclear protein UL24; 42.8 20 0.00042 28.1 2.1 24 94-117 241-266 (300)
201 KOG0288|consensus 42.3 2E+02 0.0043 23.8 9.0 66 10-79 340-407 (459)
202 COG5134 Uncharacterized conser 41.9 33 0.00071 25.8 3.0 31 9-40 83-113 (272)
203 KOG0289|consensus 41.9 1.3E+02 0.0029 25.0 6.7 56 16-78 352-408 (506)
204 PF15533 Toxin_54: Putative to 41.8 22 0.00049 21.4 1.8 17 63-79 37-53 (66)
205 PF00930 DPPIV_N: Dipeptidyl p 41.8 1.1E+02 0.0025 23.6 6.3 53 17-78 286-346 (353)
206 PF01344 Kelch_1: Kelch motif; 41.5 54 0.0012 17.1 3.5 28 22-49 11-45 (47)
207 KOG2096|consensus 41.4 1E+02 0.0022 24.8 5.8 65 9-79 329-393 (420)
208 KOG0772|consensus 41.3 94 0.002 26.5 5.9 61 15-80 368-430 (641)
209 PF14157 YmzC: YmzC-like prote 41.3 24 0.00052 21.1 1.9 14 98-111 42-57 (63)
210 PRK13684 Ycf48-like protein; P 40.8 1.7E+02 0.0037 22.7 7.5 14 65-78 274-287 (334)
211 PRK12691 flgG flagellar basal 40.5 1.6E+02 0.0034 22.2 7.6 57 11-76 89-160 (262)
212 PTZ00420 coronin; Provisional 40.3 2.3E+02 0.005 24.1 9.4 60 12-79 126-187 (568)
213 COG3823 Glutamine cyclotransfe 39.8 83 0.0018 23.8 4.9 62 12-79 131-198 (262)
214 KOG0299|consensus 39.1 94 0.002 25.8 5.5 56 15-76 384-443 (479)
215 PF06970 RepA_N: Replication i 37.5 19 0.00041 22.2 1.0 16 62-78 39-54 (76)
216 PF13511 DUF4124: Domain of un 37.4 25 0.00054 20.0 1.5 15 66-80 18-32 (60)
217 KOG3567|consensus 37.3 57 0.0012 27.2 4.0 67 13-79 169-238 (501)
218 PRK13684 Ycf48-like protein; P 37.0 1.8E+02 0.0039 22.5 6.7 36 12-47 132-167 (334)
219 PHA02713 hypothetical protein; 36.6 2.5E+02 0.0055 23.5 8.0 50 22-78 351-406 (557)
220 KOG4441|consensus 35.6 2.7E+02 0.0059 23.5 8.5 78 16-112 470-555 (571)
221 PF10647 Gmad1: Lipoprotein Lp 35.4 1.2E+02 0.0027 22.5 5.4 54 13-71 67-123 (253)
222 PF08857 ParBc_2: Putative Par 34.7 28 0.00061 24.7 1.7 16 64-79 47-62 (163)
223 KOG0263|consensus 34.6 2.9E+02 0.0062 24.3 7.9 59 11-78 451-512 (707)
224 PLN00033 photosystem II stabil 34.4 2.2E+02 0.0047 23.0 6.9 30 13-43 282-311 (398)
225 PF06462 Hyd_WA: Propeller; I 34.0 42 0.00092 17.0 1.9 14 65-78 4-17 (32)
226 COG4946 Uncharacterized protei 33.8 3E+02 0.0065 23.5 7.9 13 94-106 452-465 (668)
227 PTZ00421 coronin; Provisional 33.8 2.8E+02 0.006 23.0 7.8 32 13-44 170-202 (493)
228 PHA03175 UL43 envelope protein 33.6 25 0.00055 28.3 1.4 24 94-117 335-360 (413)
229 cd00819 PEPCK_GTP Phosphoenolp 33.0 1E+02 0.0022 26.4 4.9 59 17-79 279-338 (579)
230 PHA03344 US22 family homolog; 32.8 30 0.00065 29.9 1.8 26 93-118 280-307 (672)
231 KOG3545|consensus 32.0 1.2E+02 0.0025 23.2 4.7 53 23-79 78-143 (249)
232 KOG2055|consensus 31.4 3.2E+02 0.0069 23.0 8.4 62 11-79 344-407 (514)
233 PLN02258 9-cis-epoxycarotenoid 31.2 3.4E+02 0.0073 23.3 7.9 59 18-82 291-356 (590)
234 KOG0263|consensus 30.0 1.3E+02 0.0027 26.5 5.0 62 12-80 578-640 (707)
235 PF13418 Kelch_4: Galactose ox 29.6 86 0.0019 16.6 2.9 27 21-47 11-44 (49)
236 KOG0268|consensus 28.6 36 0.00078 27.6 1.5 64 10-79 228-292 (433)
237 PF09910 DUF2139: Uncharacteri 28.3 1.4E+02 0.003 23.7 4.6 38 12-49 106-148 (339)
238 KOG0282|consensus 28.0 3E+02 0.0066 23.1 6.7 57 16-80 263-321 (503)
239 PLN00033 photosystem II stabil 27.8 3.3E+02 0.0071 22.0 7.2 24 16-39 243-266 (398)
240 KOG1407|consensus 27.4 3E+02 0.0066 21.5 6.8 27 14-40 192-219 (313)
241 PF13186 SPASM: Iron-sulfur cl 27.1 37 0.00081 18.9 1.1 14 16-29 7-20 (64)
242 PRK04210 phosphoenolpyruvate c 26.9 1.7E+02 0.0036 25.2 5.2 60 17-79 294-354 (601)
243 KOG1273|consensus 26.8 3E+02 0.0066 22.1 6.3 60 13-79 25-85 (405)
244 COG1692 Calcineurin-like phosp 26.7 57 0.0012 25.0 2.3 32 42-79 167-198 (266)
245 KOG0649|consensus 26.2 1.5E+02 0.0032 23.1 4.4 35 14-48 117-152 (325)
246 KOG2055|consensus 26.1 3E+02 0.0065 23.1 6.4 55 16-74 262-318 (514)
247 PF14398 ATPgrasp_YheCD: YheC/ 25.8 51 0.0011 24.8 1.9 19 63-81 214-232 (262)
248 KOG1215|consensus 25.6 1.3E+02 0.0028 26.5 4.6 58 14-78 525-586 (877)
249 PTZ00420 coronin; Provisional 25.4 3.8E+02 0.0082 22.8 7.1 32 13-44 169-201 (568)
250 PF00821 PEPCK: Phosphoenolpyr 25.2 1.4E+02 0.003 25.6 4.4 57 17-77 281-338 (586)
251 smart00706 TECPR Beta propelle 25.2 74 0.0016 15.9 1.9 11 65-75 22-32 (35)
252 COG4189 Predicted transcriptio 24.5 2.7E+02 0.0058 21.5 5.4 29 43-71 175-203 (308)
253 KOG1963|consensus 24.5 1.8E+02 0.0038 26.0 5.0 55 14-76 254-309 (792)
254 PHA03346 US22 family homolog; 24.3 55 0.0012 27.5 2.0 26 93-118 120-147 (520)
255 KOG0645|consensus 23.9 3.6E+02 0.0078 21.2 7.4 62 15-81 65-127 (312)
256 KOG0973|consensus 23.5 1.9E+02 0.0041 26.3 5.1 65 10-77 170-237 (942)
257 PHA00691 hypothetical protein 23.3 79 0.0017 18.7 2.0 23 93-115 9-32 (68)
258 PF14251 DUF4346: Domain of un 23.0 1.4E+02 0.003 20.2 3.3 16 64-79 44-60 (119)
259 PRK05841 flgE flagellar hook p 22.9 5E+02 0.011 22.5 8.4 26 53-78 468-493 (603)
260 KOG0315|consensus 22.6 3.8E+02 0.0081 20.9 8.0 83 15-119 128-213 (311)
261 KOG0771|consensus 22.4 3.5E+02 0.0076 22.1 6.0 65 12-80 187-253 (398)
262 PF15232 DUF4585: Domain of un 22.2 2.1E+02 0.0045 17.8 4.2 32 16-47 9-44 (75)
263 PF09910 DUF2139: Uncharacteri 22.0 4.1E+02 0.0089 21.1 9.3 64 14-79 58-126 (339)
264 PHA03338 US22 family homolog; 21.8 85 0.0018 24.7 2.4 25 93-117 272-298 (344)
265 PF12216 m04gp34like: Immune e 21.8 84 0.0018 24.2 2.4 21 11-31 134-155 (272)
266 COG4004 Uncharacterized protei 21.7 2.4E+02 0.0051 18.3 4.0 28 16-43 25-53 (96)
267 KOG0271|consensus 21.6 4.7E+02 0.01 21.6 6.7 61 12-79 116-177 (480)
268 PHA02790 Kelch-like protein; P 21.5 4.5E+02 0.0098 21.4 7.8 50 21-77 317-369 (480)
269 PHA03219 nuclear protein UL24; 20.9 74 0.0016 24.9 2.0 26 93-118 94-121 (300)
270 KOG3914|consensus 20.9 2E+02 0.0044 23.4 4.4 56 12-75 108-167 (390)
271 KOG0279|consensus 20.8 3.2E+02 0.007 21.5 5.4 51 12-70 193-244 (315)
272 COG4247 Phy 3-phytase (myo-ino 20.8 1.3E+02 0.0029 23.5 3.3 28 12-39 205-233 (364)
273 PF10584 Proteasome_A_N: Prote 20.7 22 0.00047 17.0 -0.6 8 67-74 8-15 (23)
No 1
>KOG1520|consensus
Probab=99.95 E-value=6.7e-28 Score=187.50 Aligned_cols=109 Identities=41% Similarity=0.706 Sum_probs=102.1
Q ss_pred CcccccCCCCccceEECCCC-cEEEEECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCCCc
Q psy4774 4 GFKYEQQCGRPLGMKFDKNG-ALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTKY 82 (120)
Q Consensus 4 ~~~~~~~~g~P~Gl~~d~~G-~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~~ 82 (120)
..++++.||||+||+|+.+| +|||||++.|+++|++++|+.+.+++ +.+|+|+.+.|+++|+++|.||||||+.+|
T Consensus 107 ~~~~e~~CGRPLGl~f~~~ggdL~VaDAYlGL~~V~p~g~~a~~l~~---~~~G~~~kf~N~ldI~~~g~vyFTDSSsk~ 183 (376)
T KOG1520|consen 107 SFETEPLCGRPLGIRFDKKGGDLYVADAYLGLLKVGPEGGLAELLAD---EAEGKPFKFLNDLDIDPEGVVYFTDSSSKY 183 (376)
T ss_pred ceecccccCCcceEEeccCCCeEEEEecceeeEEECCCCCcceeccc---cccCeeeeecCceeEcCCCeEEEecccccc
Confidence 78889999999999999998 99999999999999999888777775 789999999999999999999999999999
Q ss_pred ccccceecce-ecCCceEEEEcCCC--ceEecCccc
Q psy4774 83 KLYDGLFDGL-TSGSGSFIKIGPNT--QEVLQKQTD 115 (120)
Q Consensus 83 ~~~~~~~~~~-~~~~G~l~~~d~~~--~~~l~~~L~ 115 (120)
++++++.+++ ..++|||++|||.+ +++|+++|.
T Consensus 184 ~~rd~~~a~l~g~~~GRl~~YD~~tK~~~VLld~L~ 219 (376)
T KOG1520|consen 184 DRRDFVFAALEGDPTGRLFRYDPSTKVTKVLLDGLY 219 (376)
T ss_pred chhheEEeeecCCCccceEEecCcccchhhhhhccc
Confidence 9999999887 68999999999999 889999985
No 2
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=99.52 E-value=6.9e-15 Score=94.70 Aligned_cols=53 Identities=36% Similarity=0.597 Sum_probs=40.4
Q ss_pred CceEEcCC-CCEEEEeCCCCcccccceecce-ecCCceEEEEcCCC--ceEecCccc
Q psy4774 63 NSVTVDSD-GMVYWSDSSTKYKLYDGLFDGL-TSGSGSFIKIGPNT--QEVLQKQTD 115 (120)
Q Consensus 63 ndl~v~~~-G~iy~TDs~~~~~~~~~~~~~~-~~~~G~l~~~d~~~--~~~l~~~L~ 115 (120)
|||+|+++ |.|||||++.+|+++++.++++ ..++|||++|||.+ +++|+++|.
T Consensus 1 ndldv~~~~g~vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp~t~~~~vl~~~L~ 57 (89)
T PF03088_consen 1 NDLDVDQDTGTVYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDPSTKETTVLLDGLY 57 (89)
T ss_dssp -EEEE-TTT--EEEEES-SS--TTGHHHHHHHT---EEEEEEETTTTEEEEEEEEES
T ss_pred CceeEecCCCEEEEEeCccccCccceeeeeecCCCCcCEEEEECCCCeEEEehhCCC
Confidence 79999998 9999999999999999998877 79999999999999 899999986
No 3
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=99.36 E-value=3.5e-12 Score=94.19 Aligned_cols=92 Identities=35% Similarity=0.457 Sum_probs=69.1
Q ss_pred CCccceEEC-CCCcEEEEECCCCEEEEECCCCcEEEEeccccccCCC-CCcCCCceEEcCCCCEEEEeCCCCccccccee
Q psy4774 12 GRPLGMKFD-KNGALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGA-KPQIPNSVTVDSDGMVYWSDSSTKYKLYDGLF 89 (120)
Q Consensus 12 g~P~Gl~~d-~~G~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~-~~~~pndl~v~~~G~iy~TDs~~~~~~~~~~~ 89 (120)
..|.|++++ ++|+||||+. .++..+|++++++++++. ...+. +++.|||++++++|+|||||+.....
T Consensus 40 ~~~~G~~~~~~~g~l~v~~~-~~~~~~d~~~g~~~~~~~---~~~~~~~~~~~ND~~vd~~G~ly~t~~~~~~~------ 109 (246)
T PF08450_consen 40 PGPNGMAFDRPDGRLYVADS-GGIAVVDPDTGKVTVLAD---LPDGGVPFNRPNDVAVDPDGNLYVTDSGGGGA------ 109 (246)
T ss_dssp SSEEEEEEECTTSEEEEEET-TCEEEEETTTTEEEEEEE---EETTCSCTEEEEEEEE-TTS-EEEEEECCBCT------
T ss_pred CCCceEEEEccCCEEEEEEc-CceEEEecCCCcEEEEee---ccCCCcccCCCceEEEcCCCCEEEEecCCCcc------
Confidence 339999999 7899999986 556777999999999986 23344 89999999999999999999974311
Q ss_pred cceecCCceEEEEcCCC-ceEecCccc
Q psy4774 90 DGLTSGSGSFIKIGPNT-QEVLQKQTD 115 (120)
Q Consensus 90 ~~~~~~~G~l~~~d~~~-~~~l~~~L~ 115 (120)
.....|+||++++++ ++++.+++.
T Consensus 110 --~~~~~g~v~~~~~~~~~~~~~~~~~ 134 (246)
T PF08450_consen 110 --SGIDPGSVYRIDPDGKVTVVADGLG 134 (246)
T ss_dssp --TCGGSEEEEEEETTSEEEEEEEEES
T ss_pred --ccccccceEEECCCCeEEEEecCcc
Confidence 011128899998887 777776654
No 4
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=99.19 E-value=2.5e-10 Score=88.22 Aligned_cols=95 Identities=24% Similarity=0.364 Sum_probs=71.7
Q ss_pred cCCCCccceEECCCCcEEEEECCCCEEEEECCCCcE-EEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCCCcccccc
Q psy4774 9 QQCGRPLGMKFDKNGALHVADAYFGLYKVNVTTGQT-EQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTKYKLYDG 87 (120)
Q Consensus 9 ~~~g~P~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~-~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~~~~~~~ 87 (120)
.....++++.++.+|.|++|+. |+.+++++++.. +.++. ..++.+.+.|||+.++++|++||+|+.. +.
T Consensus 64 ~p~~~~~~~~~d~~g~Lv~~~~--g~~~~~~~~~~~~t~~~~---~~~~~~~~r~ND~~v~pdG~~wfgt~~~-~~---- 133 (307)
T COG3386 64 SPGGFSSGALIDAGGRLIACEH--GVRLLDPDTGGKITLLAE---PEDGLPLNRPNDGVVDPDGRIWFGDMGY-FD---- 133 (307)
T ss_pred CCCCcccceeecCCCeEEEEcc--ccEEEeccCCceeEEecc---ccCCCCcCCCCceeEcCCCCEEEeCCCc-cc----
Confidence 3356689999999999999985 555565555555 77775 5678899999999999999999999984 11
Q ss_pred eecceecCCceEEEEcCCC-ceEecCc
Q psy4774 88 LFDGLTSGSGSFIKIGPNT-QEVLQKQ 113 (120)
Q Consensus 88 ~~~~~~~~~G~l~~~d~~~-~~~l~~~ 113 (120)
......++.|+|||++|.+ ++.++.+
T Consensus 134 ~~~~~~~~~G~lyr~~p~g~~~~l~~~ 160 (307)
T COG3386 134 LGKSEERPTGSLYRVDPDGGVVRLLDD 160 (307)
T ss_pred cCccccCCcceEEEEcCCCCEEEeecC
Confidence 1112257899999999977 5555544
No 5
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=98.92 E-value=5.8e-09 Score=76.95 Aligned_cols=75 Identities=31% Similarity=0.654 Sum_probs=54.8
Q ss_pred CCCccceEECCCCc-EEEEECCCC-EEEEECC--CCcE---EEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCCCcc
Q psy4774 11 CGRPLGMKFDKNGA-LHVADAYFG-LYKVNVT--TGQT---EQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTKYK 83 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~-l~V~d~~~g-i~~vd~~--~g~~---~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~~~ 83 (120)
...|+||+++++|+ |||+|...+ |++++.+ ++++ ++++.. .+ ....|.++++|++|+||+++..
T Consensus 133 ~~~pNGi~~s~dg~~lyv~ds~~~~i~~~~~~~~~~~~~~~~~~~~~----~~-~~g~pDG~~vD~~G~l~va~~~---- 203 (246)
T PF08450_consen 133 LGFPNGIAFSPDGKTLYVADSFNGRIWRFDLDADGGELSNRRVFIDF----PG-GPGYPDGLAVDSDGNLWVADWG---- 203 (246)
T ss_dssp ESSEEEEEEETTSSEEEEEETTTTEEEEEEEETTTCCEEEEEEEEE-----SS-SSCEEEEEEEBTTS-EEEEEET----
T ss_pred cccccceEECCcchheeecccccceeEEEeccccccceeeeeeEEEc----CC-CCcCCCcceEcCCCCEEEEEcC----
Confidence 57899999999995 899998754 9998864 3322 344431 22 2246899999999999999885
Q ss_pred cccceecceecCCceEEEEcCCC
Q psy4774 84 LYDGLFDGLTSGSGSFIKIGPNT 106 (120)
Q Consensus 84 ~~~~~~~~~~~~~G~l~~~d~~~ 106 (120)
.|+|++++|++
T Consensus 204 ------------~~~I~~~~p~G 214 (246)
T PF08450_consen 204 ------------GGRIVVFDPDG 214 (246)
T ss_dssp ------------TTEEEEEETTS
T ss_pred ------------CCEEEEECCCc
Confidence 46888888877
No 6
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=98.88 E-value=1.7e-08 Score=88.61 Aligned_cols=72 Identities=24% Similarity=0.400 Sum_probs=58.0
Q ss_pred CCCCccceEECCCCcEEEEECCCC-EEEEECCCCcEEEEeccccc--cC----CCCCcCCCceEEcCCCCEEEEeCCCC
Q psy4774 10 QCGRPLGMKFDKNGALHVADAYFG-LYKVNVTTGQTEQLISMDTE--ID----GAKPQIPNSVTVDSDGMVYWSDSSTK 81 (120)
Q Consensus 10 ~~g~P~Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~--~~----g~~~~~pndl~v~~~G~iy~TDs~~~ 81 (120)
....|.|++++++|+|||+|..++ |.++|++++.+..++..+.. .+ ...|+.|++|+++++|+|||+|+.+.
T Consensus 802 ~l~~P~Gvavd~dG~LYVADs~N~rIrviD~~tg~v~tiaG~G~~G~~dG~~~~a~l~~P~GIavd~dG~lyVaDt~Nn 880 (1057)
T PLN02919 802 LLQHPLGVLCAKDGQIYVADSYNHKIKKLDPATKRVTTLAGTGKAGFKDGKALKAQLSEPAGLALGENGRLFVADTNNS 880 (1057)
T ss_pred hccCCceeeEeCCCcEEEEECCCCEEEEEECCCCeEEEEeccCCcCCCCCcccccccCCceEEEEeCCCCEEEEECCCC
Confidence 456899999999999999999865 88899988888888753210 01 12678999999999999999999753
No 7
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=98.87 E-value=2.3e-08 Score=87.88 Aligned_cols=77 Identities=21% Similarity=0.287 Sum_probs=58.6
Q ss_pred CcccccCCCCccceEECCCC-cEEEEECCCC-EEEEECCCCcEEEEeccccc---cCC------CCCcCCCceEEcC-CC
Q psy4774 4 GFKYEQQCGRPLGMKFDKNG-ALHVADAYFG-LYKVNVTTGQTEQLISMDTE---IDG------AKPQIPNSVTVDS-DG 71 (120)
Q Consensus 4 ~~~~~~~~g~P~Gl~~d~~G-~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~---~~g------~~~~~pndl~v~~-~G 71 (120)
|..++....+|.||+++++| .|||||..++ |.++|..++.+++++..+.. ..| ..|+.|.+|++++ +|
T Consensus 616 G~~~~a~f~~P~GIavd~~gn~LYVaDt~n~~Ir~id~~~~~V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g 695 (1057)
T PLN02919 616 GSFEDATFNRPQGLAYNAKKNLLYVADTENHALREIDFVNETVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNE 695 (1057)
T ss_pred CchhccccCCCcEEEEeCCCCEEEEEeCCCceEEEEecCCCEEEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCC
Confidence 33445567889999999986 4899998755 88899988888888753211 111 2488999999999 68
Q ss_pred CEEEEeCCC
Q psy4774 72 MVYWSDSST 80 (120)
Q Consensus 72 ~iy~TDs~~ 80 (120)
.|||||+..
T Consensus 696 ~LyVad~~~ 704 (1057)
T PLN02919 696 KVYIAMAGQ 704 (1057)
T ss_pred eEEEEECCC
Confidence 999999864
No 8
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=98.81 E-value=6.5e-08 Score=75.94 Aligned_cols=104 Identities=18% Similarity=0.200 Sum_probs=68.7
Q ss_pred CCCCccceEECCCCcEEEEECCCCEEEEE-CC-----CCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCCCc-
Q psy4774 10 QCGRPLGMKFDKNGALHVADAYFGLYKVN-VT-----TGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTKY- 82 (120)
Q Consensus 10 ~~g~P~Gl~~d~~G~l~V~d~~~gi~~vd-~~-----~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~~- 82 (120)
....|.||++.++| |||++.. .|+++. .+ +++.++++.........+.+.+|++++++||.|||++.....
T Consensus 70 ~l~~p~Gi~~~~~G-lyV~~~~-~i~~~~d~~gdg~ad~~~~~l~~~~~~~~~~~~~~~~~l~~gpDG~LYv~~G~~~~~ 147 (367)
T TIGR02604 70 ELSMVTGLAVAVGG-VYVATPP-DILFLRDKDGDDKADGEREVLLSGFGGQINNHHHSLNSLAWGPDGWLYFNHGNTLAS 147 (367)
T ss_pred CCCCccceeEecCC-EEEeCCC-eEEEEeCCCCCCCCCCccEEEEEccCCCCCcccccccCceECCCCCEEEecccCCCc
Confidence 35789999999998 9998654 488883 32 126667765211100114677999999999999999996321
Q ss_pred -ccccceecc-eecCCceEEEEcCCC--ceEecCccc
Q psy4774 83 -KLYDGLFDG-LTSGSGSFIKIGPNT--QEVLQKQTD 115 (120)
Q Consensus 83 -~~~~~~~~~-~~~~~G~l~~~d~~~--~~~l~~~L~ 115 (120)
......... .....|+++|++|++ +++++.++.
T Consensus 148 ~~~~~~~~~~~~~~~~g~i~r~~pdg~~~e~~a~G~r 184 (367)
T TIGR02604 148 KVTRPGTSDESRQGLGGGLFRYNPDGGKLRVVAHGFQ 184 (367)
T ss_pred eeccCCCccCcccccCceEEEEecCCCeEEEEecCcC
Confidence 100111111 123568999999998 888888864
No 9
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=98.79 E-value=5.2e-08 Score=74.43 Aligned_cols=71 Identities=11% Similarity=0.161 Sum_probs=57.5
Q ss_pred CCCCccceEECCCCcEEEEECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCCCccc
Q psy4774 10 QCGRPLGMKFDKNGALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTKYKL 84 (120)
Q Consensus 10 ~~g~P~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~~~~ 84 (120)
.+.+|.||.+++||..||||..++|.|+|+|+.+++.+--.. ...-...|-.+||+.|+||||.+...|++
T Consensus 102 ~Ga~Phgiv~gpdg~~Witd~~~aI~R~dpkt~evt~f~lp~----~~a~~nlet~vfD~~G~lWFt~q~G~yGr 172 (353)
T COG4257 102 SGASPHGIVVGPDGSAWITDTGLAIGRLDPKTLEVTRFPLPL----EHADANLETAVFDPWGNLWFTGQIGAYGR 172 (353)
T ss_pred CCCCCceEEECCCCCeeEecCcceeEEecCcccceEEeeccc----ccCCCcccceeeCCCccEEEeecccccee
Confidence 357899999999999999999999999999999988884211 11122357899999999999999876665
No 10
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=98.70 E-value=1e-07 Score=61.41 Aligned_cols=57 Identities=23% Similarity=0.389 Sum_probs=44.4
Q ss_pred cceEECCC-CcEEEEECCC-----------------C-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EE
Q psy4774 15 LGMKFDKN-GALHVADAYF-----------------G-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VY 74 (120)
Q Consensus 15 ~Gl~~d~~-G~l~V~d~~~-----------------g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy 74 (120)
++++++++ |.+|++|+.. | ++++||.+++.++++. .+.+||||++++|+. |.
T Consensus 1 ndldv~~~~g~vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp~t~~~~vl~~--------~L~fpNGVals~d~~~vl 72 (89)
T PF03088_consen 1 NDLDVDQDTGTVYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDPSTKETTVLLD--------GLYFPNGVALSPDESFVL 72 (89)
T ss_dssp -EEEE-TTT--EEEEES-SS--TTGHHHHHHHT---EEEEEEETTTTEEEEEEE--------EESSEEEEEE-TTSSEEE
T ss_pred CceeEecCCCEEEEEeCccccCccceeeeeecCCCCcCEEEEECCCCeEEEehh--------CCCccCeEEEcCCCCEEE
Confidence 47899999 9999999631 3 9999999999999974 377999999999986 99
Q ss_pred EEeCC
Q psy4774 75 WSDSS 79 (120)
Q Consensus 75 ~TDs~ 79 (120)
|+++.
T Consensus 73 v~Et~ 77 (89)
T PF03088_consen 73 VAETG 77 (89)
T ss_dssp EEEGG
T ss_pred EEecc
Confidence 99996
No 11
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=98.65 E-value=1.9e-07 Score=72.26 Aligned_cols=66 Identities=27% Similarity=0.455 Sum_probs=53.4
Q ss_pred cccCCCCccceEECCCCcEEEEECC-----C------C-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCC-CE
Q psy4774 7 YEQQCGRPLGMKFDKNGALHVADAY-----F------G-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDG-MV 73 (120)
Q Consensus 7 ~~~~~g~P~Gl~~d~~G~l~V~d~~-----~------g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G-~i 73 (120)
......|||-..++++|++|+.++. . | |++++|. |.++.++. ..+..||+|+++||| .+
T Consensus 106 ~~~~~~r~ND~~v~pdG~~wfgt~~~~~~~~~~~~~~G~lyr~~p~-g~~~~l~~-------~~~~~~NGla~SpDg~tl 177 (307)
T COG3386 106 DGLPLNRPNDGVVDPDGRIWFGDMGYFDLGKSEERPTGSLYRVDPD-GGVVRLLD-------DDLTIPNGLAFSPDGKTL 177 (307)
T ss_pred CCCCcCCCCceeEcCCCCEEEeCCCccccCccccCCcceEEEEcCC-CCEEEeec-------CcEEecCceEECCCCCEE
Confidence 3455689999999999999999876 2 3 9999994 66666642 137789999999999 89
Q ss_pred EEEeCCC
Q psy4774 74 YWSDSST 80 (120)
Q Consensus 74 y~TDs~~ 80 (120)
|++|+..
T Consensus 178 y~aDT~~ 184 (307)
T COG3386 178 YVADTPA 184 (307)
T ss_pred EEEeCCC
Confidence 9999964
No 12
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=98.57 E-value=3.2e-07 Score=70.17 Aligned_cols=70 Identities=16% Similarity=0.165 Sum_probs=55.0
Q ss_pred ccccCCCCccceEECCCCcEEEEECC-CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCC
Q psy4774 6 KYEQQCGRPLGMKFDKNGALHVADAY-FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 6 ~~~~~~g~P~Gl~~d~~G~l~V~d~~-~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~ 80 (120)
+..++++.|+||...++|++|++..+ +-|.+||+.++..+++... ++ .-+.-..++.|+.|.+|+|+...
T Consensus 183 fpaPqG~gpyGi~atpdGsvwyaslagnaiaridp~~~~aev~p~P----~~-~~~gsRriwsdpig~~wittwg~ 253 (353)
T COG4257 183 FPAPQGGGPYGICATPDGSVWYASLAGNAIARIDPFAGHAEVVPQP----NA-LKAGSRRIWSDPIGRAWITTWGT 253 (353)
T ss_pred eccCCCCCCcceEECCCCcEEEEeccccceEEcccccCCcceecCC----Cc-ccccccccccCccCcEEEeccCC
Confidence 34678899999999999999999765 5699999999977777421 11 12334579999999999998864
No 13
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=98.42 E-value=1e-06 Score=69.19 Aligned_cols=61 Identities=25% Similarity=0.325 Sum_probs=50.9
Q ss_pred CCCccceEECCCCcEEEEECC--------------------CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCC
Q psy4774 11 CGRPLGMKFDKNGALHVADAY--------------------FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSD 70 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~~--------------------~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~ 70 (120)
.+.|+++++++||.||+++.. .+|+|++|++++.++++. | +..|++++++++
T Consensus 123 ~~~~~~l~~gpDG~LYv~~G~~~~~~~~~~~~~~~~~~~~~g~i~r~~pdg~~~e~~a~------G--~rnp~Gl~~d~~ 194 (367)
T TIGR02604 123 HHSLNSLAWGPDGWLYFNHGNTLASKVTRPGTSDESRQGLGGGLFRYNPDGGKLRVVAH------G--FQNPYGHSVDSW 194 (367)
T ss_pred cccccCceECCCCCEEEecccCCCceeccCCCccCcccccCceEEEEecCCCeEEEEec------C--cCCCccceECCC
Confidence 355999999999999998651 139999998888887752 3 678999999999
Q ss_pred CCEEEEeCC
Q psy4774 71 GMVYWSDSS 79 (120)
Q Consensus 71 G~iy~TDs~ 79 (120)
|++|+||..
T Consensus 195 G~l~~tdn~ 203 (367)
T TIGR02604 195 GDVFFCDND 203 (367)
T ss_pred CCEEEEccC
Confidence 999999985
No 14
>KOG4659|consensus
Probab=98.25 E-value=5.8e-06 Score=73.24 Aligned_cols=72 Identities=19% Similarity=0.385 Sum_probs=47.8
Q ss_pred CCCcccccCCCCccceEECCCCcEEEEECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcC-CCCEEEEeCC
Q psy4774 2 FDGFKYEQQCGRPLGMKFDKNGALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDS-DGMVYWSDSS 79 (120)
Q Consensus 2 ~~~~~~~~~~g~P~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~-~G~iy~TDs~ 79 (120)
|+|..+-...=.|-.+|..+||+|||-|... |.||.+ +|+++.++.... ..-..--=+|++| +|.||++|+.
T Consensus 355 C~G~a~~~~L~aPvala~a~DGSl~VGDfNy-IRRI~~-dg~v~tIl~L~~----t~~sh~Yy~AvsPvdgtlyvSdp~ 427 (1899)
T KOG4659|consen 355 CEGKADSISLFAPVALAYAPDGSLIVGDFNY-IRRISQ-DGQVSTILTLGL----TDTSHSYYIAVSPVDGTLYVSDPL 427 (1899)
T ss_pred CCCccccceeeceeeEEEcCCCcEEEccchh-eeeecC-CCceEEEEEecC----CCccceeEEEecCcCceEEecCCC
Confidence 8899888888889999999999999988633 777776 477766664310 0111112455565 5666666654
No 15
>KOG4659|consensus
Probab=98.25 E-value=5.3e-06 Score=73.48 Aligned_cols=74 Identities=27% Similarity=0.368 Sum_probs=58.5
Q ss_pred CcccccCCCCccceEECCCCcEEEEECCCCEEEEECCCCcEEEEecccc------------ccCCCCCcCCCceEEcC-C
Q psy4774 4 GFKYEQQCGRPLGMKFDKNGALHVADAYFGLYKVNVTTGQTEQLISMDT------------EIDGAKPQIPNSVTVDS-D 70 (120)
Q Consensus 4 ~~~~~~~~g~P~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~~~~~~~~~------------~~~g~~~~~pndl~v~~-~ 70 (120)
+...+.+...|.||++|+.|.||.+|..+ |.+||. +|-++++.+... +....++.||-+|+|+| +
T Consensus 467 alA~dA~L~~PkGIa~dk~g~lYfaD~t~-IR~iD~-~giIstlig~~~~~~~p~~C~~~~kl~~~~leWPT~LaV~Pmd 544 (1899)
T KOG4659|consen 467 ALAQDAQLIFPKGIAFDKMGNLYFADGTR-IRVIDT-TGIISTLIGTTPDQHPPRTCAQITKLVDLQLEWPTSLAVDPMD 544 (1899)
T ss_pred hhcccceeccCCceeEccCCcEEEecccE-EEEecc-CceEEEeccCCCCccCccccccccchhheeeecccceeecCCC
Confidence 45556777899999999999999999744 888886 688888765311 12223678999999999 8
Q ss_pred CCEEEEeCC
Q psy4774 71 GMVYWSDSS 79 (120)
Q Consensus 71 G~iy~TDs~ 79 (120)
+.||+-|..
T Consensus 545 nsl~Vld~n 553 (1899)
T KOG4659|consen 545 NSLLVLDTN 553 (1899)
T ss_pred CeEEEeecc
Confidence 999999986
No 16
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=97.99 E-value=8.6e-06 Score=41.32 Aligned_cols=22 Identities=45% Similarity=0.697 Sum_probs=19.6
Q ss_pred CcCCCceEEcCCCCEEEEeCCC
Q psy4774 59 PQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 59 ~~~pndl~v~~~G~iy~TDs~~ 80 (120)
|..|++|+++++|+||++|+.+
T Consensus 1 f~~P~gvav~~~g~i~VaD~~n 22 (28)
T PF01436_consen 1 FNYPHGVAVDSDGNIYVADSGN 22 (28)
T ss_dssp BSSEEEEEEETTSEEEEEECCC
T ss_pred CcCCcEEEEeCCCCEEEEECCC
Confidence 5689999999999999999863
No 17
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=97.81 E-value=8.4e-05 Score=56.96 Aligned_cols=63 Identities=24% Similarity=0.301 Sum_probs=48.6
Q ss_pred CCccceEECCCCcEEEEECC-CCEEEEECCC----CcEEEEeccccccCCCCCcCCCceEEcC--CCCEEEEeCC
Q psy4774 12 GRPLGMKFDKNGALHVADAY-FGLYKVNVTT----GQTEQLISMDTEIDGAKPQIPNSVTVDS--DGMVYWSDSS 79 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~-~gi~~vd~~~----g~~~~~~~~~~~~~g~~~~~pndl~v~~--~G~iy~TDs~ 79 (120)
+...|+++|++|+||.++.. +.|.+.++++ .+.++++. +.+.+.+|+++.+++ +|.||++-+.
T Consensus 186 ~~s~g~~~D~~G~ly~~~~~~~aI~~w~~~~~~~~~~~~~l~~-----d~~~l~~pd~~~i~~~~~g~L~v~snr 255 (287)
T PF03022_consen 186 SQSDGMAIDPNGNLYFTDVEQNAIGCWDPDGPYTPENFEILAQ-----DPRTLQWPDGLKIDPEGDGYLWVLSNR 255 (287)
T ss_dssp -SECEEEEETTTEEEEEECCCTEEEEEETTTSB-GCCEEEEEE------CC-GSSEEEEEE-T--TS-EEEEE-S
T ss_pred CCCceEEECCCCcEEEecCCCCeEEEEeCCCCcCccchheeEE-----cCceeeccceeeeccccCceEEEEECc
Confidence 46689999999999999987 5699999864 26777763 455699999999999 9999998876
No 18
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=97.74 E-value=5e-05 Score=38.45 Aligned_cols=22 Identities=32% Similarity=0.443 Sum_probs=19.8
Q ss_pred CCccceEECCCCcEEEEECCCC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFG 33 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~g 33 (120)
..|.||+++++|+|||||..++
T Consensus 2 ~~P~gvav~~~g~i~VaD~~n~ 23 (28)
T PF01436_consen 2 NYPHGVAVDSDGNIYVADSGNH 23 (28)
T ss_dssp SSEEEEEEETTSEEEEEECCCT
T ss_pred cCCcEEEEeCCCCEEEEECCCC
Confidence 4799999999999999998765
No 19
>KOG4499|consensus
Probab=97.69 E-value=0.00029 Score=53.19 Aligned_cols=79 Identities=20% Similarity=0.431 Sum_probs=56.7
Q ss_pred CCCccceEECCCC-cEEEEECCC-CE--EEEECCCCcE---EEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCCCcc
Q psy4774 11 CGRPLGMKFDKNG-ALHVADAYF-GL--YKVNVTTGQT---EQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTKYK 83 (120)
Q Consensus 11 ~g~P~Gl~~d~~G-~l~V~d~~~-gi--~~vd~~~g~~---~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~~~ 83 (120)
.+-|+||++|.+- .+|+.|.-+ -| +.+|..+|.+ .++...- +.....-..|++++||.+|+||++-..
T Consensus 157 v~IsNgl~Wd~d~K~fY~iDsln~~V~a~dyd~~tG~~snr~~i~dlr-k~~~~e~~~PDGm~ID~eG~L~Va~~n---- 231 (310)
T KOG4499|consen 157 VGISNGLAWDSDAKKFYYIDSLNYEVDAYDYDCPTGDLSNRKVIFDLR-KSQPFESLEPDGMTIDTEGNLYVATFN---- 231 (310)
T ss_pred ccCCccccccccCcEEEEEccCceEEeeeecCCCcccccCcceeEEec-cCCCcCCCCCCcceEccCCcEEEEEec----
Confidence 4679999999887 678888763 36 5566777764 2443321 111122347999999999999999886
Q ss_pred cccceecceecCCceEEEEcCCC
Q psy4774 84 LYDGLFDGLTSGSGSFIKIGPNT 106 (120)
Q Consensus 84 ~~~~~~~~~~~~~G~l~~~d~~~ 106 (120)
.|+|+++||.+
T Consensus 232 ------------g~~V~~~dp~t 242 (310)
T KOG4499|consen 232 ------------GGTVQKVDPTT 242 (310)
T ss_pred ------------CcEEEEECCCC
Confidence 47899999887
No 20
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=97.63 E-value=0.00018 Score=54.29 Aligned_cols=68 Identities=21% Similarity=0.368 Sum_probs=39.5
Q ss_pred CCccceEECCC-CcEEEEECC-CCEEEEECCCCcEEEEeccccccCC--CCCcCCCceEEcCCCCEEEEeCCC
Q psy4774 12 GRPLGMKFDKN-GALHVADAY-FGLYKVNVTTGQTEQLISMDTEIDG--AKPQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 12 g~P~Gl~~d~~-G~l~V~d~~-~gi~~vd~~~g~~~~~~~~~~~~~g--~~~~~pndl~v~~~G~iy~TDs~~ 80 (120)
..|.|+++++. |+|||-... .-|+.+|. +|++...........| ..+.+|-||++|++|+||++.-.+
T Consensus 171 ~d~S~l~~~p~t~~lliLS~es~~l~~~d~-~G~~~~~~~L~~g~~gl~~~~~QpEGIa~d~~G~LYIvsEpN 242 (248)
T PF06977_consen 171 RDLSGLSYDPRTGHLLILSDESRLLLELDR-QGRVVSSLSLDRGFHGLSKDIPQPEGIAFDPDGNLYIVSEPN 242 (248)
T ss_dssp S---EEEEETTTTEEEEEETTTTEEEEE-T-T--EEEEEE-STTGGG-SS---SEEEEEE-TT--EEEEETTT
T ss_pred ccccceEEcCCCCeEEEEECCCCeEEEECC-CCCEEEEEEeCCcccCcccccCCccEEEECCCCCEEEEcCCc
Confidence 45899999998 689987554 56999997 5775444433222223 467889999999999999887654
No 21
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=97.60 E-value=0.00056 Score=52.41 Aligned_cols=66 Identities=24% Similarity=0.310 Sum_probs=45.4
Q ss_pred ccceEECCCCcEEEEECCC-------------CEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCC------CCEE
Q psy4774 14 PLGMKFDKNGALHVADAYF-------------GLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSD------GMVY 74 (120)
Q Consensus 14 P~Gl~~d~~G~l~V~d~~~-------------gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~------G~iy 74 (120)
..++.+|+.|+|||.|.+. .|+.+|.+++++.......... -.+-.+.|+++||.. +.+|
T Consensus 3 V~~v~iD~~~rLWVlD~G~~~~~~~~~~~~~pKLv~~Dl~t~~li~~~~~p~~~-~~~~s~lndl~VD~~~~~~~~~~aY 81 (287)
T PF03022_consen 3 VQRVQIDECGRLWVLDSGRPNGLQPPKQVCPPKLVAFDLKTNQLIRRYPFPPDI-APPDSFLNDLVVDVRDGNCDDGFAY 81 (287)
T ss_dssp EEEEEE-TTSEEEEEE-CCHSSSSTTGHTS--EEEEEETTTTCEEEEEE--CCC-S-TCGGEEEEEEECTTTTS-SEEEE
T ss_pred ccEEEEcCCCCEEEEeCCCcCCCCCCCCCCCcEEEEEECCCCcEEEEEECChHH-cccccccceEEEEccCCCCcceEEE
Confidence 3689999999999999752 2999999988864333321111 114567899999983 4799
Q ss_pred EEeCCC
Q psy4774 75 WSDSST 80 (120)
Q Consensus 75 ~TDs~~ 80 (120)
+||++.
T Consensus 82 ItD~~~ 87 (287)
T PF03022_consen 82 ITDSGG 87 (287)
T ss_dssp EEETTT
T ss_pred EeCCCc
Confidence 999984
No 22
>KOG1520|consensus
Probab=97.47 E-value=0.00021 Score=56.67 Aligned_cols=60 Identities=22% Similarity=0.372 Sum_probs=45.6
Q ss_pred CCccceEECCCCcEEEEECCC-----------------C-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-
Q psy4774 12 GRPLGMKFDKNGALHVADAYF-----------------G-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM- 72 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~-----------------g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~- 72 (120)
-..+++.++++|.+|..|+.. | ++++|+.+...+++.+ .+.+||+++.++|+.
T Consensus 161 kf~N~ldI~~~g~vyFTDSSsk~~~rd~~~a~l~g~~~GRl~~YD~~tK~~~VLld--------~L~F~NGlaLS~d~sf 232 (376)
T KOG1520|consen 161 KFLNDLDIDPEGVVYFTDSSSKYDRRDFVFAALEGDPTGRLFRYDPSTKVTKVLLD--------GLYFPNGLALSPDGSF 232 (376)
T ss_pred eecCceeEcCCCeEEEeccccccchhheEEeeecCCCccceEEecCcccchhhhhh--------cccccccccCCCCCCE
Confidence 346799999999999998531 2 7778876665556652 588999999999975
Q ss_pred EEEEeCC
Q psy4774 73 VYWSDSS 79 (120)
Q Consensus 73 iy~TDs~ 79 (120)
+.|++..
T Consensus 233 vl~~Et~ 239 (376)
T KOG1520|consen 233 VLVAETT 239 (376)
T ss_pred EEEEeec
Confidence 6677775
No 23
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=97.34 E-value=0.0011 Score=51.57 Aligned_cols=63 Identities=17% Similarity=0.397 Sum_probs=44.9
Q ss_pred CccceEECCCC-cEEEEECCCC---EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEEeCCC
Q psy4774 13 RPLGMKFDKNG-ALHVADAYFG---LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWSDSST 80 (120)
Q Consensus 13 ~P~Gl~~d~~G-~l~V~d~~~g---i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~TDs~~ 80 (120)
.|.+|++.++| .|||++.... ++.+|.++|+++.+... ...| ..|.+++++++|+ ||+++...
T Consensus 246 ~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~~~g~l~~~~~~--~~~G---~~Pr~~~~s~~g~~l~Va~~~s 313 (345)
T PF10282_consen 246 APAEIAISPDGRFLYVSNRGSNSISVFDLDPATGTLTLVQTV--PTGG---KFPRHFAFSPDGRYLYVANQDS 313 (345)
T ss_dssp SEEEEEE-TTSSEEEEEECTTTEEEEEEECTTTTTEEEEEEE--EESS---SSEEEEEE-TTSSEEEEEETTT
T ss_pred CceeEEEecCCCEEEEEeccCCEEEEEEEecCCCceEEEEEE--eCCC---CCccEEEEeCCCCEEEEEecCC
Confidence 68899999999 5788886644 66666678888876432 2222 3589999999985 88888763
No 24
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=97.20 E-value=0.0046 Score=48.95 Aligned_cols=62 Identities=31% Similarity=0.439 Sum_probs=47.3
Q ss_pred CCCCccceEECCCC-cEEEEECC---CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEEeCC
Q psy4774 10 QCGRPLGMKFDKNG-ALHVADAY---FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWSDSS 79 (120)
Q Consensus 10 ~~g~P~Gl~~d~~G-~l~V~d~~---~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~TDs~ 79 (120)
.+..|.+++++++| .+||++.. .-+..+|..++++..... .|. .|-+++++++|+ +|++|+.
T Consensus 114 vG~~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~t~~~~~~~~-----vG~---~P~~~a~~p~g~~vyv~~~~ 180 (381)
T COG3391 114 VGLGPVGLAVDPDGKYVYVANAGNGNNTVSVIDAATNKVTATIP-----VGN---TPTGVAVDPDGNKVYVTNSD 180 (381)
T ss_pred eccCCceEEECCCCCEEEEEecccCCceEEEEeCCCCeEEEEEe-----cCC---CcceEEECCCCCeEEEEecC
Confidence 34489999999998 99999984 338999988776544421 232 458999999997 9999954
No 25
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=97.00 E-value=0.0059 Score=49.03 Aligned_cols=75 Identities=21% Similarity=0.369 Sum_probs=52.6
Q ss_pred CCccceEECCC-------CcEEEEECC-CCEEEEECCCCcEEEEec-cccccCCCCCcCCCceEEcCCCCEEEEeCCCCc
Q psy4774 12 GRPLGMKFDKN-------GALHVADAY-FGLYKVNVTTGQTEQLIS-MDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTKY 82 (120)
Q Consensus 12 g~P~Gl~~d~~-------G~l~V~d~~-~gi~~vd~~~g~~~~~~~-~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~~ 82 (120)
--|+||+|=.. |.++|+... ..+++++++ |..+++.. +... +.. .-|-||++.+||.||+||..
T Consensus 314 ~ApsGmaFy~G~~fP~~r~~lfV~~hgsw~~~~~~~~-g~~~~~~~~fl~~-d~~--gR~~dV~v~~DGallv~~D~--- 386 (399)
T COG2133 314 IAPSGMAFYTGDLFPAYRGDLFVGAHGSWPVLRLRPD-GNYKVVLTGFLSG-DLG--GRPRDVAVAPDGALLVLTDQ--- 386 (399)
T ss_pred cccceeEEecCCcCccccCcEEEEeecceeEEEeccC-CCcceEEEEEEec-CCC--CcccceEECCCCeEEEeecC---
Confidence 44799999853 689998765 458888886 44333322 1111 111 45789999999999999996
Q ss_pred ccccceecceecCCceEEEEcCC
Q psy4774 83 KLYDGLFDGLTSGSGSFIKIGPN 105 (120)
Q Consensus 83 ~~~~~~~~~~~~~~G~l~~~d~~ 105 (120)
.+|+|||+.+.
T Consensus 387 ------------~~g~i~Rv~~~ 397 (399)
T COG2133 387 ------------GDGRILRVSYA 397 (399)
T ss_pred ------------CCCeEEEecCC
Confidence 36899999764
No 26
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=96.93 E-value=0.006 Score=50.56 Aligned_cols=70 Identities=17% Similarity=0.314 Sum_probs=49.9
Q ss_pred CCCCccceEECCC-CcEEEEECC-C-------------------CEEEEECCCC-------cEEEEecccc---------
Q psy4774 10 QCGRPLGMKFDKN-GALHVADAY-F-------------------GLYKVNVTTG-------QTEQLISMDT--------- 52 (120)
Q Consensus 10 ~~g~P~Gl~~d~~-G~l~V~d~~-~-------------------gi~~vd~~~g-------~~~~~~~~~~--------- 52 (120)
...||.||++++. |.+|++-.. . +|++++++.+ +++.++..+.
T Consensus 348 ~f~RpEgi~~~p~~g~vY~a~T~~~~r~~~~~~~~n~~~~n~~G~I~r~~~~~~d~~~~~f~~~~~~~~g~~~~~~~~~~ 427 (524)
T PF05787_consen 348 PFDRPEGITVNPDDGEVYFALTNNSGRGESDVDAANPRAGNGYGQIYRYDPDGNDHAATTFTWELFLVGGDPTDASGNGS 427 (524)
T ss_pred cccCccCeeEeCCCCEEEEEEecCCCCcccccccCCcccCCcccEEEEecccCCccccceeEEEEEEEecCccccccccc
Confidence 4689999999997 699998421 1 3999998755 5555543211
Q ss_pred -ccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 53 -EIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 53 -~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
......|..|+.|+|+++|+|||..=.
T Consensus 428 ~~~~~~~f~sPDNL~~d~~G~LwI~eD~ 455 (524)
T PF05787_consen 428 NKCDDNGFASPDNLAFDPDGNLWIQEDG 455 (524)
T ss_pred CcccCCCcCCCCceEECCCCCEEEEeCC
Confidence 012347999999999999999975443
No 27
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=96.78 E-value=0.021 Score=45.22 Aligned_cols=66 Identities=14% Similarity=0.052 Sum_probs=46.8
Q ss_pred cCCCCccceEECCCC-cEEEEEC---------CC-CEEEEECCCCcEEEEeccccccCCC----CCcCCCceEEcCCC-C
Q psy4774 9 QQCGRPLGMKFDKNG-ALHVADA---------YF-GLYKVNVTTGQTEQLISMDTEIDGA----KPQIPNSVTVDSDG-M 72 (120)
Q Consensus 9 ~~~g~P~Gl~~d~~G-~l~V~d~---------~~-gi~~vd~~~g~~~~~~~~~~~~~g~----~~~~pndl~v~~~G-~ 72 (120)
+.+.+|.|+ +.+|| .||||.. .. -|..+|.++++..-=. ..... -...|+.+++++|| .
T Consensus 44 ~~G~~P~~~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t~~~~~~i----~~p~~p~~~~~~~~~~~~ls~dgk~ 118 (352)
T TIGR02658 44 DGGFLPNPV-VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQTHLPIADI----ELPEGPRFLVGTYPWMTSLTPDNKT 118 (352)
T ss_pred EccCCCcee-ECCCCCEEEEEeccccccccCCCCCEEEEEECccCcEEeEE----ccCCCchhhccCccceEEECCCCCE
Confidence 567799997 99998 7899987 32 2778898877643222 12222 24678899999999 5
Q ss_pred EEEEeCC
Q psy4774 73 VYWSDSS 79 (120)
Q Consensus 73 iy~TDs~ 79 (120)
+||++-.
T Consensus 119 l~V~n~~ 125 (352)
T TIGR02658 119 LLFYQFS 125 (352)
T ss_pred EEEecCC
Confidence 9988744
No 28
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=96.77 E-value=0.0085 Score=46.58 Aligned_cols=73 Identities=16% Similarity=0.348 Sum_probs=47.6
Q ss_pred ccCCCCccceEECCCC-cEEEEECCCC---EEEEECCCCcEEEEeccccccCCC-CCcCCCceEEcCCCC-EEEEeCCC
Q psy4774 8 EQQCGRPLGMKFDKNG-ALHVADAYFG---LYKVNVTTGQTEQLISMDTEIDGA-KPQIPNSVTVDSDGM-VYWSDSST 80 (120)
Q Consensus 8 ~~~~g~P~Gl~~d~~G-~l~V~d~~~g---i~~vd~~~g~~~~~~~~~~~~~g~-~~~~pndl~v~~~G~-iy~TDs~~ 80 (120)
.+.+..|..|+|+++| .+||+....+ ++.++.++|+.+.+.....-..+. .-+.|.+|++++||+ ||+++...
T Consensus 188 ~~~G~GPRh~~f~pdg~~~Yv~~e~s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~~ 266 (345)
T PF10282_consen 188 VPPGSGPRHLAFSPDGKYAYVVNELSNTVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNRGS 266 (345)
T ss_dssp CSTTSSEEEEEE-TTSSEEEEEETTTTEEEEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEECTT
T ss_pred cccCCCCcEEEEcCCcCEEEEecCCCCcEEEEeecccCCceeEEEEeeeccccccccCCceeEEEecCCCEEEEEeccC
Confidence 3557789999999998 6788876643 555564577766554321111121 234789999999995 89999864
No 29
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=96.77 E-value=0.0085 Score=49.96 Aligned_cols=70 Identities=14% Similarity=0.147 Sum_probs=49.9
Q ss_pred CCCCccceEECCC-CcEEEEECC-----------------CCEEEEECCCC-------cEEEEeccccccC---------
Q psy4774 10 QCGRPLGMKFDKN-GALHVADAY-----------------FGLYKVNVTTG-------QTEQLISMDTEID--------- 55 (120)
Q Consensus 10 ~~g~P~Gl~~d~~-G~l~V~d~~-----------------~gi~~vd~~~g-------~~~~~~~~~~~~~--------- 55 (120)
.--||.+|++.+. |++|++... .+|+|+-+.++ +++.++..+....
T Consensus 415 ~mdRpE~i~~~p~~g~Vy~~lTNn~~r~~~~aNpr~~n~~G~I~r~~p~~~d~t~~~ftWdlF~~aG~~~~~~~~~~~~~ 494 (616)
T COG3211 415 PMDRPEWIAVNPGTGEVYFTLTNNGKRSDDAANPRAKNGYGQIVRWIPATGDHTDTKFTWDLFVEAGNPSVLEGGASANI 494 (616)
T ss_pred cccCccceeecCCcceEEEEeCCCCccccccCCCcccccccceEEEecCCCCccCccceeeeeeecCCccccccccccCc
Confidence 3468999999997 589997421 13999988776 6777765322111
Q ss_pred -CCCCcCCCceEEcCCCCEEEE-eCC
Q psy4774 56 -GAKPQIPNSVTVDSDGMVYWS-DSS 79 (120)
Q Consensus 56 -g~~~~~pndl~v~~~G~iy~T-Ds~ 79 (120)
..-|+.|++|+||+.|+||+. |.+
T Consensus 495 ~~~~f~~PDnl~fD~~GrLWi~TDg~ 520 (616)
T COG3211 495 NANWFNSPDNLAFDPWGRLWIQTDGS 520 (616)
T ss_pred ccccccCCCceEECCCCCEEEEecCC
Confidence 123888999999999999964 443
No 30
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=96.72 E-value=0.024 Score=44.88 Aligned_cols=69 Identities=19% Similarity=0.257 Sum_probs=48.6
Q ss_pred cCCCCccceEECCCCc-EEEEECCCC-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEEeCCC
Q psy4774 9 QQCGRPLGMKFDKNGA-LHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWSDSST 80 (120)
Q Consensus 9 ~~~g~P~Gl~~d~~G~-l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~TDs~~ 80 (120)
..+..|.|++++++|. +||++...+ |..++..+.++.. ... ...-..+..|.+++++++|. +|+++...
T Consensus 157 ~vG~~P~~~a~~p~g~~vyv~~~~~~~v~vi~~~~~~v~~-~~~--~~~~~~~~~P~~i~v~~~g~~~yV~~~~~ 228 (381)
T COG3391 157 PVGNTPTGVAVDPDGNKVYVTNSDDNTVSVIDTSGNSVVR-GSV--GSLVGVGTGPAGIAVDPDGNRVYVANDGS 228 (381)
T ss_pred ecCCCcceEEECCCCCeEEEEecCCCeEEEEeCCCcceec-ccc--ccccccCCCCceEEECCCCCEEEEEeccC
Confidence 3444689999999996 999996644 8888875443332 210 00112567899999999996 99999875
No 31
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=96.71 E-value=0.0058 Score=47.54 Aligned_cols=63 Identities=27% Similarity=0.395 Sum_probs=39.3
Q ss_pred CCccceEECC-------CCcEEEEECCC-CEEEEECCCC-cEEEEeccccccCCCCCc-CCCceEEcCCCCEEEEeCC
Q psy4774 12 GRPLGMKFDK-------NGALHVADAYF-GLYKVNVTTG-QTEQLISMDTEIDGAKPQ-IPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 12 g~P~Gl~~d~-------~G~l~V~d~~~-gi~~vd~~~g-~~~~~~~~~~~~~g~~~~-~pndl~v~~~G~iy~TDs~ 79 (120)
.-|.|+.|-+ +|.++|++... .|+++..+++ ++..... .-+ .+. .|-+|++++||.|||+|..
T Consensus 253 ~ap~G~~~y~g~~fp~~~g~~~~~~~~~~~i~~~~~~~~~~~~~~~~----~~~-~~~~r~~~v~~~pDG~Lyv~~d~ 325 (331)
T PF07995_consen 253 SAPTGIIFYRGSAFPEYRGDLFVADYGGGRIWRLDLDEDGSVTEEEE----FLG-GFGGRPRDVAQGPDGALYVSDDS 325 (331)
T ss_dssp --EEEEEEE-SSSSGGGTTEEEEEETTTTEEEEEEEETTEEEEEEEE----ECT-TSSS-EEEEEEETTSEEEEEE-T
T ss_pred cccCceEEECCccCccccCcEEEecCCCCEEEEEeeecCCCccceEE----ccc-cCCCCceEEEEcCCCeEEEEECC
Confidence 4578888763 56799999864 4888876533 3221111 111 222 4679999999999999984
No 32
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=96.69 E-value=0.0076 Score=47.10 Aligned_cols=58 Identities=17% Similarity=0.209 Sum_probs=45.6
Q ss_pred CCCccceEECCCCcEEEEECCCC-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEE-EEeCC
Q psy4774 11 CGRPLGMKFDKNGALHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVY-WSDSS 79 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy-~TDs~ 79 (120)
...|.+.++. +|+||++|.+.| |.++|+++|+.++++. .+| +|.+|++. |++. ++=|.
T Consensus 202 LsmPhSPRWh-dgrLwvldsgtGev~~vD~~~G~~e~Va~----vpG----~~rGL~f~--G~llvVgmSk 261 (335)
T TIGR03032 202 LSMPHSPRWY-QGKLWLLNSGRGELGYVDPQAGKFQPVAF----LPG----FTRGLAFA--GDFAFVGLSK 261 (335)
T ss_pred ccCCcCCcEe-CCeEEEEECCCCEEEEEcCCCCcEEEEEE----CCC----CCccccee--CCEEEEEecc
Confidence 3567777886 789999999976 9999998899999984 455 67899999 7655 44444
No 33
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=96.58 E-value=0.035 Score=43.72 Aligned_cols=61 Identities=13% Similarity=0.265 Sum_probs=43.1
Q ss_pred CccceEECCCCc-EEEEECCC---CEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeC
Q psy4774 13 RPLGMKFDKNGA-LHVADAYF---GLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDS 78 (120)
Q Consensus 13 ~P~Gl~~d~~G~-l~V~d~~~---gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs 78 (120)
....|++.++|+ ||+|+-+. .+++||+.+|+++.+... . ..-.+|.++.+++.|++.+.-.
T Consensus 245 ~~aaIhis~dGrFLYasNRg~dsI~~f~V~~~~g~L~~~~~~--~---teg~~PR~F~i~~~g~~Liaa~ 309 (346)
T COG2706 245 WAAAIHISPDGRFLYASNRGHDSIAVFSVDPDGGKLELVGIT--P---TEGQFPRDFNINPSGRFLIAAN 309 (346)
T ss_pred ceeEEEECCCCCEEEEecCCCCeEEEEEEcCCCCEEEEEEEe--c---cCCcCCccceeCCCCCEEEEEc
Confidence 345699999995 57777553 288899988887766432 2 2335689999999987554433
No 34
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=96.55 E-value=0.008 Score=46.78 Aligned_cols=65 Identities=20% Similarity=0.274 Sum_probs=44.4
Q ss_pred CCccceEECCCCcEEEEECCCCEEEEECCCCcE-EEEeccccccCCCCCcCCCceEEcCC----CCEEEEeC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFGLYKVNVTTGQT-EQLISMDTEIDGAKPQIPNSVTVDSD----GMVYWSDS 78 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~-~~~~~~~~~~~g~~~~~pndl~v~~~----G~iy~TDs 78 (120)
.+|-+|++.|+|+|||++....|+.++. +|.. ..+.... ++.........+|+++++ +.||++-+
T Consensus 2 ~~P~~~a~~pdG~l~v~e~~G~i~~~~~-~g~~~~~v~~~~-~v~~~~~~gllgia~~p~f~~n~~lYv~~t 71 (331)
T PF07995_consen 2 NNPRSMAFLPDGRLLVAERSGRIWVVDK-DGSLKTPVADLP-EVFADGERGLLGIAFHPDFASNGYLYVYYT 71 (331)
T ss_dssp SSEEEEEEETTSCEEEEETTTEEEEEET-TTEECEEEEE-T-TTBTSTTBSEEEEEE-TTCCCC-EEEEEEE
T ss_pred CCceEEEEeCCCcEEEEeCCceEEEEeC-CCcCcceecccc-cccccccCCcccceeccccCCCCEEEEEEE
Confidence 4799999999999999988444888884 5665 4444321 222333445679999995 88998776
No 35
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=96.51 E-value=0.033 Score=42.52 Aligned_cols=71 Identities=15% Similarity=0.245 Sum_probs=43.4
Q ss_pred cCCCCccceEECCCC-cEEEEECCCC-E--EEEECCCCcEEEEeccccccC-CCCCcCCCceEEcCCCC-EEEEeCC
Q psy4774 9 QQCGRPLGMKFDKNG-ALHVADAYFG-L--YKVNVTTGQTEQLISMDTEID-GAKPQIPNSVTVDSDGM-VYWSDSS 79 (120)
Q Consensus 9 ~~~g~P~Gl~~d~~G-~l~V~d~~~g-i--~~vd~~~g~~~~~~~~~~~~~-g~~~~~pndl~v~~~G~-iy~TDs~ 79 (120)
+.+..|.+++++++| .+||++...+ | +.++..+++.+.+.....-.. .....++.+++++|+|. +|+++..
T Consensus 172 ~~g~~p~~~~~~pdg~~lyv~~~~~~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~ 248 (330)
T PRK11028 172 VEGAGPRHMVFHPNQQYAYCVNELNSSVDVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHITPDGRHLYACDRT 248 (330)
T ss_pred CCCCCCceEEECCCCCEEEEEecCCCEEEEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEECCCCCEEEEecCC
Confidence 346679999999998 5688876533 4 445543566554432110001 01123567899999985 8888764
No 36
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=96.48 E-value=0.016 Score=47.40 Aligned_cols=68 Identities=18% Similarity=0.277 Sum_probs=46.3
Q ss_pred CCCCccceEECCCCcEEEEECC-CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCC-------CCEEEEe
Q psy4774 10 QCGRPLGMKFDKNGALHVADAY-FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSD-------GMVYWSD 77 (120)
Q Consensus 10 ~~g~P~Gl~~d~~G~l~V~d~~-~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~-------G~iy~TD 77 (120)
-...|-+|++.++|+|||++-. ..|++++++++..+.+..............+.+|+++|+ +.||++=
T Consensus 28 GL~~Pw~maflPDG~llVtER~~G~I~~v~~~~~~~~~~~~l~~v~~~~ge~GLlglal~PdF~~~~~n~~lYvsy 103 (454)
T TIGR03606 28 GLNKPWALLWGPDNQLWVTERATGKILRVNPETGEVKVVFTLPEIVNDAQHNGLLGLALHPDFMQEKGNPYVYISY 103 (454)
T ss_pred CCCCceEEEEcCCCeEEEEEecCCEEEEEeCCCCceeeeecCCceeccCCCCceeeEEECCCccccCCCcEEEEEE
Confidence 3578999999999999999874 459999876666555543211111112445679999865 4799884
No 37
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=96.39 E-value=0.0093 Score=31.73 Aligned_cols=23 Identities=35% Similarity=0.570 Sum_probs=19.8
Q ss_pred CCcCCCceEEcCC-CCEEEEeCCC
Q psy4774 58 KPQIPNSVTVDSD-GMVYWSDSST 80 (120)
Q Consensus 58 ~~~~pndl~v~~~-G~iy~TDs~~ 80 (120)
.+..||++++|+. +.||+||+..
T Consensus 7 ~~~~~~~la~d~~~~~lYw~D~~~ 30 (43)
T smart00135 7 GLGHPNGLAVDWIEGRLYWTDWGL 30 (43)
T ss_pred CCCCcCEEEEeecCCEEEEEeCCC
Confidence 5678999999996 6899999973
No 38
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=96.39 E-value=0.044 Score=43.42 Aligned_cols=60 Identities=22% Similarity=0.287 Sum_probs=42.6
Q ss_pred CCccc---eEECCCC-cEEEEEC----------CCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC--EEE
Q psy4774 12 GRPLG---MKFDKNG-ALHVADA----------YFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM--VYW 75 (120)
Q Consensus 12 g~P~G---l~~d~~G-~l~V~d~----------~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~--iy~ 75 (120)
=+|.| ++++++| +|||+.. .+.|+.+|.+++++...+. -|. .|.+|++++||. ||+
T Consensus 245 wrP~g~q~ia~~~dg~~lyV~~~~~~~~thk~~~~~V~ViD~~t~kvi~~i~-----vG~---~~~~iavS~Dgkp~lyv 316 (352)
T TIGR02658 245 WRPGGWQQVAYHRARDRIYLLADQRAKWTHKTASRFLFVVDAKTGKRLRKIE-----LGH---EIDSINVSQDAKPLLYA 316 (352)
T ss_pred cCCCcceeEEEcCCCCEEEEEecCCccccccCCCCEEEEEECCCCeEEEEEe-----CCC---ceeeEEECCCCCeEEEE
Confidence 46777 9999886 8899531 1359999998887654432 132 478999999975 677
Q ss_pred EeCC
Q psy4774 76 SDSS 79 (120)
Q Consensus 76 TDs~ 79 (120)
|+..
T Consensus 317 tn~~ 320 (352)
T TIGR02658 317 LSTG 320 (352)
T ss_pred eCCC
Confidence 7753
No 39
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=96.35 E-value=0.071 Score=40.66 Aligned_cols=61 Identities=11% Similarity=0.134 Sum_probs=40.4
Q ss_pred CccceEECCCCc-EEEEECCCC---EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCC-CEEEEeCC
Q psy4774 13 RPLGMKFDKNGA-LHVADAYFG---LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDG-MVYWSDSS 79 (120)
Q Consensus 13 ~P~Gl~~d~~G~-l~V~d~~~g---i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G-~iy~TDs~ 79 (120)
.|.+|+++++|+ |||++...+ ++.++.+++..+.+... ..+ ..|.++.++++| .||++...
T Consensus 229 ~~~~i~~~pdg~~lyv~~~~~~~I~v~~i~~~~~~~~~~~~~---~~~---~~p~~~~~~~dg~~l~va~~~ 294 (330)
T PRK11028 229 WAADIHITPDGRHLYACDRTASLISVFSVSEDGSVLSFEGHQ---PTE---TQPRGFNIDHSGKYLIAAGQK 294 (330)
T ss_pred cceeEEECCCCCEEEEecCCCCeEEEEEEeCCCCeEEEeEEE---ecc---ccCCceEECCCCCEEEEEEcc
Confidence 456799999995 788865433 55556554444444321 112 468999999998 68888763
No 40
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=96.16 E-value=0.12 Score=40.81 Aligned_cols=72 Identities=19% Similarity=0.336 Sum_probs=49.3
Q ss_pred cCCCCccceEECCCCcE-EEEECCCC---EEEEECCCCcEEEEeccccccCC-CCCcCCCceEEcCCCC-EEEEeCCC
Q psy4774 9 QQCGRPLGMKFDKNGAL-HVADAYFG---LYKVNVTTGQTEQLISMDTEIDG-AKPQIPNSVTVDSDGM-VYWSDSST 80 (120)
Q Consensus 9 ~~~g~P~Gl~~d~~G~l-~V~d~~~g---i~~vd~~~g~~~~~~~~~~~~~g-~~~~~pndl~v~~~G~-iy~TDs~~ 80 (120)
..+..|.-|+|.|+|++ |+...-.+ ++.+++..|+.+.+-...+-.++ ...++..+|.+++||+ ||+++-..
T Consensus 188 ~~G~GPRHi~FHpn~k~aY~v~EL~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~dGrFLYasNRg~ 265 (346)
T COG2706 188 KPGAGPRHIVFHPNGKYAYLVNELNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISPDGRFLYASNRGH 265 (346)
T ss_pred CCCCCcceEEEcCCCcEEEEEeccCCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECCCCCEEEEecCCC
Confidence 45677999999999977 66544333 77788877887766432111111 1245677999999996 89998763
No 41
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=96.10 E-value=0.13 Score=37.39 Aligned_cols=60 Identities=22% Similarity=0.433 Sum_probs=39.9
Q ss_pred CCCccceEECCCCc-EEEEECCC-CEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEEeC
Q psy4774 11 CGRPLGMKFDKNGA-LHVADAYF-GLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWSDS 78 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~-l~V~d~~~-gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~TDs 78 (120)
...|.+++++++|+ +|++.... .|..+|.++++...... .+ ..+..++++++|+ +|++.+
T Consensus 30 ~~~~~~l~~~~dg~~l~~~~~~~~~v~~~d~~~~~~~~~~~-----~~---~~~~~~~~~~~g~~l~~~~~ 92 (300)
T TIGR03866 30 GQRPRGITLSKDGKLLYVCASDSDTIQVIDLATGEVIGTLP-----SG---PDPELFALHPNGKILYIANE 92 (300)
T ss_pred CCCCCceEECCCCCEEEEEECCCCeEEEEECCCCcEEEecc-----CC---CCccEEEECCCCCEEEEEcC
Confidence 35588999999996 56776554 47788987776543221 11 1246788999886 667654
No 42
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=96.08 E-value=0.025 Score=42.19 Aligned_cols=58 Identities=12% Similarity=0.216 Sum_probs=40.7
Q ss_pred CCcEEEE-ECCCCEEEEECCCCcEEEEecccc-----ccCCCCCcCCCceEEcCCC-CEEEEeCC
Q psy4774 22 NGALHVA-DAYFGLYKVNVTTGQTEQLISMDT-----EIDGAKPQIPNSVTVDSDG-MVYWSDSS 79 (120)
Q Consensus 22 ~G~l~V~-d~~~gi~~vd~~~g~~~~~~~~~~-----~~~g~~~~~pndl~v~~~G-~iy~TDs~ 79 (120)
+|.||.- -....|.||+|++|++..+.+... ..++...+-+|+||.++++ ++|+|--.
T Consensus 185 dG~lyANVw~t~~I~rI~p~sGrV~~widlS~L~~~~~~~~~~~nvlNGIA~~~~~~r~~iTGK~ 249 (262)
T COG3823 185 DGELYANVWQTTRIARIDPDSGRVVAWIDLSGLLKELNLDKSNDNVLNGIAHDPQQDRFLITGKL 249 (262)
T ss_pred ccEEEEeeeeecceEEEcCCCCcEEEEEEccCCchhcCccccccccccceeecCcCCeEEEecCc
Confidence 5666642 223569999999999877765311 2233456789999999974 89999765
No 43
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=96.04 E-value=0.051 Score=42.17 Aligned_cols=67 Identities=22% Similarity=0.413 Sum_probs=44.4
Q ss_pred cceEECCC-CcEEEE-ECCCCEEEEECCCCcEEEEeccccccCCC--CCcCCCceEEcCCCCEEEEeCCCCc
Q psy4774 15 LGMKFDKN-GALHVA-DAYFGLYKVNVTTGQTEQLISMDTEIDGA--KPQIPNSVTVDSDGMVYWSDSSTKY 82 (120)
Q Consensus 15 ~Gl~~d~~-G~l~V~-d~~~gi~~vd~~~g~~~~~~~~~~~~~g~--~~~~pndl~v~~~G~iy~TDs~~~~ 82 (120)
.|+.+++. ++|+|- +..+-++.+|.+ |.+.-.........|- ....+-||+.|++|+||++--.+.|
T Consensus 236 Sgl~~~~~~~~LLVLS~ESr~l~Evd~~-G~~~~~lsL~~g~~gL~~dipqaEGiamDd~g~lYIvSEPnlf 306 (316)
T COG3204 236 SGLEFNAITNSLLVLSDESRRLLEVDLS-GEVIELLSLTKGNHGLSSDIPQAEGIAMDDDGNLYIVSEPNLF 306 (316)
T ss_pred ccceecCCCCcEEEEecCCceEEEEecC-CCeeeeEEeccCCCCCcccCCCcceeEECCCCCEEEEecCCcc
Confidence 48899976 588875 555669999985 6643332221122232 4567899999999999977665443
No 44
>KOG4499|consensus
Probab=95.95 E-value=0.074 Score=40.45 Aligned_cols=81 Identities=25% Similarity=0.257 Sum_probs=50.4
Q ss_pred CCccceEECCCCcEEEEECCC-CEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCC--CCEEEEeCCCCcccccce
Q psy4774 12 GRPLGMKFDKNGALHVADAYF-GLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSD--GMVYWSDSSTKYKLYDGL 88 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~-gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~--G~iy~TDs~~~~~~~~~~ 88 (120)
-.|-|+++|.+|+|||+-... .|+++||.+|++-.-.. . |-......+|... ..+|+|-..- |... .
T Consensus 212 ~~PDGm~ID~eG~L~Va~~ng~~V~~~dp~tGK~L~eik----l---Pt~qitsccFgGkn~d~~yvT~aa~-~~dp--~ 281 (310)
T KOG4499|consen 212 LEPDGMTIDTEGNLYVATFNGGTVQKVDPTTGKILLEIK----L---PTPQITSCCFGGKNLDILYVTTAAK-FDDP--V 281 (310)
T ss_pred CCCCcceEccCCcEEEEEecCcEEEEECCCCCcEEEEEE----c---CCCceEEEEecCCCccEEEEEehhc-ccCc--h
Confidence 458999999999999986553 49999999998522221 1 2223456677754 5677777642 2211 0
Q ss_pred ecceecC-CceEEEEc
Q psy4774 89 FDGLTSG-SGSFIKIG 103 (120)
Q Consensus 89 ~~~~~~~-~G~l~~~d 103 (120)
+ ...+| .|.+|++.
T Consensus 282 ~-~~~~p~aG~iykit 296 (310)
T KOG4499|consen 282 R-TNTDPNAGTIYKIT 296 (310)
T ss_pred h-cccCCCCccEEEec
Confidence 1 11234 67777774
No 45
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=95.82 E-value=0.05 Score=34.70 Aligned_cols=37 Identities=14% Similarity=0.203 Sum_probs=28.0
Q ss_pred EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCC-CEEEEeCCC
Q psy4774 34 LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDG-MVYWSDSST 80 (120)
Q Consensus 34 i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G-~iy~TDs~~ 80 (120)
|+-+|++ ++.++++ .+.+||||++++++ .||++++..
T Consensus 38 Vvyyd~~--~~~~va~--------g~~~aNGI~~s~~~k~lyVa~~~~ 75 (86)
T PF01731_consen 38 VVYYDGK--EVKVVAS--------GFSFANGIAISPDKKYLYVASSLA 75 (86)
T ss_pred EEEEeCC--EeEEeec--------cCCCCceEEEcCCCCEEEEEeccC
Confidence 6667763 4555552 47899999999985 799999974
No 46
>PF06739 SBBP: Beta-propeller repeat; InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=95.57 E-value=0.0093 Score=32.21 Aligned_cols=21 Identities=29% Similarity=0.531 Sum_probs=18.0
Q ss_pred cCCCceEEcCCCCEEEEeCCC
Q psy4774 60 QIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 60 ~~pndl~v~~~G~iy~TDs~~ 80 (120)
..+++|++|++|+||++-.+.
T Consensus 13 ~~~~~IavD~~GNiYv~G~T~ 33 (38)
T PF06739_consen 13 DYGNGIAVDSNGNIYVTGYTN 33 (38)
T ss_pred eeEEEEEECCCCCEEEEEeec
Confidence 368999999999999987763
No 47
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=95.54 E-value=0.25 Score=40.43 Aligned_cols=93 Identities=9% Similarity=0.170 Sum_probs=55.6
Q ss_pred CCCccceEECCC-------CcEEEEEC----------CCCEEEEECCC--C---cEEEEeccccccCCCCCcCCCceEEc
Q psy4774 11 CGRPLGMKFDKN-------GALHVADA----------YFGLYKVNVTT--G---QTEQLISMDTEIDGAKPQIPNSVTVD 68 (120)
Q Consensus 11 ~g~P~Gl~~d~~-------G~l~V~d~----------~~gi~~vd~~~--g---~~~~~~~~~~~~~g~~~~~pndl~v~ 68 (120)
.+.++||+++|+ +.|||+-. ...|.|+..+. . ..++++. .......+.-..|+|+
T Consensus 78 e~GLlglal~PdF~~~~~n~~lYvsyt~~~~~~~~~~~~~I~R~~l~~~~~~l~~~~~Il~---~lP~~~~H~GgrI~Fg 154 (454)
T TIGR03606 78 HNGLLGLALHPDFMQEKGNPYVYISYTYKNGDKELPNHTKIVRYTYDKSTQTLEKPVDLLA---GLPAGNDHNGGRLVFG 154 (454)
T ss_pred CCceeeEEECCCccccCCCcEEEEEEeccCCCCCccCCcEEEEEEecCCCCccccceEEEe---cCCCCCCcCCceEEEC
Confidence 467899999876 36899831 23487775431 1 1233432 2222335667799999
Q ss_pred CCCCEEEEeCCCCcc------ccccee----------cceecCCceEEEEcCCC
Q psy4774 69 SDGMVYWSDSSTKYK------LYDGLF----------DGLTSGSGSFIKIGPNT 106 (120)
Q Consensus 69 ~~G~iy~TDs~~~~~------~~~~~~----------~~~~~~~G~l~~~d~~~ 106 (120)
+||.||||--..... ...... .......|.|+|+|+++
T Consensus 155 PDG~LYVs~GD~g~~~~~n~~~~~~aQ~~~~~~~~~~~d~~~~~GkILRin~DG 208 (454)
T TIGR03606 155 PDGKIYYTIGEQGRNQGANFFLPNQAQHTPTQQELNGKDYHAYMGKVLRLNLDG 208 (454)
T ss_pred CCCcEEEEECCCCCCCcccccCcchhccccccccccccCcccCceEEEEEcCCC
Confidence 999999987654210 000000 01234679999999887
No 48
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=95.47 E-value=0.052 Score=45.45 Aligned_cols=63 Identities=19% Similarity=0.311 Sum_probs=40.8
Q ss_pred CCCCccceEECCCCcEEEE-ECCC--------CEEEE---ECCCCcEEEEeccccccCCCCCcCCCceEEcCCC-CEEEE
Q psy4774 10 QCGRPLGMKFDKNGALHVA-DAYF--------GLYKV---NVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDG-MVYWS 76 (120)
Q Consensus 10 ~~g~P~Gl~~d~~G~l~V~-d~~~--------gi~~v---d~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G-~iy~T 76 (120)
..+.|-+|+||+.|+|||| |... |+..+ ++++|++..+.. ...|-.+ -+.+++||| ++|+.
T Consensus 498 ~f~~PDnl~fD~~GrLWi~TDg~~s~~~~~~~G~~~m~~~~p~~g~~~rf~t---~P~g~E~---tG~~FspD~~TlFV~ 571 (616)
T COG3211 498 WFNSPDNLAFDPWGRLWIQTDGSGSTLRNRFRGVTQMLTPDPKTGTIKRFLT---GPIGCEF---TGPCFSPDGKTLFVN 571 (616)
T ss_pred cccCCCceEECCCCCEEEEecCCCCccCcccccccccccCCCccceeeeecc---CCCccee---ecceeCCCCceEEEE
Confidence 3677999999999999998 4321 45555 444555555542 2223333 489999998 56655
Q ss_pred eC
Q psy4774 77 DS 78 (120)
Q Consensus 77 Ds 78 (120)
-.
T Consensus 572 vQ 573 (616)
T COG3211 572 VQ 573 (616)
T ss_pred ec
Confidence 44
No 49
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=95.21 E-value=0.51 Score=34.26 Aligned_cols=59 Identities=20% Similarity=0.208 Sum_probs=40.5
Q ss_pred CCccceEECCCCcE-EEEECC-CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEEeC
Q psy4774 12 GRPLGMKFDKNGAL-HVADAY-FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWSDS 78 (120)
Q Consensus 12 g~P~Gl~~d~~G~l-~V~d~~-~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~TDs 78 (120)
..|.+++++++|+. |++... ..|..+|.+++++..... .+ ..+..++++++|. ||.+..
T Consensus 207 ~~~~~i~~s~dg~~~~~~~~~~~~i~v~d~~~~~~~~~~~-----~~---~~~~~~~~~~~g~~l~~~~~ 268 (300)
T TIGR03866 207 VQPVGIKLTKDGKTAFVALGPANRVAVVDAKTYEVLDYLL-----VG---QRVWQLAFTPDEKYLLTTNG 268 (300)
T ss_pred CCccceEECCCCCEEEEEcCCCCeEEEEECCCCcEEEEEE-----eC---CCcceEEECCCCCEEEEEcC
Confidence 35789999999965 666543 458888988777654431 12 2467999999985 566654
No 50
>KOG1214|consensus
Probab=95.13 E-value=0.13 Score=45.04 Aligned_cols=61 Identities=20% Similarity=0.357 Sum_probs=45.3
Q ss_pred CCCccceEECCCC-cEEEEECCCC---EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcC-CCCEEEEeCCC
Q psy4774 11 CGRPLGMKFDKNG-ALHVADAYFG---LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDS-DGMVYWSDSST 80 (120)
Q Consensus 11 ~g~P~Gl~~d~~G-~l~V~d~~~g---i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~-~G~iy~TDs~~ 80 (120)
.+.|.|||+|.-+ ++|-+|+... +-++| +.+..+++. . -|..|..|++|+ .|+||+||-..
T Consensus 1067 L~SPEGiAVDh~~Rn~ywtDS~lD~IevA~Ld--G~~rkvLf~-----t--dLVNPR~iv~D~~rgnLYwtDWnR 1132 (1289)
T KOG1214|consen 1067 LISPEGIAVDHIRRNMYWTDSVLDKIEVALLD--GSERKVLFY-----T--DLVNPRAIVVDPIRGNLYWTDWNR 1132 (1289)
T ss_pred CCCccceeeeeccceeeeeccccchhheeecC--CceeeEEEe-----e--cccCcceEEeecccCceeeccccc
Confidence 4779999999887 8999998643 44454 334555653 2 366789999999 68999999864
No 51
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=95.12 E-value=0.16 Score=38.80 Aligned_cols=64 Identities=20% Similarity=0.200 Sum_probs=38.9
Q ss_pred CCccceEECCCCcEEEEECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
+..=||+.|. ..||++|....|+.+||++-+...-... ..+|.|....|-|..- +|.||.---.
T Consensus 130 ~EGWGLt~dg-~~Li~SDGS~~L~~~dP~~f~~~~~i~V--~~~g~pv~~LNELE~i-~G~IyANVW~ 193 (264)
T PF05096_consen 130 GEGWGLTSDG-KRLIMSDGSSRLYFLDPETFKEVRTIQV--TDNGRPVSNLNELEYI-NGKIYANVWQ 193 (264)
T ss_dssp SS--EEEECS-SCEEEE-SSSEEEEE-TTT-SEEEEEE---EETTEE---EEEEEEE-TTEEEEEETT
T ss_pred CcceEEEcCC-CEEEEECCccceEEECCcccceEEEEEE--EECCEECCCcEeEEEE-cCEEEEEeCC
Confidence 4556888774 4899999988899999986544322221 2357788888998887 5878766554
No 52
>PF00058 Ldl_recept_b: Low-density lipoprotein receptor repeat class B; InterPro: IPR000033 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=94.69 E-value=0.14 Score=27.92 Aligned_cols=40 Identities=23% Similarity=0.287 Sum_probs=30.1
Q ss_pred CcEEEEECCC--CEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcC
Q psy4774 23 GALHVADAYF--GLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDS 69 (120)
Q Consensus 23 G~l~V~d~~~--gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~ 69 (120)
++||-+|... .|.+.+.++...++++. ..+..|++|+||+
T Consensus 1 ~~iYWtD~~~~~~I~~a~~dGs~~~~vi~-------~~l~~P~giaVD~ 42 (42)
T PF00058_consen 1 GKIYWTDWSQDPSIERANLDGSNRRTVIS-------DDLQHPEGIAVDW 42 (42)
T ss_dssp TEEEEEETTTTEEEEEEETTSTSEEEEEE-------SSTSSEEEEEEET
T ss_pred CEEEEEECCCCcEEEEEECCCCCeEEEEE-------CCCCCcCEEEECC
Confidence 4789999874 47778887666677764 1378899999984
No 53
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=94.34 E-value=0.54 Score=36.45 Aligned_cols=66 Identities=14% Similarity=0.284 Sum_probs=41.8
Q ss_pred CccceEECCCCcEEEEECCC-------CEEEEECCCCcE-EEEecccccc-------CCCC-CcCCCceEEcCCCC-EEE
Q psy4774 13 RPLGMKFDKNGALHVADAYF-------GLYKVNVTTGQT-EQLISMDTEI-------DGAK-PQIPNSVTVDSDGM-VYW 75 (120)
Q Consensus 13 ~P~Gl~~d~~G~l~V~d~~~-------gi~~vd~~~g~~-~~~~~~~~~~-------~g~~-~~~pndl~v~~~G~-iy~ 75 (120)
-|.||++.++|.+||++.+. .|++++++ |++ +.+.-. ... .+.+ =...-+|++.++|. ||+
T Consensus 86 D~Egi~~~~~g~~~is~E~~~~~~~~p~I~~~~~~-G~~~~~~~vP-~~~~~~~~~~~~~~~N~G~E~la~~~dG~~l~~ 163 (326)
T PF13449_consen 86 DPEGIAVPPDGSFWISSEGGRTGGIPPRIRRFDLD-GRVIRRFPVP-AAFLPDANGTSGRRNNRGFEGLAVSPDGRTLFA 163 (326)
T ss_pred ChhHeEEecCCCEEEEeCCccCCCCCCEEEEECCC-CcccceEccc-cccccccCccccccCCCCeEEEEECCCCCEEEE
Confidence 35599998899999998765 49999986 665 333100 011 1111 12234899999998 776
Q ss_pred EeCCC
Q psy4774 76 SDSST 80 (120)
Q Consensus 76 TDs~~ 80 (120)
.--+.
T Consensus 164 ~~E~~ 168 (326)
T PF13449_consen 164 AMESP 168 (326)
T ss_pred EECcc
Confidence 55543
No 54
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=94.18 E-value=0.24 Score=39.21 Aligned_cols=57 Identities=30% Similarity=0.459 Sum_probs=36.5
Q ss_pred ccceEECCCC-cEEEEECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEEeC
Q psy4774 14 PLGMKFDKNG-ALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWSDS 78 (120)
Q Consensus 14 P~Gl~~d~~G-~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~TDs 78 (120)
+.++++.+|| .+||+.....|..||+.++++.--+ . .| ..|.++++++||. +|+++.
T Consensus 39 h~~~~~s~Dgr~~yv~~rdg~vsviD~~~~~~v~~i----~-~G---~~~~~i~~s~DG~~~~v~n~ 97 (369)
T PF02239_consen 39 HAGLKFSPDGRYLYVANRDGTVSVIDLATGKVVATI----K-VG---GNPRGIAVSPDGKYVYVANY 97 (369)
T ss_dssp EEEEE-TT-SSEEEEEETTSEEEEEETTSSSEEEEE----E--S---SEEEEEEE--TTTEEEEEEE
T ss_pred eeEEEecCCCCEEEEEcCCCeEEEEECCcccEEEEE----e-cC---CCcceEEEcCCCCEEEEEec
Confidence 4678899999 5899875445999999887743222 1 12 2468999999996 777765
No 55
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=94.17 E-value=0.16 Score=26.70 Aligned_cols=31 Identities=23% Similarity=0.201 Sum_probs=24.8
Q ss_pred CCCCccceEECCCC-cEEEEECCCC-EEEEECC
Q psy4774 10 QCGRPLGMKFDKNG-ALHVADAYFG-LYKVNVT 40 (120)
Q Consensus 10 ~~g~P~Gl~~d~~G-~l~V~d~~~g-i~~vd~~ 40 (120)
....|+||++|+.+ .||-+|...+ |.+.+.+
T Consensus 7 ~~~~~~~la~d~~~~~lYw~D~~~~~I~~~~~~ 39 (43)
T smart00135 7 GLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLD 39 (43)
T ss_pred CCCCcCEEEEeecCCEEEEEeCCCCEEEEEeCC
Confidence 45789999999996 7899998755 7777764
No 56
>PF06739 SBBP: Beta-propeller repeat; InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=94.07 E-value=0.052 Score=29.21 Aligned_cols=20 Identities=25% Similarity=0.302 Sum_probs=17.0
Q ss_pred CCccceEECCCCcEEEEECC
Q psy4774 12 GRPLGMKFDKNGALHVADAY 31 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~ 31 (120)
-.|.+|++|++|++||+-..
T Consensus 13 ~~~~~IavD~~GNiYv~G~T 32 (38)
T PF06739_consen 13 DYGNGIAVDSNGNIYVTGYT 32 (38)
T ss_pred eeEEEEEECCCCCEEEEEee
Confidence 36899999999999998543
No 57
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=93.96 E-value=0.41 Score=36.11 Aligned_cols=67 Identities=19% Similarity=0.116 Sum_probs=37.1
Q ss_pred ccceEECCC-CcEEEEECC--CCEEEEEC--CCCcEEEEeccccccCCCCCcCCCceEEcCC-CCEEEEeCCC
Q psy4774 14 PLGMKFDKN-GALHVADAY--FGLYKVNV--TTGQTEQLISMDTEIDGAKPQIPNSVTVDSD-GMVYWSDSST 80 (120)
Q Consensus 14 P~Gl~~d~~-G~l~V~d~~--~gi~~vd~--~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~-G~iy~TDs~~ 80 (120)
--||++|+. ++|||+... .+|+.++. ......+.....-......+.-|.++++++. |++|+....+
T Consensus 120 ~EGla~D~~~~~L~v~kE~~P~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~S~l~~~p~t~~lliLS~es 192 (248)
T PF06977_consen 120 FEGLAYDPKTNRLFVAKERKPKRLYEVNGFPGGFDLFVSDDQDLDDDKLFVRDLSGLSYDPRTGHLLILSDES 192 (248)
T ss_dssp -EEEEEETTTTEEEEEEESSSEEEEEEESTT-SS--EEEE-HHHH-HT--SS---EEEEETTTTEEEEEETTT
T ss_pred eEEEEEcCCCCEEEEEeCCCChhhEEEccccCccceeeccccccccccceeccccceEEcCCCCeEEEEECCC
Confidence 359999998 588888543 46888875 2233333332100112334556899999985 8999876654
No 58
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=93.58 E-value=0.43 Score=39.71 Aligned_cols=22 Identities=23% Similarity=0.557 Sum_probs=18.9
Q ss_pred ccCCCCccceEECCCCcEEEEE
Q psy4774 8 EQQCGRPLGMKFDKNGALHVAD 29 (120)
Q Consensus 8 ~~~~g~P~Gl~~d~~G~l~V~d 29 (120)
......|-+|+|+++|+||||.
T Consensus 432 ~~~f~sPDNL~~d~~G~LwI~e 453 (524)
T PF05787_consen 432 DNGFASPDNLAFDPDGNLWIQE 453 (524)
T ss_pred CCCcCCCCceEECCCCCEEEEe
Confidence 4556789999999999999993
No 59
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=93.52 E-value=0.25 Score=39.87 Aligned_cols=60 Identities=13% Similarity=0.184 Sum_probs=39.0
Q ss_pred CccceEECCCCcEEEEECCC-------------C-EEEEEC--------CCCcEEEEeccccccCCCCCcCCCceEEcCC
Q psy4774 13 RPLGMKFDKNGALHVADAYF-------------G-LYKVNV--------TTGQTEQLISMDTEIDGAKPQIPNSVTVDSD 70 (120)
Q Consensus 13 ~P~Gl~~d~~G~l~V~d~~~-------------g-i~~vd~--------~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~ 70 (120)
.-.-|+|++||+|||+-... | |++++. .....++++ .-+..|.+++++|.
T Consensus 178 ~g~~l~f~pDG~Lyvs~G~~~~~~~aq~~~~~~Gk~~r~~~a~~~~~d~p~~~~~i~s--------~G~RN~qGl~w~P~ 249 (399)
T COG2133 178 FGGRLVFGPDGKLYVTTGSNGDPALAQDNVSLAGKVLRIDRAGIIPADNPFPNSEIWS--------YGHRNPQGLAWHPV 249 (399)
T ss_pred CcccEEECCCCcEEEEeCCCCCcccccCccccccceeeeccCcccccCCCCCCcceEE--------eccCCccceeecCC
Confidence 33459999999999973221 2 445443 222222332 12445779999997
Q ss_pred -CCEEEEeCCC
Q psy4774 71 -GMVYWSDSST 80 (120)
Q Consensus 71 -G~iy~TDs~~ 80 (120)
|.||++|-..
T Consensus 250 tg~Lw~~e~g~ 260 (399)
T COG2133 250 TGALWTTEHGP 260 (399)
T ss_pred CCcEEEEecCC
Confidence 9999999976
No 60
>PF14269 Arylsulfotran_2: Arylsulfotransferase (ASST)
Probab=93.16 E-value=1.9 Score=33.32 Aligned_cols=88 Identities=15% Similarity=0.238 Sum_probs=56.9
Q ss_pred CccceEECCCCcEEEEECC-CCEEEEECCCCcEEEEeccc--c--ccCCCCCcCCCceEEc----CCCCEEEEeCCCCcc
Q psy4774 13 RPLGMKFDKNGALHVADAY-FGLYKVNVTTGQTEQLISMD--T--EIDGAKPQIPNSVTVD----SDGMVYWSDSSTKYK 83 (120)
Q Consensus 13 ~P~Gl~~d~~G~l~V~d~~-~gi~~vd~~~g~~~~~~~~~--~--~~~g~~~~~pndl~v~----~~G~iy~TDs~~~~~ 83 (120)
+.+.+..+++|+++|+--. .-|++|++++|++.=..... + +..+..|.+-+|..+- ++++|-+=|..+.-
T Consensus 145 HiNsV~~~~~G~yLiS~R~~~~i~~I~~~tG~I~W~lgG~~~~df~~~~~~f~~QHdar~~~~~~~~~~IslFDN~~~~- 223 (299)
T PF14269_consen 145 HINSVDKDDDGDYLISSRNTSTIYKIDPSTGKIIWRLGGKRNSDFTLPATNFSWQHDARFLNESNDDGTISLFDNANSD- 223 (299)
T ss_pred EeeeeeecCCccEEEEecccCEEEEEECCCCcEEEEeCCCCCCcccccCCcEeeccCCEEeccCCCCCEEEEEcCCCCC-
Confidence 3568899999999987544 55999999899864333210 0 0012237777888887 67777777773210
Q ss_pred cccceecceecCCceEEEEcCCC
Q psy4774 84 LYDGLFDGLTSGSGSFIKIGPNT 106 (120)
Q Consensus 84 ~~~~~~~~~~~~~G~l~~~d~~~ 106 (120)
. .......|+++.+|+.+
T Consensus 224 ---~--~~~~~s~~~v~~ld~~~ 241 (299)
T PF14269_consen 224 ---F--NGTEPSRGLVLELDPET 241 (299)
T ss_pred ---C--CCCcCCCceEEEEECCC
Confidence 0 11145678999999888
No 61
>KOG1214|consensus
Probab=93.05 E-value=0.17 Score=44.42 Aligned_cols=62 Identities=19% Similarity=0.362 Sum_probs=48.9
Q ss_pred CCccceEECCCC-cEEEEECC-CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCC-CEEEEeCCC
Q psy4774 12 GRPLGMKFDKNG-ALHVADAY-FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDG-MVYWSDSST 80 (120)
Q Consensus 12 g~P~Gl~~d~~G-~l~V~d~~-~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G-~iy~TDs~~ 80 (120)
.-|-||.||=.. .+|-+|.. .-|-+-..++++.+.+++ +.|..|.+|+||--+ ++|+|||..
T Consensus 1025 ~IiVGidfDC~e~mvyWtDv~g~SI~rasL~G~Ep~ti~n-------~~L~SPEGiAVDh~~Rn~ywtDS~l 1089 (1289)
T KOG1214|consen 1025 SIIVGIDFDCRERMVYWTDVAGRSISRASLEGAEPETIVN-------SGLISPEGIAVDHIRRNMYWTDSVL 1089 (1289)
T ss_pred ceeeeeecccccceEEEeecCCCccccccccCCCCceeec-------ccCCCccceeeeeccceeeeecccc
Confidence 457799999776 55668876 448888887788888874 358899999999865 899999974
No 62
>PF14339 DUF4394: Domain of unknown function (DUF4394)
Probab=92.92 E-value=1.5 Score=33.01 Aligned_cols=75 Identities=17% Similarity=0.262 Sum_probs=47.7
Q ss_pred CCccceEECCC-CcEEEEECCCCEEEEECCCCcEEEEeccccccCCCCCc-CCCceEEcCC-CCEEEEeCCCCcccccce
Q psy4774 12 GRPLGMKFDKN-GALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQ-IPNSVTVDSD-GMVYWSDSSTKYKLYDGL 88 (120)
Q Consensus 12 g~P~Gl~~d~~-G~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~-~pndl~v~~~-G~iy~TDs~~~~~~~~~~ 88 (120)
.+-.||.+-|. |.||-.....+|+.||+.+|..+.+... .. ...+. ..-++.|.|- .+|.|.-+
T Consensus 27 e~l~GID~Rpa~G~LYgl~~~g~lYtIn~~tG~aT~vg~s--~~-~~al~g~~~gvDFNP~aDRlRvvs~---------- 93 (236)
T PF14339_consen 27 ESLVGIDFRPANGQLYGLGSTGRLYTINPATGAATPVGAS--PL-TVALSGTAFGVDFNPAADRLRVVSN---------- 93 (236)
T ss_pred CeEEEEEeecCCCCEEEEeCCCcEEEEECCCCeEEEeecc--cc-cccccCceEEEecCcccCcEEEEcc----------
Confidence 44568888776 7999886667799999999988777210 11 11221 1336666663 56665533
Q ss_pred ecceecCCceEEEEcCCC
Q psy4774 89 FDGLTSGSGSFIKIGPNT 106 (120)
Q Consensus 89 ~~~~~~~~G~l~~~d~~~ 106 (120)
+|+=+|++|++
T Consensus 94 -------~GqNlR~npdt 104 (236)
T PF14339_consen 94 -------TGQNLRLNPDT 104 (236)
T ss_pred -------CCcEEEECCCC
Confidence 35667777776
No 63
>PF14517 Tachylectin: Tachylectin; PDB: 1TL2_A.
Probab=92.70 E-value=0.17 Score=37.84 Aligned_cols=75 Identities=17% Similarity=0.158 Sum_probs=38.6
Q ss_pred ccceEECCCCcEEEEECCCCEEEE-ECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCCCcccccceecce
Q psy4774 14 PLGMKFDKNGALHVADAYFGLYKV-NVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTKYKLYDGLFDGL 92 (120)
Q Consensus 14 P~Gl~~d~~G~l~V~d~~~gi~~v-d~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~~~~~~~~~~~~ 92 (120)
..-+-++++|.||+.+....+++. .|.++. ..+.+....+.+.-...+.-|.+.++|+||..++.
T Consensus 132 f~~vfa~~~GvLY~i~~dg~~~~~~~p~~~~-~~W~~~s~~v~~~gw~~~~~i~~~~~g~L~~V~~~------------- 197 (229)
T PF14517_consen 132 FDAVFAGPNGVLYAITPDGRLYRRYRPDGGS-DRWLSGSGLVGGGGWDSFHFIFFSPDGNLWAVKSN------------- 197 (229)
T ss_dssp EEEEEE-TTS-EEEEETTE-EEEE---SSTT---HHHH-EEEESSSGGGEEEEEE-TTS-EEEE-ET-------------
T ss_pred ceEEEeCCCccEEEEcCCCceEEeCCCCCCC-CccccccceeccCCcccceEEeeCCCCcEEEEecC-------------
Confidence 345778999999988753336766 454332 22221111112223334678999999999998764
Q ss_pred ecCCceEEEEcCCC
Q psy4774 93 TSGSGSFIKIGPNT 106 (120)
Q Consensus 93 ~~~~G~l~~~d~~~ 106 (120)
|+|||+.+..
T Consensus 198 ----G~lyr~~~p~ 207 (229)
T PF14517_consen 198 ----GKLYRGRPPQ 207 (229)
T ss_dssp ----TEEEEES---
T ss_pred ----CEEeccCCcc
Confidence 8999987654
No 64
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=92.61 E-value=0.98 Score=34.54 Aligned_cols=37 Identities=30% Similarity=0.512 Sum_probs=29.0
Q ss_pred CCccceEECCCCcEEEEECCCC---EEEEECCCCcEEEEe
Q psy4774 12 GRPLGMKFDKNGALHVADAYFG---LYKVNVTTGQTEQLI 48 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~g---i~~vd~~~g~~~~~~ 48 (120)
.+.-||.+..+|.||.+....| |.++|+++|++..-.
T Consensus 45 aFTQGL~~~~~g~LyESTG~yG~S~l~~~d~~tg~~~~~~ 84 (264)
T PF05096_consen 45 AFTQGLEFLDDGTLYESTGLYGQSSLRKVDLETGKVLQSV 84 (264)
T ss_dssp -EEEEEEEEETTEEEEEECSTTEEEEEEEETTTSSEEEEE
T ss_pred ccCccEEecCCCEEEEeCCCCCcEEEEEEECCCCcEEEEE
Confidence 4455999988899999977665 999999999875444
No 65
>PF02333 Phytase: Phytase; InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=92.33 E-value=0.83 Score=36.68 Aligned_cols=68 Identities=22% Similarity=0.369 Sum_probs=41.6
Q ss_pred CCCCccceEECCC-CcEEEEECCCCEEEEECC--CCcEEEEeccccccCCCCCc-CCCceEEc--CC--CCEEEEeCCC
Q psy4774 10 QCGRPLGMKFDKN-GALHVADAYFGLYKVNVT--TGQTEQLISMDTEIDGAKPQ-IPNSVTVD--SD--GMVYWSDSST 80 (120)
Q Consensus 10 ~~g~P~Gl~~d~~-G~l~V~d~~~gi~~vd~~--~g~~~~~~~~~~~~~g~~~~-~pndl~v~--~~--G~iy~TDs~~ 80 (120)
..+.|.|+++|.. |.|||++...|||+++.+ .+....++. ...+..|. -.-+|++- .+ |.|.+|+...
T Consensus 206 ~~sQ~EGCVVDDe~g~LYvgEE~~GIW~y~Aep~~~~~~~~v~---~~~g~~l~aDvEGlaly~~~~g~gYLivSsQG~ 281 (381)
T PF02333_consen 206 VGSQPEGCVVDDETGRLYVGEEDVGIWRYDAEPEGGNDRTLVA---SADGDGLVADVEGLALYYGSDGKGYLIVSSQGD 281 (381)
T ss_dssp -SS-EEEEEEETTTTEEEEEETTTEEEEEESSCCC-S--EEEE---EBSSSSB-S-EEEEEEEE-CCC-EEEEEEEGGG
T ss_pred CCCcceEEEEecccCCEEEecCccEEEEEecCCCCCCcceeee---cccccccccCccceEEEecCCCCeEEEEEcCCC
Confidence 3467899999987 799999999999999743 444333332 22343442 23477773 33 5677777754
No 66
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=91.18 E-value=1.3 Score=23.15 Aligned_cols=38 Identities=21% Similarity=0.250 Sum_probs=23.8
Q ss_pred CcEEEEECCC-CEEEEECCCCcEEEEeccccccCCCCCcCCCceEEc
Q psy4774 23 GALHVADAYF-GLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVD 68 (120)
Q Consensus 23 G~l~V~d~~~-gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~ 68 (120)
+.|||++... .|..+|+++++...-. .. | ..|.+++++
T Consensus 4 ~~lyv~~~~~~~v~~id~~~~~~~~~i----~v-g---~~P~~i~~~ 42 (42)
T TIGR02276 4 TKLYVTNSGSNTVSVIDTATNKVIATI----PV-G---GYPFGVAVS 42 (42)
T ss_pred CEEEEEeCCCCEEEEEECCCCeEEEEE----EC-C---CCCceEEeC
Confidence 3699998764 4888998776543222 12 2 246777764
No 67
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=91.14 E-value=1.4 Score=34.90 Aligned_cols=37 Identities=19% Similarity=0.230 Sum_probs=23.8
Q ss_pred cCCCCccceEECCCCc-EEEEECC-CCEEEEECCCCcEE
Q psy4774 9 QQCGRPLGMKFDKNGA-LHVADAY-FGLYKVNVTTGQTE 45 (120)
Q Consensus 9 ~~~g~P~Gl~~d~~G~-l~V~d~~-~gi~~vd~~~g~~~ 45 (120)
..+..|.|+++.++|+ +||++.. ..+..+|.++.++.
T Consensus 75 ~~G~~~~~i~~s~DG~~~~v~n~~~~~v~v~D~~tle~v 113 (369)
T PF02239_consen 75 KVGGNPRGIAVSPDGKYVYVANYEPGTVSVIDAETLEPV 113 (369)
T ss_dssp E-SSEEEEEEE--TTTEEEEEEEETTEEEEEETTT--EE
T ss_pred ecCCCcceEEEcCCCCEEEEEecCCCceeEeccccccce
Confidence 4577899999999996 5666554 55888898776543
No 68
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=91.08 E-value=3.9 Score=31.94 Aligned_cols=65 Identities=20% Similarity=0.147 Sum_probs=42.0
Q ss_pred CCCCccceEECCCCcEEEE-ECCCC--EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEEeCC
Q psy4774 10 QCGRPLGMKFDKNGALHVA-DAYFG--LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWSDSS 79 (120)
Q Consensus 10 ~~g~P~Gl~~d~~G~l~V~-d~~~g--i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~TDs~ 79 (120)
...|..|++..|.....|+ --.-| .+.+|+.+|+...... ..++.-|+ ---+|++||+ ||.|+.-
T Consensus 3 lP~RgH~~a~~p~~~~avafaRRPG~~~~v~D~~~g~~~~~~~---a~~gRHFy--GHg~fs~dG~~LytTEnd 71 (305)
T PF07433_consen 3 LPARGHGVAAHPTRPEAVAFARRPGTFALVFDCRTGQLLQRLW---APPGRHFY--GHGVFSPDGRLLYTTEND 71 (305)
T ss_pred CCccccceeeCCCCCeEEEEEeCCCcEEEEEEcCCCceeeEEc---CCCCCEEe--cCEEEcCCCCEEEEeccc
Confidence 3468889999996544443 22223 8899999988765543 33444443 3567889984 7777664
No 69
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=90.78 E-value=3.2 Score=32.16 Aligned_cols=70 Identities=21% Similarity=0.303 Sum_probs=38.9
Q ss_pred CCCccceEECC-CCcEEEE-ECCC-----CEEEEECCC-----CcEEEEec-cccccCCCCCc----CCCceEEcCCCCE
Q psy4774 11 CGRPLGMKFDK-NGALHVA-DAYF-----GLYKVNVTT-----GQTEQLIS-MDTEIDGAKPQ----IPNSVTVDSDGMV 73 (120)
Q Consensus 11 ~g~P~Gl~~d~-~G~l~V~-d~~~-----gi~~vd~~~-----g~~~~~~~-~~~~~~g~~~~----~pndl~v~~~G~i 73 (120)
.|.=.||.+++ +|.+|+. |.+. .++.+.... +.++..-. .....+|.++. -+-+|++.++|.+
T Consensus 19 ~GGlSgl~~~~~~~~~~avSD~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~~G~~~~~~~~D~Egi~~~~~g~~ 98 (326)
T PF13449_consen 19 FGGLSGLDYDPDDGRFYAVSDRGPNKGPPRFYTFRIDYDQGGIGGVTILDMIPLRDPDGQPFPKNGLDPEGIAVPPDGSF 98 (326)
T ss_pred cCcEeeEEEeCCCCEEEEEECCCCCCCCCcEEEEEeeccCCCccceEeccceeccCCCCCcCCcCCCChhHeEEecCCCE
Confidence 34457999994 5566554 5433 255543321 12211100 00123455553 4569999889999
Q ss_pred EEEeCCC
Q psy4774 74 YWSDSST 80 (120)
Q Consensus 74 y~TDs~~ 80 (120)
|+++-..
T Consensus 99 ~is~E~~ 105 (326)
T PF13449_consen 99 WISSEGG 105 (326)
T ss_pred EEEeCCc
Confidence 9999864
No 70
>PRK04792 tolB translocation protein TolB; Provisional
Probab=89.14 E-value=6.3 Score=31.83 Aligned_cols=54 Identities=17% Similarity=0.253 Sum_probs=33.4
Q ss_pred cceEECCCCc-EE-EEECC--CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEE
Q psy4774 15 LGMKFDKNGA-LH-VADAY--FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWS 76 (120)
Q Consensus 15 ~Gl~~d~~G~-l~-V~d~~--~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~T 76 (120)
...++.|||+ |+ +.+.. ..|+.+|.++|+.+.+.. ..+. ....+++|||. |+++
T Consensus 221 ~~p~wSPDG~~La~~s~~~g~~~L~~~dl~tg~~~~lt~----~~g~----~~~~~wSPDG~~La~~ 279 (448)
T PRK04792 221 MSPAWSPDGRKLAYVSFENRKAEIFVQDIYTQVREKVTS----FPGI----NGAPRFSPDGKKLALV 279 (448)
T ss_pred cCceECCCCCEEEEEEecCCCcEEEEEECCCCCeEEecC----CCCC----cCCeeECCCCCEEEEE
Confidence 4689999995 43 33322 249999998888776643 2221 12567777774 6555
No 71
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=89.11 E-value=3.3 Score=32.66 Aligned_cols=70 Identities=21% Similarity=0.338 Sum_probs=41.2
Q ss_pred CccceEECCCCcE-EEEECC-------------CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeC
Q psy4774 13 RPLGMKFDKNGAL-HVADAY-------------FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDS 78 (120)
Q Consensus 13 ~P~Gl~~d~~G~l-~V~d~~-------------~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs 78 (120)
+-||||+. +|+. ||+-.. .|=+.+|..+++ ++++ | |.+|+.--.- +|.|||+|+
T Consensus 153 HLNGlA~~-~g~p~yVTa~~~sD~~~gWR~~~~~gG~vidv~s~e--vl~~------G--LsmPhSPRWh-dgrLwvlds 220 (335)
T TIGR03032 153 HLNGMALD-DGEPRYVTALSQSDVADGWREGRRDGGCVIDIPSGE--VVAS------G--LSMPHSPRWY-QGKLWLLNS 220 (335)
T ss_pred eecceeee-CCeEEEEEEeeccCCcccccccccCCeEEEEeCCCC--EEEc------C--ccCCcCCcEe-CCeEEEEEC
Confidence 36899997 4544 775311 122335655553 4443 2 3344433332 489999999
Q ss_pred CCCcccccceecceecCCceEEEEcCCC--ceEe
Q psy4774 79 STKYKLYDGLFDGLTSGSGSFIKIGPNT--QEVL 110 (120)
Q Consensus 79 ~~~~~~~~~~~~~~~~~~G~l~~~d~~~--~~~l 110 (120)
. +|.|+++|+++ .+++
T Consensus 221 g----------------tGev~~vD~~~G~~e~V 238 (335)
T TIGR03032 221 G----------------RGELGYVDPQAGKFQPV 238 (335)
T ss_pred C----------------CCEEEEEcCCCCcEEEE
Confidence 6 57888888873 4444
No 72
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=89.09 E-value=3.8 Score=29.15 Aligned_cols=27 Identities=26% Similarity=0.383 Sum_probs=22.0
Q ss_pred EECCCCcEEEEECCCCEEEEECCCCcE
Q psy4774 18 KFDKNGALHVADAYFGLYKVNVTTGQT 44 (120)
Q Consensus 18 ~~d~~G~l~V~d~~~gi~~vd~~~g~~ 44 (120)
.+..++.+|+++....|+.+|+++|++
T Consensus 32 ~~~~~~~v~~~~~~~~l~~~d~~tG~~ 58 (238)
T PF13360_consen 32 AVPDGGRVYVASGDGNLYALDAKTGKV 58 (238)
T ss_dssp EEEETTEEEEEETTSEEEEEETTTSEE
T ss_pred EEEeCCEEEEEcCCCEEEEEECCCCCE
Confidence 554578999998777799999988974
No 73
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=88.82 E-value=0.63 Score=29.60 Aligned_cols=52 Identities=12% Similarity=0.029 Sum_probs=35.2
Q ss_pred CceEEcCCCCEEEEeCCCCccccccee---cceecCCceEEEEcCCCceEecCcccc
Q psy4774 63 NSVTVDSDGMVYWSDSSTKYKLYDGLF---DGLTSGSGSFIKIGPNTQEVLQKQTDK 116 (120)
Q Consensus 63 ndl~v~~~G~iy~TDs~~~~~~~~~~~---~~~~~~~G~l~~~d~~~~~~l~~~L~~ 116 (120)
|||+.-.....|+|.-.. | .+.|+. ..+..+.|.|+.||+++++++++++..
T Consensus 1 NDIvavG~~sFy~TNDhy-f-~~~~l~~lE~~l~~~~~~Vvyyd~~~~~~va~g~~~ 55 (86)
T PF01731_consen 1 NDIVAVGPDSFYVTNDHY-F-TDPFLRLLETYLGLPWGNVVYYDGKEVKVVASGFSF 55 (86)
T ss_pred CCEEEECcCcEEEECchh-h-CcHHHHHHHHHhcCCCceEEEEeCCEeEEeeccCCC
Confidence 677777667899988752 2 222322 223568899999998778888888653
No 74
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=88.70 E-value=5.2 Score=28.40 Aligned_cols=62 Identities=19% Similarity=0.193 Sum_probs=39.2
Q ss_pred ECCCCcEEEEECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCCCcccccceecceecCCce
Q psy4774 19 FDKNGALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTKYKLYDGLFDGLTSGSGS 98 (120)
Q Consensus 19 ~d~~G~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~~~~~~~~~~~~~~~~G~ 98 (120)
+-.+|.+|++.....++.+|.++|+.. + .. . +..+..+....++.||+++. .|+
T Consensus 170 ~~~~~~v~~~~~~g~~~~~d~~tg~~~-w-~~--~-----~~~~~~~~~~~~~~l~~~~~-----------------~~~ 223 (238)
T PF13360_consen 170 VISDGRVYVSSGDGRVVAVDLATGEKL-W-SK--P-----ISGIYSLPSVDGGTLYVTSS-----------------DGR 223 (238)
T ss_dssp ECCTTEEEEECCTSSEEEEETTTTEEE-E-EE--C-----SS-ECECEECCCTEEEEEET-----------------TTE
T ss_pred EEECCEEEEEcCCCeEEEEECCCCCEE-E-Ee--c-----CCCccCCceeeCCEEEEEeC-----------------CCE
Confidence 334568999876655888899888743 3 21 1 22233444556678888884 267
Q ss_pred EEEEcCCC
Q psy4774 99 FIKIGPNT 106 (120)
Q Consensus 99 l~~~d~~~ 106 (120)
|+.+|+.+
T Consensus 224 l~~~d~~t 231 (238)
T PF13360_consen 224 LYALDLKT 231 (238)
T ss_dssp EEEEETTT
T ss_pred EEEEECCC
Confidence 77777665
No 75
>TIGR03803 Gloeo_Verruco Gloeo_Verruco repeat. This model describes a rare protein repeat, found so far in two species of Verrucomicrobia (Chthoniobacter flavus and Verrucomicrobium spinosum) and in four different proteins of Gloeobacter violaceus PCC7421. In the Verrucomicrobial species, the repeat region is followed by a PEP-CTERM protein-sorting signal, suggesting an extracellular location.
Probab=88.70 E-value=1.3 Score=23.27 Aligned_cols=30 Identities=27% Similarity=0.462 Sum_probs=22.3
Q ss_pred CCCEEEEeCCCCcccccceecceecCCceEEEEcCCC-ceEe
Q psy4774 70 DGMVYWSDSSTKYKLYDGLFDGLTSGSGSFIKIGPNT-QEVL 110 (120)
Q Consensus 70 ~G~iy~TDs~~~~~~~~~~~~~~~~~~G~l~~~d~~~-~~~l 110 (120)
+|++|.|-+... ....|-|+|+++.+ +++|
T Consensus 1 dg~lYGTT~~GG-----------~~~~GTvf~~~~~g~~t~L 31 (34)
T TIGR03803 1 GGTLYGTTSGGG-----------ASGFGTLYRLSTAGGTTVL 31 (34)
T ss_pred CCcEEEEcccCC-----------CCCceeEEEEcCCCCeEEE
Confidence 578999888432 23568999999998 7554
No 76
>KOG1446|consensus
Probab=88.66 E-value=8.8 Score=30.01 Aligned_cols=64 Identities=17% Similarity=0.221 Sum_probs=43.5
Q ss_pred CCccceEECCCC-cEEEEECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 12 GRPLGMKFDKNG-ALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 12 g~P~Gl~~d~~G-~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
.+=..|.|.++| .|+++....-++.+|.=+|.+-.=.. . .....+.|-+-.+.|||...++-+.
T Consensus 188 ~ew~~l~FS~dGK~iLlsT~~s~~~~lDAf~G~~~~tfs---~-~~~~~~~~~~a~ftPds~Fvl~gs~ 252 (311)
T KOG1446|consen 188 AEWTDLEFSPDGKSILLSTNASFIYLLDAFDGTVKSTFS---G-YPNAGNLPLSATFTPDSKFVLSGSD 252 (311)
T ss_pred cceeeeEEcCCCCEEEEEeCCCcEEEEEccCCcEeeeEe---e-ccCCCCcceeEEECCCCcEEEEecC
Confidence 345689999999 55667665558888866776322111 1 2224556778899999998888775
No 77
>PRK02888 nitrous-oxide reductase; Validated
Probab=88.60 E-value=1.9 Score=36.84 Aligned_cols=62 Identities=16% Similarity=0.217 Sum_probs=40.0
Q ss_pred cCCCCccceEECCCCc-EEEEECC-CCEEEEECCCCcE---------E-EEeccccccCCCCCcCCCceEEcCCCCEEEE
Q psy4774 9 QQCGRPLGMKFDKNGA-LHVADAY-FGLYKVNVTTGQT---------E-QLISMDTEIDGAKPQIPNSVTVDSDGMVYWS 76 (120)
Q Consensus 9 ~~~g~P~Gl~~d~~G~-l~V~d~~-~gi~~vd~~~g~~---------~-~~~~~~~~~~g~~~~~pndl~v~~~G~iy~T 76 (120)
+.+.+|-|+++.|||+ +||+... .-+..||.+..+. . +++.. +. | ..|-..+||++|+.|.|
T Consensus 318 PVGKsPHGV~vSPDGkylyVanklS~tVSVIDv~k~k~~~~~~~~~~~~vvaev--ev-G---lGPLHTaFDg~G~ayts 391 (635)
T PRK02888 318 PVPKNPHGVNTSPDGKYFIANGKLSPTVTVIDVRKLDDLFDGKIKPRDAVVAEP--EL-G---LGPLHTAFDGRGNAYTT 391 (635)
T ss_pred ECCCCccceEECCCCCEEEEeCCCCCcEEEEEChhhhhhhhccCCccceEEEee--cc-C---CCcceEEECCCCCEEEe
Confidence 5678899999999995 5555443 4477788654331 1 22211 11 2 24668999999999954
No 78
>PRK01029 tolB translocation protein TolB; Provisional
Probab=88.59 E-value=8.5 Score=30.98 Aligned_cols=75 Identities=13% Similarity=0.146 Sum_probs=46.5
Q ss_pred cceEECCCCcEEE-EECC---CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEEeCCCCccccccee
Q psy4774 15 LGMKFDKNGALHV-ADAY---FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWSDSSTKYKLYDGLF 89 (120)
Q Consensus 15 ~Gl~~d~~G~l~V-~d~~---~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~TDs~~~~~~~~~~~ 89 (120)
...++.|+|+.++ +... ..|+.+|.++|+.+.+.. + + .......+++||. |+|+-...
T Consensus 330 ~~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~g~~~~Lt~------~-~-~~~~~p~wSpDG~~L~f~~~~~--------- 392 (428)
T PRK01029 330 SCPAWSPDGKKIAFCSVIKGVRQICVYDLATGRDYQLTT------S-P-ENKESPSWAIDSLHLVYSAGNS--------- 392 (428)
T ss_pred cceeECCCCCEEEEEEcCCCCcEEEEEECCCCCeEEccC------C-C-CCccceEECCCCCEEEEEECCC---------
Confidence 4578899996433 3322 248889998888877742 1 1 1235678899984 77665431
Q ss_pred cceecCCceEEEEcCCC--ceEec
Q psy4774 90 DGLTSGSGSFIKIGPNT--QEVLQ 111 (120)
Q Consensus 90 ~~~~~~~G~l~~~d~~~--~~~l~ 111 (120)
....||.+|.++ .+.|.
T Consensus 393 -----g~~~L~~vdl~~g~~~~Lt 411 (428)
T PRK01029 393 -----NESELYLISLITKKTRKIV 411 (428)
T ss_pred -----CCceEEEEECCCCCEEEee
Confidence 235777777665 44443
No 79
>PRK04792 tolB translocation protein TolB; Provisional
Probab=88.38 E-value=7.9 Score=31.29 Aligned_cols=57 Identities=18% Similarity=0.339 Sum_probs=34.3
Q ss_pred ccceEECCCCc-EEEEECC---CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCC-CEEEEeC
Q psy4774 14 PLGMKFDKNGA-LHVADAY---FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDG-MVYWSDS 78 (120)
Q Consensus 14 P~Gl~~d~~G~-l~V~d~~---~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G-~iy~TDs 78 (120)
....++.++|+ |+++... ..|++++.++|+.+.+.. .+.. ....++++|| .|||+..
T Consensus 308 ~~~p~wSpDG~~I~f~s~~~g~~~Iy~~dl~~g~~~~Lt~-----~g~~---~~~~~~SpDG~~l~~~~~ 369 (448)
T PRK04792 308 DTEPSWHPDGKSLIFTSERGGKPQIYRVNLASGKVSRLTF-----EGEQ---NLGGSITPDGRSMIMVNR 369 (448)
T ss_pred ccceEECCCCCEEEEEECCCCCceEEEEECCCCCEEEEec-----CCCC---CcCeeECCCCCEEEEEEe
Confidence 34578899985 4333222 349999988888776631 2211 1245778887 4666544
No 80
>PRK04922 tolB translocation protein TolB; Provisional
Probab=87.92 E-value=7 Score=31.21 Aligned_cols=54 Identities=22% Similarity=0.344 Sum_probs=32.7
Q ss_pred cceEECCCCc-EEEEECC---CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEE
Q psy4774 15 LGMKFDKNGA-LHVADAY---FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWS 76 (120)
Q Consensus 15 ~Gl~~d~~G~-l~V~d~~---~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~T 76 (120)
...++.++|+ |+.+... ..|+.++.++|+.+.+.. ..+. .....+++||. |+|+
T Consensus 207 ~~p~wSpDg~~la~~s~~~~~~~l~~~dl~~g~~~~l~~----~~g~----~~~~~~SpDG~~l~~~ 265 (433)
T PRK04922 207 LSPAWSPDGKKLAYVSFERGRSAIYVQDLATGQRELVAS----FRGI----NGAPSFSPDGRRLALT 265 (433)
T ss_pred ccccCCCCCCEEEEEecCCCCcEEEEEECCCCCEEEecc----CCCC----ccCceECCCCCEEEEE
Confidence 4678899995 4434322 238999988888777753 1221 12456777764 5554
No 81
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=87.56 E-value=0.68 Score=36.08 Aligned_cols=29 Identities=17% Similarity=0.319 Sum_probs=25.3
Q ss_pred CCccceEECCCCcEEEEECCCCEEEEECC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFGLYKVNVT 40 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~gi~~vd~~ 40 (120)
-+|.|+++|.+|+|||+...+..+++.++
T Consensus 284 pqaEGiamDd~g~lYIvSEPnlfy~F~~~ 312 (316)
T COG3204 284 PQAEGIAMDDDGNLYIVSEPNLFYRFTPQ 312 (316)
T ss_pred CCcceeEECCCCCEEEEecCCcceecccC
Confidence 46889999999999999988888888763
No 82
>smart00284 OLF Olfactomedin-like domains.
Probab=87.46 E-value=5.3 Score=30.43 Aligned_cols=57 Identities=26% Similarity=0.353 Sum_probs=36.5
Q ss_pred ceEECCCCcEEEE---ECCCC---EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 16 GMKFDKNGALHVA---DAYFG---LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 16 Gl~~d~~G~l~V~---d~~~g---i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
-+|+|..| |||. ....| |-|+||++-+++..... ... +-...|.+.+- |.||+|+|.
T Consensus 132 DlAvDE~G-LWvIYat~~~~g~ivvSkLnp~tL~ve~tW~T--~~~--k~sa~naFmvC--GvLY~~~s~ 194 (255)
T smart00284 132 DLAVDENG-LWVIYATEQNAGKIVISKLNPATLTIENTWIT--TYN--KRSASNAFMIC--GILYVTRSL 194 (255)
T ss_pred EEEEcCCc-eEEEEeccCCCCCEEEEeeCcccceEEEEEEc--CCC--cccccccEEEe--eEEEEEccC
Confidence 58888877 6554 33334 45899987666555542 222 23345777777 899999973
No 83
>PRK05137 tolB translocation protein TolB; Provisional
Probab=87.11 E-value=12 Score=29.84 Aligned_cols=55 Identities=18% Similarity=0.395 Sum_probs=32.8
Q ss_pred ccceEECCCCc-EEEEECC---CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEE
Q psy4774 14 PLGMKFDKNGA-LHVADAY---FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWS 76 (120)
Q Consensus 14 P~Gl~~d~~G~-l~V~d~~---~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~T 76 (120)
-...++.|+|+ |+++-.. ..|+.+|.++++...+.. ..+ ......+++||. |+|+
T Consensus 248 ~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~~~~Lt~----~~~----~~~~~~~spDG~~i~f~ 307 (435)
T PRK05137 248 TFAPRFSPDGRKVVMSLSQGGNTDIYTMDLRSGTTTRLTD----SPA----IDTSPSYSPDGSQIVFE 307 (435)
T ss_pred ccCcEECCCCCEEEEEEecCCCceEEEEECCCCceEEccC----CCC----ccCceeEcCCCCEEEEE
Confidence 35678999995 4444222 349999998888776642 111 123456677764 4443
No 84
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=86.97 E-value=7.1 Score=32.68 Aligned_cols=57 Identities=14% Similarity=0.277 Sum_probs=39.6
Q ss_pred cCCCCccceEECCCCc-EEEEECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC
Q psy4774 9 QQCGRPLGMKFDKNGA-LHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM 72 (120)
Q Consensus 9 ~~~g~P~Gl~~d~~G~-l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~ 72 (120)
...|+-..++++++|. +.|++...-|+.+|.++|+++.+-. .+-...-++++.++++
T Consensus 399 ~~lg~I~av~vs~dGK~~vvaNdr~el~vididngnv~~idk-------S~~~lItdf~~~~nsr 456 (668)
T COG4946 399 KDLGNIEAVKVSPDGKKVVVANDRFELWVIDIDNGNVRLIDK-------SEYGLITDFDWHPNSR 456 (668)
T ss_pred CCccceEEEEEcCCCcEEEEEcCceEEEEEEecCCCeeEecc-------cccceeEEEEEcCCce
Confidence 3446666789999997 5555554559999999999887732 1223456888888764
No 85
>PRK04043 tolB translocation protein TolB; Provisional
Probab=86.96 E-value=10 Score=30.51 Aligned_cols=35 Identities=26% Similarity=0.274 Sum_probs=25.4
Q ss_pred cceEECCCCc--EEEEECC---CCEEEEECCCCcEEEEec
Q psy4774 15 LGMKFDKNGA--LHVADAY---FGLYKVNVTTGQTEQLIS 49 (120)
Q Consensus 15 ~Gl~~d~~G~--l~V~d~~---~gi~~vd~~~g~~~~~~~ 49 (120)
....+.|+|+ +|+.... ..|+.+|..+|+.+.+..
T Consensus 191 ~~p~wSpDG~~~i~y~s~~~~~~~Iyv~dl~tg~~~~lt~ 230 (419)
T PRK04043 191 IFPKWANKEQTAFYYTSYGERKPTLYKYNLYTGKKEKIAS 230 (419)
T ss_pred EeEEECCCCCcEEEEEEccCCCCEEEEEECCCCcEEEEec
Confidence 3678899985 5554433 239999999999888863
No 86
>KOG0266|consensus
Probab=86.85 E-value=7.1 Score=31.64 Aligned_cols=61 Identities=20% Similarity=0.186 Sum_probs=43.0
Q ss_pred CCccceEECCCCcEEEEECCCC-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
...+.++|.++|+++++-...+ |...|.++|+...... .+.. ..+.+++.++|++.++-+.
T Consensus 247 ~~v~~~~f~p~g~~i~Sgs~D~tvriWd~~~~~~~~~l~----~hs~---~is~~~f~~d~~~l~s~s~ 308 (456)
T KOG0266|consen 247 TYVTSVAFSPDGNLLVSGSDDGTVRIWDVRTGECVRKLK----GHSD---GISGLAFSPDGNLLVSASY 308 (456)
T ss_pred CceEEEEecCCCCEEEEecCCCcEEEEeccCCeEEEeee----ccCC---ceEEEEECCCCCEEEEcCC
Confidence 3457899999998888866655 5555777776554442 1222 3578999999998888864
No 87
>PRK03629 tolB translocation protein TolB; Provisional
Probab=86.68 E-value=14 Score=29.55 Aligned_cols=53 Identities=21% Similarity=0.253 Sum_probs=33.6
Q ss_pred cceEECCCCc-EEEEECC---CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEE
Q psy4774 15 LGMKFDKNGA-LHVADAY---FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYW 75 (120)
Q Consensus 15 ~Gl~~d~~G~-l~V~d~~---~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~ 75 (120)
..+++.|||+ |+++... ..|+.+|.++++.+.+... .. ......++|||. |+|
T Consensus 246 ~~~~~SPDG~~La~~~~~~g~~~I~~~d~~tg~~~~lt~~----~~----~~~~~~wSPDG~~I~f 303 (429)
T PRK03629 246 GAPAFSPDGSKLAFALSKTGSLNLYVMDLASGQIRQVTDG----RS----NNTEPTWFPDSQNLAY 303 (429)
T ss_pred CCeEECCCCCEEEEEEcCCCCcEEEEEECCCCCEEEccCC----CC----CcCceEECCCCCEEEE
Confidence 3579999995 5444222 2499999988887777431 11 234677888875 433
No 88
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=86.36 E-value=14 Score=28.81 Aligned_cols=55 Identities=15% Similarity=0.265 Sum_probs=33.2
Q ss_pred ceEECCCCc-EEEEECC---CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEEeC
Q psy4774 16 GMKFDKNGA-LHVADAY---FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWSDS 78 (120)
Q Consensus 16 Gl~~d~~G~-l~V~d~~---~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~TDs 78 (120)
..++.++|+ |+++... ..|+.++.++++...+.. .+ .....++++++|. |+++..
T Consensus 282 ~~~~s~dg~~l~~~s~~~g~~~iy~~d~~~~~~~~l~~-----~~---~~~~~~~~spdg~~i~~~~~ 341 (417)
T TIGR02800 282 EPSWSPDGKSIAFTSDRGGSPQIYMMDADGGEVRRLTF-----RG---GYNASPSWSPDGDLIAFVHR 341 (417)
T ss_pred CEEECCCCCEEEEEECCCCCceEEEEECCCCCEEEeec-----CC---CCccCeEECCCCCEEEEEEc
Confidence 457888885 4444322 249999988777766642 12 1234667788774 555554
No 89
>PRK05137 tolB translocation protein TolB; Provisional
Probab=86.32 E-value=11 Score=30.03 Aligned_cols=55 Identities=25% Similarity=0.298 Sum_probs=33.0
Q ss_pred ccceEECCCCc-E-EEEECC--CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEE
Q psy4774 14 PLGMKFDKNGA-L-HVADAY--FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWS 76 (120)
Q Consensus 14 P~Gl~~d~~G~-l-~V~d~~--~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~T 76 (120)
-...++.+||+ | |+.... ..|+.+|.++|+.+.+.. ..+. ....+++|||. |+|+
T Consensus 204 v~~p~wSpDG~~lay~s~~~g~~~i~~~dl~~g~~~~l~~----~~g~----~~~~~~SPDG~~la~~ 263 (435)
T PRK05137 204 VLTPRFSPNRQEITYMSYANGRPRVYLLDLETGQRELVGN----FPGM----TFAPRFSPDGRKVVMS 263 (435)
T ss_pred eEeeEECCCCCEEEEEEecCCCCEEEEEECCCCcEEEeec----CCCc----ccCcEECCCCCEEEEE
Confidence 34678899995 3 333322 349999998888777653 2221 13556777764 4444
No 90
>KOG3567|consensus
Probab=85.90 E-value=1.2 Score=36.63 Aligned_cols=26 Identities=19% Similarity=0.455 Sum_probs=22.9
Q ss_pred CCCCCcCCCceEEcCCCCEEEEeCCC
Q psy4774 55 DGAKPQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 55 ~g~~~~~pndl~v~~~G~iy~TDs~~ 80 (120)
.+..|..|++|.+|+||..|+||..+
T Consensus 462 g~~~fylphgl~~dkdgf~~~tdvas 487 (501)
T KOG3567|consen 462 GKNLFYLPHGLSIDKDGFYWVTDVAS 487 (501)
T ss_pred cCCceecCCcceecCCCcEEeecccc
Confidence 34479999999999999999999974
No 91
>PF07494 Reg_prop: Two component regulator propeller; InterPro: IPR011110 A large group of two component regulator proteins appear to have the same N-terminal structure of 14 tandem repeats. These repeats show homology to members of IPR002372 from INTERPRO and IPR001680 from INTERPRO indicating that they are likely to form a beta-propeller. This family has been built with artificially high cut-offs in order to avoid overlaps with other beta-propeller families. The fourteen repeats are likely to form two propellers; it is not clear if these structures are likely to recruit other proteins or interact with DNA.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=85.57 E-value=0.96 Score=21.60 Aligned_cols=18 Identities=28% Similarity=0.296 Sum_probs=13.6
Q ss_pred cCCCceEEcCCCCEEEEe
Q psy4774 60 QIPNSVTVDSDGMVYWSD 77 (120)
Q Consensus 60 ~~pndl~v~~~G~iy~TD 77 (120)
+....|..|++|+|||.-
T Consensus 5 n~I~~i~~D~~G~lWigT 22 (24)
T PF07494_consen 5 NNIYSIYEDSDGNLWIGT 22 (24)
T ss_dssp SCEEEEEE-TTSCEEEEE
T ss_pred CeEEEEEEcCCcCEEEEe
Confidence 455789999999999863
No 92
>PF14517 Tachylectin: Tachylectin; PDB: 1TL2_A.
Probab=85.53 E-value=1.3 Score=33.17 Aligned_cols=65 Identities=20% Similarity=0.225 Sum_probs=32.5
Q ss_pred ceEECCCCcEEEEECCCCEEEEECCCCcEEEEec-cccccCCCCCcCCCceEEcCCCCEEEEeCCC
Q psy4774 16 GMKFDKNGALHVADAYFGLYKVNVTTGQTEQLIS-MDTEIDGAKPQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 16 Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~~~~~~-~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~ 80 (120)
-|.+++.|-||..+....|+|..+-+.....+.. ....+.+..-+..+-|-++++|.||.-++..
T Consensus 85 ~i~~d~~G~LYaV~~~G~lyR~~~~~~~~~~W~~~~~~~iG~~GW~~f~~vfa~~~GvLY~i~~dg 150 (229)
T PF14517_consen 85 FIFFDPTGVLYAVTPDGKLYRHPRPTNGSDNWIGGSGKKIGGTGWNDFDAVFAGPNGVLYAITPDG 150 (229)
T ss_dssp EEEE-TTS-EEEEETT-EEEEES---STT--HHH-HSEEEE-SSGGGEEEEEE-TTS-EEEEETTE
T ss_pred EEEecCCccEEEeccccceeeccCCCccCcchhhccceecccCCCccceEEEeCCCccEEEEcCCC
Confidence 5889999999988765447777532222122221 1111212233445678888889888777764
No 93
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=85.43 E-value=4.8 Score=32.90 Aligned_cols=23 Identities=17% Similarity=0.442 Sum_probs=19.0
Q ss_pred CCcEEEEECCCCEEEEECCCCcE
Q psy4774 22 NGALHVADAYFGLYKVNVTTGQT 44 (120)
Q Consensus 22 ~G~l~V~d~~~gi~~vd~~~g~~ 44 (120)
+|.+|+.+....|+.+|.++|++
T Consensus 61 ~g~vy~~~~~g~l~AlD~~tG~~ 83 (488)
T cd00216 61 DGDMYFTTSHSALFALDAATGKV 83 (488)
T ss_pred CCEEEEeCCCCcEEEEECCCChh
Confidence 67899987766699999988874
No 94
>PRK02889 tolB translocation protein TolB; Provisional
Probab=85.31 E-value=16 Score=29.20 Aligned_cols=34 Identities=15% Similarity=0.244 Sum_probs=23.4
Q ss_pred cceEECCCCc-EEEEECC---CCEEEEECCCCcEEEEe
Q psy4774 15 LGMKFDKNGA-LHVADAY---FGLYKVNVTTGQTEQLI 48 (120)
Q Consensus 15 ~Gl~~d~~G~-l~V~d~~---~gi~~vd~~~g~~~~~~ 48 (120)
...++.|+|+ |+.+... ..|+.+|..+|+...+.
T Consensus 199 ~~p~wSPDG~~la~~s~~~~~~~I~~~dl~~g~~~~l~ 236 (427)
T PRK02889 199 ISPAWSPDGTKLAYVSFESKKPVVYVHDLATGRRRVVA 236 (427)
T ss_pred ccceEcCCCCEEEEEEccCCCcEEEEEECCCCCEEEee
Confidence 3678999995 4333322 23999999888877775
No 95
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=85.28 E-value=15 Score=28.67 Aligned_cols=56 Identities=23% Similarity=0.279 Sum_probs=33.3
Q ss_pred cceEECCCCcEEE-EECC---CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEEeC
Q psy4774 15 LGMKFDKNGALHV-ADAY---FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWSDS 78 (120)
Q Consensus 15 ~Gl~~d~~G~l~V-~d~~---~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~TDs 78 (120)
...++.++|+.++ +... ..|+.++.++|+...+.. ..+. ....++++||. |+++.+
T Consensus 193 ~~p~~Spdg~~la~~~~~~~~~~i~v~d~~~g~~~~~~~----~~~~----~~~~~~spDg~~l~~~~~ 253 (417)
T TIGR02800 193 LSPAWSPDGQKLAYVSFESGKPEIYVQDLATGQREKVAS----FPGM----NGAPAFSPDGSKLAVSLS 253 (417)
T ss_pred ecccCCCCCCEEEEEEcCCCCcEEEEEECCCCCEEEeec----CCCC----ccceEECCCCCEEEEEEC
Confidence 3567888986543 3322 238888988887666642 2221 22467788874 666543
No 96
>PRK00178 tolB translocation protein TolB; Provisional
Probab=85.10 E-value=17 Score=28.74 Aligned_cols=54 Identities=15% Similarity=0.254 Sum_probs=32.8
Q ss_pred cceEECCCCc-EEEEECC---CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEE
Q psy4774 15 LGMKFDKNGA-LHVADAY---FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWS 76 (120)
Q Consensus 15 ~Gl~~d~~G~-l~V~d~~---~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~T 76 (120)
...++.++|+ |+++-.. ..|+.+|.++++.+.+.. ..+ ......+++||. |||+
T Consensus 246 ~~~~~SpDG~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~----~~~----~~~~~~~spDg~~i~f~ 304 (430)
T PRK00178 246 GAPAWSPDGSKLAFVLSKDGNPEIYVMDLASRQLSRVTN----HPA----IDTEPFWGKDGRTLYFT 304 (430)
T ss_pred CCeEECCCCCEEEEEEccCCCceEEEEECCCCCeEEccc----CCC----CcCCeEECCCCCEEEEE
Confidence 3578999995 4433222 249999998888776642 111 123456777764 5554
No 97
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=84.88 E-value=6.8 Score=30.76 Aligned_cols=23 Identities=17% Similarity=0.222 Sum_probs=18.5
Q ss_pred CCcEEEEECCCCEEEEECCCCcE
Q psy4774 22 NGALHVADAYFGLYKVNVTTGQT 44 (120)
Q Consensus 22 ~G~l~V~d~~~gi~~vd~~~g~~ 44 (120)
++.+|++.....++.+|+++|+.
T Consensus 256 ~~~vy~~~~~g~l~ald~~tG~~ 278 (394)
T PRK11138 256 GGVVYALAYNGNLVALDLRSGQI 278 (394)
T ss_pred CCEEEEEEcCCeEEEEECCCCCE
Confidence 57899887666699999999974
No 98
>PRK04922 tolB translocation protein TolB; Provisional
Probab=84.81 E-value=18 Score=28.90 Aligned_cols=56 Identities=20% Similarity=0.322 Sum_probs=34.2
Q ss_pred cceEECCCCc-EEEE-ECC--CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCC-CEEEEeC
Q psy4774 15 LGMKFDKNGA-LHVA-DAY--FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDG-MVYWSDS 78 (120)
Q Consensus 15 ~Gl~~d~~G~-l~V~-d~~--~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G-~iy~TDs 78 (120)
...++.++|+ |+++ +.. ..|+.++.++++.+.+.. .+. .....+++++| .|+++..
T Consensus 295 ~~~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~-----~g~---~~~~~~~SpDG~~Ia~~~~ 355 (433)
T PRK04922 295 TEPTWAPDGKSIYFTSDRGGRPQIYRVAASGGSAERLTF-----QGN---YNARASVSPDGKKIAMVHG 355 (433)
T ss_pred cceEECCCCCEEEEEECCCCCceEEEEECCCCCeEEeec-----CCC---CccCEEECCCCCEEEEEEC
Confidence 4578999996 4433 322 239999987777766632 221 22357888887 4666543
No 99
>PRK01742 tolB translocation protein TolB; Provisional
Probab=84.67 E-value=19 Score=28.80 Aligned_cols=54 Identities=20% Similarity=0.285 Sum_probs=33.4
Q ss_pred cceEECCCCcE-EEEECCC---CEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEE
Q psy4774 15 LGMKFDKNGAL-HVADAYF---GLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWS 76 (120)
Q Consensus 15 ~Gl~~d~~G~l-~V~d~~~---gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~T 76 (120)
..+++.|||+. +++.... .|+.+|.++++...+.. ..+ ......+++||. |+|+
T Consensus 251 ~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~~~~~~lt~----~~~----~~~~~~wSpDG~~i~f~ 309 (429)
T PRK01742 251 GAPAFSPDGSRLAFASSKDGVLNIYVMGANGGTPSQLTS----GAG----NNTEPSWSPDGQSILFT 309 (429)
T ss_pred CceeECCCCCEEEEEEecCCcEEEEEEECCCCCeEeecc----CCC----CcCCEEECCCCCEEEEE
Confidence 46899999964 4443222 38889987787766642 111 234677788875 5554
No 100
>KOG0272|consensus
Probab=84.48 E-value=5.6 Score=32.47 Aligned_cols=60 Identities=22% Similarity=0.315 Sum_probs=42.7
Q ss_pred ccceEECCCCcEEEEECCCCEEEE-ECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCC
Q psy4774 14 PLGMKFDKNGALHVADAYFGLYKV-NVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 14 P~Gl~~d~~G~l~V~d~~~gi~~v-d~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~ 80 (120)
-.+|+|.+||.|..+.....+-|| |..+|+...+.. | -....-+|+++|+|....|-++-
T Consensus 306 v~~iaf~~DGSL~~tGGlD~~~RvWDlRtgr~im~L~------g-H~k~I~~V~fsPNGy~lATgs~D 366 (459)
T KOG0272|consen 306 VFSIAFQPDGSLAATGGLDSLGRVWDLRTGRCIMFLA------G-HIKEILSVAFSPNGYHLATGSSD 366 (459)
T ss_pred cceeEecCCCceeeccCccchhheeecccCcEEEEec------c-cccceeeEeECCCceEEeecCCC
Confidence 457999999999876443333333 556787776653 2 34456799999999999998874
No 101
>PRK12690 flgF flagellar basal body rod protein FlgF; Reviewed
Probab=84.41 E-value=9.5 Score=28.56 Aligned_cols=64 Identities=23% Similarity=0.292 Sum_probs=38.8
Q ss_pred CCCccceEECCCCcEEEEECC------CCEEEEECCCCcEEEEecc-ccccCCCCCcCC---CceEEcCCCCEEE
Q psy4774 11 CGRPLGMKFDKNGALHVADAY------FGLYKVNVTTGQTEQLISM-DTEIDGAKPQIP---NSVTVDSDGMVYW 75 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~~------~gi~~vd~~~g~~~~~~~~-~~~~~g~~~~~p---ndl~v~~~G~iy~ 75 (120)
.++|+-+|++.+|=+-|-+.. .|-+++|. +|.+..-.+. .....|.|+..| ..+.|++||+|+.
T Consensus 75 Tg~~lDlAI~G~GFF~V~~~~G~~yTR~G~F~~d~-~G~Lvt~~G~~vlg~~g~pI~lp~~~~~~~I~~dG~i~~ 148 (238)
T PRK12690 75 TGGQFDFAIEGEGFFMVETPQGERLTRAGSFTPNA-EGELVDPDGNRLLDAGGAPIFIPPDARSVAVGADGTLSA 148 (238)
T ss_pred cCCceeEEECCCcEEEEEcCCCCEEeeCCCeEECC-CCCEEcCCCCEeECCCCCccccCCCCceEEECCCCeEEE
Confidence 478999999888744443311 35667776 4654321110 001234477777 3799999999965
No 102
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=83.99 E-value=7.4 Score=32.37 Aligned_cols=23 Identities=26% Similarity=0.616 Sum_probs=19.3
Q ss_pred CCcEEEEECCCCEEEEECCCCcE
Q psy4774 22 NGALHVADAYFGLYKVNVTTGQT 44 (120)
Q Consensus 22 ~G~l~V~d~~~gi~~vd~~~g~~ 44 (120)
+|.||+++....|+.+|.++|+.
T Consensus 69 ~g~vyv~s~~g~v~AlDa~TGk~ 91 (527)
T TIGR03075 69 DGVMYVTTSYSRVYALDAKTGKE 91 (527)
T ss_pred CCEEEEECCCCcEEEEECCCCce
Confidence 57999988766799999999974
No 103
>PRK00178 tolB translocation protein TolB; Provisional
Probab=83.84 E-value=18 Score=28.60 Aligned_cols=34 Identities=24% Similarity=0.392 Sum_probs=23.4
Q ss_pred cceEECCCCc-E-EEEECC--CCEEEEECCCCcEEEEe
Q psy4774 15 LGMKFDKNGA-L-HVADAY--FGLYKVNVTTGQTEQLI 48 (120)
Q Consensus 15 ~Gl~~d~~G~-l-~V~d~~--~gi~~vd~~~g~~~~~~ 48 (120)
...++.|+|+ | |++... ..|+.++.++|+.+.+.
T Consensus 202 ~~p~wSpDG~~la~~s~~~~~~~l~~~~l~~g~~~~l~ 239 (430)
T PRK00178 202 LSPRWSPDGKRIAYVSFEQKRPRIFVQNLDTGRREQIT 239 (430)
T ss_pred eeeeECCCCCEEEEEEcCCCCCEEEEEECCCCCEEEcc
Confidence 5678899995 4 333322 24999999888877764
No 104
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=83.60 E-value=6.8 Score=30.78 Aligned_cols=28 Identities=18% Similarity=0.353 Sum_probs=20.1
Q ss_pred ceEECCCCcEEEEECCCCEEEEECCCCcE
Q psy4774 16 GMKFDKNGALHVADAYFGLYKVNVTTGQT 44 (120)
Q Consensus 16 Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~ 44 (120)
.++.+ +|.||+++....++.+|+++|+.
T Consensus 289 ~~~~~-~~~vy~~~~~g~l~ald~~tG~~ 316 (394)
T PRK11138 289 DFAVD-GGRIYLVDQNDRVYALDTRGGVE 316 (394)
T ss_pred CcEEE-CCEEEEEcCCCeEEEEECCCCcE
Confidence 34443 56888887666699999888863
No 105
>PRK03629 tolB translocation protein TolB; Provisional
Probab=83.60 E-value=21 Score=28.59 Aligned_cols=56 Identities=20% Similarity=0.265 Sum_probs=34.3
Q ss_pred ccceEECCCCcEE--EEECC--CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEEe
Q psy4774 14 PLGMKFDKNGALH--VADAY--FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWSD 77 (120)
Q Consensus 14 P~Gl~~d~~G~l~--V~d~~--~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~TD 77 (120)
.....+.++|+.+ +++.. ..|++++.++++.+.+.. .+. .....++++||. |+++.
T Consensus 289 ~~~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~~g~~~~lt~-----~~~---~~~~~~~SpDG~~Ia~~~ 349 (429)
T PRK03629 289 NTEPTWFPDSQNLAYTSDQAGRPQVYKVNINGGAPQRITW-----EGS---QNQDADVSSDGKFMVMVS 349 (429)
T ss_pred cCceEECCCCCEEEEEeCCCCCceEEEEECCCCCeEEeec-----CCC---CccCEEECCCCCEEEEEE
Confidence 4568899999633 33322 359999988887776632 121 124577788874 54543
No 106
>PRK04043 tolB translocation protein TolB; Provisional
Probab=83.59 E-value=18 Score=29.15 Aligned_cols=56 Identities=16% Similarity=0.160 Sum_probs=34.4
Q ss_pred ceEECCCC-cEEEEECC---CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCC-CEEEEeCC
Q psy4774 16 GMKFDKNG-ALHVADAY---FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDG-MVYWSDSS 79 (120)
Q Consensus 16 Gl~~d~~G-~l~V~d~~---~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G-~iy~TDs~ 79 (120)
..++.||| .|++.... ..|+.++.++++.+.+.. ..+. -..-.++||| .|||+-..
T Consensus 237 ~~~~SPDG~~la~~~~~~g~~~Iy~~dl~~g~~~~LT~----~~~~----d~~p~~SPDG~~I~F~Sdr 297 (419)
T PRK04043 237 VSDVSKDGSKLLLTMAPKGQPDIYLYDTNTKTLTQITN----YPGI----DVNGNFVEDDKRIVFVSDR 297 (419)
T ss_pred eeEECCCCCEEEEEEccCCCcEEEEEECCCCcEEEccc----CCCc----cCccEECCCCCEEEEEECC
Confidence 35688998 45544332 249999988888777743 1221 1123688888 47777653
No 107
>KOG0319|consensus
Probab=83.22 E-value=5.2 Score=34.71 Aligned_cols=62 Identities=18% Similarity=0.230 Sum_probs=46.1
Q ss_pred CCccceEECCCCcEEEEECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~ 80 (120)
|.| ++++++|.++.|-....|..+|.+++++. + . ....+.+-...++++++|+.+.++-+.+
T Consensus 22 GG~--~~~s~nG~~L~t~~~d~Vi~idv~t~~~~-l-~---s~~~ed~d~ita~~l~~d~~~L~~a~rs 83 (775)
T KOG0319|consen 22 GGP--VAWSSNGQHLYTACGDRVIIIDVATGSIA-L-P---SGSNEDEDEITALALTPDEEVLVTASRS 83 (775)
T ss_pred CCc--eeECCCCCEEEEecCceEEEEEccCCcee-c-c---cCCccchhhhheeeecCCccEEEEeecc
Confidence 445 99999999999988778999998888764 2 1 1222344456899999998877777754
No 108
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=83.19 E-value=3.6 Score=33.18 Aligned_cols=93 Identities=14% Similarity=0.078 Sum_probs=47.4
Q ss_pred ceEECCCC-cEEEEECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCCC--ccc-c--ccee
Q psy4774 16 GMKFDKNG-ALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTK--YKL-Y--DGLF 89 (120)
Q Consensus 16 Gl~~d~~G-~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~--~~~-~--~~~~ 89 (120)
|..+.+++ .||-......|+++|.++++.+++... + ..++.--..+++.+++.++..-..+ +.. . .++.
T Consensus 85 g~~~s~~~~~~~Yv~~~~~l~~vdL~T~e~~~vy~~----p-~~~~g~gt~v~n~d~t~~~g~e~~~~d~~~l~~~~~f~ 159 (386)
T PF14583_consen 85 GGFLSPDDRALYYVKNGRSLRRVDLDTLEERVVYEV----P-DDWKGYGTWVANSDCTKLVGIEISREDWKPLTKWKGFR 159 (386)
T ss_dssp T-EE-TTSSEEEEEETTTEEEEEETTT--EEEEEE-------TTEEEEEEEEE-TTSSEEEEEEEEGGG-----SHHHHH
T ss_pred ceEEecCCCeEEEEECCCeEEEEECCcCcEEEEEEC----C-cccccccceeeCCCccEEEEEEEeehhccCccccHHHH
Confidence 55555665 554344556799999999888777642 1 1222112333466787775543211 111 1 1233
Q ss_pred cce-ecCCceEEEEcCCC--ceEecCc
Q psy4774 90 DGL-TSGSGSFIKIGPNT--QEVLQKQ 113 (120)
Q Consensus 90 ~~~-~~~~G~l~~~d~~~--~~~l~~~ 113 (120)
+.+ ++|..+|+++|.++ .++|.+.
T Consensus 160 e~~~a~p~~~i~~idl~tG~~~~v~~~ 186 (386)
T PF14583_consen 160 EFYEARPHCRIFTIDLKTGERKVVFED 186 (386)
T ss_dssp HHHHC---EEEEEEETTT--EEEEEEE
T ss_pred HHHhhCCCceEEEEECCCCceeEEEec
Confidence 444 78999999999888 7666543
No 109
>PRK12689 flgF flagellar basal body rod protein FlgF; Reviewed
Probab=83.03 E-value=12 Score=28.19 Aligned_cols=65 Identities=22% Similarity=0.308 Sum_probs=38.5
Q ss_pred CCCccceEECCCCcEEEEEC----C--CCEEEEECCCCcEEEEecc-ccccCCCCCcCCC---ceEEcCCCCEEEEe
Q psy4774 11 CGRPLGMKFDKNGALHVADA----Y--FGLYKVNVTTGQTEQLISM-DTEIDGAKPQIPN---SVTVDSDGMVYWSD 77 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~----~--~gi~~vd~~~g~~~~~~~~-~~~~~g~~~~~pn---dl~v~~~G~iy~TD 77 (120)
-++|+-+|+..+|=+-|-+. + .|-+++|. +|.+..--+. .....| |+..|. ++.|++||+|+..+
T Consensus 81 Tg~~lDlAI~G~GFF~V~~~~G~~yTR~G~F~~d~-~G~Lvt~~G~~vlg~~g-pI~lp~~~~~i~I~~dG~I~~~~ 155 (253)
T PRK12689 81 TKNPLDVAIDGDAFLAVQTPQGERYTRDGALEINA-QGQLVTSDGYPVLGTGG-PITFQPTDTGIAISPDGTVSVNE 155 (253)
T ss_pred CCCceeEEECCCcEEEEEeCCCcEEEeCCceEECC-CCCEEcCCCCEeecCCC-CeEeCCCCCcEEECCCCeEEEec
Confidence 47899999988874434321 1 35667776 4654322110 001123 666663 79999999997643
No 110
>PF14269 Arylsulfotran_2: Arylsulfotransferase (ASST)
Probab=83.00 E-value=18 Score=27.90 Aligned_cols=68 Identities=13% Similarity=0.139 Sum_probs=43.2
Q ss_pred CCCcEEEEECC----------CC-EEEEECCCCcEEEEeccccccCCCCCc--CCCceEEcCCCCEEEEeCCCCcccccc
Q psy4774 21 KNGALHVADAY----------FG-LYKVNVTTGQTEQLISMDTEIDGAKPQ--IPNSVTVDSDGMVYWSDSSTKYKLYDG 87 (120)
Q Consensus 21 ~~G~l~V~d~~----------~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~--~pndl~v~~~G~iy~TDs~~~~~~~~~ 87 (120)
.++.|.|-|.. .+ ++++|+++.+++.+..... ...++. ..-.+-.-++|+++++...
T Consensus 210 ~~~~IslFDN~~~~~~~~~~s~~~v~~ld~~~~~~~~~~~~~~--~~~~~~s~~~G~~Q~L~nGn~li~~g~-------- 279 (299)
T PF14269_consen 210 DDGTISLFDNANSDFNGTEPSRGLVLELDPETMTVTLVREYSD--HPDGFYSPSQGSAQRLPNGNVLIGWGN-------- 279 (299)
T ss_pred CCCEEEEEcCCCCCCCCCcCCCceEEEEECCCCEEEEEEEeec--CCCcccccCCCcceECCCCCEEEecCC--------
Confidence 56677777662 22 8999998666655543210 122232 2336666778999999986
Q ss_pred eecceecCCceEEEEcCCC
Q psy4774 88 LFDGLTSGSGSFIKIGPNT 106 (120)
Q Consensus 88 ~~~~~~~~~G~l~~~d~~~ 106 (120)
.|++..|++++
T Consensus 280 --------~g~~~E~~~~G 290 (299)
T PF14269_consen 280 --------NGRISEFTPDG 290 (299)
T ss_pred --------CceEEEECCCC
Confidence 36777777766
No 111
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=82.90 E-value=2.7 Score=20.78 Aligned_cols=24 Identities=25% Similarity=0.429 Sum_probs=18.2
Q ss_pred CCCcEEEEECCCCEEEEECCCCcE
Q psy4774 21 KNGALHVADAYFGLYKVNVTTGQT 44 (120)
Q Consensus 21 ~~G~l~V~d~~~gi~~vd~~~g~~ 44 (120)
.+|.+|++.....++.+|.++|+.
T Consensus 5 ~~~~v~~~~~~g~l~a~d~~~G~~ 28 (33)
T smart00564 5 SDGTVYVGSTDGTLYALDAKTGEI 28 (33)
T ss_pred ECCEEEEEcCCCEEEEEEcccCcE
Confidence 356788887656699999888874
No 112
>PRK02888 nitrous-oxide reductase; Validated
Probab=82.84 E-value=6.5 Score=33.70 Aligned_cols=21 Identities=14% Similarity=-0.005 Sum_probs=16.4
Q ss_pred cCCCCccceEECCCC-cEEEEE
Q psy4774 9 QQCGRPLGMKFDKNG-ALHVAD 29 (120)
Q Consensus 9 ~~~g~P~Gl~~d~~G-~l~V~d 29 (120)
...++|.+++++++| .+|++.
T Consensus 232 ~Vdgnpd~v~~spdGk~afvTs 253 (635)
T PRK02888 232 MVDGNLDNVDTDYDGKYAFSTC 253 (635)
T ss_pred EeCCCcccceECCCCCEEEEec
Confidence 456899999999998 556763
No 113
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=82.71 E-value=8.7 Score=29.92 Aligned_cols=58 Identities=19% Similarity=0.217 Sum_probs=35.6
Q ss_pred EECCCCcEEEEECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCC
Q psy4774 18 KFDKNGALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 18 ~~d~~G~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~ 80 (120)
.++.+|++|+......|+.+|++++++. +.. ...+ .....+.=.+..+|+|||++...
T Consensus 64 ~~~~dg~v~~~~~~G~i~A~d~~~g~~~-W~~---~~~~-~~~~~~~~~~~~~G~i~~g~~~g 121 (370)
T COG1520 64 PADGDGTVYVGTRDGNIFALNPDTGLVK-WSY---PLLG-AVAQLSGPILGSDGKIYVGSWDG 121 (370)
T ss_pred cEeeCCeEEEecCCCcEEEEeCCCCcEE-ecc---cCcC-cceeccCceEEeCCeEEEecccc
Confidence 3777899999833334999999888632 211 1111 12234444444488999988864
No 114
>PRK12636 flgG flagellar basal body rod protein FlgG; Provisional
Probab=81.78 E-value=12 Score=28.34 Aligned_cols=67 Identities=27% Similarity=0.505 Sum_probs=40.7
Q ss_pred CCCccceEECCCCcEEEEEC------CCCEEEEECCCCcEEEEecc-ccccCCCCCcCC---CceEEcCCCCEEEEeC
Q psy4774 11 CGRPLGMKFDKNGALHVADA------YFGLYKVNVTTGQTEQLISM-DTEIDGAKPQIP---NSVTVDSDGMVYWSDS 78 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~------~~gi~~vd~~~g~~~~~~~~-~~~~~g~~~~~p---ndl~v~~~G~iy~TDs 78 (120)
-++|+-+|+..+|-+-|-.. ..|-+++|. +|.+..--+. .....+.|+..| ..+.|+++|.|+.+..
T Consensus 88 Tg~~lDlAI~G~GfF~V~~~~g~~YTR~G~F~~d~-~G~Lvt~~G~~vlg~~g~pI~~p~~~~~~~i~~dG~I~~~~~ 164 (263)
T PRK12636 88 TGRPLDLAISGDGFFRVGDGDNTAYTRAGNFYLDN-EGNIVNADGLYLLGMNGGRIQIPTDAQSFSIGADGTVSYVDA 164 (263)
T ss_pred CCCceeEEEcCCcEEEEEeCCCCEEEeCCCeEECC-CCCEEcCCCCEeecCCCCceEeCCCCceEEECCCCeEEEEec
Confidence 47899999988875544331 135777776 4664321110 001222467676 3799999999987643
No 115
>COG3292 Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]
Probab=81.36 E-value=8.5 Score=32.80 Aligned_cols=62 Identities=18% Similarity=0.208 Sum_probs=42.7
Q ss_pred cceEECCCCcEEEEECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCCC
Q psy4774 15 LGMKFDKNGALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTK 81 (120)
Q Consensus 15 ~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~ 81 (120)
+-+.-|++|++||... +|+++...+...+..-. .....++...|.+..|.+|++|+++...+
T Consensus 250 ~ll~qD~qG~lWiGTe-nGl~r~~l~rq~Lq~~~----~~~~l~~S~vnsL~~D~dGsLWv~t~~gi 311 (671)
T COG3292 250 LLLVQDAQGELWIGTE-NGLWRTRLPRQGLQIPL----SKMHLGVSTVNSLWLDTDGSLWVGTYGGI 311 (671)
T ss_pred eeeecccCCCEEEeec-ccceeEecCCCCccccc----cccCCccccccceeeccCCCEeeeccCce
Confidence 3467788899998743 56666554433333221 22345677789999999999999999765
No 116
>KOG4649|consensus
Probab=81.05 E-value=15 Score=28.59 Aligned_cols=54 Identities=20% Similarity=0.244 Sum_probs=29.6
Q ss_pred ECCCCcEEEEECC-CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcC-CCCEEEEeCC
Q psy4774 19 FDKNGALHVADAY-FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDS-DGMVYWSDSS 79 (120)
Q Consensus 19 ~d~~G~l~V~d~~-~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~-~G~iy~TDs~ 79 (120)
.|.++-|.-|..+ ...+.+|+++ +--++-. +..|.-|..| ++++ +|.||++-..
T Consensus 101 ~d~~~glIycgshd~~~yalD~~~-~~cVyks---kcgG~~f~sP---~i~~g~~sly~a~t~ 156 (354)
T KOG4649|consen 101 CDFDGGLIYCGSHDGNFYALDPKT-YGCVYKS---KCGGGTFVSP---VIAPGDGSLYAAITA 156 (354)
T ss_pred EcCCCceEEEecCCCcEEEecccc-cceEEec---ccCCceeccc---eecCCCceEEEEecc
Confidence 4455444334443 4477777763 3344432 4555555433 5666 6788887765
No 117
>smart00284 OLF Olfactomedin-like domains.
Probab=80.76 E-value=19 Score=27.41 Aligned_cols=57 Identities=16% Similarity=0.290 Sum_probs=31.8
Q ss_pred CCcEEEEECC-CCEEEEECCCCcEEEEeccc-cccCCCCCcC----C--CceEEcCCC--CEEEEeCC
Q psy4774 22 NGALHVADAY-FGLYKVNVTTGQTEQLISMD-TEIDGAKPQI----P--NSVTVDSDG--MVYWSDSS 79 (120)
Q Consensus 22 ~G~l~V~d~~-~gi~~vd~~~g~~~~~~~~~-~~~~g~~~~~----p--ndl~v~~~G--~iy~TDs~ 79 (120)
+|.||.--.. ..|+|+|..++.+....... ..++. ++.. . =|+++|++| -||-|...
T Consensus 83 ngslYY~~~~s~~iiKydL~t~~v~~~~~Lp~a~y~~-~~~Y~~~~~sdiDlAvDE~GLWvIYat~~~ 149 (255)
T smart00284 83 NGSLYFNKFNSHDICRFDLTTETYQKEPLLNGAGYNN-RFPYAWGGFSDIDLAVDENGLWVIYATEQN 149 (255)
T ss_pred CceEEEEecCCccEEEEECCCCcEEEEEecCcccccc-ccccccCCCccEEEEEcCCceEEEEeccCC
Confidence 4778774333 56999999887764332110 01111 2211 1 288888888 57777543
No 118
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=80.59 E-value=12 Score=32.77 Aligned_cols=23 Identities=13% Similarity=0.219 Sum_probs=19.9
Q ss_pred CCcEEEEECCCCEEEEECCCCcE
Q psy4774 22 NGALHVADAYFGLYKVNVTTGQT 44 (120)
Q Consensus 22 ~G~l~V~d~~~gi~~vd~~~g~~ 44 (120)
+|.||+|.....|+.+|.++|+.
T Consensus 194 gg~lYv~t~~~~V~ALDa~TGk~ 216 (764)
T TIGR03074 194 GDTLYLCTPHNKVIALDAATGKE 216 (764)
T ss_pred CCEEEEECCCCeEEEEECCCCcE
Confidence 57999998877799999999973
No 119
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=79.90 E-value=14 Score=28.66 Aligned_cols=23 Identities=9% Similarity=0.167 Sum_probs=18.4
Q ss_pred CCcEEEEECCCCEEEEECCCCcE
Q psy4774 22 NGALHVADAYFGLYKVNVTTGQT 44 (120)
Q Consensus 22 ~G~l~V~d~~~gi~~vd~~~g~~ 44 (120)
++.+|++.....++.+|+++|+.
T Consensus 241 ~~~vy~~~~~g~l~a~d~~tG~~ 263 (377)
T TIGR03300 241 GGQVYAVSYQGRVAALDLRSGRV 263 (377)
T ss_pred CCEEEEEEcCCEEEEEECCCCcE
Confidence 56888887666799999988864
No 120
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=79.44 E-value=20 Score=28.92 Aligned_cols=54 Identities=26% Similarity=0.381 Sum_probs=25.4
Q ss_pred EECCCC-cEEEE-EC--CCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCC-CE-EEEeC
Q psy4774 18 KFDKNG-ALHVA-DA--YFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDG-MV-YWSDS 78 (120)
Q Consensus 18 ~~d~~G-~l~V~-d~--~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G-~i-y~TDs 78 (120)
.|.++| +|+.+ +. ..+++.+|.++++.+++.. +...+.. +..++++. .+ ||-+.
T Consensus 42 ~ft~dG~kllF~s~~dg~~nly~lDL~t~~i~QLTd------g~g~~~~-g~~~s~~~~~~~Yv~~~ 101 (386)
T PF14583_consen 42 CFTDDGRKLLFASDFDGNRNLYLLDLATGEITQLTD------GPGDNTF-GGFLSPDDRALYYVKNG 101 (386)
T ss_dssp -B-TTS-EEEEEE-TTSS-EEEEEETTT-EEEE---------SS-B-TT-T-EE-TTSSEEEEEETT
T ss_pred CcCCCCCEEEEEeccCCCcceEEEEcccCEEEECcc------CCCCCcc-ceEEecCCCeEEEEECC
Confidence 345677 44443 43 2459999999999999863 2222223 45555654 44 45554
No 121
>PRK01742 tolB translocation protein TolB; Provisional
Probab=78.99 E-value=31 Score=27.52 Aligned_cols=55 Identities=13% Similarity=0.209 Sum_probs=33.2
Q ss_pred ccceEECCCCcEEE-EECC---CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEE
Q psy4774 14 PLGMKFDKNGALHV-ADAY---FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWS 76 (120)
Q Consensus 14 P~Gl~~d~~G~l~V-~d~~---~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~T 76 (120)
-...++.|+|+.++ +... ..|+.+|.++|+.+.+.. ..+. -..++++|||. |+++
T Consensus 206 v~~p~wSPDG~~la~~s~~~~~~~i~i~dl~tg~~~~l~~----~~g~----~~~~~wSPDG~~La~~ 265 (429)
T PRK01742 206 LMSPAWSPDGSKLAYVSFENKKSQLVVHDLRSGARKVVAS----FRGH----NGAPAFSPDGSRLAFA 265 (429)
T ss_pred cccceEcCCCCEEEEEEecCCCcEEEEEeCCCCceEEEec----CCCc----cCceeECCCCCEEEEE
Confidence 46789999996443 2221 238889988887766643 2221 12467777774 5554
No 122
>PF05935 Arylsulfotrans: Arylsulfotransferase (ASST); InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=78.97 E-value=27 Score=28.59 Aligned_cols=37 Identities=14% Similarity=0.310 Sum_probs=26.0
Q ss_pred CccceEECC-CCcEEEEECC-CCEEEEECCCCcEEEEec
Q psy4774 13 RPLGMKFDK-NGALHVADAY-FGLYKVNVTTGQTEQLIS 49 (120)
Q Consensus 13 ~P~Gl~~d~-~G~l~V~d~~-~gi~~vd~~~g~~~~~~~ 49 (120)
+-++|.+++ +++|+|+.-. ..|++||.+++++.-+++
T Consensus 272 H~Nsi~yd~~dd~iivSsR~~s~V~~Id~~t~~i~Wilg 310 (477)
T PF05935_consen 272 HINSIDYDPSDDSIIVSSRHQSAVIKIDYRTGKIKWILG 310 (477)
T ss_dssp -EEEEEEETTTTEEEEEETTT-EEEEEE-TTS-EEEEES
T ss_pred ccCccEEeCCCCeEEEEcCcceEEEEEECCCCcEEEEeC
Confidence 467899998 5677777655 469999988899876665
No 123
>PRK12818 flgG flagellar basal body rod protein FlgG; Reviewed
Probab=78.75 E-value=11 Score=28.51 Aligned_cols=64 Identities=16% Similarity=0.242 Sum_probs=35.9
Q ss_pred CCCccceEECCCCcEEEEEC-----C--CCEEEEECCCCcEEEEecc-ccccC-----CCCCcCC-CceEEcCCCCEEE
Q psy4774 11 CGRPLGMKFDKNGALHVADA-----Y--FGLYKVNVTTGQTEQLISM-DTEID-----GAKPQIP-NSVTVDSDGMVYW 75 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~-----~--~gi~~vd~~~g~~~~~~~~-~~~~~-----g~~~~~p-ndl~v~~~G~iy~ 75 (120)
-++|+-+|+..+|=+-|.+. + .|-+++|. +|.+..--+. ..... +.|+..| .++.|+++|+|+.
T Consensus 89 Tg~~lDlAI~G~GFF~V~~~~G~~~YTR~G~F~~d~-~G~Lvt~~G~~vlg~~~~~~~~~pI~lp~~~i~i~~dG~i~~ 166 (256)
T PRK12818 89 TDKPTDFAIQGRGFFTVERNAGNNYYTRDGHFHVDT-QGYLVNDSGYYVLGRNNATGAREPINVGNGKFSTDADGNISL 166 (256)
T ss_pred cCCcceEEECCCceEEEEcCCCCeEEeeCCCeeECC-CCCEEcCCCCEEeccccccCCcCCeEECCCCceECCCCeEEE
Confidence 47889999988774444332 1 34566775 4654221110 00110 1255444 3899999999976
No 124
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=78.16 E-value=15 Score=29.89 Aligned_cols=34 Identities=15% Similarity=0.250 Sum_probs=21.2
Q ss_pred ccceEECCCCc-EEEEECCC---CEEEEECCCCcEEEE
Q psy4774 14 PLGMKFDKNGA-LHVADAYF---GLYKVNVTTGQTEQL 47 (120)
Q Consensus 14 P~Gl~~d~~G~-l~V~d~~~---gi~~vd~~~g~~~~~ 47 (120)
-...+|-+||+ |.+|-... .|+.+|..+++...+
T Consensus 240 ~~~P~fspDG~~l~f~~~rdg~~~iy~~dl~~~~~~~L 277 (425)
T COG0823 240 NGAPAFSPDGSKLAFSSSRDGSPDIYLMDLDGKNLPRL 277 (425)
T ss_pred cCCccCCCCCCEEEEEECCCCCccEEEEcCCCCcceec
Confidence 33567888884 44454433 399999876665554
No 125
>COG3292 Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]
Probab=77.69 E-value=2.7 Score=35.70 Aligned_cols=59 Identities=25% Similarity=0.336 Sum_probs=40.9
Q ss_pred CccceEECCCCcEEEEECCCCEEEEECCCCcEEEEeccccccCCCCC-cCCCceEEcCCCCEEEEeCC
Q psy4774 13 RPLGMKFDKNGALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKP-QIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 13 ~P~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~-~~pndl~v~~~G~iy~TDs~ 79 (120)
+-..|.+|.+|+|||+ ...|++++|+.+|+.-.+.. .|+ ...|-++.|-.|.+||.--.
T Consensus 166 ~V~aLv~D~~g~lWvg-T~dGL~~fd~~~gkalql~s-------~~~dk~I~al~~d~qg~LWVGTdq 225 (671)
T COG3292 166 PVVALVFDANGRLWVG-TPDGLSYFDAGRGKALQLAS-------PPLDKAINALIADVQGRLWVGTDQ 225 (671)
T ss_pred cceeeeeeccCcEEEe-cCCcceEEccccceEEEcCC-------CcchhhHHHHHHHhcCcEEEEecc
Confidence 3456899999999988 34789999998887655542 233 44566677777777764443
No 126
>PRK12641 flgF flagellar basal body rod protein FlgF; Reviewed
Probab=77.13 E-value=27 Score=26.31 Aligned_cols=63 Identities=25% Similarity=0.475 Sum_probs=39.1
Q ss_pred CCCCccceEECCCCcEEEEEC-----C--CCEEEEECCCCcEEEEecccccc--CCCCCcCCC--ceEEcCCCCEEEE
Q psy4774 10 QCGRPLGMKFDKNGALHVADA-----Y--FGLYKVNVTTGQTEQLISMDTEI--DGAKPQIPN--SVTVDSDGMVYWS 76 (120)
Q Consensus 10 ~~g~P~Gl~~d~~G~l~V~d~-----~--~gi~~vd~~~g~~~~~~~~~~~~--~g~~~~~pn--dl~v~~~G~iy~T 76 (120)
.-++|+-+|++.+|=+-|.+. + .|-+++|. +|.+. .. +..+ ++.|+..|. .+.|++||.|++.
T Consensus 71 ~Tg~~lDlAI~G~GFF~V~~~~G~~~YTR~G~F~~d~-~G~L~--~~-G~~Vl~~~gpI~lp~~~~i~I~~dG~I~~~ 144 (252)
T PRK12641 71 YTGRNLDLFIKDNGWLTIKDTNGQEAYTKNGHLKINS-KRKLT--VQ-NNEVIGNNGNIIIPKNINLKISSNGVITSI 144 (252)
T ss_pred eCCCceeEEEcCCcEEEEEcCCCCeEEeeCCCeeECC-CCCEE--eC-CcEeccCCCceEcCCCCcEEECCCceEEEE
Confidence 348899999988774444321 1 35677776 47663 21 1111 122666663 8999999999876
No 127
>KOG1446|consensus
Probab=76.33 E-value=15 Score=28.74 Aligned_cols=62 Identities=13% Similarity=0.137 Sum_probs=37.6
Q ss_pred ccceEECCCCcEEEEECCC-CEEEEECCC---CcEEEEeccccccCCCCCcCCCceEEcCCC-CEEEEeCCC
Q psy4774 14 PLGMKFDKNGALHVADAYF-GLYKVNVTT---GQTEQLISMDTEIDGAKPQIPNSVTVDSDG-MVYWSDSST 80 (120)
Q Consensus 14 P~Gl~~d~~G~l~V~d~~~-gi~~vd~~~---g~~~~~~~~~~~~~g~~~~~pndl~v~~~G-~iy~TDs~~ 80 (120)
+--.|+||+|-++++-... .|.-+|... |--+.+ .+........++|-++++| .|.+|...+
T Consensus 143 ~pi~AfDp~GLifA~~~~~~~IkLyD~Rs~dkgPF~tf-----~i~~~~~~ew~~l~FS~dGK~iLlsT~~s 209 (311)
T KOG1446|consen 143 RPIAAFDPEGLIFALANGSELIKLYDLRSFDKGPFTTF-----SITDNDEAEWTDLEFSPDGKSILLSTNAS 209 (311)
T ss_pred CcceeECCCCcEEEEecCCCeEEEEEecccCCCCceeE-----ccCCCCccceeeeEEcCCCCEEEEEeCCC
Confidence 4457999999888876654 454455431 222222 2232345667899999999 455555543
No 128
>KOG0291|consensus
Probab=76.08 E-value=35 Score=30.17 Aligned_cols=56 Identities=14% Similarity=0.194 Sum_probs=36.3
Q ss_pred CccceEECCCCcEEEEECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEE
Q psy4774 13 RPLGMKFDKNGALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYW 75 (120)
Q Consensus 13 ~P~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~ 75 (120)
|-.++.|++||+=.++-.++.|.++|.+..+..++-- + .+.. .-.++++|+|++.+
T Consensus 16 r~Gnl~ft~dG~sviSPvGNrvsv~dLknN~S~Tl~~---e-~~~N---I~~ialSp~g~lll 71 (893)
T KOG0291|consen 16 RAGNLVFTKDGNSVISPVGNRVSVFDLKNNKSYTLPL---E-TRYN---ITRIALSPDGTLLL 71 (893)
T ss_pred ecCcEEECCCCCEEEeccCCEEEEEEccCCcceeEEe---e-cCCc---eEEEEeCCCceEEE
Confidence 3457999999987778788889999987655444311 1 1112 24677777776543
No 129
>COG4787 FlgF Flagellar basal body rod protein [Cell motility and secretion]
Probab=75.94 E-value=31 Score=25.94 Aligned_cols=70 Identities=29% Similarity=0.392 Sum_probs=44.8
Q ss_pred cCCCCccceEECCCCcEEEEEC-----C--CCEEEEECCCCcEEEEeccccccCCCCCcCCC--ceEEcCCCCEEEEeCC
Q psy4774 9 QQCGRPLGMKFDKNGALHVADA-----Y--FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPN--SVTVDSDGMVYWSDSS 79 (120)
Q Consensus 9 ~~~g~P~Gl~~d~~G~l~V~d~-----~--~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pn--dl~v~~~G~iy~TDs~ 79 (120)
...|||+-++++.||-|-|-+. + +|-++|++ +|+.+.=-... -.+|.|+..|- -|.|..||.|-.-...
T Consensus 72 ~~TgR~LDvaiq~DGwlaVq~~dG~EaYTRnG~~qI~a-~g~lTiqg~pV-iG~ggpI~vPp~~~v~I~~DGtIsa~~~g 149 (251)
T COG4787 72 DYTGRPLDVAIQGDGWLAVQDADGSEAYTRNGNIQIDA-TGQLTIQGHPV-IGEGGPITVPPGAKVTIAADGTISALNPG 149 (251)
T ss_pred cccCCcceEEEccCceEEEEcCCCcchheecCceEECc-ccceecCCCee-ecCCCccccCCCceEEEecCceEEeccCC
Confidence 3459999999999997777542 1 47899998 46653211100 01344555554 5777889998876665
Q ss_pred C
Q psy4774 80 T 80 (120)
Q Consensus 80 ~ 80 (120)
.
T Consensus 150 ~ 150 (251)
T COG4787 150 D 150 (251)
T ss_pred C
Confidence 3
No 130
>PF01011 PQQ: PQQ enzyme repeat family.; InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=75.93 E-value=3.8 Score=21.42 Aligned_cols=22 Identities=27% Similarity=0.365 Sum_probs=14.5
Q ss_pred CcEEEEECCCCEEEEECCCCcE
Q psy4774 23 GALHVADAYFGLYKVNVTTGQT 44 (120)
Q Consensus 23 G~l~V~d~~~gi~~vd~~~g~~ 44 (120)
|.+|++.....|+-+|.++|++
T Consensus 1 ~~v~~~~~~g~l~AlD~~TG~~ 22 (38)
T PF01011_consen 1 GRVYVGTPDGYLYALDAKTGKV 22 (38)
T ss_dssp TEEEEETTTSEEEEEETTTTSE
T ss_pred CEEEEeCCCCEEEEEECCCCCE
Confidence 4577773323388888888874
No 131
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=75.13 E-value=30 Score=28.82 Aligned_cols=29 Identities=17% Similarity=0.295 Sum_probs=22.4
Q ss_pred cceEECCCCcEEEEECCCCEEEEECCCCcE
Q psy4774 15 LGMKFDKNGALHVADAYFGLYKVNVTTGQT 44 (120)
Q Consensus 15 ~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~ 44 (120)
.|+++. ++++|++.....|+.+|.++|++
T Consensus 114 rg~av~-~~~v~v~t~dg~l~ALDa~TGk~ 142 (527)
T TIGR03075 114 RGVALY-DGKVFFGTLDARLVALDAKTGKV 142 (527)
T ss_pred ccceEE-CCEEEEEcCCCEEEEEECCCCCE
Confidence 455665 56899887666799999999985
No 132
>PRK12819 flgG flagellar basal body rod protein FlgG; Reviewed
Probab=74.75 E-value=23 Score=26.68 Aligned_cols=65 Identities=17% Similarity=0.254 Sum_probs=37.2
Q ss_pred CCCccceEECC----CCcEEEEEC-------CCCEEEEECCCCcEEEEecc-ccccCCCCCcCCC--ceEEcCCCCEEEE
Q psy4774 11 CGRPLGMKFDK----NGALHVADA-------YFGLYKVNVTTGQTEQLISM-DTEIDGAKPQIPN--SVTVDSDGMVYWS 76 (120)
Q Consensus 11 ~g~P~Gl~~d~----~G~l~V~d~-------~~gi~~vd~~~g~~~~~~~~-~~~~~g~~~~~pn--dl~v~~~G~iy~T 76 (120)
-++|+-+|+.. .+-+|+... ..|-+++|. +|.+..--+. .....|.|+..|. ++.|++||.|+..
T Consensus 83 Tg~~lDlAI~G~~~~g~gFf~v~~~G~~~yTR~G~F~~d~-~G~Lvt~~G~~vlg~~g~pI~lp~~~~v~I~~dG~I~~~ 161 (257)
T PRK12819 83 TNSDTDFFLDDGPAGTSSFFVTSKNGETFLTRDGSFTLNS-DRYLQTASGAFVMGENNERIRIPEGAKVAVQADGTLYDA 161 (257)
T ss_pred cCCcccEEEecCcCCCCEEEEEcCCCCeeEeeCCCeeECC-CCCEEcCCCCEEecCCCCceEeCCCCcEEEcCCCEEEEE
Confidence 47899999974 112344321 135677776 4664322110 0012343666663 7999999999875
No 133
>KOG0278|consensus
Probab=74.70 E-value=24 Score=27.37 Aligned_cols=55 Identities=22% Similarity=0.212 Sum_probs=36.3
Q ss_pred ECCCCcEEEEECCCC-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 19 FDKNGALHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 19 ~d~~G~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
..|+-.+|||..... ++++|-.+|+-... +.-.-+.-.+-|-|+|+|.+|-+-|.
T Consensus 232 L~P~k~~fVaGged~~~~kfDy~TgeEi~~------~nkgh~gpVhcVrFSPdGE~yAsGSE 287 (334)
T KOG0278|consen 232 LHPKKEFFVAGGEDFKVYKFDYNTGEEIGS------YNKGHFGPVHCVRFSPDGELYASGSE 287 (334)
T ss_pred ccCCCceEEecCcceEEEEEeccCCceeee------cccCCCCceEEEEECCCCceeeccCC
Confidence 445567888865433 88888777752222 11123444578999999999998885
No 134
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=74.05 E-value=22 Score=27.50 Aligned_cols=28 Identities=21% Similarity=0.209 Sum_probs=20.5
Q ss_pred ceEECCCCcEEEEECCCCEEEEECCCCcE
Q psy4774 16 GMKFDKNGALHVADAYFGLYKVNVTTGQT 44 (120)
Q Consensus 16 Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~ 44 (120)
+..++ ++.+|+......|+.+|+++|++
T Consensus 60 ~p~v~-~~~v~v~~~~g~v~a~d~~tG~~ 87 (377)
T TIGR03300 60 QPAVA-GGKVYAADADGTVVALDAETGKR 87 (377)
T ss_pred ceEEE-CCEEEEECCCCeEEEEEccCCcE
Confidence 44555 56888887655699999888974
No 135
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=73.79 E-value=29 Score=27.59 Aligned_cols=59 Identities=20% Similarity=0.295 Sum_probs=34.3
Q ss_pred CCccc---eEECC-CCcEEEEEC-C-C--------CEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCC--CEEE
Q psy4774 12 GRPLG---MKFDK-NGALHVADA-Y-F--------GLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDG--MVYW 75 (120)
Q Consensus 12 g~P~G---l~~d~-~G~l~V~d~-~-~--------gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G--~iy~ 75 (120)
=||-| +++++ .++|||.-. + . -||.+|.+++++-.-... .. + .+.|.|+.+. .||.
T Consensus 235 WrPGG~Q~~A~~~~~~rlyvLMh~g~~gsHKdpgteVWv~D~~t~krv~Ri~l----~~-~---~~Si~Vsqd~~P~L~~ 306 (342)
T PF06433_consen 235 WRPGGWQLIAYHAASGRLYVLMHQGGEGSHKDPGTEVWVYDLKTHKRVARIPL----EH-P---IDSIAVSQDDKPLLYA 306 (342)
T ss_dssp EEE-SSS-EEEETTTTEEEEEEEE--TT-TTS-EEEEEEEETTTTEEEEEEEE----EE-E---ESEEEEESSSS-EEEE
T ss_pred cCCcceeeeeeccccCeEEEEecCCCCCCccCCceEEEEEECCCCeEEEEEeC----CC-c---cceEEEccCCCcEEEE
Confidence 35666 67765 469998631 1 1 199999998864332221 11 1 3578888765 4655
Q ss_pred EeC
Q psy4774 76 SDS 78 (120)
Q Consensus 76 TDs 78 (120)
++.
T Consensus 307 ~~~ 309 (342)
T PF06433_consen 307 LSA 309 (342)
T ss_dssp EET
T ss_pred EcC
Confidence 555
No 136
>PF15416 DUF4623: Domain of unknown function (DUF4623)
Probab=72.42 E-value=29 Score=27.95 Aligned_cols=70 Identities=20% Similarity=0.270 Sum_probs=45.2
Q ss_pred CCCccceEECCCCcEEEEECCC------CEEEEECCCCcEEEEecccc-ccCCC--CCcCCCceEEcCCC--CEEEEeCC
Q psy4774 11 CGRPLGMKFDKNGALHVADAYF------GLYKVNVTTGQTEQLISMDT-EIDGA--KPQIPNSVTVDSDG--MVYWSDSS 79 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~~~------gi~~vd~~~g~~~~~~~~~~-~~~g~--~~~~pndl~v~~~G--~iy~TDs~ 79 (120)
+..|+-|+...+|++|++.-.. -|+..+..+.+.+++++..- .++|. +-.---.+.+|.+| -+||.|..
T Consensus 182 GTf~yNmgAl~nGH~Y~asLSG~~~SPLKiY~w~tPts~PevIa~inV~~I~gAg~RhGDn~S~nlD~nGnGyiFFgdna 261 (442)
T PF15416_consen 182 GTFSYNMGALVNGHSYLASLSGGKASPLKIYYWETPTSAPEVIADINVGDIPGAGNRHGDNFSLNLDENGNGYIFFGDNA 261 (442)
T ss_pred cccccchhhhcCCeEEEEeccCCCCCceEEEEecCCCCCceEEEeeeeccCcccccccCcceeEEeccCCceEEEecCCc
Confidence 4567778888899999996432 27777755788899987421 23332 21111256677764 68999886
Q ss_pred C
Q psy4774 80 T 80 (120)
Q Consensus 80 ~ 80 (120)
.
T Consensus 262 a 262 (442)
T PF15416_consen 262 A 262 (442)
T ss_pred c
Confidence 4
No 137
>PRK02889 tolB translocation protein TolB; Provisional
Probab=71.96 E-value=49 Score=26.40 Aligned_cols=55 Identities=24% Similarity=0.361 Sum_probs=32.8
Q ss_pred ccceEECCCCc-EEEEECC---CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEE
Q psy4774 14 PLGMKFDKNGA-LHVADAY---FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWS 76 (120)
Q Consensus 14 P~Gl~~d~~G~-l~V~d~~---~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~T 76 (120)
....++.|||+ |+++-.. ..|+.+|.++++.+.+.. ..+ ......+++||. |+|+
T Consensus 242 ~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~----~~~----~~~~~~wSpDG~~l~f~ 301 (427)
T PRK02889 242 NSAPAWSPDGRTLAVALSRDGNSQIYTVNADGSGLRRLTQ----SSG----IDTEPFFSPDGRSIYFT 301 (427)
T ss_pred ccceEECCCCCEEEEEEccCCCceEEEEECCCCCcEECCC----CCC----CCcCeEEcCCCCEEEEE
Confidence 34688999995 4443222 349999987776665532 111 123567888885 5554
No 138
>PF11161 DUF2944: Protein of unknown function (DUF2946); InterPro: IPR021332 This family of proteins with unknown function appear to be restricted to Proteobacteria.
Probab=71.67 E-value=23 Score=25.78 Aligned_cols=52 Identities=15% Similarity=0.225 Sum_probs=35.8
Q ss_pred cEEEE-ECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCC
Q psy4774 24 ALHVA-DAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 24 ~l~V~-d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~ 80 (120)
++||- +..-=|+|+.++.+...... -.|.++..+..+..|++|+||+.-+..
T Consensus 76 RVYV~Le~tP~v~Rl~~~~~~~~l~t-----hTg~~~~~~~~~~lDe~G~l~l~t~~g 128 (187)
T PF11161_consen 76 RVYVELEYTPWVWRLQPEGGDLGLVT-----HTGAPFEAPRACWLDEQGRLYLATPLG 128 (187)
T ss_pred EEEEEeccCceEEEeccCCCCCceee-----cCCCcccchhheeECCCCCEEEecCCc
Confidence 67775 33333888887433333332 257788889999999999999986643
No 139
>PRK12694 flgG flagellar basal body rod protein FlgG; Reviewed
Probab=71.60 E-value=36 Score=25.63 Aligned_cols=61 Identities=21% Similarity=0.268 Sum_probs=35.5
Q ss_pred CCCccceEECCCCcEEEEECC--------CCEEEEECCCCcEEEEeccccccCCCCCcCC---CceEEcCCCCEEEE
Q psy4774 11 CGRPLGMKFDKNGALHVADAY--------FGLYKVNVTTGQTEQLISMDTEIDGAKPQIP---NSVTVDSDGMVYWS 76 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~~--------~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~p---ndl~v~~~G~iy~T 76 (120)
-++|+-+|++.+|=+ +.... .|-+++|. +|.+..--+ ...-+ |+..| .++.|+++|.|+..
T Consensus 89 Tg~~lD~AI~G~GfF-~V~~~~G~~~yTR~G~F~~d~-~G~Lvt~~G--~~Vl~-pI~vp~~~~~~~I~~dG~I~~~ 160 (260)
T PRK12694 89 TGNSKDVAINGQGFF-QVLMPDGTTAYTRDGSFQTNA-QGQLVTSSG--YPLQP-AITIPQNATSLTIGKDGTVSVT 160 (260)
T ss_pred CCCcceEEEcCCcEE-EEEcCCCCeEEeeCCCceECC-CCCEECCCC--CEecc-ceecCCCcceeEECCCCeEEEe
Confidence 478889999877743 44322 24555665 355322110 11112 45556 37999999999874
No 140
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=70.25 E-value=32 Score=23.59 Aligned_cols=60 Identities=20% Similarity=0.202 Sum_probs=36.5
Q ss_pred CccceEECCCCcEEEEECC-CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 13 RPLGMKFDKNGALHVADAY-FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 13 ~P~Gl~~d~~G~l~V~d~~-~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
....+++++++.++++... ..|..++.++++..... .+ .-..+..+.++++|..+++-+.
T Consensus 221 ~i~~~~~~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~------~~-~~~~i~~~~~~~~~~~l~~~~~ 281 (289)
T cd00200 221 GVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTL------SG-HTNSVTSLAWSPDGKRLASGSA 281 (289)
T ss_pred ceEEEEEcCCCcEEEEEcCCCcEEEEEcCCceeEEEc------cc-cCCcEEEEEECCCCCEEEEecC
Confidence 5667888888877777553 33666676544432222 11 1124578888888877776554
No 141
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=70.20 E-value=17 Score=27.40 Aligned_cols=58 Identities=29% Similarity=0.342 Sum_probs=37.4
Q ss_pred ceEECCCCcEEEE---ECCCC---EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCC
Q psy4774 16 GMKFDKNGALHVA---DAYFG---LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 16 Gl~~d~~G~l~V~---d~~~g---i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~ 80 (120)
-+|+|..| |||. ....| |.++||++-+++.-... ... +-..-|.+.+- |.||++++..
T Consensus 127 D~AvDE~G-LWvIYat~~~~g~ivvskld~~tL~v~~tw~T--~~~--k~~~~naFmvC--GvLY~~~s~~ 190 (250)
T PF02191_consen 127 DFAVDENG-LWVIYATEDNNGNIVVSKLDPETLSVEQTWNT--SYP--KRSAGNAFMVC--GVLYATDSYD 190 (250)
T ss_pred EEEEcCCC-EEEEEecCCCCCcEEEEeeCcccCceEEEEEe--ccC--chhhcceeeEe--eEEEEEEECC
Confidence 58888776 6664 33344 77889987777655542 222 22334666666 8999999975
No 142
>TIGR02608 delta_60_rpt delta-60 repeat domain. This domain occurs in tandem repeats, as many as 13, in proteins from Bdellovibrio bacteriovorus, Azotobacter vinelandii, Geobacter sulfurreducens, Pirellula sp. 1, Myxococcus xanthus, and others, many of which are Deltaproteobacteria. The periodicity of the repeat ranges from about 57 to 61 amino acids, and a core region of about 54 is represented by this model and seed alignment.
Probab=70.12 E-value=7.1 Score=22.68 Aligned_cols=30 Identities=20% Similarity=0.272 Sum_probs=20.8
Q ss_pred ccceEECCCCcEEEEECC-C-------CEEEEECCCCcE
Q psy4774 14 PLGMKFDKNGALHVADAY-F-------GLYKVNVTTGQT 44 (120)
Q Consensus 14 P~Gl~~d~~G~l~V~d~~-~-------gi~~vd~~~g~~ 44 (120)
-..+++.+||+|+|+-.. . .|+|+++ +|.+
T Consensus 3 ~~~~~~q~DGkIlv~G~~~~~~~~~~~~l~Rln~-DGsL 40 (55)
T TIGR02608 3 AYAVAVQSDGKILVAGYVDNSSGNNDFVLARLNA-DGSL 40 (55)
T ss_pred eEEEEECCCCcEEEEEEeecCCCcccEEEEEECC-CCCc
Confidence 457899999999998532 1 1777776 3553
No 143
>TIGR03118 PEPCTERM_chp_1 conserved hypothetical protein TIGR03118. This model describes and uncharacterized conserved hypothetical protein. Members are found with the C-terminal putative exosortase interaction domain, PEP-CTERM, in Nitrosospira multiformis, Rhodoferax ferrireducens, Solibacter usitatus Ellin6076, and Acidobacteria bacterium Ellin345. It is found without the PEP-CTERM domain in several other species, including Burkholderia ambifaria, Gloeobacter violaceus PCC 7421, and three copies in the Acanthamoeba polyphaga mimivirus.
Probab=68.90 E-value=24 Score=27.91 Aligned_cols=66 Identities=23% Similarity=0.361 Sum_probs=42.8
Q ss_pred CCCCccceEECC------CCcEEEEECCCC-EEEEECCCCcE-EEEeccccccCCCCC--------cCCCceEEcCCCCE
Q psy4774 10 QCGRPLGMKFDK------NGALHVADAYFG-LYKVNVTTGQT-EQLISMDTEIDGAKP--------QIPNSVTVDSDGMV 73 (120)
Q Consensus 10 ~~g~P~Gl~~d~------~G~l~V~d~~~g-i~~vd~~~g~~-~~~~~~~~~~~g~~~--------~~pndl~v~~~G~i 73 (120)
...-|-||++.| .|.|+|-..+.| |-.+|+.+|+. -.+. ...|.|+ .+.|+..-.+...|
T Consensus 242 ~LNaPWG~a~APa~FG~~sg~lLVGNFGDG~InaFD~~sG~~~g~L~----~~~G~pi~i~GLWgL~fGng~~~~~~ntL 317 (336)
T TIGR03118 242 RLNAPWGLAIAPESFGSLSGALLVGNFGDGTINAYDPQSGAQLGQLL----DPDNHPVKVDGLWSLTFGNGVSGGSANYL 317 (336)
T ss_pred cccCCceeeeChhhhCCCCCCeEEeecCCceeEEecCCCCceeeeec----CCCCCeEEecCeEEeeeCCCcCCCCcceE
Confidence 346688999976 469999998866 88899877753 3332 3345443 33344433444589
Q ss_pred EEEeCC
Q psy4774 74 YWSDSS 79 (120)
Q Consensus 74 y~TDs~ 79 (120)
|||--.
T Consensus 318 yFaAGp 323 (336)
T TIGR03118 318 YFTAGP 323 (336)
T ss_pred EEEeCC
Confidence 998664
No 144
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=67.25 E-value=54 Score=25.67 Aligned_cols=51 Identities=20% Similarity=0.183 Sum_probs=33.2
Q ss_pred CCccceEECCCCcEEEEECCCC--EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFG--LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDG 71 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~g--i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G 71 (120)
+.-..|+++++|.++.+.+.+| +...|..+|++.... ++.-.-+|+..++|
T Consensus 217 ~Y~gSIa~~~~g~~ia~tsPrGg~~~~~d~~tg~~~~~~---------~l~D~cGva~~~~~ 269 (305)
T PF07433_consen 217 GYIGSIAADRDGRLIAVTSPRGGRVAVWDAATGRLLGSV---------PLPDACGVAPTDDG 269 (305)
T ss_pred CceEEEEEeCCCCEEEEECCCCCEEEEEECCCCCEeecc---------ccCceeeeeecCCc
Confidence 4455799999998777666544 666777788654332 33344567777666
No 145
>PRK12640 flgF flagellar basal body rod protein FlgF; Reviewed
Probab=66.09 E-value=26 Score=26.32 Aligned_cols=64 Identities=27% Similarity=0.432 Sum_probs=38.6
Q ss_pred CCCccceEECCCCcEEEEEC-----C--CCEEEEECCCCcEEEEecc-ccccCCCCCcCC--CceEEcCCCCEEEE
Q psy4774 11 CGRPLGMKFDKNGALHVADA-----Y--FGLYKVNVTTGQTEQLISM-DTEIDGAKPQIP--NSVTVDSDGMVYWS 76 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~-----~--~gi~~vd~~~g~~~~~~~~-~~~~~g~~~~~p--ndl~v~~~G~iy~T 76 (120)
.++|+-+|+..+|=+-|-+. + .|-+++|. +|.+....+. .....| |+..| ..+.|+++|.|+..
T Consensus 74 Tg~~lDlAI~G~GFF~V~~~~G~~~yTR~G~F~~d~-~G~Lvt~~G~~vlg~~g-pI~lp~~~~i~I~~dG~I~~~ 147 (246)
T PRK12640 74 TGRPLDVALQGDGWLAVQAPDGSEAYTRNGSLQVDA-NGQLRTANGLPVLGDGG-PIAVPPGAKITIGADGTISAL 147 (246)
T ss_pred cCCcceEEECCCcEEEEEcCCCCEEEEeCCCeeECC-CCCEEcCCCCCccCCCc-ceecCCCCCEEECCCCEEEEe
Confidence 48899999988874444321 1 35667776 4664332110 001122 66666 37999999999875
No 146
>KOG0291|consensus
Probab=65.79 E-value=28 Score=30.69 Aligned_cols=59 Identities=25% Similarity=0.304 Sum_probs=43.0
Q ss_pred CccceEECCCCcEEEEECC--CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeC
Q psy4774 13 RPLGMKFDKNGALHVADAY--FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDS 78 (120)
Q Consensus 13 ~P~Gl~~d~~G~l~V~d~~--~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs 78 (120)
+-..+++|+.|.|.++... ..|+..+.++|++--+.. .+..| ..+|.++++|++..|-|
T Consensus 437 QfscvavD~sGelV~AG~~d~F~IfvWS~qTGqllDiLs----GHEgP---Vs~l~f~~~~~~LaS~S 497 (893)
T KOG0291|consen 437 QFSCVAVDPSGELVCAGAQDSFEIFVWSVQTGQLLDILS----GHEGP---VSGLSFSPDGSLLASGS 497 (893)
T ss_pred eeeEEEEcCCCCEEEeeccceEEEEEEEeecCeeeehhc----CCCCc---ceeeEEccccCeEEecc
Confidence 3457999999998877544 348888888998654442 23335 36899999999888877
No 147
>PRK12692 flgG flagellar basal body rod protein FlgG; Reviewed
Probab=65.70 E-value=57 Score=24.70 Aligned_cols=58 Identities=28% Similarity=0.527 Sum_probs=34.9
Q ss_pred CCCccceEECCCCcEEEEEC-----C--CCEEEEECCCCcEEEEeccccccCCC----CCcCC---CceEEcCCCCEEEE
Q psy4774 11 CGRPLGMKFDKNGALHVADA-----Y--FGLYKVNVTTGQTEQLISMDTEIDGA----KPQIP---NSVTVDSDGMVYWS 76 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~-----~--~gi~~vd~~~g~~~~~~~~~~~~~g~----~~~~p---ndl~v~~~G~iy~T 76 (120)
-++|+-+|+..+|=+-|-.. + .|-+++|. +|.+..- .|. ++..| ..+.|++||.|+..
T Consensus 89 Tg~~lD~AI~G~GFF~V~~~~G~~~yTR~G~F~~d~-~G~Lvt~-------~G~~Vl~~I~lp~~~~~i~I~~dG~I~~~ 160 (262)
T PRK12692 89 TGNQLDLAVNGRGYFQVTSPNGEIQYTRAGSFNKNA-AGQLVTM-------EGYAVDPAILIPQNTTQVTINESGQVFAK 160 (262)
T ss_pred CCCcceEEEcCCceEEEECCCCCeEEEeCCCceECC-CCCEEcC-------CCCCcccccccCCCCcceEECCCCEEEEe
Confidence 47888999987774444321 1 24555665 3553221 233 24455 38999999999764
No 148
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=65.65 E-value=40 Score=26.17 Aligned_cols=30 Identities=23% Similarity=0.311 Sum_probs=20.4
Q ss_pred ceEECCCCcEEEEECCCCEEEEECCCCcEE
Q psy4774 16 GMKFDKNGALHVADAYFGLYKVNVTTGQTE 45 (120)
Q Consensus 16 Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~~ 45 (120)
+-.+..+|+||+.+....++.+|..+|+..
T Consensus 105 ~~~~~~~G~i~~g~~~g~~y~ld~~~G~~~ 134 (370)
T COG1520 105 GPILGSDGKIYVGSWDGKLYALDASTGTLV 134 (370)
T ss_pred CceEEeCCeEEEecccceEEEEECCCCcEE
Confidence 333344889998866445888898678643
No 149
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=64.25 E-value=44 Score=22.90 Aligned_cols=60 Identities=18% Similarity=0.173 Sum_probs=35.3
Q ss_pred CCccceEECCCCcEEEEECC-CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCC-CEEEEeC
Q psy4774 12 GRPLGMKFDKNGALHVADAY-FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDG-MVYWSDS 78 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~-~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G-~iy~TDs 78 (120)
.....+++++++.++++... ..|..+|..+++...... ... .....+.+.++| .++++..
T Consensus 136 ~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~~~~~~~~----~~~---~~i~~~~~~~~~~~l~~~~~ 197 (289)
T cd00200 136 DWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLT----GHT---GEVNSVAFSPDGEKLLSSSS 197 (289)
T ss_pred CcEEEEEEcCcCCEEEEEcCCCcEEEEEccccccceeEe----cCc---cccceEEECCCcCEEEEecC
Confidence 45678999998888777553 347777776554322221 111 135677888877 3444443
No 150
>KOG4378|consensus
Probab=63.62 E-value=63 Score=27.43 Aligned_cols=60 Identities=12% Similarity=0.123 Sum_probs=37.7
Q ss_pred CCCccceEECCCCcEEEEECC--CCEEEEECCCCc-EEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 11 CGRPLGMKFDKNGALHVADAY--FGLYKVNVTTGQ-TEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~~--~gi~~vd~~~g~-~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
..-..||.|.|...++++..+ ..|+.+|....+ ...++ + ..|| .-|+|.++|.+.+.-++
T Consensus 208 sAP~~gicfspsne~l~vsVG~Dkki~~yD~~s~~s~~~l~-----y-~~Pl---stvaf~~~G~~L~aG~s 270 (673)
T KOG4378|consen 208 SAPCRGICFSPSNEALLVSVGYDKKINIYDIRSQASTDRLT-----Y-SHPL---STVAFSECGTYLCAGNS 270 (673)
T ss_pred cCCcCcceecCCccceEEEecccceEEEeecccccccceee-----e-cCCc---ceeeecCCceEEEeecC
Confidence 445679999999855554433 448888864222 12232 1 2365 48899999987777665
No 151
>PF02393 US22: US22 like; InterPro: IPR003360 Herpesviruses are large and complex DNA viruses, widely found in nature. Human cytomegalovirus (HCMV), an important human pathogen, defines the betaherpesvirus family. Mouse cytomegalovirus (MCMV) and rat cytomegalovirus serve as biological model systems for HCMV. HCMV, MCMV, and rat CMV display the largest genomes among the herpesviruses and are essentially co-linear over the central 180 kb of the 230-kb genomes. Betaherpesviruses, which include the CMVs as well as human herpesviruses 6 and 7, differ from alpha- and gammaherpesviruses by the presence of additional gene families such as the US22 gene family, which are mainly clustered at the ends of the genome. The US22 family was first described in HCMV. This gene family comprises 12 members in both HCMV and MCMV and 11 in rat CMV []. Members of the US22 gene family are characterised by stretches of hydrophobic and charged residues as well as up to four conserved sequence motifs which are specific for betaherpesviruses. Motif I differs between the HCMV US and UL family members []. Motifs I and II have consensus sequences, while motifs III and IV are less well defined but have stretches of non-polar residues [, ]. Members of this gene family are widely divergent in function and their involvement in viral replication []. This entry contains US22 family members from the Cytomegalovirus, Muromegalovirus and the Roseolovirus taxonomic groups. The name sake of this family US22 is an early nuclear protein that is secreted from cells []. The US22 family may have a role in virus replication and pathogenesis [].
Probab=63.39 E-value=6 Score=25.91 Aligned_cols=25 Identities=16% Similarity=0.240 Sum_probs=19.8
Q ss_pred ecCCceEEEEcCCC--ceEecCccccc
Q psy4774 93 TSGSGSFIKIGPNT--QEVLQKQTDKL 117 (120)
Q Consensus 93 ~~~~G~l~~~d~~~--~~~l~~~L~~~ 117 (120)
-...|+||.||+.. +-.|+++|.+|
T Consensus 87 ~~~~G~Vy~yd~~~~~l~~lA~~l~~F 113 (125)
T PF02393_consen 87 VGESGRVYAYDPEDDRLYRLADSLEEF 113 (125)
T ss_pred EeCCCeEEEEEcCCCEEEEEeCCHHHH
Confidence 34668999999876 88888888776
No 152
>PF13570 PQQ_3: PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=62.75 E-value=12 Score=19.50 Aligned_cols=24 Identities=17% Similarity=0.228 Sum_probs=16.6
Q ss_pred ceEECCCCcEEEEECCCCEEEEECC
Q psy4774 16 GMKFDKNGALHVADAYFGLYKVNVT 40 (120)
Q Consensus 16 Gl~~d~~G~l~V~d~~~gi~~vd~~ 40 (120)
+++++ +|.+||.+....|+.+|++
T Consensus 16 ~~~v~-~g~vyv~~~dg~l~ald~~ 39 (40)
T PF13570_consen 16 SPAVA-GGRVYVGTGDGNLYALDAA 39 (40)
T ss_dssp --EEC-TSEEEEE-TTSEEEEEETT
T ss_pred CCEEE-CCEEEEEcCCCEEEEEeCC
Confidence 45676 6799998876669989875
No 153
>TIGR03506 FlgEFG_subfam fagellar hook-basal body proteins. This model encompasses three closely related flagellar proteins usually denoted FlgE, FlgF and FlgG. The names have often been mis-assigned, however. Three equivalog models, TIGR02489, TIGR02490 and TIGR00488, respectively, separate the individual forms into three genome-context consistent groups. The major differences between these genes are architectural, with variable central sections between relatively conserved N- and C-terminal domains. More distantly related are two other flagellar apparatus familis, FlgC (TIGR01395) which consists of little else but the N-and C-terminal domains and FlgK (TIGR02492) with a substantial but different central domain.
Probab=62.01 E-value=61 Score=23.86 Aligned_cols=66 Identities=21% Similarity=0.309 Sum_probs=38.0
Q ss_pred CCCccceEECCCCcEEEEEC------CCCEEEEECCCCcEEEEecc-ccccCC-CCCcCC-C--ceEEcCCCCEEEEe
Q psy4774 11 CGRPLGMKFDKNGALHVADA------YFGLYKVNVTTGQTEQLISM-DTEIDG-AKPQIP-N--SVTVDSDGMVYWSD 77 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~------~~gi~~vd~~~g~~~~~~~~-~~~~~g-~~~~~p-n--dl~v~~~G~iy~TD 77 (120)
.++|+-+|++.+|-+-|-.. ..|-+++|.+ |.+..-.+. .....+ .|+..| . ++.|+++|.|+...
T Consensus 74 Tg~~lD~AI~G~GfF~V~~~~G~~yTR~G~F~~d~~-G~Lvt~~G~~vl~~~~~gpI~~~~~~~~~~i~~dG~i~~~~ 150 (231)
T TIGR03506 74 TGNSLDLAISGDGFFVVQGPDGTAYTRAGNFQLDAD-GYLVNPDGYYLLAWTGSGPVTVPPDGASVSIGSDGTVSATY 150 (231)
T ss_pred CCCeeeEEEcCCeeEEEEcCCCCEEEcCcceeECCC-CCEECCCCCEeccCCccCCEEECCCCceEEECCCcEEEEEe
Confidence 47899999988774444321 1357777764 654211110 001111 356666 3 69999999998753
No 154
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=61.81 E-value=54 Score=23.30 Aligned_cols=59 Identities=14% Similarity=0.148 Sum_probs=38.8
Q ss_pred CCccceEECCCCcEEEEE-CC--CC-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 12 GRPLGMKFDKNGALHVAD-AY--FG-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d-~~--~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
...+.|.+.|+|+++++- .. .| |...|.+ +.+.+... + -.....++.+|+|+.++|-++
T Consensus 101 ~~~n~i~wsP~G~~l~~~g~~n~~G~l~~wd~~--~~~~i~~~----~---~~~~t~~~WsPdGr~~~ta~t 163 (194)
T PF08662_consen 101 QPRNTISWSPDGRFLVLAGFGNLNGDLEFWDVR--KKKKISTF----E---HSDATDVEWSPDGRYLATATT 163 (194)
T ss_pred CCceEEEECCCCCEEEEEEccCCCcEEEEEECC--CCEEeecc----c---cCcEEEEEEcCCCCEEEEEEe
Confidence 345679999999887753 22 34 5566765 33445421 1 123589999999998887665
No 155
>KOG0282|consensus
Probab=61.78 E-value=14 Score=30.61 Aligned_cols=63 Identities=14% Similarity=0.209 Sum_probs=48.5
Q ss_pred CCCCccceEECCCC-cEEEEECCCC-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 10 QCGRPLGMKFDKNG-ALHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 10 ~~g~P~Gl~~d~~G-~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
..-.|.-+.+.|++ ++++|....+ |...|..+|+ ++. ++ +.-+...|+|.|-++|.=++|-|-
T Consensus 298 ~~~~~~cvkf~pd~~n~fl~G~sd~ki~~wDiRs~k---vvq---eY-d~hLg~i~~i~F~~~g~rFissSD 362 (503)
T KOG0282|consen 298 LDKVPTCVKFHPDNQNIFLVGGSDKKIRQWDIRSGK---VVQ---EY-DRHLGAILDITFVDEGRRFISSSD 362 (503)
T ss_pred cCCCceeeecCCCCCcEEEEecCCCcEEEEeccchH---HHH---HH-HhhhhheeeeEEccCCceEeeecc
Confidence 34568899999999 9999977754 9999988887 332 34 346777899999999988887663
No 156
>COG4993 Gcd Glucose dehydrogenase [Carbohydrate transport and metabolism]
Probab=61.75 E-value=54 Score=28.52 Aligned_cols=23 Identities=17% Similarity=0.328 Sum_probs=19.8
Q ss_pred CCcEEEEECCCCEEEEECCCCcE
Q psy4774 22 NGALHVADAYFGLYKVNVTTGQT 44 (120)
Q Consensus 22 ~G~l~V~d~~~gi~~vd~~~g~~ 44 (120)
++.||||-.++.++-+|.++|+.
T Consensus 214 gdtlYvcTphn~v~ALDa~TGke 236 (773)
T COG4993 214 GDTLYVCTPHNRVFALDAATGKE 236 (773)
T ss_pred CCEEEEecCcceeEEeeccCCce
Confidence 36899999988899999999873
No 157
>PTZ00421 coronin; Provisional
Probab=61.74 E-value=92 Score=25.79 Aligned_cols=60 Identities=13% Similarity=0.119 Sum_probs=37.8
Q ss_pred CccceEECCCC-cEEEEECCCC-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 13 RPLGMKFDKNG-ALHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 13 ~P~Gl~~d~~G-~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
....++|.+++ +++++-...+ |...|.++++...... .+. ...+.+++.++|.+.+|-+.
T Consensus 127 ~V~~l~f~P~~~~iLaSgs~DgtVrIWDl~tg~~~~~l~----~h~---~~V~sla~spdG~lLatgs~ 188 (493)
T PTZ00421 127 KVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIK----CHS---DQITSLEWNLDGSLLCTTSK 188 (493)
T ss_pred cEEEEEeCcCCCCEEEEEeCCCEEEEEECCCCeEEEEEc----CCC---CceEEEEEECCCCEEEEecC
Confidence 34578999875 6776655555 5556877665432221 111 23578888899888877665
No 158
>KOG2110|consensus
Probab=61.51 E-value=52 Score=26.55 Aligned_cols=62 Identities=23% Similarity=0.303 Sum_probs=39.2
Q ss_pred CCccceEECCCCcEEEEECCCC-EEEE-ECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFG-LYKV-NVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDS 78 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~g-i~~v-d~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs 78 (120)
+--..|+|+++|.++..-...| |+|| ...+|+.-.-.. -|......-.|+|++++...-+-|
T Consensus 174 ~~lAalafs~~G~llATASeKGTVIRVf~v~~G~kl~eFR-----RG~~~~~IySL~Fs~ds~~L~~sS 237 (391)
T KOG2110|consen 174 GPLAALAFSPDGTLLATASEKGTVIRVFSVPEGQKLYEFR-----RGTYPVSIYSLSFSPDSQFLAASS 237 (391)
T ss_pred CceeEEEECCCCCEEEEeccCceEEEEEEcCCccEeeeee-----CCceeeEEEEEEECCCCCeEEEec
Confidence 4456799999999998766666 6665 444565432221 233334456899999987443333
No 159
>KOG0640|consensus
Probab=60.79 E-value=44 Score=26.72 Aligned_cols=64 Identities=19% Similarity=0.150 Sum_probs=41.8
Q ss_pred CccceEECCCCcEEEEECCCC-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCC
Q psy4774 13 RPLGMKFDKNGALHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 13 ~P~Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~ 80 (120)
.-..|.|.|.|..+.+...+- +..+|.++-+.-+-++ .+.+--...+.+--++.|++|+|-|.-
T Consensus 218 ~vrsiSfHPsGefllvgTdHp~~rlYdv~T~Qcfvsan----Pd~qht~ai~~V~Ys~t~~lYvTaSkD 282 (430)
T KOG0640|consen 218 PVRSISFHPSGEFLLVGTDHPTLRLYDVNTYQCFVSAN----PDDQHTGAITQVRYSSTGSLYVTASKD 282 (430)
T ss_pred eeeeEeecCCCceEEEecCCCceeEEeccceeEeeecC----cccccccceeEEEecCCccEEEEeccC
Confidence 345799999997766544333 4445665555443333 222233457899999999999999963
No 160
>KOG0318|consensus
Probab=60.52 E-value=1.1e+02 Score=26.12 Aligned_cols=31 Identities=13% Similarity=0.187 Sum_probs=22.6
Q ss_pred cCCCCccceEECCCCcEEEEECCCCEEEEEC
Q psy4774 9 QQCGRPLGMKFDKNGALHVADAYFGLYKVNV 39 (120)
Q Consensus 9 ~~~g~P~Gl~~d~~G~l~V~d~~~gi~~vd~ 39 (120)
..+.+|.|++..++|.+.|.-...+|+.+.-
T Consensus 403 ~lg~QP~~lav~~d~~~avv~~~~~iv~l~~ 433 (603)
T KOG0318|consen 403 KLGSQPKGLAVLSDGGTAVVACISDIVLLQD 433 (603)
T ss_pred ecCCCceeEEEcCCCCEEEEEecCcEEEEec
Confidence 6678999999999986655444456666653
No 161
>PRK12693 flgG flagellar basal body rod protein FlgG; Provisional
Probab=58.51 E-value=76 Score=23.80 Aligned_cols=58 Identities=29% Similarity=0.483 Sum_probs=35.5
Q ss_pred CCCccceEECCCCcEEEEEC-----C--CCEEEEECCCCcEEEEeccccccCCC----CCcCC---CceEEcCCCCEEEE
Q psy4774 11 CGRPLGMKFDKNGALHVADA-----Y--FGLYKVNVTTGQTEQLISMDTEIDGA----KPQIP---NSVTVDSDGMVYWS 76 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~-----~--~gi~~vd~~~g~~~~~~~~~~~~~g~----~~~~p---ndl~v~~~G~iy~T 76 (120)
-++|+-+|+..+|=+-|... + .|-+++|. +|.+..- +|. ++..| ..+.|+++|.|+..
T Consensus 89 T~~~lD~Ai~G~GfF~v~~~~G~~~yTR~G~F~~d~-~G~Lvt~-------~G~~vl~~I~~p~~~~~~~i~~dG~I~~~ 160 (261)
T PRK12693 89 TGNSLDVAIEGQGFFQVQLPDGTIAYTRDGSFKLDQ-DGQLVTS-------GGYPLQPEITIPENATSITIGTDGTVSVT 160 (261)
T ss_pred CCCcceEEECCCcEEEEEcCCCCeEEeeCCCeeECC-CCCEECC-------CCCEEeeecccCCCCceEEECCCCeEEEe
Confidence 47899999988774444321 1 24566676 3553211 232 24455 37999999999774
No 162
>PRK12817 flgG flagellar basal body rod protein FlgG; Reviewed
Probab=58.27 E-value=78 Score=23.84 Aligned_cols=66 Identities=23% Similarity=0.287 Sum_probs=37.2
Q ss_pred CCCccceEECCCCcEEEEEC-----C--CCEEEEECCCCcEEEEecc-ccc--cCC-----CCCcCC-CceEEcCCCCEE
Q psy4774 11 CGRPLGMKFDKNGALHVADA-----Y--FGLYKVNVTTGQTEQLISM-DTE--IDG-----AKPQIP-NSVTVDSDGMVY 74 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~-----~--~gi~~vd~~~g~~~~~~~~-~~~--~~g-----~~~~~p-ndl~v~~~G~iy 74 (120)
-++|+-+|+..+|=+-|... + .|-+++|. +|.+..--+. ... ..| .++..| .++.|+++|.|+
T Consensus 85 Tg~~lD~Ai~G~GfF~V~~~~G~~~yTR~G~F~~d~-~G~Lvt~~G~~vl~~~~~g~~~~~~~i~~~~~~~~i~~dG~i~ 163 (260)
T PRK12817 85 TGNKTDLAIDGEGFFRVIMADGTYAYTRAGNFNIDS-NGMLVDDNGNRLEIQLEAGVGINNRNTGFDSNNFTVDEDGGIS 163 (260)
T ss_pred cCCcceEEECCCcEEEEEcCCCCeEEEeCCceeECC-CCCEEcCCCCEEEecccCcccccCCcccCCCCceEECCCCeEE
Confidence 47899999987774444321 1 24566775 3654321110 000 011 245445 489999999998
Q ss_pred EEe
Q psy4774 75 WSD 77 (120)
Q Consensus 75 ~TD 77 (120)
..+
T Consensus 164 ~~~ 166 (260)
T PRK12817 164 VKN 166 (260)
T ss_pred Eec
Confidence 755
No 163
>KOG0643|consensus
Probab=58.18 E-value=51 Score=25.77 Aligned_cols=62 Identities=16% Similarity=0.187 Sum_probs=43.5
Q ss_pred CCccceEECCCCcEEEEECCCC-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
..+.-..+++-+.-+|+-...| |-++|.++|+. ++.. .+.+.. ..|++-++++.+.|+|-|.
T Consensus 148 skit~a~Wg~l~~~ii~Ghe~G~is~~da~~g~~--~v~s-~~~h~~---~Ind~q~s~d~T~FiT~s~ 210 (327)
T KOG0643|consen 148 SKITSALWGPLGETIIAGHEDGSISIYDARTGKE--LVDS-DEEHSS---KINDLQFSRDRTYFITGSK 210 (327)
T ss_pred cceeeeeecccCCEEEEecCCCcEEEEEcccCce--eeec-hhhhcc---ccccccccCCcceEEeccc
Confidence 4455667778887777765555 88899888753 2221 123333 5799999999999999885
No 164
>PRK12642 flgF flagellar basal body rod protein FlgF; Reviewed
Probab=58.01 E-value=65 Score=24.05 Aligned_cols=65 Identities=20% Similarity=0.346 Sum_probs=38.1
Q ss_pred CCCCccceEECCCCcEEEEECC------CCEEEEECCCCcEEEEecc-ccccCCCCCcCC---CceEEcCCCCEEE
Q psy4774 10 QCGRPLGMKFDKNGALHVADAY------FGLYKVNVTTGQTEQLISM-DTEIDGAKPQIP---NSVTVDSDGMVYW 75 (120)
Q Consensus 10 ~~g~P~Gl~~d~~G~l~V~d~~------~gi~~vd~~~g~~~~~~~~-~~~~~g~~~~~p---ndl~v~~~G~iy~ 75 (120)
..++|+-+|+..+|=+-|-... .|-+++|. +|.+....+. .....|.|+..| ..+.|+++|.|+.
T Consensus 72 ~Tg~~lDlAI~G~GFF~V~~~~g~~yTR~G~F~~d~-~G~Lvt~~G~~vl~~~g~~I~ip~~~~~~~i~~dG~i~~ 146 (241)
T PRK12642 72 KTGNPLDFAVKGDAWFSFDTPAGQVYTRDGRFTMTS-TGELVSVTGYPVLDAGGAPIQLNPGGGEPTIGADGAIYQ 146 (241)
T ss_pred eCCCcceEEECCCcEEEEEcCCCCEEEeCCCeeECC-CCCEECCCCCEecCCCCCceEeCCCCCCceEcCCceEEE
Confidence 3488999999888744443211 35677776 4664322110 001123366666 3789999999964
No 165
>KOG2106|consensus
Probab=57.11 E-value=54 Score=27.75 Aligned_cols=62 Identities=18% Similarity=0.297 Sum_probs=40.8
Q ss_pred CCCcc-ceEECCCCcEEEEECCCC---EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 11 CGRPL-GMKFDKNGALHVADAYFG---LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 11 ~g~P~-Gl~~d~~G~l~V~d~~~g---i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
++.|+ -+++.++|..+..-...+ |++|+.. |..-..++ +..|.+ .--|..++|++...|.|.
T Consensus 446 d~~~ls~v~ysp~G~~lAvgs~d~~iyiy~Vs~~-g~~y~r~~---k~~gs~---ithLDwS~Ds~~~~~~S~ 511 (626)
T KOG2106|consen 446 DNEQLSVVRYSPDGAFLAVGSHDNHIYIYRVSAN-GRKYSRVG---KCSGSP---ITHLDWSSDSQFLVSNSG 511 (626)
T ss_pred cCCceEEEEEcCCCCEEEEecCCCeEEEEEECCC-CcEEEEee---eecCce---eEEeeecCCCceEEeccC
Confidence 34444 588999998877655544 7788864 44433443 455633 457888888888888875
No 166
>PF05935 Arylsulfotrans: Arylsulfotransferase (ASST); InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=57.07 E-value=45 Score=27.35 Aligned_cols=56 Identities=18% Similarity=0.249 Sum_probs=30.5
Q ss_pred eEECCCCcEEEEECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeC
Q psy4774 17 MKFDKNGALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDS 78 (120)
Q Consensus 17 l~~d~~G~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs 78 (120)
+...++|+|++... ..+.++|.. |++.... ..++....+-+|+...++|++.+.-.
T Consensus 153 ~~~l~nG~ll~~~~-~~~~e~D~~-G~v~~~~----~l~~~~~~~HHD~~~l~nGn~L~l~~ 208 (477)
T PF05935_consen 153 FKQLPNGNLLIGSG-NRLYEIDLL-GKVIWEY----DLPGGYYDFHHDIDELPNGNLLILAS 208 (477)
T ss_dssp EEE-TTS-EEEEEB-TEEEEE-TT---EEEEE----E--TTEE-B-S-EEE-TTS-EEEEEE
T ss_pred eeEcCCCCEEEecC-CceEEEcCC-CCEEEee----ecCCcccccccccEECCCCCEEEEEe
Confidence 67888999987755 669999984 7754443 23332234569999999998776555
No 167
>PF08309 LVIVD: LVIVD repeat; InterPro: IPR013211 This repeat is found in bacterial and archaeal cell surface proteins, many of which are hypothetical. The secondary structure corresponding to this repeat is predicted to comprise 4 beta-strands, which may associate to form a beta-propeller. The repeat copy number varies from 2-14. This repeat is sometimes found with the PKD domain IPR000601 from INTERPRO.
Probab=56.61 E-value=29 Score=18.86 Aligned_cols=28 Identities=25% Similarity=0.306 Sum_probs=20.8
Q ss_pred CCccceEECCCCcEEEEECCCCEEEEECC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFGLYKVNVT 40 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~gi~~vd~~ 40 (120)
|...++++. .+.+||++...|+..+|..
T Consensus 2 G~a~~v~v~-g~yaYva~~~~Gl~IvDIS 29 (42)
T PF08309_consen 2 GDARDVAVS-GNYAYVADGNNGLVIVDIS 29 (42)
T ss_pred ceEEEEEEE-CCEEEEEeCCCCEEEEECC
Confidence 345566775 3588999988999998864
No 168
>PF11763 DIPSY: Cell-wall adhesin ligand-binding C-terminal; InterPro: IPR021746 The DIPSY domain is characterised by the distinctive D*I*PSY motif at the very C terminus of yeast cell-wall glycoproteins. It appears not to be conserved in any other species, however. In fungi, cell adhesion is required for flocculation, mating and virulence, and is mediated by covalently bound cell wall proteins termed adhesins. Map4, an adhesin required for mating in Schizosaccharomyces pombe, is N-glycosylated and O-glycosylated, and is an endogenous substrate for the mannosyl transferase Oma4p. Map4 has a modular structure with an N-terminal signal peptide, a serine and threonine (S/T)-rich domain that includes nine repeats of 36 amino acids (rich in serine and threonine residues, but lacking glutamines), and a C-terminal DIPSY domain with no glycosyl-phosphatidyl inositol (GPI)-anchor signal. The N-terminal S/T-rich regions, are required for cell wall attachment, but the C-terminal DIPSY domain is required for agglutination and mating in liquid and solid media [].
Probab=56.50 E-value=60 Score=21.96 Aligned_cols=25 Identities=16% Similarity=0.345 Sum_probs=19.5
Q ss_pred CCcCCCceEEcCCCCEEEEeCCCCc
Q psy4774 58 KPQIPNSVTVDSDGMVYWSDSSTKY 82 (120)
Q Consensus 58 ~~~~pndl~v~~~G~iy~TDs~~~~ 82 (120)
.+..|-.|.+..+|+||||-....+
T Consensus 80 ~~~ep~~l~~l~dgri~~ts~~~d~ 104 (123)
T PF11763_consen 80 SFSEPLDLHTLSDGRIWFTSNEYDF 104 (123)
T ss_pred CCCCcEEEEEecCCcEEEEcccccC
Confidence 4556779999999999999865443
No 169
>KOG1379|consensus
Probab=56.50 E-value=64 Score=25.57 Aligned_cols=62 Identities=23% Similarity=0.237 Sum_probs=39.5
Q ss_pred CCCccceEECC--CCcEEEEECC-CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEEeC
Q psy4774 11 CGRPLGMKFDK--NGALHVADAY-FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWSDS 78 (120)
Q Consensus 11 ~g~P~Gl~~d~--~G~l~V~d~~-~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~TDs 78 (120)
++....|+.-. ++.|++++-+ .|...+. +|++ ++.+ ...--.||+|--|++-|.|. -|..|.
T Consensus 167 GSSTAcI~~l~~~~~~Lh~aNLGDSGF~VvR--~G~v-v~~S---~~Q~H~FN~PyQLs~~p~~~~~~~~d~ 232 (330)
T KOG1379|consen 167 GSSTACILALDRENGKLHTANLGDSGFLVVR--EGKV-VFRS---PEQQHYFNTPYQLSSPPEGYSSYISDV 232 (330)
T ss_pred CcceeeeeeeecCCCeEEEeeccCcceEEEE--CCEE-EEcC---chheeccCCceeeccCCccccccccCC
Confidence 33344455444 7899999876 5777775 3653 3332 23334699999999999874 455554
No 170
>PF05567 Neisseria_PilC: Neisseria PilC beta-propeller domain; InterPro: IPR008707 This domain is found in several PilC protein sequences from Neisseria gonorrhoeae and Neisseria meningitidis. PilC is a phase-variable protein associated with pilus-mediated adherence of pathogenic Neisseria to target cells [].; PDB: 3HX6_A.
Probab=56.35 E-value=21 Score=28.04 Aligned_cols=54 Identities=28% Similarity=0.410 Sum_probs=24.7
Q ss_pred EEEEECCC-CcEEEEeccccccCCCCCcCCCceEEcCCC---CEEEEeCCCCcccccceecceecCCceEEEEcCCC
Q psy4774 34 LYKVNVTT-GQTEQLISMDTEIDGAKPQIPNSVTVDSDG---MVYWSDSSTKYKLYDGLFDGLTSGSGSFIKIGPNT 106 (120)
Q Consensus 34 i~~vd~~~-g~~~~~~~~~~~~~g~~~~~pndl~v~~~G---~iy~TDs~~~~~~~~~~~~~~~~~~G~l~~~d~~~ 106 (120)
|+.+|.++ |++..-.... ...+ -+..|.-+..+.|| .+|+.|-. |.|||+|...
T Consensus 183 lyi~d~~t~G~l~~~i~~~-~~~~-gl~~~~~~D~d~DG~~D~vYaGDl~-----------------GnlwR~dl~~ 240 (335)
T PF05567_consen 183 LYILDADTTGALIKKIDVP-GGSG-GLSSPAVVDSDGDGYVDRVYAGDLG-----------------GNLWRFDLSS 240 (335)
T ss_dssp EEEEETTT---EEEEEEE---STT--EEEEEEE-TTSSSEE-EEEEEETT-----------------SEEEEEE--T
T ss_pred EEEEECCCCCceEEEEecC-CCCc-cccccEEEeccCCCeEEEEEEEcCC-----------------CcEEEEECCC
Confidence 78888887 7643222210 0011 22333333334455 56777664 7888887653
No 171
>KOG0303|consensus
Probab=56.34 E-value=1.1e+02 Score=25.12 Aligned_cols=86 Identities=16% Similarity=0.221 Sum_probs=51.1
Q ss_pred ccceEECCCCcEEEEECC-CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCCCcccccc-----
Q psy4774 14 PLGMKFDKNGALHVADAY-FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTKYKLYDG----- 87 (120)
Q Consensus 14 P~Gl~~d~~G~l~V~d~~-~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~~~~~~~----- 87 (120)
-..+.|+.+|.++++-.. .-|..+||.+|++..-.. ...|. -|-...+-.+|.|+-|-.+ +...++.
T Consensus 176 i~S~sfn~dGs~l~TtckDKkvRv~dpr~~~~v~e~~---~heG~---k~~Raifl~~g~i~tTGfs-r~seRq~aLwdp 248 (472)
T KOG0303|consen 176 VYSMSFNRDGSLLCTTCKDKKVRVIDPRRGTVVSEGV---AHEGA---KPARAIFLASGKIFTTGFS-RMSERQIALWDP 248 (472)
T ss_pred EEEEEeccCCceeeeecccceeEEEcCCCCcEeeecc---cccCC---CcceeEEeccCceeeeccc-cccccceeccCc
Confidence 357889999999987665 458888998876433321 12232 2557777788885554443 3332221
Q ss_pred ---eec----ceecCCceEEE-EcCCC
Q psy4774 88 ---LFD----GLTSGSGSFIK-IGPNT 106 (120)
Q Consensus 88 ---~~~----~~~~~~G~l~~-~d~~~ 106 (120)
-.. -+...+|-|+- ||+++
T Consensus 249 ~nl~eP~~~~elDtSnGvl~PFyD~dt 275 (472)
T KOG0303|consen 249 NNLEEPIALQELDTSNGVLLPFYDPDT 275 (472)
T ss_pred ccccCcceeEEeccCCceEEeeecCCC
Confidence 111 12467787764 47777
No 172
>PRK12643 flgF flagellar basal body rod protein FlgF; Reviewed
Probab=56.19 E-value=39 Score=24.91 Aligned_cols=64 Identities=27% Similarity=0.432 Sum_probs=37.8
Q ss_pred CCCCccceEECCCCcEEEEEC-----C--CCEEEEECCCCcEEEEecc-ccccCCCCCcCC--CceEEcCCCCEEEE
Q psy4774 10 QCGRPLGMKFDKNGALHVADA-----Y--FGLYKVNVTTGQTEQLISM-DTEIDGAKPQIP--NSVTVDSDGMVYWS 76 (120)
Q Consensus 10 ~~g~P~Gl~~d~~G~l~V~d~-----~--~gi~~vd~~~g~~~~~~~~-~~~~~g~~~~~p--ndl~v~~~G~iy~T 76 (120)
.-++|+-+|+..+|-+-|.+. + .|-+++|. +|.+ ..-+. .....| |+..| ..|.|++||+|+..
T Consensus 73 ~Tg~~lDlAI~G~GFF~V~~~~G~~~YTR~G~F~~d~-~G~L-t~~G~~Vlg~~g-pI~ip~~~~i~I~~dG~I~~~ 146 (209)
T PRK12643 73 FSGRPLDVALQQDGYLAVQLPDGSEAYTRNGNIQISA-NGQM-TVQGYPLMGDNG-PIDVPPQAAVTIAADGTISAL 146 (209)
T ss_pred eCCCceeEEECCCcEEEEEcCCCCeEEeeCCCceECC-CCCC-cCCCcCcccCCC-ceEcCCCCcEEECCCCeEEEe
Confidence 347899999988775545321 1 35666776 3654 22110 001112 56666 27999999999764
No 173
>PF13964 Kelch_6: Kelch motif
Probab=54.42 E-value=24 Score=19.09 Aligned_cols=27 Identities=26% Similarity=0.271 Sum_probs=20.7
Q ss_pred CCcEEEEECC-------CCEEEEECCCCcEEEEe
Q psy4774 22 NGALHVADAY-------FGLYKVNVTTGQTEQLI 48 (120)
Q Consensus 22 ~G~l~V~d~~-------~gi~~vd~~~g~~~~~~ 48 (120)
+++|||.-.. +.+.++|+++++++.+.
T Consensus 11 ~~~iyv~GG~~~~~~~~~~v~~yd~~t~~W~~~~ 44 (50)
T PF13964_consen 11 GGKIYVFGGYDNSGKYSNDVERYDPETNTWEQLP 44 (50)
T ss_pred CCEEEEECCCCCCCCccccEEEEcCCCCcEEECC
Confidence 5789997432 23899999999888875
No 174
>KOG0318|consensus
Probab=54.31 E-value=1.4e+02 Score=25.48 Aligned_cols=66 Identities=12% Similarity=0.131 Sum_probs=45.3
Q ss_pred CCCccceEECCCCcEEEEECCCC-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCC
Q psy4774 11 CGRPLGMKFDKNGALHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~ 80 (120)
..+-+.+++.|||+++++-...| |+.+|-++|+..-..+. .++ --...-.|.-+||+.=+.|-|.-
T Consensus 190 skFV~~VRysPDG~~Fat~gsDgki~iyDGktge~vg~l~~---~~a-HkGsIfalsWsPDs~~~~T~SaD 256 (603)
T KOG0318|consen 190 SKFVNCVRYSPDGSRFATAGSDGKIYIYDGKTGEKVGELED---SDA-HKGSIFALSWSPDSTQFLTVSAD 256 (603)
T ss_pred ccceeeEEECCCCCeEEEecCCccEEEEcCCCccEEEEecC---CCC-ccccEEEEEECCCCceEEEecCC
Confidence 45678999999999999866655 99999888865433321 011 01234478888888888887764
No 175
>KOG0771|consensus
Probab=54.15 E-value=46 Score=27.04 Aligned_cols=60 Identities=13% Similarity=0.133 Sum_probs=36.6
Q ss_pred CCccceEECCCCcEEEEECCCCEEEE-E-CCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFGLYKV-N-VTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~gi~~v-d-~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
++-.-+++..+|.++.+-...|.+|| + | .. .++... ..+ -...+||+|++||-+.+|=+.
T Consensus 145 g~~k~vaf~~~gs~latgg~dg~lRv~~~P-s~--~t~l~e--~~~---~~eV~DL~FS~dgk~lasig~ 206 (398)
T KOG0771|consen 145 GQQKVVAFNGDGSKLATGGTDGTLRVWEWP-SM--LTILEE--IAH---HAEVKDLDFSPDGKFLASIGA 206 (398)
T ss_pred CcceEEEEcCCCCEeeeccccceEEEEecC-cc--hhhhhh--Hhh---cCccccceeCCCCcEEEEecC
Confidence 33466888888888877666776665 4 3 22 233321 111 234689999999965555443
No 176
>PTZ00486 apyrase Superfamily; Provisional
Probab=54.15 E-value=1.1e+02 Score=24.45 Aligned_cols=52 Identities=19% Similarity=0.287 Sum_probs=31.5
Q ss_pred CCcEEEEECCCC-EEEEECCCCcEE--EEeccccccCC---CCCcCCCceEEcCCCCEEEEeC
Q psy4774 22 NGALHVADAYFG-LYKVNVTTGQTE--QLISMDTEIDG---AKPQIPNSVTVDSDGMVYWSDS 78 (120)
Q Consensus 22 ~G~l~V~d~~~g-i~~vd~~~g~~~--~~~~~~~~~~g---~~~~~pndl~v~~~G~iy~TDs 78 (120)
+|+||..|-..| |++++.+.+++- +++. ..+| ..|+ .-++.-.+..+|+--.
T Consensus 124 ngkLys~DDrTGiVy~i~~~~~~~~PwvIL~---dGdG~~~kGfK--~EWaTVKd~~LyVGs~ 181 (352)
T PTZ00486 124 NGKLYGFDDRTGIVYEIDIDKKKAYPRHILS---DGNGNSDKGMK--IEWATVYDDKLYVGSI 181 (352)
T ss_pred CCEEEEEeCCceEEEEEEcCCCcEeeEEEEe---cCCCCCCCCcc--eeeEEEECCEEEEecc
Confidence 579999998888 888875544433 2332 2233 2343 4566667777886433
No 177
>PRK01029 tolB translocation protein TolB; Provisional
Probab=54.10 E-value=78 Score=25.45 Aligned_cols=34 Identities=6% Similarity=0.090 Sum_probs=23.5
Q ss_pred ceEECCCCc---E-EEEECC--CCEEEEECCCCcEEEEec
Q psy4774 16 GMKFDKNGA---L-HVADAY--FGLYKVNVTTGQTEQLIS 49 (120)
Q Consensus 16 Gl~~d~~G~---l-~V~d~~--~gi~~vd~~~g~~~~~~~ 49 (120)
.-++.|||+ + |++... ..|+.++..+|+.+.+..
T Consensus 189 sP~wSPDG~~~~~~y~S~~~g~~~I~~~~l~~g~~~~lt~ 228 (428)
T PRK01029 189 TPTWMHIGSGFPYLYVSYKLGVPKIFLGSLENPAGKKILA 228 (428)
T ss_pred cceEccCCCceEEEEEEccCCCceEEEEECCCCCceEeec
Confidence 458899984 2 355432 349999998888877764
No 178
>KOG4649|consensus
Probab=53.25 E-value=47 Score=25.99 Aligned_cols=55 Identities=18% Similarity=0.216 Sum_probs=34.3
Q ss_pred EEEEECC-CCEEEEECCCCcE--EEEeccccccCCCCCcCCCceEEcC-CCCEEEEeCCCC
Q psy4774 25 LHVADAY-FGLYKVNVTTGQT--EQLISMDTEIDGAKPQIPNSVTVDS-DGMVYWSDSSTK 81 (120)
Q Consensus 25 l~V~d~~-~gi~~vd~~~g~~--~~~~~~~~~~~g~~~~~pndl~v~~-~G~iy~TDs~~~ 81 (120)
+.++..+ .-+.-||+++|++ +.+.+ ..+++.++.--|-+++.- +|-+||-+++..
T Consensus 25 ~v~igSHs~~~~avd~~sG~~~We~ilg--~RiE~sa~vvgdfVV~GCy~g~lYfl~~~tG 83 (354)
T KOG4649|consen 25 LVVIGSHSGIVIAVDPQSGNLIWEAILG--VRIECSAIVVGDFVVLGCYSGGLYFLCVKTG 83 (354)
T ss_pred EEEEecCCceEEEecCCCCcEEeehhhC--ceeeeeeEEECCEEEEEEccCcEEEEEecch
Confidence 3334444 4477799999974 34433 255666665444555554 799999999753
No 179
>PRK12816 flgG flagellar basal body rod protein FlgG; Reviewed
Probab=53.08 E-value=46 Score=25.22 Aligned_cols=58 Identities=26% Similarity=0.488 Sum_probs=35.4
Q ss_pred CCCccceEECCCCcEEEEEC-----C--CCEEEEECCCCcEEEEeccccccCCC----CCcCC-----CceEEcCCCCEE
Q psy4774 11 CGRPLGMKFDKNGALHVADA-----Y--FGLYKVNVTTGQTEQLISMDTEIDGA----KPQIP-----NSVTVDSDGMVY 74 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~-----~--~gi~~vd~~~g~~~~~~~~~~~~~g~----~~~~p-----ndl~v~~~G~iy 74 (120)
-++|+-+|+..+|=+-|... + .|-+++|. +|.+..- +|. |+..| .++.|+++|.|+
T Consensus 89 Tg~~lDlAI~G~GFF~V~~~~G~~~YTR~G~F~~d~-~G~Lvt~-------~G~~vl~~I~lp~~~~~~~i~I~~dG~I~ 160 (264)
T PRK12816 89 TGNKLDVAIEGEGFFKILMPDGTYAYTRDGSFKIDA-NGQLVTS-------NGYRLLPEIIFPENYILNSITISEEGIVS 160 (264)
T ss_pred CCCcceEEECCCcEEEEEcCCCCeEEeeCCCeeECC-CCCEECC-------CCCEecceeecCCCcccccEEECCCCeEE
Confidence 47889999988774444321 1 24556665 3553211 232 24445 479999999998
Q ss_pred EE
Q psy4774 75 WS 76 (120)
Q Consensus 75 ~T 76 (120)
..
T Consensus 161 ~~ 162 (264)
T PRK12816 161 VK 162 (264)
T ss_pred Ee
Confidence 74
No 180
>KOG3881|consensus
Probab=52.77 E-value=75 Score=25.87 Aligned_cols=63 Identities=8% Similarity=0.081 Sum_probs=39.9
Q ss_pred CccceEECCC--CcEEEEECC-CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEEeCCCC
Q psy4774 13 RPLGMKFDKN--GALHVADAY-FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWSDSSTK 81 (120)
Q Consensus 13 ~P~Gl~~d~~--G~l~V~d~~-~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~TDs~~~ 81 (120)
.+.++.|-+. ..-+++-.. ..+..+|+.. +..+++.+ ... -+....+...++|+ ||++|+...
T Consensus 204 W~tdi~Fl~g~~~~~fat~T~~hqvR~YDt~~-qRRPV~~f--d~~---E~~is~~~l~p~gn~Iy~gn~~g~ 270 (412)
T KOG3881|consen 204 WITDIRFLEGSPNYKFATITRYHQVRLYDTRH-QRRPVAQF--DFL---ENPISSTGLTPSGNFIYTGNTKGQ 270 (412)
T ss_pred eeccceecCCCCCceEEEEecceeEEEecCcc-cCcceeEe--ccc---cCcceeeeecCCCcEEEEecccch
Confidence 3567888876 566665444 4477788864 45566543 111 22345778888884 999999753
No 181
>KOG1274|consensus
Probab=52.77 E-value=93 Score=27.99 Aligned_cols=28 Identities=11% Similarity=0.175 Sum_probs=18.7
Q ss_pred ceEECCCCcEEEEECC-CCEEEEECCCCc
Q psy4774 16 GMKFDKNGALHVADAY-FGLYKVNVTTGQ 43 (120)
Q Consensus 16 Gl~~d~~G~l~V~d~~-~gi~~vd~~~g~ 43 (120)
-++++.+|++.++-.. ..|..++.+++.
T Consensus 101 ~~~v~g~g~~iaagsdD~~vK~~~~~D~s 129 (933)
T KOG1274|consen 101 DLAVSGSGKMIAAGSDDTAVKLLNLDDSS 129 (933)
T ss_pred EEEEecCCcEEEeecCceeEEEEeccccc
Confidence 5789989988877544 456666654443
No 182
>TIGR02488 flgG_G_neg flagellar basal-body rod protein FlgG, Gram-negative bacteria. This family consists of the FlgG protein of the flagellar apparatus in the Proteobacteria and spirochetes.
Probab=52.50 E-value=98 Score=23.23 Aligned_cols=58 Identities=34% Similarity=0.515 Sum_probs=35.6
Q ss_pred CCCccceEECCCCcEEEEEC-----C--CCEEEEECCCCcEEEEeccccccCCC----CCcCC---CceEEcCCCCEEEE
Q psy4774 11 CGRPLGMKFDKNGALHVADA-----Y--FGLYKVNVTTGQTEQLISMDTEIDGA----KPQIP---NSVTVDSDGMVYWS 76 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~-----~--~gi~~vd~~~g~~~~~~~~~~~~~g~----~~~~p---ndl~v~~~G~iy~T 76 (120)
-++|+-+|+..+|=+-|-.. + .|-+++|. +|.+..- +|. |+..| ..+.|+++|.|+..
T Consensus 87 Tg~~lD~AI~G~GfF~V~~~~g~~~yTR~G~F~~d~-~G~Lvt~-------~G~~Vl~~I~lp~~~~~~~I~~dG~i~~~ 158 (259)
T TIGR02488 87 TGNDLDLAIEGEGFFQVLMPDGTTAYTRDGAFKINA-EGQLVTS-------NGYPLQPEITIPENATSITVGSDGEVSVR 158 (259)
T ss_pred cCCcceEEEcCCcEEEEEcCCCCeEEeeCCceEECC-CCCEECC-------CCCEecCceecCCCCceEEECCCCeEEEe
Confidence 47889999988774433321 1 24566776 4653221 232 24445 36999999999874
No 183
>PF06079 Apyrase: Apyrase; InterPro: IPR009283 This family consists of several eukaryotic apyrase (or adenosine diphosphatase) proteins (3.6.1.5 from EC), and related nucleoside diphosphatases (3.6.1.6 from EC). The salivary apyrases of blood-feeding arthropods are nucleotide hydrolysing enzymes implicated in the inhibition of host platelet aggregation through the hydrolysis of extracellular adenosine diphosphate [].; GO: 0005509 calcium ion binding, 0016462 pyrophosphatase activity; PDB: 2H2N_A 1S18_A 2H2U_A 1S1D_B.
Probab=51.03 E-value=90 Score=24.34 Aligned_cols=52 Identities=19% Similarity=0.272 Sum_probs=25.4
Q ss_pred CCcEEEEECCCC-EEEEECCCCcEEEEeccccccCCC---CCcCCCceEEcCCCCEEEEeC
Q psy4774 22 NGALHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGA---KPQIPNSVTVDSDGMVYWSDS 78 (120)
Q Consensus 22 ~G~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~---~~~~pndl~v~~~G~iy~TDs 78 (120)
+|+||..|-..| |++|..+ +.-+.+=. ...+|. .|+ .-.+--.|+.||+--.
T Consensus 63 ngkLys~DDrTGiVyeI~~~--~~vPwviL-~dGdG~~~kGfK--~EWaTVKd~~LyvGs~ 118 (291)
T PF06079_consen 63 NGKLYSFDDRTGIVYEIKGD--KAVPWVIL-SDGDGNTSKGFK--AEWATVKDDKLYVGSI 118 (291)
T ss_dssp TTEEEEEETTT-EEEEEETT--EEEEEEE--BSTTTTESSB------EEEEETTEEEEE--
T ss_pred CCEEeeeeCCCceEEEEeCC--ceeceEEE-eCCCCCcccccc--ceeeEEeCCeeeeccC
Confidence 578999988877 7788753 44333211 122332 222 2335555677775433
No 184
>KOG4441|consensus
Probab=50.73 E-value=76 Score=26.81 Aligned_cols=57 Identities=26% Similarity=0.328 Sum_probs=36.1
Q ss_pred ccceEECCCCcEEEEECCC-C------EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcC-CCCEEEEeCC
Q psy4774 14 PLGMKFDKNGALHVADAYF-G------LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDS-DGMVYWSDSS 79 (120)
Q Consensus 14 P~Gl~~d~~G~l~V~d~~~-g------i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~-~G~iy~TDs~ 79 (120)
-.|+++. +|.|||+-... | +.++|+.+++++.+++... .-.++.+.. +|.||+.--.
T Consensus 325 ~~~~~~~-~~~lYv~GG~~~~~~~l~~ve~YD~~~~~W~~~a~M~~--------~R~~~~v~~l~g~iYavGG~ 389 (571)
T KOG4441|consen 325 RVGVAVL-NGKLYVVGGYDSGSDRLSSVERYDPRTNQWTPVAPMNT--------KRSDFGVAVLDGKLYAVGGF 389 (571)
T ss_pred cccEEEE-CCEEEEEccccCCCcccceEEEecCCCCceeccCCccC--------ccccceeEEECCEEEEEecc
Confidence 3466666 46999995443 2 8889999888877764311 112444444 5788876554
No 185
>KOG1274|consensus
Probab=50.18 E-value=1.1e+02 Score=27.51 Aligned_cols=77 Identities=10% Similarity=0.094 Sum_probs=43.5
Q ss_pred CCccceEECCCCcEEEEECCCC-EEEEECCCCcEEEEeccccccCCCC-CcCCCceEEcCCC-CEEEEeCCC---Ccccc
Q psy4774 12 GRPLGMKFDKNGALHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAK-PQIPNSVTVDSDG-MVYWSDSST---KYKLY 85 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~-~~~pndl~v~~~G-~iy~TDs~~---~~~~~ 85 (120)
+.-++|.++|+|+++++..-.| |..++.+++.+.-......+..+.. -..++-++..|+| .+-+--... .|.+.
T Consensus 139 apVl~l~~~p~~~fLAvss~dG~v~iw~~~~~~~~~tl~~v~k~n~~~~s~i~~~~aW~Pk~g~la~~~~d~~Vkvy~r~ 218 (933)
T KOG1274|consen 139 APVLQLSYDPKGNFLAVSSCDGKVQIWDLQDGILSKTLTGVDKDNEFILSRICTRLAWHPKGGTLAVPPVDNTVKVYSRK 218 (933)
T ss_pred CceeeeeEcCCCCEEEEEecCceEEEEEcccchhhhhcccCCccccccccceeeeeeecCCCCeEEeeccCCeEEEEccC
Confidence 4457999999999998877777 4445776665433322111111111 2346778889984 443332221 25555
Q ss_pred cce
Q psy4774 86 DGL 88 (120)
Q Consensus 86 ~~~ 88 (120)
.|.
T Consensus 219 ~we 221 (933)
T KOG1274|consen 219 GWE 221 (933)
T ss_pred Cce
Confidence 553
No 186
>PF14220 DUF4329: Domain of unknown function (DUF4329)
Probab=50.15 E-value=12 Score=25.41 Aligned_cols=14 Identities=29% Similarity=0.463 Sum_probs=13.0
Q ss_pred ecCCceEEEEcCCC
Q psy4774 93 TSGSGSFIKIGPNT 106 (120)
Q Consensus 93 ~~~~G~l~~~d~~~ 106 (120)
..|.||||+||+++
T Consensus 98 ~TP~Grl~~~~~~~ 111 (123)
T PF14220_consen 98 GTPGGRLWKYDPST 111 (123)
T ss_pred eCCCCcEEEEcCch
Confidence 67999999999988
No 187
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=49.87 E-value=33 Score=16.99 Aligned_cols=26 Identities=8% Similarity=0.163 Sum_probs=18.0
Q ss_pred CCCccceEECCCCcEEEEECCCCEEE
Q psy4774 11 CGRPLGMKFDKNGALHVADAYFGLYK 36 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~~~gi~~ 36 (120)
.+....|++.++++++++-...+.++
T Consensus 11 ~~~i~~i~~~~~~~~~~s~~~D~~i~ 36 (39)
T PF00400_consen 11 SSSINSIAWSPDGNFLASGSSDGTIR 36 (39)
T ss_dssp SSSEEEEEEETTSSEEEEEETTSEEE
T ss_pred CCcEEEEEEecccccceeeCCCCEEE
Confidence 35667889999988877755555333
No 188
>PHA03098 kelch-like protein; Provisional
Probab=49.61 E-value=1.4e+02 Score=24.39 Aligned_cols=72 Identities=13% Similarity=0.058 Sum_probs=41.5
Q ss_pred CCCcEEEEECC---------CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCCCcccccceecc
Q psy4774 21 KNGALHVADAY---------FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTKYKLYDGLFDG 91 (120)
Q Consensus 21 ~~G~l~V~d~~---------~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~~~~~~~~~~~ 91 (120)
-++.|||+... ..++++|+++++++.+... . .+.... .+++ -+|.||+.-....
T Consensus 436 ~~~~iyv~GG~~~~~~~~~~~~v~~yd~~~~~W~~~~~~----~-~~r~~~-~~~~-~~~~iyv~GG~~~---------- 498 (534)
T PHA03098 436 HDGKIYVIGGISYIDNIKVYNIVESYNPVTNKWTELSSL----N-FPRINA-SLCI-FNNKIYVVGGDKY---------- 498 (534)
T ss_pred ECCEEEEECCccCCCCCcccceEEEecCCCCceeeCCCC----C-cccccc-eEEE-ECCEEEEEcCCcC----------
Confidence 35789998532 1288899998888877531 1 111112 2222 2678998754321
Q ss_pred eecCCceEEEEcCCC--ceEe
Q psy4774 92 LTSGSGSFIKIGPNT--QEVL 110 (120)
Q Consensus 92 ~~~~~G~l~~~d~~~--~~~l 110 (120)
......+++||+++ .+.+
T Consensus 499 -~~~~~~v~~yd~~~~~W~~~ 518 (534)
T PHA03098 499 -EYYINEIEVYDDKTNTWTLF 518 (534)
T ss_pred -CcccceeEEEeCCCCEEEec
Confidence 01134788899887 5444
No 189
>KOG0272|consensus
Probab=48.29 E-value=39 Score=27.76 Aligned_cols=62 Identities=16% Similarity=0.222 Sum_probs=40.2
Q ss_pred CCccceEECCCCcEEEEECCCCEEEE-ECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFGLYKV-NVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~gi~~v-d~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~ 80 (120)
.|-.-.+|+|+|+.+.+-....-+|+ |.+++ .+.+.. +.+.+ ...++++.+||.|..|-..-
T Consensus 262 ~RVs~VafHPsG~~L~TasfD~tWRlWD~~tk-~ElL~Q---EGHs~---~v~~iaf~~DGSL~~tGGlD 324 (459)
T KOG0272|consen 262 ARVSRVAFHPSGKFLGTASFDSTWRLWDLETK-SELLLQ---EGHSK---GVFSIAFQPDGSLAATGGLD 324 (459)
T ss_pred hhheeeeecCCCceeeecccccchhhcccccc-hhhHhh---ccccc---ccceeEecCCCceeeccCcc
Confidence 44556899999998887655444443 44443 344432 22222 35699999999999997753
No 190
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=48.29 E-value=83 Score=23.85 Aligned_cols=33 Identities=12% Similarity=0.104 Sum_probs=20.7
Q ss_pred ceEECCCCcEEEEECCCC--EEEEECCCCcEEEEec
Q psy4774 16 GMKFDKNGALHVADAYFG--LYKVNVTTGQTEQLIS 49 (120)
Q Consensus 16 Gl~~d~~G~l~V~d~~~g--i~~vd~~~g~~~~~~~ 49 (120)
.|..- ++.|+|.|...+ +++++.+..+...++.
T Consensus 134 sl~~~-~~~I~vgD~~~sv~~~~~~~~~~~l~~va~ 168 (321)
T PF03178_consen 134 SLSVF-KNYILVGDAMKSVSLLRYDEENNKLILVAR 168 (321)
T ss_dssp EEEEE-TTEEEEEESSSSEEEEEEETTTE-EEEEEE
T ss_pred EEecc-ccEEEEEEcccCEEEEEEEccCCEEEEEEe
Confidence 34444 237888899887 5566875445777764
No 191
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=47.10 E-value=1.3e+02 Score=23.29 Aligned_cols=35 Identities=20% Similarity=0.219 Sum_probs=19.0
Q ss_pred CCccceEECCCCcEEEEECCCCEEEEECCCCcEEEE
Q psy4774 12 GRPLGMKFDKNGALHVADAYFGLYKVNVTTGQTEQL 47 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~~~~ 47 (120)
.|-..|.|.+++.||+... .|.+++.......+.+
T Consensus 187 ~riq~~gf~~~~~lw~~~~-Gg~~~~s~~~~~~~~w 221 (302)
T PF14870_consen 187 RRIQSMGFSPDGNLWMLAR-GGQIQFSDDPDDGETW 221 (302)
T ss_dssp S-EEEEEE-TTS-EEEEET-TTEEEEEE-TTEEEEE
T ss_pred ceehhceecCCCCEEEEeC-CcEEEEccCCCCcccc
Confidence 4445788899999988763 4455554323344444
No 192
>cd05845 Ig2_L1-CAM_like Second immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM) and similar proteins. Ig2_L1-CAM_like: domain similar to the second immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains, five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth.
Probab=46.87 E-value=18 Score=23.21 Aligned_cols=17 Identities=18% Similarity=0.585 Sum_probs=14.8
Q ss_pred CceEEcCCCCEEEEeCC
Q psy4774 63 NSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 63 ndl~v~~~G~iy~TDs~ 79 (120)
..+.++++|+|||++..
T Consensus 50 ~Ri~~~~~GnL~fs~v~ 66 (95)
T cd05845 50 ERVSMGQNGNLYFANVE 66 (95)
T ss_pred ccEEECCCceEEEEEEe
Confidence 46888899999999886
No 193
>PF12275 DUF3616: Protein of unknown function (DUF3616); InterPro: IPR022060 This family of proteins is found in bacteria. Proteins in this family are typically between 335 and 392 amino acids in length. There is a conserved GLRGPV sequence motif.
Probab=46.39 E-value=64 Score=25.50 Aligned_cols=66 Identities=20% Similarity=0.308 Sum_probs=35.3
Q ss_pred ccceEECCCCcEEEE-ECCCCEEEEECCC-CcEEEEec-----ccc--ccCC--CCCcCCCceEEcCCCCEEEEeCCC
Q psy4774 14 PLGMKFDKNGALHVA-DAYFGLYKVNVTT-GQTEQLIS-----MDT--EIDG--AKPQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 14 P~Gl~~d~~G~l~V~-d~~~gi~~vd~~~-g~~~~~~~-----~~~--~~~g--~~~~~pndl~v~~~G~iy~TDs~~ 80 (120)
|.+...-++++|||+ |...++.|+-... +....+.. ... ...+ ....-.-+++. .+|.||++-|-+
T Consensus 2 ~Sa~~~~~d~~l~va~DE~~~i~rL~~~~~~~~~~~~~~~~~~l~~~~~lp~~~~~e~DiEGla~-~~gyly~igSHS 78 (330)
T PF12275_consen 2 PSAAVQLPDGRLWVASDETANIERLTLDDAGGEDRFGDHASFPLADFFDLPGPKDKEIDIEGLAY-ADGYLYVIGSHS 78 (330)
T ss_pred CccceEcCCCeEEEEecCCCCeeEEEecCCCcccccccccccccccccccCCCCCcccchhhhhc-cCCeEEEEccCc
Confidence 567788888999997 5556666653221 21111110 000 0000 11222346777 678999999964
No 194
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=46.17 E-value=1.4e+02 Score=23.12 Aligned_cols=58 Identities=19% Similarity=0.278 Sum_probs=32.7
Q ss_pred eEECCCCc-EEEEECCC-----CEEEEECCCCcEEEEeccccccCCCCCcCCCceEEc-CCC--CEEEEe
Q psy4774 17 MKFDKNGA-LHVADAYF-----GLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVD-SDG--MVYWSD 77 (120)
Q Consensus 17 l~~d~~G~-l~V~d~~~-----gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~-~~G--~iy~TD 77 (120)
+.+.++++ |++.-..+ -++.+|.++|+.+++.. +....-+.......+- ++| -||+++
T Consensus 189 v~W~~d~~~l~~~~~nR~q~~~~l~~~d~~tg~~~~~~~---e~~~~Wv~~~~~~~~~~~~~~~~l~~s~ 255 (353)
T PF00930_consen 189 VGWSPDGKRLWVQWLNRDQNRLDLVLCDASTGETRVVLE---ETSDGWVDVYDPPHFLGPDGNEFLWISE 255 (353)
T ss_dssp EEEEETTEEEEEEEEETTSTEEEEEEEEECTTTCEEEEE---EESSSSSSSSSEEEE-TTTSSEEEEEEE
T ss_pred ceecCCCcEEEEEEcccCCCEEEEEEEECCCCceeEEEE---ecCCcceeeecccccccCCCCEEEEEEE
Confidence 45556666 77764322 27788988888877764 3333334444556654 443 355555
No 195
>KOG0266|consensus
Probab=45.68 E-value=1.1e+02 Score=24.72 Aligned_cols=63 Identities=24% Similarity=0.298 Sum_probs=40.9
Q ss_pred CCccceEECCCCcEEEEECCCC-EEEEECCCCcEE---EEeccccccCCCCCcCCCceEEcCCCCEEEEeCCC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFG-LYKVNVTTGQTE---QLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~~---~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~ 80 (120)
+.-.+++|.++|+++++..+.+ |...|..+++.. .+. .... +- ...-+.++++|...++-...
T Consensus 289 ~~is~~~f~~d~~~l~s~s~d~~i~vwd~~~~~~~~~~~~~----~~~~-~~-~~~~~~fsp~~~~ll~~~~d 355 (456)
T KOG0266|consen 289 DGISGLAFSPDGNLLVSASYDGTIRVWDLETGSKLCLKLLS----GAEN-SA-PVTSVQFSPNGKYLLSASLD 355 (456)
T ss_pred CceEEEEECCCCCEEEEcCCCccEEEEECCCCceeeeeccc----CCCC-CC-ceeEEEECCCCcEEEEecCC
Confidence 3456788999999988876666 555688877732 221 1111 11 34688888998777766643
No 196
>KOG2139|consensus
Probab=45.47 E-value=86 Score=25.55 Aligned_cols=57 Identities=19% Similarity=0.296 Sum_probs=38.5
Q ss_pred ceEECCCCcEEEEECC--CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 16 GMKFDKNGALHVADAY--FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 16 Gl~~d~~G~l~V~d~~--~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
.|+.++||..++..+. ..|..-|+++|+..++.. ..+....-|-.+|||..+|.-+-
T Consensus 200 smqwn~dgt~l~tAS~gsssi~iWdpdtg~~~pL~~-------~glgg~slLkwSPdgd~lfaAt~ 258 (445)
T KOG2139|consen 200 SMQWNEDGTILVTASFGSSSIMIWDPDTGQKIPLIP-------KGLGGFSLLKWSPDGDVLFAATC 258 (445)
T ss_pred EEEEcCCCCEEeecccCcceEEEEcCCCCCcccccc-------cCCCceeeEEEcCCCCEEEEecc
Confidence 5788888877776443 347777888887777742 11223446788999988877663
No 197
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=44.69 E-value=1.5e+02 Score=23.07 Aligned_cols=93 Identities=19% Similarity=0.244 Sum_probs=42.1
Q ss_pred CCCccceEECCCCcEEEEECCCCEEEEECCCCcEEEEeccccccCCC-----CCcCCCceEEcCCCCEEEE-eCCCC-cc
Q psy4774 11 CGRPLGMKFDKNGALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGA-----KPQIPNSVTVDSDGMVYWS-DSSTK-YK 83 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~-----~~~~pndl~v~~~G~iy~T-Ds~~~-~~ 83 (120)
.+.|.++....++..+++.....|++=...+...+.+.. +..+. +..--.=|+++..|++|.| |+... |.
T Consensus 103 pgs~~~i~~l~~~~~~l~~~~G~iy~T~DgG~tW~~~~~---~~~gs~~~~~r~~dG~~vavs~~G~~~~s~~~G~~~w~ 179 (302)
T PF14870_consen 103 PGSPFGITALGDGSAELAGDRGAIYRTTDGGKTWQAVVS---ETSGSINDITRSSDGRYVAVSSRGNFYSSWDPGQTTWQ 179 (302)
T ss_dssp SS-EEEEEEEETTEEEEEETT--EEEESSTTSSEEEEE----S----EEEEEE-TTS-EEEEETTSSEEEEE-TT-SS-E
T ss_pred CCCeeEEEEcCCCcEEEEcCCCcEEEeCCCCCCeeEccc---CCcceeEeEEECCCCcEEEEECcccEEEEecCCCccce
Confidence 456777777666777766554446666543335555543 11111 1111112556666777766 44321 22
Q ss_pred cc-----cceecceecCCceEEEEcCCC
Q psy4774 84 LY-----DGLFDGLTSGSGSFIKIGPNT 106 (120)
Q Consensus 84 ~~-----~~~~~~~~~~~G~l~~~d~~~ 106 (120)
+. ..+..+-..++|.||.....+
T Consensus 180 ~~~r~~~~riq~~gf~~~~~lw~~~~Gg 207 (302)
T PF14870_consen 180 PHNRNSSRRIQSMGFSPDGNLWMLARGG 207 (302)
T ss_dssp EEE--SSS-EEEEEE-TTS-EEEEETTT
T ss_pred EEccCccceehhceecCCCCEEEEeCCc
Confidence 21 112222256788888888666
No 198
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=44.23 E-value=1.5e+02 Score=24.20 Aligned_cols=30 Identities=13% Similarity=0.137 Sum_probs=21.9
Q ss_pred ceEECCCCcEEEEECCCCEEEEECCCCcEE
Q psy4774 16 GMKFDKNGALHVADAYFGLYKVNVTTGQTE 45 (120)
Q Consensus 16 Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~~ 45 (120)
|+++..++.+|+.+....|+.+|.++|+..
T Consensus 104 g~~~~~~~~V~v~~~~g~v~AlD~~TG~~~ 133 (488)
T cd00216 104 GVAYWDPRKVFFGTFDGRLVALDAETGKQV 133 (488)
T ss_pred CcEEccCCeEEEecCCCeEEEEECCCCCEe
Confidence 445543478998876566999999999743
No 199
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=43.95 E-value=94 Score=25.79 Aligned_cols=68 Identities=15% Similarity=0.206 Sum_probs=29.5
Q ss_pred CCccceEECCCC-cEEEEECCCC-EEEEECCCCc-EE----EEecccc------ccCCCCC-cCCCceEEcCCC-CEEEE
Q psy4774 12 GRPLGMKFDKNG-ALHVADAYFG-LYKVNVTTGQ-TE----QLISMDT------EIDGAKP-QIPNSVTVDSDG-MVYWS 76 (120)
Q Consensus 12 g~P~Gl~~d~~G-~l~V~d~~~g-i~~vd~~~g~-~~----~~~~~~~------~~~g~~~-~~pndl~v~~~G-~iy~T 76 (120)
+.+.-|.+..|+ -|||+.-..| |.++|..+-. .. +.++..- .+.|.++ ..|+=|-++.|| +||||
T Consensus 312 ~LitDI~iSlDDrfLYvs~W~~GdvrqYDISDP~~Pkl~gqv~lGG~~~~~~~~~v~g~~l~GgPqMvqlS~DGkRlYvT 391 (461)
T PF05694_consen 312 PLITDILISLDDRFLYVSNWLHGDVRQYDISDPFNPKLVGQVFLGGSIRKGDHPVVKGKRLRGGPQMVQLSLDGKRLYVT 391 (461)
T ss_dssp -----EEE-TTS-EEEEEETTTTEEEEEE-SSTTS-EEEEEEE-BTTTT-B--TTS------S----EEE-TTSSEEEEE
T ss_pred CceEeEEEccCCCEEEEEcccCCcEEEEecCCCCCCcEEeEEEECcEeccCCCccccccccCCCCCeEEEccCCeEEEEE
Confidence 445567777776 6799887777 7778754322 11 1211000 1123332 357889999999 89999
Q ss_pred eCC
Q psy4774 77 DSS 79 (120)
Q Consensus 77 Ds~ 79 (120)
.|-
T Consensus 392 nSL 394 (461)
T PF05694_consen 392 NSL 394 (461)
T ss_dssp ---
T ss_pred eec
Confidence 996
No 200
>PHA03219 nuclear protein UL24; Provisional
Probab=42.79 E-value=20 Score=28.06 Aligned_cols=24 Identities=8% Similarity=-0.032 Sum_probs=21.0
Q ss_pred cCCceEEEEcCCC--ceEecCccccc
Q psy4774 94 SGSGSFIKIGPNT--QEVLQKQTDKL 117 (120)
Q Consensus 94 ~~~G~l~~~d~~~--~~~l~~~L~~~ 117 (120)
.+.|+||.+|+.. +..|+|+|+.|
T Consensus 241 ~~~G~Vya~d~~~~~~~rvadsl~~f 266 (300)
T PHA03219 241 VYTGRVYACDLRNLRYIRLGDNLTAF 266 (300)
T ss_pred ccCceEEEEcCCCCeEEEEecCHHHH
Confidence 6899999999998 88889988765
No 201
>KOG0288|consensus
Probab=42.26 E-value=2e+02 Score=23.80 Aligned_cols=66 Identities=21% Similarity=0.208 Sum_probs=35.0
Q ss_pred CCCCccceEECCCC-cEEEEECCCCEEEEECCCCcEEEEeccccccCCCC-CcCCCceEEcCCCCEEEEeCC
Q psy4774 10 QCGRPLGMKFDKNG-ALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAK-PQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 10 ~~g~P~Gl~~d~~G-~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~-~~~pndl~v~~~G~iy~TDs~ 79 (120)
.+|+--.+....+| .|+.|.-...+-.+|..+.++...+. ..|.. -.-.+.+.|+|+|....+-|.
T Consensus 340 ~gg~vtSl~ls~~g~~lLsssRDdtl~viDlRt~eI~~~~s----A~g~k~asDwtrvvfSpd~~YvaAGS~ 407 (459)
T KOG0288|consen 340 LGGRVTSLDLSMDGLELLSSSRDDTLKVIDLRTKEIRQTFS----AEGFKCASDWTRVVFSPDGSYVAAGSA 407 (459)
T ss_pred cCcceeeEeeccCCeEEeeecCCCceeeeecccccEEEEee----ccccccccccceeEECCCCceeeeccC
Confidence 34444455666666 55656222345556655544443332 22221 122457788888777766665
No 202
>COG5134 Uncharacterized conserved protein [Function unknown]
Probab=41.91 E-value=33 Score=25.77 Aligned_cols=31 Identities=19% Similarity=0.395 Sum_probs=24.5
Q ss_pred cCCCCccceEECCCCcEEEEECCCCEEEEECC
Q psy4774 9 QQCGRPLGMKFDKNGALHVADAYFGLYKVNVT 40 (120)
Q Consensus 9 ~~~g~P~Gl~~d~~G~l~V~d~~~gi~~vd~~ 40 (120)
+.|++|..++.||.+..||...+ |+..|+|+
T Consensus 83 ~~C~n~i~~RTDPkN~~YV~EsG-g~R~i~pq 113 (272)
T COG5134 83 HLCSNPIDVRTDPKNTEYVVESG-GRRKIEPQ 113 (272)
T ss_pred cCCCCceeeecCCCCceEEEecC-ceeecCcc
Confidence 56899999999999999999763 45556543
No 203
>KOG0289|consensus
Probab=41.87 E-value=1.3e+02 Score=24.95 Aligned_cols=56 Identities=11% Similarity=0.237 Sum_probs=35.7
Q ss_pred ceEECCCCcEEEEECCCCEEEE-ECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeC
Q psy4774 16 GMKFDKNGALHVADAYFGLYKV-NVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDS 78 (120)
Q Consensus 16 Gl~~d~~G~l~V~d~~~gi~~v-d~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs 78 (120)
..+|.|||.|+.+-...++++| |.+.+ ..++. .+|. -.-.-.|.|+.+|.-.+|-.
T Consensus 352 s~~fHpDgLifgtgt~d~~vkiwdlks~--~~~a~----Fpgh-t~~vk~i~FsENGY~Lat~a 408 (506)
T KOG0289|consen 352 SAAFHPDGLIFGTGTPDGVVKIWDLKSQ--TNVAK----FPGH-TGPVKAISFSENGYWLATAA 408 (506)
T ss_pred EeeEcCCceEEeccCCCceEEEEEcCCc--ccccc----CCCC-CCceeEEEeccCceEEEEEe
Confidence 5789999999998877887776 55433 23332 2331 11234788998886555544
No 204
>PF15533 Toxin_54: Putative toxin 54
Probab=41.80 E-value=22 Score=21.43 Aligned_cols=17 Identities=18% Similarity=0.352 Sum_probs=13.8
Q ss_pred CceEEcCCCCEEEEeCC
Q psy4774 63 NSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 63 ndl~v~~~G~iy~TDs~ 79 (120)
-||..|.+|+||+-=-.
T Consensus 37 yDlykD~~gni~ik~KG 53 (66)
T PF15533_consen 37 YDLYKDREGNIYIKPKG 53 (66)
T ss_pred ceeEEcCCCCEEEecCC
Confidence 49999999999985443
No 205
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=41.77 E-value=1.1e+02 Score=23.59 Aligned_cols=53 Identities=21% Similarity=0.305 Sum_probs=33.5
Q ss_pred eEECCCC-cEEEEECC-----CCEEEEECC-CCcEEEEeccccccCCCCCcCC-CceEEcCCCCEEEEeC
Q psy4774 17 MKFDKNG-ALHVADAY-----FGLYKVNVT-TGQTEQLISMDTEIDGAKPQIP-NSVTVDSDGMVYWSDS 78 (120)
Q Consensus 17 l~~d~~G-~l~V~d~~-----~gi~~vd~~-~g~~~~~~~~~~~~~g~~~~~p-ndl~v~~~G~iy~TDs 78 (120)
+++|+++ .||..... ++|++++.+ +++++.+.. .. .. ..+.++++|..|+--.
T Consensus 286 ~~~d~~~~~iyf~a~~~~p~~r~lY~v~~~~~~~~~~LT~----~~-----~~~~~~~~Spdg~y~v~~~ 346 (353)
T PF00930_consen 286 LGWDEDNNRIYFTANGDNPGERHLYRVSLDSGGEPKCLTC----ED-----GDHYSASFSPDGKYYVDTY 346 (353)
T ss_dssp EEEECTSSEEEEEESSGGTTSBEEEEEETTETTEEEESST----TS-----STTEEEEE-TTSSEEEEEE
T ss_pred ceEcCCCCEEEEEecCCCCCceEEEEEEeCCCCCeEeccC----CC-----CCceEEEECCCCCEEEEEE
Confidence 5667775 56554322 469999988 777776632 11 22 4899999998776443
No 206
>PF01344 Kelch_1: Kelch motif; InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=41.53 E-value=54 Score=17.11 Aligned_cols=28 Identities=18% Similarity=0.150 Sum_probs=20.4
Q ss_pred CCcEEEEECC-------CCEEEEECCCCcEEEEec
Q psy4774 22 NGALHVADAY-------FGLYKVNVTTGQTEQLIS 49 (120)
Q Consensus 22 ~G~l~V~d~~-------~gi~~vd~~~g~~~~~~~ 49 (120)
++.|||.-.. ..+.++|+++++++.+.+
T Consensus 11 ~~~iyv~GG~~~~~~~~~~v~~yd~~~~~W~~~~~ 45 (47)
T PF01344_consen 11 GNKIYVIGGYDGNNQPTNSVEVYDPETNTWEELPP 45 (47)
T ss_dssp TTEEEEEEEBESTSSBEEEEEEEETTTTEEEEEEE
T ss_pred CCEEEEEeeecccCceeeeEEEEeCCCCEEEEcCC
Confidence 5789997432 128889999888887753
No 207
>KOG2096|consensus
Probab=41.41 E-value=1e+02 Score=24.75 Aligned_cols=65 Identities=14% Similarity=0.158 Sum_probs=47.3
Q ss_pred cCCCCccceEECCCCcEEEEECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 9 QQCGRPLGMKFDKNGALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 9 ~~~g~P~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
..++.|.=|++.|.|.++.+..+.-|..+..++|+...-++ .+.+.- ...|.++++|....|-.-
T Consensus 329 aag~~p~RL~lsP~g~~lA~s~gs~l~~~~se~g~~~~~~e---~~h~~~---Is~is~~~~g~~~atcGd 393 (420)
T KOG2096|consen 329 AAGSEPVRLELSPSGDSLAVSFGSDLKVFASEDGKDYPELE---DIHSTT---ISSISYSSDGKYIATCGD 393 (420)
T ss_pred hcCCCceEEEeCCCCcEEEeecCCceEEEEcccCccchhHH---HhhcCc---eeeEEecCCCcEEeeecc
Confidence 34677889999999999988877667777777777544433 344433 458899999988877764
No 208
>KOG0772|consensus
Probab=41.28 E-value=94 Score=26.45 Aligned_cols=61 Identities=13% Similarity=0.207 Sum_probs=34.4
Q ss_pred cceEECCCCcEEEEECCCC-EEEEECCCCcEEEEeccccccCCCCCcCC-CceEEcCCCCEEEEeCCC
Q psy4774 15 LGMKFDKNGALHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAKPQIP-NSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 15 ~Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~p-ndl~v~~~G~iy~TDs~~ 80 (120)
..|+|..+|+.+.+-...+ +-..|...-+ +.+. ...|-+-.+| -+..|+|+..|.||-++.
T Consensus 368 tsi~FS~dg~~LlSRg~D~tLKvWDLrq~k-kpL~----~~tgL~t~~~~tdc~FSPd~kli~TGtS~ 430 (641)
T KOG0772|consen 368 TSISFSYDGNYLLSRGFDDTLKVWDLRQFK-KPLN----VRTGLPTPFPGTDCCFSPDDKLILTGTSA 430 (641)
T ss_pred eEEEeccccchhhhccCCCceeeeeccccc-cchh----hhcCCCccCCCCccccCCCceEEEecccc
Confidence 4688888888777644332 2222322111 1221 1123222222 489999999999999874
No 209
>PF14157 YmzC: YmzC-like protein; PDB: 3KVP_E.
Probab=41.26 E-value=24 Score=21.14 Aligned_cols=14 Identities=14% Similarity=0.195 Sum_probs=11.0
Q ss_pred eEEEEcCCC--ceEec
Q psy4774 98 SFIKIGPNT--QEVLQ 111 (120)
Q Consensus 98 ~l~~~d~~~--~~~l~ 111 (120)
.+|+|||++ ++++-
T Consensus 42 KIfkyd~~tNei~L~K 57 (63)
T PF14157_consen 42 KIFKYDEDTNEITLKK 57 (63)
T ss_dssp EEEEEETTTTEEEEEE
T ss_pred EEEEeCCCCCeEEEEE
Confidence 779999998 66553
No 210
>PRK13684 Ycf48-like protein; Provisional
Probab=40.80 E-value=1.7e+02 Score=22.67 Aligned_cols=14 Identities=14% Similarity=0.140 Sum_probs=7.7
Q ss_pred eEEcCCCCEEEEeC
Q psy4774 65 VTVDSDGMVYWSDS 78 (120)
Q Consensus 65 l~v~~~G~iy~TDs 78 (120)
+++..+|.||.|..
T Consensus 274 ~~~G~~G~v~~S~d 287 (334)
T PRK13684 274 WAGGGNGTLLVSKD 287 (334)
T ss_pred EEEcCCCeEEEeCC
Confidence 44445566666544
No 211
>PRK12691 flgG flagellar basal body rod protein FlgG; Reviewed
Probab=40.52 E-value=1.6e+02 Score=22.16 Aligned_cols=57 Identities=25% Similarity=0.364 Sum_probs=33.9
Q ss_pred CCCccceEECCCCcEEEEECCC--------CEEEEECCCCcEEEEeccccccCCC----CCcCC---CceEEcCCCCEEE
Q psy4774 11 CGRPLGMKFDKNGALHVADAYF--------GLYKVNVTTGQTEQLISMDTEIDGA----KPQIP---NSVTVDSDGMVYW 75 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~~~--------gi~~vd~~~g~~~~~~~~~~~~~g~----~~~~p---ndl~v~~~G~iy~ 75 (120)
-++|+-+|+..+| .|+..... |-+++|. +|.+.. .+|. ++..| .++.|+++|.|+.
T Consensus 89 Tg~~lDlAI~G~G-fF~V~~~~G~~~yTR~G~F~~d~-~G~Lvt-------~~G~~vl~~I~vp~~~~~~~i~~dG~i~~ 159 (262)
T PRK12691 89 TGNDLDLAIQGRG-YFQIQLPDGETAYTRAGAFNRSA-DGQIVT-------SDGYPVQPGITIPQDATSITINASGQVSA 159 (262)
T ss_pred cCCceeEEEcCCc-EEEEEcCCCCEEEeeCCCeeECC-CCCEEC-------CCCCEeEeeeecCCCCceEEECCCCEEEE
Confidence 4778888988776 34443222 3455665 354321 1232 24445 3799999999976
Q ss_pred E
Q psy4774 76 S 76 (120)
Q Consensus 76 T 76 (120)
.
T Consensus 160 ~ 160 (262)
T PRK12691 160 T 160 (262)
T ss_pred E
Confidence 4
No 212
>PTZ00420 coronin; Provisional
Probab=40.28 E-value=2.3e+02 Score=24.07 Aligned_cols=60 Identities=5% Similarity=-0.002 Sum_probs=36.6
Q ss_pred CCccceEECCCCcE-EEEECCCC-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 12 GRPLGMKFDKNGAL-HVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 12 g~P~Gl~~d~~G~l-~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
..-..++++|++.. +++-...+ |...|.++++...-.. .. ..+..++++++|.+.+|-+.
T Consensus 126 ~~V~sVaf~P~g~~iLaSgS~DgtIrIWDl~tg~~~~~i~-----~~---~~V~SlswspdG~lLat~s~ 187 (568)
T PTZ00420 126 KKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEKRAFQIN-----MP---KKLSSLKWNIKGNLLSGTCV 187 (568)
T ss_pred CcEEEEEECCCCCeEEEEEeCCCeEEEEECCCCcEEEEEe-----cC---CcEEEEEECCCCCEEEEEec
Confidence 34567899998854 44433344 5556877665322111 11 13568999999998887553
No 213
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=39.76 E-value=83 Score=23.82 Aligned_cols=62 Identities=19% Similarity=0.144 Sum_probs=36.8
Q ss_pred CCccceEECCCCcEEEEECCCCEEEEECCCCc--EEEEeccccccCCCCCcCCCceEEcCCC----CEEEEeCC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFGLYKVNVTTGQ--TEQLISMDTEIDGAKPQIPNSVTVDSDG----MVYWSDSS 79 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~--~~~~~~~~~~~~g~~~~~pndl~v~~~G----~iy~TDs~ 79 (120)
|.--||+.|.+ +|+.+|...-+...||++=+ .++.+ ..+|.|..-.|-+.--. | |||.|+.-
T Consensus 131 GeGWgLt~d~~-~LimsdGsatL~frdP~tfa~~~~v~V----T~~g~pv~~LNELE~Vd-G~lyANVw~t~~I 198 (262)
T COG3823 131 GEGWGLTSDDK-NLIMSDGSATLQFRDPKTFAELDTVQV----TDDGVPVSKLNELEWVD-GELYANVWQTTRI 198 (262)
T ss_pred CcceeeecCCc-ceEeeCCceEEEecCHHHhhhcceEEE----EECCeecccccceeeec-cEEEEeeeeecce
Confidence 34447777644 68888876667778887521 12222 23677777777775443 4 55555553
No 214
>KOG0299|consensus
Probab=39.08 E-value=94 Score=25.80 Aligned_cols=56 Identities=16% Similarity=0.232 Sum_probs=36.8
Q ss_pred cceEECCCCcEEEEECCCC---EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCC-EEEE
Q psy4774 15 LGMKFDKNGALHVADAYFG---LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWS 76 (120)
Q Consensus 15 ~Gl~~d~~G~l~V~d~~~g---i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~-iy~T 76 (120)
.+|+.-+.-+|+.+.+..| +|++...-.+++.+.+. +.. .+.|.|.|.++|. |++.
T Consensus 384 tsla~i~~sdL~asGS~~G~vrLW~i~~g~r~i~~l~~l--s~~----GfVNsl~f~~sgk~ivag 443 (479)
T KOG0299|consen 384 TSLAVIPGSDLLASGSWSGCVRLWKIEDGLRAINLLYSL--SLV----GFVNSLAFSNSGKRIVAG 443 (479)
T ss_pred eeeEecccCceEEecCCCCceEEEEecCCccccceeeec--ccc----cEEEEEEEccCCCEEEEe
Confidence 4677777777887666555 77776432356666532 222 3789999999997 5555
No 215
>PF06970 RepA_N: Replication initiator protein A (RepA) N-terminus; InterPro: IPR010724 This entry represents the N terminus (approximately 80 residues) of replication initiator protein A (RepA), a DNA replication initiator in plasmids []. Most proteins in this entry are bacterial, but archaeal and eukaryotic members are also included.
Probab=37.46 E-value=19 Score=22.21 Aligned_cols=16 Identities=38% Similarity=0.418 Sum_probs=11.9
Q ss_pred CCceEEcCCCCEEEEeC
Q psy4774 62 PNSVTVDSDGMVYWSDS 78 (120)
Q Consensus 62 pndl~v~~~G~iy~TDs 78 (120)
-|+ ++|.+|+|||-=+
T Consensus 39 kn~-wiDe~G~vYi~~s 54 (76)
T PF06970_consen 39 KNG-WIDENGNVYIIFS 54 (76)
T ss_pred hcC-cCCCCCCEEEEee
Confidence 345 8999999997543
No 216
>PF13511 DUF4124: Domain of unknown function (DUF4124)
Probab=37.41 E-value=25 Score=19.99 Aligned_cols=15 Identities=40% Similarity=0.654 Sum_probs=12.6
Q ss_pred EEcCCCCEEEEeCCC
Q psy4774 66 TVDSDGMVYWSDSST 80 (120)
Q Consensus 66 ~v~~~G~iy~TDs~~ 80 (120)
++|.+|++-|||...
T Consensus 18 ~~D~~G~v~ysd~P~ 32 (60)
T PF13511_consen 18 WVDENGVVHYSDTPP 32 (60)
T ss_pred EECCCCCEEECccCC
Confidence 678899999999864
No 217
>KOG3567|consensus
Probab=37.27 E-value=57 Score=27.21 Aligned_cols=67 Identities=21% Similarity=0.209 Sum_probs=45.3
Q ss_pred CccceEECCCCcEEEEECCCC-EEEEECCCCcEEEEeccccc-cCCCC-CcCCCceEEcCCCCEEEEeCC
Q psy4774 13 RPLGMKFDKNGALHVADAYFG-LYKVNVTTGQTEQLISMDTE-IDGAK-PQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 13 ~P~Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~-~~g~~-~~~pndl~v~~~G~iy~TDs~ 79 (120)
.|+|+..|.+|+.++++...+ ..+.+|+.++.-.+++..+. ..+.+ |.-|-+++|..+|.+.+-|..
T Consensus 169 vhyg~t~df~~~~d~TgV~mH~t~kp~pkla~~~L~l~~~tvp~~~~~~f~~~tsc~v~~n~~ihvfa~r 238 (501)
T KOG3567|consen 169 VHYGLTIDFDGNYDVTGVGMHQTEKPQPKLAKTMLLLGDGTVPGEGTKHFETPTSCAVEENGPIHVFAYR 238 (501)
T ss_pred eccccccCCCCCcccccceeeeeccCCchhhceEEeecCCccCCCCccccCCCceEEEecCcceeeEEee
Confidence 478888888889999987744 55668876665444432111 11222 444559999999999988875
No 218
>PRK13684 Ycf48-like protein; Provisional
Probab=36.97 E-value=1.8e+02 Score=22.55 Aligned_cols=36 Identities=19% Similarity=0.191 Sum_probs=18.2
Q ss_pred CCccceEECCCCcEEEEECCCCEEEEECCCCcEEEE
Q psy4774 12 GRPLGMKFDKNGALHVADAYFGLYKVNVTTGQTEQL 47 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~~~~~ 47 (120)
+.|.++.....+.++++....+|++=+..+...+.+
T Consensus 132 ~~~~~i~~~~~~~~~~~g~~G~i~~S~DgG~tW~~~ 167 (334)
T PRK13684 132 GSPYLITALGPGTAEMATNVGAIYRTTDGGKNWEAL 167 (334)
T ss_pred CCceEEEEECCCcceeeeccceEEEECCCCCCceeC
Confidence 345555554445666665444466655433344444
No 219
>PHA02713 hypothetical protein; Provisional
Probab=36.64 E-value=2.5e+02 Score=23.47 Aligned_cols=50 Identities=8% Similarity=-0.035 Sum_probs=30.4
Q ss_pred CCcEEEEECCC------CEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeC
Q psy4774 22 NGALHVADAYF------GLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDS 78 (120)
Q Consensus 22 ~G~l~V~d~~~------gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs 78 (120)
+|.|||.-... .+.++||++.+++.+... .. +.... .++ .-+|.||+.-.
T Consensus 351 ~g~IYviGG~~~~~~~~sve~Ydp~~~~W~~~~~m----p~-~r~~~-~~~-~~~g~IYviGG 406 (557)
T PHA02713 351 DDTIYAIGGQNGTNVERTIECYTMGDDKWKMLPDM----PI-ALSSY-GMC-VLDQYIYIIGG 406 (557)
T ss_pred CCEEEEECCcCCCCCCceEEEEECCCCeEEECCCC----Cc-ccccc-cEE-EECCEEEEEeC
Confidence 57999985432 278899988888877542 11 11112 222 23589998754
No 220
>KOG4441|consensus
Probab=35.55 E-value=2.7e+02 Score=23.51 Aligned_cols=78 Identities=19% Similarity=0.233 Sum_probs=45.2
Q ss_pred ceEECCCCcEEEEECCCC------EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCCCccccccee
Q psy4774 16 GMKFDKNGALHVADAYFG------LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTKYKLYDGLF 89 (120)
Q Consensus 16 Gl~~d~~G~l~V~d~~~g------i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~~~~~~~~~ 89 (120)
|+++. ++.|||+-...+ +-++||++.+++.+.... .-...-++++. ++.+|+.--...
T Consensus 470 g~a~~-~~~iYvvGG~~~~~~~~~VE~ydp~~~~W~~v~~m~------~~rs~~g~~~~-~~~ly~vGG~~~-------- 533 (571)
T KOG4441|consen 470 GVAVL-NGKIYVVGGFDGTSALSSVERYDPETNQWTMVAPMT------SPRSAVGVVVL-GGKLYAVGGFDG-------- 533 (571)
T ss_pred eEEEE-CCEEEEECCccCCCccceEEEEcCCCCceeEcccCc------cccccccEEEE-CCEEEEEecccC--------
Confidence 56665 579999855433 788999999998886421 11112344444 367887765311
Q ss_pred cceecCCceEEEEcCCC--ceEecC
Q psy4774 90 DGLTSGSGSFIKIGPNT--QEVLQK 112 (120)
Q Consensus 90 ~~~~~~~G~l~~~d~~~--~~~l~~ 112 (120)
......|-.|||.+ .+.+.+
T Consensus 534 ---~~~l~~ve~ydp~~d~W~~~~~ 555 (571)
T KOG4441|consen 534 ---NNNLNTVECYDPETDTWTEVTE 555 (571)
T ss_pred ---ccccceeEEcCCCCCceeeCCC
Confidence 12234556667766 444433
No 221
>PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues.
Probab=35.40 E-value=1.2e+02 Score=22.46 Aligned_cols=54 Identities=20% Similarity=0.211 Sum_probs=30.3
Q ss_pred CccceEECCCCcEEEEECCCCE-EEE-ECCCCcEEEEeccccccCCCCCc-CCCceEEcCCC
Q psy4774 13 RPLGMKFDKNGALHVADAYFGL-YKV-NVTTGQTEQLISMDTEIDGAKPQ-IPNSVTVDSDG 71 (120)
Q Consensus 13 ~P~Gl~~d~~G~l~V~d~~~gi-~~v-d~~~g~~~~~~~~~~~~~g~~~~-~pndl~v~~~G 71 (120)
.-....++++|.+|+++..... ..+ +..+++...+. .+-..+. ....|.+++||
T Consensus 67 ~l~~PS~d~~g~~W~v~~~~~~~~~~~~~~~g~~~~~~-----v~~~~~~~~I~~l~vSpDG 123 (253)
T PF10647_consen 67 SLTRPSWDPDGWVWTVDDGSGGVRVVRDSASGTGEPVE-----VDWPGLRGRITALRVSPDG 123 (253)
T ss_pred ccccccccCCCCEEEEEcCCCceEEEEecCCCcceeEE-----ecccccCCceEEEEECCCC
Confidence 4445689999999998765432 222 22344443331 1111111 45688888888
No 222
>PF08857 ParBc_2: Putative ParB-like nuclease; InterPro: IPR014956 These proteins are probably distantly related to IPR003115 from INTERPRO. Suggesting these, uncharacterised proteins have a nuclease function. ; PDB: 2HWJ_E.
Probab=34.70 E-value=28 Score=24.71 Aligned_cols=16 Identities=25% Similarity=0.580 Sum_probs=10.3
Q ss_pred ceEEcCCCCEEEEeCC
Q psy4774 64 SVTVDSDGMVYWSDSS 79 (120)
Q Consensus 64 dl~v~~~G~iy~TDs~ 79 (120)
=+++.|+|.+|+||--
T Consensus 47 pvViGP~g~lyl~D~H 62 (163)
T PF08857_consen 47 PVVIGPGGQLYLTDHH 62 (163)
T ss_dssp EEEE-STT-EEE-S-H
T ss_pred CEEECCCCCeEEECCc
Confidence 5788999999999983
No 223
>KOG0263|consensus
Probab=34.61 E-value=2.9e+02 Score=24.35 Aligned_cols=59 Identities=14% Similarity=0.137 Sum_probs=34.9
Q ss_pred CCCccceEECCCCcEEEEECC---CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeC
Q psy4774 11 CGRPLGMKFDKNGALHVADAY---FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDS 78 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l~V~d~~---~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs 78 (120)
+|--+|..|.|+.+++++.+. -.+|.++.. ...++. +.+-.| .=++.|+|.|.-|.|-|
T Consensus 451 ~GPVyg~sFsPd~rfLlScSED~svRLWsl~t~--s~~V~y----~GH~~P---VwdV~F~P~GyYFatas 512 (707)
T KOG0263|consen 451 SGPVYGCSFSPDRRFLLSCSEDSSVRLWSLDTW--SCLVIY----KGHLAP---VWDVQFAPRGYYFATAS 512 (707)
T ss_pred CCceeeeeecccccceeeccCCcceeeeecccc--eeEEEe----cCCCcc---eeeEEecCCceEEEecC
Confidence 455679999999999887544 346666642 222332 112122 23677888766555554
No 224
>PLN00033 photosystem II stability/assembly factor; Provisional
Probab=34.43 E-value=2.2e+02 Score=23.03 Aligned_cols=30 Identities=30% Similarity=0.346 Sum_probs=20.0
Q ss_pred CccceEECCCCcEEEEECCCCEEEEECCCCc
Q psy4774 13 RPLGMKFDKNGALHVADAYFGLYKVNVTTGQ 43 (120)
Q Consensus 13 ~P~Gl~~d~~G~l~V~d~~~gi~~vd~~~g~ 43 (120)
+-.++.+.++|.+|++-...++++-+. +|+
T Consensus 282 ~l~~v~~~~dg~l~l~g~~G~l~~S~d-~G~ 311 (398)
T PLN00033 282 RIQNMGWRADGGLWLLTRGGGLYVSKG-TGL 311 (398)
T ss_pred ceeeeeEcCCCCEEEEeCCceEEEecC-CCC
Confidence 345788888999998876555555443 453
No 225
>PF06462 Hyd_WA: Propeller; InterPro: IPR006624 Tectonins I and II are two dominant proteins in the nuclei and nuclear matrix from plasmodia of Physarum polycephalum (Slime mold) which encode 217 and 353 amino acids, respectively. Tectonin I is homologous to the C-terminal two-thirds of tectonin II. Both proteins contain six tandem repeats that are each 33-37 amino acids in length and define a new consensus sequence. Homologous repeats are found in L-6, a bacterial lipopolysaccharide-binding lectin from horseshoe crab hemocytes. The repetitive sequences of the tectonins and L-6 are reminiscent of the WD repeats of the beta-subunit of G proteins, suggesting that they form beta-propeller domains. The tectonins may be lectins that function as part of a transmembrane signalling complex during phagocytosis [].
Probab=34.03 E-value=42 Score=16.97 Aligned_cols=14 Identities=43% Similarity=0.679 Sum_probs=9.4
Q ss_pred eEEcCCCCEEEEeC
Q psy4774 65 VTVDSDGMVYWSDS 78 (120)
Q Consensus 65 l~v~~~G~iy~TDs 78 (120)
.+++.+|.||+=-.
T Consensus 4 Wav~~~G~v~~R~G 17 (32)
T PF06462_consen 4 WAVTSDGSVYFRTG 17 (32)
T ss_pred EEEcCCCCEEEECc
Confidence 35677888886543
No 226
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=33.85 E-value=3e+02 Score=23.47 Aligned_cols=13 Identities=15% Similarity=0.353 Sum_probs=8.6
Q ss_pred cCCceEEEEc-CCC
Q psy4774 94 SGSGSFIKIG-PNT 106 (120)
Q Consensus 94 ~~~G~l~~~d-~~~ 106 (120)
.|++|-+.|. |++
T Consensus 452 ~~nsr~iAYafP~g 465 (668)
T COG4946 452 HPNSRWIAYAFPEG 465 (668)
T ss_pred cCCceeEEEecCcc
Confidence 5777777775 555
No 227
>PTZ00421 coronin; Provisional
Probab=33.80 E-value=2.8e+02 Score=23.02 Aligned_cols=32 Identities=13% Similarity=0.226 Sum_probs=23.7
Q ss_pred CccceEECCCCcEEEEECCCC-EEEEECCCCcE
Q psy4774 13 RPLGMKFDKNGALHVADAYFG-LYKVNVTTGQT 44 (120)
Q Consensus 13 ~P~Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~ 44 (120)
.-..+++.++|+++++-...+ |..+|+.+++.
T Consensus 170 ~V~sla~spdG~lLatgs~Dg~IrIwD~rsg~~ 202 (493)
T PTZ00421 170 QITSLEWNLDGSLLCTTSKDKKLNIIDPRDGTI 202 (493)
T ss_pred ceEEEEEECCCCEEEEecCCCEEEEEECCCCcE
Confidence 356799999999888766655 66678877764
No 228
>PHA03175 UL43 envelope protein; Provisional
Probab=33.58 E-value=25 Score=28.26 Aligned_cols=24 Identities=17% Similarity=0.325 Sum_probs=21.2
Q ss_pred cCCceEEEEcCCC--ceEecCccccc
Q psy4774 94 SGSGSFIKIGPNT--QEVLQKQTDKL 117 (120)
Q Consensus 94 ~~~G~l~~~d~~~--~~~l~~~L~~~ 117 (120)
...|.||.|||.+ ++-|+++|..|
T Consensus 335 nE~G~VF~vdP~~~~i~fLA~~L~~F 360 (413)
T PHA03175 335 NECGAVFGVHPDTNQAHFLARGLLGF 360 (413)
T ss_pred ccCceEEEECCCCCeeeehhhchhHH
Confidence 4679999999998 99999999875
No 229
>cd00819 PEPCK_GTP Phosphoenolpyruvate carboxykinase (PEPCK), a critical gluconeogenic enzyme, catalyzes the first committed step in the diversion of tricarboxylic acid cycle intermediates toward gluconeogenesis. It catalyzes the reversible decarboxylation and phosphorylation of oxaloacetate to yield phosphoenolpyruvate and carbon dioxide, using a nucleotide molecule (GTP) for the phosphoryl transfer, and has a strict requirement for divalent metal ions for activity. PEPCK's separate into two phylogenetic groups based on their nucleotide substrate specificity, this model describes the GTP-dependent group.
Probab=32.97 E-value=1e+02 Score=26.42 Aligned_cols=59 Identities=24% Similarity=0.420 Sum_probs=32.2
Q ss_pred eEECCCCcEEEEECCCCEEEEECCCCcEE-EEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 17 MKFDKNGALHVADAYFGLYKVNVTTGQTE-QLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 17 l~~d~~G~l~V~d~~~gi~~vd~~~g~~~-~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
|.+++||+||..+...|++-|-|.++..+ +.+- .... .. ..-..+++.+||.+|+-...
T Consensus 279 m~~~~dG~l~AINPE~GfFGVapGtn~~tnP~am--~~l~-~n-~IFTNVa~t~DG~vwWeG~~ 338 (579)
T cd00819 279 MKFGEDGRLYAINPEAGFFGVAPGTNAKTNPNAM--ATLH-KN-TIFTNVALTEDGDVWWEGLT 338 (579)
T ss_pred eEECCCCcEEEEcCCCCeeEeCCCCCCCcCHHHH--HHhc-CC-ceEEEEeEcCCCCeeCCCCC
Confidence 56666777877776677776655433211 0000 0011 11 12346888889999976553
No 230
>PHA03344 US22 family homolog; Provisional
Probab=32.78 E-value=30 Score=29.89 Aligned_cols=26 Identities=15% Similarity=0.262 Sum_probs=22.2
Q ss_pred ecCCceEEEEcCCC--ceEecCcccccC
Q psy4774 93 TSGSGSFIKIGPNT--QEVLQKQTDKLG 118 (120)
Q Consensus 93 ~~~~G~l~~~d~~~--~~~l~~~L~~~~ 118 (120)
-...||||.||+.. +-+++.+|.+|.
T Consensus 280 MG~~GRVYVYD~eeD~LyLVA~dLDELA 307 (672)
T PHA03344 280 MDEEGRFFLYDAESDGLFLAAENIDELA 307 (672)
T ss_pred EeCCceEEEEEcCCCEEEEeecCHHHHH
Confidence 45789999999987 999999988875
No 231
>KOG3545|consensus
Probab=31.98 E-value=1.2e+02 Score=23.19 Aligned_cols=53 Identities=23% Similarity=0.266 Sum_probs=29.7
Q ss_pred CcEEEEEC-CCCEEEEECCCCcEEEEeccccccCCCCCcCC----------CceEEcCCC--CEEEEeCC
Q psy4774 23 GALHVADA-YFGLYKVNVTTGQTEQLISMDTEIDGAKPQIP----------NSVTVDSDG--MVYWSDSS 79 (120)
Q Consensus 23 G~l~V~d~-~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~p----------ndl~v~~~G--~iy~TDs~ 79 (120)
|.+|--.. ...|++++.+++.+ .... ..++.....+ .|+++|.+| -||-|.-.
T Consensus 78 Gs~yynk~~t~~ivky~l~~~~~--~~~~--~lp~a~y~~~~~y~~~g~sdiD~avDE~GLWviYat~~~ 143 (249)
T KOG3545|consen 78 GSLYYNKAGTRNIIKYDLETRTV--AGSA--ALPYAGYHNPSPYYWGGHSDIDLAVDENGLWVIYATPEN 143 (249)
T ss_pred ceEEeeccCCcceEEEEeeccee--eeee--eccccccCCCcccccCCCccccceecccceeEEeccccc
Confidence 45554332 24599999875432 2111 2233333334 689999998 57766654
No 232
>KOG2055|consensus
Probab=31.37 E-value=3.2e+02 Score=22.98 Aligned_cols=62 Identities=15% Similarity=0.180 Sum_probs=35.0
Q ss_pred CCCccceEECCCCcE-EEEECCCCEEEEECCCCcEE-EEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 11 CGRPLGMKFDKNGAL-HVADAYFGLYKVNVTTGQTE-QLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 11 ~g~P~Gl~~d~~G~l-~V~d~~~gi~~vd~~~g~~~-~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
.|.-.++.|.++|+. |+|.....|+..|....++. .+.. +|. ..--.++++.+|..+.|-|.
T Consensus 344 eG~v~~~~fsSdsk~l~~~~~~GeV~v~nl~~~~~~~rf~D-----~G~--v~gts~~~S~ng~ylA~GS~ 407 (514)
T KOG2055|consen 344 EGVVSDFTFSSDSKELLASGGTGEVYVWNLRQNSCLHRFVD-----DGS--VHGTSLCISLNGSYLATGSD 407 (514)
T ss_pred ccEEeeEEEecCCcEEEEEcCCceEEEEecCCcceEEEEee-----cCc--cceeeeeecCCCceEEeccC
Confidence 366778999999955 54544344888887654432 2221 221 12235666677775555554
No 233
>PLN02258 9-cis-epoxycarotenoid dioxygenase NCED
Probab=31.21 E-value=3.4e+02 Score=23.25 Aligned_cols=59 Identities=14% Similarity=0.289 Sum_probs=33.2
Q ss_pred EECCC-CcEEEEECC------CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCCCc
Q psy4774 18 KFDKN-GALHVADAY------FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTKY 82 (120)
Q Consensus 18 ~~d~~-G~l~V~d~~------~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~~ 82 (120)
.+|+. |+||..... ..++++|+ +|++...... .-....+.+|+++.++ .+.|-|..-.|
T Consensus 291 KvDP~TGel~~f~y~~~~~p~l~~~~~d~-~G~~~~~~~i----~lp~p~~~HDFaiTen-Y~Vf~d~Pl~~ 356 (590)
T PLN02258 291 KVDPVTGELFALSYDVVKKPYLKYFRFSP-DGEKSPDVEI----PLDQPTMMHDFAITEN-FVVIPDQQVVF 356 (590)
T ss_pred eEcCCCCeEEEEEEeccCCCcEEEEEECC-CCCEEeeEEe----eCCCCccccceeccCc-eEEEEccCceE
Confidence 67774 777654321 22666776 4665544432 2223456789998875 55555554333
No 234
>KOG0263|consensus
Probab=29.97 E-value=1.3e+02 Score=26.47 Aligned_cols=62 Identities=18% Similarity=0.237 Sum_probs=41.8
Q ss_pred CCccceEECCCCcEEEEECCCC-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~ 80 (120)
+.-..+++.|.|+-+++-...| |...|..+|+. +.. .-+. -...+.|.|+.+|+|.++++..
T Consensus 578 ~~V~al~~Sp~Gr~LaSg~ed~~I~iWDl~~~~~--v~~----l~~H-t~ti~SlsFS~dg~vLasgg~D 640 (707)
T KOG0263|consen 578 GPVTALAFSPCGRYLASGDEDGLIKIWDLANGSL--VKQ----LKGH-TGTIYSLSFSRDGNVLASGGAD 640 (707)
T ss_pred CceEEEEEcCCCceEeecccCCcEEEEEcCCCcc--hhh----hhcc-cCceeEEEEecCCCEEEecCCC
Confidence 3345799999997777654455 55567766543 221 1222 3456899999999999999974
No 235
>PF13418 Kelch_4: Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=29.61 E-value=86 Score=16.55 Aligned_cols=27 Identities=15% Similarity=0.202 Sum_probs=16.9
Q ss_pred CCCcEEEEECC-------CCEEEEECCCCcEEEE
Q psy4774 21 KNGALHVADAY-------FGLYKVNVTTGQTEQL 47 (120)
Q Consensus 21 ~~G~l~V~d~~-------~gi~~vd~~~g~~~~~ 47 (120)
.++.|||.-.. .-++++|+++++++.+
T Consensus 11 ~~~~i~v~GG~~~~~~~~~d~~~~d~~~~~W~~~ 44 (49)
T PF13418_consen 11 GDNSIYVFGGRDSSGSPLNDLWIFDIETNTWTRL 44 (49)
T ss_dssp -TTEEEEE--EEE-TEE---EEEEETTTTEEEE-
T ss_pred eCCeEEEECCCCCCCcccCCEEEEECCCCEEEEC
Confidence 45788886321 2388999988888777
No 236
>KOG0268|consensus
Probab=28.56 E-value=36 Score=27.59 Aligned_cols=64 Identities=16% Similarity=0.144 Sum_probs=41.4
Q ss_pred CCCCccceEECCCCcEEEE-ECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 10 QCGRPLGMKFDKNGALHVA-DAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 10 ~~g~P~Gl~~d~~G~l~V~-d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
...++++|.++|++-.||+ +....++-+|...-+ +++ ...-.-.+..-|++++|-|.=++|-|.
T Consensus 228 ~~mRTN~IswnPeafnF~~a~ED~nlY~~DmR~l~-~p~-----~v~~dhvsAV~dVdfsptG~Efvsgsy 292 (433)
T KOG0268|consen 228 LTMRTNTICWNPEAFNFVAANEDHNLYTYDMRNLS-RPL-----NVHKDHVSAVMDVDFSPTGQEFVSGSY 292 (433)
T ss_pred eeccccceecCccccceeeccccccceehhhhhhc-ccc-----hhhcccceeEEEeccCCCcchhccccc
Confidence 4568999999998866665 444668888864221 111 111112234558999999999998885
No 237
>PF09910 DUF2139: Uncharacterized protein conserved in archaea (DUF2139); InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=28.26 E-value=1.4e+02 Score=23.71 Aligned_cols=38 Identities=32% Similarity=0.536 Sum_probs=28.1
Q ss_pred CCccceEECCC-CcEEEE--ECC--CCEEEEECCCCcEEEEec
Q psy4774 12 GRPLGMKFDKN-GALHVA--DAY--FGLYKVNVTTGQTEQLIS 49 (120)
Q Consensus 12 g~P~Gl~~d~~-G~l~V~--d~~--~gi~~vd~~~g~~~~~~~ 49 (120)
|.-.-|-.||- .+||++ |.+ .||+++|.++|+.+.+.+
T Consensus 106 GEVSdIlYdP~~D~LLlAR~DGh~nLGvy~ldr~~g~~~~L~~ 148 (339)
T PF09910_consen 106 GEVSDILYDPYEDRLLLARADGHANLGVYSLDRRTGKAEKLSS 148 (339)
T ss_pred cchhheeeCCCcCEEEEEecCCcceeeeEEEcccCCceeeccC
Confidence 33445777775 488887 443 579999999999888864
No 238
>KOG0282|consensus
Probab=28.02 E-value=3e+02 Score=23.10 Aligned_cols=57 Identities=18% Similarity=0.274 Sum_probs=37.6
Q ss_pred ceEECCCCcEEEEECCCC-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCC-CEEEEeCCC
Q psy4774 16 GMKFDKNGALHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDG-MVYWSDSST 80 (120)
Q Consensus 16 Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G-~iy~TDs~~ 80 (120)
.+++..+|.=+.+-...+ |--.|.++|++..-.. .+. .|.-+-+-|++ ++++.-.+.
T Consensus 263 d~~~s~~g~~fLS~sfD~~lKlwDtETG~~~~~f~-----~~~---~~~cvkf~pd~~n~fl~G~sd 321 (503)
T KOG0282|consen 263 DASFNNCGTSFLSASFDRFLKLWDTETGQVLSRFH-----LDK---VPTCVKFHPDNQNIFLVGGSD 321 (503)
T ss_pred hhhccccCCeeeeeecceeeeeeccccceEEEEEe-----cCC---CceeeecCCCCCcEEEEecCC
Confidence 477888885555544444 5556888998765543 222 35778888887 888877764
No 239
>PLN00033 photosystem II stability/assembly factor; Provisional
Probab=27.82 E-value=3.3e+02 Score=22.00 Aligned_cols=24 Identities=8% Similarity=0.029 Sum_probs=15.5
Q ss_pred ceEECCCCcEEEEECCCCEEEEEC
Q psy4774 16 GMKFDKNGALHVADAYFGLYKVNV 39 (120)
Q Consensus 16 Gl~~d~~G~l~V~d~~~gi~~vd~ 39 (120)
++...++|.+++......+++-..
T Consensus 243 ~v~~~~dG~~~~vg~~G~~~~s~d 266 (398)
T PLN00033 243 TVNRSPDGDYVAVSSRGNFYLTWE 266 (398)
T ss_pred eEEEcCCCCEEEEECCccEEEecC
Confidence 456778888887766444665543
No 240
>KOG1407|consensus
Probab=27.43 E-value=3e+02 Score=21.51 Aligned_cols=27 Identities=19% Similarity=0.331 Sum_probs=18.0
Q ss_pred ccceEECCCCcEEEEECCCCEEEE-ECC
Q psy4774 14 PLGMKFDKNGALHVADAYFGLYKV-NVT 40 (120)
Q Consensus 14 P~Gl~~d~~G~l~V~d~~~gi~~v-d~~ 40 (120)
-..|.|+|+|+-+...+...++.+ |++
T Consensus 192 CicI~f~p~GryfA~GsADAlvSLWD~~ 219 (313)
T KOG1407|consen 192 CICIEFDPDGRYFATGSADALVSLWDVD 219 (313)
T ss_pred eEEEEECCCCceEeeccccceeeccChh
Confidence 456999999988877655554443 443
No 241
>PF13186 SPASM: Iron-sulfur cluster-binding domain
Probab=27.15 E-value=37 Score=18.86 Aligned_cols=14 Identities=14% Similarity=0.235 Sum_probs=12.1
Q ss_pred ceEECCCCcEEEEE
Q psy4774 16 GMKFDKNGALHVAD 29 (120)
Q Consensus 16 Gl~~d~~G~l~V~d 29 (120)
.+.++++|++|.|.
T Consensus 7 ~~~I~~dG~v~pC~ 20 (64)
T PF13186_consen 7 SLYIDPDGDVYPCC 20 (64)
T ss_pred EEEEeeCccEEeCC
Confidence 57899999999994
No 242
>PRK04210 phosphoenolpyruvate carboxykinase; Provisional
Probab=26.91 E-value=1.7e+02 Score=25.22 Aligned_cols=60 Identities=18% Similarity=0.350 Sum_probs=33.8
Q ss_pred eEECCCCcEEEEECCCCEEEEECCCCcEEE-EeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 17 MKFDKNGALHVADAYFGLYKVNVTTGQTEQ-LISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 17 l~~d~~G~l~V~d~~~gi~~vd~~~g~~~~-~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
|.++.||+||..+...|++-|-|.++..+- .+- ....... ..-..|++.+||.+|+-...
T Consensus 294 m~~~~dG~l~AiNPE~GfFGVapGtn~~tnP~am--~~l~~~n-~IFTNValt~DG~vwWeG~~ 354 (601)
T PRK04210 294 IRPGEDGRLYAINPEAGFFGVAPGTNEKTNPNAM--ATLKPGN-VIFTNVALTDDGDVWWEGMT 354 (601)
T ss_pred eeECCCCcEEEEccCCCeeEeCCCCCCCcCHHHH--HhcccCC-eEEeeeEECCCCCeecCCCC
Confidence 667778888887777777777554332110 000 0010001 22347888889999976654
No 243
>KOG1273|consensus
Probab=26.79 E-value=3e+02 Score=22.14 Aligned_cols=60 Identities=13% Similarity=0.101 Sum_probs=42.2
Q ss_pred CccceEECCCCcEEEEECCCC-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 13 RPLGMKFDKNGALHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 13 ~P~Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
....+.|.+-|.++.....+| |+.+|..|-.+..+. ..+-.| ...|+.+++|+..+|-|.
T Consensus 25 ~a~~~~Fs~~G~~lAvGc~nG~vvI~D~~T~~iar~l----saH~~p---i~sl~WS~dgr~LltsS~ 85 (405)
T KOG1273|consen 25 LAECCQFSRWGDYLAVGCANGRVVIYDFDTFRIARML----SAHVRP---ITSLCWSRDGRKLLTSSR 85 (405)
T ss_pred ccceEEeccCcceeeeeccCCcEEEEEccccchhhhh----hccccc---eeEEEecCCCCEeeeecC
Confidence 367899999999888776666 777787664432222 112223 458999999999999885
No 244
>COG1692 Calcineurin-like phosphoesterase [General function prediction only]
Probab=26.67 E-value=57 Score=25.00 Aligned_cols=32 Identities=13% Similarity=0.252 Sum_probs=22.4
Q ss_pred CcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 42 GQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 42 g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
|+++.+++.- .-..-+|.-|-+.|+.|+||..
T Consensus 167 GrvsavvGTH------THV~TaD~rIL~~GTayiTDvG 198 (266)
T COG1692 167 GRVSAVVGTH------THVPTADERILPKGTAYITDVG 198 (266)
T ss_pred CeEEEEEecc------CccccccceecCCCcEEEecCc
Confidence 5666665421 1123479999999999999985
No 245
>KOG0649|consensus
Probab=26.22 E-value=1.5e+02 Score=23.05 Aligned_cols=35 Identities=17% Similarity=0.226 Sum_probs=25.9
Q ss_pred ccceEECCC-CcEEEEECCCCEEEEECCCCcEEEEe
Q psy4774 14 PLGMKFDKN-GALHVADAYFGLYKVNVTTGQTEQLI 48 (120)
Q Consensus 14 P~Gl~~d~~-G~l~V~d~~~gi~~vd~~~g~~~~~~ 48 (120)
-+.|.+||+ ++|+.|....-++.+|.++|+.+...
T Consensus 117 INam~ldP~enSi~~AgGD~~~y~~dlE~G~i~r~~ 152 (325)
T KOG0649|consen 117 INAMWLDPSENSILFAGGDGVIYQVDLEDGRIQREY 152 (325)
T ss_pred cceeEeccCCCcEEEecCCeEEEEEEecCCEEEEEE
Confidence 368889965 68887753334999999999876554
No 246
>KOG2055|consensus
Probab=26.14 E-value=3e+02 Score=23.11 Aligned_cols=55 Identities=18% Similarity=0.205 Sum_probs=31.4
Q ss_pred ceEECCCCcEEEEECCC-C-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEE
Q psy4774 16 GMKFDKNGALHVADAYF-G-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVY 74 (120)
Q Consensus 16 Gl~~d~~G~l~V~d~~~-g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy 74 (120)
-.+|.++|.=-|+-+++ - ++.+|.++++++.+-. .-|.+-.+..-+.|+++++..
T Consensus 262 ~a~f~p~G~~~i~~s~rrky~ysyDle~ak~~k~~~----~~g~e~~~~e~FeVShd~~fi 318 (514)
T KOG2055|consen 262 KAEFAPNGHSVIFTSGRRKYLYSYDLETAKVTKLKP----PYGVEEKSMERFEVSHDSNFI 318 (514)
T ss_pred eeeecCCCceEEEecccceEEEEeeccccccccccC----CCCcccchhheeEecCCCCeE
Confidence 35678888633333332 2 8888998888877743 233333334455555555533
No 247
>PF14398 ATPgrasp_YheCD: YheC/D like ATP-grasp
Probab=25.84 E-value=51 Score=24.80 Aligned_cols=19 Identities=16% Similarity=0.452 Sum_probs=16.3
Q ss_pred CceEEcCCCCEEEEeCCCC
Q psy4774 63 NSVTVDSDGMVYWSDSSTK 81 (120)
Q Consensus 63 ndl~v~~~G~iy~TDs~~~ 81 (120)
=|+++|.+|.+|+=+..++
T Consensus 214 iDl~iD~~g~iWliEvN~k 232 (262)
T PF14398_consen 214 IDLGIDKNGKIWLIEVNSK 232 (262)
T ss_pred EEEEEcCCCCEEEEEEeCC
Confidence 3999999999999988653
No 248
>KOG1215|consensus
Probab=25.62 E-value=1.3e+02 Score=26.49 Aligned_cols=58 Identities=22% Similarity=0.454 Sum_probs=0.0
Q ss_pred ccceEECCC-CcEEEEECC--CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcC-CCCEEEEeC
Q psy4774 14 PLGMKFDKN-GALHVADAY--FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDS-DGMVYWSDS 78 (120)
Q Consensus 14 P~Gl~~d~~-G~l~V~d~~--~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~-~G~iy~TDs 78 (120)
|..+++++. |.+|.+|-. ..|.|-..+.-..++++. .+ +.+||++++|- +..+|+.|+
T Consensus 525 ~r~~~v~p~~g~~~wtd~~~~~~i~ra~~dg~~~~~l~~-----~~--~~~p~glt~d~~~~~~yw~d~ 586 (877)
T KOG1215|consen 525 PRSIAVDPEKGLMFWTDWGQPPRIERASLDGSERAVLVT-----NG--ILWPNGLTIDYETDRLYWADA 586 (877)
T ss_pred ccceeeccccCeeEEecCCCCchhhhhcCCCCCceEEEe-----CC--ccCCCcceEEeecceeEEEcc
No 249
>PTZ00420 coronin; Provisional
Probab=25.41 E-value=3.8e+02 Score=22.84 Aligned_cols=32 Identities=9% Similarity=0.129 Sum_probs=24.2
Q ss_pred CccceEECCCCcEEEEECCCC-EEEEECCCCcE
Q psy4774 13 RPLGMKFDKNGALHVADAYFG-LYKVNVTTGQT 44 (120)
Q Consensus 13 ~P~Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~ 44 (120)
....++++++|+++++....+ |..+|+.+++.
T Consensus 169 ~V~SlswspdG~lLat~s~D~~IrIwD~Rsg~~ 201 (568)
T PTZ00420 169 KLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQEI 201 (568)
T ss_pred cEEEEEECCCCCEEEEEecCCEEEEEECCCCcE
Confidence 456899999999998765544 77778887754
No 250
>PF00821 PEPCK: Phosphoenolpyruvate carboxykinase; InterPro: IPR008209 Phosphoenolpyruvate carboxykinase (PEPCK) catalyses the first committed (rate-limiting) step in hepatic gluconeogenesis, namely the reversible decarboxylation of oxaloacetate to phosphoenolpyruvate (PEP) and carbon dioxide, using either ATP or GTP as a source of phosphate. The ATP-utilising (4.1.1.49 from EC) and GTP-utilising (4.1.1.32 from EC) enzymes form two divergent subfamilies, which have little sequence similarity but which retain conserved active site residues. ATP-utilising PEPCKs are monomers or oligomers of identical subunits found in certain bacteria, yeast, trypanosomatids, and plants, while GTP-utilising PEPCKs are mainly monomers found in animals and some bacteria []. Both require divalent cations for activity, such as magnesium or manganese. One cation interacts with the enzyme at metal binding site 1 to elicit activation, while the second cation interacts at metal binding site 2 to serve as a metal-nucleotide substrate. In bacteria, fungi and plants, PEPCK is involved in the glyoxylate bypass, an alternative to the tricarboxylic acid cycle. PEPCK helps to regulate blood glucose levels. The rate of gluconeogenesis can be controlled through transcriptional regulation of the PEPCK gene by cAMP (the mediator of glucagon and catecholamines), glucocorticoids and insulin. In general, PEPCK expression is induced by glucagon, catecholamines and glucocorticoids during periods of fasting and in response to stress, but is inhibited by (glucose-induced) insulin upon feeding []. With type II diabetes, this regulation system can fail, resulting in increased gluconeogenesis that in turn raises glucose levels []. PEPCK consists of an N-terminal and a catalytic C-terminal domain, with the active site and metal ions located in a cleft between them. Both domains have an alpha/beta topology that is partly similar to one another [, ]. Substrate binding causes PEPCK to undergo a conformational change, which accelerates catalysis by forcing bulk solvent molecules out of the active site []. PCK uses an alpha/beta/alpha motif for nucleotide binding, this motif differing from other kinase domains. GTP-utilising PEPCK has a PEP-binding domain and two kinase motifs to bind GTP and magnesium. This entry represents GTP-utilising phosphoenolpyruvate carboxykinase enzymes.; GO: 0004611 phosphoenolpyruvate carboxykinase activity, 0005525 GTP binding, 0006094 gluconeogenesis; PDB: 2FAH_A 2FAF_A 2QZY_B 2ZCI_D 3MOE_A 3DT7_B 2RKD_A 2RKA_A 2RK8_A 2QF2_B ....
Probab=25.21 E-value=1.4e+02 Score=25.65 Aligned_cols=57 Identities=25% Similarity=0.472 Sum_probs=25.3
Q ss_pred eEECCCCcEEEEECCCCEEEEECCCCcEE-EEeccccccCCCCCcCCCceEEcCCCCEEEEe
Q psy4774 17 MKFDKNGALHVADAYFGLYKVNVTTGQTE-QLISMDTEIDGAKPQIPNSVTVDSDGMVYWSD 77 (120)
Q Consensus 17 l~~d~~G~l~V~d~~~gi~~vd~~~g~~~-~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TD 77 (120)
|.++.||+||..+..+|++=|-|.++..+ +.+-. .. ... ..-..|+..+||.+|+.-
T Consensus 281 mr~~~dG~lrAiNPE~GfFGVapGtn~~tnP~am~--~l-~~n-tIFTNVa~t~dG~vwWeG 338 (586)
T PF00821_consen 281 MRFGEDGRLRAINPENGFFGVAPGTNMKTNPNAMA--TL-GKN-TIFTNVALTDDGDVWWEG 338 (586)
T ss_dssp EEE-TTSBEEEE---SEEEEE-TT-STTT-HHHHH--HT-TBS-EEEES-EEETTS-EE-TT
T ss_pred eeEcCCCcEEEeecccceEEeccCCCCCCCHHHHH--Hh-hcc-ceEeeeeecCCCceeecc
Confidence 67788888888887788777765433210 01000 01 111 112357777788888643
No 251
>smart00706 TECPR Beta propeller repeats in Physarum polycephalum tectonins, Limulus lectin L-6 and animal hypothetical proteins.
Probab=25.16 E-value=74 Score=15.88 Aligned_cols=11 Identities=45% Similarity=0.734 Sum_probs=5.9
Q ss_pred eEEcCCCCEEE
Q psy4774 65 VTVDSDGMVYW 75 (120)
Q Consensus 65 l~v~~~G~iy~ 75 (120)
.+++.+|+||+
T Consensus 22 W~V~~~g~i~~ 32 (35)
T smart00706 22 WAVNSDGNIYR 32 (35)
T ss_pred EEEcCCCCEEE
Confidence 34445566665
No 252
>COG4189 Predicted transcriptional regulator [Transcription]
Probab=24.54 E-value=2.7e+02 Score=21.48 Aligned_cols=29 Identities=17% Similarity=0.425 Sum_probs=21.4
Q ss_pred cEEEEeccccccCCCCCcCCCceEEcCCC
Q psy4774 43 QTEQLISMDTEIDGAKPQIPNSVTVDSDG 71 (120)
Q Consensus 43 ~~~~~~~~~~~~~g~~~~~pndl~v~~~G 71 (120)
+++.-.+..++.++..-++|.|+.+.=+|
T Consensus 175 ~lE~smElSSE~P~Tn~nWPSDIs~~iNg 203 (308)
T COG4189 175 ELEFSMELSSEVPGTNDNWPSDISFYING 203 (308)
T ss_pred ceEEEEEecccCCCCCCCCCcceEEEECC
Confidence 34444555557778788999999999887
No 253
>KOG1963|consensus
Probab=24.53 E-value=1.8e+02 Score=25.96 Aligned_cols=55 Identities=22% Similarity=0.301 Sum_probs=33.7
Q ss_pred ccceEECCCCcEEEEECCCC-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEE
Q psy4774 14 PLGMKFDKNGALHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWS 76 (120)
Q Consensus 14 P~Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~T 76 (120)
-++++|..+|.-+.+....+ +++-..++++ ..++++ =|.++ -++++++|+++|-.
T Consensus 254 V~~L~fS~~G~~LlSGG~E~VLv~Wq~~T~~-kqfLPR----Lgs~I---~~i~vS~ds~~~sl 309 (792)
T KOG1963|consen 254 VNSLSFSSDGAYLLSGGREGVLVLWQLETGK-KQFLPR----LGSPI---LHIVVSPDSDLYSL 309 (792)
T ss_pred cceeEEecCCceEeecccceEEEEEeecCCC-cccccc----cCCee---EEEEEcCCCCeEEE
Confidence 46788888884444544455 4444556666 444431 23343 48999999988844
No 254
>PHA03346 US22 family homolog; Provisional
Probab=24.31 E-value=55 Score=27.54 Aligned_cols=26 Identities=12% Similarity=0.184 Sum_probs=22.2
Q ss_pred ecCCceEEEEcCCC--ceEecCcccccC
Q psy4774 93 TSGSGSFIKIGPNT--QEVLQKQTDKLG 118 (120)
Q Consensus 93 ~~~~G~l~~~d~~~--~~~l~~~L~~~~ 118 (120)
-...|+||.||+.+ +-+|+.+|.+|.
T Consensus 120 mG~~GRVYvYD~~~d~LyLvA~dLdefA 147 (520)
T PHA03346 120 MGAETRLYIYEPSQDILILVAPHLDELA 147 (520)
T ss_pred EeCCCeEEEEecCCCEEEEEeCCHHHHH
Confidence 45789999999988 899999988774
No 255
>KOG0645|consensus
Probab=23.87 E-value=3.6e+02 Score=21.15 Aligned_cols=62 Identities=13% Similarity=0.191 Sum_probs=41.1
Q ss_pred cceEECCCCcEEEEECCCCEEEE-ECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCCC
Q psy4774 15 LGMKFDKNGALHVADAYFGLYKV-NVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTK 81 (120)
Q Consensus 15 ~Gl~~d~~G~l~V~d~~~gi~~v-d~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~ 81 (120)
..+|+.|.|+++++-+......| ...+++.+.++. .+| .-+..-.++++++|++..|-|..+
T Consensus 65 RsvAwsp~g~~La~aSFD~t~~Iw~k~~~efecv~~----lEG-HEnEVK~Vaws~sG~~LATCSRDK 127 (312)
T KOG0645|consen 65 RSVAWSPHGRYLASASFDATVVIWKKEDGEFECVAT----LEG-HENEVKCVAWSASGNYLATCSRDK 127 (312)
T ss_pred eeeeecCCCcEEEEeeccceEEEeecCCCceeEEee----eec-cccceeEEEEcCCCCEEEEeeCCC
Confidence 46889999997766555554443 334567666653 344 224455899999999999988643
No 256
>KOG0973|consensus
Probab=23.48 E-value=1.9e+02 Score=26.32 Aligned_cols=65 Identities=12% Similarity=0.057 Sum_probs=38.0
Q ss_pred CCCCccceEECCCCcEEEEECCCC---EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEe
Q psy4774 10 QCGRPLGMKFDKNGALHVADAYFG---LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSD 77 (120)
Q Consensus 10 ~~g~P~Gl~~d~~G~l~V~d~~~g---i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TD 77 (120)
..+.+.|+.+||-|+-+.+....+ |++.+. -|-...+.....+..+.. +.-.+.-+|||....+=
T Consensus 170 H~s~VKGvs~DP~Gky~ASqsdDrtikvwrt~d-w~i~k~It~pf~~~~~~T--~f~RlSWSPDG~~las~ 237 (942)
T KOG0973|consen 170 HQSLVKGVSWDPIGKYFASQSDDRTLKVWRTSD-WGIEKSITKPFEESPLTT--FFLRLSWSPDGHHLASP 237 (942)
T ss_pred ccccccceEECCccCeeeeecCCceEEEEEccc-ceeeEeeccchhhCCCcc--eeeecccCCCcCeecch
Confidence 357899999999999887766655 444332 233334433222222222 33577788888765443
No 257
>PHA00691 hypothetical protein
Probab=23.33 E-value=79 Score=18.69 Aligned_cols=23 Identities=9% Similarity=0.075 Sum_probs=17.2
Q ss_pred ecCCceEEEEcCCC-ceEecCccc
Q psy4774 93 TSGSGSFIKIGPNT-QEVLQKQTD 115 (120)
Q Consensus 93 ~~~~G~l~~~d~~~-~~~l~~~L~ 115 (120)
...+|++|.+.+++ .++++.|+.
T Consensus 9 ~~ENGr~WVL~K~~~Y~V~vSG~T 32 (68)
T PHA00691 9 AYENGRVWVLKKSDSYTVFVSGVT 32 (68)
T ss_pred hccCCeEEEEEeCCcEEEEEeccc
Confidence 34678999998766 788877764
No 258
>PF14251 DUF4346: Domain of unknown function (DUF4346)
Probab=23.05 E-value=1.4e+02 Score=20.22 Aligned_cols=16 Identities=25% Similarity=0.347 Sum_probs=11.1
Q ss_pred ceEEcCC-CCEEEEeCC
Q psy4774 64 SVTVDSD-GMVYWSDSS 79 (120)
Q Consensus 64 dl~v~~~-G~iy~TDs~ 79 (120)
|+++||+ |.+.-+...
T Consensus 44 Gla~Dpetge~i~~~g~ 60 (119)
T PF14251_consen 44 GLAVDPETGEVIPCRGK 60 (119)
T ss_pred cceeCCCCCCEEEEecC
Confidence 6888885 667666654
No 259
>PRK05841 flgE flagellar hook protein FlgE; Validated
Probab=22.94 E-value=5e+02 Score=22.45 Aligned_cols=26 Identities=27% Similarity=0.387 Sum_probs=20.4
Q ss_pred ccCCCCCcCCCceEEcCCCCEEEEeC
Q psy4774 53 EIDGAKPQIPNSVTVDSDGMVYWSDS 78 (120)
Q Consensus 53 ~~~g~~~~~pndl~v~~~G~iy~TDs 78 (120)
..+|.+.....++.|+++|.|+.+-+
T Consensus 468 ~qdG~~~G~l~~v~I~~dGtI~~~~s 493 (603)
T PRK05841 468 SQDGKPRGILRDVRIDENGVISLAFS 493 (603)
T ss_pred ccCCccceeEEEEEECCCcEEEEEec
Confidence 34666666778999999999998744
No 260
>KOG0315|consensus
Probab=22.63 E-value=3.8e+02 Score=20.91 Aligned_cols=83 Identities=13% Similarity=0.035 Sum_probs=0.0
Q ss_pred cceEECCCC-cEEEEECCCCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCCCcccccceeccee
Q psy4774 15 LGMKFDKNG-ALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTKYKLYDGLFDGLT 93 (120)
Q Consensus 15 ~Gl~~d~~G-~l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~~~~~~~~~~~~~~ 93 (120)
+-+...|+. .|+++|....|...|.......-.. -.+...+...+++.+||.....-..
T Consensus 128 n~vvlhpnQteLis~dqsg~irvWDl~~~~c~~~l------iPe~~~~i~sl~v~~dgsml~a~nn-------------- 187 (311)
T KOG0315|consen 128 NTVVLHPNQTELISGDQSGNIRVWDLGENSCTHEL------IPEDDTSIQSLTVMPDGSMLAAANN-------------- 187 (311)
T ss_pred ceEEecCCcceEEeecCCCcEEEEEccCCcccccc------CCCCCcceeeEEEcCCCcEEEEecC--------------
Q ss_pred cCCceEEEEcCCC--ceEecCcccccCC
Q psy4774 94 SGSGSFIKIGPNT--QEVLQKQTDKLGA 119 (120)
Q Consensus 94 ~~~G~l~~~d~~~--~~~l~~~L~~~~~ 119 (120)
.|++|..+.-+ ...-+.=+.++-|
T Consensus 188 --kG~cyvW~l~~~~~~s~l~P~~k~~a 213 (311)
T KOG0315|consen 188 --KGNCYVWRLLNHQTASELEPVHKFQA 213 (311)
T ss_pred --CccEEEEEccCCCccccceEhhheec
No 261
>KOG0771|consensus
Probab=22.43 E-value=3.5e+02 Score=22.11 Aligned_cols=65 Identities=12% Similarity=0.067 Sum_probs=37.3
Q ss_pred CCccceEECCCCcEEEEECCCC--EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCCC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFG--LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSST 80 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~g--i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~~ 80 (120)
+.-..|.|.+||.+++.-.... ||.++ +|. .++........+.|..++-.+....-++|+..+..
T Consensus 187 ~eV~DL~FS~dgk~lasig~d~~~VW~~~--~g~--~~a~~t~~~k~~~~~~cRF~~d~~~~~l~laa~~~ 253 (398)
T KOG0771|consen 187 AEVKDLDFSPDGKFLASIGADSARVWSVN--TGA--ALARKTPFSKDEMFSSCRFSVDNAQETLRLAASQF 253 (398)
T ss_pred CccccceeCCCCcEEEEecCCceEEEEec--cCc--hhhhcCCcccchhhhhceecccCCCceEEEEEecC
Confidence 4456899999998888755434 55554 552 23321111223456666555554445777777753
No 262
>PF15232 DUF4585: Domain of unknown function (DUF4585)
Probab=22.18 E-value=2.1e+02 Score=17.76 Aligned_cols=32 Identities=25% Similarity=0.189 Sum_probs=22.0
Q ss_pred ceEECCC-CcEEEEECCCC--EE-EEECCCCcEEEE
Q psy4774 16 GMKFDKN-GALHVADAYFG--LY-KVNVTTGQTEQL 47 (120)
Q Consensus 16 Gl~~d~~-G~l~V~d~~~g--i~-~vd~~~g~~~~~ 47 (120)
=+-.|++ |+-|+.|.... +. -+|+|+|+...+
T Consensus 9 KvL~DP~SG~Yy~vd~P~Qp~~k~lfDPETGqYVeV 44 (75)
T PF15232_consen 9 KVLQDPESGQYYVVDAPVQPKTKTLFDPETGQYVEV 44 (75)
T ss_pred cEeecCCCCCEEEEecCCCcceeeeecCCCCcEEEE
Confidence 3567776 78999998754 22 269999985433
No 263
>PF09910 DUF2139: Uncharacterized protein conserved in archaea (DUF2139); InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=21.97 E-value=4.1e+02 Score=21.13 Aligned_cols=64 Identities=9% Similarity=0.144 Sum_probs=37.4
Q ss_pred ccceEECCCCc--EEEEECCCCEEEEECCCCcEEEEeccccccCCCC--CcCCCceEEcC-CCCEEEEeCC
Q psy4774 14 PLGMKFDKNGA--LHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAK--PQIPNSVTVDS-DGMVYWSDSS 79 (120)
Q Consensus 14 P~Gl~~d~~G~--l~V~d~~~gi~~vd~~~g~~~~~~~~~~~~~g~~--~~~pndl~v~~-~G~iy~TDs~ 79 (120)
|..+.-..+|+ +-..+-+.+|..+|.++++++.+... +++.+. -....+|.-|| +..|+++-.-
T Consensus 58 Pa~y~gk~~g~~~IdF~NKYSHVH~yd~e~~~VrLLWke--sih~~~~WaGEVSdIlYdP~~D~LLlAR~D 126 (339)
T PF09910_consen 58 PAVYEGKGDGRATIDFRNKYSHVHEYDTENDSVRLLWKE--SIHDKTKWAGEVSDILYDPYEDRLLLARAD 126 (339)
T ss_pred CceeeeccCCceEEEEeeccceEEEEEcCCCeEEEEEec--ccCCccccccchhheeeCCCcCEEEEEecC
Confidence 44444444442 33334456799999888999988753 333321 12245777777 4567776554
No 264
>PHA03338 US22 family homolog; Provisional
Probab=21.78 E-value=85 Score=24.68 Aligned_cols=25 Identities=8% Similarity=-0.001 Sum_probs=20.8
Q ss_pred ecCCceEEEEcCCC--ceEecCccccc
Q psy4774 93 TSGSGSFIKIGPNT--QEVLQKQTDKL 117 (120)
Q Consensus 93 ~~~~G~l~~~d~~~--~~~l~~~L~~~ 117 (120)
..++|+||.+|... -..|+|+|+.|
T Consensus 272 ~~~~GrV~A~D~~d~~YvrlaD~LpgF 298 (344)
T PHA03338 272 GLPSGRVFAGDACDGSYVLIADSIPGF 298 (344)
T ss_pred ecCCceEEEEEccCCeEEEeecCCCce
Confidence 46999999999776 77888988865
No 265
>PF12216 m04gp34like: Immune evasion protein; InterPro: IPR022022 The proteins in this family are related to the m04 encoded protein gp34 of pathogenic microorganisms such as Murid herpesvirus 1. m06 and m152 genes are expressed earlier in the intracellular replication phases of these microorganism' life cycles. They function to inhibit MHC-1 loading and export. gp34 is theorized to prevent immune reactions from NK cells which would ordinarily recognise and attack cells lacking MHC.
Probab=21.77 E-value=84 Score=24.20 Aligned_cols=21 Identities=29% Similarity=0.463 Sum_probs=17.4
Q ss_pred CCCccceEECCC-CcEEEEECC
Q psy4774 11 CGRPLGMKFDKN-GALHVADAY 31 (120)
Q Consensus 11 ~g~P~Gl~~d~~-G~l~V~d~~ 31 (120)
.+.+.|+.+|++ |+|||....
T Consensus 134 ~~~~~Gf~Vd~~sG~L~i~sna 155 (272)
T PF12216_consen 134 VGNKDGFKVDPSSGNLYISSNA 155 (272)
T ss_pred cCCCCCeEEcCCCceEEEecCc
Confidence 367899999999 899998543
No 266
>COG4004 Uncharacterized protein conserved in archaea [Function unknown]
Probab=21.68 E-value=2.4e+02 Score=18.29 Aligned_cols=28 Identities=14% Similarity=0.075 Sum_probs=17.3
Q ss_pred ceEECCCCcEEEEECCCC-EEEEECCCCc
Q psy4774 16 GMKFDKNGALHVADAYFG-LYKVNVTTGQ 43 (120)
Q Consensus 16 Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~ 43 (120)
|..|..+|..+|+..+.. .+.+.+++++
T Consensus 25 g~~v~~eGD~ivas~pgis~ieik~E~kk 53 (96)
T COG4004 25 GWTVSEEGDRIVASSPGISRIEIKPENKK 53 (96)
T ss_pred CeeEeecccEEEEecCCceEEEEecccce
Confidence 566777788777765432 4555566554
No 267
>KOG0271|consensus
Probab=21.57 E-value=4.7e+02 Score=21.63 Aligned_cols=61 Identities=20% Similarity=0.174 Sum_probs=34.8
Q ss_pred CCccceEECCCCcEEEEECCCCEEE-EECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEeCC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFGLYK-VNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSS 79 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~gi~~-vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TDs~ 79 (120)
+--...+|.|+|.-+|+..+..-+| .|+.+- +++.. ..| --++.--|+.+|||....|-+.
T Consensus 116 e~Vl~~~fsp~g~~l~tGsGD~TvR~WD~~Te--Tp~~t----~Kg-H~~WVlcvawsPDgk~iASG~~ 177 (480)
T KOG0271|consen 116 EAVLSVQFSPTGSRLVTGSGDTTVRLWDLDTE--TPLFT----CKG-HKNWVLCVAWSPDGKKIASGSK 177 (480)
T ss_pred CcEEEEEecCCCceEEecCCCceEEeeccCCC--Cccee----ecC-CccEEEEEEECCCcchhhcccc
Confidence 3345788999998888876644333 365432 22221 122 1244556777777766666554
No 268
>PHA02790 Kelch-like protein; Provisional
Probab=21.51 E-value=4.5e+02 Score=21.41 Aligned_cols=50 Identities=12% Similarity=0.048 Sum_probs=29.5
Q ss_pred CCCcEEEEECC---CCEEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCCCCEEEEe
Q psy4774 21 KNGALHVADAY---FGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSD 77 (120)
Q Consensus 21 ~~G~l~V~d~~---~gi~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~TD 77 (120)
-+|.|||.-.. ..+.++++.+++++.+.+. . .+.. +--++.-+|.||+.-
T Consensus 317 ~~~~iYviGG~~~~~sve~ydp~~n~W~~~~~l----~-~~r~--~~~~~~~~g~IYviG 369 (480)
T PHA02790 317 ANNKLYVVGGLPNPTSVERWFHGDAAWVNMPSL----L-KPRC--NPAVASINNVIYVIG 369 (480)
T ss_pred ECCEEEEECCcCCCCceEEEECCCCeEEECCCC----C-CCCc--ccEEEEECCEEEEec
Confidence 36899998543 3388889887777766532 1 1111 222333468888863
No 269
>PHA03219 nuclear protein UL24; Provisional
Probab=20.95 E-value=74 Score=24.89 Aligned_cols=26 Identities=8% Similarity=-0.001 Sum_probs=21.1
Q ss_pred ecCCceEEEEcCCC--ceEecCcccccC
Q psy4774 93 TSGSGSFIKIGPNT--QEVLQKQTDKLG 118 (120)
Q Consensus 93 ~~~~G~l~~~d~~~--~~~l~~~L~~~~ 118 (120)
-...|++|.|++.+ +-.|++++.+|.
T Consensus 94 lG~~GrVY~Y~~~dDaLY~vA~s~~eFa 121 (300)
T PHA03219 94 VSEEGYVFCYVREDFAVYYVARNLMEFA 121 (300)
T ss_pred EcCCCeEEEEecCCCEEEEeecCHHHHH
Confidence 45789999998766 999999887763
No 270
>KOG3914|consensus
Probab=20.92 E-value=2e+02 Score=23.36 Aligned_cols=56 Identities=21% Similarity=0.373 Sum_probs=28.4
Q ss_pred CCccceEECCCC-cEEEEECCCCEEEEECC---CCcEEEEeccccccCCCCCcCCCceEEcCCCCEEE
Q psy4774 12 GRPLGMKFDKNG-ALHVADAYFGLYKVNVT---TGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYW 75 (120)
Q Consensus 12 g~P~Gl~~d~~G-~l~V~d~~~gi~~vd~~---~g~~~~~~~~~~~~~g~~~~~pndl~v~~~G~iy~ 75 (120)
-+|..|.+..+. ...|+|...-++++|.- .+..+.+++ -+...-||++++|+...+
T Consensus 108 ~~~~ai~~~~~~~sv~v~dkagD~~~~di~s~~~~~~~~~lG--------hvSml~dVavS~D~~~Ii 167 (390)
T KOG3914|consen 108 KRPTAISFIREDTSVLVADKAGDVYSFDILSADSGRCEPILG--------HVSMLLDVAVSPDDQFII 167 (390)
T ss_pred cCcceeeeeeccceEEEEeecCCceeeeeecccccCcchhhh--------hhhhhheeeecCCCCEEE
Confidence 457777777665 66666655444444321 133333322 122345777776664333
No 271
>KOG0279|consensus
Probab=20.83 E-value=3.2e+02 Score=21.45 Aligned_cols=51 Identities=20% Similarity=0.278 Sum_probs=30.7
Q ss_pred CCccceEECCCCcEEEEECCCC-EEEEECCCCcEEEEeccccccCCCCCcCCCceEEcCC
Q psy4774 12 GRPLGMKFDKNGALHVADAYFG-LYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSD 70 (120)
Q Consensus 12 g~P~Gl~~d~~G~l~V~d~~~g-i~~vd~~~g~~~~~~~~~~~~~g~~~~~pndl~v~~~ 70 (120)
+.-+-+++.|||.|.++-...| ++-.|...++. +.. .+ .+...|.++|+|+
T Consensus 193 ~~v~t~~vSpDGslcasGgkdg~~~LwdL~~~k~--lys----l~--a~~~v~sl~fspn 244 (315)
T KOG0279|consen 193 GYVNTVTVSPDGSLCASGGKDGEAMLWDLNEGKN--LYS----LE--AFDIVNSLCFSPN 244 (315)
T ss_pred ccEEEEEECCCCCEEecCCCCceEEEEEccCCce--eEe----cc--CCCeEeeEEecCC
Confidence 3455689999999998755455 55566555543 211 01 2334567777775
No 272
>COG4247 Phy 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]
Probab=20.77 E-value=1.3e+02 Score=23.51 Aligned_cols=28 Identities=25% Similarity=0.392 Sum_probs=22.5
Q ss_pred CCccceEECCC-CcEEEEECCCCEEEEEC
Q psy4774 12 GRPLGMKFDKN-GALHVADAYFGLYKVNV 39 (120)
Q Consensus 12 g~P~Gl~~d~~-G~l~V~d~~~gi~~vd~ 39 (120)
.+..|+..|.+ |.|||+....+||++..
T Consensus 205 tQTEG~VaDdEtG~LYIaeEdvaiWK~~A 233 (364)
T COG4247 205 TQTEGMVADDETGFLYIAEEDVAIWKYEA 233 (364)
T ss_pred CcccceeeccccceEEEeeccceeeeccc
Confidence 35668888766 89999988888999853
No 273
>PF10584 Proteasome_A_N: Proteasome subunit A N-terminal signature; InterPro: IPR000426 The proteasome (or macropain) (3.4.25.1 from EC) [, , , , ] is a eukaryotic and archaeal multicatalytic proteinase complex that seems to be involved in an ATP/ubiquitin-dependent nonlysosomal proteolytic pathway. In eukaryotes the proteasome is composed of about 28 distinct subunits which form a highly ordered ring-shaped structure (20S ring) of about 700 kDa. Most proteasome subunits can be classified, on the basis on sequence similarities into two groups, alpha (A) and beta (B). This family contains the alpha subunit sequences which range from 210 to 290 amino acids. These sequences are classified as non-peptidase homologues in MEROPS peptidase family T1 (clan PB(T)). ; GO: 0004175 endopeptidase activity, 0006511 ubiquitin-dependent protein catabolic process, 0019773 proteasome core complex, alpha-subunit complex; PDB: 3H4P_M 1IRU_O 3UN4_U 1FNT_A 3OEV_G 3OEU_U 3SDK_U 3DY3_G 3MG7_G 3L5Q_C ....
Probab=20.74 E-value=22 Score=17.02 Aligned_cols=8 Identities=25% Similarity=0.729 Sum_probs=4.2
Q ss_pred EcCCCCEE
Q psy4774 67 VDSDGMVY 74 (120)
Q Consensus 67 v~~~G~iy 74 (120)
++|+|+||
T Consensus 8 FSp~Grl~ 15 (23)
T PF10584_consen 8 FSPDGRLF 15 (23)
T ss_dssp BBTTSSBH
T ss_pred ECCCCeEE
Confidence 45555553
Done!