Query psy14331
Match_columns 265
No_of_seqs 157 out of 1436
Neff 10.5
Searched_HMMs 46136
Date Fri Aug 16 17:07:21 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14331.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14331hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1230|consensus 100.0 1.4E-36 3E-41 243.2 21.4 226 26-258 44-276 (521)
2 KOG4441|consensus 100.0 1.2E-36 2.6E-41 266.4 18.1 218 10-257 334-555 (571)
3 PLN02153 epithiospecifier prot 100.0 1.5E-35 3.2E-40 247.6 22.6 226 23-263 48-290 (341)
4 KOG4441|consensus 100.0 6.3E-36 1.4E-40 261.9 18.5 222 13-263 288-514 (571)
5 PLN02193 nitrile-specifier pro 100.0 2.7E-34 5.9E-39 248.5 22.3 215 23-263 191-416 (470)
6 PHA02713 hypothetical protein; 100.0 1.5E-34 3.3E-39 254.1 18.1 206 23-255 318-540 (557)
7 PHA02713 hypothetical protein; 100.0 4.3E-34 9.3E-39 251.2 20.5 212 25-262 272-503 (557)
8 PHA03098 kelch-like protein; P 100.0 2.7E-33 5.9E-38 247.4 20.8 206 22-263 308-517 (534)
9 PHA02790 Kelch-like protein; P 100.0 4.6E-33 1E-37 241.4 19.0 188 22-262 284-475 (480)
10 TIGR03548 mutarot_permut cycli 100.0 8.8E-33 1.9E-37 229.4 18.8 212 25-263 39-285 (323)
11 PLN02153 epithiospecifier prot 100.0 1.7E-31 3.8E-36 223.1 25.0 212 32-263 4-231 (341)
12 PLN02193 nitrile-specifier pro 100.0 3.4E-31 7.4E-36 229.2 21.3 217 12-258 231-469 (470)
13 TIGR03548 mutarot_permut cycli 100.0 2.2E-30 4.9E-35 215.0 17.0 193 21-237 84-316 (323)
14 KOG1230|consensus 100.0 2.4E-30 5.2E-35 207.5 15.3 233 23-264 96-347 (521)
15 KOG4693|consensus 100.0 2.5E-30 5.3E-35 197.0 12.9 213 26-263 45-282 (392)
16 PHA03098 kelch-like protein; P 100.0 3.9E-29 8.6E-34 220.9 20.6 202 26-263 265-470 (534)
17 KOG4693|consensus 100.0 5.7E-30 1.2E-34 195.0 12.2 206 15-235 95-313 (392)
18 KOG4152|consensus 100.0 1.4E-28 3.1E-33 202.3 16.7 240 8-259 91-365 (830)
19 KOG0379|consensus 100.0 1.7E-28 3.8E-33 212.0 17.2 209 26-263 89-307 (482)
20 TIGR03547 muta_rot_YjhT mutatr 100.0 4.9E-28 1.1E-32 202.9 18.5 212 23-255 83-344 (346)
21 PRK14131 N-acetylneuraminic ac 100.0 5.4E-28 1.2E-32 204.3 18.4 217 23-262 104-373 (376)
22 KOG0379|consensus 100.0 8.9E-28 1.9E-32 207.6 20.0 191 42-262 55-254 (482)
23 TIGR03547 muta_rot_YjhT mutatr 100.0 1.6E-27 3.6E-32 199.8 20.5 171 42-235 2-211 (346)
24 PRK14131 N-acetylneuraminic ac 99.9 3.6E-26 7.7E-31 193.3 20.8 177 36-236 18-233 (376)
25 PHA02790 Kelch-like protein; P 99.9 6E-26 1.3E-30 197.1 20.7 164 58-262 270-437 (480)
26 KOG4152|consensus 99.9 6.1E-23 1.3E-27 169.4 17.3 221 23-262 55-307 (830)
27 KOG2437|consensus 99.7 5E-17 1.1E-21 134.4 5.3 195 31-236 235-459 (723)
28 COG3055 Uncharacterized protei 99.5 4E-13 8.7E-18 107.4 13.7 174 41-237 30-241 (381)
29 COG3055 Uncharacterized protei 99.5 1.1E-12 2.3E-17 105.0 14.2 202 21-237 109-362 (381)
30 KOG2437|consensus 99.4 1.3E-12 2.8E-17 108.7 7.3 151 98-262 257-417 (723)
31 PF13964 Kelch_6: Kelch motif 99.3 1.1E-11 2.3E-16 73.3 6.3 50 159-214 1-50 (50)
32 PF13964 Kelch_6: Kelch motif 99.2 2.9E-11 6.4E-16 71.4 5.9 50 101-160 1-50 (50)
33 PLN02772 guanylate kinase 99.1 1E-09 2.2E-14 91.2 10.4 90 157-260 22-111 (398)
34 PF01344 Kelch_1: Kelch motif; 99.1 3E-10 6.4E-15 66.2 4.8 46 101-154 1-46 (47)
35 PF13415 Kelch_3: Galactose ox 99.1 5.9E-10 1.3E-14 65.4 6.0 48 169-221 1-48 (49)
36 PF07646 Kelch_2: Kelch motif; 99.1 6.3E-10 1.4E-14 65.3 6.1 48 159-207 1-48 (49)
37 PF01344 Kelch_1: Kelch motif; 99.0 3.6E-10 7.8E-15 65.8 4.7 46 159-207 1-46 (47)
38 PF13418 Kelch_4: Galactose ox 99.0 6.2E-10 1.3E-14 65.4 4.8 47 101-154 1-47 (49)
39 PF13415 Kelch_3: Galactose ox 99.0 1.5E-09 3.2E-14 63.7 5.8 49 114-168 1-49 (49)
40 PF13418 Kelch_4: Galactose ox 99.0 1.1E-09 2.4E-14 64.3 4.7 46 159-207 1-47 (49)
41 PF07646 Kelch_2: Kelch motif; 99.0 2.1E-09 4.6E-14 63.0 5.8 48 101-154 1-48 (49)
42 PLN02772 guanylate kinase 98.9 1.6E-08 3.4E-13 84.2 10.7 87 99-199 22-110 (398)
43 PF13854 Kelch_5: Kelch motif 98.8 1.5E-08 3.3E-13 57.1 5.5 41 157-199 2-42 (42)
44 smart00612 Kelch Kelch domain. 98.7 2E-08 4.4E-13 58.2 4.3 47 116-170 1-47 (47)
45 PF13854 Kelch_5: Kelch motif 98.7 5.2E-08 1.1E-12 54.9 5.1 41 98-146 1-42 (42)
46 smart00612 Kelch Kelch domain. 98.6 7.3E-08 1.6E-12 55.8 4.4 45 171-222 1-45 (47)
47 PF03089 RAG2: Recombination a 98.6 1.6E-05 3.5E-10 62.5 18.1 165 61-238 40-233 (337)
48 PF07250 Glyoxal_oxid_N: Glyox 98.0 0.00025 5.4E-09 55.9 13.2 103 114-234 77-190 (243)
49 PF03089 RAG2: Recombination a 97.9 0.00068 1.5E-08 53.6 13.0 137 40-183 80-233 (337)
50 PF07250 Glyoxal_oxid_N: Glyox 97.8 0.00014 3E-09 57.4 7.8 88 138-236 48-140 (243)
51 TIGR01640 F_box_assoc_1 F-box 97.3 0.048 1.1E-06 43.0 16.6 109 135-263 69-183 (230)
52 TIGR01640 F_box_assoc_1 F-box 97.0 0.12 2.7E-06 40.7 20.1 180 26-232 15-215 (230)
53 PF07893 DUF1668: Protein of u 96.8 0.089 1.9E-06 44.3 14.3 120 113-263 75-213 (342)
54 PF07893 DUF1668: Protein of u 96.4 0.45 9.7E-06 40.1 16.2 119 58-206 75-216 (342)
55 PF12768 Rax2: Cortical protei 96.3 0.072 1.6E-06 43.3 10.7 106 119-233 2-110 (281)
56 KOG2055|consensus 95.9 0.38 8.2E-06 41.1 12.9 145 53-234 219-366 (514)
57 PRK11138 outer membrane biogen 95.4 1.6 3.4E-05 37.5 16.9 104 58-206 68-189 (394)
58 PRK11138 outer membrane biogen 95.4 1.6 3.5E-05 37.5 18.4 133 26-203 80-231 (394)
59 cd00094 HX Hemopexin-like repe 95.1 1.1 2.4E-05 34.4 16.3 93 114-233 62-167 (194)
60 PF12217 End_beta_propel: Cata 94.2 2.3 4.9E-05 33.9 12.3 114 114-235 200-335 (367)
61 KOG2055|consensus 94.2 1.3 2.9E-05 37.9 11.6 137 53-231 263-405 (514)
62 PF13360 PQQ_2: PQQ-like domai 93.8 2.6 5.6E-05 33.0 18.9 130 58-234 35-183 (238)
63 PF12768 Rax2: Cortical protei 93.2 3.9 8.5E-05 33.4 13.3 108 83-206 15-130 (281)
64 cd00094 HX Hemopexin-like repe 92.6 3.7 8.1E-05 31.4 12.5 61 114-200 110-178 (194)
65 PF05096 Glu_cyclase_2: Glutam 92.2 5.3 0.00011 32.2 11.9 95 114-234 55-149 (264)
66 PRK13684 Ycf48-like protein; P 91.5 7.5 0.00016 32.6 14.7 80 137-232 153-233 (334)
67 PF02191 OLF: Olfactomedin-lik 91.4 6.4 0.00014 31.6 13.3 130 83-230 88-236 (250)
68 PRK04792 tolB translocation pr 91.0 7.6 0.00017 34.1 12.8 144 26-206 243-391 (448)
69 TIGR03075 PQQ_enz_alc_DH PQQ-d 91.0 11 0.00025 33.8 14.2 114 58-205 68-199 (527)
70 PF10282 Lactonase: Lactonase, 89.5 6.6 0.00014 33.1 10.8 143 27-206 168-333 (345)
71 PRK04792 tolB translocation pr 89.1 15 0.00033 32.3 16.2 61 136-206 286-347 (448)
72 PF08268 FBA_3: F-box associat 88.7 6.6 0.00014 27.7 9.2 83 167-263 3-86 (129)
73 KOG0310|consensus 88.1 17 0.00036 31.6 12.4 105 58-202 78-190 (487)
74 TIGR03300 assembly_YfgL outer 86.7 19 0.00041 30.6 16.9 101 58-206 64-174 (377)
75 PF08268 FBA_3: F-box associat 86.5 9.2 0.0002 27.0 11.6 83 114-206 5-89 (129)
76 PF10282 Lactonase: Lactonase, 86.2 19 0.00041 30.3 17.4 125 47-206 143-286 (345)
77 TIGR03300 assembly_YfgL outer 85.5 22 0.00047 30.2 15.8 61 114-200 279-341 (377)
78 TIGR02800 propeller_TolB tol-p 85.3 23 0.0005 30.4 13.0 61 136-206 302-363 (417)
79 TIGR03866 PQQ_ABC_repeats PQQ- 83.5 21 0.00046 28.6 16.6 38 27-67 55-92 (300)
80 PRK00178 tolB translocation pr 82.6 32 0.00069 29.9 16.7 61 136-206 267-328 (430)
81 PRK11028 6-phosphogluconolacto 81.1 30 0.00066 28.7 15.4 69 109-198 41-111 (330)
82 PF05096 Glu_cyclase_2: Glutam 81.0 24 0.00053 28.5 9.4 102 58-199 54-159 (264)
83 PRK11028 6-phosphogluconolacto 80.2 33 0.00071 28.5 17.3 93 27-146 59-158 (330)
84 cd00216 PQQ_DH Dehydrogenases 78.9 47 0.001 29.6 14.6 138 33-205 39-193 (488)
85 KOG2048|consensus 78.8 53 0.0011 30.1 13.1 94 92-206 418-514 (691)
86 PF13360 PQQ_2: PQQ-like domai 78.6 29 0.00063 27.0 17.1 66 114-206 36-106 (238)
87 PRK03629 tolB translocation pr 77.2 49 0.0011 28.9 14.8 61 136-206 311-372 (429)
88 PRK00178 tolB translocation pr 76.8 49 0.0011 28.7 14.7 144 26-206 224-372 (430)
89 KOG2321|consensus 76.5 58 0.0013 29.4 14.3 52 135-199 154-207 (703)
90 TIGR03866 PQQ_ABC_repeats PQQ- 76.3 38 0.00082 27.1 14.6 64 113-200 41-106 (300)
91 PF03178 CPSF_A: CPSF A subuni 75.9 44 0.00096 27.7 10.3 77 136-229 62-144 (321)
92 PF03178 CPSF_A: CPSF A subuni 75.2 46 0.001 27.6 11.6 93 113-231 97-190 (321)
93 TIGR03075 PQQ_enz_alc_DH PQQ-d 72.0 77 0.0017 28.7 13.5 70 114-205 69-148 (527)
94 PF14870 PSII_BNR: Photosynthe 71.7 57 0.0012 27.1 13.9 152 33-231 45-204 (302)
95 PLN00033 photosystem II stabil 71.4 68 0.0015 27.8 17.6 71 147-234 271-348 (398)
96 PF14870 PSII_BNR: Photosynthe 70.4 61 0.0013 26.9 15.3 152 33-231 4-162 (302)
97 PRK04922 tolB translocation pr 70.3 74 0.0016 27.8 15.2 60 136-206 316-377 (433)
98 PRK05137 tolB translocation pr 69.7 76 0.0016 27.7 15.8 144 26-206 227-375 (435)
99 PRK04922 tolB translocation pr 68.6 80 0.0017 27.5 16.2 61 136-206 272-333 (433)
100 PF02239 Cytochrom_D1: Cytochr 68.2 77 0.0017 27.1 10.8 91 27-147 18-111 (369)
101 PF02191 OLF: Olfactomedin-lik 65.8 69 0.0015 25.7 13.5 113 97-234 64-190 (250)
102 PRK03629 tolB translocation pr 64.2 99 0.0021 27.0 17.0 62 136-206 267-328 (429)
103 TIGR02800 propeller_TolB tol-p 63.6 96 0.0021 26.6 16.0 61 136-206 258-319 (417)
104 PF13088 BNR_2: BNR repeat-lik 61.1 35 0.00075 27.4 6.5 126 92-229 147-275 (275)
105 COG2706 3-carboxymuconate cycl 60.6 1E+02 0.0022 26.0 13.5 150 27-207 169-333 (346)
106 PF08450 SGL: SMP-30/Gluconola 60.6 81 0.0018 24.8 13.8 105 83-206 21-131 (246)
107 KOG1332|consensus 59.9 90 0.0019 25.1 10.2 55 142-206 241-296 (299)
108 TIGR03074 PQQ_membr_DH membran 58.4 1.7E+02 0.0038 27.9 18.5 39 165-204 312-353 (764)
109 smart00284 OLF Olfactomedin-li 55.4 1.1E+02 0.0024 24.7 16.1 158 43-230 69-241 (255)
110 PRK13684 Ycf48-like protein; P 54.0 1.3E+02 0.0029 25.2 15.8 131 33-206 33-169 (334)
111 COG2706 3-carboxymuconate cycl 52.9 1.4E+02 0.003 25.2 16.0 70 135-211 166-237 (346)
112 PF02239 Cytochrom_D1: Cytochr 51.7 1.5E+02 0.0033 25.3 9.3 155 27-218 60-223 (369)
113 PF15525 DUF4652: Domain of un 49.0 1.2E+02 0.0026 23.2 11.9 70 134-207 86-158 (200)
114 COG4257 Vgb Streptogramin lyas 48.4 42 0.00092 27.4 4.7 69 26-110 255-324 (353)
115 PRK04043 tolB translocation pr 48.4 1.9E+02 0.0041 25.3 16.0 62 135-206 256-318 (419)
116 PRK02889 tolB translocation pr 45.6 2.1E+02 0.0045 25.0 15.9 61 136-206 308-369 (427)
117 PRK04043 tolB translocation pr 44.4 2.2E+02 0.0047 24.9 17.9 149 26-207 214-367 (419)
118 PRK02889 tolB translocation pr 43.3 2.2E+02 0.0049 24.7 16.0 61 136-206 264-325 (427)
119 COG4257 Vgb Streptogramin lyas 42.1 2E+02 0.0043 23.8 13.0 61 135-206 253-314 (353)
120 KOG2321|consensus 41.1 2.8E+02 0.0062 25.3 10.2 76 44-146 130-207 (703)
121 PF07734 FBA_1: F-box associat 40.6 1.5E+02 0.0032 21.8 8.7 81 166-264 2-90 (164)
122 PRK01029 tolB translocation pr 38.9 2.7E+02 0.0058 24.4 12.4 61 136-206 351-412 (428)
123 PRK05137 tolB translocation pr 38.9 2.6E+02 0.0057 24.3 17.3 61 136-206 270-331 (435)
124 PF09910 DUF2139: Uncharacteri 38.8 2.3E+02 0.005 23.6 16.1 102 81-201 75-185 (339)
125 PF14298 DUF4374: Domain of un 37.7 2.9E+02 0.0062 24.4 13.0 67 188-265 366-432 (435)
126 PF13570 PQQ_3: PQQ-like domai 36.7 69 0.0015 16.9 4.6 25 164-198 16-40 (40)
127 PF13540 RCC1_2: Regulator of 36.5 59 0.0013 16.2 2.6 18 215-232 8-25 (30)
128 PF12217 End_beta_propel: Cata 35.0 2.5E+02 0.0054 22.9 10.1 65 162-235 193-260 (367)
129 KOG0278|consensus 34.8 2.5E+02 0.0054 22.8 12.8 83 138-235 207-289 (334)
130 KOG2445|consensus 33.5 2.4E+02 0.0053 23.5 6.9 61 170-232 184-257 (361)
131 KOG0318|consensus 31.7 3.6E+02 0.0079 24.3 8.0 76 169-256 454-530 (603)
132 cd00200 WD40 WD40 domain, foun 31.3 2.4E+02 0.0052 21.5 16.9 23 114-147 104-126 (289)
133 PF06739 SBBP: Beta-propeller 31.0 90 0.002 16.6 3.0 30 37-67 2-31 (38)
134 cd00200 WD40 WD40 domain, foun 30.8 2.4E+02 0.0053 21.5 16.2 83 136-235 199-283 (289)
135 KOG0308|consensus 30.7 2.3E+02 0.005 26.2 6.8 78 102-199 117-203 (735)
136 PF14830 Haemocyan_bet_s: Haem 30.3 44 0.00095 22.7 1.9 25 114-143 36-60 (103)
137 PF05262 Borrelia_P83: Borreli 30.3 4.1E+02 0.0088 23.9 8.5 65 134-206 373-437 (489)
138 KOG0649|consensus 30.2 3E+02 0.0064 22.3 13.6 139 52-234 119-267 (325)
139 KOG0278|consensus 30.0 3E+02 0.0066 22.3 8.3 55 114-181 235-290 (334)
140 smart00284 OLF Olfactomedin-li 29.9 3E+02 0.0065 22.3 16.8 141 59-231 34-192 (255)
141 PLN00033 photosystem II stabil 29.7 3.8E+02 0.0082 23.3 16.6 130 33-204 73-214 (398)
142 PF14781 BBS2_N: Ciliary BBSom 29.5 2.2E+02 0.0047 20.5 5.6 52 83-146 72-127 (136)
143 PF14583 Pectate_lyase22: Olig 29.2 3.5E+02 0.0076 23.4 7.5 84 103-207 37-120 (386)
144 smart00155 PLDc Phospholipase 27.7 84 0.0018 15.2 2.4 18 217-234 6-23 (28)
145 KOG0310|consensus 27.6 4.4E+02 0.0095 23.4 13.2 52 169-234 249-300 (487)
146 KOG4649|consensus 27.6 3.5E+02 0.0075 22.2 13.0 88 84-202 33-130 (354)
147 PTZ00421 coronin; Provisional 26.8 4.7E+02 0.01 23.5 15.4 23 114-147 179-201 (493)
148 PF15525 DUF4652: Domain of un 26.7 3E+02 0.0064 21.2 7.6 19 188-206 87-105 (200)
149 PF08450 SGL: SMP-30/Gluconola 25.3 3.3E+02 0.0072 21.3 14.4 92 110-230 7-102 (246)
150 KOG3545|consensus 25.2 3.6E+02 0.0079 21.7 6.9 149 4-175 67-235 (249)
151 TIGR03074 PQQ_membr_DH membran 24.5 6.4E+02 0.014 24.3 13.3 27 114-152 194-222 (764)
152 PLN02919 haloacid dehalogenase 24.5 7.3E+02 0.016 24.9 20.0 106 108-233 745-878 (1057)
153 PRK01742 tolB translocation pr 23.3 5E+02 0.011 22.6 14.3 180 5-235 213-393 (429)
154 PF13859 BNR_3: BNR repeat-lik 22.7 3.1E+02 0.0068 22.9 6.1 43 190-234 150-193 (310)
155 KOG0289|consensus 21.6 5.7E+02 0.012 22.6 12.9 99 85-207 370-472 (506)
156 KOG3881|consensus 20.9 5.5E+02 0.012 22.2 9.8 53 135-198 268-321 (412)
157 KOG0289|consensus 20.9 5.9E+02 0.013 22.5 9.9 59 85-155 412-473 (506)
158 cd00216 PQQ_DH Dehydrogenases 20.2 6.3E+02 0.014 22.5 13.7 71 114-206 61-139 (488)
159 KOG1538|consensus 20.1 2.7E+02 0.0058 26.1 5.3 60 47-123 12-73 (1081)
No 1
>KOG1230|consensus
Probab=100.00 E-value=1.4e-36 Score=243.22 Aligned_cols=226 Identities=55% Similarity=1.029 Sum_probs=203.1
Q ss_pred HHHHhhcccc-eEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecC--CCC
Q psy14331 26 DIEAEEKRKN-KVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSA--PGA 98 (265)
Q Consensus 26 ~~~~~d~~~~-~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~--~~~ 98 (265)
.+..++.... .-....+++|.||...++++.|..+.|++|||..+++....+ .+++++||. |... ++.
T Consensus 44 ~iq~~eaK~~e~~~e~~~~~PspRsn~sl~~nPekeELilfGGEf~ngqkT~v-----YndLy~Yn~k~~eWkk~~spn~ 118 (521)
T KOG1230|consen 44 IIQSLEAKQIEHVVETSVPPPSPRSNPSLFANPEKEELILFGGEFYNGQKTHV-----YNDLYSYNTKKNEWKKVVSPNA 118 (521)
T ss_pred HHHhhhhhccceeeeccCCCCCCCCCcceeeccCcceeEEecceeecceeEEE-----eeeeeEEeccccceeEeccCCC
Confidence 3445555442 223335689999999999999999999999999999999988 999999999 7654 667
Q ss_pred CCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCc
Q psy14331 99 PPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGF 178 (265)
Q Consensus 99 p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~ 178 (265)
|.||++|++++++ .+.+|++||........+|.+..++|+||..+++|+++...+.|.+|.+|.|++...+|+||||+
T Consensus 119 P~pRsshq~va~~--s~~l~~fGGEfaSPnq~qF~HYkD~W~fd~~trkweql~~~g~PS~RSGHRMvawK~~lilFGGF 196 (521)
T KOG1230|consen 119 PPPRSSHQAVAVP--SNILWLFGGEFASPNQEQFHHYKDLWLFDLKTRKWEQLEFGGGPSPRSGHRMVAWKRQLILFGGF 196 (521)
T ss_pred cCCCccceeEEec--cCeEEEeccccCCcchhhhhhhhheeeeeeccchheeeccCCCCCCCccceeEEeeeeEEEEcce
Confidence 8999999999986 68999999999888888899999999999999999999988889999999999999999999999
Q ss_pred cCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccccccccccccceeeceEEEEecCc
Q psy14331 179 HDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQSVKKDVDKGIVHTDTFLLTPDS 258 (265)
Q Consensus 179 ~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~~~~~~~w~~~~~~ 258 (265)
......+.|+|+||+||+++-+|.++.+.+-.|.+|++|.+.+..++.|||.||++......+++.....+|+|.++++.
T Consensus 197 hd~nr~y~YyNDvy~FdLdtykW~Klepsga~PtpRSGcq~~vtpqg~i~vyGGYsK~~~kK~~dKG~~hsDmf~L~p~~ 276 (521)
T KOG1230|consen 197 HDSNRDYIYYNDVYAFDLDTYKWSKLEPSGAGPTPRSGCQFSVTPQGGIVVYGGYSKQRVKKDVDKGTRHSDMFLLKPED 276 (521)
T ss_pred ecCCCceEEeeeeEEEeccceeeeeccCCCCCCCCCCcceEEecCCCcEEEEcchhHhhhhhhhhcCceeeeeeeecCCc
Confidence 98888889999999999999999999998778999999999888899999999999877788899999999999999988
No 2
>KOG4441|consensus
Probab=100.00 E-value=1.2e-36 Score=266.45 Aligned_cols=218 Identities=23% Similarity=0.387 Sum_probs=191.3
Q ss_pred eeeeecccccHHHHHHHHHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEE
Q psy14331 10 MSWTIMFNEDIEKIVRDIEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILM 89 (265)
Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~y 89 (265)
.+|..+|.++....++.++.||+.+++|.. .++|+.+|..++++++ ++.||++||. ++... ++++++|
T Consensus 334 ~lYv~GG~~~~~~~l~~ve~YD~~~~~W~~-~a~M~~~R~~~~v~~l--~g~iYavGG~--dg~~~-------l~svE~Y 401 (571)
T KOG4441|consen 334 KLYVVGGYDSGSDRLSSVERYDPRTNQWTP-VAPMNTKRSDFGVAVL--DGKLYAVGGF--DGEKS-------LNSVECY 401 (571)
T ss_pred EEEEEccccCCCcccceEEEecCCCCceec-cCCccCccccceeEEE--CCEEEEEecc--ccccc-------cccEEEe
Confidence 344444444256788999999999999999 9999999999999997 9999999995 44433 7999999
Q ss_pred Ee----eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceE
Q psy14331 90 NL----FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRM 165 (265)
Q Consensus 90 d~----w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~ 165 (265)
|+ |...++|+.+|++|+++++ +++||++||...... .++++++|||.+++|+.+++|. .+|.++++
T Consensus 402 Dp~~~~W~~va~m~~~r~~~gv~~~---~g~iYi~GG~~~~~~-----~l~sve~YDP~t~~W~~~~~M~--~~R~~~g~ 471 (571)
T KOG4441|consen 402 DPVTNKWTPVAPMLTRRSGHGVAVL---GGKLYIIGGGDGSSN-----CLNSVECYDPETNTWTLIAPMN--TRRSGFGV 471 (571)
T ss_pred cCCCCcccccCCCCcceeeeEEEEE---CCEEEEEcCcCCCcc-----ccceEEEEcCCCCceeecCCcc--cccccceE
Confidence 99 9999999999999999998 999999999865541 4899999999999999999985 89999999
Q ss_pred EEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccccccccccccc
Q psy14331 166 IALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQSVKKDVDKG 245 (265)
Q Consensus 166 ~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~ 245 (265)
++++++||++||.++. ....++++|||.+++|+.+.++ +.+|..++++++ ++++|++||++.....++++.|
T Consensus 472 a~~~~~iYvvGG~~~~----~~~~~VE~ydp~~~~W~~v~~m---~~~rs~~g~~~~-~~~ly~vGG~~~~~~l~~ve~y 543 (571)
T KOG4441|consen 472 AVLNGKIYVVGGFDGT----SALSSVERYDPETNQWTMVAPM---TSPRSAVGVVVL-GGKLYAVGGFDGNNNLNTVECY 543 (571)
T ss_pred EEECCEEEEECCccCC----CccceEEEEcCCCCceeEcccC---ccccccccEEEE-CCEEEEEecccCccccceeEEc
Confidence 9999999999999883 3367799999999999999885 778888888777 9999999999999989999999
Q ss_pred eeeceEEEEecC
Q psy14331 246 IVHTDTFLLTPD 257 (265)
Q Consensus 246 ~~~~~~w~~~~~ 257 (265)
+|..|.|...+.
T Consensus 544 dp~~d~W~~~~~ 555 (571)
T KOG4441|consen 544 DPETDTWTEVTE 555 (571)
T ss_pred CCCCCceeeCCC
Confidence 999999988877
No 3
>PLN02153 epithiospecifier protein
Probab=100.00 E-value=1.5e-35 Score=247.62 Aligned_cols=226 Identities=18% Similarity=0.295 Sum_probs=170.1
Q ss_pred HHHHHHHhhcccceEEEeccCCCCCC---CceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecC
Q psy14331 23 IVRDIEAEEKRKNKVIEKVVPEPTRR---ANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSA 95 (265)
Q Consensus 23 ~~~~~~~~d~~~~~w~~~~~~~p~~r---~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~ 95 (265)
..++++.||+.+++|..+.+..+.|| .+|+++++ +++||+|||.. .... ++++++||+ |...
T Consensus 48 ~~~~~~~yd~~~~~W~~~~~~~~~p~~~~~~~~~~~~--~~~iyv~GG~~--~~~~-------~~~v~~yd~~t~~W~~~ 116 (341)
T PLN02153 48 IDKDLYVFDFNTHTWSIAPANGDVPRISCLGVRMVAV--GTKLYIFGGRD--EKRE-------FSDFYSYDTVKNEWTFL 116 (341)
T ss_pred eeCcEEEEECCCCEEEEcCccCCCCCCccCceEEEEE--CCEEEEECCCC--CCCc-------cCcEEEEECCCCEEEEe
Confidence 45689999999999988433222333 36777776 99999999953 2221 689999999 9988
Q ss_pred CCC-----CCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCC-CCCCCcCceEEEEC
Q psy14331 96 PGA-----PPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKD-TPPSRSGHRMIALK 169 (265)
Q Consensus 96 ~~~-----p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~-~p~~~~~~~~~~~~ 169 (265)
++| |.+|..|+++++ +++|||+||............++++++||+++++|+.++.+. .|.+|..|++++++
T Consensus 117 ~~~~~~~~p~~R~~~~~~~~---~~~iyv~GG~~~~~~~~~~~~~~~v~~yd~~~~~W~~l~~~~~~~~~r~~~~~~~~~ 193 (341)
T PLN02153 117 TKLDEEGGPEARTFHSMASD---ENHVYVFGGVSKGGLMKTPERFRTIEAYNIADGKWVQLPDPGENFEKRGGAGFAVVQ 193 (341)
T ss_pred ccCCCCCCCCCceeeEEEEE---CCEEEEECCccCCCccCCCcccceEEEEECCCCeEeeCCCCCCCCCCCCcceEEEEC
Confidence 776 889999999877 899999999864322111123578999999999999998765 45789999999999
Q ss_pred CEEEEEcCccCC----CCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccccccccccccc
Q psy14331 170 KHLVVFGGFHDN----LREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQSVKKDVDKG 245 (265)
Q Consensus 170 ~~iyv~GG~~~~----~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~ 245 (265)
++|||+||.... +......+++++||+.+++|+++...+..|.+|..|+++++ +++|||+||.............
T Consensus 194 ~~iyv~GG~~~~~~~gG~~~~~~~~v~~yd~~~~~W~~~~~~g~~P~~r~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~ 272 (341)
T PLN02153 194 GKIWVVYGFATSILPGGKSDYESNAVQFFDPASGKWTEVETTGAKPSARSVFAHAVV-GKYIIIFGGEVWPDLKGHLGPG 272 (341)
T ss_pred CeEEEEeccccccccCCccceecCceEEEEcCCCcEEeccccCCCCCCcceeeeEEE-CCEEEEECcccCCccccccccc
Confidence 999999997421 11112257899999999999999887667889999998877 8999999997533211111112
Q ss_pred eeeceEEEEecCcccccc
Q psy14331 246 IVHTDTFLLTPDSKTLRS 263 (265)
Q Consensus 246 ~~~~~~w~~~~~~~~w~~ 263 (265)
...+++|.||+++++|+.
T Consensus 273 ~~~n~v~~~d~~~~~W~~ 290 (341)
T PLN02153 273 TLSNEGYALDTETLVWEK 290 (341)
T ss_pred cccccEEEEEcCccEEEe
Confidence 345689999999999974
No 4
>KOG4441|consensus
Probab=100.00 E-value=6.3e-36 Score=261.93 Aligned_cols=222 Identities=22% Similarity=0.352 Sum_probs=189.0
Q ss_pred eeccccc-HHHHHHHHHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe
Q psy14331 13 TIMFNED-IEKIVRDIEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL 91 (265)
Q Consensus 13 ~~~~~~~-~~~~~~~~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~ 91 (265)
+..|+.. ..+..+.+..||+.++.|.. .+++|.+|..++++++ +++||++||.+. +... ++++++||+
T Consensus 288 ~~vGG~~~~~~~~~~ve~yd~~~~~w~~-~a~m~~~r~~~~~~~~--~~~lYv~GG~~~-~~~~-------l~~ve~YD~ 356 (571)
T KOG4441|consen 288 VAVGGYNRQGQSLRSVECYDPKTNEWSS-LAPMPSPRCRVGVAVL--NGKLYVVGGYDS-GSDR-------LSSVERYDP 356 (571)
T ss_pred EEECCCCCCCcccceeEEecCCcCcEee-cCCCCcccccccEEEE--CCEEEEEccccC-CCcc-------cceEEEecC
Confidence 3334433 35677889999999999999 8999999999999987 999999999532 2221 799999999
Q ss_pred ----eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEE
Q psy14331 92 ----FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIA 167 (265)
Q Consensus 92 ----w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~ 167 (265)
|...++|+.+|..++++++ ++.||++||.+... .++++++||+.+++|+.+++|. .+|.+|++++
T Consensus 357 ~~~~W~~~a~M~~~R~~~~v~~l---~g~iYavGG~dg~~------~l~svE~YDp~~~~W~~va~m~--~~r~~~gv~~ 425 (571)
T KOG4441|consen 357 RTNQWTPVAPMNTKRSDFGVAVL---DGKLYAVGGFDGEK------SLNSVECYDPVTNKWTPVAPML--TRRSGHGVAV 425 (571)
T ss_pred CCCceeccCCccCccccceeEEE---CCEEEEEecccccc------ccccEEEecCCCCcccccCCCC--cceeeeEEEE
Confidence 9999999999999999999 99999999987443 4889999999999999999885 6999999999
Q ss_pred ECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCcccccccccccccee
Q psy14331 168 LKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQSVKKDVDKGIV 247 (265)
Q Consensus 168 ~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~~~ 247 (265)
.+++||++||.+... ..++++++|||.+++|+.++++ +.+|.+++++++ +++||++||.+......++++|++
T Consensus 426 ~~g~iYi~GG~~~~~---~~l~sve~YDP~t~~W~~~~~M---~~~R~~~g~a~~-~~~iYvvGG~~~~~~~~~VE~ydp 498 (571)
T KOG4441|consen 426 LGGKLYIIGGGDGSS---NCLNSVECYDPETNTWTLIAPM---NTRRSGFGVAVL-NGKIYVVGGFDGTSALSSVERYDP 498 (571)
T ss_pred ECCEEEEEcCcCCCc---cccceEEEEcCCCCceeecCCc---ccccccceEEEE-CCEEEEECCccCCCccceEEEEcC
Confidence 999999999998764 2589999999999999999996 889999998877 999999999998655666788888
Q ss_pred eceEEEEecCcccccc
Q psy14331 248 HTDTFLLTPDSKTLRS 263 (265)
Q Consensus 248 ~~~~w~~~~~~~~w~~ 263 (265)
..+.|..-..+..-|.
T Consensus 499 ~~~~W~~v~~m~~~rs 514 (571)
T KOG4441|consen 499 ETNQWTMVAPMTSPRS 514 (571)
T ss_pred CCCceeEcccCccccc
Confidence 8888877665554443
No 5
>PLN02193 nitrile-specifier protein
Probab=100.00 E-value=2.7e-34 Score=248.51 Aligned_cols=215 Identities=22% Similarity=0.331 Sum_probs=171.7
Q ss_pred HHHHHHHhhcccceEEEecc--CCCC-CCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecC
Q psy14331 23 IVRDIEAEEKRKNKVIEKVV--PEPT-RRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSA 95 (265)
Q Consensus 23 ~~~~~~~~d~~~~~w~~~~~--~~p~-~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~ 95 (265)
..+++++||+.+++|..+.. .+|. +|.+|+++++ +++||+|||.. ... .++++++||+ |...
T Consensus 191 ~~~~v~~yD~~~~~W~~~~~~g~~P~~~~~~~~~v~~--~~~lYvfGG~~--~~~-------~~ndv~~yD~~t~~W~~l 259 (470)
T PLN02193 191 IDKHLYVFDLETRTWSISPATGDVPHLSCLGVRMVSI--GSTLYVFGGRD--ASR-------QYNGFYSFDTTTNEWKLL 259 (470)
T ss_pred eeCcEEEEECCCCEEEeCCCCCCCCCCcccceEEEEE--CCEEEEECCCC--CCC-------CCccEEEEECCCCEEEEc
Confidence 34679999999999987433 2344 3567887776 99999999953 221 1789999999 9988
Q ss_pred CCC---CCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCC-CCCCCcCceEEEECCE
Q psy14331 96 PGA---PPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKD-TPPSRSGHRMIALKKH 171 (265)
Q Consensus 96 ~~~---p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~-~p~~~~~~~~~~~~~~ 171 (265)
+++ |.+|..|+++++ +++|||+||..... .++++++||+.+++|+.++... .|.+|..|++++++++
T Consensus 260 ~~~~~~P~~R~~h~~~~~---~~~iYv~GG~~~~~------~~~~~~~yd~~t~~W~~~~~~~~~~~~R~~~~~~~~~gk 330 (470)
T PLN02193 260 TPVEEGPTPRSFHSMAAD---EENVYVFGGVSATA------RLKTLDSYNIVDKKWFHCSTPGDSFSIRGGAGLEVVQGK 330 (470)
T ss_pred CcCCCCCCCccceEEEEE---CCEEEEECCCCCCC------CcceEEEEECCCCEEEeCCCCCCCCCCCCCcEEEEECCc
Confidence 877 889999998877 89999999985432 3688999999999999987543 4678999999999999
Q ss_pred EEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccccccccccccceeeceE
Q psy14331 172 LVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQSVKKDVDKGIVHTDT 251 (265)
Q Consensus 172 iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~~~~~~~ 251 (265)
||++||.++. ..+++++||+.+++|+++.+++..|.+|..|+++++ +++|||+||................+++
T Consensus 331 iyviGG~~g~-----~~~dv~~yD~~t~~W~~~~~~g~~P~~R~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~~~~ndv 404 (470)
T PLN02193 331 VWVVYGFNGC-----EVDDVHYYDPVQDKWTQVETFGVRPSERSVFASAAV-GKHIVIFGGEIAMDPLAHVGPGQLTDGT 404 (470)
T ss_pred EEEEECCCCC-----ccCceEEEECCCCEEEEeccCCCCCCCcceeEEEEE-CCEEEEECCccCCccccccCccceeccE
Confidence 9999997543 268999999999999999987777889999998877 8999999998653211111122456789
Q ss_pred EEEecCcccccc
Q psy14331 252 FLLTPDSKTLRS 263 (265)
Q Consensus 252 w~~~~~~~~w~~ 263 (265)
|.||+.+++|++
T Consensus 405 ~~~D~~t~~W~~ 416 (470)
T PLN02193 405 FALDTETLQWER 416 (470)
T ss_pred EEEEcCcCEEEE
Confidence 999999999984
No 6
>PHA02713 hypothetical protein; Provisional
Probab=100.00 E-value=1.5e-34 Score=254.09 Aligned_cols=206 Identities=16% Similarity=0.206 Sum_probs=168.1
Q ss_pred HHHHHHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCCC
Q psy14331 23 IVRDIEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPGA 98 (265)
Q Consensus 23 ~~~~~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~~ 98 (265)
..+.++.||+.+++|.. .++||.+|..++++++ +++||++||. ++... ++++++||+ |...++|
T Consensus 318 ~~~~v~~Yd~~~n~W~~-~~~m~~~R~~~~~~~~--~g~IYviGG~--~~~~~-------~~sve~Ydp~~~~W~~~~~m 385 (557)
T PHA02713 318 SLNKVYKINIENKIHVE-LPPMIKNRCRFSLAVI--DDTIYAIGGQ--NGTNV-------ERTIECYTMGDDKWKMLPDM 385 (557)
T ss_pred ccceEEEEECCCCeEee-CCCCcchhhceeEEEE--CCEEEEECCc--CCCCC-------CceEEEEECCCCeEEECCCC
Confidence 56779999999999988 8899999999999887 9999999995 33221 688999999 9999999
Q ss_pred CCCccccceEEEecCCCEEEEEcCcccCCCc------------cccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEE
Q psy14331 99 PPPRCSHQMVALSADKGQLWMFGGEFSSASE------------SQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMI 166 (265)
Q Consensus 99 p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~------------~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~ 166 (265)
|.+|..++++++ +++||++||.+..... ......+.+++|||++++|+.++++ |.+|..++++
T Consensus 386 p~~r~~~~~~~~---~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~YDP~td~W~~v~~m--~~~r~~~~~~ 460 (557)
T PHA02713 386 PIALSSYGMCVL---DQYIYIIGGRTEHIDYTSVHHMNSIDMEEDTHSSNKVIRYDTVNNIWETLPNF--WTGTIRPGVV 460 (557)
T ss_pred CcccccccEEEE---CCEEEEEeCCCcccccccccccccccccccccccceEEEECCCCCeEeecCCC--CcccccCcEE
Confidence 999999999888 9999999997532100 0001257899999999999999887 4789999999
Q ss_pred EECCEEEEEcCccCCCCccccCCceEEEEcCC-CceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccccccccccccc
Q psy14331 167 ALKKHLVVFGGFHDNLREAKYYNDVHIFDLET-YAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQSVKKDVDKG 245 (265)
Q Consensus 167 ~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~-~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~ 245 (265)
+++++||++||.++.. ...+.+++|||.+ ++|+.++++ |.+|..++++++ +++||++||.++. .++++|
T Consensus 461 ~~~~~IYv~GG~~~~~---~~~~~ve~Ydp~~~~~W~~~~~m---~~~r~~~~~~~~-~~~iyv~Gg~~~~---~~~e~y 530 (557)
T PHA02713 461 SHKDDIYVVCDIKDEK---NVKTCIFRYNTNTYNGWELITTT---ESRLSALHTILH-DNTIMMLHCYESY---MLQDTF 530 (557)
T ss_pred EECCEEEEEeCCCCCC---ccceeEEEecCCCCCCeeEcccc---CcccccceeEEE-CCEEEEEeeecce---eehhhc
Confidence 9999999999986432 2235689999999 899999984 889999998887 9999999998763 356666
Q ss_pred eeeceEEEEe
Q psy14331 246 IVHTDTFLLT 255 (265)
Q Consensus 246 ~~~~~~w~~~ 255 (265)
++.++.|..-
T Consensus 531 d~~~~~W~~~ 540 (557)
T PHA02713 531 NVYTYEWNHI 540 (557)
T ss_pred Ccccccccch
Confidence 6666666543
No 7
>PHA02713 hypothetical protein; Provisional
Probab=100.00 E-value=4.3e-34 Score=251.24 Aligned_cols=212 Identities=16% Similarity=0.278 Sum_probs=170.4
Q ss_pred HHHHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCCCCC
Q psy14331 25 RDIEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPGAPP 100 (265)
Q Consensus 25 ~~~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~~p~ 100 (265)
..+++||+.+++|.. .+++|.+|..|+++++ +++||++||...+... ++++++||+ |...++||.
T Consensus 272 ~~v~~yd~~~~~W~~-l~~mp~~r~~~~~a~l--~~~IYviGG~~~~~~~--------~~~v~~Yd~~~n~W~~~~~m~~ 340 (557)
T PHA02713 272 PCILVYNINTMEYSV-ISTIPNHIINYASAIV--DNEIIIAGGYNFNNPS--------LNKVYKINIENKIHVELPPMIK 340 (557)
T ss_pred CCEEEEeCCCCeEEE-CCCCCccccceEEEEE--CCEEEEEcCCCCCCCc--------cceEEEEECCCCeEeeCCCCcc
Confidence 357889999999998 7899999999988886 9999999995212111 689999999 999999999
Q ss_pred CccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccC
Q psy14331 101 PRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHD 180 (265)
Q Consensus 101 ~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~ 180 (265)
+|..++++++ +++||++||..... ..+++++||+.+++|+.++++ |.+|..+++++++++||++||.+.
T Consensus 341 ~R~~~~~~~~---~g~IYviGG~~~~~------~~~sve~Ydp~~~~W~~~~~m--p~~r~~~~~~~~~g~IYviGG~~~ 409 (557)
T PHA02713 341 NRCRFSLAVI---DDTIYAIGGQNGTN------VERTIECYTMGDDKWKMLPDM--PIALSSYGMCVLDQYIYIIGGRTE 409 (557)
T ss_pred hhhceeEEEE---CCEEEEECCcCCCC------CCceEEEEECCCCeEEECCCC--CcccccccEEEECCEEEEEeCCCc
Confidence 9999999888 99999999975332 367899999999999999877 589999999999999999999864
Q ss_pred CCC--------------ccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccccc-cccccccc
Q psy14331 181 NLR--------------EAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQS-VKKDVDKG 245 (265)
Q Consensus 181 ~~~--------------~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~-~~~~~~~~ 245 (265)
... .....+.+++|||.+++|+.++++ +.+|..++++++ +++|||+||.+... ....+++|
T Consensus 410 ~~~~~~~~~~~~~~~~~~~~~~~~ve~YDP~td~W~~v~~m---~~~r~~~~~~~~-~~~IYv~GG~~~~~~~~~~ve~Y 485 (557)
T PHA02713 410 HIDYTSVHHMNSIDMEEDTHSSNKVIRYDTVNNIWETLPNF---WTGTIRPGVVSH-KDDIYVVCDIKDEKNVKTCIFRY 485 (557)
T ss_pred ccccccccccccccccccccccceEEEECCCCCeEeecCCC---CcccccCcEEEE-CCEEEEEeCCCCCCccceeEEEe
Confidence 311 011257899999999999999985 788998888877 89999999986432 22345667
Q ss_pred eeec-eEEEEecCccccc
Q psy14331 246 IVHT-DTFLLTPDSKTLR 262 (265)
Q Consensus 246 ~~~~-~~w~~~~~~~~w~ 262 (265)
+|.. +.|...+.+..-|
T Consensus 486 dp~~~~~W~~~~~m~~~r 503 (557)
T PHA02713 486 NTNTYNGWELITTTESRL 503 (557)
T ss_pred cCCCCCCeeEccccCccc
Confidence 7766 5676666555433
No 8
>PHA03098 kelch-like protein; Provisional
Probab=100.00 E-value=2.7e-33 Score=247.36 Aligned_cols=206 Identities=19% Similarity=0.253 Sum_probs=170.5
Q ss_pred HHHHHHHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCC
Q psy14331 22 KIVRDIEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPG 97 (265)
Q Consensus 22 ~~~~~~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~ 97 (265)
...++++.||+.+++|.. .+++|.+|..|+++++ +++||++||.. .... ++++++||+ |...++
T Consensus 308 ~~~~~v~~yd~~~~~W~~-~~~~~~~R~~~~~~~~--~~~lyv~GG~~--~~~~-------~~~v~~yd~~~~~W~~~~~ 375 (534)
T PHA03098 308 LSVNSVVSYDTKTKSWNK-VPELIYPRKNPGVTVF--NNRIYVIGGIY--NSIS-------LNTVESWKPGESKWREEPP 375 (534)
T ss_pred CeeccEEEEeCCCCeeeE-CCCCCcccccceEEEE--CCEEEEEeCCC--CCEe-------cceEEEEcCCCCceeeCCC
Confidence 345679999999999988 8889999999998886 99999999953 2211 689999999 999999
Q ss_pred CCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcC
Q psy14331 98 APPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGG 177 (265)
Q Consensus 98 ~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG 177 (265)
+|.+|..|+++.+ +++||++||...... .++++++||+.+++|+.++++ |.+|..|+++..+++||++||
T Consensus 376 lp~~r~~~~~~~~---~~~iYv~GG~~~~~~-----~~~~v~~yd~~t~~W~~~~~~--p~~r~~~~~~~~~~~iyv~GG 445 (534)
T PHA03098 376 LIFPRYNPCVVNV---NNLIYVIGGISKNDE-----LLKTVECFSLNTNKWSKGSPL--PISHYGGCAIYHDGKIYVIGG 445 (534)
T ss_pred cCcCCccceEEEE---CCEEEEECCcCCCCc-----ccceEEEEeCCCCeeeecCCC--CccccCceEEEECCEEEEECC
Confidence 9999999999887 899999999754331 268899999999999998866 588999999999999999999
Q ss_pred ccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccccccccccccceeeceEEEEecC
Q psy14331 178 FHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQSVKKDVDKGIVHTDTFLLTPD 257 (265)
Q Consensus 178 ~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~~~~~~~w~~~~~ 257 (265)
....... ..++.+++||+.+++|+.+..+ |.+|..++++++ +++|||+||.+... ..++++.||++
T Consensus 446 ~~~~~~~-~~~~~v~~yd~~~~~W~~~~~~---~~~r~~~~~~~~-~~~iyv~GG~~~~~---------~~~~v~~yd~~ 511 (534)
T PHA03098 446 ISYIDNI-KVYNIVESYNPVTNKWTELSSL---NFPRINASLCIF-NNKIYVVGGDKYEY---------YINEIEVYDDK 511 (534)
T ss_pred ccCCCCC-cccceEEEecCCCCceeeCCCC---CcccccceEEEE-CCEEEEEcCCcCCc---------ccceeEEEeCC
Confidence 8644221 2256799999999999999874 778888888777 89999999987543 24567888888
Q ss_pred cccccc
Q psy14331 258 SKTLRS 263 (265)
Q Consensus 258 ~~~w~~ 263 (265)
+++|+.
T Consensus 512 ~~~W~~ 517 (534)
T PHA03098 512 TNTWTL 517 (534)
T ss_pred CCEEEe
Confidence 888863
No 9
>PHA02790 Kelch-like protein; Provisional
Probab=100.00 E-value=4.6e-33 Score=241.42 Aligned_cols=188 Identities=16% Similarity=0.244 Sum_probs=159.2
Q ss_pred HHHHHHHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCC
Q psy14331 22 KIVRDIEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPG 97 (265)
Q Consensus 22 ~~~~~~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~ 97 (265)
...+.++.||+.+++|.. .+++|.+|..++++++ +++||++||. +. .+.+++||+ |...++
T Consensus 284 ~~~~~v~~Ydp~~~~W~~-~~~m~~~r~~~~~v~~--~~~iYviGG~--~~----------~~sve~ydp~~n~W~~~~~ 348 (480)
T PHA02790 284 EIHNNAIAVNYISNNWIP-IPPMNSPRLYASGVPA--NNKLYVVGGL--PN----------PTSVERWFHGDAAWVNMPS 348 (480)
T ss_pred CcCCeEEEEECCCCEEEE-CCCCCchhhcceEEEE--CCEEEEECCc--CC----------CCceEEEECCCCeEEECCC
Confidence 455678899999999999 8899999999988886 9999999994 21 356899999 999999
Q ss_pred CCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcC
Q psy14331 98 APPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGG 177 (265)
Q Consensus 98 ~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG 177 (265)
||.+|..|+++++ +++||++||.... .+.+++|||.+++|+.++++ |.+|..|++++++++||++||
T Consensus 349 l~~~r~~~~~~~~---~g~IYviGG~~~~--------~~~ve~ydp~~~~W~~~~~m--~~~r~~~~~~~~~~~IYv~GG 415 (480)
T PHA02790 349 LLKPRCNPAVASI---NNVIYVIGGHSET--------DTTTEYLLPNHDQWQFGPST--YYPHYKSCALVFGRRLFLVGR 415 (480)
T ss_pred CCCCCcccEEEEE---CCEEEEecCcCCC--------CccEEEEeCCCCEEEeCCCC--CCccccceEEEECCEEEEECC
Confidence 9999999999888 9999999997432 35789999999999999887 488999999999999999998
Q ss_pred ccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccccccccccccceeeceEEEEecC
Q psy14331 178 FHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQSVKKDVDKGIVHTDTFLLTPD 257 (265)
Q Consensus 178 ~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~~~~~~~w~~~~~ 257 (265)
. +++||+.+++|+.++++ |.+|.+++++++ +++||++||.+.... .+.+.+||++
T Consensus 416 ~------------~e~ydp~~~~W~~~~~m---~~~r~~~~~~v~-~~~IYviGG~~~~~~---------~~~ve~Yd~~ 470 (480)
T PHA02790 416 N------------AEFYCESSNTWTLIDDP---IYPRDNPELIIV-DNKLLLIGGFYRGSY---------IDTIEVYNNR 470 (480)
T ss_pred c------------eEEecCCCCcEeEcCCC---CCCccccEEEEE-CCEEEEECCcCCCcc---------cceEEEEECC
Confidence 3 57899999999999885 788999998877 999999999864321 2446777777
Q ss_pred ccccc
Q psy14331 258 SKTLR 262 (265)
Q Consensus 258 ~~~w~ 262 (265)
+++|+
T Consensus 471 ~~~W~ 475 (480)
T PHA02790 471 TYSWN 475 (480)
T ss_pred CCeEE
Confidence 77775
No 10
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=100.00 E-value=8.8e-33 Score=229.37 Aligned_cols=212 Identities=16% Similarity=0.225 Sum_probs=159.4
Q ss_pred HHHHHhh-cc-cceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----e----ec
Q psy14331 25 RDIEAEE-KR-KNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----F----VS 94 (265)
Q Consensus 25 ~~~~~~d-~~-~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w----~~ 94 (265)
++++.++ +. +.+|.. .+++|.+|..++++++ +++||++||. +.... ++++++||+ | ..
T Consensus 39 ~~v~~~~~~~~~~~W~~-~~~lp~~r~~~~~~~~--~~~lyviGG~--~~~~~-------~~~v~~~d~~~~~w~~~~~~ 106 (323)
T TIGR03548 39 KGIYIAKDENSNLKWVK-DGQLPYEAAYGASVSV--ENGIYYIGGS--NSSER-------FSSVYRITLDESKEELICET 106 (323)
T ss_pred eeeEEEecCCCceeEEE-cccCCccccceEEEEE--CCEEEEEcCC--CCCCC-------ceeEEEEEEcCCceeeeeeE
Confidence 4566664 33 237988 7889999988887776 9999999995 22221 689999998 6 67
Q ss_pred CCCCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEE
Q psy14331 95 APGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVV 174 (265)
Q Consensus 95 ~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv 174 (265)
.++||.+|..|+++++ +++||++||..... ..+++++||+.+++|+.+++++ ..+|..|++++++++|||
T Consensus 107 ~~~lp~~~~~~~~~~~---~~~iYv~GG~~~~~------~~~~v~~yd~~~~~W~~~~~~p-~~~r~~~~~~~~~~~iYv 176 (323)
T TIGR03548 107 IGNLPFTFENGSACYK---DGTLYVGGGNRNGK------PSNKSYLFNLETQEWFELPDFP-GEPRVQPVCVKLQNELYV 176 (323)
T ss_pred cCCCCcCccCceEEEE---CCEEEEEeCcCCCc------cCceEEEEcCCCCCeeECCCCC-CCCCCcceEEEECCEEEE
Confidence 8999999999999887 89999999974322 3689999999999999998764 147888888899999999
Q ss_pred EcCccCCCCccccCCceEEEEcCCCceEEeccCCC--CCCCCCCceEEEeCCCeEEEEcCcccccccccc----------
Q psy14331 175 FGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGA--GPAPRSGCQMAATPDGKILISGGYSKQSVKKDV---------- 242 (265)
Q Consensus 175 ~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~--~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~---------- 242 (265)
+||.+... ..++++||+.+++|+.+.++.. .|..+..++.+++.+++|||+||.+.....+.+
T Consensus 177 ~GG~~~~~-----~~~~~~yd~~~~~W~~~~~~~~~~~p~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~~~ 251 (323)
T TIGR03548 177 FGGGSNIA-----YTDGYKYSPKKNQWQKVADPTTDSEPISLLGAASIKINESLLLCIGGFNKDVYNDAVIDLATMKDES 251 (323)
T ss_pred EcCCCCcc-----ccceEEEecCCCeeEECCCCCCCCCceeccceeEEEECCCEEEEECCcCHHHHHHHHhhhhhccchh
Confidence 99985432 4578999999999999987632 344444555555657899999998753211111
Q ss_pred -------------ccceeeceEEEEecCcccccc
Q psy14331 243 -------------DKGIVHTDTFLLTPDSKTLRS 263 (265)
Q Consensus 243 -------------~~~~~~~~~w~~~~~~~~w~~ 263 (265)
+.+...+++++||+.+++|+.
T Consensus 252 ~~~~~~~~~~~~~~~~~~~~~v~~yd~~~~~W~~ 285 (323)
T TIGR03548 252 LKGYKKEYFLKPPEWYNWNRKILIYNVRTGKWKS 285 (323)
T ss_pred hhhhHHHHhCCCccccCcCceEEEEECCCCeeeE
Confidence 112234678999999999874
No 11
>PLN02153 epithiospecifier protein
Probab=100.00 E-value=1.7e-31 Score=223.13 Aligned_cols=212 Identities=21% Similarity=0.381 Sum_probs=159.9
Q ss_pred cccceEEEecc---CCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCCCC-CCcc
Q psy14331 32 KRKNKVIEKVV---PEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPGAP-PPRC 103 (265)
Q Consensus 32 ~~~~~w~~~~~---~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~~p-~~r~ 103 (265)
+....|..+.. .+|.+|..|+++++ +++|||+||..... ... .+++++||+ |...++++ .||.
T Consensus 4 ~~~~~W~~~~~~~~~~P~pR~~h~~~~~--~~~iyv~GG~~~~~--~~~-----~~~~~~yd~~~~~W~~~~~~~~~p~~ 74 (341)
T PLN02153 4 TLQGGWIKVEQKGGKGPGPRCSHGIAVV--GDKLYSFGGELKPN--EHI-----DKDLYVFDFNTHTWSIAPANGDVPRI 74 (341)
T ss_pred ccCCeEEEecCCCCCCCCCCCcceEEEE--CCEEEEECCccCCC--Cce-----eCcEEEEECCCCEEEEcCccCCCCCC
Confidence 35677988544 37899999998886 89999999953221 112 589999999 99887664 3443
Q ss_pred ---ccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCC---CCCCCcCceEEEECCEEEEEcC
Q psy14331 104 ---SHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKD---TPPSRSGHRMIALKKHLVVFGG 177 (265)
Q Consensus 104 ---~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~---~p~~~~~~~~~~~~~~iyv~GG 177 (265)
.|+++++ +++||++||..... .++++++||+.+++|+.++++. .|.+|..|++++.+++||||||
T Consensus 75 ~~~~~~~~~~---~~~iyv~GG~~~~~------~~~~v~~yd~~t~~W~~~~~~~~~~~p~~R~~~~~~~~~~~iyv~GG 145 (341)
T PLN02153 75 SCLGVRMVAV---GTKLYIFGGRDEKR------EFSDFYSYDTVKNEWTFLTKLDEEGGPEARTFHSMASDENHVYVFGG 145 (341)
T ss_pred ccCceEEEEE---CCEEEEECCCCCCC------ccCcEEEEECCCCEEEEeccCCCCCCCCCceeeEEEEECCEEEEECC
Confidence 5767777 89999999975432 3689999999999999987663 3789999999999999999999
Q ss_pred ccCCCC--ccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccccccccccccceeeceEEEEe
Q psy14331 178 FHDNLR--EAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQSVKKDVDKGIVHTDTFLLT 255 (265)
Q Consensus 178 ~~~~~~--~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~~~~~~~w~~~ 255 (265)
...... ....++++++||+.+++|+.++.++..|.+|.+|+++++ +++|||+||............ ...+++++||
T Consensus 146 ~~~~~~~~~~~~~~~v~~yd~~~~~W~~l~~~~~~~~~r~~~~~~~~-~~~iyv~GG~~~~~~~gG~~~-~~~~~v~~yd 223 (341)
T PLN02153 146 VSKGGLMKTPERFRTIEAYNIADGKWVQLPDPGENFEKRGGAGFAVV-QGKIWVVYGFATSILPGGKSD-YESNAVQFFD 223 (341)
T ss_pred ccCCCccCCCcccceEEEEECCCCeEeeCCCCCCCCCCCCcceEEEE-CCeEEEEeccccccccCCccc-eecCceEEEE
Confidence 864321 112357899999999999999987656788999988777 899999999753211110011 1246789999
Q ss_pred cCcccccc
Q psy14331 256 PDSKTLRS 263 (265)
Q Consensus 256 ~~~~~w~~ 263 (265)
+.+++|++
T Consensus 224 ~~~~~W~~ 231 (341)
T PLN02153 224 PASGKWTE 231 (341)
T ss_pred cCCCcEEe
Confidence 99999975
No 12
>PLN02193 nitrile-specifier protein
Probab=100.00 E-value=3.4e-31 Score=229.22 Aligned_cols=217 Identities=20% Similarity=0.334 Sum_probs=167.3
Q ss_pred eeecccccHHHHHHHHHHhhcccceEEEeccCC---CCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEE
Q psy14331 12 WTIMFNEDIEKIVRDIEAEEKRKNKVIEKVVPE---PTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLIL 88 (265)
Q Consensus 12 ~~~~~~~~~~~~~~~~~~~d~~~~~w~~~~~~~---p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~ 88 (265)
+|++++.+....++++++||+.+++|.. ..++ |.+|..|+++++ +++||||||. +.... ++++++
T Consensus 231 lYvfGG~~~~~~~ndv~~yD~~t~~W~~-l~~~~~~P~~R~~h~~~~~--~~~iYv~GG~--~~~~~-------~~~~~~ 298 (470)
T PLN02193 231 LYVFGGRDASRQYNGFYSFDTTTNEWKL-LTPVEEGPTPRSFHSMAAD--EENVYVFGGV--SATAR-------LKTLDS 298 (470)
T ss_pred EEEECCCCCCCCCccEEEEECCCCEEEE-cCcCCCCCCCccceEEEEE--CCEEEEECCC--CCCCC-------cceEEE
Confidence 4455554434456789999999999998 5544 889999998886 9999999995 32221 689999
Q ss_pred EEe----eecCCC---CCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCC-CCCCC
Q psy14331 89 MNL----FVSAPG---APPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKD-TPPSR 160 (265)
Q Consensus 89 yd~----w~~~~~---~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~-~p~~~ 160 (265)
||+ |...+. +|.+|..|+++++ +++||++||.... ..+++++||+.+++|+.++++. .|.+|
T Consensus 299 yd~~t~~W~~~~~~~~~~~~R~~~~~~~~---~gkiyviGG~~g~-------~~~dv~~yD~~t~~W~~~~~~g~~P~~R 368 (470)
T PLN02193 299 YNIVDKKWFHCSTPGDSFSIRGGAGLEVV---QGKVWVVYGFNGC-------EVDDVHYYDPVQDKWTQVETFGVRPSER 368 (470)
T ss_pred EECCCCEEEeCCCCCCCCCCCCCcEEEEE---CCcEEEEECCCCC-------ccCceEEEECCCCEEEEeccCCCCCCCc
Confidence 999 887654 6788999988877 8999999996432 2689999999999999998763 57899
Q ss_pred cCceEEEECCEEEEEcCccCCC-----CccccCCceEEEEcCCCceEEeccCC---CCCCCCCCceE--EEeCC-CeEEE
Q psy14331 161 SGHRMIALKKHLVVFGGFHDNL-----REAKYYNDVHIFDLETYAWKKIEPLG---AGPAPRSGCQM--AATPD-GKILI 229 (265)
Q Consensus 161 ~~~~~~~~~~~iyv~GG~~~~~-----~~~~~~~~v~~y~~~~~~W~~~~~~~---~~p~~~~~~~~--~~~~~-~~iyi 229 (265)
..|+++.++++||||||..... ....+.+++++||+.+++|+.+..++ ..|.+|..+++ +.+.+ +.|++
T Consensus 369 ~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~ndv~~~D~~t~~W~~~~~~~~~~~~P~~R~~~~~~~~~~~~~~~~~~ 448 (470)
T PLN02193 369 SVFASAAVGKHIVIFGGEIAMDPLAHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSRGWTASTTGTIDGKKGLVM 448 (470)
T ss_pred ceeEEEEECCEEEEECCccCCccccccCccceeccEEEEEcCcCEEEEcccCCCCCCCCCCCccccceeeEEcCCceEEE
Confidence 9999999999999999985321 01235679999999999999998654 25777866643 22323 45999
Q ss_pred EcCccccccccccccceeeceEEEEecCc
Q psy14331 230 SGGYSKQSVKKDVDKGIVHTDTFLLTPDS 258 (265)
Q Consensus 230 ~GG~~~~~~~~~~~~~~~~~~~w~~~~~~ 258 (265)
+||..... ...+|+|.+++++
T Consensus 449 fGG~~~~~--------~~~~D~~~~~~~~ 469 (470)
T PLN02193 449 HGGKAPTN--------DRFDDLFFYGIDS 469 (470)
T ss_pred EcCCCCcc--------ccccceEEEecCC
Confidence 99997543 3678999998875
No 13
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=99.97 E-value=2.2e-30 Score=214.96 Aligned_cols=193 Identities=24% Similarity=0.354 Sum_probs=148.7
Q ss_pred HHHHHHHHHhhcccceEE---EeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----ee
Q psy14331 21 EKIVRDIEAEEKRKNKVI---EKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FV 93 (265)
Q Consensus 21 ~~~~~~~~~~d~~~~~w~---~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~ 93 (265)
...++++++||+.+++|. ...+++|.+|..|+++++ +++||++||.. ++.. ++++++||+ |.
T Consensus 84 ~~~~~~v~~~d~~~~~w~~~~~~~~~lp~~~~~~~~~~~--~~~iYv~GG~~-~~~~--------~~~v~~yd~~~~~W~ 152 (323)
T TIGR03548 84 SERFSSVYRITLDESKEELICETIGNLPFTFENGSACYK--DGTLYVGGGNR-NGKP--------SNKSYLFNLETQEWF 152 (323)
T ss_pred CCCceeEEEEEEcCCceeeeeeEcCCCCcCccCceEEEE--CCEEEEEeCcC-CCcc--------CceEEEEcCCCCCee
Confidence 345678999999999982 337789999999998886 99999999952 2221 689999999 99
Q ss_pred cCCCCCC-CccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCC---CCCCCcCceEEE-E
Q psy14331 94 SAPGAPP-PRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKD---TPPSRSGHRMIA-L 168 (265)
Q Consensus 94 ~~~~~p~-~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~---~p~~~~~~~~~~-~ 168 (265)
..+++|. +|..|+++.+ +++|||+||..... ..++++||+++++|+.++++. .|.++..+++++ .
T Consensus 153 ~~~~~p~~~r~~~~~~~~---~~~iYv~GG~~~~~-------~~~~~~yd~~~~~W~~~~~~~~~~~p~~~~~~~~~~~~ 222 (323)
T TIGR03548 153 ELPDFPGEPRVQPVCVKL---QNELYVFGGGSNIA-------YTDGYKYSPKKNQWQKVADPTTDSEPISLLGAASIKIN 222 (323)
T ss_pred ECCCCCCCCCCcceEEEE---CCEEEEEcCCCCcc-------ccceEEEecCCCeeEECCCCCCCCCceeccceeEEEEC
Confidence 9999884 7888877777 89999999975321 457899999999999998763 244444444444 4
Q ss_pred CCEEEEEcCccCCC----------------------------CccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEE
Q psy14331 169 KKHLVVFGGFHDNL----------------------------REAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMA 220 (265)
Q Consensus 169 ~~~iyv~GG~~~~~----------------------------~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~ 220 (265)
+++||++||.+... ....+.+++++||+.+++|+.+..+ +..+|.+++++
T Consensus 223 ~~~iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~yd~~~~~W~~~~~~--p~~~r~~~~~~ 300 (323)
T TIGR03548 223 ESLLLCIGGFNKDVYNDAVIDLATMKDESLKGYKKEYFLKPPEWYNWNRKILIYNVRTGKWKSIGNS--PFFARCGAALL 300 (323)
T ss_pred CCEEEEECCcCHHHHHHHHhhhhhccchhhhhhHHHHhCCCccccCcCceEEEEECCCCeeeEcccc--cccccCchheE
Confidence 78999999986421 0112347899999999999999874 23578889888
Q ss_pred EeCCCeEEEEcCccccc
Q psy14331 221 ATPDGKILISGGYSKQS 237 (265)
Q Consensus 221 ~~~~~~iyi~GG~~~~~ 237 (265)
++ +++||++||.....
T Consensus 301 ~~-~~~iyv~GG~~~pg 316 (323)
T TIGR03548 301 LT-GNNIFSINGELKPG 316 (323)
T ss_pred EE-CCEEEEEeccccCC
Confidence 77 89999999986554
No 14
>KOG1230|consensus
Probab=99.97 E-value=2.4e-30 Score=207.54 Aligned_cols=233 Identities=28% Similarity=0.524 Sum_probs=189.3
Q ss_pred HHHHHHHhhcccceEEEecc-CCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecC--
Q psy14331 23 IVRDIEAEEKRKNKVIEKVV-PEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSA-- 95 (265)
Q Consensus 23 ~~~~~~~~d~~~~~w~~~~~-~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~-- 95 (265)
..++++.||..+++|..+.. +.|.||.+|++++++ .+.+|+|||.....++..+ -..+++|.||+ |+.+
T Consensus 96 vYndLy~Yn~k~~eWkk~~spn~P~pRsshq~va~~-s~~l~~fGGEfaSPnq~qF---~HYkD~W~fd~~trkweql~~ 171 (521)
T KOG1230|consen 96 VYNDLYSYNTKKNEWKKVVSPNAPPPRSSHQAVAVP-SNILWLFGGEFASPNQEQF---HHYKDLWLFDLKTRKWEQLEF 171 (521)
T ss_pred EeeeeeEEeccccceeEeccCCCcCCCccceeEEec-cCeEEEeccccCCcchhhh---hhhhheeeeeeccchheeecc
Confidence 34689999999999988654 679999999999984 4699999997544333221 12789999999 7654
Q ss_pred CCCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCC-CCCCCcCceEEEE-CCEEE
Q psy14331 96 PGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKD-TPPSRSGHRMIAL-KKHLV 173 (265)
Q Consensus 96 ~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~-~p~~~~~~~~~~~-~~~iy 173 (265)
+..|.||++|.+++. .++|++|||..+.... ..+++++++||+.+-+|+++.+.. .|.||.++++.+. .+.||
T Consensus 172 ~g~PS~RSGHRMvaw---K~~lilFGGFhd~nr~--y~YyNDvy~FdLdtykW~Klepsga~PtpRSGcq~~vtpqg~i~ 246 (521)
T KOG1230|consen 172 GGGPSPRSGHRMVAW---KRQLILFGGFHDSNRD--YIYYNDVYAFDLDTYKWSKLEPSGAGPTPRSGCQFSVTPQGGIV 246 (521)
T ss_pred CCCCCCCccceeEEe---eeeEEEEcceecCCCc--eEEeeeeEEEeccceeeeeccCCCCCCCCCCcceEEecCCCcEE
Confidence 567999999999998 9999999998766433 357999999999999999998854 5999999999888 99999
Q ss_pred EEcCccCCCC-----ccccCCceEEEEcCC-----CceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccccccccccc
Q psy14331 174 VFGGFHDNLR-----EAKYYNDVHIFDLET-----YAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQSVKKDVD 243 (265)
Q Consensus 174 v~GG~~~~~~-----~~~~~~~v~~y~~~~-----~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~ 243 (265)
|.||++.... .-..+++++.+++.. =.|+++.+.+..|.+|+++++++..+++-+.|||..+..-..+..
T Consensus 247 vyGGYsK~~~kK~~dKG~~hsDmf~L~p~~~~~dKw~W~kvkp~g~kPspRsgfsv~va~n~kal~FGGV~D~eeeeEsl 326 (521)
T KOG1230|consen 247 VYGGYSKQRVKKDVDKGTRHSDMFLLKPEDGREDKWVWTKVKPSGVKPSPRSGFSVAVAKNHKALFFGGVCDLEEEEESL 326 (521)
T ss_pred EEcchhHhhhhhhhhcCceeeeeeeecCCcCCCcceeEeeccCCCCCCCCCCceeEEEecCCceEEecceecccccchhh
Confidence 9999854321 124578999999998 469999999999999999999999888999999987643333344
Q ss_pred cceeeceEEEEecCccccccc
Q psy14331 244 KGIVHTDTFLLTPDSKTLRSC 264 (265)
Q Consensus 244 ~~~~~~~~w~~~~~~~~w~~~ 264 (265)
.....+|+|.||...++|+.+
T Consensus 327 ~g~F~NDLy~fdlt~nrW~~~ 347 (521)
T KOG1230|consen 327 SGEFFNDLYFFDLTRNRWSEG 347 (521)
T ss_pred hhhhhhhhhheecccchhhHh
Confidence 456788999999999999863
No 15
>KOG4693|consensus
Probab=99.97 E-value=2.5e-30 Score=197.02 Aligned_cols=213 Identities=20% Similarity=0.345 Sum_probs=174.3
Q ss_pred HHHHhhcccceEEEecc------------CCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe--
Q psy14331 26 DIEAEEKRKNKVIEKVV------------PEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL-- 91 (265)
Q Consensus 26 ~~~~~d~~~~~w~~~~~------------~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~-- 91 (265)
++..+|..+-+|..+.+ ..|..|++|+.+.. .+++|+.||.+.+... .|-+++||+
T Consensus 45 DVH~lNa~~~RWtk~pp~~~ka~i~~~yp~VPyqRYGHtvV~y--~d~~yvWGGRND~ega--------CN~Ly~fDp~t 114 (392)
T KOG4693|consen 45 DVHVLNAENYRWTKMPPGITKATIESPYPAVPYQRYGHTVVEY--QDKAYVWGGRNDDEGA--------CNLLYEFDPET 114 (392)
T ss_pred eeEEeeccceeEEecCcccccccccCCCCccchhhcCceEEEE--cceEEEEcCccCcccc--------cceeeeecccc
Confidence 46688999999987433 24678999998886 9999999996443333 689999999
Q ss_pred --eecC---CCCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCC-CCCCcCceE
Q psy14331 92 --FVSA---PGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDT-PPSRSGHRM 165 (265)
Q Consensus 92 --w~~~---~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~-p~~~~~~~~ 165 (265)
|.+. .-.|..|.+|++|++ ++.+|+|||..+... +.++++.++|..|.+|+.+...+. |.=|..|++
T Consensus 115 ~~W~~p~v~G~vPgaRDGHsAcV~---gn~MyiFGGye~~a~----~FS~d~h~ld~~TmtWr~~~Tkg~PprwRDFH~a 187 (392)
T KOG4693|consen 115 NVWKKPEVEGFVPGARDGHSACVW---GNQMYIFGGYEEDAQ----RFSQDTHVLDFATMTWREMHTKGDPPRWRDFHTA 187 (392)
T ss_pred ccccccceeeecCCccCCceeeEE---CcEEEEecChHHHHH----hhhccceeEeccceeeeehhccCCCchhhhhhhh
Confidence 6543 447889999999999 999999999855433 458999999999999999987774 445678999
Q ss_pred EEECCEEEEEcCccCCCC-----ccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCcccccccc
Q psy14331 166 IALKKHLVVFGGFHDNLR-----EAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQSVKK 240 (265)
Q Consensus 166 ~~~~~~iyv~GG~~~~~~-----~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~ 240 (265)
+++++.+|||||...... ..++.+.|..+|..+..|.+....+..|.+|.+|++.+. +++||+|||++...
T Consensus 188 ~~~~~~MYiFGGR~D~~gpfHs~~e~Yc~~i~~ld~~T~aW~r~p~~~~~P~GRRSHS~fvY-ng~~Y~FGGYng~l--- 263 (392)
T KOG4693|consen 188 SVIDGMMYIFGGRSDESGPFHSIHEQYCDTIMALDLATGAWTRTPENTMKPGGRRSHSTFVY-NGKMYMFGGYNGTL--- 263 (392)
T ss_pred hhccceEEEeccccccCCCccchhhhhcceeEEEeccccccccCCCCCcCCCcccccceEEE-cceEEEecccchhh---
Confidence 999999999999865422 246778899999999999999888888999999998877 99999999998653
Q ss_pred ccccceeeceEEEEecCcccccc
Q psy14331 241 DVDKGIVHTDTFLLTPDSKTLRS 263 (265)
Q Consensus 241 ~~~~~~~~~~~w~~~~~~~~w~~ 263 (265)
-.-.+|+|+|||.+..|..
T Consensus 264 ----n~HfndLy~FdP~t~~W~~ 282 (392)
T KOG4693|consen 264 ----NVHFNDLYCFDPKTSMWSV 282 (392)
T ss_pred ----hhhhcceeecccccchhee
Confidence 1246789999999999974
No 16
>PHA03098 kelch-like protein; Provisional
Probab=99.97 E-value=3.9e-29 Score=220.93 Aligned_cols=202 Identities=18% Similarity=0.264 Sum_probs=159.5
Q ss_pred HHHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCCCCCC
Q psy14331 26 DIEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPGAPPP 101 (265)
Q Consensus 26 ~~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~~p~~ 101 (265)
.+..|++..++|.. ..+.|. +..|+++++ ++.||++||....... .+++++||+ |...++||.+
T Consensus 265 ~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~--~~~lyv~GG~~~~~~~--------~~~v~~yd~~~~~W~~~~~~~~~ 332 (534)
T PHA03098 265 NYITNYSPLSEINT-IIDIHY-VYCFGSVVL--NNVIYFIGGMNKNNLS--------VNSVVSYDTKTKSWNKVPELIYP 332 (534)
T ss_pred eeeecchhhhhccc-ccCccc-cccceEEEE--CCEEEEECCCcCCCCe--------eccEEEEeCCCCeeeECCCCCcc
Confidence 34567888888877 444442 344566665 9999999995322211 679999999 9999999999
Q ss_pred ccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccCC
Q psy14331 102 RCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDN 181 (265)
Q Consensus 102 r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~ 181 (265)
|..|+++.+ +++||++||.... ...+++++||+.+++|+.++++ |.+|..|+++.++++||++||....
T Consensus 333 R~~~~~~~~---~~~lyv~GG~~~~------~~~~~v~~yd~~~~~W~~~~~l--p~~r~~~~~~~~~~~iYv~GG~~~~ 401 (534)
T PHA03098 333 RKNPGVTVF---NNRIYVIGGIYNS------ISLNTVESWKPGESKWREEPPL--IFPRYNPCVVNVNNLIYVIGGISKN 401 (534)
T ss_pred cccceEEEE---CCEEEEEeCCCCC------EecceEEEEcCCCCceeeCCCc--CcCCccceEEEECCEEEEECCcCCC
Confidence 999999888 8999999997632 2378899999999999998876 5899999999999999999997543
Q ss_pred CCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccccccccccccceeeceEEEEecCcccc
Q psy14331 182 LREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQSVKKDVDKGIVHTDTFLLTPDSKTL 261 (265)
Q Consensus 182 ~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~~~~~~~w~~~~~~~~w 261 (265)
. ..++++++||+.+++|+.+.+ .|.+|.+++++.. +++|||+||.+.... ....+.+++||+.+++|
T Consensus 402 ~---~~~~~v~~yd~~t~~W~~~~~---~p~~r~~~~~~~~-~~~iyv~GG~~~~~~------~~~~~~v~~yd~~~~~W 468 (534)
T PHA03098 402 D---ELLKTVECFSLNTNKWSKGSP---LPISHYGGCAIYH-DGKIYVIGGISYIDN------IKVYNIVESYNPVTNKW 468 (534)
T ss_pred C---cccceEEEEeCCCCeeeecCC---CCccccCceEEEE-CCEEEEECCccCCCC------CcccceEEEecCCCCce
Confidence 3 236889999999999999987 4788988988776 899999999864321 11234588888888888
Q ss_pred cc
Q psy14331 262 RS 263 (265)
Q Consensus 262 ~~ 263 (265)
+.
T Consensus 469 ~~ 470 (534)
T PHA03098 469 TE 470 (534)
T ss_pred ee
Confidence 74
No 17
>KOG4693|consensus
Probab=99.97 E-value=5.7e-30 Score=195.03 Aligned_cols=206 Identities=20% Similarity=0.309 Sum_probs=166.7
Q ss_pred cccccHHHHHHHHHHhhcccceEEEe--ccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe-
Q psy14331 15 MFNEDIEKIVRDIEAEEKRKNKVIEK--VVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL- 91 (265)
Q Consensus 15 ~~~~~~~~~~~~~~~~d~~~~~w~~~--~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~- 91 (265)
++-.+-+.+-+.+++||++++.|+.. ....|..|-+|++|+. ++.+|+|||+..+ ...+ ++++.++|.
T Consensus 95 GGRND~egaCN~Ly~fDp~t~~W~~p~v~G~vPgaRDGHsAcV~--gn~MyiFGGye~~--a~~F-----S~d~h~ld~~ 165 (392)
T KOG4693|consen 95 GGRNDDEGACNLLYEFDPETNVWKKPEVEGFVPGARDGHSACVW--GNQMYIFGGYEED--AQRF-----SQDTHVLDFA 165 (392)
T ss_pred cCccCcccccceeeeeccccccccccceeeecCCccCCceeeEE--CcEEEEecChHHH--HHhh-----hccceeEecc
Confidence 34344445778899999999999753 3478999999999997 9999999996333 3333 789999999
Q ss_pred ---eecC---CCCCCCccccceEEEecCCCEEEEEcCcccCCCcc---ccccCCceeEEEcCCCcEEEEecCC-CCCCCc
Q psy14331 92 ---FVSA---PGAPPPRCSHQMVALSADKGQLWMFGGEFSSASES---QFHHYKDLWVFRMGEKKWEKIVCKD-TPPSRS 161 (265)
Q Consensus 92 ---w~~~---~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~---~~~~~~~~~~yd~~~~~W~~~~~~~-~p~~~~ 161 (265)
|... ...|.=|..|+++++ ++.+|+|||+.+..... ...+.+.+..+|+.|+.|...+... .|..|.
T Consensus 166 TmtWr~~~Tkg~PprwRDFH~a~~~---~~~MYiFGGR~D~~gpfHs~~e~Yc~~i~~ld~~T~aW~r~p~~~~~P~GRR 242 (392)
T KOG4693|consen 166 TMTWREMHTKGDPPRWRDFHTASVI---DGMMYIFGGRSDESGPFHSIHEQYCDTIMALDLATGAWTRTPENTMKPGGRR 242 (392)
T ss_pred ceeeeehhccCCCchhhhhhhhhhc---cceEEEeccccccCCCccchhhhhcceeEEEeccccccccCCCCCcCCCccc
Confidence 7654 445666899999988 89999999985432211 1135678889999999999986665 689999
Q ss_pred CceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccc
Q psy14331 162 GHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSK 235 (265)
Q Consensus 162 ~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~ 235 (265)
.|++.+.++++|+|||.++... ..++++|+|||.+..|..+...+..|.+|..+.+++. ++++|+|||...
T Consensus 243 SHS~fvYng~~Y~FGGYng~ln--~HfndLy~FdP~t~~W~~I~~~Gk~P~aRRRqC~~v~-g~kv~LFGGTsP 313 (392)
T KOG4693|consen 243 SHSTFVYNGKMYMFGGYNGTLN--VHFNDLYCFDPKTSMWSVISVRGKYPSARRRQCSVVS-GGKVYLFGGTSP 313 (392)
T ss_pred ccceEEEcceEEEecccchhhh--hhhcceeecccccchheeeeccCCCCCcccceeEEEE-CCEEEEecCCCC
Confidence 9999999999999999987643 5589999999999999999999999999988777666 999999999764
No 18
>KOG4152|consensus
Probab=99.96 E-value=1.4e-28 Score=202.33 Aligned_cols=240 Identities=22% Similarity=0.333 Sum_probs=189.7
Q ss_pred cceeeeecccccHHHHHHHHHHhhcccceEEEecc------CCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCc
Q psy14331 8 HAMSWTIMFNEDIEKIVRDIEAEEKRKNKVIEKVV------PEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPK 81 (265)
Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~d~~~~~w~~~~~------~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~ 81 (265)
+.-.|..++..+..+..++++++..+.-.|..+-+ ++|+||.+|+...+ +++.|+|||...+.+......+.
T Consensus 91 GtrilvFGGMvEYGkYsNdLYELQasRWeWkrlkp~~p~nG~pPCPRlGHSFsl~--gnKcYlFGGLaNdseDpknNvPr 168 (830)
T KOG4152|consen 91 GTRILVFGGMVEYGKYSNDLYELQASRWEWKRLKPKTPKNGPPPCPRLGHSFSLV--GNKCYLFGGLANDSEDPKNNVPR 168 (830)
T ss_pred CceEEEEccEeeeccccchHHHhhhhhhhHhhcCCCCCCCCCCCCCccCceeEEe--ccEeEEeccccccccCcccccch
Confidence 33445556777888899999999988888877543 68999999999986 99999999974444443344677
Q ss_pred ccCceEEEEe--------ee---cCCCCCCCccccceEEEecCCC---EEEEEcCcccCCCccccccCCceeEEEcCCCc
Q psy14331 82 ALDHLILMNL--------FV---SAPGAPPPRCSHQMVALSADKG---QLWMFGGEFSSASESQFHHYKDLWVFRMGEKK 147 (265)
Q Consensus 82 ~~~~~~~yd~--------w~---~~~~~p~~r~~~~~~~~~~~~~---~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~ 147 (265)
+++++|.+++ |. ....+|.+|.+|+++++...++ ++||+||.... .+.|+|.+|+++-.
T Consensus 169 YLnDlY~leL~~Gsgvv~W~ip~t~Gv~P~pRESHTAViY~eKDs~~skmvvyGGM~G~-------RLgDLW~Ldl~Tl~ 241 (830)
T KOG4152|consen 169 YLNDLYILELRPGSGVVAWDIPITYGVLPPPRESHTAVIYTEKDSKKSKMVVYGGMSGC-------RLGDLWTLDLDTLT 241 (830)
T ss_pred hhcceEEEEeccCCceEEEecccccCCCCCCcccceeEEEEeccCCcceEEEEcccccc-------cccceeEEecceee
Confidence 8999999998 65 3466899999999999965544 79999998654 38999999999999
Q ss_pred EEEEecCC-CCCCCcCceEEEECCEEEEEcCccCC----------CCccccCCceEEEEcCCCceEEeccC----CCCCC
Q psy14331 148 WEKIVCKD-TPPSRSGHRMIALKKHLVVFGGFHDN----------LREAKYYNDVHIFDLETYAWKKIEPL----GAGPA 212 (265)
Q Consensus 148 W~~~~~~~-~p~~~~~~~~~~~~~~iyv~GG~~~~----------~~~~~~~~~v~~y~~~~~~W~~~~~~----~~~p~ 212 (265)
|.+....+ .|.||.-|+++.+++++|||||+-.. .......+.+-++|+.+..|+.+-.. ...|.
T Consensus 242 W~kp~~~G~~PlPRSLHsa~~IGnKMyvfGGWVPl~~~~~~~~~hekEWkCTssl~clNldt~~W~tl~~d~~ed~tiPR 321 (830)
T KOG4152|consen 242 WNKPSLSGVAPLPRSLHSATTIGNKMYVFGGWVPLVMDDVKVATHEKEWKCTSSLACLNLDTMAWETLLMDTLEDNTIPR 321 (830)
T ss_pred cccccccCCCCCCcccccceeecceeEEecceeeeeccccccccccceeeeccceeeeeecchheeeeeecccccccccc
Confidence 99987655 68999999999999999999998321 11124457788999999999977432 22789
Q ss_pred CCCCceEEEeCCCeEEEEcCccccccccccccceeeceEEEEecCcc
Q psy14331 213 PRSGCQMAATPDGKILISGGYSKQSVKKDVDKGIVHTDTFLLTPDSK 259 (265)
Q Consensus 213 ~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~~~~~~~w~~~~~~~ 259 (265)
+|.+|.++++ +.++||..|++... +......+-.|+|-+|+...
T Consensus 322 ~RAGHCAvAi-gtRlYiWSGRDGYr--KAwnnQVCCkDlWyLdTekP 365 (830)
T KOG4152|consen 322 ARAGHCAVAI-GTRLYIWSGRDGYR--KAWNNQVCCKDLWYLDTEKP 365 (830)
T ss_pred ccccceeEEe-ccEEEEEeccchhh--HhhccccchhhhhhhcccCC
Confidence 9999998888 89999999987543 55555677889999988653
No 19
>KOG0379|consensus
Probab=99.96 E-value=1.7e-28 Score=211.98 Aligned_cols=209 Identities=29% Similarity=0.498 Sum_probs=175.4
Q ss_pred HHHHhhcccceEEEecc--CCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecC---C
Q psy14331 26 DIEAEEKRKNKVIEKVV--PEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSA---P 96 (265)
Q Consensus 26 ~~~~~d~~~~~w~~~~~--~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~---~ 96 (265)
+++.+|..+..|....+ ..|.+|.+|.++++ +++||+|||... .... ++++++||+ |... .
T Consensus 89 dl~~~d~~~~~w~~~~~~g~~p~~r~g~~~~~~--~~~l~lfGG~~~--~~~~------~~~l~~~d~~t~~W~~l~~~~ 158 (482)
T KOG0379|consen 89 DLYVLDLESQLWTKPAATGDEPSPRYGHSLSAV--GDKLYLFGGTDK--KYRN------LNELHSLDLSTRTWSLLSPTG 158 (482)
T ss_pred eeEEeecCCcccccccccCCCCCcccceeEEEE--CCeEEEEccccC--CCCC------hhheEeccCCCCcEEEecCcC
Confidence 49999999999977444 67899999999997 899999999632 1211 689999999 6543 4
Q ss_pred CCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCC-CCCCCcCceEEEECCEEEEE
Q psy14331 97 GAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKD-TPPSRSGHRMIALKKHLVVF 175 (265)
Q Consensus 97 ~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~-~p~~~~~~~~~~~~~~iyv~ 175 (265)
..|.+|.+|++++. +++|||+||...... ..+++++||+++.+|.++...+ .|.||.+|+++++++++++|
T Consensus 159 ~~P~~r~~Hs~~~~---g~~l~vfGG~~~~~~-----~~ndl~i~d~~~~~W~~~~~~g~~P~pR~gH~~~~~~~~~~v~ 230 (482)
T KOG0379|consen 159 DPPPPRAGHSATVV---GTKLVVFGGIGGTGD-----SLNDLHIYDLETSTWSELDTQGEAPSPRYGHAMVVVGNKLLVF 230 (482)
T ss_pred CCCCCcccceEEEE---CCEEEEECCccCccc-----ceeeeeeeccccccceecccCCCCCCCCCCceEEEECCeEEEE
Confidence 46899999999988 899999999876542 4899999999999999997766 68899999999999999999
Q ss_pred cCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccccccccccccceeeceEEEEe
Q psy14331 176 GGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQSVKKDVDKGIVHTDTFLLT 255 (265)
Q Consensus 176 GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~~~~~~~w~~~ 255 (265)
||.+.. ..+++++|.+|+.+..|..+...+..|.+|+.|.++.. +++++|+||..... -.+..+.|.|+
T Consensus 231 gG~~~~---~~~l~D~~~ldl~~~~W~~~~~~g~~p~~R~~h~~~~~-~~~~~l~gG~~~~~-------~~~l~~~~~l~ 299 (482)
T KOG0379|consen 231 GGGDDG---DVYLNDVHILDLSTWEWKLLPTGGDLPSPRSGHSLTVS-GDHLLLFGGGTDPK-------QEPLGDLYGLD 299 (482)
T ss_pred eccccC---CceecceEeeecccceeeeccccCCCCCCcceeeeEEE-CCEEEEEcCCcccc-------ccccccccccc
Confidence 998732 26799999999999999999999999999999999854 89999999987641 02567788888
Q ss_pred cCcccccc
Q psy14331 256 PDSKTLRS 263 (265)
Q Consensus 256 ~~~~~w~~ 263 (265)
.....|..
T Consensus 300 ~~~~~w~~ 307 (482)
T KOG0379|consen 300 LETLVWSK 307 (482)
T ss_pred ccccceee
Confidence 88888764
No 20
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=99.96 E-value=4.9e-28 Score=202.94 Aligned_cols=212 Identities=16% Similarity=0.158 Sum_probs=147.9
Q ss_pred HHHHHHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCC-c--e---eeCC-----------------
Q psy14331 23 IVRDIEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQ-K--F---VFGS----------------- 79 (265)
Q Consensus 23 ~~~~~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~-~--~---~~~~----------------- 79 (265)
.++++++||+.+++|..+..++|.+|.+|+++.+ .+++||++||...... . . ....
T Consensus 83 ~~~~v~~Yd~~~~~W~~~~~~~p~~~~~~~~~~~-~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 161 (346)
T TIGR03547 83 VFDDVYRYDPKKNSWQKLDTRSPVGLLGASGFSL-HNGQAYFTGGVNKNIFDGYFADLSAADKDSEPKDKLIAAYFSQPP 161 (346)
T ss_pred ecccEEEEECCCCEEecCCCCCCCcccceeEEEE-eCCEEEEEcCcChHHHHHHHhhHhhcCccchhhhhhHHHHhCCCh
Confidence 4678999999999999844356777877776622 4999999999531100 0 0 0000
Q ss_pred --CcccCceEEEEe----eecCCCCCC-CccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEE--cCCCcEEE
Q psy14331 80 --PKALDHLILMNL----FVSAPGAPP-PRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFR--MGEKKWEK 150 (265)
Q Consensus 80 --~~~~~~~~~yd~----w~~~~~~p~-~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd--~~~~~W~~ 150 (265)
...++++++||+ |...++||. +|..++++.+ +++|||+||...... ...+++.|+ +.+++|+.
T Consensus 162 ~~~~~~~~v~~YDp~t~~W~~~~~~p~~~r~~~~~~~~---~~~iyv~GG~~~~~~-----~~~~~~~y~~~~~~~~W~~ 233 (346)
T TIGR03547 162 EDYFWNKNVLSYDPSTNQWRNLGENPFLGTAGSAIVHK---GNKLLLINGEIKPGL-----RTAEVKQYLFTGGKLEWNK 233 (346)
T ss_pred hHcCccceEEEEECCCCceeECccCCCCcCCCceEEEE---CCEEEEEeeeeCCCc-----cchheEEEEecCCCceeee
Confidence 001478999999 999999996 6888887777 899999999754321 134566665 57789999
Q ss_pred EecCCCCC-----CCcCceEEEECCEEEEEcCccCCCCc-------------cccCCceEEEEcCCCceEEeccCCCCCC
Q psy14331 151 IVCKDTPP-----SRSGHRMIALKKHLVVFGGFHDNLRE-------------AKYYNDVHIFDLETYAWKKIEPLGAGPA 212 (265)
Q Consensus 151 ~~~~~~p~-----~~~~~~~~~~~~~iyv~GG~~~~~~~-------------~~~~~~v~~y~~~~~~W~~~~~~~~~p~ 212 (265)
+++++.|. .+..|++++++++|||+||.+..... ......+++||+++++|+.+.++ |.
T Consensus 234 ~~~m~~~r~~~~~~~~~~~a~~~~~~Iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~e~yd~~~~~W~~~~~l---p~ 310 (346)
T TIGR03547 234 LPPLPPPKSSSQEGLAGAFAGISNGVLLVAGGANFPGAQENYKNGKLYAHEGLIKAWSSEVYALDNGKWSKVGKL---PQ 310 (346)
T ss_pred cCCCCCCCCCccccccEEeeeEECCEEEEeecCCCCCchhhhhcCCccccCCCCceeEeeEEEecCCcccccCCC---CC
Confidence 98875321 12355577889999999998532110 01124689999999999999885 77
Q ss_pred CCCCceEEEeCCCeEEEEcCccccccccccccceeeceEEEEe
Q psy14331 213 PRSGCQMAATPDGKILISGGYSKQSVKKDVDKGIVHTDTFLLT 255 (265)
Q Consensus 213 ~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~~~~~~~w~~~ 255 (265)
+|..++++++ +++|||+||.+... ...++++.+.
T Consensus 311 ~~~~~~~~~~-~~~iyv~GG~~~~~--------~~~~~v~~~~ 344 (346)
T TIGR03547 311 GLAYGVSVSW-NNGVLLIGGENSGG--------KAVTDVYLLS 344 (346)
T ss_pred CceeeEEEEc-CCEEEEEeccCCCC--------CEeeeEEEEE
Confidence 8887776555 99999999987543 2455666553
No 21
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=99.96 E-value=5.4e-28 Score=204.31 Aligned_cols=217 Identities=18% Similarity=0.179 Sum_probs=152.5
Q ss_pred HHHHHHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCce----------------e-----e----
Q psy14331 23 IVRDIEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKF----------------V-----F---- 77 (265)
Q Consensus 23 ~~~~~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~----------------~-----~---- 77 (265)
.++++++||+.+++|..+....|.++.+|+++++ .+++||++||........ . +
T Consensus 104 ~~~~v~~YD~~~n~W~~~~~~~p~~~~~~~~~~~-~~~~IYv~GG~~~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~ 182 (376)
T PRK14131 104 VFDDVYKYDPKTNSWQKLDTRSPVGLAGHVAVSL-HNGKAYITGGVNKNIFDGYFEDLAAAGKDKTPKDKINDAYFDKKP 182 (376)
T ss_pred EcccEEEEeCCCCEEEeCCCCCCCcccceEEEEe-eCCEEEEECCCCHHHHHHHHhhhhhcccchhhhhhhHHHHhcCCh
Confidence 4678999999999999944345777777876662 389999999953210000 0 0
Q ss_pred CCCcccCceEEEEe----eecCCCCCC-CccccceEEEecCCCEEEEEcCcccCCCccccccCCcee--EEEcCCCcEEE
Q psy14331 78 GSPKALDHLILMNL----FVSAPGAPP-PRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLW--VFRMGEKKWEK 150 (265)
Q Consensus 78 ~~~~~~~~~~~yd~----w~~~~~~p~-~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~--~yd~~~~~W~~ 150 (265)
......+++++||+ |...+++|. +|..|+++.+ +++|||+||....+.. ..+++ .||+++++|+.
T Consensus 183 ~~~~~~~~v~~YD~~t~~W~~~~~~p~~~~~~~a~v~~---~~~iYv~GG~~~~~~~-----~~~~~~~~~~~~~~~W~~ 254 (376)
T PRK14131 183 EDYFFNKEVLSYDPSTNQWKNAGESPFLGTAGSAVVIK---GNKLWLINGEIKPGLR-----TDAVKQGKFTGNNLKWQK 254 (376)
T ss_pred hhcCcCceEEEEECCCCeeeECCcCCCCCCCcceEEEE---CCEEEEEeeeECCCcC-----ChhheEEEecCCCcceee
Confidence 00001478999999 999999996 6777777766 8999999997543321 34444 56778999999
Q ss_pred EecCCCCCCCc--------CceEEEECCEEEEEcCccCCCCc-------------cccCCceEEEEcCCCceEEeccCCC
Q psy14331 151 IVCKDTPPSRS--------GHRMIALKKHLVVFGGFHDNLRE-------------AKYYNDVHIFDLETYAWKKIEPLGA 209 (265)
Q Consensus 151 ~~~~~~p~~~~--------~~~~~~~~~~iyv~GG~~~~~~~-------------~~~~~~v~~y~~~~~~W~~~~~~~~ 209 (265)
+++++ .+|. ++.+++++++|||+||.+..... ......+++||+++++|+.+..
T Consensus 255 ~~~~p--~~~~~~~~~~~~~~~a~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~e~yd~~~~~W~~~~~--- 329 (376)
T PRK14131 255 LPDLP--PAPGGSSQEGVAGAFAGYSNGVLLVAGGANFPGARENYQNGKLYAHEGLKKSWSDEIYALVNGKWQKVGE--- 329 (376)
T ss_pred cCCCC--CCCcCCcCCccceEeceeECCEEEEeeccCCCCChhhhhcCCcccccCCcceeehheEEecCCcccccCc---
Confidence 98774 4432 23356779999999997542210 0011246789999999999887
Q ss_pred CCCCCCCceEEEeCCCeEEEEcCccccccccccccceeeceEEEEecCccccc
Q psy14331 210 GPAPRSGCQMAATPDGKILISGGYSKQSVKKDVDKGIVHTDTFLLTPDSKTLR 262 (265)
Q Consensus 210 ~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~~~~~~~w~~~~~~~~w~ 262 (265)
+|.+|..++++++ +++|||+||..... ...++++.|.+..+.++
T Consensus 330 lp~~r~~~~av~~-~~~iyv~GG~~~~~--------~~~~~v~~~~~~~~~~~ 373 (376)
T PRK14131 330 LPQGLAYGVSVSW-NNGVLLIGGETAGG--------KAVSDVTLLSWDGKKLT 373 (376)
T ss_pred CCCCccceEEEEe-CCEEEEEcCCCCCC--------cEeeeEEEEEEcCCEEE
Confidence 4888988877666 99999999986432 35667888887776654
No 22
>KOG0379|consensus
Probab=99.96 E-value=8.9e-28 Score=207.57 Aligned_cols=191 Identities=28% Similarity=0.520 Sum_probs=162.1
Q ss_pred cCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eec---CCCCCCCccccceEEEecCC
Q psy14331 42 VPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVS---APGAPPPRCSHQMVALSADK 114 (265)
Q Consensus 42 ~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~---~~~~p~~r~~~~~~~~~~~~ 114 (265)
...|.+|..|+++.+ ++++|||||........ -.++|++|. |.. ....|.+|++|+++++ +
T Consensus 55 ~~~p~~R~~hs~~~~--~~~~~vfGG~~~~~~~~-------~~dl~~~d~~~~~w~~~~~~g~~p~~r~g~~~~~~---~ 122 (482)
T KOG0379|consen 55 GVGPIPRAGHSAVLI--GNKLYVFGGYGSGDRLT-------DLDLYVLDLESQLWTKPAATGDEPSPRYGHSLSAV---G 122 (482)
T ss_pred CCCcchhhccceeEE--CCEEEEECCCCCCCccc-------cceeEEeecCCcccccccccCCCCCcccceeEEEE---C
Confidence 357899999999997 99999999963333321 115999999 543 3446789999999999 8
Q ss_pred CEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCC-CCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEE
Q psy14331 115 GQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKD-TPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHI 193 (265)
Q Consensus 115 ~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~-~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~ 193 (265)
++||++||..... ..+++++.||+.+++|..+.+.. .|++|.+|+++++++++|||||.+.... ..+++|+
T Consensus 123 ~~l~lfGG~~~~~-----~~~~~l~~~d~~t~~W~~l~~~~~~P~~r~~Hs~~~~g~~l~vfGG~~~~~~---~~ndl~i 194 (482)
T KOG0379|consen 123 DKLYLFGGTDKKY-----RNLNELHSLDLSTRTWSLLSPTGDPPPPRAGHSATVVGTKLVVFGGIGGTGD---SLNDLHI 194 (482)
T ss_pred CeEEEEccccCCC-----CChhheEeccCCCCcEEEecCcCCCCCCcccceEEEECCEEEEECCccCccc---ceeeeee
Confidence 9999999987522 34789999999999999998887 4999999999999999999999977652 5899999
Q ss_pred EEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCcc-ccccccccccceeeceEEEEecCccccc
Q psy14331 194 FDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYS-KQSVKKDVDKGIVHTDTFLLTPDSKTLR 262 (265)
Q Consensus 194 y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~-~~~~~~~~~~~~~~~~~w~~~~~~~~w~ 262 (265)
||+.+.+|.++...+..|.+|.+|+++++ +++++|+||.+ .+ ...+|+|.+|+.+..|.
T Consensus 195 ~d~~~~~W~~~~~~g~~P~pR~gH~~~~~-~~~~~v~gG~~~~~---------~~l~D~~~ldl~~~~W~ 254 (482)
T KOG0379|consen 195 YDLETSTWSELDTQGEAPSPRYGHAMVVV-GNKLLVFGGGDDGD---------VYLNDVHILDLSTWEWK 254 (482)
T ss_pred eccccccceecccCCCCCCCCCCceEEEE-CCeEEEEeccccCC---------ceecceEeeecccceee
Confidence 99999999999999999999999999888 89999998887 33 37889999999999887
No 23
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=99.96 E-value=1.6e-27 Score=199.77 Aligned_cols=171 Identities=20% Similarity=0.309 Sum_probs=132.5
Q ss_pred cCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe------eecCCCCC-CCccccceEEEecCC
Q psy14331 42 VPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL------FVSAPGAP-PPRCSHQMVALSADK 114 (265)
Q Consensus 42 ~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~------w~~~~~~p-~~r~~~~~~~~~~~~ 114 (265)
+++|.+|..++++++ +++|||+||. . .+++++||+ |..+++|| .+|..++++++ +
T Consensus 2 ~~lp~~~~~~~~~~~--~~~vyv~GG~--~-----------~~~~~~~d~~~~~~~W~~l~~~p~~~R~~~~~~~~---~ 63 (346)
T TIGR03547 2 PDLPVGFKNGTGAII--GDKVYVGLGS--A-----------GTSWYKLDLKKPSKGWQKIADFPGGPRNQAVAAAI---D 63 (346)
T ss_pred CCCCccccCceEEEE--CCEEEEEccc--c-----------CCeeEEEECCCCCCCceECCCCCCCCcccceEEEE---C
Confidence 468899998877676 9999999994 1 257889985 99999999 58999988887 8
Q ss_pred CEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEE-EECCEEEEEcCccCCCC----------
Q psy14331 115 GQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMI-ALKKHLVVFGGFHDNLR---------- 183 (265)
Q Consensus 115 ~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~-~~~~~iyv~GG~~~~~~---------- 183 (265)
++|||+||............++++++||+.+++|+.++. ..|.+|.+|+++ +.+++||++||.+....
T Consensus 64 ~~iYv~GG~~~~~~~~~~~~~~~v~~Yd~~~~~W~~~~~-~~p~~~~~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~ 142 (346)
T TIGR03547 64 GKLYVFGGIGKANSEGSPQVFDDVYRYDPKKNSWQKLDT-RSPVGLLGASGFSLHNGQAYFTGGVNKNIFDGYFADLSAA 142 (346)
T ss_pred CEEEEEeCCCCCCCCCcceecccEEEEECCCCEEecCCC-CCCCcccceeEEEEeCCEEEEEcCcChHHHHHHHhhHhhc
Confidence 999999997533211111246889999999999999974 346777777776 68999999999853210
Q ss_pred --------------------ccccCCceEEEEcCCCceEEeccCCCCCC-CCCCceEEEeCCCeEEEEcCccc
Q psy14331 184 --------------------EAKYYNDVHIFDLETYAWKKIEPLGAGPA-PRSGCQMAATPDGKILISGGYSK 235 (265)
Q Consensus 184 --------------------~~~~~~~v~~y~~~~~~W~~~~~~~~~p~-~~~~~~~~~~~~~~iyi~GG~~~ 235 (265)
.....+++++||+.+++|+.+.++ |. +|.+++++++ +++|||+||...
T Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~YDp~t~~W~~~~~~---p~~~r~~~~~~~~-~~~iyv~GG~~~ 211 (346)
T TIGR03547 143 DKDSEPKDKLIAAYFSQPPEDYFWNKNVLSYDPSTNQWRNLGEN---PFLGTAGSAIVHK-GNKLLLINGEIK 211 (346)
T ss_pred CccchhhhhhHHHHhCCChhHcCccceEEEEECCCCceeECccC---CCCcCCCceEEEE-CCEEEEEeeeeC
Confidence 001247899999999999999885 54 6778887766 899999999864
No 24
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=99.95 E-value=3.6e-26 Score=193.26 Aligned_cols=177 Identities=19% Similarity=0.328 Sum_probs=133.9
Q ss_pred eEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe------eecCCCCC-CCccccceE
Q psy14331 36 KVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL------FVSAPGAP-PPRCSHQMV 108 (265)
Q Consensus 36 ~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~------w~~~~~~p-~~r~~~~~~ 108 (265)
.+.. .+++|.+|..++.+++ +++|||+||. . .+.+++||+ |.+.+++| .+|..++++
T Consensus 18 ~~~~-l~~lP~~~~~~~~~~~--~~~iyv~gG~--~-----------~~~~~~~d~~~~~~~W~~l~~~p~~~r~~~~~v 81 (376)
T PRK14131 18 NAEQ-LPDLPVPFKNGTGAID--NNTVYVGLGS--A-----------GTSWYKLDLNAPSKGWTKIAAFPGGPREQAVAA 81 (376)
T ss_pred eccc-CCCCCcCccCCeEEEE--CCEEEEEeCC--C-----------CCeEEEEECCCCCCCeEECCcCCCCCcccceEE
Confidence 3444 6789999888877775 9999999993 1 245788886 99999998 589888888
Q ss_pred EEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEE-ECCEEEEEcCccCCC-----
Q psy14331 109 ALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIA-LKKHLVVFGGFHDNL----- 182 (265)
Q Consensus 109 ~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~-~~~~iyv~GG~~~~~----- 182 (265)
.+ +++|||+||............++++++||+.+++|+.++.. .|.++.+|++++ .+++||++||.....
T Consensus 82 ~~---~~~IYV~GG~~~~~~~~~~~~~~~v~~YD~~~n~W~~~~~~-~p~~~~~~~~~~~~~~~IYv~GG~~~~~~~~~~ 157 (376)
T PRK14131 82 FI---DGKLYVFGGIGKTNSEGSPQVFDDVYKYDPKTNSWQKLDTR-SPVGLAGHVAVSLHNGKAYITGGVNKNIFDGYF 157 (376)
T ss_pred EE---CCEEEEEcCCCCCCCCCceeEcccEEEEeCCCCEEEeCCCC-CCCcccceEEEEeeCCEEEEECCCCHHHHHHHH
Confidence 77 89999999976411111112368999999999999999753 367777788777 799999999975310
Q ss_pred -------------------------CccccCCceEEEEcCCCceEEeccCCCCCC-CCCCceEEEeCCCeEEEEcCcccc
Q psy14331 183 -------------------------REAKYYNDVHIFDLETYAWKKIEPLGAGPA-PRSGCQMAATPDGKILISGGYSKQ 236 (265)
Q Consensus 183 -------------------------~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~-~~~~~~~~~~~~~~iyi~GG~~~~ 236 (265)
......+++++||+.+++|+.+.++ |. +|.+++++.+ +++|||+||....
T Consensus 158 ~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~YD~~t~~W~~~~~~---p~~~~~~~a~v~~-~~~iYv~GG~~~~ 233 (376)
T PRK14131 158 EDLAAAGKDKTPKDKINDAYFDKKPEDYFFNKEVLSYDPSTNQWKNAGES---PFLGTAGSAVVIK-GNKLWLINGEIKP 233 (376)
T ss_pred hhhhhcccchhhhhhhHHHHhcCChhhcCcCceEEEEECCCCeeeECCcC---CCCCCCcceEEEE-CCEEEEEeeeECC
Confidence 0001357899999999999998875 53 6777777666 8999999997543
No 25
>PHA02790 Kelch-like protein; Provisional
Probab=99.95 E-value=6e-26 Score=197.10 Aligned_cols=164 Identities=15% Similarity=0.221 Sum_probs=136.0
Q ss_pred CCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccccc
Q psy14331 58 DKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFH 133 (265)
Q Consensus 58 ~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~ 133 (265)
.++.||++||. +... ..+.+++||+ |...++||.+|..++++.+ +++||++||...
T Consensus 270 ~~~~lyviGG~--~~~~-------~~~~v~~Ydp~~~~W~~~~~m~~~r~~~~~v~~---~~~iYviGG~~~-------- 329 (480)
T PHA02790 270 VGEVVYLIGGW--MNNE-------IHNNAIAVNYISNNWIPIPPMNSPRLYASGVPA---NNKLYVVGGLPN-------- 329 (480)
T ss_pred ECCEEEEEcCC--CCCC-------cCCeEEEEECCCCEEEECCCCCchhhcceEEEE---CCEEEEECCcCC--------
Confidence 38999999995 2221 1688999999 9999999999999988877 999999999642
Q ss_pred cCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCC
Q psy14331 134 HYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAP 213 (265)
Q Consensus 134 ~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~ 213 (265)
.+++++||+.+++|+.++++ |.+|..|++++++++||++||.+.. .+.+++|||.+++|+.++++ |.+
T Consensus 330 -~~sve~ydp~~n~W~~~~~l--~~~r~~~~~~~~~g~IYviGG~~~~------~~~ve~ydp~~~~W~~~~~m---~~~ 397 (480)
T PHA02790 330 -PTSVERWFHGDAAWVNMPSL--LKPRCNPAVASINNVIYVIGGHSET------DTTTEYLLPNHDQWQFGPST---YYP 397 (480)
T ss_pred -CCceEEEECCCCeEEECCCC--CCCCcccEEEEECCEEEEecCcCCC------CccEEEEeCCCCEEEeCCCC---CCc
Confidence 25689999999999999877 5899999999999999999997543 36789999999999999884 788
Q ss_pred CCCceEEEeCCCeEEEEcCccccccccccccceeeceEEEEecCccccc
Q psy14331 214 RSGCQMAATPDGKILISGGYSKQSVKKDVDKGIVHTDTFLLTPDSKTLR 262 (265)
Q Consensus 214 ~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~~~~~~~w~~~~~~~~w~ 262 (265)
|..++++++ +++||++||. .+.|++..+.|..-+++..-|
T Consensus 398 r~~~~~~~~-~~~IYv~GG~--------~e~ydp~~~~W~~~~~m~~~r 437 (480)
T PHA02790 398 HYKSCALVF-GRRLFLVGRN--------AEFYCESSNTWTLIDDPIYPR 437 (480)
T ss_pred cccceEEEE-CCEEEEECCc--------eEEecCCCCcEeEcCCCCCCc
Confidence 988888777 9999999984 345777778887776665433
No 26
>KOG4152|consensus
Probab=99.91 E-value=6.1e-23 Score=169.40 Aligned_cols=221 Identities=22% Similarity=0.406 Sum_probs=164.9
Q ss_pred HHHHHHHhhcccceEEEec--cCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecC-
Q psy14331 23 IVRDIEAEEKRKNKVIEKV--VPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSA- 95 (265)
Q Consensus 23 ~~~~~~~~d~~~~~w~~~~--~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~- 95 (265)
+...+..||..++.|.... ...|.+-+.|..+. ++.+||+|||...-+.. +|++|.+.. |..+
T Consensus 55 iiDELHvYNTatnqWf~PavrGDiPpgcAA~Gfvc--dGtrilvFGGMvEYGkY--------sNdLYELQasRWeWkrlk 124 (830)
T KOG4152|consen 55 IIDELHVYNTATNQWFAPAVRGDIPPGCAAFGFVC--DGTRILVFGGMVEYGKY--------SNDLYELQASRWEWKRLK 124 (830)
T ss_pred chhhhhhhccccceeecchhcCCCCCchhhcceEe--cCceEEEEccEeeeccc--------cchHHHhhhhhhhHhhcC
Confidence 6788999999999997532 25566666666555 59999999996444443 788887777 4433
Q ss_pred ------CCCCCCccccceEEEecCCCEEEEEcCcccC---CCccccccCCceeEEEcCCC----cEEEEecCC-CCCCCc
Q psy14331 96 ------PGAPPPRCSHQMVALSADKGQLWMFGGEFSS---ASESQFHHYKDLWVFRMGEK----KWEKIVCKD-TPPSRS 161 (265)
Q Consensus 96 ------~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~---~~~~~~~~~~~~~~yd~~~~----~W~~~~~~~-~p~~~~ 161 (265)
...|-||.+|+..++ +++-|+|||...+ ..++.-.+++|++.+++.-+ .|...-..+ .|.+|.
T Consensus 125 p~~p~nG~pPCPRlGHSFsl~---gnKcYlFGGLaNdseDpknNvPrYLnDlY~leL~~Gsgvv~W~ip~t~Gv~P~pRE 201 (830)
T KOG4152|consen 125 PKTPKNGPPPCPRLGHSFSLV---GNKCYLFGGLANDSEDPKNNVPRYLNDLYILELRPGSGVVAWDIPITYGVLPPPRE 201 (830)
T ss_pred CCCCCCCCCCCCccCceeEEe---ccEeEEeccccccccCcccccchhhcceEEEEeccCCceEEEecccccCCCCCCcc
Confidence 234568999999888 8999999998432 22334467999999998744 488765444 789999
Q ss_pred CceEEEE------CCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccc
Q psy14331 162 GHRMIAL------KKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSK 235 (265)
Q Consensus 162 ~~~~~~~------~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~ 235 (265)
.|++++. ..++||+||.++.. +.++|.+|+++.+|.+....+..|.+|+-|++..+ ++++|||||.-.
T Consensus 202 SHTAViY~eKDs~~skmvvyGGM~G~R-----LgDLW~Ldl~Tl~W~kp~~~G~~PlPRSLHsa~~I-GnKMyvfGGWVP 275 (830)
T KOG4152|consen 202 SHTAVIYTEKDSKKSKMVVYGGMSGCR-----LGDLWTLDLDTLTWNKPSLSGVAPLPRSLHSATTI-GNKMYVFGGWVP 275 (830)
T ss_pred cceeEEEEeccCCcceEEEEccccccc-----ccceeEEecceeecccccccCCCCCCcccccceee-cceeEEecceee
Confidence 9999887 23899999998865 78999999999999999999989999999998877 999999999753
Q ss_pred cc-----cccccccceeeceEEEEecCccccc
Q psy14331 236 QS-----VKKDVDKGIVHTDTFLLTPDSKTLR 262 (265)
Q Consensus 236 ~~-----~~~~~~~~~~~~~~w~~~~~~~~w~ 262 (265)
.. .........+.+..=+++.++..|.
T Consensus 276 l~~~~~~~~~hekEWkCTssl~clNldt~~W~ 307 (830)
T KOG4152|consen 276 LVMDDVKVATHEKEWKCTSSLACLNLDTMAWE 307 (830)
T ss_pred eeccccccccccceeeeccceeeeeecchhee
Confidence 21 1111222334444445666666775
No 27
>KOG2437|consensus
Probab=99.67 E-value=5e-17 Score=134.37 Aligned_cols=195 Identities=19% Similarity=0.364 Sum_probs=146.2
Q ss_pred hcccceEEEecc---------CCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecC--
Q psy14331 31 EKRKNKVIEKVV---------PEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSA-- 95 (265)
Q Consensus 31 d~~~~~w~~~~~---------~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~-- 95 (265)
.+-+..|.+... .-|..|.+|+++..+..+.+|+.|| .++... +.++|.|+. |+..
T Consensus 235 ~ey~~~W~~i~~~~~~~~~~~~~p~~RgGHQMV~~~~~~CiYLYGG--WdG~~~-------l~DFW~Y~v~e~~W~~iN~ 305 (723)
T KOG2437|consen 235 QEYKPRWSQIIPKSTKGDGEDNRPGMRGGHQMVIDVQTECVYLYGG--WDGTQD-------LADFWAYSVKENQWTCINR 305 (723)
T ss_pred ccccccccccCchhhcccccccCccccCcceEEEeCCCcEEEEecC--cccchh-------HHHHHhhcCCcceeEEeec
Confidence 445566866433 3488999999999877789999999 677665 789999999 7643
Q ss_pred -CCCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecC----CCCCCCcCceEEEECC
Q psy14331 96 -PGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCK----DTPPSRSGHRMIALKK 170 (265)
Q Consensus 96 -~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~----~~p~~~~~~~~~~~~~ 170 (265)
...|..|+.|.++. +....+||+.|-.........+...+++|+||..++.|..+.-. +.|...+.|.|++.++
T Consensus 306 ~t~~PG~RsCHRMVi-d~S~~KLYLlG~Y~~sS~r~~~s~RsDfW~FDi~~~~W~~ls~dt~~dGGP~~vfDHqM~Vd~~ 384 (723)
T KOG2437|consen 306 DTEGPGARSCHRMVI-DISRRKLYLLGRYLDSSVRNSKSLRSDFWRFDIDTNTWMLLSEDTAADGGPKLVFDHQMCVDSE 384 (723)
T ss_pred CCCCCcchhhhhhhh-hhhHhHHhhhhhccccccccccccccceEEEecCCceeEEecccccccCCcceeecceeeEecC
Confidence 33788899999874 33446999999876655555556688999999999999988533 2588899999999988
Q ss_pred E--EEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCC-------CCCCCCCceEEEeCC-CeEEEEcCcccc
Q psy14331 171 H--LVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGA-------GPAPRSGCQMAATPD-GKILISGGYSKQ 236 (265)
Q Consensus 171 ~--iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~-------~p~~~~~~~~~~~~~-~~iyi~GG~~~~ 236 (265)
+ +|||||..-.... ..+..+|.||.....|+.+..... ....|.+|.+-...+ +++|++||....
T Consensus 385 k~~iyVfGGr~~~~~e-~~f~GLYaf~~~~~~w~~l~e~~~~~~~vvE~~~sR~ghcmE~~~~n~~ly~fggq~s~ 459 (723)
T KOG2437|consen 385 KHMIYVFGGRILTCNE-PQFSGLYAFNCQCQTWKLLREDSCNAGPVVEDIQSRIGHCMEFHSKNRCLYVFGGQRSK 459 (723)
T ss_pred cceEEEecCeeccCCC-ccccceEEEecCCccHHHHHHHHhhcCcchhHHHHHHHHHHHhcCCCCeEEeccCcccc
Confidence 7 9999998544321 346789999999999997765311 123466666655544 479999997543
No 28
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=99.52 E-value=4e-13 Score=107.43 Aligned_cols=174 Identities=19% Similarity=0.306 Sum_probs=127.7
Q ss_pred ccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe------eecCCCCCC-CccccceEEEecC
Q psy14331 41 VVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL------FVSAPGAPP-PRCSHQMVALSAD 113 (265)
Q Consensus 41 ~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~------w~~~~~~p~-~r~~~~~~~~~~~ 113 (265)
.+.+|.+-..-+-+.+ ++.+||-=| .. -...+.+|+ |++....|- +|....++++
T Consensus 30 lPdlPvg~KnG~Ga~i--g~~~YVGLG--s~-----------G~afy~ldL~~~~k~W~~~a~FpG~~rnqa~~a~~--- 91 (381)
T COG3055 30 LPDLPVGFKNGAGALI--GDTVYVGLG--SA-----------GTAFYVLDLKKPGKGWTKIADFPGGARNQAVAAVI--- 91 (381)
T ss_pred CCCCCcccccccccee--cceEEEEec--cC-----------CccceehhhhcCCCCceEcccCCCcccccchheee---
Confidence 4556777665555554 889999777 22 246677787 999999885 5777777766
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECC-EEEEEcCccCCCC---------
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKK-HLVVFGGFHDNLR--------- 183 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~-~iyv~GG~~~~~~--------- 183 (265)
+++||++||...... ......+++++||+.+++|+++.... |....++.++..++ +||++||.+....
T Consensus 92 ~~kLyvFgG~Gk~~~-~~~~~~nd~Y~y~p~~nsW~kl~t~s-P~gl~G~~~~~~~~~~i~f~GGvn~~if~~yf~dv~~ 169 (381)
T COG3055 92 GGKLYVFGGYGKSVS-SSPQVFNDAYRYDPSTNSWHKLDTRS-PTGLVGASTFSLNGTKIYFFGGVNQNIFNGYFEDVGA 169 (381)
T ss_pred CCeEEEeeccccCCC-CCceEeeeeEEecCCCChhheecccc-ccccccceeEecCCceEEEEccccHHhhhhhHHhhhh
Confidence 999999999865444 22356899999999999999997654 67777788888877 9999999743211
Q ss_pred ---------------------ccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccccc
Q psy14331 184 ---------------------EAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQS 237 (265)
Q Consensus 184 ---------------------~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~ 237 (265)
.+.+..++..|+|.+++|+.+... +..++++.++ +..++++.++-|.-.+.
T Consensus 170 a~~d~~~~~~i~~~yf~~~~~dy~~n~ev~sy~p~~n~W~~~G~~--pf~~~aGsa~-~~~~n~~~lInGEiKpG 241 (381)
T COG3055 170 AGKDKEAVDKIIAHYFDKKAEDYFFNKEVLSYDPSTNQWRNLGEN--PFYGNAGSAV-VIKGNKLTLINGEIKPG 241 (381)
T ss_pred hcccHHHHHHHHHHHhCCCHHHhcccccccccccccchhhhcCcC--cccCccCcce-eecCCeEEEEcceecCC
Confidence 112446899999999999999864 4566777554 45577788888875443
No 29
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=99.49 E-value=1.1e-12 Score=105.03 Aligned_cols=202 Identities=17% Similarity=0.175 Sum_probs=136.0
Q ss_pred HHHHHHHHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCc-------------e----------ee
Q psy14331 21 EKIVRDIEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQK-------------F----------VF 77 (265)
Q Consensus 21 ~~~~~~~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~-------------~----------~~ 77 (265)
.+..++.+.|||++++|..+....|....+++++.. .+.++|++||...+-.. . ..
T Consensus 109 ~~~~nd~Y~y~p~~nsW~kl~t~sP~gl~G~~~~~~-~~~~i~f~GGvn~~if~~yf~dv~~a~~d~~~~~~i~~~yf~~ 187 (381)
T COG3055 109 PQVFNDAYRYDPSTNSWHKLDTRSPTGLVGASTFSL-NGTKIYFFGGVNQNIFNGYFEDVGAAGKDKEAVDKIIAHYFDK 187 (381)
T ss_pred ceEeeeeEEecCCCChhheeccccccccccceeEec-CCceEEEEccccHHhhhhhHHhhhhhcccHHHHHHHHHHHhCC
Confidence 345678999999999999977788888888888886 45599999995110000 0 00
Q ss_pred CCC--cccCceEEEEe----eecCCCCC-CCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCC--CcE
Q psy14331 78 GSP--KALDHLILMNL----FVSAPGAP-PPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGE--KKW 148 (265)
Q Consensus 78 ~~~--~~~~~~~~yd~----w~~~~~~p-~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~--~~W 148 (265)
++. ....+++.|++ |......| .++++.+.+.- ++++.++-|+--.+. .+..+.+++... -+|
T Consensus 188 ~~~dy~~n~ev~sy~p~~n~W~~~G~~pf~~~aGsa~~~~---~n~~~lInGEiKpGL-----Rt~~~k~~~~~~~~~~w 259 (381)
T COG3055 188 KAEDYFFNKEVLSYDPSTNQWRNLGENPFYGNAGSAVVIK---GNKLTLINGEIKPGL-----RTAEVKQADFGGDNLKW 259 (381)
T ss_pred CHHHhcccccccccccccchhhhcCcCcccCccCcceeec---CCeEEEEcceecCCc-----cccceeEEEeccCceee
Confidence 001 11567888999 88888776 45666544433 778999988765553 255666777653 479
Q ss_pred EEEecCCCCCC-----CcCceEEEECCEEEEEcCccCC---------------CCccccCCceEEEEcCCCceEEeccCC
Q psy14331 149 EKIVCKDTPPS-----RSGHRMIALKKHLVVFGGFHDN---------------LREAKYYNDVHIFDLETYAWKKIEPLG 208 (265)
Q Consensus 149 ~~~~~~~~p~~-----~~~~~~~~~~~~iyv~GG~~~~---------------~~~~~~~~~v~~y~~~~~~W~~~~~~~ 208 (265)
+.+...+.|.. ..++-.-..++.+.+.||..-. +......++||.+| .+.|+.+..+
T Consensus 260 ~~l~~lp~~~~~~~eGvAGaf~G~s~~~~lv~GGAnF~Ga~~~y~~Gk~~AH~Gl~K~w~~~Vy~~d--~g~Wk~~GeL- 336 (381)
T COG3055 260 LKLSDLPAPIGSNKEGVAGAFSGKSNGEVLVAGGANFPGALKAYKNGKFYAHEGLSKSWNSEVYIFD--NGSWKIVGEL- 336 (381)
T ss_pred eeccCCCCCCCCCccccceeccceeCCeEEEecCCCChhHHHHHHhcccccccchhhhhhceEEEEc--CCceeeeccc-
Confidence 99876653222 2222233347889999986322 12234567899998 9999999985
Q ss_pred CCCCCCCCceEEEeCCCeEEEEcCccccc
Q psy14331 209 AGPAPRSGCQMAATPDGKILISGGYSKQS 237 (265)
Q Consensus 209 ~~p~~~~~~~~~~~~~~~iyi~GG~~~~~ 237 (265)
|.++..-..+.. ++++|++||.+...
T Consensus 337 --p~~l~YG~s~~~-nn~vl~IGGE~~~G 362 (381)
T COG3055 337 --PQGLAYGVSLSY-NNKVLLIGGETSGG 362 (381)
T ss_pred --CCCccceEEEec-CCcEEEEccccCCC
Confidence 776665544444 89999999998664
No 30
>KOG2437|consensus
Probab=99.37 E-value=1.3e-12 Score=108.67 Aligned_cols=151 Identities=24% Similarity=0.412 Sum_probs=115.6
Q ss_pred CCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCC-CCCCCcCceEEEE--CCEEEE
Q psy14331 98 APPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKD-TPPSRSGHRMIAL--KKHLVV 174 (265)
Q Consensus 98 ~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~-~p~~~~~~~~~~~--~~~iyv 174 (265)
-|..|.+|+++.. ..++.||++||.+..+. +.++|.|+...+.|..+..-. .|-.|.+|.|+.. ..++|+
T Consensus 257 ~p~~RgGHQMV~~-~~~~CiYLYGGWdG~~~------l~DFW~Y~v~e~~W~~iN~~t~~PG~RsCHRMVid~S~~KLYL 329 (723)
T KOG2437|consen 257 RPGMRGGHQMVID-VQTECVYLYGGWDGTQD------LADFWAYSVKENQWTCINRDTEGPGARSCHRMVIDISRRKLYL 329 (723)
T ss_pred CccccCcceEEEe-CCCcEEEEecCcccchh------HHHHHhhcCCcceeEEeecCCCCCcchhhhhhhhhhhHhHHhh
Confidence 3667999999866 33469999999987653 899999999999999987665 7899999999887 459999
Q ss_pred EcCccCCCCc--cccCCceEEEEcCCCceEEeccCC---CCCCCCCCceEEEeCCCe--EEEEcCcccccccccccccee
Q psy14331 175 FGGFHDNLRE--AKYYNDVHIFDLETYAWKKIEPLG---AGPAPRSGCQMAATPDGK--ILISGGYSKQSVKKDVDKGIV 247 (265)
Q Consensus 175 ~GG~~~~~~~--~~~~~~v~~y~~~~~~W~~~~~~~---~~p~~~~~~~~~~~~~~~--iyi~GG~~~~~~~~~~~~~~~ 247 (265)
.|-.-+.... ...-.++|+||..++.|..+.... ..|...+.|.|++. +++ +||+||+....- -..
T Consensus 330 lG~Y~~sS~r~~~s~RsDfW~FDi~~~~W~~ls~dt~~dGGP~~vfDHqM~Vd-~~k~~iyVfGGr~~~~~------e~~ 402 (723)
T KOG2437|consen 330 LGRYLDSSVRNSKSLRSDFWRFDIDTNTWMLLSEDTAADGGPKLVFDHQMCVD-SEKHMIYVFGGRILTCN------EPQ 402 (723)
T ss_pred hhhccccccccccccccceEEEecCCceeEEecccccccCCcceeecceeeEe-cCcceEEEecCeeccCC------Ccc
Confidence 9976443221 233468999999999999887432 24777788999876 555 999999864321 124
Q ss_pred eceEEEEecCccccc
Q psy14331 248 HTDTFLLTPDSKTLR 262 (265)
Q Consensus 248 ~~~~w~~~~~~~~w~ 262 (265)
...+|.|+.....|.
T Consensus 403 f~GLYaf~~~~~~w~ 417 (723)
T KOG2437|consen 403 FSGLYAFNCQCQTWK 417 (723)
T ss_pred ccceEEEecCCccHH
Confidence 556889998888885
No 31
>PF13964 Kelch_6: Kelch motif
Probab=99.30 E-value=1.1e-11 Score=73.33 Aligned_cols=50 Identities=40% Similarity=0.897 Sum_probs=43.7
Q ss_pred CCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCC
Q psy14331 159 SRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPR 214 (265)
Q Consensus 159 ~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~ 214 (265)
+|.+|++++++++|||+||.... ....+++++||+++++|++++++ |.+|
T Consensus 1 pR~~~s~v~~~~~iyv~GG~~~~---~~~~~~v~~yd~~t~~W~~~~~m---p~pR 50 (50)
T PF13964_consen 1 PRYGHSAVVVGGKIYVFGGYDNS---GKYSNDVERYDPETNTWEQLPPM---PTPR 50 (50)
T ss_pred CCccCEEEEECCEEEEECCCCCC---CCccccEEEEcCCCCcEEECCCC---CCCC
Confidence 68899999999999999999774 25689999999999999999985 6665
No 32
>PF13964 Kelch_6: Kelch motif
Probab=99.23 E-value=2.9e-11 Score=71.40 Aligned_cols=50 Identities=26% Similarity=0.503 Sum_probs=42.4
Q ss_pred CccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCC
Q psy14331 101 PRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSR 160 (265)
Q Consensus 101 ~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~ 160 (265)
||..|+++++ +++|||+||..... ..++++++||+++++|+.+++++ .+|
T Consensus 1 pR~~~s~v~~---~~~iyv~GG~~~~~-----~~~~~v~~yd~~t~~W~~~~~mp--~pR 50 (50)
T PF13964_consen 1 PRYGHSAVVV---GGKIYVFGGYDNSG-----KYSNDVERYDPETNTWEQLPPMP--TPR 50 (50)
T ss_pred CCccCEEEEE---CCEEEEECCCCCCC-----CccccEEEEcCCCCcEEECCCCC--CCC
Confidence 6889999988 99999999986641 45899999999999999998774 554
No 33
>PLN02772 guanylate kinase
Probab=99.08 E-value=1e-09 Score=91.20 Aligned_cols=90 Identities=22% Similarity=0.328 Sum_probs=76.3
Q ss_pred CCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCcccc
Q psy14331 157 PPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQ 236 (265)
Q Consensus 157 p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~ 236 (265)
+.++.+++++.+++++||+||.+... +..+.+++||+.+.+|......+..|.+|.+|+++++.+++|+|+++...
T Consensus 22 ~~~~~~~tav~igdk~yv~GG~~d~~---~~~~~v~i~D~~t~~W~~P~V~G~~P~~r~GhSa~v~~~~rilv~~~~~~- 97 (398)
T PLN02772 22 VKPKNRETSVTIGDKTYVIGGNHEGN---TLSIGVQILDKITNNWVSPIVLGTGPKPCKGYSAVVLNKDRILVIKKGSA- 97 (398)
T ss_pred CCCCCcceeEEECCEEEEEcccCCCc---cccceEEEEECCCCcEecccccCCCCCCCCcceEEEECCceEEEEeCCCC-
Confidence 46888999999999999999976643 34789999999999999999999999999999999998889999987643
Q ss_pred ccccccccceeeceEEEEecCccc
Q psy14331 237 SVKKDVDKGIVHTDTFLLTPDSKT 260 (265)
Q Consensus 237 ~~~~~~~~~~~~~~~w~~~~~~~~ 260 (265)
+.+++|.+..++..
T Consensus 98 ----------~~~~~w~l~~~t~~ 111 (398)
T PLN02772 98 ----------PDDSIWFLEVDTPF 111 (398)
T ss_pred ----------CccceEEEEcCCHH
Confidence 24678888777653
No 34
>PF01344 Kelch_1: Kelch motif; InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=99.06 E-value=3e-10 Score=66.18 Aligned_cols=46 Identities=20% Similarity=0.427 Sum_probs=39.8
Q ss_pred CccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecC
Q psy14331 101 PRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCK 154 (265)
Q Consensus 101 ~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~ 154 (265)
||..|+++++ +++||++||..... ..++++++||+.+++|+.+++|
T Consensus 1 pR~~~~~~~~---~~~iyv~GG~~~~~-----~~~~~v~~yd~~~~~W~~~~~m 46 (47)
T PF01344_consen 1 PRSGHAAVVV---GNKIYVIGGYDGNN-----QPTNSVEVYDPETNTWEELPPM 46 (47)
T ss_dssp -BBSEEEEEE---TTEEEEEEEBESTS-----SBEEEEEEEETTTTEEEEEEEE
T ss_pred CCccCEEEEE---CCEEEEEeeecccC-----ceeeeEEEEeCCCCEEEEcCCC
Confidence 6889999988 99999999987722 3589999999999999999876
No 35
>PF13415 Kelch_3: Galactose oxidase, central domain
Probab=99.06 E-value=5.9e-10 Score=65.41 Aligned_cols=48 Identities=42% Similarity=0.769 Sum_probs=40.3
Q ss_pred CCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEE
Q psy14331 169 KKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAA 221 (265)
Q Consensus 169 ~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~ 221 (265)
+++||||||.+... ...++++|+||+.+++|+++.. .|.+|.+|++++
T Consensus 1 g~~~~vfGG~~~~~--~~~~nd~~~~~~~~~~W~~~~~---~P~~R~~h~~~~ 48 (49)
T PF13415_consen 1 GNKLYVFGGYDDDG--GTRLNDVWVFDLDTNTWTRIGD---LPPPRSGHTATV 48 (49)
T ss_pred CCEEEEECCcCCCC--CCEecCEEEEECCCCEEEECCC---CCCCccceEEEE
Confidence 57899999998421 2668999999999999999955 699999999875
No 36
>PF07646 Kelch_2: Kelch motif; InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=99.06 E-value=6.3e-10 Score=65.28 Aligned_cols=48 Identities=35% Similarity=0.708 Sum_probs=40.0
Q ss_pred CCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccC
Q psy14331 159 SRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPL 207 (265)
Q Consensus 159 ~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~ 207 (265)
+|.+|++++++++||||||.. ........++++.||+++++|+.++++
T Consensus 1 ~r~~hs~~~~~~kiyv~GG~~-~~~~~~~~~~v~~~d~~t~~W~~~~~~ 48 (49)
T PF07646_consen 1 PRYGHSAVVLDGKIYVFGGYG-TDNGGSSSNDVWVFDTETNQWTELSPM 48 (49)
T ss_pred CccceEEEEECCEEEEECCcc-cCCCCcccceeEEEECCCCEEeecCCC
Confidence 588999999999999999991 111125689999999999999999875
No 37
>PF01344 Kelch_1: Kelch motif; InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=99.05 E-value=3.6e-10 Score=65.81 Aligned_cols=46 Identities=33% Similarity=0.720 Sum_probs=40.5
Q ss_pred CCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccC
Q psy14331 159 SRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPL 207 (265)
Q Consensus 159 ~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~ 207 (265)
||..|++++++++|||+||..... ...+++++||+.+++|+.++++
T Consensus 1 pR~~~~~~~~~~~iyv~GG~~~~~---~~~~~v~~yd~~~~~W~~~~~m 46 (47)
T PF01344_consen 1 PRSGHAAVVVGNKIYVIGGYDGNN---QPTNSVEVYDPETNTWEELPPM 46 (47)
T ss_dssp -BBSEEEEEETTEEEEEEEBESTS---SBEEEEEEEETTTTEEEEEEEE
T ss_pred CCccCEEEEECCEEEEEeeecccC---ceeeeEEEEeCCCCEEEEcCCC
Confidence 588999999999999999998732 5689999999999999999885
No 38
>PF13418 Kelch_4: Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=99.01 E-value=6.2e-10 Score=65.40 Aligned_cols=47 Identities=30% Similarity=0.760 Sum_probs=30.4
Q ss_pred CccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecC
Q psy14331 101 PRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCK 154 (265)
Q Consensus 101 ~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~ 154 (265)
||..|+++.+. +++|||+||..... ..++++++||+++++|++++++
T Consensus 1 pR~~h~~~~~~--~~~i~v~GG~~~~~-----~~~~d~~~~d~~~~~W~~~~~~ 47 (49)
T PF13418_consen 1 PRYGHSAVSIG--DNSIYVFGGRDSSG-----SPLNDLWIFDIETNTWTRLPSM 47 (49)
T ss_dssp --BS-EEEEE---TTEEEEE--EEE-T-----EE---EEEEETTTTEEEE--SS
T ss_pred CcceEEEEEEe--CCeEEEECCCCCCC-----cccCCEEEEECCCCEEEECCCC
Confidence 69999999884 58999999997764 3589999999999999999654
No 39
>PF13415 Kelch_3: Galactose oxidase, central domain
Probab=98.99 E-value=1.5e-09 Score=63.72 Aligned_cols=49 Identities=33% Similarity=0.752 Sum_probs=39.8
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEE
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIAL 168 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~ 168 (265)
+++|||+||...... ..++++++||+.+++|+++.. .|.+|.+|+++++
T Consensus 1 g~~~~vfGG~~~~~~----~~~nd~~~~~~~~~~W~~~~~--~P~~R~~h~~~~i 49 (49)
T PF13415_consen 1 GNKLYVFGGYDDDGG----TRLNDVWVFDLDTNTWTRIGD--LPPPRSGHTATVI 49 (49)
T ss_pred CCEEEEECCcCCCCC----CEecCEEEEECCCCEEEECCC--CCCCccceEEEEC
Confidence 478999999874221 458999999999999999944 4799999999863
No 40
>PF13418 Kelch_4: Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=98.97 E-value=1.1e-09 Score=64.32 Aligned_cols=46 Identities=33% Similarity=0.691 Sum_probs=30.8
Q ss_pred CCcCceEEEE-CCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccC
Q psy14331 159 SRSGHRMIAL-KKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPL 207 (265)
Q Consensus 159 ~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~ 207 (265)
||.+|+++.+ +++||||||.+... ..++++++||+.+++|+++.++
T Consensus 1 pR~~h~~~~~~~~~i~v~GG~~~~~---~~~~d~~~~d~~~~~W~~~~~~ 47 (49)
T PF13418_consen 1 PRYGHSAVSIGDNSIYVFGGRDSSG---SPLNDLWIFDIETNTWTRLPSM 47 (49)
T ss_dssp --BS-EEEEE-TTEEEEE--EEE-T---EE---EEEEETTTTEEEE--SS
T ss_pred CcceEEEEEEeCCeEEEECCCCCCC---cccCCEEEEECCCCEEEECCCC
Confidence 6889999998 58999999997764 4689999999999999999664
No 41
>PF07646 Kelch_2: Kelch motif; InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=98.97 E-value=2.1e-09 Score=63.03 Aligned_cols=48 Identities=29% Similarity=0.685 Sum_probs=39.0
Q ss_pred CccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecC
Q psy14331 101 PRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCK 154 (265)
Q Consensus 101 ~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~ 154 (265)
||+.|+++++ +++|||+||....... ...+++++||+++++|+.++++
T Consensus 1 ~r~~hs~~~~---~~kiyv~GG~~~~~~~---~~~~~v~~~d~~t~~W~~~~~~ 48 (49)
T PF07646_consen 1 PRYGHSAVVL---DGKIYVFGGYGTDNGG---SSSNDVWVFDTETNQWTELSPM 48 (49)
T ss_pred CccceEEEEE---CCEEEEECCcccCCCC---cccceeEEEECCCCEEeecCCC
Confidence 6889999988 9999999998111111 3589999999999999999865
No 42
>PLN02772 guanylate kinase
Probab=98.90 E-value=1.6e-08 Score=84.19 Aligned_cols=87 Identities=13% Similarity=0.260 Sum_probs=71.3
Q ss_pred CCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCC-CCCCCcCceEEEE-CCEEEEEc
Q psy14331 99 PPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKD-TPPSRSGHRMIAL-KKHLVVFG 176 (265)
Q Consensus 99 p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~-~p~~~~~~~~~~~-~~~iyv~G 176 (265)
+.++..|+++.+ ++++||+||.++.. ...+.+++||+.+.+|......+ .|.+|.+|+++++ +++|+|++
T Consensus 22 ~~~~~~~tav~i---gdk~yv~GG~~d~~-----~~~~~v~i~D~~t~~W~~P~V~G~~P~~r~GhSa~v~~~~rilv~~ 93 (398)
T PLN02772 22 VKPKNRETSVTI---GDKTYVIGGNHEGN-----TLSIGVQILDKITNNWVSPIVLGTGPKPCKGYSAVVLNKDRILVIK 93 (398)
T ss_pred CCCCCcceeEEE---CCEEEEEcccCCCc-----cccceEEEEECCCCcEecccccCCCCCCCCcceEEEECCceEEEEe
Confidence 568899999988 99999999976643 13789999999999999887666 6899999999999 67999999
Q ss_pred CccCCCCccccCCceEEEEcCCC
Q psy14331 177 GFHDNLREAKYYNDVHIFDLETY 199 (265)
Q Consensus 177 G~~~~~~~~~~~~~v~~y~~~~~ 199 (265)
+-... -+++|.+...+.
T Consensus 94 ~~~~~------~~~~w~l~~~t~ 110 (398)
T PLN02772 94 KGSAP------DDSIWFLEVDTP 110 (398)
T ss_pred CCCCC------ccceEEEEcCCH
Confidence 76544 367888877663
No 43
>PF13854 Kelch_5: Kelch motif
Probab=98.81 E-value=1.5e-08 Score=57.12 Aligned_cols=41 Identities=29% Similarity=0.762 Sum_probs=35.0
Q ss_pred CCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCC
Q psy14331 157 PPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETY 199 (265)
Q Consensus 157 p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~ 199 (265)
|.+|..|++++++++|||+||... .....++++|+||+.+.
T Consensus 2 P~~R~~hs~~~~~~~iyi~GG~~~--~~~~~~~d~~~l~l~sf 42 (42)
T PF13854_consen 2 PSPRYGHSAVVVGNNIYIFGGYSG--NNNSYSNDLYVLDLPSF 42 (42)
T ss_pred CCCccceEEEEECCEEEEEcCccC--CCCCEECcEEEEECCCC
Confidence 789999999999999999999985 12256899999998763
No 44
>smart00612 Kelch Kelch domain.
Probab=98.72 E-value=2e-08 Score=58.20 Aligned_cols=47 Identities=23% Similarity=0.503 Sum_probs=38.4
Q ss_pred EEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECC
Q psy14331 116 QLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKK 170 (265)
Q Consensus 116 ~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~ 170 (265)
+||++||.... ...+++++||+.+++|+.++++ |.+|..|+++++++
T Consensus 1 ~iyv~GG~~~~------~~~~~v~~yd~~~~~W~~~~~~--~~~r~~~~~~~~~g 47 (47)
T smart00612 1 KIYVVGGFDGG------QRLKSVEVYDPETNKWTPLPSM--PTPRSGHGVAVING 47 (47)
T ss_pred CEEEEeCCCCC------ceeeeEEEECCCCCeEccCCCC--CCccccceEEEeCC
Confidence 48999997542 2378999999999999998876 58999999887754
No 45
>PF13854 Kelch_5: Kelch motif
Probab=98.69 E-value=5.2e-08 Score=54.92 Aligned_cols=41 Identities=27% Similarity=0.552 Sum_probs=34.3
Q ss_pred CCCCccccceEEEecCCCEEEEEcCccc-CCCccccccCCceeEEEcCCC
Q psy14331 98 APPPRCSHQMVALSADKGQLWMFGGEFS-SASESQFHHYKDLWVFRMGEK 146 (265)
Q Consensus 98 ~p~~r~~~~~~~~~~~~~~iyv~GG~~~-~~~~~~~~~~~~~~~yd~~~~ 146 (265)
+|.+|..|+++.+ +++||++||... .+ ..++++|+||+.++
T Consensus 1 ~P~~R~~hs~~~~---~~~iyi~GG~~~~~~-----~~~~d~~~l~l~sf 42 (42)
T PF13854_consen 1 IPSPRYGHSAVVV---GNNIYIFGGYSGNNN-----SYSNDLYVLDLPSF 42 (42)
T ss_pred CCCCccceEEEEE---CCEEEEEcCccCCCC-----CEECcEEEEECCCC
Confidence 4889999999988 899999999874 22 45899999998763
No 46
>smart00612 Kelch Kelch domain.
Probab=98.61 E-value=7.3e-08 Score=55.80 Aligned_cols=45 Identities=33% Similarity=0.667 Sum_probs=37.6
Q ss_pred EEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEe
Q psy14331 171 HLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAAT 222 (265)
Q Consensus 171 ~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~ 222 (265)
+||++||.... ...+++++||+.+++|+.+++ +|.+|..++++++
T Consensus 1 ~iyv~GG~~~~----~~~~~v~~yd~~~~~W~~~~~---~~~~r~~~~~~~~ 45 (47)
T smart00612 1 KIYVVGGFDGG----QRLKSVEVYDPETNKWTPLPS---MPTPRSGHGVAVI 45 (47)
T ss_pred CEEEEeCCCCC----ceeeeEEEECCCCCeEccCCC---CCCccccceEEEe
Confidence 48999998652 347899999999999999987 4888998988766
No 47
>PF03089 RAG2: Recombination activating protein 2; InterPro: IPR004321 The variable portion of the genes encoding immunoglobulins and T cell receptors are assembled from component V, D, and J DNA segments by a site-specific recombination reaction termed V(D)J recombination. V(D)J recombination is targeted to specific sites on the chromosome by recombination signal sequences (RSSs) that flank antigen receptor gene segments. The RSS consists of a conserved heptamer (consensus, 5'-CACAGTG-3') and nonamer (consensus, 5'-ACAAAAACC-3') separated by a spacer of either 12 or 23 bp. Efficient recombination occurs between a 12-RSS and a 23-RSS, a restriction known as the 12/23 rule. V(D)J recombination can be divided into two phases, DNA cleavage and DNA joining. DNA cleavage requires two lymphocyte-specific factors, the products of the recombination activating genes, RAG1 and RAG2, which together recognise the RSSs and create double strand breaks at the RSS-coding segment junctions []. RAG-mediated DNA cleavage occurs in a synaptic complex termed the paired complex, which is constituted from two distinct RSS-RAG complexes, a 12-SC and a 23-SC (where SC stands for signal complex). The DNA cleavage reaction involves two distinct enzymatic steps, initial nicking that creates a 3'-OH between a coding segment and its RSS, followed by hairpin formation in which the newly created 3'-OH attacks a phosphodiester bond on the opposite DNA strand. This generates a blunt, 5' phosphorylated signal end containing all of the RSS elements, and a covalently sealed hairpin coding end. The second phase of V(D)J recombination, in which broken DNA fragments are processed and joined, is less well characterised. Signal ends are typically joined precisely to form a signal joint, whereas joining of the coding ends requires the hairpin structure to be opened and typically involves nucleotide addition and deletion before formation of the coding joint. The factors involved in these processes include ubiquitously expressed proteins involved in the repair of DNA double strand breaks by nonhomologous end joining, terminal deoxynucleotidyl transferase, and Artemis protein. In addition to their critical roles in RSS recognition and DNA cleavage, the RAG proteins may perform two distinct types of functions in the postcleavage phase of V(D)J. A structural function has been inferred from the finding that, after DNA cleavage in vitro, the DNA ends remain associated with the RAG proteins in a "four end" complex known as the cleaved signal complex. After release of the coding ends in vitro, and after coding joint formation in vivo, the RAG proteins remain in a stable signal end complex (SEC) containing the two signal ends. These postcleavage complexes may serve as essential scaffolds for the second phase of the reaction, with the RAG proteins acting to organise the DNA processing and joining events. The second type of RAG protein-mediated postcleavage activity is the catalysis of phosphodiester bond hydrolysis and strand transfer reactions. The RAG proteins are capable of opening hairpin coding ends in vitro. The RAG proteins also show 3' flap endonuclease activity that may contribute to coding end processing/joining and can utilise the 3' OH group on the signal ends to attack hairpin coding ends (forming hybrid or open/shut joints) or virtually any DNA duplex (forming a transposition product).; GO: 0003677 DNA binding, 0006310 DNA recombination, 0005634 nucleus
Probab=98.60 E-value=1.6e-05 Score=62.55 Aligned_cols=165 Identities=13% Similarity=0.143 Sum_probs=95.2
Q ss_pred EEEEECcEEeCCCceeeCCCcccCceEEEEe---------------eecCCCCCCCccccceEEEecCCCE-EEEEcCcc
Q psy14331 61 QLILFGGEFYDGQKFVFGSPKALDHLILMNL---------------FVSAPGAPPPRCSHQMVALSADKGQ-LWMFGGEF 124 (265)
Q Consensus 61 ~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~---------------w~~~~~~p~~r~~~~~~~~~~~~~~-iyv~GG~~ 124 (265)
..++-||.+.+++. ++++|.... -.-.++.|.+|++|++-++...++. ..+|||+.
T Consensus 40 ~YlIHGGrTPNNEl--------S~~LY~ls~~s~~cNkK~tl~C~EKeLvGdvP~aRYGHt~~vV~SrGKta~VlFGGRS 111 (337)
T PF03089_consen 40 QYLIHGGRTPNNEL--------SSSLYILSVDSRGCNKKVTLCCQEKELVGDVPEARYGHTINVVHSRGKTACVLFGGRS 111 (337)
T ss_pred eEEecCCcCCCccc--------ccceEEEEeecCCCCceeEEEEecceecCCCCcccccceEEEEEECCcEEEEEECCcc
Confidence 34455665444443 677888777 2335788999999999888665544 56789973
Q ss_pred cC--CCcc------ccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEc
Q psy14331 125 SS--ASES------QFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDL 196 (265)
Q Consensus 125 ~~--~~~~------~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~ 196 (265)
-- ..++ .......++..|++-+..+.-....+-.....|-+.+.++.+|++||..-..+. --..+++...
T Consensus 112 Y~P~~qRTTenWNsVvDC~P~VfLiDleFGC~tah~lpEl~dG~SFHvslar~D~VYilGGHsl~sd~--Rpp~l~rlkV 189 (337)
T PF03089_consen 112 YMPPGQRTTENWNSVVDCPPQVFLIDLEFGCCTAHTLPELQDGQSFHVSLARNDCVYILGGHSLESDS--RPPRLYRLKV 189 (337)
T ss_pred cCCccccchhhcceeccCCCeEEEEeccccccccccchhhcCCeEEEEEEecCceEEEEccEEccCCC--CCCcEEEEEE
Confidence 11 1100 112345688888888776654333233566677788889999999998543221 0123444432
Q ss_pred CC---CceEEeccCCCCCCCCCCce--EEEeCCCeEEEEcCcccccc
Q psy14331 197 ET---YAWKKIEPLGAGPAPRSGCQ--MAATPDGKILISGGYSKQSV 238 (265)
Q Consensus 197 ~~---~~W~~~~~~~~~p~~~~~~~--~~~~~~~~iyi~GG~~~~~~ 238 (265)
+- .-...+.. ++.+.+..+ ++.+...+.+|+||+..+..
T Consensus 190 dLllGSP~vsC~v---l~~glSisSAIvt~~~~~e~iIlGGY~sdsQ 233 (337)
T PF03089_consen 190 DLLLGSPAVSCTV---LQGGLSISSAIVTQTGPHEYIILGGYQSDSQ 233 (337)
T ss_pred eecCCCceeEEEE---CCCCceEeeeeEeecCCCceEEEecccccce
Confidence 21 22333333 222332222 23343467889999976653
No 48
>PF07250 Glyoxal_oxid_N: Glyoxal oxidase N-terminus; InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=98.02 E-value=0.00025 Score=55.93 Aligned_cols=103 Identities=16% Similarity=0.184 Sum_probs=64.6
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEcCC----CcEEEEecCCCCCCCcCceEEEE-CCEEEEEcCccCCCCccccC
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRMGE----KKWEKIVCKDTPPSRSGHRMIAL-KKHLVVFGGFHDNLREAKYY 188 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~----~~W~~~~~~~~p~~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~ 188 (265)
+|++++.||.... ...+-.|++.+ ..|.+.... +-.+|+..++..+ +++++|+||....
T Consensus 77 dG~ll~tGG~~~G--------~~~ir~~~p~~~~~~~~w~e~~~~-m~~~RWYpT~~~L~DG~vlIvGG~~~~------- 140 (243)
T PF07250_consen 77 DGRLLQTGGDNDG--------NKAIRIFTPCTSDGTCDWTESPND-MQSGRWYPTATTLPDGRVLIVGGSNNP------- 140 (243)
T ss_pred CCCEEEeCCCCcc--------ccceEEEecCCCCCCCCceECccc-ccCCCccccceECCCCCEEEEeCcCCC-------
Confidence 8999999997542 34566788765 579887542 3488999888888 7899999998522
Q ss_pred CceEEEEcCC------CceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCcc
Q psy14331 189 NDVHIFDLET------YAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYS 234 (265)
Q Consensus 189 ~~v~~y~~~~------~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~ 234 (265)
.+.|-|.. ..|..+.............-+....+++||+++...
T Consensus 141 --t~E~~P~~~~~~~~~~~~~l~~~~~~~~~nlYP~~~llPdG~lFi~an~~ 190 (243)
T PF07250_consen 141 --TYEFWPPKGPGPGPVTLPFLSQTSDTLPNNLYPFVHLLPDGNLFIFANRG 190 (243)
T ss_pred --cccccCCccCCCCceeeecchhhhccCccccCceEEEcCCCCEEEEEcCC
Confidence 22333331 123323221111112233445567799999998764
No 49
>PF03089 RAG2: Recombination activating protein 2; InterPro: IPR004321 The variable portion of the genes encoding immunoglobulins and T cell receptors are assembled from component V, D, and J DNA segments by a site-specific recombination reaction termed V(D)J recombination. V(D)J recombination is targeted to specific sites on the chromosome by recombination signal sequences (RSSs) that flank antigen receptor gene segments. The RSS consists of a conserved heptamer (consensus, 5'-CACAGTG-3') and nonamer (consensus, 5'-ACAAAAACC-3') separated by a spacer of either 12 or 23 bp. Efficient recombination occurs between a 12-RSS and a 23-RSS, a restriction known as the 12/23 rule. V(D)J recombination can be divided into two phases, DNA cleavage and DNA joining. DNA cleavage requires two lymphocyte-specific factors, the products of the recombination activating genes, RAG1 and RAG2, which together recognise the RSSs and create double strand breaks at the RSS-coding segment junctions []. RAG-mediated DNA cleavage occurs in a synaptic complex termed the paired complex, which is constituted from two distinct RSS-RAG complexes, a 12-SC and a 23-SC (where SC stands for signal complex). The DNA cleavage reaction involves two distinct enzymatic steps, initial nicking that creates a 3'-OH between a coding segment and its RSS, followed by hairpin formation in which the newly created 3'-OH attacks a phosphodiester bond on the opposite DNA strand. This generates a blunt, 5' phosphorylated signal end containing all of the RSS elements, and a covalently sealed hairpin coding end. The second phase of V(D)J recombination, in which broken DNA fragments are processed and joined, is less well characterised. Signal ends are typically joined precisely to form a signal joint, whereas joining of the coding ends requires the hairpin structure to be opened and typically involves nucleotide addition and deletion before formation of the coding joint. The factors involved in these processes include ubiquitously expressed proteins involved in the repair of DNA double strand breaks by nonhomologous end joining, terminal deoxynucleotidyl transferase, and Artemis protein. In addition to their critical roles in RSS recognition and DNA cleavage, the RAG proteins may perform two distinct types of functions in the postcleavage phase of V(D)J. A structural function has been inferred from the finding that, after DNA cleavage in vitro, the DNA ends remain associated with the RAG proteins in a "four end" complex known as the cleaved signal complex. After release of the coding ends in vitro, and after coding joint formation in vivo, the RAG proteins remain in a stable signal end complex (SEC) containing the two signal ends. These postcleavage complexes may serve as essential scaffolds for the second phase of the reaction, with the RAG proteins acting to organise the DNA processing and joining events. The second type of RAG protein-mediated postcleavage activity is the catalysis of phosphodiester bond hydrolysis and strand transfer reactions. The RAG proteins are capable of opening hairpin coding ends in vitro. The RAG proteins also show 3' flap endonuclease activity that may contribute to coding end processing/joining and can utilise the 3' OH group on the signal ends to attack hairpin coding ends (forming hybrid or open/shut joints) or virtually any DNA duplex (forming a transposition product).; GO: 0003677 DNA binding, 0006310 DNA recombination, 0005634 nucleus
Probab=97.87 E-value=0.00068 Score=53.62 Aligned_cols=137 Identities=15% Similarity=0.181 Sum_probs=77.0
Q ss_pred eccCCCCCCCceeeEEe-cCC-CEEEEECcEEeCCCce-----eeCCCcccCceEEEEe------eecCCCCCCCccccc
Q psy14331 40 KVVPEPTRRANFSFLAH-PDK-DQLILFGGEFYDGQKF-----VFGSPKALDHLILMNL------FVSAPGAPPPRCSHQ 106 (265)
Q Consensus 40 ~~~~~p~~r~~~~~~~~-~~~-~~lyv~GG~~~~~~~~-----~~~~~~~~~~~~~yd~------w~~~~~~p~~r~~~~ 106 (265)
+++..|.+|++|++.++ +.+ ..+++|||..+-.... ......+...++.+|+ -..++.+....+.|-
T Consensus 80 LvGdvP~aRYGHt~~vV~SrGKta~VlFGGRSY~P~~qRTTenWNsVvDC~P~VfLiDleFGC~tah~lpEl~dG~SFHv 159 (337)
T PF03089_consen 80 LVGDVPEARYGHTINVVHSRGKTACVLFGGRSYMPPGQRTTENWNSVVDCPPQVFLIDLEFGCCTAHTLPELQDGQSFHV 159 (337)
T ss_pred ecCCCCcccccceEEEEEECCcEEEEEECCcccCCccccchhhcceeccCCCeEEEEeccccccccccchhhcCCeEEEE
Confidence 35689999999997765 223 3678899963322111 1123345667888888 344555666778888
Q ss_pred eEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcC--CC-cEEEEecCCCCCCCcCceEEEE-CCEEEEEcCccCCC
Q psy14331 107 MVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMG--EK-KWEKIVCKDTPPSRSGHRMIAL-KKHLVVFGGFHDNL 182 (265)
Q Consensus 107 ~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~--~~-~W~~~~~~~~p~~~~~~~~~~~-~~~iyv~GG~~~~~ 182 (265)
+.+- ++.+|++||..-... ..-..+++..++ .+ -.-....+..-.+...+.++.. ....+|+||+..+.
T Consensus 160 slar---~D~VYilGGHsl~sd----~Rpp~l~rlkVdLllGSP~vsC~vl~~glSisSAIvt~~~~~e~iIlGGY~sds 232 (337)
T PF03089_consen 160 SLAR---NDCVYILGGHSLESD----SRPPRLYRLKVDLLLGSPAVSCTVLQGGLSISSAIVTQTGPHEYIILGGYQSDS 232 (337)
T ss_pred EEec---CceEEEEccEEccCC----CCCCcEEEEEEeecCCCceeEEEECCCCceEeeeeEeecCCCceEEEecccccc
Confidence 7766 899999999743221 123456655432 22 1222222210011222122222 45788899987654
Q ss_pred C
Q psy14331 183 R 183 (265)
Q Consensus 183 ~ 183 (265)
+
T Consensus 233 Q 233 (337)
T PF03089_consen 233 Q 233 (337)
T ss_pred e
Confidence 3
No 50
>PF07250 Glyoxal_oxid_N: Glyoxal oxidase N-terminus; InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=97.77 E-value=0.00014 Score=57.39 Aligned_cols=88 Identities=18% Similarity=0.227 Sum_probs=67.0
Q ss_pred eeEEEcCCCcEEEEecCCCCCCCcCceEEE-ECCEEEEEcCccCCCCccccCCceEEEEcCC----CceEEeccCCCCCC
Q psy14331 138 LWVFRMGEKKWEKIVCKDTPPSRSGHRMIA-LKKHLVVFGGFHDNLREAKYYNDVHIFDLET----YAWKKIEPLGAGPA 212 (265)
Q Consensus 138 ~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~-~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~----~~W~~~~~~~~~p~ 212 (265)
-..||+.+++++.+... ...+|.+.+. -++.+++.||.... ...+..|++.+ ..|.+.... +..
T Consensus 48 s~~yD~~tn~~rpl~v~---td~FCSgg~~L~dG~ll~tGG~~~G------~~~ir~~~p~~~~~~~~w~e~~~~--m~~ 116 (243)
T PF07250_consen 48 SVEYDPNTNTFRPLTVQ---TDTFCSGGAFLPDGRLLQTGGDNDG------NKAIRIFTPCTSDGTCDWTESPND--MQS 116 (243)
T ss_pred EEEEecCCCcEEeccCC---CCCcccCcCCCCCCCEEEeCCCCcc------ccceEEEecCCCCCCCCceECccc--ccC
Confidence 55899999999887654 3444444433 38899999998653 35677788765 679887654 578
Q ss_pred CCCCceEEEeCCCeEEEEcCcccc
Q psy14331 213 PRSGCQMAATPDGKILISGGYSKQ 236 (265)
Q Consensus 213 ~~~~~~~~~~~~~~iyi~GG~~~~ 236 (265)
+|...++..+.+++++|+||....
T Consensus 117 ~RWYpT~~~L~DG~vlIvGG~~~~ 140 (243)
T PF07250_consen 117 GRWYPTATTLPDGRVLIVGGSNNP 140 (243)
T ss_pred CCccccceECCCCCEEEEeCcCCC
Confidence 899999999999999999998744
No 51
>TIGR01640 F_box_assoc_1 F-box protein interaction domain. This model describes a large family of plant domains, with several hundred members in Arabidopsis thaliana. Most examples are found C-terminal to an F-box (pfam00646), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes. Some members have two copies of this domain.
Probab=97.28 E-value=0.048 Score=43.00 Aligned_cols=109 Identities=17% Similarity=0.254 Sum_probs=66.0
Q ss_pred CCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEE-eccCCCCCCC
Q psy14331 135 YKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKK-IEPLGAGPAP 213 (265)
Q Consensus 135 ~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~-~~~~~~~p~~ 213 (265)
...+++|+..++.|+.+.... +........+.+++.+|-+.-...... ...|..||+.+.+|.. ++. |..
T Consensus 69 ~~~~~Vys~~~~~Wr~~~~~~-~~~~~~~~~v~~~G~lyw~~~~~~~~~----~~~IvsFDl~~E~f~~~i~~----P~~ 139 (230)
T TIGR01640 69 QSEHQVYTLGSNSWRTIECSP-PHHPLKSRGVCINGVLYYLAYTLKTNP----DYFIVSFDVSSERFKEFIPL----PCG 139 (230)
T ss_pred CccEEEEEeCCCCccccccCC-CCccccCCeEEECCEEEEEEEECCCCC----cEEEEEEEcccceEeeeeec----Ccc
Confidence 356889999999999987432 121222236677999988874322110 1269999999999995 544 322
Q ss_pred CC----CceEEEeCCCeEEEEcCccccccccccccceeeceEEEEe-cCcccccc
Q psy14331 214 RS----GCQMAATPDGKILISGGYSKQSVKKDVDKGIVHTDTFLLT-PDSKTLRS 263 (265)
Q Consensus 214 ~~----~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~~~~~~~w~~~-~~~~~w~~ 263 (265)
.. ...++.+ +++|.++....... .=++|.++ .....|.+
T Consensus 140 ~~~~~~~~~L~~~-~G~L~~v~~~~~~~----------~~~IWvl~d~~~~~W~k 183 (230)
T TIGR01640 140 NSDSVDYLSLINY-KGKLAVLKQKKDTN----------NFDLWVLNDAGKQEWSK 183 (230)
T ss_pred ccccccceEEEEE-CCEEEEEEecCCCC----------cEEEEEECCCCCCceeE
Confidence 21 2344444 68888765432111 12588886 33556765
No 52
>TIGR01640 F_box_assoc_1 F-box protein interaction domain. This model describes a large family of plant domains, with several hundred members in Arabidopsis thaliana. Most examples are found C-terminal to an F-box (pfam00646), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes. Some members have two copies of this domain.
Probab=96.98 E-value=0.12 Score=40.68 Aligned_cols=180 Identities=15% Similarity=0.231 Sum_probs=96.0
Q ss_pred HHHHhhcccceEEEeccCCCCCC---Cc--eeeEEec--CCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eec
Q psy14331 26 DIEAEEKRKNKVIEKVVPEPTRR---AN--FSFLAHP--DKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVS 94 (265)
Q Consensus 26 ~~~~~d~~~~~w~~~~~~~p~~r---~~--~~~~~~~--~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~ 94 (265)
.+..+||.+++|.. +++++.++ .. .++...+ ..=++..+... ..... ...+.+|+. |..
T Consensus 15 ~~~V~NP~T~~~~~-LP~~~~~~~~~~~~~~~~G~d~~~~~YKVv~~~~~--~~~~~-------~~~~~Vys~~~~~Wr~ 84 (230)
T TIGR01640 15 RLVVWNPSTGQSRW-LPTPKSRRSNKESDTYFLGYDPIEKQYKVLCFSDR--SGNRN-------QSEHQVYTLGSNSWRT 84 (230)
T ss_pred cEEEECCCCCCEEe-cCCCCCcccccccceEEEeecccCCcEEEEEEEee--cCCCC-------CccEEEEEeCCCCccc
Confidence 57789999999988 44333221 11 1122211 12244444331 11110 356788888 887
Q ss_pred CCCCCCC-ccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEE-EecCCCCCCC----cCceEEEE
Q psy14331 95 APGAPPP-RCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEK-IVCKDTPPSR----SGHRMIAL 168 (265)
Q Consensus 95 ~~~~p~~-r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~-~~~~~~p~~~----~~~~~~~~ 168 (265)
....+.. ......+.+ +|.||-+.-...... ...+..||+.+.+|+. ++. |... ....++..
T Consensus 85 ~~~~~~~~~~~~~~v~~---~G~lyw~~~~~~~~~------~~~IvsFDl~~E~f~~~i~~---P~~~~~~~~~~~L~~~ 152 (230)
T TIGR01640 85 IECSPPHHPLKSRGVCI---NGVLYYLAYTLKTNP------DYFIVSFDVSSERFKEFIPL---PCGNSDSVDYLSLINY 152 (230)
T ss_pred cccCCCCccccCCeEEE---CCEEEEEEEECCCCC------cEEEEEEEcccceEeeeeec---CccccccccceEEEEE
Confidence 7643322 222225555 899988864322110 1269999999999995 543 2222 13345666
Q ss_pred CCEEEEEcCccCCCCccccCCceEEEE-cCCCceEEeccCCCCCCCCCC---ceEEEeCCCeEEEEcC
Q psy14331 169 KKHLVVFGGFHDNLREAKYYNDVHIFD-LETYAWKKIEPLGAGPAPRSG---CQMAATPDGKILISGG 232 (265)
Q Consensus 169 ~~~iyv~GG~~~~~~~~~~~~~v~~y~-~~~~~W~~~~~~~~~p~~~~~---~~~~~~~~~~iyi~GG 232 (265)
++++.++....... .-+||..+ -....|++.-..+..+..... .......+++|++...
T Consensus 153 ~G~L~~v~~~~~~~-----~~~IWvl~d~~~~~W~k~~~i~~~~~~~~~~~~~~~~~~~~g~I~~~~~ 215 (230)
T TIGR01640 153 KGKLAVLKQKKDTN-----NFDLWVLNDAGKQEWSKLFTVPIPPLPDLVDDNFLSGFTDKGEIVLCCE 215 (230)
T ss_pred CCEEEEEEecCCCC-----cEEEEEECCCCCCceeEEEEEcCcchhhhhhheeEeEEeeCCEEEEEeC
Confidence 88888776532211 24677775 335679987665321222221 1233455778887654
No 53
>PF07893 DUF1668: Protein of unknown function (DUF1668); InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function.
Probab=96.75 E-value=0.089 Score=44.26 Aligned_cols=120 Identities=10% Similarity=0.155 Sum_probs=69.8
Q ss_pred CCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCcccc-CCce
Q psy14331 113 DKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKY-YNDV 191 (265)
Q Consensus 113 ~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~-~~~v 191 (265)
.+++|+..+.. ..+.+||+++..-...+.+. .+.....++.++++||++............ ...+
T Consensus 75 ~gskIv~~d~~------------~~t~vyDt~t~av~~~P~l~--~pk~~pisv~VG~~LY~m~~~~~~~~~~~~~~~~F 140 (342)
T PF07893_consen 75 HGSKIVAVDQS------------GRTLVYDTDTRAVATGPRLH--SPKRCPISVSVGDKLYAMDRSPFPEPAGRPDFPCF 140 (342)
T ss_pred cCCeEEEEcCC------------CCeEEEECCCCeEeccCCCC--CCCcceEEEEeCCeEEEeeccCccccccCccceeE
Confidence 38888888653 44889999998877665543 444455677778999999876433211000 0033
Q ss_pred EE--EEc------CC--CceEEeccCCCCCCCCCC-------ceEEEeCCCeEEE-EcCccccccccccccceeeceEEE
Q psy14331 192 HI--FDL------ET--YAWKKIEPLGAGPAPRSG-------CQMAATPDGKILI-SGGYSKQSVKKDVDKGIVHTDTFL 253 (265)
Q Consensus 192 ~~--y~~------~~--~~W~~~~~~~~~p~~~~~-------~~~~~~~~~~iyi-~GG~~~~~~~~~~~~~~~~~~~w~ 253 (265)
+. |++ .. =.|+.+++ +|..+.. .+-+++.+..|+| +-|.. ...|.
T Consensus 141 E~l~~~~~~~~~~~~~~w~W~~LP~---PPf~~~~~~~~~~i~sYavv~g~~I~vS~~~~~--------------~GTys 203 (342)
T PF07893_consen 141 EALVYRPPPDDPSPEESWSWRSLPP---PPFVRDRRYSDYRITSYAVVDGRTIFVSVNGRR--------------WGTYS 203 (342)
T ss_pred EEeccccccccccCCCcceEEcCCC---CCccccCCcccceEEEEEEecCCeEEEEecCCc--------------eEEEE
Confidence 33 342 12 25777665 2333321 2223333446777 32221 24899
Q ss_pred EecCcccccc
Q psy14331 254 LTPDSKTLRS 263 (265)
Q Consensus 254 ~~~~~~~w~~ 263 (265)
||+...+|++
T Consensus 204 fDt~~~~W~~ 213 (342)
T PF07893_consen 204 FDTESHEWRK 213 (342)
T ss_pred EEcCCcceee
Confidence 9999999986
No 54
>PF07893 DUF1668: Protein of unknown function (DUF1668); InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function.
Probab=96.38 E-value=0.45 Score=40.07 Aligned_cols=119 Identities=13% Similarity=0.232 Sum_probs=69.3
Q ss_pred CCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccccc
Q psy14331 58 DKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFH 133 (265)
Q Consensus 58 ~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~ 133 (265)
.+.+|+..+. ......||. -...+.++.+...-.++.+ +++||++........... .
T Consensus 75 ~gskIv~~d~---------------~~~t~vyDt~t~av~~~P~l~~pk~~pisv~V---G~~LY~m~~~~~~~~~~~-~ 135 (342)
T PF07893_consen 75 HGSKIVAVDQ---------------SGRTLVYDTDTRAVATGPRLHSPKRCPISVSV---GDKLYAMDRSPFPEPAGR-P 135 (342)
T ss_pred cCCeEEEEcC---------------CCCeEEEECCCCeEeccCCCCCCCcceEEEEe---CCeEEEeeccCccccccC-c
Confidence 4888888876 345789999 4455566665555555555 778999987643322100 0
Q ss_pred cCCceeEE--Ec--------CCCcEEEEecCCCCCCCcC-------ceEEEE-CCEEEE-EcCccCCCCccccCCceEEE
Q psy14331 134 HYKDLWVF--RM--------GEKKWEKIVCKDTPPSRSG-------HRMIAL-KKHLVV-FGGFHDNLREAKYYNDVHIF 194 (265)
Q Consensus 134 ~~~~~~~y--d~--------~~~~W~~~~~~~~p~~~~~-------~~~~~~-~~~iyv-~GG~~~~~~~~~~~~~v~~y 194 (265)
....++.+ +. ..-.|+.+++. |..+.. .+-+++ +..|+| .-+.. ...+.|
T Consensus 136 ~~~~FE~l~~~~~~~~~~~~~~w~W~~LP~P--Pf~~~~~~~~~~i~sYavv~g~~I~vS~~~~~---------~GTysf 204 (342)
T PF07893_consen 136 DFPCFEALVYRPPPDDPSPEESWSWRSLPPP--PFVRDRRYSDYRITSYAVVDGRTIFVSVNGRR---------WGTYSF 204 (342)
T ss_pred cceeEEEeccccccccccCCCcceEEcCCCC--CccccCCcccceEEEEEEecCCeEEEEecCCc---------eEEEEE
Confidence 00144444 31 12367777653 222221 233444 667777 33321 247899
Q ss_pred EcCCCceEEecc
Q psy14331 195 DLETYAWKKIEP 206 (265)
Q Consensus 195 ~~~~~~W~~~~~ 206 (265)
|+.+.+|++...
T Consensus 205 Dt~~~~W~~~Gd 216 (342)
T PF07893_consen 205 DTESHEWRKHGD 216 (342)
T ss_pred EcCCcceeeccc
Confidence 999999999965
No 55
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=96.33 E-value=0.072 Score=43.30 Aligned_cols=106 Identities=17% Similarity=0.297 Sum_probs=62.0
Q ss_pred EEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEE-CCEEEEEcCccCCCCccccCCceEEEEcC
Q psy14331 119 MFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIAL-KKHLVVFGGFHDNLREAKYYNDVHIFDLE 197 (265)
Q Consensus 119 v~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~v~~y~~~ 197 (265)
++||..+.... .....+-.||+.+.+|..+... ..-.-..+... ++++|+.|-....+ .....+..||..
T Consensus 2 ~VGG~F~~aGs---L~C~~lC~yd~~~~qW~~~g~~---i~G~V~~l~~~~~~~Llv~G~ft~~~---~~~~~la~yd~~ 72 (281)
T PF12768_consen 2 YVGGSFTSAGS---LPCPGLCLYDTDNSQWSSPGNG---ISGTVTDLQWASNNQLLVGGNFTLNG---TNSSNLATYDFK 72 (281)
T ss_pred EEeeecCCCCC---cCCCEEEEEECCCCEeecCCCC---ceEEEEEEEEecCCEEEEEEeeEECC---CCceeEEEEecC
Confidence 45665443322 1257788999999999987543 11112233333 67888877553332 114568899999
Q ss_pred CCceEEeccC--CCCCCCCCCceEEEeCCCeEEEEcCc
Q psy14331 198 TYAWKKIEPL--GAGPAPRSGCQMAATPDGKILISGGY 233 (265)
Q Consensus 198 ~~~W~~~~~~--~~~p~~~~~~~~~~~~~~~iyi~GG~ 233 (265)
+.+|..+... ..+|.+............++++.|..
T Consensus 73 ~~~w~~~~~~~s~~ipgpv~a~~~~~~d~~~~~~aG~~ 110 (281)
T PF12768_consen 73 NQTWSSLGGGSSNSIPGPVTALTFISNDGSNFWVAGRS 110 (281)
T ss_pred CCeeeecCCcccccCCCcEEEEEeeccCCceEEEecee
Confidence 9999988873 23454432222222223467777765
No 56
>KOG2055|consensus
Probab=95.88 E-value=0.38 Score=41.05 Aligned_cols=145 Identities=14% Similarity=0.167 Sum_probs=75.6
Q ss_pred eEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe--eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCcc
Q psy14331 53 FLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL--FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASES 130 (265)
Q Consensus 53 ~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~--w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~ 130 (265)
+-+. -...+.+.+|. ++. -.++..|- -..+.++-..++.-..+.....+....+.+|+
T Consensus 219 v~FH-p~~plllvaG~--d~~----------lrifqvDGk~N~~lqS~~l~~fPi~~a~f~p~G~~~i~~s~r------- 278 (514)
T KOG2055|consen 219 VQFH-PTAPLLLVAGL--DGT----------LRIFQVDGKVNPKLQSIHLEKFPIQKAEFAPNGHSVIFTSGR------- 278 (514)
T ss_pred EEec-CCCceEEEecC--CCc----------EEEEEecCccChhheeeeeccCccceeeecCCCceEEEeccc-------
Confidence 3344 35678888883 322 13333333 22333443334433333343334436666665
Q ss_pred ccccCCceeEEEcCCCcEEEEecCC-CCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCC
Q psy14331 131 QFHHYKDLWVFRMGEKKWEKIVCKD-TPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGA 209 (265)
Q Consensus 131 ~~~~~~~~~~yd~~~~~W~~~~~~~-~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~ 209 (265)
..-++.||+++.+-+++.++. .+.+....-.+...+.++++-|.. ..|+..+..++.|..--.+
T Consensus 279 ----rky~ysyDle~ak~~k~~~~~g~e~~~~e~FeVShd~~fia~~G~~---------G~I~lLhakT~eli~s~Ki-- 343 (514)
T KOG2055|consen 279 ----RKYLYSYDLETAKVTKLKPPYGVEEKSMERFEVSHDSNFIAIAGNN---------GHIHLLHAKTKELITSFKI-- 343 (514)
T ss_pred ----ceEEEEeeccccccccccCCCCcccchhheeEecCCCCeEEEcccC---------ceEEeehhhhhhhhheeee--
Confidence 345889999999988887665 222221212233345566666653 4577888888888644332
Q ss_pred CCCCCCCceEEEeCCCeEEEEcCcc
Q psy14331 210 GPAPRSGCQMAATPDGKILISGGYS 234 (265)
Q Consensus 210 ~p~~~~~~~~~~~~~~~iyi~GG~~ 234 (265)
+......+ ..-.+..|+++||.+
T Consensus 344 -eG~v~~~~-fsSdsk~l~~~~~~G 366 (514)
T KOG2055|consen 344 -EGVVSDFT-FSSDSKELLASGGTG 366 (514)
T ss_pred -ccEEeeEE-EecCCcEEEEEcCCc
Confidence 22222222 222233577777764
No 57
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=95.38 E-value=1.6 Score=37.55 Aligned_cols=104 Identities=9% Similarity=0.129 Sum_probs=61.0
Q ss_pred CCCEEEEECcEEeCCCceeeCCCcccCceEEEEe------eecCCCCC--------CCccccceEEEecCCCEEEEEcCc
Q psy14331 58 DKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL------FVSAPGAP--------PPRCSHQMVALSADKGQLWMFGGE 123 (265)
Q Consensus 58 ~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~------w~~~~~~p--------~~r~~~~~~~~~~~~~~iyv~GG~ 123 (265)
.++++|+... ...++++|. |...-.-. ..+.... ..+. ++++|+.+.
T Consensus 68 ~~~~vy~~~~---------------~g~l~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~-~~v~--~~~v~v~~~- 128 (394)
T PRK11138 68 AYNKVYAADR---------------AGLVKALDADTGKEIWSVDLSEKDGWFSKNKSALLSGG-VTVA--GGKVYIGSE- 128 (394)
T ss_pred ECCEEEEECC---------------CCeEEEEECCCCcEeeEEcCCCcccccccccccccccc-cEEE--CCEEEEEcC-
Confidence 4899999876 356778887 66432211 0111122 2232 788887543
Q ss_pred ccCCCccccccCCceeEEEcCCC--cEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCc-
Q psy14331 124 FSSASESQFHHYKDLWVFRMGEK--KWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYA- 200 (265)
Q Consensus 124 ~~~~~~~~~~~~~~~~~yd~~~~--~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~- 200 (265)
...++.+|.+++ .|+.-... .. ..+.++.++.+|+..+ ...++.+|+.+.+
T Consensus 129 -----------~g~l~ald~~tG~~~W~~~~~~---~~--~ssP~v~~~~v~v~~~----------~g~l~ald~~tG~~ 182 (394)
T PRK11138 129 -----------KGQVYALNAEDGEVAWQTKVAG---EA--LSRPVVSDGLVLVHTS----------NGMLQALNESDGAV 182 (394)
T ss_pred -----------CCEEEEEECCCCCCcccccCCC---ce--ecCCEEECCEEEEECC----------CCEEEEEEccCCCE
Confidence 246899998876 58654221 11 1233445788887433 2468999998754
Q ss_pred -eEEecc
Q psy14331 201 -WKKIEP 206 (265)
Q Consensus 201 -W~~~~~ 206 (265)
|+.-..
T Consensus 183 ~W~~~~~ 189 (394)
T PRK11138 183 KWTVNLD 189 (394)
T ss_pred eeeecCC
Confidence 876543
No 58
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=95.36 E-value=1.6 Score=37.50 Aligned_cols=133 Identities=10% Similarity=0.049 Sum_probs=71.5
Q ss_pred HHHHhhcccce--EEEeccCCC-------CCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe-----
Q psy14331 26 DIEAEEKRKNK--VIEKVVPEP-------TRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----- 91 (265)
Q Consensus 26 ~~~~~d~~~~~--w~~~~~~~p-------~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----- 91 (265)
.+.++|..+.+ |+....... ..+...+.++ .++++|+.+. ...++++|.
T Consensus 80 ~l~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~v--~~~~v~v~~~---------------~g~l~ald~~tG~~ 142 (394)
T PRK11138 80 LVKALDADTGKEIWSVDLSEKDGWFSKNKSALLSGGVTV--AGGKVYIGSE---------------KGQVYALNAEDGEV 142 (394)
T ss_pred eEEEEECCCCcEeeEEcCCCcccccccccccccccccEE--ECCEEEEEcC---------------CCEEEEEECCCCCC
Confidence 36788876444 865332210 1122223344 4888887544 357888887
Q ss_pred -eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCc--EEEEecCCCCCCCcCceEEEE
Q psy14331 92 -FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKK--WEKIVCKDTPPSRSGHRMIAL 168 (265)
Q Consensus 92 -w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~--W~~~~~~~~p~~~~~~~~~~~ 168 (265)
|....+ .+. ..+-++. ++.+|+..+ ...++.+|+++++ |+.-...+....+...+-++.
T Consensus 143 ~W~~~~~--~~~-~ssP~v~---~~~v~v~~~------------~g~l~ald~~tG~~~W~~~~~~~~~~~~~~~sP~v~ 204 (394)
T PRK11138 143 AWQTKVA--GEA-LSRPVVS---DGLVLVHTS------------NGMLQALNESDGAVKWTVNLDVPSLTLRGESAPATA 204 (394)
T ss_pred cccccCC--Cce-ecCCEEE---CCEEEEECC------------CCEEEEEEccCCCEeeeecCCCCcccccCCCCCEEE
Confidence 765432 221 1222333 788887543 2458999998875 776432110011112233445
Q ss_pred CCEEEEEcCccCCCCccccCCceEEEEcCCC--ceEE
Q psy14331 169 KKHLVVFGGFHDNLREAKYYNDVHIFDLETY--AWKK 203 (265)
Q Consensus 169 ~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~--~W~~ 203 (265)
++.+|+..+ ...++.+|+.+. .|+.
T Consensus 205 ~~~v~~~~~----------~g~v~a~d~~~G~~~W~~ 231 (394)
T PRK11138 205 FGGAIVGGD----------NGRVSAVLMEQGQLIWQQ 231 (394)
T ss_pred CCEEEEEcC----------CCEEEEEEccCChhhhee
Confidence 677766433 245788888875 4875
No 59
>cd00094 HX Hemopexin-like repeats.; Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs). This CD contains 4 instances of the repeat.
Probab=95.13 E-value=1.1 Score=34.35 Aligned_cols=93 Identities=18% Similarity=0.224 Sum_probs=48.1
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEcCCCcEE---EEecCCCCC--CCcCceEEEE--CCEEEEEcCccCCCCccc
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWE---KIVCKDTPP--SRSGHRMIAL--KKHLVVFGGFHDNLREAK 186 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~---~~~~~~~p~--~~~~~~~~~~--~~~iyv~GG~~~~~~~~~ 186 (265)
++++|++-|. ..|+|+..+.... .+.....|. .... ++... ++++|+|-|
T Consensus 62 ~~~~yfFkg~-------------~yw~~~~~~~~~~~Pk~i~~~~~~~~~~~iD-AA~~~~~~~~~yfFkg--------- 118 (194)
T cd00094 62 TGKIYFFKGD-------------KYWVYTGKNLEPGYPKPISDLGFPPTVKQID-AALRWPDNGKTYFFKG--------- 118 (194)
T ss_pred CCEEEEECCC-------------EEEEEcCcccccCCCcchhhcCCCCCCCCcc-EEEEEcCCCEEEEEeC---------
Confidence 4899999773 4777876542221 111111121 2222 33334 579999987
Q ss_pred cCCceEEEEcCCCceEEe-----ccC-CCCCCCCCCceEEEeCCCeEEEEcCc
Q psy14331 187 YYNDVHIFDLETYAWKKI-----EPL-GAGPAPRSGCQMAATPDGKILISGGY 233 (265)
Q Consensus 187 ~~~~v~~y~~~~~~W~~~-----~~~-~~~p~~~~~~~~~~~~~~~iyi~GG~ 233 (265)
+..|+||..+++.+.- ... ..+|.. -.++....++++|++-|.
T Consensus 119 --~~y~ry~~~~~~v~~~yP~~i~~~w~g~p~~--idaa~~~~~~~~yfF~g~ 167 (194)
T cd00094 119 --DKYWRYDEKTQKMDPGYPKLIETDFPGVPDK--VDAAFRWLDGYYYFFKGD 167 (194)
T ss_pred --CEEEEEeCCCccccCCCCcchhhcCCCcCCC--cceeEEeCCCcEEEEECC
Confidence 4578888766544211 110 112221 233444545889998765
No 60
>PF12217 End_beta_propel: Catalytic beta propeller domain of bacteriophage endosialidase; InterPro: IPR024428 This entry represents the beta propeller domain of endosialidases, which consists of catalytically active part of the enzymes. This core domain forms stable SDS-resistant trimers. There is a nested beta barrel domain in this domain. This domain is typically between 443 and 460 amino acids in length [].; PDB: 1V0E_B 1V0F_E 3JU4_A 3GVL_A 3GVK_B 3GVJ_A.
Probab=94.25 E-value=2.3 Score=33.95 Aligned_cols=114 Identities=16% Similarity=0.198 Sum_probs=52.9
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcC--ceEEEECCEEEEEcCccCCCCcc------
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSG--HRMIALKKHLVVFGGFHDNLREA------ 185 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~--~~~~~~~~~iyv~GG~~~~~~~~------ 185 (265)
+++||+.-..+... .+-+.+.+-+.....|+.+... ...++ .-.+.+++.||+||-....+++.
T Consensus 200 ~g~LyLtTRgt~~~-----~~GS~L~rs~d~G~~w~slrfp---~nvHhtnlPFakvgD~l~mFgsERA~~EWE~G~~D~ 271 (367)
T PF12217_consen 200 DGVLYLTTRGTLPT-----NPGSSLHRSDDNGQNWSSLRFP---NNVHHTNLPFAKVGDVLYMFGSERAENEWEGGEPDN 271 (367)
T ss_dssp TTEEEEEEEES-TT-----S---EEEEESSTTSS-EEEE-T---T---SS---EEEETTEEEEEEE-SSTT-SSTT----
T ss_pred CCEEEEEEcCcCCC-----CCcceeeeecccCCchhhcccc---ccccccCCCceeeCCEEEEEeccccccccccCCCcc
Confidence 89999986433221 1235566777777889998754 33322 23466799999999653322211
Q ss_pred --------ccCCce--EEEEcCCCceEEeccC---CCCCCCCCCceEEEeCCCeEE-EEcCccc
Q psy14331 186 --------KYYNDV--HIFDLETYAWKKIEPL---GAGPAPRSGCQMAATPDGKIL-ISGGYSK 235 (265)
Q Consensus 186 --------~~~~~v--~~y~~~~~~W~~~~~~---~~~p~~~~~~~~~~~~~~~iy-i~GG~~~ 235 (265)
+++..+ -..+++.-.|..+... +..-..-.+-+.+++.++.|| ||||.+.
T Consensus 272 RY~~~yPRtF~~k~nv~~W~~d~~ew~nitdqIYqG~ivNSavGVGSv~~KD~~lyy~FGgED~ 335 (367)
T PF12217_consen 272 RYRANYPRTFMLKVNVSDWSLDDVEWVNITDQIYQGGIVNSAVGVGSVVVKDGWLYYIFGGEDF 335 (367)
T ss_dssp -SS-B--EEEEEEEETTT---TT---EEEEE-BB--SSS---SEEEEEEEETTEEEEEEEEB-S
T ss_pred cccccCCceEEEEeecccCCccceEEEEeecceeccccccccccceeEEEECCEEEEEecCccc
Confidence 111111 1135666778777652 111112223344566688765 8999763
No 61
>KOG2055|consensus
Probab=94.22 E-value=1.3 Score=37.94 Aligned_cols=137 Identities=14% Similarity=0.167 Sum_probs=71.8
Q ss_pred eEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCCC--CCCccccceEEEecCCCEEEEEcCcccC
Q psy14331 53 FLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPGA--PPPRCSHQMVALSADKGQLWMFGGEFSS 126 (265)
Q Consensus 53 ~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~~--p~~r~~~~~~~~~~~~~~iyv~GG~~~~ 126 (265)
+++.|.|...++++|. ..-++.||+ -+++.++ ...+..+.-- +.. ++.++++-|.
T Consensus 263 a~f~p~G~~~i~~s~r--------------rky~ysyDle~ak~~k~~~~~g~e~~~~e~Fe-VSh-d~~fia~~G~--- 323 (514)
T KOG2055|consen 263 AEFAPNGHSVIFTSGR--------------RKYLYSYDLETAKVTKLKPPYGVEEKSMERFE-VSH-DSNFIAIAGN--- 323 (514)
T ss_pred eeecCCCceEEEeccc--------------ceEEEEeeccccccccccCCCCcccchhheeE-ecC-CCCeEEEccc---
Confidence 4444456657777773 456888999 3333221 1122222221 211 4455566564
Q ss_pred CCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 127 ASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 127 ~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
..-+..+...++.|.- .+.++-.....+....+..|+++||. ..||.+|+..+.-...-.
T Consensus 324 --------~G~I~lLhakT~eli~--s~KieG~v~~~~fsSdsk~l~~~~~~----------GeV~v~nl~~~~~~~rf~ 383 (514)
T KOG2055|consen 324 --------NGHIHLLHAKTKELIT--SFKIEGVVSDFTFSSDSKELLASGGT----------GEVYVWNLRQNSCLHRFV 383 (514)
T ss_pred --------CceEEeehhhhhhhhh--eeeeccEEeeEEEecCCcEEEEEcCC----------ceEEEEecCCcceEEEEe
Confidence 2446777777887742 22222222333333345688889884 579999999875333322
Q ss_pred CCCCCCCCCCceEEEeCCCeEEEEc
Q psy14331 207 LGAGPAPRSGCQMAATPDGKILISG 231 (265)
Q Consensus 207 ~~~~p~~~~~~~~~~~~~~~iyi~G 231 (265)
. -...++.+.+.-.++.++.+|
T Consensus 384 D---~G~v~gts~~~S~ng~ylA~G 405 (514)
T KOG2055|consen 384 D---DGSVHGTSLCISLNGSYLATG 405 (514)
T ss_pred e---cCccceeeeeecCCCceEEec
Confidence 1 222344455544566645454
No 62
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=93.81 E-value=2.6 Score=33.01 Aligned_cols=130 Identities=18% Similarity=0.226 Sum_probs=74.2
Q ss_pred CCCEEEEECcEEeCCCceeeCCCcccCceEEEEe------eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccc
Q psy14331 58 DKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL------FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQ 131 (265)
Q Consensus 58 ~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~------w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~ 131 (265)
.++.+|+..+ ...++++|. |.... +.+-... .... ++.+|+...
T Consensus 35 ~~~~v~~~~~---------------~~~l~~~d~~tG~~~W~~~~--~~~~~~~-~~~~---~~~v~v~~~--------- 84 (238)
T PF13360_consen 35 DGGRVYVASG---------------DGNLYALDAKTGKVLWRFDL--PGPISGA-PVVD---GGRVYVGTS--------- 84 (238)
T ss_dssp ETTEEEEEET---------------TSEEEEEETTTSEEEEEEEC--SSCGGSG-EEEE---TTEEEEEET---------
T ss_pred eCCEEEEEcC---------------CCEEEEEECCCCCEEEEeec--cccccce-eeec---ccccccccc---------
Confidence 4889999854 578999997 66554 2222212 2333 889988863
Q ss_pred cccCCceeEEEcCCC--cEEE-EecCCCCCC-CcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCc--eEEec
Q psy14331 132 FHHYKDLWVFRMGEK--KWEK-IVCKDTPPS-RSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYA--WKKIE 205 (265)
Q Consensus 132 ~~~~~~~~~yd~~~~--~W~~-~~~~~~p~~-~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~--W~~~~ 205 (265)
-+.++.+|..++ .|+. ....+ +.. .........++.+|+... ...++.+|+.+.+ |+.-.
T Consensus 85 ---~~~l~~~d~~tG~~~W~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~----------~g~l~~~d~~tG~~~w~~~~ 150 (238)
T PF13360_consen 85 ---DGSLYALDAKTGKVLWSIYLTSSP-PAGVRSSSSPAVDGDRLYVGTS----------SGKLVALDPKTGKLLWKYPV 150 (238)
T ss_dssp ---TSEEEEEETTTSCEEEEEEE-SSC-TCSTB--SEEEEETTEEEEEET----------CSEEEEEETTTTEEEEEEES
T ss_pred ---eeeeEecccCCcceeeeecccccc-ccccccccCceEecCEEEEEec----------cCcEEEEecCCCcEEEEeec
Confidence 246899998776 5883 43321 122 222334444677777553 2568999998754 77655
Q ss_pred cCCCCCCCCC-------CceEEEeCCCeEEEEcCcc
Q psy14331 206 PLGAGPAPRS-------GCQMAATPDGKILISGGYS 234 (265)
Q Consensus 206 ~~~~~p~~~~-------~~~~~~~~~~~iyi~GG~~ 234 (265)
.. +.... ..+...+.++.+|+..+..
T Consensus 151 ~~---~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~g 183 (238)
T PF13360_consen 151 GE---PRGSSPISSFSDINGSPVISDGRVYVSSGDG 183 (238)
T ss_dssp ST---T-SS--EEEETTEEEEEECCTTEEEEECCTS
T ss_pred CC---CCCCcceeeecccccceEEECCEEEEEcCCC
Confidence 43 21111 0122334357888877654
No 63
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=93.22 E-value=3.9 Score=33.36 Aligned_cols=108 Identities=14% Similarity=0.171 Sum_probs=63.8
Q ss_pred cCceEEEEe----eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecC---C
Q psy14331 83 LDHLILMNL----FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCK---D 155 (265)
Q Consensus 83 ~~~~~~yd~----w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~---~ 155 (265)
+..+=.||. |.....-..... .++... .+++||+.|-.+..+. ....+..||..+.+|..+... .
T Consensus 15 C~~lC~yd~~~~qW~~~g~~i~G~V-~~l~~~--~~~~Llv~G~ft~~~~-----~~~~la~yd~~~~~w~~~~~~~s~~ 86 (281)
T PF12768_consen 15 CPGLCLYDTDNSQWSSPGNGISGTV-TDLQWA--SNNQLLVGGNFTLNGT-----NSSNLATYDFKNQTWSSLGGGSSNS 86 (281)
T ss_pred CCEEEEEECCCCEeecCCCCceEEE-EEEEEe--cCCEEEEEEeeEECCC-----CceeEEEEecCCCeeeecCCccccc
Confidence 567778887 888766533322 222222 3678888876544431 145688999999999988762 1
Q ss_pred CCCCCcCceEEEE-CCEEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 156 TPPSRSGHRMIAL-KKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 156 ~p~~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
.|.+......... ...+++.|.. ..+ ..-|..| +..+|..+..
T Consensus 87 ipgpv~a~~~~~~d~~~~~~aG~~-~~g-----~~~l~~~--dGs~W~~i~~ 130 (281)
T PF12768_consen 87 IPGPVTALTFISNDGSNFWVAGRS-ANG-----STFLMKY--DGSSWSSIGS 130 (281)
T ss_pred CCCcEEEEEeeccCCceEEEecee-cCC-----CceEEEE--cCCceEeccc
Confidence 3444422222222 3467777765 221 2345555 6778888776
No 64
>cd00094 HX Hemopexin-like repeats.; Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs). This CD contains 4 instances of the repeat.
Probab=92.57 E-value=3.7 Score=31.40 Aligned_cols=61 Identities=16% Similarity=0.327 Sum_probs=37.3
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEE-----ec--CCCCCCCcCceEEEE-CCEEEEEcCccCCCCcc
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKI-----VC--KDTPPSRSGHRMIAL-KKHLVVFGGFHDNLREA 185 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~-----~~--~~~p~~~~~~~~~~~-~~~iyv~GG~~~~~~~~ 185 (265)
++++|++.| +..++||..+++...- .. ...|... .++... ++++|+|-|
T Consensus 110 ~~~~yfFkg-------------~~y~ry~~~~~~v~~~yP~~i~~~w~g~p~~i--daa~~~~~~~~yfF~g-------- 166 (194)
T cd00094 110 NGKTYFFKG-------------DKYWRYDEKTQKMDPGYPKLIETDFPGVPDKV--DAAFRWLDGYYYFFKG-------- 166 (194)
T ss_pred CCEEEEEeC-------------CEEEEEeCCCccccCCCCcchhhcCCCcCCCc--ceeEEeCCCcEEEEEC--------
Confidence 689999988 3477888655543211 00 1122222 234444 488999987
Q ss_pred ccCCceEEEEcCCCc
Q psy14331 186 KYYNDVHIFDLETYA 200 (265)
Q Consensus 186 ~~~~~v~~y~~~~~~ 200 (265)
+..++||..+.+
T Consensus 167 ---~~y~~~d~~~~~ 178 (194)
T cd00094 167 ---DQYWRFDPRSKE 178 (194)
T ss_pred ---CEEEEEeCccce
Confidence 468999988766
No 65
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=92.15 E-value=5.3 Score=32.16 Aligned_cols=95 Identities=23% Similarity=0.319 Sum_probs=62.9
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEE
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHI 193 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~ 193 (265)
++.||.--|... .+.+.++|+.+++-.....+ |..-++-+++.++++||..-=. ......
T Consensus 55 ~g~LyESTG~yG---------~S~l~~~d~~tg~~~~~~~l--~~~~FgEGit~~~d~l~qLTWk---------~~~~f~ 114 (264)
T PF05096_consen 55 DGTLYESTGLYG---------QSSLRKVDLETGKVLQSVPL--PPRYFGEGITILGDKLYQLTWK---------EGTGFV 114 (264)
T ss_dssp TTEEEEEECSTT---------EEEEEEEETTTSSEEEEEE---TTT--EEEEEEETTEEEEEESS---------SSEEEE
T ss_pred CCEEEEeCCCCC---------cEEEEEEECCCCcEEEEEEC--CccccceeEEEECCEEEEEEec---------CCeEEE
Confidence 789999888643 36788999999986665555 4666777889999999998643 345778
Q ss_pred EEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCcc
Q psy14331 194 FDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYS 234 (265)
Q Consensus 194 y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~ 234 (265)
||+.+ .+.+... +.+.-+.+++.- +..+++.-|.+
T Consensus 115 yd~~t--l~~~~~~---~y~~EGWGLt~d-g~~Li~SDGS~ 149 (264)
T PF05096_consen 115 YDPNT--LKKIGTF---PYPGEGWGLTSD-GKRLIMSDGSS 149 (264)
T ss_dssp EETTT--TEEEEEE---E-SSS--EEEEC-SSCEEEE-SSS
T ss_pred Ecccc--ceEEEEE---ecCCcceEEEcC-CCEEEEECCcc
Confidence 98875 4555552 444567777643 66788887754
No 66
>PRK13684 Ycf48-like protein; Provisional
Probab=91.52 E-value=7.5 Score=32.64 Aligned_cols=80 Identities=16% Similarity=0.199 Sum_probs=41.2
Q ss_pred ceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEE-EcCCCceEEeccCCCCCCCCC
Q psy14331 137 DLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIF-DLETYAWKKIEPLGAGPAPRS 215 (265)
Q Consensus 137 ~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y-~~~~~~W~~~~~~~~~p~~~~ 215 (265)
.+++=+-...+|+.+... ..-..+.+....+..++..|..+ .++.- +....+|+.+.. +..+.
T Consensus 153 ~i~~S~DgG~tW~~~~~~---~~g~~~~i~~~~~g~~v~~g~~G---------~i~~s~~~gg~tW~~~~~----~~~~~ 216 (334)
T PRK13684 153 AIYRTTDGGKNWEALVED---AAGVVRNLRRSPDGKYVAVSSRG---------NFYSTWEPGQTAWTPHQR----NSSRR 216 (334)
T ss_pred eEEEECCCCCCceeCcCC---CcceEEEEEECCCCeEEEEeCCc---------eEEEEcCCCCCeEEEeeC----CCccc
Confidence 355555556799987543 22233344444443444433322 23332 334467998865 33344
Q ss_pred CceEEEeCCCeEEEEcC
Q psy14331 216 GCQMAATPDGKILISGG 232 (265)
Q Consensus 216 ~~~~~~~~~~~iyi~GG 232 (265)
-..++...++.++++|.
T Consensus 217 l~~i~~~~~g~~~~vg~ 233 (334)
T PRK13684 217 LQSMGFQPDGNLWMLAR 233 (334)
T ss_pred ceeeeEcCCCCEEEEec
Confidence 44555555667777664
No 67
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=91.37 E-value=6.4 Score=31.56 Aligned_cols=130 Identities=16% Similarity=0.155 Sum_probs=73.0
Q ss_pred cCceEEEEe-------eecCCCC------CCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCC---
Q psy14331 83 LDHLILMNL-------FVSAPGA------PPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEK--- 146 (265)
Q Consensus 83 ~~~~~~yd~-------w~~~~~~------p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~--- 146 (265)
++.+.+||+ +..++.. |....+++-+-++++++-|+|+-...+.+. .--+-+.|+.+-
T Consensus 88 s~~IvkydL~t~~v~~~~~L~~A~~~n~~~y~~~~~t~iD~AvDE~GLWvIYat~~~~g------~ivvskld~~tL~v~ 161 (250)
T PF02191_consen 88 SRNIVKYDLTTRSVVARRELPGAGYNNRFPYYWSGYTDIDFAVDENGLWVIYATEDNNG------NIVVSKLDPETLSVE 161 (250)
T ss_pred CceEEEEECcCCcEEEEEECCccccccccceecCCCceEEEEEcCCCEEEEEecCCCCC------cEEEEeeCcccCceE
Confidence 789999999 2223221 222344555666666777777755433321 122456677653
Q ss_pred -cEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeC--
Q psy14331 147 -KWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATP-- 223 (265)
Q Consensus 147 -~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~-- 223 (265)
+|..- . ..+....+.++-+.||++-...... ..-.+.||+.+++=+.+.-. .+.+-..++++...
T Consensus 162 ~tw~T~--~---~k~~~~naFmvCGvLY~~~s~~~~~-----~~I~yafDt~t~~~~~~~i~--f~~~~~~~~~l~YNP~ 229 (250)
T PF02191_consen 162 QTWNTS--Y---PKRSAGNAFMVCGVLYATDSYDTRD-----TEIFYAFDTYTGKEEDVSIP--FPNPYGNISMLSYNPR 229 (250)
T ss_pred EEEEec--c---CchhhcceeeEeeEEEEEEECCCCC-----cEEEEEEECCCCceeceeee--eccccCceEeeeECCC
Confidence 46431 1 2334445777788999987664432 23457899998876655543 23333334444332
Q ss_pred CCeEEEE
Q psy14331 224 DGKILIS 230 (265)
Q Consensus 224 ~~~iyi~ 230 (265)
+.+||+.
T Consensus 230 dk~LY~w 236 (250)
T PF02191_consen 230 DKKLYAW 236 (250)
T ss_pred CCeEEEE
Confidence 5678887
No 68
>PRK04792 tolB translocation protein TolB; Provisional
Probab=91.05 E-value=7.6 Score=34.08 Aligned_cols=144 Identities=14% Similarity=0.136 Sum_probs=72.8
Q ss_pred HHHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCCCCCC
Q psy14331 26 DIEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPGAPPP 101 (265)
Q Consensus 26 ~~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~~p~~ 101 (265)
.++.+|..+.+-.. ....+. ...+...-|+++.|++... .++ ..+++.+|+ ...... .
T Consensus 243 ~L~~~dl~tg~~~~-lt~~~g--~~~~~~wSPDG~~La~~~~--~~g----------~~~Iy~~dl~tg~~~~lt~---~ 304 (448)
T PRK04792 243 EIFVQDIYTQVREK-VTSFPG--INGAPRFSPDGKKLALVLS--KDG----------QPEIYVVDIATKALTRITR---H 304 (448)
T ss_pred EEEEEECCCCCeEE-ecCCCC--CcCCeeECCCCCEEEEEEe--CCC----------CeEEEEEECCCCCeEECcc---C
Confidence 46666666555444 222222 1223445567777776533 121 357888888 222221 1
Q ss_pred ccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEEC-CEEEEEcCccC
Q psy14331 102 RCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALK-KHLVVFGGFHD 180 (265)
Q Consensus 102 r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~-~~iyv~GG~~~ 180 (265)
............++.|++...... ..+++.+|..+++++.+.... .........-+ +.|++.. ...
T Consensus 305 ~~~~~~p~wSpDG~~I~f~s~~~g---------~~~Iy~~dl~~g~~~~Lt~~g---~~~~~~~~SpDG~~l~~~~-~~~ 371 (448)
T PRK04792 305 RAIDTEPSWHPDGKSLIFTSERGG---------KPQIYRVNLASGKVSRLTFEG---EQNLGGSITPDGRSMIMVN-RTN 371 (448)
T ss_pred CCCccceEECCCCCEEEEEECCCC---------CceEEEEECCCCCEEEEecCC---CCCcCeeECCCCCEEEEEE-ecC
Confidence 111122223333455655432211 357999999999988875321 11111122224 4555443 222
Q ss_pred CCCccccCCceEEEEcCCCceEEecc
Q psy14331 181 NLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 181 ~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
. ...|+.+|+.+...+.+..
T Consensus 372 g------~~~I~~~dl~~g~~~~lt~ 391 (448)
T PRK04792 372 G------KFNIARQDLETGAMQVLTS 391 (448)
T ss_pred C------ceEEEEEECCCCCeEEccC
Confidence 1 2468999999988777654
No 69
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=91.03 E-value=11 Score=33.82 Aligned_cols=114 Identities=11% Similarity=0.103 Sum_probs=60.8
Q ss_pred CCCEEEEECcEEeCCCceeeCCCcccCceEEEEe------eecCCCCCCCc--------cccceEEEecCCCEEEEEcCc
Q psy14331 58 DKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL------FVSAPGAPPPR--------CSHQMVALSADKGQLWMFGGE 123 (265)
Q Consensus 58 ~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~------w~~~~~~p~~r--------~~~~~~~~~~~~~~iyv~GG~ 123 (265)
.++.||+... ...++.+|. |......+... .....++. +++||+...
T Consensus 68 ~~g~vyv~s~---------------~g~v~AlDa~TGk~lW~~~~~~~~~~~~~~~~~~~~rg~av~---~~~v~v~t~- 128 (527)
T TIGR03075 68 VDGVMYVTTS---------------YSRVYALDAKTGKELWKYDPKLPDDVIPVMCCDVVNRGVALY---DGKVFFGTL- 128 (527)
T ss_pred ECCEEEEECC---------------CCcEEEEECCCCceeeEecCCCCcccccccccccccccceEE---CCEEEEEcC-
Confidence 3889998665 356778887 76554333211 11222222 778887432
Q ss_pred ccCCCccccccCCceeEEEcCCCc--EEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCc-
Q psy14331 124 FSSASESQFHHYKDLWVFRMGEKK--WEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYA- 200 (265)
Q Consensus 124 ~~~~~~~~~~~~~~~~~yd~~~~~--W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~- 200 (265)
-..+..+|.++++ |+.-............+-++.++.+|+-...... .....|..||.++.+
T Consensus 129 -----------dg~l~ALDa~TGk~~W~~~~~~~~~~~~~tssP~v~~g~Vivg~~~~~~----~~~G~v~AlD~~TG~~ 193 (527)
T TIGR03075 129 -----------DARLVALDAKTGKVVWSKKNGDYKAGYTITAAPLVVKGKVITGISGGEF----GVRGYVTAYDAKTGKL 193 (527)
T ss_pred -----------CCEEEEEECCCCCEEeecccccccccccccCCcEEECCEEEEeeccccc----CCCcEEEEEECCCCce
Confidence 2458999998874 7653211100011122344557777764221111 123568889998854
Q ss_pred -eEEec
Q psy14331 201 -WKKIE 205 (265)
Q Consensus 201 -W~~~~ 205 (265)
|+.-.
T Consensus 194 lW~~~~ 199 (527)
T TIGR03075 194 VWRRYT 199 (527)
T ss_pred eEeccC
Confidence 76443
No 70
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=89.52 E-value=6.6 Score=33.07 Aligned_cols=143 Identities=13% Similarity=0.110 Sum_probs=73.0
Q ss_pred HHHhhcccce--EEEec-cCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe---------eec
Q psy14331 27 IEAEEKRKNK--VIEKV-VPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL---------FVS 94 (265)
Q Consensus 27 ~~~~d~~~~~--w~~~~-~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~---------w~~ 94 (265)
++.|+..... ..... ...|..-....+++.++++.+||+.-. .+.+..|+. ...
T Consensus 168 v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~--------------s~~v~v~~~~~~~g~~~~~~~ 233 (345)
T PF10282_consen 168 VYVYDIDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNEL--------------SNTVSVFDYDPSDGSLTEIQT 233 (345)
T ss_dssp EEEEEE-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEETT--------------TTEEEEEEEETTTTEEEEEEE
T ss_pred EEEEEEeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecCC--------------CCcEEEEeecccCCceeEEEE
Confidence 6666665554 43311 123333333456677788899998652 344444444 222
Q ss_pred CCCCCC---CccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEc--CCCcEEEEecCC--CCCCCcCceEEE
Q psy14331 95 APGAPP---PRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRM--GEKKWEKIVCKD--TPPSRSGHRMIA 167 (265)
Q Consensus 95 ~~~~p~---~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~--~~~~W~~~~~~~--~p~~~~~~~~~~ 167 (265)
.+.+|. .....+.+.+...++.||+.-.. .+.+..|++ .+++.+.+.... ...||. ++.
T Consensus 234 ~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~-----------~~sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~---~~~ 299 (345)
T PF10282_consen 234 ISTLPEGFTGENAPAEIAISPDGRFLYVSNRG-----------SNSISVFDLDPATGTLTLVQTVPTGGKFPRH---FAF 299 (345)
T ss_dssp EESCETTSCSSSSEEEEEE-TTSSEEEEEECT-----------TTEEEEEEECTTTTTEEEEEEEEESSSSEEE---EEE
T ss_pred eeeccccccccCCceeEEEecCCCEEEEEecc-----------CCEEEEEEEecCCCceEEEEEEeCCCCCccE---EEE
Confidence 233332 22234445565556788887543 356777776 456666554332 112332 222
Q ss_pred --ECCEEEEEcCccCCCCccccCCceEEE--EcCCCceEEecc
Q psy14331 168 --LKKHLVVFGGFHDNLREAKYYNDVHIF--DLETYAWKKIEP 206 (265)
Q Consensus 168 --~~~~iyv~GG~~~~~~~~~~~~~v~~y--~~~~~~W~~~~~ 206 (265)
.++.+|+.... .+.|..| |..+..++.+..
T Consensus 300 s~~g~~l~Va~~~---------s~~v~vf~~d~~tG~l~~~~~ 333 (345)
T PF10282_consen 300 SPDGRYLYVANQD---------SNTVSVFDIDPDTGKLTPVGS 333 (345)
T ss_dssp -TTSSEEEEEETT---------TTEEEEEEEETTTTEEEEEEE
T ss_pred eCCCCEEEEEecC---------CCeEEEEEEeCCCCcEEEecc
Confidence 24566664432 2445555 567788887764
No 71
>PRK04792 tolB translocation protein TolB; Provisional
Probab=89.07 E-value=15 Score=32.26 Aligned_cols=61 Identities=8% Similarity=0.065 Sum_probs=36.4
Q ss_pred CceeEEEcCCCcEEEEecCCCCCCCcCceEEEECC-EEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 136 KDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKK-HLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 136 ~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
.+++.+|..+++.+.+.... .........-++ .|++.....+ ...++.+|+.+.+++++..
T Consensus 286 ~~Iy~~dl~tg~~~~lt~~~---~~~~~p~wSpDG~~I~f~s~~~g-------~~~Iy~~dl~~g~~~~Lt~ 347 (448)
T PRK04792 286 PEIYVVDIATKALTRITRHR---AIDTEPSWHPDGKSLIFTSERGG-------KPQIYRVNLASGKVSRLTF 347 (448)
T ss_pred eEEEEEECCCCCeEECccCC---CCccceEECCCCCEEEEEECCCC-------CceEEEEECCCCCEEEEec
Confidence 57999999998887765321 111111222234 4544432211 2479999999999988764
No 72
>PF08268 FBA_3: F-box associated domain; InterPro: IPR013187 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=88.71 E-value=6.6 Score=27.69 Aligned_cols=83 Identities=12% Similarity=0.113 Sum_probs=49.3
Q ss_pred EECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccccccccccccce
Q psy14331 167 ALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQSVKKDVDKGI 246 (265)
Q Consensus 167 ~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~~ 246 (265)
.+||.+|-..-... .....|.+||..+.+|+.+..+...........++.. +|+|-++.-..... .
T Consensus 3 cinGvly~~a~~~~-----~~~~~IvsFDv~~E~f~~i~~P~~~~~~~~~~~L~~~-~G~L~~v~~~~~~~--------~ 68 (129)
T PF08268_consen 3 CINGVLYWLAWSED-----SDNNVIVSFDVRSEKFRFIKLPEDPYSSDCSSTLIEY-KGKLALVSYNDQGE--------P 68 (129)
T ss_pred EECcEEEeEEEECC-----CCCcEEEEEEcCCceEEEEEeeeeeccccCccEEEEe-CCeEEEEEecCCCC--------c
Confidence 35777777665411 2257799999999999988873111233344555555 78877764332111 1
Q ss_pred eeceEEEEe-cCcccccc
Q psy14331 247 VHTDTFLLT-PDSKTLRS 263 (265)
Q Consensus 247 ~~~~~w~~~-~~~~~w~~ 263 (265)
..=++|.++ .+...|.+
T Consensus 69 ~~~~iWvLeD~~k~~Wsk 86 (129)
T PF08268_consen 69 DSIDIWVLEDYEKQEWSK 86 (129)
T ss_pred ceEEEEEeeccccceEEE
Confidence 234588885 44567765
No 73
>KOG0310|consensus
Probab=88.14 E-value=17 Score=31.65 Aligned_cols=105 Identities=14% Similarity=0.221 Sum_probs=55.1
Q ss_pred CCCEEEEECcEEeCCCceeeCCCcccCceEEEEe-------eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCcc
Q psy14331 58 DKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL-------FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASES 130 (265)
Q Consensus 58 ~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~-------w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~ 130 (265)
.+++|..+|+. ...+.+||. .......|..+ +-....++.+++.|+.+.
T Consensus 78 ~DG~LlaaGD~--------------sG~V~vfD~k~r~iLR~~~ah~apv~~-----~~f~~~d~t~l~s~sDd~----- 133 (487)
T KOG0310|consen 78 SDGRLLAAGDE--------------SGHVKVFDMKSRVILRQLYAHQAPVHV-----TKFSPQDNTMLVSGSDDK----- 133 (487)
T ss_pred cCCeEEEccCC--------------cCcEEEeccccHHHHHHHhhccCceeE-----EEecccCCeEEEecCCCc-----
Confidence 57888888883 456677775 01111112111 122233889999988432
Q ss_pred ccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCC-ceE
Q psy14331 131 QFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETY-AWK 202 (265)
Q Consensus 131 ~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~-~W~ 202 (265)
-+-..|..+..- .........-..+..+...+++|++-||+++. |-.||+.+. .|.
T Consensus 134 ------v~k~~d~s~a~v-~~~l~~htDYVR~g~~~~~~~hivvtGsYDg~---------vrl~DtR~~~~~v 190 (487)
T KOG0310|consen 134 ------VVKYWDLSTAYV-QAELSGHTDYVRCGDISPANDHIVVTGSYDGK---------VRLWDTRSLTSRV 190 (487)
T ss_pred ------eEEEEEcCCcEE-EEEecCCcceeEeeccccCCCeEEEecCCCce---------EEEEEeccCCcee
Confidence 122334433332 22222111222233444558899999998654 667888776 443
No 74
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=86.67 E-value=19 Score=30.62 Aligned_cols=101 Identities=10% Similarity=0.086 Sum_probs=58.3
Q ss_pred CCCEEEEECcEEeCCCceeeCCCcccCceEEEEe------eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccc
Q psy14331 58 DKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL------FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQ 131 (265)
Q Consensus 58 ~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~------w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~ 131 (265)
.++++|+.+. ...++++|. |...-.- . .. +...+. ++.+|+.+.
T Consensus 64 ~~~~v~v~~~---------------~g~v~a~d~~tG~~~W~~~~~~--~-~~-~~p~v~--~~~v~v~~~--------- 113 (377)
T TIGR03300 64 AGGKVYAADA---------------DGTVVALDAETGKRLWRVDLDE--R-LS-GGVGAD--GGLVFVGTE--------- 113 (377)
T ss_pred ECCEEEEECC---------------CCeEEEEEccCCcEeeeecCCC--C-cc-cceEEc--CCEEEEEcC---------
Confidence 3888888765 357888886 6543222 1 11 122332 777876543
Q ss_pred cccCCceeEEEcCCCc--EEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCC--ceEEecc
Q psy14331 132 FHHYKDLWVFRMGEKK--WEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETY--AWKKIEP 206 (265)
Q Consensus 132 ~~~~~~~~~yd~~~~~--W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~--~W~~~~~ 206 (265)
-..++.+|..+++ |+.-.. ... ....+..++.+|+..+ ...++.+|+.+. .|+.-..
T Consensus 114 ---~g~l~ald~~tG~~~W~~~~~----~~~-~~~p~v~~~~v~v~~~----------~g~l~a~d~~tG~~~W~~~~~ 174 (377)
T TIGR03300 114 ---KGEVIALDAEDGKELWRAKLS----SEV-LSPPLVANGLVVVRTN----------DGRLTALDAATGERLWTYSRV 174 (377)
T ss_pred ---CCEEEEEECCCCcEeeeeccC----cee-ecCCEEECCEEEEECC----------CCeEEEEEcCCCceeeEEccC
Confidence 2468999987764 865321 111 1223345777777533 246899998865 4875443
No 75
>PF08268 FBA_3: F-box associated domain; InterPro: IPR013187 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=86.53 E-value=9.2 Score=26.95 Aligned_cols=83 Identities=18% Similarity=0.213 Sum_probs=54.3
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecC-CCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceE
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCK-DTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVH 192 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~-~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~ 192 (265)
+|.+|-..-... .....+.+||+.+.+|+.++.. ..........++..+++|-++.-..... ...-++|
T Consensus 5 nGvly~~a~~~~-------~~~~~IvsFDv~~E~f~~i~~P~~~~~~~~~~~L~~~~G~L~~v~~~~~~~---~~~~~iW 74 (129)
T PF08268_consen 5 NGVLYWLAWSED-------SDNNVIVSFDVRSEKFRFIKLPEDPYSSDCSSTLIEYKGKLALVSYNDQGE---PDSIDIW 74 (129)
T ss_pred CcEEEeEEEECC-------CCCcEEEEEEcCCceEEEEEeeeeeccccCccEEEEeCCeEEEEEecCCCC---cceEEEE
Confidence 788887766511 1257799999999999988753 1123445567777888888865443221 1134677
Q ss_pred EE-EcCCCceEEecc
Q psy14331 193 IF-DLETYAWKKIEP 206 (265)
Q Consensus 193 ~y-~~~~~~W~~~~~ 206 (265)
.. |..+..|.+...
T Consensus 75 vLeD~~k~~Wsk~~~ 89 (129)
T PF08268_consen 75 VLEDYEKQEWSKKHI 89 (129)
T ss_pred EeeccccceEEEEEE
Confidence 77 456788987755
No 76
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=86.22 E-value=19 Score=30.30 Aligned_cols=125 Identities=16% Similarity=0.219 Sum_probs=64.9
Q ss_pred CCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe------eecCC--CCCCCccccceEEEecCCCEEE
Q psy14331 47 RRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL------FVSAP--GAPPPRCSHQMVALSADKGQLW 118 (265)
Q Consensus 47 ~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~------w~~~~--~~p~~r~~~~~~~~~~~~~~iy 118 (265)
....|.+.+.|+++.+|+.-= + .+.++.|+. ..... ..|.. .+...+.+...++.+|
T Consensus 143 ~~h~H~v~~~pdg~~v~v~dl----G----------~D~v~~~~~~~~~~~l~~~~~~~~~~G-~GPRh~~f~pdg~~~Y 207 (345)
T PF10282_consen 143 GPHPHQVVFSPDGRFVYVPDL----G----------ADRVYVYDIDDDTGKLTPVDSIKVPPG-SGPRHLAFSPDGKYAY 207 (345)
T ss_dssp STCEEEEEE-TTSSEEEEEET----T----------TTEEEEEEE-TTS-TEEEEEEEECSTT-SSEEEEEE-TTSSEEE
T ss_pred cccceeEEECCCCCEEEEEec----C----------CCEEEEEEEeCCCceEEEeeccccccC-CCCcEEEEcCCcCEEE
Confidence 345677788888888988521 1 577888888 11101 12221 2222233444467899
Q ss_pred EEcCcccCCCccccccCCceeEEEcC--CCcEEEEecCC-CCCC---C-cCceEEEE--CCEEEEEcCccCCCCccccCC
Q psy14331 119 MFGGEFSSASESQFHHYKDLWVFRMG--EKKWEKIVCKD-TPPS---R-SGHRMIAL--KKHLVVFGGFHDNLREAKYYN 189 (265)
Q Consensus 119 v~GG~~~~~~~~~~~~~~~~~~yd~~--~~~W~~~~~~~-~p~~---~-~~~~~~~~--~~~iyv~GG~~~~~~~~~~~~ 189 (265)
|..-. .+.+.+|+.. ++.++.+.... .|.. . ..+.+... ++.+|+--.. .+
T Consensus 208 v~~e~-----------s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~---------~~ 267 (345)
T PF10282_consen 208 VVNEL-----------SNTVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNRG---------SN 267 (345)
T ss_dssp EEETT-----------TTEEEEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEECT---------TT
T ss_pred EecCC-----------CCcEEEEeecccCCceeEEEEeeeccccccccCCceeEEEecCCCEEEEEecc---------CC
Confidence 98664 3456666555 66666554322 1221 1 12222222 4577875422 46
Q ss_pred ceEEEEc--CCCceEEecc
Q psy14331 190 DVHIFDL--ETYAWKKIEP 206 (265)
Q Consensus 190 ~v~~y~~--~~~~W~~~~~ 206 (265)
.|..|++ .+.+.+.+..
T Consensus 268 sI~vf~~d~~~g~l~~~~~ 286 (345)
T PF10282_consen 268 SISVFDLDPATGTLTLVQT 286 (345)
T ss_dssp EEEEEEECTTTTTEEEEEE
T ss_pred EEEEEEEecCCCceEEEEE
Confidence 7888887 3456655554
No 77
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=85.51 E-value=22 Score=30.25 Aligned_cols=61 Identities=15% Similarity=0.075 Sum_probs=37.1
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEcCCC--cEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCce
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRMGEK--KWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDV 191 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~--~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v 191 (265)
++.||+... ...+.++|..++ .|+.-. . ..+...+.+..++.+|+.. . ...+
T Consensus 279 ~~~vyv~~~------------~G~l~~~d~~tG~~~W~~~~-~---~~~~~ssp~i~g~~l~~~~-~---------~G~l 332 (377)
T TIGR03300 279 DNRLYVTDA------------DGVVVALDRRSGSELWKNDE-L---KYRQLTAPAVVGGYLVVGD-F---------EGYL 332 (377)
T ss_pred CCEEEEECC------------CCeEEEEECCCCcEEEcccc-c---cCCccccCEEECCEEEEEe-C---------CCEE
Confidence 889998753 245889998776 476522 1 1222233345677787743 2 2468
Q ss_pred EEEEcCCCc
Q psy14331 192 HIFDLETYA 200 (265)
Q Consensus 192 ~~y~~~~~~ 200 (265)
+++|+.+.+
T Consensus 333 ~~~d~~tG~ 341 (377)
T TIGR03300 333 HWLSREDGS 341 (377)
T ss_pred EEEECCCCC
Confidence 889987754
No 78
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=85.26 E-value=23 Score=30.42 Aligned_cols=61 Identities=7% Similarity=0.110 Sum_probs=36.4
Q ss_pred CceeEEEcCCCcEEEEecCCCCCCCcCceE-EEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 136 KDLWVFRMGEKKWEKIVCKDTPPSRSGHRM-IALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 136 ~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~-~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
..++.+|..+.+++.+.... ...... ..-+++.+++...... ...|+.+|+.+..++.+..
T Consensus 302 ~~iy~~d~~~~~~~~l~~~~----~~~~~~~~spdg~~i~~~~~~~~------~~~i~~~d~~~~~~~~l~~ 363 (417)
T TIGR02800 302 PQIYMMDADGGEVRRLTFRG----GYNASPSWSPDGDLIAFVHREGG------GFNIAVMDLDGGGERVLTD 363 (417)
T ss_pred ceEEEEECCCCCEEEeecCC----CCccCeEECCCCCEEEEEEccCC------ceEEEEEeCCCCCeEEccC
Confidence 47999999988887775332 111122 2224555555443221 2468999998877766654
No 79
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=83.54 E-value=21 Score=28.56 Aligned_cols=38 Identities=21% Similarity=0.289 Sum_probs=19.2
Q ss_pred HHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECc
Q psy14331 27 IEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGG 67 (265)
Q Consensus 27 ~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG 67 (265)
+..+|..+.+......... ....+++.++++.+|+.++
T Consensus 55 v~~~d~~~~~~~~~~~~~~---~~~~~~~~~~g~~l~~~~~ 92 (300)
T TIGR03866 55 IQVIDLATGEVIGTLPSGP---DPELFALHPNGKILYIANE 92 (300)
T ss_pred EEEEECCCCcEEEeccCCC---CccEEEECCCCCEEEEEcC
Confidence 5566766655533222111 1233445556777777655
No 80
>PRK00178 tolB translocation protein TolB; Provisional
Probab=82.62 E-value=32 Score=29.89 Aligned_cols=61 Identities=3% Similarity=0.089 Sum_probs=36.4
Q ss_pred CceeEEEcCCCcEEEEecCCCCCCCcCceEEEEC-CEEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 136 KDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALK-KHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 136 ~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~-~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
.+++.+|..++..+.+.... .........-+ +.|++.....+ ...++.+|+.+.+++++..
T Consensus 267 ~~Iy~~d~~~~~~~~lt~~~---~~~~~~~~spDg~~i~f~s~~~g-------~~~iy~~d~~~g~~~~lt~ 328 (430)
T PRK00178 267 PEIYVMDLASRQLSRVTNHP---AIDTEPFWGKDGRTLYFTSDRGG-------KPQIYKVNVNGGRAERVTF 328 (430)
T ss_pred ceEEEEECCCCCeEEcccCC---CCcCCeEECCCCCEEEEEECCCC-------CceEEEEECCCCCEEEeec
Confidence 57999999999887765321 11111122223 45555432211 2468999999988887764
No 81
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=81.07 E-value=30 Score=28.67 Aligned_cols=69 Identities=16% Similarity=0.106 Sum_probs=33.0
Q ss_pred EEecCCCEEEEEcCcccCCCccccccCCceeEEEcC-CCcEEEEecCCCCCCCcCceEEEE-CCEEEEEcCccCCCCccc
Q psy14331 109 ALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMG-EKKWEKIVCKDTPPSRSGHRMIAL-KKHLVVFGGFHDNLREAK 186 (265)
Q Consensus 109 ~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~-~~~W~~~~~~~~p~~~~~~~~~~~-~~~iyv~GG~~~~~~~~~ 186 (265)
.+...++.||+.+.. .+.+..|+.. +++++.+.....+ ..-.+.+..- ++.+|+..-.
T Consensus 41 ~~spd~~~lyv~~~~-----------~~~i~~~~~~~~g~l~~~~~~~~~-~~p~~i~~~~~g~~l~v~~~~-------- 100 (330)
T PRK11028 41 VISPDKRHLYVGVRP-----------EFRVLSYRIADDGALTFAAESPLP-GSPTHISTDHQGRFLFSASYN-------- 100 (330)
T ss_pred EECCCCCEEEEEECC-----------CCcEEEEEECCCCceEEeeeecCC-CCceEEEECCCCCEEEEEEcC--------
Confidence 344445567775442 2456677765 4567655433211 1111222222 3456665421
Q ss_pred cCCceEEEEcCC
Q psy14331 187 YYNDVHIFDLET 198 (265)
Q Consensus 187 ~~~~v~~y~~~~ 198 (265)
.+.+..|++.+
T Consensus 101 -~~~v~v~~~~~ 111 (330)
T PRK11028 101 -ANCVSVSPLDK 111 (330)
T ss_pred -CCeEEEEEECC
Confidence 24577787754
No 82
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=81.03 E-value=24 Score=28.47 Aligned_cols=102 Identities=20% Similarity=0.223 Sum_probs=63.2
Q ss_pred CCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccccc
Q psy14331 58 DKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFH 133 (265)
Q Consensus 58 ~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~ 133 (265)
.++.||.--|. -+ -+.+.++|+ -....++|..-++-..+.+ +++||.+-=.
T Consensus 54 ~~g~LyESTG~--yG----------~S~l~~~d~~tg~~~~~~~l~~~~FgEGit~~---~d~l~qLTWk---------- 108 (264)
T PF05096_consen 54 DDGTLYESTGL--YG----------QSSLRKVDLETGKVLQSVPLPPRYFGEGITIL---GDKLYQLTWK---------- 108 (264)
T ss_dssp ETTEEEEEECS--TT----------EEEEEEEETTTSSEEEEEE-TTT--EEEEEEE---TTEEEEEESS----------
T ss_pred CCCEEEEeCCC--CC----------cEEEEEEECCCCcEEEEEECCccccceeEEEE---CCEEEEEEec----------
Confidence 47788887773 22 256778888 3334456655566666666 8999998432
Q ss_pred cCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCC
Q psy14331 134 HYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETY 199 (265)
Q Consensus 134 ~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~ 199 (265)
....++||..+- +.+.... .+..+-+++..++.+++--| .+.++..||.+-
T Consensus 109 -~~~~f~yd~~tl--~~~~~~~--y~~EGWGLt~dg~~Li~SDG----------S~~L~~~dP~~f 159 (264)
T PF05096_consen 109 -EGTGFVYDPNTL--KKIGTFP--YPGEGWGLTSDGKRLIMSDG----------SSRLYFLDPETF 159 (264)
T ss_dssp -SSEEEEEETTTT--EEEEEEE---SSS--EEEECSSCEEEE-S----------SSEEEEE-TTT-
T ss_pred -CCeEEEEccccc--eEEEEEe--cCCcceEEEcCCCEEEEECC----------ccceEEECCccc
Confidence 467899998764 4554443 45566678877888888877 367888888763
No 83
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=80.16 E-value=33 Score=28.47 Aligned_cols=93 Identities=9% Similarity=0.048 Sum_probs=44.9
Q ss_pred HHHhhcc-cceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEee------ecCCCCC
Q psy14331 27 IEAEEKR-KNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNLF------VSAPGAP 99 (265)
Q Consensus 27 ~~~~d~~-~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~w------~~~~~~p 99 (265)
+..|+.. ..++.. ....+.+...+.+++.++++.+|+..-. .+.+..|++- ......+
T Consensus 59 i~~~~~~~~g~l~~-~~~~~~~~~p~~i~~~~~g~~l~v~~~~--------------~~~v~v~~~~~~g~~~~~~~~~~ 123 (330)
T PRK11028 59 VLSYRIADDGALTF-AAESPLPGSPTHISTDHQGRFLFSASYN--------------ANCVSVSPLDKDGIPVAPIQIIE 123 (330)
T ss_pred EEEEEECCCCceEE-eeeecCCCCceEEEECCCCCEEEEEEcC--------------CCeEEEEEECCCCCCCCceeecc
Confidence 4445554 344543 2222222233456666677788886431 2455556551 1111122
Q ss_pred CCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCC
Q psy14331 100 PPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEK 146 (265)
Q Consensus 100 ~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~ 146 (265)
.....|.++ +...++.+|+..-. .+.+.+||+.+.
T Consensus 124 ~~~~~~~~~-~~p~g~~l~v~~~~-----------~~~v~v~d~~~~ 158 (330)
T PRK11028 124 GLEGCHSAN-IDPDNRTLWVPCLK-----------EDRIRLFTLSDD 158 (330)
T ss_pred CCCcccEeE-eCCCCCEEEEeeCC-----------CCEEEEEEECCC
Confidence 222335443 33345567765432 356889998763
No 84
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=78.88 E-value=47 Score=29.56 Aligned_cols=138 Identities=12% Similarity=0.104 Sum_probs=67.4
Q ss_pred ccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe------eecCCCCCCCc----
Q psy14331 33 RKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL------FVSAPGAPPPR---- 102 (265)
Q Consensus 33 ~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~------w~~~~~~p~~r---- 102 (265)
.+..|...... ......+.++. ++++|+... ...++.+|. |......+..+
T Consensus 39 ~~~~W~~~~~~--~~~~~~sPvv~--~g~vy~~~~---------------~g~l~AlD~~tG~~~W~~~~~~~~~~~~~~ 99 (488)
T cd00216 39 LKVAWTFSTGD--ERGQEGTPLVV--DGDMYFTTS---------------HSALFALDAATGKVLWRYDPKLPADRGCCD 99 (488)
T ss_pred ceeeEEEECCC--CCCcccCCEEE--CCEEEEeCC---------------CCcEEEEECCCChhhceeCCCCCccccccc
Confidence 34567653221 11222334443 889998765 346777777 66543322111
Q ss_pred -cccceEEEecCC-CEEEEEcCcccCCCccccccCCceeEEEcCCC--cEEEEecCCC-CCCCcCceEEEECCEEEEEcC
Q psy14331 103 -CSHQMVALSADK-GQLWMFGGEFSSASESQFHHYKDLWVFRMGEK--KWEKIVCKDT-PPSRSGHRMIALKKHLVVFGG 177 (265)
Q Consensus 103 -~~~~~~~~~~~~-~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~--~W~~~~~~~~-p~~~~~~~~~~~~~~iyv~GG 177 (265)
.....+.. + ++||+... ...++.+|.+++ .|+.-..... +......+.++.++.+|+ |.
T Consensus 100 ~~~~g~~~~---~~~~V~v~~~------------~g~v~AlD~~TG~~~W~~~~~~~~~~~~~i~ssP~v~~~~v~v-g~ 163 (488)
T cd00216 100 VVNRGVAYW---DPRKVFFGTF------------DGRLVALDAETGKQVWKFGNNDQVPPGYTMTGAPTIVKKLVII-GS 163 (488)
T ss_pred cccCCcEEc---cCCeEEEecC------------CCeEEEEECCCCCEeeeecCCCCcCcceEecCCCEEECCEEEE-ec
Confidence 11112222 5 77887543 246889998876 4765422110 000012233444666654 43
Q ss_pred ccCCCCccccCCceEEEEcCCC--ceEEec
Q psy14331 178 FHDNLREAKYYNDVHIFDLETY--AWKKIE 205 (265)
Q Consensus 178 ~~~~~~~~~~~~~v~~y~~~~~--~W~~~~ 205 (265)
.+...........++++|..+. .|+.-.
T Consensus 164 ~~~~~~~~~~~g~v~alD~~TG~~~W~~~~ 193 (488)
T cd00216 164 SGAEFFACGVRGALRAYDVETGKLLWRFYT 193 (488)
T ss_pred cccccccCCCCcEEEEEECCCCceeeEeec
Confidence 2211000012357899999875 587544
No 85
>KOG2048|consensus
Probab=78.81 E-value=53 Score=30.07 Aligned_cols=94 Identities=13% Similarity=0.212 Sum_probs=58.2
Q ss_pred eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCC---CCCCCcCceEEEE
Q psy14331 92 FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKD---TPPSRSGHRMIAL 168 (265)
Q Consensus 92 w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~---~p~~~~~~~~~~~ 168 (265)
-+....+|..+...+.......++++++.. .+ ..+++.++.++.+-.++.... ..++...-.....
T Consensus 418 ~~~v~~~~~~~~~a~~i~ftid~~k~~~~s-~~----------~~~le~~el~~ps~kel~~~~~~~~~~~I~~l~~Ssd 486 (691)
T KOG2048|consen 418 VINVDDVPLALLDASAISFTIDKNKLFLVS-KN----------IFSLEEFELETPSFKELKSIQSQAKCPSISRLVVSSD 486 (691)
T ss_pred EEEeccchhhhccceeeEEEecCceEEEEe-cc----------cceeEEEEecCcchhhhhccccccCCCcceeEEEcCC
Confidence 566778888887777766666688888886 11 345677777666555543332 1112222122223
Q ss_pred CCEEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 169 KKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 169 ~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
++.|-++++ ...|.+||+++.+-..+..
T Consensus 487 G~yiaa~~t----------~g~I~v~nl~~~~~~~l~~ 514 (691)
T KOG2048|consen 487 GNYIAAIST----------RGQIFVYNLETLESHLLKV 514 (691)
T ss_pred CCEEEEEec----------cceEEEEEcccceeecchh
Confidence 667777775 3568999999987776664
No 86
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=78.62 E-value=29 Score=26.95 Aligned_cols=66 Identities=14% Similarity=0.195 Sum_probs=43.2
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEcCCCc--EEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCce
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKK--WEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDV 191 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~--W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v 191 (265)
++.+|+..+ ...++++|..+++ |+.-.. .+. .......++.+|+... -+.+
T Consensus 36 ~~~v~~~~~------------~~~l~~~d~~tG~~~W~~~~~----~~~-~~~~~~~~~~v~v~~~----------~~~l 88 (238)
T PF13360_consen 36 GGRVYVASG------------DGNLYALDAKTGKVLWRFDLP----GPI-SGAPVVDGGRVYVGTS----------DGSL 88 (238)
T ss_dssp TTEEEEEET------------TSEEEEEETTTSEEEEEEECS----SCG-GSGEEEETTEEEEEET----------TSEE
T ss_pred CCEEEEEcC------------CCEEEEEECCCCCEEEEeecc----ccc-cceeeecccccccccc----------eeee
Confidence 888888843 3679999998875 655431 222 2224667888888763 2369
Q ss_pred EEEEcCCC--ceE-Eecc
Q psy14331 192 HIFDLETY--AWK-KIEP 206 (265)
Q Consensus 192 ~~y~~~~~--~W~-~~~~ 206 (265)
+.+|..+. .|+ ....
T Consensus 89 ~~~d~~tG~~~W~~~~~~ 106 (238)
T PF13360_consen 89 YALDAKTGKVLWSIYLTS 106 (238)
T ss_dssp EEEETTTSCEEEEEEE-S
T ss_pred EecccCCcceeeeecccc
Confidence 99998775 598 4444
No 87
>PRK03629 tolB translocation protein TolB; Provisional
Probab=77.20 E-value=49 Score=28.86 Aligned_cols=61 Identities=10% Similarity=0.107 Sum_probs=35.2
Q ss_pred CceeEEEcCCCcEEEEecCCCCCCCcCceEEEECC-EEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 136 KDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKK-HLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 136 ~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
.+++.+|+.++.-+.+.... .........-++ .|++.....+ ...++.+|+.+..++.+..
T Consensus 311 ~~Iy~~d~~~g~~~~lt~~~---~~~~~~~~SpDG~~Ia~~~~~~g-------~~~I~~~dl~~g~~~~Lt~ 372 (429)
T PRK03629 311 PQVYKVNINGGAPQRITWEG---SQNQDADVSSDGKFMVMVSSNGG-------QQHIAKQDLATGGVQVLTD 372 (429)
T ss_pred ceEEEEECCCCCeEEeecCC---CCccCEEECCCCCEEEEEEccCC-------CceEEEEECCCCCeEEeCC
Confidence 47889999888766664321 111111222244 4444433221 2468999999998887775
No 88
>PRK00178 tolB translocation protein TolB; Provisional
Probab=76.80 E-value=49 Score=28.69 Aligned_cols=144 Identities=13% Similarity=0.094 Sum_probs=71.3
Q ss_pred HHHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCCCCCC
Q psy14331 26 DIEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPGAPPP 101 (265)
Q Consensus 26 ~~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~~p~~ 101 (265)
.++.+|..+.+-.. ....+. ........|++++|++..- .++ ..+++.+|. ...+...
T Consensus 224 ~l~~~~l~~g~~~~-l~~~~g--~~~~~~~SpDG~~la~~~~--~~g----------~~~Iy~~d~~~~~~~~lt~~--- 285 (430)
T PRK00178 224 RIFVQNLDTGRREQ-ITNFEG--LNGAPAWSPDGSKLAFVLS--KDG----------NPEIYVMDLASRQLSRVTNH--- 285 (430)
T ss_pred EEEEEECCCCCEEE-ccCCCC--CcCCeEECCCCCEEEEEEc--cCC----------CceEEEEECCCCCeEEcccC---
Confidence 35556666555544 222221 1123444456667765432 111 357888888 2222221
Q ss_pred ccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEE-CCEEEEEcCccC
Q psy14331 102 RCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIAL-KKHLVVFGGFHD 180 (265)
Q Consensus 102 r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~-~~~iyv~GG~~~ 180 (265)
............+++|++..... + ..+++.+|..+++++.+.... .........- ++.|++....++
T Consensus 286 ~~~~~~~~~spDg~~i~f~s~~~--g-------~~~iy~~d~~~g~~~~lt~~~---~~~~~~~~Spdg~~i~~~~~~~~ 353 (430)
T PRK00178 286 PAIDTEPFWGKDGRTLYFTSDRG--G-------KPQIYKVNVNGGRAERVTFVG---NYNARPRLSADGKTLVMVHRQDG 353 (430)
T ss_pred CCCcCCeEECCCCCEEEEEECCC--C-------CceEEEEECCCCCEEEeecCC---CCccceEECCCCCEEEEEEccCC
Confidence 11111222333345666553221 1 356999999998888775321 1111112222 345555443221
Q ss_pred CCCccccCCceEEEEcCCCceEEecc
Q psy14331 181 NLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 181 ~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
...|+.+|+.+...+.+..
T Consensus 354 -------~~~l~~~dl~tg~~~~lt~ 372 (430)
T PRK00178 354 -------NFHVAAQDLQRGSVRILTD 372 (430)
T ss_pred -------ceEEEEEECCCCCEEEccC
Confidence 2359999999988877765
No 89
>KOG2321|consensus
Probab=76.51 E-value=58 Score=29.38 Aligned_cols=52 Identities=8% Similarity=0.166 Sum_probs=31.9
Q ss_pred CCceeEEEcCCCcEEEEecCCCCCCCcCceEEEEC--CEEEEEcCccCCCCccccCCceEEEEcCCC
Q psy14331 135 YKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALK--KHLVVFGGFHDNLREAKYYNDVHIFDLETY 199 (265)
Q Consensus 135 ~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~--~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~ 199 (265)
.++++++|++.+.|- .+.. .....-.++.++ ..++.+||.++ .|+.+|+...
T Consensus 154 g~evYRlNLEqGrfL--~P~~--~~~~~lN~v~in~~hgLla~Gt~~g---------~VEfwDpR~k 207 (703)
T KOG2321|consen 154 GSEVYRLNLEQGRFL--NPFE--TDSGELNVVSINEEHGLLACGTEDG---------VVEFWDPRDK 207 (703)
T ss_pred CcceEEEEccccccc--cccc--cccccceeeeecCccceEEecccCc---------eEEEecchhh
Confidence 467999999999884 3332 111222333343 47888998643 4777777654
No 90
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=76.32 E-value=38 Score=27.07 Aligned_cols=64 Identities=17% Similarity=0.153 Sum_probs=34.3
Q ss_pred CCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEE--CCEEEEEcCccCCCCccccCCc
Q psy14331 113 DKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIAL--KKHLVVFGGFHDNLREAKYYND 190 (265)
Q Consensus 113 ~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~--~~~iyv~GG~~~~~~~~~~~~~ 190 (265)
.++.+|+.++. .+.+..||..+.+....-... +.+ ..++.. ++.+|+.++. .+.
T Consensus 41 dg~~l~~~~~~-----------~~~v~~~d~~~~~~~~~~~~~-~~~---~~~~~~~~g~~l~~~~~~---------~~~ 96 (300)
T TIGR03866 41 DGKLLYVCASD-----------SDTIQVIDLATGEVIGTLPSG-PDP---ELFALHPNGKILYIANED---------DNL 96 (300)
T ss_pred CCCEEEEEECC-----------CCeEEEEECCCCcEEEeccCC-CCc---cEEEECCCCCEEEEEcCC---------CCe
Confidence 34567777653 245888998887654321111 111 122222 3456665442 245
Q ss_pred eEEEEcCCCc
Q psy14331 191 VHIFDLETYA 200 (265)
Q Consensus 191 v~~y~~~~~~ 200 (265)
+..+|+.+.+
T Consensus 97 l~~~d~~~~~ 106 (300)
T TIGR03866 97 VTVIDIETRK 106 (300)
T ss_pred EEEEECCCCe
Confidence 8889988754
No 91
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=75.86 E-value=44 Score=27.68 Aligned_cols=77 Identities=21% Similarity=0.272 Sum_probs=47.1
Q ss_pred CceeEEEcCCC-----cEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCc-eEEeccCCC
Q psy14331 136 KDLWVFRMGEK-----KWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYA-WKKIEPLGA 209 (265)
Q Consensus 136 ~~~~~yd~~~~-----~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~-W~~~~~~~~ 209 (265)
..+..|+.... +.+.+..... +-.-.+++..++++.+.-| +.+..|++...+ +...+..
T Consensus 62 Gri~v~~i~~~~~~~~~l~~i~~~~~--~g~V~ai~~~~~~lv~~~g-----------~~l~v~~l~~~~~l~~~~~~-- 126 (321)
T PF03178_consen 62 GRILVFEISESPENNFKLKLIHSTEV--KGPVTAICSFNGRLVVAVG-----------NKLYVYDLDNSKTLLKKAFY-- 126 (321)
T ss_dssp EEEEEEEECSS-----EEEEEEEEEE--SS-EEEEEEETTEEEEEET-----------TEEEEEEEETTSSEEEEEEE--
T ss_pred cEEEEEEEEcccccceEEEEEEEEee--cCcceEhhhhCCEEEEeec-----------CEEEEEEccCcccchhhhee--
Confidence 77999999885 5666544432 2223466677888666555 468888888877 8877764
Q ss_pred CCCCCCCceEEEeCCCeEEE
Q psy14331 210 GPAPRSGCQMAATPDGKILI 229 (265)
Q Consensus 210 ~p~~~~~~~~~~~~~~~iyi 229 (265)
..+-....+.+. ++.|++
T Consensus 127 -~~~~~i~sl~~~-~~~I~v 144 (321)
T PF03178_consen 127 -DSPFYITSLSVF-KNYILV 144 (321)
T ss_dssp --BSSSEEEEEEE-TTEEEE
T ss_pred -cceEEEEEEecc-ccEEEE
Confidence 222233444444 565444
No 92
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=75.21 E-value=46 Score=27.57 Aligned_cols=93 Identities=18% Similarity=0.161 Sum_probs=53.1
Q ss_pred CCCEEEEEcCcccCCCccccccCCceeEEEcCCCc-EEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCce
Q psy14331 113 DKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKK-WEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDV 191 (265)
Q Consensus 113 ~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~-W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v 191 (265)
.+++|.+.-| +.+..|+...++ +....... .+....++.+.++.|++---..+ -.+
T Consensus 97 ~~~~lv~~~g-------------~~l~v~~l~~~~~l~~~~~~~--~~~~i~sl~~~~~~I~vgD~~~s--------v~~ 153 (321)
T PF03178_consen 97 FNGRLVVAVG-------------NKLYVYDLDNSKTLLKKAFYD--SPFYITSLSVFKNYILVGDAMKS--------VSL 153 (321)
T ss_dssp ETTEEEEEET-------------TEEEEEEEETTSSEEEEEEE---BSSSEEEEEEETTEEEEEESSSS--------EEE
T ss_pred hCCEEEEeec-------------CEEEEEEccCcccchhhheec--ceEEEEEEeccccEEEEEEcccC--------EEE
Confidence 3777666555 357788888887 88887664 23345566666886665322111 234
Q ss_pred EEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEc
Q psy14331 192 HIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISG 231 (265)
Q Consensus 192 ~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~G 231 (265)
..|+.+..+-..++.. +.++...++..+.++..++++
T Consensus 154 ~~~~~~~~~l~~va~d---~~~~~v~~~~~l~d~~~~i~~ 190 (321)
T PF03178_consen 154 LRYDEENNKLILVARD---YQPRWVTAAEFLVDEDTIIVG 190 (321)
T ss_dssp EEEETTTE-EEEEEEE---SS-BEEEEEEEE-SSSEEEEE
T ss_pred EEEEccCCEEEEEEec---CCCccEEEEEEecCCcEEEEE
Confidence 5678877778888774 445555555555244544443
No 93
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=72.00 E-value=77 Score=28.65 Aligned_cols=70 Identities=11% Similarity=0.162 Sum_probs=40.9
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEcCCC--cEEEEecCC--C-C---CCCcCceEEEECCEEEEEcCccCCCCcc
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRMGEK--KWEKIVCKD--T-P---PSRSGHRMIALKKHLVVFGGFHDNLREA 185 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~--~W~~~~~~~--~-p---~~~~~~~~~~~~~~iyv~GG~~~~~~~~ 185 (265)
++.||+... ...++.+|..++ .|+.-.... . + ......+.++.++++|+...
T Consensus 69 ~g~vyv~s~------------~g~v~AlDa~TGk~lW~~~~~~~~~~~~~~~~~~~~rg~av~~~~v~v~t~-------- 128 (527)
T TIGR03075 69 DGVMYVTTS------------YSRVYALDAKTGKELWKYDPKLPDDVIPVMCCDVVNRGVALYDGKVFFGTL-------- 128 (527)
T ss_pred CCEEEEECC------------CCcEEEEECCCCceeeEecCCCCcccccccccccccccceEECCEEEEEcC--------
Confidence 889998654 245888998876 477543221 0 0 00111234555777776322
Q ss_pred ccCCceEEEEcCCC--ceEEec
Q psy14331 186 KYYNDVHIFDLETY--AWKKIE 205 (265)
Q Consensus 186 ~~~~~v~~y~~~~~--~W~~~~ 205 (265)
-..|+++|..+. .|+.-.
T Consensus 129 --dg~l~ALDa~TGk~~W~~~~ 148 (527)
T TIGR03075 129 --DARLVALDAKTGKVVWSKKN 148 (527)
T ss_pred --CCEEEEEECCCCCEEeeccc
Confidence 246899999875 487543
No 94
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=71.74 E-value=57 Score=27.06 Aligned_cols=152 Identities=15% Similarity=0.218 Sum_probs=63.0
Q ss_pred ccceEEEeccCCCCC-CCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCC-CCCCccccc
Q psy14331 33 RKNKVIEKVVPEPTR-RANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPG-APPPRCSHQ 106 (265)
Q Consensus 33 ~~~~w~~~~~~~p~~-r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~-~p~~r~~~~ 106 (265)
.-.+|.......+.+ ...+..+.. .++..|++|- ..-++.-.- |+..+- .+.|-....
T Consensus 45 GG~tW~~~~~~~~~~~~~~l~~I~f-~~~~g~ivG~---------------~g~ll~T~DgG~tW~~v~l~~~lpgs~~~ 108 (302)
T PF14870_consen 45 GGKTWQPVSLDLDNPFDYHLNSISF-DGNEGWIVGE---------------PGLLLHTTDGGKTWERVPLSSKLPGSPFG 108 (302)
T ss_dssp TTSS-EE-----S-----EEEEEEE-ETTEEEEEEE---------------TTEEEEESSTTSS-EE----TT-SS-EEE
T ss_pred CCccccccccCCCccceeeEEEEEe-cCCceEEEcC---------------CceEEEecCCCCCcEEeecCCCCCCCeeE
Confidence 345687744444433 233332322 3778888764 122222222 877642 233434444
Q ss_pred eEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEE-CCEEEEEcCccCCCCcc
Q psy14331 107 MVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIAL-KKHLVVFGGFHDNLREA 185 (265)
Q Consensus 107 ~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~-~~~iyv~GG~~~~~~~~ 185 (265)
...+. ++.+.++|.. ..+++=.-...+|+.+.... ......+... ++++++++-.
T Consensus 109 i~~l~--~~~~~l~~~~------------G~iy~T~DgG~tW~~~~~~~---~gs~~~~~r~~dG~~vavs~~------- 164 (302)
T PF14870_consen 109 ITALG--DGSAELAGDR------------GAIYRTTDGGKTWQAVVSET---SGSINDITRSSDGRYVAVSSR------- 164 (302)
T ss_dssp EEEEE--TTEEEEEETT--------------EEEESSTTSSEEEEE-S-------EEEEEE-TTS-EEEEETT-------
T ss_pred EEEcC--CCcEEEEcCC------------CcEEEeCCCCCCeeEcccCC---cceeEeEEECCCCcEEEEECc-------
Confidence 44443 6777777542 34555555567999886432 2112222333 5566666543
Q ss_pred ccCCce-EEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEc
Q psy14331 186 KYYNDV-HIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISG 231 (265)
Q Consensus 186 ~~~~~v-~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~G 231 (265)
..+ ...|+-...|+.... +..|.--.+....++.++++.
T Consensus 165 ---G~~~~s~~~G~~~w~~~~r----~~~~riq~~gf~~~~~lw~~~ 204 (302)
T PF14870_consen 165 ---GNFYSSWDPGQTTWQPHNR----NSSRRIQSMGFSPDGNLWMLA 204 (302)
T ss_dssp ---SSEEEEE-TT-SS-EEEE------SSS-EEEEEE-TTS-EEEEE
T ss_pred ---ccEEEEecCCCccceEEcc----CccceehhceecCCCCEEEEe
Confidence 223 346777778988876 334444555555566666654
No 95
>PLN00033 photosystem II stability/assembly factor; Provisional
Probab=71.42 E-value=68 Score=27.81 Aligned_cols=71 Identities=8% Similarity=0.009 Sum_probs=34.9
Q ss_pred cEEEEecCCCCCCCcCceEEE-ECCEEEEEcCccCCCCccccCCceEEEEcCCCc-----eEEeccCCCCCCCCC-CceE
Q psy14331 147 KWEKIVCKDTPPSRSGHRMIA-LKKHLVVFGGFHDNLREAKYYNDVHIFDLETYA-----WKKIEPLGAGPAPRS-GCQM 219 (265)
Q Consensus 147 ~W~~~~~~~~p~~~~~~~~~~-~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~-----W~~~~~~~~~p~~~~-~~~~ 219 (265)
.|+.+... .++.-.++.. .++.+++.|... .+..-+-.... |.++.. +..+. -..+
T Consensus 271 ~W~~~~~~---~~~~l~~v~~~~dg~l~l~g~~G----------~l~~S~d~G~~~~~~~f~~~~~----~~~~~~l~~v 333 (398)
T PLN00033 271 YWQPHNRA---SARRIQNMGWRADGGLWLLTRGG----------GLYVSKGTGLTEEDFDFEEADI----KSRGFGILDV 333 (398)
T ss_pred ceEEecCC---CccceeeeeEcCCCCEEEEeCCc----------eEEEecCCCCcccccceeeccc----CCCCcceEEE
Confidence 49988643 3333333333 367888877532 23333333333 444333 21222 2334
Q ss_pred EEeCCCeEEEEcCcc
Q psy14331 220 AATPDGKILISGGYS 234 (265)
Q Consensus 220 ~~~~~~~iyi~GG~~ 234 (265)
....++.++++|...
T Consensus 334 ~~~~d~~~~a~G~~G 348 (398)
T PLN00033 334 GYRSKKEAWAAGGSG 348 (398)
T ss_pred EEcCCCcEEEEECCC
Confidence 444466888888653
No 96
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=70.41 E-value=61 Score=26.87 Aligned_cols=152 Identities=14% Similarity=0.169 Sum_probs=64.3
Q ss_pred ccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCCC-CCC-ccccc
Q psy14331 33 RKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPGA-PPP-RCSHQ 106 (265)
Q Consensus 33 ~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~~-p~~-r~~~~ 106 (265)
..+.|+.+. +|....-..+.+. +.++-|++|- ...+++=+- |+....- +.+ .....
T Consensus 4 ~~~~W~~v~--l~t~~~l~dV~F~-d~~~G~~VG~---------------~g~il~T~DGG~tW~~~~~~~~~~~~~~l~ 65 (302)
T PF14870_consen 4 SGNSWQQVS--LPTDKPLLDVAFV-DPNHGWAVGA---------------YGTILKTTDGGKTWQPVSLDLDNPFDYHLN 65 (302)
T ss_dssp SS--EEEEE---S-SS-EEEEEES-SSS-EEEEET---------------TTEEEEESSTTSS-EE-----S-----EEE
T ss_pred cCCCcEEee--cCCCCceEEEEEe-cCCEEEEEec---------------CCEEEEECCCCccccccccCCCccceeeEE
Confidence 456787743 3333344455555 7889999876 233333222 7765432 222 22222
Q ss_pred eEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEE-CCEEEEEcCccCCCCcc
Q psy14331 107 MVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIAL-KKHLVVFGGFHDNLREA 185 (265)
Q Consensus 107 ~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~-~~~iyv~GG~~~~~~~~ 185 (265)
.+... ++..|++|.. .-++.-.-...+|++++... +.+.....+..+ ++.+.++|..
T Consensus 66 ~I~f~--~~~g~ivG~~------------g~ll~T~DgG~tW~~v~l~~-~lpgs~~~i~~l~~~~~~l~~~~------- 123 (302)
T PF14870_consen 66 SISFD--GNEGWIVGEP------------GLLLHTTDGGKTWERVPLSS-KLPGSPFGITALGDGSAELAGDR------- 123 (302)
T ss_dssp EEEEE--TTEEEEEEET------------TEEEEESSTTSS-EE----T-T-SS-EEEEEEEETTEEEEEETT-------
T ss_pred EEEec--CCceEEEcCC------------ceEEEecCCCCCcEEeecCC-CCCCCeeEEEEcCCCcEEEEcCC-------
Confidence 33332 6788887642 12333333467899986322 233333344444 5677776642
Q ss_pred ccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEc
Q psy14331 186 KYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISG 231 (265)
Q Consensus 186 ~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~G 231 (265)
..|++-.-.-.+|+.+... .. .....+....++++++++
T Consensus 124 ---G~iy~T~DgG~tW~~~~~~---~~-gs~~~~~r~~dG~~vavs 162 (302)
T PF14870_consen 124 ---GAIYRTTDGGKTWQAVVSE---TS-GSINDITRSSDGRYVAVS 162 (302)
T ss_dssp -----EEEESSTTSSEEEEE-S--------EEEEEE-TTS-EEEEE
T ss_pred ---CcEEEeCCCCCCeeEcccC---Cc-ceeEeEEECCCCcEEEEE
Confidence 3455555456789987763 21 222223334455555554
No 97
>PRK04922 tolB translocation protein TolB; Provisional
Probab=70.27 E-value=74 Score=27.75 Aligned_cols=60 Identities=13% Similarity=0.212 Sum_probs=35.9
Q ss_pred CceeEEEcCCCcEEEEecCCCCCCCcCceEEE-E-CCEEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 136 KDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIA-L-KKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 136 ~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~-~-~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
.+++.+|..+++.+.+.... ........ - ++.|++..+.++ ...|+.+|+.+...+.+..
T Consensus 316 ~~iy~~dl~~g~~~~lt~~g----~~~~~~~~SpDG~~Ia~~~~~~~-------~~~I~v~d~~~g~~~~Lt~ 377 (433)
T PRK04922 316 PQIYRVAASGGSAERLTFQG----NYNARASVSPDGKKIAMVHGSGG-------QYRIAVMDLSTGSVRTLTP 377 (433)
T ss_pred ceEEEEECCCCCeEEeecCC----CCccCEEECCCCCEEEEEECCCC-------ceeEEEEECCCCCeEECCC
Confidence 46999999888887775321 11112222 2 445555444211 2368999998888876654
No 98
>PRK05137 tolB translocation protein TolB; Provisional
Probab=69.73 E-value=76 Score=27.68 Aligned_cols=144 Identities=11% Similarity=0.085 Sum_probs=68.9
Q ss_pred HHHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCCCCCC
Q psy14331 26 DIEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPGAPPP 101 (265)
Q Consensus 26 ~~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~~p~~ 101 (265)
.++.+|..+..... ....+.. ..+...-|++++|++... .++ ..++|.+|. ...+...+
T Consensus 227 ~i~~~dl~~g~~~~-l~~~~g~--~~~~~~SPDG~~la~~~~--~~g----------~~~Iy~~d~~~~~~~~Lt~~~-- 289 (435)
T PRK05137 227 RVYLLDLETGQREL-VGNFPGM--TFAPRFSPDGRKVVMSLS--QGG----------NTDIYTMDLRSGTTTRLTDSP-- 289 (435)
T ss_pred EEEEEECCCCcEEE-eecCCCc--ccCcEECCCCCEEEEEEe--cCC----------CceEEEEECCCCceEEccCCC--
Confidence 46667776666554 3322221 123344456777765533 121 357888887 22222211
Q ss_pred ccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEEC-CEEEEEcCccC
Q psy14331 102 RCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALK-KHLVVFGGFHD 180 (265)
Q Consensus 102 r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~-~~iyv~GG~~~ 180 (265)
...........+++|+...... + ..+++.+|..+...+.+... ..........-+ +.|++... ..
T Consensus 290 -~~~~~~~~spDG~~i~f~s~~~--g-------~~~Iy~~d~~g~~~~~lt~~---~~~~~~~~~SpdG~~ia~~~~-~~ 355 (435)
T PRK05137 290 -AIDTSPSYSPDGSQIVFESDRS--G-------SPQLYVMNADGSNPRRISFG---GGRYSTPVWSPRGDLIAFTKQ-GG 355 (435)
T ss_pred -CccCceeEcCCCCEEEEEECCC--C-------CCeEEEEECCCCCeEEeecC---CCcccCeEECCCCCEEEEEEc-CC
Confidence 1111122333344555432211 1 35689999888877776532 111211122223 44544332 11
Q ss_pred CCCccccCCceEEEEcCCCceEEecc
Q psy14331 181 NLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 181 ~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
. ...++.+|+.+...+.+..
T Consensus 356 ~------~~~i~~~d~~~~~~~~lt~ 375 (435)
T PRK05137 356 G------QFSIGVMKPDGSGERILTS 375 (435)
T ss_pred C------ceEEEEEECCCCceEeccC
Confidence 1 2468889987766655543
No 99
>PRK04922 tolB translocation protein TolB; Provisional
Probab=68.57 E-value=80 Score=27.52 Aligned_cols=61 Identities=5% Similarity=0.038 Sum_probs=34.6
Q ss_pred CceeEEEcCCCcEEEEecCCCCCCCcCceEEEECC-EEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 136 KDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKK-HLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 136 ~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
.+++.+|+.++.-+.+.... .........-++ .|+......+ ...++.+|..+.+.+++..
T Consensus 272 ~~Iy~~d~~~g~~~~lt~~~---~~~~~~~~spDG~~l~f~sd~~g-------~~~iy~~dl~~g~~~~lt~ 333 (433)
T PRK04922 272 PEIYVMDLGSRQLTRLTNHF---GIDTEPTWAPDGKSIYFTSDRGG-------RPQIYRVAASGGSAERLTF 333 (433)
T ss_pred ceEEEEECCCCCeEECccCC---CCccceEECCCCCEEEEEECCCC-------CceEEEEECCCCCeEEeec
Confidence 57999999988766654321 111111222244 4444332211 2468999998888887764
No 100
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=68.16 E-value=77 Score=27.12 Aligned_cols=91 Identities=13% Similarity=0.103 Sum_probs=44.2
Q ss_pred HHHhhcccceEEEeccCCCCCCCcee-eEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe--eecCCCCCCCcc
Q psy14331 27 IEAEEKRKNKVIEKVVPEPTRRANFS-FLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL--FVSAPGAPPPRC 103 (265)
Q Consensus 27 ~~~~d~~~~~w~~~~~~~p~~r~~~~-~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~--w~~~~~~p~~r~ 103 (265)
+...|..+.+-.... +.....|. .++-+++..+|+.+. -..+-.+|+ .......+....
T Consensus 18 v~viD~~t~~~~~~i---~~~~~~h~~~~~s~Dgr~~yv~~r---------------dg~vsviD~~~~~~v~~i~~G~~ 79 (369)
T PF02239_consen 18 VAVIDGATNKVVARI---PTGGAPHAGLKFSPDGRYLYVANR---------------DGTVSVIDLATGKVVATIKVGGN 79 (369)
T ss_dssp EEEEETTT-SEEEEE---E-STTEEEEEE-TT-SSEEEEEET---------------TSEEEEEETTSSSEEEEEE-SSE
T ss_pred EEEEECCCCeEEEEE---cCCCCceeEEEecCCCCEEEEEcC---------------CCeEEEEECCcccEEEEEecCCC
Confidence 555666665543222 22222244 444456888999864 245778888 222333344444
Q ss_pred ccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCc
Q psy14331 104 SHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKK 147 (265)
Q Consensus 104 ~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~ 147 (265)
.+..+.. . +++..+.+... .+++..+|.++.+
T Consensus 80 ~~~i~~s-~-DG~~~~v~n~~----------~~~v~v~D~~tle 111 (369)
T PF02239_consen 80 PRGIAVS-P-DGKYVYVANYE----------PGTVSVIDAETLE 111 (369)
T ss_dssp EEEEEE----TTTEEEEEEEE----------TTEEEEEETTT--
T ss_pred cceEEEc-C-CCCEEEEEecC----------CCceeEecccccc
Confidence 4444433 2 45544444432 3568889987764
No 101
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=65.81 E-value=69 Score=25.74 Aligned_cols=113 Identities=19% Similarity=0.217 Sum_probs=58.1
Q ss_pred CCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEE---EEecCC----CCCCCcC---ceEE
Q psy14331 97 GAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWE---KIVCKD----TPPSRSG---HRMI 166 (265)
Q Consensus 97 ~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~---~~~~~~----~p~~~~~---~~~~ 166 (265)
.+|.+-.+...+++ ++.+|---.. ++++.+||+.++.-. .++... .|..... .-.+
T Consensus 64 ~Lp~~~~GtG~vVY---ngslYY~~~~-----------s~~IvkydL~t~~v~~~~~L~~A~~~n~~~y~~~~~t~iD~A 129 (250)
T PF02191_consen 64 KLPYPWQGTGHVVY---NGSLYYNKYN-----------SRNIVKYDLTTRSVVARRELPGAGYNNRFPYYWSGYTDIDFA 129 (250)
T ss_pred EEeceeccCCeEEE---CCcEEEEecC-----------CceEEEEECcCCcEEEEEECCccccccccceecCCCceEEEE
Confidence 35555566666666 7777665331 578999999998655 232111 0111111 1233
Q ss_pred EECCEEEEEcCccCCCCccccCCceEEEEcCCC----ceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCcc
Q psy14331 167 ALKKHLVVFGGFHDNLREAKYYNDVHIFDLETY----AWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYS 234 (265)
Q Consensus 167 ~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~----~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~ 234 (265)
+..+.|+++=....... .-.|-..|+.+. +|..- .+....+.+..++ |.||++.-.+
T Consensus 130 vDE~GLWvIYat~~~~g----~ivvskld~~tL~v~~tw~T~-----~~k~~~~naFmvC--GvLY~~~s~~ 190 (250)
T PF02191_consen 130 VDENGLWVIYATEDNNG----NIVVSKLDPETLSVEQTWNTS-----YPKRSAGNAFMVC--GVLYATDSYD 190 (250)
T ss_pred EcCCCEEEEEecCCCCC----cEEEEeeCcccCceEEEEEec-----cCchhhcceeeEe--eEEEEEEECC
Confidence 34556777654332210 123456777664 45532 2444444443333 7888875443
No 102
>PRK03629 tolB translocation protein TolB; Provisional
Probab=64.18 E-value=99 Score=26.98 Aligned_cols=62 Identities=8% Similarity=0.051 Sum_probs=34.0
Q ss_pred CceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 136 KDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 136 ~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
.+++.+|..+++.+.+.... . ........-+++.++|...... ...+|.+|+.+..-+++..
T Consensus 267 ~~I~~~d~~tg~~~~lt~~~--~-~~~~~~wSPDG~~I~f~s~~~g------~~~Iy~~d~~~g~~~~lt~ 328 (429)
T PRK03629 267 LNLYVMDLASGQIRQVTDGR--S-NNTEPTWFPDSQNLAYTSDQAG------RPQVYKVNINGGAPQRITW 328 (429)
T ss_pred cEEEEEECCCCCEEEccCCC--C-CcCceEECCCCCEEEEEeCCCC------CceEEEEECCCCCeEEeec
Confidence 46999999988877664321 1 1111112224443333322111 2478899998877776654
No 103
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=63.59 E-value=96 Score=26.61 Aligned_cols=61 Identities=7% Similarity=0.096 Sum_probs=35.1
Q ss_pred CceeEEEcCCCcEEEEecCCCCCCCcCceEEEECC-EEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 136 KDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKK-HLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 136 ~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
.+++.+|+.++..+.+.... ..... ....-++ .|++.....+ ...|+.+|..+..++.+..
T Consensus 258 ~~i~~~d~~~~~~~~l~~~~--~~~~~-~~~s~dg~~l~~~s~~~g-------~~~iy~~d~~~~~~~~l~~ 319 (417)
T TIGR02800 258 PDIYVMDLDGKQLTRLTNGP--GIDTE-PSWSPDGKSIAFTSDRGG-------SPQIYMMDADGGEVRRLTF 319 (417)
T ss_pred ccEEEEECCCCCEEECCCCC--CCCCC-EEECCCCCEEEEEECCCC-------CceEEEEECCCCCEEEeec
Confidence 56999999988776664332 11111 1112233 4544432211 2479999999888877764
No 104
>PF13088 BNR_2: BNR repeat-like domain; PDB: 2F11_A 2F0Z_A 1VCU_B 2F25_B 1SO7_A 2F29_A 1SNT_A 2F13_A 2F28_A 2F27_A ....
Probab=61.08 E-value=35 Score=27.39 Aligned_cols=126 Identities=16% Similarity=0.210 Sum_probs=60.6
Q ss_pred eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcC-CCcEEEEecCCCCCCCcCceEEEE-C
Q psy14331 92 FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMG-EKKWEKIVCKDTPPSRSGHRMIAL-K 169 (265)
Q Consensus 92 w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~-~~~W~~~~~~~~p~~~~~~~~~~~-~ 169 (265)
|......+..........+...++.|+++-... .. ....+.+... ..+|+.......|.+.....++.. +
T Consensus 147 W~~~~~~~~~~~~~e~~~~~~~dG~l~~~~R~~-~~-------~~~~~~~S~D~G~TWs~~~~~~~~~~~~~~~~~~~~~ 218 (275)
T PF13088_consen 147 WSSGSPIPDGQGECEPSIVELPDGRLLAVFRTE-GN-------DDIYISRSTDGGRTWSPPQPTNLPNPNSSISLVRLSD 218 (275)
T ss_dssp EEEEEECECSEEEEEEEEEEETTSEEEEEEEEC-SS-------TEEEEEEESSTTSS-EEEEEEECSSCCEEEEEEECTT
T ss_pred eeccccccccCCcceeEEEECCCCcEEEEEEcc-CC-------CcEEEEEECCCCCcCCCceecccCcccCCceEEEcCC
Confidence 776655533222222333333478888886542 11 1123333333 347998765444555544444444 5
Q ss_pred CEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCC-ceEEEeCCCeEEE
Q psy14331 170 KHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSG-CQMAATPDGKILI 229 (265)
Q Consensus 170 ~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~-~~~~~~~~~~iyi 229 (265)
+.++++....... ..-.++.-.-...+|.........+....+ ..++...+++|+|
T Consensus 219 g~~~~~~~~~~~r----~~l~l~~S~D~g~tW~~~~~i~~~~~~~~~Y~~~~~~~dg~l~i 275 (275)
T PF13088_consen 219 GRLLLVYNNPDGR----SNLSLYVSEDGGKTWSRPKTIDDGPNGDSGYPSLTQLPDGKLYI 275 (275)
T ss_dssp SEEEEEEECSSTS----EEEEEEEECTTCEEEEEEEEEEEEE-CCEEEEEEEEEETTEEEE
T ss_pred CCEEEEEECCCCC----CceEEEEEeCCCCcCCccEEEeCCCCCcEECCeeEEeCCCcCCC
Confidence 6777777621111 112233333336789877654222322233 3444554678886
No 105
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=60.62 E-value=1e+02 Score=25.96 Aligned_cols=150 Identities=15% Similarity=0.049 Sum_probs=81.2
Q ss_pred HHHhhcccceEEEeccCCCCCCCc-eeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe-------eecCCCC
Q psy14331 27 IEAEEKRKNKVIEKVVPEPTRRAN-FSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL-------FVSAPGA 98 (265)
Q Consensus 27 ~~~~d~~~~~w~~~~~~~p~~r~~-~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~-------w~~~~~~ 98 (265)
+..|+.............-.+-.+ -.+++.|.++.+|++.=. ++ .-.++.||. -.....|
T Consensus 169 i~~y~~~dg~L~~~~~~~v~~G~GPRHi~FHpn~k~aY~v~EL--~s----------tV~v~~y~~~~g~~~~lQ~i~tl 236 (346)
T COG2706 169 IFLYDLDDGKLTPADPAEVKPGAGPRHIVFHPNGKYAYLVNEL--NS----------TVDVLEYNPAVGKFEELQTIDTL 236 (346)
T ss_pred EEEEEcccCccccccccccCCCCCcceEEEcCCCcEEEEEecc--CC----------EEEEEEEcCCCceEEEeeeeccC
Confidence 556666666665433311112121 136777889999998542 11 224566666 2233334
Q ss_pred CC---CccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEE--EcCCCcEEEEecCC--CCCCCcCceEEEECCE
Q psy14331 99 PP---PRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVF--RMGEKKWEKIVCKD--TPPSRSGHRMIALKKH 171 (265)
Q Consensus 99 p~---~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~y--d~~~~~W~~~~~~~--~p~~~~~~~~~~~~~~ 171 (265)
|. .-...+++.+..+++.||+.-.. .+.+.+| |+.+++-..+...+ .-.||.. ....-++.
T Consensus 237 P~dF~g~~~~aaIhis~dGrFLYasNRg-----------~dsI~~f~V~~~~g~L~~~~~~~teg~~PR~F-~i~~~g~~ 304 (346)
T COG2706 237 PEDFTGTNWAAAIHISPDGRFLYASNRG-----------HDSIAVFSVDPDGGKLELVGITPTEGQFPRDF-NINPSGRF 304 (346)
T ss_pred ccccCCCCceeEEEECCCCCEEEEecCC-----------CCeEEEEEEcCCCCEEEEEEEeccCCcCCccc-eeCCCCCE
Confidence 42 34556677777777788887543 2344444 55566554443322 2246643 33333444
Q ss_pred EEEEcCccCCCCccccCCceEEEEcCCCceEEeccC
Q psy14331 172 LVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPL 207 (265)
Q Consensus 172 iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~ 207 (265)
|++. +.+++ .-.+++-|.++.+-..+...
T Consensus 305 Liaa-~q~sd------~i~vf~~d~~TG~L~~~~~~ 333 (346)
T COG2706 305 LIAA-NQKSD------NITVFERDKETGRLTLLGRY 333 (346)
T ss_pred EEEE-ccCCC------cEEEEEEcCCCceEEecccc
Confidence 5554 44333 23566778888888888774
No 106
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=60.61 E-value=81 Score=24.81 Aligned_cols=105 Identities=12% Similarity=0.032 Sum_probs=53.4
Q ss_pred cCceEEEEe---eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCC--C-
Q psy14331 83 LDHLILMNL---FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKD--T- 156 (265)
Q Consensus 83 ~~~~~~yd~---w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~--~- 156 (265)
-..++++|+ ....-..+. .. .+.+...++.+|+.... .+..+|+.+++++.+.... .
T Consensus 21 ~~~i~~~~~~~~~~~~~~~~~---~~-G~~~~~~~g~l~v~~~~-------------~~~~~d~~~g~~~~~~~~~~~~~ 83 (246)
T PF08450_consen 21 GGRIYRVDPDTGEVEVIDLPG---PN-GMAFDRPDGRLYVADSG-------------GIAVVDPDTGKVTVLADLPDGGV 83 (246)
T ss_dssp TTEEEEEETTTTEEEEEESSS---EE-EEEEECTTSEEEEEETT-------------CEEEEETTTTEEEEEEEEETTCS
T ss_pred CCEEEEEECCCCeEEEEecCC---Cc-eEEEEccCCEEEEEEcC-------------ceEEEecCCCcEEEEeeccCCCc
Confidence 357888888 222212222 22 22333236888888652 2456699999998876542 1
Q ss_pred CCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 157 PPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 157 p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
+..+..-.++.-++.+|+-.-....... .....++++++. .+.+.+..
T Consensus 84 ~~~~~ND~~vd~~G~ly~t~~~~~~~~~-~~~g~v~~~~~~-~~~~~~~~ 131 (246)
T PF08450_consen 84 PFNRPNDVAVDPDGNLYVTDSGGGGASG-IDPGSVYRIDPD-GKVTVVAD 131 (246)
T ss_dssp CTEEEEEEEE-TTS-EEEEEECCBCTTC-GGSEEEEEEETT-SEEEEEEE
T ss_pred ccCCCceEEEcCCCCEEEEecCCCcccc-ccccceEEECCC-CeEEEEec
Confidence 1222222222236777773311111000 001579999999 66555544
No 107
>KOG1332|consensus
Probab=59.88 E-value=90 Score=25.06 Aligned_cols=55 Identities=18% Similarity=0.275 Sum_probs=29.7
Q ss_pred EcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCC-CceEEecc
Q psy14331 142 RMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLET-YAWKKIEP 206 (265)
Q Consensus 142 d~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~-~~W~~~~~ 206 (265)
+-+...|+.--....|.+.+..+-...++.|-+-|| -+.+.++-... .+|+++..
T Consensus 241 ~~e~e~wk~tll~~f~~~~w~vSWS~sGn~LaVs~G----------dNkvtlwke~~~Gkw~~v~~ 296 (299)
T KOG1332|consen 241 DEEYEPWKKTLLEEFPDVVWRVSWSLSGNILAVSGG----------DNKVTLWKENVDGKWEEVGE 296 (299)
T ss_pred cCccCcccccccccCCcceEEEEEeccccEEEEecC----------CcEEEEEEeCCCCcEEEccc
Confidence 334567875433223344443333333455555555 35677776665 58998865
No 108
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=58.41 E-value=1.7e+02 Score=27.89 Aligned_cols=39 Identities=21% Similarity=0.253 Sum_probs=21.3
Q ss_pred EEEECCEEEEEcCccCCCC-ccccCCceEEEEcCCCc--eEEe
Q psy14331 165 MIALKKHLVVFGGFHDNLR-EAKYYNDVHIFDLETYA--WKKI 204 (265)
Q Consensus 165 ~~~~~~~iyv~GG~~~~~~-~~~~~~~v~~y~~~~~~--W~~~ 204 (265)
-++.++.+|+ |+...+.. .......|..||..+.+ |+.-
T Consensus 312 P~V~~g~VIv-G~~v~d~~~~~~~~G~I~A~Da~TGkl~W~~~ 353 (764)
T TIGR03074 312 PLVAGTTVVI-GGRVADNYSTDEPSGVIRAFDVNTGALVWAWD 353 (764)
T ss_pred CEEECCEEEE-EecccccccccCCCcEEEEEECCCCcEeeEEe
Confidence 3455776666 54322210 01224568899998864 7643
No 109
>smart00284 OLF Olfactomedin-like domains.
Probab=55.37 E-value=1.1e+02 Score=24.69 Aligned_cols=158 Identities=13% Similarity=0.037 Sum_probs=78.7
Q ss_pred CCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCCCCCC---------ccccceEE
Q psy14331 43 PEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPGAPPP---------RCSHQMVA 109 (265)
Q Consensus 43 ~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~~p~~---------r~~~~~~~ 109 (265)
.+|.+-.+-..++. ++.+|.--. . ++.+.+||+ -.....+|.+ ..+++-+-
T Consensus 69 ~Lp~~~~GtG~VVY--ngslYY~~~------~--------s~~iiKydL~t~~v~~~~~Lp~a~y~~~~~Y~~~~~sdiD 132 (255)
T smart00284 69 PLPHAGQGTGVVVY--NGSLYFNKF------N--------SHDICRFDLTTETYQKEPLLNGAGYNNRFPYAWGGFSDID 132 (255)
T ss_pred ECCCccccccEEEE--CceEEEEec------C--------CccEEEEECCCCcEEEEEecCccccccccccccCCCccEE
Confidence 35555555555553 777776432 1 578999999 1112223322 22344455
Q ss_pred EecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCC
Q psy14331 110 LSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYN 189 (265)
Q Consensus 110 ~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~ 189 (265)
++++++-|.|+=...++.. .=-+-++|+.+-.-...-.... ..+....++++-+.||+.-...... ..
T Consensus 133 lAvDE~GLWvIYat~~~~g------~ivvSkLnp~tL~ve~tW~T~~-~k~sa~naFmvCGvLY~~~s~~~~~-----~~ 200 (255)
T smart00284 133 LAVDENGLWVIYATEQNAG------KIVISKLNPATLTIENTWITTY-NKRSASNAFMICGILYVTRSLGSKG-----EK 200 (255)
T ss_pred EEEcCCceEEEEeccCCCC------CEEEEeeCcccceEEEEEEcCC-CcccccccEEEeeEEEEEccCCCCC-----cE
Confidence 5555666666533222211 1124467776543222212221 2334446777788899985322111 23
Q ss_pred ceEEEEcCCCceEEeccCCCCCCCCCCceEEEeC--CCeEEEE
Q psy14331 190 DVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATP--DGKILIS 230 (265)
Q Consensus 190 ~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~--~~~iyi~ 230 (265)
-.+.||+.+++=+.+... .+.....++++... +.+||+.
T Consensus 201 I~yayDt~t~~~~~~~i~--f~n~y~~~s~l~YNP~d~~LY~w 241 (255)
T smart00284 201 VFYAYDTNTGKEGHLDIP--FENMYEYISMLDYNPNDRKLYAW 241 (255)
T ss_pred EEEEEECCCCccceeeee--eccccccceeceeCCCCCeEEEE
Confidence 357899998764443332 22222334443221 5678876
No 110
>PRK13684 Ycf48-like protein; Provisional
Probab=54.03 E-value=1.3e+02 Score=25.22 Aligned_cols=131 Identities=14% Similarity=0.114 Sum_probs=61.7
Q ss_pred ccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEE-Ee---eecCCC-CCCCccccce
Q psy14331 33 RKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILM-NL---FVSAPG-APPPRCSHQM 107 (265)
Q Consensus 33 ~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~y-d~---w~~~~~-~p~~r~~~~~ 107 (265)
..+.|+++ . .|....-..+++. +.++.|++|- ...+++= |- |+.... +|........
T Consensus 33 ~~~~W~~~-~-~~~~~~l~~v~F~-d~~~g~avG~---------------~G~il~T~DgG~tW~~~~~~~~~~~~~l~~ 94 (334)
T PRK13684 33 SSSPWQVI-D-LPTEANLLDIAFT-DPNHGWLVGS---------------NRTLLETNDGGETWEERSLDLPEENFRLIS 94 (334)
T ss_pred cCCCcEEE-e-cCCCCceEEEEEe-CCCcEEEEEC---------------CCEEEEEcCCCCCceECccCCcccccceee
Confidence 45579774 3 2333333445555 5667777774 1223332 22 666532 2222222222
Q ss_pred EEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEE-CCEEEEEcCccCCCCccc
Q psy14331 108 VALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIAL-KKHLVVFGGFHDNLREAK 186 (265)
Q Consensus 108 ~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~-~~~iyv~GG~~~~~~~~~ 186 (265)
+... ++..++.|.. ..+++=+-...+|+.+.... ..+.....+..+ ++.+++.|..
T Consensus 95 v~~~--~~~~~~~G~~------------g~i~~S~DgG~tW~~~~~~~-~~~~~~~~i~~~~~~~~~~~g~~-------- 151 (334)
T PRK13684 95 ISFK--GDEGWIVGQP------------SLLLHTTDGGKNWTRIPLSE-KLPGSPYLITALGPGTAEMATNV-------- 151 (334)
T ss_pred eEEc--CCcEEEeCCC------------ceEEEECCCCCCCeEccCCc-CCCCCceEEEEECCCcceeeecc--------
Confidence 2222 4456666531 22343333345899885321 112222233333 3446665532
Q ss_pred cCCceEEEEcCCCceEEecc
Q psy14331 187 YYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 187 ~~~~v~~y~~~~~~W~~~~~ 206 (265)
..+++-+-.-.+|+.+..
T Consensus 152 --G~i~~S~DgG~tW~~~~~ 169 (334)
T PRK13684 152 --GAIYRTTDGGKNWEALVE 169 (334)
T ss_pred --ceEEEECCCCCCceeCcC
Confidence 335555545678998876
No 111
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=52.88 E-value=1.4e+02 Score=25.18 Aligned_cols=70 Identities=10% Similarity=0.221 Sum_probs=43.7
Q ss_pred CCceeEEEcCCCcEEEEecCCC-CCCCcCceEEEEC-CEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCC
Q psy14331 135 YKDLWVFRMGEKKWEKIVCKDT-PPSRSGHRMIALK-KHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGP 211 (265)
Q Consensus 135 ~~~~~~yd~~~~~W~~~~~~~~-p~~~~~~~~~~~~-~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p 211 (265)
.+++..|+...+.-+...+... |-.--.|-+.--+ ...|++.-.++. -.++.||+...+.+++.....+|
T Consensus 166 ~Dri~~y~~~dg~L~~~~~~~v~~G~GPRHi~FHpn~k~aY~v~EL~st-------V~v~~y~~~~g~~~~lQ~i~tlP 237 (346)
T COG2706 166 TDRIFLYDLDDGKLTPADPAEVKPGAGPRHIVFHPNGKYAYLVNELNST-------VDVLEYNPAVGKFEELQTIDTLP 237 (346)
T ss_pred CceEEEEEcccCccccccccccCCCCCcceEEEcCCCcEEEEEeccCCE-------EEEEEEcCCCceEEEeeeeccCc
Confidence 5779999998777665544432 1222223444443 468888776443 35778888888888887654444
No 112
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=51.73 E-value=1.5e+02 Score=25.29 Aligned_cols=155 Identities=13% Similarity=0.128 Sum_probs=67.4
Q ss_pred HHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe-------eecCCCCC
Q psy14331 27 IEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL-------FVSAPGAP 99 (265)
Q Consensus 27 ~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~-------w~~~~~~p 99 (265)
+..+|..+.+-. ...+......++++-++++.+|+..- . .+.+-.+|. ......++
T Consensus 60 vsviD~~~~~~v---~~i~~G~~~~~i~~s~DG~~~~v~n~--~------------~~~v~v~D~~tle~v~~I~~~~~~ 122 (369)
T PF02239_consen 60 VSVIDLATGKVV---ATIKVGGNPRGIAVSPDGKYVYVANY--E------------PGTVSVIDAETLEPVKTIPTGGMP 122 (369)
T ss_dssp EEEEETTSSSEE---EEEE-SSEEEEEEE--TTTEEEEEEE--E------------TTEEEEEETTT--EEEEEE--EE-
T ss_pred EEEEECCcccEE---EEEecCCCcceEEEcCCCCEEEEEec--C------------CCceeEeccccccceeeccccccc
Confidence 556677666642 22333444455555445666655433 1 345556665 22222222
Q ss_pred C-CccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEE-CCEEEEEcC
Q psy14331 100 P-PRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIAL-KKHLVVFGG 177 (265)
Q Consensus 100 ~-~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~-~~~iyv~GG 177 (265)
. .+....+.+++...+..|++--.. ..+++..|....+=...... +..+.-|-...- +++.|+.+
T Consensus 123 ~~~~~~Rv~aIv~s~~~~~fVv~lkd----------~~~I~vVdy~d~~~~~~~~i--~~g~~~~D~~~dpdgry~~va- 189 (369)
T PF02239_consen 123 VDGPESRVAAIVASPGRPEFVVNLKD----------TGEIWVVDYSDPKNLKVTTI--KVGRFPHDGGFDPDGRYFLVA- 189 (369)
T ss_dssp TTTS---EEEEEE-SSSSEEEEEETT----------TTEEEEEETTTSSCEEEEEE--E--TTEEEEEE-TTSSEEEEE-
T ss_pred ccccCCCceeEEecCCCCEEEEEEcc----------CCeEEEEEeccccccceeee--cccccccccccCcccceeeec-
Confidence 2 122222333433456656664432 46788888665421111111 134444444444 33333332
Q ss_pred ccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCce
Q psy14331 178 FHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQ 218 (265)
Q Consensus 178 ~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~ 218 (265)
... .+.+-..|..+.+-..+-..+..|.+.....
T Consensus 190 ~~~-------sn~i~viD~~~~k~v~~i~~g~~p~~~~~~~ 223 (369)
T PF02239_consen 190 ANG-------SNKIAVIDTKTGKLVALIDTGKKPHPGPGAN 223 (369)
T ss_dssp EGG-------GTEEEEEETTTTEEEEEEE-SSSBEETTEEE
T ss_pred ccc-------cceeEEEeeccceEEEEeecccccccccccc
Confidence 211 3578888888876554444333344433333
No 113
>PF15525 DUF4652: Domain of unknown function (DUF4652)
Probab=48.97 E-value=1.2e+02 Score=23.22 Aligned_cols=70 Identities=10% Similarity=0.238 Sum_probs=43.9
Q ss_pred cCCceeEEEcCCCcEEEEecCCCCCCCcCceEEE-E-CC-EEEEEcCccCCCCccccCCceEEEEcCCCceEEeccC
Q psy14331 134 HYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIA-L-KK-HLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPL 207 (265)
Q Consensus 134 ~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~-~-~~-~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~ 207 (265)
...++|.+|..++.|..+...+.+ ....+--+. + +. .++|+|...+.- .--..|+.|++.++.-+.+...
T Consensus 86 giGkIYIkn~~~~~~~~L~i~~~~-~k~sPK~i~WiDD~~L~vIIG~a~GTv---S~GGnLy~~nl~tg~~~~ly~~ 158 (200)
T PF15525_consen 86 GIGKIYIKNLNNNNWWSLQIDQNE-EKYSPKYIEWIDDNNLAVIIGYAHGTV---SKGGNLYKYNLNTGNLTELYEW 158 (200)
T ss_pred cceeEEEEecCCCceEEEEecCcc-cccCCceeEEecCCcEEEEEccccceE---ccCCeEEEEEccCCceeEeeec
Confidence 478999999999988777544311 122222222 2 33 456666433321 1136799999999999888874
No 114
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=48.39 E-value=42 Score=27.39 Aligned_cols=69 Identities=14% Similarity=0.136 Sum_probs=40.0
Q ss_pred HHHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe-eecCCCCCCCccc
Q psy14331 26 DIEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL-FVSAPGAPPPRCS 104 (265)
Q Consensus 26 ~~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~-w~~~~~~p~~r~~ 104 (265)
.+.+|||+...|.+..-+-..+|.. ++-+. ..+++++- +-. .+.+.+||+ -....-+|.+|-.
T Consensus 255 ~l~rfdPs~~sW~eypLPgs~arpy-s~rVD-~~grVW~s-----ea~---------agai~rfdpeta~ftv~p~pr~n 318 (353)
T COG4257 255 SLHRFDPSVTSWIEYPLPGSKARPY-SMRVD-RHGRVWLS-----EAD---------AGAIGRFDPETARFTVLPIPRPN 318 (353)
T ss_pred eeeEeCcccccceeeeCCCCCCCcc-eeeec-cCCcEEee-----ccc---------cCceeecCcccceEEEecCCCCC
Confidence 5788999999998754344444543 33443 46666662 111 578889999 2222234555554
Q ss_pred cceEEE
Q psy14331 105 HQMVAL 110 (265)
Q Consensus 105 ~~~~~~ 110 (265)
.....+
T Consensus 319 ~gn~ql 324 (353)
T COG4257 319 SGNIQL 324 (353)
T ss_pred CCceec
Confidence 444444
No 115
>PRK04043 tolB translocation protein TolB; Provisional
Probab=48.38 E-value=1.9e+02 Score=25.29 Aligned_cols=62 Identities=11% Similarity=0.150 Sum_probs=39.1
Q ss_pred CCceeEEEcCCCcEEEEecCCCCCCCcCceEEEE-CCEEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 135 YKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIAL-KKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 135 ~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
..+++.+|..++.++.+.... . ........- +++|+..-...+ ...|+.+|+.+...+++..
T Consensus 256 ~~~Iy~~dl~~g~~~~LT~~~--~-~d~~p~~SPDG~~I~F~Sdr~g-------~~~Iy~~dl~~g~~~rlt~ 318 (419)
T PRK04043 256 QPDIYLYDTNTKTLTQITNYP--G-IDVNGNFVEDDKRIVFVSDRLG-------YPNIFMKKLNSGSVEQVVF 318 (419)
T ss_pred CcEEEEEECCCCcEEEcccCC--C-ccCccEECCCCCEEEEEECCCC-------CceEEEEECCCCCeEeCcc
Confidence 367999999999988875432 1 111111222 446666543322 3579999999988877765
No 116
>PRK02889 tolB translocation protein TolB; Provisional
Probab=45.60 E-value=2.1e+02 Score=24.97 Aligned_cols=61 Identities=11% Similarity=0.060 Sum_probs=33.9
Q ss_pred CceeEEEcCCCcEEEEecCCCCCCCcCceEEEE-CCEEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 136 KDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIAL-KKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 136 ~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
.+++.++..++..+.+.... .........- ++.|+......+ ...|+.+|+.+...+.+..
T Consensus 308 ~~Iy~~~~~~g~~~~lt~~g---~~~~~~~~SpDG~~Ia~~s~~~g-------~~~I~v~d~~~g~~~~lt~ 369 (427)
T PRK02889 308 PQIYRMPASGGAAQRVTFTG---SYNTSPRISPDGKLLAYISRVGG-------AFKLYVQDLATGQVTALTD 369 (427)
T ss_pred cEEEEEECCCCceEEEecCC---CCcCceEECCCCCEEEEEEccCC-------cEEEEEEECCCCCeEEccC
Confidence 46888888888777765221 1111112222 344544333211 1368999998888776654
No 117
>PRK04043 tolB translocation protein TolB; Provisional
Probab=44.44 E-value=2.2e+02 Score=24.90 Aligned_cols=149 Identities=10% Similarity=0.107 Sum_probs=79.8
Q ss_pred HHHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe----eecCCCCCCC
Q psy14331 26 DIEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL----FVSAPGAPPP 101 (265)
Q Consensus 26 ~~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~----w~~~~~~p~~ 101 (265)
+++.+|..+.+=..+ ...+. .......-|++++|.+.-. .++ ..++|.+|. +..+...+.
T Consensus 214 ~Iyv~dl~tg~~~~l-t~~~g--~~~~~~~SPDG~~la~~~~--~~g----------~~~Iy~~dl~~g~~~~LT~~~~- 277 (419)
T PRK04043 214 TLYKYNLYTGKKEKI-ASSQG--MLVVSDVSKDGSKLLLTMA--PKG----------QPDIYLYDTNTKTLTQITNYPG- 277 (419)
T ss_pred EEEEEECCCCcEEEE-ecCCC--cEEeeEECCCCCEEEEEEc--cCC----------CcEEEEEECCCCcEEEcccCCC-
Confidence 578888877665552 22221 1122334456777776543 121 368999997 444333222
Q ss_pred ccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECC-EEEEEcCccC
Q psy14331 102 RCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKK-HLVVFGGFHD 180 (265)
Q Consensus 102 r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~-~iyv~GG~~~ 180 (265)
..... ....++++|++...... ..+++.+|..+++.+++.... .... ...-++ .|........
T Consensus 278 -~d~~p-~~SPDG~~I~F~Sdr~g---------~~~Iy~~dl~~g~~~rlt~~g----~~~~-~~SPDG~~Ia~~~~~~~ 341 (419)
T PRK04043 278 -IDVNG-NFVEDDKRIVFVSDRLG---------YPNIFMKKLNSGSVEQVVFHG----KNNS-SVSTYKNYIVYSSRETN 341 (419)
T ss_pred -ccCcc-EECCCCCEEEEEECCCC---------CceEEEEECCCCCeEeCccCC----CcCc-eECCCCCEEEEEEcCCC
Confidence 12222 24344567777654321 357999999999887765322 1122 222234 4444332211
Q ss_pred CCCccccCCceEEEEcCCCceEEeccC
Q psy14331 181 NLREAKYYNDVHIFDLETYAWKKIEPL 207 (265)
Q Consensus 181 ~~~~~~~~~~v~~y~~~~~~W~~~~~~ 207 (265)
. ........++.+|+.+..++.+...
T Consensus 342 ~-~~~~~~~~I~v~d~~~g~~~~LT~~ 367 (419)
T PRK04043 342 N-EFGKNTFNLYLISTNSDYIRRLTAN 367 (419)
T ss_pred c-ccCCCCcEEEEEECCCCCeEECCCC
Confidence 1 1001135799999999999888763
No 118
>PRK02889 tolB translocation protein TolB; Provisional
Probab=43.25 E-value=2.2e+02 Score=24.74 Aligned_cols=61 Identities=2% Similarity=-0.021 Sum_probs=33.4
Q ss_pred CceeEEEcCCCcEEEEecCCCCCCCcCceEEEECC-EEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 136 KDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKK-HLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 136 ~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
.++|.+|..++..+.+... ..........-++ .|+......+ ...+|.++..+...+.+..
T Consensus 264 ~~Iy~~d~~~~~~~~lt~~---~~~~~~~~wSpDG~~l~f~s~~~g-------~~~Iy~~~~~~g~~~~lt~ 325 (427)
T PRK02889 264 SQIYTVNADGSGLRRLTQS---SGIDTEPFFSPDGRSIYFTSDRGG-------APQIYRMPASGGAAQRVTF 325 (427)
T ss_pred ceEEEEECCCCCcEECCCC---CCCCcCeEEcCCCCEEEEEecCCC-------CcEEEEEECCCCceEEEec
Confidence 5799999887776655432 1111112222244 4544322111 2468888888877777654
No 119
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=42.08 E-value=2e+02 Score=23.75 Aligned_cols=61 Identities=15% Similarity=0.092 Sum_probs=41.5
Q ss_pred CCceeEEEcCCCcEEEEecCCCCCCCcCceEEEE-CCEEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 135 YKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIAL-KKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 135 ~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
...+.+||+.+-.|++-+... -.+|-. ++-+. .+++++.-- -.+.|.+||+++.+++.+..
T Consensus 253 ~g~l~rfdPs~~sW~eypLPg-s~arpy-s~rVD~~grVW~sea---------~agai~rfdpeta~ftv~p~ 314 (353)
T COG4257 253 TGSLHRFDPSVTSWIEYPLPG-SKARPY-SMRVDRHGRVWLSEA---------DAGAIGRFDPETARFTVLPI 314 (353)
T ss_pred CceeeEeCcccccceeeeCCC-CCCCcc-eeeeccCCcEEeecc---------ccCceeecCcccceEEEecC
Confidence 467999999999999876443 234433 33333 556666321 14678999999999998876
No 120
>KOG2321|consensus
Probab=41.09 E-value=2.8e+02 Score=25.32 Aligned_cols=76 Identities=14% Similarity=0.046 Sum_probs=41.8
Q ss_pred CCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe--eecCCCCCCCccccceEEEecCCCEEEEEc
Q psy14331 44 EPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL--FVSAPGAPPPRCSHQMVALSADKGQLWMFG 121 (265)
Q Consensus 44 ~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~--w~~~~~~p~~r~~~~~~~~~~~~~~iyv~G 121 (265)
+-.|+.+.-++...-.--||+.|- -+++|++|+ -..+.++-..-...-++.+.. -..|+++|
T Consensus 130 ~RIP~~GRDm~y~~~scDly~~gs---------------g~evYRlNLEqGrfL~P~~~~~~~lN~v~in~-~hgLla~G 193 (703)
T KOG2321|consen 130 TRIPKFGRDMKYHKPSCDLYLVGS---------------GSEVYRLNLEQGRFLNPFETDSGELNVVSINE-EHGLLACG 193 (703)
T ss_pred eecCcCCccccccCCCccEEEeec---------------CcceEEEEccccccccccccccccceeeeecC-ccceEEec
Confidence 344566666665433445676665 589999999 222333222212222333321 35788888
Q ss_pred CcccCCCccccccCCceeEEEcCCC
Q psy14331 122 GEFSSASESQFHHYKDLWVFRMGEK 146 (265)
Q Consensus 122 G~~~~~~~~~~~~~~~~~~yd~~~~ 146 (265)
|.. +.++.+|+...
T Consensus 194 t~~-----------g~VEfwDpR~k 207 (703)
T KOG2321|consen 194 TED-----------GVVEFWDPRDK 207 (703)
T ss_pred ccC-----------ceEEEecchhh
Confidence 853 44777777654
No 121
>PF07734 FBA_1: F-box associated; InterPro: IPR006527 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=40.62 E-value=1.5e+02 Score=21.82 Aligned_cols=81 Identities=9% Similarity=0.108 Sum_probs=43.9
Q ss_pred EEECCEEEEEcCccCCCCccccCCceEEEEcCCCce-EEeccCCCCCCCCC----CceEEEeCCCeEEEEcCcccccccc
Q psy14331 166 IALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAW-KKIEPLGAGPAPRS----GCQMAATPDGKILISGGYSKQSVKK 240 (265)
Q Consensus 166 ~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W-~~~~~~~~~p~~~~----~~~~~~~~~~~iyi~GG~~~~~~~~ 240 (265)
+.++|.+|=+.-...... ..-|.+||+.+.+. +.++. |.... ...+..+.+++|-++--...
T Consensus 2 V~vnG~~hW~~~~~~~~~----~~~IlsFDl~~E~F~~~~~l----P~~~~~~~~~~~L~~v~~~~L~~~~~~~~----- 68 (164)
T PF07734_consen 2 VFVNGALHWLAYDENNDE----KDFILSFDLSTEKFGRSLPL----PFCNDDDDDSVSLSVVRGDCLCVLYQCDE----- 68 (164)
T ss_pred EEECCEEEeeEEecCCCC----ceEEEEEeccccccCCEECC----CCccCccCCEEEEEEecCCEEEEEEeccC-----
Confidence 345777776665433211 11589999999999 55554 22222 23332344566666622111
Q ss_pred ccccceeeceEEEEec---Cccccccc
Q psy14331 241 DVDKGIVHTDTFLLTP---DSKTLRSC 264 (265)
Q Consensus 241 ~~~~~~~~~~~w~~~~---~~~~w~~~ 264 (265)
...-++|..+. ...+|++-
T Consensus 69 -----~~~~~IWvm~~~~~~~~SWtK~ 90 (164)
T PF07734_consen 69 -----TSKIEIWVMKKYGYGKESWTKL 90 (164)
T ss_pred -----CccEEEEEEeeeccCcceEEEE
Confidence 12356888873 25678763
No 122
>PRK01029 tolB translocation protein TolB; Provisional
Probab=38.91 E-value=2.7e+02 Score=24.37 Aligned_cols=61 Identities=16% Similarity=0.235 Sum_probs=36.3
Q ss_pred CceeEEEcCCCcEEEEecCCCCCCCcCceEEEECC-EEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 136 KDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKK-HLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 136 ~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
.+++++|+.+++.+.+.... ..... ....-++ .|++.....+ ...++.+|+.+.+.+.+..
T Consensus 351 ~~I~v~dl~~g~~~~Lt~~~--~~~~~-p~wSpDG~~L~f~~~~~g-------~~~L~~vdl~~g~~~~Lt~ 412 (428)
T PRK01029 351 RQICVYDLATGRDYQLTTSP--ENKES-PSWAIDSLHLVYSAGNSN-------ESELYLISLITKKTRKIVI 412 (428)
T ss_pred cEEEEEECCCCCeEEccCCC--CCccc-eEECCCCCEEEEEECCCC-------CceEEEEECCCCCEEEeec
Confidence 57999999999887775331 11111 1222244 4444433211 3579999999888877764
No 123
>PRK05137 tolB translocation protein TolB; Provisional
Probab=38.87 E-value=2.6e+02 Score=24.31 Aligned_cols=61 Identities=5% Similarity=0.055 Sum_probs=34.1
Q ss_pred CceeEEEcCCCcEEEEecCCCCCCCcCceEEEECC-EEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 136 KDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKK-HLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 136 ~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
.+++.+|+.++.-+.+.... .........-++ .|+...-..+ ...++.+|..+...+.+..
T Consensus 270 ~~Iy~~d~~~~~~~~Lt~~~---~~~~~~~~spDG~~i~f~s~~~g-------~~~Iy~~d~~g~~~~~lt~ 331 (435)
T PRK05137 270 TDIYTMDLRSGTTTRLTDSP---AIDTSPSYSPDGSQIVFESDRSG-------SPQLYVMNADGSNPRRISF 331 (435)
T ss_pred ceEEEEECCCCceEEccCCC---CccCceeEcCCCCEEEEEECCCC-------CCeEEEEECCCCCeEEeec
Confidence 57999999988776664321 111112222234 4443221111 2468999988877777654
No 124
>PF09910 DUF2139: Uncharacterized protein conserved in archaea (DUF2139); InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=38.83 E-value=2.3e+02 Score=23.59 Aligned_cols=102 Identities=11% Similarity=0.104 Sum_probs=60.8
Q ss_pred cccCceEEEEe--------eecCCCCCCCccc-cceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEE
Q psy14331 81 KALDHLILMNL--------FVSAPGAPPPRCS-HQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKI 151 (265)
Q Consensus 81 ~~~~~~~~yd~--------w~~~~~~p~~r~~-~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~ 151 (265)
.-.+.+..||. |...-.-+..-.+ -+-++++..+++|++.-+. ... ---+|..|..++.=+.+
T Consensus 75 NKYSHVH~yd~e~~~VrLLWkesih~~~~WaGEVSdIlYdP~~D~LLlAR~D-Gh~-------nLGvy~ldr~~g~~~~L 146 (339)
T PF09910_consen 75 NKYSHVHEYDTENDSVRLLWKESIHDKTKWAGEVSDILYDPYEDRLLLARAD-GHA-------NLGVYSLDRRTGKAEKL 146 (339)
T ss_pred eccceEEEEEcCCCeEEEEEecccCCccccccchhheeeCCCcCEEEEEecC-Ccc-------eeeeEEEcccCCceeec
Confidence 33678999997 6655444433222 3345676677888887542 111 24588888888887777
Q ss_pred ecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCce
Q psy14331 152 VCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAW 201 (265)
Q Consensus 152 ~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W 201 (265)
...+ ... .+...+..++-+ .+.. .-...|.+||+.+++|
T Consensus 147 ~~~p--s~K---G~~~~D~a~F~i--~~~~----~g~~~i~~~Dli~~~~ 185 (339)
T PF09910_consen 147 SSNP--SLK---GTLVHDYACFGI--NNFH----KGVSGIHCLDLISGKW 185 (339)
T ss_pred cCCC--CcC---ceEeeeeEEEec--cccc----cCCceEEEEEccCCeE
Confidence 6442 221 233333334432 1111 2267899999999999
No 125
>PF14298 DUF4374: Domain of unknown function (DUF4374)
Probab=37.74 E-value=2.9e+02 Score=24.36 Aligned_cols=67 Identities=18% Similarity=0.206 Sum_probs=40.1
Q ss_pred CCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccccccccccccceeeceEEEEecCcccccccC
Q psy14331 188 YNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSKQSVKKDVDKGIVHTDTFLLTPDSKTLRSCL 265 (265)
Q Consensus 188 ~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~~~~~~~~~~~~~~~~~w~~~~~~~~w~~~~ 265 (265)
.+++.++|..+.+...+..+|........- ...+.++++||-=-.... -..-+|++|+.+.+..++|
T Consensus 366 ~~~laI~d~~~kt~t~V~glP~~~is~~~~-~~~ve~G~aYi~Vtt~~g----------~~~~IY~iDp~TatAtKGl 432 (435)
T PF14298_consen 366 AKKLAIFDVSNKTFTWVTGLPADLISGFGN-APYVENGKAYIPVTTEDG----------SDPYIYKIDPATATATKGL 432 (435)
T ss_pred cceEEEEEccCceeEEeccCChhhcccccc-ceEeeCCEEEEEEeecCC----------CceeEEEEcCcccccccce
Confidence 567889999998888877653211112221 234557777763211110 0123999999999888775
No 126
>PF13570 PQQ_3: PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=36.65 E-value=69 Score=16.93 Aligned_cols=25 Identities=16% Similarity=0.068 Sum_probs=15.3
Q ss_pred eEEEECCEEEEEcCccCCCCccccCCceEEEEcCC
Q psy14331 164 RMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLET 198 (265)
Q Consensus 164 ~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~ 198 (265)
+.++.++.+|+.+. -..++++|+++
T Consensus 16 ~~~v~~g~vyv~~~----------dg~l~ald~~t 40 (40)
T PF13570_consen 16 SPAVAGGRVYVGTG----------DGNLYALDAAT 40 (40)
T ss_dssp --EECTSEEEEE-T----------TSEEEEEETT-
T ss_pred CCEEECCEEEEEcC----------CCEEEEEeCCC
Confidence 44566888888665 25688888764
No 127
>PF13540 RCC1_2: Regulator of chromosome condensation (RCC1) repeat; PDB: 3QI0_D 1JTD_B 3QHY_B.
Probab=36.49 E-value=59 Score=16.18 Aligned_cols=18 Identities=28% Similarity=0.301 Sum_probs=13.0
Q ss_pred CCceEEEeCCCeEEEEcC
Q psy14331 215 SGCQMAATPDGKILISGG 232 (265)
Q Consensus 215 ~~~~~~~~~~~~iyi~GG 232 (265)
..|+++...++++|..|-
T Consensus 8 ~~ht~al~~~g~v~~wG~ 25 (30)
T PF13540_consen 8 GYHTCALTSDGEVYCWGD 25 (30)
T ss_dssp SSEEEEEE-TTEEEEEE-
T ss_pred CCEEEEEEcCCCEEEEcC
Confidence 457888787899999984
No 128
>PF12217 End_beta_propel: Catalytic beta propeller domain of bacteriophage endosialidase; InterPro: IPR024428 This entry represents the beta propeller domain of endosialidases, which consists of catalytically active part of the enzymes. This core domain forms stable SDS-resistant trimers. There is a nested beta barrel domain in this domain. This domain is typically between 443 and 460 amino acids in length [].; PDB: 1V0E_B 1V0F_E 3JU4_A 3GVL_A 3GVK_B 3GVJ_A.
Probab=35.00 E-value=2.5e+02 Score=22.89 Aligned_cols=65 Identities=12% Similarity=0.073 Sum_probs=36.6
Q ss_pred CceEEEECCEEEEEc-CccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCC--ceEEEeCCCeEEEEcCccc
Q psy14331 162 GHRMIALKKHLVVFG-GFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSG--CQMAATPDGKILISGGYSK 235 (265)
Q Consensus 162 ~~~~~~~~~~iyv~G-G~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~--~~~~~~~~~~iyi~GG~~~ 235 (265)
..++-..++.+|+.- |.... ..-+.+.+-+.....|..+.- |...+. ...+.+ ++.|||||-..+
T Consensus 193 EPCvkyY~g~LyLtTRgt~~~----~~GS~L~rs~d~G~~w~slrf----p~nvHhtnlPFakv-gD~l~mFgsERA 260 (367)
T PF12217_consen 193 EPCVKYYDGVLYLTTRGTLPT----NPGSSLHRSDDNGQNWSSLRF----PNNVHHTNLPFAKV-GDVLYMFGSERA 260 (367)
T ss_dssp EEEEEEETTEEEEEEEES-TT----S---EEEEESSTTSS-EEEE-----TT---SS---EEEE-TTEEEEEEE-SS
T ss_pred cchhhhhCCEEEEEEcCcCCC----CCcceeeeecccCCchhhccc----cccccccCCCceee-CCEEEEEecccc
Confidence 345566699999975 43332 234567788888899999887 332222 222344 899999996543
No 129
>KOG0278|consensus
Probab=34.81 E-value=2.5e+02 Score=22.78 Aligned_cols=83 Identities=20% Similarity=0.157 Sum_probs=44.6
Q ss_pred eeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCc
Q psy14331 138 LWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGC 217 (265)
Q Consensus 138 ~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~ 217 (265)
+-..|+.+. ..++...+|......++ .-+..+||.||.+ -.++.||-.++.=...-..+ -+.+ .|
T Consensus 207 V~Fwdaksf--~~lKs~k~P~nV~SASL-~P~k~~fVaGged---------~~~~kfDy~TgeEi~~~nkg-h~gp--Vh 271 (334)
T KOG0278|consen 207 VKFWDAKSF--GLLKSYKMPCNVESASL-HPKKEFFVAGGED---------FKVYKFDYNTGEEIGSYNKG-HFGP--VH 271 (334)
T ss_pred eEEeccccc--cceeeccCccccccccc-cCCCceEEecCcc---------eEEEEEeccCCceeeecccC-CCCc--eE
Confidence 445555443 23333334444443222 1255899999973 35778888776433221211 1222 24
Q ss_pred eEEEeCCCeEEEEcCccc
Q psy14331 218 QMAATPDGKILISGGYSK 235 (265)
Q Consensus 218 ~~~~~~~~~iyi~GG~~~ 235 (265)
++-...++.+|..|-.+.
T Consensus 272 cVrFSPdGE~yAsGSEDG 289 (334)
T KOG0278|consen 272 CVRFSPDGELYASGSEDG 289 (334)
T ss_pred EEEECCCCceeeccCCCc
Confidence 444456889999986654
No 130
>KOG2445|consensus
Probab=33.55 E-value=2.4e+02 Score=23.53 Aligned_cols=61 Identities=16% Similarity=0.160 Sum_probs=34.9
Q ss_pred CEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCC-----------CCCCCCCceEEEeCCC--eEEEEcC
Q psy14331 170 KHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGA-----------GPAPRSGCQMAATPDG--KILISGG 232 (265)
Q Consensus 170 ~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~-----------~p~~~~~~~~~~~~~~--~iyi~GG 232 (265)
..++++|-...... .-...||.|+-..++|.++..++. +..+|..|-+++..++ .||=+++
T Consensus 184 ~p~iAvgs~e~a~~--~~~~~Iye~~e~~rKw~kva~L~d~~dpI~di~wAPn~Gr~y~~lAvA~kDgv~I~~v~~ 257 (361)
T KOG2445|consen 184 EPLIAVGSDEDAPH--LNKVKIYEYNENGRKWLKVAELPDHTDPIRDISWAPNIGRSYHLLAVATKDGVRIFKVKV 257 (361)
T ss_pred CceEEEEcccCCcc--ccceEEEEecCCcceeeeehhcCCCCCcceeeeeccccCCceeeEEEeecCcEEEEEEee
Confidence 45777765442211 112346777878889999887654 3345666665544444 3444554
No 131
>KOG0318|consensus
Probab=31.70 E-value=3.6e+02 Score=24.32 Aligned_cols=76 Identities=26% Similarity=0.329 Sum_probs=40.3
Q ss_pred CCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeC-CCeEEEEcCcccccccccccccee
Q psy14331 169 KKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATP-DGKILISGGYSKQSVKKDVDKGIV 247 (265)
Q Consensus 169 ~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~-~~~iyi~GG~~~~~~~~~~~~~~~ 247 (265)
++...++||.++ +++.|.+....-...... ..-|..-+.+... ++.++..|-....-+.-++.....
T Consensus 454 ~~~~vaVGG~Dg---------kvhvysl~g~~l~ee~~~---~~h~a~iT~vaySpd~~yla~~Da~rkvv~yd~~s~~~ 521 (603)
T KOG0318|consen 454 DGSEVAVGGQDG---------KVHVYSLSGDELKEEAKL---LEHRAAITDVAYSPDGAYLAAGDASRKVVLYDVASREV 521 (603)
T ss_pred CCCEEEEecccc---------eEEEEEecCCcccceeee---ecccCCceEEEECCCCcEEEEeccCCcEEEEEcccCce
Confidence 567777888643 488888887554333221 1123333444444 445555554444443334444445
Q ss_pred eceEEEEec
Q psy14331 248 HTDTFLLTP 256 (265)
Q Consensus 248 ~~~~w~~~~ 256 (265)
.++.|.|.+
T Consensus 522 ~~~~w~FHt 530 (603)
T KOG0318|consen 522 KTNRWAFHT 530 (603)
T ss_pred ecceeeeee
Confidence 666777654
No 132
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=31.29 E-value=2.4e+02 Score=21.54 Aligned_cols=23 Identities=17% Similarity=0.247 Sum_probs=14.2
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEcCCCc
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKK 147 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~ 147 (265)
++.+++.++. -..+..||+.+.+
T Consensus 104 ~~~~~~~~~~-----------~~~i~~~~~~~~~ 126 (289)
T cd00200 104 DGRILSSSSR-----------DKTIKVWDVETGK 126 (289)
T ss_pred CCCEEEEecC-----------CCeEEEEECCCcE
Confidence 3466666652 2458889887543
No 133
>PF06739 SBBP: Beta-propeller repeat; InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=31.05 E-value=90 Score=16.62 Aligned_cols=30 Identities=3% Similarity=-0.044 Sum_probs=16.9
Q ss_pred EEEeccCCCCCCCceeeEEecCCCEEEEECc
Q psy14331 37 VIEKVVPEPTRRANFSFLAHPDKDQLILFGG 67 (265)
Q Consensus 37 w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG 67 (265)
|..........-..+++++. ..+.+||.|-
T Consensus 2 w~~~lG~~~~~~~~~~IavD-~~GNiYv~G~ 31 (38)
T PF06739_consen 2 WSTQLGGPGAQDYGNGIAVD-SNGNIYVTGY 31 (38)
T ss_pred cEEEeCCCCCceeEEEEEEC-CCCCEEEEEe
Confidence 54434422222345666676 4566999886
No 134
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=30.84 E-value=2.4e+02 Score=21.49 Aligned_cols=83 Identities=18% Similarity=0.174 Sum_probs=36.8
Q ss_pred CceeEEEcCCCcEEEEecCCCCCCCcCceEEEE-CCEEEEEcCccCCCCccccCCceEEEEcCCCce-EEeccCCCCCCC
Q psy14331 136 KDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIAL-KKHLVVFGGFHDNLREAKYYNDVHIFDLETYAW-KKIEPLGAGPAP 213 (265)
Q Consensus 136 ~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W-~~~~~~~~~p~~ 213 (265)
..+..||..+.+.. ..... ....-.++... ++.+++.++. ...+..||..+.+- ..+.. ..
T Consensus 199 ~~i~i~d~~~~~~~--~~~~~-~~~~i~~~~~~~~~~~~~~~~~---------~~~i~i~~~~~~~~~~~~~~-----~~ 261 (289)
T cd00200 199 GTIKLWDLSTGKCL--GTLRG-HENGVNSVAFSPDGYLLASGSE---------DGTIRVWDLRTGECVQTLSG-----HT 261 (289)
T ss_pred CcEEEEECCCCcee--cchhh-cCCceEEEEEcCCCcEEEEEcC---------CCcEEEEEcCCceeEEEccc-----cC
Confidence 45788888764332 22210 11111222222 3455555542 24578888875332 22221 11
Q ss_pred CCCceEEEeCCCeEEEEcCccc
Q psy14331 214 RSGCQMAATPDGKILISGGYSK 235 (265)
Q Consensus 214 ~~~~~~~~~~~~~iyi~GG~~~ 235 (265)
..-..+....++..++.++.+.
T Consensus 262 ~~i~~~~~~~~~~~l~~~~~d~ 283 (289)
T cd00200 262 NSVTSLAWSPDGKRLASGSADG 283 (289)
T ss_pred CcEEEEEECCCCCEEEEecCCC
Confidence 1112333333446777766543
No 135
>KOG0308|consensus
Probab=30.73 E-value=2.3e+02 Score=26.21 Aligned_cols=78 Identities=14% Similarity=0.221 Sum_probs=41.3
Q ss_pred ccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEE------ecCCCC-CCCcC-ceEEEE-CCEE
Q psy14331 102 RCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKI------VCKDTP-PSRSG-HRMIAL-KKHL 172 (265)
Q Consensus 102 r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~------~~~~~p-~~~~~-~~~~~~-~~~i 172 (265)
+..-.+.++...++.+++-||.+ ..++.+|.++..=+.+ +...++ .++.. .+.+.- .+.+
T Consensus 117 ~DYVkcla~~ak~~~lvaSgGLD-----------~~IflWDin~~~~~l~~s~n~~t~~sl~sG~k~siYSLA~N~t~t~ 185 (735)
T KOG0308|consen 117 KDYVKCLAYIAKNNELVASGGLD-----------RKIFLWDINTGTATLVASFNNVTVNSLGSGPKDSIYSLAMNQTGTI 185 (735)
T ss_pred cchheeeeecccCceeEEecCCC-----------ccEEEEEccCcchhhhhhccccccccCCCCCccceeeeecCCcceE
Confidence 34445555544578899999974 4577777765522222 211112 22222 122221 3468
Q ss_pred EEEcCccCCCCccccCCceEEEEcCCC
Q psy14331 173 VVFGGFHDNLREAKYYNDVHIFDLETY 199 (265)
Q Consensus 173 yv~GG~~~~~~~~~~~~~v~~y~~~~~ 199 (265)
++-||. .+++..||+.+.
T Consensus 186 ivsGgt---------ek~lr~wDprt~ 203 (735)
T KOG0308|consen 186 IVSGGT---------EKDLRLWDPRTC 203 (735)
T ss_pred EEecCc---------ccceEEeccccc
Confidence 888876 345666776653
No 136
>PF14830 Haemocyan_bet_s: Haemocyanin beta-sandwich; PDB: 1JS8_B 3QJO_A 1LNL_A.
Probab=30.35 E-value=44 Score=22.70 Aligned_cols=25 Identities=20% Similarity=0.595 Sum_probs=12.8
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEc
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRM 143 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~ 143 (265)
.|.++|+||..+ +. ..++++++||+
T Consensus 36 AG~F~vLGG~~E--Mp---W~FdRlykydI 60 (103)
T PF14830_consen 36 AGTFFVLGGEKE--MP---WAFDRLYKYDI 60 (103)
T ss_dssp EEEEEE---TTS--------B-SS-EEEE-
T ss_pred ccEEEEcCCCcc--Cc---cccCccchhhH
Confidence 589999999643 32 45899999997
No 137
>PF05262 Borrelia_P83: Borrelia P83/100 protein; InterPro: IPR007926 This family consists of several Borrelia P83/P100 antigen proteins.
Probab=30.30 E-value=4.1e+02 Score=23.90 Aligned_cols=65 Identities=18% Similarity=0.118 Sum_probs=38.2
Q ss_pred cCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEecc
Q psy14331 134 HYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 134 ~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~ 206 (265)
.++.+..+|+.++.=-.-.+. ...++..+....+.+.+++|.++.. .-.+...|+.+..-..-..
T Consensus 373 ~ls~LvllD~~tg~~l~~S~~---~~Ir~r~~~~~~~~~vaI~g~~G~~-----~ikLvlid~~tLev~kes~ 437 (489)
T PF05262_consen 373 YLSELVLLDSDTGDTLKRSPV---NGIRGRTFYEREDDLVAIAGCSGNA-----AIKLVLIDPETLEVKKESE 437 (489)
T ss_pred cceeEEEEeCCCCceeccccc---ceeccceeEEcCCCEEEEeccCCch-----heEEEecCcccceeeeecc
Confidence 378899999999864333333 2333445666778888888874442 1234444565554444333
No 138
>KOG0649|consensus
Probab=30.19 E-value=3e+02 Score=22.27 Aligned_cols=139 Identities=16% Similarity=0.228 Sum_probs=0.0
Q ss_pred eeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe---eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCC
Q psy14331 52 SFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL---FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSAS 128 (265)
Q Consensus 52 ~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~---w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~ 128 (265)
++..+|..+.++..|| -..+++.|+ -.+..---.--+-|+.+.-. .+-=.+.|+.+..
T Consensus 119 am~ldP~enSi~~AgG---------------D~~~y~~dlE~G~i~r~~rGHtDYvH~vv~R~--~~~qilsG~EDGt-- 179 (325)
T KOG0649|consen 119 AMWLDPSENSILFAGG---------------DGVIYQVDLEDGRIQREYRGHTDYVHSVVGRN--ANGQILSGAEDGT-- 179 (325)
T ss_pred eeEeccCCCcEEEecC---------------CeEEEEEEecCCEEEEEEcCCcceeeeeeecc--cCcceeecCCCcc--
Q ss_pred ccccccCCceeEEEcCCCcEEEE------ecCCCCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCceE
Q psy14331 129 ESQFHHYKDLWVFRMGEKKWEKI------VCKDTPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWK 202 (265)
Q Consensus 129 ~~~~~~~~~~~~yd~~~~~W~~~------~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~ 202 (265)
+-..|..+.+-.++ +....|.-...-.+...+..-.++|| -..+-.+++.+.+=+
T Consensus 180 ---------vRvWd~kt~k~v~~ie~yk~~~~lRp~~g~wigala~~edWlvCGg----------Gp~lslwhLrsse~t 240 (325)
T KOG0649|consen 180 ---------VRVWDTKTQKHVSMIEPYKNPNLLRPDWGKWIGALAVNEDWLVCGG----------GPKLSLWHLRSSEST 240 (325)
T ss_pred ---------EEEEeccccceeEEeccccChhhcCcccCceeEEEeccCceEEecC----------CCceeEEeccCCCce
Q ss_pred EeccCCCCCCCCCCceEEEeCCCeEEEEc-Ccc
Q psy14331 203 KIEPLGAGPAPRSGCQMAATPDGKILISG-GYS 234 (265)
Q Consensus 203 ~~~~~~~~p~~~~~~~~~~~~~~~iyi~G-G~~ 234 (265)
.+-+. +..-..+.+.++.+.+.| |..
T Consensus 241 ~vfpi------pa~v~~v~F~~d~vl~~G~g~~ 267 (325)
T KOG0649|consen 241 CVFPI------PARVHLVDFVDDCVLIGGEGNH 267 (325)
T ss_pred EEEec------ccceeEeeeecceEEEeccccc
No 139
>KOG0278|consensus
Probab=30.00 E-value=3e+02 Score=22.32 Aligned_cols=55 Identities=16% Similarity=0.093 Sum_probs=32.6
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEE-ECCEEEEEcCccCC
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIA-LKKHLVVFGGFHDN 181 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~-~~~~iyv~GG~~~~ 181 (265)
+..+||.||.. -.+++||..++.=...-..+.+.|. |++-. -++.+|..|-.++.
T Consensus 235 ~k~~fVaGged-----------~~~~kfDy~TgeEi~~~nkgh~gpV--hcVrFSPdGE~yAsGSEDGT 290 (334)
T KOG0278|consen 235 KKEFFVAGGED-----------FKVYKFDYNTGEEIGSYNKGHFGPV--HCVRFSPDGELYASGSEDGT 290 (334)
T ss_pred CCceEEecCcc-----------eEEEEEeccCCceeeecccCCCCce--EEEEECCCCceeeccCCCce
Confidence 56899999964 2377888888752222112112222 22211 28899999987665
No 140
>smart00284 OLF Olfactomedin-like domains.
Probab=29.89 E-value=3e+02 Score=22.26 Aligned_cols=141 Identities=11% Similarity=0.044 Sum_probs=70.8
Q ss_pred CCEEEEECcEEeCCCceeeCCCcccCceEEEEe--------eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCcc
Q psy14331 59 KDQLILFGGEFYDGQKFVFGSPKALDHLILMNL--------FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASES 130 (265)
Q Consensus 59 ~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~--------w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~ 130 (265)
++++|++.+.. .. .+.++.|.- ....-.+|.+-.+...+++ ++.+|.--..
T Consensus 34 ~~~~wv~~~~~--~~---------~~~v~ey~~~~~f~~~~~~~~~~Lp~~~~GtG~VVY---ngslYY~~~~------- 92 (255)
T smart00284 34 KSLYWYMPLNT--RV---------LRSVREYSSMSDFQMGKNPTDHPLPHAGQGTGVVVY---NGSLYFNKFN------- 92 (255)
T ss_pred CceEEEEcccc--CC---------CcEEEEecCHHHHhccCCceEEECCCccccccEEEE---CceEEEEecC-------
Confidence 46788876631 11 345556655 2222346666667777777 8888875332
Q ss_pred ccccCCceeEEEcCCCcEEEEecCC-------CCCCCcCc---eEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCc
Q psy14331 131 QFHHYKDLWVFRMGEKKWEKIVCKD-------TPPSRSGH---RMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYA 200 (265)
Q Consensus 131 ~~~~~~~~~~yd~~~~~W~~~~~~~-------~p~~~~~~---~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~ 200 (265)
..++.+||+.+.+-.....++ .|....++ -.++..+.|+++=....... .-.|-..|+.+-.
T Consensus 93 ----s~~iiKydL~t~~v~~~~~Lp~a~y~~~~~Y~~~~~sdiDlAvDE~GLWvIYat~~~~g----~ivvSkLnp~tL~ 164 (255)
T smart00284 93 ----SHDICRFDLTTETYQKEPLLNGAGYNNRFPYAWGGFSDIDLAVDENGLWVIYATEQNAG----KIVISKLNPATLT 164 (255)
T ss_pred ----CccEEEEECCCCcEEEEEecCccccccccccccCCCccEEEEEcCCceEEEEeccCCCC----CEEEEeeCcccce
Confidence 467999999998754333221 11111111 12333445655522211100 1234567777644
Q ss_pred eEEeccCCCCCCCCCCceEEEeCCCeEEEEc
Q psy14331 201 WKKIEPLGAGPAPRSGCQMAATPDGKILISG 231 (265)
Q Consensus 201 W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~G 231 (265)
-++.-.++ .+....+.+..++ |.||++-
T Consensus 165 ve~tW~T~-~~k~sa~naFmvC--GvLY~~~ 192 (255)
T smart00284 165 IENTWITT-YNKRSASNAFMIC--GILYVTR 192 (255)
T ss_pred EEEEEEcC-CCcccccccEEEe--eEEEEEc
Confidence 33322221 2343444333333 7888874
No 141
>PLN00033 photosystem II stability/assembly factor; Provisional
Probab=29.66 E-value=3.8e+02 Score=23.33 Aligned_cols=130 Identities=15% Similarity=0.172 Sum_probs=64.4
Q ss_pred ccceEEEeccCCCCCCCceeeEEe-cCCCEEEEECcEEeCCCceeeCCCcccCceEEEE-e---eecCCCCCC----Cc-
Q psy14331 33 RKNKVIEKVVPEPTRRANFSFLAH-PDKDQLILFGGEFYDGQKFVFGSPKALDHLILMN-L---FVSAPGAPP----PR- 102 (265)
Q Consensus 33 ~~~~w~~~~~~~p~~r~~~~~~~~-~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd-~---w~~~~~~p~----~r- 102 (265)
.-.+|+++..+......-..+.++ .+.++-+++|- ...+..-+ - |+... .|. .+
T Consensus 73 ~G~~W~q~~~p~~~~~~L~~V~F~~~d~~~GwAVG~---------------~G~IL~T~DGG~tW~~~~-~~~~~~~~~~ 136 (398)
T PLN00033 73 QSSEWEQVDLPIDPGVVLLDIAFVPDDPTHGFLLGT---------------RQTLLETKDGGKTWVPRS-IPSAEDEDFN 136 (398)
T ss_pred CCCccEEeecCCCCCCceEEEEeccCCCCEEEEEcC---------------CCEEEEEcCCCCCceECc-cCcccccccc
Confidence 344798843222222233445553 25678888887 22333332 2 76643 111 12
Q ss_pred cccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCC-CCCCCcCceEEEE-CCEEEEEcCccC
Q psy14331 103 CSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKD-TPPSRSGHRMIAL-KKHLVVFGGFHD 180 (265)
Q Consensus 103 ~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~-~p~~~~~~~~~~~-~~~iyv~GG~~~ 180 (265)
+....+... ++..|++|-. ..++.=+-...+|+.+.... .|.. ......+ ++.++++|..
T Consensus 137 ~~l~~v~f~--~~~g~~vG~~------------G~il~T~DgG~tW~~~~~~~~~p~~--~~~i~~~~~~~~~ivg~~-- 198 (398)
T PLN00033 137 YRFNSISFK--GKEGWIIGKP------------AILLHTSDGGETWERIPLSPKLPGE--PVLIKATGPKSAEMVTDE-- 198 (398)
T ss_pred cceeeeEEE--CCEEEEEcCc------------eEEEEEcCCCCCceECccccCCCCC--ceEEEEECCCceEEEecc--
Confidence 122222332 6778888532 22333333457899876432 1222 2233344 4567777742
Q ss_pred CCCccccCCceEEEEcCCCceEEe
Q psy14331 181 NLREAKYYNDVHIFDLETYAWKKI 204 (265)
Q Consensus 181 ~~~~~~~~~~v~~y~~~~~~W~~~ 204 (265)
..+++-+-...+|+.+
T Consensus 199 --------G~v~~S~D~G~tW~~~ 214 (398)
T PLN00033 199 --------GAIYVTSNAGRNWKAA 214 (398)
T ss_pred --------ceEEEECCCCCCceEc
Confidence 3355555556789887
No 142
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=29.49 E-value=2.2e+02 Score=20.51 Aligned_cols=52 Identities=13% Similarity=0.095 Sum_probs=27.2
Q ss_pred cCceEEEEe----eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCC
Q psy14331 83 LDHLILMNL----FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEK 146 (265)
Q Consensus 83 ~~~~~~yd~----w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~ 146 (265)
.+.+..||. -.....+|.....-....+......+.++||- ..+.-||-+.+
T Consensus 72 ~t~llaYDV~~N~d~Fyke~~DGvn~i~~g~~~~~~~~l~ivGGn------------csi~Gfd~~G~ 127 (136)
T PF14781_consen 72 QTSLLAYDVENNSDLFYKEVPDGVNAIVIGKLGDIPSPLVIVGGN------------CSIQGFDYEGN 127 (136)
T ss_pred cceEEEEEcccCchhhhhhCccceeEEEEEecCCCCCcEEEECce------------EEEEEeCCCCc
Confidence 578999999 11222333332222222222224678889984 34566665544
No 143
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=29.22 E-value=3.5e+02 Score=23.43 Aligned_cols=84 Identities=12% Similarity=0.171 Sum_probs=39.7
Q ss_pred cccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEECCEEEEEcCccCCC
Q psy14331 103 CSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIALKKHLVVFGGFHDNL 182 (265)
Q Consensus 103 ~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~~~~iyv~GG~~~~~ 182 (265)
+.+.-+... +++-++|++..+. ...++.+|+++.+=.++.... .....+..++.-++.+|.+-.
T Consensus 37 YF~~~~ft~--dG~kllF~s~~dg--------~~nly~lDL~t~~i~QLTdg~-g~~~~g~~~s~~~~~~~Yv~~----- 100 (386)
T PF14583_consen 37 YFYQNCFTD--DGRKLLFASDFDG--------NRNLYLLDLATGEITQLTDGP-GDNTFGGFLSPDDRALYYVKN----- 100 (386)
T ss_dssp -TTS--B-T--TS-EEEEEE-TTS--------S-EEEEEETTT-EEEE---SS--B-TTT-EE-TTSSEEEEEET-----
T ss_pred eecCCCcCC--CCCEEEEEeccCC--------CcceEEEEcccCEEEECccCC-CCCccceEEecCCCeEEEEEC-----
Confidence 444444443 5656666664332 467899999999888876542 122333333333566655532
Q ss_pred CccccCCceEEEEcCCCceEEeccC
Q psy14331 183 REAKYYNDVHIFDLETYAWKKIEPL 207 (265)
Q Consensus 183 ~~~~~~~~v~~y~~~~~~W~~~~~~ 207 (265)
...|.+.|+.+.+=+.+-..
T Consensus 101 -----~~~l~~vdL~T~e~~~vy~~ 120 (386)
T PF14583_consen 101 -----GRSLRRVDLDTLEERVVYEV 120 (386)
T ss_dssp -----TTEEEEEETTT--EEEEEE-
T ss_pred -----CCeEEEEECCcCcEEEEEEC
Confidence 13678888888776566553
No 144
>smart00155 PLDc Phospholipase D. Active site motifs. Phosphatidylcholine-hydrolyzing phospholipase D (PLD) isoforms are activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic acid from phosphatidylcholine, which may be essential for the formation of certain types of transport vesicles or may be constitutive vesicular transport to signal transduction pathways. PC-hydrolysing PLD is a homologue of cardiolipin synthase, phosphatidylserine synthase, bacterial PLDs, and viral proteins. Each of these appears to possess a domain duplication which is apparent by the presence of two motifs containing well-conserved histidine, lysine, aspartic acid, and/or asparagine residues which may contribute to the active site. An E. coli endonuclease (nuc) and similar proteins appear to be PLD homologues but possess only one of these motifs. The profile contained here represents only the putative active site regions, since an accurate multiple alignment of the repeat units has not be
Probab=27.73 E-value=84 Score=15.18 Aligned_cols=18 Identities=17% Similarity=0.095 Sum_probs=11.9
Q ss_pred ceEEEeCCCeEEEEcCcc
Q psy14331 217 CQMAATPDGKILISGGYS 234 (265)
Q Consensus 217 ~~~~~~~~~~iyi~GG~~ 234 (265)
|+-..+.+++..++|+.+
T Consensus 6 H~K~~v~D~~~~~iGs~N 23 (28)
T smart00155 6 HTKLMIVDDEIAYIGSAN 23 (28)
T ss_pred EeEEEEEcCCEEEEeCcc
Confidence 444444488888888875
No 145
>KOG0310|consensus
Probab=27.57 E-value=4.4e+02 Score=23.41 Aligned_cols=52 Identities=23% Similarity=0.442 Sum_probs=29.5
Q ss_pred CCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCcc
Q psy14331 169 KKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYS 234 (265)
Q Consensus 169 ~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~ 234 (265)
++.-++-||.++. +-.|| +..|+.+.... .|.+.. .+.+..+++-.++|..+
T Consensus 249 ~~~rLlS~sLD~~---------VKVfd--~t~~Kvv~s~~-~~~pvL--siavs~dd~t~viGmsn 300 (487)
T KOG0310|consen 249 DSTRLLSGSLDRH---------VKVFD--TTNYKVVHSWK-YPGPVL--SIAVSPDDQTVVIGMSN 300 (487)
T ss_pred CCceEeecccccc---------eEEEE--ccceEEEEeee-ccccee--eEEecCCCceEEEeccc
Confidence 4466667777543 66787 45566555532 244433 34444467777777654
No 146
>KOG4649|consensus
Probab=27.55 E-value=3.5e+02 Score=22.20 Aligned_cols=88 Identities=13% Similarity=0.215 Sum_probs=49.6
Q ss_pred CceEEEEe------eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCC--cEEEEecCC
Q psy14331 84 DHLILMNL------FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEK--KWEKIVCKD 155 (265)
Q Consensus 84 ~~~~~yd~------w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~--~W~~~~~~~ 155 (265)
..+...|+ |+.. ...|...+++++ ++. .|+|-. ...++.++.+++ -|......
T Consensus 33 ~~~~avd~~sG~~~We~i---lg~RiE~sa~vv---gdf-VV~GCy-----------~g~lYfl~~~tGs~~w~f~~~~- 93 (354)
T KOG4649|consen 33 GIVIAVDPQSGNLIWEAI---LGVRIECSAIVV---GDF-VVLGCY-----------SGGLYFLCVKTGSQIWNFVILE- 93 (354)
T ss_pred ceEEEecCCCCcEEeehh---hCceeeeeeEEE---CCE-EEEEEc-----------cCcEEEEEecchhheeeeeehh-
Confidence 34555566 6554 346888888877 666 555543 344777777776 47665432
Q ss_pred CCCCCcCceEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCc--eE
Q psy14331 156 TPPSRSGHRMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYA--WK 202 (265)
Q Consensus 156 ~p~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~--W~ 202 (265)
.....+.+.-++.++.+|-.+ ..+|..|+.+.. |+
T Consensus 94 ---~vk~~a~~d~~~glIycgshd---------~~~yalD~~~~~cVyk 130 (354)
T KOG4649|consen 94 ---TVKVRAQCDFDGGLIYCGSHD---------GNFYALDPKTYGCVYK 130 (354)
T ss_pred ---hhccceEEcCCCceEEEecCC---------CcEEEecccccceEEe
Confidence 222223333344455556543 347788887753 55
No 147
>PTZ00421 coronin; Provisional
Probab=26.85 E-value=4.7e+02 Score=23.50 Aligned_cols=23 Identities=17% Similarity=0.143 Sum_probs=15.9
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEcCCCc
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKK 147 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~ 147 (265)
++.+++.|+.+ ..+..+|+.+++
T Consensus 179 dG~lLatgs~D-----------g~IrIwD~rsg~ 201 (493)
T PTZ00421 179 DGSLLCTTSKD-----------KKLNIIDPRDGT 201 (493)
T ss_pred CCCEEEEecCC-----------CEEEEEECCCCc
Confidence 56777777743 347789988765
No 148
>PF15525 DUF4652: Domain of unknown function (DUF4652)
Probab=26.73 E-value=3e+02 Score=21.17 Aligned_cols=19 Identities=16% Similarity=0.359 Sum_probs=15.4
Q ss_pred CCceEEEEcCCCceEEecc
Q psy14331 188 YNDVHIFDLETYAWKKIEP 206 (265)
Q Consensus 188 ~~~v~~y~~~~~~W~~~~~ 206 (265)
..+||.+|..++.|..+..
T Consensus 87 iGkIYIkn~~~~~~~~L~i 105 (200)
T PF15525_consen 87 IGKIYIKNLNNNNWWSLQI 105 (200)
T ss_pred ceeEEEEecCCCceEEEEe
Confidence 6789999999988876643
No 149
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=25.27 E-value=3.3e+02 Score=21.26 Aligned_cols=92 Identities=17% Similarity=0.178 Sum_probs=54.3
Q ss_pred EecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCceEEEE--CCEEEEEcCccCCCCcccc
Q psy14331 110 LSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIAL--KKHLVVFGGFHDNLREAKY 187 (265)
Q Consensus 110 ~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~--~~~iyv~GG~~~~~~~~~~ 187 (265)
.+..++.||+..-. ...++++++.+++-+..... . ..+++.. ++.+|+...
T Consensus 7 ~d~~~g~l~~~D~~-----------~~~i~~~~~~~~~~~~~~~~---~---~~G~~~~~~~g~l~v~~~---------- 59 (246)
T PF08450_consen 7 WDPRDGRLYWVDIP-----------GGRIYRVDPDTGEVEVIDLP---G---PNGMAFDRPDGRLYVADS---------- 59 (246)
T ss_dssp EETTTTEEEEEETT-----------TTEEEEEETTTTEEEEEESS---S---EEEEEEECTTSEEEEEET----------
T ss_pred EECCCCEEEEEEcC-----------CCEEEEEECCCCeEEEEecC---C---CceEEEEccCCEEEEEEc----------
Confidence 33346788887322 46799999999876655432 2 2234443 678888754
Q ss_pred CCceEEEEcCCCceEEeccCCC--CCCCCCCceEEEeCCCeEEEE
Q psy14331 188 YNDVHIFDLETYAWKKIEPLGA--GPAPRSGCQMAATPDGKILIS 230 (265)
Q Consensus 188 ~~~v~~y~~~~~~W~~~~~~~~--~p~~~~~~~~~~~~~~~iyi~ 230 (265)
..+..+|+.+.+++.+..... .+..+.+- +++-.++.+|+.
T Consensus 60 -~~~~~~d~~~g~~~~~~~~~~~~~~~~~~ND-~~vd~~G~ly~t 102 (246)
T PF08450_consen 60 -GGIAVVDPDTGKVTVLADLPDGGVPFNRPND-VAVDPDGNLYVT 102 (246)
T ss_dssp -TCEEEEETTTTEEEEEEEEETTCSCTEEEEE-EEE-TTS-EEEE
T ss_pred -CceEEEecCCCcEEEEeeccCCCcccCCCce-EEEcCCCCEEEE
Confidence 345667999999988877421 12222222 334446778874
No 150
>KOG3545|consensus
Probab=25.17 E-value=3.6e+02 Score=21.67 Aligned_cols=149 Identities=11% Similarity=0.021 Sum_probs=0.0
Q ss_pred cccccceeeeecccccHHHHHHHHHHhhcccceEEEeccCCCCCCCcee----------eEEecCCCEEEEECcEEeCCC
Q psy14331 4 IWDRHAMSWTIMFNEDIEKIVRDIEAEEKRKNKVIEKVVPEPTRRANFS----------FLAHPDKDQLILFGGEFYDGQ 73 (265)
Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~w~~~~~~~p~~r~~~~----------~~~~~~~~~lyv~GG~~~~~~ 73 (265)
.|+.-.-+.+.+.--......+.+..||..+..-.. ...+|..-..-+ +-+..+++-|+++ ....
T Consensus 67 ~~~gTg~VVynGs~yynk~~t~~ivky~l~~~~~~~-~~~lp~a~y~~~~~y~~~g~sdiD~avDE~GLWvi----Yat~ 141 (249)
T KOG3545|consen 67 SWDGTGHVVYNGSLYYNKAGTRNIIKYDLETRTVAG-SAALPYAGYHNPSPYYWGGHSDIDLAVDENGLWVI----YATP 141 (249)
T ss_pred CccccceEEEcceEEeeccCCcceEEEEeecceeee-eeeccccccCCCcccccCCCccccceecccceeEE----eccc
Q ss_pred ceeeCCCcccCceEEEEe------eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCc
Q psy14331 74 KFVFGSPKALDHLILMNL------FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKK 147 (265)
Q Consensus 74 ~~~~~~~~~~~~~~~yd~------w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~ 147 (265)
.... .-.+-++|+ -+....++....+.+-++| |.||++-...... ..--+.||..+++
T Consensus 142 ~~~g-----~iv~skLdp~tl~~e~tW~T~~~k~~~~~aF~iC----GvLY~v~S~~~~~-------~~i~yaydt~~~~ 205 (249)
T KOG3545|consen 142 ENAG-----TIVLSKLDPETLEVERTWNTTLPKRSAGNAFMIC----GVLYVVHSYNCTH-------TQISYAYDTTTGT 205 (249)
T ss_pred ccCC-----cEEeeccCHHHhheeeeeccccCCCCcCceEEEe----eeeEEEeccccCC-------ceEEEEEEcCCCc
Q ss_pred EEEEecCCCCCCCcCceEEEE----CCEEEEE
Q psy14331 148 WEKIVCKDTPPSRSGHRMIAL----KKHLVVF 175 (265)
Q Consensus 148 W~~~~~~~~p~~~~~~~~~~~----~~~iyv~ 175 (265)
=..+...- ..++.+...+. +.++|++
T Consensus 206 ~~~~~ipf--~N~y~~~~~idYNP~D~~LY~w 235 (249)
T KOG3545|consen 206 QERIDLPF--PNPYSYATMIDYNPRDRRLYAW 235 (249)
T ss_pred eecccccc--cchhhhhhccCCCcccceeeEe
No 151
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=24.55 E-value=6.4e+02 Score=24.26 Aligned_cols=27 Identities=11% Similarity=0.195 Sum_probs=18.9
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEcCCC--cEEEEe
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRMGEK--KWEKIV 152 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~--~W~~~~ 152 (265)
++.||+... .+.++.+|..++ .|+.-.
T Consensus 194 gg~lYv~t~------------~~~V~ALDa~TGk~lW~~d~ 222 (764)
T TIGR03074 194 GDTLYLCTP------------HNKVIALDAATGKEKWKFDP 222 (764)
T ss_pred CCEEEEECC------------CCeEEEEECCCCcEEEEEcC
Confidence 899999855 245788887765 476543
No 152
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=24.47 E-value=7.3e+02 Score=24.93 Aligned_cols=106 Identities=11% Similarity=0.182 Sum_probs=55.8
Q ss_pred EEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCC--CCC--------------CC--cCceEEEE-
Q psy14331 108 VALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKD--TPP--------------SR--SGHRMIAL- 168 (265)
Q Consensus 108 ~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~--~p~--------------~~--~~~~~~~~- 168 (265)
+.+...++.||+.-.. .+.+.++|+.++.-..+.... .+. .. .-.+++..
T Consensus 745 IavspdG~~LYVADs~-----------n~~Irv~D~~tg~~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~ 813 (1057)
T PLN02919 745 ISLSPDLKELYIADSE-----------SSSIRALDLKTGGSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAK 813 (1057)
T ss_pred EEEeCCCCEEEEEECC-----------CCeEEEEECCCCcEEEEEecccccCcccccccCCCCchhhhhccCCceeeEeC
Confidence 4444445569987553 367889998876543322100 000 00 01122222
Q ss_pred CCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCC-------CCCC--CCceEEEeCCCeEEEEcCc
Q psy14331 169 KKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAG-------PAPR--SGCQMAATPDGKILISGGY 233 (265)
Q Consensus 169 ~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~-------p~~~--~~~~~~~~~~~~iyi~GG~ 233 (265)
++.+||.... .+.|.++|+.+.....+...+.. .... .-++++.-.++++||.-..
T Consensus 814 dG~LYVADs~---------N~rIrviD~~tg~v~tiaG~G~~G~~dG~~~~a~l~~P~GIavd~dG~lyVaDt~ 878 (1057)
T PLN02919 814 DGQIYVADSY---------NHKIKKLDPATKRVTTLAGTGKAGFKDGKALKAQLSEPAGLALGENGRLFVADTN 878 (1057)
T ss_pred CCcEEEEECC---------CCEEEEEECCCCeEEEEeccCCcCCCCCcccccccCCceEEEEeCCCCEEEEECC
Confidence 5678887543 35689999988877766543210 0000 1134444456788887544
No 153
>PRK01742 tolB translocation protein TolB; Provisional
Probab=23.30 E-value=5e+02 Score=22.58 Aligned_cols=180 Identities=13% Similarity=0.088 Sum_probs=0.0
Q ss_pred ccccceeeeecccccHHHHHHHHHHhhcccceEEEeccCCCCCCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccC
Q psy14331 5 WDRHAMSWTIMFNEDIEKIVRDIEAEEKRKNKVIEKVVPEPTRRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALD 84 (265)
Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~w~~~~~~~p~~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~ 84 (265)
.+...+.+....+... .++.+|..+..-.. ....+. .......-|+++.|++... .++. .
T Consensus 213 PDG~~la~~s~~~~~~-----~i~i~dl~tg~~~~-l~~~~g--~~~~~~wSPDG~~La~~~~--~~g~----------~ 272 (429)
T PRK01742 213 PDGSKLAYVSFENKKS-----QLVVHDLRSGARKV-VASFRG--HNGAPAFSPDGSRLAFASS--KDGV----------L 272 (429)
T ss_pred CCCCEEEEEEecCCCc-----EEEEEeCCCCceEE-EecCCC--ccCceeECCCCCEEEEEEe--cCCc----------E
Q ss_pred ceEEEEe-eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcCc
Q psy14331 85 HLILMNL-FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSGH 163 (265)
Q Consensus 85 ~~~~yd~-w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~~ 163 (265)
++|.+|+ -.....+...............+..|++...... ..++|.++..+..-+.+... . ...
T Consensus 273 ~Iy~~d~~~~~~~~lt~~~~~~~~~~wSpDG~~i~f~s~~~g---------~~~I~~~~~~~~~~~~l~~~---~--~~~ 338 (429)
T PRK01742 273 NIYVMGANGGTPSQLTSGAGNNTEPSWSPDGQSILFTSDRSG---------SPQVYRMSASGGGASLVGGR---G--YSA 338 (429)
T ss_pred EEEEEECCCCCeEeeccCCCCcCCEEECCCCCEEEEEECCCC---------CceEEEEECCCCCeEEecCC---C--CCc
Q ss_pred eEEEECCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCccc
Q psy14331 164 RMIALKKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYSK 235 (265)
Q Consensus 164 ~~~~~~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~~ 235 (265)
.....++.|++.++ +.++.+|+.+..++.+... ..........++++++++....
T Consensus 339 ~~SpDG~~ia~~~~-----------~~i~~~Dl~~g~~~~lt~~------~~~~~~~~sPdG~~i~~~s~~g 393 (429)
T PRK01742 339 QISADGKTLVMING-----------DNVVKQDLTSGSTEVLSST------FLDESPSISPNGIMIIYSSTQG 393 (429)
T ss_pred cCCCCCCEEEEEcC-----------CCEEEEECCCCCeEEecCC------CCCCCceECCCCCEEEEEEcCC
No 154
>PF13859 BNR_3: BNR repeat-like domain; PDB: 3B69_A.
Probab=22.73 E-value=3.1e+02 Score=22.86 Aligned_cols=43 Identities=23% Similarity=0.388 Sum_probs=24.6
Q ss_pred ceEEEEcC-CCceEEeccCCCCCCCCCCceEEEeCCCeEEEEcCcc
Q psy14331 190 DVHIFDLE-TYAWKKIEPLGAGPAPRSGCQMAATPDGKILISGGYS 234 (265)
Q Consensus 190 ~v~~y~~~-~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyi~GG~~ 234 (265)
.+.+|... ..+|+.-..+ .|.....++++-..+++|+++.-.+
T Consensus 150 SlIiYS~d~g~~W~lskg~--s~~gC~~psv~EWe~gkLlM~~~c~ 193 (310)
T PF13859_consen 150 SLIIYSTDDGKTWKLSKGM--SPAGCSDPSVVEWEDGKLLMMTACD 193 (310)
T ss_dssp EEEEEESSTTSS-EE-S------TT-EEEEEEEE-TTEEEEEEE-T
T ss_pred EEEEEECCCccceEecccc--CCCCcceEEEEeccCCeeEEEEecc
Confidence 45678877 5789987765 4566666777777578988875443
No 155
>KOG0289|consensus
Probab=21.56 E-value=5.7e+02 Score=22.58 Aligned_cols=99 Identities=12% Similarity=0.161 Sum_probs=50.2
Q ss_pred ceEEEEe--eecCCCCCCCccccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCCCCCCCcC
Q psy14331 85 HLILMNL--FVSAPGAPPPRCSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKDTPPSRSG 162 (265)
Q Consensus 85 ~~~~yd~--w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~p~~~~~ 162 (265)
.+-.||+ .......|..-..-.++... .+|...+.+- ++ ..+.++|+...+ ..+....+... .
T Consensus 370 ~vkiwdlks~~~~a~Fpght~~vk~i~Fs-ENGY~Lat~a-dd----------~~V~lwDLRKl~--n~kt~~l~~~~-~ 434 (506)
T KOG0289|consen 370 VVKIWDLKSQTNVAKFPGHTGPVKAISFS-ENGYWLATAA-DD----------GSVKLWDLRKLK--NFKTIQLDEKK-E 434 (506)
T ss_pred eEEEEEcCCccccccCCCCCCceeEEEec-cCceEEEEEe-cC----------CeEEEEEehhhc--ccceeeccccc-c
Confidence 4556777 44455555432323333332 2444444433 21 237788876554 22222222222 1
Q ss_pred ceEEEE--CCEEEEEcCccCCCCccccCCceEEEEcCCCceEEeccC
Q psy14331 163 HRMIAL--KKHLVVFGGFHDNLREAKYYNDVHIFDLETYAWKKIEPL 207 (265)
Q Consensus 163 ~~~~~~--~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~~~W~~~~~~ 207 (265)
...... .+...+++|. .-.||.|+-.+.+|+++...
T Consensus 435 v~s~~fD~SGt~L~~~g~---------~l~Vy~~~k~~k~W~~~~~~ 472 (506)
T KOG0289|consen 435 VNSLSFDQSGTYLGIAGS---------DLQVYICKKKTKSWTEIKEL 472 (506)
T ss_pred ceeEEEcCCCCeEEeecc---------eeEEEEEecccccceeeehh
Confidence 122222 4567777763 13467777778999999885
No 156
>KOG3881|consensus
Probab=20.93 E-value=5.5e+02 Score=22.21 Aligned_cols=53 Identities=19% Similarity=0.112 Sum_probs=26.4
Q ss_pred CCceeEEEcCCCcEEEEecCCCCCCCcCceEEEE-CCEEEEEcCccCCCCccccCCceEEEEcCC
Q psy14331 135 YKDLWVFRMGEKKWEKIVCKDTPPSRSGHRMIAL-KKHLVVFGGFHDNLREAKYYNDVHIFDLET 198 (265)
Q Consensus 135 ~~~~~~yd~~~~~W~~~~~~~~p~~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~v~~y~~~~ 198 (265)
..++..||..+..-.-..-.. ..-.-.+.... ...+...+|.+. .|-+||.++
T Consensus 268 ~g~l~~FD~r~~kl~g~~~kg--~tGsirsih~hp~~~~las~GLDR---------yvRIhD~kt 321 (412)
T KOG3881|consen 268 KGQLAKFDLRGGKLLGCGLKG--ITGSIRSIHCHPTHPVLASCGLDR---------YVRIHDIKT 321 (412)
T ss_pred cchhheecccCceeeccccCC--ccCCcceEEEcCCCceEEeeccce---------eEEEeeccc
Confidence 467889998877543221111 00000111112 335777777643 355677666
No 157
>KOG0289|consensus
Probab=20.91 E-value=5.9e+02 Score=22.49 Aligned_cols=59 Identities=12% Similarity=0.187 Sum_probs=34.8
Q ss_pred ceEEEEe--eecCCCCCCCc-cccceEEEecCCCEEEEEcCcccCCCccccccCCceeEEEcCCCcEEEEecCC
Q psy14331 85 HLILMNL--FVSAPGAPPPR-CSHQMVALSADKGQLWMFGGEFSSASESQFHHYKDLWVFRMGEKKWEKIVCKD 155 (265)
Q Consensus 85 ~~~~yd~--w~~~~~~p~~r-~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~~W~~~~~~~ 155 (265)
.+.++|+ -......+.+- ........+. .|...+++|. .-.++.|+-.+..|+.+....
T Consensus 412 ~V~lwDLRKl~n~kt~~l~~~~~v~s~~fD~-SGt~L~~~g~-----------~l~Vy~~~k~~k~W~~~~~~~ 473 (506)
T KOG0289|consen 412 SVKLWDLRKLKNFKTIQLDEKKEVNSLSFDQ-SGTYLGIAGS-----------DLQVYICKKKTKSWTEIKELA 473 (506)
T ss_pred eEEEEEehhhcccceeeccccccceeEEEcC-CCCeEEeecc-----------eeEEEEEecccccceeeehhh
Confidence 3788888 33333333222 2333334432 3566666664 245788888899999997664
No 158
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=20.16 E-value=6.3e+02 Score=22.54 Aligned_cols=71 Identities=11% Similarity=0.094 Sum_probs=40.6
Q ss_pred CCEEEEEcCcccCCCccccccCCceeEEEcCCC--cEEEEecCC-C-C-CCCcCceEEEEC-CEEEEEcCccCCCCcccc
Q psy14331 114 KGQLWMFGGEFSSASESQFHHYKDLWVFRMGEK--KWEKIVCKD-T-P-PSRSGHRMIALK-KHLVVFGGFHDNLREAKY 187 (265)
Q Consensus 114 ~~~iyv~GG~~~~~~~~~~~~~~~~~~yd~~~~--~W~~~~~~~-~-p-~~~~~~~~~~~~-~~iyv~GG~~~~~~~~~~ 187 (265)
++++|+.... ..++.+|..++ .|+.-.... . . .+.....+++.+ +++|+-..
T Consensus 61 ~g~vy~~~~~------------g~l~AlD~~tG~~~W~~~~~~~~~~~~~~~~~~g~~~~~~~~V~v~~~---------- 118 (488)
T cd00216 61 DGDMYFTTSH------------SALFALDAATGKVLWRYDPKLPADRGCCDVVNRGVAYWDPRKVFFGTF---------- 118 (488)
T ss_pred CCEEEEeCCC------------CcEEEEECCCChhhceeCCCCCccccccccccCCcEEccCCeEEEecC----------
Confidence 8899987542 45888898875 487643221 0 0 011111233445 77776432
Q ss_pred CCceEEEEcCCC--ceEEecc
Q psy14331 188 YNDVHIFDLETY--AWKKIEP 206 (265)
Q Consensus 188 ~~~v~~y~~~~~--~W~~~~~ 206 (265)
...|+.+|..+. .|+.-..
T Consensus 119 ~g~v~AlD~~TG~~~W~~~~~ 139 (488)
T cd00216 119 DGRLVALDAETGKQVWKFGNN 139 (488)
T ss_pred CCeEEEEECCCCCEeeeecCC
Confidence 246899998864 5876543
No 159
>KOG1538|consensus
Probab=20.05 E-value=2.7e+02 Score=26.10 Aligned_cols=60 Identities=22% Similarity=0.219 Sum_probs=36.7
Q ss_pred CCCceeeEEecCCCEEEEECcEEeCCCceeeCCCcccCceEEEEe--eecCCCCCCCccccceEEEecCCCEEEEEcCc
Q psy14331 47 RRANFSFLAHPDKDQLILFGGEFYDGQKFVFGSPKALDHLILMNL--FVSAPGAPPPRCSHQMVALSADKGQLWMFGGE 123 (265)
Q Consensus 47 ~r~~~~~~~~~~~~~lyv~GG~~~~~~~~~~~~~~~~~~~~~yd~--w~~~~~~p~~r~~~~~~~~~~~~~~iyv~GG~ 123 (265)
...-|.+++-|+|..+.+..| +.++.||+ -..+.++-.-..---++++ ..+|+.|..||.
T Consensus 12 ~hci~d~afkPDGsqL~lAAg----------------~rlliyD~ndG~llqtLKgHKDtVycVAy-s~dGkrFASG~a 73 (1081)
T KOG1538|consen 12 EHCINDIAFKPDGTQLILAAG----------------SRLLVYDTSDGTLLQPLKGHKDTVYCVAY-AKDGKRFASGSA 73 (1081)
T ss_pred ccchheeEECCCCceEEEecC----------------CEEEEEeCCCcccccccccccceEEEEEE-ccCCceeccCCC
Confidence 335567788889999999988 68899998 2222222211111112222 247888888885
Done!