Query psy5768
Match_columns 652
No_of_seqs 340 out of 3177
Neff 8.7
Searched_HMMs 46136
Date Fri Aug 16 17:30:41 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5768.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5768hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 100.0 1E-51 2.2E-56 436.0 26.5 292 279-579 991-1286(1289)
2 KOG1215|consensus 100.0 2.1E-47 4.5E-52 450.0 41.4 480 92-588 202-710 (877)
3 KOG1214|consensus 100.0 5E-37 1.1E-41 325.0 20.1 249 2-265 992-1284(1289)
4 KOG1215|consensus 100.0 1.2E-34 2.5E-39 341.9 35.1 416 9-454 415-877 (877)
5 PF08450 SGL: SMP-30/Gluconola 99.8 4.9E-16 1.1E-20 156.7 27.1 213 316-538 2-232 (246)
6 PLN02919 haloacid dehalogenase 99.7 3.3E-15 7.1E-20 177.4 37.7 299 35-419 565-919 (1057)
7 PLN02919 haloacid dehalogenase 99.7 7.2E-15 1.6E-19 174.6 30.3 210 311-524 565-839 (1057)
8 PF08450 SGL: SMP-30/Gluconola 99.6 3.6E-13 7.8E-18 135.8 29.2 231 40-375 2-245 (246)
9 PF10282 Lactonase: Lactonase, 99.5 1.1E-10 2.5E-15 123.5 39.6 303 110-496 13-344 (345)
10 PRK11028 6-phosphogluconolacto 99.5 6.8E-11 1.5E-15 124.7 37.0 300 100-497 2-327 (330)
11 PRK11028 6-phosphogluconolacto 99.5 8E-10 1.7E-14 116.5 37.4 307 2-409 3-327 (330)
12 PF10282 Lactonase: Lactonase, 99.4 4.5E-09 9.7E-14 111.4 40.6 308 3-408 2-344 (345)
13 COG3386 Gluconolactonase [Carb 99.3 1.1E-10 2.4E-15 119.8 20.6 204 316-528 27-252 (307)
14 TIGR03866 PQQ_ABC_repeats PQQ- 99.2 1.8E-07 4E-12 96.2 37.7 298 2-411 2-300 (300)
15 COG2706 3-carboxymuconate cycl 99.2 1.9E-07 4E-12 94.1 33.8 297 100-479 3-323 (346)
16 KOG4659|consensus 99.2 3.5E-09 7.7E-14 119.5 22.3 148 36-195 363-553 (1899)
17 COG3391 Uncharacterized conser 99.1 2.5E-07 5.5E-12 99.2 32.7 295 13-420 13-317 (381)
18 TIGR03866 PQQ_ABC_repeats PQQ- 99.1 2.7E-06 5.8E-11 87.5 38.0 135 357-497 158-298 (300)
19 TIGR02604 Piru_Ver_Nterm putat 99.0 1.2E-07 2.6E-12 101.3 27.8 250 83-423 13-341 (367)
20 KOG4659|consensus 99.0 2.2E-08 4.9E-13 113.2 22.4 242 314-583 365-679 (1899)
21 PF00058 Ldl_recept_b: Low-den 99.0 1.2E-09 2.5E-14 77.0 6.1 42 143-184 1-42 (42)
22 PF14670 FXa_inhibition: Coagu 99.0 2.2E-10 4.8E-15 76.7 2.2 35 240-274 1-36 (36)
23 PF00058 Ldl_recept_b: Low-den 99.0 8.8E-10 1.9E-14 77.6 5.2 42 414-456 1-42 (42)
24 PF07995 GSDH: Glucose / Sorbo 99.0 4.3E-07 9.4E-12 95.4 27.9 148 37-194 1-200 (331)
25 COG3391 Uncharacterized conser 99.0 1.8E-07 4E-12 100.3 25.5 204 316-527 76-292 (381)
26 PF14670 FXa_inhibition: Coagu 99.0 2.1E-10 4.5E-15 76.9 1.8 35 551-586 1-36 (36)
27 PF06977 SdiA-regulated: SdiA- 99.0 8E-07 1.7E-11 88.4 27.2 199 313-514 21-246 (248)
28 PF07995 GSDH: Glucose / Sorbo 98.9 3.5E-07 7.6E-12 96.1 25.7 222 314-539 2-314 (331)
29 TIGR02658 TTQ_MADH_Hv methylam 98.9 9.4E-06 2E-10 84.8 33.4 280 98-443 11-337 (352)
30 PF00057 Ldl_recept_a: Low-den 98.9 1.6E-09 3.4E-14 73.6 2.8 36 591-626 1-36 (37)
31 COG2706 3-carboxymuconate cycl 98.9 3.7E-05 8.1E-10 77.7 35.0 294 1-390 3-325 (346)
32 cd00112 LDLa Low Density Lipop 98.8 2.4E-09 5.3E-14 71.9 2.4 34 593-626 1-34 (35)
33 PF02239 Cytochrom_D1: Cytochr 98.8 9.6E-05 2.1E-09 78.8 36.6 341 100-539 6-366 (369)
34 TIGR02604 Piru_Ver_Nterm putat 98.7 1.7E-06 3.8E-11 92.4 22.8 198 312-517 12-299 (367)
35 COG3386 Gluconolactonase [Carb 98.7 5.7E-06 1.2E-10 85.2 24.9 143 321-468 118-277 (307)
36 smart00192 LDLa Low-density li 98.6 2E-08 4.2E-13 66.6 2.6 32 593-624 2-33 (33)
37 KOG4499|consensus 98.6 6.3E-06 1.4E-10 78.4 20.0 198 317-527 18-250 (310)
38 TIGR02658 TTQ_MADH_Hv methylam 98.6 0.00033 7.1E-09 73.4 34.8 276 48-387 11-331 (352)
39 PF06977 SdiA-regulated: SdiA- 98.5 2.6E-05 5.7E-10 77.6 22.6 176 4-194 37-240 (248)
40 TIGR03606 non_repeat_PQQ dehyd 98.4 0.00037 8.1E-09 75.2 30.0 149 36-194 28-248 (454)
41 COG3204 Uncharacterized protei 98.4 3.6E-05 7.8E-10 76.1 19.8 190 314-506 86-301 (316)
42 smart00135 LY Low-density lipo 98.4 1E-06 2.2E-11 62.4 6.1 42 440-481 2-43 (43)
43 KOG1520|consensus 98.3 4.6E-05 1E-09 78.4 17.8 146 314-465 115-281 (376)
44 TIGR03606 non_repeat_PQQ dehyd 98.3 7.7E-05 1.7E-09 80.4 20.3 155 311-468 27-250 (454)
45 PF02239 Cytochrom_D1: Cytochr 98.2 0.0072 1.6E-07 64.5 34.1 335 3-407 8-363 (369)
46 KOG1520|consensus 98.2 1.6E-05 3.4E-10 81.7 12.7 173 327-505 77-280 (376)
47 PRK04792 tolB translocation pr 98.1 0.0014 3E-08 72.1 26.5 204 316-526 220-432 (448)
48 smart00135 LY Low-density lipo 98.1 6.4E-06 1.4E-10 58.2 5.4 41 76-121 2-42 (43)
49 PF03022 MRJP: Major royal jel 98.1 0.0018 4E-08 66.5 24.6 175 11-192 34-252 (287)
50 PF03022 MRJP: Major royal jel 98.0 0.0023 5.1E-08 65.7 24.0 148 356-508 61-255 (287)
51 PRK05137 tolB translocation pr 97.9 0.0059 1.3E-07 66.9 26.9 199 315-520 203-414 (435)
52 PRK04922 tolB translocation pr 97.9 0.0045 9.8E-08 67.8 24.9 204 316-526 206-418 (433)
53 TIGR02800 propeller_TolB tol-p 97.9 0.01 2.2E-07 64.6 27.6 204 316-525 192-403 (417)
54 PRK00178 tolB translocation pr 97.8 0.012 2.7E-07 64.3 28.0 202 315-523 200-410 (430)
55 PF03088 Str_synth: Strictosid 97.8 9E-05 2E-09 61.1 7.8 73 406-479 2-89 (89)
56 PRK03629 tolB translocation pr 97.8 0.013 2.8E-07 64.1 27.3 206 315-526 200-413 (429)
57 TIGR03118 PEPCTERM_chp_1 conse 97.8 0.0073 1.6E-07 60.5 22.1 213 311-533 20-294 (336)
58 PRK02889 tolB translocation pr 97.8 0.016 3.4E-07 63.5 27.2 205 316-526 198-410 (427)
59 PRK04043 tolB translocation pr 97.7 0.014 3E-07 63.5 25.8 203 316-526 190-407 (419)
60 COG4257 Vgb Streptogramin lyas 97.7 0.003 6.6E-08 61.9 18.1 207 325-540 72-285 (353)
61 COG3204 Uncharacterized protei 97.7 0.0043 9.4E-08 61.7 18.5 177 3-195 100-303 (316)
62 PRK04043 tolB translocation pr 97.6 0.017 3.8E-07 62.7 24.0 189 326-521 156-360 (419)
63 COG4257 Vgb Streptogramin lyas 97.6 0.054 1.2E-06 53.5 24.3 266 92-425 66-340 (353)
64 PRK05137 tolB translocation pr 97.6 0.021 4.5E-07 62.7 24.3 172 11-197 182-356 (435)
65 PRK04792 tolB translocation pr 97.5 0.02 4.4E-07 63.0 23.7 179 337-521 199-385 (448)
66 PF03088 Str_synth: Strictosid 97.5 0.00095 2.1E-08 55.1 9.5 70 318-387 2-88 (89)
67 PRK01029 tolB translocation pr 97.5 0.1 2.3E-06 56.9 28.1 202 320-523 191-408 (428)
68 PRK03629 tolB translocation pr 97.5 0.034 7.3E-07 60.9 24.1 171 11-196 179-352 (429)
69 TIGR02800 propeller_TolB tol-p 97.4 0.036 7.7E-07 60.3 24.1 179 336-520 170-356 (417)
70 PRK04922 tolB translocation pr 97.4 0.038 8.3E-07 60.5 23.2 179 337-521 185-371 (433)
71 PRK02889 tolB translocation pr 97.4 0.045 9.7E-07 59.9 23.5 181 336-521 176-363 (427)
72 PRK00178 tolB translocation pr 97.3 0.066 1.4E-06 58.6 23.8 180 337-521 180-366 (430)
73 PRK01742 tolB translocation pr 97.3 0.062 1.3E-06 58.8 23.3 201 316-526 206-411 (429)
74 PRK02888 nitrous-oxide reducta 97.2 0.014 3E-07 64.7 16.9 185 324-519 140-352 (635)
75 PF07645 EGF_CA: Calcium-bindi 97.2 0.00034 7.4E-09 49.2 2.9 39 236-274 1-42 (42)
76 PRK01742 tolB translocation pr 97.2 0.064 1.4E-06 58.7 22.1 178 336-522 184-365 (429)
77 cd00200 WD40 WD40 domain, foun 97.0 0.13 2.8E-06 51.0 21.7 176 3-196 107-283 (289)
78 PF12999 PRKCSH-like: Glucosid 97.0 0.00042 9.1E-09 64.1 3.0 53 594-650 34-92 (176)
79 cd00200 WD40 WD40 domain, foun 96.9 0.47 1E-05 46.9 30.8 202 316-527 54-258 (289)
80 PF12662 cEGF: Complement Clr- 96.9 0.0005 1.1E-08 41.4 1.5 22 257-278 1-23 (24)
81 PF02333 Phytase: Phytase; In 96.9 0.39 8.4E-06 50.8 23.7 142 3-162 70-237 (381)
82 TIGR03118 PEPCTERM_chp_1 conse 96.8 0.18 3.8E-06 50.9 19.5 177 353-539 20-252 (336)
83 PF02333 Phytase: Phytase; In 96.8 0.18 3.9E-06 53.2 20.3 117 399-517 153-289 (381)
84 KOG1219|consensus 96.8 0.0013 2.9E-08 79.5 4.9 68 544-625 3865-3935(4289)
85 PF12999 PRKCSH-like: Glucosid 96.8 0.001 2.3E-08 61.5 3.2 56 569-631 55-117 (176)
86 COG2133 Glucose/sorbosone dehy 96.7 0.1 2.2E-06 55.4 18.1 161 356-518 177-397 (399)
87 PF12662 cEGF: Complement Clr- 96.5 0.0015 3.3E-08 39.3 1.6 20 569-588 1-21 (24)
88 COG4946 Uncharacterized protei 96.3 1.9 4.1E-05 45.9 38.8 135 327-466 373-508 (668)
89 PF12947 EGF_3: EGF domain; I 96.3 0.0026 5.7E-08 42.8 1.9 34 240-274 1-36 (36)
90 PF05096 Glu_cyclase_2: Glutam 96.3 1.1 2.4E-05 44.9 21.0 176 286-477 67-261 (264)
91 KOG4499|consensus 96.3 0.069 1.5E-06 51.5 11.9 94 63-165 140-245 (310)
92 PF06433 Me-amine-dh_H: Methyl 96.2 1.9 4.1E-05 44.8 29.7 263 99-423 2-321 (342)
93 PRK01029 tolB translocation pr 96.1 1.5 3.3E-05 47.9 23.5 170 11-195 165-347 (428)
94 COG2133 Glucose/sorbosone dehy 96.1 0.14 3E-06 54.4 14.4 123 37-163 238-398 (399)
95 PF01436 NHL: NHL repeat; Int 96.0 0.021 4.6E-07 36.1 5.0 26 356-382 2-27 (28)
96 PF07645 EGF_CA: Calcium-bindi 96.0 0.0064 1.4E-07 42.7 2.7 37 549-586 3-42 (42)
97 TIGR03032 conserved hypothetic 95.9 1.1 2.3E-05 45.8 19.1 177 357-540 104-314 (335)
98 PF01436 NHL: NHL repeat; Int 95.6 0.024 5.3E-07 35.8 4.1 24 91-115 5-28 (28)
99 COG4946 Uncharacterized protei 95.3 0.57 1.2E-05 49.6 15.0 130 3-150 374-508 (668)
100 TIGR03032 conserved hypothetic 95.3 1.6 3.4E-05 44.6 17.7 99 1-121 18-134 (335)
101 smart00179 EGF_CA Calcium-bind 95.2 0.017 3.6E-07 39.5 2.7 36 237-275 2-39 (39)
102 PF09064 Tme5_EGF_like: Thromb 95.1 0.02 4.4E-07 37.1 2.4 28 555-584 5-32 (34)
103 cd01475 vWA_Matrilin VWA_Matri 95.0 0.014 3.1E-07 57.7 2.4 39 235-273 185-224 (224)
104 COG5276 Uncharacterized conser 94.8 5 0.00011 40.5 23.9 221 277-514 95-325 (370)
105 PF13449 Phytase-like: Esteras 94.4 3 6.5E-05 43.7 18.3 111 354-466 18-166 (326)
106 PF00057 Ldl_recept_a: Low-den 93.9 0.034 7.3E-07 37.8 1.5 20 632-651 1-20 (37)
107 PF14583 Pectate_lyase22: Olig 93.9 5.7 0.00012 42.1 18.5 157 326-487 49-234 (386)
108 PRK02888 nitrous-oxide reducta 93.6 3.2 7E-05 46.6 17.0 147 357-517 236-403 (635)
109 cd01475 vWA_Matrilin VWA_Matri 93.5 0.047 1E-06 54.0 2.6 35 548-583 187-221 (224)
110 KOG0291|consensus 93.4 11 0.00023 42.8 20.2 179 3-197 364-543 (893)
111 PF05096 Glu_cyclase_2: Glutam 93.4 9.5 0.00021 38.3 22.8 159 357-527 46-212 (264)
112 COG4247 Phy 3-phytase (myo-ino 93.3 9.1 0.0002 37.8 18.1 172 3-191 69-264 (364)
113 COG0823 TolB Periplasmic compo 93.2 8.4 0.00018 42.0 19.4 178 337-523 219-406 (425)
114 smart00181 EGF Epidermal growt 92.9 0.098 2.1E-06 34.7 2.6 25 556-581 6-31 (35)
115 PF06247 Plasmod_Pvs28: Plasmo 92.8 0.021 4.6E-07 52.9 -1.1 81 566-649 16-103 (197)
116 cd00112 LDLa Low Density Lipop 92.7 0.054 1.2E-06 36.3 1.0 18 634-651 1-18 (35)
117 PF05787 DUF839: Bacterial pro 92.6 5.9 0.00013 44.3 17.5 62 83-149 349-453 (524)
118 PF06433 Me-amine-dh_H: Methyl 92.6 15 0.00032 38.4 23.3 195 322-527 103-329 (342)
119 smart00181 EGF Epidermal growt 92.6 0.1 2.2E-06 34.6 2.3 29 239-268 1-30 (35)
120 PF01731 Arylesterase: Arylest 92.6 0.38 8.2E-06 39.5 6.1 42 434-476 42-83 (86)
121 smart00192 LDLa Low-density li 91.8 0.1 2.3E-06 34.3 1.6 19 633-651 1-19 (33)
122 PF01731 Arylesterase: Arylest 91.5 0.59 1.3E-05 38.4 6.0 40 346-385 44-83 (86)
123 PF13449 Phytase-like: Esteras 91.1 13 0.00028 39.0 17.4 106 313-421 19-166 (326)
124 KOG0315|consensus 91.1 17 0.00036 35.9 17.8 173 3-194 12-187 (311)
125 PF00008 EGF: EGF-like domain 91.1 0.16 3.4E-06 33.2 1.8 29 547-580 2-30 (32)
126 KOG0273|consensus 90.4 7.9 0.00017 41.3 14.4 125 5-148 251-376 (524)
127 TIGR02276 beta_rpt_yvtn 40-res 90.3 1.2 2.7E-05 30.5 6.1 41 366-410 2-42 (42)
128 PF05787 DUF839: Bacterial pro 89.4 3.2 6.8E-05 46.5 11.5 69 36-109 348-457 (524)
129 KOG0266|consensus 89.0 19 0.00041 39.7 17.3 130 3-150 217-353 (456)
130 cd00053 EGF Epidermal growth f 88.7 0.38 8.3E-06 31.5 2.4 24 245-268 6-31 (36)
131 PF12947 EGF_3: EGF domain; I 88.6 0.24 5.2E-06 33.3 1.3 29 551-580 1-31 (36)
132 smart00179 EGF_CA Calcium-bind 88.6 0.46 9.9E-06 32.1 2.7 22 557-579 10-33 (39)
133 PF13360 PQQ_2: PQQ-like domai 88.1 27 0.00059 34.0 23.3 61 458-522 173-234 (238)
134 KOG0285|consensus 87.8 37 0.00081 35.2 17.3 101 311-418 149-252 (460)
135 TIGR02276 beta_rpt_yvtn 40-res 87.8 2.4 5.3E-05 29.0 6.1 40 457-497 2-41 (42)
136 COG0823 TolB Periplasmic compo 87.7 15 0.00033 40.0 15.0 144 9-167 260-407 (425)
137 COG5276 Uncharacterized conser 87.4 36 0.00078 34.6 23.9 181 319-511 90-279 (370)
138 KOG2397|consensus 87.3 0.62 1.3E-05 49.8 3.9 51 569-626 61-115 (480)
139 KOG1219|consensus 87.1 0.68 1.5E-05 57.7 4.5 56 543-609 3903-3960(4289)
140 KOG1446|consensus 86.6 40 0.00086 34.3 23.5 140 4-161 29-169 (311)
141 cd00054 EGF_CA Calcium-binding 86.5 0.68 1.5E-05 30.8 2.6 30 237-267 2-33 (38)
142 KOG0310|consensus 86.2 46 0.001 35.9 16.9 153 38-201 111-264 (487)
143 KOG2106|consensus 85.7 45 0.00097 36.3 16.4 146 370-527 342-498 (626)
144 cd00053 EGF Epidermal growth f 85.7 0.8 1.7E-05 29.9 2.6 21 560-581 12-32 (36)
145 PTZ00421 coronin; Provisional 84.7 72 0.0016 35.6 28.2 145 3-162 90-245 (493)
146 PF09064 Tme5_EGF_like: Thromb 83.8 0.74 1.6E-05 30.1 1.5 28 245-273 6-33 (34)
147 PF00930 DPPIV_N: Dipeptidyl p 83.4 42 0.00091 35.5 15.9 93 403-498 236-336 (353)
148 PF00008 EGF: EGF-like domain 82.4 1.1 2.4E-05 29.2 2.0 18 250-267 11-29 (32)
149 KOG1225|consensus 82.1 2.4 5.2E-05 46.8 5.7 13 569-581 264-276 (525)
150 PF08662 eIF2A: Eukaryotic tra 81.7 50 0.0011 31.6 18.1 129 326-465 32-162 (194)
151 PF08662 eIF2A: Eukaryotic tra 80.7 44 0.00095 32.0 13.5 60 2-70 74-133 (194)
152 KOG0279|consensus 80.7 67 0.0015 32.4 17.3 174 2-186 29-204 (315)
153 KOG4649|consensus 80.6 65 0.0014 32.2 14.3 58 359-433 34-91 (354)
154 PF02897 Peptidase_S9_N: Proly 80.3 90 0.002 33.6 22.9 193 324-521 77-312 (414)
155 KOG4289|consensus 79.9 2.3 5E-05 51.0 4.8 12 569-580 1738-1749(2531)
156 PF00930 DPPIV_N: Dipeptidyl p 79.7 27 0.00059 36.9 12.8 104 37-146 234-343 (353)
157 KOG4289|consensus 79.6 1.9 4.1E-05 51.7 4.0 70 546-631 1242-1317(2531)
158 PRK13616 lipoprotein LpqB; Pro 79.4 1.2E+02 0.0026 34.6 19.1 186 315-516 351-565 (591)
159 TIGR03300 assembly_YfgL outer 79.2 92 0.002 33.1 22.6 106 413-527 241-347 (377)
160 KOG2397|consensus 78.8 1.4 3.1E-05 47.1 2.6 45 596-644 43-88 (480)
161 PF13360 PQQ_2: PQQ-like domai 78.8 67 0.0014 31.2 17.2 105 413-527 36-149 (238)
162 KOG0293|consensus 78.6 96 0.0021 33.0 16.0 145 38-195 270-415 (519)
163 PF02897 Peptidase_S9_N: Proly 77.8 1.1E+02 0.0023 33.1 25.7 196 318-518 128-357 (414)
164 COG3823 Glutamine cyclotransfe 76.6 66 0.0014 31.1 12.5 69 94-163 180-260 (262)
165 KOG1446|consensus 76.3 95 0.0021 31.7 23.0 187 333-527 77-271 (311)
166 PTZ00420 coronin; Provisional 76.1 1.5E+02 0.0032 33.7 27.3 103 3-121 89-200 (568)
167 KOG4227|consensus 75.5 61 0.0013 34.0 12.9 128 303-433 96-226 (609)
168 COG3823 Glutamine cyclotransfe 74.7 85 0.0019 30.4 16.0 65 412-478 184-260 (262)
169 KOG0641|consensus 74.6 59 0.0013 31.5 11.8 106 4-120 197-305 (350)
170 TIGR03075 PQQ_enz_alc_DH PQQ-d 73.3 1.7E+02 0.0036 33.0 20.3 95 318-423 238-334 (527)
171 KOG4328|consensus 72.7 1.2E+02 0.0027 32.6 14.6 156 316-475 237-397 (498)
172 KOG1274|consensus 71.9 94 0.002 36.4 14.5 143 39-196 15-160 (933)
173 PRK11138 outer membrane biogen 71.3 1.5E+02 0.0033 31.7 22.2 104 413-525 256-360 (394)
174 KOG0289|consensus 71.0 1.5E+02 0.0033 31.7 17.6 143 38-189 304-447 (506)
175 smart00284 OLF Olfactomedin-li 68.8 1.3E+02 0.0029 30.2 14.5 139 49-193 83-242 (255)
176 COG4247 Phy 3-phytase (myo-ino 67.5 1.4E+02 0.003 29.9 18.3 174 353-541 150-334 (364)
177 KOG3514|consensus 66.6 54 0.0012 39.0 11.2 79 75-163 475-560 (1591)
178 KOG3914|consensus 65.1 1.9E+02 0.0041 30.6 14.3 165 359-527 66-232 (390)
179 PF02191 OLF: Olfactomedin-lik 64.4 1.6E+02 0.0035 29.5 17.3 147 325-474 78-246 (250)
180 PF05694 SBP56: 56kDa selenium 63.9 26 0.00057 37.7 7.8 62 403-466 313-393 (461)
181 smart00284 OLF Olfactomedin-li 63.4 1.7E+02 0.0037 29.4 16.7 149 318-472 79-249 (255)
182 PTZ00421 coronin; Provisional 62.1 2.6E+02 0.0057 31.2 31.7 103 313-420 75-186 (493)
183 PF01826 TIL: Trypsin Inhibito 61.6 7.8 0.00017 28.6 2.6 17 572-588 35-51 (55)
184 PLN00181 protein SPA1-RELATED; 60.8 3.5E+02 0.0076 32.2 26.6 157 315-479 485-650 (793)
185 COG3211 PhoX Predicted phospha 60.5 75 0.0016 35.4 10.6 69 36-108 415-520 (616)
186 PRK11138 outer membrane biogen 58.6 2.6E+02 0.0056 29.9 19.6 61 458-522 256-316 (394)
187 PF12661 hEGF: Human growth fa 58.1 6.2 0.00013 20.1 1.0 9 259-267 1-9 (13)
188 KOG4328|consensus 56.6 2.9E+02 0.0063 29.9 14.6 146 314-464 187-340 (498)
189 KOG0272|consensus 56.0 2.6E+02 0.0057 29.9 13.2 111 14-140 328-440 (459)
190 PF06247 Plasmod_Pvs28: Plasmo 55.2 8.5 0.00018 36.1 2.1 44 561-609 57-104 (197)
191 COG1520 FOG: WD40-like repeat 55.0 2.8E+02 0.0061 29.3 15.1 105 413-523 111-222 (370)
192 KOG0303|consensus 53.5 3.1E+02 0.0066 29.3 15.1 153 38-203 132-292 (472)
193 COG3211 PhoX Predicted phospha 53.3 1.1E+02 0.0023 34.2 10.3 112 311-422 414-574 (616)
194 KOG0272|consensus 52.8 3.2E+02 0.007 29.3 16.9 103 392-498 336-439 (459)
195 PTZ00420 coronin; Provisional 52.7 4E+02 0.0086 30.3 30.5 103 313-420 74-185 (568)
196 PF06739 SBBP: Beta-propeller 52.4 12 0.00026 25.4 2.1 19 131-150 13-31 (38)
197 KOG0279|consensus 51.0 2.8E+02 0.0061 28.1 22.7 219 299-527 51-271 (315)
198 PF02191 OLF: Olfactomedin-lik 50.6 2.7E+02 0.0059 27.9 15.3 139 49-193 78-237 (250)
199 KOG0308|consensus 50.2 2.5E+02 0.0053 31.9 12.4 142 2-162 184-327 (735)
200 KOG2048|consensus 49.5 4.5E+02 0.0097 30.0 38.5 156 314-476 383-547 (691)
201 TIGR03075 PQQ_enz_alc_DH PQQ-d 48.9 1.1E+02 0.0023 34.5 10.1 97 92-198 238-336 (527)
202 KOG0285|consensus 48.9 3.4E+02 0.0075 28.5 16.9 162 353-527 149-316 (460)
203 KOG0294|consensus 48.4 3.3E+02 0.0071 28.1 17.5 180 1-202 2-196 (362)
204 KOG1225|consensus 47.9 30 0.00066 38.4 5.3 71 569-648 233-326 (525)
205 TIGR03300 assembly_YfgL outer 47.3 3.7E+02 0.008 28.4 19.1 61 458-522 241-301 (377)
206 KOG2110|consensus 47.3 3.7E+02 0.0079 28.4 15.4 147 2-163 99-249 (391)
207 PF14583 Pectate_lyase22: Olig 47.0 3.9E+02 0.0085 28.6 16.4 76 45-127 43-119 (386)
208 PF14781 BBS2_N: Ciliary BBSom 45.2 1.6E+02 0.0036 26.4 8.4 57 43-106 76-133 (136)
209 PF05694 SBP56: 56kDa selenium 43.6 94 0.002 33.7 7.9 61 132-193 313-392 (461)
210 PF09910 DUF2139: Uncharacteri 43.6 3.8E+02 0.0083 27.5 12.7 61 407-469 40-100 (339)
211 KOG0319|consensus 43.6 5.7E+02 0.012 29.5 14.6 147 361-516 25-177 (775)
212 PF14339 DUF4394: Domain of un 43.5 1.4E+02 0.003 29.6 8.5 60 314-374 27-92 (236)
213 PLN00181 protein SPA1-RELATED; 43.4 6.3E+02 0.014 30.0 27.4 114 315-434 534-648 (793)
214 PF14339 DUF4394: Domain of un 43.4 1.4E+02 0.0031 29.5 8.6 37 38-77 27-63 (236)
215 TIGR02171 Fb_sc_TIGR02171 Fibr 42.0 5.3E+02 0.011 30.9 14.1 61 4-72 323-387 (912)
216 KOG0315|consensus 41.8 3.7E+02 0.0081 26.9 24.3 130 280-417 54-183 (311)
217 KOG0308|consensus 41.8 5.8E+02 0.013 29.1 16.6 174 4-189 133-312 (735)
218 PF08309 LVIVD: LVIVD repeat; 41.4 1.1E+02 0.0024 21.3 5.5 30 492-522 4-33 (42)
219 KOG0286|consensus 38.3 4.6E+02 0.0099 26.9 19.5 171 4-190 70-289 (343)
220 KOG0263|consensus 38.3 6.9E+02 0.015 28.9 18.3 166 2-186 464-631 (707)
221 KOG0269|consensus 37.4 6.1E+02 0.013 29.5 13.2 124 311-440 131-256 (839)
222 KOG0318|consensus 37.4 6.2E+02 0.013 28.1 38.1 186 314-517 191-391 (603)
223 PRK13616 lipoprotein LpqB; Pro 37.3 6.8E+02 0.015 28.6 19.6 144 37-196 399-560 (591)
224 KOG0273|consensus 35.8 6.2E+02 0.013 27.7 22.6 113 316-435 279-392 (524)
225 PF00954 S_locus_glycop: S-loc 35.4 2E+02 0.0043 24.5 7.6 12 569-580 97-108 (110)
226 KOG0646|consensus 35.4 6.2E+02 0.013 27.5 12.8 27 175-202 218-244 (476)
227 KOG1036|consensus 35.2 5.2E+02 0.011 26.6 15.9 157 9-195 73-253 (323)
228 KOG2096|consensus 34.8 5.4E+02 0.012 26.7 13.0 164 8-189 206-388 (420)
229 KOG0276|consensus 33.7 7.6E+02 0.017 28.1 30.2 99 38-145 14-112 (794)
230 KOG0649|consensus 33.5 5E+02 0.011 25.9 14.0 125 34-170 111-243 (325)
231 KOG0268|consensus 33.5 6E+02 0.013 26.9 13.7 154 354-516 186-345 (433)
232 PTZ00214 high cysteine membran 33.0 57 0.0012 38.6 4.8 75 569-650 681-770 (800)
233 PF14781 BBS2_N: Ciliary BBSom 32.3 80 0.0017 28.3 4.5 55 319-375 76-134 (136)
234 TIGR03074 PQQ_membr_DH membran 32.2 9.2E+02 0.02 28.6 17.9 99 405-513 378-482 (764)
235 KOG3509|consensus 31.4 42 0.0009 40.0 3.3 59 590-648 29-90 (964)
236 KOG0649|consensus 30.7 4.7E+02 0.01 26.1 9.7 82 445-527 113-195 (325)
237 KOG1272|consensus 30.2 74 0.0016 34.3 4.6 130 2-163 184-324 (545)
238 KOG3658|consensus 30.1 77 0.0017 35.9 4.9 16 633-649 565-580 (764)
239 KOG4378|consensus 29.6 8E+02 0.017 27.1 16.6 156 356-518 122-280 (673)
240 KOG0319|consensus 29.5 9.4E+02 0.02 27.9 37.7 460 35-527 60-544 (775)
241 KOG4260|consensus 29.3 25 0.00054 34.9 1.0 34 235-268 234-269 (350)
242 KOG0281|consensus 28.4 5.1E+02 0.011 27.1 9.9 191 315-526 199-396 (499)
243 PF08954 DUF1900: Domain of un 28.4 1.6E+02 0.0035 26.5 5.9 54 361-414 16-69 (136)
244 cd00216 PQQ_DH Dehydrogenases 28.3 8.5E+02 0.018 27.0 21.4 33 494-527 401-433 (488)
245 KOG1274|consensus 28.2 1.1E+03 0.023 28.2 21.2 141 7-162 22-168 (933)
246 KOG0275|consensus 28.1 3.9E+02 0.0085 27.6 9.0 91 3-107 408-498 (508)
247 KOG4441|consensus 27.1 9.7E+02 0.021 27.3 16.2 158 360-525 327-506 (571)
248 KOG1272|consensus 27.0 1.9E+02 0.0042 31.3 6.9 144 2-164 222-366 (545)
249 KOG2106|consensus 26.9 9E+02 0.019 26.8 18.5 102 38-146 369-472 (626)
250 KOG0640|consensus 26.8 3.1E+02 0.0068 28.2 8.0 101 3-118 186-291 (430)
251 KOG1407|consensus 26.6 6.8E+02 0.015 25.3 15.6 170 4-189 35-204 (313)
252 KOG0646|consensus 26.2 7.7E+02 0.017 26.9 11.1 52 10-72 197-249 (476)
253 KOG0284|consensus 26.1 1.8E+02 0.0039 31.0 6.4 102 38-148 139-240 (464)
254 KOG0289|consensus 25.9 8.7E+02 0.019 26.3 20.4 193 315-515 305-502 (506)
255 KOG0268|consensus 25.9 3.4E+02 0.0074 28.6 8.3 143 313-462 187-331 (433)
256 KOG0284|consensus 25.6 6.2E+02 0.013 27.2 10.1 102 314-420 265-368 (464)
257 KOG0303|consensus 25.5 8.6E+02 0.019 26.1 16.7 159 313-480 131-297 (472)
258 KOG0270|consensus 24.8 9.2E+02 0.02 26.2 16.5 162 311-476 284-448 (463)
259 KOG0772|consensus 24.8 9.1E+02 0.02 26.8 11.4 63 9-80 187-254 (641)
260 PF04885 Stig1: Stigma-specifi 24.4 92 0.002 28.0 3.5 50 584-647 85-134 (136)
261 KOG4611|consensus 23.9 78 0.0017 32.7 3.4 20 627-648 98-117 (747)
262 KOG0276|consensus 23.7 1.1E+03 0.024 26.8 30.9 104 4-122 70-175 (794)
263 KOG0196|consensus 23.6 75 0.0016 36.9 3.4 56 570-642 259-320 (996)
264 PF13570 PQQ_3: PQQ-like domai 23.4 1.4E+02 0.003 20.0 3.7 24 494-518 16-39 (40)
265 PHA02887 EGF-like protein; Pro 22.4 60 0.0013 28.0 1.8 38 235-275 81-122 (126)
266 PRK10115 protease 2; Provision 22.3 1.3E+03 0.028 27.0 21.1 190 1-204 237-435 (686)
267 KOG0277|consensus 22.0 8.2E+02 0.018 24.6 14.4 82 302-385 95-177 (311)
268 KOG4649|consensus 21.6 8.5E+02 0.018 24.7 17.8 53 135-189 242-294 (354)
269 KOG0266|consensus 21.6 1.1E+03 0.023 25.8 25.2 155 316-480 162-321 (456)
270 PF15492 Nbas_N: Neuroblastoma 21.6 8.7E+02 0.019 24.7 13.3 27 3-29 57-83 (282)
271 COG1770 PtrB Protease II [Amin 21.6 1.3E+03 0.028 26.7 24.9 163 358-524 176-354 (682)
272 KOG3658|consensus 21.3 80 0.0017 35.8 3.0 17 591-608 564-580 (764)
273 COG4222 Uncharacterized protei 21.2 1E+03 0.022 25.7 11.1 64 402-465 138-218 (391)
274 KOG0772|consensus 20.8 6.5E+02 0.014 27.9 9.4 67 3-72 229-301 (641)
275 KOG4378|consensus 20.6 1.2E+03 0.025 25.9 17.1 156 314-480 122-283 (673)
276 KOG0282|consensus 20.4 1.2E+03 0.025 25.7 11.4 168 10-195 279-453 (503)
No 1
>KOG1214|consensus
Probab=100.00 E-value=1e-51 Score=436.04 Aligned_cols=292 Identities=26% Similarity=0.501 Sum_probs=256.6
Q ss_pred eEEEEeeecceeEEecCCCCCCCC--CceeeeeccccceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe-ecc
Q psy5768 279 AFIMYSRVNRIDSIHMTDKSDLNS--PFESIRNSTMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL-ERQ 355 (652)
Q Consensus 279 ~~Ll~s~~~~i~~i~l~~~~~~~~--p~~~~~~~~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~-~~~ 355 (652)
.||||+++..|.+++| .+..+.. ....|..+. ..+++||||.++++|||+|+...+|.|..|+|+..++++ .++
T Consensus 991 t~LL~aqg~~I~~lpl-ng~~~~K~~ak~~l~~p~--~IiVGidfDC~e~mvyWtDv~g~SI~rasL~G~Ep~ti~n~~L 1067 (1289)
T KOG1214|consen 991 TFLLYAQGQQIGYLPL-NGTRLQKDAAKTLLSLPG--SIIVGIDFDCRERMVYWTDVAGRSISRASLEGAEPETIVNSGL 1067 (1289)
T ss_pred ceEEEeccceEEEeec-CcchhchhhhhceEeccc--ceeeeeecccccceEEEeecCCCccccccccCCCCceeecccC
Confidence 5999999999999999 5554432 222344332 458899999999999999999999999999999999999 899
Q ss_pred CceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecC
Q psy5768 356 GSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSG 435 (652)
Q Consensus 356 ~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG 435 (652)
.+|+||||||++||+||||+...+|+++.|+|+. ++. +..+++.+||+|++||.+|.||||||++.+|+|++..|||
T Consensus 1068 ~SPEGiAVDh~~Rn~ywtDS~lD~IevA~LdG~~--rkv-Lf~tdLVNPR~iv~D~~rgnLYwtDWnRenPkIets~mDG 1144 (1289)
T KOG1214|consen 1068 ISPEGIAVDHIRRNMYWTDSVLDKIEVALLDGSE--RKV-LFYTDLVNPRAIVVDPIRGNLYWTDWNRENPKIETSSMDG 1144 (1289)
T ss_pred CCccceeeeeccceeeeeccccchhheeecCCce--eeE-EEeecccCcceEEeecccCceeeccccccCCcceeeccCC
Confidence 9999999999999999999999999999999975 444 4458999999999999999999999999999999999999
Q ss_pred CCceEEEEcCCCCCceEEEecCCCEEEEEeCCCCeEEEEecCCCceEEEecCCCCceeEEEEeCCEEEEEcCCCCeEEEE
Q psy5768 436 FGTESIITTDITMPNALALDHQAEKLFWGDARLDKIERCDYDGTNRIVLSKISPLHPFDMAVYGEFIFWTDWVIHAVLRA 515 (652)
Q Consensus 436 ~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~~~~~p~glav~~~~lYwtd~~~~~I~~~ 515 (652)
+|+++|+.+++..||||++|+..+.|.|+|+++++++.+..+|..|+++.+ .++.||+|+-+++.+|||||+.++|..+
T Consensus 1145 ~NrRilin~DigLPNGLtfdpfs~~LCWvDAGt~rleC~~p~g~gRR~i~~-~LqYPF~itsy~~~fY~TDWk~n~vvsv 1223 (1289)
T KOG1214|consen 1145 ENRRILINTDIGLPNGLTFDPFSKLLCWVDAGTKRLECTLPDGTGRRVIQN-NLQYPFSITSYADHFYHTDWKRNGVVSV 1223 (1289)
T ss_pred ccceEEeecccCCCCCceeCcccceeeEEecCCcceeEecCCCCcchhhhh-cccCceeeeeccccceeeccccCceEEe
Confidence 999999999999999999999999999999999999999999999999988 4699999999999999999999999999
Q ss_pred EccCCceEEEE-ecccCCcceeEEEeccCCCCCCCCCCCCCCCCccccccCCCCceeeeccCcee
Q psy5768 516 NKYTGEEVYTL-RKNIRRPMGIVAISDNLDACAKTPCRHLNGNCDDICKLDETGQVVCSCFTGKV 579 (652)
Q Consensus 516 ~k~~g~~~~~~-~~~~~~p~~i~~~~~~~~~~~~~~C~~~ng~Cs~lCl~~~~~~~~C~Cp~g~~ 579 (652)
++..++..... ...-...+||.++.+.- +...+||+.+||||.||||+.-.+ ..|.||+.-+
T Consensus 1224 ~~~~~~~td~~~p~~~s~lyGItav~~~C-p~gstpCSedNGGCqHLCLpgqng-avcecpdnvk 1286 (1289)
T KOG1214|consen 1224 NKHSGQFTDEYLPEQRSHLYGITAVYPYC-PTGSTPCSEDNGGCQHLCLPGQNG-AVCECPDNVK 1286 (1289)
T ss_pred eccccccccccccccccceEEEEeccccC-CCCCCcccccCCcceeecccCcCC-ccccCCccce
Confidence 99887655433 23335689999885432 335699999999999999987765 8999998754
No 2
>KOG1215|consensus
Probab=100.00 E-value=2.1e-47 Score=450.02 Aligned_cols=480 Identities=30% Similarity=0.549 Sum_probs=393.2
Q ss_pred cEEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCC-CCeEEEEeCCCCCcEEEE
Q psy5768 92 HIAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGK-IPLIARAGLDGKKQTILA 170 (652)
Q Consensus 92 ~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~-~~~I~~~~ldg~~~~~~~ 170 (652)
++..|...+.+||++.. +|+++..+|+.+.+........|.+++..|..|++||++++. .+.|+.+.+++..+..+.
T Consensus 202 ~~~~d~~~~~~~~~~~~--~~~~~~c~g~~~~i~~~~~~Dg~~dc~~~~de~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ 279 (877)
T KOG1215|consen 202 ADDYDESEGRIYWTDDS--RIEVTRCDGSSRCILISEVCDGPRDCVDGPDEGVMNCSDATCEAPEIECADGDCSDRQKLC 279 (877)
T ss_pred ccccccccCcccccCCc--ceeEEEecCCCcEEeehhccCCCcccccCCcCceeEeeccccCCcceeecCCCCccceEEe
Confidence 44556777889998754 799999999877777777789999999999999999999974 468999999999999988
Q ss_pred eecccCceeEEEeccCCEEEE-EeCCCCcEEEEEecCCC-----CceEEE-----eec--------CCCCCcceeeeeee
Q psy5768 171 QEIIMPIKDITLDLKFFSAFY-RNLSKGNIHIISLSNLS-----DVSTIS-----MKP--------YGDSYLKDIKIYSK 231 (652)
Q Consensus 171 ~~~~~~p~gl~lD~~~~~ly~-~d~~g~~~~~i~~~~~~-----~~~~~~-----~~~--------~~~~~~~~i~v~~~ 231 (652)
...+.+|+|++.|.-...+|| ..+++++.+ |...... ...... ... .......+.++++.
T Consensus 280 ~g~~d~pdg~de~~~~~~~~~~~~~d~~~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~ 358 (877)
T KOG1215|consen 280 DGDLDCPDGLDEDYCKKKLYWSMNVDGSGRR-ILLSKLCHGYWTDGLNECAERVLKCSHKCPDVSVGPRCDCMGAKVLPL 358 (877)
T ss_pred cCccCCCCcccccccccceeeeeecccCCce-eeecccCccccccccccchhhcccccCCCCccccCCcccCCccceecc
Confidence 888889999999999999999 567777766 5553311 110000 000 00011233444544
Q ss_pred ccCCCCCCCCCCCCCCcccce-ecCCCceEEEeCCc-cccCCCccc---ccceEEEEeeecceeEEecCCCCCCCCCcee
Q psy5768 232 DAQTGTNPCGVNNGGCAELCL-YNGVSAVCACAHGV-VAQDGKSCS---EYDAFIMYSRVNRIDSIHMTDKSDLNSPFES 306 (652)
Q Consensus 232 ~~q~~~n~C~~~ng~Cs~lC~-~~~~~~~C~C~~G~-l~~dg~~C~---~~~~~Ll~s~~~~i~~i~l~~~~~~~~p~~~ 306 (652)
......++|...||+|+|+|+ ..+.++.|.|+.|| +..++ |. ..++||+++.+..|+++.+ +..+...|
T Consensus 359 ~~~~~~~~~~~~~g~Csq~C~~~~p~~~~c~c~~g~~~~~~~--c~~~~~~~~~l~~s~~~~ir~~~~-~~~~~~~p--- 432 (877)
T KOG1215|consen 359 GARTDSNPCESDNGGCSQLCVPNSPGTFKCACSPGYELRLDK--CEASDQPEAFLLFSNRHDIRRISL-DCSDVSRP--- 432 (877)
T ss_pred cccccCCcccccCCccceeccCCCCCceeEecCCCcEeccCC--ceecCCCCcEEEEecCccceeccc-CCCcceEE---
Confidence 444467999999999999999 45668999999998 44455 54 5889999999999999999 44333222
Q ss_pred eeeccccceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe-eccCceeeeEEEccCCEEEEEeCCCCeEEEEEc
Q psy5768 307 IRNSTMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL-ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDL 385 (652)
Q Consensus 307 ~~~~~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~-~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~ 385 (652)
+ ..+.++.++++|..++++||+|.....|.+...++.....+. .++-.++|||+||+++++||+|.....|.+..+
T Consensus 433 ~---~~~~~~~~~d~d~~~~~i~~~d~~~~~i~~~~~~~~~~~~~~~~g~~~~~~lavD~~~~~~y~tDe~~~~i~v~~~ 509 (877)
T KOG1215|consen 433 L---EGIKNAVALDFDVLNNRIYWADLSDEKICRASQDGSSECELCGDGLCIPEGLAVDWIGDNIYWTDEGNCLIEVADL 509 (877)
T ss_pred c---cCCccceEEEEEecCCEEEEEeccCCeEeeeccCCCccceEeccCccccCcEEEEeccCCceecccCCceeEEEEc
Confidence 2 223678999999999999999999999999999987776644 788899999999999999999999999999998
Q ss_pred CCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEe
Q psy5768 386 DSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGD 465 (652)
Q Consensus 386 ~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D 465 (652)
++.. +++++. ..+..|++++++|.+|+|||+||+.. ++|+|+.|||..+.+++..++.||+||++|...+++||+|
T Consensus 510 ~g~~--~~vl~~-~~l~~~r~~~v~p~~g~~~wtd~~~~-~~i~ra~~dg~~~~~l~~~~~~~p~glt~d~~~~~~yw~d 585 (877)
T KOG1215|consen 510 DGSS--RKVLVS-KDLDLPRSIAVDPEKGLMFWTDWGQP-PRIERASLDGSERAVLVTNGILWPNGLTIDYETDRLYWAD 585 (877)
T ss_pred cCCc--eeEEEe-cCCCCccceeeccccCeeEEecCCCC-chhhhhcCCCCCceEEEeCCccCCCcceEEeecceeEEEc
Confidence 8764 455554 55699999999999999999999974 5999999999999999999999999999999999999999
Q ss_pred CCCC-eEEEEecCCCceEEEecCCCCceeEEEEeCCEEEEEcCCCCeEEEEEccCCceEEEEecccCCcceeEEE-eccC
Q psy5768 466 ARLD-KIERCDYDGTNRIVLSKISPLHPFDMAVYGEFIFWTDWVIHAVLRANKYTGEEVYTLRKNIRRPMGIVAI-SDNL 543 (652)
Q Consensus 466 ~~~~-~I~~~~ldG~~~~~l~~~~~~~p~glav~~~~lYwtd~~~~~I~~~~k~~g~~~~~~~~~~~~p~~i~~~-~~~~ 543 (652)
.... .|++++++|..|+.+....+.||++++++++++||+||....+.+.++..+.....+......|..++++ +...
T Consensus 586 ~~~~~~i~~~~~~g~~r~~~~~~~~~~p~~~~~~~~~iyw~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 665 (877)
T KOG1215|consen 586 AKLDYTIESANMDGQNRRVVDSEDLPHPFGLSVFEDYIYWTDWSNRAISRAEKHKGSDSRTSRSNLAQPLDIILVHHSSS 665 (877)
T ss_pred ccCCcceeeeecCCCceEEeccccCCCceEEEEecceeEEeeccccceEeeecccCCcceeeecccCcccceEEEecccc
Confidence 9999 7999999999998444445799999999999999999999999999998886622444456788888888 5666
Q ss_pred CCCCCCCCCCCCCCCccccccCCCCceeeeccCceeeccC-CcccC
Q psy5768 544 DACAKTPCRHLNGNCDDICKLDETGQVVCSCFTGKVLMED-NRSCT 588 (652)
Q Consensus 544 ~~~~~~~C~~~ng~Cs~lCl~~~~~~~~C~Cp~g~~l~~d-~~C~~ 588 (652)
++...|+|..+|++|+|+|++.|.+. +|+||.|+.|..+ ++|.+
T Consensus 666 ~~~~~n~C~~~n~~c~~KOG~~p~~~-~c~c~~~~~l~~~~~~C~~ 710 (877)
T KOG1215|consen 666 RPTGVNPCESSNGGCSQLCLPRPQGS-TCACPEGYRLSPDGKSCSS 710 (877)
T ss_pred CCCCCCcccccCCCCCeeeecCCCCC-eeeCCCCCeecCCCCeecC
Confidence 77788999998999999999999976 9999999999888 88986
No 3
>KOG1214|consensus
Probab=100.00 E-value=5e-37 Score=325.05 Aligned_cols=249 Identities=29% Similarity=0.429 Sum_probs=202.3
Q ss_pred eEEEecCCCCeEEEEecCCCeeEE-----EecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEE
Q psy5768 2 FIAVSSPTQSKIVVCNLEGEYQTT-----ILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRET 76 (652)
Q Consensus 2 ~i~v~~~~~~~I~~~~~~g~~~~~-----~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~ 76 (652)
|++++. +++|+...++|..... |+.. .-+-|+|||||-++.+|||+| +...+|.|+.|+|++.++
T Consensus 992 ~LL~aq--g~~I~~lplng~~~~K~~ak~~l~~------p~~IiVGidfDC~e~mvyWtD--v~g~SI~rasL~G~Ep~t 1061 (1289)
T KOG1214|consen 992 FLLYAQ--GQQIGYLPLNGTRLQKDAAKTLLSL------PGSIIVGIDFDCRERMVYWTD--VAGRSISRASLEGAEPET 1061 (1289)
T ss_pred eEEEec--cceEEEeecCcchhchhhhhceEec------ccceeeeeecccccceEEEee--cCCCccccccccCCCCce
Confidence 677774 7889999988875543 3322 236789999999999999999 888999999999999999
Q ss_pred EEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecC-CCCe
Q psy5768 77 VVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESG-KIPL 155 (652)
Q Consensus 77 v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~-~~~~ 155 (652)
++..+ +.+|+ ||||||+.+|+||+|+..++|+|+.|||+.+++|+.++|-+||+|++|+..|.||||||. .+|+
T Consensus 1062 i~n~~-L~SPE----GiAVDh~~Rn~ywtDS~lD~IevA~LdG~~rkvLf~tdLVNPR~iv~D~~rgnLYwtDWnRenPk 1136 (1289)
T KOG1214|consen 1062 IVNSG-LISPE----GIAVDHIRRNMYWTDSVLDKIEVALLDGSERKVLFYTDLVNPRAIVVDPIRGNLYWTDWNRENPK 1136 (1289)
T ss_pred eeccc-CCCcc----ceeeeeccceeeeeccccchhheeecCCceeeEEEeecccCcceEEeecccCceeeccccccCCc
Confidence 99999 99999 999999999999999999999999999999999999999999999999999999999998 5899
Q ss_pred EEEEeCCCCCcEEEEeecccCceeEEEeccCCEEEEEeC----------CCCcEEEEEec-----------------CCC
Q psy5768 156 IARAGLDGKKQTILAQEIIMPIKDITLDLKFFSAFYRNL----------SKGNIHIISLS-----------------NLS 208 (652)
Q Consensus 156 I~~~~ldg~~~~~~~~~~~~~p~gl~lD~~~~~ly~~d~----------~g~~~~~i~~~-----------------~~~ 208 (652)
|++..|||+++++|+++.++.||||++|+....|-|+|. +|..+|+++.. +++
T Consensus 1137 Iets~mDG~NrRilin~DigLPNGLtfdpfs~~LCWvDAGt~rleC~~p~g~gRR~i~~~LqYPF~itsy~~~fY~TDWk 1216 (1289)
T KOG1214|consen 1137 IETSSMDGENRRILINTDIGLPNGLTFDPFSKLLCWVDAGTKRLECTLPDGTGRRVIQNNLQYPFSITSYADHFYHTDWK 1216 (1289)
T ss_pred ceeeccCCccceEEeecccCCCCCceeCcccceeeEEecCCcceeEecCCCCcchhhhhcccCceeeeeccccceeeccc
Confidence 999999999999999999999999999999999999774 44445544431 111
Q ss_pred CceEEEeec-----------CCCCCcceeeeeeeccCCCCCCCCCCCCCCcccceecCCCceEEEeCC
Q psy5768 209 DVSTISMKP-----------YGDSYLKDIKIYSKDAQTGTNPCGVNNGGCAELCLYNGVSAVCACAHG 265 (652)
Q Consensus 209 ~~~~~~~~~-----------~~~~~~~~i~v~~~~~q~~~n~C~~~ng~Cs~lC~~~~~~~~C~C~~G 265 (652)
.-.++++.. .....+++|...-+.=-.+.+||.++||||+|||+..-++..|.|+..
T Consensus 1217 ~n~vvsv~~~~~~~td~~~p~~~s~lyGItav~~~Cp~gstpCSedNGGCqHLCLpgqngavcecpdn 1284 (1289)
T KOG1214|consen 1217 RNGVVSVNKHSGQFTDEYLPEQRSHLYGITAVYPYCPTGSTPCSEDNGGCQHLCLPGQNGAVCECPDN 1284 (1289)
T ss_pred cCceEEeeccccccccccccccccceEEEEeccccCCCCCCcccccCCcceeecccCcCCccccCCcc
Confidence 111222211 111123333322222226789999999999999997666899999874
No 4
>KOG1215|consensus
Probab=100.00 E-value=1.2e-34 Score=341.91 Aligned_cols=416 Identities=28% Similarity=0.493 Sum_probs=329.7
Q ss_pred CCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccC
Q psy5768 9 TQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTA 88 (652)
Q Consensus 9 ~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~ 88 (652)
....|+.+.++.......+. .+..+.++++|+.++++||+| .....|.++.++|.....+...+ ...++
T Consensus 415 ~~~~ir~~~~~~~~~~~p~~-------~~~~~~~~d~d~~~~~i~~~d--~~~~~i~~~~~~~~~~~~~~~~g-~~~~~- 483 (877)
T KOG1215|consen 415 NRHDIRRISLDCSDVSRPLE-------GIKNAVALDFDVLNNRIYWAD--LSDEKICRASQDGSSECELCGDG-LCIPE- 483 (877)
T ss_pred cCccceecccCCCcceEEcc-------CCccceEEEEEecCCEEEEEe--ccCCeEeeeccCCCccceEeccC-ccccC-
Confidence 35566666666553333332 337899999999999999999 99999999999998777788888 88899
Q ss_pred CCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeCCCCCcEE
Q psy5768 89 CNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGLDGKKQTI 168 (652)
Q Consensus 89 ~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~~~ 168 (652)
+||+||+.+++||+|.....|++.+++|..+.+++...+..|+.+++||..|+|||+||+..++|+|+.|||+.+..
T Consensus 484 ---~lavD~~~~~~y~tDe~~~~i~v~~~~g~~~~vl~~~~l~~~r~~~v~p~~g~~~wtd~~~~~~i~ra~~dg~~~~~ 560 (877)
T KOG1215|consen 484 ---GLAVDWIGDNIYWTDEGNCLIEVADLDGSSRKVLVSKDLDLPRSIAVDPEKGLMFWTDWGQPPRIERASLDGSERAV 560 (877)
T ss_pred ---cEEEEeccCCceecccCCceeEEEEccCCceeEEEecCCCCccceeeccccCeeEEecCCCCchhhhhcCCCCCceE
Confidence 99999999999999999999999999999999999998899999999999999999999977799999999999999
Q ss_pred EEeecccCceeEEEeccCCEEEE-----------EeCCCCcEEEEEecCCCCceEEE-----------------------
Q psy5768 169 LAQEIIMPIKDITLDLKFFSAFY-----------RNLSKGNIHIISLSNLSDVSTIS----------------------- 214 (652)
Q Consensus 169 ~~~~~~~~p~gl~lD~~~~~ly~-----------~d~~g~~~~~i~~~~~~~~~~~~----------------------- 214 (652)
++..++.||+||++|..++++|| ++++|++++.+......++..++
T Consensus 561 l~~~~~~~p~glt~d~~~~~~yw~d~~~~~~i~~~~~~g~~r~~~~~~~~~~p~~~~~~~~~iyw~d~~~~~~~~~~~~~ 640 (877)
T KOG1215|consen 561 LVTNGILWPNGLTIDYETDRLYWADAKLDYTIESANMDGQNRRVVDSEDLPHPFGLSVFEDYIYWTDWSNRAISRAEKHK 640 (877)
T ss_pred EEeCCccCCCcceEEeecceeEEEcccCCcceeeeecCCCceEEeccccCCCceEEEEecceeEEeeccccceEeeeccc
Confidence 99999999999999999999999 35677777622222222222211
Q ss_pred ------eecCCCCCcceeeee-e-eccCCCCCCCCCCCCCCcccceecCCCceEEEeCCc-cccCCCcccccceEEEEee
Q psy5768 215 ------MKPYGDSYLKDIKIY-S-KDAQTGTNPCGVNNGGCAELCLYNGVSAVCACAHGV-VAQDGKSCSEYDAFIMYSR 285 (652)
Q Consensus 215 ------~~~~~~~~~~~i~v~-~-~~~q~~~n~C~~~ng~Cs~lC~~~~~~~~C~C~~G~-l~~dg~~C~~~~~~Ll~s~ 285 (652)
+..... .+..+..+ + ..++...|+|...|++|+|+|+..|.+.+|+|+.|+ |..++++|..+..+++++.
T Consensus 641 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~n~C~~~n~~c~~KOG~~p~~~~c~c~~~~~l~~~~~~C~~~~~~~~~~~ 719 (877)
T KOG1215|consen 641 GSDSRTSRSNLA-QPLDIILVHHSSSRPTGVNPCESSNGGCSQLCLPRPQGSTCACPEGYRLSPDGKSCSSPEGYLLITS 719 (877)
T ss_pred CCcceeeecccC-cccceEEEeccccCCCCCCcccccCCCCCeeeecCCCCCeeeCCCCCeecCCCCeecCccccccccc
Confidence 111111 34444444 3 333488999999999999999998886699999996 7789999999999999999
Q ss_pred ecceeEEecCCCCCCCCCceeeeeccccceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe-eccCceeeeEEE
Q psy5768 286 VNRIDSIHMTDKSDLNSPFESIRNSTMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL-ERQGSVEGLAYE 364 (652)
Q Consensus 286 ~~~i~~i~l~~~~~~~~p~~~~~~~~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~-~~~~~~~glAvD 364 (652)
...+..+.+ +..... ...+.. . +..++..+|++...+...+...++.....++ +....++++|+|
T Consensus 720 ~~~~~~~~~-~~~~~~--~~~~~~--------~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 785 (877)
T KOG1215|consen 720 RTGIPCISL-DSELSP--DQPLED--------G---DTIDRLEYWTDVRVGVAAVSSQNCAPGYDLVGEGEPPPEGSAVD 785 (877)
T ss_pred ccccceeec-CcccCC--CcccCC--------C---cccccceecccccceeeEEEecCCCCccccccccCCCCCCceee
Confidence 999998887 332211 111110 1 7788899999987777666666554333333 777799999999
Q ss_pred ccCCEEEEEeCCCCeEEEEEcCCCCCc---cEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEE
Q psy5768 365 YVHNYLYWTCNNDATINKIDLDSPKAQ---RIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESI 441 (652)
Q Consensus 365 w~~~~LYwtd~~~~~I~~~~~~~~~~~---~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l 441 (652)
+..+.|||+......|.+..+++.... ....+.......|+++.+.|..++++|++| ...+.|.++.+++.++..+
T Consensus 786 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 864 (877)
T KOG1215|consen 786 EAEDTLYWTCSATSFIEVSGLDGERKCRRRPEGVVDFDNPVPPRTTGVEPEKSLLFWTNW-EPGPKIPRSALDGSERLVL 864 (877)
T ss_pred hhhcceEEEeecccEEEEEEEeeecccccccccccccCCCCCCcceeeccccceeccCCc-cccceeeecccccccccce
Confidence 999999999999998888877663221 122333456788999999999999999999 4457999999999999999
Q ss_pred EEcCCCCCceEEE
Q psy5768 442 ITTDITMPNALAL 454 (652)
Q Consensus 442 ~~~~l~~P~glai 454 (652)
+...+..|+|+++
T Consensus 865 ~~~~~~~~~~~~~ 877 (877)
T KOG1215|consen 865 FKSLLSCPNALAL 877 (877)
T ss_pred eccCCCCccCCCC
Confidence 9888889998863
No 5
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=99.75 E-value=4.9e-16 Score=156.68 Aligned_cols=213 Identities=23% Similarity=0.322 Sum_probs=159.2
Q ss_pred EEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEE
Q psy5768 316 IIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVV 395 (652)
Q Consensus 316 ~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~ 395 (652)
+.++.||..++.|||+|...++|++++.++...+.+. ...|.|++++...+.||+++.. .+.++++.... .+.+
T Consensus 2 ~Egp~~d~~~g~l~~~D~~~~~i~~~~~~~~~~~~~~--~~~~~G~~~~~~~g~l~v~~~~--~~~~~d~~~g~--~~~~ 75 (246)
T PF08450_consen 2 GEGPVWDPRDGRLYWVDIPGGRIYRVDPDTGEVEVID--LPGPNGMAFDRPDGRLYVADSG--GIAVVDPDTGK--VTVL 75 (246)
T ss_dssp EEEEEEETTTTEEEEEETTTTEEEEEETTTTEEEEEE--SSSEEEEEEECTTSEEEEEETT--CEEEEETTTTE--EEEE
T ss_pred CcceEEECCCCEEEEEEcCCCEEEEEECCCCeEEEEe--cCCCceEEEEccCCEEEEEEcC--ceEEEecCCCc--EEEE
Confidence 4688999999999999999999999999876544433 2239999999777999999964 44455654321 3444
Q ss_pred EEe---C-CCCCceEEEEeCCCCEEEEEecCCCC------CceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEe
Q psy5768 396 VRL---G-QHDKPRGIDIDSCDSRIYWTNWNSHL------PSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGD 465 (652)
Q Consensus 396 ~~~---~-~~~~P~~Iavdp~~g~Lywtd~~~~~------~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D 465 (652)
... . ....|.++++|| .|.||+|+.+... ++|+|...+|. . ..+...+..||||+++++++.||++|
T Consensus 76 ~~~~~~~~~~~~~ND~~vd~-~G~ly~t~~~~~~~~~~~~g~v~~~~~~~~-~-~~~~~~~~~pNGi~~s~dg~~lyv~d 152 (246)
T PF08450_consen 76 ADLPDGGVPFNRPNDVAVDP-DGNLYVTDSGGGGASGIDPGSVYRIDPDGK-V-TVVADGLGFPNGIAFSPDGKTLYVAD 152 (246)
T ss_dssp EEEETTCSCTEEEEEEEE-T-TS-EEEEEECCBCTTCGGSEEEEEEETTSE-E-EEEEEEESSEEEEEEETTSSEEEEEE
T ss_pred eeccCCCcccCCCceEEEcC-CCCEEEEecCCCccccccccceEEECCCCe-E-EEEecCcccccceEECCcchheeecc
Confidence 443 1 568899999999 5889999976432 46999998843 3 33345688999999999999999999
Q ss_pred CCCCeEEEEecCC--C---ceEEEecCC--CCceeEEEEe-CCEEEEEcCCCCeEEEEEccCCceEEEEecccCCcceeE
Q psy5768 466 ARLDKIERCDYDG--T---NRIVLSKIS--PLHPFDMAVY-GEFIFWTDWVIHAVLRANKYTGEEVYTLRKNIRRPMGIV 537 (652)
Q Consensus 466 ~~~~~I~~~~ldG--~---~~~~l~~~~--~~~p~glav~-~~~lYwtd~~~~~I~~~~k~~g~~~~~~~~~~~~p~~i~ 537 (652)
...++|.+++++. . +++++.... ...|-||+++ +++||++++..+.|.++++. |+....+......|..++
T Consensus 153 s~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~pDG~~vD~~G~l~va~~~~~~I~~~~p~-G~~~~~i~~p~~~~t~~~ 231 (246)
T PF08450_consen 153 SFNGRIWRFDLDADGGELSNRRVFIDFPGGPGYPDGLAVDSDGNLWVADWGGGRIVVFDPD-GKLLREIELPVPRPTNCA 231 (246)
T ss_dssp TTTTEEEEEEEETTTCCEEEEEEEEE-SSSSCEEEEEEEBTTS-EEEEEETTTEEEEEETT-SCEEEEEE-SSSSEEEEE
T ss_pred cccceeEEEeccccccceeeeeeEEEcCCCCcCCCcceEcCCCCEEEEEcCCCEEEEECCC-ccEEEEEcCCCCCEEEEE
Confidence 9999999999974 2 345554432 2359999998 57999999999999999996 887777765555666555
Q ss_pred E
Q psy5768 538 A 538 (652)
Q Consensus 538 ~ 538 (652)
.
T Consensus 232 f 232 (246)
T PF08450_consen 232 F 232 (246)
T ss_dssp E
T ss_pred E
Confidence 4
No 6
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=99.75 E-value=3.3e-15 Score=177.43 Aligned_cols=299 Identities=14% Similarity=0.181 Sum_probs=196.6
Q ss_pred CCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCC------------CcCCccCCCCcEEEEccCCcE
Q psy5768 35 STLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQK------------KYPAVTACNLHIAVDWIAQNI 102 (652)
Q Consensus 35 ~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~------------~~~~p~~~~~~lavDw~~~~l 102 (652)
+++..|.+|++|+.++.||++| ..+++|++++++|.....+...+ .+..|. |||+|..++.|
T Consensus 565 s~l~~P~gvavd~~~g~lyVaD--s~n~rI~v~d~~G~~i~~ig~~g~~G~~dG~~~~a~f~~P~----GIavd~~gn~L 638 (1057)
T PLN02919 565 SPLKFPGKLAIDLLNNRLFISD--SNHNRIVVTDLDGNFIVQIGSTGEEGLRDGSFEDATFNRPQ----GLAYNAKKNLL 638 (1057)
T ss_pred ccCCCCceEEEECCCCeEEEEE--CCCCeEEEEeCCCCEEEEEccCCCcCCCCCchhccccCCCc----EEEEeCCCCEE
Confidence 4678899999999999999999 89999999999987554444322 133466 99999877789
Q ss_pred EEEeCCCCEEEEEEcCCCcEEEEEeC----------------CCCCceeEEEcCCCCeEEEEecCCCCeEEEEeCCCCCc
Q psy5768 103 YWSDPKENVIEVARLTGQYRYVLISG----------------GVDQPSALAVDPESGYLFWSESGKIPLIARAGLDGKKQ 166 (652)
Q Consensus 103 Y~~d~~~~~I~v~~~dg~~~~~l~~~----------------~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~ 166 (652)
|++|...++|.++++.+...+++... .+..|.++++||.+|.||++|++ +..|.+.+..+...
T Consensus 639 YVaDt~n~~Ir~id~~~~~V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~~-~~~I~v~d~~~g~v 717 (1057)
T PLN02919 639 YVADTENHALREIDFVNETVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMAG-QHQIWEYNISDGVT 717 (1057)
T ss_pred EEEeCCCceEEEEecCCCEEEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEECC-CCeEEEEECCCCeE
Confidence 99999999999999987766666432 15689999999999999999986 55788777644333
Q ss_pred EEEEe--------------ecccCceeEEEeccCCEEEEEeCCCCcEEEEEecCCCCceEEEeecCCCCCcceeeeeeec
Q psy5768 167 TILAQ--------------EIIMPIKDITLDLKFFSAFYRNLSKGNIHIISLSNLSDVSTISMKPYGDSYLKDIKIYSKD 232 (652)
Q Consensus 167 ~~~~~--------------~~~~~p~gl~lD~~~~~ly~~d~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~ 232 (652)
..+.. ..+..|+||++++.+++||++|...+.++.+-...+. ... ..
T Consensus 718 ~~~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n~~Irv~D~~tg~-~~~----------------~~-- 778 (1057)
T PLN02919 718 RVFSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIADSESSSIRALDLKTGG-SRL----------------LA-- 778 (1057)
T ss_pred EEEecCCccccCCCCccccccccCccEEEEeCCCCEEEEEECCCCeEEEEECCCCc-EEE----------------EE--
Confidence 22211 1257899999999999999999875554444221110 000 00
Q ss_pred cCCCCCCCCCCCCCCcccceecCCCceEEEeCCccccCCCcccccceEEEEeeecceeEEecCCCCCCCCCceeeeeccc
Q psy5768 233 AQTGTNPCGVNNGGCAELCLYNGVSAVCACAHGVVAQDGKSCSEYDAFIMYSRVNRIDSIHMTDKSDLNSPFESIRNSTM 312 (652)
Q Consensus 233 ~q~~~n~C~~~ng~Cs~lC~~~~~~~~C~C~~G~l~~dg~~C~~~~~~Ll~s~~~~i~~i~l~~~~~~~~p~~~~~~~~~ 312 (652)
|+. + .. .+. ..-+... ++... ...
T Consensus 779 ------------gg~-------~----------~~-~~~--------l~~fG~~---------dG~g~---------~~~ 802 (1057)
T PLN02919 779 ------------GGD-------P----------TF-SDN--------LFKFGDH---------DGVGS---------EVL 802 (1057)
T ss_pred ------------ecc-------c----------cc-Ccc--------cccccCC---------CCchh---------hhh
Confidence 000 0 00 000 0000000 10000 011
Q ss_pred cceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe-e-------------ccCceeeeEEEccCCEEEEEeCCCC
Q psy5768 313 MKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL-E-------------RQGSVEGLAYEYVHNYLYWTCNNDA 378 (652)
Q Consensus 313 ~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~-~-------------~~~~~~glAvDw~~~~LYwtd~~~~ 378 (652)
+.++.++++|.. +.||++|..+++|.+++.++.....+. . .+..|.|||+|..+ +||++|..++
T Consensus 803 l~~P~Gvavd~d-G~LYVADs~N~rIrviD~~tg~v~tiaG~G~~G~~dG~~~~a~l~~P~GIavd~dG-~lyVaDt~Nn 880 (1057)
T PLN02919 803 LQHPLGVLCAKD-GQIYVADSYNHKIKKLDPATKRVTTLAGTGKAGFKDGKALKAQLSEPAGLALGENG-RLFVADTNNS 880 (1057)
T ss_pred ccCCceeeEeCC-CcEEEEECCCCEEEEEECCCCeEEEEeccCCcCCCCCcccccccCCceEEEEeCCC-CEEEEECCCC
Confidence 234678888865 469999999999999998765544443 2 23479999999865 6999999999
Q ss_pred eEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEE
Q psy5768 379 TINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWT 419 (652)
Q Consensus 379 ~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywt 419 (652)
+|.++++..........+.+.....|. ++.+...+||.+
T Consensus 881 ~Irvid~~~~~~~~~~~l~~~~~~~~~--~~~~~~~~~~~~ 919 (1057)
T PLN02919 881 LIRYLDLNKGEAAEILTLELKGVQPPR--PKSKSLKRLRRR 919 (1057)
T ss_pred EEEEEECCCCccceeEeeccccccCCC--Ccccchhhhhhc
Confidence 999999876431111223334555555 333333556655
No 7
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=99.70 E-value=7.2e-15 Score=174.56 Aligned_cols=210 Identities=18% Similarity=0.206 Sum_probs=158.7
Q ss_pred cccceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEee-c-------------cCceeeeEEEccCCEEEEEeCC
Q psy5768 311 TMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLLE-R-------------QGSVEGLAYEYVHNYLYWTCNN 376 (652)
Q Consensus 311 ~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~~-~-------------~~~~~glAvDw~~~~LYwtd~~ 376 (652)
..+..+.++++|..++.||++|..+++|++++++|.....+.. + +..|.|||+|..++.||++|..
T Consensus 565 s~l~~P~gvavd~~~g~lyVaDs~n~rI~v~d~~G~~i~~ig~~g~~G~~dG~~~~a~f~~P~GIavd~~gn~LYVaDt~ 644 (1057)
T PLN02919 565 SPLKFPGKLAIDLLNNRLFISDSNHNRIVVTDLDGNFIVQIGSTGEEGLRDGSFEDATFNRPQGLAYNAKKNLLYVADTE 644 (1057)
T ss_pred ccCCCCceEEEECCCCeEEEEECCCCeEEEEeCCCCEEEEEccCCCcCCCCCchhccccCCCcEEEEeCCCCEEEEEeCC
Confidence 3456778899999999999999999999999998864333322 1 3469999999988999999999
Q ss_pred CCeEEEEEcCCCCCccEEEEEe---------------CCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEE
Q psy5768 377 DATINKIDLDSPKAQRIVVVRL---------------GQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESI 441 (652)
Q Consensus 377 ~~~I~~~~~~~~~~~~~~~~~~---------------~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l 441 (652)
+++|.++++.+.. ..++... ..+..|.+|+++|..+.||++|.+.+ +|++.+..+....++
T Consensus 645 n~~Ir~id~~~~~--V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~~~~--~I~v~d~~~g~v~~~ 720 (1057)
T PLN02919 645 NHALREIDFVNET--VRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMAGQH--QIWEYNISDGVTRVF 720 (1057)
T ss_pred CceEEEEecCCCE--EEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEECCCC--eEEEEECCCCeEEEE
Confidence 9999999875421 1222110 12568999999999999999998865 777776644332222
Q ss_pred EE--------------cCCCCCceEEEecCCCEEEEEeCCCCeEEEEecCCCceEEEec---------------------
Q psy5768 442 IT--------------TDITMPNALALDHQAEKLFWGDARLDKIERCDYDGTNRIVLSK--------------------- 486 (652)
Q Consensus 442 ~~--------------~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~--------------------- 486 (652)
.. ..+..|+||++++.+++||++|...++|..+++++.....+..
T Consensus 721 ~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n~~Irv~D~~tg~~~~~~gg~~~~~~~l~~fG~~dG~g~~ 800 (1057)
T PLN02919 721 SGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIADSESSSIRALDLKTGGSRLLAGGDPTFSDNLFKFGDHDGVGSE 800 (1057)
T ss_pred ecCCccccCCCCccccccccCccEEEEeCCCCEEEEEECCCCeEEEEECCCCcEEEEEecccccCcccccccCCCCchhh
Confidence 11 1356899999999999999999999999999998654433321
Q ss_pred CCCCceeEEEEeC-CEEEEEcCCCCeEEEEEccCCceEE
Q psy5768 487 ISPLHPFDMAVYG-EFIFWTDWVIHAVLRANKYTGEEVY 524 (652)
Q Consensus 487 ~~~~~p~glav~~-~~lYwtd~~~~~I~~~~k~~g~~~~ 524 (652)
..+.+|.||++.. +.||++|+.+++|.+++..++....
T Consensus 801 ~~l~~P~Gvavd~dG~LYVADs~N~rIrviD~~tg~v~t 839 (1057)
T PLN02919 801 VLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPATKRVTT 839 (1057)
T ss_pred hhccCCceeeEeCCCcEEEEECCCCEEEEEECCCCeEEE
Confidence 0145899999984 5899999999999999987665443
No 8
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=99.63 E-value=3.6e-13 Score=135.82 Aligned_cols=231 Identities=20% Similarity=0.152 Sum_probs=161.4
Q ss_pred eeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCC
Q psy5768 40 ISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTG 119 (652)
Q Consensus 40 ~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg 119 (652)
+.++.||+.++.|||+| ...++|+|+++++...+.+ ... .|. |++++...++||+++.. .+.+.+++.
T Consensus 2 ~Egp~~d~~~g~l~~~D--~~~~~i~~~~~~~~~~~~~-~~~---~~~----G~~~~~~~g~l~v~~~~--~~~~~d~~~ 69 (246)
T PF08450_consen 2 GEGPVWDPRDGRLYWVD--IPGGRIYRVDPDTGEVEVI-DLP---GPN----GMAFDRPDGRLYVADSG--GIAVVDPDT 69 (246)
T ss_dssp EEEEEEETTTTEEEEEE--TTTTEEEEEETTTTEEEEE-ESS---SEE----EEEEECTTSEEEEEETT--CEEEEETTT
T ss_pred CcceEEECCCCEEEEEE--cCCCEEEEEECCCCeEEEE-ecC---CCc----eEEEEccCCEEEEEEcC--ceEEEecCC
Confidence 46889999999999999 8899999999998754443 332 377 99999767999999953 344448776
Q ss_pred CcEEEEEeC-----CCCCceeEEEcCCCCeEEEEecCCC-------CeEEEEeCCCCCcEEEEeecccCceeEEEeccCC
Q psy5768 120 QYRYVLISG-----GVDQPSALAVDPESGYLFWSESGKI-------PLIARAGLDGKKQTILAQEIIMPIKDITLDLKFF 187 (652)
Q Consensus 120 ~~~~~l~~~-----~~~~P~~iavd~~~g~lywtd~~~~-------~~I~~~~ldg~~~~~~~~~~~~~p~gl~lD~~~~ 187 (652)
...+.+... .+..|.++++|| +|.||+|+.+.. +.|.|...+|+ ...+...+..||||++++.++
T Consensus 70 g~~~~~~~~~~~~~~~~~~ND~~vd~-~G~ly~t~~~~~~~~~~~~g~v~~~~~~~~--~~~~~~~~~~pNGi~~s~dg~ 146 (246)
T PF08450_consen 70 GKVTVLADLPDGGVPFNRPNDVAVDP-DGNLYVTDSGGGGASGIDPGSVYRIDPDGK--VTVVADGLGFPNGIAFSPDGK 146 (246)
T ss_dssp TEEEEEEEEETTCSCTEEEEEEEE-T-TS-EEEEEECCBCTTCGGSEEEEEEETTSE--EEEEEEEESSEEEEEEETTSS
T ss_pred CcEEEEeeccCCCcccCCCceEEEcC-CCCEEEEecCCCccccccccceEEECCCCe--EEEEecCcccccceEECCcch
Confidence 555666653 467899999999 688999997531 56999988844 333455689999999999999
Q ss_pred EEEEEeCCCCcEEEEEecCCCCceEEEeecCCCCCcceeeeeeeccCCCCCCCCCCCCCCcccceecCCCceEEEeCCcc
Q psy5768 188 SAFYRNLSKGNIHIISLSNLSDVSTISMKPYGDSYLKDIKIYSKDAQTGTNPCGVNNGGCAELCLYNGVSAVCACAHGVV 267 (652)
Q Consensus 188 ~ly~~d~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~q~~~n~C~~~ng~Cs~lC~~~~~~~~C~C~~G~l 267 (652)
.||+++...+.+..+-. ...
T Consensus 147 ~lyv~ds~~~~i~~~~~-----------~~~------------------------------------------------- 166 (246)
T PF08450_consen 147 TLYVADSFNGRIWRFDL-----------DAD------------------------------------------------- 166 (246)
T ss_dssp EEEEEETTTTEEEEEEE-----------ETT-------------------------------------------------
T ss_pred heeecccccceeEEEec-----------ccc-------------------------------------------------
Confidence 99999987544322211 000
Q ss_pred ccCCCcccccceEEEEeeecceeEEecCCCCCCCCCceeeeeccccceEEEEEEEcCCCeEEEeecccccEEEEeccCCc
Q psy5768 268 AQDGKSCSEYDAFIMYSRVNRIDSIHMTDKSDLNSPFESIRNSTMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSN 347 (652)
Q Consensus 268 ~~dg~~C~~~~~~Ll~s~~~~i~~i~l~~~~~~~~p~~~~~~~~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~ 347 (652)
+. .++....+ +++ .. ....+-++++|. .+.||+++...++|.+++.+|..
T Consensus 167 ---~~---------~~~~~~~~--~~~-~~--------------~~g~pDG~~vD~-~G~l~va~~~~~~I~~~~p~G~~ 216 (246)
T PF08450_consen 167 ---GG---------ELSNRRVF--IDF-PG--------------GPGYPDGLAVDS-DGNLWVADWGGGRIVVFDPDGKL 216 (246)
T ss_dssp ---TC---------CEEEEEEE--EE--SS--------------SSCEEEEEEEBT-TS-EEEEEETTTEEEEEETTSCE
T ss_pred ---cc---------ceeeeeeE--EEc-CC--------------CCcCCCcceEcC-CCCEEEEEcCCCEEEEECCCccE
Confidence 00 00001000 122 00 012367888998 56799999999999999999876
Q ss_pred ceEEeeccCceeeeEEE-ccCCEEEEEeC
Q psy5768 348 HRVLLERQGSVEGLAYE-YVHNYLYWTCN 375 (652)
Q Consensus 348 ~~~i~~~~~~~~glAvD-w~~~~LYwtd~ 375 (652)
...+......|..+|+- ...+.||+|.+
T Consensus 217 ~~~i~~p~~~~t~~~fgg~~~~~L~vTta 245 (246)
T PF08450_consen 217 LREIELPVPRPTNCAFGGPDGKTLYVTTA 245 (246)
T ss_dssp EEEEE-SSSSEEEEEEESTTSSEEEEEEB
T ss_pred EEEEcCCCCCEEEEEEECCCCCEEEEEeC
Confidence 66666556699999993 56788999864
No 9
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=99.54 E-value=1.1e-10 Score=123.54 Aligned_cols=303 Identities=15% Similarity=0.169 Sum_probs=193.2
Q ss_pred CEEEEEEcCC---CcEEEEEeCCCCCceeEEEcCCCCeEEEEecC--CCCeEEEEeCCCC-CcEEEEeec---ccCceeE
Q psy5768 110 NVIEVARLTG---QYRYVLISGGVDQPSALAVDPESGYLFWSESG--KIPLIARAGLDGK-KQTILAQEI---IMPIKDI 180 (652)
Q Consensus 110 ~~I~v~~~dg---~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~--~~~~I~~~~ldg~-~~~~~~~~~---~~~p~gl 180 (652)
+-|.++++|. +...+-.......|..|++||.+.+||.++.. ..+.|....++.. ....++... -..|..|
T Consensus 13 ~gI~~~~~d~~~g~l~~~~~~~~~~~Ps~l~~~~~~~~LY~~~e~~~~~g~v~~~~i~~~~g~L~~~~~~~~~g~~p~~i 92 (345)
T PF10282_consen 13 GGIYVFRFDEETGTLTLVQTVAEGENPSWLAVSPDGRRLYVVNEGSGDSGGVSSYRIDPDTGTLTLLNSVPSGGSSPCHI 92 (345)
T ss_dssp TEEEEEEEETTTTEEEEEEEEEESSSECCEEE-TTSSEEEEEETTSSTTTEEEEEEEETTTTEEEEEEEEEESSSCEEEE
T ss_pred CcEEEEEEcCCCCCceEeeeecCCCCCceEEEEeCCCEEEEEEccccCCCCEEEEEECCCcceeEEeeeeccCCCCcEEE
Confidence 5666666632 21111111246899999999999999999986 4678888887765 344333332 4678999
Q ss_pred EEeccCCEEEEEeCCCCcEEEEEecCCCCceEEEeecCCCCCcceeeeeeeccCCCCCCCCCCCCCCcccceecCCCceE
Q psy5768 181 TLDLKFFSAFYRNLSKGNIHIISLSNLSDVSTISMKPYGDSYLKDIKIYSKDAQTGTNPCGVNNGGCAELCLYNGVSAVC 260 (652)
Q Consensus 181 ~lD~~~~~ly~~d~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~q~~~n~C~~~ng~Cs~lC~~~~~~~~C 260 (652)
++|+.++.||+++|.++....+-.+. . + .+...
T Consensus 93 ~~~~~g~~l~vany~~g~v~v~~l~~-----------~-g-~l~~~---------------------------------- 125 (345)
T PF10282_consen 93 AVDPDGRFLYVANYGGGSVSVFPLDD-----------D-G-SLGEV---------------------------------- 125 (345)
T ss_dssp EECTTSSEEEEEETTTTEEEEEEECT-----------T-S-EEEEE----------------------------------
T ss_pred EEecCCCEEEEEEccCCeEEEEEccC-----------C-c-cccee----------------------------------
Confidence 99999999999999877754432211 0 0 00000
Q ss_pred EEeCCccccCCCcccccceEEEEeeecceeEEecCCCCCCCCCceeeeeccccceEEEEEEEcCCCeEEEeecccccEEE
Q psy5768 261 ACAHGVVAQDGKSCSEYDAFIMYSRVNRIDSIHMTDKSDLNSPFESIRNSTMMKNIIELSYDYKRKTLFYSDIQKGTINS 340 (652)
Q Consensus 261 ~C~~G~l~~dg~~C~~~~~~Ll~s~~~~i~~i~l~~~~~~~~p~~~~~~~~~~~~~~~v~~D~~~~~lywsd~~~~~I~~ 340 (652)
..++.-.+. .. +... .....+..+.+++..+.||.+|....+|+.
T Consensus 126 ------------------~~~~~~~g~-----g~-~~~r-----------q~~~h~H~v~~~pdg~~v~v~dlG~D~v~~ 170 (345)
T PF10282_consen 126 ------------------VQTVRHEGS-----GP-NPDR-----------QEGPHPHQVVFSPDGRFVYVPDLGADRVYV 170 (345)
T ss_dssp ------------------EEEEESEEE-----ES-STTT-----------TSSTCEEEEEE-TTSSEEEEEETTTTEEEE
T ss_pred ------------------eeecccCCC-----CC-cccc-----------cccccceeEEECCCCCEEEEEecCCCEEEE
Confidence 000000000 00 1101 111346788999999999999999999999
Q ss_pred EeccCCc--ce---EE-eeccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeC-------CCCCceEE
Q psy5768 341 VFFNGSN--HR---VL-LERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLG-------QHDKPRGI 407 (652)
Q Consensus 341 ~~~~g~~--~~---~i-~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~-------~~~~P~~I 407 (652)
+.++... .+ .+ +..-..|..|++++.++.+|+++...++|.+++++...+..+.+-... ....|.+|
T Consensus 171 ~~~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i 250 (345)
T PF10282_consen 171 YDIDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNELSNTVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEI 250 (345)
T ss_dssp EEE-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEETTTTEEEEEEEETTTTEEEEEEEEESCETTSCSSSSEEEE
T ss_pred EEEeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecCCCCcEEEEeecccCCceeEEEEeeeccccccccCCceeE
Confidence 9987543 21 11 134458999999999999999999999999999884343322222111 11378999
Q ss_pred EEeCCCCEEEEEecCCCCCceEEEeecCCC-ceEE---EEcCCCCCceEEEecCCCEEEEEeCCCCeEEEEecCCCc--e
Q psy5768 408 DIDSCDSRIYWTNWNSHLPSIQRAFFSGFG-TESI---ITTDITMPNALALDHQAEKLFWGDARLDKIERCDYDGTN--R 481 (652)
Q Consensus 408 avdp~~g~Lywtd~~~~~~~I~r~~ldG~~-~~~l---~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG~~--~ 481 (652)
+++|...+||+++.+.+ .|-...+|... ...+ +...-.+|.+|++|+.+++||.+....+.|..+++|... .
T Consensus 251 ~ispdg~~lyvsnr~~~--sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~g~~l~Va~~~s~~v~vf~~d~~tG~l 328 (345)
T PF10282_consen 251 AISPDGRFLYVSNRGSN--SISVFDLDPATGTLTLVQTVPTGGKFPRHFAFSPDGRYLYVANQDSNTVSVFDIDPDTGKL 328 (345)
T ss_dssp EE-TTSSEEEEEECTTT--EEEEEEECTTTTTEEEEEEEEESSSSEEEEEE-TTSSEEEEEETTTTEEEEEEEETTTTEE
T ss_pred EEecCCCEEEEEeccCC--EEEEEEEecCCCceEEEEEEeCCCCCccEEEEeCCCCEEEEEecCCCeEEEEEEeCCCCcE
Confidence 99999999999999865 67777775432 2222 233456799999999999999999999999988876432 2
Q ss_pred EEEec-CCCCceeEEE
Q psy5768 482 IVLSK-ISPLHPFDMA 496 (652)
Q Consensus 482 ~~l~~-~~~~~p~gla 496 (652)
+.+.. ..+..|..|.
T Consensus 329 ~~~~~~~~~~~p~ci~ 344 (345)
T PF10282_consen 329 TPVGSSVPIPSPVCIV 344 (345)
T ss_dssp EEEEEEEESSSEEEEE
T ss_pred EEecccccCCCCEEEe
Confidence 32221 1246666554
No 10
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=99.53 E-value=6.8e-11 Score=124.70 Aligned_cols=300 Identities=16% Similarity=0.186 Sum_probs=188.9
Q ss_pred CcEEEEeCCCCEEEEEEcC--CCcE--EEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeCCCCCcEEEEeec--
Q psy5768 100 QNIYWSDPKENVIEVARLT--GQYR--YVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGLDGKKQTILAQEI-- 173 (652)
Q Consensus 100 ~~lY~~d~~~~~I~v~~~d--g~~~--~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~~~~~~~~-- 173 (652)
..+|++....+.|.+.+++ |+.. ..+ . ....|..|+++|...+||.+.+. ...|....+++.....++...
T Consensus 2 ~~~y~~~~~~~~I~~~~~~~~g~l~~~~~~-~-~~~~~~~l~~spd~~~lyv~~~~-~~~i~~~~~~~~g~l~~~~~~~~ 78 (330)
T PRK11028 2 QIVYIASPESQQIHVWNLNHEGALTLLQVV-D-VPGQVQPMVISPDKRHLYVGVRP-EFRVLSYRIADDGALTFAAESPL 78 (330)
T ss_pred eEEEEEcCCCCCEEEEEECCCCceeeeeEE-e-cCCCCccEEECCCCCEEEEEECC-CCcEEEEEECCCCceEEeeeecC
Confidence 4689999888999999885 4322 222 1 23679999999988899999874 456766666643333333221
Q ss_pred ccCceeEEEeccCCEEEEEeCCCCcEEEEEecCCCCceEEEeecCCCCCcceeeeeeeccCCCCCCCCCCCCCCccccee
Q psy5768 174 IMPIKDITLDLKFFSAFYRNLSKGNIHIISLSNLSDVSTISMKPYGDSYLKDIKIYSKDAQTGTNPCGVNNGGCAELCLY 253 (652)
Q Consensus 174 ~~~p~gl~lD~~~~~ly~~d~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~q~~~n~C~~~ng~Cs~lC~~ 253 (652)
...|.+|++++.+++||...+..+....+ +...
T Consensus 79 ~~~p~~i~~~~~g~~l~v~~~~~~~v~v~---------------------------~~~~-------------------- 111 (330)
T PRK11028 79 PGSPTHISTDHQGRFLFSASYNANCVSVS---------------------------PLDK-------------------- 111 (330)
T ss_pred CCCceEEEECCCCCEEEEEEcCCCeEEEE---------------------------EECC--------------------
Confidence 34789999999999999987753332221 1000
Q ss_pred cCCCceEEEeCCccccCCCcccccceEEEEeeecceeEEecCCCCCCCCCceeeeeccccceEEEEEEEcCCCeEEEeec
Q psy5768 254 NGVSAVCACAHGVVAQDGKSCSEYDAFIMYSRVNRIDSIHMTDKSDLNSPFESIRNSTMMKNIIELSYDYKRKTLFYSDI 333 (652)
Q Consensus 254 ~~~~~~C~C~~G~l~~dg~~C~~~~~~Ll~s~~~~i~~i~l~~~~~~~~p~~~~~~~~~~~~~~~v~~D~~~~~lywsd~ 333 (652)
++.. .. .+.. + .....+.++++++..+.+|.++.
T Consensus 112 ----------------~g~~------------~~---~~~~------------~---~~~~~~~~~~~~p~g~~l~v~~~ 145 (330)
T PRK11028 112 ----------------DGIP------------VA---PIQI------------I---EGLEGCHSANIDPDNRTLWVPCL 145 (330)
T ss_pred ----------------CCCC------------CC---ceee------------c---cCCCcccEeEeCCCCCEEEEeeC
Confidence 0000 00 0000 0 00012345667888888999998
Q ss_pred ccccEEEEeccCCcc------eEE-eeccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeC-------
Q psy5768 334 QKGTINSVFFNGSNH------RVL-LERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLG------- 399 (652)
Q Consensus 334 ~~~~I~~~~~~g~~~------~~i-~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~------- 399 (652)
..++|..++++.... ..+ ...-..|.++++++.++.||.++...++|.+.+++...+..+.+....
T Consensus 146 ~~~~v~v~d~~~~g~l~~~~~~~~~~~~g~~p~~~~~~pdg~~lyv~~~~~~~v~v~~~~~~~~~~~~~~~~~~~p~~~~ 225 (330)
T PRK11028 146 KEDRIRLFTLSDDGHLVAQEPAEVTTVEGAGPRHMVFHPNQQYAYCVNELNSSVDVWQLKDPHGEIECVQTLDMMPADFS 225 (330)
T ss_pred CCCEEEEEEECCCCcccccCCCceecCCCCCCceEEECCCCCEEEEEecCCCEEEEEEEeCCCCCEEEEEEEecCCCcCC
Confidence 888888888764211 111 122347999999999999999999899999999864222122222211
Q ss_pred CCCCceEEEEeCCCCEEEEEecCCCCCceEEEee--cCCCceEEEEc-CCCCCceEEEecCCCEEEEEeCCCCeEEEEec
Q psy5768 400 QHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFF--SGFGTESIITT-DITMPNALALDHQAEKLFWGDARLDKIERCDY 476 (652)
Q Consensus 400 ~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~l--dG~~~~~l~~~-~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~l 476 (652)
....|.+|+++|..++||.++.+.+ .|....+ ++...+.+-.. ....|.++++++.+++||.+....+.|..+.+
T Consensus 226 ~~~~~~~i~~~pdg~~lyv~~~~~~--~I~v~~i~~~~~~~~~~~~~~~~~~p~~~~~~~dg~~l~va~~~~~~v~v~~~ 303 (330)
T PRK11028 226 DTRWAADIHITPDGRHLYACDRTAS--LISVFSVSEDGSVLSFEGHQPTETQPRGFNIDHSGKYLIAAGQKSHHISVYEI 303 (330)
T ss_pred CCccceeEEECCCCCEEEEecCCCC--eEEEEEEeCCCCeEEEeEEEeccccCCceEECCCCCEEEEEEccCCcEEEEEE
Confidence 1124557999999999999987654 5555554 44322222211 13479999999999999999988888888877
Q ss_pred CCC--ceEEEecC-CCCceeEEEE
Q psy5768 477 DGT--NRIVLSKI-SPLHPFDMAV 497 (652)
Q Consensus 477 dG~--~~~~l~~~-~~~~p~glav 497 (652)
+.. ..+.+... ....|.++++
T Consensus 304 ~~~~g~l~~~~~~~~g~~P~~~~~ 327 (330)
T PRK11028 304 DGETGLLTELGRYAVGQGPMWVSV 327 (330)
T ss_pred cCCCCcEEEccccccCCCceEEEE
Confidence 643 22222111 1467888776
No 11
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=99.45 E-value=8e-10 Score=116.53 Aligned_cols=307 Identities=13% Similarity=0.150 Sum_probs=192.7
Q ss_pred eEEEecCCCCeEEEEecC--CCe--eEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEE
Q psy5768 2 FIAVSSPTQSKIVVCNLE--GEY--QTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETV 77 (652)
Q Consensus 2 ~i~v~~~~~~~I~~~~~~--g~~--~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v 77 (652)
|++|+....+.|.+++++ |+. ..++. ....+..|+++|..++||++. ...+.|..+..+......+
T Consensus 3 ~~y~~~~~~~~I~~~~~~~~g~l~~~~~~~--------~~~~~~~l~~spd~~~lyv~~--~~~~~i~~~~~~~~g~l~~ 72 (330)
T PRK11028 3 IVYIASPESQQIHVWNLNHEGALTLLQVVD--------VPGQVQPMVISPDKRHLYVGV--RPEFRVLSYRIADDGALTF 72 (330)
T ss_pred EEEEEcCCCCCEEEEEECCCCceeeeeEEe--------cCCCCccEEECCCCCEEEEEE--CCCCcEEEEEECCCCceEE
Confidence 689999889999999985 331 12221 224678899999999999998 6678887777653322222
Q ss_pred EeCC-CcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcC--CCcEEEEEe-CCCCCceeEEEcCCCCeEEEEecCCC
Q psy5768 78 VSQK-KYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLT--GQYRYVLIS-GGVDQPSALAVDPESGYLFWSESGKI 153 (652)
Q Consensus 78 ~~~~-~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~d--g~~~~~l~~-~~~~~P~~iavd~~~g~lywtd~~~~ 153 (652)
+... ....|. +|+++..++.||.+....+.|.+++++ |.....+.. .....|..++++|...++|.++.+ .
T Consensus 73 ~~~~~~~~~p~----~i~~~~~g~~l~v~~~~~~~v~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~p~g~~l~v~~~~-~ 147 (330)
T PRK11028 73 AAESPLPGSPT----HISTDHQGRFLFSASYNANCVSVSPLDKDGIPVAPIQIIEGLEGCHSANIDPDNRTLWVPCLK-E 147 (330)
T ss_pred eeeecCCCCce----EEEECCCCCEEEEEEcCCCeEEEEEECCCCCCCCceeeccCCCcccEeEeCCCCCEEEEeeCC-C
Confidence 2221 023566 999999888999998888999998875 432221111 234679999999988899999986 4
Q ss_pred CeEEEEeCCCCCcEE-----EEe-ecccCceeEEEeccCCEEEEEeCCCCcEEEEEecCCCCceEEEeecCCCCCcceee
Q psy5768 154 PLIARAGLDGKKQTI-----LAQ-EIIMPIKDITLDLKFFSAFYRNLSKGNIHIISLSNLSDVSTISMKPYGDSYLKDIK 227 (652)
Q Consensus 154 ~~I~~~~ldg~~~~~-----~~~-~~~~~p~gl~lD~~~~~ly~~d~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~ 227 (652)
..|...+++...... .+. ..-..|.++++++.+++||+++...+....+... ...
T Consensus 148 ~~v~v~d~~~~g~l~~~~~~~~~~~~g~~p~~~~~~pdg~~lyv~~~~~~~v~v~~~~-----------~~~-------- 208 (330)
T PRK11028 148 DRIRLFTLSDDGHLVAQEPAEVTTVEGAGPRHMVFHPNQQYAYCVNELNSSVDVWQLK-----------DPH-------- 208 (330)
T ss_pred CEEEEEEECCCCcccccCCCceecCCCCCCceEEECCCCCEEEEEecCCCEEEEEEEe-----------CCC--------
Confidence 578888776422110 011 1124689999999999999987653433222110 000
Q ss_pred eeeeccCCCCCCCCCCCCCCcccceecCCCceEEEeCCccccCCCcccccceEEEEeeecceeEEecCCCCCCCCCceee
Q psy5768 228 IYSKDAQTGTNPCGVNNGGCAELCLYNGVSAVCACAHGVVAQDGKSCSEYDAFIMYSRVNRIDSIHMTDKSDLNSPFESI 307 (652)
Q Consensus 228 v~~~~~q~~~n~C~~~ng~Cs~lC~~~~~~~~C~C~~G~l~~dg~~C~~~~~~Ll~s~~~~i~~i~l~~~~~~~~p~~~~ 307 (652)
+ .+ ..+..+.. -+....
T Consensus 209 -------------------------------------------~-------~~------~~~~~~~~-~p~~~~------ 225 (330)
T PRK11028 209 -------------------------------------------G-------EI------ECVQTLDM-MPADFS------ 225 (330)
T ss_pred -------------------------------------------C-------CE------EEEEEEec-CCCcCC------
Confidence 0 00 00000110 000100
Q ss_pred eeccccceEEEEEEEcCCCeEEEeecccccEEEEeccCCc-ceEEe---eccCceeeeEEEccCCEEEEEeCCCCeEEEE
Q psy5768 308 RNSTMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSN-HRVLL---ERQGSVEGLAYEYVHNYLYWTCNNDATINKI 383 (652)
Q Consensus 308 ~~~~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~-~~~i~---~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~ 383 (652)
..+.+.++.+++..++||.++...+.|..++++... ..+++ .....|.++++++.++.||.++...++|.++
T Consensus 226 ----~~~~~~~i~~~pdg~~lyv~~~~~~~I~v~~i~~~~~~~~~~~~~~~~~~p~~~~~~~dg~~l~va~~~~~~v~v~ 301 (330)
T PRK11028 226 ----DTRWAADIHITPDGRHLYACDRTASLISVFSVSEDGSVLSFEGHQPTETQPRGFNIDHSGKYLIAAGQKSHHISVY 301 (330)
T ss_pred ----CCccceeEEECCCCCEEEEecCCCCeEEEEEEeCCCCeEEEeEEEeccccCCceEECCCCCEEEEEEccCCcEEEE
Confidence 002234677888888899998877888877764322 11222 2224789999999999999999888999998
Q ss_pred EcCCCCCccEEEEEeCCCCCceEEEE
Q psy5768 384 DLDSPKAQRIVVVRLGQHDKPRGIDI 409 (652)
Q Consensus 384 ~~~~~~~~~~~~~~~~~~~~P~~Iav 409 (652)
+++...+....+-.......|.+|++
T Consensus 302 ~~~~~~g~l~~~~~~~~g~~P~~~~~ 327 (330)
T PRK11028 302 EIDGETGLLTELGRYAVGQGPMWVSV 327 (330)
T ss_pred EEcCCCCcEEEccccccCCCceEEEE
Confidence 87654332222222334678888887
No 12
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=99.42 E-value=4.5e-09 Score=111.40 Aligned_cols=308 Identities=16% Similarity=0.154 Sum_probs=194.2
Q ss_pred EEEecCCC---CeEEEEec--CCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccC-CcceEEEEEcCCC--cc
Q psy5768 3 IAVSSPTQ---SKIVVCNL--EGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTK-QVVTIEMAFMDGT--KR 74 (652)
Q Consensus 3 i~v~~~~~---~~I~~~~~--~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~-~~~~I~~~~~dgs--~~ 74 (652)
++|.+++. +.|+++.+ +...+..+... ....+|.-|++++.+++||.++... ..+.|..+..+.. ..
T Consensus 2 ~~vgsy~~~~~~gI~~~~~d~~~g~l~~~~~~-----~~~~~Ps~l~~~~~~~~LY~~~e~~~~~g~v~~~~i~~~~g~L 76 (345)
T PF10282_consen 2 LYVGSYTNGKGGGIYVFRFDEETGTLTLVQTV-----AEGENPSWLAVSPDGRRLYVVNEGSGDSGGVSSYRIDPDTGTL 76 (345)
T ss_dssp EEEEECCSSSSTEEEEEEEETTTTEEEEEEEE-----EESSSECCEEE-TTSSEEEEEETTSSTTTEEEEEEEETTTTEE
T ss_pred EEEEcCCCCCCCcEEEEEEcCCCCCceEeeee-----cCCCCCceEEEEeCCCEEEEEEccccCCCCEEEEEECCCccee
Confidence 68888886 89988877 43334333221 1457899999999999999999221 4567766555432 22
Q ss_pred EEE---EeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcC--CCcEEE--EEe----------CCCCCceeEE
Q psy5768 75 ETV---VSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLT--GQYRYV--LIS----------GGVDQPSALA 137 (652)
Q Consensus 75 ~~v---~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~d--g~~~~~--l~~----------~~~~~P~~ia 137 (652)
+.+ ...+ ..|- .|++|...+.||+++...+.|.+++++ |+.... ++. ..-.+|+.+.
T Consensus 77 ~~~~~~~~~g--~~p~----~i~~~~~g~~l~vany~~g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~ 150 (345)
T PF10282_consen 77 TLLNSVPSGG--SSPC----HIAVDPDGRFLYVANYGGGSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVV 150 (345)
T ss_dssp EEEEEEEESS--SCEE----EEEECTTSSEEEEEETTTTEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEE
T ss_pred EEeeeeccCC--CCcE----EEEEecCCCEEEEEEccCCeEEEEEccCCcccceeeeecccCCCCCcccccccccceeEE
Confidence 222 2222 4566 999999999999999999999998876 443333 221 1246899999
Q ss_pred EcCCCCeEEEEecCCCCeEEEEeCCCCC--cEE---EEeecccCceeEEEeccCCEEEEEeCCCCcEEEEEecCCCCceE
Q psy5768 138 VDPESGYLFWSESGKIPLIARAGLDGKK--QTI---LAQEIIMPIKDITLDLKFFSAFYRNLSKGNIHIISLSNLSDVST 212 (652)
Q Consensus 138 vd~~~g~lywtd~~~~~~I~~~~ldg~~--~~~---~~~~~~~~p~gl~lD~~~~~ly~~d~~g~~~~~i~~~~~~~~~~ 212 (652)
++|.+.+||.+|.|. .+|....++... ... +....-..|+.+++++.++++|++.-..+....+..
T Consensus 151 ~~pdg~~v~v~dlG~-D~v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~s~~v~v~~~-------- 221 (345)
T PF10282_consen 151 FSPDGRFVYVPDLGA-DRVYVYDIDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNELSNTVSVFDY-------- 221 (345)
T ss_dssp E-TTSSEEEEEETTT-TEEEEEEE-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEETTTTEEEEEEE--------
T ss_pred ECCCCCEEEEEecCC-CEEEEEEEeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecCCCCcEEEEee--------
Confidence 999999999999984 589999887655 211 111124569999999999999998765443322211
Q ss_pred EEeecCCCCCcceeeeeeeccCCCCCCCCCCCCCCcccceecCCCceEEEeCCccccCCCcccccceEEEEeeecceeEE
Q psy5768 213 ISMKPYGDSYLKDIKIYSKDAQTGTNPCGVNNGGCAELCLYNGVSAVCACAHGVVAQDGKSCSEYDAFIMYSRVNRIDSI 292 (652)
Q Consensus 213 ~~~~~~~~~~~~~i~v~~~~~q~~~n~C~~~ng~Cs~lC~~~~~~~~C~C~~G~l~~dg~~C~~~~~~Ll~s~~~~i~~i 292 (652)
... ++ . -..+..+
T Consensus 222 ---~~~---------------------------------------------------~g-------~------~~~~~~~ 234 (345)
T PF10282_consen 222 ---DPS---------------------------------------------------DG-------S------LTEIQTI 234 (345)
T ss_dssp ---ETT---------------------------------------------------TT-------E------EEEEEEE
T ss_pred ---ccc---------------------------------------------------CC-------c------eeEEEEe
Confidence 000 00 0 0011122
Q ss_pred ecCCCCCCCCCceeeeeccccceEEEEEEEcCCCeEEEeecccccEEEEeccCC--cceEEe---eccCceeeeEEEccC
Q psy5768 293 HMTDKSDLNSPFESIRNSTMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGS--NHRVLL---ERQGSVEGLAYEYVH 367 (652)
Q Consensus 293 ~l~~~~~~~~p~~~~~~~~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~--~~~~i~---~~~~~~~glAvDw~~ 367 (652)
.. .+.... ....+.+|.+++..+.||+++...+.|..+.++.. ..+.+- .+-..|.+|++|..+
T Consensus 235 ~~-~~~~~~----------~~~~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~g 303 (345)
T PF10282_consen 235 ST-LPEGFT----------GENAPAEIAISPDGRFLYVSNRGSNSISVFDLDPATGTLTLVQTVPTGGKFPRHFAFSPDG 303 (345)
T ss_dssp ES-CETTSC----------SSSSEEEEEE-TTSSEEEEEECTTTEEEEEEECTTTTTEEEEEEEEESSSSEEEEEE-TTS
T ss_pred ee-cccccc----------ccCCceeEEEecCCCEEEEEeccCCEEEEEEEecCCCceEEEEEEeCCCCCccEEEEeCCC
Confidence 21 111110 01246778889999999999999999988888542 222221 334569999999999
Q ss_pred CEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCCceEEE
Q psy5768 368 NYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDKPRGID 408 (652)
Q Consensus 368 ~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Ia 408 (652)
+.||.++...+.|.+++++...+....+........|..|+
T Consensus 304 ~~l~Va~~~s~~v~vf~~d~~tG~l~~~~~~~~~~~p~ci~ 344 (345)
T PF10282_consen 304 RYLYVANQDSNTVSVFDIDPDTGKLTPVGSSVPIPSPVCIV 344 (345)
T ss_dssp SEEEEEETTTTEEEEEEEETTTTEEEEEEEEEESSSEEEEE
T ss_pred CEEEEEecCCCeEEEEEEeCCCCcEEEecccccCCCCEEEe
Confidence 99999999999999998875544322222222445666554
No 13
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=99.33 E-value=1.1e-10 Score=119.84 Aligned_cols=204 Identities=24% Similarity=0.288 Sum_probs=143.2
Q ss_pred EEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEE
Q psy5768 316 IIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVV 395 (652)
Q Consensus 316 ~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~ 395 (652)
..+..||...+.|||+|...++|.+.+......++....-..+.++.+| ..++|.-++.+.. ..+.+. +...++
T Consensus 27 gEgP~w~~~~~~L~w~DI~~~~i~r~~~~~g~~~~~~~p~~~~~~~~~d-~~g~Lv~~~~g~~---~~~~~~--~~~~t~ 100 (307)
T COG3386 27 GEGPVWDPDRGALLWVDILGGRIHRLDPETGKKRVFPSPGGFSSGALID-AGGRLIACEHGVR---LLDPDT--GGKITL 100 (307)
T ss_pred ccCccCcCCCCEEEEEeCCCCeEEEecCCcCceEEEECCCCcccceeec-CCCeEEEEccccE---EEeccC--CceeEE
Confidence 4567789999999999999999999998744444444333346777777 4667766654432 233221 112133
Q ss_pred EEe----CCCCCceEEEEeCCCCEEEEEecC---------CCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEE
Q psy5768 396 VRL----GQHDKPRGIDIDSCDSRIYWTNWN---------SHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLF 462 (652)
Q Consensus 396 ~~~----~~~~~P~~Iavdp~~g~Lywtd~~---------~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LY 462 (652)
+.. ....+|....++| .|.+|+++.+ ...+.++|...+|.. +.++...+..||||+++++++.||
T Consensus 101 ~~~~~~~~~~~r~ND~~v~p-dG~~wfgt~~~~~~~~~~~~~~G~lyr~~p~g~~-~~l~~~~~~~~NGla~SpDg~tly 178 (307)
T COG3386 101 LAEPEDGLPLNRPNDGVVDP-DGRIWFGDMGYFDLGKSEERPTGSLYRVDPDGGV-VRLLDDDLTIPNGLAFSPDGKTLY 178 (307)
T ss_pred eccccCCCCcCCCCceeEcC-CCCEEEeCCCccccCccccCCcceEEEEcCCCCE-EEeecCcEEecCceEECCCCCEEE
Confidence 322 2447899999999 5999999988 234579999886544 444555588999999999999999
Q ss_pred EEeCCCCeEEEEecC---C--Cce--EEEecCCCCceeEEEEeCC-EEE-EEcCCCCeEEEEEccCCceEEEEec
Q psy5768 463 WGDARLDKIERCDYD---G--TNR--IVLSKISPLHPFDMAVYGE-FIF-WTDWVIHAVLRANKYTGEEVYTLRK 528 (652)
Q Consensus 463 w~D~~~~~I~~~~ld---G--~~~--~~l~~~~~~~p~glav~~~-~lY-wtd~~~~~I~~~~k~~g~~~~~~~~ 528 (652)
++|....+|.+++++ | .++ .+........|-|++++.+ .|| ++-|....|.+.++. |+....+..
T Consensus 179 ~aDT~~~~i~r~~~d~~~g~~~~~~~~~~~~~~~G~PDG~~vDadG~lw~~a~~~g~~v~~~~pd-G~l~~~i~l 252 (307)
T COG3386 179 VADTPANRIHRYDLDPATGPIGGRRGFVDFDEEPGLPDGMAVDADGNLWVAAVWGGGRVVRFNPD-GKLLGEIKL 252 (307)
T ss_pred EEeCCCCeEEEEecCcccCccCCcceEEEccCCCCCCCceEEeCCCCEEEecccCCceEEEECCC-CcEEEEEEC
Confidence 999999999999998 3 122 2222223478999999955 555 455666689999986 776665553
No 14
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=99.22 E-value=1.8e-07 Score=96.19 Aligned_cols=298 Identities=13% Similarity=0.064 Sum_probs=170.3
Q ss_pred eEEEecCCCCeEEEEecC-CCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeC
Q psy5768 2 FIAVSSPTQSKIVVCNLE-GEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQ 80 (652)
Q Consensus 2 ~i~v~~~~~~~I~~~~~~-g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~ 80 (652)
+++|+...++.|.+++++ ++....+... ..+.++++++.++.+|.+. ...+.|+.++..+......+..
T Consensus 2 ~~~~s~~~d~~v~~~d~~t~~~~~~~~~~--------~~~~~l~~~~dg~~l~~~~--~~~~~v~~~d~~~~~~~~~~~~ 71 (300)
T TIGR03866 2 KAYVSNEKDNTISVIDTATLEVTRTFPVG--------QRPRGITLSKDGKLLYVCA--SDSDTIQVIDLATGEVIGTLPS 71 (300)
T ss_pred cEEEEecCCCEEEEEECCCCceEEEEECC--------CCCCceEECCCCCEEEEEE--CCCCeEEEEECCCCcEEEeccC
Confidence 578888888999999986 4444444321 2467899999888899887 6678899888775432222222
Q ss_pred CCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEe
Q psy5768 81 KKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAG 160 (652)
Q Consensus 81 ~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ 160 (652)
+ ..+. .++++..++.+|.+....+.|.+.++........+.. ...|.+++++|...+++.+... ...+...+
T Consensus 72 ~--~~~~----~~~~~~~g~~l~~~~~~~~~l~~~d~~~~~~~~~~~~-~~~~~~~~~~~dg~~l~~~~~~-~~~~~~~d 143 (300)
T TIGR03866 72 G--PDPE----LFALHPNGKILYIANEDDNLVTVIDIETRKVLAEIPV-GVEPEGMAVSPDGKIVVNTSET-TNMAHFID 143 (300)
T ss_pred C--CCcc----EEEECCCCCEEEEEcCCCCeEEEEECCCCeEEeEeeC-CCCcceEEECCCCCEEEEEecC-CCeEEEEe
Confidence 2 3356 8888887778888877678999999876433222222 2468999999955555544332 22333333
Q ss_pred CCCCCcEEEEeecccCceeEEEeccCCEEEEEeCCCCcEEEEEecCCCCceEEEeecCCCCCcceeeeeeeccCCCCCCC
Q psy5768 161 LDGKKQTILAQEIIMPIKDITLDLKFFSAFYRNLSKGNIHIISLSNLSDVSTISMKPYGDSYLKDIKIYSKDAQTGTNPC 240 (652)
Q Consensus 161 ldg~~~~~~~~~~~~~p~gl~lD~~~~~ly~~d~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~q~~~n~C 240 (652)
+........... -..|..+++++.+++||.....++..+ +|+..
T Consensus 144 ~~~~~~~~~~~~-~~~~~~~~~s~dg~~l~~~~~~~~~v~---------------------------i~d~~-------- 187 (300)
T TIGR03866 144 TKTYEIVDNVLV-DQRPRFAEFTADGKELWVSSEIGGTVS---------------------------VIDVA-------- 187 (300)
T ss_pred CCCCeEEEEEEc-CCCccEEEECCCCCEEEEEcCCCCEEE---------------------------EEEcC--------
Confidence 332211111111 124556666655555544322111111 11100
Q ss_pred CCCCCCCcccceecCCCceEEEeCCccccCCCcccccceEEEEeeecceeEEecCCCCCCCCCceeeeeccccceEEEEE
Q psy5768 241 GVNNGGCAELCLYNGVSAVCACAHGVVAQDGKSCSEYDAFIMYSRVNRIDSIHMTDKSDLNSPFESIRNSTMMKNIIELS 320 (652)
Q Consensus 241 ~~~ng~Cs~lC~~~~~~~~C~C~~G~l~~dg~~C~~~~~~Ll~s~~~~i~~i~l~~~~~~~~p~~~~~~~~~~~~~~~v~ 320 (652)
+...+.++.. ...... .....+.++.
T Consensus 188 --------------------------------------------~~~~~~~~~~-~~~~~~---------~~~~~~~~i~ 213 (300)
T TIGR03866 188 --------------------------------------------TRKVIKKITF-EIPGVH---------PEAVQPVGIK 213 (300)
T ss_pred --------------------------------------------cceeeeeeee-cccccc---------cccCCccceE
Confidence 0000111111 000000 0001234566
Q ss_pred EEcCCCeEEEeecccccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCC
Q psy5768 321 YDYKRKTLFYSDIQKGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQ 400 (652)
Q Consensus 321 ~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~ 400 (652)
|++..+.+|++....++|..+++........+..-..+.+|++.+.++.||.+....+.|.+.++.+. +.+-.+..
T Consensus 214 ~s~dg~~~~~~~~~~~~i~v~d~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~~~~~~~i~v~d~~~~----~~~~~~~~ 289 (300)
T TIGR03866 214 LTKDGKTAFVALGPANRVAVVDAKTYEVLDYLLVGQRVWQLAFTPDEKYLLTTNGVSNDVSVIDVAAL----KVIKSIKV 289 (300)
T ss_pred ECCCCCEEEEEcCCCCeEEEEECCCCcEEEEEEeCCCcceEEECCCCCEEEEEcCCCCeEEEEECCCC----cEEEEEEc
Confidence 77766777776655566777766533222222222368899999999999988877889999998763 22222334
Q ss_pred CCCceEEEEeC
Q psy5768 401 HDKPRGIDIDS 411 (652)
Q Consensus 401 ~~~P~~Iavdp 411 (652)
...|.+||+.|
T Consensus 290 ~~~~~~~~~~~ 300 (300)
T TIGR03866 290 GRLPWGVVVRP 300 (300)
T ss_pred ccccceeEeCC
Confidence 68999999875
No 15
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=99.18 E-value=1.9e-07 Score=94.05 Aligned_cols=297 Identities=14% Similarity=0.164 Sum_probs=196.1
Q ss_pred CcEEEEeCC---CCEEEEEEcCCCc---EEEEEeCCCCCceeEEEcCCCCeEEEEecC-CCCeEEEEeCCCC-CcEEEEe
Q psy5768 100 QNIYWSDPK---ENVIEVARLTGQY---RYVLISGGVDQPSALAVDPESGYLFWSESG-KIPLIARAGLDGK-KQTILAQ 171 (652)
Q Consensus 100 ~~lY~~d~~---~~~I~v~~~dg~~---~~~l~~~~~~~P~~iavd~~~g~lywtd~~-~~~~I~~~~ldg~-~~~~~~~ 171 (652)
.++|+.-.. .+-|.+++++.+. .....-..+.+|..|+++|....||....- ..+.|.....|+. .+..+++
T Consensus 3 ~~~YiGtyT~~~s~gI~v~~ld~~~g~l~~~~~v~~~~nptyl~~~~~~~~LY~v~~~~~~ggvaay~iD~~~G~Lt~ln 82 (346)
T COG2706 3 QTVYIGTYTKRESQGIYVFNLDTKTGELSLLQLVAELGNPTYLAVNPDQRHLYVVNEPGEEGGVAAYRIDPDDGRLTFLN 82 (346)
T ss_pred eEEEEeeecccCCCceEEEEEeCcccccchhhhccccCCCceEEECCCCCEEEEEEecCCcCcEEEEEEcCCCCeEEEee
Confidence 356765544 6779998888432 212222257899999999988899988643 3567888888865 6666665
Q ss_pred ec---ccCceeEEEeccCCEEEEEeCCCCcEEEEEecCCCCceEEEeecCCCCCcceeeeeeeccCCCCCCCCCCCCCCc
Q psy5768 172 EI---IMPIKDITLDLKFFSAFYRNLSKGNIHIISLSNLSDVSTISMKPYGDSYLKDIKIYSKDAQTGTNPCGVNNGGCA 248 (652)
Q Consensus 172 ~~---~~~p~gl~lD~~~~~ly~~d~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~q~~~n~C~~~ng~Cs 248 (652)
.. -..|.-+++|..++.||.++|.++...+.-. ..+ |
T Consensus 83 ~~~~~g~~p~yvsvd~~g~~vf~AnY~~g~v~v~p~-----------~~d--------------------------G--- 122 (346)
T COG2706 83 RQTLPGSPPCYVSVDEDGRFVFVANYHSGSVSVYPL-----------QAD--------------------------G--- 122 (346)
T ss_pred ccccCCCCCeEEEECCCCCEEEEEEccCceEEEEEc-----------ccC--------------------------C---
Confidence 54 3456999999999999999998765433211 000 0
Q ss_pred ccceecCCCceEEEeCCccccCCCcccccceEEEEeeecceeEEecCCCCCCCCCceeeeeccccceEEEEEEEcCCCeE
Q psy5768 249 ELCLYNGVSAVCACAHGVVAQDGKSCSEYDAFIMYSRVNRIDSIHMTDKSDLNSPFESIRNSTMMKNIIELSYDYKRKTL 328 (652)
Q Consensus 249 ~lC~~~~~~~~C~C~~G~l~~dg~~C~~~~~~Ll~s~~~~i~~i~l~~~~~~~~p~~~~~~~~~~~~~~~v~~D~~~~~l 328 (652)
-+. ..+-.+.- ++.. |.+. .....+....+++..+.|
T Consensus 123 -----------------~l~------------------~~v~~~~h-~g~~---p~~r----Q~~~h~H~a~~tP~~~~l 159 (346)
T COG2706 123 -----------------SLQ------------------PVVQVVKH-TGSG---PHER----QESPHVHSANFTPDGRYL 159 (346)
T ss_pred -----------------ccc------------------cceeeeec-CCCC---CCcc----ccCCccceeeeCCCCCEE
Confidence 000 00000000 0000 0000 000224556778888899
Q ss_pred EEeecccccEEEEeccCCcc----eEEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCcc---EEEEEe---
Q psy5768 329 FYSDIQKGTINSVFFNGSNH----RVLLERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQR---IVVVRL--- 398 (652)
Q Consensus 329 ywsd~~~~~I~~~~~~g~~~----~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~---~~~~~~--- 398 (652)
+..|....+|+..+++.+.. +..+..-.+|.-|++-+-++..|....-+++|.+...+...+.. +++..+
T Consensus 160 ~v~DLG~Dri~~y~~~dg~L~~~~~~~v~~G~GPRHi~FHpn~k~aY~v~EL~stV~v~~y~~~~g~~~~lQ~i~tlP~d 239 (346)
T COG2706 160 VVPDLGTDRIFLYDLDDGKLTPADPAEVKPGAGPRHIVFHPNGKYAYLVNELNSTVDVLEYNPAVGKFEELQTIDTLPED 239 (346)
T ss_pred EEeecCCceEEEEEcccCccccccccccCCCCCcceEEEcCCCcEEEEEeccCCEEEEEEEcCCCceEEEeeeeccCccc
Confidence 99999999988887763211 12224445899999999999999999999999999988754432 222222
Q ss_pred -CCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEE--EcCCCCCceEEEecCCCEEEEEeCCCCeEEEEe
Q psy5768 399 -GQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESII--TTDITMPNALALDHQAEKLFWGDARLDKIERCD 475 (652)
Q Consensus 399 -~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~--~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ 475 (652)
.....-.+|.|.|..++||.++.+.+.-.+++..-+|..-+.+- .+...+|..+.|++.++.|+.+....+.|..+.
T Consensus 240 F~g~~~~aaIhis~dGrFLYasNRg~dsI~~f~V~~~~g~L~~~~~~~teg~~PR~F~i~~~g~~Liaa~q~sd~i~vf~ 319 (346)
T COG2706 240 FTGTNWAAAIHISPDGRFLYASNRGHDSIAVFSVDPDGGKLELVGITPTEGQFPRDFNINPSGRFLIAANQKSDNITVFE 319 (346)
T ss_pred cCCCCceeEEEECCCCCEEEEecCCCCeEEEEEEcCCCCEEEEEEEeccCCcCCccceeCCCCCEEEEEccCCCcEEEEE
Confidence 22345567999999999999999977555566666655432222 234557999999999999999998888887777
Q ss_pred cCCC
Q psy5768 476 YDGT 479 (652)
Q Consensus 476 ldG~ 479 (652)
.|..
T Consensus 320 ~d~~ 323 (346)
T COG2706 320 RDKE 323 (346)
T ss_pred EcCC
Confidence 7653
No 16
>KOG4659|consensus
Probab=99.16 E-value=3.5e-09 Score=119.45 Aligned_cols=148 Identities=20% Similarity=0.196 Sum_probs=107.0
Q ss_pred CCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEE
Q psy5768 36 TLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVA 115 (652)
Q Consensus 36 ~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~ 115 (652)
.|=.|+++|+-+ +|.||+-| .+-|.|+..||+-. +|++-+ +..|. +---||+|++.+-||++|....+|.+.
T Consensus 363 ~L~aPvala~a~-DGSl~VGD----fNyIRRI~~dg~v~-tIl~L~-~t~~s-h~Yy~AvsPvdgtlyvSdp~s~qv~rv 434 (1899)
T KOG4659|consen 363 SLFAPVALAYAP-DGSLIVGD----FNYIRRISQDGQVS-TILTLG-LTDTS-HSYYIAVSPVDGTLYVSDPLSKQVWRV 434 (1899)
T ss_pred eeeceeeEEEcC-CCcEEEcc----chheeeecCCCceE-EEEEec-CCCcc-ceeEEEecCcCceEEecCCCcceEEEe
Confidence 345789999986 69999988 67999999999864 444444 33333 223799999999999999999888776
Q ss_pred E-cCCC----cEEEEEeC--------------------CCCCceeEEEcCCCCeEEEEecCCCCeEEEEeCCCCCcEE--
Q psy5768 116 R-LTGQ----YRYVLISG--------------------GVDQPSALAVDPESGYLFWSESGKIPLIARAGLDGKKQTI-- 168 (652)
Q Consensus 116 ~-~dg~----~~~~l~~~--------------------~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~~~-- 168 (652)
. +.++ +-+++..+ .|..|++||+|. .|.||++|.. .|...+-+|--.+.
T Consensus 435 ~sl~~~d~~~N~evvaG~Ge~Clp~desCGDGalA~dA~L~~PkGIa~dk-~g~lYfaD~t---~IR~iD~~giIstlig 510 (1899)
T KOG4659|consen 435 SSLEPQDSRNNYEVVAGDGEVCLPADESCGDGALAQDAQLIFPKGIAFDK-MGNLYFADGT---RIRVIDTTGIISTLIG 510 (1899)
T ss_pred ccCCccccccCeeEEeccCcCccccccccCcchhcccceeccCCceeEcc-CCcEEEeccc---EEEEeccCceEEEecc
Confidence 3 3321 22333321 467899999996 8999999964 56655544322221
Q ss_pred ----------------EEeecccCceeEEEeccCCEEEEEeCC
Q psy5768 169 ----------------LAQEIIMPIKDITLDLKFFSAFYRNLS 195 (652)
Q Consensus 169 ----------------~~~~~~~~p~gl~lD~~~~~ly~~d~~ 195 (652)
++.-.+.||..||+|+-++.||+.|-+
T Consensus 511 ~~~~~~~p~~C~~~~kl~~~~leWPT~LaV~Pmdnsl~Vld~n 553 (1899)
T KOG4659|consen 511 TTPDQHPPRTCAQITKLVDLQLEWPTSLAVDPMDNSLLVLDTN 553 (1899)
T ss_pred CCCCccCccccccccchhheeeecccceeecCCCCeEEEeecc
Confidence 222237899999999999999998765
No 17
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=99.09 E-value=2.5e-07 Score=99.19 Aligned_cols=295 Identities=16% Similarity=0.117 Sum_probs=190.5
Q ss_pred EEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccCCCCc
Q psy5768 13 IVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTACNLH 92 (652)
Q Consensus 13 I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~~~ 92 (652)
+.+++..+..+...++. .+.|.++++++....+|+++ .....+.....--...+.....+ ...|. +
T Consensus 13 ~~v~~~~~~~~~~~~~~-------~~~~~~v~~~~~g~~~~v~~--~~~~~~~~~~~~~n~~~~~~~~g-~~~p~----~ 78 (381)
T COG3391 13 VSVINTGTNKVTAAISL-------GRGPGGVAVNPDGTQVYVAN--SGSNDVSVIDATSNTVTQSLSVG-GVYPA----G 78 (381)
T ss_pred eEEEeecccEEEEEeec-------CCCCceeEEcCccCEEEEEe--ecCceeeecccccceeeeeccCC-Ccccc----c
Confidence 66666666555555442 24899999999988999999 55555555544411122223333 45567 9
Q ss_pred EEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCC-CCeEEEEeCCCCCcEEEEe
Q psy5768 93 IAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGK-IPLIARAGLDGKKQTILAQ 171 (652)
Q Consensus 93 lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~-~~~I~~~~ldg~~~~~~~~ 171 (652)
++++..+.++|.+....+.|.+++.........+.-+ ..|.+++++|..+++|.++.+. ...|...+-........+.
T Consensus 79 i~v~~~~~~vyv~~~~~~~v~vid~~~~~~~~~~~vG-~~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~t~~~~~~~~ 157 (381)
T COG3391 79 VAVNPAGNKVYVTTGDSNTVSVIDTATNTVLGSIPVG-LGPVGLAVDPDGKYVYVANAGNGNNTVSVIDAATNKVTATIP 157 (381)
T ss_pred eeeCCCCCeEEEecCCCCeEEEEcCcccceeeEeeec-cCCceEEECCCCCEEEEEecccCCceEEEEeCCCCeEEEEEe
Confidence 9999999999999999999999996655444333333 3999999999999999999863 5667776554433332222
Q ss_pred ecccCceeEEEeccCCEEEEEeCCCCcEEEEEecCCCCceEEEeecCCCCCcceeeeeeeccCCCCCCCCCCCCCCcccc
Q psy5768 172 EIIMPIKDITLDLKFFSAFYRNLSKGNIHIISLSNLSDVSTISMKPYGDSYLKDIKIYSKDAQTGTNPCGVNNGGCAELC 251 (652)
Q Consensus 172 ~~~~~p~gl~lD~~~~~ly~~d~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~q~~~n~C~~~ng~Cs~lC 251 (652)
. -..|.++++|+.++++|..+.+.+....+ ...
T Consensus 158 v-G~~P~~~a~~p~g~~vyv~~~~~~~v~vi-------------~~~--------------------------------- 190 (381)
T COG3391 158 V-GNTPTGVAVDPDGNKVYVTNSDDNTVSVI-------------DTS--------------------------------- 190 (381)
T ss_pred c-CCCcceEEECCCCCeEEEEecCCCeEEEE-------------eCC---------------------------------
Confidence 2 23689999999999999998543222111 000
Q ss_pred eecCCCceEEEeCCccccCCCcccccceEEEEeeecceeEEecCCCCCCCCCceeeeeccccceEEEEEEEcCCCeEEEe
Q psy5768 252 LYNGVSAVCACAHGVVAQDGKSCSEYDAFIMYSRVNRIDSIHMTDKSDLNSPFESIRNSTMMKNIIELSYDYKRKTLFYS 331 (652)
Q Consensus 252 ~~~~~~~~C~C~~G~l~~dg~~C~~~~~~Ll~s~~~~i~~i~l~~~~~~~~p~~~~~~~~~~~~~~~v~~D~~~~~lyws 331 (652)
...+.+ .- .... + .....+..+++++...++|.+
T Consensus 191 ----------------------------------~~~v~~-~~-~~~~-------~---~~~~~P~~i~v~~~g~~~yV~ 224 (381)
T COG3391 191 ----------------------------------GNSVVR-GS-VGSL-------V---GVGTGPAGIAVDPDGNRVYVA 224 (381)
T ss_pred ----------------------------------Ccceec-cc-cccc-------c---ccCCCCceEEECCCCCEEEEE
Confidence 000110 00 0000 0 001234567778888889999
Q ss_pred eccc--ccEEEEeccCCcceEE-e--eccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCC---
Q psy5768 332 DIQK--GTINSVFFNGSNHRVL-L--ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDK--- 403 (652)
Q Consensus 332 d~~~--~~I~~~~~~g~~~~~i-~--~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~--- 403 (652)
+..+ +.+.+++......... . ... .|.+++++|.+..+|.++...+.+.+++.... ..+........
T Consensus 225 ~~~~~~~~v~~id~~~~~v~~~~~~~~~~-~~~~v~~~p~g~~~yv~~~~~~~V~vid~~~~----~v~~~~~~~~~~~~ 299 (381)
T COG3391 225 NDGSGSNNVLKIDTATGNVTATDLPVGSG-APRGVAVDPAGKAAYVANSQGGTVSVIDGATD----RVVKTGPTGNEALG 299 (381)
T ss_pred eccCCCceEEEEeCCCceEEEeccccccC-CCCceeECCCCCEEEEEecCCCeEEEEeCCCC----ceeeeecccccccc
Confidence 8876 6888888665443332 1 444 79999999999999999998899999986542 23333233333
Q ss_pred -ceEEEEeCCCCEEEEEe
Q psy5768 404 -PRGIDIDSCDSRIYWTN 420 (652)
Q Consensus 404 -P~~Iavdp~~g~Lywtd 420 (652)
|..+++.+.....|.+.
T Consensus 300 ~~~~~~~~~~~~~~~~~~ 317 (381)
T COG3391 300 EPVSIAISPLYDTNYVSV 317 (381)
T ss_pred cceeccceeeccccccee
Confidence 77888877555555443
No 18
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=99.06 E-value=2.7e-06 Score=87.51 Aligned_cols=135 Identities=13% Similarity=0.104 Sum_probs=93.4
Q ss_pred ceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeC------CCCCceEEEEeCCCCEEEEEecCCCCCceEE
Q psy5768 357 SVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLG------QHDKPRGIDIDSCDSRIYWTNWNSHLPSIQR 430 (652)
Q Consensus 357 ~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~------~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r 430 (652)
.|..+++++.++.||.+....+.|.+.++.... ....+... ....|.+|+++|...++|++..+.. +|..
T Consensus 158 ~~~~~~~s~dg~~l~~~~~~~~~v~i~d~~~~~--~~~~~~~~~~~~~~~~~~~~~i~~s~dg~~~~~~~~~~~--~i~v 233 (300)
T TIGR03866 158 RPRFAEFTADGKELWVSSEIGGTVSVIDVATRK--VIKKITFEIPGVHPEAVQPVGIKLTKDGKTAFVALGPAN--RVAV 233 (300)
T ss_pred CccEEEECCCCCEEEEEcCCCCEEEEEEcCcce--eeeeeeecccccccccCCccceEECCCCCEEEEEcCCCC--eEEE
Confidence 577899999999998876667888888876422 11111111 1235788999998888998865543 5666
Q ss_pred EeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCCCeEEEEecCCCceEEEecCCCCceeEEEE
Q psy5768 431 AFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARLDKIERCDYDGTNRIVLSKISPLHPFDMAV 497 (652)
Q Consensus 431 ~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~~~~~p~glav 497 (652)
.++........+.. -..|.++++.+.+++||.+....+.|..+|+++.....-+..+ ..|++|++
T Consensus 234 ~d~~~~~~~~~~~~-~~~~~~~~~~~~g~~l~~~~~~~~~i~v~d~~~~~~~~~~~~~-~~~~~~~~ 298 (300)
T TIGR03866 234 VDAKTYEVLDYLLV-GQRVWQLAFTPDEKYLLTTNGVSNDVSVIDVAALKVIKSIKVG-RLPWGVVV 298 (300)
T ss_pred EECCCCcEEEEEEe-CCCcceEEECCCCCEEEEEcCCCCeEEEEECCCCcEEEEEEcc-cccceeEe
Confidence 66653332222221 2468899999999999998777789999999987754444444 88999985
No 19
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=99.05 E-value=1.2e-07 Score=101.33 Aligned_cols=250 Identities=14% Similarity=0.176 Sum_probs=146.4
Q ss_pred cCCccCCCCcEEEEccCCcEEEEeC-----------CC-CEEEEEEc---CCCc-EEEEEeCCCCCceeEEEcCCCCeEE
Q psy5768 83 YPAVTACNLHIAVDWIAQNIYWSDP-----------KE-NVIEVARL---TGQY-RYVLISGGVDQPSALAVDPESGYLF 146 (652)
Q Consensus 83 ~~~p~~~~~~lavDw~~~~lY~~d~-----------~~-~~I~v~~~---dg~~-~~~l~~~~~~~P~~iavd~~~g~ly 146 (652)
+..|. +||+|. .++||+++. .. ++|.+... ||.. ...++..++..|++|++.+ +| ||
T Consensus 13 ~~~P~----~ia~d~-~G~l~V~e~~~y~~~~~~~~~~~~rI~~l~d~dgdG~~d~~~vfa~~l~~p~Gi~~~~-~G-ly 85 (367)
T TIGR02604 13 LRNPI----AVCFDE-RGRLWVAEGITYSRPAGRQGPLGDRILILEDADGDGKYDKSNVFAEELSMVTGLAVAV-GG-VY 85 (367)
T ss_pred cCCCc----eeeECC-CCCEEEEeCCcCCCCCCCCCCCCCEEEEEEcCCCCCCcceeEEeecCCCCccceeEec-CC-EE
Confidence 77788 999996 688999974 22 38887754 5654 3345566789999999987 66 99
Q ss_pred EEecCCCCeEEEE-eCCCC-----CcEEEEeec-------ccCceeEEEeccCCEEEEEeCCCCcEEEEEecCCCCceEE
Q psy5768 147 WSESGKIPLIARA-GLDGK-----KQTILAQEI-------IMPIKDITLDLKFFSAFYRNLSKGNIHIISLSNLSDVSTI 213 (652)
Q Consensus 147 wtd~~~~~~I~~~-~ldg~-----~~~~~~~~~-------~~~p~gl~lD~~~~~ly~~d~~g~~~~~i~~~~~~~~~~~ 213 (652)
+++. +.|.+. ..+|. .+++++..- .+.+++|++++ +++||+..-+.++....
T Consensus 86 V~~~---~~i~~~~d~~gdg~ad~~~~~l~~~~~~~~~~~~~~~~~l~~gp-DG~LYv~~G~~~~~~~~----------- 150 (367)
T TIGR02604 86 VATP---PDILFLRDKDGDDKADGEREVLLSGFGGQINNHHHSLNSLAWGP-DGWLYFNHGNTLASKVT----------- 150 (367)
T ss_pred EeCC---CeEEEEeCCCCCCCCCCccEEEEEccCCCCCcccccccCceECC-CCCEEEecccCCCceec-----------
Confidence 9963 478777 34442 333444321 24488999987 46899876542221100
Q ss_pred EeecCCCCCcceeeeeeeccCCCCCCCCCCCCCCcccceecCCCceEEEeCCccccCCCcccccceEEEEeeecceeEEe
Q psy5768 214 SMKPYGDSYLKDIKIYSKDAQTGTNPCGVNNGGCAELCLYNGVSAVCACAHGVVAQDGKSCSEYDAFIMYSRVNRIDSIH 293 (652)
Q Consensus 214 ~~~~~~~~~~~~i~v~~~~~q~~~n~C~~~ng~Cs~lC~~~~~~~~C~C~~G~l~~dg~~C~~~~~~Ll~s~~~~i~~i~ 293 (652)
.+... .. + .......|.+++
T Consensus 151 --~~~~~------------------~~-----------------------------~-----------~~~~~g~i~r~~ 170 (367)
T TIGR02604 151 --RPGTS------------------DE-----------------------------S-----------RQGLGGGLFRYN 170 (367)
T ss_pred --cCCCc------------------cC-----------------------------c-----------ccccCceEEEEe
Confidence 00000 00 0 001122445555
Q ss_pred cCCCCCCCCCceeeeeccccceEEEEEEEcCCCeEEEeecccccEEEEecc--C----------C---------cceE--
Q psy5768 294 MTDKSDLNSPFESIRNSTMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFN--G----------S---------NHRV-- 350 (652)
Q Consensus 294 l~~~~~~~~p~~~~~~~~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~--g----------~---------~~~~-- 350 (652)
. ++... +.+ ..+++++.+++||+ .+.+|.+|.......+++.- | . ..+.
T Consensus 171 p-dg~~~----e~~--a~G~rnp~Gl~~d~-~G~l~~tdn~~~~~~~i~~~~~g~~~g~~~~~~~~~~~~~~~~~~~~~~ 242 (367)
T TIGR02604 171 P-DGGKL----RVV--AHGFQNPYGHSVDS-WGDVFFCDNDDPPLCRVTPVAEGGRNGYQSFNGRRYDHADRGADHEVPT 242 (367)
T ss_pred c-CCCeE----EEE--ecCcCCCccceECC-CCCEEEEccCCCceeEEcccccccccCCCCCCCcccccccccccccccc
Confidence 4 33221 222 24567778888886 56778877644433333210 0 0 0000
Q ss_pred -------------E-e-eccCceeeeEEE-------ccCCEEEEEeCCCCeEEEEEcCCCCCc--c--EEEEEe-CCCCC
Q psy5768 351 -------------L-L-ERQGSVEGLAYE-------YVHNYLYWTCNNDATINKIDLDSPKAQ--R--IVVVRL-GQHDK 403 (652)
Q Consensus 351 -------------i-~-~~~~~~~glAvD-------w~~~~LYwtd~~~~~I~~~~~~~~~~~--~--~~~~~~-~~~~~ 403 (652)
. . ..-..|.|+++- .-.++||+++...+.|.++.++..... . ...+.. ....+
T Consensus 243 ~~~~~~~~~~~~~~~~~g~~~ap~G~~~y~g~~fp~~~~g~~fv~~~~~~~v~~~~l~~~g~~~~~~~~~~l~~~~~~~r 322 (367)
T TIGR02604 243 GEWRQDDRGVETVGDVAGGGTAPCGIAFYRGDALPEEYRGLLLVGDAHGQLIVRYSLEPKGAGFKGERPEFLRSNDTWFR 322 (367)
T ss_pred cccccccccccccccccCCCccccEEEEeCCCcCCHHHCCCEEeeeccCCEEEEEEeecCCCccEeecCceEecCCCccc
Confidence 0 0 111268899887 456789999999999999988632111 1 122222 12269
Q ss_pred ceEEEEeCCCCEEEEEecCC
Q psy5768 404 PRGIDIDSCDSRIYWTNWNS 423 (652)
Q Consensus 404 P~~Iavdp~~g~Lywtd~~~ 423 (652)
|++|+++| .|.||++||..
T Consensus 323 p~dv~~~p-DG~Lyv~d~~~ 341 (367)
T TIGR02604 323 PVNVTVGP-DGALYVSDWYD 341 (367)
T ss_pred ccceeECC-CCCEEEEEecc
Confidence 99999999 68899999864
No 20
>KOG4659|consensus
Probab=99.04 E-value=2.2e-08 Score=113.20 Aligned_cols=242 Identities=20% Similarity=0.247 Sum_probs=161.2
Q ss_pred ceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe---eccCceeeeEEEccCCEEEEEeCCCCeEEEEE-cCCCC
Q psy5768 314 KNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL---ERQGSVEGLAYEYVHNYLYWTCNNDATINKID-LDSPK 389 (652)
Q Consensus 314 ~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~---~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~-~~~~~ 389 (652)
-.|+|++|-+ .+.||+-|. +-|.|+..+|....++- +....-.-||+|++.+.||.+|....+|+++. +...+
T Consensus 365 ~aPvala~a~-DGSl~VGDf--NyIRRI~~dg~v~tIl~L~~t~~sh~Yy~AvsPvdgtlyvSdp~s~qv~rv~sl~~~d 441 (1899)
T KOG4659|consen 365 FAPVALAYAP-DGSLIVGDF--NYIRRISQDGQVSTILTLGLTDTSHSYYIAVSPVDGTLYVSDPLSKQVWRVSSLEPQD 441 (1899)
T ss_pred eceeeEEEcC-CCcEEEccc--hheeeecCCCceEEEEEecCCCccceeEEEecCcCceEEecCCCcceEEEeccCCccc
Confidence 3478888754 578999986 67999999987655444 33445667999999999999999999998874 32211
Q ss_pred Cc--cEEEE-------------------EeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEE-------
Q psy5768 390 AQ--RIVVV-------------------RLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESI------- 441 (652)
Q Consensus 390 ~~--~~~~~-------------------~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l------- 441 (652)
.. .+.+. ....+..|+|||+|. .|.||++|.- .|...+-+|--.+.+
T Consensus 442 ~~~N~evvaG~Ge~Clp~desCGDGalA~dA~L~~PkGIa~dk-~g~lYfaD~t----~IR~iD~~giIstlig~~~~~~ 516 (1899)
T KOG4659|consen 442 SRNNYEVVAGDGEVCLPADESCGDGALAQDAQLIFPKGIAFDK-MGNLYFADGT----RIRVIDTTGIISTLIGTTPDQH 516 (1899)
T ss_pred cccCeeEEeccCcCccccccccCcchhcccceeccCCceeEcc-CCcEEEeccc----EEEEeccCceEEEeccCCCCcc
Confidence 11 12222 113567999999997 8999999954 555555554322221
Q ss_pred -----------EEcCCCCCceEEEecCCCEEEEEeCCCCeEEEEecCCCceEEEec-------------------CCCCc
Q psy5768 442 -----------ITTDITMPNALALDHQAEKLFWGDARLDKIERCDYDGTNRIVLSK-------------------ISPLH 491 (652)
Q Consensus 442 -----------~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~-------------------~~~~~ 491 (652)
+.-.+.||..||+|+-.+.||+.|. +.|.+++.++.-+..... ..+..
T Consensus 517 ~p~~C~~~~kl~~~~leWPT~LaV~Pmdnsl~Vld~--nvvlrit~~~rV~Ii~GrP~hC~~a~~t~~~skla~H~tl~~ 594 (1899)
T KOG4659|consen 517 PPRTCAQITKLVDLQLEWPTSLAVDPMDNSLLVLDT--NVVLRITVVHRVRIILGRPTHCDLANATSSASKLADHRTLLI 594 (1899)
T ss_pred CccccccccchhheeeecccceeecCCCCeEEEeec--ceEEEEccCccEEEEcCCccccccCCCchhhhhhhhhhhhhh
Confidence 1223679999999999999999985 678888888765522210 01234
Q ss_pred eeEEEEe-CCEEEEEcCCCCeEEEEEccCCceEEEEecccCCcceeEEEeccCCCCCCCCCCCCCCCCccccccCCC---
Q psy5768 492 PFDMAVY-GEFIFWTDWVIHAVLRANKYTGEEVYTLRKNIRRPMGIVAISDNLDACAKTPCRHLNGNCDDICKLDET--- 567 (652)
Q Consensus 492 p~glav~-~~~lYwtd~~~~~I~~~~k~~g~~~~~~~~~~~~p~~i~~~~~~~~~~~~~~C~~~ng~Cs~lCl~~~~--- 567 (652)
|.+|++- .+-||+++....+|-|+-+.+... .|.++. ....||...+..| .-|+....
T Consensus 595 ~r~Iavg~~G~lyvaEsD~rriNrvr~~~tdg------------~i~ila-----Ga~S~C~C~~~~~-cdcfs~~~~~A 656 (1899)
T KOG4659|consen 595 QRDIAVGTDGALYVAESDGRRINRVRKLSTDG------------TISILA-----GAKSPCSCDVAAC-CDCFSLRDVAA 656 (1899)
T ss_pred hhceeecCCceEEEEeccchhhhheEEeccCc------------eEEEec-----CCCCCCCcccccC-Cccccccchhh
Confidence 6788886 679999999888887776643221 123332 1346776666665 23432211
Q ss_pred -------CceeeeccCceeeccC
Q psy5768 568 -------GQVVCSCFTGKVLMED 583 (652)
Q Consensus 568 -------~~~~C~Cp~g~~l~~d 583 (652)
-...|.-|+|-+...|
T Consensus 657 t~A~lnsp~alaVsPdg~v~IAD 679 (1899)
T KOG4659|consen 657 TQAKLNSPYALAVSPDGDVIIAD 679 (1899)
T ss_pred hccccCCcceEEECCCCcEEEec
Confidence 1467899999877666
No 21
>PF00058 Ldl_recept_b: Low-density lipoprotein receptor repeat class B; InterPro: IPR000033 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=98.99 E-value=1.2e-09 Score=76.95 Aligned_cols=42 Identities=26% Similarity=0.552 Sum_probs=39.4
Q ss_pred CeEEEEecCCCCeEEEEeCCCCCcEEEEeecccCceeEEEec
Q psy5768 143 GYLFWSESGKIPLIARAGLDGKKQTILAQEIIMPIKDITLDL 184 (652)
Q Consensus 143 g~lywtd~~~~~~I~~~~ldg~~~~~~~~~~~~~p~gl~lD~ 184 (652)
|+|||||++..+.|++++|||+++++++.+.+..|.|||||+
T Consensus 1 ~~iYWtD~~~~~~I~~a~~dGs~~~~vi~~~l~~P~giaVD~ 42 (42)
T PF00058_consen 1 GKIYWTDWSQDPSIERANLDGSNRRTVISDDLQHPEGIAVDW 42 (42)
T ss_dssp TEEEEEETTTTEEEEEEETTSTSEEEEEESSTSSEEEEEEET
T ss_pred CEEEEEECCCCcEEEEEECCCCCeEEEEECCCCCcCEEEECC
Confidence 689999998667999999999999999999999999999985
No 22
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=98.98 E-value=2.2e-10 Score=76.73 Aligned_cols=35 Identities=46% Similarity=1.133 Sum_probs=30.4
Q ss_pred CCCCCCCCcccceecCCCceEEEeCCc-cccCCCcc
Q psy5768 240 CGVNNGGCAELCLYNGVSAVCACAHGV-VAQDGKSC 274 (652)
Q Consensus 240 C~~~ng~Cs~lC~~~~~~~~C~C~~G~-l~~dg~~C 274 (652)
|..+||+|+|+|+..+.+|+|+|+.|| |.+|+++|
T Consensus 1 C~~~NGgC~h~C~~~~g~~~C~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 1 CSVNNGGCSHICVNTPGSYRCSCPPGYKLAEDGRTC 36 (36)
T ss_dssp CTTGGGGSSSEEEEETTSEEEE-STTEEE-TTSSSE
T ss_pred CCCCCCCcCCCCccCCCceEeECCCCCEECcCCCCC
Confidence 677899999999998889999999998 77899988
No 23
>PF00058 Ldl_recept_b: Low-density lipoprotein receptor repeat class B; InterPro: IPR000033 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=98.98 E-value=8.8e-10 Score=77.60 Aligned_cols=42 Identities=38% Similarity=0.889 Sum_probs=39.4
Q ss_pred CEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEec
Q psy5768 414 SRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDH 456 (652)
Q Consensus 414 g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~ 456 (652)
++|||||++.. ++|++++|||+++++++..++.+|.|||||+
T Consensus 1 ~~iYWtD~~~~-~~I~~a~~dGs~~~~vi~~~l~~P~giaVD~ 42 (42)
T PF00058_consen 1 GKIYWTDWSQD-PSIERANLDGSNRRTVISDDLQHPEGIAVDW 42 (42)
T ss_dssp TEEEEEETTTT-EEEEEEETTSTSEEEEEESSTSSEEEEEEET
T ss_pred CEEEEEECCCC-cEEEEEECCCCCeEEEEECCCCCcCEEEECC
Confidence 58999999976 6999999999999999999999999999995
No 24
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=98.97 E-value=4.3e-07 Score=95.43 Aligned_cols=148 Identities=20% Similarity=0.171 Sum_probs=102.8
Q ss_pred CCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeC------CCcCCccCCCCcEEEEc---cCCcEEEEeC
Q psy5768 37 LSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQ------KKYPAVTACNLHIAVDW---IAQNIYWSDP 107 (652)
Q Consensus 37 ~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~------~~~~~p~~~~~~lavDw---~~~~lY~~d~ 107 (652)
|++|++|++.|. ++||+++ . .++|+++..+|+....+... + ...+- |||++. .++.||++.+
T Consensus 1 L~~P~~~a~~pd-G~l~v~e--~-~G~i~~~~~~g~~~~~v~~~~~v~~~~-~~gll----gia~~p~f~~n~~lYv~~t 71 (331)
T PF07995_consen 1 LNNPRSMAFLPD-GRLLVAE--R-SGRIWVVDKDGSLKTPVADLPEVFADG-ERGLL----GIAFHPDFASNGYLYVYYT 71 (331)
T ss_dssp ESSEEEEEEETT-SCEEEEE--T-TTEEEEEETTTEECEEEEE-TTTBTST-TBSEE----EEEE-TTCCCC-EEEEEEE
T ss_pred CCCceEEEEeCC-CcEEEEe--C-CceEEEEeCCCcCcceecccccccccc-cCCcc----cceeccccCCCCEEEEEEE
Confidence 468999999986 8999999 5 89999999888763333221 2 23344 999998 3578998776
Q ss_pred CC--------CEEEEEEcCCC-----cEEEEEeC------CCCCceeEEEcCCCCeEEEEecC------------CCCeE
Q psy5768 108 KE--------NVIEVARLTGQ-----YRYVLISG------GVDQPSALAVDPESGYLFWSESG------------KIPLI 156 (652)
Q Consensus 108 ~~--------~~I~v~~~dg~-----~~~~l~~~------~~~~P~~iavd~~~g~lywtd~~------------~~~~I 156 (652)
.. .+|.+..++.. ..++++.. ....-..|+++| .|+||++-.. ..++|
T Consensus 72 ~~~~~~~~~~~~v~r~~~~~~~~~~~~~~~l~~~~p~~~~~~H~g~~l~fgp-DG~LYvs~G~~~~~~~~~~~~~~~G~i 150 (331)
T PF07995_consen 72 NADEDGGDNDNRVVRFTLSDGDGDLSSEEVLVTGLPDTSSGNHNGGGLAFGP-DGKLYVSVGDGGNDDNAQDPNSLRGKI 150 (331)
T ss_dssp EE-TSSSSEEEEEEEEEEETTSCEEEEEEEEEEEEES-CSSSS-EEEEEE-T-TSEEEEEEB-TTTGGGGCSTTSSTTEE
T ss_pred cccCCCCCcceeeEEEeccCCccccccceEEEEEeCCCCCCCCCCccccCCC-CCcEEEEeCCCCCcccccccccccceE
Confidence 32 46888877543 23444432 234567899999 7899998421 14689
Q ss_pred EEEeCCCCC------------cEEEEeecccCceeEEEeccCCEEEEEeC
Q psy5768 157 ARAGLDGKK------------QTILAQEIIMPIKDITLDLKFFSAFYRNL 194 (652)
Q Consensus 157 ~~~~ldg~~------------~~~~~~~~~~~p~gl~lD~~~~~ly~~d~ 194 (652)
.|.+.||+- ...++..++..|.|+++|+.+++||..+-
T Consensus 151 lri~~dG~~p~dnP~~~~~~~~~~i~A~GlRN~~~~~~d~~tg~l~~~d~ 200 (331)
T PF07995_consen 151 LRIDPDGSIPADNPFVGDDGADSEIYAYGLRNPFGLAFDPNTGRLWAADN 200 (331)
T ss_dssp EEEETTSSB-TTSTTTTSTTSTTTEEEE--SEEEEEEEETTTTEEEEEEE
T ss_pred EEecccCcCCCCCccccCCCceEEEEEeCCCccccEEEECCCCcEEEEcc
Confidence 999999872 23456778999999999999999999874
No 25
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=98.97 E-value=1.8e-07 Score=100.27 Aligned_cols=204 Identities=16% Similarity=0.150 Sum_probs=150.3
Q ss_pred EEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe-eccCceeeeEEEccCCEEEEEeC--CCCeEEEEEcCCCCCcc
Q psy5768 316 IIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL-ERQGSVEGLAYEYVHNYLYWTCN--NDATINKIDLDSPKAQR 392 (652)
Q Consensus 316 ~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~-~~~~~~~glAvDw~~~~LYwtd~--~~~~I~~~~~~~~~~~~ 392 (652)
+.+++++....++|..+...+.|..++.+.......+ -+. .|.++|+|+.++.+|.++. ..+++.+++-.. .
T Consensus 76 p~~i~v~~~~~~vyv~~~~~~~v~vid~~~~~~~~~~~vG~-~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~t----~ 150 (381)
T COG3391 76 PAGVAVNPAGNKVYVTTGDSNTVSVIDTATNTVLGSIPVGL-GPVGLAVDPDGKYVYVANAGNGNNTVSVIDAAT----N 150 (381)
T ss_pred ccceeeCCCCCeEEEecCCCCeEEEEcCcccceeeEeeecc-CCceEEECCCCCEEEEEecccCCceEEEEeCCC----C
Confidence 4567788888999999988899999985543322222 233 8999999999999999999 468999998654 2
Q ss_pred EEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceE-E---EEcCCCCCceEEEecCCCEEEEEeCCC
Q psy5768 393 IVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTES-I---ITTDITMPNALALDHQAEKLFWGDARL 468 (652)
Q Consensus 393 ~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~-l---~~~~l~~P~glaiD~~~~~LYw~D~~~ 468 (652)
..+........|.++|++|...++|.++...+ .|...+..+..... - .-.....|.++++++.+.++|.++...
T Consensus 151 ~~~~~~~vG~~P~~~a~~p~g~~vyv~~~~~~--~v~vi~~~~~~v~~~~~~~~~~~~~~P~~i~v~~~g~~~yV~~~~~ 228 (381)
T COG3391 151 KVTATIPVGNTPTGVAVDPDGNKVYVTNSDDN--TVSVIDTSGNSVVRGSVGSLVGVGTGPAGIAVDPDGNRVYVANDGS 228 (381)
T ss_pred eEEEEEecCCCcceEEECCCCCeEEEEecCCC--eEEEEeCCCcceeccccccccccCCCCceEEECCCCCEEEEEeccC
Confidence 33333445568999999999999999996543 66666655543331 0 012345799999999999999999887
Q ss_pred --CeEEEEecCCCceEEE--ecCCCCceeEEEEe--CCEEEEEcCCCCeEEEEEccCCceEEEEe
Q psy5768 469 --DKIERCDYDGTNRIVL--SKISPLHPFDMAVY--GEFIFWTDWVIHAVLRANKYTGEEVYTLR 527 (652)
Q Consensus 469 --~~I~~~~ldG~~~~~l--~~~~~~~p~glav~--~~~lYwtd~~~~~I~~~~k~~g~~~~~~~ 527 (652)
..+..++......... ..... .|+++++. +.++|.++...+.+..++..+......+.
T Consensus 229 ~~~~v~~id~~~~~v~~~~~~~~~~-~~~~v~~~p~g~~~yv~~~~~~~V~vid~~~~~v~~~~~ 292 (381)
T COG3391 229 GSNNVLKIDTATGNVTATDLPVGSG-APRGVAVDPAGKAAYVANSQGGTVSVIDGATDRVVKTGP 292 (381)
T ss_pred CCceEEEEeCCCceEEEeccccccC-CCCceeECCCCCEEEEEecCCCeEEEEeCCCCceeeeec
Confidence 5888888877665554 22234 79999986 67999999888999999877665555444
No 26
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=98.97 E-value=2.1e-10 Score=76.86 Aligned_cols=35 Identities=37% Similarity=0.685 Sum_probs=30.0
Q ss_pred CCCCCCCCccccccCCCCceeeeccCceeeccC-Ccc
Q psy5768 551 CRHLNGNCDDICKLDETGQVVCSCFTGKVLMED-NRS 586 (652)
Q Consensus 551 C~~~ng~Cs~lCl~~~~~~~~C~Cp~g~~l~~d-~~C 586 (652)
|..+||+|+|+|++.|. +++|+||.||+|.+| ++|
T Consensus 1 C~~~NGgC~h~C~~~~g-~~~C~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 1 CSVNNGGCSHICVNTPG-SYRCSCPPGYKLAEDGRTC 36 (36)
T ss_dssp CTTGGGGSSSEEEEETT-SEEEE-STTEEE-TTSSSE
T ss_pred CCCCCCCcCCCCccCCC-ceEeECCCCCEECcCCCCC
Confidence 67789999999999977 599999999999999 776
No 27
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=98.95 E-value=8e-07 Score=88.42 Aligned_cols=199 Identities=17% Similarity=0.168 Sum_probs=123.2
Q ss_pred cceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe-eccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCc
Q psy5768 313 MKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL-ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQ 391 (652)
Q Consensus 313 ~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~-~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~ 391 (652)
..++.|++|++.+++||-+.-..+.|+.++++|.-.+.+- .+.+.++||++- -++.+..++...+++.++.++.....
T Consensus 21 ~~e~SGLTy~pd~~tLfaV~d~~~~i~els~~G~vlr~i~l~g~~D~EgI~y~-g~~~~vl~~Er~~~L~~~~~~~~~~~ 99 (248)
T PF06977_consen 21 LDELSGLTYNPDTGTLFAVQDEPGEIYELSLDGKVLRRIPLDGFGDYEGITYL-GNGRYVLSEERDQRLYIFTIDDDTTS 99 (248)
T ss_dssp -S-EEEEEEETTTTEEEEEETTTTEEEEEETT--EEEEEE-SS-SSEEEEEE--STTEEEEEETTTTEEEEEEE----TT
T ss_pred cCCccccEEcCCCCeEEEEECCCCEEEEEcCCCCEEEEEeCCCCCCceeEEEE-CCCEEEEEEcCCCcEEEEEEeccccc
Confidence 3568899999999999999888899999999987666654 778899999994 23444556667888988888432211
Q ss_pred --cEEE--EEe--C--CCCCceEEEEeCCCCEEEEEecCCCCCceEEEee--cCCCceEEEEc-------CCCCCceEEE
Q psy5768 392 --RIVV--VRL--G--QHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFF--SGFGTESIITT-------DITMPNALAL 454 (652)
Q Consensus 392 --~~~~--~~~--~--~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~l--dG~~~~~l~~~-------~l~~P~glai 454 (652)
...+ +.+ . .....-|||.||.++.||.+....+ ..|+.... .+....+.... .+..|+||++
T Consensus 100 ~~~~~~~~~~l~~~~~~N~G~EGla~D~~~~~L~v~kE~~P-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~S~l~~ 178 (248)
T PF06977_consen 100 LDRADVQKISLGFPNKGNKGFEGLAYDPKTNRLFVAKERKP-KRLYEVNGFPGGFDLFVSDDQDLDDDKLFVRDLSGLSY 178 (248)
T ss_dssp --EEEEEEEE---S---SS--EEEEEETTTTEEEEEEESSS-EEEEEEESTT-SS--EEEE-HHHH-HT--SS---EEEE
T ss_pred cchhhceEEecccccCCCcceEEEEEcCCCCEEEEEeCCCC-hhhEEEccccCccceeeccccccccccceeccccceEE
Confidence 2211 111 1 2234689999999999999864432 25676665 23232222211 2567999999
Q ss_pred ecCCCEEEEEeCCCCeEEEEecCCCceEEEecC--------CCCceeEEEEe-CCEEEEEcCCCCeEEE
Q psy5768 455 DHQAEKLFWGDARLDKIERCDYDGTNRIVLSKI--------SPLHPFDMAVY-GEFIFWTDWVIHAVLR 514 (652)
Q Consensus 455 D~~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~--------~~~~p~glav~-~~~lYwtd~~~~~I~~ 514 (652)
|+.++.||.......+|-.++.+|.-...+.-. .+.+|-|||++ ++.||.+.- -+..++
T Consensus 179 ~p~t~~lliLS~es~~l~~~d~~G~~~~~~~L~~g~~gl~~~~~QpEGIa~d~~G~LYIvsE-pNlfy~ 246 (248)
T PF06977_consen 179 DPRTGHLLILSDESRLLLELDRQGRVVSSLSLDRGFHGLSKDIPQPEGIAFDPDGNLYIVSE-PNLFYR 246 (248)
T ss_dssp ETTTTEEEEEETTTTEEEEE-TT--EEEEEE-STTGGG-SS---SEEEEEE-TT--EEEEET-TTEEEE
T ss_pred cCCCCeEEEEECCCCeEEEECCCCCEEEEEEeCCcccCcccccCCccEEEECCCCCEEEEcC-CceEEE
Confidence 999999999999999999999999865544221 25689999998 569998863 334444
No 28
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=98.94 E-value=3.5e-07 Score=96.11 Aligned_cols=222 Identities=18% Similarity=0.217 Sum_probs=141.6
Q ss_pred ceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe-------eccCceeeeEEEc---cCCEEEEEeCCC------
Q psy5768 314 KNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL-------ERQGSVEGLAYEY---VHNYLYWTCNND------ 377 (652)
Q Consensus 314 ~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~-------~~~~~~~glAvDw---~~~~LYwtd~~~------ 377 (652)
.+|.+|++.+. ++||+++. .++|+++..+|.....+. .+.....|||+++ .++.||++-...
T Consensus 2 ~~P~~~a~~pd-G~l~v~e~-~G~i~~~~~~g~~~~~v~~~~~v~~~~~~gllgia~~p~f~~n~~lYv~~t~~~~~~~~ 79 (331)
T PF07995_consen 2 NNPRSMAFLPD-GRLLVAER-SGRIWVVDKDGSLKTPVADLPEVFADGERGLLGIAFHPDFASNGYLYVYYTNADEDGGD 79 (331)
T ss_dssp SSEEEEEEETT-SCEEEEET-TTEEEEEETTTEECEEEEE-TTTBTSTTBSEEEEEE-TTCCCC-EEEEEEEEE-TSSSS
T ss_pred CCceEEEEeCC-CcEEEEeC-CceEEEEeCCCcCcceecccccccccccCCcccceeccccCCCCEEEEEEEcccCCCCC
Confidence 46889999987 79999998 899999997776523222 2445789999998 368899876632
Q ss_pred --CeEEEEEcCCCCCc---cEEEEEe-----CCCCCceEEEEeCCCCEEEEEecC-----------CCCCceEEEeecCC
Q psy5768 378 --ATINKIDLDSPKAQ---RIVVVRL-----GQHDKPRGIDIDSCDSRIYWTNWN-----------SHLPSIQRAFFSGF 436 (652)
Q Consensus 378 --~~I~~~~~~~~~~~---~~~~~~~-----~~~~~P~~Iavdp~~g~Lywtd~~-----------~~~~~I~r~~ldG~ 436 (652)
.+|.+..++..... .++++.. .....-..|+++| .|+||++-.. ...++|.|.+.||+
T Consensus 80 ~~~~v~r~~~~~~~~~~~~~~~l~~~~p~~~~~~H~g~~l~fgp-DG~LYvs~G~~~~~~~~~~~~~~~G~ilri~~dG~ 158 (331)
T PF07995_consen 80 NDNRVVRFTLSDGDGDLSSEEVLVTGLPDTSSGNHNGGGLAFGP-DGKLYVSVGDGGNDDNAQDPNSLRGKILRIDPDGS 158 (331)
T ss_dssp EEEEEEEEEEETTSCEEEEEEEEEEEEES-CSSSS-EEEEEE-T-TSEEEEEEB-TTTGGGGCSTTSSTTEEEEEETTSS
T ss_pred cceeeEEEeccCCccccccceEEEEEeCCCCCCCCCCccccCCC-CCcEEEEeCCCCCcccccccccccceEEEecccCc
Confidence 46777777543111 2333322 1345667899999 6799998422 12469999999997
Q ss_pred C------------ceEEEEcCCCCCceEEEecCCCEEEEEeCCCCeEEEEec--CCCc----------------------
Q psy5768 437 G------------TESIITTDITMPNALALDHQAEKLFWGDARLDKIERCDY--DGTN---------------------- 480 (652)
Q Consensus 437 ~------------~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~l--dG~~---------------------- 480 (652)
- ...++...++.|.+|++|+.+++||.+|.+.+..+.++. .|.+
T Consensus 159 ~p~dnP~~~~~~~~~~i~A~GlRN~~~~~~d~~tg~l~~~d~G~~~~dein~i~~G~nYGWP~~~~~~~~~~~~~~~~~~ 238 (331)
T PF07995_consen 159 IPADNPFVGDDGADSEIYAYGLRNPFGLAFDPNTGRLWAADNGPDGWDEINRIEPGGNYGWPYCEGGPKYSGPPIGDAPS 238 (331)
T ss_dssp B-TTSTTTTSTTSTTTEEEE--SEEEEEEEETTTTEEEEEEE-SSSSEEEEEE-TT-B--TTTBSSSCSTTSS-ECTGSS
T ss_pred CCCCCccccCCCceEEEEEeCCCccccEEEECCCCcEEEEccCCCCCcEEEEeccCCcCCCCCCcCCCCCCCCccccccC
Confidence 1 235667799999999999999999999976654444432 2211
Q ss_pred ------eEEEecCCCCceeEEEEe--------CCEEEEEcCCCCeEEEEEccCCceE---EEEecccC-CcceeEEE
Q psy5768 481 ------RIVLSKISPLHPFDMAVY--------GEFIFWTDWVIHAVLRANKYTGEEV---YTLRKNIR-RPMGIVAI 539 (652)
Q Consensus 481 ------~~~l~~~~~~~p~glav~--------~~~lYwtd~~~~~I~~~~k~~g~~~---~~~~~~~~-~p~~i~~~ 539 (652)
+..... ....|-|++++ .+.++++++..+.|+++...++..+ ..+..... +|.+|.+-
T Consensus 239 ~~~~~~P~~~~~-~~~ap~G~~~y~g~~fp~~~g~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~~~v~~~ 314 (331)
T PF07995_consen 239 CPGFVPPVFAYP-PHSAPTGIIFYRGSAFPEYRGDLFVADYGGGRIWRLDLDEDGSVTEEEEFLGGFGGRPRDVAQG 314 (331)
T ss_dssp -TTS---SEEET-TT--EEEEEEE-SSSSGGGTTEEEEEETTTTEEEEEEEETTEEEEEEEEECTTSSS-EEEEEEE
T ss_pred CCCcCccceeec-CccccCceEEECCccCccccCcEEEecCCCCEEEEEeeecCCCccceEEccccCCCCceEEEEc
Confidence 011111 12468899987 5679999999999999987655332 22233333 55565543
No 29
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=98.88 E-value=9.4e-06 Score=84.79 Aligned_cols=280 Identities=11% Similarity=0.134 Sum_probs=163.5
Q ss_pred cCCcEEEEeCC----CCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecC--------CCCeEEEEeCCCCC
Q psy5768 98 IAQNIYWSDPK----ENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESG--------KIPLIARAGLDGKK 165 (652)
Q Consensus 98 ~~~~lY~~d~~----~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~--------~~~~I~~~~ldg~~ 165 (652)
-..++|+.|.. .++|.++|.+.....-.+..+ ..|+++ +.|.+..||.+... ....|...+..--.
T Consensus 11 ~~~~v~V~d~~~~~~~~~v~ViD~~~~~v~g~i~~G-~~P~~~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t~~ 88 (352)
T TIGR02658 11 DARRVYVLDPGHFAATTQVYTIDGEAGRVLGMTDGG-FLPNPV-VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQTHL 88 (352)
T ss_pred CCCEEEEECCcccccCceEEEEECCCCEEEEEEEcc-CCCcee-ECCCCCEEEEEeccccccccCCCCCEEEEEECccCc
Confidence 46789999986 489999998876555555544 689996 99999999999861 12356665543222
Q ss_pred cE-EEEee------cccCceeEEEeccCCEEEEEeCCCCcEEEEEecCCCCceEEEeecCCCCCcceeeeeeeccCCCCC
Q psy5768 166 QT-ILAQE------IIMPIKDITLDLKFFSAFYRNLSKGNIHIISLSNLSDVSTISMKPYGDSYLKDIKIYSKDAQTGTN 238 (652)
Q Consensus 166 ~~-~~~~~------~~~~p~gl~lD~~~~~ly~~d~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~q~~~n 238 (652)
.. .+... ....|..+++.+.+++||+.+.+..+.-.++- ..+.. + ...|.+
T Consensus 89 ~~~~i~~p~~p~~~~~~~~~~~~ls~dgk~l~V~n~~p~~~V~VvD-~~~~k-v----------v~ei~v---------- 146 (352)
T TIGR02658 89 PIADIELPEGPRFLVGTYPWMTSLTPDNKTLLFYQFSPSPAVGVVD-LEGKA-F----------VRMMDV---------- 146 (352)
T ss_pred EEeEEccCCCchhhccCccceEEECCCCCEEEEecCCCCCEEEEEE-CCCCc-E----------EEEEeC----------
Confidence 11 11110 02356699999999999998877443333221 11111 1 111111
Q ss_pred CCCCCCCCCcccceecCCCceEEEeCCcc-----ccCCCcccccceEEEEeeecceeEEecCCCCCCCCCceeeeecccc
Q psy5768 239 PCGVNNGGCAELCLYNGVSAVCACAHGVV-----AQDGKSCSEYDAFIMYSRVNRIDSIHMTDKSDLNSPFESIRNSTMM 313 (652)
Q Consensus 239 ~C~~~ng~Cs~lC~~~~~~~~C~C~~G~l-----~~dg~~C~~~~~~Ll~s~~~~i~~i~l~~~~~~~~p~~~~~~~~~~ 313 (652)
++|.++=.....++.=.|..|-+ ..+|+ .. .....+-+..+ . +..
T Consensus 147 ------p~~~~vy~t~e~~~~~~~~Dg~~~~v~~d~~g~--------~~------~~~~~vf~~~~-~---~v~------ 196 (352)
T TIGR02658 147 ------PDCYHIFPTANDTFFMHCRDGSLAKVGYGTKGN--------PK------IKPTEVFHPED-E---YLI------ 196 (352)
T ss_pred ------CCCcEEEEecCCccEEEeecCceEEEEecCCCc--------eE------EeeeeeecCCc-c---ccc------
Confidence 45666555445567778887732 12222 00 00000000100 0 011
Q ss_pred ceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEE-----e-ec----cCceee---eEEEccCCEEEEEe-CCC--
Q psy5768 314 KNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVL-----L-ER----QGSVEG---LAYEYVHNYLYWTC-NND-- 377 (652)
Q Consensus 314 ~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i-----~-~~----~~~~~g---lAvDw~~~~LYwtd-~~~-- 377 (652)
.++ .|....++.+|.... +.|+.+++.+...+.. + .. --.|.| +|+...++.||.+. ...
T Consensus 197 ~rP---~~~~~dg~~~~vs~e-G~V~~id~~~~~~~~~~~~~~~~~~~~~~~wrP~g~q~ia~~~dg~~lyV~~~~~~~~ 272 (352)
T TIGR02658 197 NHP---AYSNKSGRLVWPTYT-GKIFQIDLSSGDAKFLPAIEAFTEAEKADGWRPGGWQQVAYHRARDRIYLLADQRAKW 272 (352)
T ss_pred cCC---ceEcCCCcEEEEecC-CeEEEEecCCCcceecceeeeccccccccccCCCcceeEEEcCCCCEEEEEecCCccc
Confidence 122 344445777777766 8999998766433221 1 11 114555 99999999999953 222
Q ss_pred ------CeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCC-EEEEEecCCCCCceEEEeecCCCceEEEE
Q psy5768 378 ------ATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDS-RIYWTNWNSHLPSIQRAFFSGFGTESIIT 443 (652)
Q Consensus 378 ------~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g-~Lywtd~~~~~~~I~r~~ldG~~~~~l~~ 443 (652)
+.|+++++.. ++.+-.......|.+|++.|... +||.+++..+ .|... |....+.+-+
T Consensus 273 thk~~~~~V~ViD~~t----~kvi~~i~vG~~~~~iavS~Dgkp~lyvtn~~s~--~VsVi--D~~t~k~i~~ 337 (352)
T TIGR02658 273 THKTASRFLFVVDAKT----GKRLRKIELGHEIDSINVSQDAKPLLYALSTGDK--TLYIF--DAETGKELSS 337 (352)
T ss_pred cccCCCCEEEEEECCC----CeEEEEEeCCCceeeEEECCCCCeEEEEeCCCCC--cEEEE--ECcCCeEEee
Confidence 5899999755 34444444567999999999999 9999987644 44433 3444444443
No 30
>PF00057 Ldl_recept_a: Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor; InterPro: IPR002172 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=98.86 E-value=1.6e-09 Score=73.56 Aligned_cols=36 Identities=39% Similarity=0.910 Sum_probs=33.3
Q ss_pred cccCCCceeeccCeecCCccCCCCCCCCCCCCCCCC
Q psy5768 591 TVCSEHDFKCSDGMCIPFNQTCDRVYNCHDKSDEGI 626 (652)
Q Consensus 591 ~~C~~~~f~C~~g~Ci~~~~~Cd~~~dC~d~sde~~ 626 (652)
+.|.+.+|+|.++.||+..|+|||..||.|||||..
T Consensus 1 ~~C~~~~f~C~~~~CI~~~~~CDg~~DC~dgsDE~~ 36 (37)
T PF00057_consen 1 PTCPPGEFRCGNGQCIPKSWVCDGIPDCPDGSDEQN 36 (37)
T ss_dssp SSSSTTEEEETTSSEEEGGGTTSSSCSSSSSTTTSS
T ss_pred CcCcCCeeEcCCCCEEChHHcCCCCCCCCCCccccc
Confidence 358889999999999999999999999999999964
No 31
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=98.85 E-value=3.7e-05 Score=77.74 Aligned_cols=294 Identities=15% Similarity=0.147 Sum_probs=177.7
Q ss_pred CeEEEecCC---CCeEEEEecCCCeeE-EEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCC-ccE
Q psy5768 1 MFIAVSSPT---QSKIVVCNLEGEYQT-TILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGT-KRE 75 (652)
Q Consensus 1 ~~i~v~~~~---~~~I~~~~~~g~~~~-~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs-~~~ 75 (652)
+.++|.+++ +..|++++++.+.-. .+... ...+.+|.-|+++++.++||...-....+.|-.+..|.. .+-
T Consensus 3 ~~~YiGtyT~~~s~gI~v~~ld~~~g~l~~~~~----v~~~~nptyl~~~~~~~~LY~v~~~~~~ggvaay~iD~~~G~L 78 (346)
T COG2706 3 QTVYIGTYTKRESQGIYVFNLDTKTGELSLLQL----VAELGNPTYLAVNPDQRHLYVVNEPGEEGGVAAYRIDPDDGRL 78 (346)
T ss_pred eEEEEeeecccCCCceEEEEEeCcccccchhhh----ccccCCCceEEECCCCCEEEEEEecCCcCcEEEEEEcCCCCeE
Confidence 468999999 999999998843221 11110 125689999999999999999882222677766666643 444
Q ss_pred EEEeCC--CcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEc--CCCcEEE---EEeCC--------CCCceeEEEcC
Q psy5768 76 TVVSQK--KYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARL--TGQYRYV---LISGG--------VDQPSALAVDP 140 (652)
Q Consensus 76 ~v~~~~--~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~--dg~~~~~---l~~~~--------~~~P~~iavd~ 140 (652)
+++... .-..|. -+++|..++-||.+....+.|.|.-+ +|....+ +...+ -.+++..-++|
T Consensus 79 t~ln~~~~~g~~p~----yvsvd~~g~~vf~AnY~~g~v~v~p~~~dG~l~~~v~~~~h~g~~p~~rQ~~~h~H~a~~tP 154 (346)
T COG2706 79 TFLNRQTLPGSPPC----YVSVDEDGRFVFVANYHSGSVSVYPLQADGSLQPVVQVVKHTGSGPHERQESPHVHSANFTP 154 (346)
T ss_pred EEeeccccCCCCCe----EEEECCCCCEEEEEEccCceEEEEEcccCCccccceeeeecCCCCCCccccCCccceeeeCC
Confidence 444332 012245 99999999999999999999988865 5654433 21111 12367888999
Q ss_pred CCCeEEEEecCCCCeEEEEeCCCCCcEE----EEeecccCceeEEEeccCCEEEEEeCCCCcEEEEEecCCCCceEEEee
Q psy5768 141 ESGYLFWSESGKIPLIARAGLDGKKQTI----LAQEIIMPIKDITLDLKFFSAFYRNLSKGNIHIISLSNLSDVSTISMK 216 (652)
Q Consensus 141 ~~g~lywtd~~~~~~I~~~~ldg~~~~~----~~~~~~~~p~gl~lD~~~~~ly~~d~~g~~~~~i~~~~~~~~~~~~~~ 216 (652)
.+.+|+..|-|.. +|...+++...... .+.. -.-|+-|++.+..+..|.+.--.+.+
T Consensus 155 ~~~~l~v~DLG~D-ri~~y~~~dg~L~~~~~~~v~~-G~GPRHi~FHpn~k~aY~v~EL~stV----------------- 215 (346)
T COG2706 155 DGRYLVVPDLGTD-RIFLYDLDDGKLTPADPAEVKP-GAGPRHIVFHPNGKYAYLVNELNSTV----------------- 215 (346)
T ss_pred CCCEEEEeecCCc-eEEEEEcccCccccccccccCC-CCCcceEEEcCCCcEEEEEeccCCEE-----------------
Confidence 8889999998854 66666665211111 1111 12355555555555555532210111
Q ss_pred cCCCCCcceeeeeeeccCCCCCCCCCCCCCCcccceecCCCceEEEeCCccccCCCcccccceEEEEeeecceeEEecCC
Q psy5768 217 PYGDSYLKDIKIYSKDAQTGTNPCGVNNGGCAELCLYNGVSAVCACAHGVVAQDGKSCSEYDAFIMYSRVNRIDSIHMTD 296 (652)
Q Consensus 217 ~~~~~~~~~i~v~~~~~q~~~n~C~~~ng~Cs~lC~~~~~~~~C~C~~G~l~~dg~~C~~~~~~Ll~s~~~~i~~i~l~~ 296 (652)
.||.- | +. .| .-..+-.++. -
T Consensus 216 ----------~v~~y------------~----------~~-------~g-------------------~~~~lQ~i~t-l 236 (346)
T COG2706 216 ----------DVLEY------------N----------PA-------VG-------------------KFEELQTIDT-L 236 (346)
T ss_pred ----------EEEEE------------c----------CC-------Cc-------------------eEEEeeeecc-C
Confidence 00000 0 00 00 0011112221 1
Q ss_pred CCCCCCCceeeeeccccceEEEEEEEcCCCeEEEeecccccEEEEecc--CCcceEEe---eccCceeeeEEEccCCEEE
Q psy5768 297 KSDLNSPFESIRNSTMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFN--GSNHRVLL---ERQGSVEGLAYEYVHNYLY 371 (652)
Q Consensus 297 ~~~~~~p~~~~~~~~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~--g~~~~~i~---~~~~~~~glAvDw~~~~LY 371 (652)
++++. +-....+|...+..+.||.+|...+.|....++ |+..+.+. +....|.+..++.-++.|+
T Consensus 237 P~dF~----------g~~~~aaIhis~dGrFLYasNRg~dsI~~f~V~~~~g~L~~~~~~~teg~~PR~F~i~~~g~~Li 306 (346)
T COG2706 237 PEDFT----------GTNWAAAIHISPDGRFLYASNRGHDSIAVFSVDPDGGKLELVGITPTEGQFPRDFNINPSGRFLI 306 (346)
T ss_pred ccccC----------CCCceeEEEECCCCCEEEEecCCCCeEEEEEEcCCCCEEEEEEEeccCCcCCccceeCCCCCEEE
Confidence 22211 113456777788888999999888877665544 33322222 2333599999999999999
Q ss_pred EEeCCCCeEEEEEcCCCCC
Q psy5768 372 WTCNNDATINKIDLDSPKA 390 (652)
Q Consensus 372 wtd~~~~~I~~~~~~~~~~ 390 (652)
.+....+.|.+...+...+
T Consensus 307 aa~q~sd~i~vf~~d~~TG 325 (346)
T COG2706 307 AANQKSDNITVFERDKETG 325 (346)
T ss_pred EEccCCCcEEEEEEcCCCc
Confidence 9999999999988876544
No 32
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=98.81 E-value=2.4e-09 Score=71.91 Aligned_cols=34 Identities=41% Similarity=0.945 Sum_probs=31.7
Q ss_pred cCCCceeeccCeecCCccCCCCCCCCCCCCCCCC
Q psy5768 593 CSEHDFKCSDGMCIPFNQTCDRVYNCHDKSDEGI 626 (652)
Q Consensus 593 C~~~~f~C~~g~Ci~~~~~Cd~~~dC~d~sde~~ 626 (652)
|.+.+|+|.++.||+..++|||++||.|||||..
T Consensus 1 C~~~~f~C~~~~Ci~~~~~CDg~~DC~dgsDE~~ 34 (35)
T cd00112 1 CPPNEFRCANGRCIPSSWVCDGEDDCGDGSDEEN 34 (35)
T ss_pred CCCCeEEcCCCCeeCHHHcCCCccCCCCCccccc
Confidence 5668999999999999999999999999999974
No 33
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=98.75 E-value=9.6e-05 Score=78.76 Aligned_cols=341 Identities=12% Similarity=0.103 Sum_probs=178.3
Q ss_pred CcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeCCCCCcEEEEeecccCcee
Q psy5768 100 QNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGLDGKKQTILAQEIIMPIKD 179 (652)
Q Consensus 100 ~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~~~~~~~~~~~p~g 179 (652)
+.+|+++...+.+.+.|.+.....--+..+...+..++..|...++|.++. .+.|...++.-.....-+..+ ..|.|
T Consensus 6 ~l~~V~~~~~~~v~viD~~t~~~~~~i~~~~~~h~~~~~s~Dgr~~yv~~r--dg~vsviD~~~~~~v~~i~~G-~~~~~ 82 (369)
T PF02239_consen 6 NLFYVVERGSGSVAVIDGATNKVVARIPTGGAPHAGLKFSPDGRYLYVANR--DGTVSVIDLATGKVVATIKVG-GNPRG 82 (369)
T ss_dssp GEEEEEEGGGTEEEEEETTT-SEEEEEE-STTEEEEEE-TT-SSEEEEEET--TSEEEEEETTSSSEEEEEE-S-SEEEE
T ss_pred cEEEEEecCCCEEEEEECCCCeEEEEEcCCCCceeEEEecCCCCEEEEEcC--CCeEEEEECCcccEEEEEecC-CCcce
Confidence 445678989999999998765544444443233455777887779999974 467888887655433333333 57999
Q ss_pred EEEeccCCEEEEEeCCCCcEEEEEecCCCCceEEEeecCCCCCcceeeeeeeccCCCCCCCCCCCCCCcccceecCCCce
Q psy5768 180 ITLDLKFFSAFYRNLSKGNIHIISLSNLSDVSTISMKPYGDSYLKDIKIYSKDAQTGTNPCGVNNGGCAELCLYNGVSAV 259 (652)
Q Consensus 180 l~lD~~~~~ly~~d~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~q~~~n~C~~~ng~Cs~lC~~~~~~~~ 259 (652)
+++...++.||..++..+....+-....
T Consensus 83 i~~s~DG~~~~v~n~~~~~v~v~D~~tl---------------------------------------------------- 110 (369)
T PF02239_consen 83 IAVSPDGKYVYVANYEPGTVSVIDAETL---------------------------------------------------- 110 (369)
T ss_dssp EEE--TTTEEEEEEEETTEEEEEETTT-----------------------------------------------------
T ss_pred EEEcCCCCEEEEEecCCCceeEeccccc----------------------------------------------------
Confidence 9999999999988776443322200000
Q ss_pred EEEeCCccccCCCcccccceEEEEeeecceeEEecCCCCCCCCCceeeeeccccceEEEEEEEcCCCeEEEeecccccEE
Q psy5768 260 CACAHGVVAQDGKSCSEYDAFIMYSRVNRIDSIHMTDKSDLNSPFESIRNSTMMKNIIELSYDYKRKTLFYSDIQKGTIN 339 (652)
Q Consensus 260 C~C~~G~l~~dg~~C~~~~~~Ll~s~~~~i~~i~l~~~~~~~~p~~~~~~~~~~~~~~~v~~D~~~~~lywsd~~~~~I~ 339 (652)
..+..|+. ...... . ....+.|+...+.....+++-.+.+.|.
T Consensus 111 ---------------------------e~v~~I~~-~~~~~~----~-----~~~Rv~aIv~s~~~~~fVv~lkd~~~I~ 153 (369)
T PF02239_consen 111 ---------------------------EPVKTIPT-GGMPVD----G-----PESRVAAIVASPGRPEFVVNLKDTGEIW 153 (369)
T ss_dssp ----------------------------EEEEEE---EE-TT----T-----S---EEEEEE-SSSSEEEEEETTTTEEE
T ss_pred ---------------------------cceeeccc-cccccc----c-----cCCCceeEEecCCCCEEEEEEccCCeEE
Confidence 00111111 000000 0 0011223322222222233334456666
Q ss_pred EEeccCCcce--EEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCCc----eEEEEeCCC
Q psy5768 340 SVFFNGSNHR--VLLERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDKP----RGIDIDSCD 413 (652)
Q Consensus 340 ~~~~~g~~~~--~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~P----~~Iavdp~~ 413 (652)
.++....... ..+..-..|++..+|+.++.+|.+....+.|-++++.. .+.+........| .+-..||..
T Consensus 154 vVdy~d~~~~~~~~i~~g~~~~D~~~dpdgry~~va~~~sn~i~viD~~~----~k~v~~i~~g~~p~~~~~~~~php~~ 229 (369)
T PF02239_consen 154 VVDYSDPKNLKVTTIKVGRFPHDGGFDPDGRYFLVAANGSNKIAVIDTKT----GKLVALIDTGKKPHPGPGANFPHPGF 229 (369)
T ss_dssp EEETTTSSCEEEEEEE--TTEEEEEE-TTSSEEEEEEGGGTEEEEEETTT----TEEEEEEE-SSSBEETTEEEEEETTT
T ss_pred EEEeccccccceeeecccccccccccCcccceeeecccccceeEEEeecc----ceEEEEeeccccccccccccccCCCc
Confidence 6654332211 12233457999999999999999888888999999754 2333333223333 334488977
Q ss_pred CEEEEEecCCCCCceEEEeec------CCCceEEEEcC-CCCCceEEEecCCCEEEEE---eCCCCeEEEEecCCCceEE
Q psy5768 414 SRIYWTNWNSHLPSIQRAFFS------GFGTESIITTD-ITMPNALALDHQAEKLFWG---DARLDKIERCDYDGTNRIV 483 (652)
Q Consensus 414 g~Lywtd~~~~~~~I~r~~ld------G~~~~~l~~~~-l~~P~glaiD~~~~~LYw~---D~~~~~I~~~~ldG~~~~~ 483 (652)
|.+ |+..+.....|--...+ -...+++-.-. ...|..+...+.++.||.. ....+.|..+|..-.....
T Consensus 230 g~v-w~~~~~~~~~~~~ig~~~v~v~d~~~wkvv~~I~~~G~glFi~thP~s~~vwvd~~~~~~~~~v~viD~~tl~~~~ 308 (369)
T PF02239_consen 230 GPV-WATSGLGYFAIPLIGTDPVSVHDDYAWKVVKTIPTQGGGLFIKTHPDSRYVWVDTFLNPDADTVQVIDKKTLKVVK 308 (369)
T ss_dssp EEE-EEEEBSSSSEEEEEE--TTT-STTTBTSEEEEEE-SSSS--EE--TT-SEEEEE-TT-SSHT-EEEEECCGTEEEE
T ss_pred ceE-EeeccccceecccccCCccccchhhcCeEEEEEECCCCcceeecCCCCccEEeeccCCCCCceEEEEECcCcceeE
Confidence 765 54443322122112111 12222333222 2455677777777777665 2556789998887653222
Q ss_pred EecC-CCCceeEEEEe--CCEEEEEcCCCC-eEEEEEccCCceEEEEecccCCcceeEEE
Q psy5768 484 LSKI-SPLHPFDMAVY--GEFIFWTDWVIH-AVLRANKYTGEEVYTLRKNIRRPMGIVAI 539 (652)
Q Consensus 484 l~~~-~~~~p~glav~--~~~lYwtd~~~~-~I~~~~k~~g~~~~~~~~~~~~p~~i~~~ 539 (652)
-+.. ....+..+.+. +.++|++.|..+ +|...|..|.+..+.+. ...|.|+..+
T Consensus 309 ~i~~~~~~~~~h~ef~~dG~~v~vS~~~~~~~i~v~D~~Tl~~~~~i~--~~tP~G~f~~ 366 (369)
T PF02239_consen 309 TITPGPGKRVVHMEFNPDGKEVWVSVWDGNGAIVVYDAKTLKEKKRIP--VPTPTGKFNV 366 (369)
T ss_dssp -HHHHHT--EEEEEE-TTSSEEEEEEE--TTEEEEEETTTTEEEEEEE----SEEEEEEH
T ss_pred EEeccCCCcEeccEECCCCCEEEEEEecCCCEEEEEECCCcEEEEEEE--eeCCCeEecc
Confidence 1211 11236677775 569999999999 99999999998888877 6678887543
No 34
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=98.73 E-value=1.7e-06 Score=92.39 Aligned_cols=198 Identities=18% Similarity=0.196 Sum_probs=130.0
Q ss_pred ccceEEEEEEEcCCCeEEEeecc-----------c-ccEEEEec-cCCcc----eEEeeccCceeeeEEEccCCEEEEEe
Q psy5768 312 MMKNIIELSYDYKRKTLFYSDIQ-----------K-GTINSVFF-NGSNH----RVLLERQGSVEGLAYEYVHNYLYWTC 374 (652)
Q Consensus 312 ~~~~~~~v~~D~~~~~lywsd~~-----------~-~~I~~~~~-~g~~~----~~i~~~~~~~~glAvDw~~~~LYwtd 374 (652)
.+.++++|++|.. ++||+++.. . .+|.++.. +|... +++.+++..|.||++...+ ||+++
T Consensus 12 ~~~~P~~ia~d~~-G~l~V~e~~~y~~~~~~~~~~~~rI~~l~d~dgdG~~d~~~vfa~~l~~p~Gi~~~~~G--lyV~~ 88 (367)
T TIGR02604 12 LLRNPIAVCFDER-GRLWVAEGITYSRPAGRQGPLGDRILILEDADGDGKYDKSNVFAEELSMVTGLAVAVGG--VYVAT 88 (367)
T ss_pred ccCCCceeeECCC-CCEEEEeCCcCCCCCCCCCCCCCEEEEEEcCCCCCCcceeEEeecCCCCccceeEecCC--EEEeC
Confidence 3577899999976 679999742 2 37888763 33222 3444888999999998644 99986
Q ss_pred CCCCeEEEEE-cCCCC--C-ccEEEEEe-CC-----CCCceEEEEeCCCCEEEEEecCC-----------------CCCc
Q psy5768 375 NNDATINKID-LDSPK--A-QRIVVVRL-GQ-----HDKPRGIDIDSCDSRIYWTNWNS-----------------HLPS 427 (652)
Q Consensus 375 ~~~~~I~~~~-~~~~~--~-~~~~~~~~-~~-----~~~P~~Iavdp~~g~Lywtd~~~-----------------~~~~ 427 (652)
. ..|.++. .++.. . .++.++.. .. ...+.+++++| .|+||+++... ..+.
T Consensus 89 ~--~~i~~~~d~~gdg~ad~~~~~l~~~~~~~~~~~~~~~~~l~~gp-DG~LYv~~G~~~~~~~~~~~~~~~~~~~~~g~ 165 (367)
T TIGR02604 89 P--PDILFLRDKDGDDKADGEREVLLSGFGGQINNHHHSLNSLAWGP-DGWLYFNHGNTLASKVTRPGTSDESRQGLGGG 165 (367)
T ss_pred C--CeEEEEeCCCCCCCCCCccEEEEEccCCCCCcccccccCceECC-CCCEEEecccCCCceeccCCCccCcccccCce
Confidence 4 4677773 33311 1 13344431 11 23488999999 68999987631 1247
Q ss_pred eEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCCCeEEEEecC------------CC---------ceEE---
Q psy5768 428 IQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARLDKIERCDYD------------GT---------NRIV--- 483 (652)
Q Consensus 428 I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ld------------G~---------~~~~--- 483 (652)
|+|.+.||+..+++ ...+..|+||++|. .+.||.+|.......++..- |. ..+.
T Consensus 166 i~r~~pdg~~~e~~-a~G~rnp~Gl~~d~-~G~l~~tdn~~~~~~~i~~~~~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 243 (367)
T TIGR02604 166 LFRYNPDGGKLRVV-AHGFQNPYGHSVDS-WGDVFFCDNDDPPLCRVTPVAEGGRNGYQSFNGRRYDHADRGADHEVPTG 243 (367)
T ss_pred EEEEecCCCeEEEE-ecCcCCCccceECC-CCCEEEEccCCCceeEEcccccccccCCCCCCCccccccccccccccccc
Confidence 99999999887654 56789999999997 58889998754443433311 10 0000
Q ss_pred ------------Eec-CCCCceeEEEEeC---------CEEEEEcCCCCeEEEEEc
Q psy5768 484 ------------LSK-ISPLHPFDMAVYG---------EFIFWTDWVIHAVLRANK 517 (652)
Q Consensus 484 ------------l~~-~~~~~p~glav~~---------~~lYwtd~~~~~I~~~~k 517 (652)
... .....|-|++++. +.+++++|..+.|+++..
T Consensus 244 ~~~~~~~~~~~~~~~~g~~~ap~G~~~y~g~~fp~~~~g~~fv~~~~~~~v~~~~l 299 (367)
T TIGR02604 244 EWRQDDRGVETVGDVAGGGTAPCGIAFYRGDALPEEYRGLLLVGDAHGQLIVRYSL 299 (367)
T ss_pred ccccccccccccccccCCCccccEEEEeCCCcCCHHHCCCEEeeeccCCEEEEEEe
Confidence 000 0113688999872 578999999999988765
No 35
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=98.71 E-value=5.7e-06 Score=85.20 Aligned_cols=143 Identities=22% Similarity=0.320 Sum_probs=101.6
Q ss_pred EEcCCCeEEEeecc-----------cccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEEcC---
Q psy5768 321 YDYKRKTLFYSDIQ-----------KGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLD--- 386 (652)
Q Consensus 321 ~D~~~~~lywsd~~-----------~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~--- 386 (652)
.|+. +++|+++.. .+++||++..|...+.+...+..|.|||+++.++.||++|+..++|++++++
T Consensus 118 v~pd-G~~wfgt~~~~~~~~~~~~~~G~lyr~~p~g~~~~l~~~~~~~~NGla~SpDg~tly~aDT~~~~i~r~~~d~~~ 196 (307)
T COG3386 118 VDPD-GRIWFGDMGYFDLGKSEERPTGSLYRVDPDGGVVRLLDDDLTIPNGLAFSPDGKTLYVADTPANRIHRYDLDPAT 196 (307)
T ss_pred EcCC-CCEEEeCCCccccCccccCCcceEEEEcCCCCEEEeecCcEEecCceEECCCCCEEEEEeCCCCeEEEEecCccc
Confidence 3443 678887766 2579999876554444435588999999999999999999999999999987
Q ss_pred CCCCccEEEEEe-CCCCCceEEEEeCCCCEEE-EEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEe-cCCCEEEE
Q psy5768 387 SPKAQRIVVVRL-GQHDKPRGIDIDSCDSRIY-WTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALD-HQAEKLFW 463 (652)
Q Consensus 387 ~~~~~~~~~~~~-~~~~~P~~Iavdp~~g~Ly-wtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD-~~~~~LYw 463 (652)
+...+++..+.. ..-..|=++++|. .|.|| .+-|+. .+|.+.+.+|.....+. ....+|..+++- ...++||.
T Consensus 197 g~~~~~~~~~~~~~~~G~PDG~~vDa-dG~lw~~a~~~g--~~v~~~~pdG~l~~~i~-lP~~~~t~~~FgG~~~~~L~i 272 (307)
T COG3386 197 GPIGGRRGFVDFDEEPGLPDGMAVDA-DGNLWVAAVWGG--GRVVRFNPDGKLLGEIK-LPVKRPTNPAFGGPDLNTLYI 272 (307)
T ss_pred CccCCcceEEEccCCCCCCCceEEeC-CCCEEEecccCC--ceEEEECCCCcEEEEEE-CCCCCCccceEeCCCcCEEEE
Confidence 333223323322 2448999999998 68887 555543 38999999976554443 344678877775 33688998
Q ss_pred EeCCC
Q psy5768 464 GDARL 468 (652)
Q Consensus 464 ~D~~~ 468 (652)
+-+..
T Consensus 273 Ts~~~ 277 (307)
T COG3386 273 TSARS 277 (307)
T ss_pred EecCC
Confidence 86654
No 36
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=98.63 E-value=2e-08 Score=66.63 Aligned_cols=32 Identities=47% Similarity=1.113 Sum_probs=30.0
Q ss_pred cCCCceeeccCeecCCccCCCCCCCCCCCCCC
Q psy5768 593 CSEHDFKCSDGMCIPFNQTCDRVYNCHDKSDE 624 (652)
Q Consensus 593 C~~~~f~C~~g~Ci~~~~~Cd~~~dC~d~sde 624 (652)
|.+.+|+|.++.||+..++|||.+||.|+|||
T Consensus 2 C~~~~f~C~~~~Ci~~~~~Cdg~~dC~dgsDE 33 (33)
T smart00192 2 CPPGEFQCDNGRCIPLSWVCDGVDDCSDGSDE 33 (33)
T ss_pred CCCCeEECCCCCEECchhhCCCcCcCcCCCCC
Confidence 56679999999999999999999999999998
No 37
>KOG4499|consensus
Probab=98.62 E-value=6.3e-06 Score=78.35 Aligned_cols=198 Identities=19% Similarity=0.297 Sum_probs=121.0
Q ss_pred EEEEEEcCCCeEEEeecccccEEE----------EeccCC-cceEEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEEc
Q psy5768 317 IELSYDYKRKTLFYSDIQKGTINS----------VFFNGS-NHRVLLERQGSVEGLAYEYVHNYLYWTCNNDATINKIDL 385 (652)
Q Consensus 317 ~~v~~D~~~~~lywsd~~~~~I~~----------~~~~g~-~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~ 385 (652)
.+..+|..++.|||+|...+.|.| +.+++. ....++...+.|+..|+-- +.+ ...+.+
T Consensus 18 Egp~w~~~~~sLl~VDi~ag~v~r~D~~qn~v~ra~ie~p~~ag~ilpv~~~~q~~~v~~-G~k----------f~i~nw 86 (310)
T KOG4499|consen 18 EGPHWDVERQSLLYVDIEAGEVHRYDIEQNKVYRAKIEGPPSAGFILPVEGGPQEFAVGC-GSK----------FVIVNW 86 (310)
T ss_pred CCCceEEecceEEEEEeccCceehhhhhhhheEEEEEecCcceeEEEEecCCCceEEEee-cce----------EEEEEc
Confidence 456688889999999987665554 445443 2223334444555555432 111 112233
Q ss_pred CCCCCccEEEEE---eC---CCCCceEEEEeCCCCEEEEEecCC------CCCceEEEeecCCCceEEEEcCCCCCceEE
Q psy5768 386 DSPKAQRIVVVR---LG---QHDKPRGIDIDSCDSRIYWTNWNS------HLPSIQRAFFSGFGTESIITTDITMPNALA 453 (652)
Q Consensus 386 ~~~~~~~~~~~~---~~---~~~~P~~Iavdp~~g~Lywtd~~~------~~~~I~r~~ldG~~~~~l~~~~l~~P~gla 453 (652)
++.........+ +. .-.+-..=-|||..+| |.--... ....-.+..+-|-..+.+. ..+.-||||+
T Consensus 87 d~~~~~a~v~~t~~ev~~d~kknR~NDgkvdP~Gry-y~GtMad~~~~le~~~g~Ly~~~~~h~v~~i~-~~v~IsNgl~ 164 (310)
T KOG4499|consen 87 DGVSESAKVYRTLFEVQPDRKKNRLNDGKVDPDGRY-YGGTMADFGDDLEPIGGELYSWLAGHQVELIW-NCVGISNGLA 164 (310)
T ss_pred ccccceeeeeeeccccCchHHhcccccCccCCCCce-eeeeeccccccccccccEEEEeccCCCceeee-hhccCCcccc
Confidence 332111111111 00 0122334457785444 6532211 1112334444454555444 4677899999
Q ss_pred EecCCCEEEEEeCCCCeEEEEecCC-----CceEEEecCC------CCceeEEEEe-CCEEEEEcCCCCeEEEEEccCCc
Q psy5768 454 LDHQAEKLFWGDARLDKIERCDYDG-----TNRIVLSKIS------PLHPFDMAVY-GEFIFWTDWVIHAVLRANKYTGE 521 (652)
Q Consensus 454 iD~~~~~LYw~D~~~~~I~~~~ldG-----~~~~~l~~~~------~~~p~glav~-~~~lYwtd~~~~~I~~~~k~~g~ 521 (652)
-|.+.++.|++|+-...|...++|- ++|++++.-. ...|-|++++ ++.||++-|..++|+++|..||+
T Consensus 165 Wd~d~K~fY~iDsln~~V~a~dyd~~tG~~snr~~i~dlrk~~~~e~~~PDGm~ID~eG~L~Va~~ng~~V~~~dp~tGK 244 (310)
T KOG4499|consen 165 WDSDAKKFYYIDSLNYEVDAYDYDCPTGDLSNRKVIFDLRKSQPFESLEPDGMTIDTEGNLYVATFNGGTVQKVDPTTGK 244 (310)
T ss_pred ccccCcEEEEEccCceEEeeeecCCCcccccCcceeEEeccCCCcCCCCCCcceEccCCcEEEEEecCcEEEEECCCCCc
Confidence 9999999999999999997777652 5787776532 3468999998 68999999999999999999998
Q ss_pred eEEEEe
Q psy5768 522 EVYTLR 527 (652)
Q Consensus 522 ~~~~~~ 527 (652)
-...+.
T Consensus 245 ~L~eik 250 (310)
T KOG4499|consen 245 ILLEIK 250 (310)
T ss_pred EEEEEE
Confidence 766655
No 38
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=98.62 E-value=0.00033 Score=73.38 Aligned_cols=276 Identities=10% Similarity=0.090 Sum_probs=158.3
Q ss_pred CCCEEEEEeccCC----cceEEEEEcCCCccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeC---------CCCEEEE
Q psy5768 48 VKGKMFWSNVTKQ----VVTIEMAFMDGTKRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDP---------KENVIEV 114 (652)
Q Consensus 48 ~~~~lyw~d~~~~----~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~---------~~~~I~v 114 (652)
...++|++| .. .++|+.++.+.....--+..| ..|. ++ +...++.||.+.+ ..+.|.+
T Consensus 11 ~~~~v~V~d--~~~~~~~~~v~ViD~~~~~v~g~i~~G--~~P~----~~-~spDg~~lyva~~~~~R~~~G~~~d~V~v 81 (352)
T TIGR02658 11 DARRVYVLD--PGHFAATTQVYTIDGEAGRVLGMTDGG--FLPN----PV-VASDGSFFAHASTVYSRIARGKRTDYVEV 81 (352)
T ss_pred CCCEEEEEC--CcccccCceEEEEECCCCEEEEEEEcc--CCCc----ee-ECCCCCEEEEEeccccccccCCCCCEEEE
Confidence 457899999 54 278888887765433345555 4577 86 9888999999999 8899999
Q ss_pred EEcCCC-cEEEEEeC------CCCCceeEEEcCCCCeEEEEecCCCCeEEEEeCCCCCcEEEEeecccCceeEEEeccCC
Q psy5768 115 ARLTGQ-YRYVLISG------GVDQPSALAVDPESGYLFWSESGKIPLIARAGLDGKKQTILAQEIIMPIKDITLDLKFF 187 (652)
Q Consensus 115 ~~~dg~-~~~~l~~~------~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~~~~~~~~~~~p~gl~lD~~~~ 187 (652)
+|..-. ....+.-. -...|..+++.|...+||++++...+.+-..++.......-+.- | ...
T Consensus 82 ~D~~t~~~~~~i~~p~~p~~~~~~~~~~~~ls~dgk~l~V~n~~p~~~V~VvD~~~~kvv~ei~v----p-------~~~ 150 (352)
T TIGR02658 82 IDPQTHLPIADIELPEGPRFLVGTYPWMTSLTPDNKTLLFYQFSPSPAVGVVDLEGKAFVRMMDV----P-------DCY 150 (352)
T ss_pred EECccCcEEeEEccCCCchhhccCccceEEECCCCCEEEEecCCCCCEEEEEECCCCcEEEEEeC----C-------CCc
Confidence 998643 33333221 13456699999988899999876566788887765443322221 1 122
Q ss_pred EEEEE--------eCCCCcEEEEEecCCCCceEEEeecCCCCCcceeeeeeeccC-CCCCCCCCCCCCCcccceecCCCc
Q psy5768 188 SAFYR--------NLSKGNIHIISLSNLSDVSTISMKPYGDSYLKDIKIYSKDAQ-TGTNPCGVNNGGCAELCLYNGVSA 258 (652)
Q Consensus 188 ~ly~~--------d~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~q-~~~n~C~~~ng~Cs~lC~~~~~~~ 258 (652)
.+|.. -.||...+.-+-..++ . ......+|+.... ...+| .
T Consensus 151 ~vy~t~e~~~~~~~~Dg~~~~v~~d~~g~---~---------~~~~~~vf~~~~~~v~~rP-~----------------- 200 (352)
T TIGR02658 151 HIFPTANDTFFMHCRDGSLAKVGYGTKGN---P---------KIKPTEVFHPEDEYLINHP-A----------------- 200 (352)
T ss_pred EEEEecCCccEEEeecCceEEEEecCCCc---e---------EEeeeeeecCCccccccCC-c-----------------
Confidence 23322 1232222111111111 0 1122334443211 01111 0
Q ss_pred eEEEeCCccccCCCcccccceEEEEeeecceeEEecCCCC-CCCCCceeeeecc---cc--ceEEEEEEEcCCCeEEEe-
Q psy5768 259 VCACAHGVVAQDGKSCSEYDAFIMYSRVNRIDSIHMTDKS-DLNSPFESIRNST---MM--KNIIELSYDYKRKTLFYS- 331 (652)
Q Consensus 259 ~C~C~~G~l~~dg~~C~~~~~~Ll~s~~~~i~~i~l~~~~-~~~~p~~~~~~~~---~~--~~~~~v~~D~~~~~lyws- 331 (652)
|...|| ..++++....+..+++.... ....++..+.... +. .-...++++...+++|+.
T Consensus 201 -------~~~~dg-------~~~~vs~eG~V~~id~~~~~~~~~~~~~~~~~~~~~~~wrP~g~q~ia~~~dg~~lyV~~ 266 (352)
T TIGR02658 201 -------YSNKSG-------RLVWPTYTGKIFQIDLSSGDAKFLPAIEAFTEAEKADGWRPGGWQQVAYHRARDRIYLLA 266 (352)
T ss_pred -------eEcCCC-------cEEEEecCCeEEEEecCCCcceecceeeeccccccccccCCCcceeEEEcCCCCEEEEEe
Confidence 001122 34555666777777762211 1111222221100 00 112238999999999994
Q ss_pred ecc--------cccEEEEeccCCcceEEeeccCceeeeEEEccCC-EEEEEeCCCCeEEEEEcCC
Q psy5768 332 DIQ--------KGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHN-YLYWTCNNDATINKIDLDS 387 (652)
Q Consensus 332 d~~--------~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~-~LYwtd~~~~~I~~~~~~~ 387 (652)
... .+.|..++.....+..-+.--..|.+|++...++ .||.++...+.|.++++..
T Consensus 267 ~~~~~~thk~~~~~V~ViD~~t~kvi~~i~vG~~~~~iavS~Dgkp~lyvtn~~s~~VsViD~~t 331 (352)
T TIGR02658 267 DQRAKWTHKTASRFLFVVDAKTGKRLRKIELGHEIDSINVSQDAKPLLYALSTGDKTLYIFDAET 331 (352)
T ss_pred cCCccccccCCCCEEEEEECCCCeEEEEEeCCCceeeEEECCCCCeEEEEeCCCCCcEEEEECcC
Confidence 322 2578888876543333333345899999999999 9999999999999999765
No 39
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=98.52 E-value=2.6e-05 Score=77.65 Aligned_cols=176 Identities=16% Similarity=0.131 Sum_probs=104.1
Q ss_pred EEecCCCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCC--Cc--cEE--E
Q psy5768 4 AVSSPTQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDG--TK--RET--V 77 (652)
Q Consensus 4 ~v~~~~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dg--s~--~~~--v 77 (652)
+...-....|+..+++|+..+++.-. .+..+.+|+|- .++++..++ ...++++.+..+. .. +.. -
T Consensus 37 faV~d~~~~i~els~~G~vlr~i~l~------g~~D~EgI~y~-g~~~~vl~~--Er~~~L~~~~~~~~~~~~~~~~~~~ 107 (248)
T PF06977_consen 37 FAVQDEPGEIYELSLDGKVLRRIPLD------GFGDYEGITYL-GNGRYVLSE--ERDQRLYIFTIDDDTTSLDRADVQK 107 (248)
T ss_dssp EEEETTTTEEEEEETT--EEEEEE-S------S-SSEEEEEE--STTEEEEEE--TTTTEEEEEEE----TT--EEEEEE
T ss_pred EEEECCCCEEEEEcCCCCEEEEEeCC------CCCCceeEEEE-CCCEEEEEE--cCCCcEEEEEEeccccccchhhceE
Confidence 33434578899999999998887642 36789999995 667777788 6678888877743 21 111 1
Q ss_pred EeCC----CcCCccCCCCcEEEEccCCcEEEEeCCCC-EEEEEEc--CCCcEEEEEe-------CCCCCceeEEEcCCCC
Q psy5768 78 VSQK----KYPAVTACNLHIAVDWIAQNIYWSDPKEN-VIEVARL--TGQYRYVLIS-------GGVDQPSALAVDPESG 143 (652)
Q Consensus 78 ~~~~----~~~~p~~~~~~lavDw~~~~lY~~d~~~~-~I~v~~~--dg~~~~~l~~-------~~~~~P~~iavd~~~g 143 (652)
++.+ .....+ |||+|..+++||.+..... .|...+. .+....+... ..+..|.++++||.+|
T Consensus 108 ~~l~~~~~~N~G~E----Gla~D~~~~~L~v~kE~~P~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~S~l~~~p~t~ 183 (248)
T PF06977_consen 108 ISLGFPNKGNKGFE----GLAYDPKTNRLFVAKERKPKRLYEVNGFPGGFDLFVSDDQDLDDDKLFVRDLSGLSYDPRTG 183 (248)
T ss_dssp EE---S---SS--E----EEEEETTTTEEEEEEESSSEEEEEEESTT-SS--EEEE-HHHH-HT--SS---EEEEETTTT
T ss_pred EecccccCCCcceE----EEEEcCCCCEEEEEeCCCChhhEEEccccCccceeeccccccccccceeccccceEEcCCCC
Confidence 1211 133478 9999999999999864332 4555554 2222222221 1456899999999999
Q ss_pred eEEEEecCCCCeEEEEeCCCCCcEEEE--e------ecccCceeEEEeccCCEEEEEeC
Q psy5768 144 YLFWSESGKIPLIARAGLDGKKQTILA--Q------EIIMPIKDITLDLKFFSAFYRNL 194 (652)
Q Consensus 144 ~lywtd~~~~~~I~~~~ldg~~~~~~~--~------~~~~~p~gl~lD~~~~~ly~~d~ 194 (652)
.||.-+.. ...|...+.+|.-...+- . ..+.+|-|||+|. +++||+++-
T Consensus 184 ~lliLS~e-s~~l~~~d~~G~~~~~~~L~~g~~gl~~~~~QpEGIa~d~-~G~LYIvsE 240 (248)
T PF06977_consen 184 HLLILSDE-SRLLLELDRQGRVVSSLSLDRGFHGLSKDIPQPEGIAFDP-DGNLYIVSE 240 (248)
T ss_dssp EEEEEETT-TTEEEEE-TT--EEEEEE-STTGGG-SS---SEEEEEE-T-T--EEEEET
T ss_pred eEEEEECC-CCeEEEECCCCCEEEEEEeCCcccCcccccCCccEEEECC-CCCEEEEcC
Confidence 99999764 668888888888544332 2 1267899999996 679999875
No 40
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=98.42 E-value=0.00037 Score=75.20 Aligned_cols=149 Identities=13% Similarity=0.102 Sum_probs=103.4
Q ss_pred CCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEE------e-CCCcCCccCCCCcEEEEcc------CCcE
Q psy5768 36 TLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVV------S-QKKYPAVTACNLHIAVDWI------AQNI 102 (652)
Q Consensus 36 ~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~------~-~~~~~~p~~~~~~lavDw~------~~~l 102 (652)
.|..|++|++.+ +++||++. ...++|+++..++...+.+. . .+ ...+ +|||+++- ++.|
T Consensus 28 GL~~Pw~maflP-DG~llVtE--R~~G~I~~v~~~~~~~~~~~~l~~v~~~~g-e~GL----lglal~PdF~~~~~n~~l 99 (454)
T TIGR03606 28 GLNKPWALLWGP-DNQLWVTE--RATGKILRVNPETGEVKVVFTLPEIVNDAQ-HNGL----LGLALHPDFMQEKGNPYV 99 (454)
T ss_pred CCCCceEEEEcC-CCeEEEEE--ecCCEEEEEeCCCCceeeeecCCceeccCC-CCce----eeEEECCCccccCCCcEE
Confidence 689999999997 47999999 65789999987654332221 1 12 2334 49999743 4679
Q ss_pred EEEeC---------CCCEEEEEEcCCC-----cEEEEEeCC----CCCceeEEEcCCCCeEEEEecCC------------
Q psy5768 103 YWSDP---------KENVIEVARLTGQ-----YRYVLISGG----VDQPSALAVDPESGYLFWSESGK------------ 152 (652)
Q Consensus 103 Y~~d~---------~~~~I~v~~~dg~-----~~~~l~~~~----~~~P~~iavd~~~g~lywtd~~~------------ 152 (652)
|++.+ ...+|.+..++.. ..++++... ...-..|+++| .|+||++-...
T Consensus 100 Yvsyt~~~~~~~~~~~~~I~R~~l~~~~~~l~~~~~Il~~lP~~~~H~GgrI~FgP-DG~LYVs~GD~g~~~~~n~~~~~ 178 (454)
T TIGR03606 100 YISYTYKNGDKELPNHTKIVRYTYDKSTQTLEKPVDLLAGLPAGNDHNGGRLVFGP-DGKIYYTIGEQGRNQGANFFLPN 178 (454)
T ss_pred EEEEeccCCCCCccCCcEEEEEEecCCCCccccceEEEecCCCCCCcCCceEEECC-CCcEEEEECCCCCCCcccccCcc
Confidence 99852 2567988887631 234555431 23356899998 78999974211
Q ss_pred -------------------CCeEEEEeCCCCC----------cEEEEeecccCceeEEEeccCCEEEEEeC
Q psy5768 153 -------------------IPLIARAGLDGKK----------QTILAQEIIMPIKDITLDLKFFSAFYRNL 194 (652)
Q Consensus 153 -------------------~~~I~~~~ldg~~----------~~~~~~~~~~~p~gl~lD~~~~~ly~~d~ 194 (652)
.++|.|.+.||+- +..+...++..|.||++|+ +++||..+-
T Consensus 179 ~aQ~~~~~~~~~~~d~~~~~GkILRin~DGsiP~dNPf~~g~~~eIyA~G~RNp~Gla~dp-~G~Lw~~e~ 248 (454)
T TIGR03606 179 QAQHTPTQQELNGKDYHAYMGKVLRLNLDGSIPKDNPSINGVVSHIFTYGHRNPQGLAFTP-DGTLYASEQ 248 (454)
T ss_pred hhccccccccccccCcccCceEEEEEcCCCCCCCCCCccCCCcceEEEEeccccceeEECC-CCCEEEEec
Confidence 2379999999973 2356777789999999998 799988664
No 41
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=98.41 E-value=3.6e-05 Score=76.12 Aligned_cols=190 Identities=15% Similarity=0.162 Sum_probs=134.2
Q ss_pred ceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe-eccCceeeeEEEccCCEEEE-EeCCCCeEEEEEcCCCCCc
Q psy5768 314 KNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL-ERQGSVEGLAYEYVHNYLYW-TCNNDATINKIDLDSPKAQ 391 (652)
Q Consensus 314 ~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~-~~~~~~~glAvDw~~~~LYw-td~~~~~I~~~~~~~~~~~ 391 (652)
.++.++.|++.+++||-+-.....|..+..+|.-...+- .++..|++|++ ++++.|. +|...+++..+.++.....
T Consensus 86 ~nvS~LTynp~~rtLFav~n~p~~iVElt~~GdlirtiPL~g~~DpE~Iey--ig~n~fvi~dER~~~l~~~~vd~~t~~ 163 (316)
T COG3204 86 ANVSSLTYNPDTRTLFAVTNKPAAIVELTKEGDLIRTIPLTGFSDPETIEY--IGGNQFVIVDERDRALYLFTVDADTTV 163 (316)
T ss_pred ccccceeeCCCcceEEEecCCCceEEEEecCCceEEEecccccCChhHeEE--ecCCEEEEEehhcceEEEEEEcCCccE
Confidence 457799999999999998777788999999997666665 78889999886 6788776 5666778888877643211
Q ss_pred ---cEEEEEeCC----CCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEc--------CCCCCceEEEec
Q psy5768 392 ---RIVVVRLGQ----HDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITT--------DITMPNALALDH 456 (652)
Q Consensus 392 ---~~~~~~~~~----~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~--------~l~~P~glaiD~ 456 (652)
...-+.++. ..--.|+|-||..+.||++-...+ .+|+...++-+.-..-+.. -+...+||..|.
T Consensus 164 ~~~~~~~i~L~~~~k~N~GfEGlA~d~~~~~l~~aKEr~P-~~I~~~~~~~~~l~~~~~~~~~~~~~~f~~DvSgl~~~~ 242 (316)
T COG3204 164 ISAKVQKIPLGTTNKKNKGFEGLAWDPVDHRLFVAKERNP-IGIFEVTQSPSSLSVHASLDPTADRDLFVLDVSGLEFNA 242 (316)
T ss_pred EeccceEEeccccCCCCcCceeeecCCCCceEEEEEccCC-cEEEEEecCCcccccccccCcccccceEeeccccceecC
Confidence 111122221 234568999999999999865543 3777777544221111110 134567999999
Q ss_pred CCCEEEEEeCCCCeEEEEecCCCceEEEecC--------CCCceeEEEEe-CCEEEEEc
Q psy5768 457 QAEKLFWGDARLDKIERCDYDGTNRIVLSKI--------SPLHPFDMAVY-GEFIFWTD 506 (652)
Q Consensus 457 ~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~--------~~~~p~glav~-~~~lYwtd 506 (652)
.++.|++.-.....+--++++|.-+..+.-. .+++|-|||.+ ++.||.+.
T Consensus 243 ~~~~LLVLS~ESr~l~Evd~~G~~~~~lsL~~g~~gL~~dipqaEGiamDd~g~lYIvS 301 (316)
T COG3204 243 ITNSLLVLSDESRRLLEVDLSGEVIELLSLTKGNHGLSSDIPQAEGIAMDDDGNLYIVS 301 (316)
T ss_pred CCCcEEEEecCCceEEEEecCCCeeeeEEeccCCCCCcccCCCcceeEECCCCCEEEEe
Confidence 9999999888888888899999876655322 25788999998 56888764
No 42
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=98.37 E-value=1e-06 Score=62.40 Aligned_cols=42 Identities=40% Similarity=0.737 Sum_probs=37.9
Q ss_pred EEEEcCCCCCceEEEecCCCEEEEEeCCCCeEEEEecCCCce
Q psy5768 440 SIITTDITMPNALALDHQAEKLFWGDARLDKIERCDYDGTNR 481 (652)
Q Consensus 440 ~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG~~~ 481 (652)
+++...+..|+||++|+.+++|||+|.....|++++++|.++
T Consensus 2 ~~~~~~~~~~~~la~d~~~~~lYw~D~~~~~I~~~~~~g~~~ 43 (43)
T smart00135 2 TLLSEGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43 (43)
T ss_pred EEEECCCCCcCEEEEeecCCEEEEEeCCCCEEEEEeCCCCCC
Confidence 456668899999999999999999999999999999999763
No 43
>KOG1520|consensus
Probab=98.28 E-value=4.6e-05 Score=78.38 Aligned_cols=146 Identities=17% Similarity=0.236 Sum_probs=106.0
Q ss_pred ceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEee-c----cCceeeeEEEccCCEEEEEeCCC----CeEEEEE
Q psy5768 314 KNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLLE-R----QGSVEGLAYEYVHNYLYWTCNND----ATINKID 384 (652)
Q Consensus 314 ~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~~-~----~~~~~glAvDw~~~~LYwtd~~~----~~I~~~~ 384 (652)
+.|.|++|+.+++.||++|..-| ++.+...|...+.+.. . ..-..+|.||. .+.+||||+.. ..+..+-
T Consensus 115 GRPLGl~f~~~ggdL~VaDAYlG-L~~V~p~g~~a~~l~~~~~G~~~kf~N~ldI~~-~g~vyFTDSSsk~~~rd~~~a~ 192 (376)
T KOG1520|consen 115 GRPLGIRFDKKGGDLYVADAYLG-LLKVGPEGGLAELLADEAEGKPFKFLNDLDIDP-EGVVYFTDSSSKYDRRDFVFAA 192 (376)
T ss_pred CCcceEEeccCCCeEEEEeccee-eEEECCCCCcceeccccccCeeeeecCceeEcC-CCeEEEeccccccchhheEEee
Confidence 45789999999999999998755 7788888766555542 1 22467899999 99999999865 2343344
Q ss_pred cCCCCCc---------cEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCc---eEEEEcCCCCCceE
Q psy5768 385 LDSPKAQ---------RIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGT---ESIITTDITMPNAL 452 (652)
Q Consensus 385 ~~~~~~~---------~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~---~~l~~~~l~~P~gl 452 (652)
+.+...+ ..+-+.+.++.-|.|+|+.|.+.++-+++... .+|.|.++.|... +++++.--..|--|
T Consensus 193 l~g~~~GRl~~YD~~tK~~~VLld~L~F~NGlaLS~d~sfvl~~Et~~--~ri~rywi~g~k~gt~EvFa~~LPG~PDNI 270 (376)
T KOG1520|consen 193 LEGDPTGRLFRYDPSTKVTKVLLDGLYFPNGLALSPDGSFVLVAETTT--ARIKRYWIKGPKAGTSEVFAEGLPGYPDNI 270 (376)
T ss_pred ecCCCccceEEecCcccchhhhhhcccccccccCCCCCCEEEEEeecc--ceeeeeEecCCccCchhhHhhcCCCCCcce
Confidence 4442211 12223457889999999999999999999874 3999999999877 66665345678889
Q ss_pred EEecCCCEEEEEe
Q psy5768 453 ALDHQAEKLFWGD 465 (652)
Q Consensus 453 aiD~~~~~LYw~D 465 (652)
..|..++ ||+-
T Consensus 271 R~~~~G~--fWVa 281 (376)
T KOG1520|consen 271 RRDSTGH--FWVA 281 (376)
T ss_pred eECCCCC--EEEE
Confidence 9985443 5553
No 44
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=98.27 E-value=7.7e-05 Score=80.42 Aligned_cols=155 Identities=17% Similarity=0.250 Sum_probs=108.2
Q ss_pred cccceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEE------e--eccCceeeeEEEcc------CCEEEEEeC-
Q psy5768 311 TMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVL------L--ERQGSVEGLAYEYV------HNYLYWTCN- 375 (652)
Q Consensus 311 ~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i------~--~~~~~~~glAvDw~------~~~LYwtd~- 375 (652)
+++..+.+|+|.+. ++||+++...++|+++..++...+.+ + .+.+.+.|||+++. ++.||++-+
T Consensus 27 ~GL~~Pw~maflPD-G~llVtER~~G~I~~v~~~~~~~~~~~~l~~v~~~~ge~GLlglal~PdF~~~~~n~~lYvsyt~ 105 (454)
T TIGR03606 27 SGLNKPWALLWGPD-NQLWVTERATGKILRVNPETGEVKVVFTLPEIVNDAQHNGLLGLALHPDFMQEKGNPYVYISYTY 105 (454)
T ss_pred CCCCCceEEEEcCC-CeEEEEEecCCEEEEEeCCCCceeeeecCCceeccCCCCceeeEEECCCccccCCCcEEEEEEec
Confidence 56778899999874 68999998779999997654332222 1 14567899999954 578999752
Q ss_pred --------CCCeEEEEEcCCCCC--c-cEEEEEe---CCCCCceEEEEeCCCCEEEEEecCC------------------
Q psy5768 376 --------NDATINKIDLDSPKA--Q-RIVVVRL---GQHDKPRGIDIDSCDSRIYWTNWNS------------------ 423 (652)
Q Consensus 376 --------~~~~I~~~~~~~~~~--~-~~~~~~~---~~~~~P~~Iavdp~~g~Lywtd~~~------------------ 423 (652)
...+|.+..++.... . .+.++.. .....-..|+++| .|+||++-...
T Consensus 106 ~~~~~~~~~~~~I~R~~l~~~~~~l~~~~~Il~~lP~~~~H~GgrI~FgP-DG~LYVs~GD~g~~~~~n~~~~~~aQ~~~ 184 (454)
T TIGR03606 106 KNGDKELPNHTKIVRYTYDKSTQTLEKPVDLLAGLPAGNDHNGGRLVFGP-DGKIYYTIGEQGRNQGANFFLPNQAQHTP 184 (454)
T ss_pred cCCCCCccCCcEEEEEEecCCCCccccceEEEecCCCCCCcCCceEEECC-CCcEEEEECCCCCCCcccccCcchhcccc
Confidence 245788888753211 1 2334321 1123456799999 68999973321
Q ss_pred ------------CCCceEEEeecCCC----------ceEEEEcCCCCCceEEEecCCCEEEEEeCCC
Q psy5768 424 ------------HLPSIQRAFFSGFG----------TESIITTDITMPNALALDHQAEKLFWGDARL 468 (652)
Q Consensus 424 ------------~~~~I~r~~ldG~~----------~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~ 468 (652)
..++|.|.+.||+- +..|....++.|.||++|+ +++||.+|.+.
T Consensus 185 ~~~~~~~~d~~~~~GkILRin~DGsiP~dNPf~~g~~~eIyA~G~RNp~Gla~dp-~G~Lw~~e~Gp 250 (454)
T TIGR03606 185 TQQELNGKDYHAYMGKVLRLNLDGSIPKDNPSINGVVSHIFTYGHRNPQGLAFTP-DGTLYASEQGP 250 (454)
T ss_pred ccccccccCcccCceEEEEEcCCCCCCCCCCccCCCcceEEEEeccccceeEECC-CCCEEEEecCC
Confidence 13489999999973 3357777899999999998 79999998654
No 45
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=98.22 E-value=0.0072 Score=64.49 Aligned_cols=335 Identities=10% Similarity=0.033 Sum_probs=164.7
Q ss_pred EEEecCCCCeEEEEecCCC-eeEEEecCCCCCCCCCCC-eeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeC
Q psy5768 3 IAVSSPTQSKIVVCNLEGE-YQTTILSNESNDTSTLSK-ISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQ 80 (652)
Q Consensus 3 i~v~~~~~~~I~~~~~~g~-~~~~~~~~~~~~~~~~~~-~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~ 80 (652)
.+|+....+.|.++|.+.. .+.+|... .. ..++.+.+...++|+++ . .+.|..+++.-....--+..
T Consensus 8 ~~V~~~~~~~v~viD~~t~~~~~~i~~~--------~~~h~~~~~s~Dgr~~yv~~--r-dg~vsviD~~~~~~v~~i~~ 76 (369)
T PF02239_consen 8 FYVVERGSGSVAVIDGATNKVVARIPTG--------GAPHAGLKFSPDGRYLYVAN--R-DGTVSVIDLATGKVVATIKV 76 (369)
T ss_dssp EEEEEGGGTEEEEEETTT-SEEEEEE-S--------TTEEEEEE-TT-SSEEEEEE--T-TSEEEEEETTSSSEEEEEE-
T ss_pred EEEEecCCCEEEEEECCCCeEEEEEcCC--------CCceeEEEecCCCCEEEEEc--C-CCeEEEEECCcccEEEEEec
Confidence 5688889999999998854 44555431 23 45577888888999998 5 47899999875543333455
Q ss_pred CCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCC-CcEEEEEeCCC----C--CceeEEEcCCCCeEEEEecCCC
Q psy5768 81 KKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTG-QYRYVLISGGV----D--QPSALAVDPESGYLFWSESGKI 153 (652)
Q Consensus 81 ~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg-~~~~~l~~~~~----~--~P~~iavd~~~g~lywtd~~~~ 153 (652)
| ..|. |+|+...++.+|.++...+.+.++|... +..+.+-..+. . ++.+|.-.|. +..|+......
T Consensus 77 G--~~~~----~i~~s~DG~~~~v~n~~~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv~s~~-~~~fVv~lkd~ 149 (369)
T PF02239_consen 77 G--GNPR----GIAVSPDGKYVYVANYEPGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIVASPG-RPEFVVNLKDT 149 (369)
T ss_dssp S--SEEE----EEEE--TTTEEEEEEEETTEEEEEETTT--EEEEEE--EE-TTTS---EEEEEE-SS-SSEEEEEETTT
T ss_pred C--CCcc----eEEEcCCCCEEEEEecCCCceeEeccccccceeecccccccccccCCCceeEEecCC-CCEEEEEEccC
Confidence 5 3478 9999988999999998899999998654 33444332211 2 3345555554 44454444346
Q ss_pred CeEEEEeCCCCCcEEEE-eecccCceeEEEeccCCEEEEEeCCCCcEEEEEecCCCCceEEEeecCCCCCcce---eeee
Q psy5768 154 PLIARAGLDGKKQTILA-QEIIMPIKDITLDLKFFSAFYRNLSKGNIHIISLSNLSDVSTISMKPYGDSYLKD---IKIY 229 (652)
Q Consensus 154 ~~I~~~~ldg~~~~~~~-~~~~~~p~gl~lD~~~~~ly~~d~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~---i~v~ 229 (652)
+.|+..+.......... -+.-..|.+..+|+..+.+|..... .+. ..+.+..+...+..+..+. .|.. -.+.
T Consensus 150 ~~I~vVdy~d~~~~~~~~i~~g~~~~D~~~dpdgry~~va~~~-sn~-i~viD~~~~k~v~~i~~g~--~p~~~~~~~~p 225 (369)
T PF02239_consen 150 GEIWVVDYSDPKNLKVTTIKVGRFPHDGGFDPDGRYFLVAANG-SNK-IAVIDTKTGKLVALIDTGK--KPHPGPGANFP 225 (369)
T ss_dssp TEEEEEETTTSSCEEEEEEE--TTEEEEEE-TTSSEEEEEEGG-GTE-EEEEETTTTEEEEEEE-SS--SBEETTEEEEE
T ss_pred CeEEEEEeccccccceeeecccccccccccCcccceeeecccc-cce-eEEEeeccceEEEEeeccc--ccccccccccc
Confidence 78998886544333221 1123679999999877666664332 332 2222211111111111110 1111 1111
Q ss_pred eeccCCCCCCCCCCCCCCcccceecCCCceEEEeCCccccCCCcccccceEE--EEeeecceeEEecCCCCCCCCCceee
Q psy5768 230 SKDAQTGTNPCGVNNGGCAELCLYNGVSAVCACAHGVVAQDGKSCSEYDAFI--MYSRVNRIDSIHMTDKSDLNSPFESI 307 (652)
Q Consensus 230 ~~~~q~~~n~C~~~ng~Cs~lC~~~~~~~~C~C~~G~l~~dg~~C~~~~~~L--l~s~~~~i~~i~l~~~~~~~~p~~~~ 307 (652)
|+.. +.. .-..| .+ .+. ++.. ..+..++ ..+- ..++.+
T Consensus 226 hp~~-----------------------g~v-w~~~~----~~-------~~~~~~ig~-~~v~v~d---~~~w-kvv~~I 265 (369)
T PF02239_consen 226 HPGF-----------------------GPV-WATSG----LG-------YFAIPLIGT-DPVSVHD---DYAW-KVVKTI 265 (369)
T ss_dssp ETTT-----------------------EEE-EEEEB----SS-------SSEEEEEE---TTT-ST---TTBT-SEEEEE
T ss_pred CCCc-----------------------ceE-Eeecc----cc-------ceecccccC-Cccccch---hhcC-eEEEEE
Confidence 2100 000 00000 00 000 0000 0000000 0010 011222
Q ss_pred eeccccceEEEEEEEcCCCeEEEe---ecccccEEEEeccCCcceEEeeccC--ceeeeEEEccCCEEEEEeCCCC-eEE
Q psy5768 308 RNSTMMKNIIELSYDYKRKTLFYS---DIQKGTINSVFFNGSNHRVLLERQG--SVEGLAYEYVHNYLYWTCNNDA-TIN 381 (652)
Q Consensus 308 ~~~~~~~~~~~v~~D~~~~~lyws---d~~~~~I~~~~~~g~~~~~i~~~~~--~~~glAvDw~~~~LYwtd~~~~-~I~ 381 (652)
... +....+..++....||.. +...+.|..++...-....-+.... .+..+.+...++.+|++....+ +|.
T Consensus 266 ~~~---G~glFi~thP~s~~vwvd~~~~~~~~~v~viD~~tl~~~~~i~~~~~~~~~h~ef~~dG~~v~vS~~~~~~~i~ 342 (369)
T PF02239_consen 266 PTQ---GGGLFIKTHPDSRYVWVDTFLNPDADTVQVIDKKTLKVVKTITPGPGKRVVHMEFNPDGKEVWVSVWDGNGAIV 342 (369)
T ss_dssp E-S---SSS--EE--TT-SEEEEE-TT-SSHT-EEEEECCGTEEEE-HHHHHT--EEEEEE-TTSSEEEEEEE--TTEEE
T ss_pred ECC---CCcceeecCCCCccEEeeccCCCCCceEEEEECcCcceeEEEeccCCCcEeccEECCCCCEEEEEEecCCCEEE
Confidence 211 222456667777767665 3556778888866532111112112 4889999999999999999888 999
Q ss_pred EEEcCCCCCccEEEEEeCCCCCceEE
Q psy5768 382 KIDLDSPKAQRIVVVRLGQHDKPRGI 407 (652)
Q Consensus 382 ~~~~~~~~~~~~~~~~~~~~~~P~~I 407 (652)
+++.... +.+-... +..|.|+
T Consensus 343 v~D~~Tl----~~~~~i~-~~tP~G~ 363 (369)
T PF02239_consen 343 VYDAKTL----KEKKRIP-VPTPTGK 363 (369)
T ss_dssp EEETTTT----EEEEEEE---SEEEE
T ss_pred EEECCCc----EEEEEEE-eeCCCeE
Confidence 9997653 3333333 6677775
No 46
>KOG1520|consensus
Probab=98.21 E-value=1.6e-05 Score=81.72 Aligned_cols=173 Identities=18% Similarity=0.170 Sum_probs=113.9
Q ss_pred eEEEeecccccEEEEecc----CCcce-----EEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEE
Q psy5768 327 TLFYSDIQKGTINSVFFN----GSNHR-----VLLERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVR 397 (652)
Q Consensus 327 ~lywsd~~~~~I~~~~~~----g~~~~-----~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~ 397 (652)
-|+|+-...+-|-++... .+... ....--++|-|||+|-.++.||++|.-.+ +.+++..|.. .+.+.
T Consensus 77 il~~~g~~~Gwv~~~~~~~s~~~~~~~~~~~~~~e~~CGRPLGl~f~~~ggdL~VaDAYlG-L~~V~p~g~~---a~~l~ 152 (376)
T KOG1520|consen 77 ILKYTGNDDGWVKFADTKDSTNRSQCCDPGSFETEPLCGRPLGIRFDKKGGDLYVADAYLG-LLKVGPEGGL---AELLA 152 (376)
T ss_pred eEEEeccCceEEEEEeccccccccccCCCcceecccccCCcceEEeccCCCeEEEEeccee-eEEECCCCCc---ceecc
Confidence 357766555666555541 11111 11123479999999999999999998765 5566655432 22222
Q ss_pred eCCC----CCceEEEEeCCCCEEEEEecCCC---------------CCceEEEeecCCCceEEEEcCCCCCceEEEecCC
Q psy5768 398 LGQH----DKPRGIDIDSCDSRIYWTNWNSH---------------LPSIQRAFFSGFGTESIITTDITMPNALALDHQA 458 (652)
Q Consensus 398 ~~~~----~~P~~Iavdp~~g~Lywtd~~~~---------------~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~ 458 (652)
.... .-..++.|++ .|.+||||.... .+++.|.+..-...++|. +++..|||+++.+++
T Consensus 153 ~~~~G~~~kf~N~ldI~~-~g~vyFTDSSsk~~~rd~~~a~l~g~~~GRl~~YD~~tK~~~VLl-d~L~F~NGlaLS~d~ 230 (376)
T KOG1520|consen 153 DEAEGKPFKFLNDLDIDP-EGVVYFTDSSSKYDRRDFVFAALEGDPTGRLFRYDPSTKVTKVLL-DGLYFPNGLALSPDG 230 (376)
T ss_pred ccccCeeeeecCceeEcC-CCeEEEeccccccchhheEEeeecCCCccceEEecCcccchhhhh-hcccccccccCCCCC
Confidence 2222 2345689999 999999997642 234444433333333443 478899999999999
Q ss_pred CEEEEEeCCCCeEEEEecCCCce---EEEecCCCCceeEEEEeCCEEEEE
Q psy5768 459 EKLFWGDARLDKIERCDYDGTNR---IVLSKISPLHPFDMAVYGEFIFWT 505 (652)
Q Consensus 459 ~~LYw~D~~~~~I~~~~ldG~~~---~~l~~~~~~~p~glav~~~~lYwt 505 (652)
..+-+++....+|.++=+.|... +++.++-...|--|...++-=||.
T Consensus 231 sfvl~~Et~~~ri~rywi~g~k~gt~EvFa~~LPG~PDNIR~~~~G~fWV 280 (376)
T KOG1520|consen 231 SFVLVAETTTARIKRYWIKGPKAGTSEVFAEGLPGYPDNIRRDSTGHFWV 280 (376)
T ss_pred CEEEEEeeccceeeeeEecCCccCchhhHhhcCCCCCcceeECCCCCEEE
Confidence 99999999999999999999765 555554456788888775443444
No 47
>PRK04792 tolB translocation protein TolB; Provisional
Probab=98.11 E-value=0.0014 Score=72.13 Aligned_cols=204 Identities=11% Similarity=0.076 Sum_probs=128.5
Q ss_pred EEEEEEEcCCCeEEEeecc--cccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCCCC--eEEEEEcCCCCCc
Q psy5768 316 IIELSYDYKRKTLFYSDIQ--KGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNNDA--TINKIDLDSPKAQ 391 (652)
Q Consensus 316 ~~~v~~D~~~~~lywsd~~--~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~~--~I~~~~~~~~~~~ 391 (652)
+....+.+..++|+|+... ...|+.+++.+...+.+...-+.....++.+.++.|+++....+ .|+++++.+..
T Consensus 220 ~~~p~wSPDG~~La~~s~~~g~~~L~~~dl~tg~~~~lt~~~g~~~~~~wSPDG~~La~~~~~~g~~~Iy~~dl~tg~-- 297 (448)
T PRK04792 220 LMSPAWSPDGRKLAYVSFENRKAEIFVQDIYTQVREKVTSFPGINGAPRFSPDGKKLALVLSKDGQPEIYVVDIATKA-- 297 (448)
T ss_pred ccCceECCCCCEEEEEEecCCCcEEEEEECCCCCeEEecCCCCCcCCeeECCCCCEEEEEEeCCCCeEEEEEECCCCC--
Confidence 3456778888888887543 34688888877655444422223346788888999988754333 58888876532
Q ss_pred cEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCC--C
Q psy5768 392 RIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARL--D 469 (652)
Q Consensus 392 ~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~--~ 469 (652)
.+.+. .........+.+|...+|+++......+.|++.++++...+.+.. .-.+..+.++.++++.||++.... .
T Consensus 298 ~~~lt--~~~~~~~~p~wSpDG~~I~f~s~~~g~~~Iy~~dl~~g~~~~Lt~-~g~~~~~~~~SpDG~~l~~~~~~~g~~ 374 (448)
T PRK04792 298 LTRIT--RHRAIDTEPSWHPDGKSLIFTSERGGKPQIYRVNLASGKVSRLTF-EGEQNLGGSITPDGRSMIMVNRTNGKF 374 (448)
T ss_pred eEECc--cCCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCCEEEEec-CCCCCcCeeECCCCCEEEEEEecCCce
Confidence 22221 122345667888988888877543334689999998766555432 222344668888899999986533 4
Q ss_pred eEEEEecCCCceEEEecCC-CCceeEEEEeCCEEEEEcCCCC--eEEEEEccCCceEEEE
Q psy5768 470 KIERCDYDGTNRIVLSKIS-PLHPFDMAVYGEFIFWTDWVIH--AVLRANKYTGEEVYTL 526 (652)
Q Consensus 470 ~I~~~~ldG~~~~~l~~~~-~~~p~glav~~~~lYwtd~~~~--~I~~~~k~~g~~~~~~ 526 (652)
.|+.+++++...+.+.... ...| +.+-++.+|+++....+ .++.++. +|...+.+
T Consensus 375 ~I~~~dl~~g~~~~lt~~~~d~~p-s~spdG~~I~~~~~~~g~~~l~~~~~-~G~~~~~l 432 (448)
T PRK04792 375 NIARQDLETGAMQVLTSTRLDESP-SVAPNGTMVIYSTTYQGKQVLAAVSI-DGRFKARL 432 (448)
T ss_pred EEEEEECCCCCeEEccCCCCCCCc-eECCCCCEEEEEEecCCceEEEEEEC-CCCceEEC
Confidence 7889999988766654432 1233 34445678877654433 3666665 56554444
No 48
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=98.10 E-value=6.4e-06 Score=58.22 Aligned_cols=41 Identities=41% Similarity=0.693 Sum_probs=36.4
Q ss_pred EEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCc
Q psy5768 76 TVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQY 121 (652)
Q Consensus 76 ~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~ 121 (652)
+++..+ +..|. |||+||..++|||+|...+.|.+++++|..
T Consensus 2 ~~~~~~-~~~~~----~la~d~~~~~lYw~D~~~~~I~~~~~~g~~ 42 (43)
T smart00135 2 TLLSEG-LGHPN----GLAVDWIEGRLYWTDWGLDVIEVANLDGTN 42 (43)
T ss_pred EEEECC-CCCcC----EEEEeecCCEEEEEeCCCCEEEEEeCCCCC
Confidence 345566 88899 999999999999999999999999999974
No 49
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=98.06 E-value=0.0018 Score=66.45 Aligned_cols=175 Identities=14% Similarity=0.090 Sum_probs=106.3
Q ss_pred CeEEEEecCCC-eeEEEecCCCCCCCCCCCeeEEEEECCC-----CEEEEEeccCCcceEEEEEcCC-CccEEEE-----
Q psy5768 11 SKIVVCNLEGE-YQTTILSNESNDTSTLSKISSIAVWPVK-----GKMFWSNVTKQVVTIEMAFMDG-TKRETVV----- 78 (652)
Q Consensus 11 ~~I~~~~~~g~-~~~~~~~~~~~~~~~~~~~~~v~~d~~~-----~~lyw~d~~~~~~~I~~~~~dg-s~~~~v~----- 78 (652)
-+|+.+|+... .++++.- .......-+....+.+|... +++|++| .+...|..+++.. ..++++-
T Consensus 34 pKLv~~Dl~t~~li~~~~~-p~~~~~~~s~lndl~VD~~~~~~~~~~aYItD--~~~~glIV~dl~~~~s~Rv~~~~~~~ 110 (287)
T PF03022_consen 34 PKLVAFDLKTNQLIRRYPF-PPDIAPPDSFLNDLVVDVRDGNCDDGFAYITD--SGGPGLIVYDLATGKSWRVLHNSFSP 110 (287)
T ss_dssp -EEEEEETTTTCEEEEEE---CCCS-TCGGEEEEEEECTTTTS-SEEEEEEE--TTTCEEEEEETTTTEEEEEETCGCTT
T ss_pred cEEEEEECCCCcEEEEEEC-ChHHcccccccceEEEEccCCCCcceEEEEeC--CCcCcEEEEEccCCcEEEEecCCcce
Confidence 59999999955 4444431 11122245678889999855 5999999 7777888887664 3333321
Q ss_pred --------eCC-CcCCccCCCCcEEEEccC---CcEEEEeCCCCEEEEEEcC----CC---------cEEEEEeCCCCCc
Q psy5768 79 --------SQK-KYPAVTACNLHIAVDWIA---QNIYWSDPKENVIEVARLT----GQ---------YRYVLISGGVDQP 133 (652)
Q Consensus 79 --------~~~-~~~~p~~~~~~lavDw~~---~~lY~~d~~~~~I~v~~~d----g~---------~~~~l~~~~~~~P 133 (652)
-.+ .+..+. ...|+|+..+. +.|||.--...++.+.... .+ ..+.+... ..+-
T Consensus 111 ~p~~~~~~i~g~~~~~~d-g~~gial~~~~~d~r~LYf~~lss~~ly~v~T~~L~~~~~~~~~~~~~~v~~lG~k-~~~s 188 (287)
T PF03022_consen 111 DPDAGPFTIGGESFQWPD-GIFGIALSPISPDGRWLYFHPLSSRKLYRVPTSVLRDPSLSDAQALASQVQDLGDK-GSQS 188 (287)
T ss_dssp S-SSEEEEETTEEEEETT-SEEEEEE-TTSTTS-EEEEEETT-SEEEEEEHHHHCSTT--HHH-HHHT-EEEEE----SE
T ss_pred eccccceeccCceEecCC-CccccccCCCCCCccEEEEEeCCCCcEEEEEHHHhhCccccccccccccceecccc-CCCC
Confidence 111 011111 12378887644 4699988777778777641 11 12333332 2466
Q ss_pred eeEEEcCCCCeEEEEecCCCCeEEEEeCCC----CCcEEEEee-c-ccCceeEEEec-cCCEEEEE
Q psy5768 134 SALAVDPESGYLFWSESGKIPLIARAGLDG----KKQTILAQE-I-IMPIKDITLDL-KFFSAFYR 192 (652)
Q Consensus 134 ~~iavd~~~g~lywtd~~~~~~I~~~~ldg----~~~~~~~~~-~-~~~p~gl~lD~-~~~~ly~~ 192 (652)
.++++|+ +|.||+++.. ...|.+.+.++ .+...++.. . +.||.+++++. ..+.||+.
T Consensus 189 ~g~~~D~-~G~ly~~~~~-~~aI~~w~~~~~~~~~~~~~l~~d~~~l~~pd~~~i~~~~~g~L~v~ 252 (287)
T PF03022_consen 189 DGMAIDP-NGNLYFTDVE-QNAIGCWDPDGPYTPENFEILAQDPRTLQWPDGLKIDPEGDGYLWVL 252 (287)
T ss_dssp CEEEEET-TTEEEEEECC-CTEEEEEETTTSB-GCCEEEEEE-CC-GSSEEEEEE-T--TS-EEEE
T ss_pred ceEEECC-CCcEEEecCC-CCeEEEEeCCCCcCccchheeEEcCceeeccceeeeccccCceEEEE
Confidence 7999998 9999999986 66899999888 344444443 3 89999999997 46788874
No 50
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=98.00 E-value=0.0023 Score=65.66 Aligned_cols=148 Identities=17% Similarity=0.194 Sum_probs=95.5
Q ss_pred CceeeeEEEccC-----CEEEEEeCCCCeEEEEEcCCCCCccEEE------------EEe-----CCCCCceEEEEeC--
Q psy5768 356 GSVEGLAYEYVH-----NYLYWTCNNDATINKIDLDSPKAQRIVV------------VRL-----GQHDKPRGIDIDS-- 411 (652)
Q Consensus 356 ~~~~glAvDw~~-----~~LYwtd~~~~~I~~~~~~~~~~~~~~~------------~~~-----~~~~~P~~Iavdp-- 411 (652)
.....|+||-.. +.+|+||.....|.|+++...+. ++.+ +.. .......+||+.|
T Consensus 61 s~lndl~VD~~~~~~~~~~aYItD~~~~glIV~dl~~~~s-~Rv~~~~~~~~p~~~~~~i~g~~~~~~dg~~gial~~~~ 139 (287)
T PF03022_consen 61 SFLNDLVVDVRDGNCDDGFAYITDSGGPGLIVYDLATGKS-WRVLHNSFSPDPDAGPFTIGGESFQWPDGIFGIALSPIS 139 (287)
T ss_dssp GGEEEEEEECTTTTS-SEEEEEEETTTCEEEEEETTTTEE-EEEETCGCTTS-SSEEEEETTEEEEETTSEEEEEE-TTS
T ss_pred cccceEEEEccCCCCcceEEEEeCCCcCcEEEEEccCCcE-EEEecCCcceeccccceeccCceEecCCCccccccCCCC
Confidence 356789999754 58999999988999998865221 1111 000 1112357788877
Q ss_pred -CCCEEEEEecCCCCCceEEEeec----CCC---------ceEEEEcCCCCCceEEEecCCCEEEEEeCCCCeEEEEecC
Q psy5768 412 -CDSRIYWTNWNSHLPSIQRAFFS----GFG---------TESIITTDITMPNALALDHQAEKLFWGDARLDKIERCDYD 477 (652)
Q Consensus 412 -~~g~Lywtd~~~~~~~I~r~~ld----G~~---------~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ld 477 (652)
..+.|||.-..+. +++++..+ .+. .+.+. .......|+++|. ++.||+++...+.|.+.+.+
T Consensus 140 ~d~r~LYf~~lss~--~ly~v~T~~L~~~~~~~~~~~~~~v~~lG-~k~~~s~g~~~D~-~G~ly~~~~~~~aI~~w~~~ 215 (287)
T PF03022_consen 140 PDGRWLYFHPLSSR--KLYRVPTSVLRDPSLSDAQALASQVQDLG-DKGSQSDGMAIDP-NGNLYFTDVEQNAIGCWDPD 215 (287)
T ss_dssp TTS-EEEEEETT-S--EEEEEEHHHHCSTT--HHH-HHHT-EEEE-E---SECEEEEET-TTEEEEEECCCTEEEEEETT
T ss_pred CCccEEEEEeCCCC--cEEEEEHHHhhCccccccccccccceecc-ccCCCCceEEECC-CCcEEEecCCCCeEEEEeCC
Confidence 4568999976543 67777652 211 12222 2223456999997 89999999999999999999
Q ss_pred C----CceEEEecC-C-CCceeEEEEeC---CEEEEEcCC
Q psy5768 478 G----TNRIVLSKI-S-PLHPFDMAVYG---EFIFWTDWV 508 (652)
Q Consensus 478 G----~~~~~l~~~-~-~~~p~glav~~---~~lYwtd~~ 508 (652)
+ .+.+++... . +..|-++++.+ ++||+...+
T Consensus 216 ~~~~~~~~~~l~~d~~~l~~pd~~~i~~~~~g~L~v~snr 255 (287)
T PF03022_consen 216 GPYTPENFEILAQDPRTLQWPDGLKIDPEGDGYLWVLSNR 255 (287)
T ss_dssp TSB-GCCEEEEEE-CC-GSSEEEEEE-T--TS-EEEEE-S
T ss_pred CCcCccchheeEEcCceeeccceeeeccccCceEEEEECc
Confidence 8 455555543 3 67899999988 899997633
No 51
>PRK05137 tolB translocation protein TolB; Provisional
Probab=97.91 E-value=0.0059 Score=66.95 Aligned_cols=199 Identities=10% Similarity=0.031 Sum_probs=125.9
Q ss_pred eEEEEEEEcCCCeEEEeec--ccccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCC--CCeEEEEEcCCCCC
Q psy5768 315 NIIELSYDYKRKTLFYSDI--QKGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNN--DATINKIDLDSPKA 390 (652)
Q Consensus 315 ~~~~v~~D~~~~~lywsd~--~~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~--~~~I~~~~~~~~~~ 390 (652)
.+....+.+..++|+++.. ....|+..++.+...+.+....+...+.++.+.++.|+++-.. ...|+++++.+..
T Consensus 203 ~v~~p~wSpDG~~lay~s~~~g~~~i~~~dl~~g~~~~l~~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~- 281 (435)
T PRK05137 203 LVLTPRFSPNRQEITYMSYANGRPRVYLLDLETGQRELVGNFPGMTFAPRFSPDGRKVVMSLSQGGNTDIYTMDLRSGT- 281 (435)
T ss_pred CeEeeEECCCCCEEEEEEecCCCCEEEEEECCCCcEEEeecCCCcccCcEECCCCCEEEEEEecCCCceEEEEECCCCc-
Confidence 4567778888888888754 3357999998776555554333344567888889998876543 3468888886532
Q ss_pred ccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCC--
Q psy5768 391 QRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARL-- 468 (652)
Q Consensus 391 ~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~-- 468 (652)
.+.+ . .........+..|...+|+++......+.|++.+++|...+.+.... ..-...++.+++++|+++....
T Consensus 282 -~~~L-t-~~~~~~~~~~~spDG~~i~f~s~~~g~~~Iy~~d~~g~~~~~lt~~~-~~~~~~~~SpdG~~ia~~~~~~~~ 357 (435)
T PRK05137 282 -TTRL-T-DSPAIDTSPSYSPDGSQIVFESDRSGSPQLYVMNADGSNPRRISFGG-GRYSTPVWSPRGDLIAFTKQGGGQ 357 (435)
T ss_pred -eEEc-c-CCCCccCceeEcCCCCEEEEEECCCCCCeEEEEECCCCCeEEeecCC-CcccCeEECCCCCEEEEEEcCCCc
Confidence 2222 1 12233456788888777766543323468999999988776655322 2234567888899998875433
Q ss_pred CeEEEEecCCCceEEEecCCCCceeEEEE--eCCEEEEEcCCC-----CeEEEEEccCC
Q psy5768 469 DKIERCDYDGTNRIVLSKISPLHPFDMAV--YGEFIFWTDWVI-----HAVLRANKYTG 520 (652)
Q Consensus 469 ~~I~~~~ldG~~~~~l~~~~~~~p~glav--~~~~lYwtd~~~-----~~I~~~~k~~g 520 (652)
..|..++++|...+.+.... ...++++ ++.+||++-... ..++.++..++
T Consensus 358 ~~i~~~d~~~~~~~~lt~~~--~~~~p~~spDG~~i~~~~~~~~~~~~~~L~~~dl~g~ 414 (435)
T PRK05137 358 FSIGVMKPDGSGERILTSGF--LVEGPTWAPNGRVIMFFRQTPGSGGAPKLYTVDLTGR 414 (435)
T ss_pred eEEEEEECCCCceEeccCCC--CCCCCeECCCCCEEEEEEccCCCCCcceEEEEECCCC
Confidence 47899999887766654422 2333444 456887754322 36888887544
No 52
>PRK04922 tolB translocation protein TolB; Provisional
Probab=97.86 E-value=0.0045 Score=67.81 Aligned_cols=204 Identities=15% Similarity=0.142 Sum_probs=125.3
Q ss_pred EEEEEEEcCCCeEEEeecc--cccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCC--CCeEEEEEcCCCCCc
Q psy5768 316 IIELSYDYKRKTLFYSDIQ--KGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNN--DATINKIDLDSPKAQ 391 (652)
Q Consensus 316 ~~~v~~D~~~~~lywsd~~--~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~--~~~I~~~~~~~~~~~ 391 (652)
+....+.+..++|+++... ...|+..++++...+.+....+.....++.+.++.|+++-.. ...|.++++.+..
T Consensus 206 v~~p~wSpDg~~la~~s~~~~~~~l~~~dl~~g~~~~l~~~~g~~~~~~~SpDG~~l~~~~s~~g~~~Iy~~d~~~g~-- 283 (433)
T PRK04922 206 ILSPAWSPDGKKLAYVSFERGRSAIYVQDLATGQRELVASFRGINGAPSFSPDGRRLALTLSRDGNPEIYVMDLGSRQ-- 283 (433)
T ss_pred cccccCCCCCCEEEEEecCCCCcEEEEEECCCCCEEEeccCCCCccCceECCCCCEEEEEEeCCCCceEEEEECCCCC--
Confidence 4566778888888887643 346898898776555554222233467888889999876433 3469888886532
Q ss_pred cEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCC--CC
Q psy5768 392 RIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDAR--LD 469 (652)
Q Consensus 392 ~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~--~~ 469 (652)
.+.+. .........+.+|...+|+++......+.|+..++++...+.+.... ......++.++++.|+++... ..
T Consensus 284 ~~~lt--~~~~~~~~~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~g-~~~~~~~~SpDG~~Ia~~~~~~~~~ 360 (433)
T PRK04922 284 LTRLT--NHFGIDTEPTWAPDGKSIYFTSDRGGRPQIYRVAASGGSAERLTFQG-NYNARASVSPDGKKIAMVHGSGGQY 360 (433)
T ss_pred eEECc--cCCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCCeEEeecCC-CCccCEEECCCCCEEEEEECCCCce
Confidence 22221 12223356788998777777643222358999998876655544322 344568888899999997543 23
Q ss_pred eEEEEecCCCceEEEecCC-CCceeEEEEeCCEEEEEcCC--CCeEEEEEccCCceEEEE
Q psy5768 470 KIERCDYDGTNRIVLSKIS-PLHPFDMAVYGEFIFWTDWV--IHAVLRANKYTGEEVYTL 526 (652)
Q Consensus 470 ~I~~~~ldG~~~~~l~~~~-~~~p~glav~~~~lYwtd~~--~~~I~~~~k~~g~~~~~~ 526 (652)
.|..+++++...+.+.... ...| ..+-++.+|+++... ...|+.++. +|...+.+
T Consensus 361 ~I~v~d~~~g~~~~Lt~~~~~~~p-~~spdG~~i~~~s~~~g~~~L~~~~~-~g~~~~~l 418 (433)
T PRK04922 361 RIAVMDLSTGSVRTLTPGSLDESP-SFAPNGSMVLYATREGGRGVLAAVST-DGRVRQRL 418 (433)
T ss_pred eEEEEECCCCCeEECCCCCCCCCc-eECCCCCEEEEEEecCCceEEEEEEC-CCCceEEc
Confidence 6899999887766554432 1222 233346677776432 345777766 34433333
No 53
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=97.86 E-value=0.01 Score=64.63 Aligned_cols=204 Identities=15% Similarity=0.080 Sum_probs=124.4
Q ss_pred EEEEEEEcCCCeEEEeeccc--ccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCCC--CeEEEEEcCCCCCc
Q psy5768 316 IIELSYDYKRKTLFYSDIQK--GTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNND--ATINKIDLDSPKAQ 391 (652)
Q Consensus 316 ~~~v~~D~~~~~lywsd~~~--~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~--~~I~~~~~~~~~~~ 391 (652)
....++.+..++|+|+.... ..|+..++.+...+.+...-+....+++.+.++.||++.... ..|..+++.+..
T Consensus 192 ~~~p~~Spdg~~la~~~~~~~~~~i~v~d~~~g~~~~~~~~~~~~~~~~~spDg~~l~~~~~~~~~~~i~~~d~~~~~-- 269 (417)
T TIGR02800 192 ILSPAWSPDGQKLAYVSFESGKPEIYVQDLATGQREKVASFPGMNGAPAFSPDGSKLAVSLSKDGNPDIYVMDLDGKQ-- 269 (417)
T ss_pred eecccCCCCCCEEEEEEcCCCCcEEEEEECCCCCEEEeecCCCCccceEECCCCCEEEEEECCCCCccEEEEECCCCC--
Confidence 45566788888888876543 568888887654444433223445678888888898875533 468888876532
Q ss_pred cEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCC--C
Q psy5768 392 RIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARL--D 469 (652)
Q Consensus 392 ~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~--~ 469 (652)
.+.+.. ........+..|...+|+++......+.|+..++++...+.+.. .......+++.+.++.|+++.... .
T Consensus 270 ~~~l~~--~~~~~~~~~~s~dg~~l~~~s~~~g~~~iy~~d~~~~~~~~l~~-~~~~~~~~~~spdg~~i~~~~~~~~~~ 346 (417)
T TIGR02800 270 LTRLTN--GPGIDTEPSWSPDGKSIAFTSDRGGSPQIYMMDADGGEVRRLTF-RGGYNASPSWSPDGDLIAFVHREGGGF 346 (417)
T ss_pred EEECCC--CCCCCCCEEECCCCCEEEEEECCCCCceEEEEECCCCCEEEeec-CCCCccCeEECCCCCEEEEEEccCCce
Confidence 222221 11222345677877788776543334589999998876554443 223456778888899999986543 4
Q ss_pred eEEEEecCCCceEEEecCC-CCceeEEEEeCCEEEEEcCCCC-eEEEEEccCCceEEE
Q psy5768 470 KIERCDYDGTNRIVLSKIS-PLHPFDMAVYGEFIFWTDWVIH-AVLRANKYTGEEVYT 525 (652)
Q Consensus 470 ~I~~~~ldG~~~~~l~~~~-~~~p~glav~~~~lYwtd~~~~-~I~~~~k~~g~~~~~ 525 (652)
.|..+++++...+.+.... ...| ..+-++.+|+++....+ ....+...+|.....
T Consensus 347 ~i~~~d~~~~~~~~l~~~~~~~~p-~~spdg~~l~~~~~~~~~~~l~~~~~~g~~~~~ 403 (417)
T TIGR02800 347 NIAVMDLDGGGERVLTDTGLDESP-SFAPNGRMILYATTRGGRGVLGLVSTDGRFRAR 403 (417)
T ss_pred EEEEEeCCCCCeEEccCCCCCCCc-eECCCCCEEEEEEeCCCcEEEEEEECCCceeeE
Confidence 7899999887666655432 1222 33445678888765443 233333344544433
No 54
>PRK00178 tolB translocation protein TolB; Provisional
Probab=97.84 E-value=0.012 Score=64.29 Aligned_cols=202 Identities=13% Similarity=0.118 Sum_probs=125.4
Q ss_pred eEEEEEEEcCCCeEEEeecc--cccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCC--CCeEEEEEcCCCCC
Q psy5768 315 NIIELSYDYKRKTLFYSDIQ--KGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNN--DATINKIDLDSPKA 390 (652)
Q Consensus 315 ~~~~v~~D~~~~~lywsd~~--~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~--~~~I~~~~~~~~~~ 390 (652)
.+....+.+..++|+|+... ...|+..++++...+.+...-+.....++.+.++.|+++-.. ...|+++++++..
T Consensus 200 ~~~~p~wSpDG~~la~~s~~~~~~~l~~~~l~~g~~~~l~~~~g~~~~~~~SpDG~~la~~~~~~g~~~Iy~~d~~~~~- 278 (430)
T PRK00178 200 PILSPRWSPDGKRIAYVSFEQKRPRIFVQNLDTGRREQITNFEGLNGAPAWSPDGSKLAFVLSKDGNPEIYVMDLASRQ- 278 (430)
T ss_pred ceeeeeECCCCCEEEEEEcCCCCCEEEEEECCCCCEEEccCCCCCcCCeEECCCCCEEEEEEccCCCceEEEEECCCCC-
Confidence 34667788888888776543 346999998876555554322233457787888889876543 3468888887632
Q ss_pred ccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCC--
Q psy5768 391 QRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARL-- 468 (652)
Q Consensus 391 ~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~-- 468 (652)
.+. +. .........+..|....|+++......+.|++.++++...+.+.... ......++.++++.|+++....
T Consensus 279 -~~~-lt-~~~~~~~~~~~spDg~~i~f~s~~~g~~~iy~~d~~~g~~~~lt~~~-~~~~~~~~Spdg~~i~~~~~~~~~ 354 (430)
T PRK00178 279 -LSR-VT-NHPAIDTEPFWGKDGRTLYFTSDRGGKPQIYKVNVNGGRAERVTFVG-NYNARPRLSADGKTLVMVHRQDGN 354 (430)
T ss_pred -eEE-cc-cCCCCcCCeEECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecCC-CCccceEECCCCCEEEEEEccCCc
Confidence 222 21 12223445677887777877643333468999998877655554221 2233467788899999986533
Q ss_pred CeEEEEecCCCceEEEecCCC-CceeEEEEeCCEEEEEcCCC--CeEEEEEccCCceE
Q psy5768 469 DKIERCDYDGTNRIVLSKISP-LHPFDMAVYGEFIFWTDWVI--HAVLRANKYTGEEV 523 (652)
Q Consensus 469 ~~I~~~~ldG~~~~~l~~~~~-~~p~glav~~~~lYwtd~~~--~~I~~~~k~~g~~~ 523 (652)
..|..+++++...+.+..... ..| .++-++.+|+++.... ..|+.++. +|...
T Consensus 355 ~~l~~~dl~tg~~~~lt~~~~~~~p-~~spdg~~i~~~~~~~g~~~l~~~~~-~g~~~ 410 (430)
T PRK00178 355 FHVAAQDLQRGSVRILTDTSLDESP-SVAPNGTMLIYATRQQGRGVLMLVSI-NGRVR 410 (430)
T ss_pred eEEEEEECCCCCEEEccCCCCCCCc-eECCCCCEEEEEEecCCceEEEEEEC-CCCce
Confidence 368899999877766654321 223 3444567888876543 34666665 34433
No 55
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=97.81 E-value=9e-05 Score=61.10 Aligned_cols=73 Identities=21% Similarity=0.256 Sum_probs=56.7
Q ss_pred EEEEeCCCCEEEEEecCC---------------CCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCCCe
Q psy5768 406 GIDIDSCDSRIYWTNWNS---------------HLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARLDK 470 (652)
Q Consensus 406 ~Iavdp~~g~Lywtd~~~---------------~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~ 470 (652)
+|+|++..|.+||||... ..+++.+.++.....+++. .++..|||+++..++.-|++++....+
T Consensus 2 dldv~~~~g~vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp~t~~~~vl~-~~L~fpNGVals~d~~~vlv~Et~~~R 80 (89)
T PF03088_consen 2 DLDVDQDTGTVYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDPSTKETTVLL-DGLYFPNGVALSPDESFVLVAETGRYR 80 (89)
T ss_dssp EEEE-TTT--EEEEES-SS--TTGHHHHHHHT---EEEEEEETTTTEEEEEE-EEESSEEEEEE-TTSSEEEEEEGGGTE
T ss_pred ceeEecCCCEEEEEeCccccCccceeeeeecCCCCcCEEEEECCCCeEEEeh-hCCCccCeEEEcCCCCEEEEEeccCce
Confidence 689999889999998642 2468888888876666665 479999999999999999999999999
Q ss_pred EEEEecCCC
Q psy5768 471 IERCDYDGT 479 (652)
Q Consensus 471 I~~~~ldG~ 479 (652)
|.+.-+.|.
T Consensus 81 i~rywl~Gp 89 (89)
T PF03088_consen 81 ILRYWLKGP 89 (89)
T ss_dssp EEEEESSST
T ss_pred EEEEEEeCC
Confidence 999988873
No 56
>PRK03629 tolB translocation protein TolB; Provisional
Probab=97.81 E-value=0.013 Score=64.14 Aligned_cols=206 Identities=10% Similarity=0.056 Sum_probs=126.7
Q ss_pred eEEEEEEEcCCCeEEEeec--ccccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCCC--CeEEEEEcCCCCC
Q psy5768 315 NIIELSYDYKRKTLFYSDI--QKGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNND--ATINKIDLDSPKA 390 (652)
Q Consensus 315 ~~~~v~~D~~~~~lywsd~--~~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~--~~I~~~~~~~~~~ 390 (652)
......+.+..++|.|+.. ....|+..++++...+.+...-+....+++.+.++.|+++.... ..|+++++++..
T Consensus 200 ~~~~p~wSPDG~~la~~s~~~g~~~i~i~dl~~G~~~~l~~~~~~~~~~~~SPDG~~La~~~~~~g~~~I~~~d~~tg~- 278 (429)
T PRK03629 200 PLMSPAWSPDGSKLAYVTFESGRSALVIQTLANGAVRQVASFPRHNGAPAFSPDGSKLAFALSKTGSLNLYVMDLASGQ- 278 (429)
T ss_pred ceeeeEEcCCCCEEEEEEecCCCcEEEEEECCCCCeEEccCCCCCcCCeEECCCCCEEEEEEcCCCCcEEEEEECCCCC-
Confidence 3557788888888877643 23468888887765555543223345678999999999975433 368888886532
Q ss_pred ccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCC--C
Q psy5768 391 QRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDAR--L 468 (652)
Q Consensus 391 ~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~--~ 468 (652)
.+. +. .........+..|....|+++......+.|++.+++|...+.+.. ........++.+++++|+++... .
T Consensus 279 -~~~-lt-~~~~~~~~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~~g~~~~lt~-~~~~~~~~~~SpDG~~Ia~~~~~~g~ 354 (429)
T PRK03629 279 -IRQ-VT-DGRSNNTEPTWFPDSQNLAYTSDQAGRPQVYKVNINGGAPQRITW-EGSQNQDADVSSDGKFMVMVSSNGGQ 354 (429)
T ss_pred -EEE-cc-CCCCCcCceEECCCCCEEEEEeCCCCCceEEEEECCCCCeEEeec-CCCCccCEEECCCCCEEEEEEccCCC
Confidence 222 22 222355678889987777655332224689999999876665533 22234567788889999887543 3
Q ss_pred CeEEEEecCCCceEEEecCCCCceeEEEEeCCEEEEEcCCCC--eEEEEEccCCceEEEE
Q psy5768 469 DKIERCDYDGTNRIVLSKISPLHPFDMAVYGEFIFWTDWVIH--AVLRANKYTGEEVYTL 526 (652)
Q Consensus 469 ~~I~~~~ldG~~~~~l~~~~~~~p~glav~~~~lYwtd~~~~--~I~~~~k~~g~~~~~~ 526 (652)
..|..+++++...+.+..........++-++.+|+++....+ .++.++. +|...+.+
T Consensus 355 ~~I~~~dl~~g~~~~Lt~~~~~~~p~~SpDG~~i~~~s~~~~~~~l~~~~~-~G~~~~~l 413 (429)
T PRK03629 355 QHIAKQDLATGGVQVLTDTFLDETPSIAPNGTMVIYSSSQGMGSVLNLVST-DGRFKARL 413 (429)
T ss_pred ceEEEEECCCCCeEEeCCCCCCCCceECCCCCEEEEEEcCCCceEEEEEEC-CCCCeEEC
Confidence 468889998877766654321122233445667777654332 3455554 45544444
No 57
>TIGR03118 PEPCTERM_chp_1 conserved hypothetical protein TIGR03118. This model describes and uncharacterized conserved hypothetical protein. Members are found with the C-terminal putative exosortase interaction domain, PEP-CTERM, in Nitrosospira multiformis, Rhodoferax ferrireducens, Solibacter usitatus Ellin6076, and Acidobacteria bacterium Ellin345. It is found without the PEP-CTERM domain in several other species, including Burkholderia ambifaria, Gloeobacter violaceus PCC 7421, and three copies in the Acanthamoeba polyphaga mimivirus.
Probab=97.79 E-value=0.0073 Score=60.54 Aligned_cols=213 Identities=15% Similarity=0.168 Sum_probs=134.0
Q ss_pred cccceEEEEEEEcCCCeEEEeecccccEEEEecc-----CCcceEEe--e------ccCceeeeEEEccCCE--------
Q psy5768 311 TMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFN-----GSNHRVLL--E------RQGSVEGLAYEYVHNY-------- 369 (652)
Q Consensus 311 ~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~-----g~~~~~i~--~------~~~~~~glAvDw~~~~-------- 369 (652)
..+.|+.+|++.+.. -++++|..++.....+.+ |.....++ . ....|.|+.+.-....
T Consensus 20 p~L~N~WGia~~p~~-~~WVadngT~~~TlYdg~~~~~~g~~~~L~vtiP~~~~~~~~~~PTGiVfN~~~~F~vt~~g~~ 98 (336)
T TIGR03118 20 PGLRNAWGLSYRPGG-PFWVANTGTGTATLYVGNPDTQPLVQDPLVVVIPAPPPLAAEGTPTGQVFNGSDTFVVSGEGIT 98 (336)
T ss_pred ccccccceeEecCCC-CEEEecCCcceEEeecCCcccccCCccceEEEecCCCCCCCCCCccEEEEeCCCceEEcCCCcc
Confidence 346789999998854 566677777776666665 43333333 1 2347999998744332
Q ss_pred -----EEEEeCCCCeEEEEEcCCCCCc---cEEEEEe--CCCCCceEEEEeCC--CCEEEEEecCCCCCceEEEeecCCC
Q psy5768 370 -----LYWTCNNDATINKIDLDSPKAQ---RIVVVRL--GQHDKPRGIDIDSC--DSRIYWTNWNSHLPSIQRAFFSGFG 437 (652)
Q Consensus 370 -----LYwtd~~~~~I~~~~~~~~~~~---~~~~~~~--~~~~~P~~Iavdp~--~g~Lywtd~~~~~~~I~r~~ldG~~ 437 (652)
||.|+ .++|..-...- ... ...++.. ....--+++||-+. ..+||-+|... ++|... |++-
T Consensus 99 ~~a~Fif~tE--dGTisaW~p~v-~~t~~~~~~~~~d~s~~gavYkGLAi~~~~~~~~LYaadF~~--g~IDVF--d~~f 171 (336)
T TIGR03118 99 GPSRFLFVTE--DGTLSGWAPAL-GTTRMTRAEIVVDASQQGNVYKGLAVGPTGGGDYLYAANFRQ--GRIDVF--KGSF 171 (336)
T ss_pred cceeEEEEeC--CceEEeecCcC-CcccccccEEEEccCCCcceeeeeEEeecCCCceEEEeccCC--CceEEe--cCcc
Confidence 33333 34444322110 000 1112221 12455678888754 67999999963 488754 4444
Q ss_pred ceEEEEcC-----C---CCCceEEEecCCCEEEEEe-------------CCCCeEEEEecCCCceEEEecCC-CCceeEE
Q psy5768 438 TESIITTD-----I---TMPNALALDHQAEKLFWGD-------------ARLDKIERCDYDGTNRIVLSKIS-PLHPFDM 495 (652)
Q Consensus 438 ~~~l~~~~-----l---~~P~glaiD~~~~~LYw~D-------------~~~~~I~~~~ldG~~~~~l~~~~-~~~p~gl 495 (652)
..+-+... + ..|.+|.- .+++||++= .+.+.|...+++|.-.+.+.+.. +..|+||
T Consensus 172 ~~~~~~g~F~DP~iPagyAPFnIqn--ig~~lyVtYA~qd~~~~d~v~G~G~G~VdvFd~~G~l~~r~as~g~LNaPWG~ 249 (336)
T TIGR03118 172 RPPPLPGSFIDPALPAGYAPFNVQN--LGGTLYVTYAQQDADRNDEVAGAGLGYVNVFTLNGQLLRRVASSGRLNAPWGL 249 (336)
T ss_pred ccccCCCCccCCCCCCCCCCcceEE--ECCeEEEEEEecCCcccccccCCCcceEEEEcCCCcEEEEeccCCcccCCcee
Confidence 33222111 1 13555543 379999872 24568999999999887776554 8999999
Q ss_pred EE-------eCCEEEEEcCCCCeEEEEEccCCceEEEEecccCCc
Q psy5768 496 AV-------YGEFIFWTDWVIHAVLRANKYTGEEVYTLRKNIRRP 533 (652)
Q Consensus 496 av-------~~~~lYwtd~~~~~I~~~~k~~g~~~~~~~~~~~~p 533 (652)
++ +.+.|.+-+...++|-..|..+|+.+-.|......|
T Consensus 250 a~APa~FG~~sg~lLVGNFGDG~InaFD~~sG~~~g~L~~~~G~p 294 (336)
T TIGR03118 250 AIAPESFGSLSGALLVGNFGDGTINAYDPQSGAQLGQLLDPDNHP 294 (336)
T ss_pred eeChhhhCCCCCCeEEeecCCceeEEecCCCCceeeeecCCCCCe
Confidence 98 357899999999999999998898777776544444
No 58
>PRK02889 tolB translocation protein TolB; Provisional
Probab=97.77 E-value=0.016 Score=63.47 Aligned_cols=205 Identities=15% Similarity=0.101 Sum_probs=121.7
Q ss_pred EEEEEEEcCCCeEEEeeccc--ccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCC--CCeEEEEEcCCCCCc
Q psy5768 316 IIELSYDYKRKTLFYSDIQK--GTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNN--DATINKIDLDSPKAQ 391 (652)
Q Consensus 316 ~~~v~~D~~~~~lywsd~~~--~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~--~~~I~~~~~~~~~~~ 391 (652)
+....+.+..++|+++.... ..|+..++.+.....+....+.....++.+.++.|+++-.. ...|+++++++..
T Consensus 198 v~~p~wSPDG~~la~~s~~~~~~~I~~~dl~~g~~~~l~~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~-- 275 (427)
T PRK02889 198 IISPAWSPDGTKLAYVSFESKKPVVYVHDLATGRRRVVANFKGSNSAPAWSPDGRTLAVALSRDGNSQIYTVNADGSG-- 275 (427)
T ss_pred cccceEcCCCCEEEEEEccCCCcEEEEEECCCCCEEEeecCCCCccceEECCCCCEEEEEEccCCCceEEEEECCCCC--
Confidence 45677888888888775433 45888888766554443222344567888889999886433 3468888876532
Q ss_pred cEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCC--C
Q psy5768 392 RIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARL--D 469 (652)
Q Consensus 392 ~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~--~ 469 (652)
.+.+ . .........+..|...+|+++......+.|+...+++...+.+.... ......++.++++.|+++.... .
T Consensus 276 ~~~l-t-~~~~~~~~~~wSpDG~~l~f~s~~~g~~~Iy~~~~~~g~~~~lt~~g-~~~~~~~~SpDG~~Ia~~s~~~g~~ 352 (427)
T PRK02889 276 LRRL-T-QSSGIDTEPFFSPDGRSIYFTSDRGGAPQIYRMPASGGAAQRVTFTG-SYNTSPRISPDGKLLAYISRVGGAF 352 (427)
T ss_pred cEEC-C-CCCCCCcCeEEcCCCCEEEEEecCCCCcEEEEEECCCCceEEEecCC-CCcCceEECCCCCEEEEEEccCCcE
Confidence 2222 1 11223345678898777776543223468999998876655444222 2233567888899998876433 3
Q ss_pred eEEEEecCCCceEEEecCCCCceeEEEEeCCEEEEEcCCC--CeEEEEEccCCceEEEE
Q psy5768 470 KIERCDYDGTNRIVLSKISPLHPFDMAVYGEFIFWTDWVI--HAVLRANKYTGEEVYTL 526 (652)
Q Consensus 470 ~I~~~~ldG~~~~~l~~~~~~~p~glav~~~~lYwtd~~~--~~I~~~~k~~g~~~~~~ 526 (652)
.|..+++++...+.+...........+-++.+||++-... ..++.++. +|...+.+
T Consensus 353 ~I~v~d~~~g~~~~lt~~~~~~~p~~spdg~~l~~~~~~~g~~~l~~~~~-~g~~~~~l 410 (427)
T PRK02889 353 KLYVQDLATGQVTALTDTTRDESPSFAPNGRYILYATQQGGRSVLAAVSS-DGRIKQRL 410 (427)
T ss_pred EEEEEECCCCCeEEccCCCCccCceECCCCCEEEEEEecCCCEEEEEEEC-CCCceEEe
Confidence 6899999877766665432111112223455666654322 23556665 45544444
No 59
>PRK04043 tolB translocation protein TolB; Provisional
Probab=97.74 E-value=0.014 Score=63.50 Aligned_cols=203 Identities=11% Similarity=0.045 Sum_probs=126.0
Q ss_pred EEEEEEEcCCCe-EEEeecc--cccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCC--CCeEEEEEcCCCCC
Q psy5768 316 IIELSYDYKRKT-LFYSDIQ--KGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNN--DATINKIDLDSPKA 390 (652)
Q Consensus 316 ~~~v~~D~~~~~-lywsd~~--~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~--~~~I~~~~~~~~~~ 390 (652)
.....+.+..++ +|++... ...|+.+++.+...+.+...-+.....++.+.++.|.++... ...|+++++++..
T Consensus 190 ~~~p~wSpDG~~~i~y~s~~~~~~~Iyv~dl~tg~~~~lt~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~dl~~g~- 268 (419)
T PRK04043 190 NIFPKWANKEQTAFYYTSYGERKPTLYKYNLYTGKKEKIASSQGMLVVSDVSKDGSKLLLTMAPKGQPDIYLYDTNTKT- 268 (419)
T ss_pred eEeEEECCCCCcEEEEEEccCCCCEEEEEECCCCcEEEEecCCCcEEeeEECCCCCEEEEEEccCCCcEEEEEECCCCc-
Confidence 345667777665 7776543 467999998877666665322223334566678888887643 3579988886542
Q ss_pred ccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCC--
Q psy5768 391 QRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARL-- 468 (652)
Q Consensus 391 ~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~-- 468 (652)
.+.+..... .-..-...|....||++......+.|++++++|...+.+...... + ..+++++++|.++-...
T Consensus 269 -~~~LT~~~~--~d~~p~~SPDG~~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt~~g~~--~-~~~SPDG~~Ia~~~~~~~~ 342 (419)
T PRK04043 269 -LTQITNYPG--IDVNGNFVEDDKRIVFVSDRLGYPNIFMKKLNSGSVEQVVFHGKN--N-SSVSTYKNYIVYSSRETNN 342 (419)
T ss_pred -EEEcccCCC--ccCccEECCCCCEEEEEECCCCCceEEEEECCCCCeEeCccCCCc--C-ceECCCCCEEEEEEcCCCc
Confidence 222221111 122346888777888876544457999999998877655543322 2 37888899998885432
Q ss_pred ------CeEEEEecCCCceEEEecCCCCceeEEEEeCCEEEEEcCCC--CeEEEEEccCCceEEEE
Q psy5768 469 ------DKIERCDYDGTNRIVLSKISPLHPFDMAVYGEFIFWTDWVI--HAVLRANKYTGEEVYTL 526 (652)
Q Consensus 469 ------~~I~~~~ldG~~~~~l~~~~~~~p~glav~~~~lYwtd~~~--~~I~~~~k~~g~~~~~~ 526 (652)
..|..++++|...+.|...........+-++..|+++.... ..+..++. +|.....+
T Consensus 343 ~~~~~~~~I~v~d~~~g~~~~LT~~~~~~~p~~SPDG~~I~f~~~~~~~~~L~~~~l-~g~~~~~l 407 (419)
T PRK04043 343 EFGKNTFNLYLISTNSDYIRRLTANGVNQFPRFSSDGGSIMFIKYLGNQSALGIIRL-NYNKSFLF 407 (419)
T ss_pred ccCCCCcEEEEEECCCCCeEECCCCCCcCCeEECCCCCEEEEEEccCCcEEEEEEec-CCCeeEEe
Confidence 47999999988877776543222223444567787765432 24667776 45444444
No 60
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=97.73 E-value=0.003 Score=61.95 Aligned_cols=207 Identities=10% Similarity=0.026 Sum_probs=126.8
Q ss_pred CCeEEEeecccccEEEEeccCCcceEEe-eccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCC
Q psy5768 325 RKTLFYSDIQKGTINSVFFNGSNHRVLL-ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDK 403 (652)
Q Consensus 325 ~~~lywsd~~~~~I~~~~~~g~~~~~i~-~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~ 403 (652)
++.+++++..++.|-+.+......+.+- ..-.+|.+|.+++.+ .++++|+.. .|.+++-......+-.+-......+
T Consensus 72 dG~VWft~qg~gaiGhLdP~tGev~~ypLg~Ga~Phgiv~gpdg-~~Witd~~~-aI~R~dpkt~evt~f~lp~~~a~~n 149 (353)
T COG4257 72 DGAVWFTAQGTGAIGHLDPATGEVETYPLGSGASPHGIVVGPDG-SAWITDTGL-AIGRLDPKTLEVTRFPLPLEHADAN 149 (353)
T ss_pred CCceEEecCccccceecCCCCCceEEEecCCCCCCceEEECCCC-CeeEecCcc-eeEEecCcccceEEeecccccCCCc
Confidence 4779999999999999986654444443 444589999999854 677888876 7887764322211111111112244
Q ss_pred ceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCc--eEEEEcCCCCCceEEEecCCCEEEEEeCCCCeEEEEecCCCce
Q psy5768 404 PRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGT--ESIITTDITMPNALALDHQAEKLFWGDARLDKIERCDYDGTNR 481 (652)
Q Consensus 404 P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~--~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG~~~ 481 (652)
-...++|+ .|.|+||....-.+ .||-... ++.-.-.-..|+||.+.++ +.+|++....+.|.++|--....
T Consensus 150 let~vfD~-~G~lWFt~q~G~yG-----rLdPa~~~i~vfpaPqG~gpyGi~atpd-Gsvwyaslagnaiaridp~~~~a 222 (353)
T COG4257 150 LETAVFDP-WGNLWFTGQIGAYG-----RLDPARNVISVFPAPQGGGPYGICATPD-GSVWYASLAGNAIARIDPFAGHA 222 (353)
T ss_pred ccceeeCC-CccEEEeeccccce-----ecCcccCceeeeccCCCCCCcceEECCC-CcEEEEeccccceEEcccccCCc
Confidence 45678888 79999997532111 3333222 2221222346999999865 67888888888898888644344
Q ss_pred EEEecCC--CCceeEEEEe-CCEEEEEcCCCCeEEEEEccCCceEEEEeccc-CCcceeEEEe
Q psy5768 482 IVLSKIS--PLHPFDMAVY-GEFIFWTDWVIHAVLRANKYTGEEVYTLRKNI-RRPMGIVAIS 540 (652)
Q Consensus 482 ~~l~~~~--~~~p~glav~-~~~lYwtd~~~~~I~~~~k~~g~~~~~~~~~~-~~p~~i~~~~ 540 (652)
+++..-. -..-..+-.+ -+++..|+|.+++++|++..+-+-...-..+. .+|+.+.|-.
T Consensus 223 ev~p~P~~~~~gsRriwsdpig~~wittwg~g~l~rfdPs~~sW~eypLPgs~arpys~rVD~ 285 (353)
T COG4257 223 EVVPQPNALKAGSRRIWSDPIGRAWITTWGTGSLHRFDPSVTSWIEYPLPGSKARPYSMRVDR 285 (353)
T ss_pred ceecCCCcccccccccccCccCcEEEeccCCceeeEeCcccccceeeeCCCCCCCcceeeecc
Confidence 4443211 1122334444 46899999999999999986554322212222 5677776654
No 61
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=97.68 E-value=0.0043 Score=61.72 Aligned_cols=177 Identities=15% Similarity=0.091 Sum_probs=114.8
Q ss_pred EEEecCCCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEE-----
Q psy5768 3 IAVSSPTQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETV----- 77 (652)
Q Consensus 3 i~v~~~~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v----- 77 (652)
+|......-.|..++.+|+..+++... -++.|.+|.|- .+|..-.+| ....+++....+-......
T Consensus 100 LFav~n~p~~iVElt~~GdlirtiPL~------g~~DpE~Ieyi-g~n~fvi~d--ER~~~l~~~~vd~~t~~~~~~~~~ 170 (316)
T COG3204 100 LFAVTNKPAAIVELTKEGDLIRTIPLT------GFSDPETIEYI-GGNQFVIVD--ERDRALYLFTVDADTTVISAKVQK 170 (316)
T ss_pred EEEecCCCceEEEEecCCceEEEeccc------ccCChhHeEEe-cCCEEEEEe--hhcceEEEEEEcCCccEEeccceE
Confidence 444545567889999999999999864 36888999996 456666688 6677787777664311111
Q ss_pred EeCC----CcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCc--EEEEEeC--------CCCCceeEEEcCCCC
Q psy5768 78 VSQK----KYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQY--RYVLISG--------GVDQPSALAVDPESG 143 (652)
Q Consensus 78 ~~~~----~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~--~~~l~~~--------~~~~P~~iavd~~~g 143 (652)
++-+ ....-+ |+|.|...+++|++-. ++-|..+..++.. ..+-+.. .+....+++.|+.+|
T Consensus 171 i~L~~~~k~N~GfE----GlA~d~~~~~l~~aKE-r~P~~I~~~~~~~~~l~~~~~~~~~~~~~~f~~DvSgl~~~~~~~ 245 (316)
T COG3204 171 IPLGTTNKKNKGFE----GLAWDPVDHRLFVAKE-RNPIGIFEVTQSPSSLSVHASLDPTADRDLFVLDVSGLEFNAITN 245 (316)
T ss_pred EeccccCCCCcCce----eeecCCCCceEEEEEc-cCCcEEEEEecCCcccccccccCcccccceEeeccccceecCCCC
Confidence 1111 134578 9999999999999874 3334433333322 1111110 134567899999999
Q ss_pred eEEEEecCCCCeEEEEeCCCCCcEEE--Eee------cccCceeEEEeccCCEEEEEeCC
Q psy5768 144 YLFWSESGKIPLIARAGLDGKKQTIL--AQE------IIMPIKDITLDLKFFSAFYRNLS 195 (652)
Q Consensus 144 ~lywtd~~~~~~I~~~~ldg~~~~~~--~~~------~~~~p~gl~lD~~~~~ly~~d~~ 195 (652)
.|++-... ...+...+++|.-+..+ ... ++.+|.|||+|. .+.||++.-.
T Consensus 246 ~LLVLS~E-Sr~l~Evd~~G~~~~~lsL~~g~~gL~~dipqaEGiamDd-~g~lYIvSEP 303 (316)
T COG3204 246 SLLVLSDE-SRRLLEVDLSGEVIELLSLTKGNHGLSSDIPQAEGIAMDD-DGNLYIVSEP 303 (316)
T ss_pred cEEEEecC-CceEEEEecCCCeeeeEEeccCCCCCcccCCCcceeEECC-CCCEEEEecC
Confidence 98887654 44677777877643322 222 278999999995 5778887544
No 62
>PRK04043 tolB translocation protein TolB; Provisional
Probab=97.60 E-value=0.017 Score=62.70 Aligned_cols=189 Identities=11% Similarity=0.081 Sum_probs=114.5
Q ss_pred CeEEEeecc---cccEEEEeccCCcceEEeeccCceeeeEEEccCCE-EEEEeCC--CCeEEEEEcCCCCCccEEEEEeC
Q psy5768 326 KTLFYSDIQ---KGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNY-LYWTCNN--DATINKIDLDSPKAQRIVVVRLG 399 (652)
Q Consensus 326 ~~lywsd~~---~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~-LYwtd~~--~~~I~~~~~~~~~~~~~~~~~~~ 399 (652)
..+|++... ..+|+.++.+|...+.+..+ +....-+..+.++. +|++... ...|+++++.+.. ++.+..
T Consensus 156 r~~~v~~~~~~~~~~l~~~d~dg~~~~~~~~~-~~~~~p~wSpDG~~~i~y~s~~~~~~~Iyv~dl~tg~--~~~lt~-- 230 (419)
T PRK04043 156 RKVVFSKYTGPKKSNIVLADYTLTYQKVIVKG-GLNIFPKWANKEQTAFYYTSYGERKPTLYKYNLYTGK--KEKIAS-- 230 (419)
T ss_pred eEEEEEEccCCCcceEEEECCCCCceeEEccC-CCeEeEEECCCCCcEEEEEEccCCCCEEEEEECCCCc--EEEEec--
Confidence 445555421 34677777778776666543 23334556666774 7776544 4679999986532 333332
Q ss_pred CCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCC--CCeEEEEecC
Q psy5768 400 QHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDAR--LDKIERCDYD 477 (652)
Q Consensus 400 ~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~--~~~I~~~~ld 477 (652)
.......-++.|....|.++......+.|+..+++|...+.|....- .-......+++++||++... ...|+.++++
T Consensus 231 ~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~dl~~g~~~~LT~~~~-~d~~p~~SPDG~~I~F~Sdr~g~~~Iy~~dl~ 309 (419)
T PRK04043 231 SQGMLVVSDVSKDGSKLLLTMAPKGQPDIYLYDTNTKTLTQITNYPG-IDVNGNFVEDDKRIVFVSDRLGYPNIFMKKLN 309 (419)
T ss_pred CCCcEEeeEECCCCCEEEEEEccCCCcEEEEEECCCCcEEEcccCCC-ccCccEECCCCCEEEEEECCCCCceEEEEECC
Confidence 22223345688887788877654445699999998887665543221 11234678889999998643 3479999999
Q ss_pred CCceEEEecCCCCceeEEEEeCCEEEEEcCCC--------CeEEEEEccCCc
Q psy5768 478 GTNRIVLSKISPLHPFDMAVYGEFIFWTDWVI--------HAVLRANKYTGE 521 (652)
Q Consensus 478 G~~~~~l~~~~~~~p~glav~~~~lYwtd~~~--------~~I~~~~k~~g~ 521 (652)
|...+.+...+...+ .+.-++++|.++.... ..|+.++..+|.
T Consensus 310 ~g~~~rlt~~g~~~~-~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~g~ 360 (419)
T PRK04043 310 SGSVEQVVFHGKNNS-SVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNSDY 360 (419)
T ss_pred CCCeEeCccCCCcCc-eECCCCCEEEEEEcCCCcccCCCCcEEEEEECCCCC
Confidence 887755554332232 4555677777765432 356666665554
No 63
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=97.59 E-value=0.054 Score=53.48 Aligned_cols=266 Identities=14% Similarity=0.176 Sum_probs=148.9
Q ss_pred cEEEEccCCcEEEEeCCCCEEEEEEcC-CCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeCCCCCcE--E
Q psy5768 92 HIAVDWIAQNIYWSDPKENVIEVARLT-GQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGLDGKKQT--I 168 (652)
Q Consensus 92 ~lavDw~~~~lY~~d~~~~~I~v~~~d-g~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~~--~ 168 (652)
.+|.+. .+-+++++...+.|-..|+. |+...+-+.. -..|++|.++| .|...+||.+. .|.|.+-.--..+ .
T Consensus 66 dvapap-dG~VWft~qg~gaiGhLdP~tGev~~ypLg~-Ga~Phgiv~gp-dg~~Witd~~~--aI~R~dpkt~evt~f~ 140 (353)
T COG4257 66 DVAPAP-DGAVWFTAQGTGAIGHLDPATGEVETYPLGS-GASPHGIVVGP-DGSAWITDTGL--AIGRLDPKTLEVTRFP 140 (353)
T ss_pred ccccCC-CCceEEecCccccceecCCCCCceEEEecCC-CCCCceEEECC-CCCeeEecCcc--eeEEecCcccceEEee
Confidence 888886 56677888889999999875 4444433333 47999999999 78999999763 6777643211111 1
Q ss_pred EEee-cccCceeEEEeccCCEEEEEeCCCCcEEEEEecCCCCceEEEeecCCCCCcceeeeeeeccCCCCCCCCCCCCCC
Q psy5768 169 LAQE-IIMPIKDITLDLKFFSAFYRNLSKGNIHIISLSNLSDVSTISMKPYGDSYLKDIKIYSKDAQTGTNPCGVNNGGC 247 (652)
Q Consensus 169 ~~~~-~~~~p~gl~lD~~~~~ly~~d~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~q~~~n~C~~~ng~C 247 (652)
+-.+ ....-+-..+|.. ++|+++.-.|-.- .+.-.-..|+||..- | -++=
T Consensus 141 lp~~~a~~nlet~vfD~~-G~lWFt~q~G~yG------------------rLdPa~~~i~vfpaP-q---------G~gp 191 (353)
T COG4257 141 LPLEHADANLETAVFDPW-GNLWFTGQIGAYG------------------RLDPARNVISVFPAP-Q---------GGGP 191 (353)
T ss_pred cccccCCCcccceeeCCC-ccEEEeeccccce------------------ecCcccCceeeeccC-C---------CCCC
Confidence 1111 1122334455543 3444433222110 011122345566321 1 0000
Q ss_pred cccceecCCCceEEEeCCccccCCCcccccceEEEEe--eecceeEEecCCCCCCCCCceeeeeccccce-EEEEEEEcC
Q psy5768 248 AELCLYNGVSAVCACAHGVVAQDGKSCSEYDAFIMYS--RVNRIDSIHMTDKSDLNSPFESIRNSTMMKN-IIELSYDYK 324 (652)
Q Consensus 248 s~lC~~~~~~~~C~C~~G~l~~dg~~C~~~~~~Ll~s--~~~~i~~i~l~~~~~~~~p~~~~~~~~~~~~-~~~v~~D~~ 324 (652)
. -.|+-|.| .+.|+ .++.|-+|+-.++. . +.+..++.+.+ ...+-.|+.
T Consensus 192 y---------Gi~atpdG--------------svwyaslagnaiaridp~~~~--a---ev~p~P~~~~~gsRriwsdpi 243 (353)
T COG4257 192 Y---------GICATPDG--------------SVWYASLAGNAIARIDPFAGH--A---EVVPQPNALKAGSRRIWSDPI 243 (353)
T ss_pred c---------ceEECCCC--------------cEEEEeccccceEEcccccCC--c---ceecCCCcccccccccccCcc
Confidence 0 12332333 34555 46667777642221 1 12222222111 122333543
Q ss_pred CCeEEEeecccccEEEEeccCCcceE-Ee-eccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCC
Q psy5768 325 RKTLFYSDIQKGTINSVFFNGSNHRV-LL-ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHD 402 (652)
Q Consensus 325 ~~~lywsd~~~~~I~~~~~~g~~~~~-i~-~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~ 402 (652)
+++..++...+++++++...+.=.. -+ ....+|+.|-||-.+ .++..|...+.|.+.+-... +-+++- ..-.
T Consensus 244 -g~~wittwg~g~l~rfdPs~~sW~eypLPgs~arpys~rVD~~g-rVW~sea~agai~rfdpeta---~ftv~p-~pr~ 317 (353)
T COG4257 244 -GRAWITTWGTGSLHRFDPSVTSWIEYPLPGSKARPYSMRVDRHG-RVWLSEADAGAIGRFDPETA---RFTVLP-IPRP 317 (353)
T ss_pred -CcEEEeccCCceeeEeCcccccceeeeCCCCCCCcceeeeccCC-cEEeeccccCceeecCcccc---eEEEec-CCCC
Confidence 6788888888999999866543221 12 344589999999654 45556888999998875432 334443 3345
Q ss_pred CceEEEEeCCCCEEEEEecCCCC
Q psy5768 403 KPRGIDIDSCDSRIYWTNWNSHL 425 (652)
Q Consensus 403 ~P~~Iavdp~~g~Lywtd~~~~~ 425 (652)
++.-+.+++..|.+++++.+...
T Consensus 318 n~gn~ql~gr~ge~W~~e~gvd~ 340 (353)
T COG4257 318 NSGNIQLDGRPGELWFTEAGVDA 340 (353)
T ss_pred CCCceeccCCCCceeecccCcce
Confidence 66789999999999999877653
No 64
>PRK05137 tolB translocation protein TolB; Provisional
Probab=97.56 E-value=0.021 Score=62.70 Aligned_cols=172 Identities=9% Similarity=0.024 Sum_probs=112.1
Q ss_pred CeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccCCC
Q psy5768 11 SKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTACN 90 (652)
Q Consensus 11 ~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~ 90 (652)
..|+++|.+|.....+... -..+...++.|.+++|+++....+...|+.+++.+...+.+. .. -+...
T Consensus 182 ~~l~~~d~dg~~~~~lt~~-------~~~v~~p~wSpDG~~lay~s~~~g~~~i~~~dl~~g~~~~l~-~~-~g~~~--- 249 (435)
T PRK05137 182 KRLAIMDQDGANVRYLTDG-------SSLVLTPRFSPNRQEITYMSYANGRPRVYLLDLETGQRELVG-NF-PGMTF--- 249 (435)
T ss_pred eEEEEECCCCCCcEEEecC-------CCCeEeeEECCCCCEEEEEEecCCCCEEEEEECCCCcEEEee-cC-CCccc---
Confidence 4899999999888777642 135678889999999888762234578999999876655543 22 12233
Q ss_pred CcEEEEccCCcEEEEeC--CCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEe-cCCCCeEEEEeCCCCCcE
Q psy5768 91 LHIAVDWIAQNIYWSDP--KENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSE-SGKIPLIARAGLDGKKQT 167 (652)
Q Consensus 91 ~~lavDw~~~~lY~~d~--~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd-~~~~~~I~~~~ldg~~~~ 167 (652)
..++...+++|+++-. +...|.+.++++...+.+... .......+..|...+|+++. ....+.|++.+++|...+
T Consensus 250 -~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~~~~Lt~~-~~~~~~~~~spDG~~i~f~s~~~g~~~Iy~~d~~g~~~~ 327 (435)
T PRK05137 250 -APRFSPDGRKVVMSLSQGGNTDIYTMDLRSGTTTRLTDS-PAIDTSPSYSPDGSQIVFESDRSGSPQLYVMNADGSNPR 327 (435)
T ss_pred -CcEECCCCCEEEEEEecCCCceEEEEECCCCceEEccCC-CCccCceeEcCCCCEEEEEECCCCCCeEEEEECCCCCeE
Confidence 5566665677777643 345699999887766555432 22345567788666676654 333568999999887766
Q ss_pred EEEeecccCceeEEEeccCCEEEEEeCCCC
Q psy5768 168 ILAQEIIMPIKDITLDLKFFSAFYRNLSKG 197 (652)
Q Consensus 168 ~~~~~~~~~p~gl~lD~~~~~ly~~d~~g~ 197 (652)
.+.... .........+.+++|+++..+++
T Consensus 328 ~lt~~~-~~~~~~~~SpdG~~ia~~~~~~~ 356 (435)
T PRK05137 328 RISFGG-GRYSTPVWSPRGDLIAFTKQGGG 356 (435)
T ss_pred EeecCC-CcccCeEECCCCCEEEEEEcCCC
Confidence 554322 22234567777788877765543
No 65
>PRK04792 tolB translocation protein TolB; Provisional
Probab=97.53 E-value=0.02 Score=62.96 Aligned_cols=179 Identities=12% Similarity=0.080 Sum_probs=112.7
Q ss_pred cEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCC--CCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCC
Q psy5768 337 TINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNN--DATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDS 414 (652)
Q Consensus 337 ~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~--~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g 414 (652)
+|+.++.+|...+.+...-......++.+.++.|+|+... ...|+++++.+.. .+.+.... ....+.+..|...
T Consensus 199 ~l~i~d~dG~~~~~l~~~~~~~~~p~wSPDG~~La~~s~~~g~~~L~~~dl~tg~--~~~lt~~~--g~~~~~~wSPDG~ 274 (448)
T PRK04792 199 QLMIADYDGYNEQMLLRSPEPLMSPAWSPDGRKLAYVSFENRKAEIFVQDIYTQV--REKVTSFP--GINGAPRFSPDGK 274 (448)
T ss_pred EEEEEeCCCCCceEeecCCCcccCceECCCCCEEEEEEecCCCcEEEEEECCCCC--eEEecCCC--CCcCCeeECCCCC
Confidence 4555667776666665444455667888889999887543 3468888886532 22222111 2224678999888
Q ss_pred EEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeC--CCCeEEEEecCCCceEEEecCCCCce
Q psy5768 415 RIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDA--RLDKIERCDYDGTNRIVLSKISPLHP 492 (652)
Q Consensus 415 ~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~--~~~~I~~~~ldG~~~~~l~~~~~~~p 492 (652)
.|+++........|+..++++...+.+.. ........++.+++++|+++-. +...|+.+++++...+.+.... ...
T Consensus 275 ~La~~~~~~g~~~Iy~~dl~tg~~~~lt~-~~~~~~~p~wSpDG~~I~f~s~~~g~~~Iy~~dl~~g~~~~Lt~~g-~~~ 352 (448)
T PRK04792 275 KLALVLSKDGQPEIYVVDIATKALTRITR-HRAIDTEPSWHPDGKSLIFTSERGGKPQIYRVNLASGKVSRLTFEG-EQN 352 (448)
T ss_pred EEEEEEeCCCCeEEEEEECCCCCeEECcc-CCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCCEEEEecCC-CCC
Confidence 88886433334589999998876655443 2223456678888888988643 3357999999876655554322 222
Q ss_pred eEEEE--eCCEEEEEcCCCC--eEEEEEccCCc
Q psy5768 493 FDMAV--YGEFIFWTDWVIH--AVLRANKYTGE 521 (652)
Q Consensus 493 ~glav--~~~~lYwtd~~~~--~I~~~~k~~g~ 521 (652)
.+.++ ++++||++....+ .|+.++..+|.
T Consensus 353 ~~~~~SpDG~~l~~~~~~~g~~~I~~~dl~~g~ 385 (448)
T PRK04792 353 LGGSITPDGRSMIMVNRTNGKFNIARQDLETGA 385 (448)
T ss_pred cCeeECCCCCEEEEEEecCCceEEEEEECCCCC
Confidence 33343 4678888765443 57777776654
No 66
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=97.50 E-value=0.00095 Score=55.10 Aligned_cols=70 Identities=17% Similarity=0.180 Sum_probs=57.3
Q ss_pred EEEEEcCCCeEEEeecc-----------------cccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCCCCeE
Q psy5768 318 ELSYDYKRKTLFYSDIQ-----------------KGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNNDATI 380 (652)
Q Consensus 318 ~v~~D~~~~~lywsd~~-----------------~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I 380 (652)
++|++..++.||++|.. +|++.+.++.....++++.++.-|.|+|+...+..|.+++....+|
T Consensus 2 dldv~~~~g~vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp~t~~~~vl~~~L~fpNGVals~d~~~vlv~Et~~~Ri 81 (89)
T PF03088_consen 2 DLDVDQDTGTVYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDPSTKETTVLLDGLYFPNGVALSPDESFVLVAETGRYRI 81 (89)
T ss_dssp EEEE-TTT--EEEEES-SS--TTGHHHHHHHT---EEEEEEETTTTEEEEEEEEESSEEEEEE-TTSSEEEEEEGGGTEE
T ss_pred ceeEecCCCEEEEEeCccccCccceeeeeecCCCCcCEEEEECCCCeEEEehhCCCccCeEEEcCCCCEEEEEeccCceE
Confidence 47788888999999852 3889999998877778889999999999999999999999999999
Q ss_pred EEEEcCC
Q psy5768 381 NKIDLDS 387 (652)
Q Consensus 381 ~~~~~~~ 387 (652)
.+.-+.|
T Consensus 82 ~rywl~G 88 (89)
T PF03088_consen 82 LRYWLKG 88 (89)
T ss_dssp EEEESSS
T ss_pred EEEEEeC
Confidence 9988766
No 67
>PRK01029 tolB translocation protein TolB; Provisional
Probab=97.47 E-value=0.1 Score=56.93 Aligned_cols=202 Identities=9% Similarity=0.030 Sum_probs=117.8
Q ss_pred EEEcCCCe--E-EEeecc-cccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCC--CCeEEEE--EcCCCC-C
Q psy5768 320 SYDYKRKT--L-FYSDIQ-KGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNN--DATINKI--DLDSPK-A 390 (652)
Q Consensus 320 ~~D~~~~~--l-ywsd~~-~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~--~~~I~~~--~~~~~~-~ 390 (652)
.+.+..++ + |.+... ...|+..+++|...+.+...-+.....++.+.++.|.++... ...|.+. ++.... +
T Consensus 191 ~wSPDG~~~~~~y~S~~~g~~~I~~~~l~~g~~~~lt~~~g~~~~p~wSPDG~~Laf~s~~~g~~di~~~~~~~~~g~~g 270 (428)
T PRK01029 191 TWMHIGSGFPYLYVSYKLGVPKIFLGSLENPAGKKILALQGNQLMPTFSPRKKLLAFISDRYGNPDLFIQSFSLETGAIG 270 (428)
T ss_pred eEccCCCceEEEEEEccCCCceEEEEECCCCCceEeecCCCCccceEECCCCCEEEEEECCCCCcceeEEEeecccCCCC
Confidence 45555443 3 344432 356999999887766665333344566888889999887643 2345554 333211 1
Q ss_pred ccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCC-ceEEEEcCCCCCceEEEecCCCEEEEEeCC--
Q psy5768 391 QRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFG-TESIITTDITMPNALALDHQAEKLFWGDAR-- 467 (652)
Q Consensus 391 ~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~-~~~l~~~~l~~P~glaiD~~~~~LYw~D~~-- 467 (652)
..+.+.. .........+..|...+|+++......+.|+++.+++.. ....+...-......+..+++++|+++...
T Consensus 271 ~~~~lt~-~~~~~~~~p~wSPDG~~Laf~s~~~g~~~ly~~~~~~~g~~~~~lt~~~~~~~~p~wSPDG~~Laf~~~~~g 349 (428)
T PRK01029 271 KPRRLLN-EAFGTQGNPSFSPDGTRLVFVSNKDGRPRIYIMQIDPEGQSPRLLTKKYRNSSCPAWSPDGKKIAFCSVIKG 349 (428)
T ss_pred cceEeec-CCCCCcCCeEECCCCCEEEEEECCCCCceEEEEECcccccceEEeccCCCCccceeECCCCCEEEEEEcCCC
Confidence 1222221 222333467889987777776533334589998887533 222222222233567788889999987543
Q ss_pred CCeEEEEecCCCceEEEecCCCCceeEEEE--eCCEEEEEcC--CCCeEEEEEccCCceE
Q psy5768 468 LDKIERCDYDGTNRIVLSKISPLHPFDMAV--YGEFIFWTDW--VIHAVLRANKYTGEEV 523 (652)
Q Consensus 468 ~~~I~~~~ldG~~~~~l~~~~~~~p~glav--~~~~lYwtd~--~~~~I~~~~k~~g~~~ 523 (652)
...|..+++++...+.+.... ....+.++ ++.+||++.. ....|+.++..+++..
T Consensus 350 ~~~I~v~dl~~g~~~~Lt~~~-~~~~~p~wSpDG~~L~f~~~~~g~~~L~~vdl~~g~~~ 408 (428)
T PRK01029 350 VRQICVYDLATGRDYQLTTSP-ENKESPSWAIDSLHLVYSAGNSNESELYLISLITKKTR 408 (428)
T ss_pred CcEEEEEECCCCCeEEccCCC-CCccceEECCCCCEEEEEECCCCCceEEEEECCCCCEE
Confidence 357999999988777765432 23334444 3567876533 3456888887666543
No 68
>PRK03629 tolB translocation protein TolB; Provisional
Probab=97.45 E-value=0.034 Score=60.87 Aligned_cols=171 Identities=12% Similarity=0.047 Sum_probs=111.6
Q ss_pred CeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccCCC
Q psy5768 11 SKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTACN 90 (652)
Q Consensus 11 ~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~ 90 (652)
..|.++|.+|.....+... -......+++|.+.+|.|+....+...|+..++++...+.+.... ....
T Consensus 179 ~~l~~~d~dg~~~~~lt~~-------~~~~~~p~wSPDG~~la~~s~~~g~~~i~i~dl~~G~~~~l~~~~--~~~~--- 246 (429)
T PRK03629 179 YELRVSDYDGYNQFVVHRS-------PQPLMSPAWSPDGSKLAYVTFESGRSALVIQTLANGAVRQVASFP--RHNG--- 246 (429)
T ss_pred eeEEEEcCCCCCCEEeecC-------CCceeeeEEcCCCCEEEEEEecCCCcEEEEEECCCCCeEEccCCC--CCcC---
Confidence 3799999999887777532 135677889999998887642234568999998876655553221 2234
Q ss_pred CcEEEEccCCcEEEEeC--CCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEE-ecCCCCeEEEEeCCCCCcE
Q psy5768 91 LHIAVDWIAQNIYWSDP--KENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWS-ESGKIPLIARAGLDGKKQT 167 (652)
Q Consensus 91 ~~lavDw~~~~lY~~d~--~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywt-d~~~~~~I~~~~ldg~~~~ 167 (652)
.+++.+.++.|+++.. +...|.+.++++...+.+.... ......+..|...+|+++ +.+..+.|++.++++....
T Consensus 247 -~~~~SPDG~~La~~~~~~g~~~I~~~d~~tg~~~~lt~~~-~~~~~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~~g~~~ 324 (429)
T PRK03629 247 -APAFSPDGSKLAFALSKTGSLNLYVMDLASGQIRQVTDGR-SNNTEPTWFPDSQNLAYTSDQAGRPQVYKVNINGGAPQ 324 (429)
T ss_pred -CeEECCCCCEEEEEEcCCCCcEEEEEECCCCCEEEccCCC-CCcCceEECCCCCEEEEEeCCCCCceEEEEECCCCCeE
Confidence 5677777788988743 3346989998876555554432 345677888876667554 4433568999998877665
Q ss_pred EEEeecccCceeEEEeccCCEEEEEeCCC
Q psy5768 168 ILAQEIIMPIKDITLDLKFFSAFYRNLSK 196 (652)
Q Consensus 168 ~~~~~~~~~p~gl~lD~~~~~ly~~d~~g 196 (652)
.+.... ......++.+.+++|+++..++
T Consensus 325 ~lt~~~-~~~~~~~~SpDG~~Ia~~~~~~ 352 (429)
T PRK03629 325 RITWEG-SQNQDADVSSDGKFMVMVSSNG 352 (429)
T ss_pred EeecCC-CCccCEEECCCCCEEEEEEccC
Confidence 543322 2234566777777777765543
No 69
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=97.43 E-value=0.036 Score=60.29 Aligned_cols=179 Identities=11% Similarity=0.035 Sum_probs=110.5
Q ss_pred ccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCCC--CeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCC
Q psy5768 336 GTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNND--ATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCD 413 (652)
Q Consensus 336 ~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~--~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~ 413 (652)
..|+..+.+|...+.+...-......++.+.++.|+|+.... ..|.+.++.+. ....+. .......+++..|..
T Consensus 170 ~~l~~~d~~g~~~~~l~~~~~~~~~p~~Spdg~~la~~~~~~~~~~i~v~d~~~g---~~~~~~-~~~~~~~~~~~spDg 245 (417)
T TIGR02800 170 YELQVADYDGANPQTITRSREPILSPAWSPDGQKLAYVSFESGKPEIYVQDLATG---QREKVA-SFPGMNGAPAFSPDG 245 (417)
T ss_pred ceEEEEcCCCCCCEEeecCCCceecccCCCCCCEEEEEEcCCCCcEEEEEECCCC---CEEEee-cCCCCccceEECCCC
Confidence 346666666665555553333455667888899999987543 56888887642 122222 122344568899987
Q ss_pred CEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeC--CCCeEEEEecCCCceEEEecCCCCc
Q psy5768 414 SRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDA--RLDKIERCDYDGTNRIVLSKISPLH 491 (652)
Q Consensus 414 g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~--~~~~I~~~~ldG~~~~~l~~~~~~~ 491 (652)
..|+++........|+..++++...+.+.... ......++.+++++|+++.. +...|+.+++++...+.+.... ..
T Consensus 246 ~~l~~~~~~~~~~~i~~~d~~~~~~~~l~~~~-~~~~~~~~s~dg~~l~~~s~~~g~~~iy~~d~~~~~~~~l~~~~-~~ 323 (417)
T TIGR02800 246 SKLAVSLSKDGNPDIYVMDLDGKQLTRLTNGP-GIDTEPSWSPDGKSIAFTSDRGGSPQIYMMDADGGEVRRLTFRG-GY 323 (417)
T ss_pred CEEEEEECCCCCccEEEEECCCCCEEECCCCC-CCCCCEEECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecCC-CC
Confidence 78888754433458999998876655543321 22234566777888988743 3347999999887766554332 23
Q ss_pred eeEEEE--eCCEEEEEcCCC--CeEEEEEccCC
Q psy5768 492 PFDMAV--YGEFIFWTDWVI--HAVLRANKYTG 520 (652)
Q Consensus 492 p~glav--~~~~lYwtd~~~--~~I~~~~k~~g 520 (652)
...+++ ++.+|+++.... ..|+.++..++
T Consensus 324 ~~~~~~spdg~~i~~~~~~~~~~~i~~~d~~~~ 356 (417)
T TIGR02800 324 NASPSWSPDGDLIAFVHREGGGFNIAVMDLDGG 356 (417)
T ss_pred ccCeEECCCCCEEEEEEccCCceEEEEEeCCCC
Confidence 334444 466888887543 35777776554
No 70
>PRK04922 tolB translocation protein TolB; Provisional
Probab=97.36 E-value=0.038 Score=60.53 Aligned_cols=179 Identities=13% Similarity=0.075 Sum_probs=110.6
Q ss_pred cEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCC--CCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCC
Q psy5768 337 TINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNN--DATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDS 414 (652)
Q Consensus 337 ~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~--~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g 414 (652)
.|+.++.+|.....+..+-....+.++.+.++.|+++... ...|.+.++.+.. .+.+... .....+.++.|...
T Consensus 185 ~l~i~D~~g~~~~~lt~~~~~v~~p~wSpDg~~la~~s~~~~~~~l~~~dl~~g~--~~~l~~~--~g~~~~~~~SpDG~ 260 (433)
T PRK04922 185 ALQVADSDGYNPQTILRSAEPILSPAWSPDGKKLAYVSFERGRSAIYVQDLATGQ--RELVASF--RGINGAPSFSPDGR 260 (433)
T ss_pred EEEEECCCCCCceEeecCCCccccccCCCCCCEEEEEecCCCCcEEEEEECCCCC--EEEeccC--CCCccCceECCCCC
Confidence 3555566666555555444456677888889999988643 3468888886532 2222211 12334678999888
Q ss_pred EEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCC--CCeEEEEecCCCceEEEecCCCCce
Q psy5768 415 RIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDAR--LDKIERCDYDGTNRIVLSKISPLHP 492 (652)
Q Consensus 415 ~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~--~~~I~~~~ldG~~~~~l~~~~~~~p 492 (652)
+|+++-.....+.|+..++++...+.+... .......++.+++++|+++... ...|+.+++++...+.+.... ...
T Consensus 261 ~l~~~~s~~g~~~Iy~~d~~~g~~~~lt~~-~~~~~~~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~g-~~~ 338 (433)
T PRK04922 261 RLALTLSRDGNPEIYVMDLGSRQLTRLTNH-FGIDTEPTWAPDGKSIYFTSDRGGRPQIYRVAASGGSAERLTFQG-NYN 338 (433)
T ss_pred EEEEEEeCCCCceEEEEECCCCCeEECccC-CCCccceEECCCCCEEEEEECCCCCceEEEEECCCCCeEEeecCC-CCc
Confidence 888775433345899999988765554332 2223467888888888877432 346999999876665554322 222
Q ss_pred eEEEE--eCCEEEEEcCCC--CeEEEEEccCCc
Q psy5768 493 FDMAV--YGEFIFWTDWVI--HAVLRANKYTGE 521 (652)
Q Consensus 493 ~glav--~~~~lYwtd~~~--~~I~~~~k~~g~ 521 (652)
..+++ ++++|+++.... ..|+..+..+|.
T Consensus 339 ~~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g~ 371 (433)
T PRK04922 339 ARASVSPDGKKIAMVHGSGGQYRIAVMDLSTGS 371 (433)
T ss_pred cCEEECCCCCEEEEEECCCCceeEEEEECCCCC
Confidence 23444 467888875433 257777765554
No 71
>PRK02889 tolB translocation protein TolB; Provisional
Probab=97.35 E-value=0.045 Score=59.88 Aligned_cols=181 Identities=10% Similarity=0.050 Sum_probs=109.0
Q ss_pred ccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCC--CCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCC
Q psy5768 336 GTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNN--DATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCD 413 (652)
Q Consensus 336 ~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~--~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~ 413 (652)
..|+.++.+|...+.+...-......++.+.++.|+++... ...|++.++.+.. ...+. .......+.+..|..
T Consensus 176 ~~L~~~D~dG~~~~~l~~~~~~v~~p~wSPDG~~la~~s~~~~~~~I~~~dl~~g~---~~~l~-~~~g~~~~~~~SPDG 251 (427)
T PRK02889 176 YQLQISDADGQNAQSALSSPEPIISPAWSPDGTKLAYVSFESKKPVVYVHDLATGR---RRVVA-NFKGSNSAPAWSPDG 251 (427)
T ss_pred cEEEEECCCCCCceEeccCCCCcccceEcCCCCEEEEEEccCCCcEEEEEECCCCC---EEEee-cCCCCccceEECCCC
Confidence 34666666666555554333345567888888898887643 3468888886532 22222 111334578899988
Q ss_pred CEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeC--CCCeEEEEecCCCceEEEecCC-CC
Q psy5768 414 SRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDA--RLDKIERCDYDGTNRIVLSKIS-PL 490 (652)
Q Consensus 414 g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~--~~~~I~~~~ldG~~~~~l~~~~-~~ 490 (652)
..|+++-.......|+..+++|...+.+.... ......+..+++++|+++.. +...|+.+++++...+.+.... ..
T Consensus 252 ~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~~~-~~~~~~~wSpDG~~l~f~s~~~g~~~Iy~~~~~~g~~~~lt~~g~~~ 330 (427)
T PRK02889 252 RTLAVALSRDGNSQIYTVNADGSGLRRLTQSS-GIDTEPFFSPDGRSIYFTSDRGGAPQIYRMPASGGAAQRVTFTGSYN 330 (427)
T ss_pred CEEEEEEccCCCceEEEEECCCCCcEECCCCC-CCCcCeEEcCCCCEEEEEecCCCCcEEEEEECCCCceEEEecCCCCc
Confidence 88887643333468999999887765553321 22345678888888887643 3457888888876655443222 11
Q ss_pred ceeEEEEeCCEEEEEcCCCC--eEEEEEccCCc
Q psy5768 491 HPFDMAVYGEFIFWTDWVIH--AVLRANKYTGE 521 (652)
Q Consensus 491 ~p~glav~~~~lYwtd~~~~--~I~~~~k~~g~ 521 (652)
..-.+.-++.+|+++....+ .|+..+..+|+
T Consensus 331 ~~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g~ 363 (427)
T PRK02889 331 TSPRISPDGKLLAYISRVGGAFKLYVQDLATGQ 363 (427)
T ss_pred CceEECCCCCEEEEEEccCCcEEEEEEECCCCC
Confidence 11123335778877654332 57777765554
No 72
>PRK00178 tolB translocation protein TolB; Provisional
Probab=97.27 E-value=0.066 Score=58.61 Aligned_cols=180 Identities=10% Similarity=0.091 Sum_probs=109.7
Q ss_pred cEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCC--CCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCC
Q psy5768 337 TINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNN--DATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDS 414 (652)
Q Consensus 337 ~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~--~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g 414 (652)
+|+.++.+|...+.+...-......++.+.++.|+++... ...|++.++.+.. .+.+... .....+.+..|...
T Consensus 180 ~l~~~d~~g~~~~~l~~~~~~~~~p~wSpDG~~la~~s~~~~~~~l~~~~l~~g~--~~~l~~~--~g~~~~~~~SpDG~ 255 (430)
T PRK00178 180 TLQRSDYDGARAVTLLQSREPILSPRWSPDGKRIAYVSFEQKRPRIFVQNLDTGR--REQITNF--EGLNGAPAWSPDGS 255 (430)
T ss_pred EEEEECCCCCCceEEecCCCceeeeeECCCCCEEEEEEcCCCCCEEEEEECCCCC--EEEccCC--CCCcCCeEECCCCC
Confidence 3555566666555555443345667888889998776543 3468888886532 2332211 12234678899888
Q ss_pred EEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCC--CCeEEEEecCCCceEEEecCC-CCc
Q psy5768 415 RIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDAR--LDKIERCDYDGTNRIVLSKIS-PLH 491 (652)
Q Consensus 415 ~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~--~~~I~~~~ldG~~~~~l~~~~-~~~ 491 (652)
+|+++-.....+.|+..++++...+.+... -......+..+++++||+.-.. ...|+.+++++...+.+.... ...
T Consensus 256 ~la~~~~~~g~~~Iy~~d~~~~~~~~lt~~-~~~~~~~~~spDg~~i~f~s~~~g~~~iy~~d~~~g~~~~lt~~~~~~~ 334 (430)
T PRK00178 256 KLAFVLSKDGNPEIYVMDLASRQLSRVTNH-PAIDTEPFWGKDGRTLYFTSDRGGKPQIYKVNVNGGRAERVTFVGNYNA 334 (430)
T ss_pred EEEEEEccCCCceEEEEECCCCCeEEcccC-CCCcCCeEECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecCCCCcc
Confidence 888765433345899999988766554432 2233456777888888887432 347999999877665554322 111
Q ss_pred eeEEEEeCCEEEEEcCCCC--eEEEEEccCCc
Q psy5768 492 PFDMAVYGEFIFWTDWVIH--AVLRANKYTGE 521 (652)
Q Consensus 492 p~glav~~~~lYwtd~~~~--~I~~~~k~~g~ 521 (652)
.-.+.-++++|+++....+ .|+.++..+|.
T Consensus 335 ~~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg~ 366 (430)
T PRK00178 335 RPRLSADGKTLVMVHRQDGNFHVAAQDLQRGS 366 (430)
T ss_pred ceEECCCCCEEEEEEccCCceEEEEEECCCCC
Confidence 1223335678888765433 57777776654
No 73
>PRK01742 tolB translocation protein TolB; Provisional
Probab=97.25 E-value=0.062 Score=58.80 Aligned_cols=201 Identities=14% Similarity=0.125 Sum_probs=117.5
Q ss_pred EEEEEEEcCCCeEEEeecc--cccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCCCC--eEEEEEcCCCCCc
Q psy5768 316 IIELSYDYKRKTLFYSDIQ--KGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNNDA--TINKIDLDSPKAQ 391 (652)
Q Consensus 316 ~~~v~~D~~~~~lywsd~~--~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~~--~I~~~~~~~~~~~ 391 (652)
+..+.+.+..++|.++... ...|+..++.+...+.+....+....+++.+.++.|+++....+ .|+.+++++..
T Consensus 206 v~~p~wSPDG~~la~~s~~~~~~~i~i~dl~tg~~~~l~~~~g~~~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~~~-- 283 (429)
T PRK01742 206 LMSPAWSPDGSKLAYVSFENKKSQLVVHDLRSGARKVVASFRGHNGAPAFSPDGSRLAFASSKDGVLNIYVMGANGGT-- 283 (429)
T ss_pred cccceEcCCCCEEEEEEecCCCcEEEEEeCCCCceEEEecCCCccCceeECCCCCEEEEEEecCCcEEEEEEECCCCC--
Confidence 4567788888888877543 34688888876554444422223346788888999988754333 57777775532
Q ss_pred cEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCCCeE
Q psy5768 392 RIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARLDKI 471 (652)
Q Consensus 392 ~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I 471 (652)
.+. +. .......+.+..|...+|+++......+.|++...+|...+.+ ... . ...++.++++.|+.+.. +.|
T Consensus 284 ~~~-lt-~~~~~~~~~~wSpDG~~i~f~s~~~g~~~I~~~~~~~~~~~~l-~~~-~--~~~~~SpDG~~ia~~~~--~~i 355 (429)
T PRK01742 284 PSQ-LT-SGAGNNTEPSWSPDGQSILFTSDRSGSPQVYRMSASGGGASLV-GGR-G--YSAQISADGKTLVMING--DNV 355 (429)
T ss_pred eEe-ec-cCCCCcCCEEECCCCCEEEEEECCCCCceEEEEECCCCCeEEe-cCC-C--CCccCCCCCCEEEEEcC--CCE
Confidence 222 22 2233456788999877777764333346999988887765443 221 1 23567778888888754 567
Q ss_pred EEEecCCCceEEEecCCCCceeEEEEeCCEEEEEcCCC-CeEEEEEccCCceEEEE
Q psy5768 472 ERCDYDGTNRIVLSKISPLHPFDMAVYGEFIFWTDWVI-HAVLRANKYTGEEVYTL 526 (652)
Q Consensus 472 ~~~~ldG~~~~~l~~~~~~~p~glav~~~~lYwtd~~~-~~I~~~~k~~g~~~~~~ 526 (652)
..+++.+...+.+.........+.+-++.+|+++.... ..++.+.-.+|+..+.+
T Consensus 356 ~~~Dl~~g~~~~lt~~~~~~~~~~sPdG~~i~~~s~~g~~~~l~~~~~~G~~~~~l 411 (429)
T PRK01742 356 VKQDLTSGSTEVLSSTFLDESPSISPNGIMIIYSSTQGLGKVLQLVSADGRFKARL 411 (429)
T ss_pred EEEECCCCCeEEecCCCCCCCceECCCCCEEEEEEcCCCceEEEEEECCCCceEEc
Confidence 77888766554443322111112223456777765432 23333322355554444
No 74
>PRK02888 nitrous-oxide reductase; Validated
Probab=97.18 E-value=0.014 Score=64.73 Aligned_cols=185 Identities=11% Similarity=0.068 Sum_probs=111.5
Q ss_pred CCCeEEEeecccccEEEEeccCCcceEEe--eccCceeeeEEE-------------------ccCCEEEEEeCCCCeEEE
Q psy5768 324 KRKTLFYSDIQKGTINSVFFNGSNHRVLL--ERQGSVEGLAYE-------------------YVHNYLYWTCNNDATINK 382 (652)
Q Consensus 324 ~~~~lywsd~~~~~I~~~~~~g~~~~~i~--~~~~~~~glAvD-------------------w~~~~LYwtd~~~~~I~~ 382 (652)
..+.||.-|..+.+|-|++++--....|+ .......|+++. ..++.|+-++...+.+.+
T Consensus 140 dGr~~findk~n~Rvari~l~~~~~~~i~~iPn~~~~Hg~~~~~~p~t~yv~~~~e~~~PlpnDGk~l~~~~ey~~~vSv 219 (635)
T PRK02888 140 DGRYLFINDKANTRVARIRLDVMKCDKITELPNVQGIHGLRPQKIPRTGYVFCNGEFRIPLPNDGKDLDDPKKYRSLFTA 219 (635)
T ss_pred ceeEEEEecCCCcceEEEECccEeeceeEeCCCccCccccCccccCCccEEEeCcccccccCCCCCEeecccceeEEEEE
Confidence 35778999999999999998864444444 555566666665 234445545445566777
Q ss_pred EEcCCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEE
Q psy5768 383 IDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLF 462 (652)
Q Consensus 383 ~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LY 462 (652)
++.... +.+-......+|+.++++|..+++|+|..+...+. ....|+-..+..++.-+ ++...++-..++..|
T Consensus 220 ID~etm----eV~~qV~Vdgnpd~v~~spdGk~afvTsyNsE~G~-tl~em~a~e~d~~vvfn--i~~iea~vkdGK~~~ 292 (635)
T PRK02888 220 VDAETM----EVAWQVMVDGNLDNVDTDYDGKYAFSTCYNSEEGV-TLAEMMAAERDWVVVFN--IARIEEAVKAGKFKT 292 (635)
T ss_pred EECccc----eEEEEEEeCCCcccceECCCCCEEEEeccCcccCc-ceeeeccccCceEEEEc--hHHHHHhhhCCCEEE
Confidence 776543 22222233469999999999999999975433211 22333333332333222 222122222355666
Q ss_pred EEeCCCCeEEEEecCC-----CceEEEecCCCCceeEEEEe--CCEEEEEcCCCCeEEEEEccC
Q psy5768 463 WGDARLDKIERCDYDG-----TNRIVLSKISPLHPFDMAVY--GEFIFWTDWVIHAVLRANKYT 519 (652)
Q Consensus 463 w~D~~~~~I~~~~ldG-----~~~~~l~~~~~~~p~glav~--~~~lYwtd~~~~~I~~~~k~~ 519 (652)
+. .+++..+|... ......+.. ...|.|+++. +.++|.+...++.|..++..+
T Consensus 293 V~---gn~V~VID~~t~~~~~~~v~~yIPV-GKsPHGV~vSPDGkylyVanklS~tVSVIDv~k 352 (635)
T PRK02888 293 IG---GSKVPVVDGRKAANAGSALTRYVPV-PKNPHGVNTSPDGKYFIANGKLSPTVTVIDVRK 352 (635)
T ss_pred EC---CCEEEEEECCccccCCcceEEEEEC-CCCccceEECCCCCEEEEeCCCCCcEEEEEChh
Confidence 52 35666666543 222222332 3789999986 679999999999988887654
No 75
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.17 E-value=0.00034 Score=49.17 Aligned_cols=39 Identities=26% Similarity=0.673 Sum_probs=32.2
Q ss_pred CCCCCCCCCCCCc--ccceecCCCceEEEeCCcc-ccCCCcc
Q psy5768 236 GTNPCGVNNGGCA--ELCLYNGVSAVCACAHGVV-AQDGKSC 274 (652)
Q Consensus 236 ~~n~C~~~ng~Cs--~lC~~~~~~~~C~C~~G~l-~~dg~~C 274 (652)
++|||......|. +.|+...++|+|.|+.||. ..++++|
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~~~~~~C 42 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELNDDGTTC 42 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEECTTSSEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEECCCCCcC
Confidence 4689998878897 7999999899999999995 4566655
No 76
>PRK01742 tolB translocation protein TolB; Provisional
Probab=97.16 E-value=0.064 Score=58.72 Aligned_cols=178 Identities=11% Similarity=0.087 Sum_probs=109.5
Q ss_pred ccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCC--CCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCC
Q psy5768 336 GTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNN--DATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCD 413 (652)
Q Consensus 336 ~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~--~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~ 413 (652)
..|+..+.+|...+.+..+.......++.+.++.|+++... ...|.+.++.+.. .+.+.... ..-.+++..|..
T Consensus 184 ~~i~i~d~dg~~~~~lt~~~~~v~~p~wSPDG~~la~~s~~~~~~~i~i~dl~tg~--~~~l~~~~--g~~~~~~wSPDG 259 (429)
T PRK01742 184 YEVRVADYDGFNQFIVNRSSQPLMSPAWSPDGSKLAYVSFENKKSQLVVHDLRSGA--RKVVASFR--GHNGAPAFSPDG 259 (429)
T ss_pred EEEEEECCCCCCceEeccCCCccccceEcCCCCEEEEEEecCCCcEEEEEeCCCCc--eEEEecCC--CccCceeECCCC
Confidence 35666666776655555444456778899999999887543 3468888875422 23332222 223468899988
Q ss_pred CEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeC--CCCeEEEEecCCCceEEEecCCCCc
Q psy5768 414 SRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDA--RLDKIERCDYDGTNRIVLSKISPLH 491 (652)
Q Consensus 414 g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~--~~~~I~~~~ldG~~~~~l~~~~~~~ 491 (652)
.+|+++........|+..++++...+.+.... ......++.+++++|+++-. +...|+.++.+|...+.+ ... ..
T Consensus 260 ~~La~~~~~~g~~~Iy~~d~~~~~~~~lt~~~-~~~~~~~wSpDG~~i~f~s~~~g~~~I~~~~~~~~~~~~l-~~~-~~ 336 (429)
T PRK01742 260 SRLAFASSKDGVLNIYVMGANGGTPSQLTSGA-GNNTEPSWSPDGQSILFTSDRSGSPQVYRMSASGGGASLV-GGR-GY 336 (429)
T ss_pred CEEEEEEecCCcEEEEEEECCCCCeEeeccCC-CCcCCEEECCCCCEEEEEECCCCCceEEEEECCCCCeEEe-cCC-CC
Confidence 88888754333357898888877665554322 23457888888888887643 345788888888766554 211 11
Q ss_pred eeEEEEeCCEEEEEcCCCCeEEEEEccCCce
Q psy5768 492 PFDMAVYGEFIFWTDWVIHAVLRANKYTGEE 522 (652)
Q Consensus 492 p~glav~~~~lYwtd~~~~~I~~~~k~~g~~ 522 (652)
...+.-++++|+.+.. ..+++.+..+|+.
T Consensus 337 ~~~~SpDG~~ia~~~~--~~i~~~Dl~~g~~ 365 (429)
T PRK01742 337 SAQISADGKTLVMING--DNVVKQDLTSGST 365 (429)
T ss_pred CccCCCCCCEEEEEcC--CCEEEEECCCCCe
Confidence 1112224567777653 4566677666653
No 77
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=97.04 E-value=0.13 Score=51.01 Aligned_cols=176 Identities=18% Similarity=0.105 Sum_probs=105.2
Q ss_pred EEEecCCCCeEEEEecC-CCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCC
Q psy5768 3 IAVSSPTQSKIVVCNLE-GEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQK 81 (652)
Q Consensus 3 i~v~~~~~~~I~~~~~~-g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~ 81 (652)
++++....+.|.++++. ++....+.. ....+..+++++.+..++... ..+.|..+++........+...
T Consensus 107 ~~~~~~~~~~i~~~~~~~~~~~~~~~~-------~~~~i~~~~~~~~~~~l~~~~---~~~~i~i~d~~~~~~~~~~~~~ 176 (289)
T cd00200 107 ILSSSSRDKTIKVWDVETGKCLTTLRG-------HTDWVNSVAFSPDGTFVASSS---QDGTIKLWDLRTGKCVATLTGH 176 (289)
T ss_pred EEEEecCCCeEEEEECCCcEEEEEecc-------CCCcEEEEEEcCcCCEEEEEc---CCCcEEEEEccccccceeEecC
Confidence 44554457888889887 444444432 224689999998866655443 4678888887643322233333
Q ss_pred CcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeC
Q psy5768 82 KYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGL 161 (652)
Q Consensus 82 ~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~l 161 (652)
...+. .++++..++.++.+.. .+.|.+.++........+......+..++.+|. +.++.+-. ..+.|...++
T Consensus 177 -~~~i~----~~~~~~~~~~l~~~~~-~~~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~~-~~~~i~i~~~ 248 (289)
T cd00200 177 -TGEVN----SVAFSPDGEKLLSSSS-DGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPD-GYLLASGS-EDGTIRVWDL 248 (289)
T ss_pred -ccccc----eEEECCCcCEEEEecC-CCcEEEEECCCCceecchhhcCCceEEEEEcCC-CcEEEEEc-CCCcEEEEEc
Confidence 33466 7787765556666654 788988888753333323222346788999986 66655543 2456777776
Q ss_pred CCCCcEEEEeecccCceeEEEeccCCEEEEEeCCC
Q psy5768 162 DGKKQTILAQEIIMPIKDITLDLKFFSAFYRNLSK 196 (652)
Q Consensus 162 dg~~~~~~~~~~~~~p~gl~lD~~~~~ly~~d~~g 196 (652)
........+...-..+..+++++.+..|+....+|
T Consensus 249 ~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~d~ 283 (289)
T cd00200 249 RTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADG 283 (289)
T ss_pred CCceeEEEccccCCcEEEEEECCCCCEEEEecCCC
Confidence 63332222222223567888887766776665553
No 78
>PF12999 PRKCSH-like: Glucosidase II beta subunit-like
Probab=97.04 E-value=0.00042 Score=64.11 Aligned_cols=53 Identities=42% Similarity=0.727 Sum_probs=41.9
Q ss_pred CCCceeeccCe--ecCCccCCCCCCCCCCCCCCCCCCCCCCCCCCCeeecCCC----CcccCC
Q psy5768 594 SEHDFKCSDGM--CIPFNQTCDRVYNCHDKSDEGILYCAMRDCRPGYFKCDNN----KCILSS 650 (652)
Q Consensus 594 ~~~~f~C~~g~--Ci~~~~~Cd~~~dC~d~sde~~~~C~~~~C~~~~f~C~~~----~Ci~~~ 650 (652)
.++.|.|.+|. =|+.....|+..||+|||||.+ ...|+.+.|.|.|. +-||.+
T Consensus 34 ~~~~f~Cl~~~~~~I~~~~iNDdyCDC~DGSDEPG----TsAC~~~~FyC~N~g~~p~~i~~s 92 (176)
T PF12999_consen 34 ENGKFTCLDGSKIVIPFSQINDDYCDCPDGSDEPG----TSACSNGKFYCENKGHIPRYIPSS 92 (176)
T ss_pred CCCceEecCCCCceecHHHccCcceeCCCCCCccc----cccCcCceEeeccCCCCCceeehh
Confidence 34579999873 3899999999999999999975 34578889999873 456654
No 79
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=96.95 E-value=0.47 Score=46.87 Aligned_cols=202 Identities=14% Similarity=0.060 Sum_probs=119.0
Q ss_pred EEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe-eccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEE
Q psy5768 316 IIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL-ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIV 394 (652)
Q Consensus 316 ~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~-~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~ 394 (652)
+..+.+.+..+.|+.+.. .+.|+..+++.......+ .....+..+++... +.++.+....+.|.+.++... +.
T Consensus 54 i~~~~~~~~~~~l~~~~~-~~~i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~~~~~~i~~~~~~~~----~~ 127 (289)
T cd00200 54 VRDVAASADGTYLASGSS-DKTIRLWDLETGECVRTLTGHTSYVSSVAFSPD-GRILSSSSRDKTIKVWDVETG----KC 127 (289)
T ss_pred eeEEEECCCCCEEEEEcC-CCeEEEEEcCcccceEEEeccCCcEEEEEEcCC-CCEEEEecCCCeEEEEECCCc----EE
Confidence 447777776656665554 577887777653222222 33336777887766 455556555788888887632 22
Q ss_pred EEEe-CCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCCCeEEE
Q psy5768 395 VVRL-GQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARLDKIER 473 (652)
Q Consensus 395 ~~~~-~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~ 473 (652)
+... .....+..++++|...+|+..... ..|...++........+...-.....+++++.++.|+.+.. .+.|..
T Consensus 128 ~~~~~~~~~~i~~~~~~~~~~~l~~~~~~---~~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~-~~~i~i 203 (289)
T cd00200 128 LTTLRGHTDWVNSVAFSPDGTFVASSSQD---GTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSS-DGTIKL 203 (289)
T ss_pred EEEeccCCCcEEEEEEcCcCCEEEEEcCC---CcEEEEEccccccceeEecCccccceEEECCCcCEEEEecC-CCcEEE
Confidence 2222 233568899999986666655422 26666666533332333323335678999888778877765 677888
Q ss_pred EecCCCceEEEecCCCCceeEEEEeC-CEEEEEcCCCCeEEEEEccCCceEEEEe
Q psy5768 474 CDYDGTNRIVLSKISPLHPFDMAVYG-EFIFWTDWVIHAVLRANKYTGEEVYTLR 527 (652)
Q Consensus 474 ~~ldG~~~~~l~~~~~~~p~glav~~-~~lYwtd~~~~~I~~~~k~~g~~~~~~~ 527 (652)
.++...............+.++++.. +.++.+....+.|...+..+++....+.
T Consensus 204 ~d~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~ 258 (289)
T cd00200 204 WDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLS 258 (289)
T ss_pred EECCCCceecchhhcCCceEEEEEcCCCcEEEEEcCCCcEEEEEcCCceeEEEcc
Confidence 88875443333322223556777764 4555555456677766665555444443
No 80
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.90 E-value=0.0005 Score=41.41 Aligned_cols=22 Identities=41% Similarity=1.001 Sum_probs=18.6
Q ss_pred CceEEEeCCc-cccCCCcccccc
Q psy5768 257 SAVCACAHGV-VAQDGKSCSEYD 278 (652)
Q Consensus 257 ~~~C~C~~G~-l~~dg~~C~~~~ 278 (652)
+|+|.|+.|| |.+|+++|.+.+
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DId 23 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDID 23 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCC
Confidence 5899999998 668999998643
No 81
>PF02333 Phytase: Phytase; InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=96.89 E-value=0.39 Score=50.77 Aligned_cols=142 Identities=16% Similarity=0.195 Sum_probs=79.2
Q ss_pred EEEecCCCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEEC----CCCE---EEEEeccCC--cceEEEEEcCC--
Q psy5768 3 IAVSSPTQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWP----VKGK---MFWSNVTKQ--VVTIEMAFMDG-- 71 (652)
Q Consensus 3 i~v~~~~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~----~~~~---lyw~d~~~~--~~~I~~~~~dg-- 71 (652)
++++.-....++++|++|+.+..+.. .++-.||+-+ .+++ +..+| .. .++|..+.+|+
T Consensus 70 lIigTdK~~GL~VYdL~Gk~lq~~~~---------Gr~NNVDvrygf~l~g~~vDlavas~--R~~g~n~l~~f~id~~~ 138 (381)
T PF02333_consen 70 LIIGTDKKGGLYVYDLDGKELQSLPV---------GRPNNVDVRYGFPLNGKTVDLAVASD--RSDGRNSLRLFRIDPDT 138 (381)
T ss_dssp EEEEEETTTEEEEEETTS-EEEEE-S---------S-EEEEEEEEEEEETTEEEEEEEEEE---CCCT-EEEEEEEETTT
T ss_pred eEEEEeCCCCEEEEcCCCcEEEeecC---------CCcceeeeecceecCCceEEEEEEec--CcCCCCeEEEEEecCCC
Confidence 45555567899999999998876643 2333333321 2222 35566 43 24444444443
Q ss_pred CccEEEE------eCCCcCCccCCCCcEEE--EccCCcEEE-EeCCCCEEEEEEc----CCCcEEEEEeC--CCCCceeE
Q psy5768 72 TKRETVV------SQKKYPAVTACNLHIAV--DWIAQNIYW-SDPKENVIEVARL----TGQYRYVLISG--GVDQPSAL 136 (652)
Q Consensus 72 s~~~~v~------~~~~~~~p~~~~~~lav--Dw~~~~lY~-~d~~~~~I~v~~~----dg~~~~~l~~~--~~~~P~~i 136 (652)
.....+. ... +..|. |+++ +..++.+|. +....+.++...+ +|...-.++.. .-.+|.++
T Consensus 139 g~L~~v~~~~~p~~~~-~~e~y----Glcly~~~~~g~~ya~v~~k~G~~~Qy~L~~~~~g~v~~~lVR~f~~~sQ~EGC 213 (381)
T PF02333_consen 139 GELTDVTDPAAPIATD-LSEPY----GLCLYRSPSTGALYAFVNGKDGRVEQYELTDDGDGKVSATLVREFKVGSQPEGC 213 (381)
T ss_dssp TEEEE-CBTTC-EE-S-SSSEE----EEEEEE-TTT--EEEEEEETTSEEEEEEEEE-TTSSEEEEEEEEEE-SS-EEEE
T ss_pred CcceEcCCCCcccccc-cccce----eeEEeecCCCCcEEEEEecCCceEEEEEEEeCCCCcEeeEEEEEecCCCcceEE
Confidence 2222221 233 45567 8888 556778885 3445576666544 34333233332 23589999
Q ss_pred EEcCCCCeEEEEecCCCCeEEEEeCC
Q psy5768 137 AVDPESGYLFWSESGKIPLIARAGLD 162 (652)
Q Consensus 137 avd~~~g~lywtd~~~~~~I~~~~ld 162 (652)
++|...|+||..+- ...||+...+
T Consensus 214 VVDDe~g~LYvgEE--~~GIW~y~Ae 237 (381)
T PF02333_consen 214 VVDDETGRLYVGEE--DVGIWRYDAE 237 (381)
T ss_dssp EEETTTTEEEEEET--TTEEEEEESS
T ss_pred EEecccCCEEEecC--ccEEEEEecC
Confidence 99999999999985 4689999876
No 82
>TIGR03118 PEPCTERM_chp_1 conserved hypothetical protein TIGR03118. This model describes and uncharacterized conserved hypothetical protein. Members are found with the C-terminal putative exosortase interaction domain, PEP-CTERM, in Nitrosospira multiformis, Rhodoferax ferrireducens, Solibacter usitatus Ellin6076, and Acidobacteria bacterium Ellin345. It is found without the PEP-CTERM domain in several other species, including Burkholderia ambifaria, Gloeobacter violaceus PCC 7421, and three copies in the Acanthamoeba polyphaga mimivirus.
Probab=96.85 E-value=0.18 Score=50.93 Aligned_cols=177 Identities=15% Similarity=0.154 Sum_probs=108.5
Q ss_pred eccCceeeeEEEccCCEEEEEeCCCCeEEEEEcC-----CCCCccEEEEEeC------CCCCceEEEEeCCCCE------
Q psy5768 353 ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLD-----SPKAQRIVVVRLG------QHDKPRGIDIDSCDSR------ 415 (652)
Q Consensus 353 ~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~-----~~~~~~~~~~~~~------~~~~P~~Iavdp~~g~------ 415 (652)
..+..|-|||+.+. +.++++|..++.....+.+ +.. ...++... ....|.+|++....++
T Consensus 20 p~L~N~WGia~~p~-~~~WVadngT~~~TlYdg~~~~~~g~~--~~L~vtiP~~~~~~~~~~PTGiVfN~~~~F~vt~~g 96 (336)
T TIGR03118 20 PGLRNAWGLSYRPG-GPFWVANTGTGTATLYVGNPDTQPLVQ--DPLVVVIPAPPPLAAEGTPTGQVFNGSDTFVVSGEG 96 (336)
T ss_pred ccccccceeEecCC-CCEEEecCCcceEEeecCCcccccCCc--cceEEEecCCCCCCCCCCccEEEEeCCCceEEcCCC
Confidence 56779999999984 4677788888888888876 322 22333332 2358999999965444
Q ss_pred -------EEEEecCCCCCceEEEe--ecCC---CceEEEEcCC--CCCceEEEecC--CCEEEEEeCCCCeEEEEecCCC
Q psy5768 416 -------IYWTNWNSHLPSIQRAF--FSGF---GTESIITTDI--TMPNALALDHQ--AEKLFWGDARLDKIERCDYDGT 479 (652)
Q Consensus 416 -------Lywtd~~~~~~~I~r~~--ldG~---~~~~l~~~~l--~~P~glaiD~~--~~~LYw~D~~~~~I~~~~ldG~ 479 (652)
||.|+-+ .|--.+ ++-+ ...+++.... .--.||+|-.- .++||-+|-..++|... |++
T Consensus 97 ~~~~a~Fif~tEdG----TisaW~p~v~~t~~~~~~~~~d~s~~gavYkGLAi~~~~~~~~LYaadF~~g~IDVF--d~~ 170 (336)
T TIGR03118 97 ITGPSRFLFVTEDG----TLSGWAPALGTTRMTRAEIVVDASQQGNVYKGLAVGPTGGGDYLYAANFRQGRIDVF--KGS 170 (336)
T ss_pred cccceeEEEEeCCc----eEEeecCcCCcccccccEEEEccCCCcceeeeeEEeecCCCceEEEeccCCCceEEe--cCc
Confidence 6666654 332222 2222 1223333221 12348888744 68999999999999886 444
Q ss_pred ceEEEecCC--------CCceeEEEEeCCEEEEEcCCC-------------CeEEEEEccCCceEEEEec--ccCCccee
Q psy5768 480 NRIVLSKIS--------PLHPFDMAVYGEFIFWTDWVI-------------HAVLRANKYTGEEVYTLRK--NIRRPMGI 536 (652)
Q Consensus 480 ~~~~l~~~~--------~~~p~glav~~~~lYwtd~~~-------------~~I~~~~k~~g~~~~~~~~--~~~~p~~i 536 (652)
.+++-+... ...||+|...++.||+|=... +.|-..+. +|+.++.+.. .+..|.+|
T Consensus 171 f~~~~~~g~F~DP~iPagyAPFnIqnig~~lyVtYA~qd~~~~d~v~G~G~G~VdvFd~-~G~l~~r~as~g~LNaPWG~ 249 (336)
T TIGR03118 171 FRPPPLPGSFIDPALPAGYAPFNVQNLGGTLYVTYAQQDADRNDEVAGAGLGYVNVFTL-NGQLLRRVASSGRLNAPWGL 249 (336)
T ss_pred cccccCCCCccCCCCCCCCCCcceEEECCeEEEEEEecCCcccccccCCCcceEEEEcC-CCcEEEEeccCCcccCCcee
Confidence 443332211 256999999999999974222 22333333 4666665543 23778888
Q ss_pred EEE
Q psy5768 537 VAI 539 (652)
Q Consensus 537 ~~~ 539 (652)
++-
T Consensus 250 a~A 252 (336)
T TIGR03118 250 AIA 252 (336)
T ss_pred eeC
Confidence 764
No 83
>PF02333 Phytase: Phytase; InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=96.79 E-value=0.18 Score=53.23 Aligned_cols=117 Identities=14% Similarity=0.264 Sum_probs=71.2
Q ss_pred CCCCCceEEEEe--CCCCEEEEEecCCCCCceEEEee--cCCCc--eEEEEc-C-CCCCceEEEecCCCEEEEEeCCCCe
Q psy5768 399 GQHDKPRGIDID--SCDSRIYWTNWNSHLPSIQRAFF--SGFGT--ESIITT-D-ITMPNALALDHQAEKLFWGDARLDK 470 (652)
Q Consensus 399 ~~~~~P~~Iavd--p~~g~Lywtd~~~~~~~I~r~~l--dG~~~--~~l~~~-~-l~~P~glaiD~~~~~LYw~D~~~~~ 470 (652)
..+..|.|+++. |..|.+|..-.+.. +.++...| ++... -.++.+ . -.+|.|+++|...++||.++... -
T Consensus 153 ~~~~e~yGlcly~~~~~g~~ya~v~~k~-G~~~Qy~L~~~~~g~v~~~lVR~f~~~sQ~EGCVVDDe~g~LYvgEE~~-G 230 (381)
T PF02333_consen 153 TDLSEPYGLCLYRSPSTGALYAFVNGKD-GRVEQYELTDDGDGKVSATLVREFKVGSQPEGCVVDDETGRLYVGEEDV-G 230 (381)
T ss_dssp -SSSSEEEEEEEE-TTT--EEEEEEETT-SEEEEEEEEE-TTSSEEEEEEEEEE-SS-EEEEEEETTTTEEEEEETTT-E
T ss_pred cccccceeeEEeecCCCCcEEEEEecCC-ceEEEEEEEeCCCCcEeeEEEEEecCCCcceEEEEecccCCEEEecCcc-E
Confidence 355678999985 56676665443332 35665555 34332 223321 1 24788999999999999999875 6
Q ss_pred EEEEecC---CCceEEEecCC----CCceeEEEEe-----CCEEEEEcCCCCeEEEEEc
Q psy5768 471 IERCDYD---GTNRIVLSKIS----PLHPFDMAVY-----GEFIFWTDWVIHAVLRANK 517 (652)
Q Consensus 471 I~~~~ld---G~~~~~l~~~~----~~~p~glav~-----~~~lYwtd~~~~~I~~~~k 517 (652)
|++++.+ +..++.+.... ....-||+++ .+||..++.+.++....+.
T Consensus 231 IW~y~Aep~~~~~~~~v~~~~g~~l~aDvEGlaly~~~~g~gYLivSsQG~~sf~Vy~r 289 (381)
T PF02333_consen 231 IWRYDAEPEGGNDRTLVASADGDGLVADVEGLALYYGSDGKGYLIVSSQGDNSFAVYDR 289 (381)
T ss_dssp EEEEESSCCC-S--EEEEEBSSSSB-S-EEEEEEEE-CCC-EEEEEEEGGGTEEEEEES
T ss_pred EEEEecCCCCCCcceeeecccccccccCccceEEEecCCCCeEEEEEcCCCCeEEEEec
Confidence 8999987 34555553221 2457899987 2599999998887554444
No 84
>KOG1219|consensus
Probab=96.78 E-value=0.0013 Score=79.46 Aligned_cols=68 Identities=29% Similarity=0.672 Sum_probs=43.8
Q ss_pred CCCCCCCCCCCCCCCccccccCCCCceeeeccCceeeccCCcccCcccccCCCceeecc-CeecCCc--cCCCCCCCCCC
Q psy5768 544 DACAKTPCRHLNGNCDDICKLDETGQVVCSCFTGKVLMEDNRSCTINTVCSEHDFKCSD-GMCIPFN--QTCDRVYNCHD 620 (652)
Q Consensus 544 ~~~~~~~C~~~ng~Cs~lCl~~~~~~~~C~Cp~g~~l~~d~~C~~~~~~C~~~~f~C~~-g~Ci~~~--~~Cd~~~dC~d 620 (652)
.+|..|||+ ||| .|...|.|.|+|.||.-|.. ++|+.....|.++ .|.+ |.||+.. +.| .|+.
T Consensus 3865 d~C~~npCq--hgG---~C~~~~~ggy~CkCpsqysG---~~CEi~~epC~sn--PC~~GgtCip~~n~f~C----nC~~ 3930 (4289)
T KOG1219|consen 3865 DPCNDNPCQ--HGG---TCISQPKGGYKCKCPSQYSG---NHCEIDLEPCASN--PCLTGGTCIPFYNGFLC----NCPN 3930 (4289)
T ss_pred cccccCccc--CCC---EecCCCCCceEEeCcccccC---cccccccccccCC--CCCCCCEEEecCCCeeE----eCCC
Confidence 345556665 232 57778999999999988744 6888766667543 3555 4788754 334 5665
Q ss_pred CCCCC
Q psy5768 621 KSDEG 625 (652)
Q Consensus 621 ~sde~ 625 (652)
|.-..
T Consensus 3931 gyTG~ 3935 (4289)
T KOG1219|consen 3931 GYTGK 3935 (4289)
T ss_pred CccCc
Confidence 55544
No 85
>PF12999 PRKCSH-like: Glucosidase II beta subunit-like
Probab=96.77 E-value=0.001 Score=61.54 Aligned_cols=56 Identities=30% Similarity=0.555 Sum_probs=44.1
Q ss_pred ceeeeccCceeeccCCcccCcccccCCCceeeccC----eecCCccCCCCCCC---CCCCCCCCCCCCCC
Q psy5768 569 QVVCSCFTGKVLMEDNRSCTINTVCSEHDFKCSDG----MCIPFNQTCDRVYN---CHDKSDEGILYCAM 631 (652)
Q Consensus 569 ~~~C~Cp~g~~l~~d~~C~~~~~~C~~~~f~C~~g----~Ci~~~~~Cd~~~d---C~d~sde~~~~C~~ 631 (652)
.-.|-||+| +| .|..+.|....|.|.|. ..||...+=||+.| |=|||||....|.+
T Consensus 55 DdyCDC~DG----SD---EPGTsAC~~~~FyC~N~g~~p~~i~~s~VnDGICDy~~CCDGSDE~~~~C~N 117 (176)
T PF12999_consen 55 DDYCDCPDG----SD---EPGTSACSNGKFYCENKGHIPRYIPSSRVNDGICDYDICCDGSDESGGKCPN 117 (176)
T ss_pred CcceeCCCC----CC---ccccccCcCceEeeccCCCCCceeehhhhcCCcCcccccCCCCCCCCCCCcc
Confidence 567999999 55 12235687779999983 68999999999999 99999996655654
No 86
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=96.73 E-value=0.1 Score=55.37 Aligned_cols=161 Identities=19% Similarity=0.137 Sum_probs=99.6
Q ss_pred CceeeeEEEccCCEEEEEeCCC-------------CeEEEEEcCCCCCc-----cEEEEEeCCCCCceEEEEeCCCCEEE
Q psy5768 356 GSVEGLAYEYVHNYLYWTCNND-------------ATINKIDLDSPKAQ-----RIVVVRLGQHDKPRGIDIDSCDSRIY 417 (652)
Q Consensus 356 ~~~~glAvDw~~~~LYwtd~~~-------------~~I~~~~~~~~~~~-----~~~~~~~~~~~~P~~Iavdp~~g~Ly 417 (652)
+.-..|++++.+ +||.+-... ++|.+++.++.-.. ...+. ...+.+|.+++.||..|.||
T Consensus 177 H~g~~l~f~pDG-~Lyvs~G~~~~~~~aq~~~~~~Gk~~r~~~a~~~~~d~p~~~~~i~-s~G~RN~qGl~w~P~tg~Lw 254 (399)
T COG2133 177 HFGGRLVFGPDG-KLYVTTGSNGDPALAQDNVSLAGKVLRIDRAGIIPADNPFPNSEIW-SYGHRNPQGLAWHPVTGALW 254 (399)
T ss_pred cCcccEEECCCC-cEEEEeCCCCCcccccCccccccceeeeccCcccccCCCCCCcceE-EeccCCccceeecCCCCcEE
Confidence 345669999988 999987655 34444443331100 12222 34678999999999999999
Q ss_pred EEecCC---CCC---------------ceE-------EEeecCCCceEEEEcC-----CCCCceEEEecCC------CEE
Q psy5768 418 WTNWNS---HLP---------------SIQ-------RAFFSGFGTESIITTD-----ITMPNALALDHQA------EKL 461 (652)
Q Consensus 418 wtd~~~---~~~---------------~I~-------r~~ldG~~~~~l~~~~-----l~~P~glaiD~~~------~~L 461 (652)
.++.+. ..+ -++ +..+++.....+.... -..|.||++=.-+ +.|
T Consensus 255 ~~e~g~d~~~~~Deln~i~~G~nYGWP~~~~G~~~~g~~~~~~~~~~~~~~p~~~~~~h~ApsGmaFy~G~~fP~~r~~l 334 (399)
T COG2133 255 TTEHGPDALRGPDELNSIRPGKNYGWPYAYFGQNYDGRAIPDGTVVAGAIQPVYTWAPHIAPSGMAFYTGDLFPAYRGDL 334 (399)
T ss_pred EEecCCCcccCcccccccccCCccCCceeccCcccCccccCCCcccccccCCceeeccccccceeEEecCCcCccccCcE
Confidence 999886 211 111 1112222221111111 1246788876322 688
Q ss_pred EEEeCCCCeEEEEecCCCceEE---EecC-CCCceeEEEEe-CCEEEEEcCC-CCeEEEEEcc
Q psy5768 462 FWGDARLDKIERCDYDGTNRIV---LSKI-SPLHPFDMAVY-GEFIFWTDWV-IHAVLRANKY 518 (652)
Q Consensus 462 Yw~D~~~~~I~~~~ldG~~~~~---l~~~-~~~~p~glav~-~~~lYwtd~~-~~~I~~~~k~ 518 (652)
|.+..+.-.+.+.+.+|..+.+ ++.. ....|.++++. ++.||++|-. ++.|+|+...
T Consensus 335 fV~~hgsw~~~~~~~~g~~~~~~~~fl~~d~~gR~~dV~v~~DGallv~~D~~~g~i~Rv~~~ 397 (399)
T COG2133 335 FVGAHGSWPVLRLRPDGNYKVVLTGFLSGDLGGRPRDVAVAPDGALLVLTDQGDGRILRVSYA 397 (399)
T ss_pred EEEeecceeEEEeccCCCcceEEEEEEecCCCCcccceEECCCCeEEEeecCCCCeEEEecCC
Confidence 8888777778889999873332 2332 12589999997 5688888776 6799998753
No 87
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.51 E-value=0.0015 Score=39.34 Aligned_cols=20 Identities=30% Similarity=0.461 Sum_probs=18.0
Q ss_pred ceeeeccCceeeccC-CcccC
Q psy5768 569 QVVCSCFTGKVLMED-NRSCT 588 (652)
Q Consensus 569 ~~~C~Cp~g~~l~~d-~~C~~ 588 (652)
+|+|.|+.||.|..| ++|.+
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~D 21 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCED 21 (24)
T ss_pred CEEeeCCCCCcCCCCCCcccc
Confidence 489999999999988 88876
No 88
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=96.32 E-value=1.9 Score=45.85 Aligned_cols=135 Identities=12% Similarity=0.057 Sum_probs=94.6
Q ss_pred eEEEeecccccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCCceE
Q psy5768 327 TLFYSDIQKGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDKPRG 406 (652)
Q Consensus 327 ~lywsd~~~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~ 406 (652)
.+-..+.....|...+.+|...+.+..+++.++.|+++..++.+-.++ .+..|++++++..+ ..++.-+.-.-..+
T Consensus 373 ~~vigt~dgD~l~iyd~~~~e~kr~e~~lg~I~av~vs~dGK~~vvaN-dr~el~vididngn---v~~idkS~~~lItd 448 (668)
T COG4946 373 GDVIGTNDGDKLGIYDKDGGEVKRIEKDLGNIEAVKVSPDGKKVVVAN-DRFELWVIDIDNGN---VRLIDKSEYGLITD 448 (668)
T ss_pred ceEEeccCCceEEEEecCCceEEEeeCCccceEEEEEcCCCcEEEEEc-CceEEEEEEecCCC---eeEecccccceeEE
Confidence 344444555577777888887778889999999999999888887765 45789999997642 44444455677788
Q ss_pred EEEeCCCCEEEEEecCC-CCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeC
Q psy5768 407 IDIDSCDSRIYWTNWNS-HLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDA 466 (652)
Q Consensus 407 Iavdp~~g~Lywtd~~~-~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~ 466 (652)
++++|..+++=++-... -...|.-..|+|...-.+. +.-..-.+-|+|++++.||+...
T Consensus 449 f~~~~nsr~iAYafP~gy~tq~Iklydm~~~Kiy~vT-T~ta~DfsPaFD~d~ryLYfLs~ 508 (668)
T COG4946 449 FDWHPNSRWIAYAFPEGYYTQSIKLYDMDGGKIYDVT-TPTAYDFSPAFDPDGRYLYFLSA 508 (668)
T ss_pred EEEcCCceeEEEecCcceeeeeEEEEecCCCeEEEec-CCcccccCcccCCCCcEEEEEec
Confidence 99999887775553211 0127888899986543333 33334456789999999999754
No 89
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.29 E-value=0.0026 Score=42.82 Aligned_cols=34 Identities=44% Similarity=0.886 Sum_probs=25.3
Q ss_pred CCCCCCCCcc--cceecCCCceEEEeCCccccCCCcc
Q psy5768 240 CGVNNGGCAE--LCLYNGVSAVCACAHGVVAQDGKSC 274 (652)
Q Consensus 240 C~~~ng~Cs~--lC~~~~~~~~C~C~~G~l~~dg~~C 274 (652)
|..+|++|+. .|...+.+|.|.|..||.. ||..|
T Consensus 1 C~~~~~~C~~nA~C~~~~~~~~C~C~~Gy~G-dG~~C 36 (36)
T PF12947_consen 1 CLENNGGCHPNATCTNTGGSYTCTCKPGYEG-DGFFC 36 (36)
T ss_dssp TTTGGGGS-TTCEEEE-TTSEEEEE-CEEEC-CSTCE
T ss_pred CCCCCCCCCCCcEeecCCCCEEeECCCCCcc-CCcCC
Confidence 6677889986 8998888999999999854 77665
No 90
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=96.26 E-value=1.1 Score=44.85 Aligned_cols=176 Identities=14% Similarity=0.127 Sum_probs=102.7
Q ss_pred ecceeEEecCCCCCCCCCceeeeeccccceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEeeccCceeeeEEEc
Q psy5768 286 VNRIDSIHMTDKSDLNSPFESIRNSTMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLLERQGSVEGLAYEY 365 (652)
Q Consensus 286 ~~~i~~i~l~~~~~~~~p~~~~~~~~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw 365 (652)
.+.|+++++..++ .......+. --...|++.- +++||---...+..+..+.+.-....-++-.+.--||+.|
T Consensus 67 ~S~l~~~d~~tg~----~~~~~~l~~-~~FgEGit~~--~d~l~qLTWk~~~~f~yd~~tl~~~~~~~y~~EGWGLt~d- 138 (264)
T PF05096_consen 67 QSSLRKVDLETGK----VLQSVPLPP-RYFGEGITIL--GDKLYQLTWKEGTGFVYDPNTLKKIGTFPYPGEGWGLTSD- 138 (264)
T ss_dssp EEEEEEEETTTSS----EEEEEE-TT-T--EEEEEEE--TTEEEEEESSSSEEEEEETTTTEEEEEEE-SSS--EEEEC-
T ss_pred cEEEEEEECCCCc----EEEEEECCc-cccceeEEEE--CCEEEEEEecCCeEEEEccccceEEEEEecCCcceEEEcC-
Confidence 3568888873322 111222211 1123455543 6788888888888888887643222222333466889977
Q ss_pred cCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCCc----eEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEE
Q psy5768 366 VHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDKP----RGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESI 441 (652)
Q Consensus 366 ~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~P----~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l 441 (652)
+..||.+|+ +.+|...+.......+. +.......| .-|..- +|+||---|.++ .|.|.++.-......
T Consensus 139 -g~~Li~SDG-S~~L~~~dP~~f~~~~~--i~V~~~g~pv~~LNELE~i--~G~IyANVW~td--~I~~Idp~tG~V~~~ 210 (264)
T PF05096_consen 139 -GKRLIMSDG-SSRLYFLDPETFKEVRT--IQVTDNGRPVSNLNELEYI--NGKIYANVWQTD--RIVRIDPETGKVVGW 210 (264)
T ss_dssp -SSCEEEE-S-SSEEEEE-TTT-SEEEE--EE-EETTEE---EEEEEEE--TTEEEEEETTSS--EEEEEETTT-BEEEE
T ss_pred -CCEEEEECC-ccceEEECCcccceEEE--EEEEECCEECCCcEeEEEE--cCEEEEEeCCCC--eEEEEeCCCCeEEEE
Confidence 778999987 67888888655432122 222222333 334443 699988888754 899998876665555
Q ss_pred EEc-C--------------CCCCceEEEecCCCEEEEEeCCCCeEEEEecC
Q psy5768 442 ITT-D--------------ITMPNALALDHQAEKLFWGDARLDKIERCDYD 477 (652)
Q Consensus 442 ~~~-~--------------l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ld 477 (652)
+.- . ..--||||.|+.+++||++=-.-.+++.+.+.
T Consensus 211 iDls~L~~~~~~~~~~~~~~dVLNGIAyd~~~~~l~vTGK~Wp~lyeV~l~ 261 (264)
T PF05096_consen 211 IDLSGLRPEVGRDKSRQPDDDVLNGIAYDPETDRLFVTGKLWPKLYEVKLV 261 (264)
T ss_dssp EE-HHHHHHHTSTTST--TTS-EEEEEEETTTTEEEEEETT-SEEEEEEEE
T ss_pred EEhhHhhhcccccccccccCCeeEeEeEeCCCCEEEEEeCCCCceEEEEEE
Confidence 421 1 11258999999999999997777788877664
No 91
>KOG4499|consensus
Probab=96.26 E-value=0.069 Score=51.48 Aligned_cols=94 Identities=16% Similarity=0.187 Sum_probs=68.0
Q ss_pred eEEEEEcCCCccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcC---C--CcEEEEEeC------CCC
Q psy5768 63 TIEMAFMDGTKRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLT---G--QYRYVLISG------GVD 131 (652)
Q Consensus 63 ~I~~~~~dgs~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~d---g--~~~~~l~~~------~~~ 131 (652)
.+++..+++. .+ ++-.. +..+. |||-|...+..|++|+..-.|...++| | +++++++.- .-.
T Consensus 140 ~Ly~~~~~h~-v~-~i~~~-v~IsN----gl~Wd~d~K~fY~iDsln~~V~a~dyd~~tG~~snr~~i~dlrk~~~~e~~ 212 (310)
T KOG4499|consen 140 ELYSWLAGHQ-VE-LIWNC-VGISN----GLAWDSDAKKFYYIDSLNYEVDAYDYDCPTGDLSNRKVIFDLRKSQPFESL 212 (310)
T ss_pred EEEEeccCCC-ce-eeehh-ccCCc----cccccccCcEEEEEccCceEEeeeecCCCcccccCcceeEEeccCCCcCCC
Confidence 3455555443 22 23344 67788 999998889999999988889878754 2 567787764 234
Q ss_pred CceeEEEcCCCCeEEEEecCCCCeEEEEeCC-CCC
Q psy5768 132 QPSALAVDPESGYLFWSESGKIPLIARAGLD-GKK 165 (652)
Q Consensus 132 ~P~~iavd~~~g~lywtd~~~~~~I~~~~ld-g~~ 165 (652)
.|.+++||. .|+||++-|. .++|.+.++. |+-
T Consensus 213 ~PDGm~ID~-eG~L~Va~~n-g~~V~~~dp~tGK~ 245 (310)
T KOG4499|consen 213 EPDGMTIDT-EGNLYVATFN-GGTVQKVDPTTGKI 245 (310)
T ss_pred CCCcceEcc-CCcEEEEEec-CcEEEEECCCCCcE
Confidence 699999996 9999999985 5688888764 443
No 92
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=96.21 E-value=1.9 Score=44.77 Aligned_cols=263 Identities=16% Similarity=0.229 Sum_probs=132.0
Q ss_pred CCcEEEEeC----CCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecC----CCC----eEEEEeCCCCCc
Q psy5768 99 AQNIYWSDP----KENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESG----KIP----LIARAGLDGKKQ 166 (652)
Q Consensus 99 ~~~lY~~d~----~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~----~~~----~I~~~~ldg~~~ 166 (652)
.+++|+.|. ...++.+.|.|.....-.+..++.. .+++.|.+..+|.++.- .++ .|+..+. +..
T Consensus 2 ~~rvyV~D~~~~~~~~rv~viD~d~~k~lGmi~~g~~~--~~~~spdgk~~y~a~T~~sR~~rG~RtDvv~~~D~--~TL 77 (342)
T PF06433_consen 2 AHRVYVQDPVFFHMTSRVYVIDADSGKLLGMIDTGFLG--NVALSPDGKTIYVAETFYSRGTRGERTDVVEIWDT--QTL 77 (342)
T ss_dssp TTEEEEEE-GGGGSSEEEEEEETTTTEEEEEEEEESSE--EEEE-TTSSEEEEEEEEEEETTEEEEEEEEEEEET--TTT
T ss_pred CcEEEEECCccccccceEEEEECCCCcEEEEeecccCC--ceeECCCCCEEEEEEEEEeccccccceeEEEEEec--CcC
Confidence 467888876 3457888887766665666654333 37788988899987621 111 1233322 222
Q ss_pred EE----EEee-----cccCceeEEEeccCCEEEEEeCCCCcEEEEEecCCCCceEEEeecCCCCCcceeeeeeeccCCCC
Q psy5768 167 TI----LAQE-----IIMPIKDITLDLKFFSAFYRNLSKGNIHIISLSNLSDVSTISMKPYGDSYLKDIKIYSKDAQTGT 237 (652)
Q Consensus 167 ~~----~~~~-----~~~~p~gl~lD~~~~~ly~~d~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~q~~~ 237 (652)
.. .+.. ....++-+++...++++|+.+.....-..| .+-..+...
T Consensus 78 ~~~~EI~iP~k~R~~~~~~~~~~~ls~dgk~~~V~N~TPa~SVtV--------------------------VDl~~~kvv 131 (342)
T PF06433_consen 78 SPTGEIEIPPKPRAQVVPYKNMFALSADGKFLYVQNFTPATSVTV--------------------------VDLAAKKVV 131 (342)
T ss_dssp EEEEEEEETTS-B--BS--GGGEEE-TTSSEEEEEEESSSEEEEE--------------------------EETTTTEEE
T ss_pred cccceEecCCcchheecccccceEEccCCcEEEEEccCCCCeEEE--------------------------EECCCCcee
Confidence 11 1111 023455566666677777754432211111 111110000
Q ss_pred CCCCCCCCCCcccceecCCCceEEEeCCcc-----ccCCCcccccceEEEEeeecceeEEecCCCCCCCCCceeeeeccc
Q psy5768 238 NPCGVNNGGCAELCLYNGVSAVCACAHGVV-----AQDGKSCSEYDAFIMYSRVNRIDSIHMTDKSDLNSPFESIRNSTM 312 (652)
Q Consensus 238 n~C~~~ng~Cs~lC~~~~~~~~C~C~~G~l-----~~dg~~C~~~~~~Ll~s~~~~i~~i~l~~~~~~~~p~~~~~~~~~ 312 (652)
.+- ...||.++==..+.++.=.|..|.+ .++|+. . ..... +. +..+ . | .+..
T Consensus 132 ~ei--~~PGC~~iyP~~~~~F~~lC~DGsl~~v~Ld~~Gk~-----~----~~~t~---~F--~~~~-d-p--~f~~--- 188 (342)
T PF06433_consen 132 GEI--DTPGCWLIYPSGNRGFSMLCGDGSLLTVTLDADGKE-----A----QKSTK---VF--DPDD-D-P--LFEH--- 188 (342)
T ss_dssp EEE--EGTSEEEEEEEETTEEEEEETTSCEEEEEETSTSSE-----E----EEEEE---ES--STTT-S----B-S----
T ss_pred eee--cCCCEEEEEecCCCceEEEecCCceEEEEECCCCCE-----e----Eeecc---cc--CCCC-c-c--cccc---
Confidence 000 1246777533333478889998852 234531 0 11111 11 1111 0 1 1211
Q ss_pred cceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe--ec--------cCceee---eEEEccCCEEEEEeCC---
Q psy5768 313 MKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL--ER--------QGSVEG---LAYEYVHNYLYWTCNN--- 376 (652)
Q Consensus 313 ~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~--~~--------~~~~~g---lAvDw~~~~LYwtd~~--- 376 (652)
-+|+..++++||... .|.|+.+++.|...+..- +- --+|.| +|++...+.||...-.
T Consensus 189 ------~~~~~~~~~~~F~Sy-~G~v~~~dlsg~~~~~~~~~~~~t~~e~~~~WrPGG~Q~~A~~~~~~rlyvLMh~g~~ 261 (342)
T PF06433_consen 189 ------PAYSRDGGRLYFVSY-EGNVYSADLSGDSAKFGKPWSLLTDAEKADGWRPGGWQLIAYHAASGRLYVLMHQGGE 261 (342)
T ss_dssp -------EEETTTTEEEEEBT-TSEEEEEEETTSSEEEEEEEESS-HHHHHTTEEE-SSS-EEEETTTTEEEEEEEE--T
T ss_pred ------cceECCCCeEEEEec-CCEEEEEeccCCcccccCcccccCccccccCcCCcceeeeeeccccCeEEEEecCCCC
Confidence 135666788999776 488999999887644322 11 114555 8999999999985421
Q ss_pred -C-----CeEEEEEcCCCCCccEEE---------EEeCCCCCceEEEEeCCCCEEEEEecCC
Q psy5768 377 -D-----ATINKIDLDSPKAQRIVV---------VRLGQHDKPRGIDIDSCDSRIYWTNWNS 423 (652)
Q Consensus 377 -~-----~~I~~~~~~~~~~~~~~~---------~~~~~~~~P~~Iavdp~~g~Lywtd~~~ 423 (652)
+ ..||++++...+ +... +..++.++|.=++++...+.|++-|...
T Consensus 262 gsHKdpgteVWv~D~~t~k--rv~Ri~l~~~~~Si~Vsqd~~P~L~~~~~~~~~l~v~D~~t 321 (342)
T PF06433_consen 262 GSHKDPGTEVWVYDLKTHK--RVARIPLEHPIDSIAVSQDDKPLLYALSAGDGTLDVYDAAT 321 (342)
T ss_dssp T-TTS-EEEEEEEETTTTE--EEEEEEEEEEESEEEEESSSS-EEEEEETTTTEEEEEETTT
T ss_pred CCccCCceEEEEEECCCCe--EEEEEeCCCccceEEEccCCCcEEEEEcCCCCeEEEEeCcC
Confidence 1 258999886532 1111 2224456666666666666676666553
No 93
>PRK01029 tolB translocation protein TolB; Provisional
Probab=96.11 E-value=1.5 Score=47.87 Aligned_cols=170 Identities=8% Similarity=0.048 Sum_probs=97.6
Q ss_pred CeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCE---EEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCcc
Q psy5768 11 SKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGK---MFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVT 87 (652)
Q Consensus 11 ~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~---lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~ 87 (652)
..|.++|.+|...+.+..... ....| +|.|...+ +|++. ..+...|+..+++|...+.+.... ....
T Consensus 165 ~~l~~~d~dG~~~~~lt~~~~----~~~sP---~wSPDG~~~~~~y~S~-~~g~~~I~~~~l~~g~~~~lt~~~--g~~~ 234 (428)
T PRK01029 165 GELWSVDYDGQNLRPLTQEHS----LSITP---TWMHIGSGFPYLYVSY-KLGVPKIFLGSLENPAGKKILALQ--GNQL 234 (428)
T ss_pred ceEEEEcCCCCCceEcccCCC----Ccccc---eEccCCCceEEEEEEc-cCCCceEEEEECCCCCceEeecCC--CCcc
Confidence 489999999988777654211 22334 58888764 45555 224568999999988766664322 2223
Q ss_pred CCCCcEEEEccCCcEEEEeCC--CCEEEEE--EcCC---CcEEEEEeCCCCCceeEEEcCCCCeEEEEe-cCCCCeEEEE
Q psy5768 88 ACNLHIAVDWIAQNIYWSDPK--ENVIEVA--RLTG---QYRYVLISGGVDQPSALAVDPESGYLFWSE-SGKIPLIARA 159 (652)
Q Consensus 88 ~~~~~lavDw~~~~lY~~d~~--~~~I~v~--~~dg---~~~~~l~~~~~~~P~~iavd~~~g~lywtd-~~~~~~I~~~ 159 (652)
..++.+.+++|.|+-.. ...|.+. ++++ ...+.+...........+..|...+|+|+. .+..+.|++.
T Consensus 235 ----~p~wSPDG~~Laf~s~~~g~~di~~~~~~~~~g~~g~~~~lt~~~~~~~~~p~wSPDG~~Laf~s~~~g~~~ly~~ 310 (428)
T PRK01029 235 ----MPTFSPRKKLLAFISDRYGNPDLFIQSFSLETGAIGKPRRLLNEAFGTQGNPSFSPDGTRLVFVSNKDGRPRIYIM 310 (428)
T ss_pred ----ceEECCCCCEEEEEECCCCCcceeEEEeecccCCCCcceEeecCCCCCcCCeEECCCCCEEEEEECCCCCceEEEE
Confidence 45555557777776532 2345553 3332 222333333222334568888666677765 3335678888
Q ss_pred eCCCC--CcEEEEeecccCceeEEEeccCCEEEEEeCC
Q psy5768 160 GLDGK--KQTILAQEIIMPIKDITLDLKFFSAFYRNLS 195 (652)
Q Consensus 160 ~ldg~--~~~~~~~~~~~~p~gl~lD~~~~~ly~~d~~ 195 (652)
.+++. ....+... .......+..+.+++|+++..+
T Consensus 311 ~~~~~g~~~~~lt~~-~~~~~~p~wSPDG~~Laf~~~~ 347 (428)
T PRK01029 311 QIDPEGQSPRLLTKK-YRNSSCPAWSPDGKKIAFCSVI 347 (428)
T ss_pred ECcccccceEEeccC-CCCccceeECCCCCEEEEEEcC
Confidence 87643 23333222 1233456677777778776544
No 94
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=96.06 E-value=0.14 Score=54.40 Aligned_cols=123 Identities=19% Similarity=0.173 Sum_probs=77.3
Q ss_pred CCCeeEEEEECCCCEEEEEeccCCcce------EEEEEcCCC----ccEEE-------------EeCC-CcCCccCC---
Q psy5768 37 LSKISSIAVWPVKGKMFWSNVTKQVVT------IEMAFMDGT----KRETV-------------VSQK-KYPAVTAC--- 89 (652)
Q Consensus 37 ~~~~~~v~~d~~~~~lyw~d~~~~~~~------I~~~~~dgs----~~~~v-------------~~~~-~~~~p~~~--- 89 (652)
+++|.|+++||.++.||.+| .+... +.+. ..|. ..... .... ....-.|.
T Consensus 238 ~RN~qGl~w~P~tg~Lw~~e--~g~d~~~~~Deln~i-~~G~nYGWP~~~~G~~~~g~~~~~~~~~~~~~~p~~~~~~h~ 314 (399)
T COG2133 238 HRNPQGLAWHPVTGALWTTE--HGPDALRGPDELNSI-RPGKNYGWPYAYFGQNYDGRAIPDGTVVAGAIQPVYTWAPHI 314 (399)
T ss_pred cCCccceeecCCCCcEEEEe--cCCCcccCccccccc-ccCCccCCceeccCcccCccccCCCcccccccCCceeecccc
Confidence 58999999999999999999 55411 2221 1111 00000 0000 00011122
Q ss_pred -CCcEEEEccC------CcEEEEeCCCCEEEEEEcCCCcE---EEEEeC-CCCCceeEEEcCCCCeEEEEecCCCCeEEE
Q psy5768 90 -NLHIAVDWIA------QNIYWSDPKENVIEVARLTGQYR---YVLISG-GVDQPSALAVDPESGYLFWSESGKIPLIAR 158 (652)
Q Consensus 90 -~~~lavDw~~------~~lY~~d~~~~~I~v~~~dg~~~---~~l~~~-~~~~P~~iavd~~~g~lywtd~~~~~~I~~ 158 (652)
|+|||+=--+ +.+|++......+.+.+++|..+ ..++.. .-.+|+++++.| .|.||++|-..+.+|.|
T Consensus 315 ApsGmaFy~G~~fP~~r~~lfV~~hgsw~~~~~~~~g~~~~~~~~fl~~d~~gR~~dV~v~~-DGallv~~D~~~g~i~R 393 (399)
T COG2133 315 APSGMAFYTGDLFPAYRGDLFVGAHGSWPVLRLRPDGNYKVVLTGFLSGDLGGRPRDVAVAP-DGALLVLTDQGDGRILR 393 (399)
T ss_pred ccceeEEecCCcCccccCcEEEEeecceeEEEeccCCCcceEEEEEEecCCCCcccceEECC-CCeEEEeecCCCCeEEE
Confidence 2599985322 67888888888888899999843 333442 226999999998 78888887654668999
Q ss_pred EeCCC
Q psy5768 159 AGLDG 163 (652)
Q Consensus 159 ~~ldg 163 (652)
....+
T Consensus 394 v~~~~ 398 (399)
T COG2133 394 VSYAG 398 (399)
T ss_pred ecCCC
Confidence 87654
No 95
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=96.01 E-value=0.021 Score=36.08 Aligned_cols=26 Identities=12% Similarity=0.231 Sum_probs=23.4
Q ss_pred CceeeeEEEccCCEEEEEeCCCCeEEE
Q psy5768 356 GSVEGLAYEYVHNYLYWTCNNDATINK 382 (652)
Q Consensus 356 ~~~~glAvDw~~~~LYwtd~~~~~I~~ 382 (652)
..|.|||+| ..++||.+|..+++|.+
T Consensus 2 ~~P~gvav~-~~g~i~VaD~~n~rV~v 27 (28)
T PF01436_consen 2 NYPHGVAVD-SDGNIYVADSGNHRVQV 27 (28)
T ss_dssp SSEEEEEEE-TTSEEEEEECCCTEEEE
T ss_pred cCCcEEEEe-CCCCEEEEECCCCEEEE
Confidence 479999999 78999999999999875
No 96
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=95.96 E-value=0.0064 Score=42.67 Aligned_cols=37 Identities=30% Similarity=0.535 Sum_probs=28.1
Q ss_pred CCCCCCCCCCc--cccccCCCCceeeeccCceeeccC-Ccc
Q psy5768 549 TPCRHLNGNCD--DICKLDETGQVVCSCFTGKVLMED-NRS 586 (652)
Q Consensus 549 ~~C~~~ng~Cs--~lCl~~~~~~~~C~Cp~g~~l~~d-~~C 586 (652)
+.|......|+ +.|+.+++ +|+|.|+.||.+..+ ++|
T Consensus 3 dEC~~~~~~C~~~~~C~N~~G-sy~C~C~~Gy~~~~~~~~C 42 (42)
T PF07645_consen 3 DECAEGPHNCPENGTCVNTEG-SYSCSCPPGYELNDDGTTC 42 (42)
T ss_dssp STTTTTSSSSSTTSEEEEETT-EEEEEESTTEEECTTSSEE
T ss_pred cccCCCCCcCCCCCEEEcCCC-CEEeeCCCCcEECCCCCcC
Confidence 56766556675 67888776 699999999997666 554
No 97
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=95.91 E-value=1.1 Score=45.76 Aligned_cols=177 Identities=15% Similarity=0.125 Sum_probs=108.3
Q ss_pred ceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCc--cEEEE---EeCCCCCceEEEEeCCCCEEEEEecCCC-CCceEE
Q psy5768 357 SVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQ--RIVVV---RLGQHDKPRGIDIDSCDSRIYWTNWNSH-LPSIQR 430 (652)
Q Consensus 357 ~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~--~~~~~---~~~~~~~P~~Iavdp~~g~Lywtd~~~~-~~~I~r 430 (652)
.+..||+ ..+.|++.+..-.-+-.++-.-+-.. ....+ .-++--+-.|+|+.- ..--|+|.-+.. .+.=+|
T Consensus 104 diHdia~--~~~~l~fVNT~fSCLatl~~~~SF~P~WkPpFIs~la~eDRCHLNGlA~~~-g~p~yVTa~~~sD~~~gWR 180 (335)
T TIGR03032 104 DAHDLAL--GAGRLLFVNTLFSCLATVSPDYSFVPLWKPPFISKLAPEDRCHLNGMALDD-GEPRYVTALSQSDVADGWR 180 (335)
T ss_pred chhheee--cCCcEEEEECcceeEEEECCCCccccccCCccccccCccCceeecceeeeC-CeEEEEEEeeccCCccccc
Confidence 5677777 57788888877666665554322110 00111 112334667888875 445677765432 122233
Q ss_pred Eeec-C------CCceEEEEcCCCCCceEEEecCCCEEEEEeCCCCeEEEEecCCCceEEEecCCCCceeEEEEeCCEEE
Q psy5768 431 AFFS-G------FGTESIITTDITMPNALALDHQAEKLFWGDARLDKIERCDYDGTNRIVLSKISPLHPFDMAVYGEFIF 503 (652)
Q Consensus 431 ~~ld-G------~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~~~~~p~glav~~~~lY 503 (652)
-... | ..-++ +.+++..|.+-..- +++||+.|.+.+.+.++|.+....+.+..-. ..|.||++.++++|
T Consensus 181 ~~~~~gG~vidv~s~ev-l~~GLsmPhSPRWh--dgrLwvldsgtGev~~vD~~~G~~e~Va~vp-G~~rGL~f~G~llv 256 (335)
T TIGR03032 181 EGRRDGGCVIDIPSGEV-VASGLSMPHSPRWY--QGKLWLLNSGRGELGYVDPQAGKFQPVAFLP-GFTRGLAFAGDFAF 256 (335)
T ss_pred ccccCCeEEEEeCCCCE-EEcCccCCcCCcEe--CCeEEEEECCCCEEEEEcCCCCcEEEEEECC-CCCcccceeCCEEE
Confidence 3221 1 11122 23466677766665 7999999999999999999844444554433 68999999999999
Q ss_pred EEcCCCC-------------------eEEEEEccCCceEEEEec--ccCCcceeEEEe
Q psy5768 504 WTDWVIH-------------------AVLRANKYTGEEVYTLRK--NIRRPMGIVAIS 540 (652)
Q Consensus 504 wtd~~~~-------------------~I~~~~k~~g~~~~~~~~--~~~~p~~i~~~~ 540 (652)
+.=.+.+ .|+.+|..+|..+..+.. .+...+++.+..
T Consensus 257 VgmSk~R~~~~f~glpl~~~l~~~~CGv~vidl~tG~vv~~l~feg~v~EifdV~vLP 314 (335)
T TIGR03032 257 VGLSKLRESRVFGGLPIEERLDALGCGVAVIDLNSGDVVHWLRFEGVIEEIYDVAVLP 314 (335)
T ss_pred EEeccccCCCCcCCCchhhhhhhhcccEEEEECCCCCEEEEEEeCCceeEEEEEEEec
Confidence 8754332 266778888887766553 235667776663
No 98
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=95.63 E-value=0.024 Score=35.80 Aligned_cols=24 Identities=33% Similarity=0.436 Sum_probs=21.3
Q ss_pred CcEEEEccCCcEEEEeCCCCEEEEE
Q psy5768 91 LHIAVDWIAQNIYWSDPKENVIEVA 115 (652)
Q Consensus 91 ~~lavDw~~~~lY~~d~~~~~I~v~ 115 (652)
.|||+| ..++||++|...++|.++
T Consensus 5 ~gvav~-~~g~i~VaD~~n~rV~vf 28 (28)
T PF01436_consen 5 HGVAVD-SDGNIYVADSGNHRVQVF 28 (28)
T ss_dssp EEEEEE-TTSEEEEEECCCTEEEEE
T ss_pred cEEEEe-CCCCEEEEECCCCEEEEC
Confidence 399999 699999999999999864
No 99
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=95.31 E-value=0.57 Score=49.65 Aligned_cols=130 Identities=17% Similarity=0.268 Sum_probs=90.3
Q ss_pred EEEecCCCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCC
Q psy5768 3 IAVSSPTQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKK 82 (652)
Q Consensus 3 i~v~~~~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~ 82 (652)
.++.+..++.+-+++.+|..+.++.. .+.++.+|.+++..+++..++ ....|+.++++..+.+.+=...
T Consensus 374 ~vigt~dgD~l~iyd~~~~e~kr~e~-------~lg~I~av~vs~dGK~~vvaN---dr~el~vididngnv~~idkS~- 442 (668)
T COG4946 374 DVIGTNDGDKLGIYDKDGGEVKRIEK-------DLGNIEAVKVSPDGKKVVVAN---DRFELWVIDIDNGNVRLIDKSE- 442 (668)
T ss_pred eEEeccCCceEEEEecCCceEEEeeC-------CccceEEEEEcCCCcEEEEEc---CceEEEEEEecCCCeeEecccc-
Confidence 46778888999999999998888876 688999999999888888877 6778999999877655553222
Q ss_pred cCCccCCCCcEEEEccCCcEEEEeC-----CCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEec
Q psy5768 83 YPAVTACNLHIAVDWIAQNIYWSDP-----KENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSES 150 (652)
Q Consensus 83 ~~~p~~~~~~lavDw~~~~lY~~d~-----~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~ 150 (652)
.+... ++ +|--+-=+++.+ -++.|...+.+|....-+ .+....-.+-|.||...|||+-..
T Consensus 443 ~~lIt----df--~~~~nsr~iAYafP~gy~tq~Iklydm~~~Kiy~v-TT~ta~DfsPaFD~d~ryLYfLs~ 508 (668)
T COG4946 443 YGLIT----DF--DWHPNSRWIAYAFPEGYYTQSIKLYDMDGGKIYDV-TTPTAYDFSPAFDPDGRYLYFLSA 508 (668)
T ss_pred cceeE----EE--EEcCCceeEEEecCcceeeeeEEEEecCCCeEEEe-cCCcccccCcccCCCCcEEEEEec
Confidence 33344 33 333332223332 356788999988533222 222344457789999999999753
No 100
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=95.28 E-value=1.6 Score=44.63 Aligned_cols=99 Identities=13% Similarity=0.123 Sum_probs=59.7
Q ss_pred CeEEEecCCCCeEEEEecC--CCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEE----------
Q psy5768 1 MFIAVSSPTQSKIVVCNLE--GEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAF---------- 68 (652)
Q Consensus 1 ~~i~v~~~~~~~I~~~~~~--g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~---------- 68 (652)
+.++++++..++++++..+ |. ...+.. .+.++.|++.+ .++||.+- ..+|.+..
T Consensus 18 ~Sla~sTYQagkL~~ig~~~~g~-l~~~~r-------~F~r~MGl~~~--~~~l~~~t----~~qiw~f~~~~n~l~~~~ 83 (335)
T TIGR03032 18 LSLAVTTYQAGKLFFIGLQPNGE-LDVFER-------TFPRPMGLAVS--PQSLTLGT----RYQLWRFANVDNLLPAGQ 83 (335)
T ss_pred eEEEEEeeecceEEEEEeCCCCc-EEEEee-------ccCccceeeee--CCeEEEEE----cceeEEcccccccccccc
Confidence 3689999999999998655 43 333332 57899999885 68999876 34666651
Q ss_pred cCCC------ccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCc
Q psy5768 69 MDGT------KRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQY 121 (652)
Q Consensus 69 ~dgs------~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~ 121 (652)
..+. .+...+ +|++. .- .||+ ..+.++++++....+...+.+-++
T Consensus 84 ~~~~~D~~yvPr~~~~-TGdid-iH----dia~--~~~~l~fVNT~fSCLatl~~~~SF 134 (335)
T TIGR03032 84 THPGYDRLYVPRASYV-TGDID-AH----DLAL--GAGRLLFVNTLFSCLATVSPDYSF 134 (335)
T ss_pred cCCCCCeEEeeeeeee-ccCcc-hh----heee--cCCcEEEEECcceeEEEECCCCcc
Confidence 1121 122222 22122 12 5666 466788877766666655554443
No 101
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=95.24 E-value=0.017 Score=39.46 Aligned_cols=36 Identities=28% Similarity=0.744 Sum_probs=27.5
Q ss_pred CCCCCCCCCCCcc--cceecCCCceEEEeCCccccCCCccc
Q psy5768 237 TNPCGVNNGGCAE--LCLYNGVSAVCACAHGVVAQDGKSCS 275 (652)
Q Consensus 237 ~n~C~~~ng~Cs~--lC~~~~~~~~C~C~~G~l~~dg~~C~ 275 (652)
.++|... ..|.+ .|+...++|.|.|+.||. +|+.|.
T Consensus 2 ~~~C~~~-~~C~~~~~C~~~~g~~~C~C~~g~~--~g~~C~ 39 (39)
T smart00179 2 IDECASG-NPCQNGGTCVNTVGSYRCECPPGYT--DGRNCE 39 (39)
T ss_pred cccCcCC-CCcCCCCEeECCCCCeEeECCCCCc--cCCcCC
Confidence 4677653 46776 899877789999999986 676663
No 102
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=95.08 E-value=0.02 Score=37.14 Aligned_cols=28 Identities=25% Similarity=0.526 Sum_probs=22.0
Q ss_pred CCCCccccccCCCCceeeeccCceeeccCC
Q psy5768 555 NGNCDDICKLDETGQVVCSCFTGKVLMEDN 584 (652)
Q Consensus 555 ng~Cs~lCl~~~~~~~~C~Cp~g~~l~~d~ 584 (652)
...|+..|-|... ..|.||+||.|.++.
T Consensus 5 ~t~CpA~CDpn~~--~~C~CPeGyIlde~~ 32 (34)
T PF09064_consen 5 QTECPADCDPNSP--GQCFCPEGYILDEGS 32 (34)
T ss_pred cccCCCccCCCCC--CceeCCCceEecCCc
Confidence 3568888987654 589999999997663
No 103
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=94.99 E-value=0.014 Score=57.74 Aligned_cols=39 Identities=26% Similarity=0.603 Sum_probs=32.9
Q ss_pred CCCCCCCCCCCCCcccceecCCCceEEEeCCc-cccCCCc
Q psy5768 235 TGTNPCGVNNGGCAELCLYNGVSAVCACAHGV-VAQDGKS 273 (652)
Q Consensus 235 ~~~n~C~~~ng~Cs~lC~~~~~~~~C~C~~G~-l~~dg~~ 273 (652)
.+.++|...+..|.|.|...+++|.|.|+.|| +.+|+++
T Consensus 185 ~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~~~~~~ 224 (224)
T cd01475 185 VVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLEDNKT 224 (224)
T ss_pred cCchhhcCCCCCccceEEcCCCCEEeECCCCccCCCCCCC
Confidence 36789988888999999998889999999998 5567653
No 104
>COG5276 Uncharacterized conserved protein [Function unknown]
Probab=94.84 E-value=5 Score=40.46 Aligned_cols=221 Identities=14% Similarity=0.242 Sum_probs=118.4
Q ss_pred cceEEEEee-ecceeEEecCCCCCCCCCceeeeeccccceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEeecc
Q psy5768 277 YDAFIMYSR-VNRIDSIHMTDKSDLNSPFESIRNSTMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLLERQ 355 (652)
Q Consensus 277 ~~~~Ll~s~-~~~i~~i~l~~~~~~~~p~~~~~~~~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~~~~ 355 (652)
.+++.+++. ...++-+++.++.. |. ++.. -.. .-.+--++...+..|++|...+ +..+++......++....
T Consensus 95 se~yvyvad~ssGL~IvDIS~P~s---P~-~~~~-lnt-~gyaygv~vsGn~aYVadlddg-fLivdvsdpssP~lagry 167 (370)
T COG5276 95 SEEYVYVADWSSGLRIVDISTPDS---PT-LIGF-LNT-DGYAYGVYVSGNYAYVADLDDG-FLIVDVSDPSSPQLAGRY 167 (370)
T ss_pred cccEEEEEcCCCceEEEeccCCCC---cc-eecc-ccC-CceEEEEEecCCEEEEeeccCc-EEEEECCCCCCceeeeee
Confidence 456777765 44566666644433 21 1111 001 0123334566788999998544 444555443334444222
Q ss_pred ----CceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEE
Q psy5768 356 ----GSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRA 431 (652)
Q Consensus 356 ----~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~ 431 (652)
...+.+|+. ++.-|.+....+ +..+++.... ..+++. ...-.|..-.+.+...+.|.++.++ .+.-.
T Consensus 168 a~~~~d~~~v~IS--Gn~AYvA~~d~G-L~ivDVSnp~--sPvli~-~~n~g~g~~sv~vsdnr~y~vvy~e---gvliv 238 (370)
T COG5276 168 ALPGGDTHDVAIS--GNYAYVAWRDGG-LTIVDVSNPH--SPVLIG-SYNTGPGTYSVSVSDNRAYLVVYDE---GVLIV 238 (370)
T ss_pred ccCCCCceeEEEe--cCeEEEEEeCCC-eEEEEccCCC--CCeEEE-EEecCCceEEEEecCCeeEEEEccc---ceEEE
Confidence 233567775 788888876543 5667776543 222332 2222333333444457788888764 34445
Q ss_pred eecCCCceEEEE-cCCCCCceE-EEecCCCEEEEEeCCCCeEEEEecCCCceEEEec---CCCCceeEEEEeCCEEEEEc
Q psy5768 432 FFSGFGTESIIT-TDITMPNAL-ALDHQAEKLFWGDARLDKIERCDYDGTNRIVLSK---ISPLHPFDMAVYGEFIFWTD 506 (652)
Q Consensus 432 ~ldG~~~~~l~~-~~l~~P~gl-aiD~~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~---~~~~~p~glav~~~~lYwtd 506 (652)
..+|...-.++. -+-..|.++ ++-..+++.|.+|...+ +-.++..-..-..+.. ....+-.||.++++++|.+|
T Consensus 239 d~s~~ssp~~~gsyet~~p~~~s~v~Vs~~~~Yvadga~g-l~~idisnp~spfl~ss~~t~g~~a~gi~ay~~y~yiad 317 (370)
T COG5276 239 DVSGPSSPTVFGSYETSNPVSISTVPVSGEYAYVADGAKG-LPIIDISNPPSPFLSSSLDTAGYQAAGIRAYGNYNYIAD 317 (370)
T ss_pred ecCCCCCceEeeccccCCcccccceecccceeeeeccccC-ceeEeccCCCCCchhccccCCCccccceEEecCeeEecc
Confidence 555554334442 233445544 33345899999986433 2223333222122222 11236789999999999999
Q ss_pred CCCCeEEE
Q psy5768 507 WVIHAVLR 514 (652)
Q Consensus 507 ~~~~~I~~ 514 (652)
..++.|.-
T Consensus 318 kn~g~vV~ 325 (370)
T COG5276 318 KNTGAVVD 325 (370)
T ss_pred CCceEEEe
Confidence 88877643
No 105
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=94.40 E-value=3 Score=43.74 Aligned_cols=111 Identities=16% Similarity=0.276 Sum_probs=68.5
Q ss_pred ccCceeeeEEEccCCEEEEEeCCCCe------EEEEEcCCCCCc--c-----EEEEEeCC-------CCCceEEEEeCCC
Q psy5768 354 RQGSVEGLAYEYVHNYLYWTCNNDAT------INKIDLDSPKAQ--R-----IVVVRLGQ-------HDKPRGIDIDSCD 413 (652)
Q Consensus 354 ~~~~~~glAvDw~~~~LYwtd~~~~~------I~~~~~~~~~~~--~-----~~~~~~~~-------~~~P~~Iavdp~~ 413 (652)
.++...||++|. ....||+-+..+. +....+....+. . ...+.... ...+.+|++ +..
T Consensus 18 ~~GGlSgl~~~~-~~~~~~avSD~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~~G~~~~~~~~D~Egi~~-~~~ 95 (326)
T PF13449_consen 18 PFGGLSGLDYDP-DDGRFYAVSDRGPNKGPPRFYTFRIDYDQGGIGGVTILDMIPLRDPDGQPFPKNGLDPEGIAV-PPD 95 (326)
T ss_pred ccCcEeeEEEeC-CCCEEEEEECCCCCCCCCcEEEEEeeccCCCccceEeccceeccCCCCCcCCcCCCChhHeEE-ecC
Confidence 445678888886 3445666555555 665555431111 1 11111111 126679999 678
Q ss_pred CEEEEEecCCC----CCceEEEeecCCCceEE-EEcCC---------CC----CceEEEecCCCEEEEEeC
Q psy5768 414 SRIYWTNWNSH----LPSIQRAFFSGFGTESI-ITTDI---------TM----PNALALDHQAEKLFWGDA 466 (652)
Q Consensus 414 g~Lywtd~~~~----~~~I~r~~ldG~~~~~l-~~~~l---------~~----P~glaiD~~~~~LYw~D~ 466 (652)
|.+||++.+.. .|.|.+..++|...+.+ +...+ .. ..|||+.+++++||.+-.
T Consensus 96 g~~~is~E~~~~~~~~p~I~~~~~~G~~~~~~~vP~~~~~~~~~~~~~~~N~G~E~la~~~dG~~l~~~~E 166 (326)
T PF13449_consen 96 GSFWISSEGGRTGGIPPRIRRFDLDGRVIRRFPVPAAFLPDANGTSGRRNNRGFEGLAVSPDGRTLFAAME 166 (326)
T ss_pred CCEEEEeCCccCCCCCCEEEEECCCCcccceEccccccccccCccccccCCCCeEEEEECCCCCEEEEEEC
Confidence 99999987741 16999999999886555 22221 11 338999999888998743
No 106
>PF00057 Ldl_recept_a: Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor; InterPro: IPR002172 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=93.86 E-value=0.034 Score=37.75 Aligned_cols=20 Identities=45% Similarity=1.047 Sum_probs=17.7
Q ss_pred CCCCCCeeecCCCCcccCCC
Q psy5768 632 RDCRPGYFKCDNNKCILSSH 651 (652)
Q Consensus 632 ~~C~~~~f~C~~~~Ci~~~~ 651 (652)
++|++++|+|.+++||+.++
T Consensus 1 ~~C~~~~f~C~~~~CI~~~~ 20 (37)
T PF00057_consen 1 PTCPPGEFRCGNGQCIPKSW 20 (37)
T ss_dssp SSSSTTEEEETTSSEEEGGG
T ss_pred CcCcCCeeEcCCCCEEChHH
Confidence 36899999999999999865
No 107
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=93.85 E-value=5.7 Score=42.06 Aligned_cols=157 Identities=16% Similarity=0.157 Sum_probs=79.7
Q ss_pred CeEEEeecc-cccEEEEeccCCcceEEeecc-CceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCC
Q psy5768 326 KTLFYSDIQ-KGTINSVFFNGSNHRVLLERQ-GSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDK 403 (652)
Q Consensus 326 ~~lywsd~~-~~~I~~~~~~g~~~~~i~~~~-~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~ 403 (652)
+.||.+|.. ...++.++++....+.+..+- ....|..+-..++.||+.... .++.++++..++ .+.+......-.
T Consensus 49 kllF~s~~dg~~nly~lDL~t~~i~QLTdg~g~~~~g~~~s~~~~~~~Yv~~~-~~l~~vdL~T~e--~~~vy~~p~~~~ 125 (386)
T PF14583_consen 49 KLLFASDFDGNRNLYLLDLATGEITQLTDGPGDNTFGGFLSPDDRALYYVKNG-RSLRRVDLDTLE--ERVVYEVPDDWK 125 (386)
T ss_dssp EEEEEE-TTSS-EEEEEETTT-EEEE---SS-B-TTT-EE-TTSSEEEEEETT-TEEEEEETTT----EEEEEE--TTEE
T ss_pred EEEEEeccCCCcceEEEEcccCEEEECccCCCCCccceEEecCCCeEEEEECC-CeEEEEECCcCc--EEEEEECCcccc
Confidence 345555543 346888888876666665322 223466677889999876533 578999998764 345555444433
Q ss_pred ceEEEE-eCCCCEEEEE------------ecC--------CCCCceEEEeecCCCceEEEEcCCCCCceEEEecCC-CEE
Q psy5768 404 PRGIDI-DSCDSRIYWT------------NWN--------SHLPSIQRAFFSGFGTESIITTDITMPNALALDHQA-EKL 461 (652)
Q Consensus 404 P~~Iav-dp~~g~Lywt------------d~~--------~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~-~~L 461 (652)
..+-.+ +. .+.++.. +|. ....+|.++.+.+..++++..++ .|-.-+-..+.. ..|
T Consensus 126 g~gt~v~n~-d~t~~~g~e~~~~d~~~l~~~~~f~e~~~a~p~~~i~~idl~tG~~~~v~~~~-~wlgH~~fsP~dp~li 203 (386)
T PF14583_consen 126 GYGTWVANS-DCTKLVGIEISREDWKPLTKWKGFREFYEARPHCRIFTIDLKTGERKVVFEDT-DWLGHVQFSPTDPTLI 203 (386)
T ss_dssp EEEEEEE-T-TSSEEEEEEEEGGG-----SHHHHHHHHHC---EEEEEEETTT--EEEEEEES-S-EEEEEEETTEEEEE
T ss_pred cccceeeCC-CccEEEEEEEeehhccCccccHHHHHHHhhCCCceEEEEECCCCceeEEEecC-ccccCcccCCCCCCEE
Confidence 333333 43 3333322 111 11237999999998888888765 344444444332 344
Q ss_pred EEEeC-----CCCeEEEEecCCCceEEEecC
Q psy5768 462 FWGDA-----RLDKIERCDYDGTNRIVLSKI 487 (652)
Q Consensus 462 Yw~D~-----~~~~I~~~~ldG~~~~~l~~~ 487 (652)
-++-. -..+|+.++.||++.+.+...
T Consensus 204 ~fCHEGpw~~Vd~RiW~i~~dg~~~~~v~~~ 234 (386)
T PF14583_consen 204 MFCHEGPWDLVDQRIWTINTDGSNVKKVHRR 234 (386)
T ss_dssp EEEE-S-TTTSS-SEEEEETTS---EESS--
T ss_pred EEeccCCcceeceEEEEEEcCCCcceeeecC
Confidence 44432 234899999999998877553
No 108
>PRK02888 nitrous-oxide reductase; Validated
Probab=93.58 E-value=3.2 Score=46.59 Aligned_cols=147 Identities=13% Similarity=0.119 Sum_probs=87.7
Q ss_pred ceeeeEEEccCCEEEEEeC---CCCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEee
Q psy5768 357 SVEGLAYEYVHNYLYWTCN---NDATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFF 433 (652)
Q Consensus 357 ~~~glAvDw~~~~LYwtd~---~~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~l 433 (652)
+|.++++++.++.+|+|.. ....+..++... +..++.. ..++.-++-+...+.|+. + + ++ .-+
T Consensus 236 npd~v~~spdGk~afvTsyNsE~G~tl~em~a~e----~d~~vvf---ni~~iea~vkdGK~~~V~--g-n--~V--~VI 301 (635)
T PRK02888 236 NLDNVDTDYDGKYAFSTCYNSEEGVTLAEMMAAE----RDWVVVF---NIARIEEAVKAGKFKTIG--G-S--KV--PVV 301 (635)
T ss_pred CcccceECCCCCEEEEeccCcccCcceeeecccc----CceEEEE---chHHHHHhhhCCCEEEEC--C-C--EE--EEE
Confidence 8899999999999999963 223444444321 1111111 111111222334455552 1 1 33 234
Q ss_pred cCCC-----ceEEEEc-CCCCCceEEEecCCCEEEEEeCCCCeEEEEecCCCc---------eEEEecC-C-CCceeEEE
Q psy5768 434 SGFG-----TESIITT-DITMPNALALDHQAEKLFWGDARLDKIERCDYDGTN---------RIVLSKI-S-PLHPFDMA 496 (652)
Q Consensus 434 dG~~-----~~~l~~~-~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG~~---------~~~l~~~-~-~~~p~gla 496 (652)
|+.. ..++... .-..|.|++++++++++|.+....+.+..+|+.-.. +.+++.. . ...|...+
T Consensus 302 D~~t~~~~~~~v~~yIPVGKsPHGV~vSPDGkylyVanklS~tVSVIDv~k~k~~~~~~~~~~~~vvaevevGlGPLHTa 381 (635)
T PRK02888 302 DGRKAANAGSALTRYVPVPKNPHGVNTSPDGKYFIANGKLSPTVTVIDVRKLDDLFDGKIKPRDAVVAEPELGLGPLHTA 381 (635)
T ss_pred ECCccccCCcceEEEEECCCCccceEECCCCCEEEEeCCCCCcEEEEEChhhhhhhhccCCccceEEEeeccCCCcceEE
Confidence 4544 3333222 235899999999999999999888999999987532 2333222 1 36798999
Q ss_pred EeCC-EEEEEcCCCCeEEEEEc
Q psy5768 497 VYGE-FIFWTDWVIHAVLRANK 517 (652)
Q Consensus 497 v~~~-~lYwtd~~~~~I~~~~k 517 (652)
++++ +.|.|=.-...|.+-|.
T Consensus 382 FDg~G~aytslf~dsqv~kwn~ 403 (635)
T PRK02888 382 FDGRGNAYTTLFLDSQIVKWNI 403 (635)
T ss_pred ECCCCCEEEeEeecceeEEEeh
Confidence 9854 88888766666666553
No 109
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=93.55 E-value=0.047 Score=54.03 Aligned_cols=35 Identities=29% Similarity=0.700 Sum_probs=30.3
Q ss_pred CCCCCCCCCCCccccccCCCCceeeeccCceeeccC
Q psy5768 548 KTPCRHLNGNCDDICKLDETGQVVCSCFTGKVLMED 583 (652)
Q Consensus 548 ~~~C~~~ng~Cs~lCl~~~~~~~~C~Cp~g~~l~~d 583 (652)
.++|...+..|.|.|...++ +|.|.|+.||.+..|
T Consensus 187 ~~~C~~~~~~c~~~C~~~~g-~~~c~c~~g~~~~~~ 221 (224)
T cd01475 187 PDLCATLSHVCQQVCISTPG-SYLCACTEGYALLED 221 (224)
T ss_pred chhhcCCCCCccceEEcCCC-CEEeECCCCccCCCC
Confidence 47888778889999997665 699999999999877
No 110
>KOG0291|consensus
Probab=93.41 E-value=11 Score=42.82 Aligned_cols=179 Identities=13% Similarity=0.056 Sum_probs=0.0
Q ss_pred EEEecCCCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcC-CCccEEEEeCC
Q psy5768 3 IAVSSPTQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMD-GTKRETVVSQK 81 (652)
Q Consensus 3 i~v~~~~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~d-gs~~~~v~~~~ 81 (652)
.+++--..++|.++|....++..-.+. ..+..+++.|. ..+++.++- .-.+++.-+++. ..+.+++....
T Consensus 364 ~iaTG~eDgKVKvWn~~SgfC~vTFte------Hts~Vt~v~f~-~~g~~llss--SLDGtVRAwDlkRYrNfRTft~P~ 434 (893)
T KOG0291|consen 364 LIATGAEDGKVKVWNTQSGFCFVTFTE------HTSGVTAVQFT-ARGNVLLSS--SLDGTVRAWDLKRYRNFRTFTSPE 434 (893)
T ss_pred EEEeccCCCcEEEEeccCceEEEEecc------CCCceEEEEEE-ecCCEEEEe--ecCCeEEeeeecccceeeeecCCC
Q ss_pred CcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeC
Q psy5768 82 KYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGL 161 (652)
Q Consensus 82 ~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~l 161 (652)
--.-. .+|+|..+..+...+...=.|.+-+.......-++++--....+|.++|.+..|+=..|...-+||-..-
T Consensus 435 -p~Qfs----cvavD~sGelV~AG~~d~F~IfvWS~qTGqllDiLsGHEgPVs~l~f~~~~~~LaS~SWDkTVRiW~if~ 509 (893)
T KOG0291|consen 435 -PIQFS----CVAVDPSGELVCAGAQDSFEIFVWSVQTGQLLDILSGHEGPVSGLSFSPDGSLLASGSWDKTVRIWDIFS 509 (893)
T ss_pred -ceeee----EEEEcCCCCEEEeeccceEEEEEEEeecCeeeehhcCCCCcceeeEEccccCeEEeccccceEEEEEeec
Q ss_pred CCCCcEEEEeecccCceeEEEeccCCEEEEEeCCCC
Q psy5768 162 DGKKQTILAQEIIMPIKDITLDLKFFSAFYRNLSKG 197 (652)
Q Consensus 162 dg~~~~~~~~~~~~~p~gl~lD~~~~~ly~~d~~g~ 197 (652)
......++-... ...++++-+.+..|-+..++|.
T Consensus 510 s~~~vEtl~i~s--dvl~vsfrPdG~elaVaTldgq 543 (893)
T KOG0291|consen 510 SSGTVETLEIRS--DVLAVSFRPDGKELAVATLDGQ 543 (893)
T ss_pred cCceeeeEeecc--ceeEEEEcCCCCeEEEEEecce
No 111
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=93.40 E-value=9.5 Score=38.30 Aligned_cols=159 Identities=18% Similarity=0.157 Sum_probs=98.0
Q ss_pred ceeeeEEEccCCEEEEEeCCCC--eEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeec
Q psy5768 357 SVEGLAYEYVHNYLYWTCNNDA--TINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFS 434 (652)
Q Consensus 357 ~~~glAvDw~~~~LYwtd~~~~--~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ld 434 (652)
-..||.++ -.+.||-+....+ +|.++++...+ ......+....--.||++. +..||---|.+. .....+.+
T Consensus 46 FTQGL~~~-~~g~LyESTG~yG~S~l~~~d~~tg~--~~~~~~l~~~~FgEGit~~--~d~l~qLTWk~~--~~f~yd~~ 118 (264)
T PF05096_consen 46 FTQGLEFL-DDGTLYESTGLYGQSSLRKVDLETGK--VLQSVPLPPRYFGEGITIL--GDKLYQLTWKEG--TGFVYDPN 118 (264)
T ss_dssp EEEEEEEE-ETTEEEEEECSTTEEEEEEEETTTSS--EEEEEE-TTT--EEEEEEE--TTEEEEEESSSS--EEEEEETT
T ss_pred cCccEEec-CCCEEEEeCCCCCcEEEEEEECCCCc--EEEEEECCccccceeEEEE--CCEEEEEEecCC--eEEEEccc
Confidence 35788885 2678998887665 68888876532 2333344555566789888 478999889764 55555554
Q ss_pred CCCceEEEEcC-CCCCceEEEecCCCEEEEEeCCCCeEEEEecCCC-ceEEE-ecCC---CCceeEEEEeCCEEEEEcCC
Q psy5768 435 GFGTESIITTD-ITMPNALALDHQAEKLFWGDARLDKIERCDYDGT-NRIVL-SKIS---PLHPFDMAVYGEFIFWTDWV 508 (652)
Q Consensus 435 G~~~~~l~~~~-l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG~-~~~~l-~~~~---~~~p~glav~~~~lYwtd~~ 508 (652)
. -+.+-+-. ....-||+-| +++||.+|. +++|...|...- ..+.+ +... +..---|...+++||---|.
T Consensus 119 t--l~~~~~~~y~~EGWGLt~d--g~~Li~SDG-S~~L~~~dP~~f~~~~~i~V~~~g~pv~~LNELE~i~G~IyANVW~ 193 (264)
T PF05096_consen 119 T--LKKIGTFPYPGEGWGLTSD--GKRLIMSDG-SSRLYFLDPETFKEVRTIQVTDNGRPVSNLNELEYINGKIYANVWQ 193 (264)
T ss_dssp T--TEEEEEEE-SSS--EEEEC--SSCEEEE-S-SSEEEEE-TTT-SEEEEEE-EETTEE---EEEEEEETTEEEEEETT
T ss_pred c--ceEEEEEecCCcceEEEcC--CCEEEEECC-ccceEEECCcccceEEEEEEEECCEECCCcEeEEEEcCEEEEEeCC
Confidence 3 22222211 1233488866 778888885 678888886542 22222 1111 23344577889999999999
Q ss_pred CCeEEEEEccCCceEEEEe
Q psy5768 509 IHAVLRANKYTGEEVYTLR 527 (652)
Q Consensus 509 ~~~I~~~~k~~g~~~~~~~ 527 (652)
++.|.++|+.+|.....+.
T Consensus 194 td~I~~Idp~tG~V~~~iD 212 (264)
T PF05096_consen 194 TDRIVRIDPETGKVVGWID 212 (264)
T ss_dssp SSEEEEEETTT-BEEEEEE
T ss_pred CCeEEEEeCCCCeEEEEEE
Confidence 9999999999999877664
No 112
>COG4247 Phy 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]
Probab=93.32 E-value=9.1 Score=37.80 Aligned_cols=172 Identities=17% Similarity=0.236 Sum_probs=93.9
Q ss_pred EEEecCCCCeEEEEecCCCeeEEEecCCC-CCCCCCCCe---eEEEEECCCCEEEEEeccCCcceEEEEEcCCCc--cEE
Q psy5768 3 IAVSSPTQSKIVVCNLEGEYQTTILSNES-NDTSTLSKI---SSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTK--RET 76 (652)
Q Consensus 3 i~v~~~~~~~I~~~~~~g~~~~~~~~~~~-~~~~~~~~~---~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~--~~~ 76 (652)
.++++-.....+++|+.|+.+..+.+-.- +...+-..+ .+|++-.. +| +.+.+|..+..|+.. .+.
T Consensus 69 ~vItt~Kk~Gl~VYDLsGkqLqs~~~Gk~NNVDLrygF~LgG~~idiaaA------Sd--R~~~~i~~y~Idp~~~~L~s 140 (364)
T COG4247 69 LVITTVKKAGLRVYDLSGKQLQSVNPGKYNNVDLRYGFQLGGQSIDIAAA------SD--RQNDKIVFYKIDPNPQYLES 140 (364)
T ss_pred eEEEeeccCCeEEEecCCCeeeecCCCcccccccccCcccCCeEEEEEec------cc--ccCCeEEEEEeCCCccceee
Confidence 45666677889999999998766543210 000000000 11222111 44 445666665555542 233
Q ss_pred E------EeCCCcCCccCCCCcEEE--EccCCcEEE-EeCCCCEEEEEEc----CCCcEEEEEeC--CCCCceeEEEcCC
Q psy5768 77 V------VSQKKYPAVTACNLHIAV--DWIAQNIYW-SDPKENVIEVARL----TGQYRYVLISG--GVDQPSALAVDPE 141 (652)
Q Consensus 77 v------~~~~~~~~p~~~~~~lav--Dw~~~~lY~-~d~~~~~I~v~~~----dg~~~~~l~~~--~~~~P~~iavd~~ 141 (652)
| ++.+ +..+- |+++ +..++-.|+ .....+.|....+ +|+.+.-++.. .-.+-.+++.|-.
T Consensus 141 itD~n~p~ss~-~s~~Y----Gl~lyrs~ktgd~yvfV~~~qG~~~Qy~l~d~gnGkv~~k~vR~fk~~tQTEG~VaDdE 215 (364)
T COG4247 141 ITDSNAPYSSS-SSSAY----GLALYRSPKTGDYYVFVNRRQGDIAQYKLIDQGNGKVGTKLVRQFKIPTQTEGMVADDE 215 (364)
T ss_pred ccCCCCccccC-cccce----eeEEEecCCcCcEEEEEecCCCceeEEEEEecCCceEcceeeEeeecCCcccceeeccc
Confidence 2 2334 55566 8888 677776764 3334566666554 33333222222 1235669999999
Q ss_pred CCeEEEEecCCCCeEEEEeCC---CCCcEEEEeecccCceeEEEeccCCEEEE
Q psy5768 142 SGYLFWSESGKIPLIARAGLD---GKKQTILAQEIIMPIKDITLDLKFFSAFY 191 (652)
Q Consensus 142 ~g~lywtd~~~~~~I~~~~ld---g~~~~~~~~~~~~~p~gl~lD~~~~~ly~ 191 (652)
.|.||..+- .-.||+...+ |...+. +.. +.....|+-|.++-.||+
T Consensus 216 tG~LYIaeE--dvaiWK~~Aep~~G~~g~~-idr-~~d~~~LtdDvEGltiYy 264 (364)
T COG4247 216 TGFLYIAEE--DVAIWKYEAEPNRGNTGRL-IDR-IKDLSYLTDDVEGLTIYY 264 (364)
T ss_pred cceEEEeec--cceeeecccCCCCCCccch-hhh-hcCchhhcccccccEEEE
Confidence 999999974 4578887654 332222 222 122245677777777777
No 113
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=93.23 E-value=8.4 Score=41.95 Aligned_cols=178 Identities=17% Similarity=0.131 Sum_probs=94.9
Q ss_pred cEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCCCC--eEEEEEcCCCCCccEEEEEeCCCCCceEE----EEe
Q psy5768 337 TINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNNDA--TINKIDLDSPKAQRIVVVRLGQHDKPRGI----DID 410 (652)
Q Consensus 337 ~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~~--~I~~~~~~~~~~~~~~~~~~~~~~~P~~I----avd 410 (652)
+|+..+++.+..+.+++-.+.-..-++-+.+++|-++....+ .|.++++++.. ... +.+..++ .+.
T Consensus 219 ~i~~~~l~~g~~~~i~~~~g~~~~P~fspDG~~l~f~~~rdg~~~iy~~dl~~~~--~~~------Lt~~~gi~~~Ps~s 290 (425)
T COG0823 219 RIYYLDLNTGKRPVILNFNGNNGAPAFSPDGSKLAFSSSRDGSPDIYLMDLDGKN--LPR------LTNGFGINTSPSWS 290 (425)
T ss_pred eEEEEeccCCccceeeccCCccCCccCCCCCCEEEEEECCCCCccEEEEcCCCCc--cee------cccCCccccCccCC
Confidence 466666666666666643333344566666788877765444 58888887643 111 2222333 345
Q ss_pred CCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCCCe--EEEEecCCCc-eEEEecC
Q psy5768 411 SCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARLDK--IERCDYDGTN-RIVLSKI 487 (652)
Q Consensus 411 p~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~--I~~~~ldG~~-~~~l~~~ 487 (652)
|...+|+++.-....|.|++++++|...+.+.. ......--.+.+++++|-+.....+. |...++.... -+.+...
T Consensus 291 pdG~~ivf~Sdr~G~p~I~~~~~~g~~~~riT~-~~~~~~~p~~SpdG~~i~~~~~~~g~~~i~~~~~~~~~~~~~lt~~ 369 (425)
T COG0823 291 PDGSKIVFTSDRGGRPQIYLYDLEGSQVTRLTF-SGGGNSNPVWSPDGDKIVFESSSGGQWDIDKNDLASGGKIRILTST 369 (425)
T ss_pred CCCCEEEEEeCCCCCcceEEECCCCCceeEeec-cCCCCcCccCCCCCCEEEEEeccCCceeeEEeccCCCCcEEEcccc
Confidence 666777766433345799999999998855543 33333334455556666666533233 4444443222 3333332
Q ss_pred CCCceeEEEEeCCEE-EEEcCCCCeEEEEEccCCceE
Q psy5768 488 SPLHPFDMAVYGEFI-FWTDWVIHAVLRANKYTGEEV 523 (652)
Q Consensus 488 ~~~~p~glav~~~~l-Ywtd~~~~~I~~~~k~~g~~~ 523 (652)
....+-..+..+..| |.+.+..+.+...-..+|...
T Consensus 370 ~~~e~ps~~~ng~~i~~~s~~~~~~~l~~~s~~g~~~ 406 (425)
T COG0823 370 YLNESPSWAPNGRMIMFSSGQGGGSVLSLVSLDGRVS 406 (425)
T ss_pred ccCCCCCcCCCCceEEEeccCCCCceEEEeeccceeE
Confidence 333444455555544 444444444443333344433
No 114
>smart00181 EGF Epidermal growth factor-like domain.
Probab=92.88 E-value=0.098 Score=34.71 Aligned_cols=25 Identities=28% Similarity=0.558 Sum_probs=19.9
Q ss_pred CCCcc-ccccCCCCceeeeccCceeec
Q psy5768 556 GNCDD-ICKLDETGQVVCSCFTGKVLM 581 (652)
Q Consensus 556 g~Cs~-lCl~~~~~~~~C~Cp~g~~l~ 581 (652)
..|.+ .|+..++ +++|.|+.||.+.
T Consensus 6 ~~C~~~~C~~~~~-~~~C~C~~g~~g~ 31 (35)
T smart00181 6 GPCSNGTCINTPG-SYTCSCPPGYTGD 31 (35)
T ss_pred CCCCCCEEECCCC-CeEeECCCCCccC
Confidence 45766 8997754 7999999999874
No 115
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=92.81 E-value=0.021 Score=52.89 Aligned_cols=81 Identities=23% Similarity=0.477 Sum_probs=49.6
Q ss_pred CCCceeeeccCceeeccCCcccCcccccCC--C-ceeecc-CeecCCcc---CCCCCCCCCCCCCCCCCCCCCCCCCCCe
Q psy5768 566 ETGQVVCSCFTGKVLMEDNRSCTINTVCSE--H-DFKCSD-GMCIPFNQ---TCDRVYNCHDKSDEGILYCAMRDCRPGY 638 (652)
Q Consensus 566 ~~~~~~C~Cp~g~~l~~d~~C~~~~~~C~~--~-~f~C~~-g~Ci~~~~---~Cd~~~dC~d~sde~~~~C~~~~C~~~~ 638 (652)
.++.+.|.|.+||+|.+..+|.+ ...|.. + .-.|++ +.|+.... .=....+|..|..-....|-...|..
T Consensus 16 MSNHfEC~Cnegfvl~~EntCE~-kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vCvp~~C~~-- 92 (197)
T PF06247_consen 16 MSNHFECKCNEGFVLKNENTCEE-KVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVCVPNKCNN-- 92 (197)
T ss_dssp ESSEEEEEESTTEEEEETTEEEE-----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSEEEGGGSS--
T ss_pred ccCceEEEcCCCcEEcccccccc-ceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCeEchhhcCc--
Confidence 45589999999999987789987 456754 1 345876 67885431 11122268888776666776667754
Q ss_pred eecCCCCcccC
Q psy5768 639 FKCDNNKCILS 649 (652)
Q Consensus 639 f~C~~~~Ci~~ 649 (652)
+.|.+|+||-.
T Consensus 93 ~~Cg~GKCI~d 103 (197)
T PF06247_consen 93 KDCGSGKCILD 103 (197)
T ss_dssp ---TTEEEEEE
T ss_pred eecCCCeEEec
Confidence 67888999853
No 116
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=92.69 E-value=0.054 Score=36.26 Aligned_cols=18 Identities=50% Similarity=1.119 Sum_probs=15.9
Q ss_pred CCCCeeecCCCCcccCCC
Q psy5768 634 CRPGYFKCDNNKCILSSH 651 (652)
Q Consensus 634 C~~~~f~C~~~~Ci~~~~ 651 (652)
|++++|+|.+++||+.++
T Consensus 1 C~~~~f~C~~~~Ci~~~~ 18 (35)
T cd00112 1 CPPNEFRCANGRCIPSSW 18 (35)
T ss_pred CCCCeEEcCCCCeeCHHH
Confidence 678999999999999765
No 117
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=92.62 E-value=5.9 Score=44.33 Aligned_cols=62 Identities=26% Similarity=0.417 Sum_probs=45.2
Q ss_pred cCCccCCCCcEEEEccCCcEEEEeCCCC-------------------EEEEEEcCCC-------c-EEEEEeC-------
Q psy5768 83 YPAVTACNLHIAVDWIAQNIYWSDPKEN-------------------VIEVARLTGQ-------Y-RYVLISG------- 128 (652)
Q Consensus 83 ~~~p~~~~~~lavDw~~~~lY~~d~~~~-------------------~I~v~~~dg~-------~-~~~l~~~------- 128 (652)
+.+|+ ++++++.++.||++-+... +|.+..+++. . ...+..+
T Consensus 349 f~RpE----gi~~~p~~g~vY~a~T~~~~r~~~~~~~~n~~~~n~~G~I~r~~~~~~d~~~~~f~~~~~~~~g~~~~~~~ 424 (524)
T PF05787_consen 349 FDRPE----GITVNPDDGEVYFALTNNSGRGESDVDAANPRAGNGYGQIYRYDPDGNDHAATTFTWELFLVGGDPTDASG 424 (524)
T ss_pred ccCcc----CeeEeCCCCEEEEEEecCCCCcccccccCCcccCCcccEEEEecccCCccccceeEEEEEEEecCcccccc
Confidence 78899 9999999999999865433 7988887764 1 1222222
Q ss_pred ---------CCCCceeEEEcCCCCeEEEEe
Q psy5768 129 ---------GVDQPSALAVDPESGYLFWSE 149 (652)
Q Consensus 129 ---------~~~~P~~iavd~~~g~lywtd 149 (652)
.+..|..|++|| .|.|+..+
T Consensus 425 ~~~~~~~~~~f~sPDNL~~d~-~G~LwI~e 453 (524)
T PF05787_consen 425 NGSNKCDDNGFASPDNLAFDP-DGNLWIQE 453 (524)
T ss_pred cccCcccCCCcCCCCceEECC-CCCEEEEe
Confidence 367899999999 56666654
No 118
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=92.62 E-value=15 Score=38.35 Aligned_cols=195 Identities=14% Similarity=0.155 Sum_probs=110.8
Q ss_pred EcCCCeEEEeec-ccccEEEEeccCCcceEEeeccCcee-eeEEEccCCEEEEEeCCCCeEEEEEcCCCCCc---cEEEE
Q psy5768 322 DYKRKTLFYSDI-QKGTINSVFFNGSNHRVLLERQGSVE-GLAYEYVHNYLYWTCNNDATINKIDLDSPKAQ---RIVVV 396 (652)
Q Consensus 322 D~~~~~lywsd~-~~~~I~~~~~~g~~~~~i~~~~~~~~-glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~---~~~~~ 396 (652)
-..++.+|+.+. ...+|..+++.... ++..+..|. .+.+=+-.+..+ +--+.+++..+.++.. |. +.+-+
T Consensus 103 s~dgk~~~V~N~TPa~SVtVVDl~~~k---vv~ei~~PGC~~iyP~~~~~F~-~lC~DGsl~~v~Ld~~-Gk~~~~~t~~ 177 (342)
T PF06433_consen 103 SADGKFLYVQNFTPATSVTVVDLAAKK---VVGEIDTPGCWLIYPSGNRGFS-MLCGDGSLLTVTLDAD-GKEAQKSTKV 177 (342)
T ss_dssp -TTSSEEEEEEESSSEEEEEEETTTTE---EEEEEEGTSEEEEEEEETTEEE-EEETTSCEEEEEETST-SSEEEEEEEE
T ss_pred ccCCcEEEEEccCCCCeEEEEECCCCc---eeeeecCCCEEEEEecCCCceE-EEecCCceEEEEECCC-CCEeEeeccc
Confidence 344566676664 34567777766532 221221221 123444455544 4445677877777632 22 11111
Q ss_pred EeCCCCCceE--EEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCC--------CC-Cc---eEEEecCCCEEE
Q psy5768 397 RLGQHDKPRG--IDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDI--------TM-PN---ALALDHQAEKLF 462 (652)
Q Consensus 397 ~~~~~~~P~~--Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l--------~~-P~---glaiD~~~~~LY 462 (652)
......|.- =+.....+.+||+... +.|+.+.+.|...+..-...+ .| |- -+|++...+|||
T Consensus 178 -F~~~~dp~f~~~~~~~~~~~~~F~Sy~---G~v~~~dlsg~~~~~~~~~~~~t~~e~~~~WrPGG~Q~~A~~~~~~rly 253 (342)
T PF06433_consen 178 -FDPDDDPLFEHPAYSRDGGRLYFVSYE---GNVYSADLSGDSAKFGKPWSLLTDAEKADGWRPGGWQLIAYHAASGRLY 253 (342)
T ss_dssp -SSTTTS-B-S--EEETTTTEEEEEBTT---SEEEEEEETTSSEEEEEEEESS-HHHHHTTEEE-SSS-EEEETTTTEEE
T ss_pred -cCCCCcccccccceECCCCeEEEEecC---CEEEEEeccCCcccccCcccccCccccccCcCCcceeeeeeccccCeEE
Confidence 122233321 1223346778887765 599999999987655543222 22 43 399999999999
Q ss_pred EEeC----CC-----CeEEEEecCCCceEEEecCCCCcee-EEEEeCC---EEEEEcCCCCeEEEEEccCCceEEEEe
Q psy5768 463 WGDA----RL-----DKIERCDYDGTNRIVLSKISPLHPF-DMAVYGE---FIFWTDWVIHAVLRANKYTGEEVYTLR 527 (652)
Q Consensus 463 w~D~----~~-----~~I~~~~ldG~~~~~l~~~~~~~p~-glav~~~---~lYwtd~~~~~I~~~~k~~g~~~~~~~ 527 (652)
+.=. ++ ..|+.+|+....|..-+. +.+|- +|++..+ .||-++...+.+...+..+|+.+..+.
T Consensus 254 vLMh~g~~gsHKdpgteVWv~D~~t~krv~Ri~--l~~~~~Si~Vsqd~~P~L~~~~~~~~~l~v~D~~tGk~~~~~~ 329 (342)
T PF06433_consen 254 VLMHQGGEGSHKDPGTEVWVYDLKTHKRVARIP--LEHPIDSIAVSQDDKPLLYALSAGDGTLDVYDAATGKLVRSIE 329 (342)
T ss_dssp EEEEE--TT-TTS-EEEEEEEETTTTEEEEEEE--EEEEESEEEEESSSS-EEEEEETTTTEEEEEETTT--EEEEE-
T ss_pred EEecCCCCCCccCCceEEEEEECCCCeEEEEEe--CCCccceEEEccCCCcEEEEEcCCCCeEEEEeCcCCcEEeehh
Confidence 8721 11 257888777665554443 34554 7888643 888888888899999999998877776
No 119
>smart00181 EGF Epidermal growth factor-like domain.
Probab=92.60 E-value=0.1 Score=34.63 Aligned_cols=29 Identities=28% Similarity=0.757 Sum_probs=22.2
Q ss_pred CCCCCCCCCcc-cceecCCCceEEEeCCccc
Q psy5768 239 PCGVNNGGCAE-LCLYNGVSAVCACAHGVVA 268 (652)
Q Consensus 239 ~C~~~ng~Cs~-lC~~~~~~~~C~C~~G~l~ 268 (652)
+|..+ ..|.+ .|+..+++|+|.|+.||..
T Consensus 1 ~C~~~-~~C~~~~C~~~~~~~~C~C~~g~~g 30 (35)
T smart00181 1 ECASG-GPCSNGTCINTPGSYTCSCPPGYTG 30 (35)
T ss_pred CCCCc-CCCCCCEEECCCCCeEeECCCCCcc
Confidence 35443 56887 8998777999999999843
No 120
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=92.59 E-value=0.38 Score=39.52 Aligned_cols=42 Identities=12% Similarity=0.244 Sum_probs=32.8
Q ss_pred cCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCCCeEEEEec
Q psy5768 434 SGFGTESIITTDITMPNALALDHQAEKLFWGDARLDKIERCDY 476 (652)
Q Consensus 434 dG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~l 476 (652)
||+..++ +...+..||||++|+..+.||+++...+.|..+..
T Consensus 42 d~~~~~~-va~g~~~aNGI~~s~~~k~lyVa~~~~~~I~vy~~ 83 (86)
T PF01731_consen 42 DGKEVKV-VASGFSFANGIAISPDKKYLYVASSLAHSIHVYKR 83 (86)
T ss_pred eCCEeEE-eeccCCCCceEEEcCCCCEEEEEeccCCeEEEEEe
Confidence 3444333 44578899999999999999999998888877654
No 121
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=91.77 E-value=0.1 Score=34.33 Aligned_cols=19 Identities=53% Similarity=1.160 Sum_probs=16.1
Q ss_pred CCCCCeeecCCCCcccCCC
Q psy5768 633 DCRPGYFKCDNNKCILSSH 651 (652)
Q Consensus 633 ~C~~~~f~C~~~~Ci~~~~ 651 (652)
.|+.++|+|.++.||+.++
T Consensus 1 ~C~~~~f~C~~~~Ci~~~~ 19 (33)
T smart00192 1 TCPPGEFQCDNGRCIPLSW 19 (33)
T ss_pred CCCCCeEECCCCCEECchh
Confidence 3777899999999999875
No 122
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=91.45 E-value=0.59 Score=38.37 Aligned_cols=40 Identities=18% Similarity=0.208 Sum_probs=34.1
Q ss_pred CcceEEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEEc
Q psy5768 346 SNHRVLLERQGSVEGLAYEYVHNYLYWTCNNDATINKIDL 385 (652)
Q Consensus 346 ~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~ 385 (652)
+...++.+++..|.||++|..++.||.++...+.|.+...
T Consensus 44 ~~~~~va~g~~~aNGI~~s~~~k~lyVa~~~~~~I~vy~~ 83 (86)
T PF01731_consen 44 KEVKVVASGFSFANGIAISPDKKYLYVASSLAHSIHVYKR 83 (86)
T ss_pred CEeEEeeccCCCCceEEEcCCCCEEEEEeccCCeEEEEEe
Confidence 3345566888999999999999999999999999988764
No 123
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=91.14 E-value=13 Score=38.98 Aligned_cols=106 Identities=13% Similarity=0.129 Sum_probs=64.2
Q ss_pred cceEEEEEEEcCCCeEEEe-eccc----ccEEEEeccCC-----cce-----EEee--c--c----CceeeeEEEccCCE
Q psy5768 313 MKNIIELSYDYKRKTLFYS-DIQK----GTINSVFFNGS-----NHR-----VLLE--R--Q----GSVEGLAYEYVHNY 369 (652)
Q Consensus 313 ~~~~~~v~~D~~~~~lyws-d~~~----~~I~~~~~~g~-----~~~-----~i~~--~--~----~~~~glAvDw~~~~ 369 (652)
++...|++|+...+++|.. |... .++++..+... ..+ .+.. + + -.++||++ ...+.
T Consensus 19 ~GGlSgl~~~~~~~~~~avSD~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~~G~~~~~~~~D~Egi~~-~~~g~ 97 (326)
T PF13449_consen 19 FGGLSGLDYDPDDGRFYAVSDRGPNKGPPRFYTFRIDYDQGGIGGVTILDMIPLRDPDGQPFPKNGLDPEGIAV-PPDGS 97 (326)
T ss_pred cCcEeeEEEeCCCCEEEEEECCCCCCCCCcEEEEEeeccCCCccceEeccceeccCCCCCcCCcCCCChhHeEE-ecCCC
Confidence 4566799999766654332 3322 23776665431 111 1111 1 1 17889999 67889
Q ss_pred EEEEeCCC------CeEEEEEcCCCCCccEEE-EE------------eCCCCCceEEEEeCCCCEEEEEec
Q psy5768 370 LYWTCNND------ATINKIDLDSPKAQRIVV-VR------------LGQHDKPRGIDIDSCDSRIYWTNW 421 (652)
Q Consensus 370 LYwtd~~~------~~I~~~~~~~~~~~~~~~-~~------------~~~~~~P~~Iavdp~~g~Lywtd~ 421 (652)
+||++... ..|.+++.+|.-. +.+ +- .....-..+||+.|....||..-.
T Consensus 98 ~~is~E~~~~~~~~p~I~~~~~~G~~~--~~~~vP~~~~~~~~~~~~~~~N~G~E~la~~~dG~~l~~~~E 166 (326)
T PF13449_consen 98 FWISSEGGRTGGIPPRIRRFDLDGRVI--RRFPVPAAFLPDANGTSGRRNNRGFEGLAVSPDGRTLFAAME 166 (326)
T ss_pred EEEEeCCccCCCCCCEEEEECCCCccc--ceEccccccccccCccccccCCCCeEEEEECCCCCEEEEEEC
Confidence 99999999 9999999876432 111 10 112345678999997666777643
No 124
>KOG0315|consensus
Probab=91.11 E-value=17 Score=35.89 Aligned_cols=173 Identities=13% Similarity=0.069 Sum_probs=111.6
Q ss_pred EEEecCCCCeEEEEecC-CCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEe-C
Q psy5768 3 IAVSSPTQSKIVVCNLE-GEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVS-Q 80 (652)
Q Consensus 3 i~v~~~~~~~I~~~~~~-g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~-~ 80 (652)
|+|+.....-|+.+-.. |.=.++|- . +=+++-.+.+.|.++.|-. ..+..|..++++..+...+.+ .
T Consensus 12 iLvsA~YDhTIRfWqa~tG~C~rTiq-h------~dsqVNrLeiTpdk~~LAa----a~~qhvRlyD~~S~np~Pv~t~e 80 (311)
T KOG0315|consen 12 ILVSAGYDHTIRFWQALTGICSRTIQ-H------PDSQVNRLEITPDKKDLAA----AGNQHVRLYDLNSNNPNPVATFE 80 (311)
T ss_pred EEEeccCcceeeeeehhcCeEEEEEe-c------CccceeeEEEcCCcchhhh----ccCCeeEEEEccCCCCCceeEEe
Confidence 78888888899888655 54444444 3 2256777888876665543 356778888887654322221 1
Q ss_pred CCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEe
Q psy5768 81 KKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAG 160 (652)
Q Consensus 81 ~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ 160 (652)
+....+. .+.+-- .+++-++-+..+.+.+-++..-.+....+.. .....++++|..+.|+..|. .+.|+.-+
T Consensus 81 ~h~kNVt----aVgF~~-dgrWMyTgseDgt~kIWdlR~~~~qR~~~~~-spVn~vvlhpnQteLis~dq--sg~irvWD 152 (311)
T KOG0315|consen 81 GHTKNVT----AVGFQC-DGRWMYTGSEDGTVKIWDLRSLSCQRNYQHN-SPVNTVVLHPNQTELISGDQ--SGNIRVWD 152 (311)
T ss_pred ccCCceE----EEEEee-cCeEEEecCCCceEEEEeccCcccchhccCC-CCcceEEecCCcceEEeecC--CCcEEEEE
Confidence 2134566 667764 4666677777888887787764444444332 45678999999999999984 45677766
Q ss_pred CCCC-CcEEEEeecccCceeEEEeccCCEEEEEeC
Q psy5768 161 LDGK-KQTILAQEIIMPIKDITLDLKFFSAFYRNL 194 (652)
Q Consensus 161 ldg~-~~~~~~~~~~~~p~gl~lD~~~~~ly~~d~ 194 (652)
|-.. ....++.+.......|++++.+.+|--++-
T Consensus 153 l~~~~c~~~liPe~~~~i~sl~v~~dgsml~a~nn 187 (311)
T KOG0315|consen 153 LGENSCTHELIPEDDTSIQSLTVMPDGSMLAAANN 187 (311)
T ss_pred ccCCccccccCCCCCcceeeEEEcCCCcEEEEecC
Confidence 6433 344556666667788899887766644433
No 125
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=91.06 E-value=0.16 Score=33.23 Aligned_cols=29 Identities=28% Similarity=0.619 Sum_probs=20.2
Q ss_pred CCCCCCCCCCCCccccccCCCCceeeeccCceee
Q psy5768 547 AKTPCRHLNGNCDDICKLDETGQVVCSCFTGKVL 580 (652)
Q Consensus 547 ~~~~C~~~ng~Cs~lCl~~~~~~~~C~Cp~g~~l 580 (652)
..+||.. +| .|+....++|+|.|+.||..
T Consensus 2 ~~~~C~n-~g----~C~~~~~~~y~C~C~~G~~G 30 (32)
T PF00008_consen 2 SSNPCQN-GG----TCIDLPGGGYTCECPPGYTG 30 (32)
T ss_dssp TTTSSTT-TE----EEEEESTSEEEEEEBTTEES
T ss_pred CCCcCCC-Ce----EEEeCCCCCEEeECCCCCcc
Confidence 3467763 23 56666645899999999864
No 126
>KOG0273|consensus
Probab=90.41 E-value=7.9 Score=41.33 Aligned_cols=125 Identities=16% Similarity=0.202 Sum_probs=85.9
Q ss_pred EecCCCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEc-CCCccEEEEeCCCc
Q psy5768 5 VSSPTQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFM-DGTKRETVVSQKKY 83 (652)
Q Consensus 5 v~~~~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~-dgs~~~~v~~~~~~ 83 (652)
+.-.-++.++.++.+|....++.-++ ..+.+|-+.-++.+|.=.+ ..+++..++. .|+- .+.++ +
T Consensus 251 atG~~~G~~riw~~~G~l~~tl~~Hk-------gPI~slKWnk~G~yilS~~---vD~ttilwd~~~g~~-~q~f~---~ 316 (524)
T KOG0273|consen 251 ATGSEDGEARIWNKDGNLISTLGQHK-------GPIFSLKWNKKGTYILSGG---VDGTTILWDAHTGTV-KQQFE---F 316 (524)
T ss_pred EEeecCcEEEEEecCchhhhhhhccC-------CceEEEEEcCCCCEEEecc---CCccEEEEeccCceE-EEeee---e
Confidence 33345678889999999888887654 4778888887777777554 3445544443 3443 33333 3
Q ss_pred CCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEE
Q psy5768 84 PAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWS 148 (652)
Q Consensus 84 ~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywt 148 (652)
+... ++-|||+++.=|.+-...+.|.|+..+++.-..-+.+-.....+|.-|| .|.|.-|
T Consensus 317 ~s~~----~lDVdW~~~~~F~ts~td~~i~V~kv~~~~P~~t~~GH~g~V~alk~n~-tg~LLaS 376 (524)
T KOG0273|consen 317 HSAP----ALDVDWQSNDEFATSSTDGCIHVCKVGEDRPVKTFIGHHGEVNALKWNP-TGSLLAS 376 (524)
T ss_pred ccCC----ccceEEecCceEeecCCCceEEEEEecCCCcceeeecccCceEEEEECC-CCceEEE
Confidence 4444 7899999999999988899999999988754333333346777888998 4555554
No 127
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=90.32 E-value=1.2 Score=30.49 Aligned_cols=41 Identities=22% Similarity=0.304 Sum_probs=30.5
Q ss_pred cCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEe
Q psy5768 366 VHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDKPRGIDID 410 (652)
Q Consensus 366 ~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavd 410 (652)
.++.||.++...++|.+++... .+.+-.......|++|+++
T Consensus 2 d~~~lyv~~~~~~~v~~id~~~----~~~~~~i~vg~~P~~i~~~ 42 (42)
T TIGR02276 2 DGTKLYVTNSGSNTVSVIDTAT----NKVIATIPVGGYPFGVAVS 42 (42)
T ss_pred CCCEEEEEeCCCCEEEEEECCC----CeEEEEEECCCCCceEEeC
Confidence 4678999999999999998754 2333334456889999875
No 128
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=89.43 E-value=3.2 Score=46.46 Aligned_cols=69 Identities=17% Similarity=0.212 Sum_probs=47.9
Q ss_pred CCCCeeEEEEECCCCEEEEEeccCC-----------------cceEEEEEcCCC-------ccEEEEeCC----------
Q psy5768 36 TLSKISSIAVWPVKGKMFWSNVTKQ-----------------VVTIEMAFMDGT-------KRETVVSQK---------- 81 (652)
Q Consensus 36 ~~~~~~~v~~d~~~~~lyw~d~~~~-----------------~~~I~~~~~dgs-------~~~~v~~~~---------- 81 (652)
.+.+|.+|.++|.+++||++-.... .+.|+|+.+++. ..+.++..+
T Consensus 348 ~f~RpEgi~~~p~~g~vY~a~T~~~~r~~~~~~~~n~~~~n~~G~I~r~~~~~~d~~~~~f~~~~~~~~g~~~~~~~~~~ 427 (524)
T PF05787_consen 348 PFDRPEGITVNPDDGEVYFALTNNSGRGESDVDAANPRAGNGYGQIYRYDPDGNDHAATTFTWELFLVGGDPTDASGNGS 427 (524)
T ss_pred cccCccCeeEeCCCCEEEEEEecCCCCcccccccCCcccCCcccEEEEecccCCccccceeEEEEEEEecCccccccccc
Confidence 5789999999999999999862222 238999998865 334443322
Q ss_pred ------CcCCccCCCCcEEEEccCCcEEE-EeCCC
Q psy5768 82 ------KYPAVTACNLHIAVDWIAQNIYW-SDPKE 109 (652)
Q Consensus 82 ------~~~~p~~~~~~lavDw~~~~lY~-~d~~~ 109 (652)
.+..|. +|++|.. ++|++ +|...
T Consensus 428 ~~~~~~~f~sPD----NL~~d~~-G~LwI~eD~~~ 457 (524)
T PF05787_consen 428 NKCDDNGFASPD----NLAFDPD-GNLWIQEDGGG 457 (524)
T ss_pred CcccCCCcCCCC----ceEECCC-CCEEEEeCCCC
Confidence 266789 9999984 55655 55443
No 129
>KOG0266|consensus
Probab=88.96 E-value=19 Score=39.68 Aligned_cols=130 Identities=16% Similarity=0.208 Sum_probs=86.8
Q ss_pred EEEecCCCCeEEEEec-CC-CeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeC
Q psy5768 3 IAVSSPTQSKIVVCNL-EG-EYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQ 80 (652)
Q Consensus 3 i~v~~~~~~~I~~~~~-~g-~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~ 80 (652)
++++......|.++++ +. ..+.++..+ ...+.+++|++.. .++.+= ...+.|+.+++.+......+..
T Consensus 217 ~l~s~s~D~tiriwd~~~~~~~~~~l~gH-------~~~v~~~~f~p~g-~~i~Sg--s~D~tvriWd~~~~~~~~~l~~ 286 (456)
T KOG0266|consen 217 YLLSGSDDKTLRIWDLKDDGRNLKTLKGH-------STYVTSVAFSPDG-NLLVSG--SDDGTVRIWDVRTGECVRKLKG 286 (456)
T ss_pred EEEEecCCceEEEeeccCCCeEEEEecCC-------CCceEEEEecCCC-CEEEEe--cCCCcEEEEeccCCeEEEeeec
Confidence 5677778889999999 44 455666653 2577999999998 555555 5778999999886443444455
Q ss_pred CCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcE--EEEEeCCCCCc---eeEEEcCCCCeEEEEec
Q psy5768 81 KKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYR--YVLISGGVDQP---SALAVDPESGYLFWSES 150 (652)
Q Consensus 81 ~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~--~~l~~~~~~~P---~~iavd~~~g~lywtd~ 150 (652)
. ..... ++++.. .++++++-+..+.|.+-|..+... ...+.. ...| +.+..+|. |...|+-+
T Consensus 287 h-s~~is----~~~f~~-d~~~l~s~s~d~~i~vwd~~~~~~~~~~~~~~-~~~~~~~~~~~fsp~-~~~ll~~~ 353 (456)
T KOG0266|consen 287 H-SDGIS----GLAFSP-DGNLLVSASYDGTIRVWDLETGSKLCLKLLSG-AENSAPVTSVQFSPN-GKYLLSAS 353 (456)
T ss_pred c-CCceE----EEEECC-CCCEEEEcCCCccEEEEECCCCceeeeecccC-CCCCCceeEEEECCC-CcEEEEec
Confidence 4 55677 888986 556666667788999999887662 222322 2344 67777774 44445543
No 130
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=88.74 E-value=0.38 Score=31.54 Aligned_cols=24 Identities=25% Similarity=0.599 Sum_probs=18.7
Q ss_pred CCCc--ccceecCCCceEEEeCCccc
Q psy5768 245 GGCA--ELCLYNGVSAVCACAHGVVA 268 (652)
Q Consensus 245 g~Cs--~lC~~~~~~~~C~C~~G~l~ 268 (652)
..|. +.|...+.+|+|.|+.||..
T Consensus 6 ~~C~~~~~C~~~~~~~~C~C~~g~~g 31 (36)
T cd00053 6 NPCSNGGTCVNTPGSYRCVCPPGYTG 31 (36)
T ss_pred CCCCCCCEEecCCCCeEeECCCCCcc
Confidence 4555 67887777899999999854
No 131
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=88.62 E-value=0.24 Score=33.33 Aligned_cols=29 Identities=28% Similarity=0.614 Sum_probs=20.7
Q ss_pred CCCCCCCCcc--ccccCCCCceeeeccCceee
Q psy5768 551 CRHLNGNCDD--ICKLDETGQVVCSCFTGKVL 580 (652)
Q Consensus 551 C~~~ng~Cs~--lCl~~~~~~~~C~Cp~g~~l 580 (652)
|..+|++|+. .|...+. +++|.|..||..
T Consensus 1 C~~~~~~C~~nA~C~~~~~-~~~C~C~~Gy~G 31 (36)
T PF12947_consen 1 CLENNGGCHPNATCTNTGG-SYTCTCKPGYEG 31 (36)
T ss_dssp TTTGGGGS-TTCEEEE-TT-SEEEEE-CEEEC
T ss_pred CCCCCCCCCCCcEeecCCC-CEEeECCCCCcc
Confidence 5667788853 4888877 799999999976
No 132
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=88.58 E-value=0.46 Score=32.11 Aligned_cols=22 Identities=23% Similarity=0.576 Sum_probs=16.8
Q ss_pred CCcc--ccccCCCCceeeeccCcee
Q psy5768 557 NCDD--ICKLDETGQVVCSCFTGKV 579 (652)
Q Consensus 557 ~Cs~--lCl~~~~~~~~C~Cp~g~~ 579 (652)
.|.+ .|+..++ +++|.|+.||.
T Consensus 10 ~C~~~~~C~~~~g-~~~C~C~~g~~ 33 (39)
T smart00179 10 PCQNGGTCVNTVG-SYRCECPPGYT 33 (39)
T ss_pred CcCCCCEeECCCC-CeEeECCCCCc
Confidence 3544 7886665 59999999987
No 133
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=88.09 E-value=27 Score=34.03 Aligned_cols=61 Identities=13% Similarity=0.125 Sum_probs=43.3
Q ss_pred CCEEEEEeCCCCeEEEEecCCCceEEEecCCCCceeE-EEEeCCEEEEEcCCCCeEEEEEccCCce
Q psy5768 458 AEKLFWGDARLDKIERCDYDGTNRIVLSKISPLHPFD-MAVYGEFIFWTDWVIHAVLRANKYTGEE 522 (652)
Q Consensus 458 ~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~~~~~p~g-lav~~~~lYwtd~~~~~I~~~~k~~g~~ 522 (652)
++++|.+..... +..+++....+.. .. ....+.+ ....++.||..+ ..+.|+.++..+|+.
T Consensus 173 ~~~v~~~~~~g~-~~~~d~~tg~~~w-~~-~~~~~~~~~~~~~~~l~~~~-~~~~l~~~d~~tG~~ 234 (238)
T PF13360_consen 173 DGRVYVSSGDGR-VVAVDLATGEKLW-SK-PISGIYSLPSVDGGTLYVTS-SDGRLYALDLKTGKV 234 (238)
T ss_dssp TTEEEEECCTSS-EEEEETTTTEEEE-EE-CSS-ECECEECCCTEEEEEE-TTTEEEEEETTTTEE
T ss_pred CCEEEEEcCCCe-EEEEECCCCCEEE-Ee-cCCCccCCceeeCCEEEEEe-CCCEEEEEECCCCCE
Confidence 569999876554 4555777666442 22 2455666 667789999999 789999999999874
No 134
>KOG0285|consensus
Probab=87.76 E-value=37 Score=35.24 Aligned_cols=101 Identities=18% Similarity=0.199 Sum_probs=66.6
Q ss_pred cccceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe-eccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCC
Q psy5768 311 TMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL-ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPK 389 (652)
Q Consensus 311 ~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~-~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~ 389 (652)
.+++.+-.+++|+. +.-|.+-.....|.-.++........+ .-+..++|+||....-.||-+ ...+.|.--+|...
T Consensus 149 gHlgWVr~vavdP~-n~wf~tgs~DrtikIwDlatg~LkltltGhi~~vr~vavS~rHpYlFs~-gedk~VKCwDLe~n- 225 (460)
T KOG0285|consen 149 GHLGWVRSVAVDPG-NEWFATGSADRTIKIWDLATGQLKLTLTGHIETVRGVAVSKRHPYLFSA-GEDKQVKCWDLEYN- 225 (460)
T ss_pred hccceEEEEeeCCC-ceeEEecCCCceeEEEEcccCeEEEeecchhheeeeeeecccCceEEEe-cCCCeeEEEechhh-
Confidence 45678889999998 456667777788888887765444444 356789999999877777654 33455655565431
Q ss_pred CccEEEEE--eCCCCCceEEEEeCCCCEEEE
Q psy5768 390 AQRIVVVR--LGQHDKPRGIDIDSCDSRIYW 418 (652)
Q Consensus 390 ~~~~~~~~--~~~~~~P~~Iavdp~~g~Lyw 418 (652)
.++. .+.+...+.++++|.-..|+-
T Consensus 226 ----kvIR~YhGHlS~V~~L~lhPTldvl~t 252 (460)
T KOG0285|consen 226 ----KVIRHYHGHLSGVYCLDLHPTLDVLVT 252 (460)
T ss_pred ----hhHHHhccccceeEEEeccccceeEEe
Confidence 1111 134556677888887666654
No 135
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=87.75 E-value=2.4 Score=28.98 Aligned_cols=40 Identities=20% Similarity=0.258 Sum_probs=29.5
Q ss_pred CCCEEEEEeCCCCeEEEEecCCCceEEEecCCCCceeEEEE
Q psy5768 457 QAEKLFWGDARLDKIERCDYDGTNRIVLSKISPLHPFDMAV 497 (652)
Q Consensus 457 ~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~~~~~p~glav 497 (652)
.+++||.++...+.|..+|........-+.. ..+|.+|++
T Consensus 2 d~~~lyv~~~~~~~v~~id~~~~~~~~~i~v-g~~P~~i~~ 41 (42)
T TIGR02276 2 DGTKLYVTNSGSNTVSVIDTATNKVIATIPV-GGYPFGVAV 41 (42)
T ss_pred CCCEEEEEeCCCCEEEEEECCCCeEEEEEEC-CCCCceEEe
Confidence 4789999999999999999865443333333 378988875
No 136
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=87.69 E-value=15 Score=40.01 Aligned_cols=144 Identities=14% Similarity=0.131 Sum_probs=84.0
Q ss_pred CCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCC-CcCCcc
Q psy5768 9 TQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQK-KYPAVT 87 (652)
Q Consensus 9 ~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~-~~~~p~ 87 (652)
.+-.|+++|+.++....+-...+ .-..|. +.|.+.+|+++.-+.+...|++++++|+..+.+...+ ....|.
T Consensus 260 g~~~iy~~dl~~~~~~~Lt~~~g----i~~~Ps---~spdG~~ivf~Sdr~G~p~I~~~~~~g~~~~riT~~~~~~~~p~ 332 (425)
T COG0823 260 GSPDIYLMDLDGKNLPRLTNGFG----INTSPS---WSPDGSKIVFTSDRGGRPQIYLYDLEGSQVTRLTFSGGGNSNPV 332 (425)
T ss_pred CCccEEEEcCCCCcceecccCCc----cccCcc---CCCCCCEEEEEeCCCCCcceEEECCCCCceeEeeccCCCCcCcc
Confidence 45788999999998766443222 112444 8888999888663445678999999999866654433 011222
Q ss_pred CCCCcEEEEccCCcEEEEeCCCCE--EEEEEcCC-CcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeCCCC
Q psy5768 88 ACNLHIAVDWIAQNIYWSDPKENV--IEVARLTG-QYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGLDGK 164 (652)
Q Consensus 88 ~~~~~lavDw~~~~lY~~d~~~~~--I~v~~~dg-~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~ 164 (652)
+..-++.|-+.....+. |...++.. ...+.+-......+...+.+- ...||.+..+..+.+.-..++|.
T Consensus 333 -------~SpdG~~i~~~~~~~g~~~i~~~~~~~~~~~~~lt~~~~~e~ps~~~ng-~~i~~~s~~~~~~~l~~~s~~g~ 404 (425)
T COG0823 333 -------WSPDGDKIVFESSSGGQWDIDKNDLASGGKIRILTSTYLNESPSWAPNG-RMIMFSSGQGGGSVLSLVSLDGR 404 (425)
T ss_pred -------CCCCCCEEEEEeccCCceeeEEeccCCCCcEEEccccccCCCCCcCCCC-ceEEEeccCCCCceEEEeeccce
Confidence 12223444444322333 55555532 224555555566677777764 45666666554556666666666
Q ss_pred CcE
Q psy5768 165 KQT 167 (652)
Q Consensus 165 ~~~ 167 (652)
...
T Consensus 405 ~~~ 407 (425)
T COG0823 405 VSR 407 (425)
T ss_pred eEE
Confidence 554
No 137
>COG5276 Uncharacterized conserved protein [Function unknown]
Probab=87.40 E-value=36 Score=34.61 Aligned_cols=181 Identities=14% Similarity=0.160 Sum_probs=103.9
Q ss_pred EEEEcCCCeEEEeecccccEEEEeccCCcceEEeec--c-CceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEE
Q psy5768 319 LSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLLER--Q-GSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVV 395 (652)
Q Consensus 319 v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~~~--~-~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~ 395 (652)
.|.-..++++|++|..++ +.-+++..-...++... . +-..++++ -++.+|.+|-..+ ...+++..+.. ..+
T Consensus 90 ~Dv~vse~yvyvad~ssG-L~IvDIS~P~sP~~~~~lnt~gyaygv~v--sGn~aYVadlddg-fLivdvsdpss--P~l 163 (370)
T COG5276 90 ADVRVSEEYVYVADWSSG-LRIVDISTPDSPTLIGFLNTDGYAYGVYV--SGNYAYVADLDDG-FLIVDVSDPSS--PQL 163 (370)
T ss_pred heeEecccEEEEEcCCCc-eEEEeccCCCCcceeccccCCceEEEEEe--cCCEEEEeeccCc-EEEEECCCCCC--cee
Confidence 455667899999997654 55556554333333322 1 33455555 4899999997544 44566654432 122
Q ss_pred EEe--CCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCCCeEEE
Q psy5768 396 VRL--GQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARLDKIER 473 (652)
Q Consensus 396 ~~~--~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~ 473 (652)
... .....-..++|.- .+-|.+.|+. .+......--..-+++..-=..|..-++-...+|.|.++...+ +.-
T Consensus 164 agrya~~~~d~~~v~ISG--n~AYvA~~d~---GL~ivDVSnp~sPvli~~~n~g~g~~sv~vsdnr~y~vvy~eg-vli 237 (370)
T COG5276 164 AGRYALPGGDTHDVAISG--NYAYVAWRDG---GLTIVDVSNPHSPVLIGSYNTGPGTYSVSVSDNRAYLVVYDEG-VLI 237 (370)
T ss_pred eeeeccCCCCceeEEEec--CeEEEEEeCC---CeEEEEccCCCCCeEEEEEecCCceEEEEecCCeeEEEEcccc-eEE
Confidence 211 1112224577764 6677777763 3333444333333444321123444555566889999987654 556
Q ss_pred EecCCCceEEEe-cCCCCceeEE---EEeCCEEEEEcCCCCe
Q psy5768 474 CDYDGTNRIVLS-KISPLHPFDM---AVYGEFIFWTDWVIHA 511 (652)
Q Consensus 474 ~~ldG~~~~~l~-~~~~~~p~gl---av~~~~lYwtd~~~~~ 511 (652)
++.+|...-+++ .-....|.++ .+-+++.|..|-.++-
T Consensus 238 vd~s~~ssp~~~gsyet~~p~~~s~v~Vs~~~~Yvadga~gl 279 (370)
T COG5276 238 VDVSGPSSPTVFGSYETSNPVSISTVPVSGEYAYVADGAKGL 279 (370)
T ss_pred EecCCCCCceEeeccccCCcccccceecccceeeeeccccCc
Confidence 777775533333 3335678777 6779999999977664
No 138
>KOG2397|consensus
Probab=87.27 E-value=0.62 Score=49.78 Aligned_cols=51 Identities=31% Similarity=0.679 Sum_probs=41.0
Q ss_pred ceeeeccCceeeccCCcccCcccccCCCceeecc-C---eecCCccCCCCCCCCCCCCCCCC
Q psy5768 569 QVVCSCFTGKVLMEDNRSCTINTVCSEHDFKCSD-G---MCIPFNQTCDRVYNCHDKSDEGI 626 (652)
Q Consensus 569 ~~~C~Cp~g~~l~~d~~C~~~~~~C~~~~f~C~~-g---~Ci~~~~~Cd~~~dC~d~sde~~ 626 (652)
...|.|++| .| .+-...|....|.|.| | .-|+..-.=||+.||-|||||-.
T Consensus 61 Dd~CDC~DG----sD---EPGtsACpngkF~C~N~G~~p~~i~ssrV~DGICDCCDgSDE~~ 115 (480)
T KOG2397|consen 61 DDSCDCLDG----SD---EPGTSACPNGKFYCVNQGHQPKYIPSSRVNDGICDCCDGSDEYL 115 (480)
T ss_pred cccccCCCC----CC---CCccccCCCCceeeeecCCCceeeechhccCcccccccCCCCcc
Confidence 678999999 44 0224568888999997 2 68888888899999999999975
No 139
>KOG1219|consensus
Probab=87.08 E-value=0.68 Score=57.73 Aligned_cols=56 Identities=32% Similarity=0.679 Sum_probs=37.5
Q ss_pred CCCCCCCCCCCCCCCCccccccCCCCceeeeccCceeeccCCcccCc-ccccCCCceeecc-CeecCCc
Q psy5768 543 LDACAKTPCRHLNGNCDDICKLDETGQVVCSCFTGKVLMEDNRSCTI-NTVCSEHDFKCSD-GMCIPFN 609 (652)
Q Consensus 543 ~~~~~~~~C~~~ng~Cs~lCl~~~~~~~~C~Cp~g~~l~~d~~C~~~-~~~C~~~~f~C~~-g~Ci~~~ 609 (652)
..+|..|||.. || .|.+.++| +.|.||.||.. ++|... ...|.. -.|.+ |.|++..
T Consensus 3903 ~epC~snPC~~--Gg---tCip~~n~-f~CnC~~gyTG---~~Ce~~Gi~eCs~--n~C~~gg~C~n~~ 3960 (4289)
T KOG1219|consen 3903 LEPCASNPCLT--GG---TCIPFYNG-FLCNCPNGYTG---KRCEARGISECSK--NVCGTGGQCINIP 3960 (4289)
T ss_pred cccccCCCCCC--CC---EEEecCCC-eeEeCCCCccC---ceeeccccccccc--ccccCCceeeccC
Confidence 45666677763 33 57887774 99999999865 678764 445652 34666 4888653
No 140
>KOG1446|consensus
Probab=86.61 E-value=40 Score=34.35 Aligned_cols=140 Identities=18% Similarity=0.173 Sum_probs=91.4
Q ss_pred EEecCCCCeEEEEecC-CCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCC
Q psy5768 4 AVSSPTQSKIVVCNLE-GEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKK 82 (652)
Q Consensus 4 ~v~~~~~~~I~~~~~~-g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~ 82 (652)
++++...+-|+.+|.. |+.++++-... -.+.-+-|-.....+.-+. +..+..|+-.++.-.-....+...
T Consensus 29 litss~dDsl~LYd~~~g~~~~ti~skk-------yG~~~~~Fth~~~~~i~sS-tk~d~tIryLsl~dNkylRYF~GH- 99 (311)
T KOG1446|consen 29 LITSSEDDSLRLYDSLSGKQVKTINSKK-------YGVDLACFTHHSNTVIHSS-TKEDDTIRYLSLHDNKYLRYFPGH- 99 (311)
T ss_pred EEEecCCCeEEEEEcCCCceeeEeeccc-------ccccEEEEecCCceEEEcc-CCCCCceEEEEeecCceEEEcCCC-
Confidence 3445566788888655 66666665431 1122222333334433333 145678888777654433444555
Q ss_pred cCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeC
Q psy5768 83 YPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGL 161 (652)
Q Consensus 83 ~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~l 161 (652)
-..+. +|++-++. ..+.+-+..+.|..-|+.-+.+..++. +..+--.|.|| .|.+|-+-.++. .|...++
T Consensus 100 ~~~V~----sL~~sP~~-d~FlS~S~D~tvrLWDlR~~~cqg~l~--~~~~pi~AfDp-~GLifA~~~~~~-~IkLyD~ 169 (311)
T KOG1446|consen 100 KKRVN----SLSVSPKD-DTFLSSSLDKTVRLWDLRVKKCQGLLN--LSGRPIAAFDP-EGLIFALANGSE-LIKLYDL 169 (311)
T ss_pred CceEE----EEEecCCC-CeEEecccCCeEEeeEecCCCCceEEe--cCCCcceeECC-CCcEEEEecCCC-eEEEEEe
Confidence 56677 99999866 688888888899999999888888876 45666789999 789998877654 6766654
No 141
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=86.52 E-value=0.68 Score=30.81 Aligned_cols=30 Identities=23% Similarity=0.598 Sum_probs=21.8
Q ss_pred CCCCCCCCCCC--cccceecCCCceEEEeCCcc
Q psy5768 237 TNPCGVNNGGC--AELCLYNGVSAVCACAHGVV 267 (652)
Q Consensus 237 ~n~C~~~ng~C--s~lC~~~~~~~~C~C~~G~l 267 (652)
.++|... ..| .+.|.....+|.|.|+.||.
T Consensus 2 ~~~C~~~-~~C~~~~~C~~~~~~~~C~C~~g~~ 33 (38)
T cd00054 2 IDECASG-NPCQNGGTCVNTVGSYRCSCPPGYT 33 (38)
T ss_pred cccCCCC-CCcCCCCEeECCCCCeEeECCCCCc
Confidence 4667642 356 35788777789999999984
No 142
>KOG0310|consensus
Probab=86.23 E-value=46 Score=35.85 Aligned_cols=153 Identities=10% Similarity=0.022 Sum_probs=102.9
Q ss_pred CCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEc
Q psy5768 38 SKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARL 117 (652)
Q Consensus 38 ~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~ 117 (652)
..+..+-|.+.++.++.+- .....+..+++++...+.-++.. -.+++ ..++-..+++|.++-+..+.|..-+.
T Consensus 111 apv~~~~f~~~d~t~l~s~--sDd~v~k~~d~s~a~v~~~l~~h-tDYVR----~g~~~~~~~hivvtGsYDg~vrl~Dt 183 (487)
T KOG0310|consen 111 APVHVTKFSPQDNTMLVSG--SDDKVVKYWDLSTAYVQAELSGH-TDYVR----CGDISPANDHIVVTGSYDGKVRLWDT 183 (487)
T ss_pred CceeEEEecccCCeEEEec--CCCceEEEEEcCCcEEEEEecCC-cceeE----eeccccCCCeEEEecCCCceEEEEEe
Confidence 4667788999999999887 56667777888877654345555 67788 88888889999999998999988887
Q ss_pred CCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeCCCCCcEE-EEeecccCceeEEEeccCCEEEEEeCCC
Q psy5768 118 TGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGLDGKKQTI-LAQEIIMPIKDITLDLKFFSAFYRNLSK 196 (652)
Q Consensus 118 dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~~~-~~~~~~~~p~gl~lD~~~~~ly~~d~~g 196 (652)
.-.- ..+.+-+-..|-.-.+--..|-++.|-.| +.+..-+|-+..... ....-....+.|.+-..+.||+=..+|
T Consensus 184 R~~~-~~v~elnhg~pVe~vl~lpsgs~iasAgG--n~vkVWDl~~G~qll~~~~~H~KtVTcL~l~s~~~rLlS~sLD- 259 (487)
T KOG0310|consen 184 RSLT-SRVVELNHGCPVESVLALPSGSLIASAGG--NSVKVWDLTTGGQLLTSMFNHNKTVTCLRLASDSTRLLSGSLD- 259 (487)
T ss_pred ccCC-ceeEEecCCCceeeEEEcCCCCEEEEcCC--CeEEEEEecCCceehhhhhcccceEEEEEeecCCceEeecccc-
Confidence 6552 23333344566665555567888887655 455555655322221 111113456777777778899888887
Q ss_pred CcEEE
Q psy5768 197 GNIHI 201 (652)
Q Consensus 197 ~~~~~ 201 (652)
++.++
T Consensus 260 ~~VKV 264 (487)
T KOG0310|consen 260 RHVKV 264 (487)
T ss_pred cceEE
Confidence 44444
No 143
>KOG2106|consensus
Probab=85.71 E-value=45 Score=36.26 Aligned_cols=146 Identities=12% Similarity=0.084 Sum_probs=91.2
Q ss_pred EEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEc----C
Q psy5768 370 LYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITT----D 445 (652)
Q Consensus 370 LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~----~ 445 (652)
||+-- ..+.|..-++.+ .-.++.....+.-.++|.+|. ..+|.|-......+|++ + .+.+.+. .
T Consensus 342 i~vGT-trN~iL~Gt~~~----~f~~~v~gh~delwgla~hps-~~q~~T~gqdk~v~lW~---~---~k~~wt~~~~d~ 409 (626)
T KOG2106|consen 342 ILVGT-TRNFILQGTLEN----GFTLTVQGHGDELWGLATHPS-KNQLLTCGQDKHVRLWN---D---HKLEWTKIIEDP 409 (626)
T ss_pred EEEee-ccceEEEeeecC----CceEEEEecccceeeEEcCCC-hhheeeccCcceEEEcc---C---CceeEEEEecCc
Confidence 66543 345566555543 233344455678899999995 45667654332234443 1 1222211 1
Q ss_pred ----CCCCce-EEEecCCCEEEEEeCCCCeEEEEecCCCceEEEecCCCCceeEEEEeCC--EEEEEcCCCCeEEEEEcc
Q psy5768 446 ----ITMPNA-LALDHQAEKLFWGDARLDKIERCDYDGTNRIVLSKISPLHPFDMAVYGE--FIFWTDWVIHAVLRANKY 518 (652)
Q Consensus 446 ----l~~P~g-laiD~~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~~~~~p~glav~~~--~lYwtd~~~~~I~~~~k~ 518 (652)
--.|.| ||+-..++++++.|..+..+-.+..++....++.-.....-+++.-.++ |||-.+...+...|+.|-
T Consensus 410 ~~~~~fhpsg~va~Gt~~G~w~V~d~e~~~lv~~~~d~~~ls~v~ysp~G~~lAvgs~d~~iyiy~Vs~~g~~y~r~~k~ 489 (626)
T KOG2106|consen 410 AECADFHPSGVVAVGTATGRWFVLDTETQDLVTIHTDNEQLSVVRYSPDGAFLAVGSHDNHIYIYRVSANGRKYSRVGKC 489 (626)
T ss_pred eeEeeccCcceEEEeeccceEEEEecccceeEEEEecCCceEEEEEcCCCCEEEEecCCCeEEEEEECCCCcEEEEeeee
Confidence 124654 6777888999999998877777777755555554444456677776676 566678888888999999
Q ss_pred CCceEEEEe
Q psy5768 519 TGEEVYTLR 527 (652)
Q Consensus 519 ~g~~~~~~~ 527 (652)
.|+.++.+.
T Consensus 490 ~gs~ithLD 498 (626)
T KOG2106|consen 490 SGSPITHLD 498 (626)
T ss_pred cCceeEEee
Confidence 897666554
No 144
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=85.70 E-value=0.8 Score=29.92 Aligned_cols=21 Identities=19% Similarity=0.275 Sum_probs=16.5
Q ss_pred cccccCCCCceeeeccCceeec
Q psy5768 560 DICKLDETGQVVCSCFTGKVLM 581 (652)
Q Consensus 560 ~lCl~~~~~~~~C~Cp~g~~l~ 581 (652)
..|...+. .++|.|+.||...
T Consensus 12 ~~C~~~~~-~~~C~C~~g~~g~ 32 (36)
T cd00053 12 GTCVNTPG-SYRCVCPPGYTGD 32 (36)
T ss_pred CEEecCCC-CeEeECCCCCccc
Confidence 56777665 6999999998764
No 145
>PTZ00421 coronin; Provisional
Probab=84.66 E-value=72 Score=35.56 Aligned_cols=145 Identities=7% Similarity=0.033 Sum_probs=78.7
Q ss_pred EEEecCCCCeEEEEecCCCe--------eEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCcc
Q psy5768 3 IAVSSPTQSKIVVCNLEGEY--------QTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKR 74 (652)
Q Consensus 3 i~v~~~~~~~I~~~~~~g~~--------~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~ 74 (652)
++++....+.|.++++...- ...+.. .-..+..|+|+|..+.++++- ...+.|..+++.....
T Consensus 90 ~LaSgS~DgtIkIWdi~~~~~~~~~~~~l~~L~g-------H~~~V~~l~f~P~~~~iLaSg--s~DgtVrIWDl~tg~~ 160 (493)
T PTZ00421 90 KLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQG-------HTKKVGIVSFHPSAMNVLASA--GADMVVNVWDVERGKA 160 (493)
T ss_pred EEEEEeCCCEEEEEecCCCccccccCcceEEecC-------CCCcEEEEEeCcCCCCEEEEE--eCCCEEEEEECCCCeE
Confidence 34444556777888775321 112221 235788999999876566665 4678888898875433
Q ss_pred EEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCC-CcEEEEEeCCCCCceeEEEcCCCCeEEEEecC--
Q psy5768 75 ETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTG-QYRYVLISGGVDQPSALAVDPESGYLFWSESG-- 151 (652)
Q Consensus 75 ~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg-~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~-- 151 (652)
...+... ...+. ++++.+ .+++.++-+..+.|.+.|+.. +....+....-.....+...+..+.|+-+-+.
T Consensus 161 ~~~l~~h-~~~V~----sla~sp-dG~lLatgs~Dg~IrIwD~rsg~~v~tl~~H~~~~~~~~~w~~~~~~ivt~G~s~s 234 (493)
T PTZ00421 161 VEVIKCH-SDQIT----SLEWNL-DGSLLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSQRCLWAKRKDLIITLGCSKS 234 (493)
T ss_pred EEEEcCC-CCceE----EEEEEC-CCCEEEEecCCCEEEEEECCCCcEEEEEecCCCCcceEEEEcCCCCeEEEEecCCC
Confidence 3333333 34466 777776 455666666778898888764 32222221111122234444545555543221
Q ss_pred CCCeEEEEeCC
Q psy5768 152 KIPLIARAGLD 162 (652)
Q Consensus 152 ~~~~I~~~~ld 162 (652)
....|..-++.
T Consensus 235 ~Dr~VklWDlr 245 (493)
T PTZ00421 235 QQRQIMLWDTR 245 (493)
T ss_pred CCCeEEEEeCC
Confidence 23345555543
No 146
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=83.76 E-value=0.74 Score=30.10 Aligned_cols=28 Identities=25% Similarity=0.615 Sum_probs=19.4
Q ss_pred CCCcccceecCCCceEEEeCCccccCCCc
Q psy5768 245 GGCAELCLYNGVSAVCACAHGVVAQDGKS 273 (652)
Q Consensus 245 g~Cs~lC~~~~~~~~C~C~~G~l~~dg~~ 273 (652)
..|...|-+.. ...|.||.||++.++.-
T Consensus 6 t~CpA~CDpn~-~~~C~CPeGyIlde~~~ 33 (34)
T PF09064_consen 6 TECPADCDPNS-PGQCFCPEGYILDEGSM 33 (34)
T ss_pred ccCCCccCCCC-CCceeCCCceEecCCcc
Confidence 45777776533 36899999997755543
No 147
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=83.39 E-value=42 Score=35.49 Aligned_cols=93 Identities=10% Similarity=0.099 Sum_probs=58.4
Q ss_pred CceEEEEe--CCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCC-ceEEEecCCCEEEEEeCC----CCeEEEEe
Q psy5768 403 KPRGIDID--SCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMP-NALALDHQAEKLFWGDAR----LDKIERCD 475 (652)
Q Consensus 403 ~P~~Iavd--p~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P-~glaiD~~~~~LYw~D~~----~~~I~~~~ 475 (652)
....+..- ...++|++++ ...-.+|+...++|...+.|...+. .. .-+.+|..+++||+.-.. ...+++++
T Consensus 236 ~~~~~~~~~~~~~~~l~~s~-~~G~~hly~~~~~~~~~~~lT~G~~-~V~~i~~~d~~~~~iyf~a~~~~p~~r~lY~v~ 313 (353)
T PF00930_consen 236 VYDPPHFLGPDGNEFLWISE-RDGYRHLYLYDLDGGKPRQLTSGDW-EVTSILGWDEDNNRIYFTANGDNPGERHLYRVS 313 (353)
T ss_dssp SSSEEEE-TTTSSEEEEEEE-TTSSEEEEEEETTSSEEEESS-SSS--EEEEEEEECTSSEEEEEESSGGTTSBEEEEEE
T ss_pred eecccccccCCCCEEEEEEE-cCCCcEEEEEcccccceeccccCce-eecccceEcCCCCEEEEEecCCCCCceEEEEEE
Confidence 33444443 3445555555 3334589999999987554433221 22 258899999999998654 45899999
Q ss_pred cC-CCceEEEecCCCCceeEEEEe
Q psy5768 476 YD-GTNRIVLSKISPLHPFDMAVY 498 (652)
Q Consensus 476 ld-G~~~~~l~~~~~~~p~glav~ 498 (652)
++ |...+.|......| +.+++-
T Consensus 314 ~~~~~~~~~LT~~~~~~-~~~~~S 336 (353)
T PF00930_consen 314 LDSGGEPKCLTCEDGDH-YSASFS 336 (353)
T ss_dssp TTETTEEEESSTTSSTT-EEEEE-
T ss_pred eCCCCCeEeccCCCCCc-eEEEEC
Confidence 99 88887776654445 455554
No 148
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=82.41 E-value=1.1 Score=29.21 Aligned_cols=18 Identities=22% Similarity=0.523 Sum_probs=14.4
Q ss_pred cceecC-CCceEEEeCCcc
Q psy5768 250 LCLYNG-VSAVCACAHGVV 267 (652)
Q Consensus 250 lC~~~~-~~~~C~C~~G~l 267 (652)
.|+... .+|+|.|+.||.
T Consensus 11 ~C~~~~~~~y~C~C~~G~~ 29 (32)
T PF00008_consen 11 TCIDLPGGGYTCECPPGYT 29 (32)
T ss_dssp EEEEESTSEEEEEEBTTEE
T ss_pred EEEeCCCCCEEeECCCCCc
Confidence 577555 689999999984
No 149
>KOG1225|consensus
Probab=82.09 E-value=2.4 Score=46.76 Aligned_cols=13 Identities=23% Similarity=0.253 Sum_probs=9.5
Q ss_pred ceeeeccCceeec
Q psy5768 569 QVVCSCFTGKVLM 581 (652)
Q Consensus 569 ~~~C~Cp~g~~l~ 581 (652)
.-+|.|+.||...
T Consensus 264 ~G~CIC~~Gf~G~ 276 (525)
T KOG1225|consen 264 EGRCICPPGFTGD 276 (525)
T ss_pred CCeEeCCCCCcCC
Confidence 4578888888753
No 150
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=81.69 E-value=50 Score=31.59 Aligned_cols=129 Identities=9% Similarity=0.049 Sum_probs=77.0
Q ss_pred CeEEEeecccccEEEEeccCCcceEEe-eccCceeeeEEEccCCEEEEEeC-CCCeEEEEEcCCCCCccEEEEEeCCCCC
Q psy5768 326 KTLFYSDIQKGTINSVFFNGSNHRVLL-ERQGSVEGLAYEYVHNYLYWTCN-NDATINKIDLDSPKAQRIVVVRLGQHDK 403 (652)
Q Consensus 326 ~~lywsd~~~~~I~~~~~~g~~~~~i~-~~~~~~~glAvDw~~~~LYwtd~-~~~~I~~~~~~~~~~~~~~~~~~~~~~~ 403 (652)
++-|+.+ ..|++++..+...+.+. ...+.+..++..+.++.+....+ ...+|...++.+ +.+.... ...
T Consensus 32 ~ks~~~~---~~l~~~~~~~~~~~~i~l~~~~~I~~~~WsP~g~~favi~g~~~~~v~lyd~~~-----~~i~~~~-~~~ 102 (194)
T PF08662_consen 32 GKSYYGE---FELFYLNEKNIPVESIELKKEGPIHDVAWSPNGNEFAVIYGSMPAKVTLYDVKG-----KKIFSFG-TQP 102 (194)
T ss_pred cceEEee---EEEEEEecCCCccceeeccCCCceEEEEECcCCCEEEEEEccCCcccEEEcCcc-----cEeEeec-CCC
Confidence 4445443 34666665554444443 33335888999988888776654 344677677643 2333332 244
Q ss_pred ceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEe
Q psy5768 404 PRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGD 465 (652)
Q Consensus 404 P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D 465 (652)
...|.-+|...+|..+..+...+.|+-.+++ ..+.+....-...+.++-+++++.|..+.
T Consensus 103 ~n~i~wsP~G~~l~~~g~~n~~G~l~~wd~~--~~~~i~~~~~~~~t~~~WsPdGr~~~ta~ 162 (194)
T PF08662_consen 103 RNTISWSPDGRFLVLAGFGNLNGDLEFWDVR--KKKKISTFEHSDATDVEWSPDGRYLATAT 162 (194)
T ss_pred ceEEEECCCCCEEEEEEccCCCcEEEEEECC--CCEEeeccccCcEEEEEEcCCCCEEEEEE
Confidence 5679999988888888766544566665555 44455443333455677777766666554
No 151
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=80.74 E-value=44 Score=32.02 Aligned_cols=60 Identities=10% Similarity=0.104 Sum_probs=39.9
Q ss_pred eEEEecCCCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcC
Q psy5768 2 FIAVSSPTQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMD 70 (652)
Q Consensus 2 ~i~v~~~~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~d 70 (652)
|+++.......|.++|.+++....+.. ...-.|.++|.+++|..+......+.|.-++.+
T Consensus 74 favi~g~~~~~v~lyd~~~~~i~~~~~---------~~~n~i~wsP~G~~l~~~g~~n~~G~l~~wd~~ 133 (194)
T PF08662_consen 74 FAVIYGSMPAKVTLYDVKGKKIFSFGT---------QPRNTISWSPDGRFLVLAGFGNLNGDLEFWDVR 133 (194)
T ss_pred EEEEEccCCcccEEEcCcccEeEeecC---------CCceEEEECCCCCEEEEEEccCCCcEEEEEECC
Confidence 444444445588899998777766653 233468899998888887622224567777776
No 152
>KOG0279|consensus
Probab=80.67 E-value=67 Score=32.36 Aligned_cols=174 Identities=13% Similarity=0.026 Sum_probs=102.3
Q ss_pred eEEEecCCCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCc-cEEEEeC
Q psy5768 2 FIAVSSPTQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTK-RETVVSQ 80 (652)
Q Consensus 2 ~i~v~~~~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~-~~~v~~~ 80 (652)
.|+++......|++.+++......=.+.++ -.+.-+.+.++++.+..++ +++- .-.+.++.+++.++. .+.++ .
T Consensus 29 ~~l~sasrDk~ii~W~L~~dd~~~G~~~r~-~~GHsH~v~dv~~s~dg~~-alS~--swD~~lrlWDl~~g~~t~~f~-G 103 (315)
T KOG0279|consen 29 DILVSASRDKTIIVWKLTSDDIKYGVPVRR-LTGHSHFVSDVVLSSDGNF-ALSA--SWDGTLRLWDLATGESTRRFV-G 103 (315)
T ss_pred ceEEEcccceEEEEEEeccCccccCceeee-eeccceEecceEEccCCce-EEec--cccceEEEEEecCCcEEEEEE-e
Confidence 367777778888888887663322111110 0112357788999866554 4465 566788889998753 34443 3
Q ss_pred CCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCC-CCCceeEEEcCCCCeEEEEecCCCCeEEEE
Q psy5768 81 KKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGG-VDQPSALAVDPESGYLFWSESGKIPLIARA 159 (652)
Q Consensus 81 ~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~-~~~P~~iavd~~~g~lywtd~~~~~~I~~~ 159 (652)
. ...+. ++|++..+..| ++-+..+.|..-+.-|.-..++.... -+-..-+...|.+...|....+....+..=
T Consensus 104 H-~~dVl----sva~s~dn~qi-vSGSrDkTiklwnt~g~ck~t~~~~~~~~WVscvrfsP~~~~p~Ivs~s~DktvKvW 177 (315)
T KOG0279|consen 104 H-TKDVL----SVAFSTDNRQI-VSGSRDKTIKLWNTLGVCKYTIHEDSHREWVSCVRFSPNESNPIIVSASWDKTVKVW 177 (315)
T ss_pred c-CCceE----EEEecCCCcee-ecCCCcceeeeeeecccEEEEEecCCCcCcEEEEEEcCCCCCcEEEEccCCceEEEE
Confidence 3 34566 99999766655 66666778888887777666665544 455666777776544444443333344555
Q ss_pred eCCCCCcEEEEeecccCceeEEEeccC
Q psy5768 160 GLDGKKQTILAQEIIMPIKDITLDLKF 186 (652)
Q Consensus 160 ~ldg~~~~~~~~~~~~~p~gl~lD~~~ 186 (652)
+|++-......-..-+..+.+++.+.+
T Consensus 178 nl~~~~l~~~~~gh~~~v~t~~vSpDG 204 (315)
T KOG0279|consen 178 NLRNCQLRTTFIGHSGYVNTVTVSPDG 204 (315)
T ss_pred ccCCcchhhccccccccEEEEEECCCC
Confidence 566554332221112345666666543
No 153
>KOG4649|consensus
Probab=80.64 E-value=65 Score=32.20 Aligned_cols=58 Identities=14% Similarity=0.159 Sum_probs=38.5
Q ss_pred eeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEee
Q psy5768 359 EGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFF 433 (652)
Q Consensus 359 ~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~l 433 (652)
.=+|||..+++|||--.-..+|+- ...+. .+ -+++--.+|.||+-+.... .+|+....
T Consensus 34 ~~~avd~~sG~~~We~ilg~RiE~----------sa~vv-gd-----fVV~GCy~g~lYfl~~~tG-s~~w~f~~ 91 (354)
T KOG4649|consen 34 IVIAVDPQSGNLIWEAILGVRIEC----------SAIVV-GD-----FVVLGCYSGGLYFLCVKTG-SQIWNFVI 91 (354)
T ss_pred eEEEecCCCCcEEeehhhCceeee----------eeEEE-CC-----EEEEEEccCcEEEEEecch-hheeeeee
Confidence 446999999999997655555541 22221 22 2778888999999987653 26665443
No 154
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=80.29 E-value=90 Score=33.64 Aligned_cols=193 Identities=13% Similarity=0.130 Sum_probs=106.8
Q ss_pred CCCeEEEeecc----cccEEEEecc--CCcc-eEEe--eccC------ceeeeEEEccCCEEEEEeCCCC----eEEEEE
Q psy5768 324 KRKTLFYSDIQ----KGTINSVFFN--GSNH-RVLL--ERQG------SVEGLAYEYVHNYLYWTCNNDA----TINKID 384 (652)
Q Consensus 324 ~~~~lywsd~~----~~~I~~~~~~--g~~~-~~i~--~~~~------~~~glAvDw~~~~LYwtd~~~~----~I~~~~ 384 (652)
..+..||.-.. ...++|.... +... ++++ ..+. ...++++.+.++.|-++-...| .|.+.+
T Consensus 77 ~g~~~y~~~~~~~~~~~~~~r~~~~~~~~~~~evllD~n~l~~~~~~~~~~~~~~Spdg~~la~~~s~~G~e~~~l~v~D 156 (414)
T PF02897_consen 77 RGGYYYYSRNQGGKNYPVLYRRKTDEEDGPEEEVLLDPNELAKDGGYVSLGGFSVSPDGKRLAYSLSDGGSEWYTLRVFD 156 (414)
T ss_dssp ETTEEEEEEE-SS-SS-EEEEEETTS-TS-C-EEEEEGGGGSTTSS-EEEEEEEETTTSSEEEEEEEETTSSEEEEEEEE
T ss_pred ECCeEEEEEEcCCCceEEEEEEecccCCCCceEEEEcchHhhccCceEEeeeeeECCCCCEEEEEecCCCCceEEEEEEE
Confidence 35667765322 2345666554 2333 5565 2221 2235788888888877643333 477788
Q ss_pred cCCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCC--------CCceEEEeecCCCce--EEEEcCCCCCc---e
Q psy5768 385 LDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSH--------LPSIQRAFFSGFGTE--SIITTDITMPN---A 451 (652)
Q Consensus 385 ~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~--------~~~I~r~~ldG~~~~--~l~~~~l~~P~---g 451 (652)
+... +.+-..-......+++-.+....+|++.+... ..+|++..+.....+ .+... -..+. +
T Consensus 157 l~tg----~~l~d~i~~~~~~~~~W~~d~~~~~y~~~~~~~~~~~~~~~~~v~~~~~gt~~~~d~lvfe~-~~~~~~~~~ 231 (414)
T PF02897_consen 157 LETG----KFLPDGIENPKFSSVSWSDDGKGFFYTRFDEDQRTSDSGYPRQVYRHKLGTPQSEDELVFEE-PDEPFWFVS 231 (414)
T ss_dssp TTTT----EEEEEEEEEEESEEEEECTTSSEEEEEECSTTTSS-CCGCCEEEEEEETTS-GGG-EEEEC--TTCTTSEEE
T ss_pred CCCC----cCcCCcccccccceEEEeCCCCEEEEEEeCcccccccCCCCcEEEEEECCCChHhCeeEEee-cCCCcEEEE
Confidence 7542 22211001122334788887777888877653 235787777554443 44433 23333 6
Q ss_pred EEEecCCCEEEEEeC-C-C-CeEEEEecCCC-----ceEEEecCCCCceeEEEEeCCEEEE-EcCC--CCeEEEEEccCC
Q psy5768 452 LALDHQAEKLFWGDA-R-L-DKIERCDYDGT-----NRIVLSKISPLHPFDMAVYGEFIFW-TDWV--IHAVLRANKYTG 520 (652)
Q Consensus 452 laiD~~~~~LYw~D~-~-~-~~I~~~~ldG~-----~~~~l~~~~~~~p~glav~~~~lYw-td~~--~~~I~~~~k~~g 520 (652)
+..+.+++.|++.-. + . ..++.++++.. ..+.+.......-+.+...++.+|+ |+.. +++|.+++..+.
T Consensus 232 ~~~s~d~~~l~i~~~~~~~~s~v~~~d~~~~~~~~~~~~~l~~~~~~~~~~v~~~~~~~yi~Tn~~a~~~~l~~~~l~~~ 311 (414)
T PF02897_consen 232 VSRSKDGRYLFISSSSGTSESEVYLLDLDDGGSPDAKPKLLSPREDGVEYYVDHHGDRLYILTNDDAPNGRLVAVDLADP 311 (414)
T ss_dssp EEE-TTSSEEEEEEESSSSEEEEEEEECCCTTTSS-SEEEEEESSSS-EEEEEEETTEEEEEE-TT-TT-EEEEEETTST
T ss_pred EEecCcccEEEEEEEccccCCeEEEEeccccCCCcCCcEEEeCCCCceEEEEEccCCEEEEeeCCCCCCcEEEEeccccc
Confidence 777777888876432 2 2 46888888764 4555554443455667777887775 6543 457888887665
Q ss_pred c
Q psy5768 521 E 521 (652)
Q Consensus 521 ~ 521 (652)
.
T Consensus 312 ~ 312 (414)
T PF02897_consen 312 S 312 (414)
T ss_dssp S
T ss_pred c
Confidence 5
No 155
>KOG4289|consensus
Probab=79.89 E-value=2.3 Score=51.02 Aligned_cols=12 Identities=25% Similarity=0.487 Sum_probs=10.3
Q ss_pred ceeeeccCceee
Q psy5768 569 QVVCSCFTGKVL 580 (652)
Q Consensus 569 ~~~C~Cp~g~~l 580 (652)
.|+|.||.||..
T Consensus 1738 GY~C~C~~g~~G 1749 (2531)
T KOG4289|consen 1738 GYTCECPPGYTG 1749 (2531)
T ss_pred ceeEECCCcccC
Confidence 799999999864
No 156
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=79.67 E-value=27 Score=36.91 Aligned_cols=104 Identities=14% Similarity=0.225 Sum_probs=65.2
Q ss_pred CCCeeEEEEE-CCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCC----CCE
Q psy5768 37 LSKISSIAVW-PVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPK----ENV 111 (652)
Q Consensus 37 ~~~~~~v~~d-~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~----~~~ 111 (652)
+.......+- +.++.++|.-...+-..|+.++.+|..... ++.|+..--+ -+++|..++.||++-.. ..+
T Consensus 234 v~~~~~~~~~~~~~~~~l~~s~~~G~~hly~~~~~~~~~~~-lT~G~~~V~~----i~~~d~~~~~iyf~a~~~~p~~r~ 308 (353)
T PF00930_consen 234 VDVYDPPHFLGPDGNEFLWISERDGYRHLYLYDLDGGKPRQ-LTSGDWEVTS----ILGWDEDNNRIYFTANGDNPGERH 308 (353)
T ss_dssp SSSSSEEEE-TTTSSEEEEEEETTSSEEEEEEETTSSEEEE-SS-SSS-EEE----EEEEECTSSEEEEEESSGGTTSBE
T ss_pred eeeecccccccCCCCEEEEEEEcCCCcEEEEEcccccceec-cccCceeecc----cceEcCCCCEEEEEecCCCCCceE
Confidence 3333444554 566666665534567789999999987554 4555233224 57889999999998764 558
Q ss_pred EEEEEcC-CCcEEEEEeCCCCCceeEEEcCCCCeEE
Q psy5768 112 IEVARLT-GQYRYVLISGGVDQPSALAVDPESGYLF 146 (652)
Q Consensus 112 I~v~~~d-g~~~~~l~~~~~~~P~~iavd~~~g~ly 146 (652)
+.+.+++ |...+.|-...... ..+.+.|...++.
T Consensus 309 lY~v~~~~~~~~~~LT~~~~~~-~~~~~Spdg~y~v 343 (353)
T PF00930_consen 309 LYRVSLDSGGEPKCLTCEDGDH-YSASFSPDGKYYV 343 (353)
T ss_dssp EEEEETTETTEEEESSTTSSTT-EEEEE-TTSSEEE
T ss_pred EEEEEeCCCCCeEeccCCCCCc-eEEEECCCCCEEE
Confidence 9999999 76665554432222 4888888544443
No 157
>KOG4289|consensus
Probab=79.62 E-value=1.9 Score=51.72 Aligned_cols=70 Identities=26% Similarity=0.603 Sum_probs=40.5
Q ss_pred CCCCCCCCCCCCCccccccCCCCceeeeccCceeeccCCcccCc--ccccCCCceeecc-CeecCCc---cCCCCCCCCC
Q psy5768 546 CAKTPCRHLNGNCDDICKLDETGQVVCSCFTGKVLMEDNRSCTI--NTVCSEHDFKCSD-GMCIPFN---QTCDRVYNCH 619 (652)
Q Consensus 546 ~~~~~C~~~ng~Cs~lCl~~~~~~~~C~Cp~g~~l~~d~~C~~~--~~~C~~~~f~C~~-g~Ci~~~---~~Cd~~~dC~ 619 (652)
|..+||.. ||.| ....+ .|+|.|.+||.. .+|+-. ..+|.|+ -|.| |.|.... +.| +|+
T Consensus 1242 CYs~pC~n-ng~C----~srEg-gYtCeCrpg~tG---ehCEvs~~agrCvpG--vC~nggtC~~~~nggf~c----~Cp 1306 (2531)
T KOG4289|consen 1242 CYSGPCGN-NGRC----RSREG-GYTCECRPGFTG---EHCEVSARAGRCVPG--VCKNGGTCVNLLNGGFCC----HCP 1306 (2531)
T ss_pred hhcCCCCC-CCce----EEecC-ceeEEecCCccc---cceeeecccCccccc--eecCCCEEeecCCCceec----cCC
Confidence 34466653 4444 33444 699999999865 456531 2344433 3555 3677543 333 788
Q ss_pred CCCCCCCCCCCC
Q psy5768 620 DKSDEGILYCAM 631 (652)
Q Consensus 620 d~sde~~~~C~~ 631 (652)
-|.+|++ +|+-
T Consensus 1307 ~ge~e~p-rC~v 1317 (2531)
T KOG4289|consen 1307 YGEFEDP-RCEV 1317 (2531)
T ss_pred CcccCCC-ceEE
Confidence 8877754 7853
No 158
>PRK13616 lipoprotein LpqB; Provisional
Probab=79.40 E-value=1.2e+02 Score=34.60 Aligned_cols=186 Identities=12% Similarity=0.070 Sum_probs=95.8
Q ss_pred eEEEEEEEcCCCeEEEeec-------ccccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeC-----------C
Q psy5768 315 NIIELSYDYKRKTLFYSDI-------QKGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCN-----------N 376 (652)
Q Consensus 315 ~~~~v~~D~~~~~lywsd~-------~~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~-----------~ 376 (652)
.+...++.+..+++.++.. ...+|+.+...|.. +.+..+. ....-.++..++.|+++.. .
T Consensus 351 ~vsspaiSpdG~~vA~v~~~~~~~~d~~s~Lwv~~~gg~~-~~lt~g~-~~t~PsWspDG~~lw~v~dg~~~~~v~~~~~ 428 (591)
T PRK13616 351 NITSAALSRSGRQVAAVVTLGRGAPDPASSLWVGPLGGVA-VQVLEGH-SLTRPSWSLDADAVWVVVDGNTVVRVIRDPA 428 (591)
T ss_pred CcccceECCCCCEEEEEEeecCCCCCcceEEEEEeCCCcc-eeeecCC-CCCCceECCCCCceEEEecCcceEEEeccCC
Confidence 3445556666666666541 23467777665543 3333222 1233355655565555422 2
Q ss_pred CCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEE---eecCCCceEE-----EEcCCCC
Q psy5768 377 DATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRA---FFSGFGTESI-----ITTDITM 448 (652)
Q Consensus 377 ~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~---~ldG~~~~~l-----~~~~l~~ 448 (652)
.+.|.++.+++.. .+. .....+..+.+.|...+|.+.-.+ +|+.+ ..++.. ..+ +...+..
T Consensus 429 ~gql~~~~vd~ge--~~~----~~~g~Issl~wSpDG~RiA~i~~g----~v~Va~Vvr~~~G~-~~l~~~~~l~~~l~~ 497 (591)
T PRK13616 429 TGQLARTPVDASA--VAS----RVPGPISELQLSRDGVRAAMIIGG----KVYLAVVEQTEDGQ-YALTNPREVGPGLGD 497 (591)
T ss_pred CceEEEEeccCch--hhh----ccCCCcCeEEECCCCCEEEEEECC----EEEEEEEEeCCCCc-eeecccEEeecccCC
Confidence 2345444444321 010 112358889999988887776532 55552 223332 233 1122222
Q ss_pred -CceEEEecCCCEEEEEeCC-CCeEEEEecCCCceEEEecCCCCc-eeEEEEeCCEEEEEcCCCCeEEEEE
Q psy5768 449 -PNALALDHQAEKLFWGDAR-LDKIERCDYDGTNRIVLSKISPLH-PFDMAVYGEFIFWTDWVIHAVLRAN 516 (652)
Q Consensus 449 -P~glaiD~~~~~LYw~D~~-~~~I~~~~ldG~~~~~l~~~~~~~-p~glav~~~~lYwtd~~~~~I~~~~ 516 (652)
+..++--. ++.|+..-.. ...++.+.+||...+.+....+.. ..+|+-..+.||.+|.. .+.+..
T Consensus 498 ~~~~l~W~~-~~~L~V~~~~~~~~v~~v~vDG~~~~~~~~~n~~~~v~~vaa~~~~iyv~~~~--g~~~l~ 565 (591)
T PRK13616 498 TAVSLDWRT-GDSLVVGRSDPEHPVWYVNLDGSNSDALPSRNLSAPVVAVAASPSTVYVTDAR--AVLQLP 565 (591)
T ss_pred ccccceEec-CCEEEEEecCCCCceEEEecCCccccccCCCCccCceEEEecCCceEEEEcCC--ceEEec
Confidence 23333322 3446655333 245899999998877543332233 35666667789999744 355554
No 159
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=79.16 E-value=92 Score=33.05 Aligned_cols=106 Identities=13% Similarity=0.136 Sum_probs=62.4
Q ss_pred CCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCCCeEEEEecCCCceE-EEecCCCCc
Q psy5768 413 DSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARLDKIERCDYDGTNRI-VLSKISPLH 491 (652)
Q Consensus 413 ~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG~~~~-~l~~~~~~~ 491 (652)
.+++|.+.+. +.+...++. +-+.+.......+..++++ +++||..+ ..+.+..++.+..... .........
T Consensus 241 ~~~vy~~~~~---g~l~a~d~~--tG~~~W~~~~~~~~~p~~~--~~~vyv~~-~~G~l~~~d~~tG~~~W~~~~~~~~~ 312 (377)
T TIGR03300 241 GGQVYAVSYQ---GRVAALDLR--SGRVLWKRDASSYQGPAVD--DNRLYVTD-ADGVVVALDRRSGSELWKNDELKYRQ 312 (377)
T ss_pred CCEEEEEEcC---CEEEEEECC--CCcEEEeeccCCccCceEe--CCEEEEEC-CCCeEEEEECCCCcEEEccccccCCc
Confidence 5789988764 256665553 2234444333344555665 78999886 4567888888533221 110101111
Q ss_pred eeEEEEeCCEEEEEcCCCCeEEEEEccCCceEEEEe
Q psy5768 492 PFDMAVYGEFIFWTDWVIHAVLRANKYTGEEVYTLR 527 (652)
Q Consensus 492 p~glav~~~~lYwtd~~~~~I~~~~k~~g~~~~~~~ 527 (652)
..+.++.+++||..+ .++.|+.++..+|+....+.
T Consensus 313 ~ssp~i~g~~l~~~~-~~G~l~~~d~~tG~~~~~~~ 347 (377)
T TIGR03300 313 LTAPAVVGGYLVVGD-FEGYLHWLSREDGSFVARLK 347 (377)
T ss_pred cccCEEECCEEEEEe-CCCEEEEEECCCCCEEEEEE
Confidence 123345788999887 45788888988887655443
No 160
>KOG2397|consensus
Probab=78.80 E-value=1.4 Score=47.08 Aligned_cols=45 Identities=42% Similarity=0.678 Sum_probs=36.2
Q ss_pred CceeeccC-eecCCccCCCCCCCCCCCCCCCCCCCCCCCCCCCeeecCCC
Q psy5768 596 HDFKCSDG-MCIPFNQTCDRVYNCHDKSDEGILYCAMRDCRPGYFKCDNN 644 (652)
Q Consensus 596 ~~f~C~~g-~Ci~~~~~Cd~~~dC~d~sde~~~~C~~~~C~~~~f~C~~~ 644 (652)
..|.|.+| .=|+....=|+..||.|||||.. ...|+.+.|.|.|.
T Consensus 43 ~~~~CLdgs~~i~f~qlNDd~CDC~DGsDEPG----tsACpngkF~C~N~ 88 (480)
T KOG2397|consen 43 SMFKCLDGSKTISFSQLNDDSCDCLDGSDEPG----TSACPNGKFYCVNQ 88 (480)
T ss_pred cceeeccCCcccCHHHhccccccCCCCCCCCc----cccCCCCceeeeec
Confidence 37889876 56677777789999999999964 45789999999873
No 161
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=78.75 E-value=67 Score=31.22 Aligned_cols=105 Identities=14% Similarity=0.170 Sum_probs=60.5
Q ss_pred CCEEEEEecCCCCCceEEEee-cCCCceEEEEcCCCCCce--EEEecCCCEEEEEeCCCCeEEEEe-cCCCceEEEecCC
Q psy5768 413 DSRIYWTNWNSHLPSIQRAFF-SGFGTESIITTDITMPNA--LALDHQAEKLFWGDARLDKIERCD-YDGTNRIVLSKIS 488 (652)
Q Consensus 413 ~g~Lywtd~~~~~~~I~r~~l-dG~~~~~l~~~~l~~P~g--laiD~~~~~LYw~D~~~~~I~~~~-ldG~~~~~l~~~~ 488 (652)
.+++|.++.. ..|...+. +| +++....+..+.. .+++ +++||..... +.|..+| -+|.-........
T Consensus 36 ~~~v~~~~~~---~~l~~~d~~tG---~~~W~~~~~~~~~~~~~~~--~~~v~v~~~~-~~l~~~d~~tG~~~W~~~~~~ 106 (238)
T PF13360_consen 36 GGRVYVASGD---GNLYALDAKTG---KVLWRFDLPGPISGAPVVD--GGRVYVGTSD-GSLYALDAKTGKVLWSIYLTS 106 (238)
T ss_dssp TTEEEEEETT---SEEEEEETTTS---EEEEEEECSSCGGSGEEEE--TTEEEEEETT-SEEEEEETTTSCEEEEEEE-S
T ss_pred CCEEEEEcCC---CEEEEEECCCC---CEEEEeeccccccceeeec--ccccccccce-eeeEecccCCcceeeeecccc
Confidence 5667776432 25655554 44 3343333322221 2333 7888887643 3788888 4566555532211
Q ss_pred -----CCceeEEEEeCCEEEEEcCCCCeEEEEEccCCceEEEEe
Q psy5768 489 -----PLHPFDMAVYGEFIFWTDWVIHAVLRANKYTGEEVYTLR 527 (652)
Q Consensus 489 -----~~~p~glav~~~~lYwtd~~~~~I~~~~k~~g~~~~~~~ 527 (652)
...+...++.++.+|.... .+.|+.+|..+|+..-...
T Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~~~-~g~l~~~d~~tG~~~w~~~ 149 (238)
T PF13360_consen 107 SPPAGVRSSSSPAVDGDRLYVGTS-SGKLVALDPKTGKLLWKYP 149 (238)
T ss_dssp SCTCSTB--SEEEEETTEEEEEET-CSEEEEEETTTTEEEEEEE
T ss_pred ccccccccccCceEecCEEEEEec-cCcEEEEecCCCcEEEEee
Confidence 2334566777888888875 6789999988887654444
No 162
>KOG0293|consensus
Probab=78.65 E-value=96 Score=32.99 Aligned_cols=145 Identities=14% Similarity=0.017 Sum_probs=85.8
Q ss_pred CCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcC-CccCCCCcEEEEccCCcEEEEeCCCCEEEEEE
Q psy5768 38 SKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYP-AVTACNLHIAVDWIAQNIYWSDPKENVIEVAR 116 (652)
Q Consensus 38 ~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~-~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~ 116 (652)
..+..|.+.|.+.+|.-- .....+.+.+.+-+.....++.+ ++ .+. ..|--+.+.+ +++-+....|...+
T Consensus 270 ~~V~yi~wSPDdryLlaC---g~~e~~~lwDv~tgd~~~~y~~~-~~~S~~----sc~W~pDg~~-~V~Gs~dr~i~~wd 340 (519)
T KOG0293|consen 270 QPVSYIMWSPDDRYLLAC---GFDEVLSLWDVDTGDLRHLYPSG-LGFSVS----SCAWCPDGFR-FVTGSPDRTIIMWD 340 (519)
T ss_pred CceEEEEECCCCCeEEec---CchHheeeccCCcchhhhhcccC-cCCCcc----eeEEccCCce-eEecCCCCcEEEec
Confidence 466778999988887743 24456777777765544444443 21 122 4444333333 56666678899999
Q ss_pred cCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeCCCCCcEEEEeecccCceeEEEeccCCEEEEEeCC
Q psy5768 117 LTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGLDGKKQTILAQEIIMPIKDITLDLKFFSAFYRNLS 195 (652)
Q Consensus 117 ~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~~~~~~~~~~~p~gl~lD~~~~~ly~~d~~ 195 (652)
+||....---........++|+-+...+|+-.. ..++|...+........++++. .....++|. .++++..+++.
T Consensus 341 lDgn~~~~W~gvr~~~v~dlait~Dgk~vl~v~--~d~~i~l~~~e~~~dr~lise~-~~its~~iS-~d~k~~LvnL~ 415 (519)
T KOG0293|consen 341 LDGNILGNWEGVRDPKVHDLAITYDGKYVLLVT--VDKKIRLYNREARVDRGLISEE-QPITSFSIS-KDGKLALVNLQ 415 (519)
T ss_pred CCcchhhcccccccceeEEEEEcCCCcEEEEEe--cccceeeechhhhhhhcccccc-CceeEEEEc-CCCcEEEEEcc
Confidence 999752211111234577999998777888776 3456666655443333344443 345677775 45566566554
No 163
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=77.80 E-value=1.1e+02 Score=33.06 Aligned_cols=196 Identities=11% Similarity=0.116 Sum_probs=106.4
Q ss_pred EEEEEcCCCeEEEe-eccc---ccEEEEeccCCcceEEeeccCceeeeEEEccCC--EEEEEeCCC----------CeEE
Q psy5768 318 ELSYDYKRKTLFYS-DIQK---GTINSVFFNGSNHRVLLERQGSVEGLAYEYVHN--YLYWTCNND----------ATIN 381 (652)
Q Consensus 318 ~v~~D~~~~~lyws-d~~~---~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~--~LYwtd~~~----------~~I~ 381 (652)
++.+.+..++|-++ |... ..|+..++... +.+...+..+.+=.+-|..+ .+|++.... ..|.
T Consensus 128 ~~~~Spdg~~la~~~s~~G~e~~~l~v~Dl~tg--~~l~d~i~~~~~~~~~W~~d~~~~~y~~~~~~~~~~~~~~~~~v~ 205 (414)
T PF02897_consen 128 GFSVSPDGKRLAYSLSDGGSEWYTLRVFDLETG--KFLPDGIENPKFSSVSWSDDGKGFFYTRFDEDQRTSDSGYPRQVY 205 (414)
T ss_dssp EEEETTTSSEEEEEEEETTSSEEEEEEEETTTT--EEEEEEEEEEESEEEEECTTSSEEEEEECSTTTSS-CCGCCEEEE
T ss_pred eeeECCCCCEEEEEecCCCCceEEEEEEECCCC--cCcCCcccccccceEEEeCCCCEEEEEEeCcccccccCCCCcEEE
Confidence 45556666665555 3322 23555555443 22323333333322455544 788876432 3467
Q ss_pred EEEcCCCCCccEEEEEeCCCCCc---eEEEEeCCCCEEEEEecCCCC-CceEEEeecCC----CceEEEEcCCCCCceEE
Q psy5768 382 KIDLDSPKAQRIVVVRLGQHDKP---RGIDIDSCDSRIYWTNWNSHL-PSIQRAFFSGF----GTESIITTDITMPNALA 453 (652)
Q Consensus 382 ~~~~~~~~~~~~~~~~~~~~~~P---~~Iavdp~~g~Lywtd~~~~~-~~I~r~~ldG~----~~~~l~~~~l~~P~gla 453 (652)
+..+.... ....++. .....+ .++.+.+...+|+++-..... ..++...++.. ..-.++..... -..-.
T Consensus 206 ~~~~gt~~-~~d~lvf-e~~~~~~~~~~~~~s~d~~~l~i~~~~~~~~s~v~~~d~~~~~~~~~~~~~l~~~~~-~~~~~ 282 (414)
T PF02897_consen 206 RHKLGTPQ-SEDELVF-EEPDEPFWFVSVSRSKDGRYLFISSSSGTSESEVYLLDLDDGGSPDAKPKLLSPRED-GVEYY 282 (414)
T ss_dssp EEETTS-G-GG-EEEE-C-TTCTTSEEEEEE-TTSSEEEEEEESSSSEEEEEEEECCCTTTSS-SEEEEEESSS-S-EEE
T ss_pred EEECCCCh-HhCeeEE-eecCCCcEEEEEEecCcccEEEEEEEccccCCeEEEEeccccCCCcCCcEEEeCCCC-ceEEE
Confidence 77764322 1222332 333333 478889999999987665443 47888888764 22233322221 12334
Q ss_pred EecCCCEEEEE-e--CCCCeEEEEecCCCc---eE-EEecCC-CCceeEEEEeCCEEEEEcCCCC--eEEEEEcc
Q psy5768 454 LDHQAEKLFWG-D--ARLDKIERCDYDGTN---RI-VLSKIS-PLHPFDMAVYGEFIFWTDWVIH--AVLRANKY 518 (652)
Q Consensus 454 iD~~~~~LYw~-D--~~~~~I~~~~ldG~~---~~-~l~~~~-~~~p~glav~~~~lYwtd~~~~--~I~~~~k~ 518 (652)
++..++++|+. + +...+|.+++++... .. +++... .....++.+++++|++...... .|...+..
T Consensus 283 v~~~~~~~yi~Tn~~a~~~~l~~~~l~~~~~~~~~~~l~~~~~~~~l~~~~~~~~~Lvl~~~~~~~~~l~v~~~~ 357 (414)
T PF02897_consen 283 VDHHGDRLYILTNDDAPNGRLVAVDLADPSPAEWWTVLIPEDEDVSLEDVSLFKDYLVLSYRENGSSRLRVYDLD 357 (414)
T ss_dssp EEEETTEEEEEE-TT-TT-EEEEEETTSTSGGGEEEEEE--SSSEEEEEEEEETTEEEEEEEETTEEEEEEEETT
T ss_pred EEccCCEEEEeeCCCCCCcEEEEecccccccccceeEEcCCCCceeEEEEEEECCEEEEEEEECCccEEEEEECC
Confidence 56668888875 2 455789999998765 23 555433 2356888899999998876554 45556654
No 164
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=76.63 E-value=66 Score=31.12 Aligned_cols=69 Identities=16% Similarity=0.233 Sum_probs=42.2
Q ss_pred EEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEe------------CCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeC
Q psy5768 94 AVDWIAQNIYWSDPKENVIEVARLTGQYRYVLIS------------GGVDQPSALAVDPESGYLFWSESGKIPLIARAGL 161 (652)
Q Consensus 94 avDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~------------~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~l 161 (652)
-+.|+.+.||---=.+.+|.+.+++.......+. ...+.+.+||-||..+.+|.|--- -|.+.-..+
T Consensus 180 ELE~VdG~lyANVw~t~~I~rI~p~sGrV~~widlS~L~~~~~~~~~~~nvlNGIA~~~~~~r~~iTGK~-wp~lfEVk~ 258 (262)
T COG3823 180 ELEWVDGELYANVWQTTRIARIDPDSGRVVAWIDLSGLLKELNLDKSNDNVLNGIAHDPQQDRFLITGKL-WPLLFEVKL 258 (262)
T ss_pred ceeeeccEEEEeeeeecceEEEcCCCCcEEEEEEccCCchhcCccccccccccceeecCcCCeEEEecCc-CceeEEEEe
Confidence 3457777776433345678887776443333332 134578999999999999998421 245555444
Q ss_pred CC
Q psy5768 162 DG 163 (652)
Q Consensus 162 dg 163 (652)
++
T Consensus 259 ~~ 260 (262)
T COG3823 259 DE 260 (262)
T ss_pred cC
Confidence 43
No 165
>KOG1446|consensus
Probab=76.29 E-value=95 Score=31.71 Aligned_cols=187 Identities=16% Similarity=0.098 Sum_probs=100.3
Q ss_pred cccccEEEEeccCCcceEEe-eccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeC
Q psy5768 333 IQKGTINSVFFNGSNHRVLL-ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDS 411 (652)
Q Consensus 333 ~~~~~I~~~~~~g~~~~~i~-~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp 411 (652)
.....|+-.++.....--.+ ..-..+.+|++.+.. ..|.+-+..++|.-=++...+ ...++. +..+-..|.||
T Consensus 77 k~d~tIryLsl~dNkylRYF~GH~~~V~sL~~sP~~-d~FlS~S~D~tvrLWDlR~~~--cqg~l~---~~~~pi~AfDp 150 (311)
T KOG1446|consen 77 KEDDTIRYLSLHDNKYLRYFPGHKKRVNSLSVSPKD-DTFLSSSLDKTVRLWDLRVKK--CQGLLN---LSGRPIAAFDP 150 (311)
T ss_pred CCCCceEEEEeecCceEEEcCCCCceEEEEEecCCC-CeEEecccCCeEEeeEecCCC--CceEEe---cCCCcceeECC
Confidence 34455555555432221222 223367788888877 667777777777766665332 344443 45566789999
Q ss_pred CCCEEEEEecCCCCCceEEEeecCCCceEEEE-c--CCCCCceEEEecCCCEEEEEeCCCCeEEEEe-cCCCceEEEecC
Q psy5768 412 CDSRIYWTNWNSHLPSIQRAFFSGFGTESIIT-T--DITMPNALALDHQAEKLFWGDARLDKIERCD-YDGTNRIVLSKI 487 (652)
Q Consensus 412 ~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~-~--~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~-ldG~~~~~l~~~ 487 (652)
.|.+|-+-.++...+++-.++=+..+-..+. . ....-+.|.+.+.++.|...... +.|..+| ++|.-...+-..
T Consensus 151 -~GLifA~~~~~~~IkLyD~Rs~dkgPF~tf~i~~~~~~ew~~l~FS~dGK~iLlsT~~-s~~~~lDAf~G~~~~tfs~~ 228 (311)
T KOG1446|consen 151 -EGLIFALANGSELIKLYDLRSFDKGPFTTFSITDNDEAEWTDLEFSPDGKSILLSTNA-SFIYLLDAFDGTVKSTFSGY 228 (311)
T ss_pred -CCcEEEEecCCCeEEEEEecccCCCCceeEccCCCCccceeeeEEcCCCCEEEEEeCC-CcEEEEEccCCcEeeeEeec
Confidence 7888888777654466655554444332221 1 12233566666666666555433 3344433 456533333211
Q ss_pred --CCCceeEEEEe-CCEEEEEcCCCCeEEEEEccCCceEEEEe
Q psy5768 488 --SPLHPFDMAVY-GEFIFWTDWVIHAVLRANKYTGEEVYTLR 527 (652)
Q Consensus 488 --~~~~p~glav~-~~~lYwtd~~~~~I~~~~k~~g~~~~~~~ 527 (652)
...-|.+-++. ++....+-...++|..-+..+|..+..+.
T Consensus 229 ~~~~~~~~~a~ftPds~Fvl~gs~dg~i~vw~~~tg~~v~~~~ 271 (311)
T KOG1446|consen 229 PNAGNLPLSATFTPDSKFVLSGSDDGTIHVWNLETGKKVAVLR 271 (311)
T ss_pred cCCCCcceeEEECCCCcEEEEecCCCcEEEEEcCCCcEeeEec
Confidence 12344333332 33444444566777666667777666555
No 166
>PTZ00420 coronin; Provisional
Probab=76.06 E-value=1.5e+02 Score=33.75 Aligned_cols=103 Identities=14% Similarity=0.055 Sum_probs=62.4
Q ss_pred EEEecCCCCeEEEEecCCC-e-eE-------EEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCc
Q psy5768 3 IAVSSPTQSKIVVCNLEGE-Y-QT-------TILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTK 73 (652)
Q Consensus 3 i~v~~~~~~~I~~~~~~g~-~-~~-------~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~ 73 (652)
++++....+.|.++++... . .. .+.. .-..+..|+|+|....++.+- ...+.|..+++....
T Consensus 89 lLASgS~DgtIrIWDi~t~~~~~~~i~~p~~~L~g-------H~~~V~sVaf~P~g~~iLaSg--S~DgtIrIWDl~tg~ 159 (568)
T PTZ00420 89 ILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKG-------HKKKISIIDWNPMNYYIMCSS--GFDSFVNIWDIENEK 159 (568)
T ss_pred EEEEEeCCCeEEEEECCCCCccccccccceEEeec-------CCCcEEEEEECCCCCeEEEEE--eCCCeEEEEECCCCc
Confidence 3444456677888887521 1 11 1211 235788999999887777664 456788888886443
Q ss_pred cEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCc
Q psy5768 74 RETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQY 121 (652)
Q Consensus 74 ~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~ 121 (652)
....+. . -..+. ++++++. +.+..+-...+.|.+.++....
T Consensus 160 ~~~~i~-~-~~~V~----Slswspd-G~lLat~s~D~~IrIwD~Rsg~ 200 (568)
T PTZ00420 160 RAFQIN-M-PKKLS----SLKWNIK-GNLLSGTCVGKHMHIIDPRKQE 200 (568)
T ss_pred EEEEEe-c-CCcEE----EEEECCC-CCEEEEEecCCEEEEEECCCCc
Confidence 222122 2 23456 7888764 4555555556788888887543
No 167
>KOG4227|consensus
Probab=75.52 E-value=61 Score=34.01 Aligned_cols=128 Identities=12% Similarity=0.131 Sum_probs=75.3
Q ss_pred CceeeeeccccceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe---eccCceeeeEEEccCCEEEEEeCCCCe
Q psy5768 303 PFESIRNSTMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL---ERQGSVEGLAYEYVHNYLYWTCNNDAT 379 (652)
Q Consensus 303 p~~~~~~~~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~---~~~~~~~glAvDw~~~~LYwtd~~~~~ 379 (652)
|+..... .+-.++..++||..+.+||=... .+.+..-++..+..--++ ..-+.+.+|.+.+. .|++.+.+..+.
T Consensus 96 PI~~~~~-~H~SNIF~L~F~~~N~~~~SG~~-~~~VI~HDiEt~qsi~V~~~~~~~~~VY~m~~~P~-DN~~~~~t~~~~ 172 (609)
T KOG4227|consen 96 PIGVMEH-PHRSNIFSLEFDLENRFLYSGER-WGTVIKHDIETKQSIYVANENNNRGDVYHMDQHPT-DNTLIVVTRAKL 172 (609)
T ss_pred CceeccC-ccccceEEEEEccCCeeEecCCC-cceeEeeecccceeeeeecccCcccceeecccCCC-CceEEEEecCce
Confidence 5544433 34478999999998888875443 344544455443322222 22347889999887 788887777777
Q ss_pred EEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEee
Q psy5768 380 INKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFF 433 (652)
Q Consensus 380 I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~l 433 (652)
+..++..........++.......-....++|..-.|..+......+.++-..|
T Consensus 173 V~~~D~Rd~~~~~~~~~~AN~~~~F~t~~F~P~~P~Li~~~~~~~G~~~~D~R~ 226 (609)
T KOG4227|consen 173 VSFIDNRDRQNPISLVLPANSGKNFYTAEFHPETPALILVNSETGGPNVFDRRM 226 (609)
T ss_pred EEEEeccCCCCCCceeeecCCCccceeeeecCCCceeEEeccccCCCCceeecc
Confidence 777776432211233333334455566677777777766654443444444433
No 168
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=74.71 E-value=85 Score=30.40 Aligned_cols=65 Identities=17% Similarity=0.206 Sum_probs=42.2
Q ss_pred CCCEEEEEecCCCCCceEEEeecCCCceEEEE------------cCCCCCceEEEecCCCEEEEEeCCCCeEEEEecCC
Q psy5768 412 CDSRIYWTNWNSHLPSIQRAFFSGFGTESIIT------------TDITMPNALALDHQAEKLFWGDARLDKIERCDYDG 478 (652)
Q Consensus 412 ~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~------------~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG 478 (652)
..|.||---|-.. +|.|...+......-+. .+..-+||||.|++.+|+|.+-..-..+.-+.+++
T Consensus 184 VdG~lyANVw~t~--~I~rI~p~sGrV~~widlS~L~~~~~~~~~~~nvlNGIA~~~~~~r~~iTGK~wp~lfEVk~~~ 260 (262)
T COG3823 184 VDGELYANVWQTT--RIARIDPDSGRVVAWIDLSGLLKELNLDKSNDNVLNGIAHDPQQDRFLITGKLWPLLFEVKLDE 260 (262)
T ss_pred eccEEEEeeeeec--ceEEEcCCCCcEEEEEEccCCchhcCccccccccccceeecCcCCeEEEecCcCceeEEEEecC
Confidence 3577776666654 77777666444333221 12345899999999999999866556666655554
No 169
>KOG0641|consensus
Probab=74.62 E-value=59 Score=31.52 Aligned_cols=106 Identities=16% Similarity=0.186 Sum_probs=63.2
Q ss_pred EEecCCCCeEEEEecCCCeeEEEecCCCCC-CCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCc-cEEEEeCC
Q psy5768 4 AVSSPTQSKIVVCNLEGEYQTTILSNESND-TSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTK-RETVVSQK 81 (652)
Q Consensus 4 ~v~~~~~~~I~~~~~~g~~~~~~~~~~~~~-~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~-~~~v~~~~ 81 (652)
|++-+....|+++|+.-......+.+...+ ...-+.+.+|++||. ++|..+- .....-+.++..|.- ++.+ ...
T Consensus 197 ~~sgsqdktirfwdlrv~~~v~~l~~~~~~~glessavaav~vdps-grll~sg--~~dssc~lydirg~r~iq~f-~ph 272 (350)
T KOG0641|consen 197 FASGSQDKTIRFWDLRVNSCVNTLDNDFHDGGLESSAVAAVAVDPS-GRLLASG--HADSSCMLYDIRGGRMIQRF-HPH 272 (350)
T ss_pred EEccCCCceEEEEeeeccceeeeccCcccCCCcccceeEEEEECCC-cceeeec--cCCCceEEEEeeCCceeeee-CCC
Confidence 455555677888899866554443333333 334578899999985 7888777 666777777776653 3333 233
Q ss_pred CcCCccCCCCcEEEEccCCcEE-EEeCCCCEEEEEEcCCC
Q psy5768 82 KYPAVTACNLHIAVDWIAQNIY-WSDPKENVIEVARLTGQ 120 (652)
Q Consensus 82 ~~~~p~~~~~~lavDw~~~~lY-~~d~~~~~I~v~~~dg~ 120 (652)
...++ .+.+.+ +--| .+-+....|.+.++.|.
T Consensus 273 -sadir----~vrfsp--~a~yllt~syd~~ikltdlqgd 305 (350)
T KOG0641|consen 273 -SADIR----CVRFSP--GAHYLLTCSYDMKIKLTDLQGD 305 (350)
T ss_pred -cccee----EEEeCC--CceEEEEecccceEEEeecccc
Confidence 34455 555554 2233 23344566777777664
No 170
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=73.26 E-value=1.7e+02 Score=33.04 Aligned_cols=95 Identities=14% Similarity=0.188 Sum_probs=50.3
Q ss_pred EEEEEcCCCeEEEeecccccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCCC-CeEEEEEcCCCCCccEEEE
Q psy5768 318 ELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNND-ATINKIDLDSPKAQRIVVV 396 (652)
Q Consensus 318 ~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~-~~I~~~~~~~~~~~~~~~~ 396 (652)
.++||++.+.|||.-.+..- .++..+. -.++..-.=+|+|-.++++-|.-... +-++ +.+... ...++
T Consensus 238 ~~s~D~~~~lvy~~tGnp~p-----~~~~~r~--gdnl~~~s~vAld~~TG~~~W~~Q~~~~D~w--D~d~~~--~p~l~ 306 (527)
T TIGR03075 238 TGSYDPETNLIYFGTGNPSP-----WNSHLRP--GDNLYTSSIVARDPDTGKIKWHYQTTPHDEW--DYDGVN--EMILF 306 (527)
T ss_pred ceeEcCCCCeEEEeCCCCCC-----CCCCCCC--CCCccceeEEEEccccCCEEEeeeCCCCCCc--cccCCC--CcEEE
Confidence 56899999999997643211 1121110 01222334478888899998865432 2233 222221 22333
Q ss_pred EeCCCCCce-EEEEeCCCCEEEEEecCC
Q psy5768 397 RLGQHDKPR-GIDIDSCDSRIYWTNWNS 423 (652)
Q Consensus 397 ~~~~~~~P~-~Iavdp~~g~Lywtd~~~ 423 (652)
.+....+++ .++.-.++|++|.-|...
T Consensus 307 d~~~~G~~~~~v~~~~K~G~~~vlDr~t 334 (527)
T TIGR03075 307 DLKKDGKPRKLLAHADRNGFFYVLDRTN 334 (527)
T ss_pred EeccCCcEEEEEEEeCCCceEEEEECCC
Confidence 332223333 344556788888888653
No 171
>KOG4328|consensus
Probab=72.75 E-value=1.2e+02 Score=32.57 Aligned_cols=156 Identities=16% Similarity=0.125 Sum_probs=89.9
Q ss_pred EEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEee---ccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCcc
Q psy5768 316 IIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLLE---RQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQR 392 (652)
Q Consensus 316 ~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~~---~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~ 392 (652)
+..|.|.+.+-.-+++....|.|.-.++++...++++. .-....++.+.-..+.+|+.+.- +...++++.......
T Consensus 237 Vs~l~F~P~n~s~i~ssSyDGtiR~~D~~~~i~e~v~s~~~d~~~fs~~d~~~e~~~vl~~~~~-G~f~~iD~R~~~s~~ 315 (498)
T KOG4328|consen 237 VSGLKFSPANTSQIYSSSYDGTIRLQDFEGNISEEVLSLDTDNIWFSSLDFSAESRSVLFGDNV-GNFNVIDLRTDGSEY 315 (498)
T ss_pred ccceEecCCChhheeeeccCceeeeeeecchhhHHHhhcCccceeeeeccccCCCccEEEeecc-cceEEEEeecCCccc
Confidence 45677777665444454556888888888766665552 22245566666677788877654 355666654322111
Q ss_pred EEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEc-CC-CCCceEEEecCCCEEEEEeCCCCe
Q psy5768 393 IVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITT-DI-TMPNALALDHQAEKLFWGDARLDK 470 (652)
Q Consensus 393 ~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~-~l-~~P~glaiD~~~~~LYw~D~~~~~ 470 (652)
..+ .+. -.+..+|+++|...+++.|-.-.+..+||-+.-=+..+..++.+ .- ...++.-+.+.+++|.=+ ...+.
T Consensus 316 ~~~-~lh-~kKI~sv~~NP~~p~~laT~s~D~T~kIWD~R~l~~K~sp~lst~~HrrsV~sAyFSPs~gtl~TT-~~D~~ 392 (498)
T KOG4328|consen 316 ENL-RLH-KKKITSVALNPVCPWFLATASLDQTAKIWDLRQLRGKASPFLSTLPHRRSVNSAYFSPSGGTLLTT-CQDNE 392 (498)
T ss_pred hhh-hhh-hcccceeecCCCCchheeecccCcceeeeehhhhcCCCCcceecccccceeeeeEEcCCCCceEee-ccCCc
Confidence 111 111 13889999999999999887665555788765433333223322 21 223556666667774333 33344
Q ss_pred EEEEe
Q psy5768 471 IERCD 475 (652)
Q Consensus 471 I~~~~ 475 (652)
|...+
T Consensus 393 IRv~d 397 (498)
T KOG4328|consen 393 IRVFD 397 (498)
T ss_pred eEEee
Confidence 44443
No 172
>KOG1274|consensus
Probab=71.91 E-value=94 Score=36.36 Aligned_cols=143 Identities=15% Similarity=0.071 Sum_probs=87.6
Q ss_pred CeeEEEEECCCCEEEEEeccCCcceEEEEEcCCC--ccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEE
Q psy5768 39 KISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGT--KRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVAR 116 (652)
Q Consensus 39 ~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs--~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~ 116 (652)
....|.||+.+++|+..+ .++.|..+.-... ..++|-..+ ..+. ++|.+- +.+.+-+..+.|.+..
T Consensus 15 G~t~i~~d~~gefi~tcg---sdg~ir~~~~~sd~e~P~ti~~~g--~~v~----~ia~~s---~~f~~~s~~~tv~~y~ 82 (933)
T KOG1274|consen 15 GLTLICYDPDGEFICTCG---SDGDIRKWKTNSDEEEPETIDISG--ELVS----SIACYS---NHFLTGSEQNTVLRYK 82 (933)
T ss_pred ceEEEEEcCCCCEEEEec---CCCceEEeecCCcccCCchhhccC--ceeE----EEeecc---cceEEeeccceEEEee
Confidence 357899999999999877 4566665553322 233332122 2234 666652 3556666778888887
Q ss_pred cCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeC-CCCCcEEEEeecccCceeEEEeccCCEEEEEeCC
Q psy5768 117 LTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGL-DGKKQTILAQEIIMPIKDITLDLKFFSAFYRNLS 195 (652)
Q Consensus 117 ~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~l-dg~~~~~~~~~~~~~p~gl~lD~~~~~ly~~d~~ 195 (652)
.+...-..++....-.-+.++++- +|.+...-. ..-.|...++ |++...++.... ....+|.+|+.++.|-.++.+
T Consensus 83 fps~~~~~iL~Rftlp~r~~~v~g-~g~~iaags-dD~~vK~~~~~D~s~~~~lrgh~-apVl~l~~~p~~~fLAvss~d 159 (933)
T KOG1274|consen 83 FPSGEEDTILARFTLPIRDLAVSG-SGKMIAAGS-DDTAVKLLNLDDSSQEKVLRGHD-APVLQLSYDPKGNFLAVSSCD 159 (933)
T ss_pred CCCCCccceeeeeeccceEEEEec-CCcEEEeec-CceeEEEEeccccchheeecccC-CceeeeeEcCCCCEEEEEecC
Confidence 665545555554445667889986 555544322 2345666654 555555544332 455788999888888888877
Q ss_pred C
Q psy5768 196 K 196 (652)
Q Consensus 196 g 196 (652)
|
T Consensus 160 G 160 (933)
T KOG1274|consen 160 G 160 (933)
T ss_pred c
Confidence 4
No 173
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=71.33 E-value=1.5e+02 Score=31.74 Aligned_cols=104 Identities=13% Similarity=0.114 Sum_probs=61.1
Q ss_pred CCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCCCeEEEEecCCCceEEEecC-CCCc
Q psy5768 413 DSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARLDKIERCDYDGTNRIVLSKI-SPLH 491 (652)
Q Consensus 413 ~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~-~~~~ 491 (652)
.+.+|++... +.+...++. +-+++....+..+..++++ +++||..+. .+.+..++.......--... ....
T Consensus 256 ~~~vy~~~~~---g~l~ald~~--tG~~~W~~~~~~~~~~~~~--~~~vy~~~~-~g~l~ald~~tG~~~W~~~~~~~~~ 327 (394)
T PRK11138 256 GGVVYALAYN---GNLVALDLR--SGQIVWKREYGSVNDFAVD--GGRIYLVDQ-NDRVYALDTRGGVELWSQSDLLHRL 327 (394)
T ss_pred CCEEEEEEcC---CeEEEEECC--CCCEEEeecCCCccCcEEE--CCEEEEEcC-CCeEEEEECCCCcEEEcccccCCCc
Confidence 5788887653 255555443 2234454444444455555 789999874 46788888854322110110 0011
Q ss_pred eeEEEEeCCEEEEEcCCCCeEEEEEccCCceEEE
Q psy5768 492 PFDMAVYGEFIFWTDWVIHAVLRANKYTGEEVYT 525 (652)
Q Consensus 492 p~glav~~~~lYwtd~~~~~I~~~~k~~g~~~~~ 525 (652)
.-+.++.+++||..+. .+.|+.++..+|+..-.
T Consensus 328 ~~sp~v~~g~l~v~~~-~G~l~~ld~~tG~~~~~ 360 (394)
T PRK11138 328 LTAPVLYNGYLVVGDS-EGYLHWINREDGRFVAQ 360 (394)
T ss_pred ccCCEEECCEEEEEeC-CCEEEEEECCCCCEEEE
Confidence 2334567899999874 56788889888875443
No 174
>KOG0289|consensus
Probab=71.00 E-value=1.5e+02 Score=31.73 Aligned_cols=143 Identities=10% Similarity=0.096 Sum_probs=84.0
Q ss_pred CCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEc
Q psy5768 38 SKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARL 117 (652)
Q Consensus 38 ~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~ 117 (652)
..+.++..+|.++++.|++ ...-.+..--.+|+...++.....--... ..++-+ .++|+-+-...+.+.+.++
T Consensus 304 ~~V~~ls~h~tgeYllsAs--~d~~w~Fsd~~~g~~lt~vs~~~s~v~~t----s~~fHp-DgLifgtgt~d~~vkiwdl 376 (506)
T KOG0289|consen 304 EPVTGLSLHPTGEYLLSAS--NDGTWAFSDISSGSQLTVVSDETSDVEYT----SAAFHP-DGLIFGTGTPDGVVKIWDL 376 (506)
T ss_pred ccceeeeeccCCcEEEEec--CCceEEEEEccCCcEEEEEeeccccceeE----EeeEcC-CceEEeccCCCceEEEEEc
Confidence 4678999999999999998 44334444445566544443321011133 556654 5778888887888888888
Q ss_pred CCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeCCCC-CcEEEEeecccCceeEEEeccCCEE
Q psy5768 118 TGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGLDGK-KQTILAQEIIMPIKDITLDLKFFSA 189 (652)
Q Consensus 118 dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~-~~~~~~~~~~~~p~gl~lD~~~~~l 189 (652)
.......-+.+--...+.|+.-. |||--.|... ...|...+|.-. +..++........+.+.+|..+..|
T Consensus 377 ks~~~~a~Fpght~~vk~i~FsE-NGY~Lat~ad-d~~V~lwDLRKl~n~kt~~l~~~~~v~s~~fD~SGt~L 447 (506)
T KOG0289|consen 377 KSQTNVAKFPGHTGPVKAISFSE-NGYWLATAAD-DGSVKLWDLRKLKNFKTIQLDEKKEVNSLSFDQSGTYL 447 (506)
T ss_pred CCccccccCCCCCCceeEEEecc-CceEEEEEec-CCeEEEEEehhhcccceeeccccccceeEEEcCCCCeE
Confidence 65432222222223566788764 8876666543 334665555422 2333333333456788888765554
No 175
>smart00284 OLF Olfactomedin-like domains.
Probab=68.82 E-value=1.3e+02 Score=30.15 Aligned_cols=139 Identities=14% Similarity=0.124 Sum_probs=79.0
Q ss_pred CCEEEEEeccCCcceEEEEEcCCCcc--EEEEeC-C-----CcCCccCCCCcEEEEccCC--cEEEEeCCCCEEEEEEcC
Q psy5768 49 KGKMFWSNVTKQVVTIEMAFMDGTKR--ETVVSQ-K-----KYPAVTACNLHIAVDWIAQ--NIYWSDPKENVIEVARLT 118 (652)
Q Consensus 49 ~~~lyw~d~~~~~~~I~~~~~dgs~~--~~v~~~-~-----~~~~p~~~~~~lavDw~~~--~lY~~d~~~~~I~v~~~d 118 (652)
++.||+.- .....|.|.++..... +.++.. + ....-.+..+++|+|. ++ -||-+....+.|.+..+|
T Consensus 83 ngslYY~~--~~s~~iiKydL~t~~v~~~~~Lp~a~y~~~~~Y~~~~~sdiDlAvDE-~GLWvIYat~~~~g~ivvSkLn 159 (255)
T smart00284 83 NGSLYFNK--FNSHDICRFDLTTETYQKEPLLNGAGYNNRFPYAWGGFSDIDLAVDE-NGLWVIYATEQNAGKIVISKLN 159 (255)
T ss_pred CceEEEEe--cCCccEEEEECCCCcEEEEEecCccccccccccccCCCccEEEEEcC-CceEEEEeccCCCCCEEEEeeC
Confidence 69999977 6678899999986543 233321 1 0000123446899996 44 466666677889888887
Q ss_pred CCcEEEE--EeCCCCCc---eeEEEcCCCCeEEEEecC--CCCeEEEE-eCCC-CCcEE--EEeecccCceeEEEeccCC
Q psy5768 119 GQYRYVL--ISGGVDQP---SALAVDPESGYLFWSESG--KIPLIARA-GLDG-KKQTI--LAQEIIMPIKDITLDLKFF 187 (652)
Q Consensus 119 g~~~~~l--~~~~~~~P---~~iavd~~~g~lywtd~~--~~~~I~~~-~ldg-~~~~~--~~~~~~~~p~gl~lD~~~~ 187 (652)
-....+. ..+...++ .++.|+ |.||.++.. ...+|.-+ +... +.... ...........|...+.++
T Consensus 160 p~tL~ve~tW~T~~~k~sa~naFmvC---GvLY~~~s~~~~~~~I~yayDt~t~~~~~~~i~f~n~y~~~s~l~YNP~d~ 236 (255)
T smart00284 160 PATLTIENTWITTYNKRSASNAFMIC---GILYVTRSLGSKGEKVFYAYDTNTGKEGHLDIPFENMYEYISMLDYNPNDR 236 (255)
T ss_pred cccceEEEEEEcCCCcccccccEEEe---eEEEEEccCCCCCcEEEEEEECCCCccceeeeeeccccccceeceeCCCCC
Confidence 5433322 22333333 255563 899999853 24456554 3332 22221 1122233445577777888
Q ss_pred EEEEEe
Q psy5768 188 SAFYRN 193 (652)
Q Consensus 188 ~ly~~d 193 (652)
+||.-|
T Consensus 237 ~LY~wd 242 (255)
T smart00284 237 KLYAWN 242 (255)
T ss_pred eEEEEe
Confidence 888643
No 176
>COG4247 Phy 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]
Probab=67.53 E-value=1.4e+02 Score=29.85 Aligned_cols=174 Identities=16% Similarity=0.151 Sum_probs=94.3
Q ss_pred eccCceeeeEE--EccCCEEEE-EeCCCCeEEEEEcC-CCCCc--cEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCC
Q psy5768 353 ERQGSVEGLAY--EYVHNYLYW-TCNNDATINKIDLD-SPKAQ--RIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLP 426 (652)
Q Consensus 353 ~~~~~~~glAv--Dw~~~~LYw-td~~~~~I~~~~~~-~~~~~--~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~ 426 (652)
+++..+.||++ +.+++-.|. .....+.|....+- +.++. .+.+-.++-..+..+++.|-..|+||.++.. .
T Consensus 150 s~~s~~YGl~lyrs~ktgd~yvfV~~~qG~~~Qy~l~d~gnGkv~~k~vR~fk~~tQTEG~VaDdEtG~LYIaeEd---v 226 (364)
T COG4247 150 SSSSSAYGLALYRSPKTGDYYVFVNRRQGDIAQYKLIDQGNGKVGTKLVRQFKIPTQTEGMVADDETGFLYIAEED---V 226 (364)
T ss_pred cCcccceeeEEEecCCcCcEEEEEecCCCceeEEEEEecCCceEcceeeEeeecCCcccceeeccccceEEEeecc---c
Confidence 56678899887 445444443 33334666665552 22222 2333333344577899999999999999855 2
Q ss_pred ceEEEeec--CCCceEEEEcCCCCCceEEEecCCCEEEEEeCCCCeEEEEecCCCceEEEecCCCCceeEEEEeCCEEEE
Q psy5768 427 SIQRAFFS--GFGTESIITTDITMPNALALDHQAEKLFWGDARLDKIERCDYDGTNRIVLSKISPLHPFDMAVYGEFIFW 504 (652)
Q Consensus 427 ~I~r~~ld--G~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~~~~~p~glav~~~~lYw 504 (652)
.||+...+ +.+...++. .+..-..|+-|.++-.||+..-+.+.+.-. --|.+.-.....+ ++.-|+
T Consensus 227 aiWK~~Aep~~G~~g~~id-r~~d~~~LtdDvEGltiYy~pnGkGYL~aS-SQGnNtya~y~Re----------G~N~YV 294 (364)
T COG4247 227 AIWKYEAEPNRGNTGRLID-RIKDLSYLTDDVEGLTIYYGPNGKGYLLAS-SQGNNTYAAYTRE----------GNNDYV 294 (364)
T ss_pred eeeecccCCCCCCccchhh-hhcCchhhcccccccEEEEcCCCcEEEEEe-cCCCceEEEEEee----------CCCceE
Confidence 67776554 223333332 222225678888888888877655543221 1233322222211 222232
Q ss_pred EcC---CCCeEEEEEccCCceEEEEecccCCcceeEEEec
Q psy5768 505 TDW---VIHAVLRANKYTGEEVYTLRKNIRRPMGIVAISD 541 (652)
Q Consensus 505 td~---~~~~I~~~~k~~g~~~~~~~~~~~~p~~i~~~~~ 541 (652)
... .++.|--+...+|..+.-+..+...|||+.+-..
T Consensus 295 gsF~vt~n~~iDg~setDG~DV~~~~LGa~~p~G~FVaQD 334 (364)
T COG4247 295 GSFGVTNNGAIDGVSETDGADVVNVPLGANFPFGLFVAQD 334 (364)
T ss_pred EEEeeccCCccccccccCCcceeccccCCCCcceeEEecc
Confidence 221 1233433444455555555555578999988753
No 177
>KOG3514|consensus
Probab=66.62 E-value=54 Score=38.95 Aligned_cols=79 Identities=15% Similarity=0.157 Sum_probs=50.5
Q ss_pred EEEEeCC-CcCCccCCCCcEEEEccCCcEEE-EeCCCCEEEEEE-----cCCCcEEEEEeCCCCCceeEEEcCCCCeEEE
Q psy5768 75 ETVVSQK-KYPAVTACNLHIAVDWIAQNIYW-SDPKENVIEVAR-----LTGQYRYVLISGGVDQPSALAVDPESGYLFW 147 (652)
Q Consensus 75 ~~v~~~~-~~~~p~~~~~~lavDw~~~~lY~-~d~~~~~I~v~~-----~dg~~~~~l~~~~~~~P~~iavd~~~g~lyw 147 (652)
..++..+ ....+. -+|+.-+.++||. .|-+.+.|.+-. -||....+.++. ..+--.+.||.. |
T Consensus 475 lil~~~g~~~~~~d----~~A~ELldghlyl~ldlGSG~iklras~rkv~DGeWhhv~l~R-~gR~gsvsVd~~-----~ 544 (1591)
T KOG3514|consen 475 LILFHGGPQANATD----YFAIELLDGHLYLLLDLGSGVIKLRASSRKVNDGEWHHVDLQR-DGRTGSVSVDAI-----K 544 (1591)
T ss_pred eEEEccCccccccc----EEEEEEeCCeEEEEEecCCceEEeeeecccccCCceEEEEeec-cCccceEEEeee-----e
Confidence 4444544 133455 8999999999996 577788776543 368888777764 355567777753 7
Q ss_pred EecCCCCeEEEEeCCC
Q psy5768 148 SESGKIPLIARAGLDG 163 (652)
Q Consensus 148 td~~~~~~I~~~~ldg 163 (652)
+|+...+.=+...||+
T Consensus 545 ~df~tpG~s~iL~ld~ 560 (1591)
T KOG3514|consen 545 TDFSTPGDSEILDLDD 560 (1591)
T ss_pred cCccCCCcceeEeecC
Confidence 7775444334444554
No 178
>KOG3914|consensus
Probab=65.10 E-value=1.9e+02 Score=30.60 Aligned_cols=165 Identities=10% Similarity=-0.009 Sum_probs=97.2
Q ss_pred eeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeec-CCC
Q psy5768 359 EGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFS-GFG 437 (652)
Q Consensus 359 ~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ld-G~~ 437 (652)
...+.-.-++.||.++..... .++...+...+.+.+.....-.+|.+|.........-++|.....-.+.....+ |..
T Consensus 66 ~~~~~s~~~~llAv~~~~K~~-~~f~~~~~~~~~kl~~~~~v~~~~~ai~~~~~~~sv~v~dkagD~~~~di~s~~~~~~ 144 (390)
T KOG3914|consen 66 ALVLTSDSGRLVAVATSSKQR-AVFDYRENPKGAKLLDVSCVPKRPTAISFIREDTSVLVADKAGDVYSFDILSADSGRC 144 (390)
T ss_pred cccccCCCceEEEEEeCCCce-EEEEEecCCCcceeeeEeecccCcceeeeeeccceEEEEeecCCceeeeeecccccCc
Confidence 333444445566666654432 233333322111222222334688888887777777777754332233333333 444
Q ss_pred ceEEEEcCCCCCceEEEecCCCEEEEEeCCCCeEEEEecCCCceEEEe-cCCCCceeEEEEeCCEEEEEcCCCCeEEEEE
Q psy5768 438 TESIITTDITMPNALALDHQAEKLFWGDARLDKIERCDYDGTNRIVLS-KISPLHPFDMAVYGEFIFWTDWVIHAVLRAN 516 (652)
Q Consensus 438 ~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG~~~~~l~-~~~~~~p~glav~~~~lYwtd~~~~~I~~~~ 516 (652)
+..+-. +.....+++.++.+.|.-+|. ..+|....+.+.....-. -+.-.-.-.|++..+++.|+-.+.+.|+.=+
T Consensus 145 ~~~lGh--vSml~dVavS~D~~~IitaDR-DEkIRvs~ypa~f~IesfclGH~eFVS~isl~~~~~LlS~sGD~tlr~Wd 221 (390)
T KOG3914|consen 145 EPILGH--VSMLLDVAVSPDDQFIITADR-DEKIRVSRYPATFVIESFCLGHKEFVSTISLTDNYLLLSGSGDKTLRLWD 221 (390)
T ss_pred chhhhh--hhhhheeeecCCCCEEEEecC-CceEEEEecCcccchhhhccccHhheeeeeeccCceeeecCCCCcEEEEe
Confidence 433332 334567888888899988885 467888888876532211 1111224678999999999999999998888
Q ss_pred ccCCceEEEEe
Q psy5768 517 KYTGEEVYTLR 527 (652)
Q Consensus 517 k~~g~~~~~~~ 527 (652)
-.+|+....+.
T Consensus 222 ~~sgk~L~t~d 232 (390)
T KOG3914|consen 222 ITSGKLLDTCD 232 (390)
T ss_pred cccCCcccccc
Confidence 87888765443
No 179
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=64.38 E-value=1.6e+02 Score=29.50 Aligned_cols=147 Identities=16% Similarity=0.104 Sum_probs=78.4
Q ss_pred CCeEEEeecccccEEEEeccCCcce-E-EeeccC------------ceeeeEEEccCCE-EEEEeCCCCeEEEEEcCCCC
Q psy5768 325 RKTLFYSDIQKGTINSVFFNGSNHR-V-LLERQG------------SVEGLAYEYVHNY-LYWTCNNDATINKIDLDSPK 389 (652)
Q Consensus 325 ~~~lywsd~~~~~I~~~~~~g~~~~-~-i~~~~~------------~~~glAvDw~~~~-LYwtd~~~~~I~~~~~~~~~ 389 (652)
++.||+--..+..|.|.++...... . .+++.. .-.++|+|..+=- ||-+....+.|.+..++-..
T Consensus 78 ngslYY~~~~s~~IvkydL~t~~v~~~~~L~~A~~~n~~~y~~~~~t~iD~AvDE~GLWvIYat~~~~g~ivvskld~~t 157 (250)
T PF02191_consen 78 NGSLYYNKYNSRNIVKYDLTTRSVVARRELPGAGYNNRFPYYWSGYTDIDFAVDENGLWVIYATEDNNGNIVVSKLDPET 157 (250)
T ss_pred CCcEEEEecCCceEEEEECcCCcEEEEEECCccccccccceecCCCceEEEEEcCCCEEEEEecCCCCCcEEEEeeCccc
Confidence 4778888888888999888765433 2 222211 3367999965432 23334444567776665432
Q ss_pred CccEEEEEeCCCCCceE-EEEeCCCCEEEEEecCCCC-CceEEEeecC-CCceEEEE----cCCCCCceEEEecCCCEEE
Q psy5768 390 AQRIVVVRLGQHDKPRG-IDIDSCDSRIYWTNWNSHL-PSIQRAFFSG-FGTESIIT----TDITMPNALALDHQAEKLF 462 (652)
Q Consensus 390 ~~~~~~~~~~~~~~P~~-Iavdp~~g~Lywtd~~~~~-~~I~r~~ldG-~~~~~l~~----~~l~~P~glaiD~~~~~LY 462 (652)
........ +...++.. =|.- .-|.||.++..... .+|.-+ .|- ++...-+. ........|..++.+++||
T Consensus 158 L~v~~tw~-T~~~k~~~~naFm-vCGvLY~~~s~~~~~~~I~ya-fDt~t~~~~~~~i~f~~~~~~~~~l~YNP~dk~LY 234 (250)
T PF02191_consen 158 LSVEQTWN-TSYPKRSAGNAFM-VCGVLYATDSYDTRDTEIFYA-FDTYTGKEEDVSIPFPNPYGNISMLSYNPRDKKLY 234 (250)
T ss_pred CceEEEEE-eccCchhhcceee-EeeEEEEEEECCCCCcEEEEE-EECCCCceeceeeeeccccCceEeeeECCCCCeEE
Confidence 11111111 12222222 1222 14899999876542 345433 232 22222221 2233456788999999999
Q ss_pred EEeCCCCeEEEE
Q psy5768 463 WGDARLDKIERC 474 (652)
Q Consensus 463 w~D~~~~~I~~~ 474 (652)
.-|.+.-.++.+
T Consensus 235 ~wd~G~~v~Y~v 246 (250)
T PF02191_consen 235 AWDNGYQVTYDV 246 (250)
T ss_pred EEECCeEEEEEE
Confidence 888665444443
No 180
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=63.91 E-value=26 Score=37.71 Aligned_cols=62 Identities=16% Similarity=0.196 Sum_probs=32.0
Q ss_pred CceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEc------------------C-CCCCceEEEecCCCEEEE
Q psy5768 403 KPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITT------------------D-ITMPNALALDHQAEKLFW 463 (652)
Q Consensus 403 ~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~------------------~-l~~P~glaiD~~~~~LYw 463 (652)
-+..|.|....++||++.|... .|...++.-...-.++.. . -..|+-|.+..+++||||
T Consensus 313 LitDI~iSlDDrfLYvs~W~~G--dvrqYDISDP~~Pkl~gqv~lGG~~~~~~~~~v~g~~l~GgPqMvqlS~DGkRlYv 390 (461)
T PF05694_consen 313 LITDILISLDDRFLYVSNWLHG--DVRQYDISDPFNPKLVGQVFLGGSIRKGDHPVVKGKRLRGGPQMVQLSLDGKRLYV 390 (461)
T ss_dssp ----EEE-TTS-EEEEEETTTT--EEEEEE-SSTTS-EEEEEEE-BTTTT-B--TTS------S----EEE-TTSSEEEE
T ss_pred ceEeEEEccCCCEEEEEcccCC--cEEEEecCCCCCCcEEeEEEECcEeccCCCccccccccCCCCCeEEEccCCeEEEE
Confidence 4578888899999999999853 566555543322222211 1 135888899999999999
Q ss_pred EeC
Q psy5768 464 GDA 466 (652)
Q Consensus 464 ~D~ 466 (652)
+.+
T Consensus 391 TnS 393 (461)
T PF05694_consen 391 TNS 393 (461)
T ss_dssp E--
T ss_pred Eee
Confidence 975
No 181
>smart00284 OLF Olfactomedin-like domains.
Probab=63.39 E-value=1.7e+02 Score=29.41 Aligned_cols=149 Identities=15% Similarity=0.083 Sum_probs=79.4
Q ss_pred EEEEEcCCCeEEEeecccccEEEEeccCCcce--EEeec------------cCceeeeEEEccCCE-EEEEeCCCCeEEE
Q psy5768 318 ELSYDYKRKTLFYSDIQKGTINSVFFNGSNHR--VLLER------------QGSVEGLAYEYVHNY-LYWTCNNDATINK 382 (652)
Q Consensus 318 ~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~--~i~~~------------~~~~~glAvDw~~~~-LYwtd~~~~~I~~ 382 (652)
.+.|+ +.||+.-..+..|.|.++...... .+++. -..-.++|+|..+=- ||-|....+.|.+
T Consensus 79 ~VVYn---gslYY~~~~s~~iiKydL~t~~v~~~~~Lp~a~y~~~~~Y~~~~~sdiDlAvDE~GLWvIYat~~~~g~ivv 155 (255)
T smart00284 79 VVVYN---GSLYFNKFNSHDICRFDLTTETYQKEPLLNGAGYNNRFPYAWGGFSDIDLAVDENGLWVIYATEQNAGKIVI 155 (255)
T ss_pred EEEEC---ceEEEEecCCccEEEEECCCCcEEEEEecCccccccccccccCCCccEEEEEcCCceEEEEeccCCCCCEEE
Confidence 34554 889998878888999998865442 22221 113367999975532 3434455577877
Q ss_pred EEcCCCCCccEEEEEe-CCCCCce-EEEEeCCCCEEEEEecC-CCCCceEEEeecCCCceEEEEcCCC----CCceEEEe
Q psy5768 383 IDLDSPKAQRIVVVRL-GQHDKPR-GIDIDSCDSRIYWTNWN-SHLPSIQRAFFSGFGTESIITTDIT----MPNALALD 455 (652)
Q Consensus 383 ~~~~~~~~~~~~~~~~-~~~~~P~-~Iavdp~~g~Lywtd~~-~~~~~I~r~~ldG~~~~~l~~~~l~----~P~glaiD 455 (652)
..|+-.. ....-.. +...++. +=|.-- -|.||.++.. ....+|.-+.=--++.+..+...+. .-..|...
T Consensus 156 SkLnp~t--L~ve~tW~T~~~k~sa~naFmv-CGvLY~~~s~~~~~~~I~yayDt~t~~~~~~~i~f~n~y~~~s~l~YN 232 (255)
T smart00284 156 SKLNPAT--LTIENTWITTYNKRSASNAFMI-CGILYVTRSLGSKGEKVFYAYDTNTGKEGHLDIPFENMYEYISMLDYN 232 (255)
T ss_pred EeeCccc--ceEEEEEEcCCCcccccccEEE-eeEEEEEccCCCCCcEEEEEEECCCCccceeeeeeccccccceeceeC
Confidence 7765322 1221111 1122221 111221 4889999853 2234666553322222222222222 23357788
Q ss_pred cCCCEEEEEeCCCCeEE
Q psy5768 456 HQAEKLFWGDARLDKIE 472 (652)
Q Consensus 456 ~~~~~LYw~D~~~~~I~ 472 (652)
+.+++||.=|.+.-.++
T Consensus 233 P~d~~LY~wdng~~l~Y 249 (255)
T smart00284 233 PNDRKLYAWNNGHLVHY 249 (255)
T ss_pred CCCCeEEEEeCCeEEEE
Confidence 88999998776543333
No 182
>PTZ00421 coronin; Provisional
Probab=62.13 E-value=2.6e+02 Score=31.15 Aligned_cols=103 Identities=13% Similarity=0.124 Sum_probs=64.6
Q ss_pred cceEEEEEEEcCCCeEEEeecccccEEEEeccCCcc--------eEEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEE
Q psy5768 313 MKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNH--------RVLLERQGSVEGLAYEYVHNYLYWTCNNDATINKID 384 (652)
Q Consensus 313 ~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~--------~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~ 384 (652)
...+.+++|++.++.++.+-...+.|...++..... ..+......+..|++.+.+.++..+-+..+.|..-+
T Consensus 75 ~~~V~~v~fsP~d~~~LaSgS~DgtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~P~~~~iLaSgs~DgtVrIWD 154 (493)
T PTZ00421 75 EGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWD 154 (493)
T ss_pred CCCEEEEEEcCCCCCEEEEEeCCCEEEEEecCCCccccccCcceEEecCCCCcEEEEEeCcCCCCEEEEEeCCCEEEEEE
Confidence 356789999986655666666667787777643211 112123346778888887777777777778888888
Q ss_pred cCCCCCccEEEEEeC-CCCCceEEEEeCCCCEEEEEe
Q psy5768 385 LDSPKAQRIVVVRLG-QHDKPRGIDIDSCDSRIYWTN 420 (652)
Q Consensus 385 ~~~~~~~~~~~~~~~-~~~~P~~Iavdp~~g~Lywtd 420 (652)
+... ..+..+. ......+|+.+|. |.++.+-
T Consensus 155 l~tg----~~~~~l~~h~~~V~sla~spd-G~lLatg 186 (493)
T PTZ00421 155 VERG----KAVEVIKCHSDQITSLEWNLD-GSLLCTT 186 (493)
T ss_pred CCCC----eEEEEEcCCCCceEEEEEECC-CCEEEEe
Confidence 7642 2222222 3355788999995 4444443
No 183
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=61.60 E-value=7.8 Score=28.64 Aligned_cols=17 Identities=29% Similarity=0.593 Sum_probs=14.7
Q ss_pred eeccCceeeccCCcccC
Q psy5768 572 CSCFTGKVLMEDNRSCT 588 (652)
Q Consensus 572 C~Cp~g~~l~~d~~C~~ 588 (652)
|.|+.||++..+++|++
T Consensus 35 C~C~~G~v~~~~~~CV~ 51 (55)
T PF01826_consen 35 CFCPPGYVRNDNGRCVP 51 (55)
T ss_dssp EEETTTEEEETTSEEEE
T ss_pred CCCCCCeeEcCCCCEEc
Confidence 99999999876678886
No 184
>PLN00181 protein SPA1-RELATED; Provisional
Probab=60.75 E-value=3.5e+02 Score=32.15 Aligned_cols=157 Identities=11% Similarity=0.082 Sum_probs=82.6
Q ss_pred eEEEEEEEcCCCeEEEeecccccEEEEeccCC-----cc---eEEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEEcC
Q psy5768 315 NIIELSYDYKRKTLFYSDIQKGTINSVFFNGS-----NH---RVLLERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLD 386 (652)
Q Consensus 315 ~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~-----~~---~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~ 386 (652)
.+.+++|++..+.| .+-...+.|...+++.. .. .........+.+++.....+.+..+-...+.|.+-++.
T Consensus 485 ~V~~i~fs~dg~~l-atgg~D~~I~iwd~~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~~~~~~las~~~Dg~v~lWd~~ 563 (793)
T PLN00181 485 LVCAIGFDRDGEFF-ATAGVNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQVASSNFEGVVQVWDVA 563 (793)
T ss_pred cEEEEEECCCCCEE-EEEeCCCEEEEEECCcccccccccccceEEecccCceeeEEeccCCCCEEEEEeCCCeEEEEECC
Confidence 46789999866544 44445567776664321 10 01112223456666655444445555667888887875
Q ss_pred CCCCccEEEEEe-CCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEe
Q psy5768 387 SPKAQRIVVVRL-GQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGD 465 (652)
Q Consensus 387 ~~~~~~~~~~~~-~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D 465 (652)
.. +.+... .......+++++|..+.+++|-.... .|...++........+... .....+.+....+.++.+-
T Consensus 564 ~~----~~~~~~~~H~~~V~~l~~~p~~~~~L~Sgs~Dg--~v~iWd~~~~~~~~~~~~~-~~v~~v~~~~~~g~~latg 636 (793)
T PLN00181 564 RS----QLVTEMKEHEKRVWSIDYSSADPTLLASGSDDG--SVKLWSINQGVSIGTIKTK-ANICCVQFPSESGRSLAFG 636 (793)
T ss_pred CC----eEEEEecCCCCCEEEEEEcCCCCCEEEEEcCCC--EEEEEECCCCcEEEEEecC-CCeEEEEEeCCCCCEEEEE
Confidence 42 222222 23355788999987777776654432 4555555432222122211 2234455533344444444
Q ss_pred CCCCeEEEEecCCC
Q psy5768 466 ARLDKIERCDYDGT 479 (652)
Q Consensus 466 ~~~~~I~~~~ldG~ 479 (652)
...+.|...|+...
T Consensus 637 s~dg~I~iwD~~~~ 650 (793)
T PLN00181 637 SADHKVYYYDLRNP 650 (793)
T ss_pred eCCCeEEEEECCCC
Confidence 55667777777543
No 185
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=60.51 E-value=75 Score=35.37 Aligned_cols=69 Identities=14% Similarity=0.091 Sum_probs=47.9
Q ss_pred CCCCeeEEEEECCCCEEEEEeccCC--------------cceEEEEEcCCC-------ccEEEEeCC---Cc--------
Q psy5768 36 TLSKISSIAVWPVKGKMFWSNVTKQ--------------VVTIEMAFMDGT-------KRETVVSQK---KY-------- 83 (652)
Q Consensus 36 ~~~~~~~v~~d~~~~~lyw~d~~~~--------------~~~I~~~~~dgs-------~~~~v~~~~---~~-------- 83 (652)
.+.+|..|++++.++.||++..+.. .+.|+|+...+. ..+.++..+ .+
T Consensus 415 ~mdRpE~i~~~p~~g~Vy~~lTNn~~r~~~~aNpr~~n~~G~I~r~~p~~~d~t~~~ftWdlF~~aG~~~~~~~~~~~~~ 494 (616)
T COG3211 415 PMDRPEWIAVNPGTGEVYFTLTNNGKRSDDAANPRAKNGYGQIVRWIPATGDHTDTKFTWDLFVEAGNPSVLEGGASANI 494 (616)
T ss_pred cccCccceeecCCcceEEEEeCCCCccccccCCCcccccccceEEEecCCCCccCccceeeeeeecCCccccccccccCc
Confidence 4678999999999999999873222 136899887654 344444433 12
Q ss_pred -----CCccCCCCcEEEEccCCcEEEEeCC
Q psy5768 84 -----PAVTACNLHIAVDWIAQNIYWSDPK 108 (652)
Q Consensus 84 -----~~p~~~~~~lavDw~~~~lY~~d~~ 108 (652)
..|. +|++|..++.+..+|..
T Consensus 495 ~~~~f~~PD----nl~fD~~GrLWi~TDg~ 520 (616)
T COG3211 495 NANWFNSPD----NLAFDPWGRLWIQTDGS 520 (616)
T ss_pred ccccccCCC----ceEECCCCCEEEEecCC
Confidence 3388 99999877766667754
No 186
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=58.65 E-value=2.6e+02 Score=29.93 Aligned_cols=61 Identities=18% Similarity=0.049 Sum_probs=39.3
Q ss_pred CCEEEEEeCCCCeEEEEecCCCceEEEecCCCCceeEEEEeCCEEEEEcCCCCeEEEEEccCCce
Q psy5768 458 AEKLFWGDARLDKIERCDYDGTNRIVLSKISPLHPFDMAVYGEFIFWTDWVIHAVLRANKYTGEE 522 (652)
Q Consensus 458 ~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~~~~~p~glav~~~~lYwtd~~~~~I~~~~k~~g~~ 522 (652)
+++||.+.. .+.+..+|...... +-......+..+++.+++||..+. .+.++.++..+|+.
T Consensus 256 ~~~vy~~~~-~g~l~ald~~tG~~--~W~~~~~~~~~~~~~~~~vy~~~~-~g~l~ald~~tG~~ 316 (394)
T PRK11138 256 GGVVYALAY-NGNLVALDLRSGQI--VWKREYGSVNDFAVDGGRIYLVDQ-NDRVYALDTRGGVE 316 (394)
T ss_pred CCEEEEEEc-CCeEEEEECCCCCE--EEeecCCCccCcEEECCEEEEEcC-CCeEEEEECCCCcE
Confidence 578888764 35777777753322 112122334456778999999874 56788888888763
No 187
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=58.06 E-value=6.2 Score=20.10 Aligned_cols=9 Identities=33% Similarity=0.962 Sum_probs=6.5
Q ss_pred eEEEeCCcc
Q psy5768 259 VCACAHGVV 267 (652)
Q Consensus 259 ~C~C~~G~l 267 (652)
.|.|++||.
T Consensus 1 ~C~C~~G~~ 9 (13)
T PF12661_consen 1 TCQCPPGWT 9 (13)
T ss_dssp EEEE-TTEE
T ss_pred CccCcCCCc
Confidence 599999984
No 188
>KOG4328|consensus
Probab=56.60 E-value=2.9e+02 Score=29.92 Aligned_cols=146 Identities=14% Similarity=0.159 Sum_probs=88.5
Q ss_pred ceEEEEEEEcCCC-eEEEeecccccEEEEeccC--CcceEEe---eccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCC
Q psy5768 314 KNIIELSYDYKRK-TLFYSDIQKGTINSVFFNG--SNHRVLL---ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDS 387 (652)
Q Consensus 314 ~~~~~v~~D~~~~-~lywsd~~~~~I~~~~~~g--~~~~~i~---~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~ 387 (652)
+.+.+++|++..+ ++..+-...|.|.-.++.+ ...+.+. ..-+.+.+|.+.+.+-.-..+.+..|+|.-.++.+
T Consensus 187 ~Rit~l~fHPt~~~~lva~GdK~G~VG~Wn~~~~~~d~d~v~~f~~hs~~Vs~l~F~P~n~s~i~ssSyDGtiR~~D~~~ 266 (498)
T KOG4328|consen 187 RRITSLAFHPTENRKLVAVGDKGGQVGLWNFGTQEKDKDGVYLFTPHSGPVSGLKFSPANTSQIYSSSYDGTIRLQDFEG 266 (498)
T ss_pred cceEEEEecccCcceEEEEccCCCcEEEEecCCCCCccCceEEeccCCccccceEecCCChhheeeeccCceeeeeeecc
Confidence 5678999999887 6666655568888888863 2222222 22235677888877766666667778888888765
Q ss_pred CCCccEEEEEeC-CCCCceEEEEeCCCCE-EEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEE
Q psy5768 388 PKAQRIVVVRLG-QHDKPRGIDIDSCDSR-IYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWG 464 (652)
Q Consensus 388 ~~~~~~~~~~~~-~~~~P~~Iavdp~~g~-Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~ 464 (652)
.. .++++... ......++-+....+. ||..+||. -.+.-.+++|+....+..... ...++++.+....++-+
T Consensus 267 ~i--~e~v~s~~~d~~~fs~~d~~~e~~~vl~~~~~G~--f~~iD~R~~~s~~~~~~lh~k-KI~sv~~NP~~p~~laT 340 (498)
T KOG4328|consen 267 NI--SEEVLSLDTDNIWFSSLDFSAESRSVLFGDNVGN--FNVIDLRTDGSEYENLRLHKK-KITSVALNPVCPWFLAT 340 (498)
T ss_pred hh--hHHHhhcCccceeeeeccccCCCccEEEeecccc--eEEEEeecCCccchhhhhhhc-ccceeecCCCCchheee
Confidence 32 23333321 2223344444444554 44456662 266667888886554443332 67889999886554443
No 189
>KOG0272|consensus
Probab=56.00 E-value=2.6e+02 Score=29.90 Aligned_cols=111 Identities=13% Similarity=0.081 Sum_probs=67.1
Q ss_pred EEEecC-CCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccCCCCc
Q psy5768 14 VVCNLE-GEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTACNLH 92 (652)
Q Consensus 14 ~~~~~~-g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~~~ 92 (652)
+++|+. |.....+-. .++.+.+|+|+|+ |+..-|- .+.+++..+++.+..-.-.+... ...++ .
T Consensus 328 RvWDlRtgr~im~L~g-------H~k~I~~V~fsPN-Gy~lATg--s~Dnt~kVWDLR~r~~ly~ipAH-~nlVS----~ 392 (459)
T KOG0272|consen 328 RVWDLRTGRCIMFLAG-------HIKEILSVAFSPN-GYHLATG--SSDNTCKVWDLRMRSELYTIPAH-SNLVS----Q 392 (459)
T ss_pred heeecccCcEEEEecc-------cccceeeEeECCC-ceEEeec--CCCCcEEEeeecccccceecccc-cchhh----h
Confidence 556665 433333332 5689999999986 6666676 67788888888766433344444 44567 8
Q ss_pred EEEEccCCcEEEEeCCCCEEEEEEcCC-CcEEEEEeCCCCCceeEEEcC
Q psy5768 93 IAVDWIAQNIYWSDPKENVIEVARLTG-QYRYVLISGGVDQPSALAVDP 140 (652)
Q Consensus 93 lavDw~~~~lY~~d~~~~~I~v~~~dg-~~~~~l~~~~~~~P~~iavd~ 140 (652)
+-+++..+....|-+..+.+.+-+..+ +..+.+..- -.+.-++.+-+
T Consensus 393 Vk~~p~~g~fL~TasyD~t~kiWs~~~~~~~ksLaGH-e~kV~s~Dis~ 440 (459)
T KOG0272|consen 393 VKYSPQEGYFLVTASYDNTVKIWSTRTWSPLKSLAGH-EGKVISLDISP 440 (459)
T ss_pred eEecccCCeEEEEcccCcceeeecCCCcccchhhcCC-ccceEEEEecc
Confidence 888887777777766666665554433 333333321 23444555554
No 190
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=55.17 E-value=8.5 Score=36.12 Aligned_cols=44 Identities=23% Similarity=0.553 Sum_probs=29.0
Q ss_pred ccccCCC----CceeeeccCceeeccCCcccCcccccCCCceeeccCeecCCc
Q psy5768 561 ICKLDET----GQVVCSCFTGKVLMEDNRSCTINTVCSEHDFKCSDGMCIPFN 609 (652)
Q Consensus 561 lCl~~~~----~~~~C~Cp~g~~l~~d~~C~~~~~~C~~~~f~C~~g~Ci~~~ 609 (652)
.|+..+. ..+.|.|-.||.+..+ .|.+ ..|. .+.|++|.||-..
T Consensus 57 ~C~~~~~~~~~~~~~C~C~~gY~~~~~-vCvp--~~C~--~~~Cg~GKCI~d~ 104 (197)
T PF06247_consen 57 KCINQANKGEERAYKCDCINGYILKQG-VCVP--NKCN--NKDCGSGKCILDP 104 (197)
T ss_dssp EEEE-SSTTSSTSEEEEE-TTEEESSS-SEEE--GGGS--S---TTEEEEEEE
T ss_pred hhhcCCCcccceeEEEecccCceeeCC-eEch--hhcC--ceecCCCeEEecC
Confidence 4666553 3899999999999765 7875 5675 6889999999543
No 191
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=55.01 E-value=2.8e+02 Score=29.31 Aligned_cols=105 Identities=13% Similarity=0.031 Sum_probs=52.0
Q ss_pred CCEEEEEecCCCCCceEEEee-cCCCceEEEEcC-CCCCceEEEecCCCEEEEEeCCCCeEEEEecC-CCceEEEecC--
Q psy5768 413 DSRIYWTNWNSHLPSIQRAFF-SGFGTESIITTD-ITMPNALALDHQAEKLFWGDARLDKIERCDYD-GTNRIVLSKI-- 487 (652)
Q Consensus 413 ~g~Lywtd~~~~~~~I~r~~l-dG~~~~~l~~~~-l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ld-G~~~~~l~~~-- 487 (652)
.|.||+.++.. +++..++ +|+..-..-... ..+..+..+. ..++|+.. ..+.+..++-+ |..+-..-..
T Consensus 111 ~G~i~~g~~~g---~~y~ld~~~G~~~W~~~~~~~~~~~~~~v~~--~~~v~~~s-~~g~~~al~~~tG~~~W~~~~~~~ 184 (370)
T COG1520 111 DGKIYVGSWDG---KLYALDASTGTLVWSRNVGGSPYYASPPVVG--DGTVYVGT-DDGHLYALNADTGTLKWTYETPAP 184 (370)
T ss_pred CCeEEEecccc---eEEEEECCCCcEEEEEecCCCeEEecCcEEc--CcEEEEec-CCCeEEEEEccCCcEEEEEecCCc
Confidence 56777777653 5555565 454322211111 1111122222 45666553 33456666665 5443332111
Q ss_pred -CCCceeEEEEeCCEEEEEcCC-CCeEEEEEccCCceE
Q psy5768 488 -SPLHPFDMAVYGEFIFWTDWV-IHAVLRANKYTGEEV 523 (652)
Q Consensus 488 -~~~~p~glav~~~~lYwtd~~-~~~I~~~~k~~g~~~ 523 (652)
......+.++.++.+|+.... +..++.++..+|...
T Consensus 185 ~~~~~~~~~~~~~~~vy~~~~~~~~~~~a~~~~~G~~~ 222 (370)
T COG1520 185 LSLSIYGSPAIASGTVYVGSDGYDGILYALNAEDGTLK 222 (370)
T ss_pred cccccccCceeecceEEEecCCCcceEEEEEccCCcEe
Confidence 112222333677888887553 446788888777643
No 192
>KOG0303|consensus
Probab=53.53 E-value=3.1e+02 Score=29.25 Aligned_cols=153 Identities=10% Similarity=0.107 Sum_probs=86.3
Q ss_pred CCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeC-CCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEE
Q psy5768 38 SKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQ-KKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVAR 116 (652)
Q Consensus 38 ~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~-~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~ 116 (652)
+++--|+|+|....+..+- .....|..++..-.. .+++- . -..+. ++.+.+ .+.+..|-...++|.+.+
T Consensus 132 rrVg~V~wHPtA~NVLlsa--g~Dn~v~iWnv~tge--ali~l~h-pd~i~----S~sfn~-dGs~l~TtckDKkvRv~d 201 (472)
T KOG0303|consen 132 RRVGLVQWHPTAPNVLLSA--GSDNTVSIWNVGTGE--ALITLDH-PDMVY----SMSFNR-DGSLLCTTCKDKKVRVID 201 (472)
T ss_pred eeEEEEeecccchhhHhhc--cCCceEEEEeccCCc--eeeecCC-CCeEE----EEEecc-CCceeeeecccceeEEEc
Confidence 4555688999888888877 677888888876432 22221 2 23344 777776 556777777888899998
Q ss_pred cCC-CcEEEE-EeCCCCCceeEEEcCCCCeEEEEecCCCC--eEEEEeCCCCCcEEEEeecccCceeE---EEeccCCEE
Q psy5768 117 LTG-QYRYVL-ISGGVDQPSALAVDPESGYLFWSESGKIP--LIARAGLDGKKQTILAQEIIMPIKDI---TLDLKFFSA 189 (652)
Q Consensus 117 ~dg-~~~~~l-~~~~~~~P~~iavd~~~g~lywtd~~~~~--~I~~~~ldg~~~~~~~~~~~~~p~gl---~lD~~~~~l 189 (652)
+.. +....- ...+...+|.|-+ .+|.++-|-+.... .|..-+.+.-+.. +....+..-+|+ -.|..++.|
T Consensus 202 pr~~~~v~e~~~heG~k~~Raifl--~~g~i~tTGfsr~seRq~aLwdp~nl~eP-~~~~elDtSnGvl~PFyD~dt~iv 278 (472)
T KOG0303|consen 202 PRRGTVVSEGVAHEGAKPARAIFL--ASGKIFTTGFSRMSERQIALWDPNNLEEP-IALQELDTSNGVLLPFYDPDTSIV 278 (472)
T ss_pred CCCCcEeeecccccCCCcceeEEe--ccCceeeeccccccccceeccCcccccCc-ceeEEeccCCceEEeeecCCCCEE
Confidence 753 322222 2235667788887 47776655432111 1111111100000 111112233343 247778888
Q ss_pred EEEeCCCCcEEEEE
Q psy5768 190 FYRNLSKGNIHIIS 203 (652)
Q Consensus 190 y~~d~~g~~~~~i~ 203 (652)
|.+.-..+++|..-
T Consensus 279 Yl~GKGD~~IRYyE 292 (472)
T KOG0303|consen 279 YLCGKGDSSIRYFE 292 (472)
T ss_pred EEEecCCcceEEEE
Confidence 88877667776543
No 193
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=53.33 E-value=1.1e+02 Score=34.24 Aligned_cols=112 Identities=11% Similarity=0.049 Sum_probs=66.6
Q ss_pred cccceEEEEEEEcCCCeEEEeeccc----------------ccEEEEeccCC---c---c-eEEe-----ec--------
Q psy5768 311 TMMKNIIELSYDYKRKTLFYSDIQK----------------GTINSVFFNGS---N---H-RVLL-----ER-------- 354 (652)
Q Consensus 311 ~~~~~~~~v~~D~~~~~lywsd~~~----------------~~I~~~~~~g~---~---~-~~i~-----~~-------- 354 (652)
..+..+..+++.+.++++|++.... +.|+|....+. . . ++++ ..
T Consensus 414 T~mdRpE~i~~~p~~g~Vy~~lTNn~~r~~~~aNpr~~n~~G~I~r~~p~~~d~t~~~ftWdlF~~aG~~~~~~~~~~~~ 493 (616)
T COG3211 414 TPMDRPEWIAVNPGTGEVYFTLTNNGKRSDDAANPRAKNGYGQIVRWIPATGDHTDTKFTWDLFVEAGNPSVLEGGASAN 493 (616)
T ss_pred ccccCccceeecCCcceEEEEeCCCCccccccCCCcccccccceEEEecCCCCccCccceeeeeeecCCccccccccccC
Confidence 3456788999999999999986543 45887764332 1 1 2222 11
Q ss_pred -----cCceeeeEEEccCCEEEEEeCCCCe-------EEEEEcCCCCCc-cEEEEEeCCCCCceEEEEeCCCCEEEEEec
Q psy5768 355 -----QGSVEGLAYEYVHNYLYWTCNNDAT-------INKIDLDSPKAQ-RIVVVRLGQHDKPRGIDIDSCDSRIYWTNW 421 (652)
Q Consensus 355 -----~~~~~glAvDw~~~~LYwtd~~~~~-------I~~~~~~~~~~~-~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~ 421 (652)
..+|.+|+||..++..--||....+ +..+...+...+ .+..++........+++..|..+.||+.-.
T Consensus 494 ~~~~~f~~PDnl~fD~~GrLWi~TDg~~s~~~~~~~G~~~m~~~~p~~g~~~rf~t~P~g~E~tG~~FspD~~TlFV~vQ 573 (616)
T COG3211 494 INANWFNSPDNLAFDPWGRLWIQTDGSGSTLRNRFRGVTQMLTPDPKTGTIKRFLTGPIGCEFTGPCFSPDGKTLFVNVQ 573 (616)
T ss_pred cccccccCCCceEECCCCCEEEEecCCCCccCcccccccccccCCCccceeeeeccCCCcceeecceeCCCCceEEEEec
Confidence 2349999999999887778765431 111111111111 122222223345678899998889998764
Q ss_pred C
Q psy5768 422 N 422 (652)
Q Consensus 422 ~ 422 (652)
+
T Consensus 574 H 574 (616)
T COG3211 574 H 574 (616)
T ss_pred C
Confidence 4
No 194
>KOG0272|consensus
Probab=52.76 E-value=3.2e+02 Score=29.28 Aligned_cols=103 Identities=5% Similarity=-0.039 Sum_probs=57.3
Q ss_pred cEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCCCeE
Q psy5768 392 RIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARLDKI 471 (652)
Q Consensus 392 ~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I 471 (652)
+..++..+......+++.+| +||..-|-.+.+..+|++..|--.--++....+ -.+.+-.++..++...+-++.+.+
T Consensus 336 r~im~L~gH~k~I~~V~fsP-NGy~lATgs~Dnt~kVWDLR~r~~ly~ipAH~n--lVS~Vk~~p~~g~fL~TasyD~t~ 412 (459)
T KOG0272|consen 336 RCIMFLAGHIKEILSVAFSP-NGYHLATGSSDNTCKVWDLRMRSELYTIPAHSN--LVSQVKYSPQEGYFLVTASYDNTV 412 (459)
T ss_pred cEEEEecccccceeeEeECC-CceEEeecCCCCcEEEeeecccccceecccccc--hhhheEecccCCeEEEEcccCcce
Confidence 44555555667888999999 899988877766567877766544222222222 345666776556555554444433
Q ss_pred EEEecCC-CceEEEecCCCCceeEEEEe
Q psy5768 472 ERCDYDG-TNRIVLSKISPLHPFDMAVY 498 (652)
Q Consensus 472 ~~~~ldG-~~~~~l~~~~~~~p~glav~ 498 (652)
....-.+ +-.+.+.. .-...+++.+.
T Consensus 413 kiWs~~~~~~~ksLaG-He~kV~s~Dis 439 (459)
T KOG0272|consen 413 KIWSTRTWSPLKSLAG-HEGKVISLDIS 439 (459)
T ss_pred eeecCCCcccchhhcC-CccceEEEEec
Confidence 3322222 22333322 22455666655
No 195
>PTZ00420 coronin; Provisional
Probab=52.71 E-value=4e+02 Score=30.33 Aligned_cols=103 Identities=11% Similarity=0.167 Sum_probs=65.9
Q ss_pred cceEEEEEEEcCCCeEEEeecccccEEEEeccCCc--ce------EEe-eccCceeeeEEEccCCEEEEEeCCCCeEEEE
Q psy5768 313 MKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSN--HR------VLL-ERQGSVEGLAYEYVHNYLYWTCNNDATINKI 383 (652)
Q Consensus 313 ~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~--~~------~i~-~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~ 383 (652)
...+.+++|++..+.++.+-...+.|...++.... .. ..+ .....+..+++.+.+.++..+-+..++|.+-
T Consensus 74 ~~~V~~lafsP~~~~lLASgS~DgtIrIWDi~t~~~~~~~i~~p~~~L~gH~~~V~sVaf~P~g~~iLaSgS~DgtIrIW 153 (568)
T PTZ00420 74 TSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIW 153 (568)
T ss_pred CCCEEEEEEcCCCCCEEEEEeCCCeEEEEECCCCCccccccccceEEeecCCCcEEEEEECCCCCeEEEEEeCCCeEEEE
Confidence 35678899998766666666666778777764221 11 112 2334678899999888887777777888888
Q ss_pred EcCCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEEe
Q psy5768 384 DLDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWTN 420 (652)
Q Consensus 384 ~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd 420 (652)
++... ..+..........+++.+|.. .++.+.
T Consensus 154 Dl~tg----~~~~~i~~~~~V~SlswspdG-~lLat~ 185 (568)
T PTZ00420 154 DIENE----KRAFQINMPKKLSSLKWNIKG-NLLSGT 185 (568)
T ss_pred ECCCC----cEEEEEecCCcEEEEEECCCC-CEEEEE
Confidence 87542 222222233567899999954 444443
No 196
>PF06739 SBBP: Beta-propeller repeat; InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=52.40 E-value=12 Score=25.39 Aligned_cols=19 Identities=26% Similarity=0.469 Sum_probs=16.1
Q ss_pred CCceeEEEcCCCCeEEEEec
Q psy5768 131 DQPSALAVDPESGYLFWSES 150 (652)
Q Consensus 131 ~~P~~iavd~~~g~lywtd~ 150 (652)
..|.+|++|+ +|.+|.+=+
T Consensus 13 ~~~~~IavD~-~GNiYv~G~ 31 (38)
T PF06739_consen 13 DYGNGIAVDS-NGNIYVTGY 31 (38)
T ss_pred eeEEEEEECC-CCCEEEEEe
Confidence 5799999997 799999854
No 197
>KOG0279|consensus
Probab=51.05 E-value=2.8e+02 Score=28.12 Aligned_cols=219 Identities=7% Similarity=-0.000 Sum_probs=116.8
Q ss_pred CCCCCceeeeeccccceEEEEEEEcCCCeEEEeecccccEEEEeccCCcc-eEEeeccCceeeeEEEccCCEEEEEeCCC
Q psy5768 299 DLNSPFESIRNSTMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNH-RVLLERQGSVEGLAYEYVHNYLYWTCNND 377 (652)
Q Consensus 299 ~~~~p~~~~~~~~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~-~~i~~~~~~~~glAvDw~~~~LYwtd~~~ 377 (652)
.+..|.+.+...++ .+..++.-.. +..+.+-...+.++.-++.++.. +.+...-..+-++|++..++.| .+-+..
T Consensus 51 ~~G~~~r~~~GHsH--~v~dv~~s~d-g~~alS~swD~~lrlWDl~~g~~t~~f~GH~~dVlsva~s~dn~qi-vSGSrD 126 (315)
T KOG0279|consen 51 KYGVPVRRLTGHSH--FVSDVVLSSD-GNFALSASWDGTLRLWDLATGESTRRFVGHTKDVLSVAFSTDNRQI-VSGSRD 126 (315)
T ss_pred ccCceeeeeeccce--EecceEEccC-CceEEeccccceEEEEEecCCcEEEEEEecCCceEEEEecCCCcee-ecCCCc
Confidence 34445555543222 2333333332 33455555567777778776433 3333444578999999988887 455666
Q ss_pred CeEEEEEcCCCCCccEEEEEeCC-CCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEec
Q psy5768 378 ATINKIDLDSPKAQRIVVVRLGQ-HDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDH 456 (652)
Q Consensus 378 ~~I~~~~~~~~~~~~~~~~~~~~-~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~ 456 (652)
.+|..-+.-|. .+..+.... .+...-+.+.|.....++...+... .+...++++-..+.-.--.-...+.+++.+
T Consensus 127 kTiklwnt~g~---ck~t~~~~~~~~WVscvrfsP~~~~p~Ivs~s~Dk-tvKvWnl~~~~l~~~~~gh~~~v~t~~vSp 202 (315)
T KOG0279|consen 127 KTIKLWNTLGV---CKYTIHEDSHREWVSCVRFSPNESNPIIVSASWDK-TVKVWNLRNCQLRTTFIGHSGYVNTVTVSP 202 (315)
T ss_pred ceeeeeeeccc---EEEEEecCCCcCcEEEEEEcCCCCCcEEEEccCCc-eEEEEccCCcchhhccccccccEEEEEECC
Confidence 67766665432 222222111 3455667788876454544433332 555566766554332222334567888887
Q ss_pred CCCEEEEEeCCCCeEEEEecCCCceEEEecCCCCceeEEEEeCCEEEEEcCCCCeEEEEEccCCceEEEEe
Q psy5768 457 QAEKLFWGDARLDKIERCDYDGTNRIVLSKISPLHPFDMAVYGEFIFWTDWVIHAVLRANKYTGEEVYTLR 527 (652)
Q Consensus 457 ~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~~~~~p~glav~~~~lYwtd~~~~~I~~~~k~~g~~~~~~~ 527 (652)
++. |.-.-.+.+.+.-.|++-...-.-+. ......+|++-.+.....-....+|..-+..++..+..+.
T Consensus 203 DGs-lcasGgkdg~~~LwdL~~~k~lysl~-a~~~v~sl~fspnrywL~~at~~sIkIwdl~~~~~v~~l~ 271 (315)
T KOG0279|consen 203 DGS-LCASGGKDGEAMLWDLNEGKNLYSLE-AFDIVNSLCFSPNRYWLCAATATSIKIWDLESKAVVEELK 271 (315)
T ss_pred CCC-EEecCCCCceEEEEEccCCceeEecc-CCCeEeeEEecCCceeEeeccCCceEEEeccchhhhhhcc
Confidence 644 44444556778888887544321122 2234466666655443333344455555655554444333
No 198
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=50.65 E-value=2.7e+02 Score=27.87 Aligned_cols=139 Identities=16% Similarity=0.115 Sum_probs=81.1
Q ss_pred CCEEEEEeccCCcceEEEEEcCCCccE-EE-EeCC------CcCCccCCCCcEEEEccCC--cEEEEeCCCCEEEEEEcC
Q psy5768 49 KGKMFWSNVTKQVVTIEMAFMDGTKRE-TV-VSQK------KYPAVTACNLHIAVDWIAQ--NIYWSDPKENVIEVARLT 118 (652)
Q Consensus 49 ~~~lyw~d~~~~~~~I~~~~~dgs~~~-~v-~~~~------~~~~p~~~~~~lavDw~~~--~lY~~d~~~~~I~v~~~d 118 (652)
++.||+-- .+...|.|.++...... .. +... .+.......+.+|+|. +| -||-+....+.|.+..+|
T Consensus 78 ngslYY~~--~~s~~IvkydL~t~~v~~~~~L~~A~~~n~~~y~~~~~t~iD~AvDE-~GLWvIYat~~~~g~ivvskld 154 (250)
T PF02191_consen 78 NGSLYYNK--YNSRNIVKYDLTTRSVVARRELPGAGYNNRFPYYWSGYTDIDFAVDE-NGLWVIYATEDNNGNIVVSKLD 154 (250)
T ss_pred CCcEEEEe--cCCceEEEEECcCCcEEEEEECCccccccccceecCCCceEEEEEcC-CCEEEEEecCCCCCcEEEEeeC
Confidence 68899887 67889999999865433 22 2111 0111223347999995 55 455566666678888777
Q ss_pred CCcE--EEEEeCCCCCce---eEEEcCCCCeEEEEecCC--CCeEEEE-eCC-CCCcEE--EEeecccCceeEEEeccCC
Q psy5768 119 GQYR--YVLISGGVDQPS---ALAVDPESGYLFWSESGK--IPLIARA-GLD-GKKQTI--LAQEIIMPIKDITLDLKFF 187 (652)
Q Consensus 119 g~~~--~~l~~~~~~~P~---~iavd~~~g~lywtd~~~--~~~I~~~-~ld-g~~~~~--~~~~~~~~p~gl~lD~~~~ 187 (652)
-... +....+.+.++. ++.+ =|.||.++... ..+|.-+ ++. ++...+ ...........|..++.++
T Consensus 155 ~~tL~v~~tw~T~~~k~~~~naFmv---CGvLY~~~s~~~~~~~I~yafDt~t~~~~~~~i~f~~~~~~~~~l~YNP~dk 231 (250)
T PF02191_consen 155 PETLSVEQTWNTSYPKRSAGNAFMV---CGVLYATDSYDTRDTEIFYAFDTYTGKEEDVSIPFPNPYGNISMLSYNPRDK 231 (250)
T ss_pred cccCceEEEEEeccCchhhcceeeE---eeEEEEEEECCCCCcEEEEEEECCCCceeceeeeeccccCceEeeeECCCCC
Confidence 5422 222223333333 5555 48999999753 2455544 333 332221 1222255667888889999
Q ss_pred EEEEEe
Q psy5768 188 SAFYRN 193 (652)
Q Consensus 188 ~ly~~d 193 (652)
+||.-|
T Consensus 232 ~LY~wd 237 (250)
T PF02191_consen 232 KLYAWD 237 (250)
T ss_pred eEEEEE
Confidence 999754
No 199
>KOG0308|consensus
Probab=50.24 E-value=2.5e+02 Score=31.87 Aligned_cols=142 Identities=14% Similarity=0.093 Sum_probs=88.1
Q ss_pred eEEEecCCCCeEEEEecC-CCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCC-ccEEEEe
Q psy5768 2 FIAVSSPTQSKIVVCNLE-GEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGT-KRETVVS 79 (652)
Q Consensus 2 ~i~v~~~~~~~I~~~~~~-g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs-~~~~v~~ 79 (652)
+|+|+-.++.-|+++|.. ++....+..+ -.++..|-++..+.++. +- ...++|..+++... -..++.-
T Consensus 184 t~ivsGgtek~lr~wDprt~~kimkLrGH-------TdNVr~ll~~dDGt~~l-s~--sSDgtIrlWdLgqQrCl~T~~v 253 (735)
T KOG0308|consen 184 TIIVSGGTEKDLRLWDPRTCKKIMKLRGH-------TDNVRVLLVNDDGTRLL-SA--SSDGTIRLWDLGQQRCLATYIV 253 (735)
T ss_pred eEEEecCcccceEEeccccccceeeeecc-------ccceEEEEEcCCCCeEe-ec--CCCceEEeeeccccceeeeEEe
Confidence 578887777888888765 4444445432 25778888887666666 44 57788988888754 2233222
Q ss_pred CCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEE
Q psy5768 80 QKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARA 159 (652)
Q Consensus 80 ~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~ 159 (652)
. -+.+- .++.+..-..+|..+. .+.|.+.++......+++-..-.....++++....-+ |.-. ..+.|.|-
T Consensus 254 H--~e~VW----aL~~~~sf~~vYsG~r-d~~i~~Tdl~n~~~~tlick~daPv~~l~~~~~~~~~-WvtT-tds~I~rW 324 (735)
T KOG0308|consen 254 H--KEGVW----ALQSSPSFTHVYSGGR-DGNIYRTDLRNPAKSTLICKEDAPVLKLHLHEHDNSV-WVTT-TDSSIKRW 324 (735)
T ss_pred c--cCceE----EEeeCCCcceEEecCC-CCcEEecccCCchhheEeecCCCchhhhhhccccCCc-eeee-ccccceec
Confidence 1 22355 7777766778888774 6788888887744445555444455577777544444 5432 24566665
Q ss_pred eCC
Q psy5768 160 GLD 162 (652)
Q Consensus 160 ~ld 162 (652)
.+.
T Consensus 325 ~~~ 327 (735)
T KOG0308|consen 325 KLE 327 (735)
T ss_pred CCc
Confidence 543
No 200
>KOG2048|consensus
Probab=49.50 E-value=4.5e+02 Score=29.98 Aligned_cols=156 Identities=10% Similarity=0.074 Sum_probs=92.0
Q ss_pred ceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEeec----cCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCC
Q psy5768 314 KNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLLER----QGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPK 389 (652)
Q Consensus 314 ~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~~~----~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~ 389 (652)
.++.--+.-+..+.|-++-...-+||+++.++......++. +.....+.+--.+.+|+...-....++..++.+..
T Consensus 383 ~nIs~~aiSPdg~~Ia~st~~~~~iy~L~~~~~vk~~~v~~~~~~~~~a~~i~ftid~~k~~~~s~~~~~le~~el~~ps 462 (691)
T KOG2048|consen 383 ENISCAAISPDGNLIAISTVSRTKIYRLQPDPNVKVINVDDVPLALLDASAISFTIDKNKLFLVSKNIFSLEEFELETPS 462 (691)
T ss_pred cceeeeccCCCCCEEEEeeccceEEEEeccCcceeEEEeccchhhhccceeeEEEecCceEEEEecccceeEEEEecCcc
Confidence 45555566777888888888888999999887333333322 12334444444467777766555677777776644
Q ss_pred CccEEEEEe---CCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEE-cCCCCCceEEEe-cCCCEEEEE
Q psy5768 390 AQRIVVVRL---GQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIIT-TDITMPNALALD-HQAEKLFWG 464 (652)
Q Consensus 390 ~~~~~~~~~---~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~-~~l~~P~glaiD-~~~~~LYw~ 464 (652)
. +.+... ..-....-|++.|...||-..+. .+.|...++.+.....+.. -+ ...+++++- ...++|-.+
T Consensus 463 ~--kel~~~~~~~~~~~I~~l~~SsdG~yiaa~~t---~g~I~v~nl~~~~~~~l~~rln-~~vTa~~~~~~~~~~lvva 536 (691)
T KOG2048|consen 463 F--KELKSIQSQAKCPSISRLVVSSDGNYIAAIST---RGQIFVYNLETLESHLLKVRLN-IDVTAAAFSPFVRNRLVVA 536 (691)
T ss_pred h--hhhhccccccCCCcceeEEEcCCCCEEEEEec---cceEEEEEcccceeecchhccC-cceeeeeccccccCcEEEE
Confidence 2 222211 12234466899998888888763 2488888888776655542 11 122344444 345666666
Q ss_pred eCCCCeEEEEec
Q psy5768 465 DARLDKIERCDY 476 (652)
Q Consensus 465 D~~~~~I~~~~l 476 (652)
++.. .+.-.++
T Consensus 537 ts~n-Qv~efdi 547 (691)
T KOG2048|consen 537 TSNN-QVFEFDI 547 (691)
T ss_pred ecCC-eEEEEec
Confidence 5543 3444444
No 201
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=48.95 E-value=1.1e+02 Score=34.53 Aligned_cols=97 Identities=16% Similarity=0.101 Sum_probs=50.9
Q ss_pred cEEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeCCCCCcEEEEe
Q psy5768 92 HIAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGLDGKKQTILAQ 171 (652)
Q Consensus 92 ~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~~~~~~ 171 (652)
.+++|...+.|||--.+-.- ..+..| -...+..-.-+|||+.+|.+-|.=.....-++ ++|.....+|+.
T Consensus 238 ~~s~D~~~~lvy~~tGnp~p-----~~~~~r---~gdnl~~~s~vAld~~TG~~~W~~Q~~~~D~w--D~d~~~~p~l~d 307 (527)
T TIGR03075 238 TGSYDPETNLIYFGTGNPSP-----WNSHLR---PGDNLYTSSIVARDPDTGKIKWHYQTTPHDEW--DYDGVNEMILFD 307 (527)
T ss_pred ceeEcCCCCeEEEeCCCCCC-----CCCCCC---CCCCccceeEEEEccccCCEEEeeeCCCCCCc--cccCCCCcEEEE
Confidence 45888888888886543211 223333 11223445678999999999997322111122 333333333332
Q ss_pred ec-ccC-ceeEEEeccCCEEEEEeCCCCc
Q psy5768 172 EI-IMP-IKDITLDLKFFSAFYRNLSKGN 198 (652)
Q Consensus 172 ~~-~~~-p~gl~lD~~~~~ly~~d~~g~~ 198 (652)
-. -.. -..++.-.+++.+|+.|...|.
T Consensus 308 ~~~~G~~~~~v~~~~K~G~~~vlDr~tG~ 336 (527)
T TIGR03075 308 LKKDGKPRKLLAHADRNGFFYVLDRTNGK 336 (527)
T ss_pred eccCCcEEEEEEEeCCCceEEEEECCCCc
Confidence 11 011 1223334477888888876553
No 202
>KOG0285|consensus
Probab=48.86 E-value=3.4e+02 Score=28.50 Aligned_cols=162 Identities=15% Similarity=0.104 Sum_probs=92.9
Q ss_pred eccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeC-CCCCceEEEEeCCCCEEEEEecCCCCCceEEE
Q psy5768 353 ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLG-QHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRA 431 (652)
Q Consensus 353 ~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~-~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~ 431 (652)
..++-+..+|||+. +.-|.|.+..++|...++... +..++++ -....+++||.+..-|||=+--+ ..|...
T Consensus 149 gHlgWVr~vavdP~-n~wf~tgs~DrtikIwDlatg----~LkltltGhi~~vr~vavS~rHpYlFs~ged---k~VKCw 220 (460)
T KOG0285|consen 149 GHLGWVRSVAVDPG-NEWFATGSADRTIKIWDLATG----QLKLTLTGHIETVRGVAVSKRHPYLFSAGED---KQVKCW 220 (460)
T ss_pred hccceEEEEeeCCC-ceeEEecCCCceeEEEEcccC----eEEEeecchhheeeeeeecccCceEEEecCC---CeeEEE
Confidence 34567889999986 667778888888888888652 3334443 45788999999999999976433 145544
Q ss_pred eecCCCceEEE--EcCCCCCceEEEecCCCEEEEEeCCCCeEEEEecCCCceEEEecCCCCcee-EEEEe--CCEEEEEc
Q psy5768 432 FFSGFGTESII--TTDITMPNALALDHQAEKLFWGDARLDKIERCDYDGTNRIVLSKISPLHPF-DMAVY--GEFIFWTD 506 (652)
Q Consensus 432 ~ldG~~~~~l~--~~~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~~~~~p~-glav~--~~~lYwtd 506 (652)
+|.-. ++|- .-.+.....|++-+.-++|+ +-.....|..-|+........+.+. ..|. .+... +..||- -
T Consensus 221 DLe~n--kvIR~YhGHlS~V~~L~lhPTldvl~-t~grDst~RvWDiRtr~~V~~l~GH-~~~V~~V~~~~~dpqvit-~ 295 (460)
T KOG0285|consen 221 DLEYN--KVIRHYHGHLSGVYCLDLHPTLDVLV-TGGRDSTIRVWDIRTRASVHVLSGH-TNPVASVMCQPTDPQVIT-G 295 (460)
T ss_pred echhh--hhHHHhccccceeEEEeccccceeEE-ecCCcceEEEeeecccceEEEecCC-CCcceeEEeecCCCceEE-e
Confidence 44321 2221 11244455666665555554 4344445666677665554444433 3343 23222 334443 3
Q ss_pred CCCCeEEEEEccCCceEEEEe
Q psy5768 507 WVIHAVLRANKYTGEEVYTLR 527 (652)
Q Consensus 507 ~~~~~I~~~~k~~g~~~~~~~ 527 (652)
....+|.--+...|+...++.
T Consensus 296 S~D~tvrlWDl~agkt~~tlt 316 (460)
T KOG0285|consen 296 SHDSTVRLWDLRAGKTMITLT 316 (460)
T ss_pred cCCceEEEeeeccCceeEeee
Confidence 344455555555666555443
No 203
>KOG0294|consensus
Probab=48.38 E-value=3.3e+02 Score=28.14 Aligned_cols=180 Identities=17% Similarity=0.123 Sum_probs=0.0
Q ss_pred CeEEEecCCCCeEEEEecCCCe-------------eEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEE
Q psy5768 1 MFIAVSSPTQSKIVVCNLEGEY-------------QTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMA 67 (652)
Q Consensus 1 ~~i~v~~~~~~~I~~~~~~g~~-------------~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~ 67 (652)
|.|+|.+ ....|+.|.++-+. ...+..+.+ .+++||++ ++-..+- ....+|..+
T Consensus 2 m~iIvGt-YE~~i~Gf~l~~~~~~~~~s~~~~l~~lF~~~aH~~-------sitavAVs---~~~~aSG--ssDetI~IY 68 (362)
T KOG0294|consen 2 MEIIVGT-YEHVILGFKLDPEPKGCTDSVKPTLKPLFAFSAHAG-------SITALAVS---GPYVASG--SSDETIHIY 68 (362)
T ss_pred eeEEEee-eeeEEEEEEeccCccccccccceeeecccccccccc-------ceeEEEec---ceeEecc--CCCCcEEEE
Q ss_pred EcCCCccEEEEeCCCcCCccCCCCcEEE-EccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEE
Q psy5768 68 FMDGTKRETVVSQKKYPAVTACNLHIAV-DWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLF 146 (652)
Q Consensus 68 ~~dgs~~~~v~~~~~~~~p~~~~~~lav-Dw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~ly 146 (652)
++--.-....+... .+..+ .+-+ -..+.....+-+..+.|.+.+.+.=...--+..--.+..+|+++| .|+|=
T Consensus 69 Dm~k~~qlg~ll~H-agsit----aL~F~~~~S~shLlS~sdDG~i~iw~~~~W~~~~slK~H~~~Vt~lsiHP-S~KLA 142 (362)
T KOG0294|consen 69 DMRKRKQLGILLSH-AGSIT----ALKFYPPLSKSHLLSGSDDGHIIIWRVGSWELLKSLKAHKGQVTDLSIHP-SGKLA 142 (362)
T ss_pred eccchhhhcceecc-ccceE----EEEecCCcchhheeeecCCCcEEEEEcCCeEEeeeecccccccceeEecC-CCceE
Q ss_pred EEecCCCCeEEEEe-CCCCCcEEEEeecccCceeEEEeccCCEEEEEeCCCCcEEEE
Q psy5768 147 WSESGKIPLIARAG-LDGKKQTILAQEIIMPIKDITLDLKFFSAFYRNLSKGNIHII 202 (652)
Q Consensus 147 wtd~~~~~~I~~~~-ldg~~~~~~~~~~~~~p~gl~lD~~~~~ly~~d~~g~~~~~i 202 (652)
.+=.+ ...+..-+ +.|..-.++--.. .+.-+..++.+.+.|++-.++-.+.++
T Consensus 143 LsVg~-D~~lr~WNLV~Gr~a~v~~L~~--~at~v~w~~~Gd~F~v~~~~~i~i~q~ 196 (362)
T KOG0294|consen 143 LSVGG-DQVLRTWNLVRGRVAFVLNLKN--KATLVSWSPQGDHFVVSGRNKIDIYQL 196 (362)
T ss_pred EEEcC-CceeeeehhhcCccceeeccCC--cceeeEEcCCCCEEEEEeccEEEEEec
No 204
>KOG1225|consensus
Probab=47.89 E-value=30 Score=38.38 Aligned_cols=71 Identities=24% Similarity=0.474 Sum_probs=37.7
Q ss_pred ceeeeccCceeeccC--CcccCcccccCCCceeeccCeecCC-c---cCC---------CC-------CCCCCCCCCCCC
Q psy5768 569 QVVCSCFTGKVLMED--NRSCTINTVCSEHDFKCSDGMCIPF-N---QTC---------DR-------VYNCHDKSDEGI 626 (652)
Q Consensus 569 ~~~C~Cp~g~~l~~d--~~C~~~~~~C~~~~f~C~~g~Ci~~-~---~~C---------d~-------~~dC~d~sde~~ 626 (652)
...|.|+.+|..... ..|.. .|.- .++|.+|+||-. . ..| .+ ...|.++.- .
T Consensus 233 ~~ic~c~~~~~g~~c~~~~C~~---~c~~-~g~c~~G~CIC~~Gf~G~dC~e~~Cp~~cs~~g~~~~g~CiC~~g~~--G 306 (525)
T KOG1225|consen 233 DGICECPEGYFGPLCSTIYCPG---GCTG-RGQCVEGRCICPPGFTGDDCDELVCPVDCSGGGVCVDGECICNPGYS--G 306 (525)
T ss_pred CceeecCCceeCCccccccCCC---CCcc-cceEeCCeEeCCCCCcCCCCCcccCCcccCCCceecCCEeecCCCcc--c
Confidence 358999999866433 23322 2322 266766766631 1 112 22 122332222 3
Q ss_pred CCCCCCCCCCCeeecC-CCCccc
Q psy5768 627 LYCAMRDCRPGYFKCD-NNKCIL 648 (652)
Q Consensus 627 ~~C~~~~C~~~~f~C~-~~~Ci~ 648 (652)
..|..+.|+.+ |. .|+||+
T Consensus 307 ~dCs~~~cpad---C~g~G~Ci~ 326 (525)
T KOG1225|consen 307 KDCSIRRCPAD---CSGHGKCID 326 (525)
T ss_pred cccccccCCcc---CCCCCcccC
Confidence 34877778877 86 478884
No 205
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=47.34 E-value=3.7e+02 Score=28.38 Aligned_cols=61 Identities=15% Similarity=0.118 Sum_probs=39.7
Q ss_pred CCEEEEEeCCCCeEEEEecCCCceEEEecCCCCceeEEEEeCCEEEEEcCCCCeEEEEEccCCce
Q psy5768 458 AEKLFWGDARLDKIERCDYDGTNRIVLSKISPLHPFDMAVYGEFIFWTDWVIHAVLRANKYTGEE 522 (652)
Q Consensus 458 ~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~~~~~p~glav~~~~lYwtd~~~~~I~~~~k~~g~~ 522 (652)
+++||.... .+.+..++....... -......+..+++.+++||..+ ..+.|+.+++.+|+.
T Consensus 241 ~~~vy~~~~-~g~l~a~d~~tG~~~--W~~~~~~~~~p~~~~~~vyv~~-~~G~l~~~d~~tG~~ 301 (377)
T TIGR03300 241 GGQVYAVSY-QGRVAALDLRSGRVL--WKRDASSYQGPAVDDNRLYVTD-ADGVVVALDRRSGSE 301 (377)
T ss_pred CCEEEEEEc-CCEEEEEECCCCcEE--EeeccCCccCceEeCCEEEEEC-CCCeEEEEECCCCcE
Confidence 578888764 457888887432221 1111233445667899999987 467899999888863
No 206
>KOG2110|consensus
Probab=47.31 E-value=3.7e+02 Score=28.36 Aligned_cols=147 Identities=11% Similarity=0.100 Sum_probs=88.3
Q ss_pred eEEEecCCCCeEEEEecCCCe-eEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeC
Q psy5768 2 FIAVSSPTQSKIVVCNLEGEY-QTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQ 80 (652)
Q Consensus 2 ~i~v~~~~~~~I~~~~~~g~~-~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~ 80 (652)
+++|+= .+.|++.|+.... ..+|.... ++-....+++.+..+.++=+-+ ....+.|+.++...-.....+..
T Consensus 99 RLvV~L--ee~IyIydI~~MklLhTI~t~~----~n~~gl~AlS~n~~n~ylAyp~-s~t~GdV~l~d~~nl~~v~~I~a 171 (391)
T KOG2110|consen 99 RLVVCL--EESIYIYDIKDMKLLHTIETTP----PNPKGLCALSPNNANCYLAYPG-STTSGDVVLFDTINLQPVNTINA 171 (391)
T ss_pred eEEEEE--cccEEEEecccceeehhhhccC----CCccceEeeccCCCCceEEecC-CCCCceEEEEEcccceeeeEEEe
Confidence 455654 4569999988553 34554421 1334456666665555666544 13467788888765433333445
Q ss_pred CCcCCccCCCCcEEEEccCCcEEEEeCCCCEE-EEEE-cCCCcEEEEEeCC-CCCceeEEEcCCCCeEEEEecCCCCeEE
Q psy5768 81 KKYPAVTACNLHIAVDWIAQNIYWSDPKENVI-EVAR-LTGQYRYVLISGG-VDQPSALAVDPESGYLFWSESGKIPLIA 157 (652)
Q Consensus 81 ~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I-~v~~-~dg~~~~~l~~~~-~~~P~~iavd~~~g~lywtd~~~~~~I~ 157 (652)
. -+... .||++. .|.+.-|-+.++.| .|+. .+|+....+-.+- .-+-..|+.+|...+|-.+. ....|.
T Consensus 172 H-~~~lA----alafs~-~G~llATASeKGTVIRVf~v~~G~kl~eFRRG~~~~~IySL~Fs~ds~~L~~sS--~TeTVH 243 (391)
T KOG2110|consen 172 H-KGPLA----ALAFSP-DGTLLATASEKGTVIRVFSVPEGQKLYEFRRGTYPVSIYSLSFSPDSQFLAASS--NTETVH 243 (391)
T ss_pred c-CCcee----EEEECC-CCCEEEEeccCceEEEEEEcCCccEeeeeeCCceeeEEEEEEECCCCCeEEEec--CCCeEE
Confidence 5 45677 899996 66777777777775 4554 5676555544431 22345888998777665554 344677
Q ss_pred EEeCCC
Q psy5768 158 RAGLDG 163 (652)
Q Consensus 158 ~~~ldg 163 (652)
.++|+-
T Consensus 244 iFKL~~ 249 (391)
T KOG2110|consen 244 IFKLEK 249 (391)
T ss_pred EEEecc
Confidence 777663
No 207
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=46.99 E-value=3.9e+02 Score=28.61 Aligned_cols=76 Identities=7% Similarity=-0.022 Sum_probs=40.1
Q ss_pred EECCCCEEEE-EeccCCcceEEEEEcCCCccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcEE
Q psy5768 45 VWPVKGKMFW-SNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYRY 123 (652)
Q Consensus 45 ~d~~~~~lyw-~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~ 123 (652)
+.....+|.+ +| ..+...++.++++....+++.... -. ..+|..+-..++.||+... ...+.+.+++....+
T Consensus 43 ft~dG~kllF~s~-~dg~~nly~lDL~t~~i~QLTdg~-g~----~~~g~~~s~~~~~~~Yv~~-~~~l~~vdL~T~e~~ 115 (386)
T PF14583_consen 43 FTDDGRKLLFASD-FDGNRNLYLLDLATGEITQLTDGP-GD----NTFGGFLSPDDRALYYVKN-GRSLRRVDLDTLEER 115 (386)
T ss_dssp B-TTS-EEEEEE--TTSS-EEEEEETTT-EEEE---SS--B-----TTT-EE-TTSSEEEEEET-TTEEEEEETTT--EE
T ss_pred cCCCCCEEEEEec-cCCCcceEEEEcccCEEEECccCC-CC----CccceEEecCCCeEEEEEC-CCeEEEEECCcCcEE
Confidence 3344545555 55 335668999999988777764321 11 1225666667888877653 357888999887665
Q ss_pred EEEe
Q psy5768 124 VLIS 127 (652)
Q Consensus 124 ~l~~ 127 (652)
++..
T Consensus 116 ~vy~ 119 (386)
T PF14583_consen 116 VVYE 119 (386)
T ss_dssp EEEE
T ss_pred EEEE
Confidence 5543
No 208
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=45.17 E-value=1.6e+02 Score=26.35 Aligned_cols=57 Identities=16% Similarity=0.303 Sum_probs=28.1
Q ss_pred EEEECCCCE-EEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEe
Q psy5768 43 IAVWPVKGK-MFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSD 106 (652)
Q Consensus 43 v~~d~~~~~-lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d 106 (652)
++||..++. +|+-|+..+.+.|....+.+.....++-.| .-... -+|+.+..+||+-
T Consensus 76 laYDV~~N~d~Fyke~~DGvn~i~~g~~~~~~~~l~ivGG-ncsi~------Gfd~~G~e~fWtV 133 (136)
T PF14781_consen 76 LAYDVENNSDLFYKEVPDGVNAIVIGKLGDIPSPLVIVGG-NCSIQ------GFDYEGNEIFWTV 133 (136)
T ss_pred EEEEcccCchhhhhhCccceeEEEEEecCCCCCcEEEECc-eEEEE------EeCCCCcEEEEEe
Confidence 445544433 555553334445544444443333333344 22222 3577788888874
No 209
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=43.60 E-value=94 Score=33.68 Aligned_cols=61 Identities=10% Similarity=0.043 Sum_probs=34.3
Q ss_pred CceeEEEcCCCCeEEEEecCCCCeEEEEeCCCCCcEEEEeec-------------------ccCceeEEEeccCCEEEEE
Q psy5768 132 QPSALAVDPESGYLFWSESGKIPLIARAGLDGKKQTILAQEI-------------------IMPIKDITLDLKFFSAFYR 192 (652)
Q Consensus 132 ~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~~~~~~~~-------------------~~~p~gl~lD~~~~~ly~~ 192 (652)
-+.+|.|-....+||++.|. .+.|...++.......++..- .+.|+=|.+...+.||||.
T Consensus 313 LitDI~iSlDDrfLYvs~W~-~GdvrqYDISDP~~Pkl~gqv~lGG~~~~~~~~~v~g~~l~GgPqMvqlS~DGkRlYvT 391 (461)
T PF05694_consen 313 LITDILISLDDRFLYVSNWL-HGDVRQYDISDPFNPKLVGQVFLGGSIRKGDHPVVKGKRLRGGPQMVQLSLDGKRLYVT 391 (461)
T ss_dssp ----EEE-TTS-EEEEEETT-TTEEEEEE-SSTTS-EEEEEEE-BTTTT-B--TTS------S----EEE-TTSSEEEEE
T ss_pred ceEeEEEccCCCEEEEEccc-CCcEEEEecCCCCCCcEEeEEEECcEeccCCCccccccccCCCCCeEEEccCCeEEEEE
Confidence 35788888889999999997 668888887654443333221 2357788899999999995
Q ss_pred e
Q psy5768 193 N 193 (652)
Q Consensus 193 d 193 (652)
+
T Consensus 392 n 392 (461)
T PF05694_consen 392 N 392 (461)
T ss_dssp -
T ss_pred e
Confidence 3
No 210
>PF09910 DUF2139: Uncharacterized protein conserved in archaea (DUF2139); InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=43.60 E-value=3.8e+02 Score=27.54 Aligned_cols=61 Identities=15% Similarity=0.313 Sum_probs=40.9
Q ss_pred EEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCCC
Q psy5768 407 IDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARLD 469 (652)
Q Consensus 407 Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~ 469 (652)
=||+-...+||+--|-. +|.+++-.-+|. .++.....-......-++...=+|.|.++..+
T Consensus 40 NAV~~vDd~IyFGGWVH-APa~y~gk~~g~-~~IdF~NKYSHVH~yd~e~~~VrLLWkesih~ 100 (339)
T PF09910_consen 40 NAVEWVDDFIYFGGWVH-APAVYEGKGDGR-ATIDFRNKYSHVHEYDTENDSVRLLWKESIHD 100 (339)
T ss_pred eeeeeecceEEEeeeec-CCceeeeccCCc-eEEEEeeccceEEEEEcCCCeEEEEEecccCC
Confidence 45666679999999974 577888877776 44444444444555555555668888876554
No 211
>KOG0319|consensus
Probab=43.57 E-value=5.7e+02 Score=29.52 Aligned_cols=147 Identities=10% Similarity=-0.032 Sum_probs=80.8
Q ss_pred eEEEccCCEEEEEeCCCCeEEEEEcCCCCCccE-EEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCce
Q psy5768 361 LAYEYVHNYLYWTCNNDATINKIDLDSPKAQRI-VVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTE 439 (652)
Q Consensus 361 lAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~-~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~ 439 (652)
++++..++.||-+.. +.|..+++.... .. .....+......+++|.|.+.+||.+-.+. -+....+.-....
T Consensus 25 ~~~s~nG~~L~t~~~--d~Vi~idv~t~~--~~l~s~~~ed~d~ita~~l~~d~~~L~~a~rs~---llrv~~L~tgk~i 97 (775)
T KOG0319|consen 25 VAWSSNGQHLYTACG--DRVIIIDVATGS--IALPSGSNEDEDEITALALTPDEEVLVTASRSQ---LLRVWSLPTGKLI 97 (775)
T ss_pred eeECCCCCEEEEecC--ceEEEEEccCCc--eecccCCccchhhhheeeecCCccEEEEeeccc---eEEEEEcccchHh
Confidence 889999999997763 467777765422 11 111123446778899999988888775442 2333334332111
Q ss_pred EEEEcCCCCCc-eEEEecCCCEEEEEeCCCCeEEEEecCCCceEEEecCCCCceeEEEE-eCCEEE---EEcCCCCeEEE
Q psy5768 440 SIITTDITMPN-ALALDHQAEKLFWGDARLDKIERCDYDGTNRIVLSKISPLHPFDMAV-YGEFIF---WTDWVIHAVLR 514 (652)
Q Consensus 440 ~l~~~~l~~P~-glaiD~~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~~~~~p~glav-~~~~lY---wtd~~~~~I~~ 514 (652)
-....--..|. .+++|+.+ .|.=.-.-.+++-..|+.+..-..-+.+. +.+.+... .++..| .+....+.|+.
T Consensus 98 rswKa~He~Pvi~ma~~~~g-~LlAtggaD~~v~VWdi~~~~~th~fkG~-gGvVssl~F~~~~~~~lL~sg~~D~~v~v 175 (775)
T KOG0319|consen 98 RSWKAIHEAPVITMAFDPTG-TLLATGGADGRVKVWDIKNGYCTHSFKGH-GGVVSSLLFHPHWNRWLLASGATDGTVRV 175 (775)
T ss_pred HhHhhccCCCeEEEEEcCCC-ceEEeccccceEEEEEeeCCEEEEEecCC-CceEEEEEeCCccchhheeecCCCceEEE
Confidence 11111112454 78888776 33322223356777888887766655543 44444444 455555 33333444443
Q ss_pred EE
Q psy5768 515 AN 516 (652)
Q Consensus 515 ~~ 516 (652)
-|
T Consensus 176 wn 177 (775)
T KOG0319|consen 176 WN 177 (775)
T ss_pred EE
Confidence 33
No 212
>PF14339 DUF4394: Domain of unknown function (DUF4394)
Probab=43.54 E-value=1.4e+02 Score=29.62 Aligned_cols=60 Identities=8% Similarity=0.135 Sum_probs=41.9
Q ss_pred ceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe-----e-ccCceeeeEEEccCCEEEEEe
Q psy5768 314 KNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL-----E-RQGSVEGLAYEYVHNYLYWTC 374 (652)
Q Consensus 314 ~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~-----~-~~~~~~glAvDw~~~~LYwtd 374 (652)
..+++|||-+.+++||=.. ..++||.++........+- . -.+...|+.+++..++|.+..
T Consensus 27 e~l~GID~Rpa~G~LYgl~-~~g~lYtIn~~tG~aT~vg~s~~~~al~g~~~gvDFNP~aDRlRvvs 92 (236)
T PF14339_consen 27 ESLVGIDFRPANGQLYGLG-STGRLYTINPATGAATPVGASPLTVALSGTAFGVDFNPAADRLRVVS 92 (236)
T ss_pred CeEEEEEeecCCCCEEEEe-CCCcEEEEECCCCeEEEeecccccccccCceEEEecCcccCcEEEEc
Confidence 4678999999999999984 4589999997654433331 1 113467778887777666653
No 213
>PLN00181 protein SPA1-RELATED; Provisional
Probab=43.41 E-value=6.3e+02 Score=29.98 Aligned_cols=114 Identities=10% Similarity=0.054 Sum_probs=67.1
Q ss_pred eEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe-eccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccE
Q psy5768 315 NIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL-ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRI 393 (652)
Q Consensus 315 ~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~-~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~ 393 (652)
.+.++++++..+.+..+-...+.|...++........+ .....+.+|++.+..+.++++-+..+.|.+-++... .
T Consensus 534 ~v~~l~~~~~~~~~las~~~Dg~v~lWd~~~~~~~~~~~~H~~~V~~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~~----~ 609 (793)
T PLN00181 534 KLSGICWNSYIKSQVASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQG----V 609 (793)
T ss_pred ceeeEEeccCCCCEEEEEeCCCeEEEEECCCCeEEEEecCCCCCEEEEEEcCCCCCEEEEEcCCCEEEEEECCCC----c
Confidence 35567777655555555555677777776643322222 333467788888766777788888888888887542 1
Q ss_pred EEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeec
Q psy5768 394 VVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFS 434 (652)
Q Consensus 394 ~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ld 434 (652)
.+...........++..+..|.++.+-... +.|...++.
T Consensus 610 ~~~~~~~~~~v~~v~~~~~~g~~latgs~d--g~I~iwD~~ 648 (793)
T PLN00181 610 SIGTIKTKANICCVQFPSESGRSLAFGSAD--HKVYYYDLR 648 (793)
T ss_pred EEEEEecCCCeEEEEEeCCCCCEEEEEeCC--CeEEEEECC
Confidence 222222234556677766666666654432 255555553
No 214
>PF14339 DUF4394: Domain of unknown function (DUF4394)
Probab=43.36 E-value=1.4e+02 Score=29.50 Aligned_cols=37 Identities=11% Similarity=0.122 Sum_probs=29.5
Q ss_pred CCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEE
Q psy5768 38 SKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETV 77 (652)
Q Consensus 38 ~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v 77 (652)
.+.++|||.|.+++||=.. ..++||.++........+
T Consensus 27 e~l~GID~Rpa~G~LYgl~---~~g~lYtIn~~tG~aT~v 63 (236)
T PF14339_consen 27 ESLVGIDFRPANGQLYGLG---STGRLYTINPATGAATPV 63 (236)
T ss_pred CeEEEEEeecCCCCEEEEe---CCCcEEEEECCCCeEEEe
Confidence 6899999999999999875 578999999875443333
No 215
>TIGR02171 Fb_sc_TIGR02171 Fibrobacter succinogenes paralogous family TIGR02171. This model describes a paralogous family of the rumen bacterium Fibrobacter succinogenes. Eleven members are found in Fibrobacter succinogenes S85, averaging over 900 amino acids in length. More than half are predicted lipoproteins. The function is unknown.
Probab=41.96 E-value=5.3e+02 Score=30.86 Aligned_cols=61 Identities=3% Similarity=0.098 Sum_probs=37.3
Q ss_pred EEecCCCCeEEEEecCCCeeEEE-ecCCCCCCCCCCCeeEEEEECCCCEEEE-EeccC--CcceEEEEEcCCC
Q psy5768 4 AVSSPTQSKIVVCNLEGEYQTTI-LSNESNDTSTLSKISSIAVWPVKGKMFW-SNVTK--QVVTIEMAFMDGT 72 (652)
Q Consensus 4 ~v~~~~~~~I~~~~~~g~~~~~~-~~~~~~~~~~~~~~~~v~~d~~~~~lyw-~d~~~--~~~~I~~~~~dgs 72 (652)
||.+ ..++|.++|.+|...+++ .... ..+..=++.|..++|=+ +-... +...||+.+++.+
T Consensus 323 fv~~-~~~~L~~~D~dG~n~~~ve~~~~-------~~i~sP~~SPDG~~vAY~ts~e~~~g~s~vYv~~L~t~ 387 (912)
T TIGR02171 323 FRND-VTGNLAYIDYTKGASRAVEIEDT-------ISVYHPDISPDGKKVAFCTGIEGLPGKSSVYVRNLNAS 387 (912)
T ss_pred EEEc-CCCeEEEEecCCCCceEEEecCC-------CceecCcCCCCCCEEEEEEeecCCCCCceEEEEehhcc
Confidence 3444 445999999999888776 4321 23344457777777766 43112 3456777777644
No 216
>KOG0315|consensus
Probab=41.83 E-value=3.7e+02 Score=26.88 Aligned_cols=130 Identities=9% Similarity=0.112 Sum_probs=75.5
Q ss_pred EEEEeeecceeEEecCCCCCCCCCceeeeeccccceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEeeccCcee
Q psy5768 280 FIMYSRVNRIDSIHMTDKSDLNSPFESIRNSTMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLLERQGSVE 359 (652)
Q Consensus 280 ~Ll~s~~~~i~~i~l~~~~~~~~p~~~~~~~~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~~~~~~~~ 359 (652)
+|..+....||-.++... +.. |+..+. .+.+|+.++.|....+.+|-. .+.+.+..-++..-..+.++.....+.
T Consensus 54 ~LAaa~~qhvRlyD~~S~-np~-Pv~t~e--~h~kNVtaVgF~~dgrWMyTg-seDgt~kIWdlR~~~~qR~~~~~spVn 128 (311)
T KOG0315|consen 54 DLAAAGNQHVRLYDLNSN-NPN-PVATFE--GHTKNVTAVGFQCDGRWMYTG-SEDGTVKIWDLRSLSCQRNYQHNSPVN 128 (311)
T ss_pred hhhhccCCeeEEEEccCC-CCC-ceeEEe--ccCCceEEEEEeecCeEEEec-CCCceEEEEeccCcccchhccCCCCcc
Confidence 444455667787787322 212 444443 234788999998877766654 445666555554433344443334455
Q ss_pred eeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCCEEE
Q psy5768 360 GLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIY 417 (652)
Q Consensus 360 glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Ly 417 (652)
.+.+.+--..|+..|. .+.|++=++.... -...++- +.....++++|+|...+|-
T Consensus 129 ~vvlhpnQteLis~dq-sg~irvWDl~~~~-c~~~liP-e~~~~i~sl~v~~dgsml~ 183 (311)
T KOG0315|consen 129 TVVLHPNQTELISGDQ-SGNIRVWDLGENS-CTHELIP-EDDTSIQSLTVMPDGSMLA 183 (311)
T ss_pred eEEecCCcceEEeecC-CCcEEEEEccCCc-cccccCC-CCCcceeeEEEcCCCcEEE
Confidence 6666666667776654 5778887874321 1222332 3456778999999654443
No 217
>KOG0308|consensus
Probab=41.79 E-value=5.8e+02 Score=29.08 Aligned_cols=174 Identities=15% Similarity=0.066 Sum_probs=97.5
Q ss_pred EEecCCCCeEEEEecCCC-e--eEEE--ecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCC-CccEEE
Q psy5768 4 AVSSPTQSKIVVCNLEGE-Y--QTTI--LSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDG-TKRETV 77 (652)
Q Consensus 4 ~v~~~~~~~I~~~~~~g~-~--~~~~--~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dg-s~~~~v 77 (652)
+++--=..+|+++|.+.. . +..+ ++......+....+.++|... ++.|+++- ...+-|..++.-- ..+.-+
T Consensus 133 vaSgGLD~~IflWDin~~~~~l~~s~n~~t~~sl~sG~k~siYSLA~N~-t~t~ivsG--gtek~lr~wDprt~~kimkL 209 (735)
T KOG0308|consen 133 VASGGLDRKIFLWDINTGTATLVASFNNVTVNSLGSGPKDSIYSLAMNQ-TGTIIVSG--GTEKDLRLWDPRTCKKIMKL 209 (735)
T ss_pred EEecCCCccEEEEEccCcchhhhhhccccccccCCCCCccceeeeecCC-cceEEEec--CcccceEEeccccccceeee
Confidence 333333578999988733 1 1111 111111112445677888874 45777765 3344444454432 222223
Q ss_pred EeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEE
Q psy5768 78 VSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIA 157 (652)
Q Consensus 78 ~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~ 157 (652)
... ...++ .|-++-.+.++ .+-+..+.|.+-++....+..-..---+...++..+|.-.++|..+. ...|.
T Consensus 210 -rGH-TdNVr----~ll~~dDGt~~-ls~sSDgtIrlWdLgqQrCl~T~~vH~e~VWaL~~~~sf~~vYsG~r--d~~i~ 280 (735)
T KOG0308|consen 210 -RGH-TDNVR----VLLVNDDGTRL-LSASSDGTIRLWDLGQQRCLATYIVHKEGVWALQSSPSFTHVYSGGR--DGNIY 280 (735)
T ss_pred -ecc-ccceE----EEEEcCCCCeE-eecCCCceEEeeeccccceeeeEEeccCceEEEeeCCCcceEEecCC--CCcEE
Confidence 344 56677 77777655555 44456788998899887764433322244889999988888887753 46799
Q ss_pred EEeCCCCCcEEEEeecccCceeEEEeccCCEE
Q psy5768 158 RAGLDGKKQTILAQEIIMPIKDITLDLKFFSA 189 (652)
Q Consensus 158 ~~~ldg~~~~~~~~~~~~~p~gl~lD~~~~~l 189 (652)
|.+|......+++=+.-....-+.++...+-+
T Consensus 281 ~Tdl~n~~~~tlick~daPv~~l~~~~~~~~~ 312 (735)
T KOG0308|consen 281 RTDLRNPAKSTLICKEDAPVLKLHLHEHDNSV 312 (735)
T ss_pred ecccCCchhheEeecCCCchhhhhhccccCCc
Confidence 99887643333332222333445555444444
No 218
>PF08309 LVIVD: LVIVD repeat; InterPro: IPR013211 This repeat is found in bacterial and archaeal cell surface proteins, many of which are hypothetical. The secondary structure corresponding to this repeat is predicted to comprise 4 beta-strands, which may associate to form a beta-propeller. The repeat copy number varies from 2-14. This repeat is sometimes found with the PKD domain IPR000601 from INTERPRO.
Probab=41.42 E-value=1.1e+02 Score=21.31 Aligned_cols=30 Identities=17% Similarity=0.180 Sum_probs=21.6
Q ss_pred eeEEEEeCCEEEEEcCCCCeEEEEEccCCce
Q psy5768 492 PFDMAVYGEFIFWTDWVIHAVLRANKYTGEE 522 (652)
Q Consensus 492 p~glav~~~~lYwtd~~~~~I~~~~k~~g~~ 522 (652)
..++++.++++|.+++..+ +..+|..+.+.
T Consensus 4 a~~v~v~g~yaYva~~~~G-l~IvDISnPs~ 33 (42)
T PF08309_consen 4 ARDVAVSGNYAYVADGNNG-LVIVDISNPSN 33 (42)
T ss_pred EEEEEEECCEEEEEeCCCC-EEEEECCCCCC
Confidence 4678999999999987755 55566654433
No 219
>KOG0286|consensus
Probab=38.34 E-value=4.6e+02 Score=26.89 Aligned_cols=171 Identities=13% Similarity=0.080 Sum_probs=0.0
Q ss_pred EEecCCCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECC-----------------------------------
Q psy5768 4 AVSSPTQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPV----------------------------------- 48 (652)
Q Consensus 4 ~v~~~~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~----------------------------------- 48 (652)
+|+.+..+++.++|.--.+..-.++. +-.=+...||.|.
T Consensus 70 ivSaSqDGklIvWDs~TtnK~haipl------~s~WVMtCA~sPSg~~VAcGGLdN~Csiy~ls~~d~~g~~~v~r~l~g 143 (343)
T KOG0286|consen 70 IVSASQDGKLIVWDSFTTNKVHAIPL------PSSWVMTCAYSPSGNFVACGGLDNKCSIYPLSTRDAEGNVRVSRELAG 143 (343)
T ss_pred EEeeccCCeEEEEEcccccceeEEec------CceeEEEEEECCCCCeEEecCcCceeEEEecccccccccceeeeeecC
Q ss_pred ------------CCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEE
Q psy5768 49 ------------KGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVAR 116 (652)
Q Consensus 49 ------------~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~ 116 (652)
++.|. |- .+..+.-.++..-..+.+.+... .+.+- +|++-+.+.|.|++-+-.+....-|
T Consensus 144 HtgylScC~f~dD~~il-T~--SGD~TCalWDie~g~~~~~f~GH-~gDV~----slsl~p~~~ntFvSg~cD~~aklWD 215 (343)
T KOG0286|consen 144 HTGYLSCCRFLDDNHIL-TG--SGDMTCALWDIETGQQTQVFHGH-TGDVM----SLSLSPSDGNTFVSGGCDKSAKLWD 215 (343)
T ss_pred ccceeEEEEEcCCCceE-ec--CCCceEEEEEcccceEEEEecCC-cccEE----EEecCCCCCCeEEecccccceeeee
Q ss_pred cCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeCCCCCcEEEEeec--ccCceeEEEeccCCEEE
Q psy5768 117 LTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGLDGKKQTILAQEI--IMPIKDITLDLKFFSAF 190 (652)
Q Consensus 117 ~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~~~~~~~~--~~~p~gl~lD~~~~~ly 190 (652)
.....+...+.+-...-.++...| +|+-|.|-.. .......+|.......+++.. +...+++++...++.||
T Consensus 216 ~R~~~c~qtF~ghesDINsv~ffP-~G~afatGSD-D~tcRlyDlRaD~~~a~ys~~~~~~gitSv~FS~SGRlLf 289 (343)
T KOG0286|consen 216 VRSGQCVQTFEGHESDINSVRFFP-SGDAFATGSD-DATCRLYDLRADQELAVYSHDSIICGITSVAFSKSGRLLF 289 (343)
T ss_pred ccCcceeEeecccccccceEEEcc-CCCeeeecCC-CceeEEEeecCCcEEeeeccCcccCCceeEEEcccccEEE
No 220
>KOG0263|consensus
Probab=38.25 E-value=6.9e+02 Score=28.92 Aligned_cols=166 Identities=16% Similarity=0.153 Sum_probs=92.4
Q ss_pred eEEEecCCCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCC
Q psy5768 2 FIAVSSPTQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQK 81 (652)
Q Consensus 2 ~i~v~~~~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~ 81 (652)
..+++++...-|+.++++-.....+- +| .+..++.|.|.|. |+-|.|- ....+-..+..|-....-|+-..
T Consensus 464 rfLlScSED~svRLWsl~t~s~~V~y--~G----H~~PVwdV~F~P~-GyYFata--s~D~tArLWs~d~~~PlRifagh 534 (707)
T KOG0263|consen 464 RFLLSCSEDSSVRLWSLDTWSCLVIY--KG----HLAPVWDVQFAPR-GYYFATA--SHDQTARLWSTDHNKPLRIFAGH 534 (707)
T ss_pred cceeeccCCcceeeeecccceeEEEe--cC----CCcceeeEEecCC-ceEEEec--CCCceeeeeecccCCchhhhccc
Confidence 35666667778888888765444333 33 5667889999976 6655554 33333344555544333343344
Q ss_pred CcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEc-CCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEe
Q psy5768 82 KYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARL-TGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAG 160 (652)
Q Consensus 82 ~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~-dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ 160 (652)
+..+. .+++-+ +.+.-.+-+....+.+-+- .|..+++ +.+-.....++++-|...|| +..+..+.|..-+
T Consensus 535 -lsDV~----cv~FHP-Ns~Y~aTGSsD~tVRlWDv~~G~~VRi-F~GH~~~V~al~~Sp~Gr~L--aSg~ed~~I~iWD 605 (707)
T KOG0263|consen 535 -LSDVD----CVSFHP-NSNYVATGSSDRTVRLWDVSTGNSVRI-FTGHKGPVTALAFSPCGRYL--ASGDEDGLIKIWD 605 (707)
T ss_pred -ccccc----eEEECC-cccccccCCCCceEEEEEcCCCcEEEE-ecCCCCceEEEEEcCCCceE--eecccCCcEEEEE
Confidence 66677 788876 4443344444444544443 3554444 45444567789999855444 4444456676666
Q ss_pred CCCCCcE-EEEeecccCceeEEEeccC
Q psy5768 161 LDGKKQT-ILAQEIIMPIKDITLDLKF 186 (652)
Q Consensus 161 ldg~~~~-~~~~~~~~~p~gl~lD~~~ 186 (652)
+.+..+. .+... -.....|++...+
T Consensus 606 l~~~~~v~~l~~H-t~ti~SlsFS~dg 631 (707)
T KOG0263|consen 606 LANGSLVKQLKGH-TGTIYSLSFSRDG 631 (707)
T ss_pred cCCCcchhhhhcc-cCceeEEEEecCC
Confidence 6654433 22222 2334555554433
No 221
>KOG0269|consensus
Probab=37.40 E-value=6.1e+02 Score=29.52 Aligned_cols=124 Identities=10% Similarity=0.133 Sum_probs=81.6
Q ss_pred cccceEEEEEEEcCCCeEEEeecccccEEEEeccCCcc-eEEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCC
Q psy5768 311 TMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNH-RVLLERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPK 389 (652)
Q Consensus 311 ~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~-~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~ 389 (652)
++.+.+.-++|+..+-.|.++-...+.|.-.++..... .+....-+++.++++-+-.++.|.+-...|.+..-++...+
T Consensus 131 EH~Rs~~~ldfh~tep~iliSGSQDg~vK~~DlR~~~S~~t~~~nSESiRDV~fsp~~~~~F~s~~dsG~lqlWDlRqp~ 210 (839)
T KOG0269|consen 131 EHERSANKLDFHSTEPNILISGSQDGTVKCWDLRSKKSKSTFRSNSESIRDVKFSPGYGNKFASIHDSGYLQLWDLRQPD 210 (839)
T ss_pred hhccceeeeeeccCCccEEEecCCCceEEEEeeecccccccccccchhhhceeeccCCCceEEEecCCceEEEeeccCch
Confidence 45577889999999999999988888888877654332 33335666889999999999999998888888877875543
Q ss_pred CccEEEEEeCCCCCc-eEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceE
Q psy5768 390 AQRIVVVRLGQHDKP-RGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTES 440 (652)
Q Consensus 390 ~~~~~~~~~~~~~~P-~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~ 440 (652)
. -...+..-..| .-+-.+|.+ -|.+..|+.. .|....|++.....
T Consensus 211 r---~~~k~~AH~GpV~c~nwhPnr--~~lATGGRDK-~vkiWd~t~~~~~~ 256 (839)
T KOG0269|consen 211 R---CEKKLTAHNGPVLCLNWHPNR--EWLATGGRDK-MVKIWDMTDSRAKP 256 (839)
T ss_pred h---HHHHhhcccCceEEEeecCCC--ceeeecCCCc-cEEEEeccCCCccc
Confidence 1 11111111122 235566733 4455555543 67667777654433
No 222
>KOG0318|consensus
Probab=37.39 E-value=6.2e+02 Score=28.12 Aligned_cols=186 Identities=17% Similarity=0.130 Sum_probs=92.1
Q ss_pred ceEEEEEEEcCCCeEEEeecccccEEEEe-ccCCcceEEe---eccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCC
Q psy5768 314 KNIIELSYDYKRKTLFYSDIQKGTINSVF-FNGSNHRVLL---ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPK 389 (652)
Q Consensus 314 ~~~~~v~~D~~~~~lywsd~~~~~I~~~~-~~g~~~~~i~---~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~ 389 (652)
+.+..+-|.+. +..|.+-...++|+-.+ ..|...-++- ..-+++.+|+--+.+..+- |-+...++..=++....
T Consensus 191 kFV~~VRysPD-G~~Fat~gsDgki~iyDGktge~vg~l~~~~aHkGsIfalsWsPDs~~~~-T~SaDkt~KIWdVs~~s 268 (603)
T KOG0318|consen 191 KFVNCVRYSPD-GSRFATAGSDGKIYIYDGKTGEKVGELEDSDAHKGSIFALSWSPDSTQFL-TVSADKTIKIWDVSTNS 268 (603)
T ss_pred cceeeEEECCC-CCeEEEecCCccEEEEcCCCccEEEEecCCCCccccEEEEEECCCCceEE-EecCCceEEEEEeeccc
Confidence 44567778776 66777777777777665 2222222222 1223566666665566553 44433433333332211
Q ss_pred CccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCC---------CceEEEEcCCCCCceEEEecCCCE
Q psy5768 390 AQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGF---------GTESIITTDITMPNALALDHQAEK 460 (652)
Q Consensus 390 ~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~---------~~~~l~~~~l~~P~glaiD~~~~~ 460 (652)
- .++....+. +-|..-| ..|+. ..|....++|. ....++.-..+..++|++..++..
T Consensus 269 l-v~t~~~~~~-------v~dqqvG-~lWqk-----d~lItVSl~G~in~ln~~d~~~~~~i~GHnK~ITaLtv~~d~~~ 334 (603)
T KOG0318|consen 269 L-VSTWPMGST-------VEDQQVG-CLWQK-----DHLITVSLSGTINYLNPSDPSVLKVISGHNKSITALTVSPDGKT 334 (603)
T ss_pred e-EEEeecCCc-------hhceEEE-EEEeC-----CeEEEEEcCcEEEEecccCCChhheecccccceeEEEEcCCCCE
Confidence 0 111111111 1111111 12321 13333333332 122222223456679999988888
Q ss_pred EEEEeCCCCeEEEEecC-CCceEEEecCCCCceeEEEEeC-CEEEEEcCCCCeEEEEEc
Q psy5768 461 LFWGDARLDKIERCDYD-GTNRIVLSKISPLHPFDMAVYG-EFIFWTDWVIHAVLRANK 517 (652)
Q Consensus 461 LYw~D~~~~~I~~~~ld-G~~~~~l~~~~~~~p~glav~~-~~lYwtd~~~~~I~~~~k 517 (652)
||-++. .+.|..-+.. |..-+...........+|+..+ +.||-.-|.. .+.+++.
T Consensus 335 i~Sgsy-DG~I~~W~~~~g~~~~~~g~~h~nqI~~~~~~~~~~~~t~g~Dd-~l~~~~~ 391 (603)
T KOG0318|consen 335 IYSGSY-DGHINSWDSGSGTSDRLAGKGHTNQIKGMAASESGELFTIGWDD-TLRVISL 391 (603)
T ss_pred EEeecc-CceEEEEecCCccccccccccccceEEEEeecCCCcEEEEecCC-eEEEEec
Confidence 887764 4556655553 3333332233346678899887 7887777664 4555543
No 223
>PRK13616 lipoprotein LpqB; Provisional
Probab=37.35 E-value=6.8e+02 Score=28.63 Aligned_cols=144 Identities=9% Similarity=0.022 Sum_probs=72.1
Q ss_pred CCCeeEEEEECCCCEEEEEec---------cCCcceEEEEEcCCCccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeC
Q psy5768 37 LSKISSIAVWPVKGKMFWSNV---------TKQVVTIEMAFMDGTKRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDP 107 (652)
Q Consensus 37 ~~~~~~v~~d~~~~~lyw~d~---------~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~ 107 (652)
+..|. |++..+.|++..- ......|+...+++..... . - -..+. .+++-..+.+|.++-.
T Consensus 399 ~t~Ps---WspDG~~lw~v~dg~~~~~v~~~~~~gql~~~~vd~ge~~~--~-~-~g~Is----sl~wSpDG~RiA~i~~ 467 (591)
T PRK13616 399 LTRPS---WSLDADAVWVVVDGNTVVRVIRDPATGQLARTPVDASAVAS--R-V-PGPIS----ELQLSRDGVRAAMIIG 467 (591)
T ss_pred CCCce---ECCCCCceEEEecCcceEEEeccCCCceEEEEeccCchhhh--c-c-CCCcC----eEEECCCCCEEEEEEC
Confidence 45555 8887666666530 0122344444444432221 0 0 12355 7777666777777653
Q ss_pred CCCEEEE---EE-cCCCcEEE----EEeCCCCC-ceeEEEcCCCCeEEEEecCCCCeEEEEeCCCCCcEEEEeecccCce
Q psy5768 108 KENVIEV---AR-LTGQYRYV----LISGGVDQ-PSALAVDPESGYLFWSESGKIPLIARAGLDGKKQTILAQEIIMPIK 178 (652)
Q Consensus 108 ~~~~I~v---~~-~dg~~~~~----l~~~~~~~-P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~~~~~~~~~~~p~ 178 (652)
++|.+ .. .+|. .+. -+...+.. +..++=- ..+.|+....+....+++..+||.....+...++..|
T Consensus 468 --g~v~Va~Vvr~~~G~-~~l~~~~~l~~~l~~~~~~l~W~-~~~~L~V~~~~~~~~v~~v~vDG~~~~~~~~~n~~~~- 542 (591)
T PRK13616 468 --GKVYLAVVEQTEDGQ-YALTNPREVGPGLGDTAVSLDWR-TGDSLVVGRSDPEHPVWYVNLDGSNSDALPSRNLSAP- 542 (591)
T ss_pred --CEEEEEEEEeCCCCc-eeecccEEeecccCCccccceEe-cCCEEEEEecCCCCceEEEecCCccccccCCCCccCc-
Confidence 46665 33 3443 222 01112222 3444321 1344555433334568889999988765333333332
Q ss_pred eEEEeccCCEEEEEeCCC
Q psy5768 179 DITLDLKFFSAFYRNLSK 196 (652)
Q Consensus 179 gl~lD~~~~~ly~~d~~g 196 (652)
..+|....+.||..|-+|
T Consensus 543 v~~vaa~~~~iyv~~~~g 560 (591)
T PRK13616 543 VVAVAASPSTVYVTDARA 560 (591)
T ss_pred eEEEecCCceEEEEcCCc
Confidence 344444456899887764
No 224
>KOG0273|consensus
Probab=35.81 E-value=6.2e+02 Score=27.67 Aligned_cols=113 Identities=14% Similarity=0.164 Sum_probs=65.5
Q ss_pred EEEEEEEcCCCeEEEeecccccEEEEe-ccCCcceEEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEE
Q psy5768 316 IIELSYDYKRKTLFYSDIQKGTINSVF-FNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIV 394 (652)
Q Consensus 316 ~~~v~~D~~~~~lywsd~~~~~I~~~~-~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~ 394 (652)
+.++-.......|.=.+.+ +++...+ ..|+..+.. .+....+|.|||++..=|.+....+.|.|+.++.... ..+
T Consensus 279 I~slKWnk~G~yilS~~vD-~ttilwd~~~g~~~q~f--~~~s~~~lDVdW~~~~~F~ts~td~~i~V~kv~~~~P-~~t 354 (524)
T KOG0273|consen 279 IFSLKWNKKGTYILSGGVD-GTTILWDAHTGTVKQQF--EFHSAPALDVDWQSNDEFATSSTDGCIHVCKVGEDRP-VKT 354 (524)
T ss_pred eEEEEEcCCCCEEEeccCC-ccEEEEeccCceEEEee--eeccCCccceEEecCceEeecCCCceEEEEEecCCCc-cee
Confidence 4455554444444444443 3333333 333322222 2334458999999999999999999999999877532 233
Q ss_pred EEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecC
Q psy5768 395 VVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSG 435 (652)
Q Consensus 395 ~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG 435 (652)
++ .......+|-.+| +|.|.-|-......+|+...-++
T Consensus 355 ~~--GH~g~V~alk~n~-tg~LLaS~SdD~TlkiWs~~~~~ 392 (524)
T KOG0273|consen 355 FI--GHHGEVNALKWNP-TGSLLASCSDDGTLKIWSMGQSN 392 (524)
T ss_pred ee--cccCceEEEEECC-CCceEEEecCCCeeEeeecCCCc
Confidence 22 3456678888999 45555554433334666544333
No 225
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=35.43 E-value=2e+02 Score=24.49 Aligned_cols=12 Identities=33% Similarity=0.642 Sum_probs=10.1
Q ss_pred ceeeeccCceee
Q psy5768 569 QVVCSCFTGKVL 580 (652)
Q Consensus 569 ~~~C~Cp~g~~l 580 (652)
...|.|++||+.
T Consensus 97 ~~~C~Cl~GF~P 108 (110)
T PF00954_consen 97 SPKCSCLPGFEP 108 (110)
T ss_pred CCceECCCCcCC
Confidence 567999999975
No 226
>KOG0646|consensus
Probab=35.36 E-value=6.2e+02 Score=27.55 Aligned_cols=27 Identities=22% Similarity=0.193 Sum_probs=19.9
Q ss_pred cCceeEEEeccCCEEEEEeCCCCcEEEE
Q psy5768 175 MPIKDITLDLKFFSAFYRNLSKGNIHII 202 (652)
Q Consensus 175 ~~p~gl~lD~~~~~ly~~d~~g~~~~~i 202 (652)
..++.+++|+...++|.=..+ |++-++
T Consensus 218 ~si~av~lDpae~~~yiGt~~-G~I~~~ 244 (476)
T KOG0646|consen 218 SSIKAVALDPAERVVYIGTEE-GKIFQN 244 (476)
T ss_pred CcceeEEEcccccEEEecCCc-ceEEee
Confidence 357999999999999886665 444443
No 227
>KOG1036|consensus
Probab=35.21 E-value=5.2e+02 Score=26.61 Aligned_cols=157 Identities=16% Similarity=0.116 Sum_probs=81.1
Q ss_pred CCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccC
Q psy5768 9 TQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTA 88 (652)
Q Consensus 9 ~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~ 88 (652)
-.+.|+++|+.+.....+..+. ..+..|-+.+..+.|. +- .-.++|..++.-+. ..+.
T Consensus 73 ~dg~vr~~Dln~~~~~~igth~-------~~i~ci~~~~~~~~vI-sg--sWD~~ik~wD~R~~--~~~~---------- 130 (323)
T KOG1036|consen 73 LDGQVRRYDLNTGNEDQIGTHD-------EGIRCIEYSYEVGCVI-SG--SWDKTIKFWDPRNK--VVVG---------- 130 (323)
T ss_pred cCceEEEEEecCCcceeeccCC-------CceEEEEeeccCCeEE-Ec--ccCccEEEEecccc--cccc----------
Confidence 4578888888888777777643 4667777777767665 33 34567766665441 1111
Q ss_pred CCCcEEEEccCCcEEEEeCCCCEEEE---------EEcCCCcE-EEEEeCCCC-CceeEEEcCCCCeEEEEecCCCCeEE
Q psy5768 89 CNLHIAVDWIAQNIYWSDPKENVIEV---------ARLTGQYR-YVLISGGVD-QPSALAVDPESGYLFWSESGKIPLIA 157 (652)
Q Consensus 89 ~~~~lavDw~~~~lY~~d~~~~~I~v---------~~~dg~~~-~~l~~~~~~-~P~~iavd~~~g~lywtd~~~~~~I~ 157 (652)
++| ..+++|..|...+++.| .|+.--.. ...-...++ +-|.+++-| ++-=|+... ..+++.
T Consensus 131 -----~~d-~~kkVy~~~v~g~~LvVg~~~r~v~iyDLRn~~~~~q~reS~lkyqtR~v~~~p-n~eGy~~sS-ieGRVa 202 (323)
T KOG1036|consen 131 -----TFD-QGKKVYCMDVSGNRLVVGTSDRKVLIYDLRNLDEPFQRRESSLKYQTRCVALVP-NGEGYVVSS-IEGRVA 202 (323)
T ss_pred -----ccc-cCceEEEEeccCCEEEEeecCceEEEEEcccccchhhhccccceeEEEEEEEec-CCCceEEEe-ecceEE
Confidence 112 13345555544444443 33321000 000011222 457888888 666666544 256776
Q ss_pred EEeCCCC----CcE-EE-----Eee--c-ccCceeEEEeccCCEEEEEeCC
Q psy5768 158 RAGLDGK----KQT-IL-----AQE--I-IMPIKDITLDLKFFSAFYRNLS 195 (652)
Q Consensus 158 ~~~ldg~----~~~-~~-----~~~--~-~~~p~gl~lD~~~~~ly~~d~~ 195 (652)
.-.+|-+ .+. .+ ... . +...|.|++.+..+.++=-+.|
T Consensus 203 vE~~d~s~~~~skkyaFkCHr~~~~~~~~~yPVNai~Fhp~~~tfaTgGsD 253 (323)
T KOG1036|consen 203 VEYFDDSEEAQSKKYAFKCHRLSEKDTEIIYPVNAIAFHPIHGTFATGGSD 253 (323)
T ss_pred EEccCCchHHhhhceeEEeeecccCCceEEEEeceeEeccccceEEecCCC
Confidence 6666655 111 11 011 1 3344777877766666543333
No 228
>KOG2096|consensus
Probab=34.78 E-value=5.4e+02 Score=26.70 Aligned_cols=164 Identities=21% Similarity=0.150 Sum_probs=82.3
Q ss_pred CCCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCC-cceEE--EEEcCCCccEEE----EeC
Q psy5768 8 PTQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQ-VVTIE--MAFMDGTKRETV----VSQ 80 (652)
Q Consensus 8 ~~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~-~~~I~--~~~~dgs~~~~v----~~~ 80 (652)
+.+..|..+++.|..+..+.++ ..+-..-++.|.+.+|-.+- .. .-.++ .+..||+..++. ++.
T Consensus 206 s~dt~i~lw~lkGq~L~~idtn-------q~~n~~aavSP~GRFia~~g--FTpDVkVwE~~f~kdG~fqev~rvf~LkG 276 (420)
T KOG2096|consen 206 SLDTKICLWDLKGQLLQSIDTN-------QSSNYDAAVSPDGRFIAVSG--FTPDVKVWEPIFTKDGTFQEVKRVFSLKG 276 (420)
T ss_pred cCCCcEEEEecCCceeeeeccc-------cccccceeeCCCCcEEEEec--CCCCceEEEEEeccCcchhhhhhhheecc
Confidence 3567788999999988888764 23345567888877766554 22 22332 255778764442 122
Q ss_pred CCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCc-----EEEEEeC------CCCCceeEEEcCCCCeEEEEe
Q psy5768 81 KKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQY-----RYVLISG------GVDQPSALAVDPESGYLFWSE 149 (652)
Q Consensus 81 ~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~-----~~~l~~~------~~~~P~~iavd~~~g~lywtd 149 (652)
. ...+. .+|+...+.++. +-+..+.+.+.+.|=.+ .++|-.+ .-..|-.|++.| +|.++-..
T Consensus 277 H-~saV~----~~aFsn~S~r~v-tvSkDG~wriwdtdVrY~~~qDpk~Lk~g~~pl~aag~~p~RL~lsP-~g~~lA~s 349 (420)
T KOG2096|consen 277 H-QSAVL----AAAFSNSSTRAV-TVSKDGKWRIWDTDVRYEAGQDPKILKEGSAPLHAAGSEPVRLELSP-SGDSLAVS 349 (420)
T ss_pred c-hhhee----eeeeCCCcceeE-EEecCCcEEEeeccceEecCCCchHhhcCCcchhhcCCCceEEEeCC-CCcEEEee
Confidence 2 22234 566654444442 22233344433333111 1111111 235688899998 67777666
Q ss_pred cCCCCeEEEEe-CCCCCcEEEEeecccCceeEEEeccCCEE
Q psy5768 150 SGKIPLIARAG-LDGKKQTILAQEIIMPIKDITLDLKFFSA 189 (652)
Q Consensus 150 ~~~~~~I~~~~-ldg~~~~~~~~~~~~~p~gl~lD~~~~~l 189 (652)
.|+. |.... -+|.....+-.---.-+..|+.++.+..|
T Consensus 350 ~gs~--l~~~~se~g~~~~~~e~~h~~~Is~is~~~~g~~~ 388 (420)
T KOG2096|consen 350 FGSD--LKVFASEDGKDYPELEDIHSTTISSISYSSDGKYI 388 (420)
T ss_pred cCCc--eEEEEcccCccchhHHHhhcCceeeEEecCCCcEE
Confidence 6543 33322 23443322110001234556666555444
No 229
>KOG0276|consensus
Probab=33.73 E-value=7.6e+02 Score=28.11 Aligned_cols=99 Identities=14% Similarity=0.122 Sum_probs=59.4
Q ss_pred CCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEc
Q psy5768 38 SKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARL 117 (652)
Q Consensus 38 ~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~ 117 (652)
.++.+|++||.+-.+.-+= -++.+..++-+-. +.+..- + +.+.|+--|.=-.-.++..+-+...+|.++++
T Consensus 14 dRVKsVd~HPtePw~la~L---ynG~V~IWnyetq---tmVksf--e-V~~~PvRa~kfiaRknWiv~GsDD~~IrVfny 84 (794)
T KOG0276|consen 14 DRVKSVDFHPTEPWILAAL---YNGDVQIWNYETQ---TMVKSF--E-VSEVPVRAAKFIARKNWIVTGSDDMQIRVFNY 84 (794)
T ss_pred CceeeeecCCCCceEEEee---ecCeeEEEecccc---eeeeee--e-ecccchhhheeeeccceEEEecCCceEEEEec
Confidence 5778899999888777554 3456665655421 221110 0 11112122222224577788888899999999
Q ss_pred CCCcEEEEEeCCCCCceeEEEcCCCCeE
Q psy5768 118 TGQYRYVLISGGVDQPSALAVDPESGYL 145 (652)
Q Consensus 118 dg~~~~~l~~~~~~~P~~iavd~~~g~l 145 (652)
+.-.+...+..--..-|.|||+|+.=|+
T Consensus 85 nt~ekV~~FeAH~DyIR~iavHPt~P~v 112 (794)
T KOG0276|consen 85 NTGEKVKTFEAHSDYIRSIAVHPTLPYV 112 (794)
T ss_pred ccceeeEEeeccccceeeeeecCCCCeE
Confidence 8654444444334677899999976555
No 230
>KOG0649|consensus
Probab=33.53 E-value=5e+02 Score=25.92 Aligned_cols=125 Identities=13% Similarity=0.121 Sum_probs=70.9
Q ss_pred CCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEE
Q psy5768 34 TSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIE 113 (652)
Q Consensus 34 ~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~ 113 (652)
...+..+-++-+||.++.|+++- +...||.+++.....+-.+... ..+.- .++.-..+..| .+-+..+.+.
T Consensus 111 ~~evPeINam~ldP~enSi~~Ag---GD~~~y~~dlE~G~i~r~~rGH-tDYvH----~vv~R~~~~qi-lsG~EDGtvR 181 (325)
T KOG0649|consen 111 AVEVPEINAMWLDPSENSILFAG---GDGVIYQVDLEDGRIQREYRGH-TDYVH----SVVGRNANGQI-LSGAEDGTVR 181 (325)
T ss_pred cccCCccceeEeccCCCcEEEec---CCeEEEEEEecCCEEEEEEcCC-cceee----eeeecccCcce-eecCCCccEE
Confidence 44667889999999999999886 7889999998755555555555 55555 55553334444 3334445555
Q ss_pred EEEcCCCcEEEEEeC----CCCCce----eEEEcCCCCeEEEEecCCCCeEEEEeCCCCCcEEEE
Q psy5768 114 VARLTGQYRYVLISG----GVDQPS----ALAVDPESGYLFWSESGKIPLIARAGLDGKKQTILA 170 (652)
Q Consensus 114 v~~~dg~~~~~l~~~----~~~~P~----~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~~~~~ 170 (652)
+-+.......-++.. .+.+|. -.|++.....|.- |..|.+....|..+..+.++
T Consensus 182 vWd~kt~k~v~~ie~yk~~~~lRp~~g~wigala~~edWlvC---GgGp~lslwhLrsse~t~vf 243 (325)
T KOG0649|consen 182 VWDTKTQKHVSMIEPYKNPNLLRPDWGKWIGALAVNEDWLVC---GGGPKLSLWHLRSSESTCVF 243 (325)
T ss_pred EEeccccceeEEeccccChhhcCcccCceeEEEeccCceEEe---cCCCceeEEeccCCCceEEE
Confidence 555543333233322 122222 1233333334432 34556666666666665544
No 231
>KOG0268|consensus
Probab=33.52 E-value=6e+02 Score=26.85 Aligned_cols=154 Identities=12% Similarity=0.137 Sum_probs=88.1
Q ss_pred ccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEee
Q psy5768 354 RQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFF 433 (652)
Q Consensus 354 ~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~l 433 (652)
+...+..+.+.++--.|.-+-...+.|...++.....-.+.++ ..++.+|+-.| .++.|.+..... .++-.+|
T Consensus 186 G~Dti~svkfNpvETsILas~~sDrsIvLyD~R~~~Pl~KVi~----~mRTN~IswnP-eafnF~~a~ED~--nlY~~Dm 258 (433)
T KOG0268|consen 186 GADSISSVKFNPVETSILASCASDRSIVLYDLRQASPLKKVIL----TMRTNTICWNP-EAFNFVAANEDH--NLYTYDM 258 (433)
T ss_pred CCCceeEEecCCCcchheeeeccCCceEEEecccCCccceeee----eccccceecCc-cccceeeccccc--cceehhh
Confidence 4445666777777777776666777888777754322123333 36789999999 999998865443 5555554
Q ss_pred cCCCceEEEEcCCCCCc-eEEEecC-CCEEEEEeCCCCeEEEEecCCCc-eEEEecCCCCceeEEEEeCC--EEEE-EcC
Q psy5768 434 SGFGTESIITTDITMPN-ALALDHQ-AEKLFWGDARLDKIERCDYDGTN-RIVLSKISPLHPFDMAVYGE--FIFW-TDW 507 (652)
Q Consensus 434 dG~~~~~l~~~~l~~P~-glaiD~~-~~~LYw~D~~~~~I~~~~ldG~~-~~~l~~~~~~~p~glav~~~--~lYw-td~ 507 (652)
---.+-.-+-. ...+ =|.+|+. +++=|++-+...+|.-+..+... |.+.....++|.|++..--| ||+- +|-
T Consensus 259 R~l~~p~~v~~--dhvsAV~dVdfsptG~EfvsgsyDksIRIf~~~~~~SRdiYhtkRMq~V~~Vk~S~Dskyi~SGSdd 336 (433)
T KOG0268|consen 259 RNLSRPLNVHK--DHVSAVMDVDFSPTGQEFVSGSYDKSIRIFPVNHGHSRDIYHTKRMQHVFCVKYSMDSKYIISGSDD 336 (433)
T ss_pred hhhcccchhhc--ccceeEEEeccCCCcchhccccccceEEEeecCCCcchhhhhHhhhheeeEEEEeccccEEEecCCC
Confidence 33222111111 1111 2445533 45556666666677777777544 44444434789999987644 3322 233
Q ss_pred CCCeEEEEE
Q psy5768 508 VIHAVLRAN 516 (652)
Q Consensus 508 ~~~~I~~~~ 516 (652)
.+=++|+++
T Consensus 337 ~nvRlWka~ 345 (433)
T KOG0268|consen 337 GNVRLWKAK 345 (433)
T ss_pred cceeeeecc
Confidence 333455554
No 232
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=33.00 E-value=57 Score=38.58 Aligned_cols=75 Identities=20% Similarity=0.402 Sum_probs=48.1
Q ss_pred ceeeeccCceeeccC-CcccCcccccCCCceeec----cCeec----------CCccCCCCCCCCCCCCCCCCCCCCCCC
Q psy5768 569 QVVCSCFTGKVLMED-NRSCTINTVCSEHDFKCS----DGMCI----------PFNQTCDRVYNCHDKSDEGILYCAMRD 633 (652)
Q Consensus 569 ~~~C~Cp~g~~l~~d-~~C~~~~~~C~~~~f~C~----~g~Ci----------~~~~~Cd~~~dC~d~sde~~~~C~~~~ 633 (652)
..+|.|.+||....| .+|... ..|++..-.|+ .|+|+ +....| ...|++++...+..| .
T Consensus 681 ~~~C~C~~g~~p~~~~~~C~~~-~~C~~~~~gC~~C~~~g~C~~C~~~~~~vq~~~~~C--~~~C~~~~~~~~~vC---~ 754 (800)
T PTZ00214 681 VRRCWCERGFLPALDRSGCVLP-TECPPDMPSCAACDESGRCLLCVTSGHNVQVDQRTC--AEGCGARASSNQGVC---M 754 (800)
T ss_pred cceeEecCCcccccCCCccccc-cCCCcccccccccCCCCceeeccccCcccccCCCcc--ccCCCCCccccCCeE---E
Confidence 467999999998888 778763 34654322232 23443 233344 456888876666667 6
Q ss_pred CCCCeeecCCCCcccCC
Q psy5768 634 CRPGYFKCDNNKCILSS 650 (652)
Q Consensus 634 C~~~~f~C~~~~Ci~~~ 650 (652)
|..+.+. ..+.|++..
T Consensus 755 C~~g~~l-~~~~c~~~~ 770 (800)
T PTZ00214 755 CELDAVL-TKGVCVPAK 770 (800)
T ss_pred eCCccee-cCCeeEecc
Confidence 7888777 467888753
No 233
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=32.34 E-value=80 Score=28.29 Aligned_cols=55 Identities=25% Similarity=0.591 Sum_probs=31.0
Q ss_pred EEEEcCCC-eEEEeecccc--cEEEEeccCCcceEE-eeccCceeeeEEEccCCEEEEEeC
Q psy5768 319 LSYDYKRK-TLFYSDIQKG--TINSVFFNGSNHRVL-LERQGSVEGLAYEYVHNYLYWTCN 375 (652)
Q Consensus 319 v~~D~~~~-~lywsd~~~~--~I~~~~~~g~~~~~i-~~~~~~~~glAvDw~~~~LYwtd~ 375 (652)
++||..++ .+||-+...+ .|.--.+.+...+.+ +.+.-++.| +|+.+..+|||-.
T Consensus 76 laYDV~~N~d~Fyke~~DGvn~i~~g~~~~~~~~l~ivGGncsi~G--fd~~G~e~fWtVt 134 (136)
T PF14781_consen 76 LAYDVENNSDLFYKEVPDGVNAIVIGKLGDIPSPLVIVGGNCSIQG--FDYEGNEIFWTVT 134 (136)
T ss_pred EEEEcccCchhhhhhCccceeEEEEEecCCCCCcEEEECceEEEEE--eCCCCcEEEEEec
Confidence 56666655 3777776543 232223433222333 344445555 7888999999854
No 234
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=32.22 E-value=9.2e+02 Score=28.60 Aligned_cols=99 Identities=17% Similarity=0.079 Sum_probs=54.5
Q ss_pred eEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCC-CCeEEEEecCCCceEE
Q psy5768 405 RGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDAR-LDKIERCDYDGTNRIV 483 (652)
Q Consensus 405 ~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~-~~~I~~~~ldG~~~~~ 483 (652)
..+++||+.|.+||--.+ ..+-. .|..|+.. .++..-.=+|||..++++-|.-.. .+-++ |+|....-+
T Consensus 378 ~~~s~D~~~glvy~ptGn-~~pd~-----~g~~r~~~--~n~y~~slvALD~~TGk~~W~~Q~~~hD~W--D~D~~~~p~ 447 (764)
T TIGR03074 378 SVASYDEKLGLVYLPMGN-QTPDQ-----WGGDRTPA--DEKYSSSLVALDATTGKERWVFQTVHHDLW--DMDVPAQPS 447 (764)
T ss_pred CceEEcCCCCeEEEeCCC-ccccc-----cCCccccC--cccccceEEEEeCCCCceEEEecccCCccc--cccccCCce
Confidence 458999999999995432 22222 24344221 122233458999999999997543 23343 556554444
Q ss_pred EecCC----CCceeEEEEe-CCEEEEEcCCCCeEE
Q psy5768 484 LSKIS----PLHPFDMAVY-GEFIFWTDWVIHAVL 513 (652)
Q Consensus 484 l~~~~----~~~p~glav~-~~~lYwtd~~~~~I~ 513 (652)
++... ...|.-+... .+++|..|..++...
T Consensus 448 L~d~~~~~G~~~~~v~~~~K~G~~~vlDr~tG~~l 482 (764)
T TIGR03074 448 LVDLPDADGTTVPALVAPTKQGQIYVLDRRTGEPI 482 (764)
T ss_pred EEeeecCCCcEeeEEEEECCCCEEEEEECCCCCEE
Confidence 44311 1122222222 457888887776543
No 235
>KOG3509|consensus
Probab=31.36 E-value=42 Score=39.98 Aligned_cols=59 Identities=24% Similarity=0.553 Sum_probs=45.3
Q ss_pred ccccCCCceeeccCeecCCccCCCCCCCCCCCCCCCCCCCC--CCCCCCCeeecCCC-Cccc
Q psy5768 590 NTVCSEHDFKCSDGMCIPFNQTCDRVYNCHDKSDEGILYCA--MRDCRPGYFKCDNN-KCIL 648 (652)
Q Consensus 590 ~~~C~~~~f~C~~g~Ci~~~~~Cd~~~dC~d~sde~~~~C~--~~~C~~~~f~C~~~-~Ci~ 648 (652)
.+.|.+++|+|.+++|.-..|.||.+.+|..++++....|+ ..+|.+.++.|.+- ||-+
T Consensus 29 ~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~c~~~~~~~~ 90 (964)
T KOG3509|consen 29 GSACSPNEFKCNNPRCVQPEALLDADSTCGPNSTPSGCNAKPSASDCKPTETQCRDRLRCNP 90 (964)
T ss_pred cccCCcchhccCCccccCchhhhccccccCCCCCcCCccccccccccCCcccccccchhcCC
Confidence 35678899999999999999999999999999977663342 24677777777653 4433
No 236
>KOG0649|consensus
Probab=30.66 E-value=4.7e+02 Score=26.10 Aligned_cols=82 Identities=11% Similarity=-0.028 Sum_probs=49.1
Q ss_pred CCCCCceEEEecCCCEEEEEeCCCCeEEEEecCCCceEEEecCCCCceeEEEEeC-CEEEEEcCCCCeEEEEEccCCceE
Q psy5768 445 DITMPNALALDHQAEKLFWGDARLDKIERCDYDGTNRIVLSKISPLHPFDMAVYG-EFIFWTDWVIHAVLRANKYTGEEV 523 (652)
Q Consensus 445 ~l~~P~glaiD~~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~~~~~p~glav~~-~~lYwtd~~~~~I~~~~k~~g~~~ 523 (652)
.+...|+|.+|+.++-|+.+- +...|+.+|+.....+..+.+.......++.-+ .-=..+-...+++..-+..+++.+
T Consensus 113 evPeINam~ldP~enSi~~Ag-GD~~~y~~dlE~G~i~r~~rGHtDYvH~vv~R~~~~qilsG~EDGtvRvWd~kt~k~v 191 (325)
T KOG0649|consen 113 EVPEINAMWLDPSENSILFAG-GDGVIYQVDLEDGRIQREYRGHTDYVHSVVGRNANGQILSGAEDGTVRVWDTKTQKHV 191 (325)
T ss_pred cCCccceeEeccCCCcEEEec-CCeEEEEEEecCCEEEEEEcCCcceeeeeeecccCcceeecCCCccEEEEecccccee
Confidence 345678999999999999986 677899999976555555554323333343311 111223334455544455566655
Q ss_pred EEEe
Q psy5768 524 YTLR 527 (652)
Q Consensus 524 ~~~~ 527 (652)
+++.
T Consensus 192 ~~ie 195 (325)
T KOG0649|consen 192 SMIE 195 (325)
T ss_pred EEec
Confidence 5554
No 237
>KOG1272|consensus
Probab=30.22 E-value=74 Score=34.32 Aligned_cols=130 Identities=12% Similarity=0.119 Sum_probs=63.1
Q ss_pred eEEEecCCCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEEC---------CCCEEEEEeccCCcceEEEEEcCCC
Q psy5768 2 FIAVSSPTQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWP---------VKGKMFWSNVTKQVVTIEMAFMDGT 72 (652)
Q Consensus 2 ~i~v~~~~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~---------~~~~lyw~d~~~~~~~I~~~~~dgs 72 (652)
||+|+. ..-++++|.+|.....+-... .+..++|-| ..+++=|-| .+.+.|.....
T Consensus 184 ~~AVAQ--K~y~yvYD~~GtElHClk~~~--------~v~rLeFLPyHfLL~~~~~~G~L~Y~D--VS~GklVa~~~--- 248 (545)
T KOG1272|consen 184 FFAVAQ--KKYVYVYDNNGTELHCLKRHI--------RVARLEFLPYHFLLVAASEAGFLKYQD--VSTGKLVASIR--- 248 (545)
T ss_pred HHHhhh--hceEEEecCCCcEEeehhhcC--------chhhhcccchhheeeecccCCceEEEe--echhhhhHHHH---
Confidence 566773 567888999998887776432 333344443 334444444 32333322222
Q ss_pred ccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcE--EEEEeCCCCCceeEEEcCCCCeEEEEec
Q psy5768 73 KRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYR--YVLISGGVDQPSALAVDPESGYLFWSES 150 (652)
Q Consensus 73 ~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~--~~l~~~~~~~P~~iavd~~~g~lywtd~ 150 (652)
++ .+... -|..++-+..| -+-...+.+..-++..+-- +.+. -....++||||+...|| .|.
T Consensus 249 -------t~-~G~~~----vm~qNP~NaVi-h~GhsnGtVSlWSP~skePLvKiLc--H~g~V~siAv~~~G~YM-aTt- 311 (545)
T KOG1272|consen 249 -------TG-AGRTD----VMKQNPYNAVI-HLGHSNGTVSLWSPNSKEPLVKILC--HRGPVSSIAVDRGGRYM-ATT- 311 (545)
T ss_pred -------cc-CCccc----hhhcCCccceE-EEcCCCceEEecCCCCcchHHHHHh--cCCCcceEEECCCCcEE-eec-
Confidence 22 23333 33333323222 2333455565555543321 1111 12467899999966666 342
Q ss_pred CCCCeEEEEeCCC
Q psy5768 151 GKIPLIARAGLDG 163 (652)
Q Consensus 151 ~~~~~I~~~~ldg 163 (652)
|....+..-++-.
T Consensus 312 G~Dr~~kIWDlR~ 324 (545)
T KOG1272|consen 312 GLDRKVKIWDLRN 324 (545)
T ss_pred ccccceeEeeecc
Confidence 3334444444443
No 238
>KOG3658|consensus
Probab=30.08 E-value=77 Score=35.87 Aligned_cols=16 Identities=44% Similarity=0.578 Sum_probs=13.2
Q ss_pred CCCCCeeecCCCCcccC
Q psy5768 633 DCRPGYFKCDNNKCILS 649 (652)
Q Consensus 633 ~C~~~~f~C~~~~Ci~~ 649 (652)
.|. ..+.|.||+|++.
T Consensus 565 ~C~-~~~~C~~G~C~gs 580 (764)
T KOG3658|consen 565 VCN-ETGVCINGKCIGS 580 (764)
T ss_pred ccc-ccceEeCCcCccH
Confidence 576 7789999999874
No 239
>KOG4378|consensus
Probab=29.61 E-value=8e+02 Score=27.09 Aligned_cols=156 Identities=10% Similarity=0.073 Sum_probs=90.4
Q ss_pred CceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecC
Q psy5768 356 GSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSG 435 (652)
Q Consensus 356 ~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG 435 (652)
..+.++.+.|....|- +-+..+.|.+..+.... +.+-+........|-+-.+|.+++|..+-... +.+.-.+..|
T Consensus 122 stvt~v~YN~~DeyiA-svs~gGdiiih~~~t~~--~tt~f~~~sgqsvRll~ys~skr~lL~~asd~--G~VtlwDv~g 196 (673)
T KOG4378|consen 122 STVTYVDYNNTDEYIA-SVSDGGDIIIHGTKTKQ--KTTTFTIDSGQSVRLLRYSPSKRFLLSIASDK--GAVTLWDVQG 196 (673)
T ss_pred ceeEEEEecCCcceeE-EeccCCcEEEEecccCc--cccceecCCCCeEEEeecccccceeeEeeccC--CeEEEEeccC
Confidence 3567777777554432 22233455554443211 22333333334446677788888887765442 3555555666
Q ss_pred CCceEEEEcCCCCC-ceEEEecCCCEEEEEeCCCCeEEEEecCCCceEEEecCCCCceeE-EEEe-CCEEEEEcCCCCeE
Q psy5768 436 FGTESIITTDITMP-NALALDHQAEKLFWGDARLDKIERCDYDGTNRIVLSKISPLHPFD-MAVY-GEFIFWTDWVIHAV 512 (652)
Q Consensus 436 ~~~~~l~~~~l~~P-~glaiD~~~~~LYw~D~~~~~I~~~~ldG~~~~~l~~~~~~~p~g-lav~-~~~lYwtd~~~~~I 512 (652)
.....-....-..| .||.+.+.+..|++.-....+|..+|........-+ ...||+. +++. .+++..+-..++.|
T Consensus 197 ~sp~~~~~~~HsAP~~gicfspsne~l~vsVG~Dkki~~yD~~s~~s~~~l--~y~~Plstvaf~~~G~~L~aG~s~G~~ 274 (673)
T KOG4378|consen 197 MSPIFHASEAHSAPCRGICFSPSNEALLVSVGYDKKINIYDIRSQASTDRL--TYSHPLSTVAFSECGTYLCAGNSKGEL 274 (673)
T ss_pred CCcccchhhhccCCcCcceecCCccceEEEecccceEEEeeccccccccee--eecCCcceeeecCCceEEEeecCCceE
Confidence 55433333223346 499999999999999888888988887632211111 1367865 4444 35777777777788
Q ss_pred EEEEcc
Q psy5768 513 LRANKY 518 (652)
Q Consensus 513 ~~~~k~ 518 (652)
+.-+..
T Consensus 275 i~YD~R 280 (673)
T KOG4378|consen 275 IAYDMR 280 (673)
T ss_pred EEEecc
Confidence 776654
No 240
>KOG0319|consensus
Probab=29.52 E-value=9.4e+02 Score=27.88 Aligned_cols=460 Identities=10% Similarity=0.071 Sum_probs=206.4
Q ss_pred CCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCC-ccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEE
Q psy5768 35 STLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGT-KRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIE 113 (652)
Q Consensus 35 ~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs-~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~ 113 (652)
+....+.++++++.+++||.+- ....+..+.+.-. ........+ ..|. +.||+|.-+ .|.-+-...+++.
T Consensus 60 ed~d~ita~~l~~d~~~L~~a~---rs~llrv~~L~tgk~irswKa~H--e~Pv---i~ma~~~~g-~LlAtggaD~~v~ 130 (775)
T KOG0319|consen 60 EDEDEITALALTPDEEVLVTAS---RSQLLRVWSLPTGKLIRSWKAIH--EAPV---ITMAFDPTG-TLLATGGADGRVK 130 (775)
T ss_pred cchhhhheeeecCCccEEEEee---ccceEEEEEcccchHhHhHhhcc--CCCe---EEEEEcCCC-ceEEeccccceEE
Confidence 3557889999999988888765 3455555666533 111111111 1122 489999855 6666666788999
Q ss_pred EEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeCCCCCc-EEEEeecccCceeEEEeccCCEEEEE
Q psy5768 114 VARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGLDGKKQ-TILAQEIIMPIKDITLDLKFFSAFYR 192 (652)
Q Consensus 114 v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~-~~~~~~~~~~p~gl~lD~~~~~ly~~ 192 (652)
|-|+.+.++..-+.+--.....+...|.-.+.-.........+..-++..+.. ......-.....+|++-..+.-+.-+
T Consensus 131 VWdi~~~~~th~fkG~gGvVssl~F~~~~~~~lL~sg~~D~~v~vwnl~~~~tcl~~~~~H~S~vtsL~~~~d~~~~ls~ 210 (775)
T KOG0319|consen 131 VWDIKNGYCTHSFKGHGGVVSSLLFHPHWNRWLLASGATDGTVRVWNLNDKRTCLHTMILHKSAVTSLAFSEDSLELLSV 210 (775)
T ss_pred EEEeeCCEEEEEecCCCceEEEEEeCCccchhheeecCCCceEEEEEcccCchHHHHHHhhhhheeeeeeccCCceEEEe
Confidence 99999999988887644556677777653331111112344455445442211 11111113455677776655444433
Q ss_pred eCCCCcEEEEEecCCCCceEEEeecCCCCCcceeeeeeecc-CCCCCCCCCCCCCCcccceecCCCceEEE----eCCcc
Q psy5768 193 NLSKGNIHIISLSNLSDVSTISMKPYGDSYLKDIKIYSKDA-QTGTNPCGVNNGGCAELCLYNGVSAVCAC----AHGVV 267 (652)
Q Consensus 193 d~~g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~-q~~~n~C~~~ng~Cs~lC~~~~~~~~C~C----~~G~l 267 (652)
..| +.|..-.........+.|. .....++...+... +. ..-...-|+-.-+|...+.+..|.= +++--
T Consensus 211 ~RD----kvi~vwd~~~~~~l~~lp~-ye~~E~vv~l~~~~~~~--~~~~~TaG~~g~~~~~d~es~~~~~~~~~~~~~e 283 (775)
T KOG0319|consen 211 GRD----KVIIVWDLVQYKKLKTLPL-YESLESVVRLREELGGK--GEYIITAGGSGVVQYWDSESGKCVYKQRQSDSEE 283 (775)
T ss_pred ccC----cEEEEeehhhhhhhheech-hhheeeEEEechhcCCc--ceEEEEecCCceEEEEecccchhhhhhccCCchh
Confidence 222 1111100000000000000 00011111111100 00 0000000111112221111111100 00000
Q ss_pred ccCCCcccccceEEEEeeecceeEEecCCCCCCCCCceeeeeccccceEEEEEEEcC-CCeEEEeecccccEEEEeccCC
Q psy5768 268 AQDGKSCSEYDAFIMYSRVNRIDSIHMTDKSDLNSPFESIRNSTMMKNIIELSYDYK-RKTLFYSDIQKGTINSVFFNGS 346 (652)
Q Consensus 268 ~~dg~~C~~~~~~Ll~s~~~~i~~i~l~~~~~~~~p~~~~~~~~~~~~~~~v~~D~~-~~~lywsd~~~~~I~~~~~~g~ 346 (652)
..+...|.....+|+++....|.-++. + .+. +...+.. .-..+..|-|=.. .+.|.++ .++..+..+.+.+.
T Consensus 284 ~~~~~~~~~~~~~l~vtaeQnl~l~d~-~--~l~-i~k~ivG--~ndEI~Dm~~lG~e~~~laVA-TNs~~lr~y~~~~~ 356 (775)
T KOG0319|consen 284 IDHLLAIESMSQLLLVTAEQNLFLYDE-D--ELT-IVKQIVG--YNDEILDMKFLGPEESHLAVA-TNSPELRLYTLPTS 356 (775)
T ss_pred hhcceeccccCceEEEEccceEEEEEc-c--ccE-EehhhcC--CchhheeeeecCCccceEEEE-eCCCceEEEecCCC
Confidence 112223334456677776666666643 1 111 1111111 0112233333222 2344443 33455665677777
Q ss_pred cceEEeeccCceeeeEEE-ccCCEEEEEeCCCCe--EEEEEcCCCCCccEEEEE--eCCCCCceEEEEeCCCCEEEEEec
Q psy5768 347 NHRVLLERQGSVEGLAYE-YVHNYLYWTCNNDAT--INKIDLDSPKAQRIVVVR--LGQHDKPRGIDIDSCDSRIYWTNW 421 (652)
Q Consensus 347 ~~~~i~~~~~~~~glAvD-w~~~~LYwtd~~~~~--I~~~~~~~~~~~~~~~~~--~~~~~~P~~Iavdp~~g~Lywtd~ 421 (652)
..+.+...-+.+ |++| |..+.|..|-+...+ +|+++-+.. ....+. .+......++|... .|.=|+...
T Consensus 357 ~c~ii~GH~e~v--lSL~~~~~g~llat~sKD~svilWr~~~~~~---~~~~~a~~~gH~~svgava~~~-~~asffvsv 430 (775)
T KOG0319|consen 357 YCQIIPGHTEAV--LSLDVWSSGDLLATGSKDKSVILWRLNNNCS---KSLCVAQANGHTNSVGAVAGSK-LGASFFVSV 430 (775)
T ss_pred ceEEEeCchhhe--eeeeecccCcEEEEecCCceEEEEEecCCcc---hhhhhhhhcccccccceeeecc-cCccEEEEe
Confidence 666443322233 4555 777778888776665 455521111 111111 12234566788844 455444444
Q ss_pred CCCCCceEEEeecCCC--ceEEEE--c-----CCCCCceEEEecCCCEEEEEeC--CCCeEEEEecCCCceEEEecCCCC
Q psy5768 422 NSHLPSIQRAFFSGFG--TESIIT--T-----DITMPNALALDHQAEKLFWGDA--RLDKIERCDYDGTNRIVLSKISPL 490 (652)
Q Consensus 422 ~~~~~~I~r~~ldG~~--~~~l~~--~-----~l~~P~glaiD~~~~~LYw~D~--~~~~I~~~~ldG~~~~~l~~~~~~ 490 (652)
++. ..|....+.++. +..++- . .-+..|+++|.+. ++|.-+-+ ++.+|+... .......+.+.-.
T Consensus 431 S~D-~tlK~W~l~~s~~~~~~~~~~~~~t~~aHdKdIN~Vaia~n-dkLiAT~SqDktaKiW~le--~~~l~~vLsGH~R 506 (775)
T KOG0319|consen 431 SQD-CTLKLWDLPKSKETAFPIVLTCRYTERAHDKDINCVAIAPN-DKLIATGSQDKTAKIWDLE--QLRLLGVLSGHTR 506 (775)
T ss_pred cCC-ceEEEecCCCcccccccceehhhHHHHhhcccccceEecCC-CceEEecccccceeeeccc--CceEEEEeeCCcc
Confidence 433 255555665522 111111 0 1245789999955 55655433 344566554 2222333333334
Q ss_pred ceeEEEEe-CCEEEEEcCCCCeEEEEEccCCceEEEEe
Q psy5768 491 HPFDMAVY-GEFIFWTDWVIHAVLRANKYTGEEVYTLR 527 (652)
Q Consensus 491 ~p~glav~-~~~lYwtd~~~~~I~~~~k~~g~~~~~~~ 527 (652)
..+.+.+- .+.+.-|-....+|..=+..+.+=.+++.
T Consensus 507 Gvw~V~Fs~~dq~laT~SgD~TvKIW~is~fSClkT~e 544 (775)
T KOG0319|consen 507 GVWCVSFSKNDQLLATCSGDKTVKIWSISTFSCLKTFE 544 (775)
T ss_pred ceEEEEeccccceeEeccCCceEEEEEeccceeeeeec
Confidence 55666665 46788887777754332333344344443
No 241
>KOG4260|consensus
Probab=29.34 E-value=25 Score=34.94 Aligned_cols=34 Identities=21% Similarity=0.428 Sum_probs=28.3
Q ss_pred CCCCCCCCCCCCCc--ccceecCCCceEEEeCCccc
Q psy5768 235 TGTNPCGVNNGGCA--ELCLYNGVSAVCACAHGVVA 268 (652)
Q Consensus 235 ~~~n~C~~~ng~Cs--~lC~~~~~~~~C~C~~G~l~ 268 (652)
.++|+|..-...|. |+|+++.++|+|.+.+||..
T Consensus 234 vDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~ 269 (350)
T KOG4260|consen 234 VDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK 269 (350)
T ss_pred ccHHHHhcCCCCCChhheeecCCCceEecccccccC
Confidence 67899987666776 69999888999999999854
No 242
>KOG0281|consensus
Probab=28.43 E-value=5.1e+02 Score=27.15 Aligned_cols=191 Identities=14% Similarity=0.107 Sum_probs=101.2
Q ss_pred eEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe-eccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCccE
Q psy5768 315 NIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL-ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQRI 393 (652)
Q Consensus 315 ~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~-~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~ 393 (652)
.+.-+.|| +...++-...++|..-+.+.-....++ ..-+++--|.+| .+ +..+-+...+|.+-++...+. .+
T Consensus 199 gVYClQYD---D~kiVSGlrDnTikiWD~n~~~c~~~L~GHtGSVLCLqyd--~r-viisGSSDsTvrvWDv~tge~-l~ 271 (499)
T KOG0281|consen 199 GVYCLQYD---DEKIVSGLRDNTIKIWDKNSLECLKILTGHTGSVLCLQYD--ER-VIVSGSSDSTVRVWDVNTGEP-LN 271 (499)
T ss_pred ceEEEEec---chhhhcccccCceEEeccccHHHHHhhhcCCCcEEeeecc--ce-EEEecCCCceEEEEeccCCch-hh
Confidence 34455555 223344444455555544433222222 233455555555 33 667777777888878764321 12
Q ss_pred EEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCC----ceEEEEcCCCCCceEEEecCCCEEEEEeCCCC
Q psy5768 394 VVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFG----TESIITTDITMPNALALDHQAEKLFWGDARLD 469 (652)
Q Consensus 394 ~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~----~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~ 469 (652)
+++ ..-+..-++.+. +|+|.-.... . .|....|+... +++|+.. ....| .+|++.+ +.+.-++..
T Consensus 272 tli--hHceaVLhlrf~--ng~mvtcSkD-r--siaVWdm~sps~it~rrVLvGH-rAaVN--vVdfd~k-yIVsASgDR 340 (499)
T KOG0281|consen 272 TLI--HHCEAVLHLRFS--NGYMVTCSKD-R--SIAVWDMASPTDITLRRVLVGH-RAAVN--VVDFDDK-YIVSASGDR 340 (499)
T ss_pred HHh--hhcceeEEEEEe--CCEEEEecCC-c--eeEEEeccCchHHHHHHHHhhh-hhhee--eeccccc-eEEEecCCc
Confidence 222 122333444444 5777654433 2 56666666554 3344431 12233 3455555 334444555
Q ss_pred eEEEEecCCCceEEEecCCCCceeEEEE--eCCEEEEEcCCCCeEEEEEccCCceEEEE
Q psy5768 470 KIERCDYDGTNRIVLSKISPLHPFDMAV--YGEFIFWTDWVIHAVLRANKYTGEEVYTL 526 (652)
Q Consensus 470 ~I~~~~ldG~~~~~l~~~~~~~p~glav--~~~~lYwtd~~~~~I~~~~k~~g~~~~~~ 526 (652)
.|..-+.+...-...+. .|-.|||- |.+.+.++....++|...+...|.-.+++
T Consensus 341 TikvW~~st~efvRtl~---gHkRGIAClQYr~rlvVSGSSDntIRlwdi~~G~cLRvL 396 (499)
T KOG0281|consen 341 TIKVWSTSTCEFVRTLN---GHKRGIACLQYRDRLVVSGSSDNTIRLWDIECGACLRVL 396 (499)
T ss_pred eEEEEeccceeeehhhh---cccccceehhccCeEEEecCCCceEEEEeccccHHHHHH
Confidence 67766666543222222 56677774 78999999999999988787777644433
No 243
>PF08954 DUF1900: Domain of unknown function (DUF1900); InterPro: IPR015049 This domain is predominantly found in the structural protein coronin, and is duplicated in some sequences. It has no known function []. ; PDB: 2B4E_A 2AQ5_A.
Probab=28.37 E-value=1.6e+02 Score=26.47 Aligned_cols=54 Identities=13% Similarity=0.167 Sum_probs=30.8
Q ss_pred eEEEccCCEEEEEeCCCCeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCC
Q psy5768 361 LAYEYVHNYLYWTCNNDATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDS 414 (652)
Q Consensus 361 lAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g 414 (652)
--+|..++.||.+-.+.++|....+.........+-.......-+|+++-|++.
T Consensus 16 P~yD~dt~llyl~gKGD~~ir~yEv~~~~p~l~~l~~~~s~~~~~G~~~lPK~~ 69 (136)
T PF08954_consen 16 PFYDEDTNLLYLAGKGDGNIRYYEVSDESPYLHYLSEYRSPEPQKGFAFLPKRA 69 (136)
T ss_dssp EEE-TTT-EEEEEETT-S-EEEEEE-SSTTSEEEEEEE--SS--SEEEE--GGG
T ss_pred eeEcCCCCEEEEEeccCcEEEEEEEcCCCCceEEccccccCCCeEeeEecCccc
Confidence 358999999999999999999999876543333333333334458999999654
No 244
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=28.34 E-value=8.5e+02 Score=26.97 Aligned_cols=33 Identities=21% Similarity=0.328 Sum_probs=25.3
Q ss_pred EEEEeCCEEEEEcCCCCeEEEEEccCCceEEEEe
Q psy5768 494 DMAVYGEFIFWTDWVIHAVLRANKYTGEEVYTLR 527 (652)
Q Consensus 494 glav~~~~lYwtd~~~~~I~~~~k~~g~~~~~~~ 527 (652)
.+++.++.||..+ ..+.|+.+++.+|+..-...
T Consensus 401 ~~~~~g~~v~~g~-~dG~l~ald~~tG~~lW~~~ 433 (488)
T cd00216 401 SLATAGNLVFAGA-ADGYFRAFDATTGKELWKFR 433 (488)
T ss_pred ceEecCCeEEEEC-CCCeEEEEECCCCceeeEEE
Confidence 3567788999987 57789999999998654443
No 245
>KOG1274|consensus
Probab=28.22 E-value=1.1e+03 Score=28.16 Aligned_cols=141 Identities=16% Similarity=0.240 Sum_probs=74.3
Q ss_pred cCCCCeEEEEecCCCeeEEEecCCC-CCCCCC----CCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCC
Q psy5768 7 SPTQSKIVVCNLEGEYQTTILSNES-NDTSTL----SKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQK 81 (652)
Q Consensus 7 ~~~~~~I~~~~~~g~~~~~~~~~~~-~~~~~~----~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~ 81 (652)
|+.+..|..++.+|... .+-.+.- -++..+ ..+.+++.. .+++. +- ...+.|.|+..+...-.+++..=
T Consensus 22 d~~gefi~tcgsdg~ir-~~~~~sd~e~P~ti~~~g~~v~~ia~~--s~~f~-~~--s~~~tv~~y~fps~~~~~iL~Rf 95 (933)
T KOG1274|consen 22 DPDGEFICTCGSDGDIR-KWKTNSDEEEPETIDISGELVSSIACY--SNHFL-TG--SEQNTVLRYKFPSGEEDTILARF 95 (933)
T ss_pred cCCCCEEEEecCCCceE-EeecCCcccCCchhhccCceeEEEeec--ccceE-Ee--eccceEEEeeCCCCCccceeeee
Confidence 45556677777666432 2221111 011111 234455543 33333 33 45788999988876655555432
Q ss_pred CcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcC-CCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEe
Q psy5768 82 KYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLT-GQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAG 160 (652)
Q Consensus 82 ~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~d-g~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ 160 (652)
.-..+ .++|+- +|++-.+-+..-.|.+.+.+ ++..+++.. --.....|.+||.+.+|-.++- .+.+...+
T Consensus 96 -tlp~r----~~~v~g-~g~~iaagsdD~~vK~~~~~D~s~~~~lrg-h~apVl~l~~~p~~~fLAvss~--dG~v~iw~ 166 (933)
T KOG1274|consen 96 -TLPIR----DLAVSG-SGKMIAAGSDDTAVKLLNLDDSSQEKVLRG-HDAPVLQLSYDPKGNFLAVSSC--DGKVQIWD 166 (933)
T ss_pred -eccce----EEEEec-CCcEEEeecCceeEEEEeccccchheeecc-cCCceeeeeEcCCCCEEEEEec--CceEEEEE
Confidence 12245 889985 44444444444456666654 455555443 2345668888887666655543 34555555
Q ss_pred CC
Q psy5768 161 LD 162 (652)
Q Consensus 161 ld 162 (652)
++
T Consensus 167 ~~ 168 (933)
T KOG1274|consen 167 LQ 168 (933)
T ss_pred cc
Confidence 44
No 246
>KOG0275|consensus
Probab=28.08 E-value=3.9e+02 Score=27.55 Aligned_cols=91 Identities=12% Similarity=0.069 Sum_probs=57.0
Q ss_pred EEEecCCCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCC
Q psy5768 3 IAVSSPTQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKK 82 (652)
Q Consensus 3 i~v~~~~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~ 82 (652)
++||- -...|++++.+|..++.|-+..+ .-...+.-.+.|+++.+|-.- +.+.+|.+..-....+....-.
T Consensus 408 ~iVCN-rsntv~imn~qGQvVrsfsSGkR----EgGdFi~~~lSpkGewiYcig---ED~vlYCF~~~sG~LE~tl~Vh- 478 (508)
T KOG0275|consen 408 FIVCN-RSNTVYIMNMQGQVVRSFSSGKR----EGGDFINAILSPKGEWIYCIG---EDGVLYCFSVLSGKLERTLPVH- 478 (508)
T ss_pred EEEEc-CCCeEEEEeccceEEeeeccCCc----cCCceEEEEecCCCcEEEEEc---cCcEEEEEEeecCceeeeeecc-
Confidence 45664 34568899999999988875332 335677888999999999765 6677887765433333333222
Q ss_pred cCCccCCCCcEEEEccCCcEEEEeC
Q psy5768 83 YPAVTACNLHIAVDWIAQNIYWSDP 107 (652)
Q Consensus 83 ~~~p~~~~~~lavDw~~~~lY~~d~ 107 (652)
-..|- |||--+ .+|+.-+.+
T Consensus 479 EkdvI----Gl~HHP-HqNllAsYs 498 (508)
T KOG0275|consen 479 EKDVI----GLTHHP-HQNLLASYS 498 (508)
T ss_pred ccccc----ccccCc-ccchhhhhc
Confidence 12333 776654 555554443
No 247
>KOG4441|consensus
Probab=27.13 E-value=9.7e+02 Score=27.26 Aligned_cols=158 Identities=15% Similarity=0.130 Sum_probs=0.0
Q ss_pred eeEEEccCCEEEEEeCCC------CeEEEEEcCCCCCccEEEEEeCCCCCceE-EEEeCCCCEEEEE---ecCCCCCceE
Q psy5768 360 GLAYEYVHNYLYWTCNND------ATINKIDLDSPKAQRIVVVRLGQHDKPRG-IDIDSCDSRIYWT---NWNSHLPSIQ 429 (652)
Q Consensus 360 glAvDw~~~~LYwtd~~~------~~I~~~~~~~~~~~~~~~~~~~~~~~P~~-Iavdp~~g~Lywt---d~~~~~~~I~ 429 (652)
|+++ +++.||.+-+.. .++++.+... ..=.....+..+|. .++-...|.||.. |....-..||
T Consensus 327 ~~~~--~~~~lYv~GG~~~~~~~l~~ve~YD~~~-----~~W~~~a~M~~~R~~~~v~~l~g~iYavGG~dg~~~l~svE 399 (571)
T KOG4441|consen 327 GVAV--LNGKLYVVGGYDSGSDRLSSVERYDPRT-----NQWTPVAPMNTKRSDFGVAVLDGKLYAVGGFDGEKSLNSVE 399 (571)
T ss_pred cEEE--ECCEEEEEccccCCCcccceEEEecCCC-----CceeccCCccCccccceeEEECCEEEEEeccccccccccEE
Q ss_pred EEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeC------CCCeEEEEecCCCceEEEecCC-CCceeEEEEeCCEE
Q psy5768 430 RAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDA------RLDKIERCDYDGTNRIVLSKIS-PLHPFDMAVYGEFI 502 (652)
Q Consensus 430 r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~------~~~~I~~~~ldG~~~~~l~~~~-~~~p~glav~~~~l 502 (652)
+-+.....-..+..-.. .-.+.++-...++||.+-. ...+++++|.....=+.+..-. ...-+|+++.+++|
T Consensus 400 ~YDp~~~~W~~va~m~~-~r~~~gv~~~~g~iYi~GG~~~~~~~l~sve~YDP~t~~W~~~~~M~~~R~~~g~a~~~~~i 478 (571)
T KOG4441|consen 400 CYDPVTNKWTPVAPMLT-RRSGHGVAVLGGKLYIIGGGDGSSNCLNSVECYDPETNTWTLIAPMNTRRSGFGVAVLNGKI 478 (571)
T ss_pred EecCCCCcccccCCCCc-ceeeeEEEEECCEEEEEcCcCCCccccceEEEEcCCCCceeecCCcccccccceEEEECCEE
Q ss_pred EEEcCCCCe-----EEEEEccCCceEEE
Q psy5768 503 FWTDWVIHA-----VLRANKYTGEEVYT 525 (652)
Q Consensus 503 Ywtd~~~~~-----I~~~~k~~g~~~~~ 525 (652)
|..--..+. |++.++.+.+-..+
T Consensus 479 YvvGG~~~~~~~~~VE~ydp~~~~W~~v 506 (571)
T KOG4441|consen 479 YVVGGFDGTSALSSVERYDPETNQWTMV 506 (571)
T ss_pred EEECCccCCCccceEEEEcCCCCceeEc
No 248
>KOG1272|consensus
Probab=26.97 E-value=1.9e+02 Score=31.32 Aligned_cols=144 Identities=15% Similarity=0.167 Sum_probs=82.7
Q ss_pred eEEEecCCCCeEEEEecC-CCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeC
Q psy5768 2 FIAVSSPTQSKIVVCNLE-GEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQ 80 (652)
Q Consensus 2 ~i~v~~~~~~~I~~~~~~-g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~ 80 (652)
|++|+....+...--|.. |+.+..|-+. ......+.-.|.+.-|-.- ..++++..+++.....-+-+-.
T Consensus 222 fLL~~~~~~G~L~Y~DVS~GklVa~~~t~-------~G~~~vm~qNP~NaVih~G---hsnGtVSlWSP~skePLvKiLc 291 (545)
T KOG1272|consen 222 FLLVAASEAGFLKYQDVSTGKLVASIRTG-------AGRTDVMKQNPYNAVIHLG---HSNGTVSLWSPNSKEPLVKILC 291 (545)
T ss_pred heeeecccCCceEEEeechhhhhHHHHcc-------CCccchhhcCCccceEEEc---CCCceEEecCCCCcchHHHHHh
Confidence 567777777777655554 5555444432 2355666677776666544 3678888888776532221223
Q ss_pred CCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEe
Q psy5768 81 KKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAG 160 (652)
Q Consensus 81 ~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ 160 (652)
. .+.++ +||||. +++.-.|-.....+.+-|+......--..+ -..-..+++. ..|.| -..+|..-.|++-.
T Consensus 292 H-~g~V~----siAv~~-~G~YMaTtG~Dr~~kIWDlR~~~ql~t~~t-p~~a~~ls~S-qkglL-A~~~G~~v~iw~d~ 362 (545)
T KOG1272|consen 292 H-RGPVS----SIAVDR-GGRYMATTGLDRKVKIWDLRNFYQLHTYRT-PHPASNLSLS-QKGLL-ALSYGDHVQIWKDA 362 (545)
T ss_pred c-CCCcc----eEEECC-CCcEEeecccccceeEeeeccccccceeec-CCCccccccc-cccce-eeecCCeeeeehhh
Confidence 3 45578 999996 555555655666788888876542111111 1122244443 34554 44566666677766
Q ss_pred CCCC
Q psy5768 161 LDGK 164 (652)
Q Consensus 161 ldg~ 164 (652)
++|+
T Consensus 363 ~~~s 366 (545)
T KOG1272|consen 363 LKGS 366 (545)
T ss_pred hcCC
Confidence 6654
No 249
>KOG2106|consensus
Probab=26.90 E-value=9e+02 Score=26.79 Aligned_cols=102 Identities=12% Similarity=0.111 Sum_probs=48.5
Q ss_pred CCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCC-ccEEEEeCCCcCCccCCCCc-EEEEccCCcEEEEeCCCCEEEEE
Q psy5768 38 SKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGT-KRETVVSQKKYPAVTACNLH-IAVDWIAQNIYWSDPKENVIEVA 115 (652)
Q Consensus 38 ~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs-~~~~v~~~~~~~~p~~~~~~-lavDw~~~~lY~~d~~~~~I~v~ 115 (652)
.+.+++|.+|.++. |.+- ...+.+..++ +-. ....++... .....++|.| ||+-..++++++.|..+......
T Consensus 369 delwgla~hps~~q-~~T~--gqdk~v~lW~-~~k~~wt~~~~d~-~~~~~fhpsg~va~Gt~~G~w~V~d~e~~~lv~~ 443 (626)
T KOG2106|consen 369 DELWGLATHPSKNQ-LLTC--GQDKHVRLWN-DHKLEWTKIIEDP-AECADFHPSGVVAVGTATGRWFVLDTETQDLVTI 443 (626)
T ss_pred cceeeEEcCCChhh-eeec--cCcceEEEcc-CCceeEEEEecCc-eeEeeccCcceEEEeeccceEEEEecccceeEEE
Confidence 47799999987554 4455 3344444444 222 122222222 2223333434 45566677777777655444434
Q ss_pred EcCCCcEEEEEeCCCCCceeEEEcCCCCeEE
Q psy5768 116 RLTGQYRYVLISGGVDQPSALAVDPESGYLF 146 (652)
Q Consensus 116 ~~dg~~~~~l~~~~~~~P~~iavd~~~g~ly 146 (652)
..+++...++.-. ..-.-|||-..++.||
T Consensus 444 ~~d~~~ls~v~ys--p~G~~lAvgs~d~~iy 472 (626)
T KOG2106|consen 444 HTDNEQLSVVRYS--PDGAFLAVGSHDNHIY 472 (626)
T ss_pred EecCCceEEEEEc--CCCCEEEEecCCCeEE
Confidence 4443333222211 1122455555555444
No 250
>KOG0640|consensus
Probab=26.84 E-value=3.1e+02 Score=28.19 Aligned_cols=101 Identities=13% Similarity=0.173 Sum_probs=61.9
Q ss_pred EEEecCCCCeEEEEecCCCeeE-EEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCC
Q psy5768 3 IAVSSPTQSKIVVCNLEGEYQT-TILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQK 81 (652)
Q Consensus 3 i~v~~~~~~~I~~~~~~g~~~~-~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~ 81 (652)
|+++.+....|-.||..-.... .+.. ......+.+|.++|.+++|.+.. ....+..++.+-. +-+++..
T Consensus 186 ILiS~srD~tvKlFDfsK~saKrA~K~-----~qd~~~vrsiSfHPsGefllvgT---dHp~~rlYdv~T~--Qcfvsan 255 (430)
T KOG0640|consen 186 ILISGSRDNTVKLFDFSKTSAKRAFKV-----FQDTEPVRSISFHPSGEFLLVGT---DHPTLRLYDVNTY--QCFVSAN 255 (430)
T ss_pred eEEeccCCCeEEEEecccHHHHHHHHH-----hhccceeeeEeecCCCceEEEec---CCCceeEEeccce--eEeeecC
Confidence 7777777777877776644332 2221 11346778999999999998754 3445544544322 2333322
Q ss_pred ----CcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcC
Q psy5768 82 ----KYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLT 118 (652)
Q Consensus 82 ----~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~d 118 (652)
-.+.+. .+-+. .+++||++-+..+.|...|--
T Consensus 256 Pd~qht~ai~----~V~Ys-~t~~lYvTaSkDG~IklwDGV 291 (430)
T KOG0640|consen 256 PDDQHTGAIT----QVRYS-STGSLYVTASKDGAIKLWDGV 291 (430)
T ss_pred ccccccccee----EEEec-CCccEEEEeccCCcEEeeccc
Confidence 022344 55555 378999999988888776543
No 251
>KOG1407|consensus
Probab=26.58 E-value=6.8e+02 Score=25.29 Aligned_cols=170 Identities=14% Similarity=0.068 Sum_probs=0.0
Q ss_pred EEecCCCCeEEEEecCCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCc
Q psy5768 4 AVSSPTQSKIVVCNLEGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKY 83 (652)
Q Consensus 4 ~v~~~~~~~I~~~~~~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~ 83 (652)
+++......+.++++++.....-..+++ .-.....+.+|+...-+|.+- .+...|.+++.-.+.-...+...
T Consensus 35 lasgs~dktv~v~n~e~~r~~~~~~~~g----h~~svdql~w~~~~~d~~ata--s~dk~ir~wd~r~~k~~~~i~~~-- 106 (313)
T KOG1407|consen 35 LASGSFDKTVSVWNLERDRFRKELVYRG----HTDSVDQLCWDPKHPDLFATA--SGDKTIRIWDIRSGKCTARIETK-- 106 (313)
T ss_pred eeecccCCceEEEEecchhhhhhhcccC----CCcchhhheeCCCCCcceEEe--cCCceEEEEEeccCcEEEEeecc--
Q ss_pred CCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeCCC
Q psy5768 84 PAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGLDG 163 (652)
Q Consensus 84 ~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg 163 (652)
-+ -+-+-|...-=|++-.++.-...+=-.-+++.+--.........++-. ..+.||+...| .+.|+....-.
T Consensus 107 --~e----ni~i~wsp~g~~~~~~~kdD~it~id~r~~~~~~~~~~~~e~ne~~w~-~~nd~Fflt~G-lG~v~ILsyps 178 (313)
T KOG1407|consen 107 --GE----NINITWSPDGEYIAVGNKDDRITFIDARTYKIVNEEQFKFEVNEISWN-NSNDLFFLTNG-LGCVEILSYPS 178 (313)
T ss_pred --Cc----ceEEEEcCCCCEEEEecCcccEEEEEecccceeehhcccceeeeeeec-CCCCEEEEecC-CceEEEEeccc
Q ss_pred CCcEEEEeecccCceeEEEeccCCEE
Q psy5768 164 KKQTILAQEIIMPIKDITLDLKFFSA 189 (652)
Q Consensus 164 ~~~~~~~~~~~~~p~gl~lD~~~~~l 189 (652)
-.+..-++.--..-.-|.+|+.++.+
T Consensus 179 Lkpv~si~AH~snCicI~f~p~Gryf 204 (313)
T KOG1407|consen 179 LKPVQSIKAHPSNCICIEFDPDGRYF 204 (313)
T ss_pred cccccccccCCcceEEEEECCCCceE
No 252
>KOG0646|consensus
Probab=26.25 E-value=7.7e+02 Score=26.87 Aligned_cols=52 Identities=17% Similarity=0.105 Sum_probs=35.1
Q ss_pred CCeEEEEecCCCe-eEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCC
Q psy5768 10 QSKIVVCNLEGEY-QTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGT 72 (652)
Q Consensus 10 ~~~I~~~~~~g~~-~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs 72 (652)
...|.++++-+.. ..++.. -..+.++++||.+.++|.-. ..+.|+...+.+.
T Consensus 197 D~t~k~wdlS~g~LLlti~f--------p~si~av~lDpae~~~yiGt---~~G~I~~~~~~~~ 249 (476)
T KOG0646|consen 197 DRTIKLWDLSLGVLLLTITF--------PSSIKAVALDPAERVVYIGT---EEGKIFQNLLFKL 249 (476)
T ss_pred CceEEEEEeccceeeEEEec--------CCcceeEEEcccccEEEecC---CcceEEeeehhcC
Confidence 3455666776443 333332 15789999999999999765 5678887776654
No 253
>KOG0284|consensus
Probab=26.14 E-value=1.8e+02 Score=30.98 Aligned_cols=102 Identities=9% Similarity=0.035 Sum_probs=72.1
Q ss_pred CCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEc
Q psy5768 38 SKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARL 117 (652)
Q Consensus 38 ~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~ 117 (652)
+.+.++.+.+.+..+.=.| ..+.|..+.++-.+...+.... ...++ ++|+-. ++.-|.+-+..+.|.+-+.
T Consensus 139 s~Vr~m~ws~~g~wmiSgD---~gG~iKyWqpnmnnVk~~~ahh-~eaIR----dlafSp-nDskF~t~SdDg~ikiWdf 209 (464)
T KOG0284|consen 139 SPVRTMKWSHNGTWMISGD---KGGMIKYWQPNMNNVKIIQAHH-AEAIR----DLAFSP-NDSKFLTCSDDGTIKIWDF 209 (464)
T ss_pred ccceeEEEccCCCEEEEcC---CCceEEecccchhhhHHhhHhh-hhhhh----eeccCC-CCceeEEecCCCeEEEEec
Confidence 4678888988877777555 5677777877766655554444 56788 999998 8888998888888888765
Q ss_pred CCCcEEEEEeCCCCCceeEEEcCCCCeEEEE
Q psy5768 118 TGQYRYVLISGGVDQPSALAVDPESGYLFWS 148 (652)
Q Consensus 118 dg~~~~~l~~~~~~~P~~iavd~~~g~lywt 148 (652)
--...+.++.+---.|+.++=+|+.|.|+-.
T Consensus 210 ~~~kee~vL~GHgwdVksvdWHP~kgLiasg 240 (464)
T KOG0284|consen 210 RMPKEERVLRGHGWDVKSVDWHPTKGLIASG 240 (464)
T ss_pred cCCchhheeccCCCCcceeccCCccceeEEc
Confidence 4333333334433568888888888877643
No 254
>KOG0289|consensus
Probab=25.90 E-value=8.7e+02 Score=26.32 Aligned_cols=193 Identities=13% Similarity=0.073 Sum_probs=102.3
Q ss_pred eEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEee--ccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCcc
Q psy5768 315 NIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLLE--RQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQR 392 (652)
Q Consensus 315 ~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~~--~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~ 392 (652)
.+.++..++.++++.|++.+..-++.-.-+|....++.. ..-.....++-+ .++||-+-...+.+.+.++...
T Consensus 305 ~V~~ls~h~tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~~~s~v~~ts~~fHp-DgLifgtgt~d~~vkiwdlks~---- 379 (506)
T KOG0289|consen 305 PVTGLSLHPTGEYLLSASNDGTWAFSDISSGSQLTVVSDETSDVEYTSAAFHP-DGLIFGTGTPDGVVKIWDLKSQ---- 379 (506)
T ss_pred cceeeeeccCCcEEEEecCCceEEEEEccCCcEEEEEeeccccceeEEeeEcC-CceEEeccCCCceEEEEEcCCc----
Confidence 356788889999999988765555544445544333332 111356667765 4578888777777777787542
Q ss_pred EEEEEeCC-CCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcCCCCCceEEEecCCCEEEEEeCCCCeE
Q psy5768 393 IVVVRLGQ-HDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTDITMPNALALDHQAEKLFWGDARLDKI 471 (652)
Q Consensus 393 ~~~~~~~~-~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~I 471 (652)
..+..+.. ....++|++.. +||--.+.......+++-.+-+- +.+++...+-...+.+.+|..+..|-.+ ...=.|
T Consensus 380 ~~~a~Fpght~~vk~i~FsE-NGY~Lat~add~~V~lwDLRKl~-n~kt~~l~~~~~v~s~~fD~SGt~L~~~-g~~l~V 456 (506)
T KOG0289|consen 380 TNVAKFPGHTGPVKAISFSE-NGYWLATAADDGSVKLWDLRKLK-NFKTIQLDEKKEVNSLSFDQSGTYLGIA-GSDLQV 456 (506)
T ss_pred cccccCCCCCCceeEEEecc-CceEEEEEecCCeEEEEEehhhc-ccceeeccccccceeEEEcCCCCeEEee-cceeEE
Confidence 23333332 34557788876 77755555443322444333222 3344443333346789999877766554 222233
Q ss_pred EEEecCCCceEEEecCC-CC-ceeEEEEeCCEEEEEcCCCCeEEEE
Q psy5768 472 ERCDYDGTNRIVLSKIS-PL-HPFDMAVYGEFIFWTDWVIHAVLRA 515 (652)
Q Consensus 472 ~~~~ldG~~~~~l~~~~-~~-~p~glav~~~~lYwtd~~~~~I~~~ 515 (652)
+.+.-....=..+.... .. ..-++.+-+...|..+-...++.++
T Consensus 457 y~~~k~~k~W~~~~~~~~~sg~st~v~Fg~~aq~l~s~smd~~l~~ 502 (506)
T KOG0289|consen 457 YICKKKTKSWTEIKELADHSGLSTGVRFGEHAQYLASTSMDAILRL 502 (506)
T ss_pred EEEecccccceeeehhhhcccccceeeecccceEEeeccchhheEE
Confidence 33332222211221111 11 2234444455666666565555444
No 255
>KOG0268|consensus
Probab=25.89 E-value=3.4e+02 Score=28.57 Aligned_cols=143 Identities=9% Similarity=0.091 Sum_probs=77.6
Q ss_pred cceEEEEEEEcCCCeEEEeecccccEEEEeccC-CcceEEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCc
Q psy5768 313 MKNIIELSYDYKRKTLFYSDIQKGTINSVFFNG-SNHRVLLERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQ 391 (652)
Q Consensus 313 ~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g-~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~ 391 (652)
..++..+-|++..-.|.-+-...+.|...++-. +..+.++... .+.+|+..+ ....|.+-.+...+...++..++
T Consensus 187 ~Dti~svkfNpvETsILas~~sDrsIvLyD~R~~~Pl~KVi~~m-RTN~IswnP-eafnF~~a~ED~nlY~~DmR~l~-- 262 (433)
T KOG0268|consen 187 ADSISSVKFNPVETSILASCASDRSIVLYDLRQASPLKKVILTM-RTNTICWNP-EAFNFVAANEDHNLYTYDMRNLS-- 262 (433)
T ss_pred CCceeEEecCCCcchheeeeccCCceEEEecccCCccceeeeec-cccceecCc-cccceeeccccccceehhhhhhc--
Confidence 345666777777777776666667777777543 2233343333 556777777 55556555556666666654432
Q ss_pred cEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCC-ceEEEEcCCCCCceEEEecCCCEEE
Q psy5768 392 RIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFG-TESIITTDITMPNALALDHQAEKLF 462 (652)
Q Consensus 392 ~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~-~~~l~~~~l~~P~glaiD~~~~~LY 462 (652)
+..-+.........++...| .|.=|++-.-.. .|.-...+... |.+..+..+....++....+++.++
T Consensus 263 ~p~~v~~dhvsAV~dVdfsp-tG~EfvsgsyDk--sIRIf~~~~~~SRdiYhtkRMq~V~~Vk~S~Dskyi~ 331 (433)
T KOG0268|consen 263 RPLNVHKDHVSAVMDVDFSP-TGQEFVSGSYDK--SIRIFPVNHGHSRDIYHTKRMQHVFCVKYSMDSKYII 331 (433)
T ss_pred ccchhhcccceeEEEeccCC-Ccchhccccccc--eEEEeecCCCcchhhhhHhhhheeeEEEEeccccEEE
Confidence 11111112233445667777 566677654443 33333444433 4444344566666666665555444
No 256
>KOG0284|consensus
Probab=25.60 E-value=6.2e+02 Score=27.16 Aligned_cols=102 Identities=7% Similarity=0.127 Sum_probs=60.8
Q ss_pred ceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe--eccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCc
Q psy5768 314 KNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL--ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQ 391 (652)
Q Consensus 314 ~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~--~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~ 391 (652)
..+.++.|....+.|.-.. ....+..+++. +-.+... ..-..+..++--++...||-+-...+.|..-.+....
T Consensus 265 ntVl~~~f~~n~N~Llt~s-kD~~~kv~DiR-~mkEl~~~r~Hkkdv~~~~WhP~~~~lftsgg~Dgsvvh~~v~~~~-- 340 (464)
T KOG0284|consen 265 NTVLAVKFNPNGNWLLTGS-KDQSCKVFDIR-TMKELFTYRGHKKDVTSLTWHPLNESLFTSGGSDGSVVHWVVGLEE-- 340 (464)
T ss_pred ceEEEEEEcCCCCeeEEcc-CCceEEEEehh-HhHHHHHhhcchhhheeeccccccccceeeccCCCceEEEeccccc--
Confidence 3467888988875554443 33456655554 1111111 2233567778888999999888888887766654322
Q ss_pred cEEEEEeCCCCCceEEEEeCCCCEEEEEe
Q psy5768 392 RIVVVRLGQHDKPRGIDIDSCDSRIYWTN 420 (652)
Q Consensus 392 ~~~~~~~~~~~~P~~Iavdp~~g~Lywtd 420 (652)
....+.........+++.+| -|+|+-|-
T Consensus 341 p~~~i~~AHd~~iwsl~~hP-lGhil~tg 368 (464)
T KOG0284|consen 341 PLGEIPPAHDGEIWSLAYHP-LGHILATG 368 (464)
T ss_pred cccCCCcccccceeeeeccc-cceeEeec
Confidence 11222223345678889998 57777663
No 257
>KOG0303|consensus
Probab=25.49 E-value=8.6e+02 Score=26.07 Aligned_cols=159 Identities=15% Similarity=0.159 Sum_probs=87.6
Q ss_pred cceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCcc
Q psy5768 313 MKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLLERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQR 392 (652)
Q Consensus 313 ~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~ 392 (652)
.+.+.-|++++.-.-+..+....+.|..-+......-.-+..-.-+..|.+.+.+. ++.|-....+|.+++...
T Consensus 131 ~rrVg~V~wHPtA~NVLlsag~Dn~v~iWnv~tgeali~l~hpd~i~S~sfn~dGs-~l~TtckDKkvRv~dpr~----- 204 (472)
T KOG0303|consen 131 QRRVGLVQWHPTAPNVLLSAGSDNTVSIWNVGTGEALITLDHPDMVYSMSFNRDGS-LLCTTCKDKKVRVIDPRR----- 204 (472)
T ss_pred ceeEEEEeecccchhhHhhccCCceEEEEeccCCceeeecCCCCeEEEEEeccCCc-eeeeecccceeEEEcCCC-----
Confidence 35566677777777777777666777766665332111113333567888887655 456666677888887532
Q ss_pred EEEEEe----CCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCc-eEEEEcCCCCCceEEE---ecCCCEEEEE
Q psy5768 393 IVVVRL----GQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGT-ESIITTDITMPNALAL---DHQAEKLFWG 464 (652)
Q Consensus 393 ~~~~~~----~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~-~~l~~~~l~~P~glai---D~~~~~LYw~ 464 (652)
.+++.. .....+|+|-+.. |.++-|-...-+ .-+.+--|-.+- +-+...++..-+|+-+ |.+++.||.+
T Consensus 205 ~~~v~e~~~heG~k~~Raifl~~--g~i~tTGfsr~s-eRq~aLwdp~nl~eP~~~~elDtSnGvl~PFyD~dt~ivYl~ 281 (472)
T KOG0303|consen 205 GTVVSEGVAHEGAKPARAIFLAS--GKIFTTGFSRMS-ERQIALWDPNNLEEPIALQELDTSNGVLLPFYDPDTSIVYLC 281 (472)
T ss_pred CcEeeecccccCCCcceeEEecc--Cceeeecccccc-ccceeccCcccccCcceeEEeccCCceEEeeecCCCCEEEEE
Confidence 122211 2335566666654 556655433211 111111122211 1123334445555544 7778889988
Q ss_pred eCCCCeEEEEecCCCc
Q psy5768 465 DARLDKIERCDYDGTN 480 (652)
Q Consensus 465 D~~~~~I~~~~ldG~~ 480 (652)
-.+.+.|..+.+.-..
T Consensus 282 GKGD~~IRYyEit~d~ 297 (472)
T KOG0303|consen 282 GKGDSSIRYFEITNEP 297 (472)
T ss_pred ecCCcceEEEEecCCC
Confidence 8777788777776554
No 258
>KOG0270|consensus
Probab=24.77 E-value=9.2e+02 Score=26.16 Aligned_cols=162 Identities=14% Similarity=0.142 Sum_probs=0.0
Q ss_pred cccceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe-eccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCC
Q psy5768 311 TMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL-ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPK 389 (652)
Q Consensus 311 ~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~-~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~ 389 (652)
.+-+.+.++++++.+..+..+-...+++...+.......-.. .-.+.++-+|.|+-.-+.|++....|++.-+++....
T Consensus 284 ~~~k~Vq~l~wh~~~p~~LLsGs~D~~V~l~D~R~~~~s~~~wk~~g~VEkv~w~~~se~~f~~~tddG~v~~~D~R~~~ 363 (463)
T KOG0270|consen 284 HHGKKVQTLEWHPYEPSVLLSGSYDGTVALKDCRDPSNSGKEWKFDGEVEKVAWDPHSENSFFVSTDDGTVYYFDIRNPG 363 (463)
T ss_pred hcCCceeEEEecCCCceEEEeccccceEEeeeccCccccCceEEeccceEEEEecCCCceeEEEecCCceEEeeecCCCC
Q ss_pred CccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCceEEEEcC--CCCCceEEEecCCCEEEEEeCC
Q psy5768 390 AQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGTESIITTD--ITMPNALALDHQAEKLFWGDAR 467 (652)
Q Consensus 390 ~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~~~l~~~~--l~~P~glaiD~~~~~LYw~D~~ 467 (652)
...-..........+|.+.+..-.|-.|..... .+...++++.+.+...... +..-.-++.|+...-+|.+-..
T Consensus 364 --~~vwt~~AHd~~ISgl~~n~~~p~~l~t~s~d~--~Vklw~~~~~~~~~v~~~~~~~~rl~c~~~~~~~a~~la~GG~ 439 (463)
T KOG0270|consen 364 --KPVWTLKAHDDEISGLSVNIQTPGLLSTASTDK--VVKLWKFDVDSPKSVKEHSFKLGRLHCFALDPDVAFTLAFGGE 439 (463)
T ss_pred --CceeEEEeccCCcceEEecCCCCcceeeccccc--eEEEEeecCCCCcccccccccccceeecccCCCcceEEEecCc
Q ss_pred CCeEEEEec
Q psy5768 468 LDKIERCDY 476 (652)
Q Consensus 468 ~~~I~~~~l 476 (652)
...+...++
T Consensus 440 k~~~~vwd~ 448 (463)
T KOG0270|consen 440 KAVLRVWDI 448 (463)
T ss_pred cceEEEeec
No 259
>KOG0772|consensus
Probab=24.76 E-value=9.1e+02 Score=26.85 Aligned_cols=63 Identities=8% Similarity=0.061 Sum_probs=45.5
Q ss_pred CCCeEEEEecCCCee-----EEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeC
Q psy5768 9 TQSKIVVCNLEGEYQ-----TTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQ 80 (652)
Q Consensus 9 ~~~~I~~~~~~g~~~-----~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~ 80 (652)
-..-|..+|+.|..- +.|.| ..-..+..+.|.+..+.|.+.. +...+..++.||......+..
T Consensus 187 ~Dy~v~~wDf~gMdas~~~fr~l~P------~E~h~i~sl~ys~Tg~~iLvvs---g~aqakl~DRdG~~~~e~~KG 254 (641)
T KOG0772|consen 187 LDYTVKFWDFQGMDASMRSFRQLQP------CETHQINSLQYSVTGDQILVVS---GSAQAKLLDRDGFEIVEFSKG 254 (641)
T ss_pred ccceEEEEecccccccchhhhccCc------ccccccceeeecCCCCeEEEEe---cCcceeEEccCCceeeeeecc
Confidence 345677888888632 23333 3457889999999999998876 777888899999976655443
No 260
>PF04885 Stig1: Stigma-specific protein, Stig1; InterPro: IPR006969 This family represents the Stig1 cysteine rich plant protein.The tobacco stigma-specific gene, STIG1 is developmentally regulated and expressed specifically in the stigmatic secretory zone. Pistils of transgenic STIG1-barnase tobacco plants undergo normal development, but lack the stigmatic secretory zone and are female sterile. Pollen grains are unable to penetrate the surface of the ablated pistils. Application of stigmatic exudate from wild-type pistils to the ablated surface increases the efficiency of pollen tube germination and growth and restores the capacity of pollen tubes to penetrate the style []. The function of STIG1 is unknown.
Probab=24.37 E-value=92 Score=28.00 Aligned_cols=50 Identities=28% Similarity=0.691 Sum_probs=28.4
Q ss_pred CcccCcccccCCCceeeccCeecCCccCCCCCCCCCCCCCCCCCCCCCCCCCCCeeecCCCCcc
Q psy5768 584 NRSCTINTVCSEHDFKCSDGMCIPFNQTCDRVYNCHDKSDEGILYCAMRDCRPGYFKCDNNKCI 647 (652)
Q Consensus 584 ~~C~~~~~~C~~~~f~C~~g~Ci~~~~~Cd~~~dC~d~sde~~~~C~~~~C~~~~f~C~~~~Ci 647 (652)
..|..-...|..++ .|=+|.|+... -...+|+ . |.+ .|++++ .|..|+|-
T Consensus 85 ~nCG~Cg~~C~~g~-~cC~G~Cvd~~---~d~~~CG------~--Cg~-~C~~G~-~C~~G~C~ 134 (136)
T PF04885_consen 85 NNCGACGNKCPYGQ-TCCGGQCVDLN---SDPRHCG------A--CGN-KCPPGQ-KCVYGMCG 134 (136)
T ss_pred cccHhhcCCCCCCc-eecCCEeECCC---CCccccC------C--CCC-cCCCcC-CcCCeECC
Confidence 44444344676665 44468898875 2333454 2 533 676654 47777773
No 261
>KOG4611|consensus
Probab=23.90 E-value=78 Score=32.65 Aligned_cols=20 Identities=30% Similarity=0.971 Sum_probs=13.7
Q ss_pred CCCCCCCCCCCeeecCCCCccc
Q psy5768 627 LYCAMRDCRPGYFKCDNNKCIL 648 (652)
Q Consensus 627 ~~C~~~~C~~~~f~C~~~~Ci~ 648 (652)
..|.+ |.+++++=+||-|..
T Consensus 98 afcgn--casgfyrndngyctk 117 (747)
T KOG4611|consen 98 AFCGN--CASGFYRNDNGYCTK 117 (747)
T ss_pred ccccc--ccccceECCCccccc
Confidence 45644 888888877877653
No 262
>KOG0276|consensus
Probab=23.74 E-value=1.1e+03 Score=26.84 Aligned_cols=104 Identities=21% Similarity=0.189 Sum_probs=78.5
Q ss_pred EEecCCCCeEEEEecC-CCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCc-cEEEEeCC
Q psy5768 4 AVSSPTQSKIVVCNLE-GEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTK-RETVVSQK 81 (652)
Q Consensus 4 ~v~~~~~~~I~~~~~~-g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~-~~~v~~~~ 81 (652)
+|+-++...|++++.. ++.+.+|-.+. ..+..|++||..-++. +. .....|..++.++.- -+..+...
T Consensus 70 iv~GsDD~~IrVfnynt~ekV~~FeAH~-------DyIR~iavHPt~P~vL-ts--SDDm~iKlW~we~~wa~~qtfeGH 139 (794)
T KOG0276|consen 70 IVTGSDDMQIRVFNYNTGEKVKTFEAHS-------DYIRSIAVHPTLPYVL-TS--SDDMTIKLWDWENEWACEQTFEGH 139 (794)
T ss_pred EEEecCCceEEEEecccceeeEEeeccc-------cceeeeeecCCCCeEE-ec--CCccEEEEeeccCceeeeeEEcCc
Confidence 5666788999999877 66677777542 5889999999876665 55 577899999998864 45556666
Q ss_pred CcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcE
Q psy5768 82 KYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYR 122 (652)
Q Consensus 82 ~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~ 122 (652)
-+.+- .+|+.+...+-|.+-+-...|.|-++.....
T Consensus 140 -~HyVM----qv~fnPkD~ntFaS~sLDrTVKVWslgs~~~ 175 (794)
T KOG0276|consen 140 -EHYVM----QVAFNPKDPNTFASASLDRTVKVWSLGSPHP 175 (794)
T ss_pred -ceEEE----EEEecCCCccceeeeeccccEEEEEcCCCCC
Confidence 78888 9999998888887777666777766655544
No 263
>KOG0196|consensus
Probab=23.58 E-value=75 Score=36.93 Aligned_cols=56 Identities=34% Similarity=0.825 Sum_probs=35.4
Q ss_pred eeeeccCceeeccC-CcccCcccccCCCceeeccC--eecCCccCCCCCCCCCCCC---CCCCCCCCCCCCCCCeeecC
Q psy5768 570 VVCSCFTGKVLMED-NRSCTINTVCSEHDFKCSDG--MCIPFNQTCDRVYNCHDKS---DEGILYCAMRDCRPGYFKCD 642 (652)
Q Consensus 570 ~~C~Cp~g~~l~~d-~~C~~~~~~C~~~~f~C~~g--~Ci~~~~~Cd~~~dC~d~s---de~~~~C~~~~C~~~~f~C~ 642 (652)
..|.|..||.-... ..|.+ |.++.|+=..| .|.+ |+-+| -|....| +|..++|+=.
T Consensus 259 G~C~C~aGye~~~~~~~C~a----Cp~G~yK~~~~~~~C~~----------CP~~S~s~~ega~~C---~C~~gyyRA~ 320 (996)
T KOG0196|consen 259 GGCVCKAGYEEAENGKACQA----CPPGTYKASQGDSLCLP----------CPPNSHSSSEGATSC---TCENGYYRAD 320 (996)
T ss_pred CceeecCCCCcccCCCccee----CCCCcccCCCCCCCCCC----------CCCCCCCCCCCCCcc---cccCCcccCC
Confidence 46999999988666 78875 88777775432 3432 43333 2444456 6777777664
No 264
>PF13570 PQQ_3: PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=23.44 E-value=1.4e+02 Score=19.99 Aligned_cols=24 Identities=13% Similarity=-0.013 Sum_probs=16.8
Q ss_pred EEEEeCCEEEEEcCCCCeEEEEEcc
Q psy5768 494 DMAVYGEFIFWTDWVIHAVLRANKY 518 (652)
Q Consensus 494 glav~~~~lYwtd~~~~~I~~~~k~ 518 (652)
++++.++.||..+. .+.++.++..
T Consensus 16 ~~~v~~g~vyv~~~-dg~l~ald~~ 39 (40)
T PF13570_consen 16 SPAVAGGRVYVGTG-DGNLYALDAA 39 (40)
T ss_dssp --EECTSEEEEE-T-TSEEEEEETT
T ss_pred CCEEECCEEEEEcC-CCEEEEEeCC
Confidence 45888999999986 6778877754
No 265
>PHA02887 EGF-like protein; Provisional
Probab=22.37 E-value=60 Score=28.05 Aligned_cols=38 Identities=24% Similarity=0.502 Sum_probs=23.7
Q ss_pred CCCCCCCCC-CCCCcc-cceecCC--CceEEEeCCccccCCCccc
Q psy5768 235 TGTNPCGVN-NGGCAE-LCLYNGV--SAVCACAHGVVAQDGKSCS 275 (652)
Q Consensus 235 ~~~n~C~~~-ng~Cs~-lC~~~~~--~~~C~C~~G~l~~dg~~C~ 275 (652)
....||... ++-|=| .|...+. .+.|.|+.|| .|..|.
T Consensus 81 ~hf~pC~~eyk~YCiHG~C~yI~dL~epsCrC~~GY---tG~RCE 122 (126)
T PHA02887 81 MFFEKCKNDFNDFCINGECMNIIDLDEKFCICNKGY---TGIRCD 122 (126)
T ss_pred cCccccChHhhCEeeCCEEEccccCCCceeECCCCc---ccCCCC
Confidence 446677542 334444 4554333 6899999999 466675
No 266
>PRK10115 protease 2; Provisional
Probab=22.34 E-value=1.3e+03 Score=26.98 Aligned_cols=190 Identities=9% Similarity=0.016 Sum_probs=0.0
Q ss_pred CeEEEecCCCCeEEEEec---CCCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcC-CCccEE
Q psy5768 1 MFIAVSSPTQSKIVVCNL---EGEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMD-GTKRET 76 (652)
Q Consensus 1 ~~i~v~~~~~~~I~~~~~---~g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~d-gs~~~~ 76 (652)
++|-+.+.....+..++. +++....+.. .......+. +..+...+.+|....+.+|.++.+. -...+.
T Consensus 237 l~i~~~~~~~~~~~l~~~~~~~~~~~~~~~~-------~~~~~~~~~-~~~~~ly~~tn~~~~~~~l~~~~~~~~~~~~~ 308 (686)
T PRK10115 237 VVIHLASATTSEVLLLDAELADAEPFVFLPR-------RKDHEYSLD-HYQHRFYLRSNRHGKNFGLYRTRVRDEQQWEE 308 (686)
T ss_pred EEEEEECCccccEEEEECcCCCCCceEEEEC-------CCCCEEEEE-eCCCEEEEEEcCCCCCceEEEecCCCcccCeE
Q ss_pred EEeCCCcCCccCCCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCcEEEEEeCCCCCceeEEEc--CCCCeEE--EEecCC
Q psy5768 77 VVSQKKYPAVTACNLHIAVDWIAQNIYWSDPKENVIEVARLTGQYRYVLISGGVDQPSALAVD--PESGYLF--WSESGK 152 (652)
Q Consensus 77 v~~~~~~~~p~~~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~~~~l~~~~~~~P~~iavd--~~~g~ly--wtd~~~ 152 (652)
++...+-..++ ++++.--.=-+...+.+..++.+.++++.....+.-........+... +..+.|+ ++++..
T Consensus 309 l~~~~~~~~i~----~~~~~~~~l~~~~~~~g~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ss~~~ 384 (686)
T PRK10115 309 LIPPRENIMLE----GFTLFTDWLVVEERQRGLTSLRQINRKTREVIGIAFDDPAYVTWIAYNPEPETSRLRYGYSSMTT 384 (686)
T ss_pred EECCCCCCEEE----EEEEECCEEEEEEEeCCEEEEEEEcCCCCceEEecCCCCceEeeecccCCCCCceEEEEEecCCC
Q ss_pred CCeEEEEeCCCCCcEEEEeecccCceeEEEeccCCEEEEEeCCCCcEEE-EEe
Q psy5768 153 IPLIARAGLDGKKQTILAQEIIMPIKDITLDLKFFSAFYRNLSKGNIHI-ISL 204 (652)
Q Consensus 153 ~~~I~~~~ldg~~~~~~~~~~~~~p~gl~lD~~~~~ly~~d~~g~~~~~-i~~ 204 (652)
.+.+.+.++++...+.+.......-..- +....++.+...||..+.. ++.
T Consensus 385 P~~~y~~d~~~~~~~~l~~~~~~~~~~~--~~~~e~v~~~s~DG~~Ip~~l~~ 435 (686)
T PRK10115 385 PDTLFELDMDTGERRVLKQTEVPGFDAA--NYRSEHLWITARDGVEVPVSLVY 435 (686)
T ss_pred CCEEEEEECCCCcEEEEEecCCCCcCcc--ccEEEEEEEECCCCCEEEEEEEE
No 267
>KOG0277|consensus
Probab=22.05 E-value=8.2e+02 Score=24.64 Aligned_cols=82 Identities=15% Similarity=0.245 Sum_probs=57.5
Q ss_pred CCceeeeeccccceEEEEEEEcCCCeEEEeecccccEEEEeccC-CcceEEeeccCceeeeEEEccCCEEEEEeCCCCeE
Q psy5768 302 SPFESIRNSTMMKNIIELSYDYKRKTLFYSDIQKGTINSVFFNG-SNHRVLLERQGSVEGLAYEYVHNYLYWTCNNDATI 380 (652)
Q Consensus 302 ~p~~~~~~~~~~~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g-~~~~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I 380 (652)
.|+..+. ++-+.+.++++....++.+.+..-.++|.--..+- ...++.......+.+.++.+...+||-.-++.+..
T Consensus 95 ~Pi~~~k--EH~~EV~Svdwn~~~r~~~ltsSWD~TiKLW~~~r~~Sv~Tf~gh~~~Iy~a~~sp~~~nlfas~Sgd~~l 172 (311)
T KOG0277|consen 95 KPIHKFK--EHKREVYSVDWNTVRRRIFLTSSWDGTIKLWDPNRPNSVQTFNGHNSCIYQAAFSPHIPNLFASASGDGTL 172 (311)
T ss_pred cchhHHH--hhhhheEEeccccccceeEEeeccCCceEeecCCCCcceEeecCCccEEEEEecCCCCCCeEEEccCCceE
Confidence 3554443 45577899999999999999988888887666543 22333334445678888999999999887777765
Q ss_pred EEEEc
Q psy5768 381 NKIDL 385 (652)
Q Consensus 381 ~~~~~ 385 (652)
..-++
T Consensus 173 ~lwdv 177 (311)
T KOG0277|consen 173 RLWDV 177 (311)
T ss_pred EEEEe
Confidence 54454
No 268
>KOG4649|consensus
Probab=21.64 E-value=8.5e+02 Score=24.66 Aligned_cols=53 Identities=19% Similarity=-0.016 Sum_probs=27.6
Q ss_pred eEEEcCCCCeEEEEecCCCCeEEEEeCCCCCcEEEEeecccCceeEEEeccCCEE
Q psy5768 135 ALAVDPESGYLFWSESGKIPLIARAGLDGKKQTILAQEIIMPIKDITLDLKFFSA 189 (652)
Q Consensus 135 ~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~~~~~~~~~~~p~gl~lD~~~~~l 189 (652)
-++.-|..|.|-|.+. .+...++.++.+-+..+.+.+...|.=|+....++++
T Consensus 242 f~~~~p~~ghL~w~~~--~g~t~~vy~~p~l~F~~h~~~~S~~~ll~~~s~dgkv 294 (354)
T KOG4649|consen 242 FCAPLPIAGHLLWATQ--SGTTLHVYLSPKLRFDLHSPGISYPKLLRRSSGDGKV 294 (354)
T ss_pred EEEeccccceEEEEec--CCcEEEEEeCcccceeccCCCCcchhhhhhhcCCCcE
Confidence 5667777888888863 2244555555544444333333334444444434443
No 269
>KOG0266|consensus
Probab=21.63 E-value=1.1e+03 Score=25.85 Aligned_cols=155 Identities=12% Similarity=0.108 Sum_probs=86.1
Q ss_pred EEEEEEEcCCCeEEEeecccccEEEEeccCCc---ceEEeeccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCcc
Q psy5768 316 IIELSYDYKRKTLFYSDIQKGTINSVFFNGSN---HRVLLERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQR 392 (652)
Q Consensus 316 ~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~---~~~i~~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~~ 392 (652)
+..++|.+..+. +++-...+.|....+.+.. ...+......+.++++-+.++ .-.+-+...+|.+-++... .
T Consensus 162 v~~~~fs~~g~~-l~~~~~~~~i~~~~~~~~~~~~~~~l~~h~~~v~~~~fs~d~~-~l~s~s~D~tiriwd~~~~---~ 236 (456)
T KOG0266|consen 162 VTCVDFSPDGRA-LAAASSDGLIRIWKLEGIKSNLLRELSGHTRGVSDVAFSPDGS-YLLSGSDDKTLRIWDLKDD---G 236 (456)
T ss_pred eEEEEEcCCCCe-EEEccCCCcEEEeecccccchhhccccccccceeeeEECCCCc-EEEEecCCceEEEeeccCC---C
Confidence 444555555544 3333333444444442222 111223344678888888776 3345556677777776221 1
Q ss_pred EEEEEe-CCCCCceEEEEeCCCCEEEEEecCCCCCceEEEeec-CCCceEEEEcCCCCCceEEEecCCCEEEEEeCCCCe
Q psy5768 393 IVVVRL-GQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFS-GFGTESIITTDITMPNALALDHQAEKLFWGDARLDK 470 (652)
Q Consensus 393 ~~~~~~-~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ld-G~~~~~l~~~~l~~P~glaiD~~~~~LYw~D~~~~~ 470 (652)
..+..+ +-.....+++++|.. .+..+-.... .|.-.++. |+-.+++.. .-...+++++..++..|.-+ ...+.
T Consensus 237 ~~~~~l~gH~~~v~~~~f~p~g-~~i~Sgs~D~--tvriWd~~~~~~~~~l~~-hs~~is~~~f~~d~~~l~s~-s~d~~ 311 (456)
T KOG0266|consen 237 RNLKTLKGHSTYVTSVAFSPDG-NLLVSGSDDG--TVRIWDVRTGECVRKLKG-HSDGISGLAFSPDGNLLVSA-SYDGT 311 (456)
T ss_pred eEEEEecCCCCceEEEEecCCC-CEEEEecCCC--cEEEEeccCCeEEEeeec-cCCceEEEEECCCCCEEEEc-CCCcc
Confidence 222222 344666899999976 6666554443 44444444 333344433 33456788888776666555 66788
Q ss_pred EEEEecCCCc
Q psy5768 471 IERCDYDGTN 480 (652)
Q Consensus 471 I~~~~ldG~~ 480 (652)
|...|+.+..
T Consensus 312 i~vwd~~~~~ 321 (456)
T KOG0266|consen 312 IRVWDLETGS 321 (456)
T ss_pred EEEEECCCCc
Confidence 9999988766
No 270
>PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal
Probab=21.63 E-value=8.7e+02 Score=24.74 Aligned_cols=27 Identities=19% Similarity=0.166 Sum_probs=21.7
Q ss_pred EEEecCCCCeEEEEecCCCeeEEEecC
Q psy5768 3 IAVSSPTQSKIVVCNLEGEYQTTILSN 29 (652)
Q Consensus 3 i~v~~~~~~~I~~~~~~g~~~~~~~~~ 29 (652)
+++.+.+++.|++||+.|.....|.+.
T Consensus 57 lLa~a~S~G~i~vfdl~g~~lf~I~p~ 83 (282)
T PF15492_consen 57 LLAYAESTGTIRVFDLMGSELFVIPPA 83 (282)
T ss_pred EEEEEcCCCeEEEEecccceeEEcCcc
Confidence 455566889999999999998888763
No 271
>COG1770 PtrB Protease II [Amino acid transport and metabolism]
Probab=21.61 E-value=1.3e+03 Score=26.69 Aligned_cols=163 Identities=11% Similarity=0.118 Sum_probs=88.6
Q ss_pred eeeeEEEccCCEEEEEeCCC----CeEEEEEcCCCCCccEEEEEeCCCCCceEEEEeCCCCEEEEEecCCCCCceEEEee
Q psy5768 358 VEGLAYEYVHNYLYWTCNND----ATINKIDLDSPKAQRIVVVRLGQHDKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFF 433 (652)
Q Consensus 358 ~~glAvDw~~~~LYwtd~~~----~~I~~~~~~~~~~~~~~~~~~~~~~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~l 433 (652)
..+.+....++.||++.... .++++-.+.+.....+.+....+-..-.++--.....+|+..-.+.....|.-..+
T Consensus 176 ~~~~~Wa~d~~~lfYt~~d~~~rp~kv~~h~~gt~~~~d~lvyeE~d~~f~~~v~~s~s~~yi~i~~~~~~tsE~~ll~a 255 (682)
T COG1770 176 SGSFAWAADGKTLFYTRLDENHRPDKVWRHRLGTPGSSDELVYEEKDDRFFLSVGRSRSEAYIVISLGSHITSEVRLLDA 255 (682)
T ss_pred ccceEEecCCCeEEEEEEcCCCCcceEEEEecCCCCCcceEEEEcCCCcEEEEeeeccCCceEEEEcCCCcceeEEEEec
Confidence 56677777788899987543 46777776552222333443233233334444455677777654444445665555
Q ss_pred cCCC--ceEEEEcCCCCCceEEEe--cCCCEEEEEe---CCCCeEEEEec--C-CCceEEEecCCCCceeEEEEeCCEEE
Q psy5768 434 SGFG--TESIITTDITMPNALALD--HQAEKLFWGD---ARLDKIERCDY--D-GTNRIVLSKISPLHPFDMAVYGEFIF 503 (652)
Q Consensus 434 dG~~--~~~l~~~~l~~P~glaiD--~~~~~LYw~D---~~~~~I~~~~l--d-G~~~~~l~~~~~~~p~glav~~~~lY 503 (652)
+-.. .+++.. .++|+..+ +-+++.|.-- ...-+|.+.-. + -..+..+.........++.+|.++|.
T Consensus 256 ~~p~~~p~vv~p----r~~g~eY~~eh~~d~f~i~sN~~gknf~l~~ap~~~~~~~w~~~I~h~~~~~l~~~~~f~~~lV 331 (682)
T COG1770 256 DDPEAEPKVVLP----RENGVEYSVEHGGDRFYILSNADGKNFKLVRAPVSADKSNWRELIPHREDVRLEGVDLFADHLV 331 (682)
T ss_pred CCCCCceEEEEE----cCCCcEEeeeecCcEEEEEecCCCcceEEEEccCCCChhcCeeeeccCCCceeeeeeeeccEEE
Confidence 4332 333332 45666554 4455555542 22334555544 1 12233333333344567788999999
Q ss_pred EEcCCCC--eEEEEEccCCceEE
Q psy5768 504 WTDWVIH--AVLRANKYTGEEVY 524 (652)
Q Consensus 504 wtd~~~~--~I~~~~k~~g~~~~ 524 (652)
|.....+ .|...+..+|+...
T Consensus 332 l~eR~~glp~v~v~~~~~~~~~~ 354 (682)
T COG1770 332 LLERQEGLPRVVVRDRKTGEERG 354 (682)
T ss_pred EEecccCCceEEEEecCCCceee
Confidence 9876654 56666666666544
No 272
>KOG3658|consensus
Probab=21.28 E-value=80 Score=35.76 Aligned_cols=17 Identities=41% Similarity=0.784 Sum_probs=12.4
Q ss_pred cccCCCceeeccCeecCC
Q psy5768 591 TVCSEHDFKCSDGMCIPF 608 (652)
Q Consensus 591 ~~C~~~~f~C~~g~Ci~~ 608 (652)
..|. ..+.|.+|.|++.
T Consensus 564 t~C~-~~~~C~~G~C~gs 580 (764)
T KOG3658|consen 564 TVCN-ETGVCINGKCIGS 580 (764)
T ss_pred Cccc-ccceEeCCcCccH
Confidence 3454 5678999999985
No 273
>COG4222 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.18 E-value=1e+03 Score=25.67 Aligned_cols=64 Identities=14% Similarity=0.086 Sum_probs=44.1
Q ss_pred CCceEEEEeCCCCEEEEEec----CCCCCceEEEeecCCCceEEEEcC----------CC---CCceEEEecCCCEEEEE
Q psy5768 402 DKPRGIDIDSCDSRIYWTNW----NSHLPSIQRAFFSGFGTESIITTD----------IT---MPNALALDHQAEKLFWG 464 (652)
Q Consensus 402 ~~P~~Iavdp~~g~Lywtd~----~~~~~~I~r~~ldG~~~~~l~~~~----------l~---~P~glaiD~~~~~LYw~ 464 (652)
..|.++++.+....+.++.. ....|-|.+.+++|+..+.+.... +. .=.||++.+...+||=+
T Consensus 138 ~~~~~ralt~~d~~~~s~~~~~igdefgP~l~~f~~~Gk~~~~~~~~~~~~~~~~p~g~~~n~gfEglait~d~~~L~~~ 217 (391)
T COG4222 138 EDPEGRALTPADFDVESSQGAWIGDEFGPYLLEFDANGKLVRVLEVPVRFLPPDNPKGLRNNLGFEGLAITPDGKKLYAL 217 (391)
T ss_pred cCchhhcccCCCcceeeccccccccccCcceEEECCCCccccccccccccCcCCCccccccccceeeEEecCCCceEEEE
Confidence 45667788886666666654 233689999999998877665311 11 12379999999999976
Q ss_pred e
Q psy5768 465 D 465 (652)
Q Consensus 465 D 465 (652)
=
T Consensus 218 l 218 (391)
T COG4222 218 L 218 (391)
T ss_pred E
Confidence 3
No 274
>KOG0772|consensus
Probab=20.77 E-value=6.5e+02 Score=27.90 Aligned_cols=67 Identities=13% Similarity=0.100 Sum_probs=45.9
Q ss_pred EEEecCCCCeEEEEecCCCeeEEEec------CCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCC
Q psy5768 3 IAVSSPTQSKIVVCNLEGEYQTTILS------NESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGT 72 (652)
Q Consensus 3 i~v~~~~~~~I~~~~~~g~~~~~~~~------~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs 72 (652)
|+|.+ +...+.++|.+|..+..+.- +-.+..+....++...|+|.+...|.+- ...+++..++.+.+
T Consensus 229 iLvvs-g~aqakl~DRdG~~~~e~~KGDQYI~Dm~nTKGHia~lt~g~whP~~k~~FlT~--s~DgtlRiWdv~~~ 301 (641)
T KOG0772|consen 229 ILVVS-GSAQAKLLDRDGFEIVEFSKGDQYIRDMYNTKGHIAELTCGCWHPDNKEEFLTC--SYDGTLRIWDVNNT 301 (641)
T ss_pred EEEEe-cCcceeEEccCCceeeeeeccchhhhhhhccCCceeeeeccccccCcccceEEe--cCCCcEEEEecCCc
Confidence 34443 55666788999987766531 1123344556777888999999999887 66777777877765
No 275
>KOG4378|consensus
Probab=20.61 E-value=1.2e+03 Score=25.88 Aligned_cols=156 Identities=12% Similarity=0.131 Sum_probs=89.8
Q ss_pred ceEEEEEEEcCCCeEEEeecccccEEEEeccCCcceEEe--eccCceeeeEEEccCCEEEEEeCCCCeEEEEEcCCCCCc
Q psy5768 314 KNIIELSYDYKRKTLFYSDIQKGTINSVFFNGSNHRVLL--ERQGSVEGLAYEYVHNYLYWTCNNDATINKIDLDSPKAQ 391 (652)
Q Consensus 314 ~~~~~v~~D~~~~~lywsd~~~~~I~~~~~~g~~~~~i~--~~~~~~~glAvDw~~~~LYwtd~~~~~I~~~~~~~~~~~ 391 (652)
..+..|+|++....|--.... |.|....+......+-+ .....+.=|.+....+.|.-+-+..+.+..-++.|..
T Consensus 122 stvt~v~YN~~DeyiAsvs~g-Gdiiih~~~t~~~tt~f~~~sgqsvRll~ys~skr~lL~~asd~G~VtlwDv~g~s-- 198 (673)
T KOG4378|consen 122 STVTYVDYNNTDEYIASVSDG-GDIIIHGTKTKQKTTTFTIDSGQSVRLLRYSPSKRFLLSIASDKGAVTLWDVQGMS-- 198 (673)
T ss_pred ceeEEEEecCCcceeEEeccC-CcEEEEecccCccccceecCCCCeEEEeecccccceeeEeeccCCeEEEEeccCCC--
Confidence 457889998877654433332 44443333222222222 2233455678888889999888889999988988753
Q ss_pred cEEEEEe-CCC-CCceEEEEeCCCCEEEEEecCCCCCceEEEeecCCCc-eEEEEcCCCCC-ceEEEecCCCEEEEEeCC
Q psy5768 392 RIVVVRL-GQH-DKPRGIDIDSCDSRIYWTNWNSHLPSIQRAFFSGFGT-ESIITTDITMP-NALALDHQAEKLFWGDAR 467 (652)
Q Consensus 392 ~~~~~~~-~~~-~~P~~Iavdp~~g~Lywtd~~~~~~~I~r~~ldG~~~-~~l~~~~l~~P-~glaiD~~~~~LYw~D~~ 467 (652)
..... +.- .--++|.+.|.+-.|+++-.-.. +|.-.+...... ..|+- ..| ..+++-.. +....+-..
T Consensus 199 --p~~~~~~~HsAP~~gicfspsne~l~vsVG~Dk--ki~~yD~~s~~s~~~l~y---~~Plstvaf~~~-G~~L~aG~s 270 (673)
T KOG4378|consen 199 --PIFHASEAHSAPCRGICFSPSNEALLVSVGYDK--KINIYDIRSQASTDRLTY---SHPLSTVAFSEC-GTYLCAGNS 270 (673)
T ss_pred --cccchhhhccCCcCcceecCCccceEEEecccc--eEEEeecccccccceeee---cCCcceeeecCC-ceEEEeecC
Confidence 22222 122 34478999999999998865433 565444432111 11111 233 35555533 444444455
Q ss_pred CCeEEEEecCCCc
Q psy5768 468 LDKIERCDYDGTN 480 (652)
Q Consensus 468 ~~~I~~~~ldG~~ 480 (652)
.++|.-+|+.+..
T Consensus 271 ~G~~i~YD~R~~k 283 (673)
T KOG4378|consen 271 KGELIAYDMRSTK 283 (673)
T ss_pred CceEEEEecccCC
Confidence 6778888888753
No 276
>KOG0282|consensus
Probab=20.45 E-value=1.2e+03 Score=25.72 Aligned_cols=168 Identities=10% Similarity=0.003 Sum_probs=82.9
Q ss_pred CCeEEEEecC-CCeeEEEecCCCCCCCCCCCeeEEEEECCCCEEEEEeccCCcceEEEEEcCCCccEEEEeCCCcCCccC
Q psy5768 10 QSKIVVCNLE-GEYQTTILSNESNDTSTLSKISSIAVWPVKGKMFWSNVTKQVVTIEMAFMDGTKRETVVSQKKYPAVTA 88 (652)
Q Consensus 10 ~~~I~~~~~~-g~~~~~~~~~~~~~~~~~~~~~~v~~d~~~~~lyw~d~~~~~~~I~~~~~dgs~~~~v~~~~~~~~p~~ 88 (652)
...|..+|.+ |.....|-.. ..|.-|-++|.+..+|++- ..+++|..+++.......-+... ++...
T Consensus 279 D~~lKlwDtETG~~~~~f~~~--------~~~~cvkf~pd~~n~fl~G--~sd~ki~~wDiRs~kvvqeYd~h-Lg~i~- 346 (503)
T KOG0282|consen 279 DRFLKLWDTETGQVLSRFHLD--------KVPTCVKFHPDNQNIFLVG--GSDKKIRQWDIRSGKVVQEYDRH-LGAIL- 346 (503)
T ss_pred ceeeeeeccccceEEEEEecC--------CCceeeecCCCCCcEEEEe--cCCCcEEEEeccchHHHHHHHhh-hhhee-
Confidence 4556666666 5555555543 4678899999998999988 88899999987655322222333 55555
Q ss_pred CCCcEEEEccCCcEEEEeCCCCEEEEEEcCCCc-EEEEEeCCCCCceeEEEcCCCCeEEEEecCCCCeEEEEeCCCCCcE
Q psy5768 89 CNLHIAVDWIAQNIYWSDPKENVIEVARLTGQY-RYVLISGGVDQPSALAVDPESGYLFWSESGKIPLIARAGLDGKKQT 167 (652)
Q Consensus 89 ~~~~lavDw~~~~lY~~d~~~~~I~v~~~dg~~-~~~l~~~~~~~P~~iavd~~~g~lywtd~~~~~~I~~~~ldg~~~~ 167 (652)
.|.+=. .++=|++-+..+.+.+-+.+-.. .+-++......=..+++.| +|..|-...- ...|..+...-..+.
T Consensus 347 ---~i~F~~-~g~rFissSDdks~riWe~~~~v~ik~i~~~~~hsmP~~~~~P-~~~~~~aQs~-dN~i~ifs~~~~~r~ 420 (503)
T KOG0282|consen 347 ---DITFVD-EGRRFISSSDDKSVRIWENRIPVPIKNIADPEMHTMPCLTLHP-NGKWFAAQSM-DNYIAIFSTVPPFRL 420 (503)
T ss_pred ---eeEEcc-CCceEeeeccCccEEEEEcCCCccchhhcchhhccCcceecCC-CCCeehhhcc-CceEEEEeccccccc
Confidence 554422 22223333333333333222110 0111111122233678888 4555555432 345555554333332
Q ss_pred EEEe-----ecccCceeEEEeccCCEEEEEeCC
Q psy5768 168 ILAQ-----EIIMPIKDITLDLKFFSAFYRNLS 195 (652)
Q Consensus 168 ~~~~-----~~~~~p~gl~lD~~~~~ly~~d~~ 195 (652)
..-. .--+.+..+...+.++.|.-=|.+
T Consensus 421 nkkK~feGh~vaGys~~v~fSpDG~~l~SGdsd 453 (503)
T KOG0282|consen 421 NKKKRFEGHSVAGYSCQVDFSPDGRTLCSGDSD 453 (503)
T ss_pred CHhhhhcceeccCceeeEEEcCCCCeEEeecCC
Confidence 1100 002345555555555444333333
Done!