Query psy6570
Match_columns 713
No_of_seqs 586 out of 3937
Neff 9.5
Searched_HMMs 46136
Date Fri Aug 16 21:30:51 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6570.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6570hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 100.0 9.1E-34 2E-38 297.3 18.6 238 2-247 1042-1283(1289)
2 KOG1214|consensus 99.9 8.2E-24 1.8E-28 223.0 16.9 205 6-215 998-1211(1289)
3 KOG1215|consensus 99.9 3.4E-21 7.4E-26 227.7 24.0 254 2-262 454-710 (877)
4 PF08450 SGL: SMP-30/Gluconola 99.8 6.7E-17 1.5E-21 162.7 20.9 200 1-210 16-233 (246)
5 KOG4289|consensus 99.7 3.9E-17 8.5E-22 180.7 16.1 120 496-628 1707-1838(2531)
6 PLN02919 haloacid dehalogenase 99.7 1E-15 2.2E-20 181.2 25.4 189 1-193 584-837 (1057)
7 KOG4289|consensus 99.7 1.9E-16 4.2E-21 175.3 16.0 87 365-451 1218-1317(2531)
8 KOG0994|consensus 99.7 2.4E-15 5.2E-20 164.3 19.0 309 240-586 792-1145(1758)
9 KOG1225|consensus 99.7 2.1E-15 4.5E-20 159.8 16.5 129 494-667 233-365 (525)
10 KOG1217|consensus 99.6 5.9E-14 1.3E-18 156.6 27.4 295 284-620 100-422 (487)
11 KOG1217|consensus 99.6 3.9E-14 8.5E-19 158.0 22.5 254 368-666 109-389 (487)
12 PLN02919 haloacid dehalogenase 99.6 6.5E-14 1.4E-18 166.0 25.1 188 1-193 640-892 (1057)
13 KOG1219|consensus 99.6 1.1E-15 2.4E-20 175.2 9.2 155 548-712 3860-4020(4289)
14 KOG0994|consensus 99.6 3.1E-14 6.7E-19 155.8 14.3 253 369-628 782-1147(1758)
15 KOG1219|consensus 99.5 1.7E-14 3.7E-19 165.8 11.2 112 506-628 3865-3978(4289)
16 KOG1225|consensus 99.5 2.2E-13 4.8E-18 144.5 15.0 175 292-485 159-342 (525)
17 KOG1215|consensus 99.5 1.5E-12 3.1E-17 154.6 20.1 207 6-216 416-625 (877)
18 PF08450 SGL: SMP-30/Gluconola 99.4 6.6E-12 1.4E-16 126.4 20.1 150 28-191 2-166 (246)
19 COG3386 Gluconolactonase [Carb 99.4 1.4E-11 2.9E-16 125.4 20.6 200 1-209 41-262 (307)
20 KOG1226|consensus 99.4 1.4E-12 3E-17 140.1 10.2 141 471-644 479-637 (783)
21 PF10282 Lactonase: Lactonase, 99.2 5.6E-09 1.2E-13 110.4 24.0 184 2-189 104-322 (345)
22 COG3391 Uncharacterized conser 99.1 2.1E-08 4.7E-13 107.0 22.5 187 5-195 94-289 (381)
23 PF10282 Lactonase: Lactonase, 99.0 3.1E-08 6.6E-13 104.8 23.0 205 6-214 61-304 (345)
24 COG3386 Gluconolactonase [Carb 99.0 2E-08 4.3E-13 102.5 20.3 151 24-181 109-278 (307)
25 KOG4659|consensus 99.0 3.6E-09 7.9E-14 118.5 15.7 177 6-191 383-624 (1899)
26 TIGR02604 Piru_Ver_Nterm putat 99.0 1.1E-08 2.4E-13 108.9 19.0 159 19-187 5-211 (367)
27 KOG1520|consensus 99.0 2.3E-09 4.9E-14 108.2 10.5 147 23-177 112-281 (376)
28 PRK11028 6-phosphogluconolacto 99.0 7.1E-08 1.5E-12 101.7 22.3 181 5-189 55-258 (330)
29 PRK11028 6-phosphogluconolacto 99.0 6.3E-08 1.4E-12 102.1 21.7 182 2-189 7-205 (330)
30 COG3391 Uncharacterized conser 98.9 6.1E-08 1.3E-12 103.6 20.6 183 25-214 73-265 (381)
31 KOG4659|consensus 98.9 8.3E-09 1.8E-13 115.7 13.7 154 24-188 473-690 (1899)
32 KOG1226|consensus 98.9 5.2E-09 1.1E-13 113.1 10.2 136 512-672 468-623 (783)
33 PF06977 SdiA-regulated: SdiA- 98.8 6.3E-07 1.4E-11 88.3 20.4 176 6-186 43-247 (248)
34 KOG4260|consensus 98.8 6.1E-09 1.3E-13 98.0 5.9 141 333-479 132-304 (350)
35 PF00058 Ldl_recept_b: Low-den 98.8 1.9E-08 4.2E-13 69.4 5.8 42 84-125 1-42 (42)
36 TIGR02604 Piru_Ver_Nterm putat 98.7 4.3E-07 9.3E-12 96.8 18.3 151 27-181 125-342 (367)
37 TIGR03866 PQQ_ABC_repeats PQQ- 98.7 3.6E-06 7.7E-11 87.1 24.6 196 5-209 93-298 (300)
38 PF06977 SdiA-regulated: SdiA- 98.7 1.5E-06 3.2E-11 85.8 18.6 184 24-213 20-233 (248)
39 KOG1836|consensus 98.7 3.5E-07 7.6E-12 110.5 16.9 243 371-628 697-1022(1705)
40 KOG1836|consensus 98.7 3.9E-07 8.5E-12 110.1 17.3 273 285-589 697-1023(1705)
41 PF00058 Ldl_recept_b: Low-den 98.6 8.7E-08 1.9E-12 66.1 5.9 41 38-81 1-42 (42)
42 PF07995 GSDH: Glucose / Sorbo 98.6 9.5E-07 2.1E-11 92.4 16.0 148 25-181 1-203 (331)
43 PF03088 Str_synth: Strictosid 98.6 2.2E-07 4.7E-12 75.2 7.5 73 75-148 1-89 (89)
44 TIGR03866 PQQ_ABC_repeats PQQ- 98.6 1.5E-05 3.3E-10 82.3 23.2 182 2-192 6-190 (300)
45 KOG4499|consensus 98.5 4.5E-06 9.7E-11 78.1 16.3 129 72-203 109-255 (310)
46 COG4257 Vgb Streptogramin lyas 98.5 7.3E-06 1.6E-10 78.6 17.8 207 1-216 77-290 (353)
47 COG2706 3-carboxymuconate cycl 98.5 3.5E-05 7.7E-10 77.0 23.0 156 28-187 147-319 (346)
48 TIGR03606 non_repeat_PQQ dehyd 98.5 1E-05 2.2E-10 86.4 20.6 156 18-180 22-250 (454)
49 KOG4260|consensus 98.5 9.9E-08 2.1E-12 90.0 4.3 149 499-664 132-304 (350)
50 COG4257 Vgb Streptogramin lyas 98.4 1.9E-05 4.2E-10 75.8 17.0 187 2-199 120-314 (353)
51 KOG1520|consensus 98.4 6.4E-06 1.4E-10 83.6 14.7 116 72-191 115-251 (376)
52 PF07995 GSDH: Glucose / Sorbo 98.4 2.3E-05 5E-10 82.1 18.8 184 5-191 20-291 (331)
53 TIGR02658 TTQ_MADH_Hv methylam 98.4 6.9E-05 1.5E-09 77.8 21.8 183 5-197 75-338 (352)
54 PF03022 MRJP: Major royal jel 98.3 3.9E-05 8.5E-10 78.2 19.3 166 7-178 34-254 (287)
55 TIGR02658 TTQ_MADH_Hv methylam 98.3 0.00013 2.8E-09 75.7 22.6 202 7-214 27-311 (352)
56 PF03088 Str_synth: Strictosid 98.3 2.5E-06 5.4E-11 69.1 7.7 74 29-104 1-88 (89)
57 COG2706 3-carboxymuconate cycl 98.3 0.00019 4.1E-09 71.9 21.2 183 6-190 15-222 (346)
58 PF02239 Cytochrom_D1: Cytochr 98.2 5.7E-05 1.2E-09 80.1 18.4 187 2-194 11-207 (369)
59 smart00135 LY Low-density lipo 98.2 3E-06 6.6E-11 59.2 5.9 41 109-149 2-42 (43)
60 smart00135 LY Low-density lipo 98.2 3.4E-06 7.4E-11 58.9 5.4 40 66-106 3-42 (43)
61 COG3204 Uncharacterized protei 98.2 0.00015 3.3E-09 70.8 17.7 176 7-188 108-311 (316)
62 PF03022 MRJP: Major royal jel 98.1 0.00025 5.5E-09 72.3 18.1 179 28-211 3-243 (287)
63 KOG4499|consensus 98.0 0.00013 2.8E-09 68.5 14.1 124 6-134 138-273 (310)
64 COG3204 Uncharacterized protei 98.0 0.00047 1E-08 67.5 17.8 162 24-191 84-265 (316)
65 TIGR03118 PEPCTERM_chp_1 conse 97.9 0.00058 1.3E-08 67.3 17.2 187 19-210 16-252 (336)
66 PRK04792 tolB translocation pr 97.9 0.0011 2.4E-08 72.7 21.6 179 7-190 242-426 (448)
67 PRK04043 tolB translocation pr 97.9 0.0017 3.8E-08 70.2 21.8 182 6-194 212-405 (419)
68 PRK04792 tolB translocation pr 97.8 0.0019 4.1E-08 70.9 21.5 180 8-191 199-384 (448)
69 PRK05137 tolB translocation pr 97.8 0.0023 5.1E-08 70.1 21.7 182 6-192 225-415 (435)
70 PRK04922 tolB translocation pr 97.8 0.0024 5.1E-08 70.0 21.7 180 7-191 228-413 (433)
71 PRK04922 tolB translocation pr 97.8 0.0021 4.5E-08 70.4 21.1 180 8-191 185-370 (433)
72 PF06247 Plasmod_Pvs28: Plasmo 97.7 4.4E-06 9.5E-11 75.3 -1.0 130 517-667 11-163 (197)
73 TIGR03118 PEPCTERM_chp_1 conse 97.7 0.0034 7.5E-08 62.0 18.7 123 74-202 140-292 (336)
74 PF00008 EGF: EGF-like domain 97.7 1.8E-05 4E-10 50.8 2.0 30 555-584 1-31 (32)
75 PRK05137 tolB translocation pr 97.7 0.0042 9E-08 68.1 21.9 182 6-191 181-368 (435)
76 PRK02889 tolB translocation pr 97.7 0.0051 1.1E-07 67.2 22.0 181 7-191 176-362 (427)
77 PRK03629 tolB translocation pr 97.7 0.0059 1.3E-07 66.6 22.2 180 7-191 223-408 (429)
78 PRK03629 tolB translocation pr 97.7 0.0063 1.4E-07 66.4 22.3 181 7-191 179-365 (429)
79 PHA03099 epidermal growth fact 97.7 8.2E-05 1.8E-09 62.3 5.6 34 634-672 51-84 (139)
80 PRK04043 tolB translocation pr 97.7 0.0055 1.2E-07 66.4 21.1 180 6-192 168-360 (419)
81 PRK00178 tolB translocation pr 97.7 0.0058 1.3E-07 66.9 21.7 180 8-191 180-365 (430)
82 TIGR03606 non_repeat_PQQ dehyd 97.6 0.003 6.4E-08 67.9 18.3 111 65-177 23-163 (454)
83 PF07645 EGF_CA: Calcium-bindi 97.6 4.5E-05 9.8E-10 52.7 2.6 32 219-250 1-34 (42)
84 PF02239 Cytochrom_D1: Cytochr 97.6 0.0033 7.2E-08 66.7 18.0 148 5-156 56-212 (369)
85 COG2133 Glucose/sorbosone dehy 97.6 0.0015 3.3E-08 68.4 14.8 160 27-189 178-397 (399)
86 TIGR02800 propeller_TolB tol-p 97.6 0.01 2.2E-07 64.7 22.2 168 7-179 214-385 (417)
87 PRK00178 tolB translocation pr 97.6 0.0095 2.1E-07 65.3 21.9 179 7-190 223-407 (430)
88 cd00200 WD40 WD40 domain, foun 97.6 0.016 3.4E-07 58.3 22.0 183 6-197 72-257 (289)
89 PRK02889 tolB translocation pr 97.5 0.011 2.5E-07 64.5 21.8 178 7-189 220-403 (427)
90 TIGR02800 propeller_TolB tol-p 97.5 0.01 2.2E-07 64.7 21.6 181 6-190 169-355 (417)
91 PF00008 EGF: EGF-like domain 97.5 5.7E-05 1.2E-09 48.5 2.2 30 223-252 1-31 (32)
92 PF01436 NHL: NHL repeat; Int 97.5 0.00021 4.5E-09 44.4 4.1 28 115-143 1-28 (28)
93 smart00179 EGF_CA Calcium-bind 97.4 0.0002 4.3E-09 48.6 4.1 36 552-587 2-39 (39)
94 PF06247 Plasmod_Pvs28: Plasmo 97.4 3.4E-05 7.3E-10 69.7 0.2 128 315-445 7-160 (197)
95 PRK01029 tolB translocation pr 97.4 0.025 5.5E-07 61.6 22.3 185 7-193 211-407 (428)
96 PF05096 Glu_cyclase_2: Glutam 97.4 0.007 1.5E-07 59.6 15.8 131 6-145 109-260 (264)
97 PRK02888 nitrous-oxide reducta 97.4 0.011 2.5E-07 65.0 18.6 184 3-188 148-403 (635)
98 PRK01742 tolB translocation pr 97.4 0.018 3.9E-07 63.0 20.8 176 7-191 184-363 (429)
99 PF05096 Glu_cyclase_2: Glutam 97.3 0.03 6.5E-07 55.2 19.0 161 28-197 47-211 (264)
100 PRK01029 tolB translocation pr 97.3 0.037 8E-07 60.3 21.8 183 6-192 164-362 (428)
101 PF07645 EGF_CA: Calcium-bindi 97.3 0.00017 3.7E-09 49.8 2.1 31 552-582 2-34 (42)
102 cd00200 WD40 WD40 domain, foun 97.2 0.058 1.3E-06 54.1 21.6 175 6-191 30-209 (289)
103 COG2133 Glucose/sorbosone dehy 97.2 0.0041 8.9E-08 65.2 12.7 127 19-147 232-398 (399)
104 COG4946 Uncharacterized protei 97.2 0.023 4.9E-07 59.0 17.3 126 49-178 380-508 (668)
105 KOG1218|consensus 97.2 0.068 1.5E-06 55.8 21.5 111 441-582 96-208 (316)
106 TIGR03032 conserved hypothetic 97.1 0.033 7E-07 55.8 16.8 184 22-216 99-319 (335)
107 PF12947 EGF_3: EGF domain; I 97.1 0.00032 6.9E-09 46.2 1.9 27 227-253 7-33 (36)
108 TIGR03032 conserved hypothetic 97.1 0.053 1.1E-06 54.4 17.8 180 2-195 23-237 (335)
109 cd00054 EGF_CA Calcium-binding 97.0 0.00097 2.1E-08 44.7 4.0 34 553-586 3-37 (38)
110 PRK01742 tolB translocation pr 97.0 0.1 2.2E-06 57.1 21.6 162 7-178 228-391 (429)
111 smart00179 EGF_CA Calcium-bind 97.0 0.00087 1.9E-08 45.4 3.4 31 220-250 2-33 (39)
112 KOG1218|consensus 97.0 0.066 1.4E-06 55.9 19.3 146 328-480 48-209 (316)
113 KOG1446|consensus 97.0 0.26 5.6E-06 49.0 21.3 185 6-199 79-272 (311)
114 PF01436 NHL: NHL repeat; Int 96.9 0.0015 3.3E-08 40.4 4.0 28 25-56 1-28 (28)
115 PF14670 FXa_inhibition: Coagu 96.9 0.00049 1.1E-08 45.2 1.8 21 231-251 9-29 (36)
116 PF13449 Phytase-like: Esteras 96.9 0.06 1.3E-06 56.4 18.0 61 73-134 86-165 (326)
117 PF02333 Phytase: Phytase; In 96.7 0.13 2.8E-06 53.8 18.4 182 6-190 77-291 (381)
118 COG4946 Uncharacterized protei 96.7 0.061 1.3E-06 56.0 15.0 123 6-134 381-507 (668)
119 cd00053 EGF Epidermal growth f 96.6 0.0029 6.2E-08 41.7 3.8 30 557-586 5-35 (36)
120 smart00181 EGF Epidermal growt 96.6 0.0029 6.4E-08 41.6 3.7 28 558-586 6-34 (35)
121 cd00054 EGF_CA Calcium-binding 96.5 0.0032 6.9E-08 42.1 3.4 32 220-251 2-34 (38)
122 PRK02888 nitrous-oxide reducta 96.5 0.12 2.5E-06 57.3 16.9 138 6-146 214-404 (635)
123 PF06433 Me-amine-dh_H: Methyl 96.4 0.34 7.4E-06 49.7 18.7 182 6-198 117-329 (342)
124 PF07974 EGF_2: EGF-like domai 96.4 0.0046 1E-07 39.5 3.4 26 559-586 7-32 (32)
125 PF12947 EGF_3: EGF domain; I 96.4 0.0018 3.9E-08 42.6 1.6 26 559-584 7-32 (36)
126 cd00053 EGF Epidermal growth f 96.3 0.0042 9.1E-08 40.9 3.2 28 420-447 5-32 (36)
127 PF13360 PQQ_2: PQQ-like domai 96.3 0.34 7.4E-06 47.9 18.4 171 5-193 44-234 (238)
128 PF12661 hEGF: Human growth fa 96.3 0.0019 4.2E-08 31.9 0.9 13 656-668 1-13 (13)
129 PF02333 Phytase: Phytase; In 96.3 0.12 2.5E-06 54.2 14.9 123 23-149 153-293 (381)
130 COG3823 Glutamine cyclotransfe 96.2 0.092 2E-06 49.1 11.8 114 25-147 130-260 (262)
131 smart00181 EGF Epidermal growt 96.1 0.0059 1.3E-07 40.1 3.1 28 421-449 6-34 (35)
132 PF07974 EGF_2: EGF-like domai 96.1 0.0079 1.7E-07 38.4 3.3 25 391-415 8-32 (32)
133 smart00051 DSL delta serrate l 96.0 0.0089 1.9E-07 45.1 3.7 46 614-668 18-63 (63)
134 COG5276 Uncharacterized conser 96.0 1.7 3.7E-05 43.1 19.9 137 36-182 137-279 (370)
135 PF13449 Phytase-like: Esteras 95.9 0.27 5.9E-06 51.4 15.8 116 27-147 86-252 (326)
136 PF12661 hEGF: Human growth fa 95.9 0.0048 1E-07 30.5 1.3 12 615-626 2-13 (13)
137 PF12662 cEGF: Complement Clr- 95.9 0.0064 1.4E-07 35.7 1.9 19 612-630 1-23 (24)
138 KOG0318|consensus 95.8 1.1 2.5E-05 47.6 19.5 115 22-148 402-519 (603)
139 PF05787 DUF839: Bacterial pro 95.7 0.11 2.3E-06 57.7 12.4 71 22-93 346-457 (524)
140 KOG0279|consensus 95.7 1.7 3.6E-05 42.8 18.6 184 6-197 84-270 (315)
141 COG5276 Uncharacterized conser 95.7 1.1 2.4E-05 44.4 17.3 147 30-185 176-325 (370)
142 PF01731 Arylesterase: Arylest 95.7 0.032 6.9E-07 45.1 5.9 35 111-145 49-83 (86)
143 COG0823 TolB Periplasmic compo 95.6 0.83 1.8E-05 49.4 18.3 124 51-177 218-344 (425)
144 PF13360 PQQ_2: PQQ-like domai 95.5 2 4.4E-05 42.3 20.1 172 6-192 2-194 (238)
145 KOG3512|consensus 95.3 0.049 1.1E-06 56.5 7.3 133 351-505 279-436 (592)
146 PF08662 eIF2A: Eukaryotic tra 95.3 1.1 2.4E-05 42.9 16.4 121 7-134 39-162 (194)
147 KOG0291|consensus 95.1 2.7 5.9E-05 46.9 19.9 74 117-191 437-510 (893)
148 PF05787 DUF839: Bacterial pro 95.1 0.49 1.1E-05 52.5 14.9 164 5-177 219-453 (524)
149 PTZ00421 coronin; Provisional 95.0 7.2 0.00016 43.3 24.2 159 27-192 127-293 (493)
150 PRK11138 outer membrane biogen 94.9 1.7 3.7E-05 46.9 18.5 134 37-187 256-392 (394)
151 PF06433 Me-amine-dh_H: Methyl 94.9 1.2 2.5E-05 45.9 15.7 137 5-147 155-321 (342)
152 KOG1446|consensus 94.8 4.8 0.0001 40.3 21.2 182 5-195 34-223 (311)
153 smart00051 DSL delta serrate l 94.8 0.038 8.2E-07 41.7 3.6 47 573-626 17-63 (63)
154 PF12662 cEGF: Complement Clr- 94.7 0.026 5.7E-07 33.2 2.0 18 572-589 1-22 (24)
155 COG0823 TolB Periplasmic compo 94.7 1.5 3.2E-05 47.5 17.0 136 7-145 218-357 (425)
156 KOG4378|consensus 94.7 1.5 3.2E-05 46.4 15.9 173 28-209 124-301 (673)
157 KOG0273|consensus 94.7 2.6 5.6E-05 44.4 17.6 178 31-215 280-466 (524)
158 KOG0268|consensus 94.6 0.33 7.3E-06 49.1 10.7 207 4-221 86-335 (433)
159 KOG0285|consensus 94.6 1.4 3E-05 44.7 14.8 169 21-200 147-318 (460)
160 COG4247 Phy 3-phytase (myo-ino 94.5 2.8 6E-05 40.7 16.0 112 22-138 149-270 (364)
161 PF14583 Pectate_lyase22: Olig 94.3 2.4 5.1E-05 44.5 16.7 144 6-155 59-233 (386)
162 PLN00181 protein SPA1-RELATED; 94.2 11 0.00023 45.0 24.6 153 28-188 535-689 (793)
163 PF01731 Arylesterase: Arylest 94.2 0.14 3.1E-06 41.4 5.9 49 7-59 36-84 (86)
164 PF07433 DUF1513: Protein of u 94.2 6.6 0.00014 40.0 19.0 178 4-189 74-285 (305)
165 KOG4649|consensus 94.2 3.3 7E-05 40.4 15.8 67 24-106 29-95 (354)
166 TIGR03300 assembly_YfgL outer 93.9 4.5 9.7E-05 43.3 19.0 132 37-186 241-376 (377)
167 PF14583 Pectate_lyase22: Olig 93.8 1.8 3.9E-05 45.4 14.6 116 38-158 49-188 (386)
168 PRK11138 outer membrane biogen 93.7 5.4 0.00012 43.0 19.2 103 83-194 256-358 (394)
169 PF12955 DUF3844: Domain of un 93.5 0.12 2.6E-06 42.8 4.4 45 655-705 33-90 (103)
170 PRK13616 lipoprotein LpqB; Pro 93.4 6 0.00013 44.8 19.3 168 8-187 380-565 (591)
171 PTZ00382 Variant-specific surf 93.4 0.18 3.9E-06 41.8 5.4 6 659-664 42-47 (96)
172 KOG0310|consensus 93.4 2 4.3E-05 45.4 13.8 156 28-192 71-228 (487)
173 PF14339 DUF4394: Domain of un 93.2 8.9 0.00019 37.4 19.6 161 25-193 26-217 (236)
174 KOG0266|consensus 93.2 13 0.00028 40.9 21.3 116 26-149 204-321 (456)
175 PF00930 DPPIV_N: Dipeptidyl p 93.1 1.6 3.4E-05 46.3 13.5 104 52-155 211-325 (353)
176 PF08662 eIF2A: Eukaryotic tra 93.0 7 0.00015 37.4 16.6 92 52-146 40-133 (194)
177 KOG0294|consensus 93.0 11 0.00024 37.9 18.3 133 5-147 61-198 (362)
178 PF01034 Syndecan: Syndecan do 92.9 0.025 5.3E-07 41.8 -0.3 33 679-711 8-40 (64)
179 TIGR02276 beta_rpt_yvtn 40-res 92.9 0.48 1E-05 32.2 6.2 41 36-80 2-42 (42)
180 KOG0289|consensus 92.9 6.2 0.00013 41.3 16.3 153 28-187 350-503 (506)
181 KOG0289|consensus 92.7 14 0.00031 38.7 18.6 117 74-194 306-424 (506)
182 TIGR03300 assembly_YfgL outer 92.6 15 0.00033 39.1 20.5 104 83-195 241-344 (377)
183 KOG0315|consensus 92.6 11 0.00024 36.7 19.1 133 5-145 59-196 (311)
184 KOG3914|consensus 92.3 3.9 8.5E-05 42.3 14.1 159 28-195 65-229 (390)
185 COG3823 Glutamine cyclotransfe 92.3 7.4 0.00016 36.8 14.6 169 5-186 66-256 (262)
186 KOG0286|consensus 92.2 13 0.00029 37.0 18.4 156 52-210 167-324 (343)
187 KOG0315|consensus 92.2 12 0.00026 36.4 18.4 158 26-191 125-290 (311)
188 KOG0266|consensus 92.0 14 0.0003 40.7 19.6 133 6-146 224-364 (456)
189 PF02897 Peptidase_S9_N: Proly 92.0 20 0.00042 38.9 20.7 156 31-189 175-357 (414)
190 KOG3512|consensus 92.0 0.35 7.6E-06 50.5 6.3 24 320-343 286-309 (592)
191 PTZ00420 coronin; Provisional 91.8 27 0.00058 39.4 24.0 157 26-190 126-294 (568)
192 PLN00181 protein SPA1-RELATED; 91.7 23 0.00051 42.1 22.5 156 26-189 484-648 (793)
193 KOG2139|consensus 91.7 13 0.00029 38.1 16.7 116 73-189 282-431 (445)
194 COG4247 Phy 3-phytase (myo-ino 91.6 12 0.00027 36.4 15.7 97 49-146 122-234 (364)
195 PF14670 FXa_inhibition: Coagu 91.5 0.12 2.7E-06 34.0 1.7 19 564-582 10-28 (36)
196 KOG0273|consensus 91.5 13 0.00028 39.5 16.9 76 24-103 315-391 (524)
197 KOG0318|consensus 91.4 11 0.00023 40.6 16.4 103 24-134 319-424 (603)
198 TIGR02276 beta_rpt_yvtn 40-res 91.4 0.79 1.7E-05 31.1 5.8 42 81-124 1-42 (42)
199 PRK13616 lipoprotein LpqB; Pro 91.3 16 0.00035 41.5 19.2 158 25-193 349-531 (591)
200 KOG0291|consensus 90.9 26 0.00056 39.6 19.1 125 71-200 350-477 (893)
201 PHA02887 EGF-like protein; Pro 90.7 0.27 5.8E-06 41.1 3.2 33 634-671 92-124 (126)
202 KOG0268|consensus 90.5 1.8 4E-05 44.0 9.5 143 25-176 187-331 (433)
203 PF02897 Peptidase_S9_N: Proly 90.3 16 0.00034 39.6 17.9 143 6-153 201-365 (414)
204 KOG4441|consensus 90.1 15 0.00033 41.6 17.8 165 8-178 302-483 (571)
205 smart00180 EGF_Lam Laminin-typ 90.1 0.33 7.3E-06 34.1 3.0 29 462-505 12-40 (46)
206 KOG0319|consensus 90.1 3.6 7.9E-05 45.8 12.0 169 31-210 25-201 (775)
207 PHA02713 hypothetical protein; 90.1 11 0.00023 42.8 16.6 165 7-178 320-520 (557)
208 KOG1539|consensus 90.1 8.9 0.00019 43.5 15.1 135 5-147 468-607 (910)
209 smart00284 OLF Olfactomedin-li 89.7 9.8 0.00021 37.8 13.8 124 6-136 93-244 (255)
210 PF02191 OLF: Olfactomedin-lik 89.5 21 0.00046 35.5 16.3 139 36-180 29-190 (250)
211 KOG2110|consensus 89.5 28 0.00061 35.9 19.5 144 36-187 96-246 (391)
212 PF15102 TMEM154: TMEM154 prot 89.5 0.37 7.9E-06 42.6 3.3 14 684-697 60-73 (146)
213 KOG4441|consensus 89.5 7.4 0.00016 44.1 14.6 168 6-178 348-530 (571)
214 KOG2106|consensus 89.4 26 0.00056 37.7 17.0 156 28-196 332-496 (626)
215 COG3211 PhoX Predicted phospha 89.2 3.1 6.8E-05 45.3 10.5 114 22-135 413-573 (616)
216 KOG0650|consensus 89.0 2.6 5.6E-05 45.8 9.7 124 20-148 516-639 (733)
217 KOG0276|consensus 89.0 25 0.00055 38.8 17.0 138 2-145 30-170 (794)
218 KOG4328|consensus 88.9 4 8.6E-05 42.9 10.7 112 74-188 282-398 (498)
219 PF12946 EGF_MSP1_1: MSP1 EGF 88.9 0.2 4.3E-06 32.8 0.9 25 558-582 5-30 (37)
220 PHA02887 EGF-like protein; Pro 88.7 0.43 9.4E-06 39.9 3.0 30 315-344 93-123 (126)
221 PTZ00214 high cysteine membran 88.6 0.28 6E-06 57.0 2.6 21 612-632 681-705 (800)
222 PTZ00421 coronin; Provisional 88.5 45 0.00098 37.0 23.5 116 27-147 77-199 (493)
223 PF00930 DPPIV_N: Dipeptidyl p 88.5 5.8 0.00013 42.0 12.4 100 29-131 238-343 (353)
224 KOG4328|consensus 88.4 13 0.00027 39.3 13.9 183 4-192 254-453 (498)
225 PF02191 OLF: Olfactomedin-lik 88.4 22 0.00048 35.4 15.5 136 37-179 78-239 (250)
226 PF00053 Laminin_EGF: Laminin 88.3 0.38 8.2E-06 34.3 2.2 32 461-507 11-42 (49)
227 smart00284 OLF Olfactomedin-li 88.3 29 0.00062 34.5 16.0 134 37-178 83-243 (255)
228 PTZ00420 coronin; Provisional 88.2 51 0.0011 37.3 22.6 119 71-193 74-201 (568)
229 KOG2110|consensus 88.0 26 0.00056 36.1 15.5 139 7-147 106-249 (391)
230 TIGR03075 PQQ_enz_alc_DH PQQ-d 88.0 51 0.0011 37.0 20.6 32 120-151 238-284 (527)
231 TIGR03075 PQQ_enz_alc_DH PQQ-d 88.0 25 0.00053 39.5 17.4 17 119-135 390-406 (527)
232 PF05694 SBP56: 56kDa selenium 88.0 9.6 0.00021 40.6 12.9 128 50-178 221-393 (461)
233 KOG0272|consensus 87.9 9.9 0.00022 39.7 12.6 179 24-213 260-442 (459)
234 KOG0285|consensus 87.8 7.3 0.00016 39.8 11.4 132 71-207 151-283 (460)
235 cd00055 EGF_Lam Laminin-type e 87.6 0.6 1.3E-05 33.4 2.9 23 471-506 20-42 (50)
236 KOG0279|consensus 87.4 32 0.0007 34.1 17.9 159 27-191 17-182 (315)
237 COG1520 FOG: WD40-like repeat 87.2 16 0.00034 39.0 14.9 144 36-192 67-220 (370)
238 PHA03099 epidermal growth fact 87.1 0.54 1.2E-05 40.0 2.7 32 314-345 51-83 (139)
239 KOG0282|consensus 86.9 6.6 0.00014 41.6 10.9 202 7-215 280-487 (503)
240 KOG3607|consensus 86.9 1.1 2.4E-05 51.3 6.0 32 634-674 630-661 (716)
241 PF00053 Laminin_EGF: Laminin 86.6 0.39 8.5E-06 34.2 1.5 23 396-418 12-34 (49)
242 cd00055 EGF_Lam Laminin-type e 86.1 0.85 1.9E-05 32.7 3.0 22 397-418 14-35 (50)
243 KOG1274|consensus 86.1 18 0.00038 41.7 14.4 152 28-191 16-170 (933)
244 KOG2111|consensus 85.7 43 0.00094 33.9 16.8 139 6-148 112-258 (346)
245 KOG0292|consensus 85.7 67 0.0015 37.3 18.4 146 8-176 231-383 (1202)
246 PF01102 Glycophorin_A: Glycop 85.6 0.28 6E-06 42.3 0.3 29 682-710 66-94 (122)
247 PHA02713 hypothetical protein; 85.2 73 0.0016 36.1 20.5 161 8-177 273-470 (557)
248 KOG0270|consensus 85.1 55 0.0012 34.6 16.6 177 6-188 265-448 (463)
249 PF07433 DUF1513: Protein of u 85.0 49 0.0011 33.9 16.5 159 33-193 58-251 (305)
250 KOG0293|consensus 84.7 56 0.0012 34.3 16.6 112 29-147 273-385 (519)
251 PF12946 EGF_MSP1_1: MSP1 EGF 84.5 0.7 1.5E-05 30.3 1.7 29 223-251 2-31 (37)
252 PHA02790 Kelch-like protein; P 84.5 67 0.0015 35.6 18.5 161 8-177 288-453 (480)
253 cd00216 PQQ_DH Dehydrogenases 84.3 74 0.0016 35.4 19.7 30 166-196 402-431 (488)
254 KOG0296|consensus 84.2 55 0.0012 33.8 18.8 165 24-197 63-228 (399)
255 COG1520 FOG: WD40-like repeat 84.2 54 0.0012 34.9 17.2 130 5-148 76-220 (370)
256 KOG0293|consensus 83.8 30 0.00064 36.3 13.6 165 26-200 313-481 (519)
257 KOG2919|consensus 83.8 35 0.00077 34.6 13.8 119 23-147 156-282 (406)
258 PF10647 Gmad1: Lipoprotein Lp 83.3 52 0.0011 32.9 17.2 172 8-185 49-233 (253)
259 KOG0646|consensus 82.5 65 0.0014 34.3 15.7 75 117-193 219-311 (476)
260 PF02009 Rifin_STEVOR: Rifin/s 82.5 0.32 6.9E-06 49.3 -0.7 21 691-711 265-285 (299)
261 KOG0310|consensus 82.2 77 0.0017 34.0 18.8 113 29-147 114-226 (487)
262 KOG0286|consensus 82.0 59 0.0013 32.6 17.3 124 24-153 185-311 (343)
263 PF09910 DUF2139: Uncharacteri 81.7 63 0.0014 32.7 16.6 112 31-147 40-182 (339)
264 KOG2048|consensus 81.5 61 0.0013 36.2 15.7 157 24-188 381-547 (691)
265 KOG0272|consensus 81.0 50 0.0011 34.8 14.1 111 26-143 304-415 (459)
266 cd01475 vWA_Matrilin VWA_Matri 80.7 1.2 2.7E-05 43.7 2.7 34 218-251 185-218 (224)
267 PF05694 SBP56: 56kDa selenium 80.6 5.7 0.00012 42.2 7.5 62 73-135 313-393 (461)
268 KOG2111|consensus 80.6 70 0.0015 32.5 19.9 154 26-189 95-256 (346)
269 KOG4378|consensus 80.5 35 0.00076 36.6 13.0 114 30-150 169-284 (673)
270 KOG2106|consensus 80.0 94 0.002 33.6 16.9 92 7-104 349-447 (626)
271 KOG0772|consensus 79.6 46 0.001 36.0 13.6 160 23-188 165-346 (641)
272 KOG0270|consensus 79.6 77 0.0017 33.6 15.0 61 29-92 247-307 (463)
273 TIGR01478 STEVOR variant surfa 79.2 0.63 1.4E-05 45.8 0.1 24 689-712 265-288 (295)
274 KOG0277|consensus 79.2 68 0.0015 31.6 16.3 69 118-190 150-222 (311)
275 KOG0973|consensus 79.2 30 0.00064 40.6 13.1 146 25-179 129-283 (942)
276 KOG0271|consensus 79.1 82 0.0018 32.8 14.7 170 29-208 119-295 (480)
277 PTZ00370 STEVOR; Provisional 78.4 0.67 1.4E-05 45.7 0.0 24 689-712 261-284 (296)
278 KOG0278|consensus 78.4 71 0.0015 31.3 14.8 107 72-188 185-296 (334)
279 smart00180 EGF_Lam Laminin-typ 78.3 2.5 5.4E-05 29.6 2.9 22 396-417 12-33 (46)
280 KOG0640|consensus 78.0 60 0.0013 32.8 13.1 169 22-200 213-394 (430)
281 PF01414 DSL: Delta serrate li 78.0 0.71 1.5E-05 34.9 0.0 14 655-668 50-63 (63)
282 PF01683 EB: EB module; Inter 77.8 3.4 7.4E-05 29.7 3.6 29 622-664 18-46 (52)
283 KOG0281|consensus 77.5 13 0.00027 37.9 8.4 148 31-197 241-396 (499)
284 KOG1036|consensus 77.4 86 0.0019 31.7 14.7 52 5-60 73-125 (323)
285 PF06024 DUF912: Nucleopolyhed 77.3 3.4 7.4E-05 34.7 3.9 28 685-712 65-92 (101)
286 TIGR03548 mutarot_permut cycli 77.0 97 0.0021 32.2 16.6 95 7-104 88-195 (323)
287 KOG1407|consensus 76.9 81 0.0018 31.2 15.9 115 24-145 19-135 (313)
288 KOG2139|consensus 76.7 36 0.00078 35.1 11.4 116 28-149 198-314 (445)
289 PF01414 DSL: Delta serrate li 76.2 0.98 2.1E-05 34.1 0.3 47 329-382 17-63 (63)
290 KOG1274|consensus 76.2 1.6E+02 0.0035 34.3 18.8 96 49-147 74-169 (933)
291 KOG0264|consensus 76.1 1.1E+02 0.0024 32.4 15.1 164 49-214 198-372 (422)
292 KOG0294|consensus 76.0 95 0.0021 31.5 16.2 169 6-189 106-281 (362)
293 PRK10115 protease 2; Provision 75.8 1.7E+02 0.0036 34.2 18.6 115 36-155 278-403 (686)
294 KOG0263|consensus 75.3 1.5E+02 0.0032 33.8 16.7 173 6-188 472-648 (707)
295 PF01299 Lamp: Lysosome-associ 74.9 2.8 6E-05 43.4 3.4 17 685-701 275-291 (306)
296 PF00954 S_locus_glycop: S-loc 74.9 12 0.00026 31.9 6.9 31 219-250 76-107 (110)
297 KOG2048|consensus 74.6 1.5E+02 0.0033 33.3 18.9 99 28-133 28-128 (691)
298 KOG3516|consensus 74.2 29 0.00063 41.4 11.3 25 222-246 957-981 (1306)
299 PF10647 Gmad1: Lipoprotein Lp 74.2 99 0.0021 30.9 18.8 113 27-144 25-142 (253)
300 KOG0263|consensus 73.6 1.6E+02 0.0034 33.6 16.4 161 28-199 454-617 (707)
301 KOG3607|consensus 73.4 6.4 0.00014 45.3 6.1 32 593-628 626-657 (716)
302 PF02439 Adeno_E3_CR2: Adenovi 73.1 1.9 4.2E-05 28.3 1.1 20 682-701 5-24 (38)
303 COG3211 PhoX Predicted phospha 73.1 31 0.00066 38.0 10.6 64 70-134 415-517 (616)
304 PTZ00046 rifin; Provisional 73.1 1.4 3E-05 45.4 0.6 29 684-712 316-345 (358)
305 PF01683 EB: EB module; Inter 72.6 6.4 0.00014 28.3 3.9 20 390-411 27-46 (52)
306 PF05935 Arylsulfotrans: Aryls 72.5 1E+02 0.0022 34.1 15.2 116 36-156 157-311 (477)
307 TIGR01477 RIFIN variant surfac 72.4 1.5 3.2E-05 45.1 0.7 29 684-712 311-340 (353)
308 KOG0303|consensus 72.3 1.3E+02 0.0029 31.5 14.6 140 4-149 151-297 (472)
309 KOG0281|consensus 71.4 21 0.00046 36.3 8.3 99 84-191 330-430 (499)
310 PRK14131 N-acetylneuraminic ac 71.2 1.4E+02 0.003 31.8 15.6 137 36-177 37-228 (376)
311 PHA03098 kelch-like protein; P 70.8 1.9E+02 0.004 32.6 18.2 166 7-177 311-494 (534)
312 KOG0288|consensus 70.8 90 0.0019 32.9 12.8 126 3-134 318-450 (459)
313 KOG0308|consensus 70.7 89 0.0019 34.9 13.3 175 5-188 138-326 (735)
314 PF04478 Mid2: Mid2 like cell 70.7 2.6 5.7E-05 37.6 1.7 17 681-697 50-66 (154)
315 PF01102 Glycophorin_A: Glycop 70.6 1.3 2.8E-05 38.3 -0.2 34 680-713 60-94 (122)
316 PRK10115 protease 2; Provision 69.9 2.2E+02 0.0049 33.2 21.1 154 30-188 176-345 (686)
317 PF05454 DAG1: Dystroglycan (D 69.4 1.5 3.3E-05 44.2 0.0 32 681-712 145-177 (290)
318 KOG1517|consensus 69.2 1.1E+02 0.0025 36.3 14.3 159 28-191 1166-1335(1387)
319 KOG1407|consensus 69.1 1.2E+02 0.0027 29.9 18.1 56 73-130 149-204 (313)
320 TIGR03547 muta_rot_YjhT mutatr 69.0 1.5E+02 0.0033 31.0 15.2 137 36-177 16-207 (346)
321 PF12955 DUF3844: Domain of un 68.9 12 0.00025 31.3 5.0 24 597-620 13-40 (103)
322 PF05935 Arylsulfotrans: Aryls 68.9 1.9E+02 0.0042 32.0 16.3 158 36-198 112-310 (477)
323 KOG0649|consensus 68.6 81 0.0017 30.8 11.1 134 70-205 113-253 (325)
324 PF03178 CPSF_A: CPSF A subuni 68.4 1.1E+02 0.0024 31.7 13.8 129 51-187 62-200 (321)
325 KOG2055|consensus 68.3 1.7E+02 0.0038 31.3 16.0 113 29-148 261-376 (514)
326 PF09910 DUF2139: Uncharacteri 66.8 1.5E+02 0.0033 30.0 13.7 51 49-105 18-68 (339)
327 KOG3567|consensus 66.2 17 0.00038 38.6 6.8 51 137-189 444-496 (501)
328 TIGR02171 Fb_sc_TIGR02171 Fibr 66.1 35 0.00076 40.0 9.8 179 7-187 329-547 (912)
329 KOG0284|consensus 66.1 33 0.00072 35.8 8.6 76 116-194 181-257 (464)
330 KOG3567|consensus 65.1 2.1 4.6E-05 45.1 0.0 53 95-148 445-498 (501)
331 KOG0283|consensus 64.5 2.7E+02 0.0058 32.1 16.5 158 22-188 406-575 (712)
332 PF03302 VSP: Giardia variant- 63.3 7 0.00015 42.0 3.5 34 679-712 362-396 (397)
333 KOG3881|consensus 63.2 2E+02 0.0043 30.2 15.7 109 74-187 205-317 (412)
334 KOG0319|consensus 63.2 1.4E+02 0.0031 33.9 13.2 134 7-146 40-179 (775)
335 KOG0275|consensus 63.2 1.3E+02 0.0028 30.5 11.8 90 76-170 397-489 (508)
336 COG4222 Uncharacterized protei 63.2 1.1E+02 0.0023 32.7 12.0 62 73-134 139-218 (391)
337 KOG0301|consensus 62.9 1.5E+02 0.0033 33.4 13.3 127 51-188 81-207 (745)
338 COG3490 Uncharacterized protei 62.8 1.8E+02 0.0038 29.4 13.7 134 53-192 51-192 (366)
339 KOG1408|consensus 62.7 1.5E+02 0.0033 33.6 13.3 96 119-215 600-697 (1080)
340 TIGR02171 Fb_sc_TIGR02171 Fibr 62.7 45 0.00098 39.1 9.9 99 36-139 318-424 (912)
341 PF06739 SBBP: Beta-propeller 62.0 12 0.00027 24.9 3.3 20 72-92 13-32 (38)
342 PF12191 stn_TNFRSF12A: Tumour 61.7 3.8 8.2E-05 35.1 0.9 10 702-711 99-108 (129)
343 COG1770 PtrB Protease II [Amin 61.0 3E+02 0.0064 31.4 16.8 150 25-179 128-289 (682)
344 PF14761 HPS3_N: Hermansky-Pud 61.0 1.6E+02 0.0035 28.4 15.5 152 23-179 15-206 (215)
345 KOG0269|consensus 60.7 1E+02 0.0023 35.1 11.8 101 49-150 108-211 (839)
346 KOG2315|consensus 60.7 2.7E+02 0.0057 30.8 14.8 120 7-134 251-373 (566)
347 KOG0308|consensus 60.7 79 0.0017 35.3 10.7 111 26-144 214-325 (735)
348 KOG2321|consensus 60.4 93 0.002 34.3 11.0 116 29-153 137-265 (703)
349 KOG1036|consensus 60.4 2E+02 0.0043 29.2 13.9 148 27-188 15-162 (323)
350 cd00216 PQQ_DH Dehydrogenases 59.9 2.8E+02 0.0061 30.8 20.9 32 120-151 221-269 (488)
351 KOG0283|consensus 59.1 2.9E+02 0.0062 31.9 15.0 111 72-188 410-531 (712)
352 KOG4649|consensus 58.8 2E+02 0.0043 28.6 15.9 34 29-66 139-172 (354)
353 PF09064 Tme5_EGF_like: Thromb 58.5 7.6 0.00016 24.9 1.6 25 224-251 4-28 (34)
354 TIGR03074 PQQ_membr_DH membran 57.9 1.9E+02 0.004 34.2 14.0 100 76-184 379-482 (764)
355 KOG1273|consensus 57.7 2.3E+02 0.0049 29.0 15.0 69 74-145 156-225 (405)
356 KOG3545|consensus 57.6 2E+02 0.0043 28.4 11.9 103 37-147 77-203 (249)
357 KOG0306|consensus 57.5 3.2E+02 0.0068 31.5 14.6 109 72-188 509-621 (888)
358 KOG0918|consensus 57.1 2E+02 0.0044 30.4 12.3 30 118-147 314-343 (476)
359 PF14991 MLANA: Protein melan- 57.1 1.9 4E-05 36.1 -1.6 25 686-710 26-50 (118)
360 KOG0292|consensus 56.7 2.6E+02 0.0057 32.8 14.0 154 24-189 8-165 (1202)
361 PF01299 Lamp: Lysosome-associ 56.7 9.3 0.0002 39.5 3.0 10 82-91 107-116 (306)
362 PF15416 DUF4623: Domain of un 56.6 1.3E+02 0.0029 30.8 10.6 158 32-210 119-292 (442)
363 cd01475 vWA_Matrilin VWA_Matri 56.6 8.9 0.00019 37.6 2.8 18 565-582 200-217 (224)
364 KOG0313|consensus 56.6 2.6E+02 0.0056 29.3 14.4 140 22-181 256-400 (423)
365 KOG3516|consensus 56.0 8.9 0.00019 45.4 2.9 44 552-595 545-589 (1306)
366 PF14517 Tachylectin: Tachylec 55.8 2.1E+02 0.0045 28.0 14.8 157 20-189 28-206 (229)
367 KOG0278|consensus 55.4 2.2E+02 0.0047 28.1 13.7 127 4-140 162-291 (334)
368 PF14870 PSII_BNR: Photosynthe 55.3 2.5E+02 0.0055 28.9 17.5 142 50-198 123-269 (302)
369 KOG0640|consensus 55.2 1.8E+02 0.0039 29.5 11.2 111 25-144 172-289 (430)
370 PF03178 CPSF_A: CPSF A subuni 55.1 2.1E+02 0.0045 29.6 13.0 129 7-144 62-200 (321)
371 KOG3914|consensus 54.3 1.4E+02 0.0031 31.3 10.8 115 24-145 106-222 (390)
372 KOG0647|consensus 54.2 2.5E+02 0.0055 28.5 12.3 84 69-153 25-109 (347)
373 KOG3514|consensus 52.5 78 0.0017 37.5 9.3 30 222-251 625-655 (1591)
374 KOG3514|consensus 51.7 9.5 0.00021 44.5 2.2 35 417-451 625-660 (1591)
375 PF05808 Podoplanin: Podoplani 51.6 4.9 0.00011 36.2 0.0 26 682-707 131-156 (162)
376 KOG0646|consensus 51.6 2.5E+02 0.0054 30.2 12.2 31 72-104 218-248 (476)
377 PF00954 S_locus_glycop: S-loc 50.5 17 0.00037 31.0 3.2 30 553-583 78-108 (110)
378 KOG2055|consensus 50.0 3.6E+02 0.0078 29.1 13.8 37 25-65 213-249 (514)
379 PF05808 Podoplanin: Podoplani 50.0 5.4 0.00012 36.0 0.0 35 678-712 123-157 (162)
380 KOG1897|consensus 49.6 3.9E+02 0.0085 31.9 14.2 140 42-189 798-942 (1096)
381 PF13908 Shisa: Wnt and FGF in 48.8 9.2 0.0002 36.0 1.4 18 683-700 78-95 (179)
382 KOG0772|consensus 48.4 3E+02 0.0064 30.2 12.2 117 24-145 213-346 (641)
383 PRK13684 Ycf48-like protein; P 47.9 3.5E+02 0.0075 28.3 18.3 149 27-187 174-329 (334)
384 PRK13684 Ycf48-like protein; P 47.9 3.5E+02 0.0075 28.3 16.6 140 50-196 151-294 (334)
385 PF12877 DUF3827: Domain of un 46.9 8.1 0.00018 42.8 0.7 16 73-88 75-90 (684)
386 COG3292 Predicted periplasmic 46.7 2.2E+02 0.0047 31.7 11.1 70 25-104 164-234 (671)
387 KOG0918|consensus 45.8 77 0.0017 33.3 7.4 28 30-60 316-343 (476)
388 KOG0299|consensus 45.1 4.3E+02 0.0093 28.5 14.0 111 24-140 326-450 (479)
389 KOG1645|consensus 45.0 1E+02 0.0022 32.4 8.1 73 72-147 194-267 (463)
390 TIGR03547 muta_rot_YjhT mutatr 44.9 3.8E+02 0.0083 27.9 16.6 38 7-44 85-125 (346)
391 PF08374 Protocadherin: Protoc 44.9 7.8 0.00017 36.7 0.2 22 681-702 39-60 (221)
392 KOG4283|consensus 44.8 3.5E+02 0.0076 27.4 14.9 53 7-62 25-77 (397)
393 KOG1408|consensus 44.6 3E+02 0.0065 31.4 11.9 101 25-133 37-140 (1080)
394 KOG1539|consensus 44.4 5.8E+02 0.013 29.8 18.2 118 50-170 223-344 (910)
395 PHA03265 envelope glycoprotein 44.0 6.5 0.00014 39.9 -0.5 35 675-709 342-376 (402)
396 PF04841 Vps16_N: Vps16, N-ter 43.9 4.5E+02 0.0098 28.4 15.4 107 22-137 36-156 (410)
397 KOG0282|consensus 43.7 2.9E+02 0.0064 29.8 11.3 87 51-140 280-366 (503)
398 KOG1334|consensus 43.6 2.5E+02 0.0054 30.5 10.8 65 23-90 230-300 (559)
399 COG5167 VID27 Protein involved 43.0 3.2E+02 0.007 29.9 11.5 71 28-103 419-498 (776)
400 PF15102 TMEM154: TMEM154 prot 42.3 11 0.00025 33.4 0.8 16 686-701 58-74 (146)
401 KOG3509|consensus 42.1 1.2E+02 0.0027 36.1 9.2 36 218-253 404-439 (964)
402 PF11134 Phage_stabilise: Phag 41.8 4.8E+02 0.01 28.1 12.5 70 122-193 140-213 (469)
403 PF05297 Herpes_LMP1: Herpesvi 41.5 8.7 0.00019 37.8 0.0 16 693-708 60-75 (381)
404 PF04863 EGF_alliinase: Alliin 41.3 11 0.00023 27.2 0.4 22 652-673 33-54 (56)
405 KOG0645|consensus 41.0 3.9E+02 0.0084 26.8 15.1 124 71-196 14-142 (312)
406 KOG0277|consensus 39.9 3.9E+02 0.0084 26.6 12.7 37 72-108 148-185 (311)
407 KOG0276|consensus 39.8 6E+02 0.013 28.7 17.5 154 26-189 14-171 (794)
408 KOG2096|consensus 39.4 4.4E+02 0.0096 27.0 12.7 96 33-131 193-294 (420)
409 KOG0650|consensus 39.3 2.5E+02 0.0054 31.3 10.2 70 116-189 567-637 (733)
410 KOG1273|consensus 38.8 4.5E+02 0.0098 27.0 15.8 153 28-190 26-184 (405)
411 PHA02790 Kelch-like protein; P 38.0 6E+02 0.013 28.1 15.8 116 8-135 332-454 (480)
412 PTZ00208 65 kDa invariant surf 37.8 31 0.00067 36.1 3.2 13 699-711 403-415 (436)
413 PF08309 LVIVD: LVIVD repeat; 37.7 1.2E+02 0.0026 20.8 5.0 29 162-191 3-31 (42)
414 KOG1445|consensus 37.2 2.7E+02 0.0059 31.2 10.1 27 123-149 270-296 (1012)
415 KOG3621|consensus 37.1 2.1E+02 0.0046 32.5 9.5 59 117-179 126-189 (726)
416 KOG3881|consensus 36.9 5.3E+02 0.012 27.2 14.4 74 117-192 204-280 (412)
417 PF14283 DUF4366: Domain of un 36.7 41 0.00089 32.6 3.7 14 697-710 172-185 (218)
418 KOG0196|consensus 36.3 46 0.00099 38.3 4.5 12 368-379 258-269 (996)
419 PF14339 DUF4394: Domain of un 36.1 4.3E+02 0.0094 26.0 13.4 74 72-148 27-105 (236)
420 PF00780 CNH: CNH domain; Int 35.8 4.5E+02 0.0099 26.1 15.0 146 35-195 5-171 (275)
421 KOG0639|consensus 35.8 3.8E+02 0.0081 29.3 10.6 73 118-193 512-585 (705)
422 PHA03098 kelch-like protein; P 35.4 6.8E+02 0.015 28.0 14.8 80 7-90 358-444 (534)
423 KOG0275|consensus 35.4 5E+02 0.011 26.5 13.3 88 119-207 396-485 (508)
424 KOG0321|consensus 35.1 5.2E+02 0.011 29.2 11.9 201 1-209 116-368 (720)
425 PF04639 Baculo_E56: Baculovir 34.0 52 0.0011 32.8 3.9 18 693-710 286-303 (305)
426 PF07213 DAP10: DAP10 membrane 33.9 48 0.001 26.1 2.9 15 682-696 32-46 (79)
427 PHA03240 envelope glycoprotein 33.7 15 0.00033 34.6 0.2 26 683-709 210-235 (258)
428 KOG1524|consensus 33.2 4.5E+02 0.0098 29.0 10.8 41 5-45 124-165 (737)
429 PF07204 Orthoreo_P10: Orthore 33.0 13 0.00028 30.1 -0.3 18 693-710 52-69 (98)
430 PLN00033 photosystem II stabil 33.0 6.5E+02 0.014 27.1 15.8 98 51-153 259-363 (398)
431 TIGR03548 mutarot_permut cycli 32.5 5.7E+02 0.012 26.3 16.3 96 82-177 71-178 (323)
432 KOG0274|consensus 32.5 7.7E+02 0.017 27.8 18.7 153 24-191 248-402 (537)
433 KOG0305|consensus 32.3 7.3E+02 0.016 27.4 17.2 117 26-147 302-420 (484)
434 PF14870 PSII_BNR: Photosynthe 32.1 5.8E+02 0.013 26.3 15.3 55 29-89 116-170 (302)
435 KOG1310|consensus 31.8 2.8E+02 0.0061 30.6 9.1 113 28-145 53-177 (758)
436 KOG0196|consensus 31.4 52 0.0011 37.9 3.9 9 437-445 260-268 (996)
437 KOG1445|consensus 30.8 6.2E+02 0.013 28.6 11.5 74 24-103 676-750 (1012)
438 KOG0973|consensus 30.8 1E+03 0.022 28.6 15.1 66 72-140 130-195 (942)
439 PF12301 CD99L2: CD99 antigen 30.6 22 0.00048 32.8 0.8 33 678-710 109-141 (169)
440 PF04478 Mid2: Mid2 like cell 30.5 16 0.00035 32.7 -0.1 27 685-711 50-77 (154)
441 KOG0284|consensus 30.1 4.2E+02 0.009 28.2 9.7 91 51-144 160-250 (464)
442 PF15390 DUF4613: Domain of un 29.5 3.2E+02 0.007 30.6 9.2 59 19-80 332-393 (671)
443 PF02480 Herpes_gE: Alphaherpe 29.5 18 0.00039 39.3 0.0 9 203-211 179-187 (439)
444 KOG0307|consensus 29.4 2E+02 0.0044 34.4 8.2 166 26-199 117-294 (1049)
445 PLN02153 epithiospecifier prot 29.4 6.7E+02 0.014 26.1 19.4 166 7-177 50-258 (341)
446 TIGR03074 PQQ_membr_DH membran 29.2 1E+03 0.022 28.2 15.5 15 119-133 378-392 (764)
447 KOG3653|consensus 29.2 58 0.0013 35.1 3.6 6 655-660 116-121 (534)
448 PF13131 DUF3951: Protein of u 29.2 42 0.00091 23.7 1.7 8 687-694 6-13 (53)
449 KOG0641|consensus 28.9 5.4E+02 0.012 24.9 18.4 74 72-148 232-305 (350)
450 KOG0269|consensus 28.2 5.2E+02 0.011 29.9 10.7 52 96-147 111-165 (839)
451 COG5437 Predicted secreted pro 27.9 1.4E+02 0.0031 25.7 4.9 59 58-122 46-105 (138)
452 PF14269 Arylsulfotran_2: Aryl 27.4 6.9E+02 0.015 25.6 15.3 36 73-110 145-181 (299)
453 PF05337 CSF-1: Macrophage col 27.2 21 0.00045 35.4 0.0 19 693-711 236-254 (285)
454 KOG1517|consensus 26.3 1.2E+03 0.027 28.3 13.5 123 25-154 1208-1341(1387)
455 PF02009 Rifin_STEVOR: Rifin/s 26.2 21 0.00045 36.5 -0.2 25 685-709 262-286 (299)
456 PF06084 Cytomega_TRL10: Cytom 25.9 59 0.0013 27.2 2.4 7 614-620 21-27 (150)
457 KOG0645|consensus 25.8 6.9E+02 0.015 25.1 18.5 160 25-189 14-180 (312)
458 PF05393 Hum_adeno_E3A: Human 25.7 41 0.00088 26.9 1.3 23 689-711 39-61 (94)
459 PHA03164 hypothetical protein; 25.3 78 0.0017 24.4 2.7 11 688-698 65-75 (88)
460 PF05131 Pep3_Vps18: Pep3/Vps1 24.7 5.2E+02 0.011 23.3 8.7 72 26-106 34-106 (147)
461 PF15176 LRR19-TM: Leucine-ric 24.5 87 0.0019 25.9 3.0 17 680-696 14-30 (102)
462 PF08268 FBA_3: F-box associat 24.2 2.9E+02 0.0062 24.0 6.8 55 120-177 1-61 (129)
463 PF05345 He_PIG: Putative Ig d 24.1 85 0.0018 22.3 2.7 23 114-136 9-31 (49)
464 KOG4532|consensus 23.7 7.6E+02 0.016 24.8 14.3 32 74-107 161-192 (344)
465 KOG0274|consensus 23.5 1.1E+03 0.024 26.6 17.9 141 49-199 309-451 (537)
466 CHL00114 psbX photosystem II p 22.8 1.3E+02 0.0028 20.2 3.0 12 685-696 8-19 (39)
467 KOG1272|consensus 22.8 3.6E+02 0.0077 29.2 7.8 30 117-147 295-324 (545)
468 PF10873 DUF2668: Protein of u 22.3 60 0.0013 28.7 1.9 24 678-701 59-82 (155)
469 PF02404 SCF: Stem cell factor 22.1 30 0.00064 33.9 0.0 30 683-712 215-244 (273)
470 KOG3621|consensus 22.1 2E+02 0.0044 32.7 6.2 115 27-150 126-250 (726)
471 PF04901 RAMP: Receptor activi 21.9 30 0.00065 29.6 0.0 23 689-711 89-111 (113)
472 KOG4532|consensus 21.8 8.3E+02 0.018 24.6 15.2 29 28-60 161-189 (344)
473 KOG2280|consensus 21.7 5.7E+02 0.012 29.6 9.6 61 27-92 85-155 (829)
474 KOG4818|consensus 21.3 94 0.002 32.2 3.3 29 680-708 326-354 (362)
475 KOG4714|consensus 21.3 5.3E+02 0.012 25.8 8.1 76 25-104 179-255 (319)
476 KOG3509|consensus 21.2 1.7E+02 0.0037 35.0 5.7 72 552-626 406-478 (964)
477 KOG1538|consensus 21.2 1.2E+03 0.026 26.7 11.6 114 6-127 153-275 (1081)
478 KOG0305|consensus 20.9 1.2E+03 0.025 25.9 13.6 127 6-134 322-450 (484)
479 COG3361 Uncharacterized conser 20.5 2.5E+02 0.0054 26.3 5.5 74 72-156 41-114 (240)
480 PF09472 MtrF: Tetrahydrometha 20.5 74 0.0016 24.1 1.8 20 683-702 43-62 (64)
481 PF00558 Vpu: Vpu protein; In 20.1 52 0.0011 26.2 0.9 7 687-693 7-13 (81)
482 PF08553 VID27: VID27 cytoplas 20.1 1.5E+03 0.032 26.9 14.6 170 1-176 456-636 (794)
483 PF05262 Borrelia_P83: Borreli 20.1 8.6E+02 0.019 26.9 10.5 64 129-192 408-472 (489)
484 PF09826 Beta_propel: Beta pro 20.0 1.3E+03 0.027 26.0 13.6 168 34-207 107-319 (521)
No 1
>KOG1214|consensus
Probab=100.00 E-value=9.1e-34 Score=297.27 Aligned_cols=238 Identities=26% Similarity=0.541 Sum_probs=210.2
Q ss_pred CcccCCceeEEEccCcc-c-EEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEE
Q psy6570 2 ASISSGNVTRVKREMNL-K-TVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIAL 79 (713)
Q Consensus 2 ad~~~~~I~~~~~~~~~-~-~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iav 79 (713)
+|+...+|.|..+.|.+ + .+.++|..|+||||||++++|||+|+ ...+|.+..|||+.+++|+..+|..|++|++
T Consensus 1042 tDv~g~SI~rasL~G~Ep~ti~n~~L~SPEGiAVDh~~Rn~ywtDS---~lD~IevA~LdG~~rkvLf~tdLVNPR~iv~ 1118 (1289)
T KOG1214|consen 1042 TDVAGRSISRASLEGAEPETIVNSGLISPEGIAVDHIRRNMYWTDS---VLDKIEVALLDGSERKVLFYTDLVNPRAIVV 1118 (1289)
T ss_pred eecCCCccccccccCCCCceeecccCCCccceeeeeccceeeeecc---ccchhheeecCCceeeEEEeecccCcceEEe
Confidence 57778899999999865 3 44779999999999999999999999 8999999999999999999999999999999
Q ss_pred cCCCCcEEEEccCC-CCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCC
Q psy6570 80 EPLSGRMFWTELGI-KPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTED 158 (713)
Q Consensus 80 D~~~~~ly~td~~~-~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~ 158 (713)
|+..|.||||||+. +++|++.+|||+++++|+.+++..|+||+||+..+.|-|+|.+++|+.-+.++|..|++++..
T Consensus 1119 D~~rgnLYwtDWnRenPkIets~mDG~NrRilin~DigLPNGLtfdpfs~~LCWvDAGt~rleC~~p~g~gRR~i~~~-- 1196 (1289)
T KOG1214|consen 1119 DPIRGNLYWTDWNRENPKIETSSMDGENRRILINTDIGLPNGLTFDPFSKLLCWVDAGTKRLECTLPDGTGRRVIQNN-- 1196 (1289)
T ss_pred ecccCceeeccccccCCcceeeccCCccceEEeecccCCCCCceeCcccceeeEEecCCcceeEecCCCCcchhhhhc--
Confidence 99999999999984 589999999999999999999999999999999999999999999999999999999998876
Q ss_pred CCccceeeeeeCCeEEEEeCCCCcEEEEcccCCCcc-eeeeccccccccEEEEeeccccCCccCCCCCCCCCCCCeeecc
Q psy6570 159 NGYKPYKLEVFEDNLYFSTYRTNNILKINKFGNSDF-NVLANNLNRASDVLILQENKQAHNVTNHCDDKPCHQSALCINL 237 (713)
Q Consensus 159 ~~~~p~~i~~~~~~ly~td~~~~~i~~~~~~~~~~~-~~~~~~~~~~~~i~v~~~~~q~~~~~~~C~~~~C~~~~~C~~~ 237 (713)
+.+|++|..+++++|||||..++|..++++++... ..+.....+.++|..+. ++....+++|+.+..+..|+|+..
T Consensus 1197 -LqYPF~itsy~~~fY~TDWk~n~vvsv~~~~~~~td~~~p~~~s~lyGItav~--~~Cp~gstpCSedNGGCqHLCLpg 1273 (1289)
T KOG1214|consen 1197 -LQYPFSITSYADHFYHTDWKRNGVVSVNKHSGQFTDEYLPEQRSHLYGITAVY--PYCPTGSTPCSEDNGGCQHLCLPG 1273 (1289)
T ss_pred -ccCceeeeeccccceeeccccCceEEeeccccccccccccccccceEEEEecc--ccCCCCCCcccccCCcceeecccC
Confidence 78999999999999999999999999999876543 33444555566665554 444578899998876558999977
Q ss_pred CCCceeeeCC
Q psy6570 238 PSSHTCLCPD 247 (713)
Q Consensus 238 ~g~~~C~C~~ 247 (713)
.....|.||+
T Consensus 1274 qngavcecpd 1283 (1289)
T KOG1214|consen 1274 QNGAVCECPD 1283 (1289)
T ss_pred cCCccccCCc
Confidence 7778888875
No 2
>KOG1214|consensus
Probab=99.91 E-value=8.2e-24 Score=222.95 Aligned_cols=205 Identities=22% Similarity=0.336 Sum_probs=177.9
Q ss_pred CCceeEEEccCcc------cE-EecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEE
Q psy6570 6 SGNVTRVKREMNL------KT-VLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIA 78 (713)
Q Consensus 6 ~~~I~~~~~~~~~------~~-~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~ia 78 (713)
..+|.++.+++.. ++ +.-...-|.||+||-+.++|||+|. ....|.+.+|+|...+++++.+|..|.|||
T Consensus 998 g~~I~~lplng~~~~K~~ak~~l~~p~~IiVGidfDC~e~mvyWtDv---~g~SI~rasL~G~Ep~ti~n~~L~SPEGiA 1074 (1289)
T KOG1214|consen 998 GQQIGYLPLNGTRLQKDAAKTLLSLPGSIIVGIDFDCRERMVYWTDV---AGRSISRASLEGAEPETIVNSGLISPEGIA 1074 (1289)
T ss_pred cceEEEeecCcchhchhhhhceEecccceeeeeecccccceEEEeec---CCCccccccccCCCCceeecccCCCcccee
Confidence 4677788777643 11 2334556899999999999999999 788999999999999999999999999999
Q ss_pred EcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcC--CCCcEEEEeCCCCceeEEEec
Q psy6570 79 LEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADP--KARTIESINLNGKDRFVVYHT 156 (713)
Q Consensus 79 vD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~--~~~~I~~~~~~g~~~~~~~~~ 156 (713)
||+..+.|||||+... +|+++.|||+.+++|+.++|-.|.+|++|+..+.|||+|| .+.+|.+.++||.++++|+..
T Consensus 1075 VDh~~Rn~ywtDS~lD-~IevA~LdG~~rkvLf~tdLVNPR~iv~D~~rgnLYwtDWnRenPkIets~mDG~NrRilin~ 1153 (1289)
T KOG1214|consen 1075 VDHIRRNMYWTDSVLD-KIEVALLDGSERKVLFYTDLVNPRAIVVDPIRGNLYWTDWNRENPKIETSSMDGENRRILINT 1153 (1289)
T ss_pred eeeccceeeeeccccc-hhheeecCCceeeEEEeecccCcceEEeecccCceeeccccccCCcceeeccCCccceEEeec
Confidence 9999999999999888 9999999999999999999999999999999999999998 466899999999999999988
Q ss_pred CCCCccceeeeeeCCeEEEEeCCCCcEEEEcccCCCcceeeeccccccccEEEEeeccc
Q psy6570 157 EDNGYKPYKLEVFEDNLYFSTYRTNNILKINKFGNSDFNVLANNLNRASDVLILQENKQ 215 (713)
Q Consensus 157 ~~~~~~p~~i~~~~~~ly~td~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~q 215 (713)
...+++.+.++.+...|-|+|.+++++.-+...+..+++ +..++..|++|.-+....+
T Consensus 1154 DigLPNGLtfdpfs~~LCWvDAGt~rleC~~p~g~gRR~-i~~~LqYPF~itsy~~~fY 1211 (1289)
T KOG1214|consen 1154 DIGLPNGLTFDPFSKLLCWVDAGTKRLECTLPDGTGRRV-IQNNLQYPFSITSYADHFY 1211 (1289)
T ss_pred ccCCCCCceeCcccceeeEEecCCcceeEecCCCCcchh-hhhcccCceeeeeccccce
Confidence 777777777777788999999999999999887665544 4467888999887777665
No 3
>KOG1215|consensus
Probab=99.88 E-value=3.4e-21 Score=227.68 Aligned_cols=254 Identities=36% Similarity=0.684 Sum_probs=212.0
Q ss_pred CcccCCceeEEEccCcc--cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEE
Q psy6570 2 ASISSGNVTRVKREMNL--KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIAL 79 (713)
Q Consensus 2 ad~~~~~I~~~~~~~~~--~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iav 79 (713)
+|....+|.+...++.. .+...++-.|.+||+||+.+.+||+|. ....|.+.+++|..+.+++...+..|+.+++
T Consensus 454 ~d~~~~~i~~~~~~~~~~~~~~~~g~~~~~~lavD~~~~~~y~tDe---~~~~i~v~~~~g~~~~vl~~~~l~~~r~~~v 530 (877)
T KOG1215|consen 454 ADLSDEKICRASQDGSSECELCGDGLCIPEGLAVDWIGDNIYWTDE---GNCLIEVADLDGSSRKVLVSKDLDLPRSIAV 530 (877)
T ss_pred EeccCCeEeeeccCCCccceEeccCccccCcEEEEeccCCceeccc---CCceeEEEEccCCceeEEEecCCCCccceee
Confidence 35566777777777643 334678889999999999999999999 7899999999999988999888899999999
Q ss_pred cCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCC-cEEEEeCCCCceeEEEecCC
Q psy6570 80 EPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKAR-TIESINLNGKDRFVVYHTED 158 (713)
Q Consensus 80 D~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~-~I~~~~~~g~~~~~~~~~~~ 158 (713)
||..+++||+||+..++|.|+.|||..+.+++..++.+|+||++|...+++||+|.... .|.+++++|..++ +... .
T Consensus 531 ~p~~g~~~wtd~~~~~~i~ra~~dg~~~~~l~~~~~~~p~glt~d~~~~~~yw~d~~~~~~i~~~~~~g~~r~-~~~~-~ 608 (877)
T KOG1215|consen 531 DPEKGLMFWTDWGQPPRIERASLDGSERAVLVTNGILWPNGLTIDYETDRLYWADAKLDYTIESANMDGQNRR-VVDS-E 608 (877)
T ss_pred ccccCeeEEecCCCCchhhhhcCCCCCceEEEeCCccCCCcceEEeecceeEEEcccCCcceeeeecCCCceE-Eecc-c
Confidence 99999999999996559999999999999999988999999999999999999999888 7999999999998 2222 2
Q ss_pred CCccceeeeeeCCeEEEEeCCCCcEEEEcccCCCcceeeeccccccccEEEEeeccccCCccCCCCCCCCCCCCeeeccC
Q psy6570 159 NGYKPYKLEVFEDNLYFSTYRTNNILKINKFGNSDFNVLANNLNRASDVLILQENKQAHNVTNHCDDKPCHQSALCINLP 238 (713)
Q Consensus 159 ~~~~p~~i~~~~~~ly~td~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~q~~~~~~~C~~~~C~~~~~C~~~~ 238 (713)
.+.+|++++++++++||++|....+.+..+..+.....+......+..+.+++...+.....|+|+.+.....++|+..|
T Consensus 609 ~~~~p~~~~~~~~~iyw~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~C~~~n~~c~~KOG~~p 688 (877)
T KOG1215|consen 609 DLPHPFGLSVFEDYIYWTDWSNRAISRAEKHKGSDSRTSRSNLAQPLDIILVHHSSSRPTGVNPCESSNGGCSQLCLPRP 688 (877)
T ss_pred cCCCceEEEEecceeEEeeccccceEeeecccCCcceeeecccCcccceEEEeccccCCCCCCcccccCCCCCeeeecCC
Confidence 48899999999999999999999888888776655123444567777777776655556788999987333389999999
Q ss_pred CCceeeeCCCCccccccCCCCCcc
Q psy6570 239 SSHTCLCPDHLTEELNVTSGKMSC 262 (713)
Q Consensus 239 g~~~C~C~~G~~~~~~~~~~~c~C 262 (713)
...+|.|+.|+. +..+...|.+
T Consensus 689 ~~~~c~c~~~~~--l~~~~~~C~~ 710 (877)
T KOG1215|consen 689 QGSTCACPEGYR--LSPDGKSCSS 710 (877)
T ss_pred CCCeeeCCCCCe--ecCCCCeecC
Confidence 988999999987 4455555655
No 4
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=99.75 E-value=6.7e-17 Score=162.72 Aligned_cols=200 Identities=23% Similarity=0.337 Sum_probs=148.9
Q ss_pred CCcccCCceeEEEccCcccEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcC-----CCCCcc
Q psy6570 1 MASISSGNVTRVKREMNLKTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNT-----GLNEPY 75 (713)
Q Consensus 1 vad~~~~~I~~~~~~~~~~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~-----~~~~p~ 75 (713)
|+|+..++|+++++++....++. +..|.|++++..++.||+++. ..+.+++++....+.+... .+..|+
T Consensus 16 ~~D~~~~~i~~~~~~~~~~~~~~-~~~~~G~~~~~~~g~l~v~~~-----~~~~~~d~~~g~~~~~~~~~~~~~~~~~~N 89 (246)
T PF08450_consen 16 WVDIPGGRIYRVDPDTGEVEVID-LPGPNGMAFDRPDGRLYVADS-----GGIAVVDPDTGKVTVLADLPDGGVPFNRPN 89 (246)
T ss_dssp EEETTTTEEEEEETTTTEEEEEE-SSSEEEEEEECTTSEEEEEET-----TCEEEEETTTTEEEEEEEEETTCSCTEEEE
T ss_pred EEEcCCCEEEEEECCCCeEEEEe-cCCCceEEEEccCCEEEEEEc-----CceEEEecCCCcEEEEeeccCCCcccCCCc
Confidence 46888999999999987655433 333999999966799999997 3344558776666655543 578899
Q ss_pred eEEEcCCCCcEEEEccCCC-------CeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCC
Q psy6570 76 DIALEPLSGRMFWTELGIK-------PRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGK 148 (713)
Q Consensus 76 ~iavD~~~~~ly~td~~~~-------~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~ 148 (713)
++++|+ +|.||+++.... ++|++++.+++ .+.+ ...+..|+||+++++++.|||+|+..++|++++++..
T Consensus 90 D~~vd~-~G~ly~t~~~~~~~~~~~~g~v~~~~~~~~-~~~~-~~~~~~pNGi~~s~dg~~lyv~ds~~~~i~~~~~~~~ 166 (246)
T PF08450_consen 90 DVAVDP-DGNLYVTDSGGGGASGIDPGSVYRIDPDGK-VTVV-ADGLGFPNGIAFSPDGKTLYVADSFNGRIWRFDLDAD 166 (246)
T ss_dssp EEEE-T-TS-EEEEEECCBCTTCGGSEEEEEEETTSE-EEEE-EEEESSEEEEEEETTSSEEEEEETTTTEEEEEEEETT
T ss_pred eEEEcC-CCCEEEEecCCCccccccccceEEECCCCe-EEEE-ecCcccccceEECCcchheeecccccceeEEEecccc
Confidence 999997 688999986532 46999999944 3333 3468899999999999999999999999999998643
Q ss_pred -----ceeEEEecCCCCccceeeeee-CCeEEEEeCCCCcEEEEcccCCCcceeeeccccccccEEEE
Q psy6570 149 -----DRFVVYHTEDNGYKPYKLEVF-EDNLYFSTYRTNNILKINKFGNSDFNVLANNLNRASDVLIL 210 (713)
Q Consensus 149 -----~~~~~~~~~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~ 210 (713)
.++++.........|.||+++ +++||++.+..++|+++++. +.....+......|+.+.+-
T Consensus 167 ~~~~~~~~~~~~~~~~~g~pDG~~vD~~G~l~va~~~~~~I~~~~p~-G~~~~~i~~p~~~~t~~~fg 233 (246)
T PF08450_consen 167 GGELSNRRVFIDFPGGPGYPDGLAVDSDGNLWVADWGGGRIVVFDPD-GKLLREIELPVPRPTNCAFG 233 (246)
T ss_dssp TCCEEEEEEEEE-SSSSCEEEEEEEBTTS-EEEEEETTTEEEEEETT-SCEEEEEE-SSSSEEEEEEE
T ss_pred ccceeeeeeEEEcCCCCcCCCcceEcCCCCEEEEEcCCCEEEEECCC-ccEEEEEcCCCCCEEEEEEE
Confidence 234554443333469999998 68999999999999999988 55555565666678877764
No 5
>KOG4289|consensus
Probab=99.73 E-value=3.9e-17 Score=180.73 Aligned_cols=120 Identities=29% Similarity=0.716 Sum_probs=79.3
Q ss_pred cccCCCCcc-CCCCCCCCCCCCcEEcCCCC--CeeccCCCCCCCCCCcccCCCCCCCCC---CCCCCCCCCCCC----cE
Q psy6570 496 KCMCSPGYS-GKKCDTCTCLNGGTCIPNSK--NNVCKCPSQYTGRRCECAVGDTSCASL---ANKCTPNYCSNN----GT 565 (713)
Q Consensus 496 ~C~C~~G~~-g~~C~~~~C~~~g~C~~~~~--~~~C~C~~g~~G~~C~~~~~~~~c~~~---~~~C~~~~C~~~----~~ 565 (713)
.|.|.+ |. -+.|...||.+.|+|+..++ +|+|.|++||.|..||... +..|+.. ...|.+..|... ..
T Consensus 1707 sC~c~~-~~C~~vC~lnpc~~~g~Cv~sp~a~GY~C~C~~g~~G~~Ce~~~-dq~CPrGWWG~P~CgpC~CavsKgfdp~ 1784 (2531)
T KOG4289|consen 1707 SCPCDP-YNCVDVCSLNPCENQGTCVRSPGAHGYTCECPPGYTGPYCELRA-DQPCPRGWWGFPTCGPCNCAVSKGFDPD 1784 (2531)
T ss_pred cccCCC-CCccchhcccccccCceeecCCCCCceeEECCCcccCcchhhhc-cCCCCCcccCCCCccCccccccCCCCCC
Confidence 566766 31 34566778999999987655 8999999999999999765 5556543 334555555432 36
Q ss_pred EeecCCCceeeCCCCCcCC--CCCcCCCCCCCCCCCCCCCeEecCCCCcceeecCCCcccCCCCc
Q psy6570 566 CVLIEGKPSCKCLPPYSGK--QCTEREDSPSCHNYCDNAGLCSYSKQGKPVCTCVNGWSGITCSE 628 (713)
Q Consensus 566 C~~~~g~~~C~C~~G~~G~--~C~~~~~~~~C~~~C~~~g~C~~~~~g~~~C~C~~G~~G~~C~~ 628 (713)
|..+.| +|+|++.++-. .|..+ .|.. =+..-.|. .+++|.|++|-.|+.|..
T Consensus 1785 CnKt~G--~CqCKe~hy~~~~~Cl~C----dC~~-Gs~Sr~C~----adGqC~C~pgaiGRqCdr 1838 (2531)
T KOG4289|consen 1785 CNKTNG--QCQCKENHYRPIGSCLPC----DCYF-GSDSRECD----ADGQCPCKPGAIGRQCDR 1838 (2531)
T ss_pred ccccCc--ceeeccccccCCCcceee----cccc-CCCccccc----CCCcCCCCCccccccccc
Confidence 877766 69999987622 25432 2210 01123454 335799999999998864
No 6
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=99.71 E-value=1e-15 Score=181.21 Aligned_cols=189 Identities=17% Similarity=0.281 Sum_probs=147.0
Q ss_pred CCcccCCceeEEEccCcccEEec---------------CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEE
Q psy6570 1 MASISSGNVTRVKREMNLKTVLS---------------NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRT 65 (713)
Q Consensus 1 vad~~~~~I~~~~~~~~~~~~~~---------------~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~ 65 (713)
|||..+++|+++++++.....+. .+..|.||++|..++.|||+|. .+++|.++++.+..+++
T Consensus 584 VaDs~n~rI~v~d~~G~~i~~ig~~g~~G~~dG~~~~a~f~~P~GIavd~~gn~LYVaDt---~n~~Ir~id~~~~~V~t 660 (1057)
T PLN02919 584 ISDSNHNRIVVTDLDGNFIVQIGSTGEEGLRDGSFEDATFNRPQGLAYNAKKNLLYVADT---ENHALREIDFVNETVRT 660 (1057)
T ss_pred EEECCCCeEEEEeCCCCEEEEEccCCCcCCCCCchhccccCCCcEEEEeCCCCEEEEEeC---CCceEEEEecCCCEEEE
Confidence 67889999999999886533221 2567999999988888999999 78899999998777666
Q ss_pred EEcC----------------CCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEe--------------CCC
Q psy6570 66 LLNT----------------GLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVD--------------NNI 115 (713)
Q Consensus 66 l~~~----------------~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~--------------~~~ 115 (713)
+... .+..|.+|++|+.++.||++|.+++ +|+++++.+....++.. ..+
T Consensus 661 lag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~~~~-~I~v~d~~~g~v~~~~G~G~~~~~~g~~~~~~~~ 739 (1057)
T PLN02919 661 LAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMAGQH-QIWEYNISDGVTRVFSGDGYERNLNGSSGTSTSF 739 (1057)
T ss_pred EeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEECCCC-eEEEEECCCCeEEEEecCCccccCCCCccccccc
Confidence 5321 1568999999998999999999888 99999886554433321 135
Q ss_pred CCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEec-------------------CCCCccceeeeee-CCeEEE
Q psy6570 116 QWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHT-------------------EDNGYKPYKLEVF-EDNLYF 175 (713)
Q Consensus 116 ~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~-------------------~~~~~~p~~i~~~-~~~ly~ 175 (713)
..|+||++++.+++||++|..+++|++++++.....++... ...+.+|.+|+++ ++.||+
T Consensus 740 ~~P~GIavspdG~~LYVADs~n~~Irv~D~~tg~~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYV 819 (1057)
T PLN02919 740 AQPSGISLSPDLKELYIADSESSSIRALDLKTGGSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYV 819 (1057)
T ss_pred cCccEEEEeCCCCEEEEEECCCCeEEEEECCCCcEEEEEecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEEE
Confidence 78999999998889999999999999999876444333210 0124578999987 468999
Q ss_pred EeCCCCcEEEEcccCCCc
Q psy6570 176 STYRTNNILKINKFGNSD 193 (713)
Q Consensus 176 td~~~~~i~~~~~~~~~~ 193 (713)
+|+.+++|.+++..++..
T Consensus 820 ADs~N~rIrviD~~tg~v 837 (1057)
T PLN02919 820 ADSYNHKIKKLDPATKRV 837 (1057)
T ss_pred EECCCCEEEEEECCCCeE
Confidence 999999999999765543
No 7
>KOG4289|consensus
Probab=99.70 E-value=1.9e-16 Score=175.35 Aligned_cols=87 Identities=41% Similarity=1.045 Sum_probs=59.3
Q ss_pred CCCCceeeCCCCcccCCCCccC--CC-CCCCCCCEEEc--CCCeeeCCCCCccCCCC------cCCCCCCCCCcEeccC-
Q psy6570 365 PEPTYKCHCAPSYTGARCESRI--CE-NKCHNGGTCIA--TTQTCVCPPGFTGDTCQ------QCLNLKCQNGGVCVNK- 432 (713)
Q Consensus 365 ~~~~~~C~C~~G~~g~~C~~~~--C~-~~C~~~~~C~~--~~~~C~C~~g~~g~~C~------~C~~~~C~~~~~C~~~- 432 (713)
+.++++|.|++||+|+.|+..+ |- .+|.++|+|.. ++|+|.|.+||+|+.|| .|.+..|.++++|++.
T Consensus 1218 pvnglrCrCPpGFTgd~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~tGehCEvs~~agrCvpGvC~nggtC~~~~ 1297 (2531)
T KOG4289|consen 1218 PVNGLRCRCPPGFTGDYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFTGEHCEVSARAGRCVPGVCKNGGTCVNLL 1297 (2531)
T ss_pred ccCceeEeCCCCCCcccccchhHhhhcCCCCCCCceEEecCceeEEecCCccccceeeecccCccccceecCCCEEeecC
Confidence 3446777777777777776543 53 67777777766 67777777777777777 5777777777777664
Q ss_pred CCCccccCCCC-CcCCCCcc
Q psy6570 433 TTGLECDCPKF-YYGKNCQY 451 (713)
Q Consensus 433 ~~~~~C~C~~G-~~g~~C~~ 451 (713)
++++.|.|+.| |++..|+.
T Consensus 1298 nggf~c~Cp~ge~e~prC~v 1317 (2531)
T KOG4289|consen 1298 NGGFCCHCPYGEFEDPRCEV 1317 (2531)
T ss_pred CCceeccCCCcccCCCceEE
Confidence 44577777776 33555553
No 8
>KOG0994|consensus
Probab=99.66 E-value=2.4e-15 Score=164.31 Aligned_cols=309 Identities=26% Similarity=0.598 Sum_probs=172.9
Q ss_pred Cceee-eCCCCccccccCCCCCcccCCCCCCCCCccCCCCCCCCCCcccCCCCCCCccc-cCCCCccCC-CCCCCC---C
Q psy6570 240 SHTCL-CPDHLTEELNVTSGKMSCKVAPARTCYLDCNHGNLPSSHTCLCPDHLTEELNV-TSGKMSCKV-APARTC---Y 313 (713)
Q Consensus 240 ~~~C~-C~~G~~~~~~~~~~~c~C~~~~~~~~~~~~~~~~~~~~~~c~C~~~~~~~~c~-C~~g~~~~~-~~~~~C---~ 313 (713)
+.+|. |.+|+.|.+..+...|.|..- +.-...| ..-+..|.|..+..+..|. |.+||+|.. |.+-.| +
T Consensus 792 GR~CdqCApGtyGFGPsGCk~CdC~~~--Gs~~~~C----d~~tGQC~C~~g~ygrqCnqCqpG~WgFPeCr~CqCNgHA 865 (1758)
T KOG0994|consen 792 GRRCDQCAPGTYGFGPSGCKACDCNSI--GSLDKYC----DKITGQCQCRPGTYGRQCNQCQPGYWGFPECRPCQCNGHA 865 (1758)
T ss_pred cccccccCCcccCcCCccCcccccccc--ccccccc----cccccceeeccccchhhccccCCCccCCCcCccccccCcc
Confidence 34444 555665555555555555431 1111122 1235678998888777776 999999863 444444 3
Q ss_pred CCCC--CCeeec---CCCCCCee-ecCCCcccCCc--cccCCCCCCCCCC--------ceeeCCCCCCCCCceeeCCCCc
Q psy6570 314 LDCN--HGTCEF---DDDFDPHC-ICQENFYGTYC--EKVNNSMCPCLNQ--------GMCYPDLTHPEPTYKCHCAPSY 377 (713)
Q Consensus 314 ~~C~--~~~C~~---~~~~~~~C-~C~~g~~G~~C--~~~~c~~~~C~~~--------~~C~~~~~~~~~~~~C~C~~G~ 377 (713)
+.|. .|.|+. ...+ +.| .|..||+|+.= ..+-|.++||..+ ..|..... .....|.|.+||
T Consensus 866 ~~Cd~~tGaCi~CqD~T~G-~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~sC~~d~~--t~~ivC~C~~GY 942 (1758)
T KOG0994|consen 866 DTCDPITGACIDCQDSTTG-HSCDRCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHADSCYLDTR--TQQIVCHCQEGY 942 (1758)
T ss_pred cccCccccccccccccccc-cchhhhhccccCCcccCCCCCCCCCCCCCCCccchhcccccccccc--ccceeeecccCc
Confidence 4555 345544 3333 667 79999998732 2233666677642 24654322 224679999999
Q ss_pred ccCCCCccCCC-----CCCCCCCEEEcCCCeeeCCCCCc---cCCCCcCCCCCCCCCcEeccCCCCcccc-CCCCCcCC-
Q psy6570 378 TGARCESRICE-----NKCHNGGTCIATTQTCVCPPGFT---GDTCQQCLNLKCQNGGVCVNKTTGLECD-CPKFYYGK- 447 (713)
Q Consensus 378 ~g~~C~~~~C~-----~~C~~~~~C~~~~~~C~C~~g~~---g~~C~~C~~~~C~~~~~C~~~~~~~~C~-C~~G~~g~- 447 (713)
+|.+|+. |. ++=. +|+|. .|.|...-. -..|.. ....| -.|.....+.+|+ |.+||+|.
T Consensus 943 ~G~RCe~--CA~~~fGnP~~-GGtCq----~CeC~~NiD~~d~~aCD~-~TG~C---LkCL~hTeG~hCe~Ck~Gf~GdA 1011 (1758)
T KOG0994|consen 943 SGSRCEI--CADNHFGNPSE-GGTCQ----KCECSNNIDLYDPGACDV-ATGAC---LKCLYHTEGDHCEHCKDGFYGDA 1011 (1758)
T ss_pred cccchhh--hcccccCCccc-CCccc----cccccCCcCccCCCccch-hhchh---hhhhhcccccchhhccccchhHH
Confidence 9999874 32 2221 44443 244443321 112220 00001 1244445566784 99999985
Q ss_pred ---CCccCCCCCCCCCCeeecCCCCCeecCCCCccCCCCCCccccCCCCCCcccCCCCcc----CCCCCCCCCCCCc--E
Q psy6570 448 ---NCQYSQCKNYCVNGECSITDSGPKCMCSPGYSGKKCDTCTCLNGDSGPKCMCSPGYS----GKKCDTCTCLNGG--T 518 (713)
Q Consensus 448 ---~C~~~~C~~~~~~~~C~~~~~~~~C~C~~G~~g~~C~~~~C~~~~~~~~C~C~~G~~----g~~C~~~~C~~~g--~ 518 (713)
+|+...|...-.+.+|.-..-+.+|.|.|...|..|+. |.+.++ |..|+++.|...+ +
T Consensus 1012 ~~q~CqrC~Cn~LGTn~~~~CDr~tGQCpClpNv~G~~CDq-------------CA~N~w~laSG~GCe~C~Cd~~~~pq 1078 (1758)
T KOG0994|consen 1012 LRQNCQRCVCNFLGTNSTCHCDRFTGQCPCLPNVQGVRCDQ-------------CAENHWNLASGEGCEPCNCDPIGGPQ 1078 (1758)
T ss_pred HHhhhhhheccccccCCccccccccCcCCCCcccccccccc-------------cccchhccccCCCCCccCCCccCCcc
Confidence 56655565432222333333344999999999999986 555553 6677777775422 4
Q ss_pred EcCCCCCeeccCCCCCCCCCCcccCCCCCCCCCCCCCCCCCCCCCc----EEeecCCCceeeCCCCCcCCCC
Q psy6570 519 CIPNSKNNVCKCPSQYTGRRCECAVGDTSCASLANKCTPNYCSNNG----TCVLIEGKPSCKCLPPYSGKQC 586 (713)
Q Consensus 519 C~~~~~~~~C~C~~g~~G~~C~~~~~~~~c~~~~~~C~~~~C~~~~----~C~~~~g~~~C~C~~G~~G~~C 586 (713)
|..- +++|+|.+||-|..|......++-.+ ...|....|...| +|....| +|.|.+|..|..|
T Consensus 1079 CN~f--tGQCqCkpGfGGR~C~qCqel~WGdP-~~~C~aCdCd~rG~~tpQCdr~tG--~C~C~~Gv~G~rC 1145 (1758)
T KOG0994|consen 1079 CNEF--TGQCQCKPGFGGRTCSQCQELYWGDP-NEKCRACDCDPRGIETPQCDRATG--RCVCRPGVGGPRC 1145 (1758)
T ss_pred cccc--ccceeccCCCCCcchhHHHHhhcCCC-CCCceecCCCCCCCCCCCccccCC--ceeecCCCCCcch
Confidence 5433 67999999999999863211111000 1234444454444 3554433 5777777777666
No 9
>KOG1225|consensus
Probab=99.65 E-value=2.1e-15 Score=159.76 Aligned_cols=129 Identities=38% Similarity=0.961 Sum_probs=96.6
Q ss_pred CCcccCCCCccCCCCCCCCCCC----CcEEcCCCCCeeccCCCCCCCCCCcccCCCCCCCCCCCCCCCCCCCCCcEEeec
Q psy6570 494 GPKCMCSPGYSGKKCDTCTCLN----GGTCIPNSKNNVCKCPSQYTGRRCECAVGDTSCASLANKCTPNYCSNNGTCVLI 569 (713)
Q Consensus 494 ~~~C~C~~G~~g~~C~~~~C~~----~g~C~~~~~~~~C~C~~g~~G~~C~~~~~~~~c~~~~~~C~~~~C~~~~~C~~~ 569 (713)
...|.|+.+|+|..|....|.+ ++.|++ ++|+|++||+|..|+.. .|... |+.++.|++
T Consensus 233 ~~ic~c~~~~~g~~c~~~~C~~~c~~~g~c~~----G~CIC~~Gf~G~dC~e~-----------~Cp~~-cs~~g~~~~- 295 (525)
T KOG1225|consen 233 DGICECPEGYFGPLCSTIYCPGGCTGRGQCVE----GRCICPPGFTGDDCDEL-----------VCPVD-CSGGGVCVD- 295 (525)
T ss_pred CceeecCCceeCCccccccCCCCCcccceEeC----CeEeCCCCCcCCCCCcc-----------cCCcc-cCCCceecC-
Confidence 3467777777777776555543 345653 58899999999888743 35444 777777773
Q ss_pred CCCceeeCCCCCcCCCCCcCCCCCCCCCCCCCCCeEecCCCCcceeecCCCcccCCCCcCCCCCCCCCCCCeecCCCCcc
Q psy6570 570 EGKPSCKCLPPYSGKQCTEREDSPSCHNYCDNAGLCSYSKQGKPVCTCVNGWSGITCSERVSCAHFCFNGGTCREQNYSL 649 (713)
Q Consensus 570 ~g~~~C~C~~G~~G~~C~~~~~~~~C~~~C~~~g~C~~~~~g~~~C~C~~G~~G~~C~~~~~C~~~C~~~~~C~~~~~~~ 649 (713)
| +|.|++||+|+.|+. ..|+.+|.++|.|++. +|.|.+||+|..|+.. .|.+++.|++
T Consensus 296 -g--~CiC~~g~~G~dCs~----~~cpadC~g~G~Ci~G-----~C~C~~Gy~G~~C~~~-----~C~~~g~cv~----- 353 (525)
T KOG1225|consen 296 -G--ECICNPGYSGKDCSI----RRCPADCSGHGKCIDG-----ECLCDEGYTGELCIQR-----ACSGGGQCVN----- 353 (525)
T ss_pred -C--EeecCCCcccccccc----ccCCccCCCCCcccCC-----ceEeCCCCcCCccccc-----ccCCCceecc-----
Confidence 2 799999999998876 4588889889999833 3999999999888875 2778888887
Q ss_pred CCCCCceeeCCCCcccCC
Q psy6570 650 DPDLKPICICPRGYAGVR 667 (713)
Q Consensus 650 ~~~~~~~C~C~~Gy~G~~ 667 (713)
+ |+|..||.|.+
T Consensus 354 -----g-C~C~~Gw~G~d 365 (525)
T KOG1225|consen 354 -----G-CKCKKGWRGPD 365 (525)
T ss_pred -----C-ceeccCccCCC
Confidence 6 89999998887
No 10
>KOG1217|consensus
Probab=99.64 E-value=5.9e-14 Score=156.56 Aligned_cols=295 Identities=33% Similarity=0.766 Sum_probs=213.7
Q ss_pred CcccCCCCCCCccccCCCCccCCCCCC-CCC-CC---CCCCeeecCC--CCCCeeecCCCcccCCcccc--CCC--CCCC
Q psy6570 284 TCLCPDHLTEELNVTSGKMSCKVAPAR-TCY-LD---CNHGTCEFDD--DFDPHCICQENFYGTYCEKV--NNS--MCPC 352 (713)
Q Consensus 284 ~c~C~~~~~~~~c~C~~g~~~~~~~~~-~C~-~~---C~~~~C~~~~--~~~~~C~C~~g~~G~~C~~~--~c~--~~~C 352 (713)
...+......+.|.|..||.+..++.. +|. .+ +..+.|.... ...+.|.|..||.+..|+.. .|. ..+|
T Consensus 100 ~~~~~~~~~~~~c~c~~g~~~~~~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c 179 (487)
T KOG1217|consen 100 CGECVDCVGSYECTCPPGYQGTPCEGECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDLDECIQYSSPC 179 (487)
T ss_pred CccccCCCCCceeeCCCccccCcCCcceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccccccccccccccCCCCc
Confidence 345556777888999999999887765 573 22 3455777542 24578999999999998864 465 4469
Q ss_pred CCCceeeCCCCCCCCCceeeCCCCcccCCCCccCCCCCCCCCCEEEcCCCeeeCCCCCccCCCC----cCCCCCCCCCcE
Q psy6570 353 LNQGMCYPDLTHPEPTYKCHCAPSYTGARCESRICENKCHNGGTCIATTQTCVCPPGFTGDTCQ----QCLNLKCQNGGV 428 (713)
Q Consensus 353 ~~~~~C~~~~~~~~~~~~C~C~~G~~g~~C~~~~C~~~C~~~~~C~~~~~~C~C~~g~~g~~C~----~C~~~~C~~~~~ 428 (713)
.+.+.|.+... +|.|.|++||.+..|+.. .+++.|+.. ..|.+.+|+.+..|+ +|... + +.
T Consensus 180 ~~~~~C~~~~~----~~~C~c~~~~~~~~~~~~------~~~~~c~~~-~~~~~~~g~~~~~c~~~~~~~~~~---~-~~ 244 (487)
T KOG1217|consen 180 QNGGTCVNTGG----SYLCSCPPGYTGSTCETT------GNGGTCVDS-VACSCPPGARGPECEVSIVECASG---D-GT 244 (487)
T ss_pred CCCcccccCCC----CeeEeCCCCccCCcCcCC------CCCceEecc-eeccCCCCCCCCCcccccccccCC---C-Cc
Confidence 99999988765 599999999999988754 455677763 789999999988877 23322 4 89
Q ss_pred eccCCCCccccCCCCCcCCCC----ccCCCCCC--C-CCCeeecCCCCCeecCCCCccCCCCCCccccCCCCCCcccCCC
Q psy6570 429 CVNKTTGLECDCPKFYYGKNC----QYSQCKNY--C-VNGECSITDSGPKCMCSPGYSGKKCDTCTCLNGDSGPKCMCSP 501 (713)
Q Consensus 429 C~~~~~~~~C~C~~G~~g~~C----~~~~C~~~--~-~~~~C~~~~~~~~C~C~~G~~g~~C~~~~C~~~~~~~~C~C~~ 501 (713)
|.+..++|.|.|++||.+..+ .+++|... + +++.|.+..+.|.|.|++||.|..| ..|.. ...|..
T Consensus 245 c~~~~~~~~C~~~~g~~~~~~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~--~~~~~-----~~~C~~ 317 (487)
T KOG1217|consen 245 CVNTVGSYTCRCPEGYTGDACVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLC--TECVD-----VDECSP 317 (487)
T ss_pred ccccCCceeeeCCCCccccccceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCC--ccccc-----cccccc
Confidence 999999999999999998873 36677664 3 4689999998899999999999987 11111 112332
Q ss_pred CccCCCCCCCCCCCCcEEc--CCCCCeeccCCCCCCCCCCcccCCCCCCCCCCCCCCCCCCCCCcEEee-cCCCceeeCC
Q psy6570 502 GYSGKKCDTCTCLNGGTCI--PNSKNNVCKCPSQYTGRRCECAVGDTSCASLANKCTPNYCSNNGTCVL-IEGKPSCKCL 578 (713)
Q Consensus 502 G~~g~~C~~~~C~~~g~C~--~~~~~~~C~C~~g~~G~~C~~~~~~~~c~~~~~~C~~~~C~~~~~C~~-~~g~~~C~C~ 578 (713)
++.+ .+|.+++.|. .....+.|.|..+|.|..|+.. .++|...+|..++.|.+ ..+++.|.|+
T Consensus 318 ~~~~-----~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~---------~~~C~~~~~~~~~~c~~~~~~~~~c~~~ 383 (487)
T KOG1217|consen 318 RNAG-----GPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDS---------NDECASSPCCPGGTCVNETPGSYRCACP 383 (487)
T ss_pred cccC-----CcCCCCcccccCCCCCCCCcCCCCCCCCCccccC---------CccccCCccccCCEeccCCCCCeEecCC
Confidence 2222 3466767772 3334678999999999999844 13677777888999999 6889999999
Q ss_pred CCCcCC---CCCcCCCCCCCCCCCCCCCeEecCCCCcceeecCCC
Q psy6570 579 PPYSGK---QCTEREDSPSCHNYCDNAGLCSYSKQGKPVCTCVNG 620 (713)
Q Consensus 579 ~G~~G~---~C~~~~~~~~C~~~C~~~g~C~~~~~g~~~C~C~~G 620 (713)
.+|.+. ......+.++|.. .+.|.+.. +++.|. .++
T Consensus 384 ~~~~~~~~~~~~~~~~~~~c~~----~~~c~~~~-~~~~c~-~~~ 422 (487)
T KOG1217|consen 384 AGFAGKANGDGVGCEDIDECSG----CGDCVNGP-GGGACT-PPG 422 (487)
T ss_pred CccccCCccccccccccccccC----CcceeccC-CCCccc-cCc
Confidence 999984 2222222355533 45676664 778888 773
No 11
>KOG1217|consensus
Probab=99.61 E-value=3.9e-14 Score=157.98 Aligned_cols=254 Identities=36% Similarity=0.923 Sum_probs=168.4
Q ss_pred CceeeCCCCcccCCCCcc-CCCC-C--CCCCCEEEc-----CCCeeeCCCCCccCCCC----cCC--CCCCCCCcEeccC
Q psy6570 368 TYKCHCAPSYTGARCESR-ICEN-K--CHNGGTCIA-----TTQTCVCPPGFTGDTCQ----QCL--NLKCQNGGVCVNK 432 (713)
Q Consensus 368 ~~~C~C~~G~~g~~C~~~-~C~~-~--C~~~~~C~~-----~~~~C~C~~g~~g~~C~----~C~--~~~C~~~~~C~~~ 432 (713)
.+.|.|++||.+..|+.. .|.. + +...+.|.. ..+.|.|..||.+..++ +|. ..+|.+++.|.+.
T Consensus 109 ~~~c~c~~g~~~~~~~~~~~C~~~~~~~~~~~~c~~~~~~~~~~~c~C~~g~~~~~~~~~~~~C~~~~~~c~~~~~C~~~ 188 (487)
T KOG1217|consen 109 SYECTCPPGYQGTPCEGECECVTGPGVCCIDGSCSNGPGSVGPFRCSCTEGYEGEPCETDLDECIQYSSPCQNGGTCVNT 188 (487)
T ss_pred CceeeCCCccccCcCCcceeecCCCCCeeCchhhcCCCCCCCceeeeeCCCcccccccccccccccCCCCcCCCcccccC
Confidence 566777777776666543 2431 1 123344443 24667777777766655 454 2347777777777
Q ss_pred CCCccccCCCCCcCCCCccCCCCCCCCCCeeecCCCCCeecCCCCccCCCCCCccccCCCCCCcccCCCCccCCCCCCCC
Q psy6570 433 TTGLECDCPKFYYGKNCQYSQCKNYCVNGECSITDSGPKCMCSPGYSGKKCDTCTCLNGDSGPKCMCSPGYSGKKCDTCT 512 (713)
Q Consensus 433 ~~~~~C~C~~G~~g~~C~~~~C~~~~~~~~C~~~~~~~~C~C~~G~~g~~C~~~~C~~~~~~~~C~C~~G~~g~~C~~~~ 512 (713)
.++|.|.|++||.+..|+.. ...+.|.+. +.|.+.+|+.+..|... ...
T Consensus 189 ~~~~~C~c~~~~~~~~~~~~-----~~~~~c~~~---~~~~~~~g~~~~~c~~~-----------------------~~~ 237 (487)
T KOG1217|consen 189 GGSYLCSCPPGYTGSTCETT-----GNGGTCVDS---VACSCPPGARGPECEVS-----------------------IVE 237 (487)
T ss_pred CCCeeEeCCCCccCCcCcCC-----CCCceEecc---eeccCCCCCCCCCcccc-----------------------ccc
Confidence 77777777777777766532 122334333 34555555555554431 112
Q ss_pred CCCC-cEEcCCCCCeeccCCCCCCCCCC--cccCCCCCCCCCCCCCCCCC-CCCCcEEeecCCCceeeCCCCCcCCCCCc
Q psy6570 513 CLNG-GTCIPNSKNNVCKCPSQYTGRRC--ECAVGDTSCASLANKCTPNY-CSNNGTCVLIEGKPSCKCLPPYSGKQCTE 588 (713)
Q Consensus 513 C~~~-g~C~~~~~~~~C~C~~g~~G~~C--~~~~~~~~c~~~~~~C~~~~-C~~~~~C~~~~g~~~C~C~~G~~G~~C~~ 588 (713)
|..+ ++|.+..+.++|.|++||.+..+ ..+ .++|.... |.++++|++..+.|.|.|++||.|..|..
T Consensus 238 ~~~~~~~c~~~~~~~~C~~~~g~~~~~~~~~~~---------~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~~~~ 308 (487)
T KOG1217|consen 238 CASGDGTCVNTVGSYTCRCPEGYTGDACVTCVD---------VDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRLCTE 308 (487)
T ss_pred ccCCCCcccccCCceeeeCCCCccccccceeee---------ccccCCCCccCCCCeeecCCCcceeeCCCCCCCCCCcc
Confidence 3322 78888888899999999998874 111 46777653 88899999999889999999999999822
Q ss_pred CCCCCCC-----CCCCCCCCeE-ecCCCCcceeecCCCcccCCCCcCC-CCC-CCCCCCCeecCCCCccCCCCCceeeCC
Q psy6570 589 REDSPSC-----HNYCDNAGLC-SYSKQGKPVCTCVNGWSGITCSERV-SCA-HFCFNGGTCREQNYSLDPDLKPICICP 660 (713)
Q Consensus 589 ~~~~~~C-----~~~C~~~g~C-~~~~~g~~~C~C~~G~~G~~C~~~~-~C~-~~C~~~~~C~~~~~~~~~~~~~~C~C~ 660 (713)
..+..+| ...|.+++.| .....+.+.|.|..||.|..|+... +|. ..|..++.|... ..+.+.|.|+
T Consensus 309 ~~~~~~C~~~~~~~~c~~g~~C~~~~~~~~~~C~c~~~~~g~~C~~~~~~C~~~~~~~~~~c~~~-----~~~~~~c~~~ 383 (487)
T KOG1217|consen 309 CVDVDECSPRNAGGPCANGGTCNTLGSFGGFRCACGPGFTGRRCEDSNDECASSPCCPGGTCVNE-----TPGSYRCACP 383 (487)
T ss_pred ccccccccccccCCcCCCCcccccCCCCCCCCcCCCCCCCCCccccCCccccCCccccCCEeccC-----CCCCeEecCC
Confidence 2222455 3458887788 2222356789999999999999874 787 457888888862 3456899999
Q ss_pred CCcccC
Q psy6570 661 RGYAGV 666 (713)
Q Consensus 661 ~Gy~G~ 666 (713)
.+|.+.
T Consensus 384 ~~~~~~ 389 (487)
T KOG1217|consen 384 AGFAGK 389 (487)
T ss_pred CccccC
Confidence 999985
No 12
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=99.61 E-value=6.5e-14 Score=166.02 Aligned_cols=188 Identities=16% Similarity=0.193 Sum_probs=143.4
Q ss_pred CCcccCCceeEEEccCcc-cEEe-----------------cCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCce
Q psy6570 1 MASISSGNVTRVKREMNL-KTVL-----------------SNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRK 62 (713)
Q Consensus 1 vad~~~~~I~~~~~~~~~-~~~~-----------------~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~ 62 (713)
|||..+++|+++++.+.. +++. ..+..|.+|++|+.++.|||+|. ..++|.++++.+..
T Consensus 640 VaDt~n~~Ir~id~~~~~V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~---~~~~I~v~d~~~g~ 716 (1057)
T PLN02919 640 VADTENHALREIDFVNETVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMA---GQHQIWEYNISDGV 716 (1057)
T ss_pred EEeCCCceEEEEecCCCEEEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEEC---CCCeEEEEECCCCe
Confidence 467778889999876643 2221 12678999999998899999999 78999999986655
Q ss_pred EEEEEc--------------CCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEe----------------
Q psy6570 63 KRTLLN--------------TGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVD---------------- 112 (713)
Q Consensus 63 ~~~l~~--------------~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~---------------- 112 (713)
..++.. ..+..|.+|++++.+++||++|..++ +|+++++++.....++.
T Consensus 717 v~~~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n~-~Irv~D~~tg~~~~~~gg~~~~~~~l~~fG~~d 795 (1057)
T PLN02919 717 TRVFSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIADSESS-SIRALDLKTGGSRLLAGGDPTFSDNLFKFGDHD 795 (1057)
T ss_pred EEEEecCCccccCCCCccccccccCccEEEEeCCCCEEEEEECCCC-eEEEEECCCCcEEEEEecccccCcccccccCCC
Confidence 443321 12568999999998888999999888 99999988654433321
Q ss_pred -----CCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecC-----------CCCccceeeeee-CCeEEE
Q psy6570 113 -----NNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTE-----------DNGYKPYKLEVF-EDNLYF 175 (713)
Q Consensus 113 -----~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~-----------~~~~~p~~i~~~-~~~ly~ 175 (713)
..+..|.||++|+ ++.||++|..+++|++++.++.....+.... ..+..|.+|+++ +++||+
T Consensus 796 G~g~~~~l~~P~Gvavd~-dG~LYVADs~N~rIrviD~~tg~v~tiaG~G~~G~~dG~~~~a~l~~P~GIavd~dG~lyV 874 (1057)
T PLN02919 796 GVGSEVLLQHPLGVLCAK-DGQIYVADSYNHKIKKLDPATKRVTTLAGTGKAGFKDGKALKAQLSEPAGLALGENGRLFV 874 (1057)
T ss_pred CchhhhhccCCceeeEeC-CCcEEEEECCCCEEEEEECCCCeEEEEeccCCcCCCCCcccccccCCceEEEEeCCCCEEE
Confidence 1245799999995 5679999999999999999877665554321 135679999987 568999
Q ss_pred EeCCCCcEEEEcccCCCc
Q psy6570 176 STYRTNNILKINKFGNSD 193 (713)
Q Consensus 176 td~~~~~i~~~~~~~~~~ 193 (713)
+|..+++|.+++...+..
T Consensus 875 aDt~Nn~Irvid~~~~~~ 892 (1057)
T PLN02919 875 ADTNNSLIRYLDLNKGEA 892 (1057)
T ss_pred EECCCCEEEEEECCCCcc
Confidence 999999999999876543
No 13
>KOG1219|consensus
Probab=99.61 E-value=1.1e-15 Score=175.23 Aligned_cols=155 Identities=30% Similarity=0.624 Sum_probs=116.3
Q ss_pred CCCCCCCCCCCCCCCCcEEeecC-CCceeeCCCCCcCCCCCcCCCCCCC-CCCCCCCCeEecCCCCcceeecCCCcccCC
Q psy6570 548 CASLANKCTPNYCSNNGTCVLIE-GKPSCKCLPPYSGKQCTEREDSPSC-HNYCDNAGLCSYSKQGKPVCTCVNGWSGIT 625 (713)
Q Consensus 548 c~~~~~~C~~~~C~~~~~C~~~~-g~~~C~C~~G~~G~~C~~~~~~~~C-~~~C~~~g~C~~~~~g~~~C~C~~G~~G~~ 625 (713)
|....+.|..+||.++|+|...+ |+|+|.|++-|.|+.|+.. ...| .++|..+|+|+... ++|.|.|+.||+|..
T Consensus 3860 C~l~~d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~--~epC~snPC~~GgtCip~~-n~f~CnC~~gyTG~~ 3936 (4289)
T KOG1219|consen 3860 CSLLTDPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEID--LEPCASNPCLTGGTCIPFY-NGFLCNCPNGYTGKR 3936 (4289)
T ss_pred ccccccccccCcccCCCEecCCCCCceEEeCcccccCcccccc--cccccCCCCCCCCEEEecC-CCeeEeCCCCccCce
Confidence 54446889999999999999875 6899999999999999864 5788 88999999998776 889999999999999
Q ss_pred CCcC--CCCC-CCCCCCCeecCCCCccCCCCCceeeCCCCcccCCCCccccccccccccc-cchhHHHHHHHHHHHHHHh
Q psy6570 626 CSER--VSCA-HFCFNGGTCREQNYSLDPDLKPICICPRGYAGVRCQTLVHYISKKQSYV-NSHISSILILILLLITVGG 701 (713)
Q Consensus 626 C~~~--~~C~-~~C~~~~~C~~~~~~~~~~~~~~C~C~~Gy~G~~C~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 701 (713)
|+.. ++|. ++|.++|.|.+ ..++|.|.|.+||.|..|....+.....-.+. .+-+++|++++++ |++++
T Consensus 3937 Ce~~Gi~eCs~n~C~~gg~C~n------~~gsf~CncT~g~~gr~c~~~~pni~~~~~~~gkaEli~I~V~l~~-ifilv 4009 (4289)
T KOG1219|consen 3937 CEARGISECSKNVCGTGGQCIN------IPGSFHCNCTPGILGRTCCAEKPNILSTVLWLGKAELIIIIVLLAL-IFILV 4009 (4289)
T ss_pred eecccccccccccccCCceeec------cCCceEeccChhHhcccCccccCccccccchhcccceeehhHHHHH-HHHHH
Confidence 9963 5698 89999999987 55679999999999999966655544433331 1122222222222 33333
Q ss_pred heeeEEEEecC
Q psy6570 702 IGYYIFRIKMS 712 (713)
Q Consensus 702 ~~~~~~~~~~~ 712 (713)
+++|+.|+|.+
T Consensus 4010 vlf~~crKk~~ 4020 (4289)
T KOG1219|consen 4010 VLFWKCRKKNS 4020 (4289)
T ss_pred HHHHhhhhhcc
Confidence 35666665543
No 14
>KOG0994|consensus
Probab=99.56 E-value=3.1e-14 Score=155.81 Aligned_cols=253 Identities=30% Similarity=0.798 Sum_probs=152.5
Q ss_pred ceeeCCCCcccCCCCccC----------CC-CCCCC----CCEEEcCCCeeeCCCCCccCCCCcC----------CCCCC
Q psy6570 369 YKCHCAPSYTGARCESRI----------CE-NKCHN----GGTCIATTQTCVCPPGFTGDTCQQC----------LNLKC 423 (713)
Q Consensus 369 ~~C~C~~G~~g~~C~~~~----------C~-~~C~~----~~~C~~~~~~C~C~~g~~g~~C~~C----------~~~~C 423 (713)
..|+|.|+..|.+|+... |. -.|.. +..|...++.|.|.+|-.|..|..| .+..|
T Consensus 782 GqCqCkPnVVGR~CdqCApGtyGFGPsGCk~CdC~~~Gs~~~~Cd~~tGQC~C~~g~ygrqCnqCqpG~WgFPeCr~CqC 861 (1758)
T KOG0994|consen 782 GQCQCKPNVVGRRCDQCAPGTYGFGPSGCKACDCNSIGSLDKYCDKITGQCQCRPGTYGRQCNQCQPGYWGFPECRPCQC 861 (1758)
T ss_pred ceecccCccccccccccCCcccCcCCccCccccccccccccccccccccceeeccccchhhccccCCCccCCCcCccccc
Confidence 479999999998887421 11 11222 2345557889999888888777643 33345
Q ss_pred CCCc-----------EeccCCCCccc-cCCCCCcCC-------CCccCCCCCCC-----CCCeeecCC--CCCeecCCCC
Q psy6570 424 QNGG-----------VCVNKTTGLEC-DCPKFYYGK-------NCQYSQCKNYC-----VNGECSITD--SGPKCMCSPG 477 (713)
Q Consensus 424 ~~~~-----------~C~~~~~~~~C-~C~~G~~g~-------~C~~~~C~~~~-----~~~~C~~~~--~~~~C~C~~G 477 (713)
+.++ .|.+..+++.| .|..||+|+ .|+...|.... ....|...+ ....|.|.+|
T Consensus 862 NgHA~~Cd~~tGaCi~CqD~T~G~~CdrCl~GyyGdP~lg~g~~CrPCpCP~gp~Sg~~~A~sC~~d~~t~~ivC~C~~G 941 (1758)
T KOG0994|consen 862 NGHADTCDPITGACIDCQDSTTGHSCDRCLDGYYGDPRLGSGIGCRPCPCPDGPASGRQHADSCYLDTRTQQIVCHCQEG 941 (1758)
T ss_pred cCcccccCccccccccccccccccchhhhhccccCCcccCCCCCCCCCCCCCCCccchhccccccccccccceeeecccC
Confidence 4443 25566778889 599999864 57766666542 112344322 2258999999
Q ss_pred ccCCCCCCc-----------------cccCC----------------------CCCCcc-cCCCCccCC----CCCCCCC
Q psy6570 478 YSGKKCDTC-----------------TCLNG----------------------DSGPKC-MCSPGYSGK----KCDTCTC 513 (713)
Q Consensus 478 ~~g~~C~~~-----------------~C~~~----------------------~~~~~C-~C~~G~~g~----~C~~~~C 513 (713)
|+|.+|+.+ .|.+. ..+..| .|.+||+|+ .|..+.|
T Consensus 942 Y~G~RCe~CA~~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD~~TG~CLkCL~hTeG~hCe~Ck~Gf~GdA~~q~CqrC~C 1021 (1758)
T KOG0994|consen 942 YSGSRCEICADNHFGNPSEGGTCQKCECSNNIDLYDPGACDVATGACLKCLYHTEGDHCEHCKDGFYGDALRQNCQRCVC 1021 (1758)
T ss_pred ccccchhhhcccccCCcccCCccccccccCCcCccCCCccchhhchhhhhhhcccccchhhccccchhHHHHhhhhhhec
Confidence 999888643 23332 223455 588999985 4666656
Q ss_pred CCC-----cEEcCCCCCeeccCCCCCCCCCCcccCCCCCCCCCCCCCCCCCCCCCc--EEeecCCCceeeCCCCCcCCCC
Q psy6570 514 LNG-----GTCIPNSKNNVCKCPSQYTGRRCECAVGDTSCASLANKCTPNYCSNNG--TCVLIEGKPSCKCLPPYSGKQC 586 (713)
Q Consensus 514 ~~~-----g~C~~~~~~~~C~C~~g~~G~~C~~~~~~~~c~~~~~~C~~~~C~~~~--~C~~~~g~~~C~C~~G~~G~~C 586 (713)
.-- +.|.. -+++|.|.+...|.+|+......+-...+..|++..|...+ +|...+| +|+|.+||-|+.|
T Consensus 1022 n~LGTn~~~~CDr--~tGQCpClpNv~G~~CDqCA~N~w~laSG~GCe~C~Cd~~~~pqCN~ftG--QCqCkpGfGGR~C 1097 (1758)
T KOG0994|consen 1022 NFLGTNSTCHCDR--FTGQCPCLPNVQGVRCDQCAENHWNLASGEGCEPCNCDPIGGPQCNEFTG--QCQCKPGFGGRTC 1097 (1758)
T ss_pred cccccCCcccccc--ccCcCCCCcccccccccccccchhccccCCCCCccCCCccCCcccccccc--ceeccCCCCCcch
Confidence 422 23433 36789999999999987332222222224456655565432 6776665 7999999999998
Q ss_pred CcCCC------CCCC-CCCCCCCC----eEecCCCCcceeecCCCcccCCCCc
Q psy6570 587 TERED------SPSC-HNYCDNAG----LCSYSKQGKPVCTCVNGWSGITCSE 628 (713)
Q Consensus 587 ~~~~~------~~~C-~~~C~~~g----~C~~~~~g~~~C~C~~G~~G~~C~~ 628 (713)
....+ ...| .-.|+..| +|.. .+++|.|.+|..|..|..
T Consensus 1098 ~qCqel~WGdP~~~C~aCdCd~rG~~tpQCdr---~tG~C~C~~Gv~G~rCdq 1147 (1758)
T KOG0994|consen 1098 SQCQELYWGDPNEKCRACDCDPRGIETPQCDR---ATGRCVCRPGVGGPRCDQ 1147 (1758)
T ss_pred hHHHHhhcCCCCCCceecCCCCCCCCCCCccc---cCCceeecCCCCCcchhh
Confidence 75421 1123 12233322 3432 234577777777776643
No 15
>KOG1219|consensus
Probab=99.54 E-value=1.7e-14 Score=165.76 Aligned_cols=112 Identities=38% Similarity=0.888 Sum_probs=101.8
Q ss_pred CCCCCCCCCCCcEEcCCCC-CeeccCCCCCCCCCCcccCCCCCCCCCCCCCCCCCCCCCcEEeecCCCceeeCCCCCcCC
Q psy6570 506 KKCDTCTCLNGGTCIPNSK-NNVCKCPSQYTGRRCECAVGDTSCASLANKCTPNYCSNNGTCVLIEGKPSCKCLPPYSGK 584 (713)
Q Consensus 506 ~~C~~~~C~~~g~C~~~~~-~~~C~C~~g~~G~~C~~~~~~~~c~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~G~ 584 (713)
+.|...||+++|+|...+. .|.|.|++-|.|..||.. ...|.++||..+|+|+...+.|.|.|+.||+|.
T Consensus 3865 d~C~~npCqhgG~C~~~~~ggy~CkCpsqysG~~CEi~---------~epC~snPC~~GgtCip~~n~f~CnC~~gyTG~ 3935 (4289)
T KOG1219|consen 3865 DPCNDNPCQHGGTCISQPKGGYKCKCPSQYSGNHCEID---------LEPCASNPCLTGGTCIPFYNGFLCNCPNGYTGK 3935 (4289)
T ss_pred cccccCcccCCCEecCCCCCceEEeCcccccCcccccc---------cccccCCCCCCCCEEEecCCCeeEeCCCCccCc
Confidence 5688889999999998754 899999999999999977 689999999999999999999999999999999
Q ss_pred CCCcCCCCCCC-CCCCCCCCeEecCCCCcceeecCCCcccCCCCc
Q psy6570 585 QCTEREDSPSC-HNYCDNAGLCSYSKQGKPVCTCVNGWSGITCSE 628 (713)
Q Consensus 585 ~C~~~~~~~~C-~~~C~~~g~C~~~~~g~~~C~C~~G~~G~~C~~ 628 (713)
+|+.. .+++| .+.|.++|+|.+.. |+|.|.|.+||.|+.|..
T Consensus 3936 ~Ce~~-Gi~eCs~n~C~~gg~C~n~~-gsf~CncT~g~~gr~c~~ 3978 (4289)
T KOG1219|consen 3936 RCEAR-GISECSKNVCGTGGQCINIP-GSFHCNCTPGILGRTCCA 3978 (4289)
T ss_pred eeecc-cccccccccccCCceeeccC-CceEeccChhHhcccCcc
Confidence 99874 35789 68899999999887 999999999999998844
No 16
>KOG1225|consensus
Probab=99.50 E-value=2.2e-13 Score=144.50 Aligned_cols=175 Identities=31% Similarity=0.782 Sum_probs=125.1
Q ss_pred CCCccccCCCCccCCCCCCCCCCCCC-CCeeecCCCCCCeeecCCCcccCCccccCCCCCCCCCCc-----eeeCCCCCC
Q psy6570 292 TEELNVTSGKMSCKVAPARTCYLDCN-HGTCEFDDDFDPHCICQENFYGTYCEKVNNSMCPCLNQG-----MCYPDLTHP 365 (713)
Q Consensus 292 ~~~~c~C~~g~~~~~~~~~~C~~~C~-~~~C~~~~~~~~~C~C~~g~~G~~C~~~~c~~~~C~~~~-----~C~~~~~~~ 365 (713)
..+.|.+.+++.+..+....+...+. ++.+.. +.+.+..+|+|..|....+.. ++...+ .+.......
T Consensus 159 ~~~~c~~~~~~~~~~~g~~~~~~~~~~hg~~~~-----~~~l~~~~~s~~~~~~~~~~~-~~~~~~r~~~~~~~~~~~~~ 232 (525)
T KOG1225|consen 159 KNGVCSLKPNPFGAECGQYKCPNDGSGHGRYYF-----GNCLSGISASGETCNQLGCND-DCFRTGRCREGRCFCTAGFF 232 (525)
T ss_pred hcccccccCCccccccceecCCcCCCCCcccee-----cccccccCcchhhhhcccCCc-cceeccccccCccccccccc
Confidence 34456666766666554444433333 334442 467888888888775433221 222222 221111111
Q ss_pred CCCceeeCCCCcccCCCCccCCCCCCCCCCEEEcCCCeeeCCCCCccCCCCc--CCCCCCCCCcEeccCCCCccccCCCC
Q psy6570 366 EPTYKCHCAPSYTGARCESRICENKCHNGGTCIATTQTCVCPPGFTGDTCQQ--CLNLKCQNGGVCVNKTTGLECDCPKF 443 (713)
Q Consensus 366 ~~~~~C~C~~G~~g~~C~~~~C~~~C~~~~~C~~~~~~C~C~~g~~g~~C~~--C~~~~C~~~~~C~~~~~~~~C~C~~G 443 (713)
.+.|.|+.+|.|..|+...|...|.+++.|.. +.|.|++||+|..|.+ |... |+.++.|++. .|+|++|
T Consensus 233 --~~ic~c~~~~~g~~c~~~~C~~~c~~~g~c~~--G~CIC~~Gf~G~dC~e~~Cp~~-cs~~g~~~~g----~CiC~~g 303 (525)
T KOG1225|consen 233 --DGICECPEGYFGPLCSTIYCPGGCTGRGQCVE--GRCICPPGFTGDDCDELVCPVD-CSGGGVCVDG----ECICNPG 303 (525)
T ss_pred --CceeecCCceeCCccccccCCCCCcccceEeC--CeEeCCCCCcCCCCCcccCCcc-cCCCceecCC----EeecCCC
Confidence 23799999999999997779999998899987 8999999999999994 7766 9999999887 8999999
Q ss_pred CcCCCCccCCCCCCC-CCCeeecCCCCCeecCCCCccCCCCCC
Q psy6570 444 YYGKNCQYSQCKNYC-VNGECSITDSGPKCMCSPGYSGKKCDT 485 (713)
Q Consensus 444 ~~g~~C~~~~C~~~~-~~~~C~~~~~~~~C~C~~G~~g~~C~~ 485 (713)
|.|..|++..|...| .+|.|+ .| +|.|.+||+|..|..
T Consensus 304 ~~G~dCs~~~cpadC~g~G~Ci--~G--~C~C~~Gy~G~~C~~ 342 (525)
T KOG1225|consen 304 YSGKDCSIRRCPADCSGHGKCI--DG--ECLCDEGYTGELCIQ 342 (525)
T ss_pred ccccccccccCCccCCCCCccc--CC--ceEeCCCCcCCcccc
Confidence 999999988888775 468888 33 799999998888876
No 17
>KOG1215|consensus
Probab=99.47 E-value=1.5e-12 Score=154.56 Aligned_cols=207 Identities=20% Similarity=0.364 Sum_probs=172.9
Q ss_pred CCceeEEEccCc-ccEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCC
Q psy6570 6 SGNVTRVKREMN-LKTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSG 84 (713)
Q Consensus 6 ~~~I~~~~~~~~-~~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~ 84 (713)
...|.++.++.. ....+..+..+..+.+|..++.+||+|. ...+|.....++.....+...++-.|.+||+|+..+
T Consensus 416 ~~~ir~~~~~~~~~~~p~~~~~~~~~~d~d~~~~~i~~~d~---~~~~i~~~~~~~~~~~~~~~~g~~~~~~lavD~~~~ 492 (877)
T KOG1215|consen 416 RHDIRRISLDCSDVSRPLEGIKNAVALDFDVLNNRIYWADL---SDEKICRASQDGSSECELCGDGLCIPEGLAVDWIGD 492 (877)
T ss_pred CccceecccCCCcceEEccCCccceEEEEEecCCEEEEEec---cCCeEeeeccCCCccceEeccCccccCcEEEEeccC
Confidence 345556666654 2333555578888999999999999999 789999999999887777778889999999999999
Q ss_pred cEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCC-CCcEEEEeCCCCceeEEEecCCCCccc
Q psy6570 85 RMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPK-ARTIESINLNGKDRFVVYHTEDNGYKP 163 (713)
Q Consensus 85 ~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~-~~~I~~~~~~g~~~~~~~~~~~~~~~p 163 (713)
.+||+|.+.. .|.+.+++|+.+.+++...+..|..+++|+..+.+||+|+. ..+|.+..+||..+..+.......++.
T Consensus 493 ~~y~tDe~~~-~i~v~~~~g~~~~vl~~~~l~~~r~~~v~p~~g~~~wtd~~~~~~i~ra~~dg~~~~~l~~~~~~~p~g 571 (877)
T KOG1215|consen 493 NIYWTDEGNC-LIEVADLDGSSRKVLVSKDLDLPRSIAVDPEKGLMFWTDWGQPPRIERASLDGSERAVLVTNGILWPNG 571 (877)
T ss_pred CceecccCCc-eeEEEEccCCceeEEEecCCCCccceeeccccCeeEEecCCCCchhhhhcCCCCCceEEEeCCccCCCc
Confidence 9999999888 89999999999999998888999999999999999999987 568999999999999998876555566
Q ss_pred eeeeeeCCeEEEEeCCCC-cEEEEcccCCCcceeeeccccccccEEEEeecccc
Q psy6570 164 YKLEVFEDNLYFSTYRTN-NILKINKFGNSDFNVLANNLNRASDVLILQENKQA 216 (713)
Q Consensus 164 ~~i~~~~~~ly~td~~~~-~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~q~ 216 (713)
+.+++..+.+||.|.... .|.+++..+.....+....+..|..+.++....++
T Consensus 572 lt~d~~~~~~yw~d~~~~~~i~~~~~~g~~r~~~~~~~~~~p~~~~~~~~~iyw 625 (877)
T KOG1215|consen 572 LTIDYETDRLYWADAKLDYTIESANMDGQNRRVVDSEDLPHPFGLSVFEDYIYW 625 (877)
T ss_pred ceEEeecceeEEEcccCCcceeeeecCCCceEEeccccCCCceEEEEecceeEE
Confidence 666677899999999888 78888877766654556678889999887776663
No 18
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=99.45 E-value=6.6e-12 Score=126.38 Aligned_cols=150 Identities=20% Similarity=0.262 Sum_probs=116.6
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCc
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNK 107 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~ 107 (713)
|+|+++|..++.|||+|. ..++|+++++++...+++... .|.+++++..++.||+++.. .+.+.+++....
T Consensus 2 ~Egp~~d~~~g~l~~~D~---~~~~i~~~~~~~~~~~~~~~~---~~~G~~~~~~~g~l~v~~~~---~~~~~d~~~g~~ 72 (246)
T PF08450_consen 2 GEGPVWDPRDGRLYWVDI---PGGRIYRVDPDTGEVEVIDLP---GPNGMAFDRPDGRLYVADSG---GIAVVDPDTGKV 72 (246)
T ss_dssp EEEEEEETTTTEEEEEET---TTTEEEEEETTTTEEEEEESS---SEEEEEEECTTSEEEEEETT---CEEEEETTTTEE
T ss_pred CcceEEECCCCEEEEEEc---CCCEEEEEECCCCeEEEEecC---CCceEEEEccCCEEEEEEcC---ceEEEecCCCcE
Confidence 689999998999999999 789999999999877665433 39999999767999999965 455557776666
Q ss_pred EEEEeC-----CCCCCeeEEEeCCCCeEEEEcCCC--------CcEEEEeCCCCceeEEEecCCCCccceeeeee--CCe
Q psy6570 108 FNLVDN-----NIQWPTGITIDYPSQRLYWADPKA--------RTIESINLNGKDRFVVYHTEDNGYKPYKLEVF--EDN 172 (713)
Q Consensus 108 ~~l~~~-----~~~~p~glavd~~~~~LY~~d~~~--------~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~--~~~ 172 (713)
+.++.. .+..|+.+++|+ .++||+++... ++|++++.++. .+.+... +..|.||++. ++.
T Consensus 73 ~~~~~~~~~~~~~~~~ND~~vd~-~G~ly~t~~~~~~~~~~~~g~v~~~~~~~~-~~~~~~~---~~~pNGi~~s~dg~~ 147 (246)
T PF08450_consen 73 TVLADLPDGGVPFNRPNDVAVDP-DGNLYVTDSGGGGASGIDPGSVYRIDPDGK-VTVVADG---LGFPNGIAFSPDGKT 147 (246)
T ss_dssp EEEEEEETTCSCTEEEEEEEE-T-TS-EEEEEECCBCTTCGGSEEEEEEETTSE-EEEEEEE---ESSEEEEEEETTSSE
T ss_pred EEEeeccCCCcccCCCceEEEcC-CCCEEEEecCCCccccccccceEEECCCCe-EEEEecC---cccccceEECCcchh
Confidence 666643 577899999996 57799998644 56999999944 4444444 5567777765 668
Q ss_pred EEEEeCCCCcEEEEcccCC
Q psy6570 173 LYFSTYRTNNILKINKFGN 191 (713)
Q Consensus 173 ly~td~~~~~i~~~~~~~~ 191 (713)
||+++...++|++++....
T Consensus 148 lyv~ds~~~~i~~~~~~~~ 166 (246)
T PF08450_consen 148 LYVADSFNGRIWRFDLDAD 166 (246)
T ss_dssp EEEEETTTTEEEEEEEETT
T ss_pred eeecccccceeEEEecccc
Confidence 9999999999999998643
No 19
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=99.42 E-value=1.4e-11 Score=125.44 Aligned_cols=200 Identities=21% Similarity=0.325 Sum_probs=136.3
Q ss_pred CCcccCCceeEEEcc-CcccEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcC----CCCCcc
Q psy6570 1 MASISSGNVTRVKRE-MNLKTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNT----GLNEPY 75 (713)
Q Consensus 1 vad~~~~~I~~~~~~-~~~~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~----~~~~p~ 75 (713)
++|+..++|+++++. +..++.......+.++.+| ..++|..++. ... +... ..+..++.+... .+.+|+
T Consensus 41 w~DI~~~~i~r~~~~~g~~~~~~~p~~~~~~~~~d-~~g~Lv~~~~---g~~-~~~~-~~~~~~t~~~~~~~~~~~~r~N 114 (307)
T COG3386 41 WVDILGGRIHRLDPETGKKRVFPSPGGFSSGALID-AGGRLIACEH---GVR-LLDP-DTGGKITLLAEPEDGLPLNRPN 114 (307)
T ss_pred EEeCCCCeEEEecCCcCceEEEECCCCcccceeec-CCCeEEEEcc---ccE-EEec-cCCceeEEeccccCCCCcCCCC
Confidence 468999999999997 4456655666668888888 5578887776 222 2222 234332333321 268899
Q ss_pred eEEEcCCCCcEEEEccC-----C-----CCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeC
Q psy6570 76 DIALEPLSGRMFWTELG-----I-----KPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINL 145 (713)
Q Consensus 76 ~iavD~~~~~ly~td~~-----~-----~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~ 145 (713)
++.+|| .|.+|+++.. . .++|+|++.+|+.. .++...+..|||||++++++.||++|+..++|+++++
T Consensus 115 D~~v~p-dG~~wfgt~~~~~~~~~~~~~~G~lyr~~p~g~~~-~l~~~~~~~~NGla~SpDg~tly~aDT~~~~i~r~~~ 192 (307)
T COG3386 115 DGVVDP-DGRIWFGDMGYFDLGKSEERPTGSLYRVDPDGGVV-RLLDDDLTIPNGLAFSPDGKTLYVADTPANRIHRYDL 192 (307)
T ss_pred ceeEcC-CCCEEEeCCCccccCccccCCcceEEEEcCCCCEE-EeecCcEEecCceEECCCCCEEEEEeCCCCeEEEEec
Confidence 999997 5999999877 1 15799999866544 4444458899999999999999999999999999998
Q ss_pred C---CC--ceeEEEecCCCCccceeeeeeCC-eEE-EEeCCCCcEEEEcccCCCcceeeeccccccccEEE
Q psy6570 146 N---GK--DRFVVYHTEDNGYKPYKLEVFED-NLY-FSTYRTNNILKINKFGNSDFNVLANNLNRASDVLI 209 (713)
Q Consensus 146 ~---g~--~~~~~~~~~~~~~~p~~i~~~~~-~ly-~td~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v 209 (713)
+ |. .+............|.|+.++.+ +|| ++-+...+|.+++++ +.....+......++.+.+
T Consensus 193 d~~~g~~~~~~~~~~~~~~~G~PDG~~vDadG~lw~~a~~~g~~v~~~~pd-G~l~~~i~lP~~~~t~~~F 262 (307)
T COG3386 193 DPATGPIGGRRGFVDFDEEPGLPDGMAVDADGNLWVAAVWGGGRVVRFNPD-GKLLGEIKLPVKRPTNPAF 262 (307)
T ss_pred CcccCccCCcceEEEccCCCCCCCceEEeCCCCEEEecccCCceEEEECCC-CcEEEEEECCCCCCccceE
Confidence 7 31 22222322223568999999854 555 334445589999988 5555555444444444433
No 20
>KOG1226|consensus
Probab=99.38 E-value=1.4e-12 Score=140.13 Aligned_cols=141 Identities=30% Similarity=0.804 Sum_probs=104.2
Q ss_pred eecCCCCccCCCCCCccccCCCCCCcccCCCCccC-----CCCC----CCCCCCCcEEcCCCCCeeccCCCCCC----CC
Q psy6570 471 KCMCSPGYSGKKCDTCTCLNGDSGPKCMCSPGYSG-----KKCD----TCTCLNGGTCIPNSKNNVCKCPSQYT----GR 537 (713)
Q Consensus 471 ~C~C~~G~~g~~C~~~~C~~~~~~~~C~C~~G~~g-----~~C~----~~~C~~~g~C~~~~~~~~C~C~~g~~----G~ 537 (713)
+|.|.+||.|+.|+- +.+-.. +.|. ..+|+..|.|. -++|.|.+... |.
T Consensus 479 ~C~C~~G~~G~~CEC--------------~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~----CGqC~C~~~~~~~i~G~ 540 (783)
T KOG1226|consen 479 QCRCDEGWLGKKCEC--------------STDELSSSEEEDKCRENSDSPVCSGRGDCV----CGQCVCHKPDNGKIYGK 540 (783)
T ss_pred ceecCCCCCCCcccC--------------CccccCcHhHHhhccCCCCCCCcCCCCcEe----CCceEecCCCCCceeee
Confidence 789999999998752 211111 2232 23788999987 46899998776 88
Q ss_pred CCcccCCCCCCCCCCCCCCCCCCCCCcEEeecCCCceeeCCCCCcCCCCCcCCCCCCC----CCCCCCCCeEecCCCCcc
Q psy6570 538 RCECAVGDTSCASLANKCTPNYCSNNGTCVLIEGKPSCKCLPPYSGKQCTEREDSPSC----HNYCDNAGLCSYSKQGKP 613 (713)
Q Consensus 538 ~C~~~~~~~~c~~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~~~~~~~C----~~~C~~~g~C~~~~~g~~ 613 (713)
.||++ +..|... ....|+++|.|. -.+|.|.+||+|..|....+.+.| ...|+..|+|....
T Consensus 541 fCECD--nfsC~r~----~g~lC~g~G~C~----CG~CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~Cg~---- 606 (783)
T KOG1226|consen 541 FCECD--NFSCERH----KGVLCGGHGRCE----CGRCVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCECGR---- 606 (783)
T ss_pred eeecc--Ccccccc----cCcccCCCCeEe----CCcEEcCCCCccCCCCCCCCCccccCCCCceeCCCceeeCCc----
Confidence 99876 2222211 123699999997 347999999999999887777888 34699999998665
Q ss_pred eeecCCC-cccCCCCcCCCCCCCCCCCCeecC
Q psy6570 614 VCTCVNG-WSGITCSERVSCAHFCFNGGTCRE 644 (713)
Q Consensus 614 ~C~C~~G-~~G~~C~~~~~C~~~C~~~~~C~~ 644 (713)
|.|... |.|..|+..+.|..+|..+..|+.
T Consensus 607 -C~C~~~~~sG~~CE~cptc~~~C~~~~~Cve 637 (783)
T KOG1226|consen 607 -CKCTDPPYSGEFCEKCPTCPDPCAENKSCVE 637 (783)
T ss_pred -eEcCCCCcCcchhhcCCCCCCcccccccchh
Confidence 999876 999999998888888877777754
No 21
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=99.17 E-value=5.6e-09 Score=110.40 Aligned_cols=184 Identities=16% Similarity=0.192 Sum_probs=127.2
Q ss_pred CcccCCceeEEEccC--cccEE---e-----------cCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCce--E
Q psy6570 2 ASISSGNVTRVKREM--NLKTV---L-----------SNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRK--K 63 (713)
Q Consensus 2 ad~~~~~I~~~~~~~--~~~~~---~-----------~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~--~ 63 (713)
|+...+.|..++++. ..... + ....+|+.+.+++.++.||++|. ..++|.+++++... .
T Consensus 104 any~~g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg~~v~v~dl---G~D~v~~~~~~~~~~~l 180 (345)
T PF10282_consen 104 ANYGGGSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDGRFVYVPDL---GADRVYVYDIDDDTGKL 180 (345)
T ss_dssp EETTTTEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTSSEEEEEET---TTTEEEEEEE-TTS-TE
T ss_pred EEccCCeEEEEEccCCcccceeeeecccCCCCCcccccccccceeEEECCCCCEEEEEec---CCCEEEEEEEeCCCceE
Confidence 455667777666654 22111 1 23567899999999999999999 78999999987644 2
Q ss_pred ---EEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecC-CCCcEEEEe---C------CCCCCeeEEEeCCCCeE
Q psy6570 64 ---RTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASID-GKNKFNLVD---N------NIQWPTGITIDYPSQRL 130 (713)
Q Consensus 64 ---~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~d-G~~~~~l~~---~------~~~~p~glavd~~~~~L 130 (713)
..+....-..|+.|+++|..+++|++....+ .|.+++++ .......+. . ....|.+|+|++++++|
T Consensus 181 ~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~s~-~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~l 259 (345)
T PF10282_consen 181 TPVDSIKVPPGSGPRHLAFSPDGKYAYVVNELSN-TVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAISPDGRFL 259 (345)
T ss_dssp EEEEEEECSTTSSEEEEEE-TTSSEEEEEETTTT-EEEEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE-TTSSEE
T ss_pred EEeeccccccCCCCcEEEEcCCcCEEEEecCCCC-cEEEEeecccCCceeEEEEeeeccccccccCCceeEEEecCCCEE
Confidence 1222233478999999998899999987767 88888887 222222221 1 12378999999999999
Q ss_pred EEEcCCCCcEEEEeCCCC--ceeEEEecCCCCccceeeee--eCCeEEEEeCCCCcEEEEccc
Q psy6570 131 YWADPKARTIESINLNGK--DRFVVYHTEDNGYKPYKLEV--FEDNLYFSTYRTNNILKINKF 189 (713)
Q Consensus 131 Y~~d~~~~~I~~~~~~g~--~~~~~~~~~~~~~~p~~i~~--~~~~ly~td~~~~~i~~~~~~ 189 (713)
|++....+.|..++++.. ..+.+.........|.+|++ ++++||+++...+.|..++.+
T Consensus 260 yvsnr~~~sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~g~~l~Va~~~s~~v~vf~~d 322 (345)
T PF10282_consen 260 YVSNRGSNSISVFDLDPATGTLTLVQTVPTGGKFPRHFAFSPDGRYLYVANQDSNTVSVFDID 322 (345)
T ss_dssp EEEECTTTEEEEEEECTTTTTEEEEEEEEESSSSEEEEEE-TTSSEEEEEETTTTEEEEEEEE
T ss_pred EEEeccCCEEEEEEEecCCCceEEEEEEeCCCCCccEEEEeCCCCEEEEEecCCCeEEEEEEe
Confidence 999999999999998543 33333322223456877777 588999999998888877653
No 22
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=99.06 E-value=2.1e-08 Score=107.04 Aligned_cols=187 Identities=20% Similarity=0.256 Sum_probs=135.5
Q ss_pred cCCceeEEEccCcccEE-ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCC
Q psy6570 5 SSGNVTRVKREMNLKTV-LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLS 83 (713)
Q Consensus 5 ~~~~I~~~~~~~~~~~~-~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~ 83 (713)
..+.|..++.......- ......|.+|++|+.++.||+++.+. ..+.|.+++.........+..+ ..|.++++||..
T Consensus 94 ~~~~v~vid~~~~~~~~~~~vG~~P~~~~~~~~~~~vYV~n~~~-~~~~vsvid~~t~~~~~~~~vG-~~P~~~a~~p~g 171 (381)
T COG3391 94 DSNTVSVIDTATNTVLGSIPVGLGPVGLAVDPDGKYVYVANAGN-GNNTVSVIDAATNKVTATIPVG-NTPTGVAVDPDG 171 (381)
T ss_pred CCCeEEEEcCcccceeeEeeeccCCceEEECCCCCEEEEEeccc-CCceEEEEeCCCCeEEEEEecC-CCcceEEECCCC
Confidence 35677777755543222 33333999999999999999999832 2588999998765544434333 478999999999
Q ss_pred CcEEEEccCCCCeEEEEecCCCCcEE-E---EeCCCCCCeeEEEeCCCCeEEEEcCCC--CcEEEEeCCCCceeEEEecC
Q psy6570 84 GRMFWTELGIKPRISGASIDGKNKFN-L---VDNNIQWPTGITIDYPSQRLYWADPKA--RTIESINLNGKDRFVVYHTE 157 (713)
Q Consensus 84 ~~ly~td~~~~~~I~~~~~dG~~~~~-l---~~~~~~~p~glavd~~~~~LY~~d~~~--~~I~~~~~~g~~~~~~~~~~ 157 (713)
..+|+++...+ +|..++..+..... - .......|.+++|++...++|+++..+ ++|.+++.............
T Consensus 172 ~~vyv~~~~~~-~v~vi~~~~~~v~~~~~~~~~~~~~~P~~i~v~~~g~~~yV~~~~~~~~~v~~id~~~~~v~~~~~~~ 250 (381)
T COG3391 172 NKVYVTNSDDN-TVSVIDTSGNSVVRGSVGSLVGVGTGPAGIAVDPDGNRVYVANDGSGSNNVLKIDTATGNVTATDLPV 250 (381)
T ss_pred CeEEEEecCCC-eEEEEeCCCcceeccccccccccCCCCceEEECCCCCEEEEEeccCCCceEEEEeCCCceEEEecccc
Confidence 99999997766 89999877765442 0 123467899999999999999999877 68999888776655542232
Q ss_pred CCCccceeeee--eCCeEEEEeCCCCcEEEEcccCCCcce
Q psy6570 158 DNGYKPYKLEV--FEDNLYFSTYRTNNILKINKFGNSDFN 195 (713)
Q Consensus 158 ~~~~~p~~i~~--~~~~ly~td~~~~~i~~~~~~~~~~~~ 195 (713)
..+ .|.++.+ ++..+|+++...+.|..++........
T Consensus 251 ~~~-~~~~v~~~p~g~~~yv~~~~~~~V~vid~~~~~v~~ 289 (381)
T COG3391 251 GSG-APRGVAVDPAGKAAYVANSQGGTVSVIDGATDRVVK 289 (381)
T ss_pred ccC-CCCceeECCCCCEEEEEecCCCeEEEEeCCCCceee
Confidence 234 6766665 477899998888889888865544433
No 23
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=99.05 E-value=3.1e-08 Score=104.79 Aligned_cols=205 Identities=15% Similarity=0.178 Sum_probs=134.0
Q ss_pred CCceeEEEccC---cccEE---ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecC--CceEEE--EE--------
Q psy6570 6 SGNVTRVKREM---NLKTV---LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLE--GRKKRT--LL-------- 67 (713)
Q Consensus 6 ~~~I~~~~~~~---~~~~~---~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~--G~~~~~--l~-------- 67 (713)
.+.|..+..+. ..+.+ ......|-.|++|+.++.||+++. ..+.|.+++++ |+.... ++
T Consensus 61 ~g~v~~~~i~~~~g~L~~~~~~~~~g~~p~~i~~~~~g~~l~vany---~~g~v~v~~l~~~g~l~~~~~~~~~~g~g~~ 137 (345)
T PF10282_consen 61 SGGVSSYRIDPDTGTLTLLNSVPSGGSSPCHIAVDPDGRFLYVANY---GGGSVSVFPLDDDGSLGEVVQTVRHEGSGPN 137 (345)
T ss_dssp TTEEEEEEEETTTTEEEEEEEEEESSSCEEEEEECTTSSEEEEEET---TTTEEEEEEECTTSEEEEEEEEEESEEEESS
T ss_pred CCCEEEEEECCCcceeEEeeeeccCCCCcEEEEEecCCCEEEEEEc---cCCeEEEEEccCCcccceeeeecccCCCCCc
Confidence 45555554443 22222 336678899999999999999998 67888777765 444333 22
Q ss_pred --cCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCC--cEE---EEeCCCCCCeeEEEeCCCCeEEEEcCCCCcE
Q psy6570 68 --NTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKN--KFN---LVDNNIQWPTGITIDYPSQRLYWADPKARTI 140 (713)
Q Consensus 68 --~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~--~~~---l~~~~~~~p~glavd~~~~~LY~~d~~~~~I 140 (713)
+....+|+.+.++|.+++||++|.+.+ +|++++++... ... +....-..|..|+|++..+++|++....+.|
T Consensus 138 ~~rq~~~h~H~v~~~pdg~~v~v~dlG~D-~v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~s~~v 216 (345)
T PF10282_consen 138 PDRQEGPHPHQVVFSPDGRFVYVPDLGAD-RVYVYDIDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNELSNTV 216 (345)
T ss_dssp TTTTSSTCEEEEEE-TTSSEEEEEETTTT-EEEEEEE-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEETTTTEE
T ss_pred ccccccccceeEEECCCCCEEEEEecCCC-EEEEEEEeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecCCCCcE
Confidence 123467899999999999999999988 99999998665 211 2123456799999999999999999999999
Q ss_pred EEEeCC--CCceeEEEecC------CCCccceeeeee--CCeEEEEeCCCCcEEEEcccCC-Ccce---eeecccccccc
Q psy6570 141 ESINLN--GKDRFVVYHTE------DNGYKPYKLEVF--EDNLYFSTYRTNNILKINKFGN-SDFN---VLANNLNRASD 206 (713)
Q Consensus 141 ~~~~~~--g~~~~~~~~~~------~~~~~p~~i~~~--~~~ly~td~~~~~i~~~~~~~~-~~~~---~~~~~~~~~~~ 206 (713)
..++++ ....+.+.... .....|.+|.+. +.+||+++...+.|..++.... ..++ .+......|.+
T Consensus 217 ~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~ 296 (345)
T PF10282_consen 217 SVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNRGSNSISVFDLDPATGTLTLVQTVPTGGKFPRH 296 (345)
T ss_dssp EEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEECTTTEEEEEEECTTTTTEEEEEEEEESSSSEEE
T ss_pred EEEeecccCCceeEEEEeeeccccccccCCceeEEEecCCCEEEEEeccCCEEEEEEEecCCCceEEEEEEeCCCCCccE
Confidence 999887 22222221111 111246666665 7789999999999887776322 2222 22233455777
Q ss_pred EEEEeecc
Q psy6570 207 VLILQENK 214 (713)
Q Consensus 207 i~v~~~~~ 214 (713)
+.+....+
T Consensus 297 ~~~s~~g~ 304 (345)
T PF10282_consen 297 FAFSPDGR 304 (345)
T ss_dssp EEE-TTSS
T ss_pred EEEeCCCC
Confidence 76644433
No 24
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=99.04 E-value=2e-08 Score=102.46 Aligned_cols=151 Identities=21% Similarity=0.311 Sum_probs=112.2
Q ss_pred CCCCCceEEEeccCCeEEEeecC-----C---CCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCC
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAG-----G---RSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKP 95 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~-----~---~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~ 95 (713)
.+..|+.+.+|+. +.+|+++.. . +..+.|++++++|...+.+ ...+..|+|||++|.+..||++|+..+
T Consensus 109 ~~~r~ND~~v~pd-G~~wfgt~~~~~~~~~~~~~~G~lyr~~p~g~~~~l~-~~~~~~~NGla~SpDg~tly~aDT~~~- 185 (307)
T COG3386 109 PLNRPNDGVVDPD-GRIWFGDMGYFDLGKSEERPTGSLYRVDPDGGVVRLL-DDDLTIPNGLAFSPDGKTLYVADTPAN- 185 (307)
T ss_pred CcCCCCceeEcCC-CCEEEeCCCccccCccccCCcceEEEEcCCCCEEEee-cCcEEecCceEECCCCCEEEEEeCCCC-
Confidence 4577999999976 899999986 1 2356799999877665544 355899999999999999999999888
Q ss_pred eEEEEecC---CC--CcEEEE--eCCCCCCeeEEEeCCCCeEE-EEcCCCCcEEEEeCCCCceeEEEecCCCCccceeee
Q psy6570 96 RISGASID---GK--NKFNLV--DNNIQWPTGITIDYPSQRLY-WADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLE 167 (713)
Q Consensus 96 ~I~~~~~d---G~--~~~~l~--~~~~~~p~glavd~~~~~LY-~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~ 167 (713)
+|++++++ |. .+...+ ...-..|.|+++| .++.|| ++.+...+|.+++.+|.....+... ...|..++
T Consensus 186 ~i~r~~~d~~~g~~~~~~~~~~~~~~~G~PDG~~vD-adG~lw~~a~~~g~~v~~~~pdG~l~~~i~lP---~~~~t~~~ 261 (307)
T COG3386 186 RIHRYDLDPATGPIGGRRGFVDFDEEPGLPDGMAVD-ADGNLWVAAVWGGGRVVRFNPDGKLLGEIKLP---VKRPTNPA 261 (307)
T ss_pred eEEEEecCcccCccCCcceEEEccCCCCCCCceEEe-CCCCEEEecccCCceEEEECCCCcEEEEEECC---CCCCccce
Confidence 99999887 22 122222 2234789999999 677888 4455556999999998776666543 24566666
Q ss_pred eeC---CeEEEEeCCCC
Q psy6570 168 VFE---DNLYFSTYRTN 181 (713)
Q Consensus 168 ~~~---~~ly~td~~~~ 181 (713)
+-+ +.||+|....+
T Consensus 262 FgG~~~~~L~iTs~~~~ 278 (307)
T COG3386 262 FGGPDLNTLYITSARSG 278 (307)
T ss_pred EeCCCcCEEEEEecCCC
Confidence 654 78999976553
No 25
>KOG4659|consensus
Probab=99.04 E-value=3.6e-09 Score=118.45 Aligned_cols=177 Identities=20% Similarity=0.297 Sum_probs=123.7
Q ss_pred CCceeEEEccCcccEEec-CCCCC---ceEEEeccCCeEEEeecCCCCCCeEEEEe-cCCce----EEEEEc--------
Q psy6570 6 SGNVTRVKREMNLKTVLS-NLHDP---RGVAVDWVGKNLYWTDAGGRSSNNIMVST-LEGRK----KRTLLN-------- 68 (713)
Q Consensus 6 ~~~I~~~~~~~~~~~~~~-~~~~p---~gla~D~~~~~ly~td~~~~~~~~I~~~~-~~G~~----~~~l~~-------- 68 (713)
-+-|.|+..+|+..+|++ ++..| .-||+||..+.||++|. ..++|+++. +.++. -+++..
T Consensus 383 fNyIRRI~~dg~v~tIl~L~~t~~sh~Yy~AvsPvdgtlyvSdp---~s~qv~rv~sl~~~d~~~N~evvaG~Ge~Clp~ 459 (1899)
T KOG4659|consen 383 FNYIRRISQDGQVSTILTLGLTDTSHSYYIAVSPVDGTLYVSDP---LSKQVWRVSSLEPQDSRNNYEVVAGDGEVCLPA 459 (1899)
T ss_pred chheeeecCCCceEEEEEecCCCccceeEEEecCcCceEEecCC---CcceEEEeccCCccccccCeeEEeccCcCcccc
Confidence 345778999998766633 33444 56999999999999999 667777643 44332 122221
Q ss_pred ------------CCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEE------------------eCCCCCC
Q psy6570 69 ------------TGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLV------------------DNNIQWP 118 (713)
Q Consensus 69 ------------~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~------------------~~~~~~p 118 (713)
..|..|+||||| ..|.|||+|.- +|.+++.+|-....+- ...+.||
T Consensus 460 desCGDGalA~dA~L~~PkGIa~d-k~g~lYfaD~t---~IR~iD~~giIstlig~~~~~~~p~~C~~~~kl~~~~leWP 535 (1899)
T KOG4659|consen 460 DESCGDGALAQDAQLIFPKGIAFD-KMGNLYFADGT---RIRVIDTTGIISTLIGTTPDQHPPRTCAQITKLVDLQLEWP 535 (1899)
T ss_pred ccccCcchhcccceeccCCceeEc-cCCcEEEeccc---EEEEeccCceEEEeccCCCCccCccccccccchhheeeecc
Confidence 126789999999 68999999954 6888888874433322 1247899
Q ss_pred eeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecC-----------------CCCccceeeeee-CCeEEEEeCCC
Q psy6570 119 TGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTE-----------------DNGYKPYKLEVF-EDNLYFSTYRT 180 (713)
Q Consensus 119 ~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~-----------------~~~~~p~~i~~~-~~~ly~td~~~ 180 (713)
+.|||||.++.||+.| ++.|++++.++..+.++.... ..+..+.+|++- .+.||+++...
T Consensus 536 T~LaV~Pmdnsl~Vld--~nvvlrit~~~rV~Ii~GrP~hC~~a~~t~~~skla~H~tl~~~r~Iavg~~G~lyvaEsD~ 613 (1899)
T KOG4659|consen 536 TSLAVDPMDNSLLVLD--TNVVLRITVVHRVRIILGRPTHCDLANATSSASKLADHRTLLIQRDIAVGTDGALYVAESDG 613 (1899)
T ss_pred cceeecCCCCeEEEee--cceEEEEccCccEEEEcCCccccccCCCchhhhhhhhhhhhhhhhceeecCCceEEEEeccc
Confidence 9999999999999999 677888888776552221110 013345677775 67899999888
Q ss_pred CcEEEEcccCC
Q psy6570 181 NNILKINKFGN 191 (713)
Q Consensus 181 ~~i~~~~~~~~ 191 (713)
.+|-++.+-+.
T Consensus 614 rriNrvr~~~t 624 (1899)
T KOG4659|consen 614 RRINRVRKLST 624 (1899)
T ss_pred hhhhheEEecc
Confidence 88877776544
No 26
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=99.03 E-value=1.1e-08 Score=108.90 Aligned_cols=159 Identities=16% Similarity=0.197 Sum_probs=112.2
Q ss_pred cEEec--CCCCCceEEEeccCCeEEEeecCC---------CCCCeEEEEec---CCce-EEEEEcCCCCCcceEEEcCCC
Q psy6570 19 KTVLS--NLHDPRGVAVDWVGKNLYWTDAGG---------RSSNNIMVSTL---EGRK-KRTLLNTGLNEPYDIALEPLS 83 (713)
Q Consensus 19 ~~~~~--~~~~p~gla~D~~~~~ly~td~~~---------~~~~~I~~~~~---~G~~-~~~l~~~~~~~p~~iavD~~~ 83 (713)
+++++ .+.+|.+|++|.. ++||+++... ....+|.++.. ||.. ...++.+++..|.+|++.+ +
T Consensus 5 ~l~A~~p~~~~P~~ia~d~~-G~l~V~e~~~y~~~~~~~~~~~~rI~~l~d~dgdG~~d~~~vfa~~l~~p~Gi~~~~-~ 82 (367)
T TIGR02604 5 TLFAAEPLLRNPIAVCFDER-GRLWVAEGITYSRPAGRQGPLGDRILILEDADGDGKYDKSNVFAEELSMVTGLAVAV-G 82 (367)
T ss_pred EEEECCCccCCCceeeECCC-CCEEEEeCCcCCCCCCCCCCCCCEEEEEEcCCCCCCcceeEEeecCCCCccceeEec-C
Confidence 34444 4999999999965 8899998521 01238888765 4554 3356667789999999985 5
Q ss_pred CcEEEEccCCCCeEEEE-ecCCC-----CcEEEEeC---C----CCCCeeEEEeCCCCeEEEEcCC--------------
Q psy6570 84 GRMFWTELGIKPRISGA-SIDGK-----NKFNLVDN---N----IQWPTGITIDYPSQRLYWADPK-------------- 136 (713)
Q Consensus 84 ~~ly~td~~~~~~I~~~-~~dG~-----~~~~l~~~---~----~~~p~glavd~~~~~LY~~d~~-------------- 136 (713)
+ ||+++.. +|+++ +.+|. .+++|+.. . .+.+++|++++ +++||+++..
T Consensus 83 G-lyV~~~~---~i~~~~d~~gdg~ad~~~~~l~~~~~~~~~~~~~~~~~l~~gp-DG~LYv~~G~~~~~~~~~~~~~~~ 157 (367)
T TIGR02604 83 G-VYVATPP---DILFLRDKDGDDKADGEREVLLSGFGGQINNHHHSLNSLAWGP-DGWLYFNHGNTLASKVTRPGTSDE 157 (367)
T ss_pred C-EEEeCCC---eEEEEeCCCCCCCCCCccEEEEEccCCCCCcccccccCceECC-CCCEEEecccCCCceeccCCCccC
Confidence 5 9998743 68877 44442 33444431 1 34588999996 5799998752
Q ss_pred -----CCcEEEEeCCCCceeEEEecCCCCccceeeeee-CCeEEEEeCCCCcEEEEc
Q psy6570 137 -----ARTIESINLNGKDRFVVYHTEDNGYKPYKLEVF-EDNLYFSTYRTNNILKIN 187 (713)
Q Consensus 137 -----~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~ 187 (713)
.+.|++++.+|+..+++... +.+|.+|+++ .+.||++|.......++.
T Consensus 158 ~~~~~~g~i~r~~pdg~~~e~~a~G---~rnp~Gl~~d~~G~l~~tdn~~~~~~~i~ 211 (367)
T TIGR02604 158 SRQGLGGGLFRYNPDGGKLRVVAHG---FQNPYGHSVDSWGDVFFCDNDDPPLCRVT 211 (367)
T ss_pred cccccCceEEEEecCCCeEEEEecC---cCCCccceECCCCCEEEEccCCCceeEEc
Confidence 14699999999888777654 7889999986 678999987666555554
No 27
>KOG1520|consensus
Probab=98.98 E-value=2.3e-09 Score=108.19 Aligned_cols=147 Identities=18% Similarity=0.241 Sum_probs=111.4
Q ss_pred cCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCC----CCCcceEEEcCCCCcEEEEccCCC----
Q psy6570 23 SNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTG----LNEPYDIALEPLSGRMFWTELGIK---- 94 (713)
Q Consensus 23 ~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~----~~~p~~iavD~~~~~ly~td~~~~---- 94 (713)
..-.+|-|||++..++.||++|. .--++++++.|...+.+.... +.-.+++.||+ +|.|||||+...
T Consensus 112 ~~CGRPLGl~f~~~ggdL~VaDA----YlGL~~V~p~g~~a~~l~~~~~G~~~kf~N~ldI~~-~g~vyFTDSSsk~~~r 186 (376)
T KOG1520|consen 112 PLCGRPLGIRFDKKGGDLYVADA----YLGLLKVGPEGGLAELLADEAEGKPFKFLNDLDIDP-EGVVYFTDSSSKYDRR 186 (376)
T ss_pred cccCCcceEEeccCCCeEEEEec----ceeeEEECCCCCcceeccccccCeeeeecCceeEcC-CCeEEEeccccccchh
Confidence 34578999999999999999997 556888899988766555432 44567999998 999999997642
Q ss_pred ------------CeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCce---eEEEecCCC
Q psy6570 95 ------------PRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDR---FVVYHTEDN 159 (713)
Q Consensus 95 ------------~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~---~~~~~~~~~ 159 (713)
+|+.+.+...+..++|+ .++..|+|||++++.+.|.+++....||.++-+.|... ++++.. .
T Consensus 187 d~~~a~l~g~~~GRl~~YD~~tK~~~VLl-d~L~F~NGlaLS~d~sfvl~~Et~~~ri~rywi~g~k~gt~EvFa~~--L 263 (376)
T KOG1520|consen 187 DFVFAALEGDPTGRLFRYDPSTKVTKVLL-DGLYFPNGLALSPDGSFVLVAETTTARIKRYWIKGPKAGTSEVFAEG--L 263 (376)
T ss_pred heEEeeecCCCccceEEecCcccchhhhh-hcccccccccCCCCCCEEEEEeeccceeeeeEecCCccCchhhHhhc--C
Confidence 34555555444444444 48999999999999999999999999999999999766 666653 2
Q ss_pred CccceeeeeeCCeEEEEe
Q psy6570 160 GYKPYKLEVFEDNLYFST 177 (713)
Q Consensus 160 ~~~p~~i~~~~~~ly~td 177 (713)
...|..|...++-=||..
T Consensus 264 PG~PDNIR~~~~G~fWVa 281 (376)
T KOG1520|consen 264 PGYPDNIRRDSTGHFWVA 281 (376)
T ss_pred CCCCcceeECCCCCEEEE
Confidence 457888887754445543
No 28
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=98.98 E-value=7.1e-08 Score=101.73 Aligned_cols=181 Identities=14% Similarity=0.115 Sum_probs=119.3
Q ss_pred cCCceeEEEcc--CcccEE--ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecC--CceEEEEE-cCCCCCcceE
Q psy6570 5 SSGNVTRVKRE--MNLKTV--LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLE--GRKKRTLL-NTGLNEPYDI 77 (713)
Q Consensus 5 ~~~~I~~~~~~--~~~~~~--~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~--G~~~~~l~-~~~~~~p~~i 77 (713)
..+.|..++.+ +..+.+ ......|..|++++.++.||++.. ..+.|.+++++ |...+.+. ......|.++
T Consensus 55 ~~~~i~~~~~~~~g~l~~~~~~~~~~~p~~i~~~~~g~~l~v~~~---~~~~v~v~~~~~~g~~~~~~~~~~~~~~~~~~ 131 (330)
T PRK11028 55 PEFRVLSYRIADDGALTFAAESPLPGSPTHISTDHQGRFLFSASY---NANCVSVSPLDKDGIPVAPIQIIEGLEGCHSA 131 (330)
T ss_pred CCCcEEEEEECCCCceEEeeeecCCCCceEEEECCCCCEEEEEEc---CCCeEEEEEECCCCCCCCceeeccCCCcccEe
Confidence 35666555554 222222 233457999999999999999987 67888888775 33222221 1234679999
Q ss_pred EEcCCCCcEEEEccCCCCeEEEEecCCCCcE------EEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCC--Cc
Q psy6570 78 ALEPLSGRMFWTELGIKPRISGASIDGKNKF------NLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNG--KD 149 (713)
Q Consensus 78 avD~~~~~ly~td~~~~~~I~~~~~dG~~~~------~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g--~~ 149 (713)
+++|..++||+++.+.+ +|.+.+++..... .+....-..|.+++|++.+++||+++...+.|..++++. ..
T Consensus 132 ~~~p~g~~l~v~~~~~~-~v~v~d~~~~g~l~~~~~~~~~~~~g~~p~~~~~~pdg~~lyv~~~~~~~v~v~~~~~~~~~ 210 (330)
T PRK11028 132 NIDPDNRTLWVPCLKED-RIRLFTLSDDGHLVAQEPAEVTTVEGAGPRHMVFHPNQQYAYCVNELNSSVDVWQLKDPHGE 210 (330)
T ss_pred EeCCCCCEEEEeeCCCC-EEEEEEECCCCcccccCCCceecCCCCCCceEEECCCCCEEEEEecCCCEEEEEEEeCCCCC
Confidence 99998899999998877 8999888642211 111112356899999999999999999889998888863 22
Q ss_pred eeEEEecC---C---CCccceeee--eeCCeEEEEeCCCCcEEEEccc
Q psy6570 150 RFVVYHTE---D---NGYKPYKLE--VFEDNLYFSTYRTNNILKINKF 189 (713)
Q Consensus 150 ~~~~~~~~---~---~~~~p~~i~--~~~~~ly~td~~~~~i~~~~~~ 189 (713)
.+.+.... . ...++.+|. .++.+||+++...+.|..++..
T Consensus 211 ~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~~~~I~v~~i~ 258 (330)
T PRK11028 211 IECVQTLDMMPADFSDTRWAADIHITPDGRHLYACDRTASLISVFSVS 258 (330)
T ss_pred EEEEEEEecCCCcCCCCccceeEEECCCCCEEEEecCCCCeEEEEEEe
Confidence 22222111 0 112343444 4466899998877888777653
No 29
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=98.97 E-value=6.3e-08 Score=102.11 Aligned_cols=182 Identities=13% Similarity=0.086 Sum_probs=124.3
Q ss_pred CcccCCceeEEEccC--cccEE--ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecC--CceEEEEEc-CCCCCc
Q psy6570 2 ASISSGNVTRVKREM--NLKTV--LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLE--GRKKRTLLN-TGLNEP 74 (713)
Q Consensus 2 ad~~~~~I~~~~~~~--~~~~~--~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~--G~~~~~l~~-~~~~~p 74 (713)
|+...+.|+.++++. ..+.+ +.....|..|++++.++.||++.. ..+.|..++.+ |+.. .+.. .....|
T Consensus 7 ~~~~~~~I~~~~~~~~g~l~~~~~~~~~~~~~~l~~spd~~~lyv~~~---~~~~i~~~~~~~~g~l~-~~~~~~~~~~p 82 (330)
T PRK11028 7 ASPESQQIHVWNLNHEGALTLLQVVDVPGQVQPMVISPDKRHLYVGVR---PEFRVLSYRIADDGALT-FAAESPLPGSP 82 (330)
T ss_pred EcCCCCCEEEEEECCCCceeeeeEEecCCCCccEEECCCCCEEEEEEC---CCCcEEEEEECCCCceE-EeeeecCCCCc
Confidence 455678899988853 22222 334568999999999899999987 56778777765 3322 1111 123579
Q ss_pred ceEEEcCCCCcEEEEccCCCCeEEEEecC--CCCcEEEE-eCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCcee
Q psy6570 75 YDIALEPLSGRMFWTELGIKPRISGASID--GKNKFNLV-DNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRF 151 (713)
Q Consensus 75 ~~iavD~~~~~ly~td~~~~~~I~~~~~d--G~~~~~l~-~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~ 151 (713)
..|+++|.+++||++....+ +|..++++ |...+.+. ......|.++++++.+++||+++...++|..++++.....
T Consensus 83 ~~i~~~~~g~~l~v~~~~~~-~v~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~p~g~~l~v~~~~~~~v~v~d~~~~g~l 161 (330)
T PRK11028 83 THISTDHQGRFLFSASYNAN-CVSVSPLDKDGIPVAPIQIIEGLEGCHSANIDPDNRTLWVPCLKEDRIRLFTLSDDGHL 161 (330)
T ss_pred eEEEECCCCCEEEEEEcCCC-eEEEEEECCCCCCCCceeeccCCCcccEeEeCCCCCEEEEeeCCCCEEEEEEECCCCcc
Confidence 99999999999999987655 77777765 43322221 1234578999999999999999999999999998642211
Q ss_pred -----EEEecCCCCccceeeeee--CCeEEEEeCCCCcEEEEccc
Q psy6570 152 -----VVYHTEDNGYKPYKLEVF--EDNLYFSTYRTNNILKINKF 189 (713)
Q Consensus 152 -----~~~~~~~~~~~p~~i~~~--~~~ly~td~~~~~i~~~~~~ 189 (713)
...... ....|..+++. +.+||+++...+.|..++..
T Consensus 162 ~~~~~~~~~~~-~g~~p~~~~~~pdg~~lyv~~~~~~~v~v~~~~ 205 (330)
T PRK11028 162 VAQEPAEVTTV-EGAGPRHMVFHPNQQYAYCVNELNSSVDVWQLK 205 (330)
T ss_pred cccCCCceecC-CCCCCceEEECCCCCEEEEEecCCCEEEEEEEe
Confidence 001111 13456666665 56899999888888877764
No 30
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=98.95 E-value=6.1e-08 Score=103.60 Aligned_cols=183 Identities=20% Similarity=0.236 Sum_probs=131.0
Q ss_pred CCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCC-CCeEEEEecC
Q psy6570 25 LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGI-KPRISGASID 103 (713)
Q Consensus 25 ~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~-~~~I~~~~~d 103 (713)
...|.+++++..+.++|+++. ..+.|.+++.+.......+..+ ..|.+|++|+.++.||+++.+. +..|.+++..
T Consensus 73 ~~~p~~i~v~~~~~~vyv~~~---~~~~v~vid~~~~~~~~~~~vG-~~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~ 148 (381)
T COG3391 73 GVYPAGVAVNPAGNKVYVTTG---DSNTVSVIDTATNTVLGSIPVG-LGPVGLAVDPDGKYVYVANAGNGNNTVSVIDAA 148 (381)
T ss_pred CccccceeeCCCCCeEEEecC---CCCeEEEEcCcccceeeEeeec-cCCceEEECCCCCEEEEEecccCCceEEEEeCC
Confidence 378999999999999999998 7899999996665544433333 3899999999999999999852 3388888877
Q ss_pred CCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEe---cCCCCccceeeee--eCCeEEEEeC
Q psy6570 104 GKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYH---TEDNGYKPYKLEV--FEDNLYFSTY 178 (713)
Q Consensus 104 G~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~---~~~~~~~p~~i~~--~~~~ly~td~ 178 (713)
.......+. .-..|.++++|+...++|+++...++|..++..+..... .. .......|.++++ ++.++|+++.
T Consensus 149 t~~~~~~~~-vG~~P~~~a~~p~g~~vyv~~~~~~~v~vi~~~~~~v~~-~~~~~~~~~~~~P~~i~v~~~g~~~yV~~~ 226 (381)
T COG3391 149 TNKVTATIP-VGNTPTGVAVDPDGNKVYVTNSDDNTVSVIDTSGNSVVR-GSVGSLVGVGTGPAGIAVDPDGNRVYVAND 226 (381)
T ss_pred CCeEEEEEe-cCCCcceEEECCCCCeEEEEecCCCeEEEEeCCCcceec-cccccccccCCCCceEEECCCCCEEEEEec
Confidence 554443332 233689999999999999999999999999977765542 11 1113556777776 4678999998
Q ss_pred CC--CcEEEEcccCCCccee--eeccccccccEEEEeecc
Q psy6570 179 RT--NNILKINKFGNSDFNV--LANNLNRASDVLILQENK 214 (713)
Q Consensus 179 ~~--~~i~~~~~~~~~~~~~--~~~~~~~~~~i~v~~~~~ 214 (713)
.+ +.|.+++...+..... ..... .+.++.+....+
T Consensus 227 ~~~~~~v~~id~~~~~v~~~~~~~~~~-~~~~v~~~p~g~ 265 (381)
T COG3391 227 GSGSNNVLKIDTATGNVTATDLPVGSG-APRGVAVDPAGK 265 (381)
T ss_pred cCCCceEEEEeCCCceEEEeccccccC-CCCceeECCCCC
Confidence 87 6888888765544333 22222 455554444333
No 31
>KOG4659|consensus
Probab=98.94 E-value=8.3e-09 Score=115.67 Aligned_cols=154 Identities=19% Similarity=0.322 Sum_probs=111.4
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcC------------------CCCCcceEEEcCCCCc
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNT------------------GLNEPYDIALEPLSGR 85 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~------------------~~~~p~~iavD~~~~~ 85 (713)
.|..|.||||| ..+.||++|. ..|.++|.+|-...++... .|.+|..|||||.++.
T Consensus 473 ~L~~PkGIa~d-k~g~lYfaD~-----t~IR~iD~~giIstlig~~~~~~~p~~C~~~~kl~~~~leWPT~LaV~Pmdns 546 (1899)
T KOG4659|consen 473 QLIFPKGIAFD-KMGNLYFADG-----TRIRVIDTTGIISTLIGTTPDQHPPRTCAQITKLVDLQLEWPTSLAVDPMDNS 546 (1899)
T ss_pred eeccCCceeEc-cCCcEEEecc-----cEEEEeccCceEEEeccCCCCccCccccccccchhheeeecccceeecCCCCe
Confidence 47789999999 5689999997 5788888888655544321 2678999999999999
Q ss_pred EEEEccCCCCeEEEEecCCCCcEEEEe-------------------CCCCCCeeEEEeCCCCeEEEEcCCCCc---EEEE
Q psy6570 86 MFWTELGIKPRISGASIDGKNKFNLVD-------------------NNIQWPTGITIDYPSQRLYWADPKART---IESI 143 (713)
Q Consensus 86 ly~td~~~~~~I~~~~~dG~~~~~l~~-------------------~~~~~p~glavd~~~~~LY~~d~~~~~---I~~~ 143 (713)
||+-|.+ -|+++..++..+..+.. ..+..+..|||. ..+.||++++...+ |+.+
T Consensus 547 l~Vld~n---vvlrit~~~rV~Ii~GrP~hC~~a~~t~~~skla~H~tl~~~r~Iavg-~~G~lyvaEsD~rriNrvr~~ 622 (1899)
T KOG4659|consen 547 LLVLDTN---VVLRITVVHRVRIILGRPTHCDLANATSSASKLADHRTLLIQRDIAVG-TDGALYVAESDGRRINRVRKL 622 (1899)
T ss_pred EEEeecc---eEEEEccCccEEEEcCCccccccCCCchhhhhhhhhhhhhhhhceeec-CCceEEEEeccchhhhheEEe
Confidence 9999965 78888888766522211 124457899999 57899999986655 4555
Q ss_pred eCCCCceeEEEec-----------------------CCCCccceeeeee-CCeEEEEeCCCCcEEEEcc
Q psy6570 144 NLNGKDRFVVYHT-----------------------EDNGYKPYKLEVF-EDNLYFSTYRTNNILKINK 188 (713)
Q Consensus 144 ~~~g~~~~~~~~~-----------------------~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~ 188 (713)
..||. ..++... ...+..|.+|+|. ++.||++|.++-+|..+.+
T Consensus 623 ~tdg~-i~ilaGa~S~C~C~~~~~cdcfs~~~~~At~A~lnsp~alaVsPdg~v~IAD~gN~rIr~Vs~ 690 (1899)
T KOG4659|consen 623 STDGT-ISILAGAKSPCSCDVAACCDCFSLRDVAATQAKLNSPYALAVSPDGDVIIADSGNSRIRKVSA 690 (1899)
T ss_pred ccCce-EEEecCCCCCCCcccccCCccccccchhhhccccCCcceEEECCCCcEEEecCCchhhhhhhh
Confidence 55552 2222111 1136778999987 7789999999888877653
No 32
>KOG1226|consensus
Probab=98.90 E-value=5.2e-09 Score=113.10 Aligned_cols=136 Identities=32% Similarity=0.834 Sum_probs=98.5
Q ss_pred CCCCCcEEcCCCCCeeccCCCCCCCCCCcccCCCCCCCCCCCCCC----CCCCCCCcEEeecCCCceeeCCCCCc----C
Q psy6570 512 TCLNGGTCIPNSKNNVCKCPSQYTGRRCECAVGDTSCASLANKCT----PNYCSNNGTCVLIEGKPSCKCLPPYS----G 583 (713)
Q Consensus 512 ~C~~~g~C~~~~~~~~C~C~~g~~G~~C~~~~~~~~c~~~~~~C~----~~~C~~~~~C~~~~g~~~C~C~~G~~----G 583 (713)
.|+.+|+.+ -++|.|.+||.|..||+..+...-....+.|. ..+|++.|.|. -.+|.|++... |
T Consensus 468 ~C~g~G~~~----CG~C~C~~G~~G~~CEC~~~~~ss~~~~~~Cr~~~~~~vCSgrG~C~----CGqC~C~~~~~~~i~G 539 (783)
T KOG1226|consen 468 LCHGNGTFV----CGQCRCDEGWLGKKCECSTDELSSSEEEDKCRENSDSPVCSGRGDCV----CGQCVCHKPDNGKIYG 539 (783)
T ss_pred ccCCCCcEE----ecceecCCCCCCCcccCCccccCcHhHHhhccCCCCCCCcCCCCcEe----CCceEecCCCCCceee
Confidence 354445443 35899999999999997754433221133443 22799999998 34799999877 8
Q ss_pred CCCCcCCCCCCC----CCCCCCCCeEecCCCCcceeecCCCcccCCCCcC---CCCC----CCCCCCCeecCCCCccCCC
Q psy6570 584 KQCTEREDSPSC----HNYCDNAGLCSYSKQGKPVCTCVNGWSGITCSER---VSCA----HFCFNGGTCREQNYSLDPD 652 (713)
Q Consensus 584 ~~C~~~~~~~~C----~~~C~~~g~C~~~~~g~~~C~C~~G~~G~~C~~~---~~C~----~~C~~~~~C~~~~~~~~~~ 652 (713)
+.|+.. ...| ...|.++|.|.-.. |.|.+||+|..|.-. +.|. ..|+..|+|.-
T Consensus 540 ~fCECD--nfsC~r~~g~lC~g~G~C~CG~-----CvC~~GwtG~~C~C~~std~C~~~~G~iCSGrG~C~C-------- 604 (783)
T KOG1226|consen 540 KFCECD--NFSCERHKGVLCGGHGRCECGR-----CVCNPGWTGSACNCPLSTDTCESSDGQICSGRGTCEC-------- 604 (783)
T ss_pred eeeecc--CcccccccCcccCCCCeEeCCc-----EEcCCCCccCCCCCCCCCccccCCCCceeCCCceeeC--------
Confidence 888864 3456 34699999998665 999999999998743 2364 35888888877
Q ss_pred CCceeeCCCC-cccCCCCccc
Q psy6570 653 LKPICICPRG-YAGVRCQTLV 672 (713)
Q Consensus 653 ~~~~C~C~~G-y~G~~C~~~~ 672 (713)
++|+|.+. |.|..||...
T Consensus 605 --g~C~C~~~~~sG~~CE~cp 623 (783)
T KOG1226|consen 605 --GRCKCTDPPYSGEFCEKCP 623 (783)
T ss_pred --CceEcCCCCcCcchhhcCC
Confidence 78999876 9999988643
No 33
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=98.81 E-value=6.3e-07 Score=88.34 Aligned_cols=176 Identities=14% Similarity=0.187 Sum_probs=103.3
Q ss_pred CCceeEEEccCcc-cEE-ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecC--CceE--E---EEE--cC--CCC
Q psy6570 6 SGNVTRVKREMNL-KTV-LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLE--GRKK--R---TLL--NT--GLN 72 (713)
Q Consensus 6 ~~~I~~~~~~~~~-~~~-~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~--G~~~--~---~l~--~~--~~~ 72 (713)
.+.|+.++++|.. +.+ +.+...|+||++- .++.+.+++. ..++|+++..+ ++.. . .+. .. +-.
T Consensus 43 ~~~i~els~~G~vlr~i~l~g~~D~EgI~y~-g~~~~vl~~E---r~~~L~~~~~~~~~~~~~~~~~~~~~l~~~~~~N~ 118 (248)
T PF06977_consen 43 PGEIYELSLDGKVLRRIPLDGFGDYEGITYL-GNGRYVLSEE---RDQRLYIFTIDDDTTSLDRADVQKISLGFPNKGNK 118 (248)
T ss_dssp TTEEEEEETT--EEEEEE-SS-SSEEEEEE--STTEEEEEET---TTTEEEEEEE----TT--EEEEEEEE---S---SS
T ss_pred CCEEEEEcCCCCEEEEEeCCCCCCceeEEEE-CCCEEEEEEc---CCCcEEEEEEeccccccchhhceEEecccccCCCc
Confidence 5677888888754 223 6778899999995 4456666665 57788877773 2221 1 111 11 123
Q ss_pred CcceEEEcCCCCcEEEEccCCCCeEEEEec--CCCCcEEEEe-------CCCCCCeeEEEeCCCCeEEEEcCCCCcEEEE
Q psy6570 73 EPYDIALEPLSGRMFWTELGIKPRISGASI--DGKNKFNLVD-------NNIQWPTGITIDYPSQRLYWADPKARTIESI 143 (713)
Q Consensus 73 ~p~~iavD~~~~~ly~td~~~~~~I~~~~~--dG~~~~~l~~-------~~~~~p~glavd~~~~~LY~~d~~~~~I~~~ 143 (713)
...||+.|+.+++||++......+|+.++. .+....+... ..+..|.+|++|+.++.||+-...+++|..+
T Consensus 119 G~EGla~D~~~~~L~v~kE~~P~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~S~l~~~p~t~~lliLS~es~~l~~~ 198 (248)
T PF06977_consen 119 GFEGLAYDPKTNRLFVAKERKPKRLYEVNGFPGGFDLFVSDDQDLDDDKLFVRDLSGLSYDPRTGHLLILSDESRLLLEL 198 (248)
T ss_dssp --EEEEEETTTTEEEEEEESSSEEEEEEESTT-SS--EEEE-HHHH-HT--SS---EEEEETTTTEEEEEETTTTEEEEE
T ss_pred ceEEEEEcCCCCEEEEEeCCCChhhEEEccccCccceeeccccccccccceeccccceEEcCCCCeEEEEECCCCeEEEE
Confidence 367999999999999986543335777765 2222222221 1356799999999999999999999999999
Q ss_pred eCCCCceeEEEecC------CCCccceeeeee-CCeEEEEeCCCCcEEEE
Q psy6570 144 NLNGKDRFVVYHTE------DNGYKPYKLEVF-EDNLYFSTYRTNNILKI 186 (713)
Q Consensus 144 ~~~g~~~~~~~~~~------~~~~~p~~i~~~-~~~ly~td~~~~~i~~~ 186 (713)
+.+|..+..+.-.. ..+++|-||+++ ++.||++.- .+..+++
T Consensus 199 d~~G~~~~~~~L~~g~~gl~~~~~QpEGIa~d~~G~LYIvsE-pNlfy~f 247 (248)
T PF06977_consen 199 DRQGRVVSSLSLDRGFHGLSKDIPQPEGIAFDPDGNLYIVSE-PNLFYRF 247 (248)
T ss_dssp -TT--EEEEEE-STTGGG-SS---SEEEEEE-TT--EEEEET-TTEEEEE
T ss_pred CCCCCEEEEEEeCCcccCcccccCCccEEEECCCCCEEEEcC-CceEEEe
Confidence 99998665544332 235789999998 678999874 4555554
No 34
>KOG4260|consensus
Probab=98.81 E-value=6.1e-09 Score=98.02 Aligned_cols=141 Identities=29% Similarity=0.714 Sum_probs=95.8
Q ss_pred cCCCcccCCccccC-CCCCCCCCCceeeCCCCCCCCCceeeCCCCcccCCCCcc--------------CC---CCCCCCC
Q psy6570 333 CQENFYGTYCEKVN-NSMCPCLNQGMCYPDLTHPEPTYKCHCAPSYTGARCESR--------------IC---ENKCHNG 394 (713)
Q Consensus 333 C~~g~~G~~C~~~~-c~~~~C~~~~~C~~~~~~~~~~~~C~C~~G~~g~~C~~~--------------~C---~~~C~~~ 394 (713)
|++|.+|..|..-. =...||..+|.|... +...|+.+|.|.+||+|+.|... +| ...|.
T Consensus 132 Cp~gtyGpdCl~Cpggser~C~GnG~C~Gd-GsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C~-- 208 (350)
T KOG4260|consen 132 CPDGTYGPDCLQCPGGSERPCFGNGSCHGD-GSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGCL-- 208 (350)
T ss_pred cCCCCcCCccccCCCCCcCCcCCCCcccCC-CCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhhh--
Confidence 77888888776522 234579999999765 34577889999999999988532 12 22332
Q ss_pred CEEEc-CCCeee-CCCCCccC--CCC---cCC--CCCCCCCcEeccCCCCccccCCCCCcCCCCccCCCCC---C--CCC
Q psy6570 395 GTCIA-TTQTCV-CPPGFTGD--TCQ---QCL--NLKCQNGGVCVNKTTGLECDCPKFYYGKNCQYSQCKN---Y--CVN 460 (713)
Q Consensus 395 ~~C~~-~~~~C~-C~~g~~g~--~C~---~C~--~~~C~~~~~C~~~~~~~~C~C~~G~~g~~C~~~~C~~---~--~~~ 460 (713)
+.|.. .+-.|. |..||.-+ .|. +|. +.||..+..|+|+.|+|.|.+.+||.+. .++|.. . ..+
T Consensus 209 ~~Csg~~~k~C~kCkkGW~lde~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g---~d~C~~~~d~~~~kn 285 (350)
T KOG4260|consen 209 GVCSGESSKGCSKCKKGWKLDEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKG---VDECQFCADVCASKN 285 (350)
T ss_pred cccCCCCCCChhhhcccceecccccccHHHHhcCCCCCChhheeecCCCceEecccccccCC---hHHhhhhhhhcccCC
Confidence 24443 334554 88898754 344 554 4578889999999999999999999862 333332 1 356
Q ss_pred CeeecCCCCCeecCCCCcc
Q psy6570 461 GECSITDSGPKCMCSPGYS 479 (713)
Q Consensus 461 ~~C~~~~~~~~C~C~~G~~ 479 (713)
..|.++.+.|+|.|..|+.
T Consensus 286 ~~c~ni~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 286 RPCMNIDGQYRCVCFSGLI 304 (350)
T ss_pred CCcccCCccEEEEecccce
Confidence 6778888888887777763
No 35
>PF00058 Ldl_recept_b: Low-density lipoprotein receptor repeat class B; InterPro: IPR000033 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=98.76 E-value=1.9e-08 Score=69.38 Aligned_cols=42 Identities=33% Similarity=0.885 Sum_probs=39.2
Q ss_pred CcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeC
Q psy6570 84 GRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDY 125 (713)
Q Consensus 84 ~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~ 125 (713)
++|||||++..++|+++++||+.+++++..++..|.|||||+
T Consensus 1 ~~iYWtD~~~~~~I~~a~~dGs~~~~vi~~~l~~P~giaVD~ 42 (42)
T PF00058_consen 1 GKIYWTDWSQDPSIERANLDGSNRRTVISDDLQHPEGIAVDW 42 (42)
T ss_dssp TEEEEEETTTTEEEEEEETTSTSEEEEEESSTSSEEEEEEET
T ss_pred CEEEEEECCCCcEEEEEECCCCCeEEEEECCCCCcCEEEECC
Confidence 589999998777999999999999999999999999999994
No 36
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=98.74 E-value=4.3e-07 Score=96.77 Aligned_cols=151 Identities=17% Similarity=0.223 Sum_probs=100.0
Q ss_pred CCceEEEeccCCeEEEeecCCC----------------CCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEc
Q psy6570 27 DPRGVAVDWVGKNLYWTDAGGR----------------SSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTE 90 (713)
Q Consensus 27 ~p~gla~D~~~~~ly~td~~~~----------------~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td 90 (713)
.+.+|++++. ++||+++.... ..+.|++++++|...+++. .++..|.+|++|+ .|.||++|
T Consensus 125 ~~~~l~~gpD-G~LYv~~G~~~~~~~~~~~~~~~~~~~~~g~i~r~~pdg~~~e~~a-~G~rnp~Gl~~d~-~G~l~~td 201 (367)
T TIGR02604 125 SLNSLAWGPD-GWLYFNHGNTLASKVTRPGTSDESRQGLGGGLFRYNPDGGKLRVVA-HGFQNPYGHSVDS-WGDVFFCD 201 (367)
T ss_pred cccCceECCC-CCEEEecccCCCceeccCCCccCcccccCceEEEEecCCCeEEEEe-cCcCCCccceECC-CCCEEEEc
Confidence 3779999974 79999887310 1257999999998876554 6789999999997 78999998
Q ss_pred cCCCCeEEEEecC------------CC---------CcEE---------------EE-eCCCCCCeeEEEe-------CC
Q psy6570 91 LGIKPRISGASID------------GK---------NKFN---------------LV-DNNIQWPTGITID-------YP 126 (713)
Q Consensus 91 ~~~~~~I~~~~~d------------G~---------~~~~---------------l~-~~~~~~p~glavd-------~~ 126 (713)
.... ...++..- +. .... +. ......|.|+++- .-
T Consensus 202 n~~~-~~~~i~~~~~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ap~G~~~y~g~~fp~~~ 280 (367)
T TIGR02604 202 NDDP-PLCRVTPVAEGGRNGYQSFNGRRYDHADRGADHEVPTGEWRQDDRGVETVGDVAGGGTAPCGIAFYRGDALPEEY 280 (367)
T ss_pred cCCC-ceeEEcccccccccCCCCCCCcccccccccccccccccccccccccccccccccCCCccccEEEEeCCCcCCHHH
Confidence 7543 22322210 10 0000 00 0112368999997 34
Q ss_pred CCeEEEEcCCCCcEEEEeCC--CCce----eEEEecCCCCccceeeeee-CCeEEEEeCCCC
Q psy6570 127 SQRLYWADPKARTIESINLN--GKDR----FVVYHTEDNGYKPYKLEVF-EDNLYFSTYRTN 181 (713)
Q Consensus 127 ~~~LY~~d~~~~~I~~~~~~--g~~~----~~~~~~~~~~~~p~~i~~~-~~~ly~td~~~~ 181 (713)
.+.||++++..++|+++.++ |... ..++.....+..|..|.+. ++.||++||...
T Consensus 281 ~g~~fv~~~~~~~v~~~~l~~~g~~~~~~~~~~l~~~~~~~rp~dv~~~pDG~Lyv~d~~~~ 342 (367)
T TIGR02604 281 RGLLLVGDAHGQLIVRYSLEPKGAGFKGERPEFLRSNDTWFRPVNVTVGPDGALYVSDWYDR 342 (367)
T ss_pred CCCEEeeeccCCEEEEEEeecCCCccEeecCceEecCCCcccccceeECCCCCEEEEEeccC
Confidence 57899999999999999875 4322 2233322234578888876 678999997553
No 37
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=98.73 E-value=3.6e-06 Score=87.06 Aligned_cols=196 Identities=13% Similarity=0.076 Sum_probs=124.5
Q ss_pred cCCceeEEEccCcccEE-ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCC
Q psy6570 5 SSGNVTRVKREMNLKTV-LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLS 83 (713)
Q Consensus 5 ~~~~I~~~~~~~~~~~~-~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~ 83 (713)
..+.|..+++.+...+. +.....|.++++++.+..|+++.. ....+..++++.......+.. ...|..+++++..
T Consensus 93 ~~~~l~~~d~~~~~~~~~~~~~~~~~~~~~~~dg~~l~~~~~---~~~~~~~~d~~~~~~~~~~~~-~~~~~~~~~s~dg 168 (300)
T TIGR03866 93 DDNLVTVIDIETRKVLAEIPVGVEPEGMAVSPDGKIVVNTSE---TTNMAHFIDTKTYEIVDNVLV-DQRPRFAEFTADG 168 (300)
T ss_pred CCCeEEEEECCCCeEEeEeeCCCCcceEEECCCCCEEEEEec---CCCeEEEEeCCCCeEEEEEEc-CCCccEEEECCCC
Confidence 35667777776543222 233346899999987777766654 334566667665433222222 2568899999877
Q ss_pred CcEEEEccCCCCeEEEEecCCCCcEEEEe-------CCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEec
Q psy6570 84 GRMFWTELGIKPRISGASIDGKNKFNLVD-------NNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHT 156 (713)
Q Consensus 84 ~~ly~td~~~~~~I~~~~~dG~~~~~l~~-------~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~ 156 (713)
.+||++..... .|...++........+. .....|.++++++..+.+|++....++|..+++...........
T Consensus 169 ~~l~~~~~~~~-~v~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~s~dg~~~~~~~~~~~~i~v~d~~~~~~~~~~~~ 247 (300)
T TIGR03866 169 KELWVSSEIGG-TVSVIDVATRKVIKKITFEIPGVHPEAVQPVGIKLTKDGKTAFVALGPANRVAVVDAKTYEVLDYLLV 247 (300)
T ss_pred CEEEEEcCCCC-EEEEEEcCcceeeeeeeecccccccccCCccceEECCCCCEEEEEcCCCCeEEEEECCCCcEEEEEEe
Confidence 77777654333 78888876543222111 01235788999998889999887778899888865433222222
Q ss_pred CCCCccceeeee--eCCeEEEEeCCCCcEEEEcccCCCcceeeeccccccccEEE
Q psy6570 157 EDNGYKPYKLEV--FEDNLYFSTYRTNNILKINKFGNSDFNVLANNLNRASDVLI 209 (713)
Q Consensus 157 ~~~~~~p~~i~~--~~~~ly~td~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v 209 (713)
...+..+++ ++.+||.+....+.|..++..++..+..+..+ ..|.+|++
T Consensus 248 ---~~~~~~~~~~~~g~~l~~~~~~~~~i~v~d~~~~~~~~~~~~~-~~~~~~~~ 298 (300)
T TIGR03866 248 ---GQRVWQLAFTPDEKYLLTTNGVSNDVSVIDVAALKVIKSIKVG-RLPWGVVV 298 (300)
T ss_pred ---CCCcceEEECCCCCEEEEEcCCCCeEEEEECCCCcEEEEEEcc-cccceeEe
Confidence 224555555 46678888777788999998877766666554 66777765
No 38
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=98.68 E-value=1.5e-06 Score=85.76 Aligned_cols=184 Identities=16% Similarity=0.182 Sum_probs=108.0
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecC
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASID 103 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~d 103 (713)
...++.||++++.+++||.+.. ....|+.++++|+..+.+--.+...+.+|++- .++.+.+++...+ +|.++..+
T Consensus 20 ~~~e~SGLTy~pd~~tLfaV~d---~~~~i~els~~G~vlr~i~l~g~~D~EgI~y~-g~~~~vl~~Er~~-~L~~~~~~ 94 (248)
T PF06977_consen 20 ILDELSGLTYNPDTGTLFAVQD---EPGEIYELSLDGKVLRRIPLDGFGDYEGITYL-GNGRYVLSEERDQ-RLYIFTID 94 (248)
T ss_dssp --S-EEEEEEETTTTEEEEEET---TTTEEEEEETT--EEEEEE-SS-SSEEEEEE--STTEEEEEETTTT-EEEEEEE-
T ss_pred ccCCccccEEcCCCCeEEEEEC---CCCEEEEEcCCCCEEEEEeCCCCCCceeEEEE-CCCEEEEEEcCCC-cEEEEEEe
Confidence 3446999999999999987766 57889999999988877766778889999996 4555555564444 78887774
Q ss_pred C--CCcEE--E--EeCC-----CCCCeeEEEeCCCCeEEEEcCCC-CcEEEEeC--CCCceeEEEe-----cCCCCccce
Q psy6570 104 G--KNKFN--L--VDNN-----IQWPTGITIDYPSQRLYWADPKA-RTIESINL--NGKDRFVVYH-----TEDNGYKPY 164 (713)
Q Consensus 104 G--~~~~~--l--~~~~-----~~~p~glavd~~~~~LY~~d~~~-~~I~~~~~--~g~~~~~~~~-----~~~~~~~p~ 164 (713)
. +.... + +... -...-|||+|+.+++||++.... .+|+.++. .+....+... ....+..+.
T Consensus 95 ~~~~~~~~~~~~~~~l~~~~~~N~G~EGla~D~~~~~L~v~kE~~P~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~S 174 (248)
T PF06977_consen 95 DDTTSLDRADVQKISLGFPNKGNKGFEGLAYDPKTNRLFVAKERKPKRLYEVNGFPGGFDLFVSDDQDLDDDKLFVRDLS 174 (248)
T ss_dssp ---TT--EEEEEEEE---S---SS--EEEEEETTTTEEEEEEESSSEEEEEEESTT-SS--EEEE-HHHH-HT--SS---
T ss_pred ccccccchhhceEEecccccCCCcceEEEEEcCCCCEEEEEeCCCChhhEEEccccCccceeeccccccccccceecccc
Confidence 3 22111 1 1111 12247999999999999996444 36788776 3333222221 112356778
Q ss_pred eeeee--CCeEEEEeCCCCcEEEEcccCCCcceeee--c-------cccccccEEEEeec
Q psy6570 165 KLEVF--EDNLYFSTYRTNNILKINKFGNSDFNVLA--N-------NLNRASDVLILQEN 213 (713)
Q Consensus 165 ~i~~~--~~~ly~td~~~~~i~~~~~~~~~~~~~~~--~-------~~~~~~~i~v~~~~ 213 (713)
+|+++ .++||+....+.+|..++..|. .+..+. . .+..|++|++....
T Consensus 175 ~l~~~p~t~~lliLS~es~~l~~~d~~G~-~~~~~~L~~g~~gl~~~~~QpEGIa~d~~G 233 (248)
T PF06977_consen 175 GLSYDPRTGHLLILSDESRLLLELDRQGR-VVSSLSLDRGFHGLSKDIPQPEGIAFDPDG 233 (248)
T ss_dssp EEEEETTTTEEEEEETTTTEEEEE-TT---EEEEEE-STTGGG-SS---SEEEEEE-TT-
T ss_pred ceEEcCCCCeEEEEECCCCeEEEECCCCC-EEEEEEeCCcccCcccccCCccEEEECCCC
Confidence 88887 5689999999999999996544 333222 1 24668888876543
No 39
>KOG1836|consensus
Probab=98.68 E-value=3.5e-07 Score=110.48 Aligned_cols=243 Identities=30% Similarity=0.788 Sum_probs=130.7
Q ss_pred eeCCCCcccCCCCccC-------------CC---CCCCCC-CEEEcCCCeeeCCCCCccCCCC----------------c
Q psy6570 371 CHCAPSYTGARCESRI-------------CE---NKCHNG-GTCIATTQTCVCPPGFTGDTCQ----------------Q 417 (713)
Q Consensus 371 C~C~~G~~g~~C~~~~-------------C~---~~C~~~-~~C~~~~~~C~C~~g~~g~~C~----------------~ 417 (713)
|.|++||+|..|+... |. -.|..+ ..|...++.|.|.+--.|..|+ .
T Consensus 697 c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~tG~C~C~~~t~G~~C~~C~~GfYg~~~~~~~~d 776 (1705)
T KOG1836|consen 697 CTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPRTGQCKCKHNTFGGQCAQCVDGFYGLPDLGTSGD 776 (1705)
T ss_pred ccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCccccccCCCCceecccCCCCCchhhhcCCCCCccccCCCCC
Confidence 7778888777776320 11 112222 3455455555544444444433 2
Q ss_pred CCCCCCCCCcEeccCC--CCcccc-CCCCCcCCCCccCCCCCC------CCC---CeeecCCCCCeecCCCCccC---CC
Q psy6570 418 CLNLKCQNGGVCVNKT--TGLECD-CPKFYYGKNCQYSQCKNY------CVN---GECSITDSGPKCMCSPGYSG---KK 482 (713)
Q Consensus 418 C~~~~C~~~~~C~~~~--~~~~C~-C~~G~~g~~C~~~~C~~~------~~~---~~C~~~~~~~~C~C~~G~~g---~~ 482 (713)
|.+.+|.+.+.|..+. ....|. |++||+|..|+. |... -.+ ..|. .|.|....-- ..
T Consensus 777 C~~C~Cp~~~~~~~~~~~~~~iCk~Cp~gytG~rCe~--c~dgyfg~p~~~~~~~~~c~------~c~c~~n~dp~~~g~ 848 (1705)
T KOG1836|consen 777 CQPCPCPNGGACGQTPEILEVVCKNCPPGYTGLRCEE--CADGYFGNPLGHDGDVRPCQ------SCQCNFNVDPNAFGN 848 (1705)
T ss_pred CccCCCCCChhhcCcCcccceecCCCCCCCccccccc--CCCccccCCCCCCCCcccCc------cceeccccCcccccc
Confidence 6666788888887654 346788 999999998862 2221 001 0111 2222221110 01
Q ss_pred CCC-----ccccCCCCCCcc-cCCCCccCCC--------CCCCCCCCCc------EEcCCCCCeeccCCCCCCCCCCccc
Q psy6570 483 CDT-----CTCLNGDSGPKC-MCSPGYSGKK--------CDTCTCLNGG------TCIPNSKNNVCKCPSQYTGRRCECA 542 (713)
Q Consensus 483 C~~-----~~C~~~~~~~~C-~C~~G~~g~~--------C~~~~C~~~g------~C~~~~~~~~C~C~~g~~G~~C~~~ 542 (713)
|+. -.|.....+..| .|.+||.|+. |..+-|...+ .|.+. +++|.|.+.-.|..|...
T Consensus 849 c~~~tg~c~~ci~nT~g~~cd~c~~g~~gd~l~~~p~~~c~~c~c~p~gs~~~~~~c~~~--tGQcec~~~v~g~~c~~c 926 (1705)
T KOG1836|consen 849 CNRLTGECLKCIHNTAGEYCDLCKEGYFGDPLAPNPEDKCFACGCVPAGSELPSLTCNPV--TGQCECKPNVEGRDCLYC 926 (1705)
T ss_pred ccccccceeeccCCcccccccccccCccccccCCCcCCccccccCccCCcccccccCCCc--ccceeccCCCCccccccc
Confidence 110 023333334444 5889998853 4444443322 24332 678999998888887532
Q ss_pred CCCCCCCCCCCCCCCCCCCCCc----EEeecCCCceeeCCCCCcCCCCCcCCCC------CCC-CCCCCCCC----eEec
Q psy6570 543 VGDTSCASLANKCTPNYCSNNG----TCVLIEGKPSCKCLPPYSGKQCTEREDS------PSC-HNYCDNAG----LCSY 607 (713)
Q Consensus 543 ~~~~~c~~~~~~C~~~~C~~~~----~C~~~~g~~~C~C~~G~~G~~C~~~~~~------~~C-~~~C~~~g----~C~~ 607 (713)
.....-......|.+..|+..| .|.. ++.+|.|.+|.+|..|.....- ..| .-.|...| +|..
T Consensus 927 ~~g~fnl~s~~gC~~c~c~~~gs~~~~c~~--~tGqc~c~~gVtgqrc~qc~~~~~~~~~~gc~~c~c~~~Gs~~~qc~~ 1004 (1705)
T KOG1836|consen 927 FKGFFNLNSGVGCEPCNCDPTGSESSDCDV--GTGQCYCRPGVTGQRCDQCETYHFGFQTEGCGLCECDPLGSRGFQCDP 1004 (1705)
T ss_pred cccccccCCCCCcccccccccccccccccc--cCCceeeecCccccccCccccCcccccccCCcceecccCCcccceecc
Confidence 1111111123456666676554 5664 4457999999999998764211 112 12244444 4643
Q ss_pred CCCCcceeecCCCcccCCCCc
Q psy6570 608 SKQGKPVCTCVNGWSGITCSE 628 (713)
Q Consensus 608 ~~~g~~~C~C~~G~~G~~C~~ 628 (713)
. ..+|.|++|+.|..|..
T Consensus 1005 ~---~G~c~c~~~~~g~~c~~ 1022 (1705)
T KOG1836|consen 1005 E---DGQCPCRPGFEGRRCDQ 1022 (1705)
T ss_pred c---CCeeeecCCCCCccccc
Confidence 2 35799999999877653
No 40
>KOG1836|consensus
Probab=98.68 E-value=3.9e-07 Score=110.07 Aligned_cols=273 Identities=27% Similarity=0.609 Sum_probs=152.8
Q ss_pred cccCCCCCCCccc-cCCCCccCCCC------CCCC-----CCCCCC--C--eeecCCCCCCee-ecCCCcccCCcc--cc
Q psy6570 285 CLCPDHLTEELNV-TSGKMSCKVAP------ARTC-----YLDCNH--G--TCEFDDDFDPHC-ICQENFYGTYCE--KV 345 (713)
Q Consensus 285 c~C~~~~~~~~c~-C~~g~~~~~~~------~~~C-----~~~C~~--~--~C~~~~~~~~~C-~C~~g~~G~~C~--~~ 345 (713)
|.|+.++.+..|. |.+||+...-. --.| .+.|.. | .|+....+ ..| +|..||+|..=. ..
T Consensus 697 c~C~~g~tG~~Ce~C~~gfrr~~~~~~~~~~c~~C~cngh~~~Cd~~tG~C~C~~~t~G-~~C~~C~~GfYg~~~~~~~~ 775 (1705)
T KOG1836|consen 697 CTCPVGYTGQFCESCAPGFRRLSPQLGPFCPCIPCDCNGHSNICDPRTGQCKCKHNTFG-GQCAQCVDGFYGLPDLGTSG 775 (1705)
T ss_pred ccCCCCcccchhhhcchhhhcccccCCCCCcccccccCCccccccCCCCceecccCCCC-CchhhhcCCCCCccccCCCC
Confidence 7888888777776 88887643211 1111 123332 2 23322222 344 677777765322 12
Q ss_pred CCCCCCCCCCceeeCCCCCCCCCceee-CCCCcccCCCCccC--CC-CCCCCC---CEEEcCCCeeeCCCCCccCCCCcC
Q psy6570 346 NNSMCPCLNQGMCYPDLTHPEPTYKCH-CAPSYTGARCESRI--CE-NKCHNG---GTCIATTQTCVCPPGFTGDTCQQC 418 (713)
Q Consensus 346 ~c~~~~C~~~~~C~~~~~~~~~~~~C~-C~~G~~g~~C~~~~--C~-~~C~~~---~~C~~~~~~C~C~~g~~g~~C~~C 418 (713)
+|..++|.+++.|..... .....|. |++||+|.+|+... .. ++=.++ ..| -.|.|.....-.
T Consensus 776 dC~~C~Cp~~~~~~~~~~--~~~~iCk~Cp~gytG~rCe~c~dgyfg~p~~~~~~~~~c----~~c~c~~n~dp~----- 844 (1705)
T KOG1836|consen 776 DCQPCPCPNGGACGQTPE--ILEVVCKNCPPGYTGLRCEECADGYFGNPLGHDGDVRPC----QSCQCNFNVDPN----- 844 (1705)
T ss_pred CCccCCCCCChhhcCcCc--ccceecCCCCCCCcccccccCCCccccCCCCCCCCcccC----ccceeccccCcc-----
Confidence 388899999998877643 3367898 99999999997521 00 010010 011 012222111000
Q ss_pred CCCCCCCC-c---EeccCCCCccc-cCCCCCcCCC--------CccCCCCCC---CCCCeeecCCCCCeecCCCCccCCC
Q psy6570 419 LNLKCQNG-G---VCVNKTTGLEC-DCPKFYYGKN--------CQYSQCKNY---CVNGECSITDSGPKCMCSPGYSGKK 482 (713)
Q Consensus 419 ~~~~C~~~-~---~C~~~~~~~~C-~C~~G~~g~~--------C~~~~C~~~---~~~~~C~~~~~~~~C~C~~G~~g~~ 482 (713)
....|... + .|+....+..| .|.+||+|.. |....|... .....|....| +|.|.+.-.|..
T Consensus 845 ~~g~c~~~tg~c~~ci~nT~g~~cd~c~~g~~gd~l~~~p~~~c~~c~c~p~gs~~~~~~c~~~tG--Qcec~~~v~g~~ 922 (1705)
T KOG1836|consen 845 AFGNCNRLTGECLKCIHNTAGEYCDLCKEGYFGDPLAPNPEDKCFACGCVPAGSELPSLTCNPVTG--QCECKPNVEGRD 922 (1705)
T ss_pred ccccccccccceeeccCCcccccccccccCccccccCCCcCCccccccCccCCcccccccCCCccc--ceeccCCCCccc
Confidence 00122221 2 23444445566 4888888753 322222221 11233555554 889999988888
Q ss_pred CCCccccCCCCCCcccCCCCccC----CCCCCCCCCCCc----EEcCCCCCeeccCCCCCCCCCCcccCCCCCCCCCCCC
Q psy6570 483 CDTCTCLNGDSGPKCMCSPGYSG----KKCDTCTCLNGG----TCIPNSKNNVCKCPSQYTGRRCECAVGDTSCASLANK 554 (713)
Q Consensus 483 C~~~~C~~~~~~~~C~C~~G~~g----~~C~~~~C~~~g----~C~~~~~~~~C~C~~g~~G~~C~~~~~~~~c~~~~~~ 554 (713)
|.. |.+||++ ..|.++.|..-| .|.. ++++|.|.+|.+|.+|........-.. ...
T Consensus 923 c~~-------------c~~g~fnl~s~~gC~~c~c~~~gs~~~~c~~--~tGqc~c~~gVtgqrc~qc~~~~~~~~-~~g 986 (1705)
T KOG1836|consen 923 CLY-------------CFKGFFNLNSGVGCEPCNCDPTGSESSDCDV--GTGQCYCRPGVTGQRCDQCETYHFGFQ-TEG 986 (1705)
T ss_pred ccc-------------ccccccccCCCCCcccccccccccccccccc--cCCceeeecCccccccCccccCccccc-ccC
Confidence 875 6777774 457777776543 4543 477999999999999873321111111 234
Q ss_pred CCCCCCCCCc----EEeecCCCceeeCCCCCcCCCCCcC
Q psy6570 555 CTPNYCSNNG----TCVLIEGKPSCKCLPPYSGKQCTER 589 (713)
Q Consensus 555 C~~~~C~~~~----~C~~~~g~~~C~C~~G~~G~~C~~~ 589 (713)
|....|...| +|....| +|.|++++.|..|...
T Consensus 987 c~~c~c~~~Gs~~~qc~~~~G--~c~c~~~~~g~~c~~c 1023 (1705)
T KOG1836|consen 987 CGLCECDPLGSRGFQCDPEDG--QCPCRPGFEGRRCDQC 1023 (1705)
T ss_pred CcceecccCCcccceecccCC--eeeecCCCCCcccccc
Confidence 5455566666 6876554 7999999999887654
No 41
>PF00058 Ldl_recept_b: Low-density lipoprotein receptor repeat class B; InterPro: IPR000033 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=98.63 E-value=8.7e-08 Score=66.13 Aligned_cols=41 Identities=34% Similarity=0.656 Sum_probs=38.3
Q ss_pred CeEEEeecCCCCCC-eEEEEecCCceEEEEEcCCCCCcceEEEcC
Q psy6570 38 KNLYWTDAGGRSSN-NIMVSTLEGRKKRTLLNTGLNEPYDIALEP 81 (713)
Q Consensus 38 ~~ly~td~~~~~~~-~I~~~~~~G~~~~~l~~~~~~~p~~iavD~ 81 (713)
++|||+|. ..+ .|.+.+++|+.+++++...+..|.+|||||
T Consensus 1 ~~iYWtD~---~~~~~I~~a~~dGs~~~~vi~~~l~~P~giaVD~ 42 (42)
T PF00058_consen 1 GKIYWTDW---SQDPSIERANLDGSNRRTVISDDLQHPEGIAVDW 42 (42)
T ss_dssp TEEEEEET---TTTEEEEEEETTSTSEEEEEESSTSSEEEEEEET
T ss_pred CEEEEEEC---CCCcEEEEEECCCCCeEEEEECCCCCcCEEEECC
Confidence 58999999 677 999999999999999999999999999996
No 42
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=98.61 E-value=9.5e-07 Score=92.45 Aligned_cols=148 Identities=19% Similarity=0.219 Sum_probs=100.3
Q ss_pred CCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEc------CCCCCcceEEEcC---CCCcEEEEccCC--
Q psy6570 25 LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLN------TGLNEPYDIALEP---LSGRMFWTELGI-- 93 (713)
Q Consensus 25 ~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~------~~~~~p~~iavD~---~~~~ly~td~~~-- 93 (713)
|++|.+|++.+. ++||+++. .++|.+++.+|.....+.. .+.....+||++| .+++||++-...
T Consensus 1 L~~P~~~a~~pd-G~l~v~e~----~G~i~~~~~~g~~~~~v~~~~~v~~~~~~gllgia~~p~f~~n~~lYv~~t~~~~ 75 (331)
T PF07995_consen 1 LNNPRSMAFLPD-GRLLVAER----SGRIWVVDKDGSLKTPVADLPEVFADGERGLLGIAFHPDFASNGYLYVYYTNADE 75 (331)
T ss_dssp ESSEEEEEEETT-SCEEEEET----TTEEEEEETTTEECEEEEE-TTTBTSTTBSEEEEEE-TTCCCC-EEEEEEEEE-T
T ss_pred CCCceEEEEeCC-CcEEEEeC----CceEEEEeCCCcCcceecccccccccccCCcccceeccccCCCCEEEEEEEcccC
Confidence 578999999987 79999986 7899999988876233222 2245678999998 468888876521
Q ss_pred -----CCeEEEEecCCC-----CcEEEEe------CCCCCCeeEEEeCCCCeEEEEc-------------CCCCcEEEEe
Q psy6570 94 -----KPRISGASIDGK-----NKFNLVD------NNIQWPTGITIDYPSQRLYWAD-------------PKARTIESIN 144 (713)
Q Consensus 94 -----~~~I~~~~~dG~-----~~~~l~~------~~~~~p~glavd~~~~~LY~~d-------------~~~~~I~~~~ 144 (713)
..+|.|..++.. ..++|+. ...+....|+|++ .++|||+- ...++|.|++
T Consensus 76 ~~~~~~~~v~r~~~~~~~~~~~~~~~l~~~~p~~~~~~H~g~~l~fgp-DG~LYvs~G~~~~~~~~~~~~~~~G~ilri~ 154 (331)
T PF07995_consen 76 DGGDNDNRVVRFTLSDGDGDLSSEEVLVTGLPDTSSGNHNGGGLAFGP-DGKLYVSVGDGGNDDNAQDPNSLRGKILRID 154 (331)
T ss_dssp SSSSEEEEEEEEEEETTSCEEEEEEEEEEEEES-CSSSS-EEEEEE-T-TSEEEEEEB-TTTGGGGCSTTSSTTEEEEEE
T ss_pred CCCCcceeeEEEeccCCccccccceEEEEEeCCCCCCCCCCccccCCC-CCcEEEEeCCCCCcccccccccccceEEEec
Confidence 137888877654 2333432 2345667899997 45999983 2346799999
Q ss_pred CCCC-------------ceeEEEecCCCCccceeeeeeC--CeEEEEeCCCC
Q psy6570 145 LNGK-------------DRFVVYHTEDNGYKPYKLEVFE--DNLYFSTYRTN 181 (713)
Q Consensus 145 ~~g~-------------~~~~~~~~~~~~~~p~~i~~~~--~~ly~td~~~~ 181 (713)
.+|+ ..+++... +++|.+++++. +.||.+|.+..
T Consensus 155 ~dG~~p~dnP~~~~~~~~~~i~A~G---lRN~~~~~~d~~tg~l~~~d~G~~ 203 (331)
T PF07995_consen 155 PDGSIPADNPFVGDDGADSEIYAYG---LRNPFGLAFDPNTGRLWAADNGPD 203 (331)
T ss_dssp TTSSB-TTSTTTTSTTSTTTEEEE-----SEEEEEEEETTTTEEEEEEE-SS
T ss_pred ccCcCCCCCccccCCCceEEEEEeC---CCccccEEEECCCCcEEEEccCCC
Confidence 9987 44555544 88999999984 68999886543
No 43
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=98.57 E-value=2.2e-07 Score=75.24 Aligned_cols=73 Identities=18% Similarity=0.288 Sum_probs=57.7
Q ss_pred ceEEEcCCCCcEEEEccCC----------------CCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCC
Q psy6570 75 YDIALEPLSGRMFWTELGI----------------KPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKAR 138 (713)
Q Consensus 75 ~~iavD~~~~~ly~td~~~----------------~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~ 138 (713)
++|+|++.+|.|||||+.. .+||++.++..+..++|+. ++..|+||++++++..|++++....
T Consensus 1 ndldv~~~~g~vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp~t~~~~vl~~-~L~fpNGVals~d~~~vlv~Et~~~ 79 (89)
T PF03088_consen 1 NDLDVDQDTGTVYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDPSTKETTVLLD-GLYFPNGVALSPDESFVLVAETGRY 79 (89)
T ss_dssp -EEEE-TTT--EEEEES-SS--TTGHHHHHHHT---EEEEEEETTTTEEEEEEE-EESSEEEEEE-TTSSEEEEEEGGGT
T ss_pred CceeEecCCCEEEEEeCccccCccceeeeeecCCCCcCEEEEECCCCeEEEehh-CCCccCeEEEcCCCCEEEEEeccCc
Confidence 4789998779999998643 2689999998876666665 7999999999999999999999999
Q ss_pred cEEEEeCCCC
Q psy6570 139 TIESINLNGK 148 (713)
Q Consensus 139 ~I~~~~~~g~ 148 (713)
||.++.+.|.
T Consensus 80 Ri~rywl~Gp 89 (89)
T PF03088_consen 80 RILRYWLKGP 89 (89)
T ss_dssp EEEEEESSST
T ss_pred eEEEEEEeCC
Confidence 9999998873
No 44
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=98.55 E-value=1.5e-05 Score=82.27 Aligned_cols=182 Identities=14% Similarity=0.115 Sum_probs=117.0
Q ss_pred CcccCCceeEEEccCccc-EEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEc
Q psy6570 2 ASISSGNVTRVKREMNLK-TVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALE 80 (713)
Q Consensus 2 ad~~~~~I~~~~~~~~~~-~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD 80 (713)
+...++.|..+++.+... ..+.....|.++++++.++.||++.. ..+.|.+.++++......+.. ...|..++++
T Consensus 6 s~~~d~~v~~~d~~t~~~~~~~~~~~~~~~l~~~~dg~~l~~~~~---~~~~v~~~d~~~~~~~~~~~~-~~~~~~~~~~ 81 (300)
T TIGR03866 6 SNEKDNTISVIDTATLEVTRTFPVGQRPRGITLSKDGKLLYVCAS---DSDTIQVIDLATGEVIGTLPS-GPDPELFALH 81 (300)
T ss_pred EecCCCEEEEEECCCCceEEEEECCCCCCceEECCCCCEEEEEEC---CCCeEEEEECCCCcEEEeccC-CCCccEEEEC
Confidence 445678899999876432 23444566899999988888998876 578899999875443322322 2457889999
Q ss_pred CCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCC
Q psy6570 81 PLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNG 160 (713)
Q Consensus 81 ~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~ 160 (713)
+..+.||++..... .|...++........+. ....|.++++++.+..|+++......+..++............ .
T Consensus 82 ~~g~~l~~~~~~~~-~l~~~d~~~~~~~~~~~-~~~~~~~~~~~~dg~~l~~~~~~~~~~~~~d~~~~~~~~~~~~---~ 156 (300)
T TIGR03866 82 PNGKILYIANEDDN-LVTVIDIETRKVLAEIP-VGVEPEGMAVSPDGKIVVNTSETTNMAHFIDTKTYEIVDNVLV---D 156 (300)
T ss_pred CCCCEEEEEcCCCC-eEEEEECCCCeEEeEee-CCCCcceEEECCCCCEEEEEecCCCeEEEEeCCCCeEEEEEEc---C
Confidence 88888888865444 78888887543222222 1235789999988878777766555566667654332221111 1
Q ss_pred ccceeeee--eCCeEEEEeCCCCcEEEEcccCCC
Q psy6570 161 YKPYKLEV--FEDNLYFSTYRTNNILKINKFGNS 192 (713)
Q Consensus 161 ~~p~~i~~--~~~~ly~td~~~~~i~~~~~~~~~ 192 (713)
..|..+++ ++.+||++....+.|..++...+.
T Consensus 157 ~~~~~~~~s~dg~~l~~~~~~~~~v~i~d~~~~~ 190 (300)
T TIGR03866 157 QRPRFAEFTADGKELWVSSEIGGTVSVIDVATRK 190 (300)
T ss_pred CCccEEEECCCCCEEEEEcCCCCEEEEEEcCcce
Confidence 23444444 455676665556778777765443
No 45
>KOG4499|consensus
Probab=98.54 E-value=4.5e-06 Score=78.09 Aligned_cols=129 Identities=15% Similarity=0.243 Sum_probs=86.6
Q ss_pred CCcceEEEcCCCCcEEEEccCC--------CCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEE
Q psy6570 72 NEPYDIALEPLSGRMFWTELGI--------KPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESI 143 (713)
Q Consensus 72 ~~p~~iavD~~~~~ly~td~~~--------~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~ 143 (713)
++-++--|||. |+.|.--+.. .+.+++--+++ ....+. ..+.-|+||++|.+.+..|++|+.+..|..+
T Consensus 109 nR~NDgkvdP~-Gryy~GtMad~~~~le~~~g~Ly~~~~~h-~v~~i~-~~v~IsNgl~Wd~d~K~fY~iDsln~~V~a~ 185 (310)
T KOG4499|consen 109 NRLNDGKVDPD-GRYYGGTMADFGDDLEPIGGELYSWLAGH-QVELIW-NCVGISNGLAWDSDAKKFYYIDSLNYEVDAY 185 (310)
T ss_pred cccccCccCCC-CceeeeeeccccccccccccEEEEeccCC-Cceeee-hhccCCccccccccCcEEEEEccCceEEeee
Confidence 45566778875 4446532111 12333333333 333333 4677899999999999999999999999766
Q ss_pred eCC--C---CceeEEEecCC----CCccceeeeee-CCeEEEEeCCCCcEEEEcccCCCcceeeeccccc
Q psy6570 144 NLN--G---KDRFVVYHTED----NGYKPYKLEVF-EDNLYFSTYRTNNILKINKFGNSDFNVLANNLNR 203 (713)
Q Consensus 144 ~~~--g---~~~~~~~~~~~----~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~~~~~~~~~~~~~ 203 (713)
++| + +++++++.... ....|.|++++ +++||++.+..++|+++++..+.....+.....+
T Consensus 186 dyd~~tG~~snr~~i~dlrk~~~~e~~~PDGm~ID~eG~L~Va~~ng~~V~~~dp~tGK~L~eiklPt~q 255 (310)
T KOG4499|consen 186 DYDCPTGDLSNRKVIFDLRKSQPFESLEPDGMTIDTEGNLYVATFNGGTVQKVDPTTGKILLEIKLPTPQ 255 (310)
T ss_pred ecCCCcccccCcceeEEeccCCCcCCCCCCcceEccCCcEEEEEecCcEEEEECCCCCcEEEEEEcCCCc
Confidence 643 2 45666654322 23467888887 7899999999999999999888776655544333
No 46
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=98.53 E-value=7.3e-06 Score=78.58 Aligned_cols=207 Identities=12% Similarity=0.062 Sum_probs=135.4
Q ss_pred CCcccCCceeEEEccCcc-cEE-ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcC---CCCCcc
Q psy6570 1 MASISSGNVTRVKREMNL-KTV-LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNT---GLNEPY 75 (713)
Q Consensus 1 vad~~~~~I~~~~~~~~~-~~~-~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~---~~~~p~ 75 (713)
|+++..+.|-++++.+.+ +++ +..-..|++|.+++. +..+++|. .. .|.|++.+....+.+-.. .-..-+
T Consensus 77 ft~qg~gaiGhLdP~tGev~~ypLg~Ga~Phgiv~gpd-g~~Witd~---~~-aI~R~dpkt~evt~f~lp~~~a~~nle 151 (353)
T COG4257 77 FTAQGTGAIGHLDPATGEVETYPLGSGASPHGIVVGPD-GSAWITDT---GL-AIGRLDPKTLEVTRFPLPLEHADANLE 151 (353)
T ss_pred EecCccccceecCCCCCceEEEecCCCCCCceEEECCC-CCeeEecC---cc-eeEEecCcccceEEeecccccCCCccc
Confidence 355667888888887644 444 667888999999986 77888998 33 899988765444433211 122344
Q ss_pred eEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEe
Q psy6570 76 DIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYH 155 (713)
Q Consensus 76 ~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~ 155 (713)
..++|+ .|+|+||.... .--|.+......+++-...-..|+||.+.+ ++.||++....+.|.++|......+++..
T Consensus 152 t~vfD~-~G~lWFt~q~G--~yGrLdPa~~~i~vfpaPqG~gpyGi~atp-dGsvwyaslagnaiaridp~~~~aev~p~ 227 (353)
T COG4257 152 TAVFDP-WGNLWFTGQIG--AYGRLDPARNVISVFPAPQGGGPYGICATP-DGSVWYASLAGNAIARIDPFAGHAEVVPQ 227 (353)
T ss_pred ceeeCC-CccEEEeeccc--cceecCcccCceeeeccCCCCCCcceEECC-CCcEEEEeccccceEEcccccCCcceecC
Confidence 578995 78888887531 222444444444444444556799999995 68999999988999999975545555443
Q ss_pred cCCCCccceeeeee-CCeEEEEeCCCCcEEEEcccCCCccee-eeccccccccEEEEeecccc
Q psy6570 156 TEDNGYKPYKLEVF-EDNLYFSTYRTNNILKINKFGNSDFNV-LANNLNRASDVLILQENKQA 216 (713)
Q Consensus 156 ~~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~~~~~~-~~~~~~~~~~i~v~~~~~q~ 216 (713)
..........|..+ .+++++++|.++.+.++++....=.+. |.....++.++.|..+-+.+
T Consensus 228 P~~~~~gsRriwsdpig~~wittwg~g~l~rfdPs~~sW~eypLPgs~arpys~rVD~~grVW 290 (353)
T COG4257 228 PNALKAGSRRIWSDPIGRAWITTWGTGSLHRFDPSVTSWIEYPLPGSKARPYSMRVDRHGRVW 290 (353)
T ss_pred CCcccccccccccCccCcEEEeccCCceeeEeCcccccceeeeCCCCCCCcceeeeccCCcEE
Confidence 22111222445555 578999999999999999865542222 23334556677666655544
No 47
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=98.52 E-value=3.5e-05 Score=76.98 Aligned_cols=156 Identities=16% Similarity=0.188 Sum_probs=108.4
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEc---CCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCC
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLN---TGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDG 104 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~---~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG 104 (713)
++..-++|.++.|++.|- +.++|..++++-........ ..-..|+-|++.|..+..|+...-++ .|.+...+.
T Consensus 147 ~H~a~~tP~~~~l~v~DL---G~Dri~~y~~~dg~L~~~~~~~v~~G~GPRHi~FHpn~k~aY~v~EL~s-tV~v~~y~~ 222 (346)
T COG2706 147 VHSANFTPDGRYLVVPDL---GTDRIFLYDLDDGKLTPADPAEVKPGAGPRHIVFHPNGKYAYLVNELNS-TVDVLEYNP 222 (346)
T ss_pred cceeeeCCCCCEEEEeec---CCceEEEEEcccCccccccccccCCCCCcceEEEcCCCcEEEEEeccCC-EEEEEEEcC
Confidence 677889999999999999 78999998886332221111 23467999999999889999886655 888888876
Q ss_pred C-CcEEEEe---------CCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeC--CCCceeEEEecCCCCccceeeee--eC
Q psy6570 105 K-NKFNLVD---------NNIQWPTGITIDYPSQRLYWADPKARTIESINL--NGKDRFVVYHTEDNGYKPYKLEV--FE 170 (713)
Q Consensus 105 ~-~~~~l~~---------~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~--~g~~~~~~~~~~~~~~~p~~i~~--~~ 170 (713)
. .+...+. .+-.+...|.|++++..||+++.+.+.|..+.+ ++.....+.........|..+++ .+
T Consensus 223 ~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~dGrFLYasNRg~dsI~~f~V~~~~g~L~~~~~~~teg~~PR~F~i~~~g 302 (346)
T COG2706 223 AVGKFEELQTIDTLPEDFTGTNWAAAIHISPDGRFLYASNRGHDSIAVFSVDPDGGKLELVGITPTEGQFPRDFNINPSG 302 (346)
T ss_pred CCceEEEeeeeccCccccCCCCceeEEEECCCCCEEEEecCCCCeEEEEEEcCCCCEEEEEEEeccCCcCCccceeCCCC
Confidence 4 2211111 234466789999999999999998888776665 45555555444443445655555 47
Q ss_pred CeEEEEeCCCCcEEEEc
Q psy6570 171 DNLYFSTYRTNNILKIN 187 (713)
Q Consensus 171 ~~ly~td~~~~~i~~~~ 187 (713)
+.|+++...++.|.++.
T Consensus 303 ~~Liaa~q~sd~i~vf~ 319 (346)
T COG2706 303 RFLIAANQKSDNITVFE 319 (346)
T ss_pred CEEEEEccCCCcEEEEE
Confidence 78888876666654443
No 48
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=98.51 E-value=1e-05 Score=86.40 Aligned_cols=156 Identities=17% Similarity=0.209 Sum_probs=105.5
Q ss_pred ccEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEE------c-CCCCCcceEEEcCC------CC
Q psy6570 18 LKTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLL------N-TGLNEPYDIALEPL------SG 84 (713)
Q Consensus 18 ~~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~------~-~~~~~p~~iavD~~------~~ 84 (713)
.++++++|..|++|++.+. ++||+++. ..++|.+++.++...+.+. . .+...+.+||++|. ++
T Consensus 22 ~~~va~GL~~Pw~maflPD-G~llVtER---~~G~I~~v~~~~~~~~~~~~l~~v~~~~ge~GLlglal~PdF~~~~~n~ 97 (454)
T TIGR03606 22 KKVLLSGLNKPWALLWGPD-NQLWVTER---ATGKILRVNPETGEVKVVFTLPEIVNDAQHNGLLGLALHPDFMQEKGNP 97 (454)
T ss_pred EEEEECCCCCceEEEEcCC-CeEEEEEe---cCCEEEEEeCCCCceeeeecCCceeccCCCCceeeEEECCCccccCCCc
Confidence 4567889999999999975 68999996 4689999887654433221 1 13456789999964 46
Q ss_pred cEEEEccC--------CCCeEEEEecCCC-----CcEEEEeC----CCCCCeeEEEeCCCCeEEEEc--C----------
Q psy6570 85 RMFWTELG--------IKPRISGASIDGK-----NKFNLVDN----NIQWPTGITIDYPSQRLYWAD--P---------- 135 (713)
Q Consensus 85 ~ly~td~~--------~~~~I~~~~~dG~-----~~~~l~~~----~~~~p~glavd~~~~~LY~~d--~---------- 135 (713)
+||++-.. ...+|.|+.++.. ..++|+.. ..+.-..|+|+++ ++||++- .
T Consensus 98 ~lYvsyt~~~~~~~~~~~~~I~R~~l~~~~~~l~~~~~Il~~lP~~~~H~GgrI~FgPD-G~LYVs~GD~g~~~~~n~~~ 176 (454)
T TIGR03606 98 YVYISYTYKNGDKELPNHTKIVRYTYDKSTQTLEKPVDLLAGLPAGNDHNGGRLVFGPD-GKIYYTIGEQGRNQGANFFL 176 (454)
T ss_pred EEEEEEeccCCCCCccCCcEEEEEEecCCCCccccceEEEecCCCCCCcCCceEEECCC-CcEEEEECCCCCCCcccccC
Confidence 79987311 1238999887632 23444431 2345668999964 6899973 2
Q ss_pred --------------------CCCcEEEEeCCCCc----------eeEEEecCCCCccceeeeee-CCeEEEEeCCC
Q psy6570 136 --------------------KARTIESINLNGKD----------RFVVYHTEDNGYKPYKLEVF-EDNLYFSTYRT 180 (713)
Q Consensus 136 --------------------~~~~I~~~~~~g~~----------~~~~~~~~~~~~~p~~i~~~-~~~ly~td~~~ 180 (713)
..++|.|++.||+- +..+... .+++|.+|+++ .+.||.++.+.
T Consensus 177 ~~~aQ~~~~~~~~~~~d~~~~~GkILRin~DGsiP~dNPf~~g~~~eIyA~--G~RNp~Gla~dp~G~Lw~~e~Gp 250 (454)
T TIGR03606 177 PNQAQHTPTQQELNGKDYHAYMGKVLRLNLDGSIPKDNPSINGVVSHIFTY--GHRNPQGLAFTPDGTLYASEQGP 250 (454)
T ss_pred cchhccccccccccccCcccCceEEEEEcCCCCCCCCCCccCCCcceEEEE--eccccceeEECCCCCEEEEecCC
Confidence 12379999999962 1123332 47899999987 67899988764
No 49
>KOG4260|consensus
Probab=98.50 E-value=9.9e-08 Score=90.04 Aligned_cols=149 Identities=24% Similarity=0.555 Sum_probs=93.6
Q ss_pred CCCCccCCCCCC------CCCCCCcEEcCC---CCCeeccCCCCCCCCCCcccCCCC-CCCCC--CCCCCCC--CCCCCc
Q psy6570 499 CSPGYSGKKCDT------CTCLNGGTCIPN---SKNNVCKCPSQYTGRRCECAVGDT-SCASL--ANKCTPN--YCSNNG 564 (713)
Q Consensus 499 C~~G~~g~~C~~------~~C~~~g~C~~~---~~~~~C~C~~g~~G~~C~~~~~~~-~c~~~--~~~C~~~--~C~~~~ 564 (713)
|++|.+|..|.. .+|..+|.|.-. .|++.|.|..||.|+.|..-...+ .-... .--|... .| .+
T Consensus 132 Cp~gtyGpdCl~Cpggser~C~GnG~C~GdGsR~GsGkCkC~~GY~Gp~C~~Cg~eyfes~Rne~~lvCt~Ch~~C--~~ 209 (350)
T KOG4260|consen 132 CPDGTYGPDCLQCPGGSERPCFGNGSCHGDGSREGSGKCKCETGYTGPLCRYCGIEYFESSRNEQHLVCTACHEGC--LG 209 (350)
T ss_pred cCCCCcCCccccCCCCCcCCcCCCCcccCCCCCCCCCcccccCCCCCccccccchHHHHhhcccccchhhhhhhhh--hc
Confidence 445555554432 368888888643 358899999999999986321100 00000 0012111 12 12
Q ss_pred EEeecCCCcee-eCCCCCcC--CCCCcCCCCCCC---CCCCCCCCeEecCCCCcceeecCCCcccCCCCcCCCCC---CC
Q psy6570 565 TCVLIEGKPSC-KCLPPYSG--KQCTEREDSPSC---HNYCDNAGLCSYSKQGKPVCTCVNGWSGITCSERVSCA---HF 635 (713)
Q Consensus 565 ~C~~~~g~~~C-~C~~G~~G--~~C~~~~~~~~C---~~~C~~~g~C~~~~~g~~~C~C~~G~~G~~C~~~~~C~---~~ 635 (713)
.|.... +-.| .|..||.- ..|.++ ++| +.+|..+..|+|+. |+|.|.+++||.+. .++|. ..
T Consensus 210 ~Csg~~-~k~C~kCkkGW~lde~gCvDv---nEC~~ep~~c~~~qfCvNte-GSf~C~dk~Gy~~g----~d~C~~~~d~ 280 (350)
T KOG4260|consen 210 VCSGES-SKGCSKCKKGWKLDEEGCVDV---NECQNEPAPCKAHQFCVNTE-GSFKCEDKEGYKKG----VDECQFCADV 280 (350)
T ss_pred ccCCCC-CCChhhhcccceecccccccH---HHHhcCCCCCChhheeecCC-CceEecccccccCC----hHHhhhhhhh
Confidence 444322 2345 69999974 358776 888 67899999999987 99999999999873 34442 33
Q ss_pred C-CCCCeecCCCCccCCCCCceeeCCCCcc
Q psy6570 636 C-FNGGTCREQNYSLDPDLKPICICPRGYA 664 (713)
Q Consensus 636 C-~~~~~C~~~~~~~~~~~~~~C~C~~Gy~ 664 (713)
| ..+..|.+ .++.++|+|..|+.
T Consensus 281 ~~~kn~~c~n------i~~~~r~v~f~~~~ 304 (350)
T KOG4260|consen 281 CASKNRPCMN------IDGQYRCVCFSGLI 304 (350)
T ss_pred cccCCCCccc------CCccEEEEecccce
Confidence 4 23455654 66779999999874
No 50
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=98.40 E-value=1.9e-05 Score=75.76 Aligned_cols=187 Identities=13% Similarity=0.100 Sum_probs=120.2
Q ss_pred CcccCCceeEEEccCcc-cEE-e---cCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcce
Q psy6570 2 ASISSGNVTRVKREMNL-KTV-L---SNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYD 76 (713)
Q Consensus 2 ad~~~~~I~~~~~~~~~-~~~-~---~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~ 76 (713)
+|... .|.|++.++.+ +.. + ....+-+...||. .++|+++.. .+.--+.+.-....++.-...-..|.|
T Consensus 120 td~~~-aI~R~dpkt~evt~f~lp~~~a~~nlet~vfD~-~G~lWFt~q----~G~yGrLdPa~~~i~vfpaPqG~gpyG 193 (353)
T COG4257 120 TDTGL-AIGRLDPKTLEVTRFPLPLEHADANLETAVFDP-WGNLWFTGQ----IGAYGRLDPARNVISVFPAPQGGGPYG 193 (353)
T ss_pred ecCcc-eeEEecCcccceEEeecccccCCCcccceeeCC-CccEEEeec----cccceecCcccCceeeeccCCCCCCcc
Confidence 44444 78888886643 222 1 2233345578995 478888876 233335565555555544344578999
Q ss_pred EEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEe-CC-CCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEE
Q psy6570 77 IALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVD-NN-IQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVY 154 (713)
Q Consensus 77 iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~-~~-~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~ 154 (713)
|.+.| +|.+|++....+ .|-+++.-....+++.. .. -+....|..|+ -++++.+++.+.++++++..-..-....
T Consensus 194 i~atp-dGsvwyaslagn-aiaridp~~~~aev~p~P~~~~~gsRriwsdp-ig~~wittwg~g~l~rfdPs~~sW~eyp 270 (353)
T COG4257 194 ICATP-DGSVWYASLAGN-AIARIDPFAGHAEVVPQPNALKAGSRRIWSDP-IGRAWITTWGTGSLHRFDPSVTSWIEYP 270 (353)
T ss_pred eEECC-CCcEEEEecccc-ceEEcccccCCcceecCCCcccccccccccCc-cCcEEEeccCCceeeEeCcccccceeee
Confidence 99996 789999987666 78777654434444432 12 22345567774 6799999999999999998665433222
Q ss_pred ecCCCCccceeeeee-CCeEEEEeCCCCcEEEEcccCCCcceeeec
Q psy6570 155 HTEDNGYKPYKLEVF-EDNLYFSTYRTNNILKINKFGNSDFNVLAN 199 (713)
Q Consensus 155 ~~~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~~~~~~~~~ 199 (713)
.. ....+|+++-|+ .++++.+++..+.|.|++.+ .....++..
T Consensus 271 LP-gs~arpys~rVD~~grVW~sea~agai~rfdpe-ta~ftv~p~ 314 (353)
T COG4257 271 LP-GSKARPYSMRVDRHGRVWLSEADAGAIGRFDPE-TARFTVLPI 314 (353)
T ss_pred CC-CCCCCcceeeeccCCcEEeeccccCceeecCcc-cceEEEecC
Confidence 22 235678889988 56777789999999999976 333344433
No 51
>KOG1520|consensus
Probab=98.40 E-value=6.4e-06 Score=83.64 Aligned_cols=116 Identities=14% Similarity=0.123 Sum_probs=84.9
Q ss_pred CCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeC----CCCCCeeEEEeCCCCeEEEEcCC-----------
Q psy6570 72 NEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDN----NIQWPTGITIDYPSQRLYWADPK----------- 136 (713)
Q Consensus 72 ~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~----~~~~p~glavd~~~~~LY~~d~~----------- 136 (713)
.+|.||+++..+|.||++|..-. ++.++..|...+.+... .+...++|.||+ ++.|||+|+.
T Consensus 115 GRPLGl~f~~~ggdL~VaDAYlG--L~~V~p~g~~a~~l~~~~~G~~~kf~N~ldI~~-~g~vyFTDSSsk~~~rd~~~a 191 (376)
T KOG1520|consen 115 GRPLGIRFDKKGGDLYVADAYLG--LLKVGPEGGLAELLADEAEGKPFKFLNDLDIDP-EGVVYFTDSSSKYDRRDFVFA 191 (376)
T ss_pred CCcceEEeccCCCeEEEEeccee--eEEECCCCCcceeccccccCeeeeecCceeEcC-CCeEEEeccccccchhheEEe
Confidence 78999999999889999997644 89999998876655532 355679999997 9999999853
Q ss_pred ------CCcEEEEeCCCCceeEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEEEcccCC
Q psy6570 137 ------ARTIESINLNGKDRFVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILKINKFGN 191 (713)
Q Consensus 137 ------~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~~~~~~ 191 (713)
++|+.++|...+..+++.... .+++..++..+++.+.+++....+|.++-..+.
T Consensus 192 ~l~g~~~GRl~~YD~~tK~~~VLld~L-~F~NGlaLS~d~sfvl~~Et~~~ri~rywi~g~ 251 (376)
T KOG1520|consen 192 ALEGDPTGRLFRYDPSTKVTKVLLDGL-YFPNGLALSPDGSFVLVAETTTARIKRYWIKGP 251 (376)
T ss_pred eecCCCccceEEecCcccchhhhhhcc-cccccccCCCCCCEEEEEeeccceeeeeEecCC
Confidence 335555555555555555442 244555555678899999988888888865443
No 52
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=98.37 E-value=2.3e-05 Score=82.09 Aligned_cols=184 Identities=19% Similarity=0.286 Sum_probs=111.4
Q ss_pred cCCceeEEEccCcc-cEE-------ecCCCCCceEEEec---cCCeEEEeecCC-----CCCCeEEEEecCCc-----eE
Q psy6570 5 SSGNVTRVKREMNL-KTV-------LSNLHDPRGVAVDW---VGKNLYWTDAGG-----RSSNNIMVSTLEGR-----KK 63 (713)
Q Consensus 5 ~~~~I~~~~~~~~~-~~~-------~~~~~~p~gla~D~---~~~~ly~td~~~-----~~~~~I~~~~~~G~-----~~ 63 (713)
..++|++++.++.. ..+ ........|||+++ .++.||++-... ....+|.++.++.. ..
T Consensus 20 ~~G~i~~~~~~g~~~~~v~~~~~v~~~~~~gllgia~~p~f~~n~~lYv~~t~~~~~~~~~~~~v~r~~~~~~~~~~~~~ 99 (331)
T PF07995_consen 20 RSGRIWVVDKDGSLKTPVADLPEVFADGERGLLGIAFHPDFASNGYLYVYYTNADEDGGDNDNRVVRFTLSDGDGDLSSE 99 (331)
T ss_dssp TTTEEEEEETTTEECEEEEE-TTTBTSTTBSEEEEEE-TTCCCC-EEEEEEEEE-TSSSSEEEEEEEEEEETTSCEEEEE
T ss_pred CCceEEEEeCCCcCcceecccccccccccCCcccceeccccCCCCEEEEEEEcccCCCCCcceeeEEEeccCCccccccc
Confidence 37888888866654 222 22345578999998 357899877610 11257888877654 23
Q ss_pred EEEEc---C---CCCCcceEEEcCCCCcEEEEccC------------CCCeEEEEecCCCC------------cEEEEeC
Q psy6570 64 RTLLN---T---GLNEPYDIALEPLSGRMFWTELG------------IKPRISGASIDGKN------------KFNLVDN 113 (713)
Q Consensus 64 ~~l~~---~---~~~~p~~iavD~~~~~ly~td~~------------~~~~I~~~~~dG~~------------~~~l~~~ 113 (713)
++|+. . ....-..|+++| +|+|||+--. ..++|.|++.||+. ...++..
T Consensus 100 ~~l~~~~p~~~~~~H~g~~l~fgp-DG~LYvs~G~~~~~~~~~~~~~~~G~ilri~~dG~~p~dnP~~~~~~~~~~i~A~ 178 (331)
T PF07995_consen 100 EVLVTGLPDTSSGNHNGGGLAFGP-DGKLYVSVGDGGNDDNAQDPNSLRGKILRIDPDGSIPADNPFVGDDGADSEIYAY 178 (331)
T ss_dssp EEEEEEEES-CSSSS-EEEEEE-T-TSEEEEEEB-TTTGGGGCSTTSSTTEEEEEETTSSB-TTSTTTTSTTSTTTEEEE
T ss_pred eEEEEEeCCCCCCCCCCccccCCC-CCcEEEEeCCCCCcccccccccccceEEEecccCcCCCCCccccCCCceEEEEEe
Confidence 33332 1 234456799997 6799998311 13689999999972 2345566
Q ss_pred CCCCCeeEEEeCCCCeEEEEcCCCC---cEEEEeCCCCc--------------e------------eEEEecCCCCccce
Q psy6570 114 NIQWPTGITIDYPSQRLYWADPKAR---TIESINLNGKD--------------R------------FVVYHTEDNGYKPY 164 (713)
Q Consensus 114 ~~~~p~glavd~~~~~LY~~d~~~~---~I~~~~~~g~~--------------~------------~~~~~~~~~~~~p~ 164 (713)
.++.|.+|++|+.+++||.+|.+.. .|.++.. |.+ . ..+.... ....|.
T Consensus 179 GlRN~~~~~~d~~tg~l~~~d~G~~~~dein~i~~-G~nYGWP~~~~~~~~~~~~~~~~~~~~~~~~P~~~~~-~~~ap~ 256 (331)
T PF07995_consen 179 GLRNPFGLAFDPNTGRLWAADNGPDGWDEINRIEP-GGNYGWPYCEGGPKYSGPPIGDAPSCPGFVPPVFAYP-PHSAPT 256 (331)
T ss_dssp --SEEEEEEEETTTTEEEEEEE-SSSSEEEEEE-T-T-B--TTTBSSSCSTTSS-ECTGSS-TTS---SEEET-TT--EE
T ss_pred CCCccccEEEECCCCcEEEEccCCCCCcEEEEecc-CCcCCCCCCcCCCCCCCCccccccCCCCcCccceeec-CccccC
Confidence 8999999999998899999995443 3444431 210 0 0011110 124567
Q ss_pred eeeee--------CCeEEEEeCCCCcEEEEcccCC
Q psy6570 165 KLEVF--------EDNLYFSTYRTNNILKINKFGN 191 (713)
Q Consensus 165 ~i~~~--------~~~ly~td~~~~~i~~~~~~~~ 191 (713)
|+.+. .+.++++++..++|+++....+
T Consensus 257 G~~~y~g~~fp~~~g~~~~~~~~~~~i~~~~~~~~ 291 (331)
T PF07995_consen 257 GIIFYRGSAFPEYRGDLFVADYGGGRIWRLDLDED 291 (331)
T ss_dssp EEEEE-SSSSGGGTTEEEEEETTTTEEEEEEEETT
T ss_pred ceEEECCccCccccCcEEEecCCCCEEEEEeeecC
Confidence 77766 5679999999999999987644
No 53
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=98.36 E-value=6.9e-05 Score=77.77 Aligned_cols=183 Identities=16% Similarity=0.204 Sum_probs=109.2
Q ss_pred cCCceeEEEccCcccEE-ecC--------CCCCceEEEeccCCeEEEeecCCCCCCeEEEEecC----------------
Q psy6570 5 SSGNVTRVKREMNLKTV-LSN--------LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLE---------------- 59 (713)
Q Consensus 5 ~~~~I~~~~~~~~~~~~-~~~--------~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~---------------- 59 (713)
..+.|..+|+.+.+.+- +.. ...|..+++.+.++.||+++.. ..+.|.++|+.
T Consensus 75 ~~d~V~v~D~~t~~~~~~i~~p~~p~~~~~~~~~~~~ls~dgk~l~V~n~~--p~~~V~VvD~~~~kvv~ei~vp~~~~v 152 (352)
T TIGR02658 75 RTDYVEVIDPQTHLPIADIELPEGPRFLVGTYPWMTSLTPDNKTLLFYQFS--PSPAVGVVDLEGKAFVRMMDVPDCYHI 152 (352)
T ss_pred CCCEEEEEECccCcEEeEEccCCCchhhccCccceEEECCCCCEEEEecCC--CCCEEEEEECCCCcEEEEEeCCCCcEE
Confidence 56778888877754332 222 3445599999999999999873 13444444433
Q ss_pred --------------CceEE---------EEEcCC---------CCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCc
Q psy6570 60 --------------GRKKR---------TLLNTG---------LNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNK 107 (713)
Q Consensus 60 --------------G~~~~---------~l~~~~---------~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~ 107 (713)
|+... ...... +.+| ++.+.++.++|.... +.|+.+++.+...
T Consensus 153 y~t~e~~~~~~~~Dg~~~~v~~d~~g~~~~~~~~vf~~~~~~v~~rP---~~~~~dg~~~~vs~e--G~V~~id~~~~~~ 227 (352)
T TIGR02658 153 FPTANDTFFMHCRDGSLAKVGYGTKGNPKIKPTEVFHPEDEYLINHP---AYSNKSGRLVWPTYT--GKIFQIDLSSGDA 227 (352)
T ss_pred EEecCCccEEEeecCceEEEEecCCCceEEeeeeeecCCccccccCC---ceEcCCCcEEEEecC--CeEEEEecCCCcc
Confidence 33322 100000 1233 222324555555444 3899998766543
Q ss_pred EEEEe-----CC----CCCCee---EEEeCCCCeEEEEc-C--------CCCcEEEEeCCCCceeEEEecCCCCccceee
Q psy6570 108 FNLVD-----NN----IQWPTG---ITIDYPSQRLYWAD-P--------KARTIESINLNGKDRFVVYHTEDNGYKPYKL 166 (713)
Q Consensus 108 ~~l~~-----~~----~~~p~g---lavd~~~~~LY~~d-~--------~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i 166 (713)
..... .. -..|.| +++++++++||++. . ..++|+.+|.....+...+.. ...|.+|
T Consensus 228 ~~~~~~~~~~~~~~~~~wrP~g~q~ia~~~dg~~lyV~~~~~~~~thk~~~~~V~ViD~~t~kvi~~i~v---G~~~~~i 304 (352)
T TIGR02658 228 KFLPAIEAFTEAEKADGWRPGGWQQVAYHRARDRIYLLADQRAKWTHKTASRFLFVVDAKTGKRLRKIEL---GHEIDSI 304 (352)
T ss_pred eecceeeeccccccccccCCCcceeEEEcCCCCEEEEEecCCccccccCCCCEEEEEECCCCeEEEEEeC---CCceeeE
Confidence 33221 11 225666 99999999999953 2 236899999865544433333 2345555
Q ss_pred ee--eCC-eEEEEeCCCCcEEEEcccCCCcceee
Q psy6570 167 EV--FED-NLYFSTYRTNNILKINKFGNSDFNVL 197 (713)
Q Consensus 167 ~~--~~~-~ly~td~~~~~i~~~~~~~~~~~~~~ 197 (713)
++ ++. .||.++...+.|..++...+..+..+
T Consensus 305 avS~Dgkp~lyvtn~~s~~VsViD~~t~k~i~~i 338 (352)
T TIGR02658 305 NVSQDAKPLLYALSTGDKTLYIFDAETGKELSSV 338 (352)
T ss_pred EECCCCCeEEEEeCCCCCcEEEEECcCCeEEeee
Confidence 55 567 89999988999999998766655554
No 54
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=98.35 E-value=3.9e-05 Score=78.23 Aligned_cols=166 Identities=18% Similarity=0.197 Sum_probs=106.1
Q ss_pred CceeEEEccCccc--EE-e-----cCCCCCceEEEeccC-----CeEEEeecCCCCCCeEEEEecCCce-EEEEEcCC--
Q psy6570 7 GNVTRVKREMNLK--TV-L-----SNLHDPRGVAVDWVG-----KNLYWTDAGGRSSNNIMVSTLEGRK-KRTLLNTG-- 70 (713)
Q Consensus 7 ~~I~~~~~~~~~~--~~-~-----~~~~~p~gla~D~~~-----~~ly~td~~~~~~~~I~~~~~~G~~-~~~l~~~~-- 70 (713)
.+|..+++.+... ++ + ........|+||... ..+|++|. ....|.++++.... .+++...-
T Consensus 34 pKLv~~Dl~t~~li~~~~~p~~~~~~~s~lndl~VD~~~~~~~~~~aYItD~---~~~glIV~dl~~~~s~Rv~~~~~~~ 110 (287)
T PF03022_consen 34 PKLVAFDLKTNQLIRRYPFPPDIAPPDSFLNDLVVDVRDGNCDDGFAYITDS---GGPGLIVYDLATGKSWRVLHNSFSP 110 (287)
T ss_dssp -EEEEEETTTTCEEEEEE--CCCS-TCGGEEEEEEECTTTTS-SEEEEEEET---TTCEEEEEETTTTEEEEEETCGCTT
T ss_pred cEEEEEECCCCcEEEEEECChHHcccccccceEEEEccCCCCcceEEEEeCC---CcCcEEEEEccCCcEEEEecCCcce
Confidence 5788888887542 11 1 122334669999754 48999999 66789999987544 44442211
Q ss_pred ----------------CCCcceEEEcC---CCCcEEEEccCCCCeEEEEecC----CC---------CcEEEEeCCCCCC
Q psy6570 71 ----------------LNEPYDIALEP---LSGRMFWTELGIKPRISGASID----GK---------NKFNLVDNNIQWP 118 (713)
Q Consensus 71 ----------------~~~p~~iavD~---~~~~ly~td~~~~~~I~~~~~d----G~---------~~~~l~~~~~~~p 118 (713)
.....||++.+ ..+.|||.-.... +++++..+ .+ ..+.+.. .....
T Consensus 111 ~p~~~~~~i~g~~~~~~dg~~gial~~~~~d~r~LYf~~lss~-~ly~v~T~~L~~~~~~~~~~~~~~v~~lG~-k~~~s 188 (287)
T PF03022_consen 111 DPDAGPFTIGGESFQWPDGIFGIALSPISPDGRWLYFHPLSSR-KLYRVPTSVLRDPSLSDAQALASQVQDLGD-KGSQS 188 (287)
T ss_dssp S-SSEEEEETTEEEEETTSEEEEEE-TTSTTS-EEEEEETT-S-EEEEEEHHHHCSTT--HHH-HHHT-EEEEE----SE
T ss_pred eccccceeccCceEecCCCccccccCCCCCCccEEEEEeCCCC-cEEEEEHHHhhCccccccccccccceeccc-cCCCC
Confidence 11244678866 3457999876655 78887543 11 1222322 23456
Q ss_pred eeEEEeCCCCeEEEEcCCCCcEEEEeCCC----CceeEEEecCCCCccceeeeeeC---CeEEEEeC
Q psy6570 119 TGITIDYPSQRLYWADPKARTIESINLNG----KDRFVVYHTEDNGYKPYKLEVFE---DNLYFSTY 178 (713)
Q Consensus 119 ~glavd~~~~~LY~~d~~~~~I~~~~~~g----~~~~~~~~~~~~~~~p~~i~~~~---~~ly~td~ 178 (713)
.|+++|+ ++.||+++...+.|.+.+.++ .+..+++.....+..|.++.++. ++||+...
T Consensus 189 ~g~~~D~-~G~ly~~~~~~~aI~~w~~~~~~~~~~~~~l~~d~~~l~~pd~~~i~~~~~g~L~v~sn 254 (287)
T PF03022_consen 189 DGMAIDP-NGNLYFTDVEQNAIGCWDPDGPYTPENFEILAQDPRTLQWPDGLKIDPEGDGYLWVLSN 254 (287)
T ss_dssp CEEEEET-TTEEEEEECCCTEEEEEETTTSB-GCCEEEEEE-CC-GSSEEEEEE-T--TS-EEEEE-
T ss_pred ceEEECC-CCcEEEecCCCCeEEEEeCCCCcCccchheeEEcCceeeccceeeeccccCceEEEEEC
Confidence 8999995 899999999999999999998 45566666655588999999988 88998763
No 55
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=98.32 E-value=0.00013 Score=75.71 Aligned_cols=202 Identities=14% Similarity=0.189 Sum_probs=120.4
Q ss_pred CceeEEEccCcccE-EecCCCCCceEEEeccCCeEEEeecC------CCCCCeEEEEecCCceEE-EEEc-C-----CCC
Q psy6570 7 GNVTRVKREMNLKT-VLSNLHDPRGVAVDWVGKNLYWTDAG------GRSSNNIMVSTLEGRKKR-TLLN-T-----GLN 72 (713)
Q Consensus 7 ~~I~~~~~~~~~~~-~~~~~~~p~gla~D~~~~~ly~td~~------~~~~~~I~~~~~~G~~~~-~l~~-~-----~~~ 72 (713)
++|+.+|.++.+.+ .+..-..|+++ +.+.++.||++... .+..+.|.++|....... .|.. . ...
T Consensus 27 ~~v~ViD~~~~~v~g~i~~G~~P~~~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t~~~~~~i~~p~~p~~~~~~ 105 (352)
T TIGR02658 27 TQVYTIDGEAGRVLGMTDGGFLPNPV-VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQTHLPIADIELPEGPRFLVGT 105 (352)
T ss_pred ceEEEEECCCCEEEEEEEccCCCcee-ECCCCCEEEEEeccccccccCCCCCEEEEEECccCcEEeEEccCCCchhhccC
Confidence 88999998875422 25667889997 99999999999871 123578999998764433 2221 1 134
Q ss_pred CcceEEEcCCCCcEEEEccCCCCeEEEEecC------------------------------CCCcE---------EEEeC
Q psy6570 73 EPYDIALEPLSGRMFWTELGIKPRISGASID------------------------------GKNKF---------NLVDN 113 (713)
Q Consensus 73 ~p~~iavD~~~~~ly~td~~~~~~I~~~~~d------------------------------G~~~~---------~l~~~ 113 (713)
.|..+++.+.+++||++++.....|.++++. |+..+ .....
T Consensus 106 ~~~~~~ls~dgk~l~V~n~~p~~~V~VvD~~~~kvv~ei~vp~~~~vy~t~e~~~~~~~~Dg~~~~v~~d~~g~~~~~~~ 185 (352)
T TIGR02658 106 YPWMTSLTPDNKTLLFYQFSPSPAVGVVDLEGKAFVRMMDVPDCYHIFPTANDTFFMHCRDGSLAKVGYGTKGNPKIKPT 185 (352)
T ss_pred ccceEEECCCCCEEEEecCCCCCEEEEEECCCCcEEEEEeCCCCcEEEEecCCccEEEeecCceEEEEecCCCceEEeee
Confidence 5669999998899999886522244444433 32222 11000
Q ss_pred C---------CCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEe-----cCC-------CCccceeeeeeCCe
Q psy6570 114 N---------IQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYH-----TED-------NGYKPYKLEVFEDN 172 (713)
Q Consensus 114 ~---------~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~-----~~~-------~~~~p~~i~~~~~~ 172 (713)
. +.+| ++.+..++++|.... +.|+.+++.+........ ... ....|+++..++++
T Consensus 186 ~vf~~~~~~v~~rP---~~~~~dg~~~~vs~e-G~V~~id~~~~~~~~~~~~~~~~~~~~~~~wrP~g~q~ia~~~dg~~ 261 (352)
T TIGR02658 186 EVFHPEDEYLINHP---AYSNKSGRLVWPTYT-GKIFQIDLSSGDAKFLPAIEAFTEAEKADGWRPGGWQQVAYHRARDR 261 (352)
T ss_pred eeecCCccccccCC---ceEcCCCcEEEEecC-CeEEEEecCCCcceecceeeeccccccccccCCCcceeEEEcCCCCE
Confidence 1 1233 223324455555444 899999976653322211 110 11223445555789
Q ss_pred EEEEeC---------CCCcEEEEcccCCCcceeeeccccccccEEEEeecc
Q psy6570 173 LYFSTY---------RTNNILKINKFGNSDFNVLANNLNRASDVLILQENK 214 (713)
Q Consensus 173 ly~td~---------~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~ 214 (713)
||++.. ..+.|+.++.........+..+ ..+.+|++....+
T Consensus 262 lyV~~~~~~~~thk~~~~~V~ViD~~t~kvi~~i~vG-~~~~~iavS~Dgk 311 (352)
T TIGR02658 262 IYLLADQRAKWTHKTASRFLFVVDAKTGKRLRKIELG-HEIDSINVSQDAK 311 (352)
T ss_pred EEEEecCCccccccCCCCEEEEEECCCCeEEEEEeCC-CceeeEEECCCCC
Confidence 999542 2368999998777666665554 5666777666555
No 56
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=98.31 E-value=2.5e-06 Score=69.10 Aligned_cols=74 Identities=26% Similarity=0.366 Sum_probs=58.5
Q ss_pred ceEEEeccCCeEEEeecCCC--------------CCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCC
Q psy6570 29 RGVAVDWVGKNLYWTDAGGR--------------SSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIK 94 (713)
Q Consensus 29 ~gla~D~~~~~ly~td~~~~--------------~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~ 94 (713)
.+|+|+..++.|||||++.+ ..++++++++..+..++++ .+|..|+||++.+....|++++....
T Consensus 1 ndldv~~~~g~vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp~t~~~~vl~-~~L~fpNGVals~d~~~vlv~Et~~~ 79 (89)
T PF03088_consen 1 NDLDVDQDTGTVYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDPSTKETTVLL-DGLYFPNGVALSPDESFVLVAETGRY 79 (89)
T ss_dssp -EEEE-TTT--EEEEES-SS--TTGHHHHHHHT---EEEEEEETTTTEEEEEE-EEESSEEEEEE-TTSSEEEEEEGGGT
T ss_pred CceeEecCCCEEEEEeCccccCccceeeeeecCCCCcCEEEEECCCCeEEEeh-hCCCccCeEEEcCCCCEEEEEeccCc
Confidence 37899988899999998642 3689999999998887777 56899999999999889999999888
Q ss_pred CeEEEEecCC
Q psy6570 95 PRISGASIDG 104 (713)
Q Consensus 95 ~~I~~~~~dG 104 (713)
||.|.-+.|
T Consensus 80 -Ri~rywl~G 88 (89)
T PF03088_consen 80 -RILRYWLKG 88 (89)
T ss_dssp -EEEEEESSS
T ss_pred -eEEEEEEeC
Confidence 999998887
No 57
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=98.27 E-value=0.00019 Score=71.92 Aligned_cols=183 Identities=13% Similarity=0.109 Sum_probs=121.9
Q ss_pred CCceeEEEccCccc-----EEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCc-eEEEEEc-C--CCCCcce
Q psy6570 6 SGNVTRVKREMNLK-----TVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGR-KKRTLLN-T--GLNEPYD 76 (713)
Q Consensus 6 ~~~I~~~~~~~~~~-----~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~-~~~~l~~-~--~~~~p~~ 76 (713)
+.-|+++++++... .++..+.+|.=|++++..++||...... ..+.|..+..|.. .+..++. . ....|.-
T Consensus 15 s~gI~v~~ld~~~g~l~~~~~v~~~~nptyl~~~~~~~~LY~v~~~~-~~ggvaay~iD~~~G~Lt~ln~~~~~g~~p~y 93 (346)
T COG2706 15 SQGIYVFNLDTKTGELSLLQLVAELGNPTYLAVNPDQRHLYVVNEPG-EEGGVAAYRIDPDDGRLTFLNRQTLPGSPPCY 93 (346)
T ss_pred CCceEEEEEeCcccccchhhhccccCCCceEEECCCCCEEEEEEecC-CcCcEEEEEEcCCCCeEEEeeccccCCCCCeE
Confidence 67788888885432 2267889999999999999999988621 2567777776653 2223332 1 2355689
Q ss_pred EEEcCCCCcEEEEccCCCCeEEEEec--CCCCcEEEE---eCC--------CCCCeeEEEeCCCCeEEEEcCCCCcEEEE
Q psy6570 77 IALEPLSGRMFWTELGIKPRISGASI--DGKNKFNLV---DNN--------IQWPTGITIDYPSQRLYWADPKARTIESI 143 (713)
Q Consensus 77 iavD~~~~~ly~td~~~~~~I~~~~~--dG~~~~~l~---~~~--------~~~p~glavd~~~~~LY~~d~~~~~I~~~ 143 (713)
|++|+.+++||.+..... .|.+..+ ||.....+- ... ...+...-++|+.+.|++.|.+..+|..+
T Consensus 94 vsvd~~g~~vf~AnY~~g-~v~v~p~~~dG~l~~~v~~~~h~g~~p~~rQ~~~h~H~a~~tP~~~~l~v~DLG~Dri~~y 172 (346)
T COG2706 94 VSVDEDGRFVFVANYHSG-SVSVYPLQADGSLQPVVQVVKHTGSGPHERQESPHVHSANFTPDGRYLVVPDLGTDRIFLY 172 (346)
T ss_pred EEECCCCCEEEEEEccCc-eEEEEEcccCCccccceeeeecCCCCCCccccCCccceeeeCCCCCEEEEeecCCceEEEE
Confidence 999998889999987765 6666544 565443321 111 12256788899999999999999999999
Q ss_pred eCCCCceeEEEec-CCCCccceeeeee--CCeEEEEeCCCCcEEEEcccC
Q psy6570 144 NLNGKDRFVVYHT-EDNGYKPYKLEVF--EDNLYFSTYRTNNILKINKFG 190 (713)
Q Consensus 144 ~~~g~~~~~~~~~-~~~~~~p~~i~~~--~~~ly~td~~~~~i~~~~~~~ 190 (713)
+++-...+..... ......|.=|.+. +...|++...+++|..+....
T Consensus 173 ~~~dg~L~~~~~~~v~~G~GPRHi~FHpn~k~aY~v~EL~stV~v~~y~~ 222 (346)
T COG2706 173 DLDDGKLTPADPAEVKPGAGPRHIVFHPNGKYAYLVNELNSTVDVLEYNP 222 (346)
T ss_pred EcccCccccccccccCCCCCcceEEEcCCCcEEEEEeccCCEEEEEEEcC
Confidence 9873322221111 1123456666765 567899888888877665443
No 58
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=98.24 E-value=5.7e-05 Score=80.09 Aligned_cols=187 Identities=13% Similarity=0.098 Sum_probs=111.6
Q ss_pred CcccCCceeEEEccCcccE-EecCCCCC-ceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEE
Q psy6570 2 ASISSGNVTRVKREMNLKT-VLSNLHDP-RGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIAL 79 (713)
Q Consensus 2 ad~~~~~I~~~~~~~~~~~-~~~~~~~p-~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iav 79 (713)
++..+++|..++..+.+.+ .+.....+ .++++.+.++.+|+++. .+.|.++|+.......-+..+ ..|.+|++
T Consensus 11 ~~~~~~~v~viD~~t~~~~~~i~~~~~~h~~~~~s~Dgr~~yv~~r----dg~vsviD~~~~~~v~~i~~G-~~~~~i~~ 85 (369)
T PF02239_consen 11 VERGSGSVAVIDGATNKVVARIPTGGAPHAGLKFSPDGRYLYVANR----DGTVSVIDLATGKVVATIKVG-GNPRGIAV 85 (369)
T ss_dssp EEGGGTEEEEEETTT-SEEEEEE-STTEEEEEE-TT-SSEEEEEET----TSEEEEEETTSSSEEEEEE-S-SEEEEEEE
T ss_pred EecCCCEEEEEECCCCeEEEEEcCCCCceeEEEecCCCCEEEEEcC----CCeEEEEECCcccEEEEEecC-CCcceEEE
Confidence 4566888999998875422 23334444 55778888889999864 578999999765544334444 67999999
Q ss_pred cCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCC-------CCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeE
Q psy6570 80 EPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNN-------IQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFV 152 (713)
Q Consensus 80 D~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~-------~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~ 152 (713)
.+..++||+++...+ .+..++...-.....+... -.++.+|.-.+.+...+++-...++|+.+++.......
T Consensus 86 s~DG~~~~v~n~~~~-~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv~s~~~~~fVv~lkd~~~I~vVdy~d~~~~~ 164 (369)
T PF02239_consen 86 SPDGKYVYVANYEPG-TVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIVASPGRPEFVVNLKDTGEIWVVDYSDPKNLK 164 (369)
T ss_dssp --TTTEEEEEEEETT-EEEEEETTT--EEEEEE--EE-TTTS---EEEEEE-SSSSEEEEEETTTTEEEEEETTTSSCEE
T ss_pred cCCCCEEEEEecCCC-ceeEeccccccceeecccccccccccCCCceeEEecCCCCEEEEEEccCCeEEEEEeccccccc
Confidence 988889999987766 7888776543333222211 11334666665555566667788999999986643222
Q ss_pred EEe-cCCCCccceeeeeeCCeEEEEeCCCCcEEEEcccCCCcc
Q psy6570 153 VYH-TEDNGYKPYKLEVFEDNLYFSTYRTNNILKINKFGNSDF 194 (713)
Q Consensus 153 ~~~-~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~~~~~~~~~ 194 (713)
+.. .....++-.+++.++.++|++....+.|..++...+...
T Consensus 165 ~~~i~~g~~~~D~~~dpdgry~~va~~~sn~i~viD~~~~k~v 207 (369)
T PF02239_consen 165 VTTIKVGRFPHDGGFDPDGRYFLVAANGSNKIAVIDTKTGKLV 207 (369)
T ss_dssp EEEEE--TTEEEEEE-TTSSEEEEEEGGGTEEEEEETTTTEEE
T ss_pred eeeecccccccccccCcccceeeecccccceeEEEeeccceEE
Confidence 211 111233334444446678887777778888876544333
No 59
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=98.23 E-value=3e-06 Score=59.20 Aligned_cols=41 Identities=34% Similarity=0.722 Sum_probs=37.1
Q ss_pred EEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCc
Q psy6570 109 NLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKD 149 (713)
Q Consensus 109 ~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~ 149 (713)
+++...+..|+|||+|+.+++|||+|.....|++++++|..
T Consensus 2 ~~~~~~~~~~~~la~d~~~~~lYw~D~~~~~I~~~~~~g~~ 42 (43)
T smart00135 2 TLLSEGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTN 42 (43)
T ss_pred EEEECCCCCcCEEEEeecCCEEEEEeCCCCEEEEEeCCCCC
Confidence 45556889999999999999999999999999999999975
No 60
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=98.19 E-value=3.4e-06 Score=58.92 Aligned_cols=40 Identities=40% Similarity=0.851 Sum_probs=35.9
Q ss_pred EEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCC
Q psy6570 66 LLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKN 106 (713)
Q Consensus 66 l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~ 106 (713)
++..++..|++||+|+.+++|||+|+... .|++++++|+.
T Consensus 3 ~~~~~~~~~~~la~d~~~~~lYw~D~~~~-~I~~~~~~g~~ 42 (43)
T smart00135 3 LLSEGLGHPNGLAVDWIEGRLYWTDWGLD-VIEVANLDGTN 42 (43)
T ss_pred EEECCCCCcCEEEEeecCCEEEEEeCCCC-EEEEEeCCCCC
Confidence 44567899999999999999999999986 99999999975
No 61
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=98.16 E-value=0.00015 Score=70.82 Aligned_cols=176 Identities=14% Similarity=0.193 Sum_probs=114.1
Q ss_pred CceeEEEccCcc-cEE-ecCCCCCceEEEeccCCeEE-EeecCCCCCCeEEEEecCCceEEEEEc------CC----CCC
Q psy6570 7 GNVTRVKREMNL-KTV-LSNLHDPRGVAVDWVGKNLY-WTDAGGRSSNNIMVSTLEGRKKRTLLN------TG----LNE 73 (713)
Q Consensus 7 ~~I~~~~~~~~~-~~~-~~~~~~p~gla~D~~~~~ly-~td~~~~~~~~I~~~~~~G~~~~~l~~------~~----~~~ 73 (713)
..|..++++|+. +++ +.++..|++|++- ++..| ++|. ...+++.+..+-........ .. -..
T Consensus 108 ~~iVElt~~GdlirtiPL~g~~DpE~Ieyi--g~n~fvi~dE---R~~~l~~~~vd~~t~~~~~~~~~i~L~~~~k~N~G 182 (316)
T COG3204 108 AAIVELTKEGDLIRTIPLTGFSDPETIEYI--GGNQFVIVDE---RDRALYLFTVDADTTVISAKVQKIPLGTTNKKNKG 182 (316)
T ss_pred ceEEEEecCCceEEEecccccCChhHeEEe--cCCEEEEEeh---hcceEEEEEEcCCccEEeccceEEeccccCCCCcC
Confidence 356677777764 344 7789999999985 44444 4666 56777776655432111111 10 123
Q ss_pred cceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeC----C----CCCCeeEEEeCCCCeEEEEcCCCCcEEEEeC
Q psy6570 74 PYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDN----N----IQWPTGITIDYPSQRLYWADPKARTIESINL 145 (713)
Q Consensus 74 p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~----~----~~~p~glavd~~~~~LY~~d~~~~~I~~~~~ 145 (713)
-.|||.|+.+++||++...+.-+|+....+-+....-+.. . +..-.||.+|+.++.|++-...++++..++.
T Consensus 183 fEGlA~d~~~~~l~~aKEr~P~~I~~~~~~~~~l~~~~~~~~~~~~~~f~~DvSgl~~~~~~~~LLVLS~ESr~l~Evd~ 262 (316)
T COG3204 183 FEGLAWDPVDHRLFVAKERNPIGIFEVTQSPSSLSVHASLDPTADRDLFVLDVSGLEFNAITNSLLVLSDESRRLLEVDL 262 (316)
T ss_pred ceeeecCCCCceEEEEEccCCcEEEEEecCCcccccccccCcccccceEeeccccceecCCCCcEEEEecCCceEEEEec
Confidence 4589999999999999866554677766433221111110 0 2345789999999999998888999999999
Q ss_pred CCCceeEEEec------CCCCccceeeeee-CCeEEEEeCCCCcEEEEcc
Q psy6570 146 NGKDRFVVYHT------EDNGYKPYKLEVF-EDNLYFSTYRTNNILKINK 188 (713)
Q Consensus 146 ~g~~~~~~~~~------~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~ 188 (713)
+|..+..+... ....+++.||+.+ ++.||++. ..+..+++.+
T Consensus 263 ~G~~~~~lsL~~g~~gL~~dipqaEGiamDd~g~lYIvS-EPnlfy~F~~ 311 (316)
T COG3204 263 SGEVIELLSLTKGNHGLSSDIPQAEGIAMDDDGNLYIVS-EPNLFYRFTP 311 (316)
T ss_pred CCCeeeeEEeccCCCCCcccCCCcceeEECCCCCEEEEe-cCCcceeccc
Confidence 99865544321 1146778999997 66888875 4455666654
No 62
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=98.06 E-value=0.00025 Score=72.33 Aligned_cols=179 Identities=16% Similarity=0.199 Sum_probs=109.7
Q ss_pred CceEEEeccCCeEEEeecCCC---------CCCeEEEEecCCceEE--EEEcCC----CCCcceEEEcCCC-----CcEE
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGR---------SSNNIMVSTLEGRKKR--TLLNTG----LNEPYDIALEPLS-----GRMF 87 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~---------~~~~I~~~~~~G~~~~--~l~~~~----~~~p~~iavD~~~-----~~ly 87 (713)
+.++.+|. .++||+.|.+.. ...+|..+++...... ..+... ....++|+||... +++|
T Consensus 3 V~~v~iD~-~~rLWVlD~G~~~~~~~~~~~~~pKLv~~Dl~t~~li~~~~~p~~~~~~~s~lndl~VD~~~~~~~~~~aY 81 (287)
T PF03022_consen 3 VQRVQIDE-CGRLWVLDSGRPNGLQPPKQVCPPKLVAFDLKTNQLIRRYPFPPDIAPPDSFLNDLVVDVRDGNCDDGFAY 81 (287)
T ss_dssp EEEEEE-T-TSEEEEEE-CCHSSSSTTGHTS--EEEEEETTTTCEEEEEE--CCCS-TCGGEEEEEEECTTTTS-SEEEE
T ss_pred ccEEEEcC-CCCEEEEeCCCcCCCCCCCCCCCcEEEEEECCCCcEEEEEECChHHcccccccceEEEEccCCCCcceEEE
Confidence 46789995 689999998641 1258999999875532 223222 2345679999855 5899
Q ss_pred EEccCCCCeEEEEecCCCCcEEEEeCC----------------C---CCCeeEEEeC---CCCeEEEEcCCCCcEEEEeC
Q psy6570 88 WTELGIKPRISGASIDGKNKFNLVDNN----------------I---QWPTGITIDY---PSQRLYWADPKARTIESINL 145 (713)
Q Consensus 88 ~td~~~~~~I~~~~~dG~~~~~l~~~~----------------~---~~p~glavd~---~~~~LY~~d~~~~~I~~~~~ 145 (713)
+||.+.. .|.++++.......+.... + ....||++.+ .+++|||.-....+++++..
T Consensus 82 ItD~~~~-glIV~dl~~~~s~Rv~~~~~~~~p~~~~~~i~g~~~~~~dg~~gial~~~~~d~r~LYf~~lss~~ly~v~T 160 (287)
T PF03022_consen 82 ITDSGGP-GLIVYDLATGKSWRVLHNSFSPDPDAGPFTIGGESFQWPDGIFGIALSPISPDGRWLYFHPLSSRKLYRVPT 160 (287)
T ss_dssp EEETTTC-EEEEEETTTTEEEEEETCGCTTS-SSEEEEETTEEEEETTSEEEEEE-TTSTTS-EEEEEETT-SEEEEEEH
T ss_pred EeCCCcC-cEEEEEccCCcEEEEecCCcceeccccceeccCceEecCCCccccccCCCCCCccEEEEEeCCCCcEEEEEH
Confidence 9998876 7888887654333332210 1 1245788876 44689999988889999883
Q ss_pred ----CCC---------ceeEEEecCCCCccceeeeee-CCeEEEEeCCCCcEEEEcccCC---Ccceeeecc---ccccc
Q psy6570 146 ----NGK---------DRFVVYHTEDNGYKPYKLEVF-EDNLYFSTYRTNNILKINKFGN---SDFNVLANN---LNRAS 205 (713)
Q Consensus 146 ----~g~---------~~~~~~~~~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~---~~~~~~~~~---~~~~~ 205 (713)
+.+ ..+.+... .....|++++ .+.||+++...+.|.+.+..+. ....+++.. +..|.
T Consensus 161 ~~L~~~~~~~~~~~~~~v~~lG~k---~~~s~g~~~D~~G~ly~~~~~~~aI~~w~~~~~~~~~~~~~l~~d~~~l~~pd 237 (287)
T PF03022_consen 161 SVLRDPSLSDAQALASQVQDLGDK---GSQSDGMAIDPNGNLYFTDVEQNAIGCWDPDGPYTPENFEILAQDPRTLQWPD 237 (287)
T ss_dssp HHHCSTT--HHH-HHHT-EEEEE------SECEEEEETTTEEEEEECCCTEEEEEETTTSB-GCCEEEEEE-CC-GSSEE
T ss_pred HHhhCccccccccccccceecccc---CCCCceEEECCCCcEEEecCCCCeEEEEeCCCCcCccchheeEEcCceeeccc
Confidence 221 12233222 2244666766 7799999999999999998752 233344332 56677
Q ss_pred cEEEEe
Q psy6570 206 DVLILQ 211 (713)
Q Consensus 206 ~i~v~~ 211 (713)
++++.+
T Consensus 238 ~~~i~~ 243 (287)
T PF03022_consen 238 GLKIDP 243 (287)
T ss_dssp EEEE-T
T ss_pred eeeecc
Confidence 787777
No 63
>KOG4499|consensus
Probab=98.05 E-value=0.00013 Score=68.53 Aligned_cols=124 Identities=10% Similarity=0.132 Sum_probs=86.1
Q ss_pred CCceeEEEccCcccEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEe--cCC---ceEEEEEcC------CCCCc
Q psy6570 6 SGNVTRVKREMNLKTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVST--LEG---RKKRTLLNT------GLNEP 74 (713)
Q Consensus 6 ~~~I~~~~~~~~~~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~--~~G---~~~~~l~~~------~~~~p 74 (713)
.+.+++..++++.+.+...+.-|.||++|.....+|++|+ .+..|..++ ..+ ..+++++.- ....|
T Consensus 138 ~g~Ly~~~~~h~v~~i~~~v~IsNgl~Wd~d~K~fY~iDs---ln~~V~a~dyd~~tG~~snr~~i~dlrk~~~~e~~~P 214 (310)
T KOG4499|consen 138 GGELYSWLAGHQVELIWNCVGISNGLAWDSDAKKFYYIDS---LNYEVDAYDYDCPTGDLSNRKVIFDLRKSQPFESLEP 214 (310)
T ss_pred ccEEEEeccCCCceeeehhccCCccccccccCcEEEEEcc---CceEEeeeecCCCcccccCcceeEEeccCCCcCCCCC
Confidence 4566777777788888999999999999999999999999 778886555 333 224444431 23568
Q ss_pred ceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCC-CCeEEEEc
Q psy6570 75 YDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYP-SQRLYWAD 134 (713)
Q Consensus 75 ~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~-~~~LY~~d 134 (713)
.||+||. .|+||++-++.. +|+++++....+..-+.....+.+..++--. -+.||++-
T Consensus 215 DGm~ID~-eG~L~Va~~ng~-~V~~~dp~tGK~L~eiklPt~qitsccFgGkn~d~~yvT~ 273 (310)
T KOG4499|consen 215 DGMTIDT-EGNLYVATFNGG-TVQKVDPTTGKILLEIKLPTPQITSCCFGGKNLDILYVTT 273 (310)
T ss_pred CcceEcc-CCcEEEEEecCc-EEEEECCCCCcEEEEEEcCCCceEEEEecCCCccEEEEEe
Confidence 9999995 899999999877 9999988755443333323334455555422 13566653
No 64
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=98.01 E-value=0.00047 Score=67.50 Aligned_cols=162 Identities=16% Similarity=0.144 Sum_probs=108.0
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEE-EccCCCCeEEEEec
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFW-TELGIKPRISGASI 102 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~-td~~~~~~I~~~~~ 102 (713)
......+|++++.++.||.+-. ....|..++++|...+++-..++..|.+|+.- .+..|+ +|.... +++.+.+
T Consensus 84 ~~~nvS~LTynp~~rtLFav~n---~p~~iVElt~~GdlirtiPL~g~~DpE~Ieyi--g~n~fvi~dER~~-~l~~~~v 157 (316)
T COG3204 84 ETANVSSLTYNPDTRTLFAVTN---KPAAIVELTKEGDLIRTIPLTGFSDPETIEYI--GGNQFVIVDERDR-ALYLFTV 157 (316)
T ss_pred ccccccceeeCCCcceEEEecC---CCceEEEEecCCceEEEecccccCChhHeEEe--cCCEEEEEehhcc-eEEEEEE
Confidence 3445789999999999998765 56789999999999988887889999999995 455555 454444 7887777
Q ss_pred CCCCcEEEEeC----------CCCCCeeEEEeCCCCeEEEEcCCC-CcEEEEeCCCCceeEEEecCC------CCcccee
Q psy6570 103 DGKNKFNLVDN----------NIQWPTGITIDYPSQRLYWADPKA-RTIESINLNGKDRFVVYHTED------NGYKPYK 165 (713)
Q Consensus 103 dG~~~~~l~~~----------~~~~p~glavd~~~~~LY~~d~~~-~~I~~~~~~g~~~~~~~~~~~------~~~~p~~ 165 (713)
|-......+.. .-..-.|||.|+.+++||++-..+ -+|+.++..-+...+-..... .+..-.|
T Consensus 158 d~~t~~~~~~~~~i~L~~~~k~N~GfEGlA~d~~~~~l~~aKEr~P~~I~~~~~~~~~l~~~~~~~~~~~~~~f~~DvSg 237 (316)
T COG3204 158 DADTTVISAKVQKIPLGTTNKKNKGFEGLAWDPVDHRLFVAKERNPIGIFEVTQSPSSLSVHASLDPTADRDLFVLDVSG 237 (316)
T ss_pred cCCccEEeccceEEeccccCCCCcCceeeecCCCCceEEEEEccCCcEEEEEecCCcccccccccCcccccceEeecccc
Confidence 65433222211 122346999999999999996543 457766633221111111100 1223345
Q ss_pred eeee--CCeEEEEeCCCCcEEEEcccCC
Q psy6570 166 LEVF--EDNLYFSTYRTNNILKINKFGN 191 (713)
Q Consensus 166 i~~~--~~~ly~td~~~~~i~~~~~~~~ 191 (713)
+.++ .++|++-...++.+..++..+.
T Consensus 238 l~~~~~~~~LLVLS~ESr~l~Evd~~G~ 265 (316)
T COG3204 238 LEFNAITNSLLVLSDESRRLLEVDLSGE 265 (316)
T ss_pred ceecCCCCcEEEEecCCceEEEEecCCC
Confidence 5655 6788888888888888887655
No 65
>TIGR03118 PEPCTERM_chp_1 conserved hypothetical protein TIGR03118. This model describes and uncharacterized conserved hypothetical protein. Members are found with the C-terminal putative exosortase interaction domain, PEP-CTERM, in Nitrosospira multiformis, Rhodoferax ferrireducens, Solibacter usitatus Ellin6076, and Acidobacteria bacterium Ellin345. It is found without the PEP-CTERM domain in several other species, including Burkholderia ambifaria, Gloeobacter violaceus PCC 7421, and three copies in the Acanthamoeba polyphaga mimivirus.
Probab=97.95 E-value=0.00058 Score=67.31 Aligned_cols=187 Identities=13% Similarity=0.120 Sum_probs=108.0
Q ss_pred cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecC-----CceEEEEEc--C-----CCCCcceEEEcCCCC--
Q psy6570 19 KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLE-----GRKKRTLLN--T-----GLNEPYDIALEPLSG-- 84 (713)
Q Consensus 19 ~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~-----G~~~~~l~~--~-----~~~~p~~iavD~~~~-- 84 (713)
..+...|.+|+||++.+. +.++|+|. ..+....++.+ |....+++. . ....|.|+++....+
T Consensus 16 ~~tDp~L~N~WGia~~p~-~~~WVadn---gT~~~TlYdg~~~~~~g~~~~L~vtiP~~~~~~~~~~PTGiVfN~~~~F~ 91 (336)
T TIGR03118 16 QIVDPGLRNAWGLSYRPG-GPFWVANT---GTGTATLYVGNPDTQPLVQDPLVVVIPAPPPLAAEGTPTGQVFNGSDTFV 91 (336)
T ss_pred cccCccccccceeEecCC-CCEEEecC---CcceEEeecCCcccccCCccceEEEecCCCCCCCCCCccEEEEeCCCceE
Confidence 444678999999999874 68888998 78888888877 433222222 1 235799999985433
Q ss_pred -----------cEEEEccCCCCeEEEEecCCC---CcEEEEeC--CCCCCeeEEEeCC--CCeEEEEcCCCCcEEEEeCC
Q psy6570 85 -----------RMFWTELGIKPRISGASIDGK---NKFNLVDN--NIQWPTGITIDYP--SQRLYWADPKARTIESINLN 146 (713)
Q Consensus 85 -----------~ly~td~~~~~~I~~~~~dG~---~~~~l~~~--~~~~p~glavd~~--~~~LY~~d~~~~~I~~~~~~ 146 (713)
+||.|+.+.- .=|+-..+-+ ...+++.. ....=.||||-.. .++||-+|-.+++|.+++-.
T Consensus 92 vt~~g~~~~a~Fif~tEdGTi-saW~p~v~~t~~~~~~~~~d~s~~gavYkGLAi~~~~~~~~LYaadF~~g~IDVFd~~ 170 (336)
T TIGR03118 92 VSGEGITGPSRFLFVTEDGTL-SGWAPALGTTRMTRAEIVVDASQQGNVYKGLAVGPTGGGDYLYAANFRQGRIDVFKGS 170 (336)
T ss_pred EcCCCcccceeEEEEeCCceE-EeecCcCCcccccccEEEEccCCCcceeeeeEEeecCCCceEEEeccCCCceEEecCc
Confidence 2555554422 1122222222 11223321 1223368888744 67999999999999998643
Q ss_pred CCceeE---EEecC-CCCccceeeeeeCCeEEEEeCC-------------CCcEEEEcccCCCcceeeec-cccccccEE
Q psy6570 147 GKDRFV---VYHTE-DNGYKPYKLEVFEDNLYFSTYR-------------TNNILKINKFGNSDFNVLAN-NLNRASDVL 208 (713)
Q Consensus 147 g~~~~~---~~~~~-~~~~~p~~i~~~~~~ly~td~~-------------~~~i~~~~~~~~~~~~~~~~-~~~~~~~i~ 208 (713)
-..+.+ +.... -....|++|...+++||++=.. .+.|-+++..+.-.+.+... .+..|-+|+
T Consensus 171 f~~~~~~g~F~DP~iPagyAPFnIqnig~~lyVtYA~qd~~~~d~v~G~G~G~VdvFd~~G~l~~r~as~g~LNaPWG~a 250 (336)
T TIGR03118 171 FRPPPLPGSFIDPALPAGYAPFNVQNLGGTLYVTYAQQDADRNDEVAGAGLGYVNVFTLNGQLLRRVASSGRLNAPWGLA 250 (336)
T ss_pred cccccCCCCccCCCCCCCCCCcceEEECCeEEEEEEecCCcccccccCCCcceEEEEcCCCcEEEEeccCCcccCCceee
Confidence 322211 00110 0234678899899999998321 23455666554433333222 366666666
Q ss_pred EE
Q psy6570 209 IL 210 (713)
Q Consensus 209 v~ 210 (713)
+-
T Consensus 251 ~A 252 (336)
T TIGR03118 251 IA 252 (336)
T ss_pred eC
Confidence 53
No 66
>PRK04792 tolB translocation protein TolB; Provisional
Probab=97.94 E-value=0.0011 Score=72.74 Aligned_cols=179 Identities=12% Similarity=0.068 Sum_probs=111.0
Q ss_pred CceeEEEccCcccEEecCCC-CCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCc
Q psy6570 7 GNVTRVKREMNLKTVLSNLH-DPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGR 85 (713)
Q Consensus 7 ~~I~~~~~~~~~~~~~~~~~-~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ 85 (713)
..|+.+++.+.....+..+. ....+++.+.++.|+++.... ....|+++++++...+.+... .......++.|..+.
T Consensus 242 ~~L~~~dl~tg~~~~lt~~~g~~~~~~wSPDG~~La~~~~~~-g~~~Iy~~dl~tg~~~~lt~~-~~~~~~p~wSpDG~~ 319 (448)
T PRK04792 242 AEIFVQDIYTQVREKVTSFPGINGAPRFSPDGKKLALVLSKD-GQPEIYVVDIATKALTRITRH-RAIDTEPSWHPDGKS 319 (448)
T ss_pred cEEEEEECCCCCeEEecCCCCCcCCeeECCCCCEEEEEEeCC-CCeEEEEEECCCCCeEECccC-CCCccceEECCCCCE
Confidence 45778887765433333332 234678888888888764311 345799999988776655533 234556778887777
Q ss_pred EEEEc-cCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCC--CCcEEEEeCCCCceeEEEecCCCCcc
Q psy6570 86 MFWTE-LGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPK--ARTIESINLNGKDRFVVYHTEDNGYK 162 (713)
Q Consensus 86 ly~td-~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~--~~~I~~~~~~g~~~~~~~~~~~~~~~ 162 (713)
|+++. ......|+++++++...+.+.. ......+.++++++++||++... ..+|+++++++...+.+..... ...
T Consensus 320 I~f~s~~~g~~~Iy~~dl~~g~~~~Lt~-~g~~~~~~~~SpDG~~l~~~~~~~g~~~I~~~dl~~g~~~~lt~~~~-d~~ 397 (448)
T PRK04792 320 LIFTSERGGKPQIYRVNLASGKVSRLTF-EGEQNLGGSITPDGRSMIMVNRTNGKFNIARQDLETGAMQVLTSTRL-DES 397 (448)
T ss_pred EEEEECCCCCceEEEEECCCCCEEEEec-CCCCCcCeeECCCCCEEEEEEecCCceEEEEEECCCCCeEEccCCCC-CCC
Confidence 87764 3334589999998766555532 22234457899889999887643 3478999998876665543211 122
Q ss_pred ceeeeeeCCeEEEEeCCCC--cEEEEcccC
Q psy6570 163 PYKLEVFEDNLYFSTYRTN--NILKINKFG 190 (713)
Q Consensus 163 p~~i~~~~~~ly~td~~~~--~i~~~~~~~ 190 (713)
| .++.++..|+++....+ .++.++..+
T Consensus 398 p-s~spdG~~I~~~~~~~g~~~l~~~~~~G 426 (448)
T PRK04792 398 P-SVAPNGTMVIYSTTYQGKQVLAAVSIDG 426 (448)
T ss_pred c-eECCCCCEEEEEEecCCceEEEEEECCC
Confidence 3 45556777777654333 355565543
No 67
>PRK04043 tolB translocation protein TolB; Provisional
Probab=97.89 E-value=0.0017 Score=70.20 Aligned_cols=182 Identities=10% Similarity=-0.003 Sum_probs=112.9
Q ss_pred CCceeEEEccCcccEEecCCCC-CceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCC
Q psy6570 6 SGNVTRVKREMNLKTVLSNLHD-PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSG 84 (713)
Q Consensus 6 ~~~I~~~~~~~~~~~~~~~~~~-p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~ 84 (713)
...|+.+++.+.+...+..... ....++.+.+++|+++.... ....|+++++++...+.|..... .-....+.|..+
T Consensus 212 ~~~Iyv~dl~tg~~~~lt~~~g~~~~~~~SPDG~~la~~~~~~-g~~~Iy~~dl~~g~~~~LT~~~~-~d~~p~~SPDG~ 289 (419)
T PRK04043 212 KPTLYKYNLYTGKKEKIASSQGMLVVSDVSKDGSKLLLTMAPK-GQPDIYLYDTNTKTLTQITNYPG-IDVNGNFVEDDK 289 (419)
T ss_pred CCEEEEEECCCCcEEEEecCCCcEEeeEECCCCCEEEEEEccC-CCcEEEEEECCCCcEEEcccCCC-ccCccEECCCCC
Confidence 4568888887655433333332 22356777777887765421 35689999998877666654321 122346777777
Q ss_pred cEEEEc-cCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCC--------CcEEEEeCCCCceeEEEe
Q psy6570 85 RMFWTE-LGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKA--------RTIESINLNGKDRFVVYH 155 (713)
Q Consensus 85 ~ly~td-~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~--------~~I~~~~~~g~~~~~~~~ 155 (713)
.|||+. ....+.|+++++++...+.+...... ..+++|++++|.++.... ..|+.+++++...+.|..
T Consensus 290 ~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt~~g~~---~~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~g~~~~LT~ 366 (419)
T PRK04043 290 RIVFVSDRLGYPNIFMKKLNSGSVEQVVFHGKN---NSSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNSDYIRRLTA 366 (419)
T ss_pred EEEEEECCCCCceEEEEECCCCCeEeCccCCCc---CceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCCCCeEECCC
Confidence 787765 33446899999998777666543222 248898899887775332 479999998887666654
Q ss_pred cCCCCccceeeeeeCCeEEEEeCC--CCcEEEEcccCCCcc
Q psy6570 156 TEDNGYKPYKLEVFEDNLYFSTYR--TNNILKINKFGNSDF 194 (713)
Q Consensus 156 ~~~~~~~p~~i~~~~~~ly~td~~--~~~i~~~~~~~~~~~ 194 (713)
.. ......++.++..|+++... ...+..++..+....
T Consensus 367 ~~--~~~~p~~SPDG~~I~f~~~~~~~~~L~~~~l~g~~~~ 405 (419)
T PRK04043 367 NG--VNQFPRFSSDGGSIMFIKYLGNQSALGIIRLNYNKSF 405 (419)
T ss_pred CC--CcCCeEECCCCCEEEEEEccCCcEEEEEEecCCCeeE
Confidence 32 22223355667777776533 233667776654443
No 68
>PRK04792 tolB translocation protein TolB; Provisional
Probab=97.85 E-value=0.0019 Score=70.91 Aligned_cols=180 Identities=10% Similarity=0.004 Sum_probs=110.7
Q ss_pred ceeEEEccCcc-cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcE
Q psy6570 8 NVTRVKREMNL-KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRM 86 (713)
Q Consensus 8 ~I~~~~~~~~~-~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~l 86 (713)
+|+.++.++.. +++...-......++.+.+++|+|+.... ....|++.++++...+.+.... ......++.|....|
T Consensus 199 ~l~i~d~dG~~~~~l~~~~~~~~~p~wSPDG~~La~~s~~~-g~~~L~~~dl~tg~~~~lt~~~-g~~~~~~wSPDG~~L 276 (448)
T PRK04792 199 QLMIADYDGYNEQMLLRSPEPLMSPAWSPDGRKLAYVSFEN-RKAEIFVQDIYTQVREKVTSFP-GINGAPRFSPDGKKL 276 (448)
T ss_pred EEEEEeCCCCCceEeecCCCcccCceECCCCCEEEEEEecC-CCcEEEEEECCCCCeEEecCCC-CCcCCeeECCCCCEE
Confidence 45556676643 44444333345678888888888875421 3457999999876665554322 223467888888878
Q ss_pred EEEc-cCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcC--CCCcEEEEeCCCCceeEEEecCCCCccc
Q psy6570 87 FWTE-LGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADP--KARTIESINLNGKDRFVVYHTEDNGYKP 163 (713)
Q Consensus 87 y~td-~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~--~~~~I~~~~~~g~~~~~~~~~~~~~~~p 163 (713)
+++. ......|+++++++...+.+.. ........++++++++|+++.. ....|+++++++...+.+..... ....
T Consensus 277 a~~~~~~g~~~Iy~~dl~tg~~~~lt~-~~~~~~~p~wSpDG~~I~f~s~~~g~~~Iy~~dl~~g~~~~Lt~~g~-~~~~ 354 (448)
T PRK04792 277 ALVLSKDGQPEIYVVDIATKALTRITR-HRAIDTEPSWHPDGKSLIFTSERGGKPQIYRVNLASGKVSRLTFEGE-QNLG 354 (448)
T ss_pred EEEEeCCCCeEEEEEECCCCCeEECcc-CCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCCEEEEecCCC-CCcC
Confidence 7753 3333479999998876665543 2234466788888888877643 34579999998765554432211 1112
Q ss_pred eeeeeeCCeEEEEeCCC--CcEEEEcccCC
Q psy6570 164 YKLEVFEDNLYFSTYRT--NNILKINKFGN 191 (713)
Q Consensus 164 ~~i~~~~~~ly~td~~~--~~i~~~~~~~~ 191 (713)
..+..++++||.+.... ..|+.++..++
T Consensus 355 ~~~SpDG~~l~~~~~~~g~~~I~~~dl~~g 384 (448)
T PRK04792 355 GSITPDGRSMIMVNRTNGKFNIARQDLETG 384 (448)
T ss_pred eeECCCCCEEEEEEecCCceEEEEEECCCC
Confidence 24455677887775433 35666665544
No 69
>PRK05137 tolB translocation protein TolB; Provisional
Probab=97.82 E-value=0.0023 Score=70.09 Aligned_cols=182 Identities=9% Similarity=0.069 Sum_probs=112.1
Q ss_pred CCceeEEEccCcccEEecCCC-CCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCC
Q psy6570 6 SGNVTRVKREMNLKTVLSNLH-DPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSG 84 (713)
Q Consensus 6 ~~~I~~~~~~~~~~~~~~~~~-~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~ 84 (713)
...|+.+++.+.....+.... .....++.+.+++|+++-... ....|+++++++...+.+.... ......++.|..+
T Consensus 225 ~~~i~~~dl~~g~~~~l~~~~g~~~~~~~SPDG~~la~~~~~~-g~~~Iy~~d~~~~~~~~Lt~~~-~~~~~~~~spDG~ 302 (435)
T PRK05137 225 RPRVYLLDLETGQRELVGNFPGMTFAPRFSPDGRKVVMSLSQG-GNTDIYTMDLRSGTTTRLTDSP-AIDTSPSYSPDGS 302 (435)
T ss_pred CCEEEEEECCCCcEEEeecCCCcccCcEECCCCCEEEEEEecC-CCceEEEEECCCCceEEccCCC-CccCceeEcCCCC
Confidence 356777787765433333333 335677888888887664311 3457999999887766655332 3344577887777
Q ss_pred cEEEE-ccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCC--CCcEEEEeCCCCceeEEEecCCCCc
Q psy6570 85 RMFWT-ELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPK--ARTIESINLNGKDRFVVYHTEDNGY 161 (713)
Q Consensus 85 ~ly~t-d~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~--~~~I~~~~~~g~~~~~~~~~~~~~~ 161 (713)
+|+++ +....+.|++++++|...+.+.... ..-..+++.+++++|+++... ..+|+.+++++...+.+.... ..
T Consensus 303 ~i~f~s~~~g~~~Iy~~d~~g~~~~~lt~~~-~~~~~~~~SpdG~~ia~~~~~~~~~~i~~~d~~~~~~~~lt~~~--~~ 379 (435)
T PRK05137 303 QIVFESDRSGSPQLYVMNADGSNPRRISFGG-GRYSTPVWSPRGDLIAFTKQGGGQFSIGVMKPDGSGERILTSGF--LV 379 (435)
T ss_pred EEEEEECCCCCCeEEEEECCCCCeEEeecCC-CcccCeEECCCCCEEEEEEcCCCceEEEEEECCCCceEeccCCC--CC
Confidence 67665 4444458999999987776665422 223457888889998887543 347999999887665554321 11
Q ss_pred cceeeeeeCCeEEEEeCCC-----CcEEEEcccCCC
Q psy6570 162 KPYKLEVFEDNLYFSTYRT-----NNILKINKFGNS 192 (713)
Q Consensus 162 ~p~~i~~~~~~ly~td~~~-----~~i~~~~~~~~~ 192 (713)
....++.++..||++.... ..++.++..++.
T Consensus 380 ~~p~~spDG~~i~~~~~~~~~~~~~~L~~~dl~g~~ 415 (435)
T PRK05137 380 EGPTWAPNGRVIMFFRQTPGSGGAPKLYTVDLTGRN 415 (435)
T ss_pred CCCeECCCCCEEEEEEccCCCCCcceEEEEECCCCc
Confidence 2223444566777654322 357777765543
No 70
>PRK04922 tolB translocation protein TolB; Provisional
Probab=97.82 E-value=0.0024 Score=70.00 Aligned_cols=180 Identities=12% Similarity=0.132 Sum_probs=110.1
Q ss_pred CceeEEEccCcccEEecCCC-CCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCc
Q psy6570 7 GNVTRVKREMNLKTVLSNLH-DPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGR 85 (713)
Q Consensus 7 ~~I~~~~~~~~~~~~~~~~~-~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ 85 (713)
..|+++++.+.....+.... ....+++.+.+++|+++-... ....|++.++++...+.+... ......+++.|..+.
T Consensus 228 ~~l~~~dl~~g~~~~l~~~~g~~~~~~~SpDG~~l~~~~s~~-g~~~Iy~~d~~~g~~~~lt~~-~~~~~~~~~spDG~~ 305 (433)
T PRK04922 228 SAIYVQDLATGQRELVASFRGINGAPSFSPDGRRLALTLSRD-GNPEIYVMDLGSRQLTRLTNH-FGIDTEPTWAPDGKS 305 (433)
T ss_pred cEEEEEECCCCCEEEeccCCCCccCceECCCCCEEEEEEeCC-CCceEEEEECCCCCeEECccC-CCCccceEECCCCCE
Confidence 45788887765433333322 234678888888887764311 345799999988766655432 223456788887777
Q ss_pred EEEE-ccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCC--CCcEEEEeCCCCceeEEEecCCCCcc
Q psy6570 86 MFWT-ELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPK--ARTIESINLNGKDRFVVYHTEDNGYK 162 (713)
Q Consensus 86 ly~t-d~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~--~~~I~~~~~~g~~~~~~~~~~~~~~~ 162 (713)
|+++ +....+.|+++++++...+.+.. .......+++++++++|+++... ..+|+.+++++...+.+.... ....
T Consensus 306 l~f~sd~~g~~~iy~~dl~~g~~~~lt~-~g~~~~~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g~~~~Lt~~~-~~~~ 383 (433)
T PRK04922 306 IYFTSDRGGRPQIYRVAASGGSAERLTF-QGNYNARASVSPDGKKIAMVHGSGGQYRIAVMDLSTGSVRTLTPGS-LDES 383 (433)
T ss_pred EEEEECCCCCceEEEEECCCCCeEEeec-CCCCccCEEECCCCCEEEEEECCCCceeEEEEECCCCCeEECCCCC-CCCC
Confidence 7665 43334579999988766555543 22334568999999999887543 336999998776655443321 1112
Q ss_pred ceeeeeeCCeEEEEeCC--CCcEEEEcccCC
Q psy6570 163 PYKLEVFEDNLYFSTYR--TNNILKINKFGN 191 (713)
Q Consensus 163 p~~i~~~~~~ly~td~~--~~~i~~~~~~~~ 191 (713)
| .++.++..|+++... ...|+.++..++
T Consensus 384 p-~~spdG~~i~~~s~~~g~~~L~~~~~~g~ 413 (433)
T PRK04922 384 P-SFAPNGSMVLYATREGGRGVLAAVSTDGR 413 (433)
T ss_pred c-eECCCCCEEEEEEecCCceEEEEEECCCC
Confidence 2 344456667666432 345667766543
No 71
>PRK04922 tolB translocation protein TolB; Provisional
Probab=97.81 E-value=0.0021 Score=70.45 Aligned_cols=180 Identities=12% Similarity=0.040 Sum_probs=110.0
Q ss_pred ceeEEEccCcc-cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcE
Q psy6570 8 NVTRVKREMNL-KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRM 86 (713)
Q Consensus 8 ~I~~~~~~~~~-~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~l 86 (713)
+|..++.++.. +.+...-.....+++.+.++.|+++.... ....|++.++++...+.+.... .....+++.|..++|
T Consensus 185 ~l~i~D~~g~~~~~lt~~~~~v~~p~wSpDg~~la~~s~~~-~~~~l~~~dl~~g~~~~l~~~~-g~~~~~~~SpDG~~l 262 (433)
T PRK04922 185 ALQVADSDGYNPQTILRSAEPILSPAWSPDGKKLAYVSFER-GRSAIYVQDLATGQRELVASFR-GINGAPSFSPDGRRL 262 (433)
T ss_pred EEEEECCCCCCceEeecCCCccccccCCCCCCEEEEEecCC-CCcEEEEEECCCCCEEEeccCC-CCccCceECCCCCEE
Confidence 46666766643 33443333445678888888888875421 3457999999876665554322 233467888877788
Q ss_pred EEEc-cCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcC--CCCcEEEEeCCCCceeEEEecCCCCccc
Q psy6570 87 FWTE-LGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADP--KARTIESINLNGKDRFVVYHTEDNGYKP 163 (713)
Q Consensus 87 y~td-~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~--~~~~I~~~~~~g~~~~~~~~~~~~~~~p 163 (713)
+++. ......|++.++++...+.+.. .......+++++++++|+++.. +...|+.+++++...+.+..... ....
T Consensus 263 ~~~~s~~g~~~Iy~~d~~~g~~~~lt~-~~~~~~~~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~g~-~~~~ 340 (433)
T PRK04922 263 ALTLSRDGNPEIYVMDLGSRQLTRLTN-HFGIDTEPTWAPDGKSIYFTSDRGGRPQIYRVAASGGSAERLTFQGN-YNAR 340 (433)
T ss_pred EEEEeCCCCceEEEEECCCCCeEECcc-CCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCCeEEeecCCC-CccC
Confidence 7763 3333489999998876655543 2223456789988887777642 34569999987765554432211 1122
Q ss_pred eeeeeeCCeEEEEeCCC--CcEEEEcccCC
Q psy6570 164 YKLEVFEDNLYFSTYRT--NNILKINKFGN 191 (713)
Q Consensus 164 ~~i~~~~~~ly~td~~~--~~i~~~~~~~~ 191 (713)
..+..++++|+++.... ..|+.++..++
T Consensus 341 ~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g 370 (433)
T PRK04922 341 ASVSPDGKKIAMVHGSGGQYRIAVMDLSTG 370 (433)
T ss_pred EEECCCCCEEEEEECCCCceeEEEEECCCC
Confidence 34445677887765432 24666665444
No 72
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.74 E-value=4.4e-06 Score=75.33 Aligned_cols=130 Identities=25% Similarity=0.635 Sum_probs=78.5
Q ss_pred cEEcCCCCCeeccCCCCCC---CCCCcccCCCCCCCCCCCCCCC-----CCCCCCcEEeecC-----CCceeeCCCCCc-
Q psy6570 517 GTCIPNSKNNVCKCPSQYT---GRRCECAVGDTSCASLANKCTP-----NYCSNNGTCVLIE-----GKPSCKCLPPYS- 582 (713)
Q Consensus 517 g~C~~~~~~~~C~C~~g~~---G~~C~~~~~~~~c~~~~~~C~~-----~~C~~~~~C~~~~-----g~~~C~C~~G~~- 582 (713)
|..+..+..+.|.|.+||. -+.||.. .+|.. .+|...++|++.. ..|.|.|.+||+
T Consensus 11 G~LiQMSNHfEC~Cnegfvl~~EntCE~k----------v~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~ 80 (197)
T PF06247_consen 11 GYLIQMSNHFECKCNEGFVLKNENTCEEK----------VECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYIL 80 (197)
T ss_dssp EEEEEESSEEEEEESTTEEEEETTEEEE--------------SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEE
T ss_pred CEEEEccCceEEEcCCCcEEccccccccc----------eecCcccccCccccchhhhhcCCCcccceeEEEecccCcee
Confidence 5667777789999999996 4456533 23432 3688899999875 478999999997
Q ss_pred -CCCCCcCCCCCCC-CCCCCCCCeEecCCC--CcceeecCCCcc---cCCCCcC--CCCCCCCCCCCeecCCCCccCCCC
Q psy6570 583 -GKQCTEREDSPSC-HNYCDNAGLCSYSKQ--GKPVCTCVNGWS---GITCSER--VSCAHFCFNGGTCREQNYSLDPDL 653 (713)
Q Consensus 583 -G~~C~~~~~~~~C-~~~C~~~g~C~~~~~--g~~~C~C~~G~~---G~~C~~~--~~C~~~C~~~~~C~~~~~~~~~~~ 653 (713)
...|.. +.| ...|. .|.|+-... ....|+|.-|+. ...|... ..|...|..+..|.. .+.
T Consensus 81 ~~~vCvp----~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~LKCk~nE~CK~------~~~ 149 (197)
T PF06247_consen 81 KQGVCVP----NKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSLKCKENEECKL------VDG 149 (197)
T ss_dssp SSSSEEE----GGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE--------TTTEEEEE------ETT
T ss_pred eCCeEch----hhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceeeecCCCcceee------eCc
Confidence 335654 456 45676 589975431 245899999997 2234432 236666777778876 445
Q ss_pred CceeeCCCCcccCC
Q psy6570 654 KPICICPRGYAGVR 667 (713)
Q Consensus 654 ~~~C~C~~Gy~G~~ 667 (713)
-|+|.|.+|+.++.
T Consensus 150 ~Y~C~~~~~~~~~~ 163 (197)
T PF06247_consen 150 YYKCVCKEGFPGDG 163 (197)
T ss_dssp EEEEEE-TT-EEET
T ss_pred EEEeecCCCCCCCC
Confidence 58999999997664
No 73
>TIGR03118 PEPCTERM_chp_1 conserved hypothetical protein TIGR03118. This model describes and uncharacterized conserved hypothetical protein. Members are found with the C-terminal putative exosortase interaction domain, PEP-CTERM, in Nitrosospira multiformis, Rhodoferax ferrireducens, Solibacter usitatus Ellin6076, and Acidobacteria bacterium Ellin345. It is found without the PEP-CTERM domain in several other species, including Burkholderia ambifaria, Gloeobacter violaceus PCC 7421, and three copies in the Acanthamoeba polyphaga mimivirus.
Probab=97.74 E-value=0.0034 Score=62.02 Aligned_cols=123 Identities=14% Similarity=0.089 Sum_probs=85.2
Q ss_pred cceEEEcCC--CCcEEEEccCCCCeEEEEecCCCCcEEEEe-----C---CCCCCeeEEEeCCCCeEEEEc---------
Q psy6570 74 PYDIALEPL--SGRMFWTELGIKPRISGASIDGKNKFNLVD-----N---NIQWPTGITIDYPSQRLYWAD--------- 134 (713)
Q Consensus 74 p~~iavD~~--~~~ly~td~~~~~~I~~~~~dG~~~~~l~~-----~---~~~~p~glavd~~~~~LY~~d--------- 134 (713)
=+||||-.. ..+||-+|..+. +|.+++- +..++.+. . .-..|.+|.-- .++|||+-
T Consensus 140 YkGLAi~~~~~~~~LYaadF~~g-~IDVFd~--~f~~~~~~g~F~DP~iPagyAPFnIqni--g~~lyVtYA~qd~~~~d 214 (336)
T TIGR03118 140 YKGLAVGPTGGGDYLYAANFRQG-RIDVFKG--SFRPPPLPGSFIDPALPAGYAPFNVQNL--GGTLYVTYAQQDADRND 214 (336)
T ss_pred eeeeEEeecCCCceEEEeccCCC-ceEEecC--ccccccCCCCccCCCCCCCCCCcceEEE--CCeEEEEEEecCCcccc
Confidence 346666533 678999999766 9998743 33322221 1 12246666554 79999982
Q ss_pred ----CCCCcEEEEeCCCCceeEEEecCCCCccceeeeee-------CCeEEEEeCCCCcEEEEcccCCCcceeeecccc
Q psy6570 135 ----PKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVF-------EDNLYFSTYRTNNILKINKFGNSDFNVLANNLN 202 (713)
Q Consensus 135 ----~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~-------~~~ly~td~~~~~i~~~~~~~~~~~~~~~~~~~ 202 (713)
.+.+.|-+++++|..++.+.+.. .+..|-||++. .+.|.+-+.+.++|-.++...+..+..|.....
T Consensus 215 ~v~G~G~G~VdvFd~~G~l~~r~as~g-~LNaPWG~a~APa~FG~~sg~lLVGNFGDG~InaFD~~sG~~~g~L~~~~G 292 (336)
T TIGR03118 215 EVAGAGLGYVNVFTLNGQLLRRVASSG-RLNAPWGLAIAPESFGSLSGALLVGNFGDGTINAYDPQSGAQLGQLLDPDN 292 (336)
T ss_pred cccCCCcceEEEEcCCCcEEEEeccCC-cccCCceeeeChhhhCCCCCCeEEeecCCceeEEecCCCCceeeeecCCCC
Confidence 23457999999999888876554 37888888772 578999999999999999876766555554433
No 74
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.73 E-value=1.8e-05 Score=50.82 Aligned_cols=30 Identities=47% Similarity=1.142 Sum_probs=22.3
Q ss_pred CCCCCCCCCcEEeecC-CCceeeCCCCCcCC
Q psy6570 555 CTPNYCSNNGTCVLIE-GKPSCKCLPPYSGK 584 (713)
Q Consensus 555 C~~~~C~~~~~C~~~~-g~~~C~C~~G~~G~ 584 (713)
|.+++|.++|+|+... +.|+|.|++||+|+
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 3455777778888777 77888888888775
No 75
>PRK05137 tolB translocation protein TolB; Provisional
Probab=97.73 E-value=0.0042 Score=68.14 Aligned_cols=182 Identities=8% Similarity=0.025 Sum_probs=112.1
Q ss_pred CCceeEEEccCcc-cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCC
Q psy6570 6 SGNVTRVKREMNL-KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSG 84 (713)
Q Consensus 6 ~~~I~~~~~~~~~-~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~ 84 (713)
..+|+.++.++.. +.+.........+++.+.+++|+++.... ....|++.++++...+.+.... ......++.|..+
T Consensus 181 ~~~l~~~d~dg~~~~~lt~~~~~v~~p~wSpDG~~lay~s~~~-g~~~i~~~dl~~g~~~~l~~~~-g~~~~~~~SPDG~ 258 (435)
T PRK05137 181 IKRLAIMDQDGANVRYLTDGSSLVLTPRFSPNRQEITYMSYAN-GRPRVYLLDLETGQRELVGNFP-GMTFAPRFSPDGR 258 (435)
T ss_pred ceEEEEECCCCCCcEEEecCCCCeEeeEECCCCCEEEEEEecC-CCCEEEEEECCCCcEEEeecCC-CcccCcEECCCCC
Confidence 3467777877754 33333334456677888877777664321 3468999999877666554322 2345678888777
Q ss_pred cEEEEcc-CCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcC--CCCcEEEEeCCCCceeEEEecCCCCc
Q psy6570 85 RMFWTEL-GIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADP--KARTIESINLNGKDRFVVYHTEDNGY 161 (713)
Q Consensus 85 ~ly~td~-~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~--~~~~I~~~~~~g~~~~~~~~~~~~~~ 161 (713)
.|+++.. .....|+++++++...+.|.. ........++++++.+|+++.. +...|+.+++++...+.+.......
T Consensus 259 ~la~~~~~~g~~~Iy~~d~~~~~~~~Lt~-~~~~~~~~~~spDG~~i~f~s~~~g~~~Iy~~d~~g~~~~~lt~~~~~~- 336 (435)
T PRK05137 259 KVVMSLSQGGNTDIYTMDLRSGTTTRLTD-SPAIDTSPSYSPDGSQIVFESDRSGSPQLYVMNADGSNPRRISFGGGRY- 336 (435)
T ss_pred EEEEEEecCCCceEEEEECCCCceEEccC-CCCccCceeEcCCCCEEEEEECCCCCCeEEEEECCCCCeEEeecCCCcc-
Confidence 7776643 333589999998877666643 2223456788888888776642 3457999999987666654332111
Q ss_pred cceeeeeeCCeEEEEeCCC--CcEEEEcccCC
Q psy6570 162 KPYKLEVFEDNLYFSTYRT--NNILKINKFGN 191 (713)
Q Consensus 162 ~p~~i~~~~~~ly~td~~~--~~i~~~~~~~~ 191 (713)
....+..++++|+++.... ..|+.++..++
T Consensus 337 ~~~~~SpdG~~ia~~~~~~~~~~i~~~d~~~~ 368 (435)
T PRK05137 337 STPVWSPRGDLIAFTKQGGGQFSIGVMKPDGS 368 (435)
T ss_pred cCeEECCCCCEEEEEEcCCCceEEEEEECCCC
Confidence 1123445567777665332 34666665433
No 76
>PRK02889 tolB translocation protein TolB; Provisional
Probab=97.70 E-value=0.0051 Score=67.16 Aligned_cols=181 Identities=13% Similarity=0.040 Sum_probs=108.9
Q ss_pred CceeEEEccCcc-cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCc
Q psy6570 7 GNVTRVKREMNL-KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGR 85 (713)
Q Consensus 7 ~~I~~~~~~~~~-~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ 85 (713)
.+|+.++.++.. +.+...-.....+++.+.+++|+++.... ....|++.++++...+.+.... ......++.|..+.
T Consensus 176 ~~L~~~D~dG~~~~~l~~~~~~v~~p~wSPDG~~la~~s~~~-~~~~I~~~dl~~g~~~~l~~~~-g~~~~~~~SPDG~~ 253 (427)
T PRK02889 176 YQLQISDADGQNAQSALSSPEPIISPAWSPDGTKLAYVSFES-KKPVVYVHDLATGRRRVVANFK-GSNSAPAWSPDGRT 253 (427)
T ss_pred cEEEEECCCCCCceEeccCCCCcccceEcCCCCEEEEEEccC-CCcEEEEEECCCCCEEEeecCC-CCccceEECCCCCE
Confidence 457777776643 33333333345678888888887765421 3457999999876665554222 33457888887778
Q ss_pred EEEE-ccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEc-C-CCCcEEEEeCCCCceeEEEecCCCCcc
Q psy6570 86 MFWT-ELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWAD-P-KARTIESINLNGKDRFVVYHTEDNGYK 162 (713)
Q Consensus 86 ly~t-d~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d-~-~~~~I~~~~~~g~~~~~~~~~~~~~~~ 162 (713)
|+++ +.....+|+++++++...+.+.. ........++++++.+|+++. . +...|+.+++++...+.+..... ...
T Consensus 254 la~~~~~~g~~~Iy~~d~~~~~~~~lt~-~~~~~~~~~wSpDG~~l~f~s~~~g~~~Iy~~~~~~g~~~~lt~~g~-~~~ 331 (427)
T PRK02889 254 LAVALSRDGNSQIYTVNADGSGLRRLTQ-SSGIDTEPFFSPDGRSIYFTSDRGGAPQIYRMPASGGAAQRVTFTGS-YNT 331 (427)
T ss_pred EEEEEccCCCceEEEEECCCCCcEECCC-CCCCCcCeEEcCCCCEEEEEecCCCCcEEEEEECCCCceEEEecCCC-CcC
Confidence 8775 33334589999998877665543 222335578998888877653 2 45579999988765544432211 111
Q ss_pred ceeeeeeCCeEEEEeCCCC--cEEEEcccCC
Q psy6570 163 PYKLEVFEDNLYFSTYRTN--NILKINKFGN 191 (713)
Q Consensus 163 p~~i~~~~~~ly~td~~~~--~i~~~~~~~~ 191 (713)
...+..++.+|+++....+ .|+.++..++
T Consensus 332 ~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g 362 (427)
T PRK02889 332 SPRISPDGKLLAYISRVGGAFKLYVQDLATG 362 (427)
T ss_pred ceEECCCCCEEEEEEccCCcEEEEEEECCCC
Confidence 2234556777776653322 4666665443
No 77
>PRK03629 tolB translocation protein TolB; Provisional
Probab=97.69 E-value=0.0059 Score=66.65 Aligned_cols=180 Identities=11% Similarity=0.052 Sum_probs=109.2
Q ss_pred CceeEEEccCcccEEecCC-CCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCc
Q psy6570 7 GNVTRVKREMNLKTVLSNL-HDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGR 85 (713)
Q Consensus 7 ~~I~~~~~~~~~~~~~~~~-~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ 85 (713)
..|+.+++.+.....+..+ .....+++.+.+.+|+++.... ....|+++++++...+.+.... ......++.|..+.
T Consensus 223 ~~i~i~dl~~G~~~~l~~~~~~~~~~~~SPDG~~La~~~~~~-g~~~I~~~d~~tg~~~~lt~~~-~~~~~~~wSPDG~~ 300 (429)
T PRK03629 223 SALVIQTLANGAVRQVASFPRHNGAPAFSPDGSKLAFALSKT-GSLNLYVMDLASGQIRQVTDGR-SNNTEPTWFPDSQN 300 (429)
T ss_pred cEEEEEECCCCCeEEccCCCCCcCCeEECCCCCEEEEEEcCC-CCcEEEEEECCCCCEEEccCCC-CCcCceEECCCCCE
Confidence 4566677665432222222 2345678999888898874411 2346999999887666665432 34567888887776
Q ss_pred EEE-EccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcC--CCCcEEEEeCCCCceeEEEecCCCCcc
Q psy6570 86 MFW-TELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADP--KARTIESINLNGKDRFVVYHTEDNGYK 162 (713)
Q Consensus 86 ly~-td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~--~~~~I~~~~~~g~~~~~~~~~~~~~~~ 162 (713)
|++ ++.....+|+++++++...+.+.. .......+++.+++++|+++.. ....|+.+++++...+.+.... ...
T Consensus 301 I~f~s~~~g~~~Iy~~d~~~g~~~~lt~-~~~~~~~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~g~~~~Lt~~~--~~~ 377 (429)
T PRK03629 301 LAYTSDQAGRPQVYKVNINGGAPQRITW-EGSQNQDADVSSDGKFMVMVSSNGGQQHIAKQDLATGGVQVLTDTF--LDE 377 (429)
T ss_pred EEEEeCCCCCceEEEEECCCCCeEEeec-CCCCccCEEECCCCCEEEEEEccCCCceEEEEECCCCCeEEeCCCC--CCC
Confidence 755 444434589999999876665543 2233456888988888887653 3346888998877666554321 112
Q ss_pred ceeeeeeCCeEEEEeCCCC--cEEEEcccCC
Q psy6570 163 PYKLEVFEDNLYFSTYRTN--NILKINKFGN 191 (713)
Q Consensus 163 p~~i~~~~~~ly~td~~~~--~i~~~~~~~~ 191 (713)
...++.++..|+.+....+ .++.++..+.
T Consensus 378 ~p~~SpDG~~i~~~s~~~~~~~l~~~~~~G~ 408 (429)
T PRK03629 378 TPSIAPNGTMVIYSSSQGMGSVLNLVSTDGR 408 (429)
T ss_pred CceECCCCCEEEEEEcCCCceEEEEEECCCC
Confidence 2235555666766654322 2444554433
No 78
>PRK03629 tolB translocation protein TolB; Provisional
Probab=97.69 E-value=0.0063 Score=66.44 Aligned_cols=181 Identities=12% Similarity=0.011 Sum_probs=111.4
Q ss_pred CceeEEEccCcc-cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCc
Q psy6570 7 GNVTRVKREMNL-KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGR 85 (713)
Q Consensus 7 ~~I~~~~~~~~~-~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ 85 (713)
.+|+.++.++.. +.+...-.....+++.+.+..|.++.... ....|++.++++...+.+.... .....+++.|...+
T Consensus 179 ~~l~~~d~dg~~~~~lt~~~~~~~~p~wSPDG~~la~~s~~~-g~~~i~i~dl~~G~~~~l~~~~-~~~~~~~~SPDG~~ 256 (429)
T PRK03629 179 YELRVSDYDGYNQFVVHRSPQPLMSPAWSPDGSKLAYVTFES-GRSALVIQTLANGAVRQVASFP-RHNGAPAFSPDGSK 256 (429)
T ss_pred eeEEEEcCCCCCCEEeecCCCceeeeEEcCCCCEEEEEEecC-CCcEEEEEECCCCCeEEccCCC-CCcCCeEECCCCCE
Confidence 367788877754 44444444456788998888877654311 3467999998876665554322 23446889998888
Q ss_pred EEEEcc-CCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEE-cC-CCCcEEEEeCCCCceeEEEecCCCCcc
Q psy6570 86 MFWTEL-GIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWA-DP-KARTIESINLNGKDRFVVYHTEDNGYK 162 (713)
Q Consensus 86 ly~td~-~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~-d~-~~~~I~~~~~~g~~~~~~~~~~~~~~~ 162 (713)
|+|+.. .....|+..++++...+.+... -.....+++++++++|+++ +. +..+|+++++++...+.+..... ...
T Consensus 257 La~~~~~~g~~~I~~~d~~tg~~~~lt~~-~~~~~~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~~g~~~~lt~~~~-~~~ 334 (429)
T PRK03629 257 LAFALSKTGSLNLYVMDLASGQIRQVTDG-RSNNTEPTWFPDSQNLAYTSDQAGRPQVYKVNINGGAPQRITWEGS-QNQ 334 (429)
T ss_pred EEEEEcCCCCcEEEEEECCCCCEEEccCC-CCCcCceEECCCCCEEEEEeCCCCCceEEEEECCCCCeEEeecCCC-Ccc
Confidence 988743 2234799999988766655432 2244678899888877554 43 34589999998876655543211 111
Q ss_pred ceeeeeeCCeEEEEeCC--CCcEEEEcccCC
Q psy6570 163 PYKLEVFEDNLYFSTYR--TNNILKINKFGN 191 (713)
Q Consensus 163 p~~i~~~~~~ly~td~~--~~~i~~~~~~~~ 191 (713)
...+..++.+|+++... ...|+.++..++
T Consensus 335 ~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~g 365 (429)
T PRK03629 335 DADVSSDGKFMVMVSSNGGQQHIAKQDLATG 365 (429)
T ss_pred CEEECCCCCEEEEEEccCCCceEEEEECCCC
Confidence 22334456666665432 234556665443
No 79
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=97.68 E-value=8.2e-05 Score=62.30 Aligned_cols=34 Identities=35% Similarity=0.886 Sum_probs=25.6
Q ss_pred CCCCCCCeecCCCCccCCCCCceeeCCCCcccCCCCccc
Q psy6570 634 HFCFNGGTCREQNYSLDPDLKPICICPRGYAGVRCQTLV 672 (713)
Q Consensus 634 ~~C~~~~~C~~~~~~~~~~~~~~C~C~~Gy~G~~C~~~~ 672 (713)
+.|.++ +|.. +.....+.|.|..||+|.+||...
T Consensus 51 ~YClHG-~C~y----I~dl~~~~CrC~~GYtGeRCEh~d 84 (139)
T PHA03099 51 GYCLHG-DCIH----ARDIDGMYCRCSHGYTGIRCQHVV 84 (139)
T ss_pred CEeECC-EEEe----eccCCCceeECCCCccccccccee
Confidence 457774 8876 233445899999999999998765
No 80
>PRK04043 tolB translocation protein TolB; Provisional
Probab=97.66 E-value=0.0055 Score=66.35 Aligned_cols=180 Identities=12% Similarity=0.058 Sum_probs=111.2
Q ss_pred CCceeEEEccCcc-cEEecCCCCCceEEEeccCCe-EEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCC
Q psy6570 6 SGNVTRVKREMNL-KTVLSNLHDPRGVAVDWVGKN-LYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLS 83 (713)
Q Consensus 6 ~~~I~~~~~~~~~-~~~~~~~~~p~gla~D~~~~~-ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~ 83 (713)
..+|+.++.+|.. +++... ..-....+.+.+++ +|++.... ....|++.++.+...+.|.... ......++.|..
T Consensus 168 ~~~l~~~d~dg~~~~~~~~~-~~~~~p~wSpDG~~~i~y~s~~~-~~~~Iyv~dl~tg~~~~lt~~~-g~~~~~~~SPDG 244 (419)
T PRK04043 168 KSNIVLADYTLTYQKVIVKG-GLNIFPKWANKEQTAFYYTSYGE-RKPTLYKYNLYTGKKEKIASSQ-GMLVVSDVSKDG 244 (419)
T ss_pred cceEEEECCCCCceeEEccC-CCeEeEEECCCCCcEEEEEEccC-CCCEEEEEECCCCcEEEEecCC-CcEEeeEECCCC
Confidence 4578888888754 434333 33345666777664 77665521 2468999999887777766422 222345677767
Q ss_pred CcEEEEcc-CCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEc--CCCCcEEEEeCCCCceeEEEecCCCC
Q psy6570 84 GRMFWTEL-GIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWAD--PKARTIESINLNGKDRFVVYHTEDNG 160 (713)
Q Consensus 84 ~~ly~td~-~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d--~~~~~I~~~~~~g~~~~~~~~~~~~~ 160 (713)
++|+++.. ..++.|+.+++++...+.|..... .-....+.|++++||++. .+...|+++++++...+.+.....
T Consensus 245 ~~la~~~~~~g~~~Iy~~dl~~g~~~~LT~~~~-~d~~p~~SPDG~~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt~~g~-- 321 (419)
T PRK04043 245 SKLLLTMAPKGQPDIYLYDTNTKTLTQITNYPG-IDVNGNFVEDDKRIVFVSDRLGYPNIFMKKLNSGSVEQVVFHGK-- 321 (419)
T ss_pred CEEEEEEccCCCcEEEEEECCCCcEEEcccCCC-ccCccEECCCCCEEEEEECCCCCceEEEEECCCCCeEeCccCCC--
Confidence 67776543 334589999998887666643221 123457888888888875 344589999999877654443221
Q ss_pred ccceeeeeeCCeEEEEeCCC--------CcEEEEcccCCC
Q psy6570 161 YKPYKLEVFEDNLYFSTYRT--------NNILKINKFGNS 192 (713)
Q Consensus 161 ~~p~~i~~~~~~ly~td~~~--------~~i~~~~~~~~~ 192 (713)
.. ..++.++++|.++.... ..|+.++..++.
T Consensus 322 ~~-~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~g~ 360 (419)
T PRK04043 322 NN-SSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNSDY 360 (419)
T ss_pred cC-ceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCCCC
Confidence 12 25666677776664332 356666655443
No 81
>PRK00178 tolB translocation protein TolB; Provisional
Probab=97.66 E-value=0.0058 Score=66.94 Aligned_cols=180 Identities=13% Similarity=0.080 Sum_probs=109.1
Q ss_pred ceeEEEccCcc-cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcE
Q psy6570 8 NVTRVKREMNL-KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRM 86 (713)
Q Consensus 8 ~I~~~~~~~~~-~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~l 86 (713)
+|..++.++.. +.+.........+++.+.+++|+++.... ....|++.++++...+.+.... ......++.|..++|
T Consensus 180 ~l~~~d~~g~~~~~l~~~~~~~~~p~wSpDG~~la~~s~~~-~~~~l~~~~l~~g~~~~l~~~~-g~~~~~~~SpDG~~l 257 (430)
T PRK00178 180 TLQRSDYDGARAVTLLQSREPILSPRWSPDGKRIAYVSFEQ-KRPRIFVQNLDTGRREQITNFE-GLNGAPAWSPDGSKL 257 (430)
T ss_pred EEEEECCCCCCceEEecCCCceeeeeECCCCCEEEEEEcCC-CCCEEEEEECCCCCEEEccCCC-CCcCCeEECCCCCEE
Confidence 46666777644 33333333346678888888886654311 3457999999876666554322 233467888877788
Q ss_pred EEEcc-CCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcC--CCCcEEEEeCCCCceeEEEecCCCCccc
Q psy6570 87 FWTEL-GIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADP--KARTIESINLNGKDRFVVYHTEDNGYKP 163 (713)
Q Consensus 87 y~td~-~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~--~~~~I~~~~~~g~~~~~~~~~~~~~~~p 163 (713)
+++-. .....|++.++++...+.+.. ........++++++++||++.. +...|+++++++...+.+..... ....
T Consensus 258 a~~~~~~g~~~Iy~~d~~~~~~~~lt~-~~~~~~~~~~spDg~~i~f~s~~~g~~~iy~~d~~~g~~~~lt~~~~-~~~~ 335 (430)
T PRK00178 258 AFVLSKDGNPEIYVMDLASRQLSRVTN-HPAIDTEPFWGKDGRTLYFTSDRGGKPQIYKVNVNGGRAERVTFVGN-YNAR 335 (430)
T ss_pred EEEEccCCCceEEEEECCCCCeEEccc-CCCCcCCeEECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecCCC-Cccc
Confidence 77543 333489999999877665543 2223456788888888877642 34579999987766554432211 1122
Q ss_pred eeeeeeCCeEEEEeCCCC--cEEEEcccCC
Q psy6570 164 YKLEVFEDNLYFSTYRTN--NILKINKFGN 191 (713)
Q Consensus 164 ~~i~~~~~~ly~td~~~~--~i~~~~~~~~ 191 (713)
..+..++++|+++....+ .|+.++..++
T Consensus 336 ~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg 365 (430)
T PRK00178 336 PRLSADGKTLVMVHRQDGNFHVAAQDLQRG 365 (430)
T ss_pred eEECCCCCEEEEEEccCCceEEEEEECCCC
Confidence 334556777877754332 3666665544
No 82
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=97.64 E-value=0.003 Score=67.89 Aligned_cols=111 Identities=14% Similarity=0.205 Sum_probs=70.4
Q ss_pred EEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEE------e-CCCCCCeeEEEeCC------CCeEE
Q psy6570 65 TLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLV------D-NNIQWPTGITIDYP------SQRLY 131 (713)
Q Consensus 65 ~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~------~-~~~~~p~glavd~~------~~~LY 131 (713)
.++..+|..|++|++.| +++||+++.... +|++++.++...+.++ . .....+.||||+|. +++||
T Consensus 23 ~~va~GL~~Pw~maflP-DG~llVtER~~G-~I~~v~~~~~~~~~~~~l~~v~~~~ge~GLlglal~PdF~~~~~n~~lY 100 (454)
T TIGR03606 23 KVLLSGLNKPWALLWGP-DNQLWVTERATG-KILRVNPETGEVKVVFTLPEIVNDAQHNGLLGLALHPDFMQEKGNPYVY 100 (454)
T ss_pred EEEECCCCCceEEEEcC-CCeEEEEEecCC-EEEEEeCCCCceeeeecCCceeccCCCCceeeEEECCCccccCCCcEEE
Confidence 34557899999999997 679999996434 8988876654333222 1 13456789999965 45799
Q ss_pred EEcC---------CCCcEEEEeCCCC-----ceeEEEecC--CCCccceeeeee-CCeEEEEe
Q psy6570 132 WADP---------KARTIESINLNGK-----DRFVVYHTE--DNGYKPYKLEVF-EDNLYFST 177 (713)
Q Consensus 132 ~~d~---------~~~~I~~~~~~g~-----~~~~~~~~~--~~~~~p~~i~~~-~~~ly~td 177 (713)
++-+ ...+|.|+.++.. ..++++... ...+..-.|.+. +++||++-
T Consensus 101 vsyt~~~~~~~~~~~~~I~R~~l~~~~~~l~~~~~Il~~lP~~~~H~GgrI~FgPDG~LYVs~ 163 (454)
T TIGR03606 101 ISYTYKNGDKELPNHTKIVRYTYDKSTQTLEKPVDLLAGLPAGNDHNGGRLVFGPDGKIYYTI 163 (454)
T ss_pred EEEeccCCCCCccCCcEEEEEEecCCCCccccceEEEecCCCCCCcCCceEEECCCCcEEEEE
Confidence 8842 2468999887632 233444321 112334455554 56899964
No 83
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.60 E-value=4.5e-05 Score=52.68 Aligned_cols=32 Identities=25% Similarity=0.708 Sum_probs=28.8
Q ss_pred ccCCCCCC--CCCCCCeeeccCCCceeeeCCCCc
Q psy6570 219 VTNHCDDK--PCHQSALCINLPSSHTCLCPDHLT 250 (713)
Q Consensus 219 ~~~~C~~~--~C~~~~~C~~~~g~~~C~C~~G~~ 250 (713)
.+|||+.. +|..++.|+|++|+|+|.|++||.
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 36899875 598899999999999999999998
No 84
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=97.60 E-value=0.0033 Score=66.75 Aligned_cols=148 Identities=16% Similarity=0.154 Sum_probs=92.6
Q ss_pred cCCceeEEEccCcccEE-ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCce-EEEEEcCCC------CCcce
Q psy6570 5 SSGNVTRVKREMNLKTV-LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRK-KRTLLNTGL------NEPYD 76 (713)
Q Consensus 5 ~~~~I~~~~~~~~~~~~-~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~-~~~l~~~~~------~~p~~ 76 (713)
.++.|..+|+...+.+- +.....|.++++.+.++.||+++. ..+.|.++|..... .+.+-..++ .++.+
T Consensus 56 rdg~vsviD~~~~~~v~~i~~G~~~~~i~~s~DG~~~~v~n~---~~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~a 132 (369)
T PF02239_consen 56 RDGTVSVIDLATGKVVATIKVGGNPRGIAVSPDGKYVYVANY---EPGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAA 132 (369)
T ss_dssp TTSEEEEEETTSSSEEEEEE-SSEEEEEEE--TTTEEEEEEE---ETTEEEEEETTT--EEEEEE--EE-TTTS---EEE
T ss_pred CCCeEEEEECCcccEEEEEecCCCcceEEEcCCCCEEEEEec---CCCceeEeccccccceeecccccccccccCCCcee
Confidence 35788888888754322 566778999999999999999998 78899999876543 333322211 23446
Q ss_pred EEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEE-EeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEe
Q psy6570 77 IALEPLSGRMFWTELGIKPRISGASIDGKNKFNL-VDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYH 155 (713)
Q Consensus 77 iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l-~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~ 155 (713)
|...+.+..++++-.... +|+.++........+ ....-..|.++.+|+...++|++....+.|..++........+..
T Consensus 133 Iv~s~~~~~fVv~lkd~~-~I~vVdy~d~~~~~~~~i~~g~~~~D~~~dpdgry~~va~~~sn~i~viD~~~~k~v~~i~ 211 (369)
T PF02239_consen 133 IVASPGRPEFVVNLKDTG-EIWVVDYSDPKNLKVTTIKVGRFPHDGGFDPDGRYFLVAANGSNKIAVIDTKTGKLVALID 211 (369)
T ss_dssp EEE-SSSSEEEEEETTTT-EEEEEETTTSSCEEEEEEE--TTEEEEEE-TTSSEEEEEEGGGTEEEEEETTTTEEEEEEE
T ss_pred EEecCCCCEEEEEEccCC-eEEEEEeccccccceeeecccccccccccCcccceeeecccccceeEEEeeccceEEEEee
Confidence 666654444444444444 999998765433222 222346799999999888888888788899999987665554443
Q ss_pred c
Q psy6570 156 T 156 (713)
Q Consensus 156 ~ 156 (713)
.
T Consensus 212 ~ 212 (369)
T PF02239_consen 212 T 212 (369)
T ss_dssp -
T ss_pred c
Confidence 3
No 85
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=97.59 E-value=0.0015 Score=68.36 Aligned_cols=160 Identities=17% Similarity=0.233 Sum_probs=101.3
Q ss_pred CCceEEEeccCCeEEEeecCC----------CCCCeEEEEec--------CCceEEEEEcCCCCCcceEEEcCCCCcEEE
Q psy6570 27 DPRGVAVDWVGKNLYWTDAGG----------RSSNNIMVSTL--------EGRKKRTLLNTGLNEPYDIALEPLSGRMFW 88 (713)
Q Consensus 27 ~p~gla~D~~~~~ly~td~~~----------~~~~~I~~~~~--------~G~~~~~l~~~~~~~p~~iavD~~~~~ly~ 88 (713)
.-.-|++++.+ +||++=... ...++|.+.+. ++... .+...++..|.||+++|.++.||.
T Consensus 178 ~g~~l~f~pDG-~Lyvs~G~~~~~~~aq~~~~~~Gk~~r~~~a~~~~~d~p~~~~-~i~s~G~RN~qGl~w~P~tg~Lw~ 255 (399)
T COG2133 178 FGGRLVFGPDG-KLYVTTGSNGDPALAQDNVSLAGKVLRIDRAGIIPADNPFPNS-EIWSYGHRNPQGLAWHPVTGALWT 255 (399)
T ss_pred CcccEEECCCC-cEEEEeCCCCCcccccCccccccceeeeccCcccccCCCCCCc-ceEEeccCCccceeecCCCCcEEE
Confidence 34679999886 999985432 11244444443 34332 345567899999999999999999
Q ss_pred EccCCC---C--eEEEE---------------ecCC------CCcEEEEe-----CCCCCCeeEEEeCCC------CeEE
Q psy6570 89 TELGIK---P--RISGA---------------SIDG------KNKFNLVD-----NNIQWPTGITIDYPS------QRLY 131 (713)
Q Consensus 89 td~~~~---~--~I~~~---------------~~dG------~~~~~l~~-----~~~~~p~glavd~~~------~~LY 131 (713)
++.+.. + .|.++ ..+| .....+.. ...-.|.||+|-.-+ +.||
T Consensus 256 ~e~g~d~~~~~Deln~i~~G~nYGWP~~~~G~~~~g~~~~~~~~~~~~~~p~~~~~~h~ApsGmaFy~G~~fP~~r~~lf 335 (399)
T COG2133 256 TEHGPDALRGPDELNSIRPGKNYGWPYAYFGQNYDGRAIPDGTVVAGAIQPVYTWAPHIAPSGMAFYTGDLFPAYRGDLF 335 (399)
T ss_pred EecCCCcccCcccccccccCCccCCceeccCcccCccccCCCcccccccCCceeeccccccceeEEecCCcCccccCcEE
Confidence 997762 0 01111 1111 11111111 112346899987322 6899
Q ss_pred EEcCCCCcEEEEeCCCCceeE---EEecCCCCccceeeeee-CCeEEEEeCC-CCcEEEEccc
Q psy6570 132 WADPKARTIESINLNGKDRFV---VYHTEDNGYKPYKLEVF-EDNLYFSTYR-TNNILKINKF 189 (713)
Q Consensus 132 ~~d~~~~~I~~~~~~g~~~~~---~~~~~~~~~~p~~i~~~-~~~ly~td~~-~~~i~~~~~~ 189 (713)
++......+.+++.+|..+.. ++.. .....|.+|.+. ++.||+++.. +++|+|+...
T Consensus 336 V~~hgsw~~~~~~~~g~~~~~~~~fl~~-d~~gR~~dV~v~~DGallv~~D~~~g~i~Rv~~~ 397 (399)
T COG2133 336 VGAHGSWPVLRLRPDGNYKVVLTGFLSG-DLGGRPRDVAVAPDGALLVLTDQGDGRILRVSYA 397 (399)
T ss_pred EEeecceeEEEeccCCCcceEEEEEEec-CCCCcccceEECCCCeEEEeecCCCCeEEEecCC
Confidence 999888888889999984333 3332 223688888876 6789998766 6699999764
No 86
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=97.57 E-value=0.01 Score=64.74 Aligned_cols=168 Identities=15% Similarity=0.101 Sum_probs=104.0
Q ss_pred CceeEEEccCccc-EEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCc
Q psy6570 7 GNVTRVKREMNLK-TVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGR 85 (713)
Q Consensus 7 ~~I~~~~~~~~~~-~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ 85 (713)
..|+.+++.+... .+...-.....+++.+.++.||++.... ....|++.++++...+.+.... ......++.+..++
T Consensus 214 ~~i~v~d~~~g~~~~~~~~~~~~~~~~~spDg~~l~~~~~~~-~~~~i~~~d~~~~~~~~l~~~~-~~~~~~~~s~dg~~ 291 (417)
T TIGR02800 214 PEIYVQDLATGQREKVASFPGMNGAPAFSPDGSKLAVSLSKD-GNPDIYVMDLDGKQLTRLTNGP-GIDTEPSWSPDGKS 291 (417)
T ss_pred cEEEEEECCCCCEEEeecCCCCccceEECCCCCEEEEEECCC-CCccEEEEECCCCCEEECCCCC-CCCCCEEECCCCCE
Confidence 4677777775433 2322223345678888878888765411 3457999999876655554322 22334567776677
Q ss_pred EEEE-ccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCC--CcEEEEeCCCCceeEEEecCCCCcc
Q psy6570 86 MFWT-ELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKA--RTIESINLNGKDRFVVYHTEDNGYK 162 (713)
Q Consensus 86 ly~t-d~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~--~~I~~~~~~g~~~~~~~~~~~~~~~ 162 (713)
|+|+ +......|+++++++...+.+.. .......+++++.+++|+++.... .+|+.+++++...+.+..... ...
T Consensus 292 l~~~s~~~g~~~iy~~d~~~~~~~~l~~-~~~~~~~~~~spdg~~i~~~~~~~~~~~i~~~d~~~~~~~~l~~~~~-~~~ 369 (417)
T TIGR02800 292 IAFTSDRGGSPQIYMMDADGGEVRRLTF-RGGYNASPSWSPDGDLIAFVHREGGGFNIAVMDLDGGGERVLTDTGL-DES 369 (417)
T ss_pred EEEEECCCCCceEEEEECCCCCEEEeec-CCCCccCeEECCCCCEEEEEEccCCceEEEEEeCCCCCeEEccCCCC-CCC
Confidence 7665 43334589999998876555543 334456788998888999987543 478999988766555543211 112
Q ss_pred ceeeeeeCCeEEEEeCC
Q psy6570 163 PYKLEVFEDNLYFSTYR 179 (713)
Q Consensus 163 p~~i~~~~~~ly~td~~ 179 (713)
..++.++..|+++...
T Consensus 370 -p~~spdg~~l~~~~~~ 385 (417)
T TIGR02800 370 -PSFAPNGRMILYATTR 385 (417)
T ss_pred -ceECCCCCEEEEEEeC
Confidence 2445567777776544
No 87
>PRK00178 tolB translocation protein TolB; Provisional
Probab=97.57 E-value=0.0095 Score=65.26 Aligned_cols=179 Identities=12% Similarity=0.103 Sum_probs=108.8
Q ss_pred CceeEEEccCcccEEecCCC-CCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCc
Q psy6570 7 GNVTRVKREMNLKTVLSNLH-DPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGR 85 (713)
Q Consensus 7 ~~I~~~~~~~~~~~~~~~~~-~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ 85 (713)
..|+++++++.....+.... ....+++.+.+++|+++-... ....|+++++++...+.+.... ......++.|..+.
T Consensus 223 ~~l~~~~l~~g~~~~l~~~~g~~~~~~~SpDG~~la~~~~~~-g~~~Iy~~d~~~~~~~~lt~~~-~~~~~~~~spDg~~ 300 (430)
T PRK00178 223 PRIFVQNLDTGRREQITNFEGLNGAPAWSPDGSKLAFVLSKD-GNPEIYVMDLASRQLSRVTNHP-AIDTEPFWGKDGRT 300 (430)
T ss_pred CEEEEEECCCCCEEEccCCCCCcCCeEECCCCCEEEEEEccC-CCceEEEEECCCCCeEEcccCC-CCcCCeEECCCCCE
Confidence 45777777754433322222 233577888888887764311 3457999999887766554322 33445677777777
Q ss_pred EEEEc-cCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCC--CcEEEEeCCCCceeEEEecCCCCcc
Q psy6570 86 MFWTE-LGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKA--RTIESINLNGKDRFVVYHTEDNGYK 162 (713)
Q Consensus 86 ly~td-~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~--~~I~~~~~~g~~~~~~~~~~~~~~~ 162 (713)
||++. ....+.|+++++++...+.+... .......++++++++|+++.... ..|+.+++++...+.+..... ...
T Consensus 301 i~f~s~~~g~~~iy~~d~~~g~~~~lt~~-~~~~~~~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg~~~~lt~~~~-~~~ 378 (430)
T PRK00178 301 LYFTSDRGGKPQIYKVNVNGGRAERVTFV-GNYNARPRLSADGKTLVMVHRQDGNFHVAAQDLQRGSVRILTDTSL-DES 378 (430)
T ss_pred EEEEECCCCCceEEEEECCCCCEEEeecC-CCCccceEECCCCCEEEEEEccCCceEEEEEECCCCCEEEccCCCC-CCC
Confidence 76654 33345899999887665555422 22334568888899998886433 368899988876665543211 122
Q ss_pred ceeeeeeCCeEEEEeCCC--CcEEEEcccC
Q psy6570 163 PYKLEVFEDNLYFSTYRT--NNILKINKFG 190 (713)
Q Consensus 163 p~~i~~~~~~ly~td~~~--~~i~~~~~~~ 190 (713)
| .++.++..|+++.... ..|+.++..+
T Consensus 379 p-~~spdg~~i~~~~~~~g~~~l~~~~~~g 407 (430)
T PRK00178 379 P-SVAPNGTMLIYATRQQGRGVLMLVSING 407 (430)
T ss_pred c-eECCCCCEEEEEEecCCceEEEEEECCC
Confidence 3 4555667777775432 3466666543
No 88
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=97.55 E-value=0.016 Score=58.28 Aligned_cols=183 Identities=11% Similarity=-0.009 Sum_probs=104.9
Q ss_pred CCceeEEEccCcc--cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCC
Q psy6570 6 SGNVTRVKREMNL--KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLS 83 (713)
Q Consensus 6 ~~~I~~~~~~~~~--~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~ 83 (713)
++.|...+..... ..+......+..+++.+. +.++++.. ..+.|.+.++........+......+..+++++.+
T Consensus 72 ~~~i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~~---~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~ 147 (289)
T cd00200 72 DKTIRLWDLETGECVRTLTGHTSYVSSVAFSPD-GRILSSSS---RDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDG 147 (289)
T ss_pred CCeEEEEEcCcccceEEEeccCCcEEEEEEcCC-CCEEEEec---CCCeEEEEECCCcEEEEEeccCCCcEEEEEEcCcC
Confidence 5666666666532 122222335677888765 45555554 46789999987433333333333568899999875
Q ss_pred CcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccc
Q psy6570 84 GRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKP 163 (713)
Q Consensus 84 ~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p 163 (713)
.+|+... ... .|...++........+.........|++++.+..|+++.. .+.|..+++........... .....
T Consensus 148 ~~l~~~~-~~~-~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~-~~~i~i~d~~~~~~~~~~~~--~~~~i 222 (289)
T cd00200 148 TFVASSS-QDG-TIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSS-DGTIKLWDLSTGKCLGTLRG--HENGV 222 (289)
T ss_pred CEEEEEc-CCC-cEEEEEccccccceeEecCccccceEEECCCcCEEEEecC-CCcEEEEECCCCceecchhh--cCCce
Confidence 5555544 223 6777777644333333333446788999977767777765 67788888765332222211 12234
Q ss_pred eeeeeeC-CeEEEEeCCCCcEEEEcccCCCcceee
Q psy6570 164 YKLEVFE-DNLYFSTYRTNNILKINKFGNSDFNVL 197 (713)
Q Consensus 164 ~~i~~~~-~~ly~td~~~~~i~~~~~~~~~~~~~~ 197 (713)
..+.+.. +.++++....+.|..++...+.....+
T Consensus 223 ~~~~~~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~ 257 (289)
T cd00200 223 NSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTL 257 (289)
T ss_pred EEEEEcCCCcEEEEEcCCCcEEEEEcCCceeEEEc
Confidence 5566554 455555544677777776544433333
No 89
>PRK02889 tolB translocation protein TolB; Provisional
Probab=97.54 E-value=0.011 Score=64.47 Aligned_cols=178 Identities=12% Similarity=0.091 Sum_probs=105.2
Q ss_pred CceeEEEccCcccEEecCC-CCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCc
Q psy6570 7 GNVTRVKREMNLKTVLSNL-HDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGR 85 (713)
Q Consensus 7 ~~I~~~~~~~~~~~~~~~~-~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ 85 (713)
..|+.+++.+.....+... .....+++.+.+++|+++-.. .....|+++++++...+.+.... ......++.|..++
T Consensus 220 ~~I~~~dl~~g~~~~l~~~~g~~~~~~~SPDG~~la~~~~~-~g~~~Iy~~d~~~~~~~~lt~~~-~~~~~~~wSpDG~~ 297 (427)
T PRK02889 220 PVVYVHDLATGRRRVVANFKGSNSAPAWSPDGRTLAVALSR-DGNSQIYTVNADGSGLRRLTQSS-GIDTEPFFSPDGRS 297 (427)
T ss_pred cEEEEEECCCCCEEEeecCCCCccceEECCCCCEEEEEEcc-CCCceEEEEECCCCCcEECCCCC-CCCcCeEEcCCCCE
Confidence 4577888776543333322 234567888888888775331 13467999998877665554322 23345678877777
Q ss_pred EEEE-ccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCC--CcEEEEeCCCCceeEEEecCCCCcc
Q psy6570 86 MFWT-ELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKA--RTIESINLNGKDRFVVYHTEDNGYK 162 (713)
Q Consensus 86 ly~t-d~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~--~~I~~~~~~g~~~~~~~~~~~~~~~ 162 (713)
|+++ +......|+++++++...+.+.... ......++++++++|+++.... ..|+.+++++...+.+.... ...
T Consensus 298 l~f~s~~~g~~~Iy~~~~~~g~~~~lt~~g-~~~~~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g~~~~lt~~~--~~~ 374 (427)
T PRK02889 298 IYFTSDRGGAPQIYRMPASGGAAQRVTFTG-SYNTSPRISPDGKLLAYISRVGGAFKLYVQDLATGQVTALTDTT--RDE 374 (427)
T ss_pred EEEEecCCCCcEEEEEECCCCceEEEecCC-CCcCceEECCCCCEEEEEEccCCcEEEEEEECCCCCeEEccCCC--Ccc
Confidence 7665 4334458999998876655544222 2234578998888988875433 36899998776655554321 112
Q ss_pred ceeeeeeCCeEEEEeCCC--CcEEEEccc
Q psy6570 163 PYKLEVFEDNLYFSTYRT--NNILKINKF 189 (713)
Q Consensus 163 p~~i~~~~~~ly~td~~~--~~i~~~~~~ 189 (713)
...++.++..|+++.... ..++.++..
T Consensus 375 ~p~~spdg~~l~~~~~~~g~~~l~~~~~~ 403 (427)
T PRK02889 375 SPSFAPNGRYILYATQQGGRSVLAAVSSD 403 (427)
T ss_pred CceECCCCCEEEEEEecCCCEEEEEEECC
Confidence 223444566666654322 224455543
No 90
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=97.54 E-value=0.01 Score=64.71 Aligned_cols=181 Identities=13% Similarity=0.041 Sum_probs=107.4
Q ss_pred CCceeEEEccCcc-cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCC
Q psy6570 6 SGNVTRVKREMNL-KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSG 84 (713)
Q Consensus 6 ~~~I~~~~~~~~~-~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~ 84 (713)
...|+.++.++.. +.+...-.....+++.+.++.|+|+.... ....|++.++++...+.+.... .....+++.|..+
T Consensus 169 ~~~l~~~d~~g~~~~~l~~~~~~~~~p~~Spdg~~la~~~~~~-~~~~i~v~d~~~g~~~~~~~~~-~~~~~~~~spDg~ 246 (417)
T TIGR02800 169 RYELQVADYDGANPQTITRSREPILSPAWSPDGQKLAYVSFES-GKPEIYVQDLATGQREKVASFP-GMNGAPAFSPDGS 246 (417)
T ss_pred cceEEEEcCCCCCCEEeecCCCceecccCCCCCCEEEEEEcCC-CCcEEEEEECCCCCEEEeecCC-CCccceEECCCCC
Confidence 3457777776543 33333222345567888888888876522 2367999998765544443221 3345688888777
Q ss_pred cEEEEccC-CCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcC--CCCcEEEEeCCCCceeEEEecCCCCc
Q psy6570 85 RMFWTELG-IKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADP--KARTIESINLNGKDRFVVYHTEDNGY 161 (713)
Q Consensus 85 ~ly~td~~-~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~--~~~~I~~~~~~g~~~~~~~~~~~~~~ 161 (713)
.||++... ....|+..++++...+.+... .......++.+++++|+++.. ....|+.+++++...+.+..... ..
T Consensus 247 ~l~~~~~~~~~~~i~~~d~~~~~~~~l~~~-~~~~~~~~~s~dg~~l~~~s~~~g~~~iy~~d~~~~~~~~l~~~~~-~~ 324 (417)
T TIGR02800 247 KLAVSLSKDGNPDIYVMDLDGKQLTRLTNG-PGIDTEPSWSPDGKSIAFTSDRGGSPQIYMMDADGGEVRRLTFRGG-YN 324 (417)
T ss_pred EEEEEECCCCCccEEEEECCCCCEEECCCC-CCCCCCEEECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecCCC-Cc
Confidence 78776432 334799999987665555432 122335677877888877642 34579999998776555543221 11
Q ss_pred cceeeeeeCCeEEEEeCCC--CcEEEEcccC
Q psy6570 162 KPYKLEVFEDNLYFSTYRT--NNILKINKFG 190 (713)
Q Consensus 162 ~p~~i~~~~~~ly~td~~~--~~i~~~~~~~ 190 (713)
....+..++.+|+++.... ..|+.++..+
T Consensus 325 ~~~~~spdg~~i~~~~~~~~~~~i~~~d~~~ 355 (417)
T TIGR02800 325 ASPSWSPDGDLIAFVHREGGGFNIAVMDLDG 355 (417)
T ss_pred cCeEECCCCCEEEEEEccCCceEEEEEeCCC
Confidence 1223344566777776543 3566666544
No 91
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=97.53 E-value=5.7e-05 Score=48.51 Aligned_cols=30 Identities=40% Similarity=1.031 Sum_probs=27.0
Q ss_pred CCCCCCCCCCeeeccC-CCceeeeCCCCccc
Q psy6570 223 CDDKPCHQSALCINLP-SSHTCLCPDHLTEE 252 (713)
Q Consensus 223 C~~~~C~~~~~C~~~~-g~~~C~C~~G~~~~ 252 (713)
|..+||.++++|++.. ++|+|.|++||+|.
T Consensus 1 C~~~~C~n~g~C~~~~~~~y~C~C~~G~~G~ 31 (32)
T PF00008_consen 1 CSSNPCQNGGTCIDLPGGGYTCECPPGYTGK 31 (32)
T ss_dssp TTTTSSTTTEEEEEESTSEEEEEEBTTEEST
T ss_pred CCCCcCCCCeEEEeCCCCCEEeECCCCCccC
Confidence 5677999999999999 89999999999863
No 92
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=97.47 E-value=0.00021 Score=44.37 Aligned_cols=28 Identities=21% Similarity=0.604 Sum_probs=25.2
Q ss_pred CCCCeeEEEeCCCCeEEEEcCCCCcEEEE
Q psy6570 115 IQWPTGITIDYPSQRLYWADPKARTIESI 143 (713)
Q Consensus 115 ~~~p~glavd~~~~~LY~~d~~~~~I~~~ 143 (713)
+..|.|||+| .++.||++|.++++|+++
T Consensus 1 f~~P~gvav~-~~g~i~VaD~~n~rV~vf 28 (28)
T PF01436_consen 1 FNYPHGVAVD-SDGNIYVADSGNHRVQVF 28 (28)
T ss_dssp BSSEEEEEEE-TTSEEEEEECCCTEEEEE
T ss_pred CcCCcEEEEe-CCCCEEEEECCCCEEEEC
Confidence 4679999999 789999999999999875
No 93
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=97.43 E-value=0.0002 Score=48.62 Aligned_cols=36 Identities=42% Similarity=1.038 Sum_probs=26.3
Q ss_pred CCCCCC-CCCCCCcEEeecCCCceeeCCCCCc-CCCCC
Q psy6570 552 ANKCTP-NYCSNNGTCVLIEGKPSCKCLPPYS-GKQCT 587 (713)
Q Consensus 552 ~~~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~-G~~C~ 587 (713)
+++|.. .+|.++++|++..++|.|.|++||. |..|+
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~~g~~C~ 39 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCccCCcCC
Confidence 345665 5677777888888888888888887 76663
No 94
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=97.43 E-value=3.4e-05 Score=69.72 Aligned_cols=128 Identities=26% Similarity=0.594 Sum_probs=75.2
Q ss_pred CCCCCeeecCCCCCCeeecCCCcc---cCCccccC-CC-----CCCCCCCceeeCCCC-CCCCCceeeCCCCcc--cCCC
Q psy6570 315 DCNHGTCEFDDDFDPHCICQENFY---GTYCEKVN-NS-----MCPCLNQGMCYPDLT-HPEPTYKCHCAPSYT--GARC 382 (713)
Q Consensus 315 ~C~~~~C~~~~~~~~~C~C~~g~~---G~~C~~~~-c~-----~~~C~~~~~C~~~~~-~~~~~~~C~C~~G~~--g~~C 382 (713)
.|.||..+....- |.|.|++||. -++||... |. ..+|..-+.|..... ..+..|.|.|.+||. ...|
T Consensus 7 ~CKNG~LiQMSNH-fEC~Cnegfvl~~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~vC 85 (197)
T PF06247_consen 7 ICKNGYLIQMSNH-FECKCNEGFVLKNENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQGVC 85 (197)
T ss_dssp --BTEEEEEESSE-EEEEESTTEEEEETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSSSE
T ss_pred cccCCEEEEccCc-eEEEcCCCcEEccccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCCeE
Confidence 3666666654333 7999999997 35676533 43 347888899987654 345689999999998 4467
Q ss_pred CccCCC-CCCCCCCEEEc-----CCCeeeCCCCCcc---CCCC-----cCCCCCCCCCcEeccCCCCccccCCCCCc
Q psy6570 383 ESRICE-NKCHNGGTCIA-----TTQTCVCPPGFTG---DTCQ-----QCLNLKCQNGGVCVNKTTGLECDCPKFYY 445 (713)
Q Consensus 383 ~~~~C~-~~C~~~~~C~~-----~~~~C~C~~g~~g---~~C~-----~C~~~~C~~~~~C~~~~~~~~C~C~~G~~ 445 (713)
.+..|. ..|. .|.|+. ....|+|.-|+.- ..|. +|... |..+..|....+-|+|.|.+||.
T Consensus 86 vp~~C~~~~Cg-~GKCI~d~~~~~~~~CSC~IGkV~~dn~kCtk~G~T~C~LK-Ck~nE~CK~~~~~Y~C~~~~~~~ 160 (197)
T PF06247_consen 86 VPNKCNNKDCG-SGKCILDPDNPNNPTCSCNIGKVPDDNKKCTKTGETKCSLK-CKENEECKLVDGYYKCVCKEGFP 160 (197)
T ss_dssp EEGGGSS---T-TEEEEEEEGGGSEEEEEE-TEEETTTTTESEEEE---------TTTEEEEEETTEEEEEE-TT-E
T ss_pred chhhcCceecC-CCeEEecCCCCCCceeEeeeceEeccCCcccCCCccceeee-cCCCcceeeeCcEEEeecCCCCC
Confidence 776776 4576 689986 2348999999871 1232 33322 55556666666666666666664
No 95
>PRK01029 tolB translocation protein TolB; Provisional
Probab=97.42 E-value=0.025 Score=61.58 Aligned_cols=185 Identities=14% Similarity=0.131 Sum_probs=104.0
Q ss_pred CceeEEEccCccc-EEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEE--ecCC---ceEEEEEcCCCCCcceEEEc
Q psy6570 7 GNVTRVKREMNLK-TVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVS--TLEG---RKKRTLLNTGLNEPYDIALE 80 (713)
Q Consensus 7 ~~I~~~~~~~~~~-~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~--~~~G---~~~~~l~~~~~~~p~~iavD 80 (713)
..|+.+++++... .+...-......++.+.+++|+++-... ....|++. ++++ ...+.+...........++.
T Consensus 211 ~~I~~~~l~~g~~~~lt~~~g~~~~p~wSPDG~~Laf~s~~~-g~~di~~~~~~~~~g~~g~~~~lt~~~~~~~~~p~wS 289 (428)
T PRK01029 211 PKIFLGSLENPAGKKILALQGNQLMPTFSPRKKLLAFISDRY-GNPDLFIQSFSLETGAIGKPRRLLNEAFGTQGNPSFS 289 (428)
T ss_pred ceEEEEECCCCCceEeecCCCCccceEECCCCCEEEEEECCC-CCcceeEEEeecccCCCCcceEeecCCCCCcCCeEEC
Confidence 4678888876543 3333223344577888888888765311 22345543 4332 23334443322334467888
Q ss_pred CCCCcEEEEc-cCCCCeEEEEecCCCC-cEEEEeCCCCCCeeEEEeCCCCeEEEEcC--CCCcEEEEeCCCCceeEEEec
Q psy6570 81 PLSGRMFWTE-LGIKPRISGASIDGKN-KFNLVDNNIQWPTGITIDYPSQRLYWADP--KARTIESINLNGKDRFVVYHT 156 (713)
Q Consensus 81 ~~~~~ly~td-~~~~~~I~~~~~dG~~-~~~l~~~~~~~p~glavd~~~~~LY~~d~--~~~~I~~~~~~g~~~~~~~~~ 156 (713)
|....|+|+. .....+|+++++++.. ....+..........++.|++++|+++.. +...|+.+++++...+.+...
T Consensus 290 PDG~~Laf~s~~~g~~~ly~~~~~~~g~~~~~lt~~~~~~~~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~g~~~~Lt~~ 369 (428)
T PRK01029 290 PDGTRLVFVSNKDGRPRIYIMQIDPEGQSPRLLTKKYRNSSCPAWSPDGKKIAFCSVIKGVRQICVYDLATGRDYQLTTS 369 (428)
T ss_pred CCCCEEEEEECCCCCceEEEEECcccccceEEeccCCCCccceeECCCCCEEEEEEcCCCCcEEEEEECCCCCeEEccCC
Confidence 8777676654 4444589999886432 22222222234456789999998888753 345799999988777666533
Q ss_pred CCCCccceeeeeeCCeEEEEeC--CCCcEEEEcccCCCc
Q psy6570 157 EDNGYKPYKLEVFEDNLYFSTY--RTNNILKINKFGNSD 193 (713)
Q Consensus 157 ~~~~~~p~~i~~~~~~ly~td~--~~~~i~~~~~~~~~~ 193 (713)
.... .....+.++..|+++.. ....|+.++..++..
T Consensus 370 ~~~~-~~p~wSpDG~~L~f~~~~~g~~~L~~vdl~~g~~ 407 (428)
T PRK01029 370 PENK-ESPSWAIDSLHLVYSAGNSNESELYLISLITKKT 407 (428)
T ss_pred CCCc-cceEECCCCCEEEEEECCCCCceEEEEECCCCCE
Confidence 2111 11223334566766532 345577777665543
No 96
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=97.40 E-value=0.007 Score=59.55 Aligned_cols=131 Identities=17% Similarity=0.211 Sum_probs=83.2
Q ss_pred CCceeEEEccCcccEE-ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCc-eEEEE-Ec-C--CCCCcceEEE
Q psy6570 6 SGNVTRVKREMNLKTV-LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGR-KKRTL-LN-T--GLNEPYDIAL 79 (713)
Q Consensus 6 ~~~I~~~~~~~~~~~~-~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~-~~~~l-~~-~--~~~~p~~iav 79 (713)
++..+.+++++-+.+- .+-..+..||+.| +.+||.+|. ..+|+..+++.- ..+.+ +. . .+...+-|..
T Consensus 109 ~~~~f~yd~~tl~~~~~~~y~~EGWGLt~d--g~~Li~SDG----S~~L~~~dP~~f~~~~~i~V~~~g~pv~~LNELE~ 182 (264)
T PF05096_consen 109 EGTGFVYDPNTLKKIGTFPYPGEGWGLTSD--GKRLIMSDG----SSRLYFLDPETFKEVRTIQVTDNGRPVSNLNELEY 182 (264)
T ss_dssp SSEEEEEETTTTEEEEEEE-SSS--EEEEC--SSCEEEE-S----SSEEEEE-TTT-SEEEEEE-EETTEE---EEEEEE
T ss_pred CCeEEEEccccceEEEEEecCCcceEEEcC--CCEEEEECC----ccceEEECCcccceEEEEEEEECCEECCCcEeEEE
Confidence 4566677776533222 4445688999976 678999996 688999987642 22222 22 1 1455666777
Q ss_pred cCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeC---------------CCCCCeeEEEeCCCCeEEEEcCCCCcEEEEe
Q psy6570 80 EPLSGRMFWTELGIKPRISGASIDGKNKFNLVDN---------------NIQWPTGITIDYPSQRLYWADPKARTIESIN 144 (713)
Q Consensus 80 D~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~---------------~~~~p~glavd~~~~~LY~~d~~~~~I~~~~ 144 (713)
.+|.||---|..+ +|.++++........+.. ....-+|||.|+..++||++-..-.+++.+.
T Consensus 183 --i~G~IyANVW~td-~I~~Idp~tG~V~~~iDls~L~~~~~~~~~~~~~~dVLNGIAyd~~~~~l~vTGK~Wp~lyeV~ 259 (264)
T PF05096_consen 183 --INGKIYANVWQTD-RIVRIDPETGKVVGWIDLSGLRPEVGRDKSRQPDDDVLNGIAYDPETDRLFVTGKLWPKLYEVK 259 (264)
T ss_dssp --ETTEEEEEETTSS-EEEEEETTT-BEEEEEE-HHHHHHHTSTTST--TTS-EEEEEEETTTTEEEEEETT-SEEEEEE
T ss_pred --EcCEEEEEeCCCC-eEEEEeCCCCeEEEEEEhhHhhhcccccccccccCCeeEeEeEeCCCCEEEEEeCCCCceEEEE
Confidence 5788887777777 999999987666555531 1234699999999999999998888888876
Q ss_pred C
Q psy6570 145 L 145 (713)
Q Consensus 145 ~ 145 (713)
+
T Consensus 260 l 260 (264)
T PF05096_consen 260 L 260 (264)
T ss_dssp E
T ss_pred E
Confidence 5
No 97
>PRK02888 nitrous-oxide reductase; Validated
Probab=97.38 E-value=0.011 Score=64.96 Aligned_cols=184 Identities=14% Similarity=0.105 Sum_probs=108.6
Q ss_pred cccCCceeEEEccCcc---cEEecCCCCCceEEEe--ccCCeEEEeecCC--------------CCCCeEEEEecCCceE
Q psy6570 3 SISSGNVTRVKREMNL---KTVLSNLHDPRGVAVD--WVGKNLYWTDAGG--------------RSSNNIMVSTLEGRKK 63 (713)
Q Consensus 3 d~~~~~I~~~~~~~~~---~~~~~~~~~p~gla~D--~~~~~ly~td~~~--------------~~~~~I~~~~~~G~~~ 63 (713)
|-.+.||-||+++.-+ .+.++.....+|+++. +.++.+|-.-... +..+.+.++|.+...+
T Consensus 148 dk~n~Rvari~l~~~~~~~i~~iPn~~~~Hg~~~~~~p~t~yv~~~~e~~~PlpnDGk~l~~~~ey~~~vSvID~etmeV 227 (635)
T PRK02888 148 DKANTRVARIRLDVMKCDKITELPNVQGIHGLRPQKIPRTGYVFCNGEFRIPLPNDGKDLDDPKKYRSLFTAVDAETMEV 227 (635)
T ss_pred cCCCcceEEEECccEeeceeEeCCCccCccccCccccCCccEEEeCcccccccCCCCCEeecccceeEEEEEEECccceE
Confidence 4568899999988733 2236777778888887 4444444221100 0234455555553222
Q ss_pred E-EEEcCCCCCcceEEEcCCCCcEEEEccCC---C----------CeEEEEe-----------------------cCCCC
Q psy6570 64 R-TLLNTGLNEPYDIALEPLSGRMFWTELGI---K----------PRISGAS-----------------------IDGKN 106 (713)
Q Consensus 64 ~-~l~~~~~~~p~~iavD~~~~~ly~td~~~---~----------~~I~~~~-----------------------~dG~~ 106 (713)
. ++. .+ .+|..+++++.++++|++..+. . ..+..++ +|+..
T Consensus 228 ~~qV~-Vd-gnpd~v~~spdGk~afvTsyNsE~G~tl~em~a~e~d~~vvfni~~iea~vkdGK~~~V~gn~V~VID~~t 305 (635)
T PRK02888 228 AWQVM-VD-GNLDNVDTDYDGKYAFSTCYNSEEGVTLAEMMAAERDWVVVFNIARIEEAVKAGKFKTIGGSKVPVVDGRK 305 (635)
T ss_pred EEEEE-eC-CCcccceECCCCCEEEEeccCcccCcceeeeccccCceEEEEchHHHHHhhhCCCEEEECCCEEEEEECCc
Confidence 1 222 12 4899999999888999985221 0 0111111 11221
Q ss_pred -----cEEEE-eCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCc---------eeEEEecCCCCccceeeeeeC-
Q psy6570 107 -----KFNLV-DNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKD---------RFVVYHTEDNGYKPYKLEVFE- 170 (713)
Q Consensus 107 -----~~~l~-~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~---------~~~~~~~~~~~~~p~~i~~~~- 170 (713)
...+. ..--+.|.||+++|++++||++...++.|..+++.... +..+.........|+-.++++
T Consensus 306 ~~~~~~~v~~yIPVGKsPHGV~vSPDGkylyVanklS~tVSVIDv~k~k~~~~~~~~~~~~vvaevevGlGPLHTaFDg~ 385 (635)
T PRK02888 306 AANAGSALTRYVPVPKNPHGVNTSPDGKYFIANGKLSPTVTVIDVRKLDDLFDGKIKPRDAVVAEPELGLGPLHTAFDGR 385 (635)
T ss_pred cccCCcceEEEEECCCCccceEECCCCCEEEEeCCCCCcEEEEEChhhhhhhhccCCccceEEEeeccCCCcceEEECCC
Confidence 11111 11235799999999999999999999999999976532 222333222245777777774
Q ss_pred CeEEEEeCCCCcEEEEcc
Q psy6570 171 DNLYFSTYRTNNILKINK 188 (713)
Q Consensus 171 ~~ly~td~~~~~i~~~~~ 188 (713)
++.|.+-.....|.+++.
T Consensus 386 G~aytslf~dsqv~kwn~ 403 (635)
T PRK02888 386 GNAYTTLFLDSQIVKWNI 403 (635)
T ss_pred CCEEEeEeecceeEEEeh
Confidence 478887666666766664
No 98
>PRK01742 tolB translocation protein TolB; Provisional
Probab=97.38 E-value=0.018 Score=62.97 Aligned_cols=176 Identities=12% Similarity=0.069 Sum_probs=108.3
Q ss_pred CceeEEEccCccc-EEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCc
Q psy6570 7 GNVTRVKREMNLK-TVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGR 85 (713)
Q Consensus 7 ~~I~~~~~~~~~~-~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ 85 (713)
.+|+..+.++... .+.........+++.+.+++|+++.... ....|++.++++...+.+.... ..-..+++.|...+
T Consensus 184 ~~i~i~d~dg~~~~~lt~~~~~v~~p~wSPDG~~la~~s~~~-~~~~i~i~dl~tg~~~~l~~~~-g~~~~~~wSPDG~~ 261 (429)
T PRK01742 184 YEVRVADYDGFNQFIVNRSSQPLMSPAWSPDGSKLAYVSFEN-KKSQLVVHDLRSGARKVVASFR-GHNGAPAFSPDGSR 261 (429)
T ss_pred EEEEEECCCCCCceEeccCCCccccceEcCCCCEEEEEEecC-CCcEEEEEeCCCCceEEEecCC-CccCceeECCCCCE
Confidence 4566667776433 3333333457788999888887765421 2457999998876555444221 22346788887777
Q ss_pred EEEEc-cCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEc--CCCCcEEEEeCCCCceeEEEecCCCCcc
Q psy6570 86 MFWTE-LGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWAD--PKARTIESINLNGKDRFVVYHTEDNGYK 162 (713)
Q Consensus 86 ly~td-~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d--~~~~~I~~~~~~g~~~~~~~~~~~~~~~ 162 (713)
|+++. ......|+..++++...+.+.. .......+++++++.+|+++- .+..+|+.++.++...+.+ ... . .
T Consensus 262 La~~~~~~g~~~Iy~~d~~~~~~~~lt~-~~~~~~~~~wSpDG~~i~f~s~~~g~~~I~~~~~~~~~~~~l-~~~--~-~ 336 (429)
T PRK01742 262 LAFASSKDGVLNIYVMGANGGTPSQLTS-GAGNNTEPSWSPDGQSILFTSDRSGSPQVYRMSASGGGASLV-GGR--G-Y 336 (429)
T ss_pred EEEEEecCCcEEEEEEECCCCCeEeecc-CCCCcCCEEECCCCCEEEEEECCCCCceEEEEECCCCCeEEe-cCC--C-C
Confidence 88864 2333478999998776666543 223356789998888777663 2456899998887766554 221 1 2
Q ss_pred ceeeeeeCCeEEEEeCCCCcEEEEcccCC
Q psy6570 163 PYKLEVFEDNLYFSTYRTNNILKINKFGN 191 (713)
Q Consensus 163 p~~i~~~~~~ly~td~~~~~i~~~~~~~~ 191 (713)
...+..++.+|+.+.. ..|++++..++
T Consensus 337 ~~~~SpDG~~ia~~~~--~~i~~~Dl~~g 363 (429)
T PRK01742 337 SAQISADGKTLVMING--DNVVKQDLTSG 363 (429)
T ss_pred CccCCCCCCEEEEEcC--CCEEEEECCCC
Confidence 2234455667766543 45666665444
No 99
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=97.31 E-value=0.03 Score=55.20 Aligned_cols=161 Identities=13% Similarity=0.062 Sum_probs=99.6
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEecCCceE-EEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCC
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKK-RTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKN 106 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~-~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~ 106 (713)
-+||.++ .++.||-+-. ..+..+|.++++++... +..--..-.--.||++- ++.||.-.|..+ ..++.+++.-.
T Consensus 47 TQGL~~~-~~g~LyESTG-~yG~S~l~~~d~~tg~~~~~~~l~~~~FgEGit~~--~d~l~qLTWk~~-~~f~yd~~tl~ 121 (264)
T PF05096_consen 47 TQGLEFL-DDGTLYESTG-LYGQSSLRKVDLETGKVLQSVPLPPRYFGEGITIL--GDKLYQLTWKEG-TGFVYDPNTLK 121 (264)
T ss_dssp EEEEEEE-ETTEEEEEEC-STTEEEEEEEETTTSSEEEEEE-TTT--EEEEEEE--TTEEEEEESSSS-EEEEEETTTTE
T ss_pred CccEEec-CCCEEEEeCC-CCCcEEEEEEECCCCcEEEEEECCccccceeEEEE--CCEEEEEEecCC-eEEEEccccce
Confidence 4789885 3578997765 22345799999886443 23222333456789984 789999999887 88888887432
Q ss_pred cEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCc-eeEEEec--CCCCccceeeeeeCCeEEEEeCCCCcE
Q psy6570 107 KFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKD-RFVVYHT--EDNGYKPYKLEVFEDNLYFSTYRTNNI 183 (713)
Q Consensus 107 ~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~-~~~~~~~--~~~~~~p~~i~~~~~~ly~td~~~~~i 183 (713)
...-+.- .....||+-| ++.||.+|. +.+|+..+...-. ...+... ...+..-.-|.+.++.||---|.++.|
T Consensus 122 ~~~~~~y-~~EGWGLt~d--g~~Li~SDG-S~~L~~~dP~~f~~~~~i~V~~~g~pv~~LNELE~i~G~IyANVW~td~I 197 (264)
T PF05096_consen 122 KIGTFPY-PGEGWGLTSD--GKRLIMSDG-SSRLYFLDPETFKEVRTIQVTDNGRPVSNLNELEYINGKIYANVWQTDRI 197 (264)
T ss_dssp EEEEEE--SSS--EEEEC--SSCEEEE-S-SSEEEEE-TTT-SEEEEEE-EETTEE---EEEEEEETTEEEEEETTSSEE
T ss_pred EEEEEec-CCcceEEEcC--CCEEEEECC-ccceEEECCcccceEEEEEEEECCEECCCcEeEEEEcCEEEEEeCCCCeE
Confidence 2222221 2356899977 788999994 7889998876432 2222211 111222234666788888888999999
Q ss_pred EEEcccCCCcceee
Q psy6570 184 LKINKFGNSDFNVL 197 (713)
Q Consensus 184 ~~~~~~~~~~~~~~ 197 (713)
.++++.++.....+
T Consensus 198 ~~Idp~tG~V~~~i 211 (264)
T PF05096_consen 198 VRIDPETGKVVGWI 211 (264)
T ss_dssp EEEETTT-BEEEEE
T ss_pred EEEeCCCCeEEEEE
Confidence 99999887665554
No 100
>PRK01029 tolB translocation protein TolB; Provisional
Probab=97.29 E-value=0.037 Score=60.33 Aligned_cols=183 Identities=10% Similarity=0.048 Sum_probs=102.3
Q ss_pred CCceeEEEccCccc-EEecCCCCCceEEEeccCCe---EEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcC
Q psy6570 6 SGNVTRVKREMNLK-TVLSNLHDPRGVAVDWVGKN---LYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEP 81 (713)
Q Consensus 6 ~~~I~~~~~~~~~~-~~~~~~~~p~gla~D~~~~~---ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~ 81 (713)
..+|+.++.+|... .+..........++.+.++. +|++... ....|++.+++|...+.|.... ......++.|
T Consensus 164 ~~~l~~~d~dG~~~~~lt~~~~~~~sP~wSPDG~~~~~~y~S~~~--g~~~I~~~~l~~g~~~~lt~~~-g~~~~p~wSP 240 (428)
T PRK01029 164 QGELWSVDYDGQNLRPLTQEHSLSITPTWMHIGSGFPYLYVSYKL--GVPKIFLGSLENPAGKKILALQ-GNQLMPTFSP 240 (428)
T ss_pred cceEEEEcCCCCCceEcccCCCCcccceEccCCCceEEEEEEccC--CCceEEEEECCCCCceEeecCC-CCccceEECC
Confidence 45788888887543 33222222234467776654 4566542 3568999999988777665432 3344578888
Q ss_pred CCCcEEEEccC-CCCeEEEE--ecCC---CCcEEEEeCCCCCCeeEEEeCCCCeEEEEcC--CCCcEEEEeCCCC--cee
Q psy6570 82 LSGRMFWTELG-IKPRISGA--SIDG---KNKFNLVDNNIQWPTGITIDYPSQRLYWADP--KARTIESINLNGK--DRF 151 (713)
Q Consensus 82 ~~~~ly~td~~-~~~~I~~~--~~dG---~~~~~l~~~~~~~p~glavd~~~~~LY~~d~--~~~~I~~~~~~g~--~~~ 151 (713)
...+|.|+... ....|+.. ++++ ...+.+...........+++|++++|+++.. +..+|+.+++++. ..+
T Consensus 241 DG~~Laf~s~~~g~~di~~~~~~~~~g~~g~~~~lt~~~~~~~~~p~wSPDG~~Laf~s~~~g~~~ly~~~~~~~g~~~~ 320 (428)
T PRK01029 241 RKKLLAFISDRYGNPDLFIQSFSLETGAIGKPRRLLNEAFGTQGNPSFSPDGTRLVFVSNKDGRPRIYIMQIDPEGQSPR 320 (428)
T ss_pred CCCEEEEEECCCCCcceeEEEeecccCCCCcceEeecCCCCCcCCeEECCCCCEEEEEECCCCCceEEEEECcccccceE
Confidence 77788776532 23356664 3332 2333444333333456799998887777642 3457999888642 233
Q ss_pred EEEecCCCCccceeeeeeCCeEEEEeCC--CCcEEEEcccCCC
Q psy6570 152 VVYHTEDNGYKPYKLEVFEDNLYFSTYR--TNNILKINKFGNS 192 (713)
Q Consensus 152 ~~~~~~~~~~~p~~i~~~~~~ly~td~~--~~~i~~~~~~~~~ 192 (713)
.+..... .........++++|+++... ...|+.++..++.
T Consensus 321 ~lt~~~~-~~~~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~g~ 362 (428)
T PRK01029 321 LLTKKYR-NSSCPAWSPDGKKIAFCSVIKGVRQICVYDLATGR 362 (428)
T ss_pred EeccCCC-CccceeECCCCCEEEEEEcCCCCcEEEEEECCCCC
Confidence 3322211 11122344556677666432 2356677665543
No 101
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.26 E-value=0.00017 Score=49.80 Aligned_cols=31 Identities=45% Similarity=1.023 Sum_probs=23.9
Q ss_pred CCCCCC--CCCCCCcEEeecCCCceeeCCCCCc
Q psy6570 552 ANKCTP--NYCSNNGTCVLIEGKPSCKCLPPYS 582 (713)
Q Consensus 552 ~~~C~~--~~C~~~~~C~~~~g~~~C~C~~G~~ 582 (713)
+++|.. +.|..++.|+|+.|+|+|.|++||.
T Consensus 2 idEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~ 34 (42)
T PF07645_consen 2 IDECAEGPHNCPENGTCVNTEGSYSCSCPPGYE 34 (42)
T ss_dssp SSTTTTTSSSSSTTSEEEEETTEEEEEESTTEE
T ss_pred ccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcE
Confidence 466663 3587788888888888888888886
No 102
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=97.24 E-value=0.058 Score=54.11 Aligned_cols=175 Identities=16% Similarity=0.111 Sum_probs=95.7
Q ss_pred CCceeEEEccCccc-EEecCCCCC-ceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCC
Q psy6570 6 SGNVTRVKREMNLK-TVLSNLHDP-RGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLS 83 (713)
Q Consensus 6 ~~~I~~~~~~~~~~-~~~~~~~~p-~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~ 83 (713)
++.|..++...... ........+ ..+.+.+.++.|+.+. ..+.|.+.+++.......+......+..+++++.
T Consensus 30 ~g~i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~----~~~~i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~- 104 (289)
T cd00200 30 DGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGS----SDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPD- 104 (289)
T ss_pred CcEEEEEEeeCCCcEEEEecCCcceeEEEECCCCCEEEEEc----CCCeEEEEEcCcccceEEEeccCCcEEEEEEcCC-
Confidence 45666666554321 112222233 3677776655566554 3678888888764322223232346788999876
Q ss_pred CcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCc-eeEEEecCCCCcc
Q psy6570 84 GRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKD-RFVVYHTEDNGYK 162 (713)
Q Consensus 84 ~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~-~~~~~~~~~~~~~ 162 (713)
+.++++..... .|...++........+......+..+++++.+..|+.+. ..+.|..+++.... ...+... ...
T Consensus 105 ~~~~~~~~~~~-~i~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~-~~~~i~i~d~~~~~~~~~~~~~---~~~ 179 (289)
T cd00200 105 GRILSSSSRDK-TIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSS-QDGTIKLWDLRTGKCVATLTGH---TGE 179 (289)
T ss_pred CCEEEEecCCC-eEEEEECCCcEEEEEeccCCCcEEEEEEcCcCCEEEEEc-CCCcEEEEEccccccceeEecC---ccc
Confidence 45555554333 788888773333333333344578899997755555443 46678888876332 3333211 123
Q ss_pred ceeeeeeCC--eEEEEeCCCCcEEEEcccCC
Q psy6570 163 PYKLEVFED--NLYFSTYRTNNILKINKFGN 191 (713)
Q Consensus 163 p~~i~~~~~--~ly~td~~~~~i~~~~~~~~ 191 (713)
...+.+..+ .|+.+. ..+.|..++...+
T Consensus 180 i~~~~~~~~~~~l~~~~-~~~~i~i~d~~~~ 209 (289)
T cd00200 180 VNSVAFSPDGEKLLSSS-SDGTIKLWDLSTG 209 (289)
T ss_pred cceEEECCCcCEEEEec-CCCcEEEEECCCC
Confidence 445555533 455544 3667777766543
No 103
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=97.22 E-value=0.0041 Score=65.17 Aligned_cols=127 Identities=17% Similarity=0.207 Sum_probs=81.6
Q ss_pred cEEecCCCCCceEEEeccCCeEEEeecCC---CCCCeEEEE---------------ecCCc------eEEEEEc-----C
Q psy6570 19 KTVLSNLHDPRGVAVDWVGKNLYWTDAGG---RSSNNIMVS---------------TLEGR------KKRTLLN-----T 69 (713)
Q Consensus 19 ~~~~~~~~~p~gla~D~~~~~ly~td~~~---~~~~~I~~~---------------~~~G~------~~~~l~~-----~ 69 (713)
++...++.+|.|+++++.++.||.++.+. +....|.+. +.+|. ....++. .
T Consensus 232 ~i~s~G~RN~qGl~w~P~tg~Lw~~e~g~d~~~~~Deln~i~~G~nYGWP~~~~G~~~~g~~~~~~~~~~~~~~p~~~~~ 311 (399)
T COG2133 232 EIWSYGHRNPQGLAWHPVTGALWTTEHGPDALRGPDELNSIRPGKNYGWPYAYFGQNYDGRAIPDGTVVAGAIQPVYTWA 311 (399)
T ss_pred ceEEeccCCccceeecCCCCcEEEEecCCCcccCcccccccccCCccCCceeccCcccCccccCCCcccccccCCceeec
Confidence 45678999999999999999999999865 222222221 11111 1111111 1
Q ss_pred CCCCcceEEEcCCC------CcEEEEccCCCCeEEEEecCCCCcEE---EEeCC-CCCCeeEEEeCCCCeEEEEcC-CCC
Q psy6570 70 GLNEPYDIALEPLS------GRMFWTELGIKPRISGASIDGKNKFN---LVDNN-IQWPTGITIDYPSQRLYWADP-KAR 138 (713)
Q Consensus 70 ~~~~p~~iavD~~~------~~ly~td~~~~~~I~~~~~dG~~~~~---l~~~~-~~~p~glavd~~~~~LY~~d~-~~~ 138 (713)
....|.+|++=.-+ +.||++..+.. .+.+.+++|..+.+ ++... ..+|.+|++.+ ++.||++|- ..+
T Consensus 312 ~h~ApsGmaFy~G~~fP~~r~~lfV~~hgsw-~~~~~~~~g~~~~~~~~fl~~d~~gR~~dV~v~~-DGallv~~D~~~g 389 (399)
T COG2133 312 PHIAPSGMAFYTGDLFPAYRGDLFVGAHGSW-PVLRLRPDGNYKVVLTGFLSGDLGGRPRDVAVAP-DGALLVLTDQGDG 389 (399)
T ss_pred cccccceeEEecCCcCccccCcEEEEeecce-eEEEeccCCCcceEEEEEEecCCCCcccceEECC-CCeEEEeecCCCC
Confidence 13446788884211 57888887766 67888999884333 33322 25899999996 456666654 577
Q ss_pred cEEEEeCCC
Q psy6570 139 TIESINLNG 147 (713)
Q Consensus 139 ~I~~~~~~g 147 (713)
+|+|+.+++
T Consensus 390 ~i~Rv~~~~ 398 (399)
T COG2133 390 RILRVSYAG 398 (399)
T ss_pred eEEEecCCC
Confidence 999998765
No 104
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=97.19 E-value=0.023 Score=59.02 Aligned_cols=126 Identities=13% Similarity=0.115 Sum_probs=92.6
Q ss_pred CCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCC
Q psy6570 49 SSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQ 128 (713)
Q Consensus 49 ~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~ 128 (713)
....|.+++.+|...+.+. .++....+|.+++....+.+++. +..||++++|....+.+-.+.-....++++++...
T Consensus 380 dgD~l~iyd~~~~e~kr~e-~~lg~I~av~vs~dGK~~vvaNd--r~el~vididngnv~~idkS~~~lItdf~~~~nsr 456 (668)
T COG4946 380 DGDKLGIYDKDGGEVKRIE-KDLGNIEAVKVSPDGKKVVVAND--RFELWVIDIDNGNVRLIDKSEYGLITDFDWHPNSR 456 (668)
T ss_pred CCceEEEEecCCceEEEee-CCccceEEEEEcCCCcEEEEEcC--ceEEEEEEecCCCeeEecccccceeEEEEEcCCce
Confidence 4568999999998877655 67899999999986666777663 33799999998887777666667788999998888
Q ss_pred eEEEEcC---CCCcEEEEeCCCCceeEEEecCCCCccceeeeeeCCeEEEEeC
Q psy6570 129 RLYWADP---KARTIESINLNGKDRFVVYHTEDNGYKPYKLEVFEDNLYFSTY 178 (713)
Q Consensus 129 ~LY~~d~---~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~td~ 178 (713)
+|-++-+ .+..|..++++|...-.+.+... ....-+++.++.+||....
T Consensus 457 ~iAYafP~gy~tq~Iklydm~~~Kiy~vTT~ta-~DfsPaFD~d~ryLYfLs~ 508 (668)
T COG4946 457 WIAYAFPEGYYTQSIKLYDMDGGKIYDVTTPTA-YDFSPAFDPDGRYLYFLSA 508 (668)
T ss_pred eEEEecCcceeeeeEEEEecCCCeEEEecCCcc-cccCcccCCCCcEEEEEec
Confidence 8877753 45678999999976655544322 2222345566788988643
No 105
>KOG1218|consensus
Probab=97.15 E-value=0.068 Score=55.78 Aligned_cols=111 Identities=31% Similarity=0.733 Sum_probs=59.2
Q ss_pred CCCCcCCCCc-cCCCCCCCCCCeeecCCCCCeecCCCCccCCCCCCccccCCCCCCcccCCCCccCCCCCCCCCCCCcEE
Q psy6570 441 PKFYYGKNCQ-YSQCKNYCVNGECSITDSGPKCMCSPGYSGKKCDTCTCLNGDSGPKCMCSPGYSGKKCDTCTCLNGGTC 519 (713)
Q Consensus 441 ~~G~~g~~C~-~~~C~~~~~~~~C~~~~~~~~C~C~~G~~g~~C~~~~C~~~~~~~~C~C~~G~~g~~C~~~~C~~~g~C 519 (713)
..+|.|..|+ ..+|...+...+|.+... .|.+..+|.+..|.. +++.|..|... |.+...+
T Consensus 96 ~~~~~g~~C~~~~~~~~~c~~~~C~~~~~--~c~~~~~~~~~~C~~---------------~~~~g~~C~~~-c~~~~~~ 157 (316)
T KOG1218|consen 96 LNGYEGPQCESPCPCGDGCAEKTCANPRR--ECRCGGGYIGEQCGE---------------ENLVGLKCQRD-CQCTGGC 157 (316)
T ss_pred CCCCCcccccCCCCcCCcccccccCCCcc--ceecCCcCccccccc---------------cCCCCCCccCC-CCCcccc
Confidence 4677777776 233332211123333322 467777777766543 35666666542 2121222
Q ss_pred cCCCCCeeccCCCCCCCCCCcccCCCCCCCCCCCCCC-CCCCCCCcEEeecCCCceeeCCCCCc
Q psy6570 520 IPNSKNNVCKCPSQYTGRRCECAVGDTSCASLANKCT-PNYCSNNGTCVLIEGKPSCKCLPPYS 582 (713)
Q Consensus 520 ~~~~~~~~C~C~~g~~G~~C~~~~~~~~c~~~~~~C~-~~~C~~~~~C~~~~g~~~C~C~~G~~ 582 (713)
.. ....|.|.+||.|..|+... ..|. ...|.+++.|....+ .+.+.+++.
T Consensus 158 ~~--~~~~c~c~~g~~g~~~~~~~---------~~c~~~~~~~~g~~C~~~~~--~~~~~~~~~ 208 (316)
T KOG1218|consen 158 DC--KNGICTCQPGFVGVFCVESC---------SGCSPLTACENGAKCNRSTG--SCLCYPGPS 208 (316)
T ss_pred CC--CCCceeccCCcccccccccC---------CCcCCCcccCCCCeeecccc--ccccCCCCc
Confidence 21 24578899999999887542 1133 234666668876654 355555554
No 106
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=97.09 E-value=0.033 Score=55.81 Aligned_cols=184 Identities=11% Similarity=0.057 Sum_probs=106.2
Q ss_pred ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEE----EE----cCCCCCcceEEEcCCCCcEEEEccCC
Q psy6570 22 LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRT----LL----NTGLNEPYDIALEPLSGRMFWTELGI 93 (713)
Q Consensus 22 ~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~----l~----~~~~~~p~~iavD~~~~~ly~td~~~ 93 (713)
.++.-..+.||+ .++.+++.+. .-.-+...+.+-+.+-. ++ .++-=+-+|||++ ...--|+|-.+.
T Consensus 99 ~TGdidiHdia~--~~~~l~fVNT---~fSCLatl~~~~SF~P~WkPpFIs~la~eDRCHLNGlA~~-~g~p~yVTa~~~ 172 (335)
T TIGR03032 99 VTGDIDAHDLAL--GAGRLLFVNT---LFSCLATVSPDYSFVPLWKPPFISKLAPEDRCHLNGMALD-DGEPRYVTALSQ 172 (335)
T ss_pred eccCcchhheee--cCCcEEEEEC---cceeEEEECCCCccccccCCccccccCccCceeecceeee-CCeEEEEEEeec
Confidence 445555666666 3556666665 44445555444332210 11 1111235689997 355567775432
Q ss_pred C--CeEEEEec-CCCCcE-----EEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCcccee
Q psy6570 94 K--PRISGASI-DGKNKF-----NLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYK 165 (713)
Q Consensus 94 ~--~~I~~~~~-dG~~~~-----~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~ 165 (713)
. +.-||-.. +|.... .++.+++..|.+.-+. +++||+.|++.++|.+++.+....+.+... ...|.|
T Consensus 173 sD~~~gWR~~~~~gG~vidv~s~evl~~GLsmPhSPRWh--dgrLwvldsgtGev~~vD~~~G~~e~Va~v---pG~~rG 247 (335)
T TIGR03032 173 SDVADGWREGRRDGGCVIDIPSGEVVASGLSMPHSPRWY--QGKLWLLNSGRGELGYVDPQAGKFQPVAFL---PGFTRG 247 (335)
T ss_pred cCCcccccccccCCeEEEEeCCCCEEEcCccCCcCCcEe--CCeEEEEECCCCEEEEEcCCCCcEEEEEEC---CCCCcc
Confidence 1 12333222 221111 1233477888888888 899999999999999999984444455444 347889
Q ss_pred eeeeCCeEEEEeCCC-------------------CcEEEEcccCCCcceee--eccccccccEEEEeecccc
Q psy6570 166 LEVFEDNLYFSTYRT-------------------NNILKINKFGNSDFNVL--ANNLNRASDVLILQENKQA 216 (713)
Q Consensus 166 i~~~~~~ly~td~~~-------------------~~i~~~~~~~~~~~~~~--~~~~~~~~~i~v~~~~~q~ 216 (713)
|++.++.+++.-+.. -.|+.++..++..+..+ ...+...+++++....+++
T Consensus 248 L~f~G~llvVgmSk~R~~~~f~glpl~~~l~~~~CGv~vidl~tG~vv~~l~feg~v~EifdV~vLPg~r~P 319 (335)
T TIGR03032 248 LAFAGDFAFVGLSKLRESRVFGGLPIEERLDALGCGVAVIDLNSGDVVHWLRFEGVIEEIYDVAVLPGVRRP 319 (335)
T ss_pred cceeCCEEEEEeccccCCCCcCCCchhhhhhhhcccEEEEECCCCCEEEEEEeCCceeEEEEEEEecCCCCc
Confidence 998888777764331 12566666666655543 2345666677776666653
No 107
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=97.08 E-value=0.00032 Score=46.20 Aligned_cols=27 Identities=33% Similarity=0.603 Sum_probs=22.2
Q ss_pred CCCCCCeeeccCCCceeeeCCCCcccc
Q psy6570 227 PCHQSALCINLPSSHTCLCPDHLTEEL 253 (713)
Q Consensus 227 ~C~~~~~C~~~~g~~~C~C~~G~~~~~ 253 (713)
.|+.++.|++++++|+|.|++||.|+.
T Consensus 7 ~C~~nA~C~~~~~~~~C~C~~Gy~GdG 33 (36)
T PF12947_consen 7 GCHPNATCTNTGGSYTCTCKPGYEGDG 33 (36)
T ss_dssp GS-TTCEEEE-TTSEEEEE-CEEECCS
T ss_pred CCCCCcEeecCCCCEEeECCCCCccCC
Confidence 699999999999999999999999764
No 108
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=97.05 E-value=0.053 Score=54.36 Aligned_cols=180 Identities=14% Similarity=0.105 Sum_probs=105.4
Q ss_pred CcccCCceeEEEc--cCcccEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEE----------ecCCceEEEEE--
Q psy6570 2 ASISSGNVTRVKR--EMNLKTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVS----------TLEGRKKRTLL-- 67 (713)
Q Consensus 2 ad~~~~~I~~~~~--~~~~~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~----------~~~G~~~~~l~-- 67 (713)
+....++++.+.. ++...+....+.+|-||++. .++||++-. ..|+++ ...+......+
T Consensus 23 sTYQagkL~~ig~~~~g~l~~~~r~F~r~MGl~~~--~~~l~~~t~-----~qiw~f~~~~n~l~~~~~~~~~D~~yvPr 95 (335)
T TIGR03032 23 TTYQAGKLFFIGLQPNGELDVFERTFPRPMGLAVS--PQSLTLGTR-----YQLWRFANVDNLLPAGQTHPGYDRLYVPR 95 (335)
T ss_pred EeeecceEEEEEeCCCCcEEEEeeccCccceeeee--CCeEEEEEc-----ceeEEcccccccccccccCCCCCeEEeee
Confidence 4566788887754 45555667889999999996 578887754 445554 11222222111
Q ss_pred ---cCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcE--------EEEeCCCCCCeeEEEeCCCCeEEEEcC-
Q psy6570 68 ---NTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKF--------NLVDNNIQWPTGITIDYPSQRLYWADP- 135 (713)
Q Consensus 68 ---~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~--------~l~~~~~~~p~glavd~~~~~LY~~d~- 135 (713)
..+.-..++|++ .++.|++.+..-. -|-+.+.+-+.+- .++..+-=.-||||++ +++--|++--
T Consensus 96 ~~~~TGdidiHdia~--~~~~l~fVNT~fS-CLatl~~~~SF~P~WkPpFIs~la~eDRCHLNGlA~~-~g~p~yVTa~~ 171 (335)
T TIGR03032 96 ASYVTGDIDAHDLAL--GAGRLLFVNTLFS-CLATVSPDYSFVPLWKPPFISKLAPEDRCHLNGMALD-DGEPRYVTALS 171 (335)
T ss_pred eeeeccCcchhheee--cCCcEEEEECcce-eEEEECCCCccccccCCccccccCccCceeecceeee-CCeEEEEEEee
Confidence 123455778888 3556666554333 4555555443221 1111222245899998 4556676641
Q ss_pred --CCCcEEEEe-CCCC------ceeEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEEEcccCCCcce
Q psy6570 136 --KARTIESIN-LNGK------DRFVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILKINKFGNSDFN 195 (713)
Q Consensus 136 --~~~~I~~~~-~~g~------~~~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~~~~~~~~~~ 195 (713)
....=||-+ .+|. .-+++.+. +..|.+-..++++||+.|++.+.|.++++..+....
T Consensus 172 ~sD~~~gWR~~~~~gG~vidv~s~evl~~G---LsmPhSPRWhdgrLwvldsgtGev~~vD~~~G~~e~ 237 (335)
T TIGR03032 172 QSDVADGWREGRRDGGCVIDIPSGEVVASG---LSMPHSPRWYQGKLWLLNSGRGELGYVDPQAGKFQP 237 (335)
T ss_pred ccCCcccccccccCCeEEEEeCCCCEEEcC---ccCCcCCcEeCCeEEEEECCCCEEEEEcCCCCcEEE
Confidence 111222222 2232 12334443 678888889999999999999999999987554433
No 109
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=97.02 E-value=0.00097 Score=44.73 Aligned_cols=34 Identities=44% Similarity=1.097 Sum_probs=23.9
Q ss_pred CCCCC-CCCCCCcEEeecCCCceeeCCCCCcCCCC
Q psy6570 553 NKCTP-NYCSNNGTCVLIEGKPSCKCLPPYSGKQC 586 (713)
Q Consensus 553 ~~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~G~~C 586 (713)
++|.. .+|.+++.|.+..+.|.|.|++||.|..|
T Consensus 3 ~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g~~C 37 (38)
T cd00054 3 DECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37 (38)
T ss_pred ccCCCCCCcCCCCEeECCCCCeEeECCCCCcCCcC
Confidence 45554 56766777777777777777777777665
No 110
>PRK01742 tolB translocation protein TolB; Provisional
Probab=96.98 E-value=0.1 Score=57.05 Aligned_cols=162 Identities=12% Similarity=0.088 Sum_probs=96.6
Q ss_pred CceeEEEccCcccEEecCC-CCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCc
Q psy6570 7 GNVTRVKREMNLKTVLSNL-HDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGR 85 (713)
Q Consensus 7 ~~I~~~~~~~~~~~~~~~~-~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ 85 (713)
..|+.+++.+.....+..+ ..-..+++.+.++.|+++-... ..-.|+++++++...+.+... ......+++.|....
T Consensus 228 ~~i~i~dl~tg~~~~l~~~~g~~~~~~wSPDG~~La~~~~~~-g~~~Iy~~d~~~~~~~~lt~~-~~~~~~~~wSpDG~~ 305 (429)
T PRK01742 228 SQLVVHDLRSGARKVVASFRGHNGAPAFSPDGSRLAFASSKD-GVLNIYVMGANGGTPSQLTSG-AGNNTEPSWSPDGQS 305 (429)
T ss_pred cEEEEEeCCCCceEEEecCCCccCceeECCCCCEEEEEEecC-CcEEEEEEECCCCCeEeeccC-CCCcCCEEECCCCCE
Confidence 3577777765443222222 2234678888888888864310 234688888887766655533 234567888887776
Q ss_pred EEEE-ccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccce
Q psy6570 86 MFWT-ELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPY 164 (713)
Q Consensus 86 ly~t-d~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~ 164 (713)
|+++ +.....+|++++.++...+.+ ... . ..+++.+++++|+++.. ..|.++++.+...+.+.... .....
T Consensus 306 i~f~s~~~g~~~I~~~~~~~~~~~~l-~~~-~--~~~~~SpDG~~ia~~~~--~~i~~~Dl~~g~~~~lt~~~--~~~~~ 377 (429)
T PRK01742 306 ILFTSDRSGSPQVYRMSASGGGASLV-GGR-G--YSAQISADGKTLVMING--DNVVKQDLTSGSTEVLSSTF--LDESP 377 (429)
T ss_pred EEEEECCCCCceEEEEECCCCCeEEe-cCC-C--CCccCCCCCCEEEEEcC--CCEEEEECCCCCeEEecCCC--CCCCc
Confidence 7765 333445899998887765444 211 1 34678888888888754 56778888765544443221 11222
Q ss_pred eeeeeCCeEEEEeC
Q psy6570 165 KLEVFEDNLYFSTY 178 (713)
Q Consensus 165 ~i~~~~~~ly~td~ 178 (713)
.++.++..|+.+..
T Consensus 378 ~~sPdG~~i~~~s~ 391 (429)
T PRK01742 378 SISPNGIMIIYSST 391 (429)
T ss_pred eECCCCCEEEEEEc
Confidence 34445666666643
No 111
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=96.98 E-value=0.00087 Score=45.40 Aligned_cols=31 Identities=32% Similarity=0.847 Sum_probs=28.1
Q ss_pred cCCCCC-CCCCCCCeeeccCCCceeeeCCCCc
Q psy6570 220 TNHCDD-KPCHQSALCINLPSSHTCLCPDHLT 250 (713)
Q Consensus 220 ~~~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~ 250 (713)
+++|.. .+|.+++.|++..++|+|.|++||.
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~ 33 (39)
T smart00179 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYT 33 (39)
T ss_pred cccCcCCCCcCCCCEeECCCCCeEeECCCCCc
Confidence 578887 7998888999999999999999997
No 112
>KOG1218|consensus
Probab=96.97 E-value=0.066 Score=55.88 Aligned_cols=146 Identities=31% Similarity=0.660 Sum_probs=77.6
Q ss_pred CCeeecCCCcccCCccccC---CCCCCCCCCceeeCCCCCCCCCceeeC-CCCcccCCCCccC-CCCCCCCCCEEEcCCC
Q psy6570 328 DPHCICQENFYGTYCEKVN---NSMCPCLNQGMCYPDLTHPEPTYKCHC-APSYTGARCESRI-CENKCHNGGTCIATTQ 402 (713)
Q Consensus 328 ~~~C~C~~g~~G~~C~~~~---c~~~~C~~~~~C~~~~~~~~~~~~C~C-~~G~~g~~C~~~~-C~~~C~~~~~C~~~~~ 402 (713)
..+|.+..+|.|..|.... .....|.....|....... .+...| ..+|.|..|+... |...|.. .+|.+...
T Consensus 48 ~~~~~~~~~~~~~~c~~~~~~~~~~~~c~~~~~c~~~~~~~--~~~~~~~~~~~~g~~C~~~~~~~~~c~~-~~C~~~~~ 124 (316)
T KOG1218|consen 48 SGECGLGYGFVGSVCRIECVCGNAGGGCSQPCRCKNGGTCV--SSTGYCHLNGYEGPQCESPCPCGDGCAE-KTCANPRR 124 (316)
T ss_pred ceeEecccccCCCccccccccCCCCCcccCccccCCCCccc--CCCCcccCCCCCcccccCCCCcCCcccc-cccCCCcc
Confidence 3578888899888876543 1222333333443332211 223345 6888888887653 3322333 45665333
Q ss_pred eeeCCCCCccCCCCc--CCCCCCC----CCcEeccCCCCccccCCCCCcCCCCccCC--CCCC--CCC-CeeecCCCCCe
Q psy6570 403 TCVCPPGFTGDTCQQ--CLNLKCQ----NGGVCVNKTTGLECDCPKFYYGKNCQYSQ--CKNY--CVN-GECSITDSGPK 471 (713)
Q Consensus 403 ~C~C~~g~~g~~C~~--C~~~~C~----~~~~C~~~~~~~~C~C~~G~~g~~C~~~~--C~~~--~~~-~~C~~~~~~~~ 471 (713)
.|.+..+|.+..|.+ -....|. +...+.... -.|.|.+||.|..|+... |... +.+ +.|....+ .
T Consensus 125 ~c~~~~~~~~~~C~~~~~~g~~C~~~c~~~~~~~~~~--~~c~c~~g~~g~~~~~~~~~c~~~~~~~~g~~C~~~~~--~ 200 (316)
T KOG1218|consen 125 ECRCGGGYIGEQCGEENLVGLKCQRDCQCTGGCDCKN--GICTCQPGFVGVFCVESCSGCSPLTACENGAKCNRSTG--S 200 (316)
T ss_pred ceecCCcCccccccccCCCCCCccCCCCCccccCCCC--CceeccCCcccccccccCCCcCCCcccCCCCeeecccc--c
Confidence 677888887776653 1111122 111222111 268899999988886332 4433 444 47766655 4
Q ss_pred ecCCCCccC
Q psy6570 472 CMCSPGYSG 480 (713)
Q Consensus 472 C~C~~G~~g 480 (713)
+.+.+++.+
T Consensus 201 ~~~~~~~~~ 209 (316)
T KOG1218|consen 201 CLCYPGPSG 209 (316)
T ss_pred cccCCCCcc
Confidence 555555543
No 113
>KOG1446|consensus
Probab=96.95 E-value=0.26 Score=48.97 Aligned_cols=185 Identities=14% Similarity=0.104 Sum_probs=114.4
Q ss_pred CCceeEEEccCcccEE-ecC-CCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCC
Q psy6570 6 SGNVTRVKREMNLKTV-LSN-LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLS 83 (713)
Q Consensus 6 ~~~I~~~~~~~~~~~~-~~~-~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~ 83 (713)
+..|+-+++..+.-+- ..+ -....+|.+.|.+ ..|++-+ ..+.|...|+.-..-..++. +..+--.|+|| .
T Consensus 79 d~tIryLsl~dNkylRYF~GH~~~V~sL~~sP~~-d~FlS~S---~D~tvrLWDlR~~~cqg~l~--~~~~pi~AfDp-~ 151 (311)
T KOG1446|consen 79 DDTIRYLSLHDNKYLRYFPGHKKRVNSLSVSPKD-DTFLSSS---LDKTVRLWDLRVKKCQGLLN--LSGRPIAAFDP-E 151 (311)
T ss_pred CCceEEEEeecCceEEEcCCCCceEEEEEecCCC-CeEEecc---cCCeEEeeEecCCCCceEEe--cCCCcceeECC-C
Confidence 4566666666654222 333 3446788898885 6777776 56788888888655555553 34566789997 6
Q ss_pred CcEEEEccCCCCeEEEEecC---CCCcEEEEe--CCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEe-CCCCceeEEEecC
Q psy6570 84 GRMFWTELGIKPRISGASID---GKNKFNLVD--NNIQWPTGITIDYPSQRLYWADPKARTIESIN-LNGKDRFVVYHTE 157 (713)
Q Consensus 84 ~~ly~td~~~~~~I~~~~~d---G~~~~~l~~--~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~-~~g~~~~~~~~~~ 157 (713)
|.||.+-.+.+ .|...++. ....+++.. .....-+.|.+.+.++.|.++.. .+.|+.++ ++|.....+....
T Consensus 152 GLifA~~~~~~-~IkLyD~Rs~dkgPF~tf~i~~~~~~ew~~l~FS~dGK~iLlsT~-~s~~~~lDAf~G~~~~tfs~~~ 229 (311)
T KOG1446|consen 152 GLIFALANGSE-LIKLYDLRSFDKGPFTTFSITDNDEAEWTDLEFSPDGKSILLSTN-ASFIYLLDAFDGTVKSTFSGYP 229 (311)
T ss_pred CcEEEEecCCC-eEEEEEecccCCCCceeEccCCCCccceeeeEEcCCCCEEEEEeC-CCcEEEEEccCCcEeeeEeecc
Confidence 88888877766 66665543 333333322 23445589999988888877764 45566666 5777555554433
Q ss_pred CCCccceeeeee-CCeEEEEeCCCCcEEEEcccCCCcceeeec
Q psy6570 158 DNGYKPYKLEVF-EDNLYFSTYRTNNILKINKFGNSDFNVLAN 199 (713)
Q Consensus 158 ~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~~~~~~~~~ 199 (713)
.....|....+. ++...++-...++|...+...+.++..+..
T Consensus 230 ~~~~~~~~a~ftPds~Fvl~gs~dg~i~vw~~~tg~~v~~~~~ 272 (311)
T KOG1446|consen 230 NAGNLPLSATFTPDSKFVLSGSDDGTIHVWNLETGKKVAVLRG 272 (311)
T ss_pred CCCCcceeEEECCCCcEEEEecCCCcEEEEEcCCCcEeeEecC
Confidence 333344333332 455555556778888887766666555544
No 114
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=96.95 E-value=0.0015 Score=40.44 Aligned_cols=28 Identities=36% Similarity=0.605 Sum_probs=24.3
Q ss_pred CCCCceEEEeccCCeEEEeecCCCCCCeEEEE
Q psy6570 25 LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVS 56 (713)
Q Consensus 25 ~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~ 56 (713)
+..|.|||+| .++.||++|. .+++|.++
T Consensus 1 f~~P~gvav~-~~g~i~VaD~---~n~rV~vf 28 (28)
T PF01436_consen 1 FNYPHGVAVD-SDGNIYVADS---GNHRVQVF 28 (28)
T ss_dssp BSSEEEEEEE-TTSEEEEEEC---CCTEEEEE
T ss_pred CcCCcEEEEe-CCCCEEEEEC---CCCEEEEC
Confidence 4689999999 6799999999 78888864
No 115
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=96.93 E-value=0.00049 Score=45.18 Aligned_cols=21 Identities=38% Similarity=0.838 Sum_probs=19.0
Q ss_pred CCeeeccCCCceeeeCCCCcc
Q psy6570 231 SALCINLPSSHTCLCPDHLTE 251 (713)
Q Consensus 231 ~~~C~~~~g~~~C~C~~G~~~ 251 (713)
+++|++.+++|+|.|++||..
T Consensus 9 ~h~C~~~~g~~~C~C~~Gy~L 29 (36)
T PF14670_consen 9 SHICVNTPGSYRCSCPPGYKL 29 (36)
T ss_dssp SSEEEEETTSEEEE-STTEEE
T ss_pred CCCCccCCCceEeECCCCCEE
Confidence 899999999999999999984
No 116
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=96.89 E-value=0.06 Score=56.36 Aligned_cols=61 Identities=25% Similarity=0.362 Sum_probs=45.9
Q ss_pred CcceEEEcCCCCcEEEEccCC-----CCeEEEEecCCCCcEEE-EeCC-------------CCCCeeEEEeCCCCeEEEE
Q psy6570 73 EPYDIALEPLSGRMFWTELGI-----KPRISGASIDGKNKFNL-VDNN-------------IQWPTGITIDYPSQRLYWA 133 (713)
Q Consensus 73 ~p~~iavD~~~~~ly~td~~~-----~~~I~~~~~dG~~~~~l-~~~~-------------~~~p~glavd~~~~~LY~~ 133 (713)
.+.+|++ +.++.+||++.+. .++|++++++|...+.+ +... -....+||+.+++.+||.+
T Consensus 86 D~Egi~~-~~~g~~~is~E~~~~~~~~p~I~~~~~~G~~~~~~~vP~~~~~~~~~~~~~~~N~G~E~la~~~dG~~l~~~ 164 (326)
T PF13449_consen 86 DPEGIAV-PPDGSFWISSEGGRTGGIPPRIRRFDLDGRVIRRFPVPAAFLPDANGTSGRRNNRGFEGLAVSPDGRTLFAA 164 (326)
T ss_pred ChhHeEE-ecCCCEEEEeCCccCCCCCCEEEEECCCCcccceEccccccccccCccccccCCCCeEEEEECCCCCEEEEE
Confidence 5669999 5789999999876 14999999999886665 2221 1234689999888878887
Q ss_pred c
Q psy6570 134 D 134 (713)
Q Consensus 134 d 134 (713)
-
T Consensus 165 ~ 165 (326)
T PF13449_consen 165 M 165 (326)
T ss_pred E
Confidence 4
No 117
>PF02333 Phytase: Phytase; InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=96.71 E-value=0.13 Score=53.80 Aligned_cols=182 Identities=22% Similarity=0.303 Sum_probs=91.4
Q ss_pred CCceeEEEccCcccEEecCCCCCceEEEec----cCCe---EEEeecCCCCCCeEEEEecCC--ceEEEEE------cCC
Q psy6570 6 SGNVTRVKREMNLKTVLSNLHDPRGVAVDW----VGKN---LYWTDAGGRSSNNIMVSTLEG--RKKRTLL------NTG 70 (713)
Q Consensus 6 ~~~I~~~~~~~~~~~~~~~~~~p~gla~D~----~~~~---ly~td~~~~~~~~I~~~~~~G--~~~~~l~------~~~ 70 (713)
.+-++.++++|+....+. ..+|..+.+-. .++. +..++.. ...++|..+.++. ...+.+. ...
T Consensus 77 ~~GL~VYdL~Gk~lq~~~-~Gr~NNVDvrygf~l~g~~vDlavas~R~-~g~n~l~~f~id~~~g~L~~v~~~~~p~~~~ 154 (381)
T PF02333_consen 77 KGGLYVYDLDGKELQSLP-VGRPNNVDVRYGFPLNGKTVDLAVASDRS-DGRNSLRLFRIDPDTGELTDVTDPAAPIATD 154 (381)
T ss_dssp TTEEEEEETTS-EEEEE--SS-EEEEEEEEEEEETTEEEEEEEEEE-C-CCT-EEEEEEEETTTTEEEE-CBTTC-EE-S
T ss_pred CCCEEEEcCCCcEEEeec-CCCcceeeeecceecCCceEEEEEEecCc-CCCCeEEEEEecCCCCcceEcCCCCcccccc
Confidence 445777888886533332 34554443321 1121 2334431 0124544444432 2333332 134
Q ss_pred CCCcceEEE--cCCCCcEEEEccCCCCeEEEEec--CCCC--cEEEEe--CCCCCCeeEEEeCCCCeEEEEcCCCCcEEE
Q psy6570 71 LNEPYDIAL--EPLSGRMFWTELGIKPRISGASI--DGKN--KFNLVD--NNIQWPTGITIDYPSQRLYWADPKARTIES 142 (713)
Q Consensus 71 ~~~p~~iav--D~~~~~ly~td~~~~~~I~~~~~--dG~~--~~~l~~--~~~~~p~glavd~~~~~LY~~d~~~~~I~~ 142 (713)
+..|.||++ ++.++.+|..-.+..+.++...+ ++.. .-+++. ..-..|.|+++|...++||+++.. .-||+
T Consensus 155 ~~e~yGlcly~~~~~g~~ya~v~~k~G~~~Qy~L~~~~~g~v~~~lVR~f~~~sQ~EGCVVDDe~g~LYvgEE~-~GIW~ 233 (381)
T PF02333_consen 155 LSEPYGLCLYRSPSTGALYAFVNGKDGRVEQYELTDDGDGKVSATLVREFKVGSQPEGCVVDDETGRLYVGEED-VGIWR 233 (381)
T ss_dssp SSSEEEEEEEE-TTT--EEEEEEETTSEEEEEEEEE-TTSSEEEEEEEEEE-SS-EEEEEEETTTTEEEEEETT-TEEEE
T ss_pred cccceeeEEeecCCCCcEEEEEecCCceEEEEEEEeCCCCcEeeEEEEEecCCCcceEEEEecccCCEEEecCc-cEEEE
Confidence 566888887 56677777654333335554433 3433 222331 123478999999999999999976 46999
Q ss_pred EeCC---CCceeEEEec-CC-CCccceeeeee-----CCeEEEEeCCCCcEEEEcccC
Q psy6570 143 INLN---GKDRFVVYHT-ED-NGYKPYKLEVF-----EDNLYFSTYRTNNILKINKFG 190 (713)
Q Consensus 143 ~~~~---g~~~~~~~~~-~~-~~~~p~~i~~~-----~~~ly~td~~~~~i~~~~~~~ 190 (713)
++.+ +..++.+... .. ......||+++ .++|.+++.+.+....++..+
T Consensus 234 y~Aep~~~~~~~~v~~~~g~~l~aDvEGlaly~~~~g~gYLivSsQG~~sf~Vy~r~~ 291 (381)
T PF02333_consen 234 YDAEPEGGNDRTLVASADGDGLVADVEGLALYYGSDGKGYLIVSSQGDNSFAVYDREG 291 (381)
T ss_dssp EESSCCC-S--EEEEEBSSSSB-S-EEEEEEEE-CCC-EEEEEEEGGGTEEEEEESST
T ss_pred EecCCCCCCcceeeecccccccccCccceEEEecCCCCeEEEEEcCCCCeEEEEecCC
Confidence 9976 3444555332 11 22355677764 357888888877766666543
No 118
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=96.65 E-value=0.061 Score=56.01 Aligned_cols=123 Identities=14% Similarity=0.165 Sum_probs=93.6
Q ss_pred CCceeEEEccCcc-cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCC
Q psy6570 6 SGNVTRVKREMNL-KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSG 84 (713)
Q Consensus 6 ~~~I~~~~~~~~~-~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~ 84 (713)
.+.|-.++.++.+ +.+..++.....+++++.+..+.+++. ...|++++++....+.+-.+.-....++++.|..+
T Consensus 381 gD~l~iyd~~~~e~kr~e~~lg~I~av~vs~dGK~~vvaNd----r~el~vididngnv~~idkS~~~lItdf~~~~nsr 456 (668)
T COG4946 381 GDKLGIYDKDGGEVKRIEKDLGNIEAVKVSPDGKKVVVAND----RFELWVIDIDNGNVRLIDKSEYGLITDFDWHPNSR 456 (668)
T ss_pred CceEEEEecCCceEEEeeCCccceEEEEEcCCCcEEEEEcC----ceEEEEEEecCCCeeEecccccceeEEEEEcCCce
Confidence 3467777888755 677889999999999988777887775 68899999998888887777677788999998877
Q ss_pred cEEEEccC---CCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEc
Q psy6570 85 RMFWTELG---IKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWAD 134 (713)
Q Consensus 85 ~ly~td~~---~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d 134 (713)
+|-++--. +. .|...+|+|...-.+.. ...+-..-|+|+++..||+-.
T Consensus 457 ~iAYafP~gy~tq-~Iklydm~~~Kiy~vTT-~ta~DfsPaFD~d~ryLYfLs 507 (668)
T COG4946 457 WIAYAFPEGYYTQ-SIKLYDMDGGKIYDVTT-PTAYDFSPAFDPDGRYLYFLS 507 (668)
T ss_pred eEEEecCcceeee-eEEEEecCCCeEEEecC-CcccccCcccCCCCcEEEEEe
Confidence 77555321 23 78899999865544432 334445679999999999974
No 119
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.63 E-value=0.0029 Score=41.72 Aligned_cols=30 Identities=50% Similarity=1.059 Sum_probs=21.5
Q ss_pred CCCCCCCcEEeecCCCceeeCCCCCcCC-CC
Q psy6570 557 PNYCSNNGTCVLIEGKPSCKCLPPYSGK-QC 586 (713)
Q Consensus 557 ~~~C~~~~~C~~~~g~~~C~C~~G~~G~-~C 586 (713)
..+|.+++.|++..+.|+|.|+.||.|. .|
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~~~C 35 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCcccCCc
Confidence 3456667778777777788888888776 44
No 120
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.61 E-value=0.0029 Score=41.60 Aligned_cols=28 Identities=46% Similarity=1.096 Sum_probs=20.2
Q ss_pred CCCCCCcEEeecCCCceeeCCCCCcC-CCC
Q psy6570 558 NYCSNNGTCVLIEGKPSCKCLPPYSG-KQC 586 (713)
Q Consensus 558 ~~C~~~~~C~~~~g~~~C~C~~G~~G-~~C 586 (713)
.+|.++ +|++..++|+|.|++||.| ..|
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 456666 7777777778888888877 554
No 121
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=96.49 E-value=0.0032 Score=42.13 Aligned_cols=32 Identities=31% Similarity=0.819 Sum_probs=28.5
Q ss_pred cCCCCC-CCCCCCCeeeccCCCceeeeCCCCcc
Q psy6570 220 TNHCDD-KPCHQSALCINLPSSHTCLCPDHLTE 251 (713)
Q Consensus 220 ~~~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~~ 251 (713)
+++|.. .+|.+++.|++.+++|+|.|++||.|
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~g 34 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCcC
Confidence 477887 78988899999999999999999985
No 122
>PRK02888 nitrous-oxide reductase; Validated
Probab=96.47 E-value=0.12 Score=57.28 Aligned_cols=138 Identities=12% Similarity=0.011 Sum_probs=86.3
Q ss_pred CCceeEEEccCcccEE-ecCCCCCceEEEeccCCeEEEeecCCCC-----------------------------------
Q psy6570 6 SGNVTRVKREMNLKTV-LSNLHDPRGVAVDWVGKNLYWTDAGGRS----------------------------------- 49 (713)
Q Consensus 6 ~~~I~~~~~~~~~~~~-~~~~~~p~gla~D~~~~~ly~td~~~~~----------------------------------- 49 (713)
.+.+..||.++.+..- +.-..+|+.+++++.++.+|++......
T Consensus 214 ~~~vSvID~etmeV~~qV~Vdgnpd~v~~spdGk~afvTsyNsE~G~tl~em~a~e~d~~vvfni~~iea~vkdGK~~~V 293 (635)
T PRK02888 214 RSLFTAVDAETMEVAWQVMVDGNLDNVDTDYDGKYAFSTCYNSEEGVTLAEMMAAERDWVVVFNIARIEEAVKAGKFKTI 293 (635)
T ss_pred eEEEEEEECccceEEEEEEeCCCcccceECCCCCEEEEeccCcccCcceeeeccccCceEEEEchHHHHHhhhCCCEEEE
Confidence 4556677777532111 3334589999999998899988521100
Q ss_pred -CCeEEEEecCC-----ceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCC---------cEEEEe-C
Q psy6570 50 -SNNIMVSTLEG-----RKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKN---------KFNLVD-N 113 (713)
Q Consensus 50 -~~~I~~~~~~G-----~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~---------~~~l~~-~ 113 (713)
.++|.++|... ......+..+ ..|+||+++|...+||++..... .+.++++.... +.+++. .
T Consensus 294 ~gn~V~VID~~t~~~~~~~v~~yIPVG-KsPHGV~vSPDGkylyVanklS~-tVSVIDv~k~k~~~~~~~~~~~~vvaev 371 (635)
T PRK02888 294 GGSKVPVVDGRKAANAGSALTRYVPVP-KNPHGVNTSPDGKYFIANGKLSP-TVTVIDVRKLDDLFDGKIKPRDAVVAEP 371 (635)
T ss_pred CCCEEEEEECCccccCCcceEEEEECC-CCccceEECCCCCEEEEeCCCCC-cEEEEEChhhhhhhhccCCccceEEEee
Confidence 12344444332 1122223233 78999999999999999987666 78887775422 122322 1
Q ss_pred C-CCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCC
Q psy6570 114 N-IQWPTGITIDYPSQRLYWADPKARTIESINLN 146 (713)
Q Consensus 114 ~-~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~ 146 (713)
. -..|...+||.. ++.|.+-.-...|.+.+++
T Consensus 372 evGlGPLHTaFDg~-G~aytslf~dsqv~kwn~~ 404 (635)
T PRK02888 372 ELGLGPLHTAFDGR-GNAYTTLFLDSQIVKWNIE 404 (635)
T ss_pred ccCCCcceEEECCC-CCEEEeEeecceeEEEehH
Confidence 1 346889999954 6799987766777777754
No 123
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=96.42 E-value=0.34 Score=49.65 Aligned_cols=182 Identities=15% Similarity=0.139 Sum_probs=95.7
Q ss_pred CCceeEEEccCcccEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecC--CceEEEEEc--CCCCCcc--eEEE
Q psy6570 6 SGNVTRVKREMNLKTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLE--GRKKRTLLN--TGLNEPY--DIAL 79 (713)
Q Consensus 6 ~~~I~~~~~~~~~~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~--G~~~~~l~~--~~~~~p~--~iav 79 (713)
..+|..+|+..++ ++..+.-|.-.-+=|..+.=|.+-- ..+++..+.+| |+..+.... .....|. .=++
T Consensus 117 a~SVtVVDl~~~k--vv~ei~~PGC~~iyP~~~~~F~~lC---~DGsl~~v~Ld~~Gk~~~~~t~~F~~~~dp~f~~~~~ 191 (342)
T PF06433_consen 117 ATSVTVVDLAAKK--VVGEIDTPGCWLIYPSGNRGFSMLC---GDGSLLTVTLDADGKEAQKSTKVFDPDDDPLFEHPAY 191 (342)
T ss_dssp SEEEEEEETTTTE--EEEEEEGTSEEEEEEEETTEEEEEE---TTSCEEEEEETSTSSEEEEEEEESSTTTS-B-S--EE
T ss_pred CCeEEEEECCCCc--eeeeecCCCEEEEEecCCCceEEEe---cCCceEEEEECCCCCEeEeeccccCCCCcccccccce
Confidence 3345555555432 2233444443322222222233444 45677766665 554322211 1112221 1233
Q ss_pred cCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCC---------CCCCe---eEEEeCCCCeEEEEc----CCCC-----
Q psy6570 80 EPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNN---------IQWPT---GITIDYPSQRLYWAD----PKAR----- 138 (713)
Q Consensus 80 D~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~---------~~~p~---glavd~~~~~LY~~d----~~~~----- 138 (713)
+..++++||.... +.|+.+++.|...+....-. -.+|. -+|+++..++||+.. .+++
T Consensus 192 ~~~~~~~~F~Sy~--G~v~~~dlsg~~~~~~~~~~~~t~~e~~~~WrPGG~Q~~A~~~~~~rlyvLMh~g~~gsHKdpgt 269 (342)
T PF06433_consen 192 SRDGGRLYFVSYE--GNVYSADLSGDSAKFGKPWSLLTDAEKADGWRPGGWQLIAYHAASGRLYVLMHQGGEGSHKDPGT 269 (342)
T ss_dssp ETTTTEEEEEBTT--SEEEEEEETTSSEEEEEEEESS-HHHHHTTEEE-SSS-EEEETTTTEEEEEEEE--TT-TTS-EE
T ss_pred ECCCCeEEEEecC--CEEEEEeccCCcccccCcccccCccccccCcCCcceeeeeeccccCeEEEEecCCCCCCccCCce
Confidence 3456678887653 37999999998755443211 12343 389999999999974 1222
Q ss_pred cEEEEeCCCCceeEEEecCCCCccc-eeeeeeC--C-eEEEEeCCCCcEEEEcccCCCcceeee
Q psy6570 139 TIESINLNGKDRFVVYHTEDNGYKP-YKLEVFE--D-NLYFSTYRTNNILKINKFGNSDFNVLA 198 (713)
Q Consensus 139 ~I~~~~~~g~~~~~~~~~~~~~~~p-~~i~~~~--~-~ly~td~~~~~i~~~~~~~~~~~~~~~ 198 (713)
.||.+|+....|..-+.. .+| .+|.+.+ . .||.++...+.+..++..++..+..+.
T Consensus 270 eVWv~D~~t~krv~Ri~l----~~~~~Si~Vsqd~~P~L~~~~~~~~~l~v~D~~tGk~~~~~~ 329 (342)
T PF06433_consen 270 EVWVYDLKTHKRVARIPL----EHPIDSIAVSQDDKPLLYALSAGDGTLDVYDAATGKLVRSIE 329 (342)
T ss_dssp EEEEEETTTTEEEEEEEE----EEEESEEEEESSSS-EEEEEETTTTEEEEEETTT--EEEEE-
T ss_pred EEEEEECCCCeEEEEEeC----CCccceEEEccCCCcEEEEEcCCCCeEEEEeCcCCcEEeehh
Confidence 499999876654333321 222 3566542 2 677777777888889887776555544
No 124
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.41 E-value=0.0046 Score=39.46 Aligned_cols=26 Identities=42% Similarity=0.936 Sum_probs=15.4
Q ss_pred CCCCCcEEeecCCCceeeCCCCCcCCCC
Q psy6570 559 YCSNNGTCVLIEGKPSCKCLPPYSGKQC 586 (713)
Q Consensus 559 ~C~~~~~C~~~~g~~~C~C~~G~~G~~C 586 (713)
.|+++|+|+.. ..+|.|.+||+|..|
T Consensus 7 ~C~~~G~C~~~--~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 7 ICSGHGTCVSP--CGRCVCDSGYTGPDC 32 (32)
T ss_pred ccCCCCEEeCC--CCEEECCCCCcCCCC
Confidence 46666666644 235666666666554
No 125
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=96.40 E-value=0.0018 Score=42.64 Aligned_cols=26 Identities=38% Similarity=0.865 Sum_probs=20.3
Q ss_pred CCCCCcEEeecCCCceeeCCCCCcCC
Q psy6570 559 YCSNNGTCVLIEGKPSCKCLPPYSGK 584 (713)
Q Consensus 559 ~C~~~~~C~~~~g~~~C~C~~G~~G~ 584 (713)
.|+.+++|+++.++|.|.|++||.|+
T Consensus 7 ~C~~nA~C~~~~~~~~C~C~~Gy~Gd 32 (36)
T PF12947_consen 7 GCHPNATCTNTGGSYTCTCKPGYEGD 32 (36)
T ss_dssp GS-TTCEEEE-TTSEEEEE-CEEECC
T ss_pred CCCCCcEeecCCCCEEeECCCCCccC
Confidence 58889999999999999999999875
No 126
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=96.34 E-value=0.0042 Score=40.88 Aligned_cols=28 Identities=43% Similarity=0.946 Sum_probs=19.9
Q ss_pred CCCCCCCcEeccCCCCccccCCCCCcCC
Q psy6570 420 NLKCQNGGVCVNKTTGLECDCPKFYYGK 447 (713)
Q Consensus 420 ~~~C~~~~~C~~~~~~~~C~C~~G~~g~ 447 (713)
..+|.+++.|++..++|.|.|+.||.|.
T Consensus 5 ~~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCCCCeEeECCCCCccc
Confidence 4567667777777777777777777766
No 127
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=96.31 E-value=0.34 Score=47.95 Aligned_cols=171 Identities=13% Similarity=0.168 Sum_probs=92.8
Q ss_pred cCCceeEEEccCcccEEecCCCCCce--EEEeccCCeEEEeecCCCCCCeEEEEe-cCCceEEEEEcCC-----CCCcce
Q psy6570 5 SSGNVTRVKREMNLKTVLSNLHDPRG--VAVDWVGKNLYWTDAGGRSSNNIMVST-LEGRKKRTLLNTG-----LNEPYD 76 (713)
Q Consensus 5 ~~~~I~~~~~~~~~~~~~~~~~~p~g--la~D~~~~~ly~td~~~~~~~~I~~~~-~~G~~~~~l~~~~-----~~~p~~ 76 (713)
..+.|+.++..+.+.+-...+..+.. .++ .++.||+... .++|..++ .+|+..-.+.... +..+..
T Consensus 44 ~~~~l~~~d~~tG~~~W~~~~~~~~~~~~~~--~~~~v~v~~~----~~~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~ 117 (238)
T PF13360_consen 44 GDGNLYALDAKTGKVLWRFDLPGPISGAPVV--DGGRVYVGTS----DGSLYALDAKTGKVLWSIYLTSSPPAGVRSSSS 117 (238)
T ss_dssp TTSEEEEEETTTSEEEEEEECSSCGGSGEEE--ETTEEEEEET----TSEEEEEETTTSCEEEEEEE-SSCTCSTB--SE
T ss_pred CCCEEEEEECCCCCEEEEeeccccccceeee--cccccccccc----eeeeEecccCCcceeeeeccccccccccccccC
Confidence 56777888874433222122223211 233 4688888875 34888998 6676655432211 223444
Q ss_pred EEEcCCCCcEEEEccCCCCeEEEEecC-CCCcEEEEeCCCC--C--------CeeEEEeCCCCeEEEEcCCCCcEEEEeC
Q psy6570 77 IALEPLSGRMFWTELGIKPRISGASID-GKNKFNLVDNNIQ--W--------PTGITIDYPSQRLYWADPKARTIESINL 145 (713)
Q Consensus 77 iavD~~~~~ly~td~~~~~~I~~~~~d-G~~~~~l~~~~~~--~--------p~glavd~~~~~LY~~d~~~~~I~~~~~ 145 (713)
++++ ++.||+.... . .|..+++. |+.+-..-..... . ...+.++ +++||++..... +..+++
T Consensus 118 ~~~~--~~~~~~~~~~-g-~l~~~d~~tG~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v~~~~~~g~-~~~~d~ 190 (238)
T PF13360_consen 118 PAVD--GDRLYVGTSS-G-KLVALDPKTGKLLWKYPVGEPRGSSPISSFSDINGSPVIS--DGRVYVSSGDGR-VVAVDL 190 (238)
T ss_dssp EEEE--TTEEEEEETC-S-EEEEEETTTTEEEEEEESSTT-SS--EEEETTEEEEEECC--TTEEEEECCTSS-EEEEET
T ss_pred ceEe--cCEEEEEecc-C-cEEEEecCCCcEEEEeecCCCCCCcceeeecccccceEEE--CCEEEEEcCCCe-EEEEEC
Confidence 5554 6677777642 2 68888855 4332222111101 0 1223333 569998885544 555576
Q ss_pred CCCceeEEEecCCCCcccee-eeeeCCeEEEEeCCCCcEEEEcccCCCc
Q psy6570 146 NGKDRFVVYHTEDNGYKPYK-LEVFEDNLYFSTYRTNNILKINKFGNSD 193 (713)
Q Consensus 146 ~g~~~~~~~~~~~~~~~p~~-i~~~~~~ly~td~~~~~i~~~~~~~~~~ 193 (713)
....+. .... .....+ ....++.||+.+ ..+.|+.+++.++..
T Consensus 191 ~tg~~~-w~~~---~~~~~~~~~~~~~~l~~~~-~~~~l~~~d~~tG~~ 234 (238)
T PF13360_consen 191 ATGEKL-WSKP---ISGIYSLPSVDGGTLYVTS-SDGRLYALDLKTGKV 234 (238)
T ss_dssp TTTEEE-EEEC---SS-ECECEECCCTEEEEEE-TTTEEEEEETTTTEE
T ss_pred CCCCEE-EEec---CCCccCCceeeCCEEEEEe-CCCEEEEEECCCCCE
Confidence 655533 3222 223444 566788999998 678899998876643
No 128
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=96.29 E-value=0.0019 Score=31.90 Aligned_cols=13 Identities=46% Similarity=1.410 Sum_probs=10.9
Q ss_pred eeeCCCCcccCCC
Q psy6570 656 ICICPRGYAGVRC 668 (713)
Q Consensus 656 ~C~C~~Gy~G~~C 668 (713)
+|+|++||+|.+|
T Consensus 1 ~C~C~~G~~G~~C 13 (13)
T PF12661_consen 1 TCQCPPGWTGPNC 13 (13)
T ss_dssp EEEE-TTEETTTT
T ss_pred CccCcCCCcCCCC
Confidence 5999999999987
No 129
>PF02333 Phytase: Phytase; InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=96.28 E-value=0.12 Score=54.20 Aligned_cols=123 Identities=16% Similarity=0.260 Sum_probs=73.4
Q ss_pred cCCCCCceEEEe--ccCCeEEEeecCCCCCCeEEEEec----CCceEEEEEcC--CCCCcceEEEcCCCCcEEEEccCCC
Q psy6570 23 SNLHDPRGVAVD--WVGKNLYWTDAGGRSSNNIMVSTL----EGRKKRTLLNT--GLNEPYDIALEPLSGRMFWTELGIK 94 (713)
Q Consensus 23 ~~~~~p~gla~D--~~~~~ly~td~~~~~~~~I~~~~~----~G~~~~~l~~~--~~~~p~~iavD~~~~~ly~td~~~~ 94 (713)
+.+..|.||++= +.++.+|..-.+ ..+.+..+.| +|...-.+++. --.+|.|+++|...+.||+.+...
T Consensus 153 ~~~~e~yGlcly~~~~~g~~ya~v~~--k~G~~~Qy~L~~~~~g~v~~~lVR~f~~~sQ~EGCVVDDe~g~LYvgEE~~- 229 (381)
T PF02333_consen 153 TDLSEPYGLCLYRSPSTGALYAFVNG--KDGRVEQYELTDDGDGKVSATLVREFKVGSQPEGCVVDDETGRLYVGEEDV- 229 (381)
T ss_dssp -SSSSEEEEEEEE-TTT--EEEEEEE--TTSEEEEEEEEE-TTSSEEEEEEEEEE-SS-EEEEEEETTTTEEEEEETTT-
T ss_pred cccccceeeEEeecCCCCcEEEEEec--CCceEEEEEEEeCCCCcEeeEEEEEecCCCcceEEEEecccCCEEEecCcc-
Confidence 456678899874 445666654321 2344444333 34332233331 136899999999999999999653
Q ss_pred CeEEEEecC---CCCcEEEEeC---CC-CCCeeEEEeC---CCCeEEEEcCCCCcEEEEeCCCCc
Q psy6570 95 PRISGASID---GKNKFNLVDN---NI-QWPTGITIDY---PSQRLYWADPKARTIESINLNGKD 149 (713)
Q Consensus 95 ~~I~~~~~d---G~~~~~l~~~---~~-~~p~glavd~---~~~~LY~~d~~~~~I~~~~~~g~~ 149 (713)
-||++..+ +..++.+... .+ .-..||+|-. ..++|.+++.+.++...++..+..
T Consensus 230 -GIW~y~Aep~~~~~~~~v~~~~g~~l~aDvEGlaly~~~~g~gYLivSsQG~~sf~Vy~r~~~~ 293 (381)
T PF02333_consen 230 -GIWRYDAEPEGGNDRTLVASADGDGLVADVEGLALYYGSDGKGYLIVSSQGDNSFAVYDREGPN 293 (381)
T ss_dssp -EEEEEESSCCC-S--EEEEEBSSSSB-S-EEEEEEEE-CCC-EEEEEEEGGGTEEEEEESSTT-
T ss_pred -EEEEEecCCCCCCcceeeecccccccccCccceEEEecCCCCeEEEEEcCCCCeEEEEecCCCC
Confidence 59999886 3334444221 12 3567899853 246899999998888888877753
No 130
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=96.15 E-value=0.092 Score=49.06 Aligned_cols=114 Identities=16% Similarity=0.179 Sum_probs=68.5
Q ss_pred CCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCc--eEEEEEc-C--CCCCcceEEEcCCCCcEEEEccCCCCeEEE
Q psy6570 25 LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGR--KKRTLLN-T--GLNEPYDIALEPLSGRMFWTELGIKPRISG 99 (713)
Q Consensus 25 ~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~--~~~~l~~-~--~~~~p~~iavD~~~~~ly~td~~~~~~I~~ 99 (713)
-.+-.||+.| ..+|..+|. +..+...++..- ..++.+. + ++...+-|. +.+|.||---|... +|.|
T Consensus 130 ~GeGWgLt~d--~~~LimsdG----satL~frdP~tfa~~~~v~VT~~g~pv~~LNELE--~VdG~lyANVw~t~-~I~r 200 (262)
T COG3823 130 EGEGWGLTSD--DKNLIMSDG----SATLQFRDPKTFAELDTVQVTDDGVPVSKLNELE--WVDGELYANVWQTT-RIAR 200 (262)
T ss_pred CCcceeeecC--CcceEeeCC----ceEEEecCHHHhhhcceEEEEECCeeccccccee--eeccEEEEeeeeec-ceEE
Confidence 3445677776 344665664 344544444321 1111111 1 123333443 36788887767666 8999
Q ss_pred EecCCCCcEEEEe------------CCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCC
Q psy6570 100 ASIDGKNKFNLVD------------NNIQWPTGITIDYPSQRLYWADPKARTIESINLNG 147 (713)
Q Consensus 100 ~~~dG~~~~~l~~------------~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g 147 (713)
++.+.......+. .+.+.++|||.|+..+++|++-..-..++.+.+++
T Consensus 201 I~p~sGrV~~widlS~L~~~~~~~~~~~nvlNGIA~~~~~~r~~iTGK~wp~lfEVk~~~ 260 (262)
T COG3823 201 IDPDSGRVVAWIDLSGLLKELNLDKSNDNVLNGIAHDPQQDRFLITGKLWPLLFEVKLDE 260 (262)
T ss_pred EcCCCCcEEEEEEccCCchhcCccccccccccceeecCcCCeEEEecCcCceeEEEEecC
Confidence 9988665554442 23457899999999999999987666666665543
No 131
>smart00181 EGF Epidermal growth factor-like domain.
Probab=96.14 E-value=0.0059 Score=40.10 Aligned_cols=28 Identities=43% Similarity=1.043 Sum_probs=17.9
Q ss_pred CCCCCCcEeccCCCCccccCCCCCcC-CCC
Q psy6570 421 LKCQNGGVCVNKTTGLECDCPKFYYG-KNC 449 (713)
Q Consensus 421 ~~C~~~~~C~~~~~~~~C~C~~G~~g-~~C 449 (713)
.+|.++ .|++..++|.|.|++||.| ..|
T Consensus 6 ~~C~~~-~C~~~~~~~~C~C~~g~~g~~~C 34 (35)
T smart00181 6 GPCSNG-TCINTPGSYTCSCPPGYTGDKRC 34 (35)
T ss_pred CCCCCC-EEECCCCCeEeECCCCCccCCcc
Confidence 456666 6766666677777777766 444
No 132
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=96.10 E-value=0.0079 Score=38.39 Aligned_cols=25 Identities=44% Similarity=1.214 Sum_probs=13.9
Q ss_pred CCCCCEEEcCCCeeeCCCCCccCCC
Q psy6570 391 CHNGGTCIATTQTCVCPPGFTGDTC 415 (713)
Q Consensus 391 C~~~~~C~~~~~~C~C~~g~~g~~C 415 (713)
|+++|+|+...++|.|++||+|+.|
T Consensus 8 C~~~G~C~~~~g~C~C~~g~~G~~C 32 (32)
T PF07974_consen 8 CSGHGTCVSPCGRCVCDSGYTGPDC 32 (32)
T ss_pred cCCCCEEeCCCCEEECCCCCcCCCC
Confidence 5555666543356666666665543
No 133
>smart00051 DSL delta serrate ligand.
Probab=95.97 E-value=0.0089 Score=45.11 Aligned_cols=46 Identities=26% Similarity=0.432 Sum_probs=29.1
Q ss_pred eeecCCCcccCCCCcCCCCCCCCCCCCeecCCCCccCCCCCceeeCCCCcccCCC
Q psy6570 614 VCTCVNGWSGITCSERVSCAHFCFNGGTCREQNYSLDPDLKPICICPRGYAGVRC 668 (713)
Q Consensus 614 ~C~C~~G~~G~~C~~~~~C~~~C~~~~~C~~~~~~~~~~~~~~C~C~~Gy~G~~C 668 (713)
.=.|+++|.|..|+....+.+.+..+.+|.. .+.|+|.+||+|.+|
T Consensus 18 rv~C~~~~yG~~C~~~C~~~~d~~~~~~Cd~---------~G~~~C~~Gw~G~~C 63 (63)
T smart00051 18 RVTCDENYYGEGCNKFCRPRDDFFGHYTCDE---------NGNKGCLEGWMGPYC 63 (63)
T ss_pred EeeCCCCCcCCccCCEeCcCccccCCccCCc---------CCCEecCCCCcCCCC
Confidence 4467777777777653333334455566643 267888888888776
No 134
>COG5276 Uncharacterized conserved protein [Function unknown]
Probab=95.95 E-value=1.7 Score=43.13 Aligned_cols=137 Identities=11% Similarity=0.138 Sum_probs=79.0
Q ss_pred cCCeEEEeecCCCCCCeEEEEecCCceEEEE-EcCC--CCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEe
Q psy6570 36 VGKNLYWTDAGGRSSNNIMVSTLEGRKKRTL-LNTG--LNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVD 112 (713)
Q Consensus 36 ~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l-~~~~--~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~ 112 (713)
.++..|++|. ..+ +.+.++.....-+| .+-. ...-+.++| ..++-|++.++.. +...+......-+++.
T Consensus 137 sGn~aYVadl---ddg-fLivdvsdpssP~lagrya~~~~d~~~v~I--SGn~AYvA~~d~G--L~ivDVSnp~sPvli~ 208 (370)
T COG5276 137 SGNYAYVADL---DDG-FLIVDVSDPSSPQLAGRYALPGGDTHDVAI--SGNYAYVAWRDGG--LTIVDVSNPHSPVLIG 208 (370)
T ss_pred cCCEEEEeec---cCc-EEEEECCCCCCceeeeeeccCCCCceeEEE--ecCeEEEEEeCCC--eEEEEccCCCCCeEEE
Confidence 5678999997 333 44445443332222 2211 122356787 4778898887655 5566665555555554
Q ss_pred CCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceee---eeeCCeEEEEeCCCCc
Q psy6570 113 NNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKL---EVFEDNLYFSTYRTNN 182 (713)
Q Consensus 113 ~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i---~~~~~~ly~td~~~~~ 182 (713)
.--..|.--++.+..++.|.++...+ +..++.++.....++..-. ...|.++ .+.+++.|++|..++.
T Consensus 209 ~~n~g~g~~sv~vsdnr~y~vvy~eg-vlivd~s~~ssp~~~gsye-t~~p~~~s~v~Vs~~~~Yvadga~gl 279 (370)
T COG5276 209 SYNTGPGTYSVSVSDNRAYLVVYDEG-VLIVDVSGPSSPTVFGSYE-TSNPVSISTVPVSGEYAYVADGAKGL 279 (370)
T ss_pred EEecCCceEEEEecCCeeEEEEcccc-eEEEecCCCCCceEeeccc-cCCcccccceecccceeeeeccccCc
Confidence 32222344444555789999986655 5667777755333332221 3355554 7889999999876553
No 135
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=95.90 E-value=0.27 Score=51.45 Aligned_cols=116 Identities=17% Similarity=0.166 Sum_probs=73.3
Q ss_pred CCceEEEeccCCeEEEeecCCCCC------CeEEEEecCCceEEEE-EcCCC-------------CCcceEEEcCCCCcE
Q psy6570 27 DPRGVAVDWVGKNLYWTDAGGRSS------NNIMVSTLEGRKKRTL-LNTGL-------------NEPYDIALEPLSGRM 86 (713)
Q Consensus 27 ~p~gla~D~~~~~ly~td~~~~~~------~~I~~~~~~G~~~~~l-~~~~~-------------~~p~~iavD~~~~~l 86 (713)
.++||++ ..++.+||++. .. .+|.+++++|+..+.+ +...+ ....+||+.+..+.|
T Consensus 86 D~Egi~~-~~~g~~~is~E---~~~~~~~~p~I~~~~~~G~~~~~~~vP~~~~~~~~~~~~~~~N~G~E~la~~~dG~~l 161 (326)
T PF13449_consen 86 DPEGIAV-PPDGSFWISSE---GGRTGGIPPRIRRFDLDGRVIRRFPVPAAFLPDANGTSGRRNNRGFEGLAVSPDGRTL 161 (326)
T ss_pred ChhHeEE-ecCCCEEEEeC---CccCCCCCCEEEEECCCCcccceEccccccccccCccccccCCCCeEEEEECCCCCEE
Confidence 7789999 57799999998 66 8999999999886655 22221 235589999876668
Q ss_pred EEEccCC-------C-------CeEEEEecCC--CCcEEEE-e-C------CCCCCeeEEEeCCCCeEEEEcCC------
Q psy6570 87 FWTELGI-------K-------PRISGASIDG--KNKFNLV-D-N------NIQWPTGITIDYPSQRLYWADPK------ 136 (713)
Q Consensus 87 y~td~~~-------~-------~~I~~~~~dG--~~~~~l~-~-~------~~~~p~glavd~~~~~LY~~d~~------ 136 (713)
|.+.... . .+|.+.+... .....++ . . ....+..|+.-+ +++|++.+..
T Consensus 162 ~~~~E~~l~~d~~~~~~~~~~~~ri~~~d~~~~~~~~~~~~y~ld~~~~~~~~~~isd~~al~-d~~lLvLER~~~~~~~ 240 (326)
T PF13449_consen 162 FAAMESPLKQDGPRANPDNGSPLRILRYDPKTPGEPVAEYAYPLDPPPTAPGDNGISDIAALP-DGRLLVLERDFSPGTG 240 (326)
T ss_pred EEEECccccCCCcccccccCceEEEEEecCCCCCccceEEEEeCCccccccCCCCceeEEEEC-CCcEEEEEccCCCCcc
Confidence 7764221 1 2555665542 2222222 1 1 234455555553 5668888743
Q ss_pred -CCcEEEEeCCC
Q psy6570 137 -ARTIESINLNG 147 (713)
Q Consensus 137 -~~~I~~~~~~g 147 (713)
..+|+++++..
T Consensus 241 ~~~ri~~v~l~~ 252 (326)
T PF13449_consen 241 NYKRIYRVDLSD 252 (326)
T ss_pred ceEEEEEEEccc
Confidence 34688888754
No 136
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=95.89 E-value=0.0048 Score=30.51 Aligned_cols=12 Identities=50% Similarity=1.625 Sum_probs=6.6
Q ss_pred eecCCCcccCCC
Q psy6570 615 CTCVNGWSGITC 626 (713)
Q Consensus 615 C~C~~G~~G~~C 626 (713)
|.|++||+|..|
T Consensus 2 C~C~~G~~G~~C 13 (13)
T PF12661_consen 2 CQCPPGWTGPNC 13 (13)
T ss_dssp EEE-TTEETTTT
T ss_pred ccCcCCCcCCCC
Confidence 566666666554
No 137
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=95.87 E-value=0.0064 Score=35.73 Aligned_cols=19 Identities=26% Similarity=0.780 Sum_probs=9.7
Q ss_pred cceeecCCCcc----cCCCCcCC
Q psy6570 612 KPVCTCVNGWS----GITCSERV 630 (713)
Q Consensus 612 ~~~C~C~~G~~----G~~C~~~~ 630 (713)
+|.|+|++||. |..|.+++
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DId 23 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDID 23 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCC
Confidence 35566666654 34455443
No 138
>KOG0318|consensus
Probab=95.85 E-value=1.1 Score=47.59 Aligned_cols=115 Identities=15% Similarity=0.094 Sum_probs=72.2
Q ss_pred ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEe-cCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEE
Q psy6570 22 LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVST-LEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGA 100 (713)
Q Consensus 22 ~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~-~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~ 100 (713)
+....+|.+||+...+..+.++-. .+ |.++. +.+- .+ + .-...|.++|+.|....+-+- +..+.|+..
T Consensus 402 ~~lg~QP~~lav~~d~~~avv~~~----~~-iv~l~~~~~~-~~--~-~~~y~~s~vAv~~~~~~vaVG--G~Dgkvhvy 470 (603)
T KOG0318|consen 402 VKLGSQPKGLAVLSDGGTAVVACI----SD-IVLLQDQTKV-SS--I-PIGYESSAVAVSPDGSEVAVG--GQDGKVHVY 470 (603)
T ss_pred eecCCCceeEEEcCCCCEEEEEec----Cc-EEEEecCCcc-ee--e-ccccccceEEEcCCCCEEEEe--cccceEEEE
Confidence 456678999999977666666654 23 44433 3331 11 1 123578999999865544443 344478888
Q ss_pred ecCCCCcEE--EEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCC
Q psy6570 101 SIDGKNKFN--LVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGK 148 (713)
Q Consensus 101 ~~dG~~~~~--l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~ 148 (713)
.+.|..+.. +.......++.|++.|+..+|-..|. +++|..++....
T Consensus 471 sl~g~~l~ee~~~~~h~a~iT~vaySpd~~yla~~Da-~rkvv~yd~~s~ 519 (603)
T KOG0318|consen 471 SLSGDELKEEAKLLEHRAAITDVAYSPDGAYLAAGDA-SRKVVLYDVASR 519 (603)
T ss_pred EecCCcccceeeeecccCCceEEEECCCCcEEEEecc-CCcEEEEEcccC
Confidence 888865433 22345667899999998888887785 345555555443
No 139
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=95.72 E-value=0.11 Score=57.70 Aligned_cols=71 Identities=18% Similarity=0.358 Sum_probs=49.6
Q ss_pred ecCCCCCceEEEeccCCeEEEeecCCC----------------CCCeEEEEecCCc-------eEEEEEc----------
Q psy6570 22 LSNLHDPRGVAVDWVGKNLYWTDAGGR----------------SSNNIMVSTLEGR-------KKRTLLN---------- 68 (713)
Q Consensus 22 ~~~~~~p~gla~D~~~~~ly~td~~~~----------------~~~~I~~~~~~G~-------~~~~l~~---------- 68 (713)
.+.+.+|++|++++.++.||++.+++. ..+.|+++.+++. ...+++.
T Consensus 346 AT~f~RpEgi~~~p~~g~vY~a~T~~~~r~~~~~~~~n~~~~n~~G~I~r~~~~~~d~~~~~f~~~~~~~~g~~~~~~~~ 425 (524)
T PF05787_consen 346 ATPFDRPEGITVNPDDGEVYFALTNNSGRGESDVDAANPRAGNGYGQIYRYDPDGNDHAATTFTWELFLVGGDPTDASGN 425 (524)
T ss_pred cccccCccCeeEeCCCCEEEEEEecCCCCcccccccCCcccCCcccEEEEecccCCccccceeEEEEEEEecCccccccc
Confidence 457899999999999999999876432 1247999988765 2222221
Q ss_pred -------CCCCCcceEEEcCCCCcEEE-EccCC
Q psy6570 69 -------TGLNEPYDIALEPLSGRMFW-TELGI 93 (713)
Q Consensus 69 -------~~~~~p~~iavD~~~~~ly~-td~~~ 93 (713)
..+..|-.|++|+. |+||+ +|.+.
T Consensus 426 ~~~~~~~~~f~sPDNL~~d~~-G~LwI~eD~~~ 457 (524)
T PF05787_consen 426 GSNKCDDNGFASPDNLAFDPD-GNLWIQEDGGG 457 (524)
T ss_pred ccCcccCCCcCCCCceEECCC-CCEEEEeCCCC
Confidence 12678999999985 55555 55443
No 140
>KOG0279|consensus
Probab=95.72 E-value=1.7 Score=42.76 Aligned_cols=184 Identities=10% Similarity=0.020 Sum_probs=114.8
Q ss_pred CCceeEEEccCcc--cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCC-CCCcceEEEcCC
Q psy6570 6 SGNVTRVKREMNL--KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTG-LNEPYDIALEPL 82 (713)
Q Consensus 6 ~~~I~~~~~~~~~--~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~-~~~p~~iavD~~ 82 (713)
++.++..|+.+.+ +.+.-.-....++||++.++.| ++-+ ....|...+.-|.-+.++.... -....-+.+.|.
T Consensus 84 D~~lrlWDl~~g~~t~~f~GH~~dVlsva~s~dn~qi-vSGS---rDkTiklwnt~g~ck~t~~~~~~~~WVscvrfsP~ 159 (315)
T KOG0279|consen 84 DGTLRLWDLATGESTRRFVGHTKDVLSVAFSTDNRQI-VSGS---RDKTIKLWNTLGVCKYTIHEDSHREWVSCVRFSPN 159 (315)
T ss_pred cceEEEEEecCCcEEEEEEecCCceEEEEecCCCcee-ecCC---CcceeeeeeecccEEEEEecCCCcCcEEEEEEcCC
Confidence 4455555666543 2234455667889999876666 4666 6778888888887776666554 567888999988
Q ss_pred CCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCcc
Q psy6570 83 SGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYK 162 (713)
Q Consensus 83 ~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~ 162 (713)
....|+...+....+.+-++++-..+.-+.....+-+.+++.|+ +.|-.+-...+.++.-+++-... +.... ....
T Consensus 160 ~~~p~Ivs~s~DktvKvWnl~~~~l~~~~~gh~~~v~t~~vSpD-GslcasGgkdg~~~LwdL~~~k~--lysl~-a~~~ 235 (315)
T KOG0279|consen 160 ESNPIIVSASWDKTVKVWNLRNCQLRTTFIGHSGYVNTVTVSPD-GSLCASGGKDGEAMLWDLNEGKN--LYSLE-AFDI 235 (315)
T ss_pred CCCcEEEEccCCceEEEEccCCcchhhccccccccEEEEEECCC-CCEEecCCCCceEEEEEccCCce--eEecc-CCCe
Confidence 75666655454436777788877666555556667789999965 45555555666777777754432 22221 1233
Q ss_pred ceeeeeeCCeEEEEeCCCCcEEEEcccCCCcceee
Q psy6570 163 PYKLEVFEDNLYFSTYRTNNILKINKFGNSDFNVL 197 (713)
Q Consensus 163 p~~i~~~~~~ly~td~~~~~i~~~~~~~~~~~~~~ 197 (713)
-.++.+..++.+........|...+...+..+..+
T Consensus 236 v~sl~fspnrywL~~at~~sIkIwdl~~~~~v~~l 270 (315)
T KOG0279|consen 236 VNSLCFSPNRYWLCAATATSIKIWDLESKAVVEEL 270 (315)
T ss_pred EeeEEecCCceeEeeccCCceEEEeccchhhhhhc
Confidence 35555555555444445556777776655544443
No 141
>COG5276 Uncharacterized conserved protein [Function unknown]
Probab=95.66 E-value=1.1 Score=44.43 Aligned_cols=147 Identities=14% Similarity=0.238 Sum_probs=78.5
Q ss_pred eEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEE
Q psy6570 30 GVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFN 109 (713)
Q Consensus 30 gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~ 109 (713)
.+++ .++.-|++.. . +-+.+.+......-+++..--..|.--.+-+...+.|.++.... +...+.++...-+
T Consensus 176 ~v~I--SGn~AYvA~~---d-~GL~ivDVSnp~sPvli~~~n~g~g~~sv~vsdnr~y~vvy~eg--vlivd~s~~ssp~ 247 (370)
T COG5276 176 DVAI--SGNYAYVAWR---D-GGLTIVDVSNPHSPVLIGSYNTGPGTYSVSVSDNRAYLVVYDEG--VLIVDVSGPSSPT 247 (370)
T ss_pred eEEE--ecCeEEEEEe---C-CCeEEEEccCCCCCeEEEEEecCCceEEEEecCCeeEEEEcccc--eEEEecCCCCCce
Confidence 4555 3567787776 2 33444444333322333221122333333335678888876544 6667777765444
Q ss_pred EEe-CCCCCCeeE-EEeCCCCeEEEEcCCCCcEEEEeCCC-CceeEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEE
Q psy6570 110 LVD-NNIQWPTGI-TIDYPSQRLYWADPKARTIESINLNG-KDRFVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILK 185 (713)
Q Consensus 110 l~~-~~~~~p~gl-avd~~~~~LY~~d~~~~~I~~~~~~g-~~~~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~ 185 (713)
++. -+...|.++ ++-..+.++|++|...+ +..++... ....+..+-.....+..+|.+.++++|++|...+.|.-
T Consensus 248 ~~gsyet~~p~~~s~v~Vs~~~~Yvadga~g-l~~idisnp~spfl~ss~~t~g~~a~gi~ay~~y~yiadkn~g~vV~ 325 (370)
T COG5276 248 VFGSYETSNPVSISTVPVSGEYAYVADGAKG-LPIIDISNPPSPFLSSSLDTAGYQAAGIRAYGNYNYIADKNTGAVVD 325 (370)
T ss_pred EeeccccCCcccccceecccceeeeeccccC-ceeEeccCCCCCchhccccCCCccccceEEecCeeEeccCCceEEEe
Confidence 442 234445554 22334889999996554 22333322 22222222122234678999999999999988665543
No 142
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=95.65 E-value=0.032 Score=45.07 Aligned_cols=35 Identities=26% Similarity=0.362 Sum_probs=30.5
Q ss_pred EeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeC
Q psy6570 111 VDNNIQWPTGITIDYPSQRLYWADPKARTIESINL 145 (713)
Q Consensus 111 ~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~ 145 (713)
+...+..|+||++|+..+.||+++...+.|..+..
T Consensus 49 va~g~~~aNGI~~s~~~k~lyVa~~~~~~I~vy~~ 83 (86)
T PF01731_consen 49 VASGFSFANGIAISPDKKYLYVASSLAHSIHVYKR 83 (86)
T ss_pred eeccCCCCceEEEcCCCCEEEEEeccCCeEEEEEe
Confidence 34578999999999999999999999998888764
No 143
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=95.58 E-value=0.83 Score=49.43 Aligned_cols=124 Identities=16% Similarity=0.161 Sum_probs=76.2
Q ss_pred CeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEc-cCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCe
Q psy6570 51 NNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTE-LGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQR 129 (713)
Q Consensus 51 ~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td-~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~ 129 (713)
.+|++.+++.....+++... ..-...++-|.+.+|.++. ......|+.++++++.+..|.. ....-..=.+.|++.+
T Consensus 218 ~~i~~~~l~~g~~~~i~~~~-g~~~~P~fspDG~~l~f~~~rdg~~~iy~~dl~~~~~~~Lt~-~~gi~~~Ps~spdG~~ 295 (425)
T COG0823 218 PRIYYLDLNTGKRPVILNFN-GNNGAPAFSPDGSKLAFSSSRDGSPDIYLMDLDGKNLPRLTN-GFGINTSPSWSPDGSK 295 (425)
T ss_pred ceEEEEeccCCccceeeccC-CccCCccCCCCCCEEEEEECCCCCccEEEEcCCCCcceeccc-CCccccCccCCCCCCE
Confidence 46888888877776666422 2333455556566666554 3345689999999988766543 2222235567777888
Q ss_pred EEEEcC--CCCcEEEEeCCCCceeEEEecCCCCccceeeeeeCCeEEEEe
Q psy6570 130 LYWADP--KARTIESINLNGKDRFVVYHTEDNGYKPYKLEVFEDNLYFST 177 (713)
Q Consensus 130 LY~~d~--~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~td 177 (713)
||++-. +...|++++.+|+..+.+......-.. -.+..++++|-+..
T Consensus 296 ivf~Sdr~G~p~I~~~~~~g~~~~riT~~~~~~~~-p~~SpdG~~i~~~~ 344 (425)
T COG0823 296 IVFTSDRGGRPQIYLYDLEGSQVTRLTFSGGGNSN-PVWSPDGDKIVFES 344 (425)
T ss_pred EEEEeCCCCCcceEEECCCCCceeEeeccCCCCcC-ccCCCCCCEEEEEe
Confidence 877743 455799999999887665544221111 22344555555444
No 144
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=95.54 E-value=2 Score=42.33 Aligned_cols=172 Identities=13% Similarity=0.164 Sum_probs=91.7
Q ss_pred CCceeEEEccCcccEEecCCC--CCceE--EEeccCCeEEEeecCCCCCCeEEEEec-CCceEEEEEcC-CCCCcceEEE
Q psy6570 6 SGNVTRVKREMNLKTVLSNLH--DPRGV--AVDWVGKNLYWTDAGGRSSNNIMVSTL-EGRKKRTLLNT-GLNEPYDIAL 79 (713)
Q Consensus 6 ~~~I~~~~~~~~~~~~~~~~~--~p~gl--a~D~~~~~ly~td~~~~~~~~I~~~~~-~G~~~~~l~~~-~~~~p~~iav 79 (713)
++.|..+++.+..++=...+. ....+ ++. .+++||+++. .+.|+.++. +|+..-..... .+..+ .++
T Consensus 2 ~g~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~-~~~~v~~~~~----~~~l~~~d~~tG~~~W~~~~~~~~~~~--~~~ 74 (238)
T PF13360_consen 2 DGTLSALDPRTGKELWSYDLGPGIGGPVATAVP-DGGRVYVASG----DGNLYALDAKTGKVLWRFDLPGPISGA--PVV 74 (238)
T ss_dssp TSEEEEEETTTTEEEEEEECSSSCSSEEETEEE-ETTEEEEEET----TSEEEEEETTTSEEEEEEECSSCGGSG--EEE
T ss_pred CCEEEEEECCCCCEEEEEECCCCCCCccceEEE-eCCEEEEEcC----CCEEEEEECCCCCEEEEeeccccccce--eee
Confidence 578899998654332222221 22333 442 4689999854 688999997 67543332211 11222 244
Q ss_pred cCCCCcEEEEccCCCCeEEEEe-cCCCCcEEEEeCC-----CCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCC-CCceeE
Q psy6570 80 EPLSGRMFWTELGIKPRISGAS-IDGKNKFNLVDNN-----IQWPTGITIDYPSQRLYWADPKARTIESINLN-GKDRFV 152 (713)
Q Consensus 80 D~~~~~ly~td~~~~~~I~~~~-~dG~~~~~l~~~~-----~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~-g~~~~~ 152 (713)
.++.||+.... . +|+.++ .+|+.+-.+.... +..+..++++ +++||+... .+.|+.+++. |..+-.
T Consensus 75 --~~~~v~v~~~~-~-~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-~g~l~~~d~~tG~~~w~ 147 (238)
T PF13360_consen 75 --DGGRVYVGTSD-G-SLYALDAKTGKVLWSIYLTSSPPAGVRSSSSPAVD--GDRLYVGTS-SGKLVALDPKTGKLLWK 147 (238)
T ss_dssp --ETTEEEEEETT-S-EEEEEETTTSCEEEEEEE-SSCTCSTB--SEEEEE--TTEEEEEET-CSEEEEEETTTTEEEEE
T ss_pred --cccccccccce-e-eeEecccCCcceeeeeccccccccccccccCceEe--cCEEEEEec-cCcEEEEecCCCcEEEE
Confidence 36788887733 3 788888 6676665532211 2234445555 677877765 6678888864 433222
Q ss_pred EEecCCCCcc--------ceeeeeeCCeEEEEeCCCCcEEEEcccCCC
Q psy6570 153 VYHTEDNGYK--------PYKLEVFEDNLYFSTYRTNNILKINKFGNS 192 (713)
Q Consensus 153 ~~~~~~~~~~--------p~~i~~~~~~ly~td~~~~~i~~~~~~~~~ 192 (713)
.......... ...+.+.++.||++.... .+..++...+.
T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~g-~~~~~d~~tg~ 194 (238)
T PF13360_consen 148 YPVGEPRGSSPISSFSDINGSPVISDGRVYVSSGDG-RVVAVDLATGE 194 (238)
T ss_dssp EESSTT-SS--EEEETTEEEEEECCTTEEEEECCTS-SEEEEETTTTE
T ss_pred eecCCCCCCcceeeecccccceEEECCEEEEEcCCC-eEEEEECCCCC
Confidence 2111101111 123334467888886544 35666655444
No 145
>KOG3512|consensus
Probab=95.31 E-value=0.049 Score=56.54 Aligned_cols=133 Identities=26% Similarity=0.697 Sum_probs=70.2
Q ss_pred CCCCC-ceeeCCCCCCCCCceeeCCCCcccCCCCccC---CCCCCCCCCEEEc--CCCeeeCCCCCccCCCC---cCC--
Q psy6570 351 PCLNQ-GMCYPDLTHPEPTYKCHCAPSYTGARCESRI---CENKCHNGGTCIA--TTQTCVCPPGFTGDTCQ---QCL-- 419 (713)
Q Consensus 351 ~C~~~-~~C~~~~~~~~~~~~C~C~~G~~g~~C~~~~---C~~~C~~~~~C~~--~~~~C~C~~g~~g~~C~---~C~-- 419 (713)
.|..+ ..|+..... .++|.|.-+-+|+.|+... ...+-.. ++=.. .--.|.|.. .+.+|. |+.
T Consensus 279 KCNgHAs~Cv~d~~~---~ltCdC~HNTaGPdCgrCKpfy~dRPW~r-aT~~~a~~c~ac~Cn~--harrcrfn~Ely~l 352 (592)
T KOG3512|consen 279 KCNGHASRCVMDESS---HLTCDCEHNTAGPDCGRCKPFYYDRPWGR-ATALPANECVACNCNG--HARRCRFNMELYRL 352 (592)
T ss_pred eecCccceeeeccCC---ceEEecccCCCCCCcccccccccCCCccc-cccCCCccccccccch--hhhhcccchhhhcc
Confidence 35444 348766442 5899999999999886421 1111110 00000 001233322 122222 221
Q ss_pred CCCCCCCcEecc---CCCCcccc-CCCCCcCC---------CCccCCCCCC-CCCCeeecCCCCCeecCCCCccCCCCCC
Q psy6570 420 NLKCQNGGVCVN---KTTGLECD-CPKFYYGK---------NCQYSQCKNY-CVNGECSITDSGPKCMCSPGYSGKKCDT 485 (713)
Q Consensus 420 ~~~C~~~~~C~~---~~~~~~C~-C~~G~~g~---------~C~~~~C~~~-~~~~~C~~~~~~~~C~C~~G~~g~~C~~ 485 (713)
+..++ +++|.| ...+..|. |.+||+-+ .|....|.+. ..+.+|..+.| +|.|.+|.+|..|+.
T Consensus 353 Sgr~S-ggvClnCrHnTaGrhChyCreGyyRd~s~pl~hrkaCk~CdChpVGs~gktCNq~tG--qCpCkeGvtG~tCnr 429 (592)
T KOG3512|consen 353 SGRRS-GGVCLNCRHNTAGRHCHYCREGYYRDGSKPLTHRKACKACDCHPVGSAGKTCNQTTG--QCPCKEGVTGLTCNR 429 (592)
T ss_pred cCccc-cceEeecccCCCCcccccccCccccCCCCCCchhhhhhhcCCcccccccccccccCC--cccCCCCCccccccc
Confidence 11222 345542 22345674 99999732 3443344443 34556776666 899999999999886
Q ss_pred ccccCCCCCCcccCCCCccC
Q psy6570 486 CTCLNGDSGPKCMCSPGYSG 505 (713)
Q Consensus 486 ~~C~~~~~~~~C~C~~G~~g 505 (713)
|.+||.-
T Consensus 430 -------------Ca~gyqq 436 (592)
T KOG3512|consen 430 -------------CAPGYQQ 436 (592)
T ss_pred -------------ccchhhc
Confidence 7788863
No 146
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=95.30 E-value=1.1 Score=42.86 Aligned_cols=121 Identities=11% Similarity=0.018 Sum_probs=77.0
Q ss_pred CceeEEEccCcc-cEE-ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCC
Q psy6570 7 GNVTRVKREMNL-KTV-LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSG 84 (713)
Q Consensus 7 ~~I~~~~~~~~~-~~~-~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~ 84 (713)
..|++++..+.. ..+ +..-.....+++.|.+.++.+.... ...+|..++++++....+. -...+.|...|.++
T Consensus 39 ~~l~~~~~~~~~~~~i~l~~~~~I~~~~WsP~g~~favi~g~--~~~~v~lyd~~~~~i~~~~---~~~~n~i~wsP~G~ 113 (194)
T PF08662_consen 39 FELFYLNEKNIPVESIELKKEGPIHDVAWSPNGNEFAVIYGS--MPAKVTLYDVKGKKIFSFG---TQPRNTISWSPDGR 113 (194)
T ss_pred EEEEEEecCCCccceeeccCCCceEEEEECcCCCEEEEEEcc--CCcccEEEcCcccEeEeec---CCCceEEEECCCCC
Confidence 345566544422 222 2222347889999988887776531 3458889998866555443 24566899999888
Q ss_pred cEEEEccCC-CCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEc
Q psy6570 85 RMFWTELGI-KPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWAD 134 (713)
Q Consensus 85 ~ly~td~~~-~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d 134 (713)
+|..+..++ .+.|...+.+ ..+.+........+.++++|++.+|..+.
T Consensus 114 ~l~~~g~~n~~G~l~~wd~~--~~~~i~~~~~~~~t~~~WsPdGr~~~ta~ 162 (194)
T PF08662_consen 114 FLVLAGFGNLNGDLEFWDVR--KKKKISTFEHSDATDVEWSPDGRYLATAT 162 (194)
T ss_pred EEEEEEccCCCcEEEEEECC--CCEEeeccccCcEEEEEEcCCCCEEEEEE
Confidence 888877553 2467777766 44444444445568889998777766655
No 147
>KOG0291|consensus
Probab=95.10 E-value=2.7 Score=46.90 Aligned_cols=74 Identities=7% Similarity=-0.025 Sum_probs=45.0
Q ss_pred CCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEEEcccCC
Q psy6570 117 WPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILKINKFGN 191 (713)
Q Consensus 117 ~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~~~~~~ 191 (713)
+-.-||+|+.+..+.-.+...-.|++-++.......+++........+.+...+..|+ +.....+|+..+.+..
T Consensus 437 QfscvavD~sGelV~AG~~d~F~IfvWS~qTGqllDiLsGHEgPVs~l~f~~~~~~La-S~SWDkTVRiW~if~s 510 (893)
T KOG0291|consen 437 QFSCVAVDPSGELVCAGAQDSFEIFVWSVQTGQLLDILSGHEGPVSGLSFSPDGSLLA-SGSWDKTVRIWDIFSS 510 (893)
T ss_pred eeeEEEEcCCCCEEEeeccceEEEEEEEeecCeeeehhcCCCCcceeeEEccccCeEE-eccccceEEEEEeecc
Confidence 3467899987777766666777788877765555555554332222233445555554 4445567887777655
No 148
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=95.07 E-value=0.49 Score=52.53 Aligned_cols=164 Identities=14% Similarity=0.092 Sum_probs=90.1
Q ss_pred cCCceeEEEccCcc-c-EEec--CCCCCceEEE---eccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceE
Q psy6570 5 SSGNVTRVKREMNL-K-TVLS--NLHDPRGVAV---DWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDI 77 (713)
Q Consensus 5 ~~~~I~~~~~~~~~-~-~~~~--~~~~p~gla~---D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~i 77 (713)
..|.|..|++.... . +... +--.=+++++ ++.+..+|.+|-+ ....|+++-.+......- .-...
T Consensus 219 ~~GwvvEvdp~~~~~~p~K~tAlGRf~HE~a~v~~~~~~~~vvY~gDD~--~~~~lYkFVs~~~~~~~~------~~~~~ 290 (524)
T PF05787_consen 219 RYGWVVEVDPFDPSSTPVKRTALGRFAHEAAAVVLADPGRVVVYMGDDG--RNGYLYKFVSDKPWDPGD------RAANR 290 (524)
T ss_pred ccceEEEeCCCCCCCCccceeecccccccceeEEeecCCeEEEEEEecC--CCCeEEEEecCCCCCCcc------cchhh
Confidence 34566677665421 1 1122 2223466777 7777789999873 456788777665432110 00001
Q ss_pred EEcCCCCcEEEEccCCCCeEEEEecCCC------------CcEE----------EEeCCCCCCeeEEEeCCCCeEEEEcC
Q psy6570 78 ALEPLSGRMFWTELGIKPRISGASIDGK------------NKFN----------LVDNNIQWPTGITIDYPSQRLYWADP 135 (713)
Q Consensus 78 avD~~~~~ly~td~~~~~~I~~~~~dG~------------~~~~----------l~~~~~~~p~glavd~~~~~LY~~d~ 135 (713)
.+. ..|.||++.+.....+.=+.|.-. ...+ +-.+.+.+|.+|++++.+++||++.+
T Consensus 291 ~ll-~~GtLyaak~~~~g~~~Wv~L~~~~~~l~~~~~~~~~a~v~~~tr~aA~~~GAT~f~RpEgi~~~p~~g~vY~a~T 369 (524)
T PF05787_consen 291 DLL-DEGTLYAAKFNQDGTGEWVPLGHGQGGLTAKNGFADQADVLIETRRAADAVGATPFDRPEGITVNPDDGEVYFALT 369 (524)
T ss_pred hhh-hCCEeceEEECCCCcEEEEECCCcccccccCCCCCChHHhhhhhhhccccCccccccCccCeeEeCCCCEEEEEEe
Confidence 111 356666665443323333322110 0000 11245889999999999999999965
Q ss_pred CCC-------------------cEEEEeCCCC-------ceeEEEec---------------CCCCccceeeeee-CCeE
Q psy6570 136 KAR-------------------TIESINLNGK-------DRFVVYHT---------------EDNGYKPYKLEVF-EDNL 173 (713)
Q Consensus 136 ~~~-------------------~I~~~~~~g~-------~~~~~~~~---------------~~~~~~p~~i~~~-~~~l 173 (713)
... .|+++..++. ...+++.. ...+..|..|.++ .++|
T Consensus 370 ~~~~r~~~~~~~~n~~~~n~~G~I~r~~~~~~d~~~~~f~~~~~~~~g~~~~~~~~~~~~~~~~~f~sPDNL~~d~~G~L 449 (524)
T PF05787_consen 370 NNSGRGESDVDAANPRAGNGYGQIYRYDPDGNDHAATTFTWELFLVGGDPTDASGNGSNKCDDNGFASPDNLAFDPDGNL 449 (524)
T ss_pred cCCCCcccccccCCcccCCcccEEEEecccCCccccceeEEEEEEEecCcccccccccCcccCCCcCCCCceEECCCCCE
Confidence 433 7999988765 22333322 1235678888877 4466
Q ss_pred EEEe
Q psy6570 174 YFST 177 (713)
Q Consensus 174 y~td 177 (713)
|+..
T Consensus 450 wI~e 453 (524)
T PF05787_consen 450 WIQE 453 (524)
T ss_pred EEEe
Confidence 6654
No 149
>PTZ00421 coronin; Provisional
Probab=95.05 E-value=7.2 Score=43.28 Aligned_cols=159 Identities=9% Similarity=-0.012 Sum_probs=81.6
Q ss_pred CCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCC
Q psy6570 27 DPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKN 106 (713)
Q Consensus 27 ~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~ 106 (713)
....|++.+..+.++++-. ..+.|.+.++........+.........|++.+ ++.++++-.... .|...++....
T Consensus 127 ~V~~l~f~P~~~~iLaSgs---~DgtVrIWDl~tg~~~~~l~~h~~~V~sla~sp-dG~lLatgs~Dg-~IrIwD~rsg~ 201 (493)
T PTZ00421 127 KVGIVSFHPSAMNVLASAG---ADMVVNVWDVERGKAVEVIKCHSDQITSLEWNL-DGSLLCTTSKDK-KLNIIDPRDGT 201 (493)
T ss_pred cEEEEEeCcCCCCEEEEEe---CCCEEEEEECCCCeEEEEEcCCCCceEEEEEEC-CCCEEEEecCCC-EEEEEECCCCc
Confidence 3567888776656666655 567888988875443333333335577899987 455556554444 67777765332
Q ss_pred c-EEEEeCCCCCCeeEEEeCCCCeEEEEc---CCCCcEEEEeCCCCcee-EEEecCCCCccceeee-e--eCCeEEEEeC
Q psy6570 107 K-FNLVDNNIQWPTGITIDYPSQRLYWAD---PKARTIESINLNGKDRF-VVYHTEDNGYKPYKLE-V--FEDNLYFSTY 178 (713)
Q Consensus 107 ~-~~l~~~~~~~p~glavd~~~~~LY~~d---~~~~~I~~~~~~g~~~~-~~~~~~~~~~~p~~i~-~--~~~~ly~td~ 178 (713)
. ..+....-.....+.+.+..+.|+.+- ...+.|...|+...... .+.... ......+. + +...||.+..
T Consensus 202 ~v~tl~~H~~~~~~~~~w~~~~~~ivt~G~s~s~Dr~VklWDlr~~~~p~~~~~~d--~~~~~~~~~~d~d~~~L~lggk 279 (493)
T PTZ00421 202 IVSSVEAHASAKSQRCLWAKRKDLIITLGCSKSQQRQIMLWDTRKMASPYSTVDLD--QSSALFIPFFDEDTNLLYIGSK 279 (493)
T ss_pred EEEEEecCCCCcceEEEEcCCCCeEEEEecCCCCCCeEEEEeCCCCCCceeEeccC--CCCceEEEEEcCCCCEEEEEEe
Confidence 2 222111111223455555555555443 22355666666432211 111110 11112122 2 2445666554
Q ss_pred CCCcEEEEcccCCC
Q psy6570 179 RTNNILKINKFGNS 192 (713)
Q Consensus 179 ~~~~i~~~~~~~~~ 192 (713)
+.+.|+.++...+.
T Consensus 280 gDg~Iriwdl~~~~ 293 (493)
T PTZ00421 280 GEGNIRCFELMNER 293 (493)
T ss_pred CCCeEEEEEeeCCc
Confidence 56677777765544
No 150
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=94.92 E-value=1.7 Score=46.90 Aligned_cols=134 Identities=15% Similarity=0.184 Sum_probs=68.6
Q ss_pred CCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCC-C
Q psy6570 37 GKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNN-I 115 (713)
Q Consensus 37 ~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~-~ 115 (713)
++.||++.. .+.+..+++.... ++....+..+..++++ +++||+.+.. ++|+.++..... .+.... +
T Consensus 256 ~~~vy~~~~----~g~l~ald~~tG~--~~W~~~~~~~~~~~~~--~~~vy~~~~~--g~l~ald~~tG~--~~W~~~~~ 323 (394)
T PRK11138 256 GGVVYALAY----NGNLVALDLRSGQ--IVWKREYGSVNDFAVD--GGRIYLVDQN--DRVYALDTRGGV--ELWSQSDL 323 (394)
T ss_pred CCEEEEEEc----CCeEEEEECCCCC--EEEeecCCCccCcEEE--CCEEEEEcCC--CeEEEEECCCCc--EEEccccc
Confidence 466777654 3566666654322 2233333444556664 6788888743 268877775332 122111 1
Q ss_pred --CCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEEEc
Q psy6570 116 --QWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILKIN 187 (713)
Q Consensus 116 --~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~~ 187 (713)
......++. +++||+.+. .+.|+.++.+......-..... ........+.+++||+.+. .+.|+.++
T Consensus 324 ~~~~~~sp~v~--~g~l~v~~~-~G~l~~ld~~tG~~~~~~~~~~-~~~~s~P~~~~~~l~v~t~-~G~l~~~~ 392 (394)
T PRK11138 324 LHRLLTAPVLY--NGYLVVGDS-EGYLHWINREDGRFVAQQKVDS-SGFLSEPVVADDKLLIQAR-DGTVYAIT 392 (394)
T ss_pred CCCcccCCEEE--CCEEEEEeC-CCEEEEEECCCCCEEEEEEcCC-CcceeCCEEECCEEEEEeC-CceEEEEe
Confidence 111233343 688988874 4678888864332211111110 0111223356789999864 45666654
No 151
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=94.89 E-value=1.2 Score=45.86 Aligned_cols=137 Identities=17% Similarity=0.225 Sum_probs=82.5
Q ss_pred cCCceeEEEccC--cccEE----ecCCCCC--ceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCC-----
Q psy6570 5 SSGNVTRVKREM--NLKTV----LSNLHDP--RGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGL----- 71 (713)
Q Consensus 5 ~~~~I~~~~~~~--~~~~~----~~~~~~p--~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~----- 71 (713)
.+|++..+.++. +.... ......| .--+++..++++||.-. .+.|+.+++.|...+..-.-.+
T Consensus 155 ~DGsl~~v~Ld~~Gk~~~~~t~~F~~~~dp~f~~~~~~~~~~~~~F~Sy----~G~v~~~dlsg~~~~~~~~~~~~t~~e 230 (342)
T PF06433_consen 155 GDGSLLTVTLDADGKEAQKSTKVFDPDDDPLFEHPAYSRDGGRLYFVSY----EGNVYSADLSGDSAKFGKPWSLLTDAE 230 (342)
T ss_dssp TTSCEEEEEETSTSSEEEEEEEESSTTTS-B-S--EEETTTTEEEEEBT----TSEEEEEEETTSSEEEEEEEESS-HHH
T ss_pred cCCceEEEEECCCCCEeEeeccccCCCCcccccccceECCCCeEEEEec----CCEEEEEeccCCcccccCcccccCccc
Confidence 367888887773 32211 2222222 22334456678888765 7899999999987655432111
Q ss_pred ----CCcce---EEEcCCCCcEEEEccC----CC----CeEEEEecCCCCcEEEEeCCCCCC-eeEEEeCCCC-eEEEEc
Q psy6570 72 ----NEPYD---IALEPLSGRMFWTELG----IK----PRISGASIDGKNKFNLVDNNIQWP-TGITIDYPSQ-RLYWAD 134 (713)
Q Consensus 72 ----~~p~~---iavD~~~~~ly~td~~----~~----~~I~~~~~dG~~~~~l~~~~~~~p-~glavd~~~~-~LY~~d 134 (713)
=+|.| +|+++..++||+.... .+ ..||++++....|..-+ .+..| .+|+|..+.+ +||-++
T Consensus 231 ~~~~WrPGG~Q~~A~~~~~~rlyvLMh~g~~gsHKdpgteVWv~D~~t~krv~Ri--~l~~~~~Si~Vsqd~~P~L~~~~ 308 (342)
T PF06433_consen 231 KADGWRPGGWQLIAYHAASGRLYVLMHQGGEGSHKDPGTEVWVYDLKTHKRVARI--PLEHPIDSIAVSQDDKPLLYALS 308 (342)
T ss_dssp HHTTEEE-SSS-EEEETTTTEEEEEEEE--TT-TTS-EEEEEEEETTTTEEEEEE--EEEEEESEEEEESSSS-EEEEEE
T ss_pred cccCcCCcceeeeeeccccCeEEEEecCCCCCCccCCceEEEEEECCCCeEEEEE--eCCCccceEEEccCCCcEEEEEc
Confidence 12443 8999999999997421 11 36888888765543333 23223 3788876655 677677
Q ss_pred CCCCcEEEEeCCC
Q psy6570 135 PKARTIESINLNG 147 (713)
Q Consensus 135 ~~~~~I~~~~~~g 147 (713)
...+.|..+|...
T Consensus 309 ~~~~~l~v~D~~t 321 (342)
T PF06433_consen 309 AGDGTLDVYDAAT 321 (342)
T ss_dssp TTTTEEEEEETTT
T ss_pred CCCCeEEEEeCcC
Confidence 7778888888643
No 152
>KOG1446|consensus
Probab=94.85 E-value=4.8 Score=40.29 Aligned_cols=182 Identities=13% Similarity=0.070 Sum_probs=99.5
Q ss_pred cCCceeEEEccCc--ccEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCC
Q psy6570 5 SSGNVTRVKREMN--LKTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPL 82 (713)
Q Consensus 5 ~~~~I~~~~~~~~--~~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~ 82 (713)
.++.|+.++.... .+++...-..+.-+.|-.....+..+-. +....|...++..+.....+..--...+.|.+.|.
T Consensus 34 ~dDsl~LYd~~~g~~~~ti~skkyG~~~~~Fth~~~~~i~sSt--k~d~tIryLsl~dNkylRYF~GH~~~V~sL~~sP~ 111 (311)
T KOG1446|consen 34 EDDSLRLYDSLSGKQVKTINSKKYGVDLACFTHHSNTVIHSST--KEDDTIRYLSLHDNKYLRYFPGHKKRVNSLSVSPK 111 (311)
T ss_pred CCCeEEEEEcCCCceeeEeecccccccEEEEecCCceEEEccC--CCCCceEEEEeecCceEEEcCCCCceEEEEEecCC
Confidence 3455666654432 2333333344555555443333322222 24567888887654433333333477889999997
Q ss_pred CCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCC----CCceeEEEecCC
Q psy6570 83 SGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLN----GKDRFVVYHTED 158 (713)
Q Consensus 83 ~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~----g~~~~~~~~~~~ 158 (713)
+ ..|.+.+..+ .|..-++.-..-+.++ .+..+.-.|+|| ++.||.+-.+...|..+|+. |...+..+....
T Consensus 112 ~-d~FlS~S~D~-tvrLWDlR~~~cqg~l--~~~~~pi~AfDp-~GLifA~~~~~~~IkLyD~Rs~dkgPF~tf~i~~~~ 186 (311)
T KOG1446|consen 112 D-DTFLSSSLDK-TVRLWDLRVKKCQGLL--NLSGRPIAAFDP-EGLIFALANGSELIKLYDLRSFDKGPFTTFSITDND 186 (311)
T ss_pred C-CeEEecccCC-eEEeeEecCCCCceEE--ecCCCcceeECC-CCcEEEEecCCCeEEEEEecccCCCCceeEccCCCC
Confidence 6 7888776555 6666666655555554 344556789995 66777766666688887753 222222222111
Q ss_pred CCccceeee--eeCCeEEEEeCCCCcEEEEcccCCCcce
Q psy6570 159 NGYKPYKLE--VFEDNLYFSTYRTNNILKINKFGNSDFN 195 (713)
Q Consensus 159 ~~~~p~~i~--~~~~~ly~td~~~~~i~~~~~~~~~~~~ 195 (713)
...-..|. .++..|.++. ..+.++.++.+.|....
T Consensus 187 -~~ew~~l~FS~dGK~iLlsT-~~s~~~~lDAf~G~~~~ 223 (311)
T KOG1446|consen 187 -EAEWTDLEFSPDGKSILLST-NASFIYLLDAFDGTVKS 223 (311)
T ss_pred -ccceeeeEEcCCCCEEEEEe-CCCcEEEEEccCCcEee
Confidence 12223333 3455566654 44567777777665433
No 153
>smart00051 DSL delta serrate ligand.
Probab=94.75 E-value=0.038 Score=41.72 Aligned_cols=47 Identities=21% Similarity=0.416 Sum_probs=34.0
Q ss_pred ceeeCCCCCcCCCCCcCCCCCCCCCCCCCCCeEecCCCCcceeecCCCcccCCC
Q psy6570 573 PSCKCLPPYSGKQCTEREDSPSCHNYCDNAGLCSYSKQGKPVCTCVNGWSGITC 626 (713)
Q Consensus 573 ~~C~C~~G~~G~~C~~~~~~~~C~~~C~~~g~C~~~~~g~~~C~C~~G~~G~~C 626 (713)
+.=.|.++|.|..|.... .+.+.+.++..|... ..|+|.+||+|..|
T Consensus 17 ~rv~C~~~~yG~~C~~~C---~~~~d~~~~~~Cd~~----G~~~C~~Gw~G~~C 63 (63)
T smart00051 17 IRVTCDENYYGEGCNKFC---RPRDDFFGHYTCDEN----GNKGCLEGWMGPYC 63 (63)
T ss_pred EEeeCCCCCcCCccCCEe---CcCccccCCccCCcC----CCEecCCCCcCCCC
Confidence 456799999999997541 123446677888532 45999999999876
No 154
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=94.72 E-value=0.026 Score=33.18 Aligned_cols=18 Identities=33% Similarity=1.008 Sum_probs=15.1
Q ss_pred CceeeCCCCCc----CCCCCcC
Q psy6570 572 KPSCKCLPPYS----GKQCTER 589 (713)
Q Consensus 572 ~~~C~C~~G~~----G~~C~~~ 589 (713)
+|+|.|++||. |+.|+++
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DI 22 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDI 22 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccC
Confidence 58999999997 6678775
No 155
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=94.71 E-value=1.5 Score=47.54 Aligned_cols=136 Identities=13% Similarity=0.149 Sum_probs=83.2
Q ss_pred CceeEEEccCcc-cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCc
Q psy6570 7 GNVTRVKREMNL-KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGR 85 (713)
Q Consensus 7 ~~I~~~~~~~~~-~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ 85 (713)
.+|+..++++.. .++++.-..-...++-+.+.+|.++.... ..-.|++++++++....|.... ..-..=.+.|....
T Consensus 218 ~~i~~~~l~~g~~~~i~~~~g~~~~P~fspDG~~l~f~~~rd-g~~~iy~~dl~~~~~~~Lt~~~-gi~~~Ps~spdG~~ 295 (425)
T COG0823 218 PRIYYLDLNTGKRPVILNFNGNNGAPAFSPDGSKLAFSSSRD-GSPDIYLMDLDGKNLPRLTNGF-GINTSPSWSPDGSK 295 (425)
T ss_pred ceEEEEeccCCccceeeccCCccCCccCCCCCCEEEEEECCC-CCccEEEEcCCCCcceecccCC-ccccCccCCCCCCE
Confidence 457888888644 44445444445566777777877776532 4567999999998866644321 11113334455556
Q ss_pred EEE-EccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCc--EEEEeC
Q psy6570 86 MFW-TELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKART--IESINL 145 (713)
Q Consensus 86 ly~-td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~--I~~~~~ 145 (713)
|+| +|.+..++|++++++|+..+.+... ......-.+++++++|-+.....+. |...++
T Consensus 296 ivf~Sdr~G~p~I~~~~~~g~~~~riT~~-~~~~~~p~~SpdG~~i~~~~~~~g~~~i~~~~~ 357 (425)
T COG0823 296 IVFTSDRGGRPQIYLYDLEGSQVTRLTFS-GGGNSNPVWSPDGDKIVFESSSGGQWDIDKNDL 357 (425)
T ss_pred EEEEeCCCCCcceEEECCCCCceeEeecc-CCCCcCccCCCCCCEEEEEeccCCceeeEEecc
Confidence 555 5666667999999999988666542 3333355677667766665533333 444444
No 156
>KOG4378|consensus
Probab=94.71 E-value=1.5 Score=46.37 Aligned_cols=173 Identities=8% Similarity=0.041 Sum_probs=102.8
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCC-CCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCC
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTG-LNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKN 106 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~-~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~ 106 (713)
..++.+.|... |++..+ ..+.|.+..+..+.+.+-+..+ -...+-|-+.+..+.|..+-.... .|..-+..|..
T Consensus 124 vt~v~YN~~De--yiAsvs--~gGdiiih~~~t~~~tt~f~~~sgqsvRll~ys~skr~lL~~asd~G-~VtlwDv~g~s 198 (673)
T KOG4378|consen 124 VTYVDYNNTDE--YIASVS--DGGDIIIHGTKTKQKTTTFTIDSGQSVRLLRYSPSKRFLLSIASDKG-AVTLWDVQGMS 198 (673)
T ss_pred eEEEEecCCcc--eeEEec--cCCcEEEEecccCccccceecCCCCeEEEeecccccceeeEeeccCC-eEEEEeccCCC
Confidence 34555555443 333321 3456666666655544333322 222345667777777777654433 67777777765
Q ss_pred cEEEEeCC-CCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccce-eeeee-CCeEEEEeCCCCcE
Q psy6570 107 KFNLVDNN-IQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPY-KLEVF-EDNLYFSTYRTNNI 183 (713)
Q Consensus 107 ~~~l~~~~-~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~-~i~~~-~~~ly~td~~~~~i 183 (713)
..--+... .....||.+.|.+..|+++-....+|+.+|........-+. ..+|+ .+++. .+.++.+-..+++|
T Consensus 199 p~~~~~~~HsAP~~gicfspsne~l~vsVG~Dkki~~yD~~s~~s~~~l~----y~~Plstvaf~~~G~~L~aG~s~G~~ 274 (673)
T KOG4378|consen 199 PIFHASEAHSAPCRGICFSPSNEALLVSVGYDKKINIYDIRSQASTDRLT----YSHPLSTVAFSECGTYLCAGNSKGEL 274 (673)
T ss_pred cccchhhhccCCcCcceecCCccceEEEecccceEEEeecccccccceee----ecCCcceeeecCCceEEEeecCCceE
Confidence 54433322 33347999999999999999888999998875432211111 23554 45554 45666777788999
Q ss_pred EEEcccCCC-cceeeeccccccccEEE
Q psy6570 184 LKINKFGNS-DFNVLANNLNRASDVLI 209 (713)
Q Consensus 184 ~~~~~~~~~-~~~~~~~~~~~~~~i~v 209 (713)
+.++..+.. +++++...-..++.|++
T Consensus 275 i~YD~R~~k~Pv~v~sah~~sVt~vaf 301 (673)
T KOG4378|consen 275 IAYDMRSTKAPVAVRSAHDASVTRVAF 301 (673)
T ss_pred EEEecccCCCCceEeeecccceeEEEe
Confidence 999987653 45555554444555543
No 157
>KOG0273|consensus
Probab=94.69 E-value=2.6 Score=44.42 Aligned_cols=178 Identities=9% Similarity=0.005 Sum_probs=96.2
Q ss_pred EEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEE
Q psy6570 31 VAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNL 110 (713)
Q Consensus 31 la~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l 110 (713)
+++-|...--|+.-.+ ..+++...+.-....++.+ .+...-++.|||.+..=|.+..-.. .|++..++++....-
T Consensus 280 ~slKWnk~G~yilS~~--vD~ttilwd~~~g~~~q~f--~~~s~~~lDVdW~~~~~F~ts~td~-~i~V~kv~~~~P~~t 354 (524)
T KOG0273|consen 280 FSLKWNKKGTYILSGG--VDGTTILWDAHTGTVKQQF--EFHSAPALDVDWQSNDEFATSSTDG-CIHVCKVGEDRPVKT 354 (524)
T ss_pred EEEEEcCCCCEEEecc--CCccEEEEeccCceEEEee--eeccCCccceEEecCceEeecCCCc-eEEEEEecCCCccee
Confidence 4444544444443332 3445555554332333322 1333347889998888888765433 789888888776655
Q ss_pred EeCCCCCCeeEEEeCCCCeEEEE-cCCCCcEEEEeCCCCc-------eeEEEecCCCCccceeeeeeCC-eEEEEeCCCC
Q psy6570 111 VDNNIQWPTGITIDYPSQRLYWA-DPKARTIESINLNGKD-------RFVVYHTEDNGYKPYKLEVFED-NLYFSTYRTN 181 (713)
Q Consensus 111 ~~~~~~~p~glavd~~~~~LY~~-d~~~~~I~~~~~~g~~-------~~~~~~~~~~~~~p~~i~~~~~-~ly~td~~~~ 181 (713)
+...-+..++|-+++....|--+ |-.+-+||.+..++.. +.+.... -....|..=-...+ .|.|+ ....
T Consensus 355 ~~GH~g~V~alk~n~tg~LLaS~SdD~TlkiWs~~~~~~~~~l~~Hskei~t~~-wsp~g~v~~n~~~~~~l~sa-s~ds 432 (524)
T KOG0273|consen 355 FIGHHGEVNALKWNPTGSLLASCSDDGTLKIWSMGQSNSVHDLQAHSKEIYTIK-WSPTGPVTSNPNMNLMLASA-SFDS 432 (524)
T ss_pred eecccCceEEEEECCCCceEEEecCCCeeEeeecCCCcchhhhhhhccceeeEe-ecCCCCccCCCcCCceEEEe-ecCC
Confidence 54466677899999655444433 3345567765544321 1111100 00111111111122 33443 4556
Q ss_pred cEEEEcccCCCcceeeeccccccccEEEEeeccc
Q psy6570 182 NILKINKFGNSDFNVLANNLNRASDVLILQENKQ 215 (713)
Q Consensus 182 ~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~q 215 (713)
.|...+...+.....+..+...++.++..+..++
T Consensus 433 tV~lwdv~~gv~i~~f~kH~~pVysvafS~~g~y 466 (524)
T KOG0273|consen 433 TVKLWDVESGVPIHTLMKHQEPVYSVAFSPNGRY 466 (524)
T ss_pred eEEEEEccCCceeEeeccCCCceEEEEecCCCcE
Confidence 6777777777777777777777777777666555
No 158
>KOG0268|consensus
Probab=94.63 E-value=0.33 Score=49.07 Aligned_cols=207 Identities=10% Similarity=0.061 Sum_probs=113.3
Q ss_pred ccCCceeEEEccCcc--cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCC-----------
Q psy6570 4 ISSGNVTRVKREMNL--KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTG----------- 70 (713)
Q Consensus 4 ~~~~~I~~~~~~~~~--~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~----------- 70 (713)
..+|.|...++.... .++....+-..||.+|. +..+++.|. ..|-+..++|....++....
T Consensus 86 s~DG~VkiWnlsqR~~~~~f~AH~G~V~Gi~v~~-~~~~tvgdD-----KtvK~wk~~~~p~~tilg~s~~~gIdh~~~~ 159 (433)
T KOG0268|consen 86 SCDGEVKIWNLSQRECIRTFKAHEGLVRGICVTQ-TSFFTVGDD-----KTVKQWKIDGPPLHTILGKSVYLGIDHHRKN 159 (433)
T ss_pred ccCceEEEEehhhhhhhheeecccCceeeEEecc-cceEEecCC-----cceeeeeccCCcceeeecccccccccccccc
Confidence 457788888888754 44445555678999985 677777765 33333333333333322110
Q ss_pred ---------------------------CCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCC-cEEEEeCCCCCCeeEE
Q psy6570 71 ---------------------------LNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKN-KFNLVDNNIQWPTGIT 122 (713)
Q Consensus 71 ---------------------------~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~-~~~l~~~~~~~p~gla 122 (713)
......+.+.|..-.|..+-.... .|...++.-.. .+.++. -..+++|+
T Consensus 160 ~~FaTcGe~i~IWD~~R~~Pv~smswG~Dti~svkfNpvETsILas~~sDr-sIvLyD~R~~~Pl~KVi~--~mRTN~Is 236 (433)
T KOG0268|consen 160 SVFATCGEQIDIWDEQRDNPVSSMSWGADSISSVKFNPVETSILASCASDR-SIVLYDLRQASPLKKVIL--TMRTNTIC 236 (433)
T ss_pred ccccccCceeeecccccCCccceeecCCCceeEEecCCCcchheeeeccCC-ceEEEecccCCccceeee--ecccccee
Confidence 111223333333333333332222 34444443222 222221 23579999
Q ss_pred EeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeee-CCeEEEEeCCCCcEEEEcccCCCcceee-ecc
Q psy6570 123 IDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVF-EDNLYFSTYRTNNILKINKFGNSDFNVL-ANN 200 (713)
Q Consensus 123 vd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~~~~~~~-~~~ 200 (713)
+.| +...|++-.....++.+|+---.+.+-... ......+.+++. -+.-|++......|..+....+....+. ...
T Consensus 237 wnP-eafnF~~a~ED~nlY~~DmR~l~~p~~v~~-dhvsAV~dVdfsptG~EfvsgsyDksIRIf~~~~~~SRdiYhtkR 314 (433)
T KOG0268|consen 237 WNP-EAFNFVAANEDHNLYTYDMRNLSRPLNVHK-DHVSAVMDVDFSPTGQEFVSGSYDKSIRIFPVNHGHSRDIYHTKR 314 (433)
T ss_pred cCc-cccceeeccccccceehhhhhhcccchhhc-ccceeEEEeccCCCcchhccccccceEEEeecCCCcchhhhhHhh
Confidence 998 899999988888899988754332221111 112333444443 4556788777777777766555444433 334
Q ss_pred ccccccEEEEeeccccCCccC
Q psy6570 201 LNRASDVLILQENKQAHNVTN 221 (713)
Q Consensus 201 ~~~~~~i~v~~~~~q~~~~~~ 221 (713)
..+++.++..+++++.-+.+|
T Consensus 315 Mq~V~~Vk~S~Dskyi~SGSd 335 (433)
T KOG0268|consen 315 MQHVFCVKYSMDSKYIISGSD 335 (433)
T ss_pred hheeeEEEEeccccEEEecCC
Confidence 666777777777776544444
No 159
>KOG0285|consensus
Probab=94.56 E-value=1.4 Score=44.74 Aligned_cols=169 Identities=11% Similarity=0.094 Sum_probs=97.8
Q ss_pred EecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEE
Q psy6570 21 VLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGA 100 (713)
Q Consensus 21 ~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~ 100 (713)
+.-.++..+.|||||. +..|.+-+ ....|-+.|+.....+..+..-....++++|.+..-+||-+..+ . .|.--
T Consensus 147 i~gHlgWVr~vavdP~-n~wf~tgs---~DrtikIwDlatg~LkltltGhi~~vr~vavS~rHpYlFs~ged-k-~VKCw 220 (460)
T KOG0285|consen 147 ISGHLGWVRSVAVDPG-NEWFATGS---ADRTIKIWDLATGQLKLTLTGHIETVRGVAVSKRHPYLFSAGED-K-QVKCW 220 (460)
T ss_pred hhhccceEEEEeeCCC-ceeEEecC---CCceeEEEEcccCeEEEeecchhheeeeeeecccCceEEEecCC-C-eeEEE
Confidence 3345677899999987 66777766 67788899988766655554456789999998877777766433 2 45555
Q ss_pred ecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCcccee-eeee--CCeEEEEe
Q psy6570 101 SIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYK-LEVF--EDNLYFST 177 (713)
Q Consensus 101 ~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~-i~~~--~~~ly~td 177 (713)
++.-.....-....+..-..|++.|.-+.|+ +-.....|+.-|+.......++.. ...|.. +... +..|| +-
T Consensus 221 DLe~nkvIR~YhGHlS~V~~L~lhPTldvl~-t~grDst~RvWDiRtr~~V~~l~G---H~~~V~~V~~~~~dpqvi-t~ 295 (460)
T KOG0285|consen 221 DLEYNKVIRHYHGHLSGVYCLDLHPTLDVLV-TGGRDSTIRVWDIRTRASVHVLSG---HTNPVASVMCQPTDPQVI-TG 295 (460)
T ss_pred echhhhhHHHhccccceeEEEeccccceeEE-ecCCcceEEEeeecccceEEEecC---CCCcceeEEeecCCCceE-Ee
Confidence 6543322222233556667788887665555 443444444444433333222222 223322 2221 34444 33
Q ss_pred CCCCcEEEEcccCCCcceeeecc
Q psy6570 178 YRTNNILKINKFGNSDFNVLANN 200 (713)
Q Consensus 178 ~~~~~i~~~~~~~~~~~~~~~~~ 200 (713)
.....|.-.+...+..+..+...
T Consensus 296 S~D~tvrlWDl~agkt~~tlt~h 318 (460)
T KOG0285|consen 296 SHDSTVRLWDLRAGKTMITLTHH 318 (460)
T ss_pred cCCceEEEeeeccCceeEeeecc
Confidence 44556766676666665555444
No 160
>COG4247 Phy 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]
Probab=94.48 E-value=2.8 Score=40.69 Aligned_cols=112 Identities=16% Similarity=0.138 Sum_probs=59.6
Q ss_pred ecCCCCCceEEEe--ccCCeEEEeecCCCCCCeEEEEec----CCceEEEEEcCC--CCCcceEEEcCCCCcEEEEccCC
Q psy6570 22 LSNLHDPRGVAVD--WVGKNLYWTDAGGRSSNNIMVSTL----EGRKKRTLLNTG--LNEPYDIALEPLSGRMFWTELGI 93 (713)
Q Consensus 22 ~~~~~~p~gla~D--~~~~~ly~td~~~~~~~~I~~~~~----~G~~~~~l~~~~--~~~p~~iavD~~~~~ly~td~~~ 93 (713)
.+.+..|.||++- ++++.+|+--.+ ..+.|..+.+ +|+....+++.- -.+-.||++|...|.||.++...
T Consensus 149 ss~~s~~YGl~lyrs~ktgd~yvfV~~--~qG~~~Qy~l~d~gnGkv~~k~vR~fk~~tQTEG~VaDdEtG~LYIaeEdv 226 (364)
T COG4247 149 SSSSSSAYGLALYRSPKTGDYYVFVNR--RQGDIAQYKLIDQGNGKVGTKLVRQFKIPTQTEGMVADDETGFLYIAEEDV 226 (364)
T ss_pred ccCcccceeeEEEecCCcCcEEEEEec--CCCceeEEEEEecCCceEcceeeEeeecCCcccceeeccccceEEEeeccc
Confidence 4567778888875 344444443221 2355554443 344444444321 24667999999999999998543
Q ss_pred CCeEEEEecC--CCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCC
Q psy6570 94 KPRISGASID--GKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKAR 138 (713)
Q Consensus 94 ~~~I~~~~~d--G~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~ 138 (713)
.||+...+ +.....++. .+.--..|+=|...-.||+...+.+
T Consensus 227 --aiWK~~Aep~~G~~g~~id-r~~d~~~LtdDvEGltiYy~pnGkG 270 (364)
T COG4247 227 --AIWKYEAEPNRGNTGRLID-RIKDLSYLTDDVEGLTIYYGPNGKG 270 (364)
T ss_pred --eeeecccCCCCCCccchhh-hhcCchhhcccccccEEEEcCCCcE
Confidence 58887654 222222221 1111133555544445666554433
No 161
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=94.35 E-value=2.4 Score=44.50 Aligned_cols=144 Identities=12% Similarity=0.132 Sum_probs=75.3
Q ss_pred CCceeEEEccCcccEEecCC--CCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCC-cce-EEEcC
Q psy6570 6 SGNVTRVKREMNLKTVLSNL--HDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNE-PYD-IALEP 81 (713)
Q Consensus 6 ~~~I~~~~~~~~~~~~~~~~--~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~-p~~-iavD~ 81 (713)
...++.+++.+.+.+=++.. ....|..+-+.++.||+... ...|.++++++...++|....-.. ..+ .+++.
T Consensus 59 ~~nly~lDL~t~~i~QLTdg~g~~~~g~~~s~~~~~~~Yv~~----~~~l~~vdL~T~e~~~vy~~p~~~~g~gt~v~n~ 134 (386)
T PF14583_consen 59 NRNLYLLDLATGEITQLTDGPGDNTFGGFLSPDDRALYYVKN----GRSLRRVDLDTLEERVVYEVPDDWKGYGTWVANS 134 (386)
T ss_dssp S-EEEEEETTT-EEEE---SS-B-TTT-EE-TTSSEEEEEET----TTEEEEEETTT--EEEEEE--TTEEEEEEEEE-T
T ss_pred CcceEEEEcccCEEEECccCCCCCccceEEecCCCeEEEEEC----CCeEEEEECCcCcEEEEEECCcccccccceeeCC
Confidence 45677888887543223322 23345666678888876654 368899999988776665433111 112 33343
Q ss_pred CCCcEEEE------cc---------------CCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCe-EEEEc-----
Q psy6570 82 LSGRMFWT------EL---------------GIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQR-LYWAD----- 134 (713)
Q Consensus 82 ~~~~ly~t------d~---------------~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~-LY~~d----- 134 (713)
++.+++. |+ ....+|.++++.+..+++++... .+-.-+-+.|.... |-+..
T Consensus 135 -d~t~~~g~e~~~~d~~~l~~~~~f~e~~~a~p~~~i~~idl~tG~~~~v~~~~-~wlgH~~fsP~dp~li~fCHEGpw~ 212 (386)
T PF14583_consen 135 -DCTKLVGIEISREDWKPLTKWKGFREFYEARPHCRIFTIDLKTGERKVVFEDT-DWLGHVQFSPTDPTLIMFCHEGPWD 212 (386)
T ss_dssp -TSSEEEEEEEEGGG-----SHHHHHHHHHC---EEEEEEETTT--EEEEEEES-S-EEEEEEETTEEEEEEEEE-S-TT
T ss_pred -CccEEEEEEEeehhccCccccHHHHHHHhhCCCceEEEEECCCCceeEEEecC-ccccCcccCCCCCCEEEEeccCCcc
Confidence 3444432 11 11247999999988888887643 34456777766543 44432
Q ss_pred CCCCcEEEEeCCCCceeEEEe
Q psy6570 135 PKARTIESINLNGKDRFVVYH 155 (713)
Q Consensus 135 ~~~~~I~~~~~~g~~~~~~~~ 155 (713)
....|||.++.||++...+..
T Consensus 213 ~Vd~RiW~i~~dg~~~~~v~~ 233 (386)
T PF14583_consen 213 LVDQRIWTINTDGSNVKKVHR 233 (386)
T ss_dssp TSS-SEEEEETTS---EESS-
T ss_pred eeceEEEEEEcCCCcceeeec
Confidence 234699999999998877753
No 162
>PLN00181 protein SPA1-RELATED; Provisional
Probab=94.23 E-value=11 Score=45.00 Aligned_cols=153 Identities=8% Similarity=-0.026 Sum_probs=83.1
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCc
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNK 107 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~ 107 (713)
..++++.+..+.+..+-. ..+.|.+.++........+.........|++++.++.++++-.... .|...++.....
T Consensus 535 v~~l~~~~~~~~~las~~---~Dg~v~lWd~~~~~~~~~~~~H~~~V~~l~~~p~~~~~L~Sgs~Dg-~v~iWd~~~~~~ 610 (793)
T PLN00181 535 LSGICWNSYIKSQVASSN---FEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDG-SVKLWSINQGVS 610 (793)
T ss_pred eeeEEeccCCCCEEEEEe---CCCeEEEEECCCCeEEEEecCCCCCEEEEEEcCCCCCEEEEEcCCC-EEEEEECCCCcE
Confidence 455666544334334443 4678888887754433333233355778999877777777765444 677767654333
Q ss_pred EEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCce--eEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEE
Q psy6570 108 FNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDR--FVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILK 185 (713)
Q Consensus 108 ~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~--~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~ 185 (713)
...+.. ......+++.+..+.++++-...+.|...++..... ..+... ...-..+.+..+..+++-...+.|..
T Consensus 611 ~~~~~~-~~~v~~v~~~~~~g~~latgs~dg~I~iwD~~~~~~~~~~~~~h---~~~V~~v~f~~~~~lvs~s~D~~iki 686 (793)
T PLN00181 611 IGTIKT-KANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLCTMIGH---SKTVSYVRFVDSSTLVSSSTDNTLKL 686 (793)
T ss_pred EEEEec-CCCeEEEEEeCCCCCEEEEEeCCCeEEEEECCCCCccceEecCC---CCCEEEEEEeCCCEEEEEECCCEEEE
Confidence 222222 234456677555666666666777888888754321 122111 11223444444445555555556665
Q ss_pred Ecc
Q psy6570 186 INK 188 (713)
Q Consensus 186 ~~~ 188 (713)
++.
T Consensus 687 Wd~ 689 (793)
T PLN00181 687 WDL 689 (793)
T ss_pred EeC
Confidence 554
No 163
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=94.19 E-value=0.14 Score=41.35 Aligned_cols=49 Identities=27% Similarity=0.299 Sum_probs=38.4
Q ss_pred CceeEEEccCcccEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecC
Q psy6570 7 GNVTRVKREMNLKTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLE 59 (713)
Q Consensus 7 ~~I~~~~~~~~~~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~ 59 (713)
+.|.-++.. +.++++.++..|.||++|+.++.||+++. ..+.|.++..+
T Consensus 36 ~~Vvyyd~~-~~~~va~g~~~aNGI~~s~~~k~lyVa~~---~~~~I~vy~~~ 84 (86)
T PF01731_consen 36 GNVVYYDGK-EVKVVASGFSFANGIAISPDKKYLYVASS---LAHSIHVYKRH 84 (86)
T ss_pred ceEEEEeCC-EeEEeeccCCCCceEEEcCCCCEEEEEec---cCCeEEEEEec
Confidence 444445543 34677899999999999999999999999 77888887653
No 164
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=94.19 E-value=6.6 Score=39.98 Aligned_cols=178 Identities=10% Similarity=0.072 Sum_probs=109.4
Q ss_pred ccCCceeEEEccCcccEE---ecCCCCCceEEEeccCCeEEEeecCCCC--------------CCeEEEE-ecCCceEEE
Q psy6570 4 ISSGNVTRVKREMNLKTV---LSNLHDPRGVAVDWVGKNLYWTDAGGRS--------------SNNIMVS-TLEGRKKRT 65 (713)
Q Consensus 4 ~~~~~I~~~~~~~~~~~~---~~~~~~p~gla~D~~~~~ly~td~~~~~--------------~~~I~~~-~~~G~~~~~ 65 (713)
...|.|-..+.....+.+ .+.-..|+-|.+.+.+..|.|++.+... .-.|.++ ..+|+....
T Consensus 74 ~g~G~IgVyd~~~~~~ri~E~~s~GIGPHel~l~pDG~tLvVANGGI~Thpd~GR~kLNl~tM~psL~~ld~~sG~ll~q 153 (305)
T PF07433_consen 74 TGRGVIGVYDAARGYRRIGEFPSHGIGPHELLLMPDGETLVVANGGIETHPDSGRAKLNLDTMQPSLVYLDARSGALLEQ 153 (305)
T ss_pred CCcEEEEEEECcCCcEEEeEecCCCcChhhEEEcCCCCEEEEEcCCCccCcccCceecChhhcCCceEEEecCCCceeee
Confidence 346677777777433333 4566679999999988899999876521 1223344 445655544
Q ss_pred EEc-CC--CCCcceEEEcCCCCcEEEEccCC------CCeEEEEecCCCCcEEEEeC------C-CCCCeeEEEeCCCCe
Q psy6570 66 LLN-TG--LNEPYDIALEPLSGRMFWTELGI------KPRISGASIDGKNKFNLVDN------N-IQWPTGITIDYPSQR 129 (713)
Q Consensus 66 l~~-~~--~~~p~~iavD~~~~~ly~td~~~------~~~I~~~~~dG~~~~~l~~~------~-~~~p~glavd~~~~~ 129 (713)
... .. ....+-|+++. +|.+.+..... .+-|...+.++. . .++.. . -++.-.||++...+.
T Consensus 154 ~~Lp~~~~~lSiRHLa~~~-~G~V~~a~Q~qg~~~~~~PLva~~~~g~~-~-~~~~~p~~~~~~l~~Y~gSIa~~~~g~~ 230 (305)
T PF07433_consen 154 VELPPDLHQLSIRHLAVDG-DGTVAFAMQYQGDPGDAPPLVALHRRGGA-L-RLLPAPEEQWRRLNGYIGSIAADRDGRL 230 (305)
T ss_pred eecCccccccceeeEEecC-CCcEEEEEecCCCCCccCCeEEEEcCCCc-c-eeccCChHHHHhhCCceEEEEEeCCCCE
Confidence 322 11 23578899996 57776664221 233444444433 2 22221 1 256788999988888
Q ss_pred EEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEEEccc
Q psy6570 130 LYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILKINKF 189 (713)
Q Consensus 130 LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~~~~ 189 (713)
|.++-+..+++..++.+.... +... .+...-||+...+.+.|+ .+.+.++++...
T Consensus 231 ia~tsPrGg~~~~~d~~tg~~--~~~~--~l~D~cGva~~~~~f~~s-sG~G~~~~~~~~ 285 (305)
T PF07433_consen 231 IAVTSPRGGRVAVWDAATGRL--LGSV--PLPDACGVAPTDDGFLVS-SGQGQLIRLSPD 285 (305)
T ss_pred EEEECCCCCEEEEEECCCCCE--eecc--ccCceeeeeecCCceEEe-CCCccEEEccCc
Confidence 999999999999986543322 2211 366778899886664444 566777777654
No 165
>KOG4649|consensus
Probab=94.19 E-value=3.3 Score=40.43 Aligned_cols=67 Identities=21% Similarity=0.185 Sum_probs=44.4
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecC
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASID 103 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~d 103 (713)
+-..-.-+|+|+.+++|||-.. -..||.-.- .++ .+ -+++--.++.|||-+..+...+|.+..-
T Consensus 29 gSHs~~~~avd~~sG~~~We~i---lg~RiE~sa-------~vv-gd-----fVV~GCy~g~lYfl~~~tGs~~w~f~~~ 92 (354)
T KOG4649|consen 29 GSHSGIVIAVDPQSGNLIWEAI---LGVRIECSA-------IVV-GD-----FVVLGCYSGGLYFLCVKTGSQIWNFVIL 92 (354)
T ss_pred ecCCceEEEecCCCCcEEeehh---hCceeeeee-------EEE-CC-----EEEEEEccCcEEEEEecchhheeeeeeh
Confidence 3344456899999999999776 445554321 112 11 1666668899999998877678877655
Q ss_pred CCC
Q psy6570 104 GKN 106 (713)
Q Consensus 104 G~~ 106 (713)
++.
T Consensus 93 ~~v 95 (354)
T KOG4649|consen 93 ETV 95 (354)
T ss_pred hhh
Confidence 543
No 166
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=93.93 E-value=4.5 Score=43.28 Aligned_cols=132 Identities=13% Similarity=0.107 Sum_probs=64.9
Q ss_pred CCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeC-CC
Q psy6570 37 GKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDN-NI 115 (713)
Q Consensus 37 ~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~-~~ 115 (713)
++.||++.. .+.+..+++.... ++..........++++ +++||+.+.. +.|+.++++.... +... .+
T Consensus 241 ~~~vy~~~~----~g~l~a~d~~tG~--~~W~~~~~~~~~p~~~--~~~vyv~~~~--G~l~~~d~~tG~~--~W~~~~~ 308 (377)
T TIGR03300 241 GGQVYAVSY----QGRVAALDLRSGR--VLWKRDASSYQGPAVD--DNRLYVTDAD--GVVVALDRRSGSE--LWKNDEL 308 (377)
T ss_pred CCEEEEEEc----CCEEEEEECCCCc--EEEeeccCCccCceEe--CCEEEEECCC--CeEEEEECCCCcE--EEccccc
Confidence 456676654 4566667764322 1222222334455554 6788887632 3688887753221 1111 11
Q ss_pred C--CCeeEEEeCCCCeEEEEcCCCCcEEEEeCC-CCceeEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEEE
Q psy6570 116 Q--WPTGITIDYPSQRLYWADPKARTIESINLN-GKDRFVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILKI 186 (713)
Q Consensus 116 ~--~p~glavd~~~~~LY~~d~~~~~I~~~~~~-g~~~~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~ 186 (713)
. .....++. +++||+.+ ..+.|+.++.+ |..+-.+..... .......+.+++||+... .+.|+.+
T Consensus 309 ~~~~~ssp~i~--g~~l~~~~-~~G~l~~~d~~tG~~~~~~~~~~~--~~~~sp~~~~~~l~v~~~-dG~l~~~ 376 (377)
T TIGR03300 309 KYRQLTAPAVV--GGYLVVGD-FEGYLHWLSREDGSFVARLKTDGS--GIASPPVVVGDGLLVQTR-DGDLYAF 376 (377)
T ss_pred cCCccccCEEE--CCEEEEEe-CCCEEEEEECCCCCEEEEEEcCCC--ccccCCEEECCEEEEEeC-CceEEEe
Confidence 1 11222343 57888876 34678888764 433322221110 111222455678888754 3455544
No 167
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=93.84 E-value=1.8 Score=45.38 Aligned_cols=116 Identities=16% Similarity=0.232 Sum_probs=61.7
Q ss_pred CeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCC
Q psy6570 38 KNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQW 117 (713)
Q Consensus 38 ~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~ 117 (713)
..||.++.. ....++.++|+....+.|.........|..+.+.++.|||...+ . +|++++++....++|....-.+
T Consensus 49 kllF~s~~d--g~~nly~lDL~t~~i~QLTdg~g~~~~g~~~s~~~~~~~Yv~~~-~-~l~~vdL~T~e~~~vy~~p~~~ 124 (386)
T PF14583_consen 49 KLLFASDFD--GNRNLYLLDLATGEITQLTDGPGDNTFGGFLSPDDRALYYVKNG-R-SLRRVDLDTLEERVVYEVPDDW 124 (386)
T ss_dssp EEEEEE-TT--SS-EEEEEETTT-EEEE---SS-B-TTT-EE-TTSSEEEEEETT-T-EEEEEETTT--EEEEEE--TTE
T ss_pred EEEEEeccC--CCcceEEEEcccCEEEECccCCCCCccceEEecCCCeEEEEECC-C-eEEEEECCcCcEEEEEECCccc
Confidence 345556642 46779999999988888876433334466666788888765422 3 7999999988777766433222
Q ss_pred -CeeEE-EeCCCCeEEEE-c---------------------CCCCcEEEEeCCCCceeEEEecCC
Q psy6570 118 -PTGIT-IDYPSQRLYWA-D---------------------PKARTIESINLNGKDRFVVYHTED 158 (713)
Q Consensus 118 -p~gla-vd~~~~~LY~~-d---------------------~~~~~I~~~~~~g~~~~~~~~~~~ 158 (713)
..|-. ++. +.++++. . .-..+|.++++.+..++++.....
T Consensus 125 ~g~gt~v~n~-d~t~~~g~e~~~~d~~~l~~~~~f~e~~~a~p~~~i~~idl~tG~~~~v~~~~~ 188 (386)
T PF14583_consen 125 KGYGTWVANS-DCTKLVGIEISREDWKPLTKWKGFREFYEARPHCRIFTIDLKTGERKVVFEDTD 188 (386)
T ss_dssp EEEEEEEE-T-TSSEEEEEEEEGGG-----SHHHHHHHHHC---EEEEEEETTT--EEEEEEESS
T ss_pred ccccceeeCC-CccEEEEEEEeehhccCccccHHHHHHHhhCCCceEEEEECCCCceeEEEecCc
Confidence 12222 343 3344432 1 012379999999888888876644
No 168
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=93.73 E-value=5.4 Score=43.02 Aligned_cols=103 Identities=9% Similarity=0.086 Sum_probs=55.7
Q ss_pred CCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCcc
Q psy6570 83 SGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYK 162 (713)
Q Consensus 83 ~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~ 162 (713)
++.||++... +.++.+++.... ++....+..+..++++ +++||+.+. .++|+.++...... +..........
T Consensus 256 ~~~vy~~~~~--g~l~ald~~tG~--~~W~~~~~~~~~~~~~--~~~vy~~~~-~g~l~ald~~tG~~-~W~~~~~~~~~ 327 (394)
T PRK11138 256 GGVVYALAYN--GNLVALDLRSGQ--IVWKREYGSVNDFAVD--GGRIYLVDQ-NDRVYALDTRGGVE-LWSQSDLLHRL 327 (394)
T ss_pred CCEEEEEEcC--CeEEEEECCCCC--EEEeecCCCccCcEEE--CCEEEEEcC-CCeEEEEECCCCcE-EEcccccCCCc
Confidence 5677776643 256666654221 2333334444456665 789998874 56788888754322 11111100111
Q ss_pred ceeeeeeCCeEEEEeCCCCcEEEEcccCCCcc
Q psy6570 163 PYKLEVFEDNLYFSTYRTNNILKINKFGNSDF 194 (713)
Q Consensus 163 p~~i~~~~~~ly~td~~~~~i~~~~~~~~~~~ 194 (713)
..+..+.+++||+.+. .+.|+.++...+..+
T Consensus 328 ~~sp~v~~g~l~v~~~-~G~l~~ld~~tG~~~ 358 (394)
T PRK11138 328 LTAPVLYNGYLVVGDS-EGYLHWINREDGRFV 358 (394)
T ss_pred ccCCEEECCEEEEEeC-CCEEEEEECCCCCEE
Confidence 1223456788888764 456777777655443
No 169
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=93.50 E-value=0.12 Score=42.78 Aligned_cols=45 Identities=18% Similarity=0.213 Sum_probs=24.4
Q ss_pred ceeeCCC-------------CcccCCCCccccccccccccccchhHHHHHHHHHHHHHHhheee
Q psy6570 655 PICICPR-------------GYAGVRCQTLVHYISKKQSYVNSHISSILILILLLITVGGIGYY 705 (713)
Q Consensus 655 ~~C~C~~-------------Gy~G~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 705 (713)
|.|.|.+ .|.|..|+..-- +....+++++.+++|++++.++.|+|
T Consensus 33 ~~C~C~~T~~~~~~~~~ktt~W~G~aCqKkDv------S~~F~L~~~~ti~lv~~~~~~I~lL~ 90 (103)
T PF12955_consen 33 FACKCKPTVVKTGSGKGKTTHWGGPACQKKDV------SVPFWLFAGFTIALVVLVAGAIGLLF 90 (103)
T ss_pred EEEEeeccccccccccCceeeecccccccccc------cchhhHHHHHHHHHHHHHHHHHHHHH
Confidence 6788877 466777765321 22234455555555555444444433
No 170
>PRK13616 lipoprotein LpqB; Provisional
Probab=93.44 E-value=6 Score=44.84 Aligned_cols=168 Identities=9% Similarity=0.058 Sum_probs=89.1
Q ss_pred ceeEEEccCcccEEecCCCCCceEEEeccCCeEEEeecCC--------CCCCeEEEEecCCceEEEEEcCCCCCcceEEE
Q psy6570 8 NVTRVKREMNLKTVLSNLHDPRGVAVDWVGKNLYWTDAGG--------RSSNNIMVSTLEGRKKRTLLNTGLNEPYDIAL 79 (713)
Q Consensus 8 ~I~~~~~~~~~~~~~~~~~~p~gla~D~~~~~ly~td~~~--------~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iav 79 (713)
+|+.++.++..+.+..+. .-....+++.++.|+++..+. .....|++..+++...+. .. -..+..|++
T Consensus 380 ~Lwv~~~gg~~~~lt~g~-~~t~PsWspDG~~lw~v~dg~~~~~v~~~~~~gql~~~~vd~ge~~~--~~-~g~Issl~w 455 (591)
T PRK13616 380 SLWVGPLGGVAVQVLEGH-SLTRPSWSLDADAVWVVVDGNTVVRVIRDPATGQLARTPVDASAVAS--RV-PGPISELQL 455 (591)
T ss_pred EEEEEeCCCcceeeecCC-CCCCceECCCCCceEEEecCcceEEEeccCCCceEEEEeccCchhhh--cc-CCCcCeEEE
Confidence 555555544433332222 233456777766666553210 012344444444433221 11 135788999
Q ss_pred cCCCCcEEEEccCCCCeEEE---EecCCCCcEEE-----EeCCCCC-CeeEEEeCCCCeEEEEcC-CCCcEEEEeCCCCc
Q psy6570 80 EPLSGRMFWTELGIKPRISG---ASIDGKNKFNL-----VDNNIQW-PTGITIDYPSQRLYWADP-KARTIESINLNGKD 149 (713)
Q Consensus 80 D~~~~~ly~td~~~~~~I~~---~~~dG~~~~~l-----~~~~~~~-p~glavd~~~~~LY~~d~-~~~~I~~~~~~g~~ 149 (713)
.|...+|.+.-.+ +|++ ...++.. ..| +...+.. +..+++-. ++.|++... ....++++++||..
T Consensus 456 SpDG~RiA~i~~g---~v~Va~Vvr~~~G~-~~l~~~~~l~~~l~~~~~~l~W~~-~~~L~V~~~~~~~~v~~v~vDG~~ 530 (591)
T PRK13616 456 SRDGVRAAMIIGG---KVYLAVVEQTEDGQ-YALTNPREVGPGLGDTAVSLDWRT-GDSLVVGRSDPEHPVWYVNLDGSN 530 (591)
T ss_pred CCCCCEEEEEECC---EEEEEEEEeCCCCc-eeecccEEeecccCCccccceEec-CCEEEEEecCCCCceEEEecCCcc
Confidence 8877777776532 6666 3433333 333 2223433 46777774 445666543 34569999999987
Q ss_pred eeEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEEEc
Q psy6570 150 RFVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILKIN 187 (713)
Q Consensus 150 ~~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~~ 187 (713)
.+.+..... .....+|+...+.||++|.. .++.+.
T Consensus 531 ~~~~~~~n~-~~~v~~vaa~~~~iyv~~~~--g~~~l~ 565 (591)
T PRK13616 531 SDALPSRNL-SAPVVAVAASPSTVYVTDAR--AVLQLP 565 (591)
T ss_pred ccccCCCCc-cCceEEEecCCceEEEEcCC--ceEEec
Confidence 665332211 22335566666789998754 355554
No 171
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=93.43 E-value=0.18 Score=41.81 Aligned_cols=6 Identities=33% Similarity=1.287 Sum_probs=3.0
Q ss_pred CCCCcc
Q psy6570 659 CPRGYA 664 (713)
Q Consensus 659 C~~Gy~ 664 (713)
|.+||.
T Consensus 42 C~~GY~ 47 (96)
T PTZ00382 42 CNSGFS 47 (96)
T ss_pred CcCCcc
Confidence 555553
No 172
>KOG0310|consensus
Probab=93.36 E-value=2 Score=45.40 Aligned_cols=156 Identities=10% Similarity=-0.014 Sum_probs=87.1
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCc
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNK 107 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~ 107 (713)
..++.|-.. ++|+-+-. ..+.|.+++++.+.....+...-...+-+-+-|..+.++.+-.... .+..-++++...
T Consensus 71 v~s~~fR~D-G~LlaaGD---~sG~V~vfD~k~r~iLR~~~ah~apv~~~~f~~~d~t~l~s~sDd~-v~k~~d~s~a~v 145 (487)
T KOG0310|consen 71 VYSVDFRSD-GRLLAAGD---ESGHVKVFDMKSRVILRQLYAHQAPVHVTKFSPQDNTMLVSGSDDK-VVKYWDLSTAYV 145 (487)
T ss_pred eeEEEeecC-CeEEEccC---CcCcEEEeccccHHHHHHHhhccCceeEEEecccCCeEEEecCCCc-eEEEEEcCCcEE
Confidence 445555433 44544333 4677888886553221112122122334566677777887765444 444556665554
Q ss_pred EEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCcccee--eeeeCCeEEEEeCCCCcEEE
Q psy6570 108 FNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYK--LEVFEDNLYFSTYRTNNILK 185 (713)
Q Consensus 108 ~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~--i~~~~~~ly~td~~~~~i~~ 185 (713)
+.-+...-.+-.-+++.+.++.|+++-.+.+.|...+..... ..+.+.. ...|.. +.+-.+.++.+. +++.|..
T Consensus 146 ~~~l~~htDYVR~g~~~~~~~hivvtGsYDg~vrl~DtR~~~-~~v~eln--hg~pVe~vl~lpsgs~iasA-gGn~vkV 221 (487)
T KOG0310|consen 146 QAELSGHTDYVRCGDISPANDHIVVTGSYDGKVRLWDTRSLT-SRVVELN--HGCPVESVLALPSGSLIASA-GGNSVKV 221 (487)
T ss_pred EEEecCCcceeEeeccccCCCeEEEecCCCceEEEEEeccCC-ceeEEec--CCCceeeEEEcCCCCEEEEc-CCCeEEE
Confidence 433444555678889999999999999999988887765443 2222221 223432 222344565553 5566777
Q ss_pred EcccCCC
Q psy6570 186 INKFGNS 192 (713)
Q Consensus 186 ~~~~~~~ 192 (713)
++..++.
T Consensus 222 WDl~~G~ 228 (487)
T KOG0310|consen 222 WDLTTGG 228 (487)
T ss_pred EEecCCc
Confidence 7766443
No 173
>PF14339 DUF4394: Domain of unknown function (DUF4394)
Probab=93.22 E-value=8.9 Score=37.40 Aligned_cols=161 Identities=13% Similarity=0.119 Sum_probs=89.3
Q ss_pred CCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCC-----CCCcceEEEcCCCCcEEEE-ccCCCCeEE
Q psy6570 25 LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTG-----LNEPYDIALEPLSGRMFWT-ELGIKPRIS 98 (713)
Q Consensus 25 ~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~-----~~~p~~iavD~~~~~ly~t-d~~~~~~I~ 98 (713)
.....||.+-+.+++||-.. ..++|+.++........+.... ...+.++.++|.-.+|.+. +.+++ +
T Consensus 26 ge~l~GID~Rpa~G~LYgl~----~~g~lYtIn~~tG~aT~vg~s~~~~al~g~~~gvDFNP~aDRlRvvs~~GqN---l 98 (236)
T PF14339_consen 26 GESLVGIDFRPANGQLYGLG----STGRLYTINPATGAATPVGASPLTVALSGTAFGVDFNPAADRLRVVSNTGQN---L 98 (236)
T ss_pred CCeEEEEEeecCCCCEEEEe----CCCcEEEEECCCCeEEEeecccccccccCceEEEecCcccCcEEEEccCCcE---E
Confidence 45678899999999999774 4689999998765554442111 2346788888888888776 44444 5
Q ss_pred EEecCCCCcEEEEeCCCC----------CC--eeEEEeC-----C-CCeEEEEcCCCCcEEEEe-CCCCceeEEEecCCC
Q psy6570 99 GASIDGKNKFNLVDNNIQ----------WP--TGITIDY-----P-SQRLYWADPKARTIESIN-LNGKDRFVVYHTEDN 159 (713)
Q Consensus 99 ~~~~dG~~~~~l~~~~~~----------~p--~glavd~-----~-~~~LY~~d~~~~~I~~~~-~~g~~~~~~~~~~~~ 159 (713)
|++.|.... +.....+. .| .+.|... . .-.||-.|.....+++.. .+-.....+......
T Consensus 99 R~npdtGav-~~~Dg~L~y~~gd~~~G~~p~v~aaAYTNs~~g~~t~TtLy~ID~~~~~Lv~Q~ppN~GtL~~vG~LGvd 177 (236)
T PF14339_consen 99 RLNPDTGAV-TIVDGNLAYAAGDMNAGTTPGVTAAAYTNSFAGATTSTTLYDIDTTLDALVTQNPPNDGTLNTVGPLGVD 177 (236)
T ss_pred EECCCCCCc-eeccCccccCCCccccCCCCceEEEEEecccCCCccceEEEEEecCCCeEEEecCCCCCcEEeeeccccc
Confidence 666653221 11121222 22 2233321 1 457898998888887773 333333333322222
Q ss_pred CccceeeeeeC----CeEEEEeC--CCCcEEEEcccCCCc
Q psy6570 160 GYKPYKLEVFE----DNLYFSTY--RTNNILKINKFGNSD 193 (713)
Q Consensus 160 ~~~p~~i~~~~----~~ly~td~--~~~~i~~~~~~~~~~ 193 (713)
.....++++.. ...-|+-. ....++++++..+..
T Consensus 178 ~~~~~gFDI~~~~~~~~~a~a~~~~~~~~LY~vdL~TG~a 217 (236)
T PF14339_consen 178 AAGDAGFDIAGDGNGGNAAYAVLGVGGSGLYTVDLTTGAA 217 (236)
T ss_pred cCcccceeeecCCCcceEEEEEecCCCcEEEEEECCCccc
Confidence 22334555542 22222221 125677777765543
No 174
>KOG0266|consensus
Probab=93.18 E-value=13 Score=40.92 Aligned_cols=116 Identities=13% Similarity=0.087 Sum_probs=72.8
Q ss_pred CCCceEEEeccCCeEEEeecCCCCCCeEEEEec-CC-ceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecC
Q psy6570 26 HDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTL-EG-RKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASID 103 (713)
Q Consensus 26 ~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~-~G-~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~d 103 (713)
....++++-+.+. +.++-. ....|.+.+. +. ...+++. .-......+++.|.. .++++-.... .|..-++.
T Consensus 204 ~~v~~~~fs~d~~-~l~s~s---~D~tiriwd~~~~~~~~~~l~-gH~~~v~~~~f~p~g-~~i~Sgs~D~-tvriWd~~ 276 (456)
T KOG0266|consen 204 RGVSDVAFSPDGS-YLLSGS---DDKTLRIWDLKDDGRNLKTLK-GHSTYVTSVAFSPDG-NLLVSGSDDG-TVRIWDVR 276 (456)
T ss_pred cceeeeEECCCCc-EEEEec---CCceEEEeeccCCCeEEEEec-CCCCceEEEEecCCC-CEEEEecCCC-cEEEEecc
Confidence 3456777776544 334444 4566777666 33 4455554 334567899999876 6767655544 56666666
Q ss_pred CCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCc
Q psy6570 104 GKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKD 149 (713)
Q Consensus 104 G~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~ 149 (713)
.......+......-+++++.++...|..+ ...+.|...++....
T Consensus 277 ~~~~~~~l~~hs~~is~~~f~~d~~~l~s~-s~d~~i~vwd~~~~~ 321 (456)
T KOG0266|consen 277 TGECVRKLKGHSDGISGLAFSPDGNLLVSA-SYDGTIRVWDLETGS 321 (456)
T ss_pred CCeEEEeeeccCCceEEEEECCCCCEEEEc-CCCccEEEEECCCCc
Confidence 544444444455567889999655555544 668888888887766
No 175
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=93.06 E-value=1.6 Score=46.32 Aligned_cols=104 Identities=13% Similarity=0.198 Sum_probs=62.5
Q ss_pred eEEEEecCCceEEEEEcC---C-CCCcceEEEc-CC-CCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeC
Q psy6570 52 NIMVSTLEGRKKRTLLNT---G-LNEPYDIALE-PL-SGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDY 125 (713)
Q Consensus 52 ~I~~~~~~G~~~~~l~~~---~-~~~p~~iavD-~~-~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~ 125 (713)
.|.+++......+++..+ + +.....+.+- +. ++.|++++.....+|+.++++|...+.|.......-.-+.+|.
T Consensus 211 ~l~~~d~~tg~~~~~~~e~~~~Wv~~~~~~~~~~~~~~~~l~~s~~~G~~hly~~~~~~~~~~~lT~G~~~V~~i~~~d~ 290 (353)
T PF00930_consen 211 DLVLCDASTGETRVVLEETSDGWVDVYDPPHFLGPDGNEFLWISERDGYRHLYLYDLDGGKPRQLTSGDWEVTSILGWDE 290 (353)
T ss_dssp EEEEEEECTTTCEEEEEEESSSSSSSSSEEEE-TTTSSEEEEEEETTSSEEEEEEETTSSEEEESS-SSS-EEEEEEEEC
T ss_pred EEEEEECCCCceeEEEEecCCcceeeecccccccCCCCEEEEEEEcCCCcEEEEEcccccceeccccCceeecccceEcC
Confidence 356667644333333322 1 3333445442 33 4445555543445899999999876655443433334688999
Q ss_pred CCCeEEEEcC----CCCcEEEEeCC-CCceeEEEe
Q psy6570 126 PSQRLYWADP----KARTIESINLN-GKDRFVVYH 155 (713)
Q Consensus 126 ~~~~LY~~d~----~~~~I~~~~~~-g~~~~~~~~ 155 (713)
.+++||++-. ....|++++++ +...+.|..
T Consensus 291 ~~~~iyf~a~~~~p~~r~lY~v~~~~~~~~~~LT~ 325 (353)
T PF00930_consen 291 DNNRIYFTANGDNPGERHLYRVSLDSGGEPKCLTC 325 (353)
T ss_dssp TSSEEEEEESSGGTTSBEEEEEETTETTEEEESST
T ss_pred CCCEEEEEecCCCCCceEEEEEEeCCCCCeEeccC
Confidence 9999999854 45679999999 776665543
No 176
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=93.02 E-value=7 Score=37.36 Aligned_cols=92 Identities=12% Similarity=0.064 Sum_probs=58.3
Q ss_pred eEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEE
Q psy6570 52 NIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLY 131 (713)
Q Consensus 52 ~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY 131 (713)
.|++++..+.....+-...-....+++..|.+..+.+.......+|...++++.....+ .-...+.|.++|.+.+|.
T Consensus 40 ~l~~~~~~~~~~~~i~l~~~~~I~~~~WsP~g~~favi~g~~~~~v~lyd~~~~~i~~~---~~~~~n~i~wsP~G~~l~ 116 (194)
T PF08662_consen 40 ELFYLNEKNIPVESIELKKEGPIHDVAWSPNGNEFAVIYGSMPAKVTLYDVKGKKIFSF---GTQPRNTISWSPDGRFLV 116 (194)
T ss_pred EEEEEecCCCccceeeccCCCceEEEEECcCCCEEEEEEccCCcccEEEcCcccEeEee---cCCCceEEEECCCCCEEE
Confidence 46666666555444432222347899999977776665433233677777765444433 234457899999988888
Q ss_pred EEcCC--CCcEEEEeCC
Q psy6570 132 WADPK--ARTIESINLN 146 (713)
Q Consensus 132 ~~d~~--~~~I~~~~~~ 146 (713)
.+..+ .+.|...+.+
T Consensus 117 ~~g~~n~~G~l~~wd~~ 133 (194)
T PF08662_consen 117 LAGFGNLNGDLEFWDVR 133 (194)
T ss_pred EEEccCCCcEEEEEECC
Confidence 88744 3567777766
No 177
>KOG0294|consensus
Probab=92.96 E-value=11 Score=37.86 Aligned_cols=133 Identities=14% Similarity=0.126 Sum_probs=73.1
Q ss_pred cCCceeEEEccCcc--cEEecCCCCCceEEEeccCC--eEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEc
Q psy6570 5 SSGNVTRVKREMNL--KTVLSNLHDPRGVAVDWVGK--NLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALE 80 (713)
Q Consensus 5 ~~~~I~~~~~~~~~--~~~~~~~~~p~gla~D~~~~--~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD 80 (713)
.+.+|+.+|+.... ..++..-.....|.|++... +|. +-. ..+.|.+.+.+.-....-+..--.+.++|+|.
T Consensus 61 sDetI~IYDm~k~~qlg~ll~HagsitaL~F~~~~S~shLl-S~s---dDG~i~iw~~~~W~~~~slK~H~~~Vt~lsiH 136 (362)
T KOG0294|consen 61 SDETIHIYDMRKRKQLGILLSHAGSITALKFYPPLSKSHLL-SGS---DDGHIIIWRVGSWELLKSLKAHKGQVTDLSIH 136 (362)
T ss_pred CCCcEEEEeccchhhhcceeccccceEEEEecCCcchhhee-eec---CCCcEEEEEcCCeEEeeeecccccccceeEec
Confidence 56788888887644 34455566677777775432 333 333 46777777654422111122234569999999
Q ss_pred CCCCcEEEEccCCCCeEEEEecC-CCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCC
Q psy6570 81 PLSGRMFWTELGIKPRISGASID-GKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNG 147 (713)
Q Consensus 81 ~~~~~ly~td~~~~~~I~~~~~d-G~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g 147 (713)
| .++|-.+-.+.. .+..-++- |..-.++ .--+.+.-|.+++.+++.|+.- .++|..+.++.
T Consensus 137 P-S~KLALsVg~D~-~lr~WNLV~Gr~a~v~--~L~~~at~v~w~~~Gd~F~v~~--~~~i~i~q~d~ 198 (362)
T KOG0294|consen 137 P-SGKLALSVGGDQ-VLRTWNLVRGRVAFVL--NLKNKATLVSWSPQGDHFVVSG--RNKIDIYQLDN 198 (362)
T ss_pred C-CCceEEEEcCCc-eeeeehhhcCccceee--ccCCcceeeEEcCCCCEEEEEe--ccEEEEEeccc
Confidence 8 466666654433 44444442 3222222 1234566688887776666554 34444444443
No 178
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=92.95 E-value=0.025 Score=41.83 Aligned_cols=33 Identities=18% Similarity=0.106 Sum_probs=0.8
Q ss_pred cccccchhHHHHHHHHHHHHHHhheeeEEEEec
Q psy6570 679 QSYVNSHISSILILILLLITVGGIGYYIFRIKM 711 (713)
Q Consensus 679 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 711 (713)
+...++.+++++++++++++|++++++|+|+|.
T Consensus 8 ~~vlaavIaG~Vvgll~ailLIlf~iyR~rkkd 40 (64)
T PF01034_consen 8 SEVLAAVIAGGVVGLLFAILLILFLIYRMRKKD 40 (64)
T ss_dssp -----------------------------S---
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 344577888888888888888888888877764
No 179
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=92.88 E-value=0.48 Score=32.23 Aligned_cols=41 Identities=27% Similarity=0.429 Sum_probs=30.0
Q ss_pred cCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEc
Q psy6570 36 VGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALE 80 (713)
Q Consensus 36 ~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD 80 (713)
.+++||+++. ..+.|.++++........+..+ ..|.+|+++
T Consensus 2 d~~~lyv~~~---~~~~v~~id~~~~~~~~~i~vg-~~P~~i~~~ 42 (42)
T TIGR02276 2 DGTKLYVTNS---GSNTVSVIDTATNKVIATIPVG-GYPFGVAVS 42 (42)
T ss_pred CCCEEEEEeC---CCCEEEEEECCCCeEEEEEECC-CCCceEEeC
Confidence 4678999998 7899999998654443334343 789999874
No 180
>KOG0289|consensus
Probab=92.86 E-value=6.2 Score=41.28 Aligned_cols=153 Identities=11% Similarity=0.020 Sum_probs=87.2
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCC-CC
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDG-KN 106 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG-~~ 106 (713)
-..++|.+. +.||.+-. ..+.|.+.++.......-+.........|++. .+|+...+..... .|...++.- ..
T Consensus 350 ~ts~~fHpD-gLifgtgt---~d~~vkiwdlks~~~~a~Fpght~~vk~i~Fs-ENGY~Lat~add~-~V~lwDLRKl~n 423 (506)
T KOG0289|consen 350 YTSAAFHPD-GLIFGTGT---PDGVVKIWDLKSQTNVAKFPGHTGPVKAISFS-ENGYWLATAADDG-SVKLWDLRKLKN 423 (506)
T ss_pred eEEeeEcCC-ceEEeccC---CCceEEEEEcCCccccccCCCCCCceeEEEec-cCceEEEEEecCC-eEEEEEehhhcc
Confidence 455677654 77887776 66778888887654322222223456678887 5776655554433 344444432 13
Q ss_pred cEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEEE
Q psy6570 107 KFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILKI 186 (713)
Q Consensus 107 ~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~ 186 (713)
.+++........+.+.||..+.+|-.+ ...-+|+.+......-+.+...........++.+-+...|..+.+.++++++
T Consensus 424 ~kt~~l~~~~~v~s~~fD~SGt~L~~~-g~~l~Vy~~~k~~k~W~~~~~~~~~sg~st~v~Fg~~aq~l~s~smd~~l~~ 502 (506)
T KOG0289|consen 424 FKTIQLDEKKEVNSLSFDQSGTYLGIA-GSDLQVYICKKKTKSWTEIKELADHSGLSTGVRFGEHAQYLASTSMDAILRL 502 (506)
T ss_pred cceeeccccccceeEEEcCCCCeEEee-cceeEEEEEecccccceeeehhhhcccccceeeecccceEEeeccchhheEE
Confidence 334443344456889999777666655 3333555555444444444333222223456666677778887777777665
Q ss_pred c
Q psy6570 187 N 187 (713)
Q Consensus 187 ~ 187 (713)
-
T Consensus 503 ~ 503 (506)
T KOG0289|consen 503 Y 503 (506)
T ss_pred e
Confidence 3
No 181
>KOG0289|consensus
Probab=92.67 E-value=14 Score=38.71 Aligned_cols=117 Identities=8% Similarity=-0.014 Sum_probs=64.5
Q ss_pred cceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeC-CCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeE
Q psy6570 74 PYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDN-NIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFV 152 (713)
Q Consensus 74 p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~-~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~ 152 (713)
..++.+.|.+.+|.|++.... .++..--+|....++... .--.-+.++|.| ++.||.+-...+.|...++.... .
T Consensus 306 V~~ls~h~tgeYllsAs~d~~-w~Fsd~~~g~~lt~vs~~~s~v~~ts~~fHp-DgLifgtgt~d~~vkiwdlks~~--~ 381 (506)
T KOG0289|consen 306 VTGLSLHPTGEYLLSASNDGT-WAFSDISSGSQLTVVSDETSDVEYTSAAFHP-DGLIFGTGTPDGVVKIWDLKSQT--N 381 (506)
T ss_pred ceeeeeccCCcEEEEecCCce-EEEEEccCCcEEEEEeeccccceeEEeeEcC-CceEEeccCCCceEEEEEcCCcc--c
Confidence 468999999999999885533 333333344444333331 111236778885 67888888878877777765433 3
Q ss_pred EEecCCCCccceeeeeeCCeE-EEEeCCCCcEEEEcccCCCcc
Q psy6570 153 VYHTEDNGYKPYKLEVFEDNL-YFSTYRTNNILKINKFGNSDF 194 (713)
Q Consensus 153 ~~~~~~~~~~p~~i~~~~~~l-y~td~~~~~i~~~~~~~~~~~ 194 (713)
+..+...-..-..|.+.++-. .++....+.|+-++.+-....
T Consensus 382 ~a~Fpght~~vk~i~FsENGY~Lat~add~~V~lwDLRKl~n~ 424 (506)
T KOG0289|consen 382 VAKFPGHTGPVKAISFSENGYWLATAADDGSVKLWDLRKLKNF 424 (506)
T ss_pred cccCCCCCCceeEEEeccCceEEEEEecCCeEEEEEehhhccc
Confidence 333322112224566654432 233344444666665444333
No 182
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=92.59 E-value=15 Score=39.15 Aligned_cols=104 Identities=12% Similarity=0.108 Sum_probs=54.7
Q ss_pred CCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCcc
Q psy6570 83 SGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYK 162 (713)
Q Consensus 83 ~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~ 162 (713)
++.||++... . .++.+++.... ++..........++++ +++||+.+ ..++|+.++.+.....--... .....
T Consensus 241 ~~~vy~~~~~-g-~l~a~d~~tG~--~~W~~~~~~~~~p~~~--~~~vyv~~-~~G~l~~~d~~tG~~~W~~~~-~~~~~ 312 (377)
T TIGR03300 241 GGQVYAVSYQ-G-RVAALDLRSGR--VLWKRDASSYQGPAVD--DNRLYVTD-ADGVVVALDRRSGSELWKNDE-LKYRQ 312 (377)
T ss_pred CCEEEEEEcC-C-EEEEEECCCCc--EEEeeccCCccCceEe--CCEEEEEC-CCCeEEEEECCCCcEEEcccc-ccCCc
Confidence 4677776543 2 56666663221 2222223334555665 78999886 457888888753322111101 00111
Q ss_pred ceeeeeeCCeEEEEeCCCCcEEEEcccCCCcce
Q psy6570 163 PYKLEVFEDNLYFSTYRTNNILKINKFGNSDFN 195 (713)
Q Consensus 163 p~~i~~~~~~ly~td~~~~~i~~~~~~~~~~~~ 195 (713)
-....+.+++||+.+ ..+.|+.++...+....
T Consensus 313 ~ssp~i~g~~l~~~~-~~G~l~~~d~~tG~~~~ 344 (377)
T TIGR03300 313 LTAPAVVGGYLVVGD-FEGYLHWLSREDGSFVA 344 (377)
T ss_pred cccCEEECCEEEEEe-CCCEEEEEECCCCCEEE
Confidence 122234677888876 44667777766554433
No 183
>KOG0315|consensus
Probab=92.55 E-value=11 Score=36.69 Aligned_cols=133 Identities=12% Similarity=0.078 Sum_probs=79.6
Q ss_pred cCCceeEEEccCccc----EEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEc
Q psy6570 5 SSGNVTRVKREMNLK----TVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALE 80 (713)
Q Consensus 5 ~~~~I~~~~~~~~~~----~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD 80 (713)
....|+.+|+..... ++...-++...+.|...++.+| +-+ ..+.+.+-++.......++... ...+.|++.
T Consensus 59 ~~qhvRlyD~~S~np~Pv~t~e~h~kNVtaVgF~~dgrWMy-Tgs---eDgt~kIWdlR~~~~qR~~~~~-spVn~vvlh 133 (311)
T KOG0315|consen 59 GNQHVRLYDLNSNNPNPVATFEGHTKNVTAVGFQCDGRWMY-TGS---EDGTVKIWDLRSLSCQRNYQHN-SPVNTVVLH 133 (311)
T ss_pred cCCeeEEEEccCCCCCceeEEeccCCceEEEEEeecCeEEE-ecC---CCceEEEEeccCcccchhccCC-CCcceEEec
Confidence 345566666665331 2233336677788887766666 444 5677777777664433334332 456789999
Q ss_pred CCCCcEEEEccCCCCeEEEEecCCCC-cEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeC
Q psy6570 81 PLSGRMFWTELGIKPRISGASIDGKN-KFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINL 145 (713)
Q Consensus 81 ~~~~~ly~td~~~~~~I~~~~~dG~~-~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~ 145 (713)
|..+.|+..|...+ |++-++.... ...++.........|++++++.+|--+ ...++.+.-++
T Consensus 134 pnQteLis~dqsg~--irvWDl~~~~c~~~liPe~~~~i~sl~v~~dgsml~a~-nnkG~cyvW~l 196 (311)
T KOG0315|consen 134 PNQTELISGDQSGN--IRVWDLGENSCTHELIPEDDTSIQSLTVMPDGSMLAAA-NNKGNCYVWRL 196 (311)
T ss_pred CCcceEEeecCCCc--EEEEEccCCccccccCCCCCcceeeEEEcCCCcEEEEe-cCCccEEEEEc
Confidence 99999999996644 6666654432 233444444556789999776655443 34455444443
No 184
>KOG3914|consensus
Probab=92.33 E-value=3.9 Score=42.26 Aligned_cols=159 Identities=12% Similarity=0.020 Sum_probs=98.5
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEecCCceE--EEEEc-CCCCCcceEEEcCCCCcEEEEccCCC-CeEEEEecC
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKK--RTLLN-TGLNEPYDIALEPLSGRMFWTELGIK-PRISGASID 103 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~--~~l~~-~~~~~p~~iavD~~~~~ly~td~~~~-~~I~~~~~d 103 (713)
|..+++.+..+.|++++. ..++.+++.+++.. +.+.. .--.+|..|.++.....+-++|.... ..+...+.+
T Consensus 65 ~~~~~~s~~~~llAv~~~----~K~~~~f~~~~~~~~~kl~~~~~v~~~~~ai~~~~~~~sv~v~dkagD~~~~di~s~~ 140 (390)
T KOG3914|consen 65 PALVLTSDSGRLVAVATS----SKQRAVFDYRENPKGAKLLDVSCVPKRPTAISFIREDTSVLVADKAGDVYSFDILSAD 140 (390)
T ss_pred ccccccCCCceEEEEEeC----CCceEEEEEecCCCcceeeeEeecccCcceeeeeeccceEEEEeecCCceeeeeeccc
Confidence 445555556677888886 34444555544432 21111 12367888888877778888875433 133333333
Q ss_pred -CCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeE-EEecCCCCccceeeeeeCCeEEEEeCCCC
Q psy6570 104 -GKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFV-VYHTEDNGYKPYKLEVFEDNLYFSTYRTN 181 (713)
Q Consensus 104 -G~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~-~~~~~~~~~~p~~i~~~~~~ly~td~~~~ 181 (713)
|..+..+ ..+..-..++|.++..+|..+|. ...|+...+.+..... +... .-.....|++..+++.|+..+.+
T Consensus 141 ~~~~~~~l--GhvSml~dVavS~D~~~IitaDR-DEkIRvs~ypa~f~IesfclG--H~eFVS~isl~~~~~LlS~sGD~ 215 (390)
T KOG3914|consen 141 SGRCEPIL--GHVSMLLDVAVSPDDQFIITADR-DEKIRVSRYPATFVIESFCLG--HKEFVSTISLTDNYLLLSGSGDK 215 (390)
T ss_pred ccCcchhh--hhhhhhheeeecCCCCEEEEecC-CceEEEEecCcccchhhhccc--cHhheeeeeeccCceeeecCCCC
Confidence 4444333 24555678999998888988885 4567777777754321 1111 11234567888889999999999
Q ss_pred cEEEEcccCCCcce
Q psy6570 182 NILKINKFGNSDFN 195 (713)
Q Consensus 182 ~i~~~~~~~~~~~~ 195 (713)
.|+..+...+....
T Consensus 216 tlr~Wd~~sgk~L~ 229 (390)
T KOG3914|consen 216 TLRLWDITSGKLLD 229 (390)
T ss_pred cEEEEecccCCccc
Confidence 99988877666553
No 185
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=92.32 E-value=7.4 Score=36.85 Aligned_cols=169 Identities=8% Similarity=0.023 Sum_probs=83.4
Q ss_pred cCCceeEEEccCcccEEecCCCC----CceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCC-CCCcceEEE
Q psy6570 5 SSGNVTRVKREMNLKTVLSNLHD----PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTG-LNEPYDIAL 79 (713)
Q Consensus 5 ~~~~I~~~~~~~~~~~~~~~~~~----p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~-~~~p~~iav 79 (713)
...+|++.++.+..+.....+.. -+||.. .++.+|..-+ ..+.-++++.+.-. .+.+.. -..-+||+-
T Consensus 66 g~S~ir~~~L~~gq~~~s~~l~~~~~FgEGit~--~gd~~y~LTw---~egvaf~~d~~t~~--~lg~~~y~GeGWgLt~ 138 (262)
T COG3823 66 GFSKIRVSDLTTGQEIFSEKLAPDTVFGEGITK--LGDYFYQLTW---KEGVAFKYDADTLE--ELGRFSYEGEGWGLTS 138 (262)
T ss_pred ccceeEEEeccCceEEEEeecCCccccccceee--ccceEEEEEe---ccceeEEEChHHhh--hhcccccCCcceeeec
Confidence 35677788888654443333331 255543 4577777777 55666666665422 221111 234567777
Q ss_pred cCCCCcEEEEccCCCCeEEEEecCC-C-CcEEEEe---CCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEE
Q psy6570 80 EPLSGRMFWTELGIKPRISGASIDG-K-NKFNLVD---NNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVY 154 (713)
Q Consensus 80 D~~~~~ly~td~~~~~~I~~~~~dG-~-~~~~l~~---~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~ 154 (713)
|. ..|.-+| ++. .+...++.. . ..++.+. ..+..-+-|..- .+.||---+.+.+|.|++++...+....
T Consensus 139 d~--~~Limsd-Gsa-tL~frdP~tfa~~~~v~VT~~g~pv~~LNELE~V--dG~lyANVw~t~~I~rI~p~sGrV~~wi 212 (262)
T COG3823 139 DD--KNLIMSD-GSA-TLQFRDPKTFAELDTVQVTDDGVPVSKLNELEWV--DGELYANVWQTTRIARIDPDSGRVVAWI 212 (262)
T ss_pred CC--cceEeeC-Cce-EEEecCHHHhhhcceEEEEECCeecccccceeee--ccEEEEeeeeecceEEEcCCCCcEEEEE
Confidence 63 3354444 333 233222211 0 0111111 112222333332 5778776778888888888765544433
Q ss_pred ecCC----------CCccceeeeee--CCeEEEEeCCCCcEEEE
Q psy6570 155 HTED----------NGYKPYKLEVF--EDNLYFSTYRTNNILKI 186 (713)
Q Consensus 155 ~~~~----------~~~~p~~i~~~--~~~ly~td~~~~~i~~~ 186 (713)
.... ...-+.||+++ .+++|+|-..=..++.+
T Consensus 213 dlS~L~~~~~~~~~~~nvlNGIA~~~~~~r~~iTGK~wp~lfEV 256 (262)
T COG3823 213 DLSGLLKELNLDKSNDNVLNGIAHDPQQDRFLITGKLWPLLFEV 256 (262)
T ss_pred EccCCchhcCccccccccccceeecCcCCeEEEecCcCceeEEE
Confidence 2211 12234566665 56888875433344444
No 186
>KOG0286|consensus
Probab=92.24 E-value=13 Score=36.97 Aligned_cols=156 Identities=13% Similarity=0.120 Sum_probs=92.8
Q ss_pred eEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCC-CCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeE
Q psy6570 52 NIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGI-KPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRL 130 (713)
Q Consensus 52 ~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~-~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~L 130 (713)
....-+.........+..-.....+|++-|.+++.|++..-. ..+||-+. +|.-++++.. .-.-.+.+.+-| ++.-
T Consensus 167 TCalWDie~g~~~~~f~GH~gDV~slsl~p~~~ntFvSg~cD~~aklWD~R-~~~c~qtF~g-hesDINsv~ffP-~G~a 243 (343)
T KOG0286|consen 167 TCALWDIETGQQTQVFHGHTGDVMSLSLSPSDGNTFVSGGCDKSAKLWDVR-SGQCVQTFEG-HESDINSVRFFP-SGDA 243 (343)
T ss_pred eEEEEEcccceEEEEecCCcccEEEEecCCCCCCeEEecccccceeeeecc-CcceeEeecc-cccccceEEEcc-CCCe
Confidence 333444433333333333345677888888889999986432 23555443 3445555542 333456788874 5677
Q ss_pred EEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeee-CCeEEEEeCCCCcEEEEcccCCCcceeeeccccccccEEE
Q psy6570 131 YWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVF-EDNLYFSTYRTNNILKINKFGNSDFNVLANNLNRASDVLI 209 (713)
Q Consensus 131 Y~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v 209 (713)
|.+-+.....+.+|+.......+++.......-.++++. .++|.++-+....+...+.-.+..+.+|..+-++++.|.+
T Consensus 244 fatGSDD~tcRlyDlRaD~~~a~ys~~~~~~gitSv~FS~SGRlLfagy~d~~c~vWDtlk~e~vg~L~GHeNRvScl~~ 323 (343)
T KOG0286|consen 244 FATGSDDATCRLYDLRADQELAVYSHDSIICGITSVAFSKSGRLLFAGYDDFTCNVWDTLKGERVGVLAGHENRVSCLGV 323 (343)
T ss_pred eeecCCCceeEEEeecCCcEEeeeccCcccCCceeEEEcccccEEEeeecCCceeEeeccccceEEEeeccCCeeEEEEE
Confidence 778777888888887665444444443322233556654 5677777677777777776666666667666555555544
Q ss_pred E
Q psy6570 210 L 210 (713)
Q Consensus 210 ~ 210 (713)
.
T Consensus 324 s 324 (343)
T KOG0286|consen 324 S 324 (343)
T ss_pred C
Confidence 3
No 187
>KOG0315|consensus
Probab=92.24 E-value=12 Score=36.43 Aligned_cols=158 Identities=10% Similarity=-0.054 Sum_probs=85.1
Q ss_pred CCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCc-eEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCC
Q psy6570 26 HDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGR-KKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDG 104 (713)
Q Consensus 26 ~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~-~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG 104 (713)
.-...|.+.+....|+..|. .+.|.+-|+... -...++.+.......|+|+|...+|--+... ++.++-+|-+
T Consensus 125 spVn~vvlhpnQteLis~dq----sg~irvWDl~~~~c~~~liPe~~~~i~sl~v~~dgsml~a~nnk--G~cyvW~l~~ 198 (311)
T KOG0315|consen 125 SPVNTVVLHPNQTELISGDQ----SGNIRVWDLGENSCTHELIPEDDTSIQSLTVMPDGSMLAAANNK--GNCYVWRLLN 198 (311)
T ss_pred CCcceEEecCCcceEEeecC----CCcEEEEEccCCccccccCCCCCcceeeEEEcCCCcEEEEecCC--ccEEEEEccC
Confidence 33467888888899999986 688999998765 3556676766778889999855444333322 2444444332
Q ss_pred -CCcEEEEe-----CCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCC-ceeEEEecCCCCccceeeeeeCCeEEEEe
Q psy6570 105 -KNKFNLVD-----NNIQWPTGITIDYPSQRLYWADPKARTIESINLNGK-DRFVVYHTEDNGYKPYKLEVFEDNLYFST 177 (713)
Q Consensus 105 -~~~~~l~~-----~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~-~~~~~~~~~~~~~~p~~i~~~~~~ly~td 177 (713)
.....|.. ..-.+..-.-++|+.+.| .+-+....+...+.++. ..++.+....+....-.+..++.+| +|.
T Consensus 199 ~~~~s~l~P~~k~~ah~~~il~C~lSPd~k~l-at~ssdktv~iwn~~~~~kle~~l~gh~rWvWdc~FS~dg~Yl-vTa 276 (311)
T KOG0315|consen 199 HQTASELEPVHKFQAHNGHILRCLLSPDVKYL-ATCSSDKTVKIWNTDDFFKLELVLTGHQRWVWDCAFSADGEYL-VTA 276 (311)
T ss_pred CCccccceEhhheecccceEEEEEECCCCcEE-EeecCCceEEEEecCCceeeEEEeecCCceEEeeeeccCccEE-Eec
Confidence 22222221 112223334566655444 34444556666777776 3333344433333333444444444 343
Q ss_pred CCCCcEEEEcccCC
Q psy6570 178 YRTNNILKINKFGN 191 (713)
Q Consensus 178 ~~~~~i~~~~~~~~ 191 (713)
...+.+...+...+
T Consensus 277 ssd~~~rlW~~~~~ 290 (311)
T KOG0315|consen 277 SSDHTARLWDLSAG 290 (311)
T ss_pred CCCCceeecccccC
Confidence 34444444444333
No 188
>KOG0266|consensus
Probab=92.02 E-value=14 Score=40.68 Aligned_cols=133 Identities=17% Similarity=0.136 Sum_probs=77.9
Q ss_pred CCceeEEEc-cC--cccEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCC-ceEEEEEcCCCCCcceEEEcC
Q psy6570 6 SGNVTRVKR-EM--NLKTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEG-RKKRTLLNTGLNEPYDIALEP 81 (713)
Q Consensus 6 ~~~I~~~~~-~~--~~~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G-~~~~~l~~~~~~~p~~iavD~ 81 (713)
+..|...+. +. ..+++..-.....+++|.+.+ +++++-. ..+.|.+.+..+ +..+++. .......++++.+
T Consensus 224 D~tiriwd~~~~~~~~~~l~gH~~~v~~~~f~p~g-~~i~Sgs---~D~tvriWd~~~~~~~~~l~-~hs~~is~~~f~~ 298 (456)
T KOG0266|consen 224 DKTLRIWDLKDDGRNLKTLKGHSTYVTSVAFSPDG-NLLVSGS---DDGTVRIWDVRTGECVRKLK-GHSDGISGLAFSP 298 (456)
T ss_pred CceEEEeeccCCCeEEEEecCCCCceEEEEecCCC-CEEEEec---CCCcEEEEeccCCeEEEeee-ccCCceEEEEECC
Confidence 445555555 22 224444444556889999987 6666665 578888888876 4444443 3345778899986
Q ss_pred CCCcEEEEccCCCCeEEEEecCCCCcE--EEEeCCCC--CCeeEEEeCCCCeEEEEcCCCCcEEEEeCC
Q psy6570 82 LSGRMFWTELGIKPRISGASIDGKNKF--NLVDNNIQ--WPTGITIDYPSQRLYWADPKARTIESINLN 146 (713)
Q Consensus 82 ~~~~ly~td~~~~~~I~~~~~dG~~~~--~l~~~~~~--~p~glavd~~~~~LY~~d~~~~~I~~~~~~ 146 (713)
++.++++..... .|..-++.....+ ..+..... .-.-+.+++....|+.+- ..+.+...++.
T Consensus 299 -d~~~l~s~s~d~-~i~vwd~~~~~~~~~~~~~~~~~~~~~~~~~fsp~~~~ll~~~-~d~~~~~w~l~ 364 (456)
T KOG0266|consen 299 -DGNLLVSASYDG-TIRVWDLETGSKLCLKLLSGAENSAPVTSVQFSPNGKYLLSAS-LDRTLKLWDLR 364 (456)
T ss_pred -CCCEEEEcCCCc-cEEEEECCCCceeeeecccCCCCCCceeEEEECCCCcEEEEec-CCCeEEEEEcc
Confidence 566666664434 7888888776632 22221211 226677776555555443 33444444444
No 189
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=92.00 E-value=20 Score=38.92 Aligned_cols=156 Identities=13% Similarity=0.200 Sum_probs=84.1
Q ss_pred EEEeccCCeEEEeecCCC-C------CCeEEEEecCCceE--EEEEcCCCCCcc---eEEEcCCCCcEEEEccC-CC-Ce
Q psy6570 31 VAVDWVGKNLYWTDAGGR-S------SNNIMVSTLEGRKK--RTLLNTGLNEPY---DIALEPLSGRMFWTELG-IK-PR 96 (713)
Q Consensus 31 la~D~~~~~ly~td~~~~-~------~~~I~~~~~~G~~~--~~l~~~~~~~p~---~iavD~~~~~ly~td~~-~~-~~ 96 (713)
+++-..+..||++..... . ..+|++..+..... +.|+... ..+. ++.+.+.+++|+++... .. ..
T Consensus 175 ~~W~~d~~~~~y~~~~~~~~~~~~~~~~~v~~~~~gt~~~~d~lvfe~~-~~~~~~~~~~~s~d~~~l~i~~~~~~~~s~ 253 (414)
T PF02897_consen 175 VSWSDDGKGFFYTRFDEDQRTSDSGYPRQVYRHKLGTPQSEDELVFEEP-DEPFWFVSVSRSKDGRYLFISSSSGTSESE 253 (414)
T ss_dssp EEECTTSSEEEEEECSTTTSS-CCGCCEEEEEEETTS-GGG-EEEEC-T-TCTTSEEEEEE-TTSSEEEEEEESSSSEEE
T ss_pred EEEeCCCCEEEEEEeCcccccccCCCCcEEEEEECCCChHhCeeEEeec-CCCcEEEEEEecCcccEEEEEEEccccCCe
Confidence 566555557777765431 2 45688888876543 3455332 3343 77788777888875433 22 47
Q ss_pred EEEEecCCC----CcEEEEeCCCCCCeeEEEeCCCCeEEE-Ec--CCCCcEEEEeCCCCce---e-EEEecCCCCcccee
Q psy6570 97 ISGASIDGK----NKFNLVDNNIQWPTGITIDYPSQRLYW-AD--PKARTIESINLNGKDR---F-VVYHTEDNGYKPYK 165 (713)
Q Consensus 97 I~~~~~dG~----~~~~l~~~~~~~p~glavd~~~~~LY~-~d--~~~~~I~~~~~~g~~~---~-~~~~~~~~~~~p~~ 165 (713)
|+.++++.. ....++......-.. .++..++.+|+ ++ ....+|.+++++.... . ++..... ...-.+
T Consensus 254 v~~~d~~~~~~~~~~~~~l~~~~~~~~~-~v~~~~~~~yi~Tn~~a~~~~l~~~~l~~~~~~~~~~~l~~~~~-~~~l~~ 331 (414)
T PF02897_consen 254 VYLLDLDDGGSPDAKPKLLSPREDGVEY-YVDHHGDRLYILTNDDAPNGRLVAVDLADPSPAEWWTVLIPEDE-DVSLED 331 (414)
T ss_dssp EEEEECCCTTTSS-SEEEEEESSSS-EE-EEEEETTEEEEEE-TT-TT-EEEEEETTSTSGGGEEEEEE--SS-SEEEEE
T ss_pred EEEEeccccCCCcCCcEEEeCCCCceEE-EEEccCCEEEEeeCCCCCCcEEEEecccccccccceeEEcCCCC-ceeEEE
Confidence 888888864 233333323222222 23333777776 44 4567899999877542 3 4443322 224567
Q ss_pred eeeeCCeEEEEeCC--CCcEEEEccc
Q psy6570 166 LEVFEDNLYFSTYR--TNNILKINKF 189 (713)
Q Consensus 166 i~~~~~~ly~td~~--~~~i~~~~~~ 189 (713)
+.+++++|++.... ..+|..++..
T Consensus 332 ~~~~~~~Lvl~~~~~~~~~l~v~~~~ 357 (414)
T PF02897_consen 332 VSLFKDYLVLSYRENGSSRLRVYDLD 357 (414)
T ss_dssp EEEETTEEEEEEEETTEEEEEEEETT
T ss_pred EEEECCEEEEEEEECCccEEEEEECC
Confidence 77889998887543 3345555544
No 190
>KOG3512|consensus
Probab=92.00 E-value=0.35 Score=50.48 Aligned_cols=24 Identities=29% Similarity=0.684 Sum_probs=16.1
Q ss_pred eeecCCCCCCeeecCCCcccCCcc
Q psy6570 320 TCEFDDDFDPHCICQENFYGTYCE 343 (713)
Q Consensus 320 ~C~~~~~~~~~C~C~~g~~G~~C~ 343 (713)
.|+.+..+...|.|..+-+|..|+
T Consensus 286 ~Cv~d~~~~ltCdC~HNTaGPdCg 309 (592)
T KOG3512|consen 286 RCVMDESSHLTCDCEHNTAGPDCG 309 (592)
T ss_pred eeeeccCCceEEecccCCCCCCcc
Confidence 566655555677777777777665
No 191
>PTZ00420 coronin; Provisional
Probab=91.77 E-value=27 Score=39.44 Aligned_cols=157 Identities=7% Similarity=-0.026 Sum_probs=79.3
Q ss_pred CCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCC
Q psy6570 26 HDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGK 105 (713)
Q Consensus 26 ~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~ 105 (713)
.....|++.+....++.+-. ..+.|.+.++........+.. ...+..|++++. |.++.+..... .|...++...
T Consensus 126 ~~V~sVaf~P~g~~iLaSgS---~DgtIrIWDl~tg~~~~~i~~-~~~V~Slswspd-G~lLat~s~D~-~IrIwD~Rsg 199 (568)
T PTZ00420 126 KKISIIDWNPMNYYIMCSSG---FDSFVNIWDIENEKRAFQINM-PKKLSSLKWNIK-GNLLSGTCVGK-HMHIIDPRKQ 199 (568)
T ss_pred CcEEEEEECCCCCeEEEEEe---CCCeEEEEECCCCcEEEEEec-CCcEEEEEECCC-CCEEEEEecCC-EEEEEECCCC
Confidence 34677888887666666655 467888888875443322222 245778999974 55555543333 6777777644
Q ss_pred CcEEEEeCCCCCCee--EE---EeCCCCeEEEEcCCC---CcEEEEeCCC--CceeEEEecCCCCcccee--eeeeCCeE
Q psy6570 106 NKFNLVDNNIQWPTG--IT---IDYPSQRLYWADPKA---RTIESINLNG--KDRFVVYHTEDNGYKPYK--LEVFEDNL 173 (713)
Q Consensus 106 ~~~~l~~~~~~~p~g--la---vd~~~~~LY~~d~~~---~~I~~~~~~g--~~~~~~~~~~~~~~~p~~--i~~~~~~l 173 (713)
....-+......-.. +. +.+..++|..+-... +.|...++.. ..+..+ .... ...++- .+.+.+.+
T Consensus 200 ~~i~tl~gH~g~~~s~~v~~~~fs~d~~~IlTtG~d~~~~R~VkLWDlr~~~~pl~~~-~ld~-~~~~L~p~~D~~tg~l 277 (568)
T PTZ00420 200 EIASSFHIHDGGKNTKNIWIDGLGGDDNYILSTGFSKNNMREMKLWDLKNTTSALVTM-SIDN-ASAPLIPHYDESTGLI 277 (568)
T ss_pred cEEEEEecccCCceeEEEEeeeEcCCCCEEEEEEcCCCCccEEEEEECCCCCCceEEE-EecC-CccceEEeeeCCCCCE
Confidence 332222211111111 11 123445555543222 3466666553 222222 1110 111111 11224678
Q ss_pred EEEeCCCCcEEEEcccC
Q psy6570 174 YFSTYRTNNILKINKFG 190 (713)
Q Consensus 174 y~td~~~~~i~~~~~~~ 190 (713)
|++-.+.+.|+.++...
T Consensus 278 ~lsGkGD~tIr~~e~~~ 294 (568)
T PTZ00420 278 YLIGKGDGNCRYYQHSL 294 (568)
T ss_pred EEEEECCCeEEEEEccC
Confidence 88877777777776543
No 192
>PLN00181 protein SPA1-RELATED; Provisional
Probab=91.74 E-value=23 Score=42.14 Aligned_cols=156 Identities=6% Similarity=-0.076 Sum_probs=82.5
Q ss_pred CCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCc-----eE--EEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEE
Q psy6570 26 HDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGR-----KK--RTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRIS 98 (713)
Q Consensus 26 ~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~-----~~--~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~ 98 (713)
....+|+|++.++.| .+-. ..+.|.+.++... .. ..+.........+++.++..+.++.+-.... .|.
T Consensus 484 ~~V~~i~fs~dg~~l-atgg---~D~~I~iwd~~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~~~~~~las~~~Dg-~v~ 558 (793)
T PLN00181 484 NLVCAIGFDRDGEFF-ATAG---VNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQVASSNFEG-VVQ 558 (793)
T ss_pred CcEEEEEECCCCCEE-EEEe---CCCEEEEEECCcccccccccccceEEecccCceeeEEeccCCCCEEEEEeCCC-eEE
Confidence 346778999765544 4444 4677777775431 10 0111111234567777765454444443333 666
Q ss_pred EEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeee--CCeEEEE
Q psy6570 99 GASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVF--EDNLYFS 176 (713)
Q Consensus 99 ~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~--~~~ly~t 176 (713)
..++........+.........|++++.++.++++-...+.|...++........+.. ......+.+. .+.++++
T Consensus 559 lWd~~~~~~~~~~~~H~~~V~~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~~~~~~~~~~---~~~v~~v~~~~~~g~~lat 635 (793)
T PLN00181 559 VWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGVSIGTIKT---KANICCVQFPSESGRSLAF 635 (793)
T ss_pred EEECCCCeEEEEecCCCCCEEEEEEcCCCCCEEEEEcCCCEEEEEECCCCcEEEEEec---CCCeEEEEEeCCCCCEEEE
Confidence 6676644333333333445678999876677777766677777777754332222211 1122334332 3445555
Q ss_pred eCCCCcEEEEccc
Q psy6570 177 TYRTNNILKINKF 189 (713)
Q Consensus 177 d~~~~~i~~~~~~ 189 (713)
-...+.|+.++..
T Consensus 636 gs~dg~I~iwD~~ 648 (793)
T PLN00181 636 GSADHKVYYYDLR 648 (793)
T ss_pred EeCCCeEEEEECC
Confidence 5556666666654
No 193
>KOG2139|consensus
Probab=91.71 E-value=13 Score=38.07 Aligned_cols=116 Identities=13% Similarity=0.088 Sum_probs=74.2
Q ss_pred CcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCc---------EEEEeCC-------------CCCCeeEEEeCCCCeE
Q psy6570 73 EPYDIALEPLSGRMFWTELGIKPRISGASIDGKNK---------FNLVDNN-------------IQWPTGITIDYPSQRL 130 (713)
Q Consensus 73 ~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~---------~~l~~~~-------------~~~p~glavd~~~~~L 130 (713)
+...-.-+|..++|.++-.+.. +|++...+++.. ++++..+ -..+.-||.||.+.+|
T Consensus 282 rvqtacWspcGsfLLf~~sgsp-~lysl~f~~~~~~~~~~~~~k~~lliaDL~e~ti~ag~~l~cgeaq~lawDpsGeyL 360 (445)
T KOG2139|consen 282 RVQTACWSPCGSFLLFACSGSP-RLYSLTFDGEDSVFLRPQSIKRVLLIADLQEVTICAGQRLCCGEAQCLAWDPSGEYL 360 (445)
T ss_pred ceeeeeecCCCCEEEEEEcCCc-eEEEEeecCCCccccCcccceeeeeeccchhhhhhcCcccccCccceeeECCCCCEE
Confidence 6667788998999999887766 999988876432 2222111 2246779999999999
Q ss_pred EEEcCCCCcE-------EEEeCCCCceeEE-EecCCCCccceeeeee----CCeEEEEeCCCCcEEEEccc
Q psy6570 131 YWADPKARTI-------ESINLNGKDRFVV-YHTEDNGYKPYKLEVF----EDNLYFSTYRTNNILKINKF 189 (713)
Q Consensus 131 Y~~d~~~~~I-------~~~~~~g~~~~~~-~~~~~~~~~p~~i~~~----~~~ly~td~~~~~i~~~~~~ 189 (713)
-++-.+..+| .+++...+....+ .-...+..+|..|.+. ++.|....|.+++|.++..+
T Consensus 361 av~fKg~~~v~~~k~~i~~fdtr~sp~vels~cg~i~ge~P~~IsF~pl~n~g~lLsiaWsTGriq~ypl~ 431 (445)
T KOG2139|consen 361 AVIFKGQSFVLLCKLHISRFDTRKSPPVELSYCGMIGGEYPAYISFGPLKNEGRLLSIAWSTGRIQRYPLT 431 (445)
T ss_pred EEEEcCCchhhhhhhhhhhhcccccCceEEEecccccCCCCceEEeeecccCCcEEEEEeccCceEeeeeE
Confidence 9987666643 3333332222222 2222234557555542 66788888999998888653
No 194
>COG4247 Phy 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]
Probab=91.64 E-value=12 Score=36.42 Aligned_cols=97 Identities=19% Similarity=0.266 Sum_probs=56.3
Q ss_pred CCCeEEEEecCCce--EEEEEc------CCCCCcceEEE--cCCCCcEEEEccCCCCeEEEEec----CCCCcEEEEe-C
Q psy6570 49 SSNNIMVSTLEGRK--KRTLLN------TGLNEPYDIAL--EPLSGRMFWTELGIKPRISGASI----DGKNKFNLVD-N 113 (713)
Q Consensus 49 ~~~~I~~~~~~G~~--~~~l~~------~~~~~p~~iav--D~~~~~ly~td~~~~~~I~~~~~----dG~~~~~l~~-~ 113 (713)
.+++|..+..|+.. .+.+.. ..+..|.|+++ ++.++.+|+--.+..+.|....+ +|..+..++. -
T Consensus 122 ~~~~i~~y~Idp~~~~L~sitD~n~p~ss~~s~~YGl~lyrs~ktgd~yvfV~~~qG~~~Qy~l~d~gnGkv~~k~vR~f 201 (364)
T COG4247 122 QNDKIVFYKIDPNPQYLESITDSNAPYSSSSSSAYGLALYRSPKTGDYYVFVNRRQGDIAQYKLIDQGNGKVGTKLVRQF 201 (364)
T ss_pred cCCeEEEEEeCCCccceeeccCCCCccccCcccceeeEEEecCCcCcEEEEEecCCCceeEEEEEecCCceEcceeeEee
Confidence 56777776666543 333332 23566777776 45556555543333335655544 2333333332 1
Q ss_pred CC-CCCeeEEEeCCCCeEEEEcCCCCcEEEEeCC
Q psy6570 114 NI-QWPTGITIDYPSQRLYWADPKARTIESINLN 146 (713)
Q Consensus 114 ~~-~~p~glavd~~~~~LY~~d~~~~~I~~~~~~ 146 (713)
.+ .+..|+..|...+.||+++.. -.||++..+
T Consensus 202 k~~tQTEG~VaDdEtG~LYIaeEd-vaiWK~~Ae 234 (364)
T COG4247 202 KIPTQTEGMVADDETGFLYIAEED-VAIWKYEAE 234 (364)
T ss_pred ecCCcccceeeccccceEEEeecc-ceeeecccC
Confidence 11 245799999999999999854 458887754
No 195
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=91.54 E-value=0.12 Score=33.96 Aligned_cols=19 Identities=37% Similarity=0.822 Sum_probs=15.0
Q ss_pred cEEeecCCCceeeCCCCCc
Q psy6570 564 GTCVLIEGKPSCKCLPPYS 582 (713)
Q Consensus 564 ~~C~~~~g~~~C~C~~G~~ 582 (713)
..|++.+++|+|.|++||.
T Consensus 10 h~C~~~~g~~~C~C~~Gy~ 28 (36)
T PF14670_consen 10 HICVNTPGSYRCSCPPGYK 28 (36)
T ss_dssp SEEEEETTSEEEE-STTEE
T ss_pred CCCccCCCceEeECCCCCE
Confidence 4788888888888888886
No 196
>KOG0273|consensus
Probab=91.54 E-value=13 Score=39.46 Aligned_cols=76 Identities=14% Similarity=0.209 Sum_probs=50.3
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccC-CCCeEEEEec
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELG-IKPRISGASI 102 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~-~~~~I~~~~~ 102 (713)
.+....+|.|||+++.-|++-. ....|.++.++++....-+..-.+...+|..||. +.|.-+-+. ..-+||.+.-
T Consensus 315 ~~~s~~~lDVdW~~~~~F~ts~---td~~i~V~kv~~~~P~~t~~GH~g~V~alk~n~t-g~LLaS~SdD~TlkiWs~~~ 390 (524)
T KOG0273|consen 315 EFHSAPALDVDWQSNDEFATSS---TDGCIHVCKVGEDRPVKTFIGHHGEVNALKWNPT-GSLLASCSDDGTLKIWSMGQ 390 (524)
T ss_pred eeccCCccceEEecCceEeecC---CCceEEEEEecCCCcceeeecccCceEEEEECCC-CceEEEecCCCeeEeeecCC
Confidence 4555568999999999998877 6788888888876543333234477889999975 455444322 2226666443
Q ss_pred C
Q psy6570 103 D 103 (713)
Q Consensus 103 d 103 (713)
+
T Consensus 391 ~ 391 (524)
T KOG0273|consen 391 S 391 (524)
T ss_pred C
Confidence 3
No 197
>KOG0318|consensus
Probab=91.44 E-value=11 Score=40.59 Aligned_cols=103 Identities=17% Similarity=0.091 Sum_probs=64.0
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEE-cCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEec
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLL-NTGLNEPYDIALEPLSGRMFWTELGIKPRISGASI 102 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~-~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~ 102 (713)
..+....|++.+.+..||=.+. .+.|..-+........+. ....++..+|+.. ..+.||-..+.. .|.++++
T Consensus 319 HnK~ITaLtv~~d~~~i~Sgsy----DG~I~~W~~~~g~~~~~~g~~h~nqI~~~~~~-~~~~~~t~g~Dd--~l~~~~~ 391 (603)
T KOG0318|consen 319 HNKSITALTVSPDGKTIYSGSY----DGHINSWDSGSGTSDRLAGKGHTNQIKGMAAS-ESGELFTIGWDD--TLRVISL 391 (603)
T ss_pred cccceeEEEEcCCCCEEEeecc----CceEEEEecCCccccccccccccceEEEEeec-CCCcEEEEecCC--eEEEEec
Confidence 3445677888877777774443 677777665433322222 2334678899987 457787777654 4777766
Q ss_pred CCCC--cEEEEeCCCCCCeeEEEeCCCCeEEEEc
Q psy6570 103 DGKN--KFNLVDNNIQWPTGITIDYPSQRLYWAD 134 (713)
Q Consensus 103 dG~~--~~~l~~~~~~~p~glavd~~~~~LY~~d 134 (713)
.+.. ...++. --.+|.+||+..+++.+.++-
T Consensus 392 ~~~~~t~~~~~~-lg~QP~~lav~~d~~~avv~~ 424 (603)
T KOG0318|consen 392 KDNGYTKSEVVK-LGSQPKGLAVLSDGGTAVVAC 424 (603)
T ss_pred ccCcccccceee-cCCCceeEEEcCCCCEEEEEe
Confidence 4432 222222 345799999997776776665
No 198
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=91.38 E-value=0.79 Score=31.11 Aligned_cols=42 Identities=14% Similarity=0.277 Sum_probs=29.0
Q ss_pred CCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEe
Q psy6570 81 PLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITID 124 (713)
Q Consensus 81 ~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd 124 (713)
|.+++||+++++.+ .|..++........-+.. ...|.+|+|+
T Consensus 1 pd~~~lyv~~~~~~-~v~~id~~~~~~~~~i~v-g~~P~~i~~~ 42 (42)
T TIGR02276 1 PDGTKLYVTNSGSN-TVSVIDTATNKVIATIPV-GGYPFGVAVS 42 (42)
T ss_pred CCCCEEEEEeCCCC-EEEEEECCCCeEEEEEEC-CCCCceEEeC
Confidence 34678999998877 888888754433333333 4678998875
No 199
>PRK13616 lipoprotein LpqB; Provisional
Probab=91.28 E-value=16 Score=41.49 Aligned_cols=158 Identities=9% Similarity=0.007 Sum_probs=85.8
Q ss_pred CCCCceEEEeccCCeEEEeecC----CCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccC--------
Q psy6570 25 LHDPRGVAVDWVGKNLYWTDAG----GRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELG-------- 92 (713)
Q Consensus 25 ~~~p~gla~D~~~~~ly~td~~----~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~-------- 92 (713)
+..+..+++.+.++.+.++... .....+|++.+.+|.. +.++... .-..-.++|..+.|++...+
T Consensus 349 ~~~vsspaiSpdG~~vA~v~~~~~~~~d~~s~Lwv~~~gg~~-~~lt~g~--~~t~PsWspDG~~lw~v~dg~~~~~v~~ 425 (591)
T PRK13616 349 MGNITSAALSRSGRQVAAVVTLGRGAPDPASSLWVGPLGGVA-VQVLEGH--SLTRPSWSLDADAVWVVVDGNTVVRVIR 425 (591)
T ss_pred ccCcccceECCCCCEEEEEEeecCCCCCcceEEEEEeCCCcc-eeeecCC--CCCCceECCCCCceEEEecCcceEEEec
Confidence 3466788888888777666521 0124578888876655 3343321 23344566654555544222
Q ss_pred --CCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEE---EeCCCCceeE-----EEecCCCCcc
Q psy6570 93 --IKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIES---INLNGKDRFV-----VYHTEDNGYK 162 (713)
Q Consensus 93 --~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~---~~~~g~~~~~-----~~~~~~~~~~ 162 (713)
..+.|++..+++...+. .--..+..|.+.+++.+|.+.-. ++|+. ...++..+++ +.. .+..
T Consensus 426 ~~~~gql~~~~vd~ge~~~---~~~g~Issl~wSpDG~RiA~i~~--g~v~Va~Vvr~~~G~~~l~~~~~l~~---~l~~ 497 (591)
T PRK13616 426 DPATGQLARTPVDASAVAS---RVPGPISELQLSRDGVRAAMIIG--GKVYLAVVEQTEDGQYALTNPREVGP---GLGD 497 (591)
T ss_pred cCCCceEEEEeccCchhhh---ccCCCcCeEEECCCCCEEEEEEC--CEEEEEEEEeCCCCceeecccEEeec---ccCC
Confidence 12355555555543322 11134788999999998887653 46766 4444443333 211 1222
Q ss_pred -ceeeeee-CCeEEEEeCC-CCcEEEEcccCCCc
Q psy6570 163 -PYKLEVF-EDNLYFSTYR-TNNILKINKFGNSD 193 (713)
Q Consensus 163 -p~~i~~~-~~~ly~td~~-~~~i~~~~~~~~~~ 193 (713)
+..++.. ++.|++.... ...|++++.+|...
T Consensus 498 ~~~~l~W~~~~~L~V~~~~~~~~v~~v~vDG~~~ 531 (591)
T PRK13616 498 TAVSLDWRTGDSLVVGRSDPEHPVWYVNLDGSNS 531 (591)
T ss_pred ccccceEecCCEEEEEecCCCCceEEEecCCccc
Confidence 3556544 5567765433 34478888776543
No 200
>KOG0291|consensus
Probab=90.85 E-value=26 Score=39.63 Aligned_cols=125 Identities=10% Similarity=0.048 Sum_probs=63.1
Q ss_pred CCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCC-CCc
Q psy6570 71 LNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLN-GKD 149 (713)
Q Consensus 71 ~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~-g~~ 149 (713)
+.....|++.| +|.+..|..... +|.+-+.....-.+-+......-+++.+. ..++..++-.-.++|+..|+. ..+
T Consensus 350 ~~~i~~l~YSp-Dgq~iaTG~eDg-KVKvWn~~SgfC~vTFteHts~Vt~v~f~-~~g~~llssSLDGtVRAwDlkRYrN 426 (893)
T KOG0291|consen 350 SDRITSLAYSP-DGQLIATGAEDG-KVKVWNTQSGFCFVTFTEHTSGVTAVQFT-ARGNVLLSSSLDGTVRAWDLKRYRN 426 (893)
T ss_pred ccceeeEEECC-CCcEEEeccCCC-cEEEEeccCceEEEEeccCCCceEEEEEE-ecCCEEEEeecCCeEEeeeecccce
Confidence 44556677765 455555554333 56655555444444444444555777777 344555555555666666653 233
Q ss_pred eeEEEecCCCCccceeeeee--CCeEEEEeCCCCcEEEEcccCCCcceeeecc
Q psy6570 150 RFVVYHTEDNGYKPYKLEVF--EDNLYFSTYRTNNILKINKFGNSDFNVLANN 200 (713)
Q Consensus 150 ~~~~~~~~~~~~~p~~i~~~--~~~ly~td~~~~~i~~~~~~~~~~~~~~~~~ 200 (713)
-+++.... ..+-.-|+++ ++-+...+...-.|+.++...+....++..+
T Consensus 427 fRTft~P~--p~QfscvavD~sGelV~AG~~d~F~IfvWS~qTGqllDiLsGH 477 (893)
T KOG0291|consen 427 FRTFTSPE--PIQFSCVAVDPSGELVCAGAQDSFEIFVWSVQTGQLLDILSGH 477 (893)
T ss_pred eeeecCCC--ceeeeEEEEcCCCCEEEeeccceEEEEEEEeecCeeeehhcCC
Confidence 33333221 1111234454 4444444444445666666666555555544
No 201
>PHA02887 EGF-like protein; Provisional
Probab=90.65 E-value=0.27 Score=41.06 Aligned_cols=33 Identities=39% Similarity=1.013 Sum_probs=25.1
Q ss_pred CCCCCCCeecCCCCccCCCCCceeeCCCCcccCCCCcc
Q psy6570 634 HFCFNGGTCREQNYSLDPDLKPICICPRGYAGVRCQTL 671 (713)
Q Consensus 634 ~~C~~~~~C~~~~~~~~~~~~~~C~C~~Gy~G~~C~~~ 671 (713)
+.|.+ |+|... .......|.|+.||+|.+|+..
T Consensus 92 ~YCiH-G~C~yI----~dL~epsCrC~~GYtG~RCE~v 124 (126)
T PHA02887 92 DFCIN-GECMNI----IDLDEKFCICNKGYTGIRCDEV 124 (126)
T ss_pred CEeeC-CEEEcc----ccCCCceeECCCCcccCCCCcc
Confidence 56774 799862 3344589999999999999864
No 202
>KOG0268|consensus
Probab=90.49 E-value=1.8 Score=43.96 Aligned_cols=143 Identities=12% Similarity=0.108 Sum_probs=87.1
Q ss_pred CCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCce-EEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecC
Q psy6570 25 LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRK-KRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASID 103 (713)
Q Consensus 25 ~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~-~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~d 103 (713)
......+.+.+..-.|..+-. ....|..+|+.... .+.++. . .+++.|...| +.+.|++..... .|+..+|.
T Consensus 187 ~Dti~svkfNpvETsILas~~---sDrsIvLyD~R~~~Pl~KVi~-~-mRTN~IswnP-eafnF~~a~ED~-nlY~~DmR 259 (433)
T KOG0268|consen 187 ADSISSVKFNPVETSILASCA---SDRSIVLYDLRQASPLKKVIL-T-MRTNTICWNP-EAFNFVAANEDH-NLYTYDMR 259 (433)
T ss_pred CCceeEEecCCCcchheeeec---cCCceEEEecccCCccceeee-e-ccccceecCc-cccceeeccccc-cceehhhh
Confidence 334456666666555555443 46678888875433 333332 2 5799999999 889999876655 78888775
Q ss_pred CCCcEEEE-eCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeeeCCeEEEE
Q psy6570 104 GKNKFNLV-DNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVFEDNLYFS 176 (713)
Q Consensus 104 G~~~~~l~-~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~t 176 (713)
--.+-.-+ ......-..+.++| .+.=|++-+....|+.+..+.+.-+.+.... ++.+.+++.+.-+.-|+.
T Consensus 260 ~l~~p~~v~~dhvsAV~dVdfsp-tG~EfvsgsyDksIRIf~~~~~~SRdiYhtk-RMq~V~~Vk~S~Dskyi~ 331 (433)
T KOG0268|consen 260 NLSRPLNVHKDHVSAVMDVDFSP-TGQEFVSGSYDKSIRIFPVNHGHSRDIYHTK-RMQHVFCVKYSMDSKYII 331 (433)
T ss_pred hhcccchhhcccceeEEEeccCC-CcchhccccccceEEEeecCCCcchhhhhHh-hhheeeEEEEeccccEEE
Confidence 43322222 11222334455554 5677888888888999988765544443221 356777777764444443
No 203
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=90.28 E-value=16 Score=39.62 Aligned_cols=143 Identities=15% Similarity=0.194 Sum_probs=79.9
Q ss_pred CCceeEEEccCcc---cEEecCCCCCc---eEEEeccCCeEEEeecCCCCC-CeEEEEecCCc-----eEEEEEcCCCCC
Q psy6570 6 SGNVTRVKREMNL---KTVLSNLHDPR---GVAVDWVGKNLYWTDAGGRSS-NNIMVSTLEGR-----KKRTLLNTGLNE 73 (713)
Q Consensus 6 ~~~I~~~~~~~~~---~~~~~~~~~p~---gla~D~~~~~ly~td~~~~~~-~~I~~~~~~G~-----~~~~l~~~~~~~ 73 (713)
..+|++..+++.. ++|......+. ++.+...++.|++.-... .. ..|++.+++.. ..+.+.. ....
T Consensus 201 ~~~v~~~~~gt~~~~d~lvfe~~~~~~~~~~~~~s~d~~~l~i~~~~~-~~~s~v~~~d~~~~~~~~~~~~~l~~-~~~~ 278 (414)
T PF02897_consen 201 PRQVYRHKLGTPQSEDELVFEEPDEPFWFVSVSRSKDGRYLFISSSSG-TSESEVYLLDLDDGGSPDAKPKLLSP-REDG 278 (414)
T ss_dssp CEEEEEEETTS-GGG-EEEEC-TTCTTSEEEEEE-TTSSEEEEEEESS-SSEEEEEEEECCCTTTSS-SEEEEEE-SSSS
T ss_pred CcEEEEEECCCChHhCeeEEeecCCCcEEEEEEecCcccEEEEEEEcc-ccCCeEEEEeccccCCCcCCcEEEeC-CCCc
Confidence 4567777887643 25556656665 667777777777755432 23 67889998764 3333332 2222
Q ss_pred cceEEEcCCCCcEEE-EccC-CCCeEEEEecCCCCc---E-EEEeCC-CCCCeeEEEeCCCCeEEEEcC--CCCcEEEEe
Q psy6570 74 PYDIALEPLSGRMFW-TELG-IKPRISGASIDGKNK---F-NLVDNN-IQWPTGITIDYPSQRLYWADP--KARTIESIN 144 (713)
Q Consensus 74 p~~iavD~~~~~ly~-td~~-~~~~I~~~~~dG~~~---~-~l~~~~-~~~p~glavd~~~~~LY~~d~--~~~~I~~~~ 144 (713)
... .++..++.||+ |+.+ .+.+|.+++++.... . +++... -....++.+. .++|++... ...+|..++
T Consensus 279 ~~~-~v~~~~~~~yi~Tn~~a~~~~l~~~~l~~~~~~~~~~~l~~~~~~~~l~~~~~~--~~~Lvl~~~~~~~~~l~v~~ 355 (414)
T PF02897_consen 279 VEY-YVDHHGDRLYILTNDDAPNGRLVAVDLADPSPAEWWTVLIPEDEDVSLEDVSLF--KDYLVLSYRENGSSRLRVYD 355 (414)
T ss_dssp -EE-EEEEETTEEEEEE-TT-TT-EEEEEETTSTSGGGEEEEEE--SSSEEEEEEEEE--TTEEEEEEEETTEEEEEEEE
T ss_pred eEE-EEEccCCEEEEeeCCCCCCcEEEEecccccccccceeEEcCCCCceeEEEEEEE--CCEEEEEEEECCccEEEEEE
Confidence 222 23334666665 5543 346899999887663 3 444322 1234556665 778888764 455789999
Q ss_pred CC-CCceeEE
Q psy6570 145 LN-GKDRFVV 153 (713)
Q Consensus 145 ~~-g~~~~~~ 153 (713)
++ +.....+
T Consensus 356 ~~~~~~~~~~ 365 (414)
T PF02897_consen 356 LDDGKESREI 365 (414)
T ss_dssp TT-TEEEEEE
T ss_pred CCCCcEEeee
Confidence 98 5444443
No 204
>KOG4441|consensus
Probab=90.14 E-value=15 Score=41.58 Aligned_cols=165 Identities=12% Similarity=0.093 Sum_probs=98.7
Q ss_pred ceeEEEccCcccEEecCCCCC---ceEEEeccCCeEEEeecCC---CCCCeEEEEecCCceEEEEEcCCCCCcc-eEEEc
Q psy6570 8 NVTRVKREMNLKTVLSNLHDP---RGVAVDWVGKNLYWTDAGG---RSSNNIMVSTLEGRKKRTLLNTGLNEPY-DIALE 80 (713)
Q Consensus 8 ~I~~~~~~~~~~~~~~~~~~p---~gla~D~~~~~ly~td~~~---~~~~~I~~~~~~G~~~~~l~~~~~~~p~-~iavD 80 (713)
.+..+++..+.......+..| .++++ .++.||++-... .....+.++|+....=.. ...+..++ ++++-
T Consensus 302 ~ve~yd~~~~~w~~~a~m~~~r~~~~~~~--~~~~lYv~GG~~~~~~~l~~ve~YD~~~~~W~~--~a~M~~~R~~~~v~ 377 (571)
T KOG4441|consen 302 SVECYDPKTNEWSSLAPMPSPRCRVGVAV--LNGKLYVVGGYDSGSDRLSSVERYDPRTNQWTP--VAPMNTKRSDFGVA 377 (571)
T ss_pred eeEEecCCcCcEeecCCCCcccccccEEE--ECCEEEEEccccCCCcccceEEEecCCCCceec--cCCccCccccceeE
Confidence 344455555433333333333 44555 578999985422 133668888887654222 23455555 23333
Q ss_pred CCCCcEEEEccCC----CCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcC------CCCcEEEEeCCCCce
Q psy6570 81 PLSGRMFWTELGI----KPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADP------KARTIESINLNGKDR 150 (713)
Q Consensus 81 ~~~~~ly~td~~~----~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~------~~~~I~~~~~~g~~~ 150 (713)
..+|.||+..-.. ...|++.+.....=..+..... .-.+.++-..+++||++-. ....++++|.....-
T Consensus 378 ~l~g~iYavGG~dg~~~l~svE~YDp~~~~W~~va~m~~-~r~~~gv~~~~g~iYi~GG~~~~~~~l~sve~YDP~t~~W 456 (571)
T KOG4441|consen 378 VLDGKLYAVGGFDGEKSLNSVECYDPVTNKWTPVAPMLT-RRSGHGVAVLGGKLYIIGGGDGSSNCLNSVECYDPETNTW 456 (571)
T ss_pred EECCEEEEEeccccccccccEEEecCCCCcccccCCCCc-ceeeeEEEEECCEEEEEcCcCCCccccceEEEEcCCCCce
Confidence 4789999975221 2368888887655444443222 2244455455899999963 335788888887766
Q ss_pred eEEEecCCCCccceeeeeeCCeEEEEeC
Q psy6570 151 FVVYHTEDNGYKPYKLEVFEDNLYFSTY 178 (713)
Q Consensus 151 ~~~~~~~~~~~~p~~i~~~~~~ly~td~ 178 (713)
+.+..... ....+++++.+++||+.-.
T Consensus 457 ~~~~~M~~-~R~~~g~a~~~~~iYvvGG 483 (571)
T KOG4441|consen 457 TLIAPMNT-RRSGFGVAVLNGKIYVVGG 483 (571)
T ss_pred eecCCccc-ccccceEEEECCEEEEECC
Confidence 66655433 3455788999999999853
No 205
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=90.11 E-value=0.33 Score=34.06 Aligned_cols=29 Identities=41% Similarity=1.240 Sum_probs=21.7
Q ss_pred eeecCCCCCeecCCCCccCCCCCCccccCCCCCCcccCCCCccC
Q psy6570 462 ECSITDSGPKCMCSPGYSGKKCDTCTCLNGDSGPKCMCSPGYSG 505 (713)
Q Consensus 462 ~C~~~~~~~~C~C~~G~~g~~C~~~~C~~~~~~~~C~C~~G~~g 505 (713)
.|....| +|.|.++|+|+.|+. |++||+|
T Consensus 12 ~C~~~~G--~C~C~~~~~G~~C~~-------------C~~g~~g 40 (46)
T smart00180 12 TCDPDTG--QCECKPNVTGRRCDR-------------CAPGYYG 40 (46)
T ss_pred cccCCCC--EEECCCCCCCCCCCc-------------CCCCcCC
Confidence 3444333 899999999999885 7788887
No 206
>KOG0319|consensus
Probab=90.09 E-value=3.6 Score=45.78 Aligned_cols=169 Identities=9% Similarity=-0.017 Sum_probs=95.3
Q ss_pred EEEeccCCeEEEeecCCCCCCeEEEEecCCceEE--EEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcE
Q psy6570 31 VAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKR--TLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKF 108 (713)
Q Consensus 31 la~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~--~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~ 108 (713)
++++..+.+||-+-. ++|...+.....+. ...........+++|+|.+..||.+-.+.- |...+++.....
T Consensus 25 ~~~s~nG~~L~t~~~-----d~Vi~idv~t~~~~l~s~~~ed~d~ita~~l~~d~~~L~~a~rs~l--lrv~~L~tgk~i 97 (775)
T KOG0319|consen 25 VAWSSNGQHLYTACG-----DRVIIIDVATGSIALPSGSNEDEDEITALALTPDEEVLVTASRSQL--LRVWSLPTGKLI 97 (775)
T ss_pred eeECCCCCEEEEecC-----ceEEEEEccCCceecccCCccchhhhheeeecCCccEEEEeeccce--EEEEEcccchHh
Confidence 899988888885543 56666666544432 122233566789999998888888765533 444455433222
Q ss_pred EEEeC-CCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccce-eeeeeCCeEE---EEeCCCCcE
Q psy6570 109 NLVDN-NIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPY-KLEVFEDNLY---FSTYRTNNI 183 (713)
Q Consensus 109 ~l~~~-~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~-~i~~~~~~ly---~td~~~~~i 183 (713)
..... .-.....+++|+.. .|.-+-...++|.+-++.+...+.-+.. .+.+. .+.+....++ .+....+.|
T Consensus 98 rswKa~He~Pvi~ma~~~~g-~LlAtggaD~~v~VWdi~~~~~th~fkG---~gGvVssl~F~~~~~~~lL~sg~~D~~v 173 (775)
T KOG0319|consen 98 RSWKAIHEAPVITMAFDPTG-TLLATGGADGRVKVWDIKNGYCTHSFKG---HGGVVSSLLFHPHWNRWLLASGATDGTV 173 (775)
T ss_pred HhHhhccCCCeEEEEEcCCC-ceEEeccccceEEEEEeeCCEEEEEecC---CCceEEEEEeCCccchhheeecCCCceE
Confidence 22211 22233578999765 4444444567788888887766555544 23333 3444455444 444556667
Q ss_pred EEEcccCCCc-ceeeeccccccccEEEE
Q psy6570 184 LKINKFGNSD-FNVLANNLNRASDVLIL 210 (713)
Q Consensus 184 ~~~~~~~~~~-~~~~~~~~~~~~~i~v~ 210 (713)
+..+...... ..++....+..++|.+.
T Consensus 174 ~vwnl~~~~tcl~~~~~H~S~vtsL~~~ 201 (775)
T KOG0319|consen 174 RVWNLNDKRTCLHTMILHKSAVTSLAFS 201 (775)
T ss_pred EEEEcccCchHHHHHHhhhhheeeeeec
Confidence 7666553333 33334444445554443
No 207
>PHA02713 hypothetical protein; Provisional
Probab=90.06 E-value=11 Score=42.77 Aligned_cols=165 Identities=11% Similarity=0.104 Sum_probs=87.3
Q ss_pred CceeEEEccCcccEEecCCCCCc-eEEEeccCCeEEEeecCC--CCCCeEEEEecCCceEEEEEcCCCCCcc---eEEEc
Q psy6570 7 GNVTRVKREMNLKTVLSNLHDPR-GVAVDWVGKNLYWTDAGG--RSSNNIMVSTLEGRKKRTLLNTGLNEPY---DIALE 80 (713)
Q Consensus 7 ~~I~~~~~~~~~~~~~~~~~~p~-gla~D~~~~~ly~td~~~--~~~~~I~~~~~~G~~~~~l~~~~~~~p~---~iavD 80 (713)
..+.++++..+.-..+..+..|+ ..++-..+++||+.-... .....+.++++....=..+ ..+..|. ++++
T Consensus 320 ~~v~~Yd~~~n~W~~~~~m~~~R~~~~~~~~~g~IYviGG~~~~~~~~sve~Ydp~~~~W~~~--~~mp~~r~~~~~~~- 396 (557)
T PHA02713 320 NKVYKINIENKIHVELPPMIKNRCRFSLAVIDDTIYAIGGQNGTNVERTIECYTMGDDKWKML--PDMPIALSSYGMCV- 396 (557)
T ss_pred ceEEEEECCCCeEeeCCCCcchhhceeEEEECCEEEEECCcCCCCCCceEEEEECCCCeEEEC--CCCCcccccccEEE-
Confidence 45667777665433344555553 222223568999875421 1134588888765432222 2333333 2333
Q ss_pred CCCCcEEEEccCC----------------------CCeEEEEecCCCCcEEEEeCCC-CCCeeEEEeCCCCeEEEEcCC-
Q psy6570 81 PLSGRMFWTELGI----------------------KPRISGASIDGKNKFNLVDNNI-QWPTGITIDYPSQRLYWADPK- 136 (713)
Q Consensus 81 ~~~~~ly~td~~~----------------------~~~I~~~~~dG~~~~~l~~~~~-~~p~glavd~~~~~LY~~d~~- 136 (713)
.+++||+..... ...+++.++....=+.+..... ....++++- +++||+.-..
T Consensus 397 -~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~YDP~td~W~~v~~m~~~r~~~~~~~~--~~~IYv~GG~~ 473 (557)
T PHA02713 397 -LDQYIYIIGGRTEHIDYTSVHHMNSIDMEEDTHSSNKVIRYDTVNNIWETLPNFWTGTIRPGVVSH--KDDIYVVCDIK 473 (557)
T ss_pred -ECCEEEEEeCCCcccccccccccccccccccccccceEEEECCCCCeEeecCCCCcccccCcEEEE--CCEEEEEeCCC
Confidence 478999974221 1246666665432222221111 112244443 6899998532
Q ss_pred -----CCcEEEEeCCC-CceeEEEecCCCCccceeeeeeCCeEEEEeC
Q psy6570 137 -----ARTIESINLNG-KDRFVVYHTEDNGYKPYKLEVFEDNLYFSTY 178 (713)
Q Consensus 137 -----~~~I~~~~~~g-~~~~~~~~~~~~~~~p~~i~~~~~~ly~td~ 178 (713)
...|++++++. ..-+.+..... ...-.++++.+++||++-.
T Consensus 474 ~~~~~~~~ve~Ydp~~~~~W~~~~~m~~-~r~~~~~~~~~~~iyv~Gg 520 (557)
T PHA02713 474 DEKNVKTCIFRYNTNTYNGWELITTTES-RLSALHTILHDNTIMMLHC 520 (557)
T ss_pred CCCccceeEEEecCCCCCCeeEccccCc-ccccceeEEECCEEEEEee
Confidence 13478888876 44444443322 2344788888999999853
No 208
>KOG1539|consensus
Probab=90.06 E-value=8.9 Score=43.51 Aligned_cols=135 Identities=16% Similarity=0.136 Sum_probs=75.8
Q ss_pred cCCceeEEEccCcccEE-e----cCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEE
Q psy6570 5 SSGNVTRVKREMNLKTV-L----SNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIAL 79 (713)
Q Consensus 5 ~~~~I~~~~~~~~~~~~-~----~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iav 79 (713)
..|.|-++++......- . ..-....|||+|-. +++.++-. ..+.+...+.+++....-++.+ ..+..|+.
T Consensus 468 S~G~Id~fNmQSGi~r~sf~~~~ah~~~V~gla~D~~-n~~~vsa~---~~Gilkfw~f~~k~l~~~l~l~-~~~~~iv~ 542 (910)
T KOG1539|consen 468 SKGTIDRFNMQSGIHRKSFGDSPAHKGEVTGLAVDGT-NRLLVSAG---ADGILKFWDFKKKVLKKSLRLG-SSITGIVY 542 (910)
T ss_pred cCCeEEEEEcccCeeecccccCccccCceeEEEecCC-CceEEEcc---CcceEEEEecCCcceeeeeccC-CCcceeee
Confidence 47888899888654221 1 23345689999965 55556655 5677777777776533222222 23344444
Q ss_pred cCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCC
Q psy6570 80 EPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNG 147 (713)
Q Consensus 80 D~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g 147 (713)
....+ |+...-..- .|..+++-.......+....++.+.+++++++++|..+.. ...|+..|+-.
T Consensus 543 hr~s~-l~a~~~ddf-~I~vvD~~t~kvvR~f~gh~nritd~~FS~DgrWlisasm-D~tIr~wDlpt 607 (910)
T KOG1539|consen 543 HRVSD-LLAIALDDF-SIRVVDVVTRKVVREFWGHGNRITDMTFSPDGRWLISASM-DSTIRTWDLPT 607 (910)
T ss_pred eehhh-hhhhhcCce-eEEEEEchhhhhhHHhhccccceeeeEeCCCCcEEEEeec-CCcEEEEeccC
Confidence 32222 222221212 5666655433222223334556789999999999988874 34566666543
No 209
>smart00284 OLF Olfactomedin-like domains.
Probab=89.72 E-value=9.8 Score=37.77 Aligned_cols=124 Identities=15% Similarity=0.131 Sum_probs=67.3
Q ss_pred CCceeEEEccCcccEEecCCCC---------------CceEEEeccCCe--EEEeecCCCCCCeEEEEecCCceEEE--E
Q psy6570 6 SGNVTRVKREMNLKTVLSNLHD---------------PRGVAVDWVGKN--LYWTDAGGRSSNNIMVSTLEGRKKRT--L 66 (713)
Q Consensus 6 ~~~I~~~~~~~~~~~~~~~~~~---------------p~gla~D~~~~~--ly~td~~~~~~~~I~~~~~~G~~~~~--l 66 (713)
+..|.|+++.+........|.. =-.||+|.. +. ||-+.. ..+.|.+..||-....+ .
T Consensus 93 s~~iiKydL~t~~v~~~~~Lp~a~y~~~~~Y~~~~~sdiDlAvDE~-GLWvIYat~~---~~g~ivvSkLnp~tL~ve~t 168 (255)
T smart00284 93 SHDICRFDLTTETYQKEPLLNGAGYNNRFPYAWGGFSDIDLAVDEN-GLWVIYATEQ---NAGKIVISKLNPATLTIENT 168 (255)
T ss_pred CccEEEEECCCCcEEEEEecCccccccccccccCCCccEEEEEcCC-ceEEEEeccC---CCCCEEEEeeCcccceEEEE
Confidence 5678899988765432222211 146888843 32 333343 56778877777544333 3
Q ss_pred EcCCCCCcc---eEEEcCCCCcEEEEccCC--CCeEE-EEecCCCCcEEE---EeCCCCCCeeEEEeCCCCeEEEEcCC
Q psy6570 67 LNTGLNEPY---DIALEPLSGRMFWTELGI--KPRIS-GASIDGKNKFNL---VDNNIQWPTGITIDYPSQRLYWADPK 136 (713)
Q Consensus 67 ~~~~~~~p~---~iavD~~~~~ly~td~~~--~~~I~-~~~~dG~~~~~l---~~~~~~~p~glavd~~~~~LY~~d~~ 136 (713)
......++. +.. .=|.||++++.. ..+|. ..++.......+ +......-..|..+|.+++||.=|-+
T Consensus 169 W~T~~~k~sa~naFm---vCGvLY~~~s~~~~~~~I~yayDt~t~~~~~~~i~f~n~y~~~s~l~YNP~d~~LY~wdng 244 (255)
T smart00284 169 WITTYNKRSASNAFM---ICGILYVTRSLGSKGEKVFYAYDTNTGKEGHLDIPFENMYEYISMLDYNPNDRKLYAWNNG 244 (255)
T ss_pred EEcCCCcccccccEE---EeeEEEEEccCCCCCcEEEEEEECCCCccceeeeeeccccccceeceeCCCCCeEEEEeCC
Confidence 333333332 233 358999998632 22444 345544332221 12233344568888999999987743
No 210
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=89.51 E-value=21 Score=35.53 Aligned_cols=139 Identities=14% Similarity=0.095 Sum_probs=73.8
Q ss_pred cCCeEEEeecCCCCCCeEEEEec------CCceEEE-EEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcE
Q psy6570 36 VGKNLYWTDAGGRSSNNIMVSTL------EGRKKRT-LLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKF 108 (713)
Q Consensus 36 ~~~~ly~td~~~~~~~~I~~~~~------~G~~~~~-l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~ 108 (713)
..+++||++.. ..+.|..+.. .++..+. .+... -.-.|.+| .+|.||+--.+.. .|.|.++......
T Consensus 29 ~~~~iy~~~~~--~~~~v~ey~~~~~f~~~~~~~~~~~Lp~~-~~GtG~vV--YngslYY~~~~s~-~IvkydL~t~~v~ 102 (250)
T PF02191_consen 29 DSEKIYVTSGF--SGNTVYEYRNYEDFLRNGRSSRTYKLPYP-WQGTGHVV--YNGSLYYNKYNSR-NIVKYDLTTRSVV 102 (250)
T ss_pred CCCCEEEECcc--CCCEEEEEcCHhHHhhcCCCceEEEEece-eccCCeEE--ECCcEEEEecCCc-eEEEEECcCCcEE
Confidence 46789999874 2345554432 2222222 22211 22234555 5799999887666 9999999876655
Q ss_pred -EEEe--CCCC----------CCeeEEEeCCCCeEEEE-cCCCCcEEEEeCCCCceeEEEecCCCCccceeee--eeCCe
Q psy6570 109 -NLVD--NNIQ----------WPTGITIDYPSQRLYWA-DPKARTIESINLNGKDRFVVYHTEDNGYKPYKLE--VFEDN 172 (713)
Q Consensus 109 -~l~~--~~~~----------~p~glavd~~~~~LY~~-d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~--~~~~~ 172 (713)
.... .... .=-.||+|..+=++..+ ....+.|....+|-....+...-......+..-. +.=+-
T Consensus 103 ~~~~L~~A~~~n~~~y~~~~~t~iD~AvDE~GLWvIYat~~~~g~ivvskld~~tL~v~~tw~T~~~k~~~~naFmvCGv 182 (250)
T PF02191_consen 103 ARRELPGAGYNNRFPYYWSGYTDIDFAVDENGLWVIYATEDNNGNIVVSKLDPETLSVEQTWNTSYPKRSAGNAFMVCGV 182 (250)
T ss_pred EEEECCccccccccceecCCCceEEEEEcCCCEEEEEecCCCCCcEEEEeeCcccCceEEEEEeccCchhhcceeeEeeE
Confidence 2221 1111 11468888433333333 4444567666666554444433222233332222 23678
Q ss_pred EEEEeCCC
Q psy6570 173 LYFSTYRT 180 (713)
Q Consensus 173 ly~td~~~ 180 (713)
||.++...
T Consensus 183 LY~~~s~~ 190 (250)
T PF02191_consen 183 LYATDSYD 190 (250)
T ss_pred EEEEEECC
Confidence 99998664
No 211
>KOG2110|consensus
Probab=89.48 E-value=28 Score=35.90 Aligned_cols=144 Identities=13% Similarity=0.111 Sum_probs=73.8
Q ss_pred cCCeEEEeecCCCCCCeEEEEecCCce-EEEEEcCCCCCcce-EEEcCCCC--cEEEEccCCCCeEEEEecCCCCcEEEE
Q psy6570 36 VGKNLYWTDAGGRSSNNIMVSTLEGRK-KRTLLNTGLNEPYD-IALEPLSG--RMFWTELGIKPRISGASIDGKNKFNLV 111 (713)
Q Consensus 36 ~~~~ly~td~~~~~~~~I~~~~~~G~~-~~~l~~~~~~~p~~-iavD~~~~--~ly~td~~~~~~I~~~~~dG~~~~~l~ 111 (713)
.+++|.|.-. ..|++++++.=. ..+|. ..-..|.| +|+.+... +|-+-+..+.+.|..++...-.....+
T Consensus 96 Nr~RLvV~Le-----e~IyIydI~~MklLhTI~-t~~~n~~gl~AlS~n~~n~ylAyp~s~t~GdV~l~d~~nl~~v~~I 169 (391)
T KOG2110|consen 96 NRKRLVVCLE-----ESIYIYDIKDMKLLHTIE-TTPPNPKGLCALSPNNANCYLAYPGSTTSGDVVLFDTINLQPVNTI 169 (391)
T ss_pred ccceEEEEEc-----ccEEEEecccceeehhhh-ccCCCccceEeeccCCCCceEEecCCCCCceEEEEEcccceeeeEE
Confidence 3445555443 348888876521 22222 11135664 45555555 444434444568888888765555555
Q ss_pred eCCCCCCeeEEEeCCCCeEEEEcCCCCcE-EEEeC-CCCceeEEEecCCCCccceeeeeeCCeEEE-EeCCCCcEEEEc
Q psy6570 112 DNNIQWPTGITIDYPSQRLYWADPKARTI-ESINL-NGKDRFVVYHTEDNGYKPYKLEVFEDNLYF-STYRTNNILKIN 187 (713)
Q Consensus 112 ~~~~~~p~glavd~~~~~LY~~d~~~~~I-~~~~~-~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~-td~~~~~i~~~~ 187 (713)
...-..-..|||+++ +.|.-+-+..++| +++.. +|.....+..... ...-+.|++..+.-|. +...+..|..+.
T Consensus 170 ~aH~~~lAalafs~~-G~llATASeKGTVIRVf~v~~G~kl~eFRRG~~-~~~IySL~Fs~ds~~L~~sS~TeTVHiFK 246 (391)
T KOG2110|consen 170 NAHKGPLAALAFSPD-GTLLATASEKGTVIRVFSVPEGQKLYEFRRGTY-PVSIYSLSFSPDSQFLAASSNTETVHIFK 246 (391)
T ss_pred EecCCceeEEEECCC-CCEEEEeccCceEEEEEEcCCccEeeeeeCCce-eeEEEEEEECCCCCeEEEecCCCeEEEEE
Confidence 544445578999965 5555555566664 44554 4444444433211 2223556665433233 334455554443
No 212
>PF15102 TMEM154: TMEM154 protein family
Probab=89.48 E-value=0.37 Score=42.56 Aligned_cols=14 Identities=29% Similarity=0.646 Sum_probs=5.5
Q ss_pred chhHHHHHHHHHHH
Q psy6570 684 SHISSILILILLLI 697 (713)
Q Consensus 684 ~~~~~~~~~~~~~~ 697 (713)
.++++.++++|||+
T Consensus 60 mIlIP~VLLvlLLl 73 (146)
T PF15102_consen 60 MILIPLVLLVLLLL 73 (146)
T ss_pred EEeHHHHHHHHHHH
Confidence 34444333333333
No 213
>KOG4441|consensus
Probab=89.47 E-value=7.4 Score=44.07 Aligned_cols=168 Identities=13% Similarity=0.097 Sum_probs=101.6
Q ss_pred CCceeEEEccCcccEEecCCCCCc-eEEEeccCCeEEEeecCC--CCCCeEEEEecCCceEEEEEcCCCCCcc-eEEEcC
Q psy6570 6 SGNVTRVKREMNLKTVLSNLHDPR-GVAVDWVGKNLYWTDAGG--RSSNNIMVSTLEGRKKRTLLNTGLNEPY-DIALEP 81 (713)
Q Consensus 6 ~~~I~~~~~~~~~~~~~~~~~~p~-gla~D~~~~~ly~td~~~--~~~~~I~~~~~~G~~~~~l~~~~~~~p~-~iavD~ 81 (713)
...+.++++..+.-+.+..+..++ ++++-...+.||+.-... .....|.++++....=..+. .+..++ +.++-.
T Consensus 348 l~~ve~YD~~~~~W~~~a~M~~~R~~~~v~~l~g~iYavGG~dg~~~l~svE~YDp~~~~W~~va--~m~~~r~~~gv~~ 425 (571)
T KOG4441|consen 348 LSSVERYDPRTNQWTPVAPMNTKRSDFGVAVLDGKLYAVGGFDGEKSLNSVECYDPVTNKWTPVA--PMLTRRSGHGVAV 425 (571)
T ss_pred cceEEEecCCCCceeccCCccCccccceeEEECCEEEEEeccccccccccEEEecCCCCcccccC--CCCcceeeeEEEE
Confidence 456778888876544455555553 344555789999875421 23456888888775433322 222222 344444
Q ss_pred CCCcEEEEccCCC-----CeEEEEecCCCCcEEEEeCCC-CCCeeEEEeCCCCeEEEEcC-----CCCcEEEEeCCCCce
Q psy6570 82 LSGRMFWTELGIK-----PRISGASIDGKNKFNLVDNNI-QWPTGITIDYPSQRLYWADP-----KARTIESINLNGKDR 150 (713)
Q Consensus 82 ~~~~ly~td~~~~-----~~I~~~~~dG~~~~~l~~~~~-~~p~glavd~~~~~LY~~d~-----~~~~I~~~~~~g~~~ 150 (713)
.+|+||.+.-... ..+++.++....=+.+..... ..-.|+++- +++||+.-. ...+|+++++....-
T Consensus 426 ~~g~iYi~GG~~~~~~~l~sve~YDP~t~~W~~~~~M~~~R~~~g~a~~--~~~iYvvGG~~~~~~~~~VE~ydp~~~~W 503 (571)
T KOG4441|consen 426 LGGKLYIIGGGDGSSNCLNSVECYDPETNTWTLIAPMNTRRSGFGVAVL--NGKIYVVGGFDGTSALSSVERYDPETNQW 503 (571)
T ss_pred ECCEEEEEcCcCCCccccceEEEEcCCCCceeecCCcccccccceEEEE--CCEEEEECCccCCCccceEEEEcCCCCce
Confidence 7899999863211 367777776544333332211 122345554 899999853 234588999888777
Q ss_pred eEEEecCCCCccceeeeeeCCeEEEEeC
Q psy6570 151 FVVYHTEDNGYKPYKLEVFEDNLYFSTY 178 (713)
Q Consensus 151 ~~~~~~~~~~~~p~~i~~~~~~ly~td~ 178 (713)
+.+.... ......++++.++.||++-.
T Consensus 504 ~~v~~m~-~~rs~~g~~~~~~~ly~vGG 530 (571)
T KOG4441|consen 504 TMVAPMT-SPRSAVGVVVLGGKLYAVGG 530 (571)
T ss_pred eEcccCc-cccccccEEEECCEEEEEec
Confidence 7764332 24566788999999999853
No 214
>KOG2106|consensus
Probab=89.37 E-value=26 Score=37.66 Aligned_cols=156 Identities=16% Similarity=0.186 Sum_probs=88.0
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccC-CCCeEEEEecCCCC
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELG-IKPRISGASIDGKN 106 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~-~~~~I~~~~~dG~~ 106 (713)
|+-|+= ....||+.-+ .+.|+.-++......++.-.+ ..-.+||..|. ..+|.|-.. .+-+||+ +-+.
T Consensus 332 iRtv~e--~~~di~vGTt----rN~iL~Gt~~~~f~~~v~gh~-delwgla~hps-~~q~~T~gqdk~v~lW~---~~k~ 400 (626)
T KOG2106|consen 332 IRTVAE--GKGDILVGTT----RNFILQGTLENGFTLTVQGHG-DELWGLATHPS-KNQLLTCGQDKHVRLWN---DHKL 400 (626)
T ss_pred eeEEec--CCCcEEEeec----cceEEEeeecCCceEEEEecc-cceeeEEcCCC-hhheeeccCcceEEEcc---CCce
Confidence 444443 3344777654 577887777765544433232 57889999985 455665432 2224444 2221
Q ss_pred cEE-EEe-----CCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeeeCCe--EEEEeC
Q psy6570 107 KFN-LVD-----NNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVFEDN--LYFSTY 178 (713)
Q Consensus 107 ~~~-l~~-----~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~--ly~td~ 178 (713)
.-+ ++. .+++...-||+-..+++.++-|..+..+..+..++.....+.-.. ...-+++.-.++. ||..+.
T Consensus 401 ~wt~~~~d~~~~~~fhpsg~va~Gt~~G~w~V~d~e~~~lv~~~~d~~~ls~v~ysp--~G~~lAvgs~d~~iyiy~Vs~ 478 (626)
T KOG2106|consen 401 EWTKIIEDPAECADFHPSGVVAVGTATGRWFVLDTETQDLVTIHTDNEQLSVVRYSP--DGAFLAVGSHDNHIYIYRVSA 478 (626)
T ss_pred eEEEEecCceeEeeccCcceEEEeeccceEEEEecccceeEEEEecCCceEEEEEcC--CCCEEEEecCCCeEEEEEECC
Confidence 111 111 123333356777788899999988887778888855555443221 1122344444554 566777
Q ss_pred CCCcEEEEcccCCCccee
Q psy6570 179 RTNNILKINKFGNSDFNV 196 (713)
Q Consensus 179 ~~~~i~~~~~~~~~~~~~ 196 (713)
.+....++.+-.+..++-
T Consensus 479 ~g~~y~r~~k~~gs~ith 496 (626)
T KOG2106|consen 479 NGRKYSRVGKCSGSPITH 496 (626)
T ss_pred CCcEEEEeeeecCceeEE
Confidence 777777887766644433
No 215
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=89.18 E-value=3.1 Score=45.28 Aligned_cols=114 Identities=17% Similarity=0.149 Sum_probs=65.1
Q ss_pred ecCCCCCceEEEeccCCeEEEeecCCC-------------CCCeEEEEecCCc-------eEEEEEcC------------
Q psy6570 22 LSNLHDPRGVAVDWVGKNLYWTDAGGR-------------SSNNIMVSTLEGR-------KKRTLLNT------------ 69 (713)
Q Consensus 22 ~~~~~~p~gla~D~~~~~ly~td~~~~-------------~~~~I~~~~~~G~-------~~~~l~~~------------ 69 (713)
...+.+|++|++.+..+++|++...+. ..++|++....+. .-.+++..
T Consensus 413 AT~mdRpE~i~~~p~~g~Vy~~lTNn~~r~~~~aNpr~~n~~G~I~r~~p~~~d~t~~~ftWdlF~~aG~~~~~~~~~~~ 492 (616)
T COG3211 413 ATPMDRPEWIAVNPGTGEVYFTLTNNGKRSDDAANPRAKNGYGQIVRWIPATGDHTDTKFTWDLFVEAGNPSVLEGGASA 492 (616)
T ss_pred CccccCccceeecCCcceEEEEeCCCCccccccCCCcccccccceEEEecCCCCccCccceeeeeeecCCcccccccccc
Confidence 456789999999999999999887542 1245777666543 11122211
Q ss_pred -----CCCCcceEEEcCCCCcEEEEccCCC---CeEEEEe----cCCCCc--EEEEeCCC-CCCeeEEEeCCCCeEEEEc
Q psy6570 70 -----GLNEPYDIALEPLSGRMFWTELGIK---PRISGAS----IDGKNK--FNLVDNNI-QWPTGITIDYPSQRLYWAD 134 (713)
Q Consensus 70 -----~~~~p~~iavD~~~~~ly~td~~~~---~~I~~~~----~dG~~~--~~l~~~~~-~~p~glavd~~~~~LY~~d 134 (713)
.+..|.+|++|+..+++..||.... .++.-+. .++... +.++...+ ---.|+++.++.+.||+.-
T Consensus 493 ~~~~~~f~~PDnl~fD~~GrLWi~TDg~~s~~~~~~~G~~~m~~~~p~~g~~~rf~t~P~g~E~tG~~FspD~~TlFV~v 572 (616)
T COG3211 493 NINANWFNSPDNLAFDPWGRLWIQTDGSGSTLRNRFRGVTQMLTPDPKTGTIKRFLTGPIGCEFTGPCFSPDGKTLFVNV 572 (616)
T ss_pred CcccccccCCCceEECCCCCEEEEecCCCCccCcccccccccccCCCccceeeeeccCCCcceeecceeCCCCceEEEEe
Confidence 1445999999986554555564321 1111111 122111 11111111 2346889998888898875
Q ss_pred C
Q psy6570 135 P 135 (713)
Q Consensus 135 ~ 135 (713)
.
T Consensus 573 Q 573 (616)
T COG3211 573 Q 573 (616)
T ss_pred c
Confidence 3
No 216
>KOG0650|consensus
Probab=89.03 E-value=2.6 Score=45.75 Aligned_cols=124 Identities=11% Similarity=0.158 Sum_probs=76.0
Q ss_pred EEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEE
Q psy6570 20 TVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISG 99 (713)
Q Consensus 20 ~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~ 99 (713)
+++...+....|.+...++.|-.+... ..+..|++.+|.-.....-++..-..|..+.+.|..-+||++... .|..
T Consensus 516 ~~I~~~k~i~~vtWHrkGDYlatV~~~-~~~~~VliHQLSK~~sQ~PF~kskG~vq~v~FHPs~p~lfVaTq~---~vRi 591 (733)
T KOG0650|consen 516 IVIKHPKSIRQVTWHRKGDYLATVMPD-SGNKSVLIHQLSKRKSQSPFRKSKGLVQRVKFHPSKPYLFVATQR---SVRI 591 (733)
T ss_pred EEEecCCccceeeeecCCceEEEeccC-CCcceEEEEecccccccCchhhcCCceeEEEecCCCceEEEEecc---ceEE
Confidence 345555556666665554444333221 134556666665433332222233568889999999999998865 4555
Q ss_pred EecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCC
Q psy6570 100 ASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGK 148 (713)
Q Consensus 100 ~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~ 148 (713)
.++.-+....-+.++..+...|+|++.++.|++.. ..+++-+++++-+
T Consensus 592 YdL~kqelvKkL~tg~kwiS~msihp~GDnli~gs-~d~k~~WfDldls 639 (733)
T KOG0650|consen 592 YDLSKQELVKKLLTGSKWISSMSIHPNGDNLILGS-YDKKMCWFDLDLS 639 (733)
T ss_pred EehhHHHHHHHHhcCCeeeeeeeecCCCCeEEEec-CCCeeEEEEcccC
Confidence 55543332222235678889999999888888765 4567888888765
No 217
>KOG0276|consensus
Probab=88.99 E-value=25 Score=38.75 Aligned_cols=138 Identities=14% Similarity=0.087 Sum_probs=78.9
Q ss_pred CcccCCceeEEEccCcccEE-ecCCCCC-ceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEE
Q psy6570 2 ASISSGNVTRVKREMNLKTV-LSNLHDP-RGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIAL 79 (713)
Q Consensus 2 ad~~~~~I~~~~~~~~~~~~-~~~~~~p-~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iav 79 (713)
+...+|++...+-+++..+- +..-..| +.-.|-. ....+++-+ ...+|.+++.+.-.+...+..-....+.|||
T Consensus 30 a~LynG~V~IWnyetqtmVksfeV~~~PvRa~kfia-RknWiv~Gs---DD~~IrVfnynt~ekV~~FeAH~DyIR~iav 105 (794)
T KOG0276|consen 30 AALYNGDVQIWNYETQTMVKSFEVSEVPVRAAKFIA-RKNWIVTGS---DDMQIRVFNYNTGEKVKTFEAHSDYIRSIAV 105 (794)
T ss_pred EeeecCeeEEEecccceeeeeeeecccchhhheeee-ccceEEEec---CCceEEEEecccceeeEEeeccccceeeeee
Confidence 34456666666666544322 1111222 2222222 233444544 5678999999876655555555578999999
Q ss_pred cCCCCcEEEEccCCCCeEEEEecCCCCcEE-EEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeC
Q psy6570 80 EPLSGRMFWTELGIKPRISGASIDGKNKFN-LVDNNIQWPTGITIDYPSQRLYWADPKARTIESINL 145 (713)
Q Consensus 80 D~~~~~ly~td~~~~~~I~~~~~dG~~~~~-l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~ 145 (713)
.|..-++ .|.+... .|..-+-++.-... .+....++...|+|.|.+..-|.+-.-.+.|-+-++
T Consensus 106 HPt~P~v-LtsSDDm-~iKlW~we~~wa~~qtfeGH~HyVMqv~fnPkD~ntFaS~sLDrTVKVWsl 170 (794)
T KOG0276|consen 106 HPTLPYV-LTSSDDM-TIKLWDWENEWACEQTFEGHEHYVMQVAFNPKDPNTFASASLDRTVKVWSL 170 (794)
T ss_pred cCCCCeE-EecCCcc-EEEEeeccCceeeeeEEcCcceEEEEEEecCCCccceeeeeccccEEEEEc
Confidence 9854333 3333333 56666666654433 334456677889999988888877655554444333
No 218
>KOG4328|consensus
Probab=88.89 E-value=4 Score=42.92 Aligned_cols=112 Identities=11% Similarity=0.041 Sum_probs=58.6
Q ss_pred cceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcC--CCCcEEEEeCCCCcee
Q psy6570 74 PYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADP--KARTIESINLNGKDRF 151 (713)
Q Consensus 74 p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~--~~~~I~~~~~~g~~~~ 151 (713)
-.++.+...++.+|+.+.-....++-.+++|+....+.... ....+|++.|...+++.+-. .+.+||-+.--+..+.
T Consensus 282 fs~~d~~~e~~~vl~~~~~G~f~~iD~R~~~s~~~~~~lh~-kKI~sv~~NP~~p~~laT~s~D~T~kIWD~R~l~~K~s 360 (498)
T KOG4328|consen 282 FSSLDFSAESRSVLFGDNVGNFNVIDLRTDGSEYENLRLHK-KKITSVALNPVCPWFLATASLDQTAKIWDLRQLRGKAS 360 (498)
T ss_pred eeeccccCCCccEEEeecccceEEEEeecCCccchhhhhhh-cccceeecCCCCchheeecccCcceeeeehhhhcCCCC
Confidence 34555555666666655332436666677777544433212 26789999998888777754 3445665553333332
Q ss_pred EEEecCCCCccceeeeee---CCeEEEEeCCCCcEEEEcc
Q psy6570 152 VVYHTEDNGYKPYKLEVF---EDNLYFSTYRTNNILKINK 188 (713)
Q Consensus 152 ~~~~~~~~~~~p~~i~~~---~~~ly~td~~~~~i~~~~~ 188 (713)
.++.... ...+..-++| ++. .++....+.|..++.
T Consensus 361 p~lst~~-HrrsV~sAyFSPs~gt-l~TT~~D~~IRv~ds 398 (498)
T KOG4328|consen 361 PFLSTLP-HRRSVNSAYFSPSGGT-LLTTCQDNEIRVFDS 398 (498)
T ss_pred cceeccc-ccceeeeeEEcCCCCc-eEeeccCCceEEeec
Confidence 2222211 1223322333 445 444455556665553
No 219
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=88.86 E-value=0.2 Score=32.79 Aligned_cols=25 Identities=32% Similarity=0.862 Sum_probs=12.9
Q ss_pred CCCCCCcEEeecC-CCceeeCCCCCc
Q psy6570 558 NYCSNNGTCVLIE-GKPSCKCLPPYS 582 (713)
Q Consensus 558 ~~C~~~~~C~~~~-g~~~C~C~~G~~ 582 (713)
..|..++.|.+.. |++.|.|..||.
T Consensus 5 ~~cP~NA~C~~~~dG~eecrCllgyk 30 (37)
T PF12946_consen 5 TKCPANAGCFRYDDGSEECRCLLGYK 30 (37)
T ss_dssp S---TTEEEEEETTSEEEEEE-TTEE
T ss_pred ccCCCCcccEEcCCCCEEEEeeCCcc
Confidence 3455566666654 666666666664
No 220
>PHA02887 EGF-like protein; Provisional
Probab=88.73 E-value=0.43 Score=39.87 Aligned_cols=30 Identities=30% Similarity=0.720 Sum_probs=22.3
Q ss_pred CCCCCeeec-CCCCCCeeecCCCcccCCccc
Q psy6570 315 DCNHGTCEF-DDDFDPHCICQENFYGTYCEK 344 (713)
Q Consensus 315 ~C~~~~C~~-~~~~~~~C~C~~g~~G~~C~~ 344 (713)
-|.||+|.. .+...+.|.|+.||+|..|+.
T Consensus 93 YCiHG~C~yI~dL~epsCrC~~GYtG~RCE~ 123 (126)
T PHA02887 93 FCINGECMNIIDLDEKFCICNKGYTGIRCDE 123 (126)
T ss_pred EeeCCEEEccccCCCceeECCCCcccCCCCc
Confidence 478888887 333457888888888888875
No 221
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=88.59 E-value=0.28 Score=57.01 Aligned_cols=21 Identities=24% Similarity=0.572 Sum_probs=13.8
Q ss_pred cceeecCCCcc----cCCCCcCCCC
Q psy6570 612 KPVCTCVNGWS----GITCSERVSC 632 (713)
Q Consensus 612 ~~~C~C~~G~~----G~~C~~~~~C 632 (713)
.-+|+|..||. +..|.....|
T Consensus 681 ~~~C~C~~g~~p~~~~~~C~~~~~C 705 (800)
T PTZ00214 681 VRRCWCERGFLPALDRSGCVLPTEC 705 (800)
T ss_pred cceeEecCCcccccCCCccccccCC
Confidence 34688998886 5567654444
No 222
>PTZ00421 coronin; Provisional
Probab=88.51 E-value=45 Score=37.05 Aligned_cols=116 Identities=9% Similarity=0.047 Sum_probs=68.1
Q ss_pred CCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceE-------EEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEE
Q psy6570 27 DPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKK-------RTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISG 99 (713)
Q Consensus 27 ~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~-------~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~ 99 (713)
...+|++++..+.++++-. ..+.|.+.++..... ...+.........|++.|..+.++++-.... .|..
T Consensus 77 ~V~~v~fsP~d~~~LaSgS---~DgtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~P~~~~iLaSgs~Dg-tVrI 152 (493)
T PTZ00421 77 PIIDVAFNPFDPQKLFTAS---EDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPSAMNVLASAGADM-VVNV 152 (493)
T ss_pred CEEEEEEcCCCCCEEEEEe---CCCEEEEEecCCCccccccCcceEEecCCCCcEEEEEeCcCCCCEEEEEeCCC-EEEE
Confidence 3467888774455555555 567788877653211 1112222345678999887666666544333 6777
Q ss_pred EecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCC
Q psy6570 100 ASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNG 147 (713)
Q Consensus 100 ~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g 147 (713)
-++........+.........|++.+.. .++++-...+.|...|+..
T Consensus 153 WDl~tg~~~~~l~~h~~~V~sla~spdG-~lLatgs~Dg~IrIwD~rs 199 (493)
T PTZ00421 153 WDVERGKAVEVIKCHSDQITSLEWNLDG-SLLCTTSKDKKLNIIDPRD 199 (493)
T ss_pred EECCCCeEEEEEcCCCCceEEEEEECCC-CEEEEecCCCEEEEEECCC
Confidence 7766443333333333456789998754 4555555677788888753
No 223
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=88.50 E-value=5.8 Score=42.04 Aligned_cols=100 Identities=14% Similarity=0.153 Sum_probs=61.2
Q ss_pred ceEEEe-ccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccC---CCCeEEEEecC-
Q psy6570 29 RGVAVD-WVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELG---IKPRISGASID- 103 (713)
Q Consensus 29 ~gla~D-~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~---~~~~I~~~~~d- 103 (713)
..+.+- ..++.++|.-. ..+-..|.+++++|...+.|......--.-+.+|+.++.|||+... ....|++++++
T Consensus 238 ~~~~~~~~~~~~~l~~s~-~~G~~hly~~~~~~~~~~~lT~G~~~V~~i~~~d~~~~~iyf~a~~~~p~~r~lY~v~~~~ 316 (353)
T PF00930_consen 238 DPPHFLGPDGNEFLWISE-RDGYRHLYLYDLDGGKPRQLTSGDWEVTSILGWDEDNNRIYFTANGDNPGERHLYRVSLDS 316 (353)
T ss_dssp SEEEE-TTTSSEEEEEEE-TTSSEEEEEEETTSSEEEESS-SSS-EEEEEEEECTSSEEEEEESSGGTTSBEEEEEETTE
T ss_pred cccccccCCCCEEEEEEE-cCCCcEEEEEcccccceeccccCceeecccceEcCCCCEEEEEecCCCCCceEEEEEEeCC
Confidence 344442 34555555444 2246789999999988776554432222358899999999998754 23489999999
Q ss_pred CCCcEEEEeCCCCCC-eeEEEeCCCCeEE
Q psy6570 104 GKNKFNLVDNNIQWP-TGITIDYPSQRLY 131 (713)
Q Consensus 104 G~~~~~l~~~~~~~p-~glavd~~~~~LY 131 (713)
+...+.|.. .... ..++|++..+.+.
T Consensus 317 ~~~~~~LT~--~~~~~~~~~~Spdg~y~v 343 (353)
T PF00930_consen 317 GGEPKCLTC--EDGDHYSASFSPDGKYYV 343 (353)
T ss_dssp TTEEEESST--TSSTTEEEEE-TTSSEEE
T ss_pred CCCeEeccC--CCCCceEEEECCCCCEEE
Confidence 665555532 2222 4788886655443
No 224
>KOG4328|consensus
Probab=88.44 E-value=13 Score=39.35 Aligned_cols=183 Identities=14% Similarity=0.063 Sum_probs=89.7
Q ss_pred ccCCceeEEEccCcc-cEEecC---CCCCceEEEeccCCeEEEeecCCCCCCeEEEEec--CCceEEEEEcCCCCCcceE
Q psy6570 4 ISSGNVTRVKREMNL-KTVLSN---LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTL--EGRKKRTLLNTGLNEPYDI 77 (713)
Q Consensus 4 ~~~~~I~~~~~~~~~-~~~~~~---~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~--~G~~~~~l~~~~~~~p~~i 77 (713)
..+|.|...++++.. +++.+. -..-.++.+-...+.+|+.+. -+...++++ +|+....+.... .....|
T Consensus 254 SyDGtiR~~D~~~~i~e~v~s~~~d~~~fs~~d~~~e~~~vl~~~~----~G~f~~iD~R~~~s~~~~~~lh~-kKI~sv 328 (498)
T KOG4328|consen 254 SYDGTIRLQDFEGNISEEVLSLDTDNIWFSSLDFSAESRSVLFGDN----VGNFNVIDLRTDGSEYENLRLHK-KKITSV 328 (498)
T ss_pred ccCceeeeeeecchhhHHHhhcCccceeeeeccccCCCccEEEeec----ccceEEEEeecCCccchhhhhhh-ccccee
Confidence 446666666666543 222221 111233444445566666654 233334443 444333222222 378999
Q ss_pred EEcCCCCcEEEEccCCC-CeEEEEecCCCCcEEEEe--CCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCC--C---Cc
Q psy6570 78 ALEPLSGRMFWTELGIK-PRISGASIDGKNKFNLVD--NNIQWPTGITIDYPSQRLYWADPKARTIESINLN--G---KD 149 (713)
Q Consensus 78 avD~~~~~ly~td~~~~-~~I~~~~~dG~~~~~l~~--~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~--g---~~ 149 (713)
++.|...+++.|..... .+||-+.-=+.....++. ..-...+...++|.++.|.-+. ...+|+.++.. + ..
T Consensus 329 ~~NP~~p~~laT~s~D~T~kIWD~R~l~~K~sp~lst~~HrrsV~sAyFSPs~gtl~TT~-~D~~IRv~dss~~sa~~~p 407 (498)
T KOG4328|consen 329 ALNPVCPWFLATASLDQTAKIWDLRQLRGKASPFLSTLPHRRSVNSAYFSPSGGTLLTTC-QDNEIRVFDSSCISAKDEP 407 (498)
T ss_pred ecCCCCchheeecccCcceeeeehhhhcCCCCcceecccccceeeeeEEcCCCCceEeec-cCCceEEeecccccccCCc
Confidence 99999988888765432 257755322222221332 1233456777888888865554 44566666642 1 11
Q ss_pred eeEEEecCC--CCccceeeeee-CCeEEEEeCCCCcEEEEcccCCC
Q psy6570 150 RFVVYHTED--NGYKPYKLEVF-EDNLYFSTYRTNNILKINKFGNS 192 (713)
Q Consensus 150 ~~~~~~~~~--~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~~ 192 (713)
...+..... ++..|+-..++ ...|+++-.....|-.++..++.
T Consensus 408 ~~~I~Hn~~t~RwlT~fKA~W~P~~~li~vg~~~r~IDv~~~~~~q 453 (498)
T KOG4328|consen 408 LGTIPHNNRTGRWLTPFKAAWDPDYNLIVVGRYPRPIDVFDGNGGQ 453 (498)
T ss_pred cceeeccCcccccccchhheeCCCccEEEEeccCcceeEEcCCCCE
Confidence 122222211 23344444443 23343444445556666655444
No 225
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=88.43 E-value=22 Score=35.42 Aligned_cols=136 Identities=15% Similarity=0.097 Sum_probs=72.7
Q ss_pred CCeEEEeecCCCCCCeEEEEecCCceEE-EEEcCC--C----------CCcceEEEcCCCCc-EEEEccCCCCeEEEEec
Q psy6570 37 GKNLYWTDAGGRSSNNIMVSTLEGRKKR-TLLNTG--L----------NEPYDIALEPLSGR-MFWTELGIKPRISGASI 102 (713)
Q Consensus 37 ~~~ly~td~~~~~~~~I~~~~~~G~~~~-~l~~~~--~----------~~p~~iavD~~~~~-ly~td~~~~~~I~~~~~ 102 (713)
++.||.--. ....|.++++..+.+. .+...+ . ..=.++|+| ++|+ +.++...+++.|.++.+
T Consensus 78 ngslYY~~~---~s~~IvkydL~t~~v~~~~~L~~A~~~n~~~y~~~~~t~iD~AvD-E~GLWvIYat~~~~g~ivvskl 153 (250)
T PF02191_consen 78 NGSLYYNKY---NSRNIVKYDLTTRSVVARRELPGAGYNNRFPYYWSGYTDIDFAVD-ENGLWVIYATEDNNGNIVVSKL 153 (250)
T ss_pred CCcEEEEec---CCceEEEEECcCCcEEEEEECCccccccccceecCCCceEEEEEc-CCCEEEEEecCCCCCcEEEEee
Confidence 578888777 6889999999987655 322211 1 112479999 5665 22233333336887777
Q ss_pred CCCCcEEE--EeCCCCCC---eeEEEeCCCCeEEEEcCCC---CcE-EEEeCCCCcee-EEEecCCCCccceeeeee--C
Q psy6570 103 DGKNKFNL--VDNNIQWP---TGITIDYPSQRLYWADPKA---RTI-ESINLNGKDRF-VVYHTEDNGYKPYKLEVF--E 170 (713)
Q Consensus 103 dG~~~~~l--~~~~~~~p---~glavd~~~~~LY~~d~~~---~~I-~~~~~~g~~~~-~~~~~~~~~~~p~~i~~~--~ 170 (713)
|-....+. ..+.+.++ +.+.+ =+.||.++... .+| +.+|+.-.... +-+...........|+.. +
T Consensus 154 d~~tL~v~~tw~T~~~k~~~~naFmv---CGvLY~~~s~~~~~~~I~yafDt~t~~~~~~~i~f~~~~~~~~~l~YNP~d 230 (250)
T PF02191_consen 154 DPETLSVEQTWNTSYPKRSAGNAFMV---CGVLYATDSYDTRDTEIFYAFDTYTGKEEDVSIPFPNPYGNISMLSYNPRD 230 (250)
T ss_pred CcccCceEEEEEeccCchhhcceeeE---eeEEEEEEECCCCCcEEEEEEECCCCceeceeeeeccccCceEeeeECCCC
Confidence 76544333 23333332 33444 48999998644 333 45665432222 212221122233334433 6
Q ss_pred CeEEEEeCC
Q psy6570 171 DNLYFSTYR 179 (713)
Q Consensus 171 ~~ly~td~~ 179 (713)
.+||.=|.+
T Consensus 231 k~LY~wd~G 239 (250)
T PF02191_consen 231 KKLYAWDNG 239 (250)
T ss_pred CeEEEEECC
Confidence 778875543
No 226
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=88.33 E-value=0.38 Score=34.31 Aligned_cols=32 Identities=41% Similarity=1.142 Sum_probs=22.9
Q ss_pred CeeecCCCCCeecCCCCccCCCCCCccccCCCCCCcccCCCCccCCC
Q psy6570 461 GECSITDSGPKCMCSPGYSGKKCDTCTCLNGDSGPKCMCSPGYSGKK 507 (713)
Q Consensus 461 ~~C~~~~~~~~C~C~~G~~g~~C~~~~C~~~~~~~~C~C~~G~~g~~ 507 (713)
..|....| +|.|.++|+|+.|+. |.+||++..
T Consensus 11 ~~C~~~~G--~C~C~~~~~G~~C~~-------------C~~g~~~~~ 42 (49)
T PF00053_consen 11 QTCDPSTG--QCVCKPGTTGPRCDQ-------------CKPGYFGLP 42 (49)
T ss_dssp SSEEETCE--EESBSTTEESTTS-E-------------E-TTEECST
T ss_pred CcccCCCC--EEeccccccCCcCcC-------------CCCcccccc
Confidence 45666444 899999999999985 778888754
No 227
>smart00284 OLF Olfactomedin-like domains.
Probab=88.29 E-value=29 Score=34.53 Aligned_cols=134 Identities=17% Similarity=0.096 Sum_probs=72.5
Q ss_pred CCeEEEeecCCCCCCeEEEEecCCceEE--EEEcC-C----------CCCcceEEEcCCCCc--EEEEccCCCCeEEEEe
Q psy6570 37 GKNLYWTDAGGRSSNNIMVSTLEGRKKR--TLLNT-G----------LNEPYDIALEPLSGR--MFWTELGIKPRISGAS 101 (713)
Q Consensus 37 ~~~ly~td~~~~~~~~I~~~~~~G~~~~--~l~~~-~----------~~~p~~iavD~~~~~--ly~td~~~~~~I~~~~ 101 (713)
++.||.... ....|.+++|..+.+. .++.. + ...-.++||| ++|+ ||-|.. +++.|..+.
T Consensus 83 ngslYY~~~---~s~~iiKydL~t~~v~~~~~Lp~a~y~~~~~Y~~~~~sdiDlAvD-E~GLWvIYat~~-~~g~ivvSk 157 (255)
T smart00284 83 NGSLYFNKF---NSHDICRFDLTTETYQKEPLLNGAGYNNRFPYAWGGFSDIDLAVD-ENGLWVIYATEQ-NAGKIVISK 157 (255)
T ss_pred CceEEEEec---CCccEEEEECCCCcEEEEEecCccccccccccccCCCccEEEEEc-CCceEEEEeccC-CCCCEEEEe
Confidence 588998776 6788999999987653 32321 1 1122479999 4665 333433 334788888
Q ss_pred cCCCCcEEEE--eCCCCCC---eeEEEeCCCCeEEEEcCC---CCc-EEEEeCCCCceeEE-EecCCCCccceeeee--e
Q psy6570 102 IDGKNKFNLV--DNNIQWP---TGITIDYPSQRLYWADPK---ART-IESINLNGKDRFVV-YHTEDNGYKPYKLEV--F 169 (713)
Q Consensus 102 ~dG~~~~~l~--~~~~~~p---~glavd~~~~~LY~~d~~---~~~-I~~~~~~g~~~~~~-~~~~~~~~~p~~i~~--~ 169 (713)
+|-....++- .+.+.++ +++.|= +.||++++. ..+ .+.+|...+....+ +........-..|+. .
T Consensus 158 Lnp~tL~ve~tW~T~~~k~sa~naFmvC---GvLY~~~s~~~~~~~I~yayDt~t~~~~~~~i~f~n~y~~~s~l~YNP~ 234 (255)
T smart00284 158 LNPATLTIENTWITTYNKRSASNAFMIC---GILYVTRSLGSKGEKVFYAYDTNTGKEGHLDIPFENMYEYISMLDYNPN 234 (255)
T ss_pred eCcccceEEEEEEcCCCcccccccEEEe---eEEEEEccCCCCCcEEEEEEECCCCccceeeeeeccccccceeceeCCC
Confidence 8765554433 3333322 344444 799999852 223 45566655433222 111111112222443 3
Q ss_pred CCeEEEEeC
Q psy6570 170 EDNLYFSTY 178 (713)
Q Consensus 170 ~~~ly~td~ 178 (713)
+..||.=|.
T Consensus 235 d~~LY~wdn 243 (255)
T smart00284 235 DRKLYAWNN 243 (255)
T ss_pred CCeEEEEeC
Confidence 677887443
No 228
>PTZ00420 coronin; Provisional
Probab=88.22 E-value=51 Score=37.26 Aligned_cols=119 Identities=10% Similarity=-0.002 Sum_probs=66.6
Q ss_pred CCCcceEEEcCCCCcEEEEccCCCCeEEEEecC--CCCc------EEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEE
Q psy6570 71 LNEPYDIALEPLSGRMFWTELGIKPRISGASID--GKNK------FNLVDNNIQWPTGITIDYPSQRLYWADPKARTIES 142 (713)
Q Consensus 71 ~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~d--G~~~------~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~ 142 (713)
.....+|++.|..+.++++-.... .|..-++. +... ...+.........|+++|....|+.+-...+.|..
T Consensus 74 ~~~V~~lafsP~~~~lLASgS~Dg-tIrIWDi~t~~~~~~~i~~p~~~L~gH~~~V~sVaf~P~g~~iLaSgS~DgtIrI 152 (568)
T PTZ00420 74 TSSILDLQFNPCFSEILASGSEDL-TIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNI 152 (568)
T ss_pred CCCEEEEEEcCCCCCEEEEEeCCC-eEEEEECCCCCccccccccceEEeecCCCcEEEEEECCCCCeEEEEEeCCCeEEE
Confidence 356788999987666666654333 55555553 2111 11222234456789999877777666556677877
Q ss_pred EeCCCCceeEEEecCCCCccceeeeee-CCeEEEEeCCCCcEEEEcccCCCc
Q psy6570 143 INLNGKDRFVVYHTEDNGYKPYKLEVF-EDNLYFSTYRTNNILKINKFGNSD 193 (713)
Q Consensus 143 ~~~~g~~~~~~~~~~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~~~ 193 (713)
.++........+.. ......+++. .+.++.+....+.|..++...+..
T Consensus 153 WDl~tg~~~~~i~~---~~~V~SlswspdG~lLat~s~D~~IrIwD~Rsg~~ 201 (568)
T PTZ00420 153 WDIENEKRAFQINM---PKKLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQEI 201 (568)
T ss_pred EECCCCcEEEEEec---CCcEEEEEECCCCCEEEEEecCCEEEEEECCCCcE
Confidence 77765432222211 1233455554 445555555556677777655443
No 229
>KOG2110|consensus
Probab=88.04 E-value=26 Score=36.14 Aligned_cols=139 Identities=15% Similarity=0.116 Sum_probs=78.9
Q ss_pred CceeEEEccCcc--cEEecCCCCCce-EEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCC
Q psy6570 7 GNVTRVKREMNL--KTVLSNLHDPRG-VAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLS 83 (713)
Q Consensus 7 ~~I~~~~~~~~~--~~~~~~~~~p~g-la~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~ 83 (713)
++|+..++...+ .++...-.+|.| +|+.+....-|.+-.+.+..+.|.+++...-.....+..--...-.||+++ +
T Consensus 106 e~IyIydI~~MklLhTI~t~~~n~~gl~AlS~n~~n~ylAyp~s~t~GdV~l~d~~nl~~v~~I~aH~~~lAalafs~-~ 184 (391)
T KOG2110|consen 106 ESIYIYDIKDMKLLHTIETTPPNPKGLCALSPNNANCYLAYPGSTTSGDVVLFDTINLQPVNTINAHKGPLAALAFSP-D 184 (391)
T ss_pred ccEEEEecccceeehhhhccCCCccceEeeccCCCCceEEecCCCCCceEEEEEcccceeeeEEEecCCceeEEEECC-C
Confidence 345566655532 222233356664 456555565666665555778899999876544444443334566799996 6
Q ss_pred CcEEEEccCCCCeEEEEecCCCCcEEEEeCC--CCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCC
Q psy6570 84 GRMFWTELGIKPRISGASIDGKNKFNLVDNN--IQWPTGITIDYPSQRLYWADPKARTIESINLNG 147 (713)
Q Consensus 84 ~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~--~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g 147 (713)
|.|.-|-+.....|+++......+..-+..+ ......|+|+++...|-. -+.+..|..|.++-
T Consensus 185 G~llATASeKGTVIRVf~v~~G~kl~eFRRG~~~~~IySL~Fs~ds~~L~~-sS~TeTVHiFKL~~ 249 (391)
T KOG2110|consen 185 GTLLATASEKGTVIRVFSVPEGQKLYEFRRGTYPVSIYSLSFSPDSQFLAA-SSNTETVHIFKLEK 249 (391)
T ss_pred CCEEEEeccCceEEEEEEcCCccEeeeeeCCceeeEEEEEEECCCCCeEEE-ecCCCeEEEEEecc
Confidence 7777776655534444444322222222222 223467899987775544 34566777666654
No 230
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=88.02 E-value=51 Score=37.05 Aligned_cols=32 Identities=9% Similarity=0.081 Sum_probs=21.4
Q ss_pred eEEEeCCCCeEEEEcCC---------------CCcEEEEeCCCCcee
Q psy6570 120 GITIDYPSQRLYWADPK---------------ARTIESINLNGKDRF 151 (713)
Q Consensus 120 glavd~~~~~LY~~d~~---------------~~~I~~~~~~g~~~~ 151 (713)
.+++|+..+.|||.-.+ +..|..+|++....+
T Consensus 238 ~~s~D~~~~lvy~~tGnp~p~~~~~r~gdnl~~~s~vAld~~TG~~~ 284 (527)
T TIGR03075 238 TGSYDPETNLIYFGTGNPSPWNSHLRPGDNLYTSSIVARDPDTGKIK 284 (527)
T ss_pred ceeEcCCCCeEEEeCCCCCCCCCCCCCCCCccceeEEEEccccCCEE
Confidence 46999999999997522 235777776644443
No 231
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=87.98 E-value=25 Score=39.52 Aligned_cols=17 Identities=6% Similarity=0.045 Sum_probs=14.0
Q ss_pred eeEEEeCCCCeEEEEcC
Q psy6570 119 TGITIDYPSQRLYWADP 135 (713)
Q Consensus 119 ~glavd~~~~~LY~~d~ 135 (713)
.++|+|+..+.||+...
T Consensus 390 ~~~A~Dp~~g~~yvp~~ 406 (527)
T TIGR03075 390 QPMAYSPKTGLFYVPAN 406 (527)
T ss_pred CCceECCCCCEEEEecc
Confidence 46899999999998754
No 232
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=87.96 E-value=9.6 Score=40.59 Aligned_cols=128 Identities=16% Similarity=0.118 Sum_probs=60.8
Q ss_pred CCeEEEEecCCceEEEEEcCCC--CCcceE--EEcCCCCcEEEEccCCCCeEEEEecC--CCCc-EEEEeC---------
Q psy6570 50 SNNIMVSTLEGRKKRTLLNTGL--NEPYDI--ALEPLSGRMFWTELGIKPRISGASID--GKNK-FNLVDN--------- 113 (713)
Q Consensus 50 ~~~I~~~~~~G~~~~~l~~~~~--~~p~~i--avD~~~~~ly~td~~~~~~I~~~~~d--G~~~-~~l~~~--------- 113 (713)
.++|.+.++.....+..+.-+- ..|..| +-||....=|+.-.-.. .|+++..+ |+-. +.++..
T Consensus 221 G~~l~vWD~~~r~~~Q~idLg~~g~~pLEvRflH~P~~~~gFvg~aLss-~i~~~~k~~~g~W~a~kVi~ip~~~v~~~~ 299 (461)
T PF05694_consen 221 GHSLHVWDWSTRKLLQTIDLGEEGQMPLEVRFLHDPDANYGFVGCALSS-SIWRFYKDDDGEWAAEKVIDIPAKKVEGWI 299 (461)
T ss_dssp --EEEEEETTTTEEEEEEES-TTEEEEEEEEE-SSTT--EEEEEEE--E-EEEEEEE-ETTEEEEEEEEEE--EE--SS-
T ss_pred cCeEEEEECCCCcEeeEEecCCCCCceEEEEecCCCCccceEEEEeccc-eEEEEEEcCCCCeeeeEEEECCCcccCccc
Confidence 3567777777666554443221 123222 22444445566544333 67776553 3211 122210
Q ss_pred --CC--------CCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCc-eeEEEecCC------------------CCccce
Q psy6570 114 --NI--------QWPTGITIDYPSQRLYWADPKARTIESINLNGKD-RFVVYHTED------------------NGYKPY 164 (713)
Q Consensus 114 --~~--------~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~-~~~~~~~~~------------------~~~~p~ 164 (713)
.+ ..++.|.|+.+.++||++.+..+.|+.+|+.... .+.+.+... ..++-+
T Consensus 300 lp~ml~~~~~~P~LitDI~iSlDDrfLYvs~W~~GdvrqYDISDP~~Pkl~gqv~lGG~~~~~~~~~v~g~~l~GgPqMv 379 (461)
T PF05694_consen 300 LPEMLKPFGAVPPLITDILISLDDRFLYVSNWLHGDVRQYDISDPFNPKLVGQVFLGGSIRKGDHPVVKGKRLRGGPQMV 379 (461)
T ss_dssp --GGGGGG-EE------EEE-TTS-EEEEEETTTTEEEEEE-SSTTS-EEEEEEE-BTTTT-B--TTS------S----E
T ss_pred ccccccccccCCCceEeEEEccCCCEEEEEcccCCcEEEEecCCCCCCcEEeEEEECcEeccCCCccccccccCCCCCeE
Confidence 11 3468888998899999999999999999987643 333322110 112335
Q ss_pred eeeeeCCeEEEEeC
Q psy6570 165 KLEVFEDNLYFSTY 178 (713)
Q Consensus 165 ~i~~~~~~ly~td~ 178 (713)
.++.++.+||||++
T Consensus 380 qlS~DGkRlYvTnS 393 (461)
T PF05694_consen 380 QLSLDGKRLYVTNS 393 (461)
T ss_dssp EE-TTSSEEEEE--
T ss_pred EEccCCeEEEEEee
Confidence 67788999999974
No 233
>KOG0272|consensus
Probab=87.87 E-value=9.9 Score=39.67 Aligned_cols=179 Identities=16% Similarity=0.112 Sum_probs=99.3
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCC-CCCcceEEEcCCCCcEEEEccCC-CCeEEEEe
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTG-LNEPYDIALEPLSGRMFWTELGI-KPRISGAS 101 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~-~~~p~~iavD~~~~~ly~td~~~-~~~I~~~~ 101 (713)
.+.+...++|.|.++.|- |-..- ...+|+ |+..+ .+.+..++ .....+||+.+ +|-|..|.... -++||-..
T Consensus 260 H~~RVs~VafHPsG~~L~-TasfD-~tWRlW--D~~tk-~ElL~QEGHs~~v~~iaf~~-DGSL~~tGGlD~~~RvWDlR 333 (459)
T KOG0272|consen 260 HLARVSRVAFHPSGKFLG-TASFD-STWRLW--DLETK-SELLLQEGHSKGVFSIAFQP-DGSLAATGGLDSLGRVWDLR 333 (459)
T ss_pred chhhheeeeecCCCceee-ecccc-cchhhc--ccccc-hhhHhhcccccccceeEecC-CCceeeccCccchhheeecc
Confidence 345566677777655543 33210 123333 33222 22333333 34566899985 67777775332 24566443
Q ss_pred cCCCCcEEE-EeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCce-eEEEecCCCCccceeeeeeCCeEEEEeCC
Q psy6570 102 IDGKNKFNL-VDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDR-FVVYHTEDNGYKPYKLEVFEDNLYFSTYR 179 (713)
Q Consensus 102 ~dG~~~~~l-~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~-~~~~~~~~~~~~p~~i~~~~~~ly~td~~ 179 (713)
+.+.++ +...+....+++++| +++..-+-...+.+..-++.+... .++..... +..-.-+....++..+|...
T Consensus 334 ---tgr~im~L~gH~k~I~~V~fsP-NGy~lATgs~Dnt~kVWDLR~r~~ly~ipAH~n-lVS~Vk~~p~~g~fL~Tasy 408 (459)
T KOG0272|consen 334 ---TGRCIMFLAGHIKEILSVAFSP-NGYHLATGSSDNTCKVWDLRMRSELYTIPAHSN-LVSQVKYSPQEGYFLVTASY 408 (459)
T ss_pred ---cCcEEEEecccccceeeEeECC-CceEEeecCCCCcEEEeeecccccceecccccc-hhhheEecccCCeEEEEccc
Confidence 223333 344677778999995 677776766555544444444322 12221111 11112222246788888888
Q ss_pred CCcEEEEcccCCCcceeeeccccccccEEEEeec
Q psy6570 180 TNNILKINKFGNSDFNVLANNLNRASDVLILQEN 213 (713)
Q Consensus 180 ~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~ 213 (713)
.+.+......+...+..++..-.++..+.+....
T Consensus 409 D~t~kiWs~~~~~~~ksLaGHe~kV~s~Dis~d~ 442 (459)
T KOG0272|consen 409 DNTVKIWSTRTWSPLKSLAGHEGKVISLDISPDS 442 (459)
T ss_pred CcceeeecCCCcccchhhcCCccceEEEEeccCC
Confidence 8888888888888888888877777777665443
No 234
>KOG0285|consensus
Probab=87.84 E-value=7.3 Score=39.77 Aligned_cols=132 Identities=16% Similarity=0.160 Sum_probs=78.8
Q ss_pred CCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCce
Q psy6570 71 LNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDR 150 (713)
Q Consensus 71 ~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~ 150 (713)
+.+.+.|+|||. +..|.|.+... .|...++.....+.-+...+..-.+++|+...-+||-+- ..+.|--.|+.-..
T Consensus 151 lgWVr~vavdP~-n~wf~tgs~Dr-tikIwDlatg~LkltltGhi~~vr~vavS~rHpYlFs~g-edk~VKCwDLe~nk- 226 (460)
T KOG0285|consen 151 LGWVRSVAVDPG-NEWFATGSADR-TIKIWDLATGQLKLTLTGHIETVRGVAVSKRHPYLFSAG-EDKQVKCWDLEYNK- 226 (460)
T ss_pred cceEEEEeeCCC-ceeEEecCCCc-eeEEEEcccCeEEEeecchhheeeeeeecccCceEEEec-CCCeeEEEechhhh-
Confidence 677889999986 55666665544 677777765555444444677789999998777887664 33455555554322
Q ss_pred eEEEecCCCCccceeeeee-CCeEEEEeCCCCcEEEEcccCCCcceeeeccccccccE
Q psy6570 151 FVVYHTEDNGYKPYKLEVF-EDNLYFSTYRTNNILKINKFGNSDFNVLANNLNRASDV 207 (713)
Q Consensus 151 ~~~~~~~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i 207 (713)
++..-...+...+.|+++ -..+.++......++..+......+.++......+..+
T Consensus 227 -vIR~YhGHlS~V~~L~lhPTldvl~t~grDst~RvWDiRtr~~V~~l~GH~~~V~~V 283 (460)
T KOG0285|consen 227 -VIRHYHGHLSGVYCLDLHPTLDVLVTGGRDSTIRVWDIRTRASVHVLSGHTNPVASV 283 (460)
T ss_pred -hHHHhccccceeEEEeccccceeEEecCCcceEEEeeecccceEEEecCCCCcceeE
Confidence 222221124455666665 23455665555556666666666666665544443443
No 235
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=87.63 E-value=0.6 Score=33.44 Aligned_cols=23 Identities=48% Similarity=1.380 Sum_probs=19.2
Q ss_pred eecCCCCccCCCCCCccccCCCCCCcccCCCCccCC
Q psy6570 471 KCMCSPGYSGKKCDTCTCLNGDSGPKCMCSPGYSGK 506 (713)
Q Consensus 471 ~C~C~~G~~g~~C~~~~C~~~~~~~~C~C~~G~~g~ 506 (713)
+|.|.++|+|..|+. |.+||++.
T Consensus 20 ~C~C~~~~~G~~C~~-------------C~~g~~~~ 42 (50)
T cd00055 20 QCECKPNTTGRRCDR-------------CAPGYYGL 42 (50)
T ss_pred EEeCCCcCCCCCCCC-------------CCCCCccC
Confidence 899999999999885 77888764
No 236
>KOG0279|consensus
Probab=87.44 E-value=32 Score=34.15 Aligned_cols=159 Identities=9% Similarity=0.044 Sum_probs=89.4
Q ss_pred CCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEE-----EEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEe
Q psy6570 27 DPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRT-----LLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGAS 101 (713)
Q Consensus 27 ~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~-----l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~ 101 (713)
...+|++-..+..+|++-+ ....|.+.+++....+. .+..-.....++++.+ ++..+++-.... .++.-+
T Consensus 17 ~Vt~la~~~~~~~~l~sas---rDk~ii~W~L~~dd~~~G~~~r~~~GHsH~v~dv~~s~-dg~~alS~swD~-~lrlWD 91 (315)
T KOG0279|consen 17 WVTALAIKIKNSDILVSAS---RDKTIIVWKLTSDDIKYGVPVRRLTGHSHFVSDVVLSS-DGNFALSASWDG-TLRLWD 91 (315)
T ss_pred eEEEEEeecCCCceEEEcc---cceEEEEEEeccCccccCceeeeeeccceEecceEEcc-CCceEEeccccc-eEEEEE
Confidence 3456777666667777776 56677776666443221 1112235677888875 566666654434 566667
Q ss_pred cCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecC-CCCccceeeeeeC-CeEEEEeCC
Q psy6570 102 IDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTE-DNGYKPYKLEVFE-DNLYFSTYR 179 (713)
Q Consensus 102 ~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~-~~~~~p~~i~~~~-~~ly~td~~ 179 (713)
+.+...+..+.....--.++||++++.+| ++-+..+.|...+.-|.-...+.... .....-.-+...+ +-++++...
T Consensus 92 l~~g~~t~~f~GH~~dVlsva~s~dn~qi-vSGSrDkTiklwnt~g~ck~t~~~~~~~~WVscvrfsP~~~~p~Ivs~s~ 170 (315)
T KOG0279|consen 92 LATGESTRRFVGHTKDVLSVAFSTDNRQI-VSGSRDKTIKLWNTLGVCKYTIHEDSHREWVSCVRFSPNESNPIIVSASW 170 (315)
T ss_pred ecCCcEEEEEEecCCceEEEEecCCCcee-ecCCCcceeeeeeecccEEEEEecCCCcCcEEEEEEcCCCCCcEEEEccC
Confidence 77654444444455556789999655444 56666777777777776555554442 1121222222223 233444444
Q ss_pred CCcEEEEcccCC
Q psy6570 180 TNNILKINKFGN 191 (713)
Q Consensus 180 ~~~i~~~~~~~~ 191 (713)
.+.|...+..+-
T Consensus 171 DktvKvWnl~~~ 182 (315)
T KOG0279|consen 171 DKTVKVWNLRNC 182 (315)
T ss_pred CceEEEEccCCc
Confidence 555666665443
No 237
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=87.23 E-value=16 Score=38.99 Aligned_cols=144 Identities=10% Similarity=0.110 Sum_probs=77.2
Q ss_pred cCCeEEEeecCCCCCCeEEEEecCCceEEE---EEc--CCCCCcceEEEcCCCCcEEEEccCCCCeEEEEec-CCCCcEE
Q psy6570 36 VGKNLYWTDAGGRSSNNIMVSTLEGRKKRT---LLN--TGLNEPYDIALEPLSGRMFWTELGIKPRISGASI-DGKNKFN 109 (713)
Q Consensus 36 ~~~~ly~td~~~~~~~~I~~~~~~G~~~~~---l~~--~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~-dG~~~~~ 109 (713)
.++++|+... .+.|..+++++...+- +.. ..+..| +.+. +|+||+.++.. +++.+++ +|+.+-.
T Consensus 67 ~dg~v~~~~~----~G~i~A~d~~~g~~~W~~~~~~~~~~~~~~--~~~~--~G~i~~g~~~g--~~y~ld~~~G~~~W~ 136 (370)
T COG1520 67 GDGTVYVGTR----DGNIFALNPDTGLVKWSYPLLGAVAQLSGP--ILGS--DGKIYVGSWDG--KLYALDASTGTLVWS 136 (370)
T ss_pred eCCeEEEecC----CCcEEEEeCCCCcEEecccCcCcceeccCc--eEEe--CCeEEEecccc--eEEEEECCCCcEEEE
Confidence 4688998743 5678888887654321 111 112222 3332 78899998764 5888888 6765544
Q ss_pred EEeCC-CCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCC-CCceeEEEecC-CCCccceeeeeeCCeEEEEeCC-CCcEEE
Q psy6570 110 LVDNN-IQWPTGITIDYPSQRLYWADPKARTIESINLN-GKDRFVVYHTE-DNGYKPYKLEVFEDNLYFSTYR-TNNILK 185 (713)
Q Consensus 110 l~~~~-~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~-g~~~~~~~~~~-~~~~~p~~i~~~~~~ly~td~~-~~~i~~ 185 (713)
.-... ........+. ++.+|+.. ..+.++.++.+ |..+=...... ..........+.++.+|+.... ...++.
T Consensus 137 ~~~~~~~~~~~~~v~~--~~~v~~~s-~~g~~~al~~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~vy~~~~~~~~~~~a 213 (370)
T COG1520 137 RNVGGSPYYASPPVVG--DGTVYVGT-DDGHLYALNADTGTLKWTYETPAPLSLSIYGSPAIASGTVYVGSDGYDGILYA 213 (370)
T ss_pred EecCCCeEEecCcEEc--CcEEEEec-CCCeEEEEEccCCcEEEEEecCCccccccccCceeecceEEEecCCCcceEEE
Confidence 43323 1122222332 56777664 45778888776 54332211111 1111222222667788887553 345777
Q ss_pred EcccCCC
Q psy6570 186 INKFGNS 192 (713)
Q Consensus 186 ~~~~~~~ 192 (713)
++...+.
T Consensus 214 ~~~~~G~ 220 (370)
T COG1520 214 LNAEDGT 220 (370)
T ss_pred EEccCCc
Confidence 7764443
No 238
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=87.09 E-value=0.54 Score=40.00 Aligned_cols=32 Identities=31% Similarity=0.629 Sum_probs=23.9
Q ss_pred CCCCCCeeec-CCCCCCeeecCCCcccCCcccc
Q psy6570 314 LDCNHGTCEF-DDDFDPHCICQENFYGTYCEKV 345 (713)
Q Consensus 314 ~~C~~~~C~~-~~~~~~~C~C~~g~~G~~C~~~ 345 (713)
+-|.||.|.. .+-..+.|.|..||+|..|+..
T Consensus 51 ~YClHG~C~yI~dl~~~~CrC~~GYtGeRCEh~ 83 (139)
T PHA03099 51 GYCLHGDCIHARDIDGMYCRCSHGYTGIRCQHV 83 (139)
T ss_pred CEeECCEEEeeccCCCceeECCCCcccccccce
Confidence 3488888877 3344578999999999888754
No 239
>KOG0282|consensus
Probab=86.91 E-value=6.6 Score=41.64 Aligned_cols=202 Identities=12% Similarity=0.022 Sum_probs=105.0
Q ss_pred CceeEEEccCccc-EEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCc
Q psy6570 7 GNVTRVKREMNLK-TVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGR 85 (713)
Q Consensus 7 ~~I~~~~~~~~~~-~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ 85 (713)
..|...|.++... .-.....-|.-+-+.+.+.++|++-. ...+|...|.....+..-....|.....|.+-+. +.
T Consensus 280 ~~lKlwDtETG~~~~~f~~~~~~~cvkf~pd~~n~fl~G~---sd~ki~~wDiRs~kvvqeYd~hLg~i~~i~F~~~-g~ 355 (503)
T KOG0282|consen 280 RFLKLWDTETGQVLSRFHLDKVPTCVKFHPDNQNIFLVGG---SDKKIRQWDIRSGKVVQEYDRHLGAILDITFVDE-GR 355 (503)
T ss_pred eeeeeeccccceEEEEEecCCCceeeecCCCCCcEEEEec---CCCcEEEEeccchHHHHHHHhhhhheeeeEEccC-Cc
Confidence 3344445554432 22566667888999988889998887 6788888887654322212234677778888754 44
Q ss_pred EEEEccCCC-CeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCce---eEEEecCCCCc
Q psy6570 86 MFWTELGIK-PRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDR---FVVYHTEDNGY 161 (713)
Q Consensus 86 ly~td~~~~-~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~---~~~~~~~~~~~ 161 (713)
=|++.+... -+||.....- ..+.++......--.|++.| ++..+.+....++|..+...-..+ +..+.....-.
T Consensus 356 rFissSDdks~riWe~~~~v-~ik~i~~~~~hsmP~~~~~P-~~~~~~aQs~dN~i~ifs~~~~~r~nkkK~feGh~vaG 433 (503)
T KOG0282|consen 356 RFISSSDDKSVRIWENRIPV-PIKNIADPEMHTMPCLTLHP-NGKWFAAQSMDNYIAIFSTVPPFRLNKKKRFEGHSVAG 433 (503)
T ss_pred eEeeeccCccEEEEEcCCCc-cchhhcchhhccCcceecCC-CCCeehhhccCceEEEEecccccccCHhhhhcceeccC
Confidence 455443322 2444433321 22222223333334577775 566677888788888877544322 22222211122
Q ss_pred cceeeeee-CCeEEEEeCCCCcEEEEcccCCCcceeeeccccccccEEEEeeccc
Q psy6570 162 KPYKLEVF-EDNLYFSTYRTNNILKINKFGNSDFNVLANNLNRASDVLILQENKQ 215 (713)
Q Consensus 162 ~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~~~~~q 215 (713)
.+..+++. ++....+-...++++.++-.....+..+... ..+--.+.+|+...
T Consensus 434 ys~~v~fSpDG~~l~SGdsdG~v~~wdwkt~kl~~~lkah-~~~ci~v~wHP~e~ 487 (503)
T KOG0282|consen 434 YSCQVDFSPDGRTLCSGDSDGKVNFWDWKTTKLVSKLKAH-DQPCIGVDWHPVEP 487 (503)
T ss_pred ceeeEEEcCCCCeEEeecCCccEEEeechhhhhhhccccC-CcceEEEEecCCCc
Confidence 33344433 3344444445666666654333333333332 23333344554443
No 240
>KOG3607|consensus
Probab=86.86 E-value=1.1 Score=51.29 Aligned_cols=32 Identities=25% Similarity=0.490 Sum_probs=23.5
Q ss_pred CCCCCCCeecCCCCccCCCCCceeeCCCCcccCCCCccccc
Q psy6570 634 HFCFNGGTCREQNYSLDPDLKPICICPRGYAGVRCQTLVHY 674 (713)
Q Consensus 634 ~~C~~~~~C~~~~~~~~~~~~~~C~C~~Gy~G~~C~~~~~~ 674 (713)
..|..+|+|.+ ...|+|.+||.+..|+.....
T Consensus 630 ~~C~g~GVCnn---------~~~ChC~~gwapp~C~~~~~~ 661 (716)
T KOG3607|consen 630 TTCNGHGVCNN---------ELNCHCEPGWAPPFCFIFGYG 661 (716)
T ss_pred cccCCCcccCC---------CcceeeCCCCCCCccccccCC
Confidence 44777788866 267888888888888776544
No 241
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=86.59 E-value=0.39 Score=34.23 Aligned_cols=23 Identities=57% Similarity=1.246 Sum_probs=16.1
Q ss_pred EEEcCCCeeeCCCCCccCCCCcC
Q psy6570 396 TCIATTQTCVCPPGFTGDTCQQC 418 (713)
Q Consensus 396 ~C~~~~~~C~C~~g~~g~~C~~C 418 (713)
.|...+++|.|.++|+|..|++|
T Consensus 12 ~C~~~~G~C~C~~~~~G~~C~~C 34 (49)
T PF00053_consen 12 TCDPSTGQCVCKPGTTGPRCDQC 34 (49)
T ss_dssp SEEETCEEESBSTTEESTTS-EE
T ss_pred cccCCCCEEeccccccCCcCcCC
Confidence 56666778888888888887643
No 242
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=86.14 E-value=0.85 Score=32.65 Aligned_cols=22 Identities=41% Similarity=1.040 Sum_probs=16.0
Q ss_pred EEcCCCeeeCCCCCccCCCCcC
Q psy6570 397 CIATTQTCVCPPGFTGDTCQQC 418 (713)
Q Consensus 397 C~~~~~~C~C~~g~~g~~C~~C 418 (713)
|...+++|.|.++|+|..|+.|
T Consensus 14 C~~~~G~C~C~~~~~G~~C~~C 35 (50)
T cd00055 14 CDPGTGQCECKPNTTGRRCDRC 35 (50)
T ss_pred ccCCCCEEeCCCcCCCCCCCCC
Confidence 5445678888888888887644
No 243
>KOG1274|consensus
Probab=86.05 E-value=18 Score=41.73 Aligned_cols=152 Identities=10% Similarity=0.008 Sum_probs=81.3
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEecCC--ceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCC
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEG--RKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGK 105 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G--~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~ 105 (713)
-..|.+|+.+..|+..+. .+.|.+.+-.. ...+++... .....+|+.+. ..|.+.+.++ .|.|...+..
T Consensus 16 ~t~i~~d~~gefi~tcgs----dg~ir~~~~~sd~e~P~ti~~~-g~~v~~ia~~s---~~f~~~s~~~-tv~~y~fps~ 86 (933)
T KOG1274|consen 16 LTLICYDPDGEFICTCGS----DGDIRKWKTNSDEEEPETIDIS-GELVSSIACYS---NHFLTGSEQN-TVLRYKFPSG 86 (933)
T ss_pred eEEEEEcCCCCEEEEecC----CCceEEeecCCcccCCchhhcc-CceeEEEeecc---cceEEeeccc-eEEEeeCCCC
Confidence 456889988888887775 34444443221 222222211 13455677652 2555555556 7899988876
Q ss_pred CcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeee-CCeEEEEeCCCCcEE
Q psy6570 106 NKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVF-EDNLYFSTYRTNNIL 184 (713)
Q Consensus 106 ~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~-~~~ly~td~~~~~i~ 184 (713)
....++..-....+.++|+..+.+|- +-...-.|-.++++.......+.... ..-..|+++ ++.+..+....+.|+
T Consensus 87 ~~~~iL~Rftlp~r~~~v~g~g~~ia-agsdD~~vK~~~~~D~s~~~~lrgh~--apVl~l~~~p~~~fLAvss~dG~v~ 163 (933)
T KOG1274|consen 87 EEDTILARFTLPIRDLAVSGSGKMIA-AGSDDTAVKLLNLDDSSQEKVLRGHD--APVLQLSYDPKGNFLAVSSCDGKVQ 163 (933)
T ss_pred CccceeeeeeccceEEEEecCCcEEE-eecCceeEEEEeccccchheeecccC--CceeeeeEcCCCCEEEEEecCceEE
Confidence 66655532222346788885444443 33334456666665444444433311 123566666 334444444566666
Q ss_pred EEcccCC
Q psy6570 185 KINKFGN 191 (713)
Q Consensus 185 ~~~~~~~ 191 (713)
.++...+
T Consensus 164 iw~~~~~ 170 (933)
T KOG1274|consen 164 IWDLQDG 170 (933)
T ss_pred EEEcccc
Confidence 6665433
No 244
>KOG2111|consensus
Probab=85.70 E-value=43 Score=33.91 Aligned_cols=139 Identities=15% Similarity=0.109 Sum_probs=80.6
Q ss_pred CCceeEEEccCcccEE--ecCCCCCceE-EEeccCCeEEEeecCCCCCCeEEEEecCCceE--EEEEcCCCCCcceEEEc
Q psy6570 6 SGNVTRVKREMNLKTV--LSNLHDPRGV-AVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKK--RTLLNTGLNEPYDIALE 80 (713)
Q Consensus 6 ~~~I~~~~~~~~~~~~--~~~~~~p~gl-a~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~--~~l~~~~~~~p~~iavD 80 (713)
.++|+.+....+.+.+ +....+|.|| ++++..+.-+.+=.+. ..+.|++.+|.-... ..++..-.....-|++.
T Consensus 112 ~~~I~VytF~~n~k~l~~~et~~NPkGlC~~~~~~~k~~LafPg~-k~GqvQi~dL~~~~~~~p~~I~AH~s~Iacv~Ln 190 (346)
T KOG2111|consen 112 ENKIYVYTFPDNPKLLHVIETRSNPKGLCSLCPTSNKSLLAFPGF-KTGQVQIVDLASTKPNAPSIINAHDSDIACVALN 190 (346)
T ss_pred cCeEEEEEcCCChhheeeeecccCCCceEeecCCCCceEEEcCCC-ccceEEEEEhhhcCcCCceEEEcccCceeEEEEc
Confidence 3444544444333222 4455668886 4556666666665543 358899999876555 24444444667778887
Q ss_pred CCCCcEEEEccCCCCeEEEE-e-cCCCCcEEEEe-CCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCC
Q psy6570 81 PLSGRMFWTELGIKPRISGA-S-IDGKNKFNLVD-NNIQWPTGITIDYPSQRLYWADPKARTIESINLNGK 148 (713)
Q Consensus 81 ~~~~~ly~td~~~~~~I~~~-~-~dG~~~~~l~~-~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~ 148 (713)
..|.+.-|.+... .+.|+ + .+|+....+-. .+-....-||+++...+|-++. ..+.|..+.+...
T Consensus 191 -~~Gt~vATaStkG-TLIRIFdt~~g~~l~E~RRG~d~A~iy~iaFSp~~s~LavsS-dKgTlHiF~l~~~ 258 (346)
T KOG2111|consen 191 -LQGTLVATASTKG-TLIRIFDTEDGTLLQELRRGVDRADIYCIAFSPNSSWLAVSS-DKGTLHIFSLRDT 258 (346)
T ss_pred -CCccEEEEeccCc-EEEEEEEcCCCcEeeeeecCCchheEEEEEeCCCccEEEEEc-CCCeEEEEEeecC
Confidence 5677766665444 55554 3 34554444432 1223446789998777776553 5666766666553
No 245
>KOG0292|consensus
Probab=85.67 E-value=67 Score=37.28 Aligned_cols=146 Identities=12% Similarity=0.145 Sum_probs=81.4
Q ss_pred ceeEEEccCcccEE--ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCc
Q psy6570 8 NVTRVKREMNLKTV--LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGR 85 (713)
Q Consensus 8 ~I~~~~~~~~~~~~--~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ 85 (713)
++.|++.....++- -...++..++-+++.. .|.++.+ ....|.+.+|+-+.....++..-.+-+-||+.| +.+
T Consensus 231 KlWrmnetKaWEvDtcrgH~nnVssvlfhp~q-~lIlSns---EDksirVwDm~kRt~v~tfrrendRFW~laahP-~lN 305 (1202)
T KOG0292|consen 231 KLWRMNETKAWEVDTCRGHYNNVSSVLFHPHQ-DLILSNS---EDKSIRVWDMTKRTSVQTFRRENDRFWILAAHP-ELN 305 (1202)
T ss_pred eEEEeccccceeehhhhcccCCcceEEecCcc-ceeEecC---CCccEEEEecccccceeeeeccCCeEEEEEecC-Ccc
Confidence 34444444333322 3456678889999764 5556777 678999999987664444444446677788876 566
Q ss_pred EEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCC---CCcc
Q psy6570 86 MFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTED---NGYK 162 (713)
Q Consensus 86 ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~---~~~~ 162 (713)
||-+.... .+.++.++-.. -..+|. .+.||+.. ..+|+.+++....-..+.+... ....
T Consensus 306 LfAAgHDs--Gm~VFkleREr------------pa~~v~--~n~LfYvk--d~~i~~~d~~t~~d~~v~~lr~~g~~~~~ 367 (1202)
T KOG0292|consen 306 LFAAGHDS--GMIVFKLERER------------PAYAVN--GNGLFYVK--DRFIRSYDLRTQKDTAVASLRRPGTLWQP 367 (1202)
T ss_pred eeeeecCC--ceEEEEEcccC------------ceEEEc--CCEEEEEc--cceEEeeeccccccceeEeccCCCcccCC
Confidence 77664332 35555544322 234454 44455544 4677777776543333322211 1233
Q ss_pred ceeeee--eCCeEEEE
Q psy6570 163 PYKLEV--FEDNLYFS 176 (713)
Q Consensus 163 p~~i~~--~~~~ly~t 176 (713)
|..|.+ .++.+.+.
T Consensus 368 ~~smsYNpae~~vlic 383 (1202)
T KOG0292|consen 368 PRSLSYNPAENAVLIC 383 (1202)
T ss_pred cceeeeccccCeEEEE
Confidence 344444 46666665
No 246
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=85.59 E-value=0.28 Score=42.35 Aligned_cols=29 Identities=14% Similarity=0.068 Sum_probs=14.6
Q ss_pred ccchhHHHHHHHHHHHHHHhheeeEEEEe
Q psy6570 682 VNSHISSILILILLLITVGGIGYYIFRIK 710 (713)
Q Consensus 682 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 710 (713)
+.++++++++++|++|++++++++|+|+|
T Consensus 66 i~~Ii~gv~aGvIg~Illi~y~irR~~Kk 94 (122)
T PF01102_consen 66 IIGIIFGVMAGVIGIILLISYCIRRLRKK 94 (122)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHS--
T ss_pred eeehhHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 44556666666666555554444444443
No 247
>PHA02713 hypothetical protein; Provisional
Probab=85.19 E-value=73 Score=36.10 Aligned_cols=161 Identities=10% Similarity=0.059 Sum_probs=85.4
Q ss_pred ceeEEEccCcccEEecCCCCCc---eEEEeccCCeEEEeecCC---CCCCeEEEEecCCceEEEEEcCCCCCcc---eEE
Q psy6570 8 NVTRVKREMNLKTVLSNLHDPR---GVAVDWVGKNLYWTDAGG---RSSNNIMVSTLEGRKKRTLLNTGLNEPY---DIA 78 (713)
Q Consensus 8 ~I~~~~~~~~~~~~~~~~~~p~---gla~D~~~~~ly~td~~~---~~~~~I~~~~~~G~~~~~l~~~~~~~p~---~ia 78 (713)
.+..+++....-..+..+..|+ ++++ .++.||+.-... .....++++++....-..+ ..+..|+ +++
T Consensus 273 ~v~~yd~~~~~W~~l~~mp~~r~~~~~a~--l~~~IYviGG~~~~~~~~~~v~~Yd~~~n~W~~~--~~m~~~R~~~~~~ 348 (557)
T PHA02713 273 CILVYNINTMEYSVISTIPNHIINYASAI--VDNEIIIAGGYNFNNPSLNKVYKINIENKIHVEL--PPMIKNRCRFSLA 348 (557)
T ss_pred CEEEEeCCCCeEEECCCCCccccceEEEE--ECCEEEEEcCCCCCCCccceEEEEECCCCeEeeC--CCCcchhhceeEE
Confidence 3556666655433344455443 4444 578999974311 1135688888766532222 3344444 333
Q ss_pred EcCCCCcEEEEccCC----CCeEEEEecCCCCcEEEEeCCCCCCe-eEEEeCCCCeEEEEcCC-----------------
Q psy6570 79 LEPLSGRMFWTELGI----KPRISGASIDGKNKFNLVDNNIQWPT-GITIDYPSQRLYWADPK----------------- 136 (713)
Q Consensus 79 vD~~~~~ly~td~~~----~~~I~~~~~dG~~~~~l~~~~~~~p~-glavd~~~~~LY~~d~~----------------- 136 (713)
+ .++.||+..... ...+++.++....=..+. .+..|. +.++-..+++||+.-..
T Consensus 349 ~--~~g~IYviGG~~~~~~~~sve~Ydp~~~~W~~~~--~mp~~r~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~ 424 (557)
T PHA02713 349 V--IDDTIYAIGGQNGTNVERTIECYTMGDDKWKMLP--DMPIALSSYGMCVLDQYIYIIGGRTEHIDYTSVHHMNSIDM 424 (557)
T ss_pred E--ECCEEEEECCcCCCCCCceEEEEECCCCeEEECC--CCCcccccccEEEECCEEEEEeCCCcccccccccccccccc
Confidence 3 578999975332 125777776643222221 222221 11111236899997422
Q ss_pred ------CCcEEEEeCCCCceeEEEecCCCCccceeeeeeCCeEEEEe
Q psy6570 137 ------ARTIESINLNGKDRFVVYHTEDNGYKPYKLEVFEDNLYFST 177 (713)
Q Consensus 137 ------~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~td 177 (713)
...|+++++....-+.+..... .....++++.+++||+.-
T Consensus 425 ~~~~~~~~~ve~YDP~td~W~~v~~m~~-~r~~~~~~~~~~~IYv~G 470 (557)
T PHA02713 425 EEDTHSSNKVIRYDTVNNIWETLPNFWT-GTIRPGVVSHKDDIYVVC 470 (557)
T ss_pred cccccccceEEEECCCCCeEeecCCCCc-ccccCcEEEECCEEEEEe
Confidence 2457778776654444433222 223456788899999984
No 248
>KOG0270|consensus
Probab=85.14 E-value=55 Score=34.64 Aligned_cols=177 Identities=8% Similarity=0.011 Sum_probs=80.8
Q ss_pred CCceeEEEccCcc-c-EEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCc-eEEEEEcCCCCCcceEEEcCC
Q psy6570 6 SGNVTRVKREMNL-K-TVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGR-KKRTLLNTGLNEPYDIALEPL 82 (713)
Q Consensus 6 ~~~I~~~~~~~~~-~-~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~-~~~~l~~~~~~~p~~iavD~~ 82 (713)
..+|...+++... . ++...-.....|++.+....+.++-+ ..+++...+..-. ....-+ .-.....-+++|+.
T Consensus 265 D~TV~lWD~~~g~p~~s~~~~~k~Vq~l~wh~~~p~~LLsGs---~D~~V~l~D~R~~~~s~~~w-k~~g~VEkv~w~~~ 340 (463)
T KOG0270|consen 265 DKTVKLWDVDTGKPKSSITHHGKKVQTLEWHPYEPSVLLSGS---YDGTVALKDCRDPSNSGKEW-KFDGEVEKVAWDPH 340 (463)
T ss_pred CceEEEEEcCCCCcceehhhcCCceeEEEecCCCceEEEecc---ccceEEeeeccCccccCceE-EeccceEEEEecCC
Confidence 3445555555432 2 22334556677777777666666555 4555555544310 000000 00123334444444
Q ss_pred CCcEEEEccCCCCeEEEEecC--CCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCC--
Q psy6570 83 SGRMFWTELGIKPRISGASID--GKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTED-- 158 (713)
Q Consensus 83 ~~~ly~td~~~~~~I~~~~~d--G~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~-- 158 (713)
.-..|+...... .++-+++. |+...++.. .-....||.+......+..+......|..-++++...+.+.....
T Consensus 341 se~~f~~~tddG-~v~~~D~R~~~~~vwt~~A-Hd~~ISgl~~n~~~p~~l~t~s~d~~Vklw~~~~~~~~~v~~~~~~~ 418 (463)
T KOG0270|consen 341 SENSFFVSTDDG-TVYYFDIRNPGKPVWTLKA-HDDEISGLSVNIQTPGLLSTASTDKVVKLWKFDVDSPKSVKEHSFKL 418 (463)
T ss_pred CceeEEEecCCc-eEEeeecCCCCCceeEEEe-ccCCcceEEecCCCCcceeeccccceEEEEeecCCCCcccccccccc
Confidence 333333332222 33333332 222222221 222457788887777777776666655555565555444433222
Q ss_pred CCccceeeeeeCCeEEEEeCCCCcEEEEcc
Q psy6570 159 NGYKPYKLEVFEDNLYFSTYRTNNILKINK 188 (713)
Q Consensus 159 ~~~~p~~i~~~~~~ly~td~~~~~i~~~~~ 188 (713)
...+-++...+...+|.....++.+..++.
T Consensus 419 ~rl~c~~~~~~~a~~la~GG~k~~~~vwd~ 448 (463)
T KOG0270|consen 419 GRLHCFALDPDVAFTLAFGGEKAVLRVWDI 448 (463)
T ss_pred cceeecccCCCcceEEEecCccceEEEeec
Confidence 122334444445555555444443444443
No 249
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=84.97 E-value=49 Score=33.85 Aligned_cols=159 Identities=11% Similarity=-0.026 Sum_probs=88.1
Q ss_pred EeccCCeEEEeecCC-CCCCeEEEEecCCceEEE-EEcCCCCCcceEEEcCCCCcEEEEccCCC----------------
Q psy6570 33 VDWVGKNLYWTDAGG-RSSNNIMVSTLEGRKKRT-LLNTGLNEPYDIALEPLSGRMFWTELGIK---------------- 94 (713)
Q Consensus 33 ~D~~~~~ly~td~~~-~~~~~I~~~~~~G~~~~~-l~~~~~~~p~~iavD~~~~~ly~td~~~~---------------- 94 (713)
|.+.++.||.|+..- ...+.|-+.+.....+++ -+...-..|..|.+.|....|.+++-+-.
T Consensus 58 fs~dG~~LytTEnd~~~g~G~IgVyd~~~~~~ri~E~~s~GIGPHel~l~pDG~tLvVANGGI~Thpd~GR~kLNl~tM~ 137 (305)
T PF07433_consen 58 FSPDGRLLYTTENDYETGRGVIGVYDAARGYRRIGEFPSHGIGPHELLLMPDGETLVVANGGIETHPDSGRAKLNLDTMQ 137 (305)
T ss_pred EcCCCCEEEEeccccCCCcEEEEEEECcCCcEEEeEecCCCcChhhEEEcCCCCEEEEEcCCCccCcccCceecChhhcC
Confidence 667788899887531 246789999998444432 33344467999999987778999864411
Q ss_pred CeEEEE-ecCCCCcEEEEe---CCCCCCeeEEEeCCCCeEEEEcCCCC-------cEEEEeCCCCceeEEEecCC----C
Q psy6570 95 PRISGA-SIDGKNKFNLVD---NNIQWPTGITIDYPSQRLYWADPKAR-------TIESINLNGKDRFVVYHTED----N 159 (713)
Q Consensus 95 ~~I~~~-~~dG~~~~~l~~---~~~~~p~glavd~~~~~LY~~d~~~~-------~I~~~~~~g~~~~~~~~~~~----~ 159 (713)
+.|..+ ..+|........ .......-|+++. .+.+.++....+ -|...+.+ ...+.+..... .
T Consensus 138 psL~~ld~~sG~ll~q~~Lp~~~~~lSiRHLa~~~-~G~V~~a~Q~qg~~~~~~PLva~~~~g-~~~~~~~~p~~~~~~l 215 (305)
T PF07433_consen 138 PSLVYLDARSGALLEQVELPPDLHQLSIRHLAVDG-DGTVAFAMQYQGDPGDAPPLVALHRRG-GALRLLPAPEEQWRRL 215 (305)
T ss_pred CceEEEecCCCceeeeeecCccccccceeeEEecC-CCcEEEEEecCCCCCccCCeEEEEcCC-CcceeccCChHHHHhh
Confidence 223333 333443333211 1222456788884 466766643222 12222222 22322221111 1
Q ss_pred Cccceeeeee--CCeEEEEeCCCCcEEEEcccCCCc
Q psy6570 160 GYKPYKLEVF--EDNLYFSTYRTNNILKINKFGNSD 193 (713)
Q Consensus 160 ~~~p~~i~~~--~~~ly~td~~~~~i~~~~~~~~~~ 193 (713)
..+.-+|++. ++.+.+|....+.+..++...+.-
T Consensus 216 ~~Y~gSIa~~~~g~~ia~tsPrGg~~~~~d~~tg~~ 251 (305)
T PF07433_consen 216 NGYIGSIAADRDGRLIAVTSPRGGRVAVWDAATGRL 251 (305)
T ss_pred CCceEEEEEeCCCCEEEEECCCCCEEEEEECCCCCE
Confidence 1233456664 446888888888888887655543
No 250
>KOG0293|consensus
Probab=84.71 E-value=56 Score=34.33 Aligned_cols=112 Identities=15% Similarity=0.043 Sum_probs=65.0
Q ss_pred ceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCC-CCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCc
Q psy6570 29 RGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTG-LNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNK 107 (713)
Q Consensus 29 ~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~-~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~ 107 (713)
.-|++.|..+.|. +-. ....+..-+.+....+.+...+ ...+...+.-| ++.=|++.+-.. .|..-++||...
T Consensus 273 ~yi~wSPDdryLl-aCg---~~e~~~lwDv~tgd~~~~y~~~~~~S~~sc~W~p-Dg~~~V~Gs~dr-~i~~wdlDgn~~ 346 (519)
T KOG0293|consen 273 SYIMWSPDDRYLL-ACG---FDEVLSLWDVDTGDLRHLYPSGLGFSVSSCAWCP-DGFRFVTGSPDR-TIIMWDLDGNIL 346 (519)
T ss_pred EEEEECCCCCeEE-ecC---chHheeeccCCcchhhhhcccCcCCCcceeEEcc-CCceeEecCCCC-cEEEecCCcchh
Confidence 4466666555443 333 2334555565555544444333 23455666665 455567765444 788899998764
Q ss_pred EEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCC
Q psy6570 108 FNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNG 147 (713)
Q Consensus 108 ~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g 147 (713)
..--.........||+.+++++||.... ..+|..++...
T Consensus 347 ~~W~gvr~~~v~dlait~Dgk~vl~v~~-d~~i~l~~~e~ 385 (519)
T KOG0293|consen 347 GNWEGVRDPKVHDLAITYDGKYVLLVTV-DKKIRLYNREA 385 (519)
T ss_pred hcccccccceeEEEEEcCCCcEEEEEec-ccceeeechhh
Confidence 4332333445678999888888887763 34566555544
No 251
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=84.49 E-value=0.7 Score=30.31 Aligned_cols=29 Identities=28% Similarity=0.580 Sum_probs=21.1
Q ss_pred CCCCCCCCCCeeeccC-CCceeeeCCCCcc
Q psy6570 223 CDDKPCHQSALCINLP-SSHTCLCPDHLTE 251 (713)
Q Consensus 223 C~~~~C~~~~~C~~~~-g~~~C~C~~G~~~ 251 (713)
|...+|..++.|++.. |+++|.|..||..
T Consensus 2 C~~~~cP~NA~C~~~~dG~eecrCllgyk~ 31 (37)
T PF12946_consen 2 CIDTKCPANAGCFRYDDGSEECRCLLGYKK 31 (37)
T ss_dssp -SSS---TTEEEEEETTSEEEEEE-TTEEE
T ss_pred ccCccCCCCcccEEcCCCCEEEEeeCCccc
Confidence 5667888899999988 8899999999974
No 252
>PHA02790 Kelch-like protein; Provisional
Probab=84.46 E-value=67 Score=35.63 Aligned_cols=161 Identities=11% Similarity=0.029 Sum_probs=81.8
Q ss_pred ceeEEEccCcccEEecCCCCCce-EEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcc-eEEEcCCCCc
Q psy6570 8 NVTRVKREMNLKTVLSNLHDPRG-VAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPY-DIALEPLSGR 85 (713)
Q Consensus 8 ~I~~~~~~~~~~~~~~~~~~p~g-la~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~-~iavD~~~~~ 85 (713)
.+.++++..+.-..+..+..|+. .+.-..+++||+.-... ....+.++++....=.. ..++..|+ +.++-..+++
T Consensus 288 ~v~~Ydp~~~~W~~~~~m~~~r~~~~~v~~~~~iYviGG~~-~~~sve~ydp~~n~W~~--~~~l~~~r~~~~~~~~~g~ 364 (480)
T PHA02790 288 NAIAVNYISNNWIPIPPMNSPRLYASGVPANNKLYVVGGLP-NPTSVERWFHGDAAWVN--MPSLLKPRCNPAVASINNV 364 (480)
T ss_pred eEEEEECCCCEEEECCCCCchhhcceEEEECCEEEEECCcC-CCCceEEEECCCCeEEE--CCCCCCCCcccEEEEECCE
Confidence 34556655544333445555542 11222568999875421 23457777764322111 13444444 2222225799
Q ss_pred EEEEccCC--CCeEEEEecCCCCcEEEEeCCCCCCe-eEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCcc
Q psy6570 86 MFWTELGI--KPRISGASIDGKNKFNLVDNNIQWPT-GITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYK 162 (713)
Q Consensus 86 ly~td~~~--~~~I~~~~~dG~~~~~l~~~~~~~p~-glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~ 162 (713)
||+..... ...+++.++... .-..+. .+..|. +.+.-..+++||+.- +.+.+++++...-+.+..... ...
T Consensus 365 IYviGG~~~~~~~ve~ydp~~~-~W~~~~-~m~~~r~~~~~~~~~~~IYv~G---G~~e~ydp~~~~W~~~~~m~~-~r~ 438 (480)
T PHA02790 365 IYVIGGHSETDTTTEYLLPNHD-QWQFGP-STYYPHYKSCALVFGRRLFLVG---RNAEFYCESSNTWTLIDDPIY-PRD 438 (480)
T ss_pred EEEecCcCCCCccEEEEeCCCC-EEEeCC-CCCCccccceEEEECCEEEEEC---CceEEecCCCCcEeEcCCCCC-Ccc
Confidence 99985321 124666666533 222221 222221 111112368999986 346777776544444433221 234
Q ss_pred ceeeeeeCCeEEEEe
Q psy6570 163 PYKLEVFEDNLYFST 177 (713)
Q Consensus 163 p~~i~~~~~~ly~td 177 (713)
-.++++.+++||+.-
T Consensus 439 ~~~~~v~~~~IYviG 453 (480)
T PHA02790 439 NPELIIVDNKLLLIG 453 (480)
T ss_pred ccEEEEECCEEEEEC
Confidence 467888899999984
No 253
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=84.29 E-value=74 Score=35.39 Aligned_cols=30 Identities=3% Similarity=0.024 Sum_probs=20.1
Q ss_pred eeeeCCeEEEEeCCCCcEEEEcccCCCccee
Q psy6570 166 LEVFEDNLYFSTYRTNNILKINKFGNSDFNV 196 (713)
Q Consensus 166 i~~~~~~ly~td~~~~~i~~~~~~~~~~~~~ 196 (713)
+.+.++.||+.+ ..+.|+.+++.++..+-.
T Consensus 402 ~~~~g~~v~~g~-~dG~l~ald~~tG~~lW~ 431 (488)
T cd00216 402 LATAGNLVFAGA-ADGYFRAFDATTGKELWK 431 (488)
T ss_pred eEecCCeEEEEC-CCCeEEEEECCCCceeeE
Confidence 455677788876 456788888776655443
No 254
>KOG0296|consensus
Probab=84.21 E-value=55 Score=33.82 Aligned_cols=165 Identities=8% Similarity=0.013 Sum_probs=87.0
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecC
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASID 103 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~d 103 (713)
.-.....++++| +.+|..|-. ..++-++.+.........+..--.....+.+.. .+.|..|. +..+.|.+...+
T Consensus 63 H~~svFavsl~P-~~~l~aTGG---gDD~AflW~~~~ge~~~eltgHKDSVt~~~Fsh-dgtlLATG-dmsG~v~v~~~s 136 (399)
T KOG0296|consen 63 HTDSVFAVSLHP-NNNLVATGG---GDDLAFLWDISTGEFAGELTGHKDSVTCCSFSH-DGTLLATG-DMSGKVLVFKVS 136 (399)
T ss_pred cCCceEEEEeCC-CCceEEecC---CCceEEEEEccCCcceeEecCCCCceEEEEEcc-CceEEEec-CCCccEEEEEcc
Confidence 344467788888 667766655 445545544433332222222224556666653 34444443 233366666665
Q ss_pred CCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCcccee-eeeeCCeEEEEeCCCCc
Q psy6570 104 GKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYK-LEVFEDNLYFSTYRTNN 182 (713)
Q Consensus 104 G~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~-i~~~~~~ly~td~~~~~ 182 (713)
.+..+-.+.....--.-|...| ...++.+-...+.||+..+.......+..... .+--.| +..+ ++...+-+..+.
T Consensus 137 tg~~~~~~~~e~~dieWl~WHp-~a~illAG~~DGsvWmw~ip~~~~~kv~~Gh~-~~ct~G~f~pd-GKr~~tgy~dgt 213 (399)
T KOG0296|consen 137 TGGEQWKLDQEVEDIEWLKWHP-RAHILLAGSTDGSVWMWQIPSQALCKVMSGHN-SPCTCGEFIPD-GKRILTGYDDGT 213 (399)
T ss_pred cCceEEEeecccCceEEEEecc-cccEEEeecCCCcEEEEECCCcceeeEecCCC-CCcccccccCC-CceEEEEecCce
Confidence 4443333322333334477776 66777788888999999987644444433311 111122 2223 444444456778
Q ss_pred EEEEcccCCCcceee
Q psy6570 183 ILKINKFGNSDFNVL 197 (713)
Q Consensus 183 i~~~~~~~~~~~~~~ 197 (713)
|...++..+.+...+
T Consensus 214 i~~Wn~ktg~p~~~~ 228 (399)
T KOG0296|consen 214 IIVWNPKTGQPLHKI 228 (399)
T ss_pred EEEEecCCCceeEEe
Confidence 888887766554443
No 255
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=84.20 E-value=54 Score=34.86 Aligned_cols=130 Identities=14% Similarity=0.149 Sum_probs=68.1
Q ss_pred cCCceeEEEccCcccE----Ee---cCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEec-CCceEEEEEcCC-CCCcc
Q psy6570 5 SSGNVTRVKREMNLKT----VL---SNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTL-EGRKKRTLLNTG-LNEPY 75 (713)
Q Consensus 5 ~~~~I~~~~~~~~~~~----~~---~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~-~G~~~~~l~~~~-~~~p~ 75 (713)
.+++|+.+++++..++ +. ..+..|...+ .++||+.+. .++++.++. +|+.+-..-... .....
T Consensus 76 ~~G~i~A~d~~~g~~~W~~~~~~~~~~~~~~~~~~----~G~i~~g~~----~g~~y~ld~~~G~~~W~~~~~~~~~~~~ 147 (370)
T COG1520 76 RDGNIFALNPDTGLVKWSYPLLGAVAQLSGPILGS----DGKIYVGSW----DGKLYALDASTGTLVWSRNVGGSPYYAS 147 (370)
T ss_pred CCCcEEEEeCCCCcEEecccCcCcceeccCceEEe----CCeEEEecc----cceEEEEECCCCcEEEEEecCCCeEEec
Confidence 4567778887764322 11 2233333332 688999987 448888888 676543322222 01122
Q ss_pred eEEEcCCCCcEEEEccCCCCeEEEEecC-CCCcEEEEe---CCCCCCeeEEEeCCCCeEEEEcCC-CCcEEEEeC-CCC
Q psy6570 76 DIALEPLSGRMFWTELGIKPRISGASID-GKNKFNLVD---NNIQWPTGITIDYPSQRLYWADPK-ARTIESINL-NGK 148 (713)
Q Consensus 76 ~iavD~~~~~ly~td~~~~~~I~~~~~d-G~~~~~l~~---~~~~~p~glavd~~~~~LY~~d~~-~~~I~~~~~-~g~ 148 (713)
..++ .++.+|+.. ... +++.++.+ |+.+-..-. ..+......++ ..+.+|+.... ...++.+++ +|.
T Consensus 148 ~~v~--~~~~v~~~s-~~g-~~~al~~~tG~~~W~~~~~~~~~~~~~~~~~~--~~~~vy~~~~~~~~~~~a~~~~~G~ 220 (370)
T COG1520 148 PPVV--GDGTVYVGT-DDG-HLYALNADTGTLKWTYETPAPLSLSIYGSPAI--ASGTVYVGSDGYDGILYALNAEDGT 220 (370)
T ss_pred CcEE--cCcEEEEec-CCC-eEEEEEccCCcEEEEEecCCccccccccCcee--ecceEEEecCCCcceEEEEEccCCc
Confidence 2233 356777763 112 67777776 555433221 12222222232 36778887653 446777777 554
No 256
>KOG0293|consensus
Probab=83.82 E-value=30 Score=36.26 Aligned_cols=165 Identities=13% Similarity=0.113 Sum_probs=91.4
Q ss_pred CCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCC
Q psy6570 26 HDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGK 105 (713)
Q Consensus 26 ~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~ 105 (713)
..+...++-+.+-+ +++-+ ....|...++||....---........+||+-+.+..|+..... .+|..++....
T Consensus 313 ~S~~sc~W~pDg~~-~V~Gs---~dr~i~~wdlDgn~~~~W~gvr~~~v~dlait~Dgk~vl~v~~d--~~i~l~~~e~~ 386 (519)
T KOG0293|consen 313 FSVSSCAWCPDGFR-FVTGS---PDRTIIMWDLDGNILGNWEGVRDPKVHDLAITYDGKYVLLVTVD--KKIRLYNREAR 386 (519)
T ss_pred CCcceeEEccCCce-eEecC---CCCcEEEecCCcchhhcccccccceeEEEEEcCCCcEEEEEecc--cceeeechhhh
Confidence 44566666665444 45655 56789999999976432111123456789998877777777643 25777766655
Q ss_pred CcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeee-ee---CCeEEEEeCCCC
Q psy6570 106 NKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLE-VF---EDNLYFSTYRTN 181 (713)
Q Consensus 106 ~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~-~~---~~~ly~td~~~~ 181 (713)
..+.++.. -...+.+.|+ .++++.......+.|..-++.- ...+......-..-+-|- -| ++.+..+-...+
T Consensus 387 ~dr~lise-~~~its~~iS-~d~k~~LvnL~~qei~LWDl~e--~~lv~kY~Ghkq~~fiIrSCFgg~~~~fiaSGSED~ 462 (519)
T KOG0293|consen 387 VDRGLISE-EQPITSFSIS-KDGKLALVNLQDQEIHLWDLEE--NKLVRKYFGHKQGHFIIRSCFGGGNDKFIASGSEDS 462 (519)
T ss_pred hhhccccc-cCceeEEEEc-CCCcEEEEEcccCeeEEeecch--hhHHHHhhcccccceEEEeccCCCCcceEEecCCCc
Confidence 55444432 2334678888 4567777776677777777651 111111100000001111 11 223444445566
Q ss_pred cEEEEcccCCCcceeeecc
Q psy6570 182 NILKINKFGNSDFNVLANN 200 (713)
Q Consensus 182 ~i~~~~~~~~~~~~~~~~~ 200 (713)
.|+..++..+..+.++...
T Consensus 463 kvyIWhr~sgkll~~LsGH 481 (519)
T KOG0293|consen 463 KVYIWHRISGKLLAVLSGH 481 (519)
T ss_pred eEEEEEccCCceeEeecCC
Confidence 6777776666666666544
No 257
>KOG2919|consensus
Probab=83.79 E-value=35 Score=34.60 Aligned_cols=119 Identities=14% Similarity=0.086 Sum_probs=73.8
Q ss_pred cCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEec--CCceEE---EEEcCC---CCCcceEEEcCCCCcEEEEccCCC
Q psy6570 23 SNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTL--EGRKKR---TLLNTG---LNEPYDIALEPLSGRMFWTELGIK 94 (713)
Q Consensus 23 ~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~--~G~~~~---~l~~~~---~~~p~~iavD~~~~~ly~td~~~~ 94 (713)
..+.....|+|.+.+..||-... ..|.+++. -|..-. ++.... ..-..-+|+.|.+-.++-.....+
T Consensus 156 de~taAhsL~Fs~DGeqlfaGyk-----rcirvFdt~RpGr~c~vy~t~~~~k~gq~giisc~a~sP~~~~~~a~gsY~q 230 (406)
T KOG2919|consen 156 DEYTAAHSLQFSPDGEQLFAGYK-----RCIRVFDTSRPGRDCPVYTTVTKGKFGQKGIISCFAFSPMDSKTLAVGSYGQ 230 (406)
T ss_pred HhhhhheeEEecCCCCeEeeccc-----ceEEEeeccCCCCCCcchhhhhcccccccceeeeeeccCCCCcceeeecccc
Confidence 34556788999999888985543 55666665 344422 222211 122445788887775554433333
Q ss_pred CeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCC
Q psy6570 95 PRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNG 147 (713)
Q Consensus 95 ~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g 147 (713)
++-...-++.....++...-...+-|.+-+++++||........|..-|+.-
T Consensus 231 -~~giy~~~~~~pl~llggh~gGvThL~~~edGn~lfsGaRk~dkIl~WDiR~ 282 (406)
T KOG2919|consen 231 -RVGIYNDDGRRPLQLLGGHGGGVTHLQWCEDGNKLFSGARKDDKILCWDIRY 282 (406)
T ss_pred -eeeeEecCCCCceeeecccCCCeeeEEeccCcCeecccccCCCeEEEEeehh
Confidence 4555555666555555444555677888888999998887778887766544
No 258
>PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues.
Probab=83.25 E-value=52 Score=32.88 Aligned_cols=172 Identities=13% Similarity=0.104 Sum_probs=86.8
Q ss_pred ceeEEEccCcccEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEc-CCCC-CcceEEEcCCCCc
Q psy6570 8 NVTRVKREMNLKTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLN-TGLN-EPYDIALEPLSGR 85 (713)
Q Consensus 8 ~I~~~~~~~~~~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~-~~~~-~p~~iavD~~~~~ 85 (713)
.|+....++....+.. ........+|+. +.++..+.+. ...++.+...+|+...+.+. ..+. ....|.|.|...+
T Consensus 49 ~L~~~~~~~~~~~~~~-g~~l~~PS~d~~-g~~W~v~~~~-~~~~~~~~~~~g~~~~~~v~~~~~~~~I~~l~vSpDG~R 125 (253)
T PF10647_consen 49 SLYVGPAGGPVRPVLT-GGSLTRPSWDPD-GWVWTVDDGS-GGVRVVRDSASGTGEPVEVDWPGLRGRITALRVSPDGTR 125 (253)
T ss_pred EEEEEcCCCcceeecc-CCccccccccCC-CCEEEEEcCC-CceEEEEecCCCcceeEEecccccCCceEEEEECCCCcE
Confidence 3444444443333322 224444578876 6776666521 12223332345555443333 2233 6788999987766
Q ss_pred EE-EEccCCCCeEEEEe--cCCCC-cEEEE------eCCCCCCeeEEEeCCCCeEEEEcCCCCcEEE-EeCCCCceeEEE
Q psy6570 86 MF-WTELGIKPRISGAS--IDGKN-KFNLV------DNNIQWPTGITIDYPSQRLYWADPKARTIES-INLNGKDRFVVY 154 (713)
Q Consensus 86 ly-~td~~~~~~I~~~~--~dG~~-~~~l~------~~~~~~p~glavd~~~~~LY~~d~~~~~I~~-~~~~g~~~~~~~ 154 (713)
|- +...+...+|++.. .++.. ...+. ...+.....+++-.....++.+......++. +.++|...+.+.
T Consensus 126 vA~v~~~~~~~~v~va~V~r~~~g~~~~l~~~~~~~~~~~~~v~~v~W~~~~~L~V~~~~~~~~~~~~v~~dG~~~~~l~ 205 (253)
T PF10647_consen 126 VAVVVEDGGGGRVYVAGVVRDGDGVPRRLTGPRRVAPPLLSDVTDVAWSDDSTLVVLGRSAGGPVVRLVSVDGGPSTPLP 205 (253)
T ss_pred EEEEEecCCCCeEEEEEEEeCCCCCcceeccceEecccccCcceeeeecCCCEEEEEeCCCCCceeEEEEccCCcccccC
Confidence 54 44333334677653 23333 22222 1223456788888655555555555555666 888988776652
Q ss_pred ecCCCCccceeeeeeCCeEEEEeCCCCcEEE
Q psy6570 155 HTEDNGYKPYKLEVFEDNLYFSTYRTNNILK 185 (713)
Q Consensus 155 ~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~ 185 (713)
.... ...-.++.-....+|.++. +.+++
T Consensus 206 ~~~~-~~~v~a~~~~~~~~~~t~~--~~~~~ 233 (253)
T PF10647_consen 206 SVNL-GVPVVAVAASPSTVYVTDD--GGVLQ 233 (253)
T ss_pred CCCC-CcceEEeeCCCcEEEEECC--CcEEE
Confidence 2211 1122333334556777753 44554
No 259
>KOG0646|consensus
Probab=82.55 E-value=65 Score=34.34 Aligned_cols=75 Identities=11% Similarity=0.081 Sum_probs=41.1
Q ss_pred CCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCc-----------------eeEEEecCC-CCccceeeeeeCCeEEEEeC
Q psy6570 117 WPTGITIDYPSQRLYWADPKARTIESINLNGKD-----------------RFVVYHTED-NGYKPYKLEVFEDNLYFSTY 178 (713)
Q Consensus 117 ~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~-----------------~~~~~~~~~-~~~~p~~i~~~~~~ly~td~ 178 (713)
.++.+++||.+.++|+.. ..+.|+..++.+.. +..+..... ....-++|.. ++.|.++-.
T Consensus 219 si~av~lDpae~~~yiGt-~~G~I~~~~~~~~~~~~~~v~~k~~~~~~t~~~~~~Gh~~~~~ITcLais~-DgtlLlSGd 296 (476)
T KOG0646|consen 219 SIKAVALDPAERVVYIGT-EEGKIFQNLLFKLSGQSAGVNQKGRHEENTQINVLVGHENESAITCLAIST-DGTLLLSGD 296 (476)
T ss_pred cceeEEEcccccEEEecC-CcceEEeeehhcCCcccccccccccccccceeeeeccccCCcceeEEEEec-CccEEEeeC
Confidence 568999999888888754 45667766654422 111111100 0111223333 445666666
Q ss_pred CCCcEEEEcccCCCc
Q psy6570 179 RTNNILKINKFGNSD 193 (713)
Q Consensus 179 ~~~~i~~~~~~~~~~ 193 (713)
..+.|...+.....-
T Consensus 297 ~dg~VcvWdi~S~Q~ 311 (476)
T KOG0646|consen 297 EDGKVCVWDIYSKQC 311 (476)
T ss_pred CCCCEEEEecchHHH
Confidence 677777777654433
No 260
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=82.47 E-value=0.32 Score=49.30 Aligned_cols=21 Identities=38% Similarity=0.309 Sum_probs=11.3
Q ss_pred HHHHHHHHHHhheeeEEEEec
Q psy6570 691 ILILLLITVGGIGYYIFRIKM 711 (713)
Q Consensus 691 ~~~~~~~~~~~~~~~~~~~~~ 711 (713)
++++|||+|++.|+||+|||.
T Consensus 265 IliIVLIMvIIYLILRYRRKK 285 (299)
T PF02009_consen 265 ILIIVLIMVIIYLILRYRRKK 285 (299)
T ss_pred HHHHHHHHHHHHHHHHHHHHh
Confidence 333333555556666666644
No 261
>KOG0310|consensus
Probab=82.18 E-value=77 Score=34.02 Aligned_cols=113 Identities=10% Similarity=0.012 Sum_probs=69.3
Q ss_pred ceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcE
Q psy6570 29 RGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKF 108 (713)
Q Consensus 29 ~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~ 108 (713)
+-+-|.+..+.++.+-. ....+...++++..+..-+...-...+.+++.|.++.|++|....+ .|..-+..... .
T Consensus 114 ~~~~f~~~d~t~l~s~s---Dd~v~k~~d~s~a~v~~~l~~htDYVR~g~~~~~~~hivvtGsYDg-~vrl~DtR~~~-~ 188 (487)
T KOG0310|consen 114 HVTKFSPQDNTMLVSGS---DDKVVKYWDLSTAYVQAELSGHTDYVRCGDISPANDHIVVTGSYDG-KVRLWDTRSLT-S 188 (487)
T ss_pred eEEEecccCCeEEEecC---CCceEEEEEcCCcEEEEEecCCcceeEeeccccCCCeEEEecCCCc-eEEEEEeccCC-c
Confidence 55667778888888776 6677788888888764333333567888999999999999987655 66655554432 2
Q ss_pred EEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCC
Q psy6570 109 NLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNG 147 (713)
Q Consensus 109 ~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g 147 (713)
.++.-+-..|..-.+-...+.++.+-. .+.|...|+.+
T Consensus 189 ~v~elnhg~pVe~vl~lpsgs~iasAg-Gn~vkVWDl~~ 226 (487)
T KOG0310|consen 189 RVVELNHGCPVESVLALPSGSLIASAG-GNSVKVWDLTT 226 (487)
T ss_pred eeEEecCCCceeeEEEcCCCCEEEEcC-CCeEEEEEecC
Confidence 222223334443333333445554432 34455556553
No 262
>KOG0286|consensus
Probab=82.01 E-value=59 Score=32.61 Aligned_cols=124 Identities=14% Similarity=0.102 Sum_probs=72.1
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecC
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASID 103 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~d 103 (713)
......+|++-+.+.+.|++-. -......-++........+...-...+.+.+-| +|.-|.|.+... ....+++.
T Consensus 185 H~gDV~slsl~p~~~ntFvSg~---cD~~aklWD~R~~~c~qtF~ghesDINsv~ffP-~G~afatGSDD~-tcRlyDlR 259 (343)
T KOG0286|consen 185 HTGDVMSLSLSPSDGNTFVSGG---CDKSAKLWDVRSGQCVQTFEGHESDINSVRFFP-SGDAFATGSDDA-TCRLYDLR 259 (343)
T ss_pred CcccEEEEecCCCCCCeEEecc---cccceeeeeccCcceeEeecccccccceEEEcc-CCCeeeecCCCc-eeEEEeec
Confidence 3455677888777889998876 344444555443333333333346688888886 678888876544 56666665
Q ss_pred CCCcEEEEeC--CCCCCeeEEEeCCCCeEEEEcCCCCcEEEEe-CCCCceeEE
Q psy6570 104 GKNKFNLVDN--NIQWPTGITIDYPSQRLYWADPKARTIESIN-LNGKDRFVV 153 (713)
Q Consensus 104 G~~~~~l~~~--~~~~p~glavd~~~~~LY~~d~~~~~I~~~~-~~g~~~~~~ 153 (713)
......++.. .+...+.++|. ..+||.++-.....+.+-| +.+....+|
T Consensus 260 aD~~~a~ys~~~~~~gitSv~FS-~SGRlLfagy~d~~c~vWDtlk~e~vg~L 311 (343)
T KOG0286|consen 260 ADQELAVYSHDSIICGITSVAFS-KSGRLLFAGYDDFTCNVWDTLKGERVGVL 311 (343)
T ss_pred CCcEEeeeccCcccCCceeEEEc-ccccEEEeeecCCceeEeeccccceEEEe
Confidence 4433334332 23345789998 5667766755444444444 334333333
No 263
>PF09910 DUF2139: Uncharacterized protein conserved in archaea (DUF2139); InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=81.66 E-value=63 Score=32.65 Aligned_cols=112 Identities=15% Similarity=0.195 Sum_probs=63.8
Q ss_pred EEEeccCCeEEEeecCC-------C-----------CCCeEEEEecCCceEEEEEcCCC-------CCcceEEEcCCCCc
Q psy6570 31 VAVDWVGKNLYWTDAGG-------R-----------SSNNIMVSTLEGRKKRTLLNTGL-------NEPYDIALEPLSGR 85 (713)
Q Consensus 31 la~D~~~~~ly~td~~~-------~-----------~~~~I~~~~~~G~~~~~l~~~~~-------~~p~~iavD~~~~~ 85 (713)
=|++|..+.||+--|-- + .-..|..+|.+...++.|..+.+ ....+|..||.+..
T Consensus 40 NAV~~vDd~IyFGGWVHAPa~y~gk~~g~~~IdF~NKYSHVH~yd~e~~~VrLLWkesih~~~~WaGEVSdIlYdP~~D~ 119 (339)
T PF09910_consen 40 NAVEWVDDFIYFGGWVHAPAVYEGKGDGRATIDFRNKYSHVHEYDTENDSVRLLWKESIHDKTKWAGEVSDILYDPYEDR 119 (339)
T ss_pred eeeeeecceEEEeeeecCCceeeeccCCceEEEEeeccceEEEEEcCCCeEEEEEecccCCccccccchhheeeCCCcCE
Confidence 46777777888655421 0 01235556655556667766543 34568999999999
Q ss_pred EEEEccC--CCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEc----CCCCcEEEEeCCC
Q psy6570 86 MFWTELG--IKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWAD----PKARTIESINLNG 147 (713)
Q Consensus 86 ly~td~~--~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d----~~~~~I~~~~~~g 147 (713)
||++-.. .+--|++++..+...+.|.. ...+.|..+. +..+++- .+...|+.+|+.-
T Consensus 120 LLlAR~DGh~nLGvy~ldr~~g~~~~L~~--~ps~KG~~~~---D~a~F~i~~~~~g~~~i~~~Dli~ 182 (339)
T PF09910_consen 120 LLLARADGHANLGVYSLDRRTGKAEKLSS--NPSLKGTLVH---DYACFGINNFHKGVSGIHCLDLIS 182 (339)
T ss_pred EEEEecCCcceeeeEEEcccCCceeeccC--CCCcCceEee---eeEEEeccccccCCceEEEEEccC
Confidence 9998543 33456777665555555543 2334555444 2333332 2334566666543
No 264
>KOG2048|consensus
Probab=81.50 E-value=61 Score=36.23 Aligned_cols=157 Identities=10% Similarity=0.070 Sum_probs=88.5
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCC---CCCcce--EEEcCCCCcEEEEccCCCCeEE
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTG---LNEPYD--IALEPLSGRMFWTELGIKPRIS 98 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~---~~~p~~--iavD~~~~~ly~td~~~~~~I~ 98 (713)
+.....--|+-|.++.|-+.-- ..-+|++...++..+...+..- +..... +.+| +..||+... ....++
T Consensus 381 ~~~nIs~~aiSPdg~~Ia~st~---~~~~iy~L~~~~~vk~~~v~~~~~~~~~a~~i~ftid--~~k~~~~s~-~~~~le 454 (691)
T KOG2048|consen 381 EKENISCAAISPDGNLIAISTV---SRTKIYRLQPDPNVKVINVDDVPLALLDASAISFTID--KNKLFLVSK-NIFSLE 454 (691)
T ss_pred CccceeeeccCCCCCEEEEeec---cceEEEEeccCcceeEEEeccchhhhccceeeEEEec--CceEEEEec-ccceeE
Confidence 3333444456666666666655 5678999998884333333211 111222 3454 344444332 233678
Q ss_pred EEecCCCCcEEEEe---C-CCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeee-CCeE
Q psy6570 99 GASIDGKNKFNLVD---N-NIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVF-EDNL 173 (713)
Q Consensus 99 ~~~~dG~~~~~l~~---~-~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~-~~~l 173 (713)
.+.+++...+.+.. + .-....-|++.+.+++|-+++ ..+.|+.+++.+...+.+...........++..+ .++|
T Consensus 455 ~~el~~ps~kel~~~~~~~~~~~I~~l~~SsdG~yiaa~~-t~g~I~v~nl~~~~~~~l~~rln~~vTa~~~~~~~~~~l 533 (691)
T KOG2048|consen 455 EFELETPSFKELKSIQSQAKCPSISRLVVSSDGNYIAAIS-TRGQIFVYNLETLESHLLKVRLNIDVTAAAFSPFVRNRL 533 (691)
T ss_pred EEEecCcchhhhhccccccCCCcceeEEEcCCCCEEEEEe-ccceEEEEEcccceeecchhccCcceeeeeccccccCcE
Confidence 88887766655542 1 233456789999899988888 6678999999887666655321111122233322 3455
Q ss_pred EEEeCCCCcEEEEcc
Q psy6570 174 YFSTYRTNNILKINK 188 (713)
Q Consensus 174 y~td~~~~~i~~~~~ 188 (713)
-+++ .++.|+.++.
T Consensus 534 vvat-s~nQv~efdi 547 (691)
T KOG2048|consen 534 VVAT-SNNQVFEFDI 547 (691)
T ss_pred EEEe-cCCeEEEEec
Confidence 5554 4456776665
No 265
>KOG0272|consensus
Probab=80.99 E-value=50 Score=34.75 Aligned_cols=111 Identities=11% Similarity=0.055 Sum_probs=63.2
Q ss_pred CCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCC-CeEEEEecCC
Q psy6570 26 HDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIK-PRISGASIDG 104 (713)
Q Consensus 26 ~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~-~~I~~~~~dG 104 (713)
....+||+.+. +.|..+-. ....++| -|+.......++..-+....+|+++| +|+..-|..+.+ -+||...+
T Consensus 304 ~~v~~iaf~~D-GSL~~tGG-lD~~~Rv--WDlRtgr~im~L~gH~k~I~~V~fsP-NGy~lATgs~Dnt~kVWDLR~-- 376 (459)
T KOG0272|consen 304 KGVFSIAFQPD-GSLAATGG-LDSLGRV--WDLRTGRCIMFLAGHIKEILSVAFSP-NGYHLATGSSDNTCKVWDLRM-- 376 (459)
T ss_pred cccceeEecCC-CceeeccC-ccchhhe--eecccCcEEEEecccccceeeEeECC-CceEEeecCCCCcEEEeeecc--
Confidence 34677888765 55554433 1122333 34433323333334467788999997 788888876644 24555444
Q ss_pred CCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEE
Q psy6570 105 KNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESI 143 (713)
Q Consensus 105 ~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~ 143 (713)
.....++....+....+-+++..++..++-...+.+..-
T Consensus 377 r~~ly~ipAH~nlVS~Vk~~p~~g~fL~TasyD~t~kiW 415 (459)
T KOG0272|consen 377 RSELYTIPAHSNLVSQVKYSPQEGYFLVTASYDNTVKIW 415 (459)
T ss_pred cccceecccccchhhheEecccCCeEEEEcccCcceeee
Confidence 333333333445567888888777777776655554443
No 266
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=80.67 E-value=1.2 Score=43.67 Aligned_cols=34 Identities=24% Similarity=0.491 Sum_probs=26.6
Q ss_pred CccCCCCCCCCCCCCeeeccCCCceeeeCCCCcc
Q psy6570 218 NVTNHCDDKPCHQSALCINLPSSHTCLCPDHLTE 251 (713)
Q Consensus 218 ~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~~ 251 (713)
...++|...+....+.|++.+|+|.|.|++||+.
T Consensus 185 ~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~ 218 (224)
T cd01475 185 VVPDLCATLSHVCQQVCISTPGSYLCACTEGYAL 218 (224)
T ss_pred cCchhhcCCCCCccceEEcCCCCEEeECCCCccC
Confidence 3567887543223689999999999999999984
No 267
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=80.60 E-value=5.7 Score=42.21 Aligned_cols=62 Identities=15% Similarity=0.118 Sum_probs=35.2
Q ss_pred CcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeC-----C--------------CCCCeeEEEeCCCCeEEEE
Q psy6570 73 EPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDN-----N--------------IQWPTGITIDYPSQRLYWA 133 (713)
Q Consensus 73 ~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~-----~--------------~~~p~glavd~~~~~LY~~ 133 (713)
.|.+|.|...+++||++.|... .|+..++.....-.++.. . ...|.-|.++.+++||||+
T Consensus 313 LitDI~iSlDDrfLYvs~W~~G-dvrqYDISDP~~Pkl~gqv~lGG~~~~~~~~~v~g~~l~GgPqMvqlS~DGkRlYvT 391 (461)
T PF05694_consen 313 LITDILISLDDRFLYVSNWLHG-DVRQYDISDPFNPKLVGQVFLGGSIRKGDHPVVKGKRLRGGPQMVQLSLDGKRLYVT 391 (461)
T ss_dssp ----EEE-TTS-EEEEEETTTT-EEEEEE-SSTTS-EEEEEEE-BTTTT-B--TTS------S----EEE-TTSSEEEEE
T ss_pred ceEeEEEccCCCEEEEEcccCC-cEEEEecCCCCCCcEEeEEEECcEeccCCCccccccccCCCCCeEEEccCCeEEEEE
Confidence 4678888888999999999866 888888876544444321 0 1247778888889999999
Q ss_pred cC
Q psy6570 134 DP 135 (713)
Q Consensus 134 d~ 135 (713)
.+
T Consensus 392 nS 393 (461)
T PF05694_consen 392 NS 393 (461)
T ss_dssp --
T ss_pred ee
Confidence 63
No 268
>KOG2111|consensus
Probab=80.56 E-value=70 Score=32.50 Aligned_cols=154 Identities=12% Similarity=0.089 Sum_probs=83.8
Q ss_pred CCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceE-EEcCCCCcEEEEccCC-CCeEEEEecC
Q psy6570 26 HDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDI-ALEPLSGRMFWTELGI-KPRISGASID 103 (713)
Q Consensus 26 ~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~i-avD~~~~~ly~td~~~-~~~I~~~~~d 103 (713)
....+|-++ .++|.++-. ++|+++......+....-+....|+|+ +++|....-+.+--+. .+.|...++.
T Consensus 95 ~~I~~V~l~--r~riVvvl~-----~~I~VytF~~n~k~l~~~et~~NPkGlC~~~~~~~k~~LafPg~k~GqvQi~dL~ 167 (346)
T KOG2111|consen 95 SEIKAVKLR--RDRIVVVLE-----NKIYVYTFPDNPKLLHVIETRSNPKGLCSLCPTSNKSLLAFPGFKTGQVQIVDLA 167 (346)
T ss_pred cceeeEEEc--CCeEEEEec-----CeEEEEEcCCChhheeeeecccCCCceEeecCCCCceEEEcCCCccceEEEEEhh
Confidence 334555554 455555543 578887766444333222345678884 6677666655554342 2588888887
Q ss_pred CCCc--EEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCc-EEEEeC-CCCceeEEEecCCCCccceeeeee--CCeEEEEe
Q psy6570 104 GKNK--FNLVDNNIQWPTGITIDYPSQRLYWADPKART-IESINL-NGKDRFVVYHTEDNGYKPYKLEVF--EDNLYFST 177 (713)
Q Consensus 104 G~~~--~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~-I~~~~~-~g~~~~~~~~~~~~~~~p~~i~~~--~~~ly~td 177 (713)
-... ..++........=++|... +.+.-+-+..+. |+.++. +|.....+..... ...-+-|++. ..+|-++
T Consensus 168 ~~~~~~p~~I~AH~s~Iacv~Ln~~-Gt~vATaStkGTLIRIFdt~~g~~l~E~RRG~d-~A~iy~iaFSp~~s~Lavs- 244 (346)
T KOG2111|consen 168 STKPNAPSIINAHDSDIACVALNLQ-GTLVATASTKGTLIRIFDTEDGTLLQELRRGVD-RADIYCIAFSPNSSWLAVS- 244 (346)
T ss_pred hcCcCCceEEEcccCceeEEEEcCC-ccEEEEeccCcEEEEEEEcCCCcEeeeeecCCc-hheEEEEEeCCCccEEEEE-
Confidence 6655 2444444445566778744 455445555555 555564 6666666654432 2233445554 3344443
Q ss_pred CCCCcEEEEccc
Q psy6570 178 YRTNNILKINKF 189 (713)
Q Consensus 178 ~~~~~i~~~~~~ 189 (713)
..+++|..+...
T Consensus 245 SdKgTlHiF~l~ 256 (346)
T KOG2111|consen 245 SDKGTLHIFSLR 256 (346)
T ss_pred cCCCeEEEEEee
Confidence 345555555443
No 269
>KOG4378|consensus
Probab=80.49 E-value=35 Score=36.59 Aligned_cols=114 Identities=9% Similarity=0.039 Sum_probs=74.7
Q ss_pred eEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCc-ceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcE
Q psy6570 30 GVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEP-YDIALEPLSGRMFWTELGIKPRISGASIDGKNKF 108 (713)
Q Consensus 30 gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p-~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~ 108 (713)
-|.+.+..+.|..+-+ ..+.|...|..|.....-+......| .||.+.|.+..|+++-.... +|+.++...+...
T Consensus 169 ll~ys~skr~lL~~as---d~G~VtlwDv~g~sp~~~~~~~HsAP~~gicfspsne~l~vsVG~Dk-ki~~yD~~s~~s~ 244 (673)
T KOG4378|consen 169 LLRYSPSKRFLLSIAS---DKGAVTLWDVQGMSPIFHASEAHSAPCRGICFSPSNEALLVSVGYDK-KINIYDIRSQAST 244 (673)
T ss_pred EeecccccceeeEeec---cCCeEEEEeccCCCcccchhhhccCCcCcceecCCccceEEEecccc-eEEEeeccccccc
Confidence 4555666667777666 67788888888765543333333444 48999999999999875555 8888876643322
Q ss_pred -EEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCce
Q psy6570 109 -NLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDR 150 (713)
Q Consensus 109 -~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~ 150 (713)
.|+.. ..-..+||. +.+.+..+-...++|+.+|+.+...
T Consensus 245 ~~l~y~--~Plstvaf~-~~G~~L~aG~s~G~~i~YD~R~~k~ 284 (673)
T KOG4378|consen 245 DRLTYS--HPLSTVAFS-ECGTYLCAGNSKGELIAYDMRSTKA 284 (673)
T ss_pred ceeeec--CCcceeeec-CCceEEEeecCCceEEEEecccCCC
Confidence 22211 112567887 4667777777888999999877543
No 270
>KOG2106|consensus
Probab=79.97 E-value=94 Score=33.64 Aligned_cols=92 Identities=15% Similarity=0.163 Sum_probs=43.1
Q ss_pred CceeEEEccCccc-EEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCce--EEEEEcCC---CCCcce-EEE
Q psy6570 7 GNVTRVKREMNLK-TVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRK--KRTLLNTG---LNEPYD-IAL 79 (713)
Q Consensus 7 ~~I~~~~~~~~~~-~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~--~~~l~~~~---~~~p~~-iav 79 (713)
+.|.+-.+..... ++.-...+-.|||..+. ..+|+|-. ..+.+..-+ +-+. .+++..+. --.|.+ ||+
T Consensus 349 N~iL~Gt~~~~f~~~v~gh~delwgla~hps-~~q~~T~g---qdk~v~lW~-~~k~~wt~~~~d~~~~~~fhpsg~va~ 423 (626)
T KOG2106|consen 349 NFILQGTLENGFTLTVQGHGDELWGLATHPS-KNQLLTCG---QDKHVRLWN-DHKLEWTKIIEDPAECADFHPSGVVAV 423 (626)
T ss_pred ceEEEeeecCCceEEEEecccceeeEEcCCC-hhheeecc---CcceEEEcc-CCceeEEEEecCceeEeeccCcceEEE
Confidence 3344444443322 22333446788998875 56667765 444444444 2111 11111110 122333 455
Q ss_pred cCCCCcEEEEccCCCCeEEEEecCC
Q psy6570 80 EPLSGRMFWTELGIKPRISGASIDG 104 (713)
Q Consensus 80 D~~~~~ly~td~~~~~~I~~~~~dG 104 (713)
-...|++|+.|..+. .+.....|+
T Consensus 424 Gt~~G~w~V~d~e~~-~lv~~~~d~ 447 (626)
T KOG2106|consen 424 GTATGRWFVLDTETQ-DLVTIHTDN 447 (626)
T ss_pred eeccceEEEEecccc-eeEEEEecC
Confidence 555666666665544 444445553
No 271
>KOG0772|consensus
Probab=79.64 E-value=46 Score=35.96 Aligned_cols=160 Identities=12% Similarity=0.042 Sum_probs=90.1
Q ss_pred cCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCce-----EEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeE
Q psy6570 23 SNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRK-----KRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRI 97 (713)
Q Consensus 23 ~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~-----~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I 97 (713)
.+-.-...|++|+.+-++| +-+ ....|...++.|-. -+.|........+.+.+.+..+.|.+.. +.. .+
T Consensus 165 hgtk~Vsal~~Dp~GaR~~-sGs---~Dy~v~~wDf~gMdas~~~fr~l~P~E~h~i~sl~ys~Tg~~iLvvs-g~a-qa 238 (641)
T KOG0772|consen 165 HGTKIVSALAVDPSGARFV-SGS---LDYTVKFWDFQGMDASMRSFRQLQPCETHQINSLQYSVTGDQILVVS-GSA-QA 238 (641)
T ss_pred CCceEEEEeeecCCCceee-ecc---ccceEEEEecccccccchhhhccCcccccccceeeecCCCCeEEEEe-cCc-ce
Confidence 3444567899998876665 444 45677777777633 1223333456778899987555555544 433 67
Q ss_pred EEEecCCCCcEEEEeCC------------CCCCeeEEEeCCCCeEEEEcCCCC--cEEEEeCCCCceeEEEecCCCC-c-
Q psy6570 98 SGASIDGKNKFNLVDNN------------IQWPTGITIDYPSQRLYWADPKAR--TIESINLNGKDRFVVYHTEDNG-Y- 161 (713)
Q Consensus 98 ~~~~~dG~~~~~l~~~~------------~~~p~glavd~~~~~LY~~d~~~~--~I~~~~~~g~~~~~~~~~~~~~-~- 161 (713)
..++.||.........+ +..-+--.+.|.+...|.+-...+ |||-++-.-+.++++.....++ .
T Consensus 239 kl~DRdG~~~~e~~KGDQYI~Dm~nTKGHia~lt~g~whP~~k~~FlT~s~DgtlRiWdv~~~k~q~qVik~k~~~g~Rv 318 (641)
T KOG0772|consen 239 KLLDRDGFEIVEFSKGDQYIRDMYNTKGHIAELTCGCWHPDNKEEFLTCSYDGTLRIWDVNNTKSQLQVIKTKPAGGKRV 318 (641)
T ss_pred eEEccCCceeeeeeccchhhhhhhccCCceeeeeccccccCcccceEEecCCCcEEEEecCCchhheeEEeeccCCCccc
Confidence 78888987665554321 112233457788888888865444 5665554455566665443211 1
Q ss_pred cceeeeee-CCeEEEEeCCCCcEEEEcc
Q psy6570 162 KPYKLEVF-EDNLYFSTYRTNNILKINK 188 (713)
Q Consensus 162 ~p~~i~~~-~~~ly~td~~~~~i~~~~~ 188 (713)
.|..-++. ++.++-+.-..+.|...++
T Consensus 319 ~~tsC~~nrdg~~iAagc~DGSIQ~W~~ 346 (641)
T KOG0772|consen 319 PVTSCAWNRDGKLIAAGCLDGSIQIWDK 346 (641)
T ss_pred CceeeecCCCcchhhhcccCCceeeeec
Confidence 22223332 3344444445556655554
No 272
>KOG0270|consensus
Probab=79.63 E-value=77 Score=33.61 Aligned_cols=61 Identities=5% Similarity=0.063 Sum_probs=42.0
Q ss_pred ceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccC
Q psy6570 29 RGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELG 92 (713)
Q Consensus 29 ~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~ 92 (713)
..|++.+..++|..+-+ ..++|..-+++.......+.........|++.+....+..+...
T Consensus 247 l~Ls~n~~~~nVLaSgs---aD~TV~lWD~~~g~p~~s~~~~~k~Vq~l~wh~~~p~~LLsGs~ 307 (463)
T KOG0270|consen 247 LALSWNRNFRNVLASGS---ADKTVKLWDVDTGKPKSSITHHGKKVQTLEWHPYEPSVLLSGSY 307 (463)
T ss_pred HHHHhccccceeEEecC---CCceEEEEEcCCCCcceehhhcCCceeEEEecCCCceEEEeccc
Confidence 35666666677776666 67888888888666555554444677788888877777666543
No 273
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=79.18 E-value=0.63 Score=45.76 Aligned_cols=24 Identities=21% Similarity=0.411 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHHhheeeEEEEecC
Q psy6570 689 ILILILLLITVGGIGYYIFRIKMS 712 (713)
Q Consensus 689 ~~~~~~~~~~~~~~~~~~~~~~~~ 712 (713)
++++++|+|+|+++.+|.+|||.+
T Consensus 265 alvllil~vvliiLYiWlyrrRK~ 288 (295)
T TIGR01478 265 ALVLIILTVVLIILYIWLYRRRKK 288 (295)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhcc
Confidence 444444445566666666665543
No 274
>KOG0277|consensus
Probab=79.18 E-value=68 Score=31.56 Aligned_cols=69 Identities=9% Similarity=-0.034 Sum_probs=37.6
Q ss_pred CeeEEEeCCCCeEEEEcCCCC--cEEEEeCCCCceeEEEecCCCCccceeeee--eCCeEEEEeCCCCcEEEEcccC
Q psy6570 118 PTGITIDYPSQRLYWADPKAR--TIESINLNGKDRFVVYHTEDNGYKPYKLEV--FEDNLYFSTYRTNNILKINKFG 190 (713)
Q Consensus 118 p~glavd~~~~~LY~~d~~~~--~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~--~~~~ly~td~~~~~i~~~~~~~ 190 (713)
....++.|....||-+-++.+ +|+-++..|+...+.+.. ...+..+. ++.++.+|....+.|+..+...
T Consensus 150 Iy~a~~sp~~~nlfas~Sgd~~l~lwdvr~~gk~~~i~ah~----~Eil~cdw~ky~~~vl~Tg~vd~~vr~wDir~ 222 (311)
T KOG0277|consen 150 IYQAAFSPHIPNLFASASGDGTLRLWDVRSPGKFMSIEAHN----SEILCCDWSKYNHNVLATGGVDNLVRGWDIRN 222 (311)
T ss_pred EEEEecCCCCCCeEEEccCCceEEEEEecCCCceeEEEecc----ceeEeecccccCCcEEEecCCCceEEEEehhh
Confidence 345666776777776665544 455555566554422211 11222222 3566777777777777666543
No 275
>KOG0973|consensus
Probab=79.17 E-value=30 Score=40.63 Aligned_cols=146 Identities=9% Similarity=-0.004 Sum_probs=79.8
Q ss_pred CCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCC
Q psy6570 25 LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDG 104 (713)
Q Consensus 25 ~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG 104 (713)
-.....|++++ .+.++++-+ ..++|.+.+...-.+..++..-...+.|+++||.+++|--......-+||+. ..
T Consensus 129 ~~DV~Dv~Wsp-~~~~lvS~s---~DnsViiwn~~tF~~~~vl~~H~s~VKGvs~DP~Gky~ASqsdDrtikvwrt--~d 202 (942)
T KOG0973|consen 129 DSDVLDVNWSP-DDSLLVSVS---LDNSVIIWNAKTFELLKVLRGHQSLVKGVSWDPIGKYFASQSDDRTLKVWRT--SD 202 (942)
T ss_pred CCccceeccCC-CccEEEEec---ccceEEEEccccceeeeeeecccccccceEECCccCeeeeecCCceEEEEEc--cc
Confidence 44466677777 466777766 6788888887655555555555678999999998664433222211144442 22
Q ss_pred CCcEEEEeCCCC------CCeeEEEeCCCCeEEEEcCC---CCcEEEEeCCCCceeEEEecCCCCccceeeeeeCCeEEE
Q psy6570 105 KNKFNLVDNNIQ------WPTGITIDYPSQRLYWADPK---ARTIESINLNGKDRFVVYHTEDNGYKPYKLEVFEDNLYF 175 (713)
Q Consensus 105 ~~~~~l~~~~~~------~p~glavd~~~~~LY~~d~~---~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~ 175 (713)
-.....+...+. .-.-|.+.|++..|-.+..- ...|..+.-++-.....+.. ...|.-+..|.-+||=
T Consensus 203 w~i~k~It~pf~~~~~~T~f~RlSWSPDG~~las~nA~n~~~~~~~IieR~tWk~~~~LvG---H~~p~evvrFnP~lfe 279 (942)
T KOG0973|consen 203 WGIEKSITKPFEESPLTTFFLRLSWSPDGHHLASPNAVNGGKSTIAIIERGTWKVDKDLVG---HSAPVEVVRFNPKLFE 279 (942)
T ss_pred ceeeEeeccchhhCCCcceeeecccCCCcCeecchhhccCCcceeEEEecCCceeeeeeec---CCCceEEEEeChHHhc
Confidence 222222222211 22346777777777666542 23455555433222222221 3457777777777775
Q ss_pred EeCC
Q psy6570 176 STYR 179 (713)
Q Consensus 176 td~~ 179 (713)
-...
T Consensus 280 ~~~~ 283 (942)
T KOG0973|consen 280 RNNK 283 (942)
T ss_pred cccc
Confidence 5443
No 276
>KOG0271|consensus
Probab=79.09 E-value=82 Score=32.84 Aligned_cols=170 Identities=11% Similarity=0.005 Sum_probs=83.3
Q ss_pred ceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecC-C-CC
Q psy6570 29 RGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASID-G-KN 106 (713)
Q Consensus 29 ~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~d-G-~~ 106 (713)
..++|.+. +...++-+ +...+..-|++.+........--+....|+..|. |....+....+ .|..-+.. | ..
T Consensus 119 l~~~fsp~-g~~l~tGs---GD~TvR~WD~~TeTp~~t~KgH~~WVlcvawsPD-gk~iASG~~dg-~I~lwdpktg~~~ 192 (480)
T KOG0271|consen 119 LSVQFSPT-GSRLVTGS---GDTTVRLWDLDTETPLFTCKGHKNWVLCVAWSPD-GKKIASGSKDG-SIRLWDPKTGQQI 192 (480)
T ss_pred EEEEecCC-CceEEecC---CCceEEeeccCCCCcceeecCCccEEEEEEECCC-cchhhccccCC-eEEEecCCCCCcc
Confidence 45666663 44445555 4566666777765544333333455667777763 44444443333 44444422 2 22
Q ss_pred cEEEEeCCCCCCeeEEEeCC----CCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCcccee-eeeeCCeEEEEeCCCC
Q psy6570 107 KFNLVDNNIQWPTGITIDYP----SQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYK-LEVFEDNLYFSTYRTN 181 (713)
Q Consensus 107 ~~~l~~~~~~~p~glavd~~----~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~-i~~~~~~ly~td~~~~ 181 (713)
-+.| ...-.+.++|++.|. ..+++.+-+..+.|..-++.+......... .-.|.. |..-++.|..+.....
T Consensus 193 g~~l-~gH~K~It~Lawep~hl~p~~r~las~skDg~vrIWd~~~~~~~~~lsg---HT~~VTCvrwGG~gliySgS~Dr 268 (480)
T KOG0271|consen 193 GRAL-RGHKKWITALAWEPLHLVPPCRRLASSSKDGSVRIWDTKLGTCVRTLSG---HTASVTCVRWGGEGLIYSGSQDR 268 (480)
T ss_pred cccc-cCcccceeEEeecccccCCCccceecccCCCCEEEEEccCceEEEEecc---CccceEEEEEcCCceEEecCCCc
Confidence 2222 224456777887653 234555555566666666655433333322 223332 2223334444555666
Q ss_pred cEEEEcccCCCcceeeeccccccccEE
Q psy6570 182 NILKINKFGNSDFNVLANNLNRASDVL 208 (713)
Q Consensus 182 ~i~~~~~~~~~~~~~~~~~~~~~~~i~ 208 (713)
+|...+...+.-...+.......-.|+
T Consensus 269 tIkvw~a~dG~~~r~lkGHahwvN~la 295 (480)
T KOG0271|consen 269 TIKVWRALDGKLCRELKGHAHWVNHLA 295 (480)
T ss_pred eEEEEEccchhHHHhhcccchheeeee
Confidence 776666555544444444444433343
No 277
>PTZ00370 STEVOR; Provisional
Probab=78.41 E-value=0.67 Score=45.72 Aligned_cols=24 Identities=17% Similarity=0.406 Sum_probs=13.1
Q ss_pred HHHHHHHHHHHHhheeeEEEEecC
Q psy6570 689 ILILILLLITVGGIGYYIFRIKMS 712 (713)
Q Consensus 689 ~~~~~~~~~~~~~~~~~~~~~~~~ 712 (713)
++++++|+|+|+++.+|.+|||.+
T Consensus 261 alvllil~vvliilYiwlyrrRK~ 284 (296)
T PTZ00370 261 ALVLLILAVVLIILYIWLYRRRKN 284 (296)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhcc
Confidence 444444445566666666665543
No 278
>KOG0278|consensus
Probab=78.39 E-value=71 Score=31.32 Aligned_cols=107 Identities=9% Similarity=0.062 Sum_probs=60.0
Q ss_pred CCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCC---eeEEEeCCCCeEEEEcCCCCcEEEEeCCCC
Q psy6570 72 NEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWP---TGITIDYPSQRLYWADPKARTIESINLNGK 148 (713)
Q Consensus 72 ~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p---~glavd~~~~~LY~~d~~~~~I~~~~~~g~ 148 (713)
..+..+.|.+.++.|-.++.+ .|.--+.+ ....|- ....| ...+|.|. +.+|++-.....++++|++..
T Consensus 185 s~VtSlEvs~dG~ilTia~gs---sV~Fwdak--sf~~lK--s~k~P~nV~SASL~P~-k~~fVaGged~~~~kfDy~Tg 256 (334)
T KOG0278|consen 185 SPVTSLEVSQDGRILTIAYGS---SVKFWDAK--SFGLLK--SYKMPCNVESASLHPK-KEFFVAGGEDFKVYKFDYNTG 256 (334)
T ss_pred CCCcceeeccCCCEEEEecCc---eeEEeccc--ccccee--eccCccccccccccCC-CceEEecCcceEEEEEeccCC
Confidence 457788888655444444432 23322222 222221 12223 34567754 489999988889999998765
Q ss_pred ceeEEEecCCCCccc-eeeeee-CCeEEEEeCCCCcEEEEcc
Q psy6570 149 DRFVVYHTEDNGYKP-YKLEVF-EDNLYFSTYRTNNILKINK 188 (713)
Q Consensus 149 ~~~~~~~~~~~~~~p-~~i~~~-~~~ly~td~~~~~i~~~~~ 188 (713)
........ +...| ..+.+. ++.+|-+-...+.|+....
T Consensus 257 eEi~~~nk--gh~gpVhcVrFSPdGE~yAsGSEDGTirlWQt 296 (334)
T KOG0278|consen 257 EEIGSYNK--GHFGPVHCVRFSPDGELYASGSEDGTIRLWQT 296 (334)
T ss_pred ceeeeccc--CCCCceEEEEECCCCceeeccCCCceEEEEEe
Confidence 43333211 22333 444444 6678888777777665543
No 279
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=78.31 E-value=2.5 Score=29.62 Aligned_cols=22 Identities=41% Similarity=0.955 Sum_probs=15.1
Q ss_pred EEEcCCCeeeCCCCCccCCCCc
Q psy6570 396 TCIATTQTCVCPPGFTGDTCQQ 417 (713)
Q Consensus 396 ~C~~~~~~C~C~~g~~g~~C~~ 417 (713)
.|...+++|.|+++++|..|+.
T Consensus 12 ~C~~~~G~C~C~~~~~G~~C~~ 33 (46)
T smart00180 12 TCDPDTGQCECKPNVTGRRCDR 33 (46)
T ss_pred cccCCCCEEECCCCCCCCCCCc
Confidence 4444467788888888877763
No 280
>KOG0640|consensus
Probab=78.00 E-value=60 Score=32.75 Aligned_cols=169 Identities=10% Similarity=0.037 Sum_probs=84.9
Q ss_pred ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCC---CCCcceEEEcCCCCcEEEEccCCCCeEE
Q psy6570 22 LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTG---LNEPYDIALEPLSGRMFWTELGIKPRIS 98 (713)
Q Consensus 22 ~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~---~~~p~~iavD~~~~~ly~td~~~~~~I~ 98 (713)
+......+.|.+.|.+..|.+.-. ...+..++.+...-.+-.... ......+-.. .++.||+|.+... .|.
T Consensus 213 ~qd~~~vrsiSfHPsGefllvgTd----Hp~~rlYdv~T~QcfvsanPd~qht~ai~~V~Ys-~t~~lYvTaSkDG-~Ik 286 (430)
T KOG0640|consen 213 FQDTEPVRSISFHPSGEFLLVGTD----HPTLRLYDVNTYQCFVSANPDDQHTGAITQVRYS-STGSLYVTASKDG-AIK 286 (430)
T ss_pred hhccceeeeEeecCCCceEEEecC----CCceeEEeccceeEeeecCcccccccceeEEEec-CCccEEEEeccCC-cEE
Confidence 556667789999998888776432 456666776654332222111 2234556666 4889999986544 444
Q ss_pred EEecCCCCc---EEEEeC-CCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCC-----CCccceee-ee
Q psy6570 99 GASIDGKNK---FNLVDN-NIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTED-----NGYKPYKL-EV 168 (713)
Q Consensus 99 ~~~~dG~~~---~~l~~~-~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~-----~~~~p~~i-~~ 168 (713)
.. ||-.. +++... +-.......|.. + --|+-.++...+..+.--++.|.+..-.+. +.....++ ..
T Consensus 287 lw--DGVS~rCv~t~~~AH~gsevcSa~Ftk-n-~kyiLsSG~DS~vkLWEi~t~R~l~~YtGAg~tgrq~~rtqAvFNh 362 (430)
T KOG0640|consen 287 LW--DGVSNRCVRTIGNAHGGSEVCSAVFTK-N-GKYILSSGKDSTVKLWEISTGRMLKEYTGAGTTGRQKHRTQAVFNH 362 (430)
T ss_pred ee--ccccHHHHHHHHhhcCCceeeeEEEcc-C-CeEEeecCCcceeeeeeecCCceEEEEecCCcccchhhhhhhhhcC
Confidence 32 33211 111111 011112233331 2 223333344443333333333333221111 11111222 22
Q ss_pred eCCeEEEEeCCCCcEEEEcccCCCcceeeecc
Q psy6570 169 FEDNLYFSTYRTNNILKINKFGNSDFNVLANN 200 (713)
Q Consensus 169 ~~~~ly~td~~~~~i~~~~~~~~~~~~~~~~~ 200 (713)
.|+++...|..++.+-..+.....++..+..+
T Consensus 363 tEdyVl~pDEas~slcsWdaRtadr~~l~slg 394 (430)
T KOG0640|consen 363 TEDYVLFPDEASNSLCSWDARTADRVALLSLG 394 (430)
T ss_pred ccceEEccccccCceeeccccchhhhhhcccC
Confidence 37888888888888888888777776666554
No 281
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=77.96 E-value=0.71 Score=34.86 Aligned_cols=14 Identities=29% Similarity=0.764 Sum_probs=7.7
Q ss_pred ceeeCCCCcccCCC
Q psy6570 655 PICICPRGYAGVRC 668 (713)
Q Consensus 655 ~~C~C~~Gy~G~~C 668 (713)
+.=+|.+||+|.+|
T Consensus 50 G~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 50 GNKVCLPGWTGPNC 63 (63)
T ss_dssp --EEE-TTEESTTS
T ss_pred CCCCCCCCCcCCCC
Confidence 44567777777766
No 282
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=77.76 E-value=3.4 Score=29.74 Aligned_cols=29 Identities=34% Similarity=0.968 Sum_probs=20.2
Q ss_pred ccCCCCcCCCCCCCCCCCCeecCCCCccCCCCCceeeCCCCcc
Q psy6570 622 SGITCSERVSCAHFCFNGGTCREQNYSLDPDLKPICICPRGYA 664 (713)
Q Consensus 622 ~G~~C~~~~~C~~~C~~~~~C~~~~~~~~~~~~~~C~C~~Gy~ 664 (713)
.|..|+.... |..++.|.+ ++|.|++||.
T Consensus 18 ~g~~C~~~~q----C~~~s~C~~----------g~C~C~~g~~ 46 (52)
T PF01683_consen 18 PGESCESDEQ----CIGGSVCVN----------GRCQCPPGYV 46 (52)
T ss_pred CCCCCCCcCC----CCCcCEEcC----------CEeECCCCCE
Confidence 3555665333 446788876 7999999985
No 283
>KOG0281|consensus
Probab=77.52 E-value=13 Score=37.90 Aligned_cols=148 Identities=15% Similarity=0.115 Sum_probs=80.0
Q ss_pred EEEeccCCeEEEeecCCCCCCeEEEEecC-CceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCC---
Q psy6570 31 VAVDWVGKNLYWTDAGGRSSNNIMVSTLE-GRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKN--- 106 (713)
Q Consensus 31 la~D~~~~~ly~td~~~~~~~~I~~~~~~-G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~--- 106 (713)
|.+++. +++.++-+ +...|.+-+.+ |+..++++.. -....++.+. +|+| +|-+... .|.+-+|+...
T Consensus 241 LCLqyd-~rviisGS---SDsTvrvWDv~tge~l~tlihH-ceaVLhlrf~--ng~m-vtcSkDr-siaVWdm~sps~it 311 (499)
T KOG0281|consen 241 LCLQYD-ERVIVSGS---SDSTVRVWDVNTGEPLNTLIHH-CEAVLHLRFS--NGYM-VTCSKDR-SIAVWDMASPTDIT 311 (499)
T ss_pred Eeeecc-ceEEEecC---CCceEEEEeccCCchhhHHhhh-cceeEEEEEe--CCEE-EEecCCc-eeEEEeccCchHHH
Confidence 334433 34666665 55666666654 3444455432 2445566663 4544 4444444 56666666543
Q ss_pred -cEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCC-ceeEEEecCCCCccceeee--eeCCeEEEEeCCCCc
Q psy6570 107 -KFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGK-DRFVVYHTEDNGYKPYKLE--VFEDNLYFSTYRTNN 182 (713)
Q Consensus 107 -~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~-~~~~~~~~~~~~~~p~~i~--~~~~~ly~td~~~~~ 182 (713)
+++|+. .....+-+-+| +++.++.++...|..-+.+.. ..+++. .+-.||+ -+.++|.++-...+.
T Consensus 312 ~rrVLvG-HrAaVNvVdfd---~kyIVsASgDRTikvW~~st~efvRtl~------gHkRGIAClQYr~rlvVSGSSDnt 381 (499)
T KOG0281|consen 312 LRRVLVG-HRAAVNVVDFD---DKYIVSASGDRTIKVWSTSTCEFVRTLN------GHKRGIACLQYRDRLVVSGSSDNT 381 (499)
T ss_pred HHHHHhh-hhhheeeeccc---cceEEEecCCceEEEEeccceeeehhhh------cccccceehhccCeEEEecCCCce
Confidence 222322 23333333343 446666666677766665433 233332 2334555 468899999888888
Q ss_pred EEEEcccCCCcceee
Q psy6570 183 ILKINKFGNSDFNVL 197 (713)
Q Consensus 183 i~~~~~~~~~~~~~~ 197 (713)
|..++...+.-..++
T Consensus 382 IRlwdi~~G~cLRvL 396 (499)
T KOG0281|consen 382 IRLWDIECGACLRVL 396 (499)
T ss_pred EEEEeccccHHHHHH
Confidence 888887666444443
No 284
>KOG1036|consensus
Probab=77.36 E-value=86 Score=31.71 Aligned_cols=52 Identities=10% Similarity=-0.001 Sum_probs=33.0
Q ss_pred cCCceeEEEccCccc-EEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCC
Q psy6570 5 SSGNVTRVKREMNLK-TVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEG 60 (713)
Q Consensus 5 ~~~~I~~~~~~~~~~-~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G 60 (713)
..+.|.++|+.+..+ ++.+.....+.|..-+..+.|.-.-| ..+|...|+..
T Consensus 73 ~dg~vr~~Dln~~~~~~igth~~~i~ci~~~~~~~~vIsgsW----D~~ik~wD~R~ 125 (323)
T KOG1036|consen 73 LDGQVRRYDLNTGNEDQIGTHDEGIRCIEYSYEVGCVISGSW----DKTIKFWDPRN 125 (323)
T ss_pred cCceEEEEEecCCcceeeccCCCceEEEEeeccCCeEEEccc----CccEEEEeccc
Confidence 467888889887553 44555556677777666555544444 45676666554
No 285
>PF06024 DUF912: Nucleopolyhedrovirus protein of unknown function (DUF912); InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=77.32 E-value=3.4 Score=34.67 Aligned_cols=28 Identities=21% Similarity=0.238 Sum_probs=13.0
Q ss_pred hhHHHHHHHHHHHHHHhheeeEEEEecC
Q psy6570 685 HISSILILILLLITVGGIGYYIFRIKMS 712 (713)
Q Consensus 685 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 712 (713)
+++.+.++++++++.++.+|+..|.|.+
T Consensus 65 li~lls~v~IlVily~IyYFVILRer~~ 92 (101)
T PF06024_consen 65 LISLLSFVCILVILYAIYYFVILRERQK 92 (101)
T ss_pred HHHHHHHHHHHHHHhhheEEEEEecccc
Confidence 3333333444444445555555555443
No 286
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=77.04 E-value=97 Score=32.17 Aligned_cols=95 Identities=18% Similarity=0.170 Sum_probs=46.2
Q ss_pred CceeEEEccCccc----EEecCCCCCc-eEEEeccCCeEEEeecC--CCCCCeEEEEecCCceEEEEEcCCCC-Ccc--e
Q psy6570 7 GNVTRVKREMNLK----TVLSNLHDPR-GVAVDWVGKNLYWTDAG--GRSSNNIMVSTLEGRKKRTLLNTGLN-EPY--D 76 (713)
Q Consensus 7 ~~I~~~~~~~~~~----~~~~~~~~p~-gla~D~~~~~ly~td~~--~~~~~~I~~~~~~G~~~~~l~~~~~~-~p~--~ 76 (713)
..+.+++++.... ..+..+..|+ ..+.-..+++||+.-.. ....+.++++++....=+.+. .+. .++ .
T Consensus 88 ~~v~~~d~~~~~w~~~~~~~~~lp~~~~~~~~~~~~~~iYv~GG~~~~~~~~~v~~yd~~~~~W~~~~--~~p~~~r~~~ 165 (323)
T TIGR03548 88 SSVYRITLDESKEELICETIGNLPFTFENGSACYKDGTLYVGGGNRNGKPSNKSYLFNLETQEWFELP--DFPGEPRVQP 165 (323)
T ss_pred eeEEEEEEcCCceeeeeeEcCCCCcCccCceEEEECCEEEEEeCcCCCccCceEEEEcCCCCCeeECC--CCCCCCCCcc
Confidence 4566777765432 2234454442 12222356899987431 012457888887654322221 111 122 2
Q ss_pred EEEcCCCCcEEEEccCCC---CeEEEEecCC
Q psy6570 77 IALEPLSGRMFWTELGIK---PRISGASIDG 104 (713)
Q Consensus 77 iavD~~~~~ly~td~~~~---~~I~~~~~dG 104 (713)
.++ ..++.||+..-... ..+++.++..
T Consensus 166 ~~~-~~~~~iYv~GG~~~~~~~~~~~yd~~~ 195 (323)
T TIGR03548 166 VCV-KLQNELYVFGGGSNIAYTDGYKYSPKK 195 (323)
T ss_pred eEE-EECCEEEEEcCCCCccccceEEEecCC
Confidence 221 24688999753211 1356666654
No 287
>KOG1407|consensus
Probab=76.87 E-value=81 Score=31.19 Aligned_cols=115 Identities=17% Similarity=0.150 Sum_probs=64.3
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCce-EEEEEcC-CCCCcceEEEcCCCCcEEEEccCCCCeEEEEe
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRK-KRTLLNT-GLNEPYDIALEPLSGRMFWTELGIKPRISGAS 101 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~-~~~l~~~-~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~ 101 (713)
.+...+.|++.-.+.+|- +-+ ....+.+.++++.. ++.+... .......++-||.+..+|.+-++.. .|.+-+
T Consensus 19 ~~~~v~Sv~wn~~g~~la-sgs---~dktv~v~n~e~~r~~~~~~~~gh~~svdql~w~~~~~d~~atas~dk-~ir~wd 93 (313)
T KOG1407|consen 19 HVQKVHSVAWNCDGTKLA-SGS---FDKTVSVWNLERDRFRKELVYRGHTDSVDQLCWDPKHPDLFATASGDK-TIRIWD 93 (313)
T ss_pred hhhcceEEEEcccCceee-ecc---cCCceEEEEecchhhhhhhcccCCCcchhhheeCCCCCcceEEecCCc-eEEEEE
Confidence 345567777775555553 222 34566666666542 1112211 2345678999999999999887766 666666
Q ss_pred cCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeC
Q psy6570 102 IDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINL 145 (713)
Q Consensus 102 ~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~ 145 (713)
...+.....+.... .-.-|+..|.++.+-+.+. ..+|..++.
T Consensus 94 ~r~~k~~~~i~~~~-eni~i~wsp~g~~~~~~~k-dD~it~id~ 135 (313)
T KOG1407|consen 94 IRSGKCTARIETKG-ENINITWSPDGEYIAVGNK-DDRITFIDA 135 (313)
T ss_pred eccCcEEEEeeccC-cceEEEEcCCCCEEEEecC-cccEEEEEe
Confidence 55444433333222 2245677766666655542 344554443
No 288
>KOG2139|consensus
Probab=76.69 E-value=36 Score=35.08 Aligned_cols=116 Identities=16% Similarity=0.105 Sum_probs=73.6
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEE-EEecCCCC
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRIS-GASIDGKN 106 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~-~~~~dG~~ 106 (713)
...|+....+ .+.++.+- ....|++-+.+......|...++....-|-..|.+..||-+....-.+++ ...+-.+.
T Consensus 198 Vtsmqwn~dg-t~l~tAS~--gsssi~iWdpdtg~~~pL~~~glgg~slLkwSPdgd~lfaAt~davfrlw~e~q~wt~e 274 (445)
T KOG2139|consen 198 VTSMQWNEDG-TILVTASF--GSSSIMIWDPDTGQKIPLIPKGLGGFSLLKWSPDGDVLFAATCDAVFRLWQENQSWTKE 274 (445)
T ss_pred eeEEEEcCCC-CEEeeccc--CcceEEEEcCCCCCcccccccCCCceeeEEEcCCCCEEEEecccceeeeehhcccceec
Confidence 3455555443 33333331 45678888888776666665555666667778777777776554333444 22333333
Q ss_pred cEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCc
Q psy6570 107 KFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKD 149 (713)
Q Consensus 107 ~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~ 149 (713)
+..+.. .+-.+-..+|.+.+|.++-.+..+|++...++..
T Consensus 275 rw~lgs---grvqtacWspcGsfLLf~~sgsp~lysl~f~~~~ 314 (445)
T KOG2139|consen 275 RWILGS---GRVQTACWSPCGSFLLFACSGSPRLYSLTFDGED 314 (445)
T ss_pred ceeccC---CceeeeeecCCCCEEEEEEcCCceEEEEeecCCC
Confidence 434432 2556778899999999999999999999987753
No 289
>PF01414 DSL: Delta serrate ligand; InterPro: IPR001774 Ligands of the Delta/Serrate/lag-2 (DSL) family and their receptors, members of the lin-12/Notch family, mediate cell-cell interactions that specify cell fate in invertebrates and vertebrates. In Caenorhabditis elegans, two DSL genes, lag-2 and apx-1, influence different cell fate decisions during development []. Molecular interaction between Notch and Serrate, another EGF-homologous transmembrane protein containing a region of striking similarity to Delta, has been shown and the same two EGF repeats of Notch may also constitute a Serrate binding domain [, ].; GO: 0007154 cell communication, 0016020 membrane; PDB: 2VJ2_A.
Probab=76.20 E-value=0.98 Score=34.12 Aligned_cols=47 Identities=28% Similarity=0.687 Sum_probs=18.4
Q ss_pred CeeecCCCcccCCccccCCCCCCCCCCceeeCCCCCCCCCceeeCCCCcccCCC
Q psy6570 329 PHCICQENFYGTYCEKVNNSMCPCLNQGMCYPDLTHPEPTYKCHCAPSYTGARC 382 (713)
Q Consensus 329 ~~C~C~~g~~G~~C~~~~c~~~~C~~~~~C~~~~~~~~~~~~C~C~~G~~g~~C 382 (713)
+.-.|.+.|+|..|....-....-..+-.|... + .=.|.+||+|+.|
T Consensus 17 ~rv~C~~nyyG~~C~~~C~~~~d~~ghy~Cd~~-G------~~~C~~Gw~G~~C 63 (63)
T PF01414_consen 17 IRVVCDENYYGPNCSKFCKPRDDSFGHYTCDSN-G------NKVCLPGWTGPNC 63 (63)
T ss_dssp ------TTEETTTT-EE---EEETTEEEEE-SS---------EEE-TTEESTTS
T ss_pred EEEECCCCCCCccccCCcCCCcCCcCCcccCCC-C------CCCCCCCCcCCCC
Confidence 456788888888876532111011123345432 1 3467788887765
No 290
>KOG1274|consensus
Probab=76.16 E-value=1.6e+02 Score=34.34 Aligned_cols=96 Identities=13% Similarity=0.093 Sum_probs=53.9
Q ss_pred CCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCC
Q psy6570 49 SSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQ 128 (713)
Q Consensus 49 ~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~ 128 (713)
..+.|.++..+......++..-.-..+.++|+. +|.+..+..... .|..++++....+..+...-..-.+|.+||.+.
T Consensus 74 ~~~tv~~y~fps~~~~~iL~Rftlp~r~~~v~g-~g~~iaagsdD~-~vK~~~~~D~s~~~~lrgh~apVl~l~~~p~~~ 151 (933)
T KOG1274|consen 74 EQNTVLRYKFPSGEEDTILARFTLPIRDLAVSG-SGKMIAAGSDDT-AVKLLNLDDSSQEKVLRGHDAPVLQLSYDPKGN 151 (933)
T ss_pred ccceEEEeeCCCCCccceeeeeeccceEEEEec-CCcEEEeecCce-eEEEEeccccchheeecccCCceeeeeEcCCCC
Confidence 456677777654443333322112356788884 555555443333 577777665544444433333456888887666
Q ss_pred eEEEEcCCCCcEEEEeCCC
Q psy6570 129 RLYWADPKARTIESINLNG 147 (713)
Q Consensus 129 ~LY~~d~~~~~I~~~~~~g 147 (713)
.|-+++ ..+.|+.++++.
T Consensus 152 fLAvss-~dG~v~iw~~~~ 169 (933)
T KOG1274|consen 152 FLAVSS-CDGKVQIWDLQD 169 (933)
T ss_pred EEEEEe-cCceEEEEEccc
Confidence 665554 556677777643
No 291
>KOG0264|consensus
Probab=76.12 E-value=1.1e+02 Score=32.43 Aligned_cols=164 Identities=9% Similarity=0.079 Sum_probs=85.8
Q ss_pred CCCeEEEEecCCceE-------EEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecC--CCCcEEEEeCCCCCCe
Q psy6570 49 SSNNIMVSTLEGRKK-------RTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASID--GKNKFNLVDNNIQWPT 119 (713)
Q Consensus 49 ~~~~I~~~~~~G~~~-------~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~d--G~~~~~l~~~~~~~p~ 119 (713)
..+.|...+++.... ++++..--...++++..+.+..||-+-.... ++..-++. ....+..+...-..-+
T Consensus 198 ~d~~i~lwdi~~~~~~~~~~~p~~~~~~h~~~VeDV~~h~~h~~lF~sv~dd~-~L~iwD~R~~~~~~~~~~~ah~~~vn 276 (422)
T KOG0264|consen 198 DDHTICLWDINAESKEDKVVDPKTIFSGHEDVVEDVAWHPLHEDLFGSVGDDG-KLMIWDTRSNTSKPSHSVKAHSAEVN 276 (422)
T ss_pred CCCcEEEEeccccccCCccccceEEeecCCcceehhhccccchhhheeecCCC-eEEEEEcCCCCCCCcccccccCCcee
Confidence 345566655554332 2223222355677888888888876643333 55544443 2222233333344557
Q ss_pred eEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeee--eCCeEEEEeCCCCcEEEEcccCCCcceee
Q psy6570 120 GITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEV--FEDNLYFSTYRTNNILKINKFGNSDFNVL 197 (713)
Q Consensus 120 glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~--~~~~ly~td~~~~~i~~~~~~~~~~~~~~ 197 (713)
-++|.|.++.|.-+-+..++|...|+..-... +......-..-+.|.. ..+.|+.+....+++...+...-......
T Consensus 277 ~~~fnp~~~~ilAT~S~D~tV~LwDlRnL~~~-lh~~e~H~dev~~V~WSPh~etvLASSg~D~rl~vWDls~ig~eq~~ 355 (422)
T KOG0264|consen 277 CVAFNPFNEFILATGSADKTVALWDLRNLNKP-LHTFEGHEDEVFQVEWSPHNETVLASSGTDRRLNVWDLSRIGEEQSP 355 (422)
T ss_pred EEEeCCCCCceEEeccCCCcEEEeechhcccC-ceeccCCCcceEEEEeCCCCCceeEecccCCcEEEEeccccccccCh
Confidence 89999999999988888888888887554332 2222111112233332 24556666655666555554322221112
Q ss_pred eccccccccEEEEeecc
Q psy6570 198 ANNLNRASDVLILQENK 214 (713)
Q Consensus 198 ~~~~~~~~~i~v~~~~~ 214 (713)
...-..|-.+.+.|..-
T Consensus 356 eda~dgppEllF~HgGH 372 (422)
T KOG0264|consen 356 EDAEDGPPELLFIHGGH 372 (422)
T ss_pred hhhccCCcceeEEecCc
Confidence 22334455555555433
No 292
>KOG0294|consensus
Probab=75.96 E-value=95 Score=31.55 Aligned_cols=169 Identities=12% Similarity=0.075 Sum_probs=87.6
Q ss_pred CCceeEEEccCc--ccEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEec-CCceEEEEEcCCCCCcceEEEcCC
Q psy6570 6 SGNVTRVKREMN--LKTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTL-EGRKKRTLLNTGLNEPYDIALEPL 82 (713)
Q Consensus 6 ~~~I~~~~~~~~--~~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~-~G~~~~~l~~~~~~~p~~iavD~~ 82 (713)
+|.|...+.+.= ...+...-.+..+|++.|. ++|-.+-. +.+.+...+| .|+...++ .- -..+.-|..+|.
T Consensus 106 DG~i~iw~~~~W~~~~slK~H~~~Vt~lsiHPS-~KLALsVg---~D~~lr~WNLV~Gr~a~v~-~L-~~~at~v~w~~~ 179 (362)
T KOG0294|consen 106 DGHIIIWRVGSWELLKSLKAHKGQVTDLSIHPS-GKLALSVG---GDQVLRTWNLVRGRVAFVL-NL-KNKATLVSWSPQ 179 (362)
T ss_pred CCcEEEEEcCCeEEeeeecccccccceeEecCC-CceEEEEc---CCceeeeehhhcCccceee-cc-CCcceeeEEcCC
Confidence 455554444431 1222334455888999875 67777766 4556655554 34332222 11 144666888876
Q ss_pred CCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCC-CceeEEEecCCCCc
Q psy6570 83 SGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNG-KDRFVVYHTEDNGY 161 (713)
Q Consensus 83 ~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g-~~~~~~~~~~~~~~ 161 (713)
..+.++.-. + +|....++.......+... ..+.-+.++ ....|++.- .+..|...|-|. .-...+... ..
T Consensus 180 Gd~F~v~~~--~-~i~i~q~d~A~v~~~i~~~-~r~l~~~~l-~~~~L~vG~-d~~~i~~~D~ds~~~~~~~~AH---~~ 250 (362)
T KOG0294|consen 180 GDHFVVSGR--N-KIDIYQLDNASVFREIENP-KRILCATFL-DGSELLVGG-DNEWISLKDTDSDTPLTEFLAH---EN 250 (362)
T ss_pred CCEEEEEec--c-EEEEEecccHhHhhhhhcc-ccceeeeec-CCceEEEec-CCceEEEeccCCCccceeeecc---hh
Confidence 665666543 3 6666666654332222211 234445555 355666553 335566666653 323333222 12
Q ss_pred cceeeeee---CCeEEEEeCCCCcEEEEccc
Q psy6570 162 KPYKLEVF---EDNLYFSTYRTNNILKINKF 189 (713)
Q Consensus 162 ~p~~i~~~---~~~ly~td~~~~~i~~~~~~ 189 (713)
.--+|.++ +.++.++....+.|...+++
T Consensus 251 RVK~i~~~~~~~~~~lvTaSSDG~I~vWd~~ 281 (362)
T KOG0294|consen 251 RVKDIASYTNPEHEYLVTASSDGFIKVWDID 281 (362)
T ss_pred heeeeEEEecCCceEEEEeccCceEEEEEcc
Confidence 23445543 34677777777777776654
No 293
>PRK10115 protease 2; Provisional
Probab=75.76 E-value=1.7e+02 Score=34.24 Aligned_cols=115 Identities=10% Similarity=0.098 Sum_probs=62.5
Q ss_pred cCCeEEEeecCCCCCCeEEEEecCC-ceEEEEEcCC-CCCcceEEEcCCCCcEEEEcc-CCCCeEEEEecCCCCcEEEEe
Q psy6570 36 VGKNLYWTDAGGRSSNNIMVSTLEG-RKKRTLLNTG-LNEPYDIALEPLSGRMFWTEL-GIKPRISGASIDGKNKFNLVD 112 (713)
Q Consensus 36 ~~~~ly~td~~~~~~~~I~~~~~~G-~~~~~l~~~~-~~~p~~iavD~~~~~ly~td~-~~~~~I~~~~~dG~~~~~l~~ 112 (713)
.++.||+.........+|.+.+++. ..-+.++... -....++++. .+.|+++.. +...+|+++++++.....|.
T Consensus 278 ~~~~ly~~tn~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~i~~~~~~--~~~l~~~~~~~g~~~l~~~~~~~~~~~~l~- 354 (686)
T PRK10115 278 YQHRFYLRSNRHGKNFGLYRTRVRDEQQWEELIPPRENIMLEGFTLF--TDWLVVEERQRGLTSLRQINRKTREVIGIA- 354 (686)
T ss_pred CCCEEEEEEcCCCCCceEEEecCCCcccCeEEECCCCCCEEEEEEEE--CCEEEEEEEeCCEEEEEEEcCCCCceEEec-
Confidence 3467777543222456788888763 2234455442 2346667775 556666643 33347888887765443332
Q ss_pred CCCCCCeeEE---E--eCCCCeEEEEc---CCCCcEEEEeCCCCceeEEEe
Q psy6570 113 NNIQWPTGIT---I--DYPSQRLYWAD---PKARTIESINLNGKDRFVVYH 155 (713)
Q Consensus 113 ~~~~~p~gla---v--d~~~~~LY~~d---~~~~~I~~~~~~g~~~~~~~~ 155 (713)
+..|..++ + ++.++.|+++- .....|++++++....+.+..
T Consensus 355 --~~~~~~~~~~~~~~~~~~~~~~~~~ss~~~P~~~y~~d~~~~~~~~l~~ 403 (686)
T PRK10115 355 --FDDPAYVTWIAYNPEPETSRLRYGYSSMTTPDTLFELDMDTGERRVLKQ 403 (686)
T ss_pred --CCCCceEeeecccCCCCCceEEEEEecCCCCCEEEEEECCCCcEEEEEe
Confidence 12232222 2 23345566543 345679999987765555543
No 294
>KOG0263|consensus
Probab=75.30 E-value=1.5e+02 Score=33.82 Aligned_cols=173 Identities=12% Similarity=0.035 Sum_probs=85.9
Q ss_pred CCceeEEEccCcccEE-ecCCCCC-ceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCC
Q psy6570 6 SGNVTRVKREMNLKTV-LSNLHDP-RGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLS 83 (713)
Q Consensus 6 ~~~I~~~~~~~~~~~~-~~~~~~p-~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~ 83 (713)
+..+....+++..-+| ..+-..| ..+.|.|. +.-|.|-+ .....+++..+- .....++...+....-+.+.|..
T Consensus 472 D~svRLWsl~t~s~~V~y~GH~~PVwdV~F~P~-GyYFatas-~D~tArLWs~d~--~~PlRifaghlsDV~cv~FHPNs 547 (707)
T KOG0263|consen 472 DSSVRLWSLDTWSCLVIYKGHLAPVWDVQFAPR-GYYFATAS-HDQTARLWSTDH--NKPLRIFAGHLSDVDCVSFHPNS 547 (707)
T ss_pred CcceeeeecccceeEEEecCCCcceeeEEecCC-ceEEEecC-CCceeeeeeccc--CCchhhhcccccccceEEECCcc
Confidence 4455555566544333 4444555 34556554 44443433 112234444433 33333333446667778888754
Q ss_pred CcEEEEccCCCCeEEEEec-CCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCcc
Q psy6570 84 GRMFWTELGIKPRISGASI-DGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYK 162 (713)
Q Consensus 84 ~~ly~td~~~~~~I~~~~~-dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~ 162 (713)
.++ .|.+... .+..-+. .|..+ .++...-.....|++.|.+.+|- +-...+.|..-|+.+..+....... -..
T Consensus 548 ~Y~-aTGSsD~-tVRlWDv~~G~~V-RiF~GH~~~V~al~~Sp~Gr~La-Sg~ed~~I~iWDl~~~~~v~~l~~H--t~t 621 (707)
T KOG0263|consen 548 NYV-ATGSSDR-TVRLWDVSTGNSV-RIFTGHKGPVTALAFSPCGRYLA-SGDEDGLIKIWDLANGSLVKQLKGH--TGT 621 (707)
T ss_pred ccc-ccCCCCc-eEEEEEcCCCcEE-EEecCCCCceEEEEEcCCCceEe-ecccCCcEEEEEcCCCcchhhhhcc--cCc
Confidence 433 3332222 3333232 23333 34444555567899997554443 3335566766676554332222211 223
Q ss_pred ceeeeee-CCeEEEEeCCCCcEEEEcc
Q psy6570 163 PYKLEVF-EDNLYFSTYRTNNILKINK 188 (713)
Q Consensus 163 p~~i~~~-~~~ly~td~~~~~i~~~~~ 188 (713)
-..|.+. ++.++++....+.|...+.
T Consensus 622 i~SlsFS~dg~vLasgg~DnsV~lWD~ 648 (707)
T KOG0263|consen 622 IYSLSFSRDGNVLASGGADNSVRLWDL 648 (707)
T ss_pred eeEEEEecCCCEEEecCCCCeEEEEEc
Confidence 3444443 5567788877787776654
No 295
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=74.92 E-value=2.8 Score=43.40 Aligned_cols=17 Identities=18% Similarity=0.063 Sum_probs=6.5
Q ss_pred hhHHHHHHHHHHHHHHh
Q psy6570 685 HISSILILILLLITVGG 701 (713)
Q Consensus 685 ~~~~~~~~~~~~~~~~~ 701 (713)
+++++++++||||+|++
T Consensus 275 IaVG~~La~lvlivLia 291 (306)
T PF01299_consen 275 IAVGAALAGLVLIVLIA 291 (306)
T ss_pred HHHHHHHHHHHHHHHHh
Confidence 33343333333333333
No 296
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=74.89 E-value=12 Score=31.89 Aligned_cols=31 Identities=23% Similarity=0.575 Sum_probs=24.3
Q ss_pred ccCCCCC-CCCCCCCeeeccCCCceeeeCCCCc
Q psy6570 219 VTNHCDD-KPCHQSALCINLPSSHTCLCPDHLT 250 (713)
Q Consensus 219 ~~~~C~~-~~C~~~~~C~~~~g~~~C~C~~G~~ 250 (713)
..++|.. ..|++.++|.. ..+..|.|++||.
T Consensus 76 p~d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~ 107 (110)
T PF00954_consen 76 PKDQCDVYGFCGPNGICNS-NNSPKCSCLPGFE 107 (110)
T ss_pred cccCCCCccccCCccEeCC-CCCCceECCCCcC
Confidence 3468886 58999999954 4466899999997
No 297
>KOG2048|consensus
Probab=74.64 E-value=1.5e+02 Score=33.27 Aligned_cols=99 Identities=16% Similarity=0.084 Sum_probs=58.0
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEecCCceE-EEEEcC-CCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCC
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKK-RTLLNT-GLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGK 105 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~-~~l~~~-~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~ 105 (713)
...||+....+.|-++.. .+.|...++.-... +.++.. .-....+|+.- .+++||=++.. +.|...++-.-
T Consensus 28 I~slA~s~kS~~lAvsRt----~g~IEiwN~~~~w~~~~vi~g~~drsIE~L~W~-e~~RLFS~g~s--g~i~EwDl~~l 100 (691)
T KOG2048|consen 28 IVSLAYSHKSNQLAVSRT----DGNIEIWNLSNNWFLEPVIHGPEDRSIESLAWA-EGGRLFSSGLS--GSITEWDLHTL 100 (691)
T ss_pred eEEEEEeccCCceeeecc----CCcEEEEccCCCceeeEEEecCCCCceeeEEEc-cCCeEEeecCC--ceEEEEecccC
Confidence 467899888888887765 68898888876553 333322 23456677765 46667665532 23555554433
Q ss_pred CcEEEEeCCCCCCeeEEEeCCCCeEEEE
Q psy6570 106 NKFNLVDNNIQWPTGITIDYPSQRLYWA 133 (713)
Q Consensus 106 ~~~~l~~~~~~~p~glavd~~~~~LY~~ 133 (713)
.++............|++.+.+..|-+.
T Consensus 101 k~~~~~d~~gg~IWsiai~p~~~~l~Ig 128 (691)
T KOG2048|consen 101 KQKYNIDSNGGAIWSIAINPENTILAIG 128 (691)
T ss_pred ceeEEecCCCcceeEEEeCCccceEEee
Confidence 3333333334445566666655555554
No 298
>KOG3516|consensus
Probab=74.17 E-value=29 Score=41.41 Aligned_cols=25 Identities=32% Similarity=0.854 Sum_probs=22.5
Q ss_pred CCCCCCCCCCCeeeccCCCceeeeC
Q psy6570 222 HCDDKPCHQSALCINLPSSHTCLCP 246 (713)
Q Consensus 222 ~C~~~~C~~~~~C~~~~g~~~C~C~ 246 (713)
-|+..+|.+++.|+....+|+|.|.
T Consensus 957 hCss~~C~NGG~Cvery~gytCDCs 981 (1306)
T KOG3516|consen 957 HCSSYPCLNGGHCVERYDGYTCDCS 981 (1306)
T ss_pred ccccccccCCCEEEEecCceeeccc
Confidence 4777799999999999999999996
No 299
>PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues.
Probab=74.16 E-value=99 Score=30.89 Aligned_cols=113 Identities=17% Similarity=0.152 Sum_probs=69.2
Q ss_pred CCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCC-CeEEEEecCCC
Q psy6570 27 DPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIK-PRISGASIDGK 105 (713)
Q Consensus 27 ~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~-~~I~~~~~dG~ 105 (713)
.+..+|+.+....+.+.... .....+++...++.....+....+. ...+|+. +.++..+.+.. .++.+...+|+
T Consensus 25 ~~~s~AvS~dg~~~A~v~~~-~~~~~L~~~~~~~~~~~~~~g~~l~---~PS~d~~-g~~W~v~~~~~~~~~~~~~~~g~ 99 (253)
T PF10647_consen 25 DVTSPAVSPDGSRVAAVSEG-DGGRSLYVGPAGGPVRPVLTGGSLT---RPSWDPD-GWVWTVDDGSGGVRVVRDSASGT 99 (253)
T ss_pred cccceEECCCCCeEEEEEEc-CCCCEEEEEcCCCcceeeccCCccc---cccccCC-CCEEEEEcCCCceEEEEecCCCc
Confidence 67888888888777665511 1467788888777666555333344 4478875 77777766544 22333234565
Q ss_pred CcEEEEeCC-CC-CCeeEEEeCCCCeEEEEc--CCCCcEEEEe
Q psy6570 106 NKFNLVDNN-IQ-WPTGITIDYPSQRLYWAD--PKARTIESIN 144 (713)
Q Consensus 106 ~~~~l~~~~-~~-~p~glavd~~~~~LY~~d--~~~~~I~~~~ 144 (713)
...+.+... +. ....|.|+++..||-+.- ....+|+...
T Consensus 100 ~~~~~v~~~~~~~~I~~l~vSpDG~RvA~v~~~~~~~~v~va~ 142 (253)
T PF10647_consen 100 GEPVEVDWPGLRGRITALRVSPDGTRVAVVVEDGGGGRVYVAG 142 (253)
T ss_pred ceeEEecccccCCceEEEEECCCCcEEEEEEecCCCCeEEEEE
Confidence 555544322 22 568899999988865553 3446666654
No 300
>KOG0263|consensus
Probab=73.57 E-value=1.6e+02 Score=33.63 Aligned_cols=161 Identities=12% Similarity=0.072 Sum_probs=79.6
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCC-CCeEEEEecCCCC
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGI-KPRISGASIDGKN 106 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~-~~~I~~~~~dG~~ 106 (713)
..|..|-|.++.| ++-+ ....|..-+|+.....++....+.-.+++.+.| .|+-|.|-++. .+++|..+- ..
T Consensus 454 Vyg~sFsPd~rfL-lScS---ED~svRLWsl~t~s~~V~y~GH~~PVwdV~F~P-~GyYFatas~D~tArLWs~d~--~~ 526 (707)
T KOG0263|consen 454 VYGCSFSPDRRFL-LSCS---EDSSVRLWSLDTWSCLVIYKGHLAPVWDVQFAP-RGYYFATASHDQTARLWSTDH--NK 526 (707)
T ss_pred eeeeeecccccce-eecc---CCcceeeeecccceeEEEecCCCcceeeEEecC-CceEEEecCCCceeeeeeccc--CC
Confidence 4677777764433 4544 445666667776655566653333344577775 45544444332 235665544 23
Q ss_pred cEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeC-CCCceeEEEecCCCCccceeeeeeC-CeEEEEeCCCCcEE
Q psy6570 107 KFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINL-NGKDRFVVYHTEDNGYKPYKLEVFE-DNLYFSTYRTNNIL 184 (713)
Q Consensus 107 ~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~-~g~~~~~~~~~~~~~~~p~~i~~~~-~~ly~td~~~~~i~ 184 (713)
...++...+.--.-++|.|...++. +.+...+++.-+. .|..++++... ...-.+|++.- ++...+-...+.|.
T Consensus 527 PlRifaghlsDV~cv~FHPNs~Y~a-TGSsD~tVRlWDv~~G~~VRiF~GH---~~~V~al~~Sp~Gr~LaSg~ed~~I~ 602 (707)
T KOG0263|consen 527 PLRIFAGHLSDVDCVSFHPNSNYVA-TGSSDRTVRLWDVSTGNSVRIFTGH---KGPVTALAFSPCGRYLASGDEDGLIK 602 (707)
T ss_pred chhhhcccccccceEEECCcccccc-cCCCCceEEEEEcCCCcEEEEecCC---CCceEEEEEcCCCceEeecccCCcEE
Confidence 3333333455545588887554443 2233444555443 45555555321 22224445442 22223333455566
Q ss_pred EEcccCCCcceeeec
Q psy6570 185 KINKFGNSDFNVLAN 199 (713)
Q Consensus 185 ~~~~~~~~~~~~~~~ 199 (713)
..+..++..+..+..
T Consensus 603 iWDl~~~~~v~~l~~ 617 (707)
T KOG0263|consen 603 IWDLANGSLVKQLKG 617 (707)
T ss_pred EEEcCCCcchhhhhc
Confidence 666555544444433
No 301
>KOG3607|consensus
Probab=73.40 E-value=6.4 Score=45.30 Aligned_cols=32 Identities=28% Similarity=0.828 Sum_probs=25.7
Q ss_pred CCCCCCCCCCCeEecCCCCcceeecCCCcccCCCCc
Q psy6570 593 PSCHNYCDNAGLCSYSKQGKPVCTCVNGWSGITCSE 628 (713)
Q Consensus 593 ~~C~~~C~~~g~C~~~~~g~~~C~C~~G~~G~~C~~ 628 (713)
..|+..|+.+|+|.+.. .|.|.+||.+..|+.
T Consensus 626 ~~~~~~C~g~GVCnn~~----~ChC~~gwapp~C~~ 657 (716)
T KOG3607|consen 626 SCCPTTCNGHGVCNNEL----NCHCEPGWAPPFCFI 657 (716)
T ss_pred cccccccCCCcccCCCc----ceeeCCCCCCCcccc
Confidence 33555699999997554 699999999999987
No 302
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=73.14 E-value=1.9 Score=28.31 Aligned_cols=20 Identities=10% Similarity=0.092 Sum_probs=9.7
Q ss_pred ccchhHHHHHHHHHHHHHHh
Q psy6570 682 VNSHISSILILILLLITVGG 701 (713)
Q Consensus 682 ~~~~~~~~~~~~~~~~~~~~ 701 (713)
..+.+++++++++++++.++
T Consensus 5 ~IaIIv~V~vg~~iiii~~~ 24 (38)
T PF02439_consen 5 TIAIIVAVVVGMAIIIICMF 24 (38)
T ss_pred hhhHHHHHHHHHHHHHHHHH
Confidence 34455555555555543333
No 303
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=73.10 E-value=31 Score=37.99 Aligned_cols=64 Identities=19% Similarity=0.190 Sum_probs=42.1
Q ss_pred CCCCcceEEEcCCCCcEEEEccCCC---------------CeEEEEecCCC-------CcEEEEe----CC---------
Q psy6570 70 GLNEPYDIALEPLSGRMFWTELGIK---------------PRISGASIDGK-------NKFNLVD----NN--------- 114 (713)
Q Consensus 70 ~~~~p~~iavD~~~~~ly~td~~~~---------------~~I~~~~~dG~-------~~~~l~~----~~--------- 114 (713)
.+.+|.+|++.|..+.+|++..++. ++|+|....+. .-.+++. ..
T Consensus 415 ~mdRpE~i~~~p~~g~Vy~~lTNn~~r~~~~aNpr~~n~~G~I~r~~p~~~d~t~~~ftWdlF~~aG~~~~~~~~~~~~~ 494 (616)
T COG3211 415 PMDRPEWIAVNPGTGEVYFTLTNNGKRSDDAANPRAKNGYGQIVRWIPATGDHTDTKFTWDLFVEAGNPSVLEGGASANI 494 (616)
T ss_pred cccCccceeecCCcceEEEEeCCCCccccccCCCcccccccceEEEecCCCCccCccceeeeeeecCCccccccccccCc
Confidence 4789999999999999999865433 45777655432 1112221 11
Q ss_pred ----CCCCeeEEEeCCCCeEEEEc
Q psy6570 115 ----IQWPTGITIDYPSQRLYWAD 134 (713)
Q Consensus 115 ----~~~p~glavd~~~~~LY~~d 134 (713)
+..|.+|+||+ .++|++..
T Consensus 495 ~~~~f~~PDnl~fD~-~GrLWi~T 517 (616)
T COG3211 495 NANWFNSPDNLAFDP-WGRLWIQT 517 (616)
T ss_pred ccccccCCCceEECC-CCCEEEEe
Confidence 44599999996 45666653
No 304
>PTZ00046 rifin; Provisional
Probab=73.07 E-value=1.4 Score=45.45 Aligned_cols=29 Identities=24% Similarity=0.259 Sum_probs=14.2
Q ss_pred chhHHHHHHHHHH-HHHHhheeeEEEEecC
Q psy6570 684 SHISSILILILLL-ITVGGIGYYIFRIKMS 712 (713)
Q Consensus 684 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 712 (713)
+++++++++++++ |.|++-|+.|+|||.|
T Consensus 316 aIiaSiiAIvVIVLIMvIIYLILRYRRKKK 345 (358)
T PTZ00046 316 AIIASIVAIVVIVLIMVIIYLILRYRRKKK 345 (358)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhcch
Confidence 3444544444444 4444455556665543
No 305
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=72.60 E-value=6.4 Score=28.32 Aligned_cols=20 Identities=40% Similarity=1.326 Sum_probs=13.1
Q ss_pred CCCCCCEEEcCCCeeeCCCCCc
Q psy6570 390 KCHNGGTCIATTQTCVCPPGFT 411 (713)
Q Consensus 390 ~C~~~~~C~~~~~~C~C~~g~~ 411 (713)
.|..++.|++ +.|.|++||.
T Consensus 27 qC~~~s~C~~--g~C~C~~g~~ 46 (52)
T PF01683_consen 27 QCIGGSVCVN--GRCQCPPGYV 46 (52)
T ss_pred CCCCcCEEcC--CEeECCCCCE
Confidence 3446667754 6777777765
No 306
>PF05935 Arylsulfotrans: Arylsulfotransferase (ASST); InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=72.50 E-value=1e+02 Score=34.13 Aligned_cols=116 Identities=12% Similarity=0.045 Sum_probs=59.4
Q ss_pred cCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCC--CCCcceEEEcCCCCcEEEEcc--------C---CCCeEEEEec
Q psy6570 36 VGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTG--LNEPYDIALEPLSGRMFWTEL--------G---IKPRISGASI 102 (713)
Q Consensus 36 ~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~--~~~p~~iavD~~~~~ly~td~--------~---~~~~I~~~~~ 102 (713)
.+++|++... .+|..+++.|+......... ...=+++..+|.+..|+.+.. . ..-.|..++.
T Consensus 157 ~nG~ll~~~~-----~~~~e~D~~G~v~~~~~l~~~~~~~HHD~~~l~nGn~L~l~~~~~~~~~~~~~~~~~D~Ivevd~ 231 (477)
T PF05935_consen 157 PNGNLLIGSG-----NRLYEIDLLGKVIWEYDLPGGYYDFHHDIDELPNGNLLILASETKYVDEDKDVDTVEDVIVEVDP 231 (477)
T ss_dssp TTS-EEEEEB-----TEEEEE-TT--EEEEEE--TTEE-B-S-EEE-TTS-EEEEEEETTEE-TS-EE---S-EEEEE-T
T ss_pred CCCCEEEecC-----CceEEEcCCCCEEEeeecCCcccccccccEECCCCCEEEEEeecccccCCCCccEecCEEEEECC
Confidence 4566666543 67888999998654432222 122457777765444444431 1 0125666666
Q ss_pred CCCCcEEEEeCC--------------------------CCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEec
Q psy6570 103 DGKNKFNLVDNN--------------------------IQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHT 156 (713)
Q Consensus 103 dG~~~~~l~~~~--------------------------~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~ 156 (713)
+|......-... -.+.++|.+|+.++.|+++-...+.|++++.......-++..
T Consensus 232 tG~vv~~wd~~d~ld~~~~~~~~~~~~~~~~~~~~~~DW~H~Nsi~yd~~dd~iivSsR~~s~V~~Id~~t~~i~Wilg~ 311 (477)
T PF05935_consen 232 TGEVVWEWDFFDHLDPYRDTVLKPYPYGDISGSGGGRDWLHINSIDYDPSDDSIIVSSRHQSAVIKIDYRTGKIKWILGP 311 (477)
T ss_dssp TS-EEEEEEGGGTS-TT--TTGGT--SSSSS-SSTTSBS--EEEEEEETTTTEEEEEETTT-EEEEEE-TTS-EEEEES-
T ss_pred CCCEEEEEehHHhCCcccccccccccccccccCCCCCCccccCccEEeCCCCeEEEEcCcceEEEEEECCCCcEEEEeCC
Confidence 665443322100 023578999988999999999999999999766666555544
No 307
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=72.41 E-value=1.5 Score=45.07 Aligned_cols=29 Identities=24% Similarity=0.270 Sum_probs=14.1
Q ss_pred chhHHHHHHHHHH-HHHHhheeeEEEEecC
Q psy6570 684 SHISSILILILLL-ITVGGIGYYIFRIKMS 712 (713)
Q Consensus 684 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 712 (713)
+++++++++++++ |.|++-|+.|+|||.|
T Consensus 311 ~IiaSiIAIvvIVLIMvIIYLILRYRRKKK 340 (353)
T TIGR01477 311 PIIASIIAILIIVLIMVIIYLILRYRRKKK 340 (353)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhcch
Confidence 3444444444433 4444455556665543
No 308
>KOG0303|consensus
Probab=72.29 E-value=1.3e+02 Score=31.53 Aligned_cols=140 Identities=14% Similarity=0.095 Sum_probs=74.6
Q ss_pred ccCCceeEEEccCcccEE-ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEE-EE-EcCCCCCcceEEEc
Q psy6570 4 ISSGNVTRVKREMNLKTV-LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKR-TL-LNTGLNEPYDIALE 80 (713)
Q Consensus 4 ~~~~~I~~~~~~~~~~~~-~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~-~l-~~~~~~~p~~iavD 80 (713)
..++.|...+.++.+..+ +..-....++.+.+. +.++.|-. ...+|.+.+....... .- ...+...++.|-+
T Consensus 151 g~Dn~v~iWnv~tgeali~l~hpd~i~S~sfn~d-Gs~l~Ttc---kDKkvRv~dpr~~~~v~e~~~heG~k~~Raifl- 225 (472)
T KOG0303|consen 151 GSDNTVSIWNVGTGEALITLDHPDMVYSMSFNRD-GSLLCTTC---KDKKVRVIDPRRGTVVSEGVAHEGAKPARAIFL- 225 (472)
T ss_pred cCCceEEEEeccCCceeeecCCCCeEEEEEeccC-Cceeeeec---ccceeEEEcCCCCcEeeecccccCCCcceeEEe-
Confidence 345566666666655444 222223456677755 45556665 5678888887543322 11 2233344445554
Q ss_pred CCCCcEEEEccCCCCeEEEEecCCCCc-EEEEeCCCCCCeeEE---EeCCCCeEEEEcCCCCcEEEEeCCCCc
Q psy6570 81 PLSGRMFWTELGIKPRISGASIDGKNK-FNLVDNNIQWPTGIT---IDYPSQRLYWADPKARTIESINLNGKD 149 (713)
Q Consensus 81 ~~~~~ly~td~~~~~~I~~~~~dG~~~-~~l~~~~~~~p~gla---vd~~~~~LY~~d~~~~~I~~~~~~g~~ 149 (713)
.++.|+-|.....+....+-.|-... +.+....+...+|+- +|++++.||+.-.+.+.|+-+.+....
T Consensus 226 -~~g~i~tTGfsr~seRq~aLwdp~nl~eP~~~~elDtSnGvl~PFyD~dt~ivYl~GKGD~~IRYyEit~d~ 297 (472)
T KOG0303|consen 226 -ASGKIFTTGFSRMSERQIALWDPNNLEEPIALQELDTSNGVLLPFYDPDTSIVYLCGKGDSSIRYFEITNEP 297 (472)
T ss_pred -ccCceeeeccccccccceeccCcccccCcceeEEeccCCceEEeeecCCCCEEEEEecCCcceEEEEecCCC
Confidence 45666666544321111111111111 112222344456655 488889999999888888887776544
No 309
>KOG0281|consensus
Probab=71.39 E-value=21 Score=36.34 Aligned_cols=99 Identities=14% Similarity=-0.012 Sum_probs=56.6
Q ss_pred CcEEEEccCCCCeEEEEecCCCC-cEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCc-eeEEEecCCCCc
Q psy6570 84 GRMFWTELGIKPRISGASIDGKN-KFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKD-RFVVYHTEDNGY 161 (713)
Q Consensus 84 ~~ly~td~~~~~~I~~~~~dG~~-~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~-~~~~~~~~~~~~ 161 (713)
.++.++.++.. .|.+-+++.-. .+++ ...-.|||--.-+++|.++-+..+.|...+..-.. .+++..... +
T Consensus 330 ~kyIVsASgDR-TikvW~~st~efvRtl----~gHkRGIAClQYr~rlvVSGSSDntIRlwdi~~G~cLRvLeGHEe-L- 402 (499)
T KOG0281|consen 330 DKYIVSASGDR-TIKVWSTSTCEFVRTL----NGHKRGIACLQYRDRLVVSGSSDNTIRLWDIECGACLRVLEGHEE-L- 402 (499)
T ss_pred cceEEEecCCc-eEEEEeccceeeehhh----hcccccceehhccCeEEEecCCCceEEEEeccccHHHHHHhchHH-h-
Confidence 44666666655 66666665332 2222 23457899888899999999888888888875433 222221111 1
Q ss_pred cceeeeeeCCeEEEEeCCCCcEEEEcccCC
Q psy6570 162 KPYKLEVFEDNLYFSTYRTNNILKINKFGN 191 (713)
Q Consensus 162 ~p~~i~~~~~~ly~td~~~~~i~~~~~~~~ 191 (713)
-.-|.+++.+ .++-...++|...+....
T Consensus 403 -vRciRFd~kr-IVSGaYDGkikvWdl~aa 430 (499)
T KOG0281|consen 403 -VRCIRFDNKR-IVSGAYDGKIKVWDLQAA 430 (499)
T ss_pred -hhheeecCce-eeeccccceEEEEecccc
Confidence 1223334443 455566677776665443
No 310
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=71.19 E-value=1.4e+02 Score=31.82 Aligned_cols=137 Identities=7% Similarity=0.005 Sum_probs=66.5
Q ss_pred cCCeEEEeecCCCCCCeEEEEecCCce--EEEEEcCCCC-Ccc-eEEEcCCCCcEEEEccCC----------CCeEEEEe
Q psy6570 36 VGKNLYWTDAGGRSSNNIMVSTLEGRK--KRTLLNTGLN-EPY-DIALEPLSGRMFWTELGI----------KPRISGAS 101 (713)
Q Consensus 36 ~~~~ly~td~~~~~~~~I~~~~~~G~~--~~~l~~~~~~-~p~-~iavD~~~~~ly~td~~~----------~~~I~~~~ 101 (713)
.++.||+.-.. ....+++++++... -..+ ..+. .|+ +.++-..++.||+.--.. ...+++.+
T Consensus 37 ~~~~iyv~gG~--~~~~~~~~d~~~~~~~W~~l--~~~p~~~r~~~~~v~~~~~IYV~GG~~~~~~~~~~~~~~~v~~YD 112 (376)
T PRK14131 37 DNNTVYVGLGS--AGTSWYKLDLNAPSKGWTKI--AAFPGGPREQAVAAFIDGKLYVFGGIGKTNSEGSPQVFDDVYKYD 112 (376)
T ss_pred ECCEEEEEeCC--CCCeEEEEECCCCCCCeEEC--CcCCCCCcccceEEEECCEEEEEcCCCCCCCCCceeEcccEEEEe
Confidence 57899997542 34568888886421 1111 1221 233 222222468899974221 12467777
Q ss_pred cCCCCcEEEEeCCCCCC-eeEE-EeCCCCeEEEEcCC---------------------------------------CCcE
Q psy6570 102 IDGKNKFNLVDNNIQWP-TGIT-IDYPSQRLYWADPK---------------------------------------ARTI 140 (713)
Q Consensus 102 ~dG~~~~~l~~~~~~~p-~gla-vd~~~~~LY~~d~~---------------------------------------~~~I 140 (713)
+....=+.+. .....+ .+.+ +-..+++||+.-.. ...+
T Consensus 113 ~~~n~W~~~~-~~~p~~~~~~~~~~~~~~~IYv~GG~~~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v 191 (376)
T PRK14131 113 PKTNSWQKLD-TRSPVGLAGHVAVSLHNGKAYITGGVNKNIFDGYFEDLAAAGKDKTPKDKINDAYFDKKPEDYFFNKEV 191 (376)
T ss_pred CCCCEEEeCC-CCCCCcccceEEEEeeCCEEEEECCCCHHHHHHHHhhhhhcccchhhhhhhHHHHhcCChhhcCcCceE
Confidence 6543222221 111111 1111 11136899998432 2457
Q ss_pred EEEeCCCCceeEEEecCCCCccceeeeeeCCeEEEEe
Q psy6570 141 ESINLNGKDRFVVYHTEDNGYKPYKLEVFEDNLYFST 177 (713)
Q Consensus 141 ~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~td 177 (713)
+++|.....-+.+...........++.+.+++||+..
T Consensus 192 ~~YD~~t~~W~~~~~~p~~~~~~~a~v~~~~~iYv~G 228 (376)
T PRK14131 192 LSYDPSTNQWKNAGESPFLGTAGSAVVIKGNKLWLIN 228 (376)
T ss_pred EEEECCCCeeeECCcCCCCCCCcceEEEECCEEEEEe
Confidence 7777655444333222111122345666788999874
No 311
>PHA03098 kelch-like protein; Provisional
Probab=70.83 E-value=1.9e+02 Score=32.59 Aligned_cols=166 Identities=13% Similarity=0.007 Sum_probs=79.2
Q ss_pred CceeEEEccCcccEEecCCCCCce-EEEeccCCeEEEeecCC--CCCCeEEEEecCCceEEEEEcCCCCCcc-eEEEcCC
Q psy6570 7 GNVTRVKREMNLKTVLSNLHDPRG-VAVDWVGKNLYWTDAGG--RSSNNIMVSTLEGRKKRTLLNTGLNEPY-DIALEPL 82 (713)
Q Consensus 7 ~~I~~~~~~~~~~~~~~~~~~p~g-la~D~~~~~ly~td~~~--~~~~~I~~~~~~G~~~~~l~~~~~~~p~-~iavD~~ 82 (713)
..++++++.+..-..+..+..|+. .++-..+++||+.-... .....+.++++....=+.+ ..+..|+ +.++-..
T Consensus 311 ~~v~~yd~~~~~W~~~~~~~~~R~~~~~~~~~~~lyv~GG~~~~~~~~~v~~yd~~~~~W~~~--~~lp~~r~~~~~~~~ 388 (534)
T PHA03098 311 NSVVSYDTKTKSWNKVPELIYPRKNPGVTVFNNRIYVIGGIYNSISLNTVESWKPGESKWREE--PPLIFPRYNPCVVNV 388 (534)
T ss_pred ccEEEEeCCCCeeeECCCCCcccccceEEEECCEEEEEeCCCCCEecceEEEEcCCCCceeeC--CCcCcCCccceEEEE
Confidence 356667766654333444443421 22223468899864311 1234577777765432221 2233333 1122224
Q ss_pred CCcEEEEccCC-----CCeEEEEecCCCCcEEEEeCCCCC-CeeEEEeCCCCeEEEEcCC--------CCcEEEEeCCCC
Q psy6570 83 SGRMFWTELGI-----KPRISGASIDGKNKFNLVDNNIQW-PTGITIDYPSQRLYWADPK--------ARTIESINLNGK 148 (713)
Q Consensus 83 ~~~ly~td~~~-----~~~I~~~~~dG~~~~~l~~~~~~~-p~glavd~~~~~LY~~d~~--------~~~I~~~~~~g~ 148 (713)
++.||+..... ...+++.++....=+.+....... -..+++ .+++||+.-.. ...+++++....
T Consensus 389 ~~~iYv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~~p~~r~~~~~~~--~~~~iyv~GG~~~~~~~~~~~~v~~yd~~~~ 466 (534)
T PHA03098 389 NNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLPISHYGGCAIY--HDGKIYVIGGISYIDNIKVYNIVESYNPVTN 466 (534)
T ss_pred CCEEEEECCcCCCCcccceEEEEeCCCCeeeecCCCCccccCceEEE--ECCEEEEECCccCCCCCcccceEEEecCCCC
Confidence 78999975311 125778777643222221111111 112233 36789987532 234778777654
Q ss_pred ceeEEEecCCCCccceeeeeeCCeEEEEe
Q psy6570 149 DRFVVYHTEDNGYKPYKLEVFEDNLYFST 177 (713)
Q Consensus 149 ~~~~~~~~~~~~~~p~~i~~~~~~ly~td 177 (713)
.-+.+..... .....++.+.+++||+..
T Consensus 467 ~W~~~~~~~~-~r~~~~~~~~~~~iyv~G 494 (534)
T PHA03098 467 KWTELSSLNF-PRINASLCIFNNKIYVVG 494 (534)
T ss_pred ceeeCCCCCc-ccccceEEEECCEEEEEc
Confidence 4444332211 112234556688898874
No 312
>KOG0288|consensus
Probab=70.82 E-value=90 Score=32.87 Aligned_cols=126 Identities=10% Similarity=-0.000 Sum_probs=75.1
Q ss_pred cccCCceeEEEccCcc-cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCC-C---CCcceE
Q psy6570 3 SISSGNVTRVKREMNL-KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTG-L---NEPYDI 77 (713)
Q Consensus 3 d~~~~~I~~~~~~~~~-~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~-~---~~p~~i 77 (713)
+...++|.-++..+.. ...++......+|.+.+.+..|... . ..+.+.++++.+...+..+... + ..-..+
T Consensus 318 gH~DkkvRfwD~Rs~~~~~sv~~gg~vtSl~ls~~g~~lLss-s---RDdtl~viDlRt~eI~~~~sA~g~k~asDwtrv 393 (459)
T KOG0288|consen 318 GHFDKKVRFWDIRSADKTRSVPLGGRVTSLDLSMDGLELLSS-S---RDDTLKVIDLRTKEIRQTFSAEGFKCASDWTRV 393 (459)
T ss_pred cccccceEEEeccCCceeeEeecCcceeeEeeccCCeEEeee-c---CCCceeeeecccccEEEEeecccccccccccee
Confidence 4455667777755433 2235556677777777766666644 4 5678888888887776655432 1 223445
Q ss_pred EEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCC--CCeeEEEeCCCCeEEEEc
Q psy6570 78 ALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQ--WPTGITIDYPSQRLYWAD 134 (713)
Q Consensus 78 avD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~--~p~glavd~~~~~LY~~d 134 (713)
++.|. .=|++-.+.+++|+.-+..+.....++...-. ..+.+++++.+..|.-+|
T Consensus 394 vfSpd--~~YvaAGS~dgsv~iW~v~tgKlE~~l~~s~s~~aI~s~~W~~sG~~Llsad 450 (459)
T KOG0288|consen 394 VFSPD--GSYVAAGSADGSVYIWSVFTGKLEKVLSLSTSNAAITSLSWNPSGSGLLSAD 450 (459)
T ss_pred EECCC--CceeeeccCCCcEEEEEccCceEEEEeccCCCCcceEEEEEcCCCchhhccc
Confidence 66553 23454444455788888877776666643322 346677776665555544
No 313
>KOG0308|consensus
Probab=70.75 E-value=89 Score=34.93 Aligned_cols=175 Identities=8% Similarity=0.021 Sum_probs=91.2
Q ss_pred cCCceeEEEccCccc-EE-------ecCCC-----CCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCC
Q psy6570 5 SSGNVTRVKREMNLK-TV-------LSNLH-----DPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGL 71 (713)
Q Consensus 5 ~~~~I~~~~~~~~~~-~~-------~~~~~-----~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~ 71 (713)
.+++|+..+.+.... ++ ...+. ...+||... ++.++++-. ..+-|...+.....+..-++.--
T Consensus 138 LD~~IflWDin~~~~~l~~s~n~~t~~sl~sG~k~siYSLA~N~-t~t~ivsGg---tek~lr~wDprt~~kimkLrGHT 213 (735)
T KOG0308|consen 138 LDRKIFLWDINTGTATLVASFNNVTVNSLGSGPKDSIYSLAMNQ-TGTIIVSGG---TEKDLRLWDPRTCKKIMKLRGHT 213 (735)
T ss_pred CCccEEEEEccCcchhhhhhccccccccCCCCCccceeeeecCC-cceEEEecC---cccceEEeccccccceeeeeccc
Confidence 356777777774321 11 11222 234566553 345666654 45556667766544433333334
Q ss_pred CCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCC-Cce
Q psy6570 72 NEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNG-KDR 150 (713)
Q Consensus 72 ~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g-~~~ 150 (713)
...+.|.++....++.-+.+. . .|..-++.-+.-..-+...-.....|..++.-..+|..+. .+.|++.++.. ...
T Consensus 214 dNVr~ll~~dDGt~~ls~sSD-g-tIrlWdLgqQrCl~T~~vH~e~VWaL~~~~sf~~vYsG~r-d~~i~~Tdl~n~~~~ 290 (735)
T KOG0308|consen 214 DNVRVLLVNDDGTRLLSASSD-G-TIRLWDLGQQRCLATYIVHKEGVWALQSSPSFTHVYSGGR-DGNIYRTDLRNPAKS 290 (735)
T ss_pred cceEEEEEcCCCCeEeecCCC-c-eEEeeeccccceeeeEEeccCceEEEeeCCCcceEEecCC-CCcEEecccCCchhh
Confidence 667888888655455443322 2 4555555433322111112223567778877778887774 56688888866 334
Q ss_pred eEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEEEcc
Q psy6570 151 FVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILKINK 188 (713)
Q Consensus 151 ~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~~~ 188 (713)
+.+..... ...-+.+..+++.+ |+......|.+...
T Consensus 291 tlick~da-Pv~~l~~~~~~~~~-WvtTtds~I~rW~~ 326 (735)
T KOG0308|consen 291 TLICKEDA-PVLKLHLHEHDNSV-WVTTTDSSIKRWKL 326 (735)
T ss_pred eEeecCCC-chhhhhhccccCCc-eeeeccccceecCC
Confidence 44443321 12223344445556 55555566766643
No 314
>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=70.66 E-value=2.6 Score=37.56 Aligned_cols=17 Identities=29% Similarity=0.086 Sum_probs=8.9
Q ss_pred cccchhHHHHHHHHHHH
Q psy6570 681 YVNSHISSILILILLLI 697 (713)
Q Consensus 681 ~~~~~~~~~~~~~~~~~ 697 (713)
.+.+.++++.+++||++
T Consensus 50 IVIGvVVGVGg~ill~i 66 (154)
T PF04478_consen 50 IVIGVVVGVGGPILLGI 66 (154)
T ss_pred EEEEEEecccHHHHHHH
Confidence 44556666555444443
No 315
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=70.59 E-value=1.3 Score=38.28 Aligned_cols=34 Identities=15% Similarity=0.106 Sum_probs=20.7
Q ss_pred ccccchhHHHHHHHHHH-HHHHhheeeEEEEecCC
Q psy6570 680 SYVNSHISSILILILLL-ITVGGIGYYIFRIKMST 713 (713)
Q Consensus 680 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 713 (713)
......+++|+++++.- |+++++++|..||++|+
T Consensus 60 ~fs~~~i~~Ii~gv~aGvIg~Illi~y~irR~~Kk 94 (122)
T PF01102_consen 60 RFSEPAIIGIIFGVMAGVIGIILLISYCIRRLRKK 94 (122)
T ss_dssp SSS-TCHHHHHHHHHHHHHHHHHHHHHHHHHHS--
T ss_pred CccccceeehhHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 44556777777666665 66666777777777664
No 316
>PRK10115 protease 2; Provisional
Probab=69.89 E-value=2.2e+02 Score=33.18 Aligned_cols=154 Identities=8% Similarity=0.086 Sum_probs=74.8
Q ss_pred eEEEeccCCeEEEeecCC--CCCCeEEEEecCCc--eEEEEEcCCCCCcce--EEEcCCCCcEEEEccC-CCCeEEEEec
Q psy6570 30 GVAVDWVGKNLYWTDAGG--RSSNNIMVSTLEGR--KKRTLLNTGLNEPYD--IALEPLSGRMFWTELG-IKPRISGASI 102 (713)
Q Consensus 30 gla~D~~~~~ly~td~~~--~~~~~I~~~~~~G~--~~~~l~~~~~~~p~~--iavD~~~~~ly~td~~-~~~~I~~~~~ 102 (713)
++++.+.++.||++-... .....|++.++... ..+.|.... ..+.. +......+.|++.... ..+.++....
T Consensus 176 ~~~w~~D~~~~~y~~~~~~~~~~~~v~~h~lgt~~~~d~lv~~e~-~~~~~~~~~~s~d~~~l~i~~~~~~~~~~~l~~~ 254 (686)
T PRK10115 176 SFVWANDSWTFYYVRKHPVTLLPYQVWRHTIGTPASQDELVYEEK-DDTFYVSLHKTTSKHYVVIHLASATTSEVLLLDA 254 (686)
T ss_pred EEEEeeCCCEEEEEEecCCCCCCCEEEEEECCCChhHCeEEEeeC-CCCEEEEEEEcCCCCEEEEEEECCccccEEEEEC
Confidence 466655556676653311 13367888888765 344455432 22222 2222234445443222 2235665553
Q ss_pred ---CCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEc---CCCCcEEEEeCCCC-ceeEEEecCCCCccceeeeeeCCeEEE
Q psy6570 103 ---DGKNKFNLVDNNIQWPTGITIDYPSQRLYWAD---PKARTIESINLNGK-DRFVVYHTEDNGYKPYKLEVFEDNLYF 175 (713)
Q Consensus 103 ---dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d---~~~~~I~~~~~~g~-~~~~~~~~~~~~~~p~~i~~~~~~ly~ 175 (713)
++. .+.++......- ..+....+.||+.. ..+.+|.+++++.. ..+.++.... .....++.+++++|++
T Consensus 255 ~~~~~~-~~~~~~~~~~~~--~~~~~~~~~ly~~tn~~~~~~~l~~~~~~~~~~~~~l~~~~~-~~~i~~~~~~~~~l~~ 330 (686)
T PRK10115 255 ELADAE-PFVFLPRRKDHE--YSLDHYQHRFYLRSNRHGKNFGLYRTRVRDEQQWEELIPPRE-NIMLEGFTLFTDWLVV 330 (686)
T ss_pred cCCCCC-ceEEEECCCCCE--EEEEeCCCEEEEEEcCCCCCceEEEecCCCcccCeEEECCCC-CCEEEEEEEECCEEEE
Confidence 333 233333222222 22333456777764 24557888887642 2344443321 1234567777888877
Q ss_pred EeCC--CCcEEEEcc
Q psy6570 176 STYR--TNNILKINK 188 (713)
Q Consensus 176 td~~--~~~i~~~~~ 188 (713)
+... ...|+.++.
T Consensus 331 ~~~~~g~~~l~~~~~ 345 (686)
T PRK10115 331 EERQRGLTSLRQINR 345 (686)
T ss_pred EEEeCCEEEEEEEcC
Confidence 6543 334555554
No 317
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=69.43 E-value=1.5 Score=44.16 Aligned_cols=32 Identities=25% Similarity=0.351 Sum_probs=0.0
Q ss_pred cccchhHHHHHH-HHHHHHHHhheeeEEEEecC
Q psy6570 681 YVNSHISSILIL-ILLLITVGGIGYYIFRIKMS 712 (713)
Q Consensus 681 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 712 (713)
+...++.++|++ +|||+++|+.+.||+||+.|
T Consensus 145 yL~T~IpaVVI~~iLLIA~iIa~icyrrkR~GK 177 (290)
T PF05454_consen 145 YLHTFIPAVVIAAILLIAGIIACICYRRKRKGK 177 (290)
T ss_dssp ---------------------------------
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccc
Confidence 333444444443 33345555566666666544
No 318
>KOG1517|consensus
Probab=69.18 E-value=1.1e+02 Score=36.34 Aligned_cols=159 Identities=11% Similarity=0.062 Sum_probs=87.3
Q ss_pred CceEEEecc--CCeEEEeecCCCCCCeEEEEecCCceEEEEEc-CCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCC
Q psy6570 28 PRGVAVDWV--GKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLN-TGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDG 104 (713)
Q Consensus 28 p~gla~D~~--~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~-~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG 104 (713)
-.|+.+||. +++||++- ....|.+-|.+-..+..-+. .....+..|.-|..+|.++++..... +|..+++.-
T Consensus 1166 ~~~~v~dWqQ~~G~Ll~tG----d~r~IRIWDa~~E~~~~diP~~s~t~vTaLS~~~~~gn~i~AGfaDG-svRvyD~R~ 1240 (1387)
T KOG1517|consen 1166 GTGLVVDWQQQSGHLLVTG----DVRSIRIWDAHKEQVVADIPYGSSTLVTALSADLVHGNIIAAGFADG-SVRVYDRRM 1240 (1387)
T ss_pred CCCeeeehhhhCCeEEecC----CeeEEEEEecccceeEeecccCCCccceeecccccCCceEEEeecCC-ceEEeeccc
Confidence 356889994 56788664 25667777777655433222 22356888988988899999876644 666665543
Q ss_pred CCcEEEE--eCCC-CC--CeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCC---CccceeeeeeCCeEEEE
Q psy6570 105 KNKFNLV--DNNI-QW--PTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDN---GYKPYKLEVFEDNLYFS 176 (713)
Q Consensus 105 ~~~~~l~--~~~~-~~--p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~---~~~p~~i~~~~~~ly~t 176 (713)
..+..++ .... .. ..++.+.+.+-.=.|+-...+.|+..++..+....+..-... ...-.+|.+++..=.++
T Consensus 1241 a~~ds~v~~~R~h~~~~~Iv~~slq~~G~~elvSgs~~G~I~~~DlR~~~~e~~~~iv~~~~yGs~lTal~VH~hapiiA 1320 (1387)
T KOG1517|consen 1241 APPDSLVCVYREHNDVEPIVHLSLQRQGLGELVSGSQDGDIQLLDLRMSSKETFLTIVAHWEYGSALTALTVHEHAPIIA 1320 (1387)
T ss_pred CCccccceeecccCCcccceeEEeecCCCcceeeeccCCeEEEEecccCcccccceeeeccccCccceeeeeccCCCeee
Confidence 3332222 1111 12 356666643333345666778888888777433222221110 11235566765433333
Q ss_pred eCCCCcEEEEcccCC
Q psy6570 177 TYRTNNILKINKFGN 191 (713)
Q Consensus 177 d~~~~~i~~~~~~~~ 191 (713)
......|..++..|.
T Consensus 1321 sGs~q~ikIy~~~G~ 1335 (1387)
T KOG1517|consen 1321 SGSAQLIKIYSLSGE 1335 (1387)
T ss_pred ecCcceEEEEecChh
Confidence 333355666655443
No 319
>KOG1407|consensus
Probab=69.13 E-value=1.2e+02 Score=29.94 Aligned_cols=56 Identities=14% Similarity=0.116 Sum_probs=32.5
Q ss_pred CcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeE
Q psy6570 73 EPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRL 130 (713)
Q Consensus 73 ~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~L 130 (713)
..+.|+.. ..+.|||...+.. .|+.++...-.+..-+...-..-.-|.|||+++++
T Consensus 149 e~ne~~w~-~~nd~Fflt~GlG-~v~ILsypsLkpv~si~AH~snCicI~f~p~Gryf 204 (313)
T KOG1407|consen 149 EVNEISWN-NSNDLFFLTNGLG-CVEILSYPSLKPVQSIKAHPSNCICIEFDPDGRYF 204 (313)
T ss_pred eeeeeeec-CCCCEEEEecCCc-eEEEEeccccccccccccCCcceEEEEECCCCceE
Confidence 34566666 5677888776755 77777766433333332222333567888766544
No 320
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=68.97 E-value=1.5e+02 Score=30.99 Aligned_cols=137 Identities=9% Similarity=0.005 Sum_probs=67.3
Q ss_pred cCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCC-Ccc---eEEEcCCCCcEEEEccCC----------CCeEEEEe
Q psy6570 36 VGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLN-EPY---DIALEPLSGRMFWTELGI----------KPRISGAS 101 (713)
Q Consensus 36 ~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~-~p~---~iavD~~~~~ly~td~~~----------~~~I~~~~ 101 (713)
.+++||++-.. ....+++++++....+-.....+. .|+ ++++ .++.||+.--.. ...+++.+
T Consensus 16 ~~~~vyv~GG~--~~~~~~~~d~~~~~~~W~~l~~~p~~~R~~~~~~~--~~~~iYv~GG~~~~~~~~~~~~~~~v~~Yd 91 (346)
T TIGR03547 16 IGDKVYVGLGS--AGTSWYKLDLKKPSKGWQKIADFPGGPRNQAVAAA--IDGKLYVFGGIGKANSEGSPQVFDDVYRYD 91 (346)
T ss_pred ECCEEEEEccc--cCCeeEEEECCCCCCCceECCCCCCCCcccceEEE--ECCEEEEEeCCCCCCCCCcceecccEEEEE
Confidence 46899997431 335678888642221111112222 222 2333 468999975321 12577777
Q ss_pred cCCCCcEEEEeCCCCCCe-eE-EEeCCCCeEEEEcCC---------------------------------------CCcE
Q psy6570 102 IDGKNKFNLVDNNIQWPT-GI-TIDYPSQRLYWADPK---------------------------------------ARTI 140 (713)
Q Consensus 102 ~dG~~~~~l~~~~~~~p~-gl-avd~~~~~LY~~d~~---------------------------------------~~~I 140 (713)
+....=+.+. .....+. +. ++-..+++||+.-.. .+.|
T Consensus 92 ~~~~~W~~~~-~~~p~~~~~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 170 (346)
T TIGR03547 92 PKKNSWQKLD-TRSPVGLLGASGFSLHNGQAYFTGGVNKNIFDGYFADLSAADKDSEPKDKLIAAYFSQPPEDYFWNKNV 170 (346)
T ss_pred CCCCEEecCC-CCCCCcccceeEEEEeCCEEEEEcCcChHHHHHHHhhHhhcCccchhhhhhHHHHhCCChhHcCccceE
Confidence 6543222221 1111111 22 121237899997432 1467
Q ss_pred EEEeCCCCceeEEEecCCCCccceeeeeeCCeEEEEe
Q psy6570 141 ESINLNGKDRFVVYHTEDNGYKPYKLEVFEDNLYFST 177 (713)
Q Consensus 141 ~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~td 177 (713)
+++|.....-+.+...........++++.+++||+.-
T Consensus 171 ~~YDp~t~~W~~~~~~p~~~r~~~~~~~~~~~iyv~G 207 (346)
T TIGR03547 171 LSYDPSTNQWRNLGENPFLGTAGSAIVHKGNKLLLIN 207 (346)
T ss_pred EEEECCCCceeECccCCCCcCCCceEEEECCEEEEEe
Confidence 7887765444443322111112345667788999874
No 321
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=68.91 E-value=12 Score=31.31 Aligned_cols=24 Identities=25% Similarity=0.513 Sum_probs=12.9
Q ss_pred CCCCCCCeEecCCC----CcceeecCCC
Q psy6570 597 NYCDNAGLCSYSKQ----GKPVCTCVNG 620 (713)
Q Consensus 597 ~~C~~~g~C~~~~~----g~~~C~C~~G 620 (713)
+.|++||.|..... .=|.|.|.+.
T Consensus 13 n~CsgHG~C~~~~~~~~~~C~~C~C~~T 40 (103)
T PF12955_consen 13 NNCSGHGSCVKKYGSGGGDCFACKCKPT 40 (103)
T ss_pred cCCCCCceEeeccCCCccceEEEEeecc
Confidence 44666666655421 2356777663
No 322
>PF05935 Arylsulfotrans: Arylsulfotransferase (ASST); InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=68.87 E-value=1.9e+02 Score=32.01 Aligned_cols=158 Identities=13% Similarity=0.084 Sum_probs=72.4
Q ss_pred cCCeEEEeec-CCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCC
Q psy6570 36 VGKNLYWTDA-GGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNN 114 (713)
Q Consensus 36 ~~~~ly~td~-~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~ 114 (713)
....||+... .......++.+|.+|..+-.+...... -..+... .+|.|++... . +|..+++.|+.....-...
T Consensus 112 ~~~gl~~~~~~~~~~~~~~~~iD~~G~Vrw~~~~~~~~-~~~~~~l-~nG~ll~~~~--~-~~~e~D~~G~v~~~~~l~~ 186 (477)
T PF05935_consen 112 MEDGLYFVNGNDWDSSSYTYLIDNNGDVRWYLPLDSGS-DNSFKQL-PNGNLLIGSG--N-RLYEIDLLGKVIWEYDLPG 186 (477)
T ss_dssp -TT-EEEEEETT--BEEEEEEEETTS-EEEEE-GGGT---SSEEE--TTS-EEEEEB--T-EEEEE-TT--EEEEEE--T
T ss_pred cCCcEEEEeCCCCCCCceEEEECCCccEEEEEccCccc-cceeeEc-CCCCEEEecC--C-ceEEEcCCCCEEEeeecCC
Confidence 3445555543 111356788899999876554432211 1114444 4777777665 3 8899999988544432211
Q ss_pred -C-CCCeeEEEeCCCCeEEEEc------------CCCCcEEEEeCCCCceeEEEecC-----------------------
Q psy6570 115 -I-QWPTGITIDYPSQRLYWAD------------PKARTIESINLNGKDRFVVYHTE----------------------- 157 (713)
Q Consensus 115 -~-~~p~glavd~~~~~LY~~d------------~~~~~I~~~~~~g~~~~~~~~~~----------------------- 157 (713)
. ..=..+...+.+..|+.+. .....|..++.+|.-+..+....
T Consensus 187 ~~~~~HHD~~~l~nGn~L~l~~~~~~~~~~~~~~~~~D~Ivevd~tG~vv~~wd~~d~ld~~~~~~~~~~~~~~~~~~~~ 266 (477)
T PF05935_consen 187 GYYDFHHDIDELPNGNLLILASETKYVDEDKDVDTVEDVIVEVDPTGEVVWEWDFFDHLDPYRDTVLKPYPYGDISGSGG 266 (477)
T ss_dssp TEE-B-S-EEE-TTS-EEEEEEETTEE-TS-EE---S-EEEEE-TTS-EEEEEEGGGTS-TT--TTGGT--SSSSS-SST
T ss_pred cccccccccEECCCCCEEEEEeecccccCCCCccEecCEEEEECCCCCEEEEEehHHhCCcccccccccccccccccCCC
Confidence 0 1124555555444444443 11234666665554333321111
Q ss_pred -CCCccceeeeee--CCeEEEEeCCCCcEEEEcccCCCcceeee
Q psy6570 158 -DNGYKPYKLEVF--EDNLYFSTYRTNNILKINKFGNSDFNVLA 198 (713)
Q Consensus 158 -~~~~~p~~i~~~--~~~ly~td~~~~~i~~~~~~~~~~~~~~~ 198 (713)
..+.|..+|.++ .+.|+++......|++++..++...-++.
T Consensus 267 ~~DW~H~Nsi~yd~~dd~iivSsR~~s~V~~Id~~t~~i~Wilg 310 (477)
T PF05935_consen 267 GRDWLHINSIDYDPSDDSIIVSSRHQSAVIKIDYRTGKIKWILG 310 (477)
T ss_dssp TSBS--EEEEEEETTTTEEEEEETTT-EEEEEE-TTS-EEEEES
T ss_pred CCCccccCccEEeCCCCeEEEEcCcceEEEEEECCCCcEEEEeC
Confidence 012344567766 47899998888899999965554444443
No 323
>KOG0649|consensus
Probab=68.63 E-value=81 Score=30.83 Aligned_cols=134 Identities=9% Similarity=0.049 Sum_probs=0.0
Q ss_pred CCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcC-CCCcEEEEeCCCC
Q psy6570 70 GLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADP-KARTIESINLNGK 148 (713)
Q Consensus 70 ~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~-~~~~I~~~~~~g~ 148 (713)
.+...++|-+||..+.|+++. +.. .|+..++.....+..+.....+-..++.-..++.|+-... ++-|||......-
T Consensus 113 evPeINam~ldP~enSi~~Ag-GD~-~~y~~dlE~G~i~r~~rGHtDYvH~vv~R~~~~qilsG~EDGtvRvWd~kt~k~ 190 (325)
T KOG0649|consen 113 EVPEINAMWLDPSENSILFAG-GDG-VIYQVDLEDGRIQREYRGHTDYVHSVVGRNANGQILSGAEDGTVRVWDTKTQKH 190 (325)
T ss_pred cCCccceeEeccCCCcEEEec-CCe-EEEEEEecCCEEEEEEcCCcceeeeeeecccCcceeecCCCccEEEEeccccce
Q ss_pred ceeEEEecCCCCccc------eeeeeeCCeEEEEeCCCCcEEEEcccCCCcceeeeccccccc
Q psy6570 149 DRFVVYHTEDNGYKP------YKLEVFEDNLYFSTYRTNNILKINKFGNSDFNVLANNLNRAS 205 (713)
Q Consensus 149 ~~~~~~~~~~~~~~p------~~i~~~~~~ly~td~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 205 (713)
...+-.........| .++++.+++|.--...+-.++-+........--+........
T Consensus 191 v~~ie~yk~~~~lRp~~g~wigala~~edWlvCGgGp~lslwhLrsse~t~vfpipa~v~~v~ 253 (325)
T KOG0649|consen 191 VSMIEPYKNPNLLRPDWGKWIGALAVNEDWLVCGGGPKLSLWHLRSSESTCVFPIPARVHLVD 253 (325)
T ss_pred eEEeccccChhhcCcccCceeEEEeccCceEEecCCCceeEEeccCCCceEEEecccceeEee
No 324
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=68.40 E-value=1.1e+02 Score=31.66 Aligned_cols=129 Identities=11% Similarity=0.086 Sum_probs=69.6
Q ss_pred CeEEEEecCCc-----eEEEEEcCCC-CCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCC-cEEEEeCCCCCCeeEEE
Q psy6570 51 NNIMVSTLEGR-----KKRTLLNTGL-NEPYDIALEPLSGRMFWTELGIKPRISGASIDGKN-KFNLVDNNIQWPTGITI 123 (713)
Q Consensus 51 ~~I~~~~~~G~-----~~~~l~~~~~-~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~-~~~l~~~~~~~p~glav 123 (713)
++|.++..... ..+.+....+ ..+.+|+. .+++|.++-. + +|...+++.+. .......... ...+.+
T Consensus 62 Gri~v~~i~~~~~~~~~l~~i~~~~~~g~V~ai~~--~~~~lv~~~g--~-~l~v~~l~~~~~l~~~~~~~~~-~~i~sl 135 (321)
T PF03178_consen 62 GRILVFEISESPENNFKLKLIHSTEVKGPVTAICS--FNGRLVVAVG--N-KLYVYDLDNSKTLLKKAFYDSP-FYITSL 135 (321)
T ss_dssp EEEEEEEECSS-----EEEEEEEEEESS-EEEEEE--ETTEEEEEET--T-EEEEEEEETTSSEEEEEEE-BS-SSEEEE
T ss_pred cEEEEEEEEcccccceEEEEEEEEeecCcceEhhh--hCCEEEEeec--C-EEEEEEccCcccchhhheecce-EEEEEE
Confidence 77888887763 2333322222 33555555 4777666653 2 67777777766 3333322221 144455
Q ss_pred eCCCCeEEEEcCCCC-cEEEEeCCCCceeEEEecCCCCccceeeeee--CCeEEEEeCCCCcEEEEc
Q psy6570 124 DYPSQRLYWADPKAR-TIESINLNGKDRFVVYHTEDNGYKPYKLEVF--EDNLYFSTYRTNNILKIN 187 (713)
Q Consensus 124 d~~~~~LY~~d~~~~-~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~--~~~ly~td~~~~~i~~~~ 187 (713)
...+++|+++|...+ .+.+++.++.....+..... ..+..++.+. ++.+..+|... .|+.+.
T Consensus 136 ~~~~~~I~vgD~~~sv~~~~~~~~~~~l~~va~d~~-~~~v~~~~~l~d~~~~i~~D~~g-nl~~l~ 200 (321)
T PF03178_consen 136 SVFKNYILVGDAMKSVSLLRYDEENNKLILVARDYQ-PRWVTAAEFLVDEDTIIVGDKDG-NLFVLR 200 (321)
T ss_dssp EEETTEEEEEESSSSEEEEEEETTTE-EEEEEEESS--BEEEEEEEE-SSSEEEEEETTS-EEEEEE
T ss_pred eccccEEEEEEcccCEEEEEEEccCCEEEEEEecCC-CccEEEEEEecCCcEEEEEcCCC-eEEEEE
Confidence 555889999997544 35555665554555554432 3345555543 45677777654 444333
No 325
>KOG2055|consensus
Probab=68.32 E-value=1.7e+02 Score=31.32 Aligned_cols=113 Identities=16% Similarity=0.095 Sum_probs=54.6
Q ss_pred ceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcC-CCC--CcceEEEcCCCCcEEEEccCCCCeEEEEecCCC
Q psy6570 29 RGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNT-GLN--EPYDIALEPLSGRMFWTELGIKPRISGASIDGK 105 (713)
Q Consensus 29 ~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~-~~~--~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~ 105 (713)
...+|-+.+....++-. ....++.++|.......+-.. ++. .-.-.+|.+... |++-.+.++.|....+...
T Consensus 261 ~~a~f~p~G~~~i~~s~---rrky~ysyDle~ak~~k~~~~~g~e~~~~e~FeVShd~~--fia~~G~~G~I~lLhakT~ 335 (514)
T KOG2055|consen 261 QKAEFAPNGHSVIFTSG---RRKYLYSYDLETAKVTKLKPPYGVEEKSMERFEVSHDSN--FIAIAGNNGHIHLLHAKTK 335 (514)
T ss_pred ceeeecCCCceEEEecc---cceEEEEeeccccccccccCCCCcccchhheeEecCCCC--eEEEcccCceEEeehhhhh
Confidence 33445555442333444 456677777765444433321 111 122345554444 3333344446666554332
Q ss_pred CcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCC
Q psy6570 106 NKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGK 148 (713)
Q Consensus 106 ~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~ 148 (713)
....-+. --....+++|+.+..+|| +-...+.||.+++.-.
T Consensus 336 eli~s~K-ieG~v~~~~fsSdsk~l~-~~~~~GeV~v~nl~~~ 376 (514)
T KOG2055|consen 336 ELITSFK-IEGVVSDFTFSSDSKELL-ASGGTGEVYVWNLRQN 376 (514)
T ss_pred hhhheee-eccEEeeEEEecCCcEEE-EEcCCceEEEEecCCc
Confidence 2211111 012346788886665554 4445668888887665
No 326
>PF09910 DUF2139: Uncharacterized protein conserved in archaea (DUF2139); InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=66.76 E-value=1.5e+02 Score=30.04 Aligned_cols=51 Identities=20% Similarity=0.200 Sum_probs=29.6
Q ss_pred CCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCC
Q psy6570 49 SSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGK 105 (713)
Q Consensus 49 ~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~ 105 (713)
...+|+++.+=|...+ ++-. .-=||++.+..|||-.|-..|.+++-+.+|+
T Consensus 18 d~~~iY~felvG~~P~----SGGD--TYNAV~~vDd~IyFGGWVHAPa~y~gk~~g~ 68 (339)
T PF09910_consen 18 DSEKIYRFELVGPPPT----SGGD--TYNAVEWVDDFIYFGGWVHAPAVYEGKGDGR 68 (339)
T ss_pred CceEEEEeeeccCCCC----CCCc--cceeeeeecceEEEeeeecCCceeeeccCCc
Confidence 3567777776664332 1112 2245666788888877765555665555444
No 327
>KOG3567|consensus
Probab=66.21 E-value=17 Score=38.61 Aligned_cols=51 Identities=20% Similarity=0.250 Sum_probs=31.0
Q ss_pred CCcEEEEeCCCCceeEEE-ecCCCCccceeeeeeCC-eEEEEeCCCCcEEEEccc
Q psy6570 137 ARTIESINLNGKDRFVVY-HTEDNGYKPYKLEVFED-NLYFSTYRTNNILKINKF 189 (713)
Q Consensus 137 ~~~I~~~~~~g~~~~~~~-~~~~~~~~p~~i~~~~~-~ly~td~~~~~i~~~~~~ 189 (713)
..+|.++++. ++.++. .....+.-|.+|+++.+ ..|++|..++.+++.+..
T Consensus 444 ~~~ilvi~~~--n~~~l~~~g~~~fylphgl~~dkdgf~~~tdvash~v~k~k~~ 496 (501)
T KOG3567|consen 444 EDTILVIDPN--NAAVLQSSGKNLFYLPHGLSIDKDGFYWVTDVASHQVFKLKPN 496 (501)
T ss_pred cceEEEEcCc--chhhhhhccCCceecCCcceecCCCcEEeecccchhhhhcccc
Confidence 4567777776 333332 22335677899999955 455556667777665543
No 328
>TIGR02171 Fb_sc_TIGR02171 Fibrobacter succinogenes paralogous family TIGR02171. This model describes a paralogous family of the rumen bacterium Fibrobacter succinogenes. Eleven members are found in Fibrobacter succinogenes S85, averaging over 900 amino acids in length. More than half are predicted lipoproteins. The function is unknown.
Probab=66.14 E-value=35 Score=39.97 Aligned_cols=179 Identities=13% Similarity=0.127 Sum_probs=83.9
Q ss_pred CceeEEEccCc-ccEE-ecCCCCCceEEEeccCCeEEE-eecCC-CCCCeEEEEecCCceE--EEEEcCCCCCcceEEEc
Q psy6570 7 GNVTRVKREMN-LKTV-LSNLHDPRGVAVDWVGKNLYW-TDAGG-RSSNNIMVSTLEGRKK--RTLLNTGLNEPYDIALE 80 (713)
Q Consensus 7 ~~I~~~~~~~~-~~~~-~~~~~~p~gla~D~~~~~ly~-td~~~-~~~~~I~~~~~~G~~~--~~l~~~~~~~p~~iavD 80 (713)
++|..+|.++. .+++ +..-......++.|.+++|-+ +-... .+...|++.+|+.+.. ..|-.+...-|+=-++.
T Consensus 329 ~~L~~~D~dG~n~~~ve~~~~~~i~sP~~SPDG~~vAY~ts~e~~~g~s~vYv~~L~t~~~~~vkl~ve~aaiprwrv~e 408 (912)
T TIGR02171 329 GNLAYIDYTKGASRAVEIEDTISVYHPDISPDGKKVAFCTGIEGLPGKSSVYVRNLNASGSGLVKLPVENAAIPRWRVLE 408 (912)
T ss_pred CeEEEEecCCCCceEEEecCCCceecCcCCCCCCEEEEEEeecCCCCCceEEEEehhccCCCceEeecccccccceEecC
Confidence 47778887773 3444 333222333456666666655 43311 1244588887765432 22222233334433332
Q ss_pred -CCCCcEEEEccCCC---------CeEEEEecCCCC--cEEEEeCC----CCCCeeEEEeCCCCeEEEE---c----CCC
Q psy6570 81 -PLSGRMFWTELGIK---------PRISGASIDGKN--KFNLVDNN----IQWPTGITIDYPSQRLYWA---D----PKA 137 (713)
Q Consensus 81 -~~~~~ly~td~~~~---------~~I~~~~~dG~~--~~~l~~~~----~~~p~glavd~~~~~LY~~---d----~~~ 137 (713)
-.+-.+||+|.+++ +.-.+-..+|+. .+.|+... +..-..|||. +.+|.-+ + ...
T Consensus 409 ~gdt~ivyv~~a~nn~d~~~~~~~stw~v~f~~gkfg~p~kl~dga~hggvs~~~~lavt--ga~llr~~~~~~~~~~~~ 486 (912)
T TIGR02171 409 NGDTVIVYVSDASNNKDDATFAAYSTWQVPFANGKFGTPKKLFDGAYHGGVSEDLNLAVS--GARLLRAHVANEDVDNGK 486 (912)
T ss_pred CCCeEEEEEcCCCCCcchhhhhhcceEEEEecCCCCCCchhhhccccccccccCCceeee--hhhHhhhhhcccccccCc
Confidence 11225799987765 122233344432 33344322 2233445554 2222211 1 111
Q ss_pred CcEE---------EEeCCCCceeEEEecCCCCccc-eeeeee-CCeEEEEeCCCCcEEEEc
Q psy6570 138 RTIE---------SINLNGKDRFVVYHTEDNGYKP-YKLEVF-EDNLYFSTYRTNNILKIN 187 (713)
Q Consensus 138 ~~I~---------~~~~~g~~~~~~~~~~~~~~~p-~~i~~~-~~~ly~td~~~~~i~~~~ 187 (713)
..|| ++.-||++|++++.+......- .|-.+- .++|+++|...+.|..|.
T Consensus 487 ~~vwyn~eqacn~sl~~d~~~rt~fldfgg~tg~~fvg~~y~~he~~lvads~gklv~~v~ 547 (912)
T TIGR02171 487 DDVWYNGEQACNASLAKDGSKRTLFLDFGGSTGQAFVGQKYGVHERLLVADSKGKLVRAVA 547 (912)
T ss_pred cceeecchhccchhhhccCCcceEEEecCCccchhhccccccceeEEEEecCCCchhhhcc
Confidence 1222 2335788888888764311111 111111 456888888777776664
No 329
>KOG0284|consensus
Probab=66.10 E-value=33 Score=35.83 Aligned_cols=76 Identities=11% Similarity=0.058 Sum_probs=42.6
Q ss_pred CCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeee-CCeEEEEeCCCCcEEEEcccCCCcc
Q psy6570 116 QWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVF-EDNLYFSTYRTNNILKINKFGNSDF 194 (713)
Q Consensus 116 ~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~~~~ 194 (713)
.....||+.+ ++..|.+-+..++|..-+..-.....++.. ..-.+..++.+ ...|..+-...+.|.-.++.++.-+
T Consensus 181 eaIRdlafSp-nDskF~t~SdDg~ikiWdf~~~kee~vL~G--HgwdVksvdWHP~kgLiasgskDnlVKlWDprSg~cl 257 (464)
T KOG0284|consen 181 EAIRDLAFSP-NDSKFLTCSDDGTIKIWDFRMPKEERVLRG--HGWDVKSVDWHPTKGLIASGSKDNLVKLWDPRSGSCL 257 (464)
T ss_pred hhhheeccCC-CCceeEEecCCCeEEEEeccCCchhheecc--CCCCcceeccCCccceeEEccCCceeEeecCCCcchh
Confidence 4567899997 778888877777776655433222222222 23455666655 2334444444445666665554433
No 330
>KOG3567|consensus
Probab=65.12 E-value=2.1 Score=45.12 Aligned_cols=53 Identities=17% Similarity=0.185 Sum_probs=36.5
Q ss_pred CeEEEEecCCCCcEE-EEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCC
Q psy6570 95 PRISGASIDGKNKFN-LVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGK 148 (713)
Q Consensus 95 ~~I~~~~~dG~~~~~-l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~ 148 (713)
.+|.|+++....++. +-...+..|.||.+| .++..|++|...+.+......++
T Consensus 445 ~~ilvi~~~n~~~l~~~g~~~fylphgl~~d-kdgf~~~tdvash~v~k~k~~~~ 498 (501)
T KOG3567|consen 445 DTILVIDPNNAAVLQSSGKNLFYLPHGLSID-KDGFYWVTDVASHQVFKLKPNNK 498 (501)
T ss_pred ceEEEEcCcchhhhhhccCCceecCCcceec-CCCcEEeecccchhhhhcccccc
Confidence 366677666333322 223357789999999 67888889988888877666554
No 331
>KOG0283|consensus
Probab=64.46 E-value=2.7e+02 Score=32.07 Aligned_cols=158 Identities=9% Similarity=-0.005 Sum_probs=78.0
Q ss_pred ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCC-CCcceEEEcCCCCcEEEEccCCCCeEEEE
Q psy6570 22 LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGL-NEPYDIALEPLSGRMFWTELGIKPRISGA 100 (713)
Q Consensus 22 ~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~-~~p~~iavD~~~~~ly~td~~~~~~I~~~ 100 (713)
.........|+|.|.+++.|++-+ -.++|.+-+.-.. +++...++ ....++.+-|. |...+....+. .....
T Consensus 406 F~HndfVTcVaFnPvDDryFiSGS---LD~KvRiWsI~d~--~Vv~W~Dl~~lITAvcy~Pd-Gk~avIGt~~G-~C~fY 478 (712)
T KOG0283|consen 406 FSHNDFVTCVAFNPVDDRYFISGS---LDGKVRLWSISDK--KVVDWNDLRDLITAVCYSPD-GKGAVIGTFNG-YCRFY 478 (712)
T ss_pred EecCCeeEEEEecccCCCcEeecc---cccceEEeecCcC--eeEeehhhhhhheeEEeccC-CceEEEEEecc-EEEEE
Confidence 445666789999999999998876 5666666655442 23332222 33555555554 33222222212 33333
Q ss_pred ecCCCCcEEEEe--------CCCCCCeeEEEeCCCC-eEEEEcCCCCcEEEEeCCCCceeEEEecCCCCc--cceeeeee
Q psy6570 101 SIDGKNKFNLVD--------NNIQWPTGITIDYPSQ-RLYWADPKARTIESINLNGKDRFVVYHTEDNGY--KPYKLEVF 169 (713)
Q Consensus 101 ~~dG~~~~~l~~--------~~~~~p~glavd~~~~-~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~--~p~~i~~~ 169 (713)
+..|-..+.-.. ..-++.+||.+.+..- +|.|+- ...+|+.++..-......+....... .-..+..+
T Consensus 479 ~t~~lk~~~~~~I~~~~~Kk~~~~rITG~Q~~p~~~~~vLVTS-nDSrIRI~d~~~~~lv~KfKG~~n~~SQ~~Asfs~D 557 (712)
T KOG0283|consen 479 DTEGLKLVSDFHIRLHNKKKKQGKRITGLQFFPGDPDEVLVTS-NDSRIRIYDGRDKDLVHKFKGFRNTSSQISASFSSD 557 (712)
T ss_pred EccCCeEEEeeeEeeccCccccCceeeeeEecCCCCCeEEEec-CCCceEEEeccchhhhhhhcccccCCcceeeeEccC
Confidence 333322111110 1123578888875443 576665 45678887763333322222211111 12234444
Q ss_pred CCeEEEEeCCCCcEEEEcc
Q psy6570 170 EDNLYFSTYRTNNILKINK 188 (713)
Q Consensus 170 ~~~ly~td~~~~~i~~~~~ 188 (713)
+.+|.-+. ....|+..+.
T Consensus 558 gk~IVs~s-eDs~VYiW~~ 575 (712)
T KOG0283|consen 558 GKHIVSAS-EDSWVYIWKN 575 (712)
T ss_pred CCEEEEee-cCceEEEEeC
Confidence 55554443 5555665553
No 332
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=63.26 E-value=7 Score=41.99 Aligned_cols=34 Identities=12% Similarity=0.255 Sum_probs=20.2
Q ss_pred cccccchhHHHHHHHHHHHH-HHhheeeEEEEecC
Q psy6570 679 QSYVNSHISSILILILLLIT-VGGIGYYIFRIKMS 712 (713)
Q Consensus 679 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 712 (713)
...+.+.|++|.|++||||. ||.+|.|+|.-|.|
T Consensus 362 s~LstgaIaGIsvavvvvVgglvGfLcWwf~crgk 396 (397)
T PF03302_consen 362 SGLSTGAIAGISVAVVVVVGGLVGFLCWWFICRGK 396 (397)
T ss_pred ccccccceeeeeehhHHHHHHHHHHHhhheeeccc
Confidence 34556677777777665533 44466666655543
No 333
>KOG3881|consensus
Probab=63.22 E-value=2e+02 Score=30.19 Aligned_cols=109 Identities=7% Similarity=0.029 Sum_probs=63.7
Q ss_pred cceEEEcCC-CCcEEEEccCCCCeEEEEecCCCCcEEEEeCCC--CCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCce
Q psy6570 74 PYDIALEPL-SGRMFWTELGIKPRISGASIDGKNKFNLVDNNI--QWPTGITIDYPSQRLYWADPKARTIESINLNGKDR 150 (713)
Q Consensus 74 p~~iavD~~-~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~--~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~ 150 (713)
+.+|.+-+. ..+-|.|-...+ .+...+.. ..|+.++...+ +.-..+++.|....||+++. .+.+..||..+..
T Consensus 205 ~tdi~Fl~g~~~~~fat~T~~h-qvR~YDt~-~qRRPV~~fd~~E~~is~~~l~p~gn~Iy~gn~-~g~l~~FD~r~~k- 280 (412)
T KOG3881|consen 205 ITDIRFLEGSPNYKFATITRYH-QVRLYDTR-HQRRPVAQFDFLENPISSTGLTPSGNFIYTGNT-KGQLAKFDLRGGK- 280 (412)
T ss_pred eccceecCCCCCceEEEEecce-eEEEecCc-ccCcceeEeccccCcceeeeecCCCcEEEEecc-cchhheecccCce-
Confidence 445555431 145566665555 66666666 34444443222 23467899999999999985 3456666665543
Q ss_pred eEEEecC-CCCccceeeeeeCCeEEEEeCCCCcEEEEc
Q psy6570 151 FVVYHTE-DNGYKPYKLEVFEDNLYFSTYRTNNILKIN 187 (713)
Q Consensus 151 ~~~~~~~-~~~~~p~~i~~~~~~ly~td~~~~~i~~~~ 187 (713)
++.... ..-..+.+|..+...=|++..+-.+..||.
T Consensus 281 -l~g~~~kg~tGsirsih~hp~~~~las~GLDRyvRIh 317 (412)
T KOG3881|consen 281 -LLGCGLKGITGSIRSIHCHPTHPVLASCGLDRYVRIH 317 (412)
T ss_pred -eeccccCCccCCcceEEEcCCCceEEeeccceeEEEe
Confidence 332211 123456777777775666666666666664
No 334
>KOG0319|consensus
Probab=63.21 E-value=1.4e+02 Score=33.90 Aligned_cols=134 Identities=14% Similarity=0.068 Sum_probs=72.4
Q ss_pred CceeEEEccCcccEE----ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcc-eEEEcC
Q psy6570 7 GNVTRVKREMNLKTV----LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPY-DIALEP 81 (713)
Q Consensus 7 ~~I~~~~~~~~~~~~----~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~-~iavD~ 81 (713)
++|..++..+....+ .........+++++....||.+-. ...|.+.+++..............|. .|++||
T Consensus 40 d~Vi~idv~t~~~~l~s~~~ed~d~ita~~l~~d~~~L~~a~r----s~llrv~~L~tgk~irswKa~He~Pvi~ma~~~ 115 (775)
T KOG0319|consen 40 DRVIIIDVATGSIALPSGSNEDEDEITALALTPDEEVLVTASR----SQLLRVWSLPTGKLIRSWKAIHEAPVITMAFDP 115 (775)
T ss_pred ceEEEEEccCCceecccCCccchhhhheeeecCCccEEEEeec----cceEEEEEcccchHhHhHhhccCCCeEEEEEcC
Confidence 345555554433212 223344678899988777776553 56677777765432222222124454 699998
Q ss_pred CCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCe-EEEEcCCCCcEEEEeCC
Q psy6570 82 LSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQR-LYWADPKARTIESINLN 146 (713)
Q Consensus 82 ~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~-LY~~d~~~~~I~~~~~~ 146 (713)
.. .|. +..+...++.+-+..+.....-+...-.....|.+.+.-.+ |.++....+.|+..++.
T Consensus 116 ~g-~Ll-AtggaD~~v~VWdi~~~~~th~fkG~gGvVssl~F~~~~~~~lL~sg~~D~~v~vwnl~ 179 (775)
T KOG0319|consen 116 TG-TLL-ATGGADGRVKVWDIKNGYCTHSFKGHGGVVSSLLFHPHWNRWLLASGATDGTVRVWNLN 179 (775)
T ss_pred CC-ceE-EeccccceEEEEEeeCCEEEEEecCCCceEEEEEeCCccchhheeecCCCceEEEEEcc
Confidence 65 343 33344447777788776665554422233345566543322 33444455667776665
No 335
>KOG0275|consensus
Probab=63.19 E-value=1.3e+02 Score=30.49 Aligned_cols=90 Identities=9% Similarity=-0.004 Sum_probs=52.6
Q ss_pred eEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCC--CCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCC-CCceeE
Q psy6570 76 DIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNN--IQWPTGITIDYPSQRLYWADPKARTIESINLN-GKDRFV 152 (713)
Q Consensus 76 ~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~--~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~-g~~~~~ 152 (713)
.+.+-|.+-.-|+.-...+ .|+.+++.|+..+.+-+.. -.....-+++|.+.+||-+. ....++-+.+. |.-.+.
T Consensus 397 sv~~~PKnpeh~iVCNrsn-tv~imn~qGQvVrsfsSGkREgGdFi~~~lSpkGewiYcig-ED~vlYCF~~~sG~LE~t 474 (508)
T KOG0275|consen 397 SVILLPKNPEHFIVCNRSN-TVYIMNMQGQVVRSFSSGKREGGDFINAILSPKGEWIYCIG-EDGVLYCFSVLSGKLERT 474 (508)
T ss_pred eEEEcCCCCceEEEEcCCC-eEEEEeccceEEeeeccCCccCCceEEEEecCCCcEEEEEc-cCcEEEEEEeecCceeee
Confidence 4455555444344333345 7888899988877775432 22234567888889999765 34667777764 433333
Q ss_pred EEecCCCCccceeeeeeC
Q psy6570 153 VYHTEDNGYKPYKLEVFE 170 (713)
Q Consensus 153 ~~~~~~~~~~p~~i~~~~ 170 (713)
+... ...+.||+.+-
T Consensus 475 l~Vh---EkdvIGl~HHP 489 (508)
T KOG0275|consen 475 LPVH---EKDVIGLTHHP 489 (508)
T ss_pred eecc---cccccccccCc
Confidence 3222 23567777653
No 336
>COG4222 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=63.17 E-value=1.1e+02 Score=32.68 Aligned_cols=62 Identities=21% Similarity=0.098 Sum_probs=40.9
Q ss_pred CcceEEEcCCCCcEEEEcc-----CCCCeEEEEecCCCCcEEEEeC-------------CCCCCeeEEEeCCCCeEEEEc
Q psy6570 73 EPYDIALEPLSGRMFWTEL-----GIKPRISGASIDGKNKFNLVDN-------------NIQWPTGITIDYPSQRLYWAD 134 (713)
Q Consensus 73 ~p~~iavD~~~~~ly~td~-----~~~~~I~~~~~dG~~~~~l~~~-------------~~~~p~glavd~~~~~LY~~d 134 (713)
.+.++++.+....++++.. ...+-|++++++|+..+.+... +-..-.+||+.+..++||-+-
T Consensus 139 ~~~~ralt~~d~~~~s~~~~~igdefgP~l~~f~~~Gk~~~~~~~~~~~~~~~~p~g~~~n~gfEglait~d~~~L~~~l 218 (391)
T COG4222 139 DPEGRALTPADFDVESSQGAWIGDEFGPYLLEFDANGKLVRVLEVPVRFLPPDNPKGLRNNLGFEGLAITPDGKKLYALL 218 (391)
T ss_pred CchhhcccCCCcceeeccccccccccCcceEEECCCCccccccccccccCcCCCccccccccceeeEEecCCCceEEEEE
Confidence 3556666665555555543 3447899999999877665421 111346899999889999763
No 337
>KOG0301|consensus
Probab=62.89 E-value=1.5e+02 Score=33.42 Aligned_cols=127 Identities=12% Similarity=0.069 Sum_probs=58.1
Q ss_pred CeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeE
Q psy6570 51 NNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRL 130 (713)
Q Consensus 51 ~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~L 130 (713)
..|.+..+++.....++..--.+.-+++++ ..+.|.=..|....+||+..-=. ..+ ...-..-..++.= ....
T Consensus 81 ~~i~v~~~~~~~P~~~LkgH~snVC~ls~~-~~~~~iSgSWD~TakvW~~~~l~---~~l-~gH~asVWAv~~l--~e~~ 153 (745)
T KOG0301|consen 81 TTIIVFKLSQAEPLYTLKGHKSNVCSLSIG-EDGTLISGSWDSTAKVWRIGELV---YSL-QGHTASVWAVASL--PENT 153 (745)
T ss_pred ceEEEEecCCCCchhhhhccccceeeeecC-CcCceEecccccceEEecchhhh---ccc-CCcchheeeeeec--CCCc
Confidence 334444444433332222223455677776 34455555566555777653211 111 1111112233333 2338
Q ss_pred EEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEEEcc
Q psy6570 131 YWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILKINK 188 (713)
Q Consensus 131 Y~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~~~ 188 (713)
|++-.....|.... .|...+++... ..-..||++..+.-|.+-...+.|...+.
T Consensus 154 ~vTgsaDKtIklWk-~~~~l~tf~gH---tD~VRgL~vl~~~~flScsNDg~Ir~w~~ 207 (745)
T KOG0301|consen 154 YVTGSADKTIKLWK-GGTLLKTFSGH---TDCVRGLAVLDDSHFLSCSNDGSIRLWDL 207 (745)
T ss_pred EEeccCcceeeecc-CCchhhhhccc---hhheeeeEEecCCCeEeecCCceEEEEec
Confidence 88876655554433 35444444332 33446666654433444445555555554
No 338
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=62.81 E-value=1.8e+02 Score=29.42 Aligned_cols=134 Identities=13% Similarity=0.075 Sum_probs=65.9
Q ss_pred EEEEecCCceEEEEEcCCCCCcceEEEcCCCCc-EEEEccCCCCeEEEEecCCCCcEE-EEeCCCCCCee-EEEeCCCCe
Q psy6570 53 IMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGR-MFWTELGIKPRISGASIDGKNKFN-LVDNNIQWPTG-ITIDYPSQR 129 (713)
Q Consensus 53 I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~-ly~td~~~~~~I~~~~~dG~~~~~-l~~~~~~~p~g-lavd~~~~~ 129 (713)
+..++..|+.+..+... .+-.+|+++|...+ ++|+-. ...-.++++.++....+ +.+..-..-.| =.++++..+
T Consensus 51 ~a~~~eaGk~v~~~~lp--aR~Hgi~~~p~~~ravafARr-PGtf~~vfD~~~~~~pv~~~s~~~RHfyGHGvfs~dG~~ 127 (366)
T COG3490 51 AATLSEAGKIVFATALP--ARGHGIAFHPALPRAVAFARR-PGTFAMVFDPNGAQEPVTLVSQEGRHFYGHGVFSPDGRL 127 (366)
T ss_pred EEEEccCCceeeeeecc--cccCCeecCCCCcceEEEEec-CCceEEEECCCCCcCcEEEecccCceeecccccCCCCcE
Confidence 44455666554433211 45678888887665 455432 22245566776654433 33322222222 245667778
Q ss_pred EEEEcC----CCCcEEEEeCCCCceeEEEecCCCCccceeeeee-CCeEEEEeCCCCcEEEEcccCCC
Q psy6570 130 LYWADP----KARTIESINLNGKDRFVVYHTEDNGYKPYKLEVF-EDNLYFSTYRTNNILKINKFGNS 192 (713)
Q Consensus 130 LY~~d~----~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~~ 192 (713)
||-++. ..+.|-.++.+-. ...+.+.......|..+.+. ++++.+.. ++.|..-..++..
T Consensus 128 LYATEndfd~~rGViGvYd~r~~-fqrvgE~~t~GiGpHev~lm~DGrtlvva--nGGIethpdfgR~ 192 (366)
T COG3490 128 LYATENDFDPNRGVIGVYDAREG-FQRVGEFSTHGIGPHEVTLMADGRTLVVA--NGGIETHPDFGRT 192 (366)
T ss_pred EEeecCCCCCCCceEEEEecccc-cceecccccCCcCcceeEEecCCcEEEEe--CCceecccccCcc
Confidence 888853 3345666666532 23333333334456555543 34444432 2334444334433
No 339
>KOG1408|consensus
Probab=62.75 E-value=1.5e+02 Score=33.59 Aligned_cols=96 Identities=10% Similarity=0.150 Sum_probs=57.5
Q ss_pred eeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCC-CCccceeeeeeCCeEEEEeCC-CCcEEEEcccCCCccee
Q psy6570 119 TGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTED-NGYKPYKLEVFEDNLYFSTYR-TNNILKINKFGNSDFNV 196 (713)
Q Consensus 119 ~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~-~~~~p~~i~~~~~~ly~td~~-~~~i~~~~~~~~~~~~~ 196 (713)
..++|||..+.+. +--....|..++.....++..+.... .-..++-+..+-.-||++..- ...+-.++-..+.-+..
T Consensus 600 YDm~Vdp~~k~v~-t~cQDrnirif~i~sgKq~k~FKgs~~~eG~lIKv~lDPSgiY~atScsdktl~~~Df~sgEcvA~ 678 (1080)
T KOG1408|consen 600 YDMAVDPTSKLVV-TVCQDRNIRIFDIESGKQVKSFKGSRDHEGDLIKVILDPSGIYLATSCSDKTLCFVDFVSGECVAQ 678 (1080)
T ss_pred EEeeeCCCcceEE-EEecccceEEEeccccceeeeecccccCCCceEEEEECCCccEEEEeecCCceEEEEeccchhhhh
Confidence 4678886554443 33334567788876555544444322 234566777776677877544 44566666666666666
Q ss_pred eeccccccccEEEEeeccc
Q psy6570 197 LANNLNRASDVLILQENKQ 215 (713)
Q Consensus 197 ~~~~~~~~~~i~v~~~~~q 215 (713)
+..+...++++++..+-+.
T Consensus 679 m~GHsE~VTG~kF~nDCkH 697 (1080)
T KOG1408|consen 679 MTGHSEAVTGVKFLNDCKH 697 (1080)
T ss_pred hcCcchheeeeeecccchh
Confidence 6666666777766655444
No 340
>TIGR02171 Fb_sc_TIGR02171 Fibrobacter succinogenes paralogous family TIGR02171. This model describes a paralogous family of the rumen bacterium Fibrobacter succinogenes. Eleven members are found in Fibrobacter succinogenes S85, averaging over 900 amino acids in length. More than half are predicted lipoproteins. The function is unknown.
Probab=62.74 E-value=45 Score=39.10 Aligned_cols=99 Identities=13% Similarity=0.181 Sum_probs=60.1
Q ss_pred cCCeEEEeecCCCCCCeEEEEecCCceEEEE-EcCCCCCcceEEEcCCCCcEEE-EccCC---CCeEEEEecCC--CCcE
Q psy6570 36 VGKNLYWTDAGGRSSNNIMVSTLEGRKKRTL-LNTGLNEPYDIALEPLSGRMFW-TELGI---KPRISGASIDG--KNKF 108 (713)
Q Consensus 36 ~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l-~~~~~~~p~~iavD~~~~~ly~-td~~~---~~~I~~~~~dG--~~~~ 108 (713)
.++..|+++. .++|.++|.||...+++ +... .....-++.|..+.|-+ +.... .+.|++.+|+. +...
T Consensus 318 ~tkiAfv~~~----~~~L~~~D~dG~n~~~ve~~~~-~~i~sP~~SPDG~~vAY~ts~e~~~g~s~vYv~~L~t~~~~~v 392 (912)
T TIGR02171 318 KAKLAFRNDV----TGNLAYIDYTKGASRAVEIEDT-ISVYHPDISPDGKKVAFCTGIEGLPGKSSVYVRNLNASGSGLV 392 (912)
T ss_pred eeeEEEEEcC----CCeEEEEecCCCCceEEEecCC-CceecCcCCCCCCEEEEEEeecCCCCCceEEEEehhccCCCce
Confidence 4567787775 35999999999887766 5432 22233456666666655 44333 56799988874 3433
Q ss_pred EEEeCCCCCCeeEEEeC-CCCeEEEEcCCCCc
Q psy6570 109 NLVDNNIQWPTGITIDY-PSQRLYWADPKART 139 (713)
Q Consensus 109 ~l~~~~~~~p~glavd~-~~~~LY~~d~~~~~ 139 (713)
.|--.+..-|.--.+.. ++-.+||+|..+++
T Consensus 393 kl~ve~aaiprwrv~e~gdt~ivyv~~a~nn~ 424 (912)
T TIGR02171 393 KLPVENAAIPRWRVLENGDTVIVYVSDASNNK 424 (912)
T ss_pred EeecccccccceEecCCCCeEEEEEcCCCCCc
Confidence 44333445565555541 23368999876654
No 341
>PF06739 SBBP: Beta-propeller repeat; InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=62.05 E-value=12 Score=24.89 Aligned_cols=20 Identities=20% Similarity=0.356 Sum_probs=16.2
Q ss_pred CCcceEEEcCCCCcEEEEccC
Q psy6570 72 NEPYDIALEPLSGRMFWTELG 92 (713)
Q Consensus 72 ~~p~~iavD~~~~~ly~td~~ 92 (713)
..+.+|++|+ .|.+|++-..
T Consensus 13 ~~~~~IavD~-~GNiYv~G~T 32 (38)
T PF06739_consen 13 DYGNGIAVDS-NGNIYVTGYT 32 (38)
T ss_pred eeEEEEEECC-CCCEEEEEee
Confidence 5699999995 7889998643
No 342
>PF12191 stn_TNFRSF12A: Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain; InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=61.71 E-value=3.8 Score=35.07 Aligned_cols=10 Identities=10% Similarity=0.056 Sum_probs=0.0
Q ss_pred heeeEEEEec
Q psy6570 702 IGYYIFRIKM 711 (713)
Q Consensus 702 ~~~~~~~~~~ 711 (713)
++++|.|||+
T Consensus 99 lv~rrcrrr~ 108 (129)
T PF12191_consen 99 LVWRRCRRRE 108 (129)
T ss_dssp ----------
T ss_pred HHHhhhhccc
Confidence 4444444544
No 343
>COG1770 PtrB Protease II [Amino acid transport and metabolism]
Probab=61.00 E-value=3e+02 Score=31.39 Aligned_cols=150 Identities=9% Similarity=0.069 Sum_probs=72.9
Q ss_pred CCCCceEEEeccCCeEEEeecCC-CCCCeEEEEecCCceEEEEEcCCCC-CcceEEEcCCCCcEEEEccC---CCCeEEE
Q psy6570 25 LHDPRGVAVDWVGKNLYWTDAGG-RSSNNIMVSTLEGRKKRTLLNTGLN-EPYDIALEPLSGRMFWTELG---IKPRISG 99 (713)
Q Consensus 25 ~~~p~gla~D~~~~~ly~td~~~-~~~~~I~~~~~~G~~~~~l~~~~~~-~p~~iavD~~~~~ly~td~~---~~~~I~~ 99 (713)
+..-.++++.+.++.|-|+-... ...-.|.+.+|..... ....+. .-.+++....+..||++... ...+|++
T Consensus 128 f~~Lg~~~~s~D~~~la~s~D~~G~e~y~lr~kdL~tg~~---~~d~i~~~~~~~~Wa~d~~~lfYt~~d~~~rp~kv~~ 204 (682)
T COG1770 128 FFSLGAASISPDHNLLAYSVDVLGDEQYTLRFKDLATGEE---LPDEITNTSGSFAWAADGKTLFYTRLDENHRPDKVWR 204 (682)
T ss_pred ceeeeeeeeCCCCceEEEEEecccccEEEEEEEecccccc---cchhhcccccceEEecCCCeEEEEEEcCCCCcceEEE
Confidence 33445677777777777654321 1122344455443221 111122 24456666666777777433 2247888
Q ss_pred EecCC--CCcEEEEeC-CCCCCeeEEEeCCCCeEEEEc--CCCCcEEEEeCCCC--ceeEEEecCCCCccceeeeeeCCe
Q psy6570 100 ASIDG--KNKFNLVDN-NIQWPTGITIDYPSQRLYWAD--PKARTIESINLNGK--DRFVVYHTEDNGYKPYKLEVFEDN 172 (713)
Q Consensus 100 ~~~dG--~~~~~l~~~-~~~~p~glavd~~~~~LY~~d--~~~~~I~~~~~~g~--~~~~~~~~~~~~~~p~~i~~~~~~ 172 (713)
..+++ +.-+.|... +-..-.++--...+..|++.- ..+..|+.++.+-. ..+++.... ...-+.++..+++
T Consensus 205 h~~gt~~~~d~lvyeE~d~~f~~~v~~s~s~~yi~i~~~~~~tsE~~ll~a~~p~~~p~vv~pr~--~g~eY~~eh~~d~ 282 (682)
T COG1770 205 HRLGTPGSSDELVYEEKDDRFFLSVGRSRSEAYIVISLGSHITSEVRLLDADDPEAEPKVVLPRE--NGVEYSVEHGGDR 282 (682)
T ss_pred EecCCCCCcceEEEEcCCCcEEEEeeeccCCceEEEEcCCCcceeEEEEecCCCCCceEEEEEcC--CCcEEeeeecCcE
Confidence 88877 444445432 212222333333455666653 45566777776543 233333221 1223444445666
Q ss_pred EEEEeCC
Q psy6570 173 LYFSTYR 179 (713)
Q Consensus 173 ly~td~~ 179 (713)
+|+....
T Consensus 283 f~i~sN~ 289 (682)
T COG1770 283 FYILSNA 289 (682)
T ss_pred EEEEecC
Confidence 6665433
No 344
>PF14761 HPS3_N: Hermansky-Pudlak syndrome 3
Probab=60.97 E-value=1.6e+02 Score=28.39 Aligned_cols=152 Identities=13% Similarity=0.163 Sum_probs=79.0
Q ss_pred cCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEc-CCCCCcceEEEcCCCCcEEEE------------
Q psy6570 23 SNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLN-TGLNEPYDIALEPLSGRMFWT------------ 89 (713)
Q Consensus 23 ~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~-~~~~~p~~iavD~~~~~ly~t------------ 89 (713)
+.-..|..+.-- ..+.||++ . ...+|.++++..+....+.. ..+++...|+....+++|--.
T Consensus 15 ~~~~EP~~~c~~-g~d~Lfva-~---~g~~Vev~~l~~~~~~~~~~F~Tv~~V~~l~y~~~GDYlvTlE~k~~~~~~~fv 89 (215)
T PF14761_consen 15 PCEQEPTAVCCG-GPDALFVA-A---SGCKVEVYDLEQEECPLLCTFSTVGRVLQLVYSEAGDYLVTLEEKNKRSPVDFV 89 (215)
T ss_pred ccccCcceeecc-CCceEEEE-c---CCCEEEEEEcccCCCceeEEEcchhheeEEEeccccceEEEEEeecCCccceEE
Confidence 333378776553 23789998 4 46789999988544333322 223444455544433333221
Q ss_pred ----ccCCC---CeEEEEecCC-----------CCcEEEEeCCCC-CCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCC--
Q psy6570 90 ----ELGIK---PRISGASIDG-----------KNKFNLVDNNIQ-WPTGITIDYPSQRLYWADPKARTIESINLNGK-- 148 (713)
Q Consensus 90 ----d~~~~---~~I~~~~~dG-----------~~~~~l~~~~~~-~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~-- 148 (713)
+|... ..-.++.+-| +....++...+. .|.-||-=+.+|.|.++-...-.|+++...-.
T Consensus 90 R~Y~NWr~~~~~~~~v~vRiaG~~v~~~~~~~~~~qleiiElPl~~~p~ciaCC~~tG~LlVg~~~~l~lf~l~~~~~~~ 169 (215)
T PF14761_consen 90 RAYFNWRSQKEENSPVRVRIAGHRVTPSFNESSKDQLEIIELPLSEPPLCIACCPVTGNLLVGCGNKLVLFTLKYQTIQS 169 (215)
T ss_pred EEEEEhhhhcccCCcEEEEEcccccccCCCCccccceEEEEecCCCCCCEEEecCCCCCEEEEcCCEEEEEEEEEEEEec
Confidence 22210 0222333434 112333433443 78889999999999998766555665543222
Q ss_pred ceeEEEecCC------CCccceeeeeeCCeEEEEeCC
Q psy6570 149 DRFVVYHTED------NGYKPYKLEVFEDNLYFSTYR 179 (713)
Q Consensus 149 ~~~~~~~~~~------~~~~p~~i~~~~~~ly~td~~ 179 (713)
....+..... ....|..+++-+++|-+.+..
T Consensus 170 ~~~~~lDFe~~l~~~~~~~~p~~v~ic~~yiA~~s~~ 206 (215)
T PF14761_consen 170 EKFSFLDFERSLIDHIDNFKPTQVAICEGYIAVMSDL 206 (215)
T ss_pred ccccEEechhhhhheecCceEEEEEEEeeEEEEecCC
Confidence 1111111110 123577778888866665433
No 345
>KOG0269|consensus
Probab=60.72 E-value=1e+02 Score=35.14 Aligned_cols=101 Identities=10% Similarity=0.061 Sum_probs=66.0
Q ss_pred CCCeEEEEecCCceEEEE---EcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeC
Q psy6570 49 SSNNIMVSTLEGRKKRTL---LNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDY 125 (713)
Q Consensus 49 ~~~~I~~~~~~G~~~~~l---~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~ 125 (713)
.++.|.+.+++-..+..+ +.+--...+-+.+.+..-.|.++.+... .|...++..+.-...+..+......+++.+
T Consensus 108 ~nG~i~vWdlnk~~rnk~l~~f~EH~Rs~~~ldfh~tep~iliSGSQDg-~vK~~DlR~~~S~~t~~~nSESiRDV~fsp 186 (839)
T KOG0269|consen 108 TNGVISVWDLNKSIRNKLLTVFNEHERSANKLDFHSTEPNILISGSQDG-TVKCWDLRSKKSKSTFRSNSESIRDVKFSP 186 (839)
T ss_pred CCCcEEEEecCccccchhhhHhhhhccceeeeeeccCCccEEEecCCCc-eEEEEeeecccccccccccchhhhceeecc
Confidence 356677777665332222 2222344666777777777777765433 666667665554444444566678899999
Q ss_pred CCCeEEEEcCCCCcEEEEeCCCCce
Q psy6570 126 PSQRLYWADPKARTIESINLNGKDR 150 (713)
Q Consensus 126 ~~~~LY~~d~~~~~I~~~~~~g~~~ 150 (713)
..+..|++-..++.++..|+.-..+
T Consensus 187 ~~~~~F~s~~dsG~lqlWDlRqp~r 211 (839)
T KOG0269|consen 187 GYGNKFASIHDSGYLQLWDLRQPDR 211 (839)
T ss_pred CCCceEEEecCCceEEEeeccCchh
Confidence 9999999988888888888755443
No 346
>KOG2315|consensus
Probab=60.72 E-value=2.7e+02 Score=30.76 Aligned_cols=120 Identities=9% Similarity=0.017 Sum_probs=69.9
Q ss_pred CceeEEEccCcccEE-ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcc-eEEEcCCCC
Q psy6570 7 GNVTRVKREMNLKTV-LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPY-DIALEPLSG 84 (713)
Q Consensus 7 ~~I~~~~~~~~~~~~-~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~-~iavD~~~~ 84 (713)
..++.+..+|....+ +..-.-.+.+.+.+.+...-+.-.- ...++.+++++|..+..+. ..|+ .|-+.|..+
T Consensus 251 q~Lyll~t~g~s~~V~L~k~GPVhdv~W~~s~~EF~VvyGf--MPAkvtifnlr~~~v~df~----egpRN~~~fnp~g~ 324 (566)
T KOG2315|consen 251 QTLYLLATQGESVSVPLLKEGPVHDVTWSPSGREFAVVYGF--MPAKVTIFNLRGKPVFDFP----EGPRNTAFFNPHGN 324 (566)
T ss_pred ceEEEEEecCceEEEecCCCCCceEEEECCCCCEEEEEEec--ccceEEEEcCCCCEeEeCC----CCCccceEECCCCC
Confidence 445566666543333 2222223556666554444333331 4567888999987654433 2343 477787777
Q ss_pred cEEEEccCCC-CeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEc
Q psy6570 85 RMFWTELGIK-PRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWAD 134 (713)
Q Consensus 85 ~ly~td~~~~-~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d 134 (713)
.|.++..++- +.|++.+... ++.+..-.....+-..+.|++.+++.+.
T Consensus 325 ii~lAGFGNL~G~mEvwDv~n--~K~i~~~~a~~tt~~eW~PdGe~flTAT 373 (566)
T KOG2315|consen 325 IILLAGFGNLPGDMEVWDVPN--RKLIAKFKAANTTVFEWSPDGEYFLTAT 373 (566)
T ss_pred EEEEeecCCCCCceEEEeccc--hhhccccccCCceEEEEcCCCcEEEEEe
Confidence 7778877742 5677766543 4445444455566778888777777665
No 347
>KOG0308|consensus
Probab=60.67 E-value=79 Score=35.29 Aligned_cols=111 Identities=13% Similarity=0.133 Sum_probs=61.6
Q ss_pred CCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceE-EEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCC
Q psy6570 26 HDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKK-RTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDG 104 (713)
Q Consensus 26 ~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~-~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG 104 (713)
.+.+.|.++..+.++. +-+ +.+.|.+-+|.-+.- .++... -...+++..++.-..+|..+... .|++.++..
T Consensus 214 dNVr~ll~~dDGt~~l-s~s---SDgtIrlWdLgqQrCl~T~~vH-~e~VWaL~~~~sf~~vYsG~rd~--~i~~Tdl~n 286 (735)
T KOG0308|consen 214 DNVRVLLVNDDGTRLL-SAS---SDGTIRLWDLGQQRCLATYIVH-KEGVWALQSSPSFTHVYSGGRDG--NIYRTDLRN 286 (735)
T ss_pred cceEEEEEcCCCCeEe-ecC---CCceEEeeeccccceeeeEEec-cCceEEEeeCCCcceEEecCCCC--cEEecccCC
Confidence 3345555554444443 333 356666666654332 122211 23478888888888888877654 499999988
Q ss_pred CCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEe
Q psy6570 105 KNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESIN 144 (713)
Q Consensus 105 ~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~ 144 (713)
....+++-..-+.-..|.++..+..+ |+-.....|.+-.
T Consensus 287 ~~~~tlick~daPv~~l~~~~~~~~~-WvtTtds~I~rW~ 325 (735)
T KOG0308|consen 287 PAKSTLICKEDAPVLKLHLHEHDNSV-WVTTTDSSIKRWK 325 (735)
T ss_pred chhheEeecCCCchhhhhhccccCCc-eeeeccccceecC
Confidence 65555553333333445565334444 5555555555543
No 348
>KOG2321|consensus
Probab=60.43 E-value=93 Score=34.28 Aligned_cols=116 Identities=13% Similarity=0.127 Sum_probs=64.4
Q ss_pred ceEEEeccCCeEEEeecCCCCCCeEEEEecC-CceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCc
Q psy6570 29 RGVAVDWVGKNLYWTDAGGRSSNNIMVSTLE-GRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNK 107 (713)
Q Consensus 29 ~gla~D~~~~~ly~td~~~~~~~~I~~~~~~-G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~ 107 (713)
+.|.++..+--||++-. ...|+|++|+ |.....+-. .....+.+.|.+.+++|-+-. . .+.++-.++.-..+
T Consensus 137 RDm~y~~~scDly~~gs----g~evYRlNLEqGrfL~P~~~-~~~~lN~v~in~~hgLla~Gt-~-~g~VEfwDpR~ksr 209 (703)
T KOG2321|consen 137 RDMKYHKPSCDLYLVGS----GSEVYRLNLEQGRFLNPFET-DSGELNVVSINEEHGLLACGT-E-DGVVEFWDPRDKSR 209 (703)
T ss_pred ccccccCCCccEEEeec----CcceEEEEcccccccccccc-ccccceeeeecCccceEEecc-c-CceEEEecchhhhh
Confidence 56777766667887765 4679999986 555544432 235566788887777664421 1 11444444433322
Q ss_pred EEEE-----------eCCCCCCeeEEEeCCCCeEEEE-cCCCCcEEEEeCCCCceeEE
Q psy6570 108 FNLV-----------DNNIQWPTGITIDYPSQRLYWA-DPKARTIESINLNGKDRFVV 153 (713)
Q Consensus 108 ~~l~-----------~~~~~~p~glavd~~~~~LY~~-d~~~~~I~~~~~~g~~~~~~ 153 (713)
...+ ......++.|.|+ ++-|-++ -..++.|+..|+-.+....+
T Consensus 210 v~~l~~~~~v~s~pg~~~~~svTal~F~--d~gL~~aVGts~G~v~iyDLRa~~pl~~ 265 (703)
T KOG2321|consen 210 VGTLDAASSVNSHPGGDAAPSVTALKFR--DDGLHVAVGTSTGSVLIYDLRASKPLLV 265 (703)
T ss_pred heeeecccccCCCccccccCcceEEEec--CCceeEEeeccCCcEEEEEcccCCceee
Confidence 2222 1223456777777 2233333 34566777777766554444
No 349
>KOG1036|consensus
Probab=60.41 E-value=2e+02 Score=29.22 Aligned_cols=148 Identities=14% Similarity=0.125 Sum_probs=79.7
Q ss_pred CCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCC
Q psy6570 27 DPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKN 106 (713)
Q Consensus 27 ~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~ 106 (713)
....|.|++.++.|.++-| .+.+..++......+..+..+ .-....++-. ...+|..+.. + .|.++++++..
T Consensus 15 ~IS~v~f~~~~~~LLvssW----DgslrlYdv~~~~l~~~~~~~-~plL~c~F~d-~~~~~~G~~d-g-~vr~~Dln~~~ 86 (323)
T KOG1036|consen 15 GISSVKFSPSSSDLLVSSW----DGSLRLYDVPANSLKLKFKHG-APLLDCAFAD-ESTIVTGGLD-G-QVRRYDLNTGN 86 (323)
T ss_pred ceeeEEEcCcCCcEEEEec----cCcEEEEeccchhhhhheecC-CceeeeeccC-CceEEEeccC-c-eEEEEEecCCc
Confidence 3467888888888998888 466666666665444333322 1122444432 2344444432 3 68888888877
Q ss_pred cEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEEE
Q psy6570 107 KFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILKI 186 (713)
Q Consensus 107 ~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~ 186 (713)
...+.. .-.....|...+..+ ..++-...++|...|.-. ...+.... ....-+.+++-++.|.+.. ....|..+
T Consensus 87 ~~~igt-h~~~i~ci~~~~~~~-~vIsgsWD~~ik~wD~R~--~~~~~~~d-~~kkVy~~~v~g~~LvVg~-~~r~v~iy 160 (323)
T KOG1036|consen 87 EDQIGT-HDEGIRCIEYSYEVG-CVISGSWDKTIKFWDPRN--KVVVGTFD-QGKKVYCMDVSGNRLVVGT-SDRKVLIY 160 (323)
T ss_pred ceeecc-CCCceEEEEeeccCC-eEEEcccCccEEEEeccc--cccccccc-cCceEEEEeccCCEEEEee-cCceEEEE
Confidence 666543 333345666664444 445555666777766543 11111111 1224456666677666632 22334444
Q ss_pred cc
Q psy6570 187 NK 188 (713)
Q Consensus 187 ~~ 188 (713)
+.
T Consensus 161 DL 162 (323)
T KOG1036|consen 161 DL 162 (323)
T ss_pred Ec
Confidence 43
No 350
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=59.87 E-value=2.8e+02 Score=30.79 Aligned_cols=32 Identities=13% Similarity=0.120 Sum_probs=21.1
Q ss_pred eEEEeCCCCeEEEEcCC-----------------CCcEEEEeCCCCcee
Q psy6570 120 GITIDYPSQRLYWADPK-----------------ARTIESINLNGKDRF 151 (713)
Q Consensus 120 glavd~~~~~LY~~d~~-----------------~~~I~~~~~~g~~~~ 151 (713)
..++|+..++||+.... .++|+.++++...+.
T Consensus 221 ~pa~d~~~g~V~vg~~~g~~~~~~~~~~~~~~~~~~~l~Ald~~tG~~~ 269 (488)
T cd00216 221 SPTYDPKTNLVYVGTGNGSPWNWGGRRTPGDNLYTDSIVALDADTGKVK 269 (488)
T ss_pred CeeEeCCCCEEEEECCCCCCCccCCccCCCCCCceeeEEEEcCCCCCEE
Confidence 46788778889988532 136888887654443
No 351
>KOG0283|consensus
Probab=59.14 E-value=2.9e+02 Score=31.86 Aligned_cols=111 Identities=14% Similarity=0.100 Sum_probs=63.7
Q ss_pred CCcceEEEcCCCCcEEEEccCCC-CeEEEEecCCCCcEEEEeCCC-CCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCc
Q psy6570 72 NEPYDIALEPLSGRMFWTELGIK-PRISGASIDGKNKFNLVDNNI-QWPTGITIDYPSQRLYWADPKARTIESINLNGKD 149 (713)
Q Consensus 72 ~~p~~iavD~~~~~ly~td~~~~-~~I~~~~~dG~~~~~l~~~~~-~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~ 149 (713)
.....|++.|.+.+-|.+..-.. -|||-+. ..+++.-.++ ...+++++-|+ |...+.-..++..+.++..|..
T Consensus 410 dfVTcVaFnPvDDryFiSGSLD~KvRiWsI~----d~~Vv~W~Dl~~lITAvcy~Pd-Gk~avIGt~~G~C~fY~t~~lk 484 (712)
T KOG0283|consen 410 DFVTCVAFNPVDDRYFISGSLDGKVRLWSIS----DKKVVDWNDLRDLITAVCYSPD-GKGAVIGTFNGYCRFYDTEGLK 484 (712)
T ss_pred CeeEEEEecccCCCcEeecccccceEEeecC----cCeeEeehhhhhhheeEEeccC-CceEEEEEeccEEEEEEccCCe
Confidence 56789999999999999875432 2444442 2333332232 35678888876 5555555666777666666643
Q ss_pred eeE----EEecC--CCCccceeeeee---CCeEEEEeCCCCcEEEEcc
Q psy6570 150 RFV----VYHTE--DNGYKPYKLEVF---EDNLYFSTYRTNNILKINK 188 (713)
Q Consensus 150 ~~~----~~~~~--~~~~~p~~i~~~---~~~ly~td~~~~~i~~~~~ 188 (713)
... -.... .....-.||.++ -+.|.||. ...+|+.++.
T Consensus 485 ~~~~~~I~~~~~Kk~~~~rITG~Q~~p~~~~~vLVTS-nDSrIRI~d~ 531 (712)
T KOG0283|consen 485 LVSDFHIRLHNKKKKQGKRITGLQFFPGDPDEVLVTS-NDSRIRIYDG 531 (712)
T ss_pred EEEeeeEeeccCccccCceeeeeEecCCCCCeEEEec-CCCceEEEec
Confidence 221 11111 112234666665 23577774 4456777765
No 352
>KOG4649|consensus
Probab=58.76 E-value=2e+02 Score=28.64 Aligned_cols=34 Identities=9% Similarity=0.029 Sum_probs=21.7
Q ss_pred ceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEE
Q psy6570 29 RGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTL 66 (713)
Q Consensus 29 ~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l 66 (713)
.+.++++..+.||++-. .+++.+.+.+.....++
T Consensus 139 ~sP~i~~g~~sly~a~t----~G~vlavt~~~~~~~~~ 172 (354)
T KOG4649|consen 139 VSPVIAPGDGSLYAAIT----AGAVLAVTKNPYSSTEF 172 (354)
T ss_pred ccceecCCCceEEEEec----cceEEEEccCCCCccee
Confidence 34567777788888875 56677766655443444
No 353
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=58.47 E-value=7.6 Score=24.93 Aligned_cols=25 Identities=24% Similarity=0.623 Sum_probs=16.0
Q ss_pred CCCCCCCCCeeeccCCCceeeeCCCCcc
Q psy6570 224 DDKPCHQSALCINLPSSHTCLCPDHLTE 251 (713)
Q Consensus 224 ~~~~C~~~~~C~~~~g~~~C~C~~G~~~ 251 (713)
....| .+.|-... ...|.||+||..
T Consensus 4 n~t~C--pA~CDpn~-~~~C~CPeGyIl 28 (34)
T PF09064_consen 4 NQTEC--PADCDPNS-PGQCFCPEGYIL 28 (34)
T ss_pred ccccC--CCccCCCC-CCceeCCCceEe
Confidence 33445 45664432 358999999984
No 354
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=57.88 E-value=1.9e+02 Score=34.20 Aligned_cols=100 Identities=10% Similarity=0.112 Sum_probs=52.8
Q ss_pred eEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCC-CCcEEEEeCCCCceeEEE
Q psy6570 76 DIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPK-ARTIESINLNGKDRFVVY 154 (713)
Q Consensus 76 ~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~-~~~I~~~~~~g~~~~~~~ 154 (713)
.+++|+..+.+||--.+..+.++ |..|+.. .++..-.=+|||..++++-|.-.. .+-++ |+|.....+|.
T Consensus 379 ~~s~D~~~glvy~ptGn~~pd~~-----g~~r~~~--~n~y~~slvALD~~TGk~~W~~Q~~~hD~W--D~D~~~~p~L~ 449 (764)
T TIGR03074 379 VASYDEKLGLVYLPMGNQTPDQW-----GGDRTPA--DEKYSSSLVALDATTGKERWVFQTVHHDLW--DMDVPAQPSLV 449 (764)
T ss_pred ceEEcCCCCeEEEeCCCcccccc-----CCccccC--cccccceEEEEeCCCCceEEEecccCCccc--cccccCCceEE
Confidence 58999999999995533221122 3233211 133344568999999999997643 44455 34444443443
Q ss_pred ecCC--CCccceeeee-eCCeEEEEeCCCCcEE
Q psy6570 155 HTED--NGYKPYKLEV-FEDNLYFSTYRTNNIL 184 (713)
Q Consensus 155 ~~~~--~~~~p~~i~~-~~~~ly~td~~~~~i~ 184 (713)
.... ....|.-+.. ..+++|+.|..+++..
T Consensus 450 d~~~~~G~~~~~v~~~~K~G~~~vlDr~tG~~l 482 (764)
T TIGR03074 450 DLPDADGTTVPALVAPTKQGQIYVLDRRTGEPI 482 (764)
T ss_pred eeecCCCcEeeEEEEECCCCEEEEEECCCCCEE
Confidence 2211 1112221222 2557777777665543
No 355
>KOG1273|consensus
Probab=57.74 E-value=2.3e+02 Score=29.01 Aligned_cols=69 Identities=7% Similarity=0.031 Sum_probs=34.9
Q ss_pred cceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEe-CCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeC
Q psy6570 74 PYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVD-NNIQWPTGITIDYPSQRLYWADPKARTIESINL 145 (713)
Q Consensus 74 p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~-~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~ 145 (713)
+.-..+|+...+||.-. +. +.|.+.+.+......-+. +.......|.|.. .++-+......+.|+.++.
T Consensus 156 as~~~fdr~g~yIitGt-sK-Gkllv~~a~t~e~vas~rits~~~IK~I~~s~-~g~~liiNtsDRvIR~ye~ 225 (405)
T KOG1273|consen 156 ASHGVFDRRGKYIITGT-SK-GKLLVYDAETLECVASFRITSVQAIKQIIVSR-KGRFLIINTSDRVIRTYEI 225 (405)
T ss_pred cccccccCCCCEEEEec-Cc-ceEEEEecchheeeeeeeechheeeeEEEEec-cCcEEEEecCCceEEEEeh
Confidence 33446776555555433 32 367777766543222111 1123445677773 4555555555555666553
No 356
>KOG3545|consensus
Probab=57.61 E-value=2e+02 Score=28.38 Aligned_cols=103 Identities=20% Similarity=0.252 Sum_probs=55.2
Q ss_pred CCeEEEeecCCCCCCeEEEEecCCceEE--EEEc-CCCCCc----------ceEEEcCCCCc--EEEEccCCCCeEEEEe
Q psy6570 37 GKNLYWTDAGGRSSNNIMVSTLEGRKKR--TLLN-TGLNEP----------YDIALEPLSGR--MFWTELGIKPRISGAS 101 (713)
Q Consensus 37 ~~~ly~td~~~~~~~~I~~~~~~G~~~~--~l~~-~~~~~p----------~~iavD~~~~~--ly~td~~~~~~I~~~~ 101 (713)
++.+|.-.. ....|.++++....+. .++. .+...+ .++|+|. +|+ ||-|. ++.+.|..+.
T Consensus 77 nGs~yynk~---~t~~ivky~l~~~~~~~~~~lp~a~y~~~~~y~~~g~sdiD~avDE-~GLWviYat~-~~~g~iv~sk 151 (249)
T KOG3545|consen 77 NGSLYYNKA---GTRNIIKYDLETRTVAGSAALPYAGYHNPSPYYWGGHSDIDLAVDE-NGLWVIYATP-ENAGTIVLSK 151 (249)
T ss_pred cceEEeecc---CCcceEEEEeecceeeeeeeccccccCCCcccccCCCccccceecc-cceeEEeccc-ccCCcEEeec
Confidence 467777665 5677889998873322 2221 223334 5899994 554 23333 3333666677
Q ss_pred cCCCCcEEEE--eCCCC---CCeeEEEeCCCCeEEEEcCCCC---cE-EEEeCCC
Q psy6570 102 IDGKNKFNLV--DNNIQ---WPTGITIDYPSQRLYWADPKAR---TI-ESINLNG 147 (713)
Q Consensus 102 ~dG~~~~~l~--~~~~~---~p~glavd~~~~~LY~~d~~~~---~I-~~~~~~g 147 (713)
+|-...++.. .+.+. ..+++.|= +.||+.++.+. .| +.++...
T Consensus 152 Ldp~tl~~e~tW~T~~~k~~~~~aF~iC---GvLY~v~S~~~~~~~i~yaydt~~ 203 (249)
T KOG3545|consen 152 LDPETLEVERTWNTTLPKRSAGNAFMIC---GVLYVVHSYNCTHTQISYAYDTTT 203 (249)
T ss_pred cCHHHhheeeeeccccCCCCcCceEEEe---eeeEEEeccccCCceEEEEEEcCC
Confidence 7753322222 12222 23455554 78999886432 34 4566553
No 357
>KOG0306|consensus
Probab=57.45 E-value=3.2e+02 Score=31.48 Aligned_cols=109 Identities=11% Similarity=0.157 Sum_probs=57.2
Q ss_pred CCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCC-eeEEEeCCCCeEEEEcCCCC--cEEEEeCCCC
Q psy6570 72 NEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWP-TGITIDYPSQRLYWADPKAR--TIESINLNGK 148 (713)
Q Consensus 72 ~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p-~glavd~~~~~LY~~d~~~~--~I~~~~~~g~ 148 (713)
.....+++.|..++|-++-- .+ .+.++.+|.-....-+. +-..| ..|-|.++. .|.++-+... +||-.++-.=
T Consensus 509 ddvL~v~~Spdgk~LaVsLL-dn-TVkVyflDtlKFflsLY-GHkLPV~smDIS~DS-klivTgSADKnVKiWGLdFGDC 584 (888)
T KOG0306|consen 509 DDVLCVSVSPDGKLLAVSLL-DN-TVKVYFLDTLKFFLSLY-GHKLPVLSMDISPDS-KLIVTGSADKNVKIWGLDFGDC 584 (888)
T ss_pred ccEEEEEEcCCCcEEEEEec-cC-eEEEEEecceeeeeeec-ccccceeEEeccCCc-CeEEeccCCCceEEeccccchh
Confidence 34567888876665555543 23 57777777543332222 11222 345555444 4555554333 4565554332
Q ss_pred ceeEEEecCCCCccceeeeee-CCeEEEEeCCCCcEEEEcc
Q psy6570 149 DRFVVYHTEDNGYKPYKLEVF-EDNLYFSTYRTNNILKINK 188 (713)
Q Consensus 149 ~~~~~~~~~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~ 188 (713)
.+..++.. ..-+.+.+- +.++||+-...+.|.+.+.
T Consensus 585 HKS~fAHd----DSvm~V~F~P~~~~FFt~gKD~kvKqWDg 621 (888)
T KOG0306|consen 585 HKSFFAHD----DSVMSVQFLPKTHLFFTCGKDGKVKQWDG 621 (888)
T ss_pred hhhhhccc----CceeEEEEcccceeEEEecCcceEEeech
Confidence 33333211 122344433 6789999888888877753
No 358
>KOG0918|consensus
Probab=57.15 E-value=2e+02 Score=30.35 Aligned_cols=30 Identities=20% Similarity=0.211 Sum_probs=24.2
Q ss_pred CeeEEEeCCCCeEEEEcCCCCcEEEEeCCC
Q psy6570 118 PTGITIDYPSQRLYWADPKARTIESINLNG 147 (713)
Q Consensus 118 p~glavd~~~~~LY~~d~~~~~I~~~~~~g 147 (713)
.+.|-|+.+..+||++.+-.+-|+.+++..
T Consensus 314 ITDilISmDDRFLYvs~WLHGDirQYdIsD 343 (476)
T KOG0918|consen 314 ITDILISLDDRFLYVSNWLHGDIRQYDISD 343 (476)
T ss_pred hheeEEeecCcEEEEEeeeecceeeeccCC
Confidence 356777777999999999999998888543
No 359
>PF14991 MLANA: Protein melan-A; PDB: 2GTZ_F 2GT9_F 3MRO_P 2GUO_C 3MRQ_P 2GTW_C 3L6F_C 3MRP_P.
Probab=57.14 E-value=1.9 Score=36.06 Aligned_cols=25 Identities=28% Similarity=0.328 Sum_probs=1.6
Q ss_pred hHHHHHHHHHHHHHHhheeeEEEEe
Q psy6570 686 ISSILILILLLITVGGIGYYIFRIK 710 (713)
Q Consensus 686 ~~~~~~~~~~~~~~~~~~~~~~~~~ 710 (713)
.++|.+++|||.+|+++..|..|||
T Consensus 26 AaGIGiL~VILgiLLliGCWYckRR 50 (118)
T PF14991_consen 26 AAGIGILIVILGILLLIGCWYCKRR 50 (118)
T ss_dssp --SSS--------------------
T ss_pred hccceeHHHHHHHHHHHhheeeeec
Confidence 4455555555544444444544443
No 360
>KOG0292|consensus
Probab=56.72 E-value=2.6e+02 Score=32.83 Aligned_cols=154 Identities=10% Similarity=0.017 Sum_probs=78.9
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecC-CceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEec
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLE-GRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASI 102 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~-G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~ 102 (713)
.-.+..||+|.|..-.|..+-. .+.|+.-|.. |.....+. +--.-.+||++.| .+-||++....- +|.+-+.
T Consensus 8 kSsRvKglsFHP~rPwILtslH----sG~IQlWDYRM~tli~rFd-eHdGpVRgv~FH~-~qplFVSGGDDy-kIkVWnY 80 (1202)
T KOG0292|consen 8 KSSRVKGLSFHPKRPWILTSLH----SGVIQLWDYRMGTLIDRFD-EHDGPVRGVDFHP-TQPLFVSGGDDY-KIKVWNY 80 (1202)
T ss_pred ccccccceecCCCCCEEEEeec----CceeeeehhhhhhHHhhhh-ccCCccceeeecC-CCCeEEecCCcc-EEEEEec
Confidence 3457789999998777776654 5777776643 22222111 1224567999986 678999864333 6666555
Q ss_pred CCCCcEEEE--eCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeeeC-CeEEEEeCC
Q psy6570 103 DGKNKFNLV--DNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVFE-DNLYFSTYR 179 (713)
Q Consensus 103 dG~~~~~l~--~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~-~~ly~td~~ 179 (713)
+.. +.++ ...+.+..-+.|.+.--+|.-+ +....|+.-+....+-..++.. .-+..+.-.++. +.+.++.+-
T Consensus 81 k~r--rclftL~GHlDYVRt~~FHheyPWIlSA-SDDQTIrIWNwqsr~~iavltG--HnHYVMcAqFhptEDlIVSaSL 155 (1202)
T KOG0292|consen 81 KTR--RCLFTLLGHLDYVRTVFFHHEYPWILSA-SDDQTIRIWNWQSRKCIAVLTG--HNHYVMCAQFHPTEDLIVSASL 155 (1202)
T ss_pred ccc--eehhhhccccceeEEeeccCCCceEEEc-cCCCeEEEEeccCCceEEEEec--CceEEEeeccCCccceEEEecc
Confidence 432 2222 2234455555666544444433 2233444444333222222222 122333333332 345566555
Q ss_pred CCcEEEEccc
Q psy6570 180 TNNILKINKF 189 (713)
Q Consensus 180 ~~~i~~~~~~ 189 (713)
..+|+..+..
T Consensus 156 DQTVRVWDis 165 (1202)
T KOG0292|consen 156 DQTVRVWDIS 165 (1202)
T ss_pred cceEEEEeec
Confidence 5556555543
No 361
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=56.71 E-value=9.3 Score=39.52 Aligned_cols=10 Identities=20% Similarity=-0.107 Sum_probs=4.6
Q ss_pred CCCcEEEEcc
Q psy6570 82 LSGRMFWTEL 91 (713)
Q Consensus 82 ~~~~ly~td~ 91 (713)
..|...|.+.
T Consensus 107 ~~g~y~V~~~ 116 (306)
T PF01299_consen 107 SVGTYSVTNG 116 (306)
T ss_pred ccceEEEECC
Confidence 3444445553
No 362
>PF15416 DUF4623: Domain of unknown function (DUF4623)
Probab=56.65 E-value=1.3e+02 Score=30.83 Aligned_cols=158 Identities=13% Similarity=0.153 Sum_probs=0.0
Q ss_pred EEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEE
Q psy6570 32 AVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLV 111 (713)
Q Consensus 32 a~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~ 111 (713)
.+|..++.+| .+. ....-.-.+.||+.+.++.+.. ..|+=|-|+ |.... .|..+.+|-+..
T Consensus 119 vY~Fsgd~iY-~~f---~~~lTR~a~fDGe~VLvvsR~~-~~pHLLkvs---------dLK~g-~inpI~LdlTgV---- 179 (442)
T PF15416_consen 119 VYDFSGDNIY-DDF---AGLLTRCASFDGEHVLVVSRGT-TKPHLLKVS---------DLKAG-EINPIPLDLTGV---- 179 (442)
T ss_pred eEeccCCccc-hhh---hhhhhcccCCCCcEEEEEecCC-CCceeeehh---------HhhcC-Cccceeeecccc----
Q ss_pred eCCCCCCeeEEEeCCCCeEEEEcCCCC-----cEEEEeCCCCceeEEEecCC--------CCccceeeeee---CCeEEE
Q psy6570 112 DNNIQWPTGITIDYPSQRLYWADPKAR-----TIESINLNGKDRFVVYHTED--------NGYKPYKLEVF---EDNLYF 175 (713)
Q Consensus 112 ~~~~~~p~glavd~~~~~LY~~d~~~~-----~I~~~~~~g~~~~~~~~~~~--------~~~~p~~i~~~---~~~ly~ 175 (713)
+.-..|..++-- .++.+|++....+ +|+--....+..+++..-.. +...-+++.++ ++++|+
T Consensus 180 -tgGTf~yNmgAl-~nGH~Y~asLSG~~~SPLKiY~w~tPts~PevIa~inV~~I~gAg~RhGDn~S~nlD~nGnGyiFF 257 (442)
T PF15416_consen 180 -TGGTFSYNMGAL-VNGHSYLASLSGGKASPLKIYYWETPTSAPEVIADINVGDIPGAGNRHGDNFSLNLDENGNGYIFF 257 (442)
T ss_pred -cCcccccchhhh-cCCeEEEEeccCCCCCceEEEEecCCCCCceEEEeeeeccCcccccccCcceeEEeccCCceEEEe
Q ss_pred EeCCCCcEEEEcccCCCcceeeeccccccccEEEE
Q psy6570 176 STYRTNNILKINKFGNSDFNVLANNLNRASDVLIL 210 (713)
Q Consensus 176 td~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i~v~ 210 (713)
.|.....|.|+...+...+.....-++...+...+
T Consensus 258 gdnaat~ilR~~vsn~k~v~~~~~vip~~~~~~~~ 292 (442)
T PF15416_consen 258 GDNAATNILRFTVSNYKTVSTEPKVIPSKADATMW 292 (442)
T ss_pred cCCccceEEEEEccCcccccCcceEeecCCCccee
No 363
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=56.63 E-value=8.9 Score=37.57 Aligned_cols=18 Identities=28% Similarity=0.759 Sum_probs=10.6
Q ss_pred EEeecCCCceeeCCCCCc
Q psy6570 565 TCVLIEGKPSCKCLPPYS 582 (713)
Q Consensus 565 ~C~~~~g~~~C~C~~G~~ 582 (713)
.|.++.|+|.|.|++||+
T Consensus 200 ~C~~~~g~~~c~c~~g~~ 217 (224)
T cd01475 200 VCISTPGSYLCACTEGYA 217 (224)
T ss_pred eEEcCCCCEEeECCCCcc
Confidence 455555666666666654
No 364
>KOG0313|consensus
Probab=56.62 E-value=2.6e+02 Score=29.33 Aligned_cols=140 Identities=16% Similarity=0.136 Sum_probs=77.5
Q ss_pred ecCCCCC-ceEEEeccCCeEEEeecCCCCCCeEEEEecCCc-eEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEE
Q psy6570 22 LSNLHDP-RGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGR-KKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISG 99 (713)
Q Consensus 22 ~~~~~~p-~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~-~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~ 99 (713)
+.+-..| ..+.++. ...+|=.-+ .+.|.+-|+... ....+... ..-..|++.+...+|.-..... .|..
T Consensus 256 l~GHt~~Vs~V~w~d-~~v~yS~Sw----DHTIk~WDletg~~~~~~~~~--ksl~~i~~~~~~~Ll~~gssdr--~irl 326 (423)
T KOG0313|consen 256 LEGHTEPVSSVVWSD-ATVIYSVSW----DHTIKVWDLETGGLKSTLTTN--KSLNCISYSPLSKLLASGSSDR--HIRL 326 (423)
T ss_pred ecccccceeeEEEcC-CCceEeecc----cceEEEEEeecccceeeeecC--cceeEeecccccceeeecCCCC--ceee
Confidence 3344444 4455554 456665544 688888887643 33333322 3455677776544443333322 3443
Q ss_pred Eec---CCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeeeCCeEEEE
Q psy6570 100 ASI---DGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVFEDNLYFS 176 (713)
Q Consensus 100 ~~~---dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~t 176 (713)
.++ +|+..+.-+...-++..++...|.+.++|++-+..+.+..-|+....- .-+.|+-++++|+-+
T Consensus 327 ~DPR~~~gs~v~~s~~gH~nwVssvkwsp~~~~~~~S~S~D~t~klWDvRS~k~-----------plydI~~h~DKvl~v 395 (423)
T KOG0313|consen 327 WDPRTGDGSVVSQSLIGHKNWVSSVKWSPTNEFQLVSGSYDNTVKLWDVRSTKA-----------PLYDIAGHNDKVLSV 395 (423)
T ss_pred cCCCCCCCceeEEeeecchhhhhheecCCCCceEEEEEecCCeEEEEEeccCCC-----------cceeeccCCceEEEE
Confidence 333 344443333445567888999999999998887766665555433211 113344446677777
Q ss_pred eCCCC
Q psy6570 177 TYRTN 181 (713)
Q Consensus 177 d~~~~ 181 (713)
||..+
T Consensus 396 dW~~~ 400 (423)
T KOG0313|consen 396 DWNEG 400 (423)
T ss_pred eccCC
Confidence 76654
No 365
>KOG3516|consensus
Probab=55.95 E-value=8.9 Score=45.42 Aligned_cols=44 Identities=30% Similarity=0.719 Sum_probs=36.8
Q ss_pred CCCCCCCCCCCCcEEeecCCCceeeCC-CCCcCCCCCcCCCCCCC
Q psy6570 552 ANKCTPNYCSNNGTCVLIEGKPSCKCL-PPYSGKQCTEREDSPSC 595 (713)
Q Consensus 552 ~~~C~~~~C~~~~~C~~~~g~~~C~C~-~G~~G~~C~~~~~~~~C 595 (713)
.+.|.+++|.++|.|......|.|.|. .||.|..|........|
T Consensus 545 ~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~GatCHtsi~e~SC 589 (1306)
T KOG3516|consen 545 SDRCLPNPCEHGGKCSQSWDDFECNCELTGYKGATCHTSIYELSC 589 (1306)
T ss_pred ccccCCccccCCCcccccccceeEeccccccccccccCCCcchhh
Confidence 578999999999999998889999998 99999999865333344
No 366
>PF14517 Tachylectin: Tachylectin; PDB: 1TL2_A.
Probab=55.78 E-value=2.1e+02 Score=28.02 Aligned_cols=157 Identities=12% Similarity=0.077 Sum_probs=74.7
Q ss_pred EEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEe--cCCc-----eEEEEEcCCCCCcceEEEcCCCCcEEEEccC
Q psy6570 20 TVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVST--LEGR-----KKRTLLNTGLNEPYDIALEPLSGRMFWTELG 92 (713)
Q Consensus 20 ~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~--~~G~-----~~~~l~~~~~~~p~~iavD~~~~~ly~td~~ 92 (713)
++-.+...-.-|++-+ +++||.... +.+++.. ..+. ..+.|...+..+=..|++|+ +|.||-.+..
T Consensus 28 ~iG~gw~~~~~i~~~P-~g~lY~I~~-----~~lY~~~~~~~~~~~~~~~~~~Ig~g~W~~F~~i~~d~-~G~LYaV~~~ 100 (229)
T PF14517_consen 28 TIGSGWNNFRDIAAGP-NGRLYAIRN-----DGLYRGSPSSSGGNTWDSGSKQIGDGGWNSFKFIFFDP-TGVLYAVTPD 100 (229)
T ss_dssp EEESS-TT-SEEEE-T-TS-EEEEET-----TEEEEES---STT--HHHH-EEEE-S-GGG-SEEEE-T-TS-EEEEETT
T ss_pred hcCccccccceEEEcC-CceEEEEEC-----CceEEecCCccCcccccccCcccccCcccceeEEEecC-CccEEEeccc
Confidence 3333455566677766 578887775 2666652 1111 13444444355666899996 8999976642
Q ss_pred CCCeEEEEecCCCC--------cEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEE-eCCCCc-----eeEEEecCC
Q psy6570 93 IKPRISGASIDGKN--------KFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESI-NLNGKD-----RFVVYHTED 158 (713)
Q Consensus 93 ~~~~I~~~~~dG~~--------~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~-~~~g~~-----~~~~~~~~~ 158 (713)
++|+|....... -+.+....-+....|-.+ .++.||..+... ++++. ..++.. ...++...
T Consensus 101 --G~lyR~~~~~~~~~~W~~~~~~~iG~~GW~~f~~vfa~-~~GvLY~i~~dg-~~~~~~~p~~~~~~W~~~s~~v~~~- 175 (229)
T PF14517_consen 101 --GKLYRHPRPTNGSDNWIGGSGKKIGGTGWNDFDAVFAG-PNGVLYAITPDG-RLYRRYRPDGGSDRWLSGSGLVGGG- 175 (229)
T ss_dssp ---EEEEES---STT--HHH-HSEEEE-SSGGGEEEEEE--TTS-EEEEETTE--EEEE---SSTT--HHHH-EEEESS-
T ss_pred --cceeeccCCCccCcchhhccceecccCCCccceEEEeC-CCccEEEEcCCC-ceEEeCCCCCCCCccccccceeccC-
Confidence 267776543221 233334555556777777 578899988544 66665 444422 22333222
Q ss_pred CCccceeeeee-CCeEEEEeCCCCcEEEEccc
Q psy6570 159 NGYKPYKLEVF-EDNLYFSTYRTNNILKINKF 189 (713)
Q Consensus 159 ~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~ 189 (713)
.......|... ++.||++ ...+.++|....
T Consensus 176 gw~~~~~i~~~~~g~L~~V-~~~G~lyr~~~p 206 (229)
T PF14517_consen 176 GWDSFHFIFFSPDGNLWAV-KSNGKLYRGRPP 206 (229)
T ss_dssp SGGGEEEEEE-TTS-EEEE--ETTEEEEES--
T ss_pred CcccceEEeeCCCCcEEEE-ecCCEEeccCCc
Confidence 23444555544 5678877 456677766543
No 367
>KOG0278|consensus
Probab=55.41 E-value=2.2e+02 Score=28.12 Aligned_cols=127 Identities=10% Similarity=0.020 Sum_probs=69.0
Q ss_pred ccCCceeEEEccCcccEE-ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCC-CCcceEEEcC
Q psy6570 4 ISSGNVTRVKREMNLKTV-LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGL-NEPYDIALEP 81 (713)
Q Consensus 4 ~~~~~I~~~~~~~~~~~~-~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~-~~p~~iavD~ 81 (713)
...+.|+..|..+..++- +..-..+.++.+...++.|-+++. ..|.-.+.+. ..+|-...+ -....-.+.|
T Consensus 162 add~tVRLWD~rTgt~v~sL~~~s~VtSlEvs~dG~ilTia~g-----ssV~Fwdaks--f~~lKs~k~P~nV~SASL~P 234 (334)
T KOG0278|consen 162 ADDKTVRLWDHRTGTEVQSLEFNSPVTSLEVSQDGRILTIAYG-----SSVKFWDAKS--FGLLKSYKMPCNVESASLHP 234 (334)
T ss_pred ccCCceEEEEeccCcEEEEEecCCCCcceeeccCCCEEEEecC-----ceeEEecccc--ccceeeccCccccccccccC
Confidence 345555555655544332 344445677888766655555554 2333333322 222221111 1233445666
Q ss_pred CCCcEEEEccCCCCeEEEEecCCCCcEEEE-eCCCCCCeeEEEeCCCCeEEEEcCCCCcE
Q psy6570 82 LSGRMFWTELGIKPRISGASIDGKNKFNLV-DNNIQWPTGITIDYPSQRLYWADPKARTI 140 (713)
Q Consensus 82 ~~~~ly~td~~~~~~I~~~~~dG~~~~~l~-~~~~~~p~glavd~~~~~LY~~d~~~~~I 140 (713)
.. .+|++..... .+++++.+........ .........+.+.| ++.+|-+-+..+.|
T Consensus 235 ~k-~~fVaGged~-~~~kfDy~TgeEi~~~nkgh~gpVhcVrFSP-dGE~yAsGSEDGTi 291 (334)
T KOG0278|consen 235 KK-EFFVAGGEDF-KVYKFDYNTGEEIGSYNKGHFGPVHCVRFSP-DGELYASGSEDGTI 291 (334)
T ss_pred CC-ceEEecCcce-EEEEEeccCCceeeecccCCCCceEEEEECC-CCceeeccCCCceE
Confidence 44 7888875444 8889988866655554 33344445677774 67888877655544
No 368
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=55.34 E-value=2.5e+02 Score=28.86 Aligned_cols=142 Identities=13% Similarity=0.105 Sum_probs=56.5
Q ss_pred CCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEe-CCCCCCeeEEEeCCCC
Q psy6570 50 SNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVD-NNIQWPTGITIDYPSQ 128 (713)
Q Consensus 50 ~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~-~~~~~p~glavd~~~~ 128 (713)
.+.|++..-.|+.=+.+.........+++..+...+|.++..+ .+++..-.|...-.... ....+-..|.+++. +
T Consensus 123 ~G~iy~T~DgG~tW~~~~~~~~gs~~~~~r~~dG~~vavs~~G---~~~~s~~~G~~~w~~~~r~~~~riq~~gf~~~-~ 198 (302)
T PF14870_consen 123 RGAIYRTTDGGKTWQAVVSETSGSINDITRSSDGRYVAVSSRG---NFYSSWDPGQTTWQPHNRNSSRRIQSMGFSPD-G 198 (302)
T ss_dssp T--EEEESSTTSSEEEEE-S----EEEEEE-TTS-EEEEETTS---SEEEEE-TT-SS-EEEE--SSS-EEEEEE-TT-S
T ss_pred CCcEEEeCCCCCCeeEcccCCcceeEeEEECCCCcEEEEECcc---cEEEEecCCCccceEEccCccceehhceecCC-C
Confidence 4567776666655444444444455555555433333343322 45655555544333322 23445567778754 5
Q ss_pred eEEEEcCCCCcEEEEeCCCCceeEEEec--C--CCCccceeeeeeCCeEEEEeCCCCcEEEEcccCCCcceeee
Q psy6570 129 RLYWADPKARTIESINLNGKDRFVVYHT--E--DNGYKPYKLEVFEDNLYFSTYRTNNILKINKFGNSDFNVLA 198 (713)
Q Consensus 129 ~LY~~d~~~~~I~~~~~~g~~~~~~~~~--~--~~~~~p~~i~~~~~~ly~td~~~~~i~~~~~~~~~~~~~~~ 198 (713)
.|+.+. ..+.|+.-+ +.......... . .....-+.|++..+...|+-.+++.+++ ..+++..-+...
T Consensus 199 ~lw~~~-~Gg~~~~s~-~~~~~~~w~~~~~~~~~~~~~~ld~a~~~~~~~wa~gg~G~l~~-S~DgGktW~~~~ 269 (302)
T PF14870_consen 199 NLWMLA-RGGQIQFSD-DPDDGETWSEPIIPIKTNGYGILDLAYRPPNEIWAVGGSGTLLV-STDGGKTWQKDR 269 (302)
T ss_dssp -EEEEE-TTTEEEEEE--TTEEEEE---B-TTSS--S-EEEEEESSSS-EEEEESTT-EEE-ESSTTSS-EE-G
T ss_pred CEEEEe-CCcEEEEcc-CCCCccccccccCCcccCceeeEEEEecCCCCEEEEeCCccEEE-eCCCCccceECc
Confidence 665555 445555555 12122222111 0 1111124455555455555555565554 345666555443
No 369
>KOG0640|consensus
Probab=55.20 E-value=1.8e+02 Score=29.51 Aligned_cols=111 Identities=9% Similarity=0.089 Sum_probs=62.9
Q ss_pred CCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEc--CCCCCcceEEEcCCCCcEEEEccCCCCeEEEEec
Q psy6570 25 LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLN--TGLNEPYDIALEPLSGRMFWTELGIKPRISGASI 102 (713)
Q Consensus 25 ~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~--~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~ 102 (713)
......|+|.|. ..|.++-+ ..+.|..++..-...+..+. ......+.|.+.|.+.+|.+..... .+...+.
T Consensus 172 ~devn~l~FHPr-e~ILiS~s---rD~tvKlFDfsK~saKrA~K~~qd~~~vrsiSfHPsGefllvgTdHp--~~rlYdv 245 (430)
T KOG0640|consen 172 VDEVNDLDFHPR-ETILISGS---RDNTVKLFDFSKTSAKRAFKVFQDTEPVRSISFHPSGEFLLVGTDHP--TLRLYDV 245 (430)
T ss_pred cCcccceeecch-hheEEecc---CCCeEEEEecccHHHHHHHHHhhccceeeeEeecCCCceEEEecCCC--ceeEEec
Confidence 344566777765 45555655 56667666665433221111 2345677899999888877754332 3444444
Q ss_pred CCCCcEEEEeCC-----CCCCeeEEEeCCCCeEEEEcCCCCcEEEEe
Q psy6570 103 DGKNKFNLVDNN-----IQWPTGITIDYPSQRLYWADPKARTIESIN 144 (713)
Q Consensus 103 dG~~~~~l~~~~-----~~~p~glavd~~~~~LY~~d~~~~~I~~~~ 144 (713)
+.. +-++..+ ....+.+-.. .+++||++-+..+.|...|
T Consensus 246 ~T~--QcfvsanPd~qht~ai~~V~Ys-~t~~lYvTaSkDG~IklwD 289 (430)
T KOG0640|consen 246 NTY--QCFVSANPDDQHTGAITQVRYS-STGSLYVTASKDGAIKLWD 289 (430)
T ss_pred cce--eEeeecCcccccccceeEEEec-CCccEEEEeccCCcEEeec
Confidence 432 2222211 1122344555 5789999998888887654
No 370
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=55.14 E-value=2.1e+02 Score=29.62 Aligned_cols=129 Identities=10% Similarity=0.086 Sum_probs=63.0
Q ss_pred CceeEEEccCc------ccEE-ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCce-EEEEE-cCCCCCcceE
Q psy6570 7 GNVTRVKREMN------LKTV-LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRK-KRTLL-NTGLNEPYDI 77 (713)
Q Consensus 7 ~~I~~~~~~~~------~~~~-~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~-~~~l~-~~~~~~p~~i 77 (713)
|+|..+..... .+.+ ...+..| -.++...+++|.++-. ++|.+++++.+. ..... ......+..|
T Consensus 62 Gri~v~~i~~~~~~~~~l~~i~~~~~~g~-V~ai~~~~~~lv~~~g-----~~l~v~~l~~~~~l~~~~~~~~~~~i~sl 135 (321)
T PF03178_consen 62 GRILVFEISESPENNFKLKLIHSTEVKGP-VTAICSFNGRLVVAVG-----NKLYVYDLDNSKTLLKKAFYDSPFYITSL 135 (321)
T ss_dssp EEEEEEEECSS-----EEEEEEEEEESS--EEEEEEETTEEEEEET-----TEEEEEEEETTSSEEEEEEE-BSSSEEEE
T ss_pred cEEEEEEEEcccccceEEEEEEEEeecCc-ceEhhhhCCEEEEeec-----CEEEEEEccCcccchhhheecceEEEEEE
Confidence 78887777662 2323 2333333 2344445777665543 677787777655 33222 1212234444
Q ss_pred EEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeC-CCCCCeeEEEeCCCCeEEEEcCCCCcEEEEe
Q psy6570 78 ALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDN-NIQWPTGITIDYPSQRLYWADPKARTIESIN 144 (713)
Q Consensus 78 avD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~-~~~~p~glavd~~~~~LY~~d~~~~~I~~~~ 144 (713)
.+ .+++|++.|....-.+.+.+.++.....+... ...+...+.+=.+++.+..+|.. +.|+.+.
T Consensus 136 ~~--~~~~I~vgD~~~sv~~~~~~~~~~~l~~va~d~~~~~v~~~~~l~d~~~~i~~D~~-gnl~~l~ 200 (321)
T PF03178_consen 136 SV--FKNYILVGDAMKSVSLLRYDEENNKLILVARDYQPRWVTAAEFLVDEDTIIVGDKD-GNLFVLR 200 (321)
T ss_dssp EE--ETTEEEEEESSSSEEEEEEETTTE-EEEEEEESS-BEEEEEEEE-SSSEEEEEETT-SEEEEEE
T ss_pred ec--cccEEEEEEcccCEEEEEEEccCCEEEEEEecCCCccEEEEEEecCCcEEEEEcCC-CeEEEEE
Confidence 44 57899999976542344445434333333321 12223333332123477777754 3344333
No 371
>KOG3914|consensus
Probab=54.26 E-value=1.4e+02 Score=31.26 Aligned_cols=115 Identities=11% Similarity=0.162 Sum_probs=65.8
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecC-CceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEec
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLE-GRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASI 102 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~-G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~ 102 (713)
.-..|..|.++.....+-++|... ....+.+.+.+ |..+.++ ..+.+..+++|.|...+|..+|... +|++...
T Consensus 106 v~~~~~ai~~~~~~~sv~v~dkag-D~~~~di~s~~~~~~~~~l--GhvSml~dVavS~D~~~IitaDRDE--kIRvs~y 180 (390)
T KOG3914|consen 106 VPKRPTAISFIREDTSVLVADKAG-DVYSFDILSADSGRCEPIL--GHVSMLLDVAVSPDDQFIITADRDE--KIRVSRY 180 (390)
T ss_pred cccCcceeeeeeccceEEEEeecC-CceeeeeecccccCcchhh--hhhhhhheeeecCCCCEEEEecCCc--eEEEEec
Confidence 445567777766666666666521 11223333322 3333322 2356788999998888888888654 5777777
Q ss_pred CCCCcEE-EEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeC
Q psy6570 103 DGKNKFN-LVDNNIQWPTGITIDYPSQRLYWADPKARTIESINL 145 (713)
Q Consensus 103 dG~~~~~-l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~ 145 (713)
.+..... +....-..-..|++- .+++.|+-.+.+.|+.-++
T Consensus 181 pa~f~IesfclGH~eFVS~isl~--~~~~LlS~sGD~tlr~Wd~ 222 (390)
T KOG3914|consen 181 PATFVIESFCLGHKEFVSTISLT--DNYLLLSGSGDKTLRLWDI 222 (390)
T ss_pred CcccchhhhccccHhheeeeeec--cCceeeecCCCCcEEEEec
Confidence 7654321 111122234566665 4566777777777766665
No 372
>KOG0647|consensus
Probab=54.23 E-value=2.5e+02 Score=28.50 Aligned_cols=84 Identities=8% Similarity=-0.005 Sum_probs=47.4
Q ss_pred CCCCCcceEEEcCCCCcEEEE-ccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCC
Q psy6570 69 TGLNEPYDIALEPLSGRMFWT-ELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNG 147 (713)
Q Consensus 69 ~~~~~p~~iavD~~~~~ly~t-d~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g 147 (713)
..-.....|++.|....|+.+ .|...-|||.+.-+|...-......-..+-.+++..++.++|.+.. .+.+...++..
T Consensus 25 pP~DsIS~l~FSP~~~~~~~A~SWD~tVR~wevq~~g~~~~ka~~~~~~PvL~v~WsddgskVf~g~~-Dk~~k~wDL~S 103 (347)
T KOG0647|consen 25 PPEDSISALAFSPQADNLLAAGSWDGTVRIWEVQNSGQLVPKAQQSHDGPVLDVCWSDDGSKVFSGGC-DKQAKLWDLAS 103 (347)
T ss_pred CcccchheeEeccccCceEEecccCCceEEEEEecCCcccchhhhccCCCeEEEEEccCCceEEeecc-CCceEEEEccC
Confidence 345667789999966666644 4554457787777665443332222233456777766666776653 23444555544
Q ss_pred CceeEE
Q psy6570 148 KDRFVV 153 (713)
Q Consensus 148 ~~~~~~ 153 (713)
.....+
T Consensus 104 ~Q~~~v 109 (347)
T KOG0647|consen 104 GQVSQV 109 (347)
T ss_pred CCeeee
Confidence 433333
No 373
>KOG3514|consensus
Probab=52.51 E-value=78 Score=37.51 Aligned_cols=30 Identities=20% Similarity=0.595 Sum_probs=26.3
Q ss_pred CCCCCCCCCCCeeeccCCCceeeeCC-CCcc
Q psy6570 222 HCDDKPCHQSALCINLPSSHTCLCPD-HLTE 251 (713)
Q Consensus 222 ~C~~~~C~~~~~C~~~~g~~~C~C~~-G~~~ 251 (713)
.|..+||.++++|...-..|.|.|.. ||.|
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G 655 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEG 655 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccC
Confidence 69999999999999888899999965 6665
No 374
>KOG3514|consensus
Probab=51.73 E-value=9.5 Score=44.46 Aligned_cols=35 Identities=34% Similarity=0.817 Sum_probs=31.3
Q ss_pred cCCCCCCCCCcEeccCCCCccccCC-CCCcCCCCcc
Q psy6570 417 QCLNLKCQNGGVCVNKTTGLECDCP-KFYYGKNCQY 451 (713)
Q Consensus 417 ~C~~~~C~~~~~C~~~~~~~~C~C~-~G~~g~~C~~ 451 (713)
.|.++||.|+|.|....+.|.|.|. .||.|+.|+.
T Consensus 625 ~C~~nPC~N~g~C~egwNrfiCDCs~T~~~G~~Cer 660 (1591)
T KOG3514|consen 625 ICESNPCQNGGKCSEGWNRFICDCSGTGFEGRTCER 660 (1591)
T ss_pred ccCCCcccCCCCccccccccccccccCcccCccccc
Confidence 6889999999999999999999995 7899999974
No 375
>PF05808 Podoplanin: Podoplanin; InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [].; GO: 0016021 integral to membrane; PDB: 3IET_X.
Probab=51.61 E-value=4.9 Score=36.24 Aligned_cols=26 Identities=12% Similarity=-0.008 Sum_probs=0.0
Q ss_pred ccchhHHHHHHHHHHHHHHhheeeEE
Q psy6570 682 VNSHISSILILILLLITVGGIGYYIF 707 (713)
Q Consensus 682 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 707 (713)
.+++++++++++.+|..|++++++|.
T Consensus 131 LVGIIVGVLlaIG~igGIIivvvRKm 156 (162)
T PF05808_consen 131 LVGIIVGVLLAIGFIGGIIIVVVRKM 156 (162)
T ss_dssp --------------------------
T ss_pred eeeehhhHHHHHHHHhheeeEEeehh
Confidence 33444444444444444444444443
No 376
>KOG0646|consensus
Probab=51.60 E-value=2.5e+02 Score=30.17 Aligned_cols=31 Identities=13% Similarity=0.089 Sum_probs=22.1
Q ss_pred CCcceEEEcCCCCcEEEEccCCCCeEEEEecCC
Q psy6570 72 NEPYDIALEPLSGRMFWTELGIKPRISGASIDG 104 (713)
Q Consensus 72 ~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG 104 (713)
..+.++++||...++|+-... +.|+...+.+
T Consensus 218 ~si~av~lDpae~~~yiGt~~--G~I~~~~~~~ 248 (476)
T KOG0646|consen 218 SSIKAVALDPAERVVYIGTEE--GKIFQNLLFK 248 (476)
T ss_pred CcceeEEEcccccEEEecCCc--ceEEeeehhc
Confidence 568899999999999985432 2566655543
No 377
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=50.54 E-value=17 Score=30.95 Aligned_cols=30 Identities=33% Similarity=1.016 Sum_probs=17.0
Q ss_pred CCCC-CCCCCCCcEEeecCCCceeeCCCCCcC
Q psy6570 553 NKCT-PNYCSNNGTCVLIEGKPSCKCLPPYSG 583 (713)
Q Consensus 553 ~~C~-~~~C~~~~~C~~~~g~~~C~C~~G~~G 583 (713)
+.|. ...|+.+|.|.. .....|.|.+||.-
T Consensus 78 d~Cd~y~~CG~~g~C~~-~~~~~C~Cl~GF~P 108 (110)
T PF00954_consen 78 DQCDVYGFCGPNGICNS-NNSPKCSCLPGFEP 108 (110)
T ss_pred cCCCCccccCCccEeCC-CCCCceECCCCcCC
Confidence 3454 345666777743 23445777777653
No 378
>KOG2055|consensus
Probab=50.03 E-value=3.6e+02 Score=29.07 Aligned_cols=37 Identities=8% Similarity=0.055 Sum_probs=25.1
Q ss_pred CCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEE
Q psy6570 25 LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRT 65 (713)
Q Consensus 25 ~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~ 65 (713)
...+..|.|.+....|.++.. .+.+..+..||+....
T Consensus 213 ~~~I~sv~FHp~~plllvaG~----d~~lrifqvDGk~N~~ 249 (514)
T KOG2055|consen 213 HGGITSVQFHPTAPLLLVAGL----DGTLRIFQVDGKVNPK 249 (514)
T ss_pred cCCceEEEecCCCceEEEecC----CCcEEEEEecCccChh
Confidence 344677888887777776665 5667777777866443
No 379
>PF05808 Podoplanin: Podoplanin; InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [].; GO: 0016021 integral to membrane; PDB: 3IET_X.
Probab=49.97 E-value=5.4 Score=35.99 Aligned_cols=35 Identities=26% Similarity=0.226 Sum_probs=0.0
Q ss_pred ccccccchhHHHHHHHHHHHHHHhheeeEEEEecC
Q psy6570 678 KQSYVNSHISSILILILLLITVGGIGYYIFRIKMS 712 (713)
Q Consensus 678 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 712 (713)
+.......+++|++++||.|.++..+++...|||+
T Consensus 123 k~GL~T~tLVGIIVGVLlaIG~igGIIivvvRKmS 157 (162)
T PF05808_consen 123 KDGLSTVTLVGIIVGVLLAIGFIGGIIIVVVRKMS 157 (162)
T ss_dssp -----------------------------------
T ss_pred cCCcceeeeeeehhhHHHHHHHHhheeeEEeehhc
Confidence 34455668888999988888888877777778774
No 380
>KOG1897|consensus
Probab=49.56 E-value=3.9e+02 Score=31.87 Aligned_cols=140 Identities=14% Similarity=0.169 Sum_probs=69.6
Q ss_pred EeecCCCCCCeEEEEecCC-ceEEEEEcCC-CCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCC-CCC
Q psy6570 42 WTDAGGRSSNNIMVSTLEG-RKKRTLLNTG-LNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNI-QWP 118 (713)
Q Consensus 42 ~td~~~~~~~~I~~~~~~G-~~~~~l~~~~-~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~-~~p 118 (713)
|=|..+...++|.++...- +..+.+.... -..|.+|.+ .+|+|..+= +. .|..+.+.-. ++.-+..+. ...
T Consensus 798 ~Pde~ep~~GRIivfe~~e~~~L~~v~e~~v~Gav~aL~~--fngkllA~I-n~--~vrLye~t~~-~eLr~e~~~~~~~ 871 (1096)
T KOG1897|consen 798 YPDENEPVNGRIIVFEFEELNSLELVAETVVKGAVYALVE--FNGKLLAGI-NQ--SVRLYEWTTE-RELRIECNISNPI 871 (1096)
T ss_pred ccCCCCcccceEEEEEEecCCceeeeeeeeeccceeehhh--hCCeEEEec-Cc--EEEEEEcccc-ceehhhhcccCCe
Confidence 3344333567877776654 3333332211 234555554 566665432 22 3444443322 111112222 233
Q ss_pred eeEEEeCCCCeEEEEcCCCCcEEEEeCCC--CceeEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEEEccc
Q psy6570 119 TGITIDYPSQRLYWADPKARTIESINLNG--KDRFVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILKINKF 189 (713)
Q Consensus 119 ~glavd~~~~~LY~~d~~~~~I~~~~~~g--~~~~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~~~~ 189 (713)
..|-+...+++|+++|.. ++|..+.+.+ .+...++..-. ..+-.++.+.++..|......+.++.+.+.
T Consensus 872 ~aL~l~v~gdeI~VgDlm-~Sitll~y~~~eg~f~evArD~~-p~Wmtaveil~~d~ylgae~~gNlf~v~~d 942 (1096)
T KOG1897|consen 872 IALDLQVKGDEIAVGDLM-RSITLLQYKGDEGNFEEVARDYN-PNWMTAVEILDDDTYLGAENSGNLFTVRKD 942 (1096)
T ss_pred EEEEEEecCcEEEEeecc-ceEEEEEEeccCCceEEeehhhC-ccceeeEEEecCceEEeecccccEEEEEec
Confidence 456666678999999953 3344444443 33555554322 334466666666566655555666666654
No 381
>PF13908 Shisa: Wnt and FGF inhibitory regulator
Probab=48.81 E-value=9.2 Score=36.02 Aligned_cols=18 Identities=6% Similarity=0.194 Sum_probs=7.5
Q ss_pred cchhHHHHHHHHHHHHHH
Q psy6570 683 NSHISSILILILLLITVG 700 (713)
Q Consensus 683 ~~~~~~~~~~~~~~~~~~ 700 (713)
+++++++++++++||++|
T Consensus 78 ~~iivgvi~~Vi~Iv~~I 95 (179)
T PF13908_consen 78 TGIIVGVICGVIAIVVLI 95 (179)
T ss_pred eeeeeehhhHHHHHHHhH
Confidence 334444444444443333
No 382
>KOG0772|consensus
Probab=48.42 E-value=3e+02 Score=30.17 Aligned_cols=117 Identities=12% Similarity=-0.026 Sum_probs=64.8
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCC-----C-------CCcceEEEcCCCCcEEEEcc
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTG-----L-------NEPYDIALEPLSGRMFWTEL 91 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~-----~-------~~p~~iavD~~~~~ly~td~ 91 (713)
.-...+.|++.+.++.|.++-. ...+.+++.||.........+ | ...+.-...|.+...|.|-.
T Consensus 213 E~h~i~sl~ys~Tg~~iLvvsg----~aqakl~DRdG~~~~e~~KGDQYI~Dm~nTKGHia~lt~g~whP~~k~~FlT~s 288 (641)
T KOG0772|consen 213 ETHQINSLQYSVTGDQILVVSG----SAQAKLLDRDGFEIVEFSKGDQYIRDMYNTKGHIAELTCGCWHPDNKEEFLTCS 288 (641)
T ss_pred cccccceeeecCCCCeEEEEec----CcceeEEccCCceeeeeeccchhhhhhhccCCceeeeeccccccCcccceEEec
Confidence 3445678888877677766553 566778888887655544322 1 12223345666666777654
Q ss_pred CCC-CeEEEEecCCCCcEEEEeCC----CCCCeeEEEeCCCCeEEEEcCCCCcEEEEeC
Q psy6570 92 GIK-PRISGASIDGKNKFNLVDNN----IQWPTGITIDYPSQRLYWADPKARTIESINL 145 (713)
Q Consensus 92 ~~~-~~I~~~~~dG~~~~~l~~~~----~~~p~glavd~~~~~LY~~d~~~~~I~~~~~ 145 (713)
... .|||-++-.-+.++++.... --.|..-+++++.. ++-+--..+.|+.-++
T Consensus 289 ~DgtlRiWdv~~~k~q~qVik~k~~~g~Rv~~tsC~~nrdg~-~iAagc~DGSIQ~W~~ 346 (641)
T KOG0772|consen 289 YDGTLRIWDVNNTKSQLQVIKTKPAGGKRVPVTSCAWNRDGK-LIAAGCLDGSIQIWDK 346 (641)
T ss_pred CCCcEEEEecCCchhheeEEeeccCCCcccCceeeecCCCcc-hhhhcccCCceeeeec
Confidence 321 26665544444444444321 11356678886544 4444445566666655
No 383
>PRK13684 Ycf48-like protein; Provisional
Probab=47.95 E-value=3.5e+02 Score=28.29 Aligned_cols=149 Identities=13% Similarity=0.023 Sum_probs=67.0
Q ss_pred CCceEEEeccCCeEEEeecCCCCCCeEEEEecCC-ceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCC
Q psy6570 27 DPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEG-RKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGK 105 (713)
Q Consensus 27 ~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G-~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~ 105 (713)
...++++++. + .||.-. ..+.|++...+| +.-+.+.........+|++.+ ++.+++.... +.+.+.+.|+.
T Consensus 174 ~~~~i~~~~~-g-~~v~~g---~~G~i~~s~~~gg~tW~~~~~~~~~~l~~i~~~~-~g~~~~vg~~--G~~~~~s~d~G 245 (334)
T PRK13684 174 VVRNLRRSPD-G-KYVAVS---SRGNFYSTWEPGQTAWTPHQRNSSRRLQSMGFQP-DGNLWMLARG--GQIRFNDPDDL 245 (334)
T ss_pred eEEEEEECCC-C-eEEEEe---CCceEEEEcCCCCCeEEEeeCCCcccceeeeEcC-CCCEEEEecC--CEEEEccCCCC
Confidence 3456666653 3 333333 356677654344 322333333345677888876 4556555433 23443334443
Q ss_pred Cc-EEEEeC---CCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCc-cceeeeee-CCeEEEEeCC
Q psy6570 106 NK-FNLVDN---NIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGY-KPYKLEVF-EDNLYFSTYR 179 (713)
Q Consensus 106 ~~-~~l~~~---~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~-~p~~i~~~-~~~ly~td~~ 179 (713)
.. +.+... ......+|++.+ ++.+|++- ..+.|++..-.|..-+.+.... .++ .-+.+.+. ++.+ |+-..
T Consensus 246 ~sW~~~~~~~~~~~~~l~~v~~~~-~~~~~~~G-~~G~v~~S~d~G~tW~~~~~~~-~~~~~~~~~~~~~~~~~-~~~G~ 321 (334)
T PRK13684 246 ESWSKPIIPEITNGYGYLDLAYRT-PGEIWAGG-GNGTLLVSKDGGKTWEKDPVGE-EVPSNFYKIVFLDPEKG-FVLGQ 321 (334)
T ss_pred CccccccCCccccccceeeEEEcC-CCCEEEEc-CCCeEEEeCCCCCCCeECCcCC-CCCcceEEEEEeCCCce-EEECC
Confidence 22 222111 112235667765 45555443 4455655443344433332111 121 12334433 4444 44445
Q ss_pred CCcEEEEc
Q psy6570 180 TNNILKIN 187 (713)
Q Consensus 180 ~~~i~~~~ 187 (713)
.+.|.+.+
T Consensus 322 ~G~il~~~ 329 (334)
T PRK13684 322 RGVLLRYV 329 (334)
T ss_pred CceEEEec
Confidence 56665554
No 384
>PRK13684 Ycf48-like protein; Provisional
Probab=47.94 E-value=3.5e+02 Score=28.29 Aligned_cols=140 Identities=12% Similarity=0.110 Sum_probs=64.6
Q ss_pred CCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcE-EEEeCCCCCCeeEEEeCCCC
Q psy6570 50 SNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKF-NLVDNNIQWPTGITIDYPSQ 128 (713)
Q Consensus 50 ~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~-~l~~~~~~~p~glavd~~~~ 128 (713)
.+.|++..-.|+.=+.+.........+|++++. +.++.+.. .+.|++...+|...- .+........++|++.+ ++
T Consensus 151 ~G~i~~S~DgG~tW~~~~~~~~g~~~~i~~~~~-g~~v~~g~--~G~i~~s~~~gg~tW~~~~~~~~~~l~~i~~~~-~g 226 (334)
T PRK13684 151 VGAIYRTTDGGKNWEALVEDAAGVVRNLRRSPD-GKYVAVSS--RGNFYSTWEPGQTAWTPHQRNSSRRLQSMGFQP-DG 226 (334)
T ss_pred cceEEEECCCCCCceeCcCCCcceEEEEEECCC-CeEEEEeC--CceEEEEcCCCCCeEEEeeCCCcccceeeeEcC-CC
Confidence 567777776665533333233345678888864 33333321 226776644554322 23222344567788875 45
Q ss_pred eEEEEcCCCCcEEEEeCCC-CceeEEEec-CCCCccceeeeee-CCeEEEEeCCCCcEEEEcccCCCccee
Q psy6570 129 RLYWADPKARTIESINLNG-KDRFVVYHT-EDNGYKPYKLEVF-EDNLYFSTYRTNNILKINKFGNSDFNV 196 (713)
Q Consensus 129 ~LY~~d~~~~~I~~~~~~g-~~~~~~~~~-~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~~~~~~ 196 (713)
++|++-. .+.+.....|+ ..-+.+... .......+++.+. ++.+|++ ...+.|++ ..+++.....
T Consensus 227 ~~~~vg~-~G~~~~~s~d~G~sW~~~~~~~~~~~~~l~~v~~~~~~~~~~~-G~~G~v~~-S~d~G~tW~~ 294 (334)
T PRK13684 227 NLWMLAR-GGQIRFNDPDDLESWSKPIIPEITNGYGYLDLAYRTPGEIWAG-GGNGTLLV-SKDGGKTWEK 294 (334)
T ss_pred CEEEEec-CCEEEEccCCCCCccccccCCccccccceeeEEEcCCCCEEEE-cCCCeEEE-eCCCCCCCeE
Confidence 5655542 34443323333 332222111 0001123445554 4455444 34444543 3344444443
No 385
>PF12877 DUF3827: Domain of unknown function (DUF3827); InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells.
Probab=46.85 E-value=8.1 Score=42.76 Aligned_cols=16 Identities=25% Similarity=0.353 Sum_probs=7.1
Q ss_pred CcceEEEcCCCCcEEE
Q psy6570 73 EPYDIALEPLSGRMFW 88 (713)
Q Consensus 73 ~p~~iavD~~~~~ly~ 88 (713)
.|..|++--.++..|.
T Consensus 75 ~~V~i~~aVr~~~~~L 90 (684)
T PF12877_consen 75 GPVSITYAVRNGSGFL 90 (684)
T ss_pred CCeEEEEEEecCceee
Confidence 3444444434444444
No 386
>COG3292 Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]
Probab=46.68 E-value=2.2e+02 Score=31.68 Aligned_cols=70 Identities=11% Similarity=0.084 Sum_probs=39.9
Q ss_pred CCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCC-CCcceEEEcCCCCcEEEEccCCCCeEEEEecC
Q psy6570 25 LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGL-NEPYDIALEPLSGRMFWTELGIKPRISGASID 103 (713)
Q Consensus 25 ~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~-~~p~~iavD~~~~~ly~td~~~~~~I~~~~~d 103 (713)
......|.+|. .+.||+.-. +-+++++........+....+ ..++.|+.| ..+.|++..+. -|++....
T Consensus 164 d~~V~aLv~D~-~g~lWvgT~-----dGL~~fd~~~gkalql~s~~~dk~I~al~~d-~qg~LWVGTdq---Gv~~~e~~ 233 (671)
T COG3292 164 DTPVVALVFDA-NGRLWVGTP-----DGLSYFDAGRGKALQLASPPLDKAINALIAD-VQGRLWVGTDQ---GVYLQEAE 233 (671)
T ss_pred Cccceeeeeec-cCcEEEecC-----CcceEEccccceEEEcCCCcchhhHHHHHHH-hcCcEEEEecc---ceEEEchh
Confidence 44456788884 456655433 446666643322223333334 556778888 56777764433 47777777
Q ss_pred C
Q psy6570 104 G 104 (713)
Q Consensus 104 G 104 (713)
|
T Consensus 234 G 234 (671)
T COG3292 234 G 234 (671)
T ss_pred h
Confidence 6
No 387
>KOG0918|consensus
Probab=45.82 E-value=77 Score=33.30 Aligned_cols=28 Identities=11% Similarity=0.160 Sum_probs=19.9
Q ss_pred eEEEeccCCeEEEeecCCCCCCeEEEEecCC
Q psy6570 30 GVAVDWVGKNLYWTDAGGRSSNNIMVSTLEG 60 (713)
Q Consensus 30 gla~D~~~~~ly~td~~~~~~~~I~~~~~~G 60 (713)
.|-+....+.||++.+ ..+-|..++...
T Consensus 316 DilISmDDRFLYvs~W---LHGDirQYdIsD 343 (476)
T KOG0918|consen 316 DILISLDDRFLYVSNW---LHGDIRQYDISD 343 (476)
T ss_pred eeEEeecCcEEEEEee---eecceeeeccCC
Confidence 3445455788999999 777787777554
No 388
>KOG0299|consensus
Probab=45.07 E-value=4.3e+02 Score=28.49 Aligned_cols=111 Identities=11% Similarity=0.012 Sum_probs=61.9
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcC---------CCC---CcceEEEcCCCCcEEEEcc
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNT---------GLN---EPYDIALEPLSGRMFWTEL 91 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~---------~~~---~p~~iavD~~~~~ly~td~ 91 (713)
.-..|..+++ ++..-|++-+ .++.|...++-.+....+... .++ +.++||+-+. -.|+-+.+
T Consensus 326 ~~~sidcv~~--In~~HfvsGS---dnG~IaLWs~~KKkplf~~~~AHgv~~~~~~~~~~~Witsla~i~~-sdL~asGS 399 (479)
T KOG0299|consen 326 GEGSIDCVAF--INDEHFVSGS---DNGSIALWSLLKKKPLFTSRLAHGVIPELDPVNGNFWITSLAVIPG-SDLLASGS 399 (479)
T ss_pred CCCCeeeEEE--ecccceeecc---CCceEEEeeecccCceeEeeccccccCCccccccccceeeeEeccc-CceEEecC
Confidence 3346777777 4566667766 677887776654332222111 112 4567777754 34555443
Q ss_pred C-CCCeEEEEecCCCCcEEEEeCC-CCCCeeEEEeCCCCeEEEEcCCCCcE
Q psy6570 92 G-IKPRISGASIDGKNKFNLVDNN-IQWPTGITIDYPSQRLYWADPKARTI 140 (713)
Q Consensus 92 ~-~~~~I~~~~~dG~~~~~l~~~~-~~~p~glavd~~~~~LY~~d~~~~~I 140 (713)
. ..-++|.+...-.....+..-. ....+.|+|...+++|++.-...+|+
T Consensus 400 ~~G~vrLW~i~~g~r~i~~l~~ls~~GfVNsl~f~~sgk~ivagiGkEhRl 450 (479)
T KOG0299|consen 400 WSGCVRLWKIEDGLRAINLLYSLSLVGFVNSLAFSNSGKRIVAGIGKEHRL 450 (479)
T ss_pred CCCceEEEEecCCccccceeeecccccEEEEEEEccCCCEEEEeccccccc
Confidence 3 2336676654333333333222 34568899887788888876555543
No 389
>KOG1645|consensus
Probab=45.02 E-value=1e+02 Score=32.38 Aligned_cols=73 Identities=11% Similarity=0.101 Sum_probs=46.5
Q ss_pred CCcceEEEcCCCCc-EEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCC
Q psy6570 72 NEPYDIALEPLSGR-MFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNG 147 (713)
Q Consensus 72 ~~p~~iavD~~~~~-ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g 147 (713)
..+++|++.|.+.- |-++..+ + .|..+++.......-.. .-+.++..++|.++....++-..++.|+.+|+.-
T Consensus 194 ~~IrdlafSp~~~GLl~~asl~-n-kiki~dlet~~~vssy~-a~~~~wSC~wDlde~h~IYaGl~nG~VlvyD~R~ 267 (463)
T KOG1645|consen 194 SFIRDLAFSPFNEGLLGLASLG-N-KIKIMDLETSCVVSSYI-AYNQIWSCCWDLDERHVIYAGLQNGMVLVYDMRQ 267 (463)
T ss_pred hhhhhhccCccccceeeeeccC-c-eEEEEecccceeeehee-ccCCceeeeeccCCcceeEEeccCceEEEEEccC
Confidence 56788999887763 4444433 4 67777776543322222 2367788999977665555655677888888643
No 390
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=44.93 E-value=3.8e+02 Score=27.92 Aligned_cols=38 Identities=16% Similarity=0.163 Sum_probs=18.6
Q ss_pred CceeEEEccCcc-cEEecCCCCCc-eEE-EeccCCeEEEee
Q psy6570 7 GNVTRVKREMNL-KTVLSNLHDPR-GVA-VDWVGKNLYWTD 44 (713)
Q Consensus 7 ~~I~~~~~~~~~-~~~~~~~~~p~-gla-~D~~~~~ly~td 44 (713)
..+.++++..+. +.+...+..++ +.+ +-..+++||+.-
T Consensus 85 ~~v~~Yd~~~~~W~~~~~~~p~~~~~~~~~~~~~g~IYviG 125 (346)
T TIGR03547 85 DDVYRYDPKKNSWQKLDTRSPVGLLGASGFSLHNGQAYFTG 125 (346)
T ss_pred ccEEEEECCCCEEecCCCCCCCcccceeEEEEeCCEEEEEc
Confidence 456777777654 22222222222 221 112468999874
No 391
>PF08374 Protocadherin: Protocadherin; InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated [].
Probab=44.86 E-value=7.8 Score=36.72 Aligned_cols=22 Identities=18% Similarity=0.326 Sum_probs=10.6
Q ss_pred cccchhHHHHHHHHHHHHHHhh
Q psy6570 681 YVNSHISSILILILLLITVGGI 702 (713)
Q Consensus 681 ~~~~~~~~~~~~~~~~~~~~~~ 702 (713)
.+.+++++++.++|||++++++
T Consensus 39 I~iaiVAG~~tVILVI~i~v~v 60 (221)
T PF08374_consen 39 IMIAIVAGIMTVILVIFIVVLV 60 (221)
T ss_pred eeeeeecchhhhHHHHHHHHHH
Confidence 3455555555554444433333
No 392
>KOG4283|consensus
Probab=44.78 E-value=3.5e+02 Score=27.41 Aligned_cols=53 Identities=9% Similarity=0.104 Sum_probs=40.3
Q ss_pred CceeEEEccCcccEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCce
Q psy6570 7 GNVTRVKREMNLKTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRK 62 (713)
Q Consensus 7 ~~I~~~~~~~~~~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~ 62 (713)
.+|.++.++....++...-.....|.||...+++.++-. ..+.|.++|+....
T Consensus 25 rRil~L~Ln~d~d~~r~HgGsvNsL~id~tegrymlSGg---adgsi~v~Dl~n~t 77 (397)
T KOG4283|consen 25 RRILSLQLNNDKDFVRPHGGSVNSLQIDLTEGRYMLSGG---ADGSIAVFDLQNAT 77 (397)
T ss_pred hhhheeeccCCcceeccCCCccceeeeccccceEEeecC---CCccEEEEEecccc
Confidence 567777777766666555678889999998888877766 78899999987543
No 393
>KOG1408|consensus
Probab=44.62 E-value=3e+02 Score=31.44 Aligned_cols=101 Identities=14% Similarity=0.150 Sum_probs=56.3
Q ss_pred CCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEc-cCCCC--eEEEEe
Q psy6570 25 LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTE-LGIKP--RISGAS 101 (713)
Q Consensus 25 ~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td-~~~~~--~I~~~~ 101 (713)
....+|||.++..+.+-+-. .-.|..++.+.....-|+...-.....+|+.+ +|+...|. .+..+ ++|...
T Consensus 37 v~~~~gLa~~p~Sgl~aYpA-----GCvVVlfn~~~~tQ~hlvnssRk~~t~vAfS~-~GryvatGEcG~~pa~kVw~la 110 (1080)
T KOG1408|consen 37 VKNANGLASVPCSGLCAYPA-----GCVVVLFNVDSCTQSHLVNSSRKPLTCVAFSQ-NGRYVATGECGRTPASKVWSLA 110 (1080)
T ss_pred eecCCcccccccccceeecc-----CcEEEEEcccccchhheecccCcceeEEEEcC-CCcEEEecccCCCccceeeeec
Confidence 45678899998887764332 34566777776554445544334566899986 45444443 33333 455444
Q ss_pred cCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEE
Q psy6570 102 IDGKNKFNLVDNNIQWPTGITIDYPSQRLYWA 133 (713)
Q Consensus 102 ~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~ 133 (713)
..|.. ..++. .-..-+-+||.|.+++|.-.
T Consensus 111 ~h~vV-AEfvd-HKY~vtcvaFsp~~kyvvSV 140 (1080)
T KOG1408|consen 111 FHGVV-AEFVD-HKYNVTCVAFSPGNKYVVSV 140 (1080)
T ss_pred cccch-hhhhh-ccccceeeeecCCCcEEEee
Confidence 44421 12222 22334678888766665533
No 394
>KOG1539|consensus
Probab=44.38 E-value=5.8e+02 Score=29.83 Aligned_cols=118 Identities=8% Similarity=0.056 Sum_probs=51.4
Q ss_pred CCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEe-CCCCCCeeEEEeCCCC
Q psy6570 50 SNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVD-NNIQWPTGITIDYPSQ 128 (713)
Q Consensus 50 ~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~-~~~~~p~glavd~~~~ 128 (713)
+++|.++++.-.....-+..+-.+...|++--.+.-+..+. ...+.+..-+++++.+..++. .....+.++.+-+..-
T Consensus 223 ~G~ViifNlK~dkil~sFk~d~g~VtslSFrtDG~p~las~-~~~G~m~~wDLe~kkl~~v~~nah~~sv~~~~fl~~ep 301 (910)
T KOG1539|consen 223 NGTVIIFNLKFDKILMSFKQDWGRVTSLSFRTDGNPLLASG-RSNGDMAFWDLEKKKLINVTRNAHYGSVTGATFLPGEP 301 (910)
T ss_pred CceEEEEEcccCcEEEEEEccccceeEEEeccCCCeeEEec-cCCceEEEEEcCCCeeeeeeeccccCCcccceecCCCc
Confidence 45555555543322222222223444455532222222222 222255556666655555443 2334566666664333
Q ss_pred eEEEEcC--CCCcEEEEeC-CCCceeEEEecCCCCccceeeeeeC
Q psy6570 129 RLYWADP--KARTIESINL-NGKDRFVVYHTEDNGYKPYKLEVFE 170 (713)
Q Consensus 129 ~LY~~d~--~~~~I~~~~~-~g~~~~~~~~~~~~~~~p~~i~~~~ 170 (713)
|.++.. +.-+++.+|. ||. .+.+......-.-|.-|.+++
T Consensus 302 -Vl~ta~~DnSlk~~vfD~~dg~-pR~LR~R~GHs~Pp~~irfy~ 344 (910)
T KOG1539|consen 302 -VLVTAGADNSLKVWVFDSGDGV-PRLLRSRGGHSAPPSCIRFYG 344 (910)
T ss_pred -eEeeccCCCceeEEEeeCCCCc-chheeeccCCCCCchheeeec
Confidence 333332 3345677773 343 333332221223456666663
No 395
>PHA03265 envelope glycoprotein D; Provisional
Probab=44.00 E-value=6.5 Score=39.92 Aligned_cols=35 Identities=20% Similarity=-0.063 Sum_probs=23.4
Q ss_pred cccccccccchhHHHHHHHHHHHHHHhheeeEEEE
Q psy6570 675 ISKKQSYVNSHISSILILILLLITVGGIGYYIFRI 709 (713)
Q Consensus 675 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 709 (713)
.+.+.+...+++++..++.||++.+|+.+.||+|+
T Consensus 342 ~s~~~~~~~g~~ig~~i~glv~vg~il~~~~rr~k 376 (402)
T PHA03265 342 TSKSNSTFVGISVGLGIAGLVLVGVILYVCLRRKK 376 (402)
T ss_pred CCCCCCcccceEEccchhhhhhhhHHHHHHhhhhh
Confidence 34556666777777777777777777766666554
No 396
>PF04841 Vps16_N: Vps16, N-terminal region; InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=43.88 E-value=4.5e+02 Score=28.42 Aligned_cols=107 Identities=12% Similarity=0.080 Sum_probs=64.0
Q ss_pred ecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEe
Q psy6570 22 LSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGAS 101 (713)
Q Consensus 22 ~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~ 101 (713)
+.....|-+|..|.. +.+-+... ....|.+++..|+....+.... ..+.+|..+....+|.+++.+ .+.+.+
T Consensus 36 ~a~~gGpIAi~~d~~-k~~~~~~~---~p~~I~iys~sG~ll~~i~w~~-~~iv~~~wt~~e~LvvV~~dG---~v~vy~ 107 (410)
T PF04841_consen 36 VAPYGGPIAIIRDES-KLVPVGSA---KPNSIQIYSSSGKLLSSIPWDS-GRIVGMGWTDDEELVVVQSDG---TVRVYD 107 (410)
T ss_pred EcCCCceEEEEecCc-ccccccCC---CCcEEEEECCCCCEeEEEEECC-CCEEEEEECCCCeEEEEEcCC---EEEEEe
Confidence 556666777766642 11111222 2226999999999877766555 778888888766666666544 666777
Q ss_pred cCCCCcEEEE----------eCC----CCCCeeEEEeCCCCeEEEEcCCC
Q psy6570 102 IDGKNKFNLV----------DNN----IQWPTGITIDYPSQRLYWADPKA 137 (713)
Q Consensus 102 ~dG~~~~~l~----------~~~----~~~p~glavd~~~~~LY~~d~~~ 137 (713)
+-|.. +-.+ ... ..+-+|+++-..+.++|......
T Consensus 108 ~~G~~-~fsl~~~i~~~~v~e~~i~~~~~~~~GivvLt~~~~~~~v~n~~ 156 (410)
T PF04841_consen 108 LFGEF-QFSLGEEIEEEKVLECRIFAIWFYKNGIVVLTGNNRFYVVNNID 156 (410)
T ss_pred CCCce-eechhhhccccCcccccccccccCCCCEEEECCCCeEEEEeCcc
Confidence 76655 1100 010 11237888776777777775433
No 397
>KOG0282|consensus
Probab=43.66 E-value=2.9e+02 Score=29.84 Aligned_cols=87 Identities=13% Similarity=0.110 Sum_probs=50.4
Q ss_pred CeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeE
Q psy6570 51 NNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRL 130 (713)
Q Consensus 51 ~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~L 130 (713)
..|-..|........-+..+ ..|.-+-+.|.+..+|++..... +|.-.++.......-....+.....|++-+ +++=
T Consensus 280 ~~lKlwDtETG~~~~~f~~~-~~~~cvkf~pd~~n~fl~G~sd~-ki~~wDiRs~kvvqeYd~hLg~i~~i~F~~-~g~r 356 (503)
T KOG0282|consen 280 RFLKLWDTETGQVLSRFHLD-KVPTCVKFHPDNQNIFLVGGSDK-KIRQWDIRSGKVVQEYDRHLGAILDITFVD-EGRR 356 (503)
T ss_pred eeeeeeccccceEEEEEecC-CCceeeecCCCCCcEEEEecCCC-cEEEEeccchHHHHHHHhhhhheeeeEEcc-CCce
Confidence 34444554443333333333 56888888888889999886655 888777764432222233566667777763 4555
Q ss_pred EEEcCCCCcE
Q psy6570 131 YWADPKARTI 140 (713)
Q Consensus 131 Y~~d~~~~~I 140 (713)
|++.+....|
T Consensus 357 FissSDdks~ 366 (503)
T KOG0282|consen 357 FISSSDDKSV 366 (503)
T ss_pred EeeeccCccE
Confidence 5554444433
No 398
>KOG1334|consensus
Probab=43.58 E-value=2.5e+02 Score=30.47 Aligned_cols=65 Identities=12% Similarity=0.112 Sum_probs=40.6
Q ss_pred cCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEE-EEEcCCCC-----CcceEEEcCCCCcEEEEc
Q psy6570 23 SNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKR-TLLNTGLN-----EPYDIALEPLSGRMFWTE 90 (713)
Q Consensus 23 ~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~-~l~~~~~~-----~p~~iavD~~~~~ly~td 90 (713)
..-....-||+++.+.+-|.+.. ....++-+++...... +++..... .-..||+||.+-+.|-+.
T Consensus 230 ~h~g~vhklav~p~sp~~f~S~g---eD~~v~~~Dlr~~~pa~~~~cr~~~~~~~v~L~~Ia~~P~nt~~faVg 300 (559)
T KOG1334|consen 230 PHEGPVHKLAVEPDSPKPFLSCG---EDAVVFHIDLRQDVPAEKFVCREADEKERVGLYTIAVDPRNTNEFAVG 300 (559)
T ss_pred cccCccceeeecCCCCCcccccc---cccceeeeeeccCCccceeeeeccCCccceeeeeEecCCCCccccccC
Confidence 33445678999999888888877 5566666666543322 22222112 346899999877666553
No 399
>COG5167 VID27 Protein involved in vacuole import and degradation [Intracellular trafficking and secretion]
Probab=43.01 E-value=3.2e+02 Score=29.94 Aligned_cols=71 Identities=15% Similarity=0.222 Sum_probs=37.3
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEec--CCceEEEEEcCCCCCcceEEEcCC-------CCcEEEEccCCCCeEE
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTL--EGRKKRTLLNTGLNEPYDIALEPL-------SGRMFWTELGIKPRIS 98 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~--~G~~~~~l~~~~~~~p~~iavD~~-------~~~ly~td~~~~~~I~ 98 (713)
-..|+|.+.+++-|++.. ++|-++.- ++...-+..-..+..|.|-.|||. ...|.++|.+..-+++
T Consensus 419 Ns~L~Vgfrn~rsyVtR~-----n~IGVFk~~de~~LeF~aaiknvs~~~GKSidp~K~mlh~~dssli~~dg~~~~kLy 493 (776)
T COG5167 419 NSHLVVGFRNERSYVTRG-----NSIGVFKNTDEGSLEFKAAIKNVSDDGGKSIDPEKIMLHDNDSSLIYLDGGERDKLY 493 (776)
T ss_pred CceEEEEEcccceeEeeC-----CeeeeEeccCCcceehhhhhhhccCCCCCcCChhhceeecCCcceEEecCCCcccce
Confidence 345777778888888865 45555443 332211111123455555555553 3444555544333677
Q ss_pred EEecC
Q psy6570 99 GASID 103 (713)
Q Consensus 99 ~~~~d 103 (713)
.++..
T Consensus 494 kmDIE 498 (776)
T COG5167 494 KMDIE 498 (776)
T ss_pred eeecc
Confidence 76664
No 400
>PF15102 TMEM154: TMEM154 protein family
Probab=42.28 E-value=11 Score=33.45 Aligned_cols=16 Identities=38% Similarity=0.461 Sum_probs=6.3
Q ss_pred hHHHHHHHHHH-HHHHh
Q psy6570 686 ISSILILILLL-ITVGG 701 (713)
Q Consensus 686 ~~~~~~~~~~~-~~~~~ 701 (713)
++.++|-++|| ++|++
T Consensus 58 iLmIlIP~VLLvlLLl~ 74 (146)
T PF15102_consen 58 ILMILIPLVLLVLLLLS 74 (146)
T ss_pred EEEEeHHHHHHHHHHHH
Confidence 33444443333 44443
No 401
>KOG3509|consensus
Probab=42.10 E-value=1.2e+02 Score=36.12 Aligned_cols=36 Identities=25% Similarity=0.654 Sum_probs=30.1
Q ss_pred CccCCCCCCCCCCCCeeeccCCCceeeeCCCCcccc
Q psy6570 218 NVTNHCDDKPCHQSALCINLPSSHTCLCPDHLTEEL 253 (713)
Q Consensus 218 ~~~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~~~~ 253 (713)
...+.|...||.....|..++-...|.|+.||+|.+
T Consensus 404 c~g~~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~~ 439 (964)
T KOG3509|consen 404 CLGDVCWRIPCQHDGPCLQTLEGKQCLCPPGYTGDS 439 (964)
T ss_pred cCCCccccccCCCCccccccccccceeccccccCch
Confidence 345678888998888999999899999999999754
No 402
>PF11134 Phage_stabilise: Phage stabilisation protein; InterPro: IPR021098 This entry represents the Bacteriophage P22, Gp10, DNA-stabilising protein. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. Members of this family are phage proteins involved with stabilising the head assembly unit and condensed DNA within the capsid [].
Probab=41.82 E-value=4.8e+02 Score=28.15 Aligned_cols=70 Identities=17% Similarity=0.244 Sum_probs=38.7
Q ss_pred EEeCCCCeEEEEcCCCCcEEEEeCCCCce-e---EEEecCCCCccceeeeeeCCeEEEEeCCCCcEEEEcccCCCc
Q psy6570 122 TIDYPSQRLYWADPKARTIESINLNGKDR-F---VVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILKINKFGNSD 193 (713)
Q Consensus 122 avd~~~~~LY~~d~~~~~I~~~~~~g~~~-~---~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~~~~~~~~ 193 (713)
.+....++-.|...++..+...++..... . .+.+...+...-.+|+++.+.||.-- ..+|..+...|...
T Consensus 140 Dv~~~dGryVw~~pgt~~f~vSdL~D~T~~d~~~~~ytAEsqPD~Ivgl~~~r~~I~~fG--~~TiEvf~nTGasd 213 (469)
T PF11134_consen 140 DVTRLDGRYVWVKPGTGYFFVSDLEDETKPDRYLDFYTAESQPDNIVGLAVWRREIWCFG--ASTIEVFYNTGASD 213 (469)
T ss_pred EEEeccceEEEEeCCCceEEEeecccccCcchhhhhhhhccCCCceEEEEEeeeeEEEEe--cccEEEEEccCCcc
Confidence 33334677777777777888877765322 2 22233333334467778877776653 33444444444433
No 403
>PF05297 Herpes_LMP1: Herpesvirus latent membrane protein 1 (LMP1); InterPro: IPR007961 This family consists of several latent membrane protein 1 or LMP1s mostly from Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4). LMP1 of HHV-4 is a 62-65 kDa plasma membrane protein possessing six membrane spanning regions, a short cytoplasmic N terminus and a long cytoplasmic carboxy tail of 200 amino acids. HHV-4 virus latent membrane protein 1 (LMP1) is essential for HHV-4 mediated transformation and has been associated with several cases of malignancies. HHV-4-like viruses in Macaca fascicularis (Cynomolgus monkeys) have been associated with high lymphoma rates in immunosuppressed monkeys [].; GO: 0019087 transformation of host cell by virus, 0016021 integral to membrane; PDB: 1CZY_E 1ZMS_B.
Probab=41.52 E-value=8.7 Score=37.82 Aligned_cols=16 Identities=25% Similarity=0.297 Sum_probs=0.0
Q ss_pred HHHHHHHHhheeeEEE
Q psy6570 693 ILLLITVGGIGYYIFR 708 (713)
Q Consensus 693 ~~~~~~~~~~~~~~~~ 708 (713)
+++||++++++++|+|
T Consensus 60 vvliiIIiIImlF~Rr 75 (381)
T PF05297_consen 60 VVLIIIIIIIMLFKRR 75 (381)
T ss_dssp ----------------
T ss_pred HHHHHHHHHHHHHHHh
Confidence 3333444444445544
No 404
>PF04863 EGF_alliinase: Alliinase EGF-like domain; InterPro: IPR006947 Allicin is a thiosulphinate that gives rise to dithiines, allyl sulphides and ajoenes, the three groups of active compounds in Allium species. Allicin is synthesised from sulphoxide cysteine derivatives by alliinase, whose C-S lyase activity cleaves C(beta)-S(gamma) bonds. It is thought that this enzyme forms part of a primitive plant defence system [].; GO: 0016846 carbon-sulfur lyase activity; PDB: 1LK9_B 2HOX_C 2HOR_A.
Probab=41.30 E-value=11 Score=27.19 Aligned_cols=22 Identities=41% Similarity=0.928 Sum_probs=14.0
Q ss_pred CCCceeeCCCCcccCCCCcccc
Q psy6570 652 DLKPICICPRGYAGVRCQTLVH 673 (713)
Q Consensus 652 ~~~~~C~C~~Gy~G~~C~~~~~ 673 (713)
++...|+|..-|.|.+|++.+.
T Consensus 33 dG~p~CECn~Cy~GpdCS~~~~ 54 (56)
T PF04863_consen 33 DGSPVCECNSCYGGPDCSTLIP 54 (56)
T ss_dssp TTEE--EE-TTEESTTS-EE-T
T ss_pred cCCccccccCCcCCCCcccCCC
Confidence 3447899999999999988764
No 405
>KOG0645|consensus
Probab=41.04 E-value=3.9e+02 Score=26.83 Aligned_cols=124 Identities=12% Similarity=0.026 Sum_probs=63.9
Q ss_pred CCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCC---cEEEEe-CCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCC
Q psy6570 71 LNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKN---KFNLVD-NNIQWPTGITIDYPSQRLYWADPKARTIESINLN 146 (713)
Q Consensus 71 ~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~---~~~l~~-~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~ 146 (713)
..+...+|..|..|.|+.+-...+ .|..-+..+.. .++++. ..-+....+|..|.+..|-.+....-.+..-+.+
T Consensus 14 ~~r~W~~awhp~~g~ilAscg~Dk-~vriw~~~~~~s~~ck~vld~~hkrsVRsvAwsp~g~~La~aSFD~t~~Iw~k~~ 92 (312)
T KOG0645|consen 14 KDRVWSVAWHPGKGVILASCGTDK-AVRIWSTSSGDSWTCKTVLDDGHKRSVRSVAWSPHGRYLASASFDATVVIWKKED 92 (312)
T ss_pred CCcEEEEEeccCCceEEEeecCCc-eEEEEecCCCCcEEEEEeccccchheeeeeeecCCCcEEEEeeccceEEEeecCC
Confidence 356778899887677777765544 55555555321 222222 1234567899997766444443332222222234
Q ss_pred CCceeEEEecCCCCccceeeeee-CCeEEEEeCCCCcEEEEcccCCCccee
Q psy6570 147 GKDRFVVYHTEDNGYKPYKLEVF-EDNLYFSTYRTNNILKINKFGNSDFNV 196 (713)
Q Consensus 147 g~~~~~~~~~~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~~~~~~ 196 (713)
+.... +......-.....++.. .++++-+-.....|+......+.....
T Consensus 93 ~efec-v~~lEGHEnEVK~Vaws~sG~~LATCSRDKSVWiWe~deddEfec 142 (312)
T KOG0645|consen 93 GEFEC-VATLEGHENEVKCVAWSASGNYLATCSRDKSVWIWEIDEDDEFEC 142 (312)
T ss_pred CceeE-EeeeeccccceeEEEEcCCCCEEEEeeCCCeEEEEEecCCCcEEE
Confidence 43322 22111112233455554 344555556667787777665555443
No 406
>KOG0277|consensus
Probab=39.93 E-value=3.9e+02 Score=26.56 Aligned_cols=37 Identities=22% Similarity=0.178 Sum_probs=20.6
Q ss_pred CCcceEEEcCCCCcEEEEccCCC-CeEEEEecCCCCcE
Q psy6570 72 NEPYDIALEPLSGRMFWTELGIK-PRISGASIDGKNKF 108 (713)
Q Consensus 72 ~~p~~iavD~~~~~ly~td~~~~-~~I~~~~~dG~~~~ 108 (713)
....+.+..|....||-+-++.. -+|+-++..|+...
T Consensus 148 ~~Iy~a~~sp~~~nlfas~Sgd~~l~lwdvr~~gk~~~ 185 (311)
T KOG0277|consen 148 SCIYQAAFSPHIPNLFASASGDGTLRLWDVRSPGKFMS 185 (311)
T ss_pred cEEEEEecCCCCCCeEEEccCCceEEEEEecCCCceeE
Confidence 34555666666677766655432 35555555555443
No 407
>KOG0276|consensus
Probab=39.83 E-value=6e+02 Score=28.70 Aligned_cols=154 Identities=7% Similarity=-0.003 Sum_probs=80.8
Q ss_pred CCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcc-eEEEcCCCCcEEEEccCCCCeEEEEecCC
Q psy6570 26 HDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPY-DIALEPLSGRMFWTELGIKPRISGASIDG 104 (713)
Q Consensus 26 ~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~-~iavD~~~~~ly~td~~~~~~I~~~~~dG 104 (713)
.+..+|.+.|..-.+.-+- -++.+.+.+.+......-+... ..|. .-.+-+ +...+++.+... +|.+++.+.
T Consensus 14 dRVKsVd~HPtePw~la~L----ynG~V~IWnyetqtmVksfeV~-~~PvRa~kfia-RknWiv~GsDD~-~IrVfnynt 86 (794)
T KOG0276|consen 14 DRVKSVDFHPTEPWILAAL----YNGDVQIWNYETQTMVKSFEVS-EVPVRAAKFIA-RKNWIVTGSDDM-QIRVFNYNT 86 (794)
T ss_pred CceeeeecCCCCceEEEee----ecCeeEEEecccceeeeeeeec-ccchhhheeee-ccceEEEecCCc-eEEEEeccc
Confidence 3444455554443333222 2556666666554322212111 2232 233332 334445554444 899999887
Q ss_pred CCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeE-EEecCCCCccceeeeee--CCeEEEEeCCCC
Q psy6570 105 KNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFV-VYHTEDNGYKPYKLEVF--EDNLYFSTYRTN 181 (713)
Q Consensus 105 ~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~-~~~~~~~~~~p~~i~~~--~~~ly~td~~~~ 181 (713)
-.+...+.....+...|++.|..-.+ .+.+..-.|...+.++.-... .+.. .-+..+.|++. +..-|.+.....
T Consensus 87 ~ekV~~FeAH~DyIR~iavHPt~P~v-LtsSDDm~iKlW~we~~wa~~qtfeG--H~HyVMqv~fnPkD~ntFaS~sLDr 163 (794)
T KOG0276|consen 87 GEKVKTFEAHSDYIRSIAVHPTLPYV-LTSSDDMTIKLWDWENEWACEQTFEG--HEHYVMQVAFNPKDPNTFASASLDR 163 (794)
T ss_pred ceeeEEeeccccceeeeeecCCCCeE-EecCCccEEEEeeccCceeeeeEEcC--cceEEEEEEecCCCccceeeeeccc
Confidence 77777777777778899999765443 344455567677776653321 1222 12334444443 445666666666
Q ss_pred cEEEEccc
Q psy6570 182 NILKINKF 189 (713)
Q Consensus 182 ~i~~~~~~ 189 (713)
+|...+..
T Consensus 164 TVKVWslg 171 (794)
T KOG0276|consen 164 TVKVWSLG 171 (794)
T ss_pred cEEEEEcC
Confidence 66555543
No 408
>KOG2096|consensus
Probab=39.40 E-value=4.4e+02 Score=27.05 Aligned_cols=96 Identities=15% Similarity=0.135 Sum_probs=55.0
Q ss_pred EeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEE--EecCCCCcEEE
Q psy6570 33 VDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISG--ASIDGKNKFNL 110 (713)
Q Consensus 33 ~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~--~~~dG~~~~~l 110 (713)
+.-.++..|+...+ ....|...++.|+....+-... ..-..-||.|..++|-.+..-..-.+|. +..||+...+.
T Consensus 193 iGiA~~~k~imsas--~dt~i~lw~lkGq~L~~idtnq-~~n~~aavSP~GRFia~~gFTpDVkVwE~~f~kdG~fqev~ 269 (420)
T KOG2096|consen 193 IGIAGNAKYIMSAS--LDTKICLWDLKGQLLQSIDTNQ-SSNYDAAVSPDGRFIAVSGFTPDVKVWEPIFTKDGTFQEVK 269 (420)
T ss_pred EeecCCceEEEEec--CCCcEEEEecCCceeeeecccc-ccccceeeCCCCcEEEEecCCCCceEEEEEeccCcchhhhh
Confidence 33345555554432 4677889999988776665433 4456678888777776665443334553 35577654332
Q ss_pred --Ee--CCCCCCeeEEEeCCCCeEE
Q psy6570 111 --VD--NNIQWPTGITIDYPSQRLY 131 (713)
Q Consensus 111 --~~--~~~~~p~glavd~~~~~LY 131 (713)
+. .....-..++|++...++.
T Consensus 270 rvf~LkGH~saV~~~aFsn~S~r~v 294 (420)
T KOG2096|consen 270 RVFSLKGHQSAVLAAAFSNSSTRAV 294 (420)
T ss_pred hhheeccchhheeeeeeCCCcceeE
Confidence 11 1122345677776555544
No 409
>KOG0650|consensus
Probab=39.30 E-value=2.5e+02 Score=31.33 Aligned_cols=70 Identities=11% Similarity=0.128 Sum_probs=36.8
Q ss_pred CCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCc-eeEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEEEccc
Q psy6570 116 QWPTGITIDYPSQRLYWADPKARTIESINLNGKD-RFVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILKINKF 189 (713)
Q Consensus 116 ~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~-~~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~~~~ 189 (713)
..|..+.|.+..-+|||+....-|| +++.-.. ++.+... ..+..-++|...+++|++... .+++.+++.+
T Consensus 567 G~vq~v~FHPs~p~lfVaTq~~vRi--YdL~kqelvKkL~tg-~kwiS~msihp~GDnli~gs~-d~k~~WfDld 637 (733)
T KOG0650|consen 567 GLVQRVKFHPSKPYLFVATQRSVRI--YDLSKQELVKKLLTG-SKWISSMSIHPNGDNLILGSY-DKKMCWFDLD 637 (733)
T ss_pred CceeEEEecCCCceEEEEeccceEE--EehhHHHHHHHHhcC-CeeeeeeeecCCCCeEEEecC-CCeeEEEEcc
Confidence 3567788999999999998655444 4443221 1122221 112222333333567766643 3455556543
No 410
>KOG1273|consensus
Probab=38.81 E-value=4.5e+02 Score=26.97 Aligned_cols=153 Identities=10% Similarity=0.058 Sum_probs=79.2
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEec-CCCC
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASI-DGKN 106 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~-dG~~ 106 (713)
...+.|..-+..|-+.- .+++|.+.+++......++..-..-...|+..+ .|++..|.+..+ .|..-++ +|+.
T Consensus 26 a~~~~Fs~~G~~lAvGc----~nG~vvI~D~~T~~iar~lsaH~~pi~sl~WS~-dgr~LltsS~D~-si~lwDl~~gs~ 99 (405)
T KOG1273|consen 26 AECCQFSRWGDYLAVGC----ANGRVVIYDFDTFRIARMLSAHVRPITSLCWSR-DGRKLLTSSRDW-SIKLWDLLKGSP 99 (405)
T ss_pred cceEEeccCcceeeeec----cCCcEEEEEccccchhhhhhccccceeEEEecC-CCCEeeeecCCc-eeEEEeccCCCc
Confidence 45566665544444433 478899999987664444433333455688875 566666665555 4555554 4554
Q ss_pred cEEEEeCCCCCC-eeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCC-CCcccee---eeeeCCeEEEEeCCCC
Q psy6570 107 KFNLVDNNIQWP-TGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTED-NGYKPYK---LEVFEDNLYFSTYRTN 181 (713)
Q Consensus 107 ~~~l~~~~~~~p-~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~-~~~~p~~---i~~~~~~ly~td~~~~ 181 (713)
.+.+ .+..| .+..+.|...+.+++-.-...-+.++++....++|..... .+..... ++-.+.+|| +-..++
T Consensus 100 l~ri---rf~spv~~~q~hp~k~n~~va~~~~~sp~vi~~s~~~h~~Lp~d~d~dln~sas~~~fdr~g~yIi-tGtsKG 175 (405)
T KOG1273|consen 100 LKRI---RFDSPVWGAQWHPRKRNKCVATIMEESPVVIDFSDPKHSVLPKDDDGDLNSSASHGVFDRRGKYII-TGTSKG 175 (405)
T ss_pred eeEE---EccCccceeeeccccCCeEEEEEecCCcEEEEecCCceeeccCCCccccccccccccccCCCCEEE-EecCcc
Confidence 3333 23333 4667777777777776433334444444444444433322 1111111 111233444 444566
Q ss_pred cEEEEcccC
Q psy6570 182 NILKINKFG 190 (713)
Q Consensus 182 ~i~~~~~~~ 190 (713)
.+..++..+
T Consensus 176 kllv~~a~t 184 (405)
T KOG1273|consen 176 KLLVYDAET 184 (405)
T ss_pred eEEEEecch
Confidence 666666543
No 411
>PHA02790 Kelch-like protein; Provisional
Probab=38.03 E-value=6e+02 Score=28.14 Aligned_cols=116 Identities=14% Similarity=0.105 Sum_probs=59.8
Q ss_pred ceeEEEccCcccEEecCCCCCc-eEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcc---eEEEcCCC
Q psy6570 8 NVTRVKREMNLKTVLSNLHDPR-GVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPY---DIALEPLS 83 (713)
Q Consensus 8 ~I~~~~~~~~~~~~~~~~~~p~-gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~---~iavD~~~ 83 (713)
.+.++++..+.-..++.+..|+ +.+.-..+++||+.-........+.++++....=+.+ ..+..|+ ++++ .+
T Consensus 332 sve~ydp~~n~W~~~~~l~~~r~~~~~~~~~g~IYviGG~~~~~~~ve~ydp~~~~W~~~--~~m~~~r~~~~~~~--~~ 407 (480)
T PHA02790 332 SVERWFHGDAAWVNMPSLLKPRCNPAVASINNVIYVIGGHSETDTTTEYLLPNHDQWQFG--PSTYYPHYKSCALV--FG 407 (480)
T ss_pred ceEEEECCCCeEEECCCCCCCCcccEEEEECCEEEEecCcCCCCccEEEEeCCCCEEEeC--CCCCCccccceEEE--EC
Confidence 4556666554333445565554 2233335789998744211234577777765332221 2344443 2333 57
Q ss_pred CcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCC---CeeEEEeCCCCeEEEEcC
Q psy6570 84 GRMFWTELGIKPRISGASIDGKNKFNLVDNNIQW---PTGITIDYPSQRLYWADP 135 (713)
Q Consensus 84 ~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~---p~glavd~~~~~LY~~d~ 135 (713)
+.||+... ..++.+++. ..-+.+. .+.. -.++++- +++||+.-.
T Consensus 408 ~~IYv~GG----~~e~ydp~~-~~W~~~~-~m~~~r~~~~~~v~--~~~IYviGG 454 (480)
T PHA02790 408 RRLFLVGR----NAEFYCESS-NTWTLID-DPIYPRDNPELIIV--DNKLLLIGG 454 (480)
T ss_pred CEEEEECC----ceEEecCCC-CcEeEcC-CCCCCccccEEEEE--CCEEEEECC
Confidence 89999862 356666653 3333332 2222 2344444 678998764
No 412
>PTZ00208 65 kDa invariant surface glycoprotein; Provisional
Probab=37.78 E-value=31 Score=36.05 Aligned_cols=13 Identities=8% Similarity=-0.130 Sum_probs=6.0
Q ss_pred HHhheeeEEEEec
Q psy6570 699 VGGIGYYIFRIKM 711 (713)
Q Consensus 699 ~~~~~~~~~~~~~ 711 (713)
.++++++.+|||.
T Consensus 403 ~~~~~~~v~rrr~ 415 (436)
T PTZ00208 403 AVAFFIMVKRRRN 415 (436)
T ss_pred HHHhheeeeeccC
Confidence 3334455555543
No 413
>PF08309 LVIVD: LVIVD repeat; InterPro: IPR013211 This repeat is found in bacterial and archaeal cell surface proteins, many of which are hypothetical. The secondary structure corresponding to this repeat is predicted to comprise 4 beta-strands, which may associate to form a beta-propeller. The repeat copy number varies from 2-14. This repeat is sometimes found with the PKD domain IPR000601 from INTERPRO.
Probab=37.72 E-value=1.2e+02 Score=20.80 Aligned_cols=29 Identities=7% Similarity=0.109 Sum_probs=20.6
Q ss_pred cceeeeeeCCeEEEEeCCCCcEEEEcccCC
Q psy6570 162 KPYKLEVFEDNLYFSTYRTNNILKINKFGN 191 (713)
Q Consensus 162 ~p~~i~~~~~~ly~td~~~~~i~~~~~~~~ 191 (713)
...+|.+.++++|++++.. .+..++....
T Consensus 3 ~a~~v~v~g~yaYva~~~~-Gl~IvDISnP 31 (42)
T PF08309_consen 3 DARDVAVSGNYAYVADGNN-GLVIVDISNP 31 (42)
T ss_pred eEEEEEEECCEEEEEeCCC-CEEEEECCCC
Confidence 3567888999999997664 4666765433
No 414
>KOG1445|consensus
Probab=37.21 E-value=2.7e+02 Score=31.24 Aligned_cols=27 Identities=11% Similarity=0.244 Sum_probs=21.9
Q ss_pred EeCCCCeEEEEcCCCCcEEEEeCCCCc
Q psy6570 123 IDYPSQRLYWADPKARTIESINLNGKD 149 (713)
Q Consensus 123 vd~~~~~LY~~d~~~~~I~~~~~~g~~ 149 (713)
+|++.+.||.+-.+.++|+.+.+....
T Consensus 270 ~DpDt~llfLaGKG~~~l~~lE~~d~q 296 (1012)
T KOG1445|consen 270 YDPDTRLLFLAGKGTNKLFMLEMQDRQ 296 (1012)
T ss_pred ecCCCceEEEecCCcceEEEEEecCCC
Confidence 588899999999999998888765543
No 415
>KOG3621|consensus
Probab=37.10 E-value=2.1e+02 Score=32.46 Aligned_cols=59 Identities=15% Similarity=0.212 Sum_probs=36.0
Q ss_pred CCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCC-----ceeEEEecCCCCccceeeeeeCCeEEEEeCC
Q psy6570 117 WPTGITIDYPSQRLYWADPKARTIESINLNGK-----DRFVVYHTEDNGYKPYKLEVFEDNLYFSTYR 179 (713)
Q Consensus 117 ~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~-----~~~~~~~~~~~~~~p~~i~~~~~~ly~td~~ 179 (713)
..+.|++++...+||..|. .++|.-..++-. ..+.+.... ..-.-|++.+.+|.++...
T Consensus 126 rVTal~Ws~~~~k~ysGD~-~Gkv~~~~L~s~~~~~~~~q~il~~d---s~IVQlD~~q~~LLVStl~ 189 (726)
T KOG3621|consen 126 RVTALEWSKNGMKLYSGDS-QGKVVLTELDSRQAFLSKSQEILSED---SEIVQLDYLQSYLLVSTLT 189 (726)
T ss_pred eEEEEEecccccEEeecCC-CceEEEEEechhhhhccccceeeccC---cceEEeecccceehHhhhh
Confidence 4578999999999999985 455555555541 122222221 1224567778888777544
No 416
>KOG3881|consensus
Probab=36.88 E-value=5.3e+02 Score=27.23 Aligned_cols=74 Identities=14% Similarity=0.113 Sum_probs=46.0
Q ss_pred CCeeEEEeCC-CCeEEEEcCCCCcEEEEeCCCCceeEEEecCC--CCccceeeeeeCCeEEEEeCCCCcEEEEcccCCC
Q psy6570 117 WPTGITIDYP-SQRLYWADPKARTIESINLNGKDRFVVYHTED--NGYKPYKLEVFEDNLYFSTYRTNNILKINKFGNS 192 (713)
Q Consensus 117 ~p~glavd~~-~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~--~~~~p~~i~~~~~~ly~td~~~~~i~~~~~~~~~ 192 (713)
|+++|.|-+. -.+-|.+-+..+.++.+|..- .|+.+..... ....-+++...++.||.++.. +.+..++..++.
T Consensus 204 W~tdi~Fl~g~~~~~fat~T~~hqvR~YDt~~-qRRPV~~fd~~E~~is~~~l~p~gn~Iy~gn~~-g~l~~FD~r~~k 280 (412)
T KOG3881|consen 204 WITDIRFLEGSPNYKFATITRYHQVRLYDTRH-QRRPVAQFDFLENPISSTGLTPSGNFIYTGNTK-GQLAKFDLRGGK 280 (412)
T ss_pred eeccceecCCCCCceEEEEecceeEEEecCcc-cCcceeEeccccCcceeeeecCCCcEEEEeccc-chhheecccCce
Confidence 4566666532 257777888888888888763 3444444322 223345666778889998754 456677655443
No 417
>PF14283 DUF4366: Domain of unknown function (DUF4366)
Probab=36.70 E-value=41 Score=32.62 Aligned_cols=14 Identities=29% Similarity=0.365 Sum_probs=5.6
Q ss_pred HHHHhheeeEEEEe
Q psy6570 697 ITVGGIGYYIFRIK 710 (713)
Q Consensus 697 ~~~~~~~~~~~~~~ 710 (713)
++..+.++++++++
T Consensus 172 ~gGGa~yYfK~~K~ 185 (218)
T PF14283_consen 172 IGGGAYYYFKFYKP 185 (218)
T ss_pred hhcceEEEEEEecc
Confidence 33333444444443
No 418
>KOG0196|consensus
Probab=36.33 E-value=46 Score=38.31 Aligned_cols=12 Identities=25% Similarity=0.625 Sum_probs=7.8
Q ss_pred CceeeCCCCccc
Q psy6570 368 TYKCHCAPSYTG 379 (713)
Q Consensus 368 ~~~C~C~~G~~g 379 (713)
...|.|.+||.-
T Consensus 258 iG~C~C~aGye~ 269 (996)
T KOG0196|consen 258 IGGCVCKAGYEE 269 (996)
T ss_pred cCceeecCCCCc
Confidence 346777777753
No 419
>PF14339 DUF4394: Domain of unknown function (DUF4394)
Probab=36.11 E-value=4.3e+02 Score=25.99 Aligned_cols=74 Identities=11% Similarity=0.083 Sum_probs=49.9
Q ss_pred CCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeC----C-CCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCC
Q psy6570 72 NEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDN----N-IQWPTGITIDYPSQRLYWADPKARTIESINLN 146 (713)
Q Consensus 72 ~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~----~-~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~ 146 (713)
.+..||.+-|.+|.||-... .++|+.++......+.+-.. . ...+.++-|+|.-+||.+.-. .+.-+|++++
T Consensus 27 e~l~GID~Rpa~G~LYgl~~--~g~lYtIn~~tG~aT~vg~s~~~~al~g~~~gvDFNP~aDRlRvvs~-~GqNlR~npd 103 (236)
T PF14339_consen 27 ESLVGIDFRPANGQLYGLGS--TGRLYTINPATGAATPVGASPLTVALSGTAFGVDFNPAADRLRVVSN-TGQNLRLNPD 103 (236)
T ss_pred CeEEEEEeecCCCCEEEEeC--CCcEEEEECCCCeEEEeecccccccccCceEEEecCcccCcEEEEcc-CCcEEEECCC
Confidence 45678999999999998732 23899998876554444111 1 123677888888889988743 4555677766
Q ss_pred CC
Q psy6570 147 GK 148 (713)
Q Consensus 147 g~ 148 (713)
-.
T Consensus 104 tG 105 (236)
T PF14339_consen 104 TG 105 (236)
T ss_pred CC
Confidence 33
No 420
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=35.81 E-value=4.5e+02 Score=26.13 Aligned_cols=146 Identities=14% Similarity=0.079 Sum_probs=69.6
Q ss_pred ccCCeEEEeecCCCCCCeEEEEec-CCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEE---
Q psy6570 35 WVGKNLYWTDAGGRSSNNIMVSTL-EGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNL--- 110 (713)
Q Consensus 35 ~~~~~ly~td~~~~~~~~I~~~~~-~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l--- 110 (713)
..+++||+.-. .+ |++++. +....+.+.. ......|++-+.-+.|++-..+ .|+..+++.-.....
T Consensus 5 ~~~~~L~vGt~----~G-l~~~~~~~~~~~~~i~~--~~~I~ql~vl~~~~~llvLsd~---~l~~~~L~~l~~~~~~~~ 74 (275)
T PF00780_consen 5 SWGDRLLVGTE----DG-LYVYDLSDPSKPTRILK--LSSITQLSVLPELNLLLVLSDG---QLYVYDLDSLEPVSTSAP 74 (275)
T ss_pred cCCCEEEEEEC----CC-EEEEEecCCccceeEee--cceEEEEEEecccCEEEEEcCC---ccEEEEchhhcccccccc
Confidence 34577887732 33 777776 3333333321 2337777777665655543222 566655542211110
Q ss_pred -----------EeCCCCCCeeEE---EeCCCCeEEEEcCCCCcEEEEeCCCC--ce-eEEEecCCCCccceeeeeeCCeE
Q psy6570 111 -----------VDNNIQWPTGIT---IDYPSQRLYWADPKARTIESINLNGK--DR-FVVYHTEDNGYKPYKLEVFEDNL 173 (713)
Q Consensus 111 -----------~~~~~~~p~gla---vd~~~~~LY~~d~~~~~I~~~~~~g~--~~-~~~~~~~~~~~~p~~i~~~~~~l 173 (713)
.....+...-++ ......+|.++... +|..+.+... .. ..+.+.. ....|..|++.++.|
T Consensus 75 ~~~~~~~~~~~~~~~~~~v~~f~~~~~~~~~~~L~va~kk--~i~i~~~~~~~~~f~~~~ke~~-lp~~~~~i~~~~~~i 151 (275)
T PF00780_consen 75 LAFPKSRSLPTKLPETKGVSFFAVNGGHEGSRRLCVAVKK--KILIYEWNDPRNSFSKLLKEIS-LPDPPSSIAFLGNKI 151 (275)
T ss_pred ccccccccccccccccCCeeEEeeccccccceEEEEEECC--EEEEEEEECCcccccceeEEEE-cCCCcEEEEEeCCEE
Confidence 001122222233 12122345555433 5555555442 22 2222222 135788899888888
Q ss_pred EEEeCCCCcEEEEcccCCCcce
Q psy6570 174 YFSTYRTNNILKINKFGNSDFN 195 (713)
Q Consensus 174 y~td~~~~~i~~~~~~~~~~~~ 195 (713)
.+.. .+....++...+....
T Consensus 152 ~v~~--~~~f~~idl~~~~~~~ 171 (275)
T PF00780_consen 152 CVGT--SKGFYLIDLNTGSPSE 171 (275)
T ss_pred EEEe--CCceEEEecCCCCceE
Confidence 8875 3345556655443333
No 421
>KOG0639|consensus
Probab=35.76 E-value=3.8e+02 Score=29.28 Aligned_cols=73 Identities=7% Similarity=0.039 Sum_probs=42.9
Q ss_pred CeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCccceeeeee-CCeEEEEeCCCCcEEEEcccCCCc
Q psy6570 118 PTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVF-EDNLYFSTYRTNNILKINKFGNSD 193 (713)
Q Consensus 118 p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~-~~~ly~td~~~~~i~~~~~~~~~~ 193 (713)
-.+||+.++.+ |-|+--..+.|...|+.. ..++...........-|++. ++.-.||-...+.|+..+...+..
T Consensus 512 CyALa~spDak-vcFsccsdGnI~vwDLhn--q~~VrqfqGhtDGascIdis~dGtklWTGGlDntvRcWDlregrq 585 (705)
T KOG0639|consen 512 CYALAISPDAK-VCFSCCSDGNIAVWDLHN--QTLVRQFQGHTDGASCIDISKDGTKLWTGGLDNTVRCWDLREGRQ 585 (705)
T ss_pred hhhhhcCCccc-eeeeeccCCcEEEEEccc--ceeeecccCCCCCceeEEecCCCceeecCCCccceeehhhhhhhh
Confidence 45788887654 445554556677666643 33443331112223445554 567789988888888888765543
No 422
>PHA03098 kelch-like protein; Provisional
Probab=35.35 E-value=6.8e+02 Score=28.01 Aligned_cols=80 Identities=15% Similarity=0.003 Sum_probs=39.3
Q ss_pred CceeEEEccCcccEEecCCCCCc-eEEEeccCCeEEEeecCC---CCCCeEEEEecCCceEEEEEcCCCCCcc---eEEE
Q psy6570 7 GNVTRVKREMNLKTVLSNLHDPR-GVAVDWVGKNLYWTDAGG---RSSNNIMVSTLEGRKKRTLLNTGLNEPY---DIAL 79 (713)
Q Consensus 7 ~~I~~~~~~~~~~~~~~~~~~p~-gla~D~~~~~ly~td~~~---~~~~~I~~~~~~G~~~~~l~~~~~~~p~---~iav 79 (713)
..+.++++.++.-.....+..|+ +.+.-..+++||+.-... ...+.+.++++....=+.+. .+..|. .+++
T Consensus 358 ~~v~~yd~~~~~W~~~~~lp~~r~~~~~~~~~~~iYv~GG~~~~~~~~~~v~~yd~~t~~W~~~~--~~p~~r~~~~~~~ 435 (534)
T PHA03098 358 NTVESWKPGESKWREEPPLIFPRYNPCVVNVNNLIYVIGGISKNDELLKTVECFSLNTNKWSKGS--PLPISHYGGCAIY 435 (534)
T ss_pred ceEEEEcCCCCceeeCCCcCcCCccceEEEECCEEEEECCcCCCCcccceEEEEeCCCCeeeecC--CCCccccCceEEE
Confidence 34556666654433334444443 222223568999864311 11356888887654322221 222222 2333
Q ss_pred cCCCCcEEEEc
Q psy6570 80 EPLSGRMFWTE 90 (713)
Q Consensus 80 D~~~~~ly~td 90 (713)
.++.||+..
T Consensus 436 --~~~~iyv~G 444 (534)
T PHA03098 436 --HDGKIYVIG 444 (534)
T ss_pred --ECCEEEEEC
Confidence 467899875
No 423
>KOG0275|consensus
Probab=35.35 E-value=5e+02 Score=26.51 Aligned_cols=88 Identities=8% Similarity=0.030 Sum_probs=48.1
Q ss_pred eeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCCCCcc--ceeeeeeCCeEEEEeCCCCcEEEEcccCCCccee
Q psy6570 119 TGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTEDNGYK--PYKLEVFEDNLYFSTYRTNNILKINKFGNSDFNV 196 (713)
Q Consensus 119 ~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~~~~~--p~~i~~~~~~ly~td~~~~~i~~~~~~~~~~~~~ 196 (713)
+.+.+-|.+-.-|+.-..++.|+.+++.|.-++.+.+....... .-.+...++++|-+. ..+.++-+....+....+
T Consensus 396 nsv~~~PKnpeh~iVCNrsntv~imn~qGQvVrsfsSGkREgGdFi~~~lSpkGewiYcig-ED~vlYCF~~~sG~LE~t 474 (508)
T KOG0275|consen 396 NSVILLPKNPEHFIVCNRSNTVYIMNMQGQVVRSFSSGKREGGDFINAILSPKGEWIYCIG-EDGVLYCFSVLSGKLERT 474 (508)
T ss_pred eeEEEcCCCCceEEEEcCCCeEEEEeccceEEeeeccCCccCCceEEEEecCCCcEEEEEc-cCcEEEEEEeecCceeee
Confidence 33444455544455555678899999988766665543221111 122334477888764 345566676655555455
Q ss_pred eeccccccccE
Q psy6570 197 LANNLNRASDV 207 (713)
Q Consensus 197 ~~~~~~~~~~i 207 (713)
+...-..+.+|
T Consensus 475 l~VhEkdvIGl 485 (508)
T KOG0275|consen 475 LPVHEKDVIGL 485 (508)
T ss_pred eeccccccccc
Confidence 54443444444
No 424
>KOG0321|consensus
Probab=35.07 E-value=5.2e+02 Score=29.16 Aligned_cols=201 Identities=8% Similarity=0.027 Sum_probs=0.0
Q ss_pred CCcccCCceeEEEccCcc----cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCce--------------
Q psy6570 1 MASISSGNVTRVKREMNL----KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRK-------------- 62 (713)
Q Consensus 1 vad~~~~~I~~~~~~~~~----~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~-------------- 62 (713)
|+...+.+|.-.+..+.. .+.+..-.....+++-+.+..+|++-. ..+.|.+.++.-+.
T Consensus 116 VsasGDsT~r~Wdvk~s~l~G~~~~~GH~~SvkS~cf~~~n~~vF~tGg---RDg~illWD~R~n~~d~~e~~~~~~~~~ 192 (720)
T KOG0321|consen 116 VSASGDSTIRPWDVKTSRLVGGRLNLGHTGSVKSECFMPTNPAVFCTGG---RDGEILLWDCRCNGVDALEEFDNRIYGR 192 (720)
T ss_pred EEccCCceeeeeeeccceeecceeecccccccchhhhccCCCcceeecc---CCCcEEEEEEeccchhhHHHHhhhhhcc
Q ss_pred ------EEEEEcCCCCCcceEEEcCCCC---cEEEEccCCCCeEEEEec-CCCCcEEEEe--------------------
Q psy6570 63 ------KRTLLNTGLNEPYDIALEPLSG---RMFWTELGIKPRISGASI-DGKNKFNLVD-------------------- 112 (713)
Q Consensus 63 ------~~~l~~~~~~~p~~iavD~~~~---~ly~td~~~~~~I~~~~~-dG~~~~~l~~-------------------- 112 (713)
...........+.+-+.+ ..+ .+||-|.. .|..+.. |+..+.=-+.
T Consensus 193 ~n~~ptpskp~~kr~~k~kA~s~t-i~ssvTvv~fkDe~---tlaSaga~D~~iKVWDLRk~~~~~r~ep~~~~~~~t~s 268 (720)
T KOG0321|consen 193 HNTAPTPSKPLKKRIRKWKAASNT-IFSSVTVVLFKDES---TLASAGAADSTIKVWDLRKNYTAYRQEPRGSDKYPTHS 268 (720)
T ss_pred ccCCCCCCchhhccccccccccCc-eeeeeEEEEEeccc---eeeeccCCCcceEEEeecccccccccCCCcccCccCcc
Q ss_pred CCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEEecCC----CCccceeeeeeCCeEEEEeCCCCcEEEEcc
Q psy6570 113 NNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVYHTED----NGYKPYKLEVFEDNLYFSTYRTNNILKINK 188 (713)
Q Consensus 113 ~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~~~~~----~~~~p~~i~~~~~~ly~td~~~~~i~~~~~ 188 (713)
.....-..|.+|..+.+||..=. .++|+.+++.+-....+..... .+..-.-+..++.+|.-+.+......++-.
T Consensus 269 krs~G~~nL~lDssGt~L~AsCt-D~sIy~ynm~s~s~sP~~~~sg~~~~sf~vks~lSpd~~~l~SgSsd~~ayiw~vs 347 (720)
T KOG0321|consen 269 KRSVGQVNLILDSSGTYLFASCT-DNSIYFYNMRSLSISPVAEFSGKLNSSFYVKSELSPDDCSLLSGSSDEQAYIWVVS 347 (720)
T ss_pred cceeeeEEEEecCCCCeEEEEec-CCcEEEEeccccCcCchhhccCcccceeeeeeecCCCCceEeccCCCcceeeeeec
Q ss_pred cCCCcceeeeccccccccEEE
Q psy6570 189 FGNSDFNVLANNLNRASDVLI 209 (713)
Q Consensus 189 ~~~~~~~~~~~~~~~~~~i~v 209 (713)
.-.....++......++.+..
T Consensus 348 ~~e~~~~~l~Ght~eVt~V~w 368 (720)
T KOG0321|consen 348 SPEAPPALLLGHTREVTTVRW 368 (720)
T ss_pred CccCChhhhhCcceEEEEEee
No 425
>PF04639 Baculo_E56: Baculoviral E56 protein, specific to ODV envelope; InterPro: IPR006733 This family represents the E56 protein, which is localized to the occlusion derived virus (ODV) envelope, but not to the budded virus (BV) envelope []. Signals necessary for transport and/or retention into this structure are believed to be found within the C-terminal portion of ODV-E56.; GO: 0019031 viral envelope
Probab=34.03 E-value=52 Score=32.75 Aligned_cols=18 Identities=22% Similarity=0.109 Sum_probs=7.8
Q ss_pred HHHHHHHHhheeeEEEEe
Q psy6570 693 ILLLITVGGIGYYIFRIK 710 (713)
Q Consensus 693 ~~~~~~~~~~~~~~~~~~ 710 (713)
.+|+|+++.++++|+..+
T Consensus 286 ~vl~i~~Ig~~ifK~~~~ 303 (305)
T PF04639_consen 286 GVLLIVFIGYFIFKRLMN 303 (305)
T ss_pred HHHHHHHhhheeeEeecc
Confidence 333344444444444433
No 426
>PF07213 DAP10: DAP10 membrane protein; InterPro: IPR009861 This family consists of several mammalian DAP10 membrane proteins. In activated mouse natural killer (NK) cells, the NKG2D receptor associates with two intracellular adaptors, DAP10 and DAP12, which trigger phosphatidyl inositol 3 kinase (PI3K) and Syk family protein tyrosine kinases, respectively. It has been suggested that the DAP10-PI3K pathway is sufficient to initiate NKG2D-mediated killing of target cells [].
Probab=33.87 E-value=48 Score=26.11 Aligned_cols=15 Identities=13% Similarity=0.124 Sum_probs=7.4
Q ss_pred ccchhHHHHHHHHHH
Q psy6570 682 VNSHISSILILILLL 696 (713)
Q Consensus 682 ~~~~~~~~~~~~~~~ 696 (713)
+.+.+++++++=+++
T Consensus 32 s~g~LaGiV~~D~vl 46 (79)
T PF07213_consen 32 SPGLLAGIVAADAVL 46 (79)
T ss_pred CHHHHHHHHHHHHHH
Confidence 345556655553333
No 427
>PHA03240 envelope glycoprotein M; Provisional
Probab=33.66 E-value=15 Score=34.56 Aligned_cols=26 Identities=23% Similarity=0.354 Sum_probs=10.7
Q ss_pred cchhHHHHHHHHHHHHHHhheeeEEEE
Q psy6570 683 NSHISSILILILLLITVGGIGYYIFRI 709 (713)
Q Consensus 683 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 709 (713)
++..+.+++++||+ +|+++++++.-|
T Consensus 210 aaH~~WIiilIIiI-iIIIL~cfKiPQ 235 (258)
T PHA03240 210 AAHIAWIFIAIIII-IVIILFFFKIPQ 235 (258)
T ss_pred cchHhHHHHHHHHH-HHHHHHHHhccH
Confidence 34444444444443 333344444433
No 428
>KOG1524|consensus
Probab=33.22 E-value=4.5e+02 Score=28.99 Aligned_cols=41 Identities=17% Similarity=0.118 Sum_probs=25.0
Q ss_pred cCCceeEEEccCcc-cEEecCCCCCceEEEeccCCeEEEeec
Q psy6570 5 SSGNVTRVKREMNL-KTVLSNLHDPRGVAVDWVGKNLYWTDA 45 (713)
Q Consensus 5 ~~~~I~~~~~~~~~-~~~~~~~~~p~gla~D~~~~~ly~td~ 45 (713)
.+|.|.-....|.. .+++..-.....+++++..+.+.+...
T Consensus 124 EDG~iKiWSrsGMLRStl~Q~~~~v~c~~W~p~S~~vl~c~g 165 (737)
T KOG1524|consen 124 EDGVIKIWSRSGMLRSTVVQNEESIRCARWAPNSNSIVFCQG 165 (737)
T ss_pred CCceEEEEeccchHHHHHhhcCceeEEEEECCCCCceEEecC
Confidence 34444444555533 233445556678888888888777765
No 429
>PF07204 Orthoreo_P10: Orthoreovirus membrane fusion protein p10; InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences. p10 is thought to be a multifunctional protein that plays a key role in virus-host interaction [].
Probab=32.99 E-value=13 Score=30.06 Aligned_cols=18 Identities=28% Similarity=0.307 Sum_probs=7.7
Q ss_pred HHHHHHHHhheeeEEEEe
Q psy6570 693 ILLLITVGGIGYYIFRIK 710 (713)
Q Consensus 693 ~~~~~~~~~~~~~~~~~~ 710 (713)
++|||+|++++..+.|+|
T Consensus 52 iLilIii~Lv~CC~~K~K 69 (98)
T PF07204_consen 52 ILILIIIALVCCCRAKHK 69 (98)
T ss_pred hhHHHHHHHHHHhhhhhh
Confidence 333343444444444443
No 430
>PLN00033 photosystem II stability/assembly factor; Provisional
Probab=32.99 E-value=6.5e+02 Score=27.09 Aligned_cols=98 Identities=9% Similarity=0.043 Sum_probs=46.5
Q ss_pred CeEEEEecCCceE-EEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcE--EEEeCCC----CCCeeEEE
Q psy6570 51 NNIMVSTLEGRKK-RTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKF--NLVDNNI----QWPTGITI 123 (713)
Q Consensus 51 ~~I~~~~~~G~~~-~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~--~l~~~~~----~~p~glav 123 (713)
+.+++...+|... +.+.........+++.++ ++.|+.+... +.|.+...+|..-+ .+..... ..+.++++
T Consensus 259 G~~~~s~d~G~~~W~~~~~~~~~~l~~v~~~~-dg~l~l~g~~--G~l~~S~d~G~~~~~~~f~~~~~~~~~~~l~~v~~ 335 (398)
T PLN00033 259 GNFYLTWEPGQPYWQPHNRASARRIQNMGWRA-DGGLWLLTRG--GGLYVSKGTGLTEEDFDFEEADIKSRGFGILDVGY 335 (398)
T ss_pred ccEEEecCCCCcceEEecCCCccceeeeeEcC-CCCEEEEeCC--ceEEEecCCCCcccccceeecccCCCCcceEEEEE
Confidence 4455555555432 222222234456777774 4555555432 25777776665321 1221111 12345555
Q ss_pred eCCCCeEEEEcCCCCcEEEEeCCCCceeEE
Q psy6570 124 DYPSQRLYWADPKARTIESINLNGKDRFVV 153 (713)
Q Consensus 124 d~~~~~LY~~d~~~~~I~~~~~~g~~~~~~ 153 (713)
. .++.+|.+ ...+.|++..-.|..-+..
T Consensus 336 ~-~d~~~~a~-G~~G~v~~s~D~G~tW~~~ 363 (398)
T PLN00033 336 R-SKKEAWAA-GGSGILLRSTDGGKSWKRD 363 (398)
T ss_pred c-CCCcEEEE-ECCCcEEEeCCCCcceeEc
Confidence 5 24455444 3455666665555543443
No 431
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=32.53 E-value=5.7e+02 Score=26.31 Aligned_cols=96 Identities=10% Similarity=0.079 Sum_probs=48.9
Q ss_pred CCCcEEEEccCC----CCeEEEEecCCCCcEE--EEeCCCCCCe-eEEEeCCCCeEEEEcC-----CCCcEEEEeCCCCc
Q psy6570 82 LSGRMFWTELGI----KPRISGASIDGKNKFN--LVDNNIQWPT-GITIDYPSQRLYWADP-----KARTIESINLNGKD 149 (713)
Q Consensus 82 ~~~~ly~td~~~----~~~I~~~~~dG~~~~~--l~~~~~~~p~-glavd~~~~~LY~~d~-----~~~~I~~~~~~g~~ 149 (713)
.++.||+.--.. ...+++.+++...-.. .....+..|. ..+.-..+++||+.-. ..+.++++++....
T Consensus 71 ~~~~lyviGG~~~~~~~~~v~~~d~~~~~w~~~~~~~~~lp~~~~~~~~~~~~~~iYv~GG~~~~~~~~~v~~yd~~~~~ 150 (323)
T TIGR03548 71 VENGIYYIGGSNSSERFSSVYRITLDESKEELICETIGNLPFTFENGSACYKDGTLYVGGGNRNGKPSNKSYLFNLETQE 150 (323)
T ss_pred ECCEEEEEcCCCCCCCceeEEEEEEcCCceeeeeeEcCCCCcCccCceEEEECCEEEEEeCcCCCccCceEEEEcCCCCC
Confidence 367888874221 1367888876544211 1112232221 1122223689999743 23568888876554
Q ss_pred eeEEEecCCCCccceeeeeeCCeEEEEe
Q psy6570 150 RFVVYHTEDNGYKPYKLEVFEDNLYFST 177 (713)
Q Consensus 150 ~~~~~~~~~~~~~p~~i~~~~~~ly~td 177 (713)
-+.+............+.+.+++||+.-
T Consensus 151 W~~~~~~p~~~r~~~~~~~~~~~iYv~G 178 (323)
T TIGR03548 151 WFELPDFPGEPRVQPVCVKLQNELYVFG 178 (323)
T ss_pred eeECCCCCCCCCCcceEEEECCEEEEEc
Confidence 4444322111122234456688899874
No 432
>KOG0274|consensus
Probab=32.52 E-value=7.7e+02 Score=27.79 Aligned_cols=153 Identities=12% Similarity=0.021 Sum_probs=77.8
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecC
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASID 103 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~d 103 (713)
......+|++.. .+.++++-+ ...+|.+-+.....-..++.. ..-....++ ..+.+.++.+... .|.+-+..
T Consensus 248 H~g~V~~l~~~~-~~~~lvsgS---~D~t~rvWd~~sg~C~~~l~g--h~stv~~~~-~~~~~~~sgs~D~-tVkVW~v~ 319 (537)
T KOG0274|consen 248 HFGGVWGLAFPS-GGDKLVSGS---TDKTERVWDCSTGECTHSLQG--HTSSVRCLT-IDPFLLVSGSRDN-TVKVWDVT 319 (537)
T ss_pred CCCCceeEEEec-CCCEEEEEe---cCCcEEeEecCCCcEEEEecC--CCceEEEEE-ccCceEeeccCCc-eEEEEecc
Confidence 345578899886 355555665 566677777554443333321 122222333 3455666643434 67776776
Q ss_pred CCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCC-CCceeEEEecCCCCccceeeeeeC-CeEEEEeCCCC
Q psy6570 104 GKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLN-GKDRFVVYHTEDNGYKPYKLEVFE-DNLYFSTYRTN 181 (713)
Q Consensus 104 G~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~-g~~~~~~~~~~~~~~~p~~i~~~~-~~ly~td~~~~ 181 (713)
......++.......+.+.++ ++.|+..- ..+.|.+-++. +.-.+.+.. ....-.++.++. +++|=. ....
T Consensus 320 n~~~l~l~~~h~~~V~~v~~~--~~~lvsgs-~d~~v~VW~~~~~~cl~sl~g---H~~~V~sl~~~~~~~~~Sg-s~D~ 392 (537)
T KOG0274|consen 320 NGACLNLLRGHTGPVNCVQLD--EPLLVSGS-YDGTVKVWDPRTGKCLKSLSG---HTGRVYSLIVDSENRLLSG-SLDT 392 (537)
T ss_pred CcceEEEeccccccEEEEEec--CCEEEEEe-cCceEEEEEhhhceeeeeecC---CcceEEEEEecCcceEEee-eecc
Confidence 444444444333444566666 55555443 34465555554 222222221 233445666655 444443 3346
Q ss_pred cEEEEcccCC
Q psy6570 182 NILKINKFGN 191 (713)
Q Consensus 182 ~i~~~~~~~~ 191 (713)
.|...+..+.
T Consensus 393 ~IkvWdl~~~ 402 (537)
T KOG0274|consen 393 TIKVWDLRTK 402 (537)
T ss_pred ceEeecCCch
Confidence 6777776655
No 433
>KOG0305|consensus
Probab=32.34 E-value=7.3e+02 Score=27.45 Aligned_cols=117 Identities=11% Similarity=0.040 Sum_probs=68.5
Q ss_pred CCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEE-ecCC
Q psy6570 26 HDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGA-SIDG 104 (713)
Q Consensus 26 ~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~-~~dG 104 (713)
...-||++.+.. .|.+-.+ ..+++.+.+.........+..-.....+|+..|-...|.-+..+...++.++ +...
T Consensus 302 qeVCgLkws~d~--~~lASGg--nDN~~~Iwd~~~~~p~~~~~~H~aAVKA~awcP~q~~lLAsGGGs~D~~i~fwn~~~ 377 (484)
T KOG0305|consen 302 QEVCGLKWSPDG--NQLASGG--NDNVVFIWDGLSPEPKFTFTEHTAAVKALAWCPWQSGLLATGGGSADRCIKFWNTNT 377 (484)
T ss_pred ceeeeeEECCCC--CeeccCC--CccceEeccCCCccccEEEeccceeeeEeeeCCCccCceEEcCCCcccEEEEEEcCC
Confidence 346778876543 3334332 4678888887555544444444577889999998888888876654344443 3333
Q ss_pred CCcEEEEeCCCCCCeeEEEeCCCCeEEEEcC-CCCcEEEEeCCC
Q psy6570 105 KNKFNLVDNNIQWPTGITIDYPSQRLYWADP-KARTIESINLNG 147 (713)
Q Consensus 105 ~~~~~l~~~~~~~p~glavd~~~~~LY~~d~-~~~~I~~~~~~g 147 (713)
..+...+. .-.+...|++.+..+.|..+-. ..+.|....+.-
T Consensus 378 g~~i~~vd-tgsQVcsL~Wsk~~kEi~sthG~s~n~i~lw~~ps 420 (484)
T KOG0305|consen 378 GARIDSVD-TGSQVCSLIWSKKYKELLSTHGYSENQITLWKYPS 420 (484)
T ss_pred CcEecccc-cCCceeeEEEcCCCCEEEEecCCCCCcEEEEeccc
Confidence 22222222 2334577888877777777753 334444444433
No 434
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=32.11 E-value=5.8e+02 Score=26.26 Aligned_cols=55 Identities=16% Similarity=0.226 Sum_probs=20.7
Q ss_pred ceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEE
Q psy6570 29 RGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWT 89 (713)
Q Consensus 29 ~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~t 89 (713)
..+.+. ..+.||.|.-+. .+.+..+....|....... . ..=.-|+|. ..|.+|.+
T Consensus 116 ~~~l~~-~~G~iy~T~DgG-~tW~~~~~~~~gs~~~~~r-~--~dG~~vavs-~~G~~~~s 170 (302)
T PF14870_consen 116 SAELAG-DRGAIYRTTDGG-KTWQAVVSETSGSINDITR-S--SDGRYVAVS-SRGNFYSS 170 (302)
T ss_dssp EEEEEE-TT--EEEESSTT-SSEEEEE-S----EEEEEE----TTS-EEEEE-TTSSEEEE
T ss_pred cEEEEc-CCCcEEEeCCCC-CCeeEcccCCcceeEeEEE-C--CCCcEEEEE-CcccEEEE
Confidence 334343 346777776532 2233334444444333221 1 112245555 46666665
No 435
>KOG1310|consensus
Probab=31.82 E-value=2.8e+02 Score=30.56 Aligned_cols=113 Identities=13% Similarity=0.165 Sum_probs=62.7
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCC-CCCcceEEEcCCC-CcEEEEccCCCCeEEEEecCCC
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTG-LNEPYDIALEPLS-GRMFWTELGIKPRISGASIDGK 105 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~-~~~p~~iavD~~~-~~ly~td~~~~~~I~~~~~dG~ 105 (713)
...|++.. ++.|..+-+ ...+|.+-+.--.....++..+ ...+..+-+-|.. ++|.++..+.. .|..++++..
T Consensus 53 VN~LeWn~-dG~lL~SGS---DD~r~ivWd~~~~KllhsI~TgHtaNIFsvKFvP~tnnriv~sgAgDk-~i~lfdl~~~ 127 (758)
T KOG1310|consen 53 VNCLEWNA-DGELLASGS---DDTRLIVWDPFEYKLLHSISTGHTANIFSVKFVPYTNNRIVLSGAGDK-LIKLFDLDSS 127 (758)
T ss_pred ecceeecC-CCCEEeecC---CcceEEeecchhcceeeeeecccccceeEEeeeccCCCeEEEeccCcc-eEEEEecccc
Confidence 45555553 345555554 5667777776543333333333 4455565555554 45666666655 7888888742
Q ss_pred CcEE----------EEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeC
Q psy6570 106 NKFN----------LVDNNIQWPTGITIDYPSQRLYWADPKARTIESINL 145 (713)
Q Consensus 106 ~~~~----------l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~ 145 (713)
...- ...-.+....-||.-+..-..||+-+..+.|+..|+
T Consensus 128 ~~~~~d~~~~~~~~~~~cht~rVKria~~p~~PhtfwsasEDGtirQyDi 177 (758)
T KOG1310|consen 128 KEGGMDHGMEETTRCWSCHTDRVKRIATAPNGPHTFWSASEDGTIRQYDI 177 (758)
T ss_pred cccccccCccchhhhhhhhhhhhhheecCCCCCceEEEecCCcceeeecc
Confidence 2111 111123344567777666677777777777766654
No 436
>KOG0196|consensus
Probab=31.40 E-value=52 Score=37.90 Aligned_cols=9 Identities=33% Similarity=0.674 Sum_probs=6.8
Q ss_pred cccCCCCCc
Q psy6570 437 ECDCPKFYY 445 (713)
Q Consensus 437 ~C~C~~G~~ 445 (713)
.|.|.+||.
T Consensus 260 ~C~C~aGye 268 (996)
T KOG0196|consen 260 GCVCKAGYE 268 (996)
T ss_pred ceeecCCCC
Confidence 677877775
No 437
>KOG1445|consensus
Probab=30.84 E-value=6.2e+02 Score=28.62 Aligned_cols=74 Identities=12% Similarity=0.051 Sum_probs=37.0
Q ss_pred CCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCce-EEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEec
Q psy6570 24 NLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRK-KRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASI 102 (713)
Q Consensus 24 ~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~-~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~ 102 (713)
.+..+..|.|.+..-.+..+.+ ..-.|...|+.... ...+. .-..++.+||..|..++| -|- ...++|.+...
T Consensus 676 h~eKI~slRfHPLAadvLa~as---yd~Ti~lWDl~~~~~~~~l~-gHtdqIf~~AWSpdGr~~-AtV-cKDg~~rVy~P 749 (1012)
T KOG1445|consen 676 HGEKITSLRFHPLAADVLAVAS---YDSTIELWDLANAKLYSRLV-GHTDQIFGIAWSPDGRRI-ATV-CKDGTLRVYEP 749 (1012)
T ss_pred ccceEEEEEecchhhhHhhhhh---ccceeeeeehhhhhhhheec-cCcCceeEEEECCCCcce-eee-ecCceEEEeCC
Confidence 3444555555554433333333 34567776765432 32333 334678889988754443 222 12225665554
Q ss_pred C
Q psy6570 103 D 103 (713)
Q Consensus 103 d 103 (713)
.
T Consensus 750 r 750 (1012)
T KOG1445|consen 750 R 750 (1012)
T ss_pred C
Confidence 3
No 438
>KOG0973|consensus
Probab=30.76 E-value=1e+03 Score=28.63 Aligned_cols=66 Identities=9% Similarity=0.104 Sum_probs=42.0
Q ss_pred CCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcE
Q psy6570 72 NEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTI 140 (713)
Q Consensus 72 ~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I 140 (713)
....+++.+| ++.++++-...+ +|...+...-.+..++......+.|+++||.++++ -+.+..+.|
T Consensus 130 ~DV~Dv~Wsp-~~~~lvS~s~Dn-sViiwn~~tF~~~~vl~~H~s~VKGvs~DP~Gky~-ASqsdDrti 195 (942)
T KOG0973|consen 130 SDVLDVNWSP-DDSLLVSVSLDN-SVIIWNAKTFELLKVLRGHQSLVKGVSWDPIGKYF-ASQSDDRTL 195 (942)
T ss_pred CccceeccCC-CccEEEEecccc-eEEEEccccceeeeeeecccccccceEECCccCee-eeecCCceE
Confidence 4566778887 666777665545 66666655445555666667789999999876544 333334433
No 439
>PF12301 CD99L2: CD99 antigen like protein 2; InterPro: IPR022078 This family of proteins is found in eukaryotes. Proteins in this family are typically between 165 and 237 amino acids in length. CD99L2 and CD99 are involved in trans-endothelial migration of neutrophils in vitro and in the recruitment of neutrophils into inflamed peritoneum.
Probab=30.64 E-value=22 Score=32.81 Aligned_cols=33 Identities=12% Similarity=0.189 Sum_probs=20.0
Q ss_pred ccccccchhHHHHHHHHHHHHHHhheeeEEEEe
Q psy6570 678 KQSYVNSHISSILILILLLITVGGIGYYIFRIK 710 (713)
Q Consensus 678 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 710 (713)
.......+|++||.+++++|+-++.-++.+.+|
T Consensus 109 ~~~~~~g~IaGIvsav~valvGAvsSyiaYqkK 141 (169)
T PF12301_consen 109 DGEAEAGTIAGIVSAVVVALVGAVSSYIAYQKK 141 (169)
T ss_pred ccCcccchhhhHHHHHHHHHHHHHHHHHHHHhh
Confidence 344556778888777777766565555444433
No 440
>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=30.47 E-value=16 Score=32.75 Aligned_cols=27 Identities=11% Similarity=0.028 Sum_probs=14.3
Q ss_pred hhHHHHHHHHH-HHHHHhheeeEEEEec
Q psy6570 685 HISSILILILL-LITVGGIGYYIFRIKM 711 (713)
Q Consensus 685 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 711 (713)
+++++++++-+ ||+++++++|.+.+|.
T Consensus 50 IVIGvVVGVGg~ill~il~lvf~~c~r~ 77 (154)
T PF04478_consen 50 IVIGVVVGVGGPILLGILALVFIFCIRR 77 (154)
T ss_pred EEEEEEecccHHHHHHHHHhheeEEEec
Confidence 45565555444 4555566665554433
No 441
>KOG0284|consensus
Probab=30.08 E-value=4.2e+02 Score=28.17 Aligned_cols=91 Identities=8% Similarity=-0.060 Sum_probs=50.4
Q ss_pred CeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeE
Q psy6570 51 NNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRL 130 (713)
Q Consensus 51 ~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~L 130 (713)
+-|-+.+++=..++.+-...-...++||+.| +...|.|-+..+ +|..-+.--.....++...-.-+..+...|..+.|
T Consensus 160 G~iKyWqpnmnnVk~~~ahh~eaIRdlafSp-nDskF~t~SdDg-~ikiWdf~~~kee~vL~GHgwdVksvdWHP~kgLi 237 (464)
T KOG0284|consen 160 GMIKYWQPNMNNVKIIQAHHAEAIRDLAFSP-NDSKFLTCSDDG-TIKIWDFRMPKEERVLRGHGWDVKSVDWHPTKGLI 237 (464)
T ss_pred ceEEecccchhhhHHhhHhhhhhhheeccCC-CCceeEEecCCC-eEEEEeccCCchhheeccCCCCcceeccCCcccee
Confidence 3344444443333333222235788999998 778888766544 55544443333333334445567888888877766
Q ss_pred EEEcCCCCcEEEEe
Q psy6570 131 YWADPKARTIESIN 144 (713)
Q Consensus 131 Y~~d~~~~~I~~~~ 144 (713)
+..-..+ -|...|
T Consensus 238 asgskDn-lVKlWD 250 (464)
T KOG0284|consen 238 ASGSKDN-LVKLWD 250 (464)
T ss_pred EEccCCc-eeEeec
Confidence 6554333 444434
No 442
>PF15390 DUF4613: Domain of unknown function (DUF4613)
Probab=29.54 E-value=3.2e+02 Score=30.62 Aligned_cols=59 Identities=14% Similarity=0.134 Sum_probs=44.8
Q ss_pred cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCce---EEEEEcCCCCCcceEEEc
Q psy6570 19 KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRK---KRTLLNTGLNEPYDIALE 80 (713)
Q Consensus 19 ~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~---~~~l~~~~~~~p~~iavD 80 (713)
++-+.++--|.=||||+..+.+-|+.. .-+.|+++.+.-.. ...|--+.-.+|.||.+-
T Consensus 332 KV~IPGILvPDliAfn~kaq~VAVASN---Tcn~ilVYSv~~s~mPniQqIqLe~~ERPKGiCFl 393 (671)
T PF15390_consen 332 KVSIPGILVPDLIAFNPKAQVVAVASN---TCNIILVYSVTPSSMPNIQQIQLESNERPKGICFL 393 (671)
T ss_pred eeccccccccceeeeCCcCCEEEEEec---CCcEEEEEEeccccCCCeeEEEcccCCCCceeeEc
Confidence 334788999999999999999988877 67889988876433 333333456899999985
No 443
>PF02480 Herpes_gE: Alphaherpesvirus glycoprotein E; InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein. gE is involved in virus spread but is not essential for propagation [].; GO: 0016020 membrane; PDB: 2GJ7_F 2GIY_B.
Probab=29.47 E-value=18 Score=39.31 Aligned_cols=9 Identities=11% Similarity=-0.241 Sum_probs=2.9
Q ss_pred ccccEEEEe
Q psy6570 203 RASDVLILQ 211 (713)
Q Consensus 203 ~~~~i~v~~ 211 (713)
.++.+.+.+
T Consensus 179 ~~f~~~i~W 187 (439)
T PF02480_consen 179 PPFSLEIDW 187 (439)
T ss_dssp --EEEEEEE
T ss_pred CCeeEEEEE
Confidence 344444433
No 444
>KOG0307|consensus
Probab=29.40 E-value=2e+02 Score=34.41 Aligned_cols=166 Identities=10% Similarity=0.021 Sum_probs=76.3
Q ss_pred CCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCC---cceEEEcCCCCcEEEEccCCCCeEEEEec
Q psy6570 26 HDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNE---PYDIALEPLSGRMFWTELGIKPRISGASI 102 (713)
Q Consensus 26 ~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~---p~~iavD~~~~~ly~td~~~~~~I~~~~~ 102 (713)
....||.|.+..++|.-+.. ..+.|++-|++-... ......... +.-|+......+|+-+-..+. ++..-++
T Consensus 117 G~V~gLDfN~~q~nlLASGa---~~geI~iWDlnn~~t-P~~~~~~~~~~eI~~lsWNrkvqhILAS~s~sg-~~~iWDl 191 (1049)
T KOG0307|consen 117 GPVLGLDFNPFQGNLLASGA---DDGEILIWDLNKPET-PFTPGSQAPPSEIKCLSWNRKVSHILASGSPSG-RAVIWDL 191 (1049)
T ss_pred CceeeeeccccCCceeeccC---CCCcEEEeccCCcCC-CCCCCCCCCcccceEeccchhhhHHhhccCCCC-Cceeccc
Confidence 44678888888777765555 678899988875221 111111222 223333322223333332222 3333344
Q ss_pred CCCCcEEEEeC--CCCCCeeEEEeCCCCeEEEEcCCCC---cEEEEeC--CCCceeEEEecCCCCccceeeeee--CCeE
Q psy6570 103 DGKNKFNLVDN--NIQWPTGITIDYPSQRLYWADPKAR---TIESINL--NGKDRFVVYHTEDNGYKPYKLEVF--EDNL 173 (713)
Q Consensus 103 dG~~~~~l~~~--~~~~p~glavd~~~~~LY~~d~~~~---~I~~~~~--~g~~~~~~~~~~~~~~~p~~i~~~--~~~l 173 (713)
..+...+-+.. .-..-.+|+++|....-.|+-++.. .|..-|+ ..+-.+++ .. .....++|++- +.+|
T Consensus 192 r~~~pii~ls~~~~~~~~S~l~WhP~~aTql~~As~dd~~PviqlWDlR~assP~k~~-~~--H~~GilslsWc~~D~~l 268 (1049)
T KOG0307|consen 192 RKKKPIIKLSDTPGRMHCSVLAWHPDHATQLLVASGDDSAPVIQLWDLRFASSPLKIL-EG--HQRGILSLSWCPQDPRL 268 (1049)
T ss_pred cCCCcccccccCCCccceeeeeeCCCCceeeeeecCCCCCceeEeecccccCCchhhh-cc--cccceeeeccCCCCchh
Confidence 33311111111 1122457888887654333332222 3333332 22222333 11 12233455542 3366
Q ss_pred EEEeCCCCcEEEEcccCCCcceeeec
Q psy6570 174 YFSTYRTNNILKINKFGNSDFNVLAN 199 (713)
Q Consensus 174 y~td~~~~~i~~~~~~~~~~~~~~~~ 199 (713)
.++....++|+..+..++..+..+..
T Consensus 269 llSsgkD~~ii~wN~~tgEvl~~~p~ 294 (1049)
T KOG0307|consen 269 LLSSGKDNRIICWNPNTGEVLGELPA 294 (1049)
T ss_pred hhcccCCCCeeEecCCCceEeeecCC
Confidence 66666677777776655544444433
No 445
>PLN02153 epithiospecifier protein
Probab=29.35 E-value=6.7e+02 Score=26.08 Aligned_cols=166 Identities=13% Similarity=0.053 Sum_probs=78.5
Q ss_pred CceeEEEccCcccEEecCC-CCCc----eEEEeccCCeEEEeecCC--CCCCeEEEEecCCceEEEEEcC-C--CCCcc-
Q psy6570 7 GNVTRVKREMNLKTVLSNL-HDPR----GVAVDWVGKNLYWTDAGG--RSSNNIMVSTLEGRKKRTLLNT-G--LNEPY- 75 (713)
Q Consensus 7 ~~I~~~~~~~~~~~~~~~~-~~p~----gla~D~~~~~ly~td~~~--~~~~~I~~~~~~G~~~~~l~~~-~--~~~p~- 75 (713)
..++++++..+.-..+..+ ..|+ +.++-..+++||+.-... .....++++++....=+.+... . ...|+
T Consensus 50 ~~~~~yd~~~~~W~~~~~~~~~p~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~v~~yd~~t~~W~~~~~~~~~~~p~~R~ 129 (341)
T PLN02153 50 KDLYVFDFNTHTWSIAPANGDVPRISCLGVRMVAVGTKLYIFGGRDEKREFSDFYSYDTVKNEWTFLTKLDEEGGPEART 129 (341)
T ss_pred CcEEEEECCCCEEEEcCccCCCCCCccCceEEEEECCEEEEECCCCCCCccCcEEEEECCCCEEEEeccCCCCCCCCCce
Confidence 4677888776543222222 1222 444444578899873311 1234688888765432222110 0 01222
Q ss_pred --eEEEcCCCCcEEEEccCCC----------CeEEEEecCCCCcEEEEeCC-CCCC---eeEEEeCCCCeEEEEcC----
Q psy6570 76 --DIALEPLSGRMFWTELGIK----------PRISGASIDGKNKFNLVDNN-IQWP---TGITIDYPSQRLYWADP---- 135 (713)
Q Consensus 76 --~iavD~~~~~ly~td~~~~----------~~I~~~~~dG~~~~~l~~~~-~~~p---~glavd~~~~~LY~~d~---- 135 (713)
.+++ .+++||+.--... ..|++.++....=+.+.... ...| .++++ .+++||+.-.
T Consensus 130 ~~~~~~--~~~~iyv~GG~~~~~~~~~~~~~~~v~~yd~~~~~W~~l~~~~~~~~~r~~~~~~~--~~~~iyv~GG~~~~ 205 (341)
T PLN02153 130 FHSMAS--DENHVYVFGGVSKGGLMKTPERFRTIEAYNIADGKWVQLPDPGENFEKRGGAGFAV--VQGKIWVVYGFATS 205 (341)
T ss_pred eeEEEE--ECCEEEEECCccCCCccCCCcccceEEEEECCCCeEeeCCCCCCCCCCCCcceEEE--ECCeEEEEeccccc
Confidence 2344 4678888642210 13555665432211111111 0011 12333 3678888521
Q ss_pred ---------CCCcEEEEeCCCCceeEEEecCCCCcc---ceeeeeeCCeEEEEe
Q psy6570 136 ---------KARTIESINLNGKDRFVVYHTEDNGYK---PYKLEVFEDNLYFST 177 (713)
Q Consensus 136 ---------~~~~I~~~~~~g~~~~~~~~~~~~~~~---p~~i~~~~~~ly~td 177 (713)
..+.|+++++....-+.+..... .+. ..+..+.+++||+.-
T Consensus 206 ~~~gG~~~~~~~~v~~yd~~~~~W~~~~~~g~-~P~~r~~~~~~~~~~~iyv~G 258 (341)
T PLN02153 206 ILPGGKSDYESNAVQFFDPASGKWTEVETTGA-KPSARSVFAHAVVGKYIIIFG 258 (341)
T ss_pred cccCCccceecCceEEEEcCCCcEEeccccCC-CCCCcceeeeEEECCEEEEEC
Confidence 13568888877655444432211 122 245556688888863
No 446
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=29.20 E-value=1e+03 Score=28.20 Aligned_cols=15 Identities=13% Similarity=0.080 Sum_probs=12.7
Q ss_pred eeEEEeCCCCeEEEE
Q psy6570 119 TGITIDYPSQRLYWA 133 (713)
Q Consensus 119 ~glavd~~~~~LY~~ 133 (713)
..+++|+..+.|||.
T Consensus 378 ~~~s~D~~~glvy~p 392 (764)
T TIGR03074 378 SVASYDEKLGLVYLP 392 (764)
T ss_pred CceEEcCCCCeEEEe
Confidence 467999999999995
No 447
>KOG3653|consensus
Probab=29.16 E-value=58 Score=35.13 Aligned_cols=6 Identities=33% Similarity=1.121 Sum_probs=3.2
Q ss_pred ceeeCC
Q psy6570 655 PICICP 660 (713)
Q Consensus 655 ~~C~C~ 660 (713)
|.|-|.
T Consensus 116 ~~CcCs 121 (534)
T KOG3653|consen 116 YFCCCS 121 (534)
T ss_pred EEEecC
Confidence 556553
No 448
>PF13131 DUF3951: Protein of unknown function (DUF3951)
Probab=29.16 E-value=42 Score=23.74 Aligned_cols=8 Identities=13% Similarity=0.517 Sum_probs=3.0
Q ss_pred HHHHHHHH
Q psy6570 687 SSILILIL 694 (713)
Q Consensus 687 ~~~~~~~~ 694 (713)
++++++.+
T Consensus 6 iG~~~~~~ 13 (53)
T PF13131_consen 6 IGIILFTI 13 (53)
T ss_pred HHHHHHHH
Confidence 34333333
No 449
>KOG0641|consensus
Probab=28.91 E-value=5.4e+02 Score=24.87 Aligned_cols=74 Identities=15% Similarity=0.119 Sum_probs=41.9
Q ss_pred CCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCC
Q psy6570 72 NEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGK 148 (713)
Q Consensus 72 ~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~ 148 (713)
.....++||| .|+|..+..... .-...++.|.....-+........-+-|.|...+| .+-+...+|...++.|.
T Consensus 232 savaav~vdp-sgrll~sg~~ds-sc~lydirg~r~iq~f~phsadir~vrfsp~a~yl-lt~syd~~ikltdlqgd 305 (350)
T KOG0641|consen 232 SAVAAVAVDP-SGRLLASGHADS-SCMLYDIRGGRMIQRFHPHSADIRCVRFSPGAHYL-LTCSYDMKIKLTDLQGD 305 (350)
T ss_pred ceeEEEEECC-CcceeeeccCCC-ceEEEEeeCCceeeeeCCCccceeEEEeCCCceEE-EEecccceEEEeecccc
Confidence 3456789997 567777764433 55666666655433333233334456666543333 34455667777777664
No 450
>KOG0269|consensus
Probab=28.16 E-value=5.2e+02 Score=29.90 Aligned_cols=52 Identities=10% Similarity=0.134 Sum_probs=33.8
Q ss_pred eEEEEecCCCCcEEE---EeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCC
Q psy6570 96 RISGASIDGKNKFNL---VDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNG 147 (713)
Q Consensus 96 ~I~~~~~dG~~~~~l---~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g 147 (713)
.|..-++.-..+..+ +...-+..+-|.|.+.+-.|.++-+..+.|-.+|+.-
T Consensus 111 ~i~vWdlnk~~rnk~l~~f~EH~Rs~~~ldfh~tep~iliSGSQDg~vK~~DlR~ 165 (839)
T KOG0269|consen 111 VISVWDLNKSIRNKLLTVFNEHERSANKLDFHSTEPNILISGSQDGTVKCWDLRS 165 (839)
T ss_pred cEEEEecCccccchhhhHhhhhccceeeeeeccCCccEEEecCCCceEEEEeeec
Confidence 455555554333332 2334455677888888888888888778777777644
No 451
>COG5437 Predicted secreted protein [Function unknown]
Probab=27.85 E-value=1.4e+02 Score=25.66 Aligned_cols=59 Identities=17% Similarity=0.204 Sum_probs=34.3
Q ss_pred cCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcE-EEEeCCCCCCeeEE
Q psy6570 58 LEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKF-NLVDNNIQWPTGIT 122 (713)
Q Consensus 58 ~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~-~l~~~~~~~p~gla 122 (713)
..|+.++.|.. ...+.+++| |.+||.|...+..|.++..|++.+- .++...+....|++
T Consensus 46 t~GrWr~llaG---~gv~si~~~---gsg~f~Da~Sda~lr~~Ffd~s~~~wkvv~pdfg~~~G~f 105 (138)
T COG5437 46 TAGRWRELLAG---MGVWSIAND---GSGYFADAASDALLRKAFFDDSIVCWKVVNPDFGLFGGPF 105 (138)
T ss_pred ccccHHHHhcc---cceEEEeec---CcEEEecccchHHHHHHhccCCceEEEEEccCcccccCcE
Confidence 34555655432 335566664 6799999877756777777877653 23333344333333
No 452
>PF14269 Arylsulfotran_2: Arylsulfotransferase (ASST)
Probab=27.36 E-value=6.9e+02 Score=25.63 Aligned_cols=36 Identities=14% Similarity=0.162 Sum_probs=22.3
Q ss_pred CcceEEEcCCCCcEEEEccCCCCeEEEEe-cCCCCcEEE
Q psy6570 73 EPYDIALEPLSGRMFWTELGIKPRISGAS-IDGKNKFNL 110 (713)
Q Consensus 73 ~p~~iavD~~~~~ly~td~~~~~~I~~~~-~dG~~~~~l 110 (713)
..++|..+. .|.+.++-.... .|++++ .+|+.+-.+
T Consensus 145 HiNsV~~~~-~G~yLiS~R~~~-~i~~I~~~tG~I~W~l 181 (299)
T PF14269_consen 145 HINSVDKDD-DGDYLISSRNTS-TIYKIDPSTGKIIWRL 181 (299)
T ss_pred EeeeeeecC-CccEEEEecccC-EEEEEECCCCcEEEEe
Confidence 455777774 556667776666 777777 445444444
No 453
>PF05337 CSF-1: Macrophage colony stimulating factor-1 (CSF-1); InterPro: IPR008001 Colony stimulating factor 1 (CSF-1) is a homodimeric polypeptide growth factor whose primary function is to regulate the survival, proliferation, differentiation, and function of cells of the mononuclear phagocytic lineage. This lineage includes mononuclear phagocytic precursors, blood monocytes, tissue macrophages, osteoclasts, and microglia of the brain, all of which possess cell surface receptors for CSF-1. The protein has also been linked with male fertility [] and mutations in the Csf-1 gene have been found to cause osteopetrosis and failure of tooth eruption [].; GO: 0005125 cytokine activity, 0008083 growth factor activity, 0016021 integral to membrane; PDB: 3EJJ_A.
Probab=27.18 E-value=21 Score=35.35 Aligned_cols=19 Identities=42% Similarity=0.756 Sum_probs=0.0
Q ss_pred HHHHHHHHhheeeEEEEec
Q psy6570 693 ILLLITVGGIGYYIFRIKM 711 (713)
Q Consensus 693 ~~~~~~~~~~~~~~~~~~~ 711 (713)
|+||+.|+.++|||+|+|.
T Consensus 236 ILVLLaVGGLLfYr~rrRs 254 (285)
T PF05337_consen 236 ILVLLAVGGLLFYRRRRRS 254 (285)
T ss_dssp -------------------
T ss_pred hhhhhhccceeeecccccc
Confidence 3334455557777777653
No 454
>KOG1517|consensus
Probab=26.35 E-value=1.2e+03 Score=28.34 Aligned_cols=123 Identities=12% Similarity=0.104 Sum_probs=69.7
Q ss_pred CCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEE---cCCCCC--cceEEEcCCCCcE-EEEccCCCCeEE
Q psy6570 25 LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLL---NTGLNE--PYDIALEPLSGRM-FWTELGIKPRIS 98 (713)
Q Consensus 25 ~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~---~~~~~~--p~~iavD~~~~~l-y~td~~~~~~I~ 98 (713)
-.-+..|.-|-.++.++++-. ..+.|.++|..-...+.++ +..... ...+.+-+ .|+- .|+.. .++.|+
T Consensus 1208 ~t~vTaLS~~~~~gn~i~AGf---aDGsvRvyD~R~a~~ds~v~~~R~h~~~~~Iv~~slq~-~G~~elvSgs-~~G~I~ 1282 (1387)
T KOG1517|consen 1208 STLVTALSADLVHGNIIAAGF---ADGSVRVYDRRMAPPDSLVCVYREHNDVEPIVHLSLQR-QGLGELVSGS-QDGDIQ 1282 (1387)
T ss_pred CccceeecccccCCceEEEee---cCCceEEeecccCCccccceeecccCCcccceeEEeec-CCCcceeeec-cCCeEE
Confidence 344777887877778888877 6778888776544332222 111112 34555543 2332 33333 334788
Q ss_pred EEecCCCCcEEEEeCCCCC-----CeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCceeEEE
Q psy6570 99 GASIDGKNKFNLVDNNIQW-----PTGITIDYPSQRLYWADPKARTIESINLNGKDRFVVY 154 (713)
Q Consensus 99 ~~~~dG~~~~~l~~~~~~~-----p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~~~~ 154 (713)
..++....+.+.+.....+ -+.|++.. ...|+-+-.. ..|..++++|.....+.
T Consensus 1283 ~~DlR~~~~e~~~~iv~~~~yGs~lTal~VH~-hapiiAsGs~-q~ikIy~~~G~~l~~~k 1341 (1387)
T KOG1517|consen 1283 LLDLRMSSKETFLTIVAHWEYGSALTALTVHE-HAPIIASGSA-QLIKIYSLSGEQLNIIK 1341 (1387)
T ss_pred EEecccCcccccceeeeccccCccceeeeecc-CCCeeeecCc-ceEEEEecChhhhcccc
Confidence 8887775444443322211 35677774 3455544444 77888888887666554
No 455
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=26.16 E-value=21 Score=36.46 Aligned_cols=25 Identities=16% Similarity=0.252 Sum_probs=13.7
Q ss_pred hhHHHHHHHHHHHHHHhheeeEEEE
Q psy6570 685 HISSILILILLLITVGGIGYYIFRI 709 (713)
Q Consensus 685 ~~~~~~~~~~~~~~~~~~~~~~~~~ 709 (713)
+++.+++++++||+.+++.++|+++
T Consensus 262 iiaIliIVLIMvIIYLILRYRRKKK 286 (299)
T PF02009_consen 262 IIAILIIVLIMVIIYLILRYRRKKK 286 (299)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 4445555555555555555666443
No 456
>PF06084 Cytomega_TRL10: Cytomegalovirus TRL10 protein; InterPro: IPR009284 This family consists of several Cytomegalovirus TRL10 proteins. TRL10 represents a structural component of the virus particle and like the other HCMV envelope glycoproteins, is present in a disulphide-linked complex [].
Probab=25.88 E-value=59 Score=27.20 Aligned_cols=7 Identities=29% Similarity=0.819 Sum_probs=3.4
Q ss_pred eeecCCC
Q psy6570 614 VCTCVNG 620 (713)
Q Consensus 614 ~C~C~~G 620 (713)
.|.|.+.
T Consensus 21 ~ckc~~~ 27 (150)
T PF06084_consen 21 TCKCSPW 27 (150)
T ss_pred EEecCCC
Confidence 4555443
No 457
>KOG0645|consensus
Probab=25.82 E-value=6.9e+02 Score=25.12 Aligned_cols=160 Identities=10% Similarity=0.030 Sum_probs=80.7
Q ss_pred CCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCC-ce--EEEEEcCC-CCCcceEEEcCCCCcEEEEccCCCCeEEEE
Q psy6570 25 LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEG-RK--KRTLLNTG-LNEPYDIALEPLSGRMFWTELGIKPRISGA 100 (713)
Q Consensus 25 ~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G-~~--~~~l~~~~-~~~p~~iavD~~~~~ly~td~~~~~~I~~~ 100 (713)
..+...+|+.+..+.|+-+-. ....|.+.++.+ .. .+.++..+ -...+.+|..|.+++|-.+.......|+.
T Consensus 14 ~~r~W~~awhp~~g~ilAscg---~Dk~vriw~~~~~~s~~ck~vld~~hkrsVRsvAwsp~g~~La~aSFD~t~~Iw~- 89 (312)
T KOG0645|consen 14 KDRVWSVAWHPGKGVILASCG---TDKAVRIWSTSSGDSWTCKTVLDDGHKRSVRSVAWSPHGRYLASASFDATVVIWK- 89 (312)
T ss_pred CCcEEEEEeccCCceEEEeec---CCceEEEEecCCCCcEEEEEeccccchheeeeeeecCCCcEEEEeeccceEEEee-
Confidence 335778899887677776766 567777777764 22 22333222 35678899998776555544433312332
Q ss_pred ecCCCCcEE-EEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCC-ceeEEEecCCCCccceeeeee-CCeEEEEe
Q psy6570 101 SIDGKNKFN-LVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGK-DRFVVYHTEDNGYKPYKLEVF-EDNLYFST 177 (713)
Q Consensus 101 ~~dG~~~~~-l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~-~~~~~~~~~~~~~~p~~i~~~-~~~ly~td 177 (713)
+.|+....+ .+...-+--..+|+...+..| -+-+....|+.-..+.. ....+.......+....+..+ -..|+++-
T Consensus 90 k~~~efecv~~lEGHEnEVK~Vaws~sG~~L-ATCSRDKSVWiWe~deddEfec~aVL~~HtqDVK~V~WHPt~dlL~S~ 168 (312)
T KOG0645|consen 90 KEDGEFECVATLEGHENEVKCVAWSASGNYL-ATCSRDKSVWIWEIDEDDEFECIAVLQEHTQDVKHVIWHPTEDLLFSC 168 (312)
T ss_pred cCCCceeEEeeeeccccceeEEEEcCCCCEE-EEeeCCCeEEEEEecCCCcEEEEeeeccccccccEEEEcCCcceeEEe
Confidence 223333322 222333455788998554444 34445555555444422 222221111112222233333 12355555
Q ss_pred CCCCcEEEEccc
Q psy6570 178 YRTNNILKINKF 189 (713)
Q Consensus 178 ~~~~~i~~~~~~ 189 (713)
.+.+.|..+..+
T Consensus 169 SYDnTIk~~~~~ 180 (312)
T KOG0645|consen 169 SYDNTIKVYRDE 180 (312)
T ss_pred ccCCeEEEEeec
Confidence 566666655544
No 458
>PF05393 Hum_adeno_E3A: Human adenovirus early E3A glycoprotein; InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=25.65 E-value=41 Score=26.88 Aligned_cols=23 Identities=13% Similarity=0.088 Sum_probs=10.0
Q ss_pred HHHHHHHHHHHHhheeeEEEEec
Q psy6570 689 ILILILLLITVGGIGYYIFRIKM 711 (713)
Q Consensus 689 ~~~~~~~~~~~~~~~~~~~~~~~ 711 (713)
++.++.|++++.-+..+++|+|-
T Consensus 39 vI~~iFil~VilwfvCC~kRkrs 61 (94)
T PF05393_consen 39 VICGIFILLVILWFVCCKKRKRS 61 (94)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhc
Confidence 33333333433434445555543
No 459
>PHA03164 hypothetical protein; Provisional
Probab=25.30 E-value=78 Score=24.43 Aligned_cols=11 Identities=27% Similarity=0.591 Sum_probs=4.1
Q ss_pred HHHHHHHHHHH
Q psy6570 688 SILILILLLIT 698 (713)
Q Consensus 688 ~~~~~~~~~~~ 698 (713)
+++++++|.|+
T Consensus 65 gLaIamILfii 75 (88)
T PHA03164 65 GLAIAMILFII 75 (88)
T ss_pred HHHHHHHHHHH
Confidence 33333333333
No 460
>PF05131 Pep3_Vps18: Pep3/Vps18/deep orange family; InterPro: IPR007810 This region is found in a number of proteins identified as being involved in Golgi function and vacuolar sorting. The molecular function of this region is unknown. Proteins containing this domain also contain a C-terminal ring finger domain.
Probab=24.74 E-value=5.2e+02 Score=23.33 Aligned_cols=72 Identities=18% Similarity=0.175 Sum_probs=40.9
Q ss_pred CCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEE-EEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecCC
Q psy6570 26 HDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKR-TLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASIDG 104 (713)
Q Consensus 26 ~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~-~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG 104 (713)
..|.+|++-. -+++..-. ..-+.+..++++.+. ..+.....+..||+.|+..+ -||.=+.+ .|+++.+..
T Consensus 34 ~~p~si~lT~--~H~llL~~----~~l~~vn~L~~~vV~e~~~~~~~~~~~gl~~D~~~~-t~W~ys~~--~I~ei~i~~ 104 (147)
T PF05131_consen 34 SPPLSIALTE--FHLLLLYS----DRLIAVNRLNNKVVFEESLLETGGKILGLCRDPSSN-TFWLYSSN--SIFEIVINN 104 (147)
T ss_pred CCcceEEeec--eeeeEEeC----CEEEEEEecCCcEEEEEEeccCCcceeeEEEcCCCC-eEEEEeCC--eeEEEEcCc
Confidence 3488998852 24443332 233444456776532 22234567899999998655 55654332 477766654
Q ss_pred CC
Q psy6570 105 KN 106 (713)
Q Consensus 105 ~~ 106 (713)
..
T Consensus 105 E~ 106 (147)
T PF05131_consen 105 ED 106 (147)
T ss_pred ch
Confidence 43
No 461
>PF15176 LRR19-TM: Leucine-rich repeat family 19 TM domain
Probab=24.47 E-value=87 Score=25.92 Aligned_cols=17 Identities=6% Similarity=0.137 Sum_probs=11.1
Q ss_pred ccccchhHHHHHHHHHH
Q psy6570 680 SYVNSHISSILILILLL 696 (713)
Q Consensus 680 ~~~~~~~~~~~~~~~~~ 696 (713)
..+|.+++++++++|++
T Consensus 14 g~sW~~LVGVv~~al~~ 30 (102)
T PF15176_consen 14 GRSWPFLVGVVVTALVT 30 (102)
T ss_pred CcccHhHHHHHHHHHHH
Confidence 55666777766666655
No 462
>PF08268 FBA_3: F-box associated domain; InterPro: IPR013187 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=24.17 E-value=2.9e+02 Score=24.04 Aligned_cols=55 Identities=18% Similarity=0.243 Sum_probs=33.5
Q ss_pred eEEEeCCCCeEEEEcC----CCCcEEEEeCCCCceeEEEec--CCCCccceeeeeeCCeEEEEe
Q psy6570 120 GITIDYPSQRLYWADP----KARTIESINLNGKDRFVVYHT--EDNGYKPYKLEVFEDNLYFST 177 (713)
Q Consensus 120 glavd~~~~~LY~~d~----~~~~I~~~~~~g~~~~~~~~~--~~~~~~p~~i~~~~~~ly~td 177 (713)
||.+| |-|||.-. ....|.++|+.....+.+... .........|...+++|-...
T Consensus 1 gicin---Gvly~~a~~~~~~~~~IvsFDv~~E~f~~i~~P~~~~~~~~~~~L~~~~G~L~~v~ 61 (129)
T PF08268_consen 1 GICIN---GVLYWLAWSEDSDNNVIVSFDVRSEKFRFIKLPEDPYSSDCSSTLIEYKGKLALVS 61 (129)
T ss_pred CEEEC---cEEEeEEEECCCCCcEEEEEEcCCceEEEEEeeeeeccccCccEEEEeCCeEEEEE
Confidence 34554 78888743 467799999988766555442 122334455666677766554
No 463
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=24.08 E-value=85 Score=22.28 Aligned_cols=23 Identities=13% Similarity=0.238 Sum_probs=19.2
Q ss_pred CCCCCeeEEEeCCCCeEEEEcCC
Q psy6570 114 NIQWPTGITIDYPSQRLYWADPK 136 (713)
Q Consensus 114 ~~~~p~glavd~~~~~LY~~d~~ 136 (713)
....|.||.||+..+.|.|+-..
T Consensus 9 ~~~LP~gLs~d~~tG~isGtp~~ 31 (49)
T PF05345_consen 9 GGGLPSGLSLDPSTGTISGTPTS 31 (49)
T ss_pred CCCCCCcEEEeCCCCEEEeecCC
Confidence 45679999999999999998543
No 464
>KOG4532|consensus
Probab=23.68 E-value=7.6e+02 Score=24.84 Aligned_cols=32 Identities=9% Similarity=-0.045 Sum_probs=19.6
Q ss_pred cceEEEcCCCCcEEEEccCCCCeEEEEecCCCCc
Q psy6570 74 PYDIALEPLSGRMFWTELGIKPRISGASIDGKNK 107 (713)
Q Consensus 74 p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~ 107 (713)
-+.+++++...++-... ..++|.+..+|....
T Consensus 161 ~ns~~~snd~~~~~~Vg--ds~~Vf~y~id~~se 192 (344)
T KOG4532|consen 161 QNSLHYSNDPSWGSSVG--DSRRVFRYAIDDESE 192 (344)
T ss_pred eeeeEEcCCCceEEEec--CCCcceEEEeCCccc
Confidence 56788887655554433 223788887775543
No 465
>KOG0274|consensus
Probab=23.48 E-value=1.1e+03 Score=26.59 Aligned_cols=141 Identities=15% Similarity=0.051 Sum_probs=70.7
Q ss_pred CCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecC-CCCcEEEEeCCCCCCeeEEEeCCC
Q psy6570 49 SSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASID-GKNKFNLVDNNIQWPTGITIDYPS 127 (713)
Q Consensus 49 ~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~d-G~~~~~l~~~~~~~p~glavd~~~ 127 (713)
....|.+.+.+......+...-....+.|.++ +..||... ... .|.+-+.. ++-..++ .....+-..|+++..
T Consensus 309 ~D~tVkVW~v~n~~~l~l~~~h~~~V~~v~~~--~~~lvsgs-~d~-~v~VW~~~~~~cl~sl-~gH~~~V~sl~~~~~- 382 (537)
T KOG0274|consen 309 RDNTVKVWDVTNGACLNLLRGHTGPVNCVQLD--EPLLVSGS-YDG-TVKVWDPRTGKCLKSL-SGHTGRVYSLIVDSE- 382 (537)
T ss_pred CCceEEEEeccCcceEEEeccccccEEEEEec--CCEEEEEe-cCc-eEEEEEhhhceeeeee-cCCcceEEEEEecCc-
Confidence 45667777776444333333334566777776 34444332 222 34443333 2222222 334556778888843
Q ss_pred CeEEEEcCCCCcEEEEeCCCC-ceeEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEEEcccCCCcceeeec
Q psy6570 128 QRLYWADPKARTIESINLNGK-DRFVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILKINKFGNSDFNVLAN 199 (713)
Q Consensus 128 ~~LY~~d~~~~~I~~~~~~g~-~~~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~~~~~~~~~~~~~~ 199 (713)
.++|-... ...|..-++... .....+.... .-..++... +.++++....+.|...+...+..+.++..
T Consensus 383 ~~~~Sgs~-D~~IkvWdl~~~~~c~~tl~~h~--~~v~~l~~~-~~~Lvs~~aD~~Ik~WD~~~~~~~~~~~~ 451 (537)
T KOG0274|consen 383 NRLLSGSL-DTTIKVWDLRTKRKCIHTLQGHT--SLVSSLLLR-DNFLVSSSADGTIKLWDAEEGECLRTLEG 451 (537)
T ss_pred ceEEeeee-ccceEeecCCchhhhhhhhcCCc--ccccccccc-cceeEeccccccEEEeecccCceeeeecc
Confidence 55554443 366777777766 3222222211 111333333 34555555566677666655555555444
No 466
>CHL00114 psbX photosystem II protein X; Reviewed
Probab=22.85 E-value=1.3e+02 Score=20.17 Aligned_cols=12 Identities=25% Similarity=0.523 Sum_probs=4.9
Q ss_pred hhHHHHHHHHHH
Q psy6570 685 HISSILILILLL 696 (713)
Q Consensus 685 ~~~~~~~~~~~~ 696 (713)
++..++.+.+++
T Consensus 8 F~~SL~~Ga~iv 19 (39)
T CHL00114 8 FINSLLLGAIIV 19 (39)
T ss_pred HHHHHHHHHHHh
Confidence 333444444443
No 467
>KOG1272|consensus
Probab=22.79 E-value=3.6e+02 Score=29.17 Aligned_cols=30 Identities=17% Similarity=0.169 Sum_probs=17.4
Q ss_pred CCeeEEEeCCCCeEEEEcCCCCcEEEEeCCC
Q psy6570 117 WPTGITIDYPSQRLYWADPKARTIESINLNG 147 (713)
Q Consensus 117 ~p~glavd~~~~~LY~~d~~~~~I~~~~~~g 147 (713)
.-.+||||+ +++...+....++|...|+-.
T Consensus 295 ~V~siAv~~-~G~YMaTtG~Dr~~kIWDlR~ 324 (545)
T KOG1272|consen 295 PVSSIAVDR-GGRYMATTGLDRKVKIWDLRN 324 (545)
T ss_pred CcceEEECC-CCcEEeecccccceeEeeecc
Confidence 447899995 455444544445555555544
No 468
>PF10873 DUF2668: Protein of unknown function (DUF2668); InterPro: IPR022640 Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known [].
Probab=22.33 E-value=60 Score=28.70 Aligned_cols=24 Identities=4% Similarity=0.122 Sum_probs=15.0
Q ss_pred ccccccchhHHHHHHHHHHHHHHh
Q psy6570 678 KQSYVNSHISSILILILLLITVGG 701 (713)
Q Consensus 678 ~~~~~~~~~~~~~~~~~~~~~~~~ 701 (713)
+...++++++++|+++.+++++++
T Consensus 59 sgtAIaGIVfgiVfimgvva~i~i 82 (155)
T PF10873_consen 59 SGTAIAGIVFGIVFIMGVVAGIAI 82 (155)
T ss_pred ccceeeeeehhhHHHHHHHHHHHH
Confidence 345566777777777666644444
No 469
>PF02404 SCF: Stem cell factor; InterPro: IPR003452 Stem cell factor (SCF) is a homodimer involved in hematopoiesis. SCF binds to and activates the SCF receptor (SCFR), a receptor tyrosine kinase. SCF stimulates the proliferation of mast cells and is able to augment the proliferation of both myeloid and lymphoid hematopoietic progenitors in bone marrow culture. It also mediates cell-cell adhesion and acts synergistically with other cytokines. SCF is a type I membrane protein, but is also found in a secretable, soluble form. The crystal structure of human SCF has been resolved and a potential receptor-binding site identified [].; GO: 0005173 stem cell factor receptor binding, 0007155 cell adhesion, 0016020 membrane; PDB: 1EXZ_A 1SCF_D 2E9W_C 2O26_A 2O27_A.
Probab=22.13 E-value=30 Score=33.90 Aligned_cols=30 Identities=17% Similarity=0.070 Sum_probs=0.0
Q ss_pred cchhHHHHHHHHHHHHHHhheeeEEEEecC
Q psy6570 683 NSHISSILILILLLITVGGIGYYIFRIKMS 712 (713)
Q Consensus 683 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 712 (713)
+..++.+.++.||+-+++.+++|+++++.+
T Consensus 215 ~~~iAL~sl~SLVIGFvlG~l~WKkkq~~~ 244 (273)
T PF02404_consen 215 WPAIALPSLFSLVIGFVLGALYWKKKQRSL 244 (273)
T ss_dssp ------------------------------
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhcccc
Confidence 334444444445554445567788877643
No 470
>KOG3621|consensus
Probab=22.12 E-value=2e+02 Score=32.65 Aligned_cols=115 Identities=16% Similarity=0.192 Sum_probs=57.5
Q ss_pred CCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEc--CCCCCcce-EEEcCCCCcEEEEccCCCCeEEEEecC
Q psy6570 27 DPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLN--TGLNEPYD-IALEPLSGRMFWTELGIKPRISGASID 103 (713)
Q Consensus 27 ~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~--~~~~~p~~-iavD~~~~~ly~td~~~~~~I~~~~~d 103 (713)
+...|+++....+||..|. .++|....|+... ..+.. .-+..+.- +-+|-...+|.++..- +=...+++
T Consensus 126 rVTal~Ws~~~~k~ysGD~----~Gkv~~~~L~s~~-~~~~~~q~il~~ds~IVQlD~~q~~LLVStl~---r~~Lc~tE 197 (726)
T KOG3621|consen 126 RVTALEWSKNGMKLYSGDS----QGKVVLTELDSRQ-AFLSKSQEILSEDSEIVQLDYLQSYLLVSTLT---RCILCQTE 197 (726)
T ss_pred eEEEEEecccccEEeecCC----CceEEEEEechhh-hhccccceeeccCcceEEeecccceehHhhhh---hhheeecc
Confidence 4567888888888998886 6788887777621 11110 01222222 4456566666665432 11223333
Q ss_pred CCCcEEEEeCCCCC--CeeEEEeCC-----CCeEEEEcCCCCcEEEEeCCCCce
Q psy6570 104 GKNKFNLVDNNIQW--PTGITIDYP-----SQRLYWADPKARTIESINLNGKDR 150 (713)
Q Consensus 104 G~~~~~l~~~~~~~--p~glavd~~-----~~~LY~~d~~~~~I~~~~~~g~~~ 150 (713)
-...+.+....-.. +.|..+=+- .-+||-+-+ ..|+|.+|++|...
T Consensus 198 ~eti~QIG~k~R~~~~~~GACF~~g~~~~q~~~IycaRP-G~RlWead~~G~V~ 250 (726)
T KOG3621|consen 198 AETITQIGKKPRKSLIDFGACFFPGQCKAQKPQIYCARP-GLRLWEADFAGEVI 250 (726)
T ss_pred hhHHHHhcCCCcCCccccceEEeeccccCCCceEEEecC-CCceEEeecceeEE
Confidence 22222222211111 333333322 235555443 45788888888543
No 471
>PF04901 RAMP: Receptor activity modifying family ; InterPro: IPR006985 The calcitonin-receptor-like receptor can function as either a calcitonin-gene-related peptide or an adrenomedullin receptor. The receptors function is modified by receptor activity modifying protein or RAMP. RAMPs are single-transmembrane-domain proteins [].; GO: 0008565 protein transporter activity, 0006886 intracellular protein transport, 0008277 regulation of G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2YX8_A 3N7R_C 3N7P_D 3N7S_C 2XVT_A 3AQF_A 3AQE_B.
Probab=21.94 E-value=30 Score=29.58 Aligned_cols=23 Identities=17% Similarity=0.241 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHhheeeEEEEec
Q psy6570 689 ILILILLLITVGGIGYYIFRIKM 711 (713)
Q Consensus 689 ~~~~~~~~~~~~~~~~~~~~~~~ 711 (713)
|++-++|+++++++++||.++.+
T Consensus 89 I~~Pi~lt~~m~~LVVw~sK~~e 111 (113)
T PF04901_consen 89 IIVPILLTLLMTALVVWRSKRSE 111 (113)
T ss_dssp -----------------------
T ss_pred HHHHHHHHHHHHHheeeeccCcC
Confidence 34444444555667777776643
No 472
>KOG4532|consensus
Probab=21.85 E-value=8.3e+02 Score=24.59 Aligned_cols=29 Identities=3% Similarity=-0.111 Sum_probs=17.3
Q ss_pred CceEEEeccCCeEEEeecCCCCCCeEEEEecCC
Q psy6570 28 PRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEG 60 (713)
Q Consensus 28 p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G 60 (713)
-..++++.....+-... ...+|+++.+|.
T Consensus 161 ~ns~~~snd~~~~~~Vg----ds~~Vf~y~id~ 189 (344)
T KOG4532|consen 161 QNSLHYSNDPSWGSSVG----DSRRVFRYAIDD 189 (344)
T ss_pred eeeeEEcCCCceEEEec----CCCcceEEEeCC
Confidence 56778876654443332 356777776654
No 473
>KOG2280|consensus
Probab=21.69 E-value=5.7e+02 Score=29.62 Aligned_cols=61 Identities=13% Similarity=0.051 Sum_probs=33.8
Q ss_pred CCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEE--EEcCC-C-------CCcceEEEcCCCCcEEEEccC
Q psy6570 27 DPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRT--LLNTG-L-------NEPYDIALEPLSGRMFWTELG 92 (713)
Q Consensus 27 ~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~--l~~~~-~-------~~p~~iavD~~~~~ly~td~~ 92 (713)
.+-++.++.. ..|.+... .+++.++++-|..++. +..+. . ..-+||++-...|.+|.....
T Consensus 85 ~lI~mgWs~~-eeLI~v~k----~g~v~Vy~~~ge~ie~~svg~e~~~~~I~ec~~f~~GVavlt~~g~v~~i~~~ 155 (829)
T KOG2280|consen 85 ELIGMGWSDD-EELICVQK----DGTVHVYGLLGEFIESNSVGFESQMSDIVECRFFHNGVAVLTVSGQVILINGV 155 (829)
T ss_pred CeeeecccCC-ceEEEEec----cceEEEeecchhhhcccccccccccCceeEEEEecCceEEEecCCcEEEEcCC
Confidence 5555555533 33433332 5788888888876654 21111 0 012577776677777776533
No 474
>KOG4818|consensus
Probab=21.29 E-value=94 Score=32.21 Aligned_cols=29 Identities=14% Similarity=-0.038 Sum_probs=15.2
Q ss_pred ccccchhHHHHHHHHHHHHHHhheeeEEE
Q psy6570 680 SYVNSHISSILILILLLITVGGIGYYIFR 708 (713)
Q Consensus 680 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 708 (713)
+.....+++++++.++++++++.++-++|
T Consensus 326 siv~PivVg~~l~gl~~~vliaylIgrr~ 354 (362)
T KOG4818|consen 326 NIVLPIAVGAILAGLVLVVLIAYLIGRRR 354 (362)
T ss_pred ceecchHHHHHHHHHHHHHHHHhheehee
Confidence 33445566666665555555555554333
No 475
>KOG4714|consensus
Probab=21.28 E-value=5.3e+02 Score=25.76 Aligned_cols=76 Identities=9% Similarity=0.036 Sum_probs=33.1
Q ss_pred CCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCc-eEEEEEcCCCCCcceEEEcCCCCcEEEEccCCCCeEEEEecC
Q psy6570 25 LHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGR-KKRTLLNTGLNEPYDIALEPLSGRMFWTELGIKPRISGASID 103 (713)
Q Consensus 25 ~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~-~~~~l~~~~~~~p~~iavD~~~~~ly~td~~~~~~I~~~~~d 103 (713)
+.....++.++...+|...-. ..+.|.+.+.... +...++...-...+.+-+.|.+..=.|+-+... .+|+.+..
T Consensus 179 ~~~v~~l~~hp~qq~~v~cgt---~dg~~~l~d~rn~~~p~S~l~ahk~~i~eV~FHpk~p~~Lft~sedG-slw~wdas 254 (319)
T KOG4714|consen 179 LDAVTALCSHPAQQHLVCCGT---DDGIVGLWDARNVAMPVSLLKAHKAEIWEVHFHPKNPEHLFTCSEDG-SLWHWDAS 254 (319)
T ss_pred cccchhhhCCcccccEEEEec---CCCeEEEEEcccccchHHHHHHhhhhhhheeccCCCchheeEecCCC-cEEEEcCC
Confidence 333455555555555555444 3444444443321 111111111123445666666554333332322 57777666
Q ss_pred C
Q psy6570 104 G 104 (713)
Q Consensus 104 G 104 (713)
+
T Consensus 255 ~ 255 (319)
T KOG4714|consen 255 T 255 (319)
T ss_pred C
Confidence 5
No 476
>KOG3509|consensus
Probab=21.21 E-value=1.7e+02 Score=35.02 Aligned_cols=72 Identities=25% Similarity=0.519 Sum_probs=48.7
Q ss_pred CCCCCCCCCCCCcEEeecCCCceeeCCCCCcCCCCCcCCCCCCC-CCCCCCCCeEecCCCCcceeecCCCcccCCC
Q psy6570 552 ANKCTPNYCSNNGTCVLIEGKPSCKCLPPYSGKQCTEREDSPSC-HNYCDNAGLCSYSKQGKPVCTCVNGWSGITC 626 (713)
Q Consensus 552 ~~~C~~~~C~~~~~C~~~~g~~~C~C~~G~~G~~C~~~~~~~~C-~~~C~~~g~C~~~~~g~~~C~C~~G~~G~~C 626 (713)
.+.|...+|...+.|....-..+|.|++||+|..|.+....... .+.+ ..++|.... +...+.|.+| .|..+
T Consensus 406 g~~c~~~p~~~~g~c~p~~~~~~c~c~~g~~G~~c~d~~~~~~~~~~g~-y~~t~~~~~-~~~~~~c~pg-~g~~~ 478 (964)
T KOG3509|consen 406 GDVCWRIPCQHDGPCLQTLEGKQCLCPPGYTGDSCEDCMNGCDRSPNGS-YLGTCVPIQ-GKRCEYCGPG-AGAPT 478 (964)
T ss_pred CCccccccCCCCccccccccccceeccccccCchhhccCccccccCCcc-ccceEeccC-CCcceeecCC-CCCcc
Confidence 35666777888888888877889999999999999875322222 2222 235665443 4566889999 66555
No 477
>KOG1538|consensus
Probab=21.19 E-value=1.2e+03 Score=26.71 Aligned_cols=114 Identities=12% Similarity=0.053 Sum_probs=62.2
Q ss_pred CCceeEEEccCcccEEecC----CCCCceEEEeccC-----CeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcce
Q psy6570 6 SGNVTRVKREMNLKTVLSN----LHDPRGVAVDWVG-----KNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYD 76 (713)
Q Consensus 6 ~~~I~~~~~~~~~~~~~~~----~~~p~gla~D~~~-----~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~ 76 (713)
+|.|..-+..+..+..+.. .....+|++.+.+ +.|-++|+ ...+.-..++|+....--.-+ -.|.-
T Consensus 153 nGTIsiRNk~gEek~~I~Rpgg~Nspiwsi~~~p~sg~G~~di~aV~DW----~qTLSFy~LsG~~Igk~r~L~-FdP~C 227 (1081)
T KOG1538|consen 153 NGTISIRNKNGEEKVKIERPGGSNSPIWSICWNPSSGEGRNDILAVADW----GQTLSFYQLSGKQIGKDRALN-FDPCC 227 (1081)
T ss_pred CceEEeecCCCCcceEEeCCCCCCCCceEEEecCCCCCCccceEEEEec----cceeEEEEecceeecccccCC-CCchh
Confidence 4555555555554444332 2224677777643 35778888 467888888886543211111 35777
Q ss_pred EEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCC
Q psy6570 77 IALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPS 127 (713)
Q Consensus 77 iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~ 127 (713)
|..=+.+.++.+-... . .+..+..+|-..-++.. --.|...+++.|.+
T Consensus 228 isYf~NGEy~LiGGsd-k-~L~~fTR~GvrLGTvg~-~D~WIWtV~~~PNs 275 (1081)
T KOG1538|consen 228 ISYFTNGEYILLGGSD-K-QLSLFTRDGVRLGTVGE-QDSWIWTVQAKPNS 275 (1081)
T ss_pred heeccCCcEEEEccCC-C-ceEEEeecCeEEeeccc-cceeEEEEEEccCC
Confidence 7776544445443321 2 45555556644444433 33466777777543
No 478
>KOG0305|consensus
Probab=20.92 E-value=1.2e+03 Score=25.93 Aligned_cols=127 Identities=8% Similarity=-0.102 Sum_probs=60.9
Q ss_pred CCceeEEEccCcc--cEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCcceEEEcCCC
Q psy6570 6 SGNVTRVKREMNL--KTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEPYDIALEPLS 83 (713)
Q Consensus 6 ~~~I~~~~~~~~~--~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p~~iavD~~~ 83 (713)
++++...+..... ..+........+||+.|-...|.-+-.+ .....|...+.........+.. -.+...|+..+..
T Consensus 322 DN~~~Iwd~~~~~p~~~~~~H~aAVKA~awcP~q~~lLAsGGG-s~D~~i~fwn~~~g~~i~~vdt-gsQVcsL~Wsk~~ 399 (484)
T KOG0305|consen 322 DNVVFIWDGLSPEPKFTFTEHTAAVKALAWCPWQSGLLATGGG-SADRCIKFWNTNTGARIDSVDT-GSQVCSLIWSKKY 399 (484)
T ss_pred ccceEeccCCCccccEEEeccceeeeEeeeCCCccCceEEcCC-CcccEEEEEEcCCCcEeccccc-CCceeeEEEcCCC
Confidence 3444444443322 2334455567778888766666544432 1233444444443332222222 2567777777666
Q ss_pred CcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEc
Q psy6570 84 GRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWAD 134 (713)
Q Consensus 84 ~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d 134 (713)
+.|..|--.....|..-+.....+...+.....+.-.|++.|++..|..+.
T Consensus 400 kEi~sthG~s~n~i~lw~~ps~~~~~~l~gH~~RVl~la~SPdg~~i~t~a 450 (484)
T KOG0305|consen 400 KELLSTHGYSENQITLWKYPSMKLVAELLGHTSRVLYLALSPDGETIVTGA 450 (484)
T ss_pred CEEEEecCCCCCcEEEEeccccceeeeecCCcceeEEEEECCCCCEEEEec
Confidence 666555322222333333333333333333333445566666655555544
No 479
>COG3361 Uncharacterized conserved protein [Function unknown]
Probab=20.52 E-value=2.5e+02 Score=26.28 Aligned_cols=74 Identities=14% Similarity=0.073 Sum_probs=49.6
Q ss_pred CCcceEEEcCCCCcEEEEccCCCCeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCcee
Q psy6570 72 NEPYDIALEPLSGRMFWTELGIKPRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKDRF 151 (713)
Q Consensus 72 ~~p~~iavD~~~~~ly~td~~~~~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~~~ 151 (713)
..|.++.+|-.+|.-|++-. -+.|...... ..+..|.+-++-..+-++|++..+..-|+-+++|...+.
T Consensus 41 ~~P~g~~~Dt~dG~ayvs~v-------PF~m~n~r~~----~~lpip~~rtFpElNlRtYVt~~GrpGIyFfsLda~~~l 109 (240)
T COG3361 41 SYPPGTRPDTADGMAYVSLV-------PFRMSNTRLG----TALPIPYVRTFPELNLRTYVTNAGRPGIYFFSLDAARLL 109 (240)
T ss_pred cCCCCCCccccCCeeEEEEe-------eeeeeccccc----ccCCCcccccccccceEEEEeeCCCceeEEEeccchhhh
Confidence 45888888888888887542 1222221111 123456666777677899999988889999999988776
Q ss_pred EEEec
Q psy6570 152 VVYHT 156 (713)
Q Consensus 152 ~~~~~ 156 (713)
.+...
T Consensus 110 ~V~~a 114 (240)
T COG3361 110 VVPLA 114 (240)
T ss_pred eeeee
Confidence 65543
No 480
>PF09472 MtrF: Tetrahydromethanopterin S-methyltransferase, F subunit (MtrF); InterPro: IPR013347 Many archaea have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This domain is mostly found in MtrF, where it covers the entire length of the protein. This polypeptide is one of eight subunits of the N5-methyltetrahydromethanopterin: coenzyme M methyltransferase complex found in methanogenic archaea. This is a membrane-associated enzyme complex that uses methyl-transfer reactions to drive a sodium-ion pump []. MtrF itself is involved in the transfer of the methyl group from N5-methyltetrahydromethanopterin to coenzyme M. Subsequently, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme, methyl-coenzyme M reductase. In some organisms this domain is found at the C-terminal region of what appears to be a fusion of the MtrA and MtrF proteins [, ]. The function of these proteins is unknown, though it is likely that they are involved in C1 metabolism.; GO: 0030269 tetrahydromethanopterin S-methyltransferase activity, 0015948 methanogenesis, 0016020 membrane
Probab=20.50 E-value=74 Score=24.08 Aligned_cols=20 Identities=10% Similarity=0.021 Sum_probs=9.8
Q ss_pred cchhHHHHHHHHHHHHHHhh
Q psy6570 683 NSHISSILILILLLITVGGI 702 (713)
Q Consensus 683 ~~~~~~~~~~~~~~~~~~~~ 702 (713)
.+++++++++++++++.+++
T Consensus 43 ~GfaiG~~~AlvLv~ip~~l 62 (64)
T PF09472_consen 43 KGFAIGFLFALVLVGIPILL 62 (64)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 44555555555555444433
No 481
>PF00558 Vpu: Vpu protein; InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=20.09 E-value=52 Score=26.19 Aligned_cols=7 Identities=43% Similarity=0.619 Sum_probs=2.3
Q ss_pred HHHHHHH
Q psy6570 687 SSILILI 693 (713)
Q Consensus 687 ~~~~~~~ 693 (713)
+++++++
T Consensus 7 ~~iiali 13 (81)
T PF00558_consen 7 LAIIALI 13 (81)
T ss_dssp -HHHHHH
T ss_pred HHHHHHH
Confidence 3333333
No 482
>PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=20.07 E-value=1.5e+03 Score=26.91 Aligned_cols=170 Identities=9% Similarity=-0.020 Sum_probs=0.0
Q ss_pred CCcccCCceeEEEccCcccEEecCCCCCceEEEeccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCC-cceEEE
Q psy6570 1 MASISSGNVTRVKREMNLKTVLSNLHDPRGVAVDWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNE-PYDIAL 79 (713)
Q Consensus 1 vad~~~~~I~~~~~~~~~~~~~~~~~~p~gla~D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~-p~~iav 79 (713)
|.....+.+.-...-....+.....-.|..+.+......|.+.+.. ..+.|+++|+.-..+..-+...-.. ...++-
T Consensus 456 Fk~~~~~~l~f~t~i~~i~~~~g~~~~P~k~mL~~~d~~mil~~~~--~~~~ly~mDLe~GKVV~eW~~~~~~~v~~~~p 533 (794)
T PF08553_consen 456 FKNTDDDGLEFSTAISNISTPKGKNFTPKKAMLHDQDRNMILLDPN--NPNKLYKMDLERGKVVEEWKVHDDIPVVDIAP 533 (794)
T ss_pred EECCCCCceeeeEEecccccCCCcccCcchhhhhccccceEeecCC--CCCceEEEecCCCcEEEEeecCCCcceeEecc
Q ss_pred cCC-----CCcEEEEccCCCCeEEEEecCCCCcEEEEe-----CCCCCCeeEEEeCCCCeEEEEcCCCCcEEEEeCCCCc
Q psy6570 80 EPL-----SGRMFWTELGIKPRISGASIDGKNKFNLVD-----NNIQWPTGITIDYPSQRLYWADPKARTIESINLNGKD 149 (713)
Q Consensus 80 D~~-----~~~ly~td~~~~~~I~~~~~dG~~~~~l~~-----~~~~~p~glavd~~~~~LY~~d~~~~~I~~~~~~g~~ 149 (713)
+.. ....|+.-..+. |.|+++.=...+++.. ..-..-..+|-+ ..|+|-++. ..+.|+.++..|..
T Consensus 534 ~~K~aqlt~e~tflGls~n~--lfriDpR~~~~k~v~~~~k~Y~~~~~Fs~~aTt-~~G~iavgs-~~G~IRLyd~~g~~ 609 (794)
T PF08553_consen 534 DSKFAQLTNEQTFLGLSDNS--LFRIDPRLSGNKLVDSQSKQYSSKNNFSCFATT-EDGYIAVGS-NKGDIRLYDRLGKR 609 (794)
T ss_pred cccccccCCCceEEEECCCc--eEEeccCCCCCceeeccccccccCCCceEEEec-CCceEEEEe-CCCcEEeecccchh
Q ss_pred eeEEEecCCCCccceeeeeeCCeEEEE
Q psy6570 150 RFVVYHTEDNGYKPYKLEVFEDNLYFS 176 (713)
Q Consensus 150 ~~~~~~~~~~~~~p~~i~~~~~~ly~t 176 (713)
-++++.........+.++.++.+|..|
T Consensus 610 AKT~lp~lG~pI~~iDvt~DGkwilaT 636 (794)
T PF08553_consen 610 AKTALPGLGDPIIGIDVTADGKWILAT 636 (794)
T ss_pred hhhcCCCCCCCeeEEEecCCCcEEEEe
No 483
>PF05262 Borrelia_P83: Borrelia P83/100 protein; InterPro: IPR007926 This family consists of several Borrelia P83/P100 antigen proteins.
Probab=20.06 E-value=8.6e+02 Score=26.94 Aligned_cols=64 Identities=16% Similarity=0.077 Sum_probs=40.0
Q ss_pred eEEEE-cCCCCcEEEEeCCCCceeEEEecCCCCccceeeeeeCCeEEEEeCCCCcEEEEcccCCC
Q psy6570 129 RLYWA-DPKARTIESINLNGKDRFVVYHTEDNGYKPYKLEVFEDNLYFSTYRTNNILKINKFGNS 192 (713)
Q Consensus 129 ~LY~~-d~~~~~I~~~~~~g~~~~~~~~~~~~~~~p~~i~~~~~~ly~td~~~~~i~~~~~~~~~ 192 (713)
.|-++ +.+++.|..+.+|-.+..++......+..-.-|.++++.+|..-...+..|.+-+|+..
T Consensus 408 ~vaI~g~~G~~~ikLvlid~~tLev~kes~~~i~~~S~l~~~~~~iyaVv~~~~g~~~L~rF~~~ 472 (489)
T PF05262_consen 408 LVAIAGCSGNAAIKLVLIDPETLEVKKESEDEISWQSSLIVDGQMIYAVVKKDNGKWYLGRFDSN 472 (489)
T ss_pred EEEEeccCCchheEEEecCcccceeeeeccccccccCceEEcCCeEEEEEEcCCCeEEEeecCcc
Confidence 33333 35667788888888877777766554555555667788888554234455666666544
No 484
>PF09826 Beta_propel: Beta propeller domain; InterPro: IPR019198 This entry consists of predicted secreted proteins containing a C-terminal beta-propeller domain distantly related to WD-40 repeats.
Probab=20.04 E-value=1.3e+03 Score=26.00 Aligned_cols=168 Identities=13% Similarity=0.079 Sum_probs=0.0
Q ss_pred eccCCeEEEeecCCCCCCeEEEEecCCceEEEEEcCCCCCc------------------ceEEEcCCCCcEEEEccCCC-
Q psy6570 34 DWVGKNLYWTDAGGRSSNNIMVSTLEGRKKRTLLNTGLNEP------------------YDIALEPLSGRMFWTELGIK- 94 (713)
Q Consensus 34 D~~~~~ly~td~~~~~~~~I~~~~~~G~~~~~l~~~~~~~p------------------~~iavD~~~~~ly~td~~~~- 94 (713)
|+.+-++-+.-. -.+...-+.+-|..+.++.......+ .....-.....+|+-.....
T Consensus 107 D~~~P~~~~~~~---~~G~yvsSR~ig~~vy~Vt~~~~~~~~~~~~~~~~~~P~~~~~~~~~~~~~~~~~~y~p~~~~~~ 183 (521)
T PF09826_consen 107 DPSNPKLLREIE---IEGSYVSSRKIGDYVYLVTNSYPNYYAIEEADLEDILPSYRDNGEEATVIVPCDIIYFPGGPSGS 183 (521)
T ss_pred CCCCceEEEEEE---eeeEEEeEEEECCEEEEEEecCCccchhhcccccccCceEEecCcceeeecccceEEecCCCCCC
Q ss_pred --CeEEEEecCCCCcEEEEeCCCCCCeeEEEeCCCCeEEEEcCCC----------------------CcEEEEeCCCCce
Q psy6570 95 --PRISGASIDGKNKFNLVDNNIQWPTGITIDYPSQRLYWADPKA----------------------RTIESINLNGKDR 150 (713)
Q Consensus 95 --~~I~~~~~dG~~~~~l~~~~~~~p~glavd~~~~~LY~~d~~~----------------------~~I~~~~~~g~~~ 150 (713)
-.|..+++ ......-...-+.....|-+. .+.||++.... -.|+++++++...
T Consensus 184 ~~~~i~s~dl-~~~~~~~~~~~~g~~~~vY~S--~~~LYia~~~~~~~~~~~~~~~~~~~~~~~~~~T~I~kf~~~~~~~ 260 (521)
T PF09826_consen 184 NYTTITSIDL-DPDKASDSTSVLGSGGNVYMS--ENNLYIASNRYYYEPYAMMRFEASAEPEESNESTTIYKFALDGGKI 260 (521)
T ss_pred cEEEEEEEeC-CCCCccceeEEEecCCEEEEe--CCcEEEEEecccccccccchhccccccccCCCceEEEEEEccCCcE
Q ss_pred eEEEecCC--CCccceeeeeeCCeEEEEeCCCCcEEEEcccCCCcceeeeccccccccE
Q psy6570 151 FVVYHTED--NGYKPYKLEVFEDNLYFSTYRTNNILKINKFGNSDFNVLANNLNRASDV 207 (713)
Q Consensus 151 ~~~~~~~~--~~~~p~~i~~~~~~ly~td~~~~~i~~~~~~~~~~~~~~~~~~~~~~~i 207 (713)
+-+.+... .+.+-++|+-++++|-++.......+.-.......+.++-..+.....|
T Consensus 261 ~y~~sg~V~G~llnqFsmdE~~G~LRvaTT~~~~~~~~~~~s~N~lyVLD~~L~~vG~l 319 (521)
T PF09826_consen 261 EYVGSGSVPGYLLNQFSMDEYDGYLRVATTSGNWWWDSEDTSSNNLYVLDEDLKIVGSL 319 (521)
T ss_pred EEEEEEEECcEEcccccEeccCCEEEEEEecCcccccCCCCceEEEEEECCCCcEeEEc
Done!