Query psy6572
Match_columns 1416
No_of_seqs 734 out of 4278
Neff 7.3
Searched_HMMs 46136
Date Fri Aug 16 21:34:47 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6572.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6572hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1215|consensus 100.0 6.1E-57 1.3E-61 595.7 47.9 586 126-845 110-710 (877)
2 KOG1214|consensus 100.0 2.6E-54 5.6E-59 505.7 33.9 353 467-836 800-1287(1289)
3 KOG1215|consensus 100.0 1.3E-34 2.7E-39 383.6 38.5 562 28-725 28-638 (877)
4 KOG1214|consensus 100.0 1E-30 2.2E-35 308.1 25.4 197 592-797 1000-1208(1289)
5 PF08450 SGL: SMP-30/Gluconola 99.6 2.4E-14 5.1E-19 162.7 24.4 213 569-794 2-232 (246)
6 PLN02919 haloacid dehalogenase 99.6 6E-14 1.3E-18 187.3 31.9 234 540-777 580-892 (1057)
7 PLN02919 haloacid dehalogenase 99.6 4E-14 8.8E-19 188.9 28.2 215 561-778 562-838 (1057)
8 PF08450 SGL: SMP-30/Gluconola 99.5 3.7E-12 8E-17 144.8 24.3 207 539-762 11-245 (246)
9 KOG4659|consensus 99.3 2.5E-11 5.3E-16 151.3 18.3 225 538-771 374-689 (1899)
10 KOG4659|consensus 99.3 8.4E-11 1.8E-15 146.7 19.5 199 566-777 364-626 (1899)
11 PF10282 Lactonase: Lactonase, 99.1 3.8E-08 8.3E-13 117.7 30.1 206 565-777 85-328 (345)
12 COG3386 Gluconolactonase [Carb 99.1 1.2E-08 2.7E-13 118.5 23.3 207 563-781 21-250 (307)
13 PF10282 Lactonase: Lactonase, 99.1 7.2E-08 1.6E-12 115.4 30.2 246 549-795 14-300 (345)
14 PRK11028 6-phosphogluconolacto 99.1 8.4E-08 1.8E-12 114.0 30.7 220 567-794 80-327 (330)
15 PRK11028 6-phosphogluconolacto 99.0 7.9E-08 1.7E-12 114.2 29.3 229 541-772 3-257 (330)
16 COG3386 Gluconolactonase [Carb 98.9 2E-07 4.3E-12 108.6 25.3 210 539-765 36-278 (307)
17 COG3391 Uncharacterized conser 98.8 4.4E-07 9.5E-12 109.9 25.8 221 566-794 30-260 (381)
18 PF00058 Ldl_recept_b: Low-den 98.8 5E-09 1.1E-13 84.7 5.8 42 664-705 1-42 (42)
19 TIGR02604 Piru_Ver_Nterm putat 98.8 6.9E-07 1.5E-11 107.8 24.4 195 560-768 65-345 (367)
20 PF00057 Ldl_recept_a: Low-den 98.8 4.2E-09 9.1E-14 82.3 3.4 36 30-65 2-37 (37)
21 PF00057 Ldl_recept_a: Low-den 98.8 4.4E-09 9.5E-14 82.2 2.8 37 66-102 1-37 (37)
22 COG3391 Uncharacterized conser 98.7 8.8E-07 1.9E-11 107.3 23.9 203 567-779 74-289 (381)
23 cd00112 LDLa Low Density Lipop 98.7 4.9E-09 1.1E-13 81.1 2.6 35 31-65 1-35 (35)
24 COG2706 3-carboxymuconate cycl 98.7 1.2E-05 2.7E-10 91.8 28.0 226 549-776 17-279 (346)
25 PF00058 Ldl_recept_b: Low-den 98.7 3.7E-08 8.1E-13 79.7 5.7 41 621-661 1-42 (42)
26 cd00112 LDLa Low Density Lipop 98.6 1.3E-08 2.8E-13 78.7 2.1 35 160-199 1-35 (35)
27 PF06977 SdiA-regulated: SdiA- 98.6 3.3E-06 7.2E-11 95.3 21.6 193 566-762 21-240 (248)
28 TIGR03866 PQQ_ABC_repeats PQQ- 98.6 5.9E-05 1.3E-09 87.1 31.4 234 548-795 53-299 (300)
29 TIGR02604 Piru_Ver_Nterm putat 98.6 1.5E-06 3.2E-11 105.0 18.5 150 608-771 13-211 (367)
30 PF14670 FXa_inhibition: Coagu 98.6 2.6E-08 5.6E-13 77.0 2.0 35 808-843 1-36 (36)
31 COG2706 3-carboxymuconate cycl 98.6 4E-05 8.7E-10 87.7 28.2 220 568-794 90-344 (346)
32 PF07995 GSDH: Glucose / Sorbo 98.5 5E-06 1.1E-10 98.8 19.7 224 566-795 1-314 (331)
33 KOG1520|consensus 98.5 7.8E-06 1.7E-10 94.9 20.1 140 567-714 115-281 (376)
34 smart00192 LDLa Low-density li 98.4 1.4E-07 3.1E-12 72.1 3.1 32 1012-1043 2-33 (33)
35 smart00192 LDLa Low-density li 98.4 1.6E-07 3.4E-12 71.8 2.9 32 293-324 2-33 (33)
36 TIGR03866 PQQ_ABC_repeats PQQ- 98.4 0.00038 8.2E-09 80.4 30.8 226 548-778 11-242 (300)
37 PF06977 SdiA-regulated: SdiA- 98.3 4.5E-05 9.7E-10 86.2 20.3 162 549-715 45-240 (248)
38 TIGR02658 TTQ_MADH_Hv methylam 98.3 0.00033 7.2E-09 83.0 27.6 218 549-781 28-338 (352)
39 PF03022 MRJP: Major royal jel 98.2 7.4E-05 1.6E-09 86.9 20.9 151 610-766 62-258 (287)
40 PF14670 FXa_inhibition: Coagu 98.2 1.1E-06 2.4E-11 68.1 2.8 35 456-493 2-36 (36)
41 TIGR03606 non_repeat_PQQ dehyd 98.2 0.00038 8.2E-09 85.0 26.0 153 560-717 23-250 (454)
42 KOG1520|consensus 98.1 2.4E-05 5.2E-10 90.9 13.9 147 608-762 114-282 (376)
43 PF12999 PRKCSH-like: Glucosid 98.1 2E-06 4.2E-11 90.3 4.5 67 975-1044 35-111 (176)
44 KOG4499|consensus 98.1 0.0002 4.2E-09 77.4 19.1 198 570-782 18-250 (310)
45 PF07995 GSDH: Glucose / Sorbo 98.1 5.1E-05 1.1E-09 90.3 15.8 150 608-767 1-205 (331)
46 PF12999 PRKCSH-like: Glucosid 98.1 3.2E-06 6.9E-11 88.7 4.7 74 210-288 36-116 (176)
47 smart00135 LY Low-density lipo 98.1 6.5E-06 1.4E-10 66.6 5.3 42 645-687 2-43 (43)
48 smart00135 LY Low-density lipo 98.1 7.5E-06 1.6E-10 66.2 5.6 42 689-730 2-43 (43)
49 COG3204 Uncharacterized protei 98.0 0.00038 8.2E-09 78.2 20.1 192 566-761 85-301 (316)
50 PRK04792 tolB translocation pr 98.0 0.00079 1.7E-08 83.6 25.3 224 549-781 199-432 (448)
51 PRK05137 tolB translocation pr 98.0 0.0015 3.2E-08 81.0 27.6 215 548-772 182-411 (435)
52 PRK04922 tolB translocation pr 97.9 0.0013 2.9E-08 81.3 24.9 221 548-777 184-414 (433)
53 PRK03629 tolB translocation pr 97.9 0.0035 7.6E-08 77.5 28.0 185 548-734 179-371 (429)
54 TIGR02800 propeller_TolB tol-p 97.9 0.0025 5.4E-08 78.3 26.7 220 549-776 171-399 (417)
55 COG4257 Vgb Streptogramin lyas 97.9 0.001 2.2E-08 73.6 19.8 222 561-796 56-285 (353)
56 TIGR03606 non_repeat_PQQ dehyd 97.9 0.00069 1.5E-08 82.7 20.0 155 603-765 23-251 (454)
57 TIGR03118 PEPCTERM_chp_1 conse 97.8 0.0018 4E-08 73.2 21.3 209 561-782 17-288 (336)
58 PRK02889 tolB translocation pr 97.8 0.0052 1.1E-07 76.0 26.7 186 548-735 176-369 (427)
59 PF07645 EGF_CA: Calcium-bindi 97.8 2.3E-05 4.9E-10 63.6 3.8 38 495-532 1-42 (42)
60 TIGR02658 TTQ_MADH_Hv methylam 97.8 0.0033 7.1E-08 74.7 23.4 212 577-795 11-307 (352)
61 PRK00178 tolB translocation pr 97.8 0.0048 1E-07 76.3 26.2 221 548-777 179-409 (430)
62 PRK04043 tolB translocation pr 97.8 0.0069 1.5E-07 74.5 27.0 223 547-781 168-407 (419)
63 PF03022 MRJP: Major royal jel 97.7 0.00076 1.6E-08 78.5 16.2 172 539-718 33-257 (287)
64 cd00200 WD40 WD40 domain, foun 97.7 0.044 9.5E-07 61.2 30.1 225 540-777 22-253 (289)
65 COG4257 Vgb Streptogramin lyas 97.7 0.003 6.6E-08 70.0 19.2 213 548-775 83-307 (353)
66 PF12662 cEGF: Complement Clr- 97.6 2.9E-05 6.2E-10 54.3 2.0 24 474-498 1-24 (24)
67 PF03088 Str_synth: Strictosid 97.6 0.00015 3.3E-09 68.5 7.5 72 612-685 1-89 (89)
68 PF02239 Cytochrom_D1: Cytochr 97.6 0.005 1.1E-07 74.4 21.1 191 541-735 7-211 (369)
69 COG3204 Uncharacterized protei 97.5 0.0038 8.3E-08 70.4 18.0 160 549-714 109-301 (316)
70 PRK01742 tolB translocation pr 97.5 0.019 4.1E-07 71.1 26.4 179 548-733 184-368 (429)
71 COG2133 Glucose/sorbosone dehy 97.5 0.006 1.3E-07 73.1 20.5 156 609-772 177-396 (399)
72 PRK01029 tolB translocation pr 97.5 0.022 4.8E-07 70.4 26.5 221 548-776 165-406 (428)
73 PF03088 Str_synth: Strictosid 97.5 0.00029 6.4E-09 66.6 7.0 72 655-727 1-88 (89)
74 PF07645 EGF_CA: Calcium-bindi 97.5 0.00012 2.6E-09 59.3 3.8 39 451-492 1-41 (42)
75 PF02239 Cytochrom_D1: Cytochr 97.4 0.033 7.2E-07 67.4 24.7 142 539-683 48-202 (369)
76 TIGR03032 conserved hypothetic 97.3 0.055 1.2E-06 62.2 23.9 227 556-800 38-318 (335)
77 KOG4499|consensus 97.3 0.0055 1.2E-07 66.6 15.0 87 550-639 141-241 (310)
78 PRK04792 tolB translocation pr 97.3 0.024 5.2E-07 70.5 23.0 185 548-735 242-434 (448)
79 PRK04922 tolB translocation pr 97.3 0.027 5.8E-07 69.8 23.1 185 548-735 228-420 (433)
80 PRK04043 tolB translocation pr 97.2 0.031 6.7E-07 68.8 22.9 183 548-735 213-409 (419)
81 PRK05137 tolB translocation pr 97.2 0.036 7.8E-07 68.8 23.7 184 548-734 226-420 (435)
82 TIGR02800 propeller_TolB tol-p 97.1 0.069 1.5E-06 65.6 23.8 183 548-733 214-404 (417)
83 TIGR03118 PEPCTERM_chp_1 conse 97.0 0.033 7.2E-07 63.4 18.1 183 608-795 22-252 (336)
84 KOG1219|consensus 97.0 0.00093 2E-08 89.3 6.7 97 422-535 3871-3979(4289)
85 cd00200 WD40 WD40 domain, foun 97.0 0.18 4E-06 56.2 24.8 174 547-727 72-250 (289)
86 PRK02889 tolB translocation pr 97.0 0.1 2.3E-06 64.5 24.2 184 548-734 220-411 (427)
87 PRK00178 tolB translocation pr 96.9 0.1 2.3E-06 64.5 23.7 184 548-734 223-414 (430)
88 PRK03629 tolB translocation pr 96.8 0.18 3.9E-06 62.5 24.0 185 548-735 223-415 (429)
89 PF12662 cEGF: Complement Clr- 96.7 0.00087 1.9E-08 47.0 1.5 23 514-536 1-23 (24)
90 PRK01029 tolB translocation pr 96.6 0.28 6E-06 60.8 24.3 186 548-735 211-412 (428)
91 PF02333 Phytase: Phytase; In 96.4 0.25 5.4E-06 59.2 20.6 118 608-727 155-291 (381)
92 COG2133 Glucose/sorbosone dehy 96.2 0.14 3.1E-06 61.6 17.4 183 541-727 142-398 (399)
93 COG5276 Uncharacterized conser 96.2 0.84 1.8E-05 51.9 21.8 217 539-770 96-326 (370)
94 COG5276 Uncharacterized conser 96.1 1.2 2.7E-05 50.6 22.4 179 572-766 90-279 (370)
95 KOG3509|consensus 96.1 0.0046 1E-07 80.3 4.1 105 125-241 2-109 (964)
96 COG4247 Phy 3-phytase (myo-ino 96.0 1.1 2.3E-05 49.9 21.1 169 608-798 152-335 (364)
97 PRK01742 tolB translocation pr 96.0 0.66 1.4E-05 57.5 22.8 178 548-734 228-412 (429)
98 KOG3509|consensus 96.0 0.0052 1.1E-07 79.8 3.9 104 219-326 2-109 (964)
99 PF06433 Me-amine-dh_H: Methyl 95.7 5.6 0.00012 47.0 26.6 198 571-782 99-329 (342)
100 PF05096 Glu_cyclase_2: Glutam 95.6 1.3 2.9E-05 50.5 20.7 174 541-725 57-260 (264)
101 KOG4260|consensus 95.6 0.0075 1.6E-07 66.3 2.7 73 446-527 229-307 (350)
102 COG4946 Uncharacterized protei 95.4 5.1 0.00011 48.2 24.9 126 627-762 378-508 (668)
103 cd01475 vWA_Matrilin VWA_Matri 95.4 0.0097 2.1E-07 66.9 3.0 43 489-531 180-224 (224)
104 smart00179 EGF_CA Calcium-bind 95.4 0.013 2.8E-07 46.1 2.9 35 495-532 1-38 (39)
105 COG0823 TolB Periplasmic compo 95.2 1.5 3.3E-05 54.1 21.2 117 576-693 202-324 (425)
106 KOG1446|consensus 95.2 8 0.00017 44.6 26.7 181 592-782 81-271 (311)
107 PF01436 NHL: NHL repeat; Int 95.2 0.031 6.7E-07 41.1 4.0 27 695-722 1-27 (28)
108 KOG2397|consensus 95.1 0.018 4E-07 68.8 3.9 66 976-1044 43-114 (480)
109 PF01436 NHL: NHL repeat; Int 95.0 0.037 8E-07 40.7 4.1 28 608-636 1-28 (28)
110 PRK02888 nitrous-oxide reducta 94.9 0.47 1E-05 59.8 15.7 182 576-774 139-352 (635)
111 PF02333 Phytase: Phytase; In 94.9 5.3 0.00011 48.2 23.7 121 649-772 153-289 (381)
112 PF05096 Glu_cyclase_2: Glutam 94.8 3.1 6.7E-05 47.6 20.3 161 568-781 46-211 (264)
113 PF13449 Phytase-like: Esteras 94.7 2.5 5.4E-05 50.4 20.8 116 610-727 86-252 (326)
114 KOG0315|consensus 94.7 4.7 0.0001 45.0 20.5 178 539-725 9-196 (311)
115 KOG4260|consensus 94.5 0.036 7.8E-07 61.1 4.2 48 478-528 221-272 (350)
116 KOG2397|consensus 94.5 0.022 4.8E-07 68.2 2.8 67 1175-1245 44-115 (480)
117 PF13360 PQQ_2: PQQ-like domai 94.5 7.9 0.00017 43.2 23.5 61 707-777 173-234 (238)
118 KOG0273|consensus 94.1 7.6 0.00016 46.9 22.0 136 547-685 256-393 (524)
119 KOG1219|consensus 93.9 0.062 1.4E-06 73.1 5.2 63 464-535 3872-3940(4289)
120 COG4946 Uncharacterized protei 93.9 5.7 0.00012 47.8 20.4 122 592-715 383-508 (668)
121 PRK02888 nitrous-oxide reducta 93.8 6.2 0.00014 50.2 22.1 209 548-771 152-402 (635)
122 KOG0285|consensus 93.3 7.8 0.00017 45.3 19.7 225 531-780 155-396 (460)
123 PF06433 Me-amine-dh_H: Methyl 92.9 11 0.00023 44.8 20.8 55 615-670 190-256 (342)
124 KOG4289|consensus 92.4 1.4 3E-05 58.7 13.5 83 440-534 1225-1317(2531)
125 PF13449 Phytase-like: Esteras 92.4 4.2 9E-05 48.5 17.3 61 653-714 86-165 (326)
126 PF12947 EGF_3: EGF domain; I 92.4 0.091 2E-06 41.1 2.1 24 467-493 13-36 (36)
127 TIGR03032 conserved hypothetic 92.1 5.1 0.00011 46.6 16.3 50 744-795 202-251 (335)
128 KOG0291|consensus 91.8 39 0.00085 43.4 24.3 155 522-680 341-507 (893)
129 PF14583 Pectate_lyase22: Olig 91.8 17 0.00037 43.9 20.8 141 547-692 59-233 (386)
130 PF02897 Peptidase_S9_N: Proly 91.4 43 0.00093 41.2 26.5 179 548-731 150-363 (414)
131 KOG0315|consensus 91.1 30 0.00066 38.9 24.1 128 539-670 52-186 (311)
132 smart00179 EGF_CA Calcium-bind 91.0 0.19 4.2E-06 39.3 2.8 23 1294-1321 9-33 (39)
133 PF00930 DPPIV_N: Dipeptidyl p 91.0 15 0.00032 44.3 20.2 83 654-736 237-326 (353)
134 KOG0285|consensus 91.0 5.8 0.00013 46.3 15.2 123 608-736 151-275 (460)
135 smart00181 EGF Epidermal growt 91.0 0.18 4E-06 38.7 2.5 25 502-526 6-31 (35)
136 KOG4289|consensus 90.8 0.19 4.2E-06 66.0 3.7 50 470-524 1217-1269(2531)
137 cd01475 vWA_Matrilin VWA_Matri 90.3 0.21 4.6E-06 56.1 3.2 38 451-491 186-223 (224)
138 KOG0291|consensus 90.1 69 0.0015 41.3 24.6 146 608-773 350-508 (893)
139 KOG1446|consensus 90.1 43 0.00093 38.9 23.4 171 549-725 81-261 (311)
140 PF12947 EGF_3: EGF domain; I 90.0 0.21 4.5E-06 39.1 2.0 29 502-532 6-36 (36)
141 PTZ00421 coronin; Provisional 89.8 67 0.0014 40.7 26.7 115 565-683 74-198 (493)
142 cd00054 EGF_CA Calcium-binding 89.8 0.29 6.3E-06 37.8 2.8 29 496-524 2-33 (38)
143 KOG2106|consensus 89.7 54 0.0012 40.3 22.1 156 608-780 329-496 (626)
144 PLN00181 protein SPA1-RELATED; 89.4 57 0.0012 44.0 25.8 156 567-727 484-649 (793)
145 KOG0268|consensus 89.3 5.6 0.00012 46.5 13.4 216 539-771 120-345 (433)
146 PF05787 DUF839: Bacterial pro 89.2 4.7 0.0001 51.1 14.2 23 649-671 347-369 (524)
147 COG3823 Glutamine cyclotransfe 89.2 18 0.00039 39.7 16.2 66 661-727 183-260 (262)
148 PF00930 DPPIV_N: Dipeptidyl p 88.6 12 0.00027 45.0 16.9 97 608-705 234-337 (353)
149 PRK13616 lipoprotein LpqB; Pro 88.5 47 0.001 43.0 22.7 185 567-764 350-560 (591)
150 smart00181 EGF Epidermal growt 88.2 0.39 8.6E-06 36.8 2.4 25 1294-1323 6-31 (35)
151 PF01731 Arylesterase: Arylest 88.0 1.1 2.5E-05 42.3 5.8 36 690-725 48-83 (86)
152 PF14583 Pectate_lyase22: Olig 87.9 20 0.00043 43.3 17.3 151 580-735 50-233 (386)
153 COG4247 Phy 3-phytase (myo-ino 86.5 44 0.00095 37.7 17.5 106 627-733 120-244 (364)
154 PF05787 DUF839: Bacterial pro 86.2 4.2 9E-05 51.6 11.2 62 608-670 349-453 (524)
155 smart00284 OLF Olfactomedin-li 86.1 19 0.00041 41.2 15.2 135 578-715 83-243 (255)
156 TIGR03300 assembly_YfgL outer 86.0 89 0.0019 37.8 23.2 103 663-779 241-344 (377)
157 COG0823 TolB Periplasmic compo 85.9 33 0.00072 42.5 18.6 121 549-670 219-344 (425)
158 PF02897 Peptidase_S9_N: Proly 85.3 60 0.0013 39.9 20.8 133 549-683 203-357 (414)
159 KOG0268|consensus 85.2 12 0.00026 43.9 13.0 252 528-797 67-326 (433)
160 KOG4649|consensus 85.2 31 0.00068 39.0 15.6 61 611-684 33-93 (354)
161 KOG1407|consensus 85.0 76 0.0016 36.1 20.9 119 612-735 151-271 (313)
162 PF00008 EGF: EGF-like domain 84.7 0.79 1.7E-05 34.9 2.4 23 502-524 4-29 (32)
163 PTZ00421 coronin; Provisional 84.6 1.3E+02 0.0027 38.3 26.7 160 566-728 125-292 (493)
164 KOG0318|consensus 83.9 1.2E+02 0.0027 37.6 23.9 103 609-719 406-511 (603)
165 KOG2048|consensus 83.7 54 0.0012 41.6 18.5 192 530-727 385-602 (691)
166 PF06247 Plasmod_Pvs28: Plasmo 83.7 1.2 2.6E-05 47.4 4.1 62 470-535 15-88 (197)
167 KOG4328|consensus 83.0 84 0.0018 38.3 18.9 110 567-680 235-352 (498)
168 cd00053 EGF Epidermal growth f 82.5 1.1 2.3E-05 33.9 2.4 25 502-526 6-32 (36)
169 KOG0266|consensus 82.1 1.5E+02 0.0032 37.2 24.0 153 566-726 203-364 (456)
170 cd00053 EGF Epidermal growth f 81.6 1.2 2.6E-05 33.7 2.4 26 1293-1323 5-32 (36)
171 KOG4328|consensus 81.6 34 0.00073 41.5 15.0 147 565-715 185-342 (498)
172 PF01731 Arylesterase: Arylest 81.3 3 6.5E-05 39.5 5.3 36 604-639 48-84 (86)
173 KOG0650|consensus 81.1 25 0.00054 43.8 14.1 71 652-726 567-637 (733)
174 TIGR03075 PQQ_enz_alc_DH PQQ-d 81.0 48 0.001 42.4 17.8 100 656-769 238-339 (527)
175 PF13360 PQQ_2: PQQ-like domai 80.9 35 0.00075 38.0 15.0 73 706-781 75-148 (238)
176 TIGR02276 beta_rpt_yvtn 40-res 80.8 5 0.00011 31.8 5.9 41 619-660 2-42 (42)
177 cd00216 PQQ_DH Dehydrogenases 80.6 1.2E+02 0.0025 38.5 21.0 30 749-779 401-430 (488)
178 COG3823 Glutamine cyclotransfe 80.0 16 0.00035 40.0 10.8 69 615-684 180-260 (262)
179 KOG0318|consensus 79.6 1.7E+02 0.0037 36.4 24.8 125 544-673 380-509 (603)
180 TIGR03075 PQQ_enz_alc_DH PQQ-d 79.6 53 0.0011 42.0 17.5 51 571-628 238-288 (527)
181 cd00054 EGF_CA Calcium-binding 79.5 1.6 3.5E-05 33.5 2.5 24 1294-1322 9-34 (38)
182 PRK11138 outer membrane biogen 79.2 70 0.0015 39.1 18.0 59 707-776 256-315 (394)
183 PTZ00420 coronin; Provisional 78.2 2.2E+02 0.0047 36.8 24.8 159 566-728 125-295 (568)
184 PF09064 Tme5_EGF_like: Thromb 78.2 1.8 3.9E-05 33.1 2.2 27 812-840 6-32 (34)
185 KOG0292|consensus 78.0 1.7E+02 0.0037 38.7 20.3 169 570-760 210-383 (1202)
186 KOG4378|consensus 77.6 1.1E+02 0.0023 37.8 17.4 182 578-773 89-280 (673)
187 PHA02713 hypothetical protein; 77.5 1.2E+02 0.0027 39.0 20.1 179 577-765 302-523 (557)
188 KOG1274|consensus 77.3 1.9E+02 0.0042 38.3 20.7 147 569-726 16-168 (933)
189 KOG0310|consensus 76.1 2E+02 0.0044 35.4 19.5 171 569-754 113-289 (487)
190 PF00008 EGF: EGF-like domain 76.0 2.7 5.9E-05 32.0 2.7 21 464-484 6-29 (32)
191 TIGR03300 assembly_YfgL outer 75.9 1.9E+02 0.0041 34.9 21.2 59 707-776 241-300 (377)
192 KOG4441|consensus 75.6 60 0.0013 42.0 16.3 178 577-762 331-530 (571)
193 KOG0272|consensus 75.4 69 0.0015 38.6 15.0 117 563-684 258-378 (459)
194 KOG0293|consensus 75.4 2E+02 0.0043 34.9 18.6 99 624-726 327-425 (519)
195 KOG4441|consensus 75.0 1.4E+02 0.0029 38.8 19.3 138 619-764 331-485 (571)
196 PRK13616 lipoprotein LpqB; Pro 74.9 1.6E+02 0.0034 38.3 19.8 160 549-716 380-559 (591)
197 KOG0294|consensus 74.6 1.8E+02 0.0039 34.1 18.9 135 567-712 44-185 (362)
198 PF02191 OLF: Olfactomedin-lik 73.5 63 0.0014 37.1 14.2 140 578-719 78-242 (250)
199 KOG0266|consensus 73.0 1.3E+02 0.0027 37.8 18.1 154 614-777 165-322 (456)
200 KOG0276|consensus 72.9 2E+02 0.0043 36.7 18.5 90 620-711 66-156 (794)
201 TIGR02276 beta_rpt_yvtn 40-res 72.7 11 0.00023 29.8 5.7 41 661-704 1-42 (42)
202 KOG0270|consensus 72.2 1.3E+02 0.0029 36.6 16.4 153 570-736 247-414 (463)
203 KOG2055|consensus 71.5 2.2E+02 0.0048 35.0 18.0 59 613-674 317-376 (514)
204 PF14339 DUF4394: Domain of un 70.5 2E+02 0.0042 32.7 17.3 57 567-626 27-91 (236)
205 PRK11138 outer membrane biogen 70.4 1.7E+02 0.0038 35.6 18.3 102 663-777 256-357 (394)
206 KOG0772|consensus 69.9 1.1E+02 0.0023 38.0 15.1 154 572-734 274-464 (641)
207 PF09910 DUF2139: Uncharacteri 68.9 64 0.0014 37.5 12.4 106 614-736 40-149 (339)
208 KOG4378|consensus 67.5 93 0.002 38.3 13.9 156 565-728 120-282 (673)
209 KOG1217|consensus 66.7 4.8 0.0001 50.0 3.6 60 466-527 243-305 (487)
210 COG1520 FOG: WD40-like repeat 66.6 2.6E+02 0.0057 33.7 18.6 108 663-778 111-222 (370)
211 KOG0279|consensus 65.6 2.6E+02 0.0057 32.3 21.9 160 610-778 65-227 (315)
212 PF08662 eIF2A: Eukaryotic tra 65.2 2.2E+02 0.0047 31.3 16.2 117 548-669 39-161 (194)
213 PF07433 DUF1513: Protein of u 65.0 2.9E+02 0.0063 32.7 25.8 221 541-771 19-283 (305)
214 KOG0263|consensus 64.7 2.7E+02 0.0058 36.3 17.9 192 562-769 447-645 (707)
215 KOG4649|consensus 64.2 2.7E+02 0.0058 32.0 18.5 56 656-713 242-297 (354)
216 PLN00181 protein SPA1-RELATED; 63.8 5E+02 0.011 35.0 29.7 112 567-683 533-648 (793)
217 KOG0319|consensus 63.8 2.7E+02 0.0059 36.1 17.5 183 536-725 28-221 (775)
218 KOG0289|consensus 63.5 3.5E+02 0.0076 33.1 20.5 200 560-770 297-502 (506)
219 PF09064 Tme5_EGF_like: Thromb 62.9 6.2 0.00014 30.3 2.1 26 1294-1325 6-31 (34)
220 TIGR03074 PQQ_membr_DH membran 62.8 2.2E+02 0.0047 38.2 17.7 54 655-715 378-431 (764)
221 KOG2055|consensus 62.2 3.8E+02 0.0082 33.1 17.5 59 656-727 317-375 (514)
222 KOG3914|consensus 62.1 1.1E+02 0.0024 36.8 13.1 154 618-781 72-231 (390)
223 KOG0646|consensus 61.6 3.9E+02 0.0084 33.0 18.8 70 611-685 177-249 (476)
224 PF08662 eIF2A: Eukaryotic tra 60.2 2.6E+02 0.0057 30.6 16.9 90 611-705 62-153 (194)
225 KOG0289|consensus 59.6 4.1E+02 0.0089 32.6 21.2 62 608-670 303-365 (506)
226 KOG0308|consensus 58.9 2.5E+02 0.0054 35.9 15.8 160 568-735 119-295 (735)
227 KOG0274|consensus 58.8 5E+02 0.011 33.4 20.8 202 565-782 248-450 (537)
228 KOG0270|consensus 57.7 4.4E+02 0.0095 32.4 18.6 158 550-711 268-434 (463)
229 KOG2139|consensus 56.9 2.4E+02 0.0052 33.7 14.4 155 572-729 156-314 (445)
230 PF10647 Gmad1: Lipoprotein Lp 56.9 3.5E+02 0.0076 31.0 21.2 200 569-781 26-243 (253)
231 PF09910 DUF2139: Uncharacteri 56.6 1.2E+02 0.0027 35.2 11.9 81 630-717 19-99 (339)
232 PTZ00420 coronin; Provisional 56.5 5.6E+02 0.012 33.2 28.0 114 566-684 74-198 (568)
233 PF05694 SBP56: 56kDa selenium 56.3 2E+02 0.0043 35.6 14.2 61 654-715 314-393 (461)
234 KOG1036|consensus 55.2 3.8E+02 0.0082 31.5 15.4 206 547-764 74-296 (323)
235 KOG0277|consensus 54.8 3.8E+02 0.0082 30.7 18.1 59 612-671 106-167 (311)
236 KOG1225|consensus 54.6 50 0.0011 41.7 9.2 102 373-524 258-362 (525)
237 KOG0281|consensus 54.6 88 0.0019 36.8 10.3 150 561-726 232-388 (499)
238 KOG2048|consensus 54.3 5.4E+02 0.012 33.3 17.7 160 561-725 377-547 (691)
239 PF14339 DUF4394: Domain of un 54.2 3.8E+02 0.0082 30.5 17.7 111 609-724 27-161 (236)
240 KOG0279|consensus 53.9 4.1E+02 0.0089 30.9 23.7 134 592-727 86-223 (315)
241 KOG0650|consensus 52.8 51 0.0011 41.3 8.6 96 536-635 575-676 (733)
242 PRK10115 protease 2; Provision 51.2 4.6E+02 0.01 34.8 18.0 111 577-692 278-403 (686)
243 KOG0772|consensus 50.9 4.7E+02 0.01 32.7 16.0 161 562-727 163-348 (641)
244 KOG0281|consensus 50.3 2.6E+02 0.0056 33.1 13.1 121 549-682 258-387 (499)
245 PF05694 SBP56: 56kDa selenium 49.8 2.3E+02 0.0049 35.1 13.3 66 697-762 313-393 (461)
246 KOG1217|consensus 49.7 27 0.00059 43.2 6.2 61 467-527 184-264 (487)
247 KOG0286|consensus 49.3 4.9E+02 0.011 30.4 27.4 154 621-781 156-311 (343)
248 cd00216 PQQ_DH Dehydrogenases 48.7 6.6E+02 0.014 31.7 19.1 13 657-669 178-190 (488)
249 PF02191 OLF: Olfactomedin-lik 48.4 4.8E+02 0.01 30.0 16.5 133 577-717 29-190 (250)
250 TIGR02171 Fb_sc_TIGR02171 Fibr 48.3 3.7E+02 0.0081 36.2 15.9 52 548-599 329-385 (912)
251 KOG2139|consensus 47.9 5.7E+02 0.012 30.8 15.5 161 562-726 234-431 (445)
252 KOG0299|consensus 47.8 6.3E+02 0.014 31.2 16.6 68 653-720 382-450 (479)
253 KOG0272|consensus 46.8 6.3E+02 0.014 30.9 18.3 62 608-669 217-279 (459)
254 COG3211 PhoX Predicted phospha 46.4 1.2E+02 0.0026 38.3 10.5 22 608-629 416-437 (616)
255 PHA02790 Kelch-like protein; P 45.7 7.3E+02 0.016 31.3 18.6 130 619-762 317-454 (480)
256 KOG0263|consensus 44.5 7.8E+02 0.017 32.3 17.3 177 539-724 463-647 (707)
257 KOG0310|consensus 44.5 7.2E+02 0.016 30.9 20.2 155 569-729 71-228 (487)
258 PF06247 Plasmod_Pvs28: Plasmo 43.0 48 0.001 35.8 5.7 58 472-532 107-168 (197)
259 KOG1517|consensus 41.8 1.1E+03 0.024 32.4 19.3 71 610-682 1165-1238(1387)
260 KOG0308|consensus 41.8 5.9E+02 0.013 32.8 15.3 148 568-723 173-324 (735)
261 COG3211 PhoX Predicted phospha 41.8 1.2E+02 0.0025 38.4 9.4 105 566-670 416-572 (616)
262 KOG3914|consensus 41.5 4.6E+02 0.01 31.8 13.8 120 608-733 107-231 (390)
263 KOG0319|consensus 41.2 9.6E+02 0.021 31.5 20.8 148 532-681 329-493 (775)
264 KOG0269|consensus 39.9 2.5E+02 0.0054 36.7 12.0 123 562-687 129-254 (839)
265 smart00284 OLF Olfactomedin-li 39.2 6.6E+02 0.014 29.0 19.8 92 620-715 83-193 (255)
266 KOG0277|consensus 39.0 4.6E+02 0.0099 30.1 12.5 78 566-645 104-186 (311)
267 PF05935 Arylsulfotrans: Aryls 38.7 5.8E+02 0.013 32.2 15.5 159 619-782 112-310 (477)
268 KOG0282|consensus 38.5 5.9E+02 0.013 31.7 14.3 110 609-724 300-413 (503)
269 COG4880 Secreted protein conta 38.3 6E+02 0.013 31.2 14.0 30 616-645 155-184 (603)
270 KOG1517|consensus 37.1 5E+02 0.011 35.5 14.2 178 570-754 1167-1361(1387)
271 TIGR03074 PQQ_membr_DH membran 36.8 1.2E+03 0.026 31.4 20.7 19 610-628 269-287 (764)
272 PF03178 CPSF_A: CPSF A subuni 36.1 4.3E+02 0.0094 31.1 13.3 104 659-771 94-202 (321)
273 KOG0640|consensus 36.0 5.8E+02 0.013 30.0 13.0 106 563-673 169-282 (430)
274 KOG0269|consensus 35.6 6.3E+02 0.014 33.3 14.5 109 621-730 100-211 (839)
275 KOG0649|consensus 35.6 7.2E+02 0.016 28.4 18.7 151 565-728 113-276 (325)
276 KOG1407|consensus 33.7 8E+02 0.017 28.4 20.6 207 568-799 66-284 (313)
277 PHA02713 hypothetical protein; 33.4 1.2E+03 0.025 30.2 19.3 133 619-762 302-471 (557)
278 KOG2110|consensus 33.2 9.5E+02 0.021 29.0 18.7 198 523-726 40-248 (391)
279 KOG3567|consensus 33.0 24 0.00051 43.0 1.7 53 675-728 445-498 (501)
280 KOG0641|consensus 32.8 7.4E+02 0.016 27.7 16.7 29 608-637 32-60 (350)
281 KOG2106|consensus 32.7 1.1E+03 0.024 29.7 17.3 99 541-643 342-450 (626)
282 KOG0303|consensus 32.3 8.6E+02 0.019 29.6 14.0 88 568-659 133-225 (472)
283 PF15416 DUF4623: Domain of un 31.7 9.6E+02 0.021 28.6 14.1 30 699-728 243-273 (442)
284 KOG0645|consensus 31.6 8.8E+02 0.019 28.2 16.6 182 532-735 110-311 (312)
285 PF06079 Apyrase: Apyrase; In 31.6 2.1E+02 0.0045 33.4 8.8 54 706-762 62-118 (291)
286 KOG4283|consensus 31.3 9.2E+02 0.02 28.3 14.0 172 591-768 25-214 (397)
287 KOG3621|consensus 30.3 1.2E+03 0.026 30.6 15.6 263 521-794 29-333 (726)
288 KOG0640|consensus 30.1 9.8E+02 0.021 28.3 14.4 181 547-735 193-392 (430)
289 PF00780 CNH: CNH domain; Int 29.8 8.7E+02 0.019 27.6 24.4 113 652-782 139-264 (275)
290 KOG1225|consensus 29.2 78 0.0017 40.0 5.4 76 420-524 259-336 (525)
291 KOG0286|consensus 28.2 1E+03 0.022 27.9 18.8 113 608-724 186-301 (343)
292 KOG2111|consensus 27.8 1.1E+03 0.024 28.1 21.0 201 576-796 73-289 (346)
293 KOG0276|consensus 26.9 1.5E+03 0.032 29.4 20.6 139 527-670 13-159 (794)
294 PTZ00486 apyrase Superfamily; 26.7 3.6E+02 0.0077 32.4 9.8 56 706-761 123-180 (352)
295 PF04885 Stig1: Stigma-specifi 25.9 1.4E+02 0.003 31.0 5.6 55 92-172 76-133 (136)
296 KOG0292|consensus 25.6 1.8E+03 0.039 29.9 17.9 114 549-669 231-383 (1202)
297 TIGR02171 Fb_sc_TIGR02171 Fibr 25.5 3.5E+02 0.0076 36.5 10.4 99 619-719 318-424 (912)
298 PF06739 SBBP: Beta-propeller 25.3 76 0.0016 25.2 2.8 20 651-671 12-31 (38)
299 KOG0284|consensus 25.0 2.4E+02 0.0053 34.1 8.0 98 568-670 140-241 (464)
300 KOG1645|consensus 24.8 1.4E+03 0.029 28.2 16.0 75 562-639 189-266 (463)
301 PF12946 EGF_MSP1_1: MSP1 EGF 24.1 87 0.0019 24.9 2.8 22 507-528 12-34 (37)
302 PRK10115 protease 2; Provision 23.9 1.8E+03 0.039 29.3 24.2 115 615-734 274-402 (686)
303 COG4222 Uncharacterized protei 23.9 1.4E+03 0.031 28.1 14.4 64 651-714 137-218 (391)
304 PF10647 Gmad1: Lipoprotein Lp 23.8 1.1E+03 0.024 26.8 18.1 111 610-724 25-142 (253)
305 KOG0646|consensus 23.7 1.5E+03 0.032 28.2 17.9 31 696-727 218-248 (476)
306 KOG4532|consensus 22.5 1.1E+03 0.025 27.3 12.2 140 632-782 139-291 (344)
307 PF05935 Arylsulfotrans: Aryls 22.4 1.4E+03 0.031 28.8 15.0 41 696-736 271-311 (477)
308 KOG3881|consensus 22.2 1.5E+03 0.032 27.7 17.6 81 685-771 235-317 (412)
309 KOG3516|consensus 21.9 1.3E+03 0.029 32.1 14.5 272 492-797 541-864 (1306)
310 PHA02790 Kelch-like protein; P 21.8 1.7E+03 0.036 28.1 18.4 129 577-715 317-454 (480)
311 KOG0274|consensus 21.4 1.8E+03 0.039 28.4 22.7 172 592-780 312-489 (537)
312 KOG3545|consensus 21.0 6.2E+02 0.013 29.0 9.9 132 578-714 77-236 (249)
313 PF02425 GBP_PSP: Paralytic/GB 21.0 70 0.0015 22.2 1.5 17 1315-1335 6-22 (23)
314 KOG1445|consensus 20.9 1.5E+03 0.032 29.3 13.7 228 550-791 607-861 (1012)
315 KOG0296|consensus 20.5 1.6E+03 0.034 27.3 18.1 162 608-781 64-228 (399)
316 TIGR03547 muta_rot_YjhT mutatr 20.2 1.5E+03 0.032 26.9 18.5 91 578-670 17-125 (346)
No 1
>KOG1215|consensus
Probab=100.00 E-value=6.1e-57 Score=595.66 Aligned_cols=586 Identities=39% Similarity=0.766 Sum_probs=480.4
Q ss_pred eecCCcccCCCCCCCCCCCcc-ccccccCCCCCcCCCCCCcccC--CCceecCceeecCCCCCCCCCCCCCCCCCCCCCC
Q psy6572 126 CIEESYICDGQNDCFDMSDEQ-NCDQIKDVSPKMNCSGDKFLCR--NGNCILSRWRCDGDNDCNDGNDGLSSDEMNCDTE 202 (1416)
Q Consensus 126 CI~~~~~CDg~~DC~D~sDE~-~C~~~~~~~~~~~C~~~~f~C~--~g~CI~~~~~CDg~~DC~Dg~d~~~sDE~~C~~~ 202 (1416)
.+...|...+...+.+.+++. +++ ...|...+|+|. +++|||..|+|||..||.|| +||.+|..
T Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~c~~~~~~Cip~~~~cd~~~~C~dg-----~de~~~~~- 176 (877)
T KOG1215|consen 110 LDIHAYHPSSQPLAPDPCAESGNGP-------CSHCCLDKFSCRTGSCKCIPGDWLCDGEADCPDG-----SDELNCAV- 176 (877)
T ss_pred cceeEEecCCCCCCCCcccccCCCC-------CccccCCCCCCcCccccCCCCceeCCCCCccccc-----hhhhcccc-
Confidence 344899999999999999986 222 245778999999 89999999999999999999 99999962
Q ss_pred CccCCCCCceecCCCCceecCcccccCCCCCCCCCCCCCCcccCCCCC---CCceeeCCCCCeEcCcccccCCCCCCCCC
Q psy6572 203 STCKANNNVFQCDNNKTCISKSWVCDGTYDCTDRSDENSTYCAHSECN---LFEFRCNSTGQCIPITWVCDGVTDCIDKS 279 (1416)
Q Consensus 203 ~~C~~~~~~F~C~~~~~CI~~~w~CDg~~DC~D~sDE~~~~c~~~~C~---~~~F~C~~~~~CI~~~w~CDG~~DC~Dgs 279 (1416)
..+......|+| |...|+||+..+|++++||.+. ....+. ...|+|.+..+||...|+|||..||.+++
T Consensus 177 ~~~~~~~~~~~~------~~~~~~~d~~~~~~~~~d~~~~--~~~~~~~~~~~~~~c~g~~~~i~~~~~~Dg~~dc~~~~ 248 (877)
T KOG1215|consen 177 RRCEPRGASLDC------IVAIKVCDIQHDCADDYDESEG--RIYWTDDSRIEVTRCDGSSRCILISEVCDGPRDCVDGP 248 (877)
T ss_pred cccCcccccccc------ceeeeecCcccccccccccccC--cccccCCcceeEEEecCCCcEEeehhccCCCcccccCC
Confidence 334332244554 9999999999999999999873 222333 57899965579999999999999999999
Q ss_pred CCccccCCcccccccCCeeecCCCceeccccccCCCCCCCCCCCcccccccccceeeCCCccccccccccCCCCCCCCCC
Q psy6572 280 DEHHSQDCLNVETCMEGYFKCLNGRCLLENYYCDGENDCGDNSDEPIVSMWKLVWKCLNGRCLLENYYCDGENDCGDNSD 359 (1416)
Q Consensus 280 DE~~~~~C~~~~~C~~~~f~C~~g~CI~~~~~CDg~~DC~DgSDE~~c~~~~~~f~C~~g~Ci~~~~~Cdg~~dC~d~~~ 359 (1416)
||.- ..|. ..+|...+|.|+++.|++..+.|||..||+||+||..
T Consensus 249 de~~-~~~~-~~~~~~~e~~~~~~~~~~~~~~~~g~~d~pdg~de~~--------------------------------- 293 (877)
T KOG1215|consen 249 DEGV-MNCS-DATCEAPEIECADGDCSDRQKLCDGDLDCPDGLDEDY--------------------------------- 293 (877)
T ss_pred cCce-eEee-ccccCCcceeecCCCCccceEEecCccCCCCcccccc---------------------------------
Confidence 9931 2343 3577778999999999999999999999999999962
Q ss_pred CCCCCCCCCCCCCCceeecCCcEeCCceeeCCcCCCCCCCCccccccccCCCCCCCCCCCceeeCCCeeecCCcccCCCC
Q psy6572 360 EPPSCPKTDCDNSTHFECQNGNCIPSVLLCNGVNDCDDNSDEDMNHAECRSLKDLCKHPSHFLCSNGLCINETLTCNDIN 439 (1416)
Q Consensus 360 e~~~C~~~~c~~~~~f~C~~g~CI~~~~~CDg~~DC~DgSDE~~~~~~C~~~~~~C~~~~~f~C~~g~Ci~~~~~Cdg~~ 439 (1416)
|+...+- +..|.|.+++ |....+|++
T Consensus 294 ----~~~~~~~-~~~~d~~~~~-i~~~~~~~~------------------------------------------------ 319 (877)
T KOG1215|consen 294 ----CKKKLYW-SMNVDGSGRR-ILLSKLCHG------------------------------------------------ 319 (877)
T ss_pred ----cccceee-eeecccCCce-eeecccCcc------------------------------------------------
Confidence 2100000 2334444444 544444333
Q ss_pred CCCCCCCCCCcccccccCCCCCCcccccceecCCceEEeeCCCceecCCCCCccccCCcCC--CCCccceee-ecCCeee
Q psy6572 440 DCGDNSDEFSCFVNECNVSHGGQLCAHECIDLKIGYKCACRKGYQVHPEDKHLCVDTNECL--DRPCSHYCR-NTLGSYS 516 (1416)
Q Consensus 440 dC~dgsDe~~C~i~eC~~~~~~~~Cs~~C~nt~~gy~C~C~~Gy~L~p~d~~tC~didEC~--~~~Csq~C~-nt~gsy~ 516 (1416)
. ..+.++++...... +++.+.+++.+.+|.|..++.+. ..+.+ +.+.|. ++.|+|+|+ +.+++|+
T Consensus 320 ---~----~~~~~~~~~~~~~~--~~~~~~~~~v~~~~~~~~~~~~~-~~~~~--~~~~~~~~~g~Csq~C~~~~p~~~~ 387 (877)
T KOG1215|consen 320 ---Y----WTDGLNECAERVLK--CSHKCPDVSVGPRCDCMGAKVLP-LGART--DSNPCESDNGGCSQLCVPNSPGTFK 387 (877)
T ss_pred ---c----cccccccchhhccc--ccCCCCccccCCcccCCccceec-ccccc--cCCcccccCCccceeccCCCCCcee
Confidence 0 11122333222333 77888889999999999999986 44443 333443 799999999 5688999
Q ss_pred ecCCCCcEEecCCCceEecCCCCCeEEEEecEEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEE
Q psy6572 517 CSCAPGYALLSDKHGCKATSDVPPNLLFTNKYYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRR 595 (1416)
Q Consensus 517 C~C~~Gy~L~~dg~sC~a~~~~~~~li~s~~~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r 595 (1416)
|.|..||.|..++ |.+....+++|++++++.|+++.++... ..++.++.+++++++|..++.+||+|.... .|.+
T Consensus 388 c~c~~g~~~~~~~--c~~~~~~~~~l~~s~~~~ir~~~~~~~~~~~p~~~~~~~~~~d~d~~~~~i~~~d~~~~--~i~~ 463 (877)
T KOG1215|consen 388 CACSPGYELRLDK--CEASDQPEAFLLFSNRHDIRRISLDCSDVSRPLEGIKNAVALDFDVLNNRIYWADLSDE--KICR 463 (877)
T ss_pred EecCCCcEeccCC--ceecCCCCcEEEEecCccceecccCCCcceEEccCCccceEEEEEecCCEEEEEeccCC--eEee
Confidence 9999999999887 8887777999999999999999998875 566666689999999999999999999965 8999
Q ss_pred EecC-CCC-eEEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCC
Q psy6572 596 SCNN-SQP-ELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWG 672 (1416)
Q Consensus 596 ~~l~-s~~-~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g 672 (1416)
.... ... .++.. +-.|.+||+||+.+++||+|.+...|++..++|..+++|+...+..|++|+|+|.+|+|||+||+
T Consensus 464 ~~~~~~~~~~~~~~g~~~~~~lavD~~~~~~y~tDe~~~~i~v~~~~g~~~~vl~~~~l~~~r~~~v~p~~g~~~wtd~~ 543 (877)
T KOG1215|consen 464 ASQDGSSECELCGDGLCIPEGLAVDWIGDNIYWTDEGNCLIEVADLDGSSRKVLVSKDLDLPRSIAVDPEKGLMFWTDWG 543 (877)
T ss_pred eccCCCccceEeccCccccCcEEEEeccCCceecccCCceeEEEEccCCceeEEEecCCCCccceeeccccCeeEEecCC
Confidence 8888 333 32333 88999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCC-eEEEEeCCCCceEEEEeccCCCCcccccceeEE
Q psy6572 673 QNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHED-YIAVSDLNGENIKIIVSRRMDPTINLHHVFALA 751 (1416)
Q Consensus 673 ~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~-~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~la 751 (1416)
..++|+|+.|||+.+.+++..++.||+||++|+..++|||+|..+. .|++++++|++|+++.... ++||++|+
T Consensus 544 ~~~~i~ra~~dg~~~~~l~~~~~~~p~glt~d~~~~~~yw~d~~~~~~i~~~~~~g~~r~~~~~~~------~~~p~~~~ 617 (877)
T KOG1215|consen 544 QPPRIERASLDGSERAVLVTNGILWPNGLTIDYETDRLYWADAKLDYTIESANMDGQNRRVVDSED------LPHPFGLS 617 (877)
T ss_pred CCchhhhhcCCCCCceEEEeCCccCCCcceEEeecceeEEEcccCCcceeeeecCCCceEEecccc------CCCceEEE
Confidence 8779999999999999999999999999999999999999999998 8999999999999444433 99999999
Q ss_pred EecCcEEEeecCCCeeEEecccCCCceEEEEeCCCCCCeeeeeecccCCCCCCCCCCC-CCCCCccceeecCCCCccccC
Q psy6572 752 VFEDHLFWTDWEMKSIERCDKYTGKNCTSVVKNLVHKPMDLRVYHPYRQTPLKDNPCE-NNGGCQGLCLLKPNGHRQCAC 830 (1416)
Q Consensus 752 v~~d~LYwtD~~~~~I~~~nk~tG~~~~~l~~~~~~~p~~I~v~h~~~q~p~~~npC~-~NggCshlCl~~p~~~~~C~C 830 (1416)
+++++|||++|....+.++++..+.. .+.+......|+.++++|...+.|.+.|+|. +|++|+||||+.|.+. +|+|
T Consensus 618 ~~~~~iyw~d~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~n~C~~~n~~c~~KOG~~p~~~-~c~c 695 (877)
T KOG1215|consen 618 VFEDYIYWTDWSNRAISRAEKHKGSD-SRTSRSNLAQPLDIILVHHSSSRPTGVNPCESSNGGCSQLCLPRPQGS-TCAC 695 (877)
T ss_pred EecceeEEeeccccceEeeecccCCc-ceeeecccCcccceEEEeccccCCCCCCcccccCCCCCeeeecCCCCC-eeeC
Confidence 99999999999999999999998877 3333367778888888844433388999999 7899999999999877 9999
Q ss_pred CCCeeecCCCceecc
Q psy6572 831 PDNFILESDGKTCRH 845 (1416)
Q Consensus 831 p~g~~L~~d~~tC~~ 845 (1416)
|.|+.|..++++|.+
T Consensus 696 ~~~~~l~~~~~~C~~ 710 (877)
T KOG1215|consen 696 PEGYRLSPDGKSCSS 710 (877)
T ss_pred CCCCeecCCCCeecC
Confidence 999999999999986
No 2
>KOG1214|consensus
Probab=100.00 E-value=2.6e-54 Score=505.68 Aligned_cols=353 Identities=35% Similarity=0.712 Sum_probs=307.8
Q ss_pred cceecC-CceEEeeCCCceecCCCCCccccCCcCCCCCccc--eeeecCCeeeecCCCCcEEecCCCceEec--------
Q psy6572 467 ECIDLK-IGYKCACRKGYQVHPEDKHLCVDTNECLDRPCSH--YCRNTLGSYSCSCAPGYALLSDKHGCKAT-------- 535 (1416)
Q Consensus 467 ~C~nt~-~gy~C~C~~Gy~L~p~d~~tC~didEC~~~~Csq--~C~nt~gsy~C~C~~Gy~L~~dg~sC~a~-------- 535 (1416)
.|+-+. +.|.|.|.|||. .|++.|.++|||.+..|-+ .|.|++|+|.|.|.+||. .||..|+..
T Consensus 800 ~c~~hGgs~y~C~CLPGfs---GDG~~c~dvDeC~psrChp~A~CyntpgsfsC~C~pGy~--GDGf~CVP~~~~~T~C~ 874 (1289)
T KOG1214|consen 800 RCVHHGGSTYSCACLPGFS---GDGHQCTDVDECSPSRCHPAATCYNTPGSFSCRCQPGYY--GDGFQCVPDTSSLTPCE 874 (1289)
T ss_pred EEEecCCceEEEeecCCcc---CCccccccccccCccccCCCceEecCCCcceeecccCcc--CCCceecCCCccCCccc
Confidence 354444 559999999995 7899999999999888876 899999999999999986 333333211
Q ss_pred --------------------------------------------------------------------------C-----
Q psy6572 536 --------------------------------------------------------------------------S----- 536 (1416)
Q Consensus 536 --------------------------------------------------------------------------~----- 536 (1416)
+
T Consensus 875 ~er~hpl~chg~t~~~~~~Dp~~~e~p~~~~ppG~~~~~c~~~~~~~vp~Cd~hgh~ap~qchG~~~~CwCvd~dGrev~ 954 (1289)
T KOG1214|consen 875 QERFHPLQCHGSTGFCWCVDPDGHEVPGTQTPPGSTPPHCGPSPEQYVPQCDDHGHFAPLQCHGKSDFCWCVDKDGREVQ 954 (1289)
T ss_pred cccccceeeccccceeEeeCCCcccCCCCCCCCCCCCCCCCCcccccCCCccccccccccccCCCcceeEEecCCCcCcc
Confidence 0
Q ss_pred ---------------------------------CCCCeEEEEecEEEEEEecCCcc-------eEEecccccceeeeeec
Q psy6572 537 ---------------------------------DVPPNLLFTNKYYIREVTQAGVM-------TIRIHNQTNAVGLDFDW 576 (1416)
Q Consensus 537 ---------------------------------~~~~~li~s~~~~I~~i~l~g~~-------~~~~~~l~~~~~l~~D~ 576 (1416)
...-+|||+....|.++.++|.. +++.-...-++||+||-
T Consensus 955 gtr~~pg~tp~CiptvApp~v~np~~~~~v~p~~~gt~LL~aqg~~I~~lplng~~~~K~~ak~~l~~p~~IiVGidfDC 1034 (1289)
T KOG1214|consen 955 GTRSQPGTTPACIPTVAPPMVRNPTPRPDVTPPSVGTFLLYAQGQQIGYLPLNGTRLQKDAAKTLLSLPGSIIVGIDFDC 1034 (1289)
T ss_pred ccccCCCCCCCccCCCCCCcccCCCCCCCCcCCCCcceEEEeccceEEEeecCcchhchhhhhceEecccceeeeeeccc
Confidence 00116888888899998888754 34555566789999999
Q ss_pred CCCeEEEeeccCCCccEEEEecC-CCCeEEee--cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCC
Q psy6572 577 VDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP--ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQE 653 (1416)
Q Consensus 577 ~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~--l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~ 653 (1416)
++++|||+|+. +.+|.|+.|+ +..++++. |..|+||||||+.++|||||+...+|+|+.|||+.|++|+..+|.+
T Consensus 1035 ~e~mvyWtDv~--g~SI~rasL~G~Ep~ti~n~~L~SPEGiAVDh~~Rn~ywtDS~lD~IevA~LdG~~rkvLf~tdLVN 1112 (1289)
T KOG1214|consen 1035 RERMVYWTDVA--GRSISRASLEGAEPETIVNSGLISPEGIAVDHIRRNMYWTDSVLDKIEVALLDGSERKVLFYTDLVN 1112 (1289)
T ss_pred ccceEEEeecC--CCccccccccCCCCceeecccCCCccceeeeeccceeeeeccccchhheeecCCceeeEEEeecccC
Confidence 99999999999 4599999999 77787777 9999999999999999999999999999999999999999999999
Q ss_pred cceeeecCCcceEEEeeCCC-CceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEE
Q psy6572 654 PRGIALNPAYGYMYWTDWGQ-NAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKI 732 (1416)
Q Consensus 654 P~gIavDp~~g~LYWtD~g~-~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~ 732 (1416)
||+|+||+.+|.||||||.+ +|+|+++.|||++|++|+.+.|..||||+||++.+.|-|+|+++++++-+.++|..|++
T Consensus 1113 PR~iv~D~~rgnLYwtDWnRenPkIets~mDG~NrRilin~DigLPNGLtfdpfs~~LCWvDAGt~rleC~~p~g~gRR~ 1192 (1289)
T KOG1214|consen 1113 PRAIVVDPIRGNLYWTDWNRENPKIETSSMDGENRRILINTDIGLPNGLTFDPFSKLLCWVDAGTKRLECTLPDGTGRRV 1192 (1289)
T ss_pred cceEEeecccCceeeccccccCCcceeeccCCccceEEeecccCCCCCceeCcccceeeEEecCCcceeEecCCCCcchh
Confidence 99999999999999999975 69999999999999999999999999999999999999999999999999999999999
Q ss_pred EEeccCCCCcccccceeEEEecCcEEEeecCCCeeEEecccCCCceEEEEeCCCCCCeeeeeecccCCCCCCCCCCC-CC
Q psy6572 733 IVSRRMDPTINLHHVFALAVFEDHLFWTDWEMKSIERCDKYTGKNCTSVVKNLVHKPMDLRVYHPYRQTPLKDNPCE-NN 811 (1416)
Q Consensus 733 v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~~~I~~~nk~tG~~~~~l~~~~~~~p~~I~v~h~~~q~p~~~npC~-~N 811 (1416)
|+++ |+.||+|+-+++.+|||||+.++|..++|+.++.....+.....+.|||..+-++- |.+.+||+ +|
T Consensus 1193 i~~~-------LqYPF~itsy~~~fY~TDWk~n~vvsv~~~~~~~td~~~p~~~s~lyGItav~~~C--p~gstpCSedN 1263 (1289)
T KOG1214|consen 1193 IQNN-------LQYPFSITSYADHFYHTDWKRNGVVSVNKHSGQFTDEYLPEQRSHLYGITAVYPYC--PTGSTPCSEDN 1263 (1289)
T ss_pred hhhc-------ccCceeeeeccccceeeccccCceEEeeccccccccccccccccceEEEEeccccC--CCCCCcccccC
Confidence 9885 99999999999999999999999999999999877766655666789998876543 67899999 99
Q ss_pred CCCccceeecCCCCccccCCCCeee
Q psy6572 812 GGCQGLCLLKPNGHRQCACPDNFIL 836 (1416)
Q Consensus 812 ggCshlCl~~p~~~~~C~Cp~g~~L 836 (1416)
|||.||||+.- .+..|.||++.+.
T Consensus 1264 GGCqHLCLpgq-ngavcecpdnvkv 1287 (1289)
T KOG1214|consen 1264 GGCQHLCLPGQ-NGAVCECPDNVKV 1287 (1289)
T ss_pred CcceeecccCc-CCccccCCcccee
Confidence 99999999763 4788999988754
No 3
>KOG1215|consensus
Probab=100.00 E-value=1.3e-34 Score=383.55 Aligned_cols=562 Identities=26% Similarity=0.442 Sum_probs=402.3
Q ss_pred CCccCCCcEEecCCeEecccccccCCCCCCCCCCCCCC--C--------------------------CCCCCceeecCCc
Q psy6572 28 MRDCRPGYFKCDNNKCILSSHTCNNINDCGDGSDEADC--S--------------------------TCGNDTFHCDMGM 79 (1416)
Q Consensus 28 ~~~c~~~~f~C~n~~ci~~~~~Cdg~~dC~d~sDE~~C--~--------------------------~C~~~~f~C~~g~ 79 (1416)
.....+..|.+.||++|...|.+....|+...++|..+ . +.....+...+..
T Consensus 28 ~~~~~~~~~~~~ng~~id~~~~~~y~~d~~~~~i~~~~~dg~~r~~l~~~~~~~y~~d~~v~~~~~~sg~~~~~~~~~~~ 107 (877)
T KOG1215|consen 28 RKILEKEEFEWPNGLTIDLAWQRIYWADAKNDLIESANYDGSGRRALTLFEDGLYWTDKSVSAANKKTGKDVTRLSQDSH 107 (877)
T ss_pred eEEeeccceeCCCcceecchhheeeeccccCCceEEeccCCccceeeeeeccceeeccchhhhhccCCCCcceeehhcCC
Confidence 34667889999999999999999999999988888755 1 1222233333332
Q ss_pred --eeCCCcccCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCceecCC-CCceecCCcccCCCCCCCCCCCccccccccCCC
Q psy6572 80 --CIHKALRCDVDPDCPDASDEM-HCPMTNCTEKYPLMTNPIHCNF-TSACIEESYICDGQNDCFDMSDEQNCDQIKDVS 155 (1416)
Q Consensus 80 --Ci~~~~~Cdg~~dC~d~sDE~-~C~~~~C~~~~~~~~~~f~C~~-~~~CI~~~~~CDg~~DC~D~sDE~~C~~~~~~~ 155 (1416)
.+...|...+.+.+.+++++. .++...|.... |+|.. ..+|||..|+|||+.||.||+||.+|...
T Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~c~~~~~~Cip~~~~cd~~~~C~dg~de~~~~~~---- 177 (877)
T KOG1215|consen 108 FPLDIHAYHPSSQPLAPDPCAESGNGPCSHCCLDK------FSCRTGSCKCIPGDWLCDGEADCPDGSDELNCAVR---- 177 (877)
T ss_pred CCcceeEEecCCCCCCCCcccccCCCCCccccCCC------CCCcCccccCCCCceeCCCCCccccchhhhccccc----
Confidence 344889999999999999985 44445566655 88874 36999999999999999999999998621
Q ss_pred CCcCCCCCCcccCCCceecCceeecCCCCCCCCCCCCCCCCCCCCCCCccCC-CCCceecCCCCceecCcccccCCCCCC
Q psy6572 156 PKMNCSGDKFLCRNGNCILSRWRCDGDNDCNDGNDGLSSDEMNCDTESTCKA-NNNVFQCDNNKTCISKSWVCDGTYDCT 234 (1416)
Q Consensus 156 ~~~~C~~~~f~C~~g~CI~~~~~CDg~~DC~Dg~d~~~sDE~~C~~~~~C~~-~~~~F~C~~~~~CI~~~w~CDg~~DC~ 234 (1416)
...+....|+| |...|+||++.+|.++ +||..+.. ..+.. ....|+|...++||..+|+|||..||.
T Consensus 178 -~~~~~~~~~~~-----~~~~~~~d~~~~~~~~-----~d~~~~~~-~~~~~~~~~~~~c~g~~~~i~~~~~~Dg~~dc~ 245 (877)
T KOG1215|consen 178 -RCEPRGASLDC-----IVAIKVCDIQHDCADD-----YDESEGRI-YWTDDSRIEVTRCDGSSRCILISEVCDGPRDCV 245 (877)
T ss_pred -ccCcccccccc-----ceeeeecCcccccccc-----cccccCcc-cccCCcceeEEEecCCCcEEeehhccCCCcccc
Confidence 12344557888 9999999999999999 78877652 23331 027899944469999999999999999
Q ss_pred CCCCCCCCcccCCCCCCCceeeCCCCCeEcCcccccCCCCCCCCCCCccccCCcccccccCCeeecCCCceeccccccCC
Q psy6572 235 DRSDENSTYCAHSECNLFEFRCNSTGQCIPITWVCDGVTDCIDKSDEHHSQDCLNVETCMEGYFKCLNGRCLLENYYCDG 314 (1416)
Q Consensus 235 D~sDE~~~~c~~~~C~~~~F~C~~~~~CI~~~w~CDG~~DC~DgsDE~~~~~C~~~~~C~~~~f~C~~g~CI~~~~~CDg 314 (1416)
+++||....+...+|...+|.| .++.|++..++|||..||++|+|| ..|.....+ ...|.|.+++ |+..++|++
T Consensus 246 ~~~de~~~~~~~~~~~~~e~~~-~~~~~~~~~~~~~g~~d~pdg~de---~~~~~~~~~-~~~~d~~~~~-i~~~~~~~~ 319 (877)
T KOG1215|consen 246 DGPDEGVMNCSDATCEAPEIEC-ADGDCSDRQKLCDGDLDCPDGLDE---DYCKKKLYW-SMNVDGSGRR-ILLSKLCHG 319 (877)
T ss_pred cCCcCceeEeeccccCCcceee-cCCCCccceEEecCccCCCCcccc---cccccceee-eeecccCCce-eeecccCcc
Confidence 9999963347778899999999 789999999999999999999999 467643333 6789999999 999999988
Q ss_pred CCCCCCCCCcccccccccceeeCCCccccccccccCCCCCCCCCCCCCCCCCCCCCCCCceeecCCcEeCCceeeCCc-C
Q psy6572 315 ENDCGDNSDEPIVSMWKLVWKCLNGRCLLENYYCDGENDCGDNSDEPPSCPKTDCDNSTHFECQNGNCIPSVLLCNGV-N 393 (1416)
Q Consensus 315 ~~DC~DgSDE~~c~~~~~~f~C~~g~Ci~~~~~Cdg~~dC~d~~~e~~~C~~~~c~~~~~f~C~~g~CI~~~~~CDg~-~ 393 (1416)
. ..|.. +.++... ..+.+. .
T Consensus 320 ~-----~~~~~-------------~~~~~~~-----------------------------------------~~~~~~~~ 340 (877)
T KOG1215|consen 320 Y-----WTDGL-------------NECAERV-----------------------------------------LKCSHKCP 340 (877)
T ss_pred c-----ccccc-------------ccchhhc-----------------------------------------ccccCCCC
Confidence 1 11111 0111111 000000 0
Q ss_pred CCCCCCCccccccccCCCCCCCCCCCceeeCCCeeecCCcccCCCCCCCCCCCCCCcccccccCCCCCCcccccce-ecC
Q psy6572 394 DCDDNSDEDMNHAECRSLKDLCKHPSHFLCSNGLCINETLTCNDINDCGDNSDEFSCFVNECNVSHGGQLCAHECI-DLK 472 (1416)
Q Consensus 394 DC~DgSDE~~~~~~C~~~~~~C~~~~~f~C~~g~Ci~~~~~Cdg~~dC~dgsDe~~C~i~eC~~~~~~~~Cs~~C~-nt~ 472 (1416)
+..-+ ..| . |.....++.+... ..+.|...+++ |+|.|. +.|
T Consensus 341 ~~~v~-------~~~-------~-----------~~~~~~~~~~~~~----------~~~~~~~~~g~--Csq~C~~~~p 383 (877)
T KOG1215|consen 341 DVSVG-------PRC-------D-----------CMGAKVLPLGART----------DSNPCESDNGG--CSQLCVPNSP 383 (877)
T ss_pred ccccC-------Ccc-------c-----------CCccceecccccc----------cCCcccccCCc--cceeccCCCC
Confidence 00000 000 0 1111111111111 11233345566 999999 569
Q ss_pred CceEEeeCCCceecCCCCCccccCCcCCCCCccceeeecCC----eeeecCCCCcEEecCCCceEecC--CCCCeEEEEe
Q psy6572 473 IGYKCACRKGYQVHPEDKHLCVDTNECLDRPCSHYCRNTLG----SYSCSCAPGYALLSDKHGCKATS--DVPPNLLFTN 546 (1416)
Q Consensus 473 ~gy~C~C~~Gy~L~p~d~~tC~didEC~~~~Csq~C~nt~g----sy~C~C~~Gy~L~~dg~sC~a~~--~~~~~li~s~ 546 (1416)
+.|+|.|..||.+. .++ |....-.. ... +..+..+ +..+. +...+....+.-.+.+ .....++++.
T Consensus 384 ~~~~c~c~~g~~~~-~~~--c~~~~~~~--~~l-~~s~~~~ir~~~~~~~--~~~~p~~~~~~~~~~d~d~~~~~i~~~d 455 (877)
T KOG1215|consen 384 GTFKCACSPGYELR-LDK--CEASDQPE--AFL-LFSNRHDIRRISLDCS--DVSRPLEGIKNAVALDFDVLNNRIYWAD 455 (877)
T ss_pred CceeEecCCCcEec-cCC--ceecCCCC--cEE-EEecCccceecccCCC--cceEEccCCccceEEEEEecCCEEEEEe
Confidence 99999999999997 343 54332111 110 1111111 11111 1111111111222221 2234667765
Q ss_pred c--EEEEEEecCCcc--eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee--cCCCceEEEEcc
Q psy6572 547 K--YYIREVTQAGVM--TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP--ATSPDGLTVDWV 619 (1416)
Q Consensus 547 ~--~~I~~i~l~g~~--~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~--l~~p~gLAvD~~ 619 (1416)
. ..|.+..+.+.. .+...++-.+.+|++||..+.+||+|.... .|.+..++ +.+.+|+. +..|+.++||+.
T Consensus 456 ~~~~~i~~~~~~~~~~~~~~~~g~~~~~~lavD~~~~~~y~tDe~~~--~i~v~~~~g~~~~vl~~~~l~~~r~~~v~p~ 533 (877)
T KOG1215|consen 456 LSDEKICRASQDGSSECELCGDGLCIPEGLAVDWIGDNIYWTDEGNC--LIEVADLDGSSRKVLVSKDLDLPRSIAVDPE 533 (877)
T ss_pred ccCCeEeeeccCCCccceEeccCccccCcEEEEeccCCceecccCCc--eeEEEEccCCceeEEEecCCCCccceeeccc
Confidence 3 346677777766 456678888999999999999999999965 78888888 55666666 789999999999
Q ss_pred CCcEEEeeCC-CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCC
Q psy6572 620 GRNLYWCDKG-LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWP 698 (1416)
Q Consensus 620 ~~~LYwtD~~-~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P 698 (1416)
.+.+||+|++ ..+|.++.|+|..+++|+..++.+|+||++|.....+||+|......|++++|+|..|+++....+.+|
T Consensus 534 ~g~~~wtd~~~~~~i~ra~~dg~~~~~l~~~~~~~p~glt~d~~~~~~yw~d~~~~~~i~~~~~~g~~r~~~~~~~~~~p 613 (877)
T KOG1215|consen 534 KGLMFWTDWGQPPRIERASLDGSERAVLVTNGILWPNGLTIDYETDRLYWADAKLDYTIESANMDGQNRRVVDSEDLPHP 613 (877)
T ss_pred cCeeEEecCCCCchhhhhcCCCCCceEEEeCCccCCCcceEEeecceeEEEcccCCcceeeeecCCCceEEeccccCCCc
Confidence 9999999998 458999999999999999999999999999999999999997766579999999999996666789999
Q ss_pred eeEEeecCCCeEEEecCCCCeEEEEeC
Q psy6572 699 NALTISYETNELFWGDAHEDYIAVSDL 725 (1416)
Q Consensus 699 ~gLaiD~~~~rLYWtD~~~~~I~~~~l 725 (1416)
.+|++ ..+++||+|+....+.+...
T Consensus 614 ~~~~~--~~~~iyw~d~~~~~~~~~~~ 638 (877)
T KOG1215|consen 614 FGLSV--FEDYIYWTDWSNRAISRAEK 638 (877)
T ss_pred eEEEE--ecceeEEeeccccceEeeec
Confidence 99998 68999999999988888774
No 4
>KOG1214|consensus
Probab=99.97 E-value=1e-30 Score=308.08 Aligned_cols=197 Identities=28% Similarity=0.439 Sum_probs=176.2
Q ss_pred cEEEEecC-C---C--CeEEee--cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCc
Q psy6572 592 SIRRSCNN-S---Q--PELLFP--ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAY 663 (1416)
Q Consensus 592 ~I~r~~l~-s---~--~~~l~~--l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~ 663 (1416)
.|.+..++ . + .++|+. ...|.||++|-+.++|||||..-..|.+++|+|...++++..+|..|.|||||+..
T Consensus 1000 ~I~~lplng~~~~K~~ak~~l~~p~~IiVGidfDC~e~mvyWtDv~g~SI~rasL~G~Ep~ti~n~~L~SPEGiAVDh~~ 1079 (1289)
T KOG1214|consen 1000 QIGYLPLNGTRLQKDAAKTLLSLPGSIIVGIDFDCRERMVYWTDVAGRSISRASLEGAEPETIVNSGLISPEGIAVDHIR 1079 (1289)
T ss_pred eEEEeecCcchhchhhhhceEecccceeeeeecccccceEEEeecCCCccccccccCCCCceeecccCCCccceeeeecc
Confidence 78888887 2 1 233333 66789999999999999999999999999999999999999999999999999999
Q ss_pred ceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCC--CeEEEEeCCCCceEEEEeccCCCC
Q psy6572 664 GYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHE--DYIAVSDLNGENIKIIVSRRMDPT 741 (1416)
Q Consensus 664 g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~--~~I~~~~ldG~~r~~v~~~~~~p~ 741 (1416)
+.|||||.. ..+|++|.|||+.|++|+.++|..|++|++|+..+.|||+||.+ .+|+++.|||++|++|+...
T Consensus 1080 Rn~ywtDS~-lD~IevA~LdG~~rkvLf~tdLVNPR~iv~D~~rgnLYwtDWnRenPkIets~mDG~NrRilin~D---- 1154 (1289)
T KOG1214|consen 1080 RNMYWTDSV-LDKIEVALLDGSERKVLFYTDLVNPRAIVVDPIRGNLYWTDWNRENPKIETSSMDGENRRILINTD---- 1154 (1289)
T ss_pred ceeeeeccc-cchhheeecCCceeeEEEeecccCcceEEeecccCceeeccccccCCcceeeccCCccceEEeecc----
Confidence 999999954 56999999999999999999999999999999999999999975 59999999999999999986
Q ss_pred cccccceeEEE--ecCcEEEeecCCCeeEEecccCCCceEEEEeCCCCCCeeeeeecc
Q psy6572 742 INLHHVFALAV--FEDHLFWTDWEMKSIERCDKYTGKNCTSVVKNLVHKPMDLRVYHP 797 (1416)
Q Consensus 742 ~~l~~P~~lav--~~d~LYwtD~~~~~I~~~nk~tG~~~~~l~~~~~~~p~~I~v~h~ 797 (1416)
+.-|.||++ |...|-|.|.+++++.-++ .+|..++++. +.+..||+|.-|..
T Consensus 1155 --igLPNGLtfdpfs~~LCWvDAGt~rleC~~-p~g~gRR~i~-~~LqYPF~itsy~~ 1208 (1289)
T KOG1214|consen 1155 --IGLPNGLTFDPFSKLLCWVDAGTKRLECTL-PDGTGRRVIQ-NNLQYPFSITSYAD 1208 (1289)
T ss_pred --cCCCCCceeCcccceeeEEecCCcceeEec-CCCCcchhhh-hcccCceeeeeccc
Confidence 777888887 7788999999999998885 6788888887 89999999987754
No 5
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=99.64 E-value=2.4e-14 Score=162.72 Aligned_cols=213 Identities=25% Similarity=0.357 Sum_probs=159.3
Q ss_pred ceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEeecCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEc
Q psy6572 569 AVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFPATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLIN 648 (1416)
Q Consensus 569 ~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~ 648 (1416)
+.|+.||..++.|||+|... +.|+++.+.+....++.+..|.|++++...+.||+++... +.+.++.....++++.
T Consensus 2 ~Egp~~d~~~g~l~~~D~~~--~~i~~~~~~~~~~~~~~~~~~~G~~~~~~~g~l~v~~~~~--~~~~d~~~g~~~~~~~ 77 (246)
T PF08450_consen 2 GEGPVWDPRDGRLYWVDIPG--GRIYRVDPDTGEVEVIDLPGPNGMAFDRPDGRLYVADSGG--IAVVDPDTGKVTVLAD 77 (246)
T ss_dssp EEEEEEETTTTEEEEEETTT--TEEEEEETTTTEEEEEESSSEEEEEEECTTSEEEEEETTC--EEEEETTTTEEEEEEE
T ss_pred CcceEEECCCCEEEEEEcCC--CEEEEEECCCCeEEEEecCCCceEEEEccCCEEEEEEcCc--eEEEecCCCcEEEEee
Confidence 46889999999999999985 4999999986655556666699999997789999999754 4444777666565555
Q ss_pred C-----CCCCcceeeecCCcceEEEeeCCCC-------ceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCC
Q psy6572 649 K-----GLQEPRGIALNPAYGYMYWTDWGQN-------AHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAH 716 (1416)
Q Consensus 649 ~-----~l~~P~gIavDp~~g~LYWtD~g~~-------~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~ 716 (1416)
. .+..|+.+++|+. |.||+|+.+.. .+|+|...+|+ ..++...+..||||++++.++.||++|..
T Consensus 78 ~~~~~~~~~~~ND~~vd~~-G~ly~t~~~~~~~~~~~~g~v~~~~~~~~--~~~~~~~~~~pNGi~~s~dg~~lyv~ds~ 154 (246)
T PF08450_consen 78 LPDGGVPFNRPNDVAVDPD-GNLYVTDSGGGGASGIDPGSVYRIDPDGK--VTVVADGLGFPNGIAFSPDGKTLYVADSF 154 (246)
T ss_dssp EETTCSCTEEEEEEEE-TT-S-EEEEEECCBCTTCGGSEEEEEEETTSE--EEEEEEEESSEEEEEEETTSSEEEEEETT
T ss_pred ccCCCcccCCCceEEEcCC-CCEEEEecCCCccccccccceEEECCCCe--EEEEecCcccccceEECCcchheeecccc
Confidence 3 5778999999997 77999986432 46999999844 33344568999999999988899999999
Q ss_pred CCeEEEEeCCC--C---ceEEEEeccCCCCcccccceeEEEe-cCcEEEeecCCCeeEEecccCCCceEEEEeCCCCCCe
Q psy6572 717 EDYIAVSDLNG--E---NIKIIVSRRMDPTINLHHVFALAVF-EDHLFWTDWEMKSIERCDKYTGKNCTSVVKNLVHKPM 790 (1416)
Q Consensus 717 ~~~I~~~~ldG--~---~r~~v~~~~~~p~~~l~~P~~lav~-~d~LYwtD~~~~~I~~~nk~tG~~~~~l~~~~~~~p~ 790 (1416)
.++|++++++. . ++++++.... ....|.+|++. +++||++.+..++|.+++.. |+....+. ....+|.
T Consensus 155 ~~~i~~~~~~~~~~~~~~~~~~~~~~~----~~g~pDG~~vD~~G~l~va~~~~~~I~~~~p~-G~~~~~i~-~p~~~~t 228 (246)
T PF08450_consen 155 NGRIWRFDLDADGGELSNRRVFIDFPG----GPGYPDGLAVDSDGNLWVADWGGGRIVVFDPD-GKLLREIE-LPVPRPT 228 (246)
T ss_dssp TTEEEEEEEETTTCCEEEEEEEEE-SS----SSCEEEEEEEBTTS-EEEEEETTTEEEEEETT-SCEEEEEE--SSSSEE
T ss_pred cceeEEEeccccccceeeeeeEEEcCC----CCcCCCcceEcCCCCEEEEEcCCCEEEEECCC-ccEEEEEc-CCCCCEE
Confidence 99999999863 3 3455544321 12369999995 68999999999999999876 76555554 2334666
Q ss_pred eeee
Q psy6572 791 DLRV 794 (1416)
Q Consensus 791 ~I~v 794 (1416)
.++.
T Consensus 229 ~~~f 232 (246)
T PF08450_consen 229 NCAF 232 (246)
T ss_dssp EEEE
T ss_pred EEEE
Confidence 6555
No 6
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=99.64 E-value=6e-14 Score=187.27 Aligned_cols=234 Identities=18% Similarity=0.210 Sum_probs=175.4
Q ss_pred CeEEEEe--cEEEEEEecCCcce-EEec--------------ccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CC
Q psy6572 540 PNLLFTN--KYYIREVTQAGVMT-IRIH--------------NQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQ 601 (1416)
Q Consensus 540 ~~li~s~--~~~I~~i~l~g~~~-~~~~--------------~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~ 601 (1416)
..|++++ .+.|++++++|... .+.. .+..|.||+||..++.|||+|..++ .|+++++. +.
T Consensus 580 g~lyVaDs~n~rI~v~d~~G~~i~~ig~~g~~G~~dG~~~~a~f~~P~GIavd~~gn~LYVaDt~n~--~Ir~id~~~~~ 657 (1057)
T PLN02919 580 NRLFISDSNHNRIVVTDLDGNFIVQIGSTGEEGLRDGSFEDATFNRPQGLAYNAKKNLLYVADTENH--ALREIDFVNET 657 (1057)
T ss_pred CeEEEEECCCCeEEEEeCCCCEEEEEccCCCcCCCCCchhccccCCCcEEEEeCCCCEEEEEeCCCc--eEEEEecCCCE
Confidence 4566665 46788888877652 1111 2456899999999999999999864 88888876 33
Q ss_pred CeEEee------------------cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEc--------------C
Q psy6572 602 PELLFP------------------ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLIN--------------K 649 (1416)
Q Consensus 602 ~~~l~~------------------l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~--------------~ 649 (1416)
+++|.. +..|.+|+||+.+++||+++.+.++|.+.++.+....++.. .
T Consensus 658 V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~~~~~I~v~d~~~g~v~~~~G~G~~~~~~g~~~~~~ 737 (1057)
T PLN02919 658 VRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMAGQHQIWEYNISDGVTRVFSGDGYERNLNGSSGTST 737 (1057)
T ss_pred EEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEECCCCeEEEEECCCCeEEEEecCCccccCCCCccccc
Confidence 333321 56899999999999999999999999999886554433321 2
Q ss_pred CCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEee---------------------cCCCCCeeEEeecCCC
Q psy6572 650 GLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIIS---------------------KNLSWPNALTISYETN 708 (1416)
Q Consensus 650 ~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~---------------------~~l~~P~gLaiD~~~~ 708 (1416)
.+..|.|||++|..++||++|.+.+ +|.+.++++...++++. ..+.+|.||++|. ++
T Consensus 738 ~~~~P~GIavspdG~~LYVADs~n~-~Irv~D~~tg~~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~-dG 815 (1057)
T PLN02919 738 SFAQPSGISLSPDLKELYIADSESS-SIRALDLKTGGSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAK-DG 815 (1057)
T ss_pred cccCccEEEEeCCCCEEEEEECCCC-eEEEEECCCCcEEEEEecccccCcccccccCCCCchhhhhccCCceeeEeC-CC
Confidence 3578999999999889999997755 89999987654433321 1256899999995 67
Q ss_pred eEEEecCCCCeEEEEeCCCCceEEEEeccCC-------CCcccccceeEEEe-cCcEEEeecCCCeeEEecccCCCc
Q psy6572 709 ELFWGDAHEDYIAVSDLNGENIKIIVSRRMD-------PTINLHHVFALAVF-EDHLFWTDWEMKSIERCDKYTGKN 777 (1416)
Q Consensus 709 rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~-------p~~~l~~P~~lav~-~d~LYwtD~~~~~I~~~nk~tG~~ 777 (1416)
.||++|+.+++|.+++.++....++...... ....+.+|.+|++. +++||++|..+++|..++..++..
T Consensus 816 ~LYVADs~N~rIrviD~~tg~v~tiaG~G~~G~~dG~~~~a~l~~P~GIavd~dG~lyVaDt~Nn~Irvid~~~~~~ 892 (1057)
T PLN02919 816 QIYVADSYNHKIKKLDPATKRVTTLAGTGKAGFKDGKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNKGEA 892 (1057)
T ss_pred cEEEEECCCCEEEEEECCCCeEEEEeccCCcCCCCCcccccccCCceEEEEeCCCCEEEEECCCCEEEEEECCCCcc
Confidence 8999999999999999987766665532100 01236789999995 679999999999999998877654
No 7
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=99.62 E-value=4e-14 Score=188.92 Aligned_cols=215 Identities=17% Similarity=0.234 Sum_probs=162.5
Q ss_pred EEecccccceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEee----------------cCCCceEEEEccCCcEE
Q psy6572 561 IRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFP----------------ATSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 561 ~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~----------------l~~p~gLAvD~~~~~LY 624 (1416)
++...+..|.+|++|..++.||++|..++ +|+++.+++.....+. +..|.|||||..++.||
T Consensus 562 ~~~s~l~~P~gvavd~~~g~lyVaDs~n~--rI~v~d~~G~~i~~ig~~g~~G~~dG~~~~a~f~~P~GIavd~~gn~LY 639 (1057)
T PLN02919 562 LLTSPLKFPGKLAIDLLNNRLFISDSNHN--RIVVTDLDGNFIVQIGSTGEEGLRDGSFEDATFNRPQGLAYNAKKNLLY 639 (1057)
T ss_pred cccccCCCCceEEEECCCCeEEEEECCCC--eEEEEeCCCCEEEEEccCCCcCCCCCchhccccCCCcEEEEeCCCCEEE
Confidence 33456778889999999999999999865 8999887743221111 45799999999888999
Q ss_pred EeeCCCCeEEEeecCCCceEEEEcC----------------CCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCE
Q psy6572 625 WCDKGLDTIEVAKLDGRFRKVLINK----------------GLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPK 688 (1416)
Q Consensus 625 wtD~~~~~I~v~~ldG~~~~vLi~~----------------~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~ 688 (1416)
|+|...++|.++++.+...++|... .+..|.+|+++|..++||++|++.+ +|.+.+..+....
T Consensus 640 VaDt~n~~Ir~id~~~~~V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~~~~-~I~v~d~~~g~v~ 718 (1057)
T PLN02919 640 VADTENHALREIDFVNETVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMAGQH-QIWEYNISDGVTR 718 (1057)
T ss_pred EEeCCCceEEEEecCCCEEEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEECCCC-eEEEEECCCCeEE
Confidence 9999999999999887766666431 1568999999999999999998765 7888776543332
Q ss_pred EEe--------------ecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccC---------------C
Q psy6572 689 VII--------------SKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRM---------------D 739 (1416)
Q Consensus 689 vlv--------------~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~---------------~ 739 (1416)
++. ...+..|+||++++..++||++|..+++|.++++++....++..+.. .
T Consensus 719 ~~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n~~Irv~D~~tg~~~~~~gg~~~~~~~l~~fG~~dG~g 798 (1057)
T PLN02919 719 VFSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIADSESSSIRALDLKTGGSRLLAGGDPTFSDNLFKFGDHDGVG 798 (1057)
T ss_pred EEecCCccccCCCCccccccccCccEEEEeCCCCEEEEEECCCCeEEEEECCCCcEEEEEecccccCcccccccCCCCch
Confidence 221 12367899999998778899999999999999987655544432110 0
Q ss_pred CCcccccceeEEEe-cCcEEEeecCCCeeEEecccCCCce
Q psy6572 740 PTINLHHVFALAVF-EDHLFWTDWEMKSIERCDKYTGKNC 778 (1416)
Q Consensus 740 p~~~l~~P~~lav~-~d~LYwtD~~~~~I~~~nk~tG~~~ 778 (1416)
....+.+|.+|++. ++.||++|+.+++|.+++..++...
T Consensus 799 ~~~~l~~P~Gvavd~dG~LYVADs~N~rIrviD~~tg~v~ 838 (1057)
T PLN02919 799 SEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPATKRVT 838 (1057)
T ss_pred hhhhccCCceeeEeCCCcEEEEECCCCEEEEEECCCCeEE
Confidence 01236789999996 4689999999999999987655443
No 8
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=99.48 E-value=3.7e-12 Score=144.79 Aligned_cols=207 Identities=24% Similarity=0.269 Sum_probs=152.8
Q ss_pred CCeEEEEe--cEEEEEEecCCcceEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee-------c
Q psy6572 539 PPNLLFTN--KYYIREVTQAGVMTIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP-------A 608 (1416)
Q Consensus 539 ~~~li~s~--~~~I~~i~l~g~~~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~-------l 608 (1416)
...|+|++ ...|+++++.+....+.. +..+.|++++...++||+++.. .+.++.+. ...+++.. +
T Consensus 11 ~g~l~~~D~~~~~i~~~~~~~~~~~~~~-~~~~~G~~~~~~~g~l~v~~~~----~~~~~d~~~g~~~~~~~~~~~~~~~ 85 (246)
T PF08450_consen 11 DGRLYWVDIPGGRIYRVDPDTGEVEVID-LPGPNGMAFDRPDGRLYVADSG----GIAVVDPDTGKVTVLADLPDGGVPF 85 (246)
T ss_dssp TTEEEEEETTTTEEEEEETTTTEEEEEE-SSSEEEEEEECTTSEEEEEETT----CEEEEETTTTEEEEEEEEETTCSCT
T ss_pred CCEEEEEEcCCCEEEEEECCCCeEEEEe-cCCCceEEEEccCCEEEEEEcC----ceEEEecCCCcEEEEeeccCCCccc
Confidence 45677776 567999998887632222 2339999999778999999876 45555666 44454444 4
Q ss_pred CCCceEEEEccCCcEEEeeCCC--------CeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEE
Q psy6572 609 TSPDGLTVDWVGRNLYWCDKGL--------DTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKA 680 (1416)
Q Consensus 609 ~~p~gLAvD~~~~~LYwtD~~~--------~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra 680 (1416)
..|++++||+. |+||+|+... ++|++.+.+|+.. ++. ..+..|+||+++|..+.||++|.... +|++.
T Consensus 86 ~~~ND~~vd~~-G~ly~t~~~~~~~~~~~~g~v~~~~~~~~~~-~~~-~~~~~pNGi~~s~dg~~lyv~ds~~~-~i~~~ 161 (246)
T PF08450_consen 86 NRPNDVAVDPD-GNLYVTDSGGGGASGIDPGSVYRIDPDGKVT-VVA-DGLGFPNGIAFSPDGKTLYVADSFNG-RIWRF 161 (246)
T ss_dssp EEEEEEEE-TT-S-EEEEEECCBCTTCGGSEEEEEEETTSEEE-EEE-EEESSEEEEEEETTSSEEEEEETTTT-EEEEE
T ss_pred CCCceEEEcCC-CCEEEEecCCCccccccccceEEECCCCeEE-EEe-cCcccccceEECCcchheeecccccc-eeEEE
Confidence 56889999974 6699999764 5699999995433 333 56889999999999999999997654 89999
Q ss_pred ecCCCC-----CEEEeecC--CCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEEe
Q psy6572 681 KMDGSN-----PKVIISKN--LSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVF 753 (1416)
Q Consensus 681 ~mDGs~-----r~vlv~~~--l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~ 753 (1416)
.++... +++++... ...|.||+||. +++||++.+..+.|.+++.+|.-..+|... ..+|..+++-
T Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~g~pDG~~vD~-~G~l~va~~~~~~I~~~~p~G~~~~~i~~p-------~~~~t~~~fg 233 (246)
T PF08450_consen 162 DLDADGGELSNRRVFIDFPGGPGYPDGLAVDS-DGNLWVADWGGGRIVVFDPDGKLLREIELP-------VPRPTNCAFG 233 (246)
T ss_dssp EEETTTCCEEEEEEEEE-SSSSCEEEEEEEBT-TS-EEEEEETTTEEEEEETTSCEEEEEE-S-------SSSEEEEEEE
T ss_pred eccccccceeeeeeEEEcCCCCcCCCcceEcC-CCCEEEEEcCCCEEEEECCCccEEEEEcCC-------CCCEEEEEEE
Confidence 997543 34454432 24699999996 889999999999999999999877666542 5689999984
Q ss_pred ---cCcEEEeec
Q psy6572 754 ---EDHLFWTDW 762 (1416)
Q Consensus 754 ---~d~LYwtD~ 762 (1416)
.+.||+|..
T Consensus 234 g~~~~~L~vTta 245 (246)
T PF08450_consen 234 GPDGKTLYVTTA 245 (246)
T ss_dssp STTSSEEEEEEB
T ss_pred CCCCCEEEEEeC
Confidence 467999864
No 9
>KOG4659|consensus
Probab=99.33 E-value=2.5e-11 Score=151.35 Aligned_cols=225 Identities=19% Similarity=0.239 Sum_probs=162.7
Q ss_pred CCCeEEEEecEEEEEEecCCcc-eEEe---cccccceeeeeecCCCeEEEeeccCCCccEEEEec-C-----CCCeEEee
Q psy6572 538 VPPNLLFTNKYYIREVTQAGVM-TIRI---HNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCN-N-----SQPELLFP 607 (1416)
Q Consensus 538 ~~~~li~s~~~~I~~i~l~g~~-~~~~---~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l-~-----s~~~~l~~ 607 (1416)
+...||+.+-++||||..+|.. +++. ....+-.-||+++..+.||++|.... +|+|+.- + .+.++++.
T Consensus 374 ~DGSl~VGDfNyIRRI~~dg~v~tIl~L~~t~~sh~Yy~AvsPvdgtlyvSdp~s~--qv~rv~sl~~~d~~~N~evvaG 451 (1899)
T KOG4659|consen 374 PDGSLIVGDFNYIRRISQDGQVSTILTLGLTDTSHSYYIAVSPVDGTLYVSDPLSK--QVWRVSSLEPQDSRNNYEVVAG 451 (1899)
T ss_pred CCCcEEEccchheeeecCCCceEEEEEecCCCccceeEEEecCcCceEEecCCCcc--eEEEeccCCccccccCeeEEec
Confidence 3557888889999999999988 4443 33445678999999999999999964 7887753 2 34455541
Q ss_pred ----------------------cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEc-----------------
Q psy6572 608 ----------------------ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLIN----------------- 648 (1416)
Q Consensus 608 ----------------------l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~----------------- 648 (1416)
|..|.||||| ..|+||++|.. +|.+++.+|-.++++-+
T Consensus 452 ~Ge~Clp~desCGDGalA~dA~L~~PkGIa~d-k~g~lYfaD~t--~IR~iD~~giIstlig~~~~~~~p~~C~~~~kl~ 528 (1899)
T KOG4659|consen 452 DGEVCLPADESCGDGALAQDAQLIFPKGIAFD-KMGNLYFADGT--RIRVIDTTGIISTLIGTTPDQHPPRTCAQITKLV 528 (1899)
T ss_pred cCcCccccccccCcchhcccceeccCCceeEc-cCCcEEEeccc--EEEEeccCceEEEeccCCCCccCccccccccchh
Confidence 5689999999 68999999975 68888877765554433
Q ss_pred -CCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEee-------------------cCCCCCeeEEeecCCC
Q psy6572 649 -KGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIIS-------------------KNLSWPNALTISYETN 708 (1416)
Q Consensus 649 -~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~-------------------~~l~~P~gLaiD~~~~ 708 (1416)
-.|.||..|||+|..+-||+.|- ..|.++..++.-+.++-. ..+.-|.+|||- ..+
T Consensus 529 ~~~leWPT~LaV~Pmdnsl~Vld~---nvvlrit~~~rV~Ii~GrP~hC~~a~~t~~~skla~H~tl~~~r~Iavg-~~G 604 (1899)
T KOG4659|consen 529 DLQLEWPTSLAVDPMDNSLLVLDT---NVVLRITVVHRVRIILGRPTHCDLANATSSASKLADHRTLLIQRDIAVG-TDG 604 (1899)
T ss_pred heeeecccceeecCCCCeEEEeec---ceEEEEccCccEEEEcCCccccccCCCchhhhhhhhhhhhhhhhceeec-CCc
Confidence 14789999999999999999993 377887777654422111 124457899997 489
Q ss_pred eEEEecCCCCeEEEEeCCCCceE-EEEeccCC--------------------CCcccccceeEEE-ecCcEEEeecCCCe
Q psy6572 709 ELFWGDAHEDYIAVSDLNGENIK-IIVSRRMD--------------------PTINLHHVFALAV-FEDHLFWTDWEMKS 766 (1416)
Q Consensus 709 rLYWtD~~~~~I~~~~ldG~~r~-~v~~~~~~--------------------p~~~l~~P~~lav-~~d~LYwtD~~~~~ 766 (1416)
.||+++....+|-++..-+++.+ .++.+... ....+..|++|+| ..+.||++|.++-+
T Consensus 605 ~lyvaEsD~rriNrvr~~~tdg~i~ilaGa~S~C~C~~~~~cdcfs~~~~~At~A~lnsp~alaVsPdg~v~IAD~gN~r 684 (1899)
T KOG4659|consen 605 ALYVAESDGRRINRVRKLSTDGTISILAGAKSPCSCDVAACCDCFSLRDVAATQAKLNSPYALAVSPDGDVIIADSGNSR 684 (1899)
T ss_pred eEEEEeccchhhhheEEeccCceEEEecCCCCCCCcccccCCccccccchhhhccccCCcceEEECCCCcEEEecCCchh
Confidence 99999987766666553333332 33333211 1235789999999 57899999999988
Q ss_pred eEEec
Q psy6572 767 IERCD 771 (1416)
Q Consensus 767 I~~~n 771 (1416)
|..+.
T Consensus 685 Ir~Vs 689 (1899)
T KOG4659|consen 685 IRKVS 689 (1899)
T ss_pred hhhhh
Confidence 87653
No 10
>KOG4659|consensus
Probab=99.28 E-value=8.4e-11 Score=146.74 Aligned_cols=199 Identities=22% Similarity=0.264 Sum_probs=149.9
Q ss_pred cccceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEee-----cCCCceEEEEccCCcEEEeeCCCCeEEEee-cC
Q psy6572 566 QTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFP-----ATSPDGLTVDWVGRNLYWCDKGLDTIEVAK-LD 639 (1416)
Q Consensus 566 l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~-----l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~-ld 639 (1416)
+-.|++|++-+ .+.||+-|.+ .|+|+..++++.+|++ ...-.-|||+|+.+.||++|...++|++++ |.
T Consensus 364 L~aPvala~a~-DGSl~VGDfN----yIRRI~~dg~v~tIl~L~~t~~sh~Yy~AvsPvdgtlyvSdp~s~qv~rv~sl~ 438 (1899)
T KOG4659|consen 364 LFAPVALAYAP-DGSLIVGDFN----YIRRISQDGQVSTILTLGLTDTSHSYYIAVSPVDGTLYVSDPLSKQVWRVSSLE 438 (1899)
T ss_pred eeceeeEEEcC-CCcEEEccch----heeeecCCCceEEEEEecCCCccceeEEEecCcCceEEecCCCcceEEEeccCC
Confidence 45688999865 6789999988 7999998877776666 344568999999999999999999888765 54
Q ss_pred CCc----eEEEEcC--------------------CCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEee---
Q psy6572 640 GRF----RKVLINK--------------------GLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIIS--- 692 (1416)
Q Consensus 640 G~~----~~vLi~~--------------------~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~--- 692 (1416)
++. -+|+... .|..|+|||||.. |.||++|. .+|.+++-+|-.++++-+
T Consensus 439 ~~d~~~N~evvaG~Ge~Clp~desCGDGalA~dA~L~~PkGIa~dk~-g~lYfaD~---t~IR~iD~~giIstlig~~~~ 514 (1899)
T KOG4659|consen 439 PQDSRNNYEVVAGDGEVCLPADESCGDGALAQDAQLIFPKGIAFDKM-GNLYFADG---TRIRVIDTTGIISTLIGTTPD 514 (1899)
T ss_pred ccccccCeeEEeccCcCccccccccCcchhcccceeccCCceeEccC-CcEEEecc---cEEEEeccCceEEEeccCCCC
Confidence 432 2344321 4778999999965 99999993 368888777755444322
Q ss_pred ---------------cCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCC----CC-----------c
Q psy6572 693 ---------------KNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMD----PT-----------I 742 (1416)
Q Consensus 693 ---------------~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~----p~-----------~ 742 (1416)
-.|.||..|||++..+.||+.| ++.|.++..++..+ |+.+... ++ .
T Consensus 515 ~~~p~~C~~~~kl~~~~leWPT~LaV~Pmdnsl~Vld--~nvvlrit~~~rV~--Ii~GrP~hC~~a~~t~~~skla~H~ 590 (1899)
T KOG4659|consen 515 QHPPRTCAQITKLVDLQLEWPTSLAVDPMDNSLLVLD--TNVVLRITVVHRVR--IILGRPTHCDLANATSSASKLADHR 590 (1899)
T ss_pred ccCccccccccchhheeeecccceeecCCCCeEEEee--cceEEEEccCccEE--EEcCCccccccCCCchhhhhhhhhh
Confidence 2489999999999999999999 68999999888766 3333210 00 1
Q ss_pred ccccceeEEE-ecCcEEEeecCCCeeEEecccCCCc
Q psy6572 743 NLHHVFALAV-FEDHLFWTDWEMKSIERCDKYTGKN 777 (1416)
Q Consensus 743 ~l~~P~~lav-~~d~LYwtD~~~~~I~~~nk~tG~~ 777 (1416)
.+..|.+|+| ..+.||+++...++|.|+.+.+...
T Consensus 591 tl~~~r~Iavg~~G~lyvaEsD~rriNrvr~~~tdg 626 (1899)
T KOG4659|consen 591 TLLIQRDIAVGTDGALYVAESDGRRINRVRKLSTDG 626 (1899)
T ss_pred hhhhhhceeecCCceEEEEeccchhhhheEEeccCc
Confidence 2345788999 6789999999999999998775544
No 11
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=99.11 E-value=3.8e-08 Score=117.72 Aligned_cols=206 Identities=17% Similarity=0.153 Sum_probs=144.4
Q ss_pred ccccceeeeeecCCCeEEEeeccCCCccEEEEecCC--CCeE---Ee-----------e-cCCCceEEEEccCCcEEEee
Q psy6572 565 NQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNS--QPEL---LF-----------P-ATSPDGLTVDWVGRNLYWCD 627 (1416)
Q Consensus 565 ~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s--~~~~---l~-----------~-l~~p~gLAvD~~~~~LYwtD 627 (1416)
....|..|++++.++.||++.... +.|..+.++. .... +. . ..+|..+.+++.++.||++|
T Consensus 85 ~g~~p~~i~~~~~g~~l~vany~~--g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg~~v~v~d 162 (345)
T PF10282_consen 85 GGSSPCHIAVDPDGRFLYVANYGG--GSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDGRFVYVPD 162 (345)
T ss_dssp SSSCEEEEEECTTSSEEEEEETTT--TEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTSSEEEEEE
T ss_pred CCCCcEEEEEecCCCEEEEEEccC--CeEEEEEccCCcccceeeeecccCCCCCcccccccccceeEEECCCCCEEEEEe
Confidence 456778899999999999999885 4777777762 2111 11 0 35678999999999999999
Q ss_pred CCCCeEEEeecCCCc--e---EEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCC-CCCEEEee---c---C-
Q psy6572 628 KGLDTIEVAKLDGRF--R---KVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDG-SNPKVIIS---K---N- 694 (1416)
Q Consensus 628 ~~~~~I~v~~ldG~~--~---~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDG-s~r~vlv~---~---~- 694 (1416)
.+..+|.+.+++... . ..+.......||.|+++|...+||++.... ..|.++.++. .....++. . .
T Consensus 163 lG~D~v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~s-~~v~v~~~~~~~g~~~~~~~~~~~~~~~ 241 (345)
T PF10282_consen 163 LGADRVYVYDIDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNELS-NTVSVFDYDPSDGSLTEIQTISTLPEGF 241 (345)
T ss_dssp TTTTEEEEEEE-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEETTT-TEEEEEEEETTTTEEEEEEEEESCETTS
T ss_pred cCCCEEEEEEEeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecCCC-CcEEEEeecccCCceeEEEEeeeccccc
Confidence 999999999997654 2 223335567799999999999999998543 4788888872 22222221 1 1
Q ss_pred --CCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCC--ceEEEEeccCCCCcccccceeEEE--ecCcEEEeecCCCeeE
Q psy6572 695 --LSWPNALTISYETNELFWGDAHEDYIAVSDLNGE--NIKIIVSRRMDPTINLHHVFALAV--FEDHLFWTDWEMKSIE 768 (1416)
Q Consensus 695 --l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~--~r~~v~~~~~~p~~~l~~P~~lav--~~d~LYwtD~~~~~I~ 768 (1416)
..+|.+|+|.+..++||++....+.|..++++.. ..+.+.... ..-.+|.+|++ .+.+||++...++.|.
T Consensus 242 ~~~~~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~~~g~l~~~~~~~----~~G~~Pr~~~~s~~g~~l~Va~~~s~~v~ 317 (345)
T PF10282_consen 242 TGENAPAEIAISPDGRFLYVSNRGSNSISVFDLDPATGTLTLVQTVP----TGGKFPRHFAFSPDGRYLYVANQDSNTVS 317 (345)
T ss_dssp CSSSSEEEEEE-TTSSEEEEEECTTTEEEEEEECTTTTTEEEEEEEE----ESSSSEEEEEE-TTSSEEEEEETTTTEEE
T ss_pred cccCCceeEEEecCCCEEEEEeccCCEEEEEEEecCCCceEEEEEEe----CCCCCccEEEEeCCCCEEEEEecCCCeEE
Confidence 2378999999999999999999999998888543 333332211 11456999999 6789999999988766
Q ss_pred Ee--cccCCCc
Q psy6572 769 RC--DKYTGKN 777 (1416)
Q Consensus 769 ~~--nk~tG~~ 777 (1416)
.. +..+|.-
T Consensus 318 vf~~d~~tG~l 328 (345)
T PF10282_consen 318 VFDIDPDTGKL 328 (345)
T ss_dssp EEEEETTTTEE
T ss_pred EEEEeCCCCcE
Confidence 44 5455553
No 12
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=99.08 E-value=1.2e-08 Score=118.50 Aligned_cols=207 Identities=21% Similarity=0.309 Sum_probs=142.0
Q ss_pred ecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCC
Q psy6572 563 IHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDG 640 (1416)
Q Consensus 563 ~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG 640 (1416)
.....-..|..|+...+.|||+|... ++|.|+... +..+++.. -..+.++.+| ..++|..++.+...+.+ -.|
T Consensus 21 ~~~~~~gEgP~w~~~~~~L~w~DI~~--~~i~r~~~~~g~~~~~~~p~~~~~~~~~d-~~g~Lv~~~~g~~~~~~--~~~ 95 (307)
T COG3386 21 DKGATLGEGPVWDPDRGALLWVDILG--GRIHRLDPETGKKRVFPSPGGFSSGALID-AGGRLIACEHGVRLLDP--DTG 95 (307)
T ss_pred ecccccccCccCcCCCCEEEEEeCCC--CeEEEecCCcCceEEEECCCCcccceeec-CCCeEEEEccccEEEec--cCC
Confidence 33445567788999999999999995 599999987 54555554 4446666676 67888877776544444 234
Q ss_pred CceEEEEc----CCCCCcceeeecCCcceEEEeeCC-----CC-----ceEEEEecCCCCCEEEeecCCCCCeeEEeecC
Q psy6572 641 RFRKVLIN----KGLQEPRGIALNPAYGYMYWTDWG-----QN-----AHIGKAKMDGSNPKVIISKNLSWPNALTISYE 706 (1416)
Q Consensus 641 ~~~~vLi~----~~l~~P~gIavDp~~g~LYWtD~g-----~~-----~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~ 706 (1416)
..++.+.. ..+..|+.+.|+|. |.+|+++.+ .. ..|+|++.+|. ...++...+..||||++++.
T Consensus 96 ~~~t~~~~~~~~~~~~r~ND~~v~pd-G~~wfgt~~~~~~~~~~~~~~G~lyr~~p~g~-~~~l~~~~~~~~NGla~SpD 173 (307)
T COG3386 96 GKITLLAEPEDGLPLNRPNDGVVDPD-GRIWFGDMGYFDLGKSEERPTGSLYRVDPDGG-VVRLLDDDLTIPNGLAFSPD 173 (307)
T ss_pred ceeEEeccccCCCCcCCCCceeEcCC-CCEEEeCCCccccCccccCCcceEEEEcCCCC-EEEeecCcEEecCceEECCC
Confidence 44344433 34678999999998 999999977 21 46888887543 44455556999999999999
Q ss_pred CCeEEEecCCCCeEEEEeCC---C--CceEEEEeccCCCCcccccceeEEEecC-cEE-EeecCCCeeEEecccCCCceE
Q psy6572 707 TNELFWGDAHEDYIAVSDLN---G--ENIKIIVSRRMDPTINLHHVFALAVFED-HLF-WTDWEMKSIERCDKYTGKNCT 779 (1416)
Q Consensus 707 ~~rLYWtD~~~~~I~~~~ld---G--~~r~~v~~~~~~p~~~l~~P~~lav~~d-~LY-wtD~~~~~I~~~nk~tG~~~~ 779 (1416)
+..||++|+..++|+++.++ | .+++..+.... .-..|.++++..+ .|| ++-|....|.+.+.. |....
T Consensus 174 g~tly~aDT~~~~i~r~~~d~~~g~~~~~~~~~~~~~----~~G~PDG~~vDadG~lw~~a~~~g~~v~~~~pd-G~l~~ 248 (307)
T COG3386 174 GKTLYVADTPANRIHRYDLDPATGPIGGRRGFVDFDE----EPGLPDGMAVDADGNLWVAAVWGGGRVVRFNPD-GKLLG 248 (307)
T ss_pred CCEEEEEeCCCCeEEEEecCcccCccCCcceEEEccC----CCCCCCceEEeCCCCEEEecccCCceEEEECCC-CcEEE
Confidence 99999999999999999887 2 12222222110 1357999999755 555 445555578887765 55444
Q ss_pred EE
Q psy6572 780 SV 781 (1416)
Q Consensus 780 ~l 781 (1416)
.+
T Consensus 249 ~i 250 (307)
T COG3386 249 EI 250 (307)
T ss_pred EE
Confidence 33
No 13
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=99.07 E-value=7.2e-08 Score=115.40 Aligned_cols=246 Identities=19% Similarity=0.171 Sum_probs=161.6
Q ss_pred EEEEEecCCcc---eE--EecccccceeeeeecCCCeEEEeeccC-CCccEEEEecCC---CCeEEee----cCCCceEE
Q psy6572 549 YIREVTQAGVM---TI--RIHNQTNAVGLDFDWVDNCLYWSDVTM-HGSSIRRSCNNS---QPELLFP----ATSPDGLT 615 (1416)
Q Consensus 549 ~I~~i~l~g~~---~~--~~~~l~~~~~l~~D~~~~~LYwtD~~~-~~~~I~r~~l~s---~~~~l~~----l~~p~gLA 615 (1416)
.|+.+.++... ++ .+....+|..|++++..+.||.+.... ....|..+.++. ..+.+.. ...|-.|+
T Consensus 14 gI~~~~~d~~~g~l~~~~~~~~~~~Ps~l~~~~~~~~LY~~~e~~~~~g~v~~~~i~~~~g~L~~~~~~~~~g~~p~~i~ 93 (345)
T PF10282_consen 14 GIYVFRFDEETGTLTLVQTVAEGENPSWLAVSPDGRRLYVVNEGSGDSGGVSSYRIDPDTGTLTLLNSVPSGGSSPCHIA 93 (345)
T ss_dssp EEEEEEEETTTTEEEEEEEEEESSSECCEEE-TTSSEEEEEETTSSTTTEEEEEEEETTTTEEEEEEEEEESSSCEEEEE
T ss_pred cEEEEEEcCCCCCceEeeeecCCCCCceEEEEeCCCEEEEEEccccCCCCEEEEEECCCcceeEEeeeeccCCCCcEEEE
Confidence 46555553332 21 234667889999999999999998853 235677666662 2333332 56788999
Q ss_pred EEccCCcEEEeeCCCCeEEEeecC--CCceEE--EEc----------CCCCCcceeeecCCcceEEEeeCCCCceEEEEe
Q psy6572 616 VDWVGRNLYWCDKGLDTIEVAKLD--GRFRKV--LIN----------KGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAK 681 (1416)
Q Consensus 616 vD~~~~~LYwtD~~~~~I~v~~ld--G~~~~v--Li~----------~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~ 681 (1416)
+|+.++.||+++++.+.|.++.++ |..... ++. .....|..++++|..++||++|.|.. +|.+..
T Consensus 94 ~~~~g~~l~vany~~g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg~~v~v~dlG~D-~v~~~~ 172 (345)
T PF10282_consen 94 VDPDGRFLYVANYGGGSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDGRFVYVPDLGAD-RVYVYD 172 (345)
T ss_dssp ECTTSSEEEEEETTTTEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTSSEEEEEETTTT-EEEEEE
T ss_pred EecCCCEEEEEEccCCeEEEEEccCCcccceeeeecccCCCCCcccccccccceeEEECCCCCEEEEEecCCC-EEEEEE
Confidence 999999999999999999998886 444332 221 23567889999999999999998865 899999
Q ss_pred cCCCCC-E----EEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCC--CCceEEEEeccCCCC--cccccceeEEE
Q psy6572 682 MDGSNP-K----VIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLN--GENIKIIVSRRMDPT--INLHHVFALAV 752 (1416)
Q Consensus 682 mDGs~r-~----vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ld--G~~r~~v~~~~~~p~--~~l~~P~~lav 752 (1416)
++.... . .+....-..|+.|++++...+||++....+.|..++++ ....+.+......|. .....|.+|++
T Consensus 173 ~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~i 252 (345)
T PF10282_consen 173 IDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNELSNTVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAI 252 (345)
T ss_dssp E-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEETTTTEEEEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE
T ss_pred EeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecCCCCcEEEEeecccCCceeEEEEeeeccccccccCCceeEEE
Confidence 987652 1 12223456799999999889999999999999999887 222222222111110 11236888888
Q ss_pred e--cCcEEEeecCCCeeEEecc--cCCCceEEE-EeCCCCCCeeeeee
Q psy6572 753 F--EDHLFWTDWEMKSIERCDK--YTGKNCTSV-VKNLVHKPMDLRVY 795 (1416)
Q Consensus 753 ~--~d~LYwtD~~~~~I~~~nk--~tG~~~~~l-~~~~~~~p~~I~v~ 795 (1416)
. +.+||++....+.|..... .+|.-..+- +......|.+|.+-
T Consensus 253 spdg~~lyvsnr~~~sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s 300 (345)
T PF10282_consen 253 SPDGRFLYVSNRGSNSISVFDLDPATGTLTLVQTVPTGGKFPRHFAFS 300 (345)
T ss_dssp -TTSSEEEEEECTTTEEEEEEECTTTTTEEEEEEEEESSSSEEEEEE-
T ss_pred ecCCCEEEEEeccCCEEEEEEEecCCCceEEEEEEeCCCCCccEEEEe
Confidence 6 6789999999988665543 444432221 11223457777763
No 14
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=99.07 E-value=8.4e-08 Score=114.01 Aligned_cols=220 Identities=14% Similarity=0.117 Sum_probs=144.9
Q ss_pred ccceeeeeecCCCeEEEeeccCCCccEEEEecC--CCC-eEEe--e-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCC
Q psy6572 567 TNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN--SQP-ELLF--P-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDG 640 (1416)
Q Consensus 567 ~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~--s~~-~~l~--~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG 640 (1416)
..|..|++++.++.||.+.... +.|..+.++ +.. +.+. . ...|.++++++.++.||+++.+.++|.+.+++.
T Consensus 80 ~~p~~i~~~~~g~~l~v~~~~~--~~v~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~p~g~~l~v~~~~~~~v~v~d~~~ 157 (330)
T PRK11028 80 GSPTHISTDHQGRFLFSASYNA--NCVSVSPLDKDGIPVAPIQIIEGLEGCHSANIDPDNRTLWVPCLKEDRIRLFTLSD 157 (330)
T ss_pred CCceEEEECCCCCEEEEEEcCC--CeEEEEEECCCCCCCCceeeccCCCcccEeEeCCCCCEEEEeeCCCCEEEEEEECC
Confidence 4688999999999999998764 367777665 211 2222 1 467899999999999999999999999999864
Q ss_pred Cc--e----EEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCC-CCCEEEee------c---CCCCCeeEEee
Q psy6572 641 RF--R----KVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDG-SNPKVIIS------K---NLSWPNALTIS 704 (1416)
Q Consensus 641 ~~--~----~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDG-s~r~vlv~------~---~l~~P~gLaiD 704 (1416)
.. . ..+.......|++|+++|..++||+++.+.+ .|...+++. +....++. . ...||.+|+++
T Consensus 158 ~g~l~~~~~~~~~~~~g~~p~~~~~~pdg~~lyv~~~~~~-~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~ 236 (330)
T PRK11028 158 DGHLVAQEPAEVTTVEGAGPRHMVFHPNQQYAYCVNELNS-SVDVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHIT 236 (330)
T ss_pred CCcccccCCCceecCCCCCCceEEECCCCCEEEEEecCCC-EEEEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEEC
Confidence 21 1 1111123467999999999999999986544 777777763 12211211 1 12366689999
Q ss_pred cCCCeEEEecCCCCeEEEEeC--CCCceEEEEeccCCCCcccccceeEEE--ecCcEEEeecCCCeeEE--ecccCCCce
Q psy6572 705 YETNELFWGDAHEDYIAVSDL--NGENIKIIVSRRMDPTINLHHVFALAV--FEDHLFWTDWEMKSIER--CDKYTGKNC 778 (1416)
Q Consensus 705 ~~~~rLYWtD~~~~~I~~~~l--dG~~r~~v~~~~~~p~~~l~~P~~lav--~~d~LYwtD~~~~~I~~--~nk~tG~~~ 778 (1416)
+.+++||.++...+.|..+++ ++...+++.... ....|.+|++ .+.+||.+....+.|.. ++..+|.-.
T Consensus 237 pdg~~lyv~~~~~~~I~v~~i~~~~~~~~~~~~~~-----~~~~p~~~~~~~dg~~l~va~~~~~~v~v~~~~~~~g~l~ 311 (330)
T PRK11028 237 PDGRHLYACDRTASLISVFSVSEDGSVLSFEGHQP-----TETQPRGFNIDHSGKYLIAAGQKSHHISVYEIDGETGLLT 311 (330)
T ss_pred CCCCEEEEecCCCCeEEEEEEeCCCCeEEEeEEEe-----ccccCCceEECCCCCEEEEEEccCCcEEEEEEcCCCCcEE
Confidence 888899999888888888776 343333332211 1235667776 47799999987776554 444444421
Q ss_pred EEEEeCCCCCCeeeee
Q psy6572 779 TSVVKNLVHKPMDLRV 794 (1416)
Q Consensus 779 ~~l~~~~~~~p~~I~v 794 (1416)
.+-.-.....|+.|+|
T Consensus 312 ~~~~~~~g~~P~~~~~ 327 (330)
T PRK11028 312 ELGRYAVGQGPMWVSV 327 (330)
T ss_pred EccccccCCCceEEEE
Confidence 1111023456777776
No 15
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=99.05 E-value=7.9e-08 Score=114.22 Aligned_cols=229 Identities=13% Similarity=0.121 Sum_probs=148.0
Q ss_pred eEEEEe--cEEEEEEecC--CcceE--EecccccceeeeeecCCCeEEEeeccCCCccEEEEecC--CCCeEEee---cC
Q psy6572 541 NLLFTN--KYYIREVTQA--GVMTI--RIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN--SQPELLFP---AT 609 (1416)
Q Consensus 541 ~li~s~--~~~I~~i~l~--g~~~~--~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~--s~~~~l~~---l~ 609 (1416)
++|+++ ...|+.+++. |..++ .+....++..|++++..+.||++.... ..|..+.++ +..+.+.. ..
T Consensus 3 ~~y~~~~~~~~I~~~~~~~~g~l~~~~~~~~~~~~~~l~~spd~~~lyv~~~~~--~~i~~~~~~~~g~l~~~~~~~~~~ 80 (330)
T PRK11028 3 IVYIASPESQQIHVWNLNHEGALTLLQVVDVPGQVQPMVISPDKRHLYVGVRPE--FRVLSYRIADDGALTFAAESPLPG 80 (330)
T ss_pred EEEEEcCCCCCEEEEEECCCCceeeeeEEecCCCCccEEECCCCCEEEEEECCC--CcEEEEEECCCCceEEeeeecCCC
Confidence 345543 2345555553 32221 122345788899999999999987654 367666555 22333322 45
Q ss_pred CCceEEEEccCCcEEEeeCCCCeEEEeecC--CCceEEEE-cCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCC
Q psy6572 610 SPDGLTVDWVGRNLYWCDKGLDTIEVAKLD--GRFRKVLI-NKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSN 686 (1416)
Q Consensus 610 ~p~gLAvD~~~~~LYwtD~~~~~I~v~~ld--G~~~~vLi-~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~ 686 (1416)
.|.+|++++.++.||.+....++|.+.+++ |...+.+. ......|.+++++|..++||+++.+.. +|...+++...
T Consensus 81 ~p~~i~~~~~g~~l~v~~~~~~~v~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~p~g~~l~v~~~~~~-~v~v~d~~~~g 159 (330)
T PRK11028 81 SPTHISTDHQGRFLFSASYNANCVSVSPLDKDGIPVAPIQIIEGLEGCHSANIDPDNRTLWVPCLKED-RIRLFTLSDDG 159 (330)
T ss_pred CceEEEECCCCCEEEEEEcCCCeEEEEEECCCCCCCCceeeccCCCcccEeEeCCCCCEEEEeeCCCC-EEEEEEECCCC
Confidence 799999999999999999888999998875 43322221 134567999999999999999998755 78888876422
Q ss_pred CE------EEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCC--CceEEEEeccCCCC--cccccceeEEEe--c
Q psy6572 687 PK------VIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNG--ENIKIIVSRRMDPT--INLHHVFALAVF--E 754 (1416)
Q Consensus 687 r~------vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG--~~r~~v~~~~~~p~--~~l~~P~~lav~--~ 754 (1416)
.. .+....-..|+++++++..++||+++...+.|...+++. ...+.+..-...|. ....+|.+|++. +
T Consensus 160 ~l~~~~~~~~~~~~g~~p~~~~~~pdg~~lyv~~~~~~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg 239 (330)
T PRK11028 160 HLVAQEPAEVTTVEGAGPRHMVFHPNQQYAYCVNELNSSVDVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHITPDG 239 (330)
T ss_pred cccccCCCceecCCCCCCceEEECCCCCEEEEEecCCCEEEEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEECCCC
Confidence 11 111112356999999998899999999889998888762 22233322111010 012245556653 5
Q ss_pred CcEEEeecCCCeeEEecc
Q psy6572 755 DHLFWTDWEMKSIERCDK 772 (1416)
Q Consensus 755 d~LYwtD~~~~~I~~~nk 772 (1416)
.+||.++...+.|..++.
T Consensus 240 ~~lyv~~~~~~~I~v~~i 257 (330)
T PRK11028 240 RHLYACDRTASLISVFSV 257 (330)
T ss_pred CEEEEecCCCCeEEEEEE
Confidence 589999887777766543
No 16
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=98.92 E-value=2e-07 Score=108.57 Aligned_cols=210 Identities=17% Similarity=0.200 Sum_probs=141.4
Q ss_pred CCeEEEEe--cEEEEEEecC-CcceEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCC-eEEee------
Q psy6572 539 PPNLLFTN--KYYIREVTQA-GVMTIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQP-ELLFP------ 607 (1416)
Q Consensus 539 ~~~li~s~--~~~I~~i~l~-g~~~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~-~~l~~------ 607 (1416)
...|+|++ ...|++++.. |...+.........++.+| ..++|..++.. ++++.++ ... +++..
T Consensus 36 ~~~L~w~DI~~~~i~r~~~~~g~~~~~~~p~~~~~~~~~d-~~g~Lv~~~~g-----~~~~~~~~~~~~t~~~~~~~~~~ 109 (307)
T COG3386 36 RGALLWVDILGGRIHRLDPETGKKRVFPSPGGFSSGALID-AGGRLIACEHG-----VRLLDPDTGGKITLLAEPEDGLP 109 (307)
T ss_pred CCEEEEEeCCCCeEEEecCCcCceEEEECCCCcccceeec-CCCeEEEEccc-----cEEEeccCCceeEEeccccCCCC
Confidence 44577766 5678888876 4333333322334445554 35556655443 3333334 333 34433
Q ss_pred cCCCceEEEEccCCcEEEeeCC-----C------CeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCce
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKG-----L------DTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAH 676 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~-----~------~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~ 676 (1416)
+..|+.+.||+. +.+|+++.. . +.|++++..|...+.+ ...+..|+|||++|....||++|...+ +
T Consensus 110 ~~r~ND~~v~pd-G~~wfgt~~~~~~~~~~~~~~G~lyr~~p~g~~~~l~-~~~~~~~NGla~SpDg~tly~aDT~~~-~ 186 (307)
T COG3386 110 LNRPNDGVVDPD-GRIWFGDMGYFDLGKSEERPTGSLYRVDPDGGVVRLL-DDDLTIPNGLAFSPDGKTLYVADTPAN-R 186 (307)
T ss_pred cCCCCceeEcCC-CCEEEeCCCccccCccccCCcceEEEEcCCCCEEEee-cCcEEecCceEECCCCCEEEEEeCCCC-e
Confidence 678999999975 889999988 2 3688888765555444 356899999999999999999997654 8
Q ss_pred EEEEecC---CC--CCEEEee--cCCCCCeeEEeecCCCeEE-EecCCCCeEEEEeCCCCceEEEEeccCCCCcccccce
Q psy6572 677 IGKAKMD---GS--NPKVIIS--KNLSWPNALTISYETNELF-WGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVF 748 (1416)
Q Consensus 677 I~ra~mD---Gs--~r~vlv~--~~l~~P~gLaiD~~~~rLY-WtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~ 748 (1416)
|+|..++ |. +++..+. ..-..|-|+++|. ++.|| ++-+....|.+.+.+|.....+..- ...|.
T Consensus 187 i~r~~~d~~~g~~~~~~~~~~~~~~~G~PDG~~vDa-dG~lw~~a~~~g~~v~~~~pdG~l~~~i~lP-------~~~~t 258 (307)
T COG3386 187 IHRYDLDPATGPIGGRRGFVDFDEEPGLPDGMAVDA-DGNLWVAAVWGGGRVVRFNPDGKLLGEIKLP-------VKRPT 258 (307)
T ss_pred EEEEecCcccCccCCcceEEEccCCCCCCCceEEeC-CCCEEEecccCCceEEEECCCCcEEEEEECC-------CCCCc
Confidence 9999888 32 2332333 3347899999994 77777 5555556999999998877766441 35677
Q ss_pred eEEEec---CcEEEeecCCC
Q psy6572 749 ALAVFE---DHLFWTDWEMK 765 (1416)
Q Consensus 749 ~lav~~---d~LYwtD~~~~ 765 (1416)
.+++-+ +.||+|....+
T Consensus 259 ~~~FgG~~~~~L~iTs~~~~ 278 (307)
T COG3386 259 NPAFGGPDLNTLYITSARSG 278 (307)
T ss_pred cceEeCCCcCEEEEEecCCC
Confidence 777766 88999987663
No 17
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=98.85 E-value=4.4e-07 Score=109.94 Aligned_cols=221 Identities=15% Similarity=0.186 Sum_probs=152.8
Q ss_pred cccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee--cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCc
Q psy6572 566 QTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP--ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRF 642 (1416)
Q Consensus 566 l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~--l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~ 642 (1416)
.+.+.++++.+....+||+..... .|..+... ...+.... ...|.+++|...+.++|.+....+.|.+.++....
T Consensus 30 ~~~~~~v~~~~~g~~~~v~~~~~~--~~~~~~~~~n~~~~~~~~g~~~p~~i~v~~~~~~vyv~~~~~~~v~vid~~~~~ 107 (381)
T COG3391 30 GRGPGGVAVNPDGTQVYVANSGSN--DVSVIDATSNTVTQSLSVGGVYPAGVAVNPAGNKVYVTTGDSNTVSVIDTATNT 107 (381)
T ss_pred CCCCceeEEcCccCEEEEEeecCc--eeeecccccceeeeeccCCCccccceeeCCCCCeEEEecCCCCeEEEEcCcccc
Confidence 347889999999999999988753 34433332 11112122 37799999999999999999999999999965544
Q ss_pred eEEEEcCCCCCcceeeecCCcceEEEeeCCC-CceEEEEecCCCCCEEEeecC-CCCCeeEEeecCCCeEEEecCCCCeE
Q psy6572 643 RKVLINKGLQEPRGIALNPAYGYMYWTDWGQ-NAHIGKAKMDGSNPKVIISKN-LSWPNALTISYETNELFWGDAHEDYI 720 (1416)
Q Consensus 643 ~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~-~~~I~ra~mDGs~r~vlv~~~-l~~P~gLaiD~~~~rLYWtD~~~~~I 720 (1416)
....+..+. .|.+|+++|..++||+++.+. +..|.+++-.... ++.... -..|.++++++...+||.++...+.|
T Consensus 108 ~~~~~~vG~-~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~t~~--~~~~~~vG~~P~~~a~~p~g~~vyv~~~~~~~v 184 (381)
T COG3391 108 VLGSIPVGL-GPVGLAVDPDGKYVYVANAGNGNNTVSVIDAATNK--VTATIPVGNTPTGVAVDPDGNKVYVTNSDDNTV 184 (381)
T ss_pred eeeEeeecc-CCceEEECCCCCEEEEEecccCCceEEEEeCCCCe--EEEEEecCCCcceEEECCCCCeEEEEecCCCeE
Confidence 332222222 899999999999999999852 4567766554333 333222 23689999999999999999999999
Q ss_pred EEEeCCCCceEEEEeccCCCCcccccceeEEE--ecCcEEEeecCC--CeeEEecccCCCceEE-EEeCCCCCCeeeee
Q psy6572 721 AVSDLNGENIKIIVSRRMDPTINLHHVFALAV--FEDHLFWTDWEM--KSIERCDKYTGKNCTS-VVKNLVHKPMDLRV 794 (1416)
Q Consensus 721 ~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav--~~d~LYwtD~~~--~~I~~~nk~tG~~~~~-l~~~~~~~p~~I~v 794 (1416)
..++..+..... -.. .........|.++++ .+.++|+++..+ +.|.+++..++..... +....+ .|+++.+
T Consensus 185 ~vi~~~~~~v~~-~~~-~~~~~~~~~P~~i~v~~~g~~~yV~~~~~~~~~v~~id~~~~~v~~~~~~~~~~-~~~~v~~ 260 (381)
T COG3391 185 SVIDTSGNSVVR-GSV-GSLVGVGTGPAGIAVDPDGNRVYVANDGSGSNNVLKIDTATGNVTATDLPVGSG-APRGVAV 260 (381)
T ss_pred EEEeCCCcceec-ccc-ccccccCCCCceEEECCCCCEEEEEeccCCCceEEEEeCCCceEEEeccccccC-CCCceeE
Confidence 999977765553 111 001112567888888 567899999998 6888888766655444 221333 5555554
No 18
>PF00058 Ldl_recept_b: Low-density lipoprotein receptor repeat class B; InterPro: IPR000033 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=98.84 E-value=5e-09 Score=84.74 Aligned_cols=42 Identities=40% Similarity=0.970 Sum_probs=39.8
Q ss_pred ceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeec
Q psy6572 664 GYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISY 705 (1416)
Q Consensus 664 g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~ 705 (1416)
++|||||++..++|++++|||+++++|+...|.+|+|||||+
T Consensus 1 ~~iYWtD~~~~~~I~~a~~dGs~~~~vi~~~l~~P~giaVD~ 42 (42)
T PF00058_consen 1 GKIYWTDWSQDPSIERANLDGSNRRTVISDDLQHPEGIAVDW 42 (42)
T ss_dssp TEEEEEETTTTEEEEEEETTSTSEEEEEESSTSSEEEEEEET
T ss_pred CEEEEEECCCCcEEEEEECCCCCeEEEEECCCCCcCEEEECC
Confidence 589999999878999999999999999999999999999985
No 19
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=98.78 E-value=6.9e-07 Score=107.84 Aligned_cols=195 Identities=19% Similarity=0.272 Sum_probs=130.3
Q ss_pred eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-C------CCeEEee-c--------CCCceEEEEccCCcE
Q psy6572 560 TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-S------QPELLFP-A--------TSPDGLTVDWVGRNL 623 (1416)
Q Consensus 560 ~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s------~~~~l~~-l--------~~p~gLAvD~~~~~L 623 (1416)
++++.++..+.||++... + ||+++.. .|.++... . ..++|+. + ..+.+|++++ .++|
T Consensus 65 ~vfa~~l~~p~Gi~~~~~-G-lyV~~~~----~i~~~~d~~gdg~ad~~~~~l~~~~~~~~~~~~~~~~~l~~gp-DG~L 137 (367)
T TIGR02604 65 NVFAEELSMVTGLAVAVG-G-VYVATPP----DILFLRDKDGDDKADGEREVLLSGFGGQINNHHHSLNSLAWGP-DGWL 137 (367)
T ss_pred EEeecCCCCccceeEecC-C-EEEeCCC----eEEEEeCCCCCCCCCCccEEEEEccCCCCCcccccccCceECC-CCCE
Confidence 577788999999999654 4 9998654 67777432 1 4556655 2 2277899997 6799
Q ss_pred EEeeCCC-------------------CeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEec--
Q psy6572 624 YWCDKGL-------------------DTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKM-- 682 (1416)
Q Consensus 624 YwtD~~~-------------------~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~m-- 682 (1416)
|++.... +.|.+++.+|+..+++. .++..|.||++|+ .|.||++|-+..+ ..++..
T Consensus 138 Yv~~G~~~~~~~~~~~~~~~~~~~~~g~i~r~~pdg~~~e~~a-~G~rnp~Gl~~d~-~G~l~~tdn~~~~-~~~i~~~~ 214 (367)
T TIGR02604 138 YFNHGNTLASKVTRPGTSDESRQGLGGGLFRYNPDGGKLRVVA-HGFQNPYGHSVDS-WGDVFFCDNDDPP-LCRVTPVA 214 (367)
T ss_pred EEecccCCCceeccCCCccCcccccCceEEEEecCCCeEEEEe-cCcCCCccceECC-CCCEEEEccCCCc-eeEEcccc
Confidence 9987721 46899999998777654 7799999999998 5899999854332 222211
Q ss_pred ----------CCC---------CCEE-----------Ee-----ecCCCCCeeEEee-------cCCCeEEEecCCCCeE
Q psy6572 683 ----------DGS---------NPKV-----------II-----SKNLSWPNALTIS-------YETNELFWGDAHEDYI 720 (1416)
Q Consensus 683 ----------DGs---------~r~v-----------lv-----~~~l~~P~gLaiD-------~~~~rLYWtD~~~~~I 720 (1416)
.|. ...+ ++ ......|.|+++- ...+.||++++..+.|
T Consensus 215 ~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ap~G~~~y~g~~fp~~~~g~~fv~~~~~~~v 294 (367)
T TIGR02604 215 EGGRNGYQSFNGRRYDHADRGADHEVPTGEWRQDDRGVETVGDVAGGGTAPCGIAFYRGDALPEEYRGLLLVGDAHGQLI 294 (367)
T ss_pred cccccCCCCCCCcccccccccccccccccccccccccccccccccCCCccccEEEEeCCCcCCHHHCCCEEeeeccCCEE
Confidence 010 0000 00 0011368899996 3458999999999999
Q ss_pred EEEeCC--CCceE----EEEeccCCCCcccccceeEEE-ecCcEEEeecCCCeeE
Q psy6572 721 AVSDLN--GENIK----IIVSRRMDPTINLHHVFALAV-FEDHLFWTDWEMKSIE 768 (1416)
Q Consensus 721 ~~~~ld--G~~r~----~v~~~~~~p~~~l~~P~~lav-~~d~LYwtD~~~~~I~ 768 (1416)
.++.++ |...+ .++... ..+..|..|++ ..+.||++||....|.
T Consensus 295 ~~~~l~~~g~~~~~~~~~~l~~~----~~~~rp~dv~~~pDG~Lyv~d~~~~~i~ 345 (367)
T TIGR02604 295 VRYSLEPKGAGFKGERPEFLRSN----DTWFRPVNVTVGPDGALYVSDWYDRGIE 345 (367)
T ss_pred EEEEeecCCCccEeecCceEecC----CCcccccceeECCCCCEEEEEeccCccc
Confidence 999875 43222 122221 12467888888 4789999998765554
No 20
>PF00057 Ldl_recept_a: Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor; InterPro: IPR002172 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=98.78 E-value=4.2e-09 Score=82.30 Aligned_cols=36 Identities=53% Similarity=1.210 Sum_probs=24.1
Q ss_pred ccCCCcEEecCCeEecccccccCCCCCCCCCCCCCC
Q psy6572 30 DCRPGYFKCDNNKCILSSHTCNNINDCGDGSDEADC 65 (1416)
Q Consensus 30 ~c~~~~f~C~n~~ci~~~~~Cdg~~dC~d~sDE~~C 65 (1416)
+|++++|+|.+++||+..|+|||+.||.|||||.+|
T Consensus 2 ~C~~~~f~C~~~~CI~~~~~CDg~~DC~dgsDE~~C 37 (37)
T PF00057_consen 2 TCPPGEFRCGNGQCIPKSWVCDGIPDCPDGSDEQNC 37 (37)
T ss_dssp SSSTTEEEETTSSEEEGGGTTSSSCSSSSSTTTSSH
T ss_pred cCcCCeeEcCCCCEEChHHcCCCCCCCCCCcccccC
Confidence 466666666666666666666666666666666654
No 21
>PF00057 Ldl_recept_a: Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor; InterPro: IPR002172 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=98.76 E-value=4.4e-09 Score=82.21 Aligned_cols=37 Identities=51% Similarity=1.066 Sum_probs=34.8
Q ss_pred CCCCCCceeecCCceeCCCcccCCCCCCCCCCCCCCC
Q psy6572 66 STCGNDTFHCDMGMCIHKALRCDVDPDCPDASDEMHC 102 (1416)
Q Consensus 66 ~~C~~~~f~C~~g~Ci~~~~~Cdg~~dC~d~sDE~~C 102 (1416)
+.|...+|+|.+|+||+..|+|||..||.|+|||.+|
T Consensus 1 ~~C~~~~f~C~~~~CI~~~~~CDg~~DC~dgsDE~~C 37 (37)
T PF00057_consen 1 PTCPPGEFRCGNGQCIPKSWVCDGIPDCPDGSDEQNC 37 (37)
T ss_dssp SSSSTTEEEETTSSEEEGGGTTSSSCSSSSSTTTSSH
T ss_pred CcCcCCeeEcCCCCEEChHHcCCCCCCCCCCcccccC
Confidence 4688999999999999999999999999999999876
No 22
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=98.75 E-value=8.8e-07 Score=107.33 Aligned_cols=203 Identities=14% Similarity=0.154 Sum_probs=149.0
Q ss_pred ccceeeeeecCCCeEEEeeccCCCccEEEEecCC-C-CeEEeecCCCceEEEEccCCcEEEeeCC--CCeEEEeecCCCc
Q psy6572 567 TNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNS-Q-PELLFPATSPDGLTVDWVGRNLYWCDKG--LDTIEVAKLDGRF 642 (1416)
Q Consensus 567 ~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s-~-~~~l~~l~~p~gLAvD~~~~~LYwtD~~--~~~I~v~~ldG~~ 642 (1416)
..+.++++...+.++|.+....+ .|..+.... . ...+.....|.+||+++.+++||+++.+ .++|.+++.....
T Consensus 74 ~~p~~i~v~~~~~~vyv~~~~~~--~v~vid~~~~~~~~~~~vG~~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~t~~ 151 (381)
T COG3391 74 VYPAGVAVNPAGNKVYVTTGDSN--TVSVIDTATNTVLGSIPVGLGPVGLAVDPDGKYVYVANAGNGNNTVSVIDAATNK 151 (381)
T ss_pred ccccceeeCCCCCeEEEecCCCC--eEEEEcCcccceeeEeeeccCCceEEECCCCCEEEEEecccCCceEEEEeCCCCe
Confidence 56789999999999999998754 788887662 2 2222224599999999999999999995 6889988877554
Q ss_pred eEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEE-E---eecCCCCCeeEEeecCCCeEEEecCCC-
Q psy6572 643 RKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKV-I---ISKNLSWPNALTISYETNELFWGDAHE- 717 (1416)
Q Consensus 643 ~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~v-l---v~~~l~~P~gLaiD~~~~rLYWtD~~~- 717 (1416)
....+ .....|.++|++|...++|.++... ..|...++.+..... - .......|.+++|++...++|+++..+
T Consensus 152 ~~~~~-~vG~~P~~~a~~p~g~~vyv~~~~~-~~v~vi~~~~~~v~~~~~~~~~~~~~~P~~i~v~~~g~~~yV~~~~~~ 229 (381)
T COG3391 152 VTATI-PVGNTPTGVAVDPDGNKVYVTNSDD-NTVSVIDTSGNSVVRGSVGSLVGVGTGPAGIAVDPDGNRVYVANDGSG 229 (381)
T ss_pred EEEEE-ecCCCcceEEECCCCCeEEEEecCC-CeEEEEeCCCcceeccccccccccCCCCceEEECCCCCEEEEEeccCC
Confidence 33333 2233689999999999999999554 478888866654442 0 123467899999999999999999887
Q ss_pred -CeEEEEeCCCCceEEE-EeccCCCCcccccceeEEE--ecCcEEEeecCCCeeEEecccCCCceE
Q psy6572 718 -DYIAVSDLNGENIKII-VSRRMDPTINLHHVFALAV--FEDHLFWTDWEMKSIERCDKYTGKNCT 779 (1416)
Q Consensus 718 -~~I~~~~ldG~~r~~v-~~~~~~p~~~l~~P~~lav--~~d~LYwtD~~~~~I~~~nk~tG~~~~ 779 (1416)
+.|..++......... +... .+ .|+++++ .+.++|+++...+.|..++..+.....
T Consensus 230 ~~~v~~id~~~~~v~~~~~~~~-----~~-~~~~v~~~p~g~~~yv~~~~~~~V~vid~~~~~v~~ 289 (381)
T COG3391 230 SNNVLKIDTATGNVTATDLPVG-----SG-APRGVAVDPAGKAAYVANSQGGTVSVIDGATDRVVK 289 (381)
T ss_pred CceEEEEeCCCceEEEeccccc-----cC-CCCceeECCCCCEEEEEecCCCeEEEEeCCCCceee
Confidence 5888888765544433 2211 25 7888887 467888888888888888765554433
No 23
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=98.74 E-value=4.9e-09 Score=81.07 Aligned_cols=35 Identities=54% Similarity=1.259 Sum_probs=21.6
Q ss_pred cCCCcEEecCCeEecccccccCCCCCCCCCCCCCC
Q psy6572 31 CRPGYFKCDNNKCILSSHTCNNINDCGDGSDEADC 65 (1416)
Q Consensus 31 c~~~~f~C~n~~ci~~~~~Cdg~~dC~d~sDE~~C 65 (1416)
|++++|+|.++.||+..|+|||..||+|||||.+|
T Consensus 1 C~~~~f~C~~~~Ci~~~~~CDg~~DC~dgsDE~~C 35 (35)
T cd00112 1 CPPNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35 (35)
T ss_pred CCCCeEEcCCCCeeCHHHcCCCccCCCCCcccccC
Confidence 44556666666666666666666666666666544
No 24
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=98.67 E-value=1.2e-05 Score=91.77 Aligned_cols=226 Identities=15% Similarity=0.126 Sum_probs=152.4
Q ss_pred EEEEEecCCcc-----eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC---CCCeEEee----cCCCceEEE
Q psy6572 549 YIREVTQAGVM-----TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN---SQPELLFP----ATSPDGLTV 616 (1416)
Q Consensus 549 ~I~~i~l~g~~-----~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~---s~~~~l~~----l~~p~gLAv 616 (1416)
.|+++.+++.. ..++..+.+|.-|++++..++||........+.|..+.++ +..+.|.. ...|.-|+|
T Consensus 17 gI~v~~ld~~~g~l~~~~~v~~~~nptyl~~~~~~~~LY~v~~~~~~ggvaay~iD~~~G~Lt~ln~~~~~g~~p~yvsv 96 (346)
T COG2706 17 GIYVFNLDTKTGELSLLQLVAELGNPTYLAVNPDQRHLYVVNEPGEEGGVAAYRIDPDDGRLTFLNRQTLPGSPPCYVSV 96 (346)
T ss_pred ceEEEEEeCcccccchhhhccccCCCceEEECCCCCEEEEEEecCCcCcEEEEEEcCCCCeEEEeeccccCCCCCeEEEE
Confidence 46666666433 3456678899999999999999999776333466666665 23333333 456699999
Q ss_pred EccCCcEEEeeCCCCeEEEeec--CCCceEEE---EcC-C-------CCCcceeeecCCcceEEEeeCCCCceEEEEecC
Q psy6572 617 DWVGRNLYWCDKGLDTIEVAKL--DGRFRKVL---INK-G-------LQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMD 683 (1416)
Q Consensus 617 D~~~~~LYwtD~~~~~I~v~~l--dG~~~~vL---i~~-~-------l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mD 683 (1416)
|..++.||.+.+..+.|.|..+ +|....++ ... . -..+....++|..++|+..|.|.. +|....++
T Consensus 97 d~~g~~vf~AnY~~g~v~v~p~~~dG~l~~~v~~~~h~g~~p~~rQ~~~h~H~a~~tP~~~~l~v~DLG~D-ri~~y~~~ 175 (346)
T COG2706 97 DEDGRFVFVANYHSGSVSVYPLQADGSLQPVVQVVKHTGSGPHERQESPHVHSANFTPDGRYLVVPDLGTD-RIFLYDLD 175 (346)
T ss_pred CCCCCEEEEEEccCceEEEEEcccCCccccceeeeecCCCCCCccccCCccceeeeCCCCCEEEEeecCCc-eEEEEEcc
Confidence 9999999999999999988876 46654432 111 1 112567889999999999998865 77776665
Q ss_pred -CCCCE---EEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCC--ceEEEEeccCCCCccc--ccceeEEE--e
Q psy6572 684 -GSNPK---VIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGE--NIKIIVSRRMDPTINL--HHVFALAV--F 753 (1416)
Q Consensus 684 -Gs~r~---vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~--~r~~v~~~~~~p~~~l--~~P~~lav--~ 753 (1416)
|.... ..+ ..-..|+-|++.+..+..|.+---+++|....+++. ..+.|..-...|.... ....+|.+ .
T Consensus 176 dg~L~~~~~~~v-~~G~GPRHi~FHpn~k~aY~v~EL~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~d 254 (346)
T COG2706 176 DGKLTPADPAEV-KPGAGPRHIVFHPNGKYAYLVNELNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISPD 254 (346)
T ss_pred cCcccccccccc-CCCCCcceEEEcCCCcEEEEEeccCCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECCC
Confidence 33211 111 345679999999988899999988899999988773 3333332222222211 22334444 4
Q ss_pred cCcEEEeecCCCee--EEecccCCC
Q psy6572 754 EDHLFWTDWEMKSI--ERCDKYTGK 776 (1416)
Q Consensus 754 ~d~LYwtD~~~~~I--~~~nk~tG~ 776 (1416)
+.+||.++.+..+| ++++..+|.
T Consensus 255 GrFLYasNRg~dsI~~f~V~~~~g~ 279 (346)
T COG2706 255 GRFLYASNRGHDSIAVFSVDPDGGK 279 (346)
T ss_pred CCEEEEecCCCCeEEEEEEcCCCCE
Confidence 77999999998875 455555454
No 25
>PF00058 Ldl_recept_b: Low-density lipoprotein receptor repeat class B; InterPro: IPR000033 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=98.67 E-value=3.7e-08 Score=79.67 Aligned_cols=41 Identities=44% Similarity=0.755 Sum_probs=39.5
Q ss_pred CcEEEeeCCCC-eEEEeecCCCceEEEEcCCCCCcceeeecC
Q psy6572 621 RNLYWCDKGLD-TIEVAKLDGRFRKVLINKGLQEPRGIALNP 661 (1416)
Q Consensus 621 ~~LYwtD~~~~-~I~v~~ldG~~~~vLi~~~l~~P~gIavDp 661 (1416)
++|||||.+.+ +|++++|+|+.+++|+...+.+|+||||||
T Consensus 1 ~~iYWtD~~~~~~I~~a~~dGs~~~~vi~~~l~~P~giaVD~ 42 (42)
T PF00058_consen 1 GKIYWTDWSQDPSIERANLDGSNRRTVISDDLQHPEGIAVDW 42 (42)
T ss_dssp TEEEEEETTTTEEEEEEETTSTSEEEEEESSTSSEEEEEEET
T ss_pred CEEEEEECCCCcEEEEEECCCCCeEEEEECCCCCcCEEEECC
Confidence 58999999999 999999999999999999999999999996
No 26
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=98.64 E-value=1.3e-08 Score=78.73 Aligned_cols=35 Identities=60% Similarity=1.397 Sum_probs=32.4
Q ss_pred CCCCCcccCCCceecCceeecCCCCCCCCCCCCCCCCCCC
Q psy6572 160 CSGDKFLCRNGNCILSRWRCDGDNDCNDGNDGLSSDEMNC 199 (1416)
Q Consensus 160 C~~~~f~C~~g~CI~~~~~CDg~~DC~Dg~d~~~sDE~~C 199 (1416)
|++.+|+|.+|+|||..|+|||..||.|| |||.+|
T Consensus 1 C~~~~f~C~~~~Ci~~~~~CDg~~DC~dg-----sDE~~C 35 (35)
T cd00112 1 CPPNEFRCANGRCIPSSWVCDGEDDCGDG-----SDEENC 35 (35)
T ss_pred CCCCeEEcCCCCeeCHHHcCCCccCCCCC-----cccccC
Confidence 56789999999999999999999999999 888876
No 27
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=98.62 E-value=3.3e-06 Score=95.30 Aligned_cols=193 Identities=13% Similarity=0.101 Sum_probs=114.4
Q ss_pred cccceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEee---cCCCceEEEEccCCcEEE-eeCCCCeEEEeecC--
Q psy6572 566 QTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFP---ATSPDGLTVDWVGRNLYW-CDKGLDTIEVAKLD-- 639 (1416)
Q Consensus 566 l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~---l~~p~gLAvD~~~~~LYw-tD~~~~~I~v~~ld-- 639 (1416)
..++.||+|++.+++||-+.... ..|+.+.+++.+.--+. ...++||++ +++..|+ ++...++|.++.++
T Consensus 21 ~~e~SGLTy~pd~~tLfaV~d~~--~~i~els~~G~vlr~i~l~g~~D~EgI~y--~g~~~~vl~~Er~~~L~~~~~~~~ 96 (248)
T PF06977_consen 21 LDELSGLTYNPDTGTLFAVQDEP--GEIYELSLDGKVLRRIPLDGFGDYEGITY--LGNGRYVLSEERDQRLYIFTIDDD 96 (248)
T ss_dssp -S-EEEEEEETTTTEEEEEETTT--TEEEEEETT--EEEEEE-SS-SSEEEEEE---STTEEEEEETTTTEEEEEEE---
T ss_pred cCCccccEEcCCCCeEEEEECCC--CEEEEEcCCCCEEEEEeCCCCCCceeEEE--ECCCEEEEEEcCCCcEEEEEEecc
Confidence 34689999999999998876653 47888877643211111 678999998 4544554 56667888888873
Q ss_pred CCc--eEEEE--c---C--CCCCcceeeecCCcceEEEeeCCCCceEEEEec--CCCCCEEEee-------cCCCCCeeE
Q psy6572 640 GRF--RKVLI--N---K--GLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKM--DGSNPKVIIS-------KNLSWPNAL 701 (1416)
Q Consensus 640 G~~--~~vLi--~---~--~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~m--DGs~r~vlv~-------~~l~~P~gL 701 (1416)
+.. +..+. . . .-..-.|||.|+.+++||++--....+|+.+.. .+....+... ..+..|.+|
T Consensus 97 ~~~~~~~~~~~~~l~~~~~~N~G~EGla~D~~~~~L~v~kE~~P~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~S~l 176 (248)
T PF06977_consen 97 TTSLDRADVQKISLGFPNKGNKGFEGLAYDPKTNRLFVAKERKPKRLYEVNGFPGGFDLFVSDDQDLDDDKLFVRDLSGL 176 (248)
T ss_dssp -TT--EEEEEEEE---S---SS--EEEEEETTTTEEEEEEESSSEEEEEEESTT-SS--EEEE-HHHH-HT--SS---EE
T ss_pred ccccchhhceEEecccccCCCcceEEEEEcCCCCEEEEEeCCCChhhEEEccccCccceeeccccccccccceeccccce
Confidence 222 11111 1 1 122358999999999999986433335777766 3333333222 136679999
Q ss_pred EeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCC--CcccccceeEEEe-cCcEEEeec
Q psy6572 702 TISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDP--TINLHHVFALAVF-EDHLFWTDW 762 (1416)
Q Consensus 702 aiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p--~~~l~~P~~lav~-~d~LYwtD~ 762 (1416)
++|+.++.||+.......|..++.+|.-+..+.-..+.. ...+++|-|||+. ++.||++.-
T Consensus 177 ~~~p~t~~lliLS~es~~l~~~d~~G~~~~~~~L~~g~~gl~~~~~QpEGIa~d~~G~LYIvsE 240 (248)
T PF06977_consen 177 SYDPRTGHLLILSDESRLLLELDRQGRVVSSLSLDRGFHGLSKDIPQPEGIAFDPDGNLYIVSE 240 (248)
T ss_dssp EEETTTTEEEEEETTTTEEEEE-TT--EEEEEE-STTGGG-SS---SEEEEEE-TT--EEEEET
T ss_pred EEcCCCCeEEEEECCCCeEEEECCCCCEEEEEEeCCcccCcccccCCccEEEECCCCCEEEEcC
Confidence 999999999999999999999999998666543322110 1236789999995 679999875
No 28
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=98.57 E-value=5.9e-05 Score=87.12 Aligned_cols=234 Identities=13% Similarity=0.034 Sum_probs=145.4
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEee---cCCCceEEEEccCCcE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFP---ATSPDGLTVDWVGRNL 623 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~---l~~p~gLAvD~~~~~L 623 (1416)
..|+.+++.+.. ...+.....+..+++++.++.||.+.... +.|+.+++.+. +.+.. ...|.++++++.+..|
T Consensus 53 ~~v~~~d~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~~~~~--~~l~~~d~~~~-~~~~~~~~~~~~~~~~~~~dg~~l 129 (300)
T TIGR03866 53 DTIQVIDLATGEVIGTLPSGPDPELFALHPNGKILYIANEDD--NLVTVIDIETR-KVLAEIPVGVEPEGMAVSPDGKIV 129 (300)
T ss_pred CeEEEEECCCCcEEEeccCCCCccEEEECCCCCEEEEEcCCC--CeEEEEECCCC-eEEeEeeCCCCcceEEECCCCCEE
Confidence 456666665433 22222233466788888888888876543 37887777632 22222 3458899999877777
Q ss_pred EEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCC-EEEee------cCCC
Q psy6572 624 YWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNP-KVIIS------KNLS 696 (1416)
Q Consensus 624 YwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r-~vlv~------~~l~ 696 (1416)
+.+......+.+.++........+ .....|+++++.|...+||++... ...|...++..... ..+.. ....
T Consensus 130 ~~~~~~~~~~~~~d~~~~~~~~~~-~~~~~~~~~~~s~dg~~l~~~~~~-~~~v~i~d~~~~~~~~~~~~~~~~~~~~~~ 207 (300)
T TIGR03866 130 VNTSETTNMAHFIDTKTYEIVDNV-LVDQRPRFAEFTADGKELWVSSEI-GGTVSVIDVATRKVIKKITFEIPGVHPEAV 207 (300)
T ss_pred EEEecCCCeEEEEeCCCCeEEEEE-EcCCCccEEEECCCCCEEEEEcCC-CCEEEEEEcCcceeeeeeeecccccccccC
Confidence 766655455666666543322111 122468999999998888776422 23677777754322 21111 1123
Q ss_pred CCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEE--ecCcEEEeecCCCeeEEecccC
Q psy6572 697 WPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAV--FEDHLFWTDWEMKSIERCDKYT 774 (1416)
Q Consensus 697 ~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav--~~d~LYwtD~~~~~I~~~nk~t 774 (1416)
+|.+|++++...++|++....+.|..+++........+.. -..|++|++ .+.+||.+....+.|...+..+
T Consensus 208 ~~~~i~~s~dg~~~~~~~~~~~~i~v~d~~~~~~~~~~~~-------~~~~~~~~~~~~g~~l~~~~~~~~~i~v~d~~~ 280 (300)
T TIGR03866 208 QPVGIKLTKDGKTAFVALGPANRVAVVDAKTYEVLDYLLV-------GQRVWQLAFTPDEKYLLTTNGVSNDVSVIDVAA 280 (300)
T ss_pred CccceEECCCCCEEEEEcCCCCeEEEEECCCCcEEEEEEe-------CCCcceEEECCCCCEEEEEcCCCCeEEEEECCC
Confidence 5788999987888899877777888888754333222221 124667776 4567888877788899998888
Q ss_pred CCceEEEEeCCCCCCeeeeee
Q psy6572 775 GKNCTSVVKNLVHKPMDLRVY 795 (1416)
Q Consensus 775 G~~~~~l~~~~~~~p~~I~v~ 795 (1416)
+..+..+. ....|++|++-
T Consensus 281 ~~~~~~~~--~~~~~~~~~~~ 299 (300)
T TIGR03866 281 LKVIKSIK--VGRLPWGVVVR 299 (300)
T ss_pred CcEEEEEE--cccccceeEeC
Confidence 77666664 34678888764
No 29
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=98.57 E-value=1.5e-06 Score=105.01 Aligned_cols=150 Identities=17% Similarity=0.258 Sum_probs=109.3
Q ss_pred cCCCceEEEEccCCcEEEeeCC-----------C-CeEEEeec---CCCc-eEEEEcCCCCCcceeeecCCcceEEEeeC
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKG-----------L-DTIEVAKL---DGRF-RKVLINKGLQEPRGIALNPAYGYMYWTDW 671 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~-----------~-~~I~v~~l---dG~~-~~vLi~~~l~~P~gIavDp~~g~LYWtD~ 671 (1416)
+..|.+||+|. .++||+++.. . .+|.++.- ||.. ...++..++..|+||++.+. | ||+++.
T Consensus 13 ~~~P~~ia~d~-~G~l~V~e~~~y~~~~~~~~~~~~rI~~l~d~dgdG~~d~~~vfa~~l~~p~Gi~~~~~-G-lyV~~~ 89 (367)
T TIGR02604 13 LRNPIAVCFDE-RGRLWVAEGITYSRPAGRQGPLGDRILILEDADGDGKYDKSNVFAEELSMVTGLAVAVG-G-VYVATP 89 (367)
T ss_pred cCCCceeeECC-CCCEEEEeCCcCCCCCCCCCCCCCEEEEEEcCCCCCCcceeEEeecCCCCccceeEecC-C-EEEeCC
Confidence 78999999996 5779999842 2 37877764 4655 33455578999999999875 5 999973
Q ss_pred CCCceEEEE-ecCCC-----CCEEEeecC-------CCCCeeEEeecCCCeEEEecCC-------------------CCe
Q psy6572 672 GQNAHIGKA-KMDGS-----NPKVIISKN-------LSWPNALTISYETNELFWGDAH-------------------EDY 719 (1416)
Q Consensus 672 g~~~~I~ra-~mDGs-----~r~vlv~~~-------l~~P~gLaiD~~~~rLYWtD~~-------------------~~~ 719 (1416)
+.|.+. ..+|. .+++|++.- ...|++|++++ .++||+++.. .+.
T Consensus 90 ---~~i~~~~d~~gdg~ad~~~~~l~~~~~~~~~~~~~~~~~l~~gp-DG~LYv~~G~~~~~~~~~~~~~~~~~~~~~g~ 165 (367)
T TIGR02604 90 ---PDILFLRDKDGDDKADGEREVLLSGFGGQINNHHHSLNSLAWGP-DGWLYFNHGNTLASKVTRPGTSDESRQGLGGG 165 (367)
T ss_pred ---CeEEEEeCCCCCCCCCCccEEEEEccCCCCCcccccccCceECC-CCCEEEecccCCCceeccCCCccCcccccCce
Confidence 467776 44442 345555421 23388999986 7899998762 157
Q ss_pred EEEEeCCCCceEEEEeccCCCCcccccceeEEE-ecCcEEEeecCCCeeEEec
Q psy6572 720 IAVSDLNGENIKIIVSRRMDPTINLHHVFALAV-FEDHLFWTDWEMKSIERCD 771 (1416)
Q Consensus 720 I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav-~~d~LYwtD~~~~~I~~~n 771 (1416)
|++++.+|+..+++..+ +.+|++|++ ..+.||++|.......+++
T Consensus 166 i~r~~pdg~~~e~~a~G-------~rnp~Gl~~d~~G~l~~tdn~~~~~~~i~ 211 (367)
T TIGR02604 166 LFRYNPDGGKLRVVAHG-------FQNPYGHSVDSWGDVFFCDNDDPPLCRVT 211 (367)
T ss_pred EEEEecCCCeEEEEecC-------cCCCccceECCCCCEEEEccCCCceeEEc
Confidence 99999999888766543 899999999 4788999998666555554
No 30
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=98.56 E-value=2.6e-08 Score=77.03 Aligned_cols=35 Identities=46% Similarity=1.237 Sum_probs=29.9
Q ss_pred CC-CCCCCccceeecCCCCccccCCCCeeecCCCcee
Q psy6572 808 CE-NNGGCQGLCLLKPNGHRQCACPDNFILESDGKTC 843 (1416)
Q Consensus 808 C~-~NggCshlCl~~p~~~~~C~Cp~g~~L~~d~~tC 843 (1416)
|+ +||||+|+|+..|+ +|+|+||.||+|++|+++|
T Consensus 1 C~~~NGgC~h~C~~~~g-~~~C~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 1 CSVNNGGCSHICVNTPG-SYRCSCPPGYKLAEDGRTC 36 (36)
T ss_dssp CTTGGGGSSSEEEEETT-SEEEE-STTEEE-TTSSSE
T ss_pred CCCCCCCcCCCCccCCC-ceEeECCCCCEECcCCCCC
Confidence 44 68999999999975 7999999999999999987
No 31
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=98.56 E-value=4e-05 Score=87.67 Aligned_cols=220 Identities=17% Similarity=0.157 Sum_probs=146.6
Q ss_pred cceeeeeecCCCeEEEeeccCCCccEEEEecC--CCCe----EEeecC----------CCceEEEEccCCcEEEeeCCCC
Q psy6572 568 NAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN--SQPE----LLFPAT----------SPDGLTVDWVGRNLYWCDKGLD 631 (1416)
Q Consensus 568 ~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~--s~~~----~l~~l~----------~p~gLAvD~~~~~LYwtD~~~~ 631 (1416)
.|.-|++|..++.||.+.... +.|.+..++ +... ++.... .+...-+++.++.|+.+|-+..
T Consensus 90 ~p~yvsvd~~g~~vf~AnY~~--g~v~v~p~~~dG~l~~~v~~~~h~g~~p~~rQ~~~h~H~a~~tP~~~~l~v~DLG~D 167 (346)
T COG2706 90 PPCYVSVDEDGRFVFVANYHS--GSVSVYPLQADGSLQPVVQVVKHTGSGPHERQESPHVHSANFTPDGRYLVVPDLGTD 167 (346)
T ss_pred CCeEEEECCCCCEEEEEEccC--ceEEEEEcccCCccccceeeeecCCCCCCccccCCccceeeeCCCCCEEEEeecCCc
Confidence 457899999999999999885 477777775 2221 111122 2567889999999999999999
Q ss_pred eEEEeecC-CCce--EEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCC-CEEEee---------cCCCCC
Q psy6572 632 TIEVAKLD-GRFR--KVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSN-PKVIIS---------KNLSWP 698 (1416)
Q Consensus 632 ~I~v~~ld-G~~~--~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~-r~vlv~---------~~l~~P 698 (1416)
+|.+.+++ |... .........-||-|+++|...+.|.+.- .+..|.+..+++.. +...+. .+-.|-
T Consensus 168 ri~~y~~~dg~L~~~~~~~v~~G~GPRHi~FHpn~k~aY~v~E-L~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~ 246 (346)
T COG2706 168 RIFLYDLDDGKLTPADPAEVKPGAGPRHIVFHPNGKYAYLVNE-LNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWA 246 (346)
T ss_pred eEEEEEcccCccccccccccCCCCCcceEEEcCCCcEEEEEec-cCCEEEEEEEcCCCceEEEeeeeccCccccCCCCce
Confidence 99999986 3321 1111145567999999999999999873 34588888888752 211111 124555
Q ss_pred eeEEeecCCCeEEEecCCCCeEEEEe--CCCCceEEEEeccCCCCcccccceeEEE--ecCcEEEeecCCCe--eEEecc
Q psy6572 699 NALTISYETNELFWGDAHEDYIAVSD--LNGENIKIIVSRRMDPTINLHHVFALAV--FEDHLFWTDWEMKS--IERCDK 772 (1416)
Q Consensus 699 ~gLaiD~~~~rLYWtD~~~~~I~~~~--ldG~~r~~v~~~~~~p~~~l~~P~~lav--~~d~LYwtD~~~~~--I~~~nk 772 (1416)
.+|.|.+..+.||.++.+.+.|..+. .+|.....+.... +....|.++.+ .+++|+.+..++.. |+++++
T Consensus 247 aaIhis~dGrFLYasNRg~dsI~~f~V~~~~g~L~~~~~~~----teg~~PR~F~i~~~g~~Liaa~q~sd~i~vf~~d~ 322 (346)
T COG2706 247 AAIHISPDGRFLYASNRGHDSIAVFSVDPDGGKLELVGITP----TEGQFPRDFNINPSGRFLIAANQKSDNITVFERDK 322 (346)
T ss_pred eEEEECCCCCEEEEecCCCCeEEEEEEcCCCCEEEEEEEec----cCCcCCccceeCCCCCEEEEEccCCCcEEEEEEcC
Confidence 68999999999999999988776655 4555544443322 11334655554 67889998777664 677788
Q ss_pred cCCCceEEEEeCCCCCCeeeee
Q psy6572 773 YTGKNCTSVVKNLVHKPMDLRV 794 (1416)
Q Consensus 773 ~tG~~~~~l~~~~~~~p~~I~v 794 (1416)
.+|.-..+........|+-|++
T Consensus 323 ~TG~L~~~~~~~~~p~Pvcv~f 344 (346)
T COG2706 323 ETGRLTLLGRYAVVPEPVCVKF 344 (346)
T ss_pred CCceEEecccccCCCCcEEEEE
Confidence 8776443332223345555443
No 32
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=98.49 E-value=5e-06 Score=98.84 Aligned_cols=224 Identities=18% Similarity=0.159 Sum_probs=135.7
Q ss_pred cccceeeeeecCCCeEEEeeccCCCccEEEEecCCCC-eEEee--------cCCCceEEEEc---cCCcEEEeeCCC---
Q psy6572 566 QTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQP-ELLFP--------ATSPDGLTVDW---VGRNLYWCDKGL--- 630 (1416)
Q Consensus 566 l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~-~~l~~--------l~~p~gLAvD~---~~~~LYwtD~~~--- 630 (1416)
|.+|.+|+|.+. ++||+++.. ++|+++..++.. ..+.. ...+-||||++ .++.||++-...
T Consensus 1 L~~P~~~a~~pd-G~l~v~e~~---G~i~~~~~~g~~~~~v~~~~~v~~~~~~gllgia~~p~f~~n~~lYv~~t~~~~~ 76 (331)
T PF07995_consen 1 LNNPRSMAFLPD-GRLLVAERS---GRIWVVDKDGSLKTPVADLPEVFADGERGLLGIAFHPDFASNGYLYVYYTNADED 76 (331)
T ss_dssp ESSEEEEEEETT-SCEEEEETT---TEEEEEETTTEECEEEEE-TTTBTSTTBSEEEEEE-TTCCCC-EEEEEEEEE-TS
T ss_pred CCCceEEEEeCC-CcEEEEeCC---ceEEEEeCCCcCcceecccccccccccCCcccceeccccCCCCEEEEEEEcccCC
Confidence 467899999986 788998874 488888855222 22222 34567899998 467888876532
Q ss_pred -----CeEEEeecCCC-----ceEEEEc------CCCCCcceeeecCCcceEEEeeC--C----------CCceEEEEec
Q psy6572 631 -----DTIEVAKLDGR-----FRKVLIN------KGLQEPRGIALNPAYGYMYWTDW--G----------QNAHIGKAKM 682 (1416)
Q Consensus 631 -----~~I~v~~ldG~-----~~~vLi~------~~l~~P~gIavDp~~g~LYWtD~--g----------~~~~I~ra~m 682 (1416)
.+|.+..++.. ..++|+. .....-.+|+++|. |+|||+-- + ...+|.|++.
T Consensus 77 ~~~~~~~v~r~~~~~~~~~~~~~~~l~~~~p~~~~~~H~g~~l~fgpD-G~LYvs~G~~~~~~~~~~~~~~~G~ilri~~ 155 (331)
T PF07995_consen 77 GGDNDNRVVRFTLSDGDGDLSSEEVLVTGLPDTSSGNHNGGGLAFGPD-GKLYVSVGDGGNDDNAQDPNSLRGKILRIDP 155 (331)
T ss_dssp SSSEEEEEEEEEEETTSCEEEEEEEEEEEEES-CSSSS-EEEEEE-TT-SEEEEEEB-TTTGGGGCSTTSSTTEEEEEET
T ss_pred CCCcceeeEEEeccCCccccccceEEEEEeCCCCCCCCCCccccCCCC-CcEEEEeCCCCCcccccccccccceEEEecc
Confidence 47888887654 2344443 13455678999996 69999841 1 1258999999
Q ss_pred CCCC------------CEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeC--CCCceE-------------EEEe
Q psy6572 683 DGSN------------PKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDL--NGENIK-------------IIVS 735 (1416)
Q Consensus 683 DGs~------------r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~l--dG~~r~-------------~v~~ 735 (1416)
||+. ...|+..+|..|.+|++|+.+++||.+|.+....+.+++ .|.+-- .+..
T Consensus 156 dG~~p~dnP~~~~~~~~~~i~A~GlRN~~~~~~d~~tg~l~~~d~G~~~~dein~i~~G~nYGWP~~~~~~~~~~~~~~~ 235 (331)
T PF07995_consen 156 DGSIPADNPFVGDDGADSEIYAYGLRNPFGLAFDPNTGRLWAADNGPDGWDEINRIEPGGNYGWPYCEGGPKYSGPPIGD 235 (331)
T ss_dssp TSSB-TTSTTTTSTTSTTTEEEE--SEEEEEEEETTTTEEEEEEE-SSSSEEEEEE-TT-B--TTTBSSSCSTTSS-ECT
T ss_pred cCcCCCCCccccCCCceEEEEEeCCCccccEEEECCCCcEEEEccCCCCCcEEEEeccCCcCCCCCCcCCCCCCCCcccc
Confidence 9982 345667799999999999988999999976554443331 222100 0000
Q ss_pred cc-----CCCCcc---cccceeEEEe--------cCcEEEeecCCCeeEEecccCCCc---eEEEEeCCCC-CCeeeeee
Q psy6572 736 RR-----MDPTIN---LHHVFALAVF--------EDHLFWTDWEMKSIERCDKYTGKN---CTSVVKNLVH-KPMDLRVY 795 (1416)
Q Consensus 736 ~~-----~~p~~~---l~~P~~lav~--------~d~LYwtD~~~~~I~~~nk~tG~~---~~~l~~~~~~-~p~~I~v~ 795 (1416)
.. ..|... -..|.+|+++ .+.+|++++..++|.++....+.. ...++ ..+. +|.+|++-
T Consensus 236 ~~~~~~~~~P~~~~~~~~ap~G~~~y~g~~fp~~~g~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~-~~~~~r~~~v~~~ 314 (331)
T PF07995_consen 236 APSCPGFVPPVFAYPPHSAPTGIIFYRGSAFPEYRGDLFVADYGGGRIWRLDLDEDGSVTEEEEFL-GGFGGRPRDVAQG 314 (331)
T ss_dssp GSS-TTS---SEEETTT--EEEEEEE-SSSSGGGTTEEEEEETTTTEEEEEEEETTEEEEEEEEEC-TTSSS-EEEEEEE
T ss_pred ccCCCCcCccceeecCccccCceEEECCccCccccCcEEEecCCCCEEEEEeeecCCCccceEEcc-ccCCCCceEEEEc
Confidence 00 001111 1357888887 567999999999999998654432 22222 2233 56666653
No 33
>KOG1520|consensus
Probab=98.48 E-value=7.8e-06 Score=94.86 Aligned_cols=140 Identities=24% Similarity=0.400 Sum_probs=104.2
Q ss_pred ccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee------cCCCceEEEEccCCcEEEeeCCC----CeEEE
Q psy6572 567 TNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP------ATSPDGLTVDWVGRNLYWCDKGL----DTIEV 635 (1416)
Q Consensus 567 ~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~------l~~p~gLAvD~~~~~LYwtD~~~----~~I~v 635 (1416)
..|.||+|+..++.||++|... -++.+... ...+.+.. +.-..+|.||. ++.|||||+.. ..+..
T Consensus 115 GRPLGl~f~~~ggdL~VaDAYl---GL~~V~p~g~~a~~l~~~~~G~~~kf~N~ldI~~-~g~vyFTDSSsk~~~rd~~~ 190 (376)
T KOG1520|consen 115 GRPLGIRFDKKGGDLYVADAYL---GLLKVGPEGGLAELLADEAEGKPFKFLNDLDIDP-EGVVYFTDSSSKYDRRDFVF 190 (376)
T ss_pred CCcceEEeccCCCeEEEEecce---eeEEECCCCCcceeccccccCeeeeecCceeEcC-CCeEEEeccccccchhheEE
Confidence 6799999999999999999985 47777777 33333333 55678999999 99999999875 22333
Q ss_pred eecCC-------------CceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCC---EEEeecCCCCCe
Q psy6572 636 AKLDG-------------RFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNP---KVIISKNLSWPN 699 (1416)
Q Consensus 636 ~~ldG-------------~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r---~vlv~~~l~~P~ 699 (1416)
+-|.| +..+||+ .+|.-|+||||.|...++.+++... .+|.|..+.|... .+++..--..|.
T Consensus 191 a~l~g~~~GRl~~YD~~tK~~~VLl-d~L~F~NGlaLS~d~sfvl~~Et~~-~ri~rywi~g~k~gt~EvFa~~LPG~PD 268 (376)
T KOG1520|consen 191 AALEGDPTGRLFRYDPSTKVTKVLL-DGLYFPNGLALSPDGSFVLVAETTT-ARIKRYWIKGPKAGTSEVFAEGLPGYPD 268 (376)
T ss_pred eeecCCCccceEEecCcccchhhhh-hcccccccccCCCCCCEEEEEeecc-ceeeeeEecCCccCchhhHhhcCCCCCc
Confidence 33333 2233443 7899999999999999999998654 4899999999876 666665567899
Q ss_pred eEEeecCCCeEEEec
Q psy6572 700 ALTISYETNELFWGD 714 (1416)
Q Consensus 700 gLaiD~~~~rLYWtD 714 (1416)
-|..+ .++. ||+-
T Consensus 269 NIR~~-~~G~-fWVa 281 (376)
T KOG1520|consen 269 NIRRD-STGH-FWVA 281 (376)
T ss_pred ceeEC-CCCC-EEEE
Confidence 99997 3444 5553
No 34
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=98.44 E-value=1.4e-07 Score=72.06 Aligned_cols=32 Identities=53% Similarity=1.289 Sum_probs=28.3
Q ss_pred cCCCceecCCCceecCCCCCCCCCCCCCCCCC
Q psy6572 1012 CGPDYIRCDTGRCIPKTWQCDGDVDCPNREDE 1043 (1416)
Q Consensus 1012 C~~~~f~C~~g~Ci~~~~~CDg~~DC~dgsDE 1043 (1416)
|...+|+|.++.|||..|+|||.+||+|||||
T Consensus 2 C~~~~f~C~~~~Ci~~~~~Cdg~~dC~dgsDE 33 (33)
T smart00192 2 CPPGEFQCDNGRCIPLSWVCDGVDDCSDGSDE 33 (33)
T ss_pred CCCCeEECCCCCEECchhhCCCcCcCcCCCCC
Confidence 55668999999999999999999999999998
No 35
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=98.42 E-value=1.6e-07 Score=71.79 Aligned_cols=32 Identities=53% Similarity=1.165 Sum_probs=29.9
Q ss_pred ccCCeeecCCCceeccccccCCCCCCCCCCCc
Q psy6572 293 CMEGYFKCLNGRCLLENYYCDGENDCGDNSDE 324 (1416)
Q Consensus 293 C~~~~f~C~~g~CI~~~~~CDg~~DC~DgSDE 324 (1416)
|...+|+|.++.||+..|+|||..||.|||||
T Consensus 2 C~~~~f~C~~~~Ci~~~~~Cdg~~dC~dgsDE 33 (33)
T smart00192 2 CPPGEFQCDNGRCIPLSWVCDGVDDCSDGSDE 33 (33)
T ss_pred CCCCeEECCCCCEECchhhCCCcCcCcCCCCC
Confidence 55679999999999999999999999999998
No 36
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=98.36 E-value=0.00038 Score=80.37 Aligned_cols=226 Identities=14% Similarity=0.055 Sum_probs=133.7
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecCC-CCe-EEeecCCCceEEEEccCCcEE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNS-QPE-LLFPATSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s-~~~-~l~~l~~p~gLAvD~~~~~LY 624 (1416)
..|+.+++.... ...+.....+.++++++.++.||.+.... ..|+.+++.+ ... .+.....+..+++++.++.||
T Consensus 11 ~~v~~~d~~t~~~~~~~~~~~~~~~l~~~~dg~~l~~~~~~~--~~v~~~d~~~~~~~~~~~~~~~~~~~~~~~~g~~l~ 88 (300)
T TIGR03866 11 NTISVIDTATLEVTRTFPVGQRPRGITLSKDGKLLYVCASDS--DTIQVIDLATGEVIGTLPSGPDPELFALHPNGKILY 88 (300)
T ss_pred CEEEEEECCCCceEEEEECCCCCCceEECCCCCEEEEEECCC--CeEEEEECCCCcEEEeccCCCCccEEEECCCCCEEE
Confidence 345666654433 22223334567899998888888886653 3777777763 222 122245578899998888899
Q ss_pred EeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEee
Q psy6572 625 WCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTIS 704 (1416)
Q Consensus 625 wtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD 704 (1416)
.+....++|.+.++........+ .....|.+|+++|...+|+.+... ...+...++........+. .-..|..++++
T Consensus 89 ~~~~~~~~l~~~d~~~~~~~~~~-~~~~~~~~~~~~~dg~~l~~~~~~-~~~~~~~d~~~~~~~~~~~-~~~~~~~~~~s 165 (300)
T TIGR03866 89 IANEDDNLVTVIDIETRKVLAEI-PVGVEPEGMAVSPDGKIVVNTSET-TNMAHFIDTKTYEIVDNVL-VDQRPRFAEFT 165 (300)
T ss_pred EEcCCCCeEEEEECCCCeEEeEe-eCCCCcceEEECCCCCEEEEEecC-CCeEEEEeCCCCeEEEEEE-cCCCccEEEEC
Confidence 98877788999998764322112 223458999999987777666533 2234444444322111111 22467889998
Q ss_pred cCCCeEEEecCCCCeEEEEeCCCCce-EEEEeccCCCCcccccceeEEEe--cCcEEEeecCCCeeEEecccCCCce
Q psy6572 705 YETNELFWGDAHEDYIAVSDLNGENI-KIIVSRRMDPTINLHHVFALAVF--EDHLFWTDWEMKSIERCDKYTGKNC 778 (1416)
Q Consensus 705 ~~~~rLYWtD~~~~~I~~~~ldG~~r-~~v~~~~~~p~~~l~~P~~lav~--~d~LYwtD~~~~~I~~~nk~tG~~~ 778 (1416)
+...+||++-...+.|...++..... +.+.............|.++++. +.++|++....+.|...+..++...
T Consensus 166 ~dg~~l~~~~~~~~~v~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~s~dg~~~~~~~~~~~~i~v~d~~~~~~~ 242 (300)
T TIGR03866 166 ADGKELWVSSEIGGTVSVIDVATRKVIKKITFEIPGVHPEAVQPVGIKLTKDGKTAFVALGPANRVAVVDAKTYEVL 242 (300)
T ss_pred CCCCEEEEEcCCCCEEEEEEcCcceeeeeeeecccccccccCCccceEECCCCCEEEEEcCCCCeEEEEECCCCcEE
Confidence 77777777755567888888754332 22211100000012246667663 4567887776777877776555443
No 37
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=98.30 E-value=4.5e-05 Score=86.24 Aligned_cols=162 Identities=11% Similarity=0.086 Sum_probs=90.7
Q ss_pred EEEEEecCCcc--eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC--CC---CeEEe--e-------cCCCc
Q psy6572 549 YIREVTQAGVM--TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN--SQ---PELLF--P-------ATSPD 612 (1416)
Q Consensus 549 ~I~~i~l~g~~--~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~--s~---~~~l~--~-------l~~p~ 612 (1416)
.|..++++|.. .+.+.+...+.||+|- .++++.+++...+ .|+.+.++ +. ...+. . -...+
T Consensus 45 ~i~els~~G~vlr~i~l~g~~D~EgI~y~-g~~~~vl~~Er~~--~L~~~~~~~~~~~~~~~~~~~~~l~~~~~~N~G~E 121 (248)
T PF06977_consen 45 EIYELSLDGKVLRRIPLDGFGDYEGITYL-GNGRYVLSEERDQ--RLYIFTIDDDTTSLDRADVQKISLGFPNKGNKGFE 121 (248)
T ss_dssp EEEEEETT--EEEEEE-SS-SSEEEEEE--STTEEEEEETTTT--EEEEEEE----TT--EEEEEEEE---S---SS--E
T ss_pred EEEEEcCCCCEEEEEeCCCCCCceeEEEE-CCCEEEEEEcCCC--cEEEEEEeccccccchhhceEEecccccCCCcceE
Confidence 34555555544 4455667788999983 4455555554433 67777774 11 11111 1 23469
Q ss_pred eEEEEccCCcEEEeeCCCC-eEEEeec--CCCceEEEEc-------CCCCCcceeeecCCcceEEEeeCCCCceEEEEec
Q psy6572 613 GLTVDWVGRNLYWCDKGLD-TIEVAKL--DGRFRKVLIN-------KGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKM 682 (1416)
Q Consensus 613 gLAvD~~~~~LYwtD~~~~-~I~v~~l--dG~~~~vLi~-------~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~m 682 (1416)
|||+|+.++.||++-.... .|+..+. .+....+... ..+..|.+|++||.+|.||+... ...+|...+.
T Consensus 122 Gla~D~~~~~L~v~kE~~P~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~S~l~~~p~t~~lliLS~-es~~l~~~d~ 200 (248)
T PF06977_consen 122 GLAYDPKTNRLFVAKERKPKRLYEVNGFPGGFDLFVSDDQDLDDDKLFVRDLSGLSYDPRTGHLLILSD-ESRLLLELDR 200 (248)
T ss_dssp EEEEETTTTEEEEEEESSSEEEEEEESTT-SS--EEEE-HHHH-HT--SS---EEEEETTTTEEEEEET-TTTEEEEE-T
T ss_pred EEEEcCCCCEEEEEeCCCChhhEEEccccCccceeeccccccccccceeccccceEEcCCCCeEEEEEC-CCCeEEEECC
Confidence 9999999999999866544 5666665 2222222221 24677999999999999999974 4568999999
Q ss_pred CCCCCEEEeec--------CCCCCeeEEeecCCCeEEEecC
Q psy6572 683 DGSNPKVIISK--------NLSWPNALTISYETNELFWGDA 715 (1416)
Q Consensus 683 DGs~r~vlv~~--------~l~~P~gLaiD~~~~rLYWtD~ 715 (1416)
+|..+..+.-. .+..|-|||+|. +++||++.-
T Consensus 201 ~G~~~~~~~L~~g~~gl~~~~~QpEGIa~d~-~G~LYIvsE 240 (248)
T PF06977_consen 201 QGRVVSSLSLDRGFHGLSKDIPQPEGIAFDP-DGNLYIVSE 240 (248)
T ss_dssp T--EEEEEE-STTGGG-SS---SEEEEEE-T-T--EEEEET
T ss_pred CCCEEEEEEeCCcccCcccccCCccEEEECC-CCCEEEEcC
Confidence 99865544322 367899999995 889999863
No 38
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=98.27 E-value=0.00033 Score=83.03 Aligned_cols=218 Identities=13% Similarity=0.101 Sum_probs=131.2
Q ss_pred EEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeec---------cCCCccEEEEecCCCCe--EEee--------c
Q psy6572 549 YIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDV---------TMHGSSIRRSCNNSQPE--LLFP--------A 608 (1416)
Q Consensus 549 ~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~---------~~~~~~I~r~~l~s~~~--~l~~--------l 608 (1416)
.|..|+.+... .-.+.....|.++ +.+.++.||++.. ..+ .|..+++.+... .|.. .
T Consensus 28 ~v~ViD~~~~~v~g~i~~G~~P~~~-~spDg~~lyva~~~~~R~~~G~~~d--~V~v~D~~t~~~~~~i~~p~~p~~~~~ 104 (352)
T TIGR02658 28 QVYTIDGEAGRVLGMTDGGFLPNPV-VASDGSFFAHASTVYSRIARGKRTD--YVEVIDPQTHLPIADIELPEGPRFLVG 104 (352)
T ss_pred eEEEEECCCCEEEEEEEccCCCcee-ECCCCCEEEEEeccccccccCCCCC--EEEEEECccCcEEeEEccCCCchhhcc
Confidence 57777766554 1122233456666 8888999999988 433 677776662211 1211 3
Q ss_pred CCCceEEEEccCCcEEEeeCC-CCeEEEeec------------------------------CCCceEEEE---------c
Q psy6572 609 TSPDGLTVDWVGRNLYWCDKG-LDTIEVAKL------------------------------DGRFRKVLI---------N 648 (1416)
Q Consensus 609 ~~p~gLAvD~~~~~LYwtD~~-~~~I~v~~l------------------------------dG~~~~vLi---------~ 648 (1416)
..|..++|.+-++.||+++.. .+.|.++++ ||+..++-+ +
T Consensus 105 ~~~~~~~ls~dgk~l~V~n~~p~~~V~VvD~~~~kvv~ei~vp~~~~vy~t~e~~~~~~~~Dg~~~~v~~d~~g~~~~~~ 184 (352)
T TIGR02658 105 TYPWMTSLTPDNKTLLFYQFSPSPAVGVVDLEGKAFVRMMDVPDCYHIFPTANDTFFMHCRDGSLAKVGYGTKGNPKIKP 184 (352)
T ss_pred CccceEEECCCCCEEEEecCCCCCEEEEEECCCCcEEEEEeCCCCcEEEEecCCccEEEeecCceEEEEecCCCceEEee
Confidence 345599999999999998865 455555444 333333111 0
Q ss_pred CC---------CCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEee-----cC----CCCCee---EEeecCC
Q psy6572 649 KG---------LQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIIS-----KN----LSWPNA---LTISYET 707 (1416)
Q Consensus 649 ~~---------l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~-----~~----l~~P~g---LaiD~~~ 707 (1416)
.. +.+| ++-+..|.++|+... ..|..+++.|.....+.. .. --.|-| ++++...
T Consensus 185 ~~vf~~~~~~v~~rP---~~~~~dg~~~~vs~e--G~V~~id~~~~~~~~~~~~~~~~~~~~~~~wrP~g~q~ia~~~dg 259 (352)
T TIGR02658 185 TEVFHPEDEYLINHP---AYSNKSGRLVWPTYT--GKIFQIDLSSGDAKFLPAIEAFTEAEKADGWRPGGWQQVAYHRAR 259 (352)
T ss_pred eeeecCCccccccCC---ceEcCCCcEEEEecC--CeEEEEecCCCcceecceeeeccccccccccCCCcceeEEEcCCC
Confidence 00 1233 223334556665543 579999987765433322 11 113555 9999999
Q ss_pred CeEEEec-CC--------CCeEEEEeCCCCceEEEEeccCCCCcccccceeEEEe--cC-cEEEeecCCCeeEEecccCC
Q psy6572 708 NELFWGD-AH--------EDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVF--ED-HLFWTDWEMKSIERCDKYTG 775 (1416)
Q Consensus 708 ~rLYWtD-~~--------~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~--~d-~LYwtD~~~~~I~~~nk~tG 775 (1416)
++||++- .. .+.|..++.....+...+.. -..|.+|++. +. +||.+++.++.|..++..++
T Consensus 260 ~~lyV~~~~~~~~thk~~~~~V~ViD~~t~kvi~~i~v-------G~~~~~iavS~Dgkp~lyvtn~~s~~VsViD~~t~ 332 (352)
T TIGR02658 260 DRIYLLADQRAKWTHKTASRFLFVVDAKTGKRLRKIEL-------GHEIDSINVSQDAKPLLYALSTGDKTLYIFDAETG 332 (352)
T ss_pred CEEEEEecCCccccccCCCCEEEEEECCCCeEEEEEeC-------CCceeeEEECCCCCeEEEEeCCCCCcEEEEECcCC
Confidence 9999953 22 25788888644333332221 2357777774 56 89999999999999998777
Q ss_pred CceEEE
Q psy6572 776 KNCTSV 781 (1416)
Q Consensus 776 ~~~~~l 781 (1416)
+....+
T Consensus 333 k~i~~i 338 (352)
T TIGR02658 333 KELSSV 338 (352)
T ss_pred eEEeee
Confidence 765554
No 39
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=98.23 E-value=7.4e-05 Score=86.88 Aligned_cols=151 Identities=21% Similarity=0.234 Sum_probs=98.7
Q ss_pred CCceEEEEccC-----CcEEEeeCCCCeEEEeecC-CCceEEEEcCCCC------------------CcceeeecC---C
Q psy6572 610 SPDGLTVDWVG-----RNLYWCDKGLDTIEVAKLD-GRFRKVLINKGLQ------------------EPRGIALNP---A 662 (1416)
Q Consensus 610 ~p~gLAvD~~~-----~~LYwtD~~~~~I~v~~ld-G~~~~vLi~~~l~------------------~P~gIavDp---~ 662 (1416)
....|+||... +.+|+||.+...|.|.++. |+.++++...-.. ...|||+.| .
T Consensus 62 ~lndl~VD~~~~~~~~~~aYItD~~~~glIV~dl~~~~s~Rv~~~~~~~~p~~~~~~i~g~~~~~~dg~~gial~~~~~d 141 (287)
T PF03022_consen 62 FLNDLVVDVRDGNCDDGFAYITDSGGPGLIVYDLATGKSWRVLHNSFSPDPDAGPFTIGGESFQWPDGIFGIALSPISPD 141 (287)
T ss_dssp GEEEEEEECTTTTS-SEEEEEEETTTCEEEEEETTTTEEEEEETCGCTTS-SSEEEEETTEEEEETTSEEEEEE-TTSTT
T ss_pred ccceEEEEccCCCCcceEEEEeCCCcCcEEEEEccCCcEEEEecCCcceeccccceeccCceEecCCCccccccCCCCCC
Confidence 44679999855 5899999999999999986 4556665432111 135678876 4
Q ss_pred cceEEEeeCCCCceEEEEecC----CCC--------CEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCC---
Q psy6572 663 YGYMYWTDWGQNAHIGKAKMD----GSN--------PKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNG--- 727 (1416)
Q Consensus 663 ~g~LYWtD~g~~~~I~ra~mD----Gs~--------r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG--- 727 (1416)
.++|||.-.... +++++... .+. ....+........|+++|. ++.||+++...+.|.+.+.++
T Consensus 142 ~r~LYf~~lss~-~ly~v~T~~L~~~~~~~~~~~~~~v~~lG~k~~~s~g~~~D~-~G~ly~~~~~~~aI~~w~~~~~~~ 219 (287)
T PF03022_consen 142 GRWLYFHPLSSR-KLYRVPTSVLRDPSLSDAQALASQVQDLGDKGSQSDGMAIDP-NGNLYFTDVEQNAIGCWDPDGPYT 219 (287)
T ss_dssp S-EEEEEETT-S-EEEEEEHHHHCSTT--HHH-HHHT-EEEEE---SECEEEEET-TTEEEEEECCCTEEEEEETTTSB-
T ss_pred ccEEEEEeCCCC-cEEEEEHHHhhCccccccccccccceeccccCCCCceEEECC-CCcEEEecCCCCeEEEEeCCCCcC
Confidence 468999875433 56666442 211 1122222234457999997 999999999999999999988
Q ss_pred -CceEEEEeccCCCCcccccceeEEEec---CcEEEeecCCCe
Q psy6572 728 -ENIKIIVSRRMDPTINLHHVFALAVFE---DHLFWTDWEMKS 766 (1416)
Q Consensus 728 -~~r~~v~~~~~~p~~~l~~P~~lav~~---d~LYwtD~~~~~ 766 (1416)
.+.++|+... ..+..|.+|++.. ++||++..+..+
T Consensus 220 ~~~~~~l~~d~----~~l~~pd~~~i~~~~~g~L~v~snrl~~ 258 (287)
T PF03022_consen 220 PENFEILAQDP----RTLQWPDGLKIDPEGDGYLWVLSNRLQR 258 (287)
T ss_dssp GCCEEEEEE-C----C-GSSEEEEEE-T--TS-EEEEE-S--S
T ss_pred ccchheeEEcC----ceeeccceeeeccccCceEEEEECcchH
Confidence 5667777654 2488999999977 899998754443
No 40
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=98.18 E-value=1.1e-06 Score=68.05 Aligned_cols=35 Identities=40% Similarity=1.087 Sum_probs=25.5
Q ss_pred cCCCCCCcccccceecCCceEEeeCCCceecCCCCCcc
Q psy6572 456 NVSHGGQLCAHECIDLKIGYKCACRKGYQVHPEDKHLC 493 (1416)
Q Consensus 456 ~~~~~~~~Cs~~C~nt~~gy~C~C~~Gy~L~p~d~~tC 493 (1416)
+.++++ |+|+|++++++|+|+|++||+|. .|+++|
T Consensus 2 ~~~NGg--C~h~C~~~~g~~~C~C~~Gy~L~-~D~~tC 36 (36)
T PF14670_consen 2 SVNNGG--CSHICVNTPGSYRCSCPPGYKLA-EDGRTC 36 (36)
T ss_dssp TTGGGG--SSSEEEEETTSEEEE-STTEEE--TTSSSE
T ss_pred CCCCCC--cCCCCccCCCceEeECCCCCEEC-cCCCCC
Confidence 344566 88888888888888888888888 677765
No 41
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=98.18 E-value=0.00038 Score=84.95 Aligned_cols=153 Identities=16% Similarity=0.196 Sum_probs=106.7
Q ss_pred eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee---------cCCCceEEEEcc------CCcE
Q psy6572 560 TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP---------ATSPDGLTVDWV------GRNL 623 (1416)
Q Consensus 560 ~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~---------l~~p~gLAvD~~------~~~L 623 (1416)
++++.+|..|.+|+|.+ .++||++.... ++|+++... ...+++.. ...+-|||+++. ++.|
T Consensus 23 ~~va~GL~~Pw~maflP-DG~llVtER~~--G~I~~v~~~~~~~~~~~~l~~v~~~~ge~GLlglal~PdF~~~~~n~~l 99 (454)
T TIGR03606 23 KVLLSGLNKPWALLWGP-DNQLWVTERAT--GKILRVNPETGEVKVVFTLPEIVNDAQHNGLLGLALHPDFMQEKGNPYV 99 (454)
T ss_pred EEEECCCCCceEEEEcC-CCeEEEEEecC--CEEEEEeCCCCceeeeecCCceeccCCCCceeeEEECCCccccCCCcEE
Confidence 45677899999999987 46889888742 378888655 22222211 355789999854 4679
Q ss_pred EEeeC---------CCCeEEEeecCCC-----ceEEEEcC----CCCCcceeeecCCcceEEEee--CCC----------
Q psy6572 624 YWCDK---------GLDTIEVAKLDGR-----FRKVLINK----GLQEPRGIALNPAYGYMYWTD--WGQ---------- 673 (1416)
Q Consensus 624 YwtD~---------~~~~I~v~~ldG~-----~~~vLi~~----~l~~P~gIavDp~~g~LYWtD--~g~---------- 673 (1416)
|++-. ...+|.|+.++.. ..++|+.. ....-..|+++|. |+||++- .+.
T Consensus 100 Yvsyt~~~~~~~~~~~~~I~R~~l~~~~~~l~~~~~Il~~lP~~~~H~GgrI~FgPD-G~LYVs~GD~g~~~~~n~~~~~ 178 (454)
T TIGR03606 100 YISYTYKNGDKELPNHTKIVRYTYDKSTQTLEKPVDLLAGLPAGNDHNGGRLVFGPD-GKIYYTIGEQGRNQGANFFLPN 178 (454)
T ss_pred EEEEeccCCCCCccCCcEEEEEEecCCCCccccceEEEecCCCCCCcCCceEEECCC-CcEEEEECCCCCCCcccccCcc
Confidence 98742 2457988888632 13444431 1233567999986 7899973 210
Q ss_pred -------------------CceEEEEecCCCC----------CEEEeecCCCCCeeEEeecCCCeEEEecCCC
Q psy6572 674 -------------------NAHIGKAKMDGSN----------PKVIISKNLSWPNALTISYETNELFWGDAHE 717 (1416)
Q Consensus 674 -------------------~~~I~ra~mDGs~----------r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~ 717 (1416)
..+|.|++.||+. +..|...+++.|.||++|+ +++||.+|.+.
T Consensus 179 ~aQ~~~~~~~~~~~d~~~~~GkILRin~DGsiP~dNPf~~g~~~eIyA~G~RNp~Gla~dp-~G~Lw~~e~Gp 250 (454)
T TIGR03606 179 QAQHTPTQQELNGKDYHAYMGKVLRLNLDGSIPKDNPSINGVVSHIFTYGHRNPQGLAFTP-DGTLYASEQGP 250 (454)
T ss_pred hhccccccccccccCcccCceEEEEEcCCCCCCCCCCccCCCcceEEEEeccccceeEECC-CCCEEEEecCC
Confidence 1379999999973 3467788999999999998 89999998654
No 42
>KOG1520|consensus
Probab=98.14 E-value=2.4e-05 Score=90.91 Aligned_cols=147 Identities=18% Similarity=0.279 Sum_probs=104.0
Q ss_pred cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcC----CCCCcceeeecCCcceEEEeeCCCC---ceEEEE
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINK----GLQEPRGIALNPAYGYMYWTDWGQN---AHIGKA 680 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~----~l~~P~gIavDp~~g~LYWtD~g~~---~~I~ra 680 (1416)
-..|-|||++..++.||++|...+ +.+++..|..-+.+... .+.-.+++.|++ +|.|||||.... ..+.-+
T Consensus 114 CGRPLGl~f~~~ggdL~VaDAYlG-L~~V~p~g~~a~~l~~~~~G~~~kf~N~ldI~~-~g~vyFTDSSsk~~~rd~~~a 191 (376)
T KOG1520|consen 114 CGRPLGIRFDKKGGDLYVADAYLG-LLKVGPEGGLAELLADEAEGKPFKFLNDLDIDP-EGVVYFTDSSSKYDRRDFVFA 191 (376)
T ss_pred cCCcceEEeccCCCeEEEEeccee-eEEECCCCCcceeccccccCeeeeecCceeEcC-CCeEEEeccccccchhheEEe
Confidence 478999999999999999998764 55666666654444432 345578999999 899999996542 123334
Q ss_pred ecCCC-----------CCE-EEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCce---EEEEeccCCCCcccc
Q psy6572 681 KMDGS-----------NPK-VIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENI---KIIVSRRMDPTINLH 745 (1416)
Q Consensus 681 ~mDGs-----------~r~-vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r---~~v~~~~~~p~~~l~ 745 (1416)
-|.|. .+. .++..+|..||||++.+....|.+++....+|.+.-+.|... .+++.+- -.
T Consensus 192 ~l~g~~~GRl~~YD~~tK~~~VLld~L~F~NGlaLS~d~sfvl~~Et~~~ri~rywi~g~k~gt~EvFa~~L------PG 265 (376)
T KOG1520|consen 192 ALEGDPTGRLFRYDPSTKVTKVLLDGLYFPNGLALSPDGSFVLVAETTTARIKRYWIKGPKAGTSEVFAEGL------PG 265 (376)
T ss_pred eecCCCccceEEecCcccchhhhhhcccccccccCCCCCCEEEEEeeccceeeeeEecCCccCchhhHhhcC------CC
Confidence 44441 111 133457999999999999999999999999999999888766 5555532 34
Q ss_pred cceeEEEecCcEEEeec
Q psy6572 746 HVFALAVFEDHLFWTDW 762 (1416)
Q Consensus 746 ~P~~lav~~d~LYwtD~ 762 (1416)
.|.-|...+.==||.-.
T Consensus 266 ~PDNIR~~~~G~fWVal 282 (376)
T KOG1520|consen 266 YPDNIRRDSTGHFWVAL 282 (376)
T ss_pred CCcceeECCCCCEEEEE
Confidence 57777775544455443
No 43
>PF12999 PRKCSH-like: Glucosidase II beta subunit-like
Probab=98.13 E-value=2e-06 Score=90.25 Aligned_cols=67 Identities=36% Similarity=0.682 Sum_probs=58.4
Q ss_pred CCceeec--ceE-eccceecCCcCCCCCCCCCCCCCCCCccCCCceecCC-C---ceecCCCCCCCCCC---CCCCCCCC
Q psy6572 975 ANEFQCD--VKC-ISLALVCDKVFDCLDRSDEPADCTSQTCGPDYIRCDT-G---RCIPKTWQCDGDVD---CPNREDEP 1044 (1416)
Q Consensus 975 ~~~f~C~--~~C-i~~~~~CDg~~dC~d~sDE~~~C~~~~C~~~~f~C~~-g---~Ci~~~~~CDg~~D---C~dgsDE~ 1044 (1416)
.+.|+|- .+= |+.+.+.|++-||+|||||+ ....|+...|+|.| | +-||.++|-||+=| |=|||||.
T Consensus 35 ~~~f~Cl~~~~~~I~~~~iNDdyCDC~DGSDEP---GTsAC~~~~FyC~N~g~~p~~i~~s~VnDGICDy~~CCDGSDE~ 111 (176)
T PF12999_consen 35 NGKFTCLDGSKIVIPFSQINDDYCDCPDGSDEP---GTSACSNGKFYCENKGHIPRYIPSSRVNDGICDYDICCDGSDES 111 (176)
T ss_pred CCceEecCCCCceecHHHccCcceeCCCCCCcc---ccccCcCceEeeccCCCCCceeehhhhcCCcCcccccCCCCCCC
Confidence 4579997 334 99999999999999999996 35568888999987 3 78999999999999 99999995
No 44
>KOG4499|consensus
Probab=98.12 E-value=0.0002 Score=77.38 Aligned_cols=198 Identities=15% Similarity=0.224 Sum_probs=118.8
Q ss_pred eeeeeecCCCeEEEeeccCCCccEEEEecCC------------CCeEEee-cCCCceEEEEccCCcEEEeeCCCCeEEEe
Q psy6572 570 VGLDFDWVDNCLYWSDVTMHGSSIRRSCNNS------------QPELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVA 636 (1416)
Q Consensus 570 ~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s------------~~~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~ 636 (1416)
.|..++..++.|||+|.... .|.|..... ....++. ...|...||- -+ .+....
T Consensus 18 Egp~w~~~~~sLl~VDi~ag--~v~r~D~~qn~v~ra~ie~p~~ag~ilpv~~~~q~~~v~----------~G-~kf~i~ 84 (310)
T KOG4499|consen 18 EGPHWDVERQSLLYVDIEAG--EVHRYDIEQNKVYRAKIEGPPSAGFILPVEGGPQEFAVG----------CG-SKFVIV 84 (310)
T ss_pred CCCceEEecceEEEEEeccC--ceehhhhhhhheEEEEEecCcceeEEEEecCCCceEEEe----------ec-ceEEEE
Confidence 56778888999999998753 555543331 1112222 3444444432 11 234445
Q ss_pred ecCCCceEEEEcCC---------CCCcceeeecCCcceEEEeeCCCC-------ceEEEEecCCCCCEEEeecCCCCCee
Q psy6572 637 KLDGRFRKVLINKG---------LQEPRGIALNPAYGYMYWTDWGQN-------AHIGKAKMDGSNPKVIISKNLSWPNA 700 (1416)
Q Consensus 637 ~ldG~~~~vLi~~~---------l~~P~gIavDp~~g~LYWtD~g~~-------~~I~ra~mDGs~r~vlv~~~l~~P~g 700 (1416)
+++|....+++... -.+-+.=-|||..++ |.-..... ....+..+-|..... +-..+.-|||
T Consensus 85 nwd~~~~~a~v~~t~~ev~~d~kknR~NDgkvdP~Gry-y~GtMad~~~~le~~~g~Ly~~~~~h~v~~-i~~~v~IsNg 162 (310)
T KOG4499|consen 85 NWDGVSESAKVYRTLFEVQPDRKKNRLNDGKVDPDGRY-YGGTMADFGDDLEPIGGELYSWLAGHQVEL-IWNCVGISNG 162 (310)
T ss_pred EcccccceeeeeeeccccCchHHhcccccCccCCCCce-eeeeeccccccccccccEEEEeccCCCcee-eehhccCCcc
Confidence 56665433322211 112344567777444 54321111 122233444433333 3346677899
Q ss_pred EEeecCCCeEEEecCCCCeEEEEeCC-----CCceEEEEeccCCCCcccccceeEEE-ecCcEEEeecCCCeeEEecccC
Q psy6572 701 LTISYETNELFWGDAHEDYIAVSDLN-----GENIKIIVSRRMDPTINLHHVFALAV-FEDHLFWTDWEMKSIERCDKYT 774 (1416)
Q Consensus 701 LaiD~~~~rLYWtD~~~~~I~~~~ld-----G~~r~~v~~~~~~p~~~l~~P~~lav-~~d~LYwtD~~~~~I~~~nk~t 774 (1416)
|+-|.....+|++|+.+-.|...++| -++|++|+.-.......-.-|.|+++ .++.||++-|+.++|++++..+
T Consensus 163 l~Wd~d~K~fY~iDsln~~V~a~dyd~~tG~~snr~~i~dlrk~~~~e~~~PDGm~ID~eG~L~Va~~ng~~V~~~dp~t 242 (310)
T KOG4499|consen 163 LAWDSDAKKFYYIDSLNYEVDAYDYDCPTGDLSNRKVIFDLRKSQPFESLEPDGMTIDTEGNLYVATFNGGTVQKVDPTT 242 (310)
T ss_pred ccccccCcEEEEEccCceEEeeeecCCCcccccCcceeEEeccCCCcCCCCCCcceEccCCcEEEEEecCcEEEEECCCC
Confidence 99999999999999999899666643 26778776532100111345789998 4799999999999999999999
Q ss_pred CCceEEEE
Q psy6572 775 GKNCTSVV 782 (1416)
Q Consensus 775 G~~~~~l~ 782 (1416)
|+....+.
T Consensus 243 GK~L~eik 250 (310)
T KOG4499|consen 243 GKILLEIK 250 (310)
T ss_pred CcEEEEEE
Confidence 98766554
No 45
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=98.08 E-value=5.1e-05 Score=90.30 Aligned_cols=150 Identities=18% Similarity=0.239 Sum_probs=101.4
Q ss_pred cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEc------CCCCCcceeeecC---CcceEEEeeCC------
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLIN------KGLQEPRGIALNP---AYGYMYWTDWG------ 672 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~------~~l~~P~gIavDp---~~g~LYWtD~g------ 672 (1416)
|..|.+||+.+. +.||+++. .++|.++..+|.....+.. .....+.|||++| .+++||++-..
T Consensus 1 L~~P~~~a~~pd-G~l~v~e~-~G~i~~~~~~g~~~~~v~~~~~v~~~~~~gllgia~~p~f~~n~~lYv~~t~~~~~~~ 78 (331)
T PF07995_consen 1 LNNPRSMAFLPD-GRLLVAER-SGRIWVVDKDGSLKTPVADLPEVFADGERGLLGIAFHPDFASNGYLYVYYTNADEDGG 78 (331)
T ss_dssp ESSEEEEEEETT-SCEEEEET-TTEEEEEETTTEECEEEEE-TTTBTSTTBSEEEEEE-TTCCCC-EEEEEEEEE-TSSS
T ss_pred CCCceEEEEeCC-CcEEEEeC-CceEEEEeCCCcCcceecccccccccccCCcccceeccccCCCCEEEEEEEcccCCCC
Confidence 568999999986 79999998 8999999988876332221 3445678999999 36777776431
Q ss_pred -CCceEEEEecCCC-----CCEEEeec------CCCCCeeEEeecCCCeEEEec-------------CCCCeEEEEeCCC
Q psy6572 673 -QNAHIGKAKMDGS-----NPKVIISK------NLSWPNALTISYETNELFWGD-------------AHEDYIAVSDLNG 727 (1416)
Q Consensus 673 -~~~~I~ra~mDGs-----~r~vlv~~------~l~~P~gLaiD~~~~rLYWtD-------------~~~~~I~~~~ldG 727 (1416)
...+|.|..++.. .+++|+.. ...+..+|++++ .++|||+- ...++|.+++.+|
T Consensus 79 ~~~~~v~r~~~~~~~~~~~~~~~l~~~~p~~~~~~H~g~~l~fgp-DG~LYvs~G~~~~~~~~~~~~~~~G~ilri~~dG 157 (331)
T PF07995_consen 79 DNDNRVVRFTLSDGDGDLSSEEVLVTGLPDTSSGNHNGGGLAFGP-DGKLYVSVGDGGNDDNAQDPNSLRGKILRIDPDG 157 (331)
T ss_dssp SEEEEEEEEEEETTSCEEEEEEEEEEEEES-CSSSS-EEEEEE-T-TSEEEEEEB-TTTGGGGCSTTSSTTEEEEEETTS
T ss_pred CcceeeEEEeccCCccccccceEEEEEeCCCCCCCCCCccccCCC-CCcEEEEeCCCCCcccccccccccceEEEecccC
Confidence 1247899888765 23444432 233446799986 67999982 1246899999999
Q ss_pred C-------------ceEEEEeccCCCCcccccceeEEEec--CcEEEeecCCCee
Q psy6572 728 E-------------NIKIIVSRRMDPTINLHHVFALAVFE--DHLFWTDWEMKSI 767 (1416)
Q Consensus 728 ~-------------~r~~v~~~~~~p~~~l~~P~~lav~~--d~LYwtD~~~~~I 767 (1416)
+ ..+++.. . +++|++|++.. +.||.+|.+....
T Consensus 158 ~~p~dnP~~~~~~~~~~i~A~-G------lRN~~~~~~d~~tg~l~~~d~G~~~~ 205 (331)
T PF07995_consen 158 SIPADNPFVGDDGADSEIYAY-G------LRNPFGLAFDPNTGRLWAADNGPDGW 205 (331)
T ss_dssp SB-TTSTTTTSTTSTTTEEEE---------SEEEEEEEETTTTEEEEEEE-SSSS
T ss_pred cCCCCCccccCCCceEEEEEe-C------CCccccEEEECCCCcEEEEccCCCCC
Confidence 7 3444443 3 89999999964 7899998665443
No 46
>PF12999 PRKCSH-like: Glucosidase II beta subunit-like
Probab=98.08 E-value=3.2e-06 Score=88.69 Aligned_cols=74 Identities=38% Similarity=0.650 Sum_probs=62.5
Q ss_pred CceecCCCCce-ecCcccccCCCCCCCCCCCCCCcccCCCCCCCceeeCCCC---CeEcCcccccCCCC---CCCCCCCc
Q psy6572 210 NVFQCDNNKTC-ISKSWVCDGTYDCTDRSDENSTYCAHSECNLFEFRCNSTG---QCIPITWVCDGVTD---CIDKSDEH 282 (1416)
Q Consensus 210 ~~F~C~~~~~C-I~~~w~CDg~~DC~D~sDE~~~~c~~~~C~~~~F~C~~~~---~CI~~~w~CDG~~D---C~DgsDE~ 282 (1416)
+.|+|-++.+= ||.+.+.|+.-||.|||||.. ...|+.+.|.|.+.| +-||.++|=||.=| |=|||||.
T Consensus 36 ~~f~Cl~~~~~~I~~~~iNDdyCDC~DGSDEPG----TsAC~~~~FyC~N~g~~p~~i~~s~VnDGICDy~~CCDGSDE~ 111 (176)
T PF12999_consen 36 GKFTCLDGSKIVIPFSQINDDYCDCPDGSDEPG----TSACSNGKFYCENKGHIPRYIPSSRVNDGICDYDICCDGSDES 111 (176)
T ss_pred CceEecCCCCceecHHHccCcceeCCCCCCccc----cccCcCceEeeccCCCCCceeehhhhcCCcCcccccCCCCCCC
Confidence 67999655555 999999999999999999976 457888899996554 79999999999999 99999994
Q ss_pred cccCCc
Q psy6572 283 HSQDCL 288 (1416)
Q Consensus 283 ~~~~C~ 288 (1416)
. ..|+
T Consensus 112 ~-~~C~ 116 (176)
T PF12999_consen 112 G-GKCP 116 (176)
T ss_pred C-CCCc
Confidence 3 4464
No 47
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=98.06 E-value=6.5e-06 Score=66.59 Aligned_cols=42 Identities=43% Similarity=0.881 Sum_probs=37.2
Q ss_pred EEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCC
Q psy6572 645 VLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNP 687 (1416)
Q Consensus 645 vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r 687 (1416)
+|+...+..|+|||+||..++|||+|+.. ..|++++|+|+++
T Consensus 2 ~~~~~~~~~~~~la~d~~~~~lYw~D~~~-~~I~~~~~~g~~~ 43 (43)
T smart00135 2 TLLSEGLGHPNGLAVDWIEGRLYWTDWGL-DVIEVANLDGTNR 43 (43)
T ss_pred EEEECCCCCcCEEEEeecCCEEEEEeCCC-CEEEEEeCCCCCC
Confidence 45557899999999999999999999987 5999999999864
No 48
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=98.06 E-value=7.5e-06 Score=66.19 Aligned_cols=42 Identities=33% Similarity=0.657 Sum_probs=38.1
Q ss_pred EEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCce
Q psy6572 689 VIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENI 730 (1416)
Q Consensus 689 vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r 730 (1416)
+++...+..|+|||+|+.+++|||+|+....|++++++|+++
T Consensus 2 ~~~~~~~~~~~~la~d~~~~~lYw~D~~~~~I~~~~~~g~~~ 43 (43)
T smart00135 2 TLLSEGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43 (43)
T ss_pred EEEECCCCCcCEEEEeecCCEEEEEeCCCCEEEEEeCCCCCC
Confidence 456668999999999999999999999999999999999763
No 49
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=98.03 E-value=0.00038 Score=78.18 Aligned_cols=192 Identities=14% Similarity=0.145 Sum_probs=128.0
Q ss_pred cccceeeeeecCCCeEEEeeccCCCccEEEEecCCCC-eEEee--cCCCceEEEEccCCcEEE-eeCCCCeEEEeecCCC
Q psy6572 566 QTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQP-ELLFP--ATSPDGLTVDWVGRNLYW-CDKGLDTIEVAKLDGR 641 (1416)
Q Consensus 566 l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~-~~l~~--l~~p~gLAvD~~~~~LYw-tD~~~~~I~v~~ldG~ 641 (1416)
..++.+|+|++.++.||-+-.. ...|..+...++. .++.- +..|++|++ ++++.|. +|....++++..++-.
T Consensus 85 ~~nvS~LTynp~~rtLFav~n~--p~~iVElt~~GdlirtiPL~g~~DpE~Iey--ig~n~fvi~dER~~~l~~~~vd~~ 160 (316)
T COG3204 85 TANVSSLTYNPDTRTLFAVTNK--PAAIVELTKEGDLIRTIPLTGFSDPETIEY--IGGNQFVIVDERDRALYLFTVDAD 160 (316)
T ss_pred cccccceeeCCCcceEEEecCC--CceEEEEecCCceEEEecccccCChhHeEE--ecCCEEEEEehhcceEEEEEEcCC
Confidence 4568899999999999987544 3467777666432 22221 788998875 7888887 4556678888877644
Q ss_pred ceEEEEc-------CC---CCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeec----C----CCCCeeEEe
Q psy6572 642 FRKVLIN-------KG---LQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISK----N----LSWPNALTI 703 (1416)
Q Consensus 642 ~~~vLi~-------~~---l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~----~----l~~P~gLai 703 (1416)
.....+. .. -.--.|||-||..++||++--...-+|+...+.-+...+-+.. . +..-.||.+
T Consensus 161 t~~~~~~~~~i~L~~~~k~N~GfEGlA~d~~~~~l~~aKEr~P~~I~~~~~~~~~l~~~~~~~~~~~~~~f~~DvSgl~~ 240 (316)
T COG3204 161 TTVISAKVQKIPLGTTNKKNKGFEGLAWDPVDHRLFVAKERNPIGIFEVTQSPSSLSVHASLDPTADRDLFVLDVSGLEF 240 (316)
T ss_pred ccEEeccceEEeccccCCCCcCceeeecCCCCceEEEEEccCCcEEEEEecCCcccccccccCcccccceEeecccccee
Confidence 3221111 11 1224699999999999998754444677766544222211111 1 334569999
Q ss_pred ecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCC--CcccccceeEEEe-cCcEEEee
Q psy6572 704 SYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDP--TINLHHVFALAVF-EDHLFWTD 761 (1416)
Q Consensus 704 D~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p--~~~l~~P~~lav~-~d~LYwtD 761 (1416)
|..++.|++.-...+.|..++.+|.-+..+.-..+.. ...+++|-|||+. ++.||++.
T Consensus 241 ~~~~~~LLVLS~ESr~l~Evd~~G~~~~~lsL~~g~~gL~~dipqaEGiamDd~g~lYIvS 301 (316)
T COG3204 241 NAITNSLLVLSDESRRLLEVDLSGEVIELLSLTKGNHGLSSDIPQAEGIAMDDDGNLYIVS 301 (316)
T ss_pred cCCCCcEEEEecCCceEEEEecCCCeeeeEEeccCCCCCcccCCCcceeEECCCCCEEEEe
Confidence 9999999999989999999999998766554322211 2347889999995 67888764
No 50
>PRK04792 tolB translocation protein TolB; Provisional
Probab=98.02 E-value=0.00079 Score=83.62 Aligned_cols=224 Identities=9% Similarity=0.012 Sum_probs=139.0
Q ss_pred EEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEeecC-CCceEEEEccCCcEEE
Q psy6572 549 YIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFPAT-SPDGLTVDWVGRNLYW 625 (1416)
Q Consensus 549 ~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~l~-~p~gLAvD~~~~~LYw 625 (1416)
.|..++.+|.. ..+...........+.+.+++|+|+........|+++.+. +..+.+..+. ....+++.+.++.|++
T Consensus 199 ~l~i~d~dG~~~~~l~~~~~~~~~p~wSPDG~~La~~s~~~g~~~L~~~dl~tg~~~~lt~~~g~~~~~~wSPDG~~La~ 278 (448)
T PRK04792 199 QLMIADYDGYNEQMLLRSPEPLMSPAWSPDGRKLAYVSFENRKAEIFVQDIYTQVREKVTSFPGINGAPRFSPDGKKLAL 278 (448)
T ss_pred EEEEEeCCCCCceEeecCCCcccCceECCCCCEEEEEEecCCCcEEEEEECCCCCeEEecCCCCCcCCeeECCCCCEEEE
Confidence 56666777765 4444444455678888999999887654433578888887 4444454422 2346788888888988
Q ss_pred eeCC--CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEee-CCCCceEEEEecCCCCCEEEeecCCCCCeeEE
Q psy6572 626 CDKG--LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTD-WGQNAHIGKAKMDGSNPKVIISKNLSWPNALT 702 (1416)
Q Consensus 626 tD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD-~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLa 702 (1416)
+... ...|++.++++...+.|.. ........++.|...+|+++- .+..+.|+++++++...+.|.. ...+..+.+
T Consensus 279 ~~~~~g~~~Iy~~dl~tg~~~~lt~-~~~~~~~p~wSpDG~~I~f~s~~~g~~~Iy~~dl~~g~~~~Lt~-~g~~~~~~~ 356 (448)
T PRK04792 279 VLSKDGQPEIYVVDIATKALTRITR-HRAIDTEPSWHPDGKSLIFTSERGGKPQIYRVNLASGKVSRLTF-EGEQNLGGS 356 (448)
T ss_pred EEeCCCCeEEEEEECCCCCeEECcc-CCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCCEEEEec-CCCCCcCee
Confidence 6443 3469999998776555543 223455677888888887764 3345689999998765544432 222334567
Q ss_pred eecCCCeEEEecCCC--CeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEeecCCC--eeEEecccCCCce
Q psy6572 703 ISYETNELFWGDAHE--DYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDWEMK--SIERCDKYTGKNC 778 (1416)
Q Consensus 703 iD~~~~rLYWtD~~~--~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~~--~I~~~nk~tG~~~ 778 (1416)
+.+..+.||++.... ..|+.+++++...+.|.... ....| +++-.+.+|+++....+ .|+.++ .+|...
T Consensus 357 ~SpDG~~l~~~~~~~g~~~I~~~dl~~g~~~~lt~~~-----~d~~p-s~spdG~~I~~~~~~~g~~~l~~~~-~~G~~~ 429 (448)
T PRK04792 357 ITPDGRSMIMVNRTNGKFNIARQDLETGAMQVLTSTR-----LDESP-SVAPNGTMVIYSTTYQGKQVLAAVS-IDGRFK 429 (448)
T ss_pred ECCCCCEEEEEEecCCceEEEEEECCCCCeEEccCCC-----CCCCc-eECCCCCEEEEEEecCCceEEEEEE-CCCCce
Confidence 877778888876543 37888998887766554321 01223 34445667777654333 255444 355544
Q ss_pred EEE
Q psy6572 779 TSV 781 (1416)
Q Consensus 779 ~~l 781 (1416)
+.+
T Consensus 430 ~~l 432 (448)
T PRK04792 430 ARL 432 (448)
T ss_pred EEC
Confidence 433
No 51
>PRK05137 tolB translocation protein TolB; Provisional
Probab=98.02 E-value=0.0015 Score=80.98 Aligned_cols=215 Identities=13% Similarity=0.009 Sum_probs=135.7
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEeec-CCCceEEEEccCCcEE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFPA-TSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~l-~~p~gLAvD~~~~~LY 624 (1416)
..|+.++.+|.. ..+......+..+.+.+.+++|+++........|+++.+. +..+.|... ....+.++.+.++.|+
T Consensus 182 ~~l~~~d~dg~~~~~lt~~~~~v~~p~wSpDG~~lay~s~~~g~~~i~~~dl~~g~~~~l~~~~g~~~~~~~SPDG~~la 261 (435)
T PRK05137 182 KRLAIMDQDGANVRYLTDGSSLVLTPRFSPNRQEITYMSYANGRPRVYLLDLETGQRELVGNFPGMTFAPRFSPDGRKVV 261 (435)
T ss_pred eEEEEECCCCCCcEEEecCCCCeEeeEECCCCCEEEEEEecCCCCEEEEEECCCCcEEEeecCCCcccCcEECCCCCEEE
Confidence 467777777776 4444445567778888888888887654334579988888 444445442 2345667777788887
Q ss_pred EeeCC--CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEe-eCCCCceEEEEecCCCCCEEEeecCCCCCeeE
Q psy6572 625 WCDKG--LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWT-DWGQNAHIGKAKMDGSNPKVIISKNLSWPNAL 701 (1416)
Q Consensus 625 wtD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWt-D~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gL 701 (1416)
++-.. ...|++.++.+...+.|.. ........++.|...+|+++ +....+.|++++++|...+.|.... ..-..+
T Consensus 262 ~~~~~~g~~~Iy~~d~~~~~~~~Lt~-~~~~~~~~~~spDG~~i~f~s~~~g~~~Iy~~d~~g~~~~~lt~~~-~~~~~~ 339 (435)
T PRK05137 262 MSLSQGGNTDIYTMDLRSGTTTRLTD-SPAIDTSPSYSPDGSQIVFESDRSGSPQLYVMNADGSNPRRISFGG-GRYSTP 339 (435)
T ss_pred EEEecCCCceEEEEECCCCceEEccC-CCCccCceeEcCCCCEEEEEECCCCCCeEEEEECCCCCeEEeecCC-CcccCe
Confidence 76543 3579999998876655543 22234556778877766655 4444568999999987766655422 122345
Q ss_pred EeecCCCeEEEecCCC--CeEEEEeCCCCceEEEEeccCCCCcccccceeEEE--ecCcEEEeecCC-----CeeEEecc
Q psy6572 702 TISYETNELFWGDAHE--DYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAV--FEDHLFWTDWEM-----KSIERCDK 772 (1416)
Q Consensus 702 aiD~~~~rLYWtD~~~--~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav--~~d~LYwtD~~~-----~~I~~~nk 772 (1416)
++.+..++|+++.... ..|..++++|...+.+.... ....+++ .+..||++-... ..|+.++.
T Consensus 340 ~~SpdG~~ia~~~~~~~~~~i~~~d~~~~~~~~lt~~~--------~~~~p~~spDG~~i~~~~~~~~~~~~~~L~~~dl 411 (435)
T PRK05137 340 VWSPRGDLIAFTKQGGGQFSIGVMKPDGSGERILTSGF--------LVEGPTWAPNGRVIMFFRQTPGSGGAPKLYTVDL 411 (435)
T ss_pred EECCCCCEEEEEEcCCCceEEEEEECCCCceEeccCCC--------CCCCCeECCCCCEEEEEEccCCCCCcceEEEEEC
Confidence 6777778887775433 47888898887766554321 1222333 355676654322 35677764
No 52
>PRK04922 tolB translocation protein TolB; Provisional
Probab=97.93 E-value=0.0013 Score=81.34 Aligned_cols=221 Identities=15% Similarity=0.121 Sum_probs=135.5
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEeec-CCCceEEEEccCCcEE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFPA-TSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~l-~~p~gLAvD~~~~~LY 624 (1416)
+.|+.++.+|.. ..+..+-.....+++.+.+++|+++........|+++.+. +..+.+..+ .....+++.+.+++|+
T Consensus 184 ~~l~i~D~~g~~~~~lt~~~~~v~~p~wSpDg~~la~~s~~~~~~~l~~~dl~~g~~~~l~~~~g~~~~~~~SpDG~~l~ 263 (433)
T PRK04922 184 YALQVADSDGYNPQTILRSAEPILSPAWSPDGKKLAYVSFERGRSAIYVQDLATGQRELVASFRGINGAPSFSPDGRRLA 263 (433)
T ss_pred EEEEEECCCCCCceEeecCCCccccccCCCCCCEEEEEecCCCCcEEEEEECCCCCEEEeccCCCCccCceECCCCCEEE
Confidence 357777777765 3444444456678888889899888655434578888888 444444442 2334677888888898
Q ss_pred EeeCC--CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEe-eCCCCceEEEEecCCCCCEEEeecCCCCCeeE
Q psy6572 625 WCDKG--LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWT-DWGQNAHIGKAKMDGSNPKVIISKNLSWPNAL 701 (1416)
Q Consensus 625 wtD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWt-D~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gL 701 (1416)
++-.. ...|++.++.+...+.|.. .......+++.|...+|+++ +....+.|+++++++...+.|.... .....+
T Consensus 264 ~~~s~~g~~~Iy~~d~~~g~~~~lt~-~~~~~~~~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~g-~~~~~~ 341 (433)
T PRK04922 264 LTLSRDGNPEIYVMDLGSRQLTRLTN-HFGIDTEPTWAPDGKSIYFTSDRGGRPQIYRVAASGGSAERLTFQG-NYNARA 341 (433)
T ss_pred EEEeCCCCceEEEEECCCCCeEECcc-CCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCCeEEeecCC-CCccCE
Confidence 76433 3479999998776555442 22234567888887777665 4444468999999876655544322 334467
Q ss_pred EeecCCCeEEEecCCC--CeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEeecC--CCeeEEecccCCCc
Q psy6572 702 TISYETNELFWGDAHE--DYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDWE--MKSIERCDKYTGKN 777 (1416)
Q Consensus 702 aiD~~~~rLYWtD~~~--~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~--~~~I~~~nk~tG~~ 777 (1416)
++.+..+.|+++.... ..|+.+++++...+.|.... ....| +++-.+.+|+++... ...|+.++. +|..
T Consensus 342 ~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g~~~~Lt~~~-----~~~~p-~~spdG~~i~~~s~~~g~~~L~~~~~-~g~~ 414 (433)
T PRK04922 342 SVSPDGKKIAMVHGSGGQYRIAVMDLSTGSVRTLTPGS-----LDESP-SFAPNGSMVLYATREGGRGVLAAVST-DGRV 414 (433)
T ss_pred EECCCCCEEEEEECCCCceeEEEEECCCCCeEECCCCC-----CCCCc-eECCCCCEEEEEEecCCceEEEEEEC-CCCc
Confidence 8888788898875432 36888998776666543321 01222 233345566665442 234555543 3433
No 53
>PRK03629 tolB translocation protein TolB; Provisional
Probab=97.91 E-value=0.0035 Score=77.50 Aligned_cols=185 Identities=11% Similarity=0.076 Sum_probs=123.3
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee-cCCCceEEEEccCCcEE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP-ATSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~-l~~p~gLAvD~~~~~LY 624 (1416)
..|+.++.+|.. ..+..+......+.+.+.+++|.|+........|+.+.+. +..+.|.. -.....+++.+-+..|+
T Consensus 179 ~~l~~~d~dg~~~~~lt~~~~~~~~p~wSPDG~~la~~s~~~g~~~i~i~dl~~G~~~~l~~~~~~~~~~~~SPDG~~La 258 (429)
T PRK03629 179 YELRVSDYDGYNQFVVHRSPQPLMSPAWSPDGSKLAYVTFESGRSALVIQTLANGAVRQVASFPRHNGAPAFSPDGSKLA 258 (429)
T ss_pred eeEEEEcCCCCCCEEeecCCCceeeeEEcCCCCEEEEEEecCCCcEEEEEECCCCCeEEccCCCCCcCCeEECCCCCEEE
Confidence 368888888876 4444444456788999998888776544333578888887 44455544 23345678888899999
Q ss_pred EeeCC--CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEE-eeCCCCceEEEEecCCCCCEEEeecCCCCCeeE
Q psy6572 625 WCDKG--LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYW-TDWGQNAHIGKAKMDGSNPKVIISKNLSWPNAL 701 (1416)
Q Consensus 625 wtD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYW-tD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gL 701 (1416)
++... ...|++.++++...+.|... .......+..|...+|++ ++.+..+.|++++++|...+.|.. .......+
T Consensus 259 ~~~~~~g~~~I~~~d~~tg~~~~lt~~-~~~~~~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~~g~~~~lt~-~~~~~~~~ 336 (429)
T PRK03629 259 FALSKTGSLNLYVMDLASGQIRQVTDG-RSNNTEPTWFPDSQNLAYTSDQAGRPQVYKVNINGGAPQRITW-EGSQNQDA 336 (429)
T ss_pred EEEcCCCCcEEEEEECCCCCEEEccCC-CCCcCceEECCCCCEEEEEeCCCCCceEEEEECCCCCeEEeec-CCCCccCE
Confidence 97543 34799999987766555433 234567788888777755 454445689999999876655533 22233456
Q ss_pred EeecCCCeEEEecCC--CCeEEEEeCCCCceEEEE
Q psy6572 702 TISYETNELFWGDAH--EDYIAVSDLNGENIKIIV 734 (1416)
Q Consensus 702 aiD~~~~rLYWtD~~--~~~I~~~~ldG~~r~~v~ 734 (1416)
++.+..++|+++... ...|+.+++++...+.|.
T Consensus 337 ~~SpDG~~Ia~~~~~~g~~~I~~~dl~~g~~~~Lt 371 (429)
T PRK03629 337 DVSSDGKFMVMVSSNGGQQHIAKQDLATGGVQVLT 371 (429)
T ss_pred EECCCCCEEEEEEccCCCceEEEEECCCCCeEEeC
Confidence 777767778776543 346888888877666554
No 54
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=97.91 E-value=0.0025 Score=78.28 Aligned_cols=220 Identities=13% Similarity=0.050 Sum_probs=134.2
Q ss_pred EEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee-cCCCceEEEEccCCcEEE
Q psy6572 549 YIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP-ATSPDGLTVDWVGRNLYW 625 (1416)
Q Consensus 549 ~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~-l~~p~gLAvD~~~~~LYw 625 (1416)
.|+.++.+|.. ..+...-......++.+.+++|+|+........|+++.+. +..+.+.. -....++++.+.++.||+
T Consensus 171 ~l~~~d~~g~~~~~l~~~~~~~~~p~~Spdg~~la~~~~~~~~~~i~v~d~~~g~~~~~~~~~~~~~~~~~spDg~~l~~ 250 (417)
T TIGR02800 171 ELQVADYDGANPQTITRSREPILSPAWSPDGQKLAYVSFESGKPEIYVQDLATGQREKVASFPGMNGAPAFSPDGSKLAV 250 (417)
T ss_pred eEEEEcCCCCCCEEeecCCCceecccCCCCCCEEEEEEcCCCCcEEEEEECCCCCEEEeecCCCCccceEECCCCCEEEE
Confidence 47777777665 4444333345677888889999998765433478888887 33444443 233456778777788888
Q ss_pred eeCC--CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEee-CCCCceEEEEecCCCCCEEEeecCCCCCeeEE
Q psy6572 626 CDKG--LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTD-WGQNAHIGKAKMDGSNPKVIISKNLSWPNALT 702 (1416)
Q Consensus 626 tD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD-~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLa 702 (1416)
+... ...|++.++.+...+.|... .......++.|...+|+|+. .+..+.|+++++++...+.|.. .......++
T Consensus 251 ~~~~~~~~~i~~~d~~~~~~~~l~~~-~~~~~~~~~s~dg~~l~~~s~~~g~~~iy~~d~~~~~~~~l~~-~~~~~~~~~ 328 (417)
T TIGR02800 251 SLSKDGNPDIYVMDLDGKQLTRLTNG-PGIDTEPSWSPDGKSIAFTSDRGGSPQIYMMDADGGEVRRLTF-RGGYNASPS 328 (417)
T ss_pred EECCCCCccEEEEECCCCCEEECCCC-CCCCCCEEECCCCCEEEEEECCCCCceEEEEECCCCCEEEeec-CCCCccCeE
Confidence 7543 34799999887665554422 12233456777777777654 3445689999998776554443 334455677
Q ss_pred eecCCCeEEEecCCC--CeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEeecCCC-eeEEecccCCC
Q psy6572 703 ISYETNELFWGDAHE--DYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDWEMK-SIERCDKYTGK 776 (1416)
Q Consensus 703 iD~~~~rLYWtD~~~--~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~~-~I~~~nk~tG~ 776 (1416)
+.+..++|+++.... ..|..+++++...+.+.... ....| +++..+.+|+++....+ .+..+...+|.
T Consensus 329 ~spdg~~i~~~~~~~~~~~i~~~d~~~~~~~~l~~~~-----~~~~p-~~spdg~~l~~~~~~~~~~~l~~~~~~g~ 399 (417)
T TIGR02800 329 WSPDGDLIAFVHREGGGFNIAVMDLDGGGERVLTDTG-----LDESP-SFAPNGRMILYATTRGGRGVLGLVSTDGR 399 (417)
T ss_pred ECCCCCEEEEEEccCCceEEEEEeCCCCCeEEccCCC-----CCCCc-eECCCCCEEEEEEeCCCcEEEEEEECCCc
Confidence 877778888886543 37888898876655554321 12222 33445667777665432 23333334444
No 55
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=97.89 E-value=0.001 Score=73.65 Aligned_cols=222 Identities=14% Similarity=0.132 Sum_probs=136.5
Q ss_pred EEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee--cCCCceEEEEccCCcEEEeeCCCCeEEEee
Q psy6572 561 IRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP--ATSPDGLTVDWVGRNLYWCDKGLDTIEVAK 637 (1416)
Q Consensus 561 ~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~--l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ 637 (1416)
..+..-..+..|+.+. .+.|++++... +.|-+++.. ++.+++.- ...|.+|.+++ .+.++++|.+. .|.|++
T Consensus 56 fpvp~G~ap~dvapap-dG~VWft~qg~--gaiGhLdP~tGev~~ypLg~Ga~Phgiv~gp-dg~~Witd~~~-aI~R~d 130 (353)
T COG4257 56 FPVPNGSAPFDVAPAP-DGAVWFTAQGT--GAIGHLDPATGEVETYPLGSGASPHGIVVGP-DGSAWITDTGL-AIGRLD 130 (353)
T ss_pred eccCCCCCccccccCC-CCceEEecCcc--ccceecCCCCCceEEEecCCCCCCceEEECC-CCCeeEecCcc-eeEEec
Confidence 3344445566666654 56788887775 378887776 44443322 78899999996 67788999987 788777
Q ss_pred cCCCceEEEEc---CCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEec
Q psy6572 638 LDGRFRKVLIN---KGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGD 714 (1416)
Q Consensus 638 ldG~~~~vLi~---~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD 714 (1416)
......+.+.. ..-..-...++|+. |+|+||... ..-.|.+-.-...+++-...-..|+||.+. .++.||++.
T Consensus 131 pkt~evt~f~lp~~~a~~nlet~vfD~~-G~lWFt~q~--G~yGrLdPa~~~i~vfpaPqG~gpyGi~at-pdGsvwyas 206 (353)
T COG4257 131 PKTLEVTRFPLPLEHADANLETAVFDPW-GNLWFTGQI--GAYGRLDPARNVISVFPAPQGGGPYGICAT-PDGSVWYAS 206 (353)
T ss_pred CcccceEEeecccccCCCcccceeeCCC-ccEEEeecc--ccceecCcccCceeeeccCCCCCCcceEEC-CCCcEEEEe
Confidence 64333222211 12223455778876 889888731 112233222222333333445679999997 489999999
Q ss_pred CCCCeEEEEe-CCCCceEEEEeccCCCCcccccceeEEE-ecCcEEEeecCCCeeEEecccCCCceEEEEeCCCCCCeee
Q psy6572 715 AHEDYIAVSD-LNGENIKIIVSRRMDPTINLHHVFALAV-FEDHLFWTDWEMKSIERCDKYTGKNCTSVVKNLVHKPMDL 792 (1416)
Q Consensus 715 ~~~~~I~~~~-ldG~~r~~v~~~~~~p~~~l~~P~~lav-~~d~LYwtD~~~~~I~~~nk~tG~~~~~l~~~~~~~p~~I 792 (1416)
...+.|.+++ ++| ..++|..-. .+-.....|-. .-++++.|+|.++++++++..+..-...-+...-.+|+++
T Consensus 207 lagnaiaridp~~~-~aev~p~P~----~~~~gsRriwsdpig~~wittwg~g~l~rfdPs~~sW~eypLPgs~arpys~ 281 (353)
T COG4257 207 LAGNAIARIDPFAG-HAEVVPQPN----ALKAGSRRIWSDPIGRAWITTWGTGSLHRFDPSVTSWIEYPLPGSKARPYSM 281 (353)
T ss_pred ccccceEEcccccC-CcceecCCC----cccccccccccCccCcEEEeccCCceeeEeCcccccceeeeCCCCCCCccee
Confidence 8889999998 566 333332211 11111223333 3578999999999999988665553333333455577777
Q ss_pred eeec
Q psy6572 793 RVYH 796 (1416)
Q Consensus 793 ~v~h 796 (1416)
.|..
T Consensus 282 rVD~ 285 (353)
T COG4257 282 RVDR 285 (353)
T ss_pred eecc
Confidence 7753
No 56
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=97.86 E-value=0.00069 Score=82.71 Aligned_cols=155 Identities=16% Similarity=0.186 Sum_probs=105.9
Q ss_pred eEEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEE------c-CCCCCcceeeecCC------cceEEE
Q psy6572 603 ELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLI------N-KGLQEPRGIALNPA------YGYMYW 668 (1416)
Q Consensus 603 ~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi------~-~~l~~P~gIavDp~------~g~LYW 668 (1416)
++|++ |..|.+|++.+ .+.||+|....++|.+++.++...+++. . .+..-+.|||++|. +++||+
T Consensus 23 ~~va~GL~~Pw~maflP-DG~llVtER~~G~I~~v~~~~~~~~~~~~l~~v~~~~ge~GLlglal~PdF~~~~~n~~lYv 101 (454)
T TIGR03606 23 KVLLSGLNKPWALLWGP-DNQLWVTERATGKILRVNPETGEVKVVFTLPEIVNDAQHNGLLGLALHPDFMQEKGNPYVYI 101 (454)
T ss_pred EEEECCCCCceEEEEcC-CCeEEEEEecCCEEEEEeCCCCceeeeecCCceeccCCCCceeeEEECCCccccCCCcEEEE
Confidence 45565 99999999987 4689999987789998876654333221 1 23556899999976 468888
Q ss_pred eeC-----C---CCceEEEEecCCC-----CCEEEeecC----CCCCeeEEeecCCCeEEEe--cC--------------
Q psy6572 669 TDW-----G---QNAHIGKAKMDGS-----NPKVIISKN----LSWPNALTISYETNELFWG--DA-------------- 715 (1416)
Q Consensus 669 tD~-----g---~~~~I~ra~mDGs-----~r~vlv~~~----l~~P~gLaiD~~~~rLYWt--D~-------------- 715 (1416)
+-. + ...+|.|+.++.. ..++|+... ...-..|++++ .++||++ |.
T Consensus 102 syt~~~~~~~~~~~~~I~R~~l~~~~~~l~~~~~Il~~lP~~~~H~GgrI~FgP-DG~LYVs~GD~g~~~~~n~~~~~~a 180 (454)
T TIGR03606 102 SYTYKNGDKELPNHTKIVRYTYDKSTQTLEKPVDLLAGLPAGNDHNGGRLVFGP-DGKIYYTIGEQGRNQGANFFLPNQA 180 (454)
T ss_pred EEeccCCCCCccCCcEEEEEEecCCCCccccceEEEecCCCCCCcCCceEEECC-CCcEEEEECCCCCCCcccccCcchh
Confidence 731 1 2358999988732 234555421 12345788886 6789996 22
Q ss_pred ----------------CCCeEEEEeCCCCc----------eEEEEeccCCCCcccccceeEEEe-cCcEEEeecCCC
Q psy6572 716 ----------------HEDYIAVSDLNGEN----------IKIIVSRRMDPTINLHHVFALAVF-EDHLFWTDWEMK 765 (1416)
Q Consensus 716 ----------------~~~~I~~~~ldG~~----------r~~v~~~~~~p~~~l~~P~~lav~-~d~LYwtD~~~~ 765 (1416)
..++|.|++.||+- +..|.+.. +++|++|++. .+.||.+|.+..
T Consensus 181 Q~~~~~~~~~~~d~~~~~GkILRin~DGsiP~dNPf~~g~~~eIyA~G------~RNp~Gla~dp~G~Lw~~e~Gp~ 251 (454)
T TIGR03606 181 QHTPTQQELNGKDYHAYMGKVLRLNLDGSIPKDNPSINGVVSHIFTYG------HRNPQGLAFTPDGTLYASEQGPN 251 (454)
T ss_pred ccccccccccccCcccCceEEEEEcCCCCCCCCCCccCCCcceEEEEe------ccccceeEECCCCCEEEEecCCC
Confidence 13479999999862 23344443 8999999995 679999987654
No 57
>TIGR03118 PEPCTERM_chp_1 conserved hypothetical protein TIGR03118. This model describes and uncharacterized conserved hypothetical protein. Members are found with the C-terminal putative exosortase interaction domain, PEP-CTERM, in Nitrosospira multiformis, Rhodoferax ferrireducens, Solibacter usitatus Ellin6076, and Acidobacteria bacterium Ellin345. It is found without the PEP-CTERM domain in several other species, including Burkholderia ambifaria, Gloeobacter violaceus PCC 7421, and three copies in the Acanthamoeba polyphaga mimivirus.
Probab=97.84 E-value=0.0018 Score=73.22 Aligned_cols=209 Identities=17% Similarity=0.218 Sum_probs=131.0
Q ss_pred EEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC---C-CCeEEee---------cCCCceEEEEccCCc-----
Q psy6572 561 IRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN---S-QPELLFP---------ATSPDGLTVDWVGRN----- 622 (1416)
Q Consensus 561 ~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~---s-~~~~l~~---------l~~p~gLAvD~~~~~----- 622 (1416)
.+-.+|.++-||++.+ .+.++++|....-..+|-...+ . ...+++. ...|.||++.--+..
T Consensus 17 ~tDp~L~N~WGia~~p-~~~~WVadngT~~~TlYdg~~~~~~g~~~~L~vtiP~~~~~~~~~~PTGiVfN~~~~F~vt~~ 95 (336)
T TIGR03118 17 IVDPGLRNAWGLSYRP-GGPFWVANTGTGTATLYVGNPDTQPLVQDPLVVVIPAPPPLAAEGTPTGQVFNGSDTFVVSGE 95 (336)
T ss_pred ccCccccccceeEecC-CCCEEEecCCcceEEeecCCcccccCCccceEEEecCCCCCCCCCCccEEEEeCCCceEEcCC
Confidence 3445788999999977 4456666666542233333211 1 1122222 247889988743332
Q ss_pred --------EEEeeCCCCeEEEeec--CCC--ceE-EEEcC--CCCCcceeeecCC--cceEEEeeCCCCceEEEEecCCC
Q psy6572 623 --------LYWCDKGLDTIEVAKL--DGR--FRK-VLINK--GLQEPRGIALNPA--YGYMYWTDWGQNAHIGKAKMDGS 685 (1416)
Q Consensus 623 --------LYwtD~~~~~I~v~~l--dG~--~~~-vLi~~--~l~~P~gIavDp~--~g~LYWtD~g~~~~I~ra~mDGs 685 (1416)
||.|+. ++|.--+. +-. ... +++.. ....=+||||-.. ..+||-+|.. +.+|.+. |++
T Consensus 96 g~~~~a~Fif~tEd--GTisaW~p~v~~t~~~~~~~~~d~s~~gavYkGLAi~~~~~~~~LYaadF~-~g~IDVF--d~~ 170 (336)
T TIGR03118 96 GITGPSRFLFVTED--GTLSGWAPALGTTRMTRAEIVVDASQQGNVYKGLAVGPTGGGDYLYAANFR-QGRIDVF--KGS 170 (336)
T ss_pred CcccceeEEEEeCC--ceEEeecCcCCcccccccEEEEccCCCcceeeeeEEeecCCCceEEEeccC-CCceEEe--cCc
Confidence 333333 23333221 111 012 22321 2233467777644 6799999986 4588876 566
Q ss_pred CCEEEeecC--------CCCCeeEEeecCCCeEEEec-------------CCCCeEEEEeCCCCceEEEEeccCCCCccc
Q psy6572 686 NPKVIISKN--------LSWPNALTISYETNELFWGD-------------AHEDYIAVSDLNGENIKIIVSRRMDPTINL 744 (1416)
Q Consensus 686 ~r~vlv~~~--------l~~P~gLaiD~~~~rLYWtD-------------~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l 744 (1416)
.+++.+... -..|.+|.. ..++||++= ++++.|...+++|.-.+.+.++. .|
T Consensus 171 f~~~~~~g~F~DP~iPagyAPFnIqn--ig~~lyVtYA~qd~~~~d~v~G~G~G~VdvFd~~G~l~~r~as~g-----~L 243 (336)
T TIGR03118 171 FRPPPLPGSFIDPALPAGYAPFNVQN--LGGTLYVTYAQQDADRNDEVAGAGLGYVNVFTLNGQLLRRVASSG-----RL 243 (336)
T ss_pred cccccCCCCccCCCCCCCCCCcceEE--ECCeEEEEEEecCCcccccccCCCcceEEEEcCCCcEEEEeccCC-----cc
Confidence 655443322 234667765 589999983 23468999999999999998876 49
Q ss_pred ccceeEEE-------ecCcEEEeecCCCeeEEecccCCCceEEEE
Q psy6572 745 HHVFALAV-------FEDHLFWTDWEMKSIERCDKYTGKNCTSVV 782 (1416)
Q Consensus 745 ~~P~~lav-------~~d~LYwtD~~~~~I~~~nk~tG~~~~~l~ 782 (1416)
..|.+|++ +.+.|.+-+.+.++|...+..+|..+-.|.
T Consensus 244 NaPWG~a~APa~FG~~sg~lLVGNFGDG~InaFD~~sG~~~g~L~ 288 (336)
T TIGR03118 244 NAPWGLAIAPESFGSLSGALLVGNFGDGTINAYDPQSGAQLGQLL 288 (336)
T ss_pred cCCceeeeChhhhCCCCCCeEEeecCCceeEEecCCCCceeeeec
Confidence 99999998 478899999999999999888888666655
No 58
>PRK02889 tolB translocation protein TolB; Provisional
Probab=97.79 E-value=0.0052 Score=75.96 Aligned_cols=186 Identities=14% Similarity=0.096 Sum_probs=118.8
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee-cCCCceEEEEccCCcEE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP-ATSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~-l~~p~gLAvD~~~~~LY 624 (1416)
..|+.++.+|.. ..+.........+++.+.+++|+++........|+...+. +..+.+.. -.....+++.+.++.|+
T Consensus 176 ~~L~~~D~dG~~~~~l~~~~~~v~~p~wSPDG~~la~~s~~~~~~~I~~~dl~~g~~~~l~~~~g~~~~~~~SPDG~~la 255 (427)
T PRK02889 176 YQLQISDADGQNAQSALSSPEPIISPAWSPDGTKLAYVSFESKKPVVYVHDLATGRRRVVANFKGSNSAPAWSPDGRTLA 255 (427)
T ss_pred cEEEEECCCCCCceEeccCCCCcccceEcCCCCEEEEEEccCCCcEEEEEECCCCCEEEeecCCCCccceEECCCCCEEE
Confidence 467777777766 3333334455678888888888887654433568888888 44444443 22345677888888888
Q ss_pred EeeC--CCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEe-eCCCCceEEEEecCCCCCEEEeecCCCCCeeE
Q psy6572 625 WCDK--GLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWT-DWGQNAHIGKAKMDGSNPKVIISKNLSWPNAL 701 (1416)
Q Consensus 625 wtD~--~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWt-D~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gL 701 (1416)
++-. +...|+++++++...+.|.. ........+..|...+|+++ +.+..+.|++++++|...+.+.... ......
T Consensus 256 ~~~~~~g~~~Iy~~d~~~~~~~~lt~-~~~~~~~~~wSpDG~~l~f~s~~~g~~~Iy~~~~~~g~~~~lt~~g-~~~~~~ 333 (427)
T PRK02889 256 VALSRDGNSQIYTVNADGSGLRRLTQ-SSGIDTEPFFSPDGRSIYFTSDRGGAPQIYRMPASGGAAQRVTFTG-SYNTSP 333 (427)
T ss_pred EEEccCCCceEEEEECCCCCcEECCC-CCCCCcCeEEcCCCCEEEEEecCCCCcEEEEEECCCCceEEEecCC-CCcCce
Confidence 7643 34578898888776555432 22233456788887777665 4444578999998876654444322 222346
Q ss_pred EeecCCCeEEEecCCC--CeEEEEeCCCCceEEEEe
Q psy6572 702 TISYETNELFWGDAHE--DYIAVSDLNGENIKIIVS 735 (1416)
Q Consensus 702 aiD~~~~rLYWtD~~~--~~I~~~~ldG~~r~~v~~ 735 (1416)
++.+..+.|+++.... ..|+.+++++...+.+..
T Consensus 334 ~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g~~~~lt~ 369 (427)
T PRK02889 334 RISPDGKLLAYISRVGGAFKLYVQDLATGQVTALTD 369 (427)
T ss_pred EECCCCCEEEEEEccCCcEEEEEEECCCCCeEEccC
Confidence 7777778888775433 368888887766665544
No 59
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.78 E-value=2.3e-05 Score=63.58 Aligned_cols=38 Identities=53% Similarity=1.072 Sum_probs=32.7
Q ss_pred cCCcCC--CCCcc--ceeeecCCeeeecCCCCcEEecCCCce
Q psy6572 495 DTNECL--DRPCS--HYCRNTLGSYSCSCAPGYALLSDKHGC 532 (1416)
Q Consensus 495 didEC~--~~~Cs--q~C~nt~gsy~C~C~~Gy~L~~dg~sC 532 (1416)
|||||+ ...|. +.|+|+.|+|+|.|++||++..++++|
T Consensus 1 DidEC~~~~~~C~~~~~C~N~~Gsy~C~C~~Gy~~~~~~~~C 42 (42)
T PF07645_consen 1 DIDECAEGPHNCPENGTCVNTEGSYSCSCPPGYELNDDGTTC 42 (42)
T ss_dssp ESSTTTTTSSSSSTTSEEEEETTEEEEEESTTEEECTTSSEE
T ss_pred CccccCCCCCcCCCCCEEEcCCCCEEeeCCCCcEECCCCCcC
Confidence 689998 46787 699999999999999999987777665
No 60
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=97.78 E-value=0.0033 Score=74.72 Aligned_cols=212 Identities=16% Similarity=0.136 Sum_probs=121.1
Q ss_pred CCCeEEEeeccC-C-CccEEEEecCC-CC-eEEeecCCCceEEEEccCCcEEEeeC---------CCCeEEEeecCCCc-
Q psy6572 577 VDNCLYWSDVTM-H-GSSIRRSCNNS-QP-ELLFPATSPDGLTVDWVGRNLYWCDK---------GLDTIEVAKLDGRF- 642 (1416)
Q Consensus 577 ~~~~LYwtD~~~-~-~~~I~r~~l~s-~~-~~l~~l~~p~gLAvD~~~~~LYwtD~---------~~~~I~v~~ldG~~- 642 (1416)
...++|++|... + .++|++++..+ +. .+|.....|+++ +.+-++.||++.. ..+.|.++++....
T Consensus 11 ~~~~v~V~d~~~~~~~~~v~ViD~~~~~v~g~i~~G~~P~~~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t~~~ 89 (352)
T TIGR02658 11 DARRVYVLDPGHFAATTQVYTIDGEAGRVLGMTDGGFLPNPV-VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQTHLP 89 (352)
T ss_pred CCCEEEEECCcccccCceEEEEECCCCEEEEEEEccCCCcee-ECCCCCEEEEEeccccccccCCCCCEEEEEECccCcE
Confidence 345677777652 0 03677766662 21 222226789997 9999999999999 77899999876543
Q ss_pred eEEEEc-C-----CCCCcceeeecCCcceEEEeeCCCCceEEEE------------------------------ecCCCC
Q psy6572 643 RKVLIN-K-----GLQEPRGIALNPAYGYMYWTDWGQNAHIGKA------------------------------KMDGSN 686 (1416)
Q Consensus 643 ~~vLi~-~-----~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra------------------------------~mDGs~ 686 (1416)
...|.. . ....|..++|.|..++||++++.....+.++ -.||+.
T Consensus 90 ~~~i~~p~~p~~~~~~~~~~~~ls~dgk~l~V~n~~p~~~V~VvD~~~~kvv~ei~vp~~~~vy~t~e~~~~~~~~Dg~~ 169 (352)
T TIGR02658 90 IADIELPEGPRFLVGTYPWMTSLTPDNKTLLFYQFSPSPAVGVVDLEGKAFVRMMDVPDCYHIFPTANDTFFMHCRDGSL 169 (352)
T ss_pred EeEEccCCCchhhccCccceEEECCCCCEEEEecCCCCCEEEEEECCCCcEEEEEeCCCCcEEEEecCCccEEEeecCce
Confidence 222321 1 2445669999999999999886522222222 234443
Q ss_pred CE---------EEeecCC---------CCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccC----CC--Cc
Q psy6572 687 PK---------VIISKNL---------SWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRM----DP--TI 742 (1416)
Q Consensus 687 r~---------vlv~~~l---------~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~----~p--~~ 742 (1416)
.+ ...+..+ .+| .+....+++||+... +.|+.+++.+........-.. .+ +-
T Consensus 170 ~~v~~d~~g~~~~~~~~vf~~~~~~v~~rP---~~~~~dg~~~~vs~e-G~V~~id~~~~~~~~~~~~~~~~~~~~~~~w 245 (352)
T TIGR02658 170 AKVGYGTKGNPKIKPTEVFHPEDEYLINHP---AYSNKSGRLVWPTYT-GKIFQIDLSSGDAKFLPAIEAFTEAEKADGW 245 (352)
T ss_pred EEEEecCCCceEEeeeeeecCCccccccCC---ceEcCCCcEEEEecC-CeEEEEecCCCcceecceeeecccccccccc
Confidence 33 1111111 333 222336788888776 999999987654443321110 00 00
Q ss_pred --ccccceeEEEecCcEEEeec-C--------CCeeEEecccCCCceEEEEeCCCCCCeeeeee
Q psy6572 743 --NLHHVFALAVFEDHLFWTDW-E--------MKSIERCDKYTGKNCTSVVKNLVHKPMDLRVY 795 (1416)
Q Consensus 743 --~l~~P~~lav~~d~LYwtD~-~--------~~~I~~~nk~tG~~~~~l~~~~~~~p~~I~v~ 795 (1416)
.-.+|++++-.+++||++.. . .+.|..++..+++....+. .-..|.+|++-
T Consensus 246 rP~g~q~ia~~~dg~~lyV~~~~~~~~thk~~~~~V~ViD~~t~kvi~~i~--vG~~~~~iavS 307 (352)
T TIGR02658 246 RPGGWQQVAYHRARDRIYLLADQRAKWTHKTASRFLFVVDAKTGKRLRKIE--LGHEIDSINVS 307 (352)
T ss_pred CCCcceeEEEcCCCCEEEEEecCCccccccCCCCEEEEEECCCCeEEEEEe--CCCceeeEEEC
Confidence 01123333345789999532 2 2478888877666555553 33456677664
No 61
>PRK00178 tolB translocation protein TolB; Provisional
Probab=97.77 E-value=0.0048 Score=76.30 Aligned_cols=221 Identities=12% Similarity=0.085 Sum_probs=134.3
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEeec-CCCceEEEEccCCcEE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFPA-TSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~l-~~p~gLAvD~~~~~LY 624 (1416)
+.|..++.+|.. ..+...........+.+.+++|+++........|+++.+. +..+.|... .....+++.+.++.|+
T Consensus 179 ~~l~~~d~~g~~~~~l~~~~~~~~~p~wSpDG~~la~~s~~~~~~~l~~~~l~~g~~~~l~~~~g~~~~~~~SpDG~~la 258 (430)
T PRK00178 179 YTLQRSDYDGARAVTLLQSREPILSPRWSPDGKRIAYVSFEQKRPRIFVQNLDTGRREQITNFEGLNGAPAWSPDGSKLA 258 (430)
T ss_pred eEEEEECCCCCCceEEecCCCceeeeeECCCCCEEEEEEcCCCCCEEEEEECCCCCEEEccCCCCCcCCeEECCCCCEEE
Confidence 346677777766 4444444455777888888898777554333578888888 444444432 2233567777788888
Q ss_pred EeeCC--CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEee-CCCCceEEEEecCCCCCEEEeecCCCCCeeE
Q psy6572 625 WCDKG--LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTD-WGQNAHIGKAKMDGSNPKVIISKNLSWPNAL 701 (1416)
Q Consensus 625 wtD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD-~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gL 701 (1416)
++-.. ...|++.++++...+.|.. ........+..|...+||++. ....+.|+++++++...+.+.... ......
T Consensus 259 ~~~~~~g~~~Iy~~d~~~~~~~~lt~-~~~~~~~~~~spDg~~i~f~s~~~g~~~iy~~d~~~g~~~~lt~~~-~~~~~~ 336 (430)
T PRK00178 259 FVLSKDGNPEIYVMDLASRQLSRVTN-HPAIDTEPFWGKDGRTLYFTSDRGGKPQIYKVNVNGGRAERVTFVG-NYNARP 336 (430)
T ss_pred EEEccCCCceEEEEECCCCCeEEccc-CCCCcCCeEECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecCC-CCccce
Confidence 76543 3479999998876555542 222344566777777776654 334568999998876655444322 222345
Q ss_pred EeecCCCeEEEecCCC--CeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEeecCC--CeeEEecccCCCc
Q psy6572 702 TISYETNELFWGDAHE--DYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDWEM--KSIERCDKYTGKN 777 (1416)
Q Consensus 702 aiD~~~~rLYWtD~~~--~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~--~~I~~~nk~tG~~ 777 (1416)
++.+..+.||++.... ..|..+++.+...+.|.... ....| .++-.+..|+++.... ..|+.++. +|..
T Consensus 337 ~~Spdg~~i~~~~~~~~~~~l~~~dl~tg~~~~lt~~~-----~~~~p-~~spdg~~i~~~~~~~g~~~l~~~~~-~g~~ 409 (430)
T PRK00178 337 RLSADGKTLVMVHRQDGNFHVAAQDLQRGSVRILTDTS-----LDESP-SVAPNGTMLIYATRQQGRGVLMLVSI-NGRV 409 (430)
T ss_pred EECCCCCEEEEEEccCCceEEEEEECCCCCEEEccCCC-----CCCCc-eECCCCCEEEEEEecCCceEEEEEEC-CCCc
Confidence 6777788888886533 36888898877766664422 01123 3333456777765433 23555543 3443
No 62
>PRK04043 tolB translocation protein TolB; Provisional
Probab=97.77 E-value=0.0069 Score=74.48 Aligned_cols=223 Identities=7% Similarity=-0.012 Sum_probs=136.2
Q ss_pred cEEEEEEecCCcc-eEEecccccceeeeeecCCCe-EEEeeccCCCccEEEEecC-CCCeEEeecCC-CceEEEEccCCc
Q psy6572 547 KYYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNC-LYWSDVTMHGSSIRRSCNN-SQPELLFPATS-PDGLTVDWVGRN 622 (1416)
Q Consensus 547 ~~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~-LYwtD~~~~~~~I~r~~l~-s~~~~l~~l~~-p~gLAvD~~~~~ 622 (1416)
.+.|..++.+|.. ..+..+ .......+.+..++ +|++........|+++++. +..+.|..... ....++.+-++.
T Consensus 168 ~~~l~~~d~dg~~~~~~~~~-~~~~~p~wSpDG~~~i~y~s~~~~~~~Iyv~dl~tg~~~~lt~~~g~~~~~~~SPDG~~ 246 (419)
T PRK04043 168 KSNIVLADYTLTYQKVIVKG-GLNIFPKWANKEQTAFYYTSYGERKPTLYKYNLYTGKKEKIASSQGMLVVSDVSKDGSK 246 (419)
T ss_pred cceEEEECCCCCceeEEccC-CCeEeEEECCCCCcEEEEEEccCCCCEEEEEECCCCcEEEEecCCCcEEeeEECCCCCE
Confidence 3578888888887 434433 24556777787775 7776655323589999988 55555554222 223445556778
Q ss_pred EEEeeCC--CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEee-CCCCceEEEEecCCCCCEEEeecCCCCCe
Q psy6572 623 LYWCDKG--LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTD-WGQNAHIGKAKMDGSNPKVIISKNLSWPN 699 (1416)
Q Consensus 623 LYwtD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD-~g~~~~I~ra~mDGs~r~vlv~~~l~~P~ 699 (1416)
|+++... ...|+++++++...+.|..... .-......|...+|||+. .+..+.|++++++|...+.|.......
T Consensus 247 la~~~~~~g~~~Iy~~dl~~g~~~~LT~~~~-~d~~p~~SPDG~~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt~~g~~~-- 323 (419)
T PRK04043 247 LLLTMAPKGQPDIYLYDTNTKTLTQITNYPG-IDVNGNFVEDDKRIVFVSDRLGYPNIFMKKLNSGSVEQVVFHGKNN-- 323 (419)
T ss_pred EEEEEccCCCcEEEEEECCCCcEEEcccCCC-ccCccEECCCCCEEEEEECCCCCceEEEEECCCCCeEeCccCCCcC--
Confidence 8776543 4589999998876655543221 112335778777777765 444579999999988775555433322
Q ss_pred eEEeecCCCeEEEecCCC--------CeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEeecCC-C-eeEE
Q psy6572 700 ALTISYETNELFWGDAHE--------DYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDWEM-K-SIER 769 (1416)
Q Consensus 700 gLaiD~~~~rLYWtD~~~--------~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~-~-~I~~ 769 (1416)
.++.+..++|.++-... ..|+.++++|...+.|.... .....+++-.+..|+++.... . .|..
T Consensus 324 -~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~g~~~~LT~~~------~~~~p~~SPDG~~I~f~~~~~~~~~L~~ 396 (419)
T PRK04043 324 -SSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNSDYIRRLTANG------VNQFPRFSSDGGSIMFIKYLGNQSALGI 396 (419)
T ss_pred -ceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCCCCeEECCCCC------CcCCeEECCCCCEEEEEEccCCcEEEEE
Confidence 36777778787775432 47999999888777665532 222223444566677665432 2 3555
Q ss_pred ecccCCCceEEE
Q psy6572 770 CDKYTGKNCTSV 781 (1416)
Q Consensus 770 ~nk~tG~~~~~l 781 (1416)
++ .+|.....|
T Consensus 397 ~~-l~g~~~~~l 407 (419)
T PRK04043 397 IR-LNYNKSFLF 407 (419)
T ss_pred Ee-cCCCeeEEe
Confidence 54 355444443
No 63
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=97.69 E-value=0.00076 Score=78.55 Aligned_cols=172 Identities=21% Similarity=0.244 Sum_probs=98.3
Q ss_pred CCeEEEEe---cEEEEEEecCCcceEEecccccceeeeeecCC-----CeEEEeeccCCCccEEEEecCC--CCeEEee-
Q psy6572 539 PPNLLFTN---KYYIREVTQAGVMTIRIHNQTNAVGLDFDWVD-----NCLYWSDVTMHGSSIRRSCNNS--QPELLFP- 607 (1416)
Q Consensus 539 ~~~li~s~---~~~I~~i~l~g~~~~~~~~l~~~~~l~~D~~~-----~~LYwtD~~~~~~~I~r~~l~s--~~~~l~~- 607 (1416)
+|.|+.-+ ...|+++.+.... +........|.+|... ..+|++|.... .|.++++.+ .++++..
T Consensus 33 ~pKLv~~Dl~t~~li~~~~~p~~~---~~~~s~lndl~VD~~~~~~~~~~aYItD~~~~--glIV~dl~~~~s~Rv~~~~ 107 (287)
T PF03022_consen 33 PPKLVAFDLKTNQLIRRYPFPPDI---APPDSFLNDLVVDVRDGNCDDGFAYITDSGGP--GLIVYDLATGKSWRVLHNS 107 (287)
T ss_dssp --EEEEEETTTTCEEEEEE--CCC---S-TCGGEEEEEEECTTTTS-SEEEEEEETTTC--EEEEEETTTTEEEEEETCG
T ss_pred CcEEEEEECCCCcEEEEEECChHH---cccccccceEEEEccCCCCcceEEEEeCCCcC--cEEEEEccCCcEEEEecCC
Confidence 34554432 3345666554432 1123344556666644 47899988753 555555441 1111111
Q ss_pred -------------------cCCCceEEEEccC---CcEEEeeCCCCeEEEeecC----CCc---------eEEEEcCCCC
Q psy6572 608 -------------------ATSPDGLTVDWVG---RNLYWCDKGLDTIEVAKLD----GRF---------RKVLINKGLQ 652 (1416)
Q Consensus 608 -------------------l~~p~gLAvD~~~---~~LYwtD~~~~~I~v~~ld----G~~---------~~vLi~~~l~ 652 (1416)
.....|||+.++. +.|||.-....+++++... .+. .+.|- ....
T Consensus 108 ~~~~p~~~~~~i~g~~~~~~dg~~gial~~~~~d~r~LYf~~lss~~ly~v~T~~L~~~~~~~~~~~~~~v~~lG-~k~~ 186 (287)
T PF03022_consen 108 FSPDPDAGPFTIGGESFQWPDGIFGIALSPISPDGRWLYFHPLSSRKLYRVPTSVLRDPSLSDAQALASQVQDLG-DKGS 186 (287)
T ss_dssp CTTS-SSEEEEETTEEEEETTSEEEEEE-TTSTTS-EEEEEETT-SEEEEEEHHHHCSTT--HHH-HHHT-EEEE-E---
T ss_pred cceeccccceeccCceEecCCCccccccCCCCCCccEEEEEeCCCCcEEEEEHHHhhCccccccccccccceecc-ccCC
Confidence 3346788987744 5799999888888887642 211 12232 2224
Q ss_pred CcceeeecCCcceEEEeeCCCCceEEEEecCC----CCCEEEee-cC-CCCCeeEEeec-CCCeEEEecCCCC
Q psy6572 653 EPRGIALNPAYGYMYWTDWGQNAHIGKAKMDG----SNPKVIIS-KN-LSWPNALTISY-ETNELFWGDAHED 718 (1416)
Q Consensus 653 ~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDG----s~r~vlv~-~~-l~~P~gLaiD~-~~~rLYWtD~~~~ 718 (1416)
...|+++|+ +|.||+++...+ .|.+.+.++ .+..+|+. .. |.||.+|+|+. ..+.||+.-....
T Consensus 187 ~s~g~~~D~-~G~ly~~~~~~~-aI~~w~~~~~~~~~~~~~l~~d~~~l~~pd~~~i~~~~~g~L~v~snrl~ 257 (287)
T PF03022_consen 187 QSDGMAIDP-NGNLYFTDVEQN-AIGCWDPDGPYTPENFEILAQDPRTLQWPDGLKIDPEGDGYLWVLSNRLQ 257 (287)
T ss_dssp SECEEEEET-TTEEEEEECCCT-EEEEEETTTSB-GCCEEEEEE-CC-GSSEEEEEE-T--TS-EEEEE-S--
T ss_pred CCceEEECC-CCcEEEecCCCC-eEEEEeCCCCcCccchheeEEcCceeeccceeeeccccCceEEEEECcch
Confidence 568999999 799999998765 899999998 34445554 44 89999999984 3688998764443
No 64
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=97.68 E-value=0.044 Score=61.24 Aligned_cols=225 Identities=12% Similarity=-0.005 Sum_probs=123.2
Q ss_pred CeEEEEe-cEEEEEEecCCcc-eEEecc-cccceeeeeecCCCeEEEeeccCCCccEEEEecCC-C-CeEEee-cCCCce
Q psy6572 540 PNLLFTN-KYYIREVTQAGVM-TIRIHN-QTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNS-Q-PELLFP-ATSPDG 613 (1416)
Q Consensus 540 ~~li~s~-~~~I~~i~l~g~~-~~~~~~-l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s-~-~~~l~~-l~~p~g 613 (1416)
.+|+++. ...|+..++.... ...... ...+..+.+.+.++.|+.+... +.|+...+.+ . ...+.. ...+..
T Consensus 22 ~~l~~~~~~g~i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~---~~i~i~~~~~~~~~~~~~~~~~~i~~ 98 (289)
T cd00200 22 KLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSD---KTIRLWDLETGECVRTLTGHTSYVSS 98 (289)
T ss_pred CEEEEeecCcEEEEEEeeCCCcEEEEecCCcceeEEEECCCCCEEEEEcCC---CeEEEEEcCcccceEEEeccCCcEEE
Confidence 4455444 4445555554433 112222 2233477777766666665543 3677777762 2 222222 335667
Q ss_pred EEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeec
Q psy6572 614 LTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISK 693 (1416)
Q Consensus 614 LAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~ 693 (1416)
|++.+. ++++++....+.|.+.++........+......+..|+++|...+|+.... ...|...++........+..
T Consensus 99 ~~~~~~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~--~~~i~i~d~~~~~~~~~~~~ 175 (289)
T cd00200 99 VAFSPD-GRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQ--DGTIKLWDLRTGKCVATLTG 175 (289)
T ss_pred EEEcCC-CCEEEEecCCCeEEEEECCCcEEEEEeccCCCcEEEEEEcCcCCEEEEEcC--CCcEEEEEccccccceeEec
Confidence 777655 456666666788999888733322233233446889999998666666542 23566666663333333333
Q ss_pred CCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEEec-CcEEEeecCCCeeEEecc
Q psy6572 694 NLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFE-DHLFWTDWEMKSIERCDK 772 (1416)
Q Consensus 694 ~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~-d~LYwtD~~~~~I~~~nk 772 (1416)
.-.....|++.+....|+.+.. .+.|...++........+... ...+.+|++.. +.++.+....+.|...+.
T Consensus 176 ~~~~i~~~~~~~~~~~l~~~~~-~~~i~i~d~~~~~~~~~~~~~------~~~i~~~~~~~~~~~~~~~~~~~~i~i~~~ 248 (289)
T cd00200 176 HTGEVNSVAFSPDGEKLLSSSS-DGTIKLWDLSTGKCLGTLRGH------ENGVNSVAFSPDGYLLASGSEDGTIRVWDL 248 (289)
T ss_pred CccccceEEECCCcCEEEEecC-CCcEEEEECCCCceecchhhc------CCceEEEEEcCCCcEEEEEcCCCcEEEEEc
Confidence 3345678888765556666654 677888887643322222111 22345666654 455555554666666655
Q ss_pred cCCCc
Q psy6572 773 YTGKN 777 (1416)
Q Consensus 773 ~tG~~ 777 (1416)
.++..
T Consensus 249 ~~~~~ 253 (289)
T cd00200 249 RTGEC 253 (289)
T ss_pred CCcee
Confidence 44443
No 65
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=97.67 E-value=0.003 Score=70.02 Aligned_cols=213 Identities=13% Similarity=0.092 Sum_probs=136.0
Q ss_pred EEEEEEecCCcc--eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEe---e--cCCCceEEEEcc
Q psy6572 548 YYIREVTQAGVM--TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLF---P--ATSPDGLTVDWV 619 (1416)
Q Consensus 548 ~~I~~i~l~g~~--~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~---~--l~~p~gLAvD~~ 619 (1416)
..|-+++..... ++.+..-..|.+|.+++ .+..+++|..+ .|.|+... ...+.+. + -.+.+...+|.
T Consensus 83 gaiGhLdP~tGev~~ypLg~Ga~Phgiv~gp-dg~~Witd~~~---aI~R~dpkt~evt~f~lp~~~a~~nlet~vfD~- 157 (353)
T COG4257 83 GAIGHLDPATGEVETYPLGSGASPHGIVVGP-DGSAWITDTGL---AIGRLDPKTLEVTRFPLPLEHADANLETAVFDP- 157 (353)
T ss_pred ccceecCCCCCceEEEecCCCCCCceEEECC-CCCeeEecCcc---eeEEecCcccceEEeecccccCCCcccceeeCC-
Confidence 346666654443 55566667899999987 45578888774 68888775 2222221 1 34556678885
Q ss_pred CCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEee-cC-CCC
Q psy6572 620 GRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIIS-KN-LSW 697 (1416)
Q Consensus 620 ~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~-~~-l~~ 697 (1416)
.++|+||... +.-.+.+.-....+|.-......|.||.+.|. |-+|++....+ .|.|++.-.....+|.. +. -..
T Consensus 158 ~G~lWFt~q~-G~yGrLdPa~~~i~vfpaPqG~gpyGi~atpd-Gsvwyaslagn-aiaridp~~~~aev~p~P~~~~~g 234 (353)
T COG4257 158 WGNLWFTGQI-GAYGRLDPARNVISVFPAPQGGGPYGICATPD-GSVWYASLAGN-AIARIDPFAGHAEVVPQPNALKAG 234 (353)
T ss_pred CccEEEeecc-ccceecCcccCceeeeccCCCCCCcceEECCC-CcEEEEecccc-ceEEcccccCCcceecCCCccccc
Confidence 6778887652 22223333333344554455667999999987 88888875544 67776543334444443 22 222
Q ss_pred CeeEEeecCCCeEEEecCCCCeEEEEeCCCCc-eEEEEeccCCCCcccccceeEEEe-cCcEEEeecCCCeeEEecccCC
Q psy6572 698 PNALTISYETNELFWGDAHEDYIAVSDLNGEN-IKIIVSRRMDPTINLHHVFALAVF-EDHLFWTDWEMKSIERCDKYTG 775 (1416)
Q Consensus 698 P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~-r~~v~~~~~~p~~~l~~P~~lav~-~d~LYwtD~~~~~I~~~nk~tG 775 (1416)
-+.|-.|+ -++++.+++.++.+.+++..-+. +..-+-+. -.+|++|-|. .++|+.+|+..+.|.|.+..+.
T Consensus 235 sRriwsdp-ig~~wittwg~g~l~rfdPs~~sW~eypLPgs------~arpys~rVD~~grVW~sea~agai~rfdpeta 307 (353)
T COG4257 235 SRRIWSDP-IGRAWITTWGTGSLHRFDPSVTSWIEYPLPGS------KARPYSMRVDRHGRVWLSEADAGAIGRFDPETA 307 (353)
T ss_pred ccccccCc-cCcEEEeccCCceeeEeCcccccceeeeCCCC------CCCcceeeeccCCcEEeeccccCceeecCcccc
Confidence 34566774 68999999999999999864332 22222221 4689999995 5777778999999999876543
No 66
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=97.64 E-value=2.9e-05 Score=54.29 Aligned_cols=24 Identities=42% Similarity=0.981 Sum_probs=21.0
Q ss_pred ceEEeeCCCceecCCCCCccccCCc
Q psy6572 474 GYKCACRKGYQVHPEDKHLCVDTNE 498 (1416)
Q Consensus 474 gy~C~C~~Gy~L~p~d~~tC~didE 498 (1416)
||+|+|++||+|. .++++|+||||
T Consensus 1 sy~C~C~~Gy~l~-~d~~~C~DIdE 24 (24)
T PF12662_consen 1 SYTCSCPPGYQLS-PDGRSCEDIDE 24 (24)
T ss_pred CEEeeCCCCCcCC-CCCCccccCCC
Confidence 5889999999998 68889999987
No 67
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=97.64 E-value=0.00015 Score=68.51 Aligned_cols=72 Identities=24% Similarity=0.362 Sum_probs=57.2
Q ss_pred ceEEEEccCCcEEEeeCCC-----------------CeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCC
Q psy6572 612 DGLTVDWVGRNLYWCDKGL-----------------DTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQN 674 (1416)
Q Consensus 612 ~gLAvD~~~~~LYwtD~~~-----------------~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~ 674 (1416)
.+|+|+..++.|||||+.. +++.+.++..+..+||+ .+|..|+||||.+...+|++++...
T Consensus 1 ndldv~~~~g~vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp~t~~~~vl~-~~L~fpNGVals~d~~~vlv~Et~~- 78 (89)
T PF03088_consen 1 NDLDVDQDTGTVYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDPSTKETTVLL-DGLYFPNGVALSPDESFVLVAETGR- 78 (89)
T ss_dssp -EEEE-TTT--EEEEES-SS--TTGHHHHHHHT---EEEEEEETTTTEEEEEE-EEESSEEEEEE-TTSSEEEEEEGGG-
T ss_pred CceeEecCCCEEEEEeCccccCccceeeeeecCCCCcCEEEEECCCCeEEEeh-hCCCccCeEEEcCCCCEEEEEeccC-
Confidence 4789998889999999743 58999999888777777 6799999999999999999999765
Q ss_pred ceEEEEecCCC
Q psy6572 675 AHIGKAKMDGS 685 (1416)
Q Consensus 675 ~~I~ra~mDGs 685 (1416)
.+|.|..+.|.
T Consensus 79 ~Ri~rywl~Gp 89 (89)
T PF03088_consen 79 YRILRYWLKGP 89 (89)
T ss_dssp TEEEEEESSST
T ss_pred ceEEEEEEeCC
Confidence 49999999873
No 68
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=97.56 E-value=0.005 Score=74.38 Aligned_cols=191 Identities=14% Similarity=0.142 Sum_probs=115.7
Q ss_pred eEEEEe--cEEEEEEecCCcc-eEEeccccc-ceeeeeecCCCeEEEeeccCCCccEEEEecCCCC--eEEeecCCCceE
Q psy6572 541 NLLFTN--KYYIREVTQAGVM-TIRIHNQTN-AVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQP--ELLFPATSPDGL 614 (1416)
Q Consensus 541 ~li~s~--~~~I~~i~l~g~~-~~~~~~l~~-~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~--~~l~~l~~p~gL 614 (1416)
+++++. ...+..++..... .-.+..... ..++.|.+.++.+|++... +.|.++++.+.. ..+.....|.+|
T Consensus 7 l~~V~~~~~~~v~viD~~t~~~~~~i~~~~~~h~~~~~s~Dgr~~yv~~rd---g~vsviD~~~~~~v~~i~~G~~~~~i 83 (369)
T PF02239_consen 7 LFYVVERGSGSVAVIDGATNKVVARIPTGGAPHAGLKFSPDGRYLYVANRD---GTVSVIDLATGKVVATIKVGGNPRGI 83 (369)
T ss_dssp EEEEEEGGGTEEEEEETTT-SEEEEEE-STTEEEEEE-TT-SSEEEEEETT---SEEEEEETTSSSEEEEEE-SSEEEEE
T ss_pred EEEEEecCCCEEEEEECCCCeEEEEEcCCCCceeEEEecCCCCEEEEEcCC---CeEEEEECCcccEEEEEecCCCcceE
Confidence 444444 4567777766554 112222233 3557788888899998643 378888888332 222227789999
Q ss_pred EEEccCCcEEEeeCCCCeEEEeecCCCc-eEEEEcCCC----CC--cceeeecCCcceEEEeeCCCCceEEEEecCCCCC
Q psy6572 615 TVDWVGRNLYWCDKGLDTIEVAKLDGRF-RKVLINKGL----QE--PRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNP 687 (1416)
Q Consensus 615 AvD~~~~~LYwtD~~~~~I~v~~ldG~~-~~vLi~~~l----~~--P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r 687 (1416)
|+.+-++.||.+.+..+.|.+++..... .+.|-...+ .. +.+|.-.|.+ ..|+....+.+.|+.+++.....
T Consensus 84 ~~s~DG~~~~v~n~~~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv~s~~~-~~fVv~lkd~~~I~vVdy~d~~~ 162 (369)
T PF02239_consen 84 AVSPDGKYVYVANYEPGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIVASPGR-PEFVVNLKDTGEIWVVDYSDPKN 162 (369)
T ss_dssp EE--TTTEEEEEEEETTEEEEEETTT--EEEEEE--EE-TTTS---EEEEEE-SSS-SEEEEEETTTTEEEEEETTTSSC
T ss_pred EEcCCCCEEEEEecCCCceeEeccccccceeecccccccccccCCCceeEEecCCC-CEEEEEEccCCeEEEEEeccccc
Confidence 9999999999999999999998865433 333332221 22 3466655554 44555545567899998766543
Q ss_pred EEEee-cCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEe
Q psy6572 688 KVIIS-KNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVS 735 (1416)
Q Consensus 688 ~vlv~-~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~ 735 (1416)
..+.. ..-..|.++.+|+..+++|.+-...+.|..++........++.
T Consensus 163 ~~~~~i~~g~~~~D~~~dpdgry~~va~~~sn~i~viD~~~~k~v~~i~ 211 (369)
T PF02239_consen 163 LKVTTIKVGRFPHDGGFDPDGRYFLVAANGSNKIAVIDTKTGKLVALID 211 (369)
T ss_dssp EEEEEEE--TTEEEEEE-TTSSEEEEEEGGGTEEEEEETTTTEEEEEEE
T ss_pred cceeeecccccccccccCcccceeeecccccceeEEEeeccceEEEEee
Confidence 33222 2346789999998777777877778899999977665555544
No 69
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=97.54 E-value=0.0038 Score=70.35 Aligned_cols=160 Identities=11% Similarity=0.167 Sum_probs=103.4
Q ss_pred EEEEEecCCcc--eEEecccccceeeeeecCCCeEEE-eeccCCCccEEEEecCCCCeEE------ee-------cCCCc
Q psy6572 549 YIREVTQAGVM--TIRIHNQTNAVGLDFDWVDNCLYW-SDVTMHGSSIRRSCNNSQPELL------FP-------ATSPD 612 (1416)
Q Consensus 549 ~I~~i~l~g~~--~~~~~~l~~~~~l~~D~~~~~LYw-tD~~~~~~~I~r~~l~s~~~~l------~~-------l~~p~ 612 (1416)
.|..+++.|.. ++.+.++..+.+|+| .++..|. +|... ..++.+.++...+++ ++ -..-+
T Consensus 109 ~iVElt~~GdlirtiPL~g~~DpE~Iey--ig~n~fvi~dER~--~~l~~~~vd~~t~~~~~~~~~i~L~~~~k~N~GfE 184 (316)
T COG3204 109 AIVELTKEGDLIRTIPLTGFSDPETIEY--IGGNQFVIVDERD--RALYLFTVDADTTVISAKVQKIPLGTTNKKNKGFE 184 (316)
T ss_pred eEEEEecCCceEEEecccccCChhHeEE--ecCCEEEEEehhc--ceEEEEEEcCCccEEeccceEEeccccCCCCcCce
Confidence 46667777766 666777888888887 3444444 44443 366666666211111 11 23468
Q ss_pred eEEEEccCCcEEEeeCCCC-eEEEeecCCCce--EEEEcC------CCCCcceeeecCCcceEEEeeCCCCceEEEEecC
Q psy6572 613 GLTVDWVGRNLYWCDKGLD-TIEVAKLDGRFR--KVLINK------GLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMD 683 (1416)
Q Consensus 613 gLAvD~~~~~LYwtD~~~~-~I~v~~ldG~~~--~vLi~~------~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mD 683 (1416)
|||.|+..+.||++-..+. +|+.....-+.. .+.... -+....|+.+|+.++.|++..- +..++...+++
T Consensus 185 GlA~d~~~~~l~~aKEr~P~~I~~~~~~~~~l~~~~~~~~~~~~~~f~~DvSgl~~~~~~~~LLVLS~-ESr~l~Evd~~ 263 (316)
T COG3204 185 GLAWDPVDHRLFVAKERNPIGIFEVTQSPSSLSVHASLDPTADRDLFVLDVSGLEFNAITNSLLVLSD-ESRRLLEVDLS 263 (316)
T ss_pred eeecCCCCceEEEEEccCCcEEEEEecCCcccccccccCcccccceEeeccccceecCCCCcEEEEec-CCceEEEEecC
Confidence 9999999999998876553 566665432221 111111 1445789999999999998763 34578888899
Q ss_pred CCCCEEEee--------cCCCCCeeEEeecCCCeEEEec
Q psy6572 684 GSNPKVIIS--------KNLSWPNALTISYETNELFWGD 714 (1416)
Q Consensus 684 Gs~r~vlv~--------~~l~~P~gLaiD~~~~rLYWtD 714 (1416)
|.-+..+.. .++..|-|||+|. .+.||++-
T Consensus 264 G~~~~~lsL~~g~~gL~~dipqaEGiamDd-~g~lYIvS 301 (316)
T COG3204 264 GEVIELLSLTKGNHGLSSDIPQAEGIAMDD-DGNLYIVS 301 (316)
T ss_pred CCeeeeEEeccCCCCCcccCCCcceeEECC-CCCEEEEe
Confidence 886544432 2477889999994 78888874
No 70
>PRK01742 tolB translocation protein TolB; Provisional
Probab=97.53 E-value=0.019 Score=71.07 Aligned_cols=179 Identities=13% Similarity=0.128 Sum_probs=115.1
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEeec-CCCceEEEEccCCcEE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFPA-TSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~l-~~p~gLAvD~~~~~LY 624 (1416)
..|+..+.+|.. ..+......+..+.+.+.+++|+++........|+.+.+. +..+.+..+ .....+++.+.++.|+
T Consensus 184 ~~i~i~d~dg~~~~~lt~~~~~v~~p~wSPDG~~la~~s~~~~~~~i~i~dl~tg~~~~l~~~~g~~~~~~wSPDG~~La 263 (429)
T PRK01742 184 YEVRVADYDGFNQFIVNRSSQPLMSPAWSPDGSKLAYVSFENKKSQLVVHDLRSGARKVVASFRGHNGAPAFSPDGSRLA 263 (429)
T ss_pred EEEEEECCCCCCceEeccCCCccccceEcCCCCEEEEEEecCCCcEEEEEeCCCCceEEEecCCCccCceeECCCCCEEE
Confidence 467777777776 3333334456788999999999887654333478888887 444445442 2334677888788888
Q ss_pred EeeCC--CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEe-eCCCCceEEEEecCCCCCEEEeecCCCCCeeE
Q psy6572 625 WCDKG--LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWT-DWGQNAHIGKAKMDGSNPKVIISKNLSWPNAL 701 (1416)
Q Consensus 625 wtD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWt-D~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gL 701 (1416)
++-.. ...|++.++++...+.|.. .......++..|...+|+++ +....+.|++++++|...+.+ ... . ..+
T Consensus 264 ~~~~~~g~~~Iy~~d~~~~~~~~lt~-~~~~~~~~~wSpDG~~i~f~s~~~g~~~I~~~~~~~~~~~~l-~~~-~--~~~ 338 (429)
T PRK01742 264 FASSKDGVLNIYVMGANGGTPSQLTS-GAGNNTEPSWSPDGQSILFTSDRSGSPQVYRMSASGGGASLV-GGR-G--YSA 338 (429)
T ss_pred EEEecCCcEEEEEEECCCCCeEeecc-CCCCcCCEEECCCCCEEEEEECCCCCceEEEEECCCCCeEEe-cCC-C--CCc
Confidence 87533 3368888888766555543 33345678888887777765 334457899998887765544 221 1 235
Q ss_pred EeecCCCeEEEecCCCCeEEEEeCCCCceEEE
Q psy6572 702 TISYETNELFWGDAHEDYIAVSDLNGENIKII 733 (1416)
Q Consensus 702 aiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v 733 (1416)
++.+..+.|+++.. ..|..+++.+...+.+
T Consensus 339 ~~SpDG~~ia~~~~--~~i~~~Dl~~g~~~~l 368 (429)
T PRK01742 339 QISADGKTLVMING--DNVVKQDLTSGSTEVL 368 (429)
T ss_pred cCCCCCCEEEEEcC--CCEEEEECCCCCeEEe
Confidence 66666777877653 5677788765544443
No 71
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=97.52 E-value=0.006 Score=73.06 Aligned_cols=156 Identities=18% Similarity=0.147 Sum_probs=99.6
Q ss_pred CCCceEEEEccCCcEEEeeCCC-------------CeEEEeecCC--------CceEEEEcCCCCCcceeeecCCcceEE
Q psy6572 609 TSPDGLTVDWVGRNLYWCDKGL-------------DTIEVAKLDG--------RFRKVLINKGLQEPRGIALNPAYGYMY 667 (1416)
Q Consensus 609 ~~p~gLAvD~~~~~LYwtD~~~-------------~~I~v~~ldG--------~~~~vLi~~~l~~P~gIavDp~~g~LY 667 (1416)
..-..|++++.+ +||++-... ++|.+.+.+| .. .-+.+.++.+|.||+++|.+|.||
T Consensus 177 H~g~~l~f~pDG-~Lyvs~G~~~~~~~aq~~~~~~Gk~~r~~~a~~~~~d~p~~~-~~i~s~G~RN~qGl~w~P~tg~Lw 254 (399)
T COG2133 177 HFGGRLVFGPDG-KLYVTTGSNGDPALAQDNVSLAGKVLRIDRAGIIPADNPFPN-SEIWSYGHRNPQGLAWHPVTGALW 254 (399)
T ss_pred cCcccEEECCCC-cEEEEeCCCCCcccccCccccccceeeeccCcccccCCCCCC-cceEEeccCCccceeecCCCCcEE
Confidence 345679999877 999985443 3444444433 33 234557899999999999999999
Q ss_pred EeeCCC---C----------------ceEE-------EEecCCCCCEEEeecCC-----CCCeeEEeecCC-------Ce
Q psy6572 668 WTDWGQ---N----------------AHIG-------KAKMDGSNPKVIISKNL-----SWPNALTISYET-------NE 709 (1416)
Q Consensus 668 WtD~g~---~----------------~~I~-------ra~mDGs~r~vlv~~~l-----~~P~gLaiD~~~-------~r 709 (1416)
.++-+. . |.++ ++.+++.....++.... ..|.||++- .. +.
T Consensus 255 ~~e~g~d~~~~~Deln~i~~G~nYGWP~~~~G~~~~g~~~~~~~~~~~~~~p~~~~~~h~ApsGmaFy-~G~~fP~~r~~ 333 (399)
T COG2133 255 TTEHGPDALRGPDELNSIRPGKNYGWPYAYFGQNYDGRAIPDGTVVAGAIQPVYTWAPHIAPSGMAFY-TGDLFPAYRGD 333 (399)
T ss_pred EEecCCCcccCcccccccccCCccCCceeccCcccCccccCCCcccccccCCceeeccccccceeEEe-cCCcCccccCc
Confidence 999765 1 1111 12222222222222222 235788883 22 68
Q ss_pred EEEecCCCCeEEEEeCCCCceEE---EEeccCCCCcccccceeEEE-ecCcEEEeecC-CCeeEEecc
Q psy6572 710 LFWGDAHEDYIAVSDLNGENIKI---IVSRRMDPTINLHHVFALAV-FEDHLFWTDWE-MKSIERCDK 772 (1416)
Q Consensus 710 LYWtD~~~~~I~~~~ldG~~r~~---v~~~~~~p~~~l~~P~~lav-~~d~LYwtD~~-~~~I~~~nk 772 (1416)
||++-.+...|.+.+.+|..+.+ ++... . -..|.+|++ ..+.||+++-. ++.|+|+..
T Consensus 334 lfV~~hgsw~~~~~~~~g~~~~~~~~fl~~d----~-~gR~~dV~v~~DGallv~~D~~~g~i~Rv~~ 396 (399)
T COG2133 334 LFVGAHGSWPVLRLRPDGNYKVVLTGFLSGD----L-GGRPRDVAVAPDGALLVLTDQGDGRILRVSY 396 (399)
T ss_pred EEEEeecceeEEEeccCCCcceEEEEEEecC----C-CCcccceEECCCCeEEEeecCCCCeEEEecC
Confidence 88888777778888888873332 23321 1 258999998 56789988776 669999864
No 72
>PRK01029 tolB translocation protein TolB; Provisional
Probab=97.52 E-value=0.022 Score=70.41 Aligned_cols=221 Identities=10% Similarity=0.018 Sum_probs=126.1
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCe--E-EEeeccCCCccEEEEecC-CCCeEEee-cCCCceEEEEccCC
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNC--L-YWSDVTMHGSSIRRSCNN-SQPELLFP-ATSPDGLTVDWVGR 621 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~--L-YwtD~~~~~~~I~r~~l~-s~~~~l~~-l~~p~gLAvD~~~~ 621 (1416)
..|+.++.+|.. ..+.........-.+.+.++. + |++... ....|+++.++ +..+.|.. -......++.+-++
T Consensus 165 ~~l~~~d~dG~~~~~lt~~~~~~~sP~wSPDG~~~~~~y~S~~~-g~~~I~~~~l~~g~~~~lt~~~g~~~~p~wSPDG~ 243 (428)
T PRK01029 165 GELWSVDYDGQNLRPLTQEHSLSITPTWMHIGSGFPYLYVSYKL-GVPKIFLGSLENPAGKKILALQGNQLMPTFSPRKK 243 (428)
T ss_pred ceEEEEcCCCCCceEcccCCCCcccceEccCCCceEEEEEEccC-CCceEEEEECCCCCceEeecCCCCccceEECCCCC
Confidence 367788888876 333322223344566666654 3 344433 34579999998 55555554 23344567777788
Q ss_pred cEEEeeCC--CCeEEEe--ecCC---CceEEEEcCCCCCcceeeecCCcceEEEee-CCCCceEEEEecCCCC-CEEEee
Q psy6572 622 NLYWCDKG--LDTIEVA--KLDG---RFRKVLINKGLQEPRGIALNPAYGYMYWTD-WGQNAHIGKAKMDGSN-PKVIIS 692 (1416)
Q Consensus 622 ~LYwtD~~--~~~I~v~--~ldG---~~~~vLi~~~l~~P~gIavDp~~g~LYWtD-~g~~~~I~ra~mDGs~-r~vlv~ 692 (1416)
.|.|+-.. ...|++. ++.+ ...+.|...........++.|...+|+|+. .+..+.|+++++++.. ....++
T Consensus 244 ~Laf~s~~~g~~di~~~~~~~~~g~~g~~~~lt~~~~~~~~~p~wSPDG~~Laf~s~~~g~~~ly~~~~~~~g~~~~~lt 323 (428)
T PRK01029 244 LLAFISDRYGNPDLFIQSFSLETGAIGKPRRLLNEAFGTQGNPSFSPDGTRLVFVSNKDGRPRIYIMQIDPEGQSPRLLT 323 (428)
T ss_pred EEEEEECCCCCcceeEEEeecccCCCCcceEeecCCCCCcCCeEECCCCCEEEEEECCCCCceEEEEECcccccceEEec
Confidence 88887643 3356654 3332 222334333223345678889877777664 4445689999887533 233333
Q ss_pred cCCCCCeeEEeecCCCeEEEecCC--CCeEEEEeCCCCceEEEEeccCCCCcccccceeEEE--ecCcEEEeec--CCCe
Q psy6572 693 KNLSWPNALTISYETNELFWGDAH--EDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAV--FEDHLFWTDW--EMKS 766 (1416)
Q Consensus 693 ~~l~~P~gLaiD~~~~rLYWtD~~--~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav--~~d~LYwtD~--~~~~ 766 (1416)
........+++.+..++|+++... ...|+.+++++...+.|.... .....+++ .+.+||++.. ....
T Consensus 324 ~~~~~~~~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~g~~~~Lt~~~-------~~~~~p~wSpDG~~L~f~~~~~g~~~ 396 (428)
T PRK01029 324 KKYRNSSCPAWSPDGKKIAFCSVIKGVRQICVYDLATGRDYQLTTSP-------ENKESPSWAIDSLHLVYSAGNSNESE 396 (428)
T ss_pred cCCCCccceeECCCCCEEEEEEcCCCCcEEEEEECCCCCeEEccCCC-------CCccceEECCCCCEEEEEECCCCCce
Confidence 222333456777777888877543 347899999887777665431 11223333 3456666533 2345
Q ss_pred eEEecccCCC
Q psy6572 767 IERCDKYTGK 776 (1416)
Q Consensus 767 I~~~nk~tG~ 776 (1416)
|+.++..++.
T Consensus 397 L~~vdl~~g~ 406 (428)
T PRK01029 397 LYLISLITKK 406 (428)
T ss_pred EEEEECCCCC
Confidence 7777655444
No 73
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=97.47 E-value=0.00029 Score=66.59 Aligned_cols=72 Identities=17% Similarity=0.288 Sum_probs=55.3
Q ss_pred ceeeecCCcceEEEeeCCC----------------CceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCC
Q psy6572 655 RGIALNPAYGYMYWTDWGQ----------------NAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHED 718 (1416)
Q Consensus 655 ~gIavDp~~g~LYWtD~g~----------------~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~ 718 (1416)
.+|+|++..|.|||||... ..+|.+.++.....++|+. +|..||||+|...+..|++++....
T Consensus 1 ndldv~~~~g~vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp~t~~~~vl~~-~L~fpNGVals~d~~~vlv~Et~~~ 79 (89)
T PF03088_consen 1 NDLDVDQDTGTVYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDPSTKETTVLLD-GLYFPNGVALSPDESFVLVAETGRY 79 (89)
T ss_dssp -EEEE-TTT--EEEEES-SS--TTGHHHHHHHT---EEEEEEETTTTEEEEEEE-EESSEEEEEE-TTSSEEEEEEGGGT
T ss_pred CceeEecCCCEEEEEeCccccCccceeeeeecCCCCcCEEEEECCCCeEEEehh-CCCccCeEEEcCCCCEEEEEeccCc
Confidence 4789999999999999642 2578888887666556655 7999999999999999999999999
Q ss_pred eEEEEeCCC
Q psy6572 719 YIAVSDLNG 727 (1416)
Q Consensus 719 ~I~~~~ldG 727 (1416)
+|.+.-+.|
T Consensus 80 Ri~rywl~G 88 (89)
T PF03088_consen 80 RILRYWLKG 88 (89)
T ss_dssp EEEEEESSS
T ss_pred eEEEEEEeC
Confidence 999998877
No 74
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=97.46 E-value=0.00012 Score=59.32 Aligned_cols=39 Identities=26% Similarity=0.762 Sum_probs=30.3
Q ss_pred ccccccCCCCCCccc--ccceecCCceEEeeCCCceecCCCCCc
Q psy6572 451 FVNECNVSHGGQLCA--HECIDLKIGYKCACRKGYQVHPEDKHL 492 (1416)
Q Consensus 451 ~i~eC~~~~~~~~Cs--~~C~nt~~gy~C~C~~Gy~L~p~d~~t 492 (1416)
+|+||...... |. +.|+|++|+|+|.|++||++. .++++
T Consensus 1 DidEC~~~~~~--C~~~~~C~N~~Gsy~C~C~~Gy~~~-~~~~~ 41 (42)
T PF07645_consen 1 DIDECAEGPHN--CPENGTCVNTEGSYSCSCPPGYELN-DDGTT 41 (42)
T ss_dssp ESSTTTTTSSS--SSTTSEEEEETTEEEEEESTTEEEC-TTSSE
T ss_pred CccccCCCCCc--CCCCCEEEcCCCCEEeeCCCCcEEC-CCCCc
Confidence 47888665544 76 699999999999999999976 44433
No 75
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=97.36 E-value=0.033 Score=67.38 Aligned_cols=142 Identities=13% Similarity=0.033 Sum_probs=82.0
Q ss_pred CCeEEEEec-EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecCC--CCeEEee--c----
Q psy6572 539 PPNLLFTNK-YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNS--QPELLFP--A---- 608 (1416)
Q Consensus 539 ~~~li~s~~-~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s--~~~~l~~--l---- 608 (1416)
..++|++++ ..|..+++.... .-.+.....+.++++.+.++.||.+....+ .+..++..+ ..+.|.. +
T Consensus 48 gr~~yv~~rdg~vsviD~~~~~~v~~i~~G~~~~~i~~s~DG~~~~v~n~~~~--~v~v~D~~tle~v~~I~~~~~~~~~ 125 (369)
T PF02239_consen 48 GRYLYVANRDGTVSVIDLATGKVVATIKVGGNPRGIAVSPDGKYVYVANYEPG--TVSVIDAETLEPVKTIPTGGMPVDG 125 (369)
T ss_dssp SSEEEEEETTSEEEEEETTSSSEEEEEE-SSEEEEEEE--TTTEEEEEEEETT--EEEEEETTT--EEEEEE--EE-TTT
T ss_pred CCEEEEEcCCCeEEEEECCcccEEEEEecCCCcceEEEcCCCCEEEEEecCCC--ceeEeccccccceeecccccccccc
Confidence 456777764 468888887665 222334457899999999999999988754 677666552 1222221 1
Q ss_pred C--CCceEEEEccCCcEEEeeCCCCeEEEeecCCC-ceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecC
Q psy6572 609 T--SPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGR-FRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMD 683 (1416)
Q Consensus 609 ~--~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~-~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mD 683 (1416)
. .+.+|...+.....+++-...++|.+++.... ...+-.-.....|.+..+||..+|+|.+..+.+ +|..+++.
T Consensus 126 ~~~Rv~aIv~s~~~~~fVv~lkd~~~I~vVdy~d~~~~~~~~i~~g~~~~D~~~dpdgry~~va~~~sn-~i~viD~~ 202 (369)
T PF02239_consen 126 PESRVAAIVASPGRPEFVVNLKDTGEIWVVDYSDPKNLKVTTIKVGRFPHDGGFDPDGRYFLVAANGSN-KIAVIDTK 202 (369)
T ss_dssp S---EEEEEE-SSSSEEEEEETTTTEEEEEETTTSSCEEEEEEE--TTEEEEEE-TTSSEEEEEEGGGT-EEEEEETT
T ss_pred cCCCceeEEecCCCCEEEEEEccCCeEEEEEeccccccceeeecccccccccccCcccceeeecccccc-eeEEEeec
Confidence 1 23355444444445556677889999986543 222211134467999999999888888644322 44444433
No 76
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=97.32 E-value=0.055 Score=62.18 Aligned_cols=227 Identities=15% Similarity=0.140 Sum_probs=138.1
Q ss_pred CCcceEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC----------CCC-eEEee-------cCCCceEEEE
Q psy6572 556 AGVMTIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN----------SQP-ELLFP-------ATSPDGLTVD 617 (1416)
Q Consensus 556 ~g~~~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~----------s~~-~~l~~-------l~~p~gLAvD 617 (1416)
+|...+....+..+.||+.. .++||.+-.. .|.++... ... ...+. --....||+
T Consensus 38 ~g~l~~~~r~F~r~MGl~~~--~~~l~~~t~~----qiw~f~~~~n~l~~~~~~~~~D~~yvPr~~~~TGdidiHdia~- 110 (335)
T TIGR03032 38 NGELDVFERTFPRPMGLAVS--PQSLTLGTRY----QLWRFANVDNLLPAGQTHPGYDRLYVPRASYVTGDIDAHDLAL- 110 (335)
T ss_pred CCcEEEEeeccCccceeeee--CCeEEEEEcc----eeEEcccccccccccccCCCCCeEEeeeeeeeccCcchhheee-
Confidence 34435555667788888774 5778887655 56665211 011 11111 223455666
Q ss_pred ccCCcEEEeeCCCCeEEEeecCCCceE----EEEc----CCCCCcceeeecCCcceEEEeeCCCC--ceEEEEecCCCCC
Q psy6572 618 WVGRNLYWCDKGLDTIEVAKLDGRFRK----VLIN----KGLQEPRGIALNPAYGYMYWTDWGQN--AHIGKAKMDGSNP 687 (1416)
Q Consensus 618 ~~~~~LYwtD~~~~~I~v~~ldG~~~~----vLi~----~~l~~P~gIavDp~~g~LYWtD~g~~--~~I~ra~mDGs~r 687 (1416)
..+.|++++..-.-+-..+..-++.. .+|+ ..-=+-+|||+.- ..--|+|-.+.. +.-+|-.......
T Consensus 111 -~~~~l~fVNT~fSCLatl~~~~SF~P~WkPpFIs~la~eDRCHLNGlA~~~-g~p~yVTa~~~sD~~~gWR~~~~~gG~ 188 (335)
T TIGR03032 111 -GAGRLLFVNTLFSCLATVSPDYSFVPLWKPPFISKLAPEDRCHLNGMALDD-GEPRYVTALSQSDVADGWREGRRDGGC 188 (335)
T ss_pred -cCCcEEEEECcceeEEEECCCCccccccCCccccccCccCceeecceeeeC-CeEEEEEEeeccCCcccccccccCCeE
Confidence 67788888877666666655554422 2222 2223468899964 245666754432 2223332221111
Q ss_pred ------EEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEee
Q psy6572 688 ------KVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTD 761 (1416)
Q Consensus 688 ------~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD 761 (1416)
..++.++|..|.+--. .+++||+.|++++.|.+++++....++|..- -..|.||++.++++|+.-
T Consensus 189 vidv~s~evl~~GLsmPhSPRW--hdgrLwvldsgtGev~~vD~~~G~~e~Va~v-------pG~~rGL~f~G~llvVgm 259 (335)
T TIGR03032 189 VIDIPSGEVVASGLSMPHSPRW--YQGKLWLLNSGRGELGYVDPQAGKFQPVAFL-------PGFTRGLAFAGDFAFVGL 259 (335)
T ss_pred EEEeCCCCEEEcCccCCcCCcE--eCCeEEEEECCCCEEEEEcCCCCcEEEEEEC-------CCCCcccceeCCEEEEEe
Confidence 1233457788887777 4899999999999999999984444455442 346899999988888876
Q ss_pred cCCC-------------------eeEEecccCCCceEEEEe-CCCCCCeeeeeecccCC
Q psy6572 762 WEMK-------------------SIERCDKYTGKNCTSVVK-NLVHKPMDLRVYHPYRQ 800 (1416)
Q Consensus 762 ~~~~-------------------~I~~~nk~tG~~~~~l~~-~~~~~p~~I~v~h~~~q 800 (1416)
.+.+ .|..+|..+|..+..|.- ..+...|+++|....++
T Consensus 260 Sk~R~~~~f~glpl~~~l~~~~CGv~vidl~tG~vv~~l~feg~v~EifdV~vLPg~r~ 318 (335)
T TIGR03032 260 SKLRESRVFGGLPIEERLDALGCGVAVIDLNSGDVVHWLRFEGVIEEIYDVAVLPGVRR 318 (335)
T ss_pred ccccCCCCcCCCchhhhhhhhcccEEEEECCCCCEEEEEEeCCceeEEEEEEEecCCCC
Confidence 5433 366778888886666542 34567788888866665
No 77
>KOG4499|consensus
Probab=97.31 E-value=0.0055 Score=66.59 Aligned_cols=87 Identities=18% Similarity=0.220 Sum_probs=64.9
Q ss_pred EEEEecCCcceEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC------CCCeEEee--------cCCCceEE
Q psy6572 550 IREVTQAGVMTIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN------SQPELLFP--------ATSPDGLT 615 (1416)
Q Consensus 550 I~~i~l~g~~~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~------s~~~~l~~--------l~~p~gLA 615 (1416)
++..-+.+...++...+.-+.||++|...++.|++|..+. .|..+..+ +++.+|+. -..|.|++
T Consensus 141 Ly~~~~~h~v~~i~~~v~IsNgl~Wd~d~K~fY~iDsln~--~V~a~dyd~~tG~~snr~~i~dlrk~~~~e~~~PDGm~ 218 (310)
T KOG4499|consen 141 LYSWLAGHQVELIWNCVGISNGLAWDSDAKKFYYIDSLNY--EVDAYDYDCPTGDLSNRKVIFDLRKSQPFESLEPDGMT 218 (310)
T ss_pred EEEeccCCCceeeehhccCCccccccccCcEEEEEccCce--EEeeeecCCCcccccCcceeEEeccCCCcCCCCCCcce
Confidence 3333333333677777778889999999999999999865 67555533 45666665 23589999
Q ss_pred EEccCCcEEEeeCCCCeEEEeecC
Q psy6572 616 VDWVGRNLYWCDKGLDTIEVAKLD 639 (1416)
Q Consensus 616 vD~~~~~LYwtD~~~~~I~v~~ld 639 (1416)
||- .|+||++-+..++|+++++.
T Consensus 219 ID~-eG~L~Va~~ng~~V~~~dp~ 241 (310)
T KOG4499|consen 219 IDT-EGNLYVATFNGGTVQKVDPT 241 (310)
T ss_pred Ecc-CCcEEEEEecCcEEEEECCC
Confidence 995 89999999998999888865
No 78
>PRK04792 tolB translocation protein TolB; Provisional
Probab=97.28 E-value=0.024 Score=70.55 Aligned_cols=185 Identities=10% Similarity=-0.018 Sum_probs=115.7
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee-cCCCceEEEEccCCcEE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP-ATSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~-l~~p~gLAvD~~~~~LY 624 (1416)
..|+.+++.+.. +.+........++++.+.++.|+++........|+++.+. +..+.|.. .......++.+.++.||
T Consensus 242 ~~L~~~dl~tg~~~~lt~~~g~~~~~~wSPDG~~La~~~~~~g~~~Iy~~dl~tg~~~~lt~~~~~~~~p~wSpDG~~I~ 321 (448)
T PRK04792 242 AEIFVQDIYTQVREKVTSFPGINGAPRFSPDGKKLALVLSKDGQPEIYVVDIATKALTRITRHRAIDTEPSWHPDGKSLI 321 (448)
T ss_pred cEEEEEECCCCCeEEecCCCCCcCCeeECCCCCEEEEEEeCCCCeEEEEEECCCCCeEECccCCCCccceEECCCCCEEE
Confidence 358888877655 3332222233467888888889886443333468888887 44444433 23345567777788888
Q ss_pred EeeCC--CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCC-CCceEEEEecCCCCCEEEeecCCCCCeeE
Q psy6572 625 WCDKG--LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWG-QNAHIGKAKMDGSNPKVIISKNLSWPNAL 701 (1416)
Q Consensus 625 wtD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g-~~~~I~ra~mDGs~r~vlv~~~l~~P~gL 701 (1416)
++... ...|+++++++...+.|... .....+.++.|...+||++... ...+|++.++++...+.|..... ....
T Consensus 322 f~s~~~g~~~Iy~~dl~~g~~~~Lt~~-g~~~~~~~~SpDG~~l~~~~~~~g~~~I~~~dl~~g~~~~lt~~~~--d~~p 398 (448)
T PRK04792 322 FTSERGGKPQIYRVNLASGKVSRLTFE-GEQNLGGSITPDGRSMIMVNRTNGKFNIARQDLETGAMQVLTSTRL--DESP 398 (448)
T ss_pred EEECCCCCceEEEEECCCCCEEEEecC-CCCCcCeeECCCCCEEEEEEecCCceEEEEEECCCCCeEEccCCCC--CCCc
Confidence 87543 35799999876654444322 2234456888988899887643 23578999998876655543322 1223
Q ss_pred EeecCCCeEEEecCCC--CeEEEEeCCCCceEEEEe
Q psy6572 702 TISYETNELFWGDAHE--DYIAVSDLNGENIKIIVS 735 (1416)
Q Consensus 702 aiD~~~~rLYWtD~~~--~~I~~~~ldG~~r~~v~~ 735 (1416)
++.+..++|+++.... ..|+.++++|..++.+..
T Consensus 399 s~spdG~~I~~~~~~~g~~~l~~~~~~G~~~~~l~~ 434 (448)
T PRK04792 399 SVAPNGTMVIYSTTYQGKQVLAAVSIDGRFKARLPA 434 (448)
T ss_pred eECCCCCEEEEEEecCCceEEEEEECCCCceEECcC
Confidence 5666677788765433 357888888887776643
No 79
>PRK04922 tolB translocation protein TolB; Provisional
Probab=97.26 E-value=0.027 Score=69.84 Aligned_cols=185 Identities=10% Similarity=-0.021 Sum_probs=115.8
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee-cCCCceEEEEccCCcEE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP-ATSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~-l~~p~gLAvD~~~~~LY 624 (1416)
..|+.+++.+.. ..+........+++|.+.+++|+++-.......|+++++. +..+.+.. ......+++.+.++.|+
T Consensus 228 ~~l~~~dl~~g~~~~l~~~~g~~~~~~~SpDG~~l~~~~s~~g~~~Iy~~d~~~g~~~~lt~~~~~~~~~~~spDG~~l~ 307 (433)
T PRK04922 228 SAIYVQDLATGQRELVASFRGINGAPSFSPDGRRLALTLSRDGNPEIYVMDLGSRQLTRLTNHFGIDTEPTWAPDGKSIY 307 (433)
T ss_pred cEEEEEECCCCCEEEeccCCCCccCceECCCCCEEEEEEeCCCCceEEEEECCCCCeEECccCCCCccceEECCCCCEEE
Confidence 357777876655 3332222233467888888888876443333479998887 44344333 22334567777778887
Q ss_pred EeeCC--CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCC-CCceEEEEecCCCCCEEEeecCCCCCeeE
Q psy6572 625 WCDKG--LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWG-QNAHIGKAKMDGSNPKVIISKNLSWPNAL 701 (1416)
Q Consensus 625 wtD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g-~~~~I~ra~mDGs~r~vlv~~~l~~P~gL 701 (1416)
++... ...|+++++++...+.|... ......+++.|...+|+++... ....|...++++...+.|.... +...+
T Consensus 308 f~sd~~g~~~iy~~dl~~g~~~~lt~~-g~~~~~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g~~~~Lt~~~--~~~~p 384 (433)
T PRK04922 308 FTSDRGGRPQIYRVAASGGSAERLTFQ-GNYNARASVSPDGKKIAMVHGSGGQYRIAVMDLSTGSVRTLTPGS--LDESP 384 (433)
T ss_pred EEECCCCCceEEEEECCCCCeEEeecC-CCCccCEEECCCCCEEEEEECCCCceeEEEEECCCCCeEECCCCC--CCCCc
Confidence 76533 34799999877655444422 2345578999998899887532 2347889898776655443321 22345
Q ss_pred EeecCCCeEEEecCC--CCeEEEEeCCCCceEEEEe
Q psy6572 702 TISYETNELFWGDAH--EDYIAVSDLNGENIKIIVS 735 (1416)
Q Consensus 702 aiD~~~~rLYWtD~~--~~~I~~~~ldG~~r~~v~~ 735 (1416)
++.+....||++... ...|+.++++|..++.|..
T Consensus 385 ~~spdG~~i~~~s~~~g~~~L~~~~~~g~~~~~l~~ 420 (433)
T PRK04922 385 SFAPNGSMVLYATREGGRGVLAAVSTDGRVRQRLVS 420 (433)
T ss_pred eECCCCCEEEEEEecCCceEEEEEECCCCceEEccc
Confidence 676666677776542 4478888888877666643
No 80
>PRK04043 tolB translocation protein TolB; Provisional
Probab=97.24 E-value=0.031 Score=68.83 Aligned_cols=183 Identities=8% Similarity=-0.015 Sum_probs=118.6
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEeecC-CCceEEEEccCCcEE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFPAT-SPDGLTVDWVGRNLY 624 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~l~-~p~gLAvD~~~~~LY 624 (1416)
..|+.+++.+.. ..+........+.++.+.+++|+++........|+.+.+. +..+.|.... .-....+.+-++.||
T Consensus 213 ~~Iyv~dl~tg~~~~lt~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~dl~~g~~~~LT~~~~~d~~p~~SPDG~~I~ 292 (419)
T PRK04043 213 PTLYKYNLYTGKKEKIASSQGMLVVSDVSKDGSKLLLTMAPKGQPDIYLYDTNTKTLTQITNYPGIDVNGNFVEDDKRIV 292 (419)
T ss_pred CEEEEEECCCCcEEEEecCCCcEEeeEECCCCCEEEEEEccCCCcEEEEEECCCCcEEEcccCCCccCccEECCCCCEEE
Confidence 458888887655 4444333333456677888888877654434579998887 4444444321 122345677788898
Q ss_pred EeeCC--CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCC------C-ceEEEEecCCCCCEEEeecCC
Q psy6572 625 WCDKG--LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQ------N-AHIGKAKMDGSNPKVIISKNL 695 (1416)
Q Consensus 625 wtD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~------~-~~I~ra~mDGs~r~vlv~~~l 695 (1416)
|+... ...|++++++|...+.|...+... .++.|..++|.++-... . ..|..++++|...+.|.....
T Consensus 293 F~Sdr~g~~~Iy~~dl~~g~~~rlt~~g~~~---~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~g~~~~LT~~~~ 369 (419)
T PRK04043 293 FVSDRLGYPNIFMKKLNSGSVEQVVFHGKNN---SSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNSDYIRRLTANGV 369 (419)
T ss_pred EEECCCCCceEEEEECCCCCeEeCccCCCcC---ceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCCCCeEECCCCCC
Confidence 88643 348999999987665554333222 47889988887775332 1 489999999887666655432
Q ss_pred CCCeeEEeecCCCeEEEecCC--CCeEEEEeCCCCceEEEEe
Q psy6572 696 SWPNALTISYETNELFWGDAH--EDYIAVSDLNGENIKIIVS 735 (1416)
Q Consensus 696 ~~P~gLaiD~~~~rLYWtD~~--~~~I~~~~ldG~~r~~v~~ 735 (1416)
. ...++.+..++||++... ...|..++++|.....|..
T Consensus 370 ~--~~p~~SPDG~~I~f~~~~~~~~~L~~~~l~g~~~~~l~~ 409 (419)
T PRK04043 370 N--QFPRFSSDGGSIMFIKYLGNQSALGIIRLNYNKSFLFPL 409 (419)
T ss_pred c--CCeEECCCCCEEEEEEccCCcEEEEEEecCCCeeEEeec
Confidence 1 235676667778887543 3469999999987777654
No 81
>PRK05137 tolB translocation protein TolB; Provisional
Probab=97.22 E-value=0.036 Score=68.77 Aligned_cols=184 Identities=13% Similarity=0.031 Sum_probs=116.0
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee-cCCCceEEEEccCCcEE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP-ATSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~-l~~p~gLAvD~~~~~LY 624 (1416)
..|+.+++.+.. ..+........+.+|.+.+++|+++-.......|+++.+. +....|.. .......++.+-++.|+
T Consensus 226 ~~i~~~dl~~g~~~~l~~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~~~~Lt~~~~~~~~~~~spDG~~i~ 305 (435)
T PRK05137 226 PRVYLLDLETGQRELVGNFPGMTFAPRFSPDGRKVVMSLSQGGNTDIYTMDLRSGTTTRLTDSPAIDTSPSYSPDGSQIV 305 (435)
T ss_pred CEEEEEECCCCcEEEeecCCCcccCcEECCCCCEEEEEEecCCCceEEEEECCCCceEEccCCCCccCceeEcCCCCEEE
Confidence 467777776655 3333222344577888888888776443323468888887 44444433 22334566777777787
Q ss_pred EeeCC--CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCC-CCceEEEEecCCCCCEEEeecCCCCCeeE
Q psy6572 625 WCDKG--LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWG-QNAHIGKAKMDGSNPKVIISKNLSWPNAL 701 (1416)
Q Consensus 625 wtD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g-~~~~I~ra~mDGs~r~vlv~~~l~~P~gL 701 (1416)
++... ...|++++++|...+.|.... ..-...++.|..++|+++... ...+|...+++|...+.+.... ....+
T Consensus 306 f~s~~~g~~~Iy~~d~~g~~~~~lt~~~-~~~~~~~~SpdG~~ia~~~~~~~~~~i~~~d~~~~~~~~lt~~~--~~~~p 382 (435)
T PRK05137 306 FESDRSGSPQLYVMNADGSNPRRISFGG-GRYSTPVWSPRGDLIAFTKQGGGQFSIGVMKPDGSGERILTSGF--LVEGP 382 (435)
T ss_pred EEECCCCCCeEEEEECCCCCeEEeecCC-CcccCeEECCCCCEEEEEEcCCCceEEEEEECCCCceEeccCCC--CCCCC
Confidence 76533 458999999987665554322 223456788888888877532 2357899899887765554321 34556
Q ss_pred EeecCCCeEEEecCCC-----CeEEEEeCCCCceEEEE
Q psy6572 702 TISYETNELFWGDAHE-----DYIAVSDLNGENIKIIV 734 (1416)
Q Consensus 702 aiD~~~~rLYWtD~~~-----~~I~~~~ldG~~r~~v~ 734 (1416)
++.+..+.||++-... ..|+.++++|...+.|.
T Consensus 383 ~~spDG~~i~~~~~~~~~~~~~~L~~~dl~g~~~~~l~ 420 (435)
T PRK05137 383 TWAPNGRVIMFFRQTPGSGGAPKLYTVDLTGRNEREVP 420 (435)
T ss_pred eECCCCCEEEEEEccCCCCCcceEEEEECCCCceEEcc
Confidence 7777677777764322 46888999887776553
No 82
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=97.06 E-value=0.069 Score=65.60 Aligned_cols=183 Identities=10% Similarity=-0.016 Sum_probs=112.0
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee-cCCCceEEEEccCCcEE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP-ATSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~-l~~p~gLAvD~~~~~LY 624 (1416)
..|+.+++.+.. ..+........++++.+.++.||++........|+.+.+. ...+.+.. .......++.+.++.|+
T Consensus 214 ~~i~v~d~~~g~~~~~~~~~~~~~~~~~spDg~~l~~~~~~~~~~~i~~~d~~~~~~~~l~~~~~~~~~~~~s~dg~~l~ 293 (417)
T TIGR02800 214 PEIYVQDLATGQREKVASFPGMNGAPAFSPDGSKLAVSLSKDGNPDIYVMDLDGKQLTRLTNGPGIDTEPSWSPDGKSIA 293 (417)
T ss_pred cEEEEEECCCCCEEEeecCCCCccceEECCCCCEEEEEECCCCCccEEEEECCCCCEEECCCCCCCCCCEEECCCCCEEE
Confidence 457777776554 3333222334567888888888877544333478888887 33333333 22223456666677888
Q ss_pred EeeCC--CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCC-CceEEEEecCCCCCEEEeecCCCCCeeE
Q psy6572 625 WCDKG--LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQ-NAHIGKAKMDGSNPKVIISKNLSWPNAL 701 (1416)
Q Consensus 625 wtD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~-~~~I~ra~mDGs~r~vlv~~~l~~P~gL 701 (1416)
++... ...|+++++++...+.|. ........+++.|...+|+++.... ..+|..+++++...+++.... .....
T Consensus 294 ~~s~~~g~~~iy~~d~~~~~~~~l~-~~~~~~~~~~~spdg~~i~~~~~~~~~~~i~~~d~~~~~~~~l~~~~--~~~~p 370 (417)
T TIGR02800 294 FTSDRGGSPQIYMMDADGGEVRRLT-FRGGYNASPSWSPDGDLIAFVHREGGGFNIAVMDLDGGGERVLTDTG--LDESP 370 (417)
T ss_pred EEECCCCCceEEEEECCCCCEEEee-cCCCCccCeEECCCCCEEEEEEccCCceEEEEEeCCCCCeEEccCCC--CCCCc
Confidence 76543 347999999877655444 2334566788999888999987543 347888888875554444322 12334
Q ss_pred EeecCCCeEEEecCCC--CeEEEEeCCCCceEEE
Q psy6572 702 TISYETNELFWGDAHE--DYIAVSDLNGENIKII 733 (1416)
Q Consensus 702 aiD~~~~rLYWtD~~~--~~I~~~~ldG~~r~~v 733 (1416)
++.+..+.|+++.... ..|+.++.+|..++.|
T Consensus 371 ~~spdg~~l~~~~~~~~~~~l~~~~~~g~~~~~~ 404 (417)
T TIGR02800 371 SFAPNGRMILYATTRGGRGVLGLVSTDGRFRARL 404 (417)
T ss_pred eECCCCCEEEEEEeCCCcEEEEEEECCCceeeEC
Confidence 5655677788876543 3566666777655444
No 83
>TIGR03118 PEPCTERM_chp_1 conserved hypothetical protein TIGR03118. This model describes and uncharacterized conserved hypothetical protein. Members are found with the C-terminal putative exosortase interaction domain, PEP-CTERM, in Nitrosospira multiformis, Rhodoferax ferrireducens, Solibacter usitatus Ellin6076, and Acidobacteria bacterium Ellin345. It is found without the PEP-CTERM domain in several other species, including Burkholderia ambifaria, Gloeobacter violaceus PCC 7421, and three copies in the Acanthamoeba polyphaga mimivirus.
Probab=97.01 E-value=0.033 Score=63.42 Aligned_cols=183 Identities=13% Similarity=0.148 Sum_probs=110.0
Q ss_pred cCCCceEEEEccCCcEEEeeCCCCeEEEeecC-----CCceEEEEc----C---CCCCcceeeecCCcce----------
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLD-----GRFRKVLIN----K---GLQEPRGIALNPAYGY---------- 665 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ld-----G~~~~vLi~----~---~l~~P~gIavDp~~g~---------- 665 (1416)
|.+|.|||+-+ ++.++++|.+++...+.+.+ |....+++. . ....|.||++....++
T Consensus 22 L~N~WGia~~p-~~~~WVadngT~~~TlYdg~~~~~~g~~~~L~vtiP~~~~~~~~~~PTGiVfN~~~~F~vt~~g~~~~ 100 (336)
T TIGR03118 22 LRNAWGLSYRP-GGPFWVANTGTGTATLYVGNPDTQPLVQDPLVVVIPAPPPLAAEGTPTGQVFNGSDTFVVSGEGITGP 100 (336)
T ss_pred ccccceeEecC-CCCEEEecCCcceEEeecCCcccccCCccceEEEecCCCCCCCCCCccEEEEeCCCceEEcCCCcccc
Confidence 88999999987 55788888889888888877 544333333 1 2357999999865443
Q ss_pred ---EEEeeCCCCceEEEEecCCC---CCEEEeecC--CCCCeeEEeecC--CCeEEEecCCCCeEEEEeCCCCceEEEEe
Q psy6572 666 ---MYWTDWGQNAHIGKAKMDGS---NPKVIISKN--LSWPNALTISYE--TNELFWGDAHEDYIAVSDLNGENIKIIVS 735 (1416)
Q Consensus 666 ---LYWtD~g~~~~I~ra~mDGs---~r~vlv~~~--l~~P~gLaiD~~--~~rLYWtD~~~~~I~~~~ldG~~r~~v~~ 735 (1416)
||.|+-|.- .-++-.++-+ ...+++... ..--.||||-.. ..+||-+|.+.++|... |++.+++.+.
T Consensus 101 a~Fif~tEdGTi-saW~p~v~~t~~~~~~~~~d~s~~gavYkGLAi~~~~~~~~LYaadF~~g~IDVF--d~~f~~~~~~ 177 (336)
T TIGR03118 101 SRFLFVTEDGTL-SGWAPALGTTRMTRAEIVVDASQQGNVYKGLAVGPTGGGDYLYAANFRQGRIDVF--KGSFRPPPLP 177 (336)
T ss_pred eeEEEEeCCceE-EeecCcCCcccccccEEEEccCCCcceeeeeEEeecCCCceEEEeccCCCceEEe--cCccccccCC
Confidence 666653321 1112112222 122334322 222368888533 68999999999999887 4444443322
Q ss_pred cc-CCCCc-ccccceeEEEecCcEEEeecCCC-------------eeEEecccCCCceEEEE-eCCCCCCeeeeee
Q psy6572 736 RR-MDPTI-NLHHVFALAVFEDHLFWTDWEMK-------------SIERCDKYTGKNCTSVV-KNLVHKPMDLRVY 795 (1416)
Q Consensus 736 ~~-~~p~~-~l~~P~~lav~~d~LYwtD~~~~-------------~I~~~nk~tG~~~~~l~-~~~~~~p~~I~v~ 795 (1416)
+. ..|.+ .-..||.|...+++||+|=.+.. .|-..+ .+|.-++.+. ...++.|.+|++-
T Consensus 178 g~F~DP~iPagyAPFnIqnig~~lyVtYA~qd~~~~d~v~G~G~G~VdvFd-~~G~l~~r~as~g~LNaPWG~a~A 252 (336)
T TIGR03118 178 GSFIDPALPAGYAPFNVQNLGGTLYVTYAQQDADRNDEVAGAGLGYVNVFT-LNGQLLRRVASSGRLNAPWGLAIA 252 (336)
T ss_pred CCccCCCCCCCCCCcceEEECCeEEEEEEecCCcccccccCCCcceEEEEc-CCCcEEEEeccCCcccCCceeeeC
Confidence 11 11211 12369999999999999865332 333332 3455555443 2467889998874
No 84
>KOG1219|consensus
Probab=97.01 E-value=0.00093 Score=89.27 Aligned_cols=97 Identities=31% Similarity=0.810 Sum_probs=70.0
Q ss_pred eeCCC-eeecC---CcccCCCCCCCCCCCCCCc--ccccccCCCCCCccc--ccceecCCceEEeeCCCceecCCCCCcc
Q psy6572 422 LCSNG-LCINE---TLTCNDINDCGDNSDEFSC--FVNECNVSHGGQLCA--HECIDLKIGYKCACRKGYQVHPEDKHLC 493 (1416)
Q Consensus 422 ~C~~g-~Ci~~---~~~Cdg~~dC~dgsDe~~C--~i~eC~~~~~~~~Cs--~~C~nt~~gy~C~C~~Gy~L~p~d~~tC 493 (1416)
.|.+| .|+.. +..| .|+..--...| ++..|..++ |- ..|+..+++|.|.|+.|| .|.+|
T Consensus 3871 pCqhgG~C~~~~~ggy~C----kCpsqysG~~CEi~~epC~snP----C~~GgtCip~~n~f~CnC~~gy-----TG~~C 3937 (4289)
T KOG1219|consen 3871 PCQHGGTCISQPKGGYKC----KCPSQYSGNHCEIDLEPCASNP----CLTGGTCIPFYNGFLCNCPNGY-----TGKRC 3937 (4289)
T ss_pred cccCCCEecCCCCCceEE----eCcccccCcccccccccccCCC----CCCCCEEEecCCCeeEeCCCCc-----cCcee
Confidence 35443 56553 3344 45554445566 445675444 43 379999999999999999 45578
Q ss_pred cc--CCcCCCCCccc--eeeecCCeeeecCCCCcEEecCCCceEec
Q psy6572 494 VD--TNECLDRPCSH--YCRNTLGSYSCSCAPGYALLSDKHGCKAT 535 (1416)
Q Consensus 494 ~d--idEC~~~~Csq--~C~nt~gsy~C~C~~Gy~L~~dg~sC~a~ 535 (1416)
+. ++||+..+|.+ +|+|++|+|.|.|.+||. |++|-+.
T Consensus 3938 e~~Gi~eCs~n~C~~gg~C~n~~gsf~CncT~g~~----gr~c~~~ 3979 (4289)
T KOG1219|consen 3938 EARGISECSKNVCGTGGQCINIPGSFHCNCTPGIL----GRTCCAE 3979 (4289)
T ss_pred ecccccccccccccCCceeeccCCceEeccChhHh----cccCccc
Confidence 63 88999888977 899999999999999997 6777543
No 85
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=97.01 E-value=0.18 Score=56.18 Aligned_cols=174 Identities=11% Similarity=-0.013 Sum_probs=97.8
Q ss_pred cEEEEEEecCCcc--eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee--cCCCceEEEEccCC
Q psy6572 547 KYYIREVTQAGVM--TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP--ATSPDGLTVDWVGR 621 (1416)
Q Consensus 547 ~~~I~~i~l~g~~--~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~--l~~p~gLAvD~~~~ 621 (1416)
...|+..++.... ..+......+..+.+.... .++++.... +.|..+.+. ........ ...+..|++.+.+.
T Consensus 72 ~~~i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~~~~--~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~ 148 (289)
T cd00200 72 DKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDG-RILSSSSRD--KTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGT 148 (289)
T ss_pred CCeEEEEEcCcccceEEEeccCCcEEEEEEcCCC-CEEEEecCC--CeEEEEECCCcEEEEEeccCCCcEEEEEEcCcCC
Confidence 4455555555432 2222223356778887664 444444322 367777666 22222222 34567888887644
Q ss_pred cEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeE
Q psy6572 622 NLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNAL 701 (1416)
Q Consensus 622 ~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gL 701 (1416)
++++....+.|.+.++........+......+..|++.|...+|+.+.. ...|...++........+......+..|
T Consensus 149 -~l~~~~~~~~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~--~~~i~i~d~~~~~~~~~~~~~~~~i~~~ 225 (289)
T cd00200 149 -FVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSS--DGTIKLWDLSTGKCLGTLRGHENGVNSV 225 (289)
T ss_pred -EEEEEcCCCcEEEEEccccccceeEecCccccceEEECCCcCEEEEecC--CCcEEEEECCCCceecchhhcCCceEEE
Confidence 4444444678888888643322222233346789999998777777764 3456666665433222222223356788
Q ss_pred EeecCCCeEEEecCCCCeEEEEeCCC
Q psy6572 702 TISYETNELFWGDAHEDYIAVSDLNG 727 (1416)
Q Consensus 702 aiD~~~~rLYWtD~~~~~I~~~~ldG 727 (1416)
++++. +.++.+-...+.|...++..
T Consensus 226 ~~~~~-~~~~~~~~~~~~i~i~~~~~ 250 (289)
T cd00200 226 AFSPD-GYLLASGSEDGTIRVWDLRT 250 (289)
T ss_pred EEcCC-CcEEEEEcCCCcEEEEEcCC
Confidence 88764 66666655577888888763
No 86
>PRK02889 tolB translocation protein TolB; Provisional
Probab=96.96 E-value=0.1 Score=64.52 Aligned_cols=184 Identities=9% Similarity=0.030 Sum_probs=113.0
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee-cCCCceEEEEccCCcEE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP-ATSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~-l~~p~gLAvD~~~~~LY 624 (1416)
..|+.+++.+.. ..+........+++|.+.+++|+++-.......|++++++ +..+.|.. .......++.+-++.|+
T Consensus 220 ~~I~~~dl~~g~~~~l~~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~~~~~~~~~~wSpDG~~l~ 299 (427)
T PRK02889 220 PVVYVHDLATGRRRVVANFKGSNSAPAWSPDGRTLAVALSRDGNSQIYTVNADGSGLRRLTQSSGIDTEPFFSPDGRSIY 299 (427)
T ss_pred cEEEEEECCCCCEEEeecCCCCccceEECCCCCEEEEEEccCCCceEEEEECCCCCcEECCCCCCCCcCeEEcCCCCEEE
Confidence 357777776555 3332222344577888888888876433323478888877 44444433 22234566777778888
Q ss_pred EeeC--CCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeC-CCCceEEEEecCCCCCEEEeecCCCCCeeE
Q psy6572 625 WCDK--GLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDW-GQNAHIGKAKMDGSNPKVIISKNLSWPNAL 701 (1416)
Q Consensus 625 wtD~--~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~-g~~~~I~ra~mDGs~r~vlv~~~l~~P~gL 701 (1416)
++.. +...|+++++++...+.+...+ ......++.|...+|+++.. +....|.+.++++...+.|.... +...+
T Consensus 300 f~s~~~g~~~Iy~~~~~~g~~~~lt~~g-~~~~~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g~~~~lt~~~--~~~~p 376 (427)
T PRK02889 300 FTSDRGGAPQIYRMPASGGAAQRVTFTG-SYNTSPRISPDGKLLAYISRVGGAFKLYVQDLATGQVTALTDTT--RDESP 376 (427)
T ss_pred EEecCCCCcEEEEEECCCCceEEEecCC-CCcCceEECCCCCEEEEEEccCCcEEEEEEECCCCCeEEccCCC--CccCc
Confidence 7643 3457888888776544443222 22345688888888887753 22347888898877655554332 23455
Q ss_pred EeecCCCeEEEecCC--CCeEEEEeCCCCceEEEE
Q psy6572 702 TISYETNELFWGDAH--EDYIAVSDLNGENIKIIV 734 (1416)
Q Consensus 702 aiD~~~~rLYWtD~~--~~~I~~~~ldG~~r~~v~ 734 (1416)
++.+....||++-.. ...|+.++++|..++.+.
T Consensus 377 ~~spdg~~l~~~~~~~g~~~l~~~~~~g~~~~~l~ 411 (427)
T PRK02889 377 SFAPNGRYILYATQQGGRSVLAAVSSDGRIKQRLS 411 (427)
T ss_pred eECCCCCEEEEEEecCCCEEEEEEECCCCceEEee
Confidence 777767777776432 345777888887766664
No 87
>PRK00178 tolB translocation protein TolB; Provisional
Probab=96.92 E-value=0.1 Score=64.51 Aligned_cols=184 Identities=9% Similarity=-0.008 Sum_probs=112.5
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee-cCCCceEEEEccCCcEE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP-ATSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~-l~~p~gLAvD~~~~~LY 624 (1416)
..|+.+++.+.. ..+........+++|.+.+++|+++-.......|+++.+. +..+.|.. .......++.+.++.||
T Consensus 223 ~~l~~~~l~~g~~~~l~~~~g~~~~~~~SpDG~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~~~~~~~~~~~spDg~~i~ 302 (430)
T PRK00178 223 PRIFVQNLDTGRREQITNFEGLNGAPAWSPDGSKLAFVLSKDGNPEIYVMDLASRQLSRVTNHPAIDTEPFWGKDGRTLY 302 (430)
T ss_pred CEEEEEECCCCCEEEccCCCCCcCCeEECCCCCEEEEEEccCCCceEEEEECCCCCeEEcccCCCCcCCeEECCCCCEEE
Confidence 357777776554 3332222233457788888888876544333478888887 43343433 22334456666778888
Q ss_pred EeeCC--CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCC-CCceEEEEecCCCCCEEEeecCCCCCeeE
Q psy6572 625 WCDKG--LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWG-QNAHIGKAKMDGSNPKVIISKNLSWPNAL 701 (1416)
Q Consensus 625 wtD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g-~~~~I~ra~mDGs~r~vlv~~~l~~P~gL 701 (1416)
++... ...|++.++++...+.|.... ......++.|..++|+++... ....|.+.++++...+.|.... .....
T Consensus 303 f~s~~~g~~~iy~~d~~~g~~~~lt~~~-~~~~~~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg~~~~lt~~~--~~~~p 379 (430)
T PRK00178 303 FTSDRGGKPQIYKVNVNGGRAERVTFVG-NYNARPRLSADGKTLVMVHRQDGNFHVAAQDLQRGSVRILTDTS--LDESP 379 (430)
T ss_pred EEECCCCCceEEEEECCCCCEEEeecCC-CCccceEECCCCCEEEEEEccCCceEEEEEECCCCCEEEccCCC--CCCCc
Confidence 77543 357999998766554443222 223346788888899888642 2346888999887665554332 22234
Q ss_pred EeecCCCeEEEecCC--CCeEEEEeCCCCceEEEE
Q psy6572 702 TISYETNELFWGDAH--EDYIAVSDLNGENIKIIV 734 (1416)
Q Consensus 702 aiD~~~~rLYWtD~~--~~~I~~~~ldG~~r~~v~ 734 (1416)
++.+..+.|+++... ...|+.++++|...+.+.
T Consensus 380 ~~spdg~~i~~~~~~~g~~~l~~~~~~g~~~~~l~ 414 (430)
T PRK00178 380 SVAPNGTMLIYATRQQGRGVLMLVSINGRVRLPLP 414 (430)
T ss_pred eECCCCCEEEEEEecCCceEEEEEECCCCceEECc
Confidence 676667788887643 346888888887665543
No 88
>PRK03629 tolB translocation protein TolB; Provisional
Probab=96.77 E-value=0.18 Score=62.50 Aligned_cols=185 Identities=10% Similarity=0.027 Sum_probs=114.7
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee-cCCCceEEEEccCCcEE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP-ATSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~-l~~p~gLAvD~~~~~LY 624 (1416)
..|+.+++.+.. ..+........++++.+.+++|+++........|+.+.++ +..+.+.. -.....+++.+.++.|+
T Consensus 223 ~~i~i~dl~~G~~~~l~~~~~~~~~~~~SPDG~~La~~~~~~g~~~I~~~d~~tg~~~~lt~~~~~~~~~~wSPDG~~I~ 302 (429)
T PRK03629 223 SALVIQTLANGAVRQVASFPRHNGAPAFSPDGSKLAFALSKTGSLNLYVMDLASGQIRQVTDGRSNNTEPTWFPDSQNLA 302 (429)
T ss_pred cEEEEEECCCCCeEEccCCCCCcCCeEECCCCCEEEEEEcCCCCcEEEEEECCCCCEEEccCCCCCcCceEECCCCCEEE
Confidence 356666766554 3333222334567888989999987443322368888887 44444433 23445667777777786
Q ss_pred EeeC--CCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeC-CCCceEEEEecCCCCCEEEeecCCCCCeeE
Q psy6572 625 WCDK--GLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDW-GQNAHIGKAKMDGSNPKVIISKNLSWPNAL 701 (1416)
Q Consensus 625 wtD~--~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~-g~~~~I~ra~mDGs~r~vlv~~~l~~P~gL 701 (1416)
++-. +..+|+++++++...+.|.. .......+++.|...+|+++.. +....|+..++++...+.|..... -...
T Consensus 303 f~s~~~g~~~Iy~~d~~~g~~~~lt~-~~~~~~~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~g~~~~Lt~~~~--~~~p 379 (429)
T PRK03629 303 YTSDQAGRPQVYKVNINGGAPQRITW-EGSQNQDADVSSDGKFMVMVSSNGGQQHIAKQDLATGGVQVLTDTFL--DETP 379 (429)
T ss_pred EEeCCCCCceEEEEECCCCCeEEeec-CCCCccCEEECCCCCEEEEEEccCCCceEEEEECCCCCeEEeCCCCC--CCCc
Confidence 6543 24589999998876555543 2233456788888888877653 223578888888776555543211 1234
Q ss_pred EeecCCCeEEEecCCC--CeEEEEeCCCCceEEEEe
Q psy6572 702 TISYETNELFWGDAHE--DYIAVSDLNGENIKIIVS 735 (1416)
Q Consensus 702 aiD~~~~rLYWtD~~~--~~I~~~~ldG~~r~~v~~ 735 (1416)
++.+....|+++.... ..|+.++++|...+.|..
T Consensus 380 ~~SpDG~~i~~~s~~~~~~~l~~~~~~G~~~~~l~~ 415 (429)
T PRK03629 380 SIAPNGTMVIYSSSQGMGSVLNLVSTDGRFKARLPA 415 (429)
T ss_pred eECCCCCEEEEEEcCCCceEEEEEECCCCCeEECcc
Confidence 5666667777765432 357788889888776643
No 89
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=96.68 E-value=0.00087 Score=47.01 Aligned_cols=23 Identities=48% Similarity=1.129 Sum_probs=21.4
Q ss_pred eeeecCCCCcEEecCCCceEecC
Q psy6572 514 SYSCSCAPGYALLSDKHGCKATS 536 (1416)
Q Consensus 514 sy~C~C~~Gy~L~~dg~sC~a~~ 536 (1416)
||+|.|++||+|.+++++|..++
T Consensus 1 sy~C~C~~Gy~l~~d~~~C~DId 23 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPDGRSCEDID 23 (24)
T ss_pred CEEeeCCCCCcCCCCCCccccCC
Confidence 69999999999999999999875
No 90
>PRK01029 tolB translocation protein TolB; Provisional
Probab=96.64 E-value=0.28 Score=60.81 Aligned_cols=186 Identities=12% Similarity=0.064 Sum_probs=113.3
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC--C----CCeEEee--cCCCceEEEEc
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN--S----QPELLFP--ATSPDGLTVDW 618 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~--s----~~~~l~~--l~~p~gLAvD~ 618 (1416)
..|+.+++++.. ..+........+.++.+.+++|.|+-.......|+.+.++ + ..+.|.. .......++.+
T Consensus 211 ~~I~~~~l~~g~~~~lt~~~g~~~~p~wSPDG~~Laf~s~~~g~~di~~~~~~~~~g~~g~~~~lt~~~~~~~~~p~wSP 290 (428)
T PRK01029 211 PKIFLGSLENPAGKKILALQGNQLMPTFSPRKKLLAFISDRYGNPDLFIQSFSLETGAIGKPRRLLNEAFGTQGNPSFSP 290 (428)
T ss_pred ceEEEEECCCCCceEeecCCCCccceEECCCCCEEEEEECCCCCcceeEEEeecccCCCCcceEeecCCCCCcCCeEECC
Confidence 468888888766 3333333334567788888888887543222356554332 1 2233333 22334567888
Q ss_pred cCCcEEEeeC--CCCeEEEeecCCC--ceEEEEcCCCCCcceeeecCCcceEEEeeC-CCCceEEEEecCCCCCEEEeec
Q psy6572 619 VGRNLYWCDK--GLDTIEVAKLDGR--FRKVLINKGLQEPRGIALNPAYGYMYWTDW-GQNAHIGKAKMDGSNPKVIISK 693 (1416)
Q Consensus 619 ~~~~LYwtD~--~~~~I~v~~ldG~--~~~vLi~~~l~~P~gIavDp~~g~LYWtD~-g~~~~I~ra~mDGs~r~vlv~~ 693 (1416)
.++.|+|+.. +...|+++.+++. ..+.| +.........++.|...+|+++.. .....|.+.++++...+.|...
T Consensus 291 DG~~Laf~s~~~g~~~ly~~~~~~~g~~~~~l-t~~~~~~~~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~g~~~~Lt~~ 369 (428)
T PRK01029 291 DGTRLVFVSNKDGRPRIYIMQIDPEGQSPRLL-TKKYRNSSCPAWSPDGKKIAFCSVIKGVRQICVYDLATGRDYQLTTS 369 (428)
T ss_pred CCCEEEEEECCCCCceEEEEECcccccceEEe-ccCCCCccceeECCCCCEEEEEEcCCCCcEEEEEECCCCCeEEccCC
Confidence 8888887754 3347888888643 23333 222234456788898888887753 2235799999988877666543
Q ss_pred CCCCCeeEEeecCCCeEEEecC--CCCeEEEEeCCCCceEEEEe
Q psy6572 694 NLSWPNALTISYETNELFWGDA--HEDYIAVSDLNGENIKIIVS 735 (1416)
Q Consensus 694 ~l~~P~gLaiD~~~~rLYWtD~--~~~~I~~~~ldG~~r~~v~~ 735 (1416)
......+++.+..+.||++-. ....|+.++++|...+.|..
T Consensus 370 -~~~~~~p~wSpDG~~L~f~~~~~g~~~L~~vdl~~g~~~~Lt~ 412 (428)
T PRK01029 370 -PENKESPSWAIDSLHLVYSAGNSNESELYLISLITKKTRKIVI 412 (428)
T ss_pred -CCCccceEECCCCCEEEEEECCCCCceEEEEECCCCCEEEeec
Confidence 223345667666667777643 34578888988877666654
No 91
>PF02333 Phytase: Phytase; InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=96.41 E-value=0.25 Score=59.20 Aligned_cols=118 Identities=22% Similarity=0.296 Sum_probs=65.8
Q ss_pred cCCCceEEEE--ccCCcEEEeeC-CCCeEEEeec--C--CCceEEEEc--CCCCCcceeeecCCcceEEEeeCCCCceEE
Q psy6572 608 ATSPDGLTVD--WVGRNLYWCDK-GLDTIEVAKL--D--GRFRKVLIN--KGLQEPRGIALNPAYGYMYWTDWGQNAHIG 678 (1416)
Q Consensus 608 l~~p~gLAvD--~~~~~LYwtD~-~~~~I~v~~l--d--G~~~~vLi~--~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ 678 (1416)
+..|.||++- +.++.+|..-. ..+.++...| + |.....++. .....|.|+++|..+|+||+.+-. ..|+
T Consensus 155 ~~e~yGlcly~~~~~g~~ya~v~~k~G~~~Qy~L~~~~~g~v~~~lVR~f~~~sQ~EGCVVDDe~g~LYvgEE~--~GIW 232 (381)
T PF02333_consen 155 LSEPYGLCLYRSPSTGALYAFVNGKDGRVEQYELTDDGDGKVSATLVREFKVGSQPEGCVVDDETGRLYVGEED--VGIW 232 (381)
T ss_dssp SSSEEEEEEEE-TTT--EEEEEEETTSEEEEEEEEE-TTSSEEEEEEEEEE-SS-EEEEEEETTTTEEEEEETT--TEEE
T ss_pred cccceeeEEeecCCCCcEEEEEecCCceEEEEEEEeCCCCcEeeEEEEEecCCCcceEEEEecccCCEEEecCc--cEEE
Confidence 5668899984 45677776433 2355665555 2 333333332 234579999999999999999943 4799
Q ss_pred EEecC---CCCCEEEeec---CC-CCCeeEEeecC---CCeEEEecCCCCeEEEEeCCC
Q psy6572 679 KAKMD---GSNPKVIISK---NL-SWPNALTISYE---TNELFWGDAHEDYIAVSDLNG 727 (1416)
Q Consensus 679 ra~mD---Gs~r~vlv~~---~l-~~P~gLaiD~~---~~rLYWtD~~~~~I~~~~ldG 727 (1416)
+...+ +..++.|... .| .-.-||+|=+. .++|..++.+.+.....+..|
T Consensus 233 ~y~Aep~~~~~~~~v~~~~g~~l~aDvEGlaly~~~~g~gYLivSsQG~~sf~Vy~r~~ 291 (381)
T PF02333_consen 233 RYDAEPEGGNDRTLVASADGDGLVADVEGLALYYGSDGKGYLIVSSQGDNSFAVYDREG 291 (381)
T ss_dssp EEESSCCC-S--EEEEEBSSSSB-S-EEEEEEEE-CCC-EEEEEEEGGGTEEEEEESST
T ss_pred EEecCCCCCCcceeeecccccccccCccceEEEecCCCCeEEEEEcCCCCeEEEEecCC
Confidence 99887 3334444221 23 34568888332 245666665555444444443
No 92
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=96.23 E-value=0.14 Score=61.59 Aligned_cols=183 Identities=19% Similarity=0.220 Sum_probs=108.0
Q ss_pred eEEEEecEEEEEEecCCcc-----eEEeccc-----ccceeeeeecCCCeEEEeeccC-----------CCccEEEEecC
Q psy6572 541 NLLFTNKYYIREVTQAGVM-----TIRIHNQ-----TNAVGLDFDWVDNCLYWSDVTM-----------HGSSIRRSCNN 599 (1416)
Q Consensus 541 ~li~s~~~~I~~i~l~g~~-----~~~~~~l-----~~~~~l~~D~~~~~LYwtD~~~-----------~~~~I~r~~l~ 599 (1416)
.++++++..+.++. .+.. .+++..+ .....|.|++.+ +||++-... ..++|.|+...
T Consensus 142 ~~~~~n~~~~~~~~-~g~~~l~~~~~i~~~lP~~~~H~g~~l~f~pDG-~Lyvs~G~~~~~~~aq~~~~~~Gk~~r~~~a 219 (399)
T COG2133 142 GLYVANRVAIGRLP-GGDTKLSEPKVIFRGIPKGGHHFGGRLVFGPDG-KLYVTTGSNGDPALAQDNVSLAGKVLRIDRA 219 (399)
T ss_pred CceEEEEEEEEEcC-CCccccccccEEeecCCCCCCcCcccEEECCCC-cEEEEeCCCCCcccccCccccccceeeeccC
Confidence 34555666666666 2311 2333322 345678999877 999975443 12345554432
Q ss_pred ---------CCCeEEee-cCCCceEEEEccCCcEEEeeCCCCeE------EEee---------------cCCC------c
Q psy6572 600 ---------SQPELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTI------EVAK---------------LDGR------F 642 (1416)
Q Consensus 600 ---------s~~~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I------~v~~---------------ldG~------~ 642 (1416)
...++... +.+|.||++++.++.||.++.+...+ .+.. .+|. .
T Consensus 220 ~~~~~d~p~~~~~i~s~G~RN~qGl~w~P~tg~Lw~~e~g~d~~~~~Deln~i~~G~nYGWP~~~~G~~~~g~~~~~~~~ 299 (399)
T COG2133 220 GIIPADNPFPNSEIWSYGHRNPQGLAWHPVTGALWTTEHGPDALRGPDELNSIRPGKNYGWPYAYFGQNYDGRAIPDGTV 299 (399)
T ss_pred cccccCCCCCCcceEEeccCCccceeecCCCCcEEEEecCCCcccCcccccccccCCccCCceeccCcccCccccCCCcc
Confidence 22334444 89999999999999999999887222 1110 1111 1
Q ss_pred eEEEEc-----CCCCCcceeeecCC------cceEEEeeCCCCceEEEEecCCCCCEE---EeecC-CCCCeeEEeecCC
Q psy6572 643 RKVLIN-----KGLQEPRGIALNPA------YGYMYWTDWGQNAHIGKAKMDGSNPKV---IISKN-LSWPNALTISYET 707 (1416)
Q Consensus 643 ~~vLi~-----~~l~~P~gIavDp~------~g~LYWtD~g~~~~I~ra~mDGs~r~v---lv~~~-l~~P~gLaiD~~~ 707 (1416)
...++. ....-|.||++-.- ++.||++--+.. .+.+...+|..+.+ ++... -..|.++++.+ .
T Consensus 300 ~~~~~~p~~~~~~h~ApsGmaFy~G~~fP~~r~~lfV~~hgsw-~~~~~~~~g~~~~~~~~fl~~d~~gR~~dV~v~~-D 377 (399)
T COG2133 300 VAGAIQPVYTWAPHIAPSGMAFYTGDLFPAYRGDLFVGAHGSW-PVLRLRPDGNYKVVLTGFLSGDLGGRPRDVAVAP-D 377 (399)
T ss_pred cccccCCceeeccccccceeEEecCCcCccccCcEEEEeecce-eEEEeccCCCcceEEEEEEecCCCCcccceEECC-C
Confidence 111111 11223577777532 257787765543 56678888884433 33322 26899999985 7
Q ss_pred CeEEEecCC-CCeEEEEeCCC
Q psy6572 708 NELFWGDAH-EDYIAVSDLNG 727 (1416)
Q Consensus 708 ~rLYWtD~~-~~~I~~~~ldG 727 (1416)
+.||+++-. .++|+|+.+.+
T Consensus 378 Gallv~~D~~~g~i~Rv~~~~ 398 (399)
T COG2133 378 GALLVLTDQGDGRILRVSYAG 398 (399)
T ss_pred CeEEEeecCCCCeEEEecCCC
Confidence 777777655 66999998765
No 93
>COG5276 Uncharacterized conserved protein [Function unknown]
Probab=96.20 E-value=0.84 Score=51.85 Aligned_cols=217 Identities=13% Similarity=0.110 Sum_probs=113.5
Q ss_pred CCeEEEEec-EEEEEEecCCcc-eEEecccccceeee--eecCCCeEEEeeccCCCccEEEEecC--CCCeEEeecCCC-
Q psy6572 539 PPNLLFTNK-YYIREVTQAGVM-TIRIHNQTNAVGLD--FDWVDNCLYWSDVTMHGSSIRRSCNN--SQPELLFPATSP- 611 (1416)
Q Consensus 539 ~~~li~s~~-~~I~~i~l~g~~-~~~~~~l~~~~~l~--~D~~~~~LYwtD~~~~~~~I~r~~l~--s~~~~l~~l~~p- 611 (1416)
+.++++++. ..|+.+++.... ..+...+ +..|.+ ++..++.+|++|... -+..+++. +.+++......|
T Consensus 96 e~yvyvad~ssGL~IvDIS~P~sP~~~~~l-nt~gyaygv~vsGn~aYVadldd---gfLivdvsdpssP~lagrya~~~ 171 (370)
T COG5276 96 EEYVYVADWSSGLRIVDISTPDSPTLIGFL-NTDGYAYGVYVSGNYAYVADLDD---GFLIVDVSDPSSPQLAGRYALPG 171 (370)
T ss_pred ccEEEEEcCCCceEEEeccCCCCcceeccc-cCCceEEEEEecCCEEEEeeccC---cEEEEECCCCCCceeeeeeccCC
Confidence 456666653 346666665444 1122212 222332 344688899999864 34445555 344433333333
Q ss_pred ---ceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCC-
Q psy6572 612 ---DGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNP- 687 (1416)
Q Consensus 612 ---~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r- 687 (1416)
+.+|| .++.-|.+.+.. .+.+.+......-+|+..--..|..-.|-+...+.|.++.++ .+.-.+.+|...
T Consensus 172 ~d~~~v~I--SGn~AYvA~~d~-GL~ivDVSnp~sPvli~~~n~g~g~~sv~vsdnr~y~vvy~e--gvlivd~s~~ssp 246 (370)
T COG5276 172 GDTHDVAI--SGNYAYVAWRDG-GLTIVDVSNPHSPVLIGSYNTGPGTYSVSVSDNRAYLVVYDE--GVLIVDVSGPSSP 246 (370)
T ss_pred CCceeEEE--ecCeEEEEEeCC-CeEEEEccCCCCCeEEEEEecCCceEEEEecCCeeEEEEccc--ceEEEecCCCCCc
Confidence 34555 366667665543 334444444444455543323344444444556778887654 355555555543
Q ss_pred EEEeecCCCCCeeE-EeecCCCeEEEecCCCC--eEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEeecCC
Q psy6572 688 KVIISKNLSWPNAL-TISYETNELFWGDAHED--YIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDWEM 764 (1416)
Q Consensus 688 ~vlv~~~l~~P~gL-aiD~~~~rLYWtD~~~~--~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~ 764 (1416)
+++-+-+-..|.++ ++-..++++|++|...+ .|-..+..+....--+.. .-.+..+|.++++++|++|..+
T Consensus 247 ~~~gsyet~~p~~~s~v~Vs~~~~Yvadga~gl~~idisnp~spfl~ss~~t------~g~~a~gi~ay~~y~yiadkn~ 320 (370)
T COG5276 247 TVFGSYETSNPVSISTVPVSGEYAYVADGAKGLPIIDISNPPSPFLSSSLDT------AGYQAAGIRAYGNYNYIADKNT 320 (370)
T ss_pred eEeeccccCCcccccceecccceeeeeccccCceeEeccCCCCCchhccccC------CCccccceEEecCeeEeccCCc
Confidence 33333345556555 22236899999986432 333333333221111110 0225678999999999999987
Q ss_pred CeeEEe
Q psy6572 765 KSIERC 770 (1416)
Q Consensus 765 ~~I~~~ 770 (1416)
+.|.-+
T Consensus 321 g~vV~~ 326 (370)
T COG5276 321 GAVVDA 326 (370)
T ss_pred eEEEeC
Confidence 776543
No 94
>COG5276 Uncharacterized conserved protein [Function unknown]
Probab=96.08 E-value=1.2 Score=50.57 Aligned_cols=179 Identities=15% Similarity=0.179 Sum_probs=96.8
Q ss_pred eeeecCCCeEEEeeccCCCccEEEEecC--CCCeEEeecCCCceEE--EEccCCcEEEeeCCCCeEEEeecCCCceEEEE
Q psy6572 572 LDFDWVDNCLYWSDVTMHGSSIRRSCNN--SQPELLFPATSPDGLT--VDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLI 647 (1416)
Q Consensus 572 l~~D~~~~~LYwtD~~~~~~~I~r~~l~--s~~~~l~~l~~p~gLA--vD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi 647 (1416)
.++-..++.+|++|... -++.+.+. .+++.+..+.. .|+| |+-.++.+|++|+..+ ..+.++.....-+|.
T Consensus 90 ~Dv~vse~yvyvad~ss---GL~IvDIS~P~sP~~~~~lnt-~gyaygv~vsGn~aYVadlddg-fLivdvsdpssP~la 164 (370)
T COG5276 90 ADVRVSEEYVYVADWSS---GLRIVDISTPDSPTLIGFLNT-DGYAYGVYVSGNYAYVADLDDG-FLIVDVSDPSSPQLA 164 (370)
T ss_pred heeEecccEEEEEcCCC---ceEEEeccCCCCcceeccccC-CceEEEEEecCCEEEEeeccCc-EEEEECCCCCCceee
Confidence 45556788999999875 35555555 33333333221 1333 3336888999998544 333444433333333
Q ss_pred cCCCCCcc----eeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEE
Q psy6572 648 NKGLQEPR----GIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVS 723 (1416)
Q Consensus 648 ~~~l~~P~----gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~ 723 (1416)
.....|. .++|. ..+-|++.|+.. +...+......-+|+..--..|.--++-...+|.|.++...+ +..+
T Consensus 165 -grya~~~~d~~~v~IS--Gn~AYvA~~d~G--L~ivDVSnp~sPvli~~~n~g~g~~sv~vsdnr~y~vvy~eg-vliv 238 (370)
T COG5276 165 -GRYALPGGDTHDVAIS--GNYAYVAWRDGG--LTIVDVSNPHSPVLIGSYNTGPGTYSVSVSDNRAYLVVYDEG-VLIV 238 (370)
T ss_pred -eeeccCCCCceeEEEe--cCeEEEEEeCCC--eEEEEccCCCCCeEEEEEecCCceEEEEecCCeeEEEEcccc-eEEE
Confidence 2222233 35553 346677766543 444455444444555432222333344446788888886543 4555
Q ss_pred eCCCCceEEEEeccCCCCcccccceeE---EEecCcEEEeecCCCe
Q psy6572 724 DLNGENIKIIVSRRMDPTINLHHVFAL---AVFEDHLFWTDWEMKS 766 (1416)
Q Consensus 724 ~ldG~~r~~v~~~~~~p~~~l~~P~~l---av~~d~LYwtD~~~~~ 766 (1416)
+.+|...-+++..- ....|.++ .|.+.+.|++|...+-
T Consensus 239 d~s~~ssp~~~gsy-----et~~p~~~s~v~Vs~~~~Yvadga~gl 279 (370)
T COG5276 239 DVSGPSSPTVFGSY-----ETSNPVSISTVPVSGEYAYVADGAKGL 279 (370)
T ss_pred ecCCCCCceEeecc-----ccCCcccccceecccceeeeeccccCc
Confidence 66664433333321 14556665 7889999999976553
No 95
>KOG3509|consensus
Probab=96.05 E-value=0.0046 Score=80.30 Aligned_cols=105 Identities=37% Similarity=0.731 Sum_probs=91.6
Q ss_pred ceecCCcccCCCCCCCCCCCccccccccCCCCCcCCCCCCcccCCCceecCceeecCCCCCCCCCCCCCCCCCCCC---C
Q psy6572 125 ACIEESYICDGQNDCFDMSDEQNCDQIKDVSPKMNCSGDKFLCRNGNCILSRWRCDGDNDCNDGNDGLSSDEMNCD---T 201 (1416)
Q Consensus 125 ~CI~~~~~CDg~~DC~D~sDE~~C~~~~~~~~~~~C~~~~f~C~~g~CI~~~~~CDg~~DC~Dg~d~~~sDE~~C~---~ 201 (1416)
.|....+.|++..|+.+.+|+.+++. ....+.+++|.|.++++....|.||.+.+++.+ +.+.+|. +
T Consensus 2 ~c~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~ 71 (964)
T KOG3509|consen 2 ECVKNRYACDRQPDCRDRSDVANDPA-----IGSACSPNEFKCNNPRCVQPEALLDADSTCGPN-----STPSGCNAKPS 71 (964)
T ss_pred chhhhhhhhccchhhHhhcccCCCcc-----ccccCCcchhccCCccccCchhhhccccccCCC-----CCcCCcccccc
Confidence 47778899999999999999998863 235688899999999999999999999999999 7777775 2
Q ss_pred CCccCCCCCceecCCCCceecCcccccCCCCCCCCCCCCC
Q psy6572 202 ESTCKANNNVFQCDNNKTCISKSWVCDGTYDCTDRSDENS 241 (1416)
Q Consensus 202 ~~~C~~~~~~F~C~~~~~CI~~~w~CDg~~DC~D~sDE~~ 241 (1416)
...|.+ .+++|.+..++-+.+..|||.+||.|+++|..
T Consensus 72 ~s~~~~--~~~~c~~~~~~~~~~~~~~g~~~~~~~~~~~~ 109 (964)
T KOG3509|consen 72 ASDCKP--TETQCRDRLRCNPQSFQCDGTNDCKDGSDEVG 109 (964)
T ss_pred ccccCC--cccccccchhcCCccccccCCCCCCccchhcc
Confidence 356777 89999666799999999999999999999976
No 96
>COG4247 Phy 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]
Probab=96.03 E-value=1.1 Score=49.91 Aligned_cols=169 Identities=18% Similarity=0.252 Sum_probs=87.4
Q ss_pred cCCCceEEE--EccCCcEEEeeCC-CCeEEEeec----CCCceEEEEc--CCCCCcceeeecCCcceEEEeeCCCCceEE
Q psy6572 608 ATSPDGLTV--DWVGRNLYWCDKG-LDTIEVAKL----DGRFRKVLIN--KGLQEPRGIALNPAYGYMYWTDWGQNAHIG 678 (1416)
Q Consensus 608 l~~p~gLAv--D~~~~~LYwtD~~-~~~I~v~~l----dG~~~~vLi~--~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ 678 (1416)
+..|.||++ +++++-+|+--.+ .+-|....| +|..+..++. .--.+-.|+++|-.+|.||++.- .-.||
T Consensus 152 ~s~~YGl~lyrs~ktgd~yvfV~~~qG~~~Qy~l~d~gnGkv~~k~vR~fk~~tQTEG~VaDdEtG~LYIaeE--dvaiW 229 (364)
T COG4247 152 SSSAYGLALYRSPKTGDYYVFVNRRQGDIAQYKLIDQGNGKVGTKLVRQFKIPTQTEGMVADDETGFLYIAEE--DVAIW 229 (364)
T ss_pred cccceeeEEEecCCcCcEEEEEecCCCceeEEEEEecCCceEcceeeEeeecCCcccceeeccccceEEEeec--cceee
Confidence 678899988 4566666654333 355655554 3444433442 11235679999999999999973 34688
Q ss_pred EEecC---CCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecC
Q psy6572 679 KAKMD---GSNPKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFED 755 (1416)
Q Consensus 679 ra~mD---Gs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d 755 (1416)
+...+ |..+++|-. +..-..|+-|...=.||+..-+.+.+.. .-.|.+.-.++... ++
T Consensus 230 K~~Aep~~G~~g~~idr--~~d~~~LtdDvEGltiYy~pnGkGYL~a-SSQGnNtya~y~Re----------------G~ 290 (364)
T COG4247 230 KYEAEPNRGNTGRLIDR--IKDLSYLTDDVEGLTIYYGPNGKGYLLA-SSQGNNTYAAYTRE----------------GN 290 (364)
T ss_pred ecccCCCCCCccchhhh--hcCchhhcccccccEEEEcCCCcEEEEE-ecCCCceEEEEEee----------------CC
Confidence 76554 333443322 1111245555544456665544333221 12333333333322 12
Q ss_pred cEEEeec---CCCeeEEecccCCCceEEEEeCCCCCCeeeeeeccc
Q psy6572 756 HLFWTDW---EMKSIERCDKYTGKNCTSVVKNLVHKPMDLRVYHPY 798 (1416)
Q Consensus 756 ~LYwtD~---~~~~I~~~nk~tG~~~~~l~~~~~~~p~~I~v~h~~ 798 (1416)
.-|+... .+..|-.+...+|..+..+. .....|||+.|-+.-
T Consensus 291 N~YVgsF~vt~n~~iDg~setDG~DV~~~~-LGa~~p~G~FVaQDG 335 (364)
T COG4247 291 NDYVGSFGVTNNGAIDGVSETDGADVVNVP-LGANFPFGLFVAQDG 335 (364)
T ss_pred CceEEEEeeccCCccccccccCCcceeccc-cCCCCcceeEEeccC
Confidence 2232221 12344444445555544333 456689998886543
No 97
>PRK01742 tolB translocation protein TolB; Provisional
Probab=96.02 E-value=0.66 Score=57.51 Aligned_cols=178 Identities=9% Similarity=0.026 Sum_probs=105.9
Q ss_pred EEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee-cCCCceEEEEccCCcEE
Q psy6572 548 YYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP-ATSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~-l~~p~gLAvD~~~~~LY 624 (1416)
..|+.+++.+.. ..+........+++|.+.+++|+++-.......|+.+.++ +....+.. ......+++.+.++.|+
T Consensus 228 ~~i~i~dl~tg~~~~l~~~~g~~~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~~~~~~lt~~~~~~~~~~wSpDG~~i~ 307 (429)
T PRK01742 228 SQLVVHDLRSGARKVVASFRGHNGAPAFSPDGSRLAFASSKDGVLNIYVMGANGGTPSQLTSGAGNNTEPSWSPDGQSIL 307 (429)
T ss_pred cEEEEEeCCCCceEEEecCCCccCceeECCCCCEEEEEEecCCcEEEEEEECCCCCeEeeccCCCCcCCEEECCCCCEEE
Confidence 357777776554 3333222334567888888888886433222357877776 44344433 33455677887778888
Q ss_pred EeeC--CCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEE
Q psy6572 625 WCDK--GLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALT 702 (1416)
Q Consensus 625 wtD~--~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLa 702 (1416)
++.. +..+|+++++++...+.+ .... ..+++.|..++|+++.. ..|.+.++.+...+.+... . .-..++
T Consensus 308 f~s~~~g~~~I~~~~~~~~~~~~l-~~~~---~~~~~SpDG~~ia~~~~---~~i~~~Dl~~g~~~~lt~~-~-~~~~~~ 378 (429)
T PRK01742 308 FTSDRSGSPQVYRMSASGGGASLV-GGRG---YSAQISADGKTLVMING---DNVVKQDLTSGSTEVLSST-F-LDESPS 378 (429)
T ss_pred EEECCCCCceEEEEECCCCCeEEe-cCCC---CCccCCCCCCEEEEEcC---CCEEEEECCCCCeEEecCC-C-CCCCce
Confidence 7754 345788888877665544 2221 34678888888887752 3577777765544433322 2 123456
Q ss_pred eecCCCeEEEecCC--CCeEEEEeCCCCceEEEE
Q psy6572 703 ISYETNELFWGDAH--EDYIAVSDLNGENIKIIV 734 (1416)
Q Consensus 703 iD~~~~rLYWtD~~--~~~I~~~~ldG~~r~~v~ 734 (1416)
+.+....|+++... ...++.++++|...+.|.
T Consensus 379 ~sPdG~~i~~~s~~g~~~~l~~~~~~G~~~~~l~ 412 (429)
T PRK01742 379 ISPNGIMIIYSSTQGLGKVLQLVSADGRFKARLP 412 (429)
T ss_pred ECCCCCEEEEEEcCCCceEEEEEECCCCceEEcc
Confidence 77666677776542 223455567887766664
No 98
>KOG3509|consensus
Probab=95.95 E-value=0.0052 Score=79.82 Aligned_cols=104 Identities=32% Similarity=0.661 Sum_probs=91.9
Q ss_pred ceecCcccccCCCCCCCCCCCCCCcccCCCCCCCceeeCCCCCeEcCcccccCCCCCCCCCCCccccCCcc---cccccC
Q psy6572 219 TCISKSWVCDGTYDCTDRSDENSTYCAHSECNLFEFRCNSTGQCIPITWVCDGVTDCIDKSDEHHSQDCLN---VETCME 295 (1416)
Q Consensus 219 ~CI~~~w~CDg~~DC~D~sDE~~~~c~~~~C~~~~F~C~~~~~CI~~~w~CDG~~DC~DgsDE~~~~~C~~---~~~C~~ 295 (1416)
.|..+...|++..|+.+.||+.+..+..+.+++++|+| .++++.-..|.||.+..+..++++ .+|.. ...|.+
T Consensus 2 ~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~s~~~~ 77 (964)
T KOG3509|consen 2 ECVKNRYACDRQPDCRDRSDVANDPAIGSACSPNEFKC-NNPRCVQPEALLDADSTCGPNSTP---SGCNAKPSASDCKP 77 (964)
T ss_pred chhhhhhhhccchhhHhhcccCCCccccccCCcchhcc-CCccccCchhhhccccccCCCCCc---CCccccccccccCC
Confidence 46677889999999999999998777778999999999 789999999999999999999976 45642 357889
Q ss_pred CeeecCCC-ceeccccccCCCCCCCCCCCccc
Q psy6572 296 GYFKCLNG-RCLLENYYCDGENDCGDNSDEPI 326 (1416)
Q Consensus 296 ~~f~C~~g-~CI~~~~~CDg~~DC~DgSDE~~ 326 (1416)
.+++|.+- ++-..+..|+|.+||.|+++|..
T Consensus 78 ~~~~c~~~~~~~~~~~~~~g~~~~~~~~~~~~ 109 (964)
T KOG3509|consen 78 TETQCRDRLRCNPQSFQCDGTNDCKDGSDEVG 109 (964)
T ss_pred cccccccchhcCCccccccCCCCCCccchhcc
Confidence 99999886 79999999999999999999973
No 99
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=95.67 E-value=5.6 Score=47.04 Aligned_cols=198 Identities=18% Similarity=0.206 Sum_probs=114.3
Q ss_pred eeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEeecCCCceEEEEccCCcEEEeeCCCCeEEEeecC--CCceEEE--
Q psy6572 571 GLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFPATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLD--GRFRKVL-- 646 (1416)
Q Consensus 571 ~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ld--G~~~~vL-- 646 (1416)
.+++...++.||+.+... ..+|.++++.. .+++..+..|--.-|-|....=|.+-=+.+++..+.|+ |+..+..
T Consensus 99 ~~~ls~dgk~~~V~N~TP-a~SVtVVDl~~-~kvv~ei~~PGC~~iyP~~~~~F~~lC~DGsl~~v~Ld~~Gk~~~~~t~ 176 (342)
T PF06433_consen 99 MFALSADGKFLYVQNFTP-ATSVTVVDLAA-KKVVGEIDTPGCWLIYPSGNRGFSMLCGDGSLLTVTLDADGKEAQKSTK 176 (342)
T ss_dssp GEEE-TTSSEEEEEEESS-SEEEEEEETTT-TEEEEEEEGTSEEEEEEEETTEEEEEETTSCEEEEEETSTSSEEEEEEE
T ss_pred ceEEccCCcEEEEEccCC-CCeEEEEECCC-CceeeeecCCCEEEEEecCCCceEEEecCCceEEEEECCCCCEeEeecc
Confidence 345556788888888775 35777777762 23333344454433334443445565667888888877 4432111
Q ss_pred EcCCCCCcceeeecC----CcceEEEeeCCCCceEEEEecCCCCCEEEeecCC--------CC-Ce---eEEeecCCCeE
Q psy6572 647 INKGLQEPRGIALNP----AYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNL--------SW-PN---ALTISYETNEL 710 (1416)
Q Consensus 647 i~~~l~~P~gIavDp----~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l--------~~-P~---gLaiD~~~~rL 710 (1416)
+-.....| |..+| ..+++||+.+. ..|+.+++.|...+....-.+ .| |- -+|++...++|
T Consensus 177 ~F~~~~dp--~f~~~~~~~~~~~~~F~Sy~--G~v~~~dlsg~~~~~~~~~~~~t~~e~~~~WrPGG~Q~~A~~~~~~rl 252 (342)
T PF06433_consen 177 VFDPDDDP--LFEHPAYSRDGGRLYFVSYE--GNVYSADLSGDSAKFGKPWSLLTDAEKADGWRPGGWQLIAYHAASGRL 252 (342)
T ss_dssp ESSTTTS---B-S--EEETTTTEEEEEBTT--SEEEEEEETTSSEEEEEEEESS-HHHHHTTEEE-SSS-EEEETTTTEE
T ss_pred ccCCCCcc--cccccceECCCCeEEEEecC--CEEEEEeccCCcccccCcccccCccccccCcCCcceeeeeeccccCeE
Confidence 11112222 33333 45678888754 479999999987655433221 22 32 49999999999
Q ss_pred EEecC----CC-----CeEEEEeCCCCceEEEEeccCCCCcccccce-eEEEecC---cEEEeecCCCeeEEecccCCCc
Q psy6572 711 FWGDA----HE-----DYIAVSDLNGENIKIIVSRRMDPTINLHHVF-ALAVFED---HLFWTDWEMKSIERCDKYTGKN 777 (1416)
Q Consensus 711 YWtD~----~~-----~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~-~lav~~d---~LYwtD~~~~~I~~~nk~tG~~ 777 (1416)
|+.-. .+ ..|+.+++....|..-+. |.+|. +|+|..+ .||-++...+.|...+..+|+.
T Consensus 253 yvLMh~g~~gsHKdpgteVWv~D~~t~krv~Ri~--------l~~~~~Si~Vsqd~~P~L~~~~~~~~~l~v~D~~tGk~ 324 (342)
T PF06433_consen 253 YVLMHQGGEGSHKDPGTEVWVYDLKTHKRVARIP--------LEHPIDSIAVSQDDKPLLYALSAGDGTLDVYDAATGKL 324 (342)
T ss_dssp EEEEEE--TT-TTS-EEEEEEEETTTTEEEEEEE--------EEEEESEEEEESSSS-EEEEEETTTTEEEEEETTT--E
T ss_pred EEEecCCCCCCccCCceEEEEEECCCCeEEEEEe--------CCCccceEEEccCCCcEEEEEcCCCCeEEEEeCcCCcE
Confidence 98731 11 157777765544433332 55565 7888533 5777788888888888888876
Q ss_pred eEEEE
Q psy6572 778 CTSVV 782 (1416)
Q Consensus 778 ~~~l~ 782 (1416)
+..+-
T Consensus 325 ~~~~~ 329 (342)
T PF06433_consen 325 VRSIE 329 (342)
T ss_dssp EEEE-
T ss_pred Eeehh
Confidence 66653
No 100
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=95.61 E-value=1.3 Score=50.46 Aligned_cols=174 Identities=16% Similarity=0.122 Sum_probs=100.4
Q ss_pred eEEEEe----cEEEEEEecCCcc---eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEee---cCC
Q psy6572 541 NLLFTN----KYYIREVTQAGVM---TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFP---ATS 610 (1416)
Q Consensus 541 ~li~s~----~~~I~~i~l~g~~---~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~---l~~ 610 (1416)
.|+-+. +..|+++++.... ...+...-...||++ .+++||..-...+ ..++++.++ .+.+.. ...
T Consensus 57 ~LyESTG~yG~S~l~~~d~~tg~~~~~~~l~~~~FgEGit~--~~d~l~qLTWk~~--~~f~yd~~t-l~~~~~~~y~~E 131 (264)
T PF05096_consen 57 TLYESTGLYGQSSLRKVDLETGKVLQSVPLPPRYFGEGITI--LGDKLYQLTWKEG--TGFVYDPNT-LKKIGTFPYPGE 131 (264)
T ss_dssp EEEEEECSTTEEEEEEEETTTSSEEEEEE-TTT--EEEEEE--ETTEEEEEESSSS--EEEEEETTT-TEEEEEEE-SSS
T ss_pred EEEEeCCCCCcEEEEEEECCCCcEEEEEECCccccceeEEE--ECCEEEEEEecCC--eEEEEcccc-ceEEEEEecCCc
Confidence 445544 3468889887655 233333334677776 3778888887754 667776652 233333 557
Q ss_pred CceEEEEccCCcEEEeeCCCCeEEEeecCCCc--eEEEEc-CC--CCCcceeeecCCcceEEEeeCCCCceEEEEecCCC
Q psy6572 611 PDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRF--RKVLIN-KG--LQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGS 685 (1416)
Q Consensus 611 p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~--~~vLi~-~~--l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs 685 (1416)
..||+.| +..||.+|. +.+|...+...-. +++-+. .+ +..-+- |...+|+||---|.. ..|.++++.-.
T Consensus 132 GWGLt~d--g~~Li~SDG-S~~L~~~dP~~f~~~~~i~V~~~g~pv~~LNE--LE~i~G~IyANVW~t-d~I~~Idp~tG 205 (264)
T PF05096_consen 132 GWGLTSD--GKRLIMSDG-SSRLYFLDPETFKEVRTIQVTDNGRPVSNLNE--LEYINGKIYANVWQT-DRIVRIDPETG 205 (264)
T ss_dssp --EEEEC--SSCEEEE-S-SSEEEEE-TTT-SEEEEEE-EETTEE---EEE--EEEETTEEEEEETTS-SEEEEEETTT-
T ss_pred ceEEEcC--CCEEEEECC-ccceEEECCcccceEEEEEEEECCEECCCcEe--EEEEcCEEEEEeCCC-CeEEEEeCCCC
Confidence 7899966 677887775 5678877765322 222222 11 122222 333578888777864 58999988755
Q ss_pred CCEEEee-cC--------------CCCCeeEEeecCCCeEEEecCCCCeEEEEeC
Q psy6572 686 NPKVIIS-KN--------------LSWPNALTISYETNELFWGDAHEDYIAVSDL 725 (1416)
Q Consensus 686 ~r~vlv~-~~--------------l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~l 725 (1416)
...-++. +. ..--||||.|+..++||++--.=.+++.+.+
T Consensus 206 ~V~~~iDls~L~~~~~~~~~~~~~~dVLNGIAyd~~~~~l~vTGK~Wp~lyeV~l 260 (264)
T PF05096_consen 206 KVVGWIDLSGLRPEVGRDKSRQPDDDVLNGIAYDPETDRLFVTGKLWPKLYEVKL 260 (264)
T ss_dssp BEEEEEE-HHHHHHHTSTTST--TTS-EEEEEEETTTTEEEEEETT-SEEEEEEE
T ss_pred eEEEEEEhhHhhhcccccccccccCCeeEeEeEeCCCCEEEEEeCCCCceEEEEE
Confidence 5444442 11 1235899999999999998765566766654
No 101
>KOG4260|consensus
Probab=95.58 E-value=0.0075 Score=66.29 Aligned_cols=73 Identities=30% Similarity=0.681 Sum_probs=53.5
Q ss_pred CCCCc-ccccccCCCCCCcccccceecCCceEEeeCCCceecCCCCCccccCCcCC--CCCcc---ceeeecCCeeeecC
Q psy6572 446 DEFSC-FVNECNVSHGGQLCAHECIDLKIGYKCACRKGYQVHPEDKHLCVDTNECL--DRPCS---HYCRNTLGSYSCSC 519 (1416)
Q Consensus 446 De~~C-~i~eC~~~~~~~~Cs~~C~nt~~gy~C~C~~Gy~L~p~d~~tC~didEC~--~~~Cs---q~C~nt~gsy~C~C 519 (1416)
||..| +||||...+....-.|.|+|+.|+|+|.+++||+- ++++|+ ...|. ..|.|+.++|+|.|
T Consensus 229 de~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~---------g~d~C~~~~d~~~~kn~~c~ni~~~~r~v~ 299 (350)
T KOG4260|consen 229 DEEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKK---------GVDECQFCADVCASKNRPCMNIDGQYRCVC 299 (350)
T ss_pred cccccccHHHHhcCCCCCChhheeecCCCceEecccccccC---------ChHHhhhhhhhcccCCCCcccCCccEEEEe
Confidence 67778 89999765555112468999999999999999963 245554 12232 25799999999999
Q ss_pred CCCcEEec
Q psy6572 520 APGYALLS 527 (1416)
Q Consensus 520 ~~Gy~L~~ 527 (1416)
..|+....
T Consensus 300 f~~~~~~~ 307 (350)
T KOG4260|consen 300 FSGLIIIE 307 (350)
T ss_pred cccceeee
Confidence 99987543
No 102
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=95.43 E-value=5.1 Score=48.20 Aligned_cols=126 Identities=13% Similarity=0.122 Sum_probs=86.9
Q ss_pred eCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecC
Q psy6572 627 DKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYE 706 (1416)
Q Consensus 627 D~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~ 706 (1416)
......|.+.+.+|...+.+. .++....+++|++...++-+++ .+..|+.++++..+.+++-.+.-....++++.+.
T Consensus 378 t~dgD~l~iyd~~~~e~kr~e-~~lg~I~av~vs~dGK~~vvaN--dr~el~vididngnv~~idkS~~~lItdf~~~~n 454 (668)
T COG4946 378 TNDGDKLGIYDKDGGEVKRIE-KDLGNIEAVKVSPDGKKVVVAN--DRFELWVIDIDNGNVRLIDKSEYGLITDFDWHPN 454 (668)
T ss_pred ccCCceEEEEecCCceEEEee-CCccceEEEEEcCCCcEEEEEc--CceEEEEEEecCCCeeEecccccceeEEEEEcCC
Confidence 333458889999988777665 7888999999999977777775 3457999999998888887766555667777654
Q ss_pred CCeEEEecC---CCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEE--ecCcEEEeec
Q psy6572 707 TNELFWGDA---HEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAV--FEDHLFWTDW 762 (1416)
Q Consensus 707 ~~rLYWtD~---~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav--~~d~LYwtD~ 762 (1416)
...|-++=. .+..|...+++|...-.+.+. ..+-|+-|+ .+.+||+...
T Consensus 455 sr~iAYafP~gy~tq~Iklydm~~~Kiy~vTT~-------ta~DfsPaFD~d~ryLYfLs~ 508 (668)
T COG4946 455 SRWIAYAFPEGYYTQSIKLYDMDGGKIYDVTTP-------TAYDFSPAFDPDGRYLYFLSA 508 (668)
T ss_pred ceeEEEecCcceeeeeEEEEecCCCeEEEecCC-------cccccCcccCCCCcEEEEEec
Confidence 444433322 245778888888655544432 345555555 4678888654
No 103
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=95.41 E-value=0.0097 Score=66.91 Aligned_cols=43 Identities=40% Similarity=0.891 Sum_probs=38.1
Q ss_pred CCCccccCCcCC--CCCccceeeecCCeeeecCCCCcEEecCCCc
Q psy6572 489 DKHLCVDTNECL--DRPCSHYCRNTLGSYSCSCAPGYALLSDKHG 531 (1416)
Q Consensus 489 d~~tC~didEC~--~~~Csq~C~nt~gsy~C~C~~Gy~L~~dg~s 531 (1416)
.++.|.+++||. +..|.|.|.|+.|+|.|.|++||.|.+++++
T Consensus 180 ~~~~C~~~~~C~~~~~~c~~~C~~~~g~~~c~c~~g~~~~~~~~~ 224 (224)
T cd01475 180 QGKICVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLEDNKT 224 (224)
T ss_pred ccccCcCchhhcCCCCCccceEEcCCCCEEeECCCCccCCCCCCC
Confidence 356799999997 5789999999999999999999999988764
No 104
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=95.39 E-value=0.013 Score=46.11 Aligned_cols=35 Identities=51% Similarity=1.291 Sum_probs=28.3
Q ss_pred cCCcCCC-CCccc--eeeecCCeeeecCCCCcEEecCCCce
Q psy6572 495 DTNECLD-RPCSH--YCRNTLGSYSCSCAPGYALLSDKHGC 532 (1416)
Q Consensus 495 didEC~~-~~Csq--~C~nt~gsy~C~C~~Gy~L~~dg~sC 532 (1416)
+++||.. .+|.+ +|+++.|+|+|.|++||. ++++|
T Consensus 1 d~~~C~~~~~C~~~~~C~~~~g~~~C~C~~g~~---~g~~C 38 (39)
T smart00179 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYT---DGRNC 38 (39)
T ss_pred CcccCcCCCCcCCCCEeECCCCCeEeECCCCCc---cCCcC
Confidence 4677875 67877 899999999999999997 45555
No 105
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=95.20 E-value=1.5 Score=54.07 Aligned_cols=117 Identities=15% Similarity=0.127 Sum_probs=70.9
Q ss_pred cCCCeEEEeeccCCC-ccEEEEecC-CCCeEEee-cCCCceEEEEccCCcEEEeeCCC--CeEEEeecCCCceEEEEcCC
Q psy6572 576 WVDNCLYWSDVTMHG-SSIRRSCNN-SQPELLFP-ATSPDGLTVDWVGRNLYWCDKGL--DTIEVAKLDGRFRKVLINKG 650 (1416)
Q Consensus 576 ~~~~~LYwtD~~~~~-~~I~r~~l~-s~~~~l~~-l~~p~gLAvD~~~~~LYwtD~~~--~~I~v~~ldG~~~~vLi~~~ 650 (1416)
+...+|.++.....+ .+|++..++ +...++++ ...-...++-+-+++|.++-... ..|++++++|+..+.|...
T Consensus 202 ~~~~~~~y~~f~~~~~~~i~~~~l~~g~~~~i~~~~g~~~~P~fspDG~~l~f~~~rdg~~~iy~~dl~~~~~~~Lt~~- 280 (425)
T COG0823 202 PDGKKLAYVSFELGGCPRIYYLDLNTGKRPVILNFNGNNGAPAFSPDGSKLAFSSSRDGSPDIYLMDLDGKNLPRLTNG- 280 (425)
T ss_pred cCCCceEEEEEecCCCceEEEEeccCCccceeeccCCccCCccCCCCCCEEEEEECCCCCccEEEEcCCCCcceecccC-
Confidence 344444444443333 468888888 55556655 44444556666677887776654 4799999999885554321
Q ss_pred CCCcceeeecCCcceE-EEeeCCCCceEEEEecCCCCCEEEeec
Q psy6572 651 LQEPRGIALNPAYGYM-YWTDWGQNAHIGKAKMDGSNPKVIISK 693 (1416)
Q Consensus 651 l~~P~gIavDp~~g~L-YWtD~g~~~~I~ra~mDGs~r~vlv~~ 693 (1416)
...-..=.+-|...+| |.+|.+..+.|++++++|+..+.+...
T Consensus 281 ~gi~~~Ps~spdG~~ivf~Sdr~G~p~I~~~~~~g~~~~riT~~ 324 (425)
T COG0823 281 FGINTSPSWSPDGSKIVFTSDRGGRPQIYLYDLEGSQVTRLTFS 324 (425)
T ss_pred CccccCccCCCCCCEEEEEeCCCCCcceEEECCCCCceeEeecc
Confidence 1111122334444555 456666778999999999987655543
No 106
>KOG1446|consensus
Probab=95.19 E-value=8 Score=44.60 Aligned_cols=181 Identities=13% Similarity=0.117 Sum_probs=104.6
Q ss_pred cEEEEecC-CC-CeEEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEE
Q psy6572 592 SIRRSCNN-SQ-PELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYW 668 (1416)
Q Consensus 592 ~I~r~~l~-s~-~~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYW 668 (1416)
.|+.+.+. .+ .+.+.. -..+.+|++-|+. ..|.+-+..++|..-+|.-.....++ .+..+--+|+||. |.||.
T Consensus 81 tIryLsl~dNkylRYF~GH~~~V~sL~~sP~~-d~FlS~S~D~tvrLWDlR~~~cqg~l--~~~~~pi~AfDp~-GLifA 156 (311)
T KOG1446|consen 81 TIRYLSLHDNKYLRYFPGHKKRVNSLSVSPKD-DTFLSSSLDKTVRLWDLRVKKCQGLL--NLSGRPIAAFDPE-GLIFA 156 (311)
T ss_pred ceEEEEeecCceEEEcCCCCceEEEEEecCCC-CeEEecccCCeEEeeEecCCCCceEE--ecCCCcceeECCC-CcEEE
Confidence 56666665 22 233333 5667889999877 78888888889999998866666665 3455677899987 88887
Q ss_pred eeCCCCceEEEEecC--CC--CCEEEee-cCCCCCeeEEeecCCCeEEEecCCCCeEEEEe-CCCCceEEEEeccCCCCc
Q psy6572 669 TDWGQNAHIGKAKMD--GS--NPKVIIS-KNLSWPNALTISYETNELFWGDAHEDYIAVSD-LNGENIKIIVSRRMDPTI 742 (1416)
Q Consensus 669 tD~g~~~~I~ra~mD--Gs--~r~vlv~-~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~-ldG~~r~~v~~~~~~p~~ 742 (1416)
+-.+.. .|...++. +. .+...+. .....-+.|.+.+..+.|.++ ...+.|..++ ++|+-...+.... .
T Consensus 157 ~~~~~~-~IkLyD~Rs~dkgPF~tf~i~~~~~~ew~~l~FS~dGK~iLls-T~~s~~~~lDAf~G~~~~tfs~~~---~- 230 (311)
T KOG1446|consen 157 LANGSE-LIKLYDLRSFDKGPFTTFSITDNDEAEWTDLEFSPDGKSILLS-TNASFIYLLDAFDGTVKSTFSGYP---N- 230 (311)
T ss_pred EecCCC-eEEEEEecccCCCCceeEccCCCCccceeeeEEcCCCCEEEEE-eCCCcEEEEEccCCcEeeeEeecc---C-
Confidence 765554 55554442 22 2333333 223334566665433333333 3345555555 6777444443221 1
Q ss_pred ccccceeEEE-ecCcEEEeecCCCeeEEecccCCCceEEEE
Q psy6572 743 NLHHVFALAV-FEDHLFWTDWEMKSIERCDKYTGKNCTSVV 782 (1416)
Q Consensus 743 ~l~~P~~lav-~~d~LYwtD~~~~~I~~~nk~tG~~~~~l~ 782 (1416)
...-|.+-++ -++..+.+-...++|..-+..+|..+.++.
T Consensus 231 ~~~~~~~a~ftPds~Fvl~gs~dg~i~vw~~~tg~~v~~~~ 271 (311)
T KOG1446|consen 231 AGNLPLSATFTPDSKFVLSGSDDGTIHVWNLETGKKVAVLR 271 (311)
T ss_pred CCCcceeEEECCCCcEEEEecCCCcEEEEEcCCCcEeeEec
Confidence 1222322222 355555556666777777777777665554
No 107
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=95.18 E-value=0.031 Score=41.10 Aligned_cols=27 Identities=11% Similarity=0.492 Sum_probs=24.1
Q ss_pred CCCCeeEEeecCCCeEEEecCCCCeEEE
Q psy6572 695 LSWPNALTISYETNELFWGDAHEDYIAV 722 (1416)
Q Consensus 695 l~~P~gLaiD~~~~rLYWtD~~~~~I~~ 722 (1416)
|.+|.||+++ .++.||++|.++++|..
T Consensus 1 f~~P~gvav~-~~g~i~VaD~~n~rV~v 27 (28)
T PF01436_consen 1 FNYPHGVAVD-SDGNIYVADSGNHRVQV 27 (28)
T ss_dssp BSSEEEEEEE-TTSEEEEEECCCTEEEE
T ss_pred CcCCcEEEEe-CCCCEEEEECCCCEEEE
Confidence 4689999999 79999999999998875
No 108
>KOG2397|consensus
Probab=95.05 E-value=0.018 Score=68.83 Aligned_cols=66 Identities=36% Similarity=0.618 Sum_probs=57.8
Q ss_pred Cceeec--ceEeccceecCCcCCCCCCCCCCCCCCCCccCCCceecCC----CceecCCCCCCCCCCCCCCCCCC
Q psy6572 976 NEFQCD--VKCISLALVCDKVFDCLDRSDEPADCTSQTCGPDYIRCDT----GRCIPKTWQCDGDVDCPNREDEP 1044 (1416)
Q Consensus 976 ~~f~C~--~~Ci~~~~~CDg~~dC~d~sDE~~~C~~~~C~~~~f~C~~----g~Ci~~~~~CDg~~DC~dgsDE~ 1044 (1416)
..|+|. ..-|+.+.+=|..-||.|||||+ ....|+...|+|.| ..=||.+.+=||+-||-|||||.
T Consensus 43 ~~~~CLdgs~~i~f~qlNDd~CDC~DGsDEP---GtsACpngkF~C~N~G~~p~~i~ssrV~DGICDCCDgSDE~ 114 (480)
T KOG2397|consen 43 SMFKCLDGSKTISFSQLNDDSCDCLDGSDEP---GTSACPNGKFYCVNQGHQPKYIPSSRVNDGICDCCDGSDEY 114 (480)
T ss_pred cceeeccCCcccCHHHhccccccCCCCCCCC---ccccCCCCceeeeecCCCceeeechhccCcccccccCCCCc
Confidence 368887 57899999999999999999996 45568899999986 46888999999999999999995
No 109
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=95.04 E-value=0.037 Score=40.71 Aligned_cols=28 Identities=32% Similarity=0.552 Sum_probs=24.9
Q ss_pred cCCCceEEEEccCCcEEEeeCCCCeEEEe
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKGLDTIEVA 636 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~~~~I~v~ 636 (1416)
+..|.|||+| ..++||++|.+.++|.+.
T Consensus 1 f~~P~gvav~-~~g~i~VaD~~n~rV~vf 28 (28)
T PF01436_consen 1 FNYPHGVAVD-SDGNIYVADSGNHRVQVF 28 (28)
T ss_dssp BSSEEEEEEE-TTSEEEEEECCCTEEEEE
T ss_pred CcCCcEEEEe-CCCCEEEEECCCCEEEEC
Confidence 4579999999 899999999999998763
No 110
>PRK02888 nitrous-oxide reductase; Validated
Probab=94.92 E-value=0.47 Score=59.83 Aligned_cols=182 Identities=10% Similarity=0.002 Sum_probs=93.9
Q ss_pred cCCCeEEEeeccCCCccEEEEecC--CCCeE--EeecCCCceEEEE--ccCCc-----------------EEEeeCCCCe
Q psy6572 576 WVDNCLYWSDVTMHGSSIRRSCNN--SQPEL--LFPATSPDGLTVD--WVGRN-----------------LYWCDKGLDT 632 (1416)
Q Consensus 576 ~~~~~LYwtD~~~~~~~I~r~~l~--s~~~~--l~~l~~p~gLAvD--~~~~~-----------------LYwtD~~~~~ 632 (1416)
+.++.||+-|..+ .+|-|++++ ...++ |.......|+++. +.++. |+.+....+.
T Consensus 139 ydGr~~findk~n--~Rvari~l~~~~~~~i~~iPn~~~~Hg~~~~~~p~t~yv~~~~e~~~PlpnDGk~l~~~~ey~~~ 216 (635)
T PRK02888 139 YDGRYLFINDKAN--TRVARIRLDVMKCDKITELPNVQGIHGLRPQKIPRTGYVFCNGEFRIPLPNDGKDLDDPKKYRSL 216 (635)
T ss_pred cceeEEEEecCCC--cceEEEECccEeeceeEeCCCccCccccCccccCCccEEEeCcccccccCCCCCEeecccceeEE
Confidence 4466788888775 489999988 21121 2125555566555 33333 3333333345
Q ss_pred EEEeecCCCc--eEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeE
Q psy6572 633 IEVAKLDGRF--RKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNEL 710 (1416)
Q Consensus 633 I~v~~ldG~~--~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rL 710 (1416)
|.+++.+... .++++ -.+|+.++++|..+++|+|..........+.|+-..+..++. +.++...++-...+.+
T Consensus 217 vSvID~etmeV~~qV~V---dgnpd~v~~spdGk~afvTsyNsE~G~tl~em~a~e~d~~vv--fni~~iea~vkdGK~~ 291 (635)
T PRK02888 217 FTAVDAETMEVAWQVMV---DGNLDNVDTDYDGKYAFSTCYNSEEGVTLAEMMAAERDWVVV--FNIARIEEAVKAGKFK 291 (635)
T ss_pred EEEEECccceEEEEEEe---CCCcccceECCCCCEEEEeccCcccCcceeeeccccCceEEE--EchHHHHHhhhCCCEE
Confidence 5555544321 22233 238999999999999999964321111222222222222222 1111111221123344
Q ss_pred EEecCCCCeEEEEeCCC-----CceEEEEeccCCCCcccccceeEEE--ecCcEEEeecCCCeeEEecccC
Q psy6572 711 FWGDAHEDYIAVSDLNG-----ENIKIIVSRRMDPTINLHHVFALAV--FEDHLFWTDWEMKSIERCDKYT 774 (1416)
Q Consensus 711 YWtD~~~~~I~~~~ldG-----~~r~~v~~~~~~p~~~l~~P~~lav--~~d~LYwtD~~~~~I~~~nk~t 774 (1416)
|+. .++|..++... ......+.. -..|.+|++ .+.+||.+...+..|..++..+
T Consensus 292 ~V~---gn~V~VID~~t~~~~~~~v~~yIPV-------GKsPHGV~vSPDGkylyVanklS~tVSVIDv~k 352 (635)
T PRK02888 292 TIG---GSKVPVVDGRKAANAGSALTRYVPV-------PKNPHGVNTSPDGKYFIANGKLSPTVTVIDVRK 352 (635)
T ss_pred EEC---CCEEEEEECCccccCCcceEEEEEC-------CCCccceEECCCCCEEEEeCCCCCcEEEEEChh
Confidence 442 34566665332 112221211 246777777 5789999999988887776433
No 111
>PF02333 Phytase: Phytase; InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=94.87 E-value=5.3 Score=48.20 Aligned_cols=121 Identities=22% Similarity=0.326 Sum_probs=70.3
Q ss_pred CCCCCcceeee--cCCcceEEEeeCCCCceEEEEec--CCCCC--EEEeec--CCCCCeeEEeecCCCeEEEecCCCCeE
Q psy6572 649 KGLQEPRGIAL--NPAYGYMYWTDWGQNAHIGKAKM--DGSNP--KVIISK--NLSWPNALTISYETNELFWGDAHEDYI 720 (1416)
Q Consensus 649 ~~l~~P~gIav--Dp~~g~LYWtD~g~~~~I~ra~m--DGs~r--~vlv~~--~l~~P~gLaiD~~~~rLYWtD~~~~~I 720 (1416)
..+..|.||++ ++..|.+|..-.+....++...| +|..+ -.+|.+ .-..|.|+++|-..++||+.+.. .-|
T Consensus 153 ~~~~e~yGlcly~~~~~g~~ya~v~~k~G~~~Qy~L~~~~~g~v~~~lVR~f~~~sQ~EGCVVDDe~g~LYvgEE~-~GI 231 (381)
T PF02333_consen 153 TDLSEPYGLCLYRSPSTGALYAFVNGKDGRVEQYELTDDGDGKVSATLVREFKVGSQPEGCVVDDETGRLYVGEED-VGI 231 (381)
T ss_dssp -SSSSEEEEEEEE-TTT--EEEEEEETTSEEEEEEEEE-TTSSEEEEEEEEEE-SS-EEEEEEETTTTEEEEEETT-TEE
T ss_pred cccccceeeEEeecCCCCcEEEEEecCCceEEEEEEEeCCCCcEeeEEEEEecCCCcceEEEEecccCCEEEecCc-cEE
Confidence 46677999998 45667666554333344554444 44432 233332 23578999999999999999865 578
Q ss_pred EEEeCC---CCceEEEEeccCCCCcccccceeEEEe-----cCcEEEeecCCCeeEEecc
Q psy6572 721 AVSDLN---GENIKIIVSRRMDPTINLHHVFALAVF-----EDHLFWTDWEMKSIERCDK 772 (1416)
Q Consensus 721 ~~~~ld---G~~r~~v~~~~~~p~~~l~~P~~lav~-----~d~LYwtD~~~~~I~~~nk 772 (1416)
++...+ +..++.|.... ........-||+++ .+||+.++.+.++....+.
T Consensus 232 W~y~Aep~~~~~~~~v~~~~--g~~l~aDvEGlaly~~~~g~gYLivSsQG~~sf~Vy~r 289 (381)
T PF02333_consen 232 WRYDAEPEGGNDRTLVASAD--GDGLVADVEGLALYYGSDGKGYLIVSSQGDNSFAVYDR 289 (381)
T ss_dssp EEEESSCCC-S--EEEEEBS--SSSB-S-EEEEEEEE-CCC-EEEEEEEGGGTEEEEEES
T ss_pred EEEecCCCCCCcceeeeccc--ccccccCccceEEEecCCCCeEEEEEcCCCCeEEEEec
Confidence 888864 44455553321 11124567799986 3589999988877555543
No 112
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=94.76 E-value=3.1 Score=47.55 Aligned_cols=161 Identities=19% Similarity=0.168 Sum_probs=88.5
Q ss_pred cceeeeeecCCCeEEEeeccCCCccEEEEecCCC-CeEEee---cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCce
Q psy6572 568 NAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQ-PELLFP---ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFR 643 (1416)
Q Consensus 568 ~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~-~~~l~~---l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~ 643 (1416)
...||.|. ..+.||-+......+.|+++.+.+. ...... ---.+||++ .+++||-.-+..++..+.+.+.-..
T Consensus 46 FTQGL~~~-~~g~LyESTG~yG~S~l~~~d~~tg~~~~~~~l~~~~FgEGit~--~~d~l~qLTWk~~~~f~yd~~tl~~ 122 (264)
T PF05096_consen 46 FTQGLEFL-DDGTLYESTGLYGQSSLRKVDLETGKVLQSVPLPPRYFGEGITI--LGDKLYQLTWKEGTGFVYDPNTLKK 122 (264)
T ss_dssp EEEEEEEE-ETTEEEEEECSTTEEEEEEEETTTSSEEEEEE-TTT--EEEEEE--ETTEEEEEESSSSEEEEEETTTTEE
T ss_pred cCccEEec-CCCEEEEeCCCCCcEEEEEEECCCCcEEEEEECCccccceeEEE--ECCEEEEEEecCCeEEEEccccceE
Confidence 34678774 3578888877665678888888843 222222 223578887 4788888888888888777764322
Q ss_pred EEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCC-eEEE
Q psy6572 644 KVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHED-YIAV 722 (1416)
Q Consensus 644 ~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~-~I~~ 722 (1416)
..-+. -.....||+-| ...||.+|. +.+||+.|...- .+.+
T Consensus 123 ~~~~~-y~~EGWGLt~d--g~~Li~SDG-----------------------------------S~~L~~~dP~~f~~~~~ 164 (264)
T PF05096_consen 123 IGTFP-YPGEGWGLTSD--GKRLIMSDG-----------------------------------SSRLYFLDPETFKEVRT 164 (264)
T ss_dssp EEEEE--SSS--EEEEC--SSCEEEE-S-----------------------------------SSEEEEE-TTT-SEEEE
T ss_pred EEEEe-cCCcceEEEcC--CCEEEEECC-----------------------------------ccceEEECCcccceEEE
Confidence 11111 11345566644 234555551 123333333211 1111
Q ss_pred EeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEeecCCCeeEEecccCCCceEEE
Q psy6572 723 SDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDWEMKSIERCDKYTGKNCTSV 781 (1416)
Q Consensus 723 ~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~~~I~~~nk~tG~~~~~l 781 (1416)
+. |.... ..+..-.-|.+.+++||---|.+..|.+++..+|.....+
T Consensus 165 i~--------V~~~g----~pv~~LNELE~i~G~IyANVW~td~I~~Idp~tG~V~~~i 211 (264)
T PF05096_consen 165 IQ--------VTDNG----RPVSNLNELEYINGKIYANVWQTDRIVRIDPETGKVVGWI 211 (264)
T ss_dssp EE---------EETT----EE---EEEEEEETTEEEEEETTSSEEEEEETTT-BEEEEE
T ss_pred EE--------EEECC----EECCCcEeEEEEcCEEEEEeCCCCeEEEEeCCCCeEEEEE
Confidence 11 11100 1133444577789999999999999999999999866655
No 113
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=94.70 E-value=2.5 Score=50.43 Aligned_cols=116 Identities=16% Similarity=0.165 Sum_probs=73.4
Q ss_pred CCceEEEEccCCcEEEeeCCC------CeEEEeecCCCceEEE-EcCC-------------CCCcceeeecCCcceEEEe
Q psy6572 610 SPDGLTVDWVGRNLYWCDKGL------DTIEVAKLDGRFRKVL-INKG-------------LQEPRGIALNPAYGYMYWT 669 (1416)
Q Consensus 610 ~p~gLAvD~~~~~LYwtD~~~------~~I~v~~ldG~~~~vL-i~~~-------------l~~P~gIavDp~~g~LYWt 669 (1416)
.++||++ ...+.+||++.+. .+|.+++++|...+.+ +... -.-..|||+.|..+.||.+
T Consensus 86 D~Egi~~-~~~g~~~is~E~~~~~~~~p~I~~~~~~G~~~~~~~vP~~~~~~~~~~~~~~~N~G~E~la~~~dG~~l~~~ 164 (326)
T PF13449_consen 86 DPEGIAV-PPDGSFWISSEGGRTGGIPPRIRRFDLDGRVIRRFPVPAAFLPDANGTSGRRNNRGFEGLAVSPDGRTLFAA 164 (326)
T ss_pred ChhHeEE-ecCCCEEEEeCCccCCCCCCEEEEECCCCcccceEccccccccccCccccccCCCCeEEEEECCCCCEEEEE
Confidence 6789999 7899999999999 9999999999885554 2221 1235789999997778876
Q ss_pred eCCC-------C-------ceEEEEecC--CCC-CEEEee-c------CCCCCeeEEeecCCCeEEEecCC-------CC
Q psy6572 670 DWGQ-------N-------AHIGKAKMD--GSN-PKVIIS-K------NLSWPNALTISYETNELFWGDAH-------ED 718 (1416)
Q Consensus 670 D~g~-------~-------~~I~ra~mD--Gs~-r~vlv~-~------~l~~P~gLaiD~~~~rLYWtD~~-------~~ 718 (1416)
--+. . .+|.+.+.. |.. .+.++. . ....+..|+.- .+++||+.+.. ..
T Consensus 165 ~E~~l~~d~~~~~~~~~~~~ri~~~d~~~~~~~~~~~~y~ld~~~~~~~~~~isd~~al-~d~~lLvLER~~~~~~~~~~ 243 (326)
T PF13449_consen 165 MESPLKQDGPRANPDNGSPLRILRYDPKTPGEPVAEYAYPLDPPPTAPGDNGISDIAAL-PDGRLLVLERDFSPGTGNYK 243 (326)
T ss_pred ECccccCCCcccccccCceEEEEEecCCCCCccceEEEEeCCccccccCCCCceeEEEE-CCCcEEEEEccCCCCccceE
Confidence 4221 1 245555544 211 122222 1 22334444443 46778888765 23
Q ss_pred eEEEEeCCC
Q psy6572 719 YIAVSDLNG 727 (1416)
Q Consensus 719 ~I~~~~ldG 727 (1416)
+|+.+++.+
T Consensus 244 ri~~v~l~~ 252 (326)
T PF13449_consen 244 RIYRVDLSD 252 (326)
T ss_pred EEEEEEccc
Confidence 677777654
No 114
>KOG0315|consensus
Probab=94.70 E-value=4.7 Score=45.02 Aligned_cols=178 Identities=15% Similarity=0.092 Sum_probs=105.6
Q ss_pred CCeEEEEecE--EEEEEecCCcc--eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee----cC
Q psy6572 539 PPNLLFTNKY--YIREVTQAGVM--TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP----AT 609 (1416)
Q Consensus 539 ~~~li~s~~~--~I~~i~l~g~~--~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~----l~ 609 (1416)
.|.||++.++ .||.-...... ..+.+.-.++.+|.+-+..+.|-.+-. ..|+.++++ ++...+.+ ..
T Consensus 9 ~~viLvsA~YDhTIRfWqa~tG~C~rTiqh~dsqVNrLeiTpdk~~LAaa~~----qhvRlyD~~S~np~Pv~t~e~h~k 84 (311)
T KOG0315|consen 9 DPVILVSAGYDHTIRFWQALTGICSRTIQHPDSQVNRLEITPDKKDLAAAGN----QHVRLYDLNSNNPNPVATFEGHTK 84 (311)
T ss_pred CceEEEeccCcceeeeeehhcCeEEEEEecCccceeeEEEcCCcchhhhccC----CeeEEEEccCCCCCceeEEeccCC
Confidence 4566666543 46655544443 344555566777887776655543322 267777777 33333333 46
Q ss_pred CCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCC-CCE
Q psy6572 610 SPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGS-NPK 688 (1416)
Q Consensus 610 ~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs-~r~ 688 (1416)
++.++.|-..++.|| |.+..+++.+-+|......-+.... ...+.|+++|..+.|+..|... .|..-++--. -..
T Consensus 85 NVtaVgF~~dgrWMy-TgseDgt~kIWdlR~~~~qR~~~~~-spVn~vvlhpnQteLis~dqsg--~irvWDl~~~~c~~ 160 (311)
T KOG0315|consen 85 NVTAVGFQCDGRWMY-TGSEDGTVKIWDLRSLSCQRNYQHN-SPVNTVVLHPNQTELISGDQSG--NIRVWDLGENSCTH 160 (311)
T ss_pred ceEEEEEeecCeEEE-ecCCCceEEEEeccCcccchhccCC-CCcceEEecCCcceEEeecCCC--cEEEEEccCCcccc
Confidence 778888887777777 5566677766666543322222222 3457899999999999999543 4655555433 234
Q ss_pred EEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeC
Q psy6572 689 VIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDL 725 (1416)
Q Consensus 689 vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~l 725 (1416)
.++.+.......|+|++...+|--+ -..+..+.-++
T Consensus 161 ~liPe~~~~i~sl~v~~dgsml~a~-nnkG~cyvW~l 196 (311)
T KOG0315|consen 161 ELIPEDDTSIQSLTVMPDGSMLAAA-NNKGNCYVWRL 196 (311)
T ss_pred ccCCCCCcceeeEEEcCCCcEEEEe-cCCccEEEEEc
Confidence 4454555566789998755444433 33444444443
No 115
>KOG4260|consensus
Probab=94.53 E-value=0.036 Score=61.13 Aligned_cols=48 Identities=42% Similarity=0.940 Sum_probs=40.2
Q ss_pred eeCCCceecCCCCCccccCCcCC--CCCcc--ceeeecCCeeeecCCCCcEEecC
Q psy6572 478 ACRKGYQVHPEDKHLCVDTNECL--DRPCS--HYCRNTLGSYSCSCAPGYALLSD 528 (1416)
Q Consensus 478 ~C~~Gy~L~p~d~~tC~didEC~--~~~Cs--q~C~nt~gsy~C~C~~Gy~L~~d 528 (1416)
.|..||.|. ...|+|||||+ +.+|. |+|+|+.|||+|.+.+||.-..|
T Consensus 221 kCkkGW~ld---e~gCvDvnEC~~ep~~c~~~qfCvNteGSf~C~dk~Gy~~g~d 272 (350)
T KOG4260|consen 221 KCKKGWKLD---EEGCVDVNECQNEPAPCKAHQFCVNTEGSFKCEDKEGYKKGVD 272 (350)
T ss_pred hhcccceec---ccccccHHHHhcCCCCCChhheeecCCCceEecccccccCChH
Confidence 488999986 34699999998 67775 69999999999999999986443
No 116
>KOG2397|consensus
Probab=94.52 E-value=0.022 Score=68.22 Aligned_cols=67 Identities=40% Similarity=0.616 Sum_probs=58.8
Q ss_pred ccccCCC-eeecCCCCCCCCCCCCCCCcccccccCCCccCCcccccCC----CeecccccccCCCCCCCCCCCCCC
Q psy6572 1175 MFTCANH-QCISLNWRCDGEPDCSDNSDEIESICAGLACEPNRFKCKN----NKCIHRYAMCDGIDNCGDNSDESH 1245 (1416)
Q Consensus 1175 ~f~C~~~-~Ci~~~~~CDg~~DC~dgsDE~~~~C~~~~C~~~~f~C~~----~~Ci~~~~~Cdg~~dC~d~sDE~~ 1245 (1416)
.|.|.+| .=|+.+++=|..=||.|||||. ...+|+...|.|.| ..=|+.+.|-||+-||-|||||..
T Consensus 44 ~~~CLdgs~~i~f~qlNDd~CDC~DGsDEP----GtsACpngkF~C~N~G~~p~~i~ssrV~DGICDCCDgSDE~~ 115 (480)
T KOG2397|consen 44 MFKCLDGSKTISFSQLNDDSCDCLDGSDEP----GTSACPNGKFYCVNQGHQPKYIPSSRVNDGICDCCDGSDEYL 115 (480)
T ss_pred ceeeccCCcccCHHHhccccccCCCCCCCC----ccccCCCCceeeeecCCCceeeechhccCcccccccCCCCcc
Confidence 6888876 5788999999999999999996 24688889999998 478999999999999999999964
No 117
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=94.51 E-value=7.9 Score=43.17 Aligned_cols=61 Identities=16% Similarity=0.187 Sum_probs=40.8
Q ss_pred CCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCccccccee-EEEecCcEEEeecCCCeeEEecccCCCc
Q psy6572 707 TNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFA-LAVFEDHLFWTDWEMKSIERCDKYTGKN 777 (1416)
Q Consensus 707 ~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~-lav~~d~LYwtD~~~~~I~~~nk~tG~~ 777 (1416)
+++||.+..... +..+++....+. .... +..+.+ +.+.++.||.++ ..+.|+.+++.+|+.
T Consensus 173 ~~~v~~~~~~g~-~~~~d~~tg~~~--w~~~------~~~~~~~~~~~~~~l~~~~-~~~~l~~~d~~tG~~ 234 (238)
T PF13360_consen 173 DGRVYVSSGDGR-VVAVDLATGEKL--WSKP------ISGIYSLPSVDGGTLYVTS-SDGRLYALDLKTGKV 234 (238)
T ss_dssp TTEEEEECCTSS-EEEEETTTTEEE--EEEC------SS-ECECEECCCTEEEEEE-TTTEEEEEETTTTEE
T ss_pred CCEEEEEcCCCe-EEEEECCCCCEE--EEec------CCCccCCceeeCCEEEEEe-CCCEEEEEECCCCCE
Confidence 568998876554 455576555433 2322 444555 566788999998 788999999888864
No 118
>KOG0273|consensus
Probab=94.09 E-value=7.6 Score=46.90 Aligned_cols=136 Identities=14% Similarity=0.067 Sum_probs=80.4
Q ss_pred cEEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEeecCCCceEEEEccCCcEE
Q psy6572 547 KYYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFPATSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 547 ~~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~l~~p~gLAvD~~~~~LY 624 (1416)
..++|..+.+|.. ..+...-..+.+|-..-.+. |......++ ++..++.- +..+..+.+....+|.|||++..-|
T Consensus 256 ~G~~riw~~~G~l~~tl~~HkgPI~slKWnk~G~--yilS~~vD~-ttilwd~~~g~~~q~f~~~s~~~lDVdW~~~~~F 332 (524)
T KOG0273|consen 256 DGEARIWNKDGNLISTLGQHKGPIFSLKWNKKGT--YILSGGVDG-TTILWDAHTGTVKQQFEFHSAPALDVDWQSNDEF 332 (524)
T ss_pred CcEEEEEecCchhhhhhhccCCceEEEEEcCCCC--EEEeccCCc-cEEEEeccCceEEEeeeeccCCccceEEecCceE
Confidence 4457777777776 22222333344554433333 333333222 33333333 3444445566777999999999999
Q ss_pred EeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCC
Q psy6572 625 WCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGS 685 (1416)
Q Consensus 625 wtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs 685 (1416)
.+-...+.|.|..+++.....-+..--....+|-.+|....|-=+.....-+|+...-++.
T Consensus 333 ~ts~td~~i~V~kv~~~~P~~t~~GH~g~V~alk~n~tg~LLaS~SdD~TlkiWs~~~~~~ 393 (524)
T KOG0273|consen 333 ATSSTDGCIHVCKVGEDRPVKTFIGHHGEVNALKWNPTGSLLASCSDDGTLKIWSMGQSNS 393 (524)
T ss_pred eecCCCceEEEEEecCCCcceeeecccCceEEEEECCCCceEEEecCCCeeEeeecCCCcc
Confidence 9999999999999998764333324555678899998854443332222346666544443
No 119
>KOG1219|consensus
Probab=93.92 E-value=0.062 Score=73.13 Aligned_cols=63 Identities=35% Similarity=0.910 Sum_probs=52.1
Q ss_pred ccc--cceecC-CceEEeeCCCceecCCCCCccc-cCCcCCCCCccc--eeeecCCeeeecCCCCcEEecCCCceEec
Q psy6572 464 CAH--ECIDLK-IGYKCACRKGYQVHPEDKHLCV-DTNECLDRPCSH--YCRNTLGSYSCSCAPGYALLSDKHGCKAT 535 (1416)
Q Consensus 464 Cs~--~C~nt~-~gy~C~C~~Gy~L~p~d~~tC~-didEC~~~~Csq--~C~nt~gsy~C~C~~Gy~L~~dg~sC~a~ 535 (1416)
|+| .|..+| +||+|.|++-| .|+.|+ ++..|...||.. .|+...++|.|.|+.||+ |.+|.+.
T Consensus 3872 CqhgG~C~~~~~ggy~CkCpsqy-----sG~~CEi~~epC~snPC~~GgtCip~~n~f~CnC~~gyT----G~~Ce~~ 3940 (4289)
T KOG1219|consen 3872 CQHGGTCISQPKGGYKCKCPSQY-----SGNHCEIDLEPCASNPCLTGGTCIPFYNGFLCNCPNGYT----GKRCEAR 3940 (4289)
T ss_pred ccCCCEecCCCCCceEEeCcccc-----cCcccccccccccCCCCCCCCEEEecCCCeeEeCCCCcc----Cceeecc
Confidence 665 688777 88999999998 456786 678898888876 899999999999999997 6677664
No 120
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=93.88 E-value=5.7 Score=47.83 Aligned_cols=122 Identities=9% Similarity=0.059 Sum_probs=86.2
Q ss_pred cEEEEecC-CCCeEEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEe
Q psy6572 592 SIRRSCNN-SQPELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWT 669 (1416)
Q Consensus 592 ~I~r~~l~-s~~~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWt 669 (1416)
.|..+... +..+.+.. +.+.++|+|++.+..+.+++ .+-.|.+.+++....+++-.+...-..+++++|..++|-++
T Consensus 383 ~l~iyd~~~~e~kr~e~~lg~I~av~vs~dGK~~vvaN-dr~el~vididngnv~~idkS~~~lItdf~~~~nsr~iAYa 461 (668)
T COG4946 383 KLGIYDKDGGEVKRIEKDLGNIEAVKVSPDGKKVVVAN-DRFELWVIDIDNGNVRLIDKSEYGLITDFDWHPNSRWIAYA 461 (668)
T ss_pred eEEEEecCCceEEEeeCCccceEEEEEcCCCcEEEEEc-CceEEEEEEecCCCeeEecccccceeEEEEEcCCceeEEEe
Confidence 56666666 44455555 89999999998777676654 34589999999888777776666668889999988877555
Q ss_pred eCCC--CceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecC
Q psy6572 670 DWGQ--NAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDA 715 (1416)
Q Consensus 670 D~g~--~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~ 715 (1416)
=... ...|..++|+|....-+ ++...+-.+-|+|+..+.||+...
T Consensus 462 fP~gy~tq~Iklydm~~~Kiy~v-TT~ta~DfsPaFD~d~ryLYfLs~ 508 (668)
T COG4946 462 FPEGYYTQSIKLYDMDGGKIYDV-TTPTAYDFSPAFDPDGRYLYFLSA 508 (668)
T ss_pred cCcceeeeeEEEEecCCCeEEEe-cCCcccccCcccCCCCcEEEEEec
Confidence 3211 24788999998644333 334444556789999999999754
No 121
>PRK02888 nitrous-oxide reductase; Validated
Probab=93.85 E-value=6.2 Score=50.20 Aligned_cols=209 Identities=11% Similarity=0.029 Sum_probs=111.8
Q ss_pred EEEEEEecCCcc---eEEecccccceeeeee--cCCCeEEEeec-----cCCC----------ccEEEEecCCCCeEEee
Q psy6572 548 YYIREVTQAGVM---TIRIHNQTNAVGLDFD--WVDNCLYWSDV-----TMHG----------SSIRRSCNNSQPELLFP 607 (1416)
Q Consensus 548 ~~I~~i~l~g~~---~~~~~~l~~~~~l~~D--~~~~~LYwtD~-----~~~~----------~~I~r~~l~s~~~~l~~ 607 (1416)
..|-||+++... .+.+++...+.|+.+. +.++.||-... ..++ ..+..++.. ..+++..
T Consensus 152 ~Rvari~l~~~~~~~i~~iPn~~~~Hg~~~~~~p~t~yv~~~~e~~~PlpnDGk~l~~~~ey~~~vSvID~e-tmeV~~q 230 (635)
T PRK02888 152 TRVARIRLDVMKCDKITELPNVQGIHGLRPQKIPRTGYVFCNGEFRIPLPNDGKDLDDPKKYRSLFTAVDAE-TMEVAWQ 230 (635)
T ss_pred cceEEEECccEeeceeEeCCCccCccccCccccCCccEEEeCcccccccCCCCCEeecccceeEEEEEEECc-cceEEEE
Confidence 356777776654 2345566666777776 34444443211 0011 122333222 2233332
Q ss_pred ---cCCCceEEEEccCCcEEEeeCC---CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEe
Q psy6572 608 ---ATSPDGLTVDWVGRNLYWCDKG---LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAK 681 (1416)
Q Consensus 608 ---l~~p~gLAvD~~~~~LYwtD~~---~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~ 681 (1416)
-.+|..+++++.++.+|+|.+. ...+..++..-....+++ .+++..++-+..+++|+.+ .+|.++
T Consensus 231 V~Vdgnpd~v~~spdGk~afvTsyNsE~G~tl~em~a~e~d~~vvf----ni~~iea~vkdGK~~~V~g----n~V~VI- 301 (635)
T PRK02888 231 VMVDGNLDNVDTDYDGKYAFSTCYNSEEGVTLAEMMAAERDWVVVF----NIARIEEAVKAGKFKTIGG----SKVPVV- 301 (635)
T ss_pred EEeCCCcccceECCCCCEEEEeccCcccCcceeeeccccCceEEEE----chHHHHHhhhCCCEEEECC----CEEEEE-
Confidence 5689999999999999998642 234444433222222221 2222223333444555531 234443
Q ss_pred cCCCC-----CEEEee-cCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCc---------eEEEEeccCCCCccccc
Q psy6572 682 MDGSN-----PKVIIS-KNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGEN---------IKIIVSRRMDPTINLHH 746 (1416)
Q Consensus 682 mDGs~-----r~vlv~-~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~---------r~~v~~~~~~p~~~l~~ 746 (1416)
|+.. +.++.. ..-..|.||++.+...+||.+....+.|..+++.-.. +.+|+..- . .-..
T Consensus 302 -D~~t~~~~~~~v~~yIPVGKsPHGV~vSPDGkylyVanklS~tVSVIDv~k~k~~~~~~~~~~~~vvaev---e-vGlG 376 (635)
T PRK02888 302 -DGRKAANAGSALTRYVPVPKNPHGVNTSPDGKYFIANGKLSPTVTVIDVRKLDDLFDGKIKPRDAVVAEP---E-LGLG 376 (635)
T ss_pred -ECCccccCCcceEEEEECCCCccceEECCCCCEEEEeCCCCCcEEEEEChhhhhhhhccCCccceEEEee---c-cCCC
Confidence 3443 333322 2346799999999999999999989999999875422 22233221 0 1235
Q ss_pred ceeEEEec-CcEEEeecCCCeeEEec
Q psy6572 747 VFALAVFE-DHLFWTDWEMKSIERCD 771 (1416)
Q Consensus 747 P~~lav~~-d~LYwtD~~~~~I~~~n 771 (1416)
|.-.++.+ ++.|.|-.-...|.+-|
T Consensus 377 PLHTaFDg~G~aytslf~dsqv~kwn 402 (635)
T PRK02888 377 PLHTAFDGRGNAYTTLFLDSQIVKWN 402 (635)
T ss_pred cceEEECCCCCEEEeEeecceeEEEe
Confidence 66666643 46777766555555544
No 122
>KOG0285|consensus
Probab=93.32 E-value=7.8 Score=45.25 Aligned_cols=225 Identities=16% Similarity=0.093 Sum_probs=126.6
Q ss_pred ceEecCCCCCeEEE-EecEEEEEEecCCcc-eEEec-ccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CC-CeEE
Q psy6572 531 GCKATSDVPPNLLF-TNKYYIREVTQAGVM-TIRIH-NQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQ-PELL 605 (1416)
Q Consensus 531 sC~a~~~~~~~li~-s~~~~I~~i~l~g~~-~~~~~-~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~-~~~l 605 (1416)
.|+++++...++.- +....|...++.... .+.+. ....+.+|+|..+.-.||-.-.. +.|.-.+|. .+ ++-.
T Consensus 155 r~vavdP~n~wf~tgs~DrtikIwDlatg~LkltltGhi~~vr~vavS~rHpYlFs~ged---k~VKCwDLe~nkvIR~Y 231 (460)
T KOG0285|consen 155 RSVAVDPGNEWFATGSADRTIKIWDLATGQLKLTLTGHIETVRGVAVSKRHPYLFSAGED---KQVKCWDLEYNKVIRHY 231 (460)
T ss_pred EEEeeCCCceeEEecCCCceeEEEEcccCeEEEeecchhheeeeeeecccCceEEEecCC---CeeEEEechhhhhHHHh
Confidence 46777654433333 223456666666555 33333 45567889987766666654333 367766666 21 1222
Q ss_pred ee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCc-ceeeecCCcceEEEeeCCCCceEEEEecC
Q psy6572 606 FP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEP-RGIALNPAYGYMYWTDWGQNAHIGKAKMD 683 (1416)
Q Consensus 606 ~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P-~gIavDp~~g~LYWtD~g~~~~I~ra~mD 683 (1416)
.. +..+..|++.|.-+.| +|-.....|.|-+|..+....++ .+-..| ..+...|.. |.|....||
T Consensus 232 hGHlS~V~~L~lhPTldvl-~t~grDst~RvWDiRtr~~V~~l-~GH~~~V~~V~~~~~d-----------pqvit~S~D 298 (460)
T KOG0285|consen 232 HGHLSGVYCLDLHPTLDVL-VTGGRDSTIRVWDIRTRASVHVL-SGHTNPVASVMCQPTD-----------PQVITGSHD 298 (460)
T ss_pred ccccceeEEEeccccceeE-EecCCcceEEEeeecccceEEEe-cCCCCcceeEEeecCC-----------CceEEecCC
Confidence 22 8888899988755554 46666667888777766543333 222333 233344433 345555555
Q ss_pred CCCCE--------EE-eecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeC-CCCceEEEEeccCCCCcccccceeEEEe
Q psy6572 684 GSNPK--------VI-ISKNLSWPNALTISYETNELFWGDAHEDYIAVSDL-NGENIKIIVSRRMDPTINLHHVFALAVF 753 (1416)
Q Consensus 684 Gs~r~--------vl-v~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~l-dG~~r~~v~~~~~~p~~~l~~P~~lav~ 753 (1416)
++.+. .+ +...-...++|++.+ ...+| +.+..+.|....+ .|...+. +++. -.....|++.
T Consensus 299 ~tvrlWDl~agkt~~tlt~hkksvral~lhP-~e~~f-ASas~dnik~w~~p~g~f~~n-lsgh------~~iintl~~n 369 (460)
T KOG0285|consen 299 STVRLWDLRAGKTMITLTHHKKSVRALCLHP-KENLF-ASASPDNIKQWKLPEGEFLQN-LSGH------NAIINTLSVN 369 (460)
T ss_pred ceEEEeeeccCceeEeeecccceeeEEecCC-chhhh-hccCCccceeccCCccchhhc-cccc------cceeeeeeec
Confidence 54321 11 111223346788865 23333 4555667777764 5544443 2322 2345678888
Q ss_pred cCcEEEeecCCCeeEEecccCCCceEE
Q psy6572 754 EDHLFWTDWEMKSIERCDKYTGKNCTS 780 (1416)
Q Consensus 754 ~d~LYwtD~~~~~I~~~nk~tG~~~~~ 780 (1416)
++-+|++-..++.|..-+-.+|-+.+.
T Consensus 370 sD~v~~~G~dng~~~fwdwksg~nyQ~ 396 (460)
T KOG0285|consen 370 SDGVLVSGGDNGSIMFWDWKSGHNYQR 396 (460)
T ss_pred cCceEEEcCCceEEEEEecCcCccccc
Confidence 999999998888888777666655443
No 123
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=92.93 E-value=11 Score=44.75 Aligned_cols=55 Identities=25% Similarity=0.238 Sum_probs=36.9
Q ss_pred EEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCC---------CCCcce---eeecCCcceEEEee
Q psy6572 615 TVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKG---------LQEPRG---IALNPAYGYMYWTD 670 (1416)
Q Consensus 615 AvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~---------l~~P~g---IavDp~~g~LYWtD 670 (1416)
++.-.++.+||+.. .+.|+.+++.|...+.+..-. -=+|-| +|+++..++||+.-
T Consensus 190 ~~~~~~~~~~F~Sy-~G~v~~~dlsg~~~~~~~~~~~~t~~e~~~~WrPGG~Q~~A~~~~~~rlyvLM 256 (342)
T PF06433_consen 190 AYSRDGGRLYFVSY-EGNVYSADLSGDSAKFGKPWSLLTDAEKADGWRPGGWQLIAYHAASGRLYVLM 256 (342)
T ss_dssp EEETTTTEEEEEBT-TSEEEEEEETTSSEEEEEEEESS-HHHHHTTEEE-SSS-EEEETTTTEEEEEE
T ss_pred ceECCCCeEEEEec-CCEEEEEeccCCcccccCcccccCccccccCcCCcceeeeeeccccCeEEEEe
Confidence 34445678888654 589999999988755443211 112433 89999999999864
No 124
>KOG4289|consensus
Probab=92.38 E-value=1.4 Score=58.67 Aligned_cols=83 Identities=27% Similarity=0.727 Sum_probs=57.8
Q ss_pred CCCCCCCCCCc--ccccccCCCCCCcccc--cceecCCceEEeeCCCceecCCCCCcccc---CCcCCCCCccc--eeee
Q psy6572 440 DCGDNSDEFSC--FVNECNVSHGGQLCAH--ECIDLKIGYKCACRKGYQVHPEDKHLCVD---TNECLDRPCSH--YCRN 510 (1416)
Q Consensus 440 dC~dgsDe~~C--~i~eC~~~~~~~~Cs~--~C~nt~~gy~C~C~~Gy~L~p~d~~tC~d---idEC~~~~Csq--~C~n 510 (1416)
.|+.|-....| .|++|-..+ |.. .|....+||+|.|++||. |+.|+- ...|.++.|-. +|+|
T Consensus 1225 rCPpGFTgd~CeTeiDlCYs~p----C~nng~C~srEggYtCeCrpg~t-----GehCEvs~~agrCvpGvC~nggtC~~ 1295 (2531)
T KOG4289|consen 1225 RCPPGFTGDYCETEIDLCYSGP----CGNNGRCRSREGGYTCECRPGFT-----GEHCEVSARAGRCVPGVCKNGGTCVN 1295 (2531)
T ss_pred eCCCCCCcccccchhHhhhcCC----CCCCCceEEecCceeEEecCCcc-----ccceeeecccCccccceecCCCEEee
Confidence 56666555566 578885443 554 688999999999999994 445653 24466777765 8998
Q ss_pred cC-CeeeecCCCCcEEecCCCceEe
Q psy6572 511 TL-GSYSCSCAPGYALLSDKHGCKA 534 (1416)
Q Consensus 511 t~-gsy~C~C~~Gy~L~~dg~sC~a 534 (1416)
.. |+|.|.|+.| . -.+..|..
T Consensus 1296 ~~nggf~c~Cp~g-e--~e~prC~v 1317 (2531)
T KOG4289|consen 1296 LLNGGFCCHCPYG-E--FEDPRCEV 1317 (2531)
T ss_pred cCCCceeccCCCc-c--cCCCceEE
Confidence 65 6999999998 2 23345653
No 125
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=92.38 E-value=4.2 Score=48.54 Aligned_cols=61 Identities=23% Similarity=0.394 Sum_probs=44.0
Q ss_pred CcceeeecCCcceEEEeeCCC-----CceEEEEecCCCCCEEE-eecCC--------CC-----CeeEEeecCCCeEEEe
Q psy6572 653 EPRGIALNPAYGYMYWTDWGQ-----NAHIGKAKMDGSNPKVI-ISKNL--------SW-----PNALTISYETNELFWG 713 (1416)
Q Consensus 653 ~P~gIavDp~~g~LYWtD~g~-----~~~I~ra~mDGs~r~vl-v~~~l--------~~-----P~gLaiD~~~~rLYWt 713 (1416)
.|.||++ +..|.+||++-+. .++|.+++++|...+.+ +...+ .+ .-|||+.+...+||.+
T Consensus 86 D~Egi~~-~~~g~~~is~E~~~~~~~~p~I~~~~~~G~~~~~~~vP~~~~~~~~~~~~~~~N~G~E~la~~~dG~~l~~~ 164 (326)
T PF13449_consen 86 DPEGIAV-PPDGSFWISSEGGRTGGIPPRIRRFDLDGRVIRRFPVPAAFLPDANGTSGRRNNRGFEGLAVSPDGRTLFAA 164 (326)
T ss_pred ChhHeEE-ecCCCEEEEeCCccCCCCCCEEEEECCCCcccceEccccccccccCccccccCCCCeEEEEECCCCCEEEEE
Confidence 5779999 7889999998654 27999999999886555 32222 11 2389998866668776
Q ss_pred c
Q psy6572 714 D 714 (1416)
Q Consensus 714 D 714 (1416)
-
T Consensus 165 ~ 165 (326)
T PF13449_consen 165 M 165 (326)
T ss_pred E
Confidence 3
No 126
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=92.38 E-value=0.091 Score=41.11 Aligned_cols=24 Identities=33% Similarity=1.005 Sum_probs=16.9
Q ss_pred cceecCCceEEeeCCCceecCCCCCcc
Q psy6572 467 ECIDLKIGYKCACRKGYQVHPEDKHLC 493 (1416)
Q Consensus 467 ~C~nt~~gy~C~C~~Gy~L~p~d~~tC 493 (1416)
.|++++++|+|.|++||. .||..|
T Consensus 13 ~C~~~~~~~~C~C~~Gy~---GdG~~C 36 (36)
T PF12947_consen 13 TCTNTGGSYTCTCKPGYE---GDGFFC 36 (36)
T ss_dssp EEEE-TTSEEEEE-CEEE---CCSTCE
T ss_pred EeecCCCCEEeECCCCCc---cCCcCC
Confidence 688888899999999986 455543
No 127
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=92.06 E-value=5.1 Score=46.62 Aligned_cols=50 Identities=14% Similarity=0.040 Sum_probs=39.6
Q ss_pred cccceeEEEecCcEEEeecCCCeeEEecccCCCceEEEEeCCCCCCeeeeee
Q psy6572 744 LHHVFALAVFEDHLFWTDWEMKSIERCDKYTGKNCTSVVKNLVHKPMDLRVY 795 (1416)
Q Consensus 744 l~~P~~lav~~d~LYwtD~~~~~I~~~nk~tG~~~~~l~~~~~~~p~~I~v~ 795 (1416)
+..|.+.-.++++||+.|+.++.|.+++..+|....+. .....|.||...
T Consensus 202 LsmPhSPRWhdgrLwvldsgtGev~~vD~~~G~~e~Va--~vpG~~rGL~f~ 251 (335)
T TIGR03032 202 LSMPHSPRWYQGKLWLLNSGRGELGYVDPQAGKFQPVA--FLPGFTRGLAFA 251 (335)
T ss_pred ccCCcCCcEeCCeEEEEECCCCEEEEEcCCCCcEEEEE--ECCCCCccccee
Confidence 67788888999999999999999999998778766554 345566666544
No 128
>KOG0291|consensus
Probab=91.76 E-value=39 Score=43.37 Aligned_cols=155 Identities=14% Similarity=0.093 Sum_probs=89.3
Q ss_pred CcEEecCCC----ceEecCCCCCeEEEE--ecEEEEEEecCCcc-eEE-ecccccceeeeeecCCCeEEEeeccCCCccE
Q psy6572 522 GYALLSDKH----GCKATSDVPPNLLFT--NKYYIREVTQAGVM-TIR-IHNQTNAVGLDFDWVDNCLYWSDVTMHGSSI 593 (1416)
Q Consensus 522 Gy~L~~dg~----sC~a~~~~~~~li~s--~~~~I~~i~l~g~~-~~~-~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I 593 (1416)
-|.|.-+++ ++.+..+ ...+|.+ ....|+..+..... .+. ......+.||.|...++.|+-+... +++
T Consensus 341 sYVlKQQgH~~~i~~l~YSp-Dgq~iaTG~eDgKVKvWn~~SgfC~vTFteHts~Vt~v~f~~~g~~llssSLD---GtV 416 (893)
T KOG0291|consen 341 SYVLKQQGHSDRITSLAYSP-DGQLIATGAEDGKVKVWNTQSGFCFVTFTEHTSGVTAVQFTARGNVLLSSSLD---GTV 416 (893)
T ss_pred ceeeeccccccceeeEEECC-CCcEEEeccCCCcEEEEeccCceEEEEeccCCCceEEEEEEecCCEEEEeecC---CeE
Confidence 344544443 3555433 2233332 23345544444433 222 2334456778887666555544333 367
Q ss_pred EEEecC--CCCeEEee--cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEe
Q psy6572 594 RRSCNN--SQPELLFP--ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWT 669 (1416)
Q Consensus 594 ~r~~l~--s~~~~l~~--l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWt 669 (1416)
+..++. .+.+++.. -.+-.-||||+.+..+.-.....=.|.|-++......-++++.-.-..+|+++|....|+=.
T Consensus 417 RAwDlkRYrNfRTft~P~p~QfscvavD~sGelV~AG~~d~F~IfvWS~qTGqllDiLsGHEgPVs~l~f~~~~~~LaS~ 496 (893)
T KOG0291|consen 417 RAWDLKRYRNFRTFTSPEPIQFSCVAVDPSGELVCAGAQDSFEIFVWSVQTGQLLDILSGHEGPVSGLSFSPDGSLLASG 496 (893)
T ss_pred EeeeecccceeeeecCCCceeeeEEEEcCCCCEEEeeccceEEEEEEEeecCeeeehhcCCCCcceeeEEccccCeEEec
Confidence 777766 34444443 33445799998777777666555678888876444333333333334679999999999988
Q ss_pred eCCCCceEEEE
Q psy6572 670 DWGQNAHIGKA 680 (1416)
Q Consensus 670 D~g~~~~I~ra 680 (1416)
.|...-+|+-+
T Consensus 497 SWDkTVRiW~i 507 (893)
T KOG0291|consen 497 SWDKTVRIWDI 507 (893)
T ss_pred cccceEEEEEe
Confidence 89876555544
No 129
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=91.76 E-value=17 Score=43.91 Aligned_cols=141 Identities=18% Similarity=0.131 Sum_probs=64.5
Q ss_pred cEEEEEEecCCcc-eEEeccc-ccceeeeeecCCCeEEEeeccCCCccEEEEecCC-CCeEEeecCC---CceEE-EEcc
Q psy6572 547 KYYIREVTQAGVM-TIRIHNQ-TNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNS-QPELLFPATS---PDGLT-VDWV 619 (1416)
Q Consensus 547 ~~~I~~i~l~g~~-~~~~~~l-~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s-~~~~l~~l~~---p~gLA-vD~~ 619 (1416)
...++.++|.... +.+..+. .+..|.-+-+.++.|||.... ..++++.|.+ +.++|..+.. ..|-. ++.
T Consensus 59 ~~nly~lDL~t~~i~QLTdg~g~~~~g~~~s~~~~~~~Yv~~~---~~l~~vdL~T~e~~~vy~~p~~~~g~gt~v~n~- 134 (386)
T PF14583_consen 59 NRNLYLLDLATGEITQLTDGPGDNTFGGFLSPDDRALYYVKNG---RSLRRVDLDTLEERVVYEVPDDWKGYGTWVANS- 134 (386)
T ss_dssp S-EEEEEETTT-EEEE---SS-B-TTT-EE-TTSSEEEEEETT---TEEEEEETTT--EEEEEE--TTEEEEEEEEE-T-
T ss_pred CcceEEEEcccCEEEECccCCCCCccceEEecCCCeEEEEECC---CeEEEEECCcCcEEEEEECCcccccccceeeCC-
Confidence 4568888888776 4333322 334455556778888776533 3788888883 3334433110 01111 111
Q ss_pred CCcEEE------------eeC----------CCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcc--eEEEe--eCCC
Q psy6572 620 GRNLYW------------CDK----------GLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYG--YMYWT--DWGQ 673 (1416)
Q Consensus 620 ~~~LYw------------tD~----------~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g--~LYWt--D~g~ 673 (1416)
.+.+++ +++ -..+|.+++|.+..+++|+... .+-.-+-.-|... .+|=- .|..
T Consensus 135 d~t~~~g~e~~~~d~~~l~~~~~f~e~~~a~p~~~i~~idl~tG~~~~v~~~~-~wlgH~~fsP~dp~li~fCHEGpw~~ 213 (386)
T PF14583_consen 135 DCTKLVGIEISREDWKPLTKWKGFREFYEARPHCRIFTIDLKTGERKVVFEDT-DWLGHVQFSPTDPTLIMFCHEGPWDL 213 (386)
T ss_dssp TSSEEEEEEEEGGG-----SHHHHHHHHHC---EEEEEEETTT--EEEEEEES-S-EEEEEEETTEEEEEEEEE-S-TTT
T ss_pred CccEEEEEEEeehhccCccccHHHHHHHhhCCCceEEEEECCCCceeEEEecC-ccccCcccCCCCCCEEEEeccCCcce
Confidence 111111 111 1247889999888888777532 2222222222222 23322 2444
Q ss_pred C-ceEEEEecCCCCCEEEee
Q psy6572 674 N-AHIGKAKMDGSNPKVIIS 692 (1416)
Q Consensus 674 ~-~~I~ra~mDGs~r~vlv~ 692 (1416)
. .+|+.+++||++.+.|..
T Consensus 214 Vd~RiW~i~~dg~~~~~v~~ 233 (386)
T PF14583_consen 214 VDQRIWTINTDGSNVKKVHR 233 (386)
T ss_dssp SS-SEEEEETTS---EESS-
T ss_pred eceEEEEEEcCCCcceeeec
Confidence 3 499999999999877654
No 130
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=91.44 E-value=43 Score=41.17 Aligned_cols=179 Identities=15% Similarity=0.173 Sum_probs=97.1
Q ss_pred EEEEEEecCCcceEEecccccc--eeeeeecCCCeEEEeeccCC--------CccEEEEecCC---CCeEEee-cCCCc-
Q psy6572 548 YYIREVTQAGVMTIRIHNQTNA--VGLDFDWVDNCLYWSDVTMH--------GSSIRRSCNNS---QPELLFP-ATSPD- 612 (1416)
Q Consensus 548 ~~I~~i~l~g~~~~~~~~l~~~--~~l~~D~~~~~LYwtD~~~~--------~~~I~r~~l~s---~~~~l~~-l~~p~- 612 (1416)
..|+.+++.+.. ++...+..+ .+|.+-..+..+|++..... ...|++..+.+ ..++|.. ...+.
T Consensus 150 ~~l~v~Dl~tg~-~l~d~i~~~~~~~~~W~~d~~~~~y~~~~~~~~~~~~~~~~~v~~~~~gt~~~~d~lvfe~~~~~~~ 228 (414)
T PF02897_consen 150 YTLRVFDLETGK-FLPDGIENPKFSSVSWSDDGKGFFYTRFDEDQRTSDSGYPRQVYRHKLGTPQSEDELVFEEPDEPFW 228 (414)
T ss_dssp EEEEEEETTTTE-EEEEEEEEEESEEEEECTTSSEEEEEECSTTTSS-CCGCCEEEEEEETTS-GGG-EEEEC-TTCTTS
T ss_pred EEEEEEECCCCc-CcCCcccccccceEEEeCCCCEEEEEEeCcccccccCCCCcEEEEEECCCChHhCeeEEeecCCCcE
Confidence 457777876654 222222222 23667666678888876542 34678888872 2235655 44444
Q ss_pred --eEEEEccCCcEEEeeCC--C-CeEEEeecCCC-----ceEEEEcCCCCCcceeeecCCcceEEE-eeCC-CCceEEEE
Q psy6572 613 --GLTVDWVGRNLYWCDKG--L-DTIEVAKLDGR-----FRKVLINKGLQEPRGIALNPAYGYMYW-TDWG-QNAHIGKA 680 (1416)
Q Consensus 613 --gLAvD~~~~~LYwtD~~--~-~~I~v~~ldG~-----~~~vLi~~~l~~P~gIavDp~~g~LYW-tD~g-~~~~I~ra 680 (1416)
++.+..-++.|+++-.. . ..|+++++... ..+.|. .....-.. .|+...+.||+ |+.+ .+.+|.++
T Consensus 229 ~~~~~~s~d~~~l~i~~~~~~~~s~v~~~d~~~~~~~~~~~~~l~-~~~~~~~~-~v~~~~~~~yi~Tn~~a~~~~l~~~ 306 (414)
T PF02897_consen 229 FVSVSRSKDGRYLFISSSSGTSESEVYLLDLDDGGSPDAKPKLLS-PREDGVEY-YVDHHGDRLYILTNDDAPNGRLVAV 306 (414)
T ss_dssp EEEEEE-TTSSEEEEEEESSSSEEEEEEEECCCTTTSS-SEEEEE-ESSSS-EE-EEEEETTEEEEEE-TT-TT-EEEEE
T ss_pred EEEEEecCcccEEEEEEEccccCCeEEEEeccccCCCcCCcEEEe-CCCCceEE-EEEccCCEEEEeeCCCCCCcEEEEe
Confidence 66677777777775443 2 46888888763 233333 22222222 23333566665 6543 24689999
Q ss_pred ecCCCCC---E-EEeecC-CCCCeeEEeecCCCeEEEecCCC--CeEEEEeCC-CCceE
Q psy6572 681 KMDGSNP---K-VIISKN-LSWPNALTISYETNELFWGDAHE--DYIAVSDLN-GENIK 731 (1416)
Q Consensus 681 ~mDGs~r---~-vlv~~~-l~~P~gLaiD~~~~rLYWtD~~~--~~I~~~~ld-G~~r~ 731 (1416)
+++.... . +|+... -....++.+ ..++|++..... .+|..++++ |....
T Consensus 307 ~l~~~~~~~~~~~l~~~~~~~~l~~~~~--~~~~Lvl~~~~~~~~~l~v~~~~~~~~~~ 363 (414)
T PF02897_consen 307 DLADPSPAEWWTVLIPEDEDVSLEDVSL--FKDYLVLSYRENGSSRLRVYDLDDGKESR 363 (414)
T ss_dssp ETTSTSGGGEEEEEE--SSSEEEEEEEE--ETTEEEEEEEETTEEEEEEEETT-TEEEE
T ss_pred cccccccccceeEEcCCCCceeEEEEEE--ECCEEEEEEEECCccEEEEEECCCCcEEe
Confidence 9987763 3 444422 123455555 477888776543 477788887 44333
No 131
>KOG0315|consensus
Probab=91.12 E-value=30 Score=38.89 Aligned_cols=128 Identities=11% Similarity=0.073 Sum_probs=84.1
Q ss_pred CCeEEEEecEEEEEEecCCcc---eEEecc-cccceeeeeecCCCeEEEeeccCCCccEEEEecCC-CCeEEee-cCCCc
Q psy6572 539 PPNLLFTNKYYIREVTQAGVM---TIRIHN-QTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNS-QPELLFP-ATSPD 612 (1416)
Q Consensus 539 ~~~li~s~~~~I~~i~l~g~~---~~~~~~-l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s-~~~~l~~-l~~p~ 612 (1416)
..+|..+....||.+++.... ...+.+ .+++.+|.|...++.+|-.... +.++...+.+ ...-+.. ...+.
T Consensus 52 k~~LAaa~~qhvRlyD~~S~np~Pv~t~e~h~kNVtaVgF~~dgrWMyTgseD---gt~kIWdlR~~~~qR~~~~~spVn 128 (311)
T KOG0315|consen 52 KKDLAAAGNQHVRLYDLNSNNPNPVATFEGHTKNVTAVGFQCDGRWMYTGSED---GTVKIWDLRSLSCQRNYQHNSPVN 128 (311)
T ss_pred cchhhhccCCeeEEEEccCCCCCceeEEeccCCceEEEEEeecCeEEEecCCC---ceEEEEeccCcccchhccCCCCcc
Confidence 445555666678888887665 222333 3778889998877777744322 3566555552 1122222 45567
Q ss_pred eEEEEccCCcEEEeeCCCCeEEEeecCCC-ceEEEEcCCCCCcceeeecCCcceEEEee
Q psy6572 613 GLTVDWVGRNLYWCDKGLDTIEVAKLDGR-FRKVLINKGLQEPRGIALNPAYGYMYWTD 670 (1416)
Q Consensus 613 gLAvD~~~~~LYwtD~~~~~I~v~~ldG~-~~~vLi~~~l~~P~gIavDp~~g~LYWtD 670 (1416)
.|.+.+....|+..|. .+.|.+-+|... ....|+.......+.|+|+|...+|--+.
T Consensus 129 ~vvlhpnQteLis~dq-sg~irvWDl~~~~c~~~liPe~~~~i~sl~v~~dgsml~a~n 186 (311)
T KOG0315|consen 129 TVVLHPNQTELISGDQ-SGNIRVWDLGENSCTHELIPEDDTSIQSLTVMPDGSMLAAAN 186 (311)
T ss_pred eEEecCCcceEEeecC-CCcEEEEEccCCccccccCCCCCcceeeEEEcCCCcEEEEec
Confidence 8888998899988775 467888888655 45566667777789999999866655444
No 132
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=91.04 E-value=0.19 Score=39.34 Aligned_cols=23 Identities=26% Similarity=0.996 Sum_probs=18.9
Q ss_pred CCccc--ccccccccCCCCCeEEecCCCce
Q psy6572 1294 RTCSQ--ICIEKKISNTERTFSCHCAEGYH 1321 (1416)
Q Consensus 1294 ~~Csq--~C~n~~~~n~~gs~~C~C~~gy~ 1321 (1416)
.+|.+ +|+|+. |+|+|.|.+||.
T Consensus 9 ~~C~~~~~C~~~~-----g~~~C~C~~g~~ 33 (39)
T smart00179 9 NPCQNGGTCVNTV-----GSYRCECPPGYT 33 (39)
T ss_pred CCcCCCCEeECCC-----CCeEeECCCCCc
Confidence 45665 788666 999999999998
No 133
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=91.03 E-value=15 Score=44.33 Aligned_cols=83 Identities=13% Similarity=0.233 Sum_probs=54.4
Q ss_pred cceeee--cCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCC----CCeEEEEeCC-
Q psy6572 654 PRGIAL--NPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAH----EDYIAVSDLN- 726 (1416)
Q Consensus 654 P~gIav--Dp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~----~~~I~~~~ld- 726 (1416)
...+.+ ....++|++++...-.+|..++++|...+.|......--.-+.+|..+++||++-.. ...|++++++
T Consensus 237 ~~~~~~~~~~~~~~l~~s~~~G~~hly~~~~~~~~~~~lT~G~~~V~~i~~~d~~~~~iyf~a~~~~p~~r~lY~v~~~~ 316 (353)
T PF00930_consen 237 YDPPHFLGPDGNEFLWISERDGYRHLYLYDLDGGKPRQLTSGDWEVTSILGWDEDNNRIYFTANGDNPGERHLYRVSLDS 316 (353)
T ss_dssp SSEEEE-TTTSSEEEEEEETTSSEEEEEEETTSSEEEESS-SSS-EEEEEEEECTSSEEEEEESSGGTTSBEEEEEETTE
T ss_pred ecccccccCCCCEEEEEEEcCCCcEEEEEcccccceeccccCceeecccceEcCCCCEEEEEecCCCCCceEEEEEEeCC
Confidence 344444 344455666664334799999999988664444332211358889999999998664 4589999999
Q ss_pred CCceEEEEec
Q psy6572 727 GENIKIIVSR 736 (1416)
Q Consensus 727 G~~r~~v~~~ 736 (1416)
|...+.|...
T Consensus 317 ~~~~~~LT~~ 326 (353)
T PF00930_consen 317 GGEPKCLTCE 326 (353)
T ss_dssp TTEEEESSTT
T ss_pred CCCeEeccCC
Confidence 7777666543
No 134
>KOG0285|consensus
Probab=91.03 E-value=5.8 Score=46.28 Aligned_cols=123 Identities=16% Similarity=0.162 Sum_probs=81.7
Q ss_pred cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCC
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNP 687 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r 687 (1416)
+..++.|||||. +..|.|.+..++|-+.++.....+.-+.+-+...|++||.+..-|||-+- ....|.--+|.- +
T Consensus 151 lgWVr~vavdP~-n~wf~tgs~DrtikIwDlatg~LkltltGhi~~vr~vavS~rHpYlFs~g--edk~VKCwDLe~--n 225 (460)
T KOG0285|consen 151 LGWVRSVAVDPG-NEWFATGSADRTIKIWDLATGQLKLTLTGHIETVRGVAVSKRHPYLFSAG--EDKQVKCWDLEY--N 225 (460)
T ss_pred cceEEEEeeCCC-ceeEEecCCCceeEEEEcccCeEEEeecchhheeeeeeecccCceEEEec--CCCeeEEEechh--h
Confidence 889999999986 66677878888999999987666665656778899999999988888664 222344434332 2
Q ss_pred EEE--eecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEec
Q psy6572 688 KVI--ISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSR 736 (1416)
Q Consensus 688 ~vl--v~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~ 736 (1416)
++| ....|.....|++.+. -++.++-.....|..-++.......++.+
T Consensus 226 kvIR~YhGHlS~V~~L~lhPT-ldvl~t~grDst~RvWDiRtr~~V~~l~G 275 (460)
T KOG0285|consen 226 KVIRHYHGHLSGVYCLDLHPT-LDVLVTGGRDSTIRVWDIRTRASVHVLSG 275 (460)
T ss_pred hhHHHhccccceeEEEecccc-ceeEEecCCcceEEEeeecccceEEEecC
Confidence 222 2234666677888654 44555555555666666655544445544
No 135
>smart00181 EGF Epidermal growth factor-like domain.
Probab=91.00 E-value=0.18 Score=38.67 Aligned_cols=25 Identities=60% Similarity=1.205 Sum_probs=20.1
Q ss_pred CCccc-eeeecCCeeeecCCCCcEEe
Q psy6572 502 RPCSH-YCRNTLGSYSCSCAPGYALL 526 (1416)
Q Consensus 502 ~~Csq-~C~nt~gsy~C~C~~Gy~L~ 526 (1416)
.+|.+ +|+++.++|+|.|++||.+.
T Consensus 6 ~~C~~~~C~~~~~~~~C~C~~g~~g~ 31 (35)
T smart00181 6 GPCSNGTCINTPGSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCCCEEECCCCCeEeECCCCCccC
Confidence 46767 78888889999999998754
No 136
>KOG4289|consensus
Probab=90.76 E-value=0.19 Score=66.02 Aligned_cols=50 Identities=32% Similarity=0.976 Sum_probs=42.5
Q ss_pred ecCCceEEeeCCCceecCCCCCccc-cCCcCCCCCccc--eeeecCCeeeecCCCCcE
Q psy6572 470 DLKIGYKCACRKGYQVHPEDKHLCV-DTNECLDRPCSH--YCRNTLGSYSCSCAPGYA 524 (1416)
Q Consensus 470 nt~~gy~C~C~~Gy~L~p~d~~tC~-didEC~~~~Csq--~C~nt~gsy~C~C~~Gy~ 524 (1416)
+..+|++|.|++||.- .-|+ .||+|..++|+. .|....|+|+|.|.+||+
T Consensus 1217 ~pvnglrCrCPpGFTg-----d~CeTeiDlCYs~pC~nng~C~srEggYtCeCrpg~t 1269 (2531)
T KOG4289|consen 1217 HPVNGLRCRCPPGFTG-----DYCETEIDLCYSGPCGNNGRCRSREGGYTCECRPGFT 1269 (2531)
T ss_pred cccCceeEeCCCCCCc-----ccccchhHhhhcCCCCCCCceEEecCceeEEecCCcc
Confidence 4558899999999952 2565 489998899987 799999999999999997
No 137
>cd01475 vWA_Matrilin VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Probab=90.27 E-value=0.21 Score=56.13 Aligned_cols=38 Identities=32% Similarity=0.688 Sum_probs=29.5
Q ss_pred ccccccCCCCCCcccccceecCCceEEeeCCCceecCCCCC
Q psy6572 451 FVNECNVSHGGQLCAHECIDLKIGYKCACRKGYQVHPEDKH 491 (1416)
Q Consensus 451 ~i~eC~~~~~~~~Cs~~C~nt~~gy~C~C~~Gy~L~p~d~~ 491 (1416)
++++|...+.. |.|.|.+++++|.|.|++||.|. .+++
T Consensus 186 ~~~~C~~~~~~--c~~~C~~~~g~~~c~c~~g~~~~-~~~~ 223 (224)
T cd01475 186 VPDLCATLSHV--CQQVCISTPGSYLCACTEGYALL-EDNK 223 (224)
T ss_pred CchhhcCCCCC--ccceEEcCCCCEEeECCCCccCC-CCCC
Confidence 34566543334 99999999999999999999997 5543
No 138
>KOG0291|consensus
Probab=90.08 E-value=69 Score=41.29 Aligned_cols=146 Identities=14% Similarity=0.077 Sum_probs=81.6
Q ss_pred cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCC
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNP 687 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r 687 (1416)
......||+-+- |.+..|-...++|.+-+....+..+-.+..-....|+.+ +-.+ ..|..+.|||+-|
T Consensus 350 ~~~i~~l~YSpD-gq~iaTG~eDgKVKvWn~~SgfC~vTFteHts~Vt~v~f---------~~~g--~~llssSLDGtVR 417 (893)
T KOG0291|consen 350 SDRITSLAYSPD-GQLIATGAEDGKVKVWNTQSGFCFVTFTEHTSGVTAVQF---------TARG--NVLLSSSLDGTVR 417 (893)
T ss_pred ccceeeEEECCC-CcEEEeccCCCcEEEEeccCceEEEEeccCCCceEEEEE---------EecC--CEEEEeecCCeEE
Confidence 445666777653 445567777788888887777666655443333334433 3222 3577788888765
Q ss_pred EE----------EeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccc-eeEEE--ec
Q psy6572 688 KV----------IISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHV-FALAV--FE 754 (1416)
Q Consensus 688 ~v----------lv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P-~~lav--~~ 754 (1416)
.- +....-..-.-||+|+.+..|.-.+...=.|+...+......-+++++ ..| ++|++ .+
T Consensus 418 AwDlkRYrNfRTft~P~p~QfscvavD~sGelV~AG~~d~F~IfvWS~qTGqllDiLsGH-------EgPVs~l~f~~~~ 490 (893)
T KOG0291|consen 418 AWDLKRYRNFRTFTSPEPIQFSCVAVDPSGELVCAGAQDSFEIFVWSVQTGQLLDILSGH-------EGPVSGLSFSPDG 490 (893)
T ss_pred eeeecccceeeeecCCCceeeeEEEEcCCCCEEEeeccceEEEEEEEeecCeeeehhcCC-------CCcceeeEEcccc
Confidence 32 222222222479999865555555544557888877655566666664 233 34444 45
Q ss_pred CcEEEeecCCCeeEEeccc
Q psy6572 755 DHLFWTDWEMKSIERCDKY 773 (1416)
Q Consensus 755 d~LYwtD~~~~~I~~~nk~ 773 (1416)
..|+=..|. ++|..=+.+
T Consensus 491 ~~LaS~SWD-kTVRiW~if 508 (893)
T KOG0291|consen 491 SLLASGSWD-KTVRIWDIF 508 (893)
T ss_pred CeEEecccc-ceEEEEEee
Confidence 555555553 344333333
No 139
>KOG1446|consensus
Probab=90.06 E-value=43 Score=38.88 Aligned_cols=171 Identities=13% Similarity=0.072 Sum_probs=97.6
Q ss_pred EEEEEecCCcc--eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEeecCCCceEEEEccCCcEEE
Q psy6572 549 YIREVTQAGVM--TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFPATSPDGLTVDWVGRNLYW 625 (1416)
Q Consensus 549 ~I~~i~l~g~~--~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~l~~p~gLAvD~~~~~LYw 625 (1416)
.||.+++.... ......-..+..|.+.+.+ -.|.+-... ..|+...+. .+-..++.+..+.-.|+|+ .|.||-
T Consensus 81 tIryLsl~dNkylRYF~GH~~~V~sL~~sP~~-d~FlS~S~D--~tvrLWDlR~~~cqg~l~~~~~pi~AfDp-~GLifA 156 (311)
T KOG1446|consen 81 TIRYLSLHDNKYLRYFPGHKKRVNSLSVSPKD-DTFLSSSLD--KTVRLWDLRVKKCQGLLNLSGRPIAAFDP-EGLIFA 156 (311)
T ss_pred ceEEEEeecCceEEEcCCCCceEEEEEecCCC-CeEEecccC--CeEEeeEecCCCCceEEecCCCcceeECC-CCcEEE
Confidence 46666665554 1222233456778888877 456665554 378888888 5666666677777889996 677777
Q ss_pred eeCCCCeEEEeecC----CCceEEEEc-CCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEee--cCCCCC
Q psy6572 626 CDKGLDTIEVAKLD----GRFRKVLIN-KGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIIS--KNLSWP 698 (1416)
Q Consensus 626 tD~~~~~I~v~~ld----G~~~~vLi~-~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~--~~l~~P 698 (1416)
+-.+...|...++. |-+.+..+. ....+-..|-.-|...+|..+..+.. ....-..+|+....+-. ....-|
T Consensus 157 ~~~~~~~IkLyD~Rs~dkgPF~tf~i~~~~~~ew~~l~FS~dGK~iLlsT~~s~-~~~lDAf~G~~~~tfs~~~~~~~~~ 235 (311)
T KOG1446|consen 157 LANGSELIKLYDLRSFDKGPFTTFSITDNDEAEWTDLEFSPDGKSILLSTNASF-IYLLDAFDGTVKSTFSGYPNAGNLP 235 (311)
T ss_pred EecCCCeEEEEEecccCCCCceeEccCCCCccceeeeEEcCCCCEEEEEeCCCc-EEEEEccCCcEeeeEeeccCCCCcc
Confidence 77776678777753 445555544 33445567777777777777663321 22233456663322221 112233
Q ss_pred eeEEeecCCCeEEEecCCCCeEEEEeC
Q psy6572 699 NALTISYETNELFWGDAHEDYIAVSDL 725 (1416)
Q Consensus 699 ~gLaiD~~~~rLYWtD~~~~~I~~~~l 725 (1416)
-+-++-+ .++...+-+..++|...++
T Consensus 236 ~~a~ftP-ds~Fvl~gs~dg~i~vw~~ 261 (311)
T KOG1446|consen 236 LSATFTP-DSKFVLSGSDDGTIHVWNL 261 (311)
T ss_pred eeEEECC-CCcEEEEecCCCcEEEEEc
Confidence 3333333 4444444445566666665
No 140
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=90.04 E-value=0.21 Score=39.14 Aligned_cols=29 Identities=48% Similarity=1.071 Sum_probs=20.4
Q ss_pred CCccc--eeeecCCeeeecCCCCcEEecCCCce
Q psy6572 502 RPCSH--YCRNTLGSYSCSCAPGYALLSDKHGC 532 (1416)
Q Consensus 502 ~~Csq--~C~nt~gsy~C~C~~Gy~L~~dg~sC 532 (1416)
+.|+. +|+++.++|+|.|.+||. .||..|
T Consensus 6 ~~C~~nA~C~~~~~~~~C~C~~Gy~--GdG~~C 36 (36)
T PF12947_consen 6 GGCHPNATCTNTGGSYTCTCKPGYE--GDGFFC 36 (36)
T ss_dssp GGS-TTCEEEE-TTSEEEEE-CEEE--CCSTCE
T ss_pred CCCCCCcEeecCCCCEEeECCCCCc--cCCcCC
Confidence 44544 899999999999999997 555554
No 141
>PTZ00421 coronin; Provisional
Probab=89.83 E-value=67 Score=40.73 Aligned_cols=115 Identities=8% Similarity=-0.058 Sum_probs=67.3
Q ss_pred ccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CC--------CeEEee-cCCCceEEEEccCCcEEEeeCCCCeEE
Q psy6572 565 NQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQ--------PELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIE 634 (1416)
Q Consensus 565 ~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~--------~~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~ 634 (1416)
....+.+|+|.+..+.++.+-... +.|+...+. .. ..++.. ...+..|++.+..++++++-...+.|.
T Consensus 74 H~~~V~~v~fsP~d~~~LaSgS~D--gtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~P~~~~iLaSgs~DgtVr 151 (493)
T PTZ00421 74 QEGPIIDVAFNPFDPQKLFTASED--GTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVN 151 (493)
T ss_pred CCCCEEEEEEcCCCCCEEEEEeCC--CEEEEEecCCCccccccCcceEEecCCCCcEEEEEeCcCCCCEEEEEeCCCEEE
Confidence 345567889988544444443332 367766665 21 111222 445677888877777888877788899
Q ss_pred EeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecC
Q psy6572 635 VAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMD 683 (1416)
Q Consensus 635 v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mD 683 (1416)
+-++........+........+|++.|...+|+-+.. ...|...++.
T Consensus 152 IWDl~tg~~~~~l~~h~~~V~sla~spdG~lLatgs~--Dg~IrIwD~r 198 (493)
T PTZ00421 152 VWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSK--DKKLNIIDPR 198 (493)
T ss_pred EEECCCCeEEEEEcCCCCceEEEEEECCCCEEEEecC--CCEEEEEECC
Confidence 9988754433333333445778898886444443332 2355555554
No 142
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=89.83 E-value=0.29 Score=37.83 Aligned_cols=29 Identities=55% Similarity=1.313 Sum_probs=23.5
Q ss_pred CCcCCC-CCcc--ceeeecCCeeeecCCCCcE
Q psy6572 496 TNECLD-RPCS--HYCRNTLGSYSCSCAPGYA 524 (1416)
Q Consensus 496 idEC~~-~~Cs--q~C~nt~gsy~C~C~~Gy~ 524 (1416)
+++|.. .+|. ++|+++.++|+|.|++||.
T Consensus 2 ~~~C~~~~~C~~~~~C~~~~~~~~C~C~~g~~ 33 (38)
T cd00054 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYT 33 (38)
T ss_pred cccCCCCCCcCCCCEeECCCCCeEeECCCCCc
Confidence 566764 5773 5899999999999999996
No 143
>KOG2106|consensus
Probab=89.74 E-value=54 Score=40.34 Aligned_cols=156 Identities=13% Similarity=0.127 Sum_probs=92.2
Q ss_pred cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCC-CCceEEEEecCCCC
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWG-QNAHIGKAKMDGSN 686 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g-~~~~I~ra~mDGs~ 686 (1416)
...|+.||- ..+-||+-- .++.|..-++.+.+..++.. ..+.-.|||++|... +|.|-.. ...+|++ .+
T Consensus 329 ~G~iRtv~e--~~~di~vGT-trN~iL~Gt~~~~f~~~v~g-h~delwgla~hps~~-q~~T~gqdk~v~lW~-----~~ 398 (626)
T KOG2106|consen 329 FGPIRTVAE--GKGDILVGT-TRNFILQGTLENGFTLTVQG-HGDELWGLATHPSKN-QLLTCGQDKHVRLWN-----DH 398 (626)
T ss_pred cCCeeEEec--CCCcEEEee-ccceEEEeeecCCceEEEEe-cccceeeEEcCCChh-heeeccCcceEEEcc-----CC
Confidence 344444443 233366543 45678888887766544442 234678999999854 4555421 1223333 22
Q ss_pred CEEEeecC--------CCCCee-EEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCc-
Q psy6572 687 PKVIISKN--------LSWPNA-LTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDH- 756 (1416)
Q Consensus 687 r~vlv~~~--------l~~P~g-LaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~- 756 (1416)
+ ++.+.. -.+|.| ||+-..+++.++.|..+..+..+..++....++.... ...-+++.-.++.
T Consensus 399 k-~~wt~~~~d~~~~~~fhpsg~va~Gt~~G~w~V~d~e~~~lv~~~~d~~~ls~v~ysp------~G~~lAvgs~d~~i 471 (626)
T KOG2106|consen 399 K-LEWTKIIEDPAECADFHPSGVVAVGTATGRWFVLDTETQDLVTIHTDNEQLSVVRYSP------DGAFLAVGSHDNHI 471 (626)
T ss_pred c-eeEEEEecCceeEeeccCcceEEEeeccceEEEEecccceeEEEEecCCceEEEEEcC------CCCEEEEecCCCeE
Confidence 2 222111 234554 5666678999999998877777777766555554432 3344556656665
Q ss_pred -EEEeecCCCeeEEecccCCCceEE
Q psy6572 757 -LFWTDWEMKSIERCDKYTGKNCTS 780 (1416)
Q Consensus 757 -LYwtD~~~~~I~~~nk~tG~~~~~ 780 (1416)
||-.+...+++.+++|-+|+..+.
T Consensus 472 yiy~Vs~~g~~y~r~~k~~gs~ith 496 (626)
T KOG2106|consen 472 YIYRVSANGRKYSRVGKCSGSPITH 496 (626)
T ss_pred EEEEECCCCcEEEEeeeecCceeEE
Confidence 555677778888999988854443
No 144
>PLN00181 protein SPA1-RELATED; Provisional
Probab=89.44 E-value=57 Score=43.96 Aligned_cols=156 Identities=8% Similarity=-0.027 Sum_probs=84.2
Q ss_pred ccceeeeeecCCCeEEEeeccCCCccEEEEecCC---C---C-eEEee---cCCCceEEEEccCCcEEEeeCCCCeEEEe
Q psy6572 567 TNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNS---Q---P-ELLFP---ATSPDGLTVDWVGRNLYWCDKGLDTIEVA 636 (1416)
Q Consensus 567 ~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s---~---~-~~l~~---l~~p~gLAvD~~~~~LYwtD~~~~~I~v~ 636 (1416)
..+.+|+|++.++.|...... +.|+...+.. . . ..+.. ...+.+|++.+..+.+..+-...+.|.+.
T Consensus 484 ~~V~~i~fs~dg~~latgg~D---~~I~iwd~~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~~~~~~las~~~Dg~v~lW 560 (793)
T PLN00181 484 NLVCAIGFDRDGEFFATAGVN---KKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQVASSNFEGVVQVW 560 (793)
T ss_pred CcEEEEEECCCCCEEEEEeCC---CEEEEEECCcccccccccccceEEecccCceeeEEeccCCCCEEEEEeCCCeEEEE
Confidence 346788999876655544332 3666665431 0 0 01111 23345566655445555666667888888
Q ss_pred ecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCC
Q psy6572 637 KLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAH 716 (1416)
Q Consensus 637 ~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~ 716 (1416)
++........+........+|+++|..+.+++|-.. ...|..-++........+.. ......+.+....+.++.+-..
T Consensus 561 d~~~~~~~~~~~~H~~~V~~l~~~p~~~~~L~Sgs~-Dg~v~iWd~~~~~~~~~~~~-~~~v~~v~~~~~~g~~latgs~ 638 (793)
T PLN00181 561 DVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSD-DGSVKLWSINQGVSIGTIKT-KANICCVQFPSESGRSLAFGSA 638 (793)
T ss_pred ECCCCeEEEEecCCCCCEEEEEEcCCCCCEEEEEcC-CCEEEEEECCCCcEEEEEec-CCCeEEEEEeCCCCCEEEEEeC
Confidence 877543333333334557789999877776666432 23555555543322222221 1233445554445666666556
Q ss_pred CCeEEEEeCCC
Q psy6572 717 EDYIAVSDLNG 727 (1416)
Q Consensus 717 ~~~I~~~~ldG 727 (1416)
.+.|...++..
T Consensus 639 dg~I~iwD~~~ 649 (793)
T PLN00181 639 DHKVYYYDLRN 649 (793)
T ss_pred CCeEEEEECCC
Confidence 67777777643
No 145
>KOG0268|consensus
Probab=89.27 E-value=5.6 Score=46.50 Aligned_cols=216 Identities=13% Similarity=0.141 Sum_probs=124.5
Q ss_pred CCeEEEEecEEEEEEecCCcceEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CC-CeEEee-cCCCceEE
Q psy6572 539 PPNLLFTNKYYIREVTQAGVMTIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQ-PELLFP-ATSPDGLT 615 (1416)
Q Consensus 539 ~~~li~s~~~~I~~i~l~g~~~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~-~~~l~~-l~~p~gLA 615 (1416)
+.++.+++...|++..+++...-.+.+-....||+-.+ .+.+|.|-. ..|...... .. ...+.. ......+.
T Consensus 120 ~~~~tvgdDKtvK~wk~~~~p~~tilg~s~~~gIdh~~-~~~~FaTcG----e~i~IWD~~R~~Pv~smswG~Dti~svk 194 (433)
T KOG0268|consen 120 TSFFTVGDDKTVKQWKIDGPPLHTILGKSVYLGIDHHR-KNSVFATCG----EQIDIWDEQRDNPVSSMSWGADSISSVK 194 (433)
T ss_pred cceEEecCCcceeeeeccCCcceeeecccccccccccc-ccccccccC----ceeeecccccCCccceeecCCCceeEEe
Confidence 45666777777777666664311111222334444433 233443322 234444443 11 122221 44556777
Q ss_pred EEccCCcEEEeeCCCCeEEEeecCCCc--eEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeec
Q psy6572 616 VDWVGRNLYWCDKGLDTIEVAKLDGRF--RKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISK 693 (1416)
Q Consensus 616 vD~~~~~LYwtD~~~~~I~v~~ldG~~--~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~ 693 (1416)
+.++.-.|.-+-...+.|...++--.. ++|++ -.++++|+..| +++.|.+-. +...|+..+|.--.+.+-+..
T Consensus 195 fNpvETsILas~~sDrsIvLyD~R~~~Pl~KVi~---~mRTN~IswnP-eafnF~~a~-ED~nlY~~DmR~l~~p~~v~~ 269 (433)
T KOG0268|consen 195 FNPVETSILASCASDRSIVLYDLRQASPLKKVIL---TMRTNTICWNP-EAFNFVAAN-EDHNLYTYDMRNLSRPLNVHK 269 (433)
T ss_pred cCCCcchheeeeccCCceEEEecccCCccceeee---eccccceecCc-cccceeecc-ccccceehhhhhhcccchhhc
Confidence 777777777777777788888875332 44444 24689999999 899998863 334688888766554443332
Q ss_pred C-CCCCeeEEeec-CCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEe---ecCCCeeE
Q psy6572 694 N-LSWPNALTISY-ETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWT---DWEMKSIE 768 (1416)
Q Consensus 694 ~-l~~P~gLaiD~-~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwt---D~~~~~I~ 768 (1416)
+ ... -|.||+ +.|+=|++-+....|.....+....+.|+... .++|.+++.+..+.-|+. |-.+-+++
T Consensus 270 dhvsA--V~dVdfsptG~EfvsgsyDksIRIf~~~~~~SRdiYhtk-----RMq~V~~Vk~S~Dskyi~SGSdd~nvRlW 342 (433)
T KOG0268|consen 270 DHVSA--VMDVDFSPTGQEFVSGSYDKSIRIFPVNHGHSRDIYHTK-----RMQHVFCVKYSMDSKYIISGSDDGNVRLW 342 (433)
T ss_pred cccee--EEEeccCCCcchhccccccceEEEeecCCCcchhhhhHh-----hhheeeEEEEeccccEEEecCCCcceeee
Confidence 2 111 234443 46777888777788888887655555554433 388999999876665553 33334455
Q ss_pred Eec
Q psy6572 769 RCD 771 (1416)
Q Consensus 769 ~~n 771 (1416)
+++
T Consensus 343 ka~ 345 (433)
T KOG0268|consen 343 KAK 345 (433)
T ss_pred ecc
Confidence 554
No 146
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=89.20 E-value=4.7 Score=51.14 Aligned_cols=23 Identities=30% Similarity=0.596 Sum_probs=20.6
Q ss_pred CCCCCcceeeecCCcceEEEeeC
Q psy6572 649 KGLQEPRGIALNPAYGYMYWTDW 671 (1416)
Q Consensus 649 ~~l~~P~gIavDp~~g~LYWtD~ 671 (1416)
..+.+|.+|+++|.++.||++-.
T Consensus 347 T~f~RpEgi~~~p~~g~vY~a~T 369 (524)
T PF05787_consen 347 TPFDRPEGITVNPDDGEVYFALT 369 (524)
T ss_pred ccccCccCeeEeCCCCEEEEEEe
Confidence 46889999999999999999863
No 147
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=89.18 E-value=18 Score=39.71 Aligned_cols=66 Identities=14% Similarity=0.129 Sum_probs=39.2
Q ss_pred CCcceEEEeeCCCCceEEEEecCCCCCEEEee------------cCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCC
Q psy6572 661 PAYGYMYWTDWGQNAHIGKAKMDGSNPKVIIS------------KNLSWPNALTISYETNELFWGDAHEDYIAVSDLNG 727 (1416)
Q Consensus 661 p~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~------------~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG 727 (1416)
+..|.||---|-.. +|.|+..+.......+. .+..-+||||.|+..+|+|.+--.-..++-+.+++
T Consensus 183 ~VdG~lyANVw~t~-~I~rI~p~sGrV~~widlS~L~~~~~~~~~~~nvlNGIA~~~~~~r~~iTGK~wp~lfEVk~~~ 260 (262)
T COG3823 183 WVDGELYANVWQTT-RIARIDPDSGRVVAWIDLSGLLKELNLDKSNDNVLNGIAHDPQQDRFLITGKLWPLLFEVKLDE 260 (262)
T ss_pred eeccEEEEeeeeec-ceEEEcCCCCcEEEEEEccCCchhcCccccccccccceeecCcCCeEEEecCcCceeEEEEecC
Confidence 44566665555433 66666655333222211 22446899999999999999865555555555443
No 148
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=88.62 E-value=12 Score=45.00 Aligned_cols=97 Identities=16% Similarity=0.241 Sum_probs=58.6
Q ss_pred cCCCceEEEE-ccCCcEEEeeCCC--CeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCC---CceEEEEe
Q psy6572 608 ATSPDGLTVD-WVGRNLYWCDKGL--DTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQ---NAHIGKAK 681 (1416)
Q Consensus 608 l~~p~gLAvD-~~~~~LYwtD~~~--~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~---~~~I~ra~ 681 (1416)
+.....+.+- ..+..++|.-... .+|++++++|...+.|..+...--.-+++|+.++.||++-.+. ..+|++++
T Consensus 234 v~~~~~~~~~~~~~~~~l~~s~~~G~~hly~~~~~~~~~~~lT~G~~~V~~i~~~d~~~~~iyf~a~~~~p~~r~lY~v~ 313 (353)
T PF00930_consen 234 VDVYDPPHFLGPDGNEFLWISERDGYRHLYLYDLDGGKPRQLTSGDWEVTSILGWDEDNNRIYFTANGDNPGERHLYRVS 313 (353)
T ss_dssp SSSSSEEEE-TTTSSEEEEEEETTSSEEEEEEETTSSEEEESS-SSS-EEEEEEEECTSSEEEEEESSGGTTSBEEEEEE
T ss_pred eeeecccccccCCCCEEEEEEEcCCCcEEEEEcccccceeccccCceeecccceEcCCCCEEEEEecCCCCCceEEEEEE
Confidence 4444455543 4555566655443 4899999999886655444333224688999999999997542 35899999
Q ss_pred cC-CCCCEEEeecCCCCCeeEEeec
Q psy6572 682 MD-GSNPKVIISKNLSWPNALTISY 705 (1416)
Q Consensus 682 mD-Gs~r~vlv~~~l~~P~gLaiD~ 705 (1416)
++ |...+.|......+ ..+++.+
T Consensus 314 ~~~~~~~~~LT~~~~~~-~~~~~Sp 337 (353)
T PF00930_consen 314 LDSGGEPKCLTCEDGDH-YSASFSP 337 (353)
T ss_dssp TTETTEEEESSTTSSTT-EEEEE-T
T ss_pred eCCCCCeEeccCCCCCc-eEEEECC
Confidence 99 55444443322222 3555543
No 149
>PRK13616 lipoprotein LpqB; Provisional
Probab=88.54 E-value=47 Score=42.98 Aligned_cols=185 Identities=12% Similarity=0.034 Sum_probs=94.7
Q ss_pred ccceeeeeecCCCeEEEeec-----cCCCccEEEEecCCCCeEEeecCCCceEEEEccCCcEEEeeCC-----------C
Q psy6572 567 TNAVGLDFDWVDNCLYWSDV-----TMHGSSIRRSCNNSQPELLFPATSPDGLTVDWVGRNLYWCDKG-----------L 630 (1416)
Q Consensus 567 ~~~~~l~~D~~~~~LYwtD~-----~~~~~~I~r~~l~s~~~~l~~l~~p~gLAvD~~~~~LYwtD~~-----------~ 630 (1416)
..+..+++.+.++++.++.. ......|+.+...+..+.+..-..-..-.+++.++.|+++..+ .
T Consensus 350 ~~vsspaiSpdG~~vA~v~~~~~~~~d~~s~Lwv~~~gg~~~~lt~g~~~t~PsWspDG~~lw~v~dg~~~~~v~~~~~~ 429 (591)
T PRK13616 350 GNITSAALSRSGRQVAAVVTLGRGAPDPASSLWVGPLGGVAVQVLEGHSLTRPSWSLDADAVWVVVDGNTVVRVIRDPAT 429 (591)
T ss_pred cCcccceECCCCCEEEEEEeecCCCCCcceEEEEEeCCCcceeeecCCCCCCceECCCCCceEEEecCcceEEEeccCCC
Confidence 35566777777777766652 1112356666655333333331112233455555555554322 2
Q ss_pred CeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEE---EecCCCCCEEE-----eecCCCC-CeeE
Q psy6572 631 DTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGK---AKMDGSNPKVI-----ISKNLSW-PNAL 701 (1416)
Q Consensus 631 ~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~r---a~mDGs~r~vl-----v~~~l~~-P~gL 701 (1416)
+.|++..+++...+. .--..+..|++-|...+|.++-.+ +|+. ...++.. ..| +...+.. +..|
T Consensus 430 gql~~~~vd~ge~~~---~~~g~Issl~wSpDG~RiA~i~~g---~v~Va~Vvr~~~G~-~~l~~~~~l~~~l~~~~~~l 502 (591)
T PRK13616 430 GQLARTPVDASAVAS---RVPGPISELQLSRDGVRAAMIIGG---KVYLAVVEQTEDGQ-YALTNPREVGPGLGDTAVSL 502 (591)
T ss_pred ceEEEEeccCchhhh---ccCCCcCeEEECCCCCEEEEEECC---EEEEEEEEeCCCCc-eeecccEEeecccCCccccc
Confidence 345555555432211 011247788888888777776532 4444 2323322 233 2223332 2333
Q ss_pred EeecCCCeEEEecCC-CCeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEeecCC
Q psy6572 702 TISYETNELFWGDAH-EDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDWEM 764 (1416)
Q Consensus 702 aiD~~~~rLYWtD~~-~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~ 764 (1416)
+-- ..+.|++.-.. ...|+++.+||...+.+.... ......+|+-..+.||+++...
T Consensus 503 ~W~-~~~~L~V~~~~~~~~v~~v~vDG~~~~~~~~~n-----~~~~v~~vaa~~~~iyv~~~~g 560 (591)
T PRK13616 503 DWR-TGDSLVVGRSDPEHPVWYVNLDGSNSDALPSRN-----LSAPVVAVAASPSTVYVTDARA 560 (591)
T ss_pred eEe-cCCEEEEEecCCCCceEEEecCCccccccCCCC-----ccCceEEEecCCceEEEEcCCc
Confidence 331 34556655333 346899999998877543332 1233356776777899997543
No 150
>smart00181 EGF Epidermal growth factor-like domain.
Probab=88.19 E-value=0.39 Score=36.81 Aligned_cols=25 Identities=32% Similarity=0.882 Sum_probs=20.8
Q ss_pred CCccc-ccccccccCCCCCeEEecCCCceec
Q psy6572 1294 RTCSQ-ICIEKKISNTERTFSCHCAEGYHMV 1323 (1416)
Q Consensus 1294 ~~Csq-~C~n~~~~n~~gs~~C~C~~gy~~~ 1323 (1416)
..|++ +|+++. |+|+|+|.+||.+.
T Consensus 6 ~~C~~~~C~~~~-----~~~~C~C~~g~~g~ 31 (35)
T smart00181 6 GPCSNGTCINTP-----GSYTCSCPPGYTGD 31 (35)
T ss_pred CCCCCCEEECCC-----CCeEeECCCCCccC
Confidence 56777 788766 99999999999874
No 151
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=88.01 E-value=1.1 Score=42.31 Aligned_cols=36 Identities=19% Similarity=0.333 Sum_probs=30.3
Q ss_pred EeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeC
Q psy6572 690 IISKNLSWPNALTISYETNELFWGDAHEDYIAVSDL 725 (1416)
Q Consensus 690 lv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~l 725 (1416)
.+.+++..|+||++++..++||+++...+.|.....
T Consensus 48 ~va~g~~~aNGI~~s~~~k~lyVa~~~~~~I~vy~~ 83 (86)
T PF01731_consen 48 VVASGFSFANGIAISPDKKYLYVASSLAHSIHVYKR 83 (86)
T ss_pred EeeccCCCCceEEEcCCCCEEEEEeccCCeEEEEEe
Confidence 345679999999999999999999998888877653
No 152
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=87.87 E-value=20 Score=43.32 Aligned_cols=151 Identities=16% Similarity=0.143 Sum_probs=72.0
Q ss_pred eEEEeeccCCCccEEEEecC-CCCeEEee--cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCC-Ccc
Q psy6572 580 CLYWSDVTMHGSSIRRSCNN-SQPELLFP--ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQ-EPR 655 (1416)
Q Consensus 580 ~LYwtD~~~~~~~I~r~~l~-s~~~~l~~--l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~-~P~ 655 (1416)
.||.++.. +...++.++|. ...+.|.. .....|..+-+..+.||+...+ .+|.+++|.....++|..-.-. ...
T Consensus 50 llF~s~~d-g~~nly~lDL~t~~i~QLTdg~g~~~~g~~~s~~~~~~~Yv~~~-~~l~~vdL~T~e~~~vy~~p~~~~g~ 127 (386)
T PF14583_consen 50 LLFASDFD-GNRNLYLLDLATGEITQLTDGPGDNTFGGFLSPDDRALYYVKNG-RSLRRVDLDTLEERVVYEVPDDWKGY 127 (386)
T ss_dssp EEEEE-TT-SS-EEEEEETTT-EEEE---SS-B-TTT-EE-TTSSEEEEEETT-TEEEEEETTT--EEEEEE--TTEEEE
T ss_pred EEEEeccC-CCcceEEEEcccCEEEECccCCCCCccceEEecCCCeEEEEECC-CeEEEEECCcCcEEEEEECCcccccc
Confidence 34444543 34678888888 44455554 2334466667888898776533 5788999988776555532111 112
Q ss_pred eee-ecCCcceEEE------------eeCC---------CCceEEEEecCCCCCEEEeecC--CCCCeeEEeecCCCeEE
Q psy6572 656 GIA-LNPAYGYMYW------------TDWG---------QNAHIGKAKMDGSNPKVIISKN--LSWPNALTISYETNELF 711 (1416)
Q Consensus 656 gIa-vDp~~g~LYW------------tD~g---------~~~~I~ra~mDGs~r~vlv~~~--l~~P~gLaiD~~~~rLY 711 (1416)
|.. ++.. +.+++ ++|. ...+|.++++.+..+++|+.+. |.+|..--.|+ ..|-
T Consensus 128 gt~v~n~d-~t~~~g~e~~~~d~~~l~~~~~f~e~~~a~p~~~i~~idl~tG~~~~v~~~~~wlgH~~fsP~dp--~li~ 204 (386)
T PF14583_consen 128 GTWVANSD-CTKLVGIEISREDWKPLTKWKGFREFYEARPHCRIFTIDLKTGERKVVFEDTDWLGHVQFSPTDP--TLIM 204 (386)
T ss_dssp EEEEE-TT-SSEEEEEEEEGGG-----SHHHHHHHHHC---EEEEEEETTT--EEEEEEESS-EEEEEEETTEE--EEEE
T ss_pred cceeeCCC-ccEEEEEEEeehhccCccccHHHHHHHhhCCCceEEEEECCCCceeEEEecCccccCcccCCCCC--CEEE
Confidence 222 2322 22222 1111 1247999999988888888754 33332211221 2233
Q ss_pred Ee---cC--CCCeEEEEeCCCCceEEEEe
Q psy6572 712 WG---DA--HEDYIAVSDLNGENIKIIVS 735 (1416)
Q Consensus 712 Wt---D~--~~~~I~~~~ldG~~r~~v~~ 735 (1416)
+. .+ -..+|+.++.||++.+.|..
T Consensus 205 fCHEGpw~~Vd~RiW~i~~dg~~~~~v~~ 233 (386)
T PF14583_consen 205 FCHEGPWDLVDQRIWTINTDGSNVKKVHR 233 (386)
T ss_dssp EEE-S-TTTSS-SEEEEETTS---EESS-
T ss_pred EeccCCcceeceEEEEEEcCCCcceeeec
Confidence 32 12 23599999999999988754
No 153
>COG4247 Phy 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]
Probab=86.47 E-value=44 Score=37.72 Aligned_cols=106 Identities=19% Similarity=0.266 Sum_probs=59.9
Q ss_pred eCCCCeEEEeecCCCce--EEE------EcCCCCCcceeeec--CCcceEEEeeCCCCceEEEEec----CCCCCEEEee
Q psy6572 627 DKGLDTIEVAKLDGRFR--KVL------INKGLQEPRGIALN--PAYGYMYWTDWGQNAHIGKAKM----DGSNPKVIIS 692 (1416)
Q Consensus 627 D~~~~~I~v~~ldG~~~--~vL------i~~~l~~P~gIavD--p~~g~LYWtD~g~~~~I~ra~m----DGs~r~vlv~ 692 (1416)
|...++|.+..+++..+ +.+ +++++..|.||++. |.+|-+|+--.+...-|....| +|..+..++.
T Consensus 120 dR~~~~i~~y~Idp~~~~L~sitD~n~p~ss~~s~~YGl~lyrs~ktgd~yvfV~~~qG~~~Qy~l~d~gnGkv~~k~vR 199 (364)
T COG4247 120 DRQNDKIVFYKIDPNPQYLESITDSNAPYSSSSSSAYGLALYRSPKTGDYYVFVNRRQGDIAQYKLIDQGNGKVGTKLVR 199 (364)
T ss_pred cccCCeEEEEEeCCCccceeeccCCCCccccCcccceeeEEEecCCcCcEEEEEecCCCceeEEEEEecCCceEcceeeE
Confidence 34455677766665432 222 23567889999886 4556555443333345555544 2333333443
Q ss_pred c-CC-CCCeeEEeecCCCeEEEecCCCCeEEEEeC---CCCceEEE
Q psy6572 693 K-NL-SWPNALTISYETNELFWGDAHEDYIAVSDL---NGENIKII 733 (1416)
Q Consensus 693 ~-~l-~~P~gLaiD~~~~rLYWtD~~~~~I~~~~l---dG~~r~~v 733 (1416)
. .| ..-.|+..|-..+.||.+.. .-.|++... .|..+++|
T Consensus 200 ~fk~~tQTEG~VaDdEtG~LYIaeE-dvaiWK~~Aep~~G~~g~~i 244 (364)
T COG4247 200 QFKIPTQTEGMVADDETGFLYIAEE-DVAIWKYEAEPNRGNTGRLI 244 (364)
T ss_pred eeecCCcccceeeccccceEEEeec-cceeeecccCCCCCCccchh
Confidence 1 12 23459999999999999874 335777663 24444444
No 154
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=86.17 E-value=4.2 Score=51.60 Aligned_cols=62 Identities=23% Similarity=0.459 Sum_probs=45.4
Q ss_pred cCCCceEEEEccCCcEEEeeCCC-------------------CeEEEeecCCC-------ceEE-EEc------------
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKGL-------------------DTIEVAKLDGR-------FRKV-LIN------------ 648 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~~-------------------~~I~v~~ldG~-------~~~v-Li~------------ 648 (1416)
+..|++|+|++..+.||++-... ++|++....+. ...+ ++.
T Consensus 349 f~RpEgi~~~p~~g~vY~a~T~~~~r~~~~~~~~n~~~~n~~G~I~r~~~~~~d~~~~~f~~~~~~~~g~~~~~~~~~~~ 428 (524)
T PF05787_consen 349 FDRPEGITVNPDDGEVYFALTNNSGRGESDVDAANPRAGNGYGQIYRYDPDGNDHAATTFTWELFLVGGDPTDASGNGSN 428 (524)
T ss_pred ccCccCeeEeCCCCEEEEEEecCCCCcccccccCCcccCCcccEEEEecccCCccccceeEEEEEEEecCcccccccccC
Confidence 78999999999999999986433 37888887765 2222 222
Q ss_pred ----CCCCCcceeeecCCcceEEEee
Q psy6572 649 ----KGLQEPRGIALNPAYGYMYWTD 670 (1416)
Q Consensus 649 ----~~l~~P~gIavDp~~g~LYWtD 670 (1416)
..+..|-.|++|+. |.||+..
T Consensus 429 ~~~~~~f~sPDNL~~d~~-G~LwI~e 453 (524)
T PF05787_consen 429 KCDDNGFASPDNLAFDPD-GNLWIQE 453 (524)
T ss_pred cccCCCcCCCCceEECCC-CCEEEEe
Confidence 23778999999996 6666653
No 155
>smart00284 OLF Olfactomedin-like domains.
Probab=86.13 E-value=19 Score=41.25 Aligned_cols=135 Identities=15% Similarity=0.116 Sum_probs=75.4
Q ss_pred CCeEEEeeccCCCccEEEEecCCCCe---EEee-------------cCCCceEEEEccCCc-EEEeeCCCCeEEEeecCC
Q psy6572 578 DNCLYWSDVTMHGSSIRRSCNNSQPE---LLFP-------------ATSPDGLTVDWVGRN-LYWCDKGLDTIEVAKLDG 640 (1416)
Q Consensus 578 ~~~LYwtD~~~~~~~I~r~~l~s~~~---~l~~-------------l~~p~gLAvD~~~~~-LYwtD~~~~~I~v~~ldG 640 (1416)
++.||+.-... ..|.|++|.+... .+++ ...-..||||..+=. ||-|....+.|.+++||-
T Consensus 83 ngslYY~~~~s--~~iiKydL~t~~v~~~~~Lp~a~y~~~~~Y~~~~~sdiDlAvDE~GLWvIYat~~~~g~ivvSkLnp 160 (255)
T smart00284 83 NGSLYFNKFNS--HDICRFDLTTETYQKEPLLNGAGYNNRFPYAWGGFSDIDLAVDENGLWVIYATEQNAGKIVISKLNP 160 (255)
T ss_pred CceEEEEecCC--ccEEEEECCCCcEEEEEecCccccccccccccCCCccEEEEEcCCceEEEEeccCCCCCEEEEeeCc
Confidence 57888866654 3799999983322 1111 011247899976554 444555668899999987
Q ss_pred CceEEEEc--CCCCCcceeeecCCcceEEEeeCCC--CceEEEE-ecCCCCCEEEee----cCCCCCeeEEeecCCCeEE
Q psy6572 641 RFRKVLIN--KGLQEPRGIALNPAYGYMYWTDWGQ--NAHIGKA-KMDGSNPKVIIS----KNLSWPNALTISYETNELF 711 (1416)
Q Consensus 641 ~~~~vLi~--~~l~~P~gIavDp~~g~LYWtD~g~--~~~I~ra-~mDGs~r~vlv~----~~l~~P~gLaiD~~~~rLY 711 (1416)
....+... ..+.++.+=.-=-.=|.||.++... ..+|.-+ +...+. ...+. .....-..|...+.+++||
T Consensus 161 ~tL~ve~tW~T~~~k~sa~naFmvCGvLY~~~s~~~~~~~I~yayDt~t~~-~~~~~i~f~n~y~~~s~l~YNP~d~~LY 239 (255)
T smart00284 161 ATLTIENTWITTYNKRSASNAFMICGILYVTRSLGSKGEKVFYAYDTNTGK-EGHLDIPFENMYEYISMLDYNPNDRKLY 239 (255)
T ss_pred ccceEEEEEEcCCCcccccccEEEeeEEEEEccCCCCCcEEEEEEECCCCc-cceeeeeeccccccceeceeCCCCCeEE
Confidence 65554432 3444333211111238999998422 2355433 333322 21121 2223334588888889999
Q ss_pred EecC
Q psy6572 712 WGDA 715 (1416)
Q Consensus 712 WtD~ 715 (1416)
.-|-
T Consensus 240 ~wdn 243 (255)
T smart00284 240 AWNN 243 (255)
T ss_pred EEeC
Confidence 8773
No 156
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=86.03 E-value=89 Score=37.80 Aligned_cols=103 Identities=15% Similarity=0.171 Sum_probs=54.5
Q ss_pred cceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeC-CCCceEEEEeccCCCC
Q psy6572 663 YGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDL-NGENIKIIVSRRMDPT 741 (1416)
Q Consensus 663 ~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~l-dG~~r~~v~~~~~~p~ 741 (1416)
.++||.+.+. ..|...++... +++-...+..+..++++ +++||+.+ ..+.|..+++ +|..+-......
T Consensus 241 ~~~vy~~~~~--g~l~a~d~~tG--~~~W~~~~~~~~~p~~~--~~~vyv~~-~~G~l~~~d~~tG~~~W~~~~~~---- 309 (377)
T TIGR03300 241 GGQVYAVSYQ--GRVAALDLRSG--RVLWKRDASSYQGPAVD--DNRLYVTD-ADGVVVALDRRSGSELWKNDELK---- 309 (377)
T ss_pred CCEEEEEEcC--CEEEEEECCCC--cEEEeeccCCccCceEe--CCEEEEEC-CCCeEEEEECCCCcEEEcccccc----
Confidence 4677777643 24555555311 22222223334455553 68899886 4567888887 343221110000
Q ss_pred cccccceeEEEecCcEEEeecCCCeeEEecccCCCceE
Q psy6572 742 INLHHVFALAVFEDHLFWTDWEMKSIERCDKYTGKNCT 779 (1416)
Q Consensus 742 ~~l~~P~~lav~~d~LYwtD~~~~~I~~~nk~tG~~~~ 779 (1416)
-....+..+.+++||..+ ..+.|+.++..+|+...
T Consensus 310 --~~~~ssp~i~g~~l~~~~-~~G~l~~~d~~tG~~~~ 344 (377)
T TIGR03300 310 --YRQLTAPAVVGGYLVVGD-FEGYLHWLSREDGSFVA 344 (377)
T ss_pred --CCccccCEEECCEEEEEe-CCCEEEEEECCCCCEEE
Confidence 001122345678888876 34678777777776543
No 157
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=85.93 E-value=33 Score=42.51 Aligned_cols=121 Identities=11% Similarity=-0.045 Sum_probs=72.6
Q ss_pred EEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee-cCCCceEEEEccCCcEEE
Q psy6572 549 YIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP-ATSPDGLTVDWVGRNLYW 625 (1416)
Q Consensus 549 ~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~-l~~p~gLAvD~~~~~LYw 625 (1416)
.|+.++++... ..++.-..+..+.+|-+.+++|.++-.......|+.+++. +....|.. ...-..=.+-+.+.+||+
T Consensus 219 ~i~~~~l~~g~~~~i~~~~g~~~~P~fspDG~~l~f~~~rdg~~~iy~~dl~~~~~~~Lt~~~gi~~~Ps~spdG~~ivf 298 (425)
T COG0823 219 RIYYLDLNTGKRPVILNFNGNNGAPAFSPDGSKLAFSSSRDGSPDIYLMDLDGKNLPRLTNGFGINTSPSWSPDGSKIVF 298 (425)
T ss_pred eEEEEeccCCccceeeccCCccCCccCCCCCCEEEEEECCCCCccEEEEcCCCCcceecccCCccccCccCCCCCCEEEE
Confidence 57777776665 4444445556677888888888888766655678988888 33222332 111112234456788888
Q ss_pred eeCC--CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEee
Q psy6572 626 CDKG--LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTD 670 (1416)
Q Consensus 626 tD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD 670 (1416)
+... ...|++++++|+..+.+... ......-.+.|...+|-+..
T Consensus 299 ~Sdr~G~p~I~~~~~~g~~~~riT~~-~~~~~~p~~SpdG~~i~~~~ 344 (425)
T COG0823 299 TSDRGGRPQIYLYDLEGSQVTRLTFS-GGGNSNPVWSPDGDKIVFES 344 (425)
T ss_pred EeCCCCCcceEEECCCCCceeEeecc-CCCCcCccCCCCCCEEEEEe
Confidence 7554 45899999999986544432 22222344555544544443
No 158
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=85.30 E-value=60 Score=39.89 Aligned_cols=133 Identities=15% Similarity=0.146 Sum_probs=69.9
Q ss_pred EEEEEecCCcc---eEEecccccc---eeeeeecCCCeEEEeeccCCC-ccEEEEecCC------CCeEEeecCCCceEE
Q psy6572 549 YIREVTQAGVM---TIRIHNQTNA---VGLDFDWVDNCLYWSDVTMHG-SSIRRSCNNS------QPELLFPATSPDGLT 615 (1416)
Q Consensus 549 ~I~~i~l~g~~---~~~~~~l~~~---~~l~~D~~~~~LYwtD~~~~~-~~I~r~~l~s------~~~~l~~l~~p~gLA 615 (1416)
.|++..+.... .+++...... +++.....++.|++.-..... ..|+.+.+.. ..+.|..-.....-.
T Consensus 203 ~v~~~~~gt~~~~d~lvfe~~~~~~~~~~~~~s~d~~~l~i~~~~~~~~s~v~~~d~~~~~~~~~~~~~l~~~~~~~~~~ 282 (414)
T PF02897_consen 203 QVYRHKLGTPQSEDELVFEEPDEPFWFVSVSRSKDGRYLFISSSSGTSESEVYLLDLDDGGSPDAKPKLLSPREDGVEYY 282 (414)
T ss_dssp EEEEEETTS-GGG-EEEEC-TTCTTSEEEEEE-TTSSEEEEEEESSSSEEEEEEEECCCTTTSS-SEEEEEESSSS-EEE
T ss_pred EEEEEECCCChHhCeeEEeecCCCcEEEEEEecCcccEEEEEEEccccCCeEEEEeccccCCCcCCcEEEeCCCCceEEE
Confidence 45555554433 3565544433 466666667777765444322 4577777763 233343312222234
Q ss_pred EEccCCcEEE-eeCC--CCeEEEeecCCCc----eEEEEcCCC-CCcceeeecCCcceEEEeeC-CCCceEEEEecC
Q psy6572 616 VDWVGRNLYW-CDKG--LDTIEVAKLDGRF----RKVLINKGL-QEPRGIALNPAYGYMYWTDW-GQNAHIGKAKMD 683 (1416)
Q Consensus 616 vD~~~~~LYw-tD~~--~~~I~v~~ldG~~----~~vLi~~~l-~~P~gIavDp~~g~LYWtD~-g~~~~I~ra~mD 683 (1416)
+++.++.||+ |+.. ..+|.+++++... ..+|+...- ....++.+ ..++|++... +..++|.+.+++
T Consensus 283 v~~~~~~~yi~Tn~~a~~~~l~~~~l~~~~~~~~~~~l~~~~~~~~l~~~~~--~~~~Lvl~~~~~~~~~l~v~~~~ 357 (414)
T PF02897_consen 283 VDHHGDRLYILTNDDAPNGRLVAVDLADPSPAEWWTVLIPEDEDVSLEDVSL--FKDYLVLSYRENGSSRLRVYDLD 357 (414)
T ss_dssp EEEETTEEEEEE-TT-TT-EEEEEETTSTSGGGEEEEEE--SSSEEEEEEEE--ETTEEEEEEEETTEEEEEEEETT
T ss_pred EEccCCEEEEeeCCCCCCcEEEEecccccccccceeEEcCCCCceeEEEEEE--ECCEEEEEEEECCccEEEEEECC
Confidence 5556887776 5543 4589888887654 335554322 23455555 4567777653 334678888888
No 159
>KOG0268|consensus
Probab=85.18 E-value=12 Score=43.90 Aligned_cols=252 Identities=10% Similarity=0.027 Sum_probs=107.3
Q ss_pred CCCceEecCCCCCeEEEEe--cEEEEEEecCCcc--eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCC
Q psy6572 528 DKHGCKATSDVPPNLLFTN--KYYIREVTQAGVM--TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQP 602 (1416)
Q Consensus 528 dg~sC~a~~~~~~~li~s~--~~~I~~i~l~g~~--~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~ 602 (1416)
||-.|.|-.+..+..+++. ...++..++.... ..+....+-+.||.++. +..++..|.. +|.+..++ ...
T Consensus 67 dGV~~lakhp~~ls~~aSGs~DG~VkiWnlsqR~~~~~f~AH~G~V~Gi~v~~-~~~~tvgdDK----tvK~wk~~~~p~ 141 (433)
T KOG0268|consen 67 DGVSCLAKHPNKLSTVASGSCDGEVKIWNLSQRECIRTFKAHEGLVRGICVTQ-TSFFTVGDDK----TVKQWKIDGPPL 141 (433)
T ss_pred cccchhhcCcchhhhhhccccCceEEEEehhhhhhhheeecccCceeeEEecc-cceEEecCCc----ceeeeeccCCcc
Confidence 4555666544443334332 2334444444433 12222334567787776 5556655443 56666555 222
Q ss_pred eEEeecCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEec
Q psy6572 603 ELLFPATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKM 682 (1416)
Q Consensus 603 ~~l~~l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~m 682 (1416)
.++..-....||.--| .+++|-|-.. .|.+-+..-.....-++-+......|...|..-.|.-+-. ....|...+|
T Consensus 142 ~tilg~s~~~gIdh~~-~~~~FaTcGe--~i~IWD~~R~~Pv~smswG~Dti~svkfNpvETsILas~~-sDrsIvLyD~ 217 (433)
T KOG0268|consen 142 HTILGKSVYLGIDHHR-KNSVFATCGE--QIDIWDEQRDNPVSSMSWGADSISSVKFNPVETSILASCA-SDRSIVLYDL 217 (433)
T ss_pred eeeecccccccccccc-ccccccccCc--eeeecccccCCccceeecCCCceeEEecCCCcchheeeec-cCCceEEEec
Confidence 2332212222222221 2334433221 2333332211111111112222334444444333333221 1223444444
Q ss_pred CCCC--CEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEE-ecCcEEE
Q psy6572 683 DGSN--PKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAV-FEDHLFW 759 (1416)
Q Consensus 683 DGs~--r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav-~~d~LYw 759 (1416)
.-.. +++++. ..+|+|...+ +...|++-.....|+..++.--.+.+-+..+ .+...+.+++ ..|.=|+
T Consensus 218 R~~~Pl~KVi~~---mRTN~IswnP-eafnF~~a~ED~nlY~~DmR~l~~p~~v~~d-----hvsAV~dVdfsptG~Efv 288 (433)
T KOG0268|consen 218 RQASPLKKVILT---MRTNTICWNP-EAFNFVAANEDHNLYTYDMRNLSRPLNVHKD-----HVSAVMDVDFSPTGQEFV 288 (433)
T ss_pred ccCCccceeeee---ccccceecCc-cccceeeccccccceehhhhhhcccchhhcc-----cceeEEEeccCCCcchhc
Confidence 3222 222222 3577888877 7777777666666776665432222211111 0122223333 2455566
Q ss_pred eecCCCeeEEecccCCCceEEEEeCCCCCCeeeeeecc
Q psy6572 760 TDWEMKSIERCDKYTGKNCTSVVKNLVHKPMDLRVYHP 797 (1416)
Q Consensus 760 tD~~~~~I~~~nk~tG~~~~~l~~~~~~~p~~I~v~h~ 797 (1416)
+-...++|.-.....|..+.+.....+.+.+.++-.+.
T Consensus 289 sgsyDksIRIf~~~~~~SRdiYhtkRMq~V~~Vk~S~D 326 (433)
T KOG0268|consen 289 SGSYDKSIRIFPVNHGHSRDIYHTKRMQHVFCVKYSMD 326 (433)
T ss_pred cccccceEEEeecCCCcchhhhhHhhhheeeEEEEecc
Confidence 66665555444434444444443344444555544433
No 160
>KOG4649|consensus
Probab=85.16 E-value=31 Score=39.02 Aligned_cols=61 Identities=18% Similarity=0.127 Sum_probs=40.2
Q ss_pred CceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCC
Q psy6572 611 PDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDG 684 (1416)
Q Consensus 611 p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDG 684 (1416)
-.-+|||+.+|+|||-..-..+|+-..+ ++ .++ |+|--.+|.||+.+......|+..-.-+
T Consensus 33 ~~~~avd~~sG~~~We~ilg~RiE~sa~-------vv-gdf-----VV~GCy~g~lYfl~~~tGs~~w~f~~~~ 93 (354)
T KOG4649|consen 33 GIVIAVDPQSGNLIWEAILGVRIECSAI-------VV-GDF-----VVLGCYSGGLYFLCVKTGSQIWNFVILE 93 (354)
T ss_pred ceEEEecCCCCcEEeehhhCceeeeeeE-------EE-CCE-----EEEEEccCcEEEEEecchhheeeeeehh
Confidence 3457999999999998776667764332 11 121 5556677889998876555666554433
No 161
>KOG1407|consensus
Probab=85.05 E-value=76 Score=36.15 Aligned_cols=119 Identities=15% Similarity=0.267 Sum_probs=63.6
Q ss_pred ceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEe
Q psy6572 612 DGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVII 691 (1416)
Q Consensus 612 ~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv 691 (1416)
.-|+.. ..++||+...+.+.|+++..-...+..-+...-..-..|.+||..+|+-.- +..+.+..-+++--.-.. .
T Consensus 151 ne~~w~-~~nd~Fflt~GlG~v~ILsypsLkpv~si~AH~snCicI~f~p~GryfA~G--sADAlvSLWD~~ELiC~R-~ 226 (313)
T KOG1407|consen 151 NEISWN-NSNDLFFLTNGLGCVEILSYPSLKPVQSIKAHPSNCICIEFDPDGRYFATG--SADALVSLWDVDELICER-C 226 (313)
T ss_pred eeeeec-CCCCEEEEecCCceEEEEeccccccccccccCCcceEEEEECCCCceEeec--cccceeeccChhHhhhhe-e
Confidence 344554 788999999999999998876433322222222345579999986654321 111111111111111001 1
Q ss_pred ecCCCCC-eeEEeecCCCeEEEecCCCCeEEEEeC-CCCceEEEEe
Q psy6572 692 SKNLSWP-NALTISYETNELFWGDAHEDYIAVSDL-NGENIKIIVS 735 (1416)
Q Consensus 692 ~~~l~~P-~gLaiD~~~~rLYWtD~~~~~I~~~~l-dG~~r~~v~~ 735 (1416)
-+.|.|| +.|.+.+ .+++.-+-+....|-.+.. +|.....|..
T Consensus 227 isRldwpVRTlSFS~-dg~~lASaSEDh~IDIA~vetGd~~~eI~~ 271 (313)
T KOG1407|consen 227 ISRLDWPVRTLSFSH-DGRMLASASEDHFIDIAEVETGDRVWEIPC 271 (313)
T ss_pred eccccCceEEEEecc-CcceeeccCccceEEeEecccCCeEEEeec
Confidence 1357888 5677765 4454444444556767764 5555554443
No 162
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=84.70 E-value=0.79 Score=34.86 Aligned_cols=23 Identities=43% Similarity=1.288 Sum_probs=18.2
Q ss_pred CCccc--eeeecC-CeeeecCCCCcE
Q psy6572 502 RPCSH--YCRNTL-GSYSCSCAPGYA 524 (1416)
Q Consensus 502 ~~Csq--~C~nt~-gsy~C~C~~Gy~ 524 (1416)
.+|.+ .|++.. ++|+|.|++||.
T Consensus 4 ~~C~n~g~C~~~~~~~y~C~C~~G~~ 29 (32)
T PF00008_consen 4 NPCQNGGTCIDLPGGGYTCECPPGYT 29 (32)
T ss_dssp TSSTTTEEEEEESTSEEEEEEBTTEE
T ss_pred CcCCCCeEEEeCCCCCEEeECCCCCc
Confidence 45655 788888 899999999986
No 163
>PTZ00421 coronin; Provisional
Probab=84.60 E-value=1.3e+02 Score=38.30 Aligned_cols=160 Identities=10% Similarity=-0.025 Sum_probs=80.6
Q ss_pred cccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCe-EEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCC-C
Q psy6572 566 QTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPE-LLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDG-R 641 (1416)
Q Consensus 566 l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~-~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG-~ 641 (1416)
...+..|+|.+..+.++++-... +.|+..++. .... .+.. ...+.+|++.+ .+.++++-...++|.+.++.. .
T Consensus 125 ~~~V~~l~f~P~~~~iLaSgs~D--gtVrIWDl~tg~~~~~l~~h~~~V~sla~sp-dG~lLatgs~Dg~IrIwD~rsg~ 201 (493)
T PTZ00421 125 TKKVGIVSFHPSAMNVLASAGAD--MVVNVWDVERGKAVEVIKCHSDQITSLEWNL-DGSLLCTTSKDKKLNIIDPRDGT 201 (493)
T ss_pred CCcEEEEEeCcCCCCEEEEEeCC--CEEEEEECCCCeEEEEEcCCCCceEEEEEEC-CCCEEEEecCCCEEEEEECCCCc
Confidence 34567788988765555554443 367777777 3222 2222 34567788776 455666777778888888753 3
Q ss_pred ceEEEEcCCCCCcceeeecCCcceEEEeeCC--CCceEEEEecCCCC-CEEEeecCC-CCCeeEEeecCCCeEEEecCCC
Q psy6572 642 FRKVLINKGLQEPRGIALNPAYGYMYWTDWG--QNAHIGKAKMDGSN-PKVIISKNL-SWPNALTISYETNELFWGDAHE 717 (1416)
Q Consensus 642 ~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g--~~~~I~ra~mDGs~-r~vlv~~~l-~~P~gLaiD~~~~rLYWtD~~~ 717 (1416)
....+..........++..+..++|+.+-+. ....|..-++.... ...++.... ....-..+|+....||.+-.+.
T Consensus 202 ~v~tl~~H~~~~~~~~~w~~~~~~ivt~G~s~s~Dr~VklWDlr~~~~p~~~~~~d~~~~~~~~~~d~d~~~L~lggkgD 281 (493)
T PTZ00421 202 IVSSVEAHASAKSQRCLWAKRKDLIITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQSSALFIPFFDEDTNLLYIGSKGE 281 (493)
T ss_pred EEEEEecCCCCcceEEEEcCCCCeEEEEecCCCCCCeEEEEeCCCCCCceeEeccCCCCceEEEEEcCCCCEEEEEEeCC
Confidence 3333322211122234444554555543221 11234444443222 111111111 1111234565566667665456
Q ss_pred CeEEEEeCCCC
Q psy6572 718 DYIAVSDLNGE 728 (1416)
Q Consensus 718 ~~I~~~~ldG~ 728 (1416)
+.|...++...
T Consensus 282 g~Iriwdl~~~ 292 (493)
T PTZ00421 282 GNIRCFELMNE 292 (493)
T ss_pred CeEEEEEeeCC
Confidence 77877777543
No 164
>KOG0318|consensus
Probab=83.85 E-value=1.2e+02 Score=37.60 Aligned_cols=103 Identities=17% Similarity=0.081 Sum_probs=57.7
Q ss_pred CCCceEEEEccCCcEEEeeCCCCeEEEee-cCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCC
Q psy6572 609 TSPDGLTVDWVGRNLYWCDKGLDTIEVAK-LDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNP 687 (1416)
Q Consensus 609 ~~p~gLAvD~~~~~LYwtD~~~~~I~v~~-ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r 687 (1416)
.+|.+||+..-++.+.++-.. .|.+.+ +.+- ....-...|.++||.|....+-+ -|+..+|....|.|...
T Consensus 406 ~QP~~lav~~d~~~avv~~~~--~iv~l~~~~~~----~~~~~~y~~s~vAv~~~~~~vaV--GG~Dgkvhvysl~g~~l 477 (603)
T KOG0318|consen 406 SQPKGLAVLSDGGTAVVACIS--DIVLLQDQTKV----SSIPIGYESSAVAVSPDGSEVAV--GGQDGKVHVYSLSGDEL 477 (603)
T ss_pred CCceeEEEcCCCCEEEEEecC--cEEEEecCCcc----eeeccccccceEEEcCCCCEEEE--ecccceEEEEEecCCcc
Confidence 456777776554444444322 222222 1111 11123456888999888654433 34455777778887553
Q ss_pred E-E-EeecCCCCCeeEEeecCCCeEEEecCCCCe
Q psy6572 688 K-V-IISKNLSWPNALTISYETNELFWGDAHEDY 719 (1416)
Q Consensus 688 ~-v-lv~~~l~~P~gLaiD~~~~rLYWtD~~~~~ 719 (1416)
. . +.......+..|++.+...+|-..|+....
T Consensus 478 ~ee~~~~~h~a~iT~vaySpd~~yla~~Da~rkv 511 (603)
T KOG0318|consen 478 KEEAKLLEHRAAITDVAYSPDGAYLAAGDASRKV 511 (603)
T ss_pred cceeeeecccCCceEEEECCCCcEEEEeccCCcE
Confidence 2 2 233445667888988777777777765443
No 165
>KOG2048|consensus
Probab=83.73 E-value=54 Score=41.63 Aligned_cols=192 Identities=12% Similarity=0.050 Sum_probs=109.1
Q ss_pred CceEecCCCCCeEEEEe--cEEEEEEecCCcc-eEEec----ccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CC
Q psy6572 530 HGCKATSDVPPNLLFTN--KYYIREVTQAGVM-TIRIH----NQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQ 601 (1416)
Q Consensus 530 ~sC~a~~~~~~~li~s~--~~~I~~i~l~g~~-~~~~~----~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~ 601 (1416)
-+|-++.+....+.++. +..|+++..++.. ...+. .+..+..+.|-...++|++.-... ..+..+.+. ..
T Consensus 385 Is~~aiSPdg~~Ia~st~~~~~iy~L~~~~~vk~~~v~~~~~~~~~a~~i~ftid~~k~~~~s~~~--~~le~~el~~ps 462 (691)
T KOG2048|consen 385 ISCAAISPDGNLIAISTVSRTKIYRLQPDPNVKVINVDDVPLALLDASAISFTIDKNKLFLVSKNI--FSLEEFELETPS 462 (691)
T ss_pred eeeeccCCCCCEEEEeeccceEEEEeccCcceeEEEeccchhhhccceeeEEEecCceEEEEeccc--ceeEEEEecCcc
Confidence 35666655555555554 5567888777643 22222 223445566666666776665332 245555555 22
Q ss_pred CeEEee------cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeec-CCcceEEEeeCCCC
Q psy6572 602 PELLFP------ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALN-PAYGYMYWTDWGQN 674 (1416)
Q Consensus 602 ~~~l~~------l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavD-p~~g~LYWtD~g~~ 674 (1416)
.+.+.. .....-|++-+.+++|-..+ ..+.|.+.+|.+...+-|+..--...++.++. +.++.|-+++.++
T Consensus 463 ~kel~~~~~~~~~~~I~~l~~SsdG~yiaa~~-t~g~I~v~nl~~~~~~~l~~rln~~vTa~~~~~~~~~~lvvats~n- 540 (691)
T KOG2048|consen 463 FKELKSIQSQAKCPSISRLVVSSDGNYIAAIS-TRGQIFVYNLETLESHLLKVRLNIDVTAAAFSPFVRNRLVVATSNN- 540 (691)
T ss_pred hhhhhccccccCCCcceeEEEcCCCCEEEEEe-ccceEEEEEcccceeecchhccCcceeeeeccccccCcEEEEecCC-
Confidence 222221 44566788888888887776 67899999999887666652211345566666 5566777776443
Q ss_pred ceEEEEecCCCCC--------EEEee---cCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCC
Q psy6572 675 AHIGKAKMDGSNP--------KVIIS---KNLSWPNALTISYETNELFWGDAHEDYIAVSDLNG 727 (1416)
Q Consensus 675 ~~I~ra~mDGs~r--------~vlv~---~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG 727 (1416)
.|...+|...+. +.+.. +.+..-.||.+|+.+...+|.= ..+-+..+++++
T Consensus 541 -Qv~efdi~~~~l~~ws~~nt~nlpk~~~~l~~~~~gisfd~~n~s~~~~~-~a~w~~~id~~~ 602 (691)
T KOG2048|consen 541 -QVFEFDIEARNLTRWSKNNTRNLPKEPKTLIPGIPGISFDPKNSSRFIVY-DAHWSCLIDFSL 602 (691)
T ss_pred -eEEEEecchhhhhhhhhccccccccChhhcCCCCceEEeCCCCccEEEEE-cCcEEEEEecCC
Confidence 566666633321 11111 1233346899998777777663 233445555443
No 166
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=83.72 E-value=1.2 Score=47.44 Aligned_cols=62 Identities=27% Similarity=0.830 Sum_probs=42.6
Q ss_pred ecCCceEEeeCCCceecCCCCCccccCCcCC-----CCCccc--eeeecC-----CeeeecCCCCcEEecCCCceEec
Q psy6572 470 DLKIGYKCACRKGYQVHPEDKHLCVDTNECL-----DRPCSH--YCRNTL-----GSYSCSCAPGYALLSDKHGCKAT 535 (1416)
Q Consensus 470 nt~~gy~C~C~~Gy~L~p~d~~tC~didEC~-----~~~Csq--~C~nt~-----gsy~C~C~~Gy~L~~dg~sC~a~ 535 (1416)
+....|+|.|.+||.|. ...+|+...+|. ..+|.. .|++.. ..|+|.|.+||.|..+ .|+..
T Consensus 15 QMSNHfEC~Cnegfvl~--~EntCE~kv~C~~~e~~~K~Cgdya~C~~~~~~~~~~~~~C~C~~gY~~~~~--vCvp~ 88 (197)
T PF06247_consen 15 QMSNHFECKCNEGFVLK--NENTCEEKVECDKLENVNKPCGDYAKCINQANKGEERAYKCDCINGYILKQG--VCVPN 88 (197)
T ss_dssp EESSEEEEEESTTEEEE--ETTEEEE----SG-GGTTSEEETTEEEEE-SSTTSSTSEEEEE-TTEEESSS--SEEEG
T ss_pred EccCceEEEcCCCcEEc--cccccccceecCcccccCccccchhhhhcCCCcccceeEEEecccCceeeCC--eEchh
Confidence 34466999999999997 556899888886 356754 788876 4899999999998765 56543
No 167
>KOG4328|consensus
Probab=82.99 E-value=84 Score=38.30 Aligned_cols=110 Identities=19% Similarity=0.128 Sum_probs=59.6
Q ss_pred ccceeeeeecCCC-eEEEeeccCCCccEEEEecC-CCCeEEeec----CCCceEEEEccCCcEEEeeCCC-CeEEEeecC
Q psy6572 567 TNAVGLDFDWVDN-CLYWSDVTMHGSSIRRSCNN-SQPELLFPA----TSPDGLTVDWVGRNLYWCDKGL-DTIEVAKLD 639 (1416)
Q Consensus 567 ~~~~~l~~D~~~~-~LYwtD~~~~~~~I~r~~l~-s~~~~l~~l----~~p~gLAvD~~~~~LYwtD~~~-~~I~v~~ld 639 (1416)
..+.+|.|.+.+- +||-+.+. +.|+...++ ...++|.++ ..-.++.+-...+.+|+.+... -.+.-.+++
T Consensus 235 ~~Vs~l~F~P~n~s~i~ssSyD---GtiR~~D~~~~i~e~v~s~~~d~~~fs~~d~~~e~~~vl~~~~~G~f~~iD~R~~ 311 (498)
T KOG4328|consen 235 GPVSGLKFSPANTSQIYSSSYD---GTIRLQDFEGNISEEVLSLDTDNIWFSSLDFSAESRSVLFGDNVGNFNVIDLRTD 311 (498)
T ss_pred ccccceEecCCChhheeeeccC---ceeeeeeecchhhHHHhhcCccceeeeeccccCCCccEEEeecccceEEEEeecC
Confidence 3466788877543 45544443 377777777 333333332 1223444555566666655432 133334455
Q ss_pred CCceEEEEcCCCCCcceeeecCCcceEEEeeCCC-CceEEEE
Q psy6572 640 GRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQ-NAHIGKA 680 (1416)
Q Consensus 640 G~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~-~~~I~ra 680 (1416)
|+....+.... .+.++|+++|...+++.|-.-. ..+|+=+
T Consensus 312 ~s~~~~~~lh~-kKI~sv~~NP~~p~~laT~s~D~T~kIWD~ 352 (498)
T KOG4328|consen 312 GSEYENLRLHK-KKITSVALNPVCPWFLATASLDQTAKIWDL 352 (498)
T ss_pred Cccchhhhhhh-cccceeecCCCCchheeecccCcceeeeeh
Confidence 65333222222 2789999999999888775322 2456644
No 168
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=82.54 E-value=1.1 Score=33.95 Aligned_cols=25 Identities=56% Similarity=1.128 Sum_probs=18.8
Q ss_pred CCcc--ceeeecCCeeeecCCCCcEEe
Q psy6572 502 RPCS--HYCRNTLGSYSCSCAPGYALL 526 (1416)
Q Consensus 502 ~~Cs--q~C~nt~gsy~C~C~~Gy~L~ 526 (1416)
.+|. .+|+++.++|+|.|+.||...
T Consensus 6 ~~C~~~~~C~~~~~~~~C~C~~g~~g~ 32 (36)
T cd00053 6 NPCSNGGTCVNTPGSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCEEecCCCCeEeECCCCCccc
Confidence 4454 478888888999999888643
No 169
>KOG0266|consensus
Probab=82.07 E-value=1.5e+02 Score=37.23 Aligned_cols=153 Identities=12% Similarity=0.109 Sum_probs=89.1
Q ss_pred cccceeeeeecCCCeEEEeeccCCCccEEEEecC---CCCeEEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCC
Q psy6572 566 QTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN---SQPELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGR 641 (1416)
Q Consensus 566 l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~---s~~~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~ 641 (1416)
...+..++|-+... |.+-...+ .+|+...+. ...++|.. ...+.+++|.+.+ +|+.+-...+.|.+-++.+.
T Consensus 203 ~~~v~~~~fs~d~~--~l~s~s~D-~tiriwd~~~~~~~~~~l~gH~~~v~~~~f~p~g-~~i~Sgs~D~tvriWd~~~~ 278 (456)
T KOG0266|consen 203 TRGVSDVAFSPDGS--YLLSGSDD-KTLRIWDLKDDGRNLKTLKGHSTYVTSVAFSPDG-NLLVSGSDDGTVRIWDVRTG 278 (456)
T ss_pred ccceeeeEECCCCc--EEEEecCC-ceEEEeeccCCCeEEEEecCCCCceEEEEecCCC-CEEEEecCCCcEEEEeccCC
Confidence 34456677766555 33322221 356666663 23344544 6677899999988 88888888899999988864
Q ss_pred -ceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCE--EEeecCCCC-Ce-eEEeecCCCeEEEecCC
Q psy6572 642 -FRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPK--VIISKNLSW-PN-ALTISYETNELFWGDAH 716 (1416)
Q Consensus 642 -~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~--vlv~~~l~~-P~-gLaiD~~~~rLYWtD~~ 716 (1416)
..++|. ..-....+|++.+...+|.-+.+ ...|..-++.+...+ .++...... |. -+.+. .+++..|+-..
T Consensus 279 ~~~~~l~-~hs~~is~~~f~~d~~~l~s~s~--d~~i~vwd~~~~~~~~~~~~~~~~~~~~~~~~~fs-p~~~~ll~~~~ 354 (456)
T KOG0266|consen 279 ECVRKLK-GHSDGISGLAFSPDGNLLVSASY--DGTIRVWDLETGSKLCLKLLSGAENSAPVTSVQFS-PNGKYLLSASL 354 (456)
T ss_pred eEEEeee-ccCCceEEEEECCCCCEEEEcCC--CccEEEEECCCCceeeeecccCCCCCCceeEEEEC-CCCcEEEEecC
Confidence 344443 44446778888887555544432 335666666555422 222222111 43 34444 45555566555
Q ss_pred CCeEEEEeCC
Q psy6572 717 EDYIAVSDLN 726 (1416)
Q Consensus 717 ~~~I~~~~ld 726 (1416)
.+.|...++.
T Consensus 355 d~~~~~w~l~ 364 (456)
T KOG0266|consen 355 DRTLKLWDLR 364 (456)
T ss_pred CCeEEEEEcc
Confidence 5567666665
No 170
>cd00053 EGF Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.
Probab=81.63 E-value=1.2 Score=33.67 Aligned_cols=26 Identities=27% Similarity=0.751 Sum_probs=20.2
Q ss_pred CCCcc--cccccccccCCCCCeEEecCCCceec
Q psy6572 1293 FRTCS--QICIEKKISNTERTFSCHCAEGYHMV 1323 (1416)
Q Consensus 1293 ~~~Cs--q~C~n~~~~n~~gs~~C~C~~gy~~~ 1323 (1416)
...|+ .+|+++. ++|+|.|..||...
T Consensus 5 ~~~C~~~~~C~~~~-----~~~~C~C~~g~~g~ 32 (36)
T cd00053 5 SNPCSNGGTCVNTP-----GSYRCVCPPGYTGD 32 (36)
T ss_pred CCCCCCCCEEecCC-----CCeEeECCCCCccc
Confidence 34566 5787666 99999999999754
No 171
>KOG4328|consensus
Probab=81.56 E-value=34 Score=41.54 Aligned_cols=147 Identities=12% Similarity=0.104 Sum_probs=76.8
Q ss_pred ccccceeeeeecCCC-eEEEeeccCCCccEEEEecCC---CC-eE-Eee--cCCCceEEEEccCCcEEEeeCCCCeEEEe
Q psy6572 565 NQTNAVGLDFDWVDN-CLYWSDVTMHGSSIRRSCNNS---QP-EL-LFP--ATSPDGLTVDWVGRNLYWCDKGLDTIEVA 636 (1416)
Q Consensus 565 ~l~~~~~l~~D~~~~-~LYwtD~~~~~~~I~r~~l~s---~~-~~-l~~--l~~p~gLAvD~~~~~LYwtD~~~~~I~v~ 636 (1416)
..+.+..|+|++..+ +|..+-.. .+.|...++.+ .. .+ +.. ...+.+|.+-+.+-..+++-+..++|...
T Consensus 185 ~~~Rit~l~fHPt~~~~lva~GdK--~G~VG~Wn~~~~~~d~d~v~~f~~hs~~Vs~l~F~P~n~s~i~ssSyDGtiR~~ 262 (498)
T KOG4328|consen 185 TDRRITSLAFHPTENRKLVAVGDK--GGQVGLWNFGTQEKDKDGVYLFTPHSGPVSGLKFSPANTSQIYSSSYDGTIRLQ 262 (498)
T ss_pred cccceEEEEecccCcceEEEEccC--CCcEEEEecCCCCCccCceEEeccCCccccceEecCCChhheeeeccCceeeee
Confidence 345677889998777 45444333 35787777741 11 11 111 23345555555554444444445555555
Q ss_pred ecCCCceEEEEcC--CCCCcceeeecCCcceEEEe-eCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEe
Q psy6572 637 KLDGRFRKVLINK--GLQEPRGIALNPAYGYMYWT-DWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWG 713 (1416)
Q Consensus 637 ~ldG~~~~vLi~~--~l~~P~gIavDp~~g~LYWt-D~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWt 713 (1416)
++.+...++++.. .-.+-.++-+....+.+|+. +|| .-.+.-..++|+....+..... ..++|++.+....++.+
T Consensus 263 D~~~~i~e~v~s~~~d~~~fs~~d~~~e~~~vl~~~~~G-~f~~iD~R~~~s~~~~~~lh~k-KI~sv~~NP~~p~~laT 340 (498)
T KOG4328|consen 263 DFEGNISEEVLSLDTDNIWFSSLDFSAESRSVLFGDNVG-NFNVIDLRTDGSEYENLRLHKK-KITSVALNPVCPWFLAT 340 (498)
T ss_pred eecchhhHHHhhcCccceeeeeccccCCCccEEEeeccc-ceEEEEeecCCccchhhhhhhc-ccceeecCCCCchheee
Confidence 5555443333332 12233445555555655554 466 3355556677774433322121 56788888866665555
Q ss_pred cC
Q psy6572 714 DA 715 (1416)
Q Consensus 714 D~ 715 (1416)
-.
T Consensus 341 ~s 342 (498)
T KOG4328|consen 341 AS 342 (498)
T ss_pred cc
Confidence 43
No 172
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=81.27 E-value=3 Score=39.53 Aligned_cols=36 Identities=17% Similarity=0.188 Sum_probs=30.9
Q ss_pred EEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecC
Q psy6572 604 LLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLD 639 (1416)
Q Consensus 604 ~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ld 639 (1416)
++++ +..|.||++++.++.||+++...+.|.+....
T Consensus 48 ~va~g~~~aNGI~~s~~~k~lyVa~~~~~~I~vy~~~ 84 (86)
T PF01731_consen 48 VVASGFSFANGIAISPDKKYLYVASSLAHSIHVYKRH 84 (86)
T ss_pred EeeccCCCCceEEEcCCCCEEEEEeccCCeEEEEEec
Confidence 3444 88999999999999999999999999887653
No 173
>KOG0650|consensus
Probab=81.15 E-value=25 Score=43.85 Aligned_cols=71 Identities=11% Similarity=0.231 Sum_probs=42.2
Q ss_pred CCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCC
Q psy6572 652 QEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLN 726 (1416)
Q Consensus 652 ~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ld 726 (1416)
..|..++++|..-+||++... .|...+|--.....-+.++..|-.+|+|++..+.|+... ..++|-.++++
T Consensus 567 G~vq~v~FHPs~p~lfVaTq~---~vRiYdL~kqelvKkL~tg~kwiS~msihp~GDnli~gs-~d~k~~WfDld 637 (733)
T KOG0650|consen 567 GLVQRVKFHPSKPYLFVATQR---SVRIYDLSKQELVKKLLTGSKWISSMSIHPNGDNLILGS-YDKKMCWFDLD 637 (733)
T ss_pred CceeEEEecCCCceEEEEecc---ceEEEehhHHHHHHHHhcCCeeeeeeeecCCCCeEEEec-CCCeeEEEEcc
Confidence 457788999999999999743 333334332111111124567788888887666665543 34455555554
No 174
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=81.00 E-value=48 Score=42.35 Aligned_cols=100 Identities=12% Similarity=0.117 Sum_probs=52.4
Q ss_pred eeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCC-eEEEEeCCCCceEEEE
Q psy6572 656 GIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHED-YIAVSDLNGENIKIIV 734 (1416)
Q Consensus 656 gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~-~I~~~~ldG~~r~~v~ 734 (1416)
.+++||.+++|||.--...+ ..|..| ...++..-.=||||..+++|-|.=.... -+ -++|.....+|+
T Consensus 238 ~~s~D~~~~lvy~~tGnp~p------~~~~~r---~gdnl~~~s~vAld~~TG~~~W~~Q~~~~D~--wD~d~~~~p~l~ 306 (527)
T TIGR03075 238 TGSYDPETNLIYFGTGNPSP------WNSHLR---PGDNLYTSSIVARDPDTGKIKWHYQTTPHDE--WDYDGVNEMILF 306 (527)
T ss_pred ceeEcCCCCeEEEeCCCCCC------CCCCCC---CCCCccceeEEEEccccCCEEEeeeCCCCCC--ccccCCCCcEEE
Confidence 46999999999998732222 334443 1223333445889999999999743221 11 134443333333
Q ss_pred eccCCCCcccccceeEEE-ecCcEEEeecCCCeeEE
Q psy6572 735 SRRMDPTINLHHVFALAV-FEDHLFWTDWEMKSIER 769 (1416)
Q Consensus 735 ~~~~~p~~~l~~P~~lav-~~d~LYwtD~~~~~I~~ 769 (1416)
..... .-..|.-+.. -.+++|+.|..++++.+
T Consensus 307 d~~~~---G~~~~~v~~~~K~G~~~vlDr~tG~~i~ 339 (527)
T TIGR03075 307 DLKKD---GKPRKLLAHADRNGFFYVLDRTNGKLLS 339 (527)
T ss_pred EeccC---CcEEEEEEEeCCCceEEEEECCCCceec
Confidence 21000 0011111111 35778888887776543
No 175
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=80.91 E-value=35 Score=37.97 Aligned_cols=73 Identities=18% Similarity=0.143 Sum_probs=45.0
Q ss_pred CCCeEEEecCCCCeEEEEe-CCCCceEEEEeccCCCCcccccceeEEEecCcEEEeecCCCeeEEecccCCCceEEE
Q psy6572 706 ETNELFWGDAHEDYIAVSD-LNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDWEMKSIERCDKYTGKNCTSV 781 (1416)
Q Consensus 706 ~~~rLYWtD~~~~~I~~~~-ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~~~I~~~nk~tG~~~~~l 781 (1416)
..++||+.... +.|..++ .+|...-.+..... +...+..+..+++.++.||.... .+.|..++..+|+.+-..
T Consensus 75 ~~~~v~v~~~~-~~l~~~d~~tG~~~W~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~-~g~l~~~d~~tG~~~w~~ 148 (238)
T PF13360_consen 75 DGGRVYVGTSD-GSLYALDAKTGKVLWSIYLTSS-PPAGVRSSSSPAVDGDRLYVGTS-SGKLVALDPKTGKLLWKY 148 (238)
T ss_dssp ETTEEEEEETT-SEEEEEETTTSCEEEEEEE-SS-CTCSTB--SEEEEETTEEEEEET-CSEEEEEETTTTEEEEEE
T ss_pred cccccccccce-eeeEecccCCcceeeeeccccc-cccccccccCceEecCEEEEEec-cCcEEEEecCCCcEEEEe
Confidence 36788887733 3788888 57776665433221 22224456667777888877775 677888888888654433
No 176
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=80.76 E-value=5 Score=31.80 Aligned_cols=41 Identities=24% Similarity=0.265 Sum_probs=29.0
Q ss_pred cCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeec
Q psy6572 619 VGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALN 660 (1416)
Q Consensus 619 ~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavD 660 (1416)
.++.||+++...++|.++++.......-+.. ...|++|+++
T Consensus 2 d~~~lyv~~~~~~~v~~id~~~~~~~~~i~v-g~~P~~i~~~ 42 (42)
T TIGR02276 2 DGTKLYVTNSGSNTVSVIDTATNKVIATIPV-GGYPFGVAVS 42 (42)
T ss_pred CCCEEEEEeCCCCEEEEEECCCCeEEEEEEC-CCCCceEEeC
Confidence 4678999999999999988754332222222 4779999874
No 177
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=80.56 E-value=1.2e+02 Score=38.48 Aligned_cols=30 Identities=23% Similarity=0.148 Sum_probs=21.2
Q ss_pred eEEEecCcEEEeecCCCeeEEecccCCCceE
Q psy6572 749 ALAVFEDHLFWTDWEMKSIERCDKYTGKNCT 779 (1416)
Q Consensus 749 ~lav~~d~LYwtD~~~~~I~~~nk~tG~~~~ 779 (1416)
.+++.++.||+.+ ..+.|+.+++.+|+.+-
T Consensus 401 ~~~~~g~~v~~g~-~dG~l~ald~~tG~~lW 430 (488)
T cd00216 401 SLATAGNLVFAGA-ADGYFRAFDATTGKELW 430 (488)
T ss_pred ceEecCCeEEEEC-CCCeEEEEECCCCceee
Confidence 3556778888876 46778888888886543
No 178
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=80.01 E-value=16 Score=40.02 Aligned_cols=69 Identities=19% Similarity=0.110 Sum_probs=46.6
Q ss_pred EEEccCCcEEEeeCCCCeEEEeecCCCc-eEEEEc-----------CCCCCcceeeecCCcceEEEeeCCCCceEEEEec
Q psy6572 615 TVDWVGRNLYWCDKGLDTIEVAKLDGRF-RKVLIN-----------KGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKM 682 (1416)
Q Consensus 615 AvD~~~~~LYwtD~~~~~I~v~~ldG~~-~~vLi~-----------~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~m 682 (1416)
.+.|+.|.||---+.+.+|.|...+... ...+-. ....-++|||.+|..+++|.|-. .-|.+.-+.+
T Consensus 180 ELE~VdG~lyANVw~t~~I~rI~p~sGrV~~widlS~L~~~~~~~~~~~nvlNGIA~~~~~~r~~iTGK-~wp~lfEVk~ 258 (262)
T COG3823 180 ELEWVDGELYANVWQTTRIARIDPDSGRVVAWIDLSGLLKELNLDKSNDNVLNGIAHDPQQDRFLITGK-LWPLLFEVKL 258 (262)
T ss_pred ceeeeccEEEEeeeeecceEEEcCCCCcEEEEEEccCCchhcCccccccccccceeecCcCCeEEEecC-cCceeEEEEe
Confidence 4568889888777777888888876443 332221 23456899999999999999952 2245555544
Q ss_pred CC
Q psy6572 683 DG 684 (1416)
Q Consensus 683 DG 684 (1416)
++
T Consensus 259 ~~ 260 (262)
T COG3823 259 DE 260 (262)
T ss_pred cC
Confidence 43
No 179
>KOG0318|consensus
Probab=79.61 E-value=1.7e+02 Score=36.43 Aligned_cols=125 Identities=10% Similarity=0.004 Sum_probs=74.1
Q ss_pred EEecEEEEEEecCCcc---eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEeecCCCceEEEEccC
Q psy6572 544 FTNKYYIREVTQAGVM---TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFPATSPDGLTVDWVG 620 (1416)
Q Consensus 544 ~s~~~~I~~i~l~g~~---~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~l~~p~gLAvD~~~ 620 (1416)
.+-...|+++++.+.. ...+.--.+|.+|++...+..+.++-.. .|..+.-.+....+.-.-.|.++||.+.+
T Consensus 380 ~g~Dd~l~~~~~~~~~~t~~~~~~lg~QP~~lav~~d~~~avv~~~~----~iv~l~~~~~~~~~~~~y~~s~vAv~~~~ 455 (603)
T KOG0318|consen 380 IGWDDTLRVISLKDNGYTKSEVVKLGSQPKGLAVLSDGGTAVVACIS----DIVLLQDQTKVSSIPIGYESSAVAVSPDG 455 (603)
T ss_pred EecCCeEEEEecccCcccccceeecCCCceeEEEcCCCCEEEEEecC----cEEEEecCCcceeeccccccceEEEcCCC
Confidence 3445567777775543 2223444567889888766666665443 22222211111111114568899998755
Q ss_pred CcEEEeeCCCCeEEEeecCCCce-EE-EEcCCCCCcceeeecCCcceEEEeeCCC
Q psy6572 621 RNLYWCDKGLDTIEVAKLDGRFR-KV-LINKGLQEPRGIALNPAYGYMYWTDWGQ 673 (1416)
Q Consensus 621 ~~LYwtD~~~~~I~v~~ldG~~~-~v-Li~~~l~~P~gIavDp~~g~LYWtD~g~ 673 (1416)
..+ ..-...++|.+..|.|..+ .. +.......+..|++.|...||-.+|...
T Consensus 456 ~~v-aVGG~Dgkvhvysl~g~~l~ee~~~~~h~a~iT~vaySpd~~yla~~Da~r 509 (603)
T KOG0318|consen 456 SEV-AVGGQDGKVHVYSLSGDELKEEAKLLEHRAAITDVAYSPDGAYLAAGDASR 509 (603)
T ss_pred CEE-EEecccceEEEEEecCCcccceeeeecccCCceEEEECCCCcEEEEeccCC
Confidence 443 3334456799999998552 22 3334456688999999988888888654
No 180
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=79.61 E-value=53 Score=42.00 Aligned_cols=51 Identities=12% Similarity=0.051 Sum_probs=29.1
Q ss_pred eeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEeecCCCceEEEEccCCcEEEeeC
Q psy6572 571 GLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFPATSPDGLTVDWVGRNLYWCDK 628 (1416)
Q Consensus 571 ~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~l~~p~gLAvD~~~~~LYwtD~ 628 (1416)
.++||.+.+.|||--.+.. . +.......- ++-.-.=||||..+|+|-|.-.
T Consensus 238 ~~s~D~~~~lvy~~tGnp~--p-~~~~~r~gd----nl~~~s~vAld~~TG~~~W~~Q 288 (527)
T TIGR03075 238 TGSYDPETNLIYFGTGNPS--P-WNSHLRPGD----NLYTSSIVARDPDTGKIKWHYQ 288 (527)
T ss_pred ceeEcCCCCeEEEeCCCCC--C-CCCCCCCCC----CccceeEEEEccccCCEEEeee
Confidence 4689999999998765421 1 000000000 0222334789999999999744
No 181
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=79.48 E-value=1.6 Score=33.54 Aligned_cols=24 Identities=25% Similarity=0.877 Sum_probs=18.8
Q ss_pred CCc--ccccccccccCCCCCeEEecCCCcee
Q psy6572 1294 RTC--SQICIEKKISNTERTFSCHCAEGYHM 1322 (1416)
Q Consensus 1294 ~~C--sq~C~n~~~~n~~gs~~C~C~~gy~~ 1322 (1416)
.+| .++|+++. |+|+|.|.+||..
T Consensus 9 ~~C~~~~~C~~~~-----~~~~C~C~~g~~g 34 (38)
T cd00054 9 NPCQNGGTCVNTV-----GSYRCSCPPGYTG 34 (38)
T ss_pred CCcCCCCEeECCC-----CCeEeECCCCCcC
Confidence 356 45787666 9999999999974
No 182
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=79.22 E-value=70 Score=39.10 Aligned_cols=59 Identities=17% Similarity=0.166 Sum_probs=37.5
Q ss_pred CCeEEEecCCCCeEEEEeC-CCCceEEEEeccCCCCcccccceeEEEecCcEEEeecCCCeeEEecccCCC
Q psy6572 707 TNELFWGDAHEDYIAVSDL-NGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDWEMKSIERCDKYTGK 776 (1416)
Q Consensus 707 ~~rLYWtD~~~~~I~~~~l-dG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~~~I~~~nk~tG~ 776 (1416)
.++||++.. .+.+..+++ +|.. +-... +..+..+++.+++||+... .+.|+.++..+|+
T Consensus 256 ~~~vy~~~~-~g~l~ald~~tG~~---~W~~~------~~~~~~~~~~~~~vy~~~~-~g~l~ald~~tG~ 315 (394)
T PRK11138 256 GGVVYALAY-NGNLVALDLRSGQI---VWKRE------YGSVNDFAVDGGRIYLVDQ-NDRVYALDTRGGV 315 (394)
T ss_pred CCEEEEEEc-CCeEEEEECCCCCE---EEeec------CCCccCcEEECCEEEEEcC-CCeEEEEECCCCc
Confidence 578887764 456777775 3432 22211 3334456778999999874 4678888877775
No 183
>PTZ00420 coronin; Provisional
Probab=78.19 E-value=2.2e+02 Score=36.82 Aligned_cols=159 Identities=8% Similarity=-0.003 Sum_probs=76.2
Q ss_pred cccceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEE-ee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCc-
Q psy6572 566 QTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELL-FP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRF- 642 (1416)
Q Consensus 566 l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l-~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~- 642 (1416)
...+..|+|++....|+.+-... +.|...++.+...+. +. ...+.+|++++.+. ++.+-...++|.+.++....
T Consensus 125 ~~~V~sVaf~P~g~~iLaSgS~D--gtIrIWDl~tg~~~~~i~~~~~V~SlswspdG~-lLat~s~D~~IrIwD~Rsg~~ 201 (568)
T PTZ00420 125 KKKISIIDWNPMNYYIMCSSGFD--SFVNIWDIENEKRAFQINMPKKLSSLKWNIKGN-LLSGTCVGKHMHIIDPRKQEI 201 (568)
T ss_pred CCcEEEEEECCCCCeEEEEEeCC--CeEEEEECCCCcEEEEEecCCcEEEEEECCCCC-EEEEEecCCEEEEEECCCCcE
Confidence 34567889988777666554432 367777776322211 11 34466778776544 55555556788888876443
Q ss_pred eEEEEcCCCC-Ccceeee---cCCcceEEEeeCCCC--ceEEEEecCCCCC-EEEeecCCCCCee--EEeecCCCeEEEe
Q psy6572 643 RKVLINKGLQ-EPRGIAL---NPAYGYMYWTDWGQN--AHIGKAKMDGSNP-KVIISKNLSWPNA--LTISYETNELFWG 713 (1416)
Q Consensus 643 ~~vLi~~~l~-~P~gIav---Dp~~g~LYWtD~g~~--~~I~ra~mDGs~r-~vlv~~~l~~P~g--LaiD~~~~rLYWt 713 (1416)
...+...... .-+.+.+ .+..++|.-+-.... ..|..-++..... ...+... ..+.. ...|...+.||.+
T Consensus 202 i~tl~gH~g~~~s~~v~~~~fs~d~~~IlTtG~d~~~~R~VkLWDlr~~~~pl~~~~ld-~~~~~L~p~~D~~tg~l~ls 280 (568)
T PTZ00420 202 ASSFHIHDGGKNTKNIWIDGLGGDDNYILSTGFSKNNMREMKLWDLKNTTSALVTMSID-NASAPLIPHYDESTGLIYLI 280 (568)
T ss_pred EEEEecccCCceeEEEEeeeEcCCCCEEEEEEcCCCCccEEEEEECCCCCCceEEEEec-CCccceEEeeeCCCCCEEEE
Confidence 2222211110 0011111 123334433332211 1233333332111 1111100 11222 2456667888887
Q ss_pred cCCCCeEEEEeCCCC
Q psy6572 714 DAHEDYIAVSDLNGE 728 (1416)
Q Consensus 714 D~~~~~I~~~~ldG~ 728 (1416)
-.+.+.|...++...
T Consensus 281 GkGD~tIr~~e~~~~ 295 (568)
T PTZ00420 281 GKGDGNCRYYQHSLG 295 (568)
T ss_pred EECCCeEEEEEccCC
Confidence 777778888877543
No 184
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=78.15 E-value=1.8 Score=33.12 Aligned_cols=27 Identities=30% Similarity=0.751 Sum_probs=20.2
Q ss_pred CCCccceeecCCCCccccCCCCeeecCCC
Q psy6572 812 GGCQGLCLLKPNGHRQCACPDNFILESDG 840 (1416)
Q Consensus 812 ggCshlCl~~p~~~~~C~Cp~g~~L~~d~ 840 (1416)
..|...|-+. ....|.||+||+|+.+.
T Consensus 6 t~CpA~CDpn--~~~~C~CPeGyIlde~~ 32 (34)
T PF09064_consen 6 TECPADCDPN--SPGQCFCPEGYILDEGS 32 (34)
T ss_pred ccCCCccCCC--CCCceeCCCceEecCCc
Confidence 4577777764 35699999999998653
No 185
>KOG0292|consensus
Probab=78.04 E-value=1.7e+02 Score=38.68 Aligned_cols=169 Identities=14% Similarity=0.077 Sum_probs=89.9
Q ss_pred eeeeeecCCCeEEEeeccCCCccEEEEecCCCCeE--Eee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEE
Q psy6572 570 VGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPEL--LFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVL 646 (1416)
Q Consensus 570 ~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~--l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vL 646 (1416)
...+|++ +--|+++-.....-++.|++-.+..++ ... ..++.++-+++ ..+|..+.+..+.|.|-+|+-+.....
T Consensus 210 NwaAfhp-TlpliVSG~DDRqVKlWrmnetKaWEvDtcrgH~nnVssvlfhp-~q~lIlSnsEDksirVwDm~kRt~v~t 287 (1202)
T KOG0292|consen 210 NWAAFHP-TLPLIVSGADDRQVKLWRMNETKAWEVDTCRGHYNNVSSVLFHP-HQDLILSNSEDKSIRVWDMTKRTSVQT 287 (1202)
T ss_pred ceEEecC-CcceEEecCCcceeeEEEeccccceeehhhhcccCCcceEEecC-ccceeEecCCCccEEEEecccccceee
Confidence 3445554 223455443322123444433333332 223 77888888886 556777888999999999986543222
Q ss_pred EcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCC
Q psy6572 647 INKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLN 726 (1416)
Q Consensus 647 i~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ld 726 (1416)
....-.+-.-||++|. ..||-+-- ...+++..++- .+-+.+|. .+.||++. ...|.+.++.
T Consensus 288 frrendRFW~laahP~-lNLfAAgH--DsGm~VFkleR------------Erpa~~v~--~n~LfYvk--d~~i~~~d~~ 348 (1202)
T KOG0292|consen 288 FRRENDRFWILAAHPE-LNLFAAGH--DSGMIVFKLER------------ERPAYAVN--GNGLFYVK--DRFIRSYDLR 348 (1202)
T ss_pred eeccCCeEEEEEecCC-cceeeeec--CCceEEEEEcc------------cCceEEEc--CCEEEEEc--cceEEeeecc
Confidence 2233344455677665 45554421 11232222221 22345553 45566554 4678888876
Q ss_pred CCceEEEEeccCCCCcccccceeEEE--ecCcEEEe
Q psy6572 727 GENIKIIVSRRMDPTINLHHVFALAV--FEDHLFWT 760 (1416)
Q Consensus 727 G~~r~~v~~~~~~p~~~l~~P~~lav--~~d~LYwt 760 (1416)
...-..+.+-. .++..+..|++|.+ .++.+.++
T Consensus 349 t~~d~~v~~lr-~~g~~~~~~~smsYNpae~~vlic 383 (1202)
T KOG0292|consen 349 TQKDTAVASLR-RPGTLWQPPRSLSYNPAENAVLIC 383 (1202)
T ss_pred ccccceeEecc-CCCcccCCcceeeeccccCeEEEE
Confidence 64444444432 23334556777777 45666665
No 186
>KOG4378|consensus
Probab=77.59 E-value=1.1e+02 Score=37.83 Aligned_cols=182 Identities=11% Similarity=0.086 Sum_probs=95.4
Q ss_pred CCeEEEeeccCCCccEEEEecCCCCeEEee-----cCCCceEEEEccCCcEEEeeCC-CCeEEEeecCCCceEEEE-cCC
Q psy6572 578 DNCLYWSDVTMHGSSIRRSCNNSQPELLFP-----ATSPDGLTVDWVGRNLYWCDKG-LDTIEVAKLDGRFRKVLI-NKG 650 (1416)
Q Consensus 578 ~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~-----l~~p~gLAvD~~~~~LYwtD~~-~~~I~v~~ldG~~~~vLi-~~~ 650 (1416)
...+|....+.. ..|...++. .++++. ...+.++.+.|... |++... .+-|.+..+....+.+-+ ...
T Consensus 89 s~S~y~~sgG~~-~~Vkiwdl~--~kl~hr~lkdh~stvt~v~YN~~De--yiAsvs~gGdiiih~~~t~~~tt~f~~~s 163 (673)
T KOG4378|consen 89 SQSLYEISGGQS-GCVKIWDLR--AKLIHRFLKDHQSTVTYVDYNNTDE--YIASVSDGGDIIIHGTKTKQKTTTFTIDS 163 (673)
T ss_pred hcceeeeccCcC-ceeeehhhH--HHHHhhhccCCcceeEEEEecCCcc--eeEEeccCCcEEEEecccCccccceecCC
Confidence 444777666542 233333333 111111 33445555555443 444333 245666665544443322 233
Q ss_pred CCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCC-eeEEeecCCCeEEEecCCCCeEEEEeCCCCc
Q psy6572 651 LQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWP-NALTISYETNELFWGDAHEDYIAVSDLNGEN 729 (1416)
Q Consensus 651 l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P-~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~ 729 (1416)
.+..|-|-..|.+++|..+- +....+..-+..|.....-..+.-..| .||.+.+.+..|+++-....+|...+.....
T Consensus 164 gqsvRll~ys~skr~lL~~a-sd~G~VtlwDv~g~sp~~~~~~~HsAP~~gicfspsne~l~vsVG~Dkki~~yD~~s~~ 242 (673)
T KOG4378|consen 164 GQSVRLLRYSPSKRFLLSIA-SDKGAVTLWDVQGMSPIFHASEAHSAPCRGICFSPSNEALLVSVGYDKKINIYDIRSQA 242 (673)
T ss_pred CCeEEEeecccccceeeEee-ccCCeEEEEeccCCCcccchhhhccCCcCcceecCCccceEEEecccceEEEeeccccc
Confidence 44446666777777776654 233345555555554433333333344 6899999889999887777788777754221
Q ss_pred eEEEEeccCCCCccccccee-EEEe-cCcEEEeecCCCeeEEeccc
Q psy6572 730 IKIIVSRRMDPTINLHHVFA-LAVF-EDHLFWTDWEMKSIERCDKY 773 (1416)
Q Consensus 730 r~~v~~~~~~p~~~l~~P~~-lav~-~d~LYwtD~~~~~I~~~nk~ 773 (1416)
...-+ ...||++ |++. .+++..+-...++|+..+..
T Consensus 243 s~~~l--------~y~~Plstvaf~~~G~~L~aG~s~G~~i~YD~R 280 (673)
T KOG4378|consen 243 STDRL--------TYSHPLSTVAFSECGTYLCAGNSKGELIAYDMR 280 (673)
T ss_pred cccee--------eecCCcceeeecCCceEEEeecCCceEEEEecc
Confidence 11111 1456764 5553 35555666666677766543
No 187
>PHA02713 hypothetical protein; Provisional
Probab=77.49 E-value=1.2e+02 Score=38.97 Aligned_cols=179 Identities=9% Similarity=0.023 Sum_probs=87.8
Q ss_pred CCCeEEEeeccC-C---CccEEEEecC-CCCeEEeecCCCc-eEEEEccCCcEEEeeCC-----CCeEEEeecCCCceEE
Q psy6572 577 VDNCLYWSDVTM-H---GSSIRRSCNN-SQPELLFPATSPD-GLTVDWVGRNLYWCDKG-----LDTIEVAKLDGRFRKV 645 (1416)
Q Consensus 577 ~~~~LYwtD~~~-~---~~~I~r~~l~-s~~~~l~~l~~p~-gLAvD~~~~~LYwtD~~-----~~~I~v~~ldG~~~~v 645 (1416)
.++.||++-... . ...+.++++. .....+.++..|+ +.++-...++||..-.. ...+++.++....-+.
T Consensus 302 l~~~IYviGG~~~~~~~~~~v~~Yd~~~n~W~~~~~m~~~R~~~~~~~~~g~IYviGG~~~~~~~~sve~Ydp~~~~W~~ 381 (557)
T PHA02713 302 VDNEIIIAGGYNFNNPSLNKVYKINIENKIHVELPPMIKNRCRFSLAVIDDTIYAIGGQNGTNVERTIECYTMGDDKWKM 381 (557)
T ss_pred ECCEEEEEcCCCCCCCccceEEEEECCCCeEeeCCCCcchhhceeEEEECCEEEEECCcCCCCCCceEEEEECCCCeEEE
Confidence 367888875431 1 1245666655 3344444433333 22333357999997653 2457888776443333
Q ss_pred EEcCCCCCcce-eeecCCcceEEEeeCCC----------------------CceEEEEecCCCCCEEEeecCCCCC---e
Q psy6572 646 LINKGLQEPRG-IALNPAYGYMYWTDWGQ----------------------NAHIGKAKMDGSNPKVIISKNLSWP---N 699 (1416)
Q Consensus 646 Li~~~l~~P~g-IavDp~~g~LYWtD~g~----------------------~~~I~ra~mDGs~r~vlv~~~l~~P---~ 699 (1416)
+ ..+..|+. .++-...|+||+.--.. ...+++.+..-..=. .+. .+..| .
T Consensus 382 ~--~~mp~~r~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~YDP~td~W~-~v~-~m~~~r~~~ 457 (557)
T PHA02713 382 L--PDMPIALSSYGMCVLDQYIYIIGGRTEHIDYTSVHHMNSIDMEEDTHSSNKVIRYDTVNNIWE-TLP-NFWTGTIRP 457 (557)
T ss_pred C--CCCCcccccccEEEECCEEEEEeCCCcccccccccccccccccccccccceEEEECCCCCeEe-ecC-CCCcccccC
Confidence 2 23333331 22222468999874111 123444443322111 111 12222 2
Q ss_pred eEEeecCCCeEEEecCCC------CeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEeecCCC
Q psy6572 700 ALTISYETNELFWGDAHE------DYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDWEMK 765 (1416)
Q Consensus 700 gLaiD~~~~rLYWtD~~~------~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~~ 765 (1416)
++++ .+++||++-... ..|++.+....+.-+.+. .+ | .-..-.++++.+++||++-...+
T Consensus 458 ~~~~--~~~~IYv~GG~~~~~~~~~~ve~Ydp~~~~~W~~~~-~m-~--~~r~~~~~~~~~~~iyv~Gg~~~ 523 (557)
T PHA02713 458 GVVS--HKDDIYVVCDIKDEKNVKTCIFRYNTNTYNGWELIT-TT-E--SRLSALHTILHDNTIMMLHCYES 523 (557)
T ss_pred cEEE--ECCEEEEEeCCCCCCccceeEEEecCCCCCCeeEcc-cc-C--cccccceeEEECCEEEEEeeecc
Confidence 4444 479999985321 246677766422222222 11 0 01223688899999999765443
No 188
>KOG1274|consensus
Probab=77.34 E-value=1.9e+02 Score=38.28 Aligned_cols=147 Identities=14% Similarity=0.111 Sum_probs=86.0
Q ss_pred ceeeeeecCCCeEEEeeccCCCccEEEEecC---CCCeEEe-ecCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceE
Q psy6572 569 AVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN---SQPELLF-PATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRK 644 (1416)
Q Consensus 569 ~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~---s~~~~l~-~l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~ 644 (1416)
...|.||+.+..||..+... .|+.+.-+ ..+++|- ....+.+||.+ .+.|.+-+..+.|.|+.++-..-.
T Consensus 16 ~t~i~~d~~gefi~tcgsdg---~ir~~~~~sd~e~P~ti~~~g~~v~~ia~~---s~~f~~~s~~~tv~~y~fps~~~~ 89 (933)
T KOG1274|consen 16 LTLICYDPDGEFICTCGSDG---DIRKWKTNSDEEEPETIDISGELVSSIACY---SNHFLTGSEQNTVLRYKFPSGEED 89 (933)
T ss_pred eEEEEEcCCCCEEEEecCCC---ceEEeecCCcccCCchhhccCceeEEEeec---ccceEEeeccceEEEeeCCCCCcc
Confidence 35678999999888887763 46665544 2344444 24445666664 347778888899999998765555
Q ss_pred EEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecC-CCCCEEEeecCCCCC-eeEEeecCCCeEEEecCCCCeEEE
Q psy6572 645 VLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMD-GSNPKVIISKNLSWP-NALTISYETNELFWGDAHEDYIAV 722 (1416)
Q Consensus 645 vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mD-Gs~r~vlv~~~l~~P-~gLaiD~~~~rLYWtD~~~~~I~~ 722 (1416)
.|+..---..+.+||+-...++-.. +....|...+|+ ++..+++.. +..| .+|.+|+....|-++ ...+.|..
T Consensus 90 ~iL~Rftlp~r~~~v~g~g~~iaag--sdD~~vK~~~~~D~s~~~~lrg--h~apVl~l~~~p~~~fLAvs-s~dG~v~i 164 (933)
T KOG1274|consen 90 TILARFTLPIRDLAVSGSGKMIAAG--SDDTAVKLLNLDDSSQEKVLRG--HDAPVLQLSYDPKGNFLAVS-SCDGKVQI 164 (933)
T ss_pred ceeeeeeccceEEEEecCCcEEEee--cCceeEEEEeccccchheeecc--cCCceeeeeEcCCCCEEEEE-ecCceEEE
Confidence 4543333335778887664433322 223356666654 444444432 3233 467777644444333 34566666
Q ss_pred EeCC
Q psy6572 723 SDLN 726 (1416)
Q Consensus 723 ~~ld 726 (1416)
.+++
T Consensus 165 w~~~ 168 (933)
T KOG1274|consen 165 WDLQ 168 (933)
T ss_pred EEcc
Confidence 6654
No 189
>KOG0310|consensus
Probab=76.10 E-value=2e+02 Score=35.42 Aligned_cols=171 Identities=11% Similarity=0.019 Sum_probs=96.0
Q ss_pred ceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEee----cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceE
Q psy6572 569 AVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFP----ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRK 644 (1416)
Q Consensus 569 ~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~----l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~ 644 (1416)
+.-+-|.+..++++.+-... ..+....+++.. +... -..++.+++-+.+++|++|-+..++|...++.-.. .
T Consensus 113 v~~~~f~~~d~t~l~s~sDd--~v~k~~d~s~a~-v~~~l~~htDYVR~g~~~~~~~hivvtGsYDg~vrl~DtR~~~-~ 188 (487)
T KOG0310|consen 113 VHVTKFSPQDNTMLVSGSDD--KVVKYWDLSTAY-VQAELSGHTDYVRCGDISPANDHIVVTGSYDGKVRLWDTRSLT-S 188 (487)
T ss_pred eeEEEecccCCeEEEecCCC--ceEEEEEcCCcE-EEEEecCCcceeEeeccccCCCeEEEecCCCceEEEEEeccCC-c
Confidence 34456677777777765542 345555555222 2222 56788899999999999999999999888776553 2
Q ss_pred EEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEee-cCCCCCeeEEeecCCCeEEEecCCCCeEEEE
Q psy6572 645 VLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIIS-KNLSWPNALTISYETNELFWGDAHEDYIAVS 723 (1416)
Q Consensus 645 vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~-~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~ 723 (1416)
.++.-+-..|..-++-...|-++.+-.|. .|.+-+|-+..+.+-.. ......+.|.+-..+.|||-+-.. +++-..
T Consensus 189 ~v~elnhg~pVe~vl~lpsgs~iasAgGn--~vkVWDl~~G~qll~~~~~H~KtVTcL~l~s~~~rLlS~sLD-~~VKVf 265 (487)
T KOG0310|consen 189 RVVELNHGCPVESVLALPSGSLIASAGGN--SVKVWDLTTGGQLLTSMFNHNKTVTCLRLASDSTRLLSGSLD-RHVKVF 265 (487)
T ss_pred eeEEecCCCceeeEEEcCCCCEEEEcCCC--eEEEEEecCCceehhhhhcccceEEEEEeecCCceEeecccc-cceEEE
Confidence 33333444566555555667777776554 46666666443322221 223334566665444566543221 222222
Q ss_pred eCCCCceEEEEeccCCCCcccccc-eeEEEec
Q psy6572 724 DLNGENIKIIVSRRMDPTINLHHV-FALAVFE 754 (1416)
Q Consensus 724 ~ldG~~r~~v~~~~~~p~~~l~~P-~~lav~~ 754 (1416)
+ -++.+++..-. .+.| .+|+++.
T Consensus 266 d--~t~~Kvv~s~~------~~~pvLsiavs~ 289 (487)
T KOG0310|consen 266 D--TTNYKVVHSWK------YPGPVLSIAVSP 289 (487)
T ss_pred E--ccceEEEEeee------cccceeeEEecC
Confidence 2 23444444422 4444 5777754
No 190
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=75.99 E-value=2.7 Score=31.96 Aligned_cols=21 Identities=52% Similarity=1.298 Sum_probs=18.6
Q ss_pred ccc--cceecC-CceEEeeCCCce
Q psy6572 464 CAH--ECIDLK-IGYKCACRKGYQ 484 (1416)
Q Consensus 464 Cs~--~C~nt~-~gy~C~C~~Gy~ 484 (1416)
|.+ +|+++. .+|+|.|++||.
T Consensus 6 C~n~g~C~~~~~~~y~C~C~~G~~ 29 (32)
T PF00008_consen 6 CQNGGTCIDLPGGGYTCECPPGYT 29 (32)
T ss_dssp STTTEEEEEESTSEEEEEEBTTEE
T ss_pred CCCCeEEEeCCCCCEEeECCCCCc
Confidence 665 799999 999999999995
No 191
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=75.86 E-value=1.9e+02 Score=34.93 Aligned_cols=59 Identities=20% Similarity=0.245 Sum_probs=37.2
Q ss_pred CCeEEEecCCCCeEEEEeC-CCCceEEEEeccCCCCcccccceeEEEecCcEEEeecCCCeeEEecccCCC
Q psy6572 707 TNELFWGDAHEDYIAVSDL-NGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDWEMKSIERCDKYTGK 776 (1416)
Q Consensus 707 ~~rLYWtD~~~~~I~~~~l-dG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~~~I~~~nk~tG~ 776 (1416)
+++||.+.. .+.|..+++ +|.. +-... +..+.++++.+++||+.. ..+.|+.+++.+|+
T Consensus 241 ~~~vy~~~~-~g~l~a~d~~tG~~---~W~~~------~~~~~~p~~~~~~vyv~~-~~G~l~~~d~~tG~ 300 (377)
T TIGR03300 241 GGQVYAVSY-QGRVAALDLRSGRV---LWKRD------ASSYQGPAVDDNRLYVTD-ADGVVVALDRRSGS 300 (377)
T ss_pred CCEEEEEEc-CCEEEEEECCCCcE---EEeec------cCCccCceEeCCEEEEEC-CCCeEEEEECCCCc
Confidence 578888764 456777776 3432 22211 223445667889999886 45778888887775
No 192
>KOG4441|consensus
Probab=75.56 E-value=60 Score=41.96 Aligned_cols=178 Identities=11% Similarity=0.038 Sum_probs=104.1
Q ss_pred CCCeEEEeeccC-C---CccEEEEecC-CCCeEEeecCCC-ceEEEEccCCcEEEeeCCC-----CeEEEeecCCCceEE
Q psy6572 577 VDNCLYWSDVTM-H---GSSIRRSCNN-SQPELLFPATSP-DGLTVDWVGRNLYWCDKGL-----DTIEVAKLDGRFRKV 645 (1416)
Q Consensus 577 ~~~~LYwtD~~~-~---~~~I~r~~l~-s~~~~l~~l~~p-~gLAvD~~~~~LYwtD~~~-----~~I~v~~ldG~~~~v 645 (1416)
.++.||.+-... + -..+.+++.. ..+..+.++..+ .++++-...|.||..-... ..|++.+.....-++
T Consensus 331 ~~~~lYv~GG~~~~~~~l~~ve~YD~~~~~W~~~a~M~~~R~~~~v~~l~g~iYavGG~dg~~~l~svE~YDp~~~~W~~ 410 (571)
T KOG4441|consen 331 LNGKLYVVGGYDSGSDRLSSVERYDPRTNQWTPVAPMNTKRSDFGVAVLDGKLYAVGGFDGEKSLNSVECYDPVTNKWTP 410 (571)
T ss_pred ECCEEEEEccccCCCcccceEEEecCCCCceeccCCccCccccceeEEECCEEEEEeccccccccccEEEecCCCCcccc
Confidence 466888875544 1 1345666666 445555554433 3456666899999986543 468888877665444
Q ss_pred EEcCCCCCcceeeecCCcceEEEeeCCC-----CceEEEEecCCCCCEEEeecC-CCCCeeEEeecCCCeEEEecCC---
Q psy6572 646 LINKGLQEPRGIALNPAYGYMYWTDWGQ-----NAHIGKAKMDGSNPKVIISKN-LSWPNALTISYETNELFWGDAH--- 716 (1416)
Q Consensus 646 Li~~~l~~P~gIavDp~~g~LYWtD~g~-----~~~I~ra~mDGs~r~vlv~~~-l~~P~gLaiD~~~~rLYWtD~~--- 716 (1416)
+..-. ..-.+.++-..+|+||.+--.. ...+++.+.....=+.+..-. -..-.|+++ .+++||.+-..
T Consensus 411 va~m~-~~r~~~gv~~~~g~iYi~GG~~~~~~~l~sve~YDP~t~~W~~~~~M~~~R~~~g~a~--~~~~iYvvGG~~~~ 487 (571)
T KOG4441|consen 411 VAPML-TRRSGHGVAVLGGKLYIIGGGDGSSNCLNSVECYDPETNTWTLIAPMNTRRSGFGVAV--LNGKIYVVGGFDGT 487 (571)
T ss_pred cCCCC-cceeeeEEEEECCEEEEEcCcCCCccccceEEEEcCCCCceeecCCcccccccceEEE--ECCEEEEECCccCC
Confidence 43211 1223556666789999985211 134555555443322222211 112234555 68999998542
Q ss_pred --CCeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEeec
Q psy6572 717 --EDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDW 762 (1416)
Q Consensus 717 --~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~ 762 (1416)
...|++.+.....-..+..-. .-....++++.++.||..-.
T Consensus 488 ~~~~~VE~ydp~~~~W~~v~~m~-----~~rs~~g~~~~~~~ly~vGG 530 (571)
T KOG4441|consen 488 SALSSVERYDPETNQWTMVAPMT-----SPRSAVGVVVLGGKLYAVGG 530 (571)
T ss_pred CccceEEEEcCCCCceeEcccCc-----cccccccEEEECCEEEEEec
Confidence 345788887766666553211 13345688999999998764
No 193
>KOG0272|consensus
Probab=75.41 E-value=69 Score=38.64 Aligned_cols=117 Identities=15% Similarity=0.062 Sum_probs=64.5
Q ss_pred ecccccceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEee---cCCCceEEEEccCCcEEEeeCCCCeEEEeecC
Q psy6572 563 IHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFP---ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLD 639 (1416)
Q Consensus 563 ~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~---l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ld 639 (1416)
...+..+..++|++.++.|--+-... +=+..++.+..+++.. -..+.+||+.+ .|.|--|-.....=.|-+|-
T Consensus 258 ~gH~~RVs~VafHPsG~~L~TasfD~---tWRlWD~~tk~ElL~QEGHs~~v~~iaf~~-DGSL~~tGGlD~~~RvWDlR 333 (459)
T KOG0272|consen 258 EGHLARVSRVAFHPSGKFLGTASFDS---TWRLWDLETKSELLLQEGHSKGVFSIAFQP-DGSLAATGGLDSLGRVWDLR 333 (459)
T ss_pred hcchhhheeeeecCCCceeeeccccc---chhhcccccchhhHhhcccccccceeEecC-CCceeeccCccchhheeecc
Confidence 34456788899999887765443332 2222333355555544 55677888875 44444443322222223333
Q ss_pred CCceEEEEcCCCCCcceeeecCCcceEEEeeCCCC-ceEEEEecCC
Q psy6572 640 GRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQN-AHIGKAKMDG 684 (1416)
Q Consensus 640 G~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~-~~I~ra~mDG 684 (1416)
......++..-++...+++.+|. ||..-|-.+.+ -+|+...|.-
T Consensus 334 tgr~im~L~gH~k~I~~V~fsPN-Gy~lATgs~Dnt~kVWDLR~r~ 378 (459)
T KOG0272|consen 334 TGRCIMFLAGHIKEILSVAFSPN-GYHLATGSSDNTCKVWDLRMRS 378 (459)
T ss_pred cCcEEEEecccccceeeEeECCC-ceEEeecCCCCcEEEeeecccc
Confidence 22223333456677788999986 88777764443 4566655543
No 194
>KOG0293|consensus
Probab=75.40 E-value=2e+02 Score=34.90 Aligned_cols=99 Identities=16% Similarity=0.122 Sum_probs=62.1
Q ss_pred EEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEe
Q psy6572 624 YWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTI 703 (1416)
Q Consensus 624 YwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLai 703 (1416)
+++-+..++|...++||....---.........|||-+...+||.... .++|.-.+.....-+-++++. .....+.|
T Consensus 327 ~V~Gs~dr~i~~wdlDgn~~~~W~gvr~~~v~dlait~Dgk~vl~v~~--d~~i~l~~~e~~~dr~lise~-~~its~~i 403 (519)
T KOG0293|consen 327 FVTGSPDRTIIMWDLDGNILGNWEGVRDPKVHDLAITYDGKYVLLVTV--DKKIRLYNREARVDRGLISEE-QPITSFSI 403 (519)
T ss_pred eEecCCCCcEEEecCCcchhhcccccccceeEEEEEcCCCcEEEEEec--ccceeeechhhhhhhcccccc-CceeEEEE
Confidence 677777789999999997533222223344568888888888888763 345665555444333233321 12246677
Q ss_pred ecCCCeEEEecCCCCeEEEEeCC
Q psy6572 704 SYETNELFWGDAHEDYIAVSDLN 726 (1416)
Q Consensus 704 D~~~~rLYWtD~~~~~I~~~~ld 726 (1416)
. .++++..+......|..-++.
T Consensus 404 S-~d~k~~LvnL~~qei~LWDl~ 425 (519)
T KOG0293|consen 404 S-KDGKLALVNLQDQEIHLWDLE 425 (519)
T ss_pred c-CCCcEEEEEcccCeeEEeecc
Confidence 4 467777777777777777665
No 195
>KOG4441|consensus
Probab=75.02 E-value=1.4e+02 Score=38.76 Aligned_cols=138 Identities=9% Similarity=0.010 Sum_probs=82.6
Q ss_pred cCCcEEEeeCCC------CeEEEeecCCCceEEEEcCCCCCcc-eeeecCCcceEEEeeC----CCCceEEEEecCCCCC
Q psy6572 619 VGRNLYWCDKGL------DTIEVAKLDGRFRKVLINKGLQEPR-GIALNPAYGYMYWTDW----GQNAHIGKAKMDGSNP 687 (1416)
Q Consensus 619 ~~~~LYwtD~~~------~~I~v~~ldG~~~~vLi~~~l~~P~-gIavDp~~g~LYWtD~----g~~~~I~ra~mDGs~r 687 (1416)
+++.||.+-... ..+++.+.....-+.+ ..+..+| +++|-...|.||..-- .....|++.+.....=
T Consensus 331 ~~~~lYv~GG~~~~~~~l~~ve~YD~~~~~W~~~--a~M~~~R~~~~v~~l~g~iYavGG~dg~~~l~svE~YDp~~~~W 408 (571)
T KOG4441|consen 331 LNGKLYVVGGYDSGSDRLSSVERYDPRTNQWTPV--APMNTKRSDFGVAVLDGKLYAVGGFDGEKSLNSVECYDPVTNKW 408 (571)
T ss_pred ECCEEEEEccccCCCcccceEEEecCCCCceecc--CCccCccccceeEEECCEEEEEeccccccccccEEEecCCCCcc
Confidence 578999986544 4677777766553332 3455555 4666668899999841 1124577776665443
Q ss_pred EEEeecCCCCCeeEEeecCCCeEEEecC------CCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEee
Q psy6572 688 KVIISKNLSWPNALTISYETNELFWGDA------HEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTD 761 (1416)
Q Consensus 688 ~vlv~~~l~~P~gLaiD~~~~rLYWtD~------~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD 761 (1416)
..+..-.. .-.+.++-..+++||.+-. .+..+++.+.....-+.+..-. .-..-+++++.+++||..-
T Consensus 409 ~~va~m~~-~r~~~gv~~~~g~iYi~GG~~~~~~~l~sve~YDP~t~~W~~~~~M~-----~~R~~~g~a~~~~~iYvvG 482 (571)
T KOG4441|consen 409 TPVAPMLT-RRSGHGVAVLGGKLYIIGGGDGSSNCLNSVECYDPETNTWTLIAPMN-----TRRSGFGVAVLNGKIYVVG 482 (571)
T ss_pred cccCCCCc-ceeeeEEEEECCEEEEEcCcCCCccccceEEEEcCCCCceeecCCcc-----cccccceEEEECCEEEEEC
Confidence 33332111 1123333346899999865 3356777776555444443321 2344578999999999976
Q ss_pred cCC
Q psy6572 762 WEM 764 (1416)
Q Consensus 762 ~~~ 764 (1416)
...
T Consensus 483 G~~ 485 (571)
T KOG4441|consen 483 GFD 485 (571)
T ss_pred Ccc
Confidence 543
No 196
>PRK13616 lipoprotein LpqB; Provisional
Probab=74.88 E-value=1.6e+02 Score=38.31 Aligned_cols=160 Identities=12% Similarity=0.099 Sum_probs=80.1
Q ss_pred EEEEEecCCcceEEecccccceeeeeecCCCeEEEeeccC---------CCccEEEEecC-CCCeEEee-cCCCceEEEE
Q psy6572 549 YIREVTQAGVMTIRIHNQTNAVGLDFDWVDNCLYWSDVTM---------HGSSIRRSCNN-SQPELLFP-ATSPDGLTVD 617 (1416)
Q Consensus 549 ~I~~i~l~g~~~~~~~~l~~~~~l~~D~~~~~LYwtD~~~---------~~~~I~r~~l~-s~~~~l~~-l~~p~gLAvD 617 (1416)
.|+.+...+....+..+. ......+++.++.|+++.... ....|+++.++ +.... . -..+..|++-
T Consensus 380 ~Lwv~~~gg~~~~lt~g~-~~t~PsWspDG~~lw~v~dg~~~~~v~~~~~~gql~~~~vd~ge~~~--~~~g~Issl~wS 456 (591)
T PRK13616 380 SLWVGPLGGVAVQVLEGH-SLTRPSWSLDADAVWVVVDGNTVVRVIRDPATGQLARTPVDASAVAS--RVPGPISELQLS 456 (591)
T ss_pred EEEEEeCCCcceeeecCC-CCCCceECCCCCceEEEecCcceEEEeccCCCceEEEEeccCchhhh--ccCCCcCeEEEC
Confidence 455555544332222222 244556777666666553221 11233333333 11111 1 2347778887
Q ss_pred ccCCcEEEeeCCCCeEEE---eecCCCceEE----EEcCCCCC-cceeeecCCc-ceEEEeeCCCCceEEEEecCCCCCE
Q psy6572 618 WVGRNLYWCDKGLDTIEV---AKLDGRFRKV----LINKGLQE-PRGIALNPAY-GYMYWTDWGQNAHIGKAKMDGSNPK 688 (1416)
Q Consensus 618 ~~~~~LYwtD~~~~~I~v---~~ldG~~~~v----Li~~~l~~-P~gIavDp~~-g~LYWtD~g~~~~I~ra~mDGs~r~ 688 (1416)
+-+.+|.++-. ++|++ ...++..+++ .+...+.. +..+ ++.. +.|++.-.+....++++.+||...+
T Consensus 457 pDG~RiA~i~~--g~v~Va~Vvr~~~G~~~l~~~~~l~~~l~~~~~~l--~W~~~~~L~V~~~~~~~~v~~v~vDG~~~~ 532 (591)
T PRK13616 457 RDGVRAAMIIG--GKVYLAVVEQTEDGQYALTNPREVGPGLGDTAVSL--DWRTGDSLVVGRSDPEHPVWYVNLDGSNSD 532 (591)
T ss_pred CCCCEEEEEEC--CEEEEEEEEeCCCCceeecccEEeecccCCccccc--eEecCCEEEEEecCCCCceEEEecCCcccc
Confidence 77777777653 46666 3433333222 01123332 3333 3332 2355554344456899999999877
Q ss_pred EEeecCCCCCeeEEeecCCCeEEEecCC
Q psy6572 689 VIISKNLSWPNALTISYETNELFWGDAH 716 (1416)
Q Consensus 689 vlv~~~l~~P~gLaiD~~~~rLYWtD~~ 716 (1416)
.+...++..| ..+|-...+.||.+|..
T Consensus 533 ~~~~~n~~~~-v~~vaa~~~~iyv~~~~ 559 (591)
T PRK13616 533 ALPSRNLSAP-VVAVAASPSTVYVTDAR 559 (591)
T ss_pred ccCCCCccCc-eEEEecCCceEEEEcCC
Confidence 6555455444 23443334678888753
No 197
>KOG0294|consensus
Probab=74.63 E-value=1.8e+02 Score=34.11 Aligned_cols=135 Identities=14% Similarity=0.071 Sum_probs=67.8
Q ss_pred ccceeeeeecCCCeEEEeeccCCCccEEEEecC--CCCeEEee-cCCCceEEEEccCC--cEEEeeCCCCeEEEeecCCC
Q psy6572 567 TNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN--SQPELLFP-ATSPDGLTVDWVGR--NLYWCDKGLDTIEVAKLDGR 641 (1416)
Q Consensus 567 ~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~--s~~~~l~~-l~~p~gLAvD~~~~--~LYwtD~~~~~I~v~~ldG~ 641 (1416)
..+.+||+++ .|++....+ .+|+.+++. .+...|.. ...+.+|.+++..- +| .+-+..+.|.+.+..--
T Consensus 44 ~sitavAVs~----~~~aSGssD-etI~IYDm~k~~qlg~ll~HagsitaL~F~~~~S~shL-lS~sdDG~i~iw~~~~W 117 (362)
T KOG0294|consen 44 GSITALAVSG----PYVASGSSD-ETIHIYDMRKRKQLGILLSHAGSITALKFYPPLSKSHL-LSGSDDGHIIIWRVGSW 117 (362)
T ss_pred cceeEEEecc----eeEeccCCC-CcEEEEeccchhhhcceeccccceEEEEecCCcchhhe-eeecCCCcEEEEEcCCe
Confidence 3456777765 466555443 578888877 33334444 66667777765432 22 23344456655544321
Q ss_pred c-eEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEec-CCCCCEEEeecCCCCCeeEEeecCCCeEEE
Q psy6572 642 F-RKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKM-DGSNPKVIISKNLSWPNALTISYETNELFW 712 (1416)
Q Consensus 642 ~-~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~m-DGs~r~vlv~~~l~~P~gLaiD~~~~rLYW 712 (1416)
. ...|-.. -.+.++|+|+|. |+|-.+-.+.. .|..-+| .|....++-.. ..|.-+..++...+.|+
T Consensus 118 ~~~~slK~H-~~~Vt~lsiHPS-~KLALsVg~D~-~lr~WNLV~Gr~a~v~~L~--~~at~v~w~~~Gd~F~v 185 (362)
T KOG0294|consen 118 ELLKSLKAH-KGQVTDLSIHPS-GKLALSVGGDQ-VLRTWNLVRGRVAFVLNLK--NKATLVSWSPQGDHFVV 185 (362)
T ss_pred EEeeeeccc-ccccceeEecCC-CceEEEEcCCc-eeeeehhhcCccceeeccC--CcceeeEEcCCCCEEEE
Confidence 1 1122222 234899999998 56666654433 3433333 34443333221 22333555544444444
No 198
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=73.52 E-value=63 Score=37.11 Aligned_cols=140 Identities=16% Similarity=0.109 Sum_probs=77.9
Q ss_pred CCeEEEeeccCCCccEEEEecCCCCeE----Eee------------cCCCceEEEEccCCcE-EEeeCCCCeEEEeecCC
Q psy6572 578 DNCLYWSDVTMHGSSIRRSCNNSQPEL----LFP------------ATSPDGLTVDWVGRNL-YWCDKGLDTIEVAKLDG 640 (1416)
Q Consensus 578 ~~~LYwtD~~~~~~~I~r~~l~s~~~~----l~~------------l~~p~gLAvD~~~~~L-YwtD~~~~~I~v~~ldG 640 (1416)
++.||+--... ..|.|++|.+.... |.. ...-..||||..+=.+ |-+....+.|.++.||-
T Consensus 78 ngslYY~~~~s--~~IvkydL~t~~v~~~~~L~~A~~~n~~~y~~~~~t~iD~AvDE~GLWvIYat~~~~g~ivvskld~ 155 (250)
T PF02191_consen 78 NGSLYYNKYNS--RNIVKYDLTTRSVVARRELPGAGYNNRFPYYWSGYTDIDFAVDENGLWVIYATEDNNGNIVVSKLDP 155 (250)
T ss_pred CCcEEEEecCC--ceEEEEECcCCcEEEEEECCccccccccceecCCCceEEEEEcCCCEEEEEecCCCCCcEEEEeeCc
Confidence 66788876654 38999988833222 211 1122478999654443 43444456799999986
Q ss_pred CceEEEEc--CCCCCcceeeecCCcceEEEeeCCCC--ceEE-EEecC-CCCCE--EEeecCCCCCeeEEeecCCCeEEE
Q psy6572 641 RFRKVLIN--KGLQEPRGIALNPAYGYMYWTDWGQN--AHIG-KAKMD-GSNPK--VIISKNLSWPNALTISYETNELFW 712 (1416)
Q Consensus 641 ~~~~vLi~--~~l~~P~gIavDp~~g~LYWtD~g~~--~~I~-ra~mD-Gs~r~--vlv~~~l~~P~gLaiD~~~~rLYW 712 (1416)
....+..+ ..+.++..-..=-.=|.||.++.... .+|. ..++. |+... +.+.........|..++.+++||.
T Consensus 156 ~tL~v~~tw~T~~~k~~~~naFmvCGvLY~~~s~~~~~~~I~yafDt~t~~~~~~~i~f~~~~~~~~~l~YNP~dk~LY~ 235 (250)
T PF02191_consen 156 ETLSVEQTWNTSYPKRSAGNAFMVCGVLYATDSYDTRDTEIFYAFDTYTGKEEDVSIPFPNPYGNISMLSYNPRDKKLYA 235 (250)
T ss_pred ccCceEEEEEeccCchhhcceeeEeeEEEEEEECCCCCcEEEEEEECCCCceeceeeeeccccCceEeeeECCCCCeEEE
Confidence 55444322 34444432222223489999985432 3443 23333 21111 112233455668999999999998
Q ss_pred ecCCCCe
Q psy6572 713 GDAHEDY 719 (1416)
Q Consensus 713 tD~~~~~ 719 (1416)
-|-+.-.
T Consensus 236 wd~G~~v 242 (250)
T PF02191_consen 236 WDNGYQV 242 (250)
T ss_pred EECCeEE
Confidence 8754333
No 199
>KOG0266|consensus
Probab=73.04 E-value=1.3e+02 Score=37.82 Aligned_cols=154 Identities=17% Similarity=0.154 Sum_probs=85.7
Q ss_pred EEEEccCCcEEEeeCCCCeEEEeecCCCc---eEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEE
Q psy6572 614 LTVDWVGRNLYWCDKGLDTIEVAKLDGRF---RKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVI 690 (1416)
Q Consensus 614 LAvD~~~~~LYwtD~~~~~I~v~~ldG~~---~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vl 690 (1416)
++|-+.++. .++-...+.|.+..+.+.. .+.+ .......++|++-|...+|-=+.....-+|+.+.-+|...++|
T Consensus 165 ~~fs~~g~~-l~~~~~~~~i~~~~~~~~~~~~~~~l-~~h~~~v~~~~fs~d~~~l~s~s~D~tiriwd~~~~~~~~~~l 242 (456)
T KOG0266|consen 165 VDFSPDGRA-LAAASSDGLIRIWKLEGIKSNLLREL-SGHTRGVSDVAFSPDGSYLLSGSDDKTLRIWDLKDDGRNLKTL 242 (456)
T ss_pred EEEcCCCCe-EEEccCCCcEEEeecccccchhhccc-cccccceeeeEECCCCcEEEEecCCceEEEeeccCCCeEEEEe
Confidence 444443343 4444444555555553332 1112 2334456778887775533222222223555554444555555
Q ss_pred eecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEE-ecCcEEEeecCCCeeEE
Q psy6572 691 ISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAV-FEDHLFWTDWEMKSIER 769 (1416)
Q Consensus 691 v~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav-~~d~LYwtD~~~~~I~~ 769 (1416)
. ....+.+.+++.+.. +++.+-...+.|...++.+......+... -....++++ ..+.++++-...+.|..
T Consensus 243 ~-gH~~~v~~~~f~p~g-~~i~Sgs~D~tvriWd~~~~~~~~~l~~h------s~~is~~~f~~d~~~l~s~s~d~~i~v 314 (456)
T KOG0266|consen 243 K-GHSTYVTSVAFSPDG-NLLVSGSDDGTVRIWDVRTGECVRKLKGH------SDGISGLAFSPDGNLLVSASYDGTIRV 314 (456)
T ss_pred c-CCCCceEEEEecCCC-CEEEEecCCCcEEEEeccCCeEEEeeecc------CCceEEEEECCCCCEEEEcCCCccEEE
Confidence 4 445566899998766 77777666778888887654444444432 223345666 36778888777777777
Q ss_pred ecccCCCc
Q psy6572 770 CDKYTGKN 777 (1416)
Q Consensus 770 ~nk~tG~~ 777 (1416)
-+..+|..
T Consensus 315 wd~~~~~~ 322 (456)
T KOG0266|consen 315 WDLETGSK 322 (456)
T ss_pred EECCCCce
Confidence 77777764
No 200
>KOG0276|consensus
Probab=72.95 E-value=2e+02 Score=36.67 Aligned_cols=90 Identities=11% Similarity=0.055 Sum_probs=51.8
Q ss_pred CCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEe-ecCCCCC
Q psy6572 620 GRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVII-SKNLSWP 698 (1416)
Q Consensus 620 ~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv-~~~l~~P 698 (1416)
..+..++-+...+|.|.+++...+...+...-+..|.|||+|..-++.-+ ...-.|..-+-++.-....+ ...-...
T Consensus 66 RknWiv~GsDD~~IrVfnynt~ekV~~FeAH~DyIR~iavHPt~P~vLts--SDDm~iKlW~we~~wa~~qtfeGH~HyV 143 (794)
T KOG0276|consen 66 RKNWIVTGSDDMQIRVFNYNTGEKVKTFEAHSDYIRSIAVHPTLPYVLTS--SDDMTIKLWDWENEWACEQTFEGHEHYV 143 (794)
T ss_pred ccceEEEecCCceEEEEecccceeeEEeeccccceeeeeecCCCCeEEec--CCccEEEEeeccCceeeeeEEcCcceEE
Confidence 34455666667899999998766555555566779999999996554332 22334555555554332222 2223333
Q ss_pred eeEEeecCCCeEE
Q psy6572 699 NALTISYETNELF 711 (1416)
Q Consensus 699 ~gLaiD~~~~rLY 711 (1416)
..|++.+....-|
T Consensus 144 Mqv~fnPkD~ntF 156 (794)
T KOG0276|consen 144 MQVAFNPKDPNTF 156 (794)
T ss_pred EEEEecCCCccce
Confidence 4566655443333
No 201
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=72.72 E-value=11 Score=29.85 Aligned_cols=41 Identities=24% Similarity=0.327 Sum_probs=25.8
Q ss_pred CCcceEEEeeCCCCceEEEEecCCCCCEEEeec-CCCCCeeEEee
Q psy6572 661 PAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISK-NLSWPNALTIS 704 (1416)
Q Consensus 661 p~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~-~l~~P~gLaiD 704 (1416)
|..++||+++++.. .|..++.... +++..- ...+|.+|+++
T Consensus 1 pd~~~lyv~~~~~~-~v~~id~~~~--~~~~~i~vg~~P~~i~~~ 42 (42)
T TIGR02276 1 PDGTKLYVTNSGSN-TVSVIDTATN--KVIATIPVGGYPFGVAVS 42 (42)
T ss_pred CCCCEEEEEeCCCC-EEEEEECCCC--eEEEEEECCCCCceEEeC
Confidence 45678999998754 6777666322 222221 24679888874
No 202
>KOG0270|consensus
Probab=72.17 E-value=1.3e+02 Score=36.56 Aligned_cols=153 Identities=10% Similarity=0.096 Sum_probs=87.7
Q ss_pred eeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee--cCCCceEEEEccCCcEEEeeCCCCeEEEeecC-------
Q psy6572 570 VGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP--ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLD------- 639 (1416)
Q Consensus 570 ~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~--l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ld------- 639 (1416)
.+|++.+....|..+-.... +|....++ ++....+. ...+..|+..+....+..+-+..+++.+.+..
T Consensus 247 l~Ls~n~~~~nVLaSgsaD~--TV~lWD~~~g~p~~s~~~~~k~Vq~l~wh~~~p~~LLsGs~D~~V~l~D~R~~~~s~~ 324 (463)
T KOG0270|consen 247 LALSWNRNFRNVLASGSADK--TVKLWDVDTGKPKSSITHHGKKVQTLEWHPYEPSVLLSGSYDGTVALKDCRDPSNSGK 324 (463)
T ss_pred HHHHhccccceeEEecCCCc--eEEEEEcCCCCcceehhhcCCceeEEEecCCCceEEEeccccceEEeeeccCccccCc
Confidence 46667666667776666543 77778887 55544444 56677777777777777777666666665543
Q ss_pred -----CCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEec
Q psy6572 640 -----GRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGD 714 (1416)
Q Consensus 640 -----G~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD 714 (1416)
|..-+++. ....|..+.+....|.|||.|.... |.-..++.. .-....||.+....-.|..+.
T Consensus 325 ~wk~~g~VEkv~w--~~~se~~f~~~tddG~v~~~D~R~~---------~~~vwt~~A-Hd~~ISgl~~n~~~p~~l~t~ 392 (463)
T KOG0270|consen 325 EWKFDGEVEKVAW--DPHSENSFFVSTDDGTVYYFDIRNP---------GKPVWTLKA-HDDEISGLSVNIQTPGLLSTA 392 (463)
T ss_pred eEEeccceEEEEe--cCCCceeEEEecCCceEEeeecCCC---------CCceeEEEe-ccCCcceEEecCCCCcceeec
Confidence 22122222 2234555555566677777774332 111111111 111235777766666666666
Q ss_pred CCCCeEEEEeCCCCceEEEEec
Q psy6572 715 AHEDYIAVSDLNGENIKIIVSR 736 (1416)
Q Consensus 715 ~~~~~I~~~~ldG~~r~~v~~~ 736 (1416)
..-+.|...++++.+.+.+...
T Consensus 393 s~d~~Vklw~~~~~~~~~v~~~ 414 (463)
T KOG0270|consen 393 STDKVVKLWKFDVDSPKSVKEH 414 (463)
T ss_pred cccceEEEEeecCCCCcccccc
Confidence 6666666777777766655543
No 203
>KOG2055|consensus
Probab=71.45 E-value=2.2e+02 Score=34.98 Aligned_cols=59 Identities=12% Similarity=0.105 Sum_probs=33.0
Q ss_pred eEEEEccCCcEEEeeCCCC-eEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCC
Q psy6572 613 GLTVDWVGRNLYWCDKGLD-TIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQN 674 (1416)
Q Consensus 613 gLAvD~~~~~LYwtD~~~~-~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~ 674 (1416)
-||+--..|+||.....++ .|..+.+.|....+.+.+.. +-|.+--..|.+|+.|.+.+
T Consensus 317 fia~~G~~G~I~lLhakT~eli~s~KieG~v~~~~fsSds---k~l~~~~~~GeV~v~nl~~~ 376 (514)
T KOG2055|consen 317 FIAIAGNNGHIHLLHAKTKELITSFKIEGVVSDFTFSSDS---KELLASGGTGEVYVWNLRQN 376 (514)
T ss_pred eEEEcccCceEEeehhhhhhhhheeeeccEEeeEEEecCC---cEEEEEcCCceEEEEecCCc
Confidence 4566556666666655443 46666777776665554222 33333344566666665554
No 204
>PF14339 DUF4394: Domain of unknown function (DUF4394)
Probab=70.50 E-value=2e+02 Score=32.74 Aligned_cols=57 Identities=16% Similarity=0.126 Sum_probs=37.4
Q ss_pred ccceeeeeecCCCeEEEeeccCCCccEEEEecCCC-CeEEe-e------cCCCceEEEEccCCcEEEe
Q psy6572 567 TNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQ-PELLF-P------ATSPDGLTVDWVGRNLYWC 626 (1416)
Q Consensus 567 ~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~-~~~l~-~------l~~p~gLAvD~~~~~LYwt 626 (1416)
...+||+|-+.+++||-+... ++||.++..+. .+.+- . ...+.++.|+|..++|.+.
T Consensus 27 e~l~GID~Rpa~G~LYgl~~~---g~lYtIn~~tG~aT~vg~s~~~~al~g~~~gvDFNP~aDRlRvv 91 (236)
T PF14339_consen 27 ESLVGIDFRPANGQLYGLGST---GRLYTINPATGAATPVGASPLTVALSGTAFGVDFNPAADRLRVV 91 (236)
T ss_pred CeEEEEEeecCCCCEEEEeCC---CcEEEEECCCCeEEEeecccccccccCceEEEecCcccCcEEEE
Confidence 467899999999999988443 48999988843 33331 1 2335566666655555443
No 205
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=70.38 E-value=1.7e+02 Score=35.62 Aligned_cols=102 Identities=12% Similarity=0.146 Sum_probs=54.6
Q ss_pred cceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCc
Q psy6572 663 YGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTI 742 (1416)
Q Consensus 663 ~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~ 742 (1416)
.+.||++... ..+...++.. -+++-...+..+..++++ .++||++.. .+.|..++...... +-......
T Consensus 256 ~~~vy~~~~~--g~l~ald~~t--G~~~W~~~~~~~~~~~~~--~~~vy~~~~-~g~l~ald~~tG~~--~W~~~~~~-- 324 (394)
T PRK11138 256 GGVVYALAYN--GNLVALDLRS--GQIVWKREYGSVNDFAVD--GGRIYLVDQ-NDRVYALDTRGGVE--LWSQSDLL-- 324 (394)
T ss_pred CCEEEEEEcC--CeEEEEECCC--CCEEEeecCCCccCcEEE--CCEEEEEcC-CCeEEEEECCCCcE--EEcccccC--
Confidence 4677777643 2455554431 123333334444455553 789998874 46788888643221 21111000
Q ss_pred ccccceeEEEecCcEEEeecCCCeeEEecccCCCc
Q psy6572 743 NLHHVFALAVFEDHLFWTDWEMKSIERCDKYTGKN 777 (1416)
Q Consensus 743 ~l~~P~~lav~~d~LYwtD~~~~~I~~~nk~tG~~ 777 (1416)
-....+.++.+++||..+. .+.|+.++..+|+.
T Consensus 325 -~~~~~sp~v~~g~l~v~~~-~G~l~~ld~~tG~~ 357 (394)
T PRK11138 325 -HRLLTAPVLYNGYLVVGDS-EGYLHWINREDGRF 357 (394)
T ss_pred -CCcccCCEEECCEEEEEeC-CCEEEEEECCCCCE
Confidence 0111234467888988764 46777788777764
No 206
>KOG0772|consensus
Probab=69.94 E-value=1.1e+02 Score=38.02 Aligned_cols=154 Identities=14% Similarity=0.073 Sum_probs=79.0
Q ss_pred eeeecCCCeEEEeeccCCCccEEEEecC---CCCeEEee------cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCc
Q psy6572 572 LDFDWVDNCLYWSDVTMHGSSIRRSCNN---SQPELLFP------ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRF 642 (1416)
Q Consensus 572 l~~D~~~~~LYwtD~~~~~~~I~r~~l~---s~~~~l~~------l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~ 642 (1416)
..+++.+...|.+-.... +++...++ ++.+||.. -..|..-|+++ .++++-+--..+.|..-++.+..
T Consensus 274 g~whP~~k~~FlT~s~Dg--tlRiWdv~~~k~q~qVik~k~~~g~Rv~~tsC~~nr-dg~~iAagc~DGSIQ~W~~~~~~ 350 (641)
T KOG0772|consen 274 GCWHPDNKEEFLTCSYDG--TLRIWDVNNTKSQLQVIKTKPAGGKRVPVTSCAWNR-DGKLIAAGCLDGSIQIWDKGSRT 350 (641)
T ss_pred cccccCcccceEEecCCC--cEEEEecCCchhheeEEeeccCCCcccCceeeecCC-CcchhhhcccCCceeeeecCCcc
Confidence 457888888888866643 56656665 45555554 12344455554 33444444445666666654333
Q ss_pred eEEE--Ec---CCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEE----------------------EeecCC
Q psy6572 643 RKVL--IN---KGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKV----------------------IISKNL 695 (1416)
Q Consensus 643 ~~vL--i~---~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~v----------------------lv~~~l 695 (1416)
.++. +. ........|++.+...+|. + .|....+.+-+|.-.+.-+ |+.++.
T Consensus 351 v~p~~~vk~AH~~g~~Itsi~FS~dg~~Ll-S-Rg~D~tLKvWDLrq~kkpL~~~tgL~t~~~~tdc~FSPd~kli~TGt 428 (641)
T KOG0772|consen 351 VRPVMKVKDAHLPGQDITSISFSYDGNYLL-S-RGFDDTLKVWDLRQFKKPLNVRTGLPTPFPGTDCCFSPDDKLILTGT 428 (641)
T ss_pred cccceEeeeccCCCCceeEEEeccccchhh-h-ccCCCceeeeeccccccchhhhcCCCccCCCCccccCCCceEEEecc
Confidence 2221 11 1223456677766644432 1 1111223333333222222 222233
Q ss_pred CCCeeEEeecCCCeEEEecCC-CCeEEEEeCCCCceEEEE
Q psy6572 696 SWPNALTISYETNELFWGDAH-EDYIAVSDLNGENIKIIV 734 (1416)
Q Consensus 696 ~~P~gLaiD~~~~rLYWtD~~-~~~I~~~~ldG~~r~~v~ 734 (1416)
..|++.+ .+.|||.|.. ...|+++.+++..+..++
T Consensus 429 S~~~~~~----~g~L~f~d~~t~d~v~ki~i~~aSvv~~~ 464 (641)
T KOG0772|consen 429 SAPNGMT----AGTLFFFDRMTLDTVYKIDISTASVVRCL 464 (641)
T ss_pred cccCCCC----CceEEEEeccceeeEEEecCCCceEEEEe
Confidence 3445443 4688888874 578899998866554443
No 207
>PF09910 DUF2139: Uncharacterized protein conserved in archaea (DUF2139); InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=68.94 E-value=64 Score=37.49 Aligned_cols=106 Identities=14% Similarity=0.132 Sum_probs=58.9
Q ss_pred EEEEccCCcEEEeeCCC-CeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEee
Q psy6572 614 LTVDWVGRNLYWCDKGL-DTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIIS 692 (1416)
Q Consensus 614 LAvD~~~~~LYwtD~~~-~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~ 692 (1416)
=||+|+.+.||+--|.. ..|++.+-+|+ .++......+....+-++-.+=+|.|.+.-.. + .
T Consensus 40 NAV~~vDd~IyFGGWVHAPa~y~gk~~g~-~~IdF~NKYSHVH~yd~e~~~VrLLWkesih~-------------~---~ 102 (339)
T PF09910_consen 40 NAVEWVDDFIYFGGWVHAPAVYEGKGDGR-ATIDFRNKYSHVHEYDTENDSVRLLWKESIHD-------------K---T 102 (339)
T ss_pred eeeeeecceEEEeeeecCCceeeeccCCc-eEEEEeeccceEEEEEcCCCeEEEEEecccCC-------------c---c
Confidence 58999999999998864 35666665555 33333233322222222222223344331110 0 0
Q ss_pred cCCCCCeeEEeecCCCeEEEecCCCC---eEEEEeCCCCceEEEEec
Q psy6572 693 KNLSWPNALTISYETNELFWGDAHED---YIAVSDLNGENIKIIVSR 736 (1416)
Q Consensus 693 ~~l~~P~gLaiD~~~~rLYWtD~~~~---~I~~~~ldG~~r~~v~~~ 736 (1416)
.-....+.|..|+.+++||++-+..+ -|++++..+...+.|...
T Consensus 103 ~WaGEVSdIlYdP~~D~LLlAR~DGh~nLGvy~ldr~~g~~~~L~~~ 149 (339)
T PF09910_consen 103 KWAGEVSDILYDPYEDRLLLARADGHANLGVYSLDRRTGKAEKLSSN 149 (339)
T ss_pred ccccchhheeeCCCcCEEEEEecCCcceeeeEEEcccCCceeeccCC
Confidence 00112346888999999999966433 578888666666666553
No 208
>KOG4378|consensus
Probab=67.50 E-value=93 Score=38.28 Aligned_cols=156 Identities=15% Similarity=0.178 Sum_probs=89.2
Q ss_pred ccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCC-eEEee--cCCCceEEEEccCCcEEEeeCCCCeEEEeecCC
Q psy6572 565 NQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQP-ELLFP--ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDG 640 (1416)
Q Consensus 565 ~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~-~~l~~--l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG 640 (1416)
....+.+|+|.|....|-- ... ++-|....+. ... +.+.. ...++-|.+-+..+.|..+-+..+.|.+.+..|
T Consensus 120 h~stvt~v~YN~~DeyiAs--vs~-gGdiiih~~~t~~~tt~f~~~sgqsvRll~ys~skr~lL~~asd~G~VtlwDv~g 196 (673)
T KOG4378|consen 120 HQSTVTYVDYNNTDEYIAS--VSD-GGDIIIHGTKTKQKTTTFTIDSGQSVRLLRYSPSKRFLLSIASDKGAVTLWDVQG 196 (673)
T ss_pred CcceeEEEEecCCcceeEE--ecc-CCcEEEEecccCccccceecCCCCeEEEeecccccceeeEeeccCCeEEEEeccC
Confidence 3456778888887665433 332 2344444443 111 12211 333456667777888888888888888888888
Q ss_pred CceEEEEcCCCCCc-ceeeecCCcceEEEeeCCCCceEEEEecCCCCC-EEEeecCCCCC-eeEEeecCCCeEEEecCCC
Q psy6572 641 RFRKVLINKGLQEP-RGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNP-KVIISKNLSWP-NALTISYETNELFWGDAHE 717 (1416)
Q Consensus 641 ~~~~vLi~~~l~~P-~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r-~vlv~~~l~~P-~gLaiD~~~~rLYWtD~~~ 717 (1416)
.....-.......| +||.+-|.+-.|+++- |...+|...+...... ..|+ ..+| ..+++- ..+.+..+-...
T Consensus 197 ~sp~~~~~~~HsAP~~gicfspsne~l~vsV-G~Dkki~~yD~~s~~s~~~l~---y~~Plstvaf~-~~G~~L~aG~s~ 271 (673)
T KOG4378|consen 197 MSPIFHASEAHSAPCRGICFSPSNEALLVSV-GYDKKINIYDIRSQASTDRLT---YSHPLSTVAFS-ECGTYLCAGNSK 271 (673)
T ss_pred CCcccchhhhccCCcCcceecCCccceEEEe-cccceEEEeecccccccceee---ecCCcceeeec-CCceEEEeecCC
Confidence 75443333333334 7999999999999886 4444677666542211 1111 1234 244543 345555555556
Q ss_pred CeEEEEeCCCC
Q psy6572 718 DYIAVSDLNGE 728 (1416)
Q Consensus 718 ~~I~~~~ldG~ 728 (1416)
++|...++.+.
T Consensus 272 G~~i~YD~R~~ 282 (673)
T KOG4378|consen 272 GELIAYDMRST 282 (673)
T ss_pred ceEEEEecccC
Confidence 67777776654
No 209
>KOG1217|consensus
Probab=66.67 E-value=4.8 Score=50.00 Aligned_cols=60 Identities=33% Similarity=0.808 Sum_probs=48.7
Q ss_pred ccceecCCceEEeeCCCceecCCCCCccccCCcCCCC-Cccc--eeeecCCeeeecCCCCcEEec
Q psy6572 466 HECIDLKIGYKCACRKGYQVHPEDKHLCVDTNECLDR-PCSH--YCRNTLGSYSCSCAPGYALLS 527 (1416)
Q Consensus 466 ~~C~nt~~gy~C~C~~Gy~L~p~d~~tC~didEC~~~-~Csq--~C~nt~gsy~C~C~~Gy~L~~ 527 (1416)
..|+++.++|+|.|++||.+.. ...|.++++|+.. .|.+ .|++..++|.|.|++||....
T Consensus 243 ~~c~~~~~~~~C~~~~g~~~~~--~~~~~~~~~C~~~~~c~~~~~C~~~~~~~~C~C~~g~~g~~ 305 (487)
T KOG1217|consen 243 GTCVNTVGSYTCRCPEGYTGDA--CVTCVDVDSCALIASCPNGGTCVNVPGSYRCTCPPGFTGRL 305 (487)
T ss_pred CcccccCCceeeeCCCCccccc--cceeeeccccCCCCccCCCCeeecCCCcceeeCCCCCCCCC
Confidence 4688999999999999998751 1468899999843 3765 899999999999999997443
No 210
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=66.61 E-value=2.6e+02 Score=33.74 Aligned_cols=108 Identities=9% Similarity=0.064 Sum_probs=51.3
Q ss_pred cceEEEeeCCCCceEEEEec-CCCCCEEEeecC-CCCCeeEEeecCCCeEEEecCCCCeEEEEeCC-CCceEEEEeccCC
Q psy6572 663 YGYMYWTDWGQNAHIGKAKM-DGSNPKVIISKN-LSWPNALTISYETNELFWGDAHEDYIAVSDLN-GENIKIIVSRRMD 739 (1416)
Q Consensus 663 ~g~LYWtD~g~~~~I~ra~m-DGs~r~vlv~~~-l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ld-G~~r~~v~~~~~~ 739 (1416)
.|+||+.++.. ++...++ +|...-..-... +.+..+.++ ..+.+|+.. ..+++..++.+ |..+-.+-...
T Consensus 111 ~G~i~~g~~~g--~~y~ld~~~G~~~W~~~~~~~~~~~~~~v~--~~~~v~~~s-~~g~~~al~~~tG~~~W~~~~~~-- 183 (370)
T COG1520 111 DGKIYVGSWDG--KLYALDASTGTLVWSRNVGGSPYYASPPVV--GDGTVYVGT-DDGHLYALNADTGTLKWTYETPA-- 183 (370)
T ss_pred CCeEEEecccc--eEEEEECCCCcEEEEEecCCCeEEecCcEE--cCcEEEEec-CCCeEEEEEccCCcEEEEEecCC--
Confidence 45566666432 4555555 444332222222 111222222 356666653 34666676655 55443322211
Q ss_pred CCcccccceeEEEecCcEEEeecC-CCeeEEecccCCCce
Q psy6572 740 PTINLHHVFALAVFEDHLFWTDWE-MKSIERCDKYTGKNC 778 (1416)
Q Consensus 740 p~~~l~~P~~lav~~d~LYwtD~~-~~~I~~~nk~tG~~~ 778 (1416)
+ ..+.-..+.++..+.||++... ...+..++..+|...
T Consensus 184 ~-~~~~~~~~~~~~~~~vy~~~~~~~~~~~a~~~~~G~~~ 222 (370)
T COG1520 184 P-LSLSIYGSPAIASGTVYVGSDGYDGILYALNAEDGTLK 222 (370)
T ss_pred c-cccccccCceeecceEEEecCCCcceEEEEEccCCcEe
Confidence 0 1122223333777888887653 345666776666543
No 211
>KOG0279|consensus
Probab=65.61 E-value=2.6e+02 Score=32.34 Aligned_cols=160 Identities=13% Similarity=0.025 Sum_probs=97.6
Q ss_pred CCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEE
Q psy6572 610 SPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKV 689 (1416)
Q Consensus 610 ~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~v 689 (1416)
.+.++++- ..++.+++-+..+.+..-++.+...+..+........++|+.+.+..|- +-.. ...|..-+.-|.-..+
T Consensus 65 ~v~dv~~s-~dg~~alS~swD~~lrlWDl~~g~~t~~f~GH~~dVlsva~s~dn~qiv-SGSr-DkTiklwnt~g~ck~t 141 (315)
T KOG0279|consen 65 FVSDVVLS-SDGNFALSASWDGTLRLWDLATGESTRRFVGHTKDVLSVAFSTDNRQIV-SGSR-DKTIKLWNTLGVCKYT 141 (315)
T ss_pred EecceEEc-cCCceEEeccccceEEEEEecCCcEEEEEEecCCceEEEEecCCCceee-cCCC-cceeeeeeecccEEEE
Confidence 35566665 3566677777778888889887544444434555678999999876653 3222 2356666666666556
Q ss_pred EeecC-CCCCeeEEeecCCCeEEEecCCCC-eEEEEeCCCCceEEEEeccCCCCcccccceeEEE-ecCcEEEeecCCCe
Q psy6572 690 IISKN-LSWPNALTISYETNELFWGDAHED-YIAVSDLNGENIKIIVSRRMDPTINLHHVFALAV-FEDHLFWTDWEMKS 766 (1416)
Q Consensus 690 lv~~~-l~~P~gLaiD~~~~rLYWtD~~~~-~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav-~~d~LYwtD~~~~~ 766 (1416)
+.... -.|.+-+.+.|.+...|.+.+..+ .|..-++++-..+.-+.+. -.....++| ..+.|-.+-.+.+.
T Consensus 142 ~~~~~~~~WVscvrfsP~~~~p~Ivs~s~DktvKvWnl~~~~l~~~~~gh------~~~v~t~~vSpDGslcasGgkdg~ 215 (315)
T KOG0279|consen 142 IHEDSHREWVSCVRFSPNESNPIIVSASWDKTVKVWNLRNCQLRTTFIGH------SGYVNTVTVSPDGSLCASGGKDGE 215 (315)
T ss_pred EecCCCcCcEEEEEEcCCCCCcEEEEccCCceEEEEccCCcchhhccccc------cccEEEEEECCCCCEEecCCCCce
Confidence 65544 677788888876666676666544 5555577765554433332 234456666 35666666666666
Q ss_pred eEEecccCCCce
Q psy6572 767 IERCDKYTGKNC 778 (1416)
Q Consensus 767 I~~~nk~tG~~~ 778 (1416)
++-.+...++..
T Consensus 216 ~~LwdL~~~k~l 227 (315)
T KOG0279|consen 216 AMLWDLNEGKNL 227 (315)
T ss_pred EEEEEccCCcee
Confidence 666555555543
No 212
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=65.24 E-value=2.2e+02 Score=31.25 Aligned_cols=117 Identities=10% Similarity=0.087 Sum_probs=65.3
Q ss_pred EEEEEEecCCcc--eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEee--cCCCceEEEEccCCcE
Q psy6572 548 YYIREVTQAGVM--TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFP--ATSPDGLTVDWVGRNL 623 (1416)
Q Consensus 548 ~~I~~i~l~g~~--~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~--l~~p~gLAvD~~~~~L 623 (1416)
..|.+++..+.. .+.+.....+..+++.+.++++.++-.... ..|....+. .+.+.. -.....|+..|.++.|
T Consensus 39 ~~l~~~~~~~~~~~~i~l~~~~~I~~~~WsP~g~~favi~g~~~-~~v~lyd~~--~~~i~~~~~~~~n~i~wsP~G~~l 115 (194)
T PF08662_consen 39 FELFYLNEKNIPVESIELKKEGPIHDVAWSPNGNEFAVIYGSMP-AKVTLYDVK--GKKIFSFGTQPRNTISWSPDGRFL 115 (194)
T ss_pred EEEEEEecCCCccceeeccCCCceEEEEECcCCCEEEEEEccCC-cccEEEcCc--ccEeEeecCCCceEEEECCCCCEE
Confidence 345566555444 233333345788999998888776643322 245444444 222333 3345578888888888
Q ss_pred EEeeCC--CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEe
Q psy6572 624 YWCDKG--LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWT 669 (1416)
Q Consensus 624 YwtD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWt 669 (1416)
..+..+ .+.|.+.++. ..+.|..........++-+|..++|..+
T Consensus 116 ~~~g~~n~~G~l~~wd~~--~~~~i~~~~~~~~t~~~WsPdGr~~~ta 161 (194)
T PF08662_consen 116 VLAGFGNLNGDLEFWDVR--KKKKISTFEHSDATDVEWSPDGRYLATA 161 (194)
T ss_pred EEEEccCCCcEEEEEECC--CCEEeeccccCcEEEEEEcCCCCEEEEE
Confidence 777654 3567877776 3333333333345566666665544433
No 213
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=65.03 E-value=2.9e+02 Score=32.66 Aligned_cols=221 Identities=11% Similarity=0.076 Sum_probs=120.2
Q ss_pred eEEEEecE--EEEEEecCCcc---eEEeccccccee-eeeecCCCeEEEeeccC--CCccEEEEecCCCCeEEee----c
Q psy6572 541 NLLFTNKY--YIREVTQAGVM---TIRIHNQTNAVG-LDFDWVDNCLYWSDVTM--HGSSIRRSCNNSQPELLFP----A 608 (1416)
Q Consensus 541 ~li~s~~~--~I~~i~l~g~~---~~~~~~l~~~~~-l~~D~~~~~LYwtD~~~--~~~~I~r~~l~s~~~~l~~----l 608 (1416)
.++|+.+- .+..++..+.. .+....-++..| -.|....++||-+.... ..+.|.+.......+.+.+ .
T Consensus 19 avafaRRPG~~~~v~D~~~g~~~~~~~a~~gRHFyGHg~fs~dG~~LytTEnd~~~g~G~IgVyd~~~~~~ri~E~~s~G 98 (305)
T PF07433_consen 19 AVAFARRPGTFALVFDCRTGQLLQRLWAPPGRHFYGHGVFSPDGRLLYTTENDYETGRGVIGVYDAARGYRRIGEFPSHG 98 (305)
T ss_pred EEEEEeCCCcEEEEEEcCCCceeeEEcCCCCCEEecCEEEcCCCCEEEEeccccCCCcEEEEEEECcCCcEEEeEecCCC
Confidence 45566543 34555554433 122222233333 24677788888885533 2346666666633333333 5
Q ss_pred CCCceEEEEccCCcEEEeeCCCC-----------------eEEEe-ecCCCceEEE-EcC--CCCCcceeeecCCcceEE
Q psy6572 609 TSPDGLTVDWVGRNLYWCDKGLD-----------------TIEVA-KLDGRFRKVL-INK--GLQEPRGIALNPAYGYMY 667 (1416)
Q Consensus 609 ~~p~gLAvD~~~~~LYwtD~~~~-----------------~I~v~-~ldG~~~~vL-i~~--~l~~P~gIavDp~~g~LY 667 (1416)
..|.-|.+.+-+..|.+++.+-. .|..+ ..+|...... +.. ...+.|-||++.. |.+.
T Consensus 99 IGPHel~l~pDG~tLvVANGGI~Thpd~GR~kLNl~tM~psL~~ld~~sG~ll~q~~Lp~~~~~lSiRHLa~~~~-G~V~ 177 (305)
T PF07433_consen 99 IGPHELLLMPDGETLVVANGGIETHPDSGRAKLNLDTMQPSLVYLDARSGALLEQVELPPDLHQLSIRHLAVDGD-GTVA 177 (305)
T ss_pred cChhhEEEcCCCCEEEEEcCCCccCcccCceecChhhcCCceEEEecCCCceeeeeecCccccccceeeEEecCC-CcEE
Confidence 57888999888888888876532 12222 2233332221 111 2235788999988 5555
Q ss_pred Eee-CC-----CCceEEEEecCCCCCEEEeec----CC-CCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEec
Q psy6572 668 WTD-WG-----QNAHIGKAKMDGSNPKVIISK----NL-SWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSR 736 (1416)
Q Consensus 668 WtD-~g-----~~~~I~ra~mDGs~r~vlv~~----~l-~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~ 736 (1416)
+.- |. ..|-|.+...++..+..-... .| .+--+|+++...+.|..+-.+.+.+...+... .+.+...
T Consensus 178 ~a~Q~qg~~~~~~PLva~~~~g~~~~~~~~p~~~~~~l~~Y~gSIa~~~~g~~ia~tsPrGg~~~~~d~~t--g~~~~~~ 255 (305)
T PF07433_consen 178 FAMQYQGDPGDAPPLVALHRRGGALRLLPAPEEQWRRLNGYIGSIAADRDGRLIAVTSPRGGRVAVWDAAT--GRLLGSV 255 (305)
T ss_pred EEEecCCCCCccCCeEEEEcCCCcceeccCChHHHHhhCCceEEEEEeCCCCEEEEECCCCCEEEEEECCC--CCEeecc
Confidence 543 32 124455555444422221111 12 34557999877777888888888888875432 2223222
Q ss_pred cCCCCcccccceeEEEecCcEEEeecCCCeeEEec
Q psy6572 737 RMDPTINLHHVFALAVFEDHLFWTDWEMKSIERCD 771 (1416)
Q Consensus 737 ~~~p~~~l~~P~~lav~~d~LYwtD~~~~~I~~~n 771 (1416)
.+...-+|+...+-+.|| .+.+.|.++.
T Consensus 256 ------~l~D~cGva~~~~~f~~s-sG~G~~~~~~ 283 (305)
T PF07433_consen 256 ------PLPDACGVAPTDDGFLVS-SGQGQLIRLS 283 (305)
T ss_pred ------ccCceeeeeecCCceEEe-CCCccEEEcc
Confidence 277788999887774444 4556666554
No 214
>KOG0263|consensus
Probab=64.73 E-value=2.7e+02 Score=36.31 Aligned_cols=192 Identities=12% Similarity=0.103 Sum_probs=100.8
Q ss_pred EecccccceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEE-ee--cCCCceEEEEccCCcEEEeeCCCC-eEEEee
Q psy6572 562 RIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELL-FP--ATSPDGLTVDWVGRNLYWCDKGLD-TIEVAK 637 (1416)
Q Consensus 562 ~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l-~~--l~~p~gLAvD~~~~~LYwtD~~~~-~I~v~~ 637 (1416)
+....+.+++..|-+..+.|.=.... .+|+...+.+...++ .+ +.-+..+.|-|. -.||+..+.. +-.+-.
T Consensus 447 L~GH~GPVyg~sFsPd~rfLlScSED---~svRLWsl~t~s~~V~y~GH~~PVwdV~F~P~--GyYFatas~D~tArLWs 521 (707)
T KOG0263|consen 447 LYGHSGPVYGCSFSPDRRFLLSCSED---SSVRLWSLDTWSCLVIYKGHLAPVWDVQFAPR--GYYFATASHDQTARLWS 521 (707)
T ss_pred eecCCCceeeeeecccccceeeccCC---cceeeeecccceeEEEecCCCcceeeEEecCC--ceEEEecCCCceeeeee
Confidence 44455567788887766544322221 356666666333222 22 333344555542 3455544433 222222
Q ss_pred cCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEec--CCCCCEEEeecCCCCCeeEEeecCCCeEEEecC
Q psy6572 638 LDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKM--DGSNPKVIISKNLSWPNALTISYETNELFWGDA 715 (1416)
Q Consensus 638 ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~m--DGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~ 715 (1416)
.+-....-++...+....-++++|...||+-- ...+-.|++. .|..++++ +..-....+|++.+ .||-..+-.
T Consensus 522 ~d~~~PlRifaghlsDV~cv~FHPNs~Y~aTG---SsD~tVRlWDv~~G~~VRiF-~GH~~~V~al~~Sp-~Gr~LaSg~ 596 (707)
T KOG0263|consen 522 TDHNKPLRIFAGHLSDVDCVSFHPNSNYVATG---SSDRTVRLWDVSTGNSVRIF-TGHKGPVTALAFSP-CGRYLASGD 596 (707)
T ss_pred cccCCchhhhcccccccceEEECCcccccccC---CCCceEEEEEcCCCcEEEEe-cCCCCceEEEEEcC-CCceEeecc
Confidence 33333333334667778889999998776532 2223334333 34433333 33344446777765 444333333
Q ss_pred CCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEE-ecCcEEEeecCCCeeEE
Q psy6572 716 HEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAV-FEDHLFWTDWEMKSIER 769 (1416)
Q Consensus 716 ~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav-~~d~LYwtD~~~~~I~~ 769 (1416)
..+.|...++-+..+...+.++ ....++|++ .++.|+++.....+|..
T Consensus 597 ed~~I~iWDl~~~~~v~~l~~H------t~ti~SlsFS~dg~vLasgg~DnsV~l 645 (707)
T KOG0263|consen 597 EDGLIKIWDLANGSLVKQLKGH------TGTIYSLSFSRDGNVLASGGADNSVRL 645 (707)
T ss_pred cCCcEEEEEcCCCcchhhhhcc------cCceeEEEEecCCCEEEecCCCCeEEE
Confidence 4567777777654443333332 345677777 57788888877776654
No 215
>KOG4649|consensus
Probab=64.19 E-value=2.7e+02 Score=31.97 Aligned_cols=56 Identities=16% Similarity=0.116 Sum_probs=35.9
Q ss_pred eeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEe
Q psy6572 656 GIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWG 713 (1416)
Q Consensus 656 gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWt 713 (1416)
-++.-|..|.|.|.+.. ....++.++-.-|..+.+-++.+|.=|+.-...+++.+.
T Consensus 242 f~~~~p~~ghL~w~~~~--g~t~~vy~~p~l~F~~h~~~~S~~~ll~~~s~dgkv~il 297 (354)
T KOG4649|consen 242 FCAPLPIAGHLLWATQS--GTTLHVYLSPKLRFDLHSPGISYPKLLRRSSGDGKVMIL 297 (354)
T ss_pred EEEeccccceEEEEecC--CcEEEEEeCcccceeccCCCCcchhhhhhhcCCCcEEEE
Confidence 36677888999998832 245566666665555555556666666666556666554
No 216
>PLN00181 protein SPA1-RELATED; Provisional
Probab=63.83 E-value=5e+02 Score=35.01 Aligned_cols=112 Identities=4% Similarity=-0.100 Sum_probs=61.9
Q ss_pred ccceeeeeecCCCeEEEeeccCCCccEEEEecCCCCe--EEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCce
Q psy6572 567 TNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPE--LLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFR 643 (1416)
Q Consensus 567 ~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~--~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~ 643 (1416)
..+.++++.+..+.+..+-... +.|...++.+... .+.. ...+.+|++.+..+.++++-...+.|.+.++.....
T Consensus 533 ~~v~~l~~~~~~~~~las~~~D--g~v~lWd~~~~~~~~~~~~H~~~V~~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~~~~ 610 (793)
T PLN00181 533 SKLSGICWNSYIKSQVASSNFE--GVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGVS 610 (793)
T ss_pred CceeeEEeccCCCCEEEEEeCC--CeEEEEECCCCeEEEEecCCCCCEEEEEEcCCCCCEEEEEcCCCEEEEEECCCCcE
Confidence 3455666665433333332222 3677666663211 1211 445667888777788888888888999998864432
Q ss_pred -EEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecC
Q psy6572 644 -KVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMD 683 (1416)
Q Consensus 644 -~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mD 683 (1416)
.++. . ......+++.+..|.++.+-.. ...|...++.
T Consensus 611 ~~~~~-~-~~~v~~v~~~~~~g~~latgs~-dg~I~iwD~~ 648 (793)
T PLN00181 611 IGTIK-T-KANICCVQFPSESGRSLAFGSA-DHKVYYYDLR 648 (793)
T ss_pred EEEEe-c-CCCeEEEEEeCCCCCEEEEEeC-CCeEEEEECC
Confidence 2332 1 2344566665555665555422 2356666654
No 217
>KOG0319|consensus
Probab=63.79 E-value=2.7e+02 Score=36.11 Aligned_cols=183 Identities=10% Similarity=0.016 Sum_probs=97.0
Q ss_pred CCCCCeEEEEecEEEEEEecCCcce-EE---ecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCC-eEEee--
Q psy6572 536 SDVPPNLLFTNKYYIREVTQAGVMT-IR---IHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQP-ELLFP-- 607 (1416)
Q Consensus 536 ~~~~~~li~s~~~~I~~i~l~g~~~-~~---~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~-~~l~~-- 607 (1416)
+....+|+-+....|..+++..... +. ......+.++++.+.++.||.+-... .|..+.+. +.. +....
T Consensus 28 s~nG~~L~t~~~d~Vi~idv~t~~~~l~s~~~ed~d~ita~~l~~d~~~L~~a~rs~---llrv~~L~tgk~irswKa~H 104 (775)
T KOG0319|consen 28 SSNGQHLYTACGDRVIIIDVATGSIALPSGSNEDEDEITALALTPDEEVLVTASRSQ---LLRVWSLPTGKLIRSWKAIH 104 (775)
T ss_pred CCCCCEEEEecCceEEEEEccCCceecccCCccchhhhheeeecCCccEEEEeeccc---eEEEEEcccchHhHhHhhcc
Confidence 3334556666666676666654432 11 11223467888888888888765542 45566666 311 11111
Q ss_pred cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcce--EEEeeCCCCceEEEEecCCC
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGY--MYWTDWGQNAHIGKAKMDGS 685 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~--LYWtD~g~~~~I~ra~mDGs 685 (1416)
-..+..+|+|+-+ .|.-|-...++|.|-++.+.+.+.-+.+-..-..+|...|.-.+ ||.. +....+..-++.-.
T Consensus 105 e~Pvi~ma~~~~g-~LlAtggaD~~v~VWdi~~~~~th~fkG~gGvVssl~F~~~~~~~lL~sg--~~D~~v~vwnl~~~ 181 (775)
T KOG0319|consen 105 EAPVITMAFDPTG-TLLATGGADGRVKVWDIKNGYCTHSFKGHGGVVSSLLFHPHWNRWLLASG--ATDGTVRVWNLNDK 181 (775)
T ss_pred CCCeEEEEEcCCC-ceEEeccccceEEEEEeeCCEEEEEecCCCceEEEEEeCCccchhheeec--CCCceEEEEEcccC
Confidence 2334578999866 55555556688999999988877666433333556666665333 2211 12234555454422
Q ss_pred CC-EEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeC
Q psy6572 686 NP-KVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDL 725 (1416)
Q Consensus 686 ~r-~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~l 725 (1416)
.. ..+.......-.+|++-.. ++-.++-.+...|...++
T Consensus 182 ~tcl~~~~~H~S~vtsL~~~~d-~~~~ls~~RDkvi~vwd~ 221 (775)
T KOG0319|consen 182 RTCLHTMILHKSAVTSLAFSED-SLELLSVGRDKVIIVWDL 221 (775)
T ss_pred chHHHHHHhhhhheeeeeeccC-CceEEEeccCcEEEEeeh
Confidence 21 1122234555677777543 333333333344555555
No 218
>KOG0289|consensus
Probab=63.46 E-value=3.5e+02 Score=33.10 Aligned_cols=200 Identities=11% Similarity=-0.003 Sum_probs=95.9
Q ss_pred eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee----cCCCceEEEEccCCcEEEeeCCCCeEE
Q psy6572 560 TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP----ATSPDGLTVDWVGRNLYWCDKGLDTIE 634 (1416)
Q Consensus 560 ~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~----l~~p~gLAvD~~~~~LYwtD~~~~~I~ 634 (1416)
+++...-..+.++..++.+..|.|++... .+--..+. +..-+++. -.....++|.+ .|+||-+-...+.|.
T Consensus 297 ~~~~~h~~~V~~ls~h~tgeYllsAs~d~---~w~Fsd~~~g~~lt~vs~~~s~v~~ts~~fHp-DgLifgtgt~d~~vk 372 (506)
T KOG0289|consen 297 TSSRPHEEPVTGLSLHPTGEYLLSASNDG---TWAFSDISSGSQLTVVSDETSDVEYTSAAFHP-DGLIFGTGTPDGVVK 372 (506)
T ss_pred cccccccccceeeeeccCCcEEEEecCCc---eEEEEEccCCcEEEEEeeccccceeEEeeEcC-CceEEeccCCCceEE
Confidence 44444455677888888888888887662 22212222 11111221 12245566654 678888888777787
Q ss_pred EeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCC-CCCEEEeecCCCCCeeEEeecCCCeEEEe
Q psy6572 635 VAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDG-SNPKVIISKNLSWPNALTISYETNELFWG 713 (1416)
Q Consensus 635 v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDG-s~r~vlv~~~l~~P~gLaiD~~~~rLYWt 713 (1416)
+.++.......-+...-...+.|++- .+||-..+..... .|.--+|.- .+.+++........+.+.+|.....|-.+
T Consensus 373 iwdlks~~~~a~Fpght~~vk~i~Fs-ENGY~Lat~add~-~V~lwDLRKl~n~kt~~l~~~~~v~s~~fD~SGt~L~~~ 450 (506)
T KOG0289|consen 373 IWDLKSQTNVAKFPGHTGPVKAISFS-ENGYWLATAADDG-SVKLWDLRKLKNFKTIQLDEKKEVNSLSFDQSGTYLGIA 450 (506)
T ss_pred EEEcCCccccccCCCCCCceeEEEec-cCceEEEEEecCC-eEEEEEehhhcccceeeccccccceeEEEcCCCCeEEee
Confidence 77776544222221223335677775 4465444432222 244333332 13333333333345778888755555443
Q ss_pred cCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEeecCCCeeEEe
Q psy6572 714 DAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDWEMKSIERC 770 (1416)
Q Consensus 714 D~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~~~I~~~ 770 (1416)
...=.|+...-...+-+.+..... ......++.+-+...|..+....+++++
T Consensus 451 -g~~l~Vy~~~k~~k~W~~~~~~~~----~sg~st~v~Fg~~aq~l~s~smd~~l~~ 502 (506)
T KOG0289|consen 451 -GSDLQVYICKKKTKSWTEIKELAD----HSGLSTGVRFGEHAQYLASTSMDAILRL 502 (506)
T ss_pred -cceeEEEEEecccccceeeehhhh----cccccceeeecccceEEeeccchhheEE
Confidence 222234443322222222222110 0111233334455566666666555544
No 219
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=62.93 E-value=6.2 Score=30.32 Aligned_cols=26 Identities=31% Similarity=0.727 Sum_probs=19.0
Q ss_pred CCcccccccccccCCCCCeEEecCCCceeccC
Q psy6572 1294 RTCSQICIEKKISNTERTFSCHCAEGYHMVHG 1325 (1416)
Q Consensus 1294 ~~Csq~C~n~~~~n~~gs~~C~C~~gy~~~~~ 1325 (1416)
..|-..|.+.. .+.|.|.+||+|+.+
T Consensus 6 t~CpA~CDpn~------~~~C~CPeGyIlde~ 31 (34)
T PF09064_consen 6 TECPADCDPNS------PGQCFCPEGYILDEG 31 (34)
T ss_pred ccCCCccCCCC------CCceeCCCceEecCC
Confidence 45666676542 359999999999865
No 220
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=62.79 E-value=2.2e+02 Score=38.24 Aligned_cols=54 Identities=11% Similarity=0.052 Sum_probs=33.1
Q ss_pred ceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecC
Q psy6572 655 RGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDA 715 (1416)
Q Consensus 655 ~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~ 715 (1416)
..+++|+..|.|||---...+ ++.|..|+... ++..-.=+|||..++++-|.-.
T Consensus 378 ~~~s~D~~~glvy~ptGn~~p-----d~~g~~r~~~~--n~y~~slvALD~~TGk~~W~~Q 431 (764)
T TIGR03074 378 SVASYDEKLGLVYLPMGNQTP-----DQWGGDRTPAD--EKYSSSLVALDATTGKERWVFQ 431 (764)
T ss_pred CceEEcCCCCeEEEeCCCccc-----cccCCccccCc--ccccceEEEEeCCCCceEEEec
Confidence 457999999999996522211 12244443221 2323344889999999999753
No 221
>KOG2055|consensus
Probab=62.16 E-value=3.8e+02 Score=33.10 Aligned_cols=59 Identities=17% Similarity=0.119 Sum_probs=30.8
Q ss_pred eeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCC
Q psy6572 656 GIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNG 727 (1416)
Q Consensus 656 gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG 727 (1416)
-||+--.+|+|+..-....-.|..+.|.|.-+ +++++. .++..|+-...+.|+..++.-
T Consensus 317 fia~~G~~G~I~lLhakT~eli~s~KieG~v~------------~~~fsS-dsk~l~~~~~~GeV~v~nl~~ 375 (514)
T KOG2055|consen 317 FIAIAGNNGHIHLLHAKTKELITSFKIEGVVS------------DFTFSS-DSKELLASGGTGEVYVWNLRQ 375 (514)
T ss_pred eEEEcccCceEEeehhhhhhhhheeeeccEEe------------eEEEec-CCcEEEEEcCCceEEEEecCC
Confidence 45555566666665544333455555555433 445543 334444444455666666544
No 222
>KOG3914|consensus
Probab=62.06 E-value=1.1e+02 Score=36.81 Aligned_cols=154 Identities=14% Similarity=0.094 Sum_probs=88.6
Q ss_pred ccCCcEEEeeCCCCeEEEeecCCCce--EEE-EcCCCCCcceeeecCCcceEEEeeCCC-CceEEEEecC-CCCCEEEee
Q psy6572 618 WVGRNLYWCDKGLDTIEVAKLDGRFR--KVL-INKGLQEPRGIALNPAYGYMYWTDWGQ-NAHIGKAKMD-GSNPKVIIS 692 (1416)
Q Consensus 618 ~~~~~LYwtD~~~~~I~v~~ldG~~~--~vL-i~~~l~~P~gIavDp~~g~LYWtD~g~-~~~I~ra~mD-Gs~r~vlv~ 692 (1416)
+.++.||.++... ++.++.+.++.+ +.+ ....-..|++|......-.+-++|... ...+.....+ |..+.++-
T Consensus 72 ~~~~llAv~~~~K-~~~~f~~~~~~~~~kl~~~~~v~~~~~ai~~~~~~~sv~v~dkagD~~~~di~s~~~~~~~~~lG- 149 (390)
T KOG3914|consen 72 DSGRLVAVATSSK-QRAVFDYRENPKGAKLLDVSCVPKRPTAISFIREDTSVLVADKAGDVYSFDILSADSGRCEPILG- 149 (390)
T ss_pred CCceEEEEEeCCC-ceEEEEEecCCCcceeeeEeecccCcceeeeeeccceEEEEeecCCceeeeeecccccCcchhhh-
Confidence 3445555555543 333344433332 111 112345688888887777777777432 2233333333 44443332
Q ss_pred cCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceE-EEEeccCCCCcccccceeEEEecCcEEEeecCCCeeEEec
Q psy6572 693 KNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIK-IIVSRRMDPTINLHHVFALAVFEDHLFWTDWEMKSIERCD 771 (1416)
Q Consensus 693 ~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~-~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~~~I~~~n 771 (1416)
.+..-..++|-+...+|.-+|. -.+|....+.+.... .+..++ -.....|++..+++.|+-.+.+.|+.=+
T Consensus 150 -hvSml~dVavS~D~~~IitaDR-DEkIRvs~ypa~f~IesfclGH------~eFVS~isl~~~~~LlS~sGD~tlr~Wd 221 (390)
T KOG3914|consen 150 -HVSMLLDVAVSPDDQFIITADR-DEKIRVSRYPATFVIESFCLGH------KEFVSTISLTDNYLLLSGSGDKTLRLWD 221 (390)
T ss_pred -hhhhhheeeecCCCCEEEEecC-CceEEEEecCcccchhhhcccc------HhheeeeeeccCceeeecCCCCcEEEEe
Confidence 3444457788766667776663 457777777665432 122211 2345689999999999999999988877
Q ss_pred ccCCCceEEE
Q psy6572 772 KYTGKNCTSV 781 (1416)
Q Consensus 772 k~tG~~~~~l 781 (1416)
-.+|+....+
T Consensus 222 ~~sgk~L~t~ 231 (390)
T KOG3914|consen 222 ITSGKLLDTC 231 (390)
T ss_pred cccCCccccc
Confidence 7778765443
No 223
>KOG0646|consensus
Probab=61.60 E-value=3.9e+02 Score=32.98 Aligned_cols=70 Identities=19% Similarity=0.157 Sum_probs=39.9
Q ss_pred CceEEEEcc--CCcEEEeeCCCCeEEEeecCCCceEEEEc-CCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCC
Q psy6572 611 PDGLTVDWV--GRNLYWCDKGLDTIEVAKLDGRFRKVLIN-KGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGS 685 (1416)
Q Consensus 611 p~gLAvD~~--~~~LYwtD~~~~~I~v~~ldG~~~~vLi~-~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs 685 (1416)
+..|-+..- ..+|| |-+..++|.+.++.+.. +|.+ .--..+.++||||...++|.-. +...|+...|-+.
T Consensus 177 ITDl~ig~Gg~~~rl~-TaS~D~t~k~wdlS~g~--LLlti~fp~si~av~lDpae~~~yiGt--~~G~I~~~~~~~~ 249 (476)
T KOG0646|consen 177 ITDLQIGSGGTNARLY-TASEDRTIKLWDLSLGV--LLLTITFPSSIKAVALDPAERVVYIGT--EEGKIFQNLLFKL 249 (476)
T ss_pred eEEEEecCCCccceEE-EecCCceEEEEEeccce--eeEEEecCCcceeEEEcccccEEEecC--CcceEEeeehhcC
Confidence 344444432 23444 33445566666655432 1111 1234578999999999999865 2346777766554
No 224
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=60.23 E-value=2.6e+02 Score=30.57 Aligned_cols=90 Identities=11% Similarity=0.018 Sum_probs=51.5
Q ss_pred CceEEEEccCCcEEEeeC-CCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCC-ceEEEEecCCCCCE
Q psy6572 611 PDGLTVDWVGRNLYWCDK-GLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQN-AHIGKAKMDGSNPK 688 (1416)
Q Consensus 611 p~gLAvD~~~~~LYwtD~-~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~-~~I~ra~mDGs~r~ 688 (1416)
+..++..|.+..+.+... ...+|...++.+.....+. -.....|...|...+|..+..+.. ..|...+++ ..+
T Consensus 62 I~~~~WsP~g~~favi~g~~~~~v~lyd~~~~~i~~~~---~~~~n~i~wsP~G~~l~~~g~~n~~G~l~~wd~~--~~~ 136 (194)
T PF08662_consen 62 IHDVAWSPNGNEFAVIYGSMPAKVTLYDVKGKKIFSFG---TQPRNTISWSPDGRFLVLAGFGNLNGDLEFWDVR--KKK 136 (194)
T ss_pred eEEEEECcCCCEEEEEEccCCcccEEEcCcccEeEeec---CCCceEEEECCCCCEEEEEEccCCCcEEEEEECC--CCE
Confidence 567777777777666543 2347788887754443332 234567888888777777765432 345555554 444
Q ss_pred EEeecCCCCCeeEEeec
Q psy6572 689 VIISKNLSWPNALTISY 705 (1416)
Q Consensus 689 vlv~~~l~~P~gLaiD~ 705 (1416)
.|..........++-++
T Consensus 137 ~i~~~~~~~~t~~~WsP 153 (194)
T PF08662_consen 137 KISTFEHSDATDVEWSP 153 (194)
T ss_pred EeeccccCcEEEEEEcC
Confidence 44443333344455544
No 225
>KOG0289|consensus
Probab=59.63 E-value=4.1e+02 Score=32.58 Aligned_cols=62 Identities=18% Similarity=0.149 Sum_probs=42.7
Q ss_pred cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcC-CCCCcceeeecCCcceEEEee
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINK-GLQEPRGIALNPAYGYMYWTD 670 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~-~l~~P~gIavDp~~g~LYWtD 670 (1416)
-..+.+|.+.+.+..|.|++.....++.---+|+...++... .--.-..++++|. |.||-|-
T Consensus 303 ~~~V~~ls~h~tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~~~s~v~~ts~~fHpD-gLifgtg 365 (506)
T KOG0289|consen 303 EEPVTGLSLHPTGEYLLSASNDGTWAFSDISSGSQLTVVSDETSDVEYTSAAFHPD-GLIFGTG 365 (506)
T ss_pred cccceeeeeccCCcEEEEecCCceEEEEEccCCcEEEEEeeccccceeEEeeEcCC-ceEEecc
Confidence 455689999999999988876554444444566665555542 3334678889887 8888875
No 226
>KOG0308|consensus
Probab=58.91 E-value=2.5e+02 Score=35.95 Aligned_cols=160 Identities=14% Similarity=0.067 Sum_probs=80.6
Q ss_pred cceeeeeecCCCeEEEeeccCCCccEEEEecCCCC-eEEee-------------cCCCceEEEEccCCcEEEeeCCCCeE
Q psy6572 568 NAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQP-ELLFP-------------ATSPDGLTVDWVGRNLYWCDKGLDTI 633 (1416)
Q Consensus 568 ~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~-~~l~~-------------l~~p~gLAvD~~~~~LYwtD~~~~~I 633 (1416)
.+..|+|-...+.|+.+-.-. ++|+...+++.. +++.+ .....+||... ++.|+++-...+-|
T Consensus 119 YVkcla~~ak~~~lvaSgGLD--~~IflWDin~~~~~l~~s~n~~t~~sl~sG~k~siYSLA~N~-t~t~ivsGgtek~l 195 (735)
T KOG0308|consen 119 YVKCLAYIAKNNELVASGGLD--RKIFLWDINTGTATLVASFNNVTVNSLGSGPKDSIYSLAMNQ-TGTIIVSGGTEKDL 195 (735)
T ss_pred hheeeeecccCceeEEecCCC--ccEEEEEccCcchhhhhhccccccccCCCCCccceeeeecCC-cceEEEecCcccce
Confidence 345566644555555554432 467777766221 11111 23445666653 33566555444555
Q ss_pred EEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCC--CEEEeecCCCCCeeEEeecCCCeEE
Q psy6572 634 EVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSN--PKVIISKNLSWPNALTISYETNELF 711 (1416)
Q Consensus 634 ~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~--r~vlv~~~l~~P~gLaiD~~~~rLY 711 (1416)
.+.+.....+.+-+.+.....|.|.|+....+|.=+. ..+.|..-+|.-.. .+.++.+...| +|..++.-..||
T Consensus 196 r~wDprt~~kimkLrGHTdNVr~ll~~dDGt~~ls~s--SDgtIrlWdLgqQrCl~T~~vH~e~VW--aL~~~~sf~~vY 271 (735)
T KOG0308|consen 196 RLWDPRTCKKIMKLRGHTDNVRVLLVNDDGTRLLSAS--SDGTIRLWDLGQQRCLATYIVHKEGVW--ALQSSPSFTHVY 271 (735)
T ss_pred EEeccccccceeeeeccccceEEEEEcCCCCeEeecC--CCceEEeeeccccceeeeEEeccCceE--EEeeCCCcceEE
Confidence 5555444333333334556677777776644443332 12234333333221 12233334433 566666667788
Q ss_pred EecCCCCeEEEEeCCC-CceEEEEe
Q psy6572 712 WGDAHEDYIAVSDLNG-ENIKIIVS 735 (1416)
Q Consensus 712 WtD~~~~~I~~~~ldG-~~r~~v~~ 735 (1416)
..+. .+.|++.++.. .....|..
T Consensus 272 sG~r-d~~i~~Tdl~n~~~~tlick 295 (735)
T KOG0308|consen 272 SGGR-DGNIYRTDLRNPAKSTLICK 295 (735)
T ss_pred ecCC-CCcEEecccCCchhheEeec
Confidence 7764 56788888766 44444444
No 227
>KOG0274|consensus
Probab=58.82 E-value=5e+02 Score=33.37 Aligned_cols=202 Identities=14% Similarity=0.049 Sum_probs=101.4
Q ss_pred ccccceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEeecCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceE
Q psy6572 565 NQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFPATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRK 644 (1416)
Q Consensus 565 ~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~ 644 (1416)
....+.+|+|.... .++++-... .+++..+..+..-+..-......+...-+.+.+..+-+...+|.|-++......
T Consensus 248 H~g~V~~l~~~~~~-~~lvsgS~D--~t~rvWd~~sg~C~~~l~gh~stv~~~~~~~~~~~sgs~D~tVkVW~v~n~~~l 324 (537)
T KOG0274|consen 248 HFGGVWGLAFPSGG-DKLVSGSTD--KTERVWDCSTGECTHSLQGHTSSVRCLTIDPFLLVSGSRDNTVKVWDVTNGACL 324 (537)
T ss_pred CCCCceeEEEecCC-CEEEEEecC--CcEEeEecCCCcEEEEecCCCceEEEEEccCceEeeccCCceEEEEeccCcceE
Confidence 45567788887644 444444443 267777766332222112333434333355666666566678888888744443
Q ss_pred EEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEe
Q psy6572 645 VLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSD 724 (1416)
Q Consensus 645 vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ 724 (1416)
.++.......+.|.++ ..+||-.-....-+|+.+. -|.-.+. +.....+-.+|+++.. .++|=.-. ...|..-+
T Consensus 325 ~l~~~h~~~V~~v~~~--~~~lvsgs~d~~v~VW~~~-~~~cl~s-l~gH~~~V~sl~~~~~-~~~~Sgs~-D~~IkvWd 398 (537)
T KOG0274|consen 325 NLLRGHTGPVNCVQLD--EPLLVSGSYDGTVKVWDPR-TGKCLKS-LSGHTGRVYSLIVDSE-NRLLSGSL-DTTIKVWD 398 (537)
T ss_pred EEeccccccEEEEEec--CCEEEEEecCceEEEEEhh-hceeeee-ecCCcceEEEEEecCc-ceEEeeee-ccceEeec
Confidence 3333345556777777 4566655433222344433 1111111 2224455667777643 55554432 35677777
Q ss_pred CCCC-ceEEEEeccCCCCcccccceeEEEecCcEEEeecCCCeeEEecccCCCceEEEE
Q psy6572 725 LNGE-NIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDWEMKSIERCDKYTGKNCTSVV 782 (1416)
Q Consensus 725 ldG~-~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~~~I~~~nk~tG~~~~~l~ 782 (1416)
+.+. .....+.+. ..-..+|.+... ++.+....+.|..=+..+++.+.++-
T Consensus 399 l~~~~~c~~tl~~h------~~~v~~l~~~~~-~Lvs~~aD~~Ik~WD~~~~~~~~~~~ 450 (537)
T KOG0274|consen 399 LRTKRKCIHTLQGH------TSLVSSLLLRDN-FLVSSSADGTIKLWDAEEGECLRTLE 450 (537)
T ss_pred CCchhhhhhhhcCC------cccccccccccc-eeEeccccccEEEeecccCceeeeec
Confidence 7766 333222221 111133333344 44444444455555555666555553
No 228
>KOG0270|consensus
Probab=57.75 E-value=4.4e+02 Score=32.37 Aligned_cols=158 Identities=11% Similarity=0.060 Sum_probs=85.5
Q ss_pred EEEEecCCcc--eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC--CCCeEEee-cCCCceEEEEccCCcEE
Q psy6572 550 IREVTQAGVM--TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN--SQPELLFP-ATSPDGLTVDWVGRNLY 624 (1416)
Q Consensus 550 I~~i~l~g~~--~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~--s~~~~l~~-l~~p~gLAvD~~~~~LY 624 (1416)
|..-+++... ..+......+..|++++.+..+..+-.... ++...... +....-.+ ...++-|+.++..-+.|
T Consensus 268 V~lWD~~~g~p~~s~~~~~k~Vq~l~wh~~~p~~LLsGs~D~--~V~l~D~R~~~~s~~~wk~~g~VEkv~w~~~se~~f 345 (463)
T KOG0270|consen 268 VKLWDVDTGKPKSSITHHGKKVQTLEWHPYEPSVLLSGSYDG--TVALKDCRDPSNSGKEWKFDGEVEKVAWDPHSENSF 345 (463)
T ss_pred EEEEEcCCCCcceehhhcCCceeEEEecCCCceEEEeccccc--eEEeeeccCccccCceEEeccceEEEEecCCCceeE
Confidence 3333444333 344445567788888887777666655432 44444333 11111111 45566777777777777
Q ss_pred EeeCCCCeEEEeecC--CCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeec--CCCCCee
Q psy6572 625 WCDKGLDTIEVAKLD--GRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISK--NLSWPNA 700 (1416)
Q Consensus 625 wtD~~~~~I~v~~ld--G~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~--~l~~P~g 700 (1416)
+.....++++-+++. |.-..+|.... ....||.+....-.|..|.... ..+..-++++.+.+.+... .+..-..
T Consensus 346 ~~~tddG~v~~~D~R~~~~~vwt~~AHd-~~ISgl~~n~~~p~~l~t~s~d-~~Vklw~~~~~~~~~v~~~~~~~~rl~c 423 (463)
T KOG0270|consen 346 FVSTDDGTVYYFDIRNPGKPVWTLKAHD-DEISGLSVNIQTPGLLSTASTD-KVVKLWKFDVDSPKSVKEHSFKLGRLHC 423 (463)
T ss_pred EEecCCceEEeeecCCCCCceeEEEecc-CCcceEEecCCCCcceeecccc-ceEEEEeecCCCCcccccccccccceee
Confidence 776666666666653 33333343322 2567888888888887775333 3455556666655444332 2333344
Q ss_pred EEeecCCCeEE
Q psy6572 701 LTISYETNELF 711 (1416)
Q Consensus 701 LaiD~~~~rLY 711 (1416)
+++++..-.+|
T Consensus 424 ~~~~~~~a~~l 434 (463)
T KOG0270|consen 424 FALDPDVAFTL 434 (463)
T ss_pred cccCCCcceEE
Confidence 45544333333
No 229
>KOG2139|consensus
Probab=56.95 E-value=2.4e+02 Score=33.69 Aligned_cols=155 Identities=14% Similarity=0.067 Sum_probs=73.5
Q ss_pred eeeecCCCeEEEeeccCCC-ccEEEEecCCCCeEEee--cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEc
Q psy6572 572 LDFDWVDNCLYWSDVTMHG-SSIRRSCNNSQPELLFP--ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLIN 648 (1416)
Q Consensus 572 l~~D~~~~~LYwtD~~~~~-~~I~r~~l~s~~~~l~~--l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~ 648 (1416)
|++--+.+..+|++..... .+..|+.-+...+++.. -..+..|+-...+..|--+..+...|.+-+.+......|+.
T Consensus 156 lavgCr~gIciW~~s~tln~~r~~~~~s~~~~qvl~~pgh~pVtsmqwn~dgt~l~tAS~gsssi~iWdpdtg~~~pL~~ 235 (445)
T KOG2139|consen 156 LAVGCRAGICIWSDSRTLNANRNIRMMSTHHLQVLQDPGHNPVTSMQWNEDGTILVTASFGSSSIMIWDPDTGQKIPLIP 235 (445)
T ss_pred eeeeecceeEEEEcCcccccccccccccccchhheeCCCCceeeEEEEcCCCCEEeecccCcceEEEEcCCCCCcccccc
Confidence 4444566778888765321 11112111111122221 11123333332223333233345567777777665555554
Q ss_pred CCCCCcceeeecCCcceEEEeeCCCCceEE-EEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCC
Q psy6572 649 KGLQEPRGIALNPAYGYMYWTDWGQNAHIG-KAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNG 727 (1416)
Q Consensus 649 ~~l~~P~gIavDp~~g~LYWtD~g~~~~I~-ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG 727 (1416)
.++....-|--.|...+||-+......+++ -..+--+.|.++... .-.+-.-++...+|.++-.+..+|++..++|
T Consensus 236 ~glgg~slLkwSPdgd~lfaAt~davfrlw~e~q~wt~erw~lgsg---rvqtacWspcGsfLLf~~sgsp~lysl~f~~ 312 (445)
T KOG2139|consen 236 KGLGGFSLLKWSPDGDVLFAATCDAVFRLWQENQSWTKERWILGSG---RVQTACWSPCGSFLLFACSGSPRLYSLTFDG 312 (445)
T ss_pred cCCCceeeEEEcCCCCEEEEecccceeeeehhcccceecceeccCC---ceeeeeecCCCCEEEEEEcCCceEEEEeecC
Confidence 443333334455666666655433222333 222222333333322 2233445566778888888888888888776
Q ss_pred Cc
Q psy6572 728 EN 729 (1416)
Q Consensus 728 ~~ 729 (1416)
..
T Consensus 313 ~~ 314 (445)
T KOG2139|consen 313 ED 314 (445)
T ss_pred CC
Confidence 54
No 230
>PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues.
Probab=56.91 E-value=3.5e+02 Score=30.98 Aligned_cols=200 Identities=15% Similarity=0.079 Sum_probs=96.1
Q ss_pred ceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEeecCCCceEEEEccCCcEEEeeCCCCeEEEe--ecCCCceEEE
Q psy6572 569 AVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFPATSPDGLTVDWVGRNLYWCDKGLDTIEVA--KLDGRFRKVL 646 (1416)
Q Consensus 569 ~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~l~~p~gLAvD~~~~~LYwtD~~~~~I~v~--~ldG~~~~vL 646 (1416)
+..+++.+..+.+.++.....+..++....+.....+..........+|.. +.|+..+.+.....+. ..+|....+.
T Consensus 26 ~~s~AvS~dg~~~A~v~~~~~~~~L~~~~~~~~~~~~~~g~~l~~PS~d~~-g~~W~v~~~~~~~~~~~~~~~g~~~~~~ 104 (253)
T PF10647_consen 26 VTSPAVSPDGSRVAAVSEGDGGRSLYVGPAGGPVRPVLTGGSLTRPSWDPD-GWVWTVDDGSGGVRVVRDSASGTGEPVE 104 (253)
T ss_pred ccceEECCCCCeEEEEEEcCCCCEEEEEcCCCcceeeccCCccccccccCC-CCEEEEEcCCCceEEEEecCCCcceeEE
Confidence 445555555666655552222346666666544444434223344467776 7777776655544333 2445554444
Q ss_pred Ec-CCCC-CcceeeecCCcceE-EEeeCCCCceEEEEec--CCCC-CEEEee------cCCCCCeeEEeecCCCeEEEe-
Q psy6572 647 IN-KGLQ-EPRGIALNPAYGYM-YWTDWGQNAHIGKAKM--DGSN-PKVIIS------KNLSWPNALTISYETNELFWG- 713 (1416)
Q Consensus 647 i~-~~l~-~P~gIavDp~~g~L-YWtD~g~~~~I~ra~m--DGs~-r~vlv~------~~l~~P~gLaiD~~~~rLYWt- 713 (1416)
+. ..+. ...+|+|.|..-+| ++...+...+|+.+.+ ++.. ...+.. ..+....+++--. ...|.+.
T Consensus 105 v~~~~~~~~I~~l~vSpDG~RvA~v~~~~~~~~v~va~V~r~~~g~~~~l~~~~~~~~~~~~~v~~v~W~~-~~~L~V~~ 183 (253)
T PF10647_consen 105 VDWPGLRGRITALRVSPDGTRVAVVVEDGGGGRVYVAGVVRDGDGVPRRLTGPRRVAPPLLSDVTDVAWSD-DSTLVVLG 183 (253)
T ss_pred ecccccCCceEEEEECCCCcEEEEEEecCCCCeEEEEEEEeCCCCCcceeccceEecccccCcceeeeecC-CCEEEEEe
Confidence 43 2333 67889999886654 4443333345555543 3333 222211 1122233444432 3344443
Q ss_pred cCCCCeEEE-EeCCCCceEEEEeccCCCCcccccceeEEE--ecCcEEEeecCCCeeEEecccCCCceEEE
Q psy6572 714 DAHEDYIAV-SDLNGENIKIIVSRRMDPTINLHHVFALAV--FEDHLFWTDWEMKSIERCDKYTGKNCTSV 781 (1416)
Q Consensus 714 D~~~~~I~~-~~ldG~~r~~v~~~~~~p~~~l~~P~~lav--~~d~LYwtD~~~~~I~~~nk~tG~~~~~l 781 (1416)
......+.. +..+|.....+... ...+-.+++ ....+|.++. +.|++ ...+..-+.+
T Consensus 184 ~~~~~~~~~~v~~dG~~~~~l~~~-------~~~~~v~a~~~~~~~~~~t~~--~~~~~--~~~~~~W~~v 243 (253)
T PF10647_consen 184 RSAGGPVVRLVSVDGGPSTPLPSV-------NLGVPVVAVAASPSTVYVTDD--GGVLQ--SRSGASWREV 243 (253)
T ss_pred CCCCCceeEEEEccCCcccccCCC-------CCCcceEEeeCCCcEEEEECC--CcEEE--CCCCCcceEc
Confidence 333333444 66777666555222 122223333 4556777763 44554 3445544444
No 231
>PF09910 DUF2139: Uncharacterized protein conserved in archaea (DUF2139); InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=56.63 E-value=1.2e+02 Score=35.23 Aligned_cols=81 Identities=16% Similarity=0.245 Sum_probs=49.9
Q ss_pred CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCe
Q psy6572 630 LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNE 709 (1416)
Q Consensus 630 ~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~r 709 (1416)
..+|+...+=|...+. +.+ .-=||+.....||+--|-..|.|++-.-+|+ +++....+-.+....-++-.+=+
T Consensus 19 ~~~iY~felvG~~P~S----GGD--TYNAV~~vDd~IyFGGWVHAPa~y~gk~~g~-~~IdF~NKYSHVH~yd~e~~~Vr 91 (339)
T PF09910_consen 19 SEKIYRFELVGPPPTS----GGD--TYNAVEWVDDFIYFGGWVHAPAVYEGKGDGR-ATIDFRNKYSHVHEYDTENDSVR 91 (339)
T ss_pred ceEEEEeeeccCCCCC----CCc--cceeeeeecceEEEeeeecCCceeeeccCCc-eEEEEeeccceEEEEEcCCCeEE
Confidence 3467776665543221 111 2246777889999999987788888877777 45555556555555544433446
Q ss_pred EEEecCCC
Q psy6572 710 LFWGDAHE 717 (1416)
Q Consensus 710 LYWtD~~~ 717 (1416)
|.|.+.-.
T Consensus 92 LLWkesih 99 (339)
T PF09910_consen 92 LLWKESIH 99 (339)
T ss_pred EEEecccC
Confidence 78877543
No 232
>PTZ00420 coronin; Provisional
Probab=56.54 E-value=5.6e+02 Score=33.19 Aligned_cols=114 Identities=10% Similarity=-0.026 Sum_probs=66.6
Q ss_pred cccceeeeeecCCCeEEEeeccCCCccEEEEecC-CC--Ce-------EEee-cCCCceEEEEccCCcEEEeeCCCCeEE
Q psy6572 566 QTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQ--PE-------LLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIE 634 (1416)
Q Consensus 566 l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~--~~-------~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~ 634 (1416)
...+..|+|.+..+.++.+-... +.|+...+. .. .. ++.. ...+..|++.+.+.+|+.+-...+.|.
T Consensus 74 ~~~V~~lafsP~~~~lLASgS~D--gtIrIWDi~t~~~~~~~i~~p~~~L~gH~~~V~sVaf~P~g~~iLaSgS~DgtIr 151 (568)
T PTZ00420 74 TSSILDLQFNPCFSEILASGSED--LTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVN 151 (568)
T ss_pred CCCEEEEEEcCCCCCEEEEEeCC--CeEEEEECCCCCccccccccceEEeecCCCcEEEEEECCCCCeEEEEEeCCCeEE
Confidence 34567888887655455544433 367666665 21 11 1222 445778888888777777777778898
Q ss_pred EeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCC
Q psy6572 635 VAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDG 684 (1416)
Q Consensus 635 v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDG 684 (1416)
+.++........+. ......+|+++|...+| .+-. ....|...++..
T Consensus 152 IWDl~tg~~~~~i~-~~~~V~SlswspdG~lL-at~s-~D~~IrIwD~Rs 198 (568)
T PTZ00420 152 IWDIENEKRAFQIN-MPKKLSSLKWNIKGNLL-SGTC-VGKHMHIIDPRK 198 (568)
T ss_pred EEECCCCcEEEEEe-cCCcEEEEEECCCCCEE-EEEe-cCCEEEEEECCC
Confidence 88886544322222 22457789998875444 4332 123455555553
No 233
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=56.29 E-value=2e+02 Score=35.57 Aligned_cols=61 Identities=10% Similarity=0.293 Sum_probs=31.6
Q ss_pred cceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeec------------------C-CCCCeeEEeecCCCeEEEec
Q psy6572 654 PRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISK------------------N-LSWPNALTISYETNELFWGD 714 (1416)
Q Consensus 654 P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~------------------~-l~~P~gLaiD~~~~rLYWtD 714 (1416)
|..|.|....++||++.|... .|...++.-...-.++.. . ...|+=|.|....+||||+.
T Consensus 314 itDI~iSlDDrfLYvs~W~~G-dvrqYDISDP~~Pkl~gqv~lGG~~~~~~~~~v~g~~l~GgPqMvqlS~DGkRlYvTn 392 (461)
T PF05694_consen 314 ITDILISLDDRFLYVSNWLHG-DVRQYDISDPFNPKLVGQVFLGGSIRKGDHPVVKGKRLRGGPQMVQLSLDGKRLYVTN 392 (461)
T ss_dssp ---EEE-TTS-EEEEEETTTT-EEEEEE-SSTTS-EEEEEEE-BTTTT-B--TTS------S----EEE-TTSSEEEEE-
T ss_pred eEeEEEccCCCEEEEEcccCC-cEEEEecCCCCCCcEEeEEEECcEeccCCCccccccccCCCCCeEEEccCCeEEEEEe
Confidence 577778888899999999854 566666554433332221 1 23467778877899999987
Q ss_pred C
Q psy6572 715 A 715 (1416)
Q Consensus 715 ~ 715 (1416)
+
T Consensus 393 S 393 (461)
T PF05694_consen 393 S 393 (461)
T ss_dssp -
T ss_pred e
Confidence 4
No 234
>KOG1036|consensus
Probab=55.23 E-value=3.8e+02 Score=31.48 Aligned_cols=206 Identities=11% Similarity=0.000 Sum_probs=0.0
Q ss_pred cEEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEeecCCCceEEEEccCCcEEE
Q psy6572 547 KYYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFPATSPDGLTVDWVGRNLYW 625 (1416)
Q Consensus 547 ~~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~l~~p~gLAvD~~~~~LYw 625 (1416)
.+.|+++++.+.. ..+..+...+..|.|-+..+.|.-..+.. +|..++......+...-..-.--++|-.+++|.+
T Consensus 74 dg~vr~~Dln~~~~~~igth~~~i~ci~~~~~~~~vIsgsWD~---~ik~wD~R~~~~~~~~d~~kkVy~~~v~g~~LvV 150 (323)
T KOG1036|consen 74 DGQVRRYDLNTGNEDQIGTHDEGIRCIEYSYEVGCVISGSWDK---TIKFWDPRNKVVVGTFDQGKKVYCMDVSGNRLVV 150 (323)
T ss_pred CceEEEEEecCCcceeeccCCCceEEEEeeccCCeEEEcccCc---cEEEEeccccccccccccCceEEEEeccCCEEEE
Q ss_pred -eeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCC------------CCEEEee
Q psy6572 626 -CDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGS------------NPKVIIS 692 (1416)
Q Consensus 626 -tD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs------------~r~vlv~ 692 (1416)
+....-.|+-.+.-......-.+.--...|.|++-| ++-=|+.-.-+. +|.+-.+|-+ +|...-.
T Consensus 151 g~~~r~v~iyDLRn~~~~~q~reS~lkyqtR~v~~~p-n~eGy~~sSieG-RVavE~~d~s~~~~skkyaFkCHr~~~~~ 228 (323)
T KOG1036|consen 151 GTSDRKVLIYDLRNLDEPFQRRESSLKYQTRCVALVP-NGEGYVVSSIEG-RVAVEYFDDSEEAQSKKYAFKCHRLSEKD 228 (323)
T ss_pred eecCceEEEEEcccccchhhhccccceeEEEEEEEec-CCCceEEEeecc-eEEEEccCCchHHhhhceeEEeeecccCC
Q ss_pred cCCCCC-eeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEE--ecCcEEEeecCC
Q psy6572 693 KNLSWP-NALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAV--FEDHLFWTDWEM 764 (1416)
Q Consensus 693 ~~l~~P-~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav--~~d~LYwtD~~~ 764 (1416)
..+.+| |+|++.+. ..-|.+-...+.|..-++--..|-..+..- -....+|++ .+..|-++....
T Consensus 229 ~~~~yPVNai~Fhp~-~~tfaTgGsDG~V~~Wd~~~rKrl~q~~~~------~~SI~slsfs~dG~~LAia~sy~ 296 (323)
T KOG1036|consen 229 TEIIYPVNAIAFHPI-HGTFATGGSDGIVNIWDLFNRKRLKQLAKY------ETSISSLSFSMDGSLLAIASSYQ 296 (323)
T ss_pred ceEEEEeceeEeccc-cceEEecCCCceEEEccCcchhhhhhccCC------CCceEEEEeccCCCeEEEEechh
No 235
>KOG0277|consensus
Probab=54.77 E-value=3.8e+02 Score=30.73 Aligned_cols=59 Identities=17% Similarity=0.187 Sum_probs=26.6
Q ss_pred ceEEEEc--cCCcEEEeeCCCCeEEEeecCCCc-eEEEEcCCCCCcceeeecCCcceEEEeeC
Q psy6572 612 DGLTVDW--VGRNLYWCDKGLDTIEVAKLDGRF-RKVLINKGLQEPRGIALNPAYGYMYWTDW 671 (1416)
Q Consensus 612 ~gLAvD~--~~~~LYwtD~~~~~I~v~~ldG~~-~~vLi~~~l~~P~gIavDp~~g~LYWtD~ 671 (1416)
+-.+||| +.+.++.+.+-.++|..-.++-.. ..++ .+.-....+.+..|....||-+-.
T Consensus 106 EV~Svdwn~~~r~~~ltsSWD~TiKLW~~~r~~Sv~Tf-~gh~~~Iy~a~~sp~~~nlfas~S 167 (311)
T KOG0277|consen 106 EVYSVDWNTVRRRIFLTSSWDGTIKLWDPNRPNSVQTF-NGHNSCIYQAAFSPHIPNLFASAS 167 (311)
T ss_pred heEEeccccccceeEEeeccCCceEeecCCCCcceEee-cCCccEEEEEecCCCCCCeEEEcc
Confidence 3455665 445666666555555544433221 1121 122222344445555555554443
No 236
>KOG1225|consensus
Probab=54.61 E-value=50 Score=41.70 Aligned_cols=102 Identities=32% Similarity=0.855 Sum_probs=54.8
Q ss_pred CceeecCCcEe-CCceeeCCcCCCCCCCCccccccccCCCCCCCCCCCceeeCCCeeecCCcccCCCCCCCCCCCCCCcc
Q psy6572 373 THFECQNGNCI-PSVLLCNGVNDCDDNSDEDMNHAECRSLKDLCKHPSHFLCSNGLCINETLTCNDINDCGDNSDEFSCF 451 (1416)
Q Consensus 373 ~~f~C~~g~CI-~~~~~CDg~~DC~DgSDE~~~~~~C~~~~~~C~~~~~f~C~~g~Ci~~~~~Cdg~~dC~dgsDe~~C~ 451 (1416)
+.++|..|+|| +..|. -.|| || ..|... | ...+.|.+|+|| |.++.-...|.
T Consensus 258 ~~g~c~~G~CIC~~Gf~---G~dC----~e----~~Cp~~---c--s~~g~~~~g~Ci-----------C~~g~~G~dCs 310 (525)
T KOG1225|consen 258 GRGQCVEGRCICPPGFT---GDDC----DE----LVCPVD---C--SGGGVCVDGECI-----------CNPGYSGKDCS 310 (525)
T ss_pred ccceEeCCeEeCCCCCc---CCCC----Cc----ccCCcc---c--CCCceecCCEee-----------cCCCccccccc
Confidence 44889999998 33332 1122 12 234331 2 344556666654 44444444555
Q ss_pred cccccCCCCCCcccc--cceecCCceEEeeCCCceecCCCCCccccCCcCCCCCccceeeecCCeeeecCCCCcE
Q psy6572 452 VNECNVSHGGQLCAH--ECIDLKIGYKCACRKGYQVHPEDKHLCVDTNECLDRPCSHYCRNTLGSYSCSCAPGYA 524 (1416)
Q Consensus 452 i~eC~~~~~~~~Cs~--~C~nt~~gy~C~C~~Gy~L~p~d~~tC~didEC~~~~Csq~C~nt~gsy~C~C~~Gy~ 524 (1416)
+-.|.. . |+. .|+ .-+|.|.+||. +.+|.... |..+. .|+| + |.|..||+
T Consensus 311 ~~~cpa---d--C~g~G~Ci----~G~C~C~~Gy~-----G~~C~~~~-C~~~g---~cv~---g--C~C~~Gw~ 362 (525)
T KOG1225|consen 311 IRRCPA---D--CSGHGKCI----DGECLCDEGYT-----GELCIQRA-CSGGG---QCVN---G--CKCKKGWR 362 (525)
T ss_pred cccCCc---c--CCCCCccc----CCceEeCCCCc-----CCcccccc-cCCCc---eecc---C--ceeccCcc
Confidence 544421 1 332 455 35799999994 44666553 54221 3444 2 88999997
No 237
>KOG0281|consensus
Probab=54.60 E-value=88 Score=36.77 Aligned_cols=150 Identities=13% Similarity=0.051 Sum_probs=87.0
Q ss_pred EEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-C-CCeEEee-cCCCceEEEEccCCcEEEeeCCCCeEEEee
Q psy6572 561 IRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-S-QPELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAK 637 (1416)
Q Consensus 561 ~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s-~~~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ 637 (1416)
++....+.+.-|.||. +|.++.... ++|.....+ + ...+++. -..+-+|.| ++.+.+|-+...+|.|-+
T Consensus 232 ~L~GHtGSVLCLqyd~---rviisGSSD--sTvrvWDv~tge~l~tlihHceaVLhlrf---~ng~mvtcSkDrsiaVWd 303 (499)
T KOG0281|consen 232 ILTGHTGSVLCLQYDE---RVIVSGSSD--STVRVWDVNTGEPLNTLIHHCEAVLHLRF---SNGYMVTCSKDRSIAVWD 303 (499)
T ss_pred hhhcCCCcEEeeeccc---eEEEecCCC--ceEEEEeccCCchhhHHhhhcceeEEEEE---eCCEEEEecCCceeEEEe
Confidence 3344455667788864 366665553 477777777 2 2233333 444555555 456667778888899988
Q ss_pred cCCCc----eEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEe
Q psy6572 638 LDGRF----RKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWG 713 (1416)
Q Consensus 638 ldG~~----~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWt 713 (1416)
|+... ++||+ +.....+. ||.... +.++..|.. .|.+-+++.-. ++..--++-+|||--...+||.++
T Consensus 304 m~sps~it~rrVLv-GHrAaVNv--Vdfd~k-yIVsASgDR-TikvW~~st~e---fvRtl~gHkRGIAClQYr~rlvVS 375 (499)
T KOG0281|consen 304 MASPTDITLRRVLV-GHRAAVNV--VDFDDK-YIVSASGDR-TIKVWSTSTCE---FVRTLNGHKRGIACLQYRDRLVVS 375 (499)
T ss_pred ccCchHHHHHHHHh-hhhhheee--eccccc-eEEEecCCc-eEEEEecccee---eehhhhcccccceehhccCeEEEe
Confidence 87654 44554 22222333 333323 333333433 55555554322 222223466889887789999998
Q ss_pred cCCCCeEEEEeCC
Q psy6572 714 DAHEDYIAVSDLN 726 (1416)
Q Consensus 714 D~~~~~I~~~~ld 726 (1416)
-+..++|...++.
T Consensus 376 GSSDntIRlwdi~ 388 (499)
T KOG0281|consen 376 GSSDNTIRLWDIE 388 (499)
T ss_pred cCCCceEEEEecc
Confidence 8877888877764
No 238
>KOG2048|consensus
Probab=54.25 E-value=5.4e+02 Score=33.29 Aligned_cols=160 Identities=12% Similarity=0.090 Sum_probs=88.6
Q ss_pred EEecccccceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEE-ee-----cCCCceEEEEccCCcEEEeeCCCCeEE
Q psy6572 561 IRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELL-FP-----ATSPDGLTVDWVGRNLYWCDKGLDTIE 634 (1416)
Q Consensus 561 ~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l-~~-----l~~p~gLAvD~~~~~LYwtD~~~~~I~ 634 (1416)
+......++.--+.-+..+.|-+.-... .+|+|+..+..+++. +. +.....|.+-..+.+||........++
T Consensus 377 l~~k~~~nIs~~aiSPdg~~Ia~st~~~--~~iy~L~~~~~vk~~~v~~~~~~~~~a~~i~ftid~~k~~~~s~~~~~le 454 (691)
T KOG2048|consen 377 LFTKEKENISCAAISPDGNLIAISTVSR--TKIYRLQPDPNVKVINVDDVPLALLDASAISFTIDKNKLFLVSKNIFSLE 454 (691)
T ss_pred eecCCccceeeeccCCCCCEEEEeeccc--eEEEEeccCcceeEEEeccchhhhccceeeEEEecCceEEEEecccceeE
Confidence 4445556666667777777777665553 377777765322211 11 233444444444666666665556777
Q ss_pred EeecCCCceEEEEc---C-CCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEee-cCCCe
Q psy6572 635 VAKLDGRFRKVLIN---K-GLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTIS-YETNE 709 (1416)
Q Consensus 635 v~~ldG~~~~vLi~---~-~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD-~~~~r 709 (1416)
...+.+...+-|.. . ......-|++-|...||-.++ ....|...++.+...+.++..--..-++.++. ...++
T Consensus 455 ~~el~~ps~kel~~~~~~~~~~~I~~l~~SsdG~yiaa~~--t~g~I~v~nl~~~~~~~l~~rln~~vTa~~~~~~~~~~ 532 (691)
T KOG2048|consen 455 EFELETPSFKELKSIQSQAKCPSISRLVVSSDGNYIAAIS--TRGQIFVYNLETLESHLLKVRLNIDVTAAAFSPFVRNR 532 (691)
T ss_pred EEEecCcchhhhhccccccCCCcceeEEEcCCCCEEEEEe--ccceEEEEEcccceeecchhccCcceeeeeccccccCc
Confidence 77777655444432 2 233445788888888877776 33578899988876655552110112233333 23455
Q ss_pred EEEecCCCCeEEEEeC
Q psy6572 710 LFWGDAHEDYIAVSDL 725 (1416)
Q Consensus 710 LYWtD~~~~~I~~~~l 725 (1416)
|.++++. +.|+-.++
T Consensus 533 lvvats~-nQv~efdi 547 (691)
T KOG2048|consen 533 LVVATSN-NQVFEFDI 547 (691)
T ss_pred EEEEecC-CeEEEEec
Confidence 5555543 33333343
No 239
>PF14339 DUF4394: Domain of unknown function (DUF4394)
Probab=54.17 E-value=3.8e+02 Score=30.52 Aligned_cols=111 Identities=19% Similarity=0.184 Sum_probs=63.6
Q ss_pred CCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEE----cCCC-CCcceeeecCCcceEEEe-eCCCCceEEEEec
Q psy6572 609 TSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLI----NKGL-QEPRGIALNPAYGYMYWT-DWGQNAHIGKAKM 682 (1416)
Q Consensus 609 ~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi----~~~l-~~P~gIavDp~~g~LYWt-D~g~~~~I~ra~m 682 (1416)
....||.+-+.++.||=. ...++|+.++......+.+- ...+ ..+.++.++|.-.+|.+. +.+++ .|++.
T Consensus 27 e~l~GID~Rpa~G~LYgl-~~~g~lYtIn~~tG~aT~vg~s~~~~al~g~~~gvDFNP~aDRlRvvs~~GqN---lR~np 102 (236)
T PF14339_consen 27 ESLVGIDFRPANGQLYGL-GSTGRLYTINPATGAATPVGASPLTVALSGTAFGVDFNPAADRLRVVSNTGQN---LRLNP 102 (236)
T ss_pred CeEEEEEeecCCCCEEEE-eCCCcEEEEECCCCeEEEeecccccccccCceEEEecCcccCcEEEEccCCcE---EEECC
Confidence 445688899999999987 44578988886644433331 1111 226777888888888776 33333 34444
Q ss_pred CCCCCEEEeecCCC----------CCeeEEeecC--------CCeEEEecCCCCeEEEEe
Q psy6572 683 DGSNPKVIISKNLS----------WPNALTISYE--------TNELFWGDAHEDYIAVSD 724 (1416)
Q Consensus 683 DGs~r~vlv~~~l~----------~P~gLaiD~~--------~~rLYWtD~~~~~I~~~~ 724 (1416)
|-.. .+++...|. .|.=.+.-|. .-.||-+|..++.+....
T Consensus 103 dtGa-v~~~Dg~L~y~~gd~~~G~~p~v~aaAYTNs~~g~~t~TtLy~ID~~~~~Lv~Q~ 161 (236)
T PF14339_consen 103 DTGA-VTIVDGNLAYAAGDMNAGTTPGVTAAAYTNSFAGATTSTTLYDIDTTLDALVTQN 161 (236)
T ss_pred CCCC-ceeccCccccCCCccccCCCCceEEEEEecccCCCccceEEEEEecCCCeEEEec
Confidence 4111 122222222 2322222222 347898998888777764
No 240
>KOG0279|consensus
Probab=53.94 E-value=4.1e+02 Score=30.85 Aligned_cols=134 Identities=10% Similarity=0.038 Sum_probs=90.5
Q ss_pred cEEEEecC-CCC-eEEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCC-CCCcceeeecCCcceEE
Q psy6572 592 SIRRSCNN-SQP-ELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKG-LQEPRGIALNPAYGYMY 667 (1416)
Q Consensus 592 ~I~r~~l~-s~~-~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~-l~~P~gIavDp~~g~LY 667 (1416)
.++..++. +.. ..+.. ...+-++||.+.++.| ++-+..++|...+.-|.-..++.... -.+..-+.+.|.+...|
T Consensus 86 ~lrlWDl~~g~~t~~f~GH~~dVlsva~s~dn~qi-vSGSrDkTiklwnt~g~ck~t~~~~~~~~WVscvrfsP~~~~p~ 164 (315)
T KOG0279|consen 86 TLRLWDLATGESTRRFVGHTKDVLSVAFSTDNRQI-VSGSRDKTIKLWNTLGVCKYTIHEDSHREWVSCVRFSPNESNPI 164 (315)
T ss_pred eEEEEEecCCcEEEEEEecCCceEEEEecCCCcee-ecCCCcceeeeeeecccEEEEEecCCCcCcEEEEEEcCCCCCcE
Confidence 55555666 322 22333 6667889998766665 57777788988888887776666554 67888899999876666
Q ss_pred EeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCC
Q psy6572 668 WTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNG 727 (1416)
Q Consensus 668 WtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG 727 (1416)
+...+....+..-++++-..+.-....-..-+.++|.+ .+.|--.-.+.+.++..+++-
T Consensus 165 Ivs~s~DktvKvWnl~~~~l~~~~~gh~~~v~t~~vSp-DGslcasGgkdg~~~LwdL~~ 223 (315)
T KOG0279|consen 165 IVSASWDKTVKVWNLRNCQLRTTFIGHSGYVNTVTVSP-DGSLCASGGKDGEAMLWDLNE 223 (315)
T ss_pred EEEccCCceEEEEccCCcchhhccccccccEEEEEECC-CCCEEecCCCCceEEEEEccC
Confidence 66655554566667777655444444455667888875 667766666677888888753
No 241
>KOG0650|consensus
Probab=52.84 E-value=51 Score=41.32 Aligned_cols=96 Identities=9% Similarity=-0.020 Sum_probs=61.6
Q ss_pred CCCCCeEEEEecEEEEEEecCCcc--eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-C--CCeEEee-cC
Q psy6572 536 SDVPPNLLFTNKYYIREVTQAGVM--TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-S--QPELLFP-AT 609 (1416)
Q Consensus 536 ~~~~~~li~s~~~~I~~i~l~g~~--~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s--~~~~l~~-l~ 609 (1416)
.+..|+|+++.+.+||..+|.... ..+..+...+..|++++.+.+|+...... +|..+.++ + ..+++.- -.
T Consensus 575 HPs~p~lfVaTq~~vRiYdL~kqelvKkL~tg~kwiS~msihp~GDnli~gs~d~---k~~WfDldlsskPyk~lr~H~~ 651 (733)
T KOG0650|consen 575 HPSKPYLFVATQRSVRIYDLSKQELVKKLLTGSKWISSMSIHPNGDNLILGSYDK---KMCWFDLDLSSKPYKTLRLHEK 651 (733)
T ss_pred cCCCceEEEEeccceEEEehhHHHHHHHHhcCCeeeeeeeecCCCCeEEEecCCC---eeEEEEcccCcchhHHhhhhhh
Confidence 345789999999999998886644 44566778888999999999988876663 67777777 2 2222222 33
Q ss_pred CCceEEEEccCCcEEEeeCCCCeEEE
Q psy6572 610 SPDGLTVDWVGRNLYWCDKGLDTIEV 635 (1416)
Q Consensus 610 ~p~gLAvD~~~~~LYwtD~~~~~I~v 635 (1416)
..+.+|+.. .=.||-+-+..+.+.|
T Consensus 652 avr~Va~H~-ryPLfas~sdDgtv~V 676 (733)
T KOG0650|consen 652 AVRSVAFHK-RYPLFASGSDDGTVIV 676 (733)
T ss_pred hhhhhhhcc-ccceeeeecCCCcEEE
Confidence 344555542 2334444444444443
No 242
>PRK10115 protease 2; Provisional
Probab=51.22 E-value=4.6e+02 Score=34.80 Aligned_cols=111 Identities=14% Similarity=0.158 Sum_probs=60.5
Q ss_pred CCCeEEEeec-cCCCccEEEEecC--CCCeEEeec---CCCceEEEEccCCcEEEeeCC--CCeEEEeecCCCceEEEEc
Q psy6572 577 VDNCLYWSDV-TMHGSSIRRSCNN--SQPELLFPA---TSPDGLTVDWVGRNLYWCDKG--LDTIEVAKLDGRFRKVLIN 648 (1416)
Q Consensus 577 ~~~~LYwtD~-~~~~~~I~r~~l~--s~~~~l~~l---~~p~gLAvD~~~~~LYwtD~~--~~~I~v~~ldG~~~~vLi~ 648 (1416)
..+.+|+... .....+|.++.+. ...++|+.- ..+.++++ .++.|+++-.. ..+|.++++++.....|.
T Consensus 278 ~~~~ly~~tn~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~i~~~~~--~~~~l~~~~~~~g~~~l~~~~~~~~~~~~l~- 354 (686)
T PRK10115 278 YQHRFYLRSNRHGKNFGLYRTRVRDEQQWEELIPPRENIMLEGFTL--FTDWLVVEERQRGLTSLRQINRKTREVIGIA- 354 (686)
T ss_pred CCCEEEEEEcCCCCCceEEEecCCCcccCeEEECCCCCCEEEEEEE--ECCEEEEEEEeCCEEEEEEEcCCCCceEEec-
Confidence 3466776543 2233466666665 344555542 23444444 47778776553 456777777665433332
Q ss_pred CCCCCccee-----eecCCcceEEEe--eCCCCceEEEEecCCCCCEEEee
Q psy6572 649 KGLQEPRGI-----ALNPAYGYMYWT--DWGQNAHIGKAKMDGSNPKVIIS 692 (1416)
Q Consensus 649 ~~l~~P~gI-----avDp~~g~LYWt--D~g~~~~I~ra~mDGs~r~vlv~ 692 (1416)
+..|..+ ..++..+.|+++ .+...+.|++.++++...++|..
T Consensus 355 --~~~~~~~~~~~~~~~~~~~~~~~~~ss~~~P~~~y~~d~~~~~~~~l~~ 403 (686)
T PRK10115 355 --FDDPAYVTWIAYNPEPETSRLRYGYSSMTTPDTLFELDMDTGERRVLKQ 403 (686)
T ss_pred --CCCCceEeeecccCCCCCceEEEEEecCCCCCEEEEEECCCCcEEEEEe
Confidence 1223332 233455656644 44455789999988765455443
No 243
>KOG0772|consensus
Probab=50.94 E-value=4.7e+02 Score=32.74 Aligned_cols=161 Identities=12% Similarity=-0.004 Sum_probs=85.9
Q ss_pred EecccccceeeeeecCCCeEEEeeccCCCccEEEEecC---CC---CeEEee--cCCCceEEEEccCCcEEEeeCCCCeE
Q psy6572 562 RIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN---SQ---PELLFP--ATSPDGLTVDWVGRNLYWCDKGLDTI 633 (1416)
Q Consensus 562 ~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~---s~---~~~l~~--l~~p~gLAvD~~~~~LYwtD~~~~~I 633 (1416)
+.++.+.+.+|++|+.+-++|--.... .|.-+.+. .. .+.|.. ....+.|++-+ ++.++.+-++..++
T Consensus 163 l~hgtk~Vsal~~Dp~GaR~~sGs~Dy---~v~~wDf~gMdas~~~fr~l~P~E~h~i~sl~ys~-Tg~~iLvvsg~aqa 238 (641)
T KOG0772|consen 163 LKHGTKIVSALAVDPSGARFVSGSLDY---TVKFWDFQGMDASMRSFRQLQPCETHQINSLQYSV-TGDQILVVSGSAQA 238 (641)
T ss_pred ccCCceEEEEeeecCCCceeeeccccc---eEEEEecccccccchhhhccCcccccccceeeecC-CCCeEEEEecCcce
Confidence 344556677889998877766433321 35545544 11 122222 45567777765 55555666677788
Q ss_pred EEeecCCCceEEEEcCC-----CCC-------cceeeecCCcceEEEeeCCC-CceEEEEecCCCCCEEEeecCC----C
Q psy6572 634 EVAKLDGRFRKVLINKG-----LQE-------PRGIALNPAYGYMYWTDWGQ-NAHIGKAKMDGSNPKVIISKNL----S 696 (1416)
Q Consensus 634 ~v~~ldG~~~~vLi~~~-----l~~-------P~gIavDp~~g~LYWtD~g~-~~~I~ra~mDGs~r~vlv~~~l----~ 696 (1416)
.+++.+|....-.+.+. +.. ...-..+|.+...|.|-... ..+|+-++---+.++||..... .
T Consensus 239 kl~DRdG~~~~e~~KGDQYI~Dm~nTKGHia~lt~g~whP~~k~~FlT~s~DgtlRiWdv~~~k~q~qVik~k~~~g~Rv 318 (641)
T KOG0772|consen 239 KLLDRDGFEIVEFSKGDQYIRDMYNTKGHIAELTCGCWHPDNKEEFLTCSYDGTLRIWDVNNTKSQLQVIKTKPAGGKRV 318 (641)
T ss_pred eEEccCCceeeeeeccchhhhhhhccCCceeeeeccccccCcccceEEecCCCcEEEEecCCchhheeEEeeccCCCccc
Confidence 88888887654443221 111 12223456666667664322 2356555444455566655321 2
Q ss_pred CCeeEEeecCCCeEEEecCCCCeEEEEeCCC
Q psy6572 697 WPNALTISYETNELFWGDAHEDYIAVSDLNG 727 (1416)
Q Consensus 697 ~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG 727 (1416)
.|..-|.++ .+.++-+--..+.|..-++-+
T Consensus 319 ~~tsC~~nr-dg~~iAagc~DGSIQ~W~~~~ 348 (641)
T KOG0772|consen 319 PVTSCAWNR-DGKLIAAGCLDGSIQIWDKGS 348 (641)
T ss_pred CceeeecCC-CcchhhhcccCCceeeeecCC
Confidence 244555554 445543434456676666533
No 244
>KOG0281|consensus
Probab=50.31 E-value=2.6e+02 Score=33.14 Aligned_cols=121 Identities=17% Similarity=0.090 Sum_probs=68.6
Q ss_pred EEEEEecCCcc--eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCC----eEEee-cCCCceEEEEccC
Q psy6572 549 YIREVTQAGVM--TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQP----ELLFP-ATSPDGLTVDWVG 620 (1416)
Q Consensus 549 ~I~~i~l~g~~--~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~----~~l~~-l~~p~gLAvD~~~ 620 (1416)
.|+.-+.+... ..+++.-.++.+|.|. ++ +.++-... .+|.+..+. ... .+|+. ...++.+.+|
T Consensus 258 TvrvWDv~tge~l~tlihHceaVLhlrf~--ng-~mvtcSkD--rsiaVWdm~sps~it~rrVLvGHrAaVNvVdfd--- 329 (499)
T KOG0281|consen 258 TVRVWDVNTGEPLNTLIHHCEAVLHLRFS--NG-YMVTCSKD--RSIAVWDMASPTDITLRRVLVGHRAAVNVVDFD--- 329 (499)
T ss_pred eEEEEeccCCchhhHHhhhcceeEEEEEe--CC-EEEEecCC--ceeEEEeccCchHHHHHHHHhhhhhheeeeccc---
Confidence 34444443332 3344445556666662 33 33343332 367777777 222 23333 4445544444
Q ss_pred CcEEEeeCCCCeEEEeecCC-CceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEec
Q psy6572 621 RNLYWCDKGLDTIEVAKLDG-RFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKM 682 (1416)
Q Consensus 621 ~~LYwtD~~~~~I~v~~ldG-~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~m 682 (1416)
.++.++.++..+|.+-+++. .+.++|. ..-||||--..+|+|.++-...+ .|...+.
T Consensus 330 ~kyIVsASgDRTikvW~~st~efvRtl~----gHkRGIAClQYr~rlvVSGSSDn-tIRlwdi 387 (499)
T KOG0281|consen 330 DKYIVSASGDRTIKVWSTSTCEFVRTLN----GHKRGIACLQYRDRLVVSGSSDN-TIRLWDI 387 (499)
T ss_pred cceEEEecCCceEEEEeccceeeehhhh----cccccceehhccCeEEEecCCCc-eEEEEec
Confidence 34666777888999988764 4455543 35689999999999988865443 4444433
No 245
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=49.78 E-value=2.3e+02 Score=35.11 Aligned_cols=66 Identities=17% Similarity=0.150 Sum_probs=34.0
Q ss_pred CCeeEEeecCCCeEEEecCCCCeEEEEeCCC-CceEEEEeccCC--------CC---ccc-ccc--eeEEEecCcEEEee
Q psy6572 697 WPNALTISYETNELFWGDAHEDYIAVSDLNG-ENIKIIVSRRMD--------PT---INL-HHV--FALAVFEDHLFWTD 761 (1416)
Q Consensus 697 ~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG-~~r~~v~~~~~~--------p~---~~l-~~P--~~lav~~d~LYwtD 761 (1416)
.|..|.|....+.||++.|..+.|...++.- .+.+.+.+-... +. ..+ ..| ..|++.+.+||||.
T Consensus 313 LitDI~iSlDDrfLYvs~W~~GdvrqYDISDP~~Pkl~gqv~lGG~~~~~~~~~v~g~~l~GgPqMvqlS~DGkRlYvTn 392 (461)
T PF05694_consen 313 LITDILISLDDRFLYVSNWLHGDVRQYDISDPFNPKLVGQVFLGGSIRKGDHPVVKGKRLRGGPQMVQLSLDGKRLYVTN 392 (461)
T ss_dssp ----EEE-TTS-EEEEEETTTTEEEEEE-SSTTS-EEEEEEE-BTTTT-B--TTS------S----EEE-TTSSEEEEE-
T ss_pred ceEeEEEccCCCEEEEEcccCCcEEEEecCCCCCCcEEeEEEECcEeccCCCccccccccCCCCCeEEEccCCeEEEEEe
Confidence 3567777777899999999999999999754 444444321100 00 001 123 44555788999998
Q ss_pred c
Q psy6572 762 W 762 (1416)
Q Consensus 762 ~ 762 (1416)
.
T Consensus 393 S 393 (461)
T PF05694_consen 393 S 393 (461)
T ss_dssp -
T ss_pred e
Confidence 5
No 246
>KOG1217|consensus
Probab=49.73 E-value=27 Score=43.23 Aligned_cols=61 Identities=36% Similarity=0.809 Sum_probs=40.4
Q ss_pred cceecCCceEEeeCCCceecCCCC----CccccC-----------CcCC--CCCcc---ceeeecCCeeeecCCCCcEEe
Q psy6572 467 ECIDLKIGYKCACRKGYQVHPEDK----HLCVDT-----------NECL--DRPCS---HYCRNTLGSYSCSCAPGYALL 526 (1416)
Q Consensus 467 ~C~nt~~gy~C~C~~Gy~L~p~d~----~tC~di-----------dEC~--~~~Cs---q~C~nt~gsy~C~C~~Gy~L~ 526 (1416)
.|.++.++|.|.|++||.+..... ..|.+. ..|. ...|. ..|+++.++|.|.|++||.+.
T Consensus 184 ~C~~~~~~~~C~c~~~~~~~~~~~~~~~~~c~~~~~~~~~~g~~~~~c~~~~~~~~~~~~~c~~~~~~~~C~~~~g~~~~ 263 (487)
T KOG1217|consen 184 TCVNTGGSYLCSCPPGYTGSTCETTGNGGTCVDSVACSCPPGARGPECEVSIVECASGDGTCVNTVGSYTCRCPEGYTGD 263 (487)
T ss_pred ccccCCCCeeEeCCCCccCCcCcCCCCCceEecceeccCCCCCCCCCcccccccccCCCCcccccCCceeeeCCCCcccc
Confidence 689999999999999997651111 123321 2222 11222 568888889999999998755
Q ss_pred c
Q psy6572 527 S 527 (1416)
Q Consensus 527 ~ 527 (1416)
.
T Consensus 264 ~ 264 (487)
T KOG1217|consen 264 A 264 (487)
T ss_pred c
Confidence 4
No 247
>KOG0286|consensus
Probab=49.33 E-value=4.9e+02 Score=30.43 Aligned_cols=154 Identities=12% Similarity=0.136 Sum_probs=88.4
Q ss_pred CcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCC-CceEEEEecCCCCCEEEeecCCCCCe
Q psy6572 621 RNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQ-NAHIGKAKMDGSNPKVIISKNLSWPN 699 (1416)
Q Consensus 621 ~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~-~~~I~ra~mDGs~r~vlv~~~l~~P~ 699 (1416)
.+..+|-++..+...-++....+...+........+|+|-|..++.|++-.-. .++|+-+. .|.-++.+. ..-...|
T Consensus 156 D~~ilT~SGD~TCalWDie~g~~~~~f~GH~gDV~slsl~p~~~ntFvSg~cD~~aklWD~R-~~~c~qtF~-ghesDIN 233 (343)
T KOG0286|consen 156 DNHILTGSGDMTCALWDIETGQQTQVFHGHTGDVMSLSLSPSDGNTFVSGGCDKSAKLWDVR-SGQCVQTFE-GHESDIN 233 (343)
T ss_pred CCceEecCCCceEEEEEcccceEEEEecCCcccEEEEecCCCCCCeEEecccccceeeeecc-CcceeEeec-ccccccc
Confidence 45556777777777777765555555556666788999999899999986322 23444322 233333333 2333456
Q ss_pred eEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEE-ecCcEEEeecCCCeeEEecccCCCce
Q psy6572 700 ALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAV-FEDHLFWTDWEMKSIERCDKYTGKNC 778 (1416)
Q Consensus 700 gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav-~~d~LYwtD~~~~~I~~~nk~tG~~~ 778 (1416)
.+.+. +++.-|.+-+........++.--....+++.. ..+....++++ ..++|.++-+....+..=+...|..+
T Consensus 234 sv~ff-P~G~afatGSDD~tcRlyDlRaD~~~a~ys~~----~~~~gitSv~FS~SGRlLfagy~d~~c~vWDtlk~e~v 308 (343)
T KOG0286|consen 234 SVRFF-PSGDAFATGSDDATCRLYDLRADQELAVYSHD----SIICGITSVAFSKSGRLLFAGYDDFTCNVWDTLKGERV 308 (343)
T ss_pred eEEEc-cCCCeeeecCCCceeEEEeecCCcEEeeeccC----cccCCceeEEEcccccEEEeeecCCceeEeeccccceE
Confidence 67764 46677777666666666676543333333322 11223345665 46777777666655555444444443
Q ss_pred EEE
Q psy6572 779 TSV 781 (1416)
Q Consensus 779 ~~l 781 (1416)
.+|
T Consensus 309 g~L 311 (343)
T KOG0286|consen 309 GVL 311 (343)
T ss_pred EEe
Confidence 333
No 248
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=48.72 E-value=6.6e+02 Score=31.75 Aligned_cols=13 Identities=23% Similarity=0.509 Sum_probs=8.5
Q ss_pred eeecCCcceEEEe
Q psy6572 657 IALNPAYGYMYWT 669 (1416)
Q Consensus 657 IavDp~~g~LYWt 669 (1416)
+|||..+|.+-|.
T Consensus 178 ~alD~~TG~~~W~ 190 (488)
T cd00216 178 RAYDVETGKLLWR 190 (488)
T ss_pred EEEECCCCceeeE
Confidence 5666666666665
No 249
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=48.40 E-value=4.8e+02 Score=30.01 Aligned_cols=133 Identities=18% Similarity=0.189 Sum_probs=75.0
Q ss_pred CCCeEEEeeccCCCccEEEEecC------C--CCeEEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceE-E-
Q psy6572 577 VDNCLYWSDVTMHGSSIRRSCNN------S--QPELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRK-V- 645 (1416)
Q Consensus 577 ~~~~LYwtD~~~~~~~I~r~~l~------s--~~~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~-v- 645 (1416)
..++|||++.... ..|+.+..- . ..+..+. ...-.|.+| -++.||+--.+...|.+.+|...... .
T Consensus 29 ~~~~iy~~~~~~~-~~v~ey~~~~~f~~~~~~~~~~~Lp~~~~GtG~vV--YngslYY~~~~s~~IvkydL~t~~v~~~~ 105 (250)
T PF02191_consen 29 DSEKIYVTSGFSG-NTVYEYRNYEDFLRNGRSSRTYKLPYPWQGTGHVV--YNGSLYYNKYNSRNIVKYDLTTRSVVARR 105 (250)
T ss_pred CCCCEEEECccCC-CEEEEEcCHhHHhhcCCCceEEEEeceeccCCeEE--ECCcEEEEecCCceEEEEECcCCcEEEEE
Confidence 4667888877653 244333211 1 1111122 222334444 47999999999999999999876644 2
Q ss_pred EEc-CCC----------CCcceeeecCCcce--EEEeeCCCCceEEEEecCCCCCEEEee--cCCCCC---eeEEeecCC
Q psy6572 646 LIN-KGL----------QEPRGIALNPAYGY--MYWTDWGQNAHIGKAKMDGSNPKVIIS--KNLSWP---NALTISYET 707 (1416)
Q Consensus 646 Li~-~~l----------~~P~gIavDp~~g~--LYWtD~g~~~~I~ra~mDGs~r~vlv~--~~l~~P---~gLaiD~~~ 707 (1416)
.+. ... ..=-.+|||.. |. ||-+. .....|..+.||-....+.-+ +.+..+ +++.| =
T Consensus 106 ~L~~A~~~n~~~y~~~~~t~iD~AvDE~-GLWvIYat~-~~~g~ivvskld~~tL~v~~tw~T~~~k~~~~naFmv---C 180 (250)
T PF02191_consen 106 ELPGAGYNNRFPYYWSGYTDIDFAVDEN-GLWVIYATE-DNNGNIVVSKLDPETLSVEQTWNTSYPKRSAGNAFMV---C 180 (250)
T ss_pred ECCccccccccceecCCCceEEEEEcCC-CEEEEEecC-CCCCcEEEEeeCcccCceEEEEEeccCchhhcceeeE---e
Confidence 221 111 11246899944 53 33333 334468888888765544433 344443 34444 5
Q ss_pred CeEEEecCCC
Q psy6572 708 NELFWGDAHE 717 (1416)
Q Consensus 708 ~rLYWtD~~~ 717 (1416)
|.||.++...
T Consensus 181 GvLY~~~s~~ 190 (250)
T PF02191_consen 181 GVLYATDSYD 190 (250)
T ss_pred eEEEEEEECC
Confidence 8999998765
No 250
>TIGR02171 Fb_sc_TIGR02171 Fibrobacter succinogenes paralogous family TIGR02171. This model describes a paralogous family of the rumen bacterium Fibrobacter succinogenes. Eleven members are found in Fibrobacter succinogenes S85, averaging over 900 amino acids in length. More than half are predicted lipoproteins. The function is unknown.
Probab=48.35 E-value=3.7e+02 Score=36.23 Aligned_cols=52 Identities=10% Similarity=-0.056 Sum_probs=34.2
Q ss_pred EEEEEEecCCcc-eEE-ecccccceeeeeecCCCeEEE-eeccC--CCccEEEEecC
Q psy6572 548 YYIREVTQAGVM-TIR-IHNQTNAVGLDFDWVDNCLYW-SDVTM--HGSSIRRSCNN 599 (1416)
Q Consensus 548 ~~I~~i~l~g~~-~~~-~~~l~~~~~l~~D~~~~~LYw-tD~~~--~~~~I~r~~l~ 599 (1416)
..|..++.+|.. .++ +.....+...++.+.+++|-+ +.... ....|++..|.
T Consensus 329 ~~L~~~D~dG~n~~~ve~~~~~~i~sP~~SPDG~~vAY~ts~e~~~g~s~vYv~~L~ 385 (912)
T TIGR02171 329 GNLAYIDYTKGASRAVEIEDTISVYHPDISPDGKKVAFCTGIEGLPGKSSVYVRNLN 385 (912)
T ss_pred CeEEEEecCCCCceEEEecCCCceecCcCCCCCCEEEEEEeecCCCCCceEEEEehh
Confidence 378888888876 444 455555566777788888766 54433 34568888877
No 251
>KOG2139|consensus
Probab=47.92 E-value=5.7e+02 Score=30.76 Aligned_cols=161 Identities=17% Similarity=0.155 Sum_probs=92.6
Q ss_pred EecccccceeeeeecCCCeEEEeeccCCCccEEEEe-cC---CCCeEEeecCCCceEEEEccCCcEEEeeCCCCeEEEee
Q psy6572 562 RIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSC-NN---SQPELLFPATSPDGLTVDWVGRNLYWCDKGLDTIEVAK 637 (1416)
Q Consensus 562 ~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~-l~---s~~~~l~~l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ 637 (1416)
+..++....-|-+.+.+..||-+... .++|+- .+ +...-+..-..+.+-.-++-++.|.++-++..+|+...
T Consensus 234 ~~~glgg~slLkwSPdgd~lfaAt~d----avfrlw~e~q~wt~erw~lgsgrvqtacWspcGsfLLf~~sgsp~lysl~ 309 (445)
T KOG2139|consen 234 IPKGLGGFSLLKWSPDGDVLFAATCD----AVFRLWQENQSWTKERWILGSGRVQTACWSPCGSFLLFACSGSPRLYSLT 309 (445)
T ss_pred cccCCCceeeEEEcCCCCEEEEeccc----ceeeeehhcccceecceeccCCceeeeeecCCCCEEEEEEcCCceEEEEe
Confidence 33455555566777777777766544 344443 33 11111111224444455777888999999988999888
Q ss_pred cCCCc---------eEE-EEc--------CC----CCCcceeeecCCcceEEEeeCCCC------ceEEEEecCCCCCEE
Q psy6572 638 LDGRF---------RKV-LIN--------KG----LQEPRGIALNPAYGYMYWTDWGQN------AHIGKAKMDGSNPKV 689 (1416)
Q Consensus 638 ldG~~---------~~v-Li~--------~~----l~~P~gIavDp~~g~LYWtD~g~~------~~I~ra~mDGs~r~v 689 (1416)
.++.. .++ |+. .+ -..+..||-||...||-++-.+.. ..|.+.+..-+-...
T Consensus 310 f~~~~~~~~~~~~~k~~lliaDL~e~ti~ag~~l~cgeaq~lawDpsGeyLav~fKg~~~v~~~k~~i~~fdtr~sp~ve 389 (445)
T KOG2139|consen 310 FDGEDSVFLRPQSIKRVLLIADLQEVTICAGQRLCCGEAQCLAWDPSGEYLAVIFKGQSFVLLCKLHISRFDTRKSPPVE 389 (445)
T ss_pred ecCCCccccCcccceeeeeeccchhhhhhcCcccccCccceeeECCCCCEEEEEEcCCchhhhhhhhhhhhcccccCceE
Confidence 76532 122 222 11 235788999999889888865543 234443333222222
Q ss_pred Eee---cCCCCCeeEEeec--CCCeEEEecCCCCeEEEEeCC
Q psy6572 690 IIS---KNLSWPNALTISY--ETNELFWGDAHEDYIAVSDLN 726 (1416)
Q Consensus 690 lv~---~~l~~P~gLaiD~--~~~rLYWtD~~~~~I~~~~ld 726 (1416)
+.. ..-.+|..|++.+ .+++|.-+-|.+++|.++.+.
T Consensus 390 ls~cg~i~ge~P~~IsF~pl~n~g~lLsiaWsTGriq~ypl~ 431 (445)
T KOG2139|consen 390 LSYCGMIGGEYPAYISFGPLKNEGRLLSIAWSTGRIQRYPLT 431 (445)
T ss_pred EEecccccCCCCceEEeeecccCCcEEEEEeccCceEeeeeE
Confidence 221 1234577776654 456777777778888777654
No 252
>KOG0299|consensus
Probab=47.81 E-value=6.3e+02 Score=31.24 Aligned_cols=68 Identities=15% Similarity=0.152 Sum_probs=32.5
Q ss_pred CcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeec-CCCCCeeEEeecCCCeEEEecCCCCeE
Q psy6572 653 EPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISK-NLSWPNALTISYETNELFWGDAHEDYI 720 (1416)
Q Consensus 653 ~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~-~l~~P~gLaiD~~~~rLYWtD~~~~~I 720 (1416)
|..+||+-|....+--..|...-+++.+.-.-.....|..- -..+.|+|++-....+|+..-...+++
T Consensus 382 Witsla~i~~sdL~asGS~~G~vrLW~i~~g~r~i~~l~~ls~~GfVNsl~f~~sgk~ivagiGkEhRl 450 (479)
T KOG0299|consen 382 WITSLAVIPGSDLLASGSWSGCVRLWKIEDGLRAINLLYSLSLVGFVNSLAFSNSGKRIVAGIGKEHRL 450 (479)
T ss_pred ceeeeEecccCceEEecCCCCceEEEEecCCccccceeeecccccEEEEEEEccCCCEEEEeccccccc
Confidence 55666666654433322333233444443222222222221 235567888766555566665444444
No 253
>KOG0272|consensus
Probab=46.80 E-value=6.3e+02 Score=30.94 Aligned_cols=62 Identities=11% Similarity=0.011 Sum_probs=35.5
Q ss_pred cCCCceEEEEccC-CcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEe
Q psy6572 608 ATSPDGLTVDWVG-RNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWT 669 (1416)
Q Consensus 608 l~~p~gLAvD~~~-~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWt 669 (1416)
...+.++++.|+. ++-..|-+..++|-..++++...-.-+..-+.+...+|++|...+|--+
T Consensus 217 ~~~v~~~~fhP~~~~~~lat~s~Dgtvklw~~~~e~~l~~l~gH~~RVs~VafHPsG~~L~Ta 279 (459)
T KOG0272|consen 217 TSRVGAAVFHPVDSDLNLATASADGTVKLWKLSQETPLQDLEGHLARVSRVAFHPSGKFLGTA 279 (459)
T ss_pred ccceeeEEEccCCCccceeeeccCCceeeeccCCCcchhhhhcchhhheeeeecCCCceeeec
Confidence 6667778888773 4444455555555555555542211122345667778888886665433
No 254
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=46.45 E-value=1.2e+02 Score=38.33 Aligned_cols=22 Identities=14% Similarity=0.215 Sum_probs=19.3
Q ss_pred cCCCceEEEEccCCcEEEeeCC
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKG 629 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~ 629 (1416)
+..|++|++.+.++.+|++...
T Consensus 416 mdRpE~i~~~p~~g~Vy~~lTN 437 (616)
T COG3211 416 MDRPEWIAVNPGTGEVYFTLTN 437 (616)
T ss_pred ccCccceeecCCcceEEEEeCC
Confidence 6789999999999999998653
No 255
>PHA02790 Kelch-like protein; Provisional
Probab=45.68 E-value=7.3e+02 Score=31.33 Aligned_cols=130 Identities=10% Similarity=0.017 Sum_probs=65.7
Q ss_pred cCCcEEEeeCC--CCeEEEeecCCCceEEEEcCCCCCcc-eeeecCCcceEEEeeCCC--CceEEEEecCCCCCEEEeec
Q psy6572 619 VGRNLYWCDKG--LDTIEVAKLDGRFRKVLINKGLQEPR-GIALNPAYGYMYWTDWGQ--NAHIGKAKMDGSNPKVIISK 693 (1416)
Q Consensus 619 ~~~~LYwtD~~--~~~I~v~~ldG~~~~vLi~~~l~~P~-gIavDp~~g~LYWtD~g~--~~~I~ra~mDGs~r~vlv~~ 693 (1416)
.++.||..-.. ...+++.+.....-..+ ..+..|+ +.++-...|+||+.--.. ...+++.+... ++-..+ .
T Consensus 317 ~~~~iYviGG~~~~~sve~ydp~~n~W~~~--~~l~~~r~~~~~~~~~g~IYviGG~~~~~~~ve~ydp~~-~~W~~~-~ 392 (480)
T PHA02790 317 ANNKLYVVGGLPNPTSVERWFHGDAAWVNM--PSLLKPRCNPAVASINNVIYVIGGHSETDTTTEYLLPNH-DQWQFG-P 392 (480)
T ss_pred ECCEEEEECCcCCCCceEEEECCCCeEEEC--CCCCCCCcccEEEEECCEEEEecCcCCCCccEEEEeCCC-CEEEeC-C
Confidence 57899988653 24567766432211111 3454555 333334568999874211 12345544332 222222 1
Q ss_pred CCCCCe---eEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEeec
Q psy6572 694 NLSWPN---ALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDW 762 (1416)
Q Consensus 694 ~l~~P~---gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~ 762 (1416)
.+..|+ ++++ .+++||++- +.+++.++... .-..+.. + + .-..-.++++.+++||+.-.
T Consensus 393 ~m~~~r~~~~~~~--~~~~IYv~G---G~~e~ydp~~~-~W~~~~~-m-~--~~r~~~~~~v~~~~IYviGG 454 (480)
T PHA02790 393 STYYPHYKSCALV--FGRRLFLVG---RNAEFYCESSN-TWTLIDD-P-I--YPRDNPELIIVDNKLLLIGG 454 (480)
T ss_pred CCCCccccceEEE--ECCEEEEEC---CceEEecCCCC-cEeEcCC-C-C--CCccccEEEEECCEEEEECC
Confidence 233332 2322 478999985 34666666432 2222221 1 0 01233578899999999764
No 256
>KOG0263|consensus
Probab=44.52 E-value=7.8e+02 Score=32.34 Aligned_cols=177 Identities=10% Similarity=0.076 Sum_probs=93.0
Q ss_pred CCeEEEEe-cEEEEEEecCCcc-eEEeccccccee-eeeecCCCeEEEeeccCCCccEEEEec-C-CC-CeEEee-cCCC
Q psy6572 539 PPNLLFTN-KYYIREVTQAGVM-TIRIHNQTNAVG-LDFDWVDNCLYWSDVTMHGSSIRRSCN-N-SQ-PELLFP-ATSP 611 (1416)
Q Consensus 539 ~~~li~s~-~~~I~~i~l~g~~-~~~~~~l~~~~~-l~~D~~~~~LYwtD~~~~~~~I~r~~l-~-s~-~~~l~~-l~~p 611 (1416)
..|||-+. ...+|.-.++... .++..+...|+. +.|-+. -+|++....+ +.-|+-. + .. .++++. +..+
T Consensus 463 ~rfLlScSED~svRLWsl~t~s~~V~y~GH~~PVwdV~F~P~--GyYFatas~D--~tArLWs~d~~~PlRifaghlsDV 538 (707)
T KOG0263|consen 463 RRFLLSCSEDSSVRLWSLDTWSCLVIYKGHLAPVWDVQFAPR--GYYFATASHD--QTARLWSTDHNKPLRIFAGHLSDV 538 (707)
T ss_pred ccceeeccCCcceeeeecccceeEEEecCCCcceeeEEecCC--ceEEEecCCC--ceeeeeecccCCchhhhccccccc
Confidence 33444433 3456666666655 455555444443 556544 3666655533 3333222 2 22 344444 8888
Q ss_pred ceEEEEccCCcEEEeeCCCCeEEEeec-CCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEE
Q psy6572 612 DGLTVDWVGRNLYWCDKGLDTIEVAKL-DGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVI 690 (1416)
Q Consensus 612 ~gLAvD~~~~~LYwtD~~~~~I~v~~l-dG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vl 690 (1416)
.-++|.|....|. |.+...++..-+. .|..++++ .+......+|++-|...+|--. +....|..-++.+..+...
T Consensus 539 ~cv~FHPNs~Y~a-TGSsD~tVRlWDv~~G~~VRiF-~GH~~~V~al~~Sp~Gr~LaSg--~ed~~I~iWDl~~~~~v~~ 614 (707)
T KOG0263|consen 539 DCVSFHPNSNYVA-TGSSDRTVRLWDVSTGNSVRIF-TGHKGPVTALAFSPCGRYLASG--DEDGLIKIWDLANGSLVKQ 614 (707)
T ss_pred ceEEECCcccccc-cCCCCceEEEEEcCCCcEEEEe-cCCCCceEEEEEcCCCceEeec--ccCCcEEEEEcCCCcchhh
Confidence 8899998666553 4455566666664 45545444 4445556789999975554332 2333455555554433222
Q ss_pred eecCCCCCeeEEeecCCCeEEEecCCCCeEEEEe
Q psy6572 691 ISKNLSWPNALTISYETNELFWGDAHEDYIAVSD 724 (1416)
Q Consensus 691 v~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ 724 (1416)
+...-.....|++.+ .+.|..+....+.|..-+
T Consensus 615 l~~Ht~ti~SlsFS~-dg~vLasgg~DnsV~lWD 647 (707)
T KOG0263|consen 615 LKGHTGTIYSLSFSR-DGNVLASGGADNSVRLWD 647 (707)
T ss_pred hhcccCceeEEEEec-CCCEEEecCCCCeEEEEE
Confidence 222233334566643 455555555545444443
No 257
>KOG0310|consensus
Probab=44.47 E-value=7.2e+02 Score=30.95 Aligned_cols=155 Identities=12% Similarity=0.064 Sum_probs=85.1
Q ss_pred ceeeeeecCCCeEEEeeccCCCccEEEEecCCCC--eEEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEE
Q psy6572 569 AVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQP--ELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKV 645 (1416)
Q Consensus 569 ~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~--~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~v 645 (1416)
+.+++|-. .++|+-+--.. +.|..+.+++.. .++.. ...+.-+-+-+..+.++.+-.....+..-++++.+...
T Consensus 71 v~s~~fR~-DG~LlaaGD~s--G~V~vfD~k~r~iLR~~~ah~apv~~~~f~~~d~t~l~s~sDd~v~k~~d~s~a~v~~ 147 (487)
T KOG0310|consen 71 VYSVDFRS-DGRLLAAGDES--GHVKVFDMKSRVILRQLYAHQAPVHVTKFSPQDNTMLVSGSDDKVVKYWDLSTAYVQA 147 (487)
T ss_pred eeEEEeec-CCeEEEccCCc--CcEEEeccccHHHHHHHhhccCceeEEEecccCCeEEEecCCCceEEEEEcCCcEEEE
Confidence 45566643 34444443332 256666544211 11111 23334455666777777777766777777788777543
Q ss_pred EEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeC
Q psy6572 646 LINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDL 725 (1416)
Q Consensus 646 Li~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~l 725 (1416)
-+...-+..|.+++.|.++.|++|-... ..|..-+..-.. ..+++-+-..|.--.+-.+.+.++.+ ++.+.|...++
T Consensus 148 ~l~~htDYVR~g~~~~~~~hivvtGsYD-g~vrl~DtR~~~-~~v~elnhg~pVe~vl~lpsgs~ias-AgGn~vkVWDl 224 (487)
T KOG0310|consen 148 ELSGHTDYVRCGDISPANDHIVVTGSYD-GKVRLWDTRSLT-SRVVELNHGCPVESVLALPSGSLIAS-AGGNSVKVWDL 224 (487)
T ss_pred EecCCcceeEeeccccCCCeEEEecCCC-ceEEEEEeccCC-ceeEEecCCCceeeEEEcCCCCEEEE-cCCCeEEEEEe
Confidence 4446677899999999999999996332 244444443332 23333333445443333355555544 34456666676
Q ss_pred CCCc
Q psy6572 726 NGEN 729 (1416)
Q Consensus 726 dG~~ 729 (1416)
.+..
T Consensus 225 ~~G~ 228 (487)
T KOG0310|consen 225 TTGG 228 (487)
T ss_pred cCCc
Confidence 5433
No 258
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=42.96 E-value=48 Score=35.83 Aligned_cols=58 Identities=29% Similarity=0.710 Sum_probs=35.0
Q ss_pred CCceEEeeCCCceecCCCCCccccCCc--CCCCCcc--ceeeecCCeeeecCCCCcEEecCCCce
Q psy6572 472 KIGYKCACRKGYQVHPEDKHLCVDTNE--CLDRPCS--HYCRNTLGSYSCSCAPGYALLSDKHGC 532 (1416)
Q Consensus 472 ~~gy~C~C~~Gy~L~p~d~~tC~didE--C~~~~Cs--q~C~nt~gsy~C~C~~Gy~L~~dg~sC 532 (1416)
+....|+|.-|+.+ .+...|.-..+ |+ -.|. +.|..+.+.|+|.|.+||.+...+..+
T Consensus 107 ~~~~~CSC~IGkV~--~dn~kCtk~G~T~C~-LKCk~nE~CK~~~~~Y~C~~~~~~~~~~~~~~~ 168 (197)
T PF06247_consen 107 PNNPTCSCNIGKVP--DDNKKCTKTGETKCS-LKCKENEECKLVDGYYKCVCKEGFPGDGEGEGC 168 (197)
T ss_dssp GSEEEEEE-TEEET--TTTTESEEEE---------TTTEEEEEETTEEEEEE-TT-EEETTT---
T ss_pred CCCceeEeeeceEe--ccCCcccCCCcccee-eecCCCcceeeeCcEEEeecCCCCCCCCCcccc
Confidence 34569999999983 56666765443 21 2221 378888899999999999988766433
No 259
>KOG1517|consensus
Probab=41.85 E-value=1.1e+03 Score=32.42 Aligned_cols=71 Identities=11% Similarity=0.143 Sum_probs=41.4
Q ss_pred CCceEEEEcc--CCcEEEeeCCCCeEEEeecCCCceEE-EEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEec
Q psy6572 610 SPDGLTVDWV--GRNLYWCDKGLDTIEVAKLDGRFRKV-LINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKM 682 (1416)
Q Consensus 610 ~p~gLAvD~~--~~~LYwtD~~~~~I~v~~ldG~~~~v-Li~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~m 682 (1416)
.-.|+.+||. +|+||.+-. ...|.+-+.+-..... +..+.-.-+++|.-|..+|.|+.+-.+.. .|...++
T Consensus 1165 r~~~~v~dWqQ~~G~Ll~tGd-~r~IRIWDa~~E~~~~diP~~s~t~vTaLS~~~~~gn~i~AGfaDG-svRvyD~ 1238 (1387)
T KOG1517|consen 1165 RGTGLVVDWQQQSGHLLVTGD-VRSIRIWDAHKEQVVADIPYGSSTLVTALSADLVHGNIIAAGFADG-SVRVYDR 1238 (1387)
T ss_pred CCCCeeeehhhhCCeEEecCC-eeEEEEEecccceeEeecccCCCccceeecccccCCceEEEeecCC-ceEEeec
Confidence 3456899995 567776643 3445555544332221 11234455888888888888888865543 3444433
No 260
>KOG0308|consensus
Probab=41.81 E-value=5.9e+02 Score=32.85 Aligned_cols=148 Identities=15% Similarity=0.110 Sum_probs=82.8
Q ss_pred cceeeeeecCCCeEEEeeccCCCccEEEEecCC--CCeEEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCc-e
Q psy6572 568 NAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNS--QPELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRF-R 643 (1416)
Q Consensus 568 ~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s--~~~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~-~ 643 (1416)
.+++||... ++.++++-... +.|+.....+ ++.-|.. -.+++.|.|+.-+..| .+-+..++|.+-+|.-.. .
T Consensus 173 siYSLA~N~-t~t~ivsGgte--k~lr~wDprt~~kimkLrGHTdNVr~ll~~dDGt~~-ls~sSDgtIrlWdLgqQrCl 248 (735)
T KOG0308|consen 173 SIYSLAMNQ-TGTIIVSGGTE--KDLRLWDPRTCKKIMKLRGHTDNVRVLLVNDDGTRL-LSASSDGTIRLWDLGQQRCL 248 (735)
T ss_pred ceeeeecCC-cceEEEecCcc--cceEEeccccccceeeeeccccceEEEEEcCCCCeE-eecCCCceEEeeecccccee
Confidence 355666543 33455554432 2344333332 2222333 6678888888655555 455667888888887654 2
Q ss_pred EEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEE
Q psy6572 644 KVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVS 723 (1416)
Q Consensus 644 ~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~ 723 (1416)
.+++... ....+|.++|.-.++|..+. ...|+|++|......+++-+.-..-.-|.+.....-+ |+-.....|.+.
T Consensus 249 ~T~~vH~-e~VWaL~~~~sf~~vYsG~r--d~~i~~Tdl~n~~~~tlick~daPv~~l~~~~~~~~~-WvtTtds~I~rW 324 (735)
T KOG0308|consen 249 ATYIVHK-EGVWALQSSPSFTHVYSGGR--DGNIYRTDLRNPAKSTLICKEDAPVLKLHLHEHDNSV-WVTTTDSSIKRW 324 (735)
T ss_pred eeEEecc-CceEEEeeCCCcceEEecCC--CCcEEecccCCchhheEeecCCCchhhhhhccccCCc-eeeeccccceec
Confidence 2333222 22778899999889998874 3479999998854444443322222345554333444 665544455444
No 261
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=41.75 E-value=1.2e+02 Score=38.44 Aligned_cols=105 Identities=17% Similarity=0.086 Sum_probs=62.3
Q ss_pred cccceeeeeecCCCeEEEeeccCC--------------CccEEEEecC-C-------CCeEEee----------------
Q psy6572 566 QTNAVGLDFDWVDNCLYWSDVTMH--------------GSSIRRSCNN-S-------QPELLFP---------------- 607 (1416)
Q Consensus 566 l~~~~~l~~D~~~~~LYwtD~~~~--------------~~~I~r~~l~-s-------~~~~l~~---------------- 607 (1416)
+..+..|++.+..+.||++.++.. .+.|+|+... . ..++++.
T Consensus 416 mdRpE~i~~~p~~g~Vy~~lTNn~~r~~~~aNpr~~n~~G~I~r~~p~~~d~t~~~ftWdlF~~aG~~~~~~~~~~~~~~ 495 (616)
T COG3211 416 MDRPEWIAVNPGTGEVYFTLTNNGKRSDDAANPRAKNGYGQIVRWIPATGDHTDTKFTWDLFVEAGNPSVLEGGASANIN 495 (616)
T ss_pred ccCccceeecCCcceEEEEeCCCCccccccCCCcccccccceEEEecCCCCccCccceeeeeeecCCccccccccccCcc
Confidence 456889999999999999877643 1357777654 1 2334332
Q ss_pred ---cCCCceEEEEccCCcEEEeeCCCC----e-EEEee---cCCCc---eEEEEcCCCCCcceeeecCCcceEEEee
Q psy6572 608 ---ATSPDGLTVDWVGRNLYWCDKGLD----T-IEVAK---LDGRF---RKVLINKGLQEPRGIALNPAYGYMYWTD 670 (1416)
Q Consensus 608 ---l~~p~gLAvD~~~~~LYwtD~~~~----~-I~v~~---ldG~~---~~vLi~~~l~~P~gIavDp~~g~LYWtD 670 (1416)
+..|.+|+||+.++...-||.... + +.+.. -++.. ++-|....-..-.|++..|..+.||+.-
T Consensus 496 ~~~f~~PDnl~fD~~GrLWi~TDg~~s~~~~~~~G~~~m~~~~p~~g~~~rf~t~P~g~E~tG~~FspD~~TlFV~v 572 (616)
T COG3211 496 ANWFNSPDNLAFDPWGRLWIQTDGSGSTLRNRFRGVTQMLTPDPKTGTIKRFLTGPIGCEFTGPCFSPDGKTLFVNV 572 (616)
T ss_pred cccccCCCceEECCCCCEEEEecCCCCccCcccccccccccCCCccceeeeeccCCCcceeecceeCCCCceEEEEe
Confidence 334999999987776666776432 1 11111 12211 2222212223456788888877777764
No 262
>KOG3914|consensus
Probab=41.54 E-value=4.6e+02 Score=31.78 Aligned_cols=120 Identities=13% Similarity=0.235 Sum_probs=74.2
Q ss_pred cCCCceEEEEccCCcEEEeeCCCC--eEEEeecC-CCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCC
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKGLD--TIEVAKLD-GRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDG 684 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~~~--~I~v~~ld-G~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDG 684 (1416)
-..|++|.+.-....+-++|.... .+.+.+.+ |..+.+| ..+.....|||-|...+|.-+|..+ +|....+.+
T Consensus 107 ~~~~~ai~~~~~~~sv~v~dkagD~~~~di~s~~~~~~~~~l--GhvSml~dVavS~D~~~IitaDRDE--kIRvs~ypa 182 (390)
T KOG3914|consen 107 PKRPTAISFIREDTSVLVADKAGDVYSFDILSADSGRCEPIL--GHVSMLLDVAVSPDDQFIITADRDE--KIRVSRYPA 182 (390)
T ss_pred ccCcceeeeeeccceEEEEeecCCceeeeeecccccCcchhh--hhhhhhheeeecCCCCEEEEecCCc--eEEEEecCc
Confidence 456777777666666666665432 34444433 4433333 4566678899999989999898543 688777777
Q ss_pred CCCEE-EeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeC-CCCceEEE
Q psy6572 685 SNPKV-IISKNLSWPNALTISYETNELFWGDAHEDYIAVSDL-NGENIKII 733 (1416)
Q Consensus 685 s~r~v-lv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~l-dG~~r~~v 733 (1416)
.+... +....-.....|+| ..+++.|+-.+-++|+.-++ .|+...++
T Consensus 183 ~f~IesfclGH~eFVS~isl--~~~~~LlS~sGD~tlr~Wd~~sgk~L~t~ 231 (390)
T KOG3914|consen 183 TFVIESFCLGHKEFVSTISL--TDNYLLLSGSGDKTLRLWDITSGKLLDTC 231 (390)
T ss_pred ccchhhhccccHhheeeeee--ccCceeeecCCCCcEEEEecccCCccccc
Confidence 65211 11111222346666 46777888888888888885 56655443
No 263
>KOG0319|consensus
Probab=41.19 E-value=9.6e+02 Score=31.48 Aligned_cols=148 Identities=17% Similarity=0.099 Sum_probs=85.7
Q ss_pred eEecCCCCCeEEEE-ecEEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee-
Q psy6572 532 CKATSDVPPNLLFT-NKYYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP- 607 (1416)
Q Consensus 532 C~a~~~~~~~li~s-~~~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~- 607 (1416)
++-..+++.+|.++ |...+|.+++.+.. .++...-..+.+|++ +..+.|+.+-.......+.|++-+ +....+..
T Consensus 329 m~~lG~e~~~laVATNs~~lr~y~~~~~~c~ii~GH~e~vlSL~~-~~~g~llat~sKD~svilWr~~~~~~~~~~~a~~ 407 (775)
T KOG0319|consen 329 MKFLGPEESHLAVATNSPELRLYTLPTSYCQIIPGHTEAVLSLDV-WSSGDLLATGSKDKSVILWRLNNNCSKSLCVAQA 407 (775)
T ss_pred eeecCCccceEEEEeCCCceEEEecCCCceEEEeCchhheeeeee-cccCcEEEEecCCceEEEEEecCCcchhhhhhhh
Confidence 44444556666664 56678888888887 655444444556662 566766666555432233444223 22222211
Q ss_pred ---cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCc----eEEEEc-----CCCCCcceeeecCCcceEEEee-CCCC
Q psy6572 608 ---ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRF----RKVLIN-----KGLQEPRGIALNPAYGYMYWTD-WGQN 674 (1416)
Q Consensus 608 ---l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~----~~vLi~-----~~l~~P~gIavDp~~g~LYWtD-~g~~ 674 (1416)
...+.+||+-..+-..|.+-+...+|.+-.+.++. .-++.. ..-+..+++||.|... |+-|- ....
T Consensus 408 ~gH~~svgava~~~~~asffvsvS~D~tlK~W~l~~s~~~~~~~~~~~~~t~~aHdKdIN~Vaia~ndk-LiAT~SqDkt 486 (775)
T KOG0319|consen 408 NGHTNSVGAVAGSKLGASFFVSVSQDCTLKLWDLPKSKETAFPIVLTCRYTERAHDKDINCVAIAPNDK-LIATGSQDKT 486 (775)
T ss_pred cccccccceeeecccCccEEEEecCCceEEEecCCCcccccccceehhhHHHHhhcccccceEecCCCc-eEEecccccc
Confidence 45677888865566677777777788887777622 112211 1235688999998854 44443 3345
Q ss_pred ceEEEEe
Q psy6572 675 AHIGKAK 681 (1416)
Q Consensus 675 ~~I~ra~ 681 (1416)
.+|+.+.
T Consensus 487 aKiW~le 493 (775)
T KOG0319|consen 487 AKIWDLE 493 (775)
T ss_pred eeeeccc
Confidence 7788776
No 264
>KOG0269|consensus
Probab=39.88 E-value=2.5e+02 Score=36.66 Aligned_cols=123 Identities=11% Similarity=-0.031 Sum_probs=75.4
Q ss_pred EecccccceeeeeecCCCeEEEeeccCCCccEEEEecC--CCCeEEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeec
Q psy6572 562 RIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN--SQPELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKL 638 (1416)
Q Consensus 562 ~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~--s~~~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~l 638 (1416)
.-..-+.+.-|+|+..+-.|.++-... +.|.-+++. ....+... -..++.+++-+..++.|.+-...+.+..-+|
T Consensus 129 f~EH~Rs~~~ldfh~tep~iliSGSQD--g~vK~~DlR~~~S~~t~~~nSESiRDV~fsp~~~~~F~s~~dsG~lqlWDl 206 (839)
T KOG0269|consen 129 FNEHERSANKLDFHSTEPNILISGSQD--GTVKCWDLRSKKSKSTFRSNSESIRDVKFSPGYGNKFASIHDSGYLQLWDL 206 (839)
T ss_pred hhhhccceeeeeeccCCccEEEecCCC--ceEEEEeeecccccccccccchhhhceeeccCCCceEEEecCCceEEEeec
Confidence 334556678899998888887776654 477777776 22333333 6778899999999999999888888888777
Q ss_pred CCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCC
Q psy6572 639 DGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNP 687 (1416)
Q Consensus 639 dG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r 687 (1416)
--..+-.+....-. --.+.+++.-.+-|.+.-|+...|..-+|.+...
T Consensus 207 Rqp~r~~~k~~AH~-GpV~c~nwhPnr~~lATGGRDK~vkiWd~t~~~~ 254 (839)
T KOG0269|consen 207 RQPDRCEKKLTAHN-GPVLCLNWHPNREWLATGGRDKMVKIWDMTDSRA 254 (839)
T ss_pred cCchhHHHHhhccc-CceEEEeecCCCceeeecCCCccEEEEeccCCCc
Confidence 54332222111111 1123343333455666656555566666765443
No 265
>smart00284 OLF Olfactomedin-like domains.
Probab=39.17 E-value=6.6e+02 Score=29.01 Aligned_cols=92 Identities=17% Similarity=0.168 Sum_probs=58.5
Q ss_pred CCcEEEeeCCCCeEEEeecCCCceE--EEEcC-C----------CCCcceeeecCCcce-EEEeeCCCCceEEEEecCCC
Q psy6572 620 GRNLYWCDKGLDTIEVAKLDGRFRK--VLINK-G----------LQEPRGIALNPAYGY-MYWTDWGQNAHIGKAKMDGS 685 (1416)
Q Consensus 620 ~~~LYwtD~~~~~I~v~~ldG~~~~--vLi~~-~----------l~~P~gIavDp~~g~-LYWtD~g~~~~I~ra~mDGs 685 (1416)
+|.||+--.....|.+.+|...... .++.. . ...=-.||||...=+ ||-|. .....|..+.||-.
T Consensus 83 ngslYY~~~~s~~iiKydL~t~~v~~~~~Lp~a~y~~~~~Y~~~~~sdiDlAvDE~GLWvIYat~-~~~g~ivvSkLnp~ 161 (255)
T smart00284 83 NGSLYFNKFNSHDICRFDLTTETYQKEPLLNGAGYNNRFPYAWGGFSDIDLAVDENGLWVIYATE-QNAGKIVISKLNPA 161 (255)
T ss_pred CceEEEEecCCccEEEEECCCCcEEEEEecCccccccccccccCCCccEEEEEcCCceEEEEecc-CCCCCEEEEeeCcc
Confidence 5999998888889999999876643 22221 1 112246899966322 44443 33457888899876
Q ss_pred CCEEEee--cCCCCC---eeEEeecCCCeEEEecC
Q psy6572 686 NPKVIIS--KNLSWP---NALTISYETNELFWGDA 715 (1416)
Q Consensus 686 ~r~vlv~--~~l~~P---~gLaiD~~~~rLYWtD~ 715 (1416)
...++-+ +.+.++ +++.| =|.||.++.
T Consensus 162 tL~ve~tW~T~~~k~sa~naFmv---CGvLY~~~s 193 (255)
T smart00284 162 TLTIENTWITTYNKRSASNAFMI---CGILYVTRS 193 (255)
T ss_pred cceEEEEEEcCCCcccccccEEE---eeEEEEEcc
Confidence 6655443 344443 34555 589999985
No 266
>KOG0277|consensus
Probab=39.05 E-value=4.6e+02 Score=30.13 Aligned_cols=78 Identities=13% Similarity=0.122 Sum_probs=51.6
Q ss_pred cccceeeeeecCCCeEEEeeccCCCccEEEEecC--CCCeEEee-cCCCceEEEEccCCcEEEeeCCCCeEE--EeecCC
Q psy6572 566 QTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN--SQPELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIE--VAKLDG 640 (1416)
Q Consensus 566 l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~--s~~~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~--v~~ldG 640 (1416)
.+.+.++++....+.++.+..=. .+|+....+ ....++.. -.-+.+.++.|...+||-+-++.+... =++..|
T Consensus 104 ~~EV~Svdwn~~~r~~~ltsSWD--~TiKLW~~~r~~Sv~Tf~gh~~~Iy~a~~sp~~~nlfas~Sgd~~l~lwdvr~~g 181 (311)
T KOG0277|consen 104 KREVYSVDWNTVRRRIFLTSSWD--GTIKLWDPNRPNSVQTFNGHNSCIYQAAFSPHIPNLFASASGDGTLRLWDVRSPG 181 (311)
T ss_pred hhheEEeccccccceeEEeeccC--CceEeecCCCCcceEeecCCccEEEEEecCCCCCCeEEEccCCceEEEEEecCCC
Confidence 34577888888888888877543 367666666 33444444 455667788888999998888877544 444556
Q ss_pred CceEE
Q psy6572 641 RFRKV 645 (1416)
Q Consensus 641 ~~~~v 645 (1416)
+...+
T Consensus 182 k~~~i 186 (311)
T KOG0277|consen 182 KFMSI 186 (311)
T ss_pred ceeEE
Confidence 65543
No 267
>PF05935 Arylsulfotrans: Arylsulfotransferase (ASST); InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=38.71 E-value=5.8e+02 Score=32.22 Aligned_cols=159 Identities=13% Similarity=0.035 Sum_probs=68.6
Q ss_pred cCCcEEEeeC----CCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecC
Q psy6572 619 VGRNLYWCDK----GLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKN 694 (1416)
Q Consensus 619 ~~~~LYwtD~----~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~ 694 (1416)
....||+... ....++.++.+|..|-.+........+ +-+ -.+|.|++... .+|..+++.|......-...
T Consensus 112 ~~~gl~~~~~~~~~~~~~~~~iD~~G~Vrw~~~~~~~~~~~-~~~-l~nG~ll~~~~---~~~~e~D~~G~v~~~~~l~~ 186 (477)
T PF05935_consen 112 MEDGLYFVNGNDWDSSSYTYLIDNNGDVRWYLPLDSGSDNS-FKQ-LPNGNLLIGSG---NRLYEIDLLGKVIWEYDLPG 186 (477)
T ss_dssp -TT-EEEEEETT--BEEEEEEEETTS-EEEEE-GGGT--SS-EEE--TTS-EEEEEB---TEEEEE-TT--EEEEEE--T
T ss_pred cCCcEEEEeCCCCCCCceEEEECCCccEEEEEccCccccce-eeE-cCCCCEEEecC---CceEEEcCCCCEEEeeecCC
Confidence 4455666655 345778888888876655432211111 223 24566666552 46777777776322211111
Q ss_pred -C-CCCeeEEeecCCCeEEEec------------CCCCeEEEEeCCCCceEEEEeccC--------------------CC
Q psy6572 695 -L-SWPNALTISYETNELFWGD------------AHEDYIAVSDLNGENIKIIVSRRM--------------------DP 740 (1416)
Q Consensus 695 -l-~~P~gLaiD~~~~rLYWtD------------~~~~~I~~~~ldG~~r~~v~~~~~--------------------~p 740 (1416)
. ..=..+...+..+.|+.+. ...+.|..++.+|.-+...-.... ..
T Consensus 187 ~~~~~HHD~~~l~nGn~L~l~~~~~~~~~~~~~~~~~D~Ivevd~tG~vv~~wd~~d~ld~~~~~~~~~~~~~~~~~~~~ 266 (477)
T PF05935_consen 187 GYYDFHHDIDELPNGNLLILASETKYVDEDKDVDTVEDVIVEVDPTGEVVWEWDFFDHLDPYRDTVLKPYPYGDISGSGG 266 (477)
T ss_dssp TEE-B-S-EEE-TTS-EEEEEEETTEE-TS-EE---S-EEEEE-TTS-EEEEEEGGGTS-TT--TTGGT--SSSSS-SST
T ss_pred cccccccccEECCCCCEEEEEeecccccCCCCccEecCEEEEECCCCCEEEEEehHHhCCcccccccccccccccccCCC
Confidence 0 0001222222111222111 012345555544544333311110 01
Q ss_pred CcccccceeEEEec--CcEEEeecCCCeeEEecccCCCceEEEE
Q psy6572 741 TINLHHVFALAVFE--DHLFWTDWEMKSIERCDKYTGKNCTSVV 782 (1416)
Q Consensus 741 ~~~l~~P~~lav~~--d~LYwtD~~~~~I~~~nk~tG~~~~~l~ 782 (1416)
...+.|.-+|.+.+ +.|+++-.....|++++..+++..-++-
T Consensus 267 ~~DW~H~Nsi~yd~~dd~iivSsR~~s~V~~Id~~t~~i~Wilg 310 (477)
T PF05935_consen 267 GRDWLHINSIDYDPSDDSIIVSSRHQSAVIKIDYRTGKIKWILG 310 (477)
T ss_dssp TSBS--EEEEEEETTTTEEEEEETTT-EEEEEE-TTS-EEEEES
T ss_pred CCCccccCccEEeCCCCeEEEEcCcceEEEEEECCCCcEEEEeC
Confidence 22356888898865 8899999999999999977776544443
No 268
>KOG0282|consensus
Probab=38.55 E-value=5.9e+02 Score=31.65 Aligned_cols=110 Identities=15% Similarity=0.175 Sum_probs=55.3
Q ss_pred CCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEc--CCCCCcceeeecCCcceEEEeeCCCC-ceEEEEecCCC
Q psy6572 609 TSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLIN--KGLQEPRGIALNPAYGYMYWTDWGQN-AHIGKAKMDGS 685 (1416)
Q Consensus 609 ~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~--~~l~~P~gIavDp~~g~LYWtD~g~~-~~I~ra~mDGs 685 (1416)
..|.-+.+.+...++|++-...++|...++.... ++.. ..|.....|.+-+.+ .-|++..... -+|+-... +.
T Consensus 300 ~~~~cvkf~pd~~n~fl~G~sd~ki~~wDiRs~k--vvqeYd~hLg~i~~i~F~~~g-~rFissSDdks~riWe~~~-~v 375 (503)
T KOG0282|consen 300 KVPTCVKFHPDNQNIFLVGGSDKKIRQWDIRSGK--VVQEYDRHLGAILDITFVDEG-RRFISSSDDKSVRIWENRI-PV 375 (503)
T ss_pred CCceeeecCCCCCcEEEEecCCCcEEEEeccchH--HHHHHHhhhhheeeeEEccCC-ceEeeeccCccEEEEEcCC-Cc
Confidence 4456666777778999998888888887765332 1111 334445556555543 3344332221 12222111 11
Q ss_pred CCEEEeecC-CCCCeeEEeecCCCeEEEecCCCCeEEEEe
Q psy6572 686 NPKVIISKN-LSWPNALTISYETNELFWGDAHEDYIAVSD 724 (1416)
Q Consensus 686 ~r~vlv~~~-l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ 724 (1416)
..+.++... ..-| .|++. +++..+.+....+.|....
T Consensus 376 ~ik~i~~~~~hsmP-~~~~~-P~~~~~~aQs~dN~i~ifs 413 (503)
T KOG0282|consen 376 PIKNIADPEMHTMP-CLTLH-PNGKWFAAQSMDNYIAIFS 413 (503)
T ss_pred cchhhcchhhccCc-ceecC-CCCCeehhhccCceEEEEe
Confidence 111122212 2222 56664 4667777776666665554
No 269
>COG4880 Secreted protein containing C-terminal beta-propeller domain distantly related to WD-40 repeats [General function prediction only]
Probab=38.32 E-value=6e+02 Score=31.22 Aligned_cols=30 Identities=17% Similarity=0.194 Sum_probs=20.6
Q ss_pred EEccCCcEEEeeCCCCeEEEeecCCCceEE
Q psy6572 616 VDWVGRNLYWCDKGLDTIEVAKLDGRFRKV 645 (1416)
Q Consensus 616 vD~~~~~LYwtD~~~~~I~v~~ldG~~~~v 645 (1416)
+.|-+-.+|-+-...+.|.+++++|+....
T Consensus 155 ~~~~git~yn~~e~~k~vw~~~fnGsyvda 184 (603)
T COG4880 155 GEVGGITLYNLYESSKKVWVYNFNGSYVDA 184 (603)
T ss_pred EEeCCEEEEEeccccceeEEEecCCceeee
Confidence 355555666665566788899999987543
No 270
>KOG1517|consensus
Probab=37.11 E-value=5e+02 Score=35.47 Aligned_cols=178 Identities=19% Similarity=0.167 Sum_probs=91.2
Q ss_pred eeeeeecC--CCeEEEeeccCCCccEEEEecCCCCeEEee-----cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCc
Q psy6572 570 VGLDFDWV--DNCLYWSDVTMHGSSIRRSCNNSQPELLFP-----ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRF 642 (1416)
Q Consensus 570 ~~l~~D~~--~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~-----l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~ 642 (1416)
.++.+||. .++||.+-.. +.|+..+... ..++.. -..+.+|.-|-.+++|+.+-...++|.++++--..
T Consensus 1167 ~~~v~dWqQ~~G~Ll~tGd~---r~IRIWDa~~-E~~~~diP~~s~t~vTaLS~~~~~gn~i~AGfaDGsvRvyD~R~a~ 1242 (1387)
T KOG1517|consen 1167 TGLVVDWQQQSGHLLVTGDV---RSIRIWDAHK-EQVVADIPYGSSTLVTALSADLVHGNIIAAGFADGSVRVYDRRMAP 1242 (1387)
T ss_pred CCeeeehhhhCCeEEecCCe---eEEEEEeccc-ceeEeecccCCCccceeecccccCCceEEEeecCCceEEeecccCC
Confidence 35677875 4567765432 1344444431 122222 45578888888899999999888888888764433
Q ss_pred eEEEE--cCCC-CC--cceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCC-----CeeEEeecCCCeEEE
Q psy6572 643 RKVLI--NKGL-QE--PRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSW-----PNALTISYETNELFW 712 (1416)
Q Consensus 643 ~~vLi--~~~l-~~--P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~-----P~gLaiD~~~~rLYW 712 (1416)
+..++ .... .+ ..++.+-+. |+--.........|...+|.++.+...++-...| -++|+|.. ...|+-
T Consensus 1243 ~ds~v~~~R~h~~~~~Iv~~slq~~-G~~elvSgs~~G~I~~~DlR~~~~e~~~~iv~~~~yGs~lTal~VH~-hapiiA 1320 (1387)
T KOG1517|consen 1243 PDSLVCVYREHNDVEPIVHLSLQRQ-GLGELVSGSQDGDIQLLDLRMSSKETFLTIVAHWEYGSALTALTVHE-HAPIIA 1320 (1387)
T ss_pred ccccceeecccCCcccceeEEeecC-CCcceeeeccCCeEEEEecccCcccccceeeeccccCccceeeeecc-CCCeee
Confidence 32222 1111 11 223333321 2211111122346777777765443333322222 35677764 445554
Q ss_pred ecCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEEec
Q psy6572 713 GDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFE 754 (1416)
Q Consensus 713 tD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~ 754 (1416)
+-.. ..|-..++.|....++....+--+..+.++..|+++-
T Consensus 1321 sGs~-q~ikIy~~~G~~l~~~k~n~~F~~q~~gs~scL~FHP 1361 (1387)
T KOG1517|consen 1321 SGSA-QLIKIYSLSGEQLNIIKYNPGFMGQRIGSVSCLAFHP 1361 (1387)
T ss_pred ecCc-ceEEEEecChhhhcccccCcccccCcCCCcceeeecc
Confidence 4333 6777777888777666543211112234444555543
No 271
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=36.76 E-value=1.2e+03 Score=31.37 Aligned_cols=19 Identities=11% Similarity=-0.197 Sum_probs=13.5
Q ss_pred CCceEEEEccCCcEEEeeC
Q psy6572 610 SPDGLTVDWVGRNLYWCDK 628 (1416)
Q Consensus 610 ~p~gLAvD~~~~~LYwtD~ 628 (1416)
..+=+|||..+|++-|.=.
T Consensus 269 Dg~LiALDA~TGk~~W~fg 287 (764)
T TIGR03074 269 DARLIALDADTGKLCEDFG 287 (764)
T ss_pred CCeEEEEECCCCCEEEEec
Confidence 3455788888888888643
No 272
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=36.09 E-value=4.3e+02 Score=31.14 Aligned_cols=104 Identities=15% Similarity=0.097 Sum_probs=52.9
Q ss_pred ecCCcceEEEeeCCCCceEEEEecCCCCC-EEEeecCCCCCeeEEeecCCCeEEEecCCCC-eEEEEeCCCCceEEEEec
Q psy6572 659 LNPAYGYMYWTDWGQNAHIGKAKMDGSNP-KVIISKNLSWPNALTISYETNELFWGDAHED-YIAVSDLNGENIKIIVSR 736 (1416)
Q Consensus 659 vDp~~g~LYWtD~g~~~~I~ra~mDGs~r-~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~-~I~~~~ldG~~r~~v~~~ 736 (1416)
|.+.+|+|..+- | .+|....++...+ ........ ....+.|....++|++.|...+ .+.+.+.++.....+...
T Consensus 94 i~~~~~~lv~~~-g--~~l~v~~l~~~~~l~~~~~~~~-~~~i~sl~~~~~~I~vgD~~~sv~~~~~~~~~~~l~~va~d 169 (321)
T PF03178_consen 94 ICSFNGRLVVAV-G--NKLYVYDLDNSKTLLKKAFYDS-PFYITSLSVFKNYILVGDAMKSVSLLRYDEENNKLILVARD 169 (321)
T ss_dssp EEEETTEEEEEE-T--TEEEEEEEETTSSEEEEEEE-B-SSSEEEEEEETTEEEEEESSSSEEEEEEETTTE-EEEEEEE
T ss_pred hhhhCCEEEEee-c--CEEEEEEccCcccchhhheecc-eEEEEEEeccccEEEEEEcccCEEEEEEEccCCEEEEEEec
Confidence 333477766655 3 4677777777762 22222111 1134455556889999998654 334444434334434332
Q ss_pred cCCCCcccccceeEEEe--cCcEEEeecCCCe-eEEec
Q psy6572 737 RMDPTINLHHVFALAVF--EDHLFWTDWEMKS-IERCD 771 (1416)
Q Consensus 737 ~~~p~~~l~~P~~lav~--~d~LYwtD~~~~~-I~~~n 771 (1416)
. ...+..++.+. ++.+..+|...+- +++.+
T Consensus 170 ~-----~~~~v~~~~~l~d~~~~i~~D~~gnl~~l~~~ 202 (321)
T PF03178_consen 170 Y-----QPRWVTAAEFLVDEDTIIVGDKDGNLFVLRYN 202 (321)
T ss_dssp S-----S-BEEEEEEEE-SSSEEEEEETTSEEEEEEE-
T ss_pred C-----CCccEEEEEEecCCcEEEEEcCCCeEEEEEEC
Confidence 1 13345566554 3467777765432 34443
No 273
>KOG0640|consensus
Probab=36.01 E-value=5.8e+02 Score=29.97 Aligned_cols=106 Identities=13% Similarity=0.057 Sum_probs=55.1
Q ss_pred ecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CC----CeEEeecCCCceEEEEccCCcEEEeeCCCCeEEEee
Q psy6572 563 IHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQ----PELLFPATSPDGLTVDWVGRNLYWCDKGLDTIEVAK 637 (1416)
Q Consensus 563 ~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~----~~~l~~l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ 637 (1416)
+.....+..|+|+++...| .+-... ..|..+.+. +. .+++.....++.|.+.|.+..|.+.- ....+.+.+
T Consensus 169 YDH~devn~l~FHPre~IL-iS~srD--~tvKlFDfsK~saKrA~K~~qd~~~vrsiSfHPsGefllvgT-dHp~~rlYd 244 (430)
T KOG0640|consen 169 YDHVDEVNDLDFHPRETIL-ISGSRD--NTVKLFDFSKTSAKRAFKVFQDTEPVRSISFHPSGEFLLVGT-DHPTLRLYD 244 (430)
T ss_pred hhccCcccceeecchhheE-EeccCC--CeEEEEecccHHHHHHHHHhhccceeeeEeecCCCceEEEec-CCCceeEEe
Confidence 3444556778898866544 444443 266666665 21 23333366778899998887776532 223444444
Q ss_pred cCCCceEEEEcCCCCC---cceeeecCCcceEEEeeCCC
Q psy6572 638 LDGRFRKVLINKGLQE---PRGIALNPAYGYMYWTDWGQ 673 (1416)
Q Consensus 638 ldG~~~~vLi~~~l~~---P~gIavDp~~g~LYWtD~g~ 673 (1416)
.+....-+-....-+. ...+-. ..+|.||+|-...
T Consensus 245 v~T~QcfvsanPd~qht~ai~~V~Y-s~t~~lYvTaSkD 282 (430)
T KOG0640|consen 245 VNTYQCFVSANPDDQHTGAITQVRY-SSTGSLYVTASKD 282 (430)
T ss_pred ccceeEeeecCcccccccceeEEEe-cCCccEEEEeccC
Confidence 4433222211111111 112222 3458999997543
No 274
>KOG0269|consensus
Probab=35.58 E-value=6.3e+02 Score=33.26 Aligned_cols=109 Identities=15% Similarity=0.127 Sum_probs=63.1
Q ss_pred CcEEEeeCCCCeEEEeecCCCceEEEE---cCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCC
Q psy6572 621 RNLYWCDKGLDTIEVAKLDGRFRKVLI---NKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSW 697 (1416)
Q Consensus 621 ~~LYwtD~~~~~I~v~~ldG~~~~vLi---~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~ 697 (1416)
.||..|-...+.|.+-+|+-..|..++ ..--....-+.+++..-+|.++- ++...|.-.+|.-...+.....+-..
T Consensus 100 ~NlIAT~s~nG~i~vWdlnk~~rnk~l~~f~EH~Rs~~~ldfh~tep~iliSG-SQDg~vK~~DlR~~~S~~t~~~nSES 178 (839)
T KOG0269|consen 100 SNLIATCSTNGVISVWDLNKSIRNKLLTVFNEHERSANKLDFHSTEPNILISG-SQDGTVKCWDLRSKKSKSTFRSNSES 178 (839)
T ss_pred hhhheeecCCCcEEEEecCccccchhhhHhhhhccceeeeeeccCCccEEEec-CCCceEEEEeeecccccccccccchh
Confidence 455566667778888888765444332 22333455667777776766664 22335555555544443333334444
Q ss_pred CeeEEeecCCCeEEEecCCCCeEEEEeCCCCce
Q psy6572 698 PNALTISYETNELFWGDAHEDYIAVSDLNGENI 730 (1416)
Q Consensus 698 P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r 730 (1416)
.+.+++-+..+..|.+-...+.+...++.-..+
T Consensus 179 iRDV~fsp~~~~~F~s~~dsG~lqlWDlRqp~r 211 (839)
T KOG0269|consen 179 IRDVKFSPGYGNKFASIHDSGYLQLWDLRQPDR 211 (839)
T ss_pred hhceeeccCCCceEEEecCCceEEEeeccCchh
Confidence 456666666677777766677777777654433
No 275
>KOG0649|consensus
Probab=35.57 E-value=7.2e+02 Score=28.39 Aligned_cols=151 Identities=9% Similarity=0.071 Sum_probs=77.8
Q ss_pred ccccceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee--cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCC
Q psy6572 565 NQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP--ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGR 641 (1416)
Q Consensus 565 ~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~--l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~ 641 (1416)
.+..+.+|-+|+.++.|+++-.. ..|+.+.+. +.++.... ......++.-..++.|+ +-...+++.+-++...
T Consensus 113 evPeINam~ldP~enSi~~AgGD---~~~y~~dlE~G~i~r~~rGHtDYvH~vv~R~~~~qil-sG~EDGtvRvWd~kt~ 188 (325)
T KOG0649|consen 113 EVPEINAMWLDPSENSILFAGGD---GVIYQVDLEDGRIQREYRGHTDYVHSVVGRNANGQIL-SGAEDGTVRVWDTKTQ 188 (325)
T ss_pred cCCccceeEeccCCCcEEEecCC---eEEEEEEecCCEEEEEEcCCcceeeeeeecccCccee-ecCCCccEEEEecccc
Confidence 34567788999999988887633 367777777 44433332 34444455433444443 4444556666555433
Q ss_pred c-eEEEEc---C------CCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEE
Q psy6572 642 F-RKVLIN---K------GLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELF 711 (1416)
Q Consensus 642 ~-~~vLi~---~------~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLY 711 (1416)
. ..+|-. . ...|.-+||++.. |.--|..+.+..-+|..+..+.++. +..|.-+ +++..+ +.
T Consensus 189 k~v~~ie~yk~~~~lRp~~g~wigala~~ed-----WlvCGgGp~lslwhLrsse~t~vfp--ipa~v~~-v~F~~d-~v 259 (325)
T KOG0649|consen 189 KHVSMIEPYKNPNLLRPDWGKWIGALAVNED-----WLVCGGGPKLSLWHLRSSESTCVFP--IPARVHL-VDFVDD-CV 259 (325)
T ss_pred ceeEEeccccChhhcCcccCceeEEEeccCc-----eEEecCCCceeEEeccCCCceEEEe--cccceeE-eeeecc-eE
Confidence 3 333322 1 1233455666544 4334555666666676666555543 2222222 222223 33
Q ss_pred EecCCCCeEEEEeCCCC
Q psy6572 712 WGDAHEDYIAVSDLNGE 728 (1416)
Q Consensus 712 WtD~~~~~I~~~~ldG~ 728 (1416)
.+-...+.|.+..+.|.
T Consensus 260 l~~G~g~~v~~~~l~Gv 276 (325)
T KOG0649|consen 260 LIGGEGNHVQSYTLNGV 276 (325)
T ss_pred EEeccccceeeeeeccE
Confidence 33333445666666653
No 276
>KOG1407|consensus
Probab=33.70 E-value=8e+02 Score=28.37 Aligned_cols=207 Identities=15% Similarity=0.115 Sum_probs=95.6
Q ss_pred cceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEeecCCCceEEEEccCCcEEEeeCCC-CeEEEeecC------
Q psy6572 568 NAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFPATSPDGLTVDWVGRNLYWCDKGL-DTIEVAKLD------ 639 (1416)
Q Consensus 568 ~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~l~~p~gLAvD~~~~~LYwtD~~~-~~I~v~~ld------ 639 (1416)
.+.-|..++....++.+-... ..|.+.... ++... ++.-..+|+|.+=+.. +.|-+.+-+
T Consensus 66 svdql~w~~~~~d~~atas~d--k~ir~wd~r~~k~~~----------~i~~~~eni~i~wsp~g~~~~~~~kdD~it~i 133 (313)
T KOG1407|consen 66 SVDQLCWDPKHPDLFATASGD--KTIRIWDIRSGKCTA----------RIETKGENINITWSPDGEYIAVGNKDDRITFI 133 (313)
T ss_pred chhhheeCCCCCcceEEecCC--ceEEEEEeccCcEEE----------EeeccCcceEEEEcCCCCEEEEecCcccEEEE
Confidence 345566777777777766653 366666555 22211 1112333343332221 122222222
Q ss_pred -CCceEEEEcCC-CCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCCeeEEeecCCCeEEEecCCC
Q psy6572 640 -GRFRKVLINKG-LQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWPNALTISYETNELFWGDAHE 717 (1416)
Q Consensus 640 -G~~~~vLi~~~-l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~ 717 (1416)
-+..+++.... .....-|+-. ..+.||+...|.. .|+....-.-.+..-+...-..=..|.+|+.+ |-|-+-+..
T Consensus 134 d~r~~~~~~~~~~~~e~ne~~w~-~~nd~Fflt~GlG-~v~ILsypsLkpv~si~AH~snCicI~f~p~G-ryfA~GsAD 210 (313)
T KOG1407|consen 134 DARTYKIVNEEQFKFEVNEISWN-NSNDLFFLTNGLG-CVEILSYPSLKPVQSIKAHPSNCICIEFDPDG-RYFATGSAD 210 (313)
T ss_pred Eecccceeehhcccceeeeeeec-CCCCEEEEecCCc-eEEEEeccccccccccccCCcceEEEEECCCC-ceEeecccc
Confidence 12222222211 1224456666 4566777665533 55554443222211111111222357777643 322222212
Q ss_pred CeEEEEeCCCCceEEEEeccCCCCcccccc-eeEEE-ecCcEEEeecCCCeeEEecccCCCceEEEEeCCCCCCeeeeee
Q psy6572 718 DYIAVSDLNGENIKIIVSRRMDPTINLHHV-FALAV-FEDHLFWTDWEMKSIERCDKYTGKNCTSVVKNLVHKPMDLRVY 795 (1416)
Q Consensus 718 ~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P-~~lav-~~d~LYwtD~~~~~I~~~nk~tG~~~~~l~~~~~~~p~~I~v~ 795 (1416)
..+..-+++----...++ .+..| ..|.+ +.+++.-+-...+.|-.+...+|.....+. ...|+--.+.
T Consensus 211 AlvSLWD~~ELiC~R~is-------RldwpVRTlSFS~dg~~lASaSEDh~IDIA~vetGd~~~eI~---~~~~t~tVAW 280 (313)
T KOG1407|consen 211 ALVSLWDVDELICERCIS-------RLDWPVRTLSFSHDGRMLASASEDHFIDIAEVETGDRVWEIP---CEGPTFTVAW 280 (313)
T ss_pred ceeeccChhHhhhheeec-------cccCceEEEEeccCcceeeccCccceEEeEecccCCeEEEee---ccCCceeEEe
Confidence 222222333221122222 25666 45666 467777777777888888888888766554 2345444556
Q ss_pred cccC
Q psy6572 796 HPYR 799 (1416)
Q Consensus 796 h~~~ 799 (1416)
||.+
T Consensus 281 HPk~ 284 (313)
T KOG1407|consen 281 HPKR 284 (313)
T ss_pred cCCC
Confidence 6644
No 277
>PHA02713 hypothetical protein; Provisional
Probab=33.35 E-value=1.2e+03 Score=30.17 Aligned_cols=133 Identities=8% Similarity=0.041 Sum_probs=66.3
Q ss_pred cCCcEEEeeCC------CCeEEEeecCCCceEEEEcCCCCCcc-eeeecCCcceEEEeeCCC----CceEEEEecCCCCC
Q psy6572 619 VGRNLYWCDKG------LDTIEVAKLDGRFRKVLINKGLQEPR-GIALNPAYGYMYWTDWGQ----NAHIGKAKMDGSNP 687 (1416)
Q Consensus 619 ~~~~LYwtD~~------~~~I~v~~ldG~~~~vLi~~~l~~P~-gIavDp~~g~LYWtD~g~----~~~I~ra~mDGs~r 687 (1416)
.++.||+.-.. ...+++.++....-..+ ..+..|| +.++-...|+||..--.. ...+++.+..-..
T Consensus 302 l~~~IYviGG~~~~~~~~~~v~~Yd~~~n~W~~~--~~m~~~R~~~~~~~~~g~IYviGG~~~~~~~~sve~Ydp~~~~- 378 (557)
T PHA02713 302 VDNEIIIAGGYNFNNPSLNKVYKINIENKIHVEL--PPMIKNRCRFSLAVIDDTIYAIGGQNGTNVERTIECYTMGDDK- 378 (557)
T ss_pred ECCEEEEEcCCCCCCCccceEEEEECCCCeEeeC--CCCcchhhceeEEEECCEEEEECCcCCCCCCceEEEEECCCCe-
Confidence 57899998542 23567777654433222 2344444 223333468999875211 1246666554321
Q ss_pred EEEeecCCCCCe---eEEeecCCCeEEEecCC-----------------------CCeEEEEeCCCCceEEEEeccCCCC
Q psy6572 688 KVIISKNLSWPN---ALTISYETNELFWGDAH-----------------------EDYIAVSDLNGENIKIIVSRRMDPT 741 (1416)
Q Consensus 688 ~vlv~~~l~~P~---gLaiD~~~~rLYWtD~~-----------------------~~~I~~~~ldG~~r~~v~~~~~~p~ 741 (1416)
-..+. .+..|. ++++ .+++||++-.. .+.|++.+.....-..+..-.
T Consensus 379 W~~~~-~mp~~r~~~~~~~--~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~YDP~td~W~~v~~m~---- 451 (557)
T PHA02713 379 WKMLP-DMPIALSSYGMCV--LDQYIYIIGGRTEHIDYTSVHHMNSIDMEEDTHSSNKVIRYDTVNNIWETLPNFW---- 451 (557)
T ss_pred EEECC-CCCcccccccEEE--ECCEEEEEeCCCcccccccccccccccccccccccceEEEECCCCCeEeecCCCC----
Confidence 11111 222222 2332 47999997432 234555554432222221110
Q ss_pred cccccceeEEEecCcEEEeec
Q psy6572 742 INLHHVFALAVFEDHLFWTDW 762 (1416)
Q Consensus 742 ~~l~~P~~lav~~d~LYwtD~ 762 (1416)
......++++.+++||+.-.
T Consensus 452 -~~r~~~~~~~~~~~IYv~GG 471 (557)
T PHA02713 452 -TGTIRPGVVSHKDDIYVVCD 471 (557)
T ss_pred -cccccCcEEEECCEEEEEeC
Confidence 11233578899999999854
No 278
>KOG2110|consensus
Probab=33.19 E-value=9.5e+02 Score=29.04 Aligned_cols=198 Identities=10% Similarity=-0.038 Sum_probs=0.0
Q ss_pred cEEecCCCceEecCCCCCeEEEEecEEEEEEecCCcc--eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC-
Q psy6572 523 YALLSDKHGCKATSDVPPNLLFTNKYYIREVTQAGVM--TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN- 599 (1416)
Q Consensus 523 y~L~~dg~sC~a~~~~~~~li~s~~~~I~~i~l~g~~--~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~- 599 (1416)
|.....+..-+..-....++.+.+...-|++.+-... +.+-.-.-...-|++-...++|.+.-.. .|+..+++
T Consensus 40 ~~~~~~~~~IvEmLFSSSLvaiV~~~qpr~Lkv~~~Kk~~~ICe~~fpt~IL~VrmNr~RLvV~Lee----~IyIydI~~ 115 (391)
T KOG2110|consen 40 FSKDTEGVSIVEMLFSSSLVAIVSIKQPRKLKVVHFKKKTTICEIFFPTSILAVRMNRKRLVVCLEE----SIYIYDIKD 115 (391)
T ss_pred hcccCCCeEEEEeecccceeEEEecCCCceEEEEEcccCceEEEEecCCceEEEEEccceEEEEEcc----cEEEEeccc
Q ss_pred -CCCeEEee-cCCCc-eEEEEccCCcEEEeeCC---CCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCC
Q psy6572 600 -SQPELLFP-ATSPD-GLTVDWVGRNLYWCDKG---LDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQ 673 (1416)
Q Consensus 600 -s~~~~l~~-l~~p~-gLAvD~~~~~LYwtD~~---~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~ 673 (1416)
.-..+|.. -.+|. -+|+-+...+-|.+-.+ .+.|.++++........+.-.-....+||+++. |.|.-|-...
T Consensus 116 MklLhTI~t~~~n~~gl~AlS~n~~n~ylAyp~s~t~GdV~l~d~~nl~~v~~I~aH~~~lAalafs~~-G~llATASeK 194 (391)
T KOG2110|consen 116 MKLLHTIETTPPNPKGLCALSPNNANCYLAYPGSTTSGDVVLFDTINLQPVNTINAHKGPLAALAFSPD-GTLLATASEK 194 (391)
T ss_pred ceeehhhhccCCCccceEeeccCCCCceEEecCCCCCceEEEEEcccceeeeEEEecCCceeEEEECCC-CCEEEEeccC
Q ss_pred CceEEEEecCCCCCEEEeecC--CCCCeeEEeecCCCeEEEecCCCCeEEEEeCC
Q psy6572 674 NAHIGKAKMDGSNPKVIISKN--LSWPNALTISYETNELFWGDAHEDYIAVSDLN 726 (1416)
Q Consensus 674 ~~~I~ra~mDGs~r~vlv~~~--l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ld 726 (1416)
...|.+...--..+..-+..+ ......|++++ ...+.-+-..+++|....++
T Consensus 195 GTVIRVf~v~~G~kl~eFRRG~~~~~IySL~Fs~-ds~~L~~sS~TeTVHiFKL~ 248 (391)
T KOG2110|consen 195 GTVIRVFSVPEGQKLYEFRRGTYPVSIYSLSFSP-DSQFLAASSNTETVHIFKLE 248 (391)
T ss_pred ceEEEEEEcCCccEeeeeeCCceeeEEEEEEECC-CCCeEEEecCCCeEEEEEec
No 279
>KOG3567|consensus
Probab=32.95 E-value=24 Score=43.02 Aligned_cols=53 Identities=11% Similarity=0.028 Sum_probs=35.5
Q ss_pred ceEEEEecCCCCCEE-EeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCC
Q psy6572 675 AHIGKAKMDGSNPKV-IISKNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGE 728 (1416)
Q Consensus 675 ~~I~ra~mDGs~r~v-lv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~ 728 (1416)
++|.|..+....++. .-...+..|.||.+| .++..|.+|..++.+...+..++
T Consensus 445 ~~ilvi~~~n~~~l~~~g~~~fylphgl~~d-kdgf~~~tdvash~v~k~k~~~~ 498 (501)
T KOG3567|consen 445 DTILVIDPNNAAVLQSSGKNLFYLPHGLSID-KDGFYWVTDVASHQVFKLKPNNK 498 (501)
T ss_pred ceEEEEcCcchhhhhhccCCceecCCcceec-CCCcEEeecccchhhhhcccccc
Confidence 467777777333222 112236789999999 48888888888887777665543
No 280
>KOG0641|consensus
Probab=32.84 E-value=7.4e+02 Score=27.71 Aligned_cols=29 Identities=10% Similarity=0.139 Sum_probs=20.6
Q ss_pred cCCCceEEEEccCCcEEEeeCCCCeEEEee
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKGLDTIEVAK 637 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ 637 (1416)
...++++|+.+ .+.||-+-+..+++.+..
T Consensus 32 sqairav~fhp-~g~lyavgsnskt~ric~ 60 (350)
T KOG0641|consen 32 SQAIRAVAFHP-AGGLYAVGSNSKTFRICA 60 (350)
T ss_pred hhheeeEEecC-CCceEEeccCCceEEEEc
Confidence 45567888886 566888877777666654
No 281
>KOG2106|consensus
Probab=32.73 E-value=1.1e+03 Score=29.67 Aligned_cols=99 Identities=13% Similarity=0.070 Sum_probs=47.2
Q ss_pred eEEEEecEEEEEEecCCcceEEec-ccccceeeeeecCCCeEEEeeccCCCccEEEEecCCCC--eEEee------cCCC
Q psy6572 541 NLLFTNKYYIREVTQAGVMTIRIH-NQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQP--ELLFP------ATSP 611 (1416)
Q Consensus 541 ~li~s~~~~I~~i~l~g~~~~~~~-~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~--~~l~~------l~~p 611 (1416)
+++-+.++.|.+-++....++++. .....-||+.++..+ +|.+-.... .+...+ +.+. ..++. -..|
T Consensus 342 i~vGTtrN~iL~Gt~~~~f~~~v~gh~delwgla~hps~~-q~~T~gqdk--~v~lW~-~~k~~wt~~~~d~~~~~~fhp 417 (626)
T KOG2106|consen 342 ILVGTTRNFILQGTLENGFTLTVQGHGDELWGLATHPSKN-QLLTCGQDK--HVRLWN-DHKLEWTKIIEDPAECADFHP 417 (626)
T ss_pred EEEeeccceEEEeeecCCceEEEEecccceeeEEcCCChh-heeeccCcc--eEEEcc-CCceeEEEEecCceeEeeccC
Confidence 334456777777777666533332 233567788776544 444433321 222222 2111 11111 1122
Q ss_pred c-eEEEEccCCcEEEeeCCCCeEEEeecCCCce
Q psy6572 612 D-GLTVDWVGRNLYWCDKGLDTIEVAKLDGRFR 643 (1416)
Q Consensus 612 ~-gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~ 643 (1416)
. -||+-..+|++++.|..+..+.....++...
T Consensus 418 sg~va~Gt~~G~w~V~d~e~~~lv~~~~d~~~l 450 (626)
T KOG2106|consen 418 SGVVAVGTATGRWFVLDTETQDLVTIHTDNEQL 450 (626)
T ss_pred cceEEEeeccceEEEEecccceeEEEEecCCce
Confidence 2 3566666777777776665444444444333
No 282
>KOG0303|consensus
Probab=32.27 E-value=8.6e+02 Score=29.63 Aligned_cols=88 Identities=10% Similarity=0.029 Sum_probs=43.0
Q ss_pred cceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEeecCC---CceEEEEccCCcEEEeeCCCCeEEEeecC-CCce
Q psy6572 568 NAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFPATS---PDGLTVDWVGRNLYWCDKGLDTIEVAKLD-GRFR 643 (1416)
Q Consensus 568 ~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~l~~---p~gLAvD~~~~~LYwtD~~~~~I~v~~ld-G~~~ 643 (1416)
.+--|++++....|..+-...+ .|...++.+... ++.+.+ +.++.+.+ .|.++.|-...++|.+.+.. |...
T Consensus 133 rVg~V~wHPtA~NVLlsag~Dn--~v~iWnv~tgea-li~l~hpd~i~S~sfn~-dGs~l~TtckDKkvRv~dpr~~~~v 208 (472)
T KOG0303|consen 133 RVGLVQWHPTAPNVLLSAGSDN--TVSIWNVGTGEA-LITLDHPDMVYSMSFNR-DGSLLCTTCKDKKVRVIDPRRGTVV 208 (472)
T ss_pred eEEEEeecccchhhHhhccCCc--eEEEEeccCCce-eeecCCCCeEEEEEecc-CCceeeeecccceeEEEcCCCCcEe
Confidence 3444566666666555544432 555555543222 222333 34455554 45566666666778777653 2222
Q ss_pred EEE-EcCCCCCcceeee
Q psy6572 644 KVL-INKGLQEPRGIAL 659 (1416)
Q Consensus 644 ~vL-i~~~l~~P~gIav 659 (1416)
... ...+...+|+|-|
T Consensus 209 ~e~~~heG~k~~Raifl 225 (472)
T KOG0303|consen 209 SEGVAHEGAKPARAIFL 225 (472)
T ss_pred eecccccCCCcceeEEe
Confidence 222 1134444555554
No 283
>PF15416 DUF4623: Domain of unknown function (DUF4623)
Probab=31.69 E-value=9.6e+02 Score=28.65 Aligned_cols=30 Identities=17% Similarity=0.255 Sum_probs=21.9
Q ss_pred eeEEeecC-CCeEEEecCCCCeEEEEeCCCC
Q psy6572 699 NALTISYE-TNELFWGDAHEDYIAVSDLNGE 728 (1416)
Q Consensus 699 ~gLaiD~~-~~rLYWtD~~~~~I~~~~ldG~ 728 (1416)
..+.||.. ++.+|+.|.....|.|+.+.+-
T Consensus 243 ~S~nlD~nGnGyiFFgdnaat~ilR~~vsn~ 273 (442)
T PF15416_consen 243 FSLNLDENGNGYIFFGDNAATNILRFTVSNY 273 (442)
T ss_pred eeEEeccCCceEEEecCCccceEEEEEccCc
Confidence 35677743 5789999988888888876553
No 284
>KOG0645|consensus
Probab=31.61 E-value=8.8e+02 Score=28.19 Aligned_cols=182 Identities=15% Similarity=0.140 Sum_probs=0.0
Q ss_pred eEecCCCCCeEEEEecEE---EEEEecCCcc---eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEE
Q psy6572 532 CKATSDVPPNLLFTNKYY---IREVTQAGVM---TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELL 605 (1416)
Q Consensus 532 C~a~~~~~~~li~s~~~~---I~~i~l~g~~---~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l 605 (1416)
|++-.....+|....|.. |..+.-++.. .++-...+.+..+.+++..+.|+-..+.+. .++++...+...+.+
T Consensus 110 ~Vaws~sG~~LATCSRDKSVWiWe~deddEfec~aVL~~HtqDVK~V~WHPt~dlL~S~SYDnT-Ik~~~~~~dddW~c~ 188 (312)
T KOG0645|consen 110 CVAWSASGNYLATCSRDKSVWIWEIDEDDEFECIAVLQEHTQDVKHVIWHPTEDLLFSCSYDNT-IKVYRDEDDDDWECV 188 (312)
T ss_pred EEEEcCCCCEEEEeeCCCeEEEEEecCCCcEEEEeeeccccccccEEEEcCCcceeEEeccCCe-EEEEeecCCCCeeEE
Q ss_pred ee----cCCCceEEEEccCCcEEEeeCCCC-eEEE--eecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEE
Q psy6572 606 FP----ATSPDGLTVDWVGRNLYWCDKGLD-TIEV--AKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIG 678 (1416)
Q Consensus 606 ~~----l~~p~gLAvD~~~~~LYwtD~~~~-~I~v--~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ 678 (1416)
.. -..+.+|+||+.+.+|--++.... +|.+ ..+.+. ..+.||-.-|. +..|-
T Consensus 189 ~tl~g~~~TVW~~~F~~~G~rl~s~sdD~tv~Iw~~~~~~~~~--------------------~sr~~Y~v~W~-~~~Ia 247 (312)
T KOG0645|consen 189 QTLDGHENTVWSLAFDNIGSRLVSCSDDGTVSIWRLYTDLSGM--------------------HSRALYDVPWD-NGVIA 247 (312)
T ss_pred EEecCccceEEEEEecCCCceEEEecCCcceEeeeeccCcchh--------------------cccceEeeeec-ccceE
Q ss_pred EEecCCCCCEEEeecCCCCCe-------eEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEe
Q psy6572 679 KAKMDGSNPKVIISKNLSWPN-------ALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVS 735 (1416)
Q Consensus 679 ra~mDGs~r~vlv~~~l~~P~-------gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~ 735 (1416)
.+.-|+..|....+.....|+ ..+-...-+.|-|.....++|.+..-||.-+..-+.
T Consensus 248 S~ggD~~i~lf~~s~~~d~p~~~l~~~~~~aHe~dVNsV~w~p~~~~~L~s~~DDG~v~~W~l~ 311 (312)
T KOG0645|consen 248 SGGGDDAIRLFKESDSPDEPSWNLLAKKEGAHEVDVNSVQWNPKVSNRLASGGDDGIVNFWELE 311 (312)
T ss_pred eccCCCEEEEEEecCCCCCchHHHHHhhhcccccccceEEEcCCCCCceeecCCCceEEEEEec
No 285
>PF06079 Apyrase: Apyrase; InterPro: IPR009283 This family consists of several eukaryotic apyrase (or adenosine diphosphatase) proteins (3.6.1.5 from EC), and related nucleoside diphosphatases (3.6.1.6 from EC). The salivary apyrases of blood-feeding arthropods are nucleotide hydrolysing enzymes implicated in the inhibition of host platelet aggregation through the hydrolysis of extracellular adenosine diphosphate [].; GO: 0005509 calcium ion binding, 0016462 pyrophosphatase activity; PDB: 2H2N_A 1S18_A 2H2U_A 1S1D_B.
Probab=31.57 E-value=2.1e+02 Score=33.43 Aligned_cols=54 Identities=15% Similarity=0.018 Sum_probs=33.8
Q ss_pred CCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccc---eeEEEecCcEEEeec
Q psy6572 706 ETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHV---FALAVFEDHLFWTDW 762 (1416)
Q Consensus 706 ~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P---~~lav~~d~LYwtD~ 762 (1416)
++++||-.|-+++.|+.+..+-.-..+|+..+ +.. ...+ --++|-++.||+-..
T Consensus 62 FngkLys~DDrTGiVyeI~~~~~vPwviL~dG-dG~--~~kGfK~EWaTVKd~~LyvGs~ 118 (291)
T PF06079_consen 62 FNGKLYSFDDRTGIVYEIKGDKAVPWVILSDG-DGN--TSKGFKAEWATVKDDKLYVGSI 118 (291)
T ss_dssp ETTEEEEEETTT-EEEEEETTEEEEEEE-BST-TTT--ESSB----EEEEETTEEEEE--
T ss_pred ECCEEeeeeCCCceEEEEeCCceeceEEEeCC-CCC--ccccccceeeEEeCCeeeeccC
Confidence 68999999999999999987744445555532 111 1222 346888999996543
No 286
>KOG4283|consensus
Probab=31.32 E-value=9.2e+02 Score=28.31 Aligned_cols=172 Identities=13% Similarity=0.076 Sum_probs=0.0
Q ss_pred ccEEEEecCCCCeEEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEE--
Q psy6572 591 SSIRRSCNNSQPELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMY-- 667 (1416)
Q Consensus 591 ~~I~r~~l~s~~~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LY-- 667 (1416)
.+|.++.|+....++.. ...++.|.||...+.+..+-...+.|.|.+|.......--.-..+.-.-|+.+..++.-|
T Consensus 25 rRil~L~Ln~d~d~~r~HgGsvNsL~id~tegrymlSGgadgsi~v~Dl~n~t~~e~s~li~k~~c~v~~~h~~~Hky~i 104 (397)
T KOG4283|consen 25 RRILSLQLNNDKDFVRPHGGSVNSLQIDLTEGRYMLSGGADGSIAVFDLQNATDYEASGLIAKHKCIVAKQHENGHKYAI 104 (397)
T ss_pred hhhheeeccCCcceeccCCCccceeeeccccceEEeecCCCccEEEEEeccccchhhccceeheeeeccccCCccceeee
Q ss_pred ------------EeeCCCCceEEEEecCCCCCEEEee-cCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEE
Q psy6572 668 ------------WTDWGQNAHIGKAKMDGSNPKVIIS-KNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIV 734 (1416)
Q Consensus 668 ------------WtD~g~~~~I~ra~mDGs~r~vlv~-~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~ 734 (1416)
||...-...+.+-+.+-....+.+. ++..+-.++.==....-|.-+-...-.|...++....-..++
T Consensus 105 ss~~WyP~DtGmFtssSFDhtlKVWDtnTlQ~a~~F~me~~VYshamSp~a~sHcLiA~gtr~~~VrLCDi~SGs~sH~L 184 (397)
T KOG4283|consen 105 SSAIWYPIDTGMFTSSSFDHTLKVWDTNTLQEAVDFKMEGKVYSHAMSPMAMSHCLIAAGTRDVQVRLCDIASGSFSHTL 184 (397)
T ss_pred eeeEEeeecCceeecccccceEEEeecccceeeEEeecCceeehhhcChhhhcceEEEEecCCCcEEEEeccCCcceeee
Q ss_pred eccCCCCcccccceeEEE--ecCcEEEeecCCCeeE
Q psy6572 735 SRRMDPTINLHHVFALAV--FEDHLFWTDWEMKSIE 768 (1416)
Q Consensus 735 ~~~~~p~~~l~~P~~lav--~~d~LYwtD~~~~~I~ 768 (1416)
++. -....++.. ..++|..|-...++|.
T Consensus 185 sGH------r~~vlaV~Wsp~~e~vLatgsaDg~ir 214 (397)
T KOG4283|consen 185 SGH------RDGVLAVEWSPSSEWVLATGSADGAIR 214 (397)
T ss_pred ccc------cCceEEEEeccCceeEEEecCCCceEE
No 287
>KOG3621|consensus
Probab=30.30 E-value=1.2e+03 Score=30.61 Aligned_cols=263 Identities=14% Similarity=0.048 Sum_probs=0.0
Q ss_pred CCcEEecCCCceEecCCCCCeEEEEecEEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC
Q psy6572 521 PGYALLSDKHGCKATSDVPPNLLFTNKYYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN 599 (1416)
Q Consensus 521 ~Gy~L~~dg~sC~a~~~~~~~li~s~~~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~ 599 (1416)
+||.-..-..+|.+. ...+|.+.....+..+.-.+.. .....+-.....+.+-.....=|++-.+..+++|.++.++
T Consensus 29 ~~~~~~~v~lTc~ds--t~~~l~~GsS~G~lyl~~R~~~~~~~~~~~~~~~~~~~~~vs~~e~lvAagt~~g~V~v~ql~ 106 (726)
T KOG3621|consen 29 PGFFPARVKLTCVDA--TEEYLAMGSSAGSVYLYNRHTGEMRKLKNEGATGITCVRSVSSVEYLVAAGTASGRVSVFQLN 106 (726)
T ss_pred cccCcceEEEEEeec--CCceEEEecccceEEEEecCchhhhcccccCccceEEEEEecchhHhhhhhcCCceEEeehhh
Q ss_pred ---CCCeEEee------cCCCceEEEEccCCcEEEeeCCCCeEEEeecCC-CceEEEEcCCCCCcceee-ecCCcceEEE
Q psy6572 600 ---SQPELLFP------ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDG-RFRKVLINKGLQEPRGIA-LNPAYGYMYW 668 (1416)
Q Consensus 600 ---s~~~~l~~------l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG-~~~~vLi~~~l~~P~gIa-vDp~~g~LYW 668 (1416)
.....++. -..+.+|+-+...-+||..|.. ++|....|+- .....-...-+..+.-|+ ||...++|.+
T Consensus 107 ~~~p~~~~~~t~~d~~~~~rVTal~Ws~~~~k~ysGD~~-Gkv~~~~L~s~~~~~~~~q~il~~ds~IVQlD~~q~~LLV 185 (726)
T KOG3621|consen 107 KELPRDLDYVTPCDKSHKCRVTALEWSKNGMKLYSGDSQ-GKVVLTELDSRQAFLSKSQEILSEDSEIVQLDYLQSYLLV 185 (726)
T ss_pred ccCCCcceeeccccccCCceEEEEEecccccEEeecCCC-ceEEEEEechhhhhccccceeeccCcceEEeecccceehH
Q ss_pred eeCCCCceEEEEecCCCCCEEEeecCCCC--CeeEEeecC----CCeEEEecCCCCeEEEEeCCCCceEEEEeccC----
Q psy6572 669 TDWGQNAHIGKAKMDGSNPKVIISKNLSW--PNALTISYE----TNELFWGDAHEDYIAVSDLNGENIKIIVSRRM---- 738 (1416)
Q Consensus 669 tD~g~~~~I~ra~mDGs~r~vlv~~~l~~--P~gLaiD~~----~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~---- 738 (1416)
+. ..+-....++-...+.|.+..-.. +.|--+-+. ..-..++-.-..+++.+|++|...++.+-...
T Consensus 186 St---l~r~~Lc~tE~eti~QIG~k~R~~~~~~GACF~~g~~~~q~~~IycaRPG~RlWead~~G~V~~Thqfk~ala~~ 262 (726)
T KOG3621|consen 186 ST---LTRCILCQTEAETITQIGKKPRKSLIDFGACFFPGQCKAQKPQIYCARPGLRLWEADFAGEVIKTHQFKDALARP 262 (726)
T ss_pred hh---hhhhheeecchhHHHHhcCCCcCCccccceEEeeccccCCCceEEEecCCCceEEeecceeEEEeeehhhhhccC
Q ss_pred -------------------CCCcccccceeEEEecC-cEEEeecCCCeeEEecccCCCceEEEEeCCCCCCeeeee
Q psy6572 739 -------------------DPTINLHHVFALAVFED-HLFWTDWEMKSIERCDKYTGKNCTSVVKNLVHKPMDLRV 794 (1416)
Q Consensus 739 -------------------~p~~~l~~P~~lav~~d-~LYwtD~~~~~I~~~nk~tG~~~~~l~~~~~~~p~~I~v 794 (1416)
.+...+.-+....+.++ -|-|++.+ |+.+. .-....++..+..+...++..
T Consensus 263 p~p~i~~~s~esp~~~~~~~~~q~ls~~k~~~l~~~~vLa~te~G---iyv~d--~~~~~v~l~se~~~DI~dVs~ 333 (726)
T KOG3621|consen 263 PAPEIPIRSLESPNQRSLPSGTQHLSLSKSSTLHSDRVLAWTEVG---IYVFD--SNNSQVYLWSEGGHDILDVSH 333 (726)
T ss_pred CCCcccCCCcCCccccCCCCCccccccceeEEeecceEEEeecce---EEEEE--eccceEEEeecCCCceeEEee
No 288
>KOG0640|consensus
Probab=30.12 E-value=9.8e+02 Score=28.25 Aligned_cols=181 Identities=10% Similarity=0.058 Sum_probs=82.7
Q ss_pred cEEEEEEecCCcc----eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecCCCCeEEee-------cCCCceEE
Q psy6572 547 KYYIREVTQAGVM----TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNSQPELLFP-------ATSPDGLT 615 (1416)
Q Consensus 547 ~~~I~~i~l~g~~----~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~-------l~~p~gLA 615 (1416)
.+.|..++..... ..++.....+.+|.|++.+..|.+.... ..++.+++++. +-+++ ...+..+.
T Consensus 193 D~tvKlFDfsK~saKrA~K~~qd~~~vrsiSfHPsGefllvgTdH---p~~rlYdv~T~-QcfvsanPd~qht~ai~~V~ 268 (430)
T KOG0640|consen 193 DNTVKLFDFSKTSAKRAFKVFQDTEPVRSISFHPSGEFLLVGTDH---PTLRLYDVNTY-QCFVSANPDDQHTGAITQVR 268 (430)
T ss_pred CCeEEEEecccHHHHHHHHHhhccceeeeEeecCCCceEEEecCC---CceeEEeccce-eEeeecCcccccccceeEEE
Confidence 3446666654433 2344555667888999888776654222 24444444422 11221 11223333
Q ss_pred EEccCCcEEEeeCCCCeEEEee-cCCCceEEEEc-CCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeec
Q psy6572 616 VDWVGRNLYWCDKGLDTIEVAK-LDGRFRKVLIN-KGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISK 693 (1416)
Q Consensus 616 vD~~~~~LYwtD~~~~~I~v~~-ldG~~~~vLi~-~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~ 693 (1416)
+. .+++||+|-+..+.|...+ ..++-.+++.. .+....-+..+- .+|+ |+...|.. .+.+.+-=+++|.++.-.
T Consensus 269 Ys-~t~~lYvTaSkDG~IklwDGVS~rCv~t~~~AH~gsevcSa~Ft-kn~k-yiLsSG~D-S~vkLWEi~t~R~l~~Yt 344 (430)
T KOG0640|consen 269 YS-STGSLYVTASKDGAIKLWDGVSNRCVRTIGNAHGGSEVCSAVFT-KNGK-YILSSGKD-STVKLWEISTGRMLKEYT 344 (430)
T ss_pred ec-CCccEEEEeccCCcEEeeccccHHHHHHHHhhcCCceeeeEEEc-cCCe-EEeecCCc-ceeeeeeecCCceEEEEe
Confidence 33 5789999998887765543 11111111111 111112222222 2222 22222322 333444444555544432
Q ss_pred CC------CCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEe
Q psy6572 694 NL------SWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVS 735 (1416)
Q Consensus 694 ~l------~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~ 735 (1416)
+. .+-.--.+...++.|.+.|..++.+-+.+.....|..++.
T Consensus 345 GAg~tgrq~~rtqAvFNhtEdyVl~pDEas~slcsWdaRtadr~~l~s 392 (430)
T KOG0640|consen 345 GAGTTGRQKHRTQAVFNHTEDYVLFPDEASNSLCSWDARTADRVALLS 392 (430)
T ss_pred cCCcccchhhhhhhhhcCccceEEccccccCceeeccccchhhhhhcc
Confidence 21 1111122344566777777666666555554445544444
No 289
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=29.85 E-value=8.7e+02 Score=27.57 Aligned_cols=113 Identities=16% Similarity=0.122 Sum_probs=60.3
Q ss_pred CCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecC-----------CCCCee-EEeecCCCeEEEecCCCCe
Q psy6572 652 QEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKN-----------LSWPNA-LTISYETNELFWGDAHEDY 719 (1416)
Q Consensus 652 ~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~-----------l~~P~g-LaiD~~~~rLYWtD~~~~~ 719 (1416)
..|..|++- ...|.+.- ......++++......|+... -..|.+ +.+ .++.+..+ ..+.
T Consensus 139 ~~~~~i~~~--~~~i~v~~---~~~f~~idl~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~--~~~e~Ll~--~~~~ 209 (275)
T PF00780_consen 139 DPPSSIAFL--GNKICVGT---SKGFYLIDLNTGSPSELLDPSDSSSSFKSRNSSSKPLGIFQL--SDNEFLLC--YDNI 209 (275)
T ss_pred CCcEEEEEe--CCEEEEEe---CCceEEEecCCCCceEEeCccCCcchhhhcccCCCceEEEEe--CCceEEEE--ecce
Confidence 557777776 55676664 235677777744444443211 112332 333 22333232 2344
Q ss_pred EEEEeCCCCceE-EEEeccCCCCcccccceeEEEecCcEEEeecCCCeeEEecccCCCceEEEE
Q psy6572 720 IAVSDLNGENIK-IIVSRRMDPTINLHHVFALAVFEDHLFWTDWEMKSIERCDKYTGKNCTSVV 782 (1416)
Q Consensus 720 I~~~~ldG~~r~-~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~~~I~~~nk~tG~~~~~l~ 782 (1416)
-..++..|...+ ..+. --..|.++++...+|+.... +.|+.-+..+|.-++++.
T Consensus 210 g~fv~~~G~~~r~~~i~-------W~~~p~~~~~~~pyli~~~~--~~iEV~~~~~~~lvQ~i~ 264 (275)
T PF00780_consen 210 GVFVNKNGEPSRKSTIQ-------WSSAPQSVAYSSPYLIAFSS--NSIEVRSLETGELVQTIP 264 (275)
T ss_pred EEEEcCCCCcCcccEEE-------cCCchhEEEEECCEEEEECC--CEEEEEECcCCcEEEEEE
Confidence 556667775443 1111 02368888888888887655 446666666776555553
No 290
>KOG1225|consensus
Probab=29.23 E-value=78 Score=40.01 Aligned_cols=76 Identities=29% Similarity=0.642 Sum_probs=42.2
Q ss_pred ceeeCCCeeecCCcccCCCCCCCCCCCCCCcccccccCCCCCCcccccceecCCceEEeeCCCceecCCCCCccccCCcC
Q psy6572 420 HFLCSNGLCINETLTCNDINDCGDNSDEFSCFVNECNVSHGGQLCAHECIDLKIGYKCACRKGYQVHPEDKHLCVDTNEC 499 (1416)
Q Consensus 420 ~f~C~~g~Ci~~~~~Cdg~~dC~dgsDe~~C~i~eC~~~~~~~~Cs~~C~nt~~gy~C~C~~Gy~L~p~d~~tC~didEC 499 (1416)
.+.|.+|+|| |.+|-....|++..|... |++.=.+..+ +|.|++||.- +.|..-. |
T Consensus 259 ~g~c~~G~CI-----------C~~Gf~G~dC~e~~Cp~~-----cs~~g~~~~g--~CiC~~g~~G-----~dCs~~~-c 314 (525)
T KOG1225|consen 259 RGQCVEGRCI-----------CPPGFTGDDCDELVCPVD-----CSGGGVCVDG--ECICNPGYSG-----KDCSIRR-C 314 (525)
T ss_pred cceEeCCeEe-----------CCCCCcCCCCCcccCCcc-----cCCCceecCC--EeecCCCccc-----ccccccc-C
Confidence 3667777776 555655556665555321 4443334444 8999999953 3343211 1
Q ss_pred CCCCccc--eeeecCCeeeecCCCCcE
Q psy6572 500 LDRPCSH--YCRNTLGSYSCSCAPGYA 524 (1416)
Q Consensus 500 ~~~~Csq--~C~nt~gsy~C~C~~Gy~ 524 (1416)
+..|+. .|+ .-+|.|.+||+
T Consensus 315 -padC~g~G~Ci----~G~C~C~~Gy~ 336 (525)
T KOG1225|consen 315 -PADCSGHGKCI----DGECLCDEGYT 336 (525)
T ss_pred -CccCCCCCccc----CCceEeCCCCc
Confidence 122221 233 23789999997
No 291
>KOG0286|consensus
Probab=28.18 E-value=1e+03 Score=27.93 Aligned_cols=113 Identities=10% Similarity=0.075 Sum_probs=70.8
Q ss_pred cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCc-eEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCC
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRF-RKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSN 686 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~-~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~ 686 (1416)
...+-+|+|-+.+.|.|++-.-.+...+-++.... +.++. ......++|.+.|. |+-|.|-. ....+...+|..-.
T Consensus 186 ~gDV~slsl~p~~~ntFvSg~cD~~aklWD~R~~~c~qtF~-ghesDINsv~ffP~-G~afatGS-DD~tcRlyDlRaD~ 262 (343)
T KOG0286|consen 186 TGDVMSLSLSPSDGNTFVSGGCDKSAKLWDVRSGQCVQTFE-GHESDINSVRFFPS-GDAFATGS-DDATCRLYDLRADQ 262 (343)
T ss_pred cccEEEEecCCCCCCeEEecccccceeeeeccCcceeEeec-ccccccceEEEccC-CCeeeecC-CCceeEEEeecCCc
Confidence 55677888888899999998776666666654333 44443 45556788888876 77777753 23355556665544
Q ss_pred CEEEeec-C-CCCCeeEEeecCCCeEEEecCCCCeEEEEe
Q psy6572 687 PKVIISK-N-LSWPNALTISYETNELFWGDAHEDYIAVSD 724 (1416)
Q Consensus 687 r~vlv~~-~-l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ 724 (1416)
...+.+. . +...+++++. ..+||.++-.....+..-+
T Consensus 263 ~~a~ys~~~~~~gitSv~FS-~SGRlLfagy~d~~c~vWD 301 (343)
T KOG0286|consen 263 ELAVYSHDSIICGITSVAFS-KSGRLLFAGYDDFTCNVWD 301 (343)
T ss_pred EEeeeccCcccCCceeEEEc-ccccEEEeeecCCceeEee
Confidence 4334432 2 3444678886 4777777765555555555
No 292
>KOG2111|consensus
Probab=27.82 E-value=1.1e+03 Score=28.05 Aligned_cols=201 Identities=11% Similarity=0.140 Sum_probs=0.0
Q ss_pred cCCCeEEEeeccCCCccEEEEecCCCCeEEeecCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcc
Q psy6572 576 WVDNCLYWSDVTMHGSSIRRSCNNSQPELLFPATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPR 655 (1416)
Q Consensus 576 ~~~~~LYwtD~~~~~~~I~r~~l~s~~~~l~~l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~ 655 (1416)
+.++.|.|-|.. ...|.-+.+.+.++.|.-...-.-+++. ++|+|.......+..-.-....+|+
T Consensus 73 ~pNkviIWDD~k--~~~i~el~f~~~I~~V~l~r~riVvvl~-------------~~I~VytF~~n~k~l~~~et~~NPk 137 (346)
T KOG2111|consen 73 PPNKVIIWDDLK--ERCIIELSFNSEIKAVKLRRDRIVVVLE-------------NKIYVYTFPDNPKLLHVIETRSNPK 137 (346)
T ss_pred CCceEEEEeccc--CcEEEEEEeccceeeEEEcCCeEEEEec-------------CeEEEEEcCCChhheeeeecccCCC
Q ss_pred ee-eecCCcceEEEeeCCCC-ceEEEEecCCCCC--EEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEe-CCCCce
Q psy6572 656 GI-ALNPAYGYMYWTDWGQN-AHIGKAKMDGSNP--KVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSD-LNGENI 730 (1416)
Q Consensus 656 gI-avDp~~g~LYWtD~g~~-~~I~ra~mDGs~r--~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~-ldG~~r 730 (1416)
|| ++.|.....+.+=.|.+ ..|..++|.-... ..++........-++|.....+|--+-.+.-.|...+ .+|+.+
T Consensus 138 GlC~~~~~~~k~~LafPg~k~GqvQi~dL~~~~~~~p~~I~AH~s~Iacv~Ln~~Gt~vATaStkGTLIRIFdt~~g~~l 217 (346)
T KOG2111|consen 138 GLCSLCPTSNKSLLAFPGFKTGQVQIVDLASTKPNAPSIINAHDSDIACVALNLQGTLVATASTKGTLIRIFDTEDGTLL 217 (346)
T ss_pred ceEeecCCCCceEEEcCCCccceEEEEEhhhcCcCCceEEEcccCceeEEEEcCCccEEEEeccCcEEEEEEEcCCCcEe
Q ss_pred EEEEeccCCCCcccccceeEEE-ecCcEEEeecCCCeeEEecccCCCceEE----------EEeCCCCCCeeeeeec
Q psy6572 731 KIIVSRRMDPTINLHHVFALAV-FEDHLFWTDWEMKSIERCDKYTGKNCTS----------VVKNLVHKPMDLRVYH 796 (1416)
Q Consensus 731 ~~v~~~~~~p~~~l~~P~~lav-~~d~LYwtD~~~~~I~~~nk~tG~~~~~----------l~~~~~~~p~~I~v~h 796 (1416)
..+..+. .-.+.+.|++ ....+.......++|............. ++...+..-.+++-++
T Consensus 218 ~E~RRG~-----d~A~iy~iaFSp~~s~LavsSdKgTlHiF~l~~~~~~~~~~SSl~~~~~~lpky~~S~wS~~~f~ 289 (346)
T KOG2111|consen 218 QELRRGV-----DRADIYCIAFSPNSSWLAVSSDKGTLHIFSLRDTENTEDESSSLSFKRLVLPKYFSSEWSFAKFQ 289 (346)
T ss_pred eeeecCC-----chheEEEEEeCCCccEEEEEcCCCeEEEEEeecCCCCccccccccccccccchhcccceeEEEEE
No 293
>KOG0276|consensus
Probab=26.93 E-value=1.5e+03 Score=29.37 Aligned_cols=139 Identities=12% Similarity=0.053 Sum_probs=74.0
Q ss_pred cCCCceEecCCCCCeEEEEe---cEEEEEEecCCcc-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecCC--
Q psy6572 527 SDKHGCKATSDVPPNLLFTN---KYYIREVTQAGVM-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNNS-- 600 (1416)
Q Consensus 527 ~dg~sC~a~~~~~~~li~s~---~~~I~~i~l~g~~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~s-- 600 (1416)
.++..++...+.+|.+|.+- +..|+........ .+-+..++ +.+-.|-.+.+ |+-.+.+...|++++.++
T Consensus 13 SdRVKsVd~HPtePw~la~LynG~V~IWnyetqtmVksfeV~~~P-vRa~kfiaRkn---Wiv~GsDD~~IrVfnynt~e 88 (794)
T KOG0276|consen 13 SDRVKSVDFHPTEPWILAALYNGDVQIWNYETQTMVKSFEVSEVP-VRAAKFIARKN---WIVTGSDDMQIRVFNYNTGE 88 (794)
T ss_pred CCceeeeecCCCCceEEEeeecCeeEEEecccceeeeeeeecccc-hhhheeeeccc---eEEEecCCceEEEEecccce
Confidence 34444666666788877653 3334433322211 22222211 12222322222 222232235788888883
Q ss_pred CCeEEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceE-EEEcCCCCCcceeeecCCcceEEEee
Q psy6572 601 QPELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRK-VLINKGLQEPRGIALNPAYGYMYWTD 670 (1416)
Q Consensus 601 ~~~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~-vLi~~~l~~P~gIavDp~~g~LYWtD 670 (1416)
++.++.. -...+.|||.|..- .+.|.+....|-.-+-++...- ....+.-.....||+.|..-..|.+-
T Consensus 89 kV~~FeAH~DyIR~iavHPt~P-~vLtsSDDm~iKlW~we~~wa~~qtfeGH~HyVMqv~fnPkD~ntFaS~ 159 (794)
T KOG0276|consen 89 KVKTFEAHSDYIRSIAVHPTLP-YVLTSSDDMTIKLWDWENEWACEQTFEGHEHYVMQVAFNPKDPNTFASA 159 (794)
T ss_pred eeEEeeccccceeeeeecCCCC-eEEecCCccEEEEeeccCceeeeeEEcCcceEEEEEEecCCCccceeee
Confidence 3333333 67788999987433 3445555567777777776533 33334445577899999877666654
No 294
>PTZ00486 apyrase Superfamily; Provisional
Probab=26.68 E-value=3.6e+02 Score=32.40 Aligned_cols=56 Identities=14% Similarity=0.124 Sum_probs=36.2
Q ss_pred CCCeEEEecCCCCeEEEEeCCCC--ceEEEEeccCCCCcccccceeEEEecCcEEEee
Q psy6572 706 ETNELFWGDAHEDYIAVSDLNGE--NIKIIVSRRMDPTINLHHVFALAVFEDHLFWTD 761 (1416)
Q Consensus 706 ~~~rLYWtD~~~~~I~~~~ldG~--~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD 761 (1416)
++++||-.|-+++.|+.+..++. -..+|+..+......--..--.+|.++.||+-.
T Consensus 123 FngkLys~DDrTGiVy~i~~~~~~~~PwvIL~dGdG~~~kGfK~EWaTVKd~~LyVGs 180 (352)
T PTZ00486 123 FNGKLYGFDDRTGIVYEIDIDKKKAYPRHILSDGNGNSDKGMKIEWATVYDDKLYVGS 180 (352)
T ss_pred eCCEEEEEeCCceEEEEEEcCCCcEeeEEEEecCCCCCCCCcceeeEEEECCEEEEec
Confidence 68999999999999999987663 334455432111111112345678889999843
No 295
>PF04885 Stig1: Stigma-specific protein, Stig1; InterPro: IPR006969 This family represents the Stig1 cysteine rich plant protein.The tobacco stigma-specific gene, STIG1 is developmentally regulated and expressed specifically in the stigmatic secretory zone. Pistils of transgenic STIG1-barnase tobacco plants undergo normal development, but lack the stigmatic secretory zone and are female sterile. Pollen grains are unable to penetrate the surface of the ablated pistils. Application of stigmatic exudate from wild-type pistils to the ablated surface increases the efficiency of pollen tube germination and growth and restores the capacity of pollen tubes to penetrate the style []. The function of STIG1 is unknown.
Probab=25.94 E-value=1.4e+02 Score=30.99 Aligned_cols=55 Identities=27% Similarity=0.659 Sum_probs=28.7
Q ss_pred CCCC-CCCCCCCCC--CCCCCCCCCCCCceecCCCCceecCCcccCCCCCCCCCCCccccccccCCCCCcCCCCCCcccC
Q psy6572 92 DCPD-ASDEMHCPM--TNCTEKYPLMTNPIHCNFTSACIEESYICDGQNDCFDMSDEQNCDQIKDVSPKMNCSGDKFLCR 168 (1416)
Q Consensus 92 dC~d-~sDE~~C~~--~~C~~~~~~~~~~f~C~~~~~CI~~~~~CDg~~DC~D~sDE~~C~~~~~~~~~~~C~~~~f~C~ 168 (1416)
.|.| .+|..+|.. ..|+.++ .-| +++|+.+ .+|..+|.. =...|+.++ .|.
T Consensus 76 ~Cvdv~~d~~nCG~Cg~~C~~g~------~cC--~G~Cvd~------------~~d~~~CG~-----Cg~~C~~G~-~C~ 129 (136)
T PF04885_consen 76 KCVDVSSDRNNCGACGNKCPYGQ------TCC--GGQCVDL------------NSDPRHCGA-----CGNKCPPGQ-KCV 129 (136)
T ss_pred cCCccCCCccccHhhcCCCCCCc------eec--CCEeECC------------CCCccccCC-----CCCcCCCcC-CcC
Confidence 3544 356777753 5565554 333 2356554 456777751 123465443 566
Q ss_pred CCce
Q psy6572 169 NGNC 172 (1416)
Q Consensus 169 ~g~C 172 (1416)
.|.|
T Consensus 130 ~G~C 133 (136)
T PF04885_consen 130 YGMC 133 (136)
T ss_pred CeEC
Confidence 6665
No 296
>KOG0292|consensus
Probab=25.56 E-value=1.8e+03 Score=29.92 Aligned_cols=114 Identities=15% Similarity=0.151 Sum_probs=59.1
Q ss_pred EEEEEecCCcc--eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC--CCCeEEee-----------------
Q psy6572 549 YIREVTQAGVM--TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN--SQPELLFP----------------- 607 (1416)
Q Consensus 549 ~I~~i~l~g~~--~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~--s~~~~l~~----------------- 607 (1416)
.+++++-.... ........++.++-|++..+.| .+.... ++|++.+++ +..+++..
T Consensus 231 KlWrmnetKaWEvDtcrgH~nnVssvlfhp~q~lI-lSnsED--ksirVwDm~kRt~v~tfrrendRFW~laahP~lNLf 307 (1202)
T KOG0292|consen 231 KLWRMNETKAWEVDTCRGHYNNVSSVLFHPHQDLI-LSNSED--KSIRVWDMTKRTSVQTFRRENDRFWILAAHPELNLF 307 (1202)
T ss_pred eEEEeccccceeehhhhcccCCcceEEecCcccee-EecCCC--ccEEEEecccccceeeeeccCCeEEEEEecCCccee
Confidence 35555433222 2334455667778887755543 344432 366666665 22222211
Q ss_pred ------------c-CCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEc-CC----CCCcceeeecCCcceEEEe
Q psy6572 608 ------------A-TSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLIN-KG----LQEPRGIALNPAYGYMYWT 669 (1416)
Q Consensus 608 ------------l-~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~-~~----l~~P~gIavDp~~g~LYWt 669 (1416)
+ ..+.+.||. .+.||++. ...|...++....-.++.. .. ...|+.|...|..+-+..+
T Consensus 308 AAgHDsGm~VFkleRErpa~~v~--~n~LfYvk--d~~i~~~d~~t~~d~~v~~lr~~g~~~~~~~smsYNpae~~vlic 383 (1202)
T KOG0292|consen 308 AAGHDSGMIVFKLERERPAYAVN--GNGLFYVK--DRFIRSYDLRTQKDTAVASLRRPGTLWQPPRSLSYNPAENAVLIC 383 (1202)
T ss_pred eeecCCceEEEEEcccCceEEEc--CCEEEEEc--cceEEeeeccccccceeEeccCCCcccCCcceeeeccccCeEEEE
Confidence 1 122333442 34445444 3466666665533333322 11 3568999999999887777
No 297
>TIGR02171 Fb_sc_TIGR02171 Fibrobacter succinogenes paralogous family TIGR02171. This model describes a paralogous family of the rumen bacterium Fibrobacter succinogenes. Eleven members are found in Fibrobacter succinogenes S85, averaging over 900 amino acids in length. More than half are predicted lipoproteins. The function is unknown.
Probab=25.54 E-value=3.5e+02 Score=36.50 Aligned_cols=99 Identities=10% Similarity=0.074 Sum_probs=57.7
Q ss_pred cCCcEEEeeCCCCeEEEeecCCCceEEE-EcCCCCCcceeeecCCcceEEE-eeCC---CCceEEEEecCCCC--CEEEe
Q psy6572 619 VGRNLYWCDKGLDTIEVAKLDGRFRKVL-INKGLQEPRGIALNPAYGYMYW-TDWG---QNAHIGKAKMDGSN--PKVII 691 (1416)
Q Consensus 619 ~~~~LYwtD~~~~~I~v~~ldG~~~~vL-i~~~l~~P~gIavDp~~g~LYW-tD~g---~~~~I~ra~mDGs~--r~vlv 691 (1416)
.++.+|+++. +++|.+++.+|...++| +... .....-++-|..++|=+ |-.. ..+.|++.+|+.+. ...|-
T Consensus 318 ~tkiAfv~~~-~~~L~~~D~dG~n~~~ve~~~~-~~i~sP~~SPDG~~vAY~ts~e~~~g~s~vYv~~L~t~~~~~vkl~ 395 (912)
T TIGR02171 318 KAKLAFRNDV-TGNLAYIDYTKGASRAVEIEDT-ISVYHPDISPDGKKVAFCTGIEGLPGKSSVYVRNLNASGSGLVKLP 395 (912)
T ss_pred eeeEEEEEcC-CCeEEEEecCCCCceEEEecCC-CceecCcCCCCCCEEEEEEeecCCCCCceEEEEehhccCCCceEee
Confidence 4566777773 34899999999887776 4322 11223356666666655 4432 35779998887544 33333
Q ss_pred ecCCCCCeeEEee-cCCCeEEEecCCCCe
Q psy6572 692 SKNLSWPNALTIS-YETNELFWGDAHEDY 719 (1416)
Q Consensus 692 ~~~l~~P~gLaiD-~~~~rLYWtD~~~~~ 719 (1416)
-++..-|+==.+. -.+-.+|++|+++++
T Consensus 396 ve~aaiprwrv~e~gdt~ivyv~~a~nn~ 424 (912)
T TIGR02171 396 VENAAIPRWRVLENGDTVIVYVSDASNNK 424 (912)
T ss_pred cccccccceEecCCCCeEEEEEcCCCCCc
Confidence 3343344432332 112368999987764
No 298
>PF06739 SBBP: Beta-propeller repeat; InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=25.30 E-value=76 Score=25.23 Aligned_cols=20 Identities=30% Similarity=0.534 Sum_probs=16.2
Q ss_pred CCCcceeeecCCcceEEEeeC
Q psy6572 651 LQEPRGIALNPAYGYMYWTDW 671 (1416)
Q Consensus 651 l~~P~gIavDp~~g~LYWtD~ 671 (1416)
...|.+||||+. |.||++-+
T Consensus 12 ~~~~~~IavD~~-GNiYv~G~ 31 (38)
T PF06739_consen 12 QDYGNGIAVDSN-GNIYVTGY 31 (38)
T ss_pred ceeEEEEEECCC-CCEEEEEe
Confidence 356999999976 88999854
No 299
>KOG0284|consensus
Probab=25.03 E-value=2.4e+02 Score=34.14 Aligned_cols=98 Identities=12% Similarity=0.115 Sum_probs=65.6
Q ss_pred cceeeeeecCCCeEEEeeccCCCccEEEEecC-CCCeEEee--cCCCceEEEEccCCcEEEeeCCCCeEEEeecCC-Cce
Q psy6572 568 NAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN-SQPELLFP--ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDG-RFR 643 (1416)
Q Consensus 568 ~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~--l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG-~~~ 643 (1416)
.+.+|.+-..+..+.-.|. ++.|....++ .++..+.. -..+++||+-+ +...|.|-+..++|.+-+.-- +.-
T Consensus 140 ~Vr~m~ws~~g~wmiSgD~---gG~iKyWqpnmnnVk~~~ahh~eaIRdlafSp-nDskF~t~SdDg~ikiWdf~~~kee 215 (464)
T KOG0284|consen 140 PVRTMKWSHNGTWMISGDK---GGMIKYWQPNMNNVKIIQAHHAEAIRDLAFSP-NDSKFLTCSDDGTIKIWDFRMPKEE 215 (464)
T ss_pred cceeEEEccCCCEEEEcCC---CceEEecccchhhhHHhhHhhhhhhheeccCC-CCceeEEecCCCeEEEEeccCCchh
Confidence 3455665544433332222 3567777766 44444443 46789999998 888999999999998877543 333
Q ss_pred EEEEcCCCCCcceeeecCCcceEEEee
Q psy6572 644 KVLINKGLQEPRGIALNPAYGYMYWTD 670 (1416)
Q Consensus 644 ~vLi~~~l~~P~gIavDp~~g~LYWtD 670 (1416)
++| ....-.|+.++-+|.+|.|+..-
T Consensus 216 ~vL-~GHgwdVksvdWHP~kgLiasgs 241 (464)
T KOG0284|consen 216 RVL-RGHGWDVKSVDWHPTKGLIASGS 241 (464)
T ss_pred hee-ccCCCCcceeccCCccceeEEcc
Confidence 444 45556699999999999887654
No 300
>KOG1645|consensus
Probab=24.79 E-value=1.4e+03 Score=28.16 Aligned_cols=75 Identities=9% Similarity=-0.035 Sum_probs=44.9
Q ss_pred EecccccceeeeeecCCC-eEEEeeccCCCccEEEEecCCCCeEE--eecCCCceEEEEccCCcEEEeeCCCCeEEEeec
Q psy6572 562 RIHNQTNAVGLDFDWVDN-CLYWSDVTMHGSSIRRSCNNSQPELL--FPATSPDGLTVDWVGRNLYWCDKGLDTIEVAKL 638 (1416)
Q Consensus 562 ~~~~l~~~~~l~~D~~~~-~LYwtD~~~~~~~I~r~~l~s~~~~l--~~l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~l 638 (1416)
+......+.+|+|.+..+ .|-..... .+|..+.+.+...+. ..-..+.+.+.|-...+..++--.++.|.+.+|
T Consensus 189 lp~~g~~IrdlafSp~~~GLl~~asl~---nkiki~dlet~~~vssy~a~~~~wSC~wDlde~h~IYaGl~nG~VlvyD~ 265 (463)
T KOG1645|consen 189 LPGEGSFIRDLAFSPFNEGLLGLASLG---NKIKIMDLETSCVVSSYIAYNQIWSCCWDLDERHVIYAGLQNGMVLVYDM 265 (463)
T ss_pred ccccchhhhhhccCccccceeeeeccC---ceEEEEecccceeeeheeccCCceeeeeccCCcceeEEeccCceEEEEEc
Confidence 333444567778766555 33333333 367777776322211 114667888888776666666666788888887
Q ss_pred C
Q psy6572 639 D 639 (1416)
Q Consensus 639 d 639 (1416)
.
T Consensus 266 R 266 (463)
T KOG1645|consen 266 R 266 (463)
T ss_pred c
Confidence 5
No 301
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=24.10 E-value=87 Score=24.94 Aligned_cols=22 Identities=32% Similarity=0.584 Sum_probs=16.4
Q ss_pred eeeecC-CeeeecCCCCcEEecC
Q psy6572 507 YCRNTL-GSYSCSCAPGYALLSD 528 (1416)
Q Consensus 507 ~C~nt~-gsy~C~C~~Gy~L~~d 528 (1416)
.|++.. |++.|.|..||.+..+
T Consensus 12 ~C~~~~dG~eecrCllgyk~~~~ 34 (37)
T PF12946_consen 12 GCFRYDDGSEECRCLLGYKKVGG 34 (37)
T ss_dssp EEEEETTSEEEEEE-TTEEEETT
T ss_pred ccEEcCCCCEEEEeeCCccccCC
Confidence 567766 8999999999987543
No 302
>PRK10115 protease 2; Provisional
Probab=23.91 E-value=1.8e+03 Score=29.32 Aligned_cols=115 Identities=12% Similarity=0.095 Sum_probs=60.9
Q ss_pred EEEccCCcEEEeeC-C--CCeEEEeecCCC-ceEEEEcCC-CCCcceeeecCCcceEEEeeC-CCCceEEEEecCCCCCE
Q psy6572 615 TVDWVGRNLYWCDK-G--LDTIEVAKLDGR-FRKVLINKG-LQEPRGIALNPAYGYMYWTDW-GQNAHIGKAKMDGSNPK 688 (1416)
Q Consensus 615 AvD~~~~~LYwtD~-~--~~~I~v~~ldG~-~~~vLi~~~-l~~P~gIavDp~~g~LYWtD~-g~~~~I~ra~mDGs~r~ 688 (1416)
.+...++.||+... + +.+|.++.+.+. ..++|+... -....++++. .++|+++-. +...+|..+++++....
T Consensus 274 ~~~~~~~~ly~~tn~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~i~~~~~~--~~~l~~~~~~~g~~~l~~~~~~~~~~~ 351 (686)
T PRK10115 274 SLDHYQHRFYLRSNRHGKNFGLYRTRVRDEQQWEELIPPRENIMLEGFTLF--TDWLVVEERQRGLTSLRQINRKTREVI 351 (686)
T ss_pred EEEeCCCEEEEEEcCCCCCceEEEecCCCcccCeEEECCCCCCEEEEEEEE--CCEEEEEEEeCCEEEEEEEcCCCCceE
Confidence 33344567777543 2 346777777632 234555442 2345667775 456666542 44467888887765433
Q ss_pred EEeecCCCCCeeE-----EeecCCCeEEEecC---CCCeEEEEeCCCCceEEEE
Q psy6572 689 VIISKNLSWPNAL-----TISYETNELFWGDA---HEDYIAVSDLNGENIKIIV 734 (1416)
Q Consensus 689 vlv~~~l~~P~gL-----aiD~~~~rLYWtD~---~~~~I~~~~ldG~~r~~v~ 734 (1416)
.|. +..|.++ +.++.+++|+++-. .-..|+..++.+...+++.
T Consensus 352 ~l~---~~~~~~~~~~~~~~~~~~~~~~~~~ss~~~P~~~y~~d~~~~~~~~l~ 402 (686)
T PRK10115 352 GIA---FDDPAYVTWIAYNPEPETSRLRYGYSSMTTPDTLFELDMDTGERRVLK 402 (686)
T ss_pred Eec---CCCCceEeeecccCCCCCceEEEEEecCCCCCEEEEEECCCCcEEEEE
Confidence 332 1123222 22233455654432 2357888888765555444
No 303
>COG4222 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.87 E-value=1.4e+03 Score=28.09 Aligned_cols=64 Identities=19% Similarity=0.090 Sum_probs=41.6
Q ss_pred CCCcceeeecCCcceEEEeeC-----CCCceEEEEecCCCCCEEEeecC--C--C---------CCeeEEeecCCCeEEE
Q psy6572 651 LQEPRGIALNPAYGYMYWTDW-----GQNAHIGKAKMDGSNPKVIISKN--L--S---------WPNALTISYETNELFW 712 (1416)
Q Consensus 651 l~~P~gIavDp~~g~LYWtD~-----g~~~~I~ra~mDGs~r~vlv~~~--l--~---------~P~gLaiD~~~~rLYW 712 (1416)
...|.++++.+....++|+.- .-.|.|.+.+++|+...++.... + . .-.||||.+...+||-
T Consensus 137 ~~~~~~ralt~~d~~~~s~~~~~igdefgP~l~~f~~~Gk~~~~~~~~~~~~~~~~p~g~~~n~gfEglait~d~~~L~~ 216 (391)
T COG4222 137 GEDPEGRALTPADFDVESSQGAWIGDEFGPYLLEFDANGKLVRVLEVPVRFLPPDNPKGLRNNLGFEGLAITPDGKKLYA 216 (391)
T ss_pred ccCchhhcccCCCcceeeccccccccccCcceEEECCCCccccccccccccCcCCCccccccccceeeEEecCCCceEEE
Confidence 344677777776666665542 23489999999998876654321 1 1 1237888887788886
Q ss_pred ec
Q psy6572 713 GD 714 (1416)
Q Consensus 713 tD 714 (1416)
+-
T Consensus 217 ~l 218 (391)
T COG4222 217 LL 218 (391)
T ss_pred EE
Confidence 63
No 304
>PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues.
Probab=23.83 E-value=1.1e+03 Score=26.85 Aligned_cols=111 Identities=19% Similarity=0.227 Sum_probs=70.1
Q ss_pred CCceEEEEccCCcEEEee--CCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCc-eEEEEecCCCC
Q psy6572 610 SPDGLTVDWVGRNLYWCD--KGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNA-HIGKAKMDGSN 686 (1416)
Q Consensus 610 ~p~gLAvD~~~~~LYwtD--~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~-~I~ra~mDGs~ 686 (1416)
.+...|+-+.++.+.++. .....+++...++....++....|..| .+|+. |+|+..+.+... ++.+...+|+.
T Consensus 25 ~~~s~AvS~dg~~~A~v~~~~~~~~L~~~~~~~~~~~~~~g~~l~~P---S~d~~-g~~W~v~~~~~~~~~~~~~~~g~~ 100 (253)
T PF10647_consen 25 DVTSPAVSPDGSRVAAVSEGDGGRSLYVGPAGGPVRPVLTGGSLTRP---SWDPD-GWVWTVDDGSGGVRVVRDSASGTG 100 (253)
T ss_pred cccceEECCCCCeEEEEEEcCCCCEEEEEcCCCcceeeccCCccccc---cccCC-CCEEEEEcCCCceEEEEecCCCcc
Confidence 567778887787777766 566788888888877776643445444 78888 888777755432 22222344665
Q ss_pred CEEEeecC-CC-CCeeEEeecCCCeEEEec--CCCCeEEEEe
Q psy6572 687 PKVIISKN-LS-WPNALTISYETNELFWGD--AHEDYIAVSD 724 (1416)
Q Consensus 687 r~vlv~~~-l~-~P~gLaiD~~~~rLYWtD--~~~~~I~~~~ 724 (1416)
..+.+... +. ....|.|.+...||-++- ...++|+.+.
T Consensus 101 ~~~~v~~~~~~~~I~~l~vSpDG~RvA~v~~~~~~~~v~va~ 142 (253)
T PF10647_consen 101 EPVEVDWPGLRGRITALRVSPDGTRVAVVVEDGGGGRVYVAG 142 (253)
T ss_pred eeEEecccccCCceEEEEECCCCcEEEEEEecCCCCeEEEEE
Confidence 55555432 33 457899988877775543 2335555554
No 305
>KOG0646|consensus
Probab=23.68 E-value=1.5e+03 Score=28.23 Aligned_cols=31 Identities=19% Similarity=0.120 Sum_probs=22.1
Q ss_pred CCCeeEEeecCCCeEEEecCCCCeEEEEeCCC
Q psy6572 696 SWPNALTISYETNELFWGDAHEDYIAVSDLNG 727 (1416)
Q Consensus 696 ~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG 727 (1416)
..+.+++||+.+.++|..- ..+.|+..++.+
T Consensus 218 ~si~av~lDpae~~~yiGt-~~G~I~~~~~~~ 248 (476)
T KOG0646|consen 218 SSIKAVALDPAERVVYIGT-EEGKIFQNLLFK 248 (476)
T ss_pred CcceeEEEcccccEEEecC-CcceEEeeehhc
Confidence 3468999999887777654 456777777543
No 306
>KOG4532|consensus
Probab=22.53 E-value=1.1e+03 Score=27.29 Aligned_cols=140 Identities=15% Similarity=0.119 Sum_probs=64.9
Q ss_pred eEEEeecCCCc-eEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEe-ecCCCCCeeEEeecCCCe
Q psy6572 632 TIEVAKLDGRF-RKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVII-SKNLSWPNALTISYETNE 709 (1416)
Q Consensus 632 ~I~v~~ldG~~-~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv-~~~l~~P~gLaiD~~~~r 709 (1416)
+|.++.|+|.. +..+....+. -..+++++...+|-.. |..++|.+..+|-....++- ...-..-.|+.....+..
T Consensus 139 t~k~~~~~~~s~~~~~h~~~~~-~ns~~~snd~~~~~~V--gds~~Vf~y~id~~sey~~~~~~a~t~D~gF~~S~s~~~ 215 (344)
T KOG4532|consen 139 TGKTMVVSGDSNKFAVHNQNLT-QNSLHYSNDPSWGSSV--GDSRRVFRYAIDDESEYIENIYEAPTSDHGFYNSFSEND 215 (344)
T ss_pred ceeEEEEecCcccceeeccccc-eeeeEEcCCCceEEEe--cCCCcceEEEeCCccceeeeeEecccCCCceeeeeccCc
Confidence 44444444433 2333323333 5677888776665443 45567888888766544332 111111123333332322
Q ss_pred EEEe-cCCCC--eEEEEeCCCCceEEEEeccCCCCccccccee-EE--Ee-----cCcEEEeecCCCeeEEecccCCCce
Q psy6572 710 LFWG-DAHED--YIAVSDLNGENIKIIVSRRMDPTINLHHVFA-LA--VF-----EDHLFWTDWEMKSIERCDKYTGKNC 778 (1416)
Q Consensus 710 LYWt-D~~~~--~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~-la--v~-----~d~LYwtD~~~~~I~~~nk~tG~~~ 778 (1416)
+-+| -...+ .|+-+...++-...+-+.. ++|.| +. -| -+.||+++.. ..+..++..++.+.
T Consensus 216 ~~FAv~~Qdg~~~I~DVR~~~tpm~~~sstr-------p~hnGa~R~c~Fsl~g~lDLLf~sEhf-s~~hv~D~R~~~~~ 287 (344)
T KOG4532|consen 216 LQFAVVFQDGTCAIYDVRNMATPMAEISSTR-------PHHNGAFRVCRFSLYGLLDLLFISEHF-SRVHVVDTRNYVNH 287 (344)
T ss_pred ceEEEEecCCcEEEEEecccccchhhhcccC-------CCCCCceEEEEecCCCcceEEEEecCc-ceEEEEEcccCcee
Confidence 2222 22222 3444444554444333321 12211 11 11 2456666643 45667777778777
Q ss_pred EEEE
Q psy6572 779 TSVV 782 (1416)
Q Consensus 779 ~~l~ 782 (1416)
++++
T Consensus 288 q~I~ 291 (344)
T KOG4532|consen 288 QVIV 291 (344)
T ss_pred eEEe
Confidence 7765
No 307
>PF05935 Arylsulfotrans: Arylsulfotransferase (ASST); InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=22.43 E-value=1.4e+03 Score=28.79 Aligned_cols=41 Identities=15% Similarity=0.184 Sum_probs=28.5
Q ss_pred CCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEec
Q psy6572 696 SWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSR 736 (1416)
Q Consensus 696 ~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~ 736 (1416)
.+.|+|.+|...+.|+++-...+.|..++.......-++..
T Consensus 271 ~H~Nsi~yd~~dd~iivSsR~~s~V~~Id~~t~~i~Wilg~ 311 (477)
T PF05935_consen 271 LHINSIDYDPSDDSIIVSSRHQSAVIKIDYRTGKIKWILGP 311 (477)
T ss_dssp --EEEEEEETTTTEEEEEETTT-EEEEEE-TTS-EEEEES-
T ss_pred cccCccEEeCCCCeEEEEcCcceEEEEEECCCCcEEEEeCC
Confidence 34578999988899999988888999999766666655543
No 308
>KOG3881|consensus
Probab=22.16 E-value=1.5e+03 Score=27.67 Aligned_cols=81 Identities=14% Similarity=0.162 Sum_probs=49.1
Q ss_pred CCCEEEeecC-CCCC-eeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEeec
Q psy6572 685 SNPKVIISKN-LSWP-NALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDW 762 (1416)
Q Consensus 685 s~r~vlv~~~-l~~P-~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~ 762 (1416)
..|+.+..-. +.+| ..+++++..+.||+++.. +.|..+++.+ .+++...- .+ ....+.+|..+...-|.+..
T Consensus 235 ~qRRPV~~fd~~E~~is~~~l~p~gn~Iy~gn~~-g~l~~FD~r~--~kl~g~~~--kg-~tGsirsih~hp~~~~las~ 308 (412)
T KOG3881|consen 235 HQRRPVAQFDFLENPISSTGLTPSGNFIYTGNTK-GQLAKFDLRG--GKLLGCGL--KG-ITGSIRSIHCHPTHPVLASC 308 (412)
T ss_pred ccCcceeEeccccCcceeeeecCCCcEEEEeccc-chhheecccC--ceeecccc--CC-ccCCcceEEEcCCCceEEee
Confidence 4455444432 3444 578999999999999864 3455555444 33333210 00 14567788888777777777
Q ss_pred CCCeeEEec
Q psy6572 763 EMKSIERCD 771 (1416)
Q Consensus 763 ~~~~I~~~n 771 (1416)
+..+..|++
T Consensus 309 GLDRyvRIh 317 (412)
T KOG3881|consen 309 GLDRYVRIH 317 (412)
T ss_pred ccceeEEEe
Confidence 777777764
No 309
>KOG3516|consensus
Probab=21.88 E-value=1.3e+03 Score=32.10 Aligned_cols=272 Identities=15% Similarity=0.223 Sum_probs=0.0
Q ss_pred ccccCCcCCCCCccc--eeeecCCeeeecCC-CCcEEecCCCceEecCCC---CCeEEEEecEEEEEEecCCc-------
Q psy6572 492 LCVDTNECLDRPCSH--YCRNTLGSYSCSCA-PGYALLSDKHGCKATSDV---PPNLLFTNKYYIREVTQAGV------- 558 (1416)
Q Consensus 492 tC~didEC~~~~Csq--~C~nt~gsy~C~C~-~Gy~L~~dg~sC~a~~~~---~~~li~s~~~~I~~i~l~g~------- 558 (1416)
.|.-++.|.+++|.| .|..+-..|.|.|. .||. |.+|.....+ +.+-.......-..++.+|+
T Consensus 541 ~C~i~drClPN~CehgG~C~Qs~~~f~C~C~~TGY~----GatCHtsi~e~SCeay~~~~~t~~~~~iD~DGsGpl~Pl~ 616 (1306)
T KOG3516|consen 541 MCGISDRCLPNPCEHGGKCSQSWDDFECNCELTGYK----GATCHTSIYELSCEAYKNIGQTSGNFLIDSDGSGPLEPLQ 616 (1306)
T ss_pred ccccccccCCccccCCCcccccccceeEeccccccc----cccccCCCcchhhHHhhhhccccceEEEccCCCCcccceE
Q ss_pred ---------c-eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC--CCCeEEee--------cCCCceEEEEc
Q psy6572 559 ---------M-TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN--SQPELLFP--------ATSPDGLTVDW 618 (1416)
Q Consensus 559 ---------~-~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~--s~~~~l~~--------l~~p~gLAvD~ 618 (1416)
. +++.++...+.-|......+.+-.+-...-...-..+.++ ..-+..+. +..|.+.-+-|
T Consensus 617 v~C~~~ed~awTvv~H~~~~~t~V~g~n~~g~~~~s~~y~as~eQ~~al~n~se~CeQ~i~y~C~~sRllnt~~g~P~Sw 696 (1306)
T KOG3516|consen 617 VYCNITEDRAWTVVQHDNLGTTRVRGSNPEGPVAISLFYAASMEQLQALLNRSEHCEQEIEYSCRESRLLNTPDGTPFSW 696 (1306)
T ss_pred EEEecccCceEEEEEeCCccceEEeccCCCCceeEeeehhccHHHHHHHhhhhhhhheeeeeeeccceeeeCCCCCeeEE
Q ss_pred -----cCCcEEEeeCCCC------eEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCC
Q psy6572 619 -----VGRNLYWCDKGLD------TIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNP 687 (1416)
Q Consensus 619 -----~~~~LYwtD~~~~------~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r 687 (1416)
..+.+||..+..+ .|...-++-...-..-+....|-..--+-+...+|=+|- -.|.-..+..+..
T Consensus 697 wigr~ne~~~yWGGs~Pg~qkC~Cgi~~nC~d~~~~CNCDa~~~ewt~Dtg~l~~k~hLPVt~----vv~gdTg~~~sea 772 (1306)
T KOG3516|consen 697 WIGRSNEGHVYWGGSGPGLQKCECGLLGNCLDPQLYCNCDADEKEWTTDTGCLAYKDHLPVTQ----VVIGDTGRSQSEA 772 (1306)
T ss_pred EecccCCccceecCCCCccceeeccccccccCcceeeeccCCCccccccccccchhhcCCeeE----EEEccCCCccccc
Q ss_pred EEEee-----cCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEE---ecCcEEE
Q psy6572 688 KVIIS-----KNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAV---FEDHLFW 759 (1416)
Q Consensus 688 ~vlv~-----~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav---~~d~LYw 759 (1416)
...+. .....-+.+++.-.+.+|-+.+.+.. +..+..+.+ ...-||.
T Consensus 773 ~~~lgPLrC~gDr~~wnsvSF~~~~syL~fp~f~~~-------------------------~saDIsf~FrTt~~~gvfl 827 (1306)
T KOG3516|consen 773 PYVLGPLRCEGDRNFWNSVSFHTGASYLHFPPFHNE-------------------------LSADISFFFRTTASSGVFL 827 (1306)
T ss_pred ceeecceEeecccccccceEeecCcceeecCcccCc-------------------------ccccEEEEEEecCCceEee
Q ss_pred eecCCCeeEEecccCCCceEEEEeCCCCCCeeeeeecc
Q psy6572 760 TDWEMKSIERCDKYTGKNCTSVVKNLVHKPMDLRVYHP 797 (1416)
Q Consensus 760 tD~~~~~I~~~nk~tG~~~~~l~~~~~~~p~~I~v~h~ 797 (1416)
-..+..-..++...++..++.-. .....|..+.|-.+
T Consensus 828 en~g~~dfir~eL~~~~~vtf~~-dvgnGp~~~~V~s~ 864 (1306)
T KOG3516|consen 828 ENHGINDFIRLELSSPVEVTFAF-DVGNGPSQLTVRSP 864 (1306)
T ss_pred eccCCCceEEEEEcCCCceEEEE-EcCCCceeEEEcCC
No 310
>PHA02790 Kelch-like protein; Provisional
Probab=21.83 E-value=1.7e+03 Score=28.11 Aligned_cols=129 Identities=15% Similarity=0.034 Sum_probs=66.8
Q ss_pred CCCeEEEeeccCCCccEEEEecC-CCCeEEeecCCCc-eEEEEccCCcEEEeeCCC---CeEEEeecCCCceEEEEcCCC
Q psy6572 577 VDNCLYWSDVTMHGSSIRRSCNN-SQPELLFPATSPD-GLTVDWVGRNLYWCDKGL---DTIEVAKLDGRFRKVLINKGL 651 (1416)
Q Consensus 577 ~~~~LYwtD~~~~~~~I~r~~l~-s~~~~l~~l~~p~-gLAvD~~~~~LYwtD~~~---~~I~v~~ldG~~~~vLi~~~l 651 (1416)
.+++||.+-.......+.+.... .....+.++..|+ +.++=..+++||+.-... ..+++.+.....-+.+ ..+
T Consensus 317 ~~~~iYviGG~~~~~sve~ydp~~n~W~~~~~l~~~r~~~~~~~~~g~IYviGG~~~~~~~ve~ydp~~~~W~~~--~~m 394 (480)
T PHA02790 317 ANNKLYVVGGLPNPTSVERWFHGDAAWVNMPSLLKPRCNPAVASINNVIYVIGGHSETDTTTEYLLPNHDQWQFG--PST 394 (480)
T ss_pred ECCEEEEECCcCCCCceEEEECCCCeEEECCCCCCCCcccEEEEECCEEEEecCcCCCCccEEEEeCCCCEEEeC--CCC
Confidence 47889988654322356666655 3344444443333 333334679999975432 3566666654332222 234
Q ss_pred CCcce-eeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCCC---eeEEeecCCCeEEEecC
Q psy6572 652 QEPRG-IALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSWP---NALTISYETNELFWGDA 715 (1416)
Q Consensus 652 ~~P~g-IavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~P---~gLaiD~~~~rLYWtD~ 715 (1416)
..|+. .++-...|+||+.- | ..++.+.. +++-..+. .+..| .|+++ .+++||++-.
T Consensus 395 ~~~r~~~~~~~~~~~IYv~G-G---~~e~ydp~-~~~W~~~~-~m~~~r~~~~~~v--~~~~IYviGG 454 (480)
T PHA02790 395 YYPHYKSCALVFGRRLFLVG-R---NAEFYCES-SNTWTLID-DPIYPRDNPELII--VDNKLLLIGG 454 (480)
T ss_pred CCccccceEEEECCEEEEEC-C---ceEEecCC-CCcEeEcC-CCCCCccccEEEE--ECCEEEEECC
Confidence 44431 11113458999874 2 34555543 23333332 23333 24555 4788998753
No 311
>KOG0274|consensus
Probab=21.40 E-value=1.8e+03 Score=28.40 Aligned_cols=172 Identities=14% Similarity=0.109 Sum_probs=79.4
Q ss_pred cEEEEecC-CC-CeEEee-cCCCceEEEEccCCcEEEeeCCCCeEEEeecC-CCceEEEEcCCCCCcceeeecCCcceEE
Q psy6572 592 SIRRSCNN-SQ-PELLFP-ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLD-GRFRKVLINKGLQEPRGIALNPAYGYMY 667 (1416)
Q Consensus 592 ~I~r~~l~-s~-~~~l~~-l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ld-G~~~~vLi~~~l~~P~gIavDp~~g~LY 667 (1416)
.|++..+. +. ..++.. ...+..|.++ +.+.++-...++|.|-+.. ++..++|- +...+..+|+++.. .++|
T Consensus 312 tVkVW~v~n~~~l~l~~~h~~~V~~v~~~---~~~lvsgs~d~~v~VW~~~~~~cl~sl~-gH~~~V~sl~~~~~-~~~~ 386 (537)
T KOG0274|consen 312 TVKVWDVTNGACLNLLRGHTGPVNCVQLD---EPLLVSGSYDGTVKVWDPRTGKCLKSLS-GHTGRVYSLIVDSE-NRLL 386 (537)
T ss_pred eEEEEeccCcceEEEeccccccEEEEEec---CCEEEEEecCceEEEEEhhhceeeeeec-CCcceEEEEEecCc-ceEE
Confidence 55555555 22 223321 4445556655 5555555555677766654 33334443 34445677788775 4444
Q ss_pred EeeCCCCceEEEEecCCC-CCEEEeecCCCCCeeEEeecCCCeEEEecCCCCeEEEEeC-CCCceEEEEeccCCCCcccc
Q psy6572 668 WTDWGQNAHIGKAKMDGS-NPKVIISKNLSWPNALTISYETNELFWGDAHEDYIAVSDL-NGENIKIIVSRRMDPTINLH 745 (1416)
Q Consensus 668 WtD~g~~~~I~ra~mDGs-~r~vlv~~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~l-dG~~r~~v~~~~~~p~~~l~ 745 (1416)
=... ...|..-+|.+. .....+........+|.+ .+.++......+.|..-+. +|..++++.... ..
T Consensus 387 Sgs~--D~~IkvWdl~~~~~c~~tl~~h~~~v~~l~~---~~~~Lvs~~aD~~Ik~WD~~~~~~~~~~~~~~------~~ 455 (537)
T KOG0274|consen 387 SGSL--DTTIKVWDLRTKRKCIHTLQGHTSLVSSLLL---RDNFLVSSSADGTIKLWDAEEGECLRTLEGRH------VG 455 (537)
T ss_pred eeee--ccceEeecCCchhhhhhhhcCCccccccccc---ccceeEeccccccEEEeecccCceeeeeccCC------cc
Confidence 3321 135666666665 222222212222234444 2344444444556666664 444444443311 22
Q ss_pred cceeEEEecCcEEEeecCCCeeEEecccCCCceEE
Q psy6572 746 HVFALAVFEDHLFWTDWEMKSIERCDKYTGKNCTS 780 (1416)
Q Consensus 746 ~P~~lav~~d~LYwtD~~~~~I~~~nk~tG~~~~~ 780 (1416)
+..++++. ...+.+....++|..-+..+|+....
T Consensus 456 ~v~~l~~~-~~~il~s~~~~~~~l~dl~~~~~~~~ 489 (537)
T KOG0274|consen 456 GVSALALG-KEEILCSSDDGSVKLWDLRSGTLIRT 489 (537)
T ss_pred cEEEeecC-cceEEEEecCCeeEEEecccCchhhh
Confidence 22222222 23444444444444445555554443
No 312
>KOG3545|consensus
Probab=21.05 E-value=6.2e+02 Score=29.02 Aligned_cols=132 Identities=18% Similarity=0.217 Sum_probs=68.3
Q ss_pred CCeEEEeeccCCCccEEEEecCCCC----eEEee--cCCC----------ceEEEEccCCc-EEEeeCCCCeEEEeecCC
Q psy6572 578 DNCLYWSDVTMHGSSIRRSCNNSQP----ELLFP--ATSP----------DGLTVDWVGRN-LYWCDKGLDTIEVAKLDG 640 (1416)
Q Consensus 578 ~~~LYwtD~~~~~~~I~r~~l~s~~----~~l~~--l~~p----------~gLAvD~~~~~-LYwtD~~~~~I~v~~ldG 640 (1416)
++.+|+--... ..|.+..|.+.. ..|.. ...+ ..+|||..+=. ||-|....+.|.+++|+-
T Consensus 77 nGs~yynk~~t--~~ivky~l~~~~~~~~~~lp~a~y~~~~~y~~~g~sdiD~avDE~GLWviYat~~~~g~iv~skLdp 154 (249)
T KOG3545|consen 77 NGSLYYNKAGT--RNIIKYDLETRTVAGSAALPYAGYHNPSPYYWGGHSDIDLAVDENGLWVIYATPENAGTIVLSKLDP 154 (249)
T ss_pred cceEEeeccCC--cceEEEEeecceeeeeeeccccccCCCcccccCCCccccceecccceeEEecccccCCcEEeeccCH
Confidence 55666655443 367777777321 11211 2333 57899976653 455666678898899987
Q ss_pred CceEEEEc--CCCCCc---ceeeecCCcceEEEeeCCCC--ceEE-EEecC-CCCCEEE--eecCCCCCeeEEeecCCCe
Q psy6572 641 RFRKVLIN--KGLQEP---RGIALNPAYGYMYWTDWGQN--AHIG-KAKMD-GSNPKVI--ISKNLSWPNALTISYETNE 709 (1416)
Q Consensus 641 ~~~~vLi~--~~l~~P---~gIavDp~~g~LYWtD~g~~--~~I~-ra~mD-Gs~r~vl--v~~~l~~P~gLaiD~~~~r 709 (1416)
...++... ..+.++ .++.| =|.||.++.... ..|. ..+.. |+.+.+. +.....+-..|-..+.+.+
T Consensus 155 ~tl~~e~tW~T~~~k~~~~~aF~i---CGvLY~v~S~~~~~~~i~yaydt~~~~~~~~~ipf~N~y~~~~~idYNP~D~~ 231 (249)
T KOG3545|consen 155 ETLEVERTWNTTLPKRSAGNAFMI---CGVLYVVHSYNCTHTQISYAYDTTTGTQERIDLPFPNPYSYATMIDYNPRDRR 231 (249)
T ss_pred HHhheeeeeccccCCCCcCceEEE---eeeeEEEeccccCCceEEEEEEcCCCceecccccccchhhhhhccCCCcccce
Confidence 33222211 222222 23333 378898885432 3452 33333 2222121 1122333345666667888
Q ss_pred EEEec
Q psy6572 710 LFWGD 714 (1416)
Q Consensus 710 LYWtD 714 (1416)
||.-|
T Consensus 232 LY~wd 236 (249)
T KOG3545|consen 232 LYAWD 236 (249)
T ss_pred eeEec
Confidence 88766
No 313
>PF02425 GBP_PSP: Paralytic/GBP/PSP peptide; InterPro: IPR003463 This family includes insect peptides that are short (23 amino acids) and contain 1 disulphide bridge. The family includes growth-blocking peptide (GBP) of Pseudaletia separata (Oriental armyworm) and the paralytic peptides from Manduca sexta (Tobacco hawkmoth), Heliothis virescens (Noctuid moth), and Spodoptera exigua (Beet armyworm) [] as well as plasmatocyte-spreading peptide (PSP1) []. These peptides function to halt metamorphosis from larvae to pupae.; PDB: 1V28_A 2DJC_A 2EQQ_A 2EQH_A 2EQT_A 1BQF_A 2DJ9_A 1HRL_A 1IRR_A 1B5N_A ....
Probab=20.99 E-value=70 Score=22.16 Aligned_cols=17 Identities=29% Similarity=0.636 Sum_probs=13.9
Q ss_pred ecCCCceeccCCCCccceeec
Q psy6572 1315 HCAEGYHMVHGKNKTSSCVAN 1335 (1416)
Q Consensus 1315 ~C~~gy~~~~~~~~~~~Cka~ 1335 (1416)
.|..||++.+|+ .||.+
T Consensus 6 gc~~gy~rtadg----rckpt 22 (23)
T PF02425_consen 6 GCATGYMRTADG----RCKPT 22 (23)
T ss_dssp SSSTTEEEETTT----EEEET
T ss_pred cccccceEcCCc----cccCC
Confidence 589999999995 58863
No 314
>KOG1445|consensus
Probab=20.85 E-value=1.5e+03 Score=29.31 Aligned_cols=228 Identities=15% Similarity=0.089 Sum_probs=0.0
Q ss_pred EEEEecCCcc----eEEecccccceeeeeecCCCeEEEeeccCCCccEEEEecC------CCCeEEee--cCCCceEEEE
Q psy6572 550 IREVTQAGVM----TIRIHNQTNAVGLDFDWVDNCLYWSDVTMHGSSIRRSCNN------SQPELLFP--ATSPDGLTVD 617 (1416)
Q Consensus 550 I~~i~l~g~~----~~~~~~l~~~~~l~~D~~~~~LYwtD~~~~~~~I~r~~l~------s~~~~l~~--l~~p~gLAvD 617 (1416)
|+.++-.|.. ...+.+...+..|.+|+....-.-+-......+|+|+..+ ...+.++. +..+..|.+.
T Consensus 607 i~el~~PGrLPDgv~p~l~Ngt~vtDl~WdPFD~~rLAVa~ddg~i~lWr~~a~gl~e~~~tPe~~lt~h~eKI~slRfH 686 (1012)
T KOG1445|consen 607 IYELNEPGRLPDGVMPGLFNGTLVTDLHWDPFDDERLAVATDDGQINLWRLTANGLPENEMTPEKILTIHGEKITSLRFH 686 (1012)
T ss_pred EEEcCCCCCCCcccccccccCceeeecccCCCChHHeeecccCceEEEEEeccCCCCcccCCcceeeecccceEEEEEec
Q ss_pred ccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCCCEEEeecCCCC
Q psy6572 618 WVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSNPKVIISKNLSW 697 (1416)
Q Consensus 618 ~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~r~vlv~~~l~~ 697 (1416)
++.-.+.-+.+...+|+.-+|.....+.-+........+||-.|. |++.-|- ....+|.+.........+--..+...
T Consensus 687 PLAadvLa~asyd~Ti~lWDl~~~~~~~~l~gHtdqIf~~AWSpd-Gr~~AtV-cKDg~~rVy~Prs~e~pv~Eg~gpvg 764 (1012)
T KOG1445|consen 687 PLAADVLAVASYDSTIELWDLANAKLYSRLVGHTDQIFGIAWSPD-GRRIATV-CKDGTLRVYEPRSREQPVYEGKGPVG 764 (1012)
T ss_pred chhhhHhhhhhccceeeeeehhhhhhhheeccCcCceeEEEECCC-Ccceeee-ecCceEEEeCCCCCCCccccCCCCcc
Q ss_pred CeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccc-cceeEEEe--------------cCcEEEeec
Q psy6572 698 PNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLH-HVFALAVF--------------EDHLFWTDW 762 (1416)
Q Consensus 698 P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~-~P~~lav~--------------~d~LYwtD~ 762 (1416)
- ..-||.|+=-+.-.|-.---.-+.|++.+... ..+. .|...++. .+.||.|-.
T Consensus 765 t-------RgARi~wacdgr~viv~Gfdk~SeRQv~~Y~A----q~l~~~pl~t~~lDvaps~LvP~YD~Ds~~lfltGK 833 (1012)
T KOG1445|consen 765 T-------RGARILWACDGRIVIVVGFDKSSERQVQMYDA----QTLDLRPLYTQVLDVAPSPLVPHYDYDSNVLFLTGK 833 (1012)
T ss_pred C-------cceeEEEEecCcEEEEecccccchhhhhhhhh----hhccCCcceeeeecccCccccccccCCCceEEEecC
Q ss_pred CCCeeEEecccCCCceEEEEeCCCCCCee
Q psy6572 763 EMKSIERCDKYTGKNCTSVVKNLVHKPMD 791 (1416)
Q Consensus 763 ~~~~I~~~nk~tG~~~~~l~~~~~~~p~~ 791 (1416)
+...|+......-.....-+ ..+..|.+
T Consensus 834 GD~~v~~yEv~~esPy~lpl-~~f~sp~~ 861 (1012)
T KOG1445|consen 834 GDRFVNMYEVIYESPYLLPL-APFMSPVG 861 (1012)
T ss_pred CCceEEEEEecCCCceeeec-ccccCCCc
No 315
>KOG0296|consensus
Probab=20.45 E-value=1.6e+03 Score=27.27 Aligned_cols=162 Identities=9% Similarity=0.027 Sum_probs=74.9
Q ss_pred cCCCceEEEEccCCcEEEeeCCCCeEEEeecCCCceEEEEcCCCCCcceeeecCCcceEEEeeCCCCceEEEEecCCCC-
Q psy6572 608 ATSPDGLTVDWVGRNLYWCDKGLDTIEVAKLDGRFRKVLINKGLQEPRGIALNPAYGYMYWTDWGQNAHIGKAKMDGSN- 686 (1416)
Q Consensus 608 l~~p~gLAvD~~~~~LYwtD~~~~~I~v~~ldG~~~~vLi~~~l~~P~gIavDp~~g~LYWtD~g~~~~I~ra~mDGs~- 686 (1416)
-....+++++| +.+|-.|-.+..+-++-++........+..--.....+.+.....+|--.| ...+|....+.-..
T Consensus 64 ~~svFavsl~P-~~~l~aTGGgDD~AflW~~~~ge~~~eltgHKDSVt~~~FshdgtlLATGd--msG~v~v~~~stg~~ 140 (399)
T KOG0296|consen 64 TDSVFAVSLHP-NNNLVATGGGDDLAFLWDISTGEFAGELTGHKDSVTCCSFSHDGTLLATGD--MSGKVLVFKVSTGGE 140 (399)
T ss_pred CCceEEEEeCC-CCceEEecCCCceEEEEEccCCcceeEecCCCCceEEEEEccCceEEEecC--CCccEEEEEcccCce
Confidence 44566788888 778877877777766666543333333333333344444444433222222 12233333332221
Q ss_pred CEEEee--cCCCCCeeEEeecCCCeEEEecCCCCeEEEEeCCCCceEEEEeccCCCCcccccceeEEEecCcEEEeecCC
Q psy6572 687 PKVIIS--KNLSWPNALTISYETNELFWGDAHEDYIAVSDLNGENIKIIVSRRMDPTINLHHVFALAVFEDHLFWTDWEM 764 (1416)
Q Consensus 687 r~vlv~--~~l~~P~gLaiD~~~~rLYWtD~~~~~I~~~~ldG~~r~~v~~~~~~p~~~l~~P~~lav~~d~LYwtD~~~ 764 (1416)
+..|.. ..|.| |.-.+ ..+++.+-...+.|+...+.-...-.++.++. .+--.|=-+..++...+-...
T Consensus 141 ~~~~~~e~~dieW---l~WHp-~a~illAG~~DGsvWmw~ip~~~~~kv~~Gh~-----~~ct~G~f~pdGKr~~tgy~d 211 (399)
T KOG0296|consen 141 QWKLDQEVEDIEW---LKWHP-RAHILLAGSTDGSVWMWQIPSQALCKVMSGHN-----SPCTCGEFIPDGKRILTGYDD 211 (399)
T ss_pred EEEeecccCceEE---EEecc-cccEEEeecCCCcEEEEECCCcceeeEecCCC-----CCcccccccCCCceEEEEecC
Confidence 122211 11221 11221 33444444455566666665444444444431 111111112235555555556
Q ss_pred CeeEEecccCCCceEEE
Q psy6572 765 KSIERCDKYTGKNCTSV 781 (1416)
Q Consensus 765 ~~I~~~nk~tG~~~~~l 781 (1416)
++|..-|..+|+....+
T Consensus 212 gti~~Wn~ktg~p~~~~ 228 (399)
T KOG0296|consen 212 GTIIVWNPKTGQPLHKI 228 (399)
T ss_pred ceEEEEecCCCceeEEe
Confidence 77777777777655544
No 316
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=20.24 E-value=1.5e+03 Score=26.87 Aligned_cols=91 Identities=13% Similarity=0.009 Sum_probs=42.0
Q ss_pred CCeEEEeeccCCCccEEEEecC---CCCeEEeecC-CCc-eEEEEccCCcEEEeeCC-----------CCeEEEeecCCC
Q psy6572 578 DNCLYWSDVTMHGSSIRRSCNN---SQPELLFPAT-SPD-GLTVDWVGRNLYWCDKG-----------LDTIEVAKLDGR 641 (1416)
Q Consensus 578 ~~~LYwtD~~~~~~~I~r~~l~---s~~~~l~~l~-~p~-gLAvD~~~~~LYwtD~~-----------~~~I~v~~ldG~ 641 (1416)
+++||++-... ...++++.++ .....+..+. .++ ..++=.+.++||+.-.. ...+++.++...
T Consensus 17 ~~~vyv~GG~~-~~~~~~~d~~~~~~~W~~l~~~p~~~R~~~~~~~~~~~iYv~GG~~~~~~~~~~~~~~~v~~Yd~~~~ 95 (346)
T TIGR03547 17 GDKVYVGLGSA-GTSWYKLDLKKPSKGWQKIADFPGGPRNQAVAAAIDGKLYVFGGIGKANSEGSPQVFDDVYRYDPKKN 95 (346)
T ss_pred CCEEEEEcccc-CCeeEEEECCCCCCCceECCCCCCCCcccceEEEECCEEEEEeCCCCCCCCCcceecccEEEEECCCC
Confidence 45666653322 1345666553 2233333332 222 22333357899987643 135667776543
Q ss_pred ceEEEEcCCCCCcc-eeeec-CCcceEEEee
Q psy6572 642 FRKVLINKGLQEPR-GIALN-PAYGYMYWTD 670 (1416)
Q Consensus 642 ~~~vLi~~~l~~P~-gIavD-p~~g~LYWtD 670 (1416)
.-+.+. ..+..++ +.+.- ...++||+.-
T Consensus 96 ~W~~~~-~~~p~~~~~~~~~~~~~g~IYviG 125 (346)
T TIGR03547 96 SWQKLD-TRSPVGLLGASGFSLHNGQAYFTG 125 (346)
T ss_pred EEecCC-CCCCCcccceeEEEEeCCEEEEEc
Confidence 322222 1122222 22111 2468899874
Done!