Query psy7767 Match_columns 200 No_of_seqs 205 out of 1150 Neff 5.5 Searched_HMMs 46136 Date Fri Aug 16 18:18:42 2013 Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7767.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7767hhsearch_cdd -cpu 12 -v 0 No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PF00930 DPPIV_N: Dipeptidyl p 99.9 3E-25 6.4E-30 198.4 9.6 137 63-200 4-151 (353) 2 KOG2100|consensus 99.5 4.9E-14 1.1E-18 138.8 9.3 128 62-190 107-242 (755) 3 PRK05137 tolB translocation pr 99.0 7.4E-09 1.6E-13 95.0 14.1 110 59-188 161-279 (435) 4 PRK03629 tolB translocation pr 98.9 9.2E-09 2E-13 94.8 12.3 112 56-187 156-275 (429) 5 COG0823 TolB Periplasmic compo 98.9 5E-09 1.1E-13 97.5 10.3 92 64-175 250-346 (425) 6 PRK02889 tolB translocation pr 98.9 1.8E-08 4E-13 92.5 13.3 111 58-188 158-273 (427) 7 PRK01029 tolB translocation pr 98.9 1.3E-08 2.8E-13 94.1 12.3 87 77-183 166-259 (428) 8 PRK04043 tolB translocation pr 98.9 2.5E-08 5.3E-13 92.4 13.3 112 58-189 149-267 (419) 9 KOG2281|consensus 98.9 9.7E-10 2.1E-14 106.2 3.7 105 95-200 199-316 (867) 10 PRK05137 tolB translocation pr 98.9 3.5E-08 7.6E-13 90.5 13.1 79 88-185 281-364 (435) 11 PRK04043 tolB translocation pr 98.8 4.5E-08 9.8E-13 90.6 12.7 82 89-189 225-311 (419) 12 PRK04792 tolB translocation pr 98.8 4.2E-08 9.1E-13 91.1 12.5 113 56-188 175-295 (448) 13 PRK01742 tolB translocation pr 98.8 8.5E-08 1.8E-12 88.1 14.0 109 59-187 164-280 (429) 14 PRK00178 tolB translocation pr 98.8 1E-07 2.2E-12 86.8 13.9 112 57-188 156-276 (430) 15 PRK02889 tolB translocation pr 98.8 9.8E-08 2.1E-12 87.7 13.1 79 88-185 275-358 (427) 16 PRK03629 tolB translocation pr 98.8 1.1E-07 2.3E-12 87.7 13.2 79 88-185 278-361 (429) 17 PRK04792 tolB translocation pr 98.7 2.5E-07 5.3E-12 86.0 13.7 81 88-187 297-382 (448) 18 COG0823 TolB Periplasmic compo 98.7 6.6E-08 1.4E-12 90.1 9.8 76 95-189 237-316 (425) 19 PRK01029 tolB translocation pr 98.7 1.8E-07 4E-12 86.5 12.6 82 89-189 319-405 (428) 20 PRK04922 tolB translocation pr 98.7 2.7E-07 5.9E-12 84.8 13.3 108 61-188 165-281 (433) 21 PRK04922 tolB translocation pr 98.7 3.1E-07 6.7E-12 84.4 13.3 79 88-185 283-366 (433) 22 PRK00178 tolB translocation pr 98.7 3.5E-07 7.5E-12 83.3 13.2 80 88-186 278-362 (430) 23 PRK01742 tolB translocation pr 98.6 9.5E-07 2E-11 81.2 12.9 72 97-187 249-324 (429) 24 TIGR02800 propeller_TolB tol-p 98.5 2.8E-06 6.1E-11 76.1 13.2 92 76-187 170-266 (417) 25 TIGR02800 propeller_TolB tol-p 98.3 9.6E-06 2.1E-10 72.7 13.2 79 89-186 270-353 (417) 26 PF08662 eIF2A: Eukaryotic tra 98.1 0.00016 3.5E-09 60.2 13.9 60 95-174 100-163 (194) 27 PRK13616 lipoprotein LpqB; Pro 98.0 2.6E-05 5.6E-10 75.7 9.2 85 76-173 379-466 (591) 28 PF08662 eIF2A: Eukaryotic tra 97.9 0.00046 1E-08 57.5 13.7 60 95-175 59-121 (194) 29 PF07676 PD40: WD40-like Beta 97.6 0.00017 3.7E-09 44.7 4.7 21 156-176 10-30 (39) 30 COG4946 Uncharacterized protei 97.4 0.00041 8.9E-09 66.0 7.5 67 88-172 70-145 (668) 31 PF07676 PD40: WD40-like Beta 97.4 0.0004 8.7E-09 43.1 4.5 27 90-116 2-29 (39) 32 KOG2394|consensus 97.4 0.00066 1.4E-08 65.1 7.9 80 88-186 282-363 (636) 33 COG4946 Uncharacterized protei 97.3 0.0031 6.6E-08 60.2 12.1 62 95-175 443-509 (668) 34 PF14583 Pectate_lyase22: Olig 97.2 0.0013 2.8E-08 61.1 8.2 75 83-175 17-101 (386) 35 PRK13616 lipoprotein LpqB; Pro 97.1 0.0021 4.6E-08 62.5 9.1 57 96-173 350-415 (591) 36 PF00930 DPPIV_N: Dipeptidyl p 97.1 0.023 5E-07 51.0 14.3 94 61-174 245-347 (353) 37 KOG0318|consensus 96.9 0.0092 2E-07 57.3 10.5 74 95-184 190-264 (603) 38 PF02897 Peptidase_S9_N: Proly 96.8 0.0052 1.1E-07 55.6 7.9 98 55-177 83-192 (414) 39 TIGR02171 Fb_sc_TIGR02171 Fibr 96.7 0.0051 1.1E-07 62.3 7.4 63 107-188 318-386 (912) 40 PRK10115 protease 2; Provision 96.3 0.011 2.3E-07 58.4 7.4 57 98-176 129-193 (686) 41 PF10647 Gmad1: Lipoprotein Lp 96.2 0.068 1.5E-06 46.3 11.1 73 97-181 113-192 (253) 42 TIGR03866 PQQ_ABC_repeats PQQ- 96.0 0.039 8.5E-07 45.6 8.2 66 99-183 210-277 (300) 43 KOG1539|consensus 96.0 0.041 9E-07 55.3 9.5 67 95-181 576-644 (910) 44 TIGR03866 PQQ_ABC_repeats PQQ- 96.0 0.17 3.6E-06 41.8 11.9 67 97-182 32-100 (300) 45 KOG2139|consensus 95.5 0.13 2.8E-06 47.9 10.0 63 95-177 238-303 (445) 46 cd00200 WD40 WD40 domain, foun 95.4 0.15 3.2E-06 40.3 8.9 71 95-184 9-80 (289) 47 PF10647 Gmad1: Lipoprotein Lp 95.2 0.23 5E-06 42.9 10.4 72 94-180 64-137 (253) 48 TIGR02171 Fb_sc_TIGR02171 Fibr 95.2 0.051 1.1E-06 55.3 6.9 65 60-127 315-388 (912) 49 cd00200 WD40 WD40 domain, foun 95.2 0.17 3.7E-06 39.8 8.6 71 95-184 135-206 (289) 50 PF02239 Cytochrom_D1: Cytochr 94.8 0.28 6.1E-06 44.9 10.1 70 88-177 28-100 (369) 51 KOG4497|consensus 94.8 0.12 2.7E-06 47.7 7.6 86 100-185 13-122 (447) 52 TIGR02658 TTQ_MADH_Hv methylam 94.8 0.65 1.4E-05 42.7 12.4 59 96-174 248-318 (352) 53 PF14583 Pectate_lyase22: Olig 94.7 0.16 3.5E-06 47.3 8.3 75 61-136 45-122 (386) 54 KOG0318|consensus 94.7 0.15 3.3E-06 49.2 8.3 97 72-188 463-563 (603) 55 PF15492 Nbas_N: Neuroblastoma 94.6 0.15 3.3E-06 45.6 7.5 63 101-182 3-70 (282) 56 KOG1446|consensus 94.4 0.31 6.6E-06 44.2 9.1 74 95-184 187-261 (311) 57 KOG2139|consensus 94.4 0.19 4.2E-06 46.8 8.0 69 89-175 189-259 (445) 58 KOG0973|consensus 94.4 0.47 1E-05 48.7 11.3 75 95-188 129-204 (942) 59 PF10282 Lactonase: Lactonase, 94.3 0.23 5.1E-06 44.4 8.3 75 97-187 246-324 (345) 60 PRK11028 6-phosphogluconolacto 93.9 1.5 3.3E-05 38.2 12.4 80 88-184 27-109 (330) 61 KOG0271|consensus 93.6 0.64 1.4E-05 43.7 9.8 39 88-126 359-399 (480) 62 KOG0271|consensus 93.4 0.25 5.5E-06 46.3 6.8 72 86-175 101-178 (480) 63 PF08450 SGL: SMP-30/Gluconola 93.3 0.42 9.1E-06 40.1 7.6 61 97-178 87-157 (246) 64 KOG1524|consensus 93.3 0.35 7.5E-06 47.2 7.7 31 95-125 145-175 (737) 65 KOG1274|consensus 92.9 1.5 3.3E-05 44.8 11.8 70 97-184 190-261 (933) 66 COG2706 3-carboxymuconate cycl 92.6 1.1 2.4E-05 41.2 9.7 78 95-189 243-325 (346) 67 PRK11028 6-phosphogluconolacto 92.5 4.4 9.6E-05 35.3 13.3 76 97-185 127-205 (330) 68 KOG0293|consensus 92.4 0.66 1.4E-05 44.0 8.1 60 95-171 269-329 (519) 69 KOG0266|consensus 91.9 2.4 5.3E-05 39.6 11.4 70 95-183 246-316 (456) 70 KOG0291|consensus 91.8 1.9 4.1E-05 43.6 10.9 71 94-187 54-127 (893) 71 COG1506 DAP2 Dipeptidyl aminop 91.7 0.41 8.8E-06 46.7 6.2 61 89-172 53-118 (620) 72 KOG1407|consensus 91.4 1.7 3.8E-05 39.1 9.2 62 95-176 64-128 (313) 73 PTZ00420 coronin; Provisional 91.2 1.7 3.6E-05 42.5 9.8 68 95-182 125-194 (568) 74 PF07433 DUF1513: Protein of u 90.5 2.2 4.8E-05 38.7 9.3 62 98-176 7-71 (305) 75 KOG0645|consensus 90.2 3.4 7.4E-05 37.3 10.1 64 95-175 61-126 (312) 76 KOG4497|consensus 90.2 1.3 2.9E-05 41.1 7.7 60 94-172 90-151 (447) 77 PF10282 Lactonase: Lactonase, 89.9 3 6.5E-05 37.2 9.6 73 98-185 146-222 (345) 78 KOG0315|consensus 89.9 3.1 6.8E-05 37.3 9.5 99 67-184 182-287 (311) 79 KOG0288|consensus 89.8 0.98 2.1E-05 42.7 6.6 59 95-170 387-447 (459) 80 KOG0306|consensus 89.8 5.9 0.00013 40.2 12.2 135 8-184 394-537 (888) 81 KOG2315|consensus 89.7 4.6 9.9E-05 39.4 11.1 58 95-172 311-372 (566) 82 KOG0645|consensus 89.7 14 0.00029 33.6 13.4 88 95-183 105-223 (312) 83 KOG0278|consensus 89.5 8.7 0.00019 34.7 12.0 75 88-183 217-295 (334) 84 PF03088 Str_synth: Strictosid 89.3 1.9 4.2E-05 32.1 6.7 66 100-184 2-86 (89) 85 COG1770 PtrB Protease II [Amin 89.3 1.2 2.6E-05 44.3 7.0 61 98-178 131-197 (682) 86 COG3386 Gluconolactonase [Carb 89.3 1.3 2.9E-05 39.8 6.9 30 156-185 164-193 (307) 87 COG1506 DAP2 Dipeptidyl aminop 89.1 1 2.2E-05 44.0 6.4 67 96-184 13-89 (620) 88 PTZ00421 coronin; Provisional 89.1 2.5 5.4E-05 40.4 9.0 70 95-183 125-196 (493) 89 KOG2110|consensus 89.1 5 0.00011 37.5 10.5 78 95-189 173-252 (391) 90 KOG2096|consensus 88.4 1.4 3.1E-05 40.7 6.5 61 95-172 331-391 (420) 91 KOG0305|consensus 88.3 2 4.3E-05 41.4 7.7 66 98-183 180-245 (484) 92 KOG0273|consensus 88.2 1.2 2.6E-05 42.7 6.1 18 158-175 456-473 (524) 93 KOG1445|consensus 88.1 1.1 2.3E-05 44.8 5.8 66 95-176 720-786 (1012) 94 PF02897 Peptidase_S9_N: Proly 88.1 6.8 0.00015 35.4 10.8 73 100-188 174-261 (414) 95 PF12566 DUF3748: Protein of u 88.1 3.2 7E-05 32.8 7.4 22 155-176 68-89 (122) 96 KOG0315|consensus 87.6 4.9 0.00011 36.1 9.1 76 95-187 167-247 (311) 97 KOG2315|consensus 87.4 6.7 0.00015 38.3 10.6 75 92-187 267-346 (566) 98 KOG0266|consensus 87.3 3.2 7E-05 38.8 8.4 73 95-186 203-277 (456) 99 KOG0279|consensus 87.2 7.6 0.00017 35.2 10.2 80 94-184 232-312 (315) 100 KOG2110|consensus 86.5 8.6 0.00019 36.0 10.4 73 95-185 129-204 (391) 101 KOG0771|consensus 86.3 2.6 5.6E-05 39.5 7.0 78 95-189 186-268 (398) 102 KOG1538|consensus 86.3 1.4 3.1E-05 44.2 5.5 34 96-129 13-46 (1081) 103 KOG0273|consensus 86.3 2.5 5.4E-05 40.6 7.0 63 95-176 452-515 (524) 104 KOG0305|consensus 86.2 10 0.00022 36.6 11.1 70 89-177 381-454 (484) 105 KOG2314|consensus 86.2 6.4 0.00014 38.8 9.7 62 95-174 445-512 (698) 106 KOG0263|consensus 86.1 2.6 5.7E-05 42.1 7.3 71 95-185 577-649 (707) 107 KOG0973|consensus 85.8 2 4.3E-05 44.3 6.4 24 95-118 69-92 (942) 108 TIGR02658 TTQ_MADH_Hv methylam 85.5 8.1 0.00018 35.6 9.8 71 92-175 43-125 (352) 109 KOG0279|consensus 85.4 7.8 0.00017 35.2 9.3 83 62-171 37-122 (315) 110 KOG0293|consensus 85.3 2.7 5.9E-05 40.0 6.6 65 95-176 224-291 (519) 111 PRK13613 lipoprotein LpqB; Pro 85.0 13 0.00028 36.7 11.5 80 95-189 454-541 (599) 112 KOG2394|consensus 84.7 5.9 0.00013 38.8 8.7 87 95-185 219-320 (636) 113 PTZ00420 coronin; Provisional 84.5 40 0.00086 33.1 14.5 35 95-129 167-202 (568) 114 PTZ00421 coronin; Provisional 84.2 7.2 0.00016 37.3 9.2 78 95-184 75-155 (493) 115 KOG2055|consensus 83.8 6 0.00013 38.0 8.3 118 39-172 201-321 (514) 116 KOG0291|consensus 83.7 4.6 0.0001 40.9 7.8 61 98-178 17-77 (893) 117 KOG1273|consensus 83.4 1.8 3.9E-05 40.0 4.5 61 97-175 25-86 (405) 118 KOG1007|consensus 83.3 3 6.6E-05 38.1 5.9 63 96-172 124-190 (370) 119 KOG1274|consensus 83.3 6.2 0.00013 40.6 8.6 77 95-183 138-216 (933) 120 KOG2106|consensus 83.1 7.2 0.00016 38.0 8.6 70 97-187 409-479 (626) 121 KOG2055|consensus 83.0 5.5 0.00012 38.2 7.7 73 95-186 344-418 (514) 122 KOG1445|consensus 82.6 1.9 4.2E-05 43.1 4.7 33 94-126 626-662 (1012) 123 PF04762 IKI3: IKI3 family; I 82.6 3.9 8.5E-05 42.2 7.1 63 95-176 75-142 (928) 124 KOG0319|consensus 81.9 5.3 0.00011 40.2 7.4 79 88-186 15-93 (775) 125 KOG1063|consensus 81.4 3.3 7.2E-05 41.4 5.8 74 97-187 574-650 (764) 126 KOG2106|consensus 81.3 10 0.00023 37.0 8.9 31 94-124 446-477 (626) 127 KOG0268|consensus 81.0 5.5 0.00012 37.3 6.8 112 54-183 214-343 (433) 128 COG3386 Gluconolactonase [Carb 80.2 11 0.00024 34.0 8.4 26 99-124 166-193 (307) 129 KOG2096|consensus 79.7 10 0.00023 35.2 8.0 76 95-187 86-165 (420) 130 KOG0771|consensus 79.5 4.2 9.1E-05 38.2 5.5 65 95-176 281-346 (398) 131 KOG0263|consensus 79.2 5.8 0.00013 39.8 6.7 72 96-186 536-608 (707) 132 KOG1407|consensus 78.9 9.4 0.0002 34.5 7.3 72 94-185 105-177 (313) 133 KOG0286|consensus 78.4 9.8 0.00021 34.8 7.4 59 95-171 55-114 (343) 134 PRK13615 lipoprotein LpqB; Pro 77.8 34 0.00074 33.6 11.5 99 50-173 382-487 (557) 135 KOG0288|consensus 77.3 17 0.00037 34.6 8.8 72 89-175 335-408 (459) 136 PF02239 Cytochrom_D1: Cytochr 76.8 28 0.00061 31.8 10.2 90 86-189 68-159 (369) 137 COG2706 3-carboxymuconate cycl 76.6 65 0.0014 29.9 12.4 83 93-189 188-278 (346) 138 KOG0772|consensus 76.6 3.5 7.7E-05 40.2 4.3 75 98-189 320-398 (641) 139 PF12894 Apc4_WD40: Anaphase-p 74.9 14 0.0003 24.2 5.6 29 95-123 11-40 (47) 140 KOG0296|consensus 74.8 49 0.0011 31.1 11.0 67 95-180 148-215 (399) 141 KOG0265|consensus 74.7 15 0.00032 33.7 7.5 68 95-180 47-115 (338) 142 KOG2919|consensus 74.7 10 0.00022 35.2 6.5 28 156-185 160-187 (406) 143 PF12566 DUF3748: Protein of u 74.1 2.6 5.6E-05 33.4 2.2 23 96-118 68-90 (122) 144 PF04762 IKI3: IKI3 family; I 74.0 48 0.001 34.4 11.9 73 98-187 259-335 (928) 145 PF07433 DUF1513: Protein of u 73.7 12 0.00027 34.0 6.8 26 98-124 53-85 (305) 146 PF14870 PSII_BNR: Photosynthe 73.7 29 0.00064 31.3 9.2 91 32-126 81-175 (302) 147 KOG1063|consensus 72.8 6.5 0.00014 39.4 5.1 70 95-184 525-601 (764) 148 PF11768 DUF3312: Protein of u 72.3 29 0.00064 34.0 9.3 80 88-188 252-332 (545) 149 PRK10115 protease 2; Provision 72.2 94 0.002 31.0 13.2 35 99-133 175-218 (686) 150 PLN02919 haloacid dehalogenase 72.0 26 0.00056 36.7 9.6 81 99-183 686-768 (1057) 151 PRK13614 lipoprotein LpqB; Pro 71.7 36 0.00078 33.5 9.9 77 95-186 433-517 (573) 152 TIGR02604 Piru_Ver_Nterm putat 70.8 22 0.00047 32.2 7.8 13 156-168 185-197 (367) 153 KOG2111|consensus 70.1 32 0.0007 31.7 8.6 77 96-189 182-260 (346) 154 KOG2048|consensus 69.3 1.1E+02 0.0024 30.9 12.6 76 95-186 382-460 (691) 155 COG2319 FOG: WD40 repeat [Gene 67.8 50 0.0011 26.8 8.6 68 95-180 155-224 (466) 156 KOG1523|consensus 67.7 21 0.00046 33.0 6.9 75 95-176 146-228 (361) 157 PF06977 SdiA-regulated: SdiA- 67.5 47 0.001 29.0 8.9 40 96-135 22-63 (248) 158 KOG1524|consensus 67.4 6.4 0.00014 38.7 3.7 88 65-176 76-167 (737) 159 KOG2314|consensus 67.2 12 0.00025 37.1 5.4 60 95-172 492-555 (698) 160 KOG1539|consensus 66.9 55 0.0012 33.8 10.2 31 156-187 578-608 (910) 161 PF15492 Nbas_N: Neuroblastoma 65.2 26 0.00056 31.6 6.9 34 95-128 43-77 (282) 162 COG3490 Uncharacterized protei 63.3 45 0.00097 30.8 8.1 63 95-175 67-133 (366) 163 PLN02919 haloacid dehalogenase 62.7 38 0.00083 35.5 8.6 18 154-173 858-875 (1057) 164 KOG1446|consensus 61.1 99 0.0022 28.3 9.9 37 98-134 235-272 (311) 165 KOG1009|consensus 61.1 8.6 0.00019 36.3 3.2 64 95-176 123-187 (434) 166 PF04841 Vps16_N: Vps16, N-ter 60.3 63 0.0014 30.0 8.9 31 96-126 217-248 (410) 167 PRK02888 nitrous-oxide reducta 60.3 28 0.00061 34.8 6.7 51 102-172 283-338 (635) 168 KOG0286|consensus 59.0 74 0.0016 29.3 8.7 56 95-170 186-245 (343) 169 PF00400 WD40: WD domain, G-be 58.6 29 0.00063 20.1 4.4 27 95-121 11-38 (39) 170 KOG1920|consensus 57.4 36 0.00078 36.4 7.2 62 96-176 69-131 (1265) 171 KOG1920|consensus 55.9 63 0.0014 34.6 8.7 27 95-121 109-136 (1265) 172 COG5037 TOS9 Gluconate transpo 55.2 12 0.00026 33.0 3.0 39 111-174 42-83 (248) 173 KOG4476|consensus 55.2 12 0.00026 33.0 3.0 39 111-174 42-83 (248) 174 KOG0772|consensus 55.2 23 0.00051 34.7 5.1 85 95-187 214-301 (641) 175 COG3391 Uncharacterized conser 54.6 1.4E+02 0.003 27.3 10.0 59 95-172 115-177 (381) 176 KOG1523|consensus 53.9 34 0.00073 31.7 5.7 32 96-127 11-43 (361) 177 COG5354 Uncharacterized protei 52.9 2.1E+02 0.0046 28.1 11.1 21 95-115 274-294 (561) 178 KOG0641|consensus 52.8 49 0.0011 29.6 6.4 29 89-117 26-54 (350) 179 PF04053 Coatomer_WDAD: Coatom 52.2 16 0.00035 34.6 3.5 80 96-175 69-165 (443) 180 PF04053 Coatomer_WDAD: Coatom 51.4 36 0.00077 32.3 5.7 27 98-124 147-173 (443) 181 COG3391 Uncharacterized conser 50.9 2E+02 0.0042 26.2 11.6 65 98-177 162-228 (381) 182 PF05787 DUF839: Bacterial pro 50.4 37 0.00081 32.9 5.8 18 156-174 503-520 (524) 183 KOG2048|consensus 50.0 1.7E+02 0.0037 29.6 10.1 38 95-132 475-513 (691) 184 PRK13615 lipoprotein LpqB; Pro 49.7 91 0.002 30.6 8.3 22 156-177 418-439 (557) 185 KOG0646|consensus 47.5 1E+02 0.0022 29.8 8.0 34 95-128 81-115 (476) 186 COG3211 PhoX Predicted phospha 46.8 40 0.00086 33.4 5.3 19 157-176 556-574 (616) 187 COG5354 Uncharacterized protei 46.5 3E+02 0.0064 27.2 11.0 37 89-125 309-349 (561) 188 PF10584 Proteasome_A_N: Prote 44.9 8.3 0.00018 22.0 0.3 7 161-167 7-13 (23) 189 KOG2041|consensus 44.5 21 0.00046 36.6 3.1 58 96-172 116-174 (1189) 190 KOG0289|consensus 44.1 2.1E+02 0.0046 27.7 9.5 70 96-184 348-418 (506) 191 KOG0294|consensus 43.7 1.8E+02 0.0039 27.1 8.7 71 96-187 128-199 (362) 192 KOG1963|consensus 43.6 1.2E+02 0.0025 31.3 8.1 76 35-124 240-322 (792) 193 KOG0265|consensus 42.6 2.8E+02 0.006 25.7 9.8 35 95-129 90-125 (338) 194 COG3204 Uncharacterized protei 42.4 2.8E+02 0.006 25.5 9.7 40 96-135 86-127 (316) 195 PF09729 Gti1_Pac2: Gti1/Pac2 42.4 44 0.00094 27.9 4.3 40 112-176 39-81 (168) 196 KOG0639|consensus 42.0 70 0.0015 31.6 6.1 72 94-187 464-541 (705) 197 KOG0299|consensus 41.7 75 0.0016 30.6 6.2 39 95-133 202-241 (479) 198 KOG0639|consensus 41.3 33 0.00071 33.8 3.8 40 95-134 592-632 (705) 199 KOG1007|consensus 41.0 40 0.00087 31.0 4.1 73 97-186 172-246 (370) 200 KOG1963|consensus 40.4 1.6E+02 0.0035 30.3 8.5 62 95-176 251-314 (792) 201 KOG0268|consensus 40.4 44 0.00096 31.5 4.4 79 87-186 221-303 (433) 202 KOG0272|consensus 40.0 46 0.00099 31.8 4.4 64 95-177 217-284 (459) 203 PLN00033 photosystem II stabil 39.6 3.2E+02 0.007 25.5 10.6 91 34-127 201-311 (398) 204 KOG0284|consensus 39.4 51 0.0011 31.5 4.6 20 95-114 96-115 (464) 205 PLN00033 photosystem II stabil 39.2 1.2E+02 0.0027 28.3 7.2 29 99-127 242-270 (398) 206 PF03022 MRJP: Major royal jel 39.2 2.7E+02 0.0059 24.5 12.7 40 95-134 60-106 (287) 207 KOG2919|consensus 39.2 2.8E+02 0.0061 26.0 9.3 27 97-123 160-186 (406) 208 KOG0272|consensus 38.9 1E+02 0.0023 29.5 6.6 33 95-127 345-378 (459) 209 PF12657 TFIIIC_delta: Transcr 38.8 31 0.00068 27.9 2.9 22 99-121 8-29 (173) 210 KOG0306|consensus 37.9 4.9E+02 0.011 27.1 11.9 86 94-182 507-619 (888) 211 KOG4283|consensus 37.3 43 0.00094 31.0 3.7 75 95-176 188-268 (397) 212 PLN00181 protein SPA1-RELATED; 36.9 2.5E+02 0.0054 28.0 9.4 34 95-128 532-567 (793) 213 KOG2100|consensus 36.6 1.8E+02 0.0039 29.6 8.3 19 158-176 151-169 (755) 214 PF05694 SBP56: 56kDa selenium 35.0 4.3E+02 0.0093 25.6 13.6 146 12-172 227-391 (461) 215 KOG0647|consensus 34.9 1.3E+02 0.0027 27.9 6.3 72 94-186 26-103 (347) 216 KOG4640|consensus 34.3 1.6E+02 0.0036 29.5 7.4 61 95-175 20-83 (665) 217 KOG0643|consensus 34.2 2.5E+02 0.0054 25.7 8.0 80 94-185 92-177 (327) 218 KOG1009|consensus 34.1 1E+02 0.0022 29.4 5.7 21 155-175 343-363 (434) 219 KOG1520|consensus 33.5 2.3E+02 0.005 26.7 7.9 33 156-188 220-252 (376) 220 KOG0643|consensus 31.8 4.1E+02 0.0088 24.4 10.0 65 94-175 146-211 (327) 221 COG5169 HSF1 Heat shock transc 31.6 35 0.00077 30.7 2.2 17 156-172 26-42 (282) 222 KOG1587|consensus 31.4 2.9E+02 0.0064 27.2 8.6 74 97-187 443-518 (555) 223 TIGR03606 non_repeat_PQQ dehyd 31.4 2.7E+02 0.0058 26.7 8.2 65 101-174 347-424 (454) 224 KOG0276|consensus 30.7 2.2E+02 0.0048 28.9 7.6 20 98-117 354-373 (794) 225 KOG0277|consensus 30.7 2.8E+02 0.006 25.3 7.6 77 98-190 11-96 (311) 226 PF12234 Rav1p_C: RAVE protein 30.1 1.6E+02 0.0035 29.5 6.7 79 36-122 72-154 (631) 227 KOG4640|consensus 30.0 1.1E+02 0.0025 30.6 5.5 34 94-127 60-95 (665) 228 KOG0282|consensus 29.7 2.8E+02 0.0061 27.0 8.0 35 95-129 258-293 (503) 229 COG4831 Roadblock/LC7 domain [ 29.4 34 0.00073 26.4 1.5 13 160-172 18-30 (109) 230 KOG0283|consensus 29.1 52 0.0011 33.3 3.1 29 157-186 270-298 (712) 231 PF10313 DUF2415: Uncharacteri 29.1 1.7E+02 0.0036 19.1 5.0 29 97-125 2-34 (43) 232 smart00135 LY Low-density lipo 28.6 1E+02 0.0022 17.9 3.4 19 155-173 9-27 (43) 233 PLN00181 protein SPA1-RELATED; 28.5 4E+02 0.0087 26.6 9.3 34 95-128 575-610 (793) 234 PF08452 DNAP_B_exo_N: DNA pol 28.1 25 0.00053 19.8 0.4 11 38-48 2-12 (22) 235 PF13449 Phytase-like: Esteras 27.4 4.4E+02 0.0095 23.4 12.0 92 92-187 143-251 (326) 236 KOG0650|consensus 27.1 3.2E+02 0.0069 27.7 8.0 97 87-186 433-553 (733) 237 KOG4499|consensus 26.5 1.8E+02 0.0038 26.3 5.7 74 95-188 108-191 (310) 238 KOG0269|consensus 25.9 2.6E+02 0.0056 28.9 7.3 111 38-184 135-249 (839) 239 PF11768 DUF3312: Protein of u 25.6 6.7E+02 0.014 24.9 14.7 86 98-187 302-402 (545) 240 COG3490 Uncharacterized protei 25.0 1.6E+02 0.0035 27.3 5.2 16 157-172 164-179 (366) 241 KOG0283|consensus 24.9 2.3E+02 0.0051 28.8 6.8 30 95-124 546-576 (712) 242 PF01731 Arylesterase: Arylest 24.4 1.5E+02 0.0032 21.9 4.1 30 155-184 54-83 (86) 243 PRK13613 lipoprotein LpqB; Pro 23.9 3.3E+02 0.0072 27.0 7.6 21 156-176 456-476 (599) 244 KOG1273|consensus 23.7 5.3E+02 0.011 24.2 8.3 39 96-134 66-106 (405) 245 PF13449 Phytase-like: Esteras 23.5 5.2E+02 0.011 22.9 10.1 16 157-172 149-164 (326) 246 KOG4532|consensus 23.5 5.2E+02 0.011 23.8 8.1 73 98-188 161-236 (344) 247 KOG0284|consensus 21.6 1.9E+02 0.0041 27.7 5.2 66 96-181 181-248 (464) 248 PF05787 DUF839: Bacterial pro 21.6 1.4E+02 0.0031 28.9 4.5 61 51-113 446-518 (524) 249 KOG3621|consensus 21.3 1.2E+02 0.0027 30.7 4.1 63 98-177 79-145 (726) 250 PRK13614 lipoprotein LpqB; Pro 20.9 4.2E+02 0.0091 26.2 7.6 35 97-132 344-380 (573) 251 KOG0264|consensus 20.8 4E+02 0.0087 25.5 7.1 78 95-189 272-351 (422) 252 KOG0275|consensus 20.5 4.2E+02 0.009 25.1 7.0 28 95-122 348-376 (508) 253 PF01436 NHL: NHL repeat; Int 20.2 1.8E+02 0.0038 16.5 3.1 16 110-125 7-22 (28) 254 smart00415 HSF heat shock fact 20.1 76 0.0017 23.8 1.9 13 158-170 20-32 (105) No 1 >PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction:Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A .... Probab=99.92 E-value=3e-25 Score=198.38 Aligned_cols=137 Identities=35% Similarity=0.589 Sum_probs=108.7 Q ss_pred ceeEEEEEeeeee-eeeeecCCCCc----cceeecCC-CCCceeeEECcCCCEEEEEEcCcEEEEECCCCCeEEEeeCCC Q psy7767 63 ALKLSIRCLTWKQ-SHSLSGRPDTL----TDVSDLDG-SNFLQLVQWSPVGHDLIFVKDNNLYQAYDDFRSINALTRDGI 136 (200) Q Consensus 63 ~~kv~i~~n~~~~-R~s~~a~~~~~----~~v~~lt~-~~~is~~~wSPDGk~IAFV~~gdLyv~~~~~g~~~rLT~dg~ 136 (200) .+.|++..|..++ ||++.+.++.. ..++.++. ....+.+.|||||+.||||+++|||+.++.++...|||.+|. T Consensus 4 ~~~~l~~~~~~~~~r~s~~~~y~i~d~~~~~~~~l~~~~~~~~~~~~sP~g~~~~~v~~~nly~~~~~~~~~~~lT~dg~ 83 (353) T PF00930_consen 4 GKFVLFATNYTKQWRHSFKGDYYIYDIETGEITPLTPPPPKLQDAKWSPDGKYIAFVRDNNLYLRDLATGQETQLTTDGE 83 (353) T ss_dssp SSEEEEEEEEEEESSSEEEEEEEEEETTTTEEEESS-EETTBSEEEE-SSSTEEEEEETTEEEEESSTTSEEEESES--T T ss_pred CCeEEEEECcEEeeeeccceeEEEEecCCCceEECcCCccccccceeecCCCeeEEEecCceEEEECCCCCeEEeccccc Confidence 4678999999999 99999998621 23445554 367899999999999999999999999999999999999994 Q ss_pred cceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcCCcceEEEEEeeCCCCC-----CCCCCCCC Q psy7767 137 KGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFNDARVHNMSYIHYGEPGS-----KTYLYPDV 200 (200) Q Consensus 137 ~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~~~~~~~~-----~~~~YP~v 200 (200) ..++||.|||+||||||++..++||||||++|||++.|++.|+.+.++.+..++. ...+||++ T Consensus 84 -~~i~nG~~dwvyeEEv~~~~~~~~WSpd~~~la~~~~d~~~v~~~~~~~~~~~~~~yp~~~~~~YPk~ 151 (353) T PF00930_consen 84 -PGIYNGVPDWVYEEEVFDRRSAVWWSPDSKYLAFLRFDEREVPEYPLPDYSPPDSQYPEVESIRYPKA 151 (353) T ss_dssp -TTEEESB--HHHHHHTSSSSBSEEE-TTSSEEEEEEEE-TTS-EEEEEEESSSTESS-EEEEEE--BT T ss_pred -eeEEcCccceeccccccccccceEECCCCCEEEEEEECCcCCceEEeeccCCccccCCcccccccCCC Confidence 4579999999999999999999999999999999999999999999999988532 25677763 No 2 >KOG2100|consensus Probab=99.50 E-value=4.9e-14 Score=138.76 Aligned_cols=128 Identities=34% Similarity=0.598 Sum_probs=108.1 Q ss_pred cceeEEEEEeeeee-eeeeecCCCCccceee----cC---CCCCceeeEECcCCCEEEEEEcCcEEEEECCCCCeEEEee Q psy7767 62 IALKLSIRCLTWKQ-SHSLSGRPDTLTDVSD----LD---GSNFLQLVQWSPVGHDLIFVKDNNLYQAYDDFRSINALTR 133 (200) Q Consensus 62 ~~~kv~i~~n~~~~-R~s~~a~~~~~~~v~~----lt---~~~~is~~~wSPDGk~IAFV~~gdLyv~~~~~g~~~rLT~ 133 (200) ....++...|..++ |||+.+.+. +.++.. .. .+..++.+.|||.|+.++|+.++++|..........+++. T Consensus 107 d~~~~~~~~~~~~~~rhs~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~wsp~~~~l~yv~~~niy~~~~~~~~~~~~~~ 185 (755) T KOG2100|consen 107 DRKYILLGRNYKKRFRHSYTAKYH-LYDLNTGEKLHPPEYEGSKIQYASWSPLGNDLAYVLHNNIYYQSSEEDEDVRIVS 185 (755) T ss_pred hhhhheeccCcccccceeeEEEEE-EEEcCCCCcccCcccCCCeeEEEEEcCCCCEEEEEEecccccccCcCCCceEEEe Confidence 44566777777777 999999985 322211 11 1345689999999999999999999998877788888999 Q ss_pred CCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcCCcceEEEEEeeCCC Q psy7767 134 DGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFNDARVHNMSYIHYGEP 190 (200) Q Consensus 134 dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~~~~~~ 190 (200) ++..++++||.|||+||||+|....+.||||||.+|+|...+++.|+.+.++.+.+. T Consensus 186 ~~~~~~i~ng~~Dw~yeeEv~~~~~a~wwsp~g~~la~~~~~dt~V~~~~~~~~~~~ 242 (755) T KOG2100|consen 186 NGGEDVIFNGKPDWIYEEEVLSSDSAIWWSPDGDRLAYASFNDTKVETINLPGYTDD 242 (755) T ss_pred cCCCceEEcCCCCceeehhhcccCccceeCCCCceeEEEEecccccceEEecccccc Confidence 888888899999999999999999999999999999999999999999999999885 No 3 >PRK05137 tolB translocation protein TolB; Provisional Probab=99.01 E-value=7.4e-09 Score=94.99 Aligned_cols=110 Identities=12% Similarity=0.030 Sum_probs=80.7 Q ss_pred ccCcceeEEEEEeeee----eeeeeecCCCCccceeecCC-CCCceeeEECcCCCEEEEEEc----CcEEEEECCCCCeE Q psy7767 59 STDIALKLSIRCLTWK----QSHSLSGRPDTLTDVSDLDG-SNFLQLVQWSPVGHDLIFVKD----NNLYQAYDDFRSIN 129 (200) Q Consensus 59 ~~~~~~kv~i~~n~~~----~R~s~~a~~~~~~~v~~lt~-~~~is~~~wSPDGk~IAFV~~----gdLyv~~~~~g~~~ 129 (200) .|-+..++++..+..+ ....+..+.. ..+.+.++. ...+..|+|||||++|+|+.. .+||+.++++|+.+ T Consensus 161 ~g~f~~~iafv~~~~~~~~~~~~l~~~d~d-g~~~~~lt~~~~~v~~p~wSpDG~~lay~s~~~g~~~i~~~dl~~g~~~ 239 (435) T PRK05137 161 KGYFDTRIVYVAESGPKNKRIKRLAIMDQD-GANVRYLTDGSSLVLTPRFSPNRQEITYMSYANGRPRVYLLDLETGQRE 239 (435) T ss_pred CCcCCCeEEEEEeeCCCCCcceEEEEECCC-CCCcEEEecCCCCeEeeEECCCCCEEEEEEecCCCCEEEEEECCCCcEE Confidence 4667778888877555 3456666653 445566665 456799999999999999973 47999999999888 Q ss_pred EEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcCCcceEEEEEeeC Q psy7767 130 ALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFNDARVHNMSYIHYG 188 (200) Q Consensus 130 rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~~~~ 188 (200) +++.... ...++.|||||++|+|....++...+|.+...+ T Consensus 240 ~l~~~~g-------------------~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~ 279 (435) T PRK05137 240 LVGNFPG-------------------MTFAPRFSPDGRKVVMSLSQGGNTDIYTMDLRS 279 (435) T ss_pred EeecCCC-------------------cccCcEECCCCCEEEEEEecCCCceEEEEECCC Confidence 8885332 234689999999999987766666666665443 No 4 >PRK03629 tolB translocation protein TolB; Provisional Probab=98.95 E-value=9.2e-09 Score=94.76 Aligned_cols=112 Identities=17% Similarity=0.062 Sum_probs=77.8 Q ss_pred EEeccCcceeEEEEEeeee---eeeeeecCCCCccceeecCCC-CCceeeEECcCCCEEEEEEc----CcEEEEECCCCC Q psy7767 56 LQRSTDIALKLSIRCLTWK---QSHSLSGRPDTLTDVSDLDGS-NFLQLVQWSPVGHDLIFVKD----NNLYQAYDDFRS 127 (200) Q Consensus 56 ~~~~~~~~~kv~i~~n~~~---~R~s~~a~~~~~~~v~~lt~~-~~is~~~wSPDGk~IAFV~~----gdLyv~~~~~g~ 127 (200) |-..|.+..+++++..... ....+.++.. ....+.++.+ ..+..|+|||||++|||+.. .+||+.++++|+ T Consensus 156 tg~~g~f~~riayv~~~~~~~~~~~l~~~d~d-g~~~~~lt~~~~~~~~p~wSPDG~~la~~s~~~g~~~i~i~dl~~G~ 234 (429) T PRK03629 156 TGIKGAFRTRIAYVVQTNGGQFPYELRVSDYD-GYNQFVVHRSPQPLMSPAWSPDGSKLAYVTFESGRSALVIQTLANGA 234 (429) T ss_pred cCCCCccCCeEEEEEeeCCCCcceeEEEEcCC-CCCCEEeecCCCceeeeEEcCCCCEEEEEEecCCCcEEEEEECCCCC Confidence 4445678888888876432 1456666653 3344555543 46799999999999999962 379999999998 Q ss_pred eEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcCCcceEEEEEee Q psy7767 128 INALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFNDARVHNMSYIHY 187 (200) Q Consensus 128 ~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~~~ 187 (200) .++|+.... ...++.|||||++|+|+....+...++.+... T Consensus 235 ~~~l~~~~~-------------------~~~~~~~SPDG~~La~~~~~~g~~~I~~~d~~ 275 (429) T PRK03629 235 VRQVASFPR-------------------HNGAPAFSPDGSKLAFALSKTGSLNLYVMDLA 275 (429) T ss_pred eEEccCCCC-------------------CcCCeEECCCCCEEEEEEcCCCCcEEEEEECC Confidence 888885322 23468999999999998655444455554443 No 5 >COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion] Probab=98.94 E-value=5e-09 Score=97.53 Aligned_cols=92 Identities=20% Similarity=0.243 Sum_probs=74.9 Q ss_pred eeEEEEEeeeeeeeeeecCCCCccceeecCCCCCc-eeeEECcCCCEEEEEEcC----cEEEEECCCCCeEEEeeCCCcc Q psy7767 64 LKLSIRCLTWKQSHSLSGRPDTLTDVSDLDGSNFL-QLVQWSPVGHDLIFVKDN----NLYQAYDDFRSINALTRDGIKG 138 (200) Q Consensus 64 ~kv~i~~n~~~~R~s~~a~~~~~~~v~~lt~~~~i-s~~~wSPDGk~IAFV~~g----dLyv~~~~~g~~~rLT~dg~~~ 138 (200) ++|++..+.+++.+.|..+-. ..++.+++.+.++ ..|.|||||++|+|+.+. +||+++.+++...|+|..+. T Consensus 250 ~~l~f~~~rdg~~~iy~~dl~-~~~~~~Lt~~~gi~~~Ps~spdG~~ivf~Sdr~G~p~I~~~~~~g~~~~riT~~~~-- 326 (425) T COG0823 250 SKLAFSSSRDGSPDIYLMDLD-GKNLPRLTNGFGINTSPSWSPDGSKIVFTSDRGGRPQIYLYDLEGSQVTRLTFSGG-- 326 (425) T ss_pred CEEEEEECCCCCccEEEEcCC-CCcceecccCCccccCccCCCCCCEEEEEeCCCCCcceEEECCCCCceeEeeccCC-- Confidence 788888888888777777775 4456777764443 699999999999999864 89999999999999998764 Q ss_pred eEecccccceeeeeeeCCCcceEEccCCCEEEEEEEc Q psy7767 139 VLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFN 175 (200) Q Consensus 139 ~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D 175 (200) ....+.|||||++|+|.... T Consensus 327 -----------------~~~~p~~SpdG~~i~~~~~~ 346 (425) T COG0823 327 -----------------GNSNPVWSPDGDKIVFESSS 346 (425) T ss_pred -----------------CCcCccCCCCCCEEEEEecc Confidence 34478999999999999744 No 6 >PRK02889 tolB translocation protein TolB; Provisional Probab=98.92 E-value=1.8e-08 Score=92.50 Aligned_cols=111 Identities=16% Similarity=0.091 Sum_probs=78.6 Q ss_pred eccCcceeEEEEEeeeeeeeeeecCCCCccceeecCC-CCCceeeEECcCCCEEEEEEc----CcEEEEECCCCCeEEEe Q psy7767 58 RSTDIALKLSIRCLTWKQSHSLSGRPDTLTDVSDLDG-SNFLQLVQWSPVGHDLIFVKD----NNLYQAYDDFRSINALT 132 (200) Q Consensus 58 ~~~~~~~kv~i~~n~~~~R~s~~a~~~~~~~v~~lt~-~~~is~~~wSPDGk~IAFV~~----gdLyv~~~~~g~~~rLT 132 (200) ..|.+..+++......+....+.++.. ....+.++. ...+..|+|||||++|||+.. .+||+.++.+|+..+++ T Consensus 158 ~~g~f~~~iayv~~~~~~~~L~~~D~d-G~~~~~l~~~~~~v~~p~wSPDG~~la~~s~~~~~~~I~~~dl~~g~~~~l~ 236 (427) T PRK02889 158 ERGVFSTRIAYVIKTGNRYQLQISDAD-GQNAQSALSSPEPIISPAWSPDGTKLAYVSFESKKPVVYVHDLATGRRRVVA 236 (427) T ss_pred CCcccccEEEEEEccCCccEEEEECCC-CCCceEeccCCCCcccceEcCCCCEEEEEEccCCCcEEEEEECCCCCEEEee Confidence 355667777777654444456666553 334444443 456789999999999999973 36999999999888887 Q ss_pred eCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcCCcceEEEEEeeC Q psy7767 133 RDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFNDARVHNMSYIHYG 188 (200) Q Consensus 133 ~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~~~~ 188 (200) .... ....+.|||||++|+|....++...+|.+...+ T Consensus 237 ~~~g-------------------~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~ 273 (427) T PRK02889 237 NFKG-------------------SNSAPAWSPDGRTLAVALSRDGNSQIYTVNADG 273 (427) T ss_pred cCCC-------------------CccceEECCCCCEEEEEEccCCCceEEEEECCC Confidence 5321 234689999999999987766666777765543 No 7 >PRK01029 tolB translocation protein TolB; Provisional Probab=98.92 E-value=1.3e-08 Score=94.12 Aligned_cols=87 Identities=13% Similarity=0.105 Sum_probs=60.9 Q ss_pred eeeecCCCCccceeecCC-CCCceeeEECcCCCE--EEEEEc----CcEEEEECCCCCeEEEeeCCCcceEeccccccee Q psy7767 77 HSLSGRPDTLTDVSDLDG-SNFLQLVQWSPVGHD--LIFVKD----NNLYQAYDDFRSINALTRDGIKGVLFNGVADWVY 149 (200) Q Consensus 77 ~s~~a~~~~~~~v~~lt~-~~~is~~~wSPDGk~--IAFV~~----gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~ 149 (200) ..|..+.. ..+.+++|. +.....|+|||||+. ++|++. .+||+.++++|+.++|+.... T Consensus 166 ~l~~~d~d-G~~~~~lt~~~~~~~sP~wSPDG~~~~~~y~S~~~g~~~I~~~~l~~g~~~~lt~~~g------------- 231 (428) T PRK01029 166 ELWSVDYD-GQNLRPLTQEHSLSITPTWMHIGSGFPYLYVSYKLGVPKIFLGSLENPAGKKILALQG------------- 231 (428) T ss_pred eEEEEcCC-CCCceEcccCCCCcccceEccCCCceEEEEEEccCCCceEEEEECCCCCceEeecCCC------------- Confidence 44444443 335566665 345689999999998 555753 379999999999999997432 Q ss_pred eeeeeCCCcceEEccCCCEEEEEEEcCCcceEEE Q psy7767 150 EEEVFSSTKTLWFSPDGSILAYMKFNDARVHNMS 183 (200) Q Consensus 150 eEEi~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~ 183 (200) ....+.|||||++|||+....+....+. T Consensus 232 ------~~~~p~wSPDG~~Laf~s~~~g~~di~~ 259 (428) T PRK01029 232 ------NQLMPTFSPRKKLLAFISDRYGNPDLFI 259 (428) T ss_pred ------CccceEECCCCCEEEEEECCCCCcceeE Confidence 2346899999999999975444333433 No 8 >PRK04043 tolB translocation protein TolB; Provisional Probab=98.90 E-value=2.5e-08 Score=92.35 Aligned_cols=112 Identities=13% Similarity=0.011 Sum_probs=79.3 Q ss_pred eccCcceeEEEEEeeee--eeeeeecCCCCccceeecCCCCCceeeEECcCCCE-EEEEE----cCcEEEEECCCCCeEE Q psy7767 58 RSTDIALKLSIRCLTWK--QSHSLSGRPDTLTDVSDLDGSNFLQLVQWSPVGHD-LIFVK----DNNLYQAYDDFRSINA 130 (200) Q Consensus 58 ~~~~~~~kv~i~~n~~~--~R~s~~a~~~~~~~v~~lt~~~~is~~~wSPDGk~-IAFV~----~gdLyv~~~~~g~~~r 130 (200) +.+.+..+++.+....+ ....+.+++. ..+.+.++.+.....|+|||||++ ++|+. +.+||+.++.+|+.++ T Consensus 149 ~~~~f~~r~~~v~~~~~~~~~~l~~~d~d-g~~~~~~~~~~~~~~p~wSpDG~~~i~y~s~~~~~~~Iyv~dl~tg~~~~ 227 (419) T PRK04043 149 PSIDWMKRKVVFSKYTGPKKSNIVLADYT-LTYQKVIVKGGLNIFPKWANKEQTAFYYTSYGERKPTLYKYNLYTGKKEK 227 (419) T ss_pred CCcCceeeEEEEEEccCCCcceEEEECCC-CCceeEEccCCCeEeEEECCCCCcEEEEEEccCCCCEEEEEECCCCcEEE Confidence 34467778887776444 3567777774 445555655556789999999997 66664 2479999999999999 Q ss_pred EeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcCCcceEEEEEeeCC Q psy7767 131 LTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFNDARVHNMSYIHYGE 189 (200) Q Consensus 131 LT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~~~~~ 189 (200) |+..+. ....+.|||||++|+|.........++.+...++ T Consensus 228 lt~~~g-------------------~~~~~~~SPDG~~la~~~~~~g~~~Iy~~dl~~g 267 (419) T PRK04043 228 IASSQG-------------------MLVVSDVSKDGSKLLLTMAPKGQPDIYLYDTNTK 267 (419) T ss_pred EecCCC-------------------cEEeeEECCCCCEEEEEEccCCCcEEEEEECCCC Confidence 996332 2335789999999999976655556666654443 No 9 >KOG2281|consensus Probab=98.89 E-value=9.7e-10 Score=106.22 Aligned_cols=105 Identities=20% Similarity=0.267 Sum_probs=88.5 Q ss_pred CCCceeeEECcC-CCEEEEEEcCcEEEEECCCCCeEEEeeC--CC----cceEecccccceeeeeeeCCCcceEEccCC- Q psy7767 95 SNFLQLVQWSPV-GHDLIFVKDNNLYQAYDDFRSINALTRD--GI----KGVLFNGVADWVYEEEVFSSTKTLWFSPDG- 166 (200) Q Consensus 95 ~~~is~~~wSPD-Gk~IAFV~~gdLyv~~~~~g~~~rLT~d--g~----~~~~~nG~pdwv~eEEi~~~~~~~~WSPDG- 166 (200) .....+|+.||. +..|||++..++||.++..+.++|+|.. |. .+.+.+|+|.++.+|| |.+..++||||-+ T Consensus 199 t~~~~dP~lcP~~~~fia~i~~~dl~V~n~~~~~ekrlt~~h~g~sn~~dd~~saGVasyv~QEE-fdR~~GyWW~p~s~ 277 (867) T KOG2281|consen 199 TSTRMDPKLCPADPDFIAYIKVCDLWVLNILTGEEKRLTYIHNGSSNSKDDAISAGVASYVVQEE-FDRFEGYWWSPPSG 277 (867) T ss_pred CCCccCcccCCCCccceeeeehhhhhhhhhhhchhhceeeeeccccccccchhhcCcchHHHHHH-HHhhcceeecCCCC Confidence 455689999995 9999999999999999999999999852 21 2346899999999999 7899999999844 Q ss_pred ---CEEEEEEEcCCcceEEEEEeeCC--CCCCCCCCCCC Q psy7767 167 ---SILAYMKFNDARVHNMSYIHYGE--PGSKTYLYPDV 200 (200) Q Consensus 167 ---k~Laf~r~D~s~v~~~~i~~~~~--~~~~~~~YP~v 200 (200) .+|+|...|++.|.+++++..+. ++.++++||++ T Consensus 278 ~~~~rIlYeeVdesev~V~h~~s~~~~~~~~d~~rYPrt 316 (867) T KOG2281|consen 278 LKILRILYEEVDESEVEVIHVPSPVLGDRRVDSYRYPRT 316 (867) T ss_pred CceEEEEeeecChhheEEEeccCcccCCCccccccCCcC Confidence 47999999999999998887754 55678999985 No 10 >PRK05137 tolB translocation protein TolB; Provisional Probab=98.87 E-value=3.5e-08 Score=90.54 Aligned_cols=79 Identities=22% Similarity=0.277 Sum_probs=58.6 Q ss_pred ceeecCCCC-CceeeEECcCCCEEEEEEcC----cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEE Q psy7767 88 DVSDLDGSN-FLQLVQWSPVGHDLIFVKDN----NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWF 162 (200) Q Consensus 88 ~v~~lt~~~-~is~~~wSPDGk~IAFV~~g----dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~W 162 (200) .+++++... ....+.|||||++|+|+.+. +||+++++++..++||..+. ....+.| T Consensus 281 ~~~~Lt~~~~~~~~~~~spDG~~i~f~s~~~g~~~Iy~~d~~g~~~~~lt~~~~-------------------~~~~~~~ 341 (435) T PRK05137 281 TTTRLTDSPAIDTSPSYSPDGSQIVFESDRSGSPQLYVMNADGSNPRRISFGGG-------------------RYSTPVW 341 (435) T ss_pred ceEEccCCCCccCceeEcCCCCEEEEEECCCCCCeEEEEECCCCCeEEeecCCC-------------------cccCeEE Confidence 456666543 34789999999999999853 79999999999999987432 2345789 Q ss_pred ccCCCEEEEEEEcCCcceEEEEE Q psy7767 163 SPDGSILAYMKFNDARVHNMSYI 185 (200) Q Consensus 163 SPDGk~Laf~r~D~s~v~~~~i~ 185 (200) ||||++|+|++.+.+...++.+. T Consensus 342 SpdG~~ia~~~~~~~~~~i~~~d 364 (435) T PRK05137 342 SPRGDLIAFTKQGGGQFSIGVMK 364 (435) T ss_pred CCCCCEEEEEEcCCCceEEEEEE Confidence 99999999997655433444333 No 11 >PRK04043 tolB translocation protein TolB; Provisional Probab=98.84 E-value=4.5e-08 Score=90.60 Aligned_cols=82 Identities=10% Similarity=0.058 Sum_probs=62.5 Q ss_pred eeecCC-CCCceeeEECcCCCEEEEEEc----CcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEc Q psy7767 89 VSDLDG-SNFLQLVQWSPVGHDLIFVKD----NNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFS 163 (200) Q Consensus 89 v~~lt~-~~~is~~~wSPDGk~IAFV~~----gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WS 163 (200) .+.++. .+....+.|||||++|+|+.. .+||+.+++++..++||.... ....+.|| T Consensus 225 ~~~lt~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~dl~~g~~~~LT~~~~-------------------~d~~p~~S 285 (419) T PRK04043 225 KEKIASSQGMLVVSDVSKDGSKLLLTMAPKGQPDIYLYDTNTKTLTQITNYPG-------------------IDVNGNFV 285 (419) T ss_pred EEEEecCCCcEEeeEECCCCCEEEEEEccCCCcEEEEEECCCCcEEEcccCCC-------------------ccCccEEC Confidence 344443 333567999999999999974 489999999999999997542 23457899 Q ss_pred cCCCEEEEEEEcCCcceEEEEEeeCC Q psy7767 164 PDGSILAYMKFNDARVHNMSYIHYGE 189 (200) Q Consensus 164 PDGk~Laf~r~D~s~v~~~~i~~~~~ 189 (200) |||++|+|+....+...+|.++..++ T Consensus 286 PDG~~I~F~Sdr~g~~~Iy~~dl~~g 311 (419) T PRK04043 286 EDDKRIVFVSDRLGYPNIFMKKLNSG 311 (419) T ss_pred CCCCEEEEEECCCCCceEEEEECCCC Confidence 99999999987666667777776554 No 12 >PRK04792 tolB translocation protein TolB; Provisional Probab=98.83 E-value=4.2e-08 Score=91.06 Aligned_cols=113 Identities=17% Similarity=0.077 Sum_probs=77.7 Q ss_pred EEeccCcceeEEEEEeeeee---eeeeecCCCCccceeecCC-CCCceeeEECcCCCEEEEEEc----CcEEEEECCCCC Q psy7767 56 LQRSTDIALKLSIRCLTWKQ---SHSLSGRPDTLTDVSDLDG-SNFLQLVQWSPVGHDLIFVKD----NNLYQAYDDFRS 127 (200) Q Consensus 56 ~~~~~~~~~kv~i~~n~~~~---R~s~~a~~~~~~~v~~lt~-~~~is~~~wSPDGk~IAFV~~----gdLyv~~~~~g~ 127 (200) |=..|-+..+++......+. ...+.++.. ..+.+.++. ...+..|+|||||++|+|+.. .+||+.++++++ T Consensus 175 tG~~g~f~~riayv~~~~~~~~~~~l~i~d~d-G~~~~~l~~~~~~~~~p~wSPDG~~La~~s~~~g~~~L~~~dl~tg~ 253 (448) T PRK04792 175 TGERGAFLTRIAYVVVNDKDKYPYQLMIADYD-GYNEQMLLRSPEPLMSPAWSPDGRKLAYVSFENRKAEIFVQDIYTQV 253 (448) T ss_pred cCCCccccCEEEEEEeeCCCCCceEEEEEeCC-CCCceEeecCCCcccCceECCCCCEEEEEEecCCCcEEEEEECCCCC Confidence 33456778888887765443 233334432 223445554 456789999999999999963 269999999998 Q ss_pred eEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcCCcceEEEEEeeC Q psy7767 128 INALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFNDARVHNMSYIHYG 188 (200) Q Consensus 128 ~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~~~~ 188 (200) .++++.... ....+.|||||++|+|....++...++.+...+ T Consensus 254 ~~~lt~~~g-------------------~~~~~~wSPDG~~La~~~~~~g~~~Iy~~dl~t 295 (448) T PRK04792 254 REKVTSFPG-------------------INGAPRFSPDGKKLALVLSKDGQPEIYVVDIAT 295 (448) T ss_pred eEEecCCCC-------------------CcCCeeECCCCCEEEEEEeCCCCeEEEEEECCC Confidence 888886321 234689999999999987666655666665443 No 13 >PRK01742 tolB translocation protein TolB; Provisional Probab=98.82 E-value=8.5e-08 Score=88.08 Aligned_cols=109 Identities=17% Similarity=0.073 Sum_probs=73.4 Q ss_pred ccCcceeEEEEEeeeee---eeeeecCCCCccceeecCC-CCCceeeEECcCCCEEEEEEcC----cEEEEECCCCCeEE Q psy7767 59 STDIALKLSIRCLTWKQ---SHSLSGRPDTLTDVSDLDG-SNFLQLVQWSPVGHDLIFVKDN----NLYQAYDDFRSINA 130 (200) Q Consensus 59 ~~~~~~kv~i~~n~~~~---R~s~~a~~~~~~~v~~lt~-~~~is~~~wSPDGk~IAFV~~g----dLyv~~~~~g~~~r 130 (200) +|.+-.+++++....+. ...+..+.. ......++. ...+..|+|||||++|||+..+ +||+.++.+++.++ T Consensus 164 ~g~f~~ria~v~~~~~~~~~~~i~i~d~d-g~~~~~lt~~~~~v~~p~wSPDG~~la~~s~~~~~~~i~i~dl~tg~~~~ 242 (429) T PRK01742 164 RGAFRTRIAYVVQKNGGSQPYEVRVADYD-GFNQFIVNRSSQPLMSPAWSPDGSKLAYVSFENKKSQLVVHDLRSGARKV 242 (429) T ss_pred CCccCCEEEEEEEEcCCCceEEEEEECCC-CCCceEeccCCCccccceEcCCCCEEEEEEecCCCcEEEEEeCCCCceEE Confidence 45566777776654321 334444442 223445554 3457999999999999999742 69999999888777 Q ss_pred EeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcCCcceEEEEEee Q psy7767 131 LTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFNDARVHNMSYIHY 187 (200) Q Consensus 131 LT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~~~ 187 (200) ++.... ....+.|||||++|+|....++...+|.+... T Consensus 243 l~~~~g-------------------~~~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~ 280 (429) T PRK01742 243 VASFRG-------------------HNGAPAFSPDGSRLAFASSKDGVLNIYVMGAN 280 (429) T ss_pred EecCCC-------------------ccCceeECCCCCEEEEEEecCCcEEEEEEECC Confidence 775321 23468999999999998765665666655543 No 14 >PRK00178 tolB translocation protein TolB; Provisional Probab=98.81 E-value=1e-07 Score=86.78 Aligned_cols=112 Identities=15% Similarity=0.076 Sum_probs=74.0 Q ss_pred EeccCcceeEEEEEeeee----eeeeeecCCCCccceeecCC-CCCceeeEECcCCCEEEEEEc----CcEEEEECCCCC Q psy7767 57 QRSTDIALKLSIRCLTWK----QSHSLSGRPDTLTDVSDLDG-SNFLQLVQWSPVGHDLIFVKD----NNLYQAYDDFRS 127 (200) Q Consensus 57 ~~~~~~~~kv~i~~n~~~----~R~s~~a~~~~~~~v~~lt~-~~~is~~~wSPDGk~IAFV~~----gdLyv~~~~~g~ 127 (200) -.+|.+-.+++.+..... ....+..+.. ..+.+.++. +..+..|+|||||++|||+.. .+||+.++++|+ T Consensus 156 g~~g~f~~~ia~v~~~~~~~~~~~~l~~~d~~-g~~~~~l~~~~~~~~~p~wSpDG~~la~~s~~~~~~~l~~~~l~~g~ 234 (430) T PRK00178 156 GIKGAFSTRILYVTAERFSVNTRYTLQRSDYD-GARAVTLLQSREPILSPRWSPDGKRIAYVSFEQKRPRIFVQNLDTGR 234 (430) T ss_pred CCCccceeeEEEEEeeCCCCCcceEEEEECCC-CCCceEEecCCCceeeeeECCCCCEEEEEEcCCCCCEEEEEECCCCC Confidence 335566667776654321 1123333432 234445543 456799999999999999973 379999999999 Q ss_pred eEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcCCcceEEEEEeeC Q psy7767 128 INALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFNDARVHNMSYIHYG 188 (200) Q Consensus 128 ~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~~~~ 188 (200) .++++.... ....+.|||||++|+|....++...++.+...+ T Consensus 235 ~~~l~~~~g-------------------~~~~~~~SpDG~~la~~~~~~g~~~Iy~~d~~~ 276 (430) T PRK00178 235 REQITNFEG-------------------LNGAPAWSPDGSKLAFVLSKDGNPEIYVMDLAS 276 (430) T ss_pred EEEccCCCC-------------------CcCCeEECCCCCEEEEEEccCCCceEEEEECCC Confidence 888885332 234689999999999987655555555554443 No 15 >PRK02889 tolB translocation protein TolB; Provisional Probab=98.78 E-value=9.8e-08 Score=87.72 Aligned_cols=79 Identities=22% Similarity=0.266 Sum_probs=57.2 Q ss_pred ceeecCCCC-CceeeEECcCCCEEEEEEcC----cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEE Q psy7767 88 DVSDLDGSN-FLQLVQWSPVGHDLIFVKDN----NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWF 162 (200) Q Consensus 88 ~v~~lt~~~-~is~~~wSPDGk~IAFV~~g----dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~W 162 (200) .+++++... ....+.|||||++|+|+.+. +||..+++++..++++..+. ....+.| T Consensus 275 ~~~~lt~~~~~~~~~~wSpDG~~l~f~s~~~g~~~Iy~~~~~~g~~~~lt~~g~-------------------~~~~~~~ 335 (427) T PRK02889 275 GLRRLTQSSGIDTEPFFSPDGRSIYFTSDRGGAPQIYRMPASGGAAQRVTFTGS-------------------YNTSPRI 335 (427) T ss_pred CcEECCCCCCCCcCeEEcCCCCEEEEEecCCCCcEEEEEECCCCceEEEecCCC-------------------CcCceEE Confidence 455666433 34789999999999999753 79999998888888886542 2235789 Q ss_pred ccCCCEEEEEEEcCCcceEEEEE Q psy7767 163 SPDGSILAYMKFNDARVHNMSYI 185 (200) Q Consensus 163 SPDGk~Laf~r~D~s~v~~~~i~ 185 (200) ||||++|+|...+.+...++.+. T Consensus 336 SpDG~~Ia~~s~~~g~~~I~v~d 358 (427) T PRK02889 336 SPDGKLLAYISRVGGAFKLYVQD 358 (427) T ss_pred CCCCCEEEEEEccCCcEEEEEEE Confidence 99999999987654433344433 No 16 >PRK03629 tolB translocation protein TolB; Provisional Probab=98.78 E-value=1.1e-07 Score=87.73 Aligned_cols=79 Identities=15% Similarity=0.244 Sum_probs=59.5 Q ss_pred ceeecCCC-CCceeeEECcCCCEEEEEEcC----cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEE Q psy7767 88 DVSDLDGS-NFLQLVQWSPVGHDLIFVKDN----NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWF 162 (200) Q Consensus 88 ~v~~lt~~-~~is~~~wSPDGk~IAFV~~g----dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~W 162 (200) .+++++.+ .....+.|||||++|+|+.+. +||+.+++++..++++..+. ....++| T Consensus 278 ~~~~lt~~~~~~~~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~~g~~~~lt~~~~-------------------~~~~~~~ 338 (429) T PRK03629 278 QIRQVTDGRSNNTEPTWFPDSQNLAYTSDQAGRPQVYKVNINGGAPQRITWEGS-------------------QNQDADV 338 (429) T ss_pred CEEEccCCCCCcCceEECCCCCEEEEEeCCCCCceEEEEECCCCCeEEeecCCC-------------------CccCEEE Confidence 45566643 356899999999999999853 79999999998889986542 2346899 Q ss_pred ccCCCEEEEEEEcCCcceEEEEE Q psy7767 163 SPDGSILAYMKFNDARVHNMSYI 185 (200) Q Consensus 163 SPDGk~Laf~r~D~s~v~~~~i~ 185 (200) ||||++|+|+..+.+...++.+. T Consensus 339 SpDG~~Ia~~~~~~g~~~I~~~d 361 (429) T PRK03629 339 SSDGKFMVMVSSNGGQQHIAKQD 361 (429) T ss_pred CCCCCEEEEEEccCCCceEEEEE Confidence 99999999987665444444444 No 17 >PRK04792 tolB translocation protein TolB; Provisional Probab=98.72 E-value=2.5e-07 Score=85.96 Aligned_cols=81 Identities=16% Similarity=0.255 Sum_probs=59.8 Q ss_pred ceeecCCC-CCceeeEECcCCCEEEEEEcC----cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEE Q psy7767 88 DVSDLDGS-NFLQLVQWSPVGHDLIFVKDN----NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWF 162 (200) Q Consensus 88 ~v~~lt~~-~~is~~~wSPDGk~IAFV~~g----dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~W 162 (200) ++++++.. .....++|||||++|+|+.+. +||+.++++++.++||.++. ...++.| T Consensus 297 ~~~~lt~~~~~~~~p~wSpDG~~I~f~s~~~g~~~Iy~~dl~~g~~~~Lt~~g~-------------------~~~~~~~ 357 (448) T PRK04792 297 ALTRITRHRAIDTEPSWHPDGKSLIFTSERGGKPQIYRVNLASGKVSRLTFEGE-------------------QNLGGSI 357 (448) T ss_pred CeEECccCCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCCEEEEecCCC-------------------CCcCeeE Confidence 45556543 345789999999999999753 79999999999999986543 2235789 Q ss_pred ccCCCEEEEEEEcCCcceEEEEEee Q psy7767 163 SPDGSILAYMKFNDARVHNMSYIHY 187 (200) Q Consensus 163 SPDGk~Laf~r~D~s~v~~~~i~~~ 187 (200) ||||++|+|...+.....++.+... T Consensus 358 SpDG~~l~~~~~~~g~~~I~~~dl~ 382 (448) T PRK04792 358 TPDGRSMIMVNRTNGKFNIARQDLE 382 (448) T ss_pred CCCCCEEEEEEecCCceEEEEEECC Confidence 9999999998766554555555433 No 18 >COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion] Probab=98.72 E-value=6.6e-08 Score=90.07 Aligned_cols=76 Identities=22% Similarity=0.253 Sum_probs=60.3 Q ss_pred CCCceeeEECcCCCEEEEEEcC----cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKDN----NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILA 170 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~g----dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~La 170 (200) .+....|+|||||++|||+..+ +||++++.++..++||+.. +....+.|||||++|+ T Consensus 237 ~g~~~~P~fspDG~~l~f~~~rdg~~~iy~~dl~~~~~~~Lt~~~-------------------gi~~~Ps~spdG~~iv 297 (425) T COG0823 237 NGNNGAPAFSPDGSKLAFSSSRDGSPDIYLMDLDGKNLPRLTNGF-------------------GINTSPSWSPDGSKIV 297 (425) T ss_pred CCccCCccCCCCCCEEEEEECCCCCccEEEEcCCCCcceecccCC-------------------ccccCccCCCCCCEEE Confidence 4455789999999999999754 7999999999988888743 3455889999999999 Q ss_pred EEEEcCCcceEEEEEeeCC Q psy7767 171 YMKFNDARVHNMSYIHYGE 189 (200) Q Consensus 171 f~r~D~s~v~~~~i~~~~~ 189 (200) |+....+..++|.+...++ T Consensus 298 f~Sdr~G~p~I~~~~~~g~ 316 (425) T COG0823 298 FTSDRGGRPQIYLYDLEGS 316 (425) T ss_pred EEeCCCCCcceEEECCCCC Confidence 9977666666665555443 No 19 >PRK01029 tolB translocation protein TolB; Provisional Probab=98.71 E-value=1.8e-07 Score=86.52 Aligned_cols=82 Identities=15% Similarity=0.033 Sum_probs=61.0 Q ss_pred eeecCC-CCCceeeEECcCCCEEEEEEc----CcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEc Q psy7767 89 VSDLDG-SNFLQLVQWSPVGHDLIFVKD----NNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFS 163 (200) Q Consensus 89 v~~lt~-~~~is~~~wSPDGk~IAFV~~----gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WS 163 (200) .++++. ......|.|||||++|+|+.. .+||+.++++++.++||.... ....+.|| T Consensus 319 ~~~lt~~~~~~~~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~g~~~~Lt~~~~-------------------~~~~p~wS 379 (428) T PRK01029 319 PRLLTKKYRNSSCPAWSPDGKKIAFCSVIKGVRQICVYDLATGRDYQLTTSPE-------------------NKESPSWA 379 (428) T ss_pred eEEeccCCCCccceeECCCCCEEEEEEcCCCCcEEEEEECCCCCeEEccCCCC-------------------CccceEEC Confidence 444554 245678999999999999974 379999999999999986431 23468999 Q ss_pred cCCCEEEEEEEcCCcceEEEEEeeCC Q psy7767 164 PDGSILAYMKFNDARVHNMSYIHYGE 189 (200) Q Consensus 164 PDGk~Laf~r~D~s~v~~~~i~~~~~ 189 (200) |||+.|+|...+.+...++.+...++ T Consensus 380 pDG~~L~f~~~~~g~~~L~~vdl~~g 405 (428) T PRK01029 380 IDSLHLVYSAGNSNESELYLISLITK 405 (428) T ss_pred CCCCEEEEEECCCCCceEEEEECCCC Confidence 99999999877655555555554443 No 20 >PRK04922 tolB translocation protein TolB; Provisional Probab=98.70 E-value=2.7e-07 Score=84.76 Aligned_cols=108 Identities=15% Similarity=0.063 Sum_probs=71.3 Q ss_pred CcceeEEEEEeeee----eeeeeecCCCCccceeecCC-CCCceeeEECcCCCEEEEEEc----CcEEEEECCCCCeEEE Q psy7767 61 DIALKLSIRCLTWK----QSHSLSGRPDTLTDVSDLDG-SNFLQLVQWSPVGHDLIFVKD----NNLYQAYDDFRSINAL 131 (200) Q Consensus 61 ~~~~kv~i~~n~~~----~R~s~~a~~~~~~~v~~lt~-~~~is~~~wSPDGk~IAFV~~----gdLyv~~~~~g~~~rL 131 (200) .+..+++.+....+ ....+..+.. ......++. +..+..|+|||||+.|+|+.. .+||+.++++++.+++ T Consensus 165 ~f~~~ia~v~~~~~~~~~~~~l~i~D~~-g~~~~~lt~~~~~v~~p~wSpDg~~la~~s~~~~~~~l~~~dl~~g~~~~l 243 (433) T PRK04922 165 AFWTRIAYVTVSGAGGAMRYALQVADSD-GYNPQTILRSAEPILSPAWSPDGKKLAYVSFERGRSAIYVQDLATGQRELV 243 (433) T ss_pred cccceEEEEEEeCCCCCceEEEEEECCC-CCCceEeecCCCccccccCCCCCCEEEEEecCCCCcEEEEEECCCCCEEEe Confidence 44555555544221 1234444442 334555554 345789999999999999973 3699999998888888 Q ss_pred eeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcCCcceEEEEEeeC Q psy7767 132 TRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFNDARVHNMSYIHYG 188 (200) Q Consensus 132 T~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~~~~ 188 (200) +.... ....+.|||||++|+|....++...++.+...+ T Consensus 244 ~~~~g-------------------~~~~~~~SpDG~~l~~~~s~~g~~~Iy~~d~~~ 281 (433) T PRK04922 244 ASFRG-------------------INGAPSFSPDGRRLALTLSRDGNPEIYVMDLGS 281 (433) T ss_pred ccCCC-------------------CccCceECCCCCEEEEEEeCCCCceEEEEECCC Confidence 75321 234689999999999987665555555554443 No 21 >PRK04922 tolB translocation protein TolB; Provisional Probab=98.69 E-value=3.1e-07 Score=84.38 Aligned_cols=79 Identities=20% Similarity=0.265 Sum_probs=57.0 Q ss_pred ceeecCCCC-CceeeEECcCCCEEEEEEcC----cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEE Q psy7767 88 DVSDLDGSN-FLQLVQWSPVGHDLIFVKDN----NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWF 162 (200) Q Consensus 88 ~v~~lt~~~-~is~~~wSPDGk~IAFV~~g----dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~W 162 (200) .+++++... ....++|||||++|+|+.+. +||+.+++++..+++|..+. ....+.| T Consensus 283 ~~~~lt~~~~~~~~~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~g~-------------------~~~~~~~ 343 (433) T PRK04922 283 QLTRLTNHFGIDTEPTWAPDGKSIYFTSDRGGRPQIYRVAASGGSAERLTFQGN-------------------YNARASV 343 (433) T ss_pred CeEECccCCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCCeEEeecCCC-------------------CccCEEE Confidence 445555432 34689999999999999753 69999999899999986543 2335899 Q ss_pred ccCCCEEEEEEEcCCcceEEEEE Q psy7767 163 SPDGSILAYMKFNDARVHNMSYI 185 (200) Q Consensus 163 SPDGk~Laf~r~D~s~v~~~~i~ 185 (200) ||||++|+|...+.+...++.+. T Consensus 344 SpDG~~Ia~~~~~~~~~~I~v~d 366 (433) T PRK04922 344 SPDGKKIAMVHGSGGQYRIAVMD 366 (433) T ss_pred CCCCCEEEEEECCCCceeEEEEE Confidence 99999999986554333333333 No 22 >PRK00178 tolB translocation protein TolB; Provisional Probab=98.68 E-value=3.5e-07 Score=83.28 Aligned_cols=80 Identities=19% Similarity=0.215 Sum_probs=58.0 Q ss_pred ceeecCCC-CCceeeEECcCCCEEEEEEcC----cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEE Q psy7767 88 DVSDLDGS-NFLQLVQWSPVGHDLIFVKDN----NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWF 162 (200) Q Consensus 88 ~v~~lt~~-~~is~~~wSPDGk~IAFV~~g----dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~W 162 (200) .+++++.. .....+.|||||++|+|+.+. +||+.++++++.++++..+. ....+.| T Consensus 278 ~~~~lt~~~~~~~~~~~spDg~~i~f~s~~~g~~~iy~~d~~~g~~~~lt~~~~-------------------~~~~~~~ 338 (430) T PRK00178 278 QLSRVTNHPAIDTEPFWGKDGRTLYFTSDRGGKPQIYKVNVNGGRAERVTFVGN-------------------YNARPRL 338 (430) T ss_pred CeEEcccCCCCcCCeEECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecCCC-------------------CccceEE Confidence 44556543 335789999999999999753 79999999999999986442 2335789 Q ss_pred ccCCCEEEEEEEcCCcceEEEEEe Q psy7767 163 SPDGSILAYMKFNDARVHNMSYIH 186 (200) Q Consensus 163 SPDGk~Laf~r~D~s~v~~~~i~~ 186 (200) ||||++|+|...+.+...++.+.. T Consensus 339 Spdg~~i~~~~~~~~~~~l~~~dl 362 (430) T PRK00178 339 SADGKTLVMVHRQDGNFHVAAQDL 362 (430) T ss_pred CCCCCEEEEEEccCCceEEEEEEC Confidence 999999999876554334444443 No 23 >PRK01742 tolB translocation protein TolB; Provisional Probab=98.57 E-value=9.5e-07 Score=81.19 Aligned_cols=72 Identities=19% Similarity=0.208 Sum_probs=56.1 Q ss_pred CceeeEECcCCCEEEEEE--cC--cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 97 FLQLVQWSPVGHDLIFVK--DN--NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 97 ~is~~~wSPDGk~IAFV~--~g--dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) ....++|||||++|+|+. ++ +||+.+++++..+++|.... ...++.|||||++|+|. T Consensus 249 ~~~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~~~~~~lt~~~~-------------------~~~~~~wSpDG~~i~f~ 309 (429) T PRK01742 249 HNGAPAFSPDGSRLAFASSKDGVLNIYVMGANGGTPSQLTSGAG-------------------NNTEPSWSPDGQSILFT 309 (429) T ss_pred ccCceeECCCCCEEEEEEecCCcEEEEEEECCCCCeEeeccCCC-------------------CcCCEEECCCCCEEEEE Confidence 346799999999999986 33 69999999888888886332 34578999999999998 Q ss_pred EEcCCcceEEEEEee Q psy7767 173 KFNDARVHNMSYIHY 187 (200) Q Consensus 173 r~D~s~v~~~~i~~~ 187 (200) ....+...++.+... T Consensus 310 s~~~g~~~I~~~~~~ 324 (429) T PRK01742 310 SDRSGSPQVYRMSAS 324 (429) T ss_pred ECCCCCceEEEEECC Confidence 766666677766543 No 24 >TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear. Probab=98.48 E-value=2.8e-06 Score=76.13 Aligned_cols=92 Identities=18% Similarity=0.095 Sum_probs=62.3 Q ss_pred eeeeecCCCCccceeecCC-CCCceeeEECcCCCEEEEEEc----CcEEEEECCCCCeEEEeeCCCcceEecccccceee Q psy7767 76 SHSLSGRPDTLTDVSDLDG-SNFLQLVQWSPVGHDLIFVKD----NNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYE 150 (200) Q Consensus 76 R~s~~a~~~~~~~v~~lt~-~~~is~~~wSPDGk~IAFV~~----gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~e 150 (200) ...+..+.- ..+.+.++. +..+..|+|||||++|||+.. .+||+.+++++...+++.... T Consensus 170 ~~l~~~d~~-g~~~~~l~~~~~~~~~p~~Spdg~~la~~~~~~~~~~i~v~d~~~g~~~~~~~~~~-------------- 234 (417) T TIGR02800 170 YELQVADYD-GANPQTITRSREPILSPAWSPDGQKLAYVSFESGKPEIYVQDLATGQREKVASFPG-------------- 234 (417) T ss_pred ceEEEEcCC-CCCCEEeecCCCceecccCCCCCCEEEEEEcCCCCcEEEEEECCCCCEEEeecCCC-------------- Confidence 334444442 334555554 345789999999999999963 379999999887777765321 Q ss_pred eeeeCCCcceEEccCCCEEEEEEEcCCcceEEEEEee Q psy7767 151 EEVFSSTKTLWFSPDGSILAYMKFNDARVHNMSYIHY 187 (200) Q Consensus 151 EEi~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~~~ 187 (200) ...++.|||||+.|+|.....+...++.+... T Consensus 235 -----~~~~~~~spDg~~l~~~~~~~~~~~i~~~d~~ 266 (417) T TIGR02800 235 -----MNGAPAFSPDGSKLAVSLSKDGNPDIYVMDLD 266 (417) T ss_pred -----CccceEECCCCCEEEEEECCCCCccEEEEECC Confidence 23457899999999998655544455554433 No 25 >TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear. Probab=98.35 E-value=9.6e-06 Score=72.70 Aligned_cols=79 Identities=22% Similarity=0.337 Sum_probs=57.0 Q ss_pred eeecCCC-CCceeeEECcCCCEEEEEEcC----cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEc Q psy7767 89 VSDLDGS-NFLQLVQWSPVGHDLIFVKDN----NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFS 163 (200) Q Consensus 89 v~~lt~~-~~is~~~wSPDGk~IAFV~~g----dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WS 163 (200) .++++.. .....+.|+|||++|+|+.+. +||+.++++++.++++..+. ....+.|| T Consensus 270 ~~~l~~~~~~~~~~~~s~dg~~l~~~s~~~g~~~iy~~d~~~~~~~~l~~~~~-------------------~~~~~~~s 330 (417) T TIGR02800 270 LTRLTNGPGIDTEPSWSPDGKSIAFTSDRGGSPQIYMMDADGGEVRRLTFRGG-------------------YNASPSWS 330 (417) T ss_pred EEECCCCCCCCCCEEECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecCCC-------------------CccCeEEC Confidence 4444433 234688999999999999743 79999999888888886543 23467899 Q ss_pred cCCCEEEEEEEcCCcceEEEEEe Q psy7767 164 PDGSILAYMKFNDARVHNMSYIH 186 (200) Q Consensus 164 PDGk~Laf~r~D~s~v~~~~i~~ 186 (200) |||+.|+|...+.....++.+.. T Consensus 331 pdg~~i~~~~~~~~~~~i~~~d~ 353 (417) T TIGR02800 331 PDGDLIAFVHREGGGFNIAVMDL 353 (417) T ss_pred CCCCEEEEEEccCCceEEEEEeC Confidence 99999999876654444444443 No 26 >PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins. Probab=98.07 E-value=0.00016 Score=60.22 Aligned_cols=60 Identities=22% Similarity=0.262 Sum_probs=41.4 Q ss_pred CCCceeeEECcCCCEEEEEE----cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK----DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILA 170 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~----~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~La 170 (200) ......+.|||+|+.|+... .|+|.+.+++..+ .|..... .....+.|||||++|+ T Consensus 100 ~~~~n~i~wsP~G~~l~~~g~~n~~G~l~~wd~~~~~--~i~~~~~------------------~~~t~~~WsPdGr~~~ 159 (194) T PF08662_consen 100 TQPRNTISWSPDGRFLVLAGFGNLNGDLEFWDVRKKK--KISTFEH------------------SDATDVEWSPDGRYLA 159 (194) T ss_pred CCCceEEEECCCCCEEEEEEccCCCcEEEEEECCCCE--Eeecccc------------------CcEEEEEEcCCCCEEE Confidence 34556789999999999984 3578888887333 3332211 1234678999999998 Q ss_pred EEEE Q psy7767 171 YMKF 174 (200) Q Consensus 171 f~r~ 174 (200) .... T Consensus 160 ta~t 163 (194) T PF08662_consen 160 TATT 163 (194) T ss_pred EEEe Confidence 7744 No 27 >PRK13616 lipoprotein LpqB; Provisional Probab=98.01 E-value=2.6e-05 Score=75.66 Aligned_cols=85 Identities=19% Similarity=0.108 Sum_probs=47.2 Q ss_pred eeeeecCCCCccceeecCCCCCceeeEECcCCCEEEEEEcC-c-EEEEECCC-CCeEEEeeCCCcceEecccccceeeee Q psy7767 76 SHSLSGRPDTLTDVSDLDGSNFLQLVQWSPVGHDLIFVKDN-N-LYQAYDDF-RSINALTRDGIKGVLFNGVADWVYEEE 152 (200) Q Consensus 76 R~s~~a~~~~~~~v~~lt~~~~is~~~wSPDGk~IAFV~~g-d-Lyv~~~~~-g~~~rLT~dg~~~~~~nG~pdwv~eEE 152 (200) ++.|.+... ...+.++.+.....|+|||||++|+|+.++ . +++....+ ++...+.-++.+ ... + T Consensus 379 s~Lwv~~~g--g~~~~lt~g~~~t~PsWspDG~~lw~v~dg~~~~~v~~~~~~gql~~~~vd~ge----------~~~-~ 445 (591) T PRK13616 379 SSLWVGPLG--GVAVQVLEGHSLTRPSWSLDADAVWVVVDGNTVVRVIRDPATGQLARTPVDASA----------VAS-R 445 (591) T ss_pred eEEEEEeCC--CcceeeecCCCCCCceECCCCCceEEEecCcceEEEeccCCCceEEEEeccCch----------hhh-c Confidence 344554431 123455555568999999999999999754 2 22222111 111111111110 000 0 Q ss_pred eeCCCcceEEccCCCEEEEEE Q psy7767 153 VFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 153 i~~~~~~~~WSPDGk~Laf~r 173 (200) +-+...++.|||||++|||+. T Consensus 446 ~~g~Issl~wSpDG~RiA~i~ 466 (591) T PRK13616 446 VPGPISELQLSRDGVRAAMII 466 (591) T ss_pred cCCCcCeEEECCCCCEEEEEE Confidence 012467899999999999986 No 28 >PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins. Probab=97.90 E-value=0.00046 Score=57.46 Aligned_cols=60 Identities=23% Similarity=0.402 Sum_probs=39.9 Q ss_pred CCCceeeEECcCCCEEEEEEc---CcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKD---NNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAY 171 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~---gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf 171 (200) .+.+.+++|||+|+++|.+.+ ..+-+.++.......+ +. .....+.|||+|++|+. T Consensus 59 ~~~I~~~~WsP~g~~favi~g~~~~~v~lyd~~~~~i~~~---~~------------------~~~n~i~wsP~G~~l~~ 117 (194) T PF08662_consen 59 EGPIHDVAWSPNGNEFAVIYGSMPAKVTLYDVKGKKIFSF---GT------------------QPRNTISWSPDGRFLVL 117 (194) T ss_pred CCceEEEEECcCCCEEEEEEccCCcccEEEcCcccEeEee---cC------------------CCceEEEECCCCCEEEE Confidence 345899999999999998864 2566666642111111 11 12346899999999998 Q ss_pred EEEc Q psy7767 172 MKFN 175 (200) Q Consensus 172 ~r~D 175 (200) .-.+ T Consensus 118 ~g~~ 121 (194) T PF08662_consen 118 AGFG 121 (194) T ss_pred EEcc Confidence 7544 No 29 >PF07676 PD40: WD40-like Beta Propeller Repeat; InterPro: IPR011659 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events. This region appears to be related to the IPR001680 from INTERPRO repeat. This model is likely to miss copies within a sequence.; PDB: 2HQS_D 1C5K_A 2IVZ_A 2W8B_D 3IAX_A 1CRZ_A 1N6F_D 1N6D_C 1N6E_C 1K32_A .... Probab=97.57 E-value=0.00017 Score=44.74 Aligned_cols=21 Identities=19% Similarity=0.377 Sum_probs=16.8 Q ss_pred CCcceEEccCCCEEEEEEEcC Q psy7767 156 STKTLWFSPDGSILAYMKFND 176 (200) Q Consensus 156 ~~~~~~WSPDGk~Laf~r~D~ 176 (200) .+..+.|||||++|+|++..+ T Consensus 10 ~~~~p~~SpDGk~i~f~s~~~ 30 (39) T PF07676_consen 10 DDGSPAWSPDGKYIYFTSNRN 30 (39) T ss_dssp SEEEEEE-TTSSEEEEEEECT T ss_pred cccCEEEecCCCEEEEEecCC Confidence 345789999999999998776 No 30 >COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown] Probab=97.44 E-value=0.00041 Score=66.00 Aligned_cols=67 Identities=21% Similarity=0.360 Sum_probs=50.4 Q ss_pred ceeecCCC-CCceeeEECcCCCEEEEEE--------cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCc Q psy7767 88 DVSDLDGS-NFLQLVQWSPVGHDLIFVK--------DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTK 158 (200) Q Consensus 88 ~v~~lt~~-~~is~~~wSPDGk~IAFV~--------~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~ 158 (200) ...+++.+ +-+..|++||||++|||.+ ..+||+.+.++|+.+|||.-|.+ | .. T Consensus 70 ~~~ritS~lGVvnn~kf~pdGrkvaf~rv~~~ss~~taDly~v~~e~Ge~kRiTyfGr~----------------f--T~ 131 (668) T COG4946 70 KPLRITSGLGVVNNPKFSPDGRKVAFSRVMLGSSLQTADLYVVPSEDGEAKRITYFGRR----------------F--TR 131 (668) T ss_pred CeeEEecccceeccccCCCCCcEEEEEEEEecCCCccccEEEEeCCCCcEEEEEEeccc----------------c--ce Confidence 33566654 4458999999999999954 24899999999999999986532 1 12 Q ss_pred ceEEccCCCEEEEE Q psy7767 159 TLWFSPDGSILAYM 172 (200) Q Consensus 159 ~~~WSPDGk~Laf~ 172 (200) -.-|+|||+-|++. T Consensus 132 VaG~~~dg~iiV~T 145 (668) T COG4946 132 VAGWIPDGEIIVST 145 (668) T ss_pred eeccCCCCCEEEEe Confidence 24599999987765 No 31 >PF07676 PD40: WD40-like Beta Propeller Repeat; InterPro: IPR011659 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events. This region appears to be related to the IPR001680 from INTERPRO repeat. This model is likely to miss copies within a sequence.; PDB: 2HQS_D 1C5K_A 2IVZ_A 2W8B_D 3IAX_A 1CRZ_A 1N6F_D 1N6D_C 1N6E_C 1K32_A .... Probab=97.37 E-value=0.0004 Score=43.05 Aligned_cols=27 Identities=22% Similarity=0.369 Sum_probs=19.5 Q ss_pred eecCC-CCCceeeEECcCCCEEEEEEcC Q psy7767 90 SDLDG-SNFLQLVQWSPVGHDLIFVKDN 116 (200) Q Consensus 90 ~~lt~-~~~is~~~wSPDGk~IAFV~~g 116 (200) ++++. ......|.|||||++|+|+++. T Consensus 2 ~~~t~~~~~~~~p~~SpDGk~i~f~s~~ 29 (39) T PF07676_consen 2 KQLTNSPGDDGSPAWSPDGKYIYFTSNR 29 (39) T ss_dssp EEES-SSSSEEEEEE-TTSSEEEEEEEC T ss_pred cCcccCCccccCEEEecCCCEEEEEecC Confidence 44554 3456999999999999999754 No 32 >KOG2394|consensus Probab=97.36 E-value=0.00066 Score=65.13 Aligned_cols=80 Identities=21% Similarity=0.224 Sum_probs=47.1 Q ss_pred ceeecCC-CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccC Q psy7767 88 DVSDLDG-SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPD 165 (200) Q Consensus 88 ~v~~lt~-~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPD 165 (200) .+.+... ...+....+||||+.||-|. ++-|-+-+.+.-+..-+-. .+ |+..--+.|||| T Consensus 282 Pv~~w~~~~g~in~f~FS~DG~~LA~VSqDGfLRvF~fdt~eLlg~mk------SY------------FGGLLCvcWSPD 343 (636) T KOG2394|consen 282 PVARWHIGEGSINEFAFSPDGKYLATVSQDGFLRIFDFDTQELLGVMK------SY------------FGGLLCVCWSPD 343 (636) T ss_pred ccceeEeccccccceeEcCCCceEEEEecCceEEEeeccHHHHHHHHH------hh------------ccceEEEEEcCC Confidence 3444433 34678999999999999996 3444444443211111110 01 344456799999 Q ss_pred CCEEEEEEEcCCcceEEEEEe Q psy7767 166 GSILAYMKFNDARVHNMSYIH 186 (200) Q Consensus 166 Gk~Laf~r~D~s~v~~~~i~~ 186 (200) ||+|+ .-..+..|.++.+.. T Consensus 344 GKyIv-tGGEDDLVtVwSf~e 363 (636) T KOG2394|consen 344 GKYIV-TGGEDDLVTVWSFEE 363 (636) T ss_pred ccEEE-ecCCcceEEEEEecc Confidence 99986 334445666665543 No 33 >COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown] Probab=97.35 E-value=0.0031 Score=60.25 Aligned_cols=62 Identities=16% Similarity=0.207 Sum_probs=50.8 Q ss_pred CCCceeeEECcCCCEEEEEE-----cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-----DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSIL 169 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-----~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~L 169 (200) .+-+.+..|||++++|||.. ...|-+.++++++...+|+.-. .+.+++|-|||++| T Consensus 443 ~~lItdf~~~~nsr~iAYafP~gy~tq~Iklydm~~~Kiy~vTT~ta-------------------~DfsPaFD~d~ryL 503 (668) T COG4946 443 YGLITDFDWHPNSRWIAYAFPEGYYTQSIKLYDMDGGKIYDVTTPTA-------------------YDFSPAFDPDGRYL 503 (668) T ss_pred cceeEEEEEcCCceeEEEecCcceeeeeEEEEecCCCeEEEecCCcc-------------------cccCcccCCCCcEE Confidence 35679999999999999995 2467778889999999987432 56689999999999 Q ss_pred EEEEEc Q psy7767 170 AYMKFN 175 (200) Q Consensus 170 af~r~D 175 (200) .|++.+ T Consensus 504 YfLs~R 509 (668) T COG4946 504 YFLSAR 509 (668) T ss_pred EEEecc Confidence 999643 No 34 >PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A. Probab=97.24 E-value=0.0013 Score=61.11 Aligned_cols=75 Identities=29% Similarity=0.357 Sum_probs=44.2 Q ss_pred CCCccceeecCCCC------CceeeEECcCCCEEEEEEc----CcEEEEECCCCCeEEEeeCCCcceEecccccceeeee Q psy7767 83 PDTLTDVSDLDGSN------FLQLVQWSPVGHDLIFVKD----NNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEE 152 (200) Q Consensus 83 ~~~~~~v~~lt~~~------~is~~~wSPDGk~IAFV~~----gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEE 152 (200) +-+...|++||+.. ......|.+||+++.|..+ .++|.+++++++..|||..... T Consensus 17 ~~TG~~VtrLT~~~~~~h~~YF~~~~ft~dG~kllF~s~~dg~~nly~lDL~t~~i~QLTdg~g~--------------- 81 (386) T PF14583_consen 17 PDTGHRVTRLTPPDGHSHRLYFYQNCFTDDGRKLLFASDFDGNRNLYLLDLATGEITQLTDGPGD--------------- 81 (386) T ss_dssp TTT--EEEE-S-TTS-EE---TTS--B-TTS-EEEEEE-TTSS-EEEEEETTT-EEEE---SS-B--------------- T ss_pred CCCCceEEEecCCCCcccceeecCCCcCCCCCEEEEEeccCCCcceEEEEcccCEEEECccCCCC--------------- Confidence 54566888998633 3467899999999999874 4899999999999999985421 Q ss_pred eeCCCcceEEccCCCEEEEEEEc Q psy7767 153 VFSSTKTLWFSPDGSILAYMKFN 175 (200) Q Consensus 153 i~~~~~~~~WSPDGk~Laf~r~D 175 (200) ...+..+||+++.|.|++.. T Consensus 82 ---~~~g~~~s~~~~~~~Yv~~~ 101 (386) T PF14583_consen 82 ---NTFGGFLSPDDRALYYVKNG 101 (386) T ss_dssp ----TTT-EE-TTSSEEEEEETT T ss_pred ---CccceEEecCCCeEEEEECC Confidence 12245789999999998643 No 35 >PRK13616 lipoprotein LpqB; Provisional Probab=97.15 E-value=0.0021 Score=62.47 Aligned_cols=57 Identities=14% Similarity=0.022 Sum_probs=44.6 Q ss_pred CCceeeEECcCCCEEEEEEc---------CcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCC Q psy7767 96 NFLQLVQWSPVGHDLIFVKD---------NNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDG 166 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~~---------gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDG 166 (200) ..+.++++||||+.+||+.. ..||+.+. ++..+++|. |. ....++||||| T Consensus 350 ~~vsspaiSpdG~~vA~v~~~~~~~~d~~s~Lwv~~~-gg~~~~lt~-g~-------------------~~t~PsWspDG 408 (591) T PRK13616 350 GNITSAALSRSGRQVAAVVTLGRGAPDPASSLWVGPL-GGVAVQVLE-GH-------------------SLTRPSWSLDA 408 (591) T ss_pred cCcccceECCCCCEEEEEEeecCCCCCcceEEEEEeC-CCcceeeec-CC-------------------CCCCceECCCC Confidence 45789999999999999982 37999886 455577775 32 35678999999 Q ss_pred CEEEEEE Q psy7767 167 SILAYMK 173 (200) Q Consensus 167 k~Laf~r 173 (200) +.|+|+. T Consensus 409 ~~lw~v~ 415 (591) T PRK13616 409 DAVWVVV 415 (591) T ss_pred CceEEEe Confidence 9888885 No 36 >PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction:Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A .... Probab=97.06 E-value=0.023 Score=50.96 Aligned_cols=94 Identities=16% Similarity=0.103 Sum_probs=66.8 Q ss_pred CcceeEEEEEeeeeeeeeeecCCCCccceeecCCCC-Cc-eeeEECcCCCEEEEEEcC------cEEEEECC-CCCeEEE Q psy7767 61 DIALKLSIRCLTWKQSHSLSGRPDTLTDVSDLDGSN-FL-QLVQWSPVGHDLIFVKDN------NLYQAYDD-FRSINAL 131 (200) Q Consensus 61 ~~~~kv~i~~n~~~~R~s~~a~~~~~~~v~~lt~~~-~i-s~~~wSPDGk~IAFV~~g------dLyv~~~~-~g~~~rL 131 (200) .....++|....+|-+|.|.-+.- ....+.+|.+. .+ .-+.|+++++.|.|+... +||..+++ ++..++| T Consensus 245 ~~~~~~l~~s~~~G~~hly~~~~~-~~~~~~lT~G~~~V~~i~~~d~~~~~iyf~a~~~~p~~r~lY~v~~~~~~~~~~L 323 (353) T PF00930_consen 245 PDGNEFLWISERDGYRHLYLYDLD-GGKPRQLTSGDWEVTSILGWDEDNNRIYFTANGDNPGERHLYRVSLDSGGEPKCL 323 (353) T ss_dssp TTSSEEEEEEETTSSEEEEEEETT-SSEEEESS-SSS-EEEEEEEECTSSEEEEEESSGGTTSBEEEEEETTETTEEEES T ss_pred CCCCEEEEEEEcCCCcEEEEEccc-ccceeccccCceeecccceEcCCCCEEEEEecCCCCCceEEEEEEeCCCCCeEec Confidence 334445566655666888777774 44567788653 44 468999999999999864 89999999 8999999 Q ss_pred eeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEE Q psy7767 132 TRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKF 174 (200) Q Consensus 132 T~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~ 174 (200) |..... ...+.+||||++++-... T Consensus 324 T~~~~~-------------------~~~~~~Spdg~y~v~~~s 347 (353) T PF00930_consen 324 TCEDGD-------------------HYSASFSPDGKYYVDTYS 347 (353) T ss_dssp STTSST-------------------TEEEEE-TTSSEEEEEEE T ss_pred cCCCCC-------------------ceEEEECCCCCEEEEEEc Confidence 986531 136899999999876543 No 37 >KOG0318|consensus Probab=96.88 E-value=0.0092 Score=57.32 Aligned_cols=74 Identities=19% Similarity=0.214 Sum_probs=50.3 Q ss_pred CCCceeeEECcCCCEEEEE-EcCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFV-KDNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV-~~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) ..++...++||||+..|=+ .++.+|+.+-.+|+..-...+.... -|...+++||||+++|+-+. T Consensus 190 skFV~~VRysPDG~~Fat~gsDgki~iyDGktge~vg~l~~~~aH---------------kGsIfalsWsPDs~~~~T~S 254 (603) T KOG0318|consen 190 SKFVNCVRYSPDGSRFATAGSDGKIYIYDGKTGEKVGELEDSDAH---------------KGSIFALSWSPDSTQFLTVS 254 (603) T ss_pred ccceeeEEECCCCCeEEEecCCccEEEEcCCCccEEEEecCCCCc---------------cccEEEEEECCCCceEEEec Confidence 4567899999999998888 4788999976655544332222111 13456899999999998776 Q ss_pred EcCCcceEEEE Q psy7767 174 FNDARVHNMSY 184 (200) Q Consensus 174 ~D~s~v~~~~i 184 (200) .|.+ .+++.+ T Consensus 255 aDkt-~KIWdV 264 (603) T KOG0318|consen 255 ADKT-IKIWDV 264 (603) T ss_pred CCce-EEEEEe Confidence 5543 444444 No 38 >PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A .... Probab=96.78 E-value=0.0052 Score=55.61 Aligned_cols=98 Identities=20% Similarity=0.230 Sum_probs=56.7 Q ss_pred EEEeccCcceeEEEEEeeee---eeeeeecCCCCccceeecCCCC---CceeeEECcCCCEEEEEE--cC----cEEEEE Q psy7767 55 YLQRSTDIALKLSIRCLTWK---QSHSLSGRPDTLTDVSDLDGSN---FLQLVQWSPVGHDLIFVK--DN----NLYQAY 122 (200) Q Consensus 55 y~~~~~~~~~kv~i~~n~~~---~R~s~~a~~~~~~~v~~lt~~~---~is~~~wSPDGk~IAFV~--~g----dLyv~~ 122 (200) |....++..+.+.++..... ..... +-++..+..+. .+...++||||+.|||.. +| .|++.+ T Consensus 83 ~~~~~~~~~~~~~~r~~~~~~~~~~~ev------llD~n~l~~~~~~~~~~~~~~Spdg~~la~~~s~~G~e~~~l~v~D 156 (414) T PF02897_consen 83 YSRNQGGKNYPVLYRRKTDEEDGPEEEV------LLDPNELAKDGGYVSLGGFSVSPDGKRLAYSLSDGGSEWYTLRVFD 156 (414) T ss_dssp EEEE-SS-SS-EEEEEETTS-TS-C-EE------EEEGGGGSTTSS-EEEEEEEETTTSSEEEEEEEETTSSEEEEEEEE T ss_pred EEEEcCCCceEEEEEEecccCCCCceEE------EEcchHhhccCceEEeeeeeECCCCCEEEEEecCCCCceEEEEEEE Confidence 34446677777777766541 11011 11222232222 224679999999999995 33 589999 Q ss_pred CCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcCC Q psy7767 123 DDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFNDA 177 (200) Q Consensus 123 ~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~s 177 (200) +++|+.. .++-.. ....++.|++||+.|.|.+.++. T Consensus 157 l~tg~~l---~d~i~~----------------~~~~~~~W~~d~~~~~y~~~~~~ 192 (414) T PF02897_consen 157 LETGKFL---PDGIEN----------------PKFSSVSWSDDGKGFFYTRFDED 192 (414) T ss_dssp TTTTEEE---EEEEEE----------------EESEEEEECTTSSEEEEEECSTT T ss_pred CCCCcCc---CCcccc----------------cccceEEEeCCCCEEEEEEeCcc Confidence 9888432 211100 01223899999999999998875 No 39 >TIGR02171 Fb_sc_TIGR02171 Fibrobacter succinogenes paralogous family TIGR02171. This model describes a paralogous family of the rumen bacterium Fibrobacter succinogenes. Eleven members are found in Fibrobacter succinogenes S85, averaging over 900 amino acids in length. More than half are predicted lipoproteins. The function is unknown. Probab=96.67 E-value=0.0051 Score=62.32 Aligned_cols=63 Identities=22% Similarity=0.162 Sum_probs=46.4 Q ss_pred CCEEEEEEc--CcEEEEECCCCCeEEE-eeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEE-EEEcC--Ccce Q psy7767 107 GHDLIFVKD--NNLYQAYDDFRSINAL-TRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAY-MKFND--ARVH 180 (200) Q Consensus 107 Gk~IAFV~~--gdLyv~~~~~g~~~rL-T~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf-~r~D~--s~v~ 180 (200) ..+||||.+ ++|.+.|.++...++| +.+.. ....|.|||||++||| +.... +... T Consensus 318 ~tkiAfv~~~~~~L~~~D~dG~n~~~ve~~~~~-------------------~i~sP~~SPDG~~vAY~ts~e~~~g~s~ 378 (912) T TIGR02171 318 KAKLAFRNDVTGNLAYIDYTKGASRAVEIEDTI-------------------SVYHPDISPDGKKVAFCTGIEGLPGKSS 378 (912) T ss_pred eeeEEEEEcCCCeEEEEecCCCCceEEEecCCC-------------------ceecCcCCCCCCEEEEEEeecCCCCCce Confidence 368999985 5999999998888888 66543 2336899999999999 66665 4555 Q ss_pred EEEEEeeC Q psy7767 181 NMSYIHYG 188 (200) Q Consensus 181 ~~~i~~~~ 188 (200) +|...+.. T Consensus 379 vYv~~L~t 386 (912) T TIGR02171 379 VYVRNLNA 386 (912) T ss_pred EEEEehhc Confidence 55544443 No 40 >PRK10115 protease 2; Provisional Probab=96.35 E-value=0.011 Score=58.43 Aligned_cols=57 Identities=11% Similarity=0.093 Sum_probs=42.8 Q ss_pred ceeeEECcCCCEEEEEEcC------cEEEEECCCCC--eEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEE Q psy7767 98 LQLVQWSPVGHDLIFVKDN------NLYQAYDDFRS--INALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSIL 169 (200) Q Consensus 98 is~~~wSPDGk~IAFV~~g------dLyv~~~~~g~--~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~L 169 (200) +....|||||+.|||..+. +|++.++++|. ...+.. ....+.|+|||+.| T Consensus 129 l~~~~~Spdg~~la~~~d~~G~E~~~l~v~d~~tg~~l~~~i~~----------------------~~~~~~w~~D~~~~ 186 (686) T PRK10115 129 LGGMAITPDNTIMALAEDFLSRRQYGIRFRNLETGNWYPELLDN----------------------VEPSFVWANDSWTF 186 (686) T ss_pred EeEEEECCCCCEEEEEecCCCcEEEEEEEEECCCCCCCCccccC----------------------cceEEEEeeCCCEE Confidence 5778999999999999742 69999988775 222221 11347999999999 Q ss_pred EEEEEcC Q psy7767 170 AYMKFND 176 (200) Q Consensus 170 af~r~D~ 176 (200) +|.+.++ T Consensus 187 ~y~~~~~ 193 (686) T PRK10115 187 YYVRKHP 193 (686) T ss_pred EEEEecC Confidence 9998864 No 41 >PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues. Probab=96.25 E-value=0.068 Score=46.28 Aligned_cols=73 Identities=18% Similarity=0.098 Sum_probs=44.2 Q ss_pred CceeeEECcCCCEEEEEE----cCcEEEEECC---CCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEE Q psy7767 97 FLQLVQWSPVGHDLIFVK----DNNLYQAYDD---FRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSIL 169 (200) Q Consensus 97 ~is~~~wSPDGk~IAFV~----~gdLyv~~~~---~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~L 169 (200) .+..+++||||.++|++. ++.||+..+. +|....++..-. + ... + .....++.|++|++-+ T Consensus 113 ~I~~l~vSpDG~RvA~v~~~~~~~~v~va~V~r~~~g~~~~l~~~~~--~-~~~--------~-~~~v~~v~W~~~~~L~ 180 (253) T PF10647_consen 113 RITALRVSPDGTRVAVVVEDGGGGRVYVAGVVRDGDGVPRRLTGPRR--V-APP--------L-LSDVTDVAWSDDSTLV 180 (253) T ss_pred ceEEEEECCCCcEEEEEEecCCCCeEEEEEEEeCCCCCcceeccceE--e-ccc--------c-cCcceeeeecCCCEEE Confidence 789999999999999999 7899997554 232333332110 0 000 0 1245689999999644 Q ss_pred EEEEEcCCcceE Q psy7767 170 AYMKFNDARVHN 181 (200) Q Consensus 170 af~r~D~s~v~~ 181 (200) +..+.....+.. T Consensus 181 V~~~~~~~~~~~ 192 (253) T PF10647_consen 181 VLGRSAGGPVVR 192 (253) T ss_pred EEeCCCCCceeE Confidence 433444343333 No 42 >TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol. Probab=96.02 E-value=0.039 Score=45.64 Aligned_cols=66 Identities=14% Similarity=0.045 Sum_probs=41.7 Q ss_pred eeeEECcCCCEEEEEE--cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcC Q psy7767 99 QLVQWSPVGHDLIFVK--DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFND 176 (200) Q Consensus 99 s~~~wSPDGk~IAFV~--~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~ 176 (200) ..+.+||||+.+.... ++.|.+.++++++..+....+. ....+.|+|||++|+....++ T Consensus 210 ~~i~~s~dg~~~~~~~~~~~~i~v~d~~~~~~~~~~~~~~-------------------~~~~~~~~~~g~~l~~~~~~~ 270 (300) T TIGR03866 210 VGIKLTKDGKTAFVALGPANRVAVVDAKTYEVLDYLLVGQ-------------------RVWQLAFTPDEKYLLTTNGVS 270 (300) T ss_pred cceEECCCCCEEEEEcCCCCeEEEEECCCCcEEEEEEeCC-------------------CcceEEECCCCCEEEEEcCCC Confidence 4578999999864433 3568888887766543322221 234688999999886544444 Q ss_pred CcceEEE Q psy7767 177 ARVHNMS 183 (200) Q Consensus 177 s~v~~~~ 183 (200) +.+..+. T Consensus 271 ~~i~v~d 277 (300) T TIGR03866 271 NDVSVID 277 (300) T ss_pred CeEEEEE Confidence 5455444 No 43 >KOG1539|consensus Probab=96.02 E-value=0.041 Score=55.29 Aligned_cols=67 Identities=22% Similarity=0.241 Sum_probs=51.1 Q ss_pred CCCceeeEECcCCCEEEEEEc-CcEEEEECCCCCeE-EEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKD-NNLYQAYDDFRSIN-ALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~-gdLyv~~~~~g~~~-rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) ++.+.++.+||||+||+-... +-|.+.|+.++... -+.-+. -..++++||+|..||-+ T Consensus 576 ~nritd~~FS~DgrWlisasmD~tIr~wDlpt~~lID~~~vd~--------------------~~~sls~SPngD~LAT~ 635 (910) T KOG1539|consen 576 GNRITDMTFSPDGRWLISASMDSTIRTWDLPTGTLIDGLLVDS--------------------PCTSLSFSPNGDFLATV 635 (910) T ss_pred ccceeeeEeCCCCcEEEEeecCCcEEEEeccCcceeeeEecCC--------------------cceeeEECCCCCEEEEE Confidence 677899999999999998864 78999999877653 222111 23478999999999999 Q ss_pred EEcCCcceE Q psy7767 173 KFNDARVHN 181 (200) Q Consensus 173 r~D~s~v~~ 181 (200) ..|+.++-. T Consensus 636 Hvd~~gIyl 644 (910) T KOG1539|consen 636 HVDQNGIYL 644 (910) T ss_pred EecCceEEE Confidence 888765443 No 44 >TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol. Probab=96.00 E-value=0.17 Score=41.84 Aligned_cols=67 Identities=15% Similarity=0.096 Sum_probs=41.2 Q ss_pred CceeeEECcCCCEEEEEE--cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEE Q psy7767 97 FLQLVQWSPVGHDLIFVK--DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKF 174 (200) Q Consensus 97 ~is~~~wSPDGk~IAFV~--~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~ 174 (200) ....+.|+|||+.++.+. ++.|++.+..+++..+....+. ....+.|+|||+.|+.... T Consensus 32 ~~~~l~~~~dg~~l~~~~~~~~~v~~~d~~~~~~~~~~~~~~-------------------~~~~~~~~~~g~~l~~~~~ 92 (300) T TIGR03866 32 RPRGITLSKDGKLLYVCASDSDTIQVIDLATGEVIGTLPSGP-------------------DPELFALHPNGKILYIANE 92 (300) T ss_pred CCCceEECCCCCEEEEEECCCCeEEEEECCCCcEEEeccCCC-------------------CccEEEECCCCCEEEEEcC Confidence 356689999999875554 3578888888776543222121 1235689999987754433 Q ss_pred cCCcceEE Q psy7767 175 NDARVHNM 182 (200) Q Consensus 175 D~s~v~~~ 182 (200) +++.+..+ T Consensus 93 ~~~~l~~~ 100 (300) T TIGR03866 93 DDNLVTVI 100 (300) T ss_pred CCCeEEEE Confidence 34434333 No 45 >KOG2139|consensus Probab=95.50 E-value=0.13 Score=47.94 Aligned_cols=63 Identities=21% Similarity=0.357 Sum_probs=38.5 Q ss_pred CCCceeeEECcCCCEEEEEEcCcEEEEE-CC-CC-CeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKDNNLYQAY-DD-FR-SINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAY 171 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~gdLyv~~-~~-~g-~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf 171 (200) .++++.++|||||.++--..-..+|..- .. .- ..+.++-.| +..+..|||+|++|.| T Consensus 238 lgg~slLkwSPdgd~lfaAt~davfrlw~e~q~wt~erw~lgsg--------------------rvqtacWspcGsfLLf 297 (445) T KOG2139|consen 238 LGGFSLLKWSPDGDVLFAATCDAVFRLWQENQSWTKERWILGSG--------------------RVQTACWSPCGSFLLF 297 (445) T ss_pred CCceeeEEEcCCCCEEEEecccceeeeehhcccceecceeccCC--------------------ceeeeeecCCCCEEEE Confidence 5778999999999997444333333321 11 11 122333222 3446689999999999 Q ss_pred EEEcCC Q psy7767 172 MKFNDA 177 (200) Q Consensus 172 ~r~D~s 177 (200) +...+. T Consensus 298 ~~sgsp 303 (445) T KOG2139|consen 298 ACSGSP 303 (445) T ss_pred EEcCCc Confidence 866554 No 46 >cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto Probab=95.40 E-value=0.15 Score=40.25 Aligned_cols=71 Identities=17% Similarity=0.112 Sum_probs=46.9 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) ...+..+.|+|+|+.|++.. ++.|.+.+++++.......... .....+.|+|+++.|+... T Consensus 9 ~~~i~~~~~~~~~~~l~~~~~~g~i~i~~~~~~~~~~~~~~~~------------------~~i~~~~~~~~~~~l~~~~ 70 (289) T cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHT------------------GPVRDVAASADGTYLASGS 70 (289) T ss_pred CCCEEEEEEcCCCCEEEEeecCcEEEEEEeeCCCcEEEEecCC------------------cceeEEEECCCCCEEEEEc Confidence 35689999999999999987 7789998887665332222111 0223678999998776553 Q ss_pred EcCCcceEEEE Q psy7767 174 FNDARVHNMSY 184 (200) Q Consensus 174 ~D~s~v~~~~i 184 (200) .++.+..+.+ T Consensus 71 -~~~~i~i~~~ 80 (289) T cd00200 71 -SDKTIRLWDL 80 (289) T ss_pred -CCCeEEEEEc Confidence 3444544443 No 47 >PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues. Probab=95.23 E-value=0.23 Score=42.95 Aligned_cols=72 Identities=15% Similarity=0.104 Sum_probs=40.7 Q ss_pred CCCCceeeEECcCCCEEEEEEcCcEEEE--ECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEE Q psy7767 94 GSNFLQLVQWSPVGHDLIFVKDNNLYQA--YDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAY 171 (200) Q Consensus 94 ~~~~is~~~wSPDGk~IAFV~~gdLyv~--~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf 171 (200) .+..+..|+|+|+|...++..+...-.. +...+...++.-+.. +.-+...++.+||||.++|+ T Consensus 64 ~g~~l~~PS~d~~g~~W~v~~~~~~~~~~~~~~~g~~~~~~v~~~---------------~~~~~I~~l~vSpDG~RvA~ 128 (253) T PF10647_consen 64 TGGSLTRPSWDPDGWVWTVDDGSGGVRVVRDSASGTGEPVEVDWP---------------GLRGRITALRVSPDGTRVAV 128 (253) T ss_pred cCCccccccccCCCCEEEEEcCCCceEEEEecCCCcceeEEeccc---------------ccCCceEEEEECCCCcEEEE Confidence 4567899999999776666554332221 112222222221110 00013568999999999999 Q ss_pred EEEcCCcce Q psy7767 172 MKFNDARVH 180 (200) Q Consensus 172 ~r~D~s~v~ 180 (200) +..+....+ T Consensus 129 v~~~~~~~~ 137 (253) T PF10647_consen 129 VVEDGGGGR 137 (253) T ss_pred EEecCCCCe Confidence 976544433 No 48 >TIGR02171 Fb_sc_TIGR02171 Fibrobacter succinogenes paralogous family TIGR02171. This model describes a paralogous family of the rumen bacterium Fibrobacter succinogenes. Eleven members are found in Fibrobacter succinogenes S85, averaging over 900 amino acids in length. More than half are predicted lipoproteins. The function is unknown. Probab=95.19 E-value=0.051 Score=55.31 Aligned_cols=65 Identities=12% Similarity=0.033 Sum_probs=42.1 Q ss_pred cCcceeEEEEEeeeeeeeeeecCCCCccceee--cCCCCCceeeEECcCCCEEEE-EE--c----CcEEEEECCCCC Q psy7767 60 TDIALKLSIRCLTWKQSHSLSGRPDTLTDVSD--LDGSNFLQLVQWSPVGHDLIF-VK--D----NNLYQAYDDFRS 127 (200) Q Consensus 60 ~~~~~kv~i~~n~~~~R~s~~a~~~~~~~v~~--lt~~~~is~~~wSPDGk~IAF-V~--~----gdLyv~~~~~g~ 127 (200) +.+-.|++.+....+ ....+++. ....+. +..+..+..|+|||||++||| |. + ..||+.++++.. T Consensus 315 ~~f~tkiAfv~~~~~--~L~~~D~d-G~n~~~ve~~~~~~i~sP~~SPDG~~vAY~ts~e~~~g~s~vYv~~L~t~~ 388 (912) T TIGR02171 315 GTYKAKLAFRNDVTG--NLAYIDYT-KGASRAVEIEDTISVYHPDISPDGKKVAFCTGIEGLPGKSSVYVRNLNASG 388 (912) T ss_pred ccceeeEEEEEcCCC--eEEEEecC-CCCceEEEecCCCceecCcCCCCCCEEEEEEeecCCCCCceEEEEehhccC Confidence 345667777765333 44445553 222222 345677899999999999999 63 2 269999988643 No 49 >cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto Probab=95.16 E-value=0.17 Score=39.83 Aligned_cols=71 Identities=18% Similarity=0.167 Sum_probs=48.6 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) ...+..+.|+|+++.++... ++.|++.++..++..+...... +....+.|+|||+.|++.. T Consensus 135 ~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~~~~~~~~~~~------------------~~i~~~~~~~~~~~l~~~~ 196 (289) T cd00200 135 TDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHT------------------GEVNSVAFSPDGEKLLSSS 196 (289) T ss_pred CCcEEEEEEcCcCCEEEEEcCCCcEEEEEccccccceeEecCc------------------cccceEEECCCcCEEEEec Confidence 34689999999998888877 8899999887655433332111 1345789999998887664 Q ss_pred EcCCcceEEEE Q psy7767 174 FNDARVHNMSY 184 (200) Q Consensus 174 ~D~s~v~~~~i 184 (200) . ++.+..+.+ T Consensus 197 ~-~~~i~i~d~ 206 (289) T cd00200 197 S-DGTIKLWDL 206 (289) T ss_pred C-CCcEEEEEC Confidence 4 455555543 No 50 >PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A .... Probab=94.80 E-value=0.28 Score=44.95 Aligned_cols=70 Identities=19% Similarity=0.229 Sum_probs=40.8 Q ss_pred ceeecCCCCC-ceeeEECcCCCEEEEEE--cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEcc Q psy7767 88 DVSDLDGSNF-LQLVQWSPVGHDLIFVK--DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSP 164 (200) Q Consensus 88 ~v~~lt~~~~-is~~~wSPDGk~IAFV~--~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSP 164 (200) .+.++..+.. ...+.+||||+++ ||. ++.|-+.|+.+++..+-...|. ...++..|| T Consensus 28 ~~~~i~~~~~~h~~~~~s~Dgr~~-yv~~rdg~vsviD~~~~~~v~~i~~G~-------------------~~~~i~~s~ 87 (369) T PF02239_consen 28 VVARIPTGGAPHAGLKFSPDGRYL-YVANRDGTVSVIDLATGKVVATIKVGG-------------------NPRGIAVSP 87 (369) T ss_dssp EEEEEE-STTEEEEEE-TT-SSEE-EEEETTSEEEEEETTSSSEEEEEE-SS-------------------EEEEEEE-- T ss_pred EEEEEcCCCCceeEEEecCCCCEE-EEEcCCCeEEEEECCcccEEEEEecCC-------------------CcceEEEcC Confidence 3445544433 3567899999984 554 4578888998877653334342 234688999 Q ss_pred CCCEEEEEEEcCC Q psy7767 165 DGSILAYMKFNDA 177 (200) Q Consensus 165 DGk~Laf~r~D~s 177 (200) ||++|+-...... T Consensus 88 DG~~~~v~n~~~~ 100 (369) T PF02239_consen 88 DGKYVYVANYEPG 100 (369) T ss_dssp TTTEEEEEEEETT T ss_pred CCCEEEEEecCCC Confidence 9998865544433 No 51 >KOG4497|consensus Probab=94.79 E-value=0.12 Score=47.70 Aligned_cols=86 Identities=19% Similarity=0.213 Sum_probs=46.3 Q ss_pred eeEECcCCCEEEEEEcCcEEEEECCCCCeEEE------------e-----------eCCCcceEecccccceeee-eeeC Q psy7767 100 LVQWSPVGHDLIFVKDNNLYQAYDDFRSINAL------------T-----------RDGIKGVLFNGVADWVYEE-EVFS 155 (200) Q Consensus 100 ~~~wSPDGk~IAFV~~gdLyv~~~~~g~~~rL------------T-----------~dg~~~~~~nG~pdwv~eE-Ei~~ 155 (200) ...+||+|++||-..+..|-+.+.++-+..|+ . .+|.-.+..--.|+|-..- |--. T Consensus 13 ~c~fSp~g~yiAs~~~yrlviRd~~tlq~~qlf~cldki~yieW~ads~~ilC~~yk~~~vqvwsl~Qpew~ckIdeg~a 92 (447) T KOG4497|consen 13 FCSFSPCGNYIASLSRYRLVIRDSETLQLHQLFLCLDKIVYIEWKADSCHILCVAYKDPKVQVWSLVQPEWYCKIDEGQA 92 (447) T ss_pred ceeECCCCCeeeeeeeeEEEEeccchhhHHHHHHHHHHhhheeeeccceeeeeeeeccceEEEEEeecceeEEEeccCCC Confidence 56899999999998877777776554332110 0 1111011112234444311 1112 Q ss_pred CCcceEEccCCCEEEEEEEcCCcceEEEEE Q psy7767 156 STKTLWFSPDGSILAYMKFNDARVHNMSYI 185 (200) Q Consensus 156 ~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~ 185 (200) .-++.+|||||+.|.-...=+..+.++.+. T Consensus 93 gls~~~WSPdgrhiL~tseF~lriTVWSL~ 122 (447) T KOG4497|consen 93 GLSSISWSPDGRHILLTSEFDLRITVWSLN 122 (447) T ss_pred cceeeeECCCcceEeeeecceeEEEEEEec Confidence 346789999999887664433444444443 No 52 >TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome. Probab=94.77 E-value=0.65 Score=42.73 Aligned_cols=59 Identities=14% Similarity=0.060 Sum_probs=40.1 Q ss_pred CCceeeEECcCCCEEEEEEc------------CcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEc Q psy7767 96 NFLQLVQWSPVGHDLIFVKD------------NNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFS 163 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~~------------gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WS 163 (200) .+.+-++++|||+++ ||.. +.|++.|..+++...-...|. ...++.+| T Consensus 248 ~g~q~ia~~~dg~~l-yV~~~~~~~~thk~~~~~V~ViD~~t~kvi~~i~vG~-------------------~~~~iavS 307 (352) T TIGR02658 248 GGWQQVAYHRARDRI-YLLADQRAKWTHKTASRFLFVVDAKTGKRLRKIELGH-------------------EIDSINVS 307 (352) T ss_pred CcceeEEEcCCCCEE-EEEecCCccccccCCCCEEEEEECCCCeEEEEEeCCC-------------------ceeeEEEC Confidence 344558999999996 5532 579999887776553333332 34478999 Q ss_pred cCCCEEEEEEE Q psy7767 164 PDGSILAYMKF 174 (200) Q Consensus 164 PDGk~Laf~r~ 174 (200) |||+-++|+.. T Consensus 308 ~Dgkp~lyvtn 318 (352) T TIGR02658 308 QDAKPLLYALS 318 (352) T ss_pred CCCCeEEEEeC Confidence 99996667644 No 53 >PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A. Probab=94.69 E-value=0.16 Score=47.32 Aligned_cols=75 Identities=15% Similarity=-0.055 Sum_probs=49.6 Q ss_pred CcceeEEEEEeeeeeeeeeecCCCCccceeecCCCC--CceeeEECcCCCEEEEEEcC-cEEEEECCCCCeEEEeeCCC Q psy7767 61 DIALKLSIRCLTWKQSHSLSGRPDTLTDVSDLDGSN--FLQLVQWSPVGHDLIFVKDN-NLYQAYDDFRSINALTRDGI 136 (200) Q Consensus 61 ~~~~kv~i~~n~~~~R~s~~a~~~~~~~v~~lt~~~--~is~~~wSPDGk~IAFV~~g-dLyv~~~~~g~~~rLT~dg~ 136 (200) ..-.|+++..+.++.|+.|.-+-. ..++++||++. ......+||+++.|.|+.++ +|+..++++++.+.|-.-++ T Consensus 45 ~dG~kllF~s~~dg~~nly~lDL~-t~~i~QLTdg~g~~~~g~~~s~~~~~~~Yv~~~~~l~~vdL~T~e~~~vy~~p~ 122 (386) T PF14583_consen 45 DDGRKLLFASDFDGNRNLYLLDLA-TGEITQLTDGPGDNTFGGFLSPDDRALYYVKNGRSLRRVDLDTLEERVVYEVPD 122 (386) T ss_dssp TTS-EEEEEE-TTSS-EEEEEETT-T-EEEE---SS-B-TTT-EE-TTSSEEEEEETTTEEEEEETTT--EEEEEE--T T ss_pred CCCCEEEEEeccCCCcceEEEEcc-cCEEEECccCCCCCccceEEecCCCeEEEEECCCeEEEEECCcCcEEEEEECCc Confidence 557889999998888888888775 55778888643 33468999999999999864 89999999998887776553 No 54 >KOG0318|consensus Probab=94.68 E-value=0.15 Score=49.19 Aligned_cols=97 Identities=16% Similarity=0.116 Sum_probs=59.8 Q ss_pred eeeeeeeeecCCCCccceee-cCCCCCceeeEECcCCCEEEEE-EcCcEEEEECCCCCeE--EEeeCCCcceEecccccc Q psy7767 72 TWKQSHSLSGRPDTLTDVSD-LDGSNFLQLVQWSPVGHDLIFV-KDNNLYQAYDDFRSIN--ALTRDGIKGVLFNGVADW 147 (200) Q Consensus 72 ~~~~R~s~~a~~~~~~~v~~-lt~~~~is~~~wSPDGk~IAFV-~~gdLyv~~~~~g~~~--rLT~dg~~~~~~nG~pdw 147 (200) ++++-|.|..+...+..... +.-.+.++..++||||+.+|-. ..+.+-+.++.+.+.. +.+..- T Consensus 463 ~Dgkvhvysl~g~~l~ee~~~~~h~a~iT~vaySpd~~yla~~Da~rkvv~yd~~s~~~~~~~w~FHt------------ 530 (603) T KOG0318|consen 463 QDGKVHVYSLSGDELKEEAKLLEHRAAITDVAYSPDGAYLAAGDASRKVVLYDVASREVKTNRWAFHT------------ 530 (603) T ss_pred ccceEEEEEecCCcccceeeeecccCCceEEEECCCCcEEEEeccCCcEEEEEcccCceecceeeeee------------ Confidence 34446666666642322222 2236788999999999999877 4567777887766551 222110 Q ss_pred eeeeeeeCCCcceEEccCCCEEEEEEEcCCcceEEEEEeeC Q psy7767 148 VYEEEVFSSTKTLWFSPDGSILAYMKFNDARVHNMSYIHYG 188 (200) Q Consensus 148 v~eEEi~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~~~~ 188 (200) .+...++||||.+.||--..| +.+-+|.+..++ T Consensus 531 -------akI~~~aWsP~n~~vATGSlD-t~Viiysv~kP~ 563 (603) T KOG0318|consen 531 -------AKINCVAWSPNNKLVATGSLD-TNVIIYSVKKPA 563 (603) T ss_pred -------eeEEEEEeCCCceEEEecccc-ceEEEEEccChh Confidence 245678999999988754444 345555544443 No 55 >PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal Probab=94.57 E-value=0.15 Score=45.60 Aligned_cols=63 Identities=21% Similarity=0.248 Sum_probs=43.5 Q ss_pred eEECcCCCEEEEEEcCcEEEEECCCCC--e---EEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEc Q psy7767 101 VQWSPVGHDLIFVKDNNLYQAYDDFRS--I---NALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFN 175 (200) Q Consensus 101 ~~wSPDGk~IAFV~~gdLyv~~~~~g~--~---~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D 175 (200) ++.++||+.||-+.+.-|-+.+..+.- . -++-.|.. | +-+-+.||||+..|||.+.. T Consensus 3 ~~~~~~Gk~lAi~qd~~iEiRsa~Ddf~si~~kcqVpkD~~--------P----------QWRkl~WSpD~tlLa~a~S~ 64 (282) T PF15492_consen 3 LALSSDGKLLAILQDQCIEIRSAKDDFSSIIGKCQVPKDPN--------P----------QWRKLAWSPDCTLLAYAEST 64 (282) T ss_pred eeecCCCcEEEEEeccEEEEEeccCCchheeEEEecCCCCC--------c----------hheEEEECCCCcEEEEEcCC Confidence 578999999999999988888766532 1 24444432 3 34568999999999997543 Q ss_pred CCcceEE Q psy7767 176 DARVHNM 182 (200) Q Consensus 176 ~s~v~~~ 182 (200) +.+..| T Consensus 65 -G~i~vf 70 (282) T PF15492_consen 65 -GTIRVF 70 (282) T ss_pred -CeEEEE Confidence 334444 No 56 >KOG1446|consensus Probab=94.41 E-value=0.31 Score=44.21 Aligned_cols=74 Identities=16% Similarity=0.242 Sum_probs=48.8 Q ss_pred CCCceeeEECcCCCEEEEEEc-CcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKD-NNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~-gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) ..+...+.+|||||.|....+ +.+|+.+.-+|+... |..+.+ |+. ...-+..|+|||+.| +.- T Consensus 187 ~~ew~~l~FS~dGK~iLlsT~~s~~~~lDAf~G~~~~-tfs~~~----~~~----------~~~~~a~ftPds~Fv-l~g 250 (311) T KOG1446|consen 187 EAEWTDLEFSPDGKSILLSTNASFIYLLDAFDGTVKS-TFSGYP----NAG----------NLPLSATFTPDSKFV-LSG 250 (311) T ss_pred ccceeeeEEcCCCCEEEEEeCCCcEEEEEccCCcEee-eEeecc----CCC----------CcceeEEECCCCcEE-EEe Confidence 456789999999999998865 468999888887432 211211 000 122467899999866 455 Q ss_pred EcCCcceEEEE Q psy7767 174 FNDARVHNMSY 184 (200) Q Consensus 174 ~D~s~v~~~~i 184 (200) .+++.+..+.+ T Consensus 251 s~dg~i~vw~~ 261 (311) T KOG1446|consen 251 SDDGTIHVWNL 261 (311) T ss_pred cCCCcEEEEEc Confidence 66666666655 No 57 >KOG2139|consensus Probab=94.41 E-value=0.19 Score=46.78 Aligned_cols=69 Identities=23% Similarity=0.261 Sum_probs=48.4 Q ss_pred eeecCCCCCceeeEECcCCCEEEEEEcC--cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCC Q psy7767 89 VSDLDGSNFLQLVQWSPVGHDLIFVKDN--NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDG 166 (200) Q Consensus 89 v~~lt~~~~is~~~wSPDGk~IAFV~~g--dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDG 166 (200) +-+......+..++|.|||..++=.+-+ .+-+.+.+.|.-.+|-..|- |..+-+.||||| T Consensus 189 vl~~pgh~pVtsmqwn~dgt~l~tAS~gsssi~iWdpdtg~~~pL~~~gl------------------gg~slLkwSPdg 250 (445) T KOG2139|consen 189 VLQDPGHNPVTSMQWNEDGTILVTASFGSSSIMIWDPDTGQKIPLIPKGL------------------GGFSLLKWSPDG 250 (445) T ss_pred heeCCCCceeeEEEEcCCCCEEeecccCcceEEEEcCCCCCcccccccCC------------------CceeeEEEcCCC Confidence 3333334567999999999998877533 57777888887776664331 345668999999 Q ss_pred CEEEEEEEc Q psy7767 167 SILAYMKFN 175 (200) Q Consensus 167 k~Laf~r~D 175 (200) .+|.-.+.| T Consensus 251 d~lfaAt~d 259 (445) T KOG2139|consen 251 DVLFAATCD 259 (445) T ss_pred CEEEEeccc Confidence 988545555 No 58 >KOG0973|consensus Probab=94.38 E-value=0.47 Score=48.66 Aligned_cols=75 Identities=21% Similarity=0.235 Sum_probs=54.9 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) ..++++..||||+..+|-+. ++-|-+.+..+.+...+..+- + +...++.|-|-|++|| .. T Consensus 129 ~~DV~Dv~Wsp~~~~lvS~s~DnsViiwn~~tF~~~~vl~~H---------~---------s~VKGvs~DP~Gky~A-Sq 189 (942) T KOG0973|consen 129 DSDVLDVNWSPDDSLLVSVSLDNSVIIWNAKTFELLKVLRGH---------Q---------SLVKGVSWDPIGKYFA-SQ 189 (942) T ss_pred CCccceeccCCCccEEEEecccceEEEEccccceeeeeeecc---------c---------ccccceEECCccCeee-ee Confidence 67899999999999999885 567888876655333333322 1 3466899999999998 55 Q ss_pred EcCCcceEEEEEeeC Q psy7767 174 FNDARVHNMSYIHYG 188 (200) Q Consensus 174 ~D~s~v~~~~i~~~~ 188 (200) .|++-+.++....++ T Consensus 190 sdDrtikvwrt~dw~ 204 (942) T KOG0973|consen 190 SDDRTLKVWRTSDWG 204 (942) T ss_pred cCCceEEEEEcccce Confidence 666778888866543 No 59 >PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H. Probab=94.34 E-value=0.23 Score=44.36 Aligned_cols=75 Identities=16% Similarity=0.092 Sum_probs=45.5 Q ss_pred CceeeEECcCCCEEEEEEcC--cEEEEEC--CCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 97 FLQLVQWSPVGHDLIFVKDN--NLYQAYD--DFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 97 ~is~~~wSPDGk~IAFV~~g--dLyv~~~--~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) ....+++||||+.|....++ .|-+.++ ++|....+..-. ..| ...+.+.+||||++|+-. T Consensus 246 ~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~~~g~l~~~~~~~-----~~G-----------~~Pr~~~~s~~g~~l~Va 309 (345) T PF10282_consen 246 APAEIAISPDGRFLYVSNRGSNSISVFDLDPATGTLTLVQTVP-----TGG-----------KFPRHFAFSPDGRYLYVA 309 (345) T ss_dssp SEEEEEE-TTSSEEEEEECTTTEEEEEEECTTTTTEEEEEEEE-----ESS-----------SSEEEEEE-TTSSEEEEE T ss_pred CceeEEEecCCCEEEEEeccCCEEEEEEEecCCCceEEEEEEe-----CCC-----------CCccEEEEeCCCCEEEEE Confidence 46889999999986444333 4555555 445554443211 111 135688999999988765 Q ss_pred EEcCCcceEEEEEee Q psy7767 173 KFNDARVHNMSYIHY 187 (200) Q Consensus 173 r~D~s~v~~~~i~~~ 187 (200) -.+...+..|.+... T Consensus 310 ~~~s~~v~vf~~d~~ 324 (345) T PF10282_consen 310 NQDSNTVSVFDIDPD 324 (345) T ss_dssp ETTTTEEEEEEEETT T ss_pred ecCCCeEEEEEEeCC Confidence 566677888877533 No 60 >PRK11028 6-phosphogluconolactonase; Provisional Probab=93.89 E-value=1.5 Score=38.21 Aligned_cols=80 Identities=13% Similarity=0.012 Sum_probs=44.4 Q ss_pred ceeecCCCCCceeeEECcCCCEEEEEE--cCcEEEEECC-CCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEcc Q psy7767 88 DVSDLDGSNFLQLVQWSPVGHDLIFVK--DNNLYQAYDD-FRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSP 164 (200) Q Consensus 88 ~v~~lt~~~~is~~~wSPDGk~IAFV~--~gdLyv~~~~-~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSP 164 (200) .+..+..++....+++||||+.|+-.. .+.|.+.+++ +|....+.... . .+....+.++| T Consensus 27 ~~~~~~~~~~~~~l~~spd~~~lyv~~~~~~~i~~~~~~~~g~l~~~~~~~-----~------------~~~p~~i~~~~ 89 (330) T PRK11028 27 LLQVVDVPGQVQPMVISPDKRHLYVGVRPEFRVLSYRIADDGALTFAAESP-----L------------PGSPTHISTDH 89 (330) T ss_pred eeeEEecCCCCccEEECCCCCEEEEEECCCCcEEEEEECCCCceEEeeeec-----C------------CCCceEEEECC Confidence 334443345677889999999874443 3466655554 23322222110 0 12345688999 Q ss_pred CCCEEEEEEEcCCcceEEEE Q psy7767 165 DGSILAYMKFNDARVHNMSY 184 (200) Q Consensus 165 DGk~Laf~r~D~s~v~~~~i 184 (200) ||++|+-.....+.+..+.+ T Consensus 90 ~g~~l~v~~~~~~~v~v~~~ 109 (330) T PRK11028 90 QGRFLFSASYNANCVSVSPL 109 (330) T ss_pred CCCEEEEEEcCCCeEEEEEE Confidence 99977544444444555544 No 61 >KOG0271|consensus Probab=93.60 E-value=0.64 Score=43.71 Aligned_cols=39 Identities=15% Similarity=0.242 Sum_probs=26.2 Q ss_pred ceeecCC-CCCceeeEECcCCCEEEEEE-cCcEEEEECCCC Q psy7767 88 DVSDLDG-SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFR 126 (200) Q Consensus 88 ~v~~lt~-~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g 126 (200) .+++++. .+.+....+||||++||-.+ +..|-+.+..+| T Consensus 359 pi~rmtgHq~lVn~V~fSPd~r~IASaSFDkSVkLW~g~tG 399 (480) T KOG0271|consen 359 PITRMTGHQALVNHVSFSPDGRYIASASFDKSVKLWDGRTG 399 (480) T ss_pred chhhhhchhhheeeEEECCCccEEEEeecccceeeeeCCCc Confidence 4556664 45668999999999999885 344444444443 No 62 >KOG0271|consensus Probab=93.37 E-value=0.25 Score=46.34 Aligned_cols=72 Identities=24% Similarity=0.308 Sum_probs=44.8 Q ss_pred ccceeecCC-----CCCceeeEECcCCCEEEEEEcC-cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcc Q psy7767 86 LTDVSDLDG-----SNFLQLVQWSPVGHDLIFVKDN-NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKT 159 (200) Q Consensus 86 ~~~v~~lt~-----~~~is~~~wSPDGk~IAFV~~g-dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~ 159 (200) .+.+++-+. +..+...+|||||+.||--.+. -+-+.|+++.+ +..|=.| .-.|+ .. T Consensus 101 vrpvtrCssS~~GH~e~Vl~~~fsp~g~~l~tGsGD~TvR~WD~~TeT-p~~t~Kg--------H~~WV---------lc 162 (480) T KOG0271|consen 101 VRPVTRCSSSIAGHGEAVLSVQFSPTGSRLVTGSGDTTVRLWDLDTET-PLFTCKG--------HKNWV---------LC 162 (480) T ss_pred ccccceeccccCCCCCcEEEEEecCCCceEEecCCCceEEeeccCCCC-cceeecC--------CccEE---------EE Confidence 455665442 4567899999999999876543 35555665432 2233222 22343 25 Q ss_pred eEEccCCCEEEEEEEc Q psy7767 160 LWFSPDGSILAYMKFN 175 (200) Q Consensus 160 ~~WSPDGk~Laf~r~D 175 (200) +.|||||++||=-..| T Consensus 163 vawsPDgk~iASG~~d 178 (480) T KOG0271|consen 163 VAWSPDGKKIASGSKD 178 (480) T ss_pred EEECCCcchhhccccC Confidence 7899999999865444 No 63 >PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A .... Probab=93.29 E-value=0.42 Score=40.10 Aligned_cols=61 Identities=21% Similarity=0.303 Sum_probs=36.0 Q ss_pred CceeeEECcCCCEEEEEEc----------CcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCC Q psy7767 97 FLQLVQWSPVGHDLIFVKD----------NNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDG 166 (200) Q Consensus 97 ~is~~~wSPDGk~IAFV~~----------gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDG 166 (200) ...+.++.|||+ |.|... +.||..+.+ ++...+..+ +....++.||||| T Consensus 87 ~~ND~~vd~~G~-ly~t~~~~~~~~~~~~g~v~~~~~~-~~~~~~~~~-------------------~~~pNGi~~s~dg 145 (246) T PF08450_consen 87 RPNDVAVDPDGN-LYVTDSGGGGASGIDPGSVYRIDPD-GKVTVVADG-------------------LGFPNGIAFSPDG 145 (246) T ss_dssp EEEEEEE-TTS--EEEEEECCBCTTCGGSEEEEEEETT-SEEEEEEEE-------------------ESSEEEEEEETTS T ss_pred CCceEEEcCCCC-EEEEecCCCccccccccceEEECCC-CeEEEEecC-------------------cccccceEECCcc Confidence 346788999988 444432 347777665 444433321 2356788999999 Q ss_pred CEEEEEEEcCCc Q psy7767 167 SILAYMKFNDAR 178 (200) Q Consensus 167 k~Laf~r~D~s~ 178 (200) +.|.+....... T Consensus 146 ~~lyv~ds~~~~ 157 (246) T PF08450_consen 146 KTLYVADSFNGR 157 (246) T ss_dssp SEEEEEETTTTE T ss_pred hheeecccccce Confidence 988665444333 No 64 >KOG1524|consensus Probab=93.26 E-value=0.35 Score=47.22 Aligned_cols=31 Identities=10% Similarity=0.329 Sum_probs=26.8 Q ss_pred CCCceeeEECcCCCEEEEEEcCcEEEEECCC Q psy7767 95 SNFLQLVQWSPVGHDLIFVKDNNLYQAYDDF 125 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~gdLyv~~~~~ 125 (200) +..+..++|.||+..|+|..++.+++.++.- T Consensus 145 ~~~v~c~~W~p~S~~vl~c~g~h~~IKpL~~ 175 (737) T KOG1524|consen 145 EESIRCARWAPNSNSIVFCQGGHISIKPLAA 175 (737) T ss_pred CceeEEEEECCCCCceEEecCCeEEEeeccc Confidence 3457899999999999999999999988753 No 65 >KOG1274|consensus Probab=92.85 E-value=1.5 Score=44.76 Aligned_cols=70 Identities=19% Similarity=0.241 Sum_probs=43.6 Q ss_pred CceeeEECcCCCEEEEE-EcCcEEEEECCCCCeE-EEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEE Q psy7767 97 FLQLVQWSPVGHDLIFV-KDNNLYQAYDDFRSIN-ALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKF 174 (200) Q Consensus 97 ~is~~~wSPDGk~IAFV-~~gdLyv~~~~~g~~~-rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~ 174 (200) -..-++|+|+|.++|+. .++.|-+.+.++.... .|..+-.. ....-+.|||.|++||-.-. T Consensus 190 i~~~~aW~Pk~g~la~~~~d~~Vkvy~r~~we~~f~Lr~~~~s-----------------s~~~~~~wsPnG~YiAAs~~ 252 (933) T KOG1274|consen 190 ICTRLAWHPKGGTLAVPPVDNTVKVYSRKGWELQFKLRDKLSS-----------------SKFSDLQWSPNGKYIAASTL 252 (933) T ss_pred eeeeeeecCCCCeEEeeccCCeEEEEccCCceeheeecccccc-----------------cceEEEEEcCCCcEEeeecc Confidence 34789999996666666 5778888777665543 22222110 12456789999999986543 Q ss_pred cCCcceEEEE Q psy7767 175 NDARVHNMSY 184 (200) Q Consensus 175 D~s~v~~~~i 184 (200) | +.+-++.+ T Consensus 253 ~-g~I~vWnv 261 (933) T KOG1274|consen 253 D-GQILVWNV 261 (933) T ss_pred C-CcEEEEec Confidence 3 44544443 No 66 >COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism] Probab=92.57 E-value=1.1 Score=41.25 Aligned_cols=78 Identities=13% Similarity=0.182 Sum_probs=51.3 Q ss_pred CCCceeeEECcCCCEEEEEEcC-----cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKDN-----NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSIL 169 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~g-----dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~L 169 (200) ......+.+||||+.| |++++ -+|..+-++|....+..... |. ...+.+.++|+|+.| T Consensus 243 ~~~~aaIhis~dGrFL-YasNRg~dsI~~f~V~~~~g~L~~~~~~~t--------------eg--~~PR~F~i~~~g~~L 305 (346) T COG2706 243 TNWAAAIHISPDGRFL-YASNRGHDSIAVFSVDPDGGKLELVGITPT--------------EG--QFPRDFNINPSGRFL 305 (346) T ss_pred CCceeEEEECCCCCEE-EEecCCCCeEEEEEEcCCCCEEEEEEEecc--------------CC--cCCccceeCCCCCEE Confidence 4556889999999986 66543 24545555555443332211 11 136788999999998 Q ss_pred EEEEEcCCcceEEEEEeeCC Q psy7767 170 AYMKFNDARVHNMSYIHYGE 189 (200) Q Consensus 170 af~r~D~s~v~~~~i~~~~~ 189 (200) +-+..+...+..|.+.-..+ T Consensus 306 iaa~q~sd~i~vf~~d~~TG 325 (346) T COG2706 306 IAANQKSDNITVFERDKETG 325 (346) T ss_pred EEEccCCCcEEEEEEcCCCc Confidence 77766667788888776665 No 67 >PRK11028 6-phosphogluconolactonase; Provisional Probab=92.55 E-value=4.4 Score=35.26 Aligned_cols=76 Identities=9% Similarity=0.032 Sum_probs=41.0 Q ss_pred CceeeEECcCCCEEEEEE--cCcEEEEECCC-CCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 97 FLQLVQWSPVGHDLIFVK--DNNLYQAYDDF-RSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 97 ~is~~~wSPDGk~IAFV~--~gdLyv~~~~~-g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) .....+++|||+.+..+. .+.|.+.+++. +.......... . +.-| .....+.|+|||++|+-+. T Consensus 127 ~~~~~~~~p~g~~l~v~~~~~~~v~v~d~~~~g~l~~~~~~~~-~-~~~g-----------~~p~~~~~~pdg~~lyv~~ 193 (330) T PRK11028 127 GCHSANIDPDNRTLWVPCLKEDRIRLFTLSDDGHLVAQEPAEV-T-TVEG-----------AGPRHMVFHPNQQYAYCVN 193 (330) T ss_pred cccEeEeCCCCCEEEEeeCCCCEEEEEEECCCCcccccCCCce-e-cCCC-----------CCCceEEECCCCCEEEEEe Confidence 345668999999875443 35688888764 22110000000 0 0001 1345689999999775443 Q ss_pred EcCCcceEEEEE Q psy7767 174 FNDARVHNMSYI 185 (200) Q Consensus 174 ~D~s~v~~~~i~ 185 (200) .....+..+.+. T Consensus 194 ~~~~~v~v~~~~ 205 (330) T PRK11028 194 ELNSSVDVWQLK 205 (330) T ss_pred cCCCEEEEEEEe Confidence 334555555554 No 68 >KOG0293|consensus Probab=92.39 E-value=0.66 Score=44.01 Aligned_cols=60 Identities=25% Similarity=0.280 Sum_probs=42.3 Q ss_pred CCCceeeEECcCCCEEEE-EEcCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIF-VKDNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAY 171 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAF-V~~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf 171 (200) ...+..+.||||.++++= -.+..+.+.++++|+....-..+- | .+..+-.|-|||.+++- T Consensus 269 ~~~V~yi~wSPDdryLlaCg~~e~~~lwDv~tgd~~~~y~~~~------~-----------~S~~sc~W~pDg~~~V~ 329 (519) T KOG0293|consen 269 SQPVSYIMWSPDDRYLLACGFDEVLSLWDVDTGDLRHLYPSGL------G-----------FSVSSCAWCPDGFRFVT 329 (519) T ss_pred cCceEEEEECCCCCeEEecCchHheeeccCCcchhhhhcccCc------C-----------CCcceeEEccCCceeEe Confidence 567899999999887643 234568888998888765554431 0 15567789999988653 No 69 >KOG0266|consensus Probab=91.92 E-value=2.4 Score=39.60 Aligned_cols=70 Identities=21% Similarity=0.213 Sum_probs=50.0 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) ...+....|+|+|+.|+... ++.|.+.++.+++..+...... +...+++|+|||+.|+-. T Consensus 246 ~~~v~~~~f~p~g~~i~Sgs~D~tvriWd~~~~~~~~~l~~hs------------------~~is~~~f~~d~~~l~s~- 306 (456) T KOG0266|consen 246 STYVTSVAFSPDGNLLVSGSDDGTVRIWDVRTGECVRKLKGHS------------------DGISGLAFSPDGNLLVSA- 306 (456) T ss_pred CCceEEEEecCCCCEEEEecCCCcEEEEeccCCeEEEeeeccC------------------CceEEEEECCCCCEEEEc- Confidence 56789999999998887775 5679999999877665554322 134578899999988755 Q ss_pred EcCCcceEEE Q psy7767 174 FNDARVHNMS 183 (200) Q Consensus 174 ~D~s~v~~~~ 183 (200) ..++.+.++. T Consensus 307 s~d~~i~vwd 316 (456) T KOG0266|consen 307 SYDGTIRVWD 316 (456) T ss_pred CCCccEEEEE Confidence 3355555554 No 70 >KOG0291|consensus Probab=91.82 E-value=1.9 Score=43.58 Aligned_cols=71 Identities=18% Similarity=0.111 Sum_probs=43.7 Q ss_pred CCCCceeeEECcCCCEEEEEEc-Cc-EEEEECCCC-CeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEE Q psy7767 94 GSNFLQLVQWSPVGHDLIFVKD-NN-LYQAYDDFR-SINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILA 170 (200) Q Consensus 94 ~~~~is~~~wSPDGk~IAFV~~-gd-Lyv~~~~~g-~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~La 170 (200) ....+..++.||||..+.-|-. |. +++ ++... ...+.+... ...++.|||||+.+| T Consensus 54 ~~~NI~~ialSp~g~lllavdE~g~~~lv-s~~~r~Vlh~f~fk~--------------------~v~~i~fSPng~~fa 112 (893) T KOG0291|consen 54 TRYNITRIALSPDGTLLLAVDERGRALLV-SLLSRSVLHRFNFKR--------------------GVGAIKFSPNGKFFA 112 (893) T ss_pred cCCceEEEEeCCCceEEEEEcCCCcEEEE-ecccceeeEEEeecC--------------------ccceEEECCCCcEEE Confidence 3567899999999998877742 33 444 33321 112233221 345789999999888 Q ss_pred EEEEcCCcceEEEEEee Q psy7767 171 YMKFNDARVHNMSYIHY 187 (200) Q Consensus 171 f~r~D~s~v~~~~i~~~ 187 (200) ..+.+ .++++..|.. T Consensus 113 v~~gn--~lqiw~~P~~ 127 (893) T KOG0291|consen 113 VGCGN--LLQIWHAPGE 127 (893) T ss_pred EEecc--eeEEEecCcc Confidence 76544 4566665443 No 71 >COG1506 DAP2 Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism] Probab=91.68 E-value=0.41 Score=46.66 Aligned_cols=61 Identities=30% Similarity=0.445 Sum_probs=42.5 Q ss_pred eeecCCCCCceeeEECcCCCEEEEEE-c----CcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEc Q psy7767 89 VSDLDGSNFLQLVQWSPVGHDLIFVK-D----NNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFS 163 (200) Q Consensus 89 v~~lt~~~~is~~~wSPDGk~IAFV~-~----gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WS 163 (200) +...+.+.....+.|||||+.++|+. . .++|+.+.. | .++... .......|| T Consensus 53 ~~~~~~~~~~~~~~~spdg~~~~~~~~~~~~~~~l~l~~~~-g---~~~~~~-------------------~~v~~~~~~ 109 (620) T COG1506 53 VRLLTFGGGVSELRWSPDGSVLAFVSTDGGRVAQLYLVDVG-G---LITKTA-------------------FGVSDARWS 109 (620) T ss_pred ccccccCCcccccccCCCCCEEEEEeccCCCcceEEEEecC-C---ceeeee-------------------cccccceeC Confidence 34455678889999999999999998 2 368888766 4 222211 123456788 Q ss_pred cCCCEEEEE Q psy7767 164 PDGSILAYM 172 (200) Q Consensus 164 PDGk~Laf~ 172 (200) |+|+.+++. T Consensus 110 ~~g~~~~~~ 118 (620) T COG1506 110 PDGDRIAFL 118 (620) T ss_pred CCCCeEEEE Confidence 888888883 No 72 >KOG1407|consensus Probab=91.36 E-value=1.7 Score=39.06 Aligned_cols=62 Identities=19% Similarity=0.235 Sum_probs=40.6 Q ss_pred CCCceeeEECcCCCEEEEEE-c-CcEEEEECCCCCeEEEe-eCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-D-NNLYQAYDDFRSINALT-RDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAY 171 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~-gdLyv~~~~~g~~~rLT-~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf 171 (200) +..+-.+.|+|...-+.++. + ..|-+.++..++..+-+ +.++ .....|||||+++++ T Consensus 64 ~~svdql~w~~~~~d~~atas~dk~ir~wd~r~~k~~~~i~~~~e--------------------ni~i~wsp~g~~~~~ 123 (313) T KOG1407|consen 64 TDSVDQLCWDPKHPDLFATASGDKTIRIWDIRSGKCTARIETKGE--------------------NINITWSPDGEYIAV 123 (313) T ss_pred CcchhhheeCCCCCcceEEecCCceEEEEEeccCcEEEEeeccCc--------------------ceEEEEcCCCCEEEE Confidence 34567889999755555554 3 35777787777665444 4332 235789999999998 Q ss_pred EEEcC Q psy7767 172 MKFND 176 (200) Q Consensus 172 ~r~D~ 176 (200) .-.|+ T Consensus 124 ~~kdD 128 (313) T KOG1407|consen 124 GNKDD 128 (313) T ss_pred ecCcc Confidence 75443 No 73 >PTZ00420 coronin; Provisional Probab=91.18 E-value=1.7 Score=42.53 Aligned_cols=68 Identities=10% Similarity=0.123 Sum_probs=44.1 Q ss_pred CCCceeeEECcCCCEEEEEE--cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK--DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~--~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) ...+..++|+|+|..++.+. ++.|-+.++.++........+. ...++.|+|||+.|+-. T Consensus 125 ~~~V~sVaf~P~g~~iLaSgS~DgtIrIWDl~tg~~~~~i~~~~-------------------~V~SlswspdG~lLat~ 185 (568) T PTZ00420 125 KKKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEKRAFQINMPK-------------------KLSSLKWNIKGNLLSGT 185 (568) T ss_pred CCcEEEEEECCCCCeEEEEEeCCCeEEEEECCCCcEEEEEecCC-------------------cEEEEEECCCCCEEEEE Confidence 45688999999998876543 5779899998776432111111 23478999999988644 Q ss_pred EEcCCcceEE Q psy7767 173 KFNDARVHNM 182 (200) Q Consensus 173 r~D~s~v~~~ 182 (200) . .++.+.++ T Consensus 186 s-~D~~IrIw 194 (568) T PTZ00420 186 C-VGKHMHII 194 (568) T ss_pred e-cCCEEEEE Confidence 3 33334444 No 74 >PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. Probab=90.49 E-value=2.2 Score=38.71 Aligned_cols=62 Identities=24% Similarity=0.174 Sum_probs=39.8 Q ss_pred ceeeEECc-CCCEEEEEEc--CcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEE Q psy7767 98 LQLVQWSP-VGHDLIFVKD--NNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKF 174 (200) Q Consensus 98 is~~~wSP-DGk~IAFV~~--gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~ 174 (200) .-.+..+| ++.-+||.++ .-+++.+..+|+..+....+. +.-+||- -.|||||++| |+.. T Consensus 7 gH~~a~~p~~~~avafaRRPG~~~~v~D~~~g~~~~~~~a~~-gRHFyGH---------------g~fs~dG~~L-ytTE 69 (305) T PF07433_consen 7 GHGVAAHPTRPEAVAFARRPGTFALVFDCRTGQLLQRLWAPP-GRHFYGH---------------GVFSPDGRLL-YTTE 69 (305) T ss_pred ccceeeCCCCCeEEEEEeCCCcEEEEEEcCCCceeeEEcCCC-CCEEecC---------------EEEcCCCCEE-EEec Confidence 34578899 5677777764 367778888888764443332 2225554 3699999877 5544 Q ss_pred cC Q psy7767 175 ND 176 (200) Q Consensus 175 D~ 176 (200) ++ T Consensus 70 nd 71 (305) T PF07433_consen 70 ND 71 (305) T ss_pred cc Confidence 33 No 75 >KOG0645|consensus Probab=90.25 E-value=3.4 Score=37.32 Aligned_cols=64 Identities=20% Similarity=0.257 Sum_probs=37.8 Q ss_pred CCCceeeEECcCCCEEEEEEc-CcEEEEECCCCCeEEEee-CCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKD-NNLYQAYDDFRSINALTR-DGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~-gdLyv~~~~~g~~~rLT~-dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) ...+...+|||.|+.||-.+- .-+-+..-.+++-.-++. .|- |. ....++||++|.+||-. T Consensus 61 krsVRsvAwsp~g~~La~aSFD~t~~Iw~k~~~efecv~~lEGH-------------En----EVK~Vaws~sG~~LATC 123 (312) T KOG0645|consen 61 KRSVRSVAWSPHGRYLASASFDATVVIWKKEDGEFECVATLEGH-------------EN----EVKCVAWSASGNYLATC 123 (312) T ss_pred hheeeeeeecCCCcEEEEeeccceEEEeecCCCceeEEeeeecc-------------cc----ceeEEEEcCCCCEEEEe Confidence 456899999999999887752 222222222333332222 111 11 23578999999999866 Q ss_pred EEc Q psy7767 173 KFN 175 (200) Q Consensus 173 r~D 175 (200) ..| T Consensus 124 SRD 126 (312) T KOG0645|consen 124 SRD 126 (312) T ss_pred eCC Confidence 444 No 76 >KOG4497|consensus Probab=90.25 E-value=1.3 Score=41.09 Aligned_cols=60 Identities=22% Similarity=0.341 Sum_probs=39.4 Q ss_pred CCCCceeeEECcCCCEEEEEEcCc--EEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEE Q psy7767 94 GSNFLQLVQWSPVGHDLIFVKDNN--LYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAY 171 (200) Q Consensus 94 ~~~~is~~~wSPDGk~IAFV~~gd--Lyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf 171 (200) +..+++...|||||+.|.-...-+ |=+.++.+....-|-. +. ....+++|.|||++-|. T Consensus 90 g~agls~~~WSPdgrhiL~tseF~lriTVWSL~t~~~~~~~~-pK------------------~~~kg~~f~~dg~f~ai 150 (447) T KOG4497|consen 90 GQAGLSSISWSPDGRHILLTSEFDLRITVWSLNTQKGYLLPH-PK------------------TNVKGYAFHPDGQFCAI 150 (447) T ss_pred CCCcceeeeECCCcceEeeeecceeEEEEEEeccceeEEecc-cc------------------cCceeEEECCCCceeee Confidence 356889999999999998887654 3334554433332222 11 12356899999998776 Q ss_pred E Q psy7767 172 M 172 (200) Q Consensus 172 ~ 172 (200) . T Consensus 151 ~ 151 (447) T KOG4497|consen 151 L 151 (447) T ss_pred e Confidence 6 No 77 >PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H. Probab=89.87 E-value=3 Score=37.23 Aligned_cols=73 Identities=22% Similarity=0.215 Sum_probs=37.6 Q ss_pred ceeeEECcCCCEEEEEEc--CcEEEEECCCCC--eEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 98 LQLVQWSPVGHDLIFVKD--NNLYQAYDDFRS--INALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 98 is~~~wSPDGk~IAFV~~--gdLyv~~~~~g~--~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) .-...++|||+.+.-+-. ..|++.+++... ......-. +..| ...+.+.|+|||+++..+. T Consensus 146 ~H~v~~~pdg~~v~v~dlG~D~v~~~~~~~~~~~l~~~~~~~----~~~G-----------~GPRh~~f~pdg~~~Yv~~ 210 (345) T PF10282_consen 146 PHQVVFSPDGRFVYVPDLGADRVYVYDIDDDTGKLTPVDSIK----VPPG-----------SGPRHLAFSPDGKYAYVVN 210 (345) T ss_dssp EEEEEE-TTSSEEEEEETTTTEEEEEEE-TTS-TEEEEEEEE----CSTT-----------SSEEEEEE-TTSSEEEEEE T ss_pred ceeEEECCCCCEEEEEecCCCEEEEEEEeCCCceEEEeeccc----cccC-----------CCCcEEEEcCCcCEEEEec Confidence 457789999999844332 357777665443 22111000 0000 2456789999998764333 Q ss_pred EcCCcceEEEEE Q psy7767 174 FNDARVHNMSYI 185 (200) Q Consensus 174 ~D~s~v~~~~i~ 185 (200) ...+.+..+.+. T Consensus 211 e~s~~v~v~~~~ 222 (345) T PF10282_consen 211 ELSNTVSVFDYD 222 (345) T ss_dssp TTTTEEEEEEEE T ss_pred CCCCcEEEEeec Confidence 333455555555 No 78 >KOG0315|consensus Probab=89.86 E-value=3.1 Score=37.32 Aligned_cols=99 Identities=14% Similarity=0.091 Sum_probs=59.0 Q ss_pred EEEEeeeeeeeeeecCCC-CccceeecC---C-CCCceeeEECcCCCEEEEEE-cCcEEEEECCCC-CeEEEeeCCCcce Q psy7767 67 SIRCLTWKQSHSLSGRPD-TLTDVSDLD---G-SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFR-SINALTRDGIKGV 139 (200) Q Consensus 67 ~i~~n~~~~R~s~~a~~~-~~~~v~~lt---~-~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g-~~~rLT~dg~~~~ 139 (200) +...|..|+=+.|..-.- ..+.+++++ . ...+..-.+|||+|.||=.. +..+++.+.++- .....+. T Consensus 182 l~a~nnkG~cyvW~l~~~~~~s~l~P~~k~~ah~~~il~C~lSPd~k~lat~ssdktv~iwn~~~~~kle~~l~------ 255 (311) T KOG0315|consen 182 LAAANNKGNCYVWRLLNHQTASELEPVHKFQAHNGHILRCLLSPDVKYLATCSSDKTVKIWNTDDFFKLELVLT------ 255 (311) T ss_pred EEEecCCccEEEEEccCCCccccceEhhheecccceEEEEEECCCCcEEEeecCCceEEEEecCCceeeEEEee------ Confidence 344555666555544331 133444443 2 56678889999999998774 557888887764 2222222 Q ss_pred EecccccceeeeeeeCCCcceEEccCCCEEEEEEEcCCcceEEEE Q psy7767 140 LFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFNDARVHNMSY 184 (200) Q Consensus 140 ~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~i 184 (200) |.+.|+. .-.||.||++|+-...| ..++.+.+ T Consensus 256 ---gh~rWvW---------dc~FS~dg~YlvTassd-~~~rlW~~ 287 (311) T KOG0315|consen 256 ---GHQRWVW---------DCAFSADGEYLVTASSD-HTARLWDL 287 (311) T ss_pred ---cCCceEE---------eeeeccCccEEEecCCC-Cceeeccc Confidence 3334543 45699999999654444 44555543 No 79 >KOG0288|consensus Probab=89.83 E-value=0.98 Score=42.66 Aligned_cols=59 Identities=20% Similarity=0.185 Sum_probs=43.7 Q ss_pred CCCceeeEECcCCCEEEEE-EcCcEEEEECCCCCeE-EEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFV-KDNNLYQAYDDFRSIN-ALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILA 170 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV-~~gdLyv~~~~~g~~~-rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~La 170 (200) +.+.+-..|||||.++|=- .+|.||++++.+|+.. ++...+.. .....+.|+|-|+.|+ T Consensus 387 asDwtrvvfSpd~~YvaAGS~dgsv~iW~v~tgKlE~~l~~s~s~-----------------~aI~s~~W~~sG~~Ll 447 (459) T KOG0288|consen 387 ASDWTRVVFSPDGSYVAAGSADGSVYIWSVFTGKLEKVLSLSTSN-----------------AAITSLSWNPSGSGLL 447 (459) T ss_pred ccccceeEECCCCceeeeccCCCcEEEEEccCceEEEEeccCCCC-----------------cceEEEEEcCCCchhh Confidence 4567788999999998766 4789999999988764 45443321 0234689999999885 No 80 >KOG0306|consensus Probab=89.77 E-value=5.9 Score=40.18 Aligned_cols=135 Identities=19% Similarity=0.189 Sum_probs=68.2 Q ss_pred ccccccccc-ceEEEEEEeceeeeceeeeeeeeeeeeeeeeeeecceEEEEeccCcceeEEEEEeeeeeeeeeecCCCCc Q psy7767 8 ESHQFSHVD-HKIIHIIECGYIRSTDIALKLSIRCLTWKQSEYWDYNLYLQRSTDIALKLSIRCLTWKQSHSLSGRPDTL 86 (200) Q Consensus 8 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~kv~i~~n~~~~R~s~~a~~~~~ 86 (200) ||-.+=-+| ++-|.-|+||||-. - -|=.|..|++-.+. .-++.+.--+- . T Consensus 394 ~SikiWn~~t~kciRTi~~~y~l~--------~--------~Fvpgd~~Iv~G~k-~Gel~vfdlaS------------~ 444 (888) T KOG0306|consen 394 ESIKIWNRDTLKCIRTITCGYILA--------S--------KFVPGDRYIVLGTK-NGELQVFDLAS------------A 444 (888) T ss_pred CcEEEEEccCcceeEEeccccEEE--------E--------EecCCCceEEEecc-CCceEEEEeeh------------h Confidence 444444455 88899999998632 1 22233334433222 22233322211 2 Q ss_pred cceeecC-CCCCceeeEECcCCCEEEEEEcC-------cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCc Q psy7767 87 TDVSDLD-GSNFLQLVQWSPVGHDLIFVKDN-------NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTK 158 (200) Q Consensus 87 ~~v~~lt-~~~~is~~~wSPDGk~IAFV~~g-------dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~ 158 (200) .-++... -.+.+-.++.+|||+..+-.... ..++.+..+.+.+.|+=...+.+ |+-.... T Consensus 445 ~l~Eti~AHdgaIWsi~~~pD~~g~vT~saDktVkfWdf~l~~~~~gt~~k~lsl~~~rtL------------el~ddvL 512 (888) T KOG0306|consen 445 SLVETIRAHDGAIWSISLSPDNKGFVTGSADKTVKFWDFKLVVSVPGTQKKVLSLKHTRTL------------ELEDDVL 512 (888) T ss_pred hhhhhhhccccceeeeeecCCCCceEEecCCcEEEEEeEEEEeccCcccceeeeeccceEE------------eccccEE Confidence 2333332 24667889999999986655432 12333322223233332111111 1001334 Q ss_pred ceEEccCCCEEEEEEEcCCcceEEEE Q psy7767 159 TLWFSPDGSILAYMKFNDARVHNMSY 184 (200) Q Consensus 159 ~~~WSPDGk~Laf~r~D~s~v~~~~i 184 (200) .+..||||+.||..--|. .|.+|.+ T Consensus 513 ~v~~Spdgk~LaVsLLdn-TVkVyfl 537 (888) T KOG0306|consen 513 CVSVSPDGKLLAVSLLDN-TVKVYFL 537 (888) T ss_pred EEEEcCCCcEEEEEeccC-eEEEEEe Confidence 789999999998765554 3665543 No 81 >KOG2315|consensus Probab=89.73 E-value=4.6 Score=39.40 Aligned_cols=58 Identities=12% Similarity=0.115 Sum_probs=38.7 Q ss_pred CCCceeeEECcCCCEEEEEE----cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK----DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILA 170 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~----~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~La 170 (200) .+.--.+-|||.|+.|++.- .|++=+.|+.+ -+.|+.-... ...-+.|||||++++ T Consensus 311 egpRN~~~fnp~g~ii~lAGFGNL~G~mEvwDv~n--~K~i~~~~a~------------------~tt~~eW~PdGe~fl 370 (566) T KOG2315|consen 311 EGPRNTAFFNPHGNIILLAGFGNLPGDMEVWDVPN--RKLIAKFKAA------------------NTTVFEWSPDGEYFL 370 (566) T ss_pred CCCccceEECCCCCEEEEeecCCCCCceEEEeccc--hhhccccccC------------------CceEEEEcCCCcEEE Confidence 34446778999999999884 45777777765 4445542211 223467999999986 Q ss_pred EE Q psy7767 171 YM 172 (200) Q Consensus 171 f~ 172 (200) -. T Consensus 371 TA 372 (566) T KOG2315|consen 371 TA 372 (566) T ss_pred EE Confidence 44 No 82 >KOG0645|consensus Probab=89.70 E-value=14 Score=33.59 Aligned_cols=88 Identities=20% Similarity=0.202 Sum_probs=52.2 Q ss_pred CCCceeeEECcCCCEEEEEEc-CcEEEEECCCCCe--------------EEEeeCCCcceEecccc-------------c Q psy7767 95 SNFLQLVQWSPVGHDLIFVKD-NNLYQAYDDFRSI--------------NALTRDGIKGVLFNGVA-------------D 146 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~-gdLyv~~~~~g~~--------------~rLT~dg~~~~~~nG~p-------------d 146 (200) ..++...+||++|..||--.+ ..||+..++.+.. +++.-.+.+.+++-+.- | T Consensus 105 EnEVK~Vaws~sG~~LATCSRDKSVWiWe~deddEfec~aVL~~HtqDVK~V~WHPt~dlL~S~SYDnTIk~~~~~~ddd 184 (312) T KOG0645|consen 105 ENEVKCVAWSASGNYLATCSRDKSVWIWEIDEDDEFECIAVLQEHTQDVKHVIWHPTEDLLFSCSYDNTIKVYRDEDDDD 184 (312) T ss_pred ccceeEEEEcCCCCEEEEeeCCCeEEEEEecCCCcEEEEeeeccccccccEEEEcCCcceeEEeccCCeEEEEeecCCCC Confidence 577899999999999998854 5788887774432 33333332222221111 2 Q ss_pred ceeeeeeeCC---CcceEEccCCCEEEEEEEcCCcceEEE Q psy7767 147 WVYEEEVFSS---TKTLWFSPDGSILAYMKFNDARVHNMS 183 (200) Q Consensus 147 wv~eEEi~~~---~~~~~WSPDGk~Laf~r~D~s~v~~~~ 183 (200) |.--..+-++ ...+.|.|+|.+|+ ...|++.+.++. T Consensus 185 W~c~~tl~g~~~TVW~~~F~~~G~rl~-s~sdD~tv~Iw~ 223 (312) T KOG0645|consen 185 WECVQTLDGHENTVWSLAFDNIGSRLV-SCSDDGTVSIWR 223 (312) T ss_pred eeEEEEecCccceEEEEEecCCCceEE-EecCCcceEeee Confidence 2221111123 34556999999887 456666676665 No 83 >KOG0278|consensus Probab=89.51 E-value=8.7 Score=34.67 Aligned_cols=75 Identities=17% Similarity=0.228 Sum_probs=44.7 Q ss_pred ceeecCCCCCceeeEECcCCCEEEEEEcCc---EEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEcc Q psy7767 88 DVSDLDGSNFLQLVQWSPVGHDLIFVKDNN---LYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSP 164 (200) Q Consensus 88 ~v~~lt~~~~is~~~wSPDGk~IAFV~~gd---Lyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSP 164 (200) .+..+.....+..+.++|+-. .||.+|+ +|..+-.+|...-.-+.|. ++-.-.++||| T Consensus 217 ~lKs~k~P~nV~SASL~P~k~--~fVaGged~~~~kfDy~TgeEi~~~nkgh-----------------~gpVhcVrFSP 277 (334) T KOG0278|consen 217 LLKSYKMPCNVESASLHPKKE--FFVAGGEDFKVYKFDYNTGEEIGSYNKGH-----------------FGPVHCVRFSP 277 (334) T ss_pred ceeeccCccccccccccCCCc--eEEecCcceEEEEEeccCCceeeecccCC-----------------CCceEEEEECC Confidence 344443456778889999873 4888764 5555666666654422232 34455789999 Q ss_pred CCCEEEEEEE-cCCcceEEE Q psy7767 165 DGSILAYMKF-NDARVHNMS 183 (200) Q Consensus 165 DGk~Laf~r~-D~s~v~~~~ 183 (200) ||. .|... .++.+.+++ T Consensus 278 dGE--~yAsGSEDGTirlWQ 295 (334) T KOG0278|consen 278 DGE--LYASGSEDGTIRLWQ 295 (334) T ss_pred CCc--eeeccCCCceEEEEE Confidence 996 45543 334444444 No 84 >PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B. Probab=89.31 E-value=1.9 Score=32.14 Aligned_cols=66 Identities=12% Similarity=0.085 Sum_probs=41.4 Q ss_pred eeEECcCCCEEEEEE-------------------cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcce Q psy7767 100 LVQWSPVGHDLIFVK-------------------DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTL 160 (200) Q Consensus 100 ~~~wSPDGk~IAFV~-------------------~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~ 160 (200) ++.+++|+..|.|+- .|.|+..+..++++..|-.+ +.-.+++ T Consensus 2 dldv~~~~g~vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp~t~~~~vl~~~-------------------L~fpNGV 62 (89) T PF03088_consen 2 DLDVDQDTGTVYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDPSTKETTVLLDG-------------------LYFPNGV 62 (89) T ss_dssp EEEE-TTT--EEEEES-SS--TTGHHHHHHHT---EEEEEEETTTTEEEEEEEE-------------------ESSEEEE T ss_pred ceeEecCCCEEEEEeCccccCccceeeeeecCCCCcCEEEEECCCCeEEEehhC-------------------CCccCeE Confidence 456777877777774 24688788777776655542 1234567 Q ss_pred EEccCCCEEEEEEEcCCcceEEEE Q psy7767 161 WFSPDGSILAYMKFNDARVHNMSY 184 (200) Q Consensus 161 ~WSPDGk~Laf~r~D~s~v~~~~i 184 (200) ..|||++.|++...-...+..+++ T Consensus 63 als~d~~~vlv~Et~~~Ri~rywl 86 (89) T PF03088_consen 63 ALSPDESFVLVAETGRYRILRYWL 86 (89) T ss_dssp EE-TTSSEEEEEEGGGTEEEEEES T ss_pred EEcCCCCEEEEEeccCceEEEEEE Confidence 899999999988766665655553 No 85 >COG1770 PtrB Protease II [Amino acid transport and metabolism] Probab=89.29 E-value=1.2 Score=44.29 Aligned_cols=61 Identities=18% Similarity=0.243 Sum_probs=42.1 Q ss_pred ceeeEECcCCCEEEEEEc---C---cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEE Q psy7767 98 LQLVQWSPVGHDLIFVKD---N---NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAY 171 (200) Q Consensus 98 is~~~wSPDGk~IAFV~~---g---dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf 171 (200) +..++.|||++.+||..+ + .|-+.++.+|...- |++-+...++.|.+||+.|.| T Consensus 131 Lg~~~~s~D~~~la~s~D~~G~e~y~lr~kdL~tg~~~~--------------------d~i~~~~~~~~Wa~d~~~lfY 190 (682) T COG1770 131 LGAASISPDHNLLAYSVDVLGDEQYTLRFKDLATGEELP--------------------DEITNTSGSFAWAADGKTLFY 190 (682) T ss_pred eeeeeeCCCCceEEEEEecccccEEEEEEEecccccccc--------------------hhhcccccceEEecCCCeEEE Confidence 467899999999999964 2 35555665554321 111112456799999999999 Q ss_pred EEEcCCc Q psy7767 172 MKFNDAR 178 (200) Q Consensus 172 ~r~D~s~ 178 (200) ++.|+.. T Consensus 191 t~~d~~~ 197 (682) T COG1770 191 TRLDENH 197 (682) T ss_pred EEEcCCC Confidence 9988863 No 86 >COG3386 Gluconolactonase [Carbohydrate transport and metabolism] Probab=89.27 E-value=1.3 Score=39.80 Aligned_cols=30 Identities=30% Similarity=0.372 Sum_probs=20.3 Q ss_pred CCcceEEccCCCEEEEEEEcCCcceEEEEE Q psy7767 156 STKTLWFSPDGSILAYMKFNDARVHNMSYI 185 (200) Q Consensus 156 ~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~ 185 (200) ..+++.|||||+.|.+.-...+.+..+.+. T Consensus 164 ~~NGla~SpDg~tly~aDT~~~~i~r~~~d 193 (307) T COG3386 164 IPNGLAFSPDGKTLYVADTPANRIHRYDLD 193 (307) T ss_pred ecCceEECCCCCEEEEEeCCCCeEEEEecC Confidence 567889999999887664444444444444 No 87 >COG1506 DAP2 Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism] Probab=89.14 E-value=1 Score=43.96 Aligned_cols=67 Identities=25% Similarity=0.399 Sum_probs=43.6 Q ss_pred CCceeeEECcCCCEEEEEEcC----------cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccC Q psy7767 96 NFLQLVQWSPVGHDLIFVKDN----------NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPD 165 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~~g----------dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPD 165 (200) ..+.++..+|+|+.++|+... .+|+.+... ...++.. +....++|||| T Consensus 13 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~d~~~--~~~~~~~--------------------~~~~~~~~spd 70 (620) T COG1506 13 ARVSDPRVSPPGGRLAYILTGLDFLKPLYKSSLWVSDGKT--VRLLTFG--------------------GGVSELRWSPD 70 (620) T ss_pred hcccCcccCCCCceeEEeeccccccccccccceEEEeccc--ccccccC--------------------CcccccccCCC Confidence 357889999999999999753 455543221 2222221 24567899999 Q ss_pred CCEEEEEEEcCCcceEEEE Q psy7767 166 GSILAYMKFNDARVHNMSY 184 (200) Q Consensus 166 Gk~Laf~r~D~s~v~~~~i 184 (200) |+.++|...+......+++ T Consensus 71 g~~~~~~~~~~~~~~~l~l 89 (620) T COG1506 71 GSVLAFVSTDGGRVAQLYL 89 (620) T ss_pred CCEEEEEeccCCCcceEEE Confidence 9999999855444333333 No 88 >PTZ00421 coronin; Provisional Probab=89.13 E-value=2.5 Score=40.37 Aligned_cols=70 Identities=13% Similarity=0.152 Sum_probs=43.3 Q ss_pred CCCceeeEECcCCCEEEEE--EcCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFV--KDNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV--~~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) ...+..++|+|++..+..+ .++.|.+.++..+.......... .....+.|+|||+.|+-. T Consensus 125 ~~~V~~l~f~P~~~~iLaSgs~DgtVrIWDl~tg~~~~~l~~h~------------------~~V~sla~spdG~lLatg 186 (493) T PTZ00421 125 TKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHS------------------DQITSLEWNLDGSLLCTT 186 (493) T ss_pred CCcEEEEEeCcCCCCEEEEEeCCCEEEEEECCCCeEEEEEcCCC------------------CceEEEEEECCCCEEEEe Confidence 4567889999997544333 35679999998765443222111 123468899999877654 Q ss_pred EEcCCcceEEE Q psy7767 173 KFNDARVHNMS 183 (200) Q Consensus 173 r~D~s~v~~~~ 183 (200) ..++.+.++. T Consensus 187 -s~Dg~IrIwD 196 (493) T PTZ00421 187 -SKDKKLNIID 196 (493) T ss_pred -cCCCEEEEEE Confidence 3344454443 No 89 >KOG2110|consensus Probab=89.11 E-value=5 Score=37.49 Aligned_cols=78 Identities=22% Similarity=0.137 Sum_probs=48.5 Q ss_pred CCCceeeEECcCCCEEEEEEcC--cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKDN--NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~g--dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) .+.+.-+++||||+.||=.++. =|-|..+.+|+.. .+ +-.|.- .-+-.+++||||++.|+- T Consensus 173 ~~~lAalafs~~G~llATASeKGTVIRVf~v~~G~kl--~e------FRRG~~--------~~~IySL~Fs~ds~~L~~- 235 (391) T KOG2110|consen 173 KGPLAALAFSPDGTLLATASEKGTVIRVFSVPEGQKL--YE------FRRGTY--------PVSIYSLSFSPDSQFLAA- 235 (391) T ss_pred CCceeEEEECCCCCEEEEeccCceEEEEEEcCCccEe--ee------eeCCce--------eeEEEEEEECCCCCeEEE- Confidence 5678999999999999987643 1222233333321 11 011110 013457899999998874 Q ss_pred EEcCCcceEEEEEeeCC Q psy7767 173 KFNDARVHNMSYIHYGE 189 (200) Q Consensus 173 r~D~s~v~~~~i~~~~~ 189 (200) ..+..-|.+|.+..... T Consensus 236 sS~TeTVHiFKL~~~~~ 252 (391) T KOG2110|consen 236 SSNTETVHIFKLEKVSN 252 (391) T ss_pred ecCCCeEEEEEeccccc Confidence 45566789998877764 No 90 >KOG2096|consensus Probab=88.41 E-value=1.4 Score=40.67 Aligned_cols=61 Identities=16% Similarity=0.113 Sum_probs=41.0 Q ss_pred CCCceeeEECcCCCEEEEEEcCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKDNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) +...--+..||+|+-+|-..+.+|-+...++|+...-..+- -+ +....++|||||+++|-. T Consensus 331 g~~p~RL~lsP~g~~lA~s~gs~l~~~~se~g~~~~~~e~~-------------h~----~~Is~is~~~~g~~~atc 391 (420) T KOG2096|consen 331 GSEPVRLELSPSGDSLAVSFGSDLKVFASEDGKDYPELEDI-------------HS----TTISSISYSSDGKYIATC 391 (420) T ss_pred CCCceEEEeCCCCcEEEeecCCceEEEEcccCccchhHHHh-------------hc----CceeeEEecCCCcEEeee Confidence 33445789999999999999988888766655432111100 00 134678999999999754 No 91 >KOG0305|consensus Probab=88.34 E-value=2 Score=41.35 Aligned_cols=66 Identities=27% Similarity=0.325 Sum_probs=49.7 Q ss_pred ceeeEECcCCCEEEEEEcCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcCC Q psy7767 98 LQLVQWSPVGHDLIFVKDNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFNDA 177 (200) Q Consensus 98 is~~~wSPDGk~IAFV~~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~s 177 (200) ..-+.|| ..+.||-..+..||+.+...+.+.+|...++ .....+.|+|||+.||--.. .+ T Consensus 180 ~nlldWs-s~n~laValg~~vylW~~~s~~v~~l~~~~~------------------~~vtSv~ws~~G~~LavG~~-~g 239 (484) T KOG0305|consen 180 LNLLDWS-SANVLAVALGQSVYLWSASSGSVTELCSFGE------------------ELVTSVKWSPDGSHLAVGTS-DG 239 (484) T ss_pred hhHhhcc-cCCeEEEEecceEEEEecCCCceEEeEecCC------------------CceEEEEECCCCCEEEEeec-CC Confidence 3556899 6667787788899999999999888887642 14568899999999986543 35 Q ss_pred cceEEE Q psy7767 178 RVHNMS 183 (200) Q Consensus 178 ~v~~~~ 183 (200) .++++. T Consensus 240 ~v~iwD 245 (484) T KOG0305|consen 240 TVQIWD 245 (484) T ss_pred eEEEEe Confidence 566554 No 92 >KOG0273|consensus Probab=88.25 E-value=1.2 Score=42.67 Aligned_cols=18 Identities=33% Similarity=0.512 Sum_probs=14.8 Q ss_pred cceEEccCCCEEEEEEEc Q psy7767 158 KTLWFSPDGSILAYMKFN 175 (200) Q Consensus 158 ~~~~WSPDGk~Laf~r~D 175 (200) .++.|||||++||+=..| T Consensus 456 ysvafS~~g~ylAsGs~d 473 (524) T KOG0273|consen 456 YSVAFSPNGRYLASGSLD 473 (524) T ss_pred EEEEecCCCcEEEecCCC Confidence 478999999999986554 No 93 >KOG1445|consensus Probab=88.13 E-value=1.1 Score=44.84 Aligned_cols=66 Identities=20% Similarity=0.280 Sum_probs=44.4 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) ...|.+++|||||+++|-|. ++.|++..-..++. +|-...... | ++-..+.|.=||+.|..+- T Consensus 720 tdqIf~~AWSpdGr~~AtVcKDg~~rVy~Prs~e~-pv~Eg~gpv----g-----------tRgARi~wacdgr~viv~G 783 (1012) T KOG1445|consen 720 TDQIFGIAWSPDGRRIATVCKDGTLRVYEPRSREQ-PVYEGKGPV----G-----------TRGARILWACDGRIVIVVG 783 (1012) T ss_pred cCceeEEEECCCCcceeeeecCceEEEeCCCCCCC-ccccCCCCc----c-----------CcceeEEEEecCcEEEEec Confidence 46789999999999999995 77888876543332 233211110 0 2344577999999888776 Q ss_pred EcC Q psy7767 174 FND 176 (200) Q Consensus 174 ~D~ 176 (200) +|. T Consensus 784 fdk 786 (1012) T KOG1445|consen 784 FDK 786 (1012) T ss_pred ccc Confidence 654 No 94 >PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A .... Probab=88.07 E-value=6.8 Score=35.37 Aligned_cols=73 Identities=14% Similarity=0.095 Sum_probs=41.9 Q ss_pred eeEECcCCCEEEEEEcC------------cEEEEECCCCCeE--EEeeCCCcceEecccccceeeeeeeCCCcceEEccC Q psy7767 100 LVQWSPVGHDLIFVKDN------------NLYQAYDDFRSIN--ALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPD 165 (200) Q Consensus 100 ~~~wSPDGk~IAFV~~g------------dLyv~~~~~g~~~--rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPD 165 (200) .+.|++||+.+.|+.-+ +||...+.++... .|-...... + ....+..|+| T Consensus 174 ~~~W~~d~~~~~y~~~~~~~~~~~~~~~~~v~~~~~gt~~~~d~lvfe~~~~~---------------~-~~~~~~~s~d 237 (414) T PF02897_consen 174 SVSWSDDGKGFFYTRFDEDQRTSDSGYPRQVYRHKLGTPQSEDELVFEEPDEP---------------F-WFVSVSRSKD 237 (414) T ss_dssp EEEECTTSSEEEEEECSTTTSS-CCGCCEEEEEEETTS-GGG-EEEEC-TTCT---------------T-SEEEEEE-TT T ss_pred eEEEeCCCCEEEEEEeCcccccccCCCCcEEEEEECCCChHhCeeEEeecCCC---------------c-EEEEEEecCc Confidence 49999999999999622 3777766554332 222211100 0 1235789999 Q ss_pred CCEEEEEEEcCCc-ceEEEEEeeC Q psy7767 166 GSILAYMKFNDAR-VHNMSYIHYG 188 (200) Q Consensus 166 Gk~Laf~r~D~s~-v~~~~i~~~~ 188 (200) |++|......... ...+.++... T Consensus 238 ~~~l~i~~~~~~~~s~v~~~d~~~ 261 (414) T PF02897_consen 238 GRYLFISSSSGTSESEVYLLDLDD 261 (414) T ss_dssp SSEEEEEEESSSSEEEEEEEECCC T ss_pred ccEEEEEEEccccCCeEEEEeccc Confidence 9988766665555 4455555443 No 95 >PF12566 DUF3748: Protein of unknown function (DUF3748); InterPro: IPR022223 This domain family is found in bacteria and eukaryotes, and is approximately 120 amino acids in length. Probab=88.06 E-value=3.2 Score=32.85 Aligned_cols=22 Identities=36% Similarity=0.530 Sum_probs=16.3 Q ss_pred CCCcceEEccCCCEEEEEEEcC Q psy7767 155 SSTKTLWFSPDGSILAYMKFND 176 (200) Q Consensus 155 ~~~~~~~WSPDGk~Laf~r~D~ 176 (200) |..-...|||||++|.|.-.|+ T Consensus 68 GGtHvHvfSpDG~~lSFTYNDh 89 (122) T PF12566_consen 68 GGTHVHVFSPDGSWLSFTYNDH 89 (122) T ss_pred CCccceEECCCCCEEEEEecch Confidence 3344567999999999985544 No 96 >KOG0315|consensus Probab=87.63 E-value=4.9 Score=36.14 Aligned_cols=76 Identities=17% Similarity=0.076 Sum_probs=48.0 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeE----EEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSIN----ALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSIL 169 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~----rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~L 169 (200) .-.++.++..|||+.++=+. .|+.|+.++-++... .++.-... + +....-.+|||+++| T Consensus 167 ~~~i~sl~v~~dgsml~a~nnkG~cyvW~l~~~~~~s~l~P~~k~~ah----~------------~~il~C~lSPd~k~l 230 (311) T KOG0315|consen 167 DTSIQSLTVMPDGSMLAAANNKGNCYVWRLLNHQTASELEPVHKFQAH----N------------GHILRCLLSPDVKYL 230 (311) T ss_pred CcceeeEEEcCCCcEEEEecCCccEEEEEccCCCccccceEhhheecc----c------------ceEEEEEECCCCcEE Confidence 45679999999999988775 579999987664321 22210000 0 122345799999999 Q ss_pred EEEEEcCCcceEEEEEee Q psy7767 170 AYMKFNDARVHNMSYIHY 187 (200) Q Consensus 170 af~r~D~s~v~~~~i~~~ 187 (200) |-...|. .+.++..... T Consensus 231 at~ssdk-tv~iwn~~~~ 247 (311) T KOG0315|consen 231 ATCSSDK-TVKIWNTDDF 247 (311) T ss_pred EeecCCc-eEEEEecCCc Confidence 8665553 4666554443 No 97 >KOG2315|consensus Probab=87.40 E-value=6.7 Score=38.29 Aligned_cols=75 Identities=19% Similarity=0.240 Sum_probs=48.2 Q ss_pred cCCCCCceeeEECcCCCEEEEEEcC---cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCE Q psy7767 92 LDGSNFLQLVQWSPVGHDLIFVKDN---NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSI 168 (200) Q Consensus 92 lt~~~~is~~~wSPDGk~IAFV~~g---dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~ 168 (200) |...+.+-+..|||+|+..+-|.+- -+=+-++. +.+.-....| -...+.|||-|+. T Consensus 267 L~k~GPVhdv~W~~s~~EF~VvyGfMPAkvtifnlr-~~~v~df~eg--------------------pRN~~~fnp~g~i 325 (566) T KOG2315|consen 267 LLKEGPVHDVTWSPSGREFAVVYGFMPAKVTIFNLR-GKPVFDFPEG--------------------PRNTAFFNPHGNI 325 (566) T ss_pred cCCCCCceEEEECCCCCEEEEEEecccceEEEEcCC-CCEeEeCCCC--------------------CccceEECCCCCE Confidence 4446789999999999999888753 23333433 2332222222 3456789999999 Q ss_pred EEEEEEcC--CcceEEEEEee Q psy7767 169 LAYMKFND--ARVHNMSYIHY 187 (200) Q Consensus 169 Laf~r~D~--s~v~~~~i~~~ 187 (200) |++.-+++ +.+..+.+... T Consensus 326 i~lAGFGNL~G~mEvwDv~n~ 346 (566) T KOG2315|consen 326 ILLAGFGNLPGDMEVWDVPNR 346 (566) T ss_pred EEEeecCCCCCceEEEeccch Confidence 99887665 34555555553 No 98 >KOG0266|consensus Probab=87.34 E-value=3.2 Score=38.78 Aligned_cols=73 Identities=23% Similarity=0.184 Sum_probs=47.2 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEEC-CCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYD-DFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~-~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) ...+.+.+|||||+.|+=.. +..|.+.++ +.+...+.-. | ... ....+.|+|+|+.|+ . T Consensus 203 ~~~v~~~~fs~d~~~l~s~s~D~tiriwd~~~~~~~~~~l~-g--------H~~---------~v~~~~f~p~g~~i~-S 263 (456) T KOG0266|consen 203 TRGVSDVAFSPDGSYLLSGSDDKTLRIWDLKDDGRNLKTLK-G--------HST---------YVTSVAFSPDGNLLV-S 263 (456) T ss_pred ccceeeeEECCCCcEEEEecCCceEEEeeccCCCeEEEEec-C--------CCC---------ceEEEEecCCCCEEE-E Confidence 46789999999999665554 457888888 3333333332 2 222 234789999995554 5 Q ss_pred EEcCCcceEEEEEe Q psy7767 173 KFNDARVHNMSYIH 186 (200) Q Consensus 173 r~D~s~v~~~~i~~ 186 (200) -.+++.+.++.+.. T Consensus 264 gs~D~tvriWd~~~ 277 (456) T KOG0266|consen 264 GSDDGTVRIWDVRT 277 (456) T ss_pred ecCCCcEEEEeccC Confidence 55556687777665 No 99 >KOG0279|consensus Probab=87.24 E-value=7.6 Score=35.23 Aligned_cols=80 Identities=18% Similarity=0.167 Sum_probs=50.1 Q ss_pred CCCCceeeEECcCCCEEEEEEcCcEEEEECCCCCeE-EEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 94 GSNFLQLVQWSPVGHDLIFVKDNNLYQAYDDFRSIN-ALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 94 ~~~~is~~~wSPDGk~IAFV~~gdLyv~~~~~g~~~-rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) ....+..+.|||+-=+|+-..+..|-+.+++++... -+..++.-.-...+.| +--.+.||+||..| |. T Consensus 232 a~~~v~sl~fspnrywL~~at~~sIkIwdl~~~~~v~~l~~d~~g~s~~~~~~----------~clslaws~dG~tL-f~ 300 (315) T KOG0279|consen 232 AFDIVNSLCFSPNRYWLCAATATSIKIWDLESKAVVEELKLDGIGPSSKAGDP----------ICLSLAWSADGQTL-FA 300 (315) T ss_pred CCCeEeeEEecCCceeEeeccCCceEEEeccchhhhhhccccccccccccCCc----------EEEEEEEcCCCcEE-Ee Confidence 345678899999999998888888888888876432 2333221000001111 34578999999988 55 Q ss_pred EEcCCcceEEEE Q psy7767 173 KFNDARVHNMSY 184 (200) Q Consensus 173 r~D~s~v~~~~i 184 (200) -.-+..+..+.+ T Consensus 301 g~td~~irv~qv 312 (315) T KOG0279|consen 301 GYTDNVIRVWQV 312 (315) T ss_pred eecCCcEEEEEe Confidence 444455555554 No 100 >KOG2110|consensus Probab=86.47 E-value=8.6 Score=35.95 Aligned_cols=73 Identities=15% Similarity=0.032 Sum_probs=48.4 Q ss_pred CCCceeeEECcCCCEEEEEE---cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK---DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAY 171 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~---~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf 171 (200) ..+...++.++++..+||=. .|+|++.++.+-++....+--+ +....+.|||||..||- T Consensus 129 ~~gl~AlS~n~~n~ylAyp~s~t~GdV~l~d~~nl~~v~~I~aH~------------------~~lAalafs~~G~llAT 190 (391) T KOG2110|consen 129 PKGLCALSPNNANCYLAYPGSTTSGDVVLFDTINLQPVNTINAHK------------------GPLAALAFSPDGTLLAT 190 (391) T ss_pred ccceEeeccCCCCceEEecCCCCCceEEEEEcccceeeeEEEecC------------------CceeEEEECCCCCEEEE Confidence 34566666777788999974 4789999877544433332110 23467899999999987 Q ss_pred EEEcCCcceEEEEE Q psy7767 172 MKFNDARVHNMSYI 185 (200) Q Consensus 172 ~r~D~s~v~~~~i~ 185 (200) .....+-+++|.++ T Consensus 191 ASeKGTVIRVf~v~ 204 (391) T KOG2110|consen 191 ASEKGTVIRVFSVP 204 (391) T ss_pred eccCceEEEEEEcC Confidence 66655566666653 No 101 >KOG0771|consensus Probab=86.31 E-value=2.6 Score=39.54 Aligned_cols=78 Identities=17% Similarity=0.081 Sum_probs=51.0 Q ss_pred CCCceeeEECcCCCEEEEEEcCcEEEEECCCCC-eEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCC----EE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKDNNLYQAYDDFRS-INALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGS----IL 169 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~gdLyv~~~~~g~-~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk----~L 169 (200) ..++.++.|||||+.||++......|.++++|. ..+.|+.+.. | ....-+||-|+. +| T Consensus 186 ~~eV~DL~FS~dgk~lasig~d~~~VW~~~~g~~~a~~t~~~k~--------------~---~~~~cRF~~d~~~~~l~l 248 (398) T KOG0771|consen 186 HAEVKDLDFSPDGKFLASIGADSARVWSVNTGAALARKTPFSKD--------------E---MFSSCRFSVDNAQETLRL 248 (398) T ss_pred cCccccceeCCCCcEEEEecCCceEEEEeccCchhhhcCCcccc--------------h---hhhhceecccCCCceEEE Confidence 678999999999999999988888888888774 3455532211 1 234567888773 23 Q ss_pred EEEEEcCCcceEEEEEeeCC Q psy7767 170 AYMKFNDARVHNMSYIHYGE 189 (200) Q Consensus 170 af~r~D~s~v~~~~i~~~~~ 189 (200) |-.+.+...+....+..... T Consensus 249 aa~~~~~~~v~~~~~~~w~~ 268 (398) T KOG0771|consen 249 AASQFPGGGVRLCDISLWSG 268 (398) T ss_pred EEecCCCCceeEEEeeeecc Confidence 33355555666655554443 No 102 >KOG1538|consensus Probab=86.29 E-value=1.4 Score=44.24 Aligned_cols=34 Identities=15% Similarity=0.229 Sum_probs=29.5 Q ss_pred CCceeeEECcCCCEEEEEEcCcEEEEECCCCCeE Q psy7767 96 NFLQLVQWSPVGHDLIFVKDNNLYQAYDDFRSIN 129 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~~gdLyv~~~~~g~~~ 129 (200) .-+.++++.|||.++....++.|++.+..+|..- T Consensus 13 hci~d~afkPDGsqL~lAAg~rlliyD~ndG~ll 46 (1081) T KOG1538|consen 13 HCINDIAFKPDGTQLILAAGSRLLVYDTSDGTLL 46 (1081) T ss_pred cchheeEECCCCceEEEecCCEEEEEeCCCcccc Confidence 4578999999999999999999999998776543 No 103 >KOG0273|consensus Probab=86.28 E-value=2.5 Score=40.59 Aligned_cols=63 Identities=14% Similarity=0.033 Sum_probs=46.3 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) ...+..+++||||+++|+-. ++-|.+.+...++..+=..+.. ....++|+-+|.+|.-.. T Consensus 452 ~~pVysvafS~~g~ylAsGs~dg~V~iws~~~~~l~~s~~~~~-------------------~Ifel~Wn~~G~kl~~~~ 512 (524) T KOG0273|consen 452 QEPVYSVAFSPNGRYLASGSLDGCVHIWSTKTGKLVKSYQGTG-------------------GIFELCWNAAGDKLGACA 512 (524) T ss_pred CCceEEEEecCCCcEEEecCCCCeeEeccccchheeEeecCCC-------------------eEEEEEEcCCCCEEEEEe Confidence 45679999999999999984 6778888877777654443321 123689999999998665 Q ss_pred EcC Q psy7767 174 FND 176 (200) Q Consensus 174 ~D~ 176 (200) .|. T Consensus 513 sd~ 515 (524) T KOG0273|consen 513 SDG 515 (524) T ss_pred cCC Confidence 553 No 104 >KOG0305|consensus Probab=86.21 E-value=10 Score=36.57 Aligned_cols=70 Identities=19% Similarity=0.209 Sum_probs=45.6 Q ss_pred eeecCCCCCceeeEECcCCCEEEEEEc---CcEEEEECCCCCeE-EEeeCCCcceEecccccceeeeeeeCCCcceEEcc Q psy7767 89 VSDLDGSNFLQLVQWSPVGHDLIFVKD---NNLYQAYDDFRSIN-ALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSP 164 (200) Q Consensus 89 v~~lt~~~~is~~~wSPDGk~IAFV~~---gdLyv~~~~~g~~~-rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSP 164 (200) +..+-.+..+..+.|||..+.|+...+ ++|-+.....-... -++ | -. .+.-.+.||| T Consensus 381 i~~vdtgsQVcsL~Wsk~~kEi~sthG~s~n~i~lw~~ps~~~~~~l~--g--------H~---------~RVl~la~SP 441 (484) T KOG0305|consen 381 IDSVDTGSQVCSLIWSKKYKELLSTHGYSENQITLWKYPSMKLVAELL--G--------HT---------SRVLYLALSP 441 (484) T ss_pred ecccccCCceeeEEEcCCCCEEEEecCCCCCcEEEEeccccceeeeec--C--------Cc---------ceeEEEEECC Confidence 333335788999999999999999875 45555444432111 111 1 10 1344689999 Q ss_pred CCCEEEEEEEcCC Q psy7767 165 DGSILAYMKFNDA 177 (200) Q Consensus 165 DGk~Laf~r~D~s 177 (200) ||++|+-...|++ T Consensus 442 dg~~i~t~a~DET 454 (484) T KOG0305|consen 442 DGETIVTGAADET 454 (484) T ss_pred CCCEEEEecccCc Confidence 9999988777775 No 105 >KOG2314|consensus Probab=86.20 E-value=6.4 Score=38.83 Aligned_cols=62 Identities=15% Similarity=0.373 Sum_probs=38.9 Q ss_pred CCCceeeEECcCCCEEEEEEcCc------EEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKDNN------LYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSI 168 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~gd------Lyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~ 168 (200) ...+...+|-|.|++.+-+.++. +|-.....++...|..-.. .....+.|||.|+. T Consensus 445 ke~vi~FaWEP~gdkF~vi~g~~~k~tvsfY~~e~~~~~~~lVk~~dk------------------~~~N~vfwsPkG~f 506 (698) T KOG2314|consen 445 KESVIAFAWEPHGDKFAVISGNTVKNTVSFYAVETNIKKPSLVKELDK------------------KFANTVFWSPKGRF 506 (698) T ss_pred chheeeeeeccCCCeEEEEEccccccceeEEEeecCCCchhhhhhhcc------------------cccceEEEcCCCcE Confidence 45678889999999999998763 3433322222222221110 24567899999999 Q ss_pred EEEEEE Q psy7767 169 LAYMKF 174 (200) Q Consensus 169 Laf~r~ 174 (200) ++...- T Consensus 507 vvva~l 512 (698) T KOG2314|consen 507 VVVAAL 512 (698) T ss_pred EEEEEe Confidence 876643 No 106 >KOG0263|consensus Probab=86.09 E-value=2.6 Score=42.13 Aligned_cols=71 Identities=21% Similarity=0.143 Sum_probs=48.4 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeE-EEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSIN-ALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~-rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) .+.+..+++||+|++||--. ++.|-++++.+|... ++-.. . +...+++||.||..||-- T Consensus 577 ~~~V~al~~Sp~Gr~LaSg~ed~~I~iWDl~~~~~v~~l~~H-t------------------~ti~SlsFS~dg~vLasg 637 (707) T KOG0263|consen 577 KGPVTALAFSPCGRYLASGDEDGLIKIWDLANGSLVKQLKGH-T------------------GTIYSLSFSRDGNVLASG 637 (707) T ss_pred CCceEEEEEcCCCceEeecccCCcEEEEEcCCCcchhhhhcc-c------------------CceeEEEEecCCCEEEec Confidence 46689999999999999875 567888898876542 11111 0 234578999999988754 Q ss_pred EEcCCcceEEEEE Q psy7767 173 KFNDARVHNMSYI 185 (200) Q Consensus 173 r~D~s~v~~~~i~ 185 (200) ..| ..|..+.+. T Consensus 638 g~D-nsV~lWD~~ 649 (707) T KOG0263|consen 638 GAD-NSVRLWDLT 649 (707) T ss_pred CCC-CeEEEEEch Confidence 444 446665443 No 107 >KOG0973|consensus Probab=85.83 E-value=2 Score=44.31 Aligned_cols=24 Identities=29% Similarity=0.369 Sum_probs=19.9 Q ss_pred CCCceeeEECcCCCEEEEEEcCcE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKDNNL 118 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~gdL 118 (200) .+.+...+|||||+++|+-.+..+ T Consensus 69 ~~sv~CVR~S~dG~~lAsGSDD~~ 92 (942) T KOG0973|consen 69 DGSVNCVRFSPDGSYLASGSDDRL 92 (942) T ss_pred cCceeEEEECCCCCeEeeccCcce Confidence 356788999999999999987654 No 108 >TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome. Probab=85.47 E-value=8.1 Score=35.61 Aligned_cols=71 Identities=11% Similarity=0.030 Sum_probs=37.9 Q ss_pred cCCCCCceeeEECcCCCEEEEEEc-----------CcEEEEECCCCCeE-EEeeCCCcceEecccccceeeeeeeCCCcc Q psy7767 92 LDGSNFLQLVQWSPVGHDLIFVKD-----------NNLYQAYDDFRSIN-ALTRDGIKGVLFNGVADWVYEEEVFSSTKT 159 (200) Q Consensus 92 lt~~~~is~~~wSPDGk~IAFV~~-----------gdLyv~~~~~g~~~-rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~ 159 (200) +..+...... +||||+.|.-... +.|=+.|+.+.+.. .|--.+.+. +. . ...... T Consensus 43 i~~G~~P~~~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t~~~~~~i~~p~~p~-~~-~----------~~~~~~ 109 (352) T TIGR02658 43 TDGGFLPNPV-VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQTHLPIADIELPEGPR-FL-V----------GTYPWM 109 (352) T ss_pred EEccCCCcee-ECCCCCEEEEEeccccccccCCCCCEEEEEECccCcEEeEEccCCCch-hh-c----------cCccce Confidence 3334444444 9999999754433 24666676665543 222211100 00 0 013446 Q ss_pred eEEccCCCEEEEEEEc Q psy7767 160 LWFSPDGSILAYMKFN 175 (200) Q Consensus 160 ~~WSPDGk~Laf~r~D 175 (200) +..||||++|+..-.+ T Consensus 110 ~~ls~dgk~l~V~n~~ 125 (352) T TIGR02658 110 TSLTPDNKTLLFYQFS 125 (352) T ss_pred EEECCCCCEEEEecCC Confidence 8999999988544333 No 109 >KOG0279|consensus Probab=85.40 E-value=7.8 Score=35.15 Aligned_cols=83 Identities=17% Similarity=-0.009 Sum_probs=55.1 Q ss_pred cceeEEEEEeeeeeeeeeecCCCC-ccceeecCC-CCCceeeEECcCCCEEEEE-EcCcEEEEECCCCCeEEEeeCCCcc Q psy7767 62 IALKLSIRCLTWKQSHSLSGRPDT-LTDVSDLDG-SNFLQLVQWSPVGHDLIFV-KDNNLYQAYDDFRSINALTRDGIKG 138 (200) Q Consensus 62 ~~~kv~i~~n~~~~R~s~~a~~~~-~~~v~~lt~-~~~is~~~wSPDGk~IAFV-~~gdLyv~~~~~g~~~rLT~dg~~~ 138 (200) .-..+.|..+.+ .+. ...+++++. +..+++...||||....-. .++-+.+.|+.+|++.|.--....+ T Consensus 37 Dk~ii~W~L~~d---------d~~~G~~~r~~~GHsH~v~dv~~s~dg~~alS~swD~~lrlWDl~~g~~t~~f~GH~~d 107 (315) T KOG0279|consen 37 DKTIIVWKLTSD---------DIKYGVPVRRLTGHSHFVSDVVLSSDGNFALSASWDGTLRLWDLATGESTRRFVGHTKD 107 (315) T ss_pred ceEEEEEEeccC---------ccccCceeeeeeccceEecceEEccCCceEEeccccceEEEEEecCCcEEEEEEecCCc Confidence 334556666655 211 345667765 6788999999999984433 3678999999998776555433222 Q ss_pred eEecccccceeeeeeeCCCcceEEccCCCEEEE Q psy7767 139 VLFNGVADWVYEEEVFSSTKTLWFSPDGSILAY 171 (200) Q Consensus 139 ~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf 171 (200) ..++.+|||.++|+= T Consensus 108 ------------------Vlsva~s~dn~qivS 122 (315) T KOG0279|consen 108 ------------------VLSVAFSTDNRQIVS 122 (315) T ss_pred ------------------eEEEEecCCCceeec Confidence 236789999888753 No 110 >KOG0293|consensus Probab=85.27 E-value=2.7 Score=40.00 Aligned_cols=65 Identities=18% Similarity=0.298 Sum_probs=38.0 Q ss_pred CCCceeeEECcCCCEEEEEEcC---cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKDN---NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAY 171 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~g---dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf 171 (200) ..++=...+||+||++|-.... -+|....+ +..+ |-.+=. |.. .....+.||||.++|+- T Consensus 224 tdEVWfl~FS~nGkyLAsaSkD~Taiiw~v~~d-~~~k-l~~tlv------gh~---------~~V~yi~wSPDdryLla 286 (519) T KOG0293|consen 224 TDEVWFLQFSHNGKYLASASKDSTAIIWIVVYD-VHFK-LKKTLV------GHS---------QPVSYIMWSPDDRYLLA 286 (519) T ss_pred CCcEEEEEEcCCCeeEeeccCCceEEEEEEecC-ccee-eeeeee------ccc---------CceEEEEECCCCCeEEe Confidence 5678899999999999988643 24444332 3311 111000 000 12346789999999975 Q ss_pred EEEcC Q psy7767 172 MKFND 176 (200) Q Consensus 172 ~r~D~ 176 (200) .-.++ T Consensus 287 Cg~~e 291 (519) T KOG0293|consen 287 CGFDE 291 (519) T ss_pred cCchH Confidence 54443 No 111 >PRK13613 lipoprotein LpqB; Provisional Probab=84.97 E-value=13 Score=36.67 Aligned_cols=80 Identities=9% Similarity=0.014 Sum_probs=48.7 Q ss_pred CCCceeeEECcCCCEEEEEEc----CcEEEEECC---CCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCC Q psy7767 95 SNFLQLVQWSPVGHDLIFVKD----NNLYQAYDD---FRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGS 167 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~----gdLyv~~~~---~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk 167 (200) +..+..+++||||-++|.+.+ +.|++--+. +|. .+|+.--. + .. + ++...++.|..|++ T Consensus 454 g~~I~~lrvSrDG~RvAvv~~~~g~~~v~va~V~R~~~G~-~~l~~~~~--l-~~---------~-l~~v~~~~W~~~~s 519 (599) T PRK13613 454 GHRVVAVRVARDGVRVALIVEKDGRRSLQIGRIVRDAKAV-VSVEEFRS--L-AP---------E-LEDVTDMSWAGDSQ 519 (599) T ss_pred CCEeEEEEECCCccEEEEEEecCCCcEEEEEEEEeCCCCc-EEeeccEE--e-cc---------C-CCccceeEEcCCCE Confidence 346999999999999999986 678875433 233 55543100 0 00 1 12456889999986 Q ss_pred EEEEE-EEcCCcceEEEEEeeCC Q psy7767 168 ILAYM-KFNDARVHNMSYIHYGE 189 (200) Q Consensus 168 ~Laf~-r~D~s~v~~~~i~~~~~ 189 (200) |+-+ +...+..+.+.+...+. T Consensus 520 -L~Vlg~~~~~~~~v~~v~vdG~ 541 (599) T PRK13613 520 -LVVLGREEGGVQQARYVQVDGS 541 (599) T ss_pred -EEEEeccCCCCcceEEEecCCc Confidence 5544 44443455555555544 No 112 >KOG2394|consensus Probab=84.70 E-value=5.9 Score=38.81 Aligned_cols=87 Identities=25% Similarity=0.391 Sum_probs=52.0 Q ss_pred CCCceeeEECcCCCEEEEE--EcCcEEEEECCC--CC---eEEEeeCCCc-------ceEe-cccccceeeeeeeCCCcc Q psy7767 95 SNFLQLVQWSPVGHDLIFV--KDNNLYQAYDDF--RS---INALTRDGIK-------GVLF-NGVADWVYEEEVFSSTKT 159 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV--~~gdLyv~~~~~--g~---~~rLT~dg~~-------~~~~-nG~pdwv~eEEi~~~~~~ 159 (200) ...+...+|-|.++.+..+ +.|++|+.+.+. +. .-+.-.+++. .... |-+.-|...|. ...+ T Consensus 219 ktsvT~ikWvpg~~~~Fl~a~~sGnlyly~~~~~~~~t~p~~~~~k~~~~f~i~t~ksk~~rNPv~~w~~~~g---~in~ 295 (636) T KOG2394|consen 219 KSSVTCIKWVPGSDSLFLVAHASGNLYLYDKEIVCGATAPSYQALKDGDQFAILTSKSKKTRNPVARWHIGEG---SINE 295 (636) T ss_pred ccceEEEEEEeCCCceEEEEEecCceEEeeccccccCCCCcccccCCCCeeEEeeeeccccCCccceeEeccc---cccc Confidence 4567889999976665444 478999885532 11 1122222211 0011 33445655443 5678 Q ss_pred eEEccCCCEEEEEEEcCCcceEEEEE Q psy7767 160 LWFSPDGSILAYMKFNDARVHNMSYI 185 (200) Q Consensus 160 ~~WSPDGk~Laf~r~D~s~v~~~~i~ 185 (200) +.|||||++||-+..| +..++|.+. T Consensus 296 f~FS~DG~~LA~VSqD-GfLRvF~fd 320 (636) T KOG2394|consen 296 FAFSPDGKYLATVSQD-GFLRIFDFD 320 (636) T ss_pred eeEcCCCceEEEEecC-ceEEEeecc Confidence 8999999999988655 566776654 No 113 >PTZ00420 coronin; Provisional Probab=84.47 E-value=40 Score=33.10 Aligned_cols=35 Identities=14% Similarity=0.101 Sum_probs=28.5 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSIN 129 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~ 129 (200) ...+..++|+|||+.|+-.. ++.|.+.++.++... T Consensus 167 ~~~V~SlswspdG~lLat~s~D~~IrIwD~Rsg~~i 202 (568) T PTZ00420 167 PKKLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQEIA 202 (568) T ss_pred CCcEEEEEECCCCCEEEEEecCCEEEEEECCCCcEE Confidence 45688999999999998775 668999999877654 No 114 >PTZ00421 coronin; Provisional Probab=84.23 E-value=7.2 Score=37.27 Aligned_cols=78 Identities=13% Similarity=0.060 Sum_probs=44.4 Q ss_pred CCCceeeEECc-CCCEEEEEE-cCcEEEEECCCCCeE-EEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEE Q psy7767 95 SNFLQLVQWSP-VGHDLIFVK-DNNLYQAYDDFRSIN-ALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAY 171 (200) Q Consensus 95 ~~~is~~~wSP-DGk~IAFV~-~gdLyv~~~~~g~~~-rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf 171 (200) .+.+..++|+| |++.||-.. ++.|.+.++..+... ..+. . ...+.|.. .....+.|+|++..++. T Consensus 75 ~~~V~~v~fsP~d~~~LaSgS~DgtIkIWdi~~~~~~~~~~~-~--l~~L~gH~---------~~V~~l~f~P~~~~iLa 142 (493) T PTZ00421 75 EGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISD-P--IVHLQGHT---------KKVGIVSFHPSAMNVLA 142 (493) T ss_pred CCCEEEEEEcCCCCCEEEEEeCCCEEEEEecCCCccccccCc-c--eEEecCCC---------CcEEEEEeCcCCCCEEE Confidence 46789999999 888877664 567888887654211 0000 0 00011110 12346789998754444 Q ss_pred EEEcCCcceEEEE Q psy7767 172 MKFNDARVHNMSY 184 (200) Q Consensus 172 ~r~D~s~v~~~~i 184 (200) ....++.+.++.+ T Consensus 143 Sgs~DgtVrIWDl 155 (493) T PTZ00421 143 SAGADMVVNVWDV 155 (493) T ss_pred EEeCCCEEEEEEC Confidence 5555555655543 No 115 >KOG2055|consensus Probab=83.78 E-value=6 Score=37.99 Aligned_cols=118 Identities=16% Similarity=0.105 Sum_probs=72.3 Q ss_pred eeeeeeeeeeeecceEEEEeccCcceeEEEEEeeeeeeeeeecCCCCccceeecCC-CCCceeeEECcCCCEEEEEEcC- Q psy7767 39 IRCLTWKQSEYWDYNLYLQRSTDIALKLSIRCLTWKQSHSLSGRPDTLTDVSDLDG-SNFLQLVQWSPVGHDLIFVKDN- 116 (200) Q Consensus 39 ~~~~~~~~~~~~~~~~y~~~~~~~~~kv~i~~n~~~~R~s~~a~~~~~~~v~~lt~-~~~is~~~wSPDGk~IAFV~~g- 116 (200) +||-.-.-..+..+.|=-++=+... .++++.-.+|.--.|--+.-+-..++.+.. ...++.+.++|+|+..+|+.+. T Consensus 201 krlkDaNa~~ps~~~I~sv~FHp~~-plllvaG~d~~lrifqvDGk~N~~lqS~~l~~fPi~~a~f~p~G~~~i~~s~rr 279 (514) T KOG2055|consen 201 KRLKDANAAHPSHGGITSVQFHPTA-PLLLVAGLDGTLRIFQVDGKVNPKLQSIHLEKFPIQKAEFAPNGHSVIFTSGRR 279 (514) T ss_pred EeecccccCCcCcCCceEEEecCCC-ceEEEecCCCcEEEEEecCccChhheeeeeccCccceeeecCCCceEEEecccc Confidence 4555556666777777666654443 344444445442233333322234444432 5678999999999988888765 Q ss_pred -cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 117 -NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 117 -dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) -+|..++..++..+|..- +|.. |. ....+.-||||+.||+. T Consensus 280 ky~ysyDle~ak~~k~~~~-------~g~e-----~~---~~e~FeVShd~~fia~~ 321 (514) T KOG2055|consen 280 KYLYSYDLETAKVTKLKPP-------YGVE-----EK---SMERFEVSHDSNFIAIA 321 (514) T ss_pred eEEEEeeccccccccccCC-------CCcc-----cc---hhheeEecCCCCeEEEc Confidence 477788887777777653 2331 11 33456779999988764 No 116 >KOG0291|consensus Probab=83.74 E-value=4.6 Score=40.94 Aligned_cols=61 Identities=15% Similarity=0.159 Sum_probs=45.1 Q ss_pred ceeeEECcCCCEEEEEEcCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcCC Q psy7767 98 LQLVQWSPVGHDLIFVKDNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFNDA 177 (200) Q Consensus 98 is~~~wSPDGk~IAFV~~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~s 177 (200) ...+.++|||+.|.+-.+|.|.+.++...+...+.-+-.. ....+.-||||..| +..|++ T Consensus 17 ~Gnl~ft~dG~sviSPvGNrvsv~dLknN~S~Tl~~e~~~------------------NI~~ialSp~g~ll--lavdE~ 76 (893) T KOG0291|consen 17 AGNLVFTKDGNSVISPVGNRVSVFDLKNNKSYTLPLETRY------------------NITRIALSPDGTLL--LAVDER 76 (893) T ss_pred cCcEEECCCCCEEEeccCCEEEEEEccCCcceeEEeecCC------------------ceEEEEeCCCceEE--EEEcCC Confidence 3568999999999999999999999987776655443321 34568899999755 445555 Q ss_pred c Q psy7767 178 R 178 (200) Q Consensus 178 ~ 178 (200) + T Consensus 77 g 77 (893) T KOG0291|consen 77 G 77 (893) T ss_pred C Confidence 3 No 117 >KOG1273|consensus Probab=83.37 E-value=1.8 Score=39.96 Aligned_cols=61 Identities=18% Similarity=0.118 Sum_probs=41.5 Q ss_pred CceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEc Q psy7767 97 FLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFN 175 (200) Q Consensus 97 ~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D 175 (200) .....+|||.|..+|--. +|.|-+.+..+-...++...-. .-..+++|||||+.|+-...| T Consensus 25 ~a~~~~Fs~~G~~lAvGc~nG~vvI~D~~T~~iar~lsaH~------------------~pi~sl~WS~dgr~LltsS~D 86 (405) T KOG1273|consen 25 LAECCQFSRWGDYLAVGCANGRVVIYDFDTFRIARMLSAHV------------------RPITSLCWSRDGRKLLTSSRD 86 (405) T ss_pred ccceEEeccCcceeeeeccCCcEEEEEccccchhhhhhccc------------------cceeEEEecCCCCEeeeecCC Confidence 377889999999999885 6789888887543322221000 123578999999988755443 No 118 >KOG1007|consensus Probab=83.35 E-value=3 Score=38.12 Aligned_cols=63 Identities=25% Similarity=0.241 Sum_probs=43.2 Q ss_pred CCceeeEECcCCCEEEEEEcCcEEEEECCCCCeE-E-EeeCCCcceEecccccceeeeeeeCCCcceEEcc--CCCEEEE Q psy7767 96 NFLQLVQWSPVGHDLIFVKDNNLYQAYDDFRSIN-A-LTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSP--DGSILAY 171 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~~gdLyv~~~~~g~~~-r-LT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSP--DGk~Laf 171 (200) +.+....|.||++++|-+.+++|-+.+++.+..+ + |-..+. ++ +-....+-.||| ||..++- T Consensus 124 g~i~cvew~Pns~klasm~dn~i~l~~l~ess~~vaev~ss~s-----~e---------~~~~ftsg~WspHHdgnqv~t 189 (370) T KOG1007|consen 124 GKINCVEWEPNSDKLASMDDNNIVLWSLDESSKIVAEVLSSES-----AE---------MRHSFTSGAWSPHHDGNQVAT 189 (370) T ss_pred CceeeEEEcCCCCeeEEeccCceEEEEcccCcchheeeccccc-----cc---------ccceecccccCCCCccceEEE Confidence 3678899999999999999999999999876652 1 211111 11 112345567999 8887764 Q ss_pred E Q psy7767 172 M 172 (200) Q Consensus 172 ~ 172 (200) . T Consensus 190 t 190 (370) T KOG1007|consen 190 T 190 (370) T ss_pred e Confidence 4 No 119 >KOG1274|consensus Probab=83.26 E-value=6.2 Score=40.55 Aligned_cols=77 Identities=17% Similarity=0.138 Sum_probs=50.5 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCe-EEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSI-NALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~-~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) .+.+..+.+.|+|+-||-+. +|.|++.++.++.. ..++.-...+ + . +.-.....++|+|||..||+. T Consensus 138 ~apVl~l~~~p~~~fLAvss~dG~v~iw~~~~~~~~~tl~~v~k~n----~---~----~~s~i~~~~aW~Pk~g~la~~ 206 (933) T KOG1274|consen 138 DAPVLQLSYDPKGNFLAVSSCDGKVQIWDLQDGILSKTLTGVDKDN----E---F----ILSRICTRLAWHPKGGTLAVP 206 (933) T ss_pred CCceeeeeEcCCCCEEEEEecCceEEEEEcccchhhhhcccCCccc----c---c----cccceeeeeeecCCCCeEEee Confidence 46688999999999999985 78999999987653 2333211100 0 0 000134568999998888887 Q ss_pred EEcCCcceEEE Q psy7767 173 KFNDARVHNMS 183 (200) Q Consensus 173 r~D~s~v~~~~ 183 (200) ..|+ .|..|. T Consensus 207 ~~d~-~Vkvy~ 216 (933) T KOG1274|consen 207 PVDN-TVKVYS 216 (933) T ss_pred ccCC-eEEEEc Confidence 6654 355444 No 120 >KOG2106|consensus Probab=83.15 E-value=7.2 Score=38.04 Aligned_cols=70 Identities=13% Similarity=0.207 Sum_probs=43.9 Q ss_pred CceeeEECcCCCEEEE-EEcCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEc Q psy7767 97 FLQLVQWSPVGHDLIF-VKDNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFN 175 (200) Q Consensus 97 ~is~~~wSPDGk~IAF-V~~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D 175 (200) ....+.+.|.| .||- ...|..++.+.++....++-.+++ ..+.++|||||..||.-..| T Consensus 409 ~~~~~~fhpsg-~va~Gt~~G~w~V~d~e~~~lv~~~~d~~-------------------~ls~v~ysp~G~~lAvgs~d 468 (626) T KOG2106|consen 409 PAECADFHPSG-VVAVGTATGRWFVLDTETQDLVTIHTDNE-------------------QLSVVRYSPDGAFLAVGSHD 468 (626) T ss_pred ceeEeeccCcc-eEEEeeccceEEEEecccceeEEEEecCC-------------------ceEEEEEcCCCCEEEEecCC Confidence 44566677777 3333 345677777777666666665532 34678999999999865444 Q ss_pred CCcceEEEEEee Q psy7767 176 DARVHNMSYIHY 187 (200) Q Consensus 176 ~s~v~~~~i~~~ 187 (200) ..+-+|.+... T Consensus 469 -~~iyiy~Vs~~ 479 (626) T KOG2106|consen 469 -NHIYIYRVSAN 479 (626) T ss_pred -CeEEEEEECCC Confidence 34555555443 No 121 >KOG2055|consensus Probab=83.01 E-value=5.5 Score=38.23 Aligned_cols=73 Identities=18% Similarity=0.145 Sum_probs=50.7 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCe-EEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSI-NALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~-~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) .+.+++..||-||+.|.-+. .|+||+.++..... .+...+|. . .-..++-|++|++|| + T Consensus 344 eG~v~~~~fsSdsk~l~~~~~~GeV~v~nl~~~~~~~rf~D~G~----v--------------~gts~~~S~ng~ylA-~ 404 (514) T KOG2055|consen 344 EGVVSDFTFSSDSKELLASGGTGEVYVWNLRQNSCLHRFVDDGS----V--------------HGTSLCISLNGSYLA-T 404 (514) T ss_pred ccEEeeEEEecCCcEEEEEcCCceEEEEecCCcceEEEEeecCc----c--------------ceeeeeecCCCceEE-e Confidence 56689999999998876553 57999999986543 56677665 1 234678899999776 3 Q ss_pred EEcCCcceEEEEEe Q psy7767 173 KFNDARVHNMSYIH 186 (200) Q Consensus 173 r~D~s~v~~~~i~~ 186 (200) -.+.+-|.+|.... T Consensus 405 GS~~GiVNIYd~~s 418 (514) T KOG2055|consen 405 GSDSGIVNIYDGNS 418 (514) T ss_pred ccCcceEEEeccch Confidence 35555566665333 No 122 >KOG1445|consensus Probab=82.59 E-value=1.9 Score=43.07 Aligned_cols=33 Identities=18% Similarity=0.376 Sum_probs=24.2 Q ss_pred CCCCceeeEECc-CCCEEEEEEc-C--cEEEEECCCC Q psy7767 94 GSNFLQLVQWSP-VGHDLIFVKD-N--NLYQAYDDFR 126 (200) Q Consensus 94 ~~~~is~~~wSP-DGk~IAFV~~-g--dLyv~~~~~g 126 (200) ++..+.++.|.| |-.++|...+ + +||-....+. T Consensus 626 Ngt~vtDl~WdPFD~~rLAVa~ddg~i~lWr~~a~gl 662 (1012) T KOG1445|consen 626 NGTLVTDLHWDPFDDERLAVATDDGQINLWRLTANGL 662 (1012) T ss_pred cCceeeecccCCCChHHeeecccCceEEEEEeccCCC Confidence 466789999999 8899988764 3 5666655544 No 123 >PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation []. Probab=82.57 E-value=3.9 Score=42.18 Aligned_cols=63 Identities=16% Similarity=0.122 Sum_probs=41.7 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEE----ECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQA----YDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSIL 169 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~----~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~L 169 (200) +..+..+.+-+|...|.++. +|+|.+. +.+.....-+..- - +.-.+.+||||+..| T Consensus 75 ~~~ivs~~yl~d~~~l~~~~~~Gdi~~~~~~~~~~~~~~E~VG~v------d-------------~GI~a~~WSPD~Ell 135 (928) T PF04762_consen 75 NDKIVSFQYLADSESLCIALASGDIILVREDPDPDEDEIEIVGSV------D-------------SGILAASWSPDEELL 135 (928) T ss_pred CCcEEEEEeccCCCcEEEEECCceEEEEEccCCCCCceeEEEEEE------c-------------CcEEEEEECCCcCEE Confidence 46678888999988877775 7788888 3332232222210 0 123578999999999 Q ss_pred EEEEEcC Q psy7767 170 AYMKFND 176 (200) Q Consensus 170 af~r~D~ 176 (200) |++..+. T Consensus 136 a~vT~~~ 142 (928) T PF04762_consen 136 ALVTGEG 142 (928) T ss_pred EEEeCCC Confidence 9886543 No 124 >KOG0319|consensus Probab=81.94 E-value=5.3 Score=40.22 Aligned_cols=79 Identities=19% Similarity=0.180 Sum_probs=51.6 Q ss_pred ceeecCCCCCceeeEECcCCCEEEEEEcCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCC Q psy7767 88 DVSDLDGSNFLQLVQWSPVGHDLIFVKDNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGS 167 (200) Q Consensus 88 ~v~~lt~~~~is~~~wSPDGk~IAFV~~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk 167 (200) .++++-.++. ++||++|+.|+-..++.|-..++.++... +. .+. ++. +....++.-+||++ T Consensus 15 s~epiYtGG~---~~~s~nG~~L~t~~~d~Vi~idv~t~~~~-l~-s~~-----~ed---------~d~ita~~l~~d~~ 75 (775) T KOG0319|consen 15 SLEPIYTGGP---VAWSSNGQHLYTACGDRVIIIDVATGSIA-LP-SGS-----NED---------EDEITALALTPDEE 75 (775) T ss_pred cccceecCCc---eeECCCCCEEEEecCceEEEEEccCCcee-cc-cCC-----ccc---------hhhhheeeecCCcc Confidence 4444333333 89999999998888888999998877764 22 221 111 12456788999988 Q ss_pred EEEEEEEcCCcceEEEEEe Q psy7767 168 ILAYMKFNDARVHNMSYIH 186 (200) Q Consensus 168 ~Laf~r~D~s~v~~~~i~~ 186 (200) .|. ...+.....++.++. T Consensus 76 ~L~-~a~rs~llrv~~L~t 93 (775) T KOG0319|consen 76 VLV-TASRSQLLRVWSLPT 93 (775) T ss_pred EEE-EeeccceEEEEEccc Confidence 775 455556666666553 No 125 >KOG1063|consensus Probab=81.36 E-value=3.3 Score=41.40 Aligned_cols=74 Identities=16% Similarity=0.037 Sum_probs=42.0 Q ss_pred CceeeEECcCCCEEEEEEcCcEEEE-ECCCC--CeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 97 FLQLVQWSPVGHDLIFVKDNNLYQA-YDDFR--SINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 97 ~is~~~wSPDGk~IAFV~~gdLyv~-~~~~g--~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) .+..++|||||+.|+-|.+...+-. +...+ ...+. .++.+. -....+-.||||++++| .. T Consensus 574 TVT~l~FSpdg~~LLsvsRDRt~sl~~~~~~~~~e~~f---------a~~k~H-------tRIIWdcsW~pde~~Fa-Ta 636 (764) T KOG1063|consen 574 TVTRLAFSPDGRYLLSVSRDRTVSLYEVQEDIKDEFRF---------ACLKAH-------TRIIWDCSWSPDEKYFA-TA 636 (764) T ss_pred EEEEEEECCCCcEEEEeecCceEEeeeeecccchhhhh---------cccccc-------ceEEEEcccCcccceeE-Ee Confidence 4688999999999999976643322 11111 11110 111110 00113446999999865 45 Q ss_pred EcCCcceEEEEEee Q psy7767 174 FNDARVHNMSYIHY 187 (200) Q Consensus 174 ~D~s~v~~~~i~~~ 187 (200) .++..|..+..+.. T Consensus 637 SRDK~VkVW~~~~~ 650 (764) T KOG1063|consen 637 SRDKKVKVWEEPDL 650 (764) T ss_pred cCCceEEEEeccCc Confidence 66667877776655 No 126 >KOG2106|consensus Probab=81.33 E-value=10 Score=36.98 Aligned_cols=31 Identities=29% Similarity=0.373 Sum_probs=24.7 Q ss_pred CCCCceeeEECcCCCEEEEEE-cCcEEEEECC Q psy7767 94 GSNFLQLVQWSPVGHDLIFVK-DNNLYQAYDD 124 (200) Q Consensus 94 ~~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~ 124 (200) ++..++-.++||||..+|.-. ++.||+.-++ T Consensus 446 d~~~ls~v~ysp~G~~lAvgs~d~~iyiy~Vs 477 (626) T KOG2106|consen 446 DNEQLSVVRYSPDGAFLAVGSHDNHIYIYRVS 477 (626) T ss_pred cCCceEEEEEcCCCCEEEEecCCCeEEEEEEC Confidence 467889999999999999986 5567776544 No 127 >KOG0268|consensus Probab=81.05 E-value=5.5 Score=37.34 Aligned_cols=112 Identities=14% Similarity=0.097 Sum_probs=63.4 Q ss_pred EEEEeccCcceeEEEEEeeeeeeeeeecCCCCc---------cceeecC-------C-CCCceeeEECcCCCEEEEEE-c Q psy7767 54 LYLQRSTDIALKLSIRCLTWKQSHSLSGRPDTL---------TDVSDLD-------G-SNFLQLVQWSPVGHDLIFVK-D 115 (200) Q Consensus 54 ~y~~~~~~~~~kv~i~~n~~~~R~s~~a~~~~~---------~~v~~lt-------~-~~~is~~~wSPDGk~IAFV~-~ 115 (200) ||-+|.+.-.+||.+...+.+-.+.-.|..|+. .+++.|. + ...+.+..+||.|+.++-.+ + T Consensus 214 LyD~R~~~Pl~KVi~~mRTN~IswnPeafnF~~a~ED~nlY~~DmR~l~~p~~v~~dhvsAV~dVdfsptG~EfvsgsyD 293 (433) T KOG0268|consen 214 LYDLRQASPLKKVILTMRTNTICWNPEAFNFVAANEDHNLYTYDMRNLSRPLNVHKDHVSAVMDVDFSPTGQEFVSGSYD 293 (433) T ss_pred EEecccCCccceeeeeccccceecCccccceeeccccccceehhhhhhcccchhhcccceeEEEeccCCCcchhcccccc Confidence 566777777777777666655433332222211 1333332 1 34567888888888844331 2 Q ss_pred CcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcCCcceEEE Q psy7767 116 NNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFNDARVHNMS 183 (200) Q Consensus 116 gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~ 183 (200) ..|-+..+..+..+-+-.+.. +.+...+-||-|+++| +.-.|+.++..+. T Consensus 294 ksIRIf~~~~~~SRdiYhtkR-----------------Mq~V~~Vk~S~Dskyi-~SGSdd~nvRlWk 343 (433) T KOG0268|consen 294 KSIRIFPVNHGHSRDIYHTKR-----------------MQHVFCVKYSMDSKYI-ISGSDDGNVRLWK 343 (433) T ss_pred ceEEEeecCCCcchhhhhHhh-----------------hheeeEEEEeccccEE-EecCCCcceeeee Confidence 244444445444443332221 1345578899999877 5677888888765 No 128 >COG3386 Gluconolactonase [Carbohydrate transport and metabolism] Probab=80.21 E-value=11 Score=33.96 Aligned_cols=26 Identities=23% Similarity=0.273 Sum_probs=20.0 Q ss_pred eeeEECcCCCEEEEEE--cCcEEEEECC Q psy7767 99 QLVQWSPVGHDLIFVK--DNNLYQAYDD 124 (200) Q Consensus 99 s~~~wSPDGk~IAFV~--~gdLyv~~~~ 124 (200) -.++|||||+.+.|+- .+.|+..+.+ T Consensus 166 NGla~SpDg~tly~aDT~~~~i~r~~~d 193 (307) T COG3386 166 NGLAFSPDGKTLYVADTPANRIHRYDLD 193 (307) T ss_pred CceEECCCCCEEEEEeCCCCeEEEEecC Confidence 5679999999776663 3678888776 No 129 >KOG2096|consensus Probab=79.68 E-value=10 Score=35.18 Aligned_cols=76 Identities=12% Similarity=0.113 Sum_probs=49.0 Q ss_pred CCCceeeEECcCCCEEEEEEc-CcEEEEECCCCC---eEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKD-NNLYQAYDDFRS---INALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILA 170 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~-gdLyv~~~~~g~---~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~La 170 (200) +..++++.||-|||.+|-..+ +-|-+.++++=+ -+-+-. ++ | +++...+.|+||-+.++ T Consensus 86 ~~~vt~~~FsSdGK~lat~~~Dr~Ir~w~~~DF~~~eHr~~R~----nv------------e-~dhpT~V~FapDc~s~v 148 (420) T KOG2096|consen 86 KKEVTDVAFSSDGKKLATISGDRSIRLWDVRDFENKEHRCIRQ----NV------------E-YDHPTRVVFAPDCKSVV 148 (420) T ss_pred CCceeeeEEcCCCceeEEEeCCceEEEEecchhhhhhhhHhhc----cc------------c-CCCceEEEECCCcceEE Confidence 577899999999999998875 445555555311 110000 00 1 24677889999999998 Q ss_pred EEEEcCCcceEEEEEee Q psy7767 171 YMKFNDARVHNMSYIHY 187 (200) Q Consensus 171 f~r~D~s~v~~~~i~~~ 187 (200) +.-.+.+..-.|.+.-- T Consensus 149 v~~~~g~~l~vyk~~K~ 165 (420) T KOG2096|consen 149 VSVKRGNKLCVYKLVKK 165 (420) T ss_pred EEEccCCEEEEEEeeec Confidence 87665555555554433 No 130 >KOG0771|consensus Probab=79.48 E-value=4.2 Score=38.18 Aligned_cols=65 Identities=17% Similarity=0.101 Sum_probs=46.4 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) ...++.+++|+||+.+|.-. +|+|-+.++..-+..++.... + .+...++.||||.++++=+. T Consensus 281 ~~siSsl~VS~dGkf~AlGT~dGsVai~~~~~lq~~~~vk~a--H---------------~~~VT~ltF~Pdsr~~~svS 343 (398) T KOG0771|consen 281 FKSISSLAVSDDGKFLALGTMDGSVAIYDAKSLQRLQYVKEA--H---------------LGFVTGLTFSPDSRYLASVS 343 (398) T ss_pred cCcceeEEEcCCCcEEEEeccCCcEEEEEeceeeeeEeehhh--h---------------eeeeeeEEEcCCcCcccccc Confidence 34789999999999999874 678887766544444555422 1 12456789999999998776 Q ss_pred EcC Q psy7767 174 FND 176 (200) Q Consensus 174 ~D~ 176 (200) .+. T Consensus 344 s~~ 346 (398) T KOG0771|consen 344 SDN 346 (398) T ss_pred cCC Confidence 654 No 131 >KOG0263|consensus Probab=79.18 E-value=5.8 Score=39.79 Aligned_cols=72 Identities=15% Similarity=0.058 Sum_probs=48.1 Q ss_pred CCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEE Q psy7767 96 NFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKF 174 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~ 174 (200) .++...++.|++.++|=-. +.-+-++++.+|..+||-..-. +...++.+||+|++||-. . T Consensus 536 sDV~cv~FHPNs~Y~aTGSsD~tVRlWDv~~G~~VRiF~GH~------------------~~V~al~~Sp~Gr~LaSg-~ 596 (707) T KOG0263|consen 536 SDVDCVSFHPNSNYVATGSSDRTVRLWDVSTGNSVRIFTGHK------------------GPVTALAFSPCGRYLASG-D 596 (707) T ss_pred cccceEEECCcccccccCCCCceEEEEEcCCCcEEEEecCCC------------------CceEEEEEcCCCceEeec-c Confidence 4566688999998775442 2345556777888887775321 345689999999999743 4 Q ss_pred cCCcceEEEEEe Q psy7767 175 NDARVHNMSYIH 186 (200) Q Consensus 175 D~s~v~~~~i~~ 186 (200) .++.+..+.+.. T Consensus 597 ed~~I~iWDl~~ 608 (707) T KOG0263|consen 597 EDGLIKIWDLAN 608 (707) T ss_pred cCCcEEEEEcCC Confidence 445566665544 No 132 >KOG1407|consensus Probab=78.86 E-value=9.4 Score=34.49 Aligned_cols=72 Identities=13% Similarity=0.145 Sum_probs=41.0 Q ss_pred CCCCceeeEECcCCCEEEEEEcCc-EEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 94 GSNFLQLVQWSPVGHDLIFVKDNN-LYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 94 ~~~~is~~~wSPDGk~IAFV~~gd-Lyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) ...+.-...|||+|+++++..+.+ |-..+.. ..++... ||++-...+..|+-++ -+.|+ T Consensus 105 ~~~eni~i~wsp~g~~~~~~~kdD~it~id~r---~~~~~~~----------------~~~~~e~ne~~w~~~n-d~Ffl 164 (313) T KOG1407|consen 105 TKGENINITWSPDGEYIAVGNKDDRITFIDAR---TYKIVNE----------------EQFKFEVNEISWNNSN-DLFFL 164 (313) T ss_pred ccCcceEEEEcCCCCEEEEecCcccEEEEEec---ccceeeh----------------hcccceeeeeeecCCC-CEEEE Confidence 456778899999999999996554 3333332 2222221 1222223456777444 46666 Q ss_pred EEcCCcceEEEEE Q psy7767 173 KFNDARVHNMSYI 185 (200) Q Consensus 173 r~D~s~v~~~~i~ 185 (200) ....+-+.++.+| T Consensus 165 t~GlG~v~ILsyp 177 (313) T KOG1407|consen 165 TNGLGCVEILSYP 177 (313) T ss_pred ecCCceEEEEecc Confidence 5554556655544 No 133 >KOG0286|consensus Probab=78.40 E-value=9.8 Score=34.82 Aligned_cols=59 Identities=20% Similarity=0.238 Sum_probs=41.4 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAY 171 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf 171 (200) .+.+..+.||||++.|+-.+ +|.|.+.+.-++......+-+. .|+. .-.|||.|+.+|- T Consensus 55 ~~Ki~~~~ws~Dsr~ivSaSqDGklIvWDs~TtnK~haipl~s---------~WVM---------tCA~sPSg~~VAc 114 (343) T KOG0286|consen 55 LNKIYAMDWSTDSRRIVSASQDGKLIVWDSFTTNKVHAIPLPS---------SWVM---------TCAYSPSGNFVAC 114 (343) T ss_pred ccceeeeEecCCcCeEEeeccCCeEEEEEcccccceeEEecCc---------eeEE---------EEEECCCCCeEEe Confidence 46789999999999998875 6788888776554433333221 2443 4679999998874 No 134 >PRK13615 lipoprotein LpqB; Provisional Probab=77.83 E-value=34 Score=33.55 Aligned_cols=99 Identities=8% Similarity=-0.049 Sum_probs=57.1 Q ss_pred ecceEEEEeccCcceeEEEEEeeeeeeeeeecCCCCccceeecCCCCCceeeEECcCCCEEEEEEc----CcEEEEEC-- Q psy7767 50 WDYNLYLQRSTDIALKLSIRCLTWKQSHSLSGRPDTLTDVSDLDGSNFLQLVQWSPVGHDLIFVKD----NNLYQAYD-- 123 (200) Q Consensus 50 ~~~~~y~~~~~~~~~kv~i~~n~~~~R~s~~a~~~~~~~v~~lt~~~~is~~~wSPDGk~IAFV~~----gdLyv~~~-- 123 (200) .++.+|+...+.. ..++....+|.- .-....| . .+..|..+++|+||-++|.+.+ +.|++.-+ T Consensus 382 ~~g~vWtv~~g~~--~~l~~~~~~G~~-~~v~v~~-~-------~~~~I~~lrvSrDG~R~Avi~~~~g~~~V~va~V~R 450 (557) T PRK13615 382 AQGYVWSTPASDP--RGLVAWGPDGVG-HPVAVSW-T-------ATGRVVSLEVARDGARVLVQLETGAGPQLLVASIVR 450 (557) T ss_pred CCCCEEEEeCCCc--eEEEEecCCCce-EEeeccc-c-------CCCeeEEEEeCCCccEEEEEEecCCCCEEEEEEEEe Confidence 4568888876663 333333333221 0111112 0 3467999999999999999986 67888533 Q ss_pred CCCCeEEE-eeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 124 DFRSINAL-TRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 124 ~~g~~~rL-T~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) +++.+.+| +. ..+ + .. + ++...++.|..|++ |+-+. T Consensus 451 ~~~~P~~L~~~-p~~-l-~~---------~-l~~v~sl~W~~~~~-laVl~ 487 (557) T PRK13615 451 DGGVPTSLTTT-PLE-L-LA---------S-PGTPLDATWVDELD-VATLT 487 (557) T ss_pred CCCcceEeeec-cEE-c-cc---------C-cCcceeeEEcCCCE-EEEEe Confidence 33445567 33 110 0 00 0 23567889999986 66564 No 135 >KOG0288|consensus Probab=77.31 E-value=17 Score=34.58 Aligned_cols=72 Identities=22% Similarity=0.256 Sum_probs=46.2 Q ss_pred eeecCCCCCceeeEECcCCCEE-EEEEcCcEEEEECCCCCeEEEee-CCCcceEecccccceeeeeeeCCCcceEEccCC Q psy7767 89 VSDLDGSNFLQLVQWSPVGHDL-IFVKDNNLYQAYDDFRSINALTR-DGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDG 166 (200) Q Consensus 89 v~~lt~~~~is~~~wSPDGk~I-AFV~~gdLyv~~~~~g~~~rLT~-dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDG 166 (200) +...+.++.+..+..|+||..| ++++++.+=+.++.+.+.++.-. .|- . -.-|| +...||||| T Consensus 335 ~~sv~~gg~vtSl~ls~~g~~lLsssRDdtl~viDlRt~eI~~~~sA~g~----k-~asDw----------trvvfSpd~ 399 (459) T KOG0288|consen 335 TRSVPLGGRVTSLDLSMDGLELLSSSRDDTLKVIDLRTKEIRQTFSAEGF----K-CASDW----------TRVVFSPDG 399 (459) T ss_pred eeEeecCcceeeEeeccCCeEEeeecCCCceeeeecccccEEEEeecccc----c-ccccc----------ceeEECCCC Confidence 3334456788999999999775 56667778888887666654443 221 1 11122 456899999 Q ss_pred CEEEEEEEc Q psy7767 167 SILAYMKFN 175 (200) Q Consensus 167 k~Laf~r~D 175 (200) +++|--..| T Consensus 400 ~YvaAGS~d 408 (459) T KOG0288|consen 400 SYVAAGSAD 408 (459) T ss_pred ceeeeccCC Confidence 988644333 No 136 >PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A .... Probab=76.84 E-value=28 Score=31.84 Aligned_cols=90 Identities=11% Similarity=0.003 Sum_probs=46.9 Q ss_pred ccceeecCCCCCceeeEECcCCCEEEEEE--cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEc Q psy7767 86 LTDVSDLDGSNFLQLVQWSPVGHDLIFVK--DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFS 163 (200) Q Consensus 86 ~~~v~~lt~~~~is~~~wSPDGk~IAFV~--~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WS 163 (200) ..-+.++..+......++||||++|+-.. .+++-+.|.++.+..+....+.. .+.. .+ ++..++.-| T Consensus 68 ~~~v~~i~~G~~~~~i~~s~DG~~~~v~n~~~~~v~v~D~~tle~v~~I~~~~~----~~~~-----~~--~Rv~aIv~s 136 (369) T PF02239_consen 68 GKVVATIKVGGNPRGIAVSPDGKYVYVANYEPGTVSVIDAETLEPVKTIPTGGM----PVDG-----PE--SRVAAIVAS 136 (369) T ss_dssp SSEEEEEE-SSEEEEEEE--TTTEEEEEEEETTEEEEEETTT--EEEEEE--EE-----TTT-----S-----EEEEEE- T ss_pred ccEEEEEecCCCcceEEEcCCCCEEEEEecCCCceeEeccccccceeecccccc----cccc-----cC--CCceeEEec Confidence 33444555677789999999999985442 47899999887776544443320 0000 01 244566778 Q ss_pred cCCCEEEEEEEcCCcceEEEEEeeCC Q psy7767 164 PDGSILAYMKFNDARVHNMSYIHYGE 189 (200) Q Consensus 164 PDGk~Laf~r~D~s~v~~~~i~~~~~ 189 (200) |++..+++.-.|. ++..+.++.+ T Consensus 137 ~~~~~fVv~lkd~---~~I~vVdy~d 159 (369) T PF02239_consen 137 PGRPEFVVNLKDT---GEIWVVDYSD 159 (369) T ss_dssp SSSSEEEEEETTT---TEEEEEETTT T ss_pred CCCCEEEEEEccC---CeEEEEEecc Confidence 8887665554443 3334445444 No 137 >COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism] Probab=76.63 E-value=65 Score=29.93 Aligned_cols=83 Identities=13% Similarity=0.152 Sum_probs=45.9 Q ss_pred CCCCCceeeEECcCCCEEEEEE---cCcEEE--EECCCCCeE---EEeeCCCcceEecccccceeeeeeeCCCcceEEcc Q psy7767 93 DGSNFLQLVQWSPVGHDLIFVK---DNNLYQ--AYDDFRSIN---ALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSP 164 (200) Q Consensus 93 t~~~~is~~~wSPDGk~IAFV~---~gdLyv--~~~~~g~~~---rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSP 164 (200) .++.+.+.+.|.|+||. ||+. ++.|-+ .+-..|+.. .+..-++ + +.|. ....++.-|| T Consensus 188 ~~G~GPRHi~FHpn~k~-aY~v~EL~stV~v~~y~~~~g~~~~lQ~i~tlP~-d--F~g~----------~~~aaIhis~ 253 (346) T COG2706 188 KPGAGPRHIVFHPNGKY-AYLVNELNSTVDVLEYNPAVGKFEELQTIDTLPE-D--FTGT----------NWAAAIHISP 253 (346) T ss_pred CCCCCcceEEEcCCCcE-EEEEeccCCEEEEEEEcCCCceEEEeeeeccCcc-c--cCCC----------CceeEEEECC Confidence 46888999999999986 5554 344444 433334432 2222221 1 1111 1345778999 Q ss_pred CCCEEEEEEEcCCcceEEEEEeeCC Q psy7767 165 DGSILAYMKFNDARVHNMSYIHYGE 189 (200) Q Consensus 165 DGk~Laf~r~D~s~v~~~~i~~~~~ 189 (200) ||++|.-.-.-+..+..|.+...++ T Consensus 254 dGrFLYasNRg~dsI~~f~V~~~~g 278 (346) T COG2706 254 DGRFLYASNRGHDSIAVFSVDPDGG 278 (346) T ss_pred CCCEEEEecCCCCeEEEEEEcCCCC Confidence 9987743311223466666666654 No 138 >KOG0772|consensus Probab=76.56 E-value=3.5 Score=40.18 Aligned_cols=75 Identities=13% Similarity=0.073 Sum_probs=45.1 Q ss_pred ceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEe---eCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 98 LQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALT---RDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 98 is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT---~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) .+.-+|+|||++||=-. +|.|=+++. ++...+.. .+.- +++ ...+.+.||+||++|+ .| T Consensus 320 ~tsC~~nrdg~~iAagc~DGSIQ~W~~-~~~~v~p~~~vk~AH-----~~g----------~~Itsi~FS~dg~~Ll-SR 382 (641) T KOG0772|consen 320 VTSCAWNRDGKLIAAGCLDGSIQIWDK-GSRTVRPVMKVKDAH-----LPG----------QDITSISFSYDGNYLL-SR 382 (641) T ss_pred ceeeecCCCcchhhhcccCCceeeeec-CCcccccceEeeecc-----CCC----------CceeEEEeccccchhh-hc Confidence 46779999999987654 455554443 22221111 1110 000 1346789999999886 67 Q ss_pred EcCCcceEEEEEeeCC Q psy7767 174 FNDARVHNMSYIHYGE 189 (200) Q Consensus 174 ~D~s~v~~~~i~~~~~ 189 (200) ..+...+.+.+..+-. T Consensus 383 g~D~tLKvWDLrq~kk 398 (641) T KOG0772|consen 383 GFDDTLKVWDLRQFKK 398 (641) T ss_pred cCCCceeeeecccccc Confidence 7666677777766644 No 139 >PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain Probab=74.91 E-value=14 Score=24.21 Aligned_cols=29 Identities=10% Similarity=0.166 Sum_probs=24.3 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEEC Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYD 123 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~ 123 (200) ...+..+.|||+.+.||... +|+|++..+ T Consensus 11 ~~~v~~~~w~P~mdLiA~~t~~g~v~v~Rl 40 (47) T PF12894_consen 11 PSRVSCMSWCPTMDLIALGTEDGEVLVYRL 40 (47) T ss_pred CCcEEEEEECCCCCEEEEEECCCeEEEEEC Confidence 45688999999999999996 567888766 No 140 >KOG0296|consensus Probab=74.76 E-value=49 Score=31.08 Aligned_cols=67 Identities=16% Similarity=0.244 Sum_probs=44.1 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) ..++..+.|.|-+.-+++-. +|.||+..+..+..-++-.... .....=.|.||||+|+-. T Consensus 148 ~~dieWl~WHp~a~illAG~~DGsvWmw~ip~~~~~kv~~Gh~------------------~~ct~G~f~pdGKr~~tg- 208 (399) T KOG0296|consen 148 VEDIEWLKWHPRAHILLAGSTDGSVWMWQIPSQALCKVMSGHN------------------SPCTCGEFIPDGKRILTG- 208 (399) T ss_pred cCceEEEEecccccEEEeecCCCcEEEEECCCcceeeEecCCC------------------CCcccccccCCCceEEEE- Confidence 46788999999888887774 6899999998754444433211 012233689999988644 Q ss_pred EcCCcce Q psy7767 174 FNDARVH 180 (200) Q Consensus 174 ~D~s~v~ 180 (200) .+++-+. T Consensus 209 y~dgti~ 215 (399) T KOG0296|consen 209 YDDGTII 215 (399) T ss_pred ecCceEE Confidence 4444333 No 141 >KOG0265|consensus Probab=74.72 E-value=15 Score=33.70 Aligned_cols=68 Identities=16% Similarity=0.027 Sum_probs=43.1 Q ss_pred CCCceeeEECcCCCEEEEE-EcCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFV-KDNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV-~~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) .+.+....|+|||..+|-- .+.+|++.++.+....-.+-.|-. +...++.|.+||+.|+ .. T Consensus 47 ~geI~~~~F~P~gs~~aSgG~Dr~I~LWnv~gdceN~~~lkgHs-----------------gAVM~l~~~~d~s~i~-S~ 108 (338) T KOG0265|consen 47 KGEIYTIKFHPDGSCFASGGSDRAIVLWNVYGDCENFWVLKGHS-----------------GAVMELHGMRDGSHIL-SC 108 (338) T ss_pred cceEEEEEECCCCCeEeecCCcceEEEEeccccccceeeecccc-----------------ceeEeeeeccCCCEEE-Ee Confidence 5778999999999996654 355899988765443322222321 1334678999998774 44 Q ss_pred EcCCcce Q psy7767 174 FNDARVH 180 (200) Q Consensus 174 ~D~s~v~ 180 (200) ..+..+. T Consensus 109 gtDk~v~ 115 (338) T KOG0265|consen 109 GTDKTVR 115 (338) T ss_pred cCCceEE Confidence 4444333 No 142 >KOG2919|consensus Probab=74.66 E-value=10 Score=35.25 Aligned_cols=28 Identities=25% Similarity=0.369 Sum_probs=19.3 Q ss_pred CCcceEEccCCCEEEEEEEcCCcceEEEEE Q psy7767 156 STKTLWFSPDGSILAYMKFNDARVHNMSYI 185 (200) Q Consensus 156 ~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~ 185 (200) ..-++.|||||.+| |. ...+-+++|... T Consensus 160 aAhsL~Fs~DGeql-fa-GykrcirvFdt~ 187 (406) T KOG2919|consen 160 AAHSLQFSPDGEQL-FA-GYKRCIRVFDTS 187 (406) T ss_pred hheeEEecCCCCeE-ee-cccceEEEeecc Confidence 34568899999998 33 555566666553 No 143 >PF12566 DUF3748: Protein of unknown function (DUF3748); InterPro: IPR022223 This domain family is found in bacteria and eukaryotes, and is approximately 120 amino acids in length. Probab=74.08 E-value=2.6 Score=33.36 Aligned_cols=23 Identities=22% Similarity=0.286 Sum_probs=18.1 Q ss_pred CCceeeEECcCCCEEEEEEcCcE Q psy7767 96 NFLQLVQWSPVGHDLIFVKDNNL 118 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~~gdL 118 (200) ++----.|||||++|.|+.+..| T Consensus 68 GGtHvHvfSpDG~~lSFTYNDhV 90 (122) T PF12566_consen 68 GGTHVHVFSPDGSWLSFTYNDHV 90 (122) T ss_pred CCccceEECCCCCEEEEEecchh Confidence 33445589999999999987765 No 144 >PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation []. Probab=73.96 E-value=48 Score=34.42 Aligned_cols=73 Identities=19% Similarity=0.194 Sum_probs=41.1 Q ss_pred ceeeEECcCCCEEEEEEc----CcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 98 LQLVQWSPVGHDLIFVKD----NNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 98 is~~~wSPDGk~IAFV~~----gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) -..++|.|.|..||-+.+ .+|-...-.|-..-..+-. +.-++ .....+.|++||.-||..- T Consensus 259 e~~l~WrPsG~lIA~~q~~~~~~~VvFfErNGLrhgeF~l~------------~~~~~---~~v~~l~Wn~ds~iLAv~~ 323 (928) T PF04762_consen 259 EGALSWRPSGNLIASSQRLPDRHDVVFFERNGLRHGEFTLR------------FDPEE---EKVIELAWNSDSEILAVWL 323 (928) T ss_pred cCCccCCCCCCEEEEEEEcCCCcEEEEEecCCcEeeeEecC------------CCCCC---ceeeEEEECCCCCEEEEEe Confidence 367799999999999864 1222222111111111100 00001 1346789999999998755 Q ss_pred EcCCcceEEEEEee Q psy7767 174 FNDARVHNMSYIHY 187 (200) Q Consensus 174 ~D~s~v~~~~i~~~ 187 (200) .| .++.++..+| T Consensus 324 ~~--~vqLWt~~NY 335 (928) T PF04762_consen 324 ED--RVQLWTRSNY 335 (928) T ss_pred cC--CceEEEeeCC Confidence 33 3888877766 No 145 >PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. Probab=73.69 E-value=12 Score=33.97 Aligned_cols=26 Identities=19% Similarity=0.146 Sum_probs=18.3 Q ss_pred ceeeEECcCCCEEEEEEcC-------cEEEEECC Q psy7767 98 LQLVQWSPVGHDLIFVKDN-------NLYQAYDD 124 (200) Q Consensus 98 is~~~wSPDGk~IAFV~~g-------dLyv~~~~ 124 (200) .-...|||||++| |+.++ -|-|.++. T Consensus 53 yGHg~fs~dG~~L-ytTEnd~~~g~G~IgVyd~~ 85 (305) T PF07433_consen 53 YGHGVFSPDGRLL-YTTENDYETGRGVIGVYDAA 85 (305) T ss_pred ecCEEEcCCCCEE-EEeccccCCCcEEEEEEECc Confidence 4577999999986 66544 46666665 No 146 >PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A. Probab=73.65 E-value=29 Score=31.27 Aligned_cols=91 Identities=10% Similarity=0.030 Sum_probs=44.8 Q ss_pred eeeeeeeeeeeeeeee---eeecceEEEEeccCcceeEEEEEeeeeeeeeeecCC-CCccceeecCCCCCceeeEECcCC Q psy7767 32 DIALKLSIRCLTWKQS---EYWDYNLYLQRSTDIALKLSIRCLTWKQSHSLSGRP-DTLTDVSDLDGSNFLQLVQWSPVG 107 (200) Q Consensus 32 ~~~~~~~~~~~~~~~~---~~~~~~~y~~~~~~~~~kv~i~~n~~~~R~s~~a~~-~~~~~v~~lt~~~~is~~~wSPDG 107 (200) -+.||-+=.+-+|..- +-+.+..+.+...+.. .+.+..+ .+. .|.... ..-.+.......+.+.+..-+||| T Consensus 81 g~ll~T~DgG~tW~~v~l~~~lpgs~~~i~~l~~~-~~~l~~~-~G~--iy~T~DgG~tW~~~~~~~~gs~~~~~r~~dG 156 (302) T PF14870_consen 81 GLLLHTTDGGKTWERVPLSSKLPGSPFGITALGDG-SAELAGD-RGA--IYRTTDGGKTWQAVVSETSGSINDITRSSDG 156 (302) T ss_dssp TEEEEESSTTSS-EE----TT-SS-EEEEEEEETT-EEEEEET-T----EEEESSTTSSEEEEE-S----EEEEEE-TTS T ss_pred ceEEEecCCCCCcEEeecCCCCCCCeeEEEEcCCC-cEEEEcC-CCc--EEEeCCCCCCeeEcccCCcceeEeEEECCCC Confidence 4667777788999873 3466776665543332 3333333 222 222211 111122222234567888899999 Q ss_pred CEEEEEEcCcEEEEECCCC Q psy7767 108 HDLIFVKDNNLYQAYDDFR 126 (200) Q Consensus 108 k~IAFV~~gdLyv~~~~~g 126 (200) +.||...+|++|+.--.+. T Consensus 157 ~~vavs~~G~~~~s~~~G~ 175 (302) T PF14870_consen 157 RYVAVSSRGNFYSSWDPGQ 175 (302) T ss_dssp -EEEEETTSSEEEEE-TT- T ss_pred cEEEEECcccEEEEecCCC Confidence 9999999999998754443 No 147 >KOG1063|consensus Probab=72.76 E-value=6.5 Score=39.43 Aligned_cols=70 Identities=17% Similarity=0.062 Sum_probs=42.5 Q ss_pred CCCceeeEECcCCCEEEEEEcC------cEEEEECCCCCeEE-EeeCCCcceEecccccceeeeeeeCCCcceEEccCCC Q psy7767 95 SNFLQLVQWSPVGHDLIFVKDN------NLYQAYDDFRSINA-LTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGS 167 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~g------dLyv~~~~~g~~~r-LT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk 167 (200) +-++..++.||+|+.||-.... -|++.+..+-...+ |-. -. -....+.|||||+ T Consensus 525 GyEv~~l~~s~~gnliASaCKS~~~ehAvI~lw~t~~W~~~~~L~~-Hs------------------LTVT~l~FSpdg~ 585 (764) T KOG1063|consen 525 GYEVYALAISPTGNLIASACKSSLKEHAVIRLWNTANWLQVQELEG-HS------------------LTVTRLAFSPDGR 585 (764) T ss_pred ceeEEEEEecCCCCEEeehhhhCCccceEEEEEeccchhhhheecc-cc------------------eEEEEEEECCCCc Confidence 5668999999999999887532 35555443221111 111 00 0234678999999 Q ss_pred EEEEEEEcCCcceEEEE Q psy7767 168 ILAYMKFNDARVHNMSY 184 (200) Q Consensus 168 ~Laf~r~D~s~v~~~~i 184 (200) +|+-+..| +.+..|.. T Consensus 586 ~LLsvsRD-Rt~sl~~~ 601 (764) T KOG1063|consen 586 YLLSVSRD-RTVSLYEV 601 (764) T ss_pred EEEEeecC-ceEEeeee Confidence 99766444 44555554 No 148 >PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats. Probab=72.30 E-value=29 Score=33.99 Aligned_cols=80 Identities=13% Similarity=0.145 Sum_probs=56.0 Q ss_pred ceeecCCCCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCC Q psy7767 88 DVSDLDGSNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDG 166 (200) Q Consensus 88 ~v~~lt~~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDG 166 (200) .+++++-+..+...++||+.++++.-. +|.|-+.+...+.+ +.+..+- ...-++|.||| T Consensus 252 svtsipL~s~v~~ca~sp~E~kLvlGC~DgSiiLyD~~~~~t-~~~ka~~-------------------~P~~iaWHp~g 311 (545) T PF11768_consen 252 SVTSIPLPSQVICCARSPSEDKLVLGCEDGSIILYDTTRGVT-LLAKAEF-------------------IPTLIAWHPDG 311 (545) T ss_pred EEEEEecCCcceEEecCcccceEEEEecCCeEEEEEcCCCee-eeeeecc-------------------cceEEEEcCCC Confidence 444554577889999999999988885 67888887654433 2332211 35568999999 Q ss_pred CEEEEEEEcCCcceEEEEEeeC Q psy7767 167 SILAYMKFNDARVHNMSYIHYG 188 (200) Q Consensus 167 k~Laf~r~D~s~v~~~~i~~~~ 188 (200) ..+ .+..+.+.++.|.+.... T Consensus 312 ai~-~V~s~qGelQ~FD~ALsp 332 (545) T PF11768_consen 312 AIF-VVGSEQGELQCFDMALSP 332 (545) T ss_pred cEE-EEEcCCceEEEEEeecCc Confidence 755 466777888888877664 No 149 >PRK10115 protease 2; Provisional Probab=72.18 E-value=94 Score=30.95 Aligned_cols=35 Identities=6% Similarity=0.135 Sum_probs=26.4 Q ss_pred eeeEECcCCCEEEEEEc-------CcEEEEECCCC--CeEEEee Q psy7767 99 QLVQWSPVGHDLIFVKD-------NNLYQAYDDFR--SINALTR 133 (200) Q Consensus 99 s~~~wSPDGk~IAFV~~-------gdLyv~~~~~g--~~~rLT~ 133 (200) ..+.|+|||+.+.|++. .+||..++.++ +.+.|.. T Consensus 175 ~~~~w~~D~~~~~y~~~~~~~~~~~~v~~h~lgt~~~~d~lv~~ 218 (686) T PRK10115 175 PSFVWANDSWTFYYVRKHPVTLLPYQVWRHTIGTPASQDELVYE 218 (686) T ss_pred eEEEEeeCCCEEEEEEecCCCCCCCEEEEEECCCChhHCeEEEe Confidence 46999999999999963 37888888777 3444544 No 150 >PLN02919 haloacid dehalogenase-like hydrolase family protein Probab=72.02 E-value=26 Score=36.74 Aligned_cols=81 Identities=15% Similarity=0.212 Sum_probs=46.0 Q ss_pred eeeEECcCCCEEEEEE--cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcC Q psy7767 99 QLVQWSPVGHDLIFVK--DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFND 176 (200) Q Consensus 99 s~~~wSPDGk~IAFV~--~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~ 176 (200) .+++++|++..|.... ++.|++.+..++....+..+|... ..+|...- ...|....++.++|||++|+.+-... T Consensus 686 ~gVa~dp~~g~LyVad~~~~~I~v~d~~~g~v~~~~G~G~~~-~~~g~~~~---~~~~~~P~GIavspdG~~LYVADs~n 761 (1057) T PLN02919 686 WDVCFEPVNEKVYIAMAGQHQIWEYNISDGVTRVFSGDGYER-NLNGSSGT---STSFAQPSGISLSPDLKELYIADSES 761 (1057) T ss_pred eEEEEecCCCeEEEEECCCCeEEEEECCCCeEEEEecCCccc-cCCCCccc---cccccCccEEEEeCCCCEEEEEECCC Confidence 4678888766554332 357888888777665555443211 01221100 01245678899999999886554444 Q ss_pred CcceEEE Q psy7767 177 ARVHNMS 183 (200) Q Consensus 177 s~v~~~~ 183 (200) ..+..+. T Consensus 762 ~~Irv~D 768 (1057) T PLN02919 762 SSIRALD 768 (1057) T ss_pred CeEEEEE Confidence 4455443 No 151 >PRK13614 lipoprotein LpqB; Provisional Probab=71.67 E-value=36 Score=33.49 Aligned_cols=77 Identities=13% Similarity=0.003 Sum_probs=46.9 Q ss_pred CCCceeeEECcCCCEEEEEE--cCc--EEEEEC---CCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCC Q psy7767 95 SNFLQLVQWSPVGHDLIFVK--DNN--LYQAYD---DFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGS 167 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~--~gd--Lyv~~~---~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk 167 (200) +..|..+++|+||-++|.+. ++. |++.-+ .+|.+++|+.--. + +. .+...++.|..|++ T Consensus 433 g~~I~~lrvSrDG~R~Avi~~~~g~~~V~va~V~R~~~G~P~~L~~~~~--~---~~---------~~~~~sl~W~~~~s 498 (573) T PRK13614 433 GRTVKELRVSREGVRALVISEQNGKSRVQVAGIVRNEDGTPRELTAPIT--L---AA---------DSDADTGAWVGDST 498 (573) T ss_pred CCeeEEEEECCCccEEEEEEEeCCccEEEEEEEEeCCCCCeEEccCcee--c---cc---------CCCcceeEEcCCCE Confidence 34599999999999999998 554 887533 2455566654210 0 00 02456789999986 Q ss_pred EEEEEEE-cCCcceEEEEEe Q psy7767 168 ILAYMKF-NDARVHNMSYIH 186 (200) Q Consensus 168 ~Laf~r~-D~s~v~~~~i~~ 186 (200) |+-++. ..+....+.+.. T Consensus 499 -l~V~~~~~~~~~~~~~v~v 517 (573) T PRK13614 499 -VVVTKASATSNVVPELLSV 517 (573) T ss_pred -EEEEeccCCCcceEEEEEe Confidence 555543 333344444444 No 152 >TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs. Probab=70.78 E-value=22 Score=32.18 Aligned_cols=13 Identities=8% Similarity=-0.033 Sum_probs=10.4 Q ss_pred CCcceEEccCCCE Q psy7767 156 STKTLWFSPDGSI 168 (200) Q Consensus 156 ~~~~~~WSPDGk~ 168 (200) ...++.|+|+|+. T Consensus 185 np~Gl~~d~~G~l 197 (367) T TIGR02604 185 NPYGHSVDSWGDV 197 (367) T ss_pred CCccceECCCCCE Confidence 4568999999974 No 153 >KOG2111|consensus Probab=70.14 E-value=32 Score=31.75 Aligned_cols=77 Identities=16% Similarity=0.126 Sum_probs=45.0 Q ss_pred CCceeeEECcCCCEEEEEEcC--cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 96 NFLQLVQWSPVGHDLIFVKDN--NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~~g--dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) .++..++++-+|..||=.+.. =|-+.+..+|+..+=-.-|... .....+.||||+.+|| +. T Consensus 182 s~Iacv~Ln~~Gt~vATaStkGTLIRIFdt~~g~~l~E~RRG~d~----------------A~iy~iaFSp~~s~La-vs 244 (346) T KOG2111|consen 182 SDIACVALNLQGTLVATASTKGTLIRIFDTEDGTLLQELRRGVDR----------------ADIYCIAFSPNSSWLA-VS 244 (346) T ss_pred CceeEEEEcCCccEEEEeccCcEEEEEEEcCCCcEeeeeecCCch----------------heEEEEEeCCCccEEE-EE Confidence 456666777777777665421 2333344444432222212100 1224678999999998 56 Q ss_pred EcCCcceEEEEEeeCC Q psy7767 174 FNDARVHNMSYIHYGE 189 (200) Q Consensus 174 ~D~s~v~~~~i~~~~~ 189 (200) .|.+-+.+|.+-.... T Consensus 245 SdKgTlHiF~l~~~~~ 260 (346) T KOG2111|consen 245 SDKGTLHIFSLRDTEN 260 (346) T ss_pred cCCCeEEEEEeecCCC Confidence 7888899988877553 No 154 >KOG2048|consensus Probab=69.32 E-value=1.1e+02 Score=30.86 Aligned_cols=76 Identities=9% Similarity=0.026 Sum_probs=46.3 Q ss_pred CCCceeeEECcCCCEEEEEEcCcEEEEECCCC-CeE--EEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKDNNLYQAYDDFR-SIN--ALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAY 171 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~gdLyv~~~~~g-~~~--rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf 171 (200) ...++..+.||||++||+-.=.++.+..+... ..+ .+..... + .-....+.|+-|+..|.+ T Consensus 382 ~~nIs~~aiSPdg~~Ia~st~~~~~iy~L~~~~~vk~~~v~~~~~---------------~-~~~a~~i~ftid~~k~~~ 445 (691) T KOG2048|consen 382 KENISCAAISPDGNLIAISTVSRTKIYRLQPDPNVKVINVDDVPL---------------A-LLDASAISFTIDKNKLFL 445 (691) T ss_pred ccceeeeccCCCCCEEEEeeccceEEEEeccCcceeEEEeccchh---------------h-hccceeeEEEecCceEEE Confidence 46689999999999999997554444433321 211 1111111 1 113456789999998888 Q ss_pred EEEcCCcceEEEEEe Q psy7767 172 MKFNDARVHNMSYIH 186 (200) Q Consensus 172 ~r~D~s~v~~~~i~~ 186 (200) +..+......+.... T Consensus 446 ~s~~~~~le~~el~~ 460 (691) T KOG2048|consen 446 VSKNIFSLEEFELET 460 (691) T ss_pred EecccceeEEEEecC Confidence 876666555555433 No 155 >COG2319 FOG: WD40 repeat [General function prediction only] Probab=67.79 E-value=50 Score=26.82 Aligned_cols=68 Identities=21% Similarity=0.165 Sum_probs=41.0 Q ss_pred CCCceeeEECcCCCEEEEEE--cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK--DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~--~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) ...+..+.++|+|+.++... ++.+.+.++..+.......... .....+.|+|+|+.++.. T Consensus 155 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------~~v~~~~~~~~~~~~~~~ 216 (466) T COG2319 155 SESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLAGHT------------------DPVSSLAFSPDGGLLIAS 216 (466) T ss_pred cccEEEEEECCCCCEEEecCCCCCceEEEEcCCCceEEeeccCC------------------CceEEEEEcCCcceEEEE Confidence 45667899999999766664 5677877776544433333211 123467888999833333 Q ss_pred EEcCCcce Q psy7767 173 KFNDARVH 180 (200) Q Consensus 173 r~D~s~v~ 180 (200) ...++.+. T Consensus 217 ~~~d~~i~ 224 (466) T COG2319 217 GSSDGTIR 224 (466) T ss_pred ecCCCcEE Confidence 24444444 No 156 >KOG1523|consensus Probab=67.68 E-value=21 Score=32.97 Aligned_cols=75 Identities=21% Similarity=0.211 Sum_probs=40.1 Q ss_pred CCCceeeEECcCCCEEEEEE-cC-----cEEEEECCCCCeEEEeeCCCcceEecccc--cceeeeeeeCCCcceEEccCC Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DN-----NLYQAYDDFRSINALTRDGIKGVLFNGVA--DWVYEEEVFSSTKTLWFSPDG 166 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~g-----dLyv~~~~~g~~~rLT~dg~~~~~~nG~p--dwv~eEEi~~~~~~~~WSPDG 166 (200) ...+..+.|.|++-.+|--. ++ ..|+..++..... +.=|.+ +..|.. +|..+ -|-..++.|||+| T Consensus 146 rStv~sldWhpnnVLlaaGs~D~k~rVfSayIK~Vdekpap--~pWgsk--~PFG~lm~E~~~~---ggwvh~v~fs~sG 218 (361) T KOG1523|consen 146 RSTVTSLDWHPNNVLLAAGSTDGKCRVFSAYIKGVDEKPAP--TPWGSK--MPFGQLMSEASSS---GGWVHGVLFSPSG 218 (361) T ss_pred ccceeeeeccCCcceecccccCcceeEEEEeeeccccCCCC--CCCccC--CcHHHHHHhhccC---CCceeeeEeCCCC Confidence 34567788888877666543 22 4777777754421 111111 011110 00000 0123467899999 Q ss_pred CEEEEEEEcC Q psy7767 167 SILAYMKFND 176 (200) Q Consensus 167 k~Laf~r~D~ 176 (200) .+|||+..|. T Consensus 219 ~~lawv~Hds 228 (361) T KOG1523|consen 219 NRLAWVGHDS 228 (361) T ss_pred CEeeEecCCC Confidence 9999997664 No 157 >PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A. Probab=67.46 E-value=47 Score=29.04 Aligned_cols=40 Identities=23% Similarity=0.352 Sum_probs=28.2 Q ss_pred CCceeeEECcCCCEEEEEEcC--cEEEEECCCCCeEEEeeCC Q psy7767 96 NFLQLVQWSPVGHDLIFVKDN--NLYQAYDDFRSINALTRDG 135 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~~g--dLyv~~~~~g~~~rLT~dg 135 (200) .+++.++|.||.+.|.-|.+. .|+..+.++.-.+++.-.| T Consensus 22 ~e~SGLTy~pd~~tLfaV~d~~~~i~els~~G~vlr~i~l~g 63 (248) T PF06977_consen 22 DELSGLTYNPDTGTLFAVQDEPGEIYELSLDGKVLRRIPLDG 63 (248) T ss_dssp S-EEEEEEETTTTEEEEEETTTTEEEEEETT--EEEEEE-SS T ss_pred CCccccEEcCCCCeEEEEECCCCEEEEEcCCCCEEEEEeCCC Confidence 468999999999999888765 6888887654455555444 No 158 >KOG1524|consensus Probab=67.38 E-value=6.4 Score=38.74 Aligned_cols=88 Identities=17% Similarity=0.233 Sum_probs=47.9 Q ss_pred eEEEEEeeeeeeeeeecCCCCccceeecC--CCCCceeeEECcCCCEEEEEEcCcEEEEECCCCCeEEEeeCCC--cceE Q psy7767 65 KLSIRCLTWKQSHSLSGRPDTLTDVSDLD--GSNFLQLVQWSPVGHDLIFVKDNNLYQAYDDFRSINALTRDGI--KGVL 140 (200) Q Consensus 65 kv~i~~n~~~~R~s~~a~~~~~~~v~~lt--~~~~is~~~wSPDGk~IAFV~~gdLyv~~~~~g~~~rLT~dg~--~~~~ 140 (200) -+...+..||.-+...-+.. +++.- -.+.+...+|+|||.-+.-+- ++|..+-.+..|- ..+. T Consensus 76 d~~~i~s~DGkf~il~k~~r----VE~sv~AH~~A~~~gRW~~dGtgLlt~G---------EDG~iKiWSrsGMLRStl~ 142 (737) T KOG1524|consen 76 DTLLICSNDGRFVILNKSAR----VERSISAHAAAISSGRWSPDGAGLLTAG---------EDGVIKIWSRSGMLRSTVV 142 (737) T ss_pred ceEEEEcCCceEEEecccch----hhhhhhhhhhhhhhcccCCCCceeeeec---------CCceEEEEeccchHHHHHh Confidence 34444444666555544442 23221 135568889999998875442 2333333443331 0111 Q ss_pred ecccccceeeeeeeCCCcceEEccCCCEEEEEEEcC Q psy7767 141 FNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFND 176 (200) Q Consensus 141 ~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~ 176 (200) .++. .....+|.||+..++|...++ T Consensus 143 Q~~~-----------~v~c~~W~p~S~~vl~c~g~h 167 (737) T KOG1524|consen 143 QNEE-----------SIRCARWAPNSNSIVFCQGGH 167 (737) T ss_pred hcCc-----------eeEEEEECCCCCceEEecCCe Confidence 2222 234678999999999987665 No 159 >KOG2314|consensus Probab=67.19 E-value=12 Score=37.10 Aligned_cols=60 Identities=17% Similarity=0.284 Sum_probs=40.8 Q ss_pred CCCceeeEECcCCCEEEEEE----cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK----DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILA 170 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~----~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~La 170 (200) ......+.|||.|+.|+-.. +|+|+..+.+....+++ ..++ . ...+.+.|-|.|++++ T Consensus 492 k~~~N~vfwsPkG~fvvva~l~s~~g~l~F~D~~~a~~k~~-~~~e-h----------------~~at~veWDPtGRYvv 553 (698) T KOG2314|consen 492 KKFANTVFWSPKGRFVVVAALVSRRGDLEFYDTDYADLKDT-ASPE-H----------------FAATEVEWDPTGRYVV 553 (698) T ss_pred ccccceEEEcCCCcEEEEEEecccccceEEEecchhhhhhc-cCcc-c----------------cccccceECCCCCEEE Confidence 34567889999999998773 67899888765333322 2121 0 1345678999999886 Q ss_pred EE Q psy7767 171 YM 172 (200) Q Consensus 171 f~ 172 (200) -. T Consensus 554 T~ 555 (698) T KOG2314|consen 554 TS 555 (698) T ss_pred Ee Confidence 54 No 160 >KOG1539|consensus Probab=66.90 E-value=55 Score=33.79 Aligned_cols=31 Identities=19% Similarity=0.291 Sum_probs=23.2 Q ss_pred CCcceEEccCCCEEEEEEEcCCcceEEEEEee Q psy7767 156 STKTLWFSPDGSILAYMKFNDARVHNMSYIHY 187 (200) Q Consensus 156 ~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~~~ 187 (200) +..++.||||||+|+-+..| +.+..+.++.. T Consensus 578 ritd~~FS~DgrWlisasmD-~tIr~wDlpt~ 608 (910) T KOG1539|consen 578 RITDMTFSPDGRWLISASMD-STIRTWDLPTG 608 (910) T ss_pred ceeeeEeCCCCcEEEEeecC-CcEEEEeccCc Confidence 56789999999999876665 45777766544 No 161 >PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal Probab=65.21 E-value=26 Score=31.62 Aligned_cols=34 Identities=12% Similarity=0.081 Sum_probs=26.2 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCe Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSI 128 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~ 128 (200) ....+-++||||+..+||.. .|.|.+.++-+... T Consensus 43 ~PQWRkl~WSpD~tlLa~a~S~G~i~vfdl~g~~l 77 (282) T PF15492_consen 43 NPQWRKLAWSPDCTLLAYAESTGTIRVFDLMGSEL 77 (282) T ss_pred CchheEEEECCCCcEEEEEcCCCeEEEEeccccee Confidence 45678999999999999996 45677777765443 No 162 >COG3490 Uncharacterized protein conserved in bacteria [Function unknown] Probab=63.26 E-value=45 Score=30.77 Aligned_cols=63 Identities=19% Similarity=0.162 Sum_probs=37.9 Q ss_pred CCCceeeEECcC-CCEEEEEEcC--cEEEEECCCCCeEEEee-CCCcceEecccccceeeeeeeCCCcceEEccCCCEEE Q psy7767 95 SNFLQLVQWSPV-GHDLIFVKDN--NLYQAYDDFRSINALTR-DGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILA 170 (200) Q Consensus 95 ~~~is~~~wSPD-Gk~IAFV~~g--dLyv~~~~~g~~~rLT~-dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~La 170 (200) ....-+++++|- ...++|.++- ..++.+..+++..++-. ...+. ++|- =.|||||+.| T Consensus 67 paR~Hgi~~~p~~~ravafARrPGtf~~vfD~~~~~~pv~~~s~~~RH--fyGH---------------Gvfs~dG~~L- 128 (366) T COG3490 67 PARGHGIAFHPALPRAVAFARRPGTFAMVFDPNGAQEPVTLVSQEGRH--FYGH---------------GVFSPDGRLL- 128 (366) T ss_pred ccccCCeecCCCCcceEEEEecCCceEEEECCCCCcCcEEEecccCce--eecc---------------cccCCCCcEE- Confidence 345567789996 6778888753 45566666666544433 22333 3332 2589999866 Q ss_pred EEEEc Q psy7767 171 YMKFN 175 (200) Q Consensus 171 f~r~D 175 (200) |...+ T Consensus 129 YATEn 133 (366) T COG3490 129 YATEN 133 (366) T ss_pred EeecC Confidence 55433 No 163 >PLN02919 haloacid dehalogenase-like hydrolase family protein Probab=62.66 E-value=38 Score=35.54 Aligned_cols=18 Identities=22% Similarity=0.449 Sum_probs=13.7 Q ss_pred eCCCcceEEccCCCEEEEEE Q psy7767 154 FSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 154 ~~~~~~~~WSPDGk~Laf~r 173 (200) |....++.++|||+ | |+. T Consensus 858 l~~P~GIavd~dG~-l-yVa 875 (1057) T PLN02919 858 LSEPAGLALGENGR-L-FVA 875 (1057) T ss_pred cCCceEEEEeCCCC-E-EEE Confidence 55778899999996 4 553 No 164 >KOG1446|consensus Probab=61.10 E-value=99 Score=28.32 Aligned_cols=37 Identities=11% Similarity=-0.018 Sum_probs=30.1 Q ss_pred ceeeEECcCCCEEEEEEc-CcEEEEECCCCCeEEEeeC Q psy7767 98 LQLVQWSPVGHDLIFVKD-NNLYQAYDDFRSINALTRD 134 (200) Q Consensus 98 is~~~wSPDGk~IAFV~~-gdLyv~~~~~g~~~rLT~d 134 (200) ..++.++|||+.|.--.+ |.|.+.++++|.....+.. T Consensus 235 ~~~a~ftPds~Fvl~gs~dg~i~vw~~~tg~~v~~~~~ 272 (311) T KOG1446|consen 235 PLSATFTPDSKFVLSGSDDGTIHVWNLETGKKVAVLRG 272 (311) T ss_pred ceeEEECCCCcEEEEecCCCcEEEEEcCCCcEeeEecC Confidence 378999999999877765 7899999998887766653 No 165 >KOG1009|consensus Probab=61.06 E-value=8.6 Score=36.34 Aligned_cols=64 Identities=16% Similarity=0.094 Sum_probs=45.2 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) ..++.++.||||+..+++-+ ++-+++.++..|....+-.+.+ . -..++.|-|=+++++-.. T Consensus 123 ~~diydL~Ws~d~~~l~s~s~dns~~l~Dv~~G~l~~~~~dh~--~----------------yvqgvawDpl~qyv~s~s 184 (434) T KOG1009|consen 123 RDDIYDLAWSPDSNFLVSGSVDNSVRLWDVHAGQLLAILDDHE--H----------------YVQGVAWDPLNQYVASKS 184 (434) T ss_pred ccchhhhhccCCCceeeeeeccceEEEEEeccceeEeeccccc--c----------------ccceeecchhhhhhhhhc Confidence 47899999999999999986 5668888888887766655432 1 223456777777666554 Q ss_pred EcC Q psy7767 174 FND 176 (200) Q Consensus 174 ~D~ 176 (200) .|. T Consensus 185 ~dr 187 (434) T KOG1009|consen 185 SDR 187 (434) T ss_pred cCc Confidence 443 No 166 >PF04841 Vps16_N: Vps16, N-terminal region; InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm Probab=60.32 E-value=63 Score=29.96 Aligned_cols=31 Identities=26% Similarity=0.387 Sum_probs=25.3 Q ss_pred CCceeeEECcCCCEEEEEEc-CcEEEEECCCC Q psy7767 96 NFLQLVQWSPVGHDLIFVKD-NNLYQAYDDFR 126 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~~-gdLyv~~~~~g 126 (200) +.+..++.||+|+.||+..+ |.+|+.+.+-. T Consensus 217 ~~i~~iavSpng~~iAl~t~~g~l~v~ssDf~ 248 (410) T PF04841_consen 217 GPIIKIAVSPNGKFIALFTDSGNLWVVSSDFS 248 (410) T ss_pred CCeEEEEECCCCCEEEEEECCCCEEEEECccc Confidence 45788999999999999975 78888876544 No 167 >PRK02888 nitrous-oxide reductase; Validated Probab=60.32 E-value=28 Score=34.77 Aligned_cols=51 Identities=10% Similarity=0.010 Sum_probs=30.0 Q ss_pred EECcCCCEEEEEEcCcEEEEECCC----C-CeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 102 QWSPVGHDLIFVKDNNLYQAYDDF----R-SINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 102 ~wSPDGk~IAFV~~gdLyv~~~~~----g-~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) ++.+|||. +|+.++.|-+.|... + ........| .+..++..||||++|+-. T Consensus 283 a~vkdGK~-~~V~gn~V~VID~~t~~~~~~~v~~yIPVG-------------------KsPHGV~vSPDGkylyVa 338 (635) T PRK02888 283 EAVKAGKF-KTIGGSKVPVVDGRKAANAGSALTRYVPVP-------------------KNPHGVNTSPDGKYFIAN 338 (635) T ss_pred HhhhCCCE-EEECCCEEEEEECCccccCCcceEEEEECC-------------------CCccceEECCCCCEEEEe Confidence 45667774 455555677776554 1 122222222 256788999999988543 No 168 >KOG0286|consensus Probab=59.00 E-value=74 Score=29.29 Aligned_cols=56 Identities=18% Similarity=0.138 Sum_probs=38.1 Q ss_pred CCCceeeEECc-CCCEEEEEEcC---cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEE Q psy7767 95 SNFLQLVQWSP-VGHDLIFVKDN---NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILA 170 (200) Q Consensus 95 ~~~is~~~wSP-DGk~IAFV~~g---dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~La 170 (200) .+++..+.++| |++. |+.++ --++.++..+.-+|--+..+ +-.+++.|-|+|..++ T Consensus 186 ~gDV~slsl~p~~~nt--FvSg~cD~~aklWD~R~~~c~qtF~ghe------------------sDINsv~ffP~G~afa 245 (343) T KOG0286|consen 186 TGDVMSLSLSPSDGNT--FVSGGCDKSAKLWDVRSGQCVQTFEGHE------------------SDINSVRFFPSGDAFA 245 (343) T ss_pred cccEEEEecCCCCCCe--EEecccccceeeeeccCcceeEeecccc------------------cccceEEEccCCCeee Confidence 57889999999 8887 77654 56667777776555443222 1346788889987554 No 169 >PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g .... Probab=58.64 E-value=29 Score=20.12 Aligned_cols=27 Identities=22% Similarity=0.292 Sum_probs=20.8 Q ss_pred CCCceeeEECcCCCEEEEEEc-CcEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKD-NNLYQA 121 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~-gdLyv~ 121 (200) ...+..++|+|+++.|+=... +.|.+. T Consensus 11 ~~~i~~i~~~~~~~~~~s~~~D~~i~vw 38 (39) T PF00400_consen 11 SSSINSIAWSPDGNFLASGSSDGTIRVW 38 (39) T ss_dssp SSSEEEEEEETTSSEEEEEETTSEEEEE T ss_pred CCcEEEEEEecccccceeeCCCCEEEEE Confidence 467899999999999887764 555554 No 170 >KOG1920|consensus Probab=57.42 E-value=36 Score=36.39 Aligned_cols=62 Identities=15% Similarity=0.189 Sum_probs=39.7 Q ss_pred CCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEE Q psy7767 96 NFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKF 174 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~ 174 (200) ..+..+.+--|+..|.++. .|+|.+.+.++....-+.. +-+| ....+||||++.|+++.. T Consensus 69 ~~i~s~~fl~d~~~i~v~~~~G~iilvd~et~~~eivg~------vd~G-------------I~aaswS~Dee~l~liT~ 129 (1265) T KOG1920|consen 69 DEIVSVQFLADTNSICVITALGDIILVDPETLELEIVGN------VDNG-------------ISAASWSPDEELLALITG 129 (1265) T ss_pred cceEEEEEecccceEEEEecCCcEEEEcccccceeeeee------ccCc-------------eEEEeecCCCcEEEEEeC Confidence 3567777777777777774 4677777554444333322 1222 346799999999998865 Q ss_pred cC Q psy7767 175 ND 176 (200) Q Consensus 175 D~ 176 (200) ++ T Consensus 130 ~~ 131 (1265) T KOG1920|consen 130 RQ 131 (1265) T ss_pred Cc Confidence 54 No 171 >KOG1920|consensus Probab=55.93 E-value=63 Score=34.63 Aligned_cols=27 Identities=22% Similarity=0.351 Sum_probs=23.1 Q ss_pred CCCceeeEECcCCCEEEEEEc-CcEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKD-NNLYQA 121 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~-gdLyv~ 121 (200) ..++..+.||||++.+|++.+ +.|+++ T Consensus 109 d~GI~aaswS~Dee~l~liT~~~tll~m 136 (1265) T KOG1920|consen 109 DNGISAASWSPDEELLALITGRQTLLFM 136 (1265) T ss_pred cCceEEEeecCCCcEEEEEeCCcEEEEE Confidence 578999999999999999987 456665 No 172 >COG5037 TOS9 Gluconate transport-inducing protein [Signal transduction mechanisms / Carbohydrate transport and metabolism] Probab=55.18 E-value=12 Score=33.03 Aligned_cols=39 Identities=18% Similarity=0.378 Sum_probs=27.7 Q ss_pred EEEEcCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEcc---CCCEEEEEEE Q psy7767 111 IFVKDNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSP---DGSILAYMKF 174 (200) Q Consensus 111 AFV~~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSP---DGk~Laf~r~ 174 (200) --++.|+|||.+.+....+|+|+ +.+||| -|+.|+|-.. T Consensus 42 ~lIrsGsIFVf~Ee~sGiKRWTD-------------------------G~SWSPSRIsG~FLvYrEl 83 (248) T COG5037 42 ELIRSGSIFVFDEEESGIKRWTD-------------------------GKSWSPSRISGKFLVYREL 83 (248) T ss_pred eeeecCCEEEEeeccccceeccC-------------------------CCccCccceeeeeEeeeeh Confidence 34667888888777677788886 347888 4777877543 No 173 >KOG4476|consensus Probab=55.18 E-value=12 Score=33.03 Aligned_cols=39 Identities=18% Similarity=0.378 Sum_probs=27.7 Q ss_pred EEEEcCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEcc---CCCEEEEEEE Q psy7767 111 IFVKDNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSP---DGSILAYMKF 174 (200) Q Consensus 111 AFV~~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSP---DGk~Laf~r~ 174 (200) --++.|+|||.+.+....+|+|+ +.+||| -|+.|+|-.. T Consensus 42 ~lIrsGsIFVf~Ee~sGiKRWTD-------------------------G~SWSPSRIsG~FLvYrEl 83 (248) T KOG4476|consen 42 ELIRSGSIFVFDEEESGIKRWTD-------------------------GKSWSPSRISGKFLVYREL 83 (248) T ss_pred eeeecCCEEEEeeccccceeccC-------------------------CCccCccceeeeeEeeeeh Confidence 34667888888777677788886 347888 4777877543 No 174 >KOG0772|consensus Probab=55.16 E-value=23 Score=34.73 Aligned_cols=85 Identities=13% Similarity=0.136 Sum_probs=47.8 Q ss_pred CCCceeeEECcCCCEEEEEEcC-cEEEEECCCCCeEEEeeCCCcce--EecccccceeeeeeeCCCcceEEccCCCEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKDN-NLYQAYDDFRSINALTRDGIKGV--LFNGVADWVYEEEVFSSTKTLWFSPDGSILAY 171 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~g-dLyv~~~~~g~~~rLT~dg~~~~--~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf 171 (200) +..+..++|||.|..|.-+.+. +.-+.+-++.+...... |+.-+ +.|-.. .+ .....-.|-|+.+-... T Consensus 214 ~h~i~sl~ys~Tg~~iLvvsg~aqakl~DRdG~~~~e~~K-GDQYI~Dm~nTKG-Hi------a~lt~g~whP~~k~~Fl 285 (641) T KOG0772|consen 214 THQINSLQYSVTGDQILVVSGSAQAKLLDRDGFEIVEFSK-GDQYIRDMYNTKG-HI------AELTCGCWHPDNKEEFL 285 (641) T ss_pred ccccceeeecCCCCeEEEEecCcceeEEccCCceeeeeec-cchhhhhhhccCC-ce------eeeeccccccCcccceE Confidence 5678899999999999999865 33334444333322222 21100 000000 00 01223469999887766 Q ss_pred EEEcCCcceEEEEEee Q psy7767 172 MKFNDARVHNMSYIHY 187 (200) Q Consensus 172 ~r~D~s~v~~~~i~~~ 187 (200) ...+++.++++.+... T Consensus 286 T~s~DgtlRiWdv~~~ 301 (641) T KOG0772|consen 286 TCSYDGTLRIWDVNNT 301 (641) T ss_pred EecCCCcEEEEecCCc Confidence 6777777777776655 No 175 >COG3391 Uncharacterized conserved protein [Function unknown] Probab=54.57 E-value=1.4e+02 Score=27.26 Aligned_cols=59 Identities=14% Similarity=0.060 Sum_probs=34.9 Q ss_pred CCCceeeEECcCCCEEEEEEc----CcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVKD----NNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILA 170 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~----gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~La 170 (200) +.....++++|+|+.+..... +.+.+.+..++...+....|. ...++..+|||+++. T Consensus 115 G~~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~t~~~~~~~~vG~-------------------~P~~~a~~p~g~~vy 175 (381) T COG3391 115 GLGPVGLAVDPDGKYVYVANAGNGNNTVSVIDAATNKVTATIPVGN-------------------TPTGVAVDPDGNKVY 175 (381) T ss_pred ccCCceEEECCCCCEEEEEecccCCceEEEEeCCCCeEEEEEecCC-------------------CcceEEECCCCCeEE Confidence 556688899999988644433 345555544443333332221 234678999999774 Q ss_pred EE Q psy7767 171 YM 172 (200) Q Consensus 171 f~ 172 (200) -. T Consensus 176 v~ 177 (381) T COG3391 176 VT 177 (381) T ss_pred EE Confidence 33 No 176 >KOG1523|consensus Probab=53.90 E-value=34 Score=31.72 Aligned_cols=32 Identities=6% Similarity=-0.028 Sum_probs=25.4 Q ss_pred CCceeeEECcCCCEEEEEEcC-cEEEEECCCCC Q psy7767 96 NFLQLVQWSPVGHDLIFVKDN-NLYQAYDDFRS 127 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~~g-dLyv~~~~~g~ 127 (200) ..+...+|++|+..||...++ +|.+....+.+ T Consensus 11 ~pitchAwn~drt~iAv~~~~~evhiy~~~~~~ 43 (361) T KOG1523|consen 11 EPITCHAWNSDRTQIAVSPNNHEVHIYSMLGAD 43 (361) T ss_pred CceeeeeecCCCceEEeccCCceEEEEEecCCC Confidence 467888999999999999876 67776666655 No 177 >COG5354 Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only] Probab=52.94 E-value=2.1e+02 Score=28.12 Aligned_cols=21 Identities=10% Similarity=0.412 Sum_probs=18.0 Q ss_pred CCCceeeEECcCCCEEEEEEc Q psy7767 95 SNFLQLVQWSPVGHDLIFVKD 115 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~ 115 (200) .+.+-+..|+|++++.+-+.+ T Consensus 274 ~~pVhdf~W~p~S~~F~vi~g 294 (561) T COG5354 274 KDPVHDFTWEPLSSRFAVISG 294 (561) T ss_pred cccceeeeecccCCceeEEec Confidence 456789999999999998874 No 178 >KOG0641|consensus Probab=52.85 E-value=49 Score=29.62 Aligned_cols=29 Identities=17% Similarity=0.303 Sum_probs=20.7 Q ss_pred eeecCCCCCceeeEECcCCCEEEEEEcCc Q psy7767 89 VSDLDGSNFLQLVQWSPVGHDLIFVKDNN 117 (200) Q Consensus 89 v~~lt~~~~is~~~wSPDGk~IAFV~~gd 117 (200) +..+.+.+.+...+|.|.|+.-|.-++.. T Consensus 26 i~~l~dsqairav~fhp~g~lyavgsnsk 54 (350) T KOG0641|consen 26 INILEDSQAIRAVAFHPAGGLYAVGSNSK 54 (350) T ss_pred EEEecchhheeeEEecCCCceEEeccCCc Confidence 34455677888999999999866655543 No 179 >PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B. Probab=52.18 E-value=16 Score=34.62 Aligned_cols=80 Identities=15% Similarity=0.177 Sum_probs=32.8 Q ss_pred CCceeeEECcCCCEEEEEEcCcEEE-EECCCCCeEEEeeCCCcceEeccc------------ccceeeeeee----CCCc Q psy7767 96 NFLQLVQWSPVGHDLIFVKDNNLYQ-AYDDFRSINALTRDGIKGVLFNGV------------ADWVYEEEVF----SSTK 158 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~~gdLyv-~~~~~g~~~rLT~dg~~~~~~nG~------------pdwv~eEEi~----~~~~ 158 (200) +...++.|+++|+..+.-.++.|-+ .+.+......+..+....-++.|. .||.....+. .... T Consensus 69 G~g~~~vw~~~n~yAv~~~~~~I~I~kn~~~~~~k~i~~~~~~~~If~G~LL~~~~~~~i~~yDw~~~~~i~~i~v~~vk 148 (443) T PF04053_consen 69 GSGLSFVWSSRNRYAVLESSSTIKIYKNFKNEVVKSIKLPFSVEKIFGGNLLGVKSSDFICFYDWETGKLIRRIDVSAVK 148 (443) T ss_dssp EE-SEEEE-TSSEEEEE-TTS-EEEEETTEE-TT-----SS-EEEEE-SSSEEEEETTEEEEE-TTT--EEEEESS-E-E T ss_pred CceeEEEEecCccEEEEECCCeEEEEEcCccccceEEcCCcccceEEcCcEEEEECCCCEEEEEhhHcceeeEEecCCCc Confidence 3456788888666554444566666 455433323333321111112221 1333222211 1136 Q ss_pred ceEEccCCCEEEEEEEc Q psy7767 159 TLWFSPDGSILAYMKFN 175 (200) Q Consensus 159 ~~~WSPDGk~Laf~r~D 175 (200) .+.|||||+.+|++..+ T Consensus 149 ~V~Ws~~g~~val~t~~ 165 (443) T PF04053_consen 149 YVIWSDDGELVALVTKD 165 (443) T ss_dssp EEEE-TTSSEEEEE-S- T ss_pred EEEEECCCCEEEEEeCC Confidence 78999999999988443 No 180 >PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B. Probab=51.40 E-value=36 Score=32.30 Aligned_cols=27 Identities=22% Similarity=0.439 Sum_probs=20.3 Q ss_pred ceeeEECcCCCEEEEEEcCcEEEEECC Q psy7767 98 LQLVQWSPVGHDLIFVKDNNLYQAYDD 124 (200) Q Consensus 98 is~~~wSPDGk~IAFV~~gdLyv~~~~ 124 (200) +....|||||+.||++....+|+.+-+ T Consensus 147 vk~V~Ws~~g~~val~t~~~i~il~~~ 173 (443) T PF04053_consen 147 VKYVIWSDDGELVALVTKDSIYILKYN 173 (443) T ss_dssp -EEEEE-TTSSEEEEE-S-SEEEEEE- T ss_pred CcEEEEECCCCEEEEEeCCeEEEEEec Confidence 688999999999999999999997654 No 181 >COG3391 Uncharacterized conserved protein [Function unknown] Probab=50.94 E-value=2e+02 Score=26.25 Aligned_cols=65 Identities=20% Similarity=0.213 Sum_probs=39.0 Q ss_pred ceeeEECcCCCEEEEEE--cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEc Q psy7767 98 LQLVQWSPVGHDLIFVK--DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFN 175 (200) Q Consensus 98 is~~~wSPDGk~IAFV~--~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D 175 (200) -...+++|+|+++.-.- ++.|.+.+..+....+ ...+.. .+ .+.....+.++|||+++ |+..+ T Consensus 162 P~~~a~~p~g~~vyv~~~~~~~v~vi~~~~~~v~~-~~~~~~----~~---------~~~~P~~i~v~~~g~~~-yV~~~ 226 (381) T COG3391 162 PTGVAVDPDGNKVYVTNSDDNTVSVIDTSGNSVVR-GSVGSL----VG---------VGTGPAGIAVDPDGNRV-YVAND 226 (381) T ss_pred cceEEECCCCCeEEEEecCCCeEEEEeCCCcceec-cccccc----cc---------cCCCCceEEECCCCCEE-EEEec Confidence 37889999999864444 4567777654443333 221110 00 12356788999999866 56555 Q ss_pred CC Q psy7767 176 DA 177 (200) Q Consensus 176 ~s 177 (200) .+ T Consensus 227 ~~ 228 (381) T COG3391 227 GS 228 (381) T ss_pred cC Confidence 44 No 182 >PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function. Probab=50.42 E-value=37 Score=32.87 Aligned_cols=18 Identities=33% Similarity=0.650 Sum_probs=13.7 Q ss_pred CCcceEEccCCCEEEEEEE Q psy7767 156 STKTLWFSPDGSILAYMKF 174 (200) Q Consensus 156 ~~~~~~WSPDGk~Laf~r~ 174 (200) ...++.|||||+.| |+.. T Consensus 503 E~tG~~fspDg~tl-Fvni 520 (524) T PF05787_consen 503 EITGPCFSPDGRTL-FVNI 520 (524) T ss_pred ccccceECCCCCEE-EEEE Confidence 35789999999986 5543 No 183 >KOG2048|consensus Probab=49.96 E-value=1.7e+02 Score=29.59 Aligned_cols=38 Identities=8% Similarity=0.028 Sum_probs=31.2 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEe Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALT 132 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT 132 (200) -+.+..+..||||++||-+. .++||+.++++++..-+- T Consensus 475 ~~~I~~l~~SsdG~yiaa~~t~g~I~v~nl~~~~~~~l~ 513 (691) T KOG2048|consen 475 CPSISRLVVSSDGNYIAAISTRGQIFVYNLETLESHLLK 513 (691) T ss_pred CCcceeEEEcCCCCEEEEEeccceEEEEEcccceeecch Confidence 35678999999999999885 679999999987765444 No 184 >PRK13615 lipoprotein LpqB; Provisional Probab=49.75 E-value=91 Score=30.64 Aligned_cols=22 Identities=18% Similarity=0.080 Sum_probs=17.4 Q ss_pred CCcceEEccCCCEEEEEEEcCC Q psy7767 156 STKTLWFSPDGSILAYMKFNDA 177 (200) Q Consensus 156 ~~~~~~WSPDGk~Laf~r~D~s 177 (200) ....++-|+||.++|.+..+.+ T Consensus 418 ~I~~lrvSrDG~R~Avi~~~~g 439 (557) T PRK13615 418 RVVSLEVARDGARVLVQLETGA 439 (557) T ss_pred eeEEEEeCCCccEEEEEEecCC Confidence 4578899999999999855433 No 185 >KOG0646|consensus Probab=47.54 E-value=1e+02 Score=29.80 Aligned_cols=34 Identities=18% Similarity=0.017 Sum_probs=28.2 Q ss_pred CCCceeeEECcCCCEEEEE-EcCcEEEEECCCCCe Q psy7767 95 SNFLQLVQWSPVGHDLIFV-KDNNLYQAYDDFRSI 128 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV-~~gdLyv~~~~~g~~ 128 (200) .+.+..++-+|+|..|+=- -.++||++.+.+|.. T Consensus 81 Pg~v~al~s~n~G~~l~ag~i~g~lYlWelssG~L 115 (476) T KOG0646|consen 81 PGPVHALASSNLGYFLLAGTISGNLYLWELSSGIL 115 (476) T ss_pred ccceeeeecCCCceEEEeecccCcEEEEEeccccH Confidence 4567888999999998766 678999999988864 No 186 >COG3211 PhoX Predicted phosphatase [General function prediction only] Probab=46.78 E-value=40 Score=33.44 Aligned_cols=19 Identities=32% Similarity=0.593 Sum_probs=14.6 Q ss_pred CcceEEccCCCEEEEEEEcC Q psy7767 157 TKTLWFSPDGSILAYMKFND 176 (200) Q Consensus 157 ~~~~~WSPDGk~Laf~r~D~ 176 (200) ..++.|||||+.| |+...+ T Consensus 556 ~tG~~FspD~~Tl-FV~vQH 574 (616) T COG3211 556 FTGPCFSPDGKTL-FVNVQH 574 (616) T ss_pred eecceeCCCCceE-EEEecC Confidence 4688999999988 665444 No 187 >COG5354 Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only] Probab=46.47 E-value=3e+02 Score=27.18 Aligned_cols=37 Identities=11% Similarity=-0.013 Sum_probs=23.6 Q ss_pred eeecCCCCCceeeEECcCCCEEEEEE----cCcEEEEECCC Q psy7767 89 VSDLDGSNFLQLVQWSPVGHDLIFVK----DNNLYQAYDDF 125 (200) Q Consensus 89 v~~lt~~~~is~~~wSPDGk~IAFV~----~gdLyv~~~~~ 125 (200) +..+-+.+.--.+.+||.+++|.+.- .+++-+.+..+ T Consensus 309 l~~~~Pe~~rNT~~fsp~~r~il~agF~nl~gni~i~~~~~ 349 (561) T COG5354 309 LRFYFPEQKRNTIFFSPHERYILFAGFDNLQGNIEIFDPAG 349 (561) T ss_pred eEEecCCcccccccccCcccEEEEecCCccccceEEeccCC Confidence 33333445556778999999998863 34566665543 No 188 >PF10584 Proteasome_A_N: Proteasome subunit A N-terminal signature; InterPro: IPR000426 The proteasome (or macropain) (3.4.25.1 from EC) [, , , , ] is a eukaryotic and archaeal multicatalytic proteinase complex that seems to be involved in an ATP/ubiquitin-dependent nonlysosomal proteolytic pathway. In eukaryotes the proteasome is composed of about 28 distinct subunits which form a highly ordered ring-shaped structure (20S ring) of about 700 kDa. Most proteasome subunits can be classified, on the basis on sequence similarities into two groups, alpha (A) and beta (B). This family contains the alpha subunit sequences which range from 210 to 290 amino acids. These sequences are classified as non-peptidase homologues in MEROPS peptidase family T1 (clan PB(T)). ; GO: 0004175 endopeptidase activity, 0006511 ubiquitin-dependent protein catabolic process, 0019773 proteasome core complex, alpha-subunit complex; PDB: 3H4P_M 1IRU_O 3UN4_U 1FNT_A 3OEV_G 3OEU_U 3SDK_U 3DY3_G 3MG7_G 3L5Q_C .... Probab=44.85 E-value=8.3 Score=22.02 Aligned_cols=7 Identities=71% Similarity=1.161 Sum_probs=6.1 Q ss_pred EEccCCC Q psy7767 161 WFSPDGS 167 (200) Q Consensus 161 ~WSPDGk 167 (200) .|||||+ T Consensus 7 ~FSp~Gr 13 (23) T PF10584_consen 7 TFSPDGR 13 (23) T ss_dssp SBBTTSS T ss_pred eECCCCe Confidence 4899997 No 189 >KOG2041|consensus Probab=44.50 E-value=21 Score=36.59 Aligned_cols=58 Identities=17% Similarity=0.159 Sum_probs=38.5 Q ss_pred CCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 96 NFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) .-+.++.|.-||.+|..|. +|.|.+-++++...- | .+-+. .....+.||||.+.+.|- T Consensus 116 SvV~SmsWn~dG~kIcIvYeDGavIVGsvdGNRIw-----g-------------KeLkg-~~l~hv~ws~D~~~~Lf~ 174 (1189) T KOG2041|consen 116 SVVVSMSWNLDGTKICIVYEDGAVIVGSVDGNRIW-----G-------------KELKG-QLLAHVLWSEDLEQALFK 174 (1189) T ss_pred cEEEEEEEcCCCcEEEEEEccCCEEEEeeccceec-----c-------------hhcch-heccceeecccHHHHHhh Confidence 3468899999999999996 678888777654321 0 00000 133467899999877665 No 190 >KOG0289|consensus Probab=44.15 E-value=2.1e+02 Score=27.73 Aligned_cols=70 Identities=16% Similarity=-0.000 Sum_probs=41.7 Q ss_pred CCceeeEECcCCCEEE-EEEcCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEE Q psy7767 96 NFLQLVQWSPVGHDLI-FVKDNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKF 174 (200) Q Consensus 96 ~~is~~~wSPDGk~IA-FV~~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~ 174 (200) -++..+++.|||--++ +..++.|=+.++.... .++.-+... +-..++.||-+|=+||-... T Consensus 348 v~~ts~~fHpDgLifgtgt~d~~vkiwdlks~~--~~a~Fpght----------------~~vk~i~FsENGY~Lat~ad 409 (506) T KOG0289|consen 348 VEYTSAAFHPDGLIFGTGTPDGVVKIWDLKSQT--NVAKFPGHT----------------GPVKAISFSENGYWLATAAD 409 (506) T ss_pred ceeEEeeEcCCceEEeccCCCceEEEEEcCCcc--ccccCCCCC----------------CceeEEEeccCceEEEEEec Confidence 3468888999986543 2245667777765443 334322110 23567899999988875544 Q ss_pred cCCcceEEEE Q psy7767 175 NDARVHNMSY 184 (200) Q Consensus 175 D~s~v~~~~i 184 (200) ++.|..+.+ T Consensus 410 -d~~V~lwDL 418 (506) T KOG0289|consen 410 -DGSVKLWDL 418 (506) T ss_pred -CCeEEEEEe Confidence 444655543 No 191 >KOG0294|consensus Probab=43.73 E-value=1.8e+02 Score=27.07 Aligned_cols=71 Identities=10% Similarity=0.001 Sum_probs=42.6 Q ss_pred CCceeeEECcCCCEEEEEEcC-cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEE Q psy7767 96 NFLQLVQWSPVGHDLIFVKDN-NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKF 174 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~~g-dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~ 174 (200) +.+.++++.|.||.-.-|.+. -+-..++-.|..--+.+-+. ....+.|||.|.+++..- T Consensus 128 ~~Vt~lsiHPS~KLALsVg~D~~lr~WNLV~Gr~a~v~~L~~-------------------~at~v~w~~~Gd~F~v~~- 187 (362) T KOG0294|consen 128 GQVTDLSIHPSGKLALSVGGDQVLRTWNLVRGRVAFVLNLKN-------------------KATLVSWSPQGDHFVVSG- 187 (362) T ss_pred cccceeEecCCCceEEEEcCCceeeeehhhcCccceeeccCC-------------------cceeeEEcCCCCEEEEEe- Confidence 448999999999865444332 24445666666555554332 223478999999765432 Q ss_pred cCCcceEEEEEee Q psy7767 175 NDARVHNMSYIHY 187 (200) Q Consensus 175 D~s~v~~~~i~~~ 187 (200) ...+.+|.+... T Consensus 188 -~~~i~i~q~d~A 199 (362) T KOG0294|consen 188 -RNKIDIYQLDNA 199 (362) T ss_pred -ccEEEEEecccH Confidence 234555555443 No 192 >KOG1963|consensus Probab=43.59 E-value=1.2e+02 Score=31.31 Aligned_cols=76 Identities=18% Similarity=0.080 Sum_probs=54.4 Q ss_pred eeeeeeeeeeeeeee------ecceEEEEeccCcceeEEEEEeeeeeeeeeecCCCCccceeecCCCCCceeeEECcCCC Q psy7767 35 LKLSIRCLTWKQSEY------WDYNLYLQRSTDIALKLSIRCLTWKQSHSLSGRPDTLTDVSDLDGSNFLQLVQWSPVGH 108 (200) Q Consensus 35 ~~~~~~~~~~~~~~~------~~~~~y~~~~~~~~~kv~i~~n~~~~R~s~~a~~~~~~~v~~lt~~~~is~~~wSPDGk 108 (200) -|++.+||.|..-+- =||. |++..|.+-.-|.|...+.++ .+ +.. -+..+....+||||. T Consensus 240 ~~~t~t~lHWH~~~V~~L~fS~~G~-~LlSGG~E~VLv~Wq~~T~~k-------qf-LPR-----Lgs~I~~i~vS~ds~ 305 (792) T KOG1963|consen 240 DSETCTLLHWHHDEVNSLSFSSDGA-YLLSGGREGVLVLWQLETGKK-------QF-LPR-----LGSPILHIVVSPDSD 305 (792) T ss_pred ccccceEEEecccccceeEEecCCc-eEeecccceEEEEEeecCCCc-------cc-ccc-----cCCeeEEEEEcCCCC Confidence 468889999984332 3566 888888888888888876632 22 111 267889999999999 Q ss_pred EEEEEE-cCcEEEEECC Q psy7767 109 DLIFVK-DNNLYQAYDD 124 (200) Q Consensus 109 ~IAFV~-~gdLyv~~~~ 124 (200) ..+-+. +|+|.+.... T Consensus 306 ~~sl~~~DNqI~li~~~ 322 (792) T KOG1963|consen 306 LYSLVLEDNQIHLIKAS 322 (792) T ss_pred eEEEEecCceEEEEecc Confidence 998886 5677776553 No 193 >KOG0265|consensus Probab=42.63 E-value=2.8e+02 Score=25.66 Aligned_cols=35 Identities=9% Similarity=-0.061 Sum_probs=28.2 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSIN 129 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~ 129 (200) .+.++++.|.+||..|.-.. +..|+..|+++|+.. T Consensus 90 sgAVM~l~~~~d~s~i~S~gtDk~v~~wD~~tG~~~ 125 (338) T KOG0265|consen 90 SGAVMELHGMRDGSHILSCGTDKTVRGWDAETGKRI 125 (338) T ss_pred cceeEeeeeccCCCEEEEecCCceEEEEecccceee Confidence 56789999999999986664 457999999888654 No 194 >COG3204 Uncharacterized protein conserved in bacteria [Function unknown] Probab=42.42 E-value=2.8e+02 Score=25.55 Aligned_cols=40 Identities=13% Similarity=0.206 Sum_probs=24.2 Q ss_pred CCceeeEECcCCCEEEEEEcC--cEEEEECCCCCeEEEeeCC Q psy7767 96 NFLQLVQWSPVGHDLIFVKDN--NLYQAYDDFRSINALTRDG 135 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~~g--dLyv~~~~~g~~~rLT~dg 135 (200) ..++.++|+||.+.+--|-+. .|--.+.++.-.+++.-+| T Consensus 86 ~nvS~LTynp~~rtLFav~n~p~~iVElt~~GdlirtiPL~g 127 (316) T COG3204 86 ANVSSLTYNPDTRTLFAVTNKPAAIVELTKEGDLIRTIPLTG 127 (316) T ss_pred ccccceeeCCCcceEEEecCCCceEEEEecCCceEEEecccc Confidence 458999999999997555443 3444445443344444333 No 195 >PF09729 Gti1_Pac2: Gti1/Pac2 family; InterPro: IPR018608 In Schizosaccharomyces pombe (Fission yeast) the gti1 protein promotes the onset of gluconate uptake upon glucose starvation []. In S. pombe the Pac2 protein controls the onset of sexual development, by inhibiting the expression of ste11, in a pathway that is independent of the cAMP cascade []. Probab=42.41 E-value=44 Score=27.93 Aligned_cols=40 Identities=20% Similarity=0.429 Sum_probs=28.2 Q ss_pred EEEcCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccC---CCEEEEEEEcC Q psy7767 112 FVKDNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPD---GSILAYMKFND 176 (200) Q Consensus 112 FV~~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPD---Gk~Laf~r~D~ 176 (200) .+..|+|||.+.+....+|+|+ +..|||. |..|.|-..+. T Consensus 39 ~I~sG~vfV~~e~~sgikRWTD-------------------------G~~WspSri~g~fLiYrEl~~ 81 (168) T PF09729_consen 39 LIRSGNVFVFEEEESGIKRWTD-------------------------GRSWSPSRILGNFLIYRELDK 81 (168) T ss_pred cccCCCEEEEeccccCceeccC-------------------------CcccCcccccCceeEEEEecc Confidence 5567888888776667788886 2468884 77787765444 No 196 >KOG0639|consensus Probab=42.00 E-value=70 Score=31.60 Aligned_cols=72 Identities=18% Similarity=0.193 Sum_probs=48.0 Q ss_pred CCCCceeeEECcCCCEEEEEEcC---cEEEEECCCCCe---EEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCC Q psy7767 94 GSNFLQLVQWSPVGHDLIFVKDN---NLYQAYDDFRSI---NALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGS 167 (200) Q Consensus 94 ~~~~is~~~wSPDGk~IAFV~~g---dLyv~~~~~g~~---~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk 167 (200) .+..+..-.++|||+.|.- +| .|=++|+..-+. .+||..+- .-.++.-|||.+ T Consensus 464 rdnyiRSckL~pdgrtLiv--GGeastlsiWDLAapTprikaeltssap-------------------aCyALa~spDak 522 (705) T KOG0639|consen 464 RDNYIRSCKLLPDGRTLIV--GGEASTLSIWDLAAPTPRIKAELTSSAP-------------------ACYALAISPDAK 522 (705) T ss_pred cccceeeeEecCCCceEEe--ccccceeeeeeccCCCcchhhhcCCcch-------------------hhhhhhcCCccc Confidence 4678899999999999743 34 455556653222 13333221 235678899985 Q ss_pred EEEEEEEcCCcceEEEEEee Q psy7767 168 ILAYMKFNDARVHNMSYIHY 187 (200) Q Consensus 168 ~Laf~r~D~s~v~~~~i~~~ 187 (200) |+|...-++++.++.+.+- T Consensus 523 -vcFsccsdGnI~vwDLhnq 541 (705) T KOG0639|consen 523 -VCFSCCSDGNIAVWDLHNQ 541 (705) T ss_pred -eeeeeccCCcEEEEEcccc Confidence 8999988888888876654 No 197 >KOG0299|consensus Probab=41.74 E-value=75 Score=30.63 Aligned_cols=39 Identities=10% Similarity=-0.158 Sum_probs=30.1 Q ss_pred CCCceeeEECcCCCEEEEEEcCcEE-EEECCCCCeEEEee Q psy7767 95 SNFLQLVQWSPVGHDLIFVKDNNLY-QAYDDFRSINALTR 133 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~gdLy-v~~~~~g~~~rLT~ 133 (200) ...+..++.||||++||+--.+++. +.+.++.+..+... T Consensus 202 ~keil~~avS~Dgkylatgg~d~~v~Iw~~~t~ehv~~~~ 241 (479) T KOG0299|consen 202 VKEILTLAVSSDGKYLATGGRDRHVQIWDCDTLEHVKVFK 241 (479) T ss_pred cceeEEEEEcCCCcEEEecCCCceEEEecCcccchhhccc Confidence 4567889999999999998655544 78888887766644 No 198 >KOG0639|consensus Probab=41.30 E-value=33 Score=33.81 Aligned_cols=40 Identities=15% Similarity=0.064 Sum_probs=32.0 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeC Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRD 134 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~d 134 (200) ...|..+..||+|.|||.-. ++++++....+.+..||... T Consensus 592 ~SQIfSLg~cP~~dWlavGMens~vevlh~skp~kyqlhlh 632 (705) T KOG0639|consen 592 SSQIFSLGYCPTGDWLAVGMENSNVEVLHTSKPEKYQLHLH 632 (705) T ss_pred hhhheecccCCCccceeeecccCcEEEEecCCccceeeccc Confidence 55678899999999999986 45899988776666777753 No 199 >KOG1007|consensus Probab=40.98 E-value=40 Score=31.05 Aligned_cols=73 Identities=19% Similarity=0.168 Sum_probs=48.4 Q ss_pred CceeeEECc--CCCEEEEEEcCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEE Q psy7767 97 FLQLVQWSP--VGHDLIFVKDNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKF 174 (200) Q Consensus 97 ~is~~~wSP--DGk~IAFV~~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~ 174 (200) .....+||| ||+.++-+.++.++..++.+-...--.++. .| ...+.+-|.|+-+.+..... T Consensus 172 ~ftsg~WspHHdgnqv~tt~d~tl~~~D~RT~~~~~sI~dA------Hg-----------q~vrdlDfNpnkq~~lvt~g 234 (370) T KOG1007|consen 172 SFTSGAWSPHHDGNQVATTSDSTLQFWDLRTMKKNNSIEDA------HG-----------QRVRDLDFNPNKQHILVTCG 234 (370) T ss_pred eecccccCCCCccceEEEeCCCcEEEEEccchhhhcchhhh------hc-----------ceeeeccCCCCceEEEEEcC Confidence 457779999 899999999999999887632211111111 01 13456778888887777778 Q ss_pred cCCcceEEEEEe Q psy7767 175 NDARVHNMSYIH 186 (200) Q Consensus 175 D~s~v~~~~i~~ 186 (200) |++.++++.... T Consensus 235 DdgyvriWD~R~ 246 (370) T KOG1007|consen 235 DDGYVRIWDTRK 246 (370) T ss_pred CCccEEEEeccC Confidence 888777765433 No 200 >KOG1963|consensus Probab=40.43 E-value=1.6e+02 Score=30.31 Aligned_cols=62 Identities=13% Similarity=0.115 Sum_probs=39.5 Q ss_pred CCCceeeEECcCCCEEEEE-EcCcEEEEECCCCCeEEEee-CCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFV-KDNNLYQAYDDFRSINALTR-DGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV-~~gdLyv~~~~~g~~~rLT~-dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) ...+..++||+||..|.-- +.+-+-+..++++. +++-+ -|. ....+.+||||...+.+ T Consensus 251 ~~~V~~L~fS~~G~~LlSGG~E~VLv~Wq~~T~~-kqfLPRLgs-------------------~I~~i~vS~ds~~~sl~ 310 (792) T KOG1963|consen 251 HDEVNSLSFSSDGAYLLSGGREGVLVLWQLETGK-KQFLPRLGS-------------------PILHIVVSPDSDLYSLV 310 (792) T ss_pred ccccceeEEecCCceEeecccceEEEEEeecCCC-cccccccCC-------------------eeEEEEEcCCCCeEEEE Confidence 4578899999999886322 22345555666666 43333 111 23467899999988777 Q ss_pred EEcC Q psy7767 173 KFND 176 (200) Q Consensus 173 r~D~ 176 (200) ..|. T Consensus 311 ~~DN 314 (792) T KOG1963|consen 311 LEDN 314 (792) T ss_pred ecCc Confidence 6654 No 201 >KOG0268|consensus Probab=40.43 E-value=44 Score=31.52 Aligned_cols=79 Identities=20% Similarity=0.285 Sum_probs=45.6 Q ss_pred cceeecCCCCCceeeEECcCCCEEEEEEcC---cEEEEECCCCCeE-EEeeCCCcceEecccccceeeeeeeCCCcceEE Q psy7767 87 TDVSDLDGSNFLQLVQWSPVGHDLIFVKDN---NLYQAYDDFRSIN-ALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWF 162 (200) Q Consensus 87 ~~v~~lt~~~~is~~~wSPDGk~IAFV~~g---dLyv~~~~~g~~~-rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~W 162 (200) ..++.+.-+.....+.|+| ....||.++ +||..|+..-..+ .+-.+-. +....+.| T Consensus 221 ~Pl~KVi~~mRTN~IswnP--eafnF~~a~ED~nlY~~DmR~l~~p~~v~~dhv------------------sAV~dVdf 280 (433) T KOG0268|consen 221 SPLKKVILTMRTNTICWNP--EAFNFVAANEDHNLYTYDMRNLSRPLNVHKDHV------------------SAVMDVDF 280 (433) T ss_pred CccceeeeeccccceecCc--cccceeeccccccceehhhhhhcccchhhcccc------------------eeEEEecc Confidence 3444443344556779999 666788754 7998876543322 2222110 12346789 Q ss_pred ccCCCEEEEEEEcCCcceEEEEEe Q psy7767 163 SPDGSILAYMKFNDARVHNMSYIH 186 (200) Q Consensus 163 SPDGk~Laf~r~D~s~v~~~~i~~ 186 (200) ||-|+-++-..+|. .+.+|.+.. T Consensus 281 sptG~EfvsgsyDk-sIRIf~~~~ 303 (433) T KOG0268|consen 281 SPTGQEFVSGSYDK-SIRIFPVNH 303 (433) T ss_pred CCCcchhccccccc-eEEEeecCC Confidence 99999876555554 355555443 No 202 >KOG0272|consensus Probab=40.03 E-value=46 Score=31.83 Aligned_cols=64 Identities=16% Similarity=0.199 Sum_probs=41.2 Q ss_pred CCCceeeEECcC--CCEEEEEE-cCcEEEEECCCCCe-EEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEE Q psy7767 95 SNFLQLVQWSPV--GHDLIFVK-DNNLYQAYDDFRSI-NALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILA 170 (200) Q Consensus 95 ~~~is~~~wSPD--Gk~IAFV~-~gdLyv~~~~~g~~-~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~La 170 (200) ...+..+.|+|. +..||--. +|-+-+..+++..+ .+|+..+ .+...+.|-|+|++|+ T Consensus 217 ~~~v~~~~fhP~~~~~~lat~s~Dgtvklw~~~~e~~l~~l~gH~-------------------~RVs~VafHPsG~~L~ 277 (459) T KOG0272|consen 217 TSRVGAAVFHPVDSDLNLATASADGTVKLWKLSQETPLQDLEGHL-------------------ARVSRVAFHPSGKFLG 277 (459) T ss_pred ccceeeEEEccCCCccceeeeccCCceeeeccCCCcchhhhhcch-------------------hhheeeeecCCCceee Confidence 466788999997 55655543 45555555554322 2333222 2556789999999999 Q ss_pred EEEEcCC Q psy7767 171 YMKFNDA 177 (200) Q Consensus 171 f~r~D~s 177 (200) -..+|.+ T Consensus 278 TasfD~t 284 (459) T KOG0272|consen 278 TASFDST 284 (459) T ss_pred ecccccc Confidence 8888764 No 203 >PLN00033 photosystem II stability/assembly factor; Provisional Probab=39.56 E-value=3.2e+02 Score=25.51 Aligned_cols=91 Identities=10% Similarity=0.055 Sum_probs=51.3 Q ss_pred eeeeeeeeeeeeee--------------------eeecceEEEEeccCcceeEEEEEeeeeeeeeeecCCCCccceeecC Q psy7767 34 ALKLSIRCLTWKQS--------------------EYWDYNLYLQRSTDIALKLSIRCLTWKQSHSLSGRPDTLTDVSDLD 93 (200) Q Consensus 34 ~~~~~~~~~~~~~~--------------------~~~~~~~y~~~~~~~~~kv~i~~n~~~~R~s~~a~~~~~~~v~~lt 93 (200) .++.+-.+-+|+.- .||++.++.+....... .+.....|+-....-..-+..+..... T Consensus 201 v~~S~D~G~tW~~~~~~t~~~~l~~~~~s~~~g~~~y~Gsf~~v~~~~dG~--~~~vg~~G~~~~s~d~G~~~W~~~~~~ 278 (398) T PLN00033 201 IYVTSNAGRNWKAAVEETVSATLNRTVSSGISGASYYTGTFSTVNRSPDGD--YVAVSSRGNFYLTWEPGQPYWQPHNRA 278 (398) T ss_pred EEEECCCCCCceEcccccccccccccccccccccceeccceeeEEEcCCCC--EEEEECCccEEEecCCCCcceEEecCC Confidence 34445567788764 58888888866433322 222222332111111110011333333 Q ss_pred CCCCceeeEECcCCCEEEEEEcCcEEEEECCCCC Q psy7767 94 GSNFLQLVQWSPVGHDLIFVKDNNLYQAYDDFRS 127 (200) Q Consensus 94 ~~~~is~~~wSPDGk~IAFV~~gdLyv~~~~~g~ 127 (200) ....++.+.+.+||..+....+|.++... ++|. T Consensus 279 ~~~~l~~v~~~~dg~l~l~g~~G~l~~S~-d~G~ 311 (398) T PLN00033 279 SARRIQNMGWRADGGLWLLTRGGGLYVSK-GTGL 311 (398) T ss_pred CccceeeeeEcCCCCEEEEeCCceEEEec-CCCC Confidence 45667899999999999888888988764 4443 No 204 >KOG0284|consensus Probab=39.44 E-value=51 Score=31.48 Aligned_cols=20 Identities=25% Similarity=0.557 Sum_probs=15.7 Q ss_pred CCCceeeEECcCCCEEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK 114 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~ 114 (200) ...+....|.|+|++|.--. T Consensus 96 kc~V~~v~WtPeGRRLltgs 115 (464) T KOG0284|consen 96 KCPVNVVRWTPEGRRLLTGS 115 (464) T ss_pred ccceeeEEEcCCCceeEeec Confidence 35678889999999986554 No 205 >PLN00033 photosystem II stability/assembly factor; Provisional Probab=39.24 E-value=1.2e+02 Score=28.27 Aligned_cols=29 Identities=21% Similarity=0.326 Sum_probs=22.8 Q ss_pred eeeEECcCCCEEEEEEcCcEEEEECCCCC Q psy7767 99 QLVQWSPVGHDLIFVKDNNLYQAYDDFRS 127 (200) Q Consensus 99 s~~~wSPDGk~IAFV~~gdLyv~~~~~g~ 127 (200) .....+|||+.+++-.+|++|...-+++. T Consensus 242 ~~v~~~~dG~~~~vg~~G~~~~s~d~G~~ 270 (398) T PLN00033 242 STVNRSPDGDYVAVSSRGNFYLTWEPGQP 270 (398) T ss_pred eeEEEcCCCCEEEEECCccEEEecCCCCc Confidence 34578899999999999999987655443 No 206 >PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B. Probab=39.24 E-value=2.7e+02 Score=24.53 Aligned_cols=40 Identities=10% Similarity=0.040 Sum_probs=31.0 Q ss_pred CCCceeeEECcCC----CEEEEEEc---CcEEEEECCCCCeEEEeeC Q psy7767 95 SNFLQLVQWSPVG----HDLIFVKD---NNLYQAYDDFRSINALTRD 134 (200) Q Consensus 95 ~~~is~~~wSPDG----k~IAFV~~---gdLyv~~~~~g~~~rLT~d 134 (200) +..+.++++-... +..||+.+ +.|.|.++.+++..|+... T Consensus 60 ~s~lndl~VD~~~~~~~~~~aYItD~~~~glIV~dl~~~~s~Rv~~~ 106 (287) T PF03022_consen 60 DSFLNDLVVDVRDGNCDDGFAYITDSGGPGLIVYDLATGKSWRVLHN 106 (287) T ss_dssp CGGEEEEEEECTTTTS-SEEEEEEETTTCEEEEEETTTTEEEEEETC T ss_pred ccccceEEEEccCCCCcceEEEEeCCCcCcEEEEEccCCcEEEEecC Confidence 4567788776632 37999975 4799999999999999864 No 207 >KOG2919|consensus Probab=39.23 E-value=2.8e+02 Score=26.04 Aligned_cols=27 Identities=19% Similarity=0.116 Sum_probs=18.3 Q ss_pred CceeeEECcCCCEEEEEEcCcEEEEEC Q psy7767 97 FLQLVQWSPVGHDLIFVKDNNLYQAYD 123 (200) Q Consensus 97 ~is~~~wSPDGk~IAFV~~gdLyv~~~ 123 (200) ..-.+.|||||.+|.--.++-|-+-++ T Consensus 160 aAhsL~Fs~DGeqlfaGykrcirvFdt 186 (406) T KOG2919|consen 160 AAHSLQFSPDGEQLFAGYKRCIRVFDT 186 (406) T ss_pred hheeEEecCCCCeEeecccceEEEeec Confidence 467889999999985444454444444 No 208 >KOG0272|consensus Probab=38.89 E-value=1e+02 Score=29.50 Aligned_cols=33 Identities=27% Similarity=0.132 Sum_probs=23.7 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCC Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRS 127 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~ 127 (200) ...+....|||+|-.||=-. ++..-|.++.... T Consensus 345 ~k~I~~V~fsPNGy~lATgs~Dnt~kVWDLR~r~ 378 (459) T KOG0272|consen 345 IKEILSVAFSPNGYHLATGSSDNTCKVWDLRMRS 378 (459) T ss_pred ccceeeEeECCCceEEeecCCCCcEEEeeecccc Confidence 46788999999999998775 3445555665433 No 209 >PF12657 TFIIIC_delta: Transcription factor IIIC subunit delta N-term; InterPro: IPR024761 This entry represents a domain found towards the N terminus of the 90 kDa subunit of transcription factor IIIC (also known as subunit 9 in yeast []). The whole subunit is involved in RNA polymerase III-mediated transcription. It is possible that this N-terminal domain interacts with TFIIIC subunit 8 []. Probab=38.77 E-value=31 Score=27.89 Aligned_cols=22 Identities=18% Similarity=0.357 Sum_probs=19.5 Q ss_pred eeeEECcCCCEEEEEEcCcEEEE Q psy7767 99 QLVQWSPVGHDLIFVKDNNLYQA 121 (200) Q Consensus 99 s~~~wSPDGk~IAFV~~gdLyv~ 121 (200) ..++||.|| +||.+.+..|++. T Consensus 8 ~~l~WS~Dg-~laV~t~~~v~IL 29 (173) T PF12657_consen 8 NALAWSEDG-QLAVATGESVHIL 29 (173) T ss_pred cCeeECCCC-CEEEEcCCeEEEE Confidence 457999999 7899999999998 No 210 >KOG0306|consensus Probab=37.91 E-value=4.9e+02 Score=27.08 Aligned_cols=86 Identities=21% Similarity=0.181 Sum_probs=47.5 Q ss_pred CCCCceeeEECcCCCEEEEEE-cCc--EEEEE-------CCCCC----eEEEeeCCCcceEecccccceeee-------- Q psy7767 94 GSNFLQLVQWSPVGHDLIFVK-DNN--LYQAY-------DDFRS----INALTRDGIKGVLFNGVADWVYEE-------- 151 (200) Q Consensus 94 ~~~~is~~~wSPDGk~IAFV~-~gd--Lyv~~-------~~~g~----~~rLT~dg~~~~~~nG~pdwv~eE-------- 151 (200) -..++..+++||||+.+|-.. ++- ||..| +-|-. ...|++|+. .+..|.+|-...- T Consensus 507 l~ddvL~v~~Spdgk~LaVsLLdnTVkVyflDtlKFflsLYGHkLPV~smDIS~DSk--livTgSADKnVKiWGLdFGDC 584 (888) T KOG0306|consen 507 LEDDVLCVSVSPDGKLLAVSLLDNTVKVYFLDTLKFFLSLYGHKLPVLSMDISPDSK--LIVTGSADKNVKIWGLDFGDC 584 (888) T ss_pred ccccEEEEEEcCCCcEEEEEeccCeEEEEEecceeeeeeecccccceeEEeccCCcC--eEEeccCCCceEEeccccchh Confidence 457899999999999998874 332 33221 11111 134555443 2334444322210 Q ss_pred --eeeCC---CcceEEccCCCEEEEEEEcCCcceEE Q psy7767 152 --EVFSS---TKTLWFSPDGSILAYMKFNDARVHNM 182 (200) Q Consensus 152 --Ei~~~---~~~~~WSPDGk~Laf~r~D~s~v~~~ 182 (200) -+|++ ...+.|-|+ .++.|....++.++.+ T Consensus 585 HKS~fAHdDSvm~V~F~P~-~~~FFt~gKD~kvKqW 619 (888) T KOG0306|consen 585 HKSFFAHDDSVMSVQFLPK-THLFFTCGKDGKVKQW 619 (888) T ss_pred hhhhhcccCceeEEEEccc-ceeEEEecCcceEEee Confidence 02332 245779995 4677877777766665 No 211 >KOG4283|consensus Probab=37.26 E-value=43 Score=30.97 Aligned_cols=75 Identities=15% Similarity=0.154 Sum_probs=41.5 Q ss_pred CCCceeeEECcCCCEEEEEE--cCcEEEEECCCC--CeEEEeeCCCcceEecccccceee--eeeeCCCcceEEccCCCE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK--DNNLYQAYDDFR--SINALTRDGIKGVLFNGVADWVYE--EEVFSSTKTLWFSPDGSI 168 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~--~gdLyv~~~~~g--~~~rLT~dg~~~~~~nG~pdwv~e--EEi~~~~~~~~WSPDGk~ 168 (200) .+++....|||...+|.++. ++.+-+.++... --+.+-... ++-....+ ....+...++.|+.||++ T Consensus 188 r~~vlaV~Wsp~~e~vLatgsaDg~irlWDiRrasgcf~~lD~hn-------~k~~p~~~~n~ah~gkvngla~tSd~~~ 260 (397) T KOG4283|consen 188 RDGVLAVEWSPSSEWVLATGSADGAIRLWDIRRASGCFRVLDQHN-------TKRPPILKTNTAHYGKVNGLAWTSDARY 260 (397) T ss_pred cCceEEEEeccCceeEEEecCCCceEEEEEeecccceeEEeeccc-------CccCccccccccccceeeeeeecccchh Confidence 46788999999999999984 455555544321 111111100 00000000 001245678999999999 Q ss_pred EEEEEEcC Q psy7767 169 LAYMKFND 176 (200) Q Consensus 169 Laf~r~D~ 176 (200) |+-.-.|+ T Consensus 261 l~~~gtd~ 268 (397) T KOG4283|consen 261 LASCGTDD 268 (397) T ss_pred hhhccCcc Confidence 98664443 No 212 >PLN00181 protein SPA1-RELATED; Provisional Probab=36.89 E-value=2.5e+02 Score=28.05 Aligned_cols=34 Identities=6% Similarity=-0.003 Sum_probs=23.9 Q ss_pred CCCceeeEECcC-CCEEEEEE-cCcEEEEECCCCCe Q psy7767 95 SNFLQLVQWSPV-GHDLIFVK-DNNLYQAYDDFRSI 128 (200) Q Consensus 95 ~~~is~~~wSPD-Gk~IAFV~-~gdLyv~~~~~g~~ 128 (200) ...+..+.|+|. ++.||-.. ++.|.+.++.+++. T Consensus 532 ~~~v~~l~~~~~~~~~las~~~Dg~v~lWd~~~~~~ 567 (793) T PLN00181 532 RSKLSGICWNSYIKSQVASSNFEGVVQVWDVARSQL 567 (793) T ss_pred cCceeeEEeccCCCCEEEEEeCCCeEEEEECCCCeE Confidence 456788899885 66665543 67888888876654 No 213 >KOG2100|consensus Probab=36.56 E-value=1.8e+02 Score=29.55 Aligned_cols=19 Identities=37% Similarity=0.488 Sum_probs=15.8 Q ss_pred cceEEccCCCEEEEEEEcC Q psy7767 158 KTLWFSPDGSILAYMKFND 176 (200) Q Consensus 158 ~~~~WSPDGk~Laf~r~D~ 176 (200) .-+.|||.|..|+|+..++ T Consensus 151 ~~~~wsp~~~~l~yv~~~n 169 (755) T KOG2100|consen 151 QYASWSPLGNDLAYVLHNN 169 (755) T ss_pred EEEEEcCCCCEEEEEEecc Confidence 3578999999999997654 No 214 >PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A. Probab=35.04 E-value=4.3e+02 Score=25.61 Aligned_cols=146 Identities=14% Similarity=0.123 Sum_probs=61.4 Q ss_pred cccccceEEEEEEeceeeeceeeeeeeeeeeeeeeeeeecc-------eEEEE---eccCcceeEEEEEeeeeeeeeeec Q psy7767 12 FSHVDHKIIHIIECGYIRSTDIALKLSIRCLTWKQSEYWDY-------NLYLQ---RSTDIALKLSIRCLTWKQSHSLSG 81 (200) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~y~~---~~~~~~~kv~i~~n~~~~R~s~~a 81 (200) ..--.||++..|..|- +=.+-|-+|+| |.-..=|.+ +|++. ..+.-..+..|.+-..+. T Consensus 227 WD~~~r~~~Q~idLg~----~g~~pLEvRfl-H~P~~~~gFvg~aLss~i~~~~k~~~g~W~a~kVi~ip~~~v------ 295 (461) T PF05694_consen 227 WDWSTRKLLQTIDLGE----EGQMPLEVRFL-HDPDANYGFVGCALSSSIWRFYKDDDGEWAAEKVIDIPAKKV------ 295 (461) T ss_dssp EETTTTEEEEEEES-T----TEEEEEEEEE--SSTT--EEEEEEE--EEEEEEEE-ETTEEEEEEEEEE--EE------- T ss_pred EECCCCcEeeEEecCC----CCCceEEEEec-CCCCccceEEEEeccceEEEEEEcCCCCeeeeEEEECCCccc------ Confidence 3344677888887772 44567889998 765443332 22222 233334344444433311 Q ss_pred CCCCccce-eecC-CCCCceeeEECcCCCEEEEEE--cCcEEEEECCCCCeEEEeeCCCcceEeccc-----ccceeeee Q psy7767 82 RPDTLTDV-SDLD-GSNFLQLVQWSPVGHDLIFVK--DNNLYQAYDDFRSINALTRDGIKGVLFNGV-----ADWVYEEE 152 (200) Q Consensus 82 ~~~~~~~v-~~lt-~~~~is~~~wSPDGk~IAFV~--~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~-----pdwv~eEE 152 (200) +.|.+.++ .... ...-+.++.+|.|.+.|-+.. .|+|...++.+-..++++-. +...|. +--+...+ T Consensus 296 ~~~~lp~ml~~~~~~P~LitDI~iSlDDrfLYvs~W~~GdvrqYDISDP~~Pkl~gq----v~lGG~~~~~~~~~v~g~~ 371 (461) T PF05694_consen 296 EGWILPEMLKPFGAVPPLITDILISLDDRFLYVSNWLHGDVRQYDISDPFNPKLVGQ----VFLGGSIRKGDHPVVKGKR 371 (461) T ss_dssp -SS---GGGGGG-EE------EEE-TTS-EEEEEETTTTEEEEEE-SSTTS-EEEEE----EE-BTTTT-B--TTS---- T ss_pred CcccccccccccccCCCceEeEEEccCCCEEEEEcccCCcEEEEecCCCCCCcEEeE----EEECcEeccCCCccccccc Confidence 12312111 1011 134679999999999986664 67888888876544444421 111111 01111112 Q ss_pred eeCCCcceEEccCCCEEEEE Q psy7767 153 VFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 153 i~~~~~~~~WSPDGk~Laf~ 172 (200) +-+...-+.-|-|||+|.+. T Consensus 372 l~GgPqMvqlS~DGkRlYvT 391 (461) T PF05694_consen 372 LRGGPQMVQLSLDGKRLYVT 391 (461) T ss_dssp --S----EEE-TTSSEEEEE T ss_pred cCCCCCeEEEccCCeEEEEE Confidence 33445567889999999766 No 215 >KOG0647|consensus Probab=34.93 E-value=1.3e+02 Score=27.89 Aligned_cols=72 Identities=15% Similarity=0.100 Sum_probs=41.6 Q ss_pred CCCCceeeEECcC-CCEE-EEEEcCcEEEEECCC-CCeE---EEeeCCCcceEecccccceeeeeeeCCCcceEEccCCC Q psy7767 94 GSNFLQLVQWSPV-GHDL-IFVKDNNLYQAYDDF-RSIN---ALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGS 167 (200) Q Consensus 94 ~~~~is~~~wSPD-Gk~I-AFV~~gdLyv~~~~~-g~~~---rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk 167 (200) +...++.++|||. ...+ |=..+|.|-+.++.. |... +.+.+| -.-...||-||. T Consensus 26 P~DsIS~l~FSP~~~~~~~A~SWD~tVR~wevq~~g~~~~ka~~~~~~--------------------PvL~v~Wsddgs 85 (347) T KOG0647|consen 26 PEDSISALAFSPQADNLLAAGSWDGTVRIWEVQNSGQLVPKAQQSHDG--------------------PVLDVCWSDDGS 85 (347) T ss_pred cccchheeEeccccCceEEecccCCceEEEEEecCCcccchhhhccCC--------------------CeEEEEEccCCc Confidence 3567899999993 3333 333467766666554 2221 111111 123678999996 Q ss_pred EEEEEEEcCCcceEEEEEe Q psy7767 168 ILAYMKFNDARVHNMSYIH 186 (200) Q Consensus 168 ~Laf~r~D~s~v~~~~i~~ 186 (200) + +|+..-+..++.+.+.. T Consensus 86 k-Vf~g~~Dk~~k~wDL~S 103 (347) T KOG0647|consen 86 K-VFSGGCDKQAKLWDLAS 103 (347) T ss_pred e-EEeeccCCceEEEEccC Confidence 5 47766666666666543 No 216 >KOG4640|consensus Probab=34.25 E-value=1.6e+02 Score=29.53 Aligned_cols=61 Identities=18% Similarity=0.189 Sum_probs=40.6 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEee-C-CCcceEecccccceeeeeeeCCCcceEEccCCCEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTR-D-GIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAY 171 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~-d-g~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf 171 (200) +..+-.+.|+|.=..||++. .|+|-+.-+. .+||-. + ++.+ ...+++|-|||+-||+ T Consensus 20 ~~~i~~~ewnP~~dLiA~~t~~gelli~R~n---~qRlwtip~p~~~-----------------v~~sL~W~~DGkllaV 79 (665) T KOG4640|consen 20 PINIKRIEWNPKMDLIATRTEKGELLIHRLN---WQRLWTIPIPGEN-----------------VTASLCWRPDGKLLAV 79 (665) T ss_pred ccceEEEEEcCccchhheeccCCcEEEEEec---cceeEeccCCCCc-----------------cceeeeecCCCCEEEE Confidence 45678889999999999985 4567666544 233322 1 2111 1257999999999987 Q ss_pred EEEc Q psy7767 172 MKFN 175 (200) Q Consensus 172 ~r~D 175 (200) --.| T Consensus 80 g~kd 83 (665) T KOG4640|consen 80 GFKD 83 (665) T ss_pred EecC Confidence 6444 No 217 >KOG0643|consensus Probab=34.24 E-value=2.5e+02 Score=25.73 Aligned_cols=80 Identities=11% Similarity=0.038 Sum_probs=45.3 Q ss_pred CCCCceeeEECcCCCEEEEEEcC------cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCC Q psy7767 94 GSNFLQLVQWSPVGHDLIFVKDN------NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGS 167 (200) Q Consensus 94 ~~~~is~~~wSPDGk~IAFV~~g------dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk 167 (200) .+..+....||++|+.++++.+. .|.+.++......+..+++-..+ -.| | ......-|+|-|+ T Consensus 92 ~~~~Vk~~~F~~~gn~~l~~tD~~mg~~~~v~~fdi~~~~~~~~s~ep~~kI---~t~------~--skit~a~Wg~l~~ 160 (327) T KOG0643|consen 92 TNSPVKRVDFSFGGNLILASTDKQMGYTCFVSVFDIRDDSSDIDSEEPYLKI---PTP------D--SKITSALWGPLGE 160 (327) T ss_pred cCCeeEEEeeccCCcEEEEEehhhcCcceEEEEEEccCChhhhcccCceEEe---cCC------c--cceeeeeecccCC Confidence 35667888999999999999754 46666665333222222110000 000 0 1334567999999 Q ss_pred EEEEEEEcCCcceEEEEE Q psy7767 168 ILAYMKFNDARVHNMSYI 185 (200) Q Consensus 168 ~Laf~r~D~s~v~~~~i~ 185 (200) .|.+- .+++.+.++... T Consensus 161 ~ii~G-he~G~is~~da~ 177 (327) T KOG0643|consen 161 TIIAG-HEDGSISIYDAR 177 (327) T ss_pred EEEEe-cCCCcEEEEEcc Confidence 88654 344556555544 No 218 >KOG1009|consensus Probab=34.11 E-value=1e+02 Score=29.44 Aligned_cols=21 Identities=29% Similarity=0.384 Sum_probs=14.8 Q ss_pred CCCcceEEccCCCEEEEEEEc Q psy7767 155 SSTKTLWFSPDGSILAYMKFN 175 (200) Q Consensus 155 ~~~~~~~WSPDGk~Laf~r~D 175 (200) .....++||+||..|+....| T Consensus 343 ~~iTDiaws~dg~~l~vSS~D 363 (434) T KOG1009|consen 343 SAITDIAWSDDGSVLLVSSTD 363 (434) T ss_pred eeecceeecCCCcEEEEeccC Confidence 355678999999877654433 No 219 >KOG1520|consensus Probab=33.47 E-value=2.3e+02 Score=26.68 Aligned_cols=33 Identities=24% Similarity=0.216 Sum_probs=26.4 Q ss_pred CCcceEEccCCCEEEEEEEcCCcceEEEEEeeC Q psy7767 156 STKTLWFSPDGSILAYMKFNDARVHNMSYIHYG 188 (200) Q Consensus 156 ~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~~~~ 188 (200) -.++++-|||++.+.|...-...+..+++.-.. T Consensus 220 F~NGlaLS~d~sfvl~~Et~~~ri~rywi~g~k 252 (376) T KOG1520|consen 220 FPNGLALSPDGSFVLVAETTTARIKRYWIKGPK 252 (376) T ss_pred ccccccCCCCCCEEEEEeeccceeeeeEecCCc Confidence 356788999999999998877778888876554 No 220 >KOG0643|consensus Probab=31.77 E-value=4.1e+02 Score=24.38 Aligned_cols=65 Identities=15% Similarity=0.131 Sum_probs=42.5 Q ss_pred CCCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 94 GSNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 94 ~~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) +...+..+-|+|-|+.|..-. +|.|-..++.+|.... ..++.. -.....+.+|||..++.-. T Consensus 146 ~~skit~a~Wg~l~~~ii~Ghe~G~is~~da~~g~~~v--~s~~~h---------------~~~Ind~q~s~d~T~FiT~ 208 (327) T KOG0643|consen 146 PDSKITSALWGPLGETIIAGHEDGSISIYDARTGKELV--DSDEEH---------------SSKINDLQFSRDRTYFITG 208 (327) T ss_pred CccceeeeeecccCCEEEEecCCCcEEEEEcccCceee--echhhh---------------ccccccccccCCcceEEec Confidence 456788999999999998774 5678888887664321 111100 0134678899999876544 Q ss_pred EEc Q psy7767 173 KFN 175 (200) Q Consensus 173 r~D 175 (200) ..| T Consensus 209 s~D 211 (327) T KOG0643|consen 209 SKD 211 (327) T ss_pred ccC Confidence 443 No 221 >COG5169 HSF1 Heat shock transcription factor [Transcription] Probab=31.59 E-value=35 Score=30.68 Aligned_cols=17 Identities=29% Similarity=0.493 Sum_probs=13.5 Q ss_pred CCcceEEccCCCEEEEE Q psy7767 156 STKTLWFSPDGSILAYM 172 (200) Q Consensus 156 ~~~~~~WSPDGk~Laf~ 172 (200) ...-++|||||+.+... T Consensus 26 ~~k~I~Ws~~G~sfvI~ 42 (282) T COG5169 26 YYKLIQWSPDGRSFVIL 42 (282) T ss_pred cCCceEECCCCCEEEEe Confidence 45668999999988654 No 222 >KOG1587|consensus Probab=31.43 E-value=2.9e+02 Score=27.18 Aligned_cols=74 Identities=18% Similarity=0.221 Sum_probs=46.5 Q ss_pred CceeeEECcCCCEEEEEE--cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEE Q psy7767 97 FLQLVQWSPVGHDLIFVK--DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKF 174 (200) Q Consensus 97 ~is~~~wSPDGk~IAFV~--~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~ 174 (200) .+.+.+|||---.|-++. +|+|++.|+.......+....-. . ....-..|||+|+.|+ +-. T Consensus 443 ~v~~vaWSptrpavF~~~d~~G~l~iWDLl~~~~~Pv~s~~~~------~----------~~l~~~~~s~~g~~la-vGd 505 (555) T KOG1587|consen 443 YVTDVAWSPTRPAVFATVDGDGNLDIWDLLQDDEEPVLSQKVC------S----------PALTRVRWSPNGKLLA-VGD 505 (555) T ss_pred eeeeeEEcCcCceEEEEEcCCCceehhhhhccccCCccccccc------c----------cccceeecCCCCcEEE-Eec Confidence 378899999887776665 47899998875543333321100 0 0233468999998775 545 Q ss_pred cCCcceEEEEEee Q psy7767 175 NDARVHNMSYIHY 187 (200) Q Consensus 175 D~s~v~~~~i~~~ 187 (200) ..+.+..+.+... T Consensus 506 ~~G~~~~~~l~~~ 518 (555) T KOG1587|consen 506 ANGTTHILKLSES 518 (555) T ss_pred CCCcEEEEEcCch Confidence 5566666666443 No 223 >TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis. Probab=31.41 E-value=2.7e+02 Score=26.73 Aligned_cols=65 Identities=22% Similarity=0.206 Sum_probs=41.9 Q ss_pred eEECcCCCEEEEEE---------cCcEEEEECCCCCeEEEeeCCCc-ceEecccccceeeeeee---CCCcceEEccCCC Q psy7767 101 VQWSPVGHDLIFVK---------DNNLYQAYDDFRSINALTRDGIK-GVLFNGVADWVYEEEVF---SSTKTLWFSPDGS 167 (200) Q Consensus 101 ~~wSPDGk~IAFV~---------~gdLyv~~~~~g~~~rLT~dg~~-~~~~nG~pdwv~eEEi~---~~~~~~~WSPDGk 167 (200) |+++|- -++|.. .|+|++..+.++...||.-++.. .+ .+ .+|.+| ++.+.+.=+|||. T Consensus 347 psiaps--g~~~y~~~g~~~p~w~g~llv~~L~~~~l~r~~l~~~~~~v--~~-----~~~~~~~~~~RiRdv~~~pDg~ 417 (454) T TIGR03606 347 PTIAPS--SAYYYKGGEKGITGWENSLLIPSLKRGVIYRIKLDPDYSTV--YG-----DAVPMFKTNNRYRDVIASPDGN 417 (454) T ss_pred CCcCCc--eeEEEecCcccCcccCCCEEEEEcCCCeEEEEEecCCccee--cc-----eeEEeecCCCeeEEEEECCCCC Confidence 455565 466776 36999999998888888865532 10 00 122233 6788999999987 Q ss_pred EEEEEEE Q psy7767 168 ILAYMKF 174 (200) Q Consensus 168 ~Laf~r~ 174 (200) .|..+.. T Consensus 418 ~iy~~td 424 (454) T TIGR03606 418 VLYVATD 424 (454) T ss_pred EEEEEEc Confidence 6754443 No 224 >KOG0276|consensus Probab=30.70 E-value=2.2e+02 Score=28.92 Aligned_cols=20 Identities=25% Similarity=0.336 Sum_probs=15.2 Q ss_pred ceeeEECcCCCEEEEEEcCc Q psy7767 98 LQLVQWSPVGHDLIFVKDNN 117 (200) Q Consensus 98 is~~~wSPDGk~IAFV~~gd 117 (200) .+.++-||+|+-++-.-+|+ T Consensus 354 Pq~L~hsPNGrfV~VcgdGE 373 (794) T KOG0276|consen 354 PQTLAHSPNGRFVVVCGDGE 373 (794) T ss_pred hHHhccCCCCcEEEEecCcc Confidence 36778999999887666665 No 225 >KOG0277|consensus Probab=30.66 E-value=2.8e+02 Score=25.29 Aligned_cols=77 Identities=16% Similarity=0.051 Sum_probs=47.9 Q ss_pred ceeeEECc-CCCEEEEEE--------cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCE Q psy7767 98 LQLVQWSP-VGHDLIFVK--------DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSI 168 (200) Q Consensus 98 is~~~wSP-DGk~IAFV~--------~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~ 168 (200) -..+++|| --++||... +|.||+.++.++...+....= || + .....+.||+.-.- T Consensus 11 GysvqfSPf~~nrLavAt~q~yGl~G~G~L~ile~~~~~gi~e~~s~----------d~--~----D~LfdV~Wse~~e~ 74 (311) T KOG0277|consen 11 GYSVQFSPFVENRLAVATAQHYGLAGNGRLFILEVTDPKGIQECQSY----------DT--E----DGLFDVAWSENHEN 74 (311) T ss_pred cceeEecccccchhheeehhhcccccCceEEEEecCCCCCeEEEEee----------ec--c----cceeEeeecCCCcc Confidence 35678888 345555543 468999888643333222210 01 0 12236899999999 Q ss_pred EEEEEEcCCcceEEEEEeeCCC Q psy7767 169 LAYMKFNDARVHNMSYIHYGEP 190 (200) Q Consensus 169 Laf~r~D~s~v~~~~i~~~~~~ 190 (200) ++++...++.+++|.......| T Consensus 75 ~~~~a~GDGSLrl~d~~~~s~P 96 (311) T KOG0277|consen 75 QVIAASGDGSLRLFDLTMPSKP 96 (311) T ss_pred eEEEEecCceEEEeccCCCCcc Confidence 9999888888888885555443 No 226 >PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits. Probab=30.11 E-value=1.6e+02 Score=29.46 Aligned_cols=79 Identities=13% Similarity=0.140 Sum_probs=50.0 Q ss_pred eeeeeeeeeeeeeeecceEEEEeccCcceeEEEEEeeeeeeeeeecCCCCccceeecC----CCCCceeeEECcCCCEEE Q psy7767 36 KLSIRCLTWKQSEYWDYNLYLQRSTDIALKLSIRCLTWKQSHSLSGRPDTLTDVSDLD----GSNFLQLVQWSPVGHDLI 111 (200) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~y~~~~~~~~~kv~i~~n~~~~R~s~~a~~~~~~~v~~lt----~~~~is~~~wSPDGk~IA 111 (200) .-.|++|.|-+...=...|.. |++.+|.+.+-. |+.|.-..-.-..++.+. .+..+.+..|.+||..| T Consensus 72 ~~~I~dLDWtst~d~qsiLaV----Gf~~~v~l~~Q~---R~dy~~~~p~w~~i~~i~i~~~T~h~Igds~Wl~~G~Lv- 143 (631) T PF12234_consen 72 DDPIRDLDWTSTPDGQSILAV----GFPHHVLLYTQL---RYDYTNKGPSWAPIRKIDISSHTPHPIGDSIWLKDGTLV- 143 (631) T ss_pred CCceeeceeeecCCCCEEEEE----EcCcEEEEEEcc---chhhhcCCcccceeEEEEeecCCCCCccceeEecCCeEE- Confidence 346789999987765555555 889888887642 233332221112333322 24678999999999765 Q ss_pred EEEcCcEEEEE Q psy7767 112 FVKDNNLYQAY 122 (200) Q Consensus 112 FV~~gdLyv~~ 122 (200) -..++++|+.+ T Consensus 144 V~sGNqlfv~d 154 (631) T PF12234_consen 144 VGSGNQLFVFD 154 (631) T ss_pred EEeCCEEEEEC Confidence 44688899874 No 227 >KOG4640|consensus Probab=29.96 E-value=1.1e+02 Score=30.63 Aligned_cols=34 Identities=18% Similarity=0.178 Sum_probs=26.9 Q ss_pred CCCCc-eeeEECcCCCEEEEE-EcCcEEEEECCCCC Q psy7767 94 GSNFL-QLVQWSPVGHDLIFV-KDNNLYQAYDDFRS 127 (200) Q Consensus 94 ~~~~i-s~~~wSPDGk~IAFV-~~gdLyv~~~~~g~ 127 (200) ++..+ .++.|-||||.||+- .+|.|-+.|++.|. T Consensus 60 p~~~v~~sL~W~~DGkllaVg~kdG~I~L~Dve~~~ 95 (665) T KOG4640|consen 60 PGENVTASLCWRPDGKLLAVGFKDGTIRLHDVEKGG 95 (665) T ss_pred CCCccceeeeecCCCCEEEEEecCCeEEEEEccCCC Confidence 34445 499999999999988 47889888888664 No 228 >KOG0282|consensus Probab=29.75 E-value=2.8e+02 Score=27.02 Aligned_cols=35 Identities=9% Similarity=0.046 Sum_probs=26.4 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEECCCCCeE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAYDDFRSIN 129 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~~~~g~~~ 129 (200) ...+.++.||++|..+.-+. +..|-+.|+++|+.. T Consensus 258 ~k~Vrd~~~s~~g~~fLS~sfD~~lKlwDtETG~~~ 293 (503) T KOG0282|consen 258 RKPVRDASFNNCGTSFLSASFDRFLKLWDTETGQVL 293 (503) T ss_pred hhhhhhhhccccCCeeeeeecceeeeeeccccceEE Confidence 56789999999999865553 556667788888764 No 229 >COG4831 Roadblock/LC7 domain [Function unknown] Probab=29.44 E-value=34 Score=26.41 Aligned_cols=13 Identities=46% Similarity=0.782 Sum_probs=10.9 Q ss_pred eEEccCCCEEEEE Q psy7767 160 LWFSPDGSILAYM 172 (200) Q Consensus 160 ~~WSPDGk~Laf~ 172 (200) --|||||+-++|- T Consensus 18 Gefs~DGkLv~Yk 30 (109) T COG4831 18 GEFSPDGKLVEYK 30 (109) T ss_pred ceeCCCCceEEee Confidence 3599999988885 No 230 >KOG0283|consensus Probab=29.10 E-value=52 Score=33.32 Aligned_cols=29 Identities=28% Similarity=0.255 Sum_probs=23.1 Q ss_pred CcceEEccCCCEEEEEEEcCCcceEEEEEe Q psy7767 157 TKTLWFSPDGSILAYMKFNDARVHNMSYIH 186 (200) Q Consensus 157 ~~~~~WSPDGk~Laf~r~D~s~v~~~~i~~ 186 (200) ..++-|||||++|| ....++-++++.+.. T Consensus 270 Iw~mKFS~DGKyLA-saGeD~virVWkVie 298 (712) T KOG0283|consen 270 IWAMKFSHDGKYLA-SAGEDGVIRVWKVIE 298 (712) T ss_pred EEEEEeCCCCceee-ecCCCceEEEEEEec Confidence 34566999999998 556777888888887 No 231 >PF10313 DUF2415: Uncharacterised protein domain (DUF2415); InterPro: IPR019417 This entry represents a short (30 residues) domain of unknown function found in a family of fungal proteins. It contains a characteristic DLL sequence motif. Probab=29.08 E-value=1.7e+02 Score=19.07 Aligned_cols=29 Identities=10% Similarity=0.104 Sum_probs=21.1 Q ss_pred CceeeEECcCC---CEEEEEE-cCcEEEEECCC Q psy7767 97 FLQLVQWSPVG---HDLIFVK-DNNLYQAYDDF 125 (200) Q Consensus 97 ~is~~~wSPDG---k~IAFV~-~gdLyv~~~~~ 125 (200) .+...+|||++ +.++|+- .+.+-+.|+.+ T Consensus 2 AvR~~kFsP~~~~~DLL~~~E~~g~vhi~D~R~ 34 (43) T PF10313_consen 2 AVRCCKFSPEPGGNDLLAWAEHQGRVHIVDTRS 34 (43) T ss_pred CeEEEEeCCCCCcccEEEEEccCCeEEEEEccc Confidence 45788999864 4777774 46788888874 No 232 >smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin. Probab=28.56 E-value=1e+02 Score=17.89 Aligned_cols=19 Identities=11% Similarity=-0.081 Sum_probs=15.1 Q ss_pred CCCcceEEccCCCEEEEEE Q psy7767 155 SSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 155 ~~~~~~~WSPDGk~Laf~r 173 (200) ....++.|+|++++|.|.- T Consensus 9 ~~~~~la~d~~~~~lYw~D 27 (43) T smart00135 9 GHPNGLAVDWIEGRLYWTD 27 (43) T ss_pred CCcCEEEEeecCCEEEEEe Confidence 3567899999999887763 No 233 >PLN00181 protein SPA1-RELATED; Provisional Probab=28.52 E-value=4e+02 Score=26.57 Aligned_cols=34 Identities=9% Similarity=-0.149 Sum_probs=24.7 Q ss_pred CCCceeeEECc-CCCEEEEEE-cCcEEEEECCCCCe Q psy7767 95 SNFLQLVQWSP-VGHDLIFVK-DNNLYQAYDDFRSI 128 (200) Q Consensus 95 ~~~is~~~wSP-DGk~IAFV~-~gdLyv~~~~~g~~ 128 (200) ...+..++|+| ||..|+-.. ++.|.+.++..+.. T Consensus 575 ~~~V~~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~~~~ 610 (793) T PLN00181 575 EKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGVS 610 (793) T ss_pred CCCEEEEEEcCCCCCEEEEEcCCCEEEEEECCCCcE Confidence 45688999997 677665543 56788888876654 No 234 >PF08452 DNAP_B_exo_N: DNA polymerase family B exonuclease domain, N-terminal; InterPro: IPR013660 This domain is found in viral DNA polymerases to the N terminus of DNA polymerase family B exonuclease domains (IPR006133 from INTERPRO). ; GO: 0003887 DNA-directed DNA polymerase activity Probab=28.08 E-value=25 Score=19.84 Aligned_cols=11 Identities=36% Similarity=1.117 Sum_probs=7.8 Q ss_pred eeeeeeeeeee Q psy7767 38 SIRCLTWKQSE 48 (200) Q Consensus 38 ~~~~~~~~~~~ 48 (200) -||||.|-.|. T Consensus 2 eikCiNWFE~~ 12 (22) T PF08452_consen 2 EIKCINWFESR 12 (22) T ss_pred ccEEeehhhhC Confidence 47899986553 No 235 >PF13449 Phytase-like: Esterase-like activity of phytase Probab=27.39 E-value=4.4e+02 Score=23.37 Aligned_cols=92 Identities=20% Similarity=0.112 Sum_probs=49.6 Q ss_pred cCCCCCceeeEECcCCCEEEEEEcCcEEEEE----CCCCCeEEEee-CCCcceEecc--cccceeeeee------eCCCc Q psy7767 92 LDGSNFLQLVQWSPVGHDLIFVKDNNLYQAY----DDFRSINALTR-DGIKGVLFNG--VADWVYEEEV------FSSTK 158 (200) Q Consensus 92 lt~~~~is~~~wSPDGk~IAFV~~gdLyv~~----~~~g~~~rLT~-dg~~~~~~nG--~pdwv~eEEi------~~~~~ 158 (200) ...+.++-.++++|||+.|..+...-+.--. ...+...||.. +.. ..| ...|.|.-+- -...+ T Consensus 143 ~~~N~G~E~la~~~dG~~l~~~~E~~l~~d~~~~~~~~~~~~ri~~~d~~----~~~~~~~~~~y~ld~~~~~~~~~~is 218 (326) T PF13449_consen 143 RRNNRGFEGLAVSPDGRTLFAAMESPLKQDGPRANPDNGSPLRILRYDPK----TPGEPVAEYAYPLDPPPTAPGDNGIS 218 (326) T ss_pred ccCCCCeEEEEECCCCCEEEEEECccccCCCcccccccCceEEEEEecCC----CCCccceEEEEeCCccccccCCCCce Confidence 3356788899999999987777654433110 01122234443 211 011 2234443210 12456 Q ss_pred ceEEccCCCEEEEEEEc----CCcceEEEEEee Q psy7767 159 TLWFSPDGSILAYMKFN----DARVHNMSYIHY 187 (200) Q Consensus 159 ~~~WSPDGk~Laf~r~D----~s~v~~~~i~~~ 187 (200) .+.+-|||+.|+..|.. ...++++.+... T Consensus 219 d~~al~d~~lLvLER~~~~~~~~~~ri~~v~l~ 251 (326) T PF13449_consen 219 DIAALPDGRLLVLERDFSPGTGNYKRIYRVDLS 251 (326) T ss_pred eEEEECCCcEEEEEccCCCCccceEEEEEEEcc Confidence 78899999977766763 234556666654 No 236 >KOG0650|consensus Probab=27.12 E-value=3.2e+02 Score=27.67 Aligned_cols=97 Identities=14% Similarity=0.146 Sum_probs=56.0 Q ss_pred cceeecCCCCCceeeEECcCCCE--EEEEEcCcEEEEECCCCCe------E-EEeeC---CCcceEecccccceeee--- Q psy7767 87 TDVSDLDGSNFLQLVQWSPVGHD--LIFVKDNNLYQAYDDFRSI------N-ALTRD---GIKGVLFNGVADWVYEE--- 151 (200) Q Consensus 87 ~~v~~lt~~~~is~~~wSPDGk~--IAFV~~gdLyv~~~~~g~~------~-rLT~d---g~~~~~~nG~pdwv~eE--- 151 (200) +-+........+...+|+|.++. ||-..++.+++.+-.-|+. . .++.. ..+. ..+..|.-++ T Consensus 433 Rcvr~~~~d~~I~~vaw~P~~~~~vLAvA~~~~~~ivnp~~G~~~e~~~t~ell~~~~~~~~p~---~~~~~W~~~~~~e 509 (733) T KOG0650|consen 433 RCVRTVQFDSEIRSVAWNPLSDLCVLAVAVGECVLIVNPIFGDRLEVGPTKELLASAPNESEPD---AAVVTWSRASLDE 509 (733) T ss_pred eEEEEEeecceeEEEEecCCCCceeEEEEecCceEEeCccccchhhhcchhhhhhcCCCccCCc---ccceeechhhhhh Confidence 35555555668899999998754 3444456777775544421 1 11111 1110 0123465531 Q ss_pred -e--------eeCCCcceEEccCCCEEEEEEEcCCcceEEEEEe Q psy7767 152 -E--------VFSSTKTLWFSPDGSILAYMKFNDARVHNMSYIH 186 (200) Q Consensus 152 -E--------i~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~i~~ 186 (200) + .+.....+.|--+|.+||-+..+.+.-.++...+ T Consensus 510 ~~~~v~~~I~~~k~i~~vtWHrkGDYlatV~~~~~~~~VliHQL 553 (733) T KOG0650|consen 510 LEKGVCIVIKHPKSIRQVTWHRKGDYLATVMPDSGNKSVLIHQL 553 (733) T ss_pred hccceEEEEecCCccceeeeecCCceEEEeccCCCcceEEEEec Confidence 1 1356688999999999999988776555444333 No 237 >KOG4499|consensus Probab=26.55 E-value=1.8e+02 Score=26.32 Aligned_cols=74 Identities=15% Similarity=0.077 Sum_probs=40.5 Q ss_pred CCCceeeEECcCCCEEEEEE----------cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEcc Q psy7767 95 SNFLQLVQWSPVGHDLIFVK----------DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSP 164 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~----------~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSP 164 (200) .....+...+|||+.-+=++ ++.||..- .+++...+-+ - ++...++.|+- T Consensus 108 knR~NDgkvdP~Gryy~GtMad~~~~le~~~g~Ly~~~-~~h~v~~i~~---------~----------v~IsNgl~Wd~ 167 (310) T KOG4499|consen 108 KNRLNDGKVDPDGRYYGGTMADFGDDLEPIGGELYSWL-AGHQVELIWN---------C----------VGISNGLAWDS 167 (310) T ss_pred hcccccCccCCCCceeeeeeccccccccccccEEEEec-cCCCceeeeh---------h----------ccCCccccccc Confidence 44567889999999833332 23455432 2233332222 1 24567789999 Q ss_pred CCCEEEEEEEcCCcceEEEEEeeC Q psy7767 165 DGSILAYMKFNDARVHNMSYIHYG 188 (200) Q Consensus 165 DGk~Laf~r~D~s~v~~~~i~~~~ 188 (200) |-+.+.|.-..+=.|..|.....+ T Consensus 168 d~K~fY~iDsln~~V~a~dyd~~t 191 (310) T KOG4499|consen 168 DAKKFYYIDSLNYEVDAYDYDCPT 191 (310) T ss_pred cCcEEEEEccCceEEeeeecCCCc Confidence 998887763333233334433433 No 238 >KOG0269|consensus Probab=25.85 E-value=2.6e+02 Score=28.89 Aligned_cols=111 Identities=21% Similarity=0.159 Sum_probs=0.0 Q ss_pred eeeeeeeeeeeeecceEEEEeccCcceeEEEEEeeeeeeeeeecCCCCccceeecCCCCCceeeEECc-CCCEEEEEEcC Q psy7767 38 SIRCLTWKQSEYWDYNLYLQRSTDIALKLSIRCLTWKQSHSLSGRPDTLTDVSDLDGSNFLQLVQWSP-VGHDLIFVKDN 116 (200) Q Consensus 38 ~~~~~~~~~~~~~~~~~y~~~~~~~~~kv~i~~n~~~~R~s~~a~~~~~~~v~~lt~~~~is~~~wSP-DGk~IAFV~~g 116 (200) |+-||+|...|= +++-+|++---|-+.--..+..++++-..- ..+.+.+||| ++...+=+.++ T Consensus 135 s~~~ldfh~tep-----~iliSGSQDg~vK~~DlR~~~S~~t~~~nS-----------ESiRDV~fsp~~~~~F~s~~ds 198 (839) T KOG0269|consen 135 SANKLDFHSTEP-----NILISGSQDGTVKCWDLRSKKSKSTFRSNS-----------ESIRDVKFSPGYGNKFASIHDS 198 (839) T ss_pred ceeeeeeccCCc-----cEEEecCCCceEEEEeeecccccccccccc-----------hhhhceeeccCCCceEEEecCC Q ss_pred ---cEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEEcCCcceEEEE Q psy7767 117 ---NLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKFNDARVHNMSY 184 (200) Q Consensus 117 ---dLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~D~s~v~~~~i 184 (200) ++|=.--...-..+++..-++.+ -+.|+|++.+|| .-.++..+.++.+ T Consensus 199 G~lqlWDlRqp~r~~~k~~AH~GpV~-------------------c~nwhPnr~~lA-TGGRDK~vkiWd~ 249 (839) T KOG0269|consen 199 GYLQLWDLRQPDRCEKKLTAHNGPVL-------------------CLNWHPNREWLA-TGGRDKMVKIWDM 249 (839) T ss_pred ceEEEeeccCchhHHHHhhcccCceE-------------------EEeecCCCceee-ecCCCccEEEEec No 239 >PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats. Probab=25.60 E-value=6.7e+02 Score=24.85 Aligned_cols=86 Identities=15% Similarity=0.152 Sum_probs=47.5 Q ss_pred ceeeEECcCCCEEEEEEc-CcEEEEECCCCCe-EEEe-eCCCcceEecccccceeeeeeeCCCcceEEc----------- Q psy7767 98 LQLVQWSPVGHDLIFVKD-NNLYQAYDDFRSI-NALT-RDGIKGVLFNGVADWVYEEEVFSSTKTLWFS----------- 163 (200) Q Consensus 98 is~~~wSPDGk~IAFV~~-gdLyv~~~~~g~~-~rLT-~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WS----------- 163 (200) ...++|.|||.-++...+ |+|-+.|+.-+.. .|+. ++..+.-...=..-+..+ ..-..+.|+ T Consensus 302 P~~iaWHp~gai~~V~s~qGelQ~FD~ALspi~~qLlsEd~~P~~~L~Ls~yf~~~----~~L~~iqW~~~~~~~~~~~~ 377 (545) T PF11768_consen 302 PTLIAWHPDGAIFVVGSEQGELQCFDMALSPIKMQLLSEDATPKSTLQLSKYFRVS----SSLVHIQWAPAPQLSSQGEF 377 (545) T ss_pred ceEEEEcCCCcEEEEEcCCceEEEEEeecCccceeeccccCCCccEEeeehhccCc----chhheeEeccCCCccccCCC Confidence 378899999998877754 7777777654432 2343 322221000000000000 122334454 Q ss_pred -cCCCEEEEEEEcCCcceEEEEEee Q psy7767 164 -PDGSILAYMKFNDARVHNMSYIHY 187 (200) Q Consensus 164 -PDGk~Laf~r~D~s~v~~~~i~~~ 187 (200) +|+..+.+++++.++.....+..- T Consensus 378 ~~~~~~~L~l~f~~GPl~vl~~~~G 402 (545) T PF11768_consen 378 YADTYDLLLLVFERGPLAVLRFKLG 402 (545) T ss_pred cCCccceEEEEEcCCCeEEEEEeec Confidence 455568889999998888776654 No 240 >COG3490 Uncharacterized protein conserved in bacteria [Function unknown] Probab=25.04 E-value=1.6e+02 Score=27.27 Aligned_cols=16 Identities=19% Similarity=0.264 Sum_probs=12.3 Q ss_pred CcceEEccCCCEEEEE Q psy7767 157 TKTLWFSPDGSILAYM 172 (200) Q Consensus 157 ~~~~~WSPDGk~Laf~ 172 (200) .-.+.|+|||+.|+-+ T Consensus 164 pHev~lm~DGrtlvva 179 (366) T COG3490 164 PHEVTLMADGRTLVVA 179 (366) T ss_pred cceeEEecCCcEEEEe Confidence 3467899999988765 No 241 >KOG0283|consensus Probab=24.89 E-value=2.3e+02 Score=28.85 Aligned_cols=30 Identities=13% Similarity=0.195 Sum_probs=24.1 Q ss_pred CCCceeeEECcCCCEEEEEEc-CcEEEEECC Q psy7767 95 SNFLQLVQWSPVGHDLIFVKD-NNLYQAYDD 124 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~~-gdLyv~~~~ 124 (200) ...-..+.+++||+.|+...+ ..||+++.+ T Consensus 546 ~~SQ~~Asfs~Dgk~IVs~seDs~VYiW~~~ 576 (712) T KOG0283|consen 546 TSSQISASFSSDGKHIVSASEDSWVYIWKND 576 (712) T ss_pred CCcceeeeEccCCCEEEEeecCceEEEEeCC Confidence 344567899999999999985 479999874 No 242 >PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity Probab=24.45 E-value=1.5e+02 Score=21.85 Aligned_cols=30 Identities=20% Similarity=0.121 Sum_probs=20.1 Q ss_pred CCCcceEEccCCCEEEEEEEcCCcceEEEE Q psy7767 155 SSTKTLWFSPDGSILAYMKFNDARVHNMSY 184 (200) Q Consensus 155 ~~~~~~~WSPDGk~Laf~r~D~s~v~~~~i 184 (200) ....++..|||+++|.-...-...+..|.. T Consensus 54 ~~aNGI~~s~~~k~lyVa~~~~~~I~vy~~ 83 (86) T PF01731_consen 54 SFANGIAISPDKKYLYVASSLAHSIHVYKR 83 (86) T ss_pred CCCceEEEcCCCCEEEEEeccCCeEEEEEe Confidence 356678999999987554555555665543 No 243 >PRK13613 lipoprotein LpqB; Provisional Probab=23.90 E-value=3.3e+02 Score=26.95 Aligned_cols=21 Identities=19% Similarity=0.219 Sum_probs=17.1 Q ss_pred CCcceEEccCCCEEEEEEEcC Q psy7767 156 STKTLWFSPDGSILAYMKFND 176 (200) Q Consensus 156 ~~~~~~WSPDGk~Laf~r~D~ 176 (200) ...+++-||||.++|.+..+. T Consensus 456 ~I~~lrvSrDG~RvAvv~~~~ 476 (599) T PRK13613 456 RVVAVRVARDGVRVALIVEKD 476 (599) T ss_pred EeEEEEECCCccEEEEEEecC Confidence 456899999999999986543 No 244 >KOG1273|consensus Probab=23.70 E-value=5.3e+02 Score=24.24 Aligned_cols=39 Identities=15% Similarity=0.019 Sum_probs=27.5 Q ss_pred CCceeeEECcCCCEEEEEEc-CcEEEEECCCCCe-EEEeeC Q psy7767 96 NFLQLVQWSPVGHDLIFVKD-NNLYQAYDDFRSI-NALTRD 134 (200) Q Consensus 96 ~~is~~~wSPDGk~IAFV~~-gdLyv~~~~~g~~-~rLT~d 134 (200) ..++.+.|||||+.|.=.+. +.+-+.++-.|.. .|+-.+ T Consensus 66 ~pi~sl~WS~dgr~LltsS~D~si~lwDl~~gs~l~rirf~ 106 (405) T KOG1273|consen 66 RPITSLCWSRDGRKLLTSSRDWSIKLWDLLKGSPLKRIRFD 106 (405) T ss_pred cceeEEEecCCCCEeeeecCCceeEEEeccCCCceeEEEcc Confidence 46899999999999877754 4566667766653 455543 No 245 >PF13449 Phytase-like: Esterase-like activity of phytase Probab=23.49 E-value=5.2e+02 Score=22.89 Aligned_cols=16 Identities=38% Similarity=0.366 Sum_probs=12.3 Q ss_pred CcceEEccCCCEEEEE Q psy7767 157 TKTLWFSPDGSILAYM 172 (200) Q Consensus 157 ~~~~~WSPDGk~Laf~ 172 (200) ..++..+|||++|..+ T Consensus 149 ~E~la~~~dG~~l~~~ 164 (326) T PF13449_consen 149 FEGLAVSPDGRTLFAA 164 (326) T ss_pred eEEEEECCCCCEEEEE Confidence 4578999999976544 No 246 >KOG4532|consensus Probab=23.45 E-value=5.2e+02 Score=23.78 Aligned_cols=73 Identities=14% Similarity=0.005 Sum_probs=42.9 Q ss_pred ceeeEECcCCCEEEEEEc-CcEEEEECCCCCe--EEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEEE Q psy7767 98 LQLVQWSPVGHDLIFVKD-NNLYQAYDDFRSI--NALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMKF 174 (200) Q Consensus 98 is~~~wSPDGk~IAFV~~-gdLyv~~~~~g~~--~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r~ 174 (200) .-.+..||||++.+-|-+ ++||...++.... .+++.-...+ .-...+||.....+|.+ . T Consensus 161 ~ns~~~snd~~~~~~Vgds~~Vf~y~id~~sey~~~~~~a~t~D-----------------~gF~~S~s~~~~~FAv~-~ 222 (344) T KOG4532|consen 161 QNSLHYSNDPSWGSSVGDSRRVFRYAIDDESEYIENIYEAPTSD-----------------HGFYNSFSENDLQFAVV-F 222 (344) T ss_pred eeeeEEcCCCceEEEecCCCcceEEEeCCccceeeeeEecccCC-----------------CceeeeeccCcceEEEE-e Confidence 456789999999999954 5788887775433 2322211100 11134677776666544 4 Q ss_pred cCCcceEEEEEeeC Q psy7767 175 NDARVHNMSYIHYG 188 (200) Q Consensus 175 D~s~v~~~~i~~~~ 188 (200) .++-+.+|.+.+-. T Consensus 223 Qdg~~~I~DVR~~~ 236 (344) T KOG4532|consen 223 QDGTCAIYDVRNMA 236 (344) T ss_pred cCCcEEEEEecccc Confidence 44556666665544 No 247 >KOG0284|consensus Probab=21.62 E-value=1.9e+02 Score=27.75 Aligned_cols=66 Identities=17% Similarity=0.094 Sum_probs=40.1 Q ss_pred CCceeeEECcC-CCEEEEEEcCcEEEEECCCCCe-EEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 96 NFLQLVQWSPV-GHDLIFVKDNNLYQAYDDFRSI-NALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 96 ~~is~~~wSPD-Gk~IAFV~~gdLyv~~~~~g~~-~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) ..+.++++||+ .|-+...-+|-|-+.+....+. .+|+-.|- -..++.|-|. +.|+++- T Consensus 181 eaIRdlafSpnDskF~t~SdDg~ikiWdf~~~kee~vL~GHgw-------------------dVksvdWHP~-kgLiasg 240 (464) T KOG0284|consen 181 EAIRDLAFSPNDSKFLTCSDDGTIKIWDFRMPKEERVLRGHGW-------------------DVKSVDWHPT-KGLIASG 240 (464) T ss_pred hhhheeccCCCCceeEEecCCCeEEEEeccCCchhheeccCCC-------------------CcceeccCCc-cceeEEc Confidence 56899999994 5666666677777776655444 44432221 2356778886 4465665 Q ss_pred EcCCcceE Q psy7767 174 FNDARVHN 181 (200) Q Consensus 174 ~D~s~v~~ 181 (200) .++..|+. T Consensus 241 skDnlVKl 248 (464) T KOG0284|consen 241 SKDNLVKL 248 (464) T ss_pred cCCceeEe Confidence 55554443 No 248 >PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function. Probab=21.57 E-value=1.4e+02 Score=28.94 Aligned_cols=61 Identities=20% Similarity=0.147 Sum_probs=35.2 Q ss_pred cceEEEEeccCcceeEEEEEeeeee-ee--------eeecCCCCccceeecC---CCCCceeeEECcCCCEEEEE Q psy7767 51 DYNLYLQRSTDIALKLSIRCLTWKQ-SH--------SLSGRPDTLTDVSDLD---GSNFLQLVQWSPVGHDLIFV 113 (200) Q Consensus 51 ~~~~y~~~~~~~~~kv~i~~n~~~~-R~--------s~~a~~~~~~~v~~lt---~~~~is~~~wSPDGk~IAFV 113 (200) +|+||++..++.....++....+|. .. ....+.. ..+++++. .+.++..+.|+|||+.+ |+ T Consensus 446 ~G~LwI~eD~~~~~~~l~g~t~~G~~~~~~~~~G~~~~~~~~~-~g~~~rf~~~P~gaE~tG~~fspDg~tl-Fv 518 (524) T PF05787_consen 446 DGNLWIQEDGGGSNNNLPGVTPDGEVYDFARNDGNNVWAYDPD-TGELKRFLVGPNGAEITGPCFSPDGRTL-FV 518 (524) T ss_pred CCCEEEEeCCCCCCcccccccccCceeeeeecccceeeecccc-ccceeeeccCCCCcccccceECCCCCEE-EE Confidence 4789999887776543222222222 10 1222222 23455543 37889999999999984 55 No 249 >KOG3621|consensus Probab=21.26 E-value=1.2e+02 Score=30.68 Aligned_cols=63 Identities=11% Similarity=-0.099 Sum_probs=35.4 Q ss_pred ceeeEECcCCCEEEEEEc-CcEEEEECCCCC-e--EEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEEE Q psy7767 98 LQLVQWSPVGHDLIFVKD-NNLYQAYDDFRS-I--NALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYMK 173 (200) Q Consensus 98 is~~~wSPDGk~IAFV~~-gdLyv~~~~~g~-~--~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~r 173 (200) +....+|++...+|+... +.|-+..+..+. . ..+|+.... .+ -+..++.||+||++|. . T Consensus 79 ~~~~~vs~~e~lvAagt~~g~V~v~ql~~~~p~~~~~~t~~d~~----~~-----------~rVTal~Ws~~~~k~y-s- 141 (726) T KOG3621|consen 79 TCVRSVSSVEYLVAAGTASGRVSVFQLNKELPRDLDYVTPCDKS----HK-----------CRVTALEWSKNGMKLY-S- 141 (726) T ss_pred EEEEEecchhHhhhhhcCCceEEeehhhccCCCcceeecccccc----CC-----------ceEEEEEecccccEEe-e- Confidence 355678888888888854 344433332222 1 234432110 01 1457899999999884 3 Q ss_pred EcCC Q psy7767 174 FNDA 177 (200) Q Consensus 174 ~D~s 177 (200) .|+. T Consensus 142 GD~~ 145 (726) T KOG3621|consen 142 GDSQ 145 (726) T ss_pred cCCC Confidence 4544 No 250 >PRK13614 lipoprotein LpqB; Provisional Probab=20.90 E-value=4.2e+02 Score=26.21 Aligned_cols=35 Identities=26% Similarity=0.242 Sum_probs=24.2 Q ss_pred CceeeEECcCCCEEEEEEcC--cEEEEECCCCCeEEEe Q psy7767 97 FLQLVQWSPVGHDLIFVKDN--NLYQAYDDFRSINALT 132 (200) Q Consensus 97 ~is~~~wSPDGk~IAFV~~g--dLyv~~~~~g~~~rLT 132 (200) .+..++.|+||+.+|++.+. .|++... ++..+.+. T Consensus 344 ~~~s~avS~~g~~~A~~~~~~~~l~~~~~-g~~~~~~~ 380 (573) T PRK13614 344 GPASPAESPVSQTVAFLNGSRTTLYTVSP-GQPARALT 380 (573) T ss_pred cccceeecCCCceEEEecCCCcEEEEecC-CCcceeee Confidence 56788999999999999653 4776544 33333333 No 251 >KOG0264|consensus Probab=20.80 E-value=4e+02 Score=25.51 Aligned_cols=78 Identities=17% Similarity=0.119 Sum_probs=50.0 Q ss_pred CCCceeeEECcCCCEEEEEE--cCcEEEEECCCCCeEEEeeCCCcceEecccccceeeeeeeCCCcceEEccCCCEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK--DNNLYQAYDDFRSINALTRDGIKGVLFNGVADWVYEEEVFSSTKTLWFSPDGSILAYM 172 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~--~gdLyv~~~~~g~~~rLT~dg~~~~~~nG~pdwv~eEEi~~~~~~~~WSPDGk~Laf~ 172 (200) ++++-.++|+|-+..|.=+. ++.|.+.|+.+-..+..+..+-+ .| ...+.|||+-..+.-. T Consensus 272 ~~~vn~~~fnp~~~~ilAT~S~D~tV~LwDlRnL~~~lh~~e~H~-------------de----v~~V~WSPh~etvLAS 334 (422) T KOG0264|consen 272 SAEVNCVAFNPFNEFILATGSADKTVALWDLRNLNKPLHTFEGHE-------------DE----VFQVEWSPHNETVLAS 334 (422) T ss_pred CCceeEEEeCCCCCceEEeccCCCcEEEeechhcccCceeccCCC-------------cc----eEEEEeCCCCCceeEe Confidence 46677888999665554442 56788887765444444433311 12 2367899999888767 Q ss_pred EEcCCcceEEEEEeeCC Q psy7767 173 KFNDARVHNMSYIHYGE 189 (200) Q Consensus 173 r~D~s~v~~~~i~~~~~ 189 (200) ..-++.+.++.+.-.+. T Consensus 335 Sg~D~rl~vWDls~ig~ 351 (422) T KOG0264|consen 335 SGTDRRLNVWDLSRIGE 351 (422) T ss_pred cccCCcEEEEecccccc Confidence 66667777777665554 No 252 >KOG0275|consensus Probab=20.50 E-value=4.2e+02 Score=25.07 Aligned_cols=28 Identities=18% Similarity=0.296 Sum_probs=18.2 Q ss_pred CCCceeeEECcCCCEEEEEE-cCcEEEEE Q psy7767 95 SNFLQLVQWSPVGHDLIFVK-DNNLYQAY 122 (200) Q Consensus 95 ~~~is~~~wSPDGk~IAFV~-~gdLyv~~ 122 (200) +..+.++.+++||..|.-.+ ++.|-+++ T Consensus 348 sSyvn~a~ft~dG~~iisaSsDgtvkvW~ 376 (508) T KOG0275|consen 348 SSYVNEATFTDDGHHIISASSDGTVKVWH 376 (508) T ss_pred cccccceEEcCCCCeEEEecCCccEEEec Confidence 34567888999998886654 34444443 No 253 >PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A. Probab=20.23 E-value=1.8e+02 Score=16.46 Aligned_cols=16 Identities=25% Similarity=0.281 Sum_probs=10.5 Q ss_pred EEEEEcCcEEEEECCC Q psy7767 110 LIFVKDNNLYQAYDDF 125 (200) Q Consensus 110 IAFV~~gdLyv~~~~~ 125 (200) |+...+|+||+.+... T Consensus 7 vav~~~g~i~VaD~~n 22 (28) T PF01436_consen 7 VAVDSDGNIYVADSGN 22 (28) T ss_dssp EEEETTSEEEEEECCC T ss_pred EEEeCCCCEEEEECCC Confidence 4445677888887543 No 254 >smart00415 HSF heat shock factor. Probab=20.08 E-value=76 Score=23.75 Aligned_cols=13 Identities=31% Similarity=0.590 Sum_probs=10.1 Q ss_pred cceEEccCCCEEE Q psy7767 158 KTLWFSPDGSILA 170 (200) Q Consensus 158 ~~~~WSPDGk~La 170 (200) .-++|+|||+.+. T Consensus 20 ~iI~W~~~G~~f~ 32 (105) T smart00415 20 KIISWSPSGKSFV 32 (105) T ss_pred CEEEECCCCCEEE Confidence 3579999998654 Done!