Query gi|254780605|ref|YP_003065018.1| putative aminopeptidase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 609 No_of_seqs 267 out of 2609 Neff 7.1 Searched_HMMs 39220 Date Sun May 29 21:52:00 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780605.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 KOG2413 consensus 100.0 0 0 1349.1 42.7 586 14-609 8-606 (606) 2 cd01085 APP X-Prolyl Aminopept 100.0 0 0 587.9 20.0 220 321-545 1-223 (224) 3 COG0006 PepP Xaa-Pro aminopept 100.0 0 0 542.9 26.7 359 175-564 6-381 (384) 4 PRK09795 aminopeptidase; Provi 100.0 0 0 536.4 27.6 343 181-566 2-357 (361) 5 PRK10879 proline aminopeptidas 100.0 0 0 428.6 27.6 353 177-566 4-429 (436) 6 PRK13607 proline dipeptidase; 100.0 0 0 379.9 23.3 218 306-540 154-431 (442) 7 PRK12897 methionine aminopepti 100.0 0 0 357.4 18.9 219 310-540 1-241 (248) 8 PRK05716 methionine aminopepti 100.0 0 0 350.5 19.2 229 309-561 1-252 (252) 9 PRK12318 methionine aminopepti 100.0 0 0 348.2 20.0 226 312-559 42-290 (291) 10 PRK12896 methionine aminopepti 100.0 0 0 339.7 19.0 230 306-557 2-253 (254) 11 cd01090 Creatinase Creatine am 100.0 4.5E-44 0 327.0 19.4 210 319-539 1-221 (228) 12 cd01092 APP-like Similar to Pr 100.0 3.1E-44 0 328.0 18.4 205 319-540 1-207 (208) 13 cd01086 MetAP1 Methionine Amin 100.0 6.2E-44 0 326.0 18.3 216 319-557 1-238 (238) 14 cd01087 Prolidase Prolidase. E 100.0 1.1E-43 0 324.2 18.1 204 319-539 1-236 (243) 15 KOG2737 consensus 100.0 4.4E-43 0 320.0 18.3 366 167-569 9-478 (492) 16 KOG2414 consensus 100.0 7.7E-40 2E-44 297.5 19.4 354 158-539 42-464 (488) 17 pfam00557 Peptidase_M24 Metall 100.0 2.5E-39 6.3E-44 293.9 18.7 204 320-538 1-207 (207) 18 cd01091 CDC68-like Related to 100.0 1.3E-39 3.2E-44 296.0 16.4 215 319-540 1-236 (243) 19 PRK07281 methionine aminopepti 100.0 4.2E-38 1.1E-42 285.4 18.4 228 310-559 1-281 (286) 20 cd01066 APP_MetAP A family inc 100.0 3.8E-37 9.7E-42 278.7 17.3 204 319-540 1-206 (207) 21 cd01089 PA2G4-like Related to 99.9 7.2E-22 1.8E-26 172.3 17.3 212 319-540 1-222 (228) 22 TIGR00500 met_pdase_I methioni 99.9 3.5E-21 8.9E-26 167.5 18.9 218 312-539 2-257 (265) 23 TIGR02993 ectoine_eutD ectoine 99.9 3E-20 7.6E-25 161.0 19.2 361 171-566 3-389 (391) 24 COG0024 Map Methionine aminope 99.9 1.2E-19 3E-24 156.8 19.4 199 312-521 4-208 (255) 25 PRK08671 methionine aminopepti 99.8 1.8E-17 4.6E-22 141.7 15.4 188 319-526 2-193 (293) 26 KOG1189 consensus 99.8 3.8E-17 9.7E-22 139.4 15.4 269 263-541 82-371 (960) 27 cd01088 MetAP2 Methionine Amin 99.8 6.1E-17 1.6E-21 138.0 15.3 191 319-529 1-195 (291) 28 PRK09795 aminopeptidase; Provi 99.6 2.4E-14 6.2E-19 119.8 11.5 110 16-138 2-113 (361) 29 pfam01321 Creatinase_N Creatin 99.5 8.5E-14 2.2E-18 116.1 11.2 122 17-155 1-124 (127) 30 KOG2738 consensus 99.5 2.7E-12 6.8E-17 105.6 16.0 216 312-539 115-352 (369) 31 PTZ00053 methionine aminopepti 99.4 9.9E-11 2.5E-15 94.7 16.3 193 318-526 121-329 (435) 32 COG5406 Nucleosome binding fac 99.3 5.1E-11 1.3E-15 96.7 11.9 229 300-539 157-410 (1001) 33 TIGR00501 met_pdase_II methion 99.2 3.8E-10 9.7E-15 90.7 11.5 179 342-526 27-224 (327) 34 pfam01321 Creatinase_N Creatin 99.0 1.9E-09 4.7E-14 85.9 6.6 124 182-313 1-127 (127) 35 COG0006 PepP Xaa-Pro aminopept 98.2 1.9E-05 4.8E-10 57.9 9.9 127 14-155 10-151 (384) 36 KOG2775 consensus 97.9 0.00082 2.1E-08 46.6 13.6 173 340-526 102-291 (397) 37 KOG2776 consensus 97.3 0.0043 1.1E-07 41.5 10.5 94 369-464 71-169 (398) 38 PRK12318 methionine aminopepti 96.7 0.011 2.7E-07 38.8 7.4 65 344-414 178-244 (291) 39 KOG2413 consensus 96.6 0.0045 1.1E-07 41.4 5.3 105 182-295 11-130 (606) 40 cd01089 PA2G4-like Related to 96.0 0.064 1.6E-06 33.4 8.3 90 435-527 3-102 (228) 41 TIGR00495 crvDNA_42K DNA-bindi 95.4 0.034 8.6E-07 35.3 5.1 100 369-470 73-184 (407) 42 PRK07281 methionine aminopepti 94.4 0.29 7.4E-06 28.8 7.7 26 388-414 210-235 (286) 43 TIGR00500 met_pdase_I methioni 94.3 0.36 9.1E-06 28.1 7.9 58 343-411 148-218 (265) 44 COG0024 Map Methionine aminope 93.7 0.53 1.4E-05 26.9 7.9 57 343-411 139-204 (255) 45 PRK12897 methionine aminopepti 92.6 0.91 2.3E-05 25.3 8.5 25 389-414 180-204 (248) 46 TIGR01406 dnaQ_proteo DNA poly 89.0 0.67 1.7E-05 26.2 4.2 14 398-411 22-35 (246) 47 TIGR03241 arg_catab_astB succi 88.3 1.5 3.7E-05 23.9 5.5 310 15-366 41-429 (443) 48 TIGR02993 ectoine_eutD ectoine 87.6 0.84 2.1E-05 25.6 4.0 152 13-193 10-180 (391) 49 pfam05195 AMP_N Aminopeptidase 85.0 3.1 7.9E-05 21.6 5.7 16 178-193 5-20 (134) 50 PRK05928 hemD uroporphyrinogen 84.9 3.2 8.1E-05 21.5 9.0 68 15-91 10-77 (252) 51 TIGR00452 TIGR00452 methyltran 82.1 4.1 0.0001 20.8 6.2 24 287-311 134-157 (316) 52 pfam07385 DUF1498 Protein of u 76.1 3 7.7E-05 21.7 3.2 54 498-556 148-203 (225) 53 COG3642 Mn2+-dependent serine/ 73.0 3.2 8.1E-05 21.6 2.6 31 385-416 109-139 (204) 54 PRK10153 DNA-binding transcrip 72.9 6.8 0.00017 19.2 4.3 77 113-217 22-100 (512) 55 PRK07009 consensus 69.5 3.8 9.8E-05 21.0 2.4 34 448-487 121-154 (241) 56 pfam05770 Ins134_P3_kin Inosit 67.5 7.9 0.0002 18.8 3.7 85 113-211 5-93 (307) 57 PRK06859 consensus 66.5 4.7 0.00012 20.4 2.4 34 448-487 129-162 (253) 58 TIGR02669 SpoIID_LytB SpoIID/L 65.6 1.9 4.8E-05 23.1 0.2 11 480-492 284-294 (323) 59 TIGR01283 nifE nitrogenase MoF 63.2 7.5 0.00019 18.9 2.9 57 155-218 78-146 (470) 60 PRK08858 consensus 62.8 6.3 0.00016 19.5 2.5 32 449-486 137-168 (246) 61 PRK09605 O-sialoglycoprotein e 62.6 6.9 0.00018 19.2 2.6 17 376-392 278-294 (536) 62 TIGR01982 UbiB 2-polyprenylphe 61.8 12 0.00031 17.5 4.9 66 387-471 291-371 (452) 63 PRK13281 succinylarginine dihy 59.2 13 0.00034 17.2 5.6 308 14-365 42-427 (443) 64 PRK06022 consensus 58.8 8.1 0.00021 18.7 2.4 35 448-488 119-153 (234) 65 PRK13308 ureC urease subunit a 58.2 8.2 0.00021 18.7 2.4 64 468-532 402-488 (569) 66 PRK07982 consensus 57.0 9 0.00023 18.4 2.4 33 449-487 132-164 (243) 67 pfam08003 Methyltransf_9 Prote 56.2 15 0.00038 16.9 7.7 30 164-193 183-212 (315) 68 PRK08816 consensus 56.2 9.5 0.00024 18.2 2.4 33 449-487 123-155 (244) 69 PRK04183 glutamyl-tRNA(Gln) am 54.7 8 0.0002 18.7 1.9 58 173-243 131-189 (421) 70 PRK09238 bifunctional aconitat 54.5 11 0.00027 17.8 2.5 52 185-239 211-273 (841) 71 PRK08444 hypothetical protein; 53.9 2.7 6.8E-05 22.1 -0.7 39 315-363 187-230 (353) 72 pfam04996 AstB Succinylarginin 53.4 16 0.00042 16.6 5.7 307 15-366 42-423 (439) 73 PRK05927 hypothetical protein; 53.2 1.9 5E-05 23.0 -1.5 15 349-363 212-226 (350) 74 PRK13309 ureC urease subunit a 53.0 12 0.00032 17.4 2.6 62 468-530 405-489 (572) 75 PRK11678 putative chaperone; P 52.3 6 0.00015 19.6 0.9 18 405-426 209-226 (450) 76 PRK05711 DNA polymerase III su 51.6 13 0.00032 17.4 2.5 32 449-486 126-157 (240) 77 PRK06631 consensus 50.1 14 0.00035 17.1 2.4 32 449-486 126-157 (229) 78 PRK00683 murD UDP-N-acetylmura 49.8 6.2 0.00016 19.5 0.7 32 405-437 313-353 (418) 79 PRK13985 UreB urease subunit b 49.8 17 0.00043 16.5 2.9 57 467-525 400-478 (568) 80 pfam00486 Trans_reg_C Transcri 49.6 19 0.00048 16.2 3.8 60 124-197 5-64 (77) 81 TIGR01527 arch_NMN_Atrans nico 48.1 15 0.00037 16.9 2.4 13 178-190 47-59 (171) 82 KOG2551 consensus 47.8 18 0.00046 16.3 2.8 42 163-205 5-46 (230) 83 PRK10955 DNA-binding transcrip 45.9 21 0.00054 15.8 3.2 10 482-491 219-228 (232) 84 TIGR00393 kpsF sugar isomerase 45.5 8.9 0.00023 18.4 0.9 21 405-426 189-212 (272) 85 COG1056 NadR Nicotinamide mono 45.0 11 0.00029 17.7 1.4 27 178-204 51-79 (172) 86 PRK13206 ureC urease subunit a 44.8 17 0.00045 16.4 2.3 58 467-525 405-483 (573) 87 cd00375 Urease_alpha Urease al 44.7 18 0.00045 16.4 2.3 71 467-538 400-495 (567) 88 KOG2670 consensus 42.8 24 0.0006 15.5 3.6 12 314-325 220-231 (433) 89 KOG3303 consensus 40.4 26 0.00065 15.2 2.8 18 325-342 81-98 (192) 90 pfam02016 Peptidase_S66 LD-car 39.7 26 0.00067 15.2 2.6 24 178-201 45-68 (281) 91 smart00870 Asparaginase Aspara 39.1 20 0.0005 16.0 1.9 57 175-243 58-115 (323) 92 TIGR02870 spore_II_D stage II 39.1 9.6 0.00025 18.2 0.3 10 481-492 319-328 (358) 93 pfam04378 DUF519 Protein of un 39.0 26 0.00066 15.2 2.5 38 432-470 133-178 (245) 94 PRK08445 hypothetical protein; 37.4 4.1 0.0001 20.8 -1.8 15 349-363 209-223 (348) 95 PRK04750 ubiB putative ubiquin 37.2 29 0.00073 14.9 7.4 69 384-473 280-365 (540) 96 PRK05014 hscB co-chaperone Hsc 37.1 10 0.00026 18.0 0.2 49 168-216 30-78 (171) 97 PRK09125 DNA ligase; Provision 36.9 29 0.00074 14.9 3.4 13 175-187 38-50 (277) 98 pfam10450 POC1 POC1 chaperone. 36.6 7.4 0.00019 19.0 -0.6 11 430-440 195-205 (223) 99 PRK00294 hscB co-chaperone Hsc 36.6 9.9 0.00025 18.1 0.0 50 168-217 33-82 (173) 100 pfam05422 SIN1 Stress-activate 36.2 25 0.00064 15.3 2.0 10 178-187 98-107 (482) 101 TIGR03290 CoB_CoM_SS_C CoB--Co 35.9 27 0.00068 15.1 2.1 10 479-488 91-100 (144) 102 cd07356 HN_L-whirlin_R1_like F 35.7 30 0.00077 14.7 4.5 33 567-599 13-54 (78) 103 KOG0582 consensus 35.1 31 0.00079 14.7 4.0 17 154-170 98-114 (516) 104 PRK03578 hscB co-chaperone Hsc 35.0 11 0.00027 17.9 -0.0 49 168-216 35-83 (176) 105 PRK13207 ureC urease subunit a 34.3 29 0.00073 14.9 2.1 64 468-532 401-487 (568) 106 COG1049 AcnB Aconitase B [Ener 33.8 32 0.00082 14.5 2.4 77 154-239 187-274 (852) 107 COG0684 MenG Demethylmenaquino 33.8 32 0.00082 14.5 3.4 77 372-480 45-121 (210) 108 cd05297 GH4_alpha_glucosidase_ 33.3 25 0.00065 15.2 1.7 44 380-438 329-372 (423) 109 PRK13191 putative peroxiredoxi 32.8 33 0.00085 14.4 4.5 48 166-213 38-90 (230) 110 PRK04663 murD UDP-N-acetylmura 32.3 34 0.00087 14.4 2.4 32 404-436 338-378 (438) 111 PRK05926 hypothetical protein; 32.3 5.2 0.00013 20.1 -2.0 15 349-363 235-249 (371) 112 KOG0996 consensus 31.6 27 0.00069 15.1 1.6 18 117-134 198-215 (1293) 113 COG3737 Uncharacterized conser 31.2 35 0.0009 14.2 3.4 50 378-445 16-65 (127) 114 PRK06995 flhF flagellar biosyn 31.2 35 0.0009 14.2 2.5 23 13-38 10-32 (404) 115 pfam01307 Plant_vir_prot Plant 31.0 24 0.00062 15.4 1.3 19 412-434 44-62 (98) 116 PRK11835 hypothetical protein; 31.0 17 0.00043 16.5 0.4 12 228-239 74-85 (114) 117 PTZ00075 S-adenosyl-L-homocyst 30.9 35 0.0009 14.3 2.1 46 280-325 260-306 (476) 118 TIGR02405 trehalos_R_Ecol treh 30.6 11 0.00029 17.7 -0.5 17 509-526 271-287 (311) 119 PHA00451 protein kinase 30.6 36 0.00093 14.2 2.4 22 124-145 52-73 (364) 120 TIGR01419 nitro_reg_IIA PTS II 30.2 37 0.00094 14.1 3.2 29 383-413 56-87 (149) 121 cd02166 NMNAT_Archaea This fam 30.1 37 0.00094 14.1 4.7 65 120-203 8-75 (163) 122 TIGR02902 spore_lonB ATP-depen 30.1 36 0.00093 14.2 2.0 21 113-135 25-45 (532) 123 PRK10816 DNA-binding transcrip 29.8 37 0.00095 14.1 2.8 43 450-493 174-222 (223) 124 TIGR01820 TrpE-arch anthranila 29.3 5.2 0.00013 20.1 -2.4 94 379-490 324-449 (508) 125 TIGR01825 gly_Cac_T_rel pyrido 29.2 12 0.00032 17.4 -0.5 34 262-295 81-115 (392) 126 TIGR01170 rplA_mito ribosomal 28.0 40 0.001 13.9 2.6 48 141-195 80-129 (155) 127 pfam12095 DUF3571 Protein of u 28.0 23 0.00057 15.6 0.7 16 515-531 68-83 (83) 128 PRK06849 hypothetical protein; 27.2 41 0.0011 13.8 2.9 74 134-213 21-94 (387) 129 cd07025 Peptidase_S66 LD-Carbo 27.2 41 0.0011 13.8 2.2 28 174-201 42-69 (282) 130 PRK00141 murD UDP-N-acetylmura 26.8 37 0.00095 14.1 1.6 31 406-437 359-398 (476) 131 PRK13928 rod shape-determining 26.6 41 0.001 13.8 1.8 18 404-425 146-163 (325) 132 pfam02388 FemAB FemAB family. 26.4 43 0.0011 13.7 5.1 28 129-156 84-111 (407) 133 COG1355 Predicted dioxygenase 26.3 43 0.0011 13.7 3.3 30 172-205 56-85 (279) 134 COG0312 TldD Predicted Zn-depe 26.3 39 0.00099 14.0 1.6 42 481-527 327-375 (454) 135 cd07062 Peptidase_S66_mccF_lik 26.2 43 0.0011 13.7 2.6 25 177-201 49-73 (308) 136 pfam10565 NMDAR2_C N-methyl D- 25.7 40 0.001 13.9 1.6 27 181-207 246-272 (660) 137 PRK04308 murD UDP-N-acetylmura 25.6 30 0.00078 14.7 1.0 17 18-35 18-34 (445) 138 PRK03803 murD UDP-N-acetylmura 25.6 40 0.001 13.9 1.6 34 404-438 345-387 (448) 139 PRK02006 murD UDP-N-acetylmura 25.6 30 0.00076 14.8 0.9 35 404-439 390-433 (501) 140 cd00465 URO-D_CIMS_like The UR 25.6 28 0.0007 15.0 0.8 44 441-488 244-287 (306) 141 TIGR02480 fliN flagellar motor 25.4 40 0.001 13.9 1.6 26 503-534 49-74 (77) 142 TIGR02154 PhoB phosphate regul 25.3 45 0.0011 13.6 1.9 74 236-312 5-84 (226) 143 cd00383 trans_reg_C Effector d 25.2 45 0.0011 13.5 2.9 70 117-200 14-85 (95) 144 PRK06724 hypothetical protein; 25.2 39 0.001 14.0 1.5 16 517-532 76-94 (102) 145 TIGR01584 citF citrate lyase, 25.2 45 0.0011 13.5 2.1 117 285-410 196-351 (496) 146 COG3536 Uncharacterized protei 25.1 44 0.0011 13.6 1.7 14 513-530 62-75 (120) 147 COG3710 CadC DNA-binding winge 25.1 45 0.0011 13.5 5.1 65 124-203 31-96 (148) 148 TIGR01163 rpe ribulose-phospha 25.0 45 0.0011 13.5 6.6 66 110-204 50-116 (216) 149 PRK06201 hypothetical protein; 24.8 46 0.0012 13.5 2.1 58 401-483 76-136 (221) 150 KOG0622 consensus 24.7 46 0.0012 13.5 6.4 91 313-416 151-256 (448) 151 TIGR01792 urease_alph urease, 24.7 22 0.00055 15.7 0.1 64 468-533 393-498 (605) 152 PRK13731 conjugal transfer sur 24.4 46 0.0012 13.4 2.5 58 147-208 35-92 (243) 153 COG2044 Predicted peroxiredoxi 24.3 46 0.0012 13.4 2.2 37 197-241 6-42 (120) 154 KOG3892 consensus 24.0 19 0.00049 16.1 -0.3 39 168-206 77-118 (407) 155 COG2179 Predicted hydrolase of 23.7 48 0.0012 13.4 5.4 90 105-202 20-118 (175) 156 KOG1684 consensus 23.7 33 0.00085 14.4 0.9 14 59-72 42-55 (401) 157 TIGR03551 F420_cofH 7,8-dideme 23.6 44 0.0011 13.6 1.5 42 315-363 177-220 (343) 158 COG1212 KdsB CMP-2-keto-3-deox 23.5 48 0.0012 13.3 2.4 66 99-193 25-90 (247) 159 TIGR02145 Fib_succ_major Fibro 23.1 43 0.0011 13.7 1.3 20 90-112 7-26 (245) 160 PRK10529 DNA-binding transcrip 22.8 50 0.0013 13.2 3.6 27 463-490 191-222 (225) 161 PRK01773 hscB co-chaperone Hsc 22.7 26 0.00067 15.2 0.2 46 168-213 31-76 (173) 162 cd00717 URO-D Uroporphyrinogen 22.4 38 0.00097 14.0 1.0 44 441-488 273-318 (335) 163 PRK12727 flagellar biosynthesi 22.3 51 0.0013 13.2 2.8 23 13-38 10-32 (557) 164 COG4668 MtlA Mannitol/fructose 22.0 34 0.00088 14.3 0.7 51 384-439 54-108 (142) 165 PRK13299 tRNA CCA-pyrophosphor 21.6 35 0.0009 14.3 0.7 30 576-605 361-393 (394) 166 COG3290 CitA Signal transducti 21.6 26 0.00066 15.2 -0.0 57 461-524 471-531 (537) 167 KOG2679 consensus 21.6 52 0.0013 13.1 3.1 23 198-224 78-100 (336) 168 cd01576 AcnB_Swivel Aconitase 21.6 42 0.0011 13.8 1.0 78 153-240 16-104 (131) 169 TIGR00110 ilvD dihydroxy-acid 21.5 42 0.0011 13.7 1.0 45 558-605 541-586 (601) 170 smart00862 Trans_reg_C Transcr 21.5 52 0.0013 13.1 3.9 58 125-195 6-63 (78) 171 PRK00290 dnaK molecular chaper 21.0 39 0.00098 14.0 0.8 63 370-444 355-425 (631) 172 TIGR01363 strep_his_triad stre 20.8 43 0.0011 13.7 1.0 18 307-324 144-161 (376) 173 cd03575 NTR_WFIKKN NTR domain, 20.6 55 0.0014 12.9 2.0 26 167-192 84-109 (109) 174 TIGR01607 PST-A Plasmodium sub 20.5 39 0.00098 14.0 0.7 25 156-180 66-90 (379) 175 PRK08130 putative aldolase; Va 20.4 55 0.0014 12.9 3.2 17 401-417 50-66 (213) 176 PRK12724 flagellar biosynthesi 20.3 55 0.0014 12.9 2.2 19 20-38 14-32 (432) 177 KOG0821 consensus 20.2 37 0.00095 14.1 0.5 14 80-93 19-32 (326) 178 TIGR02551 SpaO_YscQ type III s 20.1 56 0.0014 12.9 1.8 19 400-418 161-179 (326) No 1 >KOG2413 consensus Probab=100.00 E-value=0 Score=1349.14 Aligned_cols=586 Identities=38% Similarity=0.628 Sum_probs=553.4 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHCHHHHHHCCCCCCEEEEEECCCCEEEECCHHHHHHHHHHCCCC Q ss_conf 58999999999997499799960888747860667106100441467650077895783679978814899998619671 Q gi|254780605|r 14 TFERVHNLRSCFDSLGMDAFLVPRVDEYRGEFVDKGSERLAWLSGFTGSAGIAIVLRQKSVIFVDGRYTLQVEKEVDTAL 93 (609) Q Consensus 14 ~~erL~~LR~~m~~~~idayiip~~D~h~sEyv~~~~~rl~~isGFtGSaG~aiIt~~~a~L~tDgRY~lQA~~el~~~~ 93 (609) ..-++.++|++|+.++|+||||||+|+|||||++++|+||+||||||||||+||||.++|+||||||||.||++|+|++| T Consensus 8 ~~~~~~~~~~~~~~~~i~aYi~Ps~DaH~sEy~~~~D~R~~flsGFsGsag~Avit~~~a~lwtD~RY~~QA~~qld~~W 87 (606) T KOG2413 8 ELFELMRLRELMKSPPIDAYILPSTDAHQSEYIADRDERRAFLSGFSGSAGTAVITEEEAALWTDGRYFQQAEQQLDSNW 87 (606) T ss_pred HHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHCCHHHHHHHHCCCCCCCCEEEEECCCCEEEECCHHHHHHHHHHCCCC T ss_conf 99999999987357993399846871334334231456666312667876058983474048875689998775315440 Q ss_pred EEEEECCC-CCHHHHHHHHCCCCCEEEECCEECCHHHHHHHHHHHHHCCCEEEECCCHHHHHHHCCCCCCCCCEEEECCH Q ss_conf 79986689-99999888743799889965634139999999998533476199705001112201276332200012012 Q gi|254780605|r 94 FTIKNIAI-EPLHAWISEHGFVGLRLGLDSRLHSSFEVDLLQKSLDKIEGVIVDVPYNPIDSLWKDRPQRLYRKVAMQDM 172 (609) Q Consensus 94 ~~~~~~~~-~~~~~wl~~~~~~g~~IG~Dp~l~s~~~~~~l~~~l~~~~~~l~~~~~NlVD~iW~~rP~~~~~~i~~~~~ 172 (609) ..++.... +++.+||.++++.+++||+||+|+|+..|+.+.+.+..++..|+++..||||+||.+||.+|.+++..++. T Consensus 88 ~l~k~~~~~~~v~~wl~~~l~~~~~vG~Dp~Lis~~~~~~~~~~l~s~~~~Lv~i~~nLVD~iW~~rP~~~~~~v~~l~~ 167 (606) T KOG2413 88 TLMKMGEDVPTVEEWLAKVLPEGSRVGIDPTLISFDAWKQLEKSLTSKGLELVPIPGNLVDEIWGDRPERPGNPVIVLDL 167 (606) T ss_pred EEEECCCCCCCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHCCCEEEECCCCCHHHHHCCCCCCCCCCEEEEEC T ss_conf 03001699865999999767786603648643050677768988754787276046630545523698567985699622 Q ss_pred HHCCCCHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCEEECCCCCCCHHHHHHH Q ss_conf 33587738899999986023343067602842023443201355377870023322014786200035311215678741 Q gi|254780605|r 173 AYAGRESQEKIRDICKILHQKEVGAVFICDPSSIAWIFNIRGFDIPCSPYPLSRAILYADGKAEIFFDKQYINEQLKALL 252 (609) Q Consensus 173 ~~aG~s~~eKi~~ir~~L~~~~~daliit~~d~IaWLlNiRG~Di~~tPv~~a~aiI~~~~~~~Lfvd~~k~~~~~~~~l 252 (609) +|+|.++.+|++.+|+.|+.++++++++|.+|+||||+||||+||||||||+||++|+.+ ++.||+|..|++....+++ T Consensus 168 ~~~G~~~~~Kv~~LR~~l~~~~~~a~Vvs~LdeIaWllNLRGsDipynPv~~sY~~it~d-ei~lfvd~~k~~~~~~~~~ 246 (606) T KOG2413 168 EFAGLSVDDKVDNLRKKLKEKKCDAFVVTALDEIAWLLNLRGSDIPYNPVFYSYAIITMD-EIFLFVDNSKLSDESKKHL 246 (606) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCCEEEHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHH-HHHEEECCCCCCCHHHHHH T ss_conf 015863347799999997633885885135788999985326767888203443123022-1000104764671668888 Q ss_pred CCCC-EEECHHHHHHHHHHCCCC--CCEEEECCCCCCHHHHHHHHHCCCEEECCCCCCHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 2563-570744515778630135--7428972531778897531213956751543211566619999998799998443 Q gi|254780605|r 253 SAVA-IVLDMDMMDSRLVCLART--SMPILIDPKWISYRFFKVIAQKNGVMVEGSDPSCLLRATKNKVEIEGMQTAHIQD 329 (609) Q Consensus 253 ~~~~-~~~~~~~~~~~l~~l~~~--~~~i~id~~~~s~~~~~~l~~~~~~~i~~~~pI~~lRaiKn~~EI~~mr~Ahi~D 329 (609) .... .+.+|+.+...+..+... ..++++.+ .+++.+...+.+.+. +...+||..+||+||+.|+++||.||||| T Consensus 247 ~~~~v~i~pY~~i~~~i~~~~~~~~~~~i~ia~-~~~~~i~~~i~~~~~--~~~~Spi~~~kAiKN~~E~~gmr~shirD 323 (606) T KOG2413 247 REDGVEIRPYDQIWSDIKNWASAFADKKIWISP-ETNYGIGELIGEDHS--MIDPSPISRAKAIKNDDELKGMRNSHIRD 323 (606) T ss_pred HHCCEEEEEHHHHHHHHHHHHCCCCCEEEEECC-CCEEEECCCCCCCCC--CCCCCHHHHHHHHCCHHHHHHHHHCCHHH T ss_conf 647426623899988898874146750675043-210000045554521--45567889999855768763556531035 Q ss_pred HHHHHHHHHHHHHHHCCC--CCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCEEECCCC Q ss_conf 899999987533210012--453018899999998754330278555541000012-33543222245543320503898 Q gi|254780605|r 330 GVAMVYFLFWFYSQSLET--ITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAAS-GPHAAIIHYQATVQSNRLLQKDE 406 (609) Q Consensus 330 gvAl~kfl~wL~~~~~~~--iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~-G~N~AipHY~~~~~s~~~i~~~~ 406 (609) |+|+|+||+||+.+...+ +||.++|.+||+||++ ++.|.|+||+||+++ |||||++||+|.+++|++|.+.+ T Consensus 324 ~~Alve~~~wle~~~~~g~~itE~~~A~kle~fR~~-----~~~fmglSFeTIS~s~G~NgAviHYsP~~e~n~~i~~~k 398 (606) T KOG2413 324 GAALVEYFAWLEKELHKGYTITEYDAADKLEEFRSR-----QDHFMGLSFETISSSVGPNGAVIHYSPPAETNRIVSPDK 398 (606) T ss_pred HHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH-----HCCCCCCCCCEEECCCCCCCEEEECCCCCCCCCEECCCE T ss_conf 899999999876653037620434488899999876-----442457671002246788850352079854563206763 Q ss_pred EEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 79983032010561222143210378888843336677888889972178978888999987366774186547764323 Q gi|254780605|r 407 LLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGADFAHGVGHG 486 (609) Q Consensus 407 ~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~dy~HgtGHG 486 (609) +|||||||||++||||||||+|||+||+++|++||+||||||++++++||+|+.|..||++||..||+.|+||.|||||| T Consensus 399 iyL~DSGaQY~DGTTDvTRT~HfgePs~eek~~yT~VLkGhi~la~~vFP~~t~g~~lD~laR~~LW~~gLDy~HgTGHG 478 (606) T KOG2413 399 IYLCDSGAQYLDGTTDVTRTVHFGEPTAEEKEAYTLVLKGHIALARAVFPKGTKGSVLDALARSALWKAGLDYGHGTGHG 478 (606) T ss_pred EEEECCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 79980686424686542477866999889999999998743676521168999850069999999986066668888766 Q ss_pred CCCCCCCCCCCCCCCCC---CCCEECCCCEEEECCCEEECCCCCEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCHH Q ss_conf 21135607788623778---986005698788468603568752150025999727877877664242210116530011 Q gi|254780605|r 487 VGSFLPVHEGPQGISRT---NQEPLLPGMILSNEPGYYRCGAFGIRIENVLCVSEPETINNGECLMLGFNTLTLCPIDRK 563 (609) Q Consensus 487 VG~~l~VHE~P~~is~~---~~~~l~~Gmv~s~EPG~Y~~g~~GiRIEn~v~v~~~~~~~~~~~~fl~Fe~lT~vP~~~~ 563 (609) |||||||||+|++||.+ ++.+|++|||+|+|||||..|.|||||||+++|+++.+.++- +.||+||+||+||||++ T Consensus 479 VG~fLnVhE~P~~is~r~~~~~~~l~ag~~~s~EPGYY~dg~fGIRienv~~vvd~~~~~~~-~~~L~fe~lT~vP~q~k 557 (606) T KOG2413 479 VGSFLNVHEGPIGIGYRPYSSNFPLQAGMVFSIEPGYYKDGEFGIRIENVVEVVDAGTKHNF-RGFLTFEPLTLVPYQTK 557 (606) T ss_pred CCCCEEECCCCCEEEEEECCCCCHHCCCEEECCCCCCCCCCCCEEEEEEEEEEEECCCCCCC-CCEEEECCCEECCEECC T ss_conf 56506741588610012458872003853862688611068642888648999865424344-34022311025532656 Q ss_pred CCCHHHCCHHHHHHHHHHHHHHHHHHHHHCCC---HHHHHHHHHHCCCC Q ss_conf 07867789999999999999999997441180---79999999732579 Q gi|254780605|r 564 LILVELLTNEEKKWCNDYHRRVYTSLAPLIED---QEVLSWLFSVTAPI 609 (609) Q Consensus 564 li~~~ll~~~e~~wln~Yh~~v~~~~~p~l~~---~~~~~wl~~~t~~i 609 (609) |||.+||+++|++|||.||++|+++++|+|+. .++++||.++|+|| T Consensus 558 lid~~LLs~eE~~~LN~Yh~~v~~~i~~~L~~~~~~~~~~WL~~~t~Pi 606 (606) T KOG2413 558 LIDKSLLSEEEINWLNEYHAKVRSKIGPELQKEGRMEEYKWLINATQPI 606 (606) T ss_pred CCCHHHCCHHHHHHHHHHHHHHHHHCCHHHHHCCCHHHHHHHHHHCCCC T ss_conf 4772119999998999988999975209774303116788898625689 No 2 >cd01085 APP X-Prolyl Aminopeptidase 2. E.C. 3.4.11.9. Also known as X-Pro aminopeptidase, proline aminopeptidase, aminopeptidase P, and aminoacylproline aminopeptidase. Catalyses release of any N-terminal amino acid, including proline, that is linked with proline, even from a dipeptide or tripeptide. Probab=100.00 E-value=0 Score=587.91 Aligned_cols=220 Identities=54% Similarity=0.897 Sum_probs=209.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHC--CCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 7999984438999999875332100--12453018899999998754330278555541000012335432222455433 Q gi|254780605|r 321 GMQTAHIQDGVAMVYFLFWFYSQSL--ETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGPHAAIIHYQATVQS 398 (609) Q Consensus 321 ~mr~Ahi~DgvAl~kfl~wL~~~~~--~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~N~AipHY~~~~~s 398 (609) +||+||++||+|||+||+||+++.. +.+||++++++|++||++ +.+|.++||+||||+|+|||+|||+|++.+ T Consensus 1 g~r~ahi~D~~A~~~~l~wl~~~~~~~~~~TE~~~a~~le~~r~~-----~~g~~~~SF~tIva~G~naA~pHy~~~~~~ 75 (224) T cd01085 1 GMRAAHIRDGVALVEFLAWLEQEVPKGETITELSAADKLEEFRRQ-----QKGYVGLSFDTISGFGPNGAIVHYSPTEES 75 (224) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHH-----CCCCCCCCHHHHHHCCCCCCCCCCCCCCCC T ss_conf 968999999999999999998711258973799999999999987-----799878783779848865204677888566 Q ss_pred CEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 20503898799830320105612221432103788888433366778888899721789788889999873667741865 Q gi|254780605|r 399 NRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGAD 478 (609) Q Consensus 399 ~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~d 478 (609) |++|++|+++|||+||+|.+|||||||||++|+|+++||++|++||+||+++++++||+|++|.++|.+||++||++|+| T Consensus 76 ~~~i~~gdlvLiD~G~~Y~gy~SDiTRT~~~G~~s~~~~~~Y~~Vl~a~~~~~~~~~p~g~~~~~lD~~aR~~l~~~G~~ 155 (224) T cd01085 76 NRKISPDGLYLIDSGGQYLDGTTDITRTVHLGEPTAEQKRDYTLVLKGHIALARAKFPKGTTGSQLDALARQPLWKAGLD 155 (224) T ss_pred CCCCCCCCEEEECCCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCC T ss_conf 73336898367727922516446655342069999999999999999999999998727985899999999999974988 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCC-CCCCCEECCCCEEEECCCEEECCCCCEEEEEEEEEEECCCCCCC Q ss_conf 47764323211356077886237-78986005698788468603568752150025999727877877 Q gi|254780605|r 479 FAHGVGHGVGSFLPVHEGPQGIS-RTNQEPLLPGMILSNEPGYYRCGAFGIRIENVLCVSEPETINNG 545 (609) Q Consensus 479 y~HgtGHGVG~~l~VHE~P~~is-~~~~~~l~~Gmv~s~EPG~Y~~g~~GiRIEn~v~v~~~~~~~~~ 545 (609) |.||||||||+||+|||+|++|+ +.++.+|+||||+|||||||+||+|||||||+|+||+.++++++ T Consensus 156 y~HgtGHGiG~~L~vHE~P~~~~~~~~~~~L~~Gmv~TiEPGiY~~g~~GiRIEd~v~Vt~~~~~~~~ 223 (224) T cd01085 156 YGHGTGHGVGSFLNVHEGPQSISPAPNNVPLKAGMILSNEPGYYKEGKYGIRIENLVLVVEAETTEFG 223 (224) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCEEEECCCEEEECCCEEEEEECCCCCCC T ss_conf 88799998899877779986667789996628997996589848889737977047999978877888 No 3 >COG0006 PepP Xaa-Pro aminopeptidase [Amino acid transport and metabolism] Probab=100.00 E-value=0 Score=542.89 Aligned_cols=359 Identities=29% Similarity=0.394 Sum_probs=290.9 Q ss_pred CCCCHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCCCCCCCHHH-HHHHHCCCCCCEEECCCCCCCHHHHHHHC Q ss_conf 58773889999998602334306760284202344320135537787002-33220147862000353112156787412 Q gi|254780605|r 175 AGRESQEKIRDICKILHQKEVGAVFICDPSSIAWIFNIRGFDIPCSPYPL-SRAILYADGKAEIFFDKQYINEQLKALLS 253 (609) Q Consensus 175 aG~s~~eKi~~ir~~L~~~~~daliit~~d~IaWLlNiRG~Di~~tPv~~-a~aiI~~~~~~~Lfvd~~k~~~~~~~~l~ 253 (609) .......++.+++..|..++.|+++++++.|++||+.++. .+... ..+++..+++..||++............. T Consensus 6 ~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~n~~yltg~~~-----~~~~~~~~~i~~~~~~~~l~~~~~~~~~~~~~~~~ 80 (384) T COG0006 6 ADEEYRARLARLRELMEEAGLDALLLTSPSNFYYLTGFDA-----FGFERLQALLVPAEGEPVLFVRGRDEEAAKETSWI 80 (384) T ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCEEEECCCC-----CCCCCCEEEEEECCCCEEEEECCHHHHHHHHHCCC T ss_conf 4455567999999998767987799527764044304676-----66555315897069861999661646677553112 Q ss_pred ---CCCEEECHHH-------HHHHHHHCCCCCCEEEECCCC--CCHHHHHHHHHCCC--EEECCCCCCHHHHHCCCHHHH Q ss_conf ---5635707445-------157786301357428972531--77889753121395--675154321156661999999 Q gi|254780605|r 254 ---AVAIVLDMDM-------MDSRLVCLARTSMPILIDPKW--ISYRFFKVIAQKNG--VMVEGSDPSCLLRATKNKVEI 319 (609) Q Consensus 254 ---~~~~~~~~~~-------~~~~l~~l~~~~~~i~id~~~--~s~~~~~~l~~~~~--~~i~~~~pI~~lRaiKn~~EI 319 (609) ....+..... ....+.........++++... .++..+..+..... +++...+++..+|+|||+.|| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~g~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~i~~lR~iKs~~EI 160 (384) T COG0006 81 KLENVEVYEDDEDPAAPLDLLGALLEELGLAGKRIGIESASIFLTLAAFERLQAALPRAELVDASDLVDRLRLIKSPAEI 160 (384) T ss_pred CCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCEEECCHHHHHHHHHCCCHHHH T ss_conf 35674277525554456889999998754467662521002324288899998646776575027789998855999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 87999984438999999875332100124530188999999987543302785555410000123354322224554332 Q gi|254780605|r 320 EGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGPHAAIIHYQATVQSN 399 (609) Q Consensus 320 ~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~N~AipHY~~~~~s~ 399 (609) +.||+|+..+++|+.+++.|+. .++||.+++.+++..+.+.| ..++||+|||++|+|+|+|||+| ++ T Consensus 161 ~~ir~A~~i~~~a~~~~~~~~~----~G~tE~ev~a~l~~~~~~~G------~~~~sf~~iva~G~n~a~pH~~~---~~ 227 (384) T COG0006 161 AKIRKAAEIADAALEAALEAIR----PGMTEAEIAAELEYALRKGG------AEGPSFDTIVASGENAALPHYTP---SD 227 (384) T ss_pred HHHHHHHHHHHHHHHHHHHHHH----CCCCHHHHHHHHHHHHHHCC------CCCCCCCCEEEECCCCCCCCCCC---CC T ss_conf 9999999999999999998610----79979999999999999808------87577475787654335656788---88 Q ss_pred EEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCC-- Q ss_conf 050389879983032010561222143210378888843336677888889972178978888999987366774186-- Q gi|254780605|r 400 RLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGA-- 477 (609) Q Consensus 400 ~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~-- 477 (609) +++++|+++|||+||+|.+|||||||||++|+|+++|+++|++|++||.+...+++| |+++.++|.+||++||++|+ T Consensus 228 ~~~~~gd~vliD~G~~~~gY~sDiTRT~~~G~~s~~~~~iy~~V~~aq~aa~~~~~p-G~~~~~vd~~ar~~i~~~g~~~ 306 (384) T COG0006 228 RKLRDGDLVLIDLGGVYNGYCSDITRTFPIGKPSDEQREIYEAVLEAQEAAIAAIRP-GVTGGEVDAAARQVLEKAGYGL 306 (384) T ss_pred CCCCCCCEEEEEECEEECCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCC-CCCHHHHHHHHHHHHHHHCCCC T ss_conf 766799989999460689811563568976999999999999999999999987279-9959999999999999714676 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCEEEECCCEEECCCCCEEEEEEEEEEECCCCCCCCCCEEEEEECCC Q ss_conf 54776432321135607788623778986005698788468603568752150025999727877877664242210116 Q gi|254780605|r 478 DFAHGVGHGVGSFLPVHEGPQGISRTNQEPLLPGMILSNEPGYYRCGAFGIRIENVLCVSEPETINNGECLMLGFNTLTL 557 (609) Q Consensus 478 dy~HgtGHGVG~~l~VHE~P~~is~~~~~~l~~Gmv~s~EPG~Y~~g~~GiRIEn~v~v~~~~~~~~~~~~fl~Fe~lT~ 557 (609) +|.||||||||++|+|||+|+.+++++..+|+||||+|+|||+|+||.+||||||+|+||+. +||+|| T Consensus 307 ~~~h~~GHgvG~~l~vhE~p~~~~~~~~~~L~~GMv~t~Epg~y~~g~~GiRIEd~vlVte~-----------G~e~LT- 374 (384) T COG0006 307 YFLHGTGHGVGFVLDVHEHPQYLSPGSDTTLEPGMVFSIEPGIYIPGGGGVRIEDTVLVTED-----------GFEVLT- 374 (384) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEEEECCCCCCCCCCEEEEEEEEEECCC-----------CCEECC- T ss_conf 67787477578877555475435789873304982997265565688637987778998589-----------856777- Q ss_pred CCCCHHC Q ss_conf 5300110 Q gi|254780605|r 558 CPIDRKL 564 (609) Q Consensus 558 vP~~~~l 564 (609) .|....+ T Consensus 375 ~~~~~~~ 381 (384) T COG0006 375 RVPKELL 381 (384) T ss_pred CCHHHHH T ss_conf 7626653 No 4 >PRK09795 aminopeptidase; Provisional Probab=100.00 E-value=0 Score=536.43 Aligned_cols=343 Identities=25% Similarity=0.366 Sum_probs=278.3 Q ss_pred HHHHHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCEEECCCCCCCHHHHHHHCCCCE-EE Q ss_conf 89999998602334306760284202344320135537787002332201478620003531121567874125635-70 Q gi|254780605|r 181 EKIRDICKILHQKEVGAVFICDPSSIAWIFNIRGFDIPCSPYPLSRAILYADGKAEIFFDKQYINEQLKALLSAVAI-VL 259 (609) Q Consensus 181 eKi~~ir~~L~~~~~daliit~~d~IaWLlNiRG~Di~~tPv~~a~aiI~~~~~~~Lfvd~~k~~~~~~~~l~~~~~-~~ 259 (609) +||++||+.|+++++|+++||+++|+.||+|++|++ +|++|+.+ +++||+|.++..+ ++.......+ .. T Consensus 2 sRL~~lr~~l~e~~lDallit~~~Ni~yltgf~~~~--------~~llit~~-~~~l~~d~ry~~~-a~~~~~~~~v~~~ 71 (361) T PRK09795 2 TLLASLRDWLKAQQLDAVLLSSRQNKQPHLGISTGS--------GYVVISRE-SAHILVDSRYYAD-VEARAQGYQLHLL 71 (361) T ss_pred CHHHHHHHHHHHCCCCEEEECCHHHHHHHHCCCCCC--------EEEEEECC-CCEEEECCHHHHH-HHHHCCCCEEEEE T ss_conf 759999999998799999986725765740966688--------59999479-8689987426898-9863879769998 Q ss_pred -CHHHHHHHHHHCC--CCCCEEEECCCCCCHHHHHHHHHC-CCEEECCCCCCHHHHHCCCHHHHHHHHHHH-HHHHHHHH Q ss_conf -7445157786301--357428972531778897531213-956751543211566619999998799998-44389999 Q gi|254780605|r 260 -DMDMMDSRLVCLA--RTSMPILIDPKWISYRFFKVIAQK-NGVMVEGSDPSCLLRATKNKVEIEGMQTAH-IQDGVAMV 334 (609) Q Consensus 260 -~~~~~~~~l~~l~--~~~~~i~id~~~~s~~~~~~l~~~-~~~~i~~~~pI~~lRaiKn~~EI~~mr~Ah-i~DgvAl~ 334 (609) ..+.+...+..+. ...++++++.+.+++..++.+.+. +.+. ....+..||+|||+.||+.||+|. +.| .|+. T Consensus 72 ~~~~~~~~~~~~~~~~~~~~~ig~e~~~~~~~~~~~~~~~~~~~~--v~~~~~~lR~IKs~~EI~~ir~A~~iad-~~~~ 148 (361) T PRK09795 72 DATHTLTTIVNQIIADEQLQTLGFEGQQVSWETAHRWQSELNAKL--VSATPDVLRQIKTPEEVEKIRLACGIAD-RGAE 148 (361) T ss_pred ECCCHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHCCCEE--CCCCHHHHHCCCCHHHHHHHHHHHHHHH-HHHH T ss_conf 255309999999976557857987324134999999985568567--2567888756799999999999999999-9999 Q ss_pred HHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCEEEEECCC Q ss_conf 99875332100124530188999999987543302785555410000123354322224554332050389879983032 Q gi|254780605|r 335 YFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGPHAAIIHYQATVQSNRLLQKDELLLLDSGA 414 (609) Q Consensus 335 kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~N~AipHY~~~~~s~~~i~~~~~~LiDsG~ 414 (609) ..+ +.+..|+||.+++..++...++.| ..+.||+||+++|+|+|+||++| +|++|++||++++|+|+ T Consensus 149 ~~~----~~i~~G~tE~eiaa~~~~~~~~~G------a~~~sf~~iv~sG~~~a~pH~~~---s~r~i~~Gd~v~iD~G~ 215 (361) T PRK09795 149 HIR----RFIQAGMSEREIAAELEWFMRQQG------AEKASFDTIVASGWRGALPHGKA---SDKIVAAGEFVTLDFGA 215 (361) T ss_pred HHH----HHCCCCCCHHHHHHHHHHHHHHCC------CCCCCCCEEECCCCCCCCCCCCC---CCCCCCCCCEEEEEEEE T ss_conf 999----876599889999999999999829------98666660303667774677887---77351689989998758 Q ss_pred CEECCCCCCEEEEEECCCCHH-----HHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCC--CCCCCCCCC Q ss_conf 010561222143210378888-----8433366778888899721789788889999873667741865--477643232 Q gi|254780605|r 415 QYVNGTTDITRTIAIGDVDYE-----KKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGAD--FAHGVGHGV 487 (609) Q Consensus 415 qY~~GtTDiTRT~~~G~~~~e-----~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~d--y~HgtGHGV 487 (609) .|.+||||+||||++|.+... +.++|++|++|+.+...+..| |+++.++|.+||++|+++||+ |.||||||| T Consensus 216 ~~~GY~sD~tRT~~ig~~~~~~~~~~l~~~y~~v~ea~~~~~~~~kp-G~~~~evd~~~r~~i~~~G~~~~f~h~~GHgv 294 (361) T PRK09795 216 LYQGYCSDMTRTLLVNGEGVSAESHPLFNVYQIVLQAQLAAISAIRP-GVRCQQVDDAARRVITEAGYGDYFGHNTGHAI 294 (361) T ss_pred EECCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC T ss_conf 98978567469999889780576647899999999999999998369-98210689999999998078866789840548 Q ss_pred CCCCCCCCCCCCCCCCCCCEECCCCEEEECCCEEECCCCCEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCHHCCC Q ss_conf 1135607788623778986005698788468603568752150025999727877877664242210116530011078 Q gi|254780605|r 488 GSFLPVHEGPQGISRTNQEPLLPGMILSNEPGYYRCGAFGIRIENVLCVSEPETINNGECLMLGFNTLTLCPIDRKLIL 566 (609) Q Consensus 488 G~~l~VHE~P~~is~~~~~~l~~Gmv~s~EPG~Y~~g~~GiRIEn~v~v~~~~~~~~~~~~fl~Fe~lT~vP~~~~li~ 566 (609) | |+|||.| .+++.++.+|+||||||+|||||+||++||||||+|+||+. +.|.||-.| ++||. T Consensus 295 G--l~vhE~P-~~~~~~~~~L~~GMv~tvEPgiy~~g~gGvriEd~vlVT~~-----------G~E~Lt~~p--k~l~i 357 (361) T PRK09795 295 G--IEVHEDP-RFSPRDTTTLQPGMLLTVEPGIYLPGQGGVRIEDVVLVTPQ-----------GAEVLYAMP--KTVLL 357 (361) T ss_pred C--CCCCCCC-CCCCCCCCEECCCCEEEECCEEEECCCCEEEEEEEEEECCC-----------CCEECCCCC--CEEEE T ss_conf 8--8878898-32589998778997999899789769848998879999299-----------546888899--46899 No 5 >PRK10879 proline aminopeptidase P II; Provisional Probab=100.00 E-value=0 Score=428.56 Aligned_cols=353 Identities=22% Similarity=0.308 Sum_probs=248.7 Q ss_pred CCHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCCCCCCCH--------------HHHHHHHCC-C---CCCEEE Q ss_conf 7738899999986023343067602842023443201355377870--------------023322014-7---862000 Q gi|254780605|r 177 RESQEKIRDICKILHQKEVGAVFICDPSSIAWIFNIRGFDIPCSPY--------------PLSRAILYA-D---GKAEIF 238 (609) Q Consensus 177 ~s~~eKi~~ir~~L~~~~~daliit~~d~IaWLlNiRG~Di~~tPv--------------~~a~aiI~~-~---~~~~Lf 238 (609) +.+++++++|.+.|..+.+ +++.+.+. -.|.+|++| |+ +.|.+++.+ + .+.+|| T Consensus 4 ~ey~~RR~~L~~~l~~~s~-~il~s~~~------~~r~~D~~y-pFRQds~F~YLTG~~eP~ailvl~~~d~~~~~~~LF 75 (436) T PRK10879 4 QEFQRRRQALLAQMQPGSA-ALIFAAPE------ATRSADSEY-PYRQNSDFWYFTGFNEPEAVLVLIKSDDTHNHSVLF 75 (436) T ss_pred HHHHHHHHHHHHHCCCCCE-EEEECCCC------CCCCCCCCC-CCCCCCCEEEECCCCCCCEEEEEEECCCCCCEEEEE T ss_conf 9999999999975799938-99928984------456899898-763379626522889989799997069998658997 Q ss_pred CCCCC----------CCH-HHHHHHCCCCEEECHHHHHHHHHHCCCCCCEEEECCCCCCH------HHHHHHHHC----- Q ss_conf 35311----------215-67874125635707445157786301357428972531778------897531213----- Q gi|254780605|r 239 FDKQY----------INE-QLKALLSAVAIVLDMDMMDSRLVCLARTSMPILIDPKWISY------RFFKVIAQK----- 296 (609) Q Consensus 239 vd~~k----------~~~-~~~~~l~~~~~~~~~~~~~~~l~~l~~~~~~i~id~~~~s~------~~~~~l~~~----- 296 (609) +..+- +.. ..++..+ ...+.+.+++...+..+......+........+ ...+.+... T Consensus 76 ~~~~d~~~eiW~G~~~~~e~a~~~~g-vD~~~~~~e~~~~l~~l~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (436) T PRK10879 76 NRVRDLTAEIWFGRRLGQDAAPEKLG-VDRALAFSEINQQLYQLLNGLDVVYHAQGEYAYADEIVFSALEKLRKGSRQNL 154 (436) T ss_pred CCCCCHHHCCCCCCCCCHHHHHHHHC-CCCCCCHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHCC T ss_conf 17998212633797889899998718-65300488999999999717857996477542035788899999987665146 Q ss_pred --CCEEECCCCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf --956751543211566619999998799998443899999987533210012453018899999998754330278555 Q gi|254780605|r 297 --NGVMVEGSDPSCLLRATKNKVEIEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLRD 374 (609) Q Consensus 297 --~~~~i~~~~pI~~lRaiKn~~EI~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~ 374 (609) ...+++....+..||+||++.||+.||+|-..-+.|+.+.+ +....++||.+++..++....+ .+..+ T Consensus 155 ~~~~~i~dl~~li~elR~IKs~~EI~~mr~A~~It~~a~~~~m----~~~~pG~~E~ei~a~~~~~~~~------~G~~~ 224 (436) T PRK10879 155 TAPATMTDWRPWVHEMRLFKSPEEIAVLRRAGEISALAHTRAM----EKCRPGMFEYQLEGEIHHEFNR------HGARY 224 (436) T ss_pred CCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHH----HCCCCCCCHHHHHHHHHHHHHH------CCCCC T ss_conf 7862101348899998667999999999999999999999998----5368887389999999999997------68985 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEE-CCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 5410000123354322224554332050389879983032010561222143210-378888843336677888889972 Q gi|254780605|r 375 IAFNTIAASGPHAAIIHYQATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAI-GDVDYEKKYYFTLVLKGMISVSTA 453 (609) Q Consensus 375 ~SF~TIva~G~N~AipHY~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~-G~~~~e~k~~yt~VLk~~ial~~a 453 (609) ++|++|||+|+||+++||.+ +|+++++|+++|+|.|+.|.+|+|||||||++ |++|++||++|++||.+|.+...+ T Consensus 225 ~ay~~Iva~G~na~~lHy~~---~~~~l~~gdlvliD~G~~~~gY~sDitRT~pv~Gkfs~~q~~iY~~vl~~q~~~i~~ 301 (436) T PRK10879 225 PSYNTIVGSGENGCILHYTE---NESEMRDGDLVLIDAGCEYKGYAGDITRTFPVNGKFTPAQREIYDIVLESLETSLRL 301 (436) T ss_pred CCCCCEECCCCCCCCCCCCC---CCCCCCCCCEEEEECCCEECCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 77454423778775667655---544257777799823711687735753898858968899999999999999999998 Q ss_pred CCCCCCCHHHHHHHHHHHHHHC----CC----------------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCE Q ss_conf 1789788889999873667741----86----------------547764323211356077886237789860056987 Q gi|254780605|r 454 RFPQRTRGCDLDSIARIFLWKY----GA----------------DFAHGVGHGVGSFLPVHEGPQGISRTNQEPLLPGMI 513 (609) Q Consensus 454 ~fp~g~~g~~lD~~aR~~l~~~----g~----------------dy~HgtGHGVG~~l~VHE~P~~is~~~~~~l~~Gmv 513 (609) ..| |++..+++..|++.+.+. |+ -|.||+||||| |+|||.|. +++..+.+|+|||| T Consensus 302 ~kp-Gv~~~di~~~~~~~~~~~l~~lg~~~~~~~~~~~~~~~~~~~~Hg~GH~lG--ldvHD~~~-~~~~~~~~L~~GmV 377 (436) T PRK10879 302 YRP-GTSIQEVTGEVVRIMVSGLVKLGILKGDVDELIAENAHRPFFMHGLSHWLG--LDVHDVGV-YGQDRSRILEPGMV 377 (436) T ss_pred HCC-CCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCCCCCCCCCCC--CCCCCCCC-CCCCCCCCCCCCCE T ss_conf 369-996999999999999999986687678789997623455777888776689--67778865-57888874799858 Q ss_pred EEECCCEEECCC---------CCEEEEEEEEEEECCCCCCCCCCEEEEEECC-CCCCCHHCCC Q ss_conf 884686035687---------5215002599972787787766424221011-6530011078 Q gi|254780605|r 514 LSNEPGYYRCGA---------FGIRIENVLCVSEPETINNGECLMLGFNTLT-LCPIDRKLIL 566 (609) Q Consensus 514 ~s~EPG~Y~~g~---------~GiRIEn~v~v~~~~~~~~~~~~fl~Fe~lT-~vP~~~~li~ 566 (609) ||+|||+|++.. +||||||+|+||+. ++|+|| -+|-+-+=|. T Consensus 378 ~tiEPGiY~~~~~~~~~~~~giGiRIEDdvliT~~-----------G~e~Lt~~~pk~i~eIE 429 (436) T PRK10879 378 LTVEPGLYIAPDADVPEQYRGIGIRIEDDIVITET-----------GNENLTASVVKKPDEIE 429 (436) T ss_pred EEECCEEEECCCCCCCCCCCCEEEEECCEEEECCC-----------CCEECCCCCCCCHHHHH T ss_conf 99899776578778874127348994207999899-----------78323655899999999 No 6 >PRK13607 proline dipeptidase; Provisional Probab=100.00 E-value=0 Score=379.88 Aligned_cols=218 Identities=28% Similarity=0.351 Sum_probs=167.1 Q ss_pred CCHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 21156661999999879999844389999998753321001245301889999999875433027855554100001233 Q gi|254780605|r 306 PSCLLRATKNKVEIEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGP 385 (609) Q Consensus 306 pI~~lRaiKn~~EI~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~ 385 (609) .+..+|+||++.||+.||+|...-+.|.... .+....|++|+++....+...+ .+....||+||||+|+ T Consensus 154 ~l~e~R~iKs~~EI~~mr~A~~is~~ah~~~----m~~~kpG~~E~ei~~~~~~~~~-------~~~~~~~Y~~IVasG~ 222 (442) T PRK13607 154 YLHYHRAYKTDYELACMREAQKIAVAGHRAA----KEAFRSGMSEFDINLAYLTATG-------QRDNDVPYGNIVALNE 222 (442) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH----HHHCCCCCCHHHHHHHHHHHHH-------CCCCCCCCCCEEEECC T ss_conf 9988766899999999999999999999999----9977477709999999999997-------6886789873798378 Q ss_pred CCCCCCCCCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHH Q ss_conf 54322224554332050389879983032010561222143210378888843336677888889972178978888999 Q gi|254780605|r 386 HAAIIHYQATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLD 465 (609) Q Consensus 386 N~AipHY~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD 465 (609) |||++||... +++.++.++++|||.||.|.+|+|||||||+.+. +.++.++|..|++++.++..++.| |++..+++ T Consensus 223 na~~LHY~~~--d~~~~~~~~~vLiDaGae~~gYaSDITRTfp~~~-~~~~~~l~~~v~~aq~~~i~~~kp-G~~~~d~h 298 (442) T PRK13607 223 HAAVLHYTKL--DHQAPAEMRSFLIDAGAEYNGYASDITRTYAAKE-DNDFAALIKAVNKEQLALIATMKP-GVSYVDLH 298 (442) T ss_pred CCCCCCCCCC--CCCCCCCCCEEEEECCCCCCCEEECCEEEEECCC-CHHHHHHHHHHHHHHHHHHHHHCC-CCCHHHHH T ss_conf 7644676545--6655677867999726231747714425630798-878999999999999999998179-99756899 Q ss_pred HHHHHHHHH----CCC----------------CCCCCCCCCCCCCCCCCCCC------CCC--CC-------CCCCEECC Q ss_conf 987366774----186----------------54776432321135607788------623--77-------89860056 Q gi|254780605|r 466 SIARIFLWK----YGA----------------DFAHGVGHGVGSFLPVHEGP------QGI--SR-------TNQEPLLP 510 (609) Q Consensus 466 ~~aR~~l~~----~g~----------------dy~HgtGHGVG~~l~VHE~P------~~i--s~-------~~~~~l~~ 510 (609) ..|+..|.+ .|+ -|+|||||++| |+||+-. .+. ++ +...+|+| T Consensus 299 ~~a~~~i~~~L~~lgl~~~~~~~~~~~~~~~~f~~HglGH~lG--LdVHDvg~~~~~~~g~~~~~~~~~~~lR~~r~L~~ 376 (442) T PRK13607 299 IQTHQRIAKLLRDFGIVTGSEEAMVEQGITSAFFPHGLGHPLG--LQVHDVAGFMQDDRGTHLAAPEKHPYLRCTRVLEP 376 (442) T ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 9999999999997696678889998666565766888875677--53122554245666776677544654333564899 Q ss_pred CCEEEECCCEEE-C------------------------CCCCEEEEEEEEEEECC Q ss_conf 987884686035-6------------------------87521500259997278 Q gi|254780605|r 511 GMILSNEPGYYR-C------------------------GAFGIRIENVLCVSEPE 540 (609) Q Consensus 511 Gmv~s~EPG~Y~-~------------------------g~~GiRIEn~v~v~~~~ 540 (609) |||+|||||+|+ | +-+||||||+|+||+.. T Consensus 377 GmV~TVEPGiYf~~~ll~~~~~~~~~~~~~~~~i~~~~~~GGIRIEDdvlVT~~G 431 (442) T PRK13607 377 GMVLTIEPGLYFIDSLLAPLREGPFSKHINWQKIDALKPFGGIRIEDNVVVHEDR 431 (442) T ss_pred CCEEEECCEECCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCEEEECCEEEECCCC T ss_conf 9889889920527201220004621111113456541788879736279997997 No 7 >PRK12897 methionine aminopeptidase; Reviewed Probab=100.00 E-value=0 Score=357.38 Aligned_cols=219 Identities=17% Similarity=0.179 Sum_probs=184.8 Q ss_pred HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC Q ss_conf 66619999998799998443899999987533210012453018899999998754330278555541000012335432 Q gi|254780605|r 310 LRATKNKVEIEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGPHAAI 389 (609) Q Consensus 310 lRaiKn~~EI~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~N~Ai 389 (609) |+.|||+.||+.||+|-..- .+.+.++.+.+.+|+||.+++..++.+.++.++.... .-..+||+|+++|.|+|+ T Consensus 1 m~~IKs~~EIe~mr~A~~I~----~~~~~~~~~~ikpG~te~el~~~~e~~~~~~Ga~~~~-~g~~~fp~~v~~s~n~~~ 75 (248) T PRK12897 1 MITIKTKNEIDLMHESGKLL----ASCHREIAKIMKPGITTKEINTFVEAYLEKHGATSEQ-KGYNGYPYAICASVNDEM 75 (248) T ss_pred CCCCCCHHHHHHHHHHHHHH----HHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCC-CCCCCCCEEEEEECCCCC T ss_conf 92107999999999999999----9999999977769998999999999999985996664-577888806895068606 Q ss_pred CCCCCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 22245543320503898799830320105612221432103788888433366778888899721789788889999873 Q gi|254780605|r 390 IHYQATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIAR 469 (609) Q Consensus 390 pHY~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR 469 (609) |||.| ++++|++||+++||.|+.|.||+||+||||++|+|+++++++|..|.+|+.+...+.. +|.+..++|..+| T Consensus 76 ~Hg~p---~~~~l~~GD~V~iD~G~~~~GY~sD~tRT~~vG~~~~~~~~l~~~~~~A~~~~i~~~~-~G~~~~di~~a~~ 151 (248) T PRK12897 76 CHAFP---ADVPLTEGDIVTIDMVVNLNGGLSDSAWTYRVGKVSDEAEKLLLVAENALYKGIDQAV-IGNRVGDIGYAIE 151 (248) T ss_pred CCCCC---CCCCCCCCCEEEEECCEEECCEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCC-CCCCHHHHHHHHH T ss_conf 67899---9851458999999574889999627633556787788999999999999999998625-8978424489999 Q ss_pred HHHHHCCCC-CCCCCCCCCCCCCCCCCCCCCCC----CCCCCEECCCCEEEECCCEEE-----------------CCCCC Q ss_conf 667741865-47764323211356077886237----789860056987884686035-----------------68752 Q gi|254780605|r 470 IFLWKYGAD-FAHGVGHGVGSFLPVHEGPQGIS----RTNQEPLLPGMILSNEPGYYR-----------------CGAFG 527 (609) Q Consensus 470 ~~l~~~g~d-y~HgtGHGVG~~l~VHE~P~~is----~~~~~~l~~Gmv~s~EPG~Y~-----------------~g~~G 527 (609) +++.++|+. ..|.|||||| ++|||.|. |. +.++.+|+||||||||||||+ +|.+| T Consensus 152 ~~~~~~G~~~~~~~~GHGiG--~~vHE~P~-i~~~~~~~~~~~L~~GMv~tIEPgI~~g~~~~~~~~d~W~~~~~~g~~g 228 (248) T PRK12897 152 SYVANEGFSVARDFTGHGIG--KEIHEEPA-IFHFGKQGQGPELQEGMVITIEPIVNVGMRYSKVDLNGWTARTMDGKLS 228 (248) T ss_pred HHHHHCCCEEECCCCCCHHC--CCCCCCCC-CCCCCCCCCCCCCCCCCEEEEECEEECCCCCEEECCCCCEEECCCCCEE T ss_conf 99987697160781502001--48666886-2667759999674689789982546058651465599987983599879 Q ss_pred EEEEEEEEEEECC Q ss_conf 1500259997278 Q gi|254780605|r 528 IRIENVLCVSEPE 540 (609) Q Consensus 528 iRIEn~v~v~~~~ 540 (609) +||||+|+||+.. T Consensus 229 ~r~E~~vlVT~dG 241 (248) T PRK12897 229 AQYEHTIAITKDG 241 (248) T ss_pred EEEEEEEEEECCC T ss_conf 9960599992991 No 8 >PRK05716 methionine aminopeptidase; Validated Probab=100.00 E-value=0 Score=350.51 Aligned_cols=229 Identities=21% Similarity=0.228 Sum_probs=189.8 Q ss_pred HHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCC-CCCCCCCCCCCCCC Q ss_conf 56661999999879999844389999998753321001245301889999999875433027855-55410000123354 Q gi|254780605|r 309 LLRATKNKVEIEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLR-DIAFNTIAASGPHA 387 (609) Q Consensus 309 ~lRaiKn~~EI~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~-~~SF~TIva~G~N~ 387 (609) .+++||++.||+.||+|...-..|+ ..+.+.+..|+||.+++..++.+..+.++.. .+. ..+|+.+++++.|. T Consensus 1 ~~~sIKs~~EI~~mR~Aa~i~~~~~----~~~~~~i~pG~te~El~~~~~~~~~~~Ga~~--~~~~~~~~p~~i~~s~n~ 74 (252) T PRK05716 1 MAITIKTPEEIEKMRVAGRIAAEVL----DEIEPHVKPGVTTKELDRIAEDYIREQGAIP--APLGYHGFPKSICTSVNE 74 (252) T ss_pred CCEEECCHHHHHHHHHHHHHHHHHH----HHHHHHCCCCCCHHHHHHHHHHHHHHCCCCC--CCCCCCCCCCCEEECCCC T ss_conf 9157568999999999999999999----9999877599979999999999999877931--555667888745951785 Q ss_pred CCCCCCCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHH Q ss_conf 32222455433205038987998303201056122214321037888884333667788888997217897888899998 Q gi|254780605|r 388 AIIHYQATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSI 467 (609) Q Consensus 388 AipHY~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~ 467 (609) +++||.| ++++|++||++++|.|+.|.+|++|+||||++|+|+++|+++|.+|++++-+...+.. +|+++.++|.+ T Consensus 75 ~~~H~~p---~~r~l~~Gd~v~iD~g~~~~GY~sD~tRT~~vG~~s~~~~~~~~~~~~a~~~~i~~ik-PG~~~~dv~~a 150 (252) T PRK05716 75 VVCHGIP---SDKVLKEGDIVNIDVTVIKDGYHGDTSRTFIVGEVSPEDKRLCEVTKEALYLGIKAVK-PGARLGDIGHA 150 (252) T ss_pred CEECCCC---CCCCCCCCCEEEEEEEEEECCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHC-CCCCHHHHHHH T ss_conf 5327999---8835168998999863567777466450354286103679999899999999999756-79808999999 Q ss_pred HHHHHHHCCCC-CCCCCCCCCCCCCCCCCCCCCC----CCCCCCEECCCCEEEECCCEEEC-----------------CC Q ss_conf 73667741865-4776432321135607788623----77898600569878846860356-----------------87 Q gi|254780605|r 468 ARIFLWKYGAD-FAHGVGHGVGSFLPVHEGPQGI----SRTNQEPLLPGMILSNEPGYYRC-----------------GA 525 (609) Q Consensus 468 aR~~l~~~g~d-y~HgtGHGVG~~l~VHE~P~~i----s~~~~~~l~~Gmv~s~EPG~Y~~-----------------g~ 525 (609) +++++.++|+. +.|.+||||| |++||.|. | .+.++++|+||||||+|||||.+ |. T Consensus 151 ~~~~~~~~g~~~~~~~~GHgiG--l~~hE~P~-i~~~~~~~~~~~Le~GMV~tiEP~i~~~~~~~~~~~D~w~~~~~~g~ 227 (252) T PRK05716 151 IQKYAEAEGFSVVREYCGHGIG--RVFHEEPQ-VLHYGAPGDGPVLKEGMVFTIEPMINAGKREVKTLKDGWTVVTKDGS 227 (252) T ss_pred HHHHHHHCCCCCEECCCCCCCC--CCCCCCCE-ECCCCCCCCCCCCCCCCEEEEECEEECCCCCEEECCCCCEEEECCCC T ss_conf 9999997588720021245567--76676970-35655699985356982899816150588743886898768822797 Q ss_pred CCEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCC Q ss_conf 521500259997278778776642422101165300 Q gi|254780605|r 526 FGIRIENVLCVSEPETINNGECLMLGFNTLTLCPID 561 (609) Q Consensus 526 ~GiRIEn~v~v~~~~~~~~~~~~fl~Fe~lT~vP~~ 561 (609) +|+||||+|+||+. ++|.||.+|.+ T Consensus 228 ~~~riEd~VlVTe~-----------G~E~LT~~p~e 252 (252) T PRK05716 228 LSAQYEHTVAVTED-----------GPEILTLRPEE 252 (252) T ss_pred EEEEEEEEEEECCC-----------CCEECCCCCCC T ss_conf 07886309999099-----------37789889998 No 9 >PRK12318 methionine aminopeptidase; Provisional Probab=100.00 E-value=0 Score=348.19 Aligned_cols=226 Identities=20% Similarity=0.230 Sum_probs=192.5 Q ss_pred HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCC-CCCCCCCCCCCCCCCCCCCC Q ss_conf 61999999879999844389999998753321001245301889999999875433027-85555410000123354322 Q gi|254780605|r 312 ATKNKVEIEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRN-PLRDIAFNTIAASGPHAAII 390 (609) Q Consensus 312 aiKn~~EI~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~-~~~~~SF~TIva~G~N~Aip 390 (609) .|||+.||+.||+|-..- .+.|..+.+.+++|+||.++.+.++++.++.++.... +|.+++||+|+++|.|++++ T Consensus 42 ~IKs~eEIe~mR~A~~I~----a~~~~~~~~~ikpGvTe~Eld~~~~~~~~~~ga~pa~l~y~~~~Fp~~vc~s~N~~v~ 117 (291) T PRK12318 42 IIKTPEQIEKIRKACQIT----ARILDALCEAAKEGVTTNELDQLSRELHKKYDAIPAPLNYGHPPFPKTICTSLNEVIC 117 (291) T ss_pred EECCHHHHHHHHHHHHHH----HHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCEECCCCCCCCCCCCCEEEECCCEEE T ss_conf 768999999999999999----9999999836879998999999999999987898676677887888744872388731 Q ss_pred CCCCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH Q ss_conf 22455433205038987998303201056122214321037888884333667788888997217897888899998736 Q gi|254780605|r 391 HYQATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARI 470 (609) Q Consensus 391 HY~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~ 470 (609) ||.| ++++|+.||+++||.|+.|.||++|+||||++|+++++++++|..|++++.+...+.. +|++..+++.++|. T Consensus 118 HgiP---~~~~Lk~GDlV~iD~G~~y~GY~sD~tRT~~vG~~s~~~~~l~~~v~~A~~~aI~~ik-pG~~~~dI~~ai~~ 193 (291) T PRK12318 118 HGIP---NDIPLKNGDIMNIDVSCIVDGYYGDCSRMVMIGEVSEIKKKVCQASLECLNAAIAILK-PNLPLYEIGEVIEN 193 (291) T ss_pred CCCC---CCCCCCCCCEEEEECCEEECCEECCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHC-CCCCHHHHHHHHHH T ss_conf 6899---9861679998999567868989712131453488989999999999999999999807-99647799999999 Q ss_pred HHHHCCCC-CCCCCCCCCCCCCCCCCCCCCCC---CCCCCEECCCCEEEECCCEEECCCC------------------CE Q ss_conf 67741865-47764323211356077886237---7898600569878846860356875------------------21 Q gi|254780605|r 471 FLWKYGAD-FAHGVGHGVGSFLPVHEGPQGIS---RTNQEPLLPGMILSNEPGYYRCGAF------------------GI 528 (609) Q Consensus 471 ~l~~~g~d-y~HgtGHGVG~~l~VHE~P~~is---~~~~~~l~~Gmv~s~EPG~Y~~g~~------------------Gi 528 (609) ++.++||. +.|.+||||| +++||.|. |+ ..++.+|+||||||||||+|+.+.. +. T Consensus 194 ~~~~~G~~~~~~~~GHGIG--~~vHE~P~-I~~~~~~~~~~L~pGMV~TIEPgI~~g~~~~~~~~~d~Wt~~t~d~~~~a 270 (291) T PRK12318 194 CADTYGFSVVDQFVGHGVG--IKFHENPY-VPHHRNRSKIPLAPGMIFTIEPMINVGKKEGVIDPINHWEARTCDNQPSA 270 (291) T ss_pred HHHHCCCCCCCCCCCCCCC--CCCCCCCC-CCCCCCCCCCEECCCCEEEECCCCCCCCCCCEECCCCCCEEEECCCCEEE T ss_conf 9998499117680028727--31034886-56456999878379998987992124788747737898579914998475 Q ss_pred EEEEEEEEEECCCCCCCCCCEEEEEECCCCC Q ss_conf 5002599972787787766424221011653 Q gi|254780605|r 529 RIENVLCVSEPETINNGECLMLGFNTLTLCP 559 (609) Q Consensus 529 RIEn~v~v~~~~~~~~~~~~fl~Fe~lT~vP 559 (609) |+|++|+||+. +.|.||+.+ T Consensus 271 ~~EhtvlIT~~-----------G~eiLT~~d 290 (291) T PRK12318 271 QWEHTILITET-----------GYEILTLLD 290 (291) T ss_pred EECEEEEEECC-----------CCEECCCCC T ss_conf 61558999199-----------267814689 No 10 >PRK12896 methionine aminopeptidase; Reviewed Probab=100.00 E-value=0 Score=339.74 Aligned_cols=230 Identities=22% Similarity=0.201 Sum_probs=190.1 Q ss_pred CCHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 21156661999999879999844389999998753321001245301889999999875433027855554100001233 Q gi|254780605|r 306 PSCLLRATKNKVEIEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGP 385 (609) Q Consensus 306 pI~~lRaiKn~~EI~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~ 385 (609) |....|.||++.||+.||+|...-..|+-. +.+.+..|+||.|++..++.+..+.++.... .....|+.+++++. T Consensus 2 ~~~~~~~IKs~~EI~~mR~Aa~i~~~~~~~----~~~~i~pG~te~el~~~~~~~~~~~Ga~~~~-~~~~~f~~~~~~s~ 76 (254) T PRK12896 2 PSPRGMEIKSPRELEKMRKIGRIVATALKE----MGKAVEPGMTTAELDRIAEKRLEEHGAIPAP-EGYYGFPGSTCISV 76 (254) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHH----HHHHCCCCCCHHHHHHHHHHHHHHCCCCCCC-CCCCCCCCCEEECC T ss_conf 985774537899999999999999999999----9987769997999999999999986991155-55557887417368 Q ss_pred CCCCCCCCCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHH Q ss_conf 54322224554332050389879983032010561222143210378888843336677888889972178978888999 Q gi|254780605|r 386 HAAIIHYQATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLD 465 (609) Q Consensus 386 N~AipHY~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD 465 (609) |++++||.| +++.|++||++++|.|+.|.+|+||+||||++|+|+++++++|..+++++.+...+..| |+++.+++ T Consensus 77 n~~~~H~~p---~~r~l~~Gd~v~iD~g~~~~GY~sD~tRT~~vG~~~~~~~~l~~~~~ea~~~~i~~ikP-G~~~~di~ 152 (254) T PRK12896 77 NEEVAHGIP---GPRVIKDGDLVNIDVSAYKDGYHGDTGATFAVGPVSEEAEKLCRVAEEALWAGIKQVKA-GRPLNDIG 152 (254) T ss_pred CCCCCCCCC---CCCCCCCCCEEEEEEEEEECCEECCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCC-CCCHHHHH T ss_conf 876447899---88545699899985339999892556651541752167999999999999999998359-98089999 Q ss_pred HHHHHHHHHCCCC-CCCCCCCCCCCCCCCCCCCCCC----CCCCCCEECCCCEEEECCCEEECC---------------- Q ss_conf 9873667741865-4776432321135607788623----778986005698788468603568---------------- Q gi|254780605|r 466 SIARIFLWKYGAD-FAHGVGHGVGSFLPVHEGPQGI----SRTNQEPLLPGMILSNEPGYYRCG---------------- 524 (609) Q Consensus 466 ~~aR~~l~~~g~d-y~HgtGHGVG~~l~VHE~P~~i----s~~~~~~l~~Gmv~s~EPG~Y~~g---------------- 524 (609) .++++.+.++|+. +.|.|||||| |++||.|.-+ ++..+.+|+||||||+|||||++| T Consensus 153 ~a~~~~~~~~G~~~~~~~~GHGiG--l~~HE~P~~~~~~~~~~~~~~Le~GMV~tiEP~i~~~~~~~~~~~d~w~~~~~~ 230 (254) T PRK12896 153 RAIEDFAKKNGYSVVRNLTGHGVG--RSLHEEPSVINYRTDPSPNRLLRPGMVLAIEPFLNLGAKDTEELDDGWTVVTPD 230 (254) T ss_pred HHHHHHHHHCCCCEEECCCEEECC--CCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECEEECCCCCEEECCCCCEEECCC T ss_conf 999999998698567423425550--212678754776768988707027968998050404876406658886587148 Q ss_pred -CCCEEEEEEEEEEECCCCCCCCCCEEEEEECCC Q ss_conf -752150025999727877877664242210116 Q gi|254780605|r 525 -AFGIRIENVLCVSEPETINNGECLMLGFNTLTL 557 (609) Q Consensus 525 -~~GiRIEn~v~v~~~~~~~~~~~~fl~Fe~lT~ 557 (609) .+||||||+|+||+. ++|.||+ T Consensus 231 ~~~~vr~Ed~vlVTe~-----------G~e~LT~ 253 (254) T PRK12896 231 KSLSAQFEHTVVVTRD-----------GPEILTL 253 (254) T ss_pred CCEEEEEEEEEEECCC-----------CCEECCC T ss_conf 9628997049999299-----------3376427 No 11 >cd01090 Creatinase Creatine amidinohydrolase. E.C.3.5.3.3. Hydrolyzes creatine to sarcosine and urea. Probab=100.00 E-value=4.5e-44 Score=326.98 Aligned_cols=210 Identities=18% Similarity=0.107 Sum_probs=168.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCC-CCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9879999844389999998753321001245301889999999875433027855-554100001233543222245543 Q gi|254780605|r 319 IEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLR-DIAFNTIAASGPHAAIIHYQATVQ 397 (609) Q Consensus 319 I~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~-~~SF~TIva~G~N~AipHY~~~~~ 397 (609) |+.||+|-..-..+| ..+.+.+..|+||.|++..+.....+.++.. -.+. ..++.++++||+|+|.||+.+ T Consensus 1 I~liR~Aa~iad~~~----~~~~~~i~~G~tE~eia~~~~~~~~~~~a~~-~~~~~~~~~~~~~~sG~~~~~~H~~~--- 72 (228) T cd01090 1 IALIRHGARIADIGG----AAVVEAIREGVPEYEVALAGTQAMVREIAKT-FPEVELMDTWTWFQSGINTDGAHNPV--- 72 (228) T ss_pred CHHHHHHHHHHHHHH----HHHHHHCCCCCCHHHHHHHHHHHHHHHHCCC-CCCCCCCCCCCCCCCCCCCCCCCCCC--- T ss_conf 989999999999999----9999873499969999999999999850011-68631135666302214776788997--- Q ss_pred CCEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 32050389879983032010561222143210378888843336677888889972178978888999987366774186 Q gi|254780605|r 398 SNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGA 477 (609) Q Consensus 398 s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~ 477 (609) ++|++++||++++|.|+.|.+|+||+||||++|+|+++|+++|.+|++++-+...+.. ||+++.++|..+|+++.++|+ T Consensus 73 s~r~l~~Gd~v~iD~g~~~~GY~sd~~RT~~vG~~s~~~~~~~~~~~ea~~~a~~~ik-PG~~~~dv~~~~~~~~~~~g~ 151 (228) T cd01090 73 TNRKVQRGDILSLNCFPMIAGYYTALERTLFLDEVSDAHLKIWEANVAVHERGLELIK-PGARCKDIAAELNEMYREHDL 151 (228) T ss_pred CCCCCCCCCEEEEEEEEEECCEECCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHC-CCCCHHHHHHHHHHHHHHCCC T ss_conf 7767489998999889999999847635897268999999999999999999999816-998499999999999997498 Q ss_pred --CCCCCCCCCCCCCCCCCCC----CCCCCCCCCCEECCCCEEEECCCEEE----CCCCCEEEEEEEEEEEC Q ss_conf --5477643232113560778----86237789860056987884686035----68752150025999727 Q gi|254780605|r 478 --DFAHGVGHGVGSFLPVHEG----PQGISRTNQEPLLPGMILSNEPGYYR----CGAFGIRIENVLCVSEP 539 (609) Q Consensus 478 --dy~HgtGHGVG~~l~VHE~----P~~is~~~~~~l~~Gmv~s~EPG~Y~----~g~~GiRIEn~v~v~~~ 539 (609) .+.||+|||+| +.+|+. |..+...++++|+||||||+|||+|+ ||.+|+||||+|+||+. T Consensus 152 ~~~~~~g~GhgiG--~~~h~~g~~~~~~~~~~~~~vL~pGMV~tvEPgiy~~~~~~G~gG~rieD~vvVTe~ 221 (228) T cd01090 152 LRYRTFGYGHSFG--VLSHYYGREAGLELREDIDTVLEPGMVVSMEPMIMLPEGQPGAGGYREHDILVINEN 221 (228) T ss_pred CCCCCCCCCCCCC--CCCCCCCCCCCCEECCCCCCEECCCCEEEECCEEEECCCCCCCCEEEEEEEEEECCC T ss_conf 6204655777068--647788777775313899978479998999998665776886558997617999999 No 12 >cd01092 APP-like Similar to Prolidase and Aminopeptidase P. The members of this subfamily presumably catalyse hydrolysis of Xaa-Pro dipeptides and/or release of any N-terminal amino acid, including proline, that is linked with proline. Probab=100.00 E-value=3.1e-44 Score=328.05 Aligned_cols=205 Identities=34% Similarity=0.454 Sum_probs=177.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 98799998443899999987533210012453018899999998754330278555541000012335432222455433 Q gi|254780605|r 319 IEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGPHAAIIHYQATVQS 398 (609) Q Consensus 319 I~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~N~AipHY~~~~~s 398 (609) |+.||+|-..-..|+.. +.+.+..|+||.|++..+++...+. +..++||++|+++|+|++.|||.| + T Consensus 1 Ie~mR~A~~i~~~~~~~----~~~~i~pG~te~el~~~~~~~~~~~------G~~~~~~~~~v~sG~~~~~~h~~~---~ 67 (208) T cd01092 1 IELLRKAARIADKAFEE----LLEFIKPGMTEREVAAELEYFMRKL------GAEGPSFDTIVASGPNSALPHGVP---S 67 (208) T ss_pred CHHHHHHHHHHHHHHHH----HHHHCCCCCCHHHHHHHHHHHHHHC------CCCCCCCCCEEEECCCCCCCCCCC---C T ss_conf 98999999999999999----9997748998999999999999986------998689786899862214567899---9 Q ss_pred CEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 20503898799830320105612221432103788888433366778888899721789788889999873667741865 Q gi|254780605|r 399 NRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGAD 478 (609) Q Consensus 399 ~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~d 478 (609) +++|++||++++|.|++|.+|++|+||||++|+|++++++.|+.+++++-++..+.. +|+++.+++.++|+++.+.|+. T Consensus 68 ~~~l~~Gd~v~iD~g~~~~gY~aD~~Rt~~vG~~~~~~~~~~~~~~~~~~~~i~~~k-pG~~~~dv~~~~~~~~~~~g~~ 146 (208) T cd01092 68 DRKIEEGDLVLIDFGAIYDGYCSDITRTVAVGEPSDELKEIYEIVLEAQQAAIKAVK-PGVTAKEVDKAARDVIEEAGYG 146 (208) T ss_pred CCCCCCCCEEEEEEEECCCCEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCC-CCCCHHHHHHHHHHHHHHCCCC T ss_conf 853569998999830054998630303334898899999999999999999997237-9990999999999999975987 Q ss_pred --CCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCEEEECCCEEECCCCCEEEEEEEEEEECC Q ss_conf --47764323211356077886237789860056987884686035687521500259997278 Q gi|254780605|r 479 --FAHGVGHGVGSFLPVHEGPQGISRTNQEPLLPGMILSNEPGYYRCGAFGIRIENVLCVSEPE 540 (609) Q Consensus 479 --y~HgtGHGVG~~l~VHE~P~~is~~~~~~l~~Gmv~s~EPG~Y~~g~~GiRIEn~v~v~~~~ 540 (609) |.|++||||| |++||.| +|++.++.+|+|||||++|||+|.||.+|+||||.++||+.. T Consensus 147 ~~~~~~~GHgiG--l~~he~P-~i~~~~~~~l~~GMv~~iEP~i~~~g~~g~~iedtv~VT~~G 207 (208) T cd01092 147 EYFIHRTGHGVG--LEVHEAP-YISPGSDDVLEEGMVFTIEPGIYIPGKGGVRIEDDVLVTEDG 207 (208) T ss_pred EEECCCCCCCCC--CCCCCCC-CCCCCCCCCCCCCCEEEECCEEEECCCCEEEEEEEEEECCCC T ss_conf 011037755446--6677687-026898972389999998891065895479973169985899 No 13 >cd01086 MetAP1 Methionine Aminopeptidase 1. E.C. 3.4.11.18. Also known as methionyl aminopeptidase and Peptidase M. Catalyzes release of N-terminal amino acids, preferentially methionine, from peptides and arylamides. Probab=100.00 E-value=6.2e-44 Score=326.01 Aligned_cols=216 Identities=23% Similarity=0.283 Sum_probs=179.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 98799998443899999987533210012453018899999998754330278555541000012335432222455433 Q gi|254780605|r 319 IEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGPHAAIIHYQATVQS 398 (609) Q Consensus 319 I~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~N~AipHY~~~~~s 398 (609) |+.||+|...-..|+.. +.+.+.+|+||.+++..++.+..+.++.. ..+...+|+.+++.+.|++++||.| + T Consensus 1 Ie~mR~A~~i~~~a~~~----~~~~irpG~tE~ei~~~~~~~~~~~Ga~~-~~~~~~~f~~~~~~s~N~~~~H~~p---~ 72 (238) T cd01086 1 IEGMREAGRIVAEVLDE----LAKAIKPGVTTKELDQIAHEFIEEHGAYP-APLGYYGFPKSICTSVNEVVCHGIP---D 72 (238) T ss_pred CHHHHHHHHHHHHHHHH----HHHHCCCCCCHHHHHHHHHHHHHHCCCEE-CCCCCCCCCCCEEECCCCCCCCCCC---C T ss_conf 98999999999999999----99877689959999999999999873971-5555678886247325745037999---8 Q ss_pred CEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 20503898799830320105612221432103788888433366778888899721789788889999873667741865 Q gi|254780605|r 399 NRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGAD 478 (609) Q Consensus 399 ~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~d 478 (609) +++|++||++++|.|+.|.+|+||+|||+++|+|+++++++|..|+.++-+...+.. +|+++.++|.++|+++.++|+. T Consensus 73 ~~~l~~Gd~v~~D~g~~~~gY~sD~tRT~~vG~~~~~~~~~~~~v~ea~~~~i~~~k-pG~~~~ev~~a~~~~~~~~g~~ 151 (238) T cd01086 73 DRVLKDGDIVNIDVGVELDGYHGDSARTFIVGEVSEEAKKLVEVTEEALYKGIEAVK-PGNRIGDIGHAIEKYAEKNGYS 151 (238) T ss_pred CCCCCCCCEEEEECCEEECCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHC-CCCCHHHHHHHHHHHHHHCCCC T ss_conf 845469998999620668878455157662593689999999999999999998716-9989999999999999986998 Q ss_pred -CCCCCCCCCCCCCCCCCCCCCC----CCCCCCEECCCCEEEECCCEEEC-----------------CCCCEEEEEEEEE Q ss_conf -4776432321135607788623----77898600569878846860356-----------------8752150025999 Q gi|254780605|r 479 -FAHGVGHGVGSFLPVHEGPQGI----SRTNQEPLLPGMILSNEPGYYRC-----------------GAFGIRIENVLCV 536 (609) Q Consensus 479 -y~HgtGHGVG~~l~VHE~P~~i----s~~~~~~l~~Gmv~s~EPG~Y~~-----------------g~~GiRIEn~v~v 536 (609) +.|-+||||| |++||.| +| .+.++.+|+||||||+|||||++ |.+||||||+|+| T Consensus 152 ~~~~~~GHgvG--l~~he~P-~i~~~~~~~~~~~L~~GmV~tiEP~i~~~~~~~~~~~d~w~~~~~~~~~g~riEd~ilI 228 (238) T cd01086 152 VVREFGGHGIG--RKFHEEP-QIPNYGRPGTGPKLKPGMVFTIEPMINLGTYEVVTLPDGWTVVTKDGSLSAQFEHTVLI 228 (238) T ss_pred CCCCEEECCCC--CCCCCCC-CCCCCCCCCCCCCCCCCCEEEEECEEECCCCCEEECCCCCEEECCCCCCEEEEEEEEEE T ss_conf 35463326246--7766698-62644548998405698489992648438874388789876870289717887319999 Q ss_pred EECCCCCCCCCCEEEEEECCC Q ss_conf 727877877664242210116 Q gi|254780605|r 537 SEPETINNGECLMLGFNTLTL 557 (609) Q Consensus 537 ~~~~~~~~~~~~fl~Fe~lT~ 557 (609) |+. ++|.||+ T Consensus 229 Te~-----------G~e~LT~ 238 (238) T cd01086 229 TED-----------GPEILTL 238 (238) T ss_pred CCC-----------CCEECCC T ss_conf 399-----------7766779 No 14 >cd01087 Prolidase Prolidase. E.C. 3.4.13.9. Also known as Xaa-Pro dipeptidase, X-Pro dipeptidase, proline dipeptidase., imidodipeptidase, peptidase D, gamma-peptidase. Catalyses hydrolysis of Xaa-Pro dipeptides; also acts on aminoacyl-hydroxyproline analogs. No action on Pro-Pro. Probab=100.00 E-value=1.1e-43 Score=324.24 Aligned_cols=204 Identities=31% Similarity=0.404 Sum_probs=168.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 98799998443899999987533210012453018899999998754330278555541000012335432222455433 Q gi|254780605|r 319 IEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGPHAAIIHYQATVQS 398 (609) Q Consensus 319 I~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~N~AipHY~~~~~s 398 (609) |+.||+|...-..|+ ..+.+.+..|+||.|++..++..-.+.|+ .++|++++++|+|++.+||.| + T Consensus 1 Ie~mR~A~~i~~~~~----~~~~~~i~pG~tE~el~~~~~~~~~~~G~-------~~~~~~~~~~g~~~~~~H~~~---~ 66 (243) T cd01087 1 IELMRKACDISAEAH----RAAMKASRPGMSEYELEAEFEYEFRSRGA-------RLAYSYIVAAGSNAAILHYVH---N 66 (243) T ss_pred CHHHHHHHHHHHHHH----HHHHHHCCCCCCHHHHHHHHHHHHHHCCC-------CCCCCCEEEECCCCCCCCCCC---C T ss_conf 989999999999999----99998764898799999999999998699-------658898699817865358888---9 Q ss_pred CEEECCCCEEEEECCCCEECCCCCCEEEEEE-CCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 2050389879983032010561222143210-378888843336677888889972178978888999987366774186 Q gi|254780605|r 399 NRLLQKDELLLLDSGAQYVNGTTDITRTIAI-GDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGA 477 (609) Q Consensus 399 ~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~-G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~ 477 (609) +++|++||++++|.|+.|.+|+||+||||++ |+|+++|+++|+.|+.++-+...+..| |+++.+++..+++++.+.++ T Consensus 67 ~r~l~~Gd~v~vD~g~~~~GY~sD~~Rt~~v~G~~~~~~~~~~~~~~~a~~~~~~~~kP-G~~~~ei~~~~~~~~~~~~~ 145 (243) T cd01087 67 DQPLKDGDLVLIDAGAEYGGYASDITRTFPVNGKFTDEQRELYEAVLAAQKAAIAACKP-GVSYEDIHLLAHRVLAEGLK 145 (243) T ss_pred CCCCCCCCEEEEEECEECCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCC-CCCHHHHHHHHHHHHHHHHH T ss_conf 84167999999972401297854430177618967999999999999999999998379-98599999999999999988 Q ss_pred --------------------CCCCCCCCCCCCCCCCCCCCCC-CCCCCCCEECCCCEEEECCCEEECCC----------C Q ss_conf --------------------5477643232113560778862-37789860056987884686035687----------5 Q gi|254780605|r 478 --------------------DFAHGVGHGVGSFLPVHEGPQG-ISRTNQEPLLPGMILSNEPGYYRCGA----------F 526 (609) Q Consensus 478 --------------------dy~HgtGHGVG~~l~VHE~P~~-is~~~~~~l~~Gmv~s~EPG~Y~~g~----------~ 526 (609) .|+||+||||| |+|||-|+- .....+.+|+||||||+|||+|+|+. + T Consensus 146 ~~g~~~~~~~~~~~~~~~~~~~~h~~GHgiG--l~~He~~~~~~~~~~~~~L~~GMv~aiEPgiy~~~~~~~~~~~~~~~ 223 (243) T cd01087 146 ELGILKGDVDEIVESGAYAKFFPHGLGHYLG--LDVHDVGGYLRYLRRARPLEPGMVITIEPGIYFIPDLLDVPEYFRGG 223 (243) T ss_pred HCCCCCCCHHHHHHCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCEEEECCEEEECCCCCCCCCCCCCC T ss_conf 6597546588887527864442577357667--64577876456688897857998899879798657557655333500 Q ss_pred CEEEEEEEEEEEC Q ss_conf 2150025999727 Q gi|254780605|r 527 GIRIENVLCVSEP 539 (609) Q Consensus 527 GiRIEn~v~v~~~ 539 (609) |+||||+|+||+. T Consensus 224 gvr~EdtvlVTed 236 (243) T cd01087 224 GIRIEDDVLVTED 236 (243) T ss_pred EEEEEEEEEECCC T ss_conf 8997349999899 No 15 >KOG2737 consensus Probab=100.00 E-value=4.4e-43 Score=320.03 Aligned_cols=366 Identities=22% Similarity=0.313 Sum_probs=236.9 Q ss_pred EEECCHHHCCCCHHHHHHHHHHHHC-----CCCCE-EEEECCCC----------------CHHHHHHCCCCCCCCCCHHH Q ss_conf 0120123358773889999998602-----33430-67602842----------------02344320135537787002 Q gi|254780605|r 167 VAMQDMAYAGRESQEKIRDICKILH-----QKEVG-AVFICDPS----------------SIAWIFNIRGFDIPCSPYPL 224 (609) Q Consensus 167 i~~~~~~~aG~s~~eKi~~ir~~L~-----~~~~d-aliit~~d----------------~IaWLlNiRG~Di~~tPv~~ 224 (609) .+..|.+.--.++..+.+.||+.+. ..+.. +++..--+ =-+||+++| . |- T Consensus 9 ~~~vP~~lf~~nr~rl~~~lr~k~~~~nr~~~~~s~vllqgGeE~nrYctD~~~lFrQesYF~~lfGV~---e---p~-- 80 (492) T KOG2737 9 FWLVPMELFAGNRKRLLEALRKKLLSSNRSLDGGSFVLLQGGEEKNRYCTDTTELFRQESYFAYLFGVR---E---PG-- 80 (492) T ss_pred CCEECHHHHHCCHHHHHHHHHHHCCCCCCCCCCCEEEEEECCHHHCCCCCCHHHHHHHHHHHHHHHCCC---C---CC-- T ss_conf 564157876136477999887413432335567518999654544010353278875534778864678---8---66-- Q ss_pred HHHHH-CCCCCCEEECCCCCCCHHHHHHHCCCCE------------EECHHHHHHHHHHCCCCCCEEEECCCCCCHHHHH Q ss_conf 33220-1478620003531121567874125635------------7074451577863013574289725317788975 Q gi|254780605|r 225 SRAIL-YADGKAEIFFDKQYINEQLKALLSAVAI------------VLDMDMMDSRLVCLARTSMPILIDPKWISYRFFK 291 (609) Q Consensus 225 a~aiI-~~~~~~~Lfvd~~k~~~~~~~~l~~~~~------------~~~~~~~~~~l~~l~~~~~~i~id~~~~s~~~~~ 291 (609) +|..| ...|+.+||+.. ++++-..+++.... +...++....+ ...+.+.+.+-.-.+-..-+ T Consensus 81 ~yg~idv~tgKstLFvPr--lp~~ya~W~G~i~~l~~fke~y~VDev~yvde~~~~~---~~~~~k~l~~l~g~nTDsg~ 155 (492) T KOG2737 81 FYGAIDVGTGKSTLFVPR--LPDSYATWMGEILSLQHFKEKYAVDEVFYVDEIIQVL---KGSKPKLLYLLRGLNTDSGN 155 (492) T ss_pred CEEEEEECCCCEEEEECC--CCHHHCEECCCCCCHHHHHHHHHHHHEEEHHHHHHHH---HCCCCCCEEEEECCCCCCCC T ss_conf 047898168945888468--9725433146527789998875110233307688886---33683011334000157644 Q ss_pred HHHHCC----CEEEC---CCCC-CHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH-HH Q ss_conf 312139----56751---5432-115666199999987999984438999999875332100124530188999999-98 Q gi|254780605|r 292 VIAQKN----GVMVE---GSDP-SCLLRATKNKVEIEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERC-RE 362 (609) Q Consensus 292 ~l~~~~----~~~i~---~~~p-I~~lRaiKn~~EI~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~-R~ 362 (609) .++++. .++.. .--| +...|.||++.||+-||.|....+-|-.+.| +.+..++.|+++....+.. .+ T Consensus 156 v~~e~~f~g~~kf~~D~~~lyp~m~E~RviKs~~EieviRya~kISseaH~~vM----~~~~pg~~Eyq~eslF~hh~y~ 231 (492) T KOG2737 156 VLKEASFAGISKFETDLTLLYPILAECRVIKSSLEIEVIRYANKISSEAHIEVM----RAVRPGMKEYQLESLFLHHSYS 231 (492) T ss_pred CCCCCCCCCHHHCCCCCHHHHHHHHHHEEECCHHHHHHHHHHHHHCCHHHHHHH----HHCCCHHHHHHHHHHHHHHHHC T ss_conf 347644233322366804666877666201779999999998860609899999----7479307677678888876541 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEE-CCCCHHHHHHH Q ss_conf 754330278555541000012335432222-4554332050389879983032010561222143210-37888884333 Q gi|254780605|r 363 EIGCKMRNPLRDIAFNTIAASGPHAAIIHY-QATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAI-GDVDYEKKYYF 440 (609) Q Consensus 363 ~~~~~~~~~~~~~SF~TIva~G~N~AipHY-~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~-G~~~~e~k~~y 440 (609) +.+-..+|+..|+|+|+|+|+.|| +|...+++.|+.|++.|+|+||.|+.|+|||||+|+. |+.|++||.+| T Consensus 232 ------~GGcRh~sYtcIc~sG~ns~vLHYgha~apNd~~iqdgd~cLfDmGaey~~yaSDITcsFP~nGKFTadqk~VY 305 (492) T KOG2737 232 ------YGGCRHLSYTCICASGDNSAVLHYGHAGAPNDRTIQDGDLCLFDMGAEYHFYASDITCSFPVNGKFTADQKLVY 305 (492) T ss_pred ------CCCCCCCCCCEEEECCCCCCEEECCCCCCCCCCCCCCCCEEEEECCCCEEEEECCCCEECCCCCCCCHHHHHHH T ss_conf ------47701156640552388754443145689987636789789984684103330354152468986435689999 Q ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHH----CCC----------------CCCCCCCCCCCCCCCCCCC---C Q ss_conf 6677888889972178978888999987366774----186----------------5477643232113560778---8 Q gi|254780605|r 441 TLVLKGMISVSTARFPQRTRGCDLDSIARIFLWK----YGA----------------DFAHGVGHGVGSFLPVHEG---P 497 (609) Q Consensus 441 t~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~----~g~----------------dy~HgtGHGVG~~l~VHE~---P 497 (609) ..||.++.|+..+..| |+...++|++|-.+|.+ .|. -++||+||=+| |+||.- | T Consensus 306 naVLda~navm~a~Kp-Gv~W~Dmh~La~kvlle~laq~gIl~gdvd~m~~ar~~~vF~PHGLGH~lG--lDvHDvGGyp 382 (492) T KOG2737 306 NAVLDASNAVMEAMKP-GVWWVDMHKLAEKVLLEHLAQMGILKGDVDEMVEARLGAVFMPHGLGHFLG--LDVHDVGGYP 382 (492) T ss_pred HHHHHHHHHHHHHCCC-CCCCCCHHHHHHHHHHHHHHHCCCEECCHHHHHHHCCCEEECCCCCCCCCC--CCCCCCCCCC T ss_conf 9999988999985289-986542788878799999986483423489999851470643566331125--4310257998 Q ss_pred C---CCC------CCCCCEECCCCEEEECCCEEE--------------------------CCCCCEEEEEEEEEEECCCC Q ss_conf 6---237------789860056987884686035--------------------------68752150025999727877 Q gi|254780605|r 498 Q---GIS------RTNQEPLLPGMILSNEPGYYR--------------------------CGAFGIRIENVLCVSEPETI 542 (609) Q Consensus 498 ~---~is------~~~~~~l~~Gmv~s~EPG~Y~--------------------------~g~~GiRIEn~v~v~~~~~~ 542 (609) . +.. -+-..+|++|||+|||||.|. -|.+||||||+|+||+. T Consensus 383 ~~~~rp~~P~l~~LR~aR~L~e~MviTvEPGcYFi~~Ll~ealadp~~~~f~n~e~~~rfr~~GGVRIEdDv~vt~~--- 459 (492) T KOG2737 383 EGVERPDEPGLRSLRTARHLKEGMVITVEPGCYFIDFLLDEALADPARAEFLNREVLQRFRGFGGVRIEDDVVVTKS--- 459 (492) T ss_pred CCCCCCCCCHHHHHHHHHHHHCCCEEEECCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCEEEECCEEEECC--- T ss_conf 88999886046665444453058379834873099999888854817655632999998644686675324799426--- Q ss_pred CCCCCCEEEEEECCCCCCCHHCCCHHH Q ss_conf 877664242210116530011078677 Q gi|254780605|r 543 NNGECLMLGFNTLTLCPIDRKLILVEL 569 (609) Q Consensus 543 ~~~~~~fl~Fe~lT~vP~~~~li~~~l 569 (609) +.|+||-||=+..=|+.-| T Consensus 460 --------G~enlt~vprtveeIEa~m 478 (492) T KOG2737 460 --------GIENLTCVPRTVEEIEACM 478 (492) T ss_pred --------CCCCCCCCCCCHHHHHHHH T ss_conf --------5100037999999999998 No 16 >KOG2414 consensus Probab=100.00 E-value=7.7e-40 Score=297.46 Aligned_cols=354 Identities=24% Similarity=0.302 Sum_probs=233.1 Q ss_pred CCCCCCCCEEEECCHHHC-CC---CHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCCCCCC----CHH-H---- Q ss_conf 276332200012012335-87---7388999999860233430676028420234432013553778----700-2---- Q gi|254780605|r 158 DRPQRLYRKVAMQDMAYA-GR---ESQEKIRDICKILHQKEVGAVFICDPSSIAWIFNIRGFDIPCS----PYP-L---- 224 (609) Q Consensus 158 ~rP~~~~~~i~~~~~~~a-G~---s~~eKi~~ir~~L~~~~~daliit~~d~IaWLlNiRG~Di~~t----Pv~-~---- 224 (609) .+|.-...|....|-+.+ |. .+++++.+|.+.|.++. ++|-....+.+.. .++||+ +-| + T Consensus 42 GQpt~~thPhli~pgEltPgis~~Ey~~RR~rl~~ll~~~a---~~il~sap~~~ms----g~ipY~f~Qd~df~YLtGc 114 (488) T KOG2414 42 GQPTSVTHPHLIQPGELTPGISATEYKERRSRLMSLLPANA---MVILGSAPVKYMS----GAIPYTFRQDNDFYYLTGC 114 (488) T ss_pred CCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCC---EEEECCCCHHHHC----CCCCEEEECCCCEEEEECC T ss_conf 89998888631578876788318899989999997587455---7998158666523----7624046327875888246 Q ss_pred ----HHHHHC-CC-C--CCEEECCCCCCCHHHHHHHC------------CCCEEECHHHHHHHHHHCCCCCCEEEECCCC Q ss_conf ----332201-47-8--62000353112156787412------------5635707445157786301357428972531 Q gi|254780605|r 225 ----SRAILY-AD-G--KAEIFFDKQYINEQLKALLS------------AVAIVLDMDMMDSRLVCLARTSMPILIDPKW 284 (609) Q Consensus 225 ----a~aiI~-~~-~--~~~Lfvd~~k~~~~~~~~l~------------~~~~~~~~~~~~~~l~~l~~~~~~i~id~~~ 284 (609) +-+++. .+ . ...+|+..+ ++..+.+-+ +...-.....+..+|........+++.|..+ T Consensus 115 ~EP~~vl~l~~~d~~s~~~~lf~p~k--dP~~e~WeG~rtG~~~a~~if~v~ea~~~s~l~~~L~k~~~~~~~i~~d~~s 192 (488) T KOG2414 115 LEPDAVLLLLKGDERSVAYDLFMPPK--DPTAELWEGPRTGTDGASEIFGVDEAYPLSGLAVFLPKMSALLYKIWQDKAS 192 (488) T ss_pred CCCCEEEEEEECCCCCCEEEEECCCC--CCCHHHHCCCCCCCHHHHHHHCCHHHCCHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 88881688762355650035764899--9507761486655133465626303233466799888877430354541303 Q ss_pred CCH--HH---HHHHHH--CCCEEECCCCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHH Q ss_conf 778--89---753121--39567515432115666199999987999984438999999875332100124530188999 Q gi|254780605|r 285 ISY--RF---FKVIAQ--KNGVMVEGSDPSCLLRATKNKVEIEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKL 357 (609) Q Consensus 285 ~s~--~~---~~~l~~--~~~~~i~~~~pI~~lRaiKn~~EI~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~L 357 (609) +.. +. ...+.. .+.++....+.+..+|.||++.|++.||+|.-..+-++.+-+ -.......|..+...+ T Consensus 193 s~a~s~~~~~~dl~~~~~~~~~~~~~~~li~~lRlIKSpaEl~~Mr~a~~I~sq~~~~~m----~~sr~~~~E~~l~a~~ 268 (488) T KOG2414 193 SKASSALKNMQDLLGFQSKSSTVRPVSNLIERLRLIKSPAELELMREACNIASQTFSETM----FGSRDFHNEAALSALL 268 (488) T ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHH----HHCCCCCCHHHHHHHH T ss_conf 335667888776640201686100189999987702888999999998504258899998----6146875256675641 Q ss_pred H-HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEE-CCCCHH Q ss_conf 9-99987543302785555410000123354322224554332050389879983032010561222143210-378888 Q gi|254780605|r 358 E-RCREEIGCKMRNPLRDIAFNTIAASGPHAAIIHYQATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAI-GDVDYE 435 (609) Q Consensus 358 e-~~R~~~~~~~~~~~~~~SF~TIva~G~N~AipHY~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~-G~~~~e 435 (609) + ++|..+. .-++|++.||.|.||-.+||.. +|..+.++++||+|.||.|.||+|||||||++ |+.|+. T Consensus 269 eye~r~rGa-------d~~AYpPVVAgG~na~tIHY~~---Nnq~l~d~emVLvDaGcelgGYvSDITRTWP~sGkFs~~ 338 (488) T KOG2414 269 EYECRRRGA-------DRLAYPPVVAGGKNANTIHYVR---NNQLLKDDEMVLVDAGCELGGYVSDITRTWPISGKFSDA 338 (488) T ss_pred HHHEEECCC-------CCCCCCCEEECCCCCCEEEEEE---CCCCCCCCCEEEEECCCCCCCEECCCEECCCCCCCCCCH T ss_conf 000133575-------3103687441476652688862---455467884799945744586671440016787832707 Q ss_pred HHHHHHHHHHHHHHHHHHCCC-CCCCHHHHHHHH----HHHHHHCCCC-------------CCCCCCCCCCCCCCCCCCC Q ss_conf 843336677888889972178-978888999987----3667741865-------------4776432321135607788 Q gi|254780605|r 436 KKYYFTLVLKGMISVSTARFP-QRTRGCDLDSIA----RIFLWKYGAD-------------FAHGVGHGVGSFLPVHEGP 497 (609) Q Consensus 436 ~k~~yt~VLk~~ial~~a~fp-~g~~g~~lD~~a----R~~l~~~g~d-------------y~HgtGHGVG~~l~VHE~P 497 (609) |++.|.+||.-|-.+-...+| .|++-.+|..-. -+.|.+-|.+ |+|..||-+| ++||.-| T Consensus 339 Qr~LYeavL~vq~ecik~c~~~~g~sL~~l~~~s~~Ll~~~Lk~lGI~kt~~ee~~~~~klcPHhVgHyLG--mDVHD~p 416 (488) T KOG2414 339 QRDLYEAVLQVQEECIKYCKPSNGTSLSQLFERSNELLGQELKELGIRKTDREEMIQAEKLCPHHVGHYLG--MDVHDCP 416 (488) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCHHCC--CCCCCCC T ss_conf 89999999999999998615889953999999999999999998386541078887665308742210027--6435488 Q ss_pred CCCCCCCCCEECCCCEEEECCCEEECCCC---------CEEEEEEEEEEEC Q ss_conf 62377898600569878846860356875---------2150025999727 Q gi|254780605|r 498 QGISRTNQEPLLPGMILSNEPGYYRCGAF---------GIRIENVLCVSEP 539 (609) Q Consensus 498 ~~is~~~~~~l~~Gmv~s~EPG~Y~~g~~---------GiRIEn~v~v~~~ 539 (609) . ++ .+.+|+||||||||||+|+|..+ ||||||+|+|++. T Consensus 417 ~-v~--r~~pL~pg~ViTIEPGvYIP~d~d~P~~FrGIGiRIEDDV~i~ed 464 (488) T KOG2414 417 T-VS--RDIPLQPGMVITIEPGVYIPEDDDPPEEFRGIGIRIEDDVAIGED 464 (488) T ss_pred C-CC--CCCCCCCCCEEEECCCEECCCCCCCCHHHCCCEEEEECCEEECCC T ss_conf 8-88--776589996598568502376578766763744786422675268 No 17 >pfam00557 Peptidase_M24 Metallopeptidase family M24. This family contains metallopeptidases. It also contains non-peptidase homologues such as the N terminal domain of Spt16 which is a histone H3-H4 binding module. Probab=100.00 E-value=2.5e-39 Score=293.92 Aligned_cols=204 Identities=38% Similarity=0.538 Sum_probs=174.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 87999984438999999875332100124530188999999987543302785555410000123354322224554332 Q gi|254780605|r 320 EGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGPHAAIIHYQATVQSN 399 (609) Q Consensus 320 ~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~N~AipHY~~~~~s~ 399 (609) +.||+|......|+...+ ..+..|+||.+++..+++...+ +.+....+|++|+++|+|++.+|+.| ++ T Consensus 1 ~~~r~A~~i~~~~~~~~~----~~i~~G~te~ei~~~~~~~~~~-----~~g~~~~~~~~~v~~g~~~~~~h~~~---~~ 68 (207) T pfam00557 1 ELMRKAARIAAAALEAAL----AAIRPGVTERELAAELEEAFLA-----RGGARGPAFPPIVASGPNAAVPHYIP---SD 68 (207) T ss_pred CHHHHHHHHHHHHHHHHH----HHCCCCCCHHHHHHHHHHHHHH-----HCCCCCCCCCCEEEECCCCCCCCCCC---CC T ss_conf 989999999999999999----9776999899999999999999-----57998889487688873314426899---97 Q ss_pred EEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCC- Q ss_conf 0503898799830320105612221432103788888433366778888899721789788889999873667741865- Q gi|254780605|r 400 RLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGAD- 478 (609) Q Consensus 400 ~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~d- 478 (609) ++|++||++++|.|++|.+|++|+|||+++|+|+++++++|+.+++++-++..+..| |+++.+|+.++++.+++.|++ T Consensus 69 ~~l~~Gd~v~~d~g~~~~gY~~d~~Rt~~~G~~~~~~~~~~~~~~~~~~~~~~~~rp-G~~~~ev~~~~~~~~~~~g~~~ 147 (207) T pfam00557 69 RVLKDGDLVLIDVGAEYDGYHSDITRTFVVGKPTPEQRELYEAVLEAQEAAIAAVKP-GVTGGDVDAAAREVLEEGGYGV 147 (207) T ss_pred CCCCCCCEEEEECCCEECCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCC-CCCHHHHHHHHHHHHHHCCCCC T ss_conf 506699989996252079898530433328998999999999999999999986379-9889999999999999859998 Q ss_pred CCCCCCCCCCCCCCCCCC-CCCCCCCCCCEECCCCEEEECCCEE-ECCCCCEEEEEEEEEEE Q ss_conf 477643232113560778-8623778986005698788468603-56875215002599972 Q gi|254780605|r 479 FAHGVGHGVGSFLPVHEG-PQGISRTNQEPLLPGMILSNEPGYY-RCGAFGIRIENVLCVSE 538 (609) Q Consensus 479 y~HgtGHGVG~~l~VHE~-P~~is~~~~~~l~~Gmv~s~EPG~Y-~~g~~GiRIEn~v~v~~ 538 (609) |.|++||||| |++||. |..+.+.++.+|+||||+++|||+| .+|.+|+||||+|+||+ T Consensus 148 ~~~~~GHgiG--l~~~~~~P~~~~~~~~~~l~~GMv~~iep~~~~~~g~~g~~iedtv~VT~ 207 (207) T pfam00557 148 FPHGLGHGIG--LDVHDEGPYIISGGSDRVLEPGMVFTIEPGIYFIPGWGGVRIEDTVLVTE 207 (207) T ss_pred CCCEECCCCC--CCCCCCCCCCCCCCCCCEECCCCEEEECCEEEECCCCEEEEEEEEEEECC T ss_conf 7661025126--67778986405799998979998999989614539966999866899769 No 18 >cd01091 CDC68-like Related to aminopeptidase P and aminopeptidase M, a member of this domain family is present in cell division control protein 68, a transcription factor. Probab=100.00 E-value=1.3e-39 Score=295.97 Aligned_cols=215 Identities=20% Similarity=0.284 Sum_probs=172.9 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHH--CCCCCCCHHHHHHHHHHH---HHHHHHCCCCCCCCCCCCCCCCCC-CCCCC Q ss_conf 98799998443899999987-533210--012453018899999998---754330278555541000012335-43222 Q gi|254780605|r 319 IEGMQTAHIQDGVAMVYFLF-WFYSQS--LETITEIDIIKKLERCRE---EIGCKMRNPLRDIAFNTIAASGPH-AAIIH 391 (609) Q Consensus 319 I~~mr~Ahi~DgvAl~kfl~-wL~~~~--~~~iTE~~ia~~Le~~R~---~~~~~~~~~~~~~SF~TIva~G~N-~AipH 391 (609) |.+||+|...-...|.+|+. .++..+ ...+|--.+++++|..-. +......+.-++.+|+||++||.| ++.|| T Consensus 1 l~~ik~as~~s~~~~~~~~~~~~~~iiD~e~k~~h~~l~~~~e~~i~d~kk~~~~~d~~~vd~~Y~PIVqSGg~y~~~~~ 80 (243) T cd01091 1 LNNIKKASDATVDVLKKFFVDEVEEIIDQEKKVTHSKLSDKVEKAIEDKKKYKAKLDPEQLDWCYPPIIQSGGNYDLLKS 80 (243) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEHHHHHHHHHHHHHCCHHCCCCCCHHHCCEECCCCEEECCCCCCCCC T ss_conf 93789999999999999999999988401212139999999999862702203789978875324564148876235757 Q ss_pred CCCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH Q ss_conf 24554332050389879983032010561222143210378888843336677888889972178978888999987366 Q gi|254780605|r 392 YQATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIF 471 (609) Q Consensus 392 Y~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~ 471 (609) +.+ +|+.+..++++|+|.|+.|.+|||||||||++ .|+++|+++|..||++|-++..+..| |+++.++|..||++ T Consensus 81 ~~~---~d~~~~~~g~Il~d~G~rYk~YcSditRT~~v-~pt~~q~~~Y~~vl~aq~~~i~~~kp-G~~~~~v~~~a~~~ 155 (243) T cd01091 81 SSS---SDKLLYHFGVIICSLGARYKSYCSNIARTFLI-DPTSEQQKNYNFLLALQEEILKELKP-GAKLSDVYQKTLDY 155 (243) T ss_pred CCC---CCCCCCCCCEEEEECCCCCCCEECCCEEEEEE-CCCHHHHHHHHHHHHHHHHHHHHHHC-CCCHHHHHHHHHHH T ss_conf 788---75423779989998691357780265658985-79999999999999999999999726-99689999999999 Q ss_pred HHHCC--C--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCEEEECCCEE----------ECCCCCEEEEEEEEEE Q ss_conf 77418--6--54776432321135607788623778986005698788468603----------5687521500259997 Q gi|254780605|r 472 LWKYG--A--DFAHGVGHGVGSFLPVHEGPQGISRTNQEPLLPGMILSNEPGYY----------RCGAFGIRIENVLCVS 537 (609) Q Consensus 472 l~~~g--~--dy~HgtGHGVG~~l~VHE~P~~is~~~~~~l~~Gmv~s~EPG~Y----------~~g~~GiRIEn~v~v~ 537 (609) |.+.+ | .|.|++|||+| |++||.|..+++.++..|++|||||++||+| ....|||||||+|+|+ T Consensus 156 I~~~~~~l~~~f~h~~GhgiG--lE~hE~~~~l~~kn~~~lk~GMvftV~~Gf~nl~~~~~~~~~~k~yal~I~DtV~Vt 233 (243) T cd01091 156 IKKKKPELEPNFTKNLGFGIG--LEFRESSLIINAKNDRKLKKGMVFNLSIGFSNLQNPEPKDKESKTYALLLSDTILVT 233 (243) T ss_pred HHHHHHHHHHHCCCCCCCEEC--EEECCCCCCCCCCCCEECCCCCEEEEECCEEECCCCCCCCCCCCEEEEEEEEEEEEC T ss_conf 998778887257667623225--575246654489974041799789996550555687666766773699996489985 Q ss_pred ECC Q ss_conf 278 Q gi|254780605|r 538 EPE 540 (609) Q Consensus 538 ~~~ 540 (609) +.+ T Consensus 234 e~g 236 (243) T cd01091 234 EDE 236 (243) T ss_pred CCC T ss_conf 899 No 19 >PRK07281 methionine aminopeptidase; Reviewed Probab=100.00 E-value=4.2e-38 Score=285.36 Aligned_cols=228 Identities=19% Similarity=0.206 Sum_probs=182.7 Q ss_pred HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHH-CCCCCC--CCCCCCCCCCCC Q ss_conf 66619999998799998443899999987533210012453018899999998754330-278555--541000012335 Q gi|254780605|r 310 LRATKNKVEIEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKM-RNPLRD--IAFNTIAASGPH 386 (609) Q Consensus 310 lRaiKn~~EI~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~-~~~~~~--~SF~TIva~G~N 386 (609) |..|||+.||+.||+|-.. ..+.+..+.+.+++|+||.++.+.++++-++.++.. ..++.+ .+||.+++++.| T Consensus 1 mi~IKs~~EIe~mr~A~~i----~a~~~~~~~~~ikpGit~~eld~~~~~~~~e~ga~p~~~g~~g~~~~fp~~vc~svN 76 (286) T PRK07281 1 MITLKSAREIEAMDRAGDF----LASIHIGLRDLIKPGVDMWEVEEYVRRRCKEDNVLPLQIGVDGHMMDYPYATCCGLN 76 (286) T ss_pred CEEECCHHHHHHHHHHHHH----HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCEECCCCCCCCCCCCCCCEEEECC T ss_conf 9070699999999999999----999999888467699989999999999999759724334787765578851364227 Q ss_pred CCCCCCCCCCCCCEEECCCCEEEEECC---------------------------CCEECCCCCCEEEEEECCCCHHHHHH Q ss_conf 432222455433205038987998303---------------------------20105612221432103788888433 Q gi|254780605|r 387 AAIIHYQATVQSNRLLQKDELLLLDSG---------------------------AQYVNGTTDITRTIAIGDVDYEKKYY 439 (609) Q Consensus 387 ~AipHY~~~~~s~~~i~~~~~~LiDsG---------------------------~qY~~GtTDiTRT~~~G~~~~e~k~~ 439 (609) .+++|+.| ++++|++||+++||.| +.|.+|++|.||||.+|++++++++. T Consensus 77 ~~v~HgiP---~~~~l~~GDiV~iD~~~g~~~d~~~~D~~~~~~~~~~~~~~~~~~y~G~~~D~~~t~~vG~~~~~~~~L 153 (286) T PRK07281 77 DEVAHAFP---RHYILKEGDLLKVDMVLSEPLDKSIVDVSKLDFDNVEQMKKYTESYRGGLADSCWAYAVGTPSDEVKNL 153 (286) T ss_pred CCCCCCCC---CCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEECCCCCHHHHHH T ss_conf 60048998---777468999899963554234543344211123332211132223375350158999887989999999 Q ss_pred HHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCC-CCCCCCCCCCCCCCCCCCCCCCC----CCCCCEECCCCEE Q ss_conf 366778888899721789788889999873667741865-47764323211356077886237----7898600569878 Q gi|254780605|r 440 FTLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGAD-FAHGVGHGVGSFLPVHEGPQGIS----RTNQEPLLPGMIL 514 (609) Q Consensus 440 yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~d-y~HgtGHGVG~~l~VHE~P~~is----~~~~~~l~~Gmv~ 514 (609) |..+.+++.+.-.++. +|.+..+|+.++|+++.++||. +.|.+||||| +++||.|+ |. +.....|+||||| T Consensus 154 ~~v~~eAl~~gI~~~~-pG~~~~dI~~ai~~~~~~~Gygvv~~~~GHGIG--~~~HE~P~-V~~y~~~~~g~~L~~GMv~ 229 (286) T PRK07281 154 MDVTKEAMYRGIEKAV-VGNRIGDIGAAIQEYAESRGYGVVRDLVGHGVG--PTMHEEPM-VPNYGTAGRGLRLREGMVL 229 (286) T ss_pred HHHHHHHHHHHHHHHC-CCCEEEHHHHHHHHHHHHCCCCCCCCCEECCCC--CCCCCCCC-CCCCCCCCCCCCCCCCCEE T ss_conf 9999999999999734-997885688999999998499677785467657--53546983-6776748999686469889 Q ss_pred EECCCEEEC------------------CCCCEEEEEEEEEEECCCCCCCCCCEEEEEECCCCC Q ss_conf 846860356------------------875215002599972787787766424221011653 Q gi|254780605|r 515 SNEPGYYRC------------------GAFGIRIENVLCVSEPETINNGECLMLGFNTLTLCP 559 (609) Q Consensus 515 s~EPG~Y~~------------------g~~GiRIEn~v~v~~~~~~~~~~~~fl~Fe~lT~vP 559 (609) |||||+|.. |..+.|+|++|+||+.. -|-||.-| T Consensus 230 tIEPmi~~g~~~~~~d~~dgWt~~t~Dg~~sa~~EhtvlIT~~G-----------~eVLT~~~ 281 (286) T PRK07281 230 TIEPMINTGTWEIDTDMKTGWAHKTLDGGLSCQYEHQFVITKDG-----------PVILTSQG 281 (286) T ss_pred EEECEECCCCCCEEEECCCCEEEEECCCCEEEEECEEEEEECCC-----------CEEECCCC T ss_conf 98061716986156457997489948998777305599997991-----------28940886 No 20 >cd01066 APP_MetAP A family including aminopeptidase P, aminopeptidase M, and prolidase. Also known as metallopeptidase family M24. This family of enzymes is able to cleave amido-, imido- and amidino-containing bonds. Members exibit relatively narrow substrate specificity compared to other metallo-aminopeptidases, suggesting they play roles in regulation of biological processes rather than general protein degradation. Probab=100.00 E-value=3.8e-37 Score=278.70 Aligned_cols=204 Identities=31% Similarity=0.455 Sum_probs=176.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 98799998443899999987533210012453018899999998754330278555541000012335432222455433 Q gi|254780605|r 319 IEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGPHAAIIHYQATVQS 398 (609) Q Consensus 319 I~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~N~AipHY~~~~~s 398 (609) |+.||+|...-..|+ ..+.+.+..++||.+++..+...-.+.+ ...+|++|+++|+|++.+||.|+ T Consensus 1 i~~~r~a~~i~~~~~----~~~~~~~~~G~te~ei~~~~~~~~~~~g-------~~~~~~~~~~~g~~~~~~h~~~~--- 66 (207) T cd01066 1 IARLRKAAEIAEAAM----AAAAEAIRPGVTEAEVAAAIEQALRAAG-------GYPAGPTIVGSGARTALPHYRPD--- 66 (207) T ss_pred CHHHHHHHHHHHHHH----HHHHHHCCCCCCHHHHHHHHHHHHHHCC-------CCCCCCEEEEECCCCCCCCCCCC--- T ss_conf 989999999999999----9999977589989999999999999879-------98888728885474455457999--- Q ss_pred CEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCC- Q ss_conf 2050389879983032010561222143210378888843336677888889972178978888999987366774186- Q gi|254780605|r 399 NRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGA- 477 (609) Q Consensus 399 ~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~- 477 (609) +++|++|+++++|+|+.|.+|++|++||+++|+|++++++.|..+++++.++..+..| |+++.+++.++++.+.+.|+ T Consensus 67 ~~~l~~gd~v~~D~g~~~~gY~~d~~Rt~~~G~~~~~~~~~~~~~~~~~~~~~~~~~p-G~~~~~i~~~~~~~~~~~g~~ 145 (207) T cd01066 67 DRRLQEGDLVLVDLGGVYDGYHADLTRTFVIGEPSDEQRELYEAVREAQEAALAALRP-GVTAEEVDAAAREVLEEHGLG 145 (207) T ss_pred CCCCCCCCEEEEEEEEEECCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHHHCCCC T ss_conf 8786899989999768899995203545437986610021789999999999985079-997999999999999982888 Q ss_pred -CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCEEEECCCEEECCCCCEEEEEEEEEEECC Q ss_conf -547764323211356077886237789860056987884686035687521500259997278 Q gi|254780605|r 478 -DFAHGVGHGVGSFLPVHEGPQGISRTNQEPLLPGMILSNEPGYYRCGAFGIRIENVLCVSEPE 540 (609) Q Consensus 478 -dy~HgtGHGVG~~l~VHE~P~~is~~~~~~l~~Gmv~s~EPG~Y~~g~~GiRIEn~v~v~~~~ 540 (609) .+.|++||||| +++||.| .+++.++.+|++|||+++||++|.+|.+|+||||+++||+.. T Consensus 146 ~~~~~~~GH~vG--~~~~e~p-~~~~~~~~~l~~gmv~~iep~~~~~~~~g~~~ed~~~Vt~~G 206 (207) T cd01066 146 PNFGHRTGHGIG--LEIHEPP-VLKAGDDTVLEPGMVFAVEPGLYLPGGGGVRIEDTVLVTEDG 206 (207) T ss_pred CCCCCCEECCCC--CCCCCCC-CCCCCCCCEECCCCEEEECCEEEECCCEEEEEEEEEEECCCC T ss_conf 512352243278--6477898-557999988789999999996076893489982179984999 No 21 >cd01089 PA2G4-like Related to aminopepdidase M, this family contains proliferation-associated protein 2G4. Family members have been implicated in cell cycle control. Probab=99.90 E-value=7.2e-22 Score=172.31 Aligned_cols=212 Identities=14% Similarity=0.148 Sum_probs=161.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHH----CCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 98799998443899999987533210012453018899999998754330----27855554100001233543222245 Q gi|254780605|r 319 IEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKM----RNPLRDIAFNTIAASGPHAAIIHYQA 394 (609) Q Consensus 319 I~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~----~~~~~~~SF~TIva~G~N~AipHY~~ 394 (609) |+.||+|- .-..+.|.++...+.+|+|+.++.+..++|-.+.++.. ...+.+.+|++.++ -|-.+.|+.| T Consensus 1 IeKmR~Ag----~i~a~~l~~~~~~i~pG~tt~eld~~~~~~i~~~~~~~~~~~~~~~~G~~~p~CiS--vNe~v~HGiP 74 (228) T cd01089 1 VTKYKTAG----QIANKVLKQVISLCVPGAKVVDLCEKGDKLILEELGKVYKKEKKLEKGIAFPTCIS--VNNCVCHFSP 74 (228) T ss_pred CHHHHHHH----HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEC--CCCEEECCCC T ss_conf 97899999----99999999999578699969999999999999830432567666667778672750--2544456899 Q ss_pred CCC-CCEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHH--HHHHHHHHHHHHHHH--CCCCCCCHHHHHHHHH Q ss_conf 543-32050389879983032010561222143210378888843--336677888889972--1789788889999873 Q gi|254780605|r 395 TVQ-SNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKY--YFTLVLKGMISVSTA--RFPQRTRGCDLDSIAR 469 (609) Q Consensus 395 ~~~-s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~--~yt~VLk~~ial~~a--~fp~g~~g~~lD~~aR 469 (609) .+. ++++|++||++.||.|++|.||.+|.+|||.+|+++++... ...++..++-++..+ ...+|.+-.++..+.. T Consensus 75 ~~~~~~~~L~~GDiV~iD~g~~~dGy~~D~a~T~~VG~~~~~~~~~~~~~L~~~t~~al~~gI~~~~pG~~~~dI~~ai~ 154 (228) T cd01089 75 LKSDATYTLKDGDVVKIDLGCHIDGYIAVVAHTIVVGAEAETPVTGKKADVIAAAHYALEAALRLLRPGNQNSDITEAIQ 154 (228) T ss_pred CCCCCCCCCCCCCEEEEEEEEEECCEEEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 98888735468998999776886877987569999489671023379999999999999999997079989689999999 Q ss_pred HHHHHCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCEEEECCCEEECCCCCEEEEEEEEEEECC Q ss_conf 667741865-47764323211356077886237789860056987884686035687521500259997278 Q gi|254780605|r 470 IFLWKYGAD-FAHGVGHGVGSFLPVHEGPQGISRTNQEPLLPGMILSNEPGYYRCGAFGIRIENVLCVSEPE 540 (609) Q Consensus 470 ~~l~~~g~d-y~HgtGHGVG~~l~VHE~P~~is~~~~~~l~~Gmv~s~EPG~Y~~g~~GiRIEn~v~v~~~~ 540 (609) +++.++|+. ...-+|||+| -.+|++|. . +.....++.||++..||.++.+|+...|.|++|+||+.. T Consensus 155 ~~~~~~g~~~v~~~~gH~~~--~~~~~~~~-~-~~~~e~~~~~~~~~~~~~~~~dg~~~A~fEhTvlVT~~G 222 (228) T cd01089 155 KVIVDYGCTPVEGVLSHQLK--RVVSSGEG-K-AKLVECVKHGLLFPYPVLYEKEGEVVAQFKLTVLLTPNG 222 (228) T ss_pred HHHHHCCCEEEECCCCCCCC--CCCCCCCC-C-CCCCCCCCCCEEEEEEEEECCCCCEEEEEEEEEEECCCC T ss_conf 99998499677452056999--81777999-8-885413546718888789857998887833799997998 No 22 >TIGR00500 met_pdase_I methionine aminopeptidase, type I; InterPro: IPR002467 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M24 (clan MG), subfamily M24A. Methionine aminopeptidase (3.4.11.18 from EC) (MAP) is responsible for the removal of the amino-terminal (initiator) methionine from nascent eukaryotic cytosolic and cytoplasmic prokaryotic proteins if the penultimate amino acid is small and uncharged. All MAP studied to date are monomeric proteins that require cobalt ions for activity. Two subfamilies of MAP enzymes are known to exist , . While being evolutionary related, they only share a limited amount of sequence similarity mostly clustered around the residues shown, in the Escherichia coli MAP , to be involved in cobalt-binding. The first family consists of enzymes from prokaryotes as well as eukaryotic MAP-1, while the second group (IPR002468 from INTERPRO) is made up of archaeal MAP and eukaryotic MAP-2. ; GO: 0004239 methionyl aminopeptidase activity, 0006508 proteolysis. Probab=99.89 E-value=3.5e-21 Score=167.53 Aligned_cols=218 Identities=19% Similarity=0.250 Sum_probs=162.9 Q ss_pred HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHH-HHHC-CCCC----CCCCC---CCCC Q ss_conf 619999998799998443899999987533210012453018899999998754-3302-7855----55410---0001 Q gi|254780605|r 312 ATKNKVEIEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIG-CKMR-NPLR----DIAFN---TIAA 382 (609) Q Consensus 312 aiKn~~EI~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~-~~~~-~~~~----~~SF~---TIva 382 (609) .+|++.||++||+|-.. +.+.+..++.++..|++=.++......|-.+.+ ++.. .+|. +.-|| --+. T Consensus 2 ~l~s~~Ei~kirkag~l----~a~vl~~~~~~v~PGvST~eLD~i~~~~i~~~~hA~pAf~GY~PPfsdvgfPPft~~~C 77 (265) T TIGR00500 2 SLKSKDEIEKIRKAGRL----VAEVLEILEREVKPGVSTKELDRIAKDFIEKRGHAKPAFLGYYPPFSDVGFPPFTGSVC 77 (265) T ss_pred CCCCHHHHHHHHHHHHH----HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEE T ss_conf 55789899999999999----99999999863689887899999999999847972353567877866687885556156 Q ss_pred CCCCCCCCCCCCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEECC-CCHHHHHHHHHHHHHH-HHHH-HHCCCCCC Q ss_conf 2335432222455433205038987998303201056122214321037-8888843336677888-8899-72178978 Q gi|254780605|r 383 SGPHAAIIHYQATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGD-VDYEKKYYFTLVLKGM-ISVS-TARFPQRT 459 (609) Q Consensus 383 ~G~N~AipHY~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~-~~~e~k~~yt~VLk~~-ial~-~a~fp~g~ 459 (609) .==|=-++|.-|+ ..++|++||+|=||.|..|.||.+|..+||.+|+ ++++..+.-..+-.+. .+++ +++ +|. T Consensus 78 ~S~Ne~V~HGiP~--~~~vLKdGDivniD~g~~~~Gy~gD~a~Tf~~G~n~~~~~~KLl~~t~esl~~aI~P~~k--pG~ 153 (265) T TIGR00500 78 ISVNEVVIHGIPD--KKKVLKDGDIVNIDVGVIYDGYYGDTAITFLVGKNVSPEAEKLLEATEESLYKAIEPEIK--PGK 153 (265) T ss_pred EEECCEEEECCCC--CCCEEECCCEEEEEEEEEECCEEECCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCC--CCC T ss_conf 3216788851388--472651367787557788603021132313413877876788999999999852231014--798 Q ss_pred CHHHHHHHHHHHHHHCCCCC-CCCCCCCCCCCCCCCCCC-CCCC---CCC----CCEECCCCEEEECCCEEE-------- Q ss_conf 88899998736677418654-776432321135607788-6237---789----860056987884686035-------- Q gi|254780605|r 460 RGCDLDSIARIFLWKYGADF-AHGVGHGVGSFLPVHEGP-QGIS---RTN----QEPLLPGMILSNEPGYYR-------- 522 (609) Q Consensus 460 ~g~~lD~~aR~~l~~~g~dy-~HgtGHGVG~~l~VHE~P-~~is---~~~----~~~l~~Gmv~s~EPG~Y~-------- 522 (609) +-.+|=.+--.+..+.|+-- .-=+||||| -.+||.| +-.+ +.. +..|||||||+|||-+=. T Consensus 154 ~~~~ig~AI~~~~~~~GF~vvr~y~GHGiG--~~~HeePn~ipnY~~~~~K~nsn~~lk~GMV~~iEPMv~~~~~e~~~~ 231 (265) T TIGR00500 154 RIGEIGKAIEKYAEEKGFSVVREYCGHGIG--RKLHEEPNQIPNYYKKFTKKNSNVRLKEGMVFAIEPMVNTGTEEIVIA 231 (265) T ss_pred CHHHHHHHHHHHHHHCCCEEEECCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCEECCCCEEEECCEEECCCCEEEEC T ss_conf 877899999999876299786145276437--744557353666667777667887106875788626121788225671 Q ss_pred ---------CCCCCEEEEEEEEEEEC Q ss_conf ---------68752150025999727 Q gi|254780605|r 523 ---------CGAFGIRIENVLCVSEP 539 (609) Q Consensus 523 ---------~g~~GiRIEn~v~v~~~ 539 (609) ++++----|-+|+||+. T Consensus 232 ~DgWTV~T~D~~~~AhfEHT~~iT~~ 257 (265) T TIGR00500 232 ADGWTVKTKDGSLSAHFEHTIVITEN 257 (265) T ss_pred CCCCEEEEECCCEEEEEEEEEEEECC T ss_conf 68876870077421015789999889 No 23 >TIGR02993 ectoine_eutD ectoine utilization protein EutD; InterPro: IPR014335 Members of this entry are putative peptidases, or non-peptidase homologues that belong to the MEROPS peptidase family M24 (clan MG) or hydrolases similar to Xaa-Pro aminopeptidase. They belong to ectoine utilisation operons, as found in Mesorhizobium loti, Silicibacter pomeroyi, Agrobacterium tumefaciens, Pseudomonas putida and Sinorhizobium meliloti 1021, where it is known to be induced by ectoine. The exact function is unknown.. Probab=99.87 E-value=3e-20 Score=161.04 Aligned_cols=361 Identities=19% Similarity=0.275 Sum_probs=258.9 Q ss_pred CHHHCCCCHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCEEECCCCCCCHHHHH Q ss_conf 12335877388999999860233430676028420234432013553778700233220147862000353112156787 Q gi|254780605|r 171 DMAYAGRESQEKIRDICKILHQKEVGAVFICDPSSIAWIFNIRGFDIPCSPYPLSRAILYADGKAEIFFDKQYINEQLKA 250 (609) Q Consensus 171 ~~~~aG~s~~eKi~~ir~~L~~~~~daliit~~d~IaWLlNiRG~Di~~tPv~~a~aiI~~~~~~~Lfvd~~k~~~~~~~ 250 (609) ...|+-+.++.+|.++|+.|+++++|.+++|+|.|.+||+ |+|-.-.-|..| .++..+|++..|-...-.+...+. T Consensus 3 ~l~f~~~ey~~rl~k~r~~m~~~~~~ll~v~dPsnm~Wlt---GydGWsfyvhq~-v~~~~~GeP~WyGr~qdanGa~~t 78 (391) T TIGR02993 3 NLKFTREEYAARLAKTRAEMEARGIDLLIVTDPSNMAWLT---GYDGWSFYVHQC-VLLALEGEPIWYGRGQDANGAKRT 78 (391) T ss_pred CCCHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHC---CCCCEEEEEEEE-EEEECCCCEEEECCCCCCCHHHHH T ss_conf 4301188999999999999985597189984763221101---466403456567-787137852441576441213410 Q ss_pred -HHCCCCE------------EECHHHHHHHHHHCCCCCCEEEECC--CCCCHHHHHHHHH--CCCEEECCCCCCHHHHHC Q ss_conf -4125635------------7074451577863013574289725--3177889753121--395675154321156661 Q gi|254780605|r 251 -LLSAVAI------------VLDMDMMDSRLVCLARTSMPILIDP--KWISYRFFKVIAQ--KNGVMVEGSDPSCLLRAT 313 (609) Q Consensus 251 -~l~~~~~------------~~~~~~~~~~l~~l~~~~~~i~id~--~~~s~~~~~~l~~--~~~~~i~~~~pI~~lRai 313 (609) ++....+ .++.+.+...+..-.-.+..|+++- -+.+.+.+..+++ .+.++.+....|...|++ T Consensus 79 ~~~~~~ni~GyPd~yvqsterhPmd~l~~~~~~rG~~~~~iGve~dny~f~aaaf~~lq~hlP~a~f~~a~~lvnWqrav 158 (391) T TIGR02993 79 AFLDQDNIVGYPDHYVQSTERHPMDYLSEILKDRGLDSLTIGVELDNYYFSAAAFLSLQKHLPNARFKDATALVNWQRAV 158 (391) T ss_pred EEECCCCEECCCCCCCCCCCCCCHHHHHHHHHHCCCCCCEEEEEEHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 01224641217730100245661889999998438664214566302688899999998624310233456767667651 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC---CCCCCCCC Q ss_conf 99999987999984438999999875-332100124530188999999987543302785555410000---12335432 Q gi|254780605|r 314 KNKVEIEGMQTAHIQDGVAMVYFLFW-FYSQSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIA---ASGPHAAI 389 (609) Q Consensus 314 Kn~~EI~~mr~Ahi~DgvAl~kfl~w-L~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIv---a~G~N~Ai 389 (609) |++.||..||+|. -+++-++. +-+.+..|+...++...+..- |.....+|-| .||.|| -+|..++- T Consensus 159 ks~~e~~ymr~a~-----~ive~mh~ri~~~~e~Gmrk~~lva~iy~~----G~~~~~~~GG-dyPaivPllP~G~~a~a 228 (391) T TIGR02993 159 KSETELKYMRKAA-----KIVEKMHQRILERVEPGMRKCDLVADIYDA----GIRGVDGFGG-DYPAIVPLLPSGADASA 228 (391) T ss_pred CCHHHHHHHHHHH-----HHHHHHHHHHHHHHCCCCHHHHHHHHHHHH----CCCCCCCCCC-CCCCCEECCCCCCCCCC T ss_conf 3378899999889-----999999988887630252046789998751----0110123578-66410002567743146 Q ss_pred CCCCCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHH-HHHHCCCCCCCHHHHHHHH Q ss_conf 222455433205038987998303201056122214321037888884333667788888-9972178978888999987 Q gi|254780605|r 390 IHYQATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMIS-VSTARFPQRTRGCDLDSIA 468 (609) Q Consensus 390 pHY~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ia-l~~a~fp~g~~g~~lD~~a 468 (609) ||-.= .|+.++.|+=-.+...|-|.-|.+-+.||+++|+|+...-+.=..|+.|.-| |+.|+ +|-++.++-..- T Consensus 229 PhltW---dd~P~~~G~GtffeiaG~~~ryh~PlsrtvflGkP~q~fld~eka~~eG~~aGl~~a~--~Gnt~~dia~~f 303 (391) T TIGR02993 229 PHLTW---DDKPLKVGEGTFFEIAGVYKRYHCPLSRTVFLGKPTQAFLDAEKAVLEGMEAGLEAAK--PGNTCEDIAKAF 303 (391) T ss_pred CCCCC---CCCCCCCCCCCEEEHHHHHHHCCCCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHHHC--CCCCHHHHHHHH T ss_conf 53322---6762204785201010112221465200466158603444123677887899898512--787478999999 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCC----CCCCCCCCCCCCCEECCCCEEEECCCEEECCCCCEEEEEEEEEEECCCCCC Q ss_conf 3667741865477643232113560----778862377898600569878846860356875215002599972787787 Q gi|254780605|r 469 RIFLWKYGADFAHGVGHGVGSFLPV----HEGPQGISRTNQEPLLPGMILSNEPGYYRCGAFGIRIENVLCVSEPETINN 544 (609) Q Consensus 469 R~~l~~~g~dy~HgtGHGVG~~l~V----HE~P~~is~~~~~~l~~Gmv~s~EPG~Y~~g~~GiRIEn~v~v~~~~~~~~ 544 (609) =.+|.+.|..=..+||.-|| ++- -|.-.++.|.....|++||.|--=+|+..++ .|+-|-.-+.|++. T Consensus 304 f~vl~~y~i~kd~r~GyPiG--~syPPdWGertmslrP~d~~~l~~~mtfhfm~GlW~~d-~G~e~tesi~i~~~----- 375 (391) T TIGR02993 304 FAVLKKYGIEKDSRTGYPIG--LSYPPDWGERTMSLRPGDKTVLKPGMTFHFMTGLWLED-MGLEITESILITET----- 375 (391) T ss_pred HHHHHHHCCCCCCCCCCCEE--CCCCCCCCCCEEEECCCCCCHHCCCCEEEEECCCCHHH-CCCCEEEEEEECCC----- T ss_conf 99998606400365665111--13688746520332577521002465044213420100-36420001343033----- Q ss_pred CCCCEEEEEECCCCCCCHHCCC Q ss_conf 7664242210116530011078 Q gi|254780605|r 545 GECLMLGFNTLTLCPIDRKLIL 566 (609) Q Consensus 545 ~~~~fl~Fe~lT~vP~~~~li~ 566 (609) +-|.|.-+| ++|+. T Consensus 376 ------G~e~l~~~P--r~l~v 389 (391) T TIGR02993 376 ------GVECLSSVP--RKLFV 389 (391) T ss_pred ------CCHHHHCCC--CEEEE T ss_conf ------404454077--20131 No 24 >COG0024 Map Methionine aminopeptidase [Translation, ribosomal structure and biogenesis] Probab=99.86 E-value=1.2e-19 Score=156.85 Aligned_cols=199 Identities=20% Similarity=0.205 Sum_probs=137.8 Q ss_pred HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHH-HCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 61999999879999844389999998753321001245301889999999875433-02785555410000123354322 Q gi|254780605|r 312 ATKNKVEIEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCK-MRNPLRDIAFNTIAASGPHAAII 390 (609) Q Consensus 312 aiKn~~EI~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~-~~~~~~~~SF~TIva~G~N~Aip 390 (609) .+|++.||+.||+|-. ...+.+..+...+..|+|=.|++...+++-.+.++. ...++.+..|++- .--|--.. T Consensus 4 ~ikt~~eiek~r~Ag~----i~a~~l~~~~~~v~pGvtt~Eld~~~~~~i~~~ga~pa~~gy~g~~~~~c--iSvNe~v~ 77 (255) T COG0024 4 SIKTPEEIEKMREAGK----IAAKALKEVASLVKPGVTTLELDEIAEEFIREKGAYPAFLGYKGFPFPTC--ISVNEVVA 77 (255) T ss_pred CCCCHHHHHHHHHHHH----HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCEEHHCCCCCCCCCEE--EEHHHEEE T ss_conf 3279999999999999----99999999998717999799999999999997486361106669885638--40315035 Q ss_pred CCCCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCH-HHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 22455433205038987998303201056122214321037888-88433366778888899721789788889999873 Q gi|254780605|r 391 HYQATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDY-EKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIAR 469 (609) Q Consensus 391 HY~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~-e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR 469 (609) ||.|. .++.|++||++=||.|+++.||..|..+||.+|+.++ ..++.-..+..+.-+.-+++ .+|.+-.+|-.+-- T Consensus 78 HgiP~--d~~vlk~GDiv~IDvg~~~dG~~~Dsa~T~~vg~~~~~~~~~L~~~t~eal~~~I~~v-kpG~~l~~Ig~aIq 154 (255) T COG0024 78 HGIPG--DKKVLKEGDIVKIDVGAHIDGYIGDTAITFVVGEVSDEDAKRLLEATKEALYAGIEAV-KPGARLGDIGRAIQ 154 (255) T ss_pred ECCCC--CCCCCCCCCEEEEEEEEEECCEEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHC-CCCCCHHHHHHHHH T ss_conf 14789--8734589998999889997772465789998788773789999999999999998750-69988899999999 Q ss_pred HHHHHCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCC---CCEECCCCEEEECCCEE Q ss_conf 667741865-47764323211356077886237789---86005698788468603 Q gi|254780605|r 470 IFLWKYGAD-FAHGVGHGVGSFLPVHEGPQGISRTN---QEPLLPGMILSNEPGYY 521 (609) Q Consensus 470 ~~l~~~g~d-y~HgtGHGVG~~l~VHE~P~~is~~~---~~~l~~Gmv~s~EPG~Y 521 (609) .+....|+. -..=|||||| -..||+|+-..... ...|+|||||+|||-+= T Consensus 155 ~~~~~~G~~vVr~~~GHgig--~~~He~p~ip~y~~~~~~~~l~~Gmv~aIEPmi~ 208 (255) T COG0024 155 EYAESRGFSVVRNLTGHGIG--RELHEEPSIPNYGKDGTGVRLKEGMVFAIEPMIN 208 (255) T ss_pred HHHHHCCCEEEECCCCCCCC--CCCCCCCEECCCCCCCCCCCCCCCCEEEEEEEEE T ss_conf 99997698771466477567--5457997025646888872436997999702787 No 25 >PRK08671 methionine aminopeptidase; Provisional Probab=99.78 E-value=1.8e-17 Score=141.67 Aligned_cols=188 Identities=16% Similarity=0.165 Sum_probs=134.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 98799998443899999987533210012453018899999998754330278555541000012335432222455433 Q gi|254780605|r 319 IEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGPHAAIIHYQATVQS 398 (609) Q Consensus 319 I~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~N~AipHY~~~~~s 398 (609) ++.+|+|-. ...+.+.++.....+|+|-.|+++..+++-++.++ ++|||+=+ -.|-.+.||.|.+.. T Consensus 2 ~~kyr~Ag~----I~~~vl~~~~~~~kpG~t~~el~~~~e~~i~~~ga-------~pAFP~~i--SvN~~v~H~~P~~~d 68 (293) T PRK08671 2 LEKYREAGK----IASKVREEAAKLIKPGASLLEVAEFVENRIRELGA-------KPAFPCNI--SINEVAAHYTPSPGD 68 (293) T ss_pred HHHHHHHHH----HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCC-------CCCCCCEE--CCCCEEECCCCCCCC T ss_conf 589999999----99999999997676999099999999999997599-------47777624--334687778999886 Q ss_pred CEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 20503898799830320105612221432103788888433366778888899721789788889999873667741865 Q gi|254780605|r 399 NRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGAD 478 (609) Q Consensus 399 ~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~d 478 (609) ++.|++||++=||.|+|+.|+.+|..+|+.+|...++.-++=.-.|.+ ++..+ ++|++-.++-.+.-+++.+.|+. T Consensus 69 ~~~lk~GDiVkiD~g~~idG~~~d~a~T~~vg~~~~~Li~aa~~al~~--ai~~~--kpG~~~~dIg~~I~~~~~~~g~~ 144 (293) T PRK08671 69 ETVFPEGDVVKLDLGAHVDGYIADTAVTVDLGGKYSDLVEASREALEA--AIEIV--KPGVSVGEIGRAIEETIKSYGFK 144 (293) T ss_pred CCCCCCCCEEEEEEEEEECCEEEEEEEEEEECCCHHHHHHHHHHHHHH--HHHHC--CCCCCHHHHHHHHHHHHHHCCCE T ss_conf 616258998999788898999988889999796168999999999999--99870--69979789999999999986988 Q ss_pred CCC-CCCCCCCCCCCCCCCCCCCC---CCCCCEECCCCEEEECCCEEECCCC Q ss_conf 477-64323211356077886237---7898600569878846860356875 Q gi|254780605|r 479 FAH-GVGHGVGSFLPVHEGPQGIS---RTNQEPLLPGMILSNEPGYYRCGAF 526 (609) Q Consensus 479 y~H-gtGHGVG~~l~VHE~P~~is---~~~~~~l~~Gmv~s~EPG~Y~~g~~ 526 (609) =.. =+|||+|.| .+|++|. |- ..+...|++|||++|||-+ =.|.+ T Consensus 145 ~v~~~~gHgigry-~iH~~~~-Ipn~~~~~~~~l~~g~v~aIEpf~-ttG~g 193 (293) T PRK08671 145 PIRNLTGHGIERY-DLHAGPS-IPNYDEGGGAVLKEGDVYAIEPFA-TDGEG 193 (293) T ss_pred EECCCCCCCCCCC-CCCCCCE-ECCCCCCCCCEECCCCEEEEEEEE-ECCCC T ss_conf 9514444676651-5427987-052069987155078589994476-27887 No 26 >KOG1189 consensus Probab=99.77 E-value=3.8e-17 Score=139.42 Aligned_cols=269 Identities=18% Similarity=0.219 Sum_probs=186.8 Q ss_pred HHHHHHHHCCCCCCEEEEC-CCCCCHHH----HHHHHHCCCEEECCCCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5157786301357428972-53177889----753121395675154321156661999999879999844389999998 Q gi|254780605|r 263 MMDSRLVCLARTSMPILID-PKWISYRF----FKVIAQKNGVMVEGSDPSCLLRATKNKVEIEGMQTAHIQDGVAMVYFL 337 (609) Q Consensus 263 ~~~~~l~~l~~~~~~i~id-~~~~s~~~----~~~l~~~~~~~i~~~~pI~~lRaiKn~~EI~~mr~Ahi~DgvAl~kfl 337 (609) .|+..+..+...++++++= .+...-.+ -..+...+...+..+-.+..+=++|++.||+++|+|...-.+-|.+++ T Consensus 82 ~fdkii~~ik~~gk~vGvf~ke~~~G~F~~~W~~~l~~~~fn~vDis~~ls~l~avKDd~Ei~~irksa~~s~~vm~k~~ 161 (960) T KOG1189 82 LFDKIIKAIKSAGKKVGVFAKEKFQGEFMESWNKRLEAGGFNKVDISLGLSKLFAVKDDEEIANIRKSAAASSAVMNKYL 161 (960) T ss_pred HHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEHHHHHHHHEEECCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 19999999983397464432545630589999987431367523335554565033778999888887788999999999 Q ss_pred HH-HHHHH--CCCCCCCHHHHHHHHHHHH--HHHHHCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCEEECCCCEEEEE Q ss_conf 75-33210--0124530188999999987--54330278555541000012335432-2224554332050389879983 Q gi|254780605|r 338 FW-FYSQS--LETITEIDIIKKLERCREE--IGCKMRNPLRDIAFNTIAASGPHAAI-IHYQATVQSNRLLQKDELLLLD 411 (609) Q Consensus 338 ~w-L~~~~--~~~iTE~~ia~~Le~~R~~--~~~~~~~~~~~~SF~TIva~G~N~Ai-pHY~~~~~s~~~i~~~~~~LiD 411 (609) .- +...+ ...+|---+++..+..... ......+...+..|++|+-+|.|=.+ |-..+ ++..+ + ++||- T Consensus 162 ~~~~~~aiD~ekkvthskLsD~~e~~I~~~k~s~~l~~~~~d~cY~PIiqSGg~ydlk~sa~s---~~~~L--~-~I~cs 235 (960) T KOG1189 162 VDELVEAIDEEKKVTHSKLSDLMESAIEDKKYSPGLDPDLLDMCYPPIIQSGGKYDLKPSAVS---DDNHL--H-VILCS 235 (960) T ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCC---CCCCC--E-EEEEE T ss_conf 999999865124010678888898764223247666843266336736633776456523134---76653--0-58855 Q ss_pred CCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCC----CCCCCCCCCC Q ss_conf 032010561222143210378888843336677888889972178978888999987366774186----5477643232 Q gi|254780605|r 412 SGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGA----DFAHGVGHGV 487 (609) Q Consensus 412 sG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~----dy~HgtGHGV 487 (609) .|+.|..|||.||||+.+- |+.+|.+.|+..|++|-++-.+..| |+.-.++=..+-.++.+.+. +|.-..|-|+ T Consensus 236 ~G~RynsYCSNv~RT~Lid-pssemq~nY~fLl~aqe~il~~lrp-G~ki~dVY~~~l~~v~k~~Pel~~~~~k~lG~~i 313 (960) T KOG1189 236 LGIRYNSYCSNVSRTYLID-PSSEMQENYEFLLAAQEEILKLLRP-GTKIGDVYEKALDYVEKNKPELVPNFTKNLGFGI 313 (960) T ss_pred ECCHHHHHHHCCEEEEEEC-CHHHHHHHHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCE T ss_conf 0204655410220037746-7089888899999999999997458-9817889999999998539324566654304213 Q ss_pred CCCCCCCCCCCCCCCCCCCEECCCCEEEECCCEE-E-----CCCCCEEEEEEEEEEECCC Q ss_conf 1135607788623778986005698788468603-5-----6875215002599972787 Q gi|254780605|r 488 GSFLPVHEGPQGISRTNQEPLLPGMILSNEPGYY-R-----CGAFGIRIENVLCVSEPET 541 (609) Q Consensus 488 G~~l~VHE~P~~is~~~~~~l~~Gmv~s~EPG~Y-~-----~g~~GiRIEn~v~v~~~~~ 541 (609) | |.-.|.-..|+..++..|++||||.+-=|+= + .+.|-+-|-|+|+|++.+. T Consensus 314 G--lEFREssl~inaKnd~~lk~gmvFni~lGf~nl~n~~~~~~yaL~l~DTvlv~e~~p 371 (960) T KOG1189 314 G--LEFRESSLVINAKNDRVLKKGMVFNISLGFSNLTNPESKNSYALLLSDTVLVGEDPP 371 (960) T ss_pred E--EEEECCCCCCCCCCHHHHCCCCEEEEEECCCCCCCCCCCCCHHHHCCCEEEECCCCC T ss_conf 2--376115420166662341578489998666656786556504231022378458996 No 27 >cd01088 MetAP2 Methionine Aminopeptidase 2. E.C. 3.4.11.18. Also known as methionyl aminopeptidase and peptidase M. Catalyzes release of N-terminal amino acids, preferentially methionine, from peptides and arylamides. Probab=99.76 E-value=6.1e-17 Score=137.96 Aligned_cols=191 Identities=18% Similarity=0.167 Sum_probs=136.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 98799998443899999987533210012453018899999998754330278555541000012335432222455433 Q gi|254780605|r 319 IEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGPHAAIIHYQATVQS 398 (609) Q Consensus 319 I~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~N~AipHY~~~~~s 398 (609) |+.+|+|- ....+.+.++.....+|+|-.++++..+++-++.++ +++||+-+ -.|-.+.||.|++.. T Consensus 1 l~kyr~Ag----~i~~~vl~~~~~~~kpG~s~~el~~~~e~~i~~~~a-------~~aFP~~i--SvN~~v~H~~P~~~d 67 (291) T cd01088 1 LEKYREAG----EIHRQVRKYAQSLIKPGMTLLEIAEFVENRIRELGA-------GPAFPVNL--SINECAAHYTPNAGD 67 (291) T ss_pred CHHHHHHH----HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCC-------CCCCCCEE--CCCCEEECCCCCCCC T ss_conf 96899999----999999999996676999599999999999998598-------06778622--454677568899887 Q ss_pred CEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 20503898799830320105612221432103788888433366778888899721789788889999873667741865 Q gi|254780605|r 399 NRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGAD 478 (609) Q Consensus 399 ~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~d 478 (609) ++.|++||++=||.|++|-|+.+|..+|+.+|.-.++.-++=...|.+ ++..++ +|++-.++-.+.-+++.++|+. T Consensus 68 ~~~lk~GDiVkiD~G~~idG~~~D~A~Tv~v~~~~~~Li~aa~~al~~--ai~~~k--pG~~~~dIg~~Ie~~~~~~g~~ 143 (291) T cd01088 68 DTVLKEGDVVKLDFGAHVDGYIADSAFTVDFDPKYDDLLEAAKEALNA--AIKEAG--PDVRLGEIGEAIEEVIESYGFK 143 (291) T ss_pred CEEECCCCEEEEEEEEEECCEEEEEEEEEECCCCHHHHHHHHHHHHHH--HHHHHC--CCCCHHHHHHHHHHHHHHCCCE T ss_conf 546337998998665787788868889996383177899999999999--999853--8989899999999999986977 Q ss_pred CC-CCCCCCCCCCCCCCCCCCCCC---CCCCCEECCCCEEEECCCEEECCCCCEE Q ss_conf 47-764323211356077886237---7898600569878846860356875215 Q gi|254780605|r 479 FA-HGVGHGVGSFLPVHEGPQGIS---RTNQEPLLPGMILSNEPGYYRCGAFGIR 529 (609) Q Consensus 479 y~-HgtGHGVG~~l~VHE~P~~is---~~~~~~l~~Gmv~s~EPG~Y~~g~~GiR 529 (609) -. -=+|||+|.| .+|++|. |- ......|++|||++|||-+= .|.+=+| T Consensus 144 ~v~~l~GH~i~ry-~ih~~~~-Ipn~~~~~~~~l~~g~v~aIEp~~t-~G~g~v~ 195 (291) T cd01088 144 PIRNLTGHSIERY-RLHAGKS-IPNVKGGEGTRLEEGDVYAIEPFAT-TGKGYVH 195 (291) T ss_pred EECCCCCCCCCCC-CCCCCCE-ECEECCCCCCEECCCCEEEEEEEEE-CCCCEEE T ss_conf 0036525577772-2169975-3425289983660686899855740-5875263 No 28 >PRK09795 aminopeptidase; Provisional Probab=99.59 E-value=2.4e-14 Score=119.85 Aligned_cols=110 Identities=19% Similarity=0.228 Sum_probs=86.0 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHCHHHHHHCCCCCCEEEEEECCCCEEEECCHHHHHHHHHHCCCCEE Q ss_conf 99999999999749979996088874786066710610044146765007789578367997881489999861967179 Q gi|254780605|r 16 ERVHNLRSCFDSLGMDAFLVPRVDEYRGEFVDKGSERLAWLSGFTGSAGIAIVLRQKSVIFVDGRYTLQVEKEVDTALFT 95 (609) Q Consensus 16 erL~~LR~~m~~~~idayiip~~D~h~sEyv~~~~~rl~~isGFtGSaG~aiIt~~~a~L~tDgRY~lQA~~el~~~~~~ 95 (609) .||++||+.|+++++||++|.+-+ -+.|+|||+||+|+++||++++.|||||||+.||+.+....... T Consensus 2 sRL~~lr~~l~e~~lDallit~~~------------Ni~yltgf~~~~~~llit~~~~~l~~d~ry~~~a~~~~~~~~v~ 69 (361) T PRK09795 2 TLLASLRDWLKAQQLDAVLLSSRQ------------NKQPHLGISTGSGYVVISRESAHILVDSRYYADVEARAQGYQLH 69 (361) T ss_pred CHHHHHHHHHHHCCCCEEEECCHH------------HHHHHHCCCCCCEEEEEECCCCEEEECCHHHHHHHHHCCCCEEE T ss_conf 759999999998799999986725------------76574096668859999479868998742689898638797699 Q ss_pred EEECCCCCHHHHHHHHC--CCCCEEEECCEECCHHHHHHHHHHHH Q ss_conf 98668999999888743--79988996563413999999999853 Q gi|254780605|r 96 IKNIAIEPLHAWISEHG--FVGLRLGLDSRLHSSFEVDLLQKSLD 138 (609) Q Consensus 96 ~~~~~~~~~~~wl~~~~--~~g~~IG~Dp~l~s~~~~~~l~~~l~ 138 (609) +... .....+.+.+-+ ....+|||+...+|+..++++++.+. T Consensus 70 ~~~~-~~~~~~~~~~~~~~~~~~~ig~e~~~~~~~~~~~~~~~~~ 113 (361) T PRK09795 70 LLDA-THTLTTIVNQIIADEQLQTLGFEGQQVSWETAHRWQSELN 113 (361) T ss_pred EEEC-CCHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHCC T ss_conf 9825-5309999999976557857987324134999999985568 No 29 >pfam01321 Creatinase_N Creatinase/Prolidase N-terminal domain. This family includes the N-terminal non-catalytic domains from creatinase and prolidase. The exact function of this domain is uncertain. Probab=99.54 E-value=8.5e-14 Score=116.06 Aligned_cols=122 Identities=30% Similarity=0.504 Sum_probs=98.1 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHCHHHHHHCCCCCCEEEE-EECCCCEEEEC-CHHHHHHHHHHCCCCE Q ss_conf 999999999974997999608887478606671061004414676500778-95783679978-8148999986196717 Q gi|254780605|r 17 RVHNLRSCFDSLGMDAFLVPRVDEYRGEFVDKGSERLAWLSGFTGSAGIAI-VLRQKSVIFVD-GRYTLQVEKEVDTALF 94 (609) Q Consensus 17 rL~~LR~~m~~~~idayiip~~D~h~sEyv~~~~~rl~~isGFtGSaG~ai-It~~~a~L~tD-gRY~lQA~~el~~~~~ 94 (609) ||++||+.|+++++||++|++. ..+.|+|||+||+|+++ |+.+++.+++| +||..|++++...... T Consensus 1 Rl~kl~~~m~~~~iDa~li~~~------------~n~~YlTGf~~~~~~~l~i~~~~~~~i~~~~~~~~a~~~~~~~~i~ 68 (127) T pfam01321 1 RLERLRELMAEAGLDALLLTSP------------ENIRYLTGFTGSRGLALVVTADGATLLTDALRYTEAAEESLDEEII 68 (127) T ss_pred CHHHHHHHHHHCCCCEEEECCH------------HHHHHHCCCCCCCCEEEEECCCCCEEEEECCHHHHHHHHCCCCCEE T ss_conf 9899999999869999998567------------8712531887654369998399788999283489999848897199 Q ss_pred EEEECCCCCHHHHHHHHCCCCCEEEECCEECCHHHHHHHHHHHHHCCCEEEECCCHHHHHH Q ss_conf 9986689999998887437998899656341399999999985334761997050011122 Q gi|254780605|r 95 TIKNIAIEPLHAWISEHGFVGLRLGLDSRLHSSFEVDLLQKSLDKIEGVIVDVPYNPIDSL 155 (609) Q Consensus 95 ~~~~~~~~~~~~wl~~~~~~g~~IG~Dp~l~s~~~~~~l~~~l~~~~~~l~~~~~NlVD~i 155 (609) ... ....+.++|.+....+.+||+|+..+|+..+++|++.+.. ..+++.+ ++|+.+ T Consensus 69 ~~~--~~~~~~~~l~~~~~~~~~igve~~~~s~~~~~~l~~~lp~--~~~~d~~-~~i~~l 124 (127) T pfam01321 69 EYD--DAEALADLLKELGLELKRLGFEGDHLTVAEYERLKEALPD--AELVDAS-GLIEEL 124 (127) T ss_pred EEC--CHHHHHHHHHHHCCCCCEEEEECCEECHHHHHHHHHHCCC--CEEEECH-HHHHHH T ss_conf 957--8789999999845457669996564559999999986899--9899878-999972 No 30 >KOG2738 consensus Probab=99.50 E-value=2.7e-12 Score=105.65 Aligned_cols=216 Identities=20% Similarity=0.207 Sum_probs=143.7 Q ss_pred HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 61999999879999844389999998753321001245301889999999875433027855554100001233543222 Q gi|254780605|r 312 ATKNKVEIEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGPHAAIIH 391 (609) Q Consensus 312 aiKn~~EI~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~N~AipH 391 (609) -|.++.||++||+|..... +-|-.--..+..++|--|+.....+.-.+.+++..+ .-.--||-=+..--|-.|-| T Consensus 115 ~i~~~e~ie~mR~ac~Lar----evLd~Aa~~v~PgvTTdEiD~~VH~a~Ierg~YPSP-LnYy~FPKS~CTSVNEviCH 189 (369) T KOG2738 115 KILDPEGIEGMRKACRLAR----EVLDYAATLVRPGVTTDEIDRAVHNAIIERGAYPSP-LNYYGFPKSVCTSVNEVICH 189 (369) T ss_pred ECCCHHHHHHHHHHHHHHH----HHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCC-CCCCCCCHHHHCCHHHEEEC T ss_conf 5158899999999999999----999987502579763889999999999864885887-54578866662422214654 Q ss_pred CCCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH Q ss_conf 24554332050389879983032010561222143210378888843336677888889972178978888999987366 Q gi|254780605|r 392 YQATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIF 471 (609) Q Consensus 392 Y~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~ 471 (609) ..| ..|+|++||++=||.--.|.|+--|..+||++|+.+++.|+.-. |-+-.+..+-++..+|++-.+|-.+.-+. T Consensus 190 GIP---D~RpLedGDIvNiDVtvY~~GyHGDlneTffvG~Vde~~k~LVk-vT~EcL~kaI~~~kpGv~freiG~iI~kh 265 (369) T KOG2738 190 GIP---DSRPLEDGDIVNIDVTVYLNGYHGDLNETFFVGNVDEKAKKLVK-VTRECLEKAIAIVKPGVSFREIGNIIQKH 265 (369) T ss_pred CCC---CCCCCCCCCEEEEEEEEEECCCCCCCCCCEEEECCCHHHHHHHH-HHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 788---86727779878688999942525765442684036988988999-99999999887708960499999999987 Q ss_pred HHHCCCC-CCCCCCCCCCCCCCCCCCCCCCCC-C---CCCEECCCCEEEECCCEEECCCC-----------------CEE Q ss_conf 7741865-477643232113560778862377-8---98600569878846860356875-----------------215 Q gi|254780605|r 472 LWKYGAD-FAHGVGHGVGSFLPVHEGPQGISR-T---NQEPLLPGMILSNEPGYYRCGAF-----------------GIR 529 (609) Q Consensus 472 l~~~g~d-y~HgtGHGVG~~l~VHE~P~~is~-~---~~~~l~~Gmv~s~EPG~Y~~g~~-----------------GiR 529 (609) -.+.|+. -.-=-|||||.++ |-.|. |-. . ....+++||+|||||-+-+-++- --. T Consensus 266 a~~~g~sVVr~ycGHGig~~F--H~~Pn-ipHya~n~a~GvM~~G~tFTIEPmit~G~~~d~tWPD~WT~vTaDG~~sAQ 342 (369) T KOG2738 266 ATKNGYSVVRSYCGHGIGRVF--HCAPN-IPHYAKNKAPGVMKPGQTFTIEPMITIGTWEDITWPDDWTAVTADGKRSAQ 342 (369) T ss_pred HHHCCCEEEHHHHCCCCCCCC--CCCCC-CHHHCCCCCCCEEECCCEEEEEEEECCCCCCCCCCCCCCEEEECCCCEECC T ss_conf 655176001132056655301--46888-600104677634426855886302403665555489986378138851111 Q ss_pred EEEEEEEEEC Q ss_conf 0025999727 Q gi|254780605|r 530 IENVLCVSEP 539 (609) Q Consensus 530 IEn~v~v~~~ 539 (609) -|-.++||+. T Consensus 343 FEhTlLVT~t 352 (369) T KOG2738 343 FEHTLLVTET 352 (369) T ss_pred EEEEEEEECC T ss_conf 3568998236 No 31 >PTZ00053 methionine aminopeptidase II; Provisional Probab=99.37 E-value=9.9e-11 Score=94.71 Aligned_cols=193 Identities=18% Similarity=0.245 Sum_probs=125.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHH-HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 998799998443899999987533210012453018899999-9987543302785555410000123354322224554 Q gi|254780605|r 318 EIEGMQTAHIQDGVAMVYFLFWFYSQSLETITEIDIIKKLER-CREEIGCKMRNPLRDIAFNTIAASGPHAAIIHYQATV 396 (609) Q Consensus 318 EI~~mr~Ahi~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~-~R~~~~~~~~~~~~~~SF~TIva~G~N~AipHY~~~~ 396 (609) -+.-+|+|. -+.-+.=.|+...++.|++=.++++.+|. .|.-.+. +..-.|++|||=++. |-...||.|.+ T Consensus 121 ~~~~~r~aa----e~hr~vr~~~q~~ikpg~~~~~i~~~ie~~~r~l~~~--~~l~~GiAFPt~iSi--N~~AAHYTPn~ 192 (435) T PTZ00053 121 QYQDLRKAA----EVHRQVRRYAQSFIKPGISLIDMTDRIEKKTKELIEK--DGLKRGWAFPTGCSL--NHCAAHYTPNT 192 (435) T ss_pred HHHHHHHHH----HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCC--CCCCCCCCCCCCCCC--CCEEEECCCCC T ss_conf 999999999----9999999999986579982999999999999998524--561245004664541--01013158898 Q ss_pred CCCEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHH-HHHHHCCCCCCCHHHHHHHHHHHHHHC Q ss_conf 3320503898799830320105612221432103788888433366778888-899721789788889999873667741 Q gi|254780605|r 397 QSNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMI-SVSTARFPQRTRGCDLDSIARIFLWKY 475 (609) Q Consensus 397 ~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~i-al~~a~fp~g~~g~~lD~~aR~~l~~~ 475 (609) ..++.|+.||++=||.|+|-.|+-.|..-|+.|+.--+... ..|..|-- ++..+. +|+.-.++-...-+++..+ T Consensus 193 gD~~vL~~gDVVKID~GvHVDGyIaDsAfTV~f~~kyd~Ll---~A~keAt~taIk~ag--idvrl~eIG~aIqEvieSy 267 (435) T PTZ00053 193 GDKTVLTYDDVMKVDFGTHVNGRIIDCAFTVAFNEKYDPLL---EATKEATNEGIKQAG--IDARLCDIGEAIQEVIESY 267 (435) T ss_pred CCCEEECCCCEEEEEEEEEECCEEEEEEEEEECCCCHHHHH---HHHHHHHHHHHHHHC--CCCCHHHHHHHHHHHHHHH T ss_conf 98617427998999741113637997899987587537899---999999999999851--6861888999999999752 Q ss_pred CC-----CCC-----CCCCCCCCCCCCCCCCCCCCC--C-CCCC-EECCCCEEEECCCEEECCCC Q ss_conf 86-----547-----764323211356077886237--7-8986-00569878846860356875 Q gi|254780605|r 476 GA-----DFA-----HGVGHGVGSFLPVHEGPQGIS--R-TNQE-PLLPGMILSNEPGYYRCGAF 526 (609) Q Consensus 476 g~-----dy~-----HgtGHGVG~~l~VHE~P~~is--~-~~~~-~l~~Gmv~s~EPG~Y~~g~~ 526 (609) +. .|+ -=+||+++.| .+|-|+ +|- + .... .+++|+|+.||| +=-.|++ T Consensus 268 evei~gk~y~vKPI~NL~GH~I~~Y-~IHggk-sIP~v~~~~~~k~mEEGEVyAIEt-F~STG~G 329 (435) T PTZ00053 268 EVELNGKVYPVKSIRNLTGHNIGPY-IIHSGK-SVPIVRGGEATKIMEEGELFAIET-FASTGRG 329 (435) T ss_pred EEEECCCCCCCCCCCCCCCCCCCCE-EECCCC-EECCCCCCCCCCCCCCCCEEEEEE-EECCCCC T ss_conf 3443363316740467776653023-511685-532116886562236896699966-6227864 No 32 >COG5406 Nucleosome binding factor SPN, SPT16 subunit [Transcription / DNA replication, recombination, and repair / Chromatin structure and dynamics] Probab=99.31 E-value=5.1e-11 Score=96.74 Aligned_cols=229 Identities=17% Similarity=0.168 Sum_probs=138.1 Q ss_pred EECCCCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC------CCCCCCHHHHHHHHHHHHHHHHHCC--- Q ss_conf 7515432115666199999987999984438999999875332100------1245301889999999875433027--- Q gi|254780605|r 300 MVEGSDPSCLLRATKNKVEIEGMQTAHIQDGVAMVYFLFWFYSQSL------ETITEIDIIKKLERCREEIGCKMRN--- 370 (609) Q Consensus 300 ~i~~~~pI~~lRaiKn~~EI~~mr~Ahi~DgvAl~kfl~wL~~~~~------~~iTE~~ia~~Le~~R~~~~~~~~~--- 370 (609) .+..+.-...|=++|+..||+++|.+. +|...+|..+-+.+. ..+|-..+..+++..-.......+. T Consensus 157 ~~DvslgLsk~~~~KD~~E~an~~~ss----~~s~~~M~~~~~em~~~~D~~~kit~~KlsD~mes~iddv~f~q~~s~~ 232 (1001) T COG5406 157 ASDVSLGLSKMFLTKDAEEIANCRASS----AASSVLMRYFVKEMEMLWDGAFKITHGKLSDLMESLIDDVEFFQTKSLK 232 (1001) T ss_pred HHHHHHHHHHHHCCCCHHHHHHCCCCC----HHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCHHHHHHHCCCC T ss_conf 323323266776006688875310232----3779999998877999874001212215788765200136666433754 Q ss_pred ------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHH Q ss_conf ------85555410000123354322224554332050389879983032010561222143210378888843336677 Q gi|254780605|r 371 ------PLRDIAFNTIAASGPHAAIIHYQATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVL 444 (609) Q Consensus 371 ------~~~~~SF~TIva~G~N~AipHY~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VL 444 (609) .-..-.+.||+.+|..=- -+|+.-+.-.=-.|+++|+-.|-.|.+|||.||||+.+ .|+.||.+.|.-.+ T Consensus 233 l~~~~~d~lew~ytpiiqsg~~~D---l~psa~s~~~~l~gd~vl~s~GiRYn~YCSn~~RT~l~-dp~~e~~~Ny~fl~ 308 (1001) T COG5406 233 LGDIDLDQLEWCYTPIIQSGGSID---LTPSAFSFPMELTGDVVLLSIGIRYNGYCSNMSRTILT-DPDSEQQKNYEFLY 308 (1001) T ss_pred CCCCCHHHHHHHCCHHHCCCCEEE---CCCCCCCCCHHHCCCEEEEEEEEEECCCCCCCCCEEEE-CCCHHHHHHHHHHH T ss_conf 244540344210350221586130---56202158544337457999513433500234415870-77367653589999 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCC----CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCEEEECCCE Q ss_conf 888889972178978888999987366774186----5477643232113560778862377898600569878846860 Q gi|254780605|r 445 KGMISVSTARFPQRTRGCDLDSIARIFLWKYGA----DFAHGVGHGVGSFLPVHEGPQGISRTNQEPLLPGMILSNEPGY 520 (609) Q Consensus 445 k~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~----dy~HgtGHGVG~~l~VHE~P~~is~~~~~~l~~Gmv~s~EPG~ 520 (609) ..+--+-..+. +|+....+=.-+-.+|...|. ||.--.|-++| +.-.+.---++-.+..+||.||+|-+-=|+ T Consensus 309 ~lQk~i~~~~r-pG~~~g~iY~~~~~yi~~~~pel~pnF~~nvG~~ig--iefR~s~~~~nvkn~r~lq~g~~fnis~gf 385 (1001) T COG5406 309 MLQKYILGLVR-PGTDSGIIYSEAEKYISSNGPELGPNFIYNVGLMIG--IEFRSSQKPFNVKNGRVLQAGCIFNISLGF 385 (1001) T ss_pred HHHHHHHHHCC-CCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCC--CCCCCCCCCEECCCCCEECCCCEEEEEECC T ss_conf 99999985207-998705689999999872398647407643201113--010014662542478431135079996135 Q ss_pred E--E----CCCCCEEEEEEEEEEEC Q ss_conf 3--5----68752150025999727 Q gi|254780605|r 521 Y--R----CGAFGIRIENVLCVSEP 539 (609) Q Consensus 521 Y--~----~g~~GiRIEn~v~v~~~ 539 (609) = + .+.|..-+-|.+-|+-. T Consensus 386 ~nl~~~~~~Nnyal~l~dt~qi~ls 410 (1001) T COG5406 386 GNLINPHPKNNYALLLIDTEQISLS 410 (1001) T ss_pred CCCCCCCCCCCHHHHHCCCEEEECC T ss_conf 6557887665254441242586337 No 33 >TIGR00501 met_pdase_II methionine aminopeptidase, type II; InterPro: IPR002468 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M24 (clan MG), subfamily M24A. Methionine aminopeptidase (3.4.11.18 from EC) (MAP) is responsible for the removal of the amino-terminal (initiator) methionine from nascent eukaryotic cytosolic and cytoplasmic prokaryotic proteins if the penultimate amino acid is small and uncharged. All MAP studied to date are monomeric proteins that require cobalt ions for activity. Two subfamilies of MAP enzymes are known to exist , . While being evolutionary related, they only share a limited amount of sequence similarity mostly clustered around the residues shown, in the Escherichia coli MAP , to be involved in cobalt-binding. The first family consists of enzymes from prokaryotes as well as eukaryotic MAP-1, while the second group (IPR002467 from INTERPRO) is made up of archaeal MAP and eukaryotic MAP-2 and includes proteins which do not seem to be MAP, but that are clearly evolutionary related such as mouse proliferation-associated protein 1 and fission yeast curved DNA-binding protein. ; GO: 0004239 methionyl aminopeptidase activity, 0006508 proteolysis. Probab=99.20 E-value=3.8e-10 Score=90.65 Aligned_cols=179 Identities=20% Similarity=0.316 Sum_probs=128.0 Q ss_pred HHHCCCCCCCHHHHHHHH-HHHHHHHH----HCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCEEEEECCCC Q ss_conf 210012453018899999-99875433----0278-55554100001233543222245543320503898799830320 Q gi|254780605|r 342 SQSLETITEIDIIKKLER-CREEIGCK----MRNP-LRDIAFNTIAASGPHAAIIHYQATVQSNRLLQKDELLLLDSGAQ 415 (609) Q Consensus 342 ~~~~~~iTE~~ia~~Le~-~R~~~~~~----~~~~-~~~~SF~TIva~G~N~AipHY~~~~~s~~~i~~~~~~LiDsG~q 415 (609) ..+.+|+-=+|+|+.+|. .|+.-|++ +.++ -+|.+||+=++-+.=+| ||.|.+..++.++.+|++=+|.|+| T Consensus 27 ~~I~PG~klleva~fvEnri~e~tGa~nllameDPksqGiAFP~n~S~N~~AA--HfTp~~gD~~~~k~~DvvKlD~Gah 104 (327) T TIGR00501 27 DRIVPGVKLLEVAEFVENRIRELTGAENLLAMEDPKSQGIAFPCNISINEVAA--HFTPKAGDETVFKDGDVVKLDLGAH 104 (327) T ss_pred HHCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCCCCCCCCCCCCCCCCHHCCCC--CCCCCCCCCCCCCCCCEEEEEECCE T ss_conf 52678861212212121012122064100110386325522556751213010--1686888863127787688752206 Q ss_pred EECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCC---CCCC-------CCC Q ss_conf 105612221432103788888433366778888899721789788889999873667741865---4776-------432 Q gi|254780605|r 416 YVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGAD---FAHG-------VGH 485 (609) Q Consensus 416 Y~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~d---y~Hg-------tGH 485 (609) =-|+-.|+.=||.+|..+++.++.-.++-.|.-+.-+. ..+|+.-.++=.+--..|..+-.. =.|| ||| T Consensus 105 vdGyIaD~A~TV~l~~~ddky~~L~~A~~dAl~~~i~~-~~~~v~v~eiG~~I~e~i~syeveinGetYg~kP~~NL~GH 183 (327) T TIGR00501 105 VDGYIADTAITVDLGDQDDKYKELVKAAKDALYTAIKE-IRAGVEVGEIGKAIEEVIESYEVEINGETYGVKPISNLTGH 183 (327) T ss_pred ECCEEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHH-CCCCCEEECCCHHHHHHHCCEEEEECCCCCCCCCCCCCCCC T ss_conf 65578704788506887015899999999998877750-06981260020776653221357765720143342227356 Q ss_pred CCCCCCCCCCCCCCC---CCCCCCEECCCCEEEECCCEEECCCC Q ss_conf 321135607788623---77898600569878846860356875 Q gi|254780605|r 486 GVGSFLPVHEGPQGI---SRTNQEPLLPGMILSNEPGYYRCGAF 526 (609) Q Consensus 486 GVG~~l~VHE~P~~i---s~~~~~~l~~Gmv~s~EPG~Y~~g~~ 526 (609) ++--|+ +|=|+ || ..+++..|++||++-||| |==.|.| T Consensus 184 s~~rY~-~H~G~-SiPn~~~~~~~~leeG~~~AIEp-FAt~G~G 224 (327) T TIGR00501 184 SMARYL-LHAGL-SIPNVKERDTTKLEEGDVVAIEP-FATDGVG 224 (327) T ss_pred CCCHHH-HCCCC-CCEEEECCCCCEEECCCEEEECC-CCCCCEE T ss_conf 400154-35895-00034279874510257888436-6768745 No 34 >pfam01321 Creatinase_N Creatinase/Prolidase N-terminal domain. This family includes the N-terminal non-catalytic domains from creatinase and prolidase. The exact function of this domain is uncertain. Probab=98.95 E-value=1.9e-09 Score=85.86 Aligned_cols=124 Identities=15% Similarity=0.211 Sum_probs=88.6 Q ss_pred HHHHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCEEECCCCCCCHHHHHHHCCC-CEEEC Q ss_conf 99999986023343067602842023443201355377870023322014786200035311215678741256-35707 Q gi|254780605|r 182 KIRDICKILHQKEVGAVFICDPSSIAWIFNIRGFDIPCSPYPLSRAILYADGKAEIFFDKQYINEQLKALLSAV-AIVLD 260 (609) Q Consensus 182 Ki~~ir~~L~~~~~daliit~~d~IaWLlNiRG~Di~~tPv~~a~aiI~~~~~~~Lfvd~~k~~~~~~~~l~~~-~~~~~ 260 (609) +|+++|+.|+++++|+++|++++|+.||+++.|+. | ..++|..++ ..+|++........+...... ....+ T Consensus 1 Rl~kl~~~m~~~~iDa~li~~~~n~~YlTGf~~~~----~---~~l~i~~~~-~~~i~~~~~~~~a~~~~~~~~i~~~~~ 72 (127) T pfam01321 1 RLERLRELMAEAGLDALLLTSPENIRYLTGFTGSR----G---LALVVTADG-ATLLTDALRYTEAAEESLDEEIIEYDD 72 (127) T ss_pred CHHHHHHHHHHCCCCEEEECCHHHHHHHCCCCCCC----C---EEEEECCCC-CEEEEECCHHHHHHHHCCCCCEEEECC T ss_conf 98999999998699999985678712531887654----3---699983997-889992834899998488971999578 Q ss_pred HHHHHHHHHHCCCCCCEEEECCCCCCHHHHHHHHHC--CCEEECCCCCCHHHHHC Q ss_conf 445157786301357428972531778897531213--95675154321156661 Q gi|254780605|r 261 MDMMDSRLVCLARTSMPILIDPKWISYRFFKVIAQK--NGVMVEGSDPSCLLRAT 313 (609) Q Consensus 261 ~~~~~~~l~~l~~~~~~i~id~~~~s~~~~~~l~~~--~~~~i~~~~pI~~lRai 313 (609) .+.+...+.......++|++|++..++..++.+++. +..++.....+..+|+| T Consensus 73 ~~~~~~~l~~~~~~~~~igve~~~~s~~~~~~l~~~lp~~~~~d~~~~i~~lR~I 127 (127) T pfam01321 73 AEALADLLKELGLELKRLGFEGDHLTVAEYERLKEALPDAELVDASGLIEELRMI 127 (127) T ss_pred HHHHHHHHHHHCCCCCEEEEECCEECHHHHHHHHHHCCCCEEEECHHHHHHHCCC T ss_conf 7899999998454576699965645599999999868999899878999972679 No 35 >COG0006 PepP Xaa-Pro aminopeptidase [Amino acid transport and metabolism] Probab=98.21 E-value=1.9e-05 Score=57.95 Aligned_cols=127 Identities=25% Similarity=0.365 Sum_probs=87.6 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHCHHHHHHCCC--CCCE--EEEEECCC-CEEEECCHHHHHHHHH Q ss_conf 58999999999997499799960888747860667106100441467--6500--77895783-6799788148999986 Q gi|254780605|r 14 TFERVHNLRSCFDSLGMDAFLVPRVDEYRGEFVDKGSERLAWLSGFT--GSAG--IAIVLRQK-SVIFVDGRYTLQVEKE 88 (609) Q Consensus 14 ~~erL~~LR~~m~~~~idayiip~~D~h~sEyv~~~~~rl~~isGFt--GSaG--~aiIt~~~-a~L~tDgRY~lQA~~e 88 (609) ...|+.++|..|.++++|++++.+. .-+.|+|||+ |..+ .+++..+. +.||++++|..+|..+ T Consensus 10 ~~~r~~~~~~~~~~~~~~~~~~~~~------------~n~~yltg~~~~~~~~~~~~i~~~~~~~~l~~~~~~~~~~~~~ 77 (384) T COG0006 10 YRARLARLRELMEEAGLDALLLTSP------------SNFYYLTGFDAFGFERLQALLVPAEGEPVLFVRGRDEEAAKET 77 (384) T ss_pred HHHHHHHHHHHHHHCCCCEEEECCC------------CCCEEEECCCCCCCCCCEEEEEECCCCEEEEECCHHHHHHHHH T ss_conf 5679999999987679877995277------------6404430467666555315897069861999661646677553 Q ss_pred HCCC--CEEEEECC-CC-----CHHHHHHHHCCCCCEEEECCEE--CCHHHHHHHHHHHHHCCCEEEECCCHHHHHH Q ss_conf 1967--17998668-99-----9999888743799889965634--1399999999985334761997050011122 Q gi|254780605|r 89 VDTA--LFTIKNIA-IE-----PLHAWISEHGFVGLRLGLDSRL--HSSFEVDLLQKSLDKIEGVIVDVPYNPIDSL 155 (609) Q Consensus 89 l~~~--~~~~~~~~-~~-----~~~~wl~~~~~~g~~IG~Dp~l--~s~~~~~~l~~~l~~~~~~l~~~~~NlVD~i 155 (609) ..-. .+...... .. .+.+++........++|++... .+...+..+.+.+... +++... ++++.+ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~g~~~~~~~~~~~~~~~l~~~~~~~--~~~~~~-~~i~~l 151 (384) T COG0006 78 SWIKLENVEVYEDDEDPAAPLDLLGALLEELGLAGKRIGIESASIFLTLAAFERLQAALPRA--ELVDAS-DLVDRL 151 (384) T ss_pred CCCCCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCC--EEECCH-HHHHHH T ss_conf 11235674277525554456889999998754467662521002324288899998646776--575027-789998 No 36 >KOG2775 consensus Probab=97.92 E-value=0.00082 Score=46.55 Aligned_cols=173 Identities=20% Similarity=0.281 Sum_probs=104.6 Q ss_pred HHHHHCCCCCCCHHHHHHHHH-HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCEEEEECCCCEEC Q ss_conf 332100124530188999999-9875433027855554100001233543222245543320503898799830320105 Q gi|254780605|r 340 FYSQSLETITEIDIIKKLERC-REEIGCKMRNPLRDIAFNTIAASGPHAAIIHYQATVQSNRLLQKDELLLLDSGAQYVN 418 (609) Q Consensus 340 L~~~~~~~iTE~~ia~~Le~~-R~~~~~~~~~~~~~~SF~TIva~G~N~AipHY~~~~~s~~~i~~~~~~LiDsG~qY~~ 418 (609) +...+.+|||=+++++.||.- |+-... +..-.|+.|||=++- |..-.||.|+.....+++.+|+.-||+|.|-.| T Consensus 102 v~s~ikPGmtm~ei~e~iEnttR~li~e--~gl~aGi~FPtG~Sl--N~cAAHyTpNaGd~tVLqydDV~KiDfGthi~G 177 (397) T KOG2775 102 VQSIIKPGMTMIEICETIENTTRKLILE--NGLNAGIGFPTGCSL--NHCAAHYTPNAGDKTVLKYDDVMKIDFGTHIDG 177 (397) T ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHH--CCCCCCCCCCCCCCC--CCHHHHCCCCCCCCEEEEECCEEEEECCCCCCC T ss_conf 9986167642999999998889999874--551027667776662--103430689999834642065688740211067 Q ss_pred CCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHH-CCCCC--CCHHHHHHHHHHHHHHCCCC-----C-----CCCCCC Q ss_conf 61222143210378888843336677888889972-17897--88889999873667741865-----4-----776432 Q gi|254780605|r 419 GTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTA-RFPQR--TRGCDLDSIARIFLWKYGAD-----F-----AHGVGH 485 (609) Q Consensus 419 GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a-~fp~g--~~g~~lD~~aR~~l~~~g~d-----y-----~HgtGH 485 (609) ---|..=|++|..- |.-.|.|--+.-.+ +.-.| +.-+++-.+.-.++..+-.. | ..=-|| T Consensus 178 rIiDsAFTv~F~p~-------~d~Ll~AvreaT~tGIkeaGiDvRlcdiG~aiqEVmeSyEvEi~Gk~~~VKpIrnLnGH 250 (397) T KOG2775 178 RIIDSAFTVAFNPK-------YDPLLAAVREATNTGIKEAGIDVRLCDIGEAIQEVMESYEVEINGKTYQVKPIRNLNGH 250 (397) T ss_pred EEEEEEEEEEECCC-------CCHHHHHHHHHHHHHHHHCCCEEEEHHHHHHHHHHHHHEEEEECCCEEEEEECCCCCCC T ss_conf 27534568860755-------20899999988750265528402321036889997663078767816200013135787 Q ss_pred CCCCCCCCCCC---CCCCCCCCCCEECCCCEEEECCCEEECCCC Q ss_conf 32113560778---862377898600569878846860356875 Q gi|254780605|r 486 GVGSFLPVHEG---PQGISRTNQEPLLPGMILSNEPGYYRCGAF 526 (609) Q Consensus 486 GVG~~l~VHE~---P~~is~~~~~~l~~Gmv~s~EPG~Y~~g~~ 526 (609) .++.| -+|-| |. +.-+..+.+++|-++.||- +=-.|++ T Consensus 251 SI~~y-rIH~gksVPi-Vkgge~trmee~e~yAIET-FgSTGkG 291 (397) T KOG2775 251 SIAQY-RIHGGKSVPI-VKGGEQTRMEEGEIYAIET-FGSTGKG 291 (397) T ss_pred CCCCE-EEECCCCCCE-ECCCCCEEECCCEEEEEEE-ECCCCCC T ss_conf 43324-7625833222-4477311320770699984-0467762 No 37 >KOG2776 consensus Probab=97.35 E-value=0.0043 Score=41.50 Aligned_cols=94 Identities=13% Similarity=0.197 Sum_probs=64.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHH--HHHHHHHH Q ss_conf 278555541000012335432222455433-205038987998303201056122214321037888884--33366778 Q gi|254780605|r 369 RNPLRDIAFNTIAASGPHAAIIHYQATVQS-NRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKK--YYFTLVLK 445 (609) Q Consensus 369 ~~~~~~~SF~TIva~G~N~AipHY~~~~~s-~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k--~~yt~VLk 445 (609) ++...|.+|||-+. .|-..-||.|-+.. +..++.||++=||.|+|.-|+++-+.-|+++|.+++-+- +--..+.- T Consensus 71 K~~~KGIAfPT~Is--vnncv~h~sPlksd~~~~Lk~GDvVKIdLG~HiDGfiA~vaHT~VV~~~~~~~vtG~kADvI~A 148 (398) T KOG2776 71 KDFEKGIAFPTSIS--VNNCVCHFSPLKSDADYTLKEGDVVKIDLGVHIDGFIALVAHTIVVGPAPDTPVTGRKADVIAA 148 (398) T ss_pred HHHHCCCCCCCEEC--CCCCEECCCCCCCCCCCCCCCCCEEEEEECCEECCCEEEEEEEEEECCCCCCCCCCCHHHHHHH T ss_conf 44522655560563--0010014677877876322479879997411130011001445895378888564713579999 Q ss_pred HHHHHHHHC--CCCCCCHHHH Q ss_conf 888899721--7897888899 Q gi|254780605|r 446 GMISVSTAR--FPQRTRGCDL 464 (609) Q Consensus 446 ~~ial~~a~--fp~g~~g~~l 464 (609) ||+|+..|. ..+|.+-.++ T Consensus 149 Ah~A~eaa~rllkpgn~n~~v 169 (398) T KOG2776 149 AHLAAEAALRLLKPGNTNTQV 169 (398) T ss_pred HHHHHHHHHHHHCCCCCCCHH T ss_conf 899999999984789999537 No 38 >PRK12318 methionine aminopeptidase; Provisional Probab=96.66 E-value=0.011 Score=38.79 Aligned_cols=65 Identities=12% Similarity=0.166 Sum_probs=31.0 Q ss_pred HCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCEEECCCCEEEEECCC Q ss_conf 0012453018899999998754330278555541000012335--4322224554332050389879983032 Q gi|254780605|r 344 SLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGPH--AAIIHYQATVQSNRLLQKDELLLLDSGA 414 (609) Q Consensus 344 ~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~N--~AipHY~~~~~s~~~i~~~~~~LiDsG~ 414 (609) ++.|++-.++...++.+-.+.+...-..|.|=. +|-.-| -.+|||.+ .++.+|++|-++-|-=|- T Consensus 178 ikpG~~~~dI~~ai~~~~~~~G~~~~~~~~GHG----IG~~vHE~P~I~~~~~--~~~~~L~pGMV~TIEPgI 244 (291) T PRK12318 178 LKPNLPLYEIGEVIENCADTYGFSVVDQFVGHG----VGIKFHENPYVPHHRN--RSKIPLAPGMIFTIEPMI 244 (291) T ss_pred HCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCC----CCCCCCCCCCCCCCCC--CCCCEECCCCEEEECCCC T ss_conf 079964779999999999984991176800287----2731034886564569--998783799989879921 No 39 >KOG2413 consensus Probab=96.63 E-value=0.0045 Score=41.41 Aligned_cols=105 Identities=15% Similarity=0.259 Sum_probs=72.6 Q ss_pred HHHHHHHHHCCCCCEEEEECCCCC------------HHHHHHCCCCCCCCCCHHHHHHHHCCCCCCEEECCCCCCCHHHH Q ss_conf 999999860233430676028420------------23443201355377870023322014786200035311215678 Q gi|254780605|r 182 KIRDICKILHQKEVGAVFICDPSS------------IAWIFNIRGFDIPCSPYPLSRAILYADGKAEIFFDKQYINEQLK 249 (609) Q Consensus 182 Ki~~ir~~L~~~~~daliit~~d~------------IaWLlNiRG~Di~~tPv~~a~aiI~~~~~~~Lfvd~~k~~~~~~ 249 (609) .+.++|+.|+..+++|+++.+-|. .+||.++-|+ .++|+|+. +++.|++|.++..+.-. T Consensus 11 ~~~~~~~~~~~~~i~aYi~Ps~DaH~sEy~~~~D~R~~flsGFsGs--------ag~Avit~-~~a~lwtD~RY~~QA~~ 81 (606) T KOG2413 11 ELMRLRELMKSPPIDAYILPSTDAHQSEYIADRDERRAFLSGFSGS--------AGTAVITE-EEAALWTDGRYFQQAEQ 81 (606) T ss_pred HHHHHHHHHCCCCCEEEECCCCCHHHHHHHCCHHHHHHHHCCCCCC--------CCEEEEEC-CCCEEEECCHHHHHHHH T ss_conf 9999998735799339984687133433423145666631266787--------60589834-74048875689998775 Q ss_pred HHHCCCCEEECH---HHHHHHHHHCCCCCCEEEECCCCCCHHHHHHHHH Q ss_conf 741256357074---4515778630135742897253177889753121 Q gi|254780605|r 250 ALLSAVAIVLDM---DMMDSRLVCLARTSMPILIDPKWISYRFFKVIAQ 295 (609) Q Consensus 250 ~~l~~~~~~~~~---~~~~~~l~~l~~~~~~i~id~~~~s~~~~~~l~~ 295 (609) +.-.+....... ....+.+........+|++||.-.++..+..+.+ T Consensus 82 qld~~W~l~k~~~~~~~v~~wl~~~l~~~~~vG~Dp~Lis~~~~~~~~~ 130 (606) T KOG2413 82 QLDSNWTLMKMGEDVPTVEEWLAKVLPEGSRVGIDPTLISFDAWKQLEK 130 (606) T ss_pred HHCCCCEEEECCCCCCCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHH T ss_conf 3154400300169986599999976778660364864305067776898 No 40 >cd01089 PA2G4-like Related to aminopepdidase M, this family contains proliferation-associated protein 2G4. Family members have been implicated in cell cycle control. Probab=95.99 E-value=0.064 Score=33.37 Aligned_cols=90 Identities=16% Similarity=0.133 Sum_probs=39.0 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCC-C--CCCCCCCCCC--CCCC-----CCCCCCCCCCC Q ss_conf 88433366778888899721789788889999873667741865-4--7764323211--3560-----77886237789 Q gi|254780605|r 435 EKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGAD-F--AHGVGHGVGS--FLPV-----HEGPQGISRTN 504 (609) Q Consensus 435 e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~d-y--~HgtGHGVG~--~l~V-----HE~P~~is~~~ 504 (609) ++|++=.+|-+.+-.+...+.| |++-.+||.+|.+.+.+.|-. | .++-.+|+++ +.+| |=-|. . ..+ T Consensus 3 KmR~Ag~i~a~~l~~~~~~i~p-G~tt~eld~~~~~~i~~~~~~~~~~~~~~~~G~~~p~CiSvNe~v~HGiP~-~-~~~ 79 (228) T cd01089 3 KYKTAGQIANKVLKQVISLCVP-GAKVVDLCEKGDKLILEELGKVYKKEKKLEKGIAFPTCISVNNCVCHFSPL-K-SDA 79 (228) T ss_pred HHHHHHHHHHHHHHHHHHHCCC-CCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEECCCCEEECCCCC-C-CCC T ss_conf 8999999999999999957869-996999999999999983043256766666777867275025444568999-8-888 Q ss_pred CCEECCCCEEEECCCEEECCCCC Q ss_conf 86005698788468603568752 Q gi|254780605|r 505 QEPLLPGMILSNEPGYYRCGAFG 527 (609) Q Consensus 505 ~~~l~~Gmv~s~EPG~Y~~g~~G 527 (609) +.+|++|-++++-=|.++.|.+| T Consensus 80 ~~~L~~GDiV~iD~g~~~dGy~~ 102 (228) T cd01089 80 TYTLKDGDVVKIDLGCHIDGYIA 102 (228) T ss_pred CCCCCCCCEEEEEEEEEECCEEE T ss_conf 73546899899977688687798 No 41 >TIGR00495 crvDNA_42K DNA-binding protein, 42 kDa; InterPro: IPR004545 Proteins of this family have been identified in a number of species as a nuclear (but not nucleolar) protein with a cell cycle dependence. Various names given to members of this family have included cell cycle protein p38-2G4, DNA-binding protein GBP16, and proliferation-associated protein 1. The proteins are closely related to methionine aminopeptidase, a cobalt-binding protein. This group sequences are classified as non-peptidase homologues in MEROPS peptidase family M24 (clan MG). . Probab=95.42 E-value=0.034 Score=35.29 Aligned_cols=100 Identities=15% Similarity=0.216 Sum_probs=66.3 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC----EEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHH--HHHH Q ss_conf 2785555410000123354322224554332----050389879983032010561222143210378888843--3366 Q gi|254780605|r 369 RNPLRDIAFNTIAASGPHAAIIHYQATVQSN----RLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKY--YFTL 442 (609) Q Consensus 369 ~~~~~~~SF~TIva~G~N~AipHY~~~~~s~----~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~--~yt~ 442 (609) .+-..|.+|||-++ +|-..-|..|-+... +.|+.||+|=||.|+|==|+.+=+.-|+.+|..++++-- .=.. T Consensus 73 K~~~kGiAFPT~vS--vNn~vgHfSPLkSd~eand~~Lk~GD~VKIdLG~HiDGFiA~vAhT~vig~~~~~~~tG~~ADv 150 (407) T TIGR00495 73 KELEKGIAFPTCVS--VNNCVGHFSPLKSDEEANDLVLKEGDVVKIDLGCHIDGFIALVAHTIVIGVASEEPVTGRKADV 150 (407) T ss_pred CCCCCCCCCCCCCC--CCCCHHHCCCCCCCCCHHHHHHHCCCEEEEECCCCEEEEEECCEEEEEEEECCCCCCCCHHHHH T ss_conf 00115732476417--0360332688788831334453338747981366440356415078998514587655706789 Q ss_pred HHHHHHHHHHHC--CCCCCCHHHH----HHHHHH Q ss_conf 778888899721--7897888899----998736 Q gi|254780605|r 443 VLKGMISVSTAR--FPQRTRGCDL----DSIARI 470 (609) Q Consensus 443 VLk~~ial~~a~--fp~g~~g~~l----D~~aR~ 470 (609) .--||.|+..|. ..||-+-.++ |.+|.. T Consensus 151 I~AA~~Al~aAlRlvkPG~~n~~VT~~~~KvA~~ 184 (407) T TIGR00495 151 IAAAHLALEAALRLVKPGKTNTQVTEAIEKVADS 184 (407) T ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH T ss_conf 9999999998874048879871478899999732 No 42 >PRK07281 methionine aminopeptidase; Reviewed Probab=94.45 E-value=0.29 Score=28.77 Aligned_cols=26 Identities=12% Similarity=0.184 Sum_probs=17.7 Q ss_pred CCCCCCCCCCCCEEECCCCEEEEECCC Q ss_conf 322224554332050389879983032 Q gi|254780605|r 388 AIIHYQATVQSNRLLQKDELLLLDSGA 414 (609) Q Consensus 388 AipHY~~~~~s~~~i~~~~~~LiDsG~ 414 (609) .+|||.. +..+.+|++|-++-|-=|- T Consensus 210 ~V~~y~~-~~~g~~L~~GMv~tIEPmi 235 (286) T PRK07281 210 MVPNYGT-AGRGLRLREGMVLTIEPMI 235 (286) T ss_pred CCCCCCC-CCCCCCCCCCCEEEEECEE T ss_conf 3677674-8999686469889980617 No 43 >TIGR00500 met_pdase_I methionine aminopeptidase, type I; InterPro: IPR002467 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M24 (clan MG), subfamily M24A. Methionine aminopeptidase (3.4.11.18 from EC) (MAP) is responsible for the removal of the amino-terminal (initiator) methionine from nascent eukaryotic cytosolic and cytoplasmic prokaryotic proteins if the penultimate amino acid is small and uncharged. All MAP studied to date are monomeric proteins that require cobalt ions for activity. Two subfamilies of MAP enzymes are known to exist , . While being evolutionary related, they only share a limited amount of sequence similarity mostly clustered around the residues shown, in the Escherichia coli MAP , to be involved in cobalt-binding. The first family consists of enzymes from prokaryotes as well as eukaryotic MAP-1, while the second group (IPR002468 from INTERPRO) is made up of archaeal MAP and eukaryotic MAP-2. ; GO: 0004239 methionyl aminopeptidase activity, 0006508 proteolysis. Probab=94.30 E-value=0.36 Score=28.15 Aligned_cols=58 Identities=22% Similarity=0.264 Sum_probs=27.6 Q ss_pred HHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC---------C-CCCCCCCCCC---CCEEECCCCEEE Q ss_conf 10012453018899999998754330278555541000012335---------4-3222245543---320503898799 Q gi|254780605|r 343 QSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGPH---------A-AIIHYQATVQ---SNRLLQKDELLL 409 (609) Q Consensus 343 ~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~N---------~-AipHY~~~~~---s~~~i~~~~~~L 409 (609) ++..|..=.++.+.++.+-++.| |.+|=.++-| = .||||-.... +|-+|++|=++- T Consensus 148 ~~kpG~~~~~ig~AI~~~~~~~G-----------F~vvr~y~GHGiG~~~HeePn~ipnY~~~~~K~nsn~~lk~GMV~~ 216 (265) T TIGR00500 148 EIKPGKRIGEIGKAIEKYAEEKG-----------FSVVREYCGHGIGRKLHEEPNQIPNYYKKFTKKNSNVRLKEGMVFA 216 (265) T ss_pred CCCCCCCHHHHHHHHHHHHHHCC-----------CEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCEEE T ss_conf 01479887789999999987629-----------9786145276437744557353666667777667887106875788 Q ss_pred EE Q ss_conf 83 Q gi|254780605|r 410 LD 411 (609) Q Consensus 410 iD 411 (609) |- T Consensus 217 iE 218 (265) T TIGR00500 217 IE 218 (265) T ss_pred EC T ss_conf 62 No 44 >COG0024 Map Methionine aminopeptidase [Translation, ribosomal structure and biogenesis] Probab=93.73 E-value=0.53 Score=26.94 Aligned_cols=57 Identities=16% Similarity=0.202 Sum_probs=30.4 Q ss_pred HHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC---------CCCCCCCCCCCCEEECCCCEEEEE Q ss_conf 100124530188999999987543302785555410000123354---------322224554332050389879983 Q gi|254780605|r 343 QSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAASGPHA---------AIIHYQATVQSNRLLQKDELLLLD 411 (609) Q Consensus 343 ~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva~G~N~---------AipHY~~~~~s~~~i~~~~~~LiD 411 (609) +++.+.+=.++.+.++++-+..+ |.++-.+..|| .+|||.... ...+|++|.++-|. T Consensus 139 ~vkpG~~l~~Ig~aIq~~~~~~G-----------~~vVr~~~GHgig~~~He~p~ip~y~~~~-~~~~l~~Gmv~aIE 204 (255) T COG0024 139 AVKPGARLGDIGRAIQEYAESRG-----------FSVVRNLTGHGIGRELHEEPSIPNYGKDG-TGVRLKEGMVFAIE 204 (255) T ss_pred HCCCCCCHHHHHHHHHHHHHHCC-----------CEEEECCCCCCCCCCCCCCCEECCCCCCC-CCCCCCCCCEEEEE T ss_conf 50699888999999999999769-----------87714664775675457997025646888-87243699799970 No 45 >PRK12897 methionine aminopeptidase; Reviewed Probab=92.62 E-value=0.91 Score=25.33 Aligned_cols=25 Identities=16% Similarity=0.274 Sum_probs=16.0 Q ss_pred CCCCCCCCCCCEEECCCCEEEEECCC Q ss_conf 22224554332050389879983032 Q gi|254780605|r 389 IIHYQATVQSNRLLQKDELLLLDSGA 414 (609) Q Consensus 389 ipHY~~~~~s~~~i~~~~~~LiDsG~ 414 (609) ++||. .+.++.+|++|-++-|-=|- T Consensus 180 i~~~~-~~~~~~~L~~GMv~tIEPgI 204 (248) T PRK12897 180 IFHFG-KQGQGPELQEGMVITIEPIV 204 (248) T ss_pred CCCCC-CCCCCCCCCCCCEEEEECEE T ss_conf 26677-59999674689789982546 No 46 >TIGR01406 dnaQ_proteo DNA polymerase III, epsilon subunit; InterPro: IPR006309 These represent DnaQ, the DNA polymerase III epsilon subunit, as found in most Proteobacteria. It consists largely of an exonuclease region as described in IPR006055 from INTERPRO. In Gram-positive bacteria, closely related regions are found both in the Gram-positive type DNA polymerase III alpha subunit and as an additional N-terminal domain of a DinG-family helicase. Both are not included in this set of sequences, as they are smaller proteins.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=89.01 E-value=0.67 Score=26.24 Aligned_cols=14 Identities=14% Similarity=0.420 Sum_probs=6.1 Q ss_pred CCEEECCCCEEEEE Q ss_conf 32050389879983 Q gi|254780605|r 398 SNRLLQKDELLLLD 411 (609) Q Consensus 398 s~~~i~~~~~~LiD 411 (609) ++|.|+=|.+=+|| T Consensus 22 GhRivEIGavE~vn 35 (246) T TIGR01406 22 GHRIVEIGAVELVN 35 (246) T ss_pred CCEEEEEEEEEEEC T ss_conf 86268882185673 No 47 >TIGR03241 arg_catab_astB succinylarginine dihydrolase. Members of this family are succinylarginine dihydrolase (EC 3.5.3.23), the second of five enzymes in the arginine succinyltransferase (AST) pathway. Probab=88.31 E-value=1.5 Score=23.90 Aligned_cols=310 Identities=16% Similarity=0.167 Sum_probs=151.8 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHCHHHHHHCCCCCCEEEEEEC------------CCCEEE------ Q ss_conf 89999999999974997999608887478606671061004414676500778957------------836799------ Q gi|254780605|r 15 FERVHNLRSCFDSLGMDAFLVPRVDEYRGEFVDKGSERLAWLSGFTGSAGIAIVLR------------QKSVIF------ 76 (609) Q Consensus 15 ~erL~~LR~~m~~~~idayiip~~D~h~sEyv~~~~~rl~~isGFtGSaG~aiIt~------------~~a~L~------ 76 (609) .+-|.|.|.++.--=..+++-|..-|+..= -| =-|||||...+|--. ..+.+| T Consensus 41 lQGL~KMk~L~dlGl~QGvlpPqeRP~~~~------Lr---~lGF~GsD~~vl~~aak~aP~Ll~a~~SASsMWtANAAT 111 (443) T TIGR03241 41 KQGLLKMKALADLGFKQGVLPPQERPDIPV------LR---RLGFSGSDEQVLAKAARQAPVLLSAVSSASSMWTANAAT 111 (443) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCHHH------HH---HHCCCCCHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCC T ss_conf 979999999997688650369988988078------99---828898779999999987989999886068887740055 Q ss_pred -------ECCHHHHHH----------------HHHH-----CCCCEEEEEC-------CCCCH---HHHHHHHCCCCCEE Q ss_conf -------788148999----------------9861-----9671799866-------89999---99888743799889 Q gi|254780605|r 77 -------VDGRYTLQV----------------EKEV-----DTALFTIKNI-------AIEPL---HAWISEHGFVGLRL 118 (609) Q Consensus 77 -------tDgRY~lQA----------------~~el-----~~~~~~~~~~-------~~~~~---~~wl~~~~~~g~~I 118 (609) .|||-++-. .++| |...|..+.. |+++- .-...+....|..+ T Consensus 112 VSPSaDt~DgrvHfTpANL~~~~HRsiE~~~T~~~L~~iF~d~~~F~~H~~LP~~~~fgDEGAANH~Rlc~~~g~~Gvel 191 (443) T TIGR03241 112 VSPSADTADGRVHFTVANLNNKFHRSIEHPTTTRLLKAIFNDERYFAHHDALPQVALFGDEGAANHNRLGGEYGSPGVQF 191 (443) T ss_pred CCCCCCCCCCCEEEECHHHCCCCHHCCCCHHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCHHHCHHCCCCCCCCCEEE T ss_conf 06887678985675002422341111574769999999828965164068888878767744321200112148984599 Q ss_pred ------EEC----CEECCHHHHHHHHHHHH------HCCCEEEECCCHHHHH-HHC-CCCCCCC-CEEEECCHHHCCCCH Q ss_conf ------965----63413999999999853------3476199705001112-201-2763322-000120123358773 Q gi|254780605|r 119 ------GLD----SRLHSSFEVDLLQKSLD------KIEGVIVDVPYNPIDS-LWK-DRPQRLY-RKVAMQDMAYAGRES 179 (609) Q Consensus 119 ------G~D----p~l~s~~~~~~l~~~l~------~~~~~l~~~~~NlVD~-iW~-~rP~~~~-~~i~~~~~~~aG~s~ 179 (609) +|+ |.-++..+-..-.+... .....+..-+..-||. +.. |--.... +-.|.|...|. .. T Consensus 192 FVyGr~a~~~~~~p~rfPARQtlEASqAvARlH~L~~~~~vf~qQnp~aId~GvFHNDVIaV~N~~vLf~He~AF~--d~ 269 (443) T TIGR03241 192 FVYGRSAFGRGPRPKRYPARQTLEASQAVARLHGLNPAQVVFAQQNPDVIDQGVFHNDVIAVSNRNVLFHHQQAFL--NQ 269 (443) T ss_pred EEEECHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEECCHHHHHCCCCCCCEEEECCCCEEEEHHHHHC--CH T ss_conf 9960111057998778773668999999999739994365887449888744776345588506643577686642--88 Q ss_pred HHHHHHHHHHHCCCCCEEEEECCCCC-HHHHHHCCCCCCCCCCHHHHHHHHCCCCCCEEECCCCCCCHHHHHHHCCCCEE Q ss_conf 88999999860233430676028420-23443201355377870023322014786200035311215678741256357 Q gi|254780605|r 180 QEKIRDICKILHQKEVGAVFICDPSS-IAWIFNIRGFDIPCSPYPLSRAILYADGKAEIFFDKQYINEQLKALLSAVAIV 258 (609) Q Consensus 180 ~eKi~~ir~~L~~~~~daliit~~d~-IaWLlNiRG~Di~~tPv~~a~aiI~~~~~~~Lfvd~~k~~~~~~~~l~~~~~~ 258 (609) ...++.||+++...+.+..+|--+++ |. =.|.--|-+|.|-++-..+|+..|.+.. +.+ T Consensus 270 ~~~l~~l~~~~~~~~~~~~~ieVp~~~vs------l~dAV~SYLFNSQLlt~pdg~M~Li~P~-----Ecr--------- 329 (443) T TIGR03241 270 SQVLDELRAKLAGLGQQFVAIEVPDAEVS------VADAVSSYLFNSQLLSREDGKMMLVVPE-----ECR--------- 329 (443) T ss_pred HHHHHHHHHHHHCCCCCCEEEEECCCCCC------HHHHHHHHHHCCEEEECCCCCEEEECCH-----HHH--------- T ss_conf 99999999986335788179995101177------9999887102014423599958997256-----773--------- Q ss_pred ECHHHHHHHHHHCCCCCCEEEECCCCCCHHHHHHHHHCCCEEECCCCC-CHHHHHCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 074451577863013574289725317788975312139567515432-1156661999999879999844389999998 Q gi|254780605|r 259 LDMDMMDSRLVCLARTSMPILIDPKWISYRFFKVIAQKNGVMVEGSDP-SCLLRATKNKVEIEGMQTAHIQDGVAMVYFL 337 (609) Q Consensus 259 ~~~~~~~~~l~~l~~~~~~i~id~~~~s~~~~~~l~~~~~~~i~~~~p-I~~lRaiKn~~EI~~mr~Ahi~DgvAl~kfl 337 (609) +.......+..+...+..|- .-.-+.+.++..+- ..| |..+|.+=|+.|++.+..+-+.|..-+.+.- T Consensus 330 -~~~~v~~~l~~l~~~~~PI~---~v~~~DlrQSM~NG-------GGPACLRLRVvL~~~E~~avn~~~lm~~~l~~~L~ 398 (443) T TIGR03241 330 -ENAAVWAYLNELVAGGGPID---EVRVFDLRESMRNG-------GGPACLRLRVVLNDAELAAVNPAVMMNDALFATLN 398 (443) T ss_pred -CCHHHHHHHHHHHHCCCCCC---EEEEECHHHHHHCC-------CCCEEEEEEEECCHHHHHHCCCCEECCHHHHHHHH T ss_conf -68789999999985599855---38994178785658-------98722578763688999722976133899999999 Q ss_pred HHHHHHHCCCCCCCHHHH--HHHHHHHHHHH Q ss_conf 753321001245301889--99999987543 Q gi|254780605|r 338 FWFYSQSLETITEIDIIK--KLERCREEIGC 366 (609) Q Consensus 338 ~wL~~~~~~~iTE~~ia~--~Le~~R~~~~~ 366 (609) .|+++.-.+.++.-|++. .|.+-|..... T Consensus 399 ~wV~~hYRd~L~~~DLaDP~Ll~E~r~ALDe 429 (443) T TIGR03241 399 AWVDRHYRDRLSAADLADPQLLIECRTALDE 429 (443) T ss_pred HHHHHHCCCCCCHHHCCCHHHHHHHHHHHHH T ss_conf 9999865333993453697999999999999 No 48 >TIGR02993 ectoine_eutD ectoine utilization protein EutD; InterPro: IPR014335 Members of this entry are putative peptidases, or non-peptidase homologues that belong to the MEROPS peptidase family M24 (clan MG) or hydrolases similar to Xaa-Pro aminopeptidase. They belong to ectoine utilisation operons, as found in Mesorhizobium loti, Silicibacter pomeroyi, Agrobacterium tumefaciens, Pseudomonas putida and Sinorhizobium meliloti 1021, where it is known to be induced by ectoine. The exact function is unknown.. Probab=87.62 E-value=0.84 Score=25.56 Aligned_cols=152 Identities=20% Similarity=0.331 Sum_probs=84.8 Q ss_pred CHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHCHHHHHHCCCCCC----EEEEEECCCCEEEECCHHHH--HHH Q ss_conf 558999999999997499799960888747860667106100441467650----07789578367997881489--999 Q gi|254780605|r 13 KTFERVHNLRSCFDSLGMDAFLVPRVDEYRGEFVDKGSERLAWLSGFTGSA----GIAIVLRQKSVIFVDGRYTL--QVE 86 (609) Q Consensus 13 ~~~erL~~LR~~m~~~~idayiip~~D~h~sEyv~~~~~rl~~isGFtGSa----G~aiIt~~~a~L~tDgRY~l--QA~ 86 (609) .-..||.+.|+.|.+.+||-+||.- + ....|+||++|=. -.+|++.+.--+|- ||-.- -|+ T Consensus 10 ey~~rl~k~r~~m~~~~~~ll~v~d----------P--snm~WltGydGWsfyvhq~v~~~~~GeP~Wy-Gr~qdanGa~ 76 (391) T TIGR02993 10 EYAARLAKTRAEMEARGIDLLIVTD----------P--SNMAWLTGYDGWSFYVHQCVLLALEGEPIWY-GRGQDANGAK 76 (391) T ss_pred HHHHHHHHHHHHHHHCCCCEEEEEC----------C--CCCCHHCCCCCEEEEEEEEEEEECCCCEEEE-CCCCCCCHHH T ss_conf 9999999999999855971899847----------6--3221101466403456567787137852441-5764412134 Q ss_pred HHH--CC-CCEEEEE----CCCCCHHHHHHHHC----CCCCEEEE--CCEECCHHHHHHHHHHHHHCCCEEEECCCHHHH Q ss_conf 861--96-7179986----68999999888743----79988996--563413999999999853347619970500111 Q gi|254780605|r 87 KEV--DT-ALFTIKN----IAIEPLHAWISEHG----FVGLRLGL--DSRLHSSFEVDLLQKSLDKIEGVIVDVPYNPID 153 (609) Q Consensus 87 ~el--~~-~~~~~~~----~~~~~~~~wl~~~~----~~g~~IG~--Dp~l~s~~~~~~l~~~l~~~~~~l~~~~~NlVD 153 (609) +-+ +. +..-.-+ ...-.+.++|.+.+ -.+..||+ |.--+|.+.+..|++.|-++. +.. ...|| T Consensus 77 ~t~~~~~~ni~GyPd~yvqsterhPmd~l~~~~~~rG~~~~~iGve~dny~f~aaaf~~lq~hlP~a~--f~~-a~~lv- 152 (391) T TIGR02993 77 RTAFLDQDNIVGYPDHYVQSTERHPMDYLSEILKDRGLDSLTIGVELDNYYFSAAAFLSLQKHLPNAR--FKD-ATALV- 152 (391) T ss_pred HHEEECCCCEECCCCCCCCCCCCCCHHHHHHHHHHCCCCCCEEEEEEHHHHHHHHHHHHHHHHCCCCH--HHH-HHHHH- T ss_conf 10012246412177301002456618899999984386642145663026888999999986243102--334-56767- Q ss_pred HHHCCCCCCCCCEEEECCHHHCCCCHHHHHHHHHHHHCCC Q ss_conf 2201276332200012012335877388999999860233 Q gi|254780605|r 154 SLWKDRPQRLYRKVAMQDMAYAGRESQEKIRDICKILHQK 193 (609) Q Consensus 154 ~iW~~rP~~~~~~i~~~~~~~aG~s~~eKi~~ir~~L~~~ 193 (609) -|+ |--++...+ ++..+-.+|-+.|.+. T Consensus 153 -nWq-ravks~~e~----------~ymr~a~~ive~mh~r 180 (391) T TIGR02993 153 -NWQ-RAVKSETEL----------KYMRKAAKIVEKMHQR 180 (391) T ss_pred -HHH-HHHCCHHHH----------HHHHHHHHHHHHHHHH T ss_conf -667-651337889----------9999889999999988 No 49 >pfam05195 AMP_N Aminopeptidase P, N-terminal domain. This domain is structurally very similar to the creatinase N-terminal domain (pfam01321). However, little or no sequence similarity exists between the two families. Probab=85.01 E-value=3.1 Score=21.63 Aligned_cols=16 Identities=19% Similarity=0.125 Sum_probs=9.6 Q ss_pred CHHHHHHHHHHHHCCC Q ss_conf 7388999999860233 Q gi|254780605|r 178 ESQEKIRDICKILHQK 193 (609) Q Consensus 178 s~~eKi~~ir~~L~~~ 193 (609) .+++++++|.+.|..+ T Consensus 5 ~~~~RR~~l~~~l~~~ 20 (134) T pfam05195 5 EFAERRARLLAKLPPN 20 (134) T ss_pred HHHHHHHHHHHHCCCC T ss_conf 9999999999867998 No 50 >PRK05928 hemD uroporphyrinogen-III synthase; Reviewed Probab=84.91 E-value=3.2 Score=21.54 Aligned_cols=68 Identities=22% Similarity=0.289 Sum_probs=34.0 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHCHHHHHHCCCCCCEEEEEECCCCEEEECCHHHHHHHHHHCC Q ss_conf 89999999999974997999608887478606671061004414676500778957836799788148999986196 Q gi|254780605|r 15 FERVHNLRSCFDSLGMDAFLVPRVDEYRGEFVDKGSERLAWLSGFTGSAGIAIVLRQKSVIFVDGRYTLQVEKEVDT 91 (609) Q Consensus 15 ~erL~~LR~~m~~~~idayiip~~D~h~sEyv~~~~~rl~~isGFtGSaG~aiIt~~~a~L~tDgRY~lQA~~el~~ 91 (609) .++-.+|.+.+++.|.+.+.+|--+-=-.+.....+..+.-+ +.+..+|+|-..|+ ||+.++-++.+. T Consensus 10 ~~~~~~l~~~L~~~G~~~~~~Pli~i~p~~~~~~~~~~~~~l----~~~d~iifTS~naV-----~~~~~~l~~~~~ 77 (252) T PRK05928 10 SPAGEELVELLRKLGFVALHLPLIEFEPGRSLPQLADALAFL----GEADAVIFTSKNAV-----EFLLSALKKKGL 77 (252) T ss_pred HHHHHHHHHHHHHCCCCEEEECCEEEEECCCHHHHHHHHHCC----CCCCEEEEECHHHH-----HHHHHHHHHCCC T ss_conf 788999999999889969996788888888737799998515----68788999888999-----999999985298 No 51 >TIGR00452 TIGR00452 methyltransferase, putative; InterPro: IPR010017 Methyl transfer from the ubiquitous S-adenosyl-L-methionine (AdoMet) to either nitrogen, oxygen or carbon atoms is frequently employed in diverse organisms ranging from bacteria to plants and mammals. The reaction is catalyzed by methyltransferases (Mtases) and modifies DNA, RNA, proteins and small molecules, such as catechol for regulatory purposes. The various aspects of the role of DNA methylation in prokaryotic restriction-modification systems and in a number of cellular processes in eukaryotes including gene regulation and differentiation is well documented. Three classes of DNA Mtases transfer the methyl group from AdoMet to the target base to form either N-6-methyladenine, or N-4-methylcytosine, or C-5- methylcytosine. In C-5-cytosine Mtases, ten conserved motifs are arranged in the same order . Motif I (a glycine-rich or closely related consensus sequence; FAGxGG in M.HhaI ), shared by other AdoMet-Mtases , is part of the cofactor binding site and motif IV (PCQ) is part of the catalytic site. In contrast, sequence comparison among N-6-adenine and N-4-cytosine Mtases indicated two of the conserved segments , although more conserved segments may be present. One of them corresponds to motif I in C-5-cytosine Mtases, and the other is named (D/N/S)PP(Y/F). Crystal structures are known for a number of Mtases , , , . The cofactor binding sites are almost identical and the essential catalytic amino acids coincide. The comparable protein folding and the existence of equivalent amino acids in similar secondary and tertiary positions indicate that many (if not all) AdoMet-Mtases have a common catalytic domain structure. This permits tertiary structure prediction of other DNA, RNA, protein, and small-molecule AdoMet-Mtases from their amino acid sequences . This is a set of proteobacterial proteins, which have homology in their central region to a large number of methyltransferases active on a variety of substrates. . Probab=82.12 E-value=4.1 Score=20.78 Aligned_cols=24 Identities=21% Similarity=0.613 Sum_probs=10.7 Q ss_pred HHHHHHHHHCCCEEECCCCCCHHHH Q ss_conf 8897531213956751543211566 Q gi|254780605|r 287 YRFFKVIAQKNGVMVEGSDPSCLLR 311 (609) Q Consensus 287 ~~~~~~l~~~~~~~i~~~~pI~~lR 311 (609) |.+++.+. .+.+.+-+.||..+.+ T Consensus 134 Y~~~~M~~-~Gak~~VG~DP~~L~~ 157 (316) T TIGR00452 134 YHLWRMLG-EGAKLLVGIDPTELFL 157 (316) T ss_pred HHHHHHHH-CCCCEEEECCHHHHHH T ss_conf 68897762-0782787436578999 No 52 >pfam07385 DUF1498 Protein of unknown function (DUF1498). This family consists of several hypothetical bacterial proteins of around 225 residues in length. The function of this family is unknown. Probab=76.06 E-value=3 Score=21.70 Aligned_cols=54 Identities=19% Similarity=0.225 Sum_probs=36.7 Q ss_pred CCCCCCCCCEECCCCEEEECCCEEECCCCCEEEEEEEEEEECCCCC--CCCCCEEEEEECC Q ss_conf 6237789860056987884686035687521500259997278778--7766424221011 Q gi|254780605|r 498 QGISRTNQEPLLPGMILSNEPGYYRCGAFGIRIENVLCVSEPETIN--NGECLMLGFNTLT 556 (609) Q Consensus 498 ~~is~~~~~~l~~Gmv~s~EPG~Y~~g~~GiRIEn~v~v~~~~~~~--~~~~~fl~Fe~lT 556 (609) ..+.++....|+||.-+|.+||+|-.=+ |- .-.|+|-|+.+.+ ..++.|+ |++- T Consensus 148 ~t~~aG~~l~L~PGeSVtL~pg~yH~Fw-~e--~G~VLiGEVS~vNDD~~DN~F~--epig 203 (225) T pfam07385 148 VTMKAGGLLKLKPGQSICLPPGLYHSFW-AE--GGFVLIGEVSMVNDDLTDNYFL--EPLG 203 (225) T ss_pred EEECCCCEEEECCCCEEECCCCCCEEEE-CC--CCCEEEEEEECCCCCCCCCCCC--CCCC T ss_conf 7888999788799993667998715675-38--9888998534355788675214--7758 No 53 >COG3642 Mn2+-dependent serine/threonine protein kinase [Signal transduction mechanisms] Probab=72.99 E-value=3.2 Score=21.56 Aligned_cols=31 Identities=29% Similarity=0.377 Sum_probs=23.7 Q ss_pred CCCCCCCCCCCCCCCEEECCCCEEEEECCCCE Q ss_conf 35432222455433205038987998303201 Q gi|254780605|r 385 PHAAIIHYQATVQSNRLLQKDELLLLDSGAQY 416 (609) Q Consensus 385 ~N~AipHY~~~~~s~~~i~~~~~~LiDsG~qY 416 (609) -++-++|+-.|. ||-.+..+++++||+|=.| T Consensus 109 H~~givHGDLTt-sNiIl~~~~i~~IDfGLg~ 139 (204) T COG3642 109 HKAGIVHGDLTT-SNIILSGGRIYFIDFGLGE 139 (204) T ss_pred HHCCEECCCCCC-CEEEEECCCEEEEECCCCC T ss_conf 756800578861-3289838968999778432 No 54 >PRK10153 DNA-binding transcriptional activator CadC; Provisional Probab=72.89 E-value=6.8 Score=19.24 Aligned_cols=77 Identities=17% Similarity=0.212 Sum_probs=52.8 Q ss_pred CCCCEEEECCEECCHHHHHHHHHHHHHCCCEEEECCCHHHHHHHCCCCCCCCCEEEECCHHHCCCCHHHHHHHHHHHHCC Q ss_conf 79988996563413999999999853347619970500111220127633220001201233587738899999986023 Q gi|254780605|r 113 FVGLRLGLDSRLHSSFEVDLLQKSLDKIEGVIVDVPYNPIDSLWKDRPQRLYRKVAMQDMAYAGRESQEKIRDICKILHQ 192 (609) Q Consensus 113 ~~g~~IG~Dp~l~s~~~~~~l~~~l~~~~~~l~~~~~NlVD~iW~~rP~~~~~~i~~~~~~~aG~s~~eKi~~ir~~L~~ 192 (609) ..+..|-+||+|+..- .=-++..| .+...+.|||.+|..- -.+-+-..+=|-.||+.||. T Consensus 22 r~drev~iEpRLinLL-----~Fla~hpg--~VfsRDELI~~VW~Ga-------------IvTDQVVTQSIFELRKiLKd 81 (512) T PRK10153 22 RQDREVTLEPRLINLL-----VFLAQHSG--EVFSRDELIDHVWDGA-------------IVTDQVVTQSIFELRKSLRD 81 (512) T ss_pred ECCCEEEECHHHHHHH-----HHHHCCCC--CEECHHHHHHHHCCCC-------------EECCCHHHHHHHHHHHHHHC T ss_conf 6698798647789999-----99851999--5343999999862787-------------41130255889999999863 Q ss_pred CCCEE--EEECCCCCHHHHHHCCCCCC Q ss_conf 34306--76028420234432013553 Q gi|254780605|r 193 KEVGA--VFICDPSSIAWIFNIRGFDI 217 (609) Q Consensus 193 ~~~da--liit~~d~IaWLlNiRG~Di 217 (609) .+-+. ++||-|- ||+-+ T Consensus 82 g~~~~~~YIvTVPK--------RGYKL 100 (512) T PRK10153 82 GREDNPEYIVTVPK--------RGYKL 100 (512) T ss_pred CCCCCCCEEEEECC--------CCCEE T ss_conf 88678872998368--------86312 No 55 >PRK07009 consensus Probab=69.55 E-value=3.8 Score=20.97 Aligned_cols=34 Identities=29% Similarity=0.502 Sum_probs=22.6 Q ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 8899721789788889999873667741865477643232 Q gi|254780605|r 448 ISVSTARFPQRTRGCDLDSIARIFLWKYGADFAHGVGHGV 487 (609) Q Consensus 448 ial~~a~fp~g~~g~~lD~~aR~~l~~~g~dy~HgtGHGV 487 (609) +.+|+.+|| |.+ ..||++||. .+.|..|++=||- T Consensus 121 L~lAR~~~P-g~~-~sLdaLc~R----~~I~~~~r~~HgA 154 (241) T PRK07009 121 LVQAKQMFP-GKR-NSLDALCDR----FGISNAHRTLHGA 154 (241) T ss_pred HHHHHHHCC-CCC-CCHHHHHHH----CCCCCCCCCCCCC T ss_conf 999999789-998-788999998----2869887330242 No 56 >pfam05770 Ins134_P3_kin Inositol 1, 3, 4-trisphosphate 5/6-kinase. This family consists of several inositol 1, 3, 4-trisphosphate 5/6-kinase proteins. Inositol 1,3,4-trisphosphate is at a branch point in inositol phosphate metabolism. It is dephosphorylated by specific phosphatases to either inositol 3,4-bisphosphate or inositol 1,3-bisphosphate. Alternatively, it is phosphorylated to inositol 1,3,4,6-tetrakisphosphate or inositol 1,3,4,5-tetrakisphosphate by inositol trisphosphate 5/6-kinase. Probab=67.52 E-value=7.9 Score=18.80 Aligned_cols=85 Identities=14% Similarity=0.133 Sum_probs=55.4 Q ss_pred CCCCEEEEC--CEECCHHHHHHHHHHHHHCCCEEEECCCHHHHHHHCCCCCCCCCE--EEECCHHHCCCCHHHHHHHHHH Q ss_conf 799889965--634139999999998533476199705001112201276332200--0120123358773889999998 Q gi|254780605|r 113 FVGLRLGLD--SRLHSSFEVDLLQKSLDKIEGVIVDVPYNPIDSLWKDRPQRLYRK--VAMQDMAYAGRESQEKIRDICK 188 (609) Q Consensus 113 ~~g~~IG~D--p~l~s~~~~~~l~~~l~~~~~~l~~~~~NlVD~iW~~rP~~~~~~--i~~~~~~~aG~s~~eKi~~ir~ 188 (609) .+..+|||- |+-...-.+..|...+.+.|+.+++++. +||-..+.| +..| |.+|+.+.+ ++.+ T Consensus 5 ~~~~~VGYal~~kK~~~f~~~~f~~~~~~~gi~~v~ID~--------~~pL~~QGPfDvilH--Kltd~~~~~---~l~~ 71 (307) T pfam05770 5 LKRYLVGYALAEKKIKSFIQPSLAELARKRGIDLVQLDP--------SRPLSEQGPFDIIIH--KLTDKEWRH---RLEE 71 (307) T ss_pred CEEEEEEEEECHHHHHHHHHHHHHHHHHHCCCEEEECCC--------CCCHHHCCCCEEEEE--ECCHHHHHH---HHHH T ss_conf 223699999877998887479999989767967998789--------998455498269999--656289999---9999 Q ss_pred HHCCCCCEEEEECCCCCHHHHHH Q ss_conf 60233430676028420234432 Q gi|254780605|r 189 ILHQKEVGAVFICDPSSIAWIFN 211 (609) Q Consensus 189 ~L~~~~~daliit~~d~IaWLlN 211 (609) .++ +..+..+|-.|++|.=|+| T Consensus 72 y~~-~hP~v~viDP~~ai~~L~d 93 (307) T pfam05770 72 FRE-AHPEVPVLDPPPAIRRLHN 93 (307) T ss_pred HHH-HCCCCEEECCHHHHHHHHC T ss_conf 999-7899789779999999878 No 57 >PRK06859 consensus Probab=66.46 E-value=4.7 Score=20.38 Aligned_cols=34 Identities=21% Similarity=0.409 Sum_probs=23.4 Q ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 8899721789788889999873667741865477643232 Q gi|254780605|r 448 ISVSTARFPQRTRGCDLDSIARIFLWKYGADFAHGVGHGV 487 (609) Q Consensus 448 ial~~a~fp~g~~g~~lD~~aR~~l~~~g~dy~HgtGHGV 487 (609) +.+|+.+|| |. -..||++||. ++.|..|++=||- T Consensus 129 L~lAR~~~P-g~-~~sLdaLc~r----~~id~~~R~~HgA 162 (253) T PRK06859 129 LQMARQMYP-GK-RNNLDALCDR----LGIDNSKRTLHGA 162 (253) T ss_pred HHHHHHHCC-CC-CCCHHHHHHH----CCCCCCCCCCCCH T ss_conf 999998787-99-7799999997----3998766330132 No 58 >TIGR02669 SpoIID_LytB SpoIID/LytB domain; InterPro: IPR013486 This entry describes a region which is found, typically in two or three proteins per genome, in Cyanobacteria and Firmicutes, and sporadically in other genomes. One example is SpoIID from Bacillus subtilis. Another, also from B. subtilis, is LytB which contains this region at the C-terminus. LytB is encoded immediately upstream of an amidase, the autolysin LytC, and both these proteins show considerable homology in their N-terminal regions. Genes encoding proteins in this entry do not occur in conserved neighbourhoods, and many, such as SpoIID are monocistronic. One modeling study has suggested that SpoIID may bind DNA, but the function of these proteins is so far unknown.; GO: 0030435 sporulation. Probab=65.64 E-value=1.9 Score=23.14 Aligned_cols=11 Identities=55% Similarity=1.180 Sum_probs=9.6 Q ss_pred CCCCCCCCCCCCC Q ss_conf 7764323211356 Q gi|254780605|r 480 AHGVGHGVGSFLP 492 (609) Q Consensus 480 ~HgtGHGVG~~l~ 492 (609) ++|-||||| |+ T Consensus 284 G~G~GHGVG--mS 294 (323) T TIGR02669 284 GKGYGHGVG--MS 294 (323) T ss_pred EECCCCCCC--HH T ss_conf 511335542--01 No 59 >TIGR01283 nifE nitrogenase MoFe cofactor biosynthesis protein NifE; InterPro: IPR005973 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene. Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes. It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I. ; GO: 0005515 protein binding, 0006461 protein complex assembly, 0009399 nitrogen fixation. Probab=63.22 E-value=7.5 Score=18.95 Aligned_cols=57 Identities=23% Similarity=0.444 Sum_probs=30.5 Q ss_pred HHCCCCCCCCCE----------EEECCHHHC-CCCHHHHH-HHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCCCC Q ss_conf 201276332200----------012012335-87738899-99998602334306760284202344320135537 Q gi|254780605|r 155 LWKDRPQRLYRK----------VAMQDMAYA-GRESQEKI-RDICKILHQKEVGAVFICDPSSIAWIFNIRGFDIP 218 (609) Q Consensus 155 iW~~rP~~~~~~----------i~~~~~~~a-G~s~~eKi-~~ir~~L~~~~~daliit~~d~IaWLlNiRG~Di~ 218 (609) =|..|=+++..| .-..+.-|= || .|| +-|++..+.....|+||=+-- ...|=|.||+ T Consensus 78 tWD~RGs~SSGp~LyR~gFsTDL~E~DVifGrGE---KKL~~aI~e~v~~Y~P~AvFVYsTC----v~alIGDD~~ 146 (470) T TIGR01283 78 TWDIRGSRSSGPELYRLGFSTDLTEKDVIFGRGE---KKLFHAIREIVERYHPAAVFVYSTC----VVALIGDDLE 146 (470) T ss_pred CCCCCCCCCCCHHHCCCCCCCCCCCCCEEECCHH---HHHHHHHHHHHHHCCCCEEEEECCE----ECCEECCCHH T ss_conf 7011444356144513666326602466733144---7899999999863299748994571----3102448889 No 60 >PRK08858 consensus Probab=62.77 E-value=6.3 Score=19.48 Aligned_cols=32 Identities=34% Similarity=0.632 Sum_probs=18.6 Q ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 89972178978888999987366774186547764323 Q gi|254780605|r 449 SVSTARFPQRTRGCDLDSIARIFLWKYGADFAHGVGHG 486 (609) Q Consensus 449 al~~a~fp~g~~g~~lD~~aR~~l~~~g~dy~HgtGHG 486 (609) .+|+.+|| |.+. .||++||. .+.|..|++=|| T Consensus 137 ~lAR~~~P-g~~~-sLdaLc~r----~~Id~s~R~~Hg 168 (246) T PRK08858 137 AMAKKIFP-GKRN-NLDVLCER----YGIDNSHRTLHG 168 (246) T ss_pred HHHHHHCC-CCCC-CHHHHHHH----CCCCCCCCCCCC T ss_conf 99998689-9988-88999998----198988834023 No 61 >PRK09605 O-sialoglycoprotein endopeptidase/protein kinase; Reviewed Probab=62.56 E-value=6.9 Score=19.21 Aligned_cols=17 Identities=18% Similarity=0.233 Sum_probs=8.2 Q ss_pred CCCCCCCCCCCCCCCCC Q ss_conf 41000012335432222 Q gi|254780605|r 376 AFNTIAASGPHAAIIHY 392 (609) Q Consensus 376 SF~TIva~G~N~AipHY 392 (609) -+|+.--.+.||||+-| T Consensus 278 ~~p~~~~CtDNgaMIA~ 294 (536) T PRK09605 278 YVPEKRFCGDNGAMIAW 294 (536) T ss_pred ECCCHHHCCHHHHHHHH T ss_conf 87987981238999999 No 62 >TIGR01982 UbiB 2-polyprenylphenol 6-hydroxylase; InterPro: IPR010232 This entry represents the enzyme (UbiB) which catalyses the first hydroxylation step in the ubiquinone biosynthetic pathway in bacteria , . The gene is also known as AarF in certain species. It is believed that the reaction converts 2-polyprenylphenol to 6-hydroxy-2-polyprenylphenol. Members are mainly proteobacteria.; GO: 0006744 ubiquinone biosynthetic process. Probab=61.80 E-value=12 Score=17.51 Aligned_cols=66 Identities=26% Similarity=0.507 Sum_probs=42.6 Q ss_pred CCCCCCCCCCCCCEEE-CC----CCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHH-----HHHHHCC- Q ss_conf 4322224554332050-38----98799830320105612221432103788888433366778888-----8997217- Q gi|254780605|r 387 AAIIHYQATVQSNRLL-QK----DELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMI-----SVSTARF- 455 (609) Q Consensus 387 ~AipHY~~~~~s~~~i-~~----~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~i-----al~~a~f- 455 (609) =|-+| | +|-.+ ++ |.+.+||+| .+|..+++.|+.---.|+|.+ .++...| T Consensus 291 HAD~H--p---GNI~v~~d~~~Ng~i~~~DFG--------------IVG~l~~~~r~~LAE~l~Gf~~rDY~~VA~~H~~ 351 (452) T TIGR01982 291 HADLH--P---GNIFVLKDPERNGKIIALDFG--------------IVGRLSEEDRRYLAEILYGFINRDYRRVAEVHFD 351 (452) T ss_pred CCCCC--C---CCCCCCCCCCCCCEEEEEECC--------------CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 78847--5---663324787768808998537--------------4200137889999999999985429999999986 Q ss_pred ----CCCCCHHHHHHHHHHH Q ss_conf ----8978888999987366 Q gi|254780605|r 456 ----PQRTRGCDLDSIARIF 471 (609) Q Consensus 456 ----p~g~~g~~lD~~aR~~ 471 (609) |..|--..+-..+|.+ T Consensus 352 aGyvp~~~d~~~f~~a~R~~ 371 (452) T TIGR01982 352 AGYVPSDTDMAEFEQAIRAI 371 (452) T ss_pred HCCCCHHHHHHHHHHHHHHC T ss_conf 25888435699999986200 No 63 >PRK13281 succinylarginine dihydrolase; Provisional Probab=59.25 E-value=13 Score=17.22 Aligned_cols=308 Identities=16% Similarity=0.163 Sum_probs=145.4 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHCHHHHHHCCCCCCEEEEEEC------------CCCEEE----- Q ss_conf 589999999999974997999608887478606671061004414676500778957------------836799----- Q gi|254780605|r 14 TFERVHNLRSCFDSLGMDAFLVPRVDEYRGEFVDKGSERLAWLSGFTGSAGIAIVLR------------QKSVIF----- 76 (609) Q Consensus 14 ~~erL~~LR~~m~~~~idayiip~~D~h~sEyv~~~~~rl~~isGFtGSaG~aiIt~------------~~a~L~----- 76 (609) ..+-|.+.|.++.--=..+++-|..-||..- -| =-|||||-..++--. .-+.+| T Consensus 42 AlQGL~KMk~L~d~Gl~QgvlpPqeRP~~~~------Lr---~lGF~Gsd~~vl~~a~k~aP~Ll~a~~SASsMWtANAA 112 (443) T PRK13281 42 AKQGLLKMKALADLGFPQGVLPPQERPDLPA------LR---RLGFSGSDAQVLEKAAKQAPELLSAASSASSMWTANAA 112 (443) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCHHH------HH---HHCCCCCHHHHHHHHHHHCHHHHHHHHHHHHHHHHHCC T ss_conf 9979999999997688650169988987468------99---83889872999999998698999987316888775005 Q ss_pred --------ECCHHHHHHH----------------HHH-----CCCCEEEEEC-------CCCCH---HHHHHHHCCCCCE Q ss_conf --------7881489999----------------861-----9671799866-------89999---9988874379988 Q gi|254780605|r 77 --------VDGRYTLQVE----------------KEV-----DTALFTIKNI-------AIEPL---HAWISEHGFVGLR 117 (609) Q Consensus 77 --------tDgRY~lQA~----------------~el-----~~~~~~~~~~-------~~~~~---~~wl~~~~~~g~~ 117 (609) .|||-++-+. ++| |...|..+.. |+++- .-...+....|.. T Consensus 113 TVSPSaDt~DgrvHfTpANL~~~~HRsiE~~~T~~~L~~iF~d~~~F~~H~~LP~~~~fgDEGAANH~Rlc~~~g~~Gve 192 (443) T PRK13281 113 TVSPSADTADGRVHFTPANLNNKFHRSIEAPTTERLLRAIFADERHFAVHDALPQVALFGDEGAANHTRLCGEYGEPGVE 192 (443) T ss_pred CCCCCCCCCCCCEEEEHHHHCCCCHHCCCCHHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCHHHHHCCCCCCCCCCEE T ss_conf 60577768898467511342245301257477999999982896525306888888887774343201011003798549 Q ss_pred E------EECC----EECCHHHHHHHHHHHH------HCCCEEEECCCHHHHH-HHC-CCCCCCC-CEEEECCHHHCCCC Q ss_conf 9------9656----3413999999999853------3476199705001112-201-2763322-00012012335877 Q gi|254780605|r 118 L------GLDS----RLHSSFEVDLLQKSLD------KIEGVIVDVPYNPIDS-LWK-DRPQRLY-RKVAMQDMAYAGRE 178 (609) Q Consensus 118 I------G~Dp----~l~s~~~~~~l~~~l~------~~~~~l~~~~~NlVD~-iW~-~rP~~~~-~~i~~~~~~~aG~s 178 (609) | +|++ .-++..+-..-.+... .....+..-+..-||. +.. |--+... +-.|.|...|. . T Consensus 193 lFVyGr~a~~~~~~p~rfPARQtleAsqAvAR~H~L~~~~~vf~qQnp~aId~GvFHNDVIaV~N~~vLf~HE~AF~--d 270 (443) T PRK13281 193 LFVYGREAFNDGPAPGRYPARQTLEASEAVARLHGLDPERTVFAQQNPDVIDQGVFHNDVIAVGNRNVLFCHEQAFL--D 270 (443) T ss_pred EEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEECCHHHHHCCCCCCCEEEECCCCCEEEHHHHHC--C T ss_conf 99972311357878788771578999999999759995466886059888864886333577517530034787756--9 Q ss_pred HHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCEEECCCCCCCHHHHHHHCCCCEE Q ss_conf 38899999986023343067602842023443201355377870023322014786200035311215678741256357 Q gi|254780605|r 179 SQEKIRDICKILHQKEVGAVFICDPSSIAWIFNIRGFDIPCSPYPLSRAILYADGKAEIFFDKQYINEQLKALLSAVAIV 258 (609) Q Consensus 179 ~~eKi~~ir~~L~~~~~daliit~~d~IaWLlNiRG~Di~~tPv~~a~aiI~~~~~~~Lfvd~~k~~~~~~~~l~~~~~~ 258 (609) ....+..||+.+. .+...+-|.+-+ |. =.|.--|-+|.|-++-..+|+..|.+... .+ T Consensus 271 ~~~~l~~l~~~~~-~~~~~ieV~~~~-vs------l~DAV~SYLFNSQLlt~pdg~M~Li~P~E-----cr--------- 328 (443) T PRK13281 271 QQAVLAELRAKLD-GGFMAIEVPAAQ-VS------VADAVKSYLFNSQLLSRPDGSMLLVVPQE-----CR--------- 328 (443) T ss_pred HHHHHHHHHHHCC-CCCEEEEEECCC-CC------HHHHHHHHHHCCEEEECCCCCEEEECCHH-----HH--------- T ss_conf 9999999999738-995699963253-87------99998873102345555999679964567-----73--------- Q ss_pred ECHHHHHHHHHHCCCCCCEEEECCCCCCHHHHHHHHHCCCEEECCCCC-CHHHHHCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 074451577863013574289725317788975312139567515432-1156661999999879999844389999998 Q gi|254780605|r 259 LDMDMMDSRLVCLARTSMPILIDPKWISYRFFKVIAQKNGVMVEGSDP-SCLLRATKNKVEIEGMQTAHIQDGVAMVYFL 337 (609) Q Consensus 259 ~~~~~~~~~l~~l~~~~~~i~id~~~~s~~~~~~l~~~~~~~i~~~~p-I~~lRaiKn~~EI~~mr~Ahi~DgvAl~kfl 337 (609) ........+..+...+..|- .---+.+.++..+ ...| |..+|.+=|+.|++.+...-+.|..-+.+.- T Consensus 329 -~~~~v~~~l~~l~~~~~PI~---~v~~~DlrQSM~N-------GGGPACLRLRVvl~~~E~~avn~~~lm~~~l~~~L~ 397 (443) T PRK13281 329 -ENPAVWAYLNELVAGNGPIK---EVKVFDLRQSMRN-------GGGPACLRLRVVLNEAELAAVNPAVLMNDALFDRLN 397 (443) T ss_pred -CCHHHHHHHHHHHHCCCCCC---EEEEECHHHHHHC-------CCCCEEEEEEEECCHHHHHHCCCCEECCHHHHHHHH T ss_conf -68789999999971699864---4899423665035-------897632368652689999732976233899999999 Q ss_pred HHHHHHHCCCCCCCHHHH--HHHHHHHHHH Q ss_conf 753321001245301889--9999998754 Q gi|254780605|r 338 FWFYSQSLETITEIDIIK--KLERCREEIG 365 (609) Q Consensus 338 ~wL~~~~~~~iTE~~ia~--~Le~~R~~~~ 365 (609) .|+++.-.+.++.-|++. .|.+-|.... T Consensus 398 ~wV~~hYRd~L~~~DLaDP~Ll~E~r~ALd 427 (443) T PRK13281 398 AWVDRHYRDRLTPADLADPQLLRESRTALD 427 (443) T ss_pred HHHHHHHHCCCCHHHCCCHHHHHHHHHHHH T ss_conf 999986433499345369799999999999 No 64 >PRK06022 consensus Probab=58.79 E-value=8.1 Score=18.72 Aligned_cols=35 Identities=29% Similarity=0.546 Sum_probs=23.3 Q ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 88997217897888899998736677418654776432321 Q gi|254780605|r 448 ISVSTARFPQRTRGCDLDSIARIFLWKYGADFAHGVGHGVG 488 (609) Q Consensus 448 ial~~a~fp~g~~g~~lD~~aR~~l~~~g~dy~HgtGHGVG 488 (609) +.+|+.+|| |.+. .||++||. .|.|-.|+|=||-- T Consensus 119 L~LAR~~~P-g~~~-sLdaLc~r----~~Id~~~R~~HgAL 153 (234) T PRK06022 119 LALARRKHP-MGPN-SLDALCRR----YGIDNSHRTKHGAL 153 (234) T ss_pred HHHHHHHCC-CCCC-CHHHHHHH----CCCCCCCCCCCCCH T ss_conf 999998688-9988-98999998----39997773537628 No 65 >PRK13308 ureC urease subunit alpha; Reviewed Probab=58.25 E-value=8.2 Score=18.68 Aligned_cols=64 Identities=23% Similarity=0.273 Sum_probs=42.2 Q ss_pred HHHHHHHCCCC--CCCCCCCCCCCCCCCC-------CCCCCCCCCCCCEECCCCEE------------EECCCEEECC-- Q ss_conf 73667741865--4776432321135607-------78862377898600569878------------8468603568-- Q gi|254780605|r 468 ARIFLWKYGAD--FAHGVGHGVGSFLPVH-------EGPQGISRTNQEPLLPGMIL------------SNEPGYYRCG-- 524 (609) Q Consensus 468 aR~~l~~~g~d--y~HgtGHGVG~~l~VH-------E~P~~is~~~~~~l~~Gmv~------------s~EPG~Y~~g-- 524 (609) +|+++.++-.+ -.||..|-||+ ++|- =.|..|+-.-+.+|+-||+. |-||=+|.|- T Consensus 402 vkRYiAKyTINPAIahGis~~VGS-vE~GK~ADlVLW~PafFG~KP~~VlKgG~ia~a~mGD~NaSiptpqPv~~RpmfG 480 (569) T PRK13308 402 IKRYIAKYTINPAITFGIDDHVGS-LEPGKMADIVLWRPAFFGIKPELVIKGGFPAWAAMGDANGSLMTCEPMLQRPQWG 480 (569) T ss_pred HHHHHHHEEECHHHHHCCCCCCCC-CCCCCCEEEEEECHHHCCCCCCEEEECCEEEEEECCCCCCCCCCCCCEEEEHHHH T ss_conf 865535402172565044454456-5588503487636364178862378768788984478776889986505203133 Q ss_pred CCCEEEEE Q ss_conf 75215002 Q gi|254780605|r 525 AFGIRIEN 532 (609) Q Consensus 525 ~~GiRIEn 532 (609) .+|--.-+ T Consensus 481 a~g~a~~~ 488 (569) T PRK13308 481 AFGRAKQA 488 (569) T ss_pred HCCCCCCC T ss_conf 21555455 No 66 >PRK07982 consensus Probab=57.00 E-value=9 Score=18.38 Aligned_cols=33 Identities=27% Similarity=0.466 Sum_probs=20.5 Q ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 899721789788889999873667741865477643232 Q gi|254780605|r 449 SVSTARFPQRTRGCDLDSIARIFLWKYGADFAHGVGHGV 487 (609) Q Consensus 449 al~~a~fp~g~~g~~lD~~aR~~l~~~g~dy~HgtGHGV 487 (609) .+|+.+|| |.+ ..||++||+ .+.|-.|++=||- T Consensus 132 ~lAR~~~P-g~~-~sLDaLc~R----~~Id~s~R~~HgA 164 (243) T PRK07982 132 ALARKMFP-GKR-NSLDALCDR----YEIDNSKRTLHGA 164 (243) T ss_pred HHHHHHCC-CCC-CCHHHHHHH----CCCCCCCCCCCCH T ss_conf 99888788-986-899999875----4877656552333 No 67 >pfam08003 Methyltransf_9 Protein of unknown function (DUF1698). This family contains many hypothetical proteins. It also includes two putative methyltransferase proteins. Probab=56.20 E-value=15 Score=16.88 Aligned_cols=30 Identities=13% Similarity=0.220 Sum_probs=13.5 Q ss_pred CCEEEECCHHHCCCCHHHHHHHHHHHHCCC Q ss_conf 200012012335877388999999860233 Q gi|254780605|r 164 YRKVAMQDMAYAGRESQEKIRDICKILHQK 193 (609) Q Consensus 164 ~~~i~~~~~~~aG~s~~eKi~~ir~~L~~~ 193 (609) ..-||.+..=|.-+|..+-|.+|++.|+.. T Consensus 183 FDtVFsMGVLYHrrsP~~hL~~Lk~~L~~g 212 (315) T pfam08003 183 FDTVFSMGVLYHRRSPLDHLLQLKDQLVKG 212 (315) T ss_pred CCEEEEEEEEECCCCHHHHHHHHHHHCCCC T ss_conf 155776544432689899999999852879 No 68 >PRK08816 consensus Probab=56.19 E-value=9.5 Score=18.22 Aligned_cols=33 Identities=30% Similarity=0.561 Sum_probs=21.4 Q ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 899721789788889999873667741865477643232 Q gi|254780605|r 449 SVSTARFPQRTRGCDLDSIARIFLWKYGADFAHGVGHGV 487 (609) Q Consensus 449 al~~a~fp~g~~g~~lD~~aR~~l~~~g~dy~HgtGHGV 487 (609) ++++.+|| |.+ ..||++||. .+.|-.|++=||- T Consensus 123 ~lAr~~~P-g~~-nsLDaLc~r----~~id~~~R~~HgA 155 (244) T PRK08816 123 MMARERYP-GQR-NSLDALCKR----LGVDNSHRQLHGA 155 (244) T ss_pred HHHHHHCC-CCC-CCHHHHHHH----CCCCCCCCCCCCH T ss_conf 99998787-997-789999997----4978877350148 No 69 >PRK04183 glutamyl-tRNA(Gln) amidotransferase subunit D; Validated Probab=54.66 E-value=8 Score=18.74 Aligned_cols=58 Identities=16% Similarity=0.166 Sum_probs=40.5 Q ss_pred HHCCCCHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCC-CCCCCHHHHHHHHCCCCCCEEECCCCC Q ss_conf 33587738899999986023343067602842023443201355-377870023322014786200035311 Q gi|254780605|r 173 AYAGRESQEKIRDICKILHQKEVGAVFICDPSSIAWIFNIRGFD-IPCSPYPLSRAILYADGKAEIFFDKQY 243 (609) Q Consensus 173 ~~aG~s~~eKi~~ir~~L~~~~~daliit~~d~IaWLlNiRG~D-i~~tPv~~a~aiI~~~~~~~Lfvd~~k 243 (609) ....+.|.+=.+++++.+ ++++|+++|+- |.| ..||-.++||++-..+ +++.|+-.++ T Consensus 131 nM~P~~W~~iA~~i~~~~-~~g~DG~VItH-----------GTDTMayTAsALSFml~~~~-KPVVlTGSQr 189 (421) T PRK04183 131 NMTPEYWQEIAEAVAEEI-ENGADGVVVAH-----------GTDTMHYTAAALSFMLRKTP-VPIVFVGAQR 189 (421) T ss_pred CCCHHHHHHHHHHHHHHH-HCCCCEEEEEE-----------CCCHHHHHHHHHHHHHHCCC-CCEEEECCCC T ss_conf 379899999999999986-66898799921-----------67529999999999963899-9889968777 No 70 >PRK09238 bifunctional aconitate hydratase 2/2-methylisocitrate dehydratase; Validated Probab=54.53 E-value=11 Score=17.85 Aligned_cols=52 Identities=25% Similarity=0.459 Sum_probs=33.2 Q ss_pred HHHHHHCCCCCEEEEE-----------CCCCCHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCEEEC Q ss_conf 9998602334306760-----------284202344320135537787002332201478620003 Q gi|254780605|r 185 DICKILHQKEVGAVFI-----------CDPSSIAWIFNIRGFDIPCSPYPLSRAILYADGKAEIFF 239 (609) Q Consensus 185 ~ir~~L~~~~~dalii-----------t~~d~IaWLlNiRG~Di~~tPv~~a~aiI~~~~~~~Lfv 239 (609) +--+.|+++|....++ |.-.||.|.+ |.|+||-|--..-.++.-..-+-+|. T Consensus 211 ~~I~~LK~kg~pla~VGDVVGTGSSRKSa~NSVlW~~---G~dIP~VPNKr~gGvv~G~kIAPIFf 273 (841) T PRK09238 211 KQIEALKKKGHPLAYVGDVVGTGSSRKSATNSVLWHM---GDDIPYVPNKRAGGVVLGGKIAPIFF 273 (841) T ss_pred HHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHH---CCCCCCCCCCCCCCEEECCEECCEEE T ss_conf 9999999709987996773256754211121327870---79888897357778576561233577 No 71 >PRK08444 hypothetical protein; Provisional Probab=53.89 E-value=2.7 Score=22.06 Aligned_cols=39 Identities=21% Similarity=0.101 Sum_probs=19.7 Q ss_pred CHHH-HHHHHHHHHHHHHHHHHHHHHHHHH--HCCC--CCCCHHHHHHHHHHHH Q ss_conf 9999-9879999844389999998753321--0012--4530188999999987 Q gi|254780605|r 315 NKVE-IEGMQTAHIQDGVAMVYFLFWFYSQ--SLET--ITEIDIIKKLERCREE 363 (609) Q Consensus 315 n~~E-I~~mr~Ahi~DgvAl~kfl~wL~~~--~~~~--iTE~~ia~~Le~~R~~ 363 (609) +..| ++-||.||.. |+ ... +.-| -|.-+.++.|..+|.. T Consensus 187 ~~~~Wl~i~~~AH~l-Gi---------~ttaTmmyGhvEt~e~rv~HL~~lR~l 230 (353) T PRK08444 187 SSERWLEIHKYWHKK-GK---------QSNATMLFGHIENREHRIDHMLRLRDL 230 (353) T ss_pred CHHHHHHHHHHHHHC-CC---------CCCEEEECCCCCCHHHHHHHHHHHHHH T ss_conf 999999999999982-99---------664146778879999999999999983 No 72 >pfam04996 AstB Succinylarginine dihydrolase. This enzyme transforms N(2)-succinylglutamate into succinate and glutamate. This is the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway. Probab=53.36 E-value=16 Score=16.57 Aligned_cols=307 Identities=18% Similarity=0.201 Sum_probs=142.5 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHCHHHHHHCCCCCCEEEEEE------------CCCCEEEE----- Q ss_conf 8999999999997499799960888747860667106100441467650077895------------78367997----- Q gi|254780605|r 15 FERVHNLRSCFDSLGMDAFLVPRVDEYRGEFVDKGSERLAWLSGFTGSAGIAIVL------------RQKSVIFV----- 77 (609) Q Consensus 15 ~erL~~LR~~m~~~~idayiip~~D~h~sEyv~~~~~rl~~isGFtGSaG~aiIt------------~~~a~L~t----- 77 (609) .+-|.|.|.++.--=..+++-|..-||..- -| =-|||||...+|-- ...+.+|| T Consensus 42 lQGL~KMk~l~d~G~~QgvlpPqeRP~~~~------Lr---~lGF~GsD~~vl~~aak~aP~Ll~a~~SASsMWtANAAT 112 (439) T pfam04996 42 LQGLAKMRALADLGLPQGVLPPHERPAAHV------LR---QLGFSGSDEAVLAQVAREAPDLLAAASSASSMWTANAAT 112 (439) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCHHH------HH---HHCCCCCHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCC T ss_conf 979999999997688650369988988278------99---838898779999999987989999885068887740155 Q ss_pred --------CCHHHHHHH----------------HHH-----CCCCEEEEEC-------CCCCH---HHHHHHHCCCCCEE Q ss_conf --------881489999----------------861-----9671799866-------89999---99888743799889 Q gi|254780605|r 78 --------DGRYTLQVE----------------KEV-----DTALFTIKNI-------AIEPL---HAWISEHGFVGLRL 118 (609) Q Consensus 78 --------DgRY~lQA~----------------~el-----~~~~~~~~~~-------~~~~~---~~wl~~~~~~g~~I 118 (609) |||-++-.. ++| |...|.++.. |+++- .-...+....|..+ T Consensus 113 VSPSaDt~DgrvHfTpANL~~~~HRsiE~~~T~~~L~~iF~d~~~F~~H~~LP~~~~fgDEGAANH~Rlc~~~g~~Gvel 192 (439) T pfam04996 113 VLPSADTADGRVHLTTANLNTKLHRSLEAPDTYRTLRRVFNDEARFAVHAPLPAVAHLGDEGAANHMRLCANHGAPGVNL 192 (439) T ss_pred CCCCCCCCCCCEEEECHHHCCCCHHCCCCHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCHHHHHHCCCCCCCCCEEE T ss_conf 06887678985675102422442111574769999999828965052267888888777744422110233358985599 Q ss_pred ---EEC-----CEECCHHHHHHHHHHHHHC-C--CEEEECCCHH--HHH-HHC-CCCCCC-CCEEEECCHHHCCCCHHHH Q ss_conf ---965-----6341399999999985334-7--6199705001--112-201-276332-2000120123358773889 Q gi|254780605|r 119 ---GLD-----SRLHSSFEVDLLQKSLDKI-E--GVIVDVPYNP--IDS-LWK-DRPQRL-YRKVAMQDMAYAGRESQEK 182 (609) Q Consensus 119 ---G~D-----p~l~s~~~~~~l~~~l~~~-~--~~l~~~~~Nl--VD~-iW~-~rP~~~-~~~i~~~~~~~aG~s~~eK 182 (609) |=+ |.-++..+-..-.+..... + -..+.+.+|+ ||. +.. |--... .+-.|.|...|. ...+. T Consensus 193 FVyGr~~~~~~p~rfPARQtleASqAVAR~H~L~~~~vf~qQnp~aId~GvFHNDVIaV~N~~vLf~HE~AF~--d~~~~ 270 (439) T pfam04996 193 FVHGREAWSPRPGRFPARQTLEASQAVARLHGVQATAVFAQQNPAAIDQGAFHNDVVAVGNREVLFAHEQAFA--DRPDL 270 (439) T ss_pred EEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEECCHHHHHCCCCCCCEEEECCCCEEEEHHHHHC--CHHHH T ss_conf 9973567888888777277899999999973999673786339888754776345588506543577686644--89999 Q ss_pred HHHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCEEECCCCCCCHHHHHHHCCCCEEECHH Q ss_conf 99999860233430676028420234432013553778700233220147862000353112156787412563570744 Q gi|254780605|r 183 IRDICKILHQKEVGAVFICDPSSIAWIFNIRGFDIPCSPYPLSRAILYADGKAEIFFDKQYINEQLKALLSAVAIVLDMD 262 (609) Q Consensus 183 i~~ir~~L~~~~~daliit~~d~IaWLlNiRG~Di~~tPv~~a~aiI~~~~~~~Lfvd~~k~~~~~~~~l~~~~~~~~~~ 262 (609) ++.||+.+.. ...+-|.+- +|. =.|.--|-+|.|-++-..+|+..|.+.. +.+ +.. T Consensus 271 l~~l~~~~~~--~~~ieVp~~-~vs------l~dAV~SYLFNSQLlt~pdg~M~Li~P~-----Ecr----------~~~ 326 (439) T pfam04996 271 LAELRARLPG--FAAVEVPTR-DLS------VEDAVASYLFNSQLLSLPDGSMALVVPV-----ECR----------DNA 326 (439) T ss_pred HHHHHHHCCC--CEEEEEECC-CCC------HHHHHHHHHHCCEEEECCCCCEEEECCH-----HHH----------CCH T ss_conf 9999987379--758995101-087------9999887102013323599957997356-----873----------687 Q ss_pred HHHHHHHHCCCCCCEEEECCCCCCHHHHHHHHHCCCEEECCCCC-CHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 51577863013574289725317788975312139567515432-11566619999998799998443899999987533 Q gi|254780605|r 263 MMDSRLVCLARTSMPILIDPKWISYRFFKVIAQKNGVMVEGSDP-SCLLRATKNKVEIEGMQTAHIQDGVAMVYFLFWFY 341 (609) Q Consensus 263 ~~~~~l~~l~~~~~~i~id~~~~s~~~~~~l~~~~~~~i~~~~p-I~~lRaiKn~~EI~~mr~Ahi~DgvAl~kfl~wL~ 341 (609) .....+..+...+..|- .---+.+.++..+- ..| |..||.+=|+.|++.+..+-+.|..-+.+.-.|++ T Consensus 327 ~v~~~l~~l~~~~~PI~---~v~~~DlrQSM~NG-------GGPACLRLRVvL~e~E~~avn~~~lm~~~l~~~L~~wV~ 396 (439) T pfam04996 327 RAWRYLQTLLAADNPIT---EVKVVDLRQSMSNG-------GGPACLRLRVVLTEAERAAVNPAVLLDAALLDALTDWVD 396 (439) T ss_pred HHHHHHHHHHHCCCCCC---EEEEECHHHHHHCC-------CCCEEEEEEEECCHHHHHHCCCCEECCHHHHHHHHHHHH T ss_conf 89999999985599955---48994178785658-------987226787636788997329761338999999999999 Q ss_pred HHHCCCCCCCHHHH--HHHHHHHHHHH Q ss_conf 21001245301889--99999987543 Q gi|254780605|r 342 SQSLETITEIDIIK--KLERCREEIGC 366 (609) Q Consensus 342 ~~~~~~iTE~~ia~--~Le~~R~~~~~ 366 (609) +.-.+.++.-|++. .|.+-|..... T Consensus 397 ~hYRd~L~~~DLaDP~Ll~E~r~ALDe 423 (439) T pfam04996 397 RWYRDRLSPADLADPQLLEEAREALDE 423 (439) T ss_pred HHCCCCCCHHHCCCHHHHHHHHHHHHH T ss_conf 864333992453597999999999999 No 73 >PRK05927 hypothetical protein; Provisional Probab=53.21 E-value=1.9 Score=23.02 Aligned_cols=15 Identities=27% Similarity=0.417 Sum_probs=7.7 Q ss_pred CCCHHHHHHHHHHHH Q ss_conf 530188999999987 Q gi|254780605|r 349 TEIDIIKKLERCREE 363 (609) Q Consensus 349 TE~~ia~~Le~~R~~ 363 (609) |-.+..+.|..+|.- T Consensus 212 t~e~ri~HL~~lR~l 226 (350) T PRK05927 212 NPEDILLHLQTLRDA 226 (350) T ss_pred CHHHHHHHHHHHHHH T ss_conf 999999999999998 No 74 >PRK13309 ureC urease subunit alpha; Reviewed Probab=53.04 E-value=12 Score=17.42 Aligned_cols=62 Identities=26% Similarity=0.377 Sum_probs=41.4 Q ss_pred HHHHHHHCCCC--CCCCCCCCCCCCCCCC-------CCCCCCCCCCCCEECCCCEE------------EECCCEEECC-- Q ss_conf 73667741865--4776432321135607-------78862377898600569878------------8468603568-- Q gi|254780605|r 468 ARIFLWKYGAD--FAHGVGHGVGSFLPVH-------EGPQGISRTNQEPLLPGMIL------------SNEPGYYRCG-- 524 (609) Q Consensus 468 aR~~l~~~g~d--y~HgtGHGVG~~l~VH-------E~P~~is~~~~~~l~~Gmv~------------s~EPG~Y~~g-- 524 (609) +|+++.++-.+ -.||..|-||+ ++|- =.|.+|.-.-+.+|+-||+. |-||=+|.|- T Consensus 405 vkRYiAKyTINPAiahGis~~vGS-vE~GKlADlVlW~PafFG~KP~~VlKgG~ia~a~mGD~NaSiptpqPv~~Rpmfg 483 (572) T PRK13309 405 VLRYVAKITINPAITQGVSHVIGS-VEVGKMADLVLWDPRFFGAKPKMVIKGGMINWAAMGDPNASLPTPQPVFYRPMFG 483 (572) T ss_pred HHHHHHHHCCCHHHHCCCCCCCCC-CCCCCCEEEEEECHHHCCCCCCEEEECCEEEEECCCCCCCCCCCCCCCEECHHHH T ss_conf 888666611175766175555356-3478622487536143278862378878887851568776778876212012222 Q ss_pred CCCEEE Q ss_conf 752150 Q gi|254780605|r 525 AFGIRI 530 (609) Q Consensus 525 ~~GiRI 530 (609) .+|--. T Consensus 484 a~g~a~ 489 (572) T PRK13309 484 AMGKTL 489 (572) T ss_pred HCCCCC T ss_conf 025430 No 75 >PRK11678 putative chaperone; Provisional Probab=52.30 E-value=6 Score=19.60 Aligned_cols=18 Identities=39% Similarity=0.612 Sum_probs=9.6 Q ss_pred CCEEEEECCCCEECCCCCCEEE Q ss_conf 9879983032010561222143 Q gi|254780605|r 405 DELLLLDSGAQYVNGTTDITRT 426 (609) Q Consensus 405 ~~~~LiDsG~qY~~GtTDiTRT 426 (609) +.++.+|.|| ||+|+|=. T Consensus 209 ~~vLV~DlGG----GT~DvSlv 226 (450) T PRK11678 209 KTVLVVDIGG----GTTDCSLL 226 (450) T ss_pred CEEEEEEECC----CEEEEEEE T ss_conf 8799999089----84888747 No 76 >PRK05711 DNA polymerase III subunit epsilon; Provisional Probab=51.58 E-value=13 Score=17.37 Aligned_cols=32 Identities=34% Similarity=0.639 Sum_probs=14.9 Q ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 89972178978888999987366774186547764323 Q gi|254780605|r 449 SVSTARFPQRTRGCDLDSIARIFLWKYGADFAHGVGHG 486 (609) Q Consensus 449 al~~a~fp~g~~g~~lD~~aR~~l~~~g~dy~HgtGHG 486 (609) .+|+..|| |.+ ..||++||. .|.|..|++=|| T Consensus 126 ~lAR~~~P-g~~-~sLdaLc~r----~~I~~s~R~~Hg 157 (240) T PRK05711 126 AMARRMFP-GKR-NSLDALCKR----YGIDNSHRTLHG 157 (240) T ss_pred HHHHHHCC-CCC-CCHHHHHHH----CCCCCCCCCCCC T ss_conf 99998789-998-899999998----098988756576 No 77 >PRK06631 consensus Probab=50.09 E-value=14 Score=17.13 Aligned_cols=32 Identities=22% Similarity=0.469 Sum_probs=13.8 Q ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 89972178978888999987366774186547764323 Q gi|254780605|r 449 SVSTARFPQRTRGCDLDSIARIFLWKYGADFAHGVGHG 486 (609) Q Consensus 449 al~~a~fp~g~~g~~lD~~aR~~l~~~g~dy~HgtGHG 486 (609) .+++..|| |.+ ..||++||. .|.|+.|++=|| T Consensus 126 ~lAR~~~P-g~~-~sLdaLc~r----~~I~~s~r~~H~ 157 (229) T PRK06631 126 VMARSMFP-GSK-YNLDALCKR----FKVDNSGRQLHG 157 (229) T ss_pred HHHHHHCC-CCC-CCHHHHHHH----CCCCCCCCCCCC T ss_conf 99998689-988-899999998----499999855568 No 78 >PRK00683 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=49.84 E-value=6.2 Score=19.52 Aligned_cols=32 Identities=16% Similarity=0.416 Sum_probs=15.8 Q ss_pred CCEEEEECCCCEECCC--------CC-CEEEEEECCCCHHHH Q ss_conf 9879983032010561--------22-214321037888884 Q gi|254780605|r 405 DELLLLDSGAQYVNGT--------TD-ITRTIAIGDVDYEKK 437 (609) Q Consensus 405 ~~~~LiDsG~qY~~Gt--------TD-iTRT~~~G~~~~e~k 437 (609) +.++|| .||+-++.- .. +...+.+|+-.++.. T Consensus 313 ~~iilI-~GG~~K~~d~~~L~~~~~~~vk~v~~~G~~~~~i~ 353 (418) T PRK00683 313 KDVIVI-LGGRNKGGDFPSLLPVLRQTAKHVVAMGECRQEIA 353 (418) T ss_pred CCCEEE-ECCCCCCCCHHHHHHHHHHCCCEEEEECCCHHHHH T ss_conf 885899-75787667869999998631849999688999999 No 79 >PRK13985 UreB urease subunit beta; Provisional Probab=49.82 E-value=17 Score=16.48 Aligned_cols=57 Identities=30% Similarity=0.419 Sum_probs=40.0 Q ss_pred HHHHHHHHCCCC--CCCCCCCCCCCCCCC--------CCCCCCCCCCCCCEECCCCEE------------EECCCEEECC Q ss_conf 873667741865--477643232113560--------778862377898600569878------------8468603568 Q gi|254780605|r 467 IARIFLWKYGAD--FAHGVGHGVGSFLPV--------HEGPQGISRTNQEPLLPGMIL------------SNEPGYYRCG 524 (609) Q Consensus 467 ~aR~~l~~~g~d--y~HgtGHGVG~~l~V--------HE~P~~is~~~~~~l~~Gmv~------------s~EPG~Y~~g 524 (609) =+|+++.++-.+ -.||..|-||+ ++| .+ |..|+-.-+.+|+-||+. |-||=+|.|- T Consensus 400 RvkRYiAKyTINPAiahGis~~vGS-vE~GK~ADlVLW~-PafFG~KP~~VlKgG~ia~a~mGD~NaSiptpqPv~~Rpm 477 (568) T PRK13985 400 RIKRYLSKYTINPAIAHGISEYVGS-VEVGKVADLVLWS-PAFFGVKPNMIIKGGFIALSQMGDANASIPTPQPVYYREM 477 (568) T ss_pred HHHHHHHHCCCCHHHHCCCCCCCCC-CCCCCHHHEEEEC-CCCCCCCCCEEEECCEEEEECCCCCCCCCCCCCCCEECCC T ss_conf 8887754322275766583002576-5546401034205-7556877433550887767035797778899876252101 Q ss_pred C Q ss_conf 7 Q gi|254780605|r 525 A 525 (609) Q Consensus 525 ~ 525 (609) . T Consensus 478 f 478 (568) T PRK13985 478 F 478 (568) T ss_pred C T ss_conf 0 No 80 >pfam00486 Trans_reg_C Transcriptional regulatory protein, C terminal. Probab=49.59 E-value=19 Score=16.17 Aligned_cols=60 Identities=15% Similarity=0.157 Sum_probs=39.2 Q ss_pred ECCHHHHHHHHHHHHHCCCEEEECCCHHHHHHHCCCCCCCCCEEEECCHHHCCCCHHHHHHHHHHHHCCCCCEE Q ss_conf 41399999999985334761997050011122012763322000120123358773889999998602334306 Q gi|254780605|r 124 LHSSFEVDLLQKSLDKIEGVIVDVPYNPIDSLWKDRPQRLYRKVAMQDMAYAGRESQEKIRDICKILHQKEVGA 197 (609) Q Consensus 124 l~s~~~~~~l~~~l~~~~~~l~~~~~NlVD~iW~~rP~~~~~~i~~~~~~~aG~s~~eKi~~ir~~L~~~~~da 197 (609) ..|..+++-|..-+.+.+ ..+ ..+.+++.+|.+.+.. +-.+...-+.+||++|+..+... T Consensus 5 ~Lt~~E~~lL~~L~~~~~-~~v-sr~~l~~~iw~~~~~~------------~~~sl~~~I~rLR~kl~~~~~~~ 64 (77) T pfam00486 5 ELTPKEFDLLELLARNPG-RVV-SREELLEEVWGEDFDV------------DDNTVDVHISRLRKKLEDDPRDP 64 (77) T ss_pred ECCHHHHHHHHHHHHCCC-CCC-CHHHHHHHHCCCCCCC------------CCCCHHHHHHHHHHHHHHCCCCC T ss_conf 338999999999996799-719-7999999963876665------------53309999999999875448999 No 81 >TIGR01527 arch_NMN_Atrans nicotinamide-nucleotide adenylyltransferase; InterPro: IPR006418 This family of archaeal proteins exhibits NAD salvage biosynthesis enzyme nicotinamide-nucleotide adenylyltransferase (2.7.7.1 from EC) activity. In some cases, the enzyme was tested and found also to have the activity of nicotinate-nucleotide adenylyltransferase (2.7.7.18 from EC), an enzyme of NAD de novo biosynthesis, although with a higher Km. In some archaeal species, a number of proteins which are uncharacterised with respect to activity, are also present.; GO: 0000309 nicotinamide-nucleotide adenylyltransferase activity, 0009435 NAD biosynthetic process, 0005737 cytoplasm. Probab=48.13 E-value=15 Score=16.91 Aligned_cols=13 Identities=31% Similarity=0.337 Sum_probs=4.8 Q ss_pred CHHHHHHHHHHHH Q ss_conf 7388999999860 Q gi|254780605|r 178 ESQEKIRDICKIL 190 (609) Q Consensus 178 s~~eKi~~ir~~L 190 (609) |+-|++.=|++.| T Consensus 47 TAGERv~MI~~sl 59 (171) T TIGR01527 47 TAGERVLMITKSL 59 (171) T ss_pred CCCHHHHHHHHHH T ss_conf 6130789999999 No 82 >KOG2551 consensus Probab=47.77 E-value=18 Score=16.26 Aligned_cols=42 Identities=21% Similarity=0.309 Sum_probs=35.3 Q ss_pred CCCEEEECCHHHCCCCHHHHHHHHHHHHCCCCCEEEEECCCCC Q ss_conf 2200012012335877388999999860233430676028420 Q gi|254780605|r 163 LYRKVAMQDMAYAGRESQEKIRDICKILHQKEVGAVFICDPSS 205 (609) Q Consensus 163 ~~~~i~~~~~~~aG~s~~eKi~~ir~~L~~~~~daliit~~d~ 205 (609) +..-...|--.-+|+..++|+..+|+.|++. +|..+++.|.+ T Consensus 5 k~rvLcLHGfrQsg~~F~~Ktg~~rK~l~k~-~el~f~~aPh~ 46 (230) T KOG2551 5 KLRVLCLHGFRQSGKVFSEKTGSLRKLLKKL-AELVFPDAPHE 46 (230) T ss_pred CCEEEEECCHHHCCHHHHHHHHHHHHHHHHH-HEEEECCCCCC T ss_conf 7468986231221578887755689999751-11770688754 No 83 >PRK10955 DNA-binding transcriptional regulator CpxR; Provisional Probab=45.89 E-value=21 Score=15.79 Aligned_cols=10 Identities=20% Similarity=0.308 Sum_probs=5.2 Q ss_pred CCCCCCCCCC Q ss_conf 6432321135 Q gi|254780605|r 482 GVGHGVGSFL 491 (609) Q Consensus 482 gtGHGVG~~l 491 (609) -|=+|+||-| T Consensus 219 ~TvrG~GY~l 228 (232) T PRK10955 219 KTLRGRGYLM 228 (232) T ss_pred EEECCCCEEE T ss_conf 9845706599 No 84 >TIGR00393 kpsF sugar isomerase, KpsF/GutQ family; InterPro: IPR004800 This is a family of closely related proteins with the phosphosugar-binding domain SIS (Sugar ISomerase) followed by two copies of the CBS (named after Cystathionine Beta Synthase) domain. The group includes GutQ, a protein of the glucitol operon and KpsF, a virulence factor involved in capsular polysialic acid biosynthesis in some pathogenic strains of Escherichia coli.; GO: 0016853 isomerase activity, 0005975 carbohydrate metabolic process. Probab=45.48 E-value=8.9 Score=18.43 Aligned_cols=21 Identities=29% Similarity=0.412 Sum_probs=8.8 Q ss_pred CCEEEEECCCCEECC-CC--CCEEE Q ss_conf 987998303201056-12--22143 Q gi|254780605|r 405 DELLLLDSGAQYVNG-TT--DITRT 426 (609) Q Consensus 405 ~~~~LiDsG~qY~~G-tT--DiTRT 426 (609) |-.++||---|- .| -| |+=|. T Consensus 189 G~~~v~~~~~~~-~Gv~tDGD~RR~ 212 (272) T TIGR00393 189 GLAVVCDENEQL-KGVFTDGDLRRV 212 (272) T ss_pred CEEEEEECCCCE-EEEEECCHHHHH T ss_conf 179997155524-678714659999 No 85 >COG1056 NadR Nicotinamide mononucleotide adenylyltransferase [Coenzyme metabolism] Probab=44.98 E-value=11 Score=17.73 Aligned_cols=27 Identities=15% Similarity=0.018 Sum_probs=20.3 Q ss_pred CHHHHHHHHHHHHCCCCCEE--EEECCCC Q ss_conf 73889999998602334306--7602842 Q gi|254780605|r 178 ESQEKIRDICKILHQKEVGA--VFICDPS 204 (609) Q Consensus 178 s~~eKi~~ir~~L~~~~~da--liit~~d 204 (609) |+.|++.-|++.|++.+++. .+++-+| T Consensus 51 TagER~~mi~~~L~~~~~~~r~~~~~v~d 79 (172) T COG1056 51 TAGERIPMIRDRLREAGLDLRVYLRPVFD 79 (172) T ss_pred CCCCHHHHHHHHHHHCCCCCEEEEEECCC T ss_conf 75400699999998658774289996476 No 86 >PRK13206 ureC urease subunit alpha; Reviewed Probab=44.79 E-value=17 Score=16.39 Aligned_cols=58 Identities=29% Similarity=0.322 Sum_probs=39.5 Q ss_pred HHHHHHHHCCCC--CCCCCCCCCCCCCCCC-------CCCCCCCCCCCCEECCCCEE------------EECCCEEECCC Q ss_conf 873667741865--4776432321135607-------78862377898600569878------------84686035687 Q gi|254780605|r 467 IARIFLWKYGAD--FAHGVGHGVGSFLPVH-------EGPQGISRTNQEPLLPGMIL------------SNEPGYYRCGA 525 (609) Q Consensus 467 ~aR~~l~~~g~d--y~HgtGHGVG~~l~VH-------E~P~~is~~~~~~l~~Gmv~------------s~EPG~Y~~g~ 525 (609) -+|+++.++-.+ -.||..|-||+ ++|- =.|..|.-.-+.+|+-||+. |-||=+|.|-. T Consensus 405 RvkRYiAKyTINPAIahGish~VGS-vE~GK~ADlVLW~PafFG~KP~~VlKgG~ia~a~mGD~NaSiptpqPv~~Rpmf 483 (573) T PRK13206 405 RARRYVAKYTICPAVAHGLDHEIGS-VEVGKLADLVLWEPAFFGVRPHAVIKGGMIAWAAMGDANASIPTPQPVLPRPMF 483 (573) T ss_pred HHHHHHHHCCCCHHHHCCCCCCCCC-CCCCCCEEEEEECHHHCCCCCCEEEECCEEEEEECCCCCCCCCCCCCCEEEHHH T ss_conf 9887776446686887486533346-468851248763636418886237887888885057877688988650420322 No 87 >cd00375 Urease_alpha Urease alpha-subunit; Urease is a nickel-dependent metalloenzyme that catalyzes the hydrolysis of urea to form ammonia and carbon dioxide. Nickel-dependent ureases are found in bacteria, fungi and plants. Their primary role is to allow the use of external and internally generated urea as a nitrogen source. The enzyme consists of 3 subunits, alpha, beta and gamma, which can be fused and present on a single protein chain and which in turn forms multimers, mainly trimers. The large alpha subunit is the catalytic domain containing an active site with a bi-nickel center complexed by a carbamylated lysine. The beta and gamma subunits play a role in subunit association to form the higher order trimers. Probab=44.75 E-value=18 Score=16.37 Aligned_cols=71 Identities=31% Similarity=0.400 Sum_probs=45.0 Q ss_pred HHHHHHHHCCCC--CCCCCCCCCCCCCCCC-------CCCCCCCCCCCCEECCCCEE------------EECCCEEEC-- Q ss_conf 873667741865--4776432321135607-------78862377898600569878------------846860356-- Q gi|254780605|r 467 IARIFLWKYGAD--FAHGVGHGVGSFLPVH-------EGPQGISRTNQEPLLPGMIL------------SNEPGYYRC-- 523 (609) Q Consensus 467 ~aR~~l~~~g~d--y~HgtGHGVG~~l~VH-------E~P~~is~~~~~~l~~Gmv~------------s~EPG~Y~~-- 523 (609) =+|+++.++-.+ -.||..|-||+ ++|- =.|.+|+-.-+.+|+-||+. |-||=+|.| T Consensus 400 RvkRYiAKyTINPAiahGis~~VGS-vE~GK~ADlVLW~PafFG~KP~~VlKgG~ia~a~mGD~NaSiptpqPv~~Rpmf 478 (567) T cd00375 400 RVKRYIAKYTINPAIAHGISHEVGS-VEVGKLADLVLWEPAFFGVKPEMVLKGGFIAYAQMGDPNASIPTPQPVMMRPMF 478 (567) T ss_pred HHHHHHHHCCCCHHHHCCCCCCCCC-CCCCCCEEEEEECHHHCCCCCCEEEECCEEEEEECCCCCCCCCCCCCCEEEHHH T ss_conf 7766653155686887486543345-247851238752614327886237887888884057877688987651411111 Q ss_pred CCCCEEEEE--EEEEEE Q ss_conf 875215002--599972 Q gi|254780605|r 524 GAFGIRIEN--VLCVSE 538 (609) Q Consensus 524 g~~GiRIEn--~v~v~~ 538 (609) |.||--..+ ..+|.+ T Consensus 479 ga~g~a~~~~s~~FvS~ 495 (567) T cd00375 479 GAHGKAPAATSVTFVSQ 495 (567) T ss_pred HHCCCCCCCCEEEEECH T ss_conf 10365433455899677 No 88 >KOG2670 consensus Probab=42.83 E-value=24 Score=15.48 Aligned_cols=12 Identities=17% Similarity=0.296 Sum_probs=4.8 Q ss_pred CCHHHHHHHHHH Q ss_conf 999999879999 Q gi|254780605|r 314 KNKVEIEGMQTA 325 (609) Q Consensus 314 Kn~~EI~~mr~A 325 (609) +|++-++.++.| T Consensus 220 ~~~E~L~Li~~A 231 (433) T KOG2670 220 TNEEALDLIKEA 231 (433) T ss_pred CHHHHHHHHHHH T ss_conf 448899999999 No 89 >KOG3303 consensus Probab=40.43 E-value=26 Score=15.23 Aligned_cols=18 Identities=28% Similarity=0.296 Sum_probs=8.8 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 984438999999875332 Q gi|254780605|r 325 AHIQDGVAMVYFLFWFYS 342 (609) Q Consensus 325 Ahi~DgvAl~kfl~wL~~ 342 (609) |...+-.+..+-++|=.. T Consensus 81 AY~~~R~~~lr~~~W~~g 98 (192) T KOG3303 81 AYVYLRLARLRDLAWSYG 98 (192) T ss_pred HHHHHHHHHHHHHHHHCC T ss_conf 999999999999987618 No 90 >pfam02016 Peptidase_S66 LD-carboxypeptidase. Muramoyl-tetrapeptide carboxypeptidase hydrolyses a peptide bond between a di-basic amino acid and the C-terminal D-alanine in the tetrapeptide moiety in peptidoglycan. This cleaves the bond between an L- and a D-amino acid. The function of this activity is in murein recycling. This family also includes the microcin c7 self-immunity protein. This family corresponds to Merops family S66. Probab=39.73 E-value=26 Score=15.16 Aligned_cols=24 Identities=17% Similarity=0.365 Sum_probs=13.4 Q ss_pred CHHHHHHHHHHHHCCCCCEEEEEC Q ss_conf 738899999986023343067602 Q gi|254780605|r 178 ESQEKIRDICKILHQKEVGAVFIC 201 (609) Q Consensus 178 s~~eKi~~ir~~L~~~~~daliit 201 (609) +-+++.+.|.+.+...++++++-. T Consensus 45 ~d~~Ra~dL~~a~~d~~i~aI~~~ 68 (281) T pfam02016 45 TDEERAADLHAAFADPEIDAIICA 68 (281) T ss_pred CHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 999999999998508998899992 No 91 >smart00870 Asparaginase Asparaginase, which is found in various plant, animal and bacterial cells, catalyses the deamination of asparagine to yield aspartic acid and an ammonium ion, resulting in a depletion of free circulatory asparagine in plasma PUBMED:3026924. The enzyme is effective in the treatment of human malignant lymphomas, which have a diminished capacity to produce asparagine synthetase: in order to survive, such cells absorb asparagine from blood plasma PUBMED:2407723, PUBMED:3379033 - if Asn levels have been depleted by injection of asparaginase, the lymphoma cells die. Probab=39.13 E-value=20 Score=16.04 Aligned_cols=57 Identities=16% Similarity=0.150 Sum_probs=38.3 Q ss_pred CCCCHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCC-CCCCCHHHHHHHHCCCCCCEEECCCCC Q ss_conf 587738899999986023343067602842023443201355-377870023322014786200035311 Q gi|254780605|r 175 AGRESQEKIRDICKILHQKEVGAVFICDPSSIAWIFNIRGFD-IPCSPYPLSRAILYADGKAEIFFDKQY 243 (609) Q Consensus 175 aG~s~~eKi~~ir~~L~~~~~daliit~~d~IaWLlNiRG~D-i~~tPv~~a~aiI~~~~~~~Lfvd~~k 243 (609) +-+.|.+=.++|++.++..+.|+++||- |.| +.||-.++++++-.. .+++.|+-..+ T Consensus 58 tp~~w~~la~~I~~~~~~~~~dG~VVtH-----------GTDTme~TA~~Ls~~l~~~-~kPVVlTGsqr 115 (323) T smart00870 58 TPADWLKLAKRINEALADDGYDGVVVTH-----------GTDTLEETAYFLSLTLDSL-DKPVVLTGAMR 115 (323) T ss_pred CHHHHHHHHHHHHHHHCCCCCCEEEEEC-----------CCCHHHHHHHHHHHHHHCC-CCCEEEECCCC T ss_conf 9999999999999986056877499961-----------7635999999999997089-99889969788 No 92 >TIGR02870 spore_II_D stage II sporulation protein D; InterPro: IPR014225 This entry contains the stage II sporulation protein D (SpoIID), which is a protein involved in the endospore formation program. SpoIID is one of the three genes (spoIID, spoIIM and spoIIP, , , ), under the control of sigma E, that have been shown to be essential for the engulfment of the forespore by the mother cell. Their products are involved in degradation of the septal peptidoglycan - mutations in spoIID, spoIIM or spoIIP block sporulation at morphological stage II, prior to the engulfment stage. These three genes are absolutely conserved (sometimes even duplicated) in all endospore formers . Probab=39.05 E-value=9.6 Score=18.20 Aligned_cols=10 Identities=60% Similarity=1.189 Sum_probs=7.7 Q ss_pred CCCCCCCCCCCC Q ss_conf 764323211356 Q gi|254780605|r 481 HGVGHGVGSFLP 492 (609) Q Consensus 481 HgtGHGVG~~l~ 492 (609) -|=||||| |+ T Consensus 319 ~GyGHGVG--MS 328 (358) T TIGR02870 319 IGYGHGVG--MS 328 (358) T ss_pred ECCCCCCC--HH T ss_conf 71545555--23 No 93 >pfam04378 DUF519 Protein of unknown function (DUF519). Bacterial family of unknown function, possibly secreted. Probab=38.97 E-value=26 Score=15.19 Aligned_cols=38 Identities=16% Similarity=0.217 Sum_probs=22.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHCCCCCCC--------HHHHHHHHHH Q ss_conf 88888433366778888899721789788--------8899998736 Q gi|254780605|r 432 VDYEKKYYFTLVLKGMISVSTARFPQRTR--------GCDLDSIARI 470 (609) Q Consensus 432 ~~~e~k~~yt~VLk~~ial~~a~fp~g~~--------g~~lD~~aR~ 470 (609) |+-|+|.-|..|.++. .-+..+||.|+- ....|.+.|+ T Consensus 133 PpYE~k~ey~~v~~~l-~~a~~r~~~g~y~iWYPi~~~~~~~~~~~~ 178 (245) T pfam04378 133 PPFELKDEYQRVVAAL-REAYKRWPTGIYAIWYPIKDRRQVKRFYRA 178 (245) T ss_pred CCCCCHHHHHHHHHHH-HHHHHHCCCCEEEEEEECCCHHHHHHHHHH T ss_conf 9866656999999999-999985788379999655874999999999 No 94 >PRK08445 hypothetical protein; Provisional Probab=37.44 E-value=4.1 Score=20.77 Aligned_cols=15 Identities=27% Similarity=0.446 Sum_probs=8.9 Q ss_pred CCCHHHHHHHHHHHH Q ss_conf 530188999999987 Q gi|254780605|r 349 TEIDIIKKLERCREE 363 (609) Q Consensus 349 TE~~ia~~Le~~R~~ 363 (609) |--+.++.|..+|.- T Consensus 209 t~e~rv~HL~~lR~l 223 (348) T PRK08445 209 NDEEIIEHWEHIRDL 223 (348) T ss_pred CHHHHHHHHHHHHHH T ss_conf 999999999999999 No 95 >PRK04750 ubiB putative ubiquinone biosynthesis protein UbiB; Reviewed Probab=37.22 E-value=29 Score=14.90 Aligned_cols=69 Identities=25% Similarity=0.309 Sum_probs=37.6 Q ss_pred CCCCCCCCCCCCCCCCEEECC-----CCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHH------------HHHHHH Q ss_conf 335432222455433205038-----987998303201056122214321037888884333------------667788 Q gi|254780605|r 384 GPHAAIIHYQATVQSNRLLQK-----DELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYF------------TLVLKG 446 (609) Q Consensus 384 G~N~AipHY~~~~~s~~~i~~-----~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~y------------t~VLk~ 446 (609) |==-|-|| | +|-.+.+ |.++++|+| .+|..+++.|+.. ..|.+. T Consensus 280 GFFHADpH--P---GNi~v~~d~~~~g~~~~lDfG--------------mvG~L~~~~r~~l~~l~~a~~~~D~~~va~~ 340 (540) T PRK04750 280 GFFHADMH--P---GNIFVSYDHPENPRYIALDFG--------------IVGSLNKEDKRYLAENFLAFFNRDYRRVAEL 340 (540) T ss_pred CCEECCCC--C---HHEEECCCCCCCCCEEEEEEE--------------EECCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 86606999--5---103645777778747887401--------------3213899999999999999994059999999 Q ss_pred HHHHHHHCCCCCCCHHHHHHHHHHHHH Q ss_conf 888997217897888899998736677 Q gi|254780605|r 447 MISVSTARFPQRTRGCDLDSIARIFLW 473 (609) Q Consensus 447 ~ial~~a~fp~g~~g~~lD~~aR~~l~ 473 (609) ++. ....|+.+.-.++....|..+. T Consensus 341 ~~~--~g~v~~~~~~~~l~~~i~~~~~ 365 (540) T PRK04750 341 HVE--SGWVPPDTRVEELEFAIRAVCE 365 (540) T ss_pred HHH--HCCCCCCCCHHHHHHHHHHHHH T ss_conf 999--5899988887999999999999 No 96 >PRK05014 hscB co-chaperone HscB; Provisional Probab=37.09 E-value=10 Score=17.96 Aligned_cols=49 Identities=12% Similarity=0.285 Sum_probs=29.0 Q ss_pred EECCHHHCCCCHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCC Q ss_conf 1201233587738899999986023343067602842023443201355 Q gi|254780605|r 168 AMQDMAYAGRESQEKIRDICKILHQKEVGAVFICDPSSIAWIFNIRGFD 216 (609) Q Consensus 168 ~~~~~~~aG~s~~eKi~~ir~~L~~~~~daliit~~d~IaWLlNiRG~D 216 (609) ..||.+|++.+..+|...+...-.-+.+-..+-..+.=..|||.+.|.| T Consensus 30 ~~HPDkf~~~~~~e~~~a~~~ss~iN~AY~tLk~pl~RA~yLL~L~G~~ 78 (171) T PRK05014 30 QVHPDKFANASEREQLAAVQQAAQINDAYQTLKHPLSRAEYMLSLHGFD 78 (171) T ss_pred HHCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCC T ss_conf 8697740589899999999999999999999748899899999982998 No 97 >PRK09125 DNA ligase; Provisional Probab=36.90 E-value=29 Score=14.86 Aligned_cols=13 Identities=23% Similarity=0.325 Sum_probs=5.1 Q ss_pred CCCCHHHHHHHHH Q ss_conf 5877388999999 Q gi|254780605|r 175 AGRESQEKIRDIC 187 (609) Q Consensus 175 aG~s~~eKi~~ir 187 (609) +|==.++|+.-+| T Consensus 38 ~~w~~SeKlDGvR 50 (277) T PRK09125 38 SGYLVSEKLDGVR 50 (277) T ss_pred CCEEEEEEEEEEE T ss_conf 2358987413799 No 98 >pfam10450 POC1 POC1 chaperone. In yeast, POC1 is a chaperone of the 20S proteasome which functions in early 20S proteasome assembly. Probab=36.62 E-value=7.4 Score=18.98 Aligned_cols=11 Identities=0% Similarity=-0.272 Sum_probs=3.8 Q ss_pred CCCCHHHHHHH Q ss_conf 37888884333 Q gi|254780605|r 430 GDVDYEKKYYF 440 (609) Q Consensus 430 G~~~~e~k~~y 440 (609) |.+++--.++| T Consensus 195 ~~~s~Y~~eC~ 205 (223) T pfam10450 195 SYPSQYSSECL 205 (223) T ss_pred CCHHHHHHHHH T ss_conf 78488899999 No 99 >PRK00294 hscB co-chaperone HscB; Provisional Probab=36.60 E-value=9.9 Score=18.09 Aligned_cols=50 Identities=10% Similarity=0.224 Sum_probs=28.5 Q ss_pred EECCHHHCCCCHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCCC Q ss_conf 12012335877388999999860233430676028420234432013553 Q gi|254780605|r 168 AMQDMAYAGRESQEKIRDICKILHQKEVGAVFICDPSSIAWIFNIRGFDI 217 (609) Q Consensus 168 ~~~~~~~aG~s~~eKi~~ir~~L~~~~~daliit~~d~IaWLlNiRG~Di 217 (609) ..||.+|++.|-.+|...+...-.-+.+-..+-..+.=-.|||.++|.|+ T Consensus 33 ~~HPDrf~~~s~~e~~~a~~~sa~iN~AY~tLk~pl~RA~yLL~l~G~~~ 82 (173) T PRK00294 33 EVHPDRFADAPEREQRLALERSAALNEAYQTLKSAPRRARYLLALSGHEV 82 (173) T ss_pred HHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCC T ss_conf 86957367989999999999999999999997599999999998549888 No 100 >pfam05422 SIN1 Stress-activated map kinase interacting protein 1 (SIN1). This family consists of several stress-activated map kinase interacting protein 1 (MAPKAP1 OR SIN1) sequences. The fission yeast Sty1/Spc1 mitogen-activated protein (MAP) kinase is a member of the eukaryotic stress-activated MAP kinase (SAPK) family. Sin1 interacts with Sty1/Spc1. Cells lacking Sin1 display many, but not all, of the phenotypes of cells lacking the Sty1/Spc1 MAP kinase including sterility, multiple stress sensitivity and a cell-cycle delay. Sin1 is phosphorylated after stress but this is not Sty1/Spc1-dependent. Probab=36.22 E-value=25 Score=15.28 Aligned_cols=10 Identities=0% Similarity=0.255 Sum_probs=6.8 Q ss_pred CHHHHHHHHH Q ss_conf 7388999999 Q gi|254780605|r 178 ESQEKIRDIC 187 (609) Q Consensus 178 s~~eKi~~ir 187 (609) ++++||++|+ T Consensus 98 ntaq~lE~l~ 107 (482) T pfam05422 98 NTAQELERLD 107 (482) T ss_pred HHHHHHHHHH T ss_conf 4888998775 No 101 >TIGR03290 CoB_CoM_SS_C CoB--CoM heterodisulfide reductase, subunit C. The last step in methanogenesis leaves two coenzymes of methanogenesis, CoM and CoB, linked by a disulfide bond. Members of this protein family are the C subunit of the enzyme that reduces the heterodisulfide to CoB-SH and CoM-SH. Similar enzyme complex subunits are found in various other species, but likely act on a different substrate. Probab=35.87 E-value=27 Score=15.10 Aligned_cols=10 Identities=40% Similarity=0.634 Sum_probs=6.2 Q ss_pred CCCCCCCCCC Q ss_conf 4776432321 Q gi|254780605|r 479 FAHGVGHGVG 488 (609) Q Consensus 479 y~HgtGHGVG 488 (609) |...|||+|+ T Consensus 91 ~i~~~GhaVp 100 (144) T TIGR03290 91 FVLKTGHAVP 100 (144) T ss_pred HHHHHCCCCC T ss_conf 9999687788 No 102 >cd07356 HN_L-whirlin_R1_like First harmonin_N_like domain (repeat 1) of the long isoform of whirlin, and related domains. This subgroup contains the first of two harmonin_N_like domains of the long isoform of whirlin, and related domains. Whirlin is a postsynaptic density-95/discs-large/ZO-1 (PDZ) domain-containing scaffold protein which binds various components of the Usher protein network of the inner ear and the retina: erythrocyte protein p55, usherin, VlGR1, and myosin XVa. The long isoform of whirlin contains two harmonin_N_like domains, and three PDZ protein-binding domains, PDZ1-3. This first harmonin_N_like domain precedes PDZ1, and is a putative protein-binding module based on its sequence similarity to the N-terminal domain of harmonin. This first harmonin_N_like domain has been assayed for interaction with the cytoplasmic domain of cadherin 23 (a component of the Usher network and an interacting partner of the harmonin N-domain), however no interaction could be detected. Th Probab=35.66 E-value=30 Score=14.73 Aligned_cols=33 Identities=30% Similarity=0.538 Sum_probs=22.0 Q ss_pred HHHCCHHHHHH----HHHHHHH--HHHHH---HHHCCCHHHH Q ss_conf 67789999999----9999999--99997---4411807999 Q gi|254780605|r 567 VELLTNEEKKW----CNDYHRR--VYTSL---APLIEDQEVL 599 (609) Q Consensus 567 ~~ll~~~e~~w----ln~Yh~~--v~~~~---~p~l~~~~~~ 599 (609) ..+||+.|++. ||+||.+ |++-+ .|.|+.++.+ T Consensus 13 ~~lLt~~Er~~f~~~Ln~Y~~~RNV~~Lv~sL~~vLd~P~Kr 54 (78) T cd07356 13 TKLLSEAEREEFIHCLNDYHAKRNVYDLVQSLKVVLDTPEKR 54 (78) T ss_pred HHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHH T ss_conf 998799889999999999862054999999999980887687 No 103 >KOG0582 consensus Probab=35.07 E-value=31 Score=14.67 Aligned_cols=17 Identities=18% Similarity=0.100 Sum_probs=7.1 Q ss_pred HHHCCCCCCCCCEEEEC Q ss_conf 22012763322000120 Q gi|254780605|r 154 SLWKDRPQRLYRKVAMQ 170 (609) Q Consensus 154 ~iW~~rP~~~~~~i~~~ 170 (609) .+|.=-|.......+.+ T Consensus 98 ~LWvVmpfMa~GS~ldI 114 (516) T KOG0582 98 ELWVVMPFMAGGSLLDI 114 (516) T ss_pred EEEEEEHHHCCCCHHHH T ss_conf 66885433337728999 No 104 >PRK03578 hscB co-chaperone HscB; Provisional Probab=34.96 E-value=11 Score=17.89 Aligned_cols=49 Identities=14% Similarity=0.167 Sum_probs=26.7 Q ss_pred EECCHHHCCCCHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCC Q ss_conf 1201233587738899999986023343067602842023443201355 Q gi|254780605|r 168 AMQDMAYAGRESQEKIRDICKILHQKEVGAVFICDPSSIAWIFNIRGFD 216 (609) Q Consensus 168 ~~~~~~~aG~s~~eKi~~ir~~L~~~~~daliit~~d~IaWLlNiRG~D 216 (609) ..||.+|++.|-.+|...+...-.-+.+-..+-..+.=-.|||.++|-| T Consensus 35 ~~HPDkf~~~s~~e~~~a~~~sa~iN~AY~tLk~pl~RA~YLL~l~G~~ 83 (176) T PRK03578 35 QVHPDRFAAAGDAEKRVAMQWATRANEAYRTLRDPLKRATYLLHLRGVD 83 (176) T ss_pred HHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCC T ss_conf 8796757788999999999999999999999759999999999985988 No 105 >PRK13207 ureC urease subunit alpha; Reviewed Probab=34.27 E-value=29 Score=14.90 Aligned_cols=64 Identities=31% Similarity=0.441 Sum_probs=40.4 Q ss_pred HHHHHHHCCCC--CCCCCCCCCCCCCCCC-------CCCCCCCCCCCCEECCCCEE------------EECCCEEECC-- Q ss_conf 73667741865--4776432321135607-------78862377898600569878------------8468603568-- Q gi|254780605|r 468 ARIFLWKYGAD--FAHGVGHGVGSFLPVH-------EGPQGISRTNQEPLLPGMIL------------SNEPGYYRCG-- 524 (609) Q Consensus 468 aR~~l~~~g~d--y~HgtGHGVG~~l~VH-------E~P~~is~~~~~~l~~Gmv~------------s~EPG~Y~~g-- 524 (609) +|+++.++-.+ -.||..|-||+ ++|- =.|..|.-.-+.+|+-||+. |-||=+|.|- T Consensus 401 ikRYiAKyTINPAIahGis~~vGS-vE~GK~ADlVlW~PafFGvKPe~VlKgG~Ia~a~MGDpNASIPTpQPv~~RpMFG 479 (568) T PRK13207 401 VKRYIAKYTINPAIAHGISHEVGS-VEVGKLADLVLWKPAFFGVKPELVLKGGMIAWAQMGDPNASIPTPQPVHYRPMFG 479 (568) T ss_pred HHHHHHHHCCCHHHHCCCCCCCCC-CCCCCEEEEEEECHHHCCCCCCEEEECCEEEEECCCCCCCCCCCCCCCEECHHHH T ss_conf 877766621477886475322056-4576242277525364187840378768886804579777889876513112122 Q ss_pred CCCEEEEE Q ss_conf 75215002 Q gi|254780605|r 525 AFGIRIEN 532 (609) Q Consensus 525 ~~GiRIEn 532 (609) .+|--..+ T Consensus 480 a~G~A~~~ 487 (568) T PRK13207 480 AYGGALAA 487 (568) T ss_pred HCCCCCCC T ss_conf 02555556 No 106 >COG1049 AcnB Aconitase B [Energy production and conversion] Probab=33.80 E-value=32 Score=14.53 Aligned_cols=77 Identities=29% Similarity=0.437 Sum_probs=41.9 Q ss_pred HHHCCCCCCCCCEEEECCHHHCCCCHHHHHHHHHHHHCCCCCEEEEE-----------CCCCCHHHHHHCCCCCCCCCCH Q ss_conf 22012763322000120123358773889999998602334306760-----------2842023443201355377870 Q gi|254780605|r 154 SLWKDRPQRLYRKVAMQDMAYAGRESQEKIRDICKILHQKEVGAVFI-----------CDPSSIAWIFNIRGFDIPCSPY 222 (609) Q Consensus 154 ~iW~~rP~~~~~~i~~~~~~~aG~s~~eKi~~ir~~L~~~~~dalii-----------t~~d~IaWLlNiRG~Di~~tPv 222 (609) ..|. ||..|..-.-.+..+--|. .-+..+.++++|.-...+ |...++-|.+ |.|+||-|- T Consensus 187 dA~s-RpDipLHalaMlk~~~~gg-----~i~~i~~lk~Kg~~vayvgdvvGtGSSRkSa~NsvlW~~---g~diP~VPn 257 (852) T COG1049 187 DAWS-RPDIPLHALAMLKNAREGG-----PIKQIEALKQKGHPVAYVGDVVGTGSSRKSATNSVLWHM---GDDIPYVPN 257 (852) T ss_pred CCCC-CCCCCHHHHHHCCCCCCCC-----HHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCEEEEEC---CCCCCCCCC T ss_conf 5556-8886337776314666687-----278999877549558995461125756432301032114---787877875 Q ss_pred HHHHHHHCCCCCCEEEC Q ss_conf 02332201478620003 Q gi|254780605|r 223 PLSRAILYADGKAEIFF 239 (609) Q Consensus 223 ~~a~aiI~~~~~~~Lfv 239 (609) .++-.++.-.+-+-+|. T Consensus 258 kr~ggivlG~~IaPIFf 274 (852) T COG1049 258 KRYGGIVLGGKIAPIFF 274 (852) T ss_pred CCCCCEEECCEECCEEE T ss_conf 66665466571232354 No 107 >COG0684 MenG Demethylmenaquinone methyltransferase [Coenzyme metabolism] Probab=33.78 E-value=32 Score=14.53 Aligned_cols=77 Identities=21% Similarity=0.267 Sum_probs=43.7 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHH Q ss_conf 55554100001233543222245543320503898799830320105612221432103788888433366778888899 Q gi|254780605|r 372 LRDIAFNTIAASGPHAAIIHYQATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVS 451 (609) Q Consensus 372 ~~~~SF~TIva~G~N~AipHY~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~ 451 (609) |.|..| |+--+..| .++|. +- ....+|++++||.|+ |++|.++ |.. ++.. T Consensus 45 ~~G~A~-TV~~~~~d-~~~~~-al----~~~~~GdVLVid~~g-------~~~~A~~-Gd~---------------la~~ 94 (210) T COG0684 45 FVGPAV-TVRCFPDD-WLLHV-AL----EQAGPGDVLVIDGGG-------DLRRALW-GDL---------------LATL 94 (210) T ss_pred CEEEEE-EEEEECCC-CHHHH-EE----ECCCCCCEEEEECCC-------CCCEEEH-HHH---------------HHHH T ss_conf 056779-99970797-54241-00----027999789996898-------8522203-789---------------9999 Q ss_pred HHCCCCCCCHHHHHHHHHHHHHHCCCCCC Q ss_conf 72178978888999987366774186547 Q gi|254780605|r 452 TARFPQRTRGCDLDSIARIFLWKYGADFA 480 (609) Q Consensus 452 ~a~fp~g~~g~~lD~~aR~~l~~~g~dy~ 480 (609) ... .|..|--+|...|+.-.-..+||+ T Consensus 95 a~~--~G~~GvVidG~vRDv~~l~el~~p 121 (210) T COG0684 95 AKV--RGWAGVVIDGAVRDVDELRELDFP 121 (210) T ss_pred HHH--CCCCEEEEECEEECHHHHHHCCCC T ss_conf 987--683489970456026788536997 No 108 >cd05297 GH4_alpha_glucosidase_galactosidase Glycoside Hydrolases Family 4; Alpha-glucosidases and alpha-galactosidases. linked to 3D####ucture Probab=33.26 E-value=25 Score=15.25 Aligned_cols=44 Identities=11% Similarity=0.087 Sum_probs=21.4 Q ss_pred CCCCCCCCCCCCCCCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHH Q ss_conf 00123354322224554332050389879983032010561222143210378888843 Q gi|254780605|r 380 IAASGPHAAIIHYQATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKY 438 (609) Q Consensus 380 Iva~G~N~AipHY~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~ 438 (609) ++-.--||||+.- | .|..++- ...||..| -+.+.+|+.+..++- T Consensus 329 ~vNv~N~G~I~nL-p---~DavVEV--pc~V~~~G---------i~P~~vg~lP~~~~g 372 (423) T cd05297 329 NGNVPNNGLIPNL-P---DDVVVEV--PALVDRNG---------IHPEKIGPLPPQLAA 372 (423) T ss_pred EEECCCCCCCCCC-C---CCCEEEE--EEEECCCC---------CCCCCCCCCCHHHHH T ss_conf 9988899857899-9---9978999--88985997---------541446989899999 No 109 >PRK13191 putative peroxiredoxin; Provisional Probab=32.83 E-value=33 Score=14.43 Aligned_cols=48 Identities=10% Similarity=0.050 Sum_probs=26.0 Q ss_pred EEEECCHHHCCCCHHHHH--HHHHHHHCCCCCEEEEECCC---CCHHHHHHCC Q ss_conf 001201233587738899--99998602334306760284---2023443201 Q gi|254780605|r 166 KVAMQDMAYAGRESQEKI--RDICKILHQKEVGAVFICDP---SSIAWIFNIR 213 (609) Q Consensus 166 ~i~~~~~~~aG~s~~eKi--~~ir~~L~~~~~daliit~~---d~IaWLlNiR 213 (609) -+|.||..|+.....|=. ++.....++.|+..+-+|.= ...+|+-.++ T Consensus 38 VLffyP~DFTpVCtTEl~~f~~~~~eF~k~n~~vlg~S~Ds~~sH~aW~~~i~ 90 (230) T PRK13191 38 VLFSHPGDFTPVCTTEFYSFAKRYEDFKKLNTELIGLSVDSNISHIEWVNWIE 90 (230) T ss_pred EEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHH T ss_conf 99978898899597999999988999997599899997896888999987689 No 110 >PRK04663 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=32.35 E-value=34 Score=14.37 Aligned_cols=32 Identities=16% Similarity=0.220 Sum_probs=15.0 Q ss_pred CCCEEEEECCCCEECCC--------CCCE-EEEEECCCCHHH Q ss_conf 89879983032010561--------2221-432103788888 Q gi|254780605|r 404 KDELLLLDSGAQYVNGT--------TDIT-RTIAIGDVDYEK 436 (609) Q Consensus 404 ~~~~~LiDsG~qY~~Gt--------TDiT-RT~~~G~~~~e~ 436 (609) ++.++|| .||+-+++- .... ..+.+|+-.++. T Consensus 338 ~~~i~lI-~GG~~Kg~d~~~l~~~~~~~~~~~~~~G~~~~~~ 378 (438) T PRK04663 338 EGKLYLL-VGGVGKGADFSELKPVLATLNLQLCCFGEDGDQF 378 (438) T ss_pred CCCEEEE-ECCCCCCCCHHHHHHHHHHCCEEEEEECCCHHHH T ss_conf 7847999-6576666687999999852376999988889999 No 111 >PRK05926 hypothetical protein; Provisional Probab=32.29 E-value=5.2 Score=20.07 Aligned_cols=15 Identities=27% Similarity=0.459 Sum_probs=9.6 Q ss_pred CCCHHHHHHHHHHHH Q ss_conf 530188999999987 Q gi|254780605|r 349 TEIDIIKKLERCREE 363 (609) Q Consensus 349 TE~~ia~~Le~~R~~ 363 (609) |..+.++.|..+|.- T Consensus 235 t~~~rv~HL~~lR~l 249 (371) T PRK05926 235 TPEDIVTHMSKLRDL 249 (371) T ss_pred CHHHHHHHHHHHHHH T ss_conf 999999999999998 No 112 >KOG0996 consensus Probab=31.57 E-value=27 Score=15.08 Aligned_cols=18 Identities=17% Similarity=0.108 Sum_probs=7.9 Q ss_pred EEEECCEECCHHHHHHHH Q ss_conf 899656341399999999 Q gi|254780605|r 117 RLGLDSRLHSSFEVDLLQ 134 (609) Q Consensus 117 ~IG~Dp~l~s~~~~~~l~ 134 (609) +--+|++-.|...+..+- T Consensus 198 kY~Ingk~as~~~V~~lL 215 (1293) T KOG0996 198 KYYINGKEASFKDVTKLL 215 (1293) T ss_pred EEEECCCCCCHHHHHHHH T ss_conf 276778504299999999 No 113 >COG3737 Uncharacterized conserved protein [Function unknown] Probab=31.21 E-value=35 Score=14.25 Aligned_cols=50 Identities=20% Similarity=0.213 Sum_probs=25.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCEEECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHH Q ss_conf 00001233543222245543320503898799830320105612221432103788888433366778 Q gi|254780605|r 378 NTIAASGPHAAIIHYQATVQSNRLLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLK 445 (609) Q Consensus 378 ~TIva~G~N~AipHY~~~~~s~~~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk 445 (609) -+|-++|..+.-. +..-..+.++++++|- |. +-+-++++-+.+.+..||+ T Consensus 16 ~~~~ayG~Gg~R~--------a~~sh~~SlL~lpdgv-~~---------W~v~~~~~Lt~e~f~~vl~ 65 (127) T COG3737 16 APIDAYGAGGFRF--------ADMSHRGSLLVLPDGV-CD---------WEVATLSDLTPEDFERVLA 65 (127) T ss_pred CHHHHHCCCCEEE--------CCCCCCCCEEEECCCC-CC---------CCCCCHHHCCHHHHHHHHH T ss_conf 1355635887476--------4405156578816965-44---------6656756679999999873 No 114 >PRK06995 flhF flagellar biosynthesis regulator FlhF; Validated Probab=31.19 E-value=35 Score=14.24 Aligned_cols=23 Identities=22% Similarity=0.408 Sum_probs=14.6 Q ss_pred CHHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 55899999999999749979996088 Q gi|254780605|r 13 KTFERVHNLRSCFDSLGMDAFLVPRV 38 (609) Q Consensus 13 ~~~erL~~LR~~m~~~~idayiip~~ 38 (609) +..|=|.+.|+ +.|=||.|+.+- T Consensus 10 ~m~eAl~~Vr~---eLG~DAVILStr 32 (404) T PRK06995 10 TSRDALRLVRE---ALGADAVILSNR 32 (404) T ss_pred CHHHHHHHHHH---HHCCCEEEEECE T ss_conf 99999999999---978993999794 No 115 >pfam01307 Plant_vir_prot Plant viral movement protein. This family includes several known plant viral movement proteins from a number of different ssRNA plant virus families including potexviruses, hordeiviruses and carlaviruses. Probab=30.99 E-value=24 Score=15.37 Aligned_cols=19 Identities=26% Similarity=0.508 Sum_probs=14.3 Q ss_pred CCCCEECCCCCCEEEEEECCCCH Q ss_conf 03201056122214321037888 Q gi|254780605|r 412 SGAQYVNGTTDITRTIAIGDVDY 434 (609) Q Consensus 412 sG~qY~~GtTDiTRT~~~G~~~~ 434 (609) -||+|.||| +.+.++.|.. T Consensus 44 hGG~YrDGT----K~I~Y~~P~~ 62 (98) T pfam01307 44 HGGSYRDGT----KRINYNSPNK 62 (98) T ss_pred CCCEECCCC----EEEEECCCCC T ss_conf 887503786----1787589987 No 116 >PRK11835 hypothetical protein; Provisional Probab=30.95 E-value=17 Score=16.53 Aligned_cols=12 Identities=25% Similarity=0.628 Sum_probs=4.6 Q ss_pred HHCCCCCCEEEC Q ss_conf 201478620003 Q gi|254780605|r 228 ILYADGKAEIFF 239 (609) Q Consensus 228 iI~~~~~~~Lfv 239 (609) ++.-+|++.||+ T Consensus 74 i~E~eGePCLFv 85 (114) T PRK11835 74 VLECEGEPCLFV 85 (114) T ss_pred EEEECCCCEEEE T ss_conf 888569626887 No 117 >PTZ00075 S-adenosyl-L-homocysteine hydrolase; Provisional Probab=30.90 E-value=35 Score=14.26 Aligned_cols=46 Identities=13% Similarity=0.128 Sum_probs=29.0 Q ss_pred ECCCCCCHHHHHHHHHCCCE-EECCCCCCHHHHHCCCHHHHHHHHHH Q ss_conf 72531778897531213956-75154321156661999999879999 Q gi|254780605|r 280 IDPKWISYRFFKVIAQKNGV-MVEGSDPSCLLRATKNKVEIEGMQTA 325 (609) Q Consensus 280 id~~~~s~~~~~~l~~~~~~-~i~~~~pI~~lRaiKn~~EI~~mr~A 325 (609) +-..+|--.+...++..+.. ++.-.|||..|.|.=+=-|+-.|.+| T Consensus 260 ~GYG~~GkG~A~~~rg~GA~ViVtEiDPi~ALqA~MdGf~V~~m~ea 306 (476) T PTZ00075 260 CGYGDVGKGCAQALRGFGARVVVTEIDPICALQAAMEGYQVVLLEDV 306 (476) T ss_pred EECCCCCHHHHHHHCCCCCEEEEECCCCHHHHHHHHCCCEEEEHHHH T ss_conf 50453012147665358978999616807899987458787569998 No 118 >TIGR02405 trehalos_R_Ecol trehalose operon repressor; InterPro: IPR012771 Trehalose is a non-reducing disaccharide which can be used as both a carbon source and an osmoprotectant in bacteria. Trehalose uptake into the cytoplasm occurs via a trehalose-specific phosphotransferase system which phosphorylates trehalase to trehalose-6-phosphate (Tre6P) during transport into the cytoplasm, and a hydrolase which hydrolyses Tre6P to glucose and glucose-6-phophate. This entry represents LacI-type TreR, a transcriptional repressor of trehalose uptake found mainly within the gamma-proteobacteria. It does not include the GntR-type TreR's such as those found in Bacillus species. It is capable of binding both the inducer Tre6P and trehalose. Binding of trehalose does not affect the repressor's affinity for its DNA binding site, while binding Tre6P substantially reduces its affinity. The repression activity of TreR is therefore regulated by the ratio of trehalose to Tre6P within the cell . The protein is composed of two domains, an N-terminal DNA-binding helix-turn-helix domain, and a C-terminal effector-binding domain which is homologous to that of LacI. The effector-binding domain is composed of two subdomains, both of which form an alpha-beta-alpha sandwhich, with the effector binding site located at the interface of these subdomains . Tre6P and trehalose bind competitively to this site, with the affinty for trehalose substantially lower than that for Tre6P.; GO: 0003677 DNA binding, 0016566 specific transcriptional repressor activity, 0005991 trehalose metabolic process, 0016481 negative regulation of transcription. Probab=30.60 E-value=11 Score=17.65 Aligned_cols=17 Identities=35% Similarity=0.692 Sum_probs=12.6 Q ss_pred CCCCEEEECCCEEECCCC Q ss_conf 569878846860356875 Q gi|254780605|r 509 LPGMILSNEPGYYRCGAF 526 (609) Q Consensus 509 ~~Gmv~s~EPG~Y~~g~~ 526 (609) -| .++||-||||..|+. T Consensus 271 ~P-n~vsiDpGY~eAGk~ 287 (311) T TIGR02405 271 FP-NTVSIDPGYAEAGKA 287 (311) T ss_pred CC-CEEEECCCHHHHHHH T ss_conf 87-637656546789999 No 119 >PHA00451 protein kinase Probab=30.56 E-value=36 Score=14.17 Aligned_cols=22 Identities=9% Similarity=-0.148 Sum_probs=13.9 Q ss_pred ECCHHHHHHHHHHHHHCCCEEE Q ss_conf 4139999999998533476199 Q gi|254780605|r 124 LHSSFEVDLLQKSLDKIEGVIV 145 (609) Q Consensus 124 l~s~~~~~~l~~~l~~~~~~l~ 145 (609) |-...+|..|...+..+|-.+. T Consensus 52 LE~qD~W~tL~~~mrDaGF~~L 73 (364) T PHA00451 52 LEYQDWWQTLGALMRDAGFNLL 73 (364) T ss_pred CHHHHHHHHHHHHHHCCCHHEE T ss_conf 0565799999999851143000 No 120 >TIGR01419 nitro_reg_IIA PTS IIA-like nitrogen-regulatory protein PtsN; InterPro: IPR006320 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . These sequences, which are about 160 residues in length, are closely related to the fructose-specific phosphotransferase (PTS) system IIA component. It is a regulatory protein found only in species with a phosphoenolpyruvate-protein phosphotransferase (enzyme I of PTS systems) and an HPr-like phosphocarrier protein, but not all species have a IIC-like permease. Members of this family are found in Proteobacteria, Chlamydia, and the spirochete Treponema pallidum , .; GO: 0008982 protein-N(PI)-phosphohistidine-sugar phosphotransferase activity, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system. Probab=30.18 E-value=37 Score=14.13 Aligned_cols=29 Identities=21% Similarity=0.233 Sum_probs=14.9 Q ss_pred CCC-CC-CCCCCCCCCCCCEEE-CCCCEEEEECC Q ss_conf 233-54-322224554332050-38987998303 Q gi|254780605|r 383 SGP-HA-AIIHYQATVQSNRLL-QKDELLLLDSG 413 (609) Q Consensus 383 ~G~-N~-AipHY~~~~~s~~~i-~~~~~~LiDsG 413 (609) +|- || ||||.|-.+. ... -.+-++=+|+= T Consensus 56 TG~GnGIAIPHgrl~~~--~~~~~~~aF~rLd~P 87 (149) T TIGR01419 56 TGLGNGIAIPHGRLSEG--KQRTAVGAFIRLDAP 87 (149) T ss_pred CCCCCCCCCCCCCCCCC--CCCCCEEEEEEECCC T ss_conf 89986433676507988--732650477740688 No 121 >cd02166 NMNAT_Archaea This family of archaeal proteins exhibits nicotinamide-nucleotide adenylyltransferase (NMNAT) activity utilizing the salvage pathway to synthesize NAD. In some cases, the enzyme was tested and found also to have the activity of nicotinate-nucleotide adenylyltransferase an enzyme of NAD de novo biosynthesis, although with a higher Km. In some archaeal species, a number of proteins which are uncharacterized with respect to activity, are also present. Probab=30.12 E-value=37 Score=14.12 Aligned_cols=65 Identities=14% Similarity=0.193 Sum_probs=34.7 Q ss_pred ECCEECCHHHHHHHHHHHHHCCCEEEECCCHHHHHHHCCCCCCCCCEEEECCHHHCCCCHHHHHHHHHHHHCCCCCE--- Q ss_conf 65634139999999998533476199705001112201276332200012012335877388999999860233430--- Q gi|254780605|r 120 LDSRLHSSFEVDLLQKSLDKIEGVIVDVPYNPIDSLWKDRPQRLYRKVAMQDMAYAGRESQEKIRDICKILHQKEVG--- 196 (609) Q Consensus 120 ~Dp~l~s~~~~~~l~~~l~~~~~~l~~~~~NlVD~iW~~rP~~~~~~i~~~~~~~aG~s~~eKi~~ir~~L~~~~~d--- 196 (609) |||... ....-++..++..+-.++.+.. .+++..+.+| -|+.|+++=|+..|++.+.+ T Consensus 8 FqP~h~--GHl~~i~~a~~~~d~~ii~igs-------~~~~~~~~NP----------ft~~ER~~Mi~~al~~~~~~~~~ 68 (163) T cd02166 8 FQPFHL--GHLQVIKWILEEVDELIIGIGS-------AQESHTLRNP----------FTAGERVLMIRKALEEAGIDLSR 68 (163) T ss_pred CCCCCH--HHHHHHHHHHHHCCEEEEEECC-------CCCCCCCCCC----------CCHHHHHHHHHHHHHHCCCCCCE T ss_conf 599867--8999999999748927999777-------8667778899----------88999999999999966987651 Q ss_pred EEEECCC Q ss_conf 6760284 Q gi|254780605|r 197 AVFICDP 203 (609) Q Consensus 197 aliit~~ 203 (609) .+++.-+ T Consensus 69 ~~ivpi~ 75 (163) T cd02166 69 YYIIPVP 75 (163) T ss_pred EEEEECC T ss_conf 8998566 No 122 >TIGR02902 spore_lonB ATP-dependent protease LonB; InterPro: IPR014251 This entry represents LonB, a paralog of the ATP-dependent protease La (LonA, IPR004815 from INTERPRO). LonB proteins are unassigned peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SJ) and are found strictly, and almost universally, in endospore-forming bacteria. This protease was shown, in Bacillus subtilis, to be expressed specifically in the forespore during sporulation, under control of sigmaF . The lonB gene, despite being located immediately upstream of lonA, was shown to be monocistronic. LonB appears to be involved in the post-translation control of sigmaH, but lonB mutation did not produce an obvious sporulation defect under the conditions tested . Note that additional paralogs of LonA and LonB occur in the Clostridium lineage and these are excluded from this entry. . Probab=30.12 E-value=36 Score=14.17 Aligned_cols=21 Identities=29% Similarity=0.270 Sum_probs=12.7 Q ss_pred CCCCEEEECCEECCHHHHHHHHH Q ss_conf 79988996563413999999999 Q gi|254780605|r 113 FVGLRLGLDSRLHSSFEVDLLQK 135 (609) Q Consensus 113 ~~g~~IG~Dp~l~s~~~~~~l~~ 135 (609) .+..|+.+| -=|-.++++|.+ T Consensus 25 Qq~nk~~id--kESkKEl~kLn~ 45 (532) T TIGR02902 25 QQTNKITID--KESKKELDKLNK 45 (532) T ss_pred CCCCEEEEE--ECCHHHHHHHHC T ss_conf 534404565--047689987611 No 123 >PRK10816 DNA-binding transcriptional regulator PhoP; Provisional Probab=29.82 E-value=37 Score=14.09 Aligned_cols=43 Identities=12% Similarity=0.181 Sum_probs=18.0 Q ss_pred HHHHCCCCC--CCHHHHHHHH---HHHHHHCCC-CCCCCCCCCCCCCCCC Q ss_conf 997217897--8888999987---366774186-5477643232113560 Q gi|254780605|r 450 VSTARFPQR--TRGCDLDSIA---RIFLWKYGA-DFAHGVGHGVGSFLPV 493 (609) Q Consensus 450 l~~a~fp~g--~~g~~lD~~a---R~~l~~~g~-dy~HgtGHGVG~~l~V 493 (609) +...++... .....||..- |+-|.+.+- ++ =-|=||+||-++| T Consensus 174 l~~~vw~~~~~~~~~sld~~I~rLRkKl~~~~~~~~-I~tvrG~GY~~~~ 222 (223) T PRK10816 174 LMLQLYPDAELRESHTIDVLMGRLRKKIQAQYPQEV-ITTVRGQGYLFEL 222 (223) T ss_pred HHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCC-EEEECCCCEEEEE T ss_conf 999854998788867799999999997433589981-8997895679875 No 124 >TIGR01820 TrpE-arch anthranilate synthase component I; InterPro: IPR010116 This model represents an archaeal clade of anthranilate synthase component I enzymes. This enzyme (4.1.3.27 from EC) is responsible for the first step of tryptophan biosynthesis from chorismate. The Sulfolobus enzyme has been reported to be part of a gene cluster for Trp biosynthesis . ; GO: 0004049 anthranilate synthase activity, 0000162 tryptophan biosynthetic process. Probab=29.25 E-value=5.2 Score=20.07 Aligned_cols=94 Identities=29% Similarity=0.385 Sum_probs=52.2 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEEC----------CCCEEEEECCCCEECCCCCCEEEEEECCC--CHHHH-HHHHHHH- Q ss_conf 0001233543222245543320503----------89879983032010561222143210378--88884-3336677- Q gi|254780605|r 379 TIAASGPHAAIIHYQATVQSNRLLQ----------KDELLLLDSGAQYVNGTTDITRTIAIGDV--DYEKK-YYFTLVL- 444 (609) Q Consensus 379 TIva~G~N~AipHY~~~~~s~~~i~----------~~~~~LiDsG~qY~~GtTDiTRT~~~G~~--~~e~k-~~yt~VL- 444 (609) ||+|+-|.||-|| -|+.|. .-.+=|+|.| .. ||=|.--.|+. ++-++ .-|..|. T Consensus 324 PIAGT~pRGk~~e------ED~~L~~~LL~DEKERAEH~MLVDLA---RN---Dvr~Vs~~GsVkV~~fM~v~KYShVQH 391 (508) T TIGR01820 324 PIAGTRPRGKTEE------EDEELAKELLSDEKERAEHVMLVDLA---RN---DVRRVSKPGSVKVPEFMKVEKYSHVQH 391 (508) T ss_pred CCCCCCCCCCCHH------HHHHHHHHHHHHHHHHHCCCCEEECC---CC---CCCCCCCCCEEEECCCEEEEECCCEEE T ss_conf 5777679877989------99999998410022232012410002---07---754266587046246217976582257 Q ss_pred ---------HH---HHHHHHHCCCCCC-CHH----HHHHHHHHHHHHCCCC-CCCCCCCCCCCC Q ss_conf ---------88---8889972178978-888----9999873667741865-477643232113 Q gi|254780605|r 445 ---------KG---MISVSTARFPQRT-RGC----DLDSIARIFLWKYGAD-FAHGVGHGVGSF 490 (609) Q Consensus 445 ---------k~---~ial~~a~fp~g~-~g~----~lD~~aR~~l~~~g~d-y~HgtGHGVG~~ 490 (609) +. +.+.-+|.||+|| +|. .+..|. -|.+.-.. | |=|||+| T Consensus 392 IeS~V~G~L~~~~d~fDA~~A~FPAGTlSGAPK~RAMEII~--~LE~~~RG~Y----gGgVGYf 449 (508) T TIGR01820 392 IESTVVGTLKKDADAFDALKATFPAGTLSGAPKIRAMEIID--ELEKEPRGVY----GGGVGYF 449 (508) T ss_pred EEEEEEEEECCCCCCCCCEEEECCCCCCCCCCHHHHHHHHH--HHHCCCCCCC----CCEEECC T ss_conf 99889997134688101012134776547985367888888--8607888732----2313213 No 125 >TIGR01825 gly_Cac_T_rel pyridoxal phosphate-dependent acyltransferase, putative; InterPro: IPR010962 Pyridoxal phosphate is the active form of vitamin B6 (pyridoxine or pyridoxal). PLP is a versatile catalyst, acting as a coenzyme in a multitude of reactions, including decarboxylation, deamination and transamination , , . PLP-dependent enzymes are primarily involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and in the synthesis or catabolism of neurotransmitters; pyridoxal phosphate can also inhibit DNA polymerases and several steroid receptors . Inadequate levels of pyridoxal phosphate in the brain can cause neurological dysfunction, particularly epilepsy . PLP enzymes exist in their resting state as a Schiff base, the aldehyde group of PLP forming a linkage with the epsilon-amino group of an active site lysine residue on the enzyme. The alpha-amino group of the substrate displaces the lysine epsilon-amino group, in the process forming a new aldimine with the substrate. This aldimine is the common central intermediate for all PLP-catalyzed reactions, enzymatic and non-enzymatic . This entry contains a number of enzyme families: Serine palmitoyltransferase Glycine C-acetyltransferase (2-amino-3-ketobutyrate coenzyme A ligase) 5-aminolevulinic acid synthase 8-amino-7-oxononanoate synthase All transfer the R-group (acetyl, succinyl, or 6-carboxyhexanoyl) from coenzyme A to an amino acid (Gly, Gly, Ala, respectively), with release of CO2 for the latter two reactions.; GO: 0016740 transferase activity, 0030170 pyridoxal phosphate binding. Probab=29.20 E-value=12 Score=17.41 Aligned_cols=34 Identities=6% Similarity=0.220 Sum_probs=15.2 Q ss_pred HHHHHHHHHCCCCCCEEEECCC-CCCHHHHHHHHH Q ss_conf 4515778630135742897253-177889753121 Q gi|254780605|r 262 DMMDSRLVCLARTSMPILIDPK-WISYRFFKVIAQ 295 (609) Q Consensus 262 ~~~~~~l~~l~~~~~~i~id~~-~~s~~~~~~l~~ 295 (609) +++...|..+++....+.+.+. +||......+-. T Consensus 81 ~eLE~~LA~FK~teAAl~fqSGF~~N~GvlSa~L~ 115 (392) T TIGR01825 81 EELEEKLAKFKKTEAALVFQSGFNTNQGVLSALLR 115 (392) T ss_pred HHHHHHHHHHCCCCEEEHHHCCHHHHHHHHHHHHC T ss_conf 99999987553820110103122377999987623 No 126 >TIGR01170 rplA_mito ribosomal protein L1, mitochondrial; InterPro: IPR005879 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . Ribosomal protein L1 is the largest protein from the large ribosomal subunit. In Escherichia coli, L1 is known to bind to the 23S rRNA. This model describes the mitochondrial L1 protein.; GO: 0003723 RNA binding, 0003735 structural constituent of ribosome, 0006412 translation, 0005739 mitochondrion, 0015934 large ribosomal subunit. Probab=27.98 E-value=40 Score=13.88 Aligned_cols=48 Identities=10% Similarity=0.112 Sum_probs=31.9 Q ss_pred CCEEEECCCHHHHHHHCCCCCC-CCCEEEECCHHHCCCCHHHHHH-HHHHHHCCCCC Q ss_conf 7619970500111220127633-2200012012335877388999-99986023343 Q gi|254780605|r 141 EGVIVDVPYNPIDSLWKDRPQR-LYRKVAMQDMAYAGRESQEKIR-DICKILHQKEV 195 (609) Q Consensus 141 ~~~l~~~~~NlVD~iW~~rP~~-~~~~i~~~~~~~aG~s~~eKi~-~ir~~L~~~~~ 195 (609) ++.++.- .+||-+||.+-=.. ..+.++.|| ....-|. ++|+.|..+|+ T Consensus 80 ~a~~vGg-~dLikKI~~~ei~~sDf~~~~A~P------~~~~~L~s~~~~~Lg~Kgl 129 (155) T TIGR01170 80 GASYVGG-EDLIKKIKDGEIKVSDFDYLIAHP------DIVPELASQLAKLLGPKGL 129 (155) T ss_pred CCCEECC-HHHHCCCCCCCCCCCCCCCEEECC------HHHHHHHHHHHHHHCCCCC T ss_conf 8322351-232201448952211554168711------1467789999876046766 No 127 >pfam12095 DUF3571 Protein of unknown function (DUF3571). This family of proteins is functionally uncharacterized. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 85 to 97 amino acids in length. Probab=27.97 E-value=23 Score=15.62 Aligned_cols=16 Identities=25% Similarity=0.345 Sum_probs=7.7 Q ss_pred EECCCEEECCCCCEEEE Q ss_conf 84686035687521500 Q gi|254780605|r 515 SNEPGYYRCGAFGIRIE 531 (609) Q Consensus 515 s~EPG~Y~~g~~GiRIE 531 (609) -+.||.|+. ||-||+| T Consensus 68 ei~pG~~lQ-WyaVRLE 83 (83) T pfam12095 68 EIGPGGTLQ-WYAVRLE 83 (83) T ss_pred ECCCCCEEE-EEEEECC T ss_conf 417898889-9999619 No 128 >PRK06849 hypothetical protein; Provisional Probab=27.23 E-value=41 Score=13.79 Aligned_cols=74 Identities=8% Similarity=0.064 Sum_probs=36.5 Q ss_pred HHHHHHCCCEEEECCCHHHHHHHCCCCCCCCCEEEECCHHHCCCCHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHCC Q ss_conf 99853347619970500111220127633220001201233587738899999986023343067602842023443201 Q gi|254780605|r 134 QKSLDKIEGVIVDVPYNPIDSLWKDRPQRLYRKVAMQDMAYAGRESQEKIRDICKILHQKEVGAVFICDPSSIAWIFNIR 213 (609) Q Consensus 134 ~~~l~~~~~~l~~~~~NlVD~iW~~rP~~~~~~i~~~~~~~aG~s~~eKi~~ir~~L~~~~~daliit~~d~IaWLlNiR 213 (609) -..|...|+..+..+.+ ..|--|..+-.+..+.+|.-- .+...=++.|.+.+++.++| ++|..-+++.|+-..+ T Consensus 21 aR~l~~~Gh~V~~aD~~---~~~l~r~Sr~v~~~~~vP~P~--~d~~~y~~~Ll~Iv~~e~id-l~IP~~eev~~~a~~~ 94 (387) T PRK06849 21 ARSFHNAGHTVILADSL---KYPLSRFSRAVDGFYTIPSPK--WDPNAYIQALLSIVKRHNID-LLIPTCEEVFYLSLAK 94 (387) T ss_pred HHHHHHCCCEEEEECCC---CCCCCCHHHCEEEEEECCCCC--CCHHHHHHHHHHHHHHHCCC-EEEECCCHHHHHHHHH T ss_conf 99998789979998489---877542000011279869997--89899999999999983899-9997776899998657 No 129 >cd07025 Peptidase_S66 LD-Carboxypeptidase, a serine protease, includes microcin C7 self immunity protein. LD-carboxypeptidase (Muramoyltetrapeptide carboxypeptidase; EC 3.4.17.13; Merops family S66; initially described as Carboxypeptidase II) family also includes the microcin c7 self-immunity protein (MccF) as well as uncharacterized proteins including hypothetical proteins. LD-carboxypeptidase hydrolyzes the amide bond that links the dibasic amino acids to C-terminal D-amino acids. The physiological substrates of LD-carboxypeptidase are tetrapeptide fragments (such as UDP-MurNAc-tetrapeptides) that are produced when bacterial cell walls are degraded; they contain an L-configured residue (L-lysine or meso-diaminopimelic acid residue) as the penultimate residue and D-alanine as the ultimate residue. A possible role of LD-carboxypeptidase is in peptidoglycan recycling whereby the resulting tripeptide (precursor for murein synthesis) can be reconverted into peptidoglycan by attachment o Probab=27.22 E-value=41 Score=13.79 Aligned_cols=28 Identities=18% Similarity=0.396 Sum_probs=17.3 Q ss_pred HCCCCHHHHHHHHHHHHCCCCCEEEEEC Q ss_conf 3587738899999986023343067602 Q gi|254780605|r 174 YAGRESQEKIRDICKILHQKEVGAVFIC 201 (609) Q Consensus 174 ~aG~s~~eKi~~ir~~L~~~~~daliit 201 (609) |-.-+-+++.+.|.+.++..++++++-. T Consensus 42 ~~ag~~~~Ra~dl~~a~~dp~i~aI~~~ 69 (282) T cd07025 42 YLAGTDEERAADLNAAFADPEIKAIWCA 69 (282) T ss_pred CCCCCHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 7579999999999998528999899992 No 130 >PRK00141 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=26.76 E-value=37 Score=14.09 Aligned_cols=31 Identities=16% Similarity=0.277 Sum_probs=15.0 Q ss_pred CEEEEECCCCEECC-CC--------CCEEEEEECCCCHHHH Q ss_conf 87998303201056-12--------2214321037888884 Q gi|254780605|r 406 ELLLLDSGAQYVNG-TT--------DITRTIAIGDVDYEKK 437 (609) Q Consensus 406 ~~~LiDsG~qY~~G-tT--------DiTRT~~~G~~~~e~k 437 (609) .++|| .||+-+++ .+ -+-..+.+|+-.++.. T Consensus 359 ~iilI-~GG~~Kg~d~~~l~~~~~~~vk~v~l~G~~~~~i~ 398 (476) T PRK00141 359 SVIWV-AGGQLKGADIDELIATHGGRIKAALVLGADRAEIV 398 (476) T ss_pred CCEEE-ECCCCCCCCHHHHHHHHHHHCEEEEEECCCHHHHH T ss_conf 73999-44765777879999998633109999768989999 No 131 >PRK13928 rod shape-determining protein Mbl; Provisional Probab=26.64 E-value=41 Score=13.82 Aligned_cols=18 Identities=39% Similarity=0.582 Sum_probs=10.6 Q ss_pred CCCEEEEECCCCEECCCCCCEE Q ss_conf 8987998303201056122214 Q gi|254780605|r 404 KDELLLLDSGAQYVNGTTDITR 425 (609) Q Consensus 404 ~~~~~LiDsG~qY~~GtTDiTR 425 (609) +...+++|.|| ||+|++= T Consensus 146 ~~~~lV~DlGG----GT~Dvsv 163 (325) T PRK13928 146 PSGNMVVDIGG----GTTDIAV 163 (325) T ss_pred CCCEEEEEECC----CCEEEEE T ss_conf 87359999589----8378999 No 132 >pfam02388 FemAB FemAB family. The femAB operon codes for two nearly identical approximately 50-kDa proteins involved in the formation of the Staphylococcal pentaglycine interpeptide bridge in peptidoglycan. These proteins are also considered as a factor influencing the level of methicillin resistance. Probab=26.41 E-value=43 Score=13.69 Aligned_cols=28 Identities=14% Similarity=0.018 Sum_probs=16.1 Q ss_pred HHHHHHHHHHHCCCEEEECCCHHHHHHH Q ss_conf 9999999853347619970500111220 Q gi|254780605|r 129 EVDLLQKSLDKIEGVIVDVPYNPIDSLW 156 (609) Q Consensus 129 ~~~~l~~~l~~~~~~l~~~~~NlVD~iW 156 (609) .++.+.+.+++.++..+.++-|+.-..| T Consensus 84 ~l~~Lk~~akk~~a~~lridP~~~~~~~ 111 (407) T pfam02388 84 FLKELKKYAKKKRALFLKIDPYIPYQLR 111 (407) T ss_pred HHHHHHHHHHHCCEEEEEECCCHHHHHC T ss_conf 9999999999869289997676454430 No 133 >COG1355 Predicted dioxygenase [General function prediction only] Probab=26.33 E-value=43 Score=13.68 Aligned_cols=30 Identities=20% Similarity=0.320 Sum_probs=19.8 Q ss_pred HHHCCCCHHHHHHHHHHHHCCCCCEEEEECCCCC Q ss_conf 2335877388999999860233430676028420 Q gi|254780605|r 172 MAYAGRESQEKIRDICKILHQKEVGAVFICDPSS 205 (609) Q Consensus 172 ~~~aG~s~~eKi~~ir~~L~~~~~daliit~~d~ 205 (609) .-|+|.++..--..+-+ ...+.++|-.|+. T Consensus 56 y~ySG~taa~~y~~l~~----~~~~~vVIlGPnH 85 (279) T COG1355 56 YRYSGPTAAHVYSALDE----GEPDTVVILGPNH 85 (279) T ss_pred CEECCHHHHHHHHHHHC----CCCCEEEEECCCC T ss_conf 57626338899998665----8998799989988 No 134 >COG0312 TldD Predicted Zn-dependent proteases and their inactivated homologs [General function prediction only] Probab=26.29 E-value=39 Score=13.98 Aligned_cols=42 Identities=29% Similarity=0.552 Sum_probs=26.7 Q ss_pred CCCCCCC-CCCCCCCCCCCCCCCCCCCEECCC------CEEEECCCEEECCCCC Q ss_conf 7643232-113560778862377898600569------8788468603568752 Q gi|254780605|r 481 HGVGHGV-GSFLPVHEGPQGISRTNQEPLLPG------MILSNEPGYYRCGAFG 527 (609) Q Consensus 481 HgtGHGV-G~~l~VHE~P~~is~~~~~~l~~G------mv~s~EPG~Y~~g~~G 527 (609) ..||||- ++| +-.| +.+..++.++|| |+=+++=|+|+.+.+| T Consensus 327 ~~TGnar~~~~---~~~p--~~rm~N~~i~~G~~s~eeli~~~~~Giyv~~~~g 375 (454) T COG0312 327 ESTGNARRGSY---AHVP--IPRMTNTYIEPGDYSFEELIEDVKRGLYVTNLWG 375 (454) T ss_pred CCCCCCCCCCC---CCCC--CCCCCEEEECCCCCCHHHHHHHCCCEEEEEECCC T ss_conf 95664435667---7787--6664205775899998999853397799980547 No 135 >cd07062 Peptidase_S66_mccF_like Microcin C7 self-immunity protein determines resistance to exogenous microcin C7. Microcin C7 self-immunity protein (mccF): MccF, a homolog of the LD-carboxypeptidase family, mediates resistance against exogenously added microcin C7 (MccC7), a ribosomally-encoded peptide antibiotic that contains a phosphoramidate linkage to adenosine monophosphate at its C-terminus. The plasmid-encoded mccF gene is transcribed in the opposite direction to the other five genes (mccA-E) and is required for the full expression of immunity but not for production. The catalytic triad residues (Ser, His, Glu) of LD-carboxypeptidase are also conserved in MccF, strongly suggesting that MccF shares the hydrolytic activity with LD-carboxypeptidases. Substrates of MccF have not been deduced, but could likely be microcin C7 precursors. The possible role of MccF is to defend producer cells against exogenous microcin from re-entering after having been exported. It is suggested that M Probab=26.20 E-value=43 Score=13.66 Aligned_cols=25 Identities=8% Similarity=0.243 Sum_probs=15.9 Q ss_pred CCHHHHHHHHHHHHCCCCCEEEEEC Q ss_conf 7738899999986023343067602 Q gi|254780605|r 177 RESQEKIRDICKILHQKEVGAVFIC 201 (609) Q Consensus 177 ~s~~eKi~~ir~~L~~~~~daliit 201 (609) -+-+++.+.|.+.++..++++++-. T Consensus 49 g~~~~Ra~dL~~a~~Dp~i~aI~~~ 73 (308) T cd07062 49 ASPEERAEELMAAFADPSIKAIIPT 73 (308) T ss_pred CCHHHHHHHHHHHHHCCCCCEEEEC T ss_conf 9989999999998629998999991 No 136 >pfam10565 NMDAR2_C N-methyl D-aspartate receptor 2B3 C-terminus. This domain is found at the C-terminus of many NMDA-receptor proteins, many of which also carry the Ligated ion-channel family pfam00060 further upstream as well as the ANF_receptor family pfam01094. This region is predicted to be a large extra-cellular domain of the NMDA receptor proteins, being highly hydrophilic, and is thought to be integrally involved in the function of the receptor. The region also carries a number of potential N-glycosylation sites. Probab=25.72 E-value=40 Score=13.87 Aligned_cols=27 Identities=11% Similarity=0.085 Sum_probs=18.4 Q ss_pred HHHHHHHHHHCCCCCEEEEECCCCCHH Q ss_conf 899999986023343067602842023 Q gi|254780605|r 181 EKIRDICKILHQKEVGAVFICDPSSIA 207 (609) Q Consensus 181 eKi~~ir~~L~~~~~daliit~~d~Ia 207 (609) .|..+.++.||+.-+.+=.=-..|+|- T Consensus 246 kk~~k~kDsLKKR~~saK~~Re~sevE 272 (660) T pfam10565 246 KKRKKYKDSLKKRPASAKSRRELDEVE 272 (660) T ss_pred HHCCHHHHHHHCCCCCCCCCCCCCHHH T ss_conf 211224455532863334743001344 No 137 >PRK04308 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=25.61 E-value=30 Score=14.71 Aligned_cols=17 Identities=6% Similarity=0.106 Sum_probs=9.9 Q ss_pred HHHHHHHHHHCCCCEEEE Q ss_conf 999999999749979996 Q gi|254780605|r 18 VHNLRSCFDSLGMDAFLV 35 (609) Q Consensus 18 L~~LR~~m~~~~idayii 35 (609) ++.+| ++.++|.+..+. T Consensus 18 ~s~a~-~L~~~G~~v~~~ 34 (445) T PRK04308 18 ISMIA-YLRKNGAEVAAY 34 (445) T ss_pred HHHHH-HHHHCCCEEEEE T ss_conf 99999-999789919999 No 138 >PRK03803 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=25.61 E-value=40 Score=13.89 Aligned_cols=34 Identities=18% Similarity=0.174 Sum_probs=16.1 Q ss_pred CCCEEEEECCCCEECCC---------CCCEEEEEECCCCHHHHH Q ss_conf 89879983032010561---------222143210378888843 Q gi|254780605|r 404 KDELLLLDSGAQYVNGT---------TDITRTIAIGDVDYEKKY 438 (609) Q Consensus 404 ~~~~~LiDsG~qY~~Gt---------TDiTRT~~~G~~~~e~k~ 438 (609) ++.++|| .||+-+++- .-+...+.+|+-.+..++ T Consensus 345 ~~~iilI-~GG~~K~~d~~~l~~~~~~~v~~v~l~G~~~~~i~~ 387 (448) T PRK03803 345 EGKLVLI-AGGQGKGADFSELREPVAKYCRAVVLIGRDAELIEQ 387 (448) T ss_pred CCCEEEE-ECCCCCCCCHHHHHHHHHHCCEEEEEECCCHHHHHH T ss_conf 6868999-378666668799999997366099997959999999 No 139 >PRK02006 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=25.61 E-value=30 Score=14.78 Aligned_cols=35 Identities=14% Similarity=0.192 Sum_probs=17.0 Q ss_pred CCCEEEEECCCCEECCC--------CCCEE-EEEECCCCHHHHHH Q ss_conf 89879983032010561--------22214-32103788888433 Q gi|254780605|r 404 KDELLLLDSGAQYVNGT--------TDITR-TIAIGDVDYEKKYY 439 (609) Q Consensus 404 ~~~~~LiDsG~qY~~Gt--------TDiTR-T~~~G~~~~e~k~~ 439 (609) +..++|| .||+=++.- ....| .+.+|+-.++.++. T Consensus 390 ~~~iilI-~GG~~K~~d~~~L~~~~~~~~k~vil~G~~~~~i~~~ 433 (501) T PRK02006 390 AQRVVLI-AGGDGKGQDFSPLAAPVARWARAVVLIGRDAPAIRAA 433 (501) T ss_pred CCCEEEE-EECCCCCCCHHHHHHHHHHHCEEEEEECCCHHHHHHH T ss_conf 8974999-8057777797999999985370999989899999999 No 140 >cd00465 URO-D_CIMS_like The URO-D_CIMS_like protein superfamily includes bacterial and eukaryotic uroporphyrinogen decarboxylases (URO-D), coenzyme M methyltransferases and other putative bacterial methyltransferases, as well as cobalamine (B12) independent methionine synthases. Despite their sequence similarities, members of this family have clearly different functions. Uroporphyrinogen decarboxylase (URO-D) decarboxylates the four acetate side chains of uroporphyrinogen III (uro-III) to create coproporphyrinogen III, an important branching point of the tetrapyrrole biosynthetic pathway. The methyltransferases represented here are important for ability of methanogenic organisms to use other compounds than carbon dioxide for reduction to methane, and methionine synthases transfer a methyl group from a folate cofactor to L-homocysteine in a reaction requiring zinc. Probab=25.60 E-value=28 Score=15.00 Aligned_cols=44 Identities=11% Similarity=0.133 Sum_probs=32.0 Q ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 667788888997217897888899998736677418654776432321 Q gi|254780605|r 441 TLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGADFAHGVGHGVG 488 (609) Q Consensus 441 t~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~dy~HgtGHGVG 488 (609) ..+|+|+++-.... .++..+..-+++.|.+.|-.|.-.+||||. T Consensus 244 ~~~lQGNLDP~~L~----~~~e~i~~~v~~il~~~g~~~IfNLGHGi~ 287 (306) T cd00465 244 KKTLVGGVDPGYLP----ATDEECIAKVEELVERLGPHYIINPDCGLG 287 (306) T ss_pred CEEEECCCCCHHHC----CCHHHHHHHHHHHHHHHCCCCEEECCCCCC T ss_conf 91587089846765----998999999999999848987797899889 No 141 >TIGR02480 fliN flagellar motor switch protein FliN; InterPro: IPR012826 Proteins that consist largely of this domain can be designated flagellar motor switch proteins: FliN. Longer proteins in which this region is a C-terminal domain typically are designated FliY. More distantly related sequences, outside of this family, are associated with type III secretion systems (TTSS) and include the surface presentation of antigens protein SpaO required or invasion of host cells by Salmonella enterica.; GO: 0001539 ciliary or flagellar motility, 0006935 chemotaxis, 0016020 membrane, 0044461 flagellin-based flagellum part. Probab=25.35 E-value=40 Score=13.87 Aligned_cols=26 Identities=31% Similarity=0.555 Sum_probs=17.4 Q ss_pred CCCCEECCCCEEEECCCEEECCCCCEEEEEEE Q ss_conf 89860056987884686035687521500259 Q gi|254780605|r 503 TNQEPLLPGMILSNEPGYYRCGAFGIRIENVL 534 (609) Q Consensus 503 ~~~~~l~~Gmv~s~EPG~Y~~g~~GiRIEn~v 534 (609) -|..+.--|-|+=+| ++|||||-+.+ T Consensus 49 VNG~~ia~GEvvV~~------dk~Giri~ei~ 74 (77) T TIGR02480 49 VNGRLIARGEVVVVE------DKFGIRITEIV 74 (77) T ss_pred ECCCEEEEEEEEEEC------CCEEEEEEEEE T ss_conf 889278889899867------96008989854 No 142 >TIGR02154 PhoB phosphate regulon transcriptional regulatory protein PhoB; InterPro: IPR011879 PhoB is a DNA-binding response regulator protein acting with PhoR in a 2-component system responding to phosphate ion. PhoB acts as a positive regulator of gene expression for phosphate-related genes such as phoA, phoS, phoE and ugpAB as well as itself . It is often found proximal to genes for the high-affinity phosphate ABC transporter (pstSCAB; GenProp0190) and presumably regulates these as well.; GO: 0000156 two-component response regulator activity, 0003677 DNA binding, 0000160 two-component signal transduction system (phosphorelay), 0006817 phosphate transport. Probab=25.28 E-value=45 Score=13.55 Aligned_cols=74 Identities=18% Similarity=0.193 Sum_probs=32.3 Q ss_pred EEECCCCC-CCHHHHHHHCCCCE-EECHHHHHHHHHHCCCCC-CEEEEC---CCCCCHHHHHHHHHCCCEEECCCCCCHH Q ss_conf 00035311-21567874125635-707445157786301357-428972---5317788975312139567515432115 Q gi|254780605|r 236 EIFFDKQY-INEQLKALLSAVAI-VLDMDMMDSRLVCLARTS-MPILID---PKWISYRFFKVIAQKNGVMVEGSDPSCL 309 (609) Q Consensus 236 ~Lfvd~~k-~~~~~~~~l~~~~~-~~~~~~~~~~l~~l~~~~-~~i~id---~~~~s~~~~~~l~~~~~~~i~~~~pI~~ 309 (609) +|.|++.- +.+-++-.+.+.+- +..-.+..+....+.... .=|++| |..+=..+...|+.... ...=||.. T Consensus 5 iLvVEDE~AirEl~~~~L~~~gy~v~~A~d~~~A~~~~~E~~PDLILLDWMLPG~SGIel~R~Lr~~~~---Tr~iPIIM 81 (226) T TIGR02154 5 ILVVEDEPAIRELIAYNLEKAGYDVVEAGDGDEALTLINERLPDLILLDWMLPGTSGIELARRLRREPE---TRAIPIIM 81 (226) T ss_pred EEEEECCHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHCCCCC---CCCCCEEE T ss_conf 567708268999999984168946998079799999986079988996147899756999987347633---14888177 Q ss_pred HHH Q ss_conf 666 Q gi|254780605|r 310 LRA 312 (609) Q Consensus 310 lRa 312 (609) |=| T Consensus 82 LTA 84 (226) T TIGR02154 82 LTA 84 (226) T ss_pred EEC T ss_conf 405 No 143 >cd00383 trans_reg_C Effector domain of response regulator. Bacteria and certain eukaryotes like protozoa and higher plants use two-component signal transduction systems to detect and respond to changes in the environment. The system consists of a sensor histidine kinase and a response regulator. The former autophosphorylates in a histidine residue on detecting an external stimulus. The phosphate is then transferred to an invariant aspartate residue in a highly conserved receiver domain of the response regulator. Phosphorylation activates a variable effector domain of the response regulator, which triggers the cellular response. The C-terminal effector domain contains DNA and RNA polymerase binding sites. Several dimers or monomers bind head to tail to small tandem repeats upstream of the genes. The RNA polymerase binding sites interact with the alpha or sigma subunite of RNA polymerase. Probab=25.20 E-value=45 Score=13.54 Aligned_cols=70 Identities=16% Similarity=0.206 Sum_probs=41.9 Q ss_pred EEEECCEE--CCHHHHHHHHHHHHHCCCEEEECCCHHHHHHHCCCCCCCCCEEEECCHHHCCCCHHHHHHHHHHHHCCCC Q ss_conf 89965634--1399999999985334761997050011122012763322000120123358773889999998602334 Q gi|254780605|r 117 RLGLDSRL--HSSFEVDLLQKSLDKIEGVIVDVPYNPIDSLWKDRPQRLYRKVAMQDMAYAGRESQEKIRDICKILHQKE 194 (609) Q Consensus 117 ~IG~Dp~l--~s~~~~~~l~~~l~~~~~~l~~~~~NlVD~iW~~rP~~~~~~i~~~~~~~aG~s~~eKi~~ir~~L~~~~ 194 (609) .+-++... .|..+++-|.--+.+.+- .+ ..+.|.+.+|.... ...-.+...-|.+||++|++.+ T Consensus 14 ~l~~~~~~i~Lt~~E~~lL~~L~~~~~~-vv-sr~~l~~~vw~~~~------------~~~~~~l~~~I~rLRkkl~~~~ 79 (95) T cd00383 14 EVTRDGEPVELTPKEFELLELLARNPGR-VL-SREQLLEAVWGDDY------------DVDDRTVDVHISRLRKKLEDDP 79 (95) T ss_pred EEEECCEEEECCHHHHHHHHHHHHCCCC-CC-CHHHHHHHHCCCCC------------CCCCCCHHHHHHHHHHHHHHCC T ss_conf 9999999987799999999999877997-45-69999998539764------------5574549999999999975268 Q ss_pred CEEEEE Q ss_conf 306760 Q gi|254780605|r 195 VGAVFI 200 (609) Q Consensus 195 ~dalii 200 (609) ...-+| T Consensus 80 ~~~~~I 85 (95) T cd00383 80 SNPRLI 85 (95) T ss_pred CCCCEE T ss_conf 999859 No 144 >PRK06724 hypothetical protein; Provisional Probab=25.18 E-value=39 Score=13.95 Aligned_cols=16 Identities=38% Similarity=0.544 Sum_probs=10.4 Q ss_pred CCCEE---ECCCCCEEEEE Q ss_conf 68603---56875215002 Q gi|254780605|r 517 EPGYY---RCGAFGIRIEN 532 (609) Q Consensus 517 EPG~Y---~~g~~GiRIEn 532 (609) -|||| .++--|||||= T Consensus 76 a~GYY~v~FeDPdGiriEV 94 (102) T PRK06724 76 SEGYYTIDFYDPNGFIIEV 94 (102) T ss_pred CCCEEEEEEECCCCEEEEE T ss_conf 8855999988899669998 No 145 >TIGR01584 citF citrate lyase, alpha subunit; InterPro: IPR006472 These sequences, from both Gram-positive and Gram-negative bacteria, represent the alpha subunit of the holoenzyme citrate lyase composed of alpha (2.8.3.10 from EC), beta, and acyl carrier protein subunits in a stoichiometric relationship of 6:6:6. Citrate lyase is an enzyme which converts citrate to oxaloacetate. In bacteria, this reaction is involved in citrate fermentation. The alpha subunit catalyzes the reaction Acetyl-CoA + citrate = acetate + (3S)-citryl-CoA. The protein from Lactococcus lactis subsp. lactis has been experimentally characterised . ; GO: 0008814 citrate CoA-transferase activity, 0006084 acetyl-CoA metabolic process, 0005737 cytoplasm, 0009346 citrate lyase complex. Probab=25.16 E-value=45 Score=13.54 Aligned_cols=117 Identities=19% Similarity=0.165 Sum_probs=56.3 Q ss_pred CCHHHHHHHHHCCCEEECCCCC----CHHHHHCCCHHHHHHHHHH--------HHHHH-----------HHHHHHHHH-H Q ss_conf 7788975312139567515432----1156661999999879999--------84438-----------999999875-3 Q gi|254780605|r 285 ISYRFFKVIAQKNGVMVEGSDP----SCLLRATKNKVEIEGMQTA--------HIQDG-----------VAMVYFLFW-F 340 (609) Q Consensus 285 ~s~~~~~~l~~~~~~~i~~~~p----I~~lRaiKn~~EI~~mr~A--------hi~Dg-----------vAl~kfl~w-L 340 (609) +|..+.|..-..=..+...-|| .-..|.-|||.||.-.|.| ..+|| +|++|||.. | T Consensus 196 ~PaSI~Q~~VD~vV~vd~vGdP~kI~sGAtR~t~dPkeLlIA~~~~~VI~aSGYfKdGFs~QtGtGGAaLAVtrfL~e~M 275 (496) T TIGR01584 196 LPASIKQTQVDYVVKVDAVGDPKKIGSGATRFTKDPKELLIAKKAADVIVASGYFKDGFSLQTGTGGAALAVTRFLKEKM 275 (496) T ss_pred CCEEEECCCCCEEEEEEEECCCCCEECCCCCCCCCCHHHHHHHHHHHEEEECCCCCCCCEEECCCHHHHHHHHHHHHHHH T ss_conf 76012003331489983423988500133006898678888642300022268313771565363137999999888988 Q ss_pred HH-HHC-----CCCCCCHHHHHHHH-HHHHHHHHHCCCCCCCCCCCCCC----CCCC----CCCCCCCCCCCCCEEECCC Q ss_conf 32-100-----12453018899999-99875433027855554100001----2335----4322224554332050389 Q gi|254780605|r 341 YS-QSL-----ETITEIDIIKKLER-CREEIGCKMRNPLRDIAFNTIAA----SGPH----AAIIHYQATVQSNRLLQKD 405 (609) Q Consensus 341 ~~-~~~-----~~iTE~~ia~~Le~-~R~~~~~~~~~~~~~~SF~TIva----~G~N----~AipHY~~~~~s~~~i~~~ 405 (609) .+ +++ .|+|.-=+ ..+|+ |... .|.--|||.-++ -++| +|.-.--|..+ ...+++= T Consensus 276 ~~~NikA~fGLGGIt~~~V-~LhEeGLi~~-------L~DvQ~FD~~A~~S~~~N~~h~Ei~As~YAnP~~k-Ga~v~~L 346 (496) T TIGR01584 276 IEKNIKASFGLGGITKQMV-DLHEEGLIDK-------LFDVQSFDLKAVESIAKNPKHVEIDASLYANPANK-GAIVNKL 346 (496) T ss_pred HHHCCEEEECCCCCCHHHH-HHHHHHHHHH-------HCCCCCCHHHHHHHHHHCCCEEEEECCCCCCCCCC-CCCCCCC T ss_conf 8609507641677136789-8875234664-------42335742789999850898167821123688846-8520147 Q ss_pred CEEEE Q ss_conf 87998 Q gi|254780605|r 406 ELLLL 410 (609) Q Consensus 406 ~~~Li 410 (609) ++|++ T Consensus 347 d~ViL 351 (496) T TIGR01584 347 DVVIL 351 (496) T ss_pred CEEEE T ss_conf 56787 No 146 >COG3536 Uncharacterized protein conserved in bacteria [Function unknown] Probab=25.12 E-value=44 Score=13.59 Aligned_cols=14 Identities=43% Similarity=0.819 Sum_probs=10.1 Q ss_pred EEEECCCEEECCCCCEEE Q ss_conf 788468603568752150 Q gi|254780605|r 513 ILSNEPGYYRCGAFGIRI 530 (609) Q Consensus 513 v~s~EPG~Y~~g~~GiRI 530 (609) +.+||| .|.|+||| T Consensus 62 i~~i~p----~GnYavri 75 (120) T COG3536 62 IRDIEP----VGNYAVRI 75 (120) T ss_pred EEEEEE----CCCEEEEE T ss_conf 886676----37357898 No 147 >COG3710 CadC DNA-binding winged-HTH domains [Transcription] Probab=25.10 E-value=45 Score=13.53 Aligned_cols=65 Identities=18% Similarity=0.282 Sum_probs=42.7 Q ss_pred ECCHHHHHHHHHHHHHCCCEEEECCCHHHHHHHCCCCCCCCCEEEECCHHHCCCCHHHHHHHHHHHHCCCCC-EEEEECC Q ss_conf 413999999999853347619970500111220127633220001201233587738899999986023343-0676028 Q gi|254780605|r 124 LHSSFEVDLLQKSLDKIEGVIVDVPYNPIDSLWKDRPQRLYRKVAMQDMAYAGRESQEKIRDICKILHQKEV-GAVFICD 202 (609) Q Consensus 124 l~s~~~~~~l~~~l~~~~~~l~~~~~NlVD~iW~~rP~~~~~~i~~~~~~~aG~s~~eKi~~ir~~L~~~~~-daliit~ 202 (609) -++...++-|.--++.. ...+. .+.|+|.+|.+|-... .+-.+-|..||+.|++.+- .-++.|- T Consensus 31 ~l~~~~~~lL~~L~e~~-geVvs-k~eL~~~VW~~~~v~~-------------~~Ltq~I~~LRr~L~d~~~~~~~I~Tv 95 (148) T COG3710 31 KLGPRELKLLSLLLERA-GEVVS-KDELLDAVWPGRIVTV-------------NTLTQAISALRRALRDIGDGHRLIATV 95 (148) T ss_pred EECHHHHHHHHHHHHCC-CCEEC-HHHHHHHHCCCCCCCC-------------CHHHHHHHHHHHHHHCCCCCCEEEEEE T ss_conf 86667889999998546-86052-9999998579962644-------------549999999999863468766389983 Q ss_pred C Q ss_conf 4 Q gi|254780605|r 203 P 203 (609) Q Consensus 203 ~ 203 (609) | T Consensus 96 P 96 (148) T COG3710 96 P 96 (148) T ss_pred C T ss_conf 7 No 148 >TIGR01163 rpe ribulose-phosphate 3-epimerase; InterPro: IPR000056 Ribulose-phosphate 3-epimerase (5.1.3.1 from EC) (also known as pentose-5-phosphate 3-epimerase or PPE) is the enzyme that converts D-ribulose 5-phosphate into D-xylulose 5-phosphate in Calvin's reductive pentose phosphate cycle. In Alcaligenes eutrophus two copies of the gene coding for PPE are known , one is chromosomally encoded P40117 from SWISSPROT, the other one is on a plasmid Q04539 from SWISSPROT. PPE has been found in a wide range of bacteria, archaebacteria, fungi and plants. All the proteins have from 209 to 241 amino acid residues. The enzyme has a TIM barrel structure.; GO: 0004750 ribulose-phosphate 3-epimerase activity, 0005975 carbohydrate metabolic process. Probab=25.04 E-value=45 Score=13.52 Aligned_cols=66 Identities=17% Similarity=0.207 Sum_probs=36.9 Q ss_pred HHCCCC-CEEEECCEECCHHHHHHHHHHHHHCCCEEEECCCHHHHHHHCCCCCCCCCEEEECCHHHCCCCHHHHHHHHHH Q ss_conf 743799-8899656341399999999985334761997050011122012763322000120123358773889999998 Q gi|254780605|r 110 EHGFVG-LRLGLDSRLHSSFEVDLLQKSLDKIEGVIVDVPYNPIDSLWKDRPQRLYRKVAMQDMAYAGRESQEKIRDICK 188 (609) Q Consensus 110 ~~~~~g-~~IG~Dp~l~s~~~~~~l~~~l~~~~~~l~~~~~NlVD~iW~~rP~~~~~~i~~~~~~~aG~s~~eKi~~ir~ 188 (609) +.+.+- .++-+|-.|+ +....+|-..+.+.|..++. .|. .+.+-+.|+.+ T Consensus 50 ~~~r~~g~~~P~DVHLM-v~~pd~~~~~Fa~aGA~~I~----------------------vH~------Ea~~h~~R~l~ 100 (216) T TIGR01163 50 EALRKYGTKLPIDVHLM-VENPDRYIEDFAEAGADIIT----------------------VHA------EATEHIHRLLQ 100 (216) T ss_pred HHHHHHCCCCCEEEEEC-CCCHHHHHHHHHHHCCCEEE----------------------EEC------CCCCCHHHHHH T ss_conf 98874079521266303-57857778899970899899----------------------843------77626799999 Q ss_pred HHCCCCCEEEEECCCC Q ss_conf 6023343067602842 Q gi|254780605|r 189 ILHQKEVGAVFICDPS 204 (609) Q Consensus 189 ~L~~~~~daliit~~d 204 (609) .+|+.|+.|-+.-+|+ T Consensus 101 ~Ik~~G~~AG~v~NP~ 116 (216) T TIGR01163 101 LIKELGAKAGIVLNPA 116 (216) T ss_pred HHHHCCCCEEEEECCC T ss_conf 9997189706886799 No 149 >PRK06201 hypothetical protein; Validated Probab=24.76 E-value=46 Score=13.49 Aligned_cols=58 Identities=21% Similarity=0.452 Sum_probs=34.3 Q ss_pred EECCCCEEEEECCCCEECCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH--HHHCCCC Q ss_conf 50389879983032010561222143210378888843336677888889972178978888999987366--7741865 Q gi|254780605|r 401 LLQKDELLLLDSGAQYVNGTTDITRTIAIGDVDYEKKYYFTLVLKGMISVSTARFPQRTRGCDLDSIARIF--LWKYGAD 478 (609) Q Consensus 401 ~i~~~~~~LiDsG~qY~~GtTDiTRT~~~G~~~~e~k~~yt~VLk~~ial~~a~fp~g~~g~~lD~~aR~~--l~~~g~d 478 (609) ..++|+++.+|.++. ..+ =.+|+... ..+++ .|..|.-+|...|+. |.+.|+- T Consensus 76 ~~~pGdVlVid~~g~-------~~~-a~~G~~~a--------------~~a~~---~G~~G~ViDG~vRD~~~i~~~~fP 130 (221) T PRK06201 76 EARPGDVIVVDGGGD-------LTN-ALVGEIML--------------EIAQR---RGVAGVVIDGAVRDVGAIREMGFP 130 (221) T ss_pred HCCCCCEEEEECCCC-------CCE-EEHHHHHH--------------HHHHH---CCCCEEEEECCCCCHHHHHHCCCC T ss_conf 369997899947889-------762-70089999--------------99998---698689876675899999867998 Q ss_pred -CCCCC Q ss_conf -47764 Q gi|254780605|r 479 -FAHGV 483 (609) Q Consensus 479 -y~Hgt 483 (609) |..|+ T Consensus 131 vfarg~ 136 (221) T PRK06201 131 VFARGV 136 (221) T ss_pred EEECCC T ss_conf 795620 No 150 >KOG0622 consensus Probab=24.72 E-value=46 Score=13.48 Aligned_cols=91 Identities=18% Similarity=0.176 Sum_probs=42.0 Q ss_pred CCCHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 199999987999984----------4389999998753321001245301889999999875433027855554100001 Q gi|254780605|r 313 TKNKVEIEGMQTAHI----------QDGVAMVYFLFWFYSQSLETITEIDIIKKLERCREEIGCKMRNPLRDIAFNTIAA 382 (609) Q Consensus 313 iKn~~EI~~mr~Ahi----------~DgvAl~kfl~wL~~~~~~~iTE~~ia~~Le~~R~~~~~~~~~~~~~~SF~TIva 382 (609) +-|+.|+..++++|- -|+.|.|+ -+.+-|.+-.++.+.|+ .-++.+ -..+|.||. +| T Consensus 151 fDne~el~kv~~~hP~a~llLrIatdds~a~~~------l~~KFG~~~~~~~~lLd-~ak~l~----lnvvGvsfH--vG 217 (448) T KOG0622 151 FDNEEELEKVAKSHPNANLLLRIATDDSTATCR------LNLKFGCSLDNCRHLLD-MAKELE----LNVVGVSFH--VG 217 (448) T ss_pred ECCHHHHHHHHHHCCCCEEEEEECCCCCCCCCC------CCCCCCCCHHHHHHHHH-HHHHCC----CEEEEEEEE--EC T ss_conf 258999999997499733899973688745555------66756877789999999-998718----658889987--25 Q ss_pred CCCCCCCCCCCCCCCCCEEEC-----CCCEEEEECCCCE Q ss_conf 233543222245543320503-----8987998303201 Q gi|254780605|r 383 SGPHAAIIHYQATVQSNRLLQ-----KDELLLLDSGAQY 416 (609) Q Consensus 383 ~G~N~AipHY~~~~~s~~~i~-----~~~~~LiDsG~qY 416 (609) ||-.-.-.-+++-...-..++ .-.+.++|.||.+ T Consensus 218 Sgc~d~~~y~~Ai~dAr~vfd~g~e~Gf~m~~LdiGGGf 256 (448) T KOG0622 218 SGCTDLQAYRDAISDARNVFDMGAELGFEMDILDIGGGF 256 (448) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCC T ss_conf 887878999999999999999998618627886137888 No 151 >TIGR01792 urease_alph urease, alpha subunit; InterPro: IPR005848 Urease (urea amidohydrolase, 3.5.1.5 from EC) catalyses the hydrolysis of urea to form ammonia and carbamate. The subunit composition of urease from different sources varies , but each holoenzyme consists of four structural domains : three structural domains and a nickel-binding catalytic domain common to amidohydrolases . Urease is unique among nickel metalloenzymes in that it catalyses a hydrolysis rather than a redox reaction. In Helicobacter pylori, the gamma and beta domains are fused and called the alpha subunit (IPR008223 from INTERPRO). The catalytic subunit (called beta or B) has the same organization as the Klebsiella alpha subunit. Jack bean (Canavalia ensiformis) urease has a fused gamma-beta-alpha organization (IPR008221 from INTERPRO). This entry describes the urease alpha subunit UreC (designated beta or B chain, UreB in Helicobacter species). Urease (3.5.1.5 from EC) belongs to MEROPS peptidase family M38 (clan MJ). ; GO: 0009039 urease activity, 0016151 nickel ion binding, 0006807 nitrogen compound metabolic process. Probab=24.70 E-value=22 Score=15.74 Aligned_cols=64 Identities=31% Similarity=0.432 Sum_probs=45.0 Q ss_pred HHHHHHHC--C-C-C------C----------CCCCCCCCCCCCC--------CCCCCCCCCCCCCCEECCCCEE----- Q ss_conf 73667741--8-6-5------4----------7764323211356--------0778862377898600569878----- Q gi|254780605|r 468 ARIFLWKY--G-A-D------F----------AHGVGHGVGSFLP--------VHEGPQGISRTNQEPLLPGMIL----- 514 (609) Q Consensus 468 aR~~l~~~--g-~-d------y----------~HgtGHGVG~~l~--------VHE~P~~is~~~~~~l~~Gmv~----- 514 (609) .|.+|-.- . + | | .||.+|=||+ .+ .=| |-+|+-.-+.+|+-|||- T Consensus 393 ~RGpL~gD~Ps~~NDNnRvkRYvAKYTINPAIthGi~d~vGS-ievGK~ADlvlWe-P~fFGvKP~~vlKgG~Ia~a~~G 470 (605) T TIGR01792 393 QRGPLDGDRPSEYNDNNRVKRYVAKYTINPAITHGISDYVGS-IEVGKLADLVLWE-PAFFGVKPDMVLKGGLIAWAIAG 470 (605) T ss_pred HCCCCCCCCCCCCCCCCEEEEEEEEECCCHHHHCCCCCCCCE-ECHHHHHHHHHCC-CCCCCCCCCEEECCCEEEECCCC T ss_conf 157136766667888743655662001465875562230000-2000041132108-64355466568807766512676 Q ss_pred -------EECCCEEEC--CCCCEEEEEE Q ss_conf -------846860356--8752150025 Q gi|254780605|r 515 -------SNEPGYYRC--GAFGIRIENV 533 (609) Q Consensus 515 -------s~EPG~Y~~--g~~GiRIEn~ 533 (609) |.||=+|+| |.+|==+.++ T Consensus 471 DaNaSiPTpePv~~R~mfg~~G~al~~T 498 (605) T TIGR01792 471 DANASIPTPEPVLYRPMFGAYGRALQST 498 (605) T ss_pred CCCCCCCCCCCCCCCCHHHHCCCCCCCE T ss_conf 8777878876334331255506542560 No 152 >PRK13731 conjugal transfer surface exclusion protein TraT; Provisional Probab=24.40 E-value=46 Score=13.44 Aligned_cols=58 Identities=16% Similarity=0.249 Sum_probs=39.1 Q ss_pred CCCHHHHHHHCCCCCCCCCEEEECCHHHCCCCHHHHHHHHHHHHCCCCCEEEEECCCCCHHH Q ss_conf 05001112201276332200012012335877388999999860233430676028420234 Q gi|254780605|r 147 VPYNPIDSLWKDRPQRLYRKVAMQDMAYAGRESQEKIRDICKILHQKEVGAVFICDPSSIAW 208 (609) Q Consensus 147 ~~~NlVD~iW~~rP~~~~~~i~~~~~~~aG~s~~eKi~~ir~~L~~~~~daliit~~d~IaW 208 (609) +..-+=+.||-| | .....||+.-..-+++...+--.+|.+.|+.+| .-++.+||+=.| T Consensus 35 VqTkMS~TIfLd-P-~s~ktVyv~vrNTS~~~~~~l~~~i~~~L~akG--Y~iv~~P~~A~Y 92 (243) T PRK13731 35 VKTQMSETIWLE-P-ASERTVFLQIKNTSDKDMSGLQGKIADAVKAKG--YQVVTSPDKAYY 92 (243) T ss_pred EECEECCCEEEC-C-CCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCC--EEEECCHHHCEE T ss_conf 222000438866-8-545579999851787724567899999998589--098678657607 No 153 >COG2044 Predicted peroxiredoxins [General function prediction only] Probab=24.35 E-value=46 Score=13.43 Aligned_cols=37 Identities=14% Similarity=0.178 Sum_probs=25.4 Q ss_pred EEEECCCCCHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCEEECCC Q ss_conf 676028420234432013553778700233220147862000353 Q gi|254780605|r 197 AVFICDPSSIAWIFNIRGFDIPCSPYPLSRAILYADGKAEIFFDK 241 (609) Q Consensus 197 aliit~~d~IaWLlNiRG~Di~~tPv~~a~aiI~~~~~~~Lfvd~ 241 (609) .+++|+.++..|=+ |.|++.|-+....+-++.+|... T Consensus 6 ~i~~t~G~~~~~r~--------ya~f~~A~~a~smg~dV~iF~t~ 42 (120) T COG2044 6 LIVVTSGPNNPERA--------YAPFVMATAAASMGYDVTIFFTM 42 (120) T ss_pred EEEEECCCCCHHHH--------HHHHHHHHHHHHCCCCEEEEEEE T ss_conf 99981599987999--------86999999998679956999981 No 154 >KOG3892 consensus Probab=24.03 E-value=19 Score=16.11 Aligned_cols=39 Identities=13% Similarity=0.220 Sum_probs=24.9 Q ss_pred EECCHHHCCCCHHHHHHHHHHHHCCCCCEEE---EECCCCCH Q ss_conf 1201233587738899999986023343067---60284202 Q gi|254780605|r 168 AMQDMAYAGRESQEKIRDICKILHQKEVGAV---FICDPSSI 206 (609) Q Consensus 168 ~~~~~~~aG~s~~eKi~~ir~~L~~~~~dal---iit~~d~I 206 (609) |-.+..+.-+.+.|-++-+.+.|-+.|+..+ ++|+|.++ T Consensus 77 fGvDFS~dte~~~eGvAlVAr~ll~hGvtsf~Pt~~tsp~~~ 118 (407) T KOG3892 77 FGVDFSQDTEDVGEGVALVARQLLSHGVTSFCPTLVTSPPEV 118 (407) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC T ss_conf 244345651566547999999998658987898322388520 No 155 >COG2179 Predicted hydrolase of the HAD superfamily [General function prediction only] Probab=23.72 E-value=48 Score=13.35 Aligned_cols=90 Identities=21% Similarity=0.187 Sum_probs=47.6 Q ss_pred HHHHHHHCCCCCEEEECCEECCHH------HHHHHHHHHHHCCCEEEECCCHHHHHHHCCCCCCCCCEEEECCH---HHC Q ss_conf 998887437998899656341399------99999998533476199705001112201276332200012012---335 Q gi|254780605|r 105 HAWISEHGFVGLRLGLDSRLHSSF------EVDLLQKSLDKIEGVIVDVPYNPIDSLWKDRPQRLYRKVAMQDM---AYA 175 (609) Q Consensus 105 ~~wl~~~~~~g~~IG~Dp~l~s~~------~~~~l~~~l~~~~~~l~~~~~NlVD~iW~~rP~~~~~~i~~~~~---~~a 175 (609) .+||..+.-+|..+-+|-.|++.+ +.+.+...++.++++++-++.|= +.+...-.-.+.. --| T Consensus 20 ~~~L~~~Gikgvi~DlDNTLv~wd~~~~tpe~~~W~~e~k~~gi~v~vvSNn~--------e~RV~~~~~~l~v~fi~~A 91 (175) T COG2179 20 PDILKAHGIKGVILDLDNTLVPWDNPDATPELRAWLAELKEAGIKVVVVSNNK--------ESRVARAAEKLGVPFIYRA 91 (175) T ss_pred HHHHHHCCCCEEEEECCCCEECCCCCCCCHHHHHHHHHHHHCCCEEEEEECCC--------HHHHHHHHHHCCCCEEECC T ss_conf 99999759818998166751104699899999999999986597799981897--------8888766652597234022 Q ss_pred CCCHHHHHHHHHHHHCCCCCEEEEECC Q ss_conf 877388999999860233430676028 Q gi|254780605|r 176 GRESQEKIRDICKILHQKEVGAVFICD 202 (609) Q Consensus 176 G~s~~eKi~~ir~~L~~~~~daliit~ 202 (609) ++....++.+-.+.|.-.--+...|.+ T Consensus 92 ~KP~~~~fr~Al~~m~l~~~~vvmVGD 118 (175) T COG2179 92 KKPFGRAFRRALKEMNLPPEEVVMVGD 118 (175) T ss_pred CCCCHHHHHHHHHHCCCCHHHEEEECC T ss_conf 596279999999980998368799851 No 156 >KOG1684 consensus Probab=23.72 E-value=33 Score=14.44 Aligned_cols=14 Identities=7% Similarity=0.238 Sum_probs=9.1 Q ss_pred CCCCCEEEEEECCC Q ss_conf 67650077895783 Q gi|254780605|r 59 FTGSAGIAIVLRQK 72 (609) Q Consensus 59 FtGSaG~aiIt~~~ 72 (609) ++|+.+-.+||.|+ T Consensus 42 ~e~~~~~r~itLNR 55 (401) T KOG1684 42 VEGKGCARVITLNR 55 (401) T ss_pred EECCCCEEEEEECC T ss_conf 72477446899468 No 157 >TIGR03551 F420_cofH 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofH subunit. This enzyme, together with CofG, complete the biosynthesis of 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, the chromophore of coenzyme F420. The chromophore is also used in cyanobacteria DNA photolyases. Probab=23.64 E-value=44 Score=13.61 Aligned_cols=42 Identities=24% Similarity=0.076 Sum_probs=21.4 Q ss_pred CHHH-HHHHHHHHHHHHHHHHHHHHHHHHH-HCCCCCCCHHHHHHHHHHHH Q ss_conf 9999-9879999844389999998753321-00124530188999999987 Q gi|254780605|r 315 NKVE-IEGMQTAHIQDGVAMVYFLFWFYSQ-SLETITEIDIIKKLERCREE 363 (609) Q Consensus 315 n~~E-I~~mr~Ahi~DgvAl~kfl~wL~~~-~~~~iTE~~ia~~Le~~R~~ 363 (609) +..+ ++-++.||..- + ++=+ .. .+-+-|..+.++.|..+|.. T Consensus 177 ~~~~wl~~~~~Ah~lG-i---~tta---tml~G~gEt~eerv~hL~~lR~l 220 (343) T TIGR03551 177 STAEWIEIIKTAHKLG-I---PTTA---TIMYGHVETPEHWVDHLLILREI 220 (343) T ss_pred CHHHHHHHHHHHHHCC-C---CCCC---CEEECCCCCHHHHHHHHHHHHHH T ss_conf 9999999999999859-9---7202---23427889999999999999986 No 158 >COG1212 KdsB CMP-2-keto-3-deoxyoctulosonic acid synthetase [Cell envelope biogenesis, outer membrane] Probab=23.46 E-value=48 Score=13.32 Aligned_cols=66 Identities=12% Similarity=0.287 Sum_probs=32.0 Q ss_pred CCCCCHHHHHHHHCCCCCEEEECCEECCHHHHHHHHHHHHHCCCEEEECCCHHHHHHHCCCCCCCCCEEEECCHHHCCCC Q ss_conf 68999999888743799889965634139999999998533476199705001112201276332200012012335877 Q gi|254780605|r 99 IAIEPLHAWISEHGFVGLRLGLDSRLHSSFEVDLLQKSLDKIEGVIVDVPYNPIDSLWKDRPQRLYRKVAMQDMAYAGRE 178 (609) Q Consensus 99 ~~~~~~~~wl~~~~~~g~~IG~Dp~l~s~~~~~~l~~~l~~~~~~l~~~~~NlVD~iW~~rP~~~~~~i~~~~~~~aG~s 178 (609) .+-.++..|..++..+. |.|.-.+-. .-++..+.+++.|+.-+-++.| | +| T Consensus 25 I~GkpmI~rV~e~a~~s---~~~rvvVAT-Dde~I~~av~~~G~~avmT~~~-------------------h------~S 75 (247) T COG1212 25 IGGKPMIVRVAERALKS---GADRVVVAT-DDERIAEAVQAFGGEAVMTSKD-------------------H------QS 75 (247) T ss_pred HCCCHHHHHHHHHHHHC---CCCEEEEEC-CCHHHHHHHHHHCCEEEECCCC-------------------C------CC T ss_conf 37960789999999873---787289974-9889999999709789961788-------------------8------98 Q ss_pred HHHHHHHHHHHHCCC Q ss_conf 388999999860233 Q gi|254780605|r 179 SQEKIRDICKILHQK 193 (609) Q Consensus 179 ~~eKi~~ir~~L~~~ 193 (609) -.+++..+.+.|.-. T Consensus 76 GTdR~~Ev~~~l~~~ 90 (247) T COG1212 76 GTDRLAEVVEKLGLP 90 (247) T ss_pred CCHHHHHHHHHCCCC T ss_conf 008999999965999 No 159 >TIGR02145 Fib_succ_major Fibrobacter succinogenes major paralogous domain; InterPro: IPR011871 This domain of about 175 to 200 amino acids is found, in from one to five copies, in over 50 proteins in Fibrobacter succinogenes S85, an obligate anaerobe of the rumen. Many members of this family have an apparent lipoprotein signal sequence. Conserved cysteine residues, suggestive of disulphide bond formation, are also consistent with an extracytoplasmic location for this domain. This domain can also be found in small numbers of proteins in Chlorobium tepidum and Bacteroides thetaiotaomicron.. Probab=23.06 E-value=43 Score=13.68 Aligned_cols=20 Identities=20% Similarity=0.444 Sum_probs=10.4 Q ss_pred CCCCEEEEECCCCCHHHHHHHHC Q ss_conf 96717998668999999888743 Q gi|254780605|r 90 DTALFTIKNIAIEPLHAWISEHG 112 (609) Q Consensus 90 ~~~~~~~~~~~~~~~~~wl~~~~ 112 (609) |+..|++.++|... |+++|| T Consensus 7 DGq~YkTvkIG~Q~---WMAENL 26 (245) T TIGR02145 7 DGQTYKTVKIGSQV---WMAENL 26 (245) T ss_pred CCCCCCEEEECCEE---EHHHHC T ss_conf 89773678868807---725415 No 160 >PRK10529 DNA-binding transcriptional activator KdpE; Provisional Probab=22.77 E-value=50 Score=13.23 Aligned_cols=27 Identities=19% Similarity=0.000 Sum_probs=12.5 Q ss_pred HHHHHH---HHHHHHCCC--CCCCCCCCCCCCC Q ss_conf 999987---366774186--5477643232113 Q gi|254780605|r 463 DLDSIA---RIFLWKYGA--DFAHGVGHGVGSF 490 (609) Q Consensus 463 ~lD~~a---R~~l~~~g~--dy~HgtGHGVG~~ 490 (609) .||..- |+-|.+.+- .|.+ |=+|+||- T Consensus 191 ~idv~I~rLRkKl~~~~~~~~~I~-TvrG~GYr 222 (225) T PRK10529 191 YLRIYMGHLRQKLEQDPTRPRHFI-TETGIGYR 222 (225) T ss_pred CHHHHHHHHHHHHCCCCCCCCCEE-EECCCCEE T ss_conf 799999999997200789998289-56781656 No 161 >PRK01773 hscB co-chaperone HscB; Provisional Probab=22.68 E-value=26 Score=15.17 Aligned_cols=46 Identities=11% Similarity=0.068 Sum_probs=25.2 Q ss_pred EECCHHHCCCCHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHCC Q ss_conf 1201233587738899999986023343067602842023443201 Q gi|254780605|r 168 AMQDMAYAGRESQEKIRDICKILHQKEVGAVFICDPSSIAWIFNIR 213 (609) Q Consensus 168 ~~~~~~~aG~s~~eKi~~ir~~L~~~~~daliit~~d~IaWLlNiR 213 (609) ..||.+|++.|..+|...+...-.-+.+-..+-..+.=..|||-++ T Consensus 31 ~~HPDkf~~~s~~Ek~~s~~~Ss~iN~AY~tLkdpl~RA~YLL~L~ 76 (173) T PRK01773 31 SLHPDNFANSSAQEQRLAMQKSAEVNDALQILKDPILRAEAIIALH 76 (173) T ss_pred HHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHC T ss_conf 8696856689999999999999999999999748888999999861 No 162 >cd00717 URO-D Uroporphyrinogen decarboxylase (URO-D) is a dimeric cytosolic enzyme that decarboxylates the four acetate side chains of uroporphyrinogen III (uro-III) to create coproporphyrinogen III, without requiring any prosthetic groups or cofactors. This reaction is located at the branching point of the tetrapyrrole biosynthetic pathway, leading to the biosynthesis of heme, chlorophyll or bacteriochlorophyll. URO-D deficiency is responsible for the human genetic diseases familial porphyria cutanea tarda (fPCT) and hepatoerythropoietic porphyria (HEP). Probab=22.45 E-value=38 Score=14.05 Aligned_cols=44 Identities=16% Similarity=0.305 Sum_probs=31.4 Q ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCC--CCCCCCCCCCC Q ss_conf 6677888889972178978888999987366774186--54776432321 Q gi|254780605|r 441 TLVLKGMISVSTARFPQRTRGCDLDSIARIFLWKYGA--DFAHGVGHGVG 488 (609) Q Consensus 441 t~VLk~~ial~~a~fp~g~~g~~lD~~aR~~l~~~g~--dy~HgtGHGVG 488 (609) ..+++|+++-.... | ...++...+++.|...+- .|.-.+||||- T Consensus 273 ~~~lQGNlDP~~L~---~-~~e~i~~~~~~il~~~~~~~g~IfnLGHGI~ 318 (335) T cd00717 273 KVALQGNLDPALLY---A-PKEAIEKEVKRILKAFGGAPGHIFNLGHGIL 318 (335) T ss_pred CEEEECCCCHHHHC---C-CHHHHHHHHHHHHHHHCCCCCEEEECCCCCC T ss_conf 82796689878976---9-9999999999999984899985987999869 No 163 >PRK12727 flagellar biosynthesis regulator FlhF; Provisional Probab=22.28 E-value=51 Score=13.16 Aligned_cols=23 Identities=17% Similarity=0.161 Sum_probs=14.0 Q ss_pred CHHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 55899999999999749979996088 Q gi|254780605|r 13 KTFERVHNLRSCFDSLGMDAFLVPRV 38 (609) Q Consensus 13 ~~~erL~~LR~~m~~~~idayiip~~ 38 (609) ++.|=|.+.|+. .|-||.|+.+- T Consensus 10 dMReAL~~VRee---LG~DAVILSNR 32 (557) T PRK12727 10 DMRTAFRMVREE---HGPDAVILSNR 32 (557) T ss_pred CHHHHHHHHHHH---HCCCEEEEECC T ss_conf 999999999998---78997997288 No 164 >COG4668 MtlA Mannitol/fructose-specific phosphotransferase system, IIA domain [Carbohydrate transport and metabolism] Probab=22.00 E-value=34 Score=14.34 Aligned_cols=51 Identities=24% Similarity=0.313 Sum_probs=25.1 Q ss_pred CCCCCCCCCCCCCCCCEEECCCCEEEEEC--CCCEECCCCCCEEEEEEC--CCCHHHHHH Q ss_conf 33543222245543320503898799830--320105612221432103--788888433 Q gi|254780605|r 384 GPHAAIIHYQATVQSNRLLQKDELLLLDS--GAQYVNGTTDITRTIAIG--DVDYEKKYY 439 (609) Q Consensus 384 G~N~AipHY~~~~~s~~~i~~~~~~LiDs--G~qY~~GtTDiTRT~~~G--~~~~e~k~~ 439 (609) |.-=|+||. +.++-..+.+.+++.+-. |-.+ | -|-+-.+.+| .-++|+-.+ T Consensus 54 GngIAIPHg--t~e~k~~V~kTgv~i~q~p~GV~w--g-~g~~a~~viGIAak~~EHl~~ 108 (142) T COG4668 54 GNGIAIPHG--TGEAKDLVLKTGVVIVQFPEGVDW--G-DGQIAYLVIGIAAKSDEHLQL 108 (142) T ss_pred CCCEECCCC--CHHHHHHHHHCCEEEEECCCCCCC--C-CCCEEEEEEEEECCCHHHHHH T ss_conf 474404889--758999999718499977876215--9-996699999984298689999 No 165 >PRK13299 tRNA CCA-pyrophosphorylase; Provisional Probab=21.64 E-value=35 Score=14.26 Aligned_cols=30 Identities=23% Similarity=0.547 Sum_probs=14.1 Q ss_pred HHHHHHHHHHHHHH--HHHCCCHHH-HHHHHHH Q ss_conf 99999999999997--441180799-9999973 Q gi|254780605|r 576 KWCNDYHRRVYTSL--APLIEDQEV-LSWLFSV 605 (609) Q Consensus 576 ~wln~Yh~~v~~~~--~p~l~~~~~-~~wl~~~ 605 (609) .|+-+--+.+++.. ++..+.+|+ ++||++. T Consensus 361 p~iG~iL~~l~~~~l~g~~~n~~eaLl~~l~~~ 393 (394) T PRK13299 361 PWLGETLRKIEEAIVTGELENEKEAILEWLDEH 393 (394) T ss_pred HHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 899999999999981699999699999999974 No 166 >COG3290 CitA Signal transduction histidine kinase regulating citrate/malate metabolism [Signal transduction mechanisms] Probab=21.64 E-value=26 Score=15.19 Aligned_cols=57 Identities=21% Similarity=0.232 Sum_probs=30.2 Q ss_pred HHHHHHHHHHHHHHCCCCCCCC-CCCCCCCCCC---CCCCCCCCCCCCCCEECCCCEEEECCCEEECC Q ss_conf 8899998736677418654776-4323211356---07788623778986005698788468603568 Q gi|254780605|r 461 GCDLDSIARIFLWKYGADFAHG-VGHGVGSFLP---VHEGPQGISRTNQEPLLPGMILSNEPGYYRCG 524 (609) Q Consensus 461 g~~lD~~aR~~l~~~g~dy~Hg-tGHGVG~~l~---VHE~P~~is~~~~~~l~~Gmv~s~EPG~Y~~g 524 (609) |+-++.--|+.+.+.|+ ..++ .|||+|-||- |++.--.|.-.+. --.|++||+ |+|. T Consensus 471 G~GI~~~~~~~iFe~G~-Stk~~~~rGiGL~Lvkq~V~~~~G~I~~~s~--~~~Gt~F~i----~iP~ 531 (537) T COG3290 471 GPGIPPEVRDKIFEKGV-STKNTGGRGIGLYLVKQLVERLGGSIEVESE--KGQGTRFSI----YIPK 531 (537) T ss_pred CCCCCHHHHHHHHHCCC-CCCCCCCCCHHHHHHHHHHHHCCCEEEEEEC--CCCCEEEEE----ECCC T ss_conf 99989688777873484-1467788761288999999874966999507--898149999----8888 No 167 >KOG2679 consensus Probab=21.59 E-value=52 Score=13.07 Aligned_cols=23 Identities=4% Similarity=-0.063 Sum_probs=11.3 Q ss_pred EEECCCCCHHHHHHCCCCCCCCCCHHH Q ss_conf 760284202344320135537787002 Q gi|254780605|r 198 VFICDPSSIAWIFNIRGFDIPCSPYPL 224 (609) Q Consensus 198 liit~~d~IaWLlNiRG~Di~~tPv~~ 224 (609) ++|+..||+. .+++. -++-|-|. T Consensus 78 fvlS~GDNfY-d~G~~---~~~Dp~Fq 100 (336) T KOG2679 78 FVLSTGDNFY-DTGLT---SENDPRFQ 100 (336) T ss_pred EEEECCCCCC-CCCCC---CCCCHHHH T ss_conf 8996387310-06878---77780477 No 168 >cd01576 AcnB_Swivel Aconitase B swivel domain. Aconitate hydratase B is involved in energy metabolism as part of the TCA cycle. It catalyses the formation of cis-aconitate from citrate. This is the aconitase swivel domain, which undergoes swivelling conformational change in the enzyme mechanism. The domain structure of Aconitase B is different from other Aconitases in that he swivel domain that is found at N-terminus of B family is normally found at C-terminus for other Aconitases. In most members of the family, there is also a HEAT domain before domain 4, which is believed to play a role in protein-protein interaction. Probab=21.59 E-value=42 Score=13.75 Aligned_cols=78 Identities=26% Similarity=0.436 Sum_probs=42.3 Q ss_pred HHHHCCCCCCCCCEEEECCHHHCCCCHHHHHHHHHHHHCCCCCEEEEE-----------CCCCCHHHHHHCCCCCCCCCC Q ss_conf 122012763322000120123358773889999998602334306760-----------284202344320135537787 Q gi|254780605|r 153 DSLWKDRPQRLYRKVAMQDMAYAGRESQEKIRDICKILHQKEVGAVFI-----------CDPSSIAWIFNIRGFDIPCSP 221 (609) Q Consensus 153 D~iW~~rP~~~~~~i~~~~~~~aG~s~~eKi~~ir~~L~~~~~dalii-----------t~~d~IaWLlNiRG~Di~~tP 221 (609) -..|. ||..|..-.-.+..+.- .-+..|. .++.++.--.++ |...++.|-+ |.|+||-| T Consensus 16 ~dA~S-RpDipLHa~aMlk~~~~-----~~~~~i~-~lk~~g~~~~~vgdvvGtGSSRkSa~Nsv~w~~---G~~ip~vp 85 (131) T cd01576 16 PDAWS-RPDIPLHALAMLKNKRE-----GEIVAIA-QLKPKGHPVAYVGDVVGTGSSRKSATNSVLWHT---GKDIPFVP 85 (131) T ss_pred CCCCC-CCCCHHHHHHHHCCCCC-----CHHHHHH-HHHHCCCEEEEEECCEECCCCCHHHHHHHHHHH---CCCCCCCC T ss_conf 54656-88714589986205434-----1789999-996649838998165766836254441619970---88788889 Q ss_pred HHHHHHHHCCCCCCEEECC Q ss_conf 0023322014786200035 Q gi|254780605|r 222 YPLSRAILYADGKAEIFFD 240 (609) Q Consensus 222 v~~a~aiI~~~~~~~Lfvd 240 (609) -+.+-.++...+-+-+|.+ T Consensus 86 nk~~ggv~~g~~IaPIFfn 104 (131) T cd01576 86 NKRAGGVVLGGKIAPIFFN 104 (131) T ss_pred CCCCCEEEECCCCCCEEEE T ss_conf 8667768868817535772 No 169 >TIGR00110 ilvD dihydroxy-acid dehydratase; InterPro: IPR004404 Two dehydratases, dihydroxy-acid dehydratase (gene ilvD or ILV3) and 6-phosphogluconate dehydratase (gene edd) have been shown to be evolutionary related . Dihydroxy-acid dehydratase catalyzes the fourth step in the biosynthesis of isoleucine and valine, the dehydratation of 2,3-dihydroxy-isovaleic acid into alpha-ketoisovaleric acid. 6-Phosphogluconate dehydratase catalyzes the first step in the Entner-Doudoroff pathway, the dehydratation of 6-phospho-D-gluconate into 6-phospho-2-dehydro-3-deoxy-D-gluconate. Another protein containing this signature is the Escherichia coli hypothetical protein yjhG. The N-terminal part of the proteins contains a cysteine that could be involved in the binding of a 2Fe-2S iron-sulphur cluster . This family represents dihydroxy-acid dehydratase (DAD). It contains a catalytically essential [4Fe-4S] cluster and catalyses the fourth step in valine and isoleucine biosynthesis.; GO: 0004160 dihydroxy-acid dehydratase activity, 0009082 branched chain family amino acid biosynthetic process. Probab=21.52 E-value=42 Score=13.73 Aligned_cols=45 Identities=16% Similarity=0.195 Sum_probs=20.2 Q ss_pred CCCCHHCCCHHHCCHHHHHHHHHHHHHHHHH-HHHHCCCHHHHHHHHHH Q ss_conf 5300110786778999999999999999999-74411807999999973 Q gi|254780605|r 558 CPIDRKLILVELLTNEEKKWCNDYHRRVYTS-LAPLIEDQEVLSWLFSV 605 (609) Q Consensus 558 vP~~~~li~~~ll~~~e~~wln~Yh~~v~~~-~~p~l~~~~~~~wl~~~ 605 (609) +=++.|.||.. ++++|++ .+=-++.... ..|-..+.+.+=||.+. T Consensus 541 IDi~~r~l~l~-~s~eEla--~Rr~~~~~~~p~~P~~~~r~~~G~La~Y 586 (601) T TIGR00110 541 IDIPNRKLDLQ-VSEEELA--ERRAKWKKPGPWEPKNREREVKGYLAKY 586 (601) T ss_pred EECCCCCCCCC-CCHHHHH--HHHHHCCCCCCCCCCCCCCCCCCHHHHH T ss_conf 70023612353-7988999--9998617741788733344542023234 No 170 >smart00862 Trans_reg_C Transcriptional regulatory protein, C terminal. This domain is almost always found associated with the response regulator receiver domain. It may play a role in DNA binding. Probab=21.49 E-value=52 Score=13.06 Aligned_cols=58 Identities=17% Similarity=0.235 Sum_probs=34.6 Q ss_pred CCHHHHHHHHHHHHHCCCEEEECCCHHHHHHHCCCCCCCCCEEEECCHHHCCCCHHHHHHHHHHHHCCCCC Q ss_conf 13999999999853347619970500111220127633220001201233587738899999986023343 Q gi|254780605|r 125 HSSFEVDLLQKSLDKIEGVIVDVPYNPIDSLWKDRPQRLYRKVAMQDMAYAGRESQEKIRDICKILHQKEV 195 (609) Q Consensus 125 ~s~~~~~~l~~~l~~~~~~l~~~~~NlVD~iW~~rP~~~~~~i~~~~~~~aG~s~~eKi~~ir~~L~~~~~ 195 (609) .|..+++-|.--+...+ ..+ ..+.+++.+|.+.+... +-.+...-+.+||+.|.+.+. T Consensus 6 Lt~~e~~lL~~L~~~~~-~~v-s~~~l~~~lw~~~~~~~-----------~~~~l~~~I~~LR~kl~~~~~ 63 (78) T smart00862 6 LTPKEFRLLELLLRNPG-RVV-SREELLEAVWGDDDDDV-----------DDNTLDVHISRLRKKLEDDGA 63 (78) T ss_pred CCHHHHHHHHHHHHCCC-CCC-CHHHHHHHHCCCCCCCC-----------CCCCHHHHHHHHHHHHHHCCC T ss_conf 48999999999984799-629-69999987628887888-----------860278999999999764589 No 171 >PRK00290 dnaK molecular chaperone DnaK; Provisional Probab=21.05 E-value=39 Score=14.00 Aligned_cols=63 Identities=21% Similarity=0.382 Sum_probs=27.7 Q ss_pred CCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCEEECCCCEEEEEC-----CCCEECCCCCCEEEEEECCC-CHHHHHHHH Q ss_conf 7855554100001233--543222245543320503898799830-----32010561222143210378-888843336 Q gi|254780605|r 370 NPLRDIAFNTIAASGP--HAAIIHYQATVQSNRLLQKDELLLLDS-----GAQYVNGTTDITRTIAIGDV-DYEKKYYFT 441 (609) Q Consensus 370 ~~~~~~SF~TIva~G~--N~AipHY~~~~~s~~~i~~~~~~LiDs-----G~qY~~GtTDiTRT~~~G~~-~~e~k~~yt 441 (609) +...++--+..||.|+ -||+.- + +..+++|+|. |-.-.|| -+|.-+.-+.+ +-...+.|| T Consensus 355 ~~~~~inPDeaVA~GAAiqa~iLs------g----~~~d~~LlDVtPlSLGiet~gg--~~~~iI~rNt~IP~~~s~~Ft 422 (631) T PRK00290 355 EPNKGVNPDEVVAVGAAIQGGVLA------G----DVKDVLLLDVTPLSLGIETLGG--VMTKLIERNTTIPTKKSQVFS 422 (631) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHC------C----CCCCCEEEECCCCCEEEEECCC--EEEEEEECCCCCCCCCEEEEE T ss_conf 977796912689987999998863------8----7667368960345313786397--479963179836821216623 Q ss_pred HHH Q ss_conf 677 Q gi|254780605|r 442 LVL 444 (609) Q Consensus 442 ~VL 444 (609) .+- T Consensus 423 T~~ 425 (631) T PRK00290 423 TAE 425 (631) T ss_pred EEC T ss_conf 404 No 172 >TIGR01363 strep_his_triad streptococcal histidine triad protein; InterPro: IPR006270 The sequences represented in this group are identified by a domain which consists of the N-terminal half of a family of Streptococcal proteins that contain a signal peptide and then up to five repeats of a region that includes a His-X-X-His-X-His (histidine triad) motif. Additional copies of the repeats are found in more poorly conserved regions. Members of this family from Streptococcus pneumoniae are suggested to cleave human C3, and the member PhpA has been shown in vaccine studies to be a protective antigen in mice . . Probab=20.82 E-value=43 Score=13.66 Aligned_cols=18 Identities=17% Similarity=0.167 Sum_probs=9.0 Q ss_pred CHHHHHCCCHHHHHHHHH Q ss_conf 115666199999987999 Q gi|254780605|r 307 SCLLRATKNKVEIEGMQT 324 (609) Q Consensus 307 I~~lRaiKn~~EI~~mr~ 324 (609) -...+-|.+..||+.=++ T Consensus 144 aa~~~NiRTK~~I~~Q~~ 161 (376) T TIGR01363 144 AAHADNIRTKEEIARQKQ 161 (376) T ss_pred CCCCCCCCHHHHHHHHHH T ss_conf 122101301789999987 No 173 >cd03575 NTR_WFIKKN NTR domain, WFIKKN subfamily; WFIKKN proteins contain a C-terminal NTR domain and are putative secreted proteins which may be multivalent protease inhibitors that act on serine proteases as well as metalloproteases. Human WFIKKN and a related protein sharing the same domain architecture were observed to have distinct tissue expression patterns. WFIKKN is also referred to as growth and differentiation factor-associated serum protein-1 (GASP-1). It inhibits the activity of mature myostatin, a specific regulator of skeletal muscle mass and a member of the TGFbeta superfamily. Probab=20.62 E-value=55 Score=12.94 Aligned_cols=26 Identities=23% Similarity=0.530 Sum_probs=19.0 Q ss_pred EEECCHHHCCCCHHHHHHHHHHHHCC Q ss_conf 01201233587738899999986023 Q gi|254780605|r 167 VAMQDMAYAGRESQEKIRDICKILHQ 192 (609) Q Consensus 167 i~~~~~~~aG~s~~eKi~~ir~~L~~ 192 (609) +..+|..|.+-++..++.+||+.+.+ T Consensus 84 avl~pdSfv~~ss~rRv~klrE~~~k 109 (109) T cd03575 84 AVLQPDSYVRASSERRVRKLREVLHK 109 (109) T ss_pred EEECCCCEECCCCHHHHHHHHHHHHC T ss_conf 77557762100438899999998509 No 174 >TIGR01607 PST-A Plasmodium subtelomeric family (PST-A); InterPro: IPR006494 These proteins represent a paralogous family of genes found in Plasmodium falciparum and Plasmodium yoelii that are closely related to various phospholipases and lysophospholipases of plants as well as generally being related to the alpha/beta-fold superfamily of hydrolases. These genes are preferentially located in the subtelomeric regions of the chromosomes of both P. falciparum and P. yoelii. . Probab=20.54 E-value=39 Score=13.99 Aligned_cols=25 Identities=16% Similarity=0.185 Sum_probs=17.3 Q ss_pred HCCCCCCCCCEEEECCHHHCCCCHH Q ss_conf 0127633220001201233587738 Q gi|254780605|r 156 WKDRPQRLYRKVAMQDMAYAGRESQ 180 (609) Q Consensus 156 W~~rP~~~~~~i~~~~~~~aG~s~~ 180 (609) |-+|=.+.-=-||-++++=.|||=. T Consensus 66 WIE~fN~~GYSVYgLDLQGHGESdG 90 (379) T TIGR01607 66 WIEKFNKNGYSVYGLDLQGHGESDG 90 (379) T ss_pred EEEECCCCCCEEEEEECCCCCCCCH T ss_conf 5651167976387534356762320 No 175 >PRK08130 putative aldolase; Validated Probab=20.43 E-value=55 Score=12.91 Aligned_cols=17 Identities=18% Similarity=0.331 Sum_probs=8.6 Q ss_pred EECCCCEEEEECCCCEE Q ss_conf 50389879983032010 Q gi|254780605|r 401 LLQKDELLLLDSGAQYV 417 (609) Q Consensus 401 ~i~~~~~~LiDsG~qY~ 417 (609) .+++.+++++|.-|+.. T Consensus 50 ~l~~~div~vd~dG~~~ 66 (213) T PRK08130 50 RLDPARLSKVDADGNWL 66 (213) T ss_pred HCCHHHHEEECCCCCCC T ss_conf 39888907655678725 No 176 >PRK12724 flagellar biosynthesis regulator FlhF; Provisional Probab=20.32 E-value=55 Score=12.89 Aligned_cols=19 Identities=16% Similarity=0.151 Sum_probs=12.4 Q ss_pred HHHHHHHHCCCCEEEEECC Q ss_conf 9999999749979996088 Q gi|254780605|r 20 NLRSCFDSLGMDAFLVPRV 38 (609) Q Consensus 20 ~LR~~m~~~~idayiip~~ 38 (609) +|+..-++.|=||.|+.+- T Consensus 14 Al~~vk~eLG~DAVILssr 32 (432) T PRK12724 14 CLMEMKMKYGSEATVISQT 32 (432) T ss_pred HHHHHHHHHCCCEEEEEEE T ss_conf 9999999978992998545 No 177 >KOG0821 consensus Probab=20.25 E-value=37 Score=14.11 Aligned_cols=14 Identities=29% Similarity=0.503 Sum_probs=11.6 Q ss_pred HHHHHHHHHHCCCC Q ss_conf 14899998619671 Q gi|254780605|r 80 RYTLQVEKEVDTAL 93 (609) Q Consensus 80 RY~lQA~~el~~~~ 93 (609) -|-+||++|+++++ T Consensus 19 lYRLqA~K~LSQNf 32 (326) T KOG0821 19 LYRLQAAKQLSQNF 32 (326) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999886767 No 178 >TIGR02551 SpaO_YscQ type III secretion apparatus protein, YscQ/HrcQ family; InterPro: IPR013385 Proteins in this entry are encoded within type III secretion operons and are involved in many different functions. For example, YscQ in Yersinia is essential for YOPs secretion , while SpaO in Shigella is involved in the Surface Presentation of Antigens apparatus found on the virulence plasmid , and HrcQ is involved in the Harpin secretory system in organisms like Pseudomonas syringae .; GO: 0030254 protein secretion by the type III secretion system. Probab=20.10 E-value=56 Score=12.86 Aligned_cols=19 Identities=37% Similarity=0.457 Sum_probs=11.7 Q ss_pred EEECCCCEEEEECCCCEEC Q ss_conf 0503898799830320105 Q gi|254780605|r 400 RLLQKDELLLLDSGAQYVN 418 (609) Q Consensus 400 ~~i~~~~~~LiDsG~qY~~ 418 (609) +.++.||++|+|....+.. T Consensus 161 ~~l~~GDvLL~~~~~~~~~ 179 (326) T TIGR02551 161 RSLEPGDVLLLDNPAAARA 179 (326) T ss_pred HHHCCCCEEECCCCCCCCC T ss_conf 2320585663265311220 Done!