Query gi|254780915|ref|YP_003065328.1| oligoendopeptidase F [Candidatus Liberibacter asiaticus str. psy62] Match_columns 626 No_of_seqs 186 out of 1997 Neff 9.3 Searched_HMMs 39220 Date Mon May 30 01:13:31 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780915.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR02290 M3_fam_3 oligoendope 100.0 0 0 949.0 57.4 583 38-626 1-600 (600) 2 TIGR00181 pepF oligoendopeptid 100.0 0 0 922.8 51.4 587 30-626 1-611 (611) 3 COG1164 Oligoendopeptidase F [ 100.0 0 0 866.8 61.8 590 27-626 1-595 (598) 4 cd06459 M3B_Oligoendopeptidase 100.0 0 0 708.9 43.1 423 192-617 2-427 (427) 5 COG0339 Dcp Zn-dependent oligo 100.0 0 0 620.5 42.0 563 27-610 9-677 (683) 6 PRK10280 dipeptidyl carboxypep 100.0 0 0 603.1 46.2 548 57-611 25-676 (681) 7 PRK10911 oligopeptidase A; Pro 100.0 0 0 591.1 42.5 545 59-611 21-675 (680) 8 TIGR02289 M3_not_pepF oligoend 100.0 0 0 562.5 45.3 538 59-624 3-553 (553) 9 pfam01432 Peptidase_M3 Peptida 100.0 0 0 551.5 36.9 382 229-614 2-447 (448) 10 cd06455 M3A_TOP Peptidase M3 T 100.0 0 0 507.6 31.2 379 221-611 20-471 (472) 11 cd06456 M3A_DCP_Oligopeptidase 100.0 0 0 486.6 28.3 350 255-611 1-419 (422) 12 cd06457 M3A_MIP Peptidase M3 m 100.0 0 0 482.9 27.2 371 228-609 2-455 (458) 13 KOG2089 consensus 100.0 0 0 463.7 34.6 517 83-611 75-705 (718) 14 cd06258 Peptidase_M3_like The 100.0 0 0 416.3 31.8 325 258-610 2-363 (365) 15 KOG2090 consensus 100.0 0 0 383.1 39.2 553 42-617 42-693 (704) 16 pfam01401 Peptidase_M2 Angiote 99.9 8.8E-18 2.2E-22 144.9 45.2 506 81-623 10-578 (595) 17 cd06461 M2_ACE Peptidase famil 99.8 4.7E-16 1.2E-20 132.5 34.7 380 225-622 36-475 (477) 18 pfam02074 Peptidase_M32 Carbox 99.8 5.5E-14 1.4E-18 117.8 39.1 336 236-611 108-488 (494) 19 cd06460 M32_Taq Peptidase fami 99.8 2.1E-14 5.5E-19 120.7 27.9 334 237-611 9-389 (396) 20 KOG3690 consensus 99.7 4.3E-11 1.1E-15 97.1 46.3 514 82-622 29-606 (646) 21 COG2317 Zn-dependent carboxype 99.6 1.5E-12 3.8E-17 107.5 25.9 336 237-613 108-491 (497) 22 pfam08439 Peptidase_M3_N Oligo 98.2 2.4E-06 6.1E-11 63.3 5.7 70 145-214 1-70 (70) 23 COG2856 Predicted Zn peptidase 96.1 0.1 2.7E-06 30.2 9.9 52 373-426 35-88 (213) 24 pfam05960 DUF885 Bacterial pro 95.9 0.14 3.5E-06 29.3 27.4 172 398-597 346-529 (536) 25 KOG2719 consensus 93.4 0.08 2E-06 31.0 3.2 50 379-428 243-298 (428) 26 pfam01400 Astacin Astacin (Pep 92.6 0.098 2.5E-06 30.3 2.7 68 347-426 20-95 (192) 27 cd04272 ZnMc_salivary_gland_MP 92.4 0.11 2.7E-06 30.1 2.7 41 387-428 122-162 (220) 28 pfam10460 Peptidase_M30 Peptid 92.4 0.23 6E-06 27.7 4.4 51 407-458 136-186 (366) 29 cd00203 ZnMc Zinc-dependent me 92.0 0.25 6.4E-06 27.4 4.2 15 409-423 95-109 (167) 30 cd04280 ZnMc_astacin_like Zinc 91.8 0.18 4.6E-06 28.4 3.3 73 347-426 16-89 (180) 31 pfam04298 Zn_peptidase_2 Putat 91.8 0.24 6E-06 27.6 3.8 51 402-454 80-131 (222) 32 smart00235 ZnMc Zinc-dependent 91.7 0.074 1.9E-06 31.2 1.2 81 340-426 17-101 (140) 33 cd04283 ZnMc_hatching_enzyme Z 91.4 0.19 4.8E-06 28.3 3.0 38 387-426 53-92 (182) 34 pfam06114 DUF955 Domain of unk 91.0 0.25 6.5E-06 27.4 3.4 28 397-424 27-55 (121) 35 cd04282 ZnMc_meprin Zinc-depen 90.6 0.36 9.1E-06 26.3 3.8 74 346-425 61-135 (230) 36 cd04281 ZnMc_BMP1_TLD Zinc-dep 89.4 0.42 1.1E-05 25.9 3.4 18 406-423 83-100 (200) 37 KOG3607 consensus 89.4 0.32 8.1E-06 26.7 2.8 58 371-429 278-341 (716) 38 pfam01435 Peptidase_M48 Peptid 89.0 0.57 1.4E-05 24.9 3.8 48 376-426 51-104 (222) 39 PTZ00337 surface protease GP63 88.4 0.53 1.4E-05 25.1 3.3 49 376-424 187-243 (567) 40 cd04269 ZnMc_adamalysin_II_lik 88.0 0.45 1.1E-05 25.7 2.8 41 387-428 106-148 (194) 41 cd04268 ZnMc_MMP_like Zinc-dep 87.7 0.62 1.6E-05 24.6 3.4 14 409-422 93-106 (165) 42 cd04273 ZnMc_ADAMTS_like Zinc- 87.7 0.56 1.4E-05 24.9 3.1 41 386-428 117-157 (207) 43 cd04267 ZnMc_ADAM_like Zinc-de 87.4 0.21 5.4E-06 28.0 0.8 43 385-428 107-150 (192) 44 pfam09768 Peptidase_M76 Peptid 85.7 1.7 4.4E-05 21.5 4.7 53 372-428 35-88 (172) 45 pfam01421 Reprolysin Reprolysi 85.6 0.31 7.8E-06 26.8 0.8 41 387-428 106-148 (198) 46 COG0501 HtpX Zn-dependent prot 85.5 1.5 3.9E-05 21.8 4.4 49 375-424 118-171 (302) 47 PTZ00257 Glycoprotein GP63 (le 85.2 0.83 2.1E-05 23.7 2.9 50 375-424 210-270 (615) 48 PRK02391 heat shock protein Ht 84.1 2.2 5.5E-05 20.8 4.6 47 377-425 97-149 (297) 49 cd04327 ZnMc_MMP_like_3 Zinc-d 83.6 0.47 1.2E-05 25.5 1.1 19 408-426 90-108 (198) 50 cd04278 ZnMc_MMP Zinc-dependen 82.8 0.8 2E-05 23.8 2.0 16 408-423 105-120 (157) 51 cd06162 S2P-M50_PDZ_SREBP Ster 82.8 0.44 1.1E-05 25.7 0.7 42 411-452 136-181 (277) 52 cd04279 ZnMc_MMP_like_1 Zinc-d 81.9 0.57 1.4E-05 24.9 0.9 15 408-422 102-116 (156) 53 PRK04897 heat shock protein Ht 81.0 1.2 3.1E-05 22.5 2.4 47 377-425 100-152 (298) 54 pfam01742 Peptidase_M27 Clostr 80.5 1.2 3E-05 22.7 2.2 22 408-429 212-233 (408) 55 PRK03982 heat shock protein Ht 79.8 2 5.1E-05 21.0 3.2 46 378-425 89-140 (288) 56 cd04276 ZnMc_MMP_like_2 Zinc-d 79.2 0.66 1.7E-05 24.5 0.5 12 411-422 117-128 (197) 57 pfam00413 Peptidase_M10 Matrix 78.8 0.67 1.7E-05 24.4 0.5 16 408-423 105-120 (158) 58 COG2738 Predicted Zn-dependent 78.7 4 0.0001 18.9 7.9 87 364-455 44-135 (226) 59 TIGR03296 M6dom_TIGR03296 M6 f 78.3 0.23 5.9E-06 27.7 -2.0 14 409-422 164-177 (286) 60 KOG2661 consensus 78.2 2.3 5.9E-05 20.6 3.1 60 385-447 249-318 (424) 61 cd06164 S2P-M50_SpoIVFB_CBS Sp 78.0 1.4 3.5E-05 22.2 1.8 18 411-428 54-71 (227) 62 LOAD_S2Pmetalloprt consensus 77.7 1.4 3.6E-05 22.1 1.8 18 411-428 8-25 (148) 63 COG4227 Antirestriction protei 77.6 1.3 3.4E-05 22.3 1.7 85 362-449 158-247 (316) 64 PRK03001 heat shock protein Ht 77.5 2 5E-05 21.0 2.5 46 377-424 87-138 (284) 65 pfam03272 Enhancin Viral enhan 77.2 0.67 1.7E-05 24.4 0.1 16 470-485 664-679 (775) 66 cd06161 S2P-M50_SpoIVFB SpoIVF 77.1 1.5 3.8E-05 21.9 1.8 18 411-428 39-56 (208) 67 PRK05457 heat shock protein Ht 77.0 2.6 6.5E-05 20.2 3.0 46 378-425 98-149 (289) 68 cd04277 ZnMc_serralysin_like Z 76.9 0.21 5.3E-06 28.0 -2.6 13 410-422 113-125 (186) 69 PRK01265 heat shock protein Ht 75.1 3.4 8.6E-05 19.4 3.2 47 377-425 102-154 (326) 70 PRK02870 heat shock protein Ht 74.6 5.1 0.00013 18.1 5.2 47 377-425 136-188 (338) 71 cd04275 ZnMc_pappalysin_like Z 74.3 0.54 1.4E-05 25.1 -1.0 16 407-422 134-149 (225) 72 PRK03072 heat shock protein Ht 74.2 3 7.7E-05 19.7 2.8 47 377-425 90-142 (289) 73 TIGR01241 FtsH_fam ATP-depende 73.8 1.6 4E-05 21.7 1.3 74 335-433 257-339 (505) 74 pfam01434 Peptidase_M41 Peptid 73.7 3.2 8.1E-05 19.6 2.8 22 411-432 9-30 (192) 75 pfam12388 Peptidase_M57 Dual-a 73.4 1.5 3.9E-05 21.8 1.1 50 374-424 89-147 (211) 76 KOG3658 consensus 73.0 1.2 3E-05 22.7 0.4 59 363-423 335-405 (764) 77 pfam02163 Peptidase_M50 Peptid 71.8 2.4 6.2E-05 20.4 1.8 18 411-428 8-25 (205) 78 cd06163 S2P-M50_PDZ_RseP-like 71.5 2.6 6.7E-05 20.2 1.9 19 410-428 9-27 (182) 79 cd05709 S2P-M50 Site-2 proteas 69.6 2.9 7.4E-05 19.8 1.8 19 411-429 9-27 (180) 80 KOG3714 consensus 69.6 2 5.1E-05 21.0 1.0 20 407-426 156-175 (411) 81 PRK10733 hflB ATP-dependent me 69.5 4.4 0.00011 18.6 2.7 25 409-433 407-431 (644) 82 pfam04228 Zn_peptidase Putativ 68.9 6.8 0.00017 17.2 7.1 84 344-427 85-187 (292) 83 pfam05572 Peptidase_M43 Pregna 68.6 0.88 2.2E-05 23.5 -1.0 15 408-422 67-81 (152) 84 pfam01457 Peptidase_M8 Leishma 68.5 2.6 6.7E-05 20.1 1.4 50 375-424 194-254 (569) 85 pfam02031 Peptidase_M7 Strepto 67.2 2.3 5.9E-05 20.5 0.9 48 375-422 40-89 (132) 86 KOG2921 consensus 66.4 3.3 8.4E-05 19.5 1.6 37 411-447 132-172 (484) 87 CHL00176 ftsH cell division pr 65.5 5.8 0.00015 17.7 2.7 22 412-433 434-455 (631) 88 TIGR02151 IPP_isom_2 isopenten 65.3 3.3 8.3E-05 19.5 1.4 92 525-624 214-326 (349) 89 PRK10779 zinc metallopeptidase 64.9 4.1 0.00011 18.8 1.8 10 342-351 254-263 (449) 90 TIGR00054 TIGR00054 membrane-a 64.4 3.6 9.3E-05 19.2 1.5 46 407-452 335-383 (463) 91 pfam07607 DUF1570 Protein of u 63.8 7.3 0.00019 17.0 3.0 41 410-453 1-41 (128) 92 pfam05548 Peptidase_M11 Gameto 62.5 5.7 0.00014 17.8 2.2 47 374-422 102-148 (303) 93 PRK01345 heat shock protein Ht 61.8 7 0.00018 17.1 2.5 47 377-425 87-139 (314) 94 cd06160 S2P-M50_like_2 Unchara 61.6 5.1 0.00013 18.1 1.8 16 411-426 42-57 (183) 95 KOG1565 consensus 61.3 2.4 6.2E-05 20.4 0.1 21 407-427 208-228 (469) 96 cd04271 ZnMc_ADAM_fungal Zinc- 60.9 1.4 3.6E-05 22.1 -1.1 16 319-334 45-60 (228) 97 pfam10263 SprT-like SprT-like 60.7 8.2 0.00021 16.6 2.7 50 375-427 21-75 (153) 98 cd06159 S2P-M50_PDZ_Arch Uncha 59.0 6 0.00015 17.6 1.8 35 392-426 100-134 (263) 99 KOG3931 consensus 59.0 6.5 0.00016 17.4 2.0 20 407-426 102-121 (484) 100 cd06158 S2P-M50_like_1 Unchara 57.3 6.8 0.00017 17.2 1.8 19 411-429 10-28 (181) 101 pfam06167 MtfA Phosphoenolpyru 57.1 11 0.00028 15.7 4.3 62 400-461 132-202 (248) 102 cd04270 ZnMc_TACE_like Zinc-de 56.8 1.9 4.8E-05 21.2 -1.1 13 410-422 167-179 (244) 103 PHA02456 zinc metallopeptidase 56.2 6.6 0.00017 17.3 1.6 26 408-433 77-102 (141) 104 KOG3624 consensus 55.9 11 0.00029 15.6 5.9 83 411-496 519-602 (687) 105 pfam08325 WLM WLM domain. This 55.4 5.7 0.00015 17.8 1.2 18 407-424 79-97 (183) 106 COG2316 Predicted hydrolase (H 54.3 12 0.00031 15.4 4.6 67 555-623 140-210 (212) 107 pfam05547 Peptidase_M6 Immune 51.8 0.84 2.1E-05 23.7 -3.5 13 410-422 222-234 (646) 108 COG3824 Predicted Zn-dependent 51.5 3.7 9.4E-05 19.1 -0.3 51 376-429 71-126 (136) 109 COG4783 Putative Zn-dependent 50.9 9.7 0.00025 16.1 1.8 21 116-136 67-87 (484) 110 pfam12315 DUF3633 Protein of u 50.0 8.3 0.00021 16.6 1.3 19 408-426 91-109 (212) 111 smart00731 SprT SprT homologue 49.6 13 0.00033 15.2 2.3 45 379-427 27-76 (146) 112 pfam11176 DUF2962 Protein of u 49.4 9.3 0.00024 16.3 1.5 76 338-414 62-148 (155) 113 pfam09471 Peptidase_M64 IgA Pe 49.1 1.7 4.2E-05 21.6 -2.4 49 374-430 180-228 (257) 114 COG5549 Predicted Zn-dependent 47.6 8.1 0.00021 16.7 1.0 15 411-425 188-202 (236) 115 pfam12044 Metallopep Putative 46.2 11 0.00029 15.6 1.6 33 402-438 300-339 (419) 116 pfam06262 DUF1025 Domain of un 44.3 11 0.00029 15.6 1.3 10 411-420 73-82 (96) 117 pfam10462 Peptidase_M66 Peptid 42.0 4.3 0.00011 18.6 -1.2 44 411-464 193-239 (304) 118 KOG3854 consensus 41.2 19 0.00048 14.1 2.6 46 397-443 335-383 (505) 119 TIGR01108 oadA oxaloacetate de 39.1 8.1 0.00021 16.7 -0.1 24 258-281 205-228 (616) 120 pfam07998 Peptidase_M54 Peptid 38.5 19 0.00049 14.0 1.7 14 409-422 125-138 (176) 121 pfam01447 Peptidase_M4 Thermol 37.8 7.2 0.00018 17.1 -0.6 14 409-422 134-147 (150) 122 PHA00363 major capsid protein 37.2 22 0.00055 13.6 3.2 18 360-377 325-342 (557) 123 PRK13267 archaemetzincin-like 36.8 18 0.00046 14.2 1.3 14 409-422 122-135 (177) 124 pfam08112 ATP-synt_E_2 ATP syn 36.6 19 0.00048 14.1 1.4 24 576-599 8-31 (56) 125 KOG4525 consensus 36.2 20 0.0005 13.9 1.5 21 402-422 288-313 (614) 126 pfam10463 Peptidase_U49 Peptid 35.4 18 0.00045 14.3 1.1 15 411-425 99-113 (203) 127 TIGR02158 PA_CoA_Oxy3 phenylac 33.7 24 0.00062 13.3 3.5 49 471-522 83-131 (251) 128 pfam09197 Rap1-DNA-bind Rap1, 33.5 25 0.00062 13.2 1.9 33 565-597 52-84 (101) 129 COG3227 LasB Zinc metalloprote 33.2 19 0.00048 14.0 1.0 29 397-428 327-355 (507) 130 COG4784 Putative Zn-dependent 33.1 17 0.00044 14.3 0.8 24 115-138 60-83 (479) 131 KOG1885 consensus 32.4 20 0.00051 13.9 1.0 25 368-392 344-368 (560) 132 TIGR01428 HAD_type_II haloacid 30.5 27 0.0007 12.9 2.1 115 476-603 5-140 (207) 133 TIGR02269 TIGR02269 Myxococcus 30.2 28 0.0007 12.9 1.4 51 549-601 159-213 (228) 134 KOG2670 consensus 30.0 28 0.00071 12.8 1.5 115 285-418 167-285 (433) 135 KOG0734 consensus 30.0 6.6 0.00017 17.3 -1.8 24 410-433 557-580 (752) 136 pfam11350 DUF3152 Protein of u 29.9 16 0.0004 14.6 0.1 21 404-424 130-152 (189) 137 TIGR01122 ilvE_I branched-chai 28.7 11 0.00028 15.7 -0.8 22 402-423 209-232 (302) 138 pfam12057 DUF3538 Domain of un 27.8 30 0.00077 12.6 2.7 18 407-424 46-63 (118) 139 TIGR01325 O_suc_HS_sulf O-succ 27.3 18 0.00047 14.1 0.1 35 69-103 37-75 (386) 140 COG2321 Predicted metalloprote 25.9 33 0.00083 12.4 6.5 108 345-452 83-210 (295) 141 pfam01431 Peptidase_M13 Peptid 24.5 35 0.00088 12.2 7.0 15 411-425 37-51 (203) 142 pfam04838 Baculo_LEF5 Baculovi 23.8 10 0.00026 16.0 -1.7 31 399-429 23-53 (159) 143 cd07219 Pat_PNPLA1 Patatin-lik 23.2 18 0.00046 14.2 -0.5 65 343-419 139-211 (382) 144 COG0465 HflB ATP-dependent Zn 22.7 37 0.00095 11.9 2.3 31 411-441 407-438 (596) 145 pfam02102 Peptidase_M35 Deuter 22.5 12 0.00032 15.3 -1.5 17 406-422 293-309 (352) 146 pfam07203 DUF1412 Protein of u 21.1 40 0.001 11.7 1.3 16 534-550 53-68 (74) 147 PRK10735 tldD protease TldD; P 20.4 41 0.0011 11.6 2.1 32 393-429 244-276 (481) 148 KOG3538 consensus 20.2 16 0.00041 14.6 -1.3 21 407-428 314-334 (845) No 1 >TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family; InterPro: IPR011977 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This family consists of probable oligoendopeptidases belonging to MEROPS peptidase family M3, subfamily M3B (clan MA(E)). It is related to lactococcal PepF and group B streptococcal PepB (IPR004438 from INTERPRO) but in a distinct clade with considerable sequence differences not only to IPR004438 from INTERPRO but also to IPR011976 from INTERPRO. Members are not characterised with respect to their substrates and the activity profile may differ.. Probab=100.00 E-value=0 Score=949.04 Aligned_cols=583 Identities=36% Similarity=0.617 Sum_probs=560.2 Q ss_pred CCCHHHHCCCCC-CHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 679357278889-2789999999999999999997423310233499--8999999999999999999999999864205 Q gi|254780915|r 38 RWNLEDLYPSHD-SQEISNDMECIEHESLAFKTRWEGNLAHATNQKN--CHSLGAAIAEYERICELIGRIASYAMLSYNC 114 (626) Q Consensus 38 ~WdL~~ly~~~~-~~e~~~dl~~~~~~i~~~~~~~~~~l~~~~~~~~--~~~l~~~l~~~e~i~~~l~rv~~ya~l~~s~ 114 (626) +|||++||+|.+ +++|+++++.++..+.+|...++.++.... ..+ ...|.++|..+|++++.+.++++|++|.+|+ T Consensus 1 ~WdL~~lY~~~~~s~e~~~~~~~~~~~~~~l~~~~~~~~~~~d-~~~~~~~~~~~~l~~~e~~~~~~~~~~~ya~l~~s~ 79 (600) T TIGR02290 1 EWDLSDLYPGGSESPEFEEDLEKIKALADELESLLKEKLTPLD-AKGRLKEKLQNALKRYEALSELLSKVGAYASLLYSA 79 (600) T ss_pred CCCCCHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9881002689988978899999999999999999985124222-001128899999999999999998788999999886 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 79918999999999999999999887416988719989999998469001111799999997423317167998766300 Q gi|254780915|r 115 NLSSPTIRKFYTDINAKLADFEKVLIFFALEINTLDEALLEQSYAQDPLTLKYSAWIKNIRKIKKHLLSNDMECLLSDTS 194 (626) Q Consensus 115 dt~d~~~~~~~~~~~~~~s~~~~~l~~~~~e~~~l~e~~l~~~~~~~~~l~~y~~~~~~l~~~~~~~l~~~~E~~l~~~~ 194 (626) ||+|+++.++.++++.+.+.+++...+|+.++.+++++....++..++.++.|++++.+.++.++|.|++.+|++++.++ T Consensus 80 d~~~~~a~~~~~~~~~~~~~~~~~~~~f~~~l~~~~d~~~~~~l~~~~~l~~~~~~L~~~R~~~~~~Lse~EE~l~~~l~ 159 (600) T TIGR02290 80 DTEDPEAKKLLGELEALSSEIQQALLFFTLELARLSDEVWAELLSKDPDLAPYRFYLKELRKEAPHTLSEEEEKLIAALS 159 (600) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHC T ss_conf 06868899999999999998886301799998248688999997368431135752999998575439998999998746 Q ss_pred HHCCCHHHHHHHHHHHHHHHHHH-----CCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 10003677777764444332210-----0347523321354188876799999999998650542169999998666788 Q gi|254780915|r 195 QVGREALKRFFCENIESLRFKIN-----DQKIPLTKAYKSFFDSDREVRKSAAKALSHTFNKSSHIFSFITNTLAKDEEI 269 (626) Q Consensus 195 ~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~l~~~dr~~Rk~a~~a~~~~~~~~~~~~a~il~~iv~~r~e 269 (626) .+|..+|+++++..++.+++... .+..++.+...++++||+++|++||+++.+.|.++..+++.|||.+.+.|.+ T Consensus 160 ~~G~~AW~~Lyd~~~~~l~~~~~~~~ge~~~l~~~~~~nl~~~pd~~vR~~A~~a~~~~~~~~~~~~a~~LN~l~g~~~~ 239 (600) T TIGR02290 160 LTGRAAWSRLYDELTGTLRIPVEEKDGEEEELSLEQALNLLRDPDAEVRKKAFKALLKAWEKEAPTLAAILNALAGDRLT 239 (600) T ss_pred CHHHHHHHHHHHHHHEEEEEEEEECCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 30588899999741112368885146620002699999862088888999999999999875002699998655578888 Q ss_pred HHHHCC---CCCHHHHHHHHCCCCHHHHHHHHHHHHHCCHH-HHHHHHHHHHHHCCCCCCCHHHHHHCCCCCC--CCCCC Q ss_conf 654278---99988976222258989999999987630628-9999999999870887531234440256554--77799 Q gi|254780915|r 270 QDRWRK---YEKIADSRHLSNNVEPYVIEALMQSVKNYYPK-TSHRYYELKKKWLKLDTMYFWDRLAPLPGTS--QDIIP 343 (626) Q Consensus 270 ~A~llG---y~s~~~~~l~~~~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~lg~~~l~~wD~~~~~~~~~--~~~~~ 343 (626) ..++|| |.++++.++..|.|+.++|++|+.+|..+. | .++||+..|+++||+++|.+||++||++... .+.+| T Consensus 240 ~~~~R~~~~~~~~le~~l~~nri~~~tl~am~~a~~en~-pR~~~rY~~~KAklLG~~~L~~~D~~AP~~~~~~e~~~~~ 318 (600) T TIGR02290 240 EYRLRGQVYYDHPLEPRLLYNRIDQETLDAMLEAIKENY-PRLFRRYLKLKAKLLGKEKLDFYDLLAPLGDSDSEAKTYT 318 (600) T ss_pred HHHHHCCCCCCCCCCCCHHHHCCCHHHHHHHHHHHHHHC-CCHHHHHHHHHHHHHCHHHCCCCCCCCCCCCCCCCCCCCC T ss_conf 887732445676421101330388899999999999852-8058999999888718201763165467888777766075 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCCCCCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHH Q ss_conf 89999999986666328999999986424850201037998743547666777510103653324679999998458999 Q gi|254780915|r 344 FEVARDLVLQSYAKFSPQMSIIAEKFFTHNWIDAPQYDGKGSGAFAHGTIPSVHPYILLNYLGKPQDVMTLAHELGHGIH 423 (626) Q Consensus 344 ~ee~~~~v~~~~~~~~~e~~~~~~~~~~~~~iD~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~~~~dv~TL~HE~GHa~H 423 (626) |+||+++|.++|+.|+|+++++++++|+++|||..|||||||||||+++++.++|||+|||+|+++||.||+||||||+| T Consensus 319 f~EA~~lvl~~F~~fsP~~A~~A~~af~~~WIDa~pRpGK~gGAFC~~~~~~~~P~vl~Ny~G~~~dV~TLAHELGHA~H 398 (600) T TIGR02290 319 FDEAKELVLEAFGKFSPEMADFAEKAFEEGWIDAEPRPGKRGGAFCTGFPPSKEPRVLMNYDGSFSDVSTLAHELGHAYH 398 (600) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHH T ss_conf 89899999999876357899999999757962034783388985336888876661462458884158999898655899 Q ss_pred HHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 99612568776578005999886888999999998413899899999999986206999999999999999997212016 Q gi|254780915|r 424 FVLSSETQGILTNNSSLTLAETASIFGETLTFDSLLQAASSKEERKILLANKIEDMLNSIVRQISFYDFELKLHTERRST 503 (626) Q Consensus 424 ~~ls~~~~~~~~~~~~~~~aE~~S~~~E~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~~~~q~~~~~fe~~~h~~~~~~ 503 (626) ++++++ +|+.++++||++|||||+|+|+++.|.++++..++++|..||+.+++++.++++||+++..||.++|+. ++. T Consensus 399 ~~~~~d-~p~l~~~yPmtlAETASiF~E~l~~d~ll~~ak~~eeK~~lL~~~~~d~~at~~~i~~rf~FE~~~~~~-R~~ 476 (600) T TIGR02290 399 SELAKD-QPLLNARYPMTLAETASIFAEMLLFDALLKEAKTEEEKLSLLAEKLEDAIATLVRIHARFLFERRFHEA-RKE 476 (600) T ss_pred HHHHHC-CCHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHH T ss_conf 998741-372540588733678999768999999988308088989999998710899998999999999999998-671 Q ss_pred CCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCC--CCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHH Q ss_conf 899989999999999998607763458878885001525410667--525899999999999999999567332899999 Q gi|254780915|r 504 GDIPTHRINEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFIESS--FYVYAYAFGNCLVNSLYDIYKSNTVDCFKEKYL 581 (626) Q Consensus 504 ~~~t~~~l~~~~~~~~~~~~g~~~~~~~~~~~~~w~~~~H~y~~~--fY~y~Y~~~~~~a~~l~~~~~~~~~~~~~~~y~ 581 (626) +.||++++++||.+.+++.||++++... .++++|+++||||++| ||||+|+||+|++.+||++++++| +.|.++|. T Consensus 477 ~~Ls~~~~~~Lm~eaq~~~YG~~l~~~~-~~p~~Wa~k~HF~~~~~~FYnYpY~FG~LfvlglY~~y~~~G-~~F~~~Y~ 554 (600) T TIGR02290 477 GELSAEELCELMLEAQKESYGDALDESE-LHPYMWAYKPHFYHADVPFYNYPYTFGYLFVLGLYAKYKEEG-ESFVDKYI 554 (600) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHCCC-CCCEECCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH T ss_conf 2889899868799999998546753289-885000013613417888521345899999999999999861-53799999 Q ss_pred HHHHCCCCCCHHHHHHH-CCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 99832699997999997-0889899889999999999999998609 Q gi|254780915|r 582 NILRAGNSKHYSELLLP-LNINLSDPNFWERGLQTVEKMIDDVEKM 626 (626) Q Consensus 582 ~~L~~Ggs~~~~el~k~-~G~d~~~~~~~~~~~~~~~~~i~~~e~l 626 (626) +||+.|||+.|.||++. +|+|+++|+||+.||+.|.+.||+||+| T Consensus 555 ~lL~~tG~~~~~el~~~H~G~Dl~~~dFW~~G~~~~~~~i~~f~~L 600 (600) T TIGR02290 555 ALLRDTGSMTAEELVKKHLGFDLTSPDFWQKGIDVIEEKIEEFEEL 600 (600) T ss_pred HHHHHCCCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCC T ss_conf 9987203678899999653899998788999999999999985039 No 2 >TIGR00181 pepF oligoendopeptidase F; InterPro: IPR004438 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), the type example being oligoendopeptidase F from Lactococcus lactis. The enzyme hydrolyses peptides of 7 and 17 amino acids with fairly broad specificity. Differences in substrate specificity should be expected in other species. The gene is duplicated in L. lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis. Probab=100.00 E-value=0 Score=922.80 Aligned_cols=587 Identities=28% Similarity=0.435 Sum_probs=556.7 Q ss_pred HHHCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 44342477679357278889278999999999999999999742331023349989999999999999999999999998 Q gi|254780915|r 30 QENLGNLPRWNLEDLYPSHDSQEISNDMECIEHESLAFKTRWEGNLAHATNQKNCHSLGAAIAEYERICELIGRIASYAM 109 (626) Q Consensus 30 r~~~~~~~~WdL~~ly~~~~~~e~~~dl~~~~~~i~~~~~~~~~~l~~~~~~~~~~~l~~~l~~~e~i~~~l~rv~~ya~ 109 (626) |++||+.++|||++||++.+.|+ .+++.++..+.+|.+..+|.+. .++++|.+++...|++...++|+++|++ T Consensus 1 R~~~~~~~~WDL~~L~~n~~~w~--~~~~~~~~~~k~~~~~~~g~~~-----~s~~~f~~~l~~~e~~~~~~~r~~nY~~ 73 (611) T TIGR00181 1 RSEVPKEYKWDLDDLYKNKEEWE--KALEALEEDIKEIKKFKKGTLL-----TSKETFLEALKLEEKILELLNRLYNYAS 73 (611) T ss_pred CCCCCCCCCCCHHHHCCCHHHHH--HHHHHHHHHHHHCCCCCCCCCC-----CCHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 97777544357354308978999--9999876666404432466421-----6979999999987768988645432011 Q ss_pred HHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 64205799189999999999999999998874169887199899999984690011117999999974233171679987 Q gi|254780915|r 110 LSYNCNLSSPTIRKFYTDINAKLADFEKVLIFFALEINTLDEALLEQSYAQDPLTLKYSAWIKNIRKIKKHLLSNDMECL 189 (626) Q Consensus 110 l~~s~dt~d~~~~~~~~~~~~~~s~~~~~l~~~~~e~~~l~e~~l~~~~~~~~~l~~y~~~~~~l~~~~~~~l~~~~E~~ 189 (626) +..++|+.|+.++++.++++...++++.+++|+.|+++.++++++..|+.+. .|+.|+..+.++.+.+||+||+++|++ T Consensus 74 ~~~~~d~~d~~~~~~~~k~~~~~~~~~~a~s~~~~e~l~~~e~~I~~wL~~~-~L~~y~~~l~~~~~~k~H~LS~~~E~l 152 (611) T TIGR00181 74 MKLSTDVTDPEVNALSQKLSNLYTKVESATSFLSPEILEIEEEKIKEWLKDS-ELADYKIALEEILREKPHVLSEEVEKL 152 (611) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHCCCC-CCHHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 4433465478899999999999999999999730798728678999860330-207899999999975177377589999 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHHCC----CCCHH---HHHHHHH--CCCHHHHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 663001000367777776444433221003----47523---3213541--88876799999999998650542169999 Q gi|254780915|r 190 LSDTSQVGREALKRFFCENIESLRFKINDQ----KIPLT---KAYKSFF--DSDREVRKSAAKALSHTFNKSSHIFSFIT 260 (626) Q Consensus 190 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~---~~~~~l~--~~dr~~Rk~a~~a~~~~~~~~~~~~a~il 260 (626) |+..++..+++..++..+..++++++.... ..... ....+.. ++||++||+||+++.+.+.++++|+|.+| T Consensus 153 L~~~~~~~~s~~~iy~~L~~~D~~f~~~~~~~G~~~~~~~~~~~~~l~~dP~~d~~~Rk~Ay~~~~~~~~~~knT~A~~y 232 (611) T TIGR00181 153 LSALSEVFGSPSDIYSVLTNADMDFGSIKDKKGKKKPLTNSTYENFLQKDPNKDREIRKKAYESFYKAYKKHKNTLAALY 232 (611) T ss_pred HHHHHHHHCCHHHHHHHHHHHHCCHHHHHCCCCCEEEEEHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98764431004679999877423601200378953652035778887416898878899999999999986377899999 Q ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCC----CCCCCHHHHHHCCCC Q ss_conf 9986667886542789998897622225898999999998763062899999999998708----875312344402565 Q gi|254780915|r 261 NTLAKDEEIQDRWRKYEKIADSRHLSNNVEPYVIEALMQSVKNYYPKTSHRYYELKKKWLK----LDTMYFWDRLAPLPG 336 (626) Q Consensus 261 ~~iv~~r~e~A~llGy~s~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~lg----~~~l~~wD~~~~~~~ 336 (626) ...++.....|++|||+|+.++.|..+.+|..+++.+++.+.....|.++||.+++++.+| ++++++||++.|+++ T Consensus 233 ~~~v~~~~~~A~~r~y~S~~~~~L~~d~vp~~~~~~l~d~~~~~l~~~~~~Y~~~~kk~l~~~~~l~~~~~~D~~~~Lv~ 312 (611) T TIGR00181 233 YGNVKKNVALAKLRNYESAIDASLFSDEVPKEVYDNLYDTIKKNLAPVLQRYYKLRKKVLKKKLKLDKMEPYDLYLPLVK 312 (611) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCC T ss_conf 99998888887304871078885314787389999999999997337899999999998422202200144425334544 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCCHHCCCCCCCCCCCCCCCCC--CCCEEECCCCCC-HHHHH Q ss_conf 5477799899999999866663289999999864248-5020103799874354766677--751010365332-46799 Q gi|254780915|r 337 TSQDIIPFEVARDLVLQSYAKFSPQMSIIAEKFFTHN-WIDAPQYDGKGSGAFAHGTIPS--VHPYILLNYLGK-PQDVM 412 (626) Q Consensus 337 ~~~~~~~~ee~~~~v~~~~~~~~~e~~~~~~~~~~~~-~iD~~~r~gKr~GA~~~~~~~~--~~~~I~~N~~~~-~~dv~ 412 (626) ...+.+|+|+|+++|.+++.+||.||...+++.+.++ |||+++..|||+||||.|. .+ .+|||+|||+++ ++||. T Consensus 313 ~~~~~~~iE~Ak~~i~ksL~~LG~EY~~~~~~a~~~~RWvD~~en~gKr~GAYS~g~-y~Gl~~pYILMNw~~~~~~~~~ 391 (611) T TIGR00181 313 EKNPKVSIEEAKELILKSLEPLGEEYVKILKRAFSEERWVDYYENKGKRSGAYSIGG-YKGLVKPYILMNWDGTVLNSVF 391 (611) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCCEECCC-CCCCCCCEEEECCCCCCHHHHH T ss_conf 336501789999999998765038889999984379971753566778555112378-8986654367144675022257 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC-CHHHHHHHHHHHHH-HHHHHHHHHHHHH Q ss_conf 999984589999961256877657800599988688899999999841389-98999999999862-0699999999999 Q gi|254780915|r 413 TLAHELGHGIHFVLSSETQGILTNNSSLTLAETASIFGETLTFDSLLQAAS-SKEERKILLANKIE-DMLNSIVRQISFY 490 (626) Q Consensus 413 TL~HE~GHa~H~~ls~~~~~~~~~~~~~~~aE~~S~~~E~~~~~~~~~~~~-~~~~~~~ll~~~~~-~~~~~~~~q~~~~ 490 (626) ||+||+||+||++||+++||+...+|+|++|||||+|||.|+.+|++++.. |++++..+|.++|. ++.++++||++|| T Consensus 392 TLaHE~GHS~Hs~~s~~~Qp~~~~~Y~iF~AE~ASt~NE~LL~~YLL~~~~nD~~~k~~iL~~~l~~~f~~t~~rQ~~fa 471 (611) T TIGR00181 392 TLAHELGHSMHSYFSSKHQPYPNSDYSIFYAEVASTFNELLLADYLLKNSKNDPEMKIYILLERLSDNFVGTFFRQTLFA 471 (611) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEHHHHHHHHH T ss_conf 99998556799986541178544788547887304789999998778642799899999999987379615468898599 Q ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH----CCCCCCCCCCC-CCCEEECCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 99999972120168999899999999999986----07763458878-88500152541066752589999999999999 Q gi|254780915|r 491 DFELKLHTERRSTGDIPTHRINEIWLETQKES----LGPAFDLSDLE-YGSFWMMVPHFIESSFYVYAYAFGNCLVNSLY 565 (626) Q Consensus 491 ~fe~~~h~~~~~~~~~t~~~l~~~~~~~~~~~----~g~~~~~~~~~-~~~~w~~~~H~y~~~fY~y~Y~~~~~~a~~l~ 565 (626) .||+.+|+.+++|+++|++.|+++|.++.++| +|+.+.+++.. .+..|+|+||||.+|||+|+||+|.++|..++ T Consensus 472 ~fE~~~~~li~~g~~lt~~~L~~~y~~l~~~Y~g~k~gd~~~~~~~~~~~l~~~riPHFY~g~FYVYkYa~G~~a~~~~~ 551 (611) T TIGR00181 472 EFEYEAYELIEEGEPLTAETLTEIYANLLKKYTGFKFGDSVKIDEGKSSGLTWMRIPHFYMGNFYVYKYATGLVAATALY 551 (611) T ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 99999999986389978758999999999734871357754224000135113211311158840678799999999999 Q ss_pred HHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999567332899999998326999979999970889899889999999999999998609 Q gi|254780915|r 566 DIYKSNTVDCFKEKYLNILRAGNSKHYSELLLPLNINLSDPNFWERGLQTVEKMIDDVEKM 626 (626) Q Consensus 566 ~~~~~~~~~~~~~~y~~~L~~Ggs~~~~el~k~~G~d~~~~~~~~~~~~~~~~~i~~~e~l 626 (626) .+++++|. ...++|++||++|||++|+|++|.+|||||+|++|+++++.|+.+||++|++ T Consensus 552 ~~ik~~~~-~a~e~yl~fLKsGGs~~PlE~lk~aGvDl~~~~~~~~~~~~~~~~~d~~e~~ 611 (611) T TIGR00181 552 EKIKEEGK-PAVEKYLKFLKSGGSKYPLETLKLAGVDLTKPQPIQAAIKIFEDLIDELEEL 611 (611) T ss_pred HHHHCCCC-CHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHC T ss_conf 98615785-1378999885158980048999866876787378999999999899999509 No 3 >COG1164 Oligoendopeptidase F [Amino acid transport and metabolism] Probab=100.00 E-value=0 Score=866.80 Aligned_cols=590 Identities=34% Similarity=0.550 Sum_probs=558.7 Q ss_pred CCCHHHCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 87844342477679357278889278999999999999999999742331023349989999999999999999999999 Q gi|254780915|r 27 MDSQENLGNLPRWNLEDLYPSHDSQEISNDMECIEHESLAFKTRWEGNLAHATNQKNCHSLGAAIAEYERICELIGRIAS 106 (626) Q Consensus 27 ~~~r~~~~~~~~WdL~~ly~~~~~~e~~~dl~~~~~~i~~~~~~~~~~l~~~~~~~~~~~l~~~l~~~e~i~~~l~rv~~ 106 (626) +|+|+++|+.++|||++||++.+.|+- .+.+..+++......++ ...++..|.++|...|++.+.++++++ T Consensus 1 ~~~r~~~~~~~~WDl~~l~~~~~~~~~--~~~~~~~~~~~~~~~~~-------~~~s~~~l~~~l~~~e~~~~~l~~~~~ 71 (598) T COG1164 1 LKNRSEVPEKYTWDLSDLYPGDELWEA--EFLEESKELLKALEFYE-------LILSAEDLLEALEEYEKLEELLGRLYA 71 (598) T ss_pred CCCCCCCCCCCCCCHHHHCCCHHHHHH--HHHHHHHHHHHHHHHHH-------HCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 997443563345664752488055689--89999999988665576-------507999999999999999999988999 Q ss_pred HHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHH Q ss_conf 99864205799189999999999999999998874169887199899999984690011117999999974233171679 Q gi|254780915|r 107 YAMLSYNCNLSSPTIRKFYTDINAKLADFEKVLIFFALEINTLDEALLEQSYAQDPLTLKYSAWIKNIRKIKKHLLSNDM 186 (626) Q Consensus 107 ya~l~~s~dt~d~~~~~~~~~~~~~~s~~~~~l~~~~~e~~~l~e~~l~~~~~~~~~l~~y~~~~~~l~~~~~~~l~~~~ 186 (626) |+++..+.|++|+.++++.++++...++++..++|+.+++.+++++.+..++...+.|+.|++++.++++.++|++++.+ T Consensus 72 ya~~~~s~d~~d~~~~~~~~~~~~~~~~~~~~~~~~~~el~~~~e~~~~~~l~~~~~l~~y~~~l~~~~~~k~h~ls~~~ 151 (598) T COG1164 72 YASMKLSTDTTDEEAQKLYQKLEEFYADFSSALSFFEPELLELDEEVIQSLLTSGPELADYRFYLEELFRRKPHQLSAEE 151 (598) T ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCHHH T ss_conf 99988620346788999999999999999999999899998612999999983474210234999999872865489899 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHC----CCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 98766300100036777777644443322100----34752332135418887679999999999865054216999999 Q gi|254780915|r 187 ECLLSDTSQVGREALKRFFCENIESLRFKIND----QKIPLTKAYKSFFDSDREVRKSAAKALSHTFNKSSHIFSFITNT 262 (626) Q Consensus 187 E~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~l~~~dr~~Rk~a~~a~~~~~~~~~~~~a~il~~ 262 (626) |+++...+..+.+++.++.....++++++..+ +..++.+...++.++||++||+||+++.+.+.++.++++.+++. T Consensus 152 Ekll~~~~~~~~~~~~~~~~~~~~dl~~~~~~~~~g~~~~~~~~~~ll~~~Dr~~Rk~A~~a~~~~~~~~~~t~a~~l~~ 231 (598) T COG1164 152 EKLLAQLSPVGNSASNIFFDLTNSDLTFPDIGDAKGKKLTLEQLLNLLEDPDREVRKAAYEALYKAYEKHRNTLAALLNT 231 (598) T ss_pred HHHHHHCCHHHCCHHHHHHHHHCCCCCCCCCCCCCCCEECHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99998613021012556554220201156444565655129889887629888899999999999999847899999999 Q ss_pred HHHHHHHHHHHCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCHHHHHHC-CCCCCCCC Q ss_conf 86667886542789998897622225898999999998763062899999999998708875312344402-56554777 Q gi|254780915|r 263 LAKDEEIQDRWRKYEKIADSRHLSNNVEPYVIEALMQSVKNYYPKTSHRYYELKKKWLKLDTMYFWDRLAP-LPGTSQDI 341 (626) Q Consensus 263 iv~~r~e~A~llGy~s~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~lg~~~l~~wD~~~~-~~~~~~~~ 341 (626) +++.+...|+.+||++..+..+.++.++..+++++..++.+..+|+++|+.+.+++.||++++++||+++| ..+...+. T Consensus 232 ~~~~~~~~a~~r~y~~~~~~~l~~~~i~~~~~d~l~~~v~~~~~~l~~ry~~~r~k~Lgl~~l~~yD~~~p~~~~~~~~~ 311 (598) T COG1164 232 LVKVLAFLARARNYDDVIDSALLRNEVDREVVDNLIESVKEAFLPLLHRYYKLRAKVLGLEKLRPYDLYAPLLDKDPSPE 311 (598) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHCCCCCCCHHHCCCCCCCCCCCCC T ss_conf 99999888877368518899887557999999999999998640589999999998719985886550798445777756 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCCCCCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHH Q ss_conf 99899999999866663289999999864248502010379987435476667775101036533246799999984589 Q gi|254780915|r 342 IPFEVARDLVLQSYAKFSPQMSIIAEKFFTHNWIDAPQYDGKGSGAFAHGTIPSVHPYILLNYLGKPQDVMTLAHELGHG 421 (626) Q Consensus 342 ~~~ee~~~~v~~~~~~~~~e~~~~~~~~~~~~~iD~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~~~~dv~TL~HE~GHa 421 (626) ++++++++.|.+++.+||++|.+++++++.++|||++||+||||||||.+++.+.+|||+|||+|+.+||.||+||+||+ T Consensus 312 ~s~~ea~~~v~~~l~~lg~ey~~~~~~a~~~~WiD~~~~~gKrsGaYs~~~~~~~~p~IlmN~~gt~~dV~TLaHElGHs 391 (598) T COG1164 312 YSYEEAKELVLKALAPLGPEYAKIARRAFDERWIDVYPRKGKRSGAYSIGFYKGDHPFILMNYDGTLRDVFTLAHELGHS 391 (598) T ss_pred CCHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHH T ss_conf 66999999999999864889999999998738862356879998852689988888558763789654466874650189 Q ss_pred HHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99996125687765780059998868889999999984138998999999999862069999999999999999972120 Q gi|254780915|r 422 IHFVLSSETQGILTNNSSLTLAETASIFGETLTFDSLLQAASSKEERKILLANKIEDMLNSIVRQISFYDFELKLHTERR 501 (626) Q Consensus 422 ~H~~ls~~~~~~~~~~~~~~~aE~~S~~~E~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~~~~q~~~~~fe~~~h~~~~ 501 (626) +|+++++++||+.+.++|+++||+||+|||+++++|+++++.+++++..++..++.++..+++||++|+.||+++|+.++ T Consensus 392 ~Hs~~s~~~qp~~~~~~~i~~AEiAS~fnE~l~~~~ll~~~~~~~~~~~il~~~l~~~~~t~~rq~~f~~FE~~~h~~~~ 471 (598) T COG1164 392 VHSYFSRKHQPYLYADYSIFLAEIASTFNEMLLFDYLLERFKDPEERLAILEEKLEGFFATLFRQTLFAEFEHRVHELIE 471 (598) T ss_pred HHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999605576211567436659989999999999999874899999999999874389999999999999999998776 Q ss_pred CCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHH Q ss_conf 16899989999999999998607763458878885001525410667525899999999999999999567332899999 Q gi|254780915|r 502 STGDIPTHRINEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFIESSFYVYAYAFGNCLVNSLYDIYKSNTVDCFKEKYL 581 (626) Q Consensus 502 ~~~~~t~~~l~~~~~~~~~~~~g~~~~~~~~~~~~~w~~~~H~y~~~fY~y~Y~~~~~~a~~l~~~~~~~~~~~~~~~y~ 581 (626) +++++|+++++++|.+++++|+|+++.+++. .+.+|+++||||.+|||||+|++|+++|.++|..++.++.+++.++|. T Consensus 472 ~~~~lt~~~l~~~~~~l~~~y~gd~~~~d~~-~~~~W~~ipHfy~~pFYvy~Ya~G~~~a~~l~~~~~~~~~~~~~~~y~ 550 (598) T COG1164 472 EGEELTAEELNELYLELQKEYYGDAVKLDEL-SGLEWARIPHFYHSPFYVYQYATGQLAALALYAKILTNDAEAFEKYYI 550 (598) T ss_pred HCCCCCHHHHHHHHHHHHHHHCCCCCCCCCC-CCCHHHHCCCEEECCCEEHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 0599999999999999998745886455864-443014247253248322076999999999999986567889999999 Q ss_pred HHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 998326999979999970889899889999999999999998609 Q gi|254780915|r 582 NILRAGNSKHYSELLLPLNINLSDPNFWERGLQTVEKMIDDVEKM 626 (626) Q Consensus 582 ~~L~~Ggs~~~~el~k~~G~d~~~~~~~~~~~~~~~~~i~~~e~l 626 (626) ++|++|||++|.|+++.+|||+++|+||+++++.+..+||++++| T Consensus 551 ~~Lk~GgS~~p~ell~~~GiD~~~~~~~~~a~~~~~~~ide~~~l 595 (598) T COG1164 551 AFLKSGGSKSPLELLKIAGIDLTTPDPWEEALAEFERLIDELEEL 595 (598) T ss_pred HHHHCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 998777999979999882988787148999999999999999998 No 4 >cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid. Probab=100.00 E-value=0 Score=708.90 Aligned_cols=423 Identities=35% Similarity=0.562 Sum_probs=400.1 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHH--HHHCCCCCHHH-HHHHHHCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 30010003677777764444332--21003475233-2135418887679999999999865054216999999866678 Q gi|254780915|r 192 DTSQVGREALKRFFCENIESLRF--KINDQKIPLTK-AYKSFFDSDREVRKSAAKALSHTFNKSSHIFSFITNTLAKDEE 268 (626) Q Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~-~~~~l~~~dr~~Rk~a~~a~~~~~~~~~~~~a~il~~iv~~r~ 268 (626) .++..+..+|..++....++..+ ...|+..++++ ...++.++||++|++||+++.+++.+++++++.||+.+++.|. T Consensus 2 ~l~~~~~~a~~~~~~~~~~~~~~~~~~~G~~~~l~~~~~~~l~~~dr~~R~~A~~~~~~~~~~~~~~~a~il~~lv~~r~ 81 (427) T cd06459 2 KLSETGNSAWSEYYDLLNSDLKFIFEFDGEELTLSQNLSNLLESPDREVRKKAFEALYKAYEKYENTLAAILNTLVKLRL 81 (427) T ss_pred CCCCCCHHHHHHHHHHHHHCEEEEEEECCEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98875599999999998742578984499787518999988659499999999999999999818999999999999999 Q ss_pred HHHHHCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCHHHHHHCCCCCCCCCCCHHHHH Q ss_conf 86542789998897622225898999999998763062899999999998708875312344402565547779989999 Q gi|254780915|r 269 IQDRWRKYEKIADSRHLSNNVEPYVIEALMQSVKNYYPKTSHRYYELKKKWLKLDTMYFWDRLAPLPGTSQDIIPFEVAR 348 (626) Q Consensus 269 e~A~llGy~s~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~lg~~~l~~wD~~~~~~~~~~~~~~~ee~~ 348 (626) +.|++|||+|++++++..+.+++++++.+...+.. ..|.++++.+++++.+|+++|+|||+++|.+....+.+++++++ T Consensus 82 ~~a~~rgy~~~~~~~l~~~~~~~~v~~~l~~~~~~-~~p~~~ry~~~k~k~lg~~~l~~~D~~~p~~~~~~~~~~~~ea~ 160 (427) T cd06459 82 TLAKLRGYDSYLEAALFNNNIPEDVYDFLIAVVKE-NVPLLHRYLKLKKKLLGLDKLRPYDLYAPLVSGNPPKYTYEEAK 160 (427) T ss_pred HHHHHCCCCCHHHHHHHCCCCHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHCCCCCHHHHHCCCCCCCCCCCCCHHHHH T ss_conf 99998699989999867069869999999999999-57899999999999858850159881487888888868999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCCCCCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf 99998666632899999998642485020103799874354766677751010365332467999999845899999612 Q gi|254780915|r 349 DLVLQSYAKFSPQMSIIAEKFFTHNWIDAPQYDGKGSGAFAHGTIPSVHPYILLNYLGKPQDVMTLAHELGHGIHFVLSS 428 (626) Q Consensus 349 ~~v~~~~~~~~~e~~~~~~~~~~~~~iD~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~~~~dv~TL~HE~GHa~H~~ls~ 428 (626) ++|.+++++||++|+++++++++++|||++||+||||||||+++++.++|||+|||+|+++||.||+||+|||+|+++++ T Consensus 161 ~~v~~~~~~~~~e~~~~~~~~~~~~wiD~~~r~gKr~Gay~~~~~~~~~p~IlmN~~gt~~dv~TLaHElGHa~H~~l~~ 240 (427) T cd06459 161 ELVLEALSPLGPEYAEFAKRAFEERWIDVEPRKGKRSGAYCTGLPPGKHPFILMNFNGTLDDVFTLAHELGHAFHSYLSR 240 (427) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 99999998418999999999976687212578999988220798999888688835898531568898631899999983 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH Q ss_conf 56877657800599988688899999999841389989999999998620699999999999999999721201689998 Q gi|254780915|r 429 ETQGILTNNSSLTLAETASIFGETLTFDSLLQAASSKEERKILLANKIEDMLNSIVRQISFYDFELKLHTERRSTGDIPT 508 (626) Q Consensus 429 ~~~~~~~~~~~~~~aE~~S~~~E~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~~~~q~~~~~fe~~~h~~~~~~~~~t~ 508 (626) ++|++.+.++|+++||+||||||++++||+++...+++++..++.+.+.++.+++++|++++.||+++|+.+++|+.+|+ T Consensus 241 ~~~~~~~~~~~~~~AE~aS~~~E~ll~~~ll~~~~~~~~k~~~l~~~~~~~~~~~~~q~~~~~FE~~~y~~~~~g~~~t~ 320 (427) T cd06459 241 DNQPYLYSDYPIFLAEIASTFNELLLFDYLLKFAKDPEEKLYLLEHLLEDIRATLPRQTMFAEFEHEVYENPEEGEPLTA 320 (427) T ss_pred CCCCHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH T ss_conf 28971204885789999889999999999998679999999999999999999999999999999999970651699989 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCC Q ss_conf 99999999999986077634588788850015254106675258999999999999999995673328999999983269 Q gi|254780915|r 509 HRINEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFIESSFYVYAYAFGNCLVNSLYDIYKSNTVDCFKEKYLNILRAGN 588 (626) Q Consensus 509 ~~l~~~~~~~~~~~~g~~~~~~~~~~~~~w~~~~H~y~~~fY~y~Y~~~~~~a~~l~~~~~~~~~~~~~~~y~~~L~~Gg 588 (626) +.++++|.+++++|+|+.+.+. ...+++|+++||||.+|||||+|++|+++|+++|.+++++| +.+.++|+++|++|| T Consensus 321 ~~~~~~~~~~~~~y~g~~~~~~-~~~~~~W~~i~Hfy~~pFY~y~Ya~g~l~al~l~~~~~~~~-~~~~~~Y~~~L~~gg 398 (427) T cd06459 321 EELNEIYRELEKKYGGDLVEID-EEHGYEWARIPHFYYVPFYVYPYAFGQLAALALYAKYKEDG-EGAVEKYLELLKAGG 398 (427) T ss_pred HHHHHHHHHHHHHHCCCCCCCC-CCCCCEEECCCEECCCCCCCHHHHHHHHHHHHHHHHHHHCC-HHHHHHHHHHHHCCC T ss_conf 9999999999998479856558-87686640316211889985888999999999999999767-769999999986729 Q ss_pred CCCHHHHHHHCCCCCCCHHHHHHHHHHHH Q ss_conf 99979999970889899889999999999 Q gi|254780915|r 589 SKHYSELLLPLNINLSDPNFWERGLQTVE 617 (626) Q Consensus 589 s~~~~el~k~~G~d~~~~~~~~~~~~~~~ 617 (626) |++|+|++|.+|||+++|+||+++++.|+ T Consensus 399 s~~~~el~k~aGvd~t~~~~~~~~~~~~~ 427 (427) T cd06459 399 SKSPLELLKKAGVDLTSPDFWEEAIDVIE 427 (427) T ss_pred CCCHHHHHHHHCCCCCCHHHHHHHHHHHC T ss_conf 99999999982989898099999997629 No 5 >COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism] Probab=100.00 E-value=0 Score=620.48 Aligned_cols=563 Identities=16% Similarity=0.190 Sum_probs=431.1 Q ss_pred CCCHHHCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 87844342477679357278889278999999999999999999742331023349989999999999999999999999 Q gi|254780915|r 27 MDSQENLGNLPRWNLEDLYPSHDSQEISNDMECIEHESLAFKTRWEGNLAHATNQKNCHSLGAAIAEYERICELIGRIAS 106 (626) Q Consensus 27 ~~~r~~~~~~~~WdL~~ly~~~~~~e~~~dl~~~~~~i~~~~~~~~~~l~~~~~~~~~~~l~~~l~~~e~i~~~l~rv~~ 106 (626) ..+...+..+|..| .|.+++-.|.|..-+.+.+..|.++. .+.++++|+|+|.++++++..++|+.+ T Consensus 9 ~~~ll~~~~lP~F~--~i~~ed~~paf~~~l~~~~~~i~~i~-----------~~~~~pt~~n~v~~Le~~~~~L~rv~~ 75 (683) T COG0339 9 SNPLLPFGGLPPFD--AIKPEDVKPAFDEALAQARAEIDAIA-----------ANPDAPTWENTVEPLERAGDRLGRVWS 75 (683) T ss_pred CCCCCCCCCCCCCC--CCCHHHCCHHHHHHHHHHHHHHHHHH-----------CCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 37678867899833--37833236799999999899999997-----------489999688899899987357787999 Q ss_pred HHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH-------------------HHHHHHHCCHHHHHHHHHCCCCH--- Q ss_conf 99864205799189999999999999999998874-------------------16988719989999998469001--- Q gi|254780915|r 107 YAMLSYNCNLSSPTIRKFYTDINAKLADFEKVLIF-------------------FALEINTLDEALLEQSYAQDPLT--- 164 (626) Q Consensus 107 ya~l~~s~dt~d~~~~~~~~~~~~~~s~~~~~l~~-------------------~~~e~~~l~e~~l~~~~~~~~~l--- 164 (626) .++.+.+++++ ++.++..+.+.|+++++++.+.. ++++..++.+..++.|.+....| T Consensus 76 ~~~~l~~v~~~-~elr~a~e~~~pklse~~~~l~~~~~Lf~~~~al~~~~e~~~ld~e~~r~~e~~~rdF~~sGa~L~~~ 154 (683) T COG0339 76 VVSHLNSVHNS-PELREAYEEILPKLSEFSNDLGQNPGLFARYKALWQSPESAGLDPEQKRVVENALRDFVLSGAKLPPE 154 (683) T ss_pred HHHHHHHCCCC-HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHH T ss_conf 99998713898-89999999863999999998615899999999986284414499999999999999999832789978 Q ss_pred --HHHHHHHHHH----HHHHHHCCCHHHHHHHH--HHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCH---HHHHHHHHCC Q ss_conf --1117999999----97423317167998766--30010003677777764444332210034752---3321354188 Q gi|254780915|r 165 --LKYSAWIKNI----RKIKKHLLSNDMECLLS--DTSQVGREALKRFFCENIESLRFKINDQKIPL---TKAYKSFFDS 233 (626) Q Consensus 165 --~~y~~~~~~l----~~~~~~~l~~~~E~~l~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~l~~~ 233 (626) +++.....++ .++.++.|.+.....+. ...+..+.+..............+..++..++ +..+.+.++. T Consensus 155 ~k~r~~~I~~ela~L~~~Fs~nvL~at~~~~~~~~~~~~LaGlp~~~l~aa~~~A~~kg~~~~~itl~~p~~~p~l~~~~ 234 (683) T COG0339 155 KKARLAEINTELAELGAQFSNNVLDATKAWALIVTDEAELAGLPESLLAAAAAAAEAKGLEGYLITLDIPSYLPVLTYAD 234 (683) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEECCCHHHHCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHCC T ss_conf 87899999999999999997650202235300048478766999899999999998468877488610642407988456 Q ss_pred CHHHHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCC-HHHHHHHHHHHHHCCHHHHHH Q ss_conf 87679999999999865054-2169999998666788654278999889762222589-899999999876306289999 Q gi|254780915|r 234 DREVRKSAAKALSHTFNKSS-HIFSFITNTLAKDEEIQDRWRKYEKIADSRHLSNNVE-PYVIEALMQSVKNYYPKTSHR 311 (626) Q Consensus 234 dr~~Rk~a~~a~~~~~~~~~-~~~a~il~~iv~~r~e~A~llGy~s~~~~~l~~~~~~-~~~~~~l~~~~~~~~~~~~~~ 311 (626) ||++|+++|+|+.++..+.. ++|.+|+.+++++|.+.|+|+||.||++++|.++|++ ++.+.+++..+.....+...+ T Consensus 235 ~r~lRe~~y~A~~~ra~~~~~~~n~~ii~~~L~LR~E~A~LLGf~~yA~~~L~~kMA~~p~~Vl~fL~~l~~ka~~~a~~ 314 (683) T COG0339 235 NRALREKLYRAYVTRASEGGPNDNRAIIEEILKLRAELAKLLGFSNYAEYSLADKMAKTPEAVLNFLNDLAEKARPQAEK 314 (683) T ss_pred CHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 69999999998764210058864179999999999999987496418888588775159999999999889986899999 Q ss_pred HHHHH----HH-HCCCCCCCHHHHHHCC-----------CCCCCCCCCHHHHHHHHHHHHHH-HHHHHHHH--------- Q ss_conf 99999----98-7088753123444025-----------65547779989999999986666-32899999--------- Q gi|254780915|r 312 YYELK----KK-WLKLDTMYFWDRLAPL-----------PGTSQDIIPFEVARDLVLQSYAK-FSPQMSII--------- 365 (626) Q Consensus 312 ~~~~~----~~-~lg~~~l~~wD~~~~~-----------~~~~~~~~~~ee~~~~v~~~~~~-~~~e~~~~--------- 365 (626) ..... ++ ..|+.++.|||+.++. ..+.+++||.+.+.+++.+..+. ||..+.+. T Consensus 315 e~a~L~~~~~~~~~~~~~l~~WD~~yyaeK~r~~~y~~de~elrpYF~l~~Vl~GlF~~~~rLfGI~~~e~~~~~vwHpD 394 (683) T COG0339 315 ELAELQAFAAEEEGGLPELQPWDWAYYAEKQRQEKYAFDEEELRPYFPLNKVLEGLFEVAKRLFGITFVERKDIPVWHPD 394 (683) T ss_pred HHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCC T ss_conf 99999999975406886356002899999987652599978842157735899999999998707377877999766898 Q ss_pred ---------HHHHHHCCCCCHHCCCCCCCCCCCCCCCCC----------CCCEEECCCCC---------CHHHHHHHHHH Q ss_conf ---------998642485020103799874354766677----------75101036533---------24679999998 Q gi|254780915|r 366 ---------AEKFFTHNWIDAPQYDGKGSGAFAHGTIPS----------VHPYILLNYLG---------KPQDVMTLAHE 417 (626) Q Consensus 366 ---------~~~~~~~~~iD~~~r~gKr~GA~~~~~~~~----------~~~~I~~N~~~---------~~~dv~TL~HE 417 (626) -+..++-.|+|+|+|+||||||||.+++.+ +..|.+|||++ +++||+||||| T Consensus 395 Vr~~~v~d~~g~~~g~fY~DlyaR~~KrgGAWM~~~~~~~~~~~~~~q~PV~ylvCNF~~p~~gkPaLls~dEV~TLFHE 474 (683) T COG0339 395 VRVFEVFDENGELIGLFYLDLYARDGKRGGAWMDDFVSQRRLDDGGGQKPVIYLVCNFTKPVGGKPALLSHDEVTTLFHE 474 (683) T ss_pred CEEEEEECCCCCEEEEEEEECCCCCCCCCCHHHHHHHHCCCCCCCCCCCCEEEEECCCCCCCCCCCCEEEHHHHHHHHHH T ss_conf 22898884899778899842366788866217887652032067886454288852677899998723107889999987 Q ss_pred HHHHHHHHHHC-CCCCCCCCCCHHHHHHHHHHHHHHHHH----------HHHHCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 45899999612-568776578005999886888999999----------9984138998999999999862069999999 Q gi|254780915|r 418 LGHGIHFVLSS-ETQGILTNNSSLTLAETASIFGETLTF----------DSLLQAASSKEERKILLANKIEDMLNSIVRQ 486 (626) Q Consensus 418 ~GHa~H~~ls~-~~~~~~~~~~~~~~aE~~S~~~E~~~~----------~~~~~~~~~~~~~~~ll~~~~~~~~~~~~~q 486 (626) ||||||+|+++ +++.++|++|+|||||+||||||+||| ||.+++++|.++..++++++.++.+.+++|| T Consensus 475 fGHgLH~mlt~v~~~~vsGt~v~wDfVElPSQ~mE~w~~~p~vL~~~a~Hy~TGe~lP~~ll~k~laaknf~~g~~t~rq 554 (683) T COG0339 475 FGHGLHHLLTRVKYPGVSGTNVPWDFVELPSQFMENWCWEPEVLAKYARHYQTGEPLPKELLDKMLAAKNFQAGLFTLRQ 554 (683) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCHHHCCHHHHHHHHCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 53688887633874566787787302226489998741579999999876336996879999999998754207899999 Q ss_pred HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCCC--HHHHHHHHHHHHHH Q ss_conf 9999999999721201689998999999999999860776345887888500152541066752--58999999999999 Q gi|254780915|r 487 ISFYDFELKLHTERRSTGDIPTHRINEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFIESSFY--VYAYAFGNCLVNSL 564 (626) Q Consensus 487 ~~~~~fe~~~h~~~~~~~~~t~~~l~~~~~~~~~~~~g~~~~~~~~~~~~~w~~~~H~y~~~fY--~y~Y~~~~~~a~~l 564 (626) +.|++||+.+|+..+. ....++.+++.++.++.. +. +.+..+.+..+|+|||++||- ||+|.|++|+++|. T Consensus 555 l~fal~Dm~~H~~~~~---~~~~~i~~~~~~~~~~~~---v~-~~~~~~~~~~~F~HIFagGYsAGYYSY~WaeVLsaDa 627 (683) T COG0339 555 LEFALFDMRLHTEFDP---DANADILEFEAEVLKKVA---VL-PSIPPRRRPHSFGHIFAGGYSAGYYSYLWAEVLSADA 627 (683) T ss_pred HHHHHHHHHHHCCCCC---CCCCCHHHHHHHHHHHHC---CC-CCCCHHCCCCCCCCEECCCCCCHHHHHHHHHHHHHHH T ss_conf 9999887886504785---442589999999998707---88-8765000456554100476431268899999985679 Q ss_pred HHHHHHCC--CHHHHHHHHH-HHHCCCCCCHHHHHHHC-CCCCCCHHHHH Q ss_conf 99999567--3328999999-98326999979999970-88989988999 Q gi|254780915|r 565 YDIYKSNT--VDCFKEKYLN-ILRAGNSKHYSELLLPL-NINLSDPNFWE 610 (626) Q Consensus 565 ~~~~~~~~--~~~~~~~y~~-~L~~Ggs~~~~el~k~~-G~d~~~~~~~~ 610 (626) |..|.++| ..+.+++|++ ||+.|||++|.+++|.| |.+++-.+..+ T Consensus 628 fa~Fee~g~~~~e~G~rfrd~ILs~GGS~dp~e~f~~frGrep~~dalLr 677 (683) T COG0339 628 FAAFEEEGPFNRETGQRFRDAILSRGGSRDPMELFKAFRGREPSIDALLR 677 (683) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCHHHHHH T ss_conf 99987638877899999999999655885899999998667998569998 No 6 >PRK10280 dipeptidyl carboxypeptidase II; Provisional Probab=100.00 E-value=0 Score=603.05 Aligned_cols=548 Identities=14% Similarity=0.084 Sum_probs=402.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999742331023349989999999999999999999999998642057991899999999999999999 Q gi|254780915|r 57 MECIEHESLAFKTRWEGNLAHATNQKNCHSLGAAIAEYERICELIGRIASYAMLSYNCNLSSPTIRKFYTDINAKLADFE 136 (626) Q Consensus 57 l~~~~~~i~~~~~~~~~~l~~~~~~~~~~~l~~~l~~~e~i~~~l~rv~~ya~l~~s~dt~d~~~~~~~~~~~~~~s~~~ 136 (626) .+.+++.+.+..+..+..|..|.++.++++|.|+|.++|++++.++|+.+.++...+++++ ++.++++.++.|++++++ T Consensus 25 ~eh~~pA~~~~l~~~~a~I~~I~~~~~~pTfeNti~aLE~~g~~L~rv~~~~~~L~sv~~~-~elr~~~~~~~p~ls~~~ 103 (681) T PRK10280 25 DHHYRPAFDEGVRQKRAEIAAIALNPQAPDFNNTILALEQSGELLTRVTSVFFAMTAAHTN-DELQRLDEQFSAELAELA 103 (681) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-HHHHHHHHHHHHHHHHHH T ss_conf 8887999999999989999999749999998999999998617999999999999815899-999999998569999997 Q ss_pred HHHHHHHHHH-------------------HHCCHHHHHHHHHCCCC-----HHHHHHHHHHHHH----HHHHCCCHHHHH Q ss_conf 9887416988-------------------71998999999846900-----1111799999997----423317167998 Q gi|254780915|r 137 KVLIFFALEI-------------------NTLDEALLEQSYAQDPL-----TLKYSAWIKNIRK----IKKHLLSNDMEC 188 (626) Q Consensus 137 ~~l~~~~~e~-------------------~~l~e~~l~~~~~~~~~-----l~~y~~~~~~l~~----~~~~~l~~~~E~ 188 (626) +.+......+ .++-+..+..|....-. .++++....++.. +..+++...... T Consensus 104 ~~i~~n~~Lf~ri~~l~~~~~~~~L~~e~~rlle~~~~dF~~sG~~L~~~~r~rl~~i~~eLs~L~~~F~~Nvl~at~~~ 183 (681) T PRK10280 104 NDIYLNGALFARVDAVWQQRESLGLDSESIRLVEVIHQRFVLAGAKLAQADKAKLKVLNTEAATLTSQFNQRLLAANKSG 183 (681) T ss_pred HHHHCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 06546899999999998185425899999999999999999768878977899999999999999999999876764357 Q ss_pred --HHHHHHHHCCCHHHHHHHHHHHHHHHH-HHCCCC---CHHHHHHHHHCCCHHHHHHHHHHHHHHHHHC-CCCHHHHHH Q ss_conf --766300100036777777644443322-100347---5233213541888767999999999986505-421699999 Q gi|254780915|r 189 --LLSDTSQVGREALKRFFCENIESLRFK-INDQKI---PLTKAYKSFFDSDREVRKSAAKALSHTFNKS-SHIFSFITN 261 (626) Q Consensus 189 --~l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~---~~~~~~~~l~~~dr~~Rk~a~~a~~~~~~~~-~~~~a~il~ 261 (626) .+....+..+.+..............+ ..++.. ..+..+.+.+++||++|+++|+|+.+++.++ .+++.+|+. T Consensus 184 ~l~v~d~~~L~Glp~~~i~~a~~~A~~~g~~g~~~itl~~ps~~p~l~~a~nr~lRe~~y~A~~~Ra~~~~~~dN~~ii~ 263 (681) T PRK10280 184 GLVVDDIHQLAGLSEQEIALAAEAAREKGLDNRWLIPLLNTTQQPALAELRDRQTRENLFAAGWTRAEKNDANDTRAIIQ 263 (681) T ss_pred EEECCCHHHHCCCCHHHHHHHHHHHHHCCCCCCEEEECCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 25537689857999999999999998628998758852376400788853688999999999986640488643399999 Q ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHCCC-CHHHHHHHHHHHHHCCHHHHHHHHHHHH----HHCCCCCCCHHHHHHCC-- Q ss_conf 9866678865427899988976222258-9899999999876306289999999999----87088753123444025-- Q gi|254780915|r 262 TLAKDEEIQDRWRKYEKIADSRHLSNNV-EPYVIEALMQSVKNYYPKTSHRYYELKK----KWLKLDTMYFWDRLAPL-- 334 (626) Q Consensus 262 ~iv~~r~e~A~llGy~s~~~~~l~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~----~~lg~~~l~~wD~~~~~-- 334 (626) +|+++|.+.|+|+||+||++++|..+|+ +++.+..++..+.....+.+.+...... +..|...+.|||+.++. T Consensus 264 eil~LR~e~A~LLGy~syA~~~L~~kMAksp~~V~~FL~~L~~~a~~~a~~E~~~l~~~~~~~~~~~~l~~WD~~yy~ek 343 (681) T PRK10280 264 RLVEIRAQQAKLLGFPHYAAWKIADQMAKTPEAALNFMREIVPAARQRASDELASIQAVIDKQQGGFSAQAWDWAFYAEQ 343 (681) T ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH T ss_conf 99999999999859996999999876159999999999999999899999999999999997079888986569999998 Q ss_pred ---------CCCCCCCCCHHHHH-HHHHHHHH-HHHHHHHHHH------------------HHHHHCCCCCHHCCCCCCC Q ss_conf ---------65547779989999-99998666-6328999999------------------9864248502010379987 Q gi|254780915|r 335 ---------PGTSQDIIPFEVAR-DLVLQSYA-KFSPQMSIIA------------------EKFFTHNWIDAPQYDGKGS 385 (626) Q Consensus 335 ---------~~~~~~~~~~ee~~-~~v~~~~~-~~~~e~~~~~------------------~~~~~~~~iD~~~r~gKr~ 385 (626) ..+.+++||.+.+. +++....+ .||..+.+.. ...++..|+|+|+|+||+| T Consensus 344 ~r~~~~~~d~~~lr~YFpl~~Vl~~Glf~~~~~Lfgi~f~e~~~~~vwh~dV~~~~V~d~~g~~lg~fYlD~~~R~~K~~ 423 (681) T PRK10280 344 VRREKYALDEAQLKPYFELNTVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRVWEIFDHNGVGLALFYGDFFARDSKSG 423 (681) T ss_pred HHHHHCCCCHHHCCCCCCHHHHHHHHHHHHHHHHHCEEEEECCCCCCCCCCEEEEEEECCCCCEEEEEECCCCCCCCCCC T ss_conf 78875276655405776399999999999999983868786587876578747999985899767678525566788877 Q ss_pred CCCCCCCCCC-------CCCEEECCCCC---------CHHHHHHHHHHHHHHHHHHHHC-CCCCCCCCCCHHHHHHHHHH Q ss_conf 4354766677-------75101036533---------2467999999845899999612-56877657800599988688 Q gi|254780915|r 386 GAFAHGTIPS-------VHPYILLNYLG---------KPQDVMTLAHELGHGIHFVLSS-ETQGILTNNSSLTLAETASI 448 (626) Q Consensus 386 GA~~~~~~~~-------~~~~I~~N~~~---------~~~dv~TL~HE~GHa~H~~ls~-~~~~~~~~~~~~~~aE~~S~ 448 (626) ||||.+++.+ +..+++|||+. +++||+|||||||||||+||++ +++.++|++++|||||+||| T Consensus 424 gAwm~~~~~~~~~~~~~Pv~~lvcNf~~p~~~~psLL~~~eV~TlFHEFGH~lH~lls~~~y~~~sGt~v~~DfVE~PSQ 503 (681) T PRK10280 424 GAWMGNFVEQSTLNETRPVIYNVCNYQKPAAGQPALLLWDDVITLFHEFGHTLHGLFARQRYATLSGTNTPRDFVEFPSQ 503 (681) T ss_pred CCHHHCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCHH T ss_conf 66010231445568988747888256799999886556999999999998998887407875732487786228857799 Q ss_pred HHHHHHH----------HHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 8999999----------998413899899999999986206999999999999999997212016899989999999999 Q gi|254780915|r 449 FGETLTF----------DSLLQAASSKEERKILLANKIEDMLNSIVRQISFYDFELKLHTERRSTGDIPTHRINEIWLET 518 (626) Q Consensus 449 ~~E~~~~----------~~~~~~~~~~~~~~~ll~~~~~~~~~~~~~q~~~~~fe~~~h~~~~~~~~~t~~~l~~~~~~~ 518 (626) +||+||| ||.+++++|+++..+|+.++.++.+..+.||+.+++||+.+|......... ++..+.... T Consensus 504 lmE~w~~~~~vL~~~a~Hy~TgeplP~~l~~kl~~a~~~~~g~~~~rql~~a~~Dl~lH~~~~~~~~~---~~~~~~~~~ 580 (681) T PRK10280 504 INEHWASHPQVFARYARHYQSGEAMPDELQQKMRNASLFNKGYDMSELLSAALLDMRWHCLEENEAMQ---DVDDFELRA 580 (681) T ss_pred HHHHHHCCHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC---CHHHHHHHH T ss_conf 99988568999852234567999896999999999861546999999999999899995788333434---699999999 Q ss_pred HHHHCCCCCCCCCCCCCCEEECCCCCCCCCC--CHHHHHHHHHHHHHHHHHHHHCC--CHHHHHHHHH-HHHCCCCCCHH Q ss_conf 9986077634588788850015254106675--25899999999999999999567--3328999999-98326999979 Q gi|254780915|r 519 QKESLGPAFDLSDLEYGSFWMMVPHFIESSF--YVYAYAFGNCLVNSLYDIYKSNT--VDCFKEKYLN-ILRAGNSKHYS 593 (626) Q Consensus 519 ~~~~~g~~~~~~~~~~~~~w~~~~H~y~~~f--Y~y~Y~~~~~~a~~l~~~~~~~~--~~~~~~~y~~-~L~~Ggs~~~~ 593 (626) +.+.. +.++....++..++|+|||.+|| .||+|.|++|+|+|+|..|+++| +.+.+.+|++ ||+.|||++|. T Consensus 581 l~~~~---~~~~~~~~~~~~~~F~Hif~gGY~AgYYsYlWsevlaaD~f~~F~~~g~~n~~~G~~~r~~iL~~Ggs~d~~ 657 (681) T PRK10280 581 LVAEN---MDLPAVPPRYRSSYFAHIFGGGYAAGYYAYLWTQMLADDGYQWFVEQGGLTRENGQRFREAILSRGNSEDLE 657 (681) T ss_pred HHHHC---CCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCHH T ss_conf 99846---788889997789976646578807868999999999999999998769999999999999973078982799 Q ss_pred HHHHHC-CCCCCCHHHHHH Q ss_conf 999970-889899889999 Q gi|254780915|r 594 ELLLPL-NINLSDPNFWER 611 (626) Q Consensus 594 el~k~~-G~d~~~~~~~~~ 611 (626) ++++.| |.+++...+++. T Consensus 658 ~~~~~F~GR~P~~~alL~~ 676 (681) T PRK10280 658 RLYRQWRGHAPQIMPMLQH 676 (681) T ss_pred HHHHHHCCCCCCHHHHHHH T ss_conf 9999817999982799996 No 7 >PRK10911 oligopeptidase A; Provisional Probab=100.00 E-value=0 Score=591.14 Aligned_cols=545 Identities=15% Similarity=0.167 Sum_probs=399.4 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999974233102334998999999999999999999999999864205799189999999999999999998 Q gi|254780915|r 59 CIEHESLAFKTRWEGNLAHATNQKNCHSLGAAIAEYERICELIGRIASYAMLSYNCNLSSPTIRKFYTDINAKLADFEKV 138 (626) Q Consensus 59 ~~~~~i~~~~~~~~~~l~~~~~~~~~~~l~~~l~~~e~i~~~l~rv~~ya~l~~s~dt~d~~~~~~~~~~~~~~s~~~~~ 138 (626) .+.+.+.+..+..+..|..|..+.++++|.|+|.++|+++..++++...++...+++++ ++.++...++.++++++.+. T Consensus 21 ~~~pA~~~~i~~~~~~i~~I~~~~~~~tfeNti~~le~~~~~L~rv~~~~~~L~~v~~~-~elr~a~~~~~p~ls~~~~~ 99 (680) T PRK10911 21 HVVPAVTKALNDCREAVERVVAQGAPYTWENLCQPLAEVDDVLGRIFSPVSHLNSVKNS-PELREAYEQTLPLLSEYSTW 99 (680) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-HHHHHHHHHHHHHHHHHHHH T ss_conf 85999999999999999999859999998999999999758999999999999705998-99999999968999999998 Q ss_pred HHHHHHHHHH-------------------CCHHHHHHHHHCCCCH-----HHHHHHHHHHHH----HHHHCCCHHH--HH Q ss_conf 8741698871-------------------9989999998469001-----111799999997----4233171679--98 Q gi|254780915|r 139 LIFFALEINT-------------------LDEALLEQSYAQDPLT-----LKYSAWIKNIRK----IKKHLLSNDM--EC 188 (626) Q Consensus 139 l~~~~~e~~~-------------------l~e~~l~~~~~~~~~l-----~~y~~~~~~l~~----~~~~~l~~~~--E~ 188 (626) +......+.+ +-++.+..|....-.| .+++....++.. +..+++.+.. .. T Consensus 100 i~~n~~Ly~~~k~v~~~~~~~~L~~e~~rlle~~l~dF~rsG~~L~~~~r~rl~~i~~eLs~L~~~F~~Nvl~~t~~~~~ 179 (680) T PRK10911 100 VGQHEGLYQAYRDLRDGDHYATLNTAQKKAVDNALRDFELSGIGLPKEKQQRYGEIATRLSELGNQYSNNVLDATMGWTK 179 (680) T ss_pred HHCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEE T ss_conf 75779999999999739332459999999999999999985898997899999999999999999999864665316457 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCH---HHHHHHHHCCCHHHHHHHHHHHHHHHHHC-----CCCHHHHH Q ss_conf 76630010003677777764444332210034752---33213541888767999999999986505-----42169999 Q gi|254780915|r 189 LLSDTSQVGREALKRFFCENIESLRFKINDQKIPL---TKAYKSFFDSDREVRKSAAKALSHTFNKS-----SHIFSFIT 260 (626) Q Consensus 189 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~l~~~dr~~Rk~a~~a~~~~~~~~-----~~~~a~il 260 (626) ++....+..+.+.................++..++ +..+.+.+++||++|+++|+|+..++.+. +.+|.+++ T Consensus 180 ~~~d~~eL~GlP~~~l~~a~~~a~~~~~~g~~itl~~p~~~pvl~~~~~r~lRk~~~~A~~~Ra~~~g~~~g~~dN~~il 259 (680) T PRK10911 180 LITDEAELAGMPESALAAAKAQAEAKEQEGYLLTLDIPSYLPVMTYCDNQALREEMYRAYSTRASDQGPNAGKWDNSEVM 259 (680) T ss_pred EECCHHHHCCCCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHH T ss_conf 73368886379989999999987754898627837786366898607779999999999864241016765665559999 Q ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHCCC-CHHHHHHHHHHHHHCCHHHHHHHHHHH----HHHCCCCCCCHHHHHHCCC Q ss_conf 99866678865427899988976222258-989999999987630628999999999----9870887531234440256 Q gi|254780915|r 261 NTLAKDEEIQDRWRKYEKIADSRHLSNNV-EPYVIEALMQSVKNYYPKTSHRYYELK----KKWLKLDTMYFWDRLAPLP 335 (626) Q Consensus 261 ~~iv~~r~e~A~llGy~s~~~~~l~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~----~~~lg~~~l~~wD~~~~~~ 335 (626) .+|+++|.++|+|+||+||++++|..+|+ +++.+..++..+.....+.+.+..... ++..|.++|.|||+.++.. T Consensus 260 ~eil~LR~E~A~LLGy~s~A~~~l~~kMAksp~~V~~fL~~L~~~~~~~a~~E~~~L~~~~~~~~~~~~l~pWD~~yy~~ 339 (680) T PRK10911 260 EEILALRHELAQLLGFENYAFKSLATKMAENPQQVLDFLTDLAKRARPQGEKELAQLRAFAKAEFGVDELQPWDIAYYSE 339 (680) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH T ss_conf 99999999999983998499999987713999999999999999989999999999999986447967778646999999 Q ss_pred -----------CCCCCCCCHHHHHHHHHHHHHH-HHHHHHHHH------------------HHHHHCCCCCHHCCCCCCC Q ss_conf -----------5547779989999999986666-328999999------------------9864248502010379987 Q gi|254780915|r 336 -----------GTSQDIIPFEVARDLVLQSYAK-FSPQMSIIA------------------EKFFTHNWIDAPQYDGKGS 385 (626) Q Consensus 336 -----------~~~~~~~~~ee~~~~v~~~~~~-~~~e~~~~~------------------~~~~~~~~iD~~~r~gKr~ 385 (626) .+.+++||.+.+.+++....+. ||..+.+.- ...++..|+|+|+|+|||+ T Consensus 340 ~~r~~~~~~d~~~l~~YFpl~~vl~Glf~~~~~Lfgi~~~~~~~~~~wh~dV~~~~V~d~~~~~lG~~YlDl~~R~~K~~ 419 (680) T PRK10911 340 KQKQHLYSISDEQLRPYFPENKAVNGLFEVVKRIYGITAKERKDVDVWHPDVRFFELYDENNELRGSFYLDLYARENKRG 419 (680) T ss_pred HHHHHHCCCCHHHHHCCCCHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCEEEEEEECCCCEEEEEEEECCCCCCCCC T ss_conf 98765322455665233228999999999999971737786688877887666999995799778999840566788979 Q ss_pred CCCCCCCCCC----------CCCEEECCCCC---------CHHHHHHHHHHHHHHHHHHHHC-CCCCCCCCC-CHHHHHH Q ss_conf 4354766677----------75101036533---------2467999999845899999612-568776578-0059998 Q gi|254780915|r 386 GAFAHGTIPS----------VHPYILLNYLG---------KPQDVMTLAHELGHGIHFVLSS-ETQGILTNN-SSLTLAE 444 (626) Q Consensus 386 GA~~~~~~~~----------~~~~I~~N~~~---------~~~dv~TL~HE~GHa~H~~ls~-~~~~~~~~~-~~~~~aE 444 (626) ||||++++.+ +...++|||+. ++++|+|||||||||||++|++ +++.++|++ ++||||| T Consensus 420 ga~~~~~~~~~~~~~g~~~~Pv~~lvcNf~~p~~~~p~LL~~~eV~TlFHEfGH~lH~lls~~~y~~~sGt~~v~~DfVE 499 (680) T PRK10911 420 GAWMDDCVGQMRKADGSLQKPVAYLTCNFNRPVNGKPALFTHDEVITLFHEFGHGLHHMLTRIETAGVSGISGVPWDAVE 499 (680) T ss_pred CCCHHHHHHCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHH T ss_conf 85243453143278998315435787157899999888668999999999962999998717875565787767883231 Q ss_pred HHHHHHHHHHH----------HHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH Q ss_conf 86888999999----------99841389989999999998620699999999999999999721201689998999999 Q gi|254780915|r 445 TASIFGETLTF----------DSLLQAASSKEERKILLANKIEDMLNSIVRQISFYDFELKLHTERRSTGDIPTHRINEI 514 (626) Q Consensus 445 ~~S~~~E~~~~----------~~~~~~~~~~~~~~~ll~~~~~~~~~~~~~q~~~~~fe~~~h~~~~~~~~~t~~~l~~~ 514 (626) +|||+||+||| ||.+++++|+++..++++++.++.+..+.+|+.+++||+++|.... +....++.++ T Consensus 500 ~PSql~E~w~~~~~vL~~~a~H~~TgeplP~~l~~~l~~a~~~~~g~~~~~ql~~a~~Dl~lH~~~~---~~~~~~~~~~ 576 (680) T PRK10911 500 LPSQFMENWCWEPEALAFISGHYETGEPLPKELLDKMLAAKNYQAALFILRQLEFGLFDFRLHAEFD---PDQGAKILET 576 (680) T ss_pred CCHHHHHHHHCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC---CCCCCCHHHH T ss_conf 6679998760679999861202668880849999999999887389999999999999999857788---6544579999 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCC--CHHHHHHHHHHHHHHHHHHHHCC--CHHHHHHHHH-HHHCCCC Q ss_conf 99999986077634588788850015254106675--25899999999999999999567--3328999999-9832699 Q gi|254780915|r 515 WLETQKESLGPAFDLSDLEYGSFWMMVPHFIESSF--YVYAYAFGNCLVNSLYDIYKSNT--VDCFKEKYLN-ILRAGNS 589 (626) Q Consensus 515 ~~~~~~~~~g~~~~~~~~~~~~~w~~~~H~y~~~f--Y~y~Y~~~~~~a~~l~~~~~~~~--~~~~~~~y~~-~L~~Ggs 589 (626) +.++.+++.. ++.+..+.+.++|+|+|.+|| -||+|.|++|+|+++|..|+++| +++.+++|++ ||+.||| T Consensus 577 ~~~~~~~~~~----~p~~~~~~~~~~F~Hif~ggY~AgYYsYlwsevlaad~~~~f~~~g~~~~~~G~~~r~~iL~~Ggs 652 (680) T PRK10911 577 LAEIKKQVAV----VPSPSWGRFPHAFSHIFAGGYAAGYYSYLWADVLAADAFSRFEEEGIFNRETGQSFLDNILSRGGS 652 (680) T ss_pred HHHHHHHCCC----CCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCC T ss_conf 9999863475----789999987751556567880042299999999999999999865999899999999998317899 Q ss_pred CCHHHHHHHC-CCCCCCHHHHHH Q ss_conf 9979999970-889899889999 Q gi|254780915|r 590 KHYSELLLPL-NINLSDPNFWER 611 (626) Q Consensus 590 ~~~~el~k~~-G~d~~~~~~~~~ 611 (626) ++|.|+++.| |.+++...+++. T Consensus 653 ~~~~e~~~~F~GR~P~~~alL~~ 675 (680) T PRK10911 653 EEPMELFKRFRGREPQLDAMLEH 675 (680) T ss_pred CCHHHHHHHHCCCCCCHHHHHHH T ss_conf 08999999836999983789986 No 8 >TIGR02289 M3_not_pepF oligoendopeptidase, M3 family; InterPro: IPR011976 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This family consists of probable oligoendopeptidases belonging to MEROPS peptidase family M3, subfamily M3B (clan MA(E)). The family is related to lactococcal PepF and group B streptococcal PepB (IPR004438 from INTERPRO) but in a distinct clade with considerable sequence differences not only to IPR004438 from INTERPRO but also to IPR011977 from INTERPRO. Likely substrates are small peptides and not whole proteins, as with PepF, but members are not characterised and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.. Probab=100.00 E-value=0 Score=562.45 Aligned_cols=538 Identities=18% Similarity=0.233 Sum_probs=447.9 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH----HHHHHHHHHHH Q ss_conf 99999999999974233102334998999999999999999999999999864205799189999----99999999999 Q gi|254780915|r 59 CIEHESLAFKTRWEGNLAHATNQKNCHSLGAAIAEYERICELIGRIASYAMLSYNCNLSSPTIRK----FYTDINAKLAD 134 (626) Q Consensus 59 ~~~~~i~~~~~~~~~~l~~~~~~~~~~~l~~~l~~~e~i~~~l~rv~~ya~l~~s~dt~d~~~~~----~~~~~~~~~s~ 134 (626) ++++.+.++..++ .+..+..++.+.|....++...+......++.+.++||+|+++.+ ..+.+.|++.+ T Consensus 3 ~~e~~~~~lle~~-------~~s~~~~E~~~~i~~~~~~~~~i~~~~~~~yI~~~~dtdde~f~~~~~~f~~~~~P~l~~ 75 (553) T TIGR02289 3 ELEKTLKELLEKF-------ISSKSVEEQEKIINEVNDVIDDIEEMITLAYIRHSVDTDDEEFYDEEEHFQDEIKPLLKR 75 (553) T ss_pred HHHHHHHHHHHHH-------CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 3489999998742-------064466899999999999999998767664440011587278999997566563678998 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 99988741698871998999999846900111179999999742331716799876630010003677777764444332 Q gi|254780915|r 135 FEKVLIFFALEINTLDEALLEQSYAQDPLTLKYSAWIKNIRKIKKHLLSNDMECLLSDTSQVGREALKRFFCENIESLRF 214 (626) Q Consensus 135 ~~~~l~~~~~e~~~l~e~~l~~~~~~~~~l~~y~~~~~~l~~~~~~~l~~~~E~~l~~~~~~~~~~~~~~~~~~~~~~~~ 214 (626) +++.+. ..+. +...+..+.. ..|.++ ....+....+-+++ .+... .-+...++-+..++++.+. T Consensus 76 y~~~~~---~~~~---espFr~~L~~----~~y~~l-~~~~~~~~klF~ek---~i~l~--~kE~~L~t~Y~~i~~~~~i 139 (553) T TIGR02289 76 YNTKFD---QKIV---ESPFREELDE----RFYELL-FKLIKNKLKLFSEK---NIPLL--QKENKLSTKYREIISNIKI 139 (553) T ss_pred HHHHHH---HHHH---CCCCHHHHHH----HHHHHH-HHHHHHHHHHCCCC---CCHHH--HCCHHHHHHHHHHHCCCEE T ss_conf 878998---7762---3877656875----222689-99999987606754---34123--0120454335455328702 Q ss_pred HHHCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHH---HHCCCCH Q ss_conf 2100347523321354188876799999999998650542169999998666788654278999889762---2225898 Q gi|254780915|r 215 KINDQKIPLTKAYKSFFDSDREVRKSAAKALSHTFNKSSHIFSFITNTLAKDEEIQDRWRKYEKIADSRH---LSNNVEP 291 (626) Q Consensus 215 ~~~~~~~~~~~~~~~l~~~dr~~Rk~a~~a~~~~~~~~~~~~a~il~~iv~~r~e~A~llGy~s~~~~~l---~~~~~~~ 291 (626) .+.|+..|+++...++.+|||..||+|+.|..+....++..+..||+++|+.|++.|+.+||.||.|+.. .+-+++. T Consensus 140 ~f~Ge~kTl~ql~~~L~~~~R~~RK~A~~a~~~~~~~~~~Eld~I~deLv~~R~~~A~~lGf~NY~dy~y~~~~RtDY~~ 219 (553) T TIGR02289 140 DFEGEEKTLSQLKPYLQDPDRSTRKKAWEARSEFFAEVEEELDRIYDELVEIRTKIAKNLGFENYVDYMYKLLNRTDYSA 219 (553) T ss_pred ECCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCCCCCH T ss_conf 11301332876256607987678999999999799867899987999999999999987040105899886416787898 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCHHHHHHCC-CCCC-CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999987630628999999999987088753123444025-6554-7779989999999986666328999999986 Q gi|254780915|r 292 YVIEALMQSVKNYYPKTSHRYYELKKKWLKLDTMYFWDRLAPL-PGTS-QDIIPFEVARDLVLQSYAKFSPQMSIIAEKF 369 (626) Q Consensus 292 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~lg~~~l~~wD~~~~~-~~~~-~~~~~~ee~~~~v~~~~~~~~~e~~~~~~~~ 369 (626) ..+..+.+.|.+..+|+..+....+++.||+++++|||..+.. ++.. .|.-+.+...+.....++.++.++.++++.| T Consensus 220 ~d~y~y~e~v~KyvvP~~~~~~~~~~~rLgi~~l~pwD~s~~~~~gn~L~P~~~~~~~~~~~~~my~~l~~~f~~~F~~m 299 (553) T TIGR02289 220 EDVYKYRESVLKYVVPLLTKLRKRKKKRLGIEKLRPWDESAVFLDGNVLKPFGNVDFLLEKAEKMYKELSLEFKEFFNFM 299 (553) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCHHHHHHHHHH T ss_conf 99999987665411425899889988605611114201576788678668866778999999999984087899999988 Q ss_pred HHCCCCCHHCCCCCCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHCC-CCCCCCCCCHHHHHHHHHH Q ss_conf 424850201037998743547666777510103653324679999998458999996125-6877657800599988688 Q gi|254780915|r 370 FTHNWIDAPQYDGKGSGAFAHGTIPSVHPYILLNYLGKPQDVMTLAHELGHGIHFVLSSE-TQGILTNNSSLTLAETASI 448 (626) Q Consensus 370 ~~~~~iD~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~~~~dv~TL~HE~GHa~H~~ls~~-~~~~~~~~~~~~~aE~~S~ 448 (626) ++++.+|+..|+||++||||..++.+..|||+|||+||.+||.||+||+|||+|.+-+.+ ..+..+..++.+.||++|+ T Consensus 300 ~~~nlLDL~~r~~Ka~GGyC~~Lp~~~~pFiFsNfngt~~Di~~l~HE~GHafh~Y~~~k~~~~~~Y~~~~~E~AEl~SM 379 (553) T TIGR02289 300 LEKNLLDLESRKGKAAGGYCTYLPKYKAPFIFSNFNGTSGDIDVLTHEAGHAFHVYESRKNDLLPEYRWPTYEAAELASM 379 (553) T ss_pred HHCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHH T ss_conf 75288343137898777431535566786078647888313678888656889766642467774346864679987764 Q ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCC Q ss_conf 89999999984138998999999999862069999999999999999972120168999899999999999986077634 Q gi|254780915|r 449 FGETLTFDSLLQAASSKEERKILLANKIEDMLNSIVRQISFYDFELKLHTERRSTGDIPTHRINEIWLETQKESLGPAFD 528 (626) Q Consensus 449 ~~E~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~~~~q~~~~~fe~~~h~~~~~~~~~t~~~l~~~~~~~~~~~~g~~~~ 528 (626) .||.++|+|+-.-+.+++...+.....+.++..+++..+..+.|+|+||+.| ..||++.++.|++|.++|.-+.+. T Consensus 380 smEll~~~~~~~Fy~d~~d~~k~k~~~l~~aL~fLp~~~~vD~FQHwvYeNP----~~TP~ER~~~y~~l~k~Y~~~~~~ 455 (553) T TIGR02289 380 SMELLSMPWMDLFYTDEEDLKKAKISHLKGALSFLPYGVIVDHFQHWVYENP----NHTPEERDEKYAELEKRYQPSEVY 455 (553) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCC----CCCHHHHHHHHHHHHHHCCCCCCC T ss_conf 2888665667642079546758999999999999988888503124144488----888578999999999731787666 Q ss_pred C--CCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCC-C Q ss_conf 5--887888500152541066752589999999999999999956733289999999832699997999997088989-9 Q gi|254780915|r 529 L--SDLEYGSFWMMVPHFIESSFYVYAYAFGNCLVNSLYDIYKSNTVDCFKEKYLNILRAGNSKHYSELLLPLNINLS-D 605 (626) Q Consensus 529 ~--~~~~~~~~w~~~~H~y~~~fY~y~Y~~~~~~a~~l~~~~~~~~~~~~~~~y~~~L~~Ggs~~~~el~k~~G~d~~-~ 605 (626) . +..+.+.+|.+.+|+|..||||+.||+||++|+++|.++++++.+ +.+.|.++|+.|||++..|+++.+|+..- + T Consensus 456 ~~~~~~e~gt~W~~q~Hif~~PFYYIeYtlAQigALq~~~~y~ed~~~-al~~Y~~~~~~G~S~s~~~l~~~agl~~~F~ 534 (553) T TIGR02289 456 EEDLELEKGTFWLRQLHIFEVPFYYIEYTLAQIGALQIYKKYKEDPEK-ALEDYKKLCSLGGSQSFLELYETAGLTFPFS 534 (553) T ss_pred CCCCCCCCCCEEEECCCEEECCCCHHHHHHHHHHHHHHHHHHHHCHHH-HHHHHHHHHHCCCCCCHHHHHHHHCCCCCCC T ss_conf 777320224110100443613730767789999999999875207889-9999999970789987899997627747876 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 8899999999999999986 Q gi|254780915|r 606 PNFWERGLQTVEKMIDDVE 624 (626) Q Consensus 606 ~~~~~~~~~~~~~~i~~~e 624 (626) ++.++..+.+|..+|++++ T Consensus 535 ~~~ikei~~~v~~~l~~i~ 553 (553) T TIGR02289 535 EEIIKEIVSFVEKLLEEIE 553 (553) T ss_pred CCHHHHHHHHHHHHHHHCC T ss_conf 6436788999999997329 No 9 >pfam01432 Peptidase_M3 Peptidase family M3. This is the Thimet oligopeptidase family, large family of mammalian and bacterial oligopeptidases that cleave medium sized peptides. The group also contains mitochondrial intermediate peptidase which is encoded by nuclear DNA but functions within the mitochondria to remove the leader sequence. Probab=100.00 E-value=0 Score=551.49 Aligned_cols=382 Identities=23% Similarity=0.285 Sum_probs=317.7 Q ss_pred HHHCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCC-CHHHHHHHHHHHHHCCHH Q ss_conf 5418887679999999999865054216999999866678865427899988976222258-989999999987630628 Q gi|254780915|r 229 SFFDSDREVRKSAAKALSHTFNKSSHIFSFITNTLAKDEEIQDRWRKYEKIADSRHLSNNV-EPYVIEALMQSVKNYYPK 307 (626) Q Consensus 229 ~l~~~dr~~Rk~a~~a~~~~~~~~~~~~a~il~~iv~~r~e~A~llGy~s~~~~~l~~~~~-~~~~~~~l~~~~~~~~~~ 307 (626) +.+++||++||++|+|+.+++.++.++++.+|++++++|.+.|++|||+|++++++.+.|+ +++.+..++..+.+...| T Consensus 2 L~~~~dr~~Rk~a~~a~~~~~~~~~~~~~~~l~~lv~~R~~~A~llGy~s~a~~~l~~~m~~~p~~v~~fL~~l~~~~~p 81 (448) T pfam01432 2 LKESPDRETRKKAYRAFYSRAEANRNENSALLEELLKLRAELAKLLGYPSYAEASLEDKMAKIPETVYDFLEELVNKLRP 81 (448) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH T ss_conf 88790999999999999999870756899999999999999999779998999998754278999999999999999799 Q ss_pred HHHHHHHHHHHHCC----CCCCCHHHHHHCC------------CCCCCCCCCHHHHHH-HHHHHHH-HHH---------H Q ss_conf 99999999998708----8753123444025------------655477799899999-9998666-632---------8 Q gi|254780915|r 308 TSHRYYELKKKWLK----LDTMYFWDRLAPL------------PGTSQDIIPFEVARD-LVLQSYA-KFS---------P 360 (626) Q Consensus 308 ~~~~~~~~~~~~lg----~~~l~~wD~~~~~------------~~~~~~~~~~ee~~~-~v~~~~~-~~~---------~ 360 (626) .++++++.+++.+| +++|+|||+.++. .....+++|++++.+ ++..... .|| + T Consensus 82 ~~~~e~~~l~~~~~~~~~~~~l~pwD~~y~~~~~~~~~~~~~~~~~~~~yf~~~~~~~~gl~~l~~~lfg~~~~~~~~~~ 161 (448) T pfam01432 82 LLHRELELLKKLKKKELGLEELQPWDVAYYSEKQREELYDPLDQEELRPYFPLEQVLEKGLFGLFERLFGITFVLEPLGE 161 (448) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCH T ss_conf 99999999999973425966556122367766767763587777660576869999999999998765323432056604 Q ss_pred HHHHH----------HHHHHHCCCCCHHCCCCCCCCCCCCCCCCC---CCCEEECCCCC---------CHHHHHHHHHHH Q ss_conf 99999----------998642485020103799874354766677---75101036533---------246799999984 Q gi|254780915|r 361 QMSII----------AEKFFTHNWIDAPQYDGKGSGAFAHGTIPS---VHPYILLNYLG---------KPQDVMTLAHEL 418 (626) Q Consensus 361 e~~~~----------~~~~~~~~~iD~~~r~gKr~GA~~~~~~~~---~~~~I~~N~~~---------~~~dv~TL~HE~ 418 (626) .+.+. .+.+++.+|+|++||+||++||||++++++ ++|+|+|||++ +++||.|||||| T Consensus 162 ~w~~dv~~~~v~d~~~~~~~g~~ylDl~~R~gK~~ga~~~~~~~~~~~p~~~i~~Nf~~~~~~~~~llt~~dV~TLfHE~ 241 (448) T pfam01432 162 VWHEDVRFYSVFDELSGGLIGEFYLDLYPRKGKRGGAYSFGLVPGRLDPVPYLLCNFTKPSSGKPSLLTHDDVETLFHEF 241 (448) T ss_pred HHHHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCEEECCCCCCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 56788877887620120400365885578775678885532678877873699951799989987757799999999998 Q ss_pred HHHHHHHHHCCCC-CCCCCCCHHHHHHHHHHHHHHHHHHHHH----------CCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5899999612568-7765780059998868889999999984----------1389989999999998620699999999 Q gi|254780915|r 419 GHGIHFVLSSETQ-GILTNNSSLTLAETASIFGETLTFDSLL----------QAASSKEERKILLANKIEDMLNSIVRQI 487 (626) Q Consensus 419 GHa~H~~ls~~~~-~~~~~~~~~~~aE~~S~~~E~~~~~~~~----------~~~~~~~~~~~ll~~~~~~~~~~~~~q~ 487 (626) |||+|+++|+..+ .+.++++++||||+|||+||+||||+.+ +++.+.++...++.++..+.+.++++|+ T Consensus 242 GHalH~~ls~~~~~~~sg~~~~~dfvE~pS~~~E~~~~d~~~l~~~a~h~~t~~~l~~~~i~~l~~~~~~~~~~~~~rq~ 321 (448) T pfam01432 242 GHSMHSLLSRTEYSYVSGTNVPIDFAEIPSQFNENWLWEPLLLNLLSRHYETGEPIPAELLEKLIKSKNVNAGLFLFRQL 321 (448) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCCHHHCCHHHHHHHHCCHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 58899886136677646787782143043899999870889871535781103304099999999987641799999999 Q ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 99999999972120168999899999999999986077634588788850015254106675258999999999999999 Q gi|254780915|r 488 SFYDFELKLHTERRSTGDIPTHRINEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFIESSFYVYAYAFGNCLVNSLYDI 567 (626) Q Consensus 488 ~~~~fe~~~h~~~~~~~~~t~~~l~~~~~~~~~~~~g~~~~~~~~~~~~~w~~~~H~y~~~fY~y~Y~~~~~~a~~l~~~ 567 (626) ++++||+++|...+.++. .+.++++|.+++++++|+.++.+......+|..++|+|+++||+|.|+ .++|+++|.+ T Consensus 322 ~~a~fd~~lh~~~~~~~~--~~~~~~~~~~l~~~~~~~~~~~~~~~~~~f~hi~~~~Y~a~yY~Y~ya--~~~a~~~~~~ 397 (448) T pfam01432 322 MFAAFDQEIHEAAEEDQK--LDFLLEEYAELNKKYYGDPVTPDEASPLSFSHIFPHGYAANYYSYLYA--TGLALDIFEK 397 (448) T ss_pred HHHHHHHHHHHCCCCCCC--CCHHHHHHHHHHHHHCCCCCCCCCCCCCEEHHHCCCCCCCCCHHHHHH--HHHHHHHHHH T ss_conf 999999999736567897--313999999999985799989998755552866588626786878999--9999999999 Q ss_pred HHHCC---CHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 99567---332899999998326999979999970889899889999999 Q gi|254780915|r 568 YKSNT---VDCFKEKYLNILRAGNSKHYSELLLPLNINLSDPNFWERGLQ 614 (626) Q Consensus 568 ~~~~~---~~~~~~~y~~~L~~Ggs~~~~el~k~~G~d~~~~~~~~~~~~ 614 (626) ++++| .+.+.+.|+.||+.|||++|.|+++.||++.++++++..+++ T Consensus 398 ~~~~~~~~~~~g~~~~~~iL~~Ggs~~p~ell~~f~gr~~~~~a~l~~~g 447 (448) T pfam01432 398 FFEQDPLNRETGLRYYLEFLSRGGSLDPLELLKKFGGRMPSADALLRALG 447 (448) T ss_pred HHHCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCHHHHHHHCC T ss_conf 99749987699999999984276982699999982899998589998677 No 10 >cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom Probab=100.00 E-value=0 Score=507.58 Aligned_cols=379 Identities=17% Similarity=0.197 Sum_probs=312.7 Q ss_pred CCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCC-CHHHHHHHHH Q ss_conf 752332135418887679999999999865054216999999866678865427899988976222258-9899999999 Q gi|254780915|r 221 IPLTKAYKSFFDSDREVRKSAAKALSHTFNKSSHIFSFITNTLAKDEEIQDRWRKYEKIADSRHLSNNV-EPYVIEALMQ 299 (626) Q Consensus 221 ~~~~~~~~~l~~~dr~~Rk~a~~a~~~~~~~~~~~~a~il~~iv~~r~e~A~llGy~s~~~~~l~~~~~-~~~~~~~l~~ 299 (626) ..++..++|.++++|++|+++|+|+.+++.+ .|..++.+|+++|.+.|+|+||+||++++|..+|+ +++.+..++. T Consensus 20 ~~P~y~pvm~~a~~r~lR~~~y~A~~~R~~~---~N~~il~~il~lR~e~A~LLGf~s~A~~~L~~kMA~sp~~V~~fL~ 96 (472) T cd06455 20 KYPDYFPVMKYAKNAETRKAMWVAFQNRGGP---ENVPLLEELVALRHELARLLGYKSHADYVLEDRMAKSPETVRDFLE 96 (472) T ss_pred CCCCHHHHHCCCCCHHHHHHHHHHHHHHCHH---HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCHHHHHHHHH T ss_conf 8997899973399989999999999632566---4799999999999999997699969999998775699999999999 Q ss_pred HHHHCCHHHHHHHHHHH----HHHCCC--CCCCHHHHHHCC-----------CCCCCCCCCHHHHHHHHHHHHHH-HHHH Q ss_conf 87630628999999999----987088--753123444025-----------65547779989999999986666-3289 Q gi|254780915|r 300 SVKNYYPKTSHRYYELK----KKWLKL--DTMYFWDRLAPL-----------PGTSQDIIPFEVARDLVLQSYAK-FSPQ 361 (626) Q Consensus 300 ~~~~~~~~~~~~~~~~~----~~~lg~--~~l~~wD~~~~~-----------~~~~~~~~~~ee~~~~v~~~~~~-~~~e 361 (626) .+.....|.+.+..... ++..|. ..|.|||+.++. ..+.+++||.+.+.+++....+. ||.. T Consensus 97 ~L~~~~~~~a~~E~~~L~~~~~~~~~~~~~~l~pWD~~yy~ek~r~~~~~~d~~~l~~YFpl~~Vl~Glf~l~~~LfGi~ 176 (472) T cd06455 97 DLSKKLKPLAEKELAELLELKKKEVPEAGDRIYPWDLAYYMERVEEEKYDVDQEKIREYFPLEVVIEGMLDIYQRLFGLR 176 (472) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHCCCCCHHHHHHHHHHHHHHHHCEE T ss_conf 99999899999999999999987558776866766799999888887535567774057728999999999999984845 Q ss_pred HHHHH-------------------HHHHHCCCCCHHCCCCCCCCCCCCCCCCC----------CCCEEECCCCC------ Q ss_conf 99999-------------------98642485020103799874354766677----------75101036533------ Q gi|254780915|r 362 MSIIA-------------------EKFFTHNWIDAPQYDGKGSGAFAHGTIPS----------VHPYILLNYLG------ 406 (626) Q Consensus 362 ~~~~~-------------------~~~~~~~~iD~~~r~gKr~GA~~~~~~~~----------~~~~I~~N~~~------ 406 (626) +.+.. ...++..|+|+|+|+||++||||++++++ +..+++|||+. T Consensus 177 f~~~~~~~~WH~dVr~~~V~d~~~g~~lG~fYlDl~~R~~K~~ga~~~~~~~~~~~~~g~~q~Pv~~lvcNf~~p~~~~p 256 (472) T cd06455 177 FEEVPDASVWHEDVRLYSVWDADTGEFLGYFYLDLHPREGKYGHAANFGLQPGFLLPDGSRQYPVAALVCNFPKPTADKP 256 (472) T ss_pred EEECCCCCCCCCCCEEEEEEECCCCCEEEEEEEECCCCCCCCCCCCEEEECCCEECCCCCEECCEEEEEECCCCCCCCCC T ss_conf 58778788778765699999889994789998514777777787715652354465899861676899844679999988 Q ss_pred ---CHHHHHHHHHHHHHHHHHHHHC-CCCCCCCCCCHHHHHHHHHHHHHHHHH----------HHHHCCCCCHHHHHHHH Q ss_conf ---2467999999845899999612-568776578005999886888999999----------99841389989999999 Q gi|254780915|r 407 ---KPQDVMTLAHELGHGIHFVLSS-ETQGILTNNSSLTLAETASIFGETLTF----------DSLLQAASSKEERKILL 472 (626) Q Consensus 407 ---~~~dv~TL~HE~GHa~H~~ls~-~~~~~~~~~~~~~~aE~~S~~~E~~~~----------~~~~~~~~~~~~~~~ll 472 (626) ++++|+|||||||||||++|++ +++.++|++++|||||+|||+||+||| ||.+++++|+++..+|+ T Consensus 257 sLLs~~ev~TlFHEfGHaLH~lLs~~~y~~~sGt~v~~DfVE~PSqlmE~w~~~~~vL~~~a~Hy~Tgeplp~~l~~~l~ 336 (472) T cd06455 257 SLLRHDEVETFFHEFGHVIHHLLGRTKYARFSGTRVERDFVEAPSQMLENWCWEPEVLKRLSKHYKTGEKIPDELIERLI 336 (472) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHCCHHHHHHCCCCHHHHHHHHCCCCCCCCCCHHHHHHHH T ss_conf 86679999887756748888652376545544765661244286899985457889998531225789989699999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCC-CCCCH Q ss_conf 99862069999999999999999972120168999899999999999986077634588788850015254106-67525 Q gi|254780915|r 473 ANKIEDMLNSIVRQISFYDFELKLHTERRSTGDIPTHRINEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFIE-SSFYV 551 (626) Q Consensus 473 ~~~~~~~~~~~~~q~~~~~fe~~~h~~~~~~~~~t~~~l~~~~~~~~~~~~g~~~~~~~~~~~~~w~~~~H~y~-~~fY~ 551 (626) ..+.++.+..+.+|+.+++||+++|..... .. +..+++.++.+++.+ ++.....++.++|+|+++ ++-.| T Consensus 337 ~a~~~~~g~~~~~ql~~a~~Dl~lH~~~~~--~~---d~~~~~~~~~~~~~~----~~~~~~~~~~~~F~Hi~~gY~AgY 407 (472) T cd06455 337 ASRHFNRGLFYLRQLFFALFDLALHTGDPA--DL---DTTKLYNDLREEISL----IPSTEGTHGPASFGHLAGGYDAGY 407 (472) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCC--CC---CHHHHHHHHHHHHCC----CCCCCCCCCCCCCCCHHCCCCCCH T ss_conf 887564699999999999999998378865--59---999999999997179----999999987873320224855342 Q ss_pred HHHHHHHHHHHHHHHHHHHCC--CHHHHHHHHH-HHHCCCCCCHHHHHHHC-CCCCCCHHHHHH Q ss_conf 899999999999999999567--3328999999-98326999979999970-889899889999 Q gi|254780915|r 552 YAYAFGNCLVNSLYDIYKSNT--VDCFKEKYLN-ILRAGNSKHYSELLLPL-NINLSDPNFWER 611 (626) Q Consensus 552 y~Y~~~~~~a~~l~~~~~~~~--~~~~~~~y~~-~L~~Ggs~~~~el~k~~-G~d~~~~~~~~~ 611 (626) |+|.|++++|+++|..|+++| +.+.+++|++ ||+.|||++|.|+++.| |.|++...+++. T Consensus 408 YsYlWsevlAaD~~~~f~~~g~~~~e~G~~~r~~iL~~Ggs~~~~e~~~~F~GR~P~~~all~~ 471 (472) T cd06455 408 YGYLWSEVFAADMFSSFFKDGLLNPEVGLRYRDTVLAPGGSKDAADMLKDFLGREPNNDAFLKS 471 (472) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCCHHHHHC T ss_conf 9999999999999999986399989999999999721789918999999846999981577616 No 11 >cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala. Probab=100.00 E-value=0 Score=486.58 Aligned_cols=350 Identities=17% Similarity=0.189 Sum_probs=286.4 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCC-CHHHHHHHHHHHHHCCHHHHHHHHHH----HHHHCCCCCCCHHH Q ss_conf 16999999866678865427899988976222258-98999999998763062899999999----99870887531234 Q gi|254780915|r 255 IFSFITNTLAKDEEIQDRWRKYEKIADSRHLSNNV-EPYVIEALMQSVKNYYPKTSHRYYEL----KKKWLKLDTMYFWD 329 (626) Q Consensus 255 ~~a~il~~iv~~r~e~A~llGy~s~~~~~l~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~----~~~~lg~~~l~~wD 329 (626) +|.+++.+|+++|.+.|+++||+||++++|..+|+ +++.+..++..+.....|.+.+.... +++..|.+++.||| T Consensus 1 dN~~~l~~il~lR~e~A~LLGy~syA~~~l~~kMA~spe~V~~FL~~L~~~~~~~a~~El~~L~~~~~~~~~~~~l~pWD 80 (422) T cd06456 1 DNRPLIEEILALRAEKAKLLGFENYAEYSLADKMAKSPEAVLEFLEDLAPKAKPQAKKELAELQAFAKEEGGEDELEPWD 80 (422) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHH T ss_conf 91899999999999999985999799999998765899999999999999989999999999999997607988878768 Q ss_pred HHHCC-----------CCCCCCCCCHHHHHHHHHHHHHH-HHHHHHHHH------------------HHHHHCCCCCHHC Q ss_conf 44025-----------65547779989999999986666-328999999------------------9864248502010 Q gi|254780915|r 330 RLAPL-----------PGTSQDIIPFEVARDLVLQSYAK-FSPQMSIIA------------------EKFFTHNWIDAPQ 379 (626) Q Consensus 330 ~~~~~-----------~~~~~~~~~~ee~~~~v~~~~~~-~~~e~~~~~------------------~~~~~~~~iD~~~ 379 (626) +.++. ..+.+++||.+.+.+.+....+. ||..+.+.- ...++..|+|+|+ T Consensus 81 ~~Yy~~~~r~~~~~~d~~~l~~YFpl~~vl~Glf~l~~~LfGi~f~~~~~~~~Wh~dV~~~~V~d~~~~~lG~~YlDl~~ 160 (422) T cd06456 81 WAYYSEKLRKEKYDLDEEELRPYFPLEKVLDGLFELAERLYGITFKERTDLPVWHPDVRVYEVFDKDGSHIGLFYLDLYA 160 (422) T ss_pred HHHHHHHHHHHHCCCCHHHHHCCCCHHHHHHHHHHHHHHHHCEEEEECCCCCCCCCCCEEEEEECCCCCEEEEEEECCCC T ss_conf 99999998775424786776145538999999999999985746787788887888626999982899778999730457 Q ss_pred CCCCCCCCCCCCCCCC--------CCCEEECCCCC---------CHHHHHHHHHHHHHHHHHHHHC-CCCCCCCCCCHHH Q ss_conf 3799874354766677--------75101036533---------2467999999845899999612-5687765780059 Q gi|254780915|r 380 YDGKGSGAFAHGTIPS--------VHPYILLNYLG---------KPQDVMTLAHELGHGIHFVLSS-ETQGILTNNSSLT 441 (626) Q Consensus 380 r~gKr~GA~~~~~~~~--------~~~~I~~N~~~---------~~~dv~TL~HE~GHa~H~~ls~-~~~~~~~~~~~~~ 441 (626) |+||++||||++++++ +..+++|||+. ++++|+|||||||||||++|++ +++.+++++++|| T Consensus 161 R~~K~~gAw~~~~~~~~~~~~~q~Pv~~lvcNf~~p~~~~psLL~~~ev~TlFHEfGH~lH~lLs~~~y~~~sGt~v~~D 240 (422) T cd06456 161 REGKRGGAWMNNLRSQSKNGLGQKPVAYLVCNFTKPAGGKPALLTHDEVTTLFHEFGHALHHLLTDVEYPSLGGTNVEWD 240 (422) T ss_pred CCCCCCCCEEEECCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHH T ss_conf 77766885033024556688976886999716789999988652699999999987488888750687453068667501 Q ss_pred HHHHHHHHHHHHHH----------HHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH Q ss_conf 99886888999999----------99841389989999999998620699999999999999999721201689998999 Q gi|254780915|r 442 LAETASIFGETLTF----------DSLLQAASSKEERKILLANKIEDMLNSIVRQISFYDFELKLHTERRSTGDIPTHRI 511 (626) Q Consensus 442 ~aE~~S~~~E~~~~----------~~~~~~~~~~~~~~~ll~~~~~~~~~~~~~q~~~~~fe~~~h~~~~~~~~~t~~~l 511 (626) |||+|||+||+||| ||.+++++|+++..+|+.++..+.+..+.+|+.+++||+++|+..... ..+.. T Consensus 241 fvE~PSql~E~w~~~~~vL~~~a~Hy~Tgeplp~~li~~l~~~~~~~~g~~~~~ql~~al~Dl~lH~~~~~~---~~~~~ 317 (422) T cd06456 241 FVELPSQFMENWAWEPEVLKLFAKHYETGEPLPDELIDKLLAARNFNSGFATVRQLEFALLDLALHSLTDPE---ILDVV 317 (422) T ss_pred HHHCCHHHHHHHCCCHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC---CCCHH T ss_conf 774678999851376999973244466898186999999999987607999999999999999985689865---45369 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCC--CHHHHHHHHHHHHHHHHHHHHCC--CHHHHHHHHH-HHHC Q ss_conf 99999999986077634588788850015254106675--25899999999999999999567--3328999999-9832 Q gi|254780915|r 512 NEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFIESSF--YVYAYAFGNCLVNSLYDIYKSNT--VDCFKEKYLN-ILRA 586 (626) Q Consensus 512 ~~~~~~~~~~~~g~~~~~~~~~~~~~w~~~~H~y~~~f--Y~y~Y~~~~~~a~~l~~~~~~~~--~~~~~~~y~~-~L~~ 586 (626) ..++.++.+++.+ ++.....++.++|+|+|.+|| .||+|.|++++|+++|.+|+++| +++.+++|++ ||+. T Consensus 318 ~~~~~~l~~~~~~----~~~~~~~~~~~~F~Hif~~gY~A~YYsYlWsevlAaD~~~~F~~~g~~~~e~G~~~r~~iL~~ 393 (422) T cd06456 318 QFELDALRKEGLV----IPPPPPRYFSNYFSHIFSGGYAAGYYSYKWAEVLDADAFSAFEEEGIFNRETGRRFRDTILSK 393 (422) T ss_pred HHHHHHHHHHCCC----CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC T ss_conf 9999999985788----999999877876552015873344799999999999999999766999899999999998517 Q ss_pred CCCCCHHHHHHHC-CCCCCCHHHHHH Q ss_conf 6999979999970-889899889999 Q gi|254780915|r 587 GNSKHYSELLLPL-NINLSDPNFWER 611 (626) Q Consensus 587 Ggs~~~~el~k~~-G~d~~~~~~~~~ 611 (626) |||++|.|+++.| |.+++...|++. T Consensus 394 GGs~~~~e~~~~FlGR~P~~~alL~~ 419 (422) T cd06456 394 GGSRDPMELFRAFRGRDPSIEALLRR 419 (422) T ss_pred CCCCCHHHHHHHHCCCCCCHHHHHHH T ss_conf 89949999999856999984789986 No 12 >cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo Probab=100.00 E-value=0 Score=482.93 Aligned_cols=371 Identities=16% Similarity=0.151 Sum_probs=296.4 Q ss_pred HHHHCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCC-CHHHHHHHHHHHHHCCH Q ss_conf 35418887679999999999865054216999999866678865427899988976222258-98999999998763062 Q gi|254780915|r 228 KSFFDSDREVRKSAAKALSHTFNKSSHIFSFITNTLAKDEEIQDRWRKYEKIADSRHLSNNV-EPYVIEALMQSVKNYYP 306 (626) Q Consensus 228 ~~l~~~dr~~Rk~a~~a~~~~~~~~~~~~a~il~~iv~~r~e~A~llGy~s~~~~~l~~~~~-~~~~~~~l~~~~~~~~~ 306 (626) .+.+++||++|+++|+|+.+ ....|.+++.+|+++|.+.|+++||+||+++++..+|+ +++.+..++..+..... T Consensus 2 vl~~~~n~~lR~~~y~a~~~----~~~~N~~~l~~il~lR~e~A~LLGf~syA~~~l~~kMa~sp~~V~~FL~~L~~~~~ 77 (458) T cd06457 2 LLASVPNESVRELVYRAFHS----SSAEQLELLEELLSSRAELAQLLGFESYAHRALRGKMAKNPENVMEFLTELSEKLR 77 (458) T ss_pred CCCCCCCHHHHHHHHHHHCC----CCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 67769899999999998677----86217999999999999999985999899999998863899999999999999989 Q ss_pred HHHHHHHHHHH----HHCC--CCCCCHHHHHHCCC-----------CCCCCCCCHHHHHHHHHHHHHH-HHHHHHHHH-- Q ss_conf 89999999999----8708--87531234440256-----------5547779989999999986666-328999999-- Q gi|254780915|r 307 KTSHRYYELKK----KWLK--LDTMYFWDRLAPLP-----------GTSQDIIPFEVARDLVLQSYAK-FSPQMSIIA-- 366 (626) Q Consensus 307 ~~~~~~~~~~~----~~lg--~~~l~~wD~~~~~~-----------~~~~~~~~~ee~~~~v~~~~~~-~~~e~~~~~-- 366 (626) |.+.+...... +..| ..+|.|||+.++.. .+.+++||.+.+.+.+....+. ||..+.+.. T Consensus 78 ~~~~~E~~~L~~~~~~~~~~~~~~l~pWD~~yy~~k~r~~~~~~d~~~l~~YFpl~~vl~Glf~i~~~Lfgi~~~~~~~~ 157 (458) T cd06457 78 PRAEEELAVLRDMKRKVLGKPLSSLAPWDRDYYTGQYRQSRFDSEPSNLSPYFSLGTVMEGLSRLFSRLYGIRLVPVPLA 157 (458) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHCCCCHHHHHHHHHHHHHHHHCEEEEECCCC T ss_conf 99999999999999985489867678778999998888864368888883048289999999999999857456874678 Q ss_pred ------------------HHHHHCCCCCHHCCCCCCCCCCCCCCCCC--------------------CCCEEECCCCC-- Q ss_conf ------------------98642485020103799874354766677--------------------75101036533-- Q gi|254780915|r 367 ------------------EKFFTHNWIDAPQYDGKGSGAFAHGTIPS--------------------VHPYILLNYLG-- 406 (626) Q Consensus 367 ------------------~~~~~~~~iD~~~r~gKr~GA~~~~~~~~--------------------~~~~I~~N~~~-- 406 (626) +..++..|+|+|+|+||++||++++++++ +...++|||+. T Consensus 158 ~~evWh~dV~~~~V~d~~~~~lG~~YlDl~~R~gK~~~aa~~~ir~~~~~~~~~~~~~~~~~~~~q~Pv~~lvcNf~~p~ 237 (458) T cd06457 158 PGEVWHPDVRKLDVVHEDEGLLGVIYCDLFSRPGKPPGAAHFTIRCSRRLDDDDVAERGGRGGTYQLPVVALMCNFPPPS 237 (458) T ss_pred CCCCCCCCCEEEEEEECCCCEEEEEEEECCCCCCCCCCCCEECCCCCEECCCCCCCCCCCCCCCEECCEEEEECCCCCCC T ss_conf 77676788359999935997799998525889988677721025432023741002223679855778789965688999 Q ss_pred -------CHHHHHHHHHHHHHHHHHHHHC-CCCCCCCCCCHHHHHHHHHHHHHHHHH----------HHHHCCCCCHHHH Q ss_conf -------2467999999845899999612-568776578005999886888999999----------9984138998999 Q gi|254780915|r 407 -------KPQDVMTLAHELGHGIHFVLSS-ETQGILTNNSSLTLAETASIFGETLTF----------DSLLQAASSKEER 468 (626) Q Consensus 407 -------~~~dv~TL~HE~GHa~H~~ls~-~~~~~~~~~~~~~~aE~~S~~~E~~~~----------~~~~~~~~~~~~~ 468 (626) ++++|+|||||||||||++||+ +.+.++|++++|||||+|||+||+||| ||.+++++|+++. T Consensus 238 ~~~psLL~~~ev~TlFHEFGH~lH~lLs~~~y~~~sGt~v~~DfVE~PSql~E~w~~~~evL~~~a~Hy~Tgeplp~~l~ 317 (458) T cd06457 238 PSGPTLLSPHEVETLFHEMGHAMHSMLGRTEYQHVSGTRCATDFVEVPSILMEYFASDPRVLKLFARHYSTGEPLPEEML 317 (458) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHCHHHHHHHHHCCHHHHHHHHHCCCCCCCCCHHHH T ss_conf 99987366999999999872898875317775535577564046645199999985589999985121589994979999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCC Q ss_conf 99999986206999999999999999997212016899989999999999998607763458878885001525410667 Q gi|254780915|r 469 KILLANKIEDMLNSIVRQISFYDFELKLHTERRSTGDIPTHRINEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFIESS 548 (626) Q Consensus 469 ~~ll~~~~~~~~~~~~~q~~~~~fe~~~h~~~~~~~~~t~~~l~~~~~~~~~~~~g~~~~~~~~~~~~~w~~~~H~y~~~ 548 (626) .+++.++..+.+..+.+|+.+++||+++|+..+.... ...+.+.++..++.+- ... .....+.++|+|+++++ T Consensus 318 ~~l~~a~~~~~g~~~~~ql~~al~Dl~lH~~~~~~~~----~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~F~Hl~gY~ 390 (458) T cd06457 318 ARLLASKNSFAALETQQQIVYALLDQELHGEQPLSPT----FTSDVLRDSTEIFYGL--PYV-PGGTAWQLRFGHLVGYG 390 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC----CHHHHHHHHHHHHCCC--CCC-CCCCCCCCCCCCCCCCH T ss_conf 9999887662699999999999999998478966555----5699999999985488--889-99888777687631523 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHCC--CHHHHHHHHH-HHHCCCCCCHHHHHHHC-CCCCCCHHHH Q ss_conf 525899999999999999999567--3328999999-98326999979999970-8898998899 Q gi|254780915|r 549 FYVYAYAFGNCLVNSLYDIYKSNT--VDCFKEKYLN-ILRAGNSKHYSELLLPL-NINLSDPNFW 609 (626) Q Consensus 549 fY~y~Y~~~~~~a~~l~~~~~~~~--~~~~~~~y~~-~L~~Ggs~~~~el~k~~-G~d~~~~~~~ 609 (626) .-||+|.|++|+|+++|..+.++| ++..+++||+ ||+.|||++|.|+++.| |.+++-.+.. T Consensus 391 AgYYsYlwsevlaad~f~~~f~~~~~~~~~G~~~R~~iL~~Ggs~~~~e~~~~FlGR~Psi~~~~ 455 (458) T cd06457 391 ATYYSYLFDRAIASKIWQKLFAADPLSREAGERLREELLKHGGGKDPWELLAGVLGKPPLVKGGA 455 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCC T ss_conf 41899999999999999999864999899999999986127898279999999659899887861 No 13 >KOG2089 consensus Probab=100.00 E-value=0 Score=463.74 Aligned_cols=517 Identities=16% Similarity=0.143 Sum_probs=380.5 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHH-------------------HHH Q ss_conf 9899999999999999999999999986420579918999999999999999999887-------------------416 Q gi|254780915|r 83 NCHSLGAAIAEYERICELIGRIASYAMLSYNCNLSSPTIRKFYTDINAKLADFEKVLI-------------------FFA 143 (626) Q Consensus 83 ~~~~l~~~l~~~e~i~~~l~rv~~ya~l~~s~dt~d~~~~~~~~~~~~~~s~~~~~l~-------------------~~~ 143 (626) ..+++.|+++++.++...++-+..-......+. .+...++........+.++....+ -+. T Consensus 75 ee~s~en~vepla~ie~el~~~~~~L~f~~~vs-~~~~~R~as~~~~~~~~~~~~r~~~r~di~~~~~~i~e~~~~dsls 153 (718) T KOG2089 75 EEPSYENVVEPLAKIEVELTVVIGMLVFPQHVS-PDKELRKASTEADKKLDEFDLRLSMRKDIYNRFQAIYEKPAKDSLS 153 (718) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 443267787689999999876665310053259-8266666400004778998798662057899999999740125679 Q ss_pred HHHHHCCHHHHHHHHHCC--------CCHHHHHHHHHHHHHHHHHCCCHHH---HHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 988719989999998469--------0011117999999974233171679---98766300100036777777644443 Q gi|254780915|r 144 LEINTLDEALLEQSYAQD--------PLTLKYSAWIKNIRKIKKHLLSNDM---ECLLSDTSQVGREALKRFFCENIESL 212 (626) Q Consensus 144 ~e~~~l~e~~l~~~~~~~--------~~l~~y~~~~~~l~~~~~~~l~~~~---E~~l~~~~~~~~~~~~~~~~~~~~~~ 212 (626) +|-.++-+..+....... .++++.+..+.+++..-...+.+.. +.++....+..+.+.+.+........ T Consensus 154 pe~~RylE~~ike~k~nGL~L~~~kr~~ik~ikk~l~~l~~~f~~nvne~~~~~t~litd~~el~glPps~L~~la~~~~ 233 (718) T KOG2089 154 PEAQRYLEKLIKEGKLNGLHLDDDKREEIKEIKKELSELSIKFSKNVNEDTKKFTFLITDKKELEGLPPSFLQSLAKDED 233 (718) T ss_pred HHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECHHHHCCCCHHHHHHHHHCCC T ss_conf 99999999999999862777776689999999999999999988603202345615630189865499899999851647 Q ss_pred HH---HHHCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCC Q ss_conf 32---210034752332135418887679999999999865054216999999866678865427899988976222258 Q gi|254780915|r 213 RF---KINDQKIPLTKAYKSFFDSDREVRKSAAKALSHTFNKSSHIFSFITNTLAKDEEIQDRWRKYEKIADSRHLSNNV 289 (626) Q Consensus 213 ~~---~~~~~~~~~~~~~~~l~~~dr~~Rk~a~~a~~~~~~~~~~~~a~il~~iv~~r~e~A~llGy~s~~~~~l~~~~~ 289 (626) .+ ++..........+.|.+|++|.+|+++|+|+..++....-+|..|+.+++++|.+.|+++||.+||+|+|+.+|+ T Consensus 234 ~k~sgpwkiTl~~p~~~pvmk~c~~r~tR~~v~~A~~~ra~~~~~eNs~ii~~l~~lR~~lAklLGY~t~Ad~~L~~kMa 313 (718) T KOG2089 234 DKGSGPWKITLKYPHYFPVMKHCKIRQTRKTVWRAYNSRASEGDLENSAIIEQLLKLRLELAKLLGYSTYADYSLAMKMA 313 (718) T ss_pred CCCCCCEEEEECCCCHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 77778527985661036998868768899999999742477555455189999998889888870840388889998760 Q ss_pred C-HHHHHHHHHHHHHCCHHHHHHH----HHHHH---HHCCC---CCCCHHHHHHCCC-----------CCCCCCCCHHHH Q ss_conf 9-8999999998763062899999----99999---87088---7531234440256-----------554777998999 Q gi|254780915|r 290 E-PYVIEALMQSVKNYYPKTSHRY----YELKK---KWLKL---DTMYFWDRLAPLP-----------GTSQDIIPFEVA 347 (626) Q Consensus 290 ~-~~~~~~l~~~~~~~~~~~~~~~----~~~~~---~~lg~---~~l~~wD~~~~~~-----------~~~~~~~~~ee~ 347 (626) . +.++..++..+.....|+..+. ..+++ +..|. +++..||..++.. .+..++||...+ T Consensus 314 ~ss~tv~~fl~dL~~kL~pl~~~e~~v~~elk~~e~k~~g~~~~~~~~~wD~~yy~~~~~e~~f~vd~~~LreyFPl~~v 393 (718) T KOG2089 314 KSSETVVEFLDDLSQKLRPLGIDERSVLLELKKGEAKDRGAPFDGKLTAWDLRYYMKRVEESKFDVDQEDLREYFPLPVV 393 (718) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCHHH T ss_conf 17188999999999985631078888999998889986399877640246789899888998638998898761883799 Q ss_pred HHHHHHHHH-HHHHHHHHHHH-------------------HHHHCCCCCHHCCCCCCCCCCCCCCCCC----------CC Q ss_conf 999998666-63289999999-------------------8642485020103799874354766677----------75 Q gi|254780915|r 348 RDLVLQSYA-KFSPQMSIIAE-------------------KFFTHNWIDAPQYDGKGSGAFAHGTIPS----------VH 397 (626) Q Consensus 348 ~~~v~~~~~-~~~~e~~~~~~-------------------~~~~~~~iD~~~r~gKr~GA~~~~~~~~----------~~ 397 (626) .+++....+ .||.++.+.-+ ..++..|+|.|||+||+|+|.|.+..++ +. T Consensus 394 ~~Gl~~i~q~LFglkf~e~~da~vWh~dVr~y~v~D~~Sg~~vG~fY~D~y~RegK~gh~~~f~l~~~~~~~~ss~~~PV 473 (718) T KOG2089 394 LSGLFGIYQTLFGLKFEEATDAEVWHADVRVYTVKDSASGNPVGYFYLDPYPREGKYGHAAVFGLQPGCLQKDSSRRIPV 473 (718) T ss_pred HHHHHHHHHHHHCCEEEECCCCHHCCCCEEEEECCCCCCCCEEEEEEECCCCCCCCCCCHHHCCCCHHHHCCCCCCCCCH T ss_conf 98899999998484224358732201312688535788886434788426786664460020014511206687443205 Q ss_pred CEEECCCCC---------CHHHHHHHHHHHHHHHHHHHHCCC-CCCCCC-CCHHHHHHHHHHHHHHHHH----------H Q ss_conf 101036533---------246799999984589999961256-877657-8005999886888999999----------9 Q gi|254780915|r 398 PYILLNYLG---------KPQDVMTLAHELGHGIHFVLSSET-QGILTN-NSSLTLAETASIFGETLTF----------D 456 (626) Q Consensus 398 ~~I~~N~~~---------~~~dv~TL~HE~GHa~H~~ls~~~-~~~~~~-~~~~~~aE~~S~~~E~~~~----------~ 456 (626) ..++|||+. .++||+|+||||||+||++|++.. ..+.+. +++||+||+||||||+||| | T Consensus 474 aalv~nfS~p~~~kpsll~~~ev~t~FheFGh~~q~ll~Qa~~~~fsG~~~vewDave~psq~Lenwv~~~d~L~~lS~H 553 (718) T KOG2089 474 AALVCNFSKPQSDKPSLLGHDEVETLFHEFGHVLQHLLTQADFARFSGPRNVEWDAVEVPSQFLENWVWDPDTLRSLSKH 553 (718) T ss_pred HHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHCHHHHHHHHCCCCHHHHHHCCC T ss_conf 89886057755689986550799999998748999998127500015866689655556099998750584076540223 Q ss_pred HHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 98413899899999999986206999999999999999997212016899989999999999998607763458878885 Q gi|254780915|r 457 SLLQAASSKEERKILLANKIEDMLNSIVRQISFYDFELKLHTERRSTGDIPTHRINEIWLETQKESLGPAFDLSDLEYGS 536 (626) Q Consensus 457 ~~~~~~~~~~~~~~ll~~~~~~~~~~~~~q~~~~~fe~~~h~~~~~~~~~t~~~l~~~~~~~~~~~~g~~~~~~~~~~~~ 536 (626) |.++++.++++..+|+..+..+.+.++.||+.++.||+++|+..+. +..+.|+++..++.+ ++...... T Consensus 554 y~tge~l~eEl~~kl~~~r~~~~gl~tlrqL~~a~~D~~lht~~d~-------~~~~~~~~l~~~i~~----~p~~~~d~ 622 (718) T KOG2089 554 YKTGEPLPEELLKKLILTRTVNAGLFTLRQLVLADFDLELHTKTDA-------DLADTYRQLCQEISI----VPATPGDN 622 (718) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-------CHHHHHHHHCHHHEE----CCCCCCCC T ss_conf 3579946799999999999998799999999987633887604542-------368999995656510----67898888 Q ss_pred EEECCCCCCCCCC--CHHHHHHHHHHHHHHHHHHHH-CC--CH-HHHHHHHH-HHHCCCCCCHHHHHHHC-CCCCCCHHH Q ss_conf 0015254106675--258999999999999999995-67--33-28999999-98326999979999970-889899889 Q gi|254780915|r 537 FWMMVPHFIESSF--YVYAYAFGNCLVNSLYDIYKS-NT--VD-CFKEKYLN-ILRAGNSKHYSELLLPL-NINLSDPNF 608 (626) Q Consensus 537 ~w~~~~H~y~~~f--Y~y~Y~~~~~~a~~l~~~~~~-~~--~~-~~~~~y~~-~L~~Ggs~~~~el~k~~-G~d~~~~~~ 608 (626) +.++|+|||..|| -||+|.|++|+|+|+|..+.+ .| +. ..+.+||+ ||+.||+++|.|+++.| |.|++..+| T Consensus 623 ~pcsF~hifa~gy~A~yY~yLWsEv~aaDif~t~fe~~g~~N~~~~G~ryR~tiLa~GG~~~~~e~f~~FlGRePS~~Af 702 (718) T KOG2089 623 MPCSFGHIFAGGYAAGYYSYLWSEVLAADIFSTFFEQEGEDNIKEVGMRYRNTILAPGGGKDPMEVFKRFLGREPSQEAF 702 (718) T ss_pred CCCCCCCHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCHHHH T ss_conf 87553121047337778999999999988888775133785688887998866115789956899999973789976788 Q ss_pred HHH Q ss_conf 999 Q gi|254780915|r 609 WER 611 (626) Q Consensus 609 ~~~ 611 (626) ... T Consensus 703 l~s 705 (718) T KOG2089 703 LKS 705 (718) T ss_pred HHH T ss_conf 875 No 14 >cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig Probab=100.00 E-value=0 Score=416.29 Aligned_cols=325 Identities=21% Similarity=0.247 Sum_probs=264.7 Q ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCC-CHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCC----CCHHHHHH Q ss_conf 99999866678865427899988976222258-98999999998763062899999999998708875----31234440 Q gi|254780915|r 258 FITNTLAKDEEIQDRWRKYEKIADSRHLSNNV-EPYVIEALMQSVKNYYPKTSHRYYELKKKWLKLDT----MYFWDRLA 332 (626) Q Consensus 258 ~il~~iv~~r~e~A~llGy~s~~~~~l~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~lg~~~----l~~wD~~~ 332 (626) +|+.+|+++|.+.|+++||+||+++.+...|+ +++.+..++..+.....|...+......+..+.+. ..+|+... T Consensus 2 ~il~elv~lR~e~A~lLGy~~~a~~~l~~~ma~tpe~v~~fl~~l~~~~~~~~~~e~~~l~~~~~~~~~~~~~~~~~~~~ 81 (365) T cd06258 2 ALLEELVSLRNQLARLLGYENFADYKLALQEAKSPETVEGFFEELKRKLRPLLAKLREEISAAKQKEEEIYGELPARYDV 81 (365) T ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC T ss_conf 79999999999999984999999999987339999999999999999989999999999999974233554311134568 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCCCCCCCCCCCCCCCC-----CCCEEECCCCC- Q ss_conf 256554777998999999998666632899999998642485020103799874354766677-----75101036533- Q gi|254780915|r 333 PLPGTSQDIIPFEVARDLVLQSYAKFSPQMSIIAEKFFTHNWIDAPQYDGKGSGAFAHGTIPS-----VHPYILLNYLG- 406 (626) Q Consensus 333 ~~~~~~~~~~~~ee~~~~v~~~~~~~~~e~~~~~~~~~~~~~iD~~~r~gKr~GA~~~~~~~~-----~~~~I~~N~~~- 406 (626) . .....+.++.+.+.+.+ +.-.|+|+++|+||++||||+++.++ +.++|+|||++ T Consensus 82 ~-~~~~~~~f~~~~v~~~~------------------~~~fylDl~~R~gK~~~a~~~~~~~~~~~~~p~~~l~~N~~~~ 142 (365) T cd06258 82 D-SALLKEFFDAERPWEGA------------------LPFFYLDLYDRKGKYPHGFCTGLDPGFNRQDKDVRILANFTSP 142 (365) T ss_pred C-HHHCCCCCCHHHHHHHH------------------HHHHEECCCCCCCCCCCCEEECCCCCCCCCCCEEEEEECCCCC T ss_conf 9-76736658799999998------------------8877118888999998871602378877879806898268998 Q ss_pred --------CHHHHHHHHHHHHHHHHHHHHC-CCCCCCCCCCHHHHHHHHHHHHHHHHHHH---------HHCCCCCHHHH Q ss_conf --------2467999999845899999612-56877657800599988688899999999---------84138998999 Q gi|254780915|r 407 --------KPQDVMTLAHELGHGIHFVLSS-ETQGILTNNSSLTLAETASIFGETLTFDS---------LLQAASSKEER 468 (626) Q Consensus 407 --------~~~dv~TL~HE~GHa~H~~ls~-~~~~~~~~~~~~~~aE~~S~~~E~~~~~~---------~~~~~~~~~~~ 468 (626) ++++|+|||||||||+|+++++ +.+.+.++++++||||+|||+||+|||+. ..+++.|.+.. T Consensus 143 ~~~~p~lL~~~ev~TlfHEfGHalh~ll~~~~~~~~~gt~~~~df~E~pS~l~E~~~~~~~~L~~~a~h~~~~~~~~~~~ 222 (365) T cd06258 143 AAPDPVLLGHDDINTLFHEFGHAVHFLLIQQRYPFQERTPTSTDFAEAQSMFLESFATDPEWLERYARHYQGGVVPDELI 222 (365) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHCCHHHHHHHHHHCCCCCCCHHHH T ss_conf 88898756699999999998788887435655665467778865122028999987258999976403102579979999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCC-C Q ss_conf 999999862069999999999999999972120168999899999999999986077634588788850015254106-6 Q gi|254780915|r 469 KILLANKIEDMLNSIVRQISFYDFELKLHTERRSTGDIPTHRINEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFIE-S 547 (626) Q Consensus 469 ~~ll~~~~~~~~~~~~~q~~~~~fe~~~h~~~~~~~~~t~~~l~~~~~~~~~~~~g~~~~~~~~~~~~~w~~~~H~y~-~ 547 (626) .+++..+..+.+..+.+|+.++.||+++|+.. .+..+++++|.++.+++.|.. .......|++|+|+++ + T Consensus 223 ~~~~~~~~~~~~~~~~~~l~~~~~d~~lh~~~-----~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~f~Hi~~gy 293 (365) T cd06258 223 EKLIAARLPNTLYETRRILVVAKFEKALYENP-----DRELELQKLWRDLVKEILGVR----PDPSTPDPAAFPHLAGGS 293 (365) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-----CCCCCHHHHHHHHHHHHCCCC----CCCCCCCCCCCCCCCCCC T ss_conf 99999984558987899999999999975388-----642017999999988627778----999887766556013788 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHCC-----CHHHHHHHHH-HHHCCCCCCHHHHHHHC-CCCCCCHHHHH Q ss_conf 7525899999999999999999567-----3328999999-98326999979999970-88989988999 Q gi|254780915|r 548 SFYVYAYAFGNCLVNSLYDIYKSNT-----VDCFKEKYLN-ILRAGNSKHYSELLLPL-NINLSDPNFWE 610 (626) Q Consensus 548 ~fY~y~Y~~~~~~a~~l~~~~~~~~-----~~~~~~~y~~-~L~~Ggs~~~~el~k~~-G~d~~~~~~~~ 610 (626) +.|||+|+||+|+|+++|..+++.+ ..+.+++|++ ||+.|||++|.|+++.| |.+++...|++ T Consensus 294 ~a~YysY~~a~vla~d~~~~f~~~~~~~~~~~~~G~~~r~~il~~G~s~~~~el~~~f~Gr~p~~~a~l~ 363 (365) T cd06258 294 PAYYYGYLLAEMLASQLRATFKKKVGYLTDNPEAGPRLREHILRPGNSEPWKELLKRATGEDPNADAFLD 363 (365) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCHHHHHH T ss_conf 8415899999999999999999846886468999999999986363899999999997785998078874 No 15 >KOG2090 consensus Probab=100.00 E-value=0 Score=383.14 Aligned_cols=553 Identities=14% Similarity=0.152 Sum_probs=364.3 Q ss_pred HHHCCCCC--CHH-HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 57278889--278-999999999999999999742331023349989999999999999999999999998642057991 Q gi|254780915|r 42 EDLYPSHD--SQE-ISNDMECIEHESLAFKTRWEGNLAHATNQKNCHSLGAAIAEYERICELIGRIASYAMLSYNCNLSS 118 (626) Q Consensus 42 ~~ly~~~~--~~e-~~~dl~~~~~~i~~~~~~~~~~l~~~~~~~~~~~l~~~l~~~e~i~~~l~rv~~ya~l~~s~dt~d 118 (626) +-||.++. +++ |..--+...+...++.....+ ...++. +|..+++|...+.+|...|.+...++. | T Consensus 42 tGLFgn~~L~t~eGF~~l~~~a~~~t~eLi~~~~~------~~~gp~----tI~~~DeiSdtlC~VaDLaEfvR~aHP-d 110 (704) T KOG2090 42 TGLFGNPELSTAEGFNRLPEAALEKTQELIDELLS------TPSGPR----TIQIFDEISDTLCRVADLAEFVRQAHP-D 110 (704) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHC------CCCCCH----HHHHHHHHHHHHHHHHHHHHHHHHHCC-C T ss_conf 54657865577468999899999999999999746------999734----889898775778889789999987189-8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHH--HHHHCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 8999999999999999999887416988719989999--99846900111179999999742331716799876630010 Q gi|254780915|r 119 PTIRKFYTDINAKLADFEKVLIFFALEINTLDEALLE--QSYAQDPLTLKYSAWIKNIRKIKKHLLSNDMECLLSDTSQV 196 (626) Q Consensus 119 ~~~~~~~~~~~~~~s~~~~~l~~~~~e~~~l~e~~l~--~~~~~~~~l~~y~~~~~~l~~~~~~~l~~~~E~~l~~~~~~ 196 (626) +++....+++...+-++-+.+.-......+|....-+ .+...+++-..-+.++.+.-+..-|.-+++-++.+...+.. T Consensus 111 ~~fv~aAe~a~~~~~e~ve~LNTn~~LY~~Lk~~l~~~~~l~~~d~e~~v~~lll~DFE~sGIhL~~~kr~kfv~Ls~eI 190 (704) T KOG2090 111 PEFVEAAEEACRSMFELVESLNTNVALYQKLKKVLQDSSRLDDLDPETYVARLLLDDFEKSGIHLDPEKREKFVQLSSEI 190 (704) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf 89999999999999999998525778999999986286546655788899999998898606678989999999873999 Q ss_pred CCCHHHHHHHHH-------------H--------HHHHHHHHCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCC Q ss_conf 003677777764-------------4--------4433221003475233213541888767999999999986505421 Q gi|254780915|r 197 GREALKRFFCEN-------------I--------ESLRFKINDQKIPLTKAYKSFFDSDREVRKSAAKALSHTFNKSSHI 255 (626) Q Consensus 197 ~~~~~~~~~~~~-------------~--------~~~~~~~~~~~~~~~~~~~~l~~~dr~~Rk~a~~a~~~~~~~~~~~ 255 (626) ......-..... . .....++.--..+...+.....++|+.+||.+|..+...-..+ T Consensus 191 ~~lg~~F~~n~~~fP~~~l~~s~~~l~~~~~~~~~~k~knF~~~t~~~~i~~ll~~~~d~~vRk~vY~~~~~~~~~q--- 267 (704) T KOG2090 191 FDLGREFQNNTDRFPDNKLPNSLERLPFSKKNFIESKRKNFDQATDPYIIYGLLSSSEDESVRKMVYNTFHSPSDIQ--- 267 (704) T ss_pred HHHHHHHHHCCCCCCCCCCCHHHHHCCCCCHHHHHHHHCCCCCCCCCEEEEHHHHCCCCHHHHHHHHHHHCCCCHHH--- T ss_conf 99999997235447700011356642576320488875288767887332125322798899999999744796778--- Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCC-HHHHHHHHHHHHHCCHHHHHHHHHH----HHHHCC--CCCCCHH Q ss_conf 69999998666788654278999889762222589-8999999998763062899999999----998708--8753123 Q gi|254780915|r 256 FSFITNTLAKDEEIQDRWRKYEKIADSRHLSNNVE-PYVIEALMQSVKNYYPKTSHRYYEL----KKKWLK--LDTMYFW 328 (626) Q Consensus 256 ~a~il~~iv~~r~e~A~llGy~s~~~~~l~~~~~~-~~~~~~l~~~~~~~~~~~~~~~~~~----~~~~lg--~~~l~~w 328 (626) -..|+.++..|++.|++.|+.|||++.+.+++++ +++|..+++.+.....+.+.+.+.. +++.+| -.++.|| T Consensus 268 -~~~L~~Li~sRheLa~l~G~~SfA~~al~~~la~~pk~V~~Fl~~Ls~k~~~~~~kel~~i~~mkkk~~~~~~~e~~pW 346 (704) T KOG2090 268 -VKLLEHLISSRHELAKLVGKSSFAHRALEGKLAKNPKTVRSFLEELSEKLSEKTDKELAVIRDMKKKENNNNNAEIEPW 346 (704) T ss_pred -HHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf -9999999997999998848544889986200027807899999999986227789999999998750248998756666 Q ss_pred HHHHCCC-----------CCCCCCCCHHHHHHHHHHHHHHH-HH---------------HHHHHH-----HHHHHCCCCC Q ss_conf 4440256-----------55477799899999999866663-28---------------999999-----9864248502 Q gi|254780915|r 329 DRLAPLP-----------GTSQDIIPFEVARDLVLQSYAKF-SP---------------QMSIIA-----EKFFTHNWID 376 (626) Q Consensus 329 D~~~~~~-----------~~~~~~~~~ee~~~~v~~~~~~~-~~---------------e~~~~~-----~~~~~~~~iD 376 (626) |..++.. ....+.|+...+.++.-..++.+ |. +..+.. +..++--|+| T Consensus 347 D~~YYT~~~r~~~~~~~~~i~~~fFslg~~ieGLs~L~~~LyGirl~~~~l~pGE~WhpdV~KL~vv~E~eg~lG~IY~D 426 (704) T KOG2090 347 DRPYYTSMYRQSNNSLNPSIYSPFFSLGSCIEGLSTLFQRLYGIRLIPEPLAPGEVWHPDVRKLNVVHEQEGLLGYIYCD 426 (704) T ss_pred CCHHHHHHHHCCCCCCCCCEEECCCCHHHHHHHHHHHHHHHHCEEEECCCCCCCCCCCCHHHHHEEECCCCCCEEEEEEE T ss_conf 50656667650357998440103441788998899999997571500058998654682014430134777804699987 Q ss_pred HHCCCCCCCCCCCCCCCCCC----------CCEEECCCCC---------CHHHHHHHHHHHHHHHHHHHHC-CCCCCCCC Q ss_conf 01037998743547666777----------5101036533---------2467999999845899999612-56877657 Q gi|254780915|r 377 APQYDGKGSGAFAHGTIPSV----------HPYILLNYLG---------KPQDVMTLAHELGHGIHFVLSS-ETQGILTN 436 (626) Q Consensus 377 ~~~r~gKr~GA~~~~~~~~~----------~~~I~~N~~~---------~~~dv~TL~HE~GHa~H~~ls~-~~~~~~~~ 436 (626) +|.|+||..|+..+.++.++ ...++|||.- ++.+|+|||||||||||+||++ ++|.+.|+ T Consensus 427 lf~R~gK~~g~aHFTIr~sr~l~Dg~yQlPVivL~cnf~rss~ss~t~L~~~~vetLFHEmGHAMHSmLgRT~yQhvtGT 506 (704) T KOG2090 427 LFERPGKTVGDAHFTIRGSRQLSDGTYQLPVIVLVCNFVRSSQSSPTFLSLSEVETLFHEMGHAMHSMLGRTHYQHVTGT 506 (704) T ss_pred EECCCCCCCCCCEEEEECCCCCCCCCEECEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 21247999988038863363578987513056776356410468973267999999999987999998532300033576 Q ss_pred CCHHHHHHHHHHHHHHHHHH----------HHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 80059998868889999999----------98413899899999999986206999999999999999997212016899 Q gi|254780915|r 437 NSSLTLAETASIFGETLTFD----------SLLQAASSKEERKILLANKIEDMLNSIVRQISFYDFELKLHTERRSTGDI 506 (626) Q Consensus 437 ~~~~~~aE~~S~~~E~~~~~----------~~~~~~~~~~~~~~ll~~~~~~~~~~~~~q~~~~~fe~~~h~~~~~~~~~ 506 (626) +++.||||+||++||.+.+| |-+++++|.++...+...+......-+.+|+.+++.|+.+|..+. +. T Consensus 507 RC~tDFaE~PSiLMEyFa~D~rvl~~faRhy~Tge~lp~~m~~~l~~s~n~faa~etq~Qv~ya~lDQ~~Hg~~~---p~ 583 (704) T KOG2090 507 RCPTDFAEIPSILMEYFANDYRVLRFFARHYSTGEPLPEDMVNRLCESRNSFAAQETQRQVFYALLDQEFHGIAC---PL 583 (704) T ss_pred CCCCHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC---CC T ss_conf 466206651299999984377899999986289987629999999865410136899999999999998626334---56 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCC--CHHHHHHHHH-H Q ss_conf 989999999999998607763458878885001525410667525899999999999999999567--3328999999-9 Q gi|254780915|r 507 PTHRINEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFIESSFYVYAYAFGNCLVNSLYDIYKSNT--VDCFKEKYLN-I 583 (626) Q Consensus 507 t~~~l~~~~~~~~~~~~g~~~~~~~~~~~~~w~~~~H~y~~~fY~y~Y~~~~~~a~~l~~~~~~~~--~~~~~~~y~~-~ 583 (626) ...+..++..++.++++| +.. .+...+..+|+|++++|.-||+|.++.++|.-+|++...++ ...++++|++ | T Consensus 584 ~~~~~td~~~~v~rk~~~--~~~--vp~taW~lRFsHLvgYGA~YYSYL~~r~~AS~IWq~~Fe~dPfsR~aGek~r~ei 659 (704) T KOG2090 584 IAEDTTDLLSEVKRKFSG--LLY--VPPTAWQLRFSHLVGYGATYYSYLFARAIASLIWQQLFENDPFSRKAGEKFRKEI 659 (704) T ss_pred CCCCHHHHHHHHHHHCCC--CCC--CCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 554568999999986068--766--7983342102556346513679999999999999999815963224468999999 Q ss_pred HHCCCCCCHHHHHHH-CCCCCCCHHHHHHHHHHHH Q ss_conf 832699997999997-0889899889999999999 Q gi|254780915|r 584 LRAGNSKHYSELLLP-LNINLSDPNFWERGLQTVE 617 (626) Q Consensus 584 L~~Ggs~~~~el~k~-~G~d~~~~~~~~~~~~~~~ 617 (626) |+.||.++|.+++.. +|..++-.+..+ ++..+. T Consensus 660 l~hGG~~~P~~lva~~L~~~~~~~g~~~-Al~~i~ 693 (704) T KOG2090 660 LKHGGGRDPAELVADILGKPPLENGGVD-ALSKID 693 (704) T ss_pred HHHCCCCCHHHHHHHHHCCCCCCCCHHH-HHHHHH T ss_conf 9845999869999999669877551288-898873 No 16 >pfam01401 Peptidase_M2 Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid peptidase domain, both of these are catalytically active. Most members are secreted membrane bound ectoenzymes. Probab=99.92 E-value=8.8e-18 Score=144.87 Aligned_cols=506 Identities=11% Similarity=0.061 Sum_probs=295.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHCCHHHHHHHH Q ss_conf 499899999999999999999999999986420579918999999999999999999887416988--719989999998 Q gi|254780915|r 81 QKNCHSLGAAIAEYERICELIGRIASYAMLSYNCNLSSPTIRKFYTDINAKLADFEKVLIFFALEI--NTLDEALLEQSY 158 (626) Q Consensus 81 ~~~~~~l~~~l~~~e~i~~~l~rv~~ya~l~~s~dt~d~~~~~~~~~~~~~~s~~~~~l~~~~~e~--~~l~e~~l~~~~ 158 (626) +.+-.....+|..+++-...+....+-|.-.+++|.++. .+.+..+.+...+++....+.-...+ ..+++..+ T Consensus 10 ~~~E~~A~~Fl~~~~~~~~~~~~~~~~A~W~y~TnIT~~-n~~~~~~~~~~~~~~~~~~~~~a~~f~~~~~~d~~~---- 84 (595) T pfam01401 10 VTDEAGASLFVEEYDRSAQVVLNEYAEANWAYNTNITTE-NSKILLEKALQSQNHTLKYGTKAKEFDVSNWQNFTL---- 84 (595) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHEEEEHHHCCCCHH-HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHH---- T ss_conf 879999999999999999999767656531675058889-999999999999999999999997476656899879---- Q ss_pred HCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHH--HHHHHHHCCCHH Q ss_conf 469001111799999997423317167998766300100036777777644443322100347523--321354188876 Q gi|254780915|r 159 AQDPLTLKYSAWIKNIRKIKKHLLSNDMECLLSDTSQVGREALKRFFCENIESLRFKINDQKIPLT--KAYKSFFDSDRE 236 (626) Q Consensus 159 ~~~~~l~~y~~~~~~l~~~~~~~l~~~~E~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~l~~~dr~ 236 (626) ++.+..+....+..+++..-+ .+....+....++.....- ...+.-..+. -...+..+.|.+ T Consensus 85 ---------rR~l~~l~~~g~a~l~~~~~~---el~~l~s~M~~iY~~~kvC----~~~~~cl~LePdl~~Ima~SrD~d 148 (595) T pfam01401 85 ---------KRIIKKVQTLGRANLPLAELE---EYNQILLDMETIYSTAKVC----FPNGTCWSLEPDLTNIMATSRKYA 148 (595) T ss_pred ---------HHHHHHHHHCCCCCCCHHHHH---HHHHHHHHHHHHHCCCEEC----CCCCCHHHHHHHHHHHHHCCCCHH T ss_conf ---------999998864387789999999---9999999999984792430----899834312303999987069999 Q ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHH---HHHH Q ss_conf 7999999999986505421699999986667886542789998897622225898999999998763062899---9999 Q gi|254780915|r 237 VRKSAAKALSHTFNKSSHIFSFITNTLAKDEEIQDRWRKYEKIADSRHLSNNVEPYVIEALMQSVKNYYPKTS---HRYY 313 (626) Q Consensus 237 ~Rk~a~~a~~~~~~~~~~~~a~il~~iv~~r~e~A~llGy~s~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~---~~~~ 313 (626) -..-+|.++..+ ....+...+...|.+.++-|++.||.+..++=...-. .+.+...++.+.....|+. |-|. T Consensus 149 EL~~~W~~Wr~~---vg~~~r~~y~~~V~L~N~~A~~nGf~d~g~~Wr~~Ye--~~~f~~~~e~lw~qvkPLY~~LHayV 223 (595) T pfam01401 149 ELLWAWEGWRDK---VGRAILPLYPKYVELSNEAARLNGYTDAGDSWRSWYE--SPTLEQDLERLYQELQPLYLNLHAYV 223 (595) T ss_pred HHHHHHHHHHHH---HCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999997---1878889999999999999998499988998777631--14399999999999889999999999 Q ss_pred HHHH-HHCCCCCC----------------CHHH----HHHCCCCCCCCCC---------CHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999-87088753----------------1234----4402565547779---------989999999986666328999 Q gi|254780915|r 314 ELKK-KWLKLDTM----------------YFWD----RLAPLPGTSQDII---------PFEVARDLVLQSYAKFSPQMS 363 (626) Q Consensus 314 ~~~~-~~lg~~~l----------------~~wD----~~~~~~~~~~~~~---------~~ee~~~~v~~~~~~~~~e~~ 363 (626) +.+. +.-|.+.+ +.|- ...|.+......+ +-.+..+...+-|.++|.. T Consensus 224 R~kL~~~YG~~~v~~~GpIPAHLLGnMWaQ~W~niyd~~~P~p~~~~~Dvt~~m~~q~yt~~~mf~~ae~FF~SlGl~-- 301 (595) T pfam01401 224 RRALHRHYGDQYINLRGPIPAHLLGNMWAQTWSNIYDLVVPFPSAPNLDVTEAMIKQGWTPRRMFREADEFFTSLGLL-- 301 (595) T ss_pred HHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHHHHHHCCCC-- T ss_conf 999998666212699998556765107776777630114689999988813999986998999999999999967998-- Q ss_pred HHHHHHHHCCCCCHHCCCC--CCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCC-CCHH Q ss_conf 9999864248502010379--987435476667775101036533246799999984589999961256877657-8005 Q gi|254780915|r 364 IIAEKFFTHNWIDAPQYDG--KGSGAFAHGTIPSVHPYILLNYLGKPQDVMTLAHELGHGIHFVLSSETQGILTN-NSSL 440 (626) Q Consensus 364 ~~~~~~~~~~~iD~~~r~g--Kr~GA~~~~~~~~~~~~I~~N~~~~~~dv~TL~HE~GHa~H~~ls~~~~~~~~~-~~~~ 440 (626) .+-+.++++. +=.-|++| -..+|..+.+......+|.|.-..+++|..|..||+|| ++.++..++||+... +..- T Consensus 302 ~mp~~FW~~S-~~~kp~dgR~v~ChaSAwdf~~~~D~RIkmCt~v~~~df~t~HHEmGH-i~Y~~~Y~~qP~~fr~gANp 379 (595) T pfam01401 302 PVPPEFWNKS-MLEKPTDGREVVCHASAWDFYNGKDFRIKQCTTVNMEDLVTVHHEMGH-IQYFMQYKDLPVSLREGANP 379 (595) T ss_pred CCCHHHHHHH-HCCCCCCCCCCCCCHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHHH-HHHHHHHHCCCHHHHCCCCC T ss_conf 7997898553-003688888787164067436898725885586789999999999999-99999983699776279980 Q ss_pred HHHHHHHHHHHHHH--HHH-----HHCCCC-C-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH Q ss_conf 99988688899999--999-----841389-9-89999999998620699999999999999999721201689998999 Q gi|254780915|r 441 TLAETASIFGETLT--FDS-----LLQAAS-S-KEERKILLANKIEDMLNSIVRQISFYDFELKLHTERRSTGDIPTHRI 511 (626) Q Consensus 441 ~~aE~~S~~~E~~~--~~~-----~~~~~~-~-~~~~~~ll~~~~~~~~~~~~~q~~~~~fe~~~h~~~~~~~~~t~~~l 511 (626) -|.|.-.-..-+=+ =.| ++.... + ......|+...+..+. ++.-..+.+.+-..+.+ |.+++++. T Consensus 380 gFhEAiGd~iaLSv~tP~hL~~igLl~~~~~~~e~~In~L~~~AL~kia-fLPF~~~~D~wRw~VF~-----G~i~~~~y 453 (595) T pfam01401 380 GFHEAIGDVLALSVSTPKHLHSINLLSSEGNDYESDINFLMKMALDKIA-FIPFSYLVDQWRWRVFD-----GSITKENY 453 (595) T ss_pred HHHHHHHHHHHHHCCCHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHHC-----CCCCHHHH T ss_conf 7899999999997278899988488657677711189999999865510-17499999999999738-----98987899 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHC-------------CCHHHHH Q ss_conf 999999999860776345887888500152541066752589999999999999999956-------------7332899 Q gi|254780915|r 512 NEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFIESSFYVYAYAFGNCLVNSLYDIYKSN-------------TVDCFKE 578 (626) Q Consensus 512 ~~~~~~~~~~~~g~~~~~~~~~~~~~w~~~~H~y~~~fY~y~Y~~~~~~a~~l~~~~~~~-------------~~~~~~~ 578 (626) |.-|+++..+|.|-..++...+.......+-|+= +..=|+.|-+|.++-.+++....+. |.+++++ T Consensus 454 N~~wW~Lr~kyqGv~pPv~R~e~dFDpgaKyHi~-~n~pY~rYFls~IlQFQf~~alC~~ag~~gPLh~Cdiy~sk~AG~ 532 (595) T pfam01401 454 NQEWWSLRLKYQGLCPPVPRNQGDFDPGAKFHIP-SVVPYIRYFVSFIIQFQFHEALCKAAGHTGPLHQCDIYQSKEAGA 532 (595) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCC-CCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHH T ss_conf 9999988987078369977882337944432467-885789999999999999999999738989974378869899999 Q ss_pred HHHHHHHCCCCCCHHHHHHHC-CCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 999998326999979999970-889899889999999999999998 Q gi|254780915|r 579 KYLNILRAGNSKHYSELLLPL-NINLSDPNFWERGLQTVEKMIDDV 623 (626) Q Consensus 579 ~y~~~L~~Ggs~~~~el~k~~-G~d~~~~~~~~~~~~~~~~~i~~~ 623 (626) +.+++|+.|.|++..|+++.+ |-.=-+.+.+.+=++.+..|+++- T Consensus 533 ~L~~ml~lGsSkpW~eale~~TG~~~mda~alleYF~PL~~WL~~q 578 (595) T pfam01401 533 KLADAMKLGYSRPWPEAMKDITGQPNMSASAMLNYFKPLTQWLQTE 578 (595) T ss_pred HHHHHHHCCCCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 9999997569988699999972998668699999999999999999 No 17 >cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h Probab=99.85 E-value=4.7e-16 Score=132.54 Aligned_cols=380 Identities=14% Similarity=0.078 Sum_probs=238.6 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHC Q ss_conf 32135418887679999999999865054216999999866678865427899988976222258989999999987630 Q gi|254780915|r 225 KAYKSFFDSDREVRKSAAKALSHTFNKSSHIFSFITNTLAKDEEIQDRWRKYEKIADSRHLSNNVEPYVIEALMQSVKNY 304 (626) Q Consensus 225 ~~~~~l~~~dr~~Rk~a~~a~~~~~~~~~~~~a~il~~iv~~r~e~A~llGy~s~~~~~l~~~~~~~~~~~~l~~~~~~~ 304 (626) -...+..+.|.+...-+|+++..+- ...+...+...|.+.++-|++.||.+..++=...-..+ .+...++.+... T Consensus 36 l~~Ima~Srd~~EL~~~W~~Wr~~v---G~~~r~~y~~~VeL~N~aA~~nGf~d~g~~Wr~~Ye~~--~f~~~~d~lw~q 110 (477) T cd06461 36 LESIMAKSRDYDELLYAWEGWRDAV---GPPMRPLYERYVELANEAAKLNGFVDAGEYWRSEYETP--DFEQDVERLWEE 110 (477) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHH---CHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCH--HHHHHHHHHHHH T ss_conf 9999886599999999999999873---83568999999999999999859998899988743636--799999999999 Q ss_pred CHHHHH---HHHHHHH-HHCCCCCC----------------CHHH----HHHCCCCCCCCCC---------CHHHHHHHH Q ss_conf 628999---9999999-87088753----------------1234----4402565547779---------989999999 Q gi|254780915|r 305 YPKTSH---RYYELKK-KWLKLDTM----------------YFWD----RLAPLPGTSQDII---------PFEVARDLV 351 (626) Q Consensus 305 ~~~~~~---~~~~~~~-~~lg~~~l----------------~~wD----~~~~~~~~~~~~~---------~~ee~~~~v 351 (626) ..|+.. -|.+.+. +.-|.+.+ +.|. .-.|.+......+ +-.+..+.. T Consensus 111 ikPLY~~LHayVR~kL~~~YG~~~v~~~GpIPAHLLGnmWaQ~W~ni~dl~~P~p~~~~~Dvt~~m~~q~~t~~~mf~~a 190 (477) T cd06461 111 LKPLYLQLHAYVRRKLRKKYGDDVVNRDGPIPAHLLGNMWAQSWSNIYDLVKPYPGKPLLDVTDAMVEQGYTAKRMFKEA 190 (477) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHCCCCHHHHHHHH T ss_conf 88999999999999999873754478999856777521665436654201357999888881599998599899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHCCC---CCCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf 986666328999999986424850201037---99874354766677751010365332467999999845899999612 Q gi|254780915|r 352 LQSYAKFSPQMSIIAEKFFTHNWIDAPQYD---GKGSGAFAHGTIPSVHPYILLNYLGKPQDVMTLAHELGHGIHFVLSS 428 (626) Q Consensus 352 ~~~~~~~~~e~~~~~~~~~~~~~iD~~~r~---gKr~GA~~~~~~~~~~~~I~~N~~~~~~dv~TL~HE~GHa~H~~ls~ 428 (626) .+-|..+|.. .+-..++++. .+.+| +...+|..+.+....+.+|.|.-..+++|..|..||+|| ++.++.. T Consensus 191 e~FF~SLGl~--~mp~~FW~~S---~~~kP~dr~v~ChaSAwdf~~~~D~RIkmCt~v~~~df~t~HHEmgH-i~Y~l~Y 264 (477) T cd06461 191 EEFFTSLGLP--PMPPSFWTKS---MLEKPTDREVVCHASAWDFYNGKDFRIKMCTKVNMEDFVTVHHEMGH-IQYYLQY 264 (477) T ss_pred HHHHHHCCCC--CCCHHHHHHC---CCCCCCCCCCCCCHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHHH-HHHHHHH T ss_conf 9999967997--6997787000---23478998875446353135898725885387688999999999999-9999997 Q ss_pred CCCCCCC-CCCHHHHHHHHHHHHH-------HHHHHHHHCCCC-CH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5687765-7800599988688899-------999999841389-98-999999999862069999999999999999972 Q gi|254780915|r 429 ETQGILT-NNSSLTLAETASIFGE-------TLTFDSLLQAAS-SK-EERKILLANKIEDMLNSIVRQISFYDFELKLHT 498 (626) Q Consensus 429 ~~~~~~~-~~~~~~~aE~~S~~~E-------~~~~~~~~~~~~-~~-~~~~~ll~~~~~~~~~~~~~q~~~~~fe~~~h~ 498 (626) ++||+.. .+..--|.|.-.-..- +|-.=-++.+.. +. .....|+...++.+. ++.-..+.+.+-..+.+ T Consensus 265 ~~qP~~fr~ganpgFhEAiGd~iaLSv~tP~hL~~igLl~~~~~~~~~~In~L~~~AL~kia-fLPF~~~~DkwRw~Vf~ 343 (477) T cd06461 265 KDQPVLFREGANPGFHEAVGDAIALSVSTPKHLHKIGLLDSEVDDEEADINFLLKMALDKIA-FLPFGYLLDKWRWDVFD 343 (477) T ss_pred HCCCHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHCC T ss_conf 54983320689841889999999997278898988388666678811489999999987740-08599999999987118 Q ss_pred HHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHC------- Q ss_conf 1201689998999999999999860776345887888500152541066752589999999999999999956------- Q gi|254780915|r 499 ERRSTGDIPTHRINEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFIESSFYVYAYAFGNCLVNSLYDIYKSN------- 571 (626) Q Consensus 499 ~~~~~~~~t~~~l~~~~~~~~~~~~g~~~~~~~~~~~~~w~~~~H~y~~~fY~y~Y~~~~~~a~~l~~~~~~~------- 571 (626) |.+++++.|.-|+++..+|.|-..+++..+.......+-|+= ++.-|+.|-+|.++-.+++..+.+. T Consensus 344 -----g~i~~~~~N~~wW~Lr~kyqGi~pPv~R~e~dFDpgaKyHi~-~n~pYirYFls~IlqFQf~~alC~~ag~~~PL 417 (477) T cd06461 344 -----GEIPKDDYNKAWWELREKYQGVVPPVPRSEEDFDPGAKYHVP-ANTPYIRYFLSFILQFQFHKALCKAAGHTGPL 417 (477) T ss_pred -----CCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCC-CCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf -----989878889999999996188689988870127955000136-88669999999999999999999972888986 Q ss_pred ------CCHHHHHHHHHHHHCCCCCCHHHHHHH-CCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf ------733289999999832699997999997-088989988999999999999999 Q gi|254780915|r 572 ------TVDCFKEKYLNILRAGNSKHYSELLLP-LNINLSDPNFWERGLQTVEKMIDD 622 (626) Q Consensus 572 ------~~~~~~~~y~~~L~~Ggs~~~~el~k~-~G~d~~~~~~~~~~~~~~~~~i~~ 622 (626) |.++++++.+++|+.|.|++..|+++. .|-.==+..++.+=++.+.+|+++ T Consensus 418 h~Cdiy~sk~AG~~L~~ml~lGsS~pW~eale~~tG~~~m~a~alleYF~PL~~WL~~ 475 (477) T cd06461 418 HKCDIYGSKEAGKKLRAMLSLGSSKPWPEALEALTGEREMDASALLEYFQPLIDWLKE 475 (477) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 2146758788899999998577999759999998098865849999999999999985 No 18 >pfam02074 Peptidase_M32 Carboxypeptidase Taq (M32) metallopeptidase. Probab=99.82 E-value=5.5e-14 Score=117.77 Aligned_cols=336 Identities=16% Similarity=0.170 Sum_probs=217.2 Q ss_pred HHHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHHHHHHCCCC-CHHHHHHH--HCCCCHHHHHHHHHHHHHCCHHHHHH Q ss_conf 679999999999865054-216999999866678865427899-98897622--22589899999999876306289999 Q gi|254780915|r 236 EVRKSAAKALSHTFNKSS-HIFSFITNTLAKDEEIQDRWRKYE-KIADSRHL--SNNVEPYVIEALMQSVKNYYPKTSHR 311 (626) Q Consensus 236 ~~Rk~a~~a~~~~~~~~~-~~~a~il~~iv~~r~e~A~llGy~-s~~~~~l~--~~~~~~~~~~~l~~~~~~~~~~~~~~ 311 (626) ...-.+..+|.++-+++. ..+++.|.+++.+..+.|..+||+ +..|.-|. ...++.+.++.++..++...+|+.++ T Consensus 108 ~~~s~a~~~W~~AR~~nDf~~F~P~Leklv~L~re~A~~~g~~~~pYDaLLd~YEpG~t~~~ld~~F~~Lk~~L~pLi~~ 187 (494) T pfam02074 108 ETTSKAETAWEEAKAKDDFSKFEPYLDKLISLAKRAAEYLGYEEEPYDALLDLYEPGLRTRDLVKLFDELEKELRPLLEK 187 (494) T ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 99957899999988747899999999999999999999708899828998743489998999999999999989999999 Q ss_pred HHHHHHHHCCCCCCCHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCCCCCCCCCCCC Q ss_conf 99999987088753123444025655477799899999999866663289999999864248502010379987435476 Q gi|254780915|r 312 YYELKKKWLKLDTMYFWDRLAPLPGTSQDIIPFEVARDLVLQSYAKFSPQMSIIAEKFFTHNWIDAPQYDGKGSGAFAHG 391 (626) Q Consensus 312 ~~~~~~~~lg~~~l~~wD~~~~~~~~~~~~~~~ee~~~~v~~~~~~~~~e~~~~~~~~~~~~~iD~~~r~gKr~GA~~~~ 391 (626) .....++ |........++.+.-..+.. ++.+.++--++.+++|.-+ |.||.+ T Consensus 188 i~~~~~~--------------~~~~~l~~~~~~~~Q~~l~~--------~v~~~~GfDf~~GRlD~S~------HPF~~~ 239 (494) T pfam02074 188 ILESGKV--------------PDEHPLEKKYPKEWQEKVNL--------WVLQKFGYDLGTGRLDVTA------HPFTTE 239 (494) T ss_pred HHHCCCC--------------CCCCCCCCCCCHHHHHHHHH--------HHHHHHCCCCCCEEEECCC------CCCCCC T ss_conf 9855658--------------98654579999999999999--------9999929587770573478------987678 Q ss_pred CCCCCCCEEECCCCC--CHHHHHHHHHHHHHHHHHHHHC---CCCCCCCCCCHHHHHHHHHHHHHHHHH------HHHH- Q ss_conf 667775101036533--2467999999845899999612---568776578005999886888999999------9984- Q gi|254780915|r 392 TIPSVHPYILLNYLG--KPQDVMTLAHELGHGIHFVLSS---ETQGILTNNSSLTLAETASIFGETLTF------DSLL- 459 (626) Q Consensus 392 ~~~~~~~~I~~N~~~--~~~dv~TL~HE~GHa~H~~ls~---~~~~~~~~~~~~~~aE~~S~~~E~~~~------~~~~- 459 (626) +.+ ...+|.-+|.. -...+.-..||.|||+.-.--. ..+ +.+...++-+.|-.|-|+|+.+. .++. T Consensus 240 ~~~-~DvRiTTry~e~d~~~~l~~~iHE~GHalYEq~l~~~~~~t-plg~~~smgiHESQSr~~En~vgrS~~F~~~~~p 317 (494) T pfam02074 240 FGI-GDVRITTRYEEFDFRRAIFGTIHEFGHALYEQQQDEAFMGT-PIAGGASLGIHESQSRFWENIIGRSKEFWEFIYP 317 (494) T ss_pred CCC-CCEEEEHHCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHCCC-CCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 897-73333310150346789999998762889985379877579-5656676564878999999998058999999999 Q ss_pred ---------CCCCCHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf ---------1389989999999998------6206999999999999999997212016899989999999999998607 Q gi|254780915|r 460 ---------QAASSKEERKILLANK------IEDMLNSIVRQISFYDFELKLHTERRSTGDIPTHRINEIWLETQKESLG 524 (626) Q Consensus 460 ---------~~~~~~~~~~~ll~~~------~~~~~~~~~~q~~~~~fe~~~h~~~~~~~~~t~~~l~~~~~~~~~~~~g 524 (626) +....++....+-..+ -.+-...-+..+.+.+.|.++- +|.+++++|-++|.+..++|.| T Consensus 318 ~l~~~f~~~~~~s~e~~y~~~n~V~ps~IRveADEvTY~lHIilRyeiEk~li-----~g~l~v~dlP~~Wn~~~~~yLG 392 (494) T pfam02074 318 VLKENLPFMEDYTLEDFYLAFNMVRPSFIRVEADEVTYNLHILLRFELERAMF-----SEEVKAEDLPEMWNDKYERYLG 392 (494) T ss_pred HHHHHCCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHCCHHHHHHHHHHHHHHH-----CCCCCHHHHHHHHHHHHHHHCC T ss_conf 99997687667999999999710487501102033010689999999999998-----5998742328999999999849 Q ss_pred CCCCCCCCCCCCEEECCCCCCCCCCCHHH-HHHHHHHHHHHHHHHHHCCCH--H------H--HHHHH--HHHHCCCCCC Q ss_conf 76345887888500152541066752589-999999999999999956733--2------8--99999--9983269999 Q gi|254780915|r 525 PAFDLSDLEYGSFWMMVPHFIESSFYVYA-YAFGNCLVNSLYDIYKSNTVD--C------F--KEKYL--NILRAGNSKH 591 (626) Q Consensus 525 ~~~~~~~~~~~~~w~~~~H~y~~~fY~y~-Y~~~~~~a~~l~~~~~~~~~~--~------~--~~~y~--~~L~~Ggs~~ 591 (626) -... ....| ..+=-|-..+.|=|++ |+.|.+.|+++|...+++-++ . | .-.++ +|-+.|++.+ T Consensus 393 i~p~--~d~~G--~LQDvHWs~G~fGYFPtY~LG~~~AaQl~~~~~~~~p~~~~~i~~G~~~~i~~WL~e~Ih~~G~~~~ 468 (494) T pfam02074 393 IRPK--TYREG--ILQDVHWAGGSFGYFPTYSLGTIYAAQLYYKIKEDLPDFEELVARGEFEPIKSWLREKIHRWGSRYP 468 (494) T ss_pred CCCC--CCCCC--CCHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCHHHHHHHHHHHHHHHCCCCC T ss_conf 9999--98677--5352445788615356679999999999999998676798898748959999999999997325899 Q ss_pred HHHHHHH-CCCCCCCHHHHHH Q ss_conf 7999997-0889899889999 Q gi|254780915|r 592 YSELLLP-LNINLSDPNFWER 611 (626) Q Consensus 592 ~~el~k~-~G~d~~~~~~~~~ 611 (626) |.||++. .|-.++ |+++-+ T Consensus 469 ~~eLi~~~TGe~l~-~~~~~~ 488 (494) T pfam02074 469 PKELLKKATGEDVN-AEYFVR 488 (494) T ss_pred HHHHHHHHHCCCCC-HHHHHH T ss_conf 89999998689998-799999 No 19 >cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o Probab=99.75 E-value=2.1e-14 Score=120.69 Aligned_cols=334 Identities=19% Similarity=0.206 Sum_probs=217.0 Q ss_pred HHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHHHHHHCCCC-CHHHHHHH--HCCCCHHHHHHHHHHHHHCCHHHHHHH Q ss_conf 79999999999865054-216999999866678865427899-98897622--225898999999998763062899999 Q gi|254780915|r 237 VRKSAAKALSHTFNKSS-HIFSFITNTLAKDEEIQDRWRKYE-KIADSRHL--SNNVEPYVIEALMQSVKNYYPKTSHRY 312 (626) Q Consensus 237 ~Rk~a~~a~~~~~~~~~-~~~a~il~~iv~~r~e~A~llGy~-s~~~~~l~--~~~~~~~~~~~l~~~~~~~~~~~~~~~ 312 (626) .+-++..++.++-+++. ..+.+.|.+++.+..+.|..+|+. +..|.-|. ...++.+.++.++..+....+|+.++. T Consensus 9 a~s~a~~~W~~AR~~nDf~~f~P~L~~vv~L~re~A~~~~~~~~pYDaLLd~YEpg~t~~~ld~iF~~Lk~~l~~li~~I 88 (396) T cd06460 9 ATSEAEQAWREARKKNDFALFAPYLEKIVELSREIAEYLGYEKHPYDALLDLYEPGMTTAKLDAIFAELKAGLVPLLKKV 88 (396) T ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99699999999988688999899999999999999998289998689999861899989999999999999999999999 Q ss_pred HHHHHHHCCCCCCCHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCCCCCCCCCCCCC Q ss_conf 99999870887531234440256554777998999999998666632899999998642485020103799874354766 Q gi|254780915|r 313 YELKKKWLKLDTMYFWDRLAPLPGTSQDIIPFEVARDLVLQSYAKFSPQMSIIAEKFFTHNWIDAPQYDGKGSGAFAHGT 392 (626) Q Consensus 313 ~~~~~~~lg~~~l~~wD~~~~~~~~~~~~~~~ee~~~~v~~~~~~~~~e~~~~~~~~~~~~~iD~~~r~gKr~GA~~~~~ 392 (626) ....+ + +........++.+.-+.+ +.++.+.+.-.++.+.+|.- .+.||.++ T Consensus 89 ~~~~~---~-----------~~~~~~~~~~~~~~Q~~l--------~~~i~~~~GfDf~~GRlD~S------~HPF~~~~ 140 (396) T cd06460 89 LEKQK---P-----------PDDSLLSGPFPEEKQEAL--------GRELLEALGFDFDRGRLDVS------AHPFTGGL 140 (396) T ss_pred HHCCC---C-----------CCCCCCCCCCCHHHHHHH--------HHHHHHHCCCCCCCCEEECC------CCCCCCCC T ss_conf 95689---9-----------986545799999999999--------99999990958665445245------68866789 Q ss_pred CCCCCCEEECCCCC--CHHHHHHHHHHHHHHHHHHHHC---CCCCCCCCCCHHHHHHHHHHHHHHHHH------HHHH-- Q ss_conf 67775101036533--2467999999845899999612---568776578005999886888999999------9984-- Q gi|254780915|r 393 IPSVHPYILLNYLG--KPQDVMTLAHELGHGIHFVLSS---ETQGILTNNSSLTLAETASIFGETLTF------DSLL-- 459 (626) Q Consensus 393 ~~~~~~~I~~N~~~--~~~dv~TL~HE~GHa~H~~ls~---~~~~~~~~~~~~~~aE~~S~~~E~~~~------~~~~-- 459 (626) .+ ...+|.-.|.. -...+....||.|||+.-.--. ..+ +.+..+++-+.|-.|-|+|+.+. .++. T Consensus 141 ~~-~DvRiTTry~e~d~~~~l~~~iHE~GHalYEqgl~~~~~~~-P~g~~~smgiHESQSr~~En~igrS~~F~~~~~p~ 218 (396) T cd06460 141 GP-GDVRITTRYDENDFRSALFSTIHETGHALYEQGLPPELRGT-PLGGGASMGIHESQSRLWENQVGRSRAFWEFLYPK 218 (396) T ss_pred CC-CCCEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCHHHCCC-CCCCCCEEEECCHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 86-64012300042247999999998751678861989778568-56677214452289999999997389999999999 Q ss_pred ---------CCCCCHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf ---------13899899999999986------206999999999999999997212016899989999999999998607 Q gi|254780915|r 460 ---------QAASSKEERKILLANKI------EDMLNSIVRQISFYDFELKLHTERRSTGDIPTHRINEIWLETQKESLG 524 (626) Q Consensus 460 ---------~~~~~~~~~~~ll~~~~------~~~~~~~~~q~~~~~fe~~~h~~~~~~~~~t~~~l~~~~~~~~~~~~g 524 (626) +....++....+-.-+- .+-...-+..+.+.+.|.++- ++.+++++|.++|.+..++|.| T Consensus 219 l~~~f~~~~~~~s~e~~y~~~n~V~pslIRveADEvTY~lHImlRyeiEk~li-----~g~l~v~DLP~~Wn~~~~~yLG 293 (396) T cd06460 219 LKKTFPEQLKDVSLENFYRAVNRVQPSLIRVEADEVTYNLHIMLRYELEKALI-----EGDLEVADLPEAWNEKMKEYLG 293 (396) T ss_pred HHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH-----CCCCCHHHHHHHHHHHHHHHCC T ss_conf 99984702457899999999975377511120054304789999999999998-----5998731449999999999759 Q ss_pred CCCCCCCCCCCCEEECCCCCCCCCCCHH-HHHHHHHHHHHHHHHHHHCCC-----------HHHHHHHH--HHHHCCCCC Q ss_conf 7634588788850015254106675258-999999999999999995673-----------32899999--998326999 Q gi|254780915|r 525 PAFDLSDLEYGSFWMMVPHFIESSFYVY-AYAFGNCLVNSLYDIYKSNTV-----------DCFKEKYL--NILRAGNSK 590 (626) Q Consensus 525 ~~~~~~~~~~~~~w~~~~H~y~~~fY~y-~Y~~~~~~a~~l~~~~~~~~~-----------~~~~~~y~--~~L~~Ggs~ 590 (626) -... ....|+ .+=-|-..+.|=|+ +|+.|.+.|+++|...+++-+ ..... ++ +|-+.|++. T Consensus 294 i~p~--~d~~G~--LQDiHWs~G~fGYFPtY~LG~~~AaQl~~~~~~~~~~~~~~i~~G~f~~l~~-WL~e~Ih~~G~~~ 368 (396) T cd06460 294 IRPP--NDAEGC--LQDIHWSGGSFGYFPTYTLGNLYAAQLFAAAKKDLPDIDEQIERGDFSPLLE-WLRENIHQHGSRY 368 (396) T ss_pred CCCC--CCCCCC--CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCHHHHHH-HHHHHHHHCCCCC T ss_conf 8899--965564--6576100674353525688899999999999885567878987589499999-9999998602479 Q ss_pred CHHHHHHHC-CCCCCCHHHHHH Q ss_conf 979999970-889899889999 Q gi|254780915|r 591 HYSELLLPL-NINLSDPNFWER 611 (626) Q Consensus 591 ~~~el~k~~-G~d~~~~~~~~~ 611 (626) +|.||++.+ |-.++ |+++.+ T Consensus 369 ~~~eLl~~~TGe~l~-~~~~~~ 389 (396) T cd06460 369 SPDELLKKATGEPLN-PEYFLE 389 (396) T ss_pred CHHHHHHHHHCCCCC-HHHHHH T ss_conf 989999998689998-799999 No 20 >KOG3690 consensus Probab=99.74 E-value=4.3e-11 Score=97.14 Aligned_cols=514 Identities=12% Similarity=0.058 Sum_probs=270.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCC Q ss_conf 99899999999999999999999999986420579918999999999999999999887416988719989999998469 Q gi|254780915|r 82 KNCHSLGAAIAEYERICELIGRIASYAMLSYNCNLSSPTIRKFYTDINAKLADFEKVLIFFALEINTLDEALLEQSYAQD 161 (626) Q Consensus 82 ~~~~~l~~~l~~~e~i~~~l~rv~~ya~l~~s~dt~d~~~~~~~~~~~~~~s~~~~~l~~~~~e~~~l~e~~l~~~~~~~ 161 (626) .+-.+...+++.++.-.+.+-+....+...+.+|.+.+..++ ..+++...+++-..++.- .+.......++ T Consensus 29 ~~e~~~~~fL~~~~~q~~~v~~q~~la~~~y~~~~~~~~~~a-~~ea~~~~~~~~r~~a~q--------~k~~~~~~l~D 99 (646) T KOG3690 29 TDEEEAKRFLERYNAQARVVYRQEALAGWNYNTNGTEENLLA-LLEAEDKLVEFVRSLATQ--------AKKFDVAELQD 99 (646) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHH-HHHHHHHHHHHHHHHHHH--------HHHCCHHHCCC T ss_conf 779999999998789999999999999888753798499999-998889999999999877--------75424864048 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHH-HHHHHHHCCCHHHHHH Q ss_conf 001111799999997423317167998766300100036777777644443322100347523-3213541888767999 Q gi|254780915|r 162 PLTLKYSAWIKNIRKIKKHLLSNDMECLLSDTSQVGREALKRFFCENIESLRFKINDQKIPLT-KAYKSFFDSDREVRKS 240 (626) Q Consensus 162 ~~l~~y~~~~~~l~~~~~~~l~~~~E~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~l~~~dr~~Rk~ 240 (626) + ..++.+..+.......|++..-+.++.+. ++....+....+-..+.+......-.+ -...+..+.|.+-+.- T Consensus 100 ~---~~rRql~~~s~lg~saL~~~d~~~~~~ll---~~m~~~y~~~~vC~y~~p~~~~L~l~P~l~~i~~~Srd~~eL~~ 173 (646) T KOG3690 100 E---RLRRQLQKVSQLGMSALSPDDLKRYNQLL---SSMSTNYNTATVCAYDQPSACCLLLEPQLQSIMANSRDYDELQW 173 (646) T ss_pred H---HHHHHHHHHHHHCCCCCCHHHHHHHHHHH---HHHHHHCCCCEEECCCCCCCCCEECCCCHHHHHHCCCCHHHHHH T ss_conf 8---89999998866332028978999999999---99985247762128889997540027317888734688899999 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHHHH---HHHHHH Q ss_conf 99999998650542169999998666788654278999889762222589899999999876306289999---999999 Q gi|254780915|r 241 AAKALSHTFNKSSHIFSFITNTLAKDEEIQDRWRKYEKIADSRHLSNNVEPYVIEALMQSVKNYYPKTSHR---YYELKK 317 (626) Q Consensus 241 a~~a~~~~~~~~~~~~a~il~~iv~~r~e~A~llGy~s~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~---~~~~~~ 317 (626) .|.++..+.... .-..+.++|.+-++.|++-||.++.+|=...-..+. .+...++.+.....|+.+. |.+.+. T Consensus 174 ~W~~wr~~~g~~---~r~~y~~yv~L~~e~A~lNg~~~~gdyW~~~yE~~~-~~~~~ld~i~~ei~PlY~~LHAYvRr~L 249 (646) T KOG3690 174 YWVEWRRKIGKA---MRQSYEEYVDLSNEAAQLNGYVNGGDYWRLAYETPG-DFEQDLDAIFEEIRPLYRQLHAYVRRKL 249 (646) T ss_pred HHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999986124---467799999999999986498878888999851725-6999999999998899999999999999 Q ss_pred -HHCCCCCC----------------CHH----HHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH----HHH- Q ss_conf -87088753----------------123----4440256554777998999999998666632899999998----642- Q gi|254780915|r 318 -KWLKLDTM----------------YFW----DRLAPLPGTSQDIIPFEVARDLVLQSYAKFSPQMSIIAEK----FFT- 371 (626) Q Consensus 318 -~~lg~~~l----------------~~w----D~~~~~~~~~~~~~~~ee~~~~v~~~~~~~~~e~~~~~~~----~~~- 371 (626) ..-|-+.+ +.| |...|.+.......+.+-+.+... .+.+-.+-.++|.. ... T Consensus 250 ~~~Ygp~~i~~~gpiPahLlgnm~gq~Ws~~y~~~~P~pe~~~~dvt~~m~~qg~~--~qkmF~~a~efF~SlGl~~lp~ 327 (646) T KOG3690 250 RGAYGPDGISRDGPIPAHLLGNMWGQDWSNHYDLTTPFPERPLIDVTMEMVKQGYT--VQKMFKLAAEFFTSLGLPALPP 327 (646) T ss_pred HHHCCCCCCCCCCCCCHHHHHHHHCCCHHHHHHCCCCCCCCCCCCHHHHHHHCCCC--HHHHHHHHHHHHHHCCCCCCCH T ss_conf 87528333798898488887656402327553002689887877657999982433--9999999999999769876896 Q ss_pred CCC-CCHHCCCCCCCCCCC----CCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCC-CCHHHHHHH Q ss_conf 485-020103799874354----76667775101036533246799999984589999961256877657-800599988 Q gi|254780915|r 372 HNW-IDAPQYDGKGSGAFA----HGTIPSVHPYILLNYLGKPQDVMTLAHELGHGIHFVLSSETQGILTN-NSSLTLAET 445 (626) Q Consensus 372 ~~~-iD~~~r~gKr~GA~~----~~~~~~~~~~I~~N~~~~~~dv~TL~HE~GHa~H~~ls~~~~~~~~~-~~~~~~aE~ 445 (626) ..| --...|++-+.++-| +.++.++..+|.+.-..+..+..+.+||+|| ++.++..+.||+... .+.--|.|. T Consensus 328 sfw~~s~~~~p~d~~~v~C~~sAwdf~~~~Dfrik~Ct~vd~~~f~~~Hhem~h-iQY~l~y~~qP~llre~anPgFheA 406 (646) T KOG3690 328 SFWTNSILTRPDDRDMVVCHASAWDFYAGPDFRIKYCTKVDEEDFEQAHHEMGH-IQYYLQYKQQPFLLREGANPGFHEA 406 (646) T ss_pred HHHHHHHHCCCCCCCCEEECHHHHCCCCCCCCEEEECCCCCHHHHHHHHHHHHH-HHHHHHHHHCCHHHHCCCCCCHHHH T ss_conf 887677725788887524356663044688731544475787899999999999-9999998745487627999728899 Q ss_pred HHHHHHHHHH--HH-----HHCCCCCHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 6888999999--99-----841389989999999998620699999-999999999999721201689998999999999 Q gi|254780915|r 446 ASIFGETLTF--DS-----LLQAASSKEERKILLANKIEDMLNSIV-RQISFYDFELKLHTERRSTGDIPTHRINEIWLE 517 (626) Q Consensus 446 ~S~~~E~~~~--~~-----~~~~~~~~~~~~~ll~~~~~~~~~~~~-~q~~~~~fe~~~h~~~~~~~~~t~~~l~~~~~~ 517 (626) -.-.+=+-+. +| ++..-.+.+ ....-.+++.++...+ -.+.....|..-+.- -.|.++.++.+.-|++ T Consensus 407 Igd~~als~stPrhL~slgLL~~~~~~d--~~~~IN~L~k~aL~~v~~LPf~y~~DkwR~~V--f~G~i~k~~~N~~~W~ 482 (646) T KOG3690 407 IGDAFALSVSTPRHLHSLGLLSEDLDED--DEVRINRLFKMALDKVAFLPFTYALDKWRYEV--FDGEIPKDQYNCAYWE 482 (646) T ss_pred HHHHHHHHCCCHHHHHHHCCCCCCCCCC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HCCCCCCCHHHHHHHH T ss_conf 9999998528878888704534233641--17899999999988986402666543468887--6387882024689987 Q ss_pred HHHHHCCCCCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHC-------------------CCHHHHH Q ss_conf 999860776345887888500152541066752589999999999999999956-------------------7332899 Q gi|254780915|r 518 TQKESLGPAFDLSDLEYGSFWMMVPHFIESSFYVYAYAFGNCLVNSLYDIYKSN-------------------TVDCFKE 578 (626) Q Consensus 518 ~~~~~~g~~~~~~~~~~~~~w~~~~H~y~~~fY~y~Y~~~~~~a~~l~~~~~~~-------------------~~~~~~~ 578 (626) +..+|-|-..+++.....+...-.-|+ ....-|..|-++.|+-.+++..+... +.+++++ T Consensus 483 lr~ky~Gv~~P~~r~~~dfD~~akf~~-~~d~~~~ryf~s~vlqFQf~kalC~~agq~~pg~pr~pLh~CDi~~sk~aG~ 561 (646) T KOG3690 483 LRNKYEGVRPPVPRTNKDFDPPAKFHI-QVDVSYTRYFLSTVLQFQFLKALCQAAGQYGPGDPRKPLHNCDIYDSKEAGE 561 (646) T ss_pred HHHHHCCCCCCCCCCCCCCCCHHHHCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCCCCCHHHHH T ss_conf 788736767997675334781354236-7551899999999999999999999846647899987750304647777889 Q ss_pred HHHHHHHCCCCCCHHHHHHHC-CCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 999998326999979999970-88989988999999999999999 Q gi|254780915|r 579 KYLNILRAGNSKHYSELLLPL-NINLSDPNFWERGLQTVEKMIDD 622 (626) Q Consensus 579 ~y~~~L~~Ggs~~~~el~k~~-G~d~~~~~~~~~~~~~~~~~i~~ 622 (626) +.+++++.|.|++..|+++.+ |--=-+..++..=++.+-.|+.+ T Consensus 562 kL~~~M~lG~S~~W~evLE~~tGe~els~~~LLeYFePL~~WLk~ 606 (646) T KOG3690 562 KLRTMMSLGSSRHWREVLEEITGEGELSGRALLEYFEPLFTWLKQ 606 (646) T ss_pred HHHHHHHCCCCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 999999735876389999997189767848899999999999986 No 21 >COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism] Probab=99.65 E-value=1.5e-12 Score=107.53 Aligned_cols=336 Identities=20% Similarity=0.232 Sum_probs=213.8 Q ss_pred HHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH---HCCCCHHHHHHHHHHHHHCCHHHHHHH Q ss_conf 79999999999865054-21699999986667886542789998897622---225898999999998763062899999 Q gi|254780915|r 237 VRKSAAKALSHTFNKSS-HIFSFITNTLAKDEEIQDRWRKYEKIADSRHL---SNNVEPYVIEALMQSVKNYYPKTSHRY 312 (626) Q Consensus 237 ~Rk~a~~a~~~~~~~~~-~~~a~il~~iv~~r~e~A~llGy~s~~~~~l~---~~~~~~~~~~~l~~~~~~~~~~~~~~~ 312 (626) ++-++..++-+.-.++- ..+.+-|.+++.+..+.|+.+||.-..--+|. .-..+..+++..+..++...+|+..+. T Consensus 108 ~~s~a~~aWreAr~knDf~~F~p~Lekivel~re~A~~~~~~~~pYdaLld~yEpG~t~~~i~~vF~~Lk~~L~~ll~kv 187 (497) T COG2317 108 LTSKAEHAWREAREKNDFSIFKPYLEKIVELKREFAEYRGYEEHPYDALLDLYEPGLTVRDVDRVFAELKKELVPLLDKV 187 (497) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 87788999998765065666578899999999999986144358088999862688758999999999999999999999 Q ss_pred HHHHHHHCCCCCCCHHHHHHCCC-CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCCCCCCCCCCCC Q ss_conf 99999870887531234440256-55477799899999999866663289999999864248502010379987435476 Q gi|254780915|r 313 YELKKKWLKLDTMYFWDRLAPLP-GTSQDIIPFEVARDLVLQSYAKFSPQMSIIAEKFFTHNWIDAPQYDGKGSGAFAHG 391 (626) Q Consensus 313 ~~~~~~~lg~~~l~~wD~~~~~~-~~~~~~~~~ee~~~~v~~~~~~~~~e~~~~~~~~~~~~~iD~~~r~gKr~GA~~~~ 391 (626) .....+ |.. ......|+-+.-.+ ++..+.+.++..++.+.+|.-+ +.||.+ T Consensus 188 ~~~~~~--------------~~~~~~~~~~~~~e~q~~--------~~~~vl~~lGfdf~~GRlD~S~------HPF~~g 239 (497) T COG2317 188 LEKGKS--------------PRSDLLEKEKYDKEKQEA--------LGLRVLELLGFDFERGRLDVSV------HPFTTG 239 (497) T ss_pred HHCCCC--------------CCCCCCCCCCCCHHHHHH--------HHHHHHHHHCCCCCCCCCCCCC------CCCCCC T ss_conf 863678--------------753212457889899999--------9999999948985576303788------986578 Q ss_pred CCCCCCCEEECCCCCC--HHHHHHHHHHHHHHHHHHHHCCC--CCCCCCCCHHHHHHHHHHHHHHHHHH------HH--- Q ss_conf 6677751010365332--46799999984589999961256--87765780059998868889999999------98--- Q gi|254780915|r 392 TIPSVHPYILLNYLGK--PQDVMTLAHELGHGIHFVLSSET--QGILTNNSSLTLAETASIFGETLTFD------SL--- 458 (626) Q Consensus 392 ~~~~~~~~I~~N~~~~--~~dv~TL~HE~GHa~H~~ls~~~--~~~~~~~~~~~~aE~~S~~~E~~~~~------~~--- 458 (626) ++ ....+|.-.|++. .+.+.-..||+|||+.-..-.+. -.+.+.+.+|-+.|-.|-|+|+.+-. .. T Consensus 240 ~~-~~DVRITTRy~~~df~~aL~g~iHE~GHAlYEqn~~~~l~gtPlg~g~smgiHESQSrF~En~VGRs~aFw~~~~~~ 318 (497) T COG2317 240 LP-INDVRITTRYNEQDFRSALFGTIHETGHALYEQNLDEALLGTPLGEGRSMGIHESQSRFWENQVGRSRAFWEAIYPL 318 (497) T ss_pred CC-CCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 99-87536874047765699999999853157787088988728866667664301199999999964689899998999 Q ss_pred HC--------CCCCHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH Q ss_conf 41--------389989999999998620--------69999999999999999972120168999899999999999986 Q gi|254780915|r 459 LQ--------AASSKEERKILLANKIED--------MLNSIVRQISFYDFELKLHTERRSTGDIPTHRINEIWLETQKES 522 (626) Q Consensus 459 ~~--------~~~~~~~~~~ll~~~~~~--------~~~~~~~q~~~~~fe~~~h~~~~~~~~~t~~~l~~~~~~~~~~~ 522 (626) ++ ....++..... +.... -...-+..+.+.+.|..+- +|.+.++++-++|.+...+| T Consensus 319 ~~~~~~~~~~~~s~dd~y~~v--n~v~~~lIRveADElTY~lHIilRyEiEk~li-----~G~l~v~DlP~lWn~kme~y 391 (497) T COG2317 319 LRKHFPEQFDKYSLDDFYRAV--NRVEPSLIRVEADELTYPLHIILRYEIEKELI-----SGELKVDDLPELWNDKMEEY 391 (497) T ss_pred HHHHHHHHCCCCCHHHHHHHH--HCCCCCCEEEECCCCCEEEEEEHHHHHHHHHH-----CCCCCHHHHHHHHHHHHHHH T ss_conf 998623421267799999998--30367553650431410013302489999997-----49864413088899999997 Q ss_pred CCCCCCCCCCCCCCEEECCCCCCCCCCCHH-HHHHHHHHHHHHHHHHHHCC----------CHHHHHHHH--HHHHCCCC Q ss_conf 077634588788850015254106675258-99999999999999999567----------332899999--99832699 Q gi|254780915|r 523 LGPAFDLSDLEYGSFWMMVPHFIESSFYVY-AYAFGNCLVNSLYDIYKSNT----------VDCFKEKYL--NILRAGNS 589 (626) Q Consensus 523 ~g~~~~~~~~~~~~~w~~~~H~y~~~fY~y-~Y~~~~~~a~~l~~~~~~~~----------~~~~~~~y~--~~L~~Ggs 589 (626) .|-... .+ .-+..+=-|-++++|-|+ +|+.|.+.|+++|...+++- +=+-...|+ +|=+.|.. T Consensus 392 LGirp~-~d---~eG~LQDIHWs~GsfGYFPsYtlG~v~AAQ~~~a~~~~~p~~d~~i~~gd~~~i~~WL~e~ih~~Gs~ 467 (497) T COG2317 392 LGIRPK-ND---AEGVLQDIHWSHGSFGYFPTYTLGNVYAAQLYAAMKKDIPDVDALIAKGDFSPIKNWLRENIHRHGSR 467 (497) T ss_pred CCCCCC-CC---CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHHHCCCC T ss_conf 398999-86---66511102526888676713356799999999999753666878887278088899999999971676 Q ss_pred CCHHHHHHH-CCCCCCCHHHHHHHH Q ss_conf 997999997-088989988999999 Q gi|254780915|r 590 KHYSELLLP-LNINLSDPNFWERGL 613 (626) Q Consensus 590 ~~~~el~k~-~G~d~~~~~~~~~~~ 613 (626) .+|.||++. .|-.+ +|+++-+=+ T Consensus 468 ~~p~eLi~~atGE~l-np~y~i~yL 491 (497) T COG2317 468 YPPKELLKRATGEAL-NPEYFIDYL 491 (497) T ss_pred CCCHHHHHHHHCCCC-CHHHHHHHH T ss_conf 896899998637868-879999999 No 22 >pfam08439 Peptidase_M3_N Oligopeptidase F. This domain is found to the N-terminus of the pfam01432 domain in bacterial and archaeal proteins including Oligoendopeptidase F. An example of this protein is Lactococcus lactis PepF. Probab=98.23 E-value=2.4e-06 Score=63.25 Aligned_cols=70 Identities=29% Similarity=0.311 Sum_probs=57.4 Q ss_pred HHHHCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 8871998999999846900111179999999742331716799876630010003677777764444332 Q gi|254780915|r 145 EINTLDEALLEQSYAQDPLTLKYSAWIKNIRKIKKHLLSNDMECLLSDTSQVGREALKRFFCENIESLRF 214 (626) Q Consensus 145 e~~~l~e~~l~~~~~~~~~l~~y~~~~~~l~~~~~~~l~~~~E~~l~~~~~~~~~~~~~~~~~~~~~~~~ 214 (626) ++..++++.+..++.+++.|+.|++++.++++.++|+|++++|++++..++..+++...+....++++++ T Consensus 1 el~~l~e~~~~~~~~~~~~L~~y~~~l~~~~r~k~h~Ls~~~E~lLa~~~~~~~a~~~~y~~l~~adl~f 70 (70) T pfam08439 1 ELLALPEEKLEALLEEDPELKPYRFYLEELRRQKPHTLSEEEEKLLAALSDGLGAWSRLFDELTNADLKF 70 (70) T ss_pred CHHHCCHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCC T ss_conf 9041799999999984961899999999999818877999999999986677610999999998757989 No 23 >COG2856 Predicted Zn peptidase [Amino acid transport and metabolism] Probab=96.13 E-value=0.1 Score=30.16 Aligned_cols=52 Identities=25% Similarity=0.277 Sum_probs=32.3 Q ss_pred CCCCHHC-CCCCCCCCCCCCCCCCCCCEEECCCCC-CHHHHHHHHHHHHHHHHHHH Q ss_conf 8502010-379987435476667775101036533-24679999998458999996 Q gi|254780915|r 373 NWIDAPQ-YDGKGSGAFAHGTIPSVHPYILLNYLG-KPQDVMTLAHELGHGIHFVL 426 (626) Q Consensus 373 ~~iD~~~-r~gKr~GA~~~~~~~~~~~~I~~N~~~-~~~dv~TL~HE~GHa~H~~l 426 (626) +-+++.- ...|++.|+|.-. .. .|.|+.|-+. ..+..+||+||+||++-+-. T Consensus 35 gv~vf~~~~~~~~~d~~~~~~-~~-~~~I~iN~n~~~~r~rFtlAHELGH~llH~~ 88 (213) T COG2856 35 GVLVFELSFINKNIDAYGLYD-EE-KPVIYINANNSLERKRFTLAHELGHALLHTD 88 (213) T ss_pred CEEEEECCCCCCCCCCCEEEE-CC-CCEEEEECCCCHHHHHHHHHHHHHHHHHCCC T ss_conf 738874024312466663566-16-8659985789878888789998758996265 No 24 >pfam05960 DUF885 Bacterial protein of unknown function (DUF885). This family consists of several hypothetical bacterial proteins several of which are putative membrane proteins. Probab=95.90 E-value=0.14 Score=29.32 Aligned_cols=172 Identities=16% Similarity=0.121 Sum_probs=98.2 Q ss_pred CEEECCCCC-----CHHHHHHHHHHH--HHHHHHHHHCCCCCC--C-CCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHH Q ss_conf 101036533-----246799999984--589999961256877--6-578005999886888999999998413899899 Q gi|254780915|r 398 PYILLNYLG-----KPQDVMTLAHEL--GHGIHFVLSSETQGI--L-TNNSSLTLAETASIFGETLTFDSLLQAASSKEE 467 (626) Q Consensus 398 ~~I~~N~~~-----~~~dv~TL~HE~--GHa~H~~ls~~~~~~--~-~~~~~~~~aE~~S~~~E~~~~~~~~~~~~~~~~ 467 (626) .++..|... .+....|.+||. ||-+|..++.++.+. . ..-....++|==.+..|.+..+. +...++.. T Consensus 346 ~~~~in~~~~~~~~~~~~~~~~~HE~~PGHh~Q~~~~~~~~~~~~~R~~~~~~~~~EGWAlY~E~l~~e~--G~~~dp~~ 423 (536) T pfam05960 346 GRYYINLYDLRSRPKYSLETLAAHEAVPGHHLQIALAQELEDLPPFRRLLGFTAYVEGWALYAEELGDEM--GLYEDPYS 423 (536) T ss_pred CEEEEECCCHHHHCCCCCCHHHHCCCCCCHHHHHHHHHHCCCCCHHHHHCCCCHHHHHHHHHHHHHHHHC--CCCCCHHH T ss_conf 6598868972342013411233202466089999999975787358998056127747999999999873--87789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCC Q ss_conf 99999998620699999999999999999721201689998999999999999860776345887888500152541066 Q gi|254780915|r 468 RKILLANKIEDMLNSIVRQISFYDFELKLHTERRSTGDIPTHRINEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFIES 547 (626) Q Consensus 468 ~~~ll~~~~~~~~~~~~~q~~~~~fe~~~h~~~~~~~~~t~~~l~~~~~~~~~~~~g~~~~~~~~~~~~~w~~~~H~y~~ 547 (626) +...|...+. -......|..+|.. .+|.++-.++|.+ +.|.. .......|.+ +... T Consensus 424 ~lg~l~~~l~--------ra~r~vvD~glH~~-----~ws~~~a~~~~~~----~~~~~----~~~a~~e~~r---y~~~ 479 (536) T pfam05960 424 RLGMLQDELW--------RAARLVVDTGLHAK-----GWTREQAIDYLVE----NTGLS----EGEAEAEVDR---YIVW 479 (536) T ss_pred HHHHHHHHHH--------HHHHHHHHHHHHCC-----CCCHHHHHHHHHH----HCCCC----HHHHHHHHHH---HHCC T ss_conf 9999999999--------99999952545227-----9999999999998----37998----7999999999---9729 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHC-CCH-HHHHHHHHHHHCCCCCCHHHHHH Q ss_conf 752589999999999999999956-733-28999999983269999799999 Q gi|254780915|r 548 SFYVYAYAFGNCLVNSLYDIYKSN-TVD-CFKEKYLNILRAGNSKHYSELLL 597 (626) Q Consensus 548 ~fY~y~Y~~~~~~a~~l~~~~~~~-~~~-~~~~~y~~~L~~Ggs~~~~el~k 597 (626) |-|.-+|.+|.+.=.++-...++. |.+ ...++...+|+.|.. |..+++ T Consensus 480 Pgq~~sY~~G~~~i~~lr~~a~~~lG~~f~lk~FH~~iL~~G~~--P~~~l~ 529 (536) T pfam05960 480 PGQALSYKLGKLKILELREKAEAALGDRFDLREFHDAVLSQGAL--PLDLLE 529 (536) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCC--CHHHHH T ss_conf 62788899999999999999999758899999999999827999--889999 No 25 >KOG2719 consensus Probab=93.40 E-value=0.08 Score=30.97 Aligned_cols=50 Identities=26% Similarity=0.218 Sum_probs=37.8 Q ss_pred CCCCCCCCCCCCCCCCCCCCEEECCCC--CCH---HH-HHHHHHHHHHHHHHHHHC Q ss_conf 037998743547666777510103653--324---67-999999845899999612 Q gi|254780915|r 379 QYDGKGSGAFAHGTIPSVHPYILLNYL--GKP---QD-VMTLAHELGHGIHFVLSS 428 (626) Q Consensus 379 ~r~gKr~GA~~~~~~~~~~~~I~~N~~--~~~---~d-v~TL~HE~GHa~H~~ls~ 428 (626) .++++.+.||..|+...+...|+-+.. +.+ ++ +-.++||+||--|+.... T Consensus 243 s~rs~hsNAyfyG~~~~KRIvIyDtLl~~~~~~~~eel~AVl~HELGHW~~~H~~K 298 (428) T KOG2719 243 SKRSSHSNAYFYGLCKNKRIVIYDTLLLEEEHLNNEELVAVLAHELGHWKLNHVLK 298 (428) T ss_pred CCCCCCCCEEEEECCCCCEEEEEHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 77788887555614642258882146652203652778889998763787766999 No 26 >pfam01400 Astacin Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this family have an amino terminal propeptide which is cleaved to give the active protease domain. All other linked domains are found to the carboxyl terminus of this domain. This family includes: Astacin, a digestive enzyme from Crayfish. Meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain. Proteins involved in morphogenesis, and Tolloid from drosophila. Probab=92.61 E-value=0.098 Score=30.34 Aligned_cols=68 Identities=16% Similarity=0.088 Sum_probs=34.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCCCC--------CCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHH Q ss_conf 9999998666632899999998642485020103799--------87435476667775101036533246799999984 Q gi|254780915|r 347 ARDLVLQSYAKFSPQMSIIAEKFFTHNWIDAPQYDGK--------GSGAFAHGTIPSVHPYILLNYLGKPQDVMTLAHEL 418 (626) Q Consensus 347 ~~~~v~~~~~~~~~e~~~~~~~~~~~~~iD~~~r~gK--------r~GA~~~~~~~~~~~~I~~N~~~~~~dv~TL~HE~ 418 (626) .++.|..|+..+.. .-.|.+-||.+. ..|..|++.+......-.+|....-..+.|..||+ T Consensus 20 ~~~~I~~Am~~~~~-----------~TCirF~pr~~~~~~~~i~~~~~~gC~S~vG~~gg~Q~islg~gC~~~g~i~HEl 88 (192) T pfam01400 20 ARALIRQAMRHWEQ-----------KTCIRFVPRTSAPDNNYLFFFKGDGCYSYVGRNGGAQPVSLGNGCDKFGIIVHEL 88 (192) T ss_pred HHHHHHHHHHHHHH-----------CCCCEEEECCCCCCCEEEEEECCCCEEEECCEECCEEEEEECCCCCCCCCCHHHH T ss_conf 99999999999985-----------8875378588898876999943994536247008802588688858677017888 Q ss_pred HHHHHHHH Q ss_conf 58999996 Q gi|254780915|r 419 GHGIHFVL 426 (626) Q Consensus 419 GHa~H~~l 426 (626) |||| |++ T Consensus 89 ~HaL-Gf~ 95 (192) T pfam01400 89 GHAL-GFW 95 (192) T ss_pred HHHH-CCC T ss_conf 8884-555 No 27 >cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods. Probab=92.40 E-value=0.11 Score=30.08 Aligned_cols=41 Identities=22% Similarity=0.187 Sum_probs=24.7 Q ss_pred CCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf 354766677751010365332467999999845899999612 Q gi|254780915|r 387 AFAHGTIPSVHPYILLNYLGKPQDVMTLAHELGHGIHFVLSS 428 (626) Q Consensus 387 A~~~~~~~~~~~~I~~N~~~~~~dv~TL~HE~GHa~H~~ls~ 428 (626) ||..+.+.....-++-...+++..+.|++||+||.+ ||--+ T Consensus 122 A~vggiCs~~~~~~~~d~~~~~~~~~t~AHElGH~l-G~~HD 162 (220) T cd04272 122 AYVGGACTENRVAMGEDTPGSYYGVYTMTHELAHLL-GAPHD 162 (220) T ss_pred CCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHC-CCCCC T ss_conf 420787479960379516876310334567667642-88667 No 28 >pfam10460 Peptidase_M30 Peptidase M30. This family contains the metallopeptidase hyicolysin. Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue. Probab=92.38 E-value=0.23 Score=27.65 Aligned_cols=51 Identities=18% Similarity=0.070 Sum_probs=36.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 2467999999845899999612568776578005999886888999999998 Q gi|254780915|r 407 KPQDVMTLAHELGHGIHFVLSSETQGILTNNSSLTLAETASIFGETLTFDSL 458 (626) Q Consensus 407 ~~~dv~TL~HE~GHa~H~~ls~~~~~~~~~~~~~~~aE~~S~~~E~~~~~~~ 458 (626) --.-+.||+||+-|..+++.-.. ..=....+..++-|.-.++||.|+.... T Consensus 136 l~~v~sTmaHE~~HM~NFYqR~v-l~g~~y~fdtWLEEmtAMmmED~~s~~i 186 (366) T pfam10460 136 LNTVLSTMAHEGTHMQNFYRRGV-LMGAQYGFDVWLEEMTAMMMEDFASSKI 186 (366) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHCCCCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 99999998788899999999887-6056402788999999999999987377 No 29 >cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease. Probab=92.00 E-value=0.25 Score=27.45 Aligned_cols=15 Identities=47% Similarity=0.815 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 679999998458999 Q gi|254780915|r 409 QDVMTLAHELGHGIH 423 (626) Q Consensus 409 ~dv~TL~HE~GHa~H 423 (626) ....|..||+|||+= T Consensus 95 ~~~gti~HElgHaLG 109 (167) T cd00203 95 EGAQTIAHELGHALG 109 (167) T ss_pred CCCCHHHHHHHHHHC T ss_conf 554248999999848 No 30 >cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development. Probab=91.84 E-value=0.18 Score=28.43 Aligned_cols=73 Identities=18% Similarity=0.240 Sum_probs=34.2 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHCCCCCHHCCCCCCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 999999866663289-9999998642485020103799874354766677751010365332467999999845899999 Q gi|254780915|r 347 ARDLVLQSYAKFSPQ-MSIIAEKFFTHNWIDAPQYDGKGSGAFAHGTIPSVHPYILLNYLGKPQDVMTLAHELGHGIHFV 425 (626) Q Consensus 347 ~~~~v~~~~~~~~~e-~~~~~~~~~~~~~iD~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~~~~dv~TL~HE~GHa~H~~ 425 (626) .++.|..|+..+... ...+..+--+..||.+.. .+| |++.+......-.++....-..+.|..||+|||+= + T Consensus 16 ~~~~I~~Am~~~~~~TCirF~pr~~~~~yi~f~~----~~g--C~S~vG~~~g~q~isl~~~C~~~g~i~HEl~HalG-f 88 (180) T cd04280 16 DRSLILRAMREIESNTCIRFVPRTTEKDYIRIVK----GSG--CWSYVGRVGGRQVVSLGSGCFSLGTIVHELMHALG-F 88 (180) T ss_pred HHHHHHHHHHHHHHCCCEEEEECCCCCCEEEEEC----CCC--EEECCCCCCCEEEEEECCCCCCCCHHHHHHHHHHC-C T ss_conf 9999999999998679653588899875899965----997--13615701882368847886867633999998745-6 Q ss_pred H Q ss_conf 6 Q gi|254780915|r 426 L 426 (626) Q Consensus 426 l 426 (626) + T Consensus 89 ~ 89 (180) T cd04280 89 Y 89 (180) T ss_pred C T ss_conf 5 No 31 >pfam04298 Zn_peptidase_2 Putative neutral zinc metallopeptidase. Zinc metallopeptidase zinc binding regions have been predicted in some family members by a pattern match (Prosite:PS00142). Probab=91.75 E-value=0.24 Score=27.62 Aligned_cols=51 Identities=20% Similarity=0.218 Sum_probs=34.2 Q ss_pred CCCCC-CHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 36533-246799999984589999961256877657800599988688899999 Q gi|254780915|r 402 LNYLG-KPQDVMTLAHELGHGIHFVLSSETQGILTNNSSLTLAETASIFGETLT 454 (626) Q Consensus 402 ~N~~~-~~~dv~TL~HE~GHa~H~~ls~~~~~~~~~~~~~~~aE~~S~~~E~~~ 454 (626) -||++ +...+.+-+||.|||+++-- .+.|..-...=.+++-+.|++.-.++ T Consensus 80 ~~y~~~SlaAiaVAAHEvGHAiQha~--~Y~pL~~Rs~lvp~~~~~s~~~~~l~ 131 (222) T pfam04298 80 DVYNGRSVAAIGVAAHEVGHAIQHAE--GYAPLVLRSALVPVVNIGSNLSWILF 131 (222) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHH--CCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 11479889999999999838988562--47589999899999999999999999 No 32 >smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site. Probab=91.71 E-value=0.074 Score=31.22 Aligned_cols=81 Identities=23% Similarity=0.286 Sum_probs=38.4 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHH--HHCCCCCHHCCCCCCCCCC-CCCCCC-CCCCEEECCCCCCHHHHHHHH Q ss_conf 779989999999986666328999999986--4248502010379987435-476667-775101036533246799999 Q gi|254780915|r 340 DIIPFEVARDLVLQSYAKFSPQMSIIAEKF--FTHNWIDAPQYDGKGSGAF-AHGTIP-SVHPYILLNYLGKPQDVMTLA 415 (626) Q Consensus 340 ~~~~~ee~~~~v~~~~~~~~~e~~~~~~~~--~~~~~iD~~~r~gKr~GA~-~~~~~~-~~~~~I~~N~~~~~~dv~TL~ 415 (626) ..++..+.++.|..|+..+...-.-.|.+. -...+|-+....+ +|.| .....+ +....-+-+ .-..+.|.. T Consensus 17 ~~~~~~~~r~~i~~A~~~w~~~Tci~F~~~~~~~~~~i~f~~~~~--~gc~~s~~~~~~g~q~~~~~~---~c~~~g~i~ 91 (140) T smart00235 17 SSLSPEEVREAIARAFAEWSDVTCLRFVERTSTADIYISFGKGDG--SGCTLSHAGRPGGDQHFSLGN---GCINTGVAA 91 (140) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCEEEEEEECCC--CCCCEEEEECCCCCEEEECCC---CCCCCCHHH T ss_conf 999979999999999999985478638965678871899986888--997457630448823345388---816666245 Q ss_pred HHHHHHHHHHH Q ss_conf 98458999996 Q gi|254780915|r 416 HELGHGIHFVL 426 (626) Q Consensus 416 HE~GHa~H~~l 426 (626) ||+|||+ |+. T Consensus 92 HEigHaL-Gl~ 101 (140) T smart00235 92 HELGHAL-GLY 101 (140) T ss_pred HHHHHHC-CCC T ss_conf 5345342-665 No 33 >cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion. Probab=91.38 E-value=0.19 Score=28.34 Aligned_cols=38 Identities=18% Similarity=0.048 Sum_probs=19.8 Q ss_pred CCCCCCCCCC--CCEEECCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 3547666777--510103653324679999998458999996 Q gi|254780915|r 387 AFAHGTIPSV--HPYILLNYLGKPQDVMTLAHELGHGIHFVL 426 (626) Q Consensus 387 A~~~~~~~~~--~~~I~~N~~~~~~dv~TL~HE~GHa~H~~l 426 (626) ..|++.+... .-.|.+...+ -..+.|..||+|||+ |++ T Consensus 53 ~GC~S~vGr~gg~Q~isl~~~g-C~~~g~i~HEl~Hal-Gf~ 92 (182) T cd04283 53 SGCWSYIGRQGGRQTVSLQKQG-CMYKGIIQHELLHAL-GFY 92 (182) T ss_pred CCEEECCCEECCEEEEEECCCC-CCCCCHHHHHHHHHH-CCC T ss_conf 9785405710981248868999-787672078888884-555 No 34 >pfam06114 DUF955 Domain of unknown function (DUF955). Family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to pfam01435. Probab=91.02 E-value=0.25 Score=27.41 Aligned_cols=28 Identities=36% Similarity=0.411 Sum_probs=22.0 Q ss_pred CCEEECCCCC-CHHHHHHHHHHHHHHHHH Q ss_conf 5101036533-246799999984589999 Q gi|254780915|r 397 HPYILLNYLG-KPQDVMTLAHELGHGIHF 424 (626) Q Consensus 397 ~~~I~~N~~~-~~~dv~TL~HE~GHa~H~ 424 (626) .+.|+.|-+. ..+..+|++||+||.+.+ T Consensus 27 ~~~I~in~~~~~~~~~f~~aHElgH~~l~ 55 (121) T pfam06114 27 NKVIFINENLSPERQRFTLAHELGHLLLH 55 (121) T ss_pred CCEEEECCCCCHHHHHHHHHHHHHHHHHC T ss_conf 99899899999777599999999999936 No 35 >cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells. Probab=90.56 E-value=0.36 Score=26.35 Aligned_cols=74 Identities=18% Similarity=0.142 Sum_probs=36.5 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHCCCCCHHCCCCCCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 9999999866663289-999999864248502010379987435476667775101036533246799999984589999 Q gi|254780915|r 346 VARDLVLQSYAKFSPQ-MSIIAEKFFTHNWIDAPQYDGKGSGAFAHGTIPSVHPYILLNYLGKPQDVMTLAHELGHGIHF 424 (626) Q Consensus 346 e~~~~v~~~~~~~~~e-~~~~~~~~~~~~~iD~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~~~~dv~TL~HE~GHa~H~ 424 (626) ..+..|..|++.+... ...+..+.-+..||.+.. .+| |++.+....-.-.++....=..+.|..||+|||+=. T Consensus 61 ~~~~~I~~Am~~~~~~TCirF~~r~~e~~yi~~~~----~~G--C~S~VG~~gg~Q~vslg~gC~~~G~i~HEl~HaLGf 134 (230) T cd04282 61 NAKGVILKAFEMYRLKSCVDFKPYEGESNYIFFFK----GSG--CWSMVGDQQGGQNLSIGAGCDYKATVEHEFLHALGF 134 (230) T ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCCCCEEEEEC----CCC--CCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHCC T ss_conf 99999999999998589541486789875599956----995--114226538805665078878776679999988556 Q ss_pred H Q ss_conf 9 Q gi|254780915|r 425 V 425 (626) Q Consensus 425 ~ 425 (626) + T Consensus 135 ~ 135 (230) T cd04282 135 Y 135 (230) T ss_pred C T ss_conf 6 No 36 >cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning. Probab=89.44 E-value=0.42 Score=25.85 Aligned_cols=18 Identities=33% Similarity=0.397 Sum_probs=13.8 Q ss_pred CCHHHHHHHHHHHHHHHH Q ss_conf 324679999998458999 Q gi|254780915|r 406 GKPQDVMTLAHELGHGIH 423 (626) Q Consensus 406 ~~~~dv~TL~HE~GHa~H 423 (626) ..-..+.|..||+|||+= T Consensus 83 ~~C~~~G~i~HEl~HalG 100 (200) T cd04281 83 KNCDKFGIVVHELGHVIG 100 (200) T ss_pred CCCCCCCCHHHHHHHHHC T ss_conf 876756302999988754 No 37 >KOG3607 consensus Probab=89.41 E-value=0.32 Score=26.71 Aligned_cols=58 Identities=26% Similarity=0.229 Sum_probs=34.6 Q ss_pred HCCCCCHHCC---CCCCCC-CCCCCCCCCCCCEEECCCC--CCHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 2485020103---799874-3547666777510103653--324679999998458999996125 Q gi|254780915|r 371 THNWIDAPQY---DGKGSG-AFAHGTIPSVHPYILLNYL--GKPQDVMTLAHELGHGIHFVLSSE 429 (626) Q Consensus 371 ~~~~iD~~~r---~gKr~G-A~~~~~~~~~~~~I~~N~~--~~~~dv~TL~HE~GHa~H~~ls~~ 429 (626) .+.+.-+... .||+.| ||-.++....+..-+..+. .....+.|++||+||.+ +|.-+. T Consensus 278 ~hD~a~L~~~~~~~~~~~G~a~~~~mCs~~~s~gv~~~~~~~~~~~a~v~AhelgH~l-Gm~hD~ 341 (716) T KOG3607 278 PHDAAHLLSGILFYGKYVGLAYFGGMCSPGHSGGVNKFHSDILLAFAVVLAHELGHNL-GMIHDE 341 (716) T ss_pred CCCCEEEEEEEECCCCEECEEECCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHC-CCCCCC T ss_conf 8880478876402685411032165457665643452475442067899999878652-850066 No 38 >pfam01435 Peptidase_M48 Peptidase family M48. Probab=89.00 E-value=0.57 Score=24.90 Aligned_cols=48 Identities=21% Similarity=0.190 Sum_probs=31.1 Q ss_pred CHHCCCCCCCCCCCCCCCCCCCCEEECCC-----CCCHHHH-HHHHHHHHHHHHHHH Q ss_conf 20103799874354766677751010365-----3324679-999998458999996 Q gi|254780915|r 376 DAPQYDGKGSGAFAHGTIPSVHPYILLNY-----LGKPQDV-MTLAHELGHGIHFVL 426 (626) Q Consensus 376 D~~~r~gKr~GA~~~~~~~~~~~~I~~N~-----~~~~~dv-~TL~HE~GHa~H~~l 426 (626) .++.-+++...||+.+.. ..+|+++- ..+-+++ ..++||+||.-|+-. T Consensus 51 ~v~v~~~~~~NAfa~g~g---~~~I~v~~gLl~~~~~~~El~aVlAHEigHi~~~H~ 104 (222) T pfam01435 51 EVYVVDSPVRNAFALGGG---KNKVVVTSGLLDLALTDDELAAVLGHEIGHITARHS 104 (222) T ss_pred EEEEECCCCCCEEECCCC---CCEEEEEHHHHHHCCCHHHHHHHHHHHHHHHHCCCH T ss_conf 788874899884762799---866875568965169999999999999999983469 No 39 >PTZ00337 surface protease GP63; Provisional Probab=88.38 E-value=0.53 Score=25.12 Aligned_cols=49 Identities=27% Similarity=0.207 Sum_probs=33.3 Q ss_pred CHHCCCCCCCCCCCC---CCCCCCCCEEECCCCC-----CHHHHHHHHHHHHHHHHH Q ss_conf 201037998743547---6667775101036533-----246799999984589999 Q gi|254780915|r 376 DAPQYDGKGSGAFAH---GTIPSVHPYILLNYLG-----KPQDVMTLAHELGHGIHF 424 (626) Q Consensus 376 D~~~r~gKr~GA~~~---~~~~~~~~~I~~N~~~-----~~~dv~TL~HE~GHa~H~ 424 (626) -+.++|++-.-||.. .+..++...-++||.. +...|.+.+||+.||+=+ T Consensus 187 YvaA~Pt~g~~AWA~~C~~~~dgRP~vGv~N~~P~~i~~~~~~vRvaaHEiAHALGF 243 (567) T PTZ00337 187 YANIFPTSGPTAWAVPCFLLDDGRPFAAAVNFDPRQIAVTNGYVRVAAHELGHALGF 243 (567) T ss_pred EEECCCCCCCCHHEEECCCCCCCCEEEEEEECCHHHCCCCCHHHHHHHHHHHHHHCC T ss_conf 984067788402201211067898258997118132234401578999999888636 No 40 >cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. Probab=88.04 E-value=0.45 Score=25.66 Aligned_cols=41 Identities=22% Similarity=0.182 Sum_probs=22.4 Q ss_pred CCCCCCCCCCCCEEECCCCC--CHHHHHHHHHHHHHHHHHHHHC Q ss_conf 35476667775101036533--2467999999845899999612 Q gi|254780915|r 387 AFAHGTIPSVHPYILLNYLG--KPQDVMTLAHELGHGIHFVLSS 428 (626) Q Consensus 387 A~~~~~~~~~~~~I~~N~~~--~~~dv~TL~HE~GHa~H~~ls~ 428 (626) ||-.+.+.......+.-..+ ...-..|++||+||.+ |+--+ T Consensus 106 A~~g~iC~~~~s~~i~~~~~~~~~~~a~~~AHElGH~l-G~~HD 148 (194) T cd04269 106 AYVGGMCSPKYSGGVVQDHSRNLLLFAVTMAHELGHNL-GMEHD 148 (194) T ss_pred ECCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHC-CCCCC T ss_conf 11577677887569998679870155799989988644-88668 No 41 >cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases. Probab=87.75 E-value=0.62 Score=24.63 Aligned_cols=14 Identities=43% Similarity=0.575 Sum_probs=11.9 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 67999999845899 Q gi|254780915|r 409 QDVMTLAHELGHGI 422 (626) Q Consensus 409 ~dv~TL~HE~GHa~ 422 (626) .=+.|..||+||++ T Consensus 93 ~~~~v~~HEvGHtL 106 (165) T cd04268 93 RLRNTAEHELGHAL 106 (165) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999876350 No 42 >cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix. Probab=87.67 E-value=0.56 Score=24.94 Aligned_cols=41 Identities=24% Similarity=0.180 Sum_probs=22.3 Q ss_pred CCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf 4354766677751010365332467999999845899999612 Q gi|254780915|r 386 GAFAHGTIPSVHPYILLNYLGKPQDVMTLAHELGHGIHFVLSS 428 (626) Q Consensus 386 GA~~~~~~~~~~~~I~~N~~~~~~dv~TL~HE~GHa~H~~ls~ 428 (626) =||..+.+....+--+ |-...+....|++||+||.+ ||--+ T Consensus 117 ~A~~g~~C~~~~s~~i-~~d~~~~~a~t~AHElGHnL-Gm~HD 157 (207) T cd04273 117 LAPVGGMCSPSRSCSI-NEDTGLSSAFTIAHELGHVL-GMPHD 157 (207) T ss_pred EECCCCCCCCCCCEEE-EECCCCCHHHHHHHHHHHHC-CCCCC T ss_conf 6156777889754789-93488208999999877642-98668 No 43 >cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. Probab=87.44 E-value=0.21 Score=27.99 Aligned_cols=43 Identities=19% Similarity=0.116 Sum_probs=25.2 Q ss_pred CCCCCCCCCCCCC-CEEECCCCCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf 7435476667775-1010365332467999999845899999612 Q gi|254780915|r 385 SGAFAHGTIPSVH-PYILLNYLGKPQDVMTLAHELGHGIHFVLSS 428 (626) Q Consensus 385 ~GA~~~~~~~~~~-~~I~~N~~~~~~dv~TL~HE~GHa~H~~ls~ 428 (626) |=||..+.+.... .-|..+...++..+.|++||+||.+ ||.-+ T Consensus 107 G~A~vg~iC~~~~s~~v~~d~~~~~~~a~t~AHElGH~l-Gm~HD 150 (192) T cd04267 107 GLAYVGSMCNPYSSVGVVEDTGFTLLTALTMAHELGHNL-GAEHD 150 (192) T ss_pred EEECCCCCCCCCCCEEEEEECCCCCEEEEEHHHHHHHHC-CCCCC T ss_conf 786464667899878999757987301212256666344-88678 No 44 >pfam09768 Peptidase_M76 Peptidase M76 family. This is a family of metalloproteases. Proteins in this family are also annotated as Ku70-binding proteins. Probab=85.66 E-value=1.7 Score=21.45 Aligned_cols=53 Identities=25% Similarity=0.266 Sum_probs=34.7 Q ss_pred CCCCCHHCCCCCCCCCCCCCCCCCCCCEEECCCCCCHHHH-HHHHHHHHHHHHHHHHC Q ss_conf 4850201037998743547666777510103653324679-99999845899999612 Q gi|254780915|r 372 HNWIDAPQYDGKGSGAFAHGTIPSVHPYILLNYLGKPQDV-MTLAHELGHGIHFVLSS 428 (626) Q Consensus 372 ~~~iD~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~~~~dv-~TL~HE~GHa~H~~ls~ 428 (626) ...|--.+-+++.+|||.. .....|-.|.-.+...+ .||.|||=||+-+.-.. T Consensus 35 ~~~i~c~~C~~~~~GGf~p----~~gIvlC~N~i~~~~~~e~~l~HELIHAyD~cR~k 88 (172) T pfam09768 35 DRNISCEVCDGSKSGGFDP----EKGILLCQNRIRDKAHLEDTLTHELIHAFDDLRAK 88 (172) T ss_pred CCCEEEEECCCCCCCCCCC----CCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 7887888897865556069----88978747868788999999999999999998740 No 45 >pfam01421 Reprolysin Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins, e.g. fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. Probab=85.62 E-value=0.31 Score=26.82 Aligned_cols=41 Identities=22% Similarity=0.256 Sum_probs=21.2 Q ss_pred CCCCCCCCCCCCEEECCCCC--CHHHHHHHHHHHHHHHHHHHHC Q ss_conf 35476667775101036533--2467999999845899999612 Q gi|254780915|r 387 AFAHGTIPSVHPYILLNYLG--KPQDVMTLAHELGHGIHFVLSS 428 (626) Q Consensus 387 A~~~~~~~~~~~~I~~N~~~--~~~dv~TL~HE~GHa~H~~ls~ 428 (626) ||..+.+.......+..+.+ ...-..|++||+||.+ ||--+ T Consensus 106 A~~ggiC~~~~s~~v~~~~~~~~~~~a~t~AHElGH~L-G~~HD 148 (198) T pfam01421 106 AYVGGMCSPKRSVGVVQDHSPIVLLVAVTMAHELGHNL-GMTHD 148 (198) T ss_pred ECCCCCCCCCCCEEEEEECCCCCCEEEEEHHHHHHHHC-CCCCC T ss_conf 21577678877679998368773111100123345115-77578 No 46 >COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones] Probab=85.55 E-value=1.5 Score=21.84 Aligned_cols=49 Identities=27% Similarity=0.303 Sum_probs=31.7 Q ss_pred CCHHCCCCCCCCCCCCCCCCCCCCEEECCC----CCCHHHH-HHHHHHHHHHHHH Q ss_conf 020103799874354766677751010365----3324679-9999984589999 Q gi|254780915|r 375 IDAPQYDGKGSGAFAHGTIPSVHPYILLNY----LGKPQDV-MTLAHELGHGIHF 424 (626) Q Consensus 375 iD~~~r~gKr~GA~~~~~~~~~~~~I~~N~----~~~~~dv-~TL~HE~GHa~H~ 424 (626) ..++-.+.+.+.|||.+..+ ..+.|+..- ..+-+++ -.|+||+||..|. T Consensus 118 ~~v~i~~~~~pNAFa~g~~~-~~~~V~vt~gLl~~l~~dEl~aVlaHElgH~~~r 171 (302) T COG0501 118 PEVYILETPQPNAFALGGGP-KNGRVVVTTGLLDLLNDDELEAVLAHELGHIKNR 171 (302) T ss_pred CEEEEECCCCCCEEECCCCC-CCCEEEEEHHHHHCCCHHHHHHHHHHHHHHHHCC T ss_conf 65774069976722357998-8877986468862399999999999999999855 No 47 >PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional Probab=85.23 E-value=0.83 Score=23.73 Aligned_cols=50 Identities=16% Similarity=0.137 Sum_probs=30.2 Q ss_pred CCHHCCCCCCC-CCCCC---CCCCCCCCEEECCCCCC-----HH--HHHHHHHHHHHHHHH Q ss_conf 02010379987-43547---66677751010365332-----46--799999984589999 Q gi|254780915|r 375 IDAPQYDGKGS-GAFAH---GTIPSVHPYILLNYLGK-----PQ--DVMTLAHELGHGIHF 424 (626) Q Consensus 375 iD~~~r~gKr~-GA~~~---~~~~~~~~~I~~N~~~~-----~~--dv~TL~HE~GHa~H~ 424 (626) |-+-.+|...+ -||.. .+..++...=++||+.. ++ .|.|.+||++||+=+ T Consensus 210 LYvaa~Pt~~g~lAWA~~C~~~~dgRPvvGvvNi~p~~i~~~yd~~~vRv~aHEiAHALGF 270 (615) T PTZ00257 210 LYVASVPSEPGVLAWATTCQVFSDDHPAVGVMNIPAANIVSRYDQGTTRTVTHEVAHALGF 270 (615) T ss_pred EEEECCCCCCCCHHHHEECCCCCCCCCEEEEEEECHHHCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 9985157999731321011127789925899820856534545541478999999998757 No 48 >PRK02391 heat shock protein HtpX; Provisional Probab=84.07 E-value=2.2 Score=20.76 Aligned_cols=47 Identities=30% Similarity=0.437 Sum_probs=29.5 Q ss_pred HHCCCCCCCCCCCCCCCCCCCCEEECCCCC-----CHHHH-HHHHHHHHHHHHHH Q ss_conf 010379987435476667775101036533-----24679-99999845899999 Q gi|254780915|r 377 APQYDGKGSGAFAHGTIPSVHPYILLNYLG-----KPQDV-MTLAHELGHGIHFV 425 (626) Q Consensus 377 ~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~-----~~~dv-~TL~HE~GHa~H~~ 425 (626) ++.-+...+.||.+|..+. +..|... +| +-+|+ -.++||+||--|+- T Consensus 97 vyI~~~~~pNAFAtG~~~~-~a~V~vT-~GLL~~L~~dEL~aVLAHEl~Hikn~D 149 (297) T PRK02391 97 VAVADTDVPNAFATGRSPK-NAVVCVT-TGLLRRLDPEELEAVLAHELSHVKNRD 149 (297) T ss_pred EEEECCCCCCEEEECCCCC-CEEEEEC-HHHHHHCCHHHHHHHHHHHHHHHHCCC T ss_conf 8995289976257458998-7699975-798863999999999999999997152 No 49 >cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin. Probab=83.59 E-value=0.47 Score=25.49 Aligned_cols=19 Identities=32% Similarity=0.649 Sum_probs=13.9 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 4679999998458999996 Q gi|254780915|r 408 PQDVMTLAHELGHGIHFVL 426 (626) Q Consensus 408 ~~dv~TL~HE~GHa~H~~l 426 (626) -....|..||||||+-... T Consensus 90 ~~~~~~~~he~~h~~g~~h 108 (198) T cd04327 90 PEFSRVVLHEFGHALGFIH 108 (198) T ss_pred CCCCEEHHHHHHHHHCCCC T ss_conf 3300100577676642600 No 50 >cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases). Probab=82.85 E-value=0.8 Score=23.84 Aligned_cols=16 Identities=31% Similarity=0.497 Sum_probs=12.8 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 4679999998458999 Q gi|254780915|r 408 PQDVMTLAHELGHGIH 423 (626) Q Consensus 408 ~~dv~TL~HE~GHa~H 423 (626) .+=..|+.||+||||= T Consensus 105 ~~l~~va~HEIGHaLG 120 (157) T cd04278 105 TDLFSVAAHEIGHALG 120 (157) T ss_pred CCHHHHHHHHHHHHCC T ss_conf 0189999987565406 No 51 >cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain. Probab=82.80 E-value=0.44 Score=25.67 Aligned_cols=42 Identities=17% Similarity=0.183 Sum_probs=27.9 Q ss_pred HHHHHHHHHHHHHHHHHCCC-CCC---CCCCCHHHHHHHHHHHHHH Q ss_conf 99999984589999961256-877---6578005999886888999 Q gi|254780915|r 411 VMTLAHELGHGIHFVLSSET-QGI---LTNNSSLTLAETASIFGET 452 (626) Q Consensus 411 v~TL~HE~GHa~H~~ls~~~-~~~---~~~~~~~~~aE~~S~~~E~ 452 (626) +.+.+||+|||+-...-+.. ..+ ..--+|..+||+.+.-.+. T Consensus 136 is~v~HElGHA~aA~~e~V~v~~~G~~~~~i~PgA~v~l~~~~L~~ 181 (277) T cd06162 136 ISGVVHEMGHGVAAVREQVRVNGFGIFFFIIYPGAYVDLFTDHLNL 181 (277) T ss_pred HHHHHHHHHHHHHHHHCCCEEEEEEEEEEEEECCCEEEECHHHHHC T ss_conf 9999999989999986287152224999998024047717999834 No 52 >cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin. Probab=81.94 E-value=0.57 Score=24.92 Aligned_cols=15 Identities=33% Similarity=0.457 Sum_probs=12.4 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 467999999845899 Q gi|254780915|r 408 PQDVMTLAHELGHGI 422 (626) Q Consensus 408 ~~dv~TL~HE~GHa~ 422 (626) .+=..+..||+||++ T Consensus 102 ~~l~~vA~HEiGHaL 116 (156) T cd04279 102 ENLQAIALHELGHAL 116 (156) T ss_pred HHHHHHHHHHHCCCC T ss_conf 209999999860101 No 53 >PRK04897 heat shock protein HtpX; Provisional Probab=80.96 E-value=1.2 Score=22.54 Aligned_cols=47 Identities=21% Similarity=0.374 Sum_probs=30.2 Q ss_pred HHCCCCCCCCCCCCCCCCCCCCEEECCCCC-----CHHHH-HHHHHHHHHHHHHH Q ss_conf 010379987435476667775101036533-----24679-99999845899999 Q gi|254780915|r 377 APQYDGKGSGAFAHGTIPSVHPYILLNYLG-----KPQDV-MTLAHELGHGIHFV 425 (626) Q Consensus 377 ~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~-----~~~dv-~TL~HE~GHa~H~~ 425 (626) +|--+...+.||++|..+. +..|.+. +| +-+|+ -.++||+||--|+- T Consensus 100 vyi~~~~~pNAFatG~~~~-~~~V~vt-~GLL~~L~~~ELeaVlAHElgHikn~D 152 (298) T PRK04897 100 VFIIDDPSPNAFATGSDPK-NAAVAVT-TGLLEVMNREELEGVIGHEISHIRNYD 152 (298) T ss_pred EEEECCCCCCEEEEECCCC-CEEEEEC-HHHHHHCCHHHHHHHHHHHHHHHHCCC T ss_conf 7996289870699506888-8789961-798976899999999999999997577 No 54 >pfam01742 Peptidase_M27 Clostridial neurotoxin zinc protease. These toxins are zinc proteases that block neurotransmitter release by proteolytic cleavage of synaptic proteins such as synaptobrevins, syntaxin and SNAP-25. Probab=80.55 E-value=1.2 Score=22.65 Aligned_cols=22 Identities=27% Similarity=0.383 Sum_probs=19.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHCC Q ss_conf 4679999998458999996125 Q gi|254780915|r 408 PQDVMTLAHELGHGIHFVLSSE 429 (626) Q Consensus 408 ~~dv~TL~HE~GHa~H~~ls~~ 429 (626) .|=+.+|.||+=|++|+++.-+ T Consensus 212 ~DPAl~LmheLIhsLh~LYGik 233 (408) T pfam01742 212 VDPALELMHELIHSLHGLYGIK 233 (408) T ss_pred CCHHHHHHHHHHHHHHHHCCCC T ss_conf 3889999999999988760766 No 55 >PRK03982 heat shock protein HtpX; Provisional Probab=79.75 E-value=2 Score=20.99 Aligned_cols=46 Identities=26% Similarity=0.335 Sum_probs=29.6 Q ss_pred HCCCCCCCCCCCCCCCCCCCCEEECCCCC-----CHHHH-HHHHHHHHHHHHHH Q ss_conf 10379987435476667775101036533-----24679-99999845899999 Q gi|254780915|r 378 PQYDGKGSGAFAHGTIPSVHPYILLNYLG-----KPQDV-MTLAHELGHGIHFV 425 (626) Q Consensus 378 ~~r~gKr~GA~~~~~~~~~~~~I~~N~~~-----~~~dv-~TL~HE~GHa~H~~ 425 (626) +--+.-.+.||.++..+. +..|.+. +| +-||+ -.++||+||--|.- T Consensus 89 ~i~~~~~pNAFa~G~~~~-~~~V~vt-~GLL~~L~~dEl~aVlAHElgHi~n~D 140 (288) T PRK03982 89 AIVPTQTPNAFATGRDPK-HAVVAVT-EGILNLLNEDELEGVIAHELTHIKNRD 140 (288) T ss_pred EEEECCCCCEEEECCCCC-CEEEEEE-HHHHHHCCHHHHHHHHHHHHHHHHCCC T ss_conf 997089976687268999-8589964-899873899999999999999997233 No 56 >cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin. Probab=79.17 E-value=0.66 Score=24.46 Aligned_cols=12 Identities=50% Similarity=0.794 Sum_probs=9.9 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999845899 Q gi|254780915|r 411 VMTLAHELGHGI 422 (626) Q Consensus 411 v~TL~HE~GHa~ 422 (626) ..+++||.||.+ T Consensus 117 r~~~aHEVGHtL 128 (197) T cd04276 117 RYLLAHEVGHTL 128 (197) T ss_pred HHHHHHHHHHHC T ss_conf 999998764031 No 57 >pfam00413 Peptidase_M10 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Probab=78.85 E-value=0.67 Score=24.42 Aligned_cols=16 Identities=31% Similarity=0.580 Sum_probs=12.6 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 4679999998458999 Q gi|254780915|r 408 PQDVMTLAHELGHGIH 423 (626) Q Consensus 408 ~~dv~TL~HE~GHa~H 423 (626) .+=..|+.||+|||+= T Consensus 105 ~~l~~va~HEIGHaLG 120 (158) T pfam00413 105 TNLFLVAAHEIGHALG 120 (158) T ss_pred CHHHHHHHHHHCCCCC T ss_conf 1155665454020026 No 58 >COG2738 Predicted Zn-dependent protease [General function prediction only] Probab=78.75 E-value=4 Score=18.85 Aligned_cols=87 Identities=22% Similarity=0.268 Sum_probs=45.6 Q ss_pred HHHHHHH-HCCCCCHHCCCCCCCCCCCCCCCCCCCCEEE--CCCCC-CHHHHHHHHHHHHHHHHHHHHCCCCCCC-CCCC Q ss_conf 9999864-2485020103799874354766677751010--36533-2467999999845899999612568776-5780 Q gi|254780915|r 364 IIAEKFF-THNWIDAPQYDGKGSGAFAHGTIPSVHPYIL--LNYLG-KPQDVMTLAHELGHGIHFVLSSETQGIL-TNNS 438 (626) Q Consensus 364 ~~~~~~~-~~~~iD~~~r~gKr~GA~~~~~~~~~~~~I~--~N~~~-~~~dv~TL~HE~GHa~H~~ls~~~~~~~-~~~~ 438 (626) +.+++++ .++..|+...+.+ |--..-..|.+...-+ -||.| +....-.-+||-|||++.-- .+++. -.+. T Consensus 44 evAr~iLd~nGl~dV~Ve~v~--G~LTDHYDP~~kvvrLSe~~y~g~Sia~~aVAAHEVGHAiQd~~---~Y~~L~~R~~ 118 (226) T COG2738 44 EVARMILDENGLYDVPVEEVP--GTLTDHYDPRRKVVRLSEANYYGPSIAAIAVAAHEVGHAIQDQE---DYAFLVLRHA 118 (226) T ss_pred HHHHHHHHHCCCCCCEEEEEC--CCCCCCCCHHHHEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHC---CCHHHHHHHC T ss_conf 999999855587665046406--87656668045125645201478638889899987627776220---0579998640 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 05999886888999999 Q gi|254780915|r 439 SLTLAETASIFGETLTF 455 (626) Q Consensus 439 ~~~~aE~~S~~~E~~~~ 455 (626) -.+.+-+.|++.-.+++ T Consensus 119 lvPv~~~gSn~a~~l~i 135 (226) T COG2738 119 LVPVANFGSNLAPLLFI 135 (226) T ss_pred CCCEECCCCCHHHHHHH T ss_conf 24400013524699999 No 59 >TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than pfam05547. Probab=78.29 E-value=0.23 Score=27.67 Aligned_cols=14 Identities=50% Similarity=0.793 Sum_probs=11.8 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 67999999845899 Q gi|254780915|r 409 QDVMTLAHELGHGI 422 (626) Q Consensus 409 ~dv~TL~HE~GHa~ 422 (626) ..+.|++|||||+| T Consensus 164 ~~igv~~HE~gH~l 177 (286) T TIGR03296 164 GGVGVIAHELGHDL 177 (286) T ss_pred CCEEEEEHHHHCCC T ss_conf 73556752221403 No 60 >KOG2661 consensus Probab=78.21 E-value=2.3 Score=20.57 Aligned_cols=60 Identities=18% Similarity=0.250 Sum_probs=32.8 Q ss_pred CCCCCCCCCCCCCCEEEC---CCCCCHHH-HHHHHHHHHHHHHHHHHCCCC-----CCCC-CCCHHHHHHHHH Q ss_conf 743547666777510103---65332467-999999845899999612568-----7765-780059998868 Q gi|254780915|r 385 SGAFAHGTIPSVHPYILL---NYLGKPQD-VMTLAHELGHGIHFVLSSETQ-----GILT-NNSSLTLAETAS 447 (626) Q Consensus 385 ~GA~~~~~~~~~~~~I~~---N~~~~~~d-v~TL~HE~GHa~H~~ls~~~~-----~~~~-~~~~~~~aE~~S 447 (626) +.|||- |++..+|+- |...+-|+ ...|.||+|||+-.--+-+-- ++.+ .-+.++.+-+++ T Consensus 249 pNAFvL---PgGKvfVFtgiLn~ck~ddglAtvLgHE~aHaVarH~AEki~k~~~~siLgLvlyt~~~a~~~n 318 (424) T KOG2661 249 PNAFVL---PGGKVFVFTGILNSCKDDDGLATVLGHEIAHAVARHAAEKIGKVHLLSILGLVLYTMIWAICPN 318 (424) T ss_pred CCEEEC---CCCEEEEEECHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCH T ss_conf 762662---6973999951101036838889999889999999988875316489999999999977401323 No 61 >cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha Probab=78.01 E-value=1.4 Score=22.22 Aligned_cols=18 Identities=33% Similarity=0.388 Sum_probs=14.1 Q ss_pred HHHHHHHHHHHHHHHHHC Q ss_conf 999999845899999612 Q gi|254780915|r 411 VMTLAHELGHGIHFVLSS 428 (626) Q Consensus 411 v~TL~HE~GHa~H~~ls~ 428 (626) +..+.||+||++-...-. T Consensus 54 ~sVllHElgHal~Ar~~G 71 (227) T cd06164 54 ASVLLHELGHSLVARRYG 71 (227) T ss_pred HHHHHHHHHHHHHHHHCC T ss_conf 999999999999999929 No 62 >LOAD_S2Pmetalloprt consensus Probab=77.72 E-value=1.4 Score=22.12 Aligned_cols=18 Identities=33% Similarity=0.418 Sum_probs=13.3 Q ss_pred HHHHHHHHHHHHHHHHHC Q ss_conf 999999845899999612 Q gi|254780915|r 411 VMTLAHELGHGIHFVLSS 428 (626) Q Consensus 411 v~TL~HE~GHa~H~~ls~ 428 (626) +.-+.||+||++=...-. T Consensus 8 i~v~vHElgH~~~A~~~G 25 (148) T LOAD_S2Pmetall 8 VSVVVHELGHALVARRFG 25 (148) T ss_pred HHHHHHHHHHHHHHHHCC T ss_conf 999999999999999969 No 63 >COG4227 Antirestriction protein [DNA replication, recombination, and repair] Probab=77.61 E-value=1.3 Score=22.29 Aligned_cols=85 Identities=22% Similarity=0.164 Sum_probs=41.4 Q ss_pred HHHHHHHHHHCCCCCHHCCCCCCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHH--HHHCCCCCCCCCC-C Q ss_conf 999999864248502010379987435476667775101036533246799999984589999--9612568776578-0 Q gi|254780915|r 362 MSIIAEKFFTHNWIDAPQYDGKGSGAFAHGTIPSVHPYILLNYLGKPQDVMTLAHELGHGIHF--VLSSETQGILTNN-S 438 (626) Q Consensus 362 ~~~~~~~~~~~~~iD~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~~~~dv~TL~HE~GHa~H~--~ls~~~~~~~~~~-~ 438 (626) -.+.++.++.+-=.++ |-|- .-||..+.....+-.-.--|....+-+.||+||+||.--+ -+.++.....+++ + T Consensus 158 ~ie~Aea~~~atgAvi--rhGg-drAfy~Pa~D~Iq~p~~~aF~da~~yyaTl~HElghwtgh~~rl~rdLs~~~~sr~y 234 (316) T COG4227 158 RIERAEAFFRATGAVI--RHGG-DRAFYSPASDSIQLPPFEAFRDAINYYATLLHELGHWTGHEARLDRDLSRAFGSREY 234 (316) T ss_pred HHHHHHHHHHHCCCEE--EECC-CCCCCCCCCCEECCCCHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCCHHHH T ss_conf 5566999997046547--7657-532005766703069825523447699999887536557603442332243202578 Q ss_pred --HHHHHHHHHHH Q ss_conf --05999886888 Q gi|254780915|r 439 --SLTLAETASIF 449 (626) Q Consensus 439 --~~~~aE~~S~~ 449 (626) .--.+|+.|.| T Consensus 235 A~eel~aEi~a~~ 247 (316) T COG4227 235 AFEELVAEIGAAF 247 (316) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999975433 No 64 >PRK03001 heat shock protein HtpX; Provisional Probab=77.46 E-value=2 Score=21.04 Aligned_cols=46 Identities=28% Similarity=0.392 Sum_probs=30.0 Q ss_pred HHCCCCCCCCCCCCCCCCCCCCEEECCCCC-----CHHHH-HHHHHHHHHHHHH Q ss_conf 010379987435476667775101036533-----24679-9999984589999 Q gi|254780915|r 377 APQYDGKGSGAFAHGTIPSVHPYILLNYLG-----KPQDV-MTLAHELGHGIHF 424 (626) Q Consensus 377 ~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~-----~~~dv-~TL~HE~GHa~H~ 424 (626) ++--+...+.||++|..+ .+..|.+. +| +-||+ -.++||+||--|+ T Consensus 87 v~i~~~~~pNAFAtG~~~-~~a~I~vT-~GLL~~L~~dELeaVlAHEl~Hi~n~ 138 (284) T PRK03001 87 VYLINEDQPNAFATGRNP-EHAAVAAT-TGILRVLSEREIRGVMAHELAHVKHR 138 (284) T ss_pred EEEEECCCCCEEEECCCC-CCEEEEEE-HHHHHHCCHHHHHHHHHHHHHHHHCC T ss_conf 899855886544653688-88389960-89997579999999999999999746 No 65 >pfam03272 Enhancin Viral enhancin protein. Probab=77.18 E-value=0.67 Score=24.38 Aligned_cols=16 Identities=13% Similarity=0.125 Sum_probs=8.0 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999986206999999 Q gi|254780915|r 470 ILLANKIEDMLNSIVR 485 (626) Q Consensus 470 ~ll~~~~~~~~~~~~~ 485 (626) .-|..++.....++-. T Consensus 664 ~~l~~rI~~~~~~L~~ 679 (775) T pfam03272 664 QRLLERIDNAAAWLDN 679 (775) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9999999999999870 No 66 >cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. Probab=77.11 E-value=1.5 Score=21.92 Aligned_cols=18 Identities=33% Similarity=0.421 Sum_probs=13.3 Q ss_pred HHHHHHHHHHHHHHHHHC Q ss_conf 999999845899999612 Q gi|254780915|r 411 VMTLAHELGHGIHFVLSS 428 (626) Q Consensus 411 v~TL~HE~GHa~H~~ls~ 428 (626) +..+.||+||++=...-. T Consensus 39 ~sVl~HElgH~l~A~~~G 56 (208) T cd06161 39 LSVLLHELGHALVARRYG 56 (208) T ss_pred HHHHHHHHHHHHHHHHCC T ss_conf 999999999999999949 No 67 >PRK05457 heat shock protein HtpX; Provisional Probab=77.00 E-value=2.6 Score=20.24 Aligned_cols=46 Identities=26% Similarity=0.374 Sum_probs=29.9 Q ss_pred HCCCCCCCCCCCCCCCCCCCCEEECCCCC-----CHHHH-HHHHHHHHHHHHHH Q ss_conf 10379987435476667775101036533-----24679-99999845899999 Q gi|254780915|r 378 PQYDGKGSGAFAHGTIPSVHPYILLNYLG-----KPQDV-MTLAHELGHGIHFV 425 (626) Q Consensus 378 ~~r~gKr~GA~~~~~~~~~~~~I~~N~~~-----~~~dv-~TL~HE~GHa~H~~ 425 (626) +--+.-.+.||++|..+. +..|.+. +| +-||+ -.++||+||--|+- T Consensus 98 yi~~~~~pNAFAtG~~p~-~a~VaVT-~GLL~~L~~dELegVlAHElsHI~n~D 149 (289) T PRK05457 98 AIYHSPEINAFATGASKN-NSLVAVS-TGLLQNMSRDEVEAVLAHEISHIANGD 149 (289) T ss_pred EEECCCCCCEEECCCCCC-CEEEEEC-HHHHHHCCHHHHHHHHHHHHHHHHCCC T ss_conf 887489987014268988-8799985-799976799999999999999997311 No 68 >cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides. Probab=76.92 E-value=0.21 Score=28.00 Aligned_cols=13 Identities=38% Similarity=0.790 Sum_probs=11.1 Q ss_pred HHHHHHHHHHHHH Q ss_conf 7999999845899 Q gi|254780915|r 410 DVMTLAHELGHGI 422 (626) Q Consensus 410 dv~TL~HE~GHa~ 422 (626) ...|+.||+|||+ T Consensus 113 ~~~t~lHEIGHAL 125 (186) T cd04277 113 GYQTIIHEIGHAL 125 (186) T ss_pred CEEEEEEEECHHC T ss_conf 2478888810132 No 69 >PRK01265 heat shock protein HtpX; Provisional Probab=75.13 E-value=3.4 Score=19.38 Aligned_cols=47 Identities=26% Similarity=0.262 Sum_probs=29.8 Q ss_pred HHCCCCCCCCCCCCCCCCCCCCEEECCCCC-----CHHHH-HHHHHHHHHHHHHH Q ss_conf 010379987435476667775101036533-----24679-99999845899999 Q gi|254780915|r 377 APQYDGKGSGAFAHGTIPSVHPYILLNYLG-----KPQDV-MTLAHELGHGIHFV 425 (626) Q Consensus 377 ~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~-----~~~dv-~TL~HE~GHa~H~~ 425 (626) ++--+.-.+.||.++-.+. +..|... +| +-+|+ -.++||+||--|.- T Consensus 102 vyIi~~~~PNAFAtG~~p~-~a~VaVT-~GLL~~L~~dELegVlAHEl~HI~nrD 154 (326) T PRK01265 102 VYIAEVPFPNAFAYSSPIA-GKRIAIT-LPLLKILNPDEIKAVIGHELGHLKHRD 154 (326) T ss_pred EEEECCCCCCEEECCCCCC-CEEEEEC-HHHHHHCCHHHHHHHHHHHHHHHHCCC T ss_conf 8884599977325468988-8789975-788864799999999999998984534 No 70 >PRK02870 heat shock protein HtpX; Provisional Probab=74.60 E-value=5.1 Score=18.09 Aligned_cols=47 Identities=26% Similarity=0.271 Sum_probs=30.2 Q ss_pred HHCCCCCCCCCCCCCCCCCCCCEEECCCCC-----CHHHH-HHHHHHHHHHHHHH Q ss_conf 010379987435476667775101036533-----24679-99999845899999 Q gi|254780915|r 377 APQYDGKGSGAFAHGTIPSVHPYILLNYLG-----KPQDV-MTLAHELGHGIHFV 425 (626) Q Consensus 377 ~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~-----~~~dv-~TL~HE~GHa~H~~ 425 (626) +|--+...+.||.+|..+. +..|.+. +| +-+|+ -.++||+||--|+- T Consensus 136 vyii~~~~pNAFAtG~~~~-~a~VaVT-~GLL~~L~~dELeaVlAHEl~HI~n~D 188 (338) T PRK02870 136 VYIIDAPYMNAFASGYSEK-SAMVAVT-TGLLEKLDRDELQAVMAHELSHIRHGD 188 (338) T ss_pred EEEECCCCCCCCCCCCCCC-CEEEEEC-HHHHHHCCHHHHHHHHHHHHHHHHCCC T ss_conf 8872899977312589987-8799973-788863899999999999999997445 No 71 >cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved. Probab=74.34 E-value=0.54 Score=25.09 Aligned_cols=16 Identities=31% Similarity=0.239 Sum_probs=12.1 Q ss_pred CHHHHHHHHHHHHHHH Q ss_conf 2467999999845899 Q gi|254780915|r 407 KPQDVMTLAHELGHGI 422 (626) Q Consensus 407 ~~~dv~TL~HE~GHa~ 422 (626) +.+--.||.||+||-+ T Consensus 134 ~~~~g~TltHEvGH~l 149 (225) T cd04275 134 PYNLGDTATHEVGHWL 149 (225) T ss_pred CCCCCCEEEEECCHHC T ss_conf 7665431466622010 No 72 >PRK03072 heat shock protein HtpX; Provisional Probab=74.23 E-value=3 Score=19.72 Aligned_cols=47 Identities=23% Similarity=0.234 Sum_probs=30.4 Q ss_pred HHCCCCCCCCCCCCCCCCCCCCEEECCCCC-----CHHHH-HHHHHHHHHHHHHH Q ss_conf 010379987435476667775101036533-----24679-99999845899999 Q gi|254780915|r 377 APQYDGKGSGAFAHGTIPSVHPYILLNYLG-----KPQDV-MTLAHELGHGIHFV 425 (626) Q Consensus 377 ~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~-----~~~dv-~TL~HE~GHa~H~~ 425 (626) ++--+.-.+.||++|..+ .+..|.+. +| +-+|+ -.|+||+||--|+- T Consensus 90 v~i~~~~~pNAFAtG~~~-~~a~V~vT-~GLL~~L~~dELeaVlAHEl~Hi~n~D 142 (289) T PRK03072 90 LYISPTMAPNAFATGRNP-RNAAVCCT-EGILQILNERELRGVLGHELSHVYNRD 142 (289) T ss_pred EEEECCCCCCEEEECCCC-CCEEEEEE-HHHHHHCCHHHHHHHHHHHHHHHHCCC T ss_conf 799549997558842798-88699964-799751899999999999999998358 No 73 >TIGR01241 FtsH_fam ATP-dependent metallopeptidase HflB; InterPro: IPR005936 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. FtsH is a membrane-anchored ATP-dependent protease that degrades misfolded or misassembled membrane proteins as well as a subset of cytoplasmic regulatory proteins. FtsH is a 647-residue protein of 70 kDa, with two putative transmembrane segments towards its N terminus which anchor the protein to the membrane, giving rise to a periplasmic domain of 70 residues and a cytoplasmic segment of 520 residues containing the ATPase and protease domains . ; GO: 0004222 metalloendopeptidase activity, 0030163 protein catabolic process, 0016020 membrane. Probab=73.81 E-value=1.6 Score=21.73 Aligned_cols=74 Identities=9% Similarity=0.084 Sum_probs=39.1 Q ss_pred CCCCCCCCCHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHCCCCCHHCCCCCCCCCCCCCCCCCCCCEEECCCC Q ss_conf 655477799899999999866---------66328999999986424850201037998743547666777510103653 Q gi|254780915|r 335 PGTSQDIIPFEVARDLVLQSY---------AKFSPQMSIIAEKFFTHNWIDAPQYDGKGSGAFAHGTIPSVHPYILLNYL 405 (626) Q Consensus 335 ~~~~~~~~~~ee~~~~v~~~~---------~~~~~e~~~~~~~~~~~~~iD~~~r~gKr~GA~~~~~~~~~~~~I~~N~~ 405 (626) +-...|=|+-.+.-.+|.+|- ..--.++.+-.++.+. +. .+.+.++..=. T Consensus 257 ~Ar~TPGfSGADLaNl~NEAALlAAR~n~~~i~~~~~eeA~Drv~~-------------------G~--ekKsr~is~~e 315 (505) T TIGR01241 257 VARRTPGFSGADLANLLNEAALLAARKNKTEITMNDIEEAVDRVIA-------------------GL--EKKSRVISEKE 315 (505) T ss_pred HHHHCCCCCHHHHHHHHHHHHHHHHHCCCCEECHHHHHHHHHHHHC-------------------CC--CCCCCCCCHHH T ss_conf 9701568767889999999999986179865628889878776522-------------------76--67885326777 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 3246799999984589999961256877 Q gi|254780915|r 406 GKPQDVMTLAHELGHGIHFVLSSETQGI 433 (626) Q Consensus 406 ~~~~dv~TL~HE~GHa~H~~ls~~~~~~ 433 (626) ==.|=-||-|||+=+++...+-|. T Consensus 316 ----K~~vAYHEaGHAl~G~~~~~~DpV 339 (505) T TIGR01241 316 ----KKIVAYHEAGHALVGLLLKDADPV 339 (505) T ss_pred ----CCEEEEECHHHHHHHHHCCCCCCC T ss_conf ----422201115789999735344752 No 74 >pfam01434 Peptidase_M41 Peptidase family M41. Probab=73.66 E-value=3.2 Score=19.59 Aligned_cols=22 Identities=23% Similarity=0.270 Sum_probs=15.9 Q ss_pred HHHHHHHHHHHHHHHHHCCCCC Q ss_conf 9999998458999996125687 Q gi|254780915|r 411 VMTLAHELGHGIHFVLSSETQG 432 (626) Q Consensus 411 v~TL~HE~GHa~H~~ls~~~~~ 432 (626) =.|-.||.|||+=+++.....+ T Consensus 9 ~~vA~HEaGHAlva~~l~~~~~ 30 (192) T pfam01434 9 RLVAYHEAGHALVGLLLPGADP 30 (192) T ss_pred HHHHHHHHHHHHHHHHCCCCCC T ss_conf 9999999999999998469998 No 75 >pfam12388 Peptidase_M57 Dual-action HEIGH metallo-peptidase. The catalytic triad for this family of proteases is HE-H-H, which in many members is in the sequence motif HEIGH. Probab=73.40 E-value=1.5 Score=21.82 Aligned_cols=50 Identities=18% Similarity=0.183 Sum_probs=23.5 Q ss_pred CCCHHCCCCCCCC--CCCCCCC-CCCCCE--EEC----CCCCCHHHHHHHHHHHHHHHHH Q ss_conf 5020103799874--3547666-777510--103----6533246799999984589999 Q gi|254780915|r 374 WIDAPQYDGKGSG--AFAHGTI-PSVHPY--ILL----NYLGKPQDVMTLAHELGHGIHF 424 (626) Q Consensus 374 ~iD~~~r~gKr~G--A~~~~~~-~~~~~~--I~~----N~~~~~~dv~TL~HE~GHa~H~ 424 (626) -|+.+....+.++ +-..+|+ ....|+ |.+ |+.-... -.++.||+||++=. T Consensus 89 di~v~~~~~~~~~g~ggsAGFP~s~G~P~~~V~i~~~~~~~~~~~-~~vi~HEiGHciGf 147 (211) T pfam12388 89 DMVVYDNSVNNPSGSGGSAGFPDSNGDPAKFVQIYDLENGSTNVN-EHVITHEIGHSIGF 147 (211) T ss_pred CEEEEECCCCCCCCCEEECCCCCCCCCCCCEEEEECCCCCCCCHH-HHHHHHHHHCEECC T ss_conf 679995455678885104128877899886699823578882033-45452320222222 No 76 >KOG3658 consensus Probab=73.01 E-value=1.2 Score=22.66 Aligned_cols=59 Identities=19% Similarity=0.283 Sum_probs=29.6 Q ss_pred HHHHHHHHHCCCCCHHCCCCCCCCCCCCCCCC---CCC------CEEECCCCC---CHHHHHHHHHHHHHHHH Q ss_conf 99999864248502010379987435476667---775------101036533---24679999998458999 Q gi|254780915|r 363 SIIAEKFFTHNWIDAPQYDGKGSGAFAHGTIP---SVH------PYILLNYLG---KPQDVMTLAHELGHGIH 423 (626) Q Consensus 363 ~~~~~~~~~~~~iD~~~r~gKr~GA~~~~~~~---~~~------~~I~~N~~~---~~~dv~TL~HE~GHa~H 423 (626) .++-...++.-|+-. |+.|-. ||-|.--.. +.. ..-+.||.. +--.-.||+||+||.+- T Consensus 335 rdFe~GtLGLAwVgs-p~~~sa-GGIC~k~~~y~~G~~~sLNtGi~T~~NYg~~Vp~kvs~lt~AHEiGHNfG 405 (764) T KOG3658 335 RDFEGGTLGLAWVGS-PRSNSA-GGICEKAYTYNEGKKRSLNTGISTSVNYGKRVPTKVSDLTLAHEIGHNFG 405 (764) T ss_pred CCCCCCEEEEEEECC-CCCCCC-CCCCCCCEECCCCEEEEEECCEEEEEECCCCCCCCHHHEEEHHHHCCCCC T ss_conf 033563245677526-766777-75566532226760578513355421058857852322330223203458 No 77 >pfam02163 Peptidase_M50 Peptidase family M50. Probab=71.84 E-value=2.4 Score=20.41 Aligned_cols=18 Identities=28% Similarity=0.412 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHHHHHHC Q ss_conf 999999845899999612 Q gi|254780915|r 411 VMTLAHELGHGIHFVLSS 428 (626) Q Consensus 411 v~TL~HE~GHa~H~~ls~ 428 (626) +.-..||+||++=..... T Consensus 8 i~i~~HE~gH~~~Ar~~G 25 (205) T pfam02163 8 ISVVVHELGHALVARRFG 25 (205) T ss_pred HHHHHHHHHHHHHHHHCC T ss_conf 999999999999999929 No 78 >cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA. Probab=71.46 E-value=2.6 Score=20.16 Aligned_cols=19 Identities=32% Similarity=0.473 Sum_probs=13.9 Q ss_pred HHHHHHHHHHHHHHHHHHC Q ss_conf 7999999845899999612 Q gi|254780915|r 410 DVMTLAHELGHGIHFVLSS 428 (626) Q Consensus 410 dv~TL~HE~GHa~H~~ls~ 428 (626) .+..+.||+||.+-.-... T Consensus 9 ~vlV~vHElGH~~~Ar~~G 27 (182) T cd06163 9 GILIFVHELGHFLVAKLFG 27 (182) T ss_pred HHHHHHHHHHHHHHHHHCC T ss_conf 9999999899899999949 No 79 >cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin Probab=69.62 E-value=2.9 Score=19.85 Aligned_cols=19 Identities=26% Similarity=0.316 Sum_probs=14.1 Q ss_pred HHHHHHHHHHHHHHHHHCC Q ss_conf 9999998458999996125 Q gi|254780915|r 411 VMTLAHELGHGIHFVLSSE 429 (626) Q Consensus 411 v~TL~HE~GHa~H~~ls~~ 429 (626) +.-++||+||++=...-.. T Consensus 9 i~i~iHE~gH~~~A~~~G~ 27 (180) T cd05709 9 ISVTVHELGHALVARRLGV 27 (180) T ss_pred HHHHHHHHHHHHHHHHCCC T ss_conf 9999999999999999699 No 80 >KOG3714 consensus Probab=69.61 E-value=2 Score=21.01 Aligned_cols=20 Identities=30% Similarity=0.316 Sum_probs=15.0 Q ss_pred CHHHHHHHHHHHHHHHHHHH Q ss_conf 24679999998458999996 Q gi|254780915|r 407 KPQDVMTLAHELGHGIHFVL 426 (626) Q Consensus 407 ~~~dv~TL~HE~GHa~H~~l 426 (626) .=....|..||+|||+=.+- T Consensus 156 ~C~~~G~i~HEl~HaLGf~H 175 (411) T KOG3714 156 GCDRFGTIVHELMHALGFWH 175 (411) T ss_pred CCCCCCEEHHHHHHHHHHHC T ss_conf 74767610566776641310 No 81 >PRK10733 hflB ATP-dependent metalloprotease; Reviewed Probab=69.47 E-value=4.4 Score=18.55 Aligned_cols=25 Identities=36% Similarity=0.324 Sum_probs=18.9 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 6799999984589999961256877 Q gi|254780915|r 409 QDVMTLAHELGHGIHFVLSSETQGI 433 (626) Q Consensus 409 ~dv~TL~HE~GHa~H~~ls~~~~~~ 433 (626) .--.|-.||.|||+-+++.....|. T Consensus 407 ek~~vAyHEAGHAlva~~l~~~dpv 431 (644) T PRK10733 407 QKESTAYHEAGHAIIGRLVPEHDPV 431 (644) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 9999999999899999976898872 No 82 >pfam04228 Zn_peptidase Putative neutral zinc metallopeptidase. Members of this family have a predicted zinc binding motif characteristic of neutral zinc metallopeptidases (Prosite:PDOC00129). Probab=68.95 E-value=6.8 Score=17.22 Aligned_cols=84 Identities=14% Similarity=0.104 Sum_probs=41.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH---HHCCCCCHHCCCCCCCCCCC----C--CCCCCCCCEEECCCC--------- Q ss_conf 89999999986666328999999986---42485020103799874354----7--666777510103653--------- Q gi|254780915|r 344 FEVARDLVLQSYAKFSPQMSIIAEKF---FTHNWIDAPQYDGKGSGAFA----H--GTIPSVHPYILLNYL--------- 405 (626) Q Consensus 344 ~ee~~~~v~~~~~~~~~e~~~~~~~~---~~~~~iD~~~r~gKr~GA~~----~--~~~~~~~~~I~~N~~--------- 405 (626) -++..+.+...+........+++... +..--+=+|......+.+.. - .|+.....||=..|- T Consensus 85 ~de~~~Fvs~VLa~TEdvW~~~F~~~g~~Y~~P~lvlf~~~~~s~CG~a~sa~GPFYCP~D~~vYiD~~Ff~~l~~~~ga 164 (292) T pfam04228 85 EDEMSSFVSVVLASTEDTWTQIFADQGRTYQPPTLVLYSGVTRSACGTASSATGPFYCPADQKVYLDTSFYNELSQKLGA 164 (292) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCEEEECCHHHHHHHHHCCC T ss_conf 77766799999714988999999974998899858984188667766776777896689875288610089999997289 Q ss_pred -CCHHHHHHHHHHHHHHHHHHHH Q ss_conf -3246799999984589999961 Q gi|254780915|r 406 -GKPQDVMTLAHELGHGIHFVLS 427 (626) Q Consensus 406 -~~~~dv~TL~HE~GHa~H~~ls 427 (626) +++...-.++||.||-++.++. T Consensus 165 ~GdfA~aYViAHEvGHHVQnllG 187 (292) T pfam04228 165 SGDFAQAYVIAHEVGHHVQNLLG 187 (292) T ss_pred CCCHHHHHHHHHHHHHHHHHHHH T ss_conf 96068899988755057787751 No 83 >pfam05572 Peptidase_M43 Pregnancy-associated plasma protein-A. Pregnancy-associated plasma protein A (PAPP-A) is a metallo-protease belonging to Merops family M43. It cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the human ovary and the cardiovascular system. Probab=68.60 E-value=0.88 Score=23.55 Aligned_cols=15 Identities=40% Similarity=0.507 Sum_probs=11.0 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 467999999845899 Q gi|254780915|r 408 PQDVMTLAHELGHGI 422 (626) Q Consensus 408 ~~dv~TL~HE~GHa~ 422 (626) ...-.||.||+||-+ T Consensus 67 ~~~g~TltHE~GH~l 81 (152) T pfam05572 67 TNFSSTLTHEFGHFL 81 (152) T ss_pred CCCCCCEEEECHHHH T ss_conf 666650245312241 No 84 >pfam01457 Peptidase_M8 Leishmanolysin. Probab=68.49 E-value=2.6 Score=20.15 Aligned_cols=50 Identities=14% Similarity=0.140 Sum_probs=30.5 Q ss_pred CCHHCCCCCCC-CCCCCCC---CCCCCCEEECCCCCC-----HH--HHHHHHHHHHHHHHH Q ss_conf 02010379987-4354766---677751010365332-----46--799999984589999 Q gi|254780915|r 375 IDAPQYDGKGS-GAFAHGT---IPSVHPYILLNYLGK-----PQ--DVMTLAHELGHGIHF 424 (626) Q Consensus 375 iD~~~r~gKr~-GA~~~~~---~~~~~~~I~~N~~~~-----~~--dv~TL~HE~GHa~H~ 424 (626) |-+-++|...+ =||...+ ..++...=.+||... .+ .|.|.+||+.||+=+ T Consensus 194 LYvaa~Ps~~~~~AWA~~C~~~~dgRP~vG~iN~~P~~i~~~~~~~~vrv~aHEiaHALGF 254 (569) T pfam01457 194 LYVASVPSEPGVLAWATTCQVFADGRPAVGVINIPAANITSRYDQLVTRVVTHEIAHALGF 254 (569) T ss_pred EEEEECCCCCCCEEEEEECCCCCCCCEEEEEEEECHHHCCCCCCCEEEEHHHHHHHHHHCC T ss_conf 9996057999832577551106789805899841877764135541423898888888635 No 85 >pfam02031 Peptidase_M7 Streptomyces extracellular neutral proteinase (M7) family. Probab=67.18 E-value=2.3 Score=20.53 Aligned_cols=48 Identities=29% Similarity=0.356 Sum_probs=29.6 Q ss_pred CCHHCCCCCCCCCCCCCCCCCCCCEEECCCCC--CHHHHHHHHHHHHHHH Q ss_conf 02010379987435476667775101036533--2467999999845899 Q gi|254780915|r 375 IDAPQYDGKGSGAFAHGTIPSVHPYILLNYLG--KPQDVMTLAHELGHGI 422 (626) Q Consensus 375 iD~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~--~~~dv~TL~HE~GHa~ 422 (626) .|+--+++--++++-.........+|++-+.. .++.+.--+||+||.+ T Consensus 40 a~~~~~~~~~~~gs~a~t~g~G~g~i~~~~~~~qgy~~~RI~aHE~GH~L 89 (132) T pfam02031 40 ADFTYYEGNDSRGSYASTDGHGRGYIFLDYRQNQQYDSTRVTAHETGHVL 89 (132) T ss_pred CCEEEEECCCCCCCCCCCCCCCCEEEEEEHHHHCCCCCHHHHHHHHCCCC T ss_conf 77799957888876564379984789724388658872311001322224 No 86 >KOG2921 consensus Probab=66.42 E-value=3.3 Score=19.46 Aligned_cols=37 Identities=16% Similarity=0.296 Sum_probs=20.5 Q ss_pred HHHHHHHHHHHHHHHHHCCCC-C---CCCCCCHHHHHHHHH Q ss_conf 999999845899999612568-7---765780059998868 Q gi|254780915|r 411 VMTLAHELGHGIHFVLSSETQ-G---ILTNNSSLTLAETAS 447 (626) Q Consensus 411 v~TL~HE~GHa~H~~ls~~~~-~---~~~~~~~~~~aE~~S 447 (626) +...+||+|||+-...-+..- . +.-.-+|..|+++-- T Consensus 132 i~~vvHElGHalAA~segV~vngfgIfi~aiyPgafvdl~~ 172 (484) T KOG2921 132 ITVVVHELGHALAAASEGVQVNGFGIFIAAIYPGAFVDLDN 172 (484) T ss_pred HHHHHHHHHHHHHHHHCCCEEEEEEEEEEEECCCHHHHHHH T ss_conf 77878876579998754861300589998873751210016 No 87 >CHL00176 ftsH cell division protein; Validated Probab=65.47 E-value=5.8 Score=17.73 Aligned_cols=22 Identities=32% Similarity=0.427 Sum_probs=16.4 Q ss_pred HHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9999984589999961256877 Q gi|254780915|r 412 MTLAHELGHGIHFVLSSETQGI 433 (626) Q Consensus 412 ~TL~HE~GHa~H~~ls~~~~~~ 433 (626) .|-.||.|||+=+++.....|. T Consensus 434 ~vAyHEAGHAlVa~ll~~~dpV 455 (631) T CHL00176 434 LIAYHEVGHAIVGTLLPEHDPV 455 (631) T ss_pred HHHHHHHHHHHHHHHCCCCCCE T ss_conf 9999999999999977898760 No 88 >TIGR02151 IPP_isom_2 isopentenyl-diphosphate delta-isomerase, type 2; InterPro: IPR011179 This group represents the bacterial and archaeal isopentenyl-diphosphate delta-isomerase (IPP isomerase). IPP isomerase catalyses the interconversion of isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP), and is a key enzyme in the biosynthesis of isoprenoids via the mevalonate pathway. The bacterial and archaeal IPP isomerase (type 2 enzyme) differs from that found in eukaryotes (type 1 enzyme), and requires NADPH, magnesium, and FMN for activity , .; GO: 0004452 isopentenyl-diphosphate delta-isomerase activity, 0010181 FMN binding, 0008299 isoprenoid biosynthetic process, 0005737 cytoplasm. Probab=65.29 E-value=3.3 Score=19.48 Aligned_cols=92 Identities=16% Similarity=0.256 Sum_probs=51.0 Q ss_pred CCCCCCCCCCCCEEECCCCCCC-CCC----C----HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHH Q ss_conf 7634588788850015254106-675----2----589999999999999999956733289999999832699997999 Q gi|254780915|r 525 PAFDLSDLEYGSFWMMVPHFIE-SSF----Y----VYAYAFGNCLVNSLYDIYKSNTVDCFKEKYLNILRAGNSKHYSEL 595 (626) Q Consensus 525 ~~~~~~~~~~~~~w~~~~H~y~-~~f----Y----~y~Y~~~~~~a~~l~~~~~~~~~~~~~~~y~~~L~~Ggs~~~~el 595 (626) +++|+- ...+..|+++=- |. ..= | .+-.-||.--+..|.+.-....++ ..++.+||=++|+|+ T Consensus 214 ~aiDv~-G~GGTswa~vE~-~Rr~~~~~~~~~r~a~~f~~WGipT~~sL~~~~~~~~~~------~~~iASGG~r~GlD~ 285 (349) T TIGR02151 214 KAIDVS-GAGGTSWAAVEN-YRRAKESNQKYERLASAFSDWGIPTAASLLEVRSIDAPD------APLIASGGLRTGLDV 285 (349) T ss_pred EEEEEC-CCCCCCHHHHHH-HHHHCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHCCC------CCEEEECCCCCHHHH T ss_conf 088707-876755999998-875157523578887777414886689999986421247------736884677788899 Q ss_pred HHH--CCCCCC--CHHHHH--------HHHHHHHHHHHHHH Q ss_conf 997--088989--988999--------99999999999986 Q gi|254780915|r 596 LLP--LNINLS--DPNFWE--------RGLQTVEKMIDDVE 624 (626) Q Consensus 596 ~k~--~G~d~~--~~~~~~--------~~~~~~~~~i~~~e 624 (626) .|. +|.++. ..-|++ ..+..++.+++||. T Consensus 286 AKAlALGA~~~G~A~~~L~~~~~~g~e~~~~~~~~~~~eLk 326 (349) T TIGR02151 286 AKALALGADAVGMARPLLKAALDEGEEAVIEEIELIIEELK 326 (349) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 99999621188888999998852698899999999999999 No 89 >PRK10779 zinc metallopeptidase; Provisional Probab=64.89 E-value=4.1 Score=18.76 Aligned_cols=10 Identities=20% Similarity=0.036 Sum_probs=3.7 Q ss_pred CCHHHHHHHH Q ss_conf 9989999999 Q gi|254780915|r 342 IPFEVARDLV 351 (626) Q Consensus 342 ~~~ee~~~~v 351 (626) .++++....+ T Consensus 254 ~s~~~l~~~i 263 (449) T PRK10779 254 TQWVTFVMLV 263 (449) T ss_pred CCHHHHHHHH T ss_conf 6599999999 No 90 >TIGR00054 TIGR00054 membrane-associated zinc metalloprotease, putative; InterPro: IPR004387 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This family contains putative zinc metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). The N-terminal region of contains a perfectly conserved motif HEXGH, where the Glu is the active site and the His residues coordinate the metal cation. The family of bacterial and plant proteins also includes a region that hits the PDZ domain (IPR001478 from INTERPRO), found in a number of proteins targeted to the membrane by binding to a peptide ligand . The family includes EcfE, which is a homolog of human site-2 protease (S2P), a membrane-bound zinc metalloprotease involved in regulated intramembrane proteolysis. In Escherichia coli EcfE activates the sigma(E) pathway of stress response through a site-2 cleavage of anti-sigma(E), RseA.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016021 integral to membrane. Probab=64.37 E-value=3.6 Score=19.16 Aligned_cols=46 Identities=9% Similarity=0.011 Sum_probs=18.5 Q ss_pred CHHHHHHHHHHHHHHHHHHHHC-CC-C-CCCCCCCHHHHHHHHHHHHHH Q ss_conf 2467999999845899999612-56-8-776578005999886888999 Q gi|254780915|r 407 KPQDVMTLAHELGHGIHFVLSS-ET-Q-GILTNNSSLTLAETASIFGET 452 (626) Q Consensus 407 ~~~dv~TL~HE~GHa~H~~ls~-~~-~-~~~~~~~~~~~aE~~S~~~E~ 452 (626) .+..+.+=+.+.=-+.-.+|++ -. + ..-.-+-|.-.|+.+++..+. T Consensus 335 a~~~~~~~~~~~~~li~~~l~~Li~g~~~l~~lSGPVgIv~~~~~~A~~ 383 (463) T TIGR00054 335 ALAKAAEATKDIVKLILKLLGKLITGSLKLKNLSGPVGIVKGAGSSANL 383 (463) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCCCCHHHHHHHHHHHH T ss_conf 9999999998999999998887540002220056742154551246676 No 91 >pfam07607 DUF1570 Protein of unknown function (DUF1570). A family of hypothetical proteins in Rhodopirellula baltica. Probab=63.76 E-value=7.3 Score=16.97 Aligned_cols=41 Identities=20% Similarity=0.269 Sum_probs=28.1 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 79999998458999996125687765780059998868889999 Q gi|254780915|r 410 DVMTLAHELGHGIHFVLSSETQGILTNNSSLTLAETASIFGETL 453 (626) Q Consensus 410 dv~TL~HE~GHa~H~~ls~~~~~~~~~~~~~~~aE~~S~~~E~~ 453 (626) +|.|++||.-|-+.+-.. .++. ...+|+.++|=-.+++|.. T Consensus 1 ~iaT~~he~~hq~~~n~G-~~~~--~~~~p~W~~EGla~~fe~~ 41 (128) T pfam07607 1 NIATVAHEATHQLAFNCG-LQSR--LADNPMWVSEGLATYFEPP 41 (128) T ss_pred CCHHHHHHHHHHHHHHHH-HHHH--CCCCCHHHHHHHHHHHCCC T ss_conf 902688999999999730-6653--2899708987589982587 No 92 >pfam05548 Peptidase_M11 Gametolysin peptidase M11. In the unicellular biflagellated alga, Chlamydomonas reinhardtii, gametolysin, a zinc-containing metallo-protease, is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox. Probab=62.50 E-value=5.7 Score=17.78 Aligned_cols=47 Identities=17% Similarity=0.045 Sum_probs=27.6 Q ss_pred CCCHHCCCCCCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHH Q ss_conf 5020103799874354766677751010365332467999999845899 Q gi|254780915|r 374 WIDAPQYDGKGSGAFAHGTIPSVHPYILLNYLGKPQDVMTLAHELGHGI 422 (626) Q Consensus 374 ~iD~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~~~~dv~TL~HE~GHa~ 422 (626) .+-+.|..-+=|.+.+..+ ++....+-++-.+ ..+..|++||+||-+ T Consensus 102 ~~~VLP~~~~CgWaGlA~v-~G~~~w~~~~~~~-~~~~~t~~hE~~HN~ 148 (303) T pfam05548 102 YIYVLPPGVRCPWAGLALV-PGRQTWLQTSGYG-VQRWATIMQEAIHNY 148 (303) T ss_pred EEEECCCCCCCCCCCCCCC-CCCCEEEEECCCC-CCCCHHHHHHHHHHH T ss_conf 6996378888776543325-8860799965655-566105666453332 No 93 >PRK01345 heat shock protein HtpX; Provisional Probab=61.76 E-value=7 Score=17.12 Aligned_cols=47 Identities=30% Similarity=0.372 Sum_probs=30.8 Q ss_pred HHCCCCCCCCCCCCCCCCCCCCEEECCCCC-----CHHHH-HHHHHHHHHHHHHH Q ss_conf 010379987435476667775101036533-----24679-99999845899999 Q gi|254780915|r 377 APQYDGKGSGAFAHGTIPSVHPYILLNYLG-----KPQDV-MTLAHELGHGIHFV 425 (626) Q Consensus 377 ~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~-----~~~dv-~TL~HE~GHa~H~~ 425 (626) ++--+...+.||.++..+ .++.|.+. +| +-||+ -.++||+||--|.- T Consensus 87 v~Ii~~~~pNAFAtG~~~-~~a~VaVT-~GLL~~L~~dELeaVlAHElsHI~n~D 139 (314) T PRK01345 87 VYIIDNPQPNAFATGRNP-ENAAVAAT-TGLLQRLSPEEVAGVMAHELAHVKNRD 139 (314) T ss_pred EEEECCCCCCEEEECCCC-CCEEEEEC-HHHHHHCCHHHHHHHHHHHHHHHHCCC T ss_conf 999658986636743798-87699970-899975699999999999999996554 No 94 >cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains. Probab=61.64 E-value=5.1 Score=18.09 Aligned_cols=16 Identities=25% Similarity=0.264 Sum_probs=11.3 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999998458999996 Q gi|254780915|r 411 VMTLAHELGHGIHFVL 426 (626) Q Consensus 411 v~TL~HE~GHa~H~~l 426 (626) ..-++||+||.+-.-. T Consensus 42 ~il~~HElGH~l~a~~ 57 (183) T cd06160 42 AILGIHEMGHYLAARR 57 (183) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9999999999999999 No 95 >KOG1565 consensus Probab=61.27 E-value=2.4 Score=20.42 Aligned_cols=21 Identities=29% Similarity=0.382 Sum_probs=14.6 Q ss_pred CHHHHHHHHHHHHHHHHHHHH Q ss_conf 246799999984589999961 Q gi|254780915|r 407 KPQDVMTLAHELGHGIHFVLS 427 (626) Q Consensus 407 ~~~dv~TL~HE~GHa~H~~ls 427 (626) -.+=..|.+||+||+|=-.-| T Consensus 208 g~~l~~Va~HEiGH~LGL~HS 228 (469) T KOG1565 208 GVDLFLVAAHEIGHALGLGHS 228 (469) T ss_pred CCCHHHHHHHHCCCCCCCCCC T ss_conf 500677754301100266777 No 96 >cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined. Probab=60.95 E-value=1.4 Score=22.06 Aligned_cols=16 Identities=25% Similarity=0.218 Sum_probs=10.7 Q ss_pred HCCCCCCCHHHHHHCC Q ss_conf 7088753123444025 Q gi|254780915|r 319 WLKLDTMYFWDRLAPL 334 (626) Q Consensus 319 ~lg~~~l~~wD~~~~~ 334 (626) -+++..|..||...|. T Consensus 45 sl~l~~L~I~~~~~p~ 60 (228) T cd04271 45 SLGLRNLTISDASCPS 60 (228) T ss_pred EEEEEEEEECCCCCCC T ss_conf 7999899973787888 No 97 >pfam10263 SprT-like SprT-like family. This family represents a domain found in eukaryotes and prokaryotes. The domain contains a characteristic motif of the zinc metallopeptidases. This family includes the bacterial SprT protein. Probab=60.70 E-value=8.2 Score=16.64 Aligned_cols=50 Identities=26% Similarity=0.161 Sum_probs=27.9 Q ss_pred CCHHCCCCCCCCCCCCCCCCCCCCEEECCC-----CCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 020103799874354766677751010365-----33246799999984589999961 Q gi|254780915|r 375 IDAPQYDGKGSGAFAHGTIPSVHPYILLNY-----LGKPQDVMTLAHELGHGIHFVLS 427 (626) Q Consensus 375 iD~~~r~gKr~GA~~~~~~~~~~~~I~~N~-----~~~~~dv~TL~HE~GHa~H~~ls 427 (626) |.+-.|.++..|-+... .. ...|.+|- ...-+=+.||.||+=|++-.++. T Consensus 21 v~~~~r~~~taG~~~~~--~~-~~~I~ls~~ll~~~~~~~l~~tl~HEm~H~~~~~~~ 75 (153) T pfam10263 21 VVWSKRLGTTAGRARLK--RN-GAEIRLNPKLLAENPEEFLIETLLHEMCHAALFVLF 75 (153) T ss_pred EEEEHHCCCCCEEEEEC--CC-CCEEEECHHHHHHCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 89731048864378976--88-718998889985583999998889999999999761 No 98 >cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present. Probab=59.05 E-value=6 Score=17.62 Aligned_cols=35 Identities=37% Similarity=0.425 Sum_probs=18.2 Q ss_pred CCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 66777510103653324679999998458999996 Q gi|254780915|r 392 TIPSVHPYILLNYLGKPQDVMTLAHELGHGIHFVL 426 (626) Q Consensus 392 ~~~~~~~~I~~N~~~~~~dv~TL~HE~GHa~H~~l 426 (626) ..|+..++|-.-+----=-|.-..||+|||+-.-. T Consensus 100 ~iPGv~~~ip~~~~~~al~v~~vvHE~~Hgi~ar~ 134 (263) T cd06159 100 VIPGVNIFIPLPYGIIALVVGVVVHELSHGILARV 134 (263) T ss_pred ECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 24798755337899999999999998888999998 No 99 >KOG3931 consensus Probab=59.02 E-value=6.5 Score=17.38 Aligned_cols=20 Identities=40% Similarity=0.444 Sum_probs=9.2 Q ss_pred CHHHHHHHHHHHHHHHHHHH Q ss_conf 24679999998458999996 Q gi|254780915|r 407 KPQDVMTLAHELGHGIHFVL 426 (626) Q Consensus 407 ~~~dv~TL~HE~GHa~H~~l 426 (626) .-|=|+||.|||=||+-+.. T Consensus 102 RkDLVETLLHEMIHAYlFV~ 121 (484) T KOG3931 102 RKDLVETLLHEMIHAYLFVT 121 (484) T ss_pred HHHHHHHHHHHHHHHHEEEE T ss_conf 57799999999988760575 No 100 >cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains. Probab=57.25 E-value=6.8 Score=17.22 Aligned_cols=19 Identities=16% Similarity=0.165 Sum_probs=14.1 Q ss_pred HHHHHHHHHHHHHHHHHCC Q ss_conf 9999998458999996125 Q gi|254780915|r 411 VMTLAHELGHGIHFVLSSE 429 (626) Q Consensus 411 v~TL~HE~GHa~H~~ls~~ 429 (626) +.-..||+|||+=...-.+ T Consensus 10 ~si~~HE~aHa~~A~~~GD 28 (181) T cd06158 10 LAITLHEFAHAYVAYRLGD 28 (181) T ss_pred HHHHHHHHHHHHHHHHCCC T ss_conf 9999999999999998499 No 101 >pfam06167 MtfA Phosphoenolpyruvate:glucose-phosphotransferase regulator. MtfA (earlier known as YeeI) is a transcription factor A that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (ptsG) system, the major glucose transporter in E.coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in Escherichia coli. Probab=57.10 E-value=11 Score=15.75 Aligned_cols=62 Identities=13% Similarity=0.111 Sum_probs=39.2 Q ss_pred EECCCCC-------CHHHHHHHHHHHHHHHHHHHHC-CCCCCC-CCCCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 1036533-------2467999999845899999612-568776-578005999886888999999998413 Q gi|254780915|r 400 ILLNYLG-------KPQDVMTLAHELGHGIHFVLSS-ETQGIL-TNNSSLTLAETASIFGETLTFDSLLQA 461 (626) Q Consensus 400 I~~N~~~-------~~~dv~TL~HE~GHa~H~~ls~-~~~~~~-~~~~~~~~aE~~S~~~E~~~~~~~~~~ 461 (626) |++.|.. .-|...+.+|||-|.+...-+. +-.|.. ........+++-+.-++.++.+...++ T Consensus 132 vvLSW~dv~~g~~~~~dg~NVviHEFAH~LD~~dG~adG~P~L~~~~~~~~W~~~~~~~~~~l~~~~~~~~ 202 (248) T pfam06167 132 VILSWQDVLAGGANPGDGHNVVIHEFAHKLDMLNGAADGAPPLHRGMDYAAWAKVFSAAYDRLQEQVDAGE 202 (248) T ss_pred EEEEHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 79779999744427889860478499878642158778999899942289999999999999999998079 No 102 >cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha. Probab=56.78 E-value=1.9 Score=21.19 Aligned_cols=13 Identities=46% Similarity=0.746 Sum_probs=10.4 Q ss_pred HHHHHHHHHHHHH Q ss_conf 7999999845899 Q gi|254780915|r 410 DVMTLAHELGHGI 422 (626) Q Consensus 410 dv~TL~HE~GHa~ 422 (626) ...|++||+||.+ T Consensus 167 ~~~~~~he~gh~~ 179 (244) T cd04270 167 SDLVTAHELGHNF 179 (244) T ss_pred EEEEEEHHHHCCC T ss_conf 0255211122026 No 103 >PHA02456 zinc metallopeptidase motif-containing protein Probab=56.17 E-value=6.6 Score=17.29 Aligned_cols=26 Identities=38% Similarity=0.303 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 46799999984589999961256877 Q gi|254780915|r 408 PQDVMTLAHELGHGIHFVLSSETQGI 433 (626) Q Consensus 408 ~~dv~TL~HE~GHa~H~~ls~~~~~~ 433 (626) -..-.||.||+.|+.+.---.-.+|+ T Consensus 77 KGC~~TL~HEL~H~WQ~RsYG~i~PI 102 (141) T PHA02456 77 KGCRDTLAHELNHAWQFRTYGLVQPI 102 (141) T ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 22077899999878865124445222 No 104 >KOG3624 consensus Probab=55.92 E-value=11 Score=15.62 Aligned_cols=83 Identities=13% Similarity=0.009 Sum_probs=38.9 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHH-HHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999998458999996125687765780059998868889999-999984138998999999999862069999999999 Q gi|254780915|r 411 VMTLAHELGHGIHFVLSSETQGILTNNSSLTLAETASIFGETL-TFDSLLQAASSKEERKILLANKIEDMLNSIVRQISF 489 (626) Q Consensus 411 v~TL~HE~GHa~H~~ls~~~~~~~~~~~~~~~aE~~S~~~E~~-~~~~~~~~~~~~~~~~~ll~~~~~~~~~~~~~q~~~ 489 (626) -.++.||+||++-.-- ...-...+... |.-.+.-+.|-+.- |+.....++.++ ....-.....++++...--++++ T Consensus 519 G~vigHEl~H~FD~~G-~~~D~~Gn~~~-ww~~~s~~~f~~~~qC~~~qy~~y~~~-~~~ng~~t~~EnIAD~~Gl~~A~ 595 (687) T KOG3624 519 GFVIGHELTHGFDDQG-RQYDESGNLRD-WWDTESESEFQERAQCLVKQYSNYLDP-RRLNGSKTLGENIADNGGLKLAY 595 (687) T ss_pred HHHHHHHHHHHCCCCC-CCCCCCCCCCC-CCCHHHHHHHHHHHHHHHHHHCCCCCC-CCCCCCCHHHHHHHHHHHHHHHH T ss_conf 9999999853200025-43477776155-667577998887754589885123665-45675100878866377899999 Q ss_pred HHHHHHH Q ss_conf 9999999 Q gi|254780915|r 490 YDFELKL 496 (626) Q Consensus 490 ~~fe~~~ 496 (626) ..|.... T Consensus 596 ~AY~~~~ 602 (687) T KOG3624 596 RAYKKWK 602 (687) T ss_pred HHHHHHH T ss_conf 9999976 No 105 >pfam08325 WLM WLM domain. This is a predicted metallopeptidase domain called WLM (Wss1p-like metalloproteases). These are linked to the Ub-system by virtue of fusions with the UB-binding PUG (PUB), Ub-like, and Little Finger domains. More specifically, genetic evidence implicates the WLM family in de-SUMOylation. Probab=55.38 E-value=5.7 Score=17.75 Aligned_cols=18 Identities=39% Similarity=0.475 Sum_probs=12.9 Q ss_pred CHHH-HHHHHHHHHHHHHH Q ss_conf 2467-99999984589999 Q gi|254780915|r 407 KPQD-VMTLAHELGHGIHF 424 (626) Q Consensus 407 ~~~d-v~TL~HE~GHa~H~ 424 (626) ++.+ +.||.||+-|..|+ T Consensus 79 p~~~i~~tmlHELaH~~~g 97 (183) T pfam08325 79 PFESIMGTLLHELAHNVHG 97 (183) T ss_pred CHHHHHHHHHHHHHHHHCC T ss_conf 6999999999998876227 No 106 >COG2316 Predicted hydrolase (HD superfamily) [General function prediction only] Probab=54.30 E-value=12 Score=15.44 Aligned_cols=67 Identities=16% Similarity=0.202 Sum_probs=48.9 Q ss_pred HHHHHHHHHHHHHHHHC---CCHHHHHHHHH-HHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999956---73328999999-98326999979999970889899889999999999999998 Q gi|254780915|r 555 AFGNCLVNSLYDIYKSN---TVDCFKEKYLN-ILRAGNSKHYSELLLPLNINLSDPNFWERGLQTVEKMIDDV 623 (626) Q Consensus 555 ~~~~~~a~~l~~~~~~~---~~~~~~~~y~~-~L~~Ggs~~~~el~k~~G~d~~~~~~~~~~~~~~~~~i~~~ 623 (626) .+|.+.|+++-..-+.- +.+....|+++ -.++|-+++-+-....+|+++ ..+++-++..+...-+|| T Consensus 140 ~tGli~AaALirp~K~~~vv~~~~lkkrFKeK~FAaG~~Rd~vR~~E~lG~~L--ek~~e~~l~am~~~kdEL 210 (212) T COG2316 140 LTGLIQAAALIRPDKDVRVVELSSLKKRFKEKGFAAGVNRDEVRQAEELGVDL--EKHFENVLTAMQAGKDEL 210 (212) T ss_pred CHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCH--HHHHHHHHHHHHHHHHHH T ss_conf 01589999871787743100479999998765554158899999999919989--999999999999778751 No 107 >pfam05547 Peptidase_M6 Immune inhibitor A peptidase M6. The insect pathogenic Gram-positive Bacillus thuringiensis secretes immune inhibitor A, a metallopeptidase, which specifically cleaves host antibacterial proteins. A homologue of immune inhibitor A, PrtV, has been identified in the Gram-negative human pathogen Vibrio cholerae. Probab=51.76 E-value=0.84 Score=23.70 Aligned_cols=13 Identities=38% Similarity=0.662 Sum_probs=10.6 Q ss_pred HHHHHHHHHHHHH Q ss_conf 7999999845899 Q gi|254780915|r 410 DVMTLAHELGHGI 422 (626) Q Consensus 410 dv~TL~HE~GHa~ 422 (626) .|.+++|||||.+ T Consensus 222 ~~GVfaHEfGHdL 234 (646) T pfam05547 222 AAGVFAHEYGHDL 234 (646) T ss_pred CEEEEEEECCCCC T ss_conf 3478984134446 No 108 >COG3824 Predicted Zn-dependent protease [General function prediction only] Probab=51.46 E-value=3.7 Score=19.12 Aligned_cols=51 Identities=24% Similarity=0.284 Sum_probs=24.2 Q ss_pred CHHCCCCCCCCCCCCCCCCCCCCEEE--CCC-CCCHHH--HHHHHHHHHHHHHHHHHCC Q ss_conf 20103799874354766677751010--365-332467--9999998458999996125 Q gi|254780915|r 376 DAPQYDGKGSGAFAHGTIPSVHPYIL--LNY-LGKPQD--VMTLAHELGHGIHFVLSSE 429 (626) Q Consensus 376 D~~~r~gKr~GA~~~~~~~~~~~~I~--~N~-~~~~~d--v~TL~HE~GHa~H~~ls~~ 429 (626) ++-.|-.-..|+.-+.+.-++.| |+ |+- ..+..| ..||.||.|| |+=+|+. T Consensus 71 ~l~eR~~~~~G~~P~rItlYRra-ilDywae~eetlgd~vthvliHEIgH--hFGLsDd 126 (136) T COG3824 71 ALTERGSLYTGELPDRITLYRRA-LLDYWAENEETLGDQVTHVLIHEIGH--HFGLSDD 126 (136) T ss_pred CHHHCCCCCCCCCCCEEEEEHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH--HCCCCHH T ss_conf 30220354457898645444798-99988600366765744302353221--2387776 No 109 >COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only] Probab=50.91 E-value=9.7 Score=16.10 Aligned_cols=21 Identities=14% Similarity=0.469 Sum_probs=10.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHH Q ss_conf 991899999999999999999 Q gi|254780915|r 116 LSSPTIRKFYTDINAKLADFE 136 (626) Q Consensus 116 t~d~~~~~~~~~~~~~~s~~~ 136 (626) .+|+++++....+...+.+.+ T Consensus 67 i~D~el~~yv~~~g~rL~~~a 87 (484) T COG4783 67 IRDPELEEYVNSLGQRLAAAA 87 (484) T ss_pred CCCHHHHHHHHHHHHHHHHHC T ss_conf 688889999999879999844 No 110 >pfam12315 DUF3633 Protein of unknown function (DUF3633). This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with pfam00412. Probab=50.05 E-value=8.3 Score=16.60 Aligned_cols=19 Identities=26% Similarity=0.081 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 4679999998458999996 Q gi|254780915|r 408 PQDVMTLAHELGHGIHFVL 426 (626) Q Consensus 408 ~~dv~TL~HE~GHa~H~~l 426 (626) .=--.-|+||+.||+--+- T Consensus 91 LLTGsILAHE~mHawlRL~ 109 (212) T pfam12315 91 LLTGSILAHEMMHAWLRLN 109 (212) T ss_pred HHHHHHHHHHHHHHHHHHC T ss_conf 8886499999999999864 No 111 >smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function. Probab=49.63 E-value=13 Score=15.23 Aligned_cols=45 Identities=24% Similarity=0.197 Sum_probs=24.3 Q ss_pred CCCCCCCCCCCCCCCCCCCCEEECCC-----CCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 03799874354766677751010365-----33246799999984589999961 Q gi|254780915|r 379 QYDGKGSGAFAHGTIPSVHPYILLNY-----LGKPQDVMTLAHELGHGIHFVLS 427 (626) Q Consensus 379 ~r~gKr~GA~~~~~~~~~~~~I~~N~-----~~~~~dv~TL~HE~GHa~H~~ls 427 (626) ...++..||.|.. ..-.|.+|- .+.-+=..|+.||+=|+.=++.. T Consensus 27 n~Rl~ttaG~~~~----~~~~I~ln~~ll~~~~~~~l~~tl~HElcH~~~~~~~ 76 (146) T smart00731 27 NKRLRKTGGRCLL----KSAEIRLNPKLLTENGRDRLRETLLHELCHAALYLFG 76 (146) T ss_pred CCHHHHHHHHEEC----CCCEEEECHHHHHHCCHHHHHHHHHHHHHHHHHHHHC T ss_conf 7046306781005----6988997899997674889998789999999999977 No 112 >pfam11176 DUF2962 Protein of unknown function (DUF2962). This eukaryotic family of proteins has no known function. Probab=49.38 E-value=9.3 Score=16.25 Aligned_cols=76 Identities=12% Similarity=0.131 Sum_probs=42.3 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-----------CCCHHCCCCCCCCCCCCCCCCCCCCEEECCCCC Q ss_conf 477799899999999866663289999999864248-----------502010379987435476667775101036533 Q gi|254780915|r 338 SQDIIPFEVARDLVLQSYAKFSPQMSIIAEKFFTHN-----------WIDAPQYDGKGSGAFAHGTIPSVHPYILLNYLG 406 (626) Q Consensus 338 ~~~~~~~ee~~~~v~~~~~~~~~e~~~~~~~~~~~~-----------~iD~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~ 406 (626) ....++.+++.++|..-+..++.|+.++-..-..++ -++....+=+ +|=||.++......-.+-+|+| T Consensus 62 ~~~~~s~~e~~~lIe~yl~R~deELeqik~~rR~gRp~s~Re~~L~~~~e~E~~ey~-~G~~iPDLtd~~nv~~Lr~WnG 140 (155) T pfam11176 62 EKEPFTLEEIKDLIERYLHRFDEELEQLKSERRPGRPPSSREDVLEQTIEKERKEFE-TGFEVPDLTDADNVKALREWNG 140 (155) T ss_pred CCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHH-CCCCCCCCCCHHHHHHHHHCCC T ss_conf 788999999999999999458799999998416899983299999999999999983-2944787799899999987099 Q ss_pred CHHHHHHH Q ss_conf 24679999 Q gi|254780915|r 407 KPQDVMTL 414 (626) Q Consensus 407 ~~~dv~TL 414 (626) ++..+.+| T Consensus 141 ~~~~l~~l 148 (155) T pfam11176 141 DWGGLTNL 148 (155) T ss_pred CHHHCCCE T ss_conf 82318610 No 113 >pfam09471 Peptidase_M64 IgA Peptidase M64. This is a family of highly selective metallo-endopeptidases. The primary structure of the Clostridium ramosum IgA proteinase shows no significant overall similarity to any other known metallo-endopeptidase. Probab=49.15 E-value=1.7 Score=21.57 Aligned_cols=49 Identities=20% Similarity=0.170 Sum_probs=26.0 Q ss_pred CCCHHCCCCCCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 502010379987435476667775101036533246799999984589999961256 Q gi|254780915|r 374 WIDAPQYDGKGSGAFAHGTIPSVHPYILLNYLGKPQDVMTLAHELGHGIHFVLSSET 430 (626) Q Consensus 374 ~iD~~~r~gKr~GA~~~~~~~~~~~~I~~N~~~~~~dv~TL~HE~GHa~H~~ls~~~ 430 (626) ++=+-.+..|+||+-..+.... .. ...-....++.|||||.+ +-|++++ T Consensus 180 ~v~vLvNt~~YGG~G~~~~~~~-----~~--~~n~~~~~v~vHE~GHsf-~gLaDEY 228 (257) T pfam09471 180 YVLVLVNTAKYGGGGYFNLYGT-----VS--ADNSQASYIAVHELGHSF-AGLADEY 228 (257) T ss_pred EEEEEECCCCCCCCCCCCCEEE-----EE--CCCCCCCCEEEEECCCCC-CCCCHHH T ss_conf 5999965776466520271688-----63--277566633488435323-4423011 No 114 >COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones] Probab=47.60 E-value=8.1 Score=16.67 Aligned_cols=15 Identities=40% Similarity=0.503 Sum_probs=11.7 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999845899999 Q gi|254780915|r 411 VMTLAHELGHGIHFV 425 (626) Q Consensus 411 v~TL~HE~GHa~H~~ 425 (626) +-|.-||+|||+--+ T Consensus 188 ~~tarhElGhaLgi~ 202 (236) T COG5549 188 NPTARHELGHALGIW 202 (236) T ss_pred HHHHHHHHCCHHEEC T ss_conf 478887632021111 No 115 >pfam12044 Metallopep Putative peptidase family. This family of proteins is functionally uncharacterized. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length. Probab=46.20 E-value=11 Score=15.60 Aligned_cols=33 Identities=24% Similarity=0.278 Sum_probs=19.5 Q ss_pred CCCCCCHHH-----HHHHHHHHHHHHHHHHHCCCCC--CCCCCC Q ss_conf 365332467-----9999998458999996125687--765780 Q gi|254780915|r 402 LNYLGKPQD-----VMTLAHELGHGIHFVLSSETQG--ILTNNS 438 (626) Q Consensus 402 ~N~~~~~~d-----v~TL~HE~GHa~H~~ls~~~~~--~~~~~~ 438 (626) ||..+|+-+ .--..||+||++ .-.+++ ++..++ T Consensus 300 ~~e~gT~We~~~i~lGA~lHEiGH~f----g~pH~~~GIM~Rgy 339 (419) T pfam12044 300 SNECGTYWECANITIGAFLHEIGHLF----GLPHQEDGIMLRGY 339 (419) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHC----CCCCCCCCEEECCC T ss_conf 67798789986631208877546424----89999883471584 No 116 >pfam06262 DUF1025 Domain of unknown function (DUF1025). Family of bacterial protein with undetermined function. Probab=44.32 E-value=11 Score=15.64 Aligned_cols=10 Identities=40% Similarity=0.757 Sum_probs=8.5 Q ss_pred HHHHHHHHHH Q ss_conf 9999998458 Q gi|254780915|r 411 VMTLAHELGH 420 (626) Q Consensus 411 v~TL~HE~GH 420 (626) -.|+.||.|| T Consensus 73 ~~tviHEiaH 82 (96) T pfam06262 73 RHVVIHEIGH 82 (96) T ss_pred HHHHHHHHHH T ss_conf 9999999887 No 117 >pfam10462 Peptidase_M66 Peptidase M66. This family of metallopeptidases contains StcE, a virulence factor found in Shiga toxigenic Escherichia coli organisms. StcE peptidase cleaves C1 esterase inhibitor. Probab=42.04 E-value=4.3 Score=18.64 Aligned_cols=44 Identities=18% Similarity=0.274 Sum_probs=21.4 Q ss_pred HHHHHHHHHHHH---HHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 999999845899---999612568776578005999886888999999998413899 Q gi|254780915|r 411 VMTLAHELGHGI---HFVLSSETQGILTNNSSLTLAETASIFGETLTFDSLLQAASS 464 (626) Q Consensus 411 v~TL~HE~GHa~---H~~ls~~~~~~~~~~~~~~~aE~~S~~~E~~~~~~~~~~~~~ 464 (626) --.+.||+||++ |+--+....|+. -+|+.+=.|=||...+++++ T Consensus 193 gnefsHE~GHa~GL~Hy~~g~~~~py~----------~~~~~gS~WG~D~~~n~Fi~ 239 (304) T pfam10462 193 GNEFSHEQGHNYGLGHYPAGEAGSPYR----------SGSLLGSEWGWDANHNRFIG 239 (304) T ss_pred CCEEEECCHHHCCCCCCCCCCCCCCCC----------CCCCCCCCCCCCCCCCEECC T ss_conf 875472021014678676787888466----------65557885363222387526 No 118 >KOG3854 consensus Probab=41.25 E-value=19 Score=14.06 Aligned_cols=46 Identities=22% Similarity=0.114 Sum_probs=24.7 Q ss_pred CCEEECCCC--CCHHHH-HHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHH Q ss_conf 510103653--324679-99999845899999612568776578005999 Q gi|254780915|r 397 HPYILLNYL--GKPQDV-MTLAHELGHGIHFVLSSETQGILTNNSSLTLA 443 (626) Q Consensus 397 ~~~I~~N~~--~~~~dv-~TL~HE~GHa~H~~ls~~~~~~~~~~~~~~~a 443 (626) -+.|.+-.. -|.+=+ -||+||+-||--.++-+. +--+|..+-++.. T Consensus 335 ~A~IeLs~kV~tTAERir~TLiHEmCHaAaWv~dr~-e~gHGd~wKrWa~ 383 (505) T KOG3854 335 YAKIELSDKVCTTAERIRDTLIHEMCHAAAWVFDRE-ELGHGDNWKRWAY 383 (505) T ss_pred EEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-CCCCCHHHHHHHH T ss_conf 038983134656899999999999987888420354-2576438999999 No 119 >TIGR01108 oadA oxaloacetate decarboxylase alpha subunit; InterPro: IPR005776 This family describes the bacterial oxaloacetate decarboxylase alpha subunit and its equivalents in archaea . The oxaloacetate decarboxylase Na+ pump is the paradigm of the family of Na+ transport decarboxylases that present in bacteria and archaea. It a multi subunit enzyme consisting of a peripheral alpha-subunit and integral membrane subunits beta and gamma. The energy released by the decarboxylation reaction of oxaloacetate is coupled to Na+ ion pumping across the membrane.; GO: 0008948 oxaloacetate decarboxylase activity, 0006814 sodium ion transport. Probab=39.11 E-value=8.1 Score=16.66 Aligned_cols=24 Identities=8% Similarity=-0.099 Sum_probs=14.1 Q ss_pred HHHHHHHHHHHHHHHHCCCCCHHH Q ss_conf 999998666788654278999889 Q gi|254780915|r 258 FITNTLAKDEEIQDRWRKYEKIAD 281 (626) Q Consensus 258 ~il~~iv~~r~e~A~llGy~s~~~ 281 (626) .-|..++.+...+|..=+.++... T Consensus 205 tGmA~~AllkA~EAG~d~iDTAis 228 (616) T TIGR01108 205 TGMAEMALLKAIEAGADMIDTAIS 228 (616) T ss_pred HHHHHHHHHHHHHCCCCCCHHCCC T ss_conf 337999999888707880020055 No 120 >pfam07998 Peptidase_M54 Peptidase family M54. This is a family of metallopeptidases. Two human proteins have been reported to degrade synthetic substrates and peptides. Probab=38.51 E-value=19 Score=14.01 Aligned_cols=14 Identities=43% Similarity=0.650 Sum_probs=11.2 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 67999999845899 Q gi|254780915|r 409 QDVMTLAHELGHGI 422 (626) Q Consensus 409 ~dv~TL~HE~GHa~ 422 (626) +-+-+..||+||.+ T Consensus 125 R~~Ke~~HElGH~f 138 (176) T pfam07998 125 RVVKEVTHELGHTY 138 (176) T ss_pred HHHHHHHHHHHHHC T ss_conf 99999999988862 No 121 >pfam01447 Peptidase_M4 Thermolysin metallopeptidase, catalytic domain. Probab=37.78 E-value=7.2 Score=17.06 Aligned_cols=14 Identities=36% Similarity=0.686 Sum_probs=11.2 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 67999999845899 Q gi|254780915|r 409 QDVMTLAHELGHGI 422 (626) Q Consensus 409 ~dv~TL~HE~GHa~ 422 (626) ..+-..+|||+|++ T Consensus 134 ~~lDVv~HE~tHgv 147 (150) T pfam01447 134 GSLDVVGHELTHGV 147 (150) T ss_pred CCCCCHHHHHCCCC T ss_conf 24340000002011 No 122 >PHA00363 major capsid protein Probab=37.16 E-value=22 Score=13.63 Aligned_cols=18 Identities=22% Similarity=0.163 Sum_probs=6.9 Q ss_pred HHHHHHHHHHHHCCCCCH Q ss_conf 899999998642485020 Q gi|254780915|r 360 PQMSIIAEKFFTHNWIDA 377 (626) Q Consensus 360 ~e~~~~~~~~~~~~~iD~ 377 (626) ..|.++++..|+..--|. T Consensus 325 trY~e~i~aHfgV~~~Da 342 (557) T PHA00363 325 TRYVEIIRSHFNVQSPDA 342 (557) T ss_pred CCHHHHHHHHCCCCCCCH T ss_conf 756999997638115731 No 123 >PRK13267 archaemetzincin-like protein; Reviewed Probab=36.75 E-value=18 Score=14.21 Aligned_cols=14 Identities=43% Similarity=0.733 Sum_probs=10.9 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 67999999845899 Q gi|254780915|r 409 QDVMTLAHELGHGI 422 (626) Q Consensus 409 ~dv~TL~HE~GHa~ 422 (626) +-+....||+||.+ T Consensus 122 R~~Ke~~HE~GH~~ 135 (177) T PRK13267 122 RVVKEVTHELGHTL 135 (177) T ss_pred HHHHHHHHHHHHHC T ss_conf 99988743557663 No 124 >pfam08112 ATP-synt_E_2 ATP synthase epsilon subunit. This family consists of epsilon subunits of the ATP synthase. The ATP synthase complex is composed of an oligomeric transmembrane sector (CF0), and a catalytic core (CF1). CF1 is composed of 5 subunits, of which the epsilon subunit functions as a potent inhibitor of ATPase activity in both soluble and bound CF1. Only when the epsilon inhibition is disabled is high ATPase activity detected in ATPase. Probab=36.62 E-value=19 Score=14.08 Aligned_cols=24 Identities=29% Similarity=0.472 Sum_probs=9.3 Q ss_pred HHHHHHHHHHCCCCCCHHHHHHHC Q ss_conf 899999998326999979999970 Q gi|254780915|r 576 FKEKYLNILRAGNSKHYSELLLPL 599 (626) Q Consensus 576 ~~~~y~~~L~~Ggs~~~~el~k~~ 599 (626) ..|+|++||++.=-..-.|+++.. T Consensus 8 tidkyinilk~kldekkneil~ki 31 (56) T pfam08112 8 TIDKYINILKNKLDEKKNEILKKI 31 (56) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 899999999998878889999997 No 125 >KOG4525 consensus Probab=36.23 E-value=20 Score=13.92 Aligned_cols=21 Identities=29% Similarity=0.306 Sum_probs=15.0 Q ss_pred CCCCCCHHHH-----HHHHHHHHHHH Q ss_conf 3653324679-----99999845899 Q gi|254780915|r 402 LNYLGKPQDV-----MTLAHELGHGI 422 (626) Q Consensus 402 ~N~~~~~~dv-----~TL~HE~GHa~ 422 (626) ||..+|+-++ ..+.||+||.+ T Consensus 288 ~Ne~~Tywec~~itlGA~~HElGH~l 313 (614) T KOG4525 288 SNERRTYWECYAITLGAVCHELGHCL 313 (614) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHCC T ss_conf 33342256555657889998755304 No 126 >pfam10463 Peptidase_U49 Peptidase U49. This family contains Lit peptidase from Escherichia coli. Lit protease functions in bacterial cell death in response to infection by bacteriophage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death. Probab=35.44 E-value=18 Score=14.27 Aligned_cols=15 Identities=33% Similarity=0.273 Sum_probs=11.9 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999845899999 Q gi|254780915|r 411 VMTLAHELGHGIHFV 425 (626) Q Consensus 411 v~TL~HE~GHa~H~~ 425 (626) ..-++||++|..++- T Consensus 99 a~I~~HE~~Hi~l~h 113 (203) T pfam10463 99 AWILLHEVAHIILQH 113 (203) T ss_pred HHHHHHHHHHHHHCC T ss_conf 999999988887246 No 127 >TIGR02158 PA_CoA_Oxy3 phenylacetate-CoA oxygenase, PaaI subunit; InterPro: IPR011882 Phenylacetate-CoA oxygenase is comprised of a five gene complex responsible for the hydroxylation of phenylacetate-CoA (PA-CoA) as the second catabolic step in phenylacetic acid (PA) degradation , . Although the exact function of this enzyme has not been determined, it has been shown to be required for phenylacetic acid degradation and has been proposed to function in a multicomponent oxygenase acting on phenylacetate-CoA . . Probab=33.65 E-value=24 Score=13.25 Aligned_cols=49 Identities=22% Similarity=0.266 Sum_probs=31.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH Q ss_conf 9999862069999999999999999972120168999899999999999986 Q gi|254780915|r 471 LLANKIEDMLNSIVRQISFYDFELKLHTERRSTGDIPTHRINEIWLETQKES 522 (626) Q Consensus 471 ll~~~~~~~~~~~~~q~~~~~fe~~~h~~~~~~~~~t~~~l~~~~~~~~~~~ 522 (626) .|.+-..+++.|++||..++.|+..+++.-.+... ..|.++-.++++|. T Consensus 83 YLDePhYDfa~T~~Rqfl~~~~~~~l~e~L~~S~d---~~LA~iA~KalkE~ 131 (251) T TIGR02158 83 YLDEPHYDFALTIVRQFLYDAYKVLLLEALTESSD---APLAAIAAKALKEA 131 (251) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCC---HHHHHHHHHHHHCH T ss_conf 46777544667999999999999999998852898---77999999986043 No 128 >pfam09197 Rap1-DNA-bind Rap1, DNA-binding. Members of this family, which are predominantly found in the yeast protein rap1, assume a secondary structure consisting of a three-helix bundle and an N-terminal arm. They contain an Arg-Asp-Arg-Lys sequence that interacts with an ACACC region in the 3' region of the DNA-binding site. Probab=33.55 E-value=25 Score=13.24 Aligned_cols=33 Identities=12% Similarity=0.229 Sum_probs=24.2 Q ss_pred HHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHH Q ss_conf 999995673328999999983269999799999 Q gi|254780915|r 565 YDIYKSNTVDCFKEKYLNILRAGNSKHYSELLL 597 (626) Q Consensus 565 ~~~~~~~~~~~~~~~y~~~L~~Ggs~~~~el~k 597 (626) +.....+-..++.|||++|++.+|-....+-++ T Consensus 52 a~~~p~HT~~sWRDRfrKfv~~~g~~~Yi~yYe 84 (101) T pfam09197 52 ARKNPRHTENSWRDRFRKFVREYGIDEYIEYYE 84 (101) T ss_pred HHHCCCCCHHHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 985875325558999999998628999999999 No 129 >COG3227 LasB Zinc metalloprotease (elastase) [Amino acid transport and metabolism] Probab=33.24 E-value=19 Score=14.03 Aligned_cols=29 Identities=24% Similarity=0.394 Sum_probs=20.0 Q ss_pred CCEEECCCCCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf 51010365332467999999845899999612 Q gi|254780915|r 397 HPYILLNYLGKPQDVMTLAHELGHGIHFVLSS 428 (626) Q Consensus 397 ~~~I~~N~~~~~~dv~TL~HE~GHa~H~~ls~ 428 (626) .-..+-+++++.| ..+||++|++-..-++ T Consensus 327 DG~~f~~~S~sLD---VvAHElTHGvtq~tA~ 355 (507) T COG3227 327 DGSFFTPFSGSLD---VVAHELTHGVTQQTAG 355 (507) T ss_pred CCCEECCCCCCCC---EEHHHHCCHHHHHCCC T ss_conf 8625045566521---0000002014443267 No 130 >COG4784 Putative Zn-dependent protease [General function prediction only] Probab=33.06 E-value=17 Score=14.31 Aligned_cols=24 Identities=13% Similarity=0.295 Sum_probs=16.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 799189999999999999999998 Q gi|254780915|r 115 NLSSPTIRKFYTDINAKLADFEKV 138 (626) Q Consensus 115 dt~d~~~~~~~~~~~~~~s~~~~~ 138 (626) .-+|++.+.+.+++-..++..+.. T Consensus 60 eY~D~Kler~Vari~g~lt~~S~~ 83 (479) T COG4784 60 EYRDPKLERMVARIVGALTAVSEN 83 (479) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCC T ss_conf 558788999999997676320258 No 131 >KOG1885 consensus Probab=32.36 E-value=20 Score=13.85 Aligned_cols=25 Identities=12% Similarity=0.014 Sum_probs=10.7 Q ss_pred HHHHCCCCCHHCCCCCCCCCCCCCC Q ss_conf 8642485020103799874354766 Q gi|254780915|r 368 KFFTHNWIDAPQYDGKGSGAFAHGT 392 (626) Q Consensus 368 ~~~~~~~iD~~~r~gKr~GA~~~~~ 392 (626) +.+...|.+-|--.|+..+.+--.+ T Consensus 344 ~~i~G~~~i~y~p~~~~~~~~eldf 368 (560) T KOG1885 344 KNITGSYKITYHPNGPEEPELELDF 368 (560) T ss_pred HHHCCCEEEEECCCCCCCCCEEEEC T ss_conf 9603743675268898887436623 No 132 >TIGR01428 HAD_type_II haloacid dehalogenase, type II; InterPro: IPR006328 These proteins catalyze the hydrolytic dehalogenation of small L-2-haloalkanoic acids to yield the corresponding D-2-hydroxyalkanoic acids . They belong to the Haloacid Dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases, class (subfamily) I. Note that the Type I HAD enzymes have not yet been fully characterised, but clearly utilise a substantially different catalytic mechanism and are thus unlikely to be related.; GO: 0019120 hydrolase activity acting on acid halide bonds in C-halide compounds, 0008152 metabolic process. Probab=30.52 E-value=27 Score=12.90 Aligned_cols=115 Identities=17% Similarity=0.272 Sum_probs=61.6 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHCCCCCC-HHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCCCHHH Q ss_conf 620699999999-99999999972120168999-8999999999999860776345887888500152541066752589 Q gi|254780915|r 476 IEDMLNSIVRQI-SFYDFELKLHTERRSTGDIP-THRINEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFIESSFYVYA 553 (626) Q Consensus 476 ~~~~~~~~~~q~-~~~~fe~~~h~~~~~~~~~t-~~~l~~~~~~~~~~~~g~~~~~~~~~~~~~w~~~~H~y~~~fY~y~ 553 (626) .|+..+|++.-. ....++... .-. .+.+.++|++-+.+|.- ... -++-.|...-||-.---=.+. T Consensus 5 vFD~yGTL~D~~s~~~~~~~~~--------~~~G~~~~~~lWR~~~l~y~~-~~~----~mg~~w~~y~df~~~~~~al~ 71 (207) T TIGR01428 5 VFDVYGTLVDVHSVVERFAELF--------GGRGLEALSQLWRQKQLEYSW-LRT----LMGQNWTPYKDFWDLTAEALR 71 (207) T ss_pred EECCCCHHHHHHHHHHHHHHCC--------CCHHHHHHHHHHHHHHHHHHH-HHH----HCCCCCCCCCCHHHHHHHHHH T ss_conf 7416435765899999988515--------853489999999998888988-876----258888888888999999999 Q ss_pred HHHH---------HHHHHHHHHHHHH---CC-----CHHHHHH-HH-HHHHCCCCCCHHHHHHHCCCCC Q ss_conf 9999---------9999999999995---67-----3328999-99-9983269999799999708898 Q gi|254780915|r 554 YAFG---------NCLVNSLYDIYKS---NT-----VDCFKEK-YL-NILRAGNSKHYSELLLPLNINL 603 (626) Q Consensus 554 Y~~~---------~~~a~~l~~~~~~---~~-----~~~~~~~-y~-~~L~~Ggs~~~~el~k~~G~d~ 603 (626) |+.+ +-.+..|...++. .+ -+....+ |+ -|||+|.+.....++|++|||| T Consensus 72 ~~~~~~~~~~~~~~~~~~~L~~~~~~L~p~pD~~~gL~~L~~~G~~l~iLSNg~~~~~~~~~~~aGvdL 140 (207) T TIGR01428 72 YLLGRLGLASTLDEAAADRLAEAYLRLPPHPDVPAGLRALKERGLRLAILSNGSPAMLKSLVKHAGVDL 140 (207) T ss_pred HHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCCCCC T ss_conf 999755877678988899998776358998661889999976432365214898678999999628875 No 133 >TIGR02269 TIGR02269 Myxococcus xanthus paralogous lipoprotein family TIGR02269; InterPro: IPR011755 This family consists of at least 9 paralogs in Myxococcus xanthus, a member of the Deltaproteobacteria. One appears truncated toward the N-terminus; the others are predicted lipoproteins. The function is unknown.. Probab=30.24 E-value=28 Score=12.87 Aligned_cols=51 Identities=6% Similarity=0.055 Sum_probs=40.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHCCC--HHHHHHHHHHHHCCC--CCCHHHHHHHCCC Q ss_conf 5258999999999999999995673--328999999983269--9997999997088 Q gi|254780915|r 549 FYVYAYAFGNCLVNSLYDIYKSNTV--DCFKEKYLNILRAGN--SKHYSELLLPLNI 601 (626) Q Consensus 549 fY~y~Y~~~~~~a~~l~~~~~~~~~--~~~~~~y~~~L~~Gg--s~~~~el~k~~G~ 601 (626) --.=.|+ .+++-++++++..++. -.+.|-+++|.++-. ++.+.+++|++|- T Consensus 159 v~iHdFT--l~Ip~~~Hr~iHsG~~rGG~WN~aW~~F~~~~~G~g~~~~~~~~haG~ 213 (228) T TIGR02269 159 VKIHDFT--LVIPRHVHRRIHSGAGRGGAWNQAWRDFARRQPGRGASPEEIYRHAGE 213 (228) T ss_pred CCCCCCC--CCCHHHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHH T ss_conf 4302334--327277534076866888752689999998458878778899998510 No 134 >KOG2670 consensus Probab=30.03 E-value=28 Score=12.84 Aligned_cols=115 Identities=17% Similarity=0.128 Sum_probs=53.4 Q ss_pred HHCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 22258989999999987630628999999999987088753123444025655477799899999999866663289999 Q gi|254780915|r 285 LSNNVEPYVIEALMQSVKNYYPKTSHRYYELKKKWLKLDTMYFWDRLAPLPGTSQDIIPFEVARDLVLQSYAKFSPQMSI 364 (626) Q Consensus 285 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~lg~~~l~~wD~~~~~~~~~~~~~~~ee~~~~v~~~~~~~~~e~~~ 364 (626) +.-|+-+.-... +....+-...+.|.+....++..|.+.--.-|- -+-..+..+-+|+.+++.++...-|.. ++ T Consensus 167 QEfMIlP~ga~s-f~eamr~GsevYh~LK~vik~kyG~~atnVGDE----GGfAPnI~~~~E~L~Li~~Ai~kagyt-~k 240 (433) T KOG2670 167 QEFMILPVGADS-FAEAMRMGSEVYHHLKSVIKEKYGADATNVGDE----GGFAPNIQTNEEALDLIKEAINKAGYT-GK 240 (433) T ss_pred HHHEECCCCCHH-HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC----CCCCCCCCCHHHHHHHHHHHHHHCCCC-CC T ss_conf 311012457156-999998628999999999998707554565666----676998544488999999999965988-85 Q ss_pred HHHHHHHCCCCCHHC----CCCCCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHH Q ss_conf 999864248502010----379987435476667775101036533246799999984 Q gi|254780915|r 365 IAEKFFTHNWIDAPQ----YDGKGSGAFAHGTIPSVHPYILLNYLGKPQDVMTLAHEL 418 (626) Q Consensus 365 ~~~~~~~~~~iD~~~----r~gKr~GA~~~~~~~~~~~~I~~N~~~~~~dv~TL~HE~ 418 (626) +. -=+|+.+ |+||+-=-|-..- + .| .+| -+.++..-|.||| T Consensus 241 ik------IgmDvAaseF~~dgkYDLdfk~~~-~--d~---s~~-~s~~~L~dlY~~~ 285 (433) T KOG2670 241 VK------IGMDVAASEFYKDGKYDLDFKSPN-S--DP---SRW-LSGDQLADLYKSF 285 (433) T ss_pred EE------EEEEECHHHHHCCCCCCCCCCCCC-C--CC---CCC-CCHHHHHHHHHHH T ss_conf 48------987512355412775445576899-9--71---232-4889999999999 No 135 >KOG0734 consensus Probab=30.01 E-value=6.6 Score=17.31 Aligned_cols=24 Identities=25% Similarity=0.304 Sum_probs=18.5 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 799999984589999961256877 Q gi|254780915|r 410 DVMTLAHELGHGIHFVLSSETQGI 433 (626) Q Consensus 410 dv~TL~HE~GHa~H~~ls~~~~~~ 433 (626) --.|-+||-|||+-.+++...-|+ T Consensus 557 k~~TAyHE~GHAivA~yTk~A~Pl 580 (752) T KOG0734 557 KKITAYHEGGHAIVALYTKGAMPL 580 (752) T ss_pred HHHHHHHCCCCEEEEEECCCCCCC T ss_conf 443443326725887512788664 No 136 >pfam11350 DUF3152 Protein of unknown function (DUF3152). Some members in this family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently there is no known function. Probab=29.95 E-value=16 Score=14.62 Aligned_cols=21 Identities=29% Similarity=0.612 Sum_probs=13.1 Q ss_pred CCCCHHHHHH--HHHHHHHHHHH Q ss_conf 5332467999--99984589999 Q gi|254780915|r 404 YLGKPQDVMT--LAHELGHGIHF 424 (626) Q Consensus 404 ~~~~~~dv~T--L~HE~GHa~H~ 424 (626) |.+++..... +=||.||++=+ T Consensus 130 f~gdl~~YR~y~INHEVGH~LG~ 152 (189) T pfam11350 130 FEGDLGSYRQYVINHEVGHAIGY 152 (189) T ss_pred CCCCHHHHHHHHHHHHHHHHCCC T ss_conf 36589999999870323201255 No 137 >TIGR01122 ilvE_I branched-chain amino acid aminotransferase; InterPro: IPR005785 Aminotransferases share certain mechanistic features with other pyridoxal-phosphate dependent enzymes, such as the covalent binding of the pyridoxal-phosphate group to a lysine residue. On the basis of sequence similarity, these various enzymes can be grouped into subfamilies. One of these, called class-IV, currently consists of proteins of about 270 to 415 amino-acid residues that share a few regions of sequence similarity. Surprisingly, the best conserved region does not include the lysine residue to which the pyridoxal-phosphate group is known to be attached, in ilvE, but is located some 40 residues at the C-terminus side of the PlP-lysine. Among the class IV aminotransferases are two phylogenetically separable groups of branched-chain amino acid aminotransferase (IlvE) (2.6.1.42 from EC). The last common ancestor of the two lineages appears also to have given rise to a family of D-amino acid aminotransferases (DAAT). This model represents the IlvE family more strongly similar to the DAAT family. ; GO: 0004084 branched-chain-amino-acid transaminase activity, 0009081 branched chain family amino acid metabolic process. Probab=28.74 E-value=11 Score=15.67 Aligned_cols=22 Identities=36% Similarity=0.416 Sum_probs=16.6 Q ss_pred CCCCC-CHHHHHHHHH-HHHHHHH Q ss_conf 36533-2467999999-8458999 Q gi|254780915|r 402 LNYLG-KPQDVMTLAH-ELGHGIH 423 (626) Q Consensus 402 ~N~~~-~~~dv~TL~H-E~GHa~H 423 (626) .+..| |-|.|.||+| |+|=-+| T Consensus 209 s~L~GITRd~VitlA~ke~G~~V~ 232 (302) T TIGR01122 209 SILKGITRDTVITLAKKELGIEVV 232 (302) T ss_pred HHCCCCCHHHHHHHHHHCCCEEEE T ss_conf 224888888999998510862688 No 138 >pfam12057 DUF3538 Domain of unknown function (DUF3538). This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is about 120 amino acids in length. This domain is found associated with pfam00240. This domain has a conserved SDL sequence motif. Probab=27.82 E-value=30 Score=12.58 Aligned_cols=18 Identities=28% Similarity=0.191 Sum_probs=10.5 Q ss_pred CHHHHHHHHHHHHHHHHH Q ss_conf 246799999984589999 Q gi|254780915|r 407 KPQDVMTLAHELGHGIHF 424 (626) Q Consensus 407 ~~~dv~TL~HE~GHa~H~ 424 (626) .++-|.--+|=+|||.|. T Consensus 46 l~n~vsE~lH~LsHa~hA 63 (118) T pfam12057 46 IFDRVSEALHYLSHAQHA 63 (118) T ss_pred HHHHHHHHHHHHHHHHHH T ss_conf 889999999999999999 No 139 >TIGR01325 O_suc_HS_sulf O-succinylhomoserine sulfhydrylase; InterPro: IPR006234 These sequences represent O-succinylhomoserine sulfhydrylase, one of several related pyridoxal phosphate-dependent enzymes of cysteine and methionine metabolism. This enzyme is part of an alternative pathway of homocysteine biosynthesis, a step in methionine biosynthesis.. Probab=27.25 E-value=18 Score=14.12 Aligned_cols=35 Identities=11% Similarity=0.161 Sum_probs=23.7 Q ss_pred HHHHHHHHCC----CCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 9974233102----3349989999999999999999999 Q gi|254780915|r 69 TRWEGNLAHA----TNQKNCHSLGAAIAEYERICELIGR 103 (626) Q Consensus 69 ~~~~~~l~~~----~~~~~~~~l~~~l~~~e~i~~~l~r 103 (626) ..|.|++..- .+|.+...|++=|.++|-...-.-. T Consensus 37 ~~FAge~~GfvYsRy~NPTv~~FE~Rla~LEGAe~A~At 75 (386) T TIGR01325 37 KRFAGEVPGFVYSRYANPTVKVFEERLALLEGAERAVAT 75 (386) T ss_pred HHCCCCCCCCEEECCCCCCHHHHHHHHHHHHCHHHHHHH T ss_conf 753676889666226986157889999974060378864 No 140 >COG2321 Predicted metalloprotease [General function prediction only] Probab=25.92 E-value=33 Score=12.35 Aligned_cols=108 Identities=16% Similarity=0.128 Sum_probs=52.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH---HHCCCCCHHCCCCCCCCCCCCC----C--CCCCCCEEECCCCC--------- Q ss_conf 9999999986666328999999986---4248502010379987435476----6--67775101036533--------- Q gi|254780915|r 345 EVARDLVLQSYAKFSPQMSIIAEKF---FTHNWIDAPQYDGKGSGAFAHG----T--IPSVHPYILLNYLG--------- 406 (626) Q Consensus 345 ee~~~~v~~~~~~~~~e~~~~~~~~---~~~~~iD~~~r~gKr~GA~~~~----~--~~~~~~~I~~N~~~--------- 406 (626) ++....+...+.........++... ....-+=+|.+.-..+.++..+ | +.....|+=+.|.. T Consensus 83 ~e~~~f~~~vla~tEd~W~~if~~~~~~Y~~Ptlvlf~~~v~t~CG~assasGPFYCP~D~kvYlDlsFf~~m~~~fga~ 162 (295) T COG2321 83 DEIDKFVSVVLADTEDTWMQIFQESGRTYQKPTLVLFSGQVRTGCGFASSASGPFYCPADTKVYLDLSFFDEMKTKFGAS 162 (295) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCEECCCCCEEEEEHHHHHHHHHHHCCC T ss_conf 78888999985128889999998733034697599814763457678876778864678750786126999999983688 Q ss_pred -CHHHHHHHHHHHHHHHHHHHHCCCC-CCCCCCCHHHHHHHHHHHHHH Q ss_conf -2467999999845899999612568-776578005999886888999 Q gi|254780915|r 407 -KPQDVMTLAHELGHGIHFVLSSETQ-GILTNNSSLTLAETASIFGET 452 (626) Q Consensus 407 -~~~dv~TL~HE~GHa~H~~ls~~~~-~~~~~~~~~~~aE~~S~~~E~ 452 (626) .+-..-.++||.||-+++++.--.+ .....+.+..-+--+|+-.|. T Consensus 163 GdfAqaYViAHEVGHHVQnllGi~~kv~~~~q~~s~~~an~~sVr~EL 210 (295) T COG2321 163 GDFAQAYVIAHEVGHHVQNLLGILPKVNQNQQGMSQAEANQLSVRREL 210 (295) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 627999898865407788776036999988753303313675344201 No 141 >pfam01431 Peptidase_M13 Peptidase family M13. Mammalian enzymes are typically type-II membrane anchored enzymes which are known, or believed to activate or inactivate oligopeptide (pro)-hormones such as opioid peptides. The family also contains a bacterial member believed to be involved with milk protein cleavage. Probab=24.52 E-value=35 Score=12.18 Aligned_cols=15 Identities=40% Similarity=0.465 Sum_probs=11.9 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999845899999 Q gi|254780915|r 411 VMTLAHELGHGIHFV 425 (626) Q Consensus 411 v~TL~HE~GHa~H~~ 425 (626) -.+++||++||+..- T Consensus 37 G~iiaHEi~HafD~~ 51 (203) T pfam01431 37 GFVLGHELMHGFDDQ 51 (203) T ss_pred HHHHHHHHHHHHCHH T ss_conf 999999998770602 No 142 >pfam04838 Baculo_LEF5 Baculoviridae late expression factor 5. Probab=23.82 E-value=10 Score=15.97 Aligned_cols=31 Identities=19% Similarity=0.246 Sum_probs=16.8 Q ss_pred EEECCCCCCHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 0103653324679999998458999996125 Q gi|254780915|r 399 YILLNYLGKPQDVMTLAHELGHGIHFVLSSE 429 (626) Q Consensus 399 ~I~~N~~~~~~dv~TL~HE~GHa~H~~ls~~ 429 (626) |++.||++....-+==|+--||-+|++++-. T Consensus 23 fL~~nyp~nVKNkTFNF~nTGHlFHsLyAYv 53 (159) T pfam04838 23 YLITNYPKNVKNKTFNFANTGHLFHSLYAYI 53 (159) T ss_pred HHHHHCCCCCCCCEECCCCCCHHHHHHHHCC T ss_conf 9986170111347320477762256554216 No 143 >cd07219 Pat_PNPLA1 Patatin-like phospholipase domain containing protein 1. Members of this family share a patatin domain, initially discovered in potato tubers. Some members of PNPLA1 subfamily do not have the lipase consensus sequence Gly-X-Ser-X-Gly which is essential for hydrolase activity. This family includes PNPLA1 from Homo sapiens and Gallus gallus. Currently, there is no literature available on the physiological role, structure, or enzymatic activity of PNPLA1. It is expressed in various human tissues in low mRNA levels. Probab=23.17 E-value=18 Score=14.16 Aligned_cols=65 Identities=15% Similarity=0.272 Sum_probs=41.3 Q ss_pred CHHHHHHHHH-HHHHHHHHHHHHHHH-HHHHCCCCCHHCCCCCCCCCCCCCCCC--CCCCEEECCCCCC----HHHHHHH Q ss_conf 9899999999-866663289999999-864248502010379987435476667--7751010365332----4679999 Q gi|254780915|r 343 PFEVARDLVL-QSYAKFSPQMSIIAE-KFFTHNWIDAPQYDGKGSGAFAHGTIP--SVHPYILLNYLGK----PQDVMTL 414 (626) Q Consensus 343 ~~ee~~~~v~-~~~~~~~~e~~~~~~-~~~~~~~iD~~~r~gKr~GA~~~~~~~--~~~~~I~~N~~~~----~~dv~TL 414 (626) +-||+++-+. .+| -|-|..+.- ...+.+||| |||+.+.+. .....-+..|.|. .+|..+. T Consensus 139 SREELIqALlCScF---IPvYcGliPP~fRG~RYID---------GGfSdnlP~~~~~~TITVSPFsGesDICPrD~s~~ 206 (382) T cd07219 139 SKEELIEALYCSCF---VPVYCGLIPPTYRGVRYID---------GGFTGMQPCSFWTDSITISTFSGQQDICPRDCPAI 206 (382) T ss_pred CHHHHHHHHHHHCC---CCHHCCCCCCCCCCEEEEC---------CCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCC T ss_conf 88999999988223---3253088886537606644---------75468877567898379788888887799998742 Q ss_pred HHHHH Q ss_conf 99845 Q gi|254780915|r 415 AHELG 419 (626) Q Consensus 415 ~HE~G 419 (626) ||||- T Consensus 207 fh~~~ 211 (382) T cd07219 207 FHDFR 211 (382) T ss_pred CEEEE T ss_conf 00367 No 144 >COG0465 HflB ATP-dependent Zn proteases [Posttranslational modification, protein turnover, chaperones] Probab=22.65 E-value=37 Score=11.93 Aligned_cols=31 Identities=19% Similarity=0.194 Sum_probs=21.3 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCC-CCCCHHH Q ss_conf 999999845899999612568776-5780059 Q gi|254780915|r 411 VMTLAHELGHGIHFVLSSETQGIL-TNNSSLT 441 (626) Q Consensus 411 v~TL~HE~GHa~H~~ls~~~~~~~-~~~~~~~ 441 (626) -.|-.||.|||+-+++....-+.. -+-.|+. T Consensus 407 ~~~AYhEaghalv~~~l~~~d~v~KvtIiPrG 438 (596) T COG0465 407 KITAYHEAGHALVGLLLPDADPVHKVTIIPRG 438 (596) T ss_pred CCHHHHHHHHHHHHHHCCCCCCCCEEEECCCC T ss_conf 32578899999999727887620035522672 No 145 >pfam02102 Peptidase_M35 Deuterolysin metalloprotease (M35) family. Probab=22.51 E-value=12 Score=15.34 Aligned_cols=17 Identities=24% Similarity=0.270 Sum_probs=14.0 Q ss_pred CCHHHHHHHHHHHHHHH Q ss_conf 32467999999845899 Q gi|254780915|r 406 GKPQDVMTLAHELGHGI 422 (626) Q Consensus 406 ~~~~dv~TL~HE~GHa~ 422 (626) ...|...|+.|||-|+= T Consensus 293 h~qDqAtT~lHE~TH~~ 309 (352) T pfam02102 293 HAQDQATTTLHEFTHAP 309 (352) T ss_pred CCCCCCCEEEEEHHCCC T ss_conf 67454410122012455 No 146 >pfam07203 DUF1412 Protein of unknown function (DUF1412). This family consists of several Caenorhabditis elegans proteins of around 70-75 residues in length. The function of this family is unknown. Probab=21.15 E-value=40 Score=11.73 Aligned_cols=16 Identities=31% Similarity=0.613 Sum_probs=11.9 Q ss_pred CCCEEECCCCCCCCCCC Q ss_conf 88500152541066752 Q gi|254780915|r 534 YGSFWMMVPHFIESSFY 550 (626) Q Consensus 534 ~~~~w~~~~H~y~~~fY 550 (626) ....|++.||.| +|-| T Consensus 53 tslGWaQVPh~~-sPMf 68 (74) T pfam07203 53 TSLGWAQVPHVY-SPMF 68 (74) T ss_pred CCCCCEECCCCC-CCCC T ss_conf 655400075334-7443 No 147 >PRK10735 tldD protease TldD; Provisional Probab=20.41 E-value=41 Score=11.62 Aligned_cols=32 Identities=25% Similarity=0.270 Sum_probs=18.2 Q ss_pred CCCCCCEEECCCCCCHHHHHHHHHH-HHHHHHHHHHCC Q ss_conf 6777510103653324679999998-458999996125 Q gi|254780915|r 393 IPSVHPYILLNYLGKPQDVMTLAHE-LGHGIHFVLSSE 429 (626) Q Consensus 393 ~~~~~~~I~~N~~~~~~dv~TL~HE-~GHa~H~~ls~~ 429 (626) +.+.-|.|+-+ +-+-.|+|| +||++-.-+.++ T Consensus 244 p~G~~pVVl~~-----~~~g~l~hea~gh~~e~d~~~~ 276 (481) T PRK10735 244 PAGTMPVVLGA-----GWPGVLLHEAVGHGLEGDFNRR 276 (481) T ss_pred CCCEEEEEECC-----CHHHHHHHHHHHHHHCCCHHHC T ss_conf 97146079768-----3789999998754412002313 No 148 >KOG3538 consensus Probab=20.24 E-value=16 Score=14.58 Aligned_cols=21 Identities=33% Similarity=0.442 Sum_probs=15.3 Q ss_pred CHHHHHHHHHHHHHHHHHHHHC Q ss_conf 2467999999845899999612 Q gi|254780915|r 407 KPQDVMTLAHELGHGIHFVLSS 428 (626) Q Consensus 407 ~~~dv~TL~HE~GHa~H~~ls~ 428 (626) -+...+|++||+||-+ +|--+ T Consensus 314 gl~~aftiahE~gH~~-gm~hd 334 (845) T KOG3538 314 GLPSAFTIAHELGHNF-GMEHD 334 (845) T ss_pred CCCCCCCEECCCCCCC-CCCCC T ss_conf 8756554000025688-86567 Done!