Query gi|254780384|ref|YP_003064797.1| serralysin [Candidatus Liberibacter asiaticus str. psy62] Match_columns 665 No_of_seqs 305 out of 556 Neff 6.2 Searched_HMMs 39220 Date Sun May 29 20:42:06 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780384.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 cd04277 ZnMc_serralysin_like Z 100.0 1.4E-40 4.2E-45 262.3 12.2 154 39-196 23-186 (186) 2 pfam08548 Peptidase_M10_C Pept 100.0 2.1E-32 5.4E-37 213.0 12.9 127 197-326 1-137 (220) 3 pfam00413 Peptidase_M10 Matrix 99.8 3.7E-20 9.3E-25 139.1 9.6 138 23-196 2-158 (158) 4 cd04278 ZnMc_MMP Zinc-dependen 99.8 4.1E-19 1E-23 132.8 9.6 138 23-196 2-157 (157) 5 cd04279 ZnMc_MMP_like_1 Zinc-d 99.7 2.5E-16 6.4E-21 115.9 8.7 125 48-196 19-156 (156) 6 cd00203 ZnMc Zinc-dependent me 99.7 6.4E-16 1.6E-20 113.5 9.8 142 43-195 15-167 (167) 7 KOG1565 consensus 99.6 3.6E-15 9.3E-20 109.0 6.1 138 35-200 106-265 (469) 8 cd04268 ZnMc_MMP_like Zinc-dep 99.5 7.4E-14 1.9E-18 101.1 6.2 151 39-195 3-165 (165) 9 cd04327 ZnMc_MMP_like_3 Zinc-d 99.4 8.1E-12 2.1E-16 88.8 11.5 157 37-195 6-197 (198) 10 smart00235 ZnMc Zinc-dependent 99.3 2.5E-11 6.5E-16 85.7 8.1 91 38-142 8-108 (140) 11 pfam12388 Peptidase_M57 Dual-a 99.1 8.4E-10 2.1E-14 76.6 10.4 172 7-195 8-211 (211) 12 cd04281 ZnMc_BMP1_TLD Zinc-dep 98.9 2.2E-08 5.6E-13 68.0 11.5 97 35-142 10-109 (200) 13 pfam01400 Astacin Astacin (Pep 98.9 3.4E-08 8.8E-13 66.9 11.9 95 37-143 5-103 (192) 14 cd04283 ZnMc_hatching_enzyme Z 98.9 3.4E-08 8.7E-13 66.9 10.4 85 45-141 12-98 (182) 15 cd04280 ZnMc_astacin_like Zinc 98.8 1.9E-07 4.9E-12 62.4 11.5 89 42-142 6-96 (180) 16 KOG3714 consensus 98.7 1.4E-07 3.7E-12 63.1 10.3 152 33-199 80-271 (411) 17 cd04282 ZnMc_meprin Zinc-depen 98.7 2.5E-07 6.3E-12 61.7 9.7 91 39-141 49-141 (230) 18 cd04276 ZnMc_MMP_like_2 Zinc-d 98.5 3.6E-08 9.1E-13 66.8 1.4 157 36-196 6-196 (197) 19 pfam08548 Peptidase_M10_C Pept 98.4 2.9E-06 7.4E-11 55.2 10.2 55 259-313 81-135 (220) 20 COG2931 RTX toxins and related 98.4 1.9E-06 4.8E-11 56.4 8.3 60 266-325 155-217 (510) 21 COG2931 RTX toxins and related 98.2 7.3E-06 1.9E-10 52.8 7.1 127 268-398 275-406 (510) 22 pfam05572 Peptidase_M43 Pregna 97.7 1.8E-05 4.7E-10 50.4 2.3 55 85-141 34-90 (152) 23 cd04275 ZnMc_pappalysin_like Z 97.4 7.8E-05 2E-09 46.6 1.8 55 85-141 100-158 (225) 24 pfam02031 Peptidase_M7 Strepto 97.0 0.001 2.6E-08 39.8 4.7 89 39-136 3-93 (132) 25 COG5549 Predicted Zn-dependent 97.0 0.00048 1.2E-08 41.8 2.5 92 46-137 98-205 (236) 26 COG1913 Predicted Zn-dependent 96.1 0.0019 4.9E-08 38.2 0.8 17 121-137 125-141 (181) 27 cd04272 ZnMc_salivary_gland_MP 96.0 0.0072 1.8E-07 34.7 3.5 22 120-141 145-166 (220) 28 cd04267 ZnMc_ADAM_like Zinc-de 95.9 0.0073 1.9E-07 34.7 3.2 38 120-166 133-170 (192) 29 pfam05547 Peptidase_M6 Immune 95.6 0.016 4.1E-07 32.6 4.0 73 115-198 217-289 (646) 30 PRK13267 archaemetzincin-like 95.2 0.0064 1.6E-07 35.1 0.9 19 122-140 125-143 (177) 31 pfam10462 Peptidase_M66 Peptid 95.2 0.0053 1.3E-07 35.6 0.3 16 122-137 194-209 (304) 32 pfam07998 Peptidase_M54 Peptid 95.1 0.008 2E-07 34.5 0.9 21 120-140 126-146 (176) 33 cd04271 ZnMc_ADAM_fungal Zinc- 94.8 0.0088 2.2E-07 34.2 0.6 21 117-137 142-162 (228) 34 cd04269 ZnMc_adamalysin_II_lik 94.8 0.012 3E-07 33.5 1.1 19 122-140 133-151 (194) 35 pfam11350 DUF3152 Protein of u 94.7 0.022 5.7E-07 31.8 2.4 87 49-137 51-155 (189) 36 cd04273 ZnMc_ADAMTS_like Zinc- 94.5 0.015 3.8E-07 32.8 1.2 20 121-140 141-160 (207) 37 pfam01421 Reprolysin Reprolysi 94.3 0.016 4E-07 32.7 0.9 19 121-139 132-150 (198) 38 pfam05548 Peptidase_M11 Gameto 94.0 0.021 5.3E-07 32.0 1.1 32 108-139 124-155 (303) 39 pfam12044 Metallopep Putative 93.9 0.021 5.4E-07 31.9 0.9 21 119-139 312-332 (419) 40 TIGR03296 M6dom_TIGR03296 M6 f 92.6 0.078 2E-06 28.5 2.2 41 118-166 163-203 (286) 41 pfam11996 DUF3491 Protein of u 90.7 0.93 2.4E-05 22.0 6.1 32 219-250 347-380 (934) 42 TIGR00181 pepF oligoendopeptid 90.3 0.075 1.9E-06 28.6 0.2 67 44-132 331-401 (611) 43 TIGR02289 M3_not_pepF oligoend 89.8 0.099 2.5E-06 27.9 0.4 73 51-134 277-354 (553) 44 cd04270 ZnMc_TACE_like Zinc-de 89.7 0.22 5.6E-06 25.8 2.1 20 121-140 168-187 (244) 45 COG3824 Predicted Zn-dependent 89.5 0.11 2.8E-06 27.6 0.5 24 113-136 102-125 (136) 46 KOG4525 consensus 86.0 0.33 8.5E-06 24.7 1.2 80 119-208 300-383 (614) 47 KOG3607 consensus 86.0 0.25 6.5E-06 25.4 0.6 19 120-138 323-341 (716) 48 pfam00353 HemolysinCabind Hemo 85.3 0.42 1.1E-05 24.1 1.4 20 290-309 1-20 (26) 49 pfam06262 DUF1025 Domain of un 82.5 0.47 1.2E-05 23.8 0.7 21 116-136 68-88 (96) 50 cd06459 M3B_Oligoendopeptidase 81.5 0.46 1.2E-05 23.8 0.4 14 119-132 221-234 (427) 51 pfam03272 Enhancin Viral enhan 81.0 0.92 2.3E-05 22.0 1.8 26 354-379 531-556 (775) 52 pfam01457 Peptidase_M8 Leishma 81.0 0.58 1.5E-05 23.3 0.7 20 119-138 239-258 (569) 53 PTZ00257 Glycoprotein GP63 (le 80.7 0.6 1.5E-05 23.1 0.7 17 121-137 257-273 (615) 54 pfam09471 Peptidase_M64 IgA Pe 80.7 0.39 1E-05 24.3 -0.2 26 117-144 206-232 (257) 55 TIGR02290 M3_fam_3 oligoendope 78.3 0.67 1.7E-05 22.8 0.3 13 120-132 385-397 (600) 56 pfam01432 Peptidase_M3 Peptida 77.5 0.75 1.9E-05 22.6 0.4 19 57-75 141-159 (448) 57 PTZ00337 surface protease GP63 76.8 0.82 2.1E-05 22.3 0.4 22 117-138 226-247 (567) 58 pfam02382 RTX RTX N-terminal d 74.2 4.9 0.00012 17.7 4.8 20 291-310 622-641 (653) 59 cd06258 Peptidase_M3_like The 71.8 1.1 2.9E-05 21.5 0.1 17 119-135 153-170 (365) 60 KOG3538 consensus 70.9 1.4 3.6E-05 20.9 0.5 22 120-141 317-338 (845) 61 COG1164 Oligoendopeptidase F [ 70.8 1.2 3.1E-05 21.3 0.1 10 122-131 382-391 (598) 62 KOG3658 consensus 70.7 1.8 4.6E-05 20.2 1.0 20 346-365 365-384 (764) 63 pfam06594 HCBP_related Haemoly 69.7 5.8 0.00015 17.2 3.4 14 338-351 24-37 (43) 64 cd06162 S2P-M50_PDZ_SREBP Ster 69.6 1.5 3.9E-05 20.7 0.4 13 122-134 137-149 (277) 65 COG2856 Predicted Zn peptidase 66.9 1.9 4.8E-05 20.1 0.4 18 119-136 71-88 (213) 66 cd06455 M3A_TOP Peptidase M3 T 66.3 1.8 4.5E-05 20.3 0.2 36 190-231 168-204 (472) 67 cd06164 S2P-M50_SpoIVFB_CBS Sp 65.1 2.4 6.1E-05 19.5 0.7 15 122-136 55-69 (227) 68 cd06457 M3A_MIP Peptidase M3 m 64.7 2.1 5.5E-05 19.8 0.4 18 57-74 138-155 (458) 69 cd06161 S2P-M50_SpoIVFB SpoIVF 64.6 2.5 6.3E-05 19.4 0.7 16 122-137 40-55 (208) 70 pfam05279 Asp-B-Hydro_N Aspart 63.6 2.8 7.1E-05 19.1 0.8 22 52-73 15-36 (232) 71 PRK10280 dipeptidyl carboxypep 62.1 2.3 6E-05 19.6 0.2 33 190-228 373-406 (681) 72 pfam01434 Peptidase_M41 Peptid 60.9 2.9 7.3E-05 19.0 0.4 17 121-137 9-26 (192) 73 LOAD_S2Pmetalloprt consensus 60.7 3.2 8.2E-05 18.8 0.7 18 122-139 9-26 (148) 74 KOG2921 consensus 60.1 3.2 8.3E-05 18.7 0.6 13 123-135 134-146 (484) 75 PRK10911 oligopeptidase A; Pro 59.1 3.1 7.9E-05 18.8 0.4 16 123-138 264-279 (680) 76 pfam02163 Peptidase_M50 Peptid 56.7 4.1 0.00011 18.1 0.7 17 123-139 10-26 (205) 77 COG0339 Dcp Zn-dependent oligo 56.6 3.6 9.1E-05 18.5 0.3 17 123-139 267-283 (683) 78 cd05709 S2P-M50 Site-2 proteas 54.5 4.7 0.00012 17.7 0.7 18 122-139 10-27 (180) 79 cd06460 M32_Taq Peptidase fami 54.0 4.5 0.00012 17.8 0.5 17 120-136 159-178 (396) 80 cd06160 S2P-M50_like_2 Unchara 52.6 5.3 0.00013 17.5 0.7 19 120-138 41-59 (183) 81 KOG2090 consensus 46.9 6.6 0.00017 16.9 0.4 26 539-566 641-669 (704) 82 TIGR00054 TIGR00054 membrane-a 45.4 7.4 0.00019 16.5 0.5 11 121-131 16-26 (463) 83 cd02641 R3H_Smubp-2_like R3H d 43.8 10 0.00026 15.7 1.0 15 123-137 32-46 (60) 84 pfam08463 EcoEI_R_C EcoEI R pr 42.5 11 0.00028 15.5 1.0 61 518-602 130-190 (232) 85 pfam10263 SprT-like SprT-like 42.4 12 0.00032 15.2 1.3 21 119-139 57-83 (153) 86 pfam02074 Peptidase_M32 Carbox 42.3 9.1 0.00023 16.0 0.6 12 121-132 260-271 (494) 87 PRK10733 hflB ATP-dependent me 41.4 9.1 0.00023 16.0 0.4 11 606-616 583-593 (644) 88 cd06158 S2P-M50_like_1 Unchara 39.7 11 0.00028 15.5 0.7 14 124-137 13-26 (181) 89 cd06163 S2P-M50_PDZ_RseP-like 39.0 11 0.00029 15.4 0.6 14 122-135 11-24 (182) 90 TIGR02604 Piru_Ver_Nterm putat 37.0 17 0.00043 14.4 1.3 25 115-139 58-83 (459) 91 COG2317 Zn-dependent carboxype 36.5 12 0.00031 15.2 0.5 10 123-132 262-271 (497) 92 cd02639 R3H_RRM R3H domain of 36.4 15 0.00038 14.7 0.9 17 123-139 32-48 (60) 93 pfam06114 DUF955 Domain of unk 36.2 12 0.00031 15.2 0.4 14 120-133 41-54 (121) 94 smart00731 SprT SprT homologue 35.1 18 0.00047 14.2 1.2 20 119-138 58-81 (146) 95 PRK10779 zinc metallopeptidase 34.5 14 0.00035 14.9 0.5 17 120-136 15-31 (449) 96 KOG3854 consensus 34.2 13 0.00034 15.0 0.4 12 522-533 376-387 (505) 97 pfam06189 5-nucleotidase 5'-nu 33.8 24 0.00061 13.5 1.6 13 410-422 120-132 (263) 98 COG4227 Antirestriction protei 29.8 15 0.00038 14.7 0.0 23 508-530 212-235 (316) 99 TIGR01241 FtsH_fam ATP-depende 29.8 18 0.00046 14.2 0.4 26 121-150 317-343 (505) 100 cd06159 S2P-M50_PDZ_Arch Uncha 28.6 21 0.00055 13.8 0.6 12 121-132 119-130 (263) 101 pfam01742 Peptidase_M27 Clostr 27.2 19 0.00049 14.1 0.2 13 415-427 210-222 (408) 102 cd02640 R3H_NRF R3H domain of 26.9 28 0.0007 13.1 0.9 15 123-137 32-46 (60) 103 TIGR01338 phycocy_alpha phycoc 25.6 24 0.00062 13.4 0.5 70 522-599 83-157 (161) 104 COG2738 Predicted Zn-dependent 25.3 24 0.00062 13.4 0.4 83 50-136 19-108 (226) 105 COG3793 TerB Tellurite resista 24.9 32 0.00082 12.7 1.0 10 125-134 128-137 (144) 106 pfam07108 PipA PipA protein. T 24.6 28 0.00072 13.0 0.7 14 586-599 175-188 (228) 107 pfam02102 Peptidase_M35 Deuter 23.8 30 0.00077 12.9 0.7 28 556-586 320-347 (352) 108 TIGR00962 atpA ATP synthase F1 23.2 38 0.00098 12.2 1.3 30 571-600 402-433 (520) 109 PRK13262 ureE urease accessory 23.0 31 0.00079 12.8 0.6 16 124-139 105-120 (231) 110 COG4783 Putative Zn-dependent 21.7 31 0.00079 12.8 0.4 12 122-133 132-143 (484) 111 pfam09954 DUF2188 Uncharacteri 21.4 40 0.001 12.1 0.9 22 635-656 87-113 (131) 112 COG4744 Uncharacterized conser 21.0 18 0.00046 14.2 -0.9 41 158-198 40-80 (121) No 1 >cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides. Probab=100.00 E-value=1.4e-40 Score=262.31 Aligned_cols=154 Identities=40% Similarity=0.538 Sum_probs=124.7 Q ss_pred EEEEEEEECCHHHHHHHHHHHHHHHHHHCCEEEEECCC--CCEEEECCC------CEEEEECCCCCC--CCCCCCCCCCC Q ss_conf 02688750799999999999998865747568982478--716896158------535663146775--31224433544 Q gi|254780384|r 39 RLSFDITELNPNAQEVARWALGEWSKVVDLTFEETSIH--SDIKFISSN------NGYVCTPRYDSR--YFMEINFDKAD 108 (665) Q Consensus 39 ~~t~~it~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~--~d~~f~~~~------~g~a~~p~~~~~--~~~~~~~~~~~ 108 (665) ....++.+|+++||+++|.||+.|++|++|+|+||+.. ++++|...+ .|+++.|..... ...++|++... T Consensus 23 ~~~~~~~~~s~~q~~~ir~a~~~w~~v~~i~F~ev~~~~~adI~~~~~~~~~~~~~g~a~~p~~~~~~~~~gdi~~~~~~ 102 (186) T cd04277 23 EDTTNTAALSAAQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPDGNTAGYAYYPGSGSGTAYGGDIWFNSSY 102 (186) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCEEEEEECCCCCCCEEEEEECCCCCCCCCCCCEEEECCCC T ss_conf 57677416899999999999999872139449993688863289996058888825999658987776756448965876 Q ss_pred CCCCCCCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCEECCHHHHHH Q ss_conf 33366666734663003234655187888876667666677754101420244675235677666655200021122148 Q gi|254780384|r 109 VWRYGIGKGTILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMSYFMPQDSMMDASFGYCATPMVSDI 188 (665) Q Consensus 109 ~~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMSY~~~~~~~~~~~~~~~~tpm~~DI 188 (665) .+. ....|+|.++|+||||||||||+|||+++++.+. ..++.+|+.++|||||+.+........+.+|.|||++|| T Consensus 103 ~~~-~~~~g~~~~~t~lHEIGHALGL~Hp~d~~~~~~~---~~~~~~d~~~~TvMSY~~~~~~~~~~~~~~~~tpm~~DI 178 (186) T cd04277 103 DTN-SDSPGSYGYQTIIHEIGHALGLEHPGDYNGGDPV---PPTYALDSREYTVMSYNSGYGNGASAGGGYPQTPMLLDI 178 (186) T ss_pred CCC-CCCCCCCCEEEEEEEECHHCCCCCCCCCCCCCCC---CCCHHHCCCCCEEEECCCCCCCCCCCCCCCCCCCCHHHH T ss_conf 557-7787754247888881013267888767777777---775335084735886557888886646646578758999 Q ss_pred HHHHHHCC Q ss_conf 99998608 Q gi|254780384|r 189 VAVQKIYG 196 (665) Q Consensus 189 ~Alq~lYG 196 (665) +|||+||| T Consensus 179 aAlQ~lYG 186 (186) T cd04277 179 AALQYLYG 186 (186) T ss_pred HHHHHHHC T ss_conf 99999629 No 2 >pfam08548 Peptidase_M10_C Peptidase M10 serralysin C terminal. Serralysins are peptidases related to mammalian matrix metallopeptidases (MMPs). The peptidase unit is found at the N terminal while this domain at the C terminal forms a corkscrew and is thought to be important for secretion of the protein through the bacterial cell wall. This domain contains the calcium ion binding domain pfam00353. Probab=100.00 E-value=2.1e-32 Score=212.98 Aligned_cols=127 Identities=29% Similarity=0.506 Sum_probs=111.3 Q ss_pred CC-CCCCCCCEEECCCCCC---------CCCEEEEEECCCCCEEEEEEECCCCCCEEECCCCCCCEECCCCCCEEEECCC Q ss_conf 76-6666773463257678---------7635898722897325998504787424744587310103555754773482 Q gi|254780384|r 197 AP-KGSLGNNIYKIEVQQK---------DSPFIQTIYDSGGDDLLDLTLIGGVKNYIDLNTESWSSIGGYEKNMTIAQHT 266 (665) Q Consensus 197 an-~~~~gdt~y~~~~~~~---------~~~~~~tI~D~gG~DTid~s~~~~~~~~IdL~~g~~S~~~g~~~n~~Ia~gt 266 (665) || ++|+|||||+|++++. ..++.+||||+||+||||+|+++..+ +|||+++.||++++.++|++||+++ T Consensus 1 an~~Tr~GdtvYgfnsntg~~~~~~~~~~~~~~~tiwD~gG~DTiD~Sg~~~~~-~IdL~~g~~S~vgg~~gN~~Ia~g~ 79 (220) T pfam08548 1 ANMTTRTGDTVYGFNSNTDRDFYSATSSSDKLIFSVWDAGGNDTFDFSGFSQNQ-RINLNEGSFSDVGGLKGNVSIAAGV 79 (220) T ss_pred CCCCCCCCCCEECCCCCCCCCEEECCCCCCCEEEEEECCCCCEEECCCCCCCCE-EEECCCCCEECCCCCCCCEEECCCC T ss_conf 986224788737268889850461367889769999938987848546985460-8966688242145653745432773 Q ss_pred EEEEEECCCCCCEEEECCCCCEEECCCCCCEEECCCCCEEEEEECCCCCEEEEEECCCEE Q ss_conf 574466177651688266674686678634588888733899823665137872058259 Q gi|254780384|r 267 VIESVVGSSGNDVVIGNSANNVFFESSGNDVFDGSSGLDTFFYYAPSYLYKIYRFENAII 326 (665) Q Consensus 267 ~IEna~g~~g~D~i~gn~~~n~l~gg~g~d~~~g~~g~Dt~~~~~~~~~~~~~~~~~~~~ 326 (665) +|||++|++|+|+|+||.++|+|.||.|+|+|+|+.|.|++. +..+.+.+....+... T Consensus 80 ~IEn~~GgsG~D~l~Gn~~~n~L~Gg~G~D~l~Gg~G~D~l~--GG~G~D~f~~~~g~Ds 137 (220) T pfam08548 80 TIENAIGGSGNDLLIGNDAANILKGGAGNDIIYGGGGADQLW--GGAGKDTFVYGAASDS 137 (220) T ss_pred EEEEEECCCCCCEEECCCCCCEEECCCCCCEEECCCCCCEEE--CCCCCEEEEECCCCCC T ss_conf 166886667676898788885578789988687778871898--8998349984076657 No 3 >pfam00413 Peptidase_M10 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Probab=99.83 E-value=3.7e-20 Score=139.12 Aligned_cols=138 Identities=30% Similarity=0.386 Sum_probs=92.8 Q ss_pred ECCCEEEEECCCCCCCEEEEEEEECC-HHHHHHHHHHHHHHHHHHCCEEEEECC-CCCEEEECCC--------------- Q ss_conf 40434675057656750268875079-999999999999886574756898247-8716896158--------------- Q gi|254780384|r 23 WSNYFIRYKPDVLTTHRLSFDITELN-PNAQEVARWALGEWSKVVDLTFEETSI-HSDIKFISSN--------------- 85 (665) Q Consensus 23 ~~~~~~~~~~d~~~~~~~t~~it~l~-~~~~~~~R~Al~~ws~va~ItF~Ev~~-~~d~~f~~~~--------------- 85 (665) |+...++++...++. .++ ..++.++|.||+.|++|++|+|+||+. .+||++.+.. T Consensus 2 W~k~~lTy~i~~~~~--------~l~~~~vr~~i~~Af~~Ws~v~~l~F~ev~~~~adI~i~F~~~~h~d~~~fDg~gg~ 73 (158) T pfam00413 2 WKKKNLTYRIVNYTP--------DLPRDEVRRAIRRAFKVWSEVTPLTFTEVPEGTADIMIGFGRGEHGDGYPFDGPGGV 73 (158) T ss_pred CCCCCEEEEEECCCC--------CCCHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCEEEEECCCCCCCCCCCCCCCCE T ss_conf 988815799835799--------989999999999999999863685479777898778999446767998886798986 Q ss_pred CEEEEECCCCCCCCCCCCCCCCCCCCCC--CCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 5356631467753122443354433366--66673466300323465518788887666766667775410142024467 Q gi|254780384|r 86 NGYVCTPRYDSRYFMEINFDKADVWRYG--IGKGTILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVM 163 (665) Q Consensus 86 ~g~a~~p~~~~~~~~~~~~~~~~~~~~~--~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvM 163 (665) -++++.|... .+++.++....|..+ ...+...+++++|||||||||+|.-. .-+|| T Consensus 74 laha~~P~~~---~G~ihfd~~e~w~~~~~~~~g~~l~~va~HEIGHaLGL~Hs~~-------------------~~siM 131 (158) T pfam00413 74 LAHAFFPGPI---GGDIHFDDDEQWTVGNESPEGTNLFLVAAHEIGHALGLGHSSD-------------------PDAIM 131 (158) T ss_pred EEECCCCCCC---CCEEEECCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCC-------------------CCCEE T ss_conf 7751269865---7338965786530589876651155665454020026688999-------------------86677 Q ss_pred EECCCCCCCCCCCCCEECCHHHHHHHHHHHHCC Q ss_conf 523567766665520002112214899998608 Q gi|254780384|r 164 SYFMPQDSMMDASFGYCATPMVSDIVAVQKIYG 196 (665) Q Consensus 164 SY~~~~~~~~~~~~~~~~tpm~~DI~Alq~lYG 196 (665) ....... .....++-..||.+||.||| T Consensus 132 ~p~y~~~------~~~~~~L~~dDi~gIq~LYG 158 (158) T pfam00413 132 YPYYSPY------KDPVFRLDQDDIKGIQSLYG 158 (158) T ss_pred CCCCCCC------CCCCCCCCHHHHHHHHHHCC T ss_conf 7602378------78878799899999998659 No 4 >cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases). Probab=99.80 E-value=4.1e-19 Score=132.82 Aligned_cols=138 Identities=29% Similarity=0.371 Sum_probs=89.5 Q ss_pred ECCCEEEEECCCCCCCEEEEEEEECCHHHHHHHHHHHHHHHHHHCCEEEEECCC--CCEEEECC---------------C Q ss_conf 404346750576567502688750799999999999998865747568982478--71689615---------------8 Q gi|254780384|r 23 WSNYFIRYKPDVLTTHRLSFDITELNPNAQEVARWALGEWSKVVDLTFEETSIH--SDIKFISS---------------N 85 (665) Q Consensus 23 ~~~~~~~~~~d~~~~~~~t~~it~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~--~d~~f~~~---------------~ 85 (665) |+...++++...++.. --...+|+++|.||+.|++|++|+|+||+.. +||++.+. . T Consensus 2 W~k~~lty~i~~~~~~-------l~~~~~r~~i~~Af~~Ws~v~~l~F~ev~~~~~AdI~i~f~~~~h~d~~~fDG~gg~ 74 (157) T cd04278 2 WSKTNLTYRILNYPPD-------LPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPGGT 74 (157) T ss_pred CCCCCEEEEEECCCCC-------CCHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCEEEEEECCCCCCCCCCCCCCCE T ss_conf 9887058998427999-------899999999999999986647855598158887878999866766898885788886 Q ss_pred CEEEEECCCCCCCCCCCCCCCCCCCCCCC-CCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEE Q ss_conf 53566314677531224433544333666-66734663003234655187888876667666677754101420244675 Q gi|254780384|r 86 NGYVCTPRYDSRYFMEINFDKADVWRYGI-GKGTILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMS 164 (665) Q Consensus 86 ~g~a~~p~~~~~~~~~~~~~~~~~~~~~~-~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMS 164 (665) -++++.|... .+.+.++....|.... ..+...+.+++|||||||||.|.-+ .-+||. T Consensus 75 lahA~~p~~~---~G~ihfd~~e~w~~~~~~~~~~l~~va~HEIGHaLGL~Hs~~-------------------~~siM~ 132 (157) T cd04278 75 LAHAFFPGGI---GGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHALGLGHSSD-------------------PDSIMY 132 (157) T ss_pred EEEEECCCCC---CCEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCC-------------------CCCEEC T ss_conf 8987568886---641887777662126788760189999987565406888899-------------------975333 Q ss_pred ECCCCCCCCCCCCCEECCHHHHHHHHHHHHCC Q ss_conf 23567766665520002112214899998608 Q gi|254780384|r 165 YFMPQDSMMDASFGYCATPMVSDIVAVQKIYG 196 (665) Q Consensus 165 Y~~~~~~~~~~~~~~~~tpm~~DI~Alq~lYG 196 (665) -..... .....+-..||.+||.||| T Consensus 133 p~y~~~-------~~~~~L~~dDi~giq~LYG 157 (157) T cd04278 133 PYYQGP-------VPKFKLSQDDIRGIQALYG 157 (157) T ss_pred CCCCCC-------CCCCCCCHHHHHHHHHHCC T ss_conf 503588-------8789989899999998759 No 5 >cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin. Probab=99.67 E-value=2.5e-16 Score=115.95 Aligned_cols=125 Identities=22% Similarity=0.199 Sum_probs=76.4 Q ss_pred CHHHHHHHHHHHHHHHHHHCCEEEEECC---CCCEEEECCC----------CEEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9999999999999886574756898247---8716896158----------53566314677531224433544333666 Q gi|254780384|r 48 NPNAQEVARWALGEWSKVVDLTFEETSI---HSDIKFISSN----------NGYVCTPRYDSRYFMEINFDKADVWRYGI 114 (665) Q Consensus 48 ~~~~~~~~R~Al~~ws~va~ItF~Ev~~---~~d~~f~~~~----------~g~a~~p~~~~~~~~~~~~~~~~~~~~~~ 114 (665) -...++++|.||+.|++|++|+|+|+.. .+||++...+ -+.++.|...................... T Consensus 19 ~~~~~~Ai~~A~~~Ws~v~~L~Fve~~~~~~~adI~I~f~~~~~~~g~gg~lA~a~~p~~~~~~~~~~~~~~~~~~~~~~ 98 (156) T cd04279 19 AQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPPPVGGAGGGLARAGFPLISDGNRKLFNRTDINLGPGQP 98 (156) T ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCEEEEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCEEEEEECCCCC T ss_conf 99999999999999856389369982477777876999658999889898686716898887776655011442011688 Q ss_pred CCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCEECCHHHHHHHHHHHH Q ss_conf 66734663003234655187888876667666677754101420244675235677666655200021122148999986 Q gi|254780384|r 115 GKGTILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMSYFMPQDSMMDASFGYCATPMVSDIVAVQKI 194 (665) Q Consensus 115 ~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMSY~~~~~~~~~~~~~~~~tpm~~DI~Alq~l 194 (665) ..+...+.+++|||||+|||.|-.+. .-.||.-..... ..+...+-..||.+||.| T Consensus 99 ~~~~~l~~vA~HEiGHaLGL~Hss~~------------------~~aiM~P~y~~~------~~~~~~L~~dDi~gIq~l 154 (156) T cd04279 99 RGAENLQAIALHELGHALGLWHHSDR------------------PEDAMYPSQGQG------PDGNPTLSARDVATLKRL 154 (156) T ss_pred CCCHHHHHHHHHHHCCCCCCCCCCCC------------------CCCEEEEEECCC------CCCCCCCCHHHHHHHHHH T ss_conf 66020999999986010144667899------------------666663252157------777898998999999997 Q ss_pred CC Q ss_conf 08 Q gi|254780384|r 195 YG 196 (665) Q Consensus 195 YG 196 (665) || T Consensus 155 Yg 156 (156) T cd04279 155 YG 156 (156) T ss_pred CC T ss_conf 59 No 6 >cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease. Probab=99.66 E-value=6.4e-16 Score=113.51 Aligned_cols=142 Identities=21% Similarity=0.215 Sum_probs=94.0 Q ss_pred EEEECCHHHHHHHHHHHHHHHHHHCCEEEEECCC---CCEEEE-------CCCCEEEEECCCCCCCCCCCCCCCCCCCCC Q ss_conf 8750799999999999998865747568982478---716896-------158535663146775312244335443336 Q gi|254780384|r 43 DITELNPNAQEVARWALGEWSKVVDLTFEETSIH---SDIKFI-------SSNNGYVCTPRYDSRYFMEINFDKADVWRY 112 (665) Q Consensus 43 ~it~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~---~d~~f~-------~~~~g~a~~p~~~~~~~~~~~~~~~~~~~~ 112 (665) +-..++..+|++++.|++.|++++.|+|++++.. .++.+. .+..++++.+......-...+.+.... T Consensus 15 ~~~~~~~~~~~~I~~Am~~~~~~tcIrF~~~~~e~~~~~i~~~~~~~~~~~~~gc~S~vG~~~~~~~q~~~l~~~~~--- 91 (167) T cd00203 15 EEENLSAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDFDGGTGGWAYLGRVCDSLRGVGVLQDNQS--- 91 (167) T ss_pred HHCCCCHHHHHHHHHHHHHHHHCCCCCCEECCCCCCCCCEEEEEEEECCCCCCEEEECCCCCCCCCCCEEEECCCCC--- T ss_conf 01158999999999999999857674737788887765367764330358998448518988679985389658974--- Q ss_pred CCCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCC-CCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCEECCHHHHHHHHH Q ss_conf 6666734663003234655187888876667666-677754101420244675235677666655200021122148999 Q gi|254780384|r 113 GIGKGTILSHNALHEIGHALGLMHPGSYNGGYPV-YGVDNDYENDSFLTSVMSYFMPQDSMMDASFGYCATPMVSDIVAV 191 (665) Q Consensus 113 ~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~-~~~~~~~~~d~~~~TvMSY~~~~~~~~~~~~~~~~tpm~~DI~Al 191 (665) .+.+...|++|||||||||.|++.....+.. .-....+..+....|||+|..... +.+-...+-..||..| T Consensus 92 ---~~~~~~gti~HElgHaLGf~He~sr~dRD~yv~i~~~~~~~~ydy~SVMhY~~~~f-----s~gqr~~~S~~Di~~i 163 (167) T cd00203 92 ---GTKEGAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSVMSYTKGSF-----SDGQRKDFSQCDIDQI 163 (167) T ss_pred ---CCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCEEEECCCCCCCCCCCCEECCCCCCC-----CCCCCCCCCHHHHHHH T ss_conf ---33255424899999984887555666667876871366789776661612489886-----8766567899999999 Q ss_pred HHHC Q ss_conf 9860 Q gi|254780384|r 192 QKIY 195 (665) Q Consensus 192 q~lY 195 (665) +.|| T Consensus 164 n~lY 167 (167) T cd00203 164 NKLY 167 (167) T ss_pred HHHC T ss_conf 9639 No 7 >KOG1565 consensus Probab=99.57 E-value=3.6e-15 Score=108.96 Aligned_cols=138 Identities=29% Similarity=0.425 Sum_probs=93.5 Q ss_pred CCCCEEEEEEEEC-----CHHHHHHHHHHHHHHHHHHCCEEEEECC--CCCEEEECC---------------CCEEEEEC Q ss_conf 5675026887507-----9999999999999886574756898247--871689615---------------85356631 Q gi|254780384|r 35 LTTHRLSFDITEL-----NPNAQEVARWALGEWSKVVDLTFEETSI--HSDIKFISS---------------NNGYVCTP 92 (665) Q Consensus 35 ~~~~~~t~~it~l-----~~~~~~~~R~Al~~ws~va~ItF~Ev~~--~~d~~f~~~---------------~~g~a~~p 92 (665) +...++|+-|... ..+.+.+++.|++.|++|+.++|+||.. .+||.+.+. .-++++.| T Consensus 106 W~k~~lT~ri~n~~~~l~~~~v~~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~~h~d~~PFDG~~g~laHAf~P 185 (469) T KOG1565 106 WNKEHLTYRIKNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADIRISFFPGDHGDGFPFDGPGGVLAHAFFP 185 (469) T ss_pred CCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCEECCCCC T ss_conf 64231321113538888878888899999853266888866467887888525533116789988625889732045478 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCC Q ss_conf 46775312244335443336666673466300323465518788887666766667775410142024467523567766 Q gi|254780384|r 93 RYDSRYFMEINFDKADVWRYGIGKGTILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMSYFMPQDSM 172 (665) Q Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMSY~~~~~~~ 172 (665) ... .-++..+..++.|..+...|.-.+++++|||||+|||+|--+ ...||...... T Consensus 186 g~~--~~G~~hfD~dE~Wt~~~~~g~~l~~Va~HEiGH~LGL~HS~~-------------------~~aiM~P~y~~--- 241 (469) T KOG1565 186 GPG--IGGDLHFDKDETWTYGDSNGVDLFLVAAHEIGHALGLGHSSD-------------------PDAIMYPFYQP--- 241 (469) T ss_pred CCC--CCCCCCCCCCCCEECCCCCCCCHHHHHHHHCCCCCCCCCCCC-------------------CCCCCCCCCCC--- T ss_conf 888--877654675425121677550067775430110026677778-------------------45013652326--- Q ss_pred CCCCCCEECCHHHHHHHHHHHHCCCCCC Q ss_conf 6655200021122148999986087666 Q gi|254780384|r 173 MDASFGYCATPMVSDIVAVQKIYGAPKG 200 (665) Q Consensus 173 ~~~~~~~~~tpm~~DI~Alq~lYGan~~ 200 (665) ....+. +-..||.+||+|||.+.. T Consensus 242 --~~~~~~--L~~DDv~giq~lYG~~~~ 265 (469) T KOG1565 242 --DSGNFD--LSQDDVRGIQHLYGGPPR 265 (469) T ss_pred --CCCCCC--CCHHHHHHHHHHHCCCCC T ss_conf --888851--673235666987099844 No 8 >cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases. Probab=99.47 E-value=7.4e-14 Score=101.05 Aligned_cols=151 Identities=21% Similarity=0.237 Sum_probs=93.0 Q ss_pred EEEEEEEE-CCHHHHHHHHHHHHHHHHHHCCEEEEECC--CCCEEEECCC------CEEEEECCCCC---CCCCCCCCCC Q ss_conf 02688750-79999999999999886574756898247--8716896158------53566314677---5312244335 Q gi|254780384|r 39 RLSFDITE-LNPNAQEVARWALGEWSKVVDLTFEETSI--HSDIKFISSN------NGYVCTPRYDS---RYFMEINFDK 106 (665) Q Consensus 39 ~~t~~it~-l~~~~~~~~R~Al~~ws~va~ItF~Ev~~--~~d~~f~~~~------~g~a~~p~~~~---~~~~~~~~~~ 106 (665) .|+|=|.. +...-|+++|.+++.|.....+.|.++.. .+|+++..-+ ...++.|+... +.+....+.. T Consensus 3 pI~~yid~~~p~~~r~ai~~gi~~WN~af~~gFk~a~~~~~~dir~~~~~~~~~~~~~~~ygps~~dp~tgeI~~a~v~~ 82 (165) T cd04268 3 PITYYIDDSVPDKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYNDGTWSYGPSQVDPLTGEILLARVYL 82 (165) T ss_pred CEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEE T ss_conf 87999379999899999999999999998720565122475566321599632689866617854599984188557876 Q ss_pred CCCCCCCCCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCEECCHHHH Q ss_conf 44333666667346630032346551878888766676666777541014202446752356776666552000211221 Q gi|254780384|r 107 ADVWRYGIGKGTILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMSYFMPQDSMMDASFGYCATPMVS 186 (665) Q Consensus 107 ~~~~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMSY~~~~~~~~~~~~~~~~tpm~~ 186 (665) .... . .-.+.+..+++.||+||+|||.|.+.... .+.............|||.|...... ......++.+++.+ T Consensus 83 ~~~~-~-~~~~~~~~~v~~HEvGHtLGL~Hn~~~S~---~~~~~~~~~~~~~~~SVMdY~~~n~~-~~~~~~~~~~ig~y 156 (165) T cd04268 83 YSSF-V-EYSGARLRNTAEHELGHALGLRHNFAASD---RDDNVDLLAEKGDTSSVMDYAPSNFS-IQLGDGQKYTIGPY 156 (165) T ss_pred ECHH-H-HHHHHHHHHHHHHHHHHHHCCCCCCCCCC---CCCCHHHHCCCCCCCEEECCCCCCCC-CCCCCCCCCCCCHH T ss_conf 2039-9-99999999999998763505754453201---46844654158998614268775557-68545568998568 Q ss_pred HHHHHHHHC Q ss_conf 489999860 Q gi|254780384|r 187 DIVAVQKIY 195 (665) Q Consensus 187 DI~Alq~lY 195 (665) ||.||+++| T Consensus 157 D~~aI~~~Y 165 (165) T cd04268 157 DIAAIKKLY 165 (165) T ss_pred HHHHHHHCC T ss_conf 999998539 No 9 >cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin. Probab=99.38 E-value=8.1e-12 Score=88.76 Aligned_cols=157 Identities=18% Similarity=0.174 Sum_probs=84.4 Q ss_pred CCEEEEEE-EECCHHHHHHHHHHHHHHHHHHCCEEEEECC-CCCEEEECCC--CEEEEECCCCCCC-CCCCCCCCCCCCC Q ss_conf 75026887-5079999999999999886574756898247-8716896158--5356631467753-1224433544333 Q gi|254780384|r 37 THRLSFDI-TELNPNAQEVARWALGEWSKVVDLTFEETSI-HSDIKFISSN--NGYVCTPRYDSRY-FMEINFDKADVWR 111 (665) Q Consensus 37 ~~~~t~~i-t~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~-~~d~~f~~~~--~g~a~~p~~~~~~-~~~~~~~~~~~~~ 111 (665) +.+|.+-| .+.+..+|+.+|.|++.|+..++|+|+|++. .+++++.+.. ..+.+.-...-.. -.....+.+ |. T Consensus 6 G~~l~~~f~~~~~~~~r~~V~~a~~~W~~~tnirF~~~~~~~adIRI~F~~~~G~wS~VGtd~~~~~~~~~Tmnlg--~~ 83 (198) T cd04327 6 GTVLRIAFLGGPDAFLKDKVRAAAREWLPYANLKFKFVTDADADIRISFTPGDGYWSYVGTDALLIGADAPTMNLG--WF 83 (198) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCEECC--CC T ss_conf 9889999878999999999999999996267567798888888689995549975376686523578888714546--56 Q ss_pred CCCCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCC----------CCCCCC-------------C------CCCCEEEE Q ss_conf 666667346630032346551878888766676666----------777541-------------0------14202446 Q gi|254780384|r 112 YGIGKGTILSHNALHEIGHALGLMHPGSYNGGYPVY----------GVDNDY-------------E------NDSFLTSV 162 (665) Q Consensus 112 ~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~----------~~~~~~-------------~------~d~~~~Tv 162 (665) .....+....-+++||||||||+.|-+......-.. .....+ . ..--..|| T Consensus 84 ~~~~~~~~~~~~~~he~~h~~g~~he~~~~~~~i~Wd~~~v~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~s~yD~~SI 163 (198) T cd04327 84 TDDTPDPEFSRVVLHEFGHALGFIHEHQSPAANIPWDKEAVYAYFSGPPNWDRETVINHNVFAKLDDGDVAYSPYDPDSI 163 (198) T ss_pred CCCCCCCCCCEEHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCE T ss_conf 78775533001005776766426002238767877578999999853789867786654434305623345886996505 Q ss_pred EEECCCCCC-CCCCCCCEECCHHHHHHHHHHHHC Q ss_conf 752356776-666552000211221489999860 Q gi|254780384|r 163 MSYFMPQDS-MMDASFGYCATPMVSDIVAVQKIY 195 (665) Q Consensus 163 MSY~~~~~~-~~~~~~~~~~tpm~~DI~Alq~lY 195 (665) |-|..|..- ........+.++--.|++-|..+| T Consensus 164 MhY~~p~~~t~~~~~~~~n~~LS~~Dk~f~~~~Y 197 (198) T cd04327 164 MHYPFPGSLTLDGEEVPPNRTLSDKDKAFMRLLY 197 (198) T ss_pred EECCCCHHHCCCCEECCCCCCCCHHHHHHHHHHC T ss_conf 5357687562798775788858988999999858 No 10 >smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site. Probab=99.26 E-value=2.5e-11 Score=85.75 Aligned_cols=91 Identities=34% Similarity=0.553 Sum_probs=57.0 Q ss_pred CEEEEEE--EECCHH-HHHHHHHHHHHHHHHHCCEEEEECCCCC--EEEECCCC-----EEEEECCCCCCCCCCCCCCCC Q ss_conf 5026887--507999-9999999999886574756898247871--68961585-----356631467753122443354 Q gi|254780384|r 38 HRLSFDI--TELNPN-AQEVARWALGEWSKVVDLTFEETSIHSD--IKFISSNN-----GYVCTPRYDSRYFMEINFDKA 107 (665) Q Consensus 38 ~~~t~~i--t~l~~~-~~~~~R~Al~~ws~va~ItF~Ev~~~~d--~~f~~~~~-----g~a~~p~~~~~~~~~~~~~~~ 107 (665) .++++.| ..++.. .|+++|.||+.|++++.|+|+|++...+ +.|...+. +++..| . +...++ T Consensus 8 ~~v~Y~i~~~~~~~~~~r~~i~~A~~~w~~~Tci~F~~~~~~~~~~i~f~~~~~~gc~~s~~~~~--~----g~q~~~-- 79 (140) T smart00235 8 GTVPYVIDSSSLSPEEVREAIARAFAEWSDVTCLRFVERTSTADIYISFGKGDGSGCTLSHAGRP--G----GDQHFS-- 79 (140) T ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCEEEEEEECCCCCCCEEEEECC--C----CCEEEE-- T ss_conf 97889988899997999999999999998547863896567887189998688899745763044--8----823345-- Q ss_pred CCCCCCCCCCCEEEEEHHHHHHHHHCCCCCCCCCC Q ss_conf 43336666673466300323465518788887666 Q gi|254780384|r 108 DVWRYGIGKGTILSHNALHEIGHALGLMHPGSYNG 142 (665) Q Consensus 108 ~~~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~ 142 (665) .+.+.....|++|||||||||.|.+.... T Consensus 80 ------~~~~c~~~g~i~HEigHaLGl~He~~r~D 108 (140) T smart00235 80 ------LGNGCINTGVAAHELGHALGLYHEQSRSD 108 (140) T ss_pred ------CCCCCCCCCHHHHHHHHHCCCCCCCCCCC T ss_conf ------38881666624553453426655457763 No 11 >pfam12388 Peptidase_M57 Dual-action HEIGH metallo-peptidase. The catalytic triad for this family of proteases is HE-H-H, which in many members is in the sequence motif HEIGH. Probab=99.12 E-value=8.4e-10 Score=76.58 Aligned_cols=172 Identities=20% Similarity=0.240 Sum_probs=95.9 Q ss_pred CCCHHHHHHHHHCCEEECCCEEEEECCCCCCCEEEEEE-----EECCHHHHHHHHHHHHHHHHHHC--CEEEEE----CC Q ss_conf 55799999987358354043467505765675026887-----50799999999999998865747--568982----47 Q gi|254780384|r 7 VYTTERIADHLLRDQYWSNYFIRYKPDVLTTHRLSFDI-----TELNPNAQEVARWALGEWSKVVD--LTFEET----SI 75 (665) Q Consensus 7 ~~t~d~~ad~l~~~~~~~~~~~~~~~d~~~~~~~t~~i-----t~l~~~~~~~~R~Al~~ws~va~--ItF~Ev----~~ 75 (665) ..|.+|+.+.+..+.--..-+.+...-....++|++=. +.|+..++++.++|++.|..+ + |+|+-+ .. T Consensus 8 ~~s~~~l~~~~~~~~~~~eqYrT~nlV~~~~r~I~v~~~tg~~~~l~~~~~tal~~AI~~yN~l-~l~l~F~~t~gtn~~ 86 (211) T pfam12388 8 FLTEKELNKLELSEQNDQKQYRTNNLVTGSPRTITIIGYTGGSQALTAKGQTALDDAVNNYNNL-GLDISFRLTFGTNYQ 86 (211) T ss_pred EECHHHHHHHHCCCCCCCEEEEEEEEECCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHC-CCEEEEEEEECCCCC T ss_conf 7719997565504787600520111645898589999827865103178999999999987640-760799998435777 Q ss_pred CCCEEEECC-------CCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHCCCCCCCC-C------ Q ss_conf 871689615-------85356631467753122443354433366666734663003234655187888876-6------ Q gi|254780384|r 76 HSDIKFISS-------NNGYVCTPRYDSRYFMEINFDKADVWRYGIGKGTILSHNALHEIGHALGLMHPGSY-N------ 141 (665) Q Consensus 76 ~~d~~f~~~-------~~g~a~~p~~~~~~~~~~~~~~~~~~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~~-~------ 141 (665) ..|+..... ..+.|.+|...+.-+..+.+ ....+.+. ....+++.|||||.+||-|. || + T Consensus 87 ~~di~v~~~~~~~~~g~ggsAGFP~s~G~P~~~V~i--~~~~~~~~---~~~~~vi~HEiGHciGfRHT-DyfnRsSCG~ 160 (211) T pfam12388 87 NADMVVYDNSVNNPSGSGGSAGFPDSNGDPAKFVQI--YDLENGST---NVNEHVITHEIGHSIGFRHT-DYFDRSSCGQ 160 (211) T ss_pred CCCEEEEECCCCCCCCCEEECCCCCCCCCCCCEEEE--ECCCCCCC---CHHHHHHHHHHHCEECCCCC-CCCCCCCCCC T ss_conf 776799954556788851041288778998866998--23578882---03345452320222222113-5557444466 Q ss_pred -C--CC----CCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCEECCHHHHHHHHHHHHC Q ss_conf -6--76----666777541014202446752356776666552000211221489999860 Q gi|254780384|r 142 -G--GY----PVYGVDNDYENDSFLTSVMSYFMPQDSMMDASFGYCATPMVSDIVAVQKIY 195 (665) Q Consensus 142 -~--g~----~~~~~~~~~~~d~~~~TvMSY~~~~~~~~~~~~~~~~tpm~~DI~Alq~lY 195 (665) + |. .+.-+..|...|. .|||--- ++......+-..||.||++|| T Consensus 161 ~~nEG~agvGAihIpGTPtg~D~--~SiM~aC--------~~~~~~g~F~~~DitAL~~LY 211 (211) T pfam12388 161 GGNEGTGGVGAVYIPGTPTGRDN--TSIMQAC--------FSGGEDGEFNSNDITALLAMY 211 (211) T ss_pred CCCCCCCCCCCEECCCCCCCCCC--CHHHHHH--------CCCCCCCCCCCCCHHHHHHHC T ss_conf 77778776564677899989987--5487600--------048878886742098898639 No 12 >cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning. Probab=98.94 E-value=2.2e-08 Score=68.03 Aligned_cols=97 Identities=20% Similarity=0.314 Sum_probs=61.5 Q ss_pred CCCCEEEEEEE-ECCHHHHHHHHHHHHHHHHHHCCEEEEECCCCC-EEEECCCCE-EEEECCCCCCCCCCCCCCCCCCCC Q ss_conf 56750268875-079999999999999886574756898247871-689615853-566314677531224433544333 Q gi|254780384|r 35 LTTHRLSFDIT-ELNPNAQEVARWALGEWSKVVDLTFEETSIHSD-IKFISSNNG-YVCTPRYDSRYFMEINFDKADVWR 111 (665) Q Consensus 35 ~~~~~~t~~it-~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~~d-~~f~~~~~g-~a~~p~~~~~~~~~~~~~~~~~~~ 111 (665) ++..++-+.|. .|+..++.+++.|++.|++.+-|+|++-+...+ +.|.....| +++.-..+.+ .+. T Consensus 10 Wpn~~VPY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~pr~~e~~yi~f~~~~~GC~S~vG~~g~g---------~Q~-- 78 (200) T cd04281 10 WPGGVIPYVIDGNFTGSQRAMFKQAMRHWENFTCVTFVERTPEENYIVFTYRPCGCCSYVGRRGNG---------PQA-- 78 (200) T ss_pred CCCCEEEEEECCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCEEEEECCCCCEEECCCCCCCC---------CEE-- T ss_conf 989889899899799999999999999998679765588899987799972899777328830787---------467-- Q ss_pred CCCCCCCEEEEEHHHHHHHHHCCCCCCCCCC Q ss_conf 6666673466300323465518788887666 Q gi|254780384|r 112 YGIGKGTILSHNALHEIGHALGLMHPGSYNG 142 (665) Q Consensus 112 ~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~ 142 (665) ..++.+-...-|++|||+||||+-|-+.-.. T Consensus 79 isl~~~C~~~G~i~HEl~HalGf~HEqsR~D 109 (200) T cd04281 79 ISIGKNCDKFGIVVHELGHVIGFWHEHTRPD 109 (200) T ss_pred EECCCCCCCCCCHHHHHHHHHCCCCCCCCCC T ss_conf 6637876756302999988754655434667 No 13 >pfam01400 Astacin Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this family have an amino terminal propeptide which is cleaved to give the active protease domain. All other linked domains are found to the carboxyl terminus of this domain. This family includes: Astacin, a digestive enzyme from Crayfish. Meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain. Proteins involved in morphogenesis, and Tolloid from drosophila. Probab=98.92 E-value=3.4e-08 Score=66.85 Aligned_cols=95 Identities=22% Similarity=0.354 Sum_probs=60.6 Q ss_pred CCEEEEEEE-ECCHHHHHHHHHHHHHHHHHHCCEEEEECCCCC---EEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCC Q ss_conf 750268875-079999999999999886574756898247871---6896158535663146775312244335443336 Q gi|254780384|r 37 THRLSFDIT-ELNPNAQEVARWALGEWSKVVDLTFEETSIHSD---IKFISSNNGYVCTPRYDSRYFMEINFDKADVWRY 112 (665) Q Consensus 37 ~~~~t~~it-~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~~d---~~f~~~~~g~a~~p~~~~~~~~~~~~~~~~~~~~ 112 (665) ..++-+.|. .+++.++++++.|++.|++.+-|+|++-+...+ +.|.....-+++.-..++. +. . T Consensus 5 ~~~VpY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~pr~~~~~~~~i~~~~~~gC~S~vG~~gg~----------Q~--i 72 (192) T pfam01400 5 NGPIPYVIDSSFTGLARALIRQAMRHWEQKTCIRFVPRTSAPDNNYLFFFKGDGCYSYVGRNGGA----------QP--V 72 (192) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCEEEEEECCCCEEEECCEECCE----------EE--E T ss_conf 98897998997999999999999999985887537858889887699994399453624700880----------25--8 Q ss_pred CCCCCCEEEEEHHHHHHHHHCCCCCCCCCCC Q ss_conf 6666734663003234655187888876667 Q gi|254780384|r 113 GIGKGTILSHNALHEIGHALGLMHPGSYNGG 143 (665) Q Consensus 113 ~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g 143 (665) .++.+-...-+++|||+||||+-|-+.-... T Consensus 73 slg~gC~~~g~i~HEl~HaLGf~HEhsR~DR 103 (192) T pfam01400 73 SLGNGCDKFGIIVHELGHALGFWHEQSRPDR 103 (192) T ss_pred EECCCCCCCCCCHHHHHHHHCCCCCCCCCCC T ss_conf 8688858677017888888455554467774 No 14 >cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion. Probab=98.87 E-value=3.4e-08 Score=66.86 Aligned_cols=85 Identities=20% Similarity=0.312 Sum_probs=56.6 Q ss_pred EECCHHHHHHHHHHHHHHHHHHCCEEEEECCCCC-EEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCC-CCCEEEE Q ss_conf 5079999999999999886574756898247871-6896158535663146775312244335443336666-6734663 Q gi|254780384|r 45 TELNPNAQEVARWALGEWSKVVDLTFEETSIHSD-IKFISSNNGYVCTPRYDSRYFMEINFDKADVWRYGIG-KGTILSH 122 (665) Q Consensus 45 t~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~~d-~~f~~~~~g~a~~p~~~~~~~~~~~~~~~~~~~~~~~-~g~y~~~ 122 (665) ..|++.++.+++.|++.|++.+-|+|++-+...+ ++|.....-+++.-..++. ..+ .++ .+-.... T Consensus 12 ~~~~~~~~~~I~~Am~~~~~~TCirF~~r~~e~~yi~i~~~~GC~S~vGr~gg~----Q~i--------sl~~~gC~~~g 79 (182) T cd04283 12 PQYSENERAVIEKAMQEFETLTCVRFVPRTTERDYLNIESRSGCWSYIGRQGGR----QTV--------SLQKQGCMYKG 79 (182) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCEEEEECCCCEEECCCEECCE----EEE--------EECCCCCCCCC T ss_conf 989999999999999999858967788578987579995299785405710981----248--------86899978767 Q ss_pred EHHHHHHHHHCCCCCCCCC Q ss_conf 0032346551878888766 Q gi|254780384|r 123 NALHEIGHALGLMHPGSYN 141 (665) Q Consensus 123 t~iHEIGHALGL~Hp~~~~ 141 (665) |++|||+|||||-|-+.-. T Consensus 80 ~i~HEl~HalGf~HEqsR~ 98 (182) T cd04283 80 IIQHELLHALGFYHEQTRS 98 (182) T ss_pred HHHHHHHHHHCCCCCCCCC T ss_conf 2078888884555544666 No 15 >cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development. Probab=98.76 E-value=1.9e-07 Score=62.36 Aligned_cols=89 Identities=20% Similarity=0.331 Sum_probs=58.9 Q ss_pred EEEE-ECCHHHHHHHHHHHHHHHHHHCCEEEEECCCCC-EEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 8875-079999999999999886574756898247871-68961585356631467753122443354433366666734 Q gi|254780384|r 42 FDIT-ELNPNAQEVARWALGEWSKVVDLTFEETSIHSD-IKFISSNNGYVCTPRYDSRYFMEINFDKADVWRYGIGKGTI 119 (665) Q Consensus 42 ~~it-~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~~d-~~f~~~~~g~a~~p~~~~~~~~~~~~~~~~~~~~~~~~g~y 119 (665) +-|. .+++..|++++.|++.|++.+-|.|++-+...+ ++|.....-+++.-..++ . +.. .++.+-. T Consensus 6 Y~i~~~~~~~~~~~I~~Am~~~~~~TCirF~pr~~~~~yi~f~~~~gC~S~vG~~~g----~------q~i--sl~~~C~ 73 (180) T cd04280 6 YVIDGSFDESDRSLILRAMREIESNTCIRFVPRTTEKDYIRIVKGSGCWSYVGRVGG----R------QVV--SLGSGCF 73 (180) T ss_pred EEECCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCEEEEECCCCEEECCCCCCC----E------EEE--EECCCCC T ss_conf 998898999999999999999986796535888998758999659971361570188----2------368--8478868 Q ss_pred EEEEHHHHHHHHHCCCCCCCCCC Q ss_conf 66300323465518788887666 Q gi|254780384|r 120 LSHNALHEIGHALGLMHPGSYNG 142 (665) Q Consensus 120 ~~~t~iHEIGHALGL~Hp~~~~~ 142 (665) ..-+++|||+|||||-|-+.-.. T Consensus 74 ~~g~i~HEl~HalGf~HE~~R~D 96 (180) T cd04280 74 SLGTIVHELMHALGFYHEQSRPD 96 (180) T ss_pred CCCHHHHHHHHHHCCCCCCCCCC T ss_conf 67633999998745655434667 No 16 >KOG3714 consensus Probab=98.74 E-value=1.4e-07 Score=63.09 Aligned_cols=152 Identities=20% Similarity=0.257 Sum_probs=87.2 Q ss_pred CCCCCCEEEEEEEE-CCHHHHHHHHHHHHHHHHHHCCEEEEECCC-CCEEEECCCCE-EEEECCCCCCCCCCCCCCCCCC Q ss_conf 76567502688750-799999999999998865747568982478-71689615853-5663146775312244335443 Q gi|254780384|r 33 DVLTTHRLSFDITE-LNPNAQEVARWALGEWSKVVDLTFEETSIH-SDIKFISSNNG-YVCTPRYDSRYFMEINFDKADV 109 (665) Q Consensus 33 d~~~~~~~t~~it~-l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~-~d~~f~~~~~g-~a~~p~~~~~~~~~~~~~~~~~ 109 (665) ..++...+-+-|.+ ++..+|.++|.|++.|+..+=|+|+|.+.. -+..+.....| +.++...++.. T Consensus 80 ~~Wp~~~ipY~i~~~~~~~~r~~i~~A~~~~~~~TCi~F~~~~~~~~~~~~~~~~~gC~S~VGr~gg~~----------- 148 (411) T KOG3714 80 RRWPNGVIPYYIDGSFTSSQRALIRQAMREIENHTCIRFVERTTPDKDYLIVFTGGGCYSYVGRRGGGQ----------- 148 (411) T ss_pred CCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCEEEECCCCCCCCCCCCCCCCC----------- T ss_conf 258898122498888898999999999999973956535557887776178727997313127778865----------- Q ss_pred CCCCCCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCC---------------C---------CCCCCCCCCCCEEEEEEE Q ss_conf 3366666734663003234655187888876667666---------------6---------777541014202446752 Q gi|254780384|r 110 WRYGIGKGTILSHNALHEIGHALGLMHPGSYNGGYPV---------------Y---------GVDNDYENDSFLTSVMSY 165 (665) Q Consensus 110 ~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~---------------~---------~~~~~~~~d~~~~TvMSY 165 (665) -...++.+-..+-|++|||+||||+-|-|.-...+.. + ....+|+ --|||-| T Consensus 149 q~~sl~~~C~~~G~i~HEl~HaLGf~HehsR~DRD~yV~I~~~ni~~~~~~nF~k~~~~~~~~~~~pYD----ygSvMHY 224 (411) T KOG3714 149 QLLSLGDGCDRFGTIVHELMHALGFWHEHSRPDRDNYVSINWDNIDPGQEYNFEKYSPDEVTTYGVPYD----YGSVMHY 224 (411) T ss_pred CCCCCCCCCCCCCEEHHHHHHHHHHHCCCCCCCCCCEEEEEHHHCCHHHHHHHHHCCHHHHHCCCCCCC----CCCCCCC T ss_conf 321117874767610566776641310256545436279841308866655566468645421598566----7752464 Q ss_pred CCCCC---CCC----CC--C----CCEECCHHHHHHHHHHHHCCCCC Q ss_conf 35677---666----65--5----20002112214899998608766 Q gi|254780384|r 166 FMPQD---SMM----DA--S----FGYCATPMVSDIVAVQKIYGAPK 199 (665) Q Consensus 166 ~~~~~---~~~----~~--~----~~~~~tpm~~DI~Alq~lYGan~ 199 (665) ..... ... .. . -+-...+-..||.-|-.||.-.. T Consensus 225 ~~~afs~~~~~~ti~~~~~~~~~~mGqr~~~S~~Di~~iN~~Y~C~~ 271 (411) T KOG3714 225 APYAFSKNGSLPTIVPKDNGFQNTMGQRERLSFYDIRKINKLYCCPE 271 (411) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCC T ss_conf 88666658877602036665224557647687878999876208987 No 17 >cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells. Probab=98.67 E-value=2.5e-07 Score=61.70 Aligned_cols=91 Identities=22% Similarity=0.254 Sum_probs=59.8 Q ss_pred EEEEEE-EECCHHHHHHHHHHHHHHHHHHCCEEEEECCCCC-EEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 026887-5079999999999999886574756898247871-68961585356631467753122443354433366666 Q gi|254780384|r 39 RLSFDI-TELNPNAQEVARWALGEWSKVVDLTFEETSIHSD-IKFISSNNGYVCTPRYDSRYFMEINFDKADVWRYGIGK 116 (665) Q Consensus 39 ~~t~~i-t~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~~d-~~f~~~~~g~a~~p~~~~~~~~~~~~~~~~~~~~~~~~ 116 (665) ++-+.| ..++...++.++.|++.|++.+-|+|++-+...+ +.|.....-+++.-..++. +. ..++. T Consensus 49 ~VPY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~r~~e~~yi~~~~~~GC~S~VG~~gg~----------Q~--vslg~ 116 (230) T cd04282 49 PIPYILDDSLDLNAKGVILKAFEMYRLKSCVDFKPYEGESNYIFFFKGSGCWSMVGDQQGG----------QN--LSIGA 116 (230) T ss_pred CEEEEECCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCEEEEECCCCCCCCCCCCCCE----------EE--EEECC T ss_conf 1669988879999999999999999858954148678987559995699511422653880----------56--65078 Q ss_pred CCEEEEEHHHHHHHHHCCCCCCCCC Q ss_conf 7346630032346551878888766 Q gi|254780384|r 117 GTILSHNALHEIGHALGLMHPGSYN 141 (665) Q Consensus 117 g~y~~~t~iHEIGHALGL~Hp~~~~ 141 (665) |-....|++|||+||||+-|-+.-. T Consensus 117 gC~~~G~i~HEl~HaLGf~HEqsR~ 141 (230) T cd04282 117 GCDYKATVEHEFLHALGFYHEQSRS 141 (230) T ss_pred CCCCCCHHHHHHHHHHCCCCCCCCC T ss_conf 8787766799999885566542366 No 18 >cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin. Probab=98.49 E-value=3.6e-08 Score=66.76 Aligned_cols=157 Identities=20% Similarity=0.231 Sum_probs=82.4 Q ss_pred CCCEEEEEEEECC-HHHHHHHHHHHHHHHHHHC-CEE-----EEEC----CCCCEEE-----ECC-CCEEEEECCCCC-- Q ss_conf 6750268875079-9999999999998865747-568-----9824----7871689-----615-853566314677-- Q gi|254780384|r 36 TTHRLSFDITELN-PNAQEVARWALGEWSKVVD-LTF-----EETS----IHSDIKF-----ISS-NNGYVCTPRYDS-- 96 (665) Q Consensus 36 ~~~~~t~~it~l~-~~~~~~~R~Al~~ws~va~-ItF-----~Ev~----~~~d~~f-----~~~-~~g~a~~p~~~~-- 96 (665) +.+-|.|=|.... ..-++++|+....|....- +=| +++. +.+|++. +.+ +.+.++.|+... T Consensus 6 p~~PIvfyid~~~P~~~r~ai~~Gv~~WN~AFE~aGfknAi~vk~~P~d~d~~DiRyn~Irw~~s~~~~~ayGps~~dPr 85 (197) T cd04276 6 PKEPIVYYLDNTFPEKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSPNGGWAYGPSVVDPR 85 (197) T ss_pred CCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCC T ss_conf 87887999459998899999999999999999872775416975089997977742668999637987666688757999 Q ss_pred -CCCCCCCC-------CCCCCCCCCCCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCC-CC----CCCCCEEEEE Q ss_conf -53122443-------35443336666673466300323465518788887666766667775-41----0142024467 Q gi|254780384|r 97 -RYFMEINF-------DKADVWRYGIGKGTILSHNALHEIGHALGLMHPGSYNGGYPVYGVDN-DY----ENDSFLTSVM 163 (665) Q Consensus 97 -~~~~~~~~-------~~~~~~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~-~~----~~d~~~~TvM 163 (665) +-+....+ .....|.. .-.+..-++++.||+||+|||.|=+- ++..+..+. .. ......-||| T Consensus 86 TGEIl~a~I~~~~~~~~~~~~~~~-~~~~~~lr~~~aHEVGHtLGL~HNf~---aS~~~~~~~l~~~~~t~~~g~~~SVM 161 (197) T cd04276 86 TGEILKADVILYSGFLRQDQLWYE-DLLAASLRYLLAHEVGHTLGLRHNFK---ASSDGSNEELEDPLGTKEKGATSSVM 161 (197) T ss_pred CCCEEEEEEEEEHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHCCCCCCCC---HHCCCCHHHCCCHHHHHCCCCCCEEC T ss_conf 866776578874177654799999-89999999999987640316533010---00258845402887772178754632 Q ss_pred EECCCCCC--CCCCCCCEECCHHHHHHHHHHHHCC Q ss_conf 52356776--6665520002112214899998608 Q gi|254780384|r 164 SYFMPQDS--MMDASFGYCATPMVSDIVAVQKIYG 196 (665) Q Consensus 164 SY~~~~~~--~~~~~~~~~~tpm~~DI~Alq~lYG 196 (665) -|....-. ..+....++.+++.||..||++-|. T Consensus 162 DY~~~N~~~~~~d~~~~~~~~~G~YD~~AI~~gY~ 196 (197) T cd04276 162 DYPPPNVAAQGEDQGDYYPPTIGPYDKWAIEYGYT 196 (197) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCC T ss_conf 57432213476546531579985042667301357 No 19 >pfam08548 Peptidase_M10_C Peptidase M10 serralysin C terminal. Serralysins are peptidases related to mammalian matrix metallopeptidases (MMPs). The peptidase unit is found at the N terminal while this domain at the C terminal forms a corkscrew and is thought to be important for secretion of the protein through the bacterial cell wall. This domain contains the calcium ion binding domain pfam00353. Probab=98.45 E-value=2.9e-06 Score=55.23 Aligned_cols=55 Identities=24% Similarity=0.434 Sum_probs=43.8 Q ss_pred CEEEECCCEEEEEECCCCCCEEEECCCCCEEECCCCCCEEECCCCCEEEEEECCC Q ss_conf 5477348257446617765168826667468667863458888873389982366 Q gi|254780384|r 259 NMTIAQHTVIESVVGSSGNDVVIGNSANNVFFESSGNDVFDGSSGLDTFFYYAPS 313 (665) Q Consensus 259 n~~Ia~gt~IEna~g~~g~D~i~gn~~~n~l~gg~g~d~~~g~~g~Dt~~~~~~~ 313 (665) --.+-.++..+.++|..++|+|.|+.++++|.|+.|+|+|.||.|.|++++.... T Consensus 81 IEn~~GgsG~D~l~Gn~~~n~L~Gg~G~D~l~Gg~G~D~l~GG~G~D~f~~~~g~ 135 (220) T pfam08548 81 IENAIGGSGNDLLIGNDAANILKGGAGNDIIYGGGGADQLWGGAGKDTFVYGAAS 135 (220) T ss_pred EEEEECCCCCCEEECCCCCCEEECCCCCCEEECCCCCCEEECCCCCEEEEECCCC T ss_conf 6688666767689878888557878998868777887189889983499840766 No 20 >COG2931 RTX toxins and related Ca2+-binding proteins [Secondary metabolites biosynthesis, transport, and catabolism] Probab=98.40 E-value=1.9e-06 Score=56.36 Aligned_cols=60 Identities=27% Similarity=0.302 Sum_probs=35.1 Q ss_pred CEEEEEECCCCCCEEEECCCCC--EEECCCCCCEEECC-CCCEEEEEECCCCCEEEEEECCCE Q ss_conf 2574466177651688266674--68667863458888-873389982366513787205825 Q gi|254780384|r 266 TVIESVVGSSGNDVVIGNSANN--VFFESSGNDVFDGS-SGLDTFFYYAPSYLYKIYRFENAI 325 (665) Q Consensus 266 t~IEna~g~~g~D~i~gn~~~n--~l~gg~g~d~~~g~-~g~Dt~~~~~~~~~~~~~~~~~~~ 325 (665) ...++++++.++|+|+++...+ +|.++.|+|++.++ .+.|+++-........+....+.. T Consensus 155 ~~~~~l~g~~g~d~l~~~~~~~~~~~~~~~G~d~~~~~~~~~d~~~~~~~~~~~~~~~~~g~d 217 (510) T COG2931 155 NGNDILIGGSGNDTLTGNSGADDRVINGGAGDDTILGGGNGDDTVFGGGSGGNATLDGGNGGD 217 (510) T ss_pred CCCCEECCCCCCCEEECCCCCCCEEEECCCCCCEEECCCCCCCEEEECCCCCCCCCCCCCCCC T ss_conf 223232157766358146666510575243640330456677416712346750012677877 No 21 >COG2931 RTX toxins and related Ca2+-binding proteins [Secondary metabolites biosynthesis, transport, and catabolism] Probab=98.18 E-value=7.3e-06 Score=52.82 Aligned_cols=127 Identities=20% Similarity=0.201 Sum_probs=77.2 Q ss_pred EEEEECCCCCCEEEECCCCCEE-ECCCCCCEEECCCCCEEEEEECCCCCEEEEEECCCEEEEECCCCCCEEEEEEE--EE Q ss_conf 7446617765168826667468-66786345888887338998236651378720582599956888725675467--99 Q gi|254780384|r 268 IESVVGSSGNDVVIGNSANNVF-FESSGNDVFDGSSGLDTFFYYAPSYLYKIYRFENAIIVYDANEHKQDFLSDVE--RL 344 (665) Q Consensus 268 IEna~g~~g~D~i~gn~~~n~l-~gg~g~d~~~g~~g~Dt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~d~~~~iE--~~ 344 (665) .+.+.|..++|++.|..+++.| .++.|+|+|.|+.|+|++++.... ..++...+...+..+ .+.|++.... .. T Consensus 275 ~d~l~G~~g~d~~~g~~g~d~~~~gg~g~d~l~gg~gnd~~~~~~~~--d~i~~~~~~~~~~gg--~g~d~~~~~~~~~~ 350 (510) T COG2931 275 NDTLKGGAGNDTLLGGAGNDTLTIGGGGNDTLDGGAGNDTLDFSGGD--DTIYGGAGNDTLDGG--AGNDTLAGNAGALA 350 (510) T ss_pred CCCCCCCCCCCEEECCCCCCEEECCCCCCEECCCCCCCCEEECCCCC--CEEEECCCCCCEECC--CCCCEEEECCCCCE T ss_conf 45334677864474456754121167662332466777679513776--337703576535325--67662550334304 Q ss_pred EECCCE--ECCCCCCCCCEEECCCCCCCEEEEECCCCHHHHHHHHHHCCCCCCCCC Q ss_conf 963840--012444442024245787516764034201154445641434586313 Q gi|254780384|r 345 RFADQC--IDSVNVKQRSILEYTASYEDLIQVIGQDVFESMKHFYEFGLSEGRDIT 398 (665) Q Consensus 345 ~f~d~t--~~~~~~~~~d~l~~~a~~ddl~~~~G~d~~~~~~hy~~~G~~EGrd~~ 398 (665) .|..+. ....+...++.+...+++|.|.+..|.|++.+..+.+.+-++-|.|.- T Consensus 351 ~~~gg~g~d~i~g~~~~~~l~Gg~gnd~l~Gg~G~D~l~gg~G~D~l~GG~G~D~f 406 (510) T COG2931 351 LLNGGDGNDTISGNDGNDTLIGGAGNDTLSGGAGSDTLVGGGGNDTLTGGAGADTF 406 (510) T ss_pred ECCCCCCCCEEECCCCCCEEECCCCCCEEECCCCCCEECCCCCCCEEECCCCCCEE T ss_conf 42588654243047764435324566556536666144168775568768765457 No 22 >pfam05572 Peptidase_M43 Pregnancy-associated plasma protein-A. Pregnancy-associated plasma protein A (PAPP-A) is a metallo-protease belonging to Merops family M43. It cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the human ovary and the cardiovascular system. Probab=97.70 E-value=1.8e-05 Score=50.39 Aligned_cols=55 Identities=25% Similarity=0.207 Sum_probs=30.0 Q ss_pred CCEEEEECCCCCCCCCC--CCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHCCCCCCCCC Q ss_conf 85356631467753122--4433544333666667346630032346551878888766 Q gi|254780384|r 85 NNGYVCTPRYDSRYFME--INFDKADVWRYGIGKGTILSHNALHEIGHALGLMHPGSYN 141 (665) Q Consensus 85 ~~g~a~~p~~~~~~~~~--~~~~~~~~~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~ 141 (665) ..|++.+|......... +.++....... ....+. ..|++|||||-|||-|+|..+ T Consensus 34 ~lGyA~~P~~~~~~d~~~~v~~~~~~~G~~-~~~~~~-g~TltHE~GH~lgL~Htf~~~ 90 (152) T pfam05572 34 NSGVAWYPDSGMSADGVARVTFNGAYLGAD-STSTNF-SSTLTHEFGHFLGLRHTFEGG 90 (152) T ss_pred CCEEEECCCCCCCCCCCEEEEEECCCCCCC-CCCCCC-CCCEEEECHHHHCCCCCCCCC T ss_conf 286874899988888732899805425677-776666-650245312241743312689 No 23 >cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved. Probab=97.37 E-value=7.8e-05 Score=46.60 Aligned_cols=55 Identities=24% Similarity=0.271 Sum_probs=30.3 Q ss_pred CCEEEEECCCCCCC--CC-CCCCCCCCCCCCCCCCCCEE-EEEHHHHHHHHHCCCCCCCCC Q ss_conf 85356631467753--12-24433544333666667346-630032346551878888766 Q gi|254780384|r 85 NNGYVCTPRYDSRY--FM-EINFDKADVWRYGIGKGTIL-SHNALHEIGHALGLMHPGSYN 141 (665) Q Consensus 85 ~~g~a~~p~~~~~~--~~-~~~~~~~~~~~~~~~~g~y~-~~t~iHEIGHALGL~Hp~~~~ 141 (665) ..|++.+|...... .. .+.++.... .+.....|. -.|++|||||-|||-|.|... T Consensus 100 ~lG~a~fP~~~~~~~~~~dGvvi~~~~~--~~~~~~~~~~g~TltHEvGH~lGL~Htf~~~ 158 (225) T cd04275 100 LLGYATFPDSLVSLAFITDGVVINPSSL--PGGSAAPYNLGDTATHEVGHWLGLYHTFQGG 158 (225) T ss_pred EEEEEECCCCCCCCCCCCCEEEEECCEE--CCCCCCCCCCCCEEEEECCHHCCCEEECCCC T ss_conf 0189868988889885378799916362--7987777665431466622010631001589 No 24 >pfam02031 Peptidase_M7 Streptomyces extracellular neutral proteinase (M7) family. Probab=97.04 E-value=0.001 Score=39.83 Aligned_cols=89 Identities=19% Similarity=0.278 Sum_probs=45.0 Q ss_pred EEEEEEEECCHHHHHHHHHHHHHH-HHHHCCEEEEECCCCCEEEECCCCE-EEEECCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 026887507999999999999988-6574756898247871689615853-56631467753122443354433366666 Q gi|254780384|r 39 RLSFDITELNPNAQEVARWALGEW-SKVVDLTFEETSIHSDIKFISSNNG-YVCTPRYDSRYFMEINFDKADVWRYGIGK 116 (665) Q Consensus 39 ~~t~~it~l~~~~~~~~R~Al~~w-s~va~ItF~Ev~~~~d~~f~~~~~g-~a~~p~~~~~~~~~~~~~~~~~~~~~~~~ 116 (665) +++++.+.--.-..+.+ .+.+.| +.|.|++++| ...+++++...+.. ..+.+..+.+ -+.++..... .. T Consensus 3 Tv~YdaS~A~~f~s~ia-~~aaiWN~sV~NV~L~~-g~~a~~~~~~~~~~~gs~a~t~g~G-~g~i~~~~~~------~q 73 (132) T pfam02031 3 TVTYDASNAPSFRSQIA-RSAQIWNSSVSNVRLQE-GSNADFTYYEGNDSRGSYASTDGHG-RGYIFLDYRQ------NQ 73 (132) T ss_pred EEEECCCCCCHHHHHHH-HHHHHHHCCCCCEEEEC-CCCCCEEEEECCCCCCCCCCCCCCC-CEEEEEEHHH------HC T ss_conf 99974777831799999-99988745577467504-8887779995788887656437998-4789724388------65 Q ss_pred CCEEEEEHHHHHHHHHCCCC Q ss_conf 73466300323465518788 Q gi|254780384|r 117 GTILSHNALHEIGHALGLMH 136 (665) Q Consensus 117 g~y~~~t~iHEIGHALGL~H 136 (665) +-..--..-|||||.|||-- T Consensus 74 gy~~~RI~aHE~GH~LGLPD 93 (132) T pfam02031 74 QYDSTRVTAHETGHVLGLPD 93 (132) T ss_pred CCCCHHHHHHHHCCCCCCCC T ss_conf 88723110013222247998 No 25 >COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones] Probab=96.97 E-value=0.00048 Score=41.84 Aligned_cols=92 Identities=24% Similarity=0.255 Sum_probs=53.4 Q ss_pred ECCHHH--HHHHHHHHHHHHHHHCCEEEEECCCCCEEEECCCC---EEEEECCCCCCCCCCC--CCCC--CC-----CCC Q ss_conf 079999--99999999988657475689824787168961585---3566314677531224--4335--44-----333 Q gi|254780384|r 46 ELNPNA--QEVARWALGEWSKVVDLTFEETSIHSDIKFISSNN---GYVCTPRYDSRYFMEI--NFDK--AD-----VWR 111 (665) Q Consensus 46 ~l~~~~--~~~~R~Al~~ws~va~ItF~Ev~~~~d~~f~~~~~---g~a~~p~~~~~~~~~~--~~~~--~~-----~~~ 111 (665) ....++ ++++.+|.|.|.+..++.-+|-++-+||++.-.+. |....+.........- +.+. .. ..+ T Consensus 98 n~~~ap~wq~a~~tava~wa~~fpl~ive~~eeaDItie~~n~pgtg~~~~grArtal~~~efl~~~~~~~~lnhr~~i~ 177 (236) T COG5549 98 NVEGAPRWQGAYLTAVAGWAKTFPLIIVERFEEADITIEVGNPPGTGWRQYGRARTALIAYEFLGHALGLGHLNHRGDIM 177 (236) T ss_pred CCCCCHHHHHHHHHHHHHHHHHCCCEEEECCEEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCC T ss_conf 32116068889999999898738805640120200799941789876223236778888777865036752113430025 Q ss_pred CCC-CCCCEEEEEHHHHHHHHHCCC-CC Q ss_conf 666-667346630032346551878-88 Q gi|254780384|r 112 YGI-GKGTILSHNALHEIGHALGLM-HP 137 (665) Q Consensus 112 ~~~-~~g~y~~~t~iHEIGHALGL~-Hp 137 (665) ..+ -...+--.|+.||||||||+- |. T Consensus 178 ~~pg~~~e~L~~tarhElGhaLgi~ghs 205 (236) T COG5549 178 YPPGELRENLNPTARHELGHALGIWGHS 205 (236) T ss_pred CCCCCCHHHHHHHHHHHHCCHHEECCCC T ss_conf 7864016664478887632021111334 No 26 >COG1913 Predicted Zn-dependent proteases [General function prediction only] Probab=96.06 E-value=0.0019 Score=38.19 Aligned_cols=17 Identities=53% Similarity=0.872 Sum_probs=14.2 Q ss_pred EEEHHHHHHHHHCCCCC Q ss_conf 63003234655187888 Q gi|254780384|r 121 SHNALHEIGHALGLMHP 137 (665) Q Consensus 121 ~~t~iHEIGHALGL~Hp 137 (665) +--.+|||||.|||+|= T Consensus 125 ~KEv~HElGH~~GL~HC 141 (181) T COG1913 125 VKEVLHELGHLLGLSHC 141 (181) T ss_pred HHHHHHHHHHHCCCCCC T ss_conf 99998876534174658 No 27 >cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods. Probab=96.01 E-value=0.0072 Score=34.74 Aligned_cols=22 Identities=36% Similarity=0.390 Sum_probs=18.0 Q ss_pred EEEEHHHHHHHHHCCCCCCCCC Q ss_conf 6630032346551878888766 Q gi|254780384|r 120 LSHNALHEIGHALGLMHPGSYN 141 (665) Q Consensus 120 ~~~t~iHEIGHALGL~Hp~~~~ 141 (665) +..|+.|||||.||+.|-++.. T Consensus 145 ~~~t~AHElGH~lG~~HD~~~~ 166 (220) T cd04272 145 GVYTMTHELAHLLGAPHDGSPP 166 (220) T ss_pred HHHHHHHHHHHHCCCCCCCCCC T ss_conf 0334567667642886677888 No 28 >cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. Probab=95.92 E-value=0.0073 Score=34.70 Aligned_cols=38 Identities=29% Similarity=0.366 Sum_probs=24.5 Q ss_pred EEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEC Q ss_conf 66300323465518788887666766667775410142024467523 Q gi|254780384|r 120 LSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMSYF 166 (665) Q Consensus 120 ~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMSY~ 166 (665) .-.|+.|||||.||+.|-.+...-. .......-|||+. T Consensus 133 ~a~t~AHElGH~lGm~HD~~~~~~~---------~~~~~~~~IMs~~ 170 (192) T cd04267 133 TALTMAHELGHNLGAEHDGGDELAF---------ECDGGGNYIMAPV 170 (192) T ss_pred EEEEHHHHHHHHCCCCCCCCCCCCC---------CCCCCCCEECCCC T ss_conf 1212256666344886789987467---------7899998871687 No 29 >pfam05547 Peptidase_M6 Immune inhibitor A peptidase M6. The insect pathogenic Gram-positive Bacillus thuringiensis secretes immune inhibitor A, a metallopeptidase, which specifically cleaves host antibacterial proteins. A homologue of immune inhibitor A, PrtV, has been identified in the Gram-negative human pathogen Vibrio cholerae. Probab=95.61 E-value=0.016 Score=32.60 Aligned_cols=73 Identities=22% Similarity=0.095 Sum_probs=39.9 Q ss_pred CCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCEECCHHHHHHHHHHHH Q ss_conf 66734663003234655187888876667666677754101420244675235677666655200021122148999986 Q gi|254780384|r 115 GKGTILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMSYFMPQDSMMDASFGYCATPMVSDIVAVQKI 194 (665) Q Consensus 115 ~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMSY~~~~~~~~~~~~~~~~tpm~~DI~Alq~l 194 (665) .+.+-+.-++.||.||.||| |=+|+-... ...+....|++||...-..... ..-|..+-.+.-+=+|.. T Consensus 217 qPed~~~GVfaHEfGHdLGL--PD~YDT~~~------g~g~~vg~WslMssGSw~G~~~---gt~p~~fs~~~k~~lq~~ 285 (646) T pfam05547 217 QPEDGAAGVFAHEYGHDLGL--PDEYDTQYT------GGGEPVGYWSLMSSGSWAGKIA---GTEPTSFSPQNKEFLQKQ 285 (646) T ss_pred CCCCCCEEEEEEECCCCCCC--CCCCCCCCC------CCCCCCCCEEEECCCCCCCCCC---CCCCCCCCHHHHHHHHHH T ss_conf 56677347898413444689--750136777------8998744366522675378668---988887687889999886 Q ss_pred CCCC Q ss_conf 0876 Q gi|254780384|r 195 YGAP 198 (665) Q Consensus 195 YGan 198 (665) ||-+ T Consensus 286 ~Gg~ 289 (646) T pfam05547 286 IGGN 289 (646) T ss_pred HCCC T ss_conf 4589 No 30 >PRK13267 archaemetzincin-like protein; Reviewed Probab=95.22 E-value=0.0064 Score=35.06 Aligned_cols=19 Identities=42% Similarity=0.690 Sum_probs=15.6 Q ss_pred EEHHHHHHHHHCCCCCCCC Q ss_conf 3003234655187888876 Q gi|254780384|r 122 HNALHEIGHALGLMHPGSY 140 (665) Q Consensus 122 ~t~iHEIGHALGL~Hp~~~ 140 (665) -..+|||||.+||+|=-++ T Consensus 125 Ke~~HE~GH~~GL~HC~~~ 143 (177) T PRK13267 125 KEVTHELGHTLGLEHCDNP 143 (177) T ss_pred HHHHHHHHHHCCCCCCCCC T ss_conf 8874355766376568999 No 31 >pfam10462 Peptidase_M66 Peptidase M66. This family of metallopeptidases contains StcE, a virulence factor found in Shiga toxigenic Escherichia coli organisms. StcE peptidase cleaves C1 esterase inhibitor. Probab=95.17 E-value=0.0053 Score=35.55 Aligned_cols=16 Identities=44% Similarity=0.578 Sum_probs=14.1 Q ss_pred EEHHHHHHHHHCCCCC Q ss_conf 3003234655187888 Q gi|254780384|r 122 HNALHEIGHALGLMHP 137 (665) Q Consensus 122 ~t~iHEIGHALGL~Hp 137 (665) -.++||+||+.||.|= T Consensus 194 nefsHE~GHa~GL~Hy 209 (304) T pfam10462 194 NEFSHEQGHNYGLGHY 209 (304) T ss_pred CEEEECCHHHCCCCCC T ss_conf 7547202101467867 No 32 >pfam07998 Peptidase_M54 Peptidase family M54. This is a family of metallopeptidases. Two human proteins have been reported to degrade synthetic substrates and peptides. Probab=95.05 E-value=0.008 Score=34.47 Aligned_cols=21 Identities=33% Similarity=0.607 Sum_probs=16.7 Q ss_pred EEEEHHHHHHHHHCCCCCCCC Q ss_conf 663003234655187888876 Q gi|254780384|r 120 LSHNALHEIGHALGLMHPGSY 140 (665) Q Consensus 120 ~~~t~iHEIGHALGL~Hp~~~ 140 (665) ..-..+|||||.+||+|=-++ T Consensus 126 ~~Ke~~HElGH~fGL~HC~~~ 146 (176) T pfam07998 126 VVKEVTHELGHTYGLSHCNNT 146 (176) T ss_pred HHHHHHHHHHHHCCCCCCCCC T ss_conf 999999998886277568999 No 33 >cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined. Probab=94.78 E-value=0.0088 Score=34.22 Aligned_cols=21 Identities=33% Similarity=0.447 Sum_probs=18.6 Q ss_pred CCEEEEEHHHHHHHHHCCCCC Q ss_conf 734663003234655187888 Q gi|254780384|r 117 GTILSHNALHEIGHALGLMHP 137 (665) Q Consensus 117 g~y~~~t~iHEIGHALGL~Hp 137 (665) -++.++++-|||||.||.-|- T Consensus 142 t~~~w~v~AHEiGHnfGa~HD 162 (228) T cd04271 142 TSNEWQVFAHEIGHTFGAVHD 162 (228) T ss_pred ECCCEEEEEEHHHCCCCCCCC T ss_conf 536405633013122578667 No 34 >cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. Probab=94.76 E-value=0.012 Score=33.47 Aligned_cols=19 Identities=37% Similarity=0.606 Sum_probs=15.6 Q ss_pred EEHHHHHHHHHCCCCCCCC Q ss_conf 3003234655187888876 Q gi|254780384|r 122 HNALHEIGHALGLMHPGSY 140 (665) Q Consensus 122 ~t~iHEIGHALGL~Hp~~~ 140 (665) .++.|||||.||+.|-.++ T Consensus 133 ~~~AHElGH~lG~~HD~~~ 151 (194) T cd04269 133 VTMAHELGHNLGMEHDDGG 151 (194) T ss_pred HHHHHHHHHHCCCCCCCCC T ss_conf 9998998864488668998 No 35 >pfam11350 DUF3152 Protein of unknown function (DUF3152). Some members in this family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently there is no known function. Probab=94.66 E-value=0.022 Score=31.79 Aligned_cols=87 Identities=20% Similarity=0.244 Sum_probs=50.7 Q ss_pred HHHHHHHHHHH---HHHHHHHCCEEEEECCC-CCEEEECCCCEE---EEECC----------CCCCCCCCC-CCCCCCCC Q ss_conf 99999999999---98865747568982478-716896158535---66314----------677531224-43354433 Q gi|254780384|r 49 PNAQEVARWAL---GEWSKVVDLTFEETSIH-SDIKFISSNNGY---VCTPR----------YDSRYFMEI-NFDKADVW 110 (665) Q Consensus 49 ~~~~~~~R~Al---~~ws~va~ItF~Ev~~~-~d~~f~~~~~g~---a~~p~----------~~~~~~~~~-~~~~~~~~ 110 (665) +.-.+++...| ..|...-.+.|+.|... .|+++...+-.. ...+. .....++.. |......+ T Consensus 51 ~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~g~~Df~i~Lasp~Tt~~lC~gl~t~~e~SC~~~~~VviN~~RW~~ga~~f 130 (189) T pfam11350 51 DAFAAMVDATLADPRGWTHDGTFAFRRVDSGAPDFRISLASPGTTRELCAGLDTGGEVSCYNGDRVVINEARWVRGAPPF 130 (189) T ss_pred HHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCCEEEEECCCCCHHHCCCCCCCCCEEEECCCCEEEEEHHHHHCCCCCC T ss_conf 99999999985699876668861589867899988999778660743255767687787469997998789965368653 Q ss_pred CCCCCCCCEEEEEHHHHHHHHHCCCCC Q ss_conf 366666734663003234655187888 Q gi|254780384|r 111 RYGIGKGTILSHNALHEIGHALGLMHP 137 (665) Q Consensus 111 ~~~~~~g~y~~~t~iHEIGHALGL~Hp 137 (665) +-...+|--+.+-||+||+||-.|- T Consensus 131 --~gdl~~YR~y~INHEVGH~LG~~H~ 155 (189) T pfam11350 131 --EGDLGSYRQYVINHEVGHAIGYGHE 155 (189) T ss_pred --CCCHHHHHHHHHHHHHHHHCCCCCC T ss_conf --6589999999870323201255776 No 36 >cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix. Probab=94.52 E-value=0.015 Score=32.81 Aligned_cols=20 Identities=40% Similarity=0.733 Sum_probs=16.3 Q ss_pred EEEHHHHHHHHHCCCCCCCC Q ss_conf 63003234655187888876 Q gi|254780384|r 121 SHNALHEIGHALGLMHPGSY 140 (665) Q Consensus 121 ~~t~iHEIGHALGL~Hp~~~ 140 (665) -.|+.|||||.||+.|-.+. T Consensus 141 a~t~AHElGHnLGm~HD~~~ 160 (207) T cd04273 141 AFTIAHELGHVLGMPHDGDG 160 (207) T ss_pred HHHHHHHHHHHCCCCCCCCC T ss_conf 99999987764298668999 No 37 >pfam01421 Reprolysin Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins, e.g. fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. Probab=94.28 E-value=0.016 Score=32.70 Aligned_cols=19 Identities=37% Similarity=0.681 Sum_probs=15.5 Q ss_pred EEEHHHHHHHHHCCCCCCC Q ss_conf 6300323465518788887 Q gi|254780384|r 121 SHNALHEIGHALGLMHPGS 139 (665) Q Consensus 121 ~~t~iHEIGHALGL~Hp~~ 139 (665) -.|+.|||||.||+.|-.+ T Consensus 132 a~t~AHElGH~LG~~HD~~ 150 (198) T pfam01421 132 AVTMAHELGHNLGMTHDDI 150 (198) T ss_pred EEEHHHHHHHHCCCCCCCC T ss_conf 1001233451157757899 No 38 >pfam05548 Peptidase_M11 Gametolysin peptidase M11. In the unicellular biflagellated alga, Chlamydomonas reinhardtii, gametolysin, a zinc-containing metallo-protease, is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox. Probab=94.01 E-value=0.021 Score=31.97 Aligned_cols=32 Identities=25% Similarity=0.321 Sum_probs=23.4 Q ss_pred CCCCCCCCCCCEEEEEHHHHHHHHHCCCCCCC Q ss_conf 43336666673466300323465518788887 Q gi|254780384|r 108 DVWRYGIGKGTILSHNALHEIGHALGLMHPGS 139 (665) Q Consensus 108 ~~~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~ 139 (665) ..|....+.+...+.|++||.+|-+||.|.+. T Consensus 124 ~~w~~~~~~~~~~~~t~~hE~~HN~GL~Ha~~ 155 (303) T pfam05548 124 QTWLQTSGYGVQRWATIMQEAIHNYGLWHSWR 155 (303) T ss_pred CEEEEECCCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 07999656555661056664533323110446 No 39 >pfam12044 Metallopep Putative peptidase family. This family of proteins is functionally uncharacterized. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length. Probab=93.85 E-value=0.021 Score=31.89 Aligned_cols=21 Identities=48% Similarity=0.661 Sum_probs=15.3 Q ss_pred EEEEEHHHHHHHHHCCCCCCC Q ss_conf 466300323465518788887 Q gi|254780384|r 119 ILSHNALHEIGHALGLMHPGS 139 (665) Q Consensus 119 y~~~t~iHEIGHALGL~Hp~~ 139 (665) .+.-.++|||||++||.|.-+ T Consensus 312 i~lGA~lHEiGH~fg~pH~~~ 332 (419) T pfam12044 312 ITIGAFLHEIGHLFGLPHQED 332 (419) T ss_pred HHHHHHHHHHHHHCCCCCCCC T ss_conf 312088775464248999988 No 40 >TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than pfam05547. Probab=92.61 E-value=0.078 Score=28.49 Aligned_cols=41 Identities=24% Similarity=-0.009 Sum_probs=25.5 Q ss_pred CEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEC Q ss_conf 3466300323465518788887666766667775410142024467523 Q gi|254780384|r 118 TILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMSYF 166 (665) Q Consensus 118 ~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMSY~ 166 (665) .-+.-|+.||.||.|||-.-=++..... ..--..|.+|+.. T Consensus 163 ~~~igv~~HE~gH~lGlPDlYd~~~~~~--------~~gvG~wdlM~~G 203 (286) T TIGR03296 163 DGGVGVIAHELGHDLGLPDLYDTSYDGG--------GEPVGYWSLMSSG 203 (286) T ss_pred CCCEEEEEHHHHCCCCCCCCCCCCCCCC--------CCCCCCEEECCCC T ss_conf 8735567522214038975355567888--------8888761121488 No 41 >pfam11996 DUF3491 Protein of unknown function (DUF3491). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 286 to 3225 amino acids in length. This protein is found associated with pfam04488. This protein is found associated with pfam04488. Probab=90.66 E-value=0.93 Score=21.99 Aligned_cols=32 Identities=16% Similarity=0.014 Sum_probs=12.5 Q ss_pred EEEEECCCCCEEEEE--EECCCCCCEEECCCCCC Q ss_conf 898722897325998--50478742474458731 Q gi|254780384|r 219 IQTIYDSGGDDLLDL--TLIGGVKNYIDLNTESW 250 (665) Q Consensus 219 ~~tI~D~gG~DTid~--s~~~~~~~~IdL~~g~~ 250 (665) .++-.|..+.-||+. ...+....+|||+.--. T Consensus 347 i~~~s~~~~~~~i~I~LadD~~~pqtiDls~ivP 380 (934) T pfam11996 347 IFSSSPTVQTKTIDIILADDNDHPQTIDLSSIVP 380 (934) T ss_pred EEECCCCCCCEEEEEEEECCCCCCCEEEHHHCCC T ss_conf 9804876783479999925899853678536050 No 42 >TIGR00181 pepF oligoendopeptidase F; InterPro: IPR004438 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), the type example being oligoendopeptidase F from Lactococcus lactis. The enzyme hydrolyses peptides of 7 and 17 amino acids with fairly broad specificity. Differences in substrate specificity should be expected in other species. The gene is duplicated in L. lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis. Probab=90.33 E-value=0.075 Score=28.59 Aligned_cols=67 Identities=19% Similarity=0.376 Sum_probs=33.9 Q ss_pred EEECCHHHHHHHHHHH-HH-HHHHHCCEEEEECCCCCEEEECCCCEEEEECCCC--CCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 7507999999999999-98-8657475689824787168961585356631467--753122443354433366666734 Q gi|254780384|r 44 ITELNPNAQEVARWAL-GE-WSKVVDLTFEETSIHSDIKFISSNNGYVCTPRYD--SRYFMEINFDKADVWRYGIGKGTI 119 (665) Q Consensus 44 it~l~~~~~~~~R~Al-~~-ws~va~ItF~Ev~~~~d~~f~~~~~g~a~~p~~~--~~~~~~~~~~~~~~~~~~~~~g~y 119 (665) +..|.++-...+|.|| .. |=|++ |.. |- .+|+.....+. ...++.-+|.... -- T Consensus 331 L~~LG~EY~~~~~~a~~~~RWvD~~-----en~--gK------r~GAYS~g~y~Gl~~pYILMNw~~~~---------~~ 388 (611) T TIGR00181 331 LEPLGEEYVKILKRAFSEERWVDYY-----ENK--GK------RSGAYSIGGYKGLVKPYILMNWDGTV---------LN 388 (611) T ss_pred HHHHHHHHHHHHHHHCCCCCCEEEC-----CCC--CC------CCCCEECCCCCCCCCCEEEECCCCCC---------HH T ss_conf 7650388899999843799717535-----667--78------55511237889866543671446750---------22 Q ss_pred EEEEHHHHHHHHH Q ss_conf 6630032346551 Q gi|254780384|r 120 LSHNALHEIGHAL 132 (665) Q Consensus 120 ~~~t~iHEIGHAL 132 (665) ...||+||+||++ T Consensus 389 ~~~TLaHE~GHS~ 401 (611) T TIGR00181 389 SVFTLAHELGHSM 401 (611) T ss_pred HHHHHHHHHHHHH T ss_conf 2579999855679 No 43 >TIGR02289 M3_not_pepF oligoendopeptidase, M3 family; InterPro: IPR011976 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This family consists of probable oligoendopeptidases belonging to MEROPS peptidase family M3, subfamily M3B (clan MA(E)). The family is related to lactococcal PepF and group B streptococcal PepB (IPR004438 from INTERPRO) but in a distinct clade with considerable sequence differences not only to IPR004438 from INTERPRO but also to IPR011977 from INTERPRO. Likely substrates are small peptides and not whole proteins, as with PepF, but members are not characterised and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.. Probab=89.80 E-value=0.099 Score=27.87 Aligned_cols=73 Identities=25% Similarity=0.250 Sum_probs=35.1 Q ss_pred HHHHHHHHHHHHHHHHCCEEEEECCCCCEEEEC--CC--CEEE-EECCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHH Q ss_conf 999999999988657475689824787168961--58--5356-631467753122443354433366666734663003 Q gi|254780384|r 51 AQEVARWALGEWSKVVDLTFEETSIHSDIKFIS--SN--NGYV-CTPRYDSRYFMEINFDKADVWRYGIGKGTILSHNAL 125 (665) Q Consensus 51 ~~~~~R~Al~~ws~va~ItF~Ev~~~~d~~f~~--~~--~g~a-~~p~~~~~~~~~~~~~~~~~~~~~~~~g~y~~~t~i 125 (665) ..+.+..++...+.-+.--|.-.-...=+.+.. +- .|+- ++|.+.. .++-+|+|.. .+.--|++ T Consensus 277 ~~~~~~~my~~l~~~f~~~F~~m~~~nlLDL~~r~~Ka~GGyC~~Lp~~~~-pFiFsNfngt----------~~Di~~l~ 345 (553) T TIGR02289 277 LLEKAEKMYKELSLEFKEFFNFMLEKNLLDLESRKGKAAGGYCTYLPKYKA-PFIFSNFNGT----------SGDIDVLT 345 (553) T ss_pred HHHHHHHHHHHHCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC-CEEEECCCCC----------CCHHHHHH T ss_conf 999999999840878999999887528834313789877743153556678-6078647888----------31367888 Q ss_pred HHHHHHHCC Q ss_conf 234655187 Q gi|254780384|r 126 HEIGHALGL 134 (665) Q Consensus 126 HEIGHALGL 134 (665) ||+|||+=. T Consensus 346 HE~GHafh~ 354 (553) T TIGR02289 346 HEAGHAFHV 354 (553) T ss_pred HHHHHHHHH T ss_conf 865688976 No 44 >cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha. Probab=89.72 E-value=0.22 Score=25.77 Aligned_cols=20 Identities=30% Similarity=0.469 Sum_probs=16.3 Q ss_pred EEEHHHHHHHHHCCCCCCCC Q ss_conf 63003234655187888876 Q gi|254780384|r 121 SHNALHEIGHALGLMHPGSY 140 (665) Q Consensus 121 ~~t~iHEIGHALGL~Hp~~~ 140 (665) -.|+.|||||.||..|-.+. T Consensus 168 ~~~~~he~gh~~~~~h~~~~ 187 (244) T cd04270 168 DLVTAHELGHNFGSPHDPDI 187 (244) T ss_pred EEEEEHHHHCCCCCCCCCCC T ss_conf 25521112202689888998 No 45 >COG3824 Predicted Zn-dependent protease [General function prediction only] Probab=89.52 E-value=0.11 Score=27.56 Aligned_cols=24 Identities=38% Similarity=0.607 Sum_probs=19.1 Q ss_pred CCCCCCEEEEEHHHHHHHHHCCCC Q ss_conf 666673466300323465518788 Q gi|254780384|r 113 GIGKGTILSHNALHEIGHALGLMH 136 (665) Q Consensus 113 ~~~~g~y~~~t~iHEIGHALGL~H 136 (665) .-..|..-.+++||||||-.||+- T Consensus 102 eetlgd~vthvliHEIgHhFGLsD 125 (136) T COG3824 102 EETLGDQVTHVLIHEIGHHFGLSD 125 (136) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCH T ss_conf 366765744302353221238777 No 46 >KOG4525 consensus Probab=86.04 E-value=0.33 Score=24.70 Aligned_cols=80 Identities=20% Similarity=0.089 Sum_probs=36.8 Q ss_pred EEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCEE---CCHHHHHHH-HHHHH Q ss_conf 46630032346551878888766676666777541014202446752356776666552000---211221489-99986 Q gi|254780384|r 119 ILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMSYFMPQDSMMDASFGYC---ATPMVSDIV-AVQKI 194 (665) Q Consensus 119 y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMSY~~~~~~~~~~~~~~~---~tpm~~DI~-Alq~l 194 (665) .+.-.++|||||.||+.|.-+. ++...-++ -++.+|+--.-. .-....+..| ..-...-|. ++..+ T Consensus 300 itlGA~~HElGH~lgcpHq~~G-----vMlrgy~~--lNRsftt~e~~~---vrt~s~g~aPv~~keec~w~~lD~~Rf~ 369 (614) T KOG4525 300 ITLGAVCHELGHCLGCPHQSEG-----VMLRGYDY--LNRSFTTDEFEC---VRTGSKGLAPVLAKEECPWRILDARRFS 369 (614) T ss_pred HHHHHHHHHHHHCCCCCCCCCC-----EEECCCHH--HCCEEEECCEEE---EECCCCCCCCCCCCCCCCHHHHHHHHHC T ss_conf 5788999875530489988773-----57437311--120134020025---5227767687553022746666445420 Q ss_pred CCCCCCCCCCCEEE Q ss_conf 08766666773463 Q gi|254780384|r 195 YGAPKGSLGNNIYK 208 (665) Q Consensus 195 YGan~~~~gdt~y~ 208 (665) |-...+-.+|-+|. T Consensus 370 yhp~f~lp~dptyp 383 (614) T KOG4525 370 YHPTFSLPGDPTYP 383 (614) T ss_pred CCCCCCCCCCCCCC T ss_conf 14454689999887 No 47 >KOG3607 consensus Probab=85.98 E-value=0.25 Score=25.41 Aligned_cols=19 Identities=37% Similarity=0.730 Sum_probs=15.2 Q ss_pred EEEEHHHHHHHHHCCCCCC Q ss_conf 6630032346551878888 Q gi|254780384|r 120 LSHNALHEIGHALGLMHPG 138 (665) Q Consensus 120 ~~~t~iHEIGHALGL~Hp~ 138 (665) ...++-|||||.||+.|-= T Consensus 323 ~a~v~AhelgH~lGm~hD~ 341 (716) T KOG3607 323 FAVVLAHELGHNLGMIHDE 341 (716) T ss_pred HHHHHHHHHHHHCCCCCCC T ss_conf 7899999878652850066 No 48 >pfam00353 HemolysinCabind Hemolysin-type calcium-binding repeat (2 copies). Probab=85.27 E-value=0.42 Score=24.07 Aligned_cols=20 Identities=35% Similarity=0.683 Sum_probs=14.0 Q ss_pred ECCCCCCEEECCCCCEEEEE Q ss_conf 66786345888887338998 Q gi|254780384|r 290 FESSGNDVFDGSSGLDTFFY 309 (665) Q Consensus 290 ~gg~g~d~~~g~~g~Dt~~~ 309 (665) .|+.|+|+|+|+.|+|++.. T Consensus 1 ~Gg~GnD~L~Gg~GnD~i~~ 20 (26) T pfam00353 1 YGGAGNDTLYGGAGNDTLXX 20 (26) T ss_pred CCCCCCCCEECCCCCCCEEE T ss_conf 98766662064668761310 No 49 >pfam06262 DUF1025 Domain of unknown function (DUF1025). Family of bacterial protein with undetermined function. Probab=82.51 E-value=0.47 Score=23.80 Aligned_cols=21 Identities=43% Similarity=0.833 Sum_probs=16.4 Q ss_pred CCCEEEEEHHHHHHHHHCCCC Q ss_conf 673466300323465518788 Q gi|254780384|r 116 KGTILSHNALHEIGHALGLMH 136 (665) Q Consensus 116 ~g~y~~~t~iHEIGHALGL~H 136 (665) ...--..|++|||||=+|++. T Consensus 68 L~~~V~~tviHEiaH~fG~dd 88 (96) T pfam06262 68 LGELVRHVVIHEIGHHFGLSD 88 (96) T ss_pred HHHHHHHHHHHHHHHHCCCCH T ss_conf 999999999999887708988 No 50 >cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid. Probab=81.50 E-value=0.46 Score=23.85 Aligned_cols=14 Identities=36% Similarity=0.444 Sum_probs=11.2 Q ss_pred EEEEEHHHHHHHHH Q ss_conf 46630032346551 Q gi|254780384|r 119 ILSHNALHEIGHAL 132 (665) Q Consensus 119 y~~~t~iHEIGHAL 132 (665) ....|+.||+|||+ T Consensus 221 ~dv~TLaHElGHa~ 234 (427) T cd06459 221 DDVFTLAHELGHAF 234 (427) T ss_pred CCHHHHHHHHHHHH T ss_conf 31568898631899 No 51 >pfam03272 Enhancin Viral enhancin protein. Probab=81.02 E-value=0.92 Score=22.02 Aligned_cols=26 Identities=8% Similarity=-0.128 Sum_probs=16.6 Q ss_pred CCCCCCCEEECCCCCCCEEEEECCCC Q ss_conf 44444202424578751676403420 Q gi|254780384|r 354 VNVKQRSILEYTASYEDLIQVIGQDV 379 (665) Q Consensus 354 ~~~~~~d~l~~~a~~ddl~~~~G~d~ 379 (665) ...+.++.+....-.|.+.+.+-.|. T Consensus 531 ~S~l~~e~~~~lGl~D~~~A~~~Vd~ 556 (775) T pfam03272 531 KSPLLNETGHLLGLNDRLAATFYVDY 556 (775) T ss_pred CCCCCEEEEEEECCCCCEEEEEEEEC T ss_conf 77010007999846871389999954 No 52 >pfam01457 Peptidase_M8 Leishmanolysin. Probab=80.96 E-value=0.58 Score=23.25 Aligned_cols=20 Identities=35% Similarity=0.499 Sum_probs=15.6 Q ss_pred EEEEEHHHHHHHHHCCCCCC Q ss_conf 46630032346551878888 Q gi|254780384|r 119 ILSHNALHEIGHALGLMHPG 138 (665) Q Consensus 119 y~~~t~iHEIGHALGL~Hp~ 138 (665) ...-+..|||.||||....+ T Consensus 239 ~~vrv~aHEiaHALGFs~~~ 258 (569) T pfam01457 239 LVTRVVTHEIAHALGFSVTF 258 (569) T ss_pred EEEEHHHHHHHHHHCCCHHH T ss_conf 14238988888886358989 No 53 >PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional Probab=80.74 E-value=0.6 Score=23.14 Aligned_cols=17 Identities=35% Similarity=0.632 Sum_probs=14.0 Q ss_pred EEEHHHHHHHHHCCCCC Q ss_conf 63003234655187888 Q gi|254780384|r 121 SHNALHEIGHALGLMHP 137 (665) Q Consensus 121 ~~t~iHEIGHALGL~Hp 137 (665) --++.|||.||||.... T Consensus 257 vRv~aHEiAHALGFs~~ 273 (615) T PTZ00257 257 TRTVTHEVAHALGFSSV 273 (615) T ss_pred HHHHHHHHHHHHCCCHH T ss_conf 78999999998757998 No 54 >pfam09471 Peptidase_M64 IgA Peptidase M64. This is a family of highly selective metallo-endopeptidases. The primary structure of the Clostridium ramosum IgA proteinase shows no significant overall similarity to any other known metallo-endopeptidase. Probab=80.68 E-value=0.39 Score=24.27 Aligned_cols=26 Identities=38% Similarity=0.562 Sum_probs=17.8 Q ss_pred CCEEEEEHHHHHHHHH-CCCCCCCCCCCC Q ss_conf 7346630032346551-878888766676 Q gi|254780384|r 117 GTILSHNALHEIGHAL-GLMHPGSYNGGY 144 (665) Q Consensus 117 g~y~~~t~iHEIGHAL-GL~Hp~~~~~g~ 144 (665) ..+.-.+++||+||.+ ||.- ||.++. T Consensus 206 n~~~~~v~vHE~GHsf~gLaD--EY~y~~ 232 (257) T pfam09471 206 NSQASYIAVHELGHSFAGLAD--EYYYGG 232 (257) T ss_pred CCCCCCEEEEECCCCCCCCCH--HHCCCC T ss_conf 756663348843532344230--111686 No 55 >TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family; InterPro: IPR011977 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This family consists of probable oligoendopeptidases belonging to MEROPS peptidase family M3, subfamily M3B (clan MA(E)). It is related to lactococcal PepF and group B streptococcal PepB (IPR004438 from INTERPRO) but in a distinct clade with considerable sequence differences not only to IPR004438 from INTERPRO but also to IPR011976 from INTERPRO. Members are not characterised with respect to their substrates and the activity profile may differ.. Probab=78.27 E-value=0.67 Score=22.85 Aligned_cols=13 Identities=38% Similarity=0.521 Sum_probs=11.1 Q ss_pred EEEEHHHHHHHHH Q ss_conf 6630032346551 Q gi|254780384|r 120 LSHNALHEIGHAL 132 (665) Q Consensus 120 ~~~t~iHEIGHAL 132 (665) .-.|+-||+|||+ T Consensus 385 dV~TLAHELGHA~ 397 (600) T TIGR02290 385 DVSTLAHELGHAY 397 (600) T ss_pred HHHHHHHHHHHHH T ss_conf 5899989865589 No 56 >pfam01432 Peptidase_M3 Peptidase family M3. This is the Thimet oligopeptidase family, large family of mammalian and bacterial oligopeptidases that cleave medium sized peptides. The group also contains mitochondrial intermediate peptidase which is encoded by nuclear DNA but functions within the mitochondria to remove the leader sequence. Probab=77.46 E-value=0.75 Score=22.57 Aligned_cols=19 Identities=16% Similarity=0.349 Sum_probs=12.1 Q ss_pred HHHHHHHHHHCCEEEEECC Q ss_conf 9999886574756898247 Q gi|254780384|r 57 WALGEWSKVVDLTFEETSI 75 (665) Q Consensus 57 ~Al~~ws~va~ItF~Ev~~ 75 (665) -.+..+..+-+|.|+++.. T Consensus 141 gl~~l~~~lfg~~~~~~~~ 159 (448) T pfam01432 141 GLFGLFERLFGITFVLEPL 159 (448) T ss_pred HHHHHHHHHHHHHHHCCCC T ss_conf 9999987653234320566 No 57 >PTZ00337 surface protease GP63; Provisional Probab=76.79 E-value=0.82 Score=22.31 Aligned_cols=22 Identities=36% Similarity=0.481 Sum_probs=16.3 Q ss_pred CCEEEEEHHHHHHHHHCCCCCC Q ss_conf 7346630032346551878888 Q gi|254780384|r 117 GTILSHNALHEIGHALGLMHPG 138 (665) Q Consensus 117 g~y~~~t~iHEIGHALGL~Hp~ 138 (665) .....-+..|||.||||..+.. T Consensus 226 ~~~~vRvaaHEiAHALGFs~~~ 247 (567) T PTZ00337 226 TNGYVRVAAHELGHALGFARDQ 247 (567) T ss_pred CCHHHHHHHHHHHHHHCCCHHH T ss_conf 4015789999998886368889 No 58 >pfam02382 RTX RTX N-terminal domain. The RTX family of bacterial toxins are a group of cytolysins and cytotoxins. This Pfam family represents the N-terminal domain which is found in association with a glycine-rich repeat domain and hemolysinCabind pfam00353. Probab=74.19 E-value=4.9 Score=17.67 Aligned_cols=20 Identities=30% Similarity=0.531 Sum_probs=7.9 Q ss_pred CCCCCCEEECCCCCEEEEEE Q ss_conf 67863458888873389982 Q gi|254780384|r 291 ESSGNDVFDGSSGLDTFFYY 310 (665) Q Consensus 291 gg~g~d~~~g~~g~Dt~~~~ 310 (665) -+.|.-.++||.|.|+++|. T Consensus 622 ~gsGs~~idgG~G~D~V~Ys 641 (653) T pfam02382 622 LGSGSTEVDAGEGHDVVYYS 641 (653) T ss_pred ECCCEEEEECCCCCEEEEEC T ss_conf 53770688568871368853 No 59 >cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig Probab=71.80 E-value=1.1 Score=21.48 Aligned_cols=17 Identities=35% Similarity=0.430 Sum_probs=11.8 Q ss_pred EEEEEHHHHHHHHH-CCC Q ss_conf 46630032346551-878 Q gi|254780384|r 119 ILSHNALHEIGHAL-GLM 135 (665) Q Consensus 119 y~~~t~iHEIGHAL-GL~ 135 (665) .-..|+.||.|||| .|- T Consensus 153 ~ev~TlfHEfGHalh~ll 170 (365) T cd06258 153 DDINTLFHEFGHAVHFLL 170 (365) T ss_pred HHHHHHHHHHHHHHHHHC T ss_conf 999999999878888743 No 60 >KOG3538 consensus Probab=70.87 E-value=1.4 Score=20.89 Aligned_cols=22 Identities=36% Similarity=0.652 Sum_probs=19.3 Q ss_pred EEEEHHHHHHHHHCCCCCCCCC Q ss_conf 6630032346551878888766 Q gi|254780384|r 120 LSHNALHEIGHALGLMHPGSYN 141 (665) Q Consensus 120 ~~~t~iHEIGHALGL~Hp~~~~ 141 (665) .-.|+.||+||-+|+.|-+..+ T Consensus 317 ~aftiahE~gH~~gm~hd~~~~ 338 (845) T KOG3538 317 SAFTIAHELGHNFGMEHDGRGN 338 (845) T ss_pred CCCCEECCCCCCCCCCCCCCCC T ss_conf 6554000025688865677578 No 61 >COG1164 Oligoendopeptidase F [Amino acid transport and metabolism] Probab=70.85 E-value=1.2 Score=21.27 Aligned_cols=10 Identities=40% Similarity=0.803 Sum_probs=3.6 Q ss_pred EEHHHHHHHH Q ss_conf 3003234655 Q gi|254780384|r 122 HNALHEIGHA 131 (665) Q Consensus 122 ~t~iHEIGHA 131 (665) .|+.||+||+ T Consensus 382 ~TLaHElGHs 391 (598) T COG1164 382 FTLAHELGHS 391 (598) T ss_pred HHHHHHHHHH T ss_conf 6874650189 No 62 >KOG3658 consensus Probab=70.71 E-value=1.8 Score=20.24 Aligned_cols=20 Identities=15% Similarity=0.252 Sum_probs=8.2 Q ss_pred ECCCEECCCCCCCCCEEECC Q ss_conf 63840012444442024245 Q gi|254780384|r 346 FADQCIDSVNVKQRSILEYT 365 (665) Q Consensus 346 f~d~t~~~~~~~~~d~l~~~ 365 (665) +..|.....+.+.-.+.+|. T Consensus 365 y~~G~~~sLNtGi~T~~NYg 384 (764) T KOG3658 365 YNEGKKRSLNTGISTSVNYG 384 (764) T ss_pred CCCCEEEEEECCEEEEEECC T ss_conf 26760578513355421058 No 63 >pfam06594 HCBP_related Haemolysin-type calcium binding protein related domain. This family consists of a number of bacteria specific domains which are found in haemolysin-type calcium binding proteins. This family is found in conjunction with pfam00353 and is often found in multiple copies. Probab=69.67 E-value=5.8 Score=17.21 Aligned_cols=14 Identities=29% Similarity=0.498 Sum_probs=12.6 Q ss_pred EEEEEEEEECCCEE Q ss_conf 75467999638400 Q gi|254780384|r 338 LSDVERLRFADQCI 351 (665) Q Consensus 338 ~~~iE~~~f~d~t~ 351 (665) ..+||.++|+|||+ T Consensus 24 ~~~iE~i~FADGT~ 37 (43) T pfam06594 24 GYGIEQIEFADGTV 37 (43) T ss_pred CCEEEEEEECCCCE T ss_conf 86376899879975 No 64 >cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain. Probab=69.58 E-value=1.5 Score=20.66 Aligned_cols=13 Identities=31% Similarity=0.728 Sum_probs=10.0 Q ss_pred EEHHHHHHHHHCC Q ss_conf 3003234655187 Q gi|254780384|r 122 HNALHEIGHALGL 134 (665) Q Consensus 122 ~t~iHEIGHALGL 134 (665) -..+||+|||+-- T Consensus 137 s~v~HElGHA~aA 149 (277) T cd06162 137 SGVVHEMGHGVAA 149 (277) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999899999 No 65 >COG2856 Predicted Zn peptidase [Amino acid transport and metabolism] Probab=66.88 E-value=1.9 Score=20.14 Aligned_cols=18 Identities=33% Similarity=0.338 Sum_probs=13.3 Q ss_pred EEEEEHHHHHHHHHCCCC Q ss_conf 466300323465518788 Q gi|254780384|r 119 ILSHNALHEIGHALGLMH 136 (665) Q Consensus 119 y~~~t~iHEIGHALGL~H 136 (665) --..|+.||+||+|=-.+ T Consensus 71 r~rFtlAHELGH~llH~~ 88 (213) T COG2856 71 RKRFTLAHELGHALLHTD 88 (213) T ss_pred HHHHHHHHHHHHHHHCCC T ss_conf 888789998758996265 No 66 >cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom Probab=66.33 E-value=1.8 Score=20.32 Aligned_cols=36 Identities=11% Similarity=0.266 Sum_probs=14.9 Q ss_pred HHHHHCCCCCC-CCCCCEEECCCCCCCCCEEEEEECCCCCEEE Q ss_conf 99986087666-6677346325767876358987228973259 Q gi|254780384|r 190 AVQKIYGAPKG-SLGNNIYKIEVQQKDSPFIQTIYDSGGDDLL 231 (665) Q Consensus 190 Alq~lYGan~~-~~gdt~y~~~~~~~~~~~~~tI~D~gG~DTi 231 (665) ..+.|||..-. ...-.+ |.+.. -.+.|+|..+...| T Consensus 168 l~~~LfGi~f~~~~~~~~--WH~dV----r~~~V~d~~~g~~l 204 (472) T cd06455 168 IYQRLFGLRFEEVPDASV--WHEDV----RLYSVWDADTGEFL 204 (472) T ss_pred HHHHHHCEEEEECCCCCC--CCCCC----EEEEEEECCCCCEE T ss_conf 999984845587787887--78765----69999988999478 No 67 >cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha Probab=65.11 E-value=2.4 Score=19.53 Aligned_cols=15 Identities=40% Similarity=0.554 Sum_probs=11.4 Q ss_pred EEHHHHHHHHHCCCC Q ss_conf 300323465518788 Q gi|254780384|r 122 HNALHEIGHALGLMH 136 (665) Q Consensus 122 ~t~iHEIGHALGL~H 136 (665) ..++||+||++=-++ T Consensus 55 sVllHElgHal~Ar~ 69 (227) T cd06164 55 SVLLHELGHSLVARR 69 (227) T ss_pred HHHHHHHHHHHHHHH T ss_conf 999999999999999 No 68 >cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo Probab=64.66 E-value=2.1 Score=19.81 Aligned_cols=18 Identities=6% Similarity=0.017 Sum_probs=7.7 Q ss_pred HHHHHHHHHHCCEEEEEC Q ss_conf 999988657475689824 Q gi|254780384|r 57 WALGEWSKVVDLTFEETS 74 (665) Q Consensus 57 ~Al~~ws~va~ItF~Ev~ 74 (665) -.+.....+-+|+|+|+. T Consensus 138 Glf~i~~~Lfgi~~~~~~ 155 (458) T cd06457 138 GLSRLFSRLYGIRLVPVP 155 (458) T ss_pred HHHHHHHHHHCEEEEECC T ss_conf 999999998574568746 No 69 >cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. Probab=64.57 E-value=2.5 Score=19.43 Aligned_cols=16 Identities=44% Similarity=0.482 Sum_probs=12.1 Q ss_pred EEHHHHHHHHHCCCCC Q ss_conf 3003234655187888 Q gi|254780384|r 122 HNALHEIGHALGLMHP 137 (665) Q Consensus 122 ~t~iHEIGHALGL~Hp 137 (665) ..++||+||++--++- T Consensus 40 sVl~HElgH~l~A~~~ 55 (208) T cd06161 40 SVLLHELGHALVARRY 55 (208) T ss_pred HHHHHHHHHHHHHHHC T ss_conf 9999999999999994 No 70 >pfam05279 Asp-B-Hydro_N Aspartyl beta-hydroxylase N-terminal region. This family includes the N-terminal regions of the junctin, junctate and aspartyl beta-hydroxylase proteins. Junctate is an integral ER/SR membrane calcium binding protein, which comes from an alternatively spliced form of the same gene that generates aspartyl beta-hydroxylase and junctin. Aspartyl beta-hydroxylase catalyses the post-translational hydroxylation of aspartic acid or asparagine residues contained within epidermal growth factor (EGF) domains of proteins. Probab=63.60 E-value=2.8 Score=19.13 Aligned_cols=22 Identities=23% Similarity=0.277 Sum_probs=16.3 Q ss_pred HHHHHHHHHHHHHHHCCEEEEE Q ss_conf 9999999998865747568982 Q gi|254780384|r 52 QEVARWALGEWSKVVDLTFEET 73 (665) Q Consensus 52 ~~~~R~Al~~ws~va~ItF~Ev 73 (665) =-++-.-|..|.+||=+=|-=| T Consensus 15 WfmViaLLGvWtSVaVVwFdlV 36 (232) T pfam05279 15 WFMVIALLGVWTSVAVVWFDLV 36 (232) T ss_pred HHHHHHHHHHHHHHHEEEEEEC T ss_conf 9999999887761304554400 No 71 >PRK10280 dipeptidyl carboxypeptidase II; Provisional Probab=62.10 E-value=2.3 Score=19.57 Aligned_cols=33 Identities=12% Similarity=0.257 Sum_probs=14.9 Q ss_pred HHHHHCCCCC-CCCCCCEEECCCCCCCCCEEEEEECCCCC Q ss_conf 9998608766-66677346325767876358987228973 Q gi|254780384|r 190 AVQKIYGAPK-GSLGNNIYKIEVQQKDSPFIQTIYDSGGD 228 (665) Q Consensus 190 Alq~lYGan~-~~~gdt~y~~~~~~~~~~~~~tI~D~gG~ 228 (665) ..+.|||..- -...-.+ |.+.. -.+.|+|..|. T Consensus 373 ~~~~Lfgi~f~e~~~~~v--wh~dV----~~~~V~d~~g~ 406 (681) T PRK10280 373 TANQLFGIKFVERFDIPV--YHPDV----RVWEIFDHNGV 406 (681) T ss_pred HHHHHHCEEEEECCCCCC--CCCCE----EEEEEECCCCC T ss_conf 999983868786587876--57874----79999858997 No 72 >pfam01434 Peptidase_M41 Peptidase family M41. Probab=60.89 E-value=2.9 Score=19.04 Aligned_cols=17 Identities=59% Similarity=0.882 Sum_probs=12.3 Q ss_pred EEEHHHHHHHHH-CCCCC Q ss_conf 630032346551-87888 Q gi|254780384|r 121 SHNALHEIGHAL-GLMHP 137 (665) Q Consensus 121 ~~t~iHEIGHAL-GL~Hp 137 (665) ..+.+||.|||| ++--| T Consensus 9 ~~vA~HEaGHAlva~~l~ 26 (192) T pfam01434 9 RLVAYHEAGHALVGLLLP 26 (192) T ss_pred HHHHHHHHHHHHHHHHCC T ss_conf 999999999999999846 No 73 >LOAD_S2Pmetalloprt consensus Probab=60.71 E-value=3.2 Score=18.75 Aligned_cols=18 Identities=39% Similarity=0.503 Sum_probs=13.0 Q ss_pred EEHHHHHHHHHCCCCCCC Q ss_conf 300323465518788887 Q gi|254780384|r 122 HNALHEIGHALGLMHPGS 139 (665) Q Consensus 122 ~t~iHEIGHALGL~Hp~~ 139 (665) ...+||+||++=-++-|. T Consensus 9 ~v~vHElgH~~~A~~~G~ 26 (148) T LOAD_S2Pmetall 9 SVVVHELGHALVARRFGV 26 (148) T ss_pred HHHHHHHHHHHHHHHCCC T ss_conf 999999999999999699 No 74 >KOG2921 consensus Probab=60.08 E-value=3.2 Score=18.73 Aligned_cols=13 Identities=46% Similarity=0.761 Sum_probs=9.7 Q ss_pred EHHHHHHHHHCCC Q ss_conf 0032346551878 Q gi|254780384|r 123 NALHEIGHALGLM 135 (665) Q Consensus 123 t~iHEIGHALGL~ 135 (665) ..+|||||||--- T Consensus 134 ~vvHElGHalAA~ 146 (484) T KOG2921 134 VVVHELGHALAAA 146 (484) T ss_pred HHHHHHHHHHHHH T ss_conf 8788765799987 No 75 >PRK10911 oligopeptidase A; Provisional Probab=59.12 E-value=3.1 Score=18.84 Aligned_cols=16 Identities=25% Similarity=0.385 Sum_probs=8.8 Q ss_pred EHHHHHHHHHCCCCCC Q ss_conf 0032346551878888 Q gi|254780384|r 123 NALHEIGHALGLMHPG 138 (665) Q Consensus 123 t~iHEIGHALGL~Hp~ 138 (665) .+=||+..-||-++=- T Consensus 264 ~LR~E~A~LLGy~s~A 279 (680) T PRK10911 264 ALRHELAQLLGFENYA 279 (680) T ss_pred HHHHHHHHHCCCCCHH T ss_conf 9999999983998499 No 76 >pfam02163 Peptidase_M50 Peptidase family M50. Probab=56.69 E-value=4.1 Score=18.09 Aligned_cols=17 Identities=41% Similarity=0.536 Sum_probs=11.6 Q ss_pred EHHHHHHHHHCCCCCCC Q ss_conf 00323465518788887 Q gi|254780384|r 123 NALHEIGHALGLMHPGS 139 (665) Q Consensus 123 t~iHEIGHALGL~Hp~~ 139 (665) ..+||+||++=-++-++ T Consensus 10 i~~HE~gH~~~Ar~~G~ 26 (205) T pfam02163 10 VVVHELGHALVARRFGV 26 (205) T ss_pred HHHHHHHHHHHHHHCCC T ss_conf 99999999999999299 No 77 >COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism] Probab=56.59 E-value=3.6 Score=18.47 Aligned_cols=17 Identities=18% Similarity=0.255 Sum_probs=8.9 Q ss_pred EHHHHHHHHHCCCCCCC Q ss_conf 00323465518788887 Q gi|254780384|r 123 NALHEIGHALGLMHPGS 139 (665) Q Consensus 123 t~iHEIGHALGL~Hp~~ 139 (665) .+=+|+-+-||-.+=-+ T Consensus 267 ~LR~E~A~LLGf~~yA~ 283 (683) T COG0339 267 KLRAELAKLLGFSNYAE 283 (683) T ss_pred HHHHHHHHHCCCCCHHH T ss_conf 99999998749641888 No 78 >cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin Probab=54.52 E-value=4.7 Score=17.75 Aligned_cols=18 Identities=39% Similarity=0.540 Sum_probs=11.9 Q ss_pred EEHHHHHHHHHCCCCCCC Q ss_conf 300323465518788887 Q gi|254780384|r 122 HNALHEIGHALGLMHPGS 139 (665) Q Consensus 122 ~t~iHEIGHALGL~Hp~~ 139 (665) ...+||+||++=-+.-++ T Consensus 10 ~i~iHE~gH~~~A~~~G~ 27 (180) T cd05709 10 SVTVHELGHALVARRLGV 27 (180) T ss_pred HHHHHHHHHHHHHHHCCC T ss_conf 999999999999999699 No 79 >cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o Probab=54.02 E-value=4.5 Score=17.84 Aligned_cols=17 Identities=47% Similarity=0.706 Sum_probs=12.0 Q ss_pred EEEEHHHHHHHHH---CCCC Q ss_conf 6630032346551---8788 Q gi|254780384|r 120 LSHNALHEIGHAL---GLMH 136 (665) Q Consensus 120 ~~~t~iHEIGHAL---GL~H 136 (665) +....|||.|||| ||.- T Consensus 159 ~l~~~iHE~GHalYEqgl~~ 178 (396) T cd06460 159 ALFSTIHETGHALYEQGLPP 178 (396) T ss_pred HHHHHHHHHHHHHHHCCCCH T ss_conf 99999987516788619897 No 80 >cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains. Probab=52.63 E-value=5.3 Score=17.45 Aligned_cols=19 Identities=37% Similarity=0.587 Sum_probs=13.9 Q ss_pred EEEEHHHHHHHHHCCCCCC Q ss_conf 6630032346551878888 Q gi|254780384|r 120 LSHNALHEIGHALGLMHPG 138 (665) Q Consensus 120 ~~~t~iHEIGHALGL~Hp~ 138 (665) ....++||+||.+--++-+ T Consensus 41 ~~il~~HElGH~l~a~~~g 59 (183) T cd06160 41 LAILGIHEMGHYLAARRHG 59 (183) T ss_pred HHHHHHHHHHHHHHHHHCC T ss_conf 9999999999999999939 No 81 >KOG2090 consensus Probab=46.94 E-value=6.6 Score=16.87 Aligned_cols=26 Identities=23% Similarity=0.322 Sum_probs=15.8 Q ss_pred HHCCCCHHHHHHHH---HHHHCCCCCCCCCC Q ss_conf 51675002477888---89975404688631 Q gi|254780384|r 539 AFADAPDLVQAGKD---HFSNFGYEEGRSIT 566 (665) Q Consensus 539 a~G~d~~~~~aa~~---HYi~~G~~EGR~~~ 566 (665) -|-.||....+|-+ |... |.++|.|- T Consensus 641 ~Fe~dPfsR~aGek~r~eil~--hGG~~~P~ 669 (704) T KOG2090 641 LFENDPFSRKAGEKFRKEILK--HGGGRDPA 669 (704) T ss_pred HHHCCCCCHHHHHHHHHHHHH--HCCCCCHH T ss_conf 981596322446899999998--45999869 No 82 >TIGR00054 TIGR00054 membrane-associated zinc metalloprotease, putative; InterPro: IPR004387 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This family contains putative zinc metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). The N-terminal region of contains a perfectly conserved motif HEXGH, where the Glu is the active site and the His residues coordinate the metal cation. The family of bacterial and plant proteins also includes a region that hits the PDZ domain (IPR001478 from INTERPRO), found in a number of proteins targeted to the membrane by binding to a peptide ligand . The family includes EcfE, which is a homolog of human site-2 protease (S2P), a membrane-bound zinc metalloprotease involved in regulated intramembrane proteolysis. In Escherichia coli EcfE activates the sigma(E) pathway of stress response through a site-2 cleavage of anti-sigma(E), RseA.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016021 integral to membrane. Probab=45.41 E-value=7.4 Score=16.54 Aligned_cols=11 Identities=36% Similarity=0.501 Sum_probs=7.1 Q ss_pred EEEHHHHHHHH Q ss_conf 63003234655 Q gi|254780384|r 121 SHNALHEIGHA 131 (665) Q Consensus 121 ~~t~iHEIGHA 131 (665) ...++||+||= T Consensus 16 ~Li~vHElGHF 26 (463) T TIGR00054 16 VLIFVHELGHF 26 (463) T ss_pred HHHHHHHHHHH T ss_conf 99999988899 No 83 >cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA. Probab=43.80 E-value=10 Score=15.68 Aligned_cols=15 Identities=40% Similarity=0.651 Sum_probs=13.4 Q ss_pred EHHHHHHHHHCCCCC Q ss_conf 003234655187888 Q gi|254780384|r 123 NALHEIGHALGLMHP 137 (665) Q Consensus 123 t~iHEIGHALGL~Hp 137 (665) ..||+|.|-+||.|- T Consensus 32 ~~vH~lA~~~GL~h~ 46 (60) T cd02641 32 LLVHELAEELGLRHE 46 (60) T ss_pred HHHHHHHHHCCCCCC T ss_conf 999999998299764 No 84 >pfam08463 EcoEI_R_C EcoEI R protein C-terminal. The restriction enzyme EcoEI recognizes 5'-GAGN(7)ATGC-3' and is composed of the three proteins R, M, and S. The domain described here is found at the C-terminus of the R protein (HsdR) which is required for both nuclease and ATPase activity. Probab=42.48 E-value=11 Score=15.48 Aligned_cols=61 Identities=23% Similarity=0.337 Sum_probs=30.9 Q ss_pred CCCCCCCCCHHHHHCCHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHCCCHHHHHHCCHHHHHHHHHHH Q ss_conf 68886544011120232166651675002477888899754046886311768876738107777606513478868873 Q gi|254780384|r 518 GRNPDAFKALEYIASHEDLINAFADAPDLVQAGKDHFSNFGYEEGRSITFNPDLYLTSYDDLWNVFGSDRSAAAQHYITH 597 (665) Q Consensus 518 GR~~~~Fda~~Ylasn~DL~~a~G~d~~~~~aa~~HYi~~G~~EGR~~~Fda~~YLa~y~Dl~~~fg~d~~~a~~Hyi~~ 597 (665) +++.+.||.++|+|=+ ..|++...=..+ +--..|++.|.+-+..| ....+.+|+.. T Consensus 130 ~~d~D~FDvL~hiAf~--------~pplTR~ERa~~-------------vkk~~~f~~y~~~ar~v---Le~lLdkY~~~ 185 (232) T pfam08463 130 GKDSDPFDLLCHIAFD--------QPPLTRKERAEQ-------------VKKRNYFTKYSEKAREV---LEALLDKYADE 185 (232) T ss_pred CCCCCHHHHHHHHHCC--------CCCCCHHHHHHH-------------HHHHHHHHHCCHHHHHH---HHHHHHHHHHH T ss_conf 7537689999999816--------998778999999-------------87666887639999999---99999999972 Q ss_pred HHHHC Q ss_conf 56407 Q gi|254780384|r 598 GMKEG 602 (665) Q Consensus 598 G~~EG 602 (665) |..|- T Consensus 186 Gi~el 190 (232) T pfam08463 186 GIQEL 190 (232) T ss_pred CCCCC T ss_conf 90030 No 85 >pfam10263 SprT-like SprT-like family. This family represents a domain found in eukaryotes and prokaryotes. The domain contains a characteristic motif of the zinc metallopeptidases. This family includes the bacterial SprT protein. Probab=42.40 E-value=12 Score=15.20 Aligned_cols=21 Identities=38% Similarity=0.469 Sum_probs=15.3 Q ss_pred EEEEEHHHHHHHHH------CCCCCCC Q ss_conf 46630032346551------8788887 Q gi|254780384|r 119 ILSHNALHEIGHAL------GLMHPGS 139 (665) Q Consensus 119 y~~~t~iHEIGHAL------GL~Hp~~ 139 (665) ....|++||+-|++ +-.+||. T Consensus 57 ~l~~tl~HEm~H~~~~~~~~~~~~~HG 83 (153) T pfam10263 57 FLIETLLHEMCHAALFVLFGGRGYPHG 83 (153) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 999888999999999976178889988 No 86 >pfam02074 Peptidase_M32 Carboxypeptidase Taq (M32) metallopeptidase. Probab=42.31 E-value=9.1 Score=16.02 Aligned_cols=12 Identities=50% Similarity=0.875 Sum_probs=7.3 Q ss_pred EEEHHHHHHHHH Q ss_conf 630032346551 Q gi|254780384|r 121 SHNALHEIGHAL 132 (665) Q Consensus 121 ~~t~iHEIGHAL 132 (665) ..-.|||.|||| T Consensus 260 l~~~iHE~GHal 271 (494) T pfam02074 260 IFGTIHEFGHAL 271 (494) T ss_pred HHHHHHHHHHHH T ss_conf 999998762889 No 87 >PRK10733 hflB ATP-dependent metalloprotease; Reviewed Probab=41.39 E-value=9.1 Score=16.02 Aligned_cols=11 Identities=27% Similarity=0.374 Sum_probs=5.0 Q ss_pred CCCCHHHHHHH Q ss_conf 66704455552 Q gi|254780384|r 606 GISGMQFKSVM 616 (665) Q Consensus 606 ~l~~~qf~~~~ 616 (665) +|+..++..-. T Consensus 583 TL~geEi~~Il 593 (644) T PRK10733 583 TIDAPQIDDLM 593 (644) T ss_pred CCCHHHHHHHH T ss_conf 85899999996 No 88 >cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains. Probab=39.74 E-value=11 Score=15.51 Aligned_cols=14 Identities=36% Similarity=0.422 Sum_probs=6.7 Q ss_pred HHHHHHHHHCCCCC Q ss_conf 03234655187888 Q gi|254780384|r 124 ALHEIGHALGLMHP 137 (665) Q Consensus 124 ~iHEIGHALGL~Hp 137 (665) .+||++||+=-..- T Consensus 13 ~~HE~aHa~~A~~~ 26 (181) T cd06158 13 TLHEFAHAYVAYRL 26 (181) T ss_pred HHHHHHHHHHHHHC T ss_conf 99999999999984 No 89 >cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA. Probab=38.99 E-value=11 Score=15.41 Aligned_cols=14 Identities=36% Similarity=0.373 Sum_probs=9.6 Q ss_pred EEHHHHHHHHHCCC Q ss_conf 30032346551878 Q gi|254780384|r 122 HNALHEIGHALGLM 135 (665) Q Consensus 122 ~t~iHEIGHALGL~ 135 (665) ..++||.||-+=-+ T Consensus 11 lV~vHElGH~~~Ar 24 (182) T cd06163 11 LIFVHELGHFLVAK 24 (182) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999899899999 No 90 >TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain; InterPro: IPR013428 Proteins containing this domain also contain several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (IPR013427 from INTERPRO) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) (Q44091 from SWISSPROT) shows some homolgy to this domain but lacks additional sequence regions found in this entry and belongs to a different clade of the larger family of homologues. . Probab=36.98 E-value=17 Score=14.39 Aligned_cols=25 Identities=24% Similarity=0.235 Sum_probs=15.0 Q ss_pred CCCCE-EEEEHHHHHHHHHCCCCCCC Q ss_conf 66734-66300323465518788887 Q gi|254780384|r 115 GKGTI-LSHNALHEIGHALGLMHPGS 139 (665) Q Consensus 115 ~~g~y-~~~t~iHEIGHALGL~Hp~~ 139 (665) ++|.| ...|++-|+.|..||.-... T Consensus 58 gDGkyd~~~VF~~~~~~~~~L~~a~~ 83 (459) T TIGR02604 58 GDGKYDKSKVFAEELSLVTGLAVAVG 83 (459) T ss_pred CCCCCCEEEECCCCCCCCCCCCCCCC T ss_conf 88850146542468532356100258 No 91 >COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism] Probab=36.47 E-value=12 Score=15.22 Aligned_cols=10 Identities=60% Similarity=1.059 Sum_probs=5.2 Q ss_pred EHHHHHHHHH Q ss_conf 0032346551 Q gi|254780384|r 123 NALHEIGHAL 132 (665) Q Consensus 123 t~iHEIGHAL 132 (665) ..|||-|||| T Consensus 262 g~iHE~GHAl 271 (497) T COG2317 262 GTIHETGHAL 271 (497) T ss_pred HHHHHHHHHH T ss_conf 9998531577 No 92 >cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner. Probab=36.39 E-value=15 Score=14.74 Aligned_cols=17 Identities=29% Similarity=0.481 Sum_probs=13.7 Q ss_pred EHHHHHHHHHCCCCCCC Q ss_conf 00323465518788887 Q gi|254780384|r 123 NALHEIGHALGLMHPGS 139 (665) Q Consensus 123 t~iHEIGHALGL~Hp~~ 139 (665) -++|.+.|-|||.|--. T Consensus 32 rivh~La~~lGL~h~s~ 48 (60) T cd02639 32 RIVHLLASRLGLNHVSD 48 (60) T ss_pred HHHHHHHHHCCCCEEEE T ss_conf 99999998819920440 No 93 >pfam06114 DUF955 Domain of unknown function (DUF955). Family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to pfam01435. Probab=36.18 E-value=12 Score=15.22 Aligned_cols=14 Identities=36% Similarity=0.468 Sum_probs=11.1 Q ss_pred EEEEHHHHHHHHHC Q ss_conf 66300323465518 Q gi|254780384|r 120 LSHNALHEIGHALG 133 (665) Q Consensus 120 ~~~t~iHEIGHALG 133 (665) -..++.|||||.+- T Consensus 41 ~~f~~aHElgH~~l 54 (121) T pfam06114 41 QRFTLAHELGHLLL 54 (121) T ss_pred HHHHHHHHHHHHHH T ss_conf 59999999999993 No 94 >smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function. Probab=35.08 E-value=18 Score=14.19 Aligned_cols=20 Identities=35% Similarity=0.308 Sum_probs=14.1 Q ss_pred EEEEEHHHHHHHHH----CCCCCC Q ss_conf 46630032346551----878888 Q gi|254780384|r 119 ILSHNALHEIGHAL----GLMHPG 138 (665) Q Consensus 119 y~~~t~iHEIGHAL----GL~Hp~ 138 (665) ....|++||+-|+. |-.+|| T Consensus 58 ~l~~tl~HElcH~~~~~~~~~~~H 81 (146) T smart00731 58 RLRETLLHELCHAALYLFGRGYGH 81 (146) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 999878999999999997798987 No 95 >PRK10779 zinc metallopeptidase; Provisional Probab=34.52 E-value=14 Score=14.93 Aligned_cols=17 Identities=24% Similarity=0.137 Sum_probs=11.0 Q ss_pred EEEEHHHHHHHHHCCCC Q ss_conf 66300323465518788 Q gi|254780384|r 120 LSHNALHEIGHALGLMH 136 (665) Q Consensus 120 ~~~t~iHEIGHALGL~H 136 (665) +...++||.||=|=-|- T Consensus 15 ~~~V~iHElGHfl~Ak~ 31 (449) T PRK10779 15 GVLITVHEFGHFWVARR 31 (449) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 99999885787999998 No 96 >KOG3854 consensus Probab=34.17 E-value=13 Score=14.99 Aligned_cols=12 Identities=33% Similarity=0.481 Sum_probs=6.4 Q ss_pred CCCCCHHHHHCC Q ss_conf 654401112023 Q gi|254780384|r 522 DAFKALEYIASH 533 (665) Q Consensus 522 ~~Fda~~Ylasn 533 (665) ..|..|.|-+.. T Consensus 376 d~wKrWa~ra~~ 387 (505) T KOG3854 376 DNWKRWAYRANM 387 (505) T ss_pred HHHHHHHHHHHH T ss_conf 389999999863 No 97 >pfam06189 5-nucleotidase 5'-nucleotidase. This family consists of both eukaryotic and prokaryotic 5'-nucleotidase sequences (EC:3.1.3.5). Probab=33.80 E-value=24 Score=13.47 Aligned_cols=13 Identities=31% Similarity=0.302 Sum_probs=6.1 Q ss_pred CEEECCCCCCCCC Q ss_conf 1010014554221 Q gi|254780384|r 410 DLRNVFGSDRGAA 422 (665) Q Consensus 410 Dl~~~~g~d~~~a 422 (665) .|+-+|..|.+.+ T Consensus 120 qlRIAFDgDaVLF 132 (263) T pfam06189 120 QLRIAFDGDAVLF 132 (263) T ss_pred CEEEEECCCEEEE T ss_conf 4589736874786 No 98 >COG4227 Antirestriction protein [DNA replication, recombination, and repair] Probab=29.81 E-value=15 Score=14.71 Aligned_cols=23 Identities=30% Similarity=0.517 Sum_probs=14.1 Q ss_pred HHHHCCCHHCCCCCC-CCCCHHHH Q ss_conf 211004500268886-54401112 Q gi|254780384|r 508 RHYITHGVKEGRNPD-AFKALEYI 530 (665) Q Consensus 508 ~Hy~~~G~~EGR~~~-~Fda~~Yl 530 (665) -||.-.+.+-.|++. .|+...|- T Consensus 212 ghwtgh~~rl~rdLs~~~~sr~yA 235 (316) T COG4227 212 GHWTGHEARLDRDLSRAFGSREYA 235 (316) T ss_pred CCCCCCHHHHHHHHCCCCCHHHHH T ss_conf 365576034423322432025789 No 99 >TIGR01241 FtsH_fam ATP-dependent metallopeptidase HflB; InterPro: IPR005936 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. FtsH is a membrane-anchored ATP-dependent protease that degrades misfolded or misassembled membrane proteins as well as a subset of cytoplasmic regulatory proteins. FtsH is a 647-residue protein of 70 kDa, with two putative transmembrane segments towards its N terminus which anchor the protein to the membrane, giving rise to a periplasmic domain of 70 residues and a cytoplasmic segment of 520 residues containing the ATPase and protease domains . ; GO: 0004222 metalloendopeptidase activity, 0030163 protein catabolic process, 0016020 membrane. Probab=29.77 E-value=18 Score=14.23 Aligned_cols=26 Identities=46% Similarity=0.594 Sum_probs=17.1 Q ss_pred EEEHHHHHHHHH-CCCCCCCCCCCCCCCCCC Q ss_conf 630032346551-878888766676666777 Q gi|254780384|r 121 SHNALHEIGHAL-GLMHPGSYNGGYPVYGVD 150 (665) Q Consensus 121 ~~t~iHEIGHAL-GL~Hp~~~~~g~~~~~~~ 150 (665) ..|.-||=|||| |+--|+. +|+..++ T Consensus 317 ~~vAYHEaGHAl~G~~~~~~----DpV~KvT 343 (505) T TIGR01241 317 KIVAYHEAGHALVGLLLKDA----DPVHKVT 343 (505) T ss_pred CEEEEECHHHHHHHHHCCCC----CCCCCEE T ss_conf 22201115789999735344----7523256 No 100 >cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present. Probab=28.60 E-value=21 Score=13.77 Aligned_cols=12 Identities=25% Similarity=0.540 Sum_probs=9.3 Q ss_pred EEEHHHHHHHHH Q ss_conf 630032346551 Q gi|254780384|r 121 SHNALHEIGHAL 132 (665) Q Consensus 121 ~~t~iHEIGHAL 132 (665) -...+||.+|++ T Consensus 119 v~~vvHE~~Hgi 130 (263) T cd06159 119 VGVVVHELSHGI 130 (263) T ss_pred HHHHHHHHHHHH T ss_conf 999999888899 No 101 >pfam01742 Peptidase_M27 Clostridial neurotoxin zinc protease. These toxins are zinc proteases that block neurotransmitter release by proteolytic cleavage of synaptic proteins such as synaptobrevins, syntaxin and SNAP-25. Probab=27.23 E-value=19 Score=14.08 Aligned_cols=13 Identities=23% Similarity=0.215 Sum_probs=5.2 Q ss_pred CCCCCCCCHHHHH Q ss_conf 1455422114467 Q gi|254780384|r 415 FGSDRGAAARHYI 427 (665) Q Consensus 415 ~g~d~~~al~HYi 427 (665) |-.|....|.|=+ T Consensus 210 F~~DPAl~LmheL 222 (408) T pfam01742 210 FYVDPALELMHEL 222 (408) T ss_pred EECCHHHHHHHHH T ss_conf 2238899999999 No 102 >cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner. Probab=26.85 E-value=28 Score=13.11 Aligned_cols=15 Identities=27% Similarity=0.499 Sum_probs=13.3 Q ss_pred EHHHHHHHHHCCCCC Q ss_conf 003234655187888 Q gi|254780384|r 123 NALHEIGHALGLMHP 137 (665) Q Consensus 123 t~iHEIGHALGL~Hp 137 (665) .+||+|..-|||+|- T Consensus 32 ~~iH~~a~~~gLksk 46 (60) T cd02640 32 ALIHQIAQKYGLKSR 46 (60) T ss_pred HHHHHHHHHCCCCCC T ss_conf 999999999199221 No 103 >TIGR01338 phycocy_alpha phycocyanin, alpha subunit; InterPro: IPR006246 These sequences in this family represent the phycocyanin alpha subunit. Other, homologous phyobiliproteins of the phycobilisome include phycocyanin alpha chain and the allophycocyanin and phycoerythrin alpha and beta chains. Not included are the closely related phycoerythrocyanin alpha subunit sequences.; GO: 0006118 electron transport, 0015979 photosynthesis, 0030089 phycobilisome. Probab=25.64 E-value=24 Score=13.45 Aligned_cols=70 Identities=19% Similarity=0.267 Sum_probs=45.7 Q ss_pred CCCCCHHHHHCCHH--HHHHHCCCCHHHHHHHHHHHHCCCCC-CCCCCCCHHHHHHHCCCHHHHHHCCHHHH--HHHHHH Q ss_conf 65440111202321--66651675002477888899754046-88631176887673810777760651347--886887 Q gi|254780384|r 522 DAFKALEYIASHED--LINAFADAPDLVQAGKDHFSNFGYEE-GRSITFNPDLYLTSYDDLWNVFGSDRSAA--AQHYIT 596 (665) Q Consensus 522 ~~Fda~~Ylasn~D--L~~a~G~d~~~~~aa~~HYi~~G~~E-GR~~~Fda~~YLa~y~Dl~~~fg~d~~~a--~~Hyi~ 596 (665) ..-|.-+||.--.= +.+ + +..+|.|+-.|..| .|++...|+||.++-.=+++..|-.+..+ ..-|+. T Consensus 83 C~rdiGyylr~~ty~~~~G--~------tGPld~yl~~G~~e~nr~f~lsPsWy~eal~yik~~hGlsG~~a~ean~y~d 154 (161) T TIGR01338 83 CVRDIGYYLRMVTYALVVG--G------TGPLDEYLVAGLEEINRAFDLSPSWYVEALKYIKANHGLSGQAAVEANSYLD 154 (161) T ss_pred HHHHHHHHHHHHHHHHHHC--C------CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH T ss_conf 5677789999999998826--8------8744789998799986653267268999999998614888612456778999 Q ss_pred HHH Q ss_conf 356 Q gi|254780384|r 597 HGM 599 (665) Q Consensus 597 ~G~ 599 (665) +-+ T Consensus 155 y~i 157 (161) T TIGR01338 155 YAI 157 (161) T ss_pred HHH T ss_conf 998 No 104 >COG2738 Predicted Zn-dependent protease [General function prediction only] Probab=25.35 E-value=24 Score=13.43 Aligned_cols=83 Identities=22% Similarity=0.181 Sum_probs=39.6 Q ss_pred HHHHHHHHHHHHHHHHHCCEEEEECCCCCEEEECCCCEEEEECCCCCCCCCCCCCCC-------CCCCCCCCCCCCEEEE Q ss_conf 999999999998865747568982478716896158535663146775312244335-------4433366666734663 Q gi|254780384|r 50 NAQEVARWALGEWSKVVDLTFEETSIHSDIKFISSNNGYVCTPRYDSRYFMEINFDK-------ADVWRYGIGKGTILSH 122 (665) Q Consensus 50 ~~~~~~R~Al~~ws~va~ItF~Ev~~~~d~~f~~~~~g~a~~p~~~~~~~~~~~~~~-------~~~~~~~~~~g~y~~~ 122 (665) -.|..++..++.||.|..-.=- +...-.+++.+++|-...|-..........|+. .+..-.++... +-- T Consensus 19 wAQ~kvks~y~KySkV~~s~g~--TGaevAr~iLd~nGl~dV~Ve~v~G~LTDHYDP~~kvvrLSe~~y~g~Sia--~~a 94 (226) T COG2738 19 WAQSKVKSTYKKYSKVRASSGL--TGAEVARMILDENGLYDVPVEEVPGTLTDHYDPRRKVVRLSEANYYGPSIA--AIA 94 (226) T ss_pred HHHHHHHHHHHHHHHCCCCCCC--CHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCHHHHEEECCCCCCCCCCHH--HHH T ss_conf 9999999889987631676898--479999999855587665046406876566680451256452014786388--898 Q ss_pred EHHHHHHHHHCCCC Q ss_conf 00323465518788 Q gi|254780384|r 123 NALHEIGHALGLMH 136 (665) Q Consensus 123 t~iHEIGHALGL~H 136 (665) ..-||.|||+-=+| T Consensus 95 VAAHEVGHAiQd~~ 108 (226) T COG2738 95 VAAHEVGHAIQDQE 108 (226) T ss_pred HHHHHHHHHHHHHC T ss_conf 99987627776220 No 105 >COG3793 TerB Tellurite resistance protein [Inorganic ion transport and metabolism] Probab=24.86 E-value=32 Score=12.71 Aligned_cols=10 Identities=70% Similarity=0.943 Sum_probs=3.7 Q ss_pred HHHHHHHHCC Q ss_conf 3234655187 Q gi|254780384|r 125 LHEIGHALGL 134 (665) Q Consensus 125 iHEIGHALGL 134 (665) +|||-|+||| T Consensus 128 l~eI~~aLGL 137 (144) T COG3793 128 LREIAGALGL 137 (144) T ss_pred HHHHHHHHCC T ss_conf 9999999388 No 106 >pfam07108 PipA PipA protein. This family consists of several Salmonella PipA (pathogenicity island-encoded protein A) and related phage sequences. PipA is thought to contribute to enteric but not to systemic salmonellosis. Probab=24.61 E-value=28 Score=13.04 Aligned_cols=14 Identities=21% Similarity=0.396 Sum_probs=8.2 Q ss_pred CHHHHHHHHHHHHH Q ss_conf 51347886887356 Q gi|254780384|r 586 DRSAAAQHYITHGM 599 (665) Q Consensus 586 d~~~a~~Hyi~~G~ 599 (665) +...++.|-|.+-+ T Consensus 175 dt~RsFiHEiVHAL 188 (228) T pfam07108 175 DTKRSFIHEVVHAL 188 (228) T ss_pred HHHHHHHHHHHHHH T ss_conf 67999999999998 No 107 >pfam02102 Peptidase_M35 Deuterolysin metalloprotease (M35) family. Probab=23.76 E-value=30 Score=12.88 Aligned_cols=28 Identities=11% Similarity=0.459 Sum_probs=13.4 Q ss_pred HCCCCCCCCCCCCHHHHHHHCCCHHHHHHCC Q ss_conf 7540468863117688767381077776065 Q gi|254780384|r 556 NFGYEEGRSITFNPDLYLTSYDDLWNVFGSD 586 (665) Q Consensus 556 ~~G~~EGR~~~Fda~~YLa~y~Dl~~~fg~d 586 (665) .||+..-|.++ ++ ..-+|+|-=..|++. T Consensus 320 aYGy~~~~~Ls--a~-qAlnNADsYalfAna 347 (352) T pfam02102 320 GYGYDAATALS--AS-QALNNADSYALYANA 347 (352) T ss_pred CCCHHHHHCCC--HH-HHHHCHHHHHHHHHH T ss_conf 33588883259--88-876581769999868 No 108 >TIGR00962 atpA ATP synthase F1, alpha subunit; InterPro: IPR005294 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis . These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents the alpha subunit found in the F1 complex of F-ATPases. In F-ATPases, there are three copies each of the alpha and beta subunits that form the catalytic core of the F1 complex, while the remaining F1 subunits (gamma, delta, epsilon) form part of the stalks. There is a substrate-binding site on each of the alpha and beta subunits, those on the beta subunits being catalytic, while those on the alpha subunits are regulatory. The alpha-subunit contains a highly conserved adenine-specific non-catalytic nucleotide-binding domain, with a conserved amino acid sequence of Gly-X-X-X-X-Gly-Lys. The alpha and beta subunits form a cylinder that is attached to the central stalk. The alpha/beta subunits undergo a sequence of conformational changes leading to the formation of ATP from ADP, which are induced by the rotation of the gamma subunit, itself is driven by the movement of protons through the F0 complex C subunit . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0016820 hydrolase activity acting on acid anhydrides catalyzing transmembrane movement of substances, 0015986 ATP synthesis coupled proton transport, 0016469 proton-transporting two-sector ATPase complex. Probab=23.25 E-value=38 Score=12.24 Aligned_cols=30 Identities=30% Similarity=0.464 Sum_probs=17.9 Q ss_pred HHHHHCCCHHHH--HHCCHHHHHHHHHHHHHH Q ss_conf 876738107777--606513478868873564 Q gi|254780384|r 571 LYLTSYDDLWNV--FGSDRSAAAQHYITHGMK 600 (665) Q Consensus 571 ~YLa~y~Dl~~~--fg~d~~~a~~Hyi~~G~~ 600 (665) +-||.|-.|-++ ||+|.+.|.+.=|.+|.| T Consensus 402 LeLAQYRELeAFsQFaSDLD~ATk~qLerG~R 433 (520) T TIGR00962 402 LELAQYRELEAFSQFASDLDEATKAQLERGKR 433 (520) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 55754889988864012234899999860672 No 109 >PRK13262 ureE urease accessory protein UreE; Provisional Probab=22.97 E-value=31 Score=12.82 Aligned_cols=16 Identities=44% Similarity=0.584 Sum_probs=11.9 Q ss_pred HHHHHHHHHCCCCCCC Q ss_conf 0323465518788887 Q gi|254780384|r 124 ALHEIGHALGLMHPGS 139 (665) Q Consensus 124 ~iHEIGHALGL~Hp~~ 139 (665) .--|+|||||=.|=-- T Consensus 105 tafELGHalGNQHW~a 120 (231) T PRK13262 105 TCFELGHALGNQHWKA 120 (231) T ss_pred HHHHHHHHHHCCCHHH T ss_conf 9999756650232165 No 110 >COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only] Probab=21.68 E-value=31 Score=12.79 Aligned_cols=12 Identities=42% Similarity=0.515 Sum_probs=6.8 Q ss_pred EEHHHHHHHHHC Q ss_conf 300323465518 Q gi|254780384|r 122 HNALHEIGHALG 133 (665) Q Consensus 122 ~t~iHEIGHALG 133 (665) -++-|||||-.| T Consensus 132 gViAHEigHv~q 143 (484) T COG4783 132 GVIAHEIGHVAQ 143 (484) T ss_pred HHHHHHHHHHHH T ss_conf 999999988741 No 111 >pfam09954 DUF2188 Uncharacterized protein conserved in bacteria (DUF2188). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=21.41 E-value=40 Score=12.14 Aligned_cols=22 Identities=23% Similarity=0.422 Sum_probs=13.1 Q ss_pred CCCEEEECCCC-----CCCCCCCCCCH Q ss_conf 75238801478-----86632222201 Q gi|254780384|r 635 KKVDLYTDYFG-----DGNYYDSDLYS 656 (665) Q Consensus 635 ~~g~l~~d~dg-----~g~~~~~~~~~ 656 (665) +...+||-.|| .|+.-.|+.+. T Consensus 87 ~~~hV~~edd~W~VkseGAkrAS~tfd 113 (131) T pfam09954 87 KAEHVYKHDDGWAVKSEGAKRASEVFD 113 (131) T ss_pred CCEEEEECCCCEEEECCCHHHHHHHHH T ss_conf 661688628936784044466664231 No 112 >COG4744 Uncharacterized conserved protein [Function unknown] Probab=20.95 E-value=18 Score=14.23 Aligned_cols=41 Identities=27% Similarity=0.397 Sum_probs=33.4 Q ss_pred CEEEEEEECCCCCCCCCCCCCEECCHHHHHHHHHHHHCCCC Q ss_conf 02446752356776666552000211221489999860876 Q gi|254780384|r 158 FLTSVMSYFMPQDSMMDASFGYCATPMVSDIVAVQKIYGAP 198 (665) Q Consensus 158 ~~~TvMSY~~~~~~~~~~~~~~~~tpm~~DI~Alq~lYGan 198 (665) ...-||||.-+.....+.+-....+||..|..++.+.||-. T Consensus 40 LI~lv~Sy~lp~~lss~~~~~~v~npgeadmk~v~kd~Gqe 80 (121) T COG4744 40 LIALVMSYHLPELLSSNEDITIVKNPGEADMKIVIKDYGQE 80 (121) T ss_pred HHHHHHHCCCCCCCCCCCCCEEEECCCCCCEEEEEHHCCCC T ss_conf 99998751774456888871577367642206672422916 Done!