Query psy12290
Match_columns 530
No_of_seqs 307 out of 1762
Neff 5.4
Searched_HMMs 46136
Date Fri Aug 16 23:33:04 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy12290.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12290hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1565|consensus 100.0 1.3E-53 2.7E-58 458.3 18.9 270 2-428 197-469 (469)
2 cd00094 HX Hemopexin-like repe 100.0 4.9E-48 1.1E-52 371.7 21.8 192 232-427 1-194 (194)
3 cd00094 HX Hemopexin-like repe 100.0 7.3E-33 1.6E-37 266.5 16.3 165 286-471 8-175 (194)
4 KOG1565|consensus 99.8 9.2E-19 2E-23 189.2 11.3 135 286-427 291-427 (469)
5 PF00045 Hemopexin: Hemopexin; 99.3 3.1E-12 6.8E-17 95.7 3.5 45 334-381 1-45 (45)
6 PF00413 Peptidase_M10: Matrix 99.2 3.1E-12 6.7E-17 117.4 2.0 53 14-82 100-153 (154)
7 cd04278 ZnMc_MMP Zinc-dependen 99.2 8.7E-12 1.9E-16 116.2 4.6 63 3-82 93-156 (157)
8 smart00120 HX Hemopexin-like r 99.1 6.8E-11 1.5E-15 87.5 5.1 44 334-380 1-44 (45)
9 cd04279 ZnMc_MMP_like_1 Zinc-d 99.0 9.7E-11 2.1E-15 109.0 3.4 53 16-82 101-155 (156)
10 PF00045 Hemopexin: Hemopexin; 99.0 2.1E-10 4.5E-15 85.8 4.4 44 239-282 1-45 (45)
11 smart00120 HX Hemopexin-like r 98.8 1E-08 2.2E-13 75.8 4.7 44 383-428 1-44 (45)
12 cd04268 ZnMc_MMP_like Zinc-dep 98.6 3.2E-08 6.9E-13 92.0 2.9 56 18-82 93-165 (165)
13 smart00235 ZnMc Zinc-dependent 98.0 2.2E-06 4.7E-11 78.1 0.7 48 20-82 87-138 (140)
14 cd00203 ZnMc Zinc-dependent me 97.8 1.6E-05 3.4E-10 73.8 3.1 52 18-82 95-167 (167)
15 cd04277 ZnMc_serralysin_like Z 97.5 7.6E-05 1.7E-09 71.4 3.5 24 16-41 110-134 (186)
16 PF12388 Peptidase_M57: Dual-a 96.8 0.00075 1.6E-08 66.7 2.5 44 25-82 137-211 (211)
17 cd04276 ZnMc_MMP_like_2 Zinc-d 95.6 0.0098 2.1E-07 58.4 3.3 27 19-48 116-143 (197)
18 cd04267 ZnMc_ADAM_like Zinc-de 94.4 0.03 6.5E-07 53.6 3.0 31 20-50 132-173 (192)
19 PF05572 Peptidase_M43: Pregna 94.4 0.012 2.7E-07 55.4 0.3 18 24-41 72-90 (154)
20 cd04327 ZnMc_MMP_like_3 Zinc-d 94.2 0.018 3.9E-07 56.0 0.9 25 18-44 91-116 (198)
21 TIGR03548 mutarot_permut cycli 93.6 2.8 6.2E-05 43.2 15.8 144 241-400 67-232 (323)
22 PHA03098 kelch-like protein; P 93.5 2.6 5.6E-05 46.6 16.3 143 241-411 337-514 (534)
23 PLN02153 epithiospecifier prot 93.2 3.3 7.2E-05 43.1 15.7 15 241-255 80-94 (341)
24 COG1913 Predicted Zn-dependent 93.0 0.049 1.1E-06 52.5 1.6 25 22-46 125-150 (181)
25 cd04273 ZnMc_ADAMTS_like Zinc- 92.9 0.055 1.2E-06 52.8 1.9 49 21-82 140-198 (207)
26 PF13574 Reprolysin_2: Metallo 92.7 0.034 7.4E-07 53.1 0.1 21 21-41 111-132 (173)
27 PLN02153 epithiospecifier prot 92.5 5.1 0.00011 41.7 16.0 15 241-255 27-41 (341)
28 TIGR03548 mutarot_permut cycli 92.0 3.6 7.7E-05 42.5 14.0 59 242-310 119-192 (323)
29 PRK13267 archaemetzincin-like 91.9 0.069 1.5E-06 51.8 1.2 22 25-46 129-151 (179)
30 PF13583 Reprolysin_4: Metallo 91.6 0.073 1.6E-06 52.4 0.9 19 19-39 137-156 (206)
31 PRK14131 N-acetylneuraminic ac 91.3 2.7 5.9E-05 44.6 12.6 84 273-363 18-117 (376)
32 cd04271 ZnMc_ADAM_fungal Zinc- 91.3 0.14 3E-06 51.3 2.6 44 24-82 148-217 (228)
33 cd04275 ZnMc_pappalysin_like Z 91.3 0.046 1E-06 54.7 -0.8 25 16-42 134-159 (225)
34 PLN02193 nitrile-specifier pro 91.0 6.5 0.00014 43.3 15.4 151 241-413 170-356 (470)
35 PHA03098 kelch-like protein; P 90.9 8.8 0.00019 42.4 16.5 146 241-412 289-468 (534)
36 TIGR03547 muta_rot_YjhT mutatr 90.9 7.9 0.00017 40.2 15.3 55 242-304 13-72 (346)
37 PHA02713 hypothetical protein; 90.7 6.1 0.00013 44.6 15.3 54 241-304 298-360 (557)
38 cd04269 ZnMc_adamalysin_II_lik 90.7 0.14 2.9E-06 49.3 1.8 29 22-50 132-167 (194)
39 PF13688 Reprolysin_5: Metallo 90.4 0.088 1.9E-06 50.4 0.3 22 21-42 142-164 (196)
40 PRK14131 N-acetylneuraminic ac 90.3 9.4 0.0002 40.5 15.5 22 241-262 33-57 (376)
41 TIGR03547 muta_rot_YjhT mutatr 90.0 5.8 0.00013 41.2 13.4 103 278-399 2-126 (346)
42 PF07998 Peptidase_M54: Peptid 89.8 0.11 2.3E-06 51.1 0.3 21 25-46 149-170 (194)
43 cd04270 ZnMc_TACE_like Zinc-de 89.1 0.18 4E-06 50.8 1.4 49 23-82 169-229 (244)
44 PLN02193 nitrile-specifier pro 89.1 17 0.00037 40.0 16.8 51 343-400 328-386 (470)
45 cd04272 ZnMc_salivary_gland_MP 87.5 0.52 1.1E-05 46.5 3.4 19 19-37 143-162 (220)
46 cd04280 ZnMc_astacin_like Zinc 86.9 0.23 5E-06 47.8 0.6 20 20-41 75-95 (180)
47 cd04283 ZnMc_hatching_enzyme Z 84.4 0.37 8.1E-06 46.8 0.7 19 20-40 78-97 (182)
48 COG5549 Predicted Zn-dependent 84.2 0.48 1E-05 47.1 1.3 48 15-82 183-232 (236)
49 PF01400 Astacin: Astacin (Pep 83.2 0.41 8.8E-06 46.5 0.3 20 22-41 80-100 (191)
50 PF02031 Peptidase_M7: Strepto 82.5 0.37 8E-06 44.4 -0.3 55 14-82 72-129 (132)
51 COG3824 Predicted Zn-dependent 81.6 0.55 1.2E-05 42.9 0.5 11 25-35 113-124 (136)
52 PF01421 Reprolysin: Reprolysi 81.2 0.53 1.1E-05 45.5 0.3 25 25-49 135-168 (199)
53 cd04281 ZnMc_BMP1_TLD Zinc-dep 80.9 0.58 1.2E-05 46.2 0.5 19 20-40 88-107 (200)
54 cd04282 ZnMc_meprin Zinc-depen 80.3 0.7 1.5E-05 46.6 0.8 18 23-40 122-140 (230)
55 PHA02713 hypothetical protein; 80.1 33 0.00071 38.8 14.0 54 241-304 346-407 (557)
56 PF10462 Peptidase_M66: Peptid 78.9 0.78 1.7E-05 48.1 0.7 14 25-38 197-211 (305)
57 KOG3658|consensus 77.7 1.5 3.3E-05 50.1 2.5 63 25-92 396-471 (764)
58 PF05548 Peptidase_M11: Gameto 74.3 1.3 2.8E-05 46.6 0.8 17 23-39 152-169 (314)
59 KOG4441|consensus 70.0 2E+02 0.0044 32.8 18.9 141 239-401 325-485 (571)
60 KOG3714|consensus 68.8 1.8 4E-05 47.1 0.5 19 20-40 160-179 (411)
61 PF12044 Metallopep: Putative 66.3 2.9 6.2E-05 45.9 1.3 24 25-50 320-344 (423)
62 PHA02790 Kelch-like protein; P 64.3 2.2E+02 0.0047 31.5 15.5 133 243-407 268-422 (480)
63 PHA02790 Kelch-like protein; P 62.9 2.5E+02 0.0053 31.1 15.6 49 342-400 406-455 (480)
64 TIGR00181 pepF oligoendopeptid 51.5 5.7 0.00012 44.9 0.5 15 18-32 375-389 (591)
65 PF06262 DUF1025: Possibl zinc 50.5 5.8 0.00013 34.9 0.3 11 25-35 77-88 (97)
66 PF08450 SGL: SMP-30/Gluconola 50.3 2.2E+02 0.0049 27.7 11.5 147 246-409 51-214 (246)
67 TIGR03296 M6dom_TIGR03296 M6 f 49.1 5.9 0.00013 41.0 0.1 19 22-40 166-185 (286)
68 cd06455 M3A_TOP Peptidase M3 T 48.2 9.8 0.00021 42.0 1.7 16 17-32 259-274 (472)
69 TIGR02290 M3_fam_3 oligoendope 46.8 7.5 0.00016 44.0 0.5 15 18-32 372-386 (587)
70 KOG3607|consensus 45.9 8.5 0.00018 45.0 0.8 17 22-38 324-341 (716)
71 PF09471 Peptidase_M64: IgA Pe 45.5 7.2 0.00016 40.2 0.1 18 19-38 216-235 (264)
72 cd06459 M3B_Oligoendopeptidase 45.3 8.3 0.00018 41.2 0.5 15 18-32 219-233 (427)
73 KOG4693|consensus 44.0 4.2E+02 0.0091 28.1 13.1 77 326-409 176-277 (392)
74 PF06114 DUF955: Domain of unk 43.9 8.2 0.00018 32.8 0.2 14 25-38 46-59 (122)
75 TIGR02289 M3_not_pepF oligoend 42.8 9.4 0.0002 43.0 0.5 15 18-32 334-348 (549)
76 KOG2719|consensus 42.5 3.2 7E-05 45.4 -3.1 27 18-46 279-305 (428)
77 PF01432 Peptidase_M3: Peptida 42.4 10 0.00022 41.3 0.7 17 16-32 237-253 (458)
78 PF04298 Zn_peptidase_2: Putat 39.6 14 0.00031 37.2 1.1 24 14-38 82-105 (222)
79 KOG4441|consensus 38.3 3.8E+02 0.0082 30.6 12.4 137 343-504 332-481 (571)
80 COG2856 Predicted Zn peptidase 38.2 12 0.00025 37.6 0.3 9 24-32 75-83 (213)
81 cd08988 GH43_ABN Glycosyl hydr 37.0 92 0.002 31.6 6.6 65 332-397 118-189 (279)
82 KOG4693|consensus 36.5 2.1E+02 0.0046 30.2 8.9 98 383-493 184-289 (392)
83 COG1164 Oligoendopeptidase F [ 36.5 14 0.00029 42.4 0.5 15 18-32 377-391 (598)
84 cd06456 M3A_DCP_Oligopeptidase 35.1 21 0.00045 39.0 1.7 19 16-34 203-223 (422)
85 cd06258 Peptidase_M3_like The 35.1 21 0.00046 37.5 1.7 16 17-32 150-165 (365)
86 KOG4499|consensus 34.1 2.3E+02 0.0051 29.4 8.7 77 335-413 160-246 (310)
87 PF13964 Kelch_6: Kelch motif 33.3 58 0.0013 24.0 3.4 21 343-363 11-39 (50)
88 cd06457 M3A_MIP Peptidase M3 m 32.4 22 0.00047 39.2 1.3 17 16-32 243-259 (458)
89 TIGR02604 Piru_Ver_Nterm putat 31.9 6.4E+02 0.014 26.7 13.6 93 294-397 81-200 (367)
90 COG5549 Predicted Zn-dependent 31.7 22 0.00048 35.7 1.0 26 26-51 158-184 (236)
91 PF02191 OLF: Olfactomedin-lik 31.3 4.8E+02 0.01 26.6 10.7 72 288-365 24-102 (250)
92 PF05547 Peptidase_M6: Immune 30.4 14 0.00031 42.7 -0.6 19 20-40 222-241 (645)
93 KOG4525|consensus 29.4 31 0.00068 38.1 1.8 26 25-52 305-331 (614)
94 KOG1924|consensus 28.7 4.4E+02 0.0095 31.7 10.6 6 255-260 642-647 (1102)
95 COG2738 Predicted Zn-dependent 28.4 21 0.00047 35.5 0.3 29 19-49 92-124 (226)
96 PF01435 Peptidase_M48: Peptid 28.1 18 0.00039 34.8 -0.2 21 18-40 88-108 (226)
97 KOG0379|consensus 27.1 9E+02 0.02 26.9 14.0 53 343-400 173-233 (482)
98 PF11350 DUF3152: Protein of u 27.0 23 0.00051 35.2 0.3 24 14-40 135-159 (203)
99 PF13398 Peptidase_M50B: Pepti 26.3 22 0.00047 34.9 -0.0 13 18-32 21-33 (200)
100 PF08450 SGL: SMP-30/Gluconola 25.3 4.3E+02 0.0092 25.7 8.9 108 240-361 90-214 (246)
101 cd05709 S2P-M50 Site-2 proteas 24.2 27 0.00059 33.0 0.2 12 19-32 8-19 (180)
102 PF02163 Peptidase_M50: Peptid 23.9 27 0.0006 33.2 0.2 20 20-41 8-29 (192)
103 KOG3538|consensus 23.2 30 0.00066 41.2 0.4 33 19-51 315-354 (845)
104 PRK10911 oligopeptidase A; Pro 22.5 29 0.00063 40.4 0.0 16 19-34 461-478 (680)
105 PF14564 Membrane_bind: Membra 20.3 92 0.002 28.1 2.7 33 333-365 50-83 (110)
No 1
>KOG1565|consensus
Probab=100.00 E-value=1.3e-53 Score=458.27 Aligned_cols=270 Identities=31% Similarity=0.644 Sum_probs=231.6
Q ss_pred CCCCCCCCCCCcccccchhhhhhhhhhhccc-cccCCCCCCCeeeecccCCCCccccCCceeecCCCCCCChhHHHHhhc
Q psy12290 2 LDDGWELPEDDKMAIQSLYGMNHSWATYGHW-ALSFKMTTIEAEFKWFDGGNSTICTNLSTEVLSPRNQRKKEDRIGESQ 80 (530)
Q Consensus 2 ~de~W~~~~~~~~~~~~l~~va~~~HE~GH~-GL~Hs~~~~~iM~p~y~~~~~~~~~~~~~~~l~~~~~~~~dD~~gi~~ 80 (530)
.||.|++. +..|+||++|| +|||||+ ||.||++++|||||+|+... +++.|+ +||+.|||+
T Consensus 197 ~dE~Wt~~---~~~g~~l~~Va--~HEiGH~LGL~HS~~~~aiM~P~y~~~~-------~~~~L~------~DDv~giq~ 258 (469)
T KOG1565|consen 197 KDETWTYG---DSNGVDLFLVA--AHEIGHALGLGHSSDPDAIMYPFYQPDS-------GNFDLS------QDDVRGIQH 258 (469)
T ss_pred cccceecc---CCccchhHHHh--hhhcccccccCCCCCcccccccccccCC-------CCcccC------hhhhhhhHH
Confidence 68999996 35699999999 9999999 99999999999999999732 258888 999999999
Q ss_pred cccccccccccccccccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy12290 81 PFRERTLSICDRKEEKSWAVIPRYSPSPDLTTRRPEVPRPRHPTHHHPEHHPTPESPRREPPPRHPNQYPETPQDSPRRE 160 (530)
Q Consensus 81 ly~~~~~~~~G~~~~~~~~~~~~~~~~~~~tt~~p~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (530)
|| |.... +. .+ +.
T Consensus 259 lY--------G~~~~--~~------~~-----------------------------~~---------------------- 271 (469)
T KOG1565|consen 259 LY--------GGPPR--DY------FM-----------------------------PS---------------------- 271 (469)
T ss_pred Hh--------CCCcc--cc------cc-----------------------------cc----------------------
Confidence 99 74210 00 00 00
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccccCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCcce
Q psy12290 161 PSLRQPNQYPETPHEPRRDSPPRHPNQFPETPRKPPARYPDIARAEPRTTPTTTTPPATRRGESNPGPSDKPDTCDTSYD 240 (530)
Q Consensus 161 p~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~p~~~~p~~~~~~~~~~~~~~~p~~t~~~~~~~~~~~~pd~C~~~fD 240 (530)
.+ ..+. + . ++...+..|+..||
T Consensus 272 -------------------------~~--~~~~-p-------------------~-----------~~~~~~~~~s~~fd 293 (469)
T KOG1565|consen 272 -------------------------GQ--RNPT-P-------------------S-----------SSGPEPSRCSGHFD 293 (469)
T ss_pred -------------------------cc--CCCC-C-------------------C-----------CCCCCCCcCCCCcc
Confidence 00 0000 0 0 00123577888999
Q ss_pred eeEEeC-CEEEEEECCeEEEEcCCCccCCCceecccccccCCCCCccccceEEeeCCcEEEEEceeEEEecceecCCCCC
Q psy12290 241 AIAVIR-TEVFVFKDRWMWRIGDKGVLPGYPVLIHQFWSELPHNLTHIDAVYQRIDNHIAFFIGRQYFLFEGIRLLPGYP 319 (530)
Q Consensus 241 Avt~ir-GeiyfFKG~~~WR~~~~~~~~gyP~~Is~~WpGLP~~~~~IDAAye~~~gktyFFKG~~yW~y~~~~l~~GYP 319 (530)
||++++ |++||||+++|||+.... +.++|.+|+++|+|||.. |||||+ .++++|||||++||+|.+.++..|||
T Consensus 294 av~~~~~g~~~ffk~~~~Wr~~~~~-~~~~p~~I~~~w~glP~~---vdAa~e-~~~~~~ffkg~~~w~~~~~~~~~gyp 368 (469)
T KOG1565|consen 294 AVTTDRHGETYFFKGRHFWRLTTSG-DGREPRPISRFWPGLPGA---VDAAYE-WDGKIYFFKGAQFWSYAGYTLVPGYP 368 (469)
T ss_pred eEeeccccceEEecCceeEeeeccC-cCCCcccHhhcCCCCcch---hhhhhh-ccCcEEEecCCeeEEEecccccCCCC
Confidence 999987 999999999999999887 778999999999999986 999999 89999999999999999999999999
Q ss_pred ccccc-CCCCCcccccccEEEeccCCeEEEeeCCeEEEEeCCCCcccCCCCcccccCCCCCCCCCCEEEecCCCcEEEEE
Q psy12290 320 RPLTA-LGLPASLERIDAAMVWGYNSKTYLFSGTRYWKLDDETGRSEPDYPRNMTENWRGVGVDIDDAFQWKDGATYFFK 398 (530)
Q Consensus 320 K~Is~-~GlP~~~~~IDAAf~~~~ngktYFFkG~~YWryde~~~~vd~gYPk~Is~~w~GvP~~IDAAf~~~~G~~YFFk 398 (530)
|.|.+ .|++..+. ||||+.|..++++|||+|++|||||+..+.|+.+||+.|. .|+|+ +||+++ .++.+|||+
T Consensus 369 r~l~~~~~~~~~~~-idaa~~~~~~~kt~ff~g~~~wr~d~~~~~~d~~~p~~~~-~~~gi---~~a~~~-~~~~~Yff~ 442 (469)
T KOG1565|consen 369 KKLYRSVGFPPRGR-IDAARFWPANGKTYFFSGRRYWRYDEKSRAMDWGYPKWIH-KFPGI---GDASFS-NGGSVYFFS 442 (469)
T ss_pred cchhhccccCCCCc-eeeeEeccCCCceEEEecceeeeehhhhcccccCCCcchh-hccCc---cchhhh-cCceEEEEe
Confidence 99998 46776656 9999999999999999999999999999999999999999 99999 999999 667999999
Q ss_pred CCeEEEEecccceeeccCCCcccccccCCC
Q psy12290 399 GKGFWKFKDVSMRVEKEKPSPSAQFWMKCP 428 (530)
Q Consensus 399 G~~Ywrfd~~~~~V~~~yPr~ia~~W~gCp 428 (530)
|++||+||....+|+.+||+++ ||+|+
T Consensus 443 g~~y~~~d~~~~~v~~~~~~~~---wl~C~ 469 (469)
T KOG1565|consen 443 GPKYYRYDDRNLRVEPPYPRNI---WLGCT 469 (469)
T ss_pred CCeeEeeCchHhhccccCCcce---eccCC
Confidence 9999999999999999999987 99996
No 2
>cd00094 HX Hemopexin-like repeats.; Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs). This CD contains 4 instances of the repeat.
Probab=100.00 E-value=4.9e-48 Score=371.69 Aligned_cols=192 Identities=42% Similarity=0.973 Sum_probs=180.2
Q ss_pred CCCCCC-cceeeEEeCCEEEEEECCeEEEEcCCCccCCCceecccccccCCCCCccccceEEeeC-CcEEEEEceeEEEe
Q psy12290 232 PDTCDT-SYDAIAVIRTEVFVFKDRWMWRIGDKGVLPGYPVLIHQFWSELPHNLTHIDAVYQRID-NHIAFFIGRQYFLF 309 (530)
Q Consensus 232 pd~C~~-~fDAvt~irGeiyfFKG~~~WR~~~~~~~~gyP~~Is~~WpGLP~~~~~IDAAye~~~-gktyFFKG~~yW~y 309 (530)
++.|+. .+|||++++|++|||||++||+++.. ...++|++|++.|++||.. |||||+..+ +++|||||++||+|
T Consensus 1 ~~~C~~~~iDA~~~~~g~~y~FkG~~~w~~~~~-~~~~~p~~I~~~w~~~p~~---IDAa~~~~~~~~~yfFkg~~yw~~ 76 (194)
T cd00094 1 PDACDPLSFDAVTTLRGELYFFKGRYFWRLSPG-KPPGSPFLISSFWPSLPSP---VDAAFERPDTGKIYFFKGDKYWVY 76 (194)
T ss_pred CCcCCCCcCCeEEEeCCEEEEEeCCEEEEEeCC-CCCCCCeEhhhhCCCCCCC---ccEEEEECCCCEEEEECCCEEEEE
Confidence 478988 89999998899999999999999877 5678999999999999975 999999875 99999999999999
Q ss_pred cceecCCCCCcccccCCCCCcccccccEEEeccCCeEEEeeCCeEEEEeCCCCcccCCCCcccccCCCCCCCCCCEEEec
Q psy12290 310 EGIRLLPGYPRPLTALGLPASLERIDAAMVWGYNSKTYLFSGTRYWKLDDETGRSEPDYPRNMTENWRGVGVDIDDAFQW 389 (530)
Q Consensus 310 ~~~~l~~GYPK~Is~~GlP~~~~~IDAAf~~~~ngktYFFkG~~YWryde~~~~vd~gYPk~Is~~w~GvP~~IDAAf~~ 389 (530)
++..++.||||.|+++|+|..+++|||||.|+.++++|||||++||+||...++|+.+||+.|++.|+|||.+|||||++
T Consensus 77 ~~~~~~~~~Pk~i~~~~~~~~~~~iDAA~~~~~~~~~yfFkg~~y~ry~~~~~~v~~~yP~~i~~~w~g~p~~idaa~~~ 156 (194)
T cd00094 77 TGKNLEPGYPKPISDLGFPPTVKQIDAALRWPDNGKTYFFKGDKYWRYDEKTQKMDPGYPKLIETDFPGVPDKVDAAFRW 156 (194)
T ss_pred cCcccccCCCcchhhcCCCCCCCCccEEEEEcCCCEEEEEeCCEEEEEeCCCccccCCCCcchhhcCCCcCCCcceeEEe
Confidence 99888999999999888887668999999997789999999999999999989999999999999999999999999999
Q ss_pred CCCcEEEEECCeEEEEecccceeeccCCCcccccccCC
Q psy12290 390 KDGATYFFKGKGFWKFKDVSMRVEKEKPSPSAQFWMKC 427 (530)
Q Consensus 390 ~~G~~YFFkG~~Ywrfd~~~~~V~~~yPr~ia~~W~gC 427 (530)
.+|++|||+|++||+||..+++|+.+||+.++..||+|
T Consensus 157 ~~~~~yfF~g~~y~~~d~~~~~~~~~~p~~~~~~w~~C 194 (194)
T cd00094 157 LDGYYYFFKGDQYWRFDPRSKEVRVGYPLKISSDWLGC 194 (194)
T ss_pred CCCcEEEEECCEEEEEeCccceecCCcccCccccccCC
Confidence 88999999999999999999989999999999999999
No 3
>cd00094 HX Hemopexin-like repeats.; Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs). This CD contains 4 instances of the repeat.
Probab=100.00 E-value=7.3e-33 Score=266.47 Aligned_cols=165 Identities=24% Similarity=0.466 Sum_probs=138.3
Q ss_pred cccceEEeeCCcEEEEEceeEEEecceecCCCCCcccccCCCCCcccccccEEEeccCCeEEEeeCCeEEEEeCCCCccc
Q psy12290 286 HIDAVYQRIDNHIAFFIGRQYFLFEGIRLLPGYPRPLTALGLPASLERIDAAMVWGYNSKTYLFSGTRYWKLDDETGRSE 365 (530)
Q Consensus 286 ~IDAAye~~~gktyFFKG~~yW~y~~~~l~~GYPK~Is~~GlP~~~~~IDAAf~~~~ngktYFFkG~~YWryde~~~~vd 365 (530)
.||||++.. |++|||||++||+++.. +..|+|+.|++ ++|+.+++|||||.+..+|++|||||++||+|+... ++
T Consensus 8 ~iDA~~~~~-g~~y~FkG~~~w~~~~~-~~~~~p~~I~~-~w~~~p~~IDAa~~~~~~~~~yfFkg~~yw~~~~~~--~~ 82 (194)
T cd00094 8 SFDAVTTLR-GELYFFKGRYFWRLSPG-KPPGSPFLISS-FWPSLPSPVDAAFERPDTGKIYFFKGDKYWVYTGKN--LE 82 (194)
T ss_pred cCCeEEEeC-CEEEEEeCCEEEEEeCC-CCCCCCeEhhh-hCCCCCCCccEEEEECCCCEEEEECCCEEEEEcCcc--cc
Confidence 499999975 99999999999999987 67899999998 455555789999999777999999999999999875 88
Q ss_pred CCCCcccccCCCCCC--CCCCEEEecC-CCcEEEEECCeEEEEecccceeeccCCCcccccccCCCCCCCCCcccccccc
Q psy12290 366 PDYPRNMTENWRGVG--VDIDDAFQWK-DGATYFFKGKGFWKFKDVSMRVEKEKPSPSAQFWMKCPEISSPEDRISERRV 442 (530)
Q Consensus 366 ~gYPk~Is~~w~GvP--~~IDAAf~~~-~G~~YFFkG~~Ywrfd~~~~~V~~~yPr~ia~~W~gCp~~~~~~~~i~~~~~ 442 (530)
.+|||.|++ |.+.+ .+|||||++. +|++|||+|++||+|+....+|+.+||+.+++.|+|||.+++++ ....
T Consensus 83 ~~~Pk~i~~-~~~~~~~~~iDAA~~~~~~~~~yfFkg~~y~ry~~~~~~v~~~yP~~i~~~w~g~p~~idaa---~~~~- 157 (194)
T cd00094 83 PGYPKPISD-LGFPPTVKQIDAALRWPDNGKTYFFKGDKYWRYDEKTQKMDPGYPKLIETDFPGVPDKVDAA---FRWL- 157 (194)
T ss_pred cCCCcchhh-cCCCCCCCCccEEEEEcCCCEEEEEeCCEEEEEeCCCccccCCCCcchhhcCCCcCCCccee---EEeC-
Confidence 999999997 65544 8999999997 78999999999999999999999999999999999999877553 1111
Q ss_pred CceEEEeecCCCceecccccceEEEEecC
Q psy12290 443 GRAFSSRSTSGGLRQGASFKAQFYHVRCS 471 (530)
Q Consensus 443 g~~y~~~~t~~~~~~~~~~~~~~~~~~~~ 471 (530)
++...++| +.+||+....
T Consensus 158 ----------~~~~yfF~-g~~y~~~d~~ 175 (194)
T cd00094 158 ----------DGYYYFFK-GDQYWRFDPR 175 (194)
T ss_pred ----------CCcEEEEE-CCEEEEEeCc
Confidence 13444555 6677877544
No 4
>KOG1565|consensus
Probab=99.77 E-value=9.2e-19 Score=189.20 Aligned_cols=135 Identities=27% Similarity=0.507 Sum_probs=117.8
Q ss_pred cccceEEeeCCcEEEEEceeEEEecceecCCCCCcccccCCCCCcccccccEEEeccCCeEEEeeCCeEEEEeCCCCccc
Q psy12290 286 HIDAVYQRIDNHIAFFIGRQYFLFEGIRLLPGYPRPLTALGLPASLERIDAAMVWGYNSKTYLFSGTRYWKLDDETGRSE 365 (530)
Q Consensus 286 ~IDAAye~~~gktyFFKG~~yW~y~~~~l~~GYPK~Is~~GlP~~~~~IDAAf~~~~ngktYFFkG~~YWryde~~~~vd 365 (530)
.+||+.....++++|||+..||++.... +.++|+.|+. .+|+.+..||||++ ..+++|||||++||.|... ++.
T Consensus 291 ~fdav~~~~~g~~~ffk~~~~Wr~~~~~-~~~~p~~I~~-~w~glP~~vdAa~e--~~~~~~ffkg~~~w~~~~~--~~~ 364 (469)
T KOG1565|consen 291 HFDAVTTDRHGETYFFKGRHFWRLTTSG-DGREPRPISR-FWPGLPGAVDAAYE--WDGKIYFFKGAQFWSYAGY--TLV 364 (469)
T ss_pred CcceEeeccccceEEecCceeEeeeccC-cCCCcccHhh-cCCCCcchhhhhhh--ccCcEEEecCCeeEEEecc--ccc
Confidence 5899998777999999999999999877 7799999998 33444458999998 5799999999999999986 578
Q ss_pred CCCCcccccCCCCCCCC-CCEEEecC-CCcEEEEECCeEEEEecccceeeccCCCcccccccCC
Q psy12290 366 PDYPRNMTENWRGVGVD-IDDAFQWK-DGATYFFKGKGFWKFKDVSMRVEKEKPSPSAQFWMKC 427 (530)
Q Consensus 366 ~gYPk~Is~~w~GvP~~-IDAAf~~~-~G~~YFFkG~~Ywrfd~~~~~V~~~yPr~ia~~W~gC 427 (530)
.||||.|.+.....+.. ||||+.|. +|++|||+|++||+||.....++.+||+.+. .|.|.
T Consensus 365 ~gypr~l~~~~~~~~~~~idaa~~~~~~~kt~ff~g~~~wr~d~~~~~~d~~~p~~~~-~~~gi 427 (469)
T KOG1565|consen 365 PGYPKKLYRSVGFPPRGRIDAARFWPANGKTYFFSGRRYWRYDEKSRAMDWGYPKWIH-KFPGI 427 (469)
T ss_pred CCCCcchhhccccCCCCceeeeEeccCCCceEEEecceeeeehhhhcccccCCCcchh-hccCc
Confidence 99999999766555543 99999997 6789999999999999999889999999998 88888
No 5
>PF00045 Hemopexin: Hemopexin; InterPro: IPR018487 Hemopexin (3.2.1.35 from EC) is a serum glycoprotein that binds haem and transports it to the liver for breakdown and iron recovery, after which the free hemopexin returns to the circulation []. Hemopexin prevents haem-mediated oxidative stress. Structurally hemopexin consists of two similar halves of approximately two hundred amino acid residues connected by a histidine-rich hinge region. Each half is itself formed by the repetition of a basic unit of some 35 to 45 residues. Hemopexin-like domains have been found in two other types of proteins, vitronectin [], a cell adhesion and spreading factor found in plasma and tissues, and matrixins MMP-1, MMP-2, MMP-3, MMP-9, MMP-10, MMP-11, MMP-12, MMP-14, MMP-15 and MMP-16, members of the matrix metalloproteinase family that cleave extracellular matrix constituents []. These zinc endopeptidases, which belong to MEROPS peptidase subfamily M10A, have a single hemopexin-like domain in their C-terminal section. It is suggested that the hemopexin domain facilitates binding to a variety of molecules and proteins, for example the HX repeats of some matrixins bind tissue inhibitor of metallopeptidases (TIMPs). This entry represents the repeat found in hempoxein and related domains.; PDB: 1SU3_B 2CLT_B 3V6N_B 3C7X_A 1GXD_A 1RTG_A 1CK7_A 1GEN_A 1ITV_A 2JXY_A ....
Probab=99.26 E-value=3.1e-12 Score=95.71 Aligned_cols=45 Identities=44% Similarity=1.139 Sum_probs=41.6
Q ss_pred cccEEEeccCCeEEEeeCCeEEEEeCCCCcccCCCCcccccCCCCCCC
Q psy12290 334 IDAAMVWGYNSKTYLFSGTRYWKLDDETGRSEPDYPRNMTENWRGVGV 381 (530)
Q Consensus 334 IDAAf~~~~ngktYFFkG~~YWryde~~~~vd~gYPk~Is~~w~GvP~ 381 (530)
|||||.+.. |++|||||++||+|++. +++.|||+.|++.|+|||.
T Consensus 1 iDAa~~~~~-g~~yfFkg~~ywr~~~~--~~~~gyP~~i~~~w~glp~ 45 (45)
T PF00045_consen 1 IDAAFQWGN-GKTYFFKGNQYWRFDES--KVDPGYPRPISDFWPGLPC 45 (45)
T ss_dssp ESEEEEETT-TEEEEEETTEEEEEETT--EETTCEEEEHHHHSTTSTS
T ss_pred CeEEEEeCC-CcEEEEECCEEEEEcCc--ccCCCcCEEhhhCCCcCCC
Confidence 799999854 99999999999999997 7899999999999999973
No 6
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=99.22 E-value=3.1e-12 Score=117.43 Aligned_cols=53 Identities=17% Similarity=0.246 Sum_probs=46.5
Q ss_pred ccccchhhhhhhhhhhccc-cccCCCCCCCeeeecccCCCCccccCCceeecCCCCCCChhHHHHhhccc
Q psy12290 14 MAIQSLYGMNHSWATYGHW-ALSFKMTTIEAEFKWFDGGNSTICTNLSTEVLSPRNQRKKEDRIGESQPF 82 (530)
Q Consensus 14 ~~~~~l~~va~~~HE~GH~-GL~Hs~~~~~iM~p~y~~~~~~~~~~~~~~~l~~~~~~~~dD~~gi~~ly 82 (530)
..+.+|+.|+ +|||||+ ||.|+.++.++|+|+|.+.. . ..|+ .+||.+||+||
T Consensus 100 ~~~~~~~~v~--~HEiGHaLGL~H~~~~~svM~~~~~~~~-~-------~~l~------~~Di~~i~~lY 153 (154)
T PF00413_consen 100 DSGNDLQSVA--IHEIGHALGLDHSNDPNSVMYPYYRGPD-N-------KTLS------EDDIDGIQYLY 153 (154)
T ss_dssp SSSEEHHHHH--HHHHHHHTTBESSSSTTSTTSSSCTSSS-S-------TSTT------HHHHHHHHHHH
T ss_pred hhhhhhhhhh--hhccccccCcCcCCCcccceeeecccCC-C-------CCCC------HHHHHHHHHHh
Confidence 3567899999 9999999 99999999999999998753 1 2566 99999999999
No 7
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=99.21 E-value=8.7e-12 Score=116.23 Aligned_cols=63 Identities=19% Similarity=0.318 Sum_probs=51.9
Q ss_pred CCCCCCCCCCcccccchhhhhhhhhhhccc-cccCCCCCCCeeeecccCCCCccccCCceeecCCCCCCChhHHHHhhcc
Q psy12290 3 DDGWELPEDDKMAIQSLYGMNHSWATYGHW-ALSFKMTTIEAEFKWFDGGNSTICTNLSTEVLSPRNQRKKEDRIGESQP 81 (530)
Q Consensus 3 de~W~~~~~~~~~~~~l~~va~~~HE~GH~-GL~Hs~~~~~iM~p~y~~~~~~~~~~~~~~~l~~~~~~~~dD~~gi~~l 81 (530)
++.|.+.. ...+.+++.|+ +|||||+ ||.|+.++.+||||+|.+... ...|+ .+|+.+||+|
T Consensus 93 ~~~~~~~~--~~~~~~~~~~~--~HEiGHaLGL~H~~~~~~vM~~~~~~~~~-------~~~l~------~~D~~~~~~l 155 (157)
T cd04278 93 DEQWTLGS--DSGGTDLFSVA--AHEIGHALGLGHSSDPDSIMYPYYQGPVP-------KFKLS------QDDIRGIQAL 155 (157)
T ss_pred CcccccCC--CCccchHHHHH--HHHhccccccCCCCCCcCeecccccCCCc-------CCCCC------HHHHHHHHHh
Confidence 45566643 23578899999 9999999 999999999999999987521 24677 9999999999
Q ss_pred c
Q psy12290 82 F 82 (530)
Q Consensus 82 y 82 (530)
|
T Consensus 156 Y 156 (157)
T cd04278 156 Y 156 (157)
T ss_pred c
Confidence 9
No 8
>smart00120 HX Hemopexin-like repeats. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs).
Probab=99.13 E-value=6.8e-11 Score=87.53 Aligned_cols=44 Identities=39% Similarity=0.866 Sum_probs=40.6
Q ss_pred cccEEEeccCCeEEEeeCCeEEEEeCCCCcccCCCCcccccCCCCCC
Q psy12290 334 IDAAMVWGYNSKTYLFSGTRYWKLDDETGRSEPDYPRNMTENWRGVG 380 (530)
Q Consensus 334 IDAAf~~~~ngktYFFkG~~YWryde~~~~vd~gYPk~Is~~w~GvP 380 (530)
||||+.+. +|++|||||++||+|+..+ |+.+||+.|++.|+|+|
T Consensus 1 iDAa~~~~-~~~~yfFkg~~yw~~~~~~--~~~~~p~~i~~~~~~lp 44 (45)
T smart00120 1 IDAAFELR-NGKTYFFKGDKYWRFDPKR--VDPGYPKLISSFFPGLP 44 (45)
T ss_pred CeEEEEeC-CCeEEEEeCCEEEEEcCCc--cCCCCCeEhhhcCCCCC
Confidence 79999986 8899999999999999864 88999999999999997
No 9
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.04 E-value=9.7e-11 Score=108.99 Aligned_cols=53 Identities=17% Similarity=0.176 Sum_probs=45.7
Q ss_pred ccchhhhhhhhhhhccc-cccCCCCCC-CeeeecccCCCCccccCCceeecCCCCCCChhHHHHhhccc
Q psy12290 16 IQSLYGMNHSWATYGHW-ALSFKMTTI-EAEFKWFDGGNSTICTNLSTEVLSPRNQRKKEDRIGESQPF 82 (530)
Q Consensus 16 ~~~l~~va~~~HE~GH~-GL~Hs~~~~-~iM~p~y~~~~~~~~~~~~~~~l~~~~~~~~dD~~gi~~ly 82 (530)
+.+|++|| +|||||+ ||.|+.... ++|||++..... .+..|+ .+|+.+|+.||
T Consensus 101 ~~~~~~~~--~HEiGHaLGL~H~~~~~~siM~p~~~~~~~------~~~~l~------~~D~~~i~~lY 155 (156)
T cd04279 101 AENLQAIA--LHELGHALGLWHHSDRPEDAMYPSQGQGPD------GNPTLS------ARDVATLKRLY 155 (156)
T ss_pred chHHHHHH--HHHhhhhhcCCCCCCCccceeeeecccCCc------CcCCCC------HHHHHHHHHHh
Confidence 56899999 9999999 999999999 999999986422 134566 99999999999
No 10
>PF00045 Hemopexin: Hemopexin; InterPro: IPR018487 Hemopexin (3.2.1.35 from EC) is a serum glycoprotein that binds haem and transports it to the liver for breakdown and iron recovery, after which the free hemopexin returns to the circulation []. Hemopexin prevents haem-mediated oxidative stress. Structurally hemopexin consists of two similar halves of approximately two hundred amino acid residues connected by a histidine-rich hinge region. Each half is itself formed by the repetition of a basic unit of some 35 to 45 residues. Hemopexin-like domains have been found in two other types of proteins, vitronectin [], a cell adhesion and spreading factor found in plasma and tissues, and matrixins MMP-1, MMP-2, MMP-3, MMP-9, MMP-10, MMP-11, MMP-12, MMP-14, MMP-15 and MMP-16, members of the matrix metalloproteinase family that cleave extracellular matrix constituents []. These zinc endopeptidases, which belong to MEROPS peptidase subfamily M10A, have a single hemopexin-like domain in their C-terminal section. It is suggested that the hemopexin domain facilitates binding to a variety of molecules and proteins, for example the HX repeats of some matrixins bind tissue inhibitor of metallopeptidases (TIMPs). This entry represents the repeat found in hempoxein and related domains.; PDB: 1SU3_B 2CLT_B 3V6N_B 3C7X_A 1GXD_A 1RTG_A 1CK7_A 1GEN_A 1ITV_A 2JXY_A ....
Probab=99.04 E-value=2.1e-10 Score=85.82 Aligned_cols=44 Identities=36% Similarity=0.714 Sum_probs=41.1
Q ss_pred ceeeEEeC-CEEEEEECCeEEEEcCCCccCCCceecccccccCCC
Q psy12290 239 YDAIAVIR-TEVFVFKDRWMWRIGDKGVLPGYPVLIHQFWSELPH 282 (530)
Q Consensus 239 fDAvt~ir-GeiyfFKG~~~WR~~~~~~~~gyP~~Is~~WpGLP~ 282 (530)
||||++.+ |++|||||++|||++...+..|||++|++.|+|||.
T Consensus 1 iDAa~~~~~g~~yfFkg~~ywr~~~~~~~~gyP~~i~~~w~glp~ 45 (45)
T PF00045_consen 1 IDAAFQWGNGKTYFFKGNQYWRFDESKVDPGYPRPISDFWPGLPC 45 (45)
T ss_dssp ESEEEEETTTEEEEEETTEEEEEETTEETTCEEEEHHHHSTTSTS
T ss_pred CeEEEEeCCCcEEEEECCEEEEEcCcccCCCcCEEhhhCCCcCCC
Confidence 69999987 999999999999999887889999999999999984
No 11
>smart00120 HX Hemopexin-like repeats. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs).
Probab=98.76 E-value=1e-08 Score=75.77 Aligned_cols=44 Identities=39% Similarity=0.841 Sum_probs=39.1
Q ss_pred CCEEEecCCCcEEEEECCeEEEEecccceeeccCCCcccccccCCC
Q psy12290 383 IDDAFQWKDGATYFFKGKGFWKFKDVSMRVEKEKPSPSAQFWMKCP 428 (530)
Q Consensus 383 IDAAf~~~~G~~YFFkG~~Ywrfd~~~~~V~~~yPr~ia~~W~gCp 428 (530)
||||+.+.+|++|||||++||+|+..+ ++.++|+.++..|.+.|
T Consensus 1 iDAa~~~~~~~~yfFkg~~yw~~~~~~--~~~~~p~~i~~~~~~lp 44 (45)
T smart00120 1 IDAAFELRNGKTYFFKGDKYWRFDPKR--VDPGYPKLISSFFPGLP 44 (45)
T ss_pred CeEEEEeCCCeEEEEeCCEEEEEcCCc--cCCCCCeEhhhcCCCCC
Confidence 799999998899999999999999765 78899999999886654
No 12
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=98.56 E-value=3.2e-08 Score=92.01 Aligned_cols=56 Identities=11% Similarity=-0.112 Sum_probs=41.8
Q ss_pred chhhhhhhhhhhccc-cccCCCC----------------CCCeeeecccCCCCccccCCceeecCCCCCCChhHHHHhhc
Q psy12290 18 SLYGMNHSWATYGHW-ALSFKMT----------------TIEAEFKWFDGGNSTICTNLSTEVLSPRNQRKKEDRIGESQ 80 (530)
Q Consensus 18 ~l~~va~~~HE~GH~-GL~Hs~~----------------~~~iM~p~y~~~~~~~~~~~~~~~l~~~~~~~~dD~~gi~~ 80 (530)
.++.|+ +|||||+ ||.|+.. ..|||+++....... ....-+..++.+||.+||.
T Consensus 93 ~~~~~~--~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SvM~y~~~~~~~~-------~~~~~~~~~~~~Di~ai~~ 163 (165)
T cd04268 93 RLRNTA--EHELGHALGLRHNFAASDRDDNVDLLAEKGDTSSVMDYAPSNFSIQ-------LGDGQKYTIGPYDIAAIKK 163 (165)
T ss_pred HHHHHH--HHHHHHHhcccccCcCCccCCcchhhccCCCCcccCCCCccccccc-------cccccCCCCCHHHHHHHHh
Confidence 467788 9999999 9999998 899999987642111 1111123455999999999
Q ss_pred cc
Q psy12290 81 PF 82 (530)
Q Consensus 81 ly 82 (530)
||
T Consensus 164 lY 165 (165)
T cd04268 164 LY 165 (165)
T ss_pred cC
Confidence 98
No 13
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=97.95 E-value=2.2e-06 Score=78.13 Aligned_cols=48 Identities=17% Similarity=0.207 Sum_probs=40.6
Q ss_pred hhhhhhhhhhccc-cccCCCCCCC---eeeecccCCCCccccCCceeecCCCCCCChhHHHHhhccc
Q psy12290 20 YGMNHSWATYGHW-ALSFKMTTIE---AEFKWFDGGNSTICTNLSTEVLSPRNQRKKEDRIGESQPF 82 (530)
Q Consensus 20 ~~va~~~HE~GH~-GL~Hs~~~~~---iM~p~y~~~~~~~~~~~~~~~l~~~~~~~~dD~~gi~~ly 82 (530)
+.|+ +|||||+ ||.|+..+.+ .|++.++.... .|.|+ .+|..+++.+|
T Consensus 87 ~~~~--~HEigHaLGl~H~~~~~drd~~~~~~~~~~~~-------~~~~~------~~~~~~~~~~y 138 (140)
T smart00235 87 TGVA--AHELGHALGLYHEQSRSDRDNYMYINYTNITR-------NFDLS------NDDSLGIPYDY 138 (140)
T ss_pred cccH--HHHHHHHhcCCcCCCCCcccCeEEEehhhhhh-------ccccc------cccCCCchhcc
Confidence 4577 9999999 9999999888 99999986522 37777 88999999999
No 14
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=97.77 E-value=1.6e-05 Score=73.84 Aligned_cols=52 Identities=8% Similarity=-0.174 Sum_probs=38.8
Q ss_pred chhhhhhhhhhhccc-cccCCCC--------------------CCCeeeecccCCCCccccCCceeecCCCCCCChhHHH
Q psy12290 18 SLYGMNHSWATYGHW-ALSFKMT--------------------TIEAEFKWFDGGNSTICTNLSTEVLSPRNQRKKEDRI 76 (530)
Q Consensus 18 ~l~~va~~~HE~GH~-GL~Hs~~--------------------~~~iM~p~y~~~~~~~~~~~~~~~l~~~~~~~~dD~~ 76 (530)
.+..|+ +|||||+ ||.|... ..+||.+...... ......++ ..|+.
T Consensus 95 ~~~~~~--~HElGH~LGl~H~~~~~~~~~~~~~~~~~~~~~~~~~siM~y~~~~~~-----~~~~~~fS------~~d~~ 161 (167)
T cd00203 95 EGAQTI--AHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSVMSYTKGSFS-----DGQRKDFS------QCDID 161 (167)
T ss_pred cchhhH--HHHHHHHhCCCccCcCCCCCCCccccccccCCCCCCCeEeccCccccC-----cccCCCcC------HHHHH
Confidence 455666 9999999 9999887 4689998765421 01124566 99999
Q ss_pred Hhhccc
Q psy12290 77 GESQPF 82 (530)
Q Consensus 77 gi~~ly 82 (530)
.||++|
T Consensus 162 ~i~~~Y 167 (167)
T cd00203 162 QINKLY 167 (167)
T ss_pred HHHhhC
Confidence 999998
No 15
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=97.49 E-value=7.6e-05 Score=71.45 Aligned_cols=24 Identities=17% Similarity=-0.078 Sum_probs=19.6
Q ss_pred ccchhhhhhhhhhhccc-cccCCCCCC
Q psy12290 16 IQSLYGMNHSWATYGHW-ALSFKMTTI 41 (530)
Q Consensus 16 ~~~l~~va~~~HE~GH~-GL~Hs~~~~ 41 (530)
|...+.|+ +|||||+ ||.|..+..
T Consensus 110 g~~~~~t~--~HEiGHaLGL~H~~~~~ 134 (186)
T cd04277 110 GSYGYQTI--IHEIGHALGLEHPGDYN 134 (186)
T ss_pred ChhhHHHH--HHHHHHHhcCCCCCcCC
Confidence 45567777 9999999 999998653
No 16
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=96.78 E-value=0.00075 Score=66.75 Aligned_cols=44 Identities=11% Similarity=0.103 Sum_probs=33.0
Q ss_pred hhhhhccc-cccCCC------------------------------CCCCeeeecccCCCCccccCCceeecCCCCCCChh
Q psy12290 25 SWATYGHW-ALSFKM------------------------------TTIEAEFKWFDGGNSTICTNLSTEVLSPRNQRKKE 73 (530)
Q Consensus 25 ~~HE~GH~-GL~Hs~------------------------------~~~~iM~p~y~~~~~~~~~~~~~~~l~~~~~~~~d 73 (530)
|+|||||+ ||.|+. +..|||-.-|..... -.++ ..
T Consensus 137 i~HEiGH~IGfRHTD~~~R~SCG~~~nEg~~~vGAi~IpGTPt~~d~~SiM~ac~~~~~~--------~~f~------~~ 202 (211)
T PF12388_consen 137 ITHEIGHCIGFRHTDYFNRSSCGSGGNEGSAGVGAIHIPGTPTGADPNSIMNACFSSGED--------GEFT------SN 202 (211)
T ss_pred HHHHhhhhccccccCcCCcccccccCCcCccccceEECCCCCCCCCCchhhhccccCCCC--------CCcC------hh
Confidence 49999999 999987 455666665543322 2445 99
Q ss_pred HHHHhhccc
Q psy12290 74 DRIGESQPF 82 (530)
Q Consensus 74 D~~gi~~ly 82 (530)
||.+++.||
T Consensus 203 Di~Al~~lY 211 (211)
T PF12388_consen 203 DITALNYLY 211 (211)
T ss_pred hHHHHHhhC
Confidence 999999998
No 17
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=95.59 E-value=0.0098 Score=58.42 Aligned_cols=27 Identities=15% Similarity=-0.012 Sum_probs=20.2
Q ss_pred hhhhhhhhhhhccc-cccCCCCCCCeeeecc
Q psy12290 19 LYGMNHSWATYGHW-ALSFKMTTIEAEFKWF 48 (530)
Q Consensus 19 l~~va~~~HE~GH~-GL~Hs~~~~~iM~p~y 48 (530)
|..++ +||+||+ ||.|.. ..|.|+|.+
T Consensus 116 ~~~~~--~he~gh~lGl~hn~-~~s~~~~~~ 143 (197)
T cd04276 116 LRYLL--AHEVGHTLGLRHNF-KASSDGSNE 143 (197)
T ss_pred HHHHH--HHHHHHHhcCcccc-cccccCchh
Confidence 44455 9999999 999965 566777743
No 18
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=94.40 E-value=0.03 Score=53.65 Aligned_cols=31 Identities=10% Similarity=-0.188 Sum_probs=22.5
Q ss_pred hhhhhhhhhhccc-cccCCCCCC----------CeeeecccC
Q psy12290 20 YGMNHSWATYGHW-ALSFKMTTI----------EAEFKWFDG 50 (530)
Q Consensus 20 ~~va~~~HE~GH~-GL~Hs~~~~----------~iM~p~y~~ 50 (530)
....++|||+||+ ||.|-.... -||.+....
T Consensus 132 ~~~~~~aHElGH~lG~~HD~~~~~~~~c~~~~~~IM~~~~~~ 173 (192)
T cd04267 132 LTALTMAHELGHNLGAEHDGGDELAFECDGGGNYIMAPVDSG 173 (192)
T ss_pred eehhhhhhhHHhhcCCcCCCCCCccccCCCCCCeEEcccccC
Confidence 3344569999999 999987632 388876553
No 19
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=94.39 E-value=0.012 Score=55.44 Aligned_cols=18 Identities=22% Similarity=0.305 Sum_probs=10.4
Q ss_pred hhhhhhccc-cccCCCCCC
Q psy12290 24 HSWATYGHW-ALSFKMTTI 41 (530)
Q Consensus 24 ~~~HE~GH~-GL~Hs~~~~ 41 (530)
||+||+||+ ||.|.-...
T Consensus 72 TltHEvGH~LGL~HtF~~~ 90 (154)
T PF05572_consen 72 TLTHEVGHWLGLYHTFGGG 90 (154)
T ss_dssp HHHHHHHHHTT---TT---
T ss_pred chhhhhhhhhcccccccCC
Confidence 359999999 999987544
No 20
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=94.16 E-value=0.018 Score=56.01 Aligned_cols=25 Identities=12% Similarity=0.187 Sum_probs=19.6
Q ss_pred chhhhhhhhhhhccc-cccCCCCCCCee
Q psy12290 18 SLYGMNHSWATYGHW-ALSFKMTTIEAE 44 (530)
Q Consensus 18 ~l~~va~~~HE~GH~-GL~Hs~~~~~iM 44 (530)
++++|+ +|||||+ ||.|...+....
T Consensus 91 ~~~~~i--~HElgHaLG~~HEh~rpdrd 116 (198)
T cd04327 91 EFSRVV--LHEFGHALGFIHEHQSPAAN 116 (198)
T ss_pred hHHHHH--HHHHHHHhcCcccccCCCCC
Confidence 345677 9999999 999988765554
No 21
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=93.55 E-value=2.8 Score=43.19 Aligned_cols=144 Identities=15% Similarity=0.169 Sum_probs=68.0
Q ss_pred eeEEeCCEEEEEECC-------eEEEEcCCCccCCCceecccccccCCCCCccccceEEeeCCcEEEEEce-------eE
Q psy12290 241 AIAVIRTEVFVFKDR-------WMWRIGDKGVLPGYPVLIHQFWSELPHNLTHIDAVYQRIDNHIAFFIGR-------QY 306 (530)
Q Consensus 241 Avt~irGeiyfFKG~-------~~WR~~~~~~~~gyP~~Is~~WpGLP~~~~~IDAAye~~~gktyFFKG~-------~y 306 (530)
+++.+++.||+|-|. .+|+++...... ...-...+.||... -.++....++++|+|-|. .+
T Consensus 67 ~~~~~~~~lyviGG~~~~~~~~~v~~~d~~~~~w---~~~~~~~~~lp~~~--~~~~~~~~~~~iYv~GG~~~~~~~~~v 141 (323)
T TIGR03548 67 ASVSVENGIYYIGGSNSSERFSSVYRITLDESKE---ELICETIGNLPFTF--ENGSACYKDGTLYVGGGNRNGKPSNKS 141 (323)
T ss_pred EEEEECCEEEEEcCCCCCCCceeEEEEEEcCCce---eeeeeEcCCCCcCc--cCceEEEECCEEEEEeCcCCCccCceE
Confidence 445568999999873 566665321100 00001123455432 122222247899999773 57
Q ss_pred EEecceecCCCCCcccccCCCCCcccccccEEEeccCCeEEEeeCC------eEEEEeCCCCcccCCCCcccccCCCCCC
Q psy12290 307 FLFEGIRLLPGYPRPLTALGLPASLERIDAAMVWGYNSKTYLFSGT------RYWKLDDETGRSEPDYPRNMTENWRGVG 380 (530)
Q Consensus 307 W~y~~~~l~~GYPK~Is~~GlP~~~~~IDAAf~~~~ngktYFFkG~------~YWryde~~~~vd~gYPk~Is~~w~GvP 380 (530)
|+|+-.... + ..+. .+|.. ....++.. ..++++|+|.|. ..++||..+++-..--+ +. -...|
T Consensus 142 ~~yd~~~~~--W-~~~~--~~p~~-~r~~~~~~-~~~~~iYv~GG~~~~~~~~~~~yd~~~~~W~~~~~--~~--~~~~p 210 (323)
T TIGR03548 142 YLFNLETQE--W-FELP--DFPGE-PRVQPVCV-KLQNELYVFGGGSNIAYTDGYKYSPKKNQWQKVAD--PT--TDSEP 210 (323)
T ss_pred EEEcCCCCC--e-eECC--CCCCC-CCCcceEE-EECCEEEEEcCCCCccccceEEEecCCCeeEECCC--CC--CCCCc
Confidence 777642100 0 0011 12211 11223322 246899999873 35788877654221100 00 00122
Q ss_pred CC--CCEEEecCCCcEEEEECC
Q psy12290 381 VD--IDDAFQWKDGATYFFKGK 400 (530)
Q Consensus 381 ~~--IDAAf~~~~G~~YFFkG~ 400 (530)
.. --+++...++++|+|-|.
T Consensus 211 ~~~~~~~~~~~~~~~iyv~GG~ 232 (323)
T TIGR03548 211 ISLLGAASIKINESLLLCIGGF 232 (323)
T ss_pred eeccceeEEEECCCEEEEECCc
Confidence 11 123344446799999763
No 22
>PHA03098 kelch-like protein; Provisional
Probab=93.45 E-value=2.6 Score=46.58 Aligned_cols=143 Identities=14% Similarity=0.127 Sum_probs=71.5
Q ss_pred eeEEeCCEEEEEECCe-------EEEEcCCCccCCCceecccccccCCCC-CccccceEEeeCCcEEEEEc--------e
Q psy12290 241 AIAVIRTEVFVFKDRW-------MWRIGDKGVLPGYPVLIHQFWSELPHN-LTHIDAVYQRIDNHIAFFIG--------R 304 (530)
Q Consensus 241 Avt~irGeiyfFKG~~-------~WR~~~~~~~~gyP~~Is~~WpGLP~~-~~~IDAAye~~~gktyFFKG--------~ 304 (530)
+++.++|.+|+|-|.. +|+++.. ...|..++.- ...-.++....++++|+|-| +
T Consensus 337 ~~~~~~~~lyv~GG~~~~~~~~~v~~yd~~----------~~~W~~~~~lp~~r~~~~~~~~~~~iYv~GG~~~~~~~~~ 406 (534)
T PHA03098 337 GVTVFNNRIYVIGGIYNSISLNTVESWKPG----------ESKWREEPPLIFPRYNPCVVNVNNLIYVIGGISKNDELLK 406 (534)
T ss_pred eEEEECCEEEEEeCCCCCEecceEEEEcCC----------CCceeeCCCcCcCCccceEEEECCEEEEECCcCCCCcccc
Confidence 4456799999998853 4444422 2234333221 01122333334789999977 3
Q ss_pred eEEEecce--ecCCCCCcccccCCCCCcccccccEEEeccCCeEEEeeCC----------eEEEEeCCCCcccCCCCccc
Q psy12290 305 QYFLFEGI--RLLPGYPRPLTALGLPASLERIDAAMVWGYNSKTYLFSGT----------RYWKLDDETGRSEPDYPRNM 372 (530)
Q Consensus 305 ~yW~y~~~--~l~~GYPK~Is~~GlP~~~~~IDAAf~~~~ngktYFFkG~----------~YWryde~~~~vd~gYPk~I 372 (530)
.+++|+-. +-.. +. .+|...... +|+. .++++|+|.|. ..|+||..+++...-
T Consensus 407 ~v~~yd~~t~~W~~-----~~--~~p~~r~~~-~~~~--~~~~iyv~GG~~~~~~~~~~~~v~~yd~~~~~W~~~----- 471 (534)
T PHA03098 407 TVECFSLNTNKWSK-----GS--PLPISHYGG-CAIY--HDGKIYVIGGISYIDNIKVYNIVESYNPVTNKWTEL----- 471 (534)
T ss_pred eEEEEeCCCCeeee-----cC--CCCccccCc-eEEE--ECCEEEEECCccCCCCCcccceEEEecCCCCceeeC-----
Confidence 46677632 1111 11 122111112 2332 36899999873 278899876542211
Q ss_pred ccCCCCCCCCCCEEEecCCCcEEEEECC-------eEEEEecccce
Q psy12290 373 TENWRGVGVDIDDAFQWKDGATYFFKGK-------GFWKFKDVSMR 411 (530)
Q Consensus 373 s~~w~GvP~~IDAAf~~~~G~~YFFkG~-------~Ywrfd~~~~~ 411 (530)
.. ...+..--++.. .+|++|+|.|. ..+.||..+.+
T Consensus 472 ~~--~~~~r~~~~~~~-~~~~iyv~GG~~~~~~~~~v~~yd~~~~~ 514 (534)
T PHA03098 472 SS--LNFPRINASLCI-FNNKIYVVGGDKYEYYINEIEVYDDKTNT 514 (534)
T ss_pred CC--CCcccccceEEE-ECCEEEEEcCCcCCcccceeEEEeCCCCE
Confidence 00 011111112233 36899999774 45667665543
No 23
>PLN02153 epithiospecifier protein
Probab=93.15 E-value=3.3 Score=43.11 Aligned_cols=15 Identities=20% Similarity=0.636 Sum_probs=11.2
Q ss_pred eeEEeCCEEEEEECC
Q psy12290 241 AIAVIRTEVFVFKDR 255 (530)
Q Consensus 241 Avt~irGeiyfFKG~ 255 (530)
+++.+++.||+|-|.
T Consensus 80 ~~~~~~~~iyv~GG~ 94 (341)
T PLN02153 80 RMVAVGTKLYIFGGR 94 (341)
T ss_pred EEEEECCEEEEECCC
Confidence 345578999999873
No 24
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=93.01 E-value=0.049 Score=52.53 Aligned_cols=25 Identities=16% Similarity=0.008 Sum_probs=21.9
Q ss_pred hhhhhhhhccc-cccCCCCCCCeeee
Q psy12290 22 MNHSWATYGHW-ALSFKMTTIEAEFK 46 (530)
Q Consensus 22 va~~~HE~GH~-GL~Hs~~~~~iM~p 46 (530)
+-.+.||+||. ||+|-.++.-+|.+
T Consensus 125 ~KEv~HElGH~~GL~HC~N~~CVM~F 150 (181)
T COG1913 125 VKEVLHELGHLLGLSHCPNPRCVMNF 150 (181)
T ss_pred HHHHHHHhhhhcCcccCCCCCcEEeC
Confidence 33469999999 99999999999985
No 25
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=92.87 E-value=0.055 Score=52.84 Aligned_cols=49 Identities=10% Similarity=-0.102 Sum_probs=33.9
Q ss_pred hhhhhhhhhccc-cccCCCC---------CCCeeeecccCCCCccccCCceeecCCCCCCChhHHHHhhccc
Q psy12290 21 GMNHSWATYGHW-ALSFKMT---------TIEAEFKWFDGGNSTICTNLSTEVLSPRNQRKKEDRIGESQPF 82 (530)
Q Consensus 21 ~va~~~HE~GH~-GL~Hs~~---------~~~iM~p~y~~~~~~~~~~~~~~~l~~~~~~~~dD~~gi~~ly 82 (530)
.+.++|||+||. ||.|-.+ ..-||.|....... .+.+| +--+.-++++.
T Consensus 140 ~a~~~aHElGH~LG~~HD~~~~~C~~~~~~~~IM~~~~~~~~~-------~~~fS------~CS~~~~~~~l 198 (207)
T cd04273 140 SAFTIAHELGHVLGMPHDGDGNSCGPEGKDGHIMSPTLGANTG-------PFTWS------KCSRRYLTSFL 198 (207)
T ss_pred eEEeeeeechhhcCCCCCCCCCCCCCCCCCceeecccccCCCC-------CCCcC------HHHHHHHHHHH
Confidence 334469999999 9999986 37899998765311 24555 66666666555
No 26
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=92.66 E-value=0.034 Score=53.13 Aligned_cols=21 Identities=14% Similarity=0.181 Sum_probs=13.9
Q ss_pred hhhhhhhhhccc-cccCCCCCC
Q psy12290 21 GMNHSWATYGHW-ALSFKMTTI 41 (530)
Q Consensus 21 ~va~~~HE~GH~-GL~Hs~~~~ 41 (530)
.+-++|||+||. |+.|..+..
T Consensus 111 ~~~~~aHElGH~lGa~Hd~~~~ 132 (173)
T PF13574_consen 111 GIDTFAHELGHQLGAPHDFDNG 132 (173)
T ss_dssp HHHHHHHHHHHHHT---SSSS-
T ss_pred eeeeehhhhHhhcCCCCCCCCC
Confidence 555569999999 999998764
No 27
>PLN02153 epithiospecifier protein
Probab=92.49 E-value=5.1 Score=41.69 Aligned_cols=15 Identities=27% Similarity=0.576 Sum_probs=11.0
Q ss_pred eeEEeCCEEEEEECC
Q psy12290 241 AIAVIRTEVFVFKDR 255 (530)
Q Consensus 241 Avt~irGeiyfFKG~ 255 (530)
+++.++++||+|-|.
T Consensus 27 ~~~~~~~~iyv~GG~ 41 (341)
T PLN02153 27 GIAVVGDKLYSFGGE 41 (341)
T ss_pred eEEEECCEEEEECCc
Confidence 345578999999774
No 28
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=92.01 E-value=3.6 Score=42.46 Aligned_cols=59 Identities=17% Similarity=0.250 Sum_probs=32.9
Q ss_pred eEEeCCEEEEEECC-------eEEEEcCCCccCCCceecccccccCCCC-C-ccccceEEeeCCcEEEEEce------eE
Q psy12290 242 IAVIRTEVFVFKDR-------WMWRIGDKGVLPGYPVLIHQFWSELPHN-L-THIDAVYQRIDNHIAFFIGR------QY 306 (530)
Q Consensus 242 vt~irGeiyfFKG~-------~~WR~~~~~~~~gyP~~Is~~WpGLP~~-~-~~IDAAye~~~gktyFFKG~------~y 306 (530)
++.+++.||+|-|. .+|+++.. .+.|..++.- . ....++.-..++++|+|-|. ..
T Consensus 119 ~~~~~~~iYv~GG~~~~~~~~~v~~yd~~----------~~~W~~~~~~p~~~r~~~~~~~~~~~iYv~GG~~~~~~~~~ 188 (323)
T TIGR03548 119 ACYKDGTLYVGGGNRNGKPSNKSYLFNLE----------TQEWFELPDFPGEPRVQPVCVKLQNELYVFGGGSNIAYTDG 188 (323)
T ss_pred EEEECCEEEEEeCcCCCccCceEEEEcCC----------CCCeeECCCCCCCCCCcceEEEECCEEEEEcCCCCccccce
Confidence 34578999999874 45666532 2235444321 0 11233333347899999773 24
Q ss_pred EEec
Q psy12290 307 FLFE 310 (530)
Q Consensus 307 W~y~ 310 (530)
++|+
T Consensus 189 ~~yd 192 (323)
T TIGR03548 189 YKYS 192 (323)
T ss_pred EEEe
Confidence 6665
No 29
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=91.94 E-value=0.069 Score=51.78 Aligned_cols=22 Identities=14% Similarity=-0.157 Sum_probs=20.3
Q ss_pred hhhhhccc-cccCCCCCCCeeee
Q psy12290 25 SWATYGHW-ALSFKMTTIEAEFK 46 (530)
Q Consensus 25 ~~HE~GH~-GL~Hs~~~~~iM~p 46 (530)
++||+||. ||.|-.+..-+|+.
T Consensus 129 ~~HElGH~lGL~HC~~~~CvM~~ 151 (179)
T PRK13267 129 VTHELGHTLGLEHCDNPRCVMNF 151 (179)
T ss_pred HHHHHHHHcCCccCCCCCccCCC
Confidence 49999999 99999999999983
No 30
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=91.57 E-value=0.073 Score=52.38 Aligned_cols=19 Identities=21% Similarity=0.156 Sum_probs=15.7
Q ss_pred hhhhhhhhhhhccc-cccCCCC
Q psy12290 19 LYGMNHSWATYGHW-ALSFKMT 39 (530)
Q Consensus 19 l~~va~~~HE~GH~-GL~Hs~~ 39 (530)
.+.+. +|||||. ||.|..+
T Consensus 137 ~~~~~--aHEiGH~lGl~H~~~ 156 (206)
T PF13583_consen 137 GYQTF--AHEIGHNLGLRHDFD 156 (206)
T ss_pred cchHH--HHHHHHHhcCCCCcc
Confidence 34444 9999999 9999987
No 31
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=91.34 E-value=2.7 Score=44.56 Aligned_cols=84 Identities=18% Similarity=0.331 Sum_probs=45.0
Q ss_pred cccccccCCCCCccccceEEeeCCcEEEEEce---eEEEecceecCCCCCcccccCCCCCcccccccEEEeccCCeEEEe
Q psy12290 273 IHQFWSELPHNLTHIDAVYQRIDNHIAFFIGR---QYFLFEGIRLLPGYPRPLTALGLPASLERIDAAMVWGYNSKTYLF 349 (530)
Q Consensus 273 Is~~WpGLP~~~~~IDAAye~~~gktyFFKG~---~yW~y~~~~l~~GYPK~Is~~GlP~~~~~IDAAf~~~~ngktYFF 349 (530)
+...++.||..+....|+. .++++|++-|. .+|+|+-......+ ..+. .+|... ....+... .++++|+|
T Consensus 18 ~~~~l~~lP~~~~~~~~~~--~~~~iyv~gG~~~~~~~~~d~~~~~~~W-~~l~--~~p~~~-r~~~~~v~-~~~~IYV~ 90 (376)
T PRK14131 18 NAEQLPDLPVPFKNGTGAI--DNNTVYVGLGSAGTSWYKLDLNAPSKGW-TKIA--AFPGGP-REQAVAAF-IDGKLYVF 90 (376)
T ss_pred ecccCCCCCcCccCCeEEE--ECCEEEEEeCCCCCeEEEEECCCCCCCe-EECC--cCCCCC-cccceEEE-ECCEEEEE
Confidence 4455678886543333333 47899998774 67888742111111 1121 222111 12222221 36899999
Q ss_pred eC-------------CeEEEEeCCCCc
Q psy12290 350 SG-------------TRYWKLDDETGR 363 (530)
Q Consensus 350 kG-------------~~YWryde~~~~ 363 (530)
.| +.+|+||..+++
T Consensus 91 GG~~~~~~~~~~~~~~~v~~YD~~~n~ 117 (376)
T PRK14131 91 GGIGKTNSEGSPQVFDDVYKYDPKTNS 117 (376)
T ss_pred cCCCCCCCCCceeEcccEEEEeCCCCE
Confidence 77 247888877653
No 32
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=91.29 E-value=0.14 Score=51.26 Aligned_cols=44 Identities=9% Similarity=-0.195 Sum_probs=31.5
Q ss_pred hhhhhhccc-cccCCCCC-------------------------CCeeeecccCCCCccccCCceeecCCCCCCChhHHHH
Q psy12290 24 HSWATYGHW-ALSFKMTT-------------------------IEAEFKWFDGGNSTICTNLSTEVLSPRNQRKKEDRIG 77 (530)
Q Consensus 24 ~~~HE~GH~-GL~Hs~~~-------------------------~~iM~p~y~~~~~~~~~~~~~~~l~~~~~~~~dD~~g 77 (530)
|||||+||. |+.|-.+. .=||+|.... . .++++ .--+.-
T Consensus 148 t~AHElGHnLGm~HD~~~~~C~~~~~~~~~~Cp~~~~~C~~~~~~IM~~~~~~-~--------~~~FS------~CS~~~ 212 (228)
T cd04271 148 VFAHEIGHTFGAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPSSSS-G--------ITEFS------PCTIGN 212 (228)
T ss_pred ehhhhhhhhcCCCCCCCCCccccccccccccCCCcccccCCCCCEEeecCcCC-C--------CCccC------hhhHHH
Confidence 679999999 99997542 1299997443 1 24566 777777
Q ss_pred hhccc
Q psy12290 78 ESQPF 82 (530)
Q Consensus 78 i~~ly 82 (530)
|+.+.
T Consensus 213 ~~~~l 217 (228)
T cd04271 213 ICSLL 217 (228)
T ss_pred HHHHH
Confidence 77777
No 33
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=91.28 E-value=0.046 Score=54.74 Aligned_cols=25 Identities=20% Similarity=0.214 Sum_probs=20.7
Q ss_pred ccchhhhhhhhhhhccc-cccCCCCCCC
Q psy12290 16 IQSLYGMNHSWATYGHW-ALSFKMTTIE 42 (530)
Q Consensus 16 ~~~l~~va~~~HE~GH~-GL~Hs~~~~~ 42 (530)
..++-.++ +||+||+ ||.|......
T Consensus 134 ~~n~g~t~--~HEvGH~lGL~HtF~~~~ 159 (225)
T cd04275 134 PYNLGDTA--THEVGHWLGLYHTFQGGS 159 (225)
T ss_pred ccccccee--EEeccceeeeeeeecCCC
Confidence 35666777 9999999 9999987765
No 34
>PLN02193 nitrile-specifier protein
Probab=90.95 E-value=6.5 Score=43.27 Aligned_cols=151 Identities=13% Similarity=0.173 Sum_probs=74.3
Q ss_pred eeEEeCCEEEEEECC---------eEEEEcCCCccCCCceecccccccC------CCCCccccceEEeeCCcEEEEEc--
Q psy12290 241 AIAVIRTEVFVFKDR---------WMWRIGDKGVLPGYPVLIHQFWSEL------PHNLTHIDAVYQRIDNHIAFFIG-- 303 (530)
Q Consensus 241 Avt~irGeiyfFKG~---------~~WR~~~~~~~~gyP~~Is~~WpGL------P~~~~~IDAAye~~~gktyFFKG-- 303 (530)
+++.+++.||+|-|. .+|+++... +.|..+ |.. ....++.-..++++|+|-|
T Consensus 170 ~~~~~~~~iyv~GG~~~~~~~~~~~v~~yD~~~----------~~W~~~~~~g~~P~~-~~~~~~~v~~~~~lYvfGG~~ 238 (470)
T PLN02193 170 GIAQVGNKIYSFGGEFTPNQPIDKHLYVFDLET----------RTWSISPATGDVPHL-SCLGVRMVSIGSTLYVFGGRD 238 (470)
T ss_pred EEEEECCEEEEECCcCCCCCCeeCcEEEEECCC----------CEEEeCCCCCCCCCC-cccceEEEEECCEEEEECCCC
Confidence 445578899999774 367776432 123222 210 0112222224789999987
Q ss_pred -----eeEEEecceecCCCCCcccccCC-CCCcccccccEEEeccCCeEEEeeCC-------eEEEEeCCCCcccCCCCc
Q psy12290 304 -----RQYFLFEGIRLLPGYPRPLTALG-LPASLERIDAAMVWGYNSKTYLFSGT-------RYWKLDDETGRSEPDYPR 370 (530)
Q Consensus 304 -----~~yW~y~~~~l~~GYPK~Is~~G-lP~~~~~IDAAf~~~~ngktYFFkG~-------~YWryde~~~~vd~gYPk 370 (530)
+.+|+|+-.... + ..+..++ .|..... -+|.. .++++|+|-|. ..+.||..+++...--+.
T Consensus 239 ~~~~~ndv~~yD~~t~~--W-~~l~~~~~~P~~R~~-h~~~~--~~~~iYv~GG~~~~~~~~~~~~yd~~t~~W~~~~~~ 312 (470)
T PLN02193 239 ASRQYNGFYSFDTTTNE--W-KLLTPVEEGPTPRSF-HSMAA--DEENVYVFGGVSATARLKTLDSYNIVDKKWFHCSTP 312 (470)
T ss_pred CCCCCccEEEEECCCCE--E-EEcCcCCCCCCCccc-eEEEE--ECCEEEEECCCCCCCCcceEEEEECCCCEEEeCCCC
Confidence 457888742100 0 0011111 1211011 12222 36899999873 367888776543211000
Q ss_pred ccccCCCCCCCCCCEEEecCCCcEEEEEC------CeEEEEecccceee
Q psy12290 371 NMTENWRGVGVDIDDAFQWKDGATYFFKG------KGFWKFKDVSMRVE 413 (530)
Q Consensus 371 ~Is~~w~GvP~~IDAAf~~~~G~~YFFkG------~~Ywrfd~~~~~V~ 413 (530)
... ..+..--++.. .+|++|++-| +..|.||..+++..
T Consensus 313 --~~~--~~~R~~~~~~~-~~gkiyviGG~~g~~~~dv~~yD~~t~~W~ 356 (470)
T PLN02193 313 --GDS--FSIRGGAGLEV-VQGKVWVVYGFNGCEVDDVHYYDPVQDKWT 356 (470)
T ss_pred --CCC--CCCCCCcEEEE-ECCcEEEEECCCCCccCceEEEECCCCEEE
Confidence 000 01111122333 3678999876 56888888776553
No 35
>PHA03098 kelch-like protein; Provisional
Probab=90.92 E-value=8.8 Score=42.43 Aligned_cols=146 Identities=15% Similarity=0.175 Sum_probs=71.5
Q ss_pred eeEEeCCEEEEEECC--------eEEEEcCCCccCCCceecccccccCCCCC-ccccceEEeeCCcEEEEEce-------
Q psy12290 241 AIAVIRTEVFVFKDR--------WMWRIGDKGVLPGYPVLIHQFWSELPHNL-THIDAVYQRIDNHIAFFIGR------- 304 (530)
Q Consensus 241 Avt~irGeiyfFKG~--------~~WR~~~~~~~~gyP~~Is~~WpGLP~~~-~~IDAAye~~~gktyFFKG~------- 304 (530)
+++.+++.+|+|-|. .+|+++.. ..-|..+|.-. ..-.++....++++|+|-|.
T Consensus 289 ~~~~~~~~lyv~GG~~~~~~~~~~v~~yd~~----------~~~W~~~~~~~~~R~~~~~~~~~~~lyv~GG~~~~~~~~ 358 (534)
T PHA03098 289 GSVVLNNVIYFIGGMNKNNLSVNSVVSYDTK----------TKSWNKVPELIYPRKNPGVTVFNNRIYVIGGIYNSISLN 358 (534)
T ss_pred eEEEECCEEEEECCCcCCCCeeccEEEEeCC----------CCeeeECCCCCcccccceEEEECCEEEEEeCCCCCEecc
Confidence 455678999999773 35555532 22343333210 01222222347899999874
Q ss_pred eEEEecceecCCCCCcccccCCCCCcccccccEEEeccCCeEEEeeC--------CeEEEEeCCCCcccCCCCcccccCC
Q psy12290 305 QYFLFEGIRLLPGYPRPLTALGLPASLERIDAAMVWGYNSKTYLFSG--------TRYWKLDDETGRSEPDYPRNMTENW 376 (530)
Q Consensus 305 ~yW~y~~~~l~~GYPK~Is~~GlP~~~~~IDAAf~~~~ngktYFFkG--------~~YWryde~~~~vd~gYPk~Is~~w 376 (530)
.+|+|+-..-. + ..+. .+|.. .-.++.. ..++++|+|.| +..++||..+++....-|-.
T Consensus 359 ~v~~yd~~~~~--W-~~~~--~lp~~--r~~~~~~-~~~~~iYv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~~p----- 425 (534)
T PHA03098 359 TVESWKPGESK--W-REEP--PLIFP--RYNPCVV-NVNNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLP----- 425 (534)
T ss_pred eEEEEcCCCCc--e-eeCC--CcCcC--CccceEE-EECCEEEEECCcCCCCcccceEEEEeCCCCeeeecCCCC-----
Confidence 25555531100 0 0011 12211 1122222 13689999988 45788998765432211110
Q ss_pred CCCCCCCCEEEecCCCcEEEEECCe----------EEEEeccccee
Q psy12290 377 RGVGVDIDDAFQWKDGATYFFKGKG----------FWKFKDVSMRV 412 (530)
Q Consensus 377 ~GvP~~IDAAf~~~~G~~YFFkG~~----------Ywrfd~~~~~V 412 (530)
.+..--+|.. .+|++|++-|.. .++||..+.+.
T Consensus 426 --~~r~~~~~~~-~~~~iyv~GG~~~~~~~~~~~~v~~yd~~~~~W 468 (534)
T PHA03098 426 --ISHYGGCAIY-HDGKIYVIGGISYIDNIKVYNIVESYNPVTNKW 468 (534)
T ss_pred --ccccCceEEE-ECCEEEEECCccCCCCCcccceEEEecCCCCce
Confidence 1111123333 367999997732 56777655443
No 36
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=90.85 E-value=7.9 Score=40.20 Aligned_cols=55 Identities=22% Similarity=0.239 Sum_probs=30.6
Q ss_pred eEEeCCEEEEEECC---eEEEEcCCCccCCCceecccccccCCCCC--ccccceEEeeCCcEEEEEce
Q psy12290 242 IAVIRTEVFVFKDR---WMWRIGDKGVLPGYPVLIHQFWSELPHNL--THIDAVYQRIDNHIAFFIGR 304 (530)
Q Consensus 242 vt~irGeiyfFKG~---~~WR~~~~~~~~gyP~~Is~~WpGLP~~~--~~IDAAye~~~gktyFFKG~ 304 (530)
++.+.++||++-|. .+|+++... -.+-|..|+.-- ....++....++++|++-|.
T Consensus 13 ~~~~~~~vyv~GG~~~~~~~~~d~~~--------~~~~W~~l~~~p~~~R~~~~~~~~~~~iYv~GG~ 72 (346)
T TIGR03547 13 GAIIGDKVYVGLGSAGTSWYKLDLKK--------PSKGWQKIADFPGGPRNQAVAAAIDGKLYVFGGI 72 (346)
T ss_pred EEEECCEEEEEccccCCeeEEEECCC--------CCCCceECCCCCCCCcccceEEEECCEEEEEeCC
Confidence 44568999999774 577776321 112344333210 12333333357899999873
No 37
>PHA02713 hypothetical protein; Provisional
Probab=90.75 E-value=6.1 Score=44.57 Aligned_cols=54 Identities=20% Similarity=0.294 Sum_probs=30.2
Q ss_pred eeEEeCCEEEEEECC--------eEEEEcCCCccCCCceecccccccCCCCC-ccccceEEeeCCcEEEEEce
Q psy12290 241 AIAVIRTEVFVFKDR--------WMWRIGDKGVLPGYPVLIHQFWSELPHNL-THIDAVYQRIDNHIAFFIGR 304 (530)
Q Consensus 241 Avt~irGeiyfFKG~--------~~WR~~~~~~~~gyP~~Is~~WpGLP~~~-~~IDAAye~~~gktyFFKG~ 304 (530)
+++.+++.||++-|. .+|+++.. .+.|..+|.-. ..-.++.-..+|++|++-|.
T Consensus 298 ~~a~l~~~IYviGG~~~~~~~~~~v~~Yd~~----------~n~W~~~~~m~~~R~~~~~~~~~g~IYviGG~ 360 (557)
T PHA02713 298 ASAIVDNEIIIAGGYNFNNPSLNKVYKINIE----------NKIHVELPPMIKNRCRFSLAVIDDTIYAIGGQ 360 (557)
T ss_pred EEEEECCEEEEEcCCCCCCCccceEEEEECC----------CCeEeeCCCCcchhhceeEEEECCEEEEECCc
Confidence 456679999999773 24555432 23454444321 11222323347899999874
No 38
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=90.67 E-value=0.14 Score=49.28 Aligned_cols=29 Identities=7% Similarity=-0.151 Sum_probs=21.9
Q ss_pred hhhhhhhhccc-cccCCC------CCCCeeeecccC
Q psy12290 22 MNHSWATYGHW-ALSFKM------TTIEAEFKWFDG 50 (530)
Q Consensus 22 va~~~HE~GH~-GL~Hs~------~~~~iM~p~y~~ 50 (530)
+.++|||+||. ||.|-. ...-||+|....
T Consensus 132 a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~~ 167 (194)
T cd04269 132 AVTMAHELGHNLGMEHDDGGCTCGRSTCIMAPSPSS 167 (194)
T ss_pred HHHHHHHHHhhcCCCcCCCCCCCCCCCeEecCCCCC
Confidence 44579999999 999944 246799987653
No 39
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=90.40 E-value=0.088 Score=50.43 Aligned_cols=22 Identities=14% Similarity=-0.044 Sum_probs=12.2
Q ss_pred hhhhhhhhhccc-cccCCCCCCC
Q psy12290 21 GMNHSWATYGHW-ALSFKMTTIE 42 (530)
Q Consensus 21 ~va~~~HE~GH~-GL~Hs~~~~~ 42 (530)
.+.++||||||. |+.|-.+...
T Consensus 142 ~~~~~AHEiGH~lGa~HD~~~~~ 164 (196)
T PF13688_consen 142 GAITFAHEIGHNLGAPHDGDYSS 164 (196)
T ss_dssp HHHHHHHHHHHHTT-----SSS-
T ss_pred eehhhHHhHHHhcCCCCCCCCCC
Confidence 334469999999 9999877654
No 40
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=90.27 E-value=9.4 Score=40.53 Aligned_cols=22 Identities=14% Similarity=0.069 Sum_probs=16.6
Q ss_pred eeEEeCCEEEEEECC---eEEEEcC
Q psy12290 241 AIAVIRTEVFVFKDR---WMWRIGD 262 (530)
Q Consensus 241 Avt~irGeiyfFKG~---~~WR~~~ 262 (530)
|++.+++.||+|-|. .+|+++-
T Consensus 33 ~~~~~~~~iyv~gG~~~~~~~~~d~ 57 (376)
T PRK14131 33 TGAIDNNTVYVGLGSAGTSWYKLDL 57 (376)
T ss_pred eEEEECCEEEEEeCCCCCeEEEEEC
Confidence 455679999999774 6787764
No 41
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=90.03 E-value=5.8 Score=41.20 Aligned_cols=103 Identities=19% Similarity=0.478 Sum_probs=52.3
Q ss_pred ccCCCCCccccceEEeeCCcEEEEEc---eeEEEecceecCCCCCcccccCCCCCcccccccEEEeccCCeEEEeeCC--
Q psy12290 278 SELPHNLTHIDAVYQRIDNHIAFFIG---RQYFLFEGIRLLPGYPRPLTALGLPASLERIDAAMVWGYNSKTYLFSGT-- 352 (530)
Q Consensus 278 pGLP~~~~~IDAAye~~~gktyFFKG---~~yW~y~~~~l~~GYPK~Is~~GlP~~~~~IDAAf~~~~ngktYFFkG~-- 352 (530)
|.||..+...=||. .++++|++-| +.+|+|+-......+ +.+. .+|.. ....+++.. .++++|++.|.
T Consensus 2 ~~lp~~~~~~~~~~--~~~~vyv~GG~~~~~~~~~d~~~~~~~W-~~l~--~~p~~-~R~~~~~~~-~~~~iYv~GG~~~ 74 (346)
T TIGR03547 2 PDLPVGFKNGTGAI--IGDKVYVGLGSAGTSWYKLDLKKPSKGW-QKIA--DFPGG-PRNQAVAAA-IDGKLYVFGGIGK 74 (346)
T ss_pred CCCCccccCceEEE--ECCEEEEEccccCCeeEEEECCCCCCCc-eECC--CCCCC-CcccceEEE-ECCEEEEEeCCCC
Confidence 56665533222333 3689999877 468888732111111 0111 22211 122333322 36899999873
Q ss_pred -----------eEEEEeCCCCcccCCCCcccccCCCCC----CCCC-C-EEEecCCCcEEEEEC
Q psy12290 353 -----------RYWKLDDETGRSEPDYPRNMTENWRGV----GVDI-D-DAFQWKDGATYFFKG 399 (530)
Q Consensus 353 -----------~YWryde~~~~vd~gYPk~Is~~w~Gv----P~~I-D-AAf~~~~G~~YFFkG 399 (530)
..|+||..+++ |..+ |... . +|+...+|++|++-|
T Consensus 75 ~~~~~~~~~~~~v~~Yd~~~~~------------W~~~~~~~p~~~~~~~~~~~~~g~IYviGG 126 (346)
T TIGR03547 75 ANSEGSPQVFDDVYRYDPKKNS------------WQKLDTRSPVGLLGASGFSLHNGQAYFTGG 126 (346)
T ss_pred CCCCCcceecccEEEEECCCCE------------EecCCCCCCCcccceeEEEEeCCEEEEEcC
Confidence 47888877653 3222 1111 1 223234789999976
No 42
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=89.76 E-value=0.11 Score=51.12 Aligned_cols=21 Identities=14% Similarity=-0.054 Sum_probs=15.1
Q ss_pred hhhhhccc-cccCCCCCCCeeee
Q psy12290 25 SWATYGHW-ALSFKMTTIEAEFK 46 (530)
Q Consensus 25 ~~HE~GH~-GL~Hs~~~~~iM~p 46 (530)
++||+||. ||+|=.+ .-+|++
T Consensus 149 a~HElGH~~GL~HC~~-~CvM~~ 170 (194)
T PF07998_consen 149 AVHELGHLFGLDHCEN-RCVMNF 170 (194)
T ss_dssp HHHHHHHHTT----SS-TSTTS-
T ss_pred HHHHHHHHcCCcCCCC-CCccCC
Confidence 49999999 9999999 999987
No 43
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=89.12 E-value=0.18 Score=50.80 Aligned_cols=49 Identities=10% Similarity=0.010 Sum_probs=31.0
Q ss_pred hhhhhhhccc-cccCCCC-----C------CCeeeecccCCCCccccCCceeecCCCCCCChhHHHHhhccc
Q psy12290 23 NHSWATYGHW-ALSFKMT-----T------IEAEFKWFDGGNSTICTNLSTEVLSPRNQRKKEDRIGESQPF 82 (530)
Q Consensus 23 a~~~HE~GH~-GL~Hs~~-----~------~~iM~p~y~~~~~~~~~~~~~~~l~~~~~~~~dD~~gi~~ly 82 (530)
.+||||+||. |+.|-.. + .=||+|.-..... .....++ .--+.-|++..
T Consensus 169 ~t~AHElGHnlGm~HD~~~~~C~~~~~~~~~cIM~~~~~~~~~-----~~~~~FS------~CS~~~~~~~L 229 (244)
T cd04270 169 LVTAHELGHNFGSPHDPDIAECAPGESQGGNYIMYARATSGDK-----ENNKKFS------PCSKKSISKVL 229 (244)
T ss_pred HHHHHHHHHhcCCCCCCCCCCCCCCCCCCCceEecCccccCCC-----CCCCCCC------HhHHHHHHHHH
Confidence 4589999999 9999765 1 3599987532110 1124455 66666665555
No 44
>PLN02193 nitrile-specifier protein
Probab=89.08 E-value=17 Score=40.01 Aligned_cols=51 Identities=10% Similarity=0.062 Sum_probs=29.4
Q ss_pred CCeEEEeeC------CeEEEEeCCCCcccCCCCcccccCCCCCC--CCCCEEEecCCCcEEEEECC
Q psy12290 343 NSKTYLFSG------TRYWKLDDETGRSEPDYPRNMTENWRGVG--VDIDDAFQWKDGATYFFKGK 400 (530)
Q Consensus 343 ngktYFFkG------~~YWryde~~~~vd~gYPk~Is~~w~GvP--~~IDAAf~~~~G~~YFFkG~ 400 (530)
++++|++-| +..|.||..+++-..- . ....+| ...-+|... ++++|+|-|.
T Consensus 328 ~gkiyviGG~~g~~~~dv~~yD~~t~~W~~~-----~-~~g~~P~~R~~~~~~~~-~~~iyv~GG~ 386 (470)
T PLN02193 328 QGKVWVVYGFNGCEVDDVHYYDPVQDKWTQV-----E-TFGVRPSERSVFASAAV-GKHIVIFGGE 386 (470)
T ss_pred CCcEEEEECCCCCccCceEEEECCCCEEEEe-----c-cCCCCCCCcceeEEEEE-CCEEEEECCc
Confidence 678999877 6689999887543210 0 001122 122233333 5799999873
No 45
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=87.49 E-value=0.52 Score=46.49 Aligned_cols=19 Identities=16% Similarity=0.254 Sum_probs=16.0
Q ss_pred hhhhhhhhhhhccc-cccCC
Q psy12290 19 LYGMNHSWATYGHW-ALSFK 37 (530)
Q Consensus 19 l~~va~~~HE~GH~-GL~Hs 37 (530)
+..+.++|||+||. |+.|-
T Consensus 143 ~~~~~~~AHElGH~lG~~HD 162 (220)
T cd04272 143 YYGVYTMTHELAHLLGAPHD 162 (220)
T ss_pred cccHHHHHHHHHHHhCCCCC
Confidence 45567789999999 99995
No 46
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=86.92 E-value=0.23 Score=47.79 Aligned_cols=20 Identities=10% Similarity=-0.031 Sum_probs=16.1
Q ss_pred hhhhhhhhhhccc-cccCCCCCC
Q psy12290 20 YGMNHSWATYGHW-ALSFKMTTI 41 (530)
Q Consensus 20 ~~va~~~HE~GH~-GL~Hs~~~~ 41 (530)
++++ +|||||+ ||.|...+.
T Consensus 75 ~g~v--~HE~~HalG~~HEh~R~ 95 (180)
T cd04280 75 LGTI--VHELMHALGFYHEQSRP 95 (180)
T ss_pred Cchh--HHHHHHHhcCcchhccc
Confidence 3555 9999999 999987654
No 47
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=84.44 E-value=0.37 Score=46.84 Aligned_cols=19 Identities=16% Similarity=0.030 Sum_probs=15.7
Q ss_pred hhhhhhhhhhccc-cccCCCCC
Q psy12290 20 YGMNHSWATYGHW-ALSFKMTT 40 (530)
Q Consensus 20 ~~va~~~HE~GH~-GL~Hs~~~ 40 (530)
++++ +|||||+ ||.|-..+
T Consensus 78 ~G~i--~HEl~HaLG~~HEhsR 97 (182)
T cd04283 78 KGII--QHELLHALGFYHEQTR 97 (182)
T ss_pred cchH--HHHHHHHhCCcccccc
Confidence 3555 9999999 99998765
No 48
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=84.20 E-value=0.48 Score=47.09 Aligned_cols=48 Identities=10% Similarity=0.012 Sum_probs=36.2
Q ss_pred cccchhhhhhhhhhhccc-ccc-CCCCCCCeeeecccCCCCccccCCceeecCCCCCCChhHHHHhhccc
Q psy12290 15 AIQSLYGMNHSWATYGHW-ALS-FKMTTIEAEFKWFDGGNSTICTNLSTEVLSPRNQRKKEDRIGESQPF 82 (530)
Q Consensus 15 ~~~~l~~va~~~HE~GH~-GL~-Hs~~~~~iM~p~y~~~~~~~~~~~~~~~l~~~~~~~~dD~~gi~~ly 82 (530)
.+-+|...| -||+||+ ||. ||+-+ +.|++..+.. -++ ..|+.-+..+|
T Consensus 183 ~~e~L~~ta--rhElGhaLgi~ghsd~k-s~aly~~s~~-----------~is------~rdv~tL~r~Y 232 (236)
T COG5549 183 LRENLNPTA--RHELGHALGIWGHSDLK-SDALYGISWQ-----------GIS------RRDVKTLERKY 232 (236)
T ss_pred chhhhhHHH--HHhhcchheecccccch-hhhhcchhhc-----------ccC------HHHHHHHHHHh
Confidence 456777888 9999999 998 87754 5666655532 255 99999999999
No 49
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=83.23 E-value=0.41 Score=46.54 Aligned_cols=20 Identities=10% Similarity=0.061 Sum_probs=14.2
Q ss_pred hhhhhhhhccc-cccCCCCCC
Q psy12290 22 MNHSWATYGHW-ALSFKMTTI 41 (530)
Q Consensus 22 va~~~HE~GH~-GL~Hs~~~~ 41 (530)
+++++|||||+ ||.|-..+.
T Consensus 80 ~~~i~HEl~HaLG~~HEh~Rp 100 (191)
T PF01400_consen 80 VGTILHELGHALGFWHEHQRP 100 (191)
T ss_dssp HHHHHHHHHHHHTB--GGGST
T ss_pred ccchHHHHHHHHhhhhhhhcc
Confidence 44459999999 999988664
No 50
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=82.46 E-value=0.37 Score=44.44 Aligned_cols=55 Identities=20% Similarity=0.128 Sum_probs=26.7
Q ss_pred ccccchhhhhhhhhhhccc-ccc-CCCCC-CCeeeecccCCCCccccCCceeecCCCCCCChhHHHHhhccc
Q psy12290 14 MAIQSLYGMNHSWATYGHW-ALS-FKMTT-IEAEFKWFDGGNSTICTNLSTEVLSPRNQRKKEDRIGESQPF 82 (530)
Q Consensus 14 ~~~~~l~~va~~~HE~GH~-GL~-Hs~~~-~~iM~p~y~~~~~~~~~~~~~~~l~~~~~~~~dD~~gi~~ly 82 (530)
..+.|-.-|+ +||+||. ||- |-.-+ .+||-= +..+.-|+|. .-+ ..++..+.+||
T Consensus 72 ~qgy~~~RIa--aHE~GHiLGLPD~y~GpCS~LMSG---~~aG~sCtN~---~Pn------a~E~a~V~~~~ 129 (132)
T PF02031_consen 72 NQGYNSTRIA--AHELGHILGLPDHYPGPCSELMSG---GSAGTSCTNA---YPN------AAERARVNSLF 129 (132)
T ss_dssp HHHS-HHHHH--HHHHHHHHT----TTS-TT-GGGT---TTT-TT-------S--------HHHHHHHHHHC
T ss_pred hhCCccceee--eehhccccCCCCCCCCccHHhhcC---CCCCCCCCCC---CCC------HHHHHHHHHHh
Confidence 3456667788 9999999 995 33322 334432 1233446662 223 67777777777
No 51
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=81.60 E-value=0.55 Score=42.87 Aligned_cols=11 Identities=36% Similarity=0.395 Sum_probs=10.0
Q ss_pred hhhhhccc-ccc
Q psy12290 25 SWATYGHW-ALS 35 (530)
Q Consensus 25 ~~HE~GH~-GL~ 35 (530)
++|||||+ ||+
T Consensus 113 liHEIgHhFGLs 124 (136)
T COG3824 113 LIHEIGHHFGLS 124 (136)
T ss_pred hhhhhhhhcCCC
Confidence 59999999 996
No 52
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=81.19 E-value=0.53 Score=45.46 Aligned_cols=25 Identities=8% Similarity=-0.165 Sum_probs=17.4
Q ss_pred hhhhhccc-cccCCCC--------CCCeeeeccc
Q psy12290 25 SWATYGHW-ALSFKMT--------TIEAEFKWFD 49 (530)
Q Consensus 25 ~~HE~GH~-GL~Hs~~--------~~~iM~p~y~ 49 (530)
+|||+||. |+.|-.. ..-||.+...
T Consensus 135 ~AHelGH~lGm~HD~~~~C~C~~~~~cIM~~~~~ 168 (199)
T PF01421_consen 135 IAHELGHNLGMPHDGDNGCKCPRRDGCIMSPSIG 168 (199)
T ss_dssp HHHHHHHHTT---TTTCTHCSSSSSSSTTSSSSS
T ss_pred HHHHHHHhcCCCCCCCCcCCcCCCCCcccccccc
Confidence 69999999 9999877 3668888544
No 53
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=80.94 E-value=0.58 Score=46.20 Aligned_cols=19 Identities=21% Similarity=0.349 Sum_probs=15.5
Q ss_pred hhhhhhhhhhccc-cccCCCCC
Q psy12290 20 YGMNHSWATYGHW-ALSFKMTT 40 (530)
Q Consensus 20 ~~va~~~HE~GH~-GL~Hs~~~ 40 (530)
++++ +|||||+ ||-|-..+
T Consensus 88 ~Gti--~HEl~HaLGf~HEhsR 107 (200)
T cd04281 88 FGIV--VHELGHVIGFWHEHTR 107 (200)
T ss_pred CchH--HHHHHHHhcCcchhcc
Confidence 3555 9999999 99998754
No 54
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=80.29 E-value=0.7 Score=46.62 Aligned_cols=18 Identities=6% Similarity=-0.029 Sum_probs=14.9
Q ss_pred hhhhhhhccc-cccCCCCC
Q psy12290 23 NHSWATYGHW-ALSFKMTT 40 (530)
Q Consensus 23 a~~~HE~GH~-GL~Hs~~~ 40 (530)
.+++|||||+ ||-|-..+
T Consensus 122 Gti~HEl~HalGf~HEqsR 140 (230)
T cd04282 122 ATVEHEFLHALGFYHEQSR 140 (230)
T ss_pred chHHHHHHHHhCCcccccc
Confidence 3449999999 99998765
No 55
>PHA02713 hypothetical protein; Provisional
Probab=80.07 E-value=33 Score=38.81 Aligned_cols=54 Identities=19% Similarity=0.210 Sum_probs=28.7
Q ss_pred eeEEeCCEEEEEECCe-------EEEEcCCCccCCCceecccccccCCCCC-ccccceEEeeCCcEEEEEce
Q psy12290 241 AIAVIRTEVFVFKDRW-------MWRIGDKGVLPGYPVLIHQFWSELPHNL-THIDAVYQRIDNHIAFFIGR 304 (530)
Q Consensus 241 Avt~irGeiyfFKG~~-------~WR~~~~~~~~gyP~~Is~~WpGLP~~~-~~IDAAye~~~gktyFFKG~ 304 (530)
+++.++|.||++-|.. +.+++ ...+.|..++.-. ....++.-.-+|++|++-|.
T Consensus 346 ~~~~~~g~IYviGG~~~~~~~~sve~Yd----------p~~~~W~~~~~mp~~r~~~~~~~~~g~IYviGG~ 407 (557)
T PHA02713 346 SLAVIDDTIYAIGGQNGTNVERTIECYT----------MGDDKWKMLPDMPIALSSYGMCVLDQYIYIIGGR 407 (557)
T ss_pred eEEEECCEEEEECCcCCCCCCceEEEEE----------CCCCeEEECCCCCcccccccEEEECCEEEEEeCC
Confidence 4556799999998741 22222 1234565444311 11112222247999999773
No 56
>PF10462 Peptidase_M66: Peptidase M66; InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=78.91 E-value=0.78 Score=48.11 Aligned_cols=14 Identities=21% Similarity=0.057 Sum_probs=10.3
Q ss_pred hhhhhccc-cccCCC
Q psy12290 25 SWATYGHW-ALSFKM 38 (530)
Q Consensus 25 ~~HE~GH~-GL~Hs~ 38 (530)
++||+||+ ||.|..
T Consensus 197 f~HE~GH~~GL~H~p 211 (305)
T PF10462_consen 197 FSHELGHNFGLGHYP 211 (305)
T ss_dssp HHHHHHHTTT--SST
T ss_pred eehhhhhhcCCCCCC
Confidence 49999999 999954
No 57
>KOG3658|consensus
Probab=77.69 E-value=1.5 Score=50.07 Aligned_cols=63 Identities=14% Similarity=0.150 Sum_probs=40.0
Q ss_pred hhhhhccc-cccCCCC----------CCCeeeecccCCCCccccCCceeecCCCCCCChhHHHH--hhcccccccccccc
Q psy12290 25 SWATYGHW-ALSFKMT----------TIEAEFKWFDGGNSTICTNLSTEVLSPRNQRKKEDRIG--ESQPFRERTLSICD 91 (530)
Q Consensus 25 ~~HE~GH~-GL~Hs~~----------~~~iM~p~y~~~~~~~~~~~~~~~l~~~~~~~~dD~~g--i~~ly~~~~~~~~G 91 (530)
+||||||. |-.|-.+ -+=|||++-..+.. .|+.+|..= -..+-.++.. .+.-+.++.-++||
T Consensus 396 ~AHEiGHNfGSpHDp~~ECsPg~~~~GnyiM~a~AtSGd~---~NN~kFSpC--S~ksI~~VL~~k~~~CFqE~~~sfCG 470 (764)
T KOG3658|consen 396 LAHEIGHNFGSPHDPDIECSPGESQGGNYIMYAYATSGDK---PNNKKFSPC--SLKSISKVLESKKRNCFQERESSFCG 470 (764)
T ss_pred ehhhhccccCCCCCCCCccCCCCCCCCcEEEEEecccCcC---ccCcccCcc--hHHHHHHHHHHhhhhhccCccccccC
Confidence 69999999 9999776 23489998775543 245555431 1111223222 25567788889999
Q ss_pred c
Q psy12290 92 R 92 (530)
Q Consensus 92 ~ 92 (530)
.
T Consensus 471 N 471 (764)
T KOG3658|consen 471 N 471 (764)
T ss_pred C
Confidence 4
No 58
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=74.30 E-value=1.3 Score=46.61 Aligned_cols=17 Identities=18% Similarity=0.036 Sum_probs=14.4
Q ss_pred hhhhhhhccc-cccCCCC
Q psy12290 23 NHSWATYGHW-ALSFKMT 39 (530)
Q Consensus 23 a~~~HE~GH~-GL~Hs~~ 39 (530)
+++.||+||- ||.||..
T Consensus 152 ~~~~HElgHN~GL~Ha~~ 169 (314)
T PF05548_consen 152 ATIMHELGHNLGLWHAGR 169 (314)
T ss_pred HHHHHHhhhhccccccCC
Confidence 3459999999 9999973
No 59
>KOG4441|consensus
Probab=70.05 E-value=2e+02 Score=32.78 Aligned_cols=141 Identities=18% Similarity=0.119 Sum_probs=71.8
Q ss_pred ceeeEEeCCEEEEEECCe--------EEEEcCCCccCCCceecccccccCCCCC-ccccceEEeeCCcEEEEEceeEEEe
Q psy12290 239 YDAIAVIRTEVFVFKDRW--------MWRIGDKGVLPGYPVLIHQFWSELPHNL-THIDAVYQRIDNHIAFFIGRQYFLF 309 (530)
Q Consensus 239 fDAvt~irGeiyfFKG~~--------~WR~~~~~~~~gyP~~Is~~WpGLP~~~-~~IDAAye~~~gktyFFKG~~yW~y 309 (530)
-.+++.++|.+|+.-|.. +|+++... +-|..+++-. .+.+++....+|.+|..-|.--+.+
T Consensus 325 ~~~~~~~~~~lYv~GG~~~~~~~l~~ve~YD~~~----------~~W~~~a~M~~~R~~~~v~~l~g~iYavGG~dg~~~ 394 (571)
T KOG4441|consen 325 RVGVAVLNGKLYVVGGYDSGSDRLSSVERYDPRT----------NQWTPVAPMNTKRSDFGVAVLDGKLYAVGGFDGEKS 394 (571)
T ss_pred cccEEEECCEEEEEccccCCCcccceEEEecCCC----------CceeccCCccCccccceeEEECCEEEEEeccccccc
Confidence 456667889999887754 45554322 1276665421 2455555556899998877553322
Q ss_pred cceecCCCC-CcccccCCCCC--cccccccEEEeccCCeEEEeeC--------CeEEEEeCCCCcccCCCCcccccCCCC
Q psy12290 310 EGIRLLPGY-PRPLTALGLPA--SLERIDAAMVWGYNSKTYLFSG--------TRYWKLDDETGRSEPDYPRNMTENWRG 378 (530)
Q Consensus 310 ~~~~l~~GY-PK~Is~~GlP~--~~~~IDAAf~~~~ngktYFFkG--------~~YWryde~~~~vd~gYPk~Is~~w~G 378 (530)
-. .++ -| |+ ...|..=. ....--++.. ..+|++|.+.| +.+-+||..+++....=|......+.|
T Consensus 395 l~-svE-~YDp~-~~~W~~va~m~~~r~~~gv~-~~~g~iYi~GG~~~~~~~l~sve~YDP~t~~W~~~~~M~~~R~~~g 470 (571)
T KOG4441|consen 395 LN-SVE-CYDPV-TNKWTPVAPMLTRRSGHGVA-VLGGKLYIIGGGDGSSNCLNSVECYDPETNTWTLIAPMNTRRSGFG 470 (571)
T ss_pred cc-cEE-EecCC-CCcccccCCCCcceeeeEEE-EECCEEEEEcCcCCCccccceEEEEcCCCCceeecCCcccccccce
Confidence 11 011 01 11 12242211 0011112221 24799999988 456678877654332222222222111
Q ss_pred CCCCCCEEEecCCCcEEEEECCe
Q psy12290 379 VGVDIDDAFQWKDGATYFFKGKG 401 (530)
Q Consensus 379 vP~~IDAAf~~~~G~~YFFkG~~ 401 (530)
+|+. +|++|...|..
T Consensus 471 ------~a~~--~~~iYvvGG~~ 485 (571)
T KOG4441|consen 471 ------VAVL--NGKIYVVGGFD 485 (571)
T ss_pred ------EEEE--CCEEEEECCcc
Confidence 3443 68999998843
No 60
>KOG3714|consensus
Probab=68.84 E-value=1.8 Score=47.07 Aligned_cols=19 Identities=11% Similarity=0.072 Sum_probs=16.5
Q ss_pred hhhhhhhhhhccc-cccCCCCC
Q psy12290 20 YGMNHSWATYGHW-ALSFKMTT 40 (530)
Q Consensus 20 ~~va~~~HE~GH~-GL~Hs~~~ 40 (530)
++++ +|||+|+ |+-|-..+
T Consensus 160 ~G~i--~HEl~HaLGf~HehsR 179 (411)
T KOG3714|consen 160 FGTI--VHELMHALGFWHEHSR 179 (411)
T ss_pred Cchh--HHHHHHHhhhhhccCc
Confidence 7888 9999999 99998744
No 61
>PF12044 Metallopep: Putative peptidase family; InterPro: IPR021917 This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length.
Probab=66.31 E-value=2.9 Score=45.88 Aligned_cols=24 Identities=13% Similarity=0.145 Sum_probs=19.7
Q ss_pred hhhhhccc-cccCCCCCCCeeeecccC
Q psy12290 25 SWATYGHW-ALSFKMTTIEAEFKWFDG 50 (530)
Q Consensus 25 ~~HE~GH~-GL~Hs~~~~~iM~p~y~~ 50 (530)
++||+||+ ||.|..+ -||--=|..
T Consensus 320 ~lHEiGH~fg~pH~~~--GIM~Rdy~~ 344 (423)
T PF12044_consen 320 FLHEIGHLFGCPHQED--GIMLRDYDR 344 (423)
T ss_pred HHHHHHHhcCCCCCCC--CeeeCCchh
Confidence 69999999 9999977 688765554
No 62
>PHA02790 Kelch-like protein; Provisional
Probab=64.32 E-value=2.2e+02 Score=31.51 Aligned_cols=133 Identities=11% Similarity=0.033 Sum_probs=61.8
Q ss_pred EEeCCEEEEEECC-------eEEEEcCCCccCCCceecccccccCCCCC-ccccceEEeeCCcEEEEEce----eEEEec
Q psy12290 243 AVIRTEVFVFKDR-------WMWRIGDKGVLPGYPVLIHQFWSELPHNL-THIDAVYQRIDNHIAFFIGR----QYFLFE 310 (530)
Q Consensus 243 t~irGeiyfFKG~-------~~WR~~~~~~~~gyP~~Is~~WpGLP~~~-~~IDAAye~~~gktyFFKG~----~yW~y~ 310 (530)
+.+++.+|++-|. .+|+++.. ..-|-.++.-- ....++.-..++++|+.-|. .+++|+
T Consensus 268 ~~~~~~lyviGG~~~~~~~~~v~~Ydp~----------~~~W~~~~~m~~~r~~~~~v~~~~~iYviGG~~~~~sve~yd 337 (480)
T PHA02790 268 THVGEVVYLIGGWMNNEIHNNAIAVNYI----------SNNWIPIPPMNSPRLYASGVPANNKLYVVGGLPNPTSVERWF 337 (480)
T ss_pred EEECCEEEEEcCCCCCCcCCeEEEEECC----------CCEEEECCCCCchhhcceEEEECCEEEEECCcCCCCceEEEE
Confidence 3467888888773 35555432 12344443210 11222222247899998773 345554
Q ss_pred ceecCCCCCcccccCCCCCcccccccEEEeccCCeEEEeeCC-----eEEEEeCCCCcccCCCCcccccCCCCCCC----
Q psy12290 311 GIRLLPGYPRPLTALGLPASLERIDAAMVWGYNSKTYLFSGT-----RYWKLDDETGRSEPDYPRNMTENWRGVGV---- 381 (530)
Q Consensus 311 ~~~l~~GYPK~Is~~GlP~~~~~IDAAf~~~~ngktYFFkG~-----~YWryde~~~~vd~gYPk~Is~~w~GvP~---- 381 (530)
... ..+ ..+. .+|...... |+.. .+|++|++.|. ..-+||..++ .|.-++.
T Consensus 338 p~~--n~W-~~~~--~l~~~r~~~-~~~~--~~g~IYviGG~~~~~~~ve~ydp~~~------------~W~~~~~m~~~ 397 (480)
T PHA02790 338 HGD--AAW-VNMP--SLLKPRCNP-AVAS--INNVIYVIGGHSETDTTTEYLLPNHD------------QWQFGPSTYYP 397 (480)
T ss_pred CCC--CeE-EECC--CCCCCCccc-EEEE--ECCEEEEecCcCCCCccEEEEeCCCC------------EEEeCCCCCCc
Confidence 210 000 0011 222211111 2222 47899999873 2334555443 3433320
Q ss_pred CCC-EEEecCCCcEEEEECCeEEEEec
Q psy12290 382 DID-DAFQWKDGATYFFKGKGFWKFKD 407 (530)
Q Consensus 382 ~ID-AAf~~~~G~~YFFkG~~Ywrfd~ 407 (530)
.-. +|.. -+|++|.+-|. .-.||.
T Consensus 398 r~~~~~~~-~~~~IYv~GG~-~e~ydp 422 (480)
T PHA02790 398 HYKSCALV-FGRRLFLVGRN-AEFYCE 422 (480)
T ss_pred cccceEEE-ECCEEEEECCc-eEEecC
Confidence 111 2333 36899999763 334554
No 63
>PHA02790 Kelch-like protein; Provisional
Probab=62.89 E-value=2.5e+02 Score=31.09 Aligned_cols=49 Identities=6% Similarity=-0.046 Sum_probs=26.3
Q ss_pred cCCeEEEeeCCeEEEEeCCCCcccCCCCcccccCCCCCCC-CCCEEEecCCCcEEEEECC
Q psy12290 342 YNSKTYLFSGTRYWKLDDETGRSEPDYPRNMTENWRGVGV-DIDDAFQWKDGATYFFKGK 400 (530)
Q Consensus 342 ~ngktYFFkG~~YWryde~~~~vd~gYPk~Is~~w~GvP~-~IDAAf~~~~G~~YFFkG~ 400 (530)
.+|++|++.|. .-.||..+++-... ..++. ...+....-+|++|.+-|.
T Consensus 406 ~~~~IYv~GG~-~e~ydp~~~~W~~~---------~~m~~~r~~~~~~v~~~~IYviGG~ 455 (480)
T PHA02790 406 FGRRLFLVGRN-AEFYCESSNTWTLI---------DDPIYPRDNPELIIVDNKLLLIGGF 455 (480)
T ss_pred ECCEEEEECCc-eEEecCCCCcEeEc---------CCCCCCccccEEEEECCEEEEECCc
Confidence 36899999764 45677665432111 11111 1222222236899999873
No 64
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=51.49 E-value=5.7 Score=44.86 Aligned_cols=15 Identities=20% Similarity=0.122 Sum_probs=13.3
Q ss_pred chhhhhhhhhhhccc
Q psy12290 18 SLYGMNHSWATYGHW 32 (530)
Q Consensus 18 ~l~~va~~~HE~GH~ 32 (530)
++..|.|++||+||+
T Consensus 375 ~~~dv~TLaHElGHa 389 (591)
T TIGR00181 375 TLNSVFTLAHELGHS 389 (591)
T ss_pred CcchHHHHHHHhhhH
Confidence 467889999999999
No 65
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=50.53 E-value=5.8 Score=34.93 Aligned_cols=11 Identities=18% Similarity=0.151 Sum_probs=8.5
Q ss_pred hhhhhccc-ccc
Q psy12290 25 SWATYGHW-ALS 35 (530)
Q Consensus 25 ~~HE~GH~-GL~ 35 (530)
++||+||. |++
T Consensus 77 lvhEiah~fG~~ 88 (97)
T PF06262_consen 77 LVHEIAHHFGIS 88 (97)
T ss_dssp HHHHHHHHTT--
T ss_pred HHHHHHHHcCCC
Confidence 69999999 985
No 66
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=50.26 E-value=2.2e+02 Score=27.71 Aligned_cols=147 Identities=14% Similarity=0.153 Sum_probs=73.1
Q ss_pred CCEEEEEECCeEEEEcCCCccCCCceecccccccCCCCCccc-cceEEeeCCcEEEEEc----------eeEEEecceec
Q psy12290 246 RTEVFVFKDRWMWRIGDKGVLPGYPVLIHQFWSELPHNLTHI-DAVYQRIDNHIAFFIG----------RQYFLFEGIRL 314 (530)
Q Consensus 246 rGeiyfFKG~~~WR~~~~~~~~gyP~~Is~~WpGLP~~~~~I-DAAye~~~gktyFFKG----------~~yW~y~~~~l 314 (530)
+|.+|+-...-...++.. .+.-..+.+...+- ..+... |.++ .++|++||=.- .++|+++...
T Consensus 51 ~g~l~v~~~~~~~~~d~~---~g~~~~~~~~~~~~-~~~~~~ND~~v-d~~G~ly~t~~~~~~~~~~~~g~v~~~~~~~- 124 (246)
T PF08450_consen 51 DGRLYVADSGGIAVVDPD---TGKVTVLADLPDGG-VPFNRPNDVAV-DPDGNLYVTDSGGGGASGIDPGSVYRIDPDG- 124 (246)
T ss_dssp TSEEEEEETTCEEEEETT---TTEEEEEEEEETTC-SCTEEEEEEEE--TTS-EEEEEECCBCTTCGGSEEEEEEETTS-
T ss_pred CCEEEEEEcCceEEEecC---CCcEEEEeeccCCC-cccCCCceEEE-cCCCCEEEEecCCCccccccccceEEECCCC-
Confidence 577777776666666422 12122232221111 011123 4444 35788777421 4577776541
Q ss_pred CCCCCccc-ccCCCCCcccccccEEEeccCCeEEEe---eCCeEEEEeCCCCcccCCCCcccccCCCCCCCCCCEEEecC
Q psy12290 315 LPGYPRPL-TALGLPASLERIDAAMVWGYNSKTYLF---SGTRYWKLDDETGRSEPDYPRNMTENWRGVGVDIDDAFQWK 390 (530)
Q Consensus 315 ~~GYPK~I-s~~GlP~~~~~IDAAf~~~~ngktYFF---kG~~YWryde~~~~vd~gYPk~Is~~w~GvP~~IDAAf~~~ 390 (530)
-.+.+ ..+..| .+ +.+..+++++++ ...+.|+|+..........++.+. .+.+.....|......
T Consensus 125 ---~~~~~~~~~~~p------NG-i~~s~dg~~lyv~ds~~~~i~~~~~~~~~~~~~~~~~~~-~~~~~~g~pDG~~vD~ 193 (246)
T PF08450_consen 125 ---KVTVVADGLGFP------NG-IAFSPDGKTLYVADSFNGRIWRFDLDADGGELSNRRVFI-DFPGGPGYPDGLAVDS 193 (246)
T ss_dssp ---EEEEEEEEESSE------EE-EEEETTSSEEEEEETTTTEEEEEEEETTTCCEEEEEEEE-E-SSSSCEEEEEEEBT
T ss_pred ---eEEEEecCcccc------cc-eEECCcchheeecccccceeEEEeccccccceeeeeeEE-EcCCCCcCCCcceEcC
Confidence 01112 223333 22 233456665443 367899999754322111223232 2333334588888878
Q ss_pred CCcEEEE--ECCeEEEEeccc
Q psy12290 391 DGATYFF--KGKGFWKFKDVS 409 (530)
Q Consensus 391 ~G~~YFF--kG~~Ywrfd~~~ 409 (530)
+|.+|.- .+.+.++|+...
T Consensus 194 ~G~l~va~~~~~~I~~~~p~G 214 (246)
T PF08450_consen 194 DGNLWVADWGGGRIVVFDPDG 214 (246)
T ss_dssp TS-EEEEEETTTEEEEEETTS
T ss_pred CCCEEEEEcCCCEEEEECCCc
Confidence 9999875 789999999873
No 67
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=49.10 E-value=5.9 Score=40.96 Aligned_cols=19 Identities=21% Similarity=0.032 Sum_probs=14.7
Q ss_pred hhhhhhhhccc-cccCCCCC
Q psy12290 22 MNHSWATYGHW-ALSFKMTT 40 (530)
Q Consensus 22 va~~~HE~GH~-GL~Hs~~~ 40 (530)
|.+++||+||. ||-+-.+.
T Consensus 166 igv~~HE~gH~lGLPDlY~~ 185 (286)
T TIGR03296 166 VGVIAHELGHDLGLPDLYDT 185 (286)
T ss_pred eeeeehhhhcccCCCccccC
Confidence 44459999999 99886653
No 68
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=48.17 E-value=9.8 Score=42.01 Aligned_cols=16 Identities=13% Similarity=0.208 Sum_probs=13.8
Q ss_pred cchhhhhhhhhhhccc
Q psy12290 17 QSLYGMNHSWATYGHW 32 (530)
Q Consensus 17 ~~l~~va~~~HE~GH~ 32 (530)
.++-.|.||+||+||+
T Consensus 259 l~~~~V~TLfHEfGHa 274 (472)
T cd06455 259 LRHDEVETFFHEFGHV 274 (472)
T ss_pred CCHHHHHHHHHHHHHH
Confidence 4477899999999999
No 69
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=46.85 E-value=7.5 Score=44.02 Aligned_cols=15 Identities=20% Similarity=0.067 Sum_probs=13.2
Q ss_pred chhhhhhhhhhhccc
Q psy12290 18 SLYGMNHSWATYGHW 32 (530)
Q Consensus 18 ~l~~va~~~HE~GH~ 32 (530)
++..|.||+||+||+
T Consensus 372 ~~~~v~TL~HE~GHa 386 (587)
T TIGR02290 372 SRRDVSTLAHELGHA 386 (587)
T ss_pred CchhHHHHHHHhhHH
Confidence 467788999999999
No 70
>KOG3607|consensus
Probab=45.92 E-value=8.5 Score=44.97 Aligned_cols=17 Identities=12% Similarity=-0.048 Sum_probs=14.9
Q ss_pred hhhhhhhhccc-cccCCC
Q psy12290 22 MNHSWATYGHW-ALSFKM 38 (530)
Q Consensus 22 va~~~HE~GH~-GL~Hs~ 38 (530)
+++||||+||. |+.|-.
T Consensus 324 a~v~AhelgH~lGm~hD~ 341 (716)
T KOG3607|consen 324 AVVLAHELGHNLGMIHDE 341 (716)
T ss_pred HHHHHHHHHhhcCccccc
Confidence 44589999999 999987
No 71
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=45.47 E-value=7.2 Score=40.18 Aligned_cols=18 Identities=17% Similarity=0.211 Sum_probs=9.4
Q ss_pred hhhhhhhhhhhccc--cccCCC
Q psy12290 19 LYGMNHSWATYGHW--ALSFKM 38 (530)
Q Consensus 19 l~~va~~~HE~GH~--GL~Hs~ 38 (530)
.-.|+ +||+||+ ||+=-.
T Consensus 216 ~~~v~--vHE~GHsf~~LaDEY 235 (264)
T PF09471_consen 216 FKQVV--VHEFGHSFGGLADEY 235 (264)
T ss_dssp HHHHH--HHHHHHHTT------
T ss_pred cccee--eeecccccccccccc
Confidence 34566 9999998 786433
No 72
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=45.32 E-value=8.3 Score=41.16 Aligned_cols=15 Identities=20% Similarity=0.078 Sum_probs=12.6
Q ss_pred chhhhhhhhhhhccc
Q psy12290 18 SLYGMNHSWATYGHW 32 (530)
Q Consensus 18 ~l~~va~~~HE~GH~ 32 (530)
++..|.||+||+||+
T Consensus 219 ~~~~v~tl~HE~GHa 233 (427)
T cd06459 219 TLDDVFTLAHELGHA 233 (427)
T ss_pred ChhhHHHHHHHhhHH
Confidence 466788899999997
No 73
>KOG4693|consensus
Probab=44.04 E-value=4.2e+02 Score=28.09 Aligned_cols=77 Identities=21% Similarity=0.425 Sum_probs=41.6
Q ss_pred CCCCcccccccEEEeccCCeEEEeeCC--eEEEEeCCCCcccCCCCcc------cccCCCCCC--------CCCCEEEec
Q psy12290 326 GLPASLERIDAAMVWGYNSKTYLFSGT--RYWKLDDETGRSEPDYPRN------MTENWRGVG--------VDIDDAFQW 389 (530)
Q Consensus 326 GlP~~~~~IDAAf~~~~ngktYFFkG~--~YWryde~~~~vd~gYPk~------Is~~w~GvP--------~~IDAAf~~ 389 (530)
|.|..-...-+|... ++..|+|.|. +.=-|.... +.|=.. -.+.|...| ..-.++|.+
T Consensus 176 g~PprwRDFH~a~~~--~~~MYiFGGR~D~~gpfHs~~----e~Yc~~i~~ld~~T~aW~r~p~~~~~P~GRRSHS~fvY 249 (392)
T KOG4693|consen 176 GDPPRWRDFHTASVI--DGMMYIFGGRSDESGPFHSIH----EQYCDTIMALDLATGAWTRTPENTMKPGGRRSHSTFVY 249 (392)
T ss_pred CCCchhhhhhhhhhc--cceEEEeccccccCCCccchh----hhhcceeEEEeccccccccCCCCCcCCCcccccceEEE
Confidence 555544445566654 5899999763 222221111 111111 113455443 245678886
Q ss_pred CCCcEEEEEC---------CeEEEEeccc
Q psy12290 390 KDGATYFFKG---------KGFWKFKDVS 409 (530)
Q Consensus 390 ~~G~~YFFkG---------~~Ywrfd~~~ 409 (530)
||++|.|-| +..|.||..+
T Consensus 250 -ng~~Y~FGGYng~ln~HfndLy~FdP~t 277 (392)
T KOG4693|consen 250 -NGKMYMFGGYNGTLNVHFNDLYCFDPKT 277 (392)
T ss_pred -cceEEEecccchhhhhhhcceeeccccc
Confidence 799999976 3457777654
No 74
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=43.88 E-value=8.2 Score=32.79 Aligned_cols=14 Identities=21% Similarity=0.064 Sum_probs=10.2
Q ss_pred hhhhhccccccCCC
Q psy12290 25 SWATYGHWALSFKM 38 (530)
Q Consensus 25 ~~HE~GH~GL~Hs~ 38 (530)
++|||||.=|+|..
T Consensus 46 laHELgH~~~~~~~ 59 (122)
T PF06114_consen 46 LAHELGHILLHHGD 59 (122)
T ss_dssp HHHHHHHHHHHH-H
T ss_pred HHHHHHHHHhhhcc
Confidence 48999999665554
No 75
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=42.79 E-value=9.4 Score=42.95 Aligned_cols=15 Identities=13% Similarity=-0.031 Sum_probs=12.8
Q ss_pred chhhhhhhhhhhccc
Q psy12290 18 SLYGMNHSWATYGHW 32 (530)
Q Consensus 18 ~l~~va~~~HE~GH~ 32 (530)
.+..|.||+||+||+
T Consensus 334 t~~dv~TL~HElGHa 348 (549)
T TIGR02289 334 TSGDIDVLTHEAGHA 348 (549)
T ss_pred ChhHHHHHHHHhhHH
Confidence 466788899999999
No 76
>KOG2719|consensus
Probab=42.48 E-value=3.2 Score=45.40 Aligned_cols=27 Identities=22% Similarity=0.303 Sum_probs=22.6
Q ss_pred chhhhhhhhhhhccccccCCCCCCCeeee
Q psy12290 18 SLYGMNHSWATYGHWALSFKMTTIEAEFK 46 (530)
Q Consensus 18 ~l~~va~~~HE~GH~GL~Hs~~~~~iM~p 46 (530)
.+.+|. +||+|||-+.|-....++|+=
T Consensus 279 el~AVl--~HELGHW~~~H~~K~~ii~~~ 305 (428)
T KOG2719|consen 279 ELVAVL--AHELGHWKLNHVLKNIIIMQI 305 (428)
T ss_pred HHHHHH--HHHhhHHHHhhHHHHHHHHHH
Confidence 467777 999999999999888877763
No 77
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=42.41 E-value=10 Score=41.28 Aligned_cols=17 Identities=18% Similarity=0.309 Sum_probs=13.8
Q ss_pred ccchhhhhhhhhhhccc
Q psy12290 16 IQSLYGMNHSWATYGHW 32 (530)
Q Consensus 16 ~~~l~~va~~~HE~GH~ 32 (530)
..++..|-|++||+||+
T Consensus 237 ll~~~~v~tLfHE~GHa 253 (458)
T PF01432_consen 237 LLSHDDVETLFHEFGHA 253 (458)
T ss_dssp B-SHHHHHHHHHHHHHH
T ss_pred ccChhhHHHHHHHHhHH
Confidence 34667889999999999
No 78
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=39.64 E-value=14 Score=37.23 Aligned_cols=24 Identities=13% Similarity=0.059 Sum_probs=16.3
Q ss_pred ccccchhhhhhhhhhhccccccCCC
Q psy12290 14 MAIQSLYGMNHSWATYGHWALSFKM 38 (530)
Q Consensus 14 ~~~~~l~~va~~~HE~GH~GL~Hs~ 38 (530)
..+.++-+||.-+||.||+ +.|..
T Consensus 82 y~~~SiaAvaVAAHEvGHA-iQ~a~ 105 (222)
T PF04298_consen 82 YNGRSIAAVAVAAHEVGHA-IQHAE 105 (222)
T ss_pred CCCCCHHHHHHHHHHHhHH-Hhccc
Confidence 4567776544449999998 55543
No 79
>KOG4441|consensus
Probab=38.31 E-value=3.8e+02 Score=30.64 Aligned_cols=137 Identities=17% Similarity=0.124 Sum_probs=70.7
Q ss_pred CCeEEEeeCC--------eEEEEeCCCCcccCCCCcccccCCCCCC----CCCCEEEecCCCcEEEEECCeEEEEecccc
Q psy12290 343 NSKTYLFSGT--------RYWKLDDETGRSEPDYPRNMTENWRGVG----VDIDDAFQWKDGATYFFKGKGFWKFKDVSM 410 (530)
Q Consensus 343 ngktYFFkG~--------~YWryde~~~~vd~gYPk~Is~~w~GvP----~~IDAAf~~~~G~~YFFkG~~Ywrfd~~~~ 410 (530)
+|++|...|. ..|+||...+ .|..++ ...++++..-+|.+|..-|..-+..- .
T Consensus 332 ~~~lYv~GG~~~~~~~l~~ve~YD~~~~------------~W~~~a~M~~~R~~~~v~~l~g~iYavGG~dg~~~l---~ 396 (571)
T KOG4441|consen 332 NGKLYVVGGYDSGSDRLSSVERYDPRTN------------QWTPVAPMNTKRSDFGVAVLDGKLYAVGGFDGEKSL---N 396 (571)
T ss_pred CCEEEEEccccCCCcccceEEEecCCCC------------ceeccCCccCccccceeEEECCEEEEEecccccccc---c
Confidence 5688887653 3688887654 366554 35666666568999999886622221 2
Q ss_pred eeeccCCCcccccccCCCCCCCCCcc-ccccccCceEEEeecCCCceecccccceEEEEecCCCCCCCccccccceeeCc
Q psy12290 411 RVEKEKPSPSAQFWMKCPEISSPEDR-ISERRVGRAFSSRSTSGGLRQGASFKAQFYHVRCSGKFGNGPVGRKSSISENP 489 (530)
Q Consensus 411 ~V~~~yPr~ia~~W~gCp~~~~~~~~-i~~~~~g~~y~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (530)
.|+..-|+ .+.|--+..-...+.. .----.|..|-+-|.++..+ .++ .-++| +...+-|...+.=.+-+
T Consensus 397 svE~YDp~--~~~W~~va~m~~~r~~~gv~~~~g~iYi~GG~~~~~~-~l~-sve~Y---DP~t~~W~~~~~M~~~R--- 466 (571)
T KOG4441|consen 397 SVECYDPV--TNKWTPVAPMLTRRSGHGVAVLGGKLYIIGGGDGSSN-CLN-SVECY---DPETNTWTLIAPMNTRR--- 466 (571)
T ss_pred cEEEecCC--CCcccccCCCCcceeeeEEEEECCEEEEEcCcCCCcc-ccc-eEEEE---cCCCCceeecCCccccc---
Confidence 35444454 5678887643322111 11122355666666655333 332 22333 22222222222222222
Q ss_pred cccceeeecceeEEE
Q psy12290 490 AIKALKKWNIAWYCT 504 (530)
Q Consensus 490 ~~~a~~~~~~~~~~~ 504 (530)
..-++-..|.++|.-
T Consensus 467 ~~~g~a~~~~~iYvv 481 (571)
T KOG4441|consen 467 SGFGVAVLNGKIYVV 481 (571)
T ss_pred ccceEEEECCEEEEE
Confidence 122366788888863
No 80
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=38.25 E-value=12 Score=37.56 Aligned_cols=9 Identities=22% Similarity=0.080 Sum_probs=7.6
Q ss_pred hhhhhhccc
Q psy12290 24 HSWATYGHW 32 (530)
Q Consensus 24 ~~~HE~GH~ 32 (530)
|+|||+||+
T Consensus 75 tlAHELGH~ 83 (213)
T COG2856 75 TLAHELGHA 83 (213)
T ss_pred HHHHHHhHH
Confidence 357999999
No 81
>cd08988 GH43_ABN Glycosyl hydrolase family 43. This glycosyl hydrolase family 43 (GH43) includes mostly enzymes with alpha-L-arabinofuranosidase (AFN; EC 3.2.1.55) and endo-alpha-L-arabinanase (ABN; EC 3.2.1.99) activities. These are inverting enzymes (i.e. they invert the stereochemistry of the anomeric carbon atom of the substrate) that have an aspartate as the catalytic general base, a glutamate as the catalytic general acid and another aspartate that is responsible for pKa modulation and orienting the catalytic acid. The GH43 ABN enzymes hydrolyze alpha-1,5-L-arabinofuranoside linkages while the ABF enzymes cleave arabinose side chains so that the combined actions of these two enzymes reduce arabinan to L-arabinose and/or arabinooligosaccharides. These arabinan-degrading enzymes are important in the food industry for efficient production of L-arabinose from agricultural waste; L-arabinose is often used as a bioactive sweetener. A common structural feature of GH43 enzymes is a 5-bla
Probab=36.96 E-value=92 Score=31.62 Aligned_cols=65 Identities=23% Similarity=0.445 Sum_probs=34.7
Q ss_pred cccccEEEeccCCeEEEeeCC-----eEEEEeCCCCcc-cCCCCcccccCCCCCCCCCCEEEecC-CCcEEEE
Q psy12290 332 ERIDAAMVWGYNSKTYLFSGT-----RYWKLDDETGRS-EPDYPRNMTENWRGVGVDIDDAFQWK-DGATYFF 397 (530)
Q Consensus 332 ~~IDAAf~~~~ngktYFFkG~-----~YWryde~~~~v-d~gYPk~Is~~w~GvP~~IDAAf~~~-~G~~YFF 397 (530)
..||+++....+|+.|++-|+ +.-+.+.....+ ..+..+.|...-. -..-++|....+ +|..|+|
T Consensus 118 ~~iDp~~f~DdDG~~Yl~~g~~~~gi~~~eL~~d~~~~~~~~~~~~i~~~~~-~~~~~Egp~i~k~~g~YYl~ 189 (279)
T cd08988 118 NAIDPAIIFDQHGQPWLSFGSFWGGIKLFELDKDTMKPAEPGELHSIAGRER-SSAAIEAPFILYRGDYYYLF 189 (279)
T ss_pred CccCCceEEcCCCCEEEEecccCCCEEEEEECcccCCccCCCcceEEeccCC-CCCceEeeEEEEcCCeEEEE
Confidence 469986655578999997554 223566554443 2344444432100 012356654443 6777766
No 82
>KOG4693|consensus
Probab=36.52 E-value=2.1e+02 Score=30.17 Aligned_cols=98 Identities=18% Similarity=0.072 Sum_probs=48.1
Q ss_pred CCEEEecCCCcEEEEEC--CeEEEEecccc----eeeccCCCcccccccCCCCC-CCCCccccccccCceEEEeecCCCc
Q psy12290 383 IDDAFQWKDGATYFFKG--KGFWKFKDVSM----RVEKEKPSPSAQFWMKCPEI-SSPEDRISERRVGRAFSSRSTSGGL 455 (530)
Q Consensus 383 IDAAf~~~~G~~YFFkG--~~Ywrfd~~~~----~V~~~yPr~ia~~W~gCp~~-~~~~~~i~~~~~g~~y~~~~t~~~~ 455 (530)
..+|... +|.+|+|-| ++.=-|..... ++. +=......|..||.+ .-+.|+-+ -+.... + +
T Consensus 184 FH~a~~~-~~~MYiFGGR~D~~gpfHs~~e~Yc~~i~--~ld~~T~aW~r~p~~~~~P~GRRS-------HS~fvY-n-g 251 (392)
T KOG4693|consen 184 FHTASVI-DGMMYIFGGRSDESGPFHSIHEQYCDTIM--ALDLATGAWTRTPENTMKPGGRRS-------HSTFVY-N-G 251 (392)
T ss_pred hhhhhhc-cceEEEeccccccCCCccchhhhhcceeE--EEeccccccccCCCCCcCCCcccc-------cceEEE-c-c
Confidence 3455443 589999965 33333322211 111 001124689999765 34554311 111122 1 2
Q ss_pred eecccccceEEEEecCCCCCCCcccccccee-eCccccc
Q psy12290 456 RQGASFKAQFYHVRCSGKFGNGPVGRKSSIS-ENPAIKA 493 (530)
Q Consensus 456 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~a 493 (530)
.++. |++.-=.++.--+|...-.|.++.|+ ..+.+|+
T Consensus 252 ~~Y~-FGGYng~ln~HfndLy~FdP~t~~W~~I~~~Gk~ 289 (392)
T KOG4693|consen 252 KMYM-FGGYNGTLNVHFNDLYCFDPKTSMWSVISVRGKY 289 (392)
T ss_pred eEEE-ecccchhhhhhhcceeecccccchheeeeccCCC
Confidence 2233 33311123333367777789999999 6666665
No 83
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=36.46 E-value=14 Score=42.42 Aligned_cols=15 Identities=20% Similarity=0.107 Sum_probs=12.8
Q ss_pred chhhhhhhhhhhccc
Q psy12290 18 SLYGMNHSWATYGHW 32 (530)
Q Consensus 18 ~l~~va~~~HE~GH~ 32 (530)
.+..|.||+||+||+
T Consensus 377 t~~dV~TLaHElGHs 391 (598)
T COG1164 377 TLRDVFTLAHELGHS 391 (598)
T ss_pred chhHHHHHHHHccHH
Confidence 466788899999999
No 84
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=35.08 E-value=21 Score=39.04 Aligned_cols=19 Identities=16% Similarity=0.298 Sum_probs=15.5
Q ss_pred ccchhhhhhhhhhhccc--cc
Q psy12290 16 IQSLYGMNHSWATYGHW--AL 34 (530)
Q Consensus 16 ~~~l~~va~~~HE~GH~--GL 34 (530)
..++..|.|+.|||||+ +|
T Consensus 203 lL~~~~v~tLfHEfGHalH~~ 223 (422)
T cd06456 203 LLTHDEVTTLFHEFGHALHHL 223 (422)
T ss_pred ccCHHHHHHHHHHHHHHHHHH
Confidence 34777888999999999 55
No 85
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=35.06 E-value=21 Score=37.53 Aligned_cols=16 Identities=19% Similarity=0.312 Sum_probs=12.8
Q ss_pred cchhhhhhhhhhhccc
Q psy12290 17 QSLYGMNHSWATYGHW 32 (530)
Q Consensus 17 ~~l~~va~~~HE~GH~ 32 (530)
...-.|.|++||+||+
T Consensus 150 l~~~~v~tl~HE~GHa 165 (365)
T cd06258 150 LGHDDINTLFHEFGHA 165 (365)
T ss_pred CCHHHHHHHHHHHhHH
Confidence 3555677889999999
No 86
>KOG4499|consensus
Probab=34.13 E-value=2.3e+02 Score=29.43 Aligned_cols=77 Identities=17% Similarity=0.315 Sum_probs=43.0
Q ss_pred ccEEEeccCC-eEEEeeCCeE----EEEeCCCCcccCCCCcccccCCCCCC---CCCCEEEecCCCcEEE--EECCeEEE
Q psy12290 335 DAAMVWGYNS-KTYLFSGTRY----WKLDDETGRSEPDYPRNMTENWRGVG---VDIDDAFQWKDGATYF--FKGKGFWK 404 (530)
Q Consensus 335 DAAf~~~~ng-ktYFFkG~~Y----Wryde~~~~vd~gYPk~Is~~w~GvP---~~IDAAf~~~~G~~YF--FkG~~Ywr 404 (530)
.-++.|..+. +.|++-...| |.||-.++.+.. |+.|-+.=..-| --.|..-.+.+|.+|. |.|.+.++
T Consensus 160 sNgl~Wd~d~K~fY~iDsln~~V~a~dyd~~tG~~sn--r~~i~dlrk~~~~e~~~PDGm~ID~eG~L~Va~~ng~~V~~ 237 (310)
T KOG4499|consen 160 SNGLAWDSDAKKFYYIDSLNYEVDAYDYDCPTGDLSN--RKVIFDLRKSQPFESLEPDGMTIDTEGNLYVATFNGGTVQK 237 (310)
T ss_pred CccccccccCcEEEEEccCceEEeeeecCCCcccccC--cceeEEeccCCCcCCCCCCcceEccCCcEEEEEecCcEEEE
Confidence 3455675444 4566554444 777777665433 444432211111 1123322234788775 89999999
Q ss_pred Eecccceee
Q psy12290 405 FKDVSMRVE 413 (530)
Q Consensus 405 fd~~~~~V~ 413 (530)
+|..+.++.
T Consensus 238 ~dp~tGK~L 246 (310)
T KOG4499|consen 238 VDPTTGKIL 246 (310)
T ss_pred ECCCCCcEE
Confidence 999886654
No 87
>PF13964 Kelch_6: Kelch motif
Probab=33.26 E-value=58 Score=24.03 Aligned_cols=21 Identities=33% Similarity=0.515 Sum_probs=16.1
Q ss_pred CCeEEEeeCC--------eEEEEeCCCCc
Q psy12290 343 NSKTYLFSGT--------RYWKLDDETGR 363 (530)
Q Consensus 343 ngktYFFkG~--------~YWryde~~~~ 363 (530)
++++|+|.|. .+|+||..+++
T Consensus 11 ~~~iyv~GG~~~~~~~~~~v~~yd~~t~~ 39 (50)
T PF13964_consen 11 GGKIYVFGGYDNSGKYSNDVERYDPETNT 39 (50)
T ss_pred CCEEEEECCCCCCCCccccEEEEcCCCCc
Confidence 6799999874 45888887754
No 88
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=32.39 E-value=22 Score=39.23 Aligned_cols=17 Identities=18% Similarity=0.192 Sum_probs=13.9
Q ss_pred ccchhhhhhhhhhhccc
Q psy12290 16 IQSLYGMNHSWATYGHW 32 (530)
Q Consensus 16 ~~~l~~va~~~HE~GH~ 32 (530)
-..+-.|.||.|||||+
T Consensus 243 lL~~~~v~TLfHEfGHa 259 (458)
T cd06457 243 LLSPHEVETLFHEMGHA 259 (458)
T ss_pred CcCHHHHHHHHHHHhHH
Confidence 34566788999999999
No 89
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=31.92 E-value=6.4e+02 Score=26.66 Aligned_cols=93 Identities=12% Similarity=0.256 Sum_probs=48.0
Q ss_pred eCCcEEEEEceeEEEecce---ecCCCCCcccccCCCCCc---ccccccEEEeccCCeEEEeeC----------------
Q psy12290 294 IDNHIAFFIGRQYFLFEGI---RLLPGYPRPLTALGLPAS---LERIDAAMVWGYNSKTYLFSG---------------- 351 (530)
Q Consensus 294 ~~gktyFFKG~~yW~y~~~---~l~~GYPK~Is~~GlP~~---~~~IDAAf~~~~ngktYFFkG---------------- 351 (530)
.++ +|+-...+.|+|... ....|-.+.|.+ +++.. ..+.-..+.+..+|++||=.|
T Consensus 81 ~~G-lyV~~~~~i~~~~d~~gdg~ad~~~~~l~~-~~~~~~~~~~~~~~~l~~gpDG~LYv~~G~~~~~~~~~~~~~~~~ 158 (367)
T TIGR02604 81 VGG-VYVATPPDILFLRDKDGDDKADGEREVLLS-GFGGQINNHHHSLNSLAWGPDGWLYFNHGNTLASKVTRPGTSDES 158 (367)
T ss_pred cCC-EEEeCCCeEEEEeCCCCCCCCCCccEEEEE-ccCCCCCcccccccCceECCCCCEEEecccCCCceeccCCCccCc
Confidence 355 887777788988531 122233334433 44431 011112344567889998555
Q ss_pred -----CeEEEEeCCCCcccCCCCcccccCCCCCCCCCCEEEecCCCcEEEE
Q psy12290 352 -----TRYWKLDDETGRSEPDYPRNMTENWRGVGVDIDDAFQWKDGATYFF 397 (530)
Q Consensus 352 -----~~YWryde~~~~vd~gYPk~Is~~w~GvP~~IDAAf~~~~G~~YFF 397 (530)
...|||+.....++ .++ .|+.....-||. .+|.+|+-
T Consensus 159 ~~~~~g~i~r~~pdg~~~e-----~~a---~G~rnp~Gl~~d-~~G~l~~t 200 (367)
T TIGR02604 159 RQGLGGGLFRYNPDGGKLR-----VVA---HGFQNPYGHSVD-SWGDVFFC 200 (367)
T ss_pred ccccCceEEEEecCCCeEE-----EEe---cCcCCCccceEC-CCCCEEEE
Confidence 24678777654432 122 144334445554 36777654
No 90
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=31.66 E-value=22 Score=35.70 Aligned_cols=26 Identities=12% Similarity=-0.024 Sum_probs=23.1
Q ss_pred hhhhccc-cccCCCCCCCeeeecccCC
Q psy12290 26 WATYGHW-ALSFKMTTIEAEFKWFDGG 51 (530)
Q Consensus 26 ~HE~GH~-GL~Hs~~~~~iM~p~y~~~ 51 (530)
-|++||. ||+|++.+..||++.+..+
T Consensus 158 ~efl~~~~~~~~lnhr~~i~~~pg~~~ 184 (236)
T COG5549 158 YEFLGHALGLGHLNHRGDIMYPPGELR 184 (236)
T ss_pred HHHhhhccCcccccccccccCCcccch
Confidence 6889999 9999999999999987653
No 91
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=31.30 E-value=4.8e+02 Score=26.62 Aligned_cols=72 Identities=17% Similarity=0.151 Sum_probs=39.3
Q ss_pred cceEEeeCCcEEEEE---ceeEEEecce-ec-CCCCCcccccCCCCCcccccccEEEeccCCeEEEeeC--CeEEEEeCC
Q psy12290 288 DAVYQRIDNHIAFFI---GRQYFLFEGI-RL-LPGYPRPLTALGLPASLERIDAAMVWGYNSKTYLFSG--TRYWKLDDE 360 (530)
Q Consensus 288 DAAye~~~gktyFFK---G~~yW~y~~~-~l-~~GYPK~Is~~GlP~~~~~IDAAf~~~~ngktYFFkG--~~YWryde~ 360 (530)
|++ ....+++|++. ++.+..|... .+ ..+-+.... .||-...+-..++ .||.+|+=+. +..-|||..
T Consensus 24 D~~-~~~~~~iy~~~~~~~~~v~ey~~~~~f~~~~~~~~~~--~Lp~~~~GtG~vV---YngslYY~~~~s~~IvkydL~ 97 (250)
T PF02191_consen 24 DPS-PSDSEKIYVTSGFSGNTVYEYRNYEDFLRNGRSSRTY--KLPYPWQGTGHVV---YNGSLYYNKYNSRNIVKYDLT 97 (250)
T ss_pred CCC-CCCCCCEEEECccCCCEEEEEcCHhHHhhcCCCceEE--EEeceeccCCeEE---ECCcEEEEecCCceEEEEECc
Confidence 444 23356777754 4567777652 23 333222222 2332223334444 2688888554 778999998
Q ss_pred CCccc
Q psy12290 361 TGRSE 365 (530)
Q Consensus 361 ~~~vd 365 (530)
++.+.
T Consensus 98 t~~v~ 102 (250)
T PF02191_consen 98 TRSVV 102 (250)
T ss_pred CCcEE
Confidence 87654
No 92
>PF05547 Peptidase_M6: Immune inhibitor A peptidase M6; InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=30.41 E-value=14 Score=42.69 Aligned_cols=19 Identities=26% Similarity=0.230 Sum_probs=14.2
Q ss_pred hhhhhhhhhhccc-cccCCCCC
Q psy12290 20 YGMNHSWATYGHW-ALSFKMTT 40 (530)
Q Consensus 20 ~~va~~~HE~GH~-GL~Hs~~~ 40 (530)
.+|. +|||||. ||-=-.+.
T Consensus 222 iGVf--aHEfGH~LGLPDlYDT 241 (645)
T PF05547_consen 222 IGVF--AHEFGHDLGLPDLYDT 241 (645)
T ss_pred eEEE--EeeccccCCCCCcccC
Confidence 4455 9999999 99755554
No 93
>KOG4525|consensus
Probab=29.41 E-value=31 Score=38.09 Aligned_cols=26 Identities=15% Similarity=0.146 Sum_probs=20.8
Q ss_pred hhhhhccc-cccCCCCCCCeeeecccCCC
Q psy12290 25 SWATYGHW-ALSFKMTTIEAEFKWFDGGN 52 (530)
Q Consensus 25 ~~HE~GH~-GL~Hs~~~~~iM~p~y~~~~ 52 (530)
|.||+||. |+.|-++ -+|---|...+
T Consensus 305 ~~HElGH~lgcpHq~~--GvMlrgy~~lN 331 (614)
T KOG4525|consen 305 VCHELGHCLGCPHQSE--GVMLRGYDYLN 331 (614)
T ss_pred HHHHhhhccCCCCCCC--ceEecCchhhc
Confidence 79999999 9999764 58977776544
No 94
>KOG1924|consensus
Probab=28.66 E-value=4.4e+02 Score=31.72 Aligned_cols=6 Identities=17% Similarity=0.611 Sum_probs=2.6
Q ss_pred CeEEEE
Q psy12290 255 RWMWRI 260 (530)
Q Consensus 255 ~~~WR~ 260 (530)
+-||.-
T Consensus 642 ~cFWvk 647 (1102)
T KOG1924|consen 642 NCFWVK 647 (1102)
T ss_pred cceeee
Confidence 344543
No 95
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=28.38 E-value=21 Score=35.45 Aligned_cols=29 Identities=10% Similarity=-0.126 Sum_probs=17.0
Q ss_pred hhhhhhhhhhhccc-cccCCC---CCCCeeeeccc
Q psy12290 19 LYGMNHSWATYGHW-ALSFKM---TTIEAEFKWFD 49 (530)
Q Consensus 19 l~~va~~~HE~GH~-GL~Hs~---~~~~iM~p~y~ 49 (530)
=.+|| +||.||+ -=.|.. .-...|.|.-.
T Consensus 92 ~~aVA--AHEVGHAiQd~~~Y~~L~~R~~lvPv~~ 124 (226)
T COG2738 92 AIAVA--AHEVGHAIQDQEDYAFLVLRHALVPVAN 124 (226)
T ss_pred HHHHH--HHHhhHHHhhhcccHHHHHhhcccceec
Confidence 34677 9999999 444433 22334555443
No 96
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=28.12 E-value=18 Score=34.77 Aligned_cols=21 Identities=14% Similarity=0.115 Sum_probs=16.8
Q ss_pred chhhhhhhhhhhccccccCCCCC
Q psy12290 18 SLYGMNHSWATYGHWALSFKMTT 40 (530)
Q Consensus 18 ~l~~va~~~HE~GH~GL~Hs~~~ 40 (530)
.|..|. +||+||.-..|....
T Consensus 88 el~aVl--aHElgH~~~~h~~~~ 108 (226)
T PF01435_consen 88 ELAAVL--AHELGHIKHRHILKS 108 (226)
T ss_dssp HHHHHH--HHHHHHHHTTHCCCC
T ss_pred HHHHHH--HHHHHHHHcCCcchH
Confidence 466777 999999977887665
No 97
>KOG0379|consensus
Probab=27.10 E-value=9e+02 Score=26.89 Aligned_cols=53 Identities=17% Similarity=0.033 Sum_probs=27.2
Q ss_pred CCeEEEeeC--------CeEEEEeCCCCcccCCCCcccccCCCCCCCCCCEEEecCCCcEEEEECC
Q psy12290 343 NSKTYLFSG--------TRYWKLDDETGRSEPDYPRNMTENWRGVGVDIDDAFQWKDGATYFFKGK 400 (530)
Q Consensus 343 ngktYFFkG--------~~YWryde~~~~vd~gYPk~Is~~w~GvP~~IDAAf~~~~G~~YFFkG~ 400 (530)
..++|+|.| +.+|.||..+.+.. +.......-.|..-.++..+. +++++|-|.
T Consensus 173 g~~l~vfGG~~~~~~~~ndl~i~d~~~~~W~----~~~~~g~~P~pR~gH~~~~~~-~~~~v~gG~ 233 (482)
T KOG0379|consen 173 GTKLVVFGGIGGTGDSLNDLHIYDLETSTWS----ELDTQGEAPSPRYGHAMVVVG-NKLLVFGGG 233 (482)
T ss_pred CCEEEEECCccCcccceeeeeeeccccccce----ecccCCCCCCCCCCceEEEEC-CeEEEEecc
Confidence 478999976 35677887765421 111112222233444555544 455555443
No 98
>PF11350 DUF3152: Protein of unknown function (DUF3152); InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=26.97 E-value=23 Score=35.20 Aligned_cols=24 Identities=21% Similarity=0.210 Sum_probs=17.6
Q ss_pred ccccchhhhhhhhhhhccc-cccCCCCC
Q psy12290 14 MAIQSLYGMNHSWATYGHW-ALSFKMTT 40 (530)
Q Consensus 14 ~~~~~l~~va~~~HE~GH~-GL~Hs~~~ 40 (530)
...+.-|.|+ ||.||+ |-+|-.=+
T Consensus 135 l~~YRqYvIN---HEVGH~LGh~H~~Cp 159 (203)
T PF11350_consen 135 LASYRQYVIN---HEVGHALGHGHEPCP 159 (203)
T ss_pred HHHHHHHhhh---hhhhhhcccCCCcCC
Confidence 3445556666 999999 99998744
No 99
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=26.32 E-value=22 Score=34.87 Aligned_cols=13 Identities=23% Similarity=0.061 Sum_probs=9.9
Q ss_pred chhhhhhhhhhhccc
Q psy12290 18 SLYGMNHSWATYGHW 32 (530)
Q Consensus 18 ~l~~va~~~HE~GH~ 32 (530)
.++.|. +||+||+
T Consensus 21 ~~l~t~--~HE~gHa 33 (200)
T PF13398_consen 21 RLLVTF--VHELGHA 33 (200)
T ss_pred HHHHHH--HHHHHHH
Confidence 344555 9999999
No 100
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=25.31 E-value=4.3e+02 Score=25.73 Aligned_cols=108 Identities=14% Similarity=0.151 Sum_probs=56.6
Q ss_pred eeeEEeCCEEEEEEC----------CeEEEEcCCCccCCCceecccccccCCCCCccccceEEeeCCcEEEEE---ceeE
Q psy12290 240 DAIAVIRTEVFVFKD----------RWMWRIGDKGVLPGYPVLIHQFWSELPHNLTHIDAVYQRIDNHIAFFI---GRQY 306 (530)
Q Consensus 240 DAvt~irGeiyfFKG----------~~~WR~~~~~~~~gyP~~Is~~WpGLP~~~~~IDAAye~~~gktyFFK---G~~y 306 (530)
|.++.-+|.+||=-. ..+||+...+. +...+.++.. -..+.-.++++++++. ..++
T Consensus 90 D~~vd~~G~ly~t~~~~~~~~~~~~g~v~~~~~~~~-------~~~~~~~~~~----pNGi~~s~dg~~lyv~ds~~~~i 158 (246)
T PF08450_consen 90 DVAVDPDGNLYVTDSGGGGASGIDPGSVYRIDPDGK-------VTVVADGLGF----PNGIAFSPDGKTLYVADSFNGRI 158 (246)
T ss_dssp EEEE-TTS-EEEEEECCBCTTCGGSEEEEEEETTSE-------EEEEEEEESS----EEEEEEETTSSEEEEEETTTTEE
T ss_pred eEEEcCCCCEEEEecCCCccccccccceEEECCCCe-------EEEEecCccc----ccceEECCcchheeeccccccee
Confidence 444445778776432 34788876521 2222323321 2333334577766554 4678
Q ss_pred EEeccee--cCCCCCcccccCCCCCcccccccEEEeccCCeEEE--eeCCeEEEEeCCC
Q psy12290 307 FLFEGIR--LLPGYPRPLTALGLPASLERIDAAMVWGYNSKTYL--FSGTRYWKLDDET 361 (530)
Q Consensus 307 W~y~~~~--l~~GYPK~Is~~GlP~~~~~IDAAf~~~~ngktYF--FkG~~YWryde~~ 361 (530)
|+|+-.. ...+.++.+.+ ++......|...+. .+|.+|. +.+++.++||...
T Consensus 159 ~~~~~~~~~~~~~~~~~~~~--~~~~~g~pDG~~vD-~~G~l~va~~~~~~I~~~~p~G 214 (246)
T PF08450_consen 159 WRFDLDADGGELSNRRVFID--FPGGPGYPDGLAVD-SDGNLWVADWGGGRIVVFDPDG 214 (246)
T ss_dssp EEEEEETTTCCEEEEEEEEE---SSSSCEEEEEEEB-TTS-EEEEEETTTEEEEEETTS
T ss_pred EEEeccccccceeeeeeEEE--cCCCCcCCCcceEc-CCCCEEEEEcCCCEEEEECCCc
Confidence 8887431 11112333322 22211247777765 5889998 5789999999874
No 101
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=24.24 E-value=27 Score=33.03 Aligned_cols=12 Identities=25% Similarity=0.315 Sum_probs=9.6
Q ss_pred hhhhhhhhhhhccc
Q psy12290 19 LYGMNHSWATYGHW 32 (530)
Q Consensus 19 l~~va~~~HE~GH~ 32 (530)
++.|. +||+||+
T Consensus 8 ~i~i~--iHE~gH~ 19 (180)
T cd05709 8 LISVT--VHELGHA 19 (180)
T ss_pred HHHHH--HHHHHHH
Confidence 34566 9999999
No 102
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=23.87 E-value=27 Score=33.17 Aligned_cols=20 Identities=15% Similarity=0.069 Sum_probs=12.2
Q ss_pred hhhhhhhhhhccc--cccCCCCCC
Q psy12290 20 YGMNHSWATYGHW--ALSFKMTTI 41 (530)
Q Consensus 20 ~~va~~~HE~GH~--GL~Hs~~~~ 41 (530)
.++. +||+||+ +..+-....
T Consensus 8 i~i~--~HE~gH~~~a~~~G~~~~ 29 (192)
T PF02163_consen 8 ISIV--LHELGHALAARLYGDKVP 29 (192)
T ss_dssp HHHH--HHHHHHHHHHHTTT--B-
T ss_pred cccc--cccccccccccccccccc
Confidence 3455 9999999 655544443
No 103
>KOG3538|consensus
Probab=23.23 E-value=30 Score=41.19 Aligned_cols=33 Identities=15% Similarity=-0.052 Sum_probs=23.8
Q ss_pred hhhhhhhhhhhccc-cccCCCCCC------CeeeecccCC
Q psy12290 19 LYGMNHSWATYGHW-ALSFKMTTI------EAEFKWFDGG 51 (530)
Q Consensus 19 l~~va~~~HE~GH~-GL~Hs~~~~------~iM~p~y~~~ 51 (530)
|-+..|++||+||. |+.|-...+ .||.|-....
T Consensus 315 l~~aftiahE~GH~~gm~hd~~~~~C~~~~~im~~~~~~~ 354 (845)
T KOG3538|consen 315 LPSAFTIAHELGHNFGMEHDGRGNSCGPTFHVMSSDTCGG 354 (845)
T ss_pred CccceeeeeccccccCcccCCcCCCCCCcccccCcccccC
Confidence 33444569999999 999987665 4777766544
No 104
>PRK10911 oligopeptidase A; Provisional
Probab=22.51 E-value=29 Score=40.40 Aligned_cols=16 Identities=13% Similarity=0.196 Sum_probs=12.9
Q ss_pred hhhhhhhhhhhccc--cc
Q psy12290 19 LYGMNHSWATYGHW--AL 34 (530)
Q Consensus 19 l~~va~~~HE~GH~--GL 34 (530)
.-.|.|+.|||||+ +|
T Consensus 461 ~~~v~tlfHEfGHalH~~ 478 (680)
T PRK10911 461 HDEVITLFHEFGHGLHHM 478 (680)
T ss_pred HHHHHHHHHHHhHHHHHH
Confidence 35677899999998 55
No 105
>PF14564 Membrane_bind: Membrane binding; PDB: 1YHP_A 2B1O_A.
Probab=20.26 E-value=92 Score=28.08 Aligned_cols=33 Identities=21% Similarity=0.366 Sum_probs=24.8
Q ss_pred ccccEEEeccCC-eEEEeeCCeEEEEeCCCCccc
Q psy12290 333 RIDAAMVWGYNS-KTYLFSGTRYWKLDDETGRSE 365 (530)
Q Consensus 333 ~IDAAf~~~~ng-ktYFFkG~~YWryde~~~~vd 365 (530)
.|.+++.....+ ..|.+.|.-|++||..++.++
T Consensus 50 ~iv~~I~Vr~~~~g~~v~nGsvYFkY~~s~g~V~ 83 (110)
T PF14564_consen 50 EIVCQISVRDTDWGQYVANGSVYFKYNPSTGEVS 83 (110)
T ss_dssp EEEEEEEEEESST-SEEEEEEEEEEEETTTTEEE
T ss_pred cEEEEEEEEeccCCceEecceEEEEECCCCCeEE
Confidence 466666654444 799999999999999887765
Done!