Query psy14093
Match_columns 857
No_of_seqs 357 out of 2673
Neff 6.3
Searched_HMMs 46136
Date Fri Aug 16 19:21:39 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14093.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14093hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3658|consensus 100.0 2.7E-97 6E-102 831.3 33.7 526 1-634 129-697 (764)
2 KOG3607|consensus 100.0 1E-60 2.3E-65 563.8 27.4 375 1-545 118-511 (716)
3 KOG3607|consensus 100.0 3.2E-45 7E-50 432.5 12.6 225 589-852 268-526 (716)
4 KOG3658|consensus 100.0 8.1E-42 1.8E-46 384.7 13.7 243 555-827 274-573 (764)
5 cd04270 ZnMc_TACE_like Zinc-de 100.0 8.5E-41 1.8E-45 354.7 20.0 235 96-424 2-243 (244)
6 cd04273 ZnMc_ADAMTS_like Zinc- 100.0 2.2E-31 4.8E-36 276.3 12.9 191 95-415 1-196 (207)
7 cd04269 ZnMc_adamalysin_II_lik 100.0 9.4E-31 2E-35 268.6 15.5 187 95-420 1-194 (194)
8 PF01421 Reprolysin: Reprolysi 100.0 3.7E-31 7.9E-36 272.8 11.0 191 95-422 1-199 (199)
9 cd04272 ZnMc_salivary_gland_MP 100.0 1.9E-28 4.1E-33 256.7 15.5 189 96-415 2-209 (220)
10 cd04271 ZnMc_ADAM_fungal Zinc- 100.0 1.5E-28 3.2E-33 258.4 14.2 200 96-415 2-215 (228)
11 cd04267 ZnMc_ADAM_like Zinc-de 100.0 8.6E-28 1.9E-32 246.2 15.2 185 95-415 1-189 (192)
12 KOG3538|consensus 99.9 2.1E-25 4.5E-30 274.0 22.0 343 2-539 107-467 (845)
13 PF13688 Reprolysin_5: Metallo 99.9 7.9E-25 1.7E-29 224.8 9.1 183 94-399 2-196 (196)
14 smart00050 DISIN Homologues of 99.8 4.8E-20 1E-24 159.5 6.0 75 432-519 1-75 (75)
15 smart00050 DISIN Homologues of 99.8 2.2E-19 4.7E-24 155.4 4.2 71 717-812 1-75 (75)
16 PF00200 Disintegrin: Disinteg 99.8 2.4E-19 5.2E-24 156.4 4.3 76 432-520 1-76 (76)
17 PF00200 Disintegrin: Disinteg 99.8 3.5E-19 7.6E-24 155.3 3.3 72 717-813 1-76 (76)
18 PF13574 Reprolysin_2: Metallo 99.7 2.9E-19 6.3E-24 180.7 0.3 103 258-415 64-169 (173)
19 PF13583 Reprolysin_4: Metallo 99.7 6.7E-16 1.4E-20 160.5 15.0 42 94-135 2-43 (206)
20 cd04270 ZnMc_TACE_like Zinc-de 99.7 2E-16 4.4E-21 168.5 8.8 145 558-709 52-243 (244)
21 cd00203 ZnMc Zinc-dependent me 99.6 5.6E-15 1.2E-19 147.5 13.1 106 260-415 51-165 (167)
22 cd04273 ZnMc_ADAMTS_like Zinc- 99.3 2E-12 4.4E-17 134.5 6.2 94 604-704 94-207 (207)
23 PF01421 Reprolysin: Reprolysi 99.3 7E-13 1.5E-17 137.0 2.0 105 588-706 75-198 (199)
24 cd04269 ZnMc_adamalysin_II_lik 99.3 3.1E-12 6.7E-17 131.6 6.1 92 604-705 87-194 (194)
25 cd04271 ZnMc_ADAM_fungal Zinc- 99.2 1.2E-11 2.6E-16 130.7 5.3 105 591-704 82-228 (228)
26 PF13582 Reprolysin_3: Metallo 99.1 3.3E-11 7.1E-16 114.8 3.9 61 264-365 64-124 (124)
27 cd04272 ZnMc_salivary_gland_MP 99.1 9.1E-11 2E-15 123.3 5.2 95 603-704 94-220 (220)
28 cd04267 ZnMc_ADAM_like Zinc-de 98.8 2.5E-09 5.4E-14 109.8 5.1 85 602-695 87-191 (192)
29 PF13688 Reprolysin_5: Metallo 98.3 1.3E-07 2.9E-12 97.3 0.7 103 558-672 45-161 (196)
30 KOG3475|consensus 98.2 5.6E-07 1.2E-11 78.1 2.0 75 779-853 1-75 (92)
31 PF08516 ADAM_CR: ADAM cystein 98.2 3.6E-07 7.7E-12 86.9 0.2 39 815-853 1-44 (117)
32 PTZ00073 60S ribosomal protein 98.2 1.5E-06 3.2E-11 77.0 3.4 75 779-853 1-75 (91)
33 PF13574 Reprolysin_2: Metallo 98.1 9.4E-07 2E-11 89.7 2.1 87 601-696 64-172 (173)
34 cd00203 ZnMc Zinc-dependent me 97.6 4.9E-05 1.1E-09 75.8 4.4 58 609-672 54-115 (167)
35 PRK13267 archaemetzincin-like 97.5 7.7E-05 1.7E-09 76.2 3.3 100 240-386 47-152 (179)
36 KOG3538|consensus 97.3 0.00037 7.9E-09 87.2 7.4 112 604-722 270-406 (845)
37 PF13582 Reprolysin_3: Metallo 96.8 0.00042 9E-09 65.9 1.2 58 606-670 63-124 (124)
38 smart00608 ACR ADAM Cysteine-R 96.8 0.0005 1.1E-08 67.4 1.7 30 823-852 14-43 (137)
39 PF13583 Reprolysin_4: Metallo 96.8 0.0011 2.3E-08 69.5 4.2 86 606-699 93-202 (206)
40 PRK04179 rpl37e 50S ribosomal 96.1 0.004 8.6E-08 51.9 2.3 54 779-832 1-56 (62)
41 cd04279 ZnMc_MMP_like_1 Zinc-d 95.7 0.009 1.9E-07 59.4 3.8 42 348-400 104-145 (156)
42 PF00413 Peptidase_M10: Matrix 95.6 0.0028 6E-08 62.4 -0.4 31 347-388 104-134 (154)
43 smart00608 ACR ADAM Cysteine-R 95.5 0.0094 2E-07 58.5 2.7 23 522-544 1-27 (137)
44 cd04278 ZnMc_MMP Zinc-dependen 94.6 0.016 3.5E-07 57.7 1.8 40 348-400 107-146 (157)
45 TIGR02232 myxo_disulf_rpt Myxo 94.6 0.034 7.5E-07 42.4 2.9 34 709-745 2-35 (38)
46 smart00235 ZnMc Zinc-dependent 94.2 0.021 4.6E-07 55.5 1.6 18 350-367 88-105 (140)
47 cd04277 ZnMc_serralysin_like Z 94.1 0.043 9.3E-07 56.2 3.5 40 348-387 113-157 (186)
48 cd04268 ZnMc_MMP_like Zinc-dep 94.1 0.021 4.5E-07 56.9 1.2 40 348-387 94-138 (165)
49 COG1913 Predicted Zn-dependent 93.9 0.095 2.1E-06 52.9 5.4 27 349-386 125-151 (181)
50 cd04275 ZnMc_pappalysin_like Z 93.9 0.2 4.3E-06 53.3 8.2 21 348-368 137-157 (225)
51 COG2126 RPL37A Ribosomal prote 93.6 0.049 1.1E-06 44.9 2.2 53 779-831 1-54 (61)
52 PF12388 Peptidase_M57: Dual-a 93.4 0.049 1.1E-06 56.9 2.6 28 347-374 132-161 (211)
53 PF05548 Peptidase_M11: Gameto 92.9 0.09 1.9E-06 58.6 3.8 19 349-367 151-169 (314)
54 PF05572 Peptidase_M43: Pregna 92.6 0.049 1.1E-06 54.5 1.1 24 348-372 69-92 (154)
55 cd04276 ZnMc_MMP_like_2 Zinc-d 92.2 0.093 2E-06 54.6 2.6 26 348-373 116-141 (197)
56 TIGR02232 myxo_disulf_rpt Myxo 92.1 0.16 3.5E-06 38.8 3.0 37 424-473 2-38 (38)
57 PRK13267 archaemetzincin-like 91.8 0.093 2E-06 53.8 2.1 85 583-679 47-150 (179)
58 PF02031 Peptidase_M7: Strepto 91.1 0.076 1.7E-06 50.9 0.5 23 349-372 78-100 (132)
59 PF01907 Ribosomal_L37e: Ribos 90.1 0.12 2.5E-06 42.5 0.7 52 780-831 1-53 (55)
60 PF10462 Peptidase_M66: Peptid 88.4 0.22 4.9E-06 55.0 1.6 19 348-366 193-211 (305)
61 cd04327 ZnMc_MMP_like_3 Zinc-d 87.8 0.26 5.7E-06 51.2 1.6 22 349-370 93-114 (198)
62 KOG1565|consensus 86.5 0.4 8.7E-06 56.3 2.3 30 348-388 211-240 (469)
63 PF08516 ADAM_CR: ADAM cystein 86.1 0.34 7.4E-06 46.3 1.2 31 522-552 1-42 (117)
64 cd04283 ZnMc_hatching_enzyme Z 85.6 0.35 7.6E-06 49.8 1.1 18 350-367 79-96 (182)
65 PF07998 Peptidase_M54: Peptid 85.0 0.42 9.2E-06 49.6 1.4 19 348-366 145-163 (194)
66 cd04281 ZnMc_BMP1_TLD Zinc-dep 84.6 0.4 8.7E-06 50.1 1.0 18 350-367 89-106 (200)
67 cd04280 ZnMc_astacin_like Zinc 84.2 0.41 9E-06 49.0 0.9 18 350-367 76-93 (180)
68 PF01400 Astacin: Astacin (Pep 82.7 0.45 9.7E-06 49.3 0.4 20 349-368 80-99 (191)
69 COG4783 Putative Zn-dependent 81.4 0.55 1.2E-05 54.3 0.5 18 346-363 128-145 (484)
70 TIGR03296 M6dom_TIGR03296 M6 f 80.7 0.53 1.2E-05 51.8 0.1 37 349-386 166-203 (286)
71 cd04282 ZnMc_meprin Zinc-depen 80.5 0.73 1.6E-05 49.2 1.0 18 350-367 122-139 (230)
72 PF13948 DUF4215: Domain of un 75.4 1.3 2.8E-05 35.4 0.9 21 708-728 18-38 (47)
73 KOG3714|consensus 72.6 1.3 2.8E-05 51.3 0.4 18 350-367 161-178 (411)
74 PF12044 Metallopep: Putative 70.5 2.7 5.8E-05 48.7 2.3 17 351-367 319-335 (423)
75 COG3824 Predicted Zn-dependent 70.0 1.7 3.8E-05 41.3 0.5 20 347-366 108-127 (136)
76 PF06114 DUF955: Domain of unk 68.2 1.9 4E-05 39.7 0.3 14 347-360 41-54 (122)
77 PF12725 DUF3810: Protein of u 66.2 3 6.4E-05 46.8 1.5 26 342-367 190-215 (318)
78 PF13948 DUF4215: Domain of un 65.4 3 6.6E-05 33.3 1.0 20 423-444 18-37 (47)
79 COG2856 Predicted Zn peptidase 60.4 3.5 7.7E-05 43.5 0.7 13 348-360 72-84 (213)
80 PF09471 Peptidase_M64: IgA Pe 58.5 4.9 0.00011 43.9 1.4 20 347-366 215-234 (264)
81 COG3590 PepO Predicted metallo 57.8 4.5 9.8E-05 47.9 1.0 27 334-367 480-509 (654)
82 KOG4525|consensus 56.3 7.3 0.00016 44.6 2.3 17 351-367 304-320 (614)
83 PF00413 Peptidase_M10: Matrix 55.7 3.9 8.5E-05 40.0 0.1 29 649-681 104-132 (154)
84 PF12115 Salp15: Salivary prot 55.3 7.5 0.00016 37.2 1.9 21 520-540 93-114 (118)
85 cd04268 ZnMc_MMP_like Zinc-dep 51.8 7.3 0.00016 38.6 1.3 19 650-671 94-112 (165)
86 PF01435 Peptidase_M48: Peptid 50.3 7.2 0.00015 40.5 1.0 18 346-363 87-104 (226)
87 PRK01345 heat shock protein Ht 50.2 6.2 0.00013 44.2 0.5 15 346-360 122-136 (317)
88 PF05547 Peptidase_M6: Immune 50.0 4.4 9.6E-05 49.4 -0.7 17 349-365 222-238 (645)
89 PF06262 DUF1025: Possibl zinc 49.1 8.8 0.00019 35.6 1.3 20 346-365 71-90 (97)
90 COG0501 HtpX Zn-dependent prot 48.9 6.7 0.00015 42.8 0.6 15 346-360 155-169 (302)
91 TIGR02289 M3_not_pepF oligoend 47.3 14 0.00029 44.6 2.8 14 347-360 336-349 (549)
92 TIGR00181 pepF oligoendopeptid 45.0 16 0.00034 44.4 2.9 14 347-360 377-390 (591)
93 PRK03001 M48 family peptidase; 43.7 10 0.00022 41.7 1.0 15 346-360 122-136 (283)
94 PRK02870 heat shock protein Ht 43.3 9.9 0.00022 43.0 0.8 18 346-365 171-188 (336)
95 PRK03982 heat shock protein Ht 43.2 11 0.00023 41.7 1.0 15 346-360 123-137 (288)
96 PRK05457 heat shock protein Ht 42.9 10 0.00022 41.9 0.8 15 346-360 132-146 (284)
97 PRK01265 heat shock protein Ht 42.0 9.8 0.00021 42.8 0.5 14 346-359 138-151 (324)
98 PRK03072 heat shock protein Ht 41.5 12 0.00025 41.4 1.0 16 346-361 125-140 (288)
99 cd04279 ZnMc_MMP_like_1 Zinc-d 39.9 10 0.00022 37.6 0.2 29 649-680 103-131 (156)
100 COG1913 Predicted Zn-dependent 39.3 11 0.00024 38.4 0.4 23 653-679 127-149 (181)
101 PF01863 DUF45: Protein of unk 39.2 17 0.00037 37.5 1.7 24 344-367 160-183 (205)
102 PRK04897 heat shock protein Ht 39.0 12 0.00027 41.4 0.7 15 346-360 135-149 (298)
103 PF11350 DUF3152: Protein of u 38.2 16 0.00035 38.2 1.3 20 347-366 138-157 (203)
104 cd04278 ZnMc_MMP Zinc-dependen 38.1 13 0.00029 36.8 0.7 30 649-682 106-135 (157)
105 PF05572 Peptidase_M43: Pregna 38.1 10 0.00022 38.1 -0.2 18 650-670 69-86 (154)
106 cd04275 ZnMc_pappalysin_like Z 37.7 23 0.0005 37.8 2.5 45 623-670 98-154 (225)
107 PF01431 Peptidase_M13: Peptid 37.4 13 0.00029 38.4 0.6 44 347-421 35-83 (206)
108 PF04298 Zn_peptidase_2: Putat 36.8 20 0.00044 38.1 1.8 14 347-360 88-101 (222)
109 TIGR02290 M3_fam_3 oligoendope 36.7 25 0.00055 42.7 2.9 14 347-360 374-387 (587)
110 PRK02391 heat shock protein Ht 34.9 17 0.00037 40.3 1.0 14 346-359 131-144 (296)
111 PTZ00337 surface protease GP63 32.7 44 0.00095 40.5 3.9 19 347-365 228-246 (567)
112 PF04228 Zn_peptidase: Putativ 31.2 21 0.00046 39.5 1.0 13 347-359 169-181 (292)
113 COG1164 Oligoendopeptidase F [ 30.2 19 0.00042 43.8 0.5 16 347-362 379-394 (598)
114 PF12315 DUF3633: Protein of u 29.9 23 0.0005 37.2 0.9 23 343-365 88-110 (212)
115 KOG2719|consensus 29.4 30 0.00065 40.2 1.8 14 346-359 278-291 (428)
116 PF02031 Peptidase_M7: Strepto 29.3 15 0.00033 35.7 -0.5 24 653-679 80-104 (132)
117 cd06456 M3A_DCP_Oligopeptidase 29.0 84 0.0018 36.7 5.4 14 347-360 207-220 (422)
118 PF13485 Peptidase_MA_2: Pepti 28.0 25 0.00055 32.4 0.7 16 348-363 25-40 (128)
119 PF07998 Peptidase_M54: Peptid 27.6 18 0.0004 37.7 -0.3 89 581-679 60-169 (194)
120 smart00235 ZnMc Zinc-dependent 27.4 16 0.00035 35.3 -0.7 16 652-670 88-103 (140)
121 COG2738 Predicted Zn-dependent 27.0 30 0.00065 36.0 1.1 26 337-365 84-109 (226)
122 cd06459 M3B_Oligoendopeptidase 27.0 26 0.00055 40.3 0.7 14 347-360 221-234 (427)
123 KOG3624|consensus 26.3 35 0.00075 42.3 1.7 14 347-360 517-530 (687)
124 cd06455 M3A_TOP Peptidase M3 T 26.2 27 0.00058 41.3 0.7 14 347-360 262-275 (472)
125 PF01447 Peptidase_M4: Thermol 26.0 35 0.00075 34.2 1.3 11 349-359 136-146 (150)
126 COG4784 Putative Zn-dependent 25.7 26 0.00057 39.3 0.4 19 346-365 122-140 (479)
127 PF01432 Peptidase_M3: Peptida 25.6 27 0.00059 40.9 0.6 14 347-360 241-254 (458)
128 KOG1047|consensus 25.5 31 0.00067 41.1 1.0 14 347-360 287-300 (613)
129 PF12388 Peptidase_M57: Dual-a 24.3 22 0.00048 37.5 -0.4 18 650-670 133-150 (211)
130 KOG2090|consensus 23.8 94 0.002 37.6 4.5 14 347-360 478-491 (704)
131 PF01457 Peptidase_M8: Leishma 23.7 40 0.00086 40.5 1.5 18 347-364 209-226 (521)
132 PF13699 DUF4157: Domain of un 23.2 54 0.0012 29.2 1.9 13 348-360 61-73 (79)
133 cd04277 ZnMc_serralysin_like Z 23.2 23 0.00049 36.2 -0.6 17 652-671 115-131 (186)
134 PF14891 Peptidase_M91: Effect 23.0 30 0.00065 35.2 0.2 21 347-367 102-122 (174)
135 PF08325 WLM: WLM domain; Int 22.6 42 0.00091 34.8 1.2 22 346-367 80-101 (186)
136 PF11403 Yeast_MT: Yeast metal 21.4 1.3E+02 0.0028 22.5 3.2 37 490-528 4-40 (40)
137 PF05548 Peptidase_M11: Gameto 20.8 40 0.00087 37.8 0.7 17 652-671 152-168 (314)
138 COG5549 Predicted Zn-dependent 20.1 42 0.00092 35.3 0.6 19 349-367 188-207 (236)
No 1
>KOG3658|consensus
Probab=100.00 E-value=2.7e-97 Score=831.28 Aligned_cols=526 Identities=39% Similarity=0.636 Sum_probs=421.9
Q ss_pred CEEEEEcCCeEEEEEeCCCCCCCCCC-ccEEEEEecccccccccC-CCCCCCCCCCCCCcCCCCccccCcCCCCCCCccc
Q psy14093 1 MTASIHTPEETYHIEPSWRHLPHLGN-QSMVAYRASDVKLSWDHA-HDKPASPCGYVKEEFNTTDFDLDMEEDDPDMPHT 78 (857)
Q Consensus 1 l~G~I~~~~~~Y~IEPl~~~~~~~~~-~~HivYr~sd~~~~~~~~-~~~~~~~Cg~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (857)
+.|.|++++++|+|||+|+|+++... +++|||+.+|++...... ....+..||+.+..........+....++.....
T Consensus 129 ~~g~I~t~~~~y~IEPa~~y~~d~~~p~~~viYke~di~~~~~~~~~~~~p~~~g~~k~~~~el~~k~~~~~~p~n~~~~ 208 (764)
T KOG3658|consen 129 FEGSIQTEGDTYYIEPAWRYFEDDKLPFHSVIYKESDIKNVHRLVATAGSPKVCGYSKRRRRELLKKGEVDRIPPNELTA 208 (764)
T ss_pred EEEEEEecCceEEeccccccccCCCCccceEEEehhhhcchhhcccccCCCcccchhhhhhhhcccccccccCChhhhhh
Confidence 57999999999999999999988765 999999999998542211 1123677887663222211111111111111122
Q ss_pred --ccccccCCCCCCCCCcceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEE
Q psy14093 79 --RSKRQVDPAYEYKPTKTRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIK 156 (857)
Q Consensus 79 --r~kR~~~~~~~~~~~~~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~ 156 (857)
|.||...... .+.+++|.|+||||++||++||..+.+++++||+.+|.+||.||++++|+++ .|+.+|+|+|+
T Consensus 209 ~~r~~R~~d~~~--~~~kntC~LylqADh~fy~~mg~~t~e~~~~~l~~hI~rVn~IY~~T~f~~~---~g~kni~F~Ik 283 (764)
T KOG3658|consen 209 NGRSKRRKDLLS--DPMKNTCSLYLQADHRFYRYMGRGTRETTTNYLINHIDRVNEIYRNTNFDDR---GGFKNIGFVIK 283 (764)
T ss_pred hhhhhhhccccc--ccccceeeEEEEecHHHHHHHcCCchHHHHHHHHHHHHHHHHHhcCCccccC---CCccceEEEEE
Confidence 5566554321 3679999999999999999999888999999999999999999999999754 45678888888
Q ss_pred EEEEeccchhhccccccccccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhcccc
Q psy14093 157 KIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGE 236 (857)
Q Consensus 157 ~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~ 236 (857)
+|.|+++++..+.++.||||+.+
T Consensus 284 rIkI~~~~~~~~~~~~~~N~f~~--------------------------------------------------------- 306 (764)
T KOG3658|consen 284 RIKILDSPACVRPGEDHYNMFCE--------------------------------------------------------- 306 (764)
T ss_pred EEEecCchhhcccCcCccccchh---------------------------------------------------------
Confidence 88888777777777777777431
Q ss_pred ccccccccccchhhhccccccccCCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCc
Q psy14093 237 AHYNMIRDKWDVRNLLEVFSREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSS 316 (857)
Q Consensus 237 ~~~~~~f~~w~~~~ll~~~s~~~~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~ 316 (857)
......++++++|. .+||+||||||||++||++|||||||||+|+.+++||||++.+..
T Consensus 307 -------e~~~v~~fl~l~s~-~~~sdfCLAylfT~rdFe~GtLGLAwVgsp~~~saGGIC~k~~~y------------- 365 (764)
T KOG3658|consen 307 -------EVKDVLEFLSLNSE-EKHSDFCLAYLFTYRDFEGGTLGLAWVGSPRSNSAGGICEKAYTY------------- 365 (764)
T ss_pred -------hHHHHHHHhhhhhh-ccccchhheeeeeecccccceeeeEEeccCccCCCCcccccceec-------------
Confidence 11223345555554 478999999999999999999999999999999999999986532
Q ss_pred cccccccceeecccCCccccccCCccccccchhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCCCC
Q psy14093 317 SQYFKNGYTLYLNSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVNNK 396 (857)
Q Consensus 317 ~~~~~~~~~~~~n~Gl~~~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n~~ 396 (857)
.+|..+|+|+||+|++| ||++||++++.+|+|||||||||++||++. +|.|+.+++|+|||++++++|+++||.
T Consensus 366 ----~~G~~~sLNtGi~T~~N-Yg~~Vp~kvs~lt~AHEiGHNfGSpHDp~~-ECsPg~~~~GnyiM~a~AtSGd~~NN~ 439 (764)
T KOG3658|consen 366 ----NEGKKRSLNTGISTSVN-YGKRVPTKVSDLTLAHEIGHNFGSPHDPDI-ECSPGESQGGNYIMYAYATSGDKPNNK 439 (764)
T ss_pred ----cCcceEEeecceeeeee-cCCccCcchhheeehhhhccccCCCCCCCC-ccCCCCCCCCcEEEEEecccCcCccCc
Confidence 23667799999999999 999999999999999999999999999975 999999999999999999999999999
Q ss_pred CCCCCCCCCCcccHHHHhc-------cccccCCCcccCccccCCCCCCccCCCcccccccCCCCCCCCCCCCC-----CC
Q psy14093 397 TFSSHVDGTNFCLAHLFTH-------QSFWSRGSSILGLAYIASPRPYSIGGGILGLAYVGSPRRNSKCCYHS-----IL 464 (857)
Q Consensus 397 ~FS~~~~~~~~CS~~~i~~-------~Cl~~~~~~~CGN~ive~g~~EeCDcG~~~~~~~~~~~~~d~CC~~~-----~~ 464 (857)
+||+ ||+++|++ .||.++..++|||+|||+| ||||||... .|.|.||++. ..
T Consensus 440 kFSp-------CS~ksI~~VL~~k~~~CFqE~~~sfCGN~iVE~G--EECDcG~~~-------~~~D~CC~p~~~~~~k~ 503 (764)
T KOG3658|consen 440 KFSP-------CSLKSISKVLESKKRNCFQERESSFCGNGIVEPG--EECDCGFST-------DCKDSCCDPQPNLDEKP 503 (764)
T ss_pred ccCc-------chHHHHHHHHHHhhhhhccCccccccCCccccCC--cccCccccc-------ccCCcccCCCCCCCCCC
Confidence 9999 99999998 5999999999999999999 999999986 6999999931 25
Q ss_pred CcccCCCccCCCCCCccc-CCcccCCCceeecCCCCCCCCCcccCCCCCCCCCCCCCCCCCcCCCCceecCCccchHhhh
Q psy14093 465 SELFPVLLYSDKNSPCCQ-NCKFMAVGMKCRDAQYATCEQESRCTGSSSVCPPSAPMSDNTGCLERGKCRGGKCIPFCET 543 (857)
Q Consensus 465 C~l~~ga~Cs~~~g~CC~-~C~~~~~g~~Cr~~~~~~C~~~~~C~G~s~~Cp~~~~~~dgt~C~~~~~C~~G~C~~~C~~ 543 (857)
|+|++||||||+|||||. +|+|+++++.||.+++.+|..+++|+|.+++||++.|++|||+|.+.+.|++|+|.++|+.
T Consensus 504 C~lk~gaqCSpsqgpCC~~~Cqf~~~~~kc~~~d~~~C~~~s~CnG~~aeCP~s~~~~d~t~C~~~~~C~~G~C~gs~c~ 583 (764)
T KOG3658|consen 504 CTLKPGAQCSPSQGPCCTPNCQFHTSGEKCREADEATCKGESTCNGFSAECPPSPPKPDGTVCNETGVCINGKCIGSCCL 583 (764)
T ss_pred ceeCCCCccCCCCCCccCCcceeecccCeeeccccccCcCcccccCCccCCcCCCCCCCCCcccccceEeCCcCccHHHH
Confidence 999999999999999995 9999999999998887899999999999999999999999999999999999999986665
Q ss_pred -cccccccccccC-------------CCCcceee-e-------eEEEcCCCCccC--CCccccCcccccccccccccccc
Q psy14093 544 -QNQQSCMCDVNG-------------YRGMGFVI-K-------KIVVHSDATRVR--QGEAHYNMIRDKWDVRNLLETFS 599 (857)
Q Consensus 544 -~~~~~C~c~~~~-------------~~g~C~pi-~-------~~~il~~GT~C~--~G~c~~n~c~r~W~~~~lL~~~~ 599 (857)
+|+++|+|+... ..++|+|+ + ..++|++|+||+ .|+|+.+..++.-+....|.|+.
T Consensus 584 ~~glesC~c~~~~~~~sC~lcCr~~~~~stC~~~v~~~~~~~~~~~~L~pGsPC~~~~GyCdvf~KCr~vd~dgpl~rl~ 663 (764)
T KOG3658|consen 584 MQGLESCFCTETDLDKSCKLCCRKPDDNSTCVPTVEANRKNFDVGIYLRPGSPCNNYKGYCDVFGKCRKVDADGPLARLK 663 (764)
T ss_pred hhCcceeeeccCCchhhHHHHhcCCCCCCCcccccCccccccccceecCCCCcCcCCcceeccccceeeecCCchHHHHH
Confidence 899999988633 56888884 2 245899999999 89999988888777776666654
Q ss_pred cCCC-CCCc-ceEEEEEeeeecccCCCcceeeeecCC
Q psy14093 600 SHVD-GTNF-CLAHLFTHQSFWSRGSSILGLAYIASP 634 (857)
Q Consensus 600 ~~~~-hD~~-c~AhLft~~df~~~~~~tvGlA~vgs~ 634 (857)
..+. -..+ -+|..++. .|| ....+|+|++-.|
T Consensus 664 ~li~~~~~~~~~~~w~~~-~w~--~v~i~gi~~i~~m 697 (764)
T KOG3658|consen 664 DLIFQLETIETFAEWIVL-NWL--AVNIVGIVLIVLM 697 (764)
T ss_pred HHHhhhhhhhhhHHHHHh-hhH--HhHhHHHHHHHHH
Confidence 2110 0100 01222222 344 3567777776555
No 2
>KOG3607|consensus
Probab=100.00 E-value=1e-60 Score=563.80 Aligned_cols=375 Identities=26% Similarity=0.453 Sum_probs=299.5
Q ss_pred CEEEEEcCCeEEEEEeCCCCCCCCCCccEEEEEecccccccccCCCCCCCCCCCCCCcCCCCccccCcCCCCCC-Ccccc
Q psy14093 1 MTASIHTPEETYHIEPSWRHLPHLGNQSMVAYRASDVKLSWDHAHDKPASPCGYVKEEFNTTDFDLDMEEDDPD-MPHTR 79 (857)
Q Consensus 1 l~G~I~~~~~~Y~IEPl~~~~~~~~~~~HivYr~sd~~~~~~~~~~~~~~~Cg~~~~~~~~~~~~~~~~~~~~~-~~~~r 79 (857)
|+|+|...+..|.|||+ ..+..|.|+||+....... .....||..+..... ... ....+. .+..+
T Consensus 118 l~g~~~~~~~~Y~Iepl----~~s~~f~h~iy~~~~~~~~------~~~~~~g~~~~~~~~--~~~--~~~~~~~~~~~~ 183 (716)
T KOG3607|consen 118 LRGLFLFENISYSIEPL----EGSDSFEHLIYKLLSLETD------NSALVCGHQEAHIIT--KQM--ELPLRSKRQERR 183 (716)
T ss_pred cceeEEECceeEEEEEC----CCCCCcceeeecccccccc------ccccccccchhhccc--ccc--cccCcccccccc
Confidence 68999999999999999 4567899999999876632 335678877543332 000 000111 11122
Q ss_pred cccccCCCCCCCCCcceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEE
Q psy14093 80 SKRQVDPAYEYKPTKTRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIV 159 (857)
Q Consensus 80 ~kR~~~~~~~~~~~~~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~ 159 (857)
.+|. ++...+|+||+||+|+.+|++++. ++.++.++++.++|+||.||+.. +++|.++
T Consensus 184 ~~~~------~~~~~kyvEl~vV~D~~~~~~~~~-~~~~v~~~~~~vvn~~d~~y~~l--------------ni~i~lv- 241 (716)
T KOG3607|consen 184 EKRT------VIEHTKYVELYVVVDNDLYKKYGS-NLSKVRSFAKEVVNLVDSIYKQL--------------NIRIVLV- 241 (716)
T ss_pred cccc------cccccceEEEEEEEehHHHHHccc-cHHHHHHHHHHhhhHHhhhHhhc--------------cceEEEE-
Confidence 2332 246899999999999999999985 78899999999999999999962 3444443
Q ss_pred EeccchhhccccccccccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccc
Q psy14093 160 VHSDATRVRQGEAHYNMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHY 239 (857)
Q Consensus 160 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~ 239 (857)
++|+|+++| +|.+..++..+|.
T Consensus 242 ----------------------------------------~lE~Wt~~d-------------ki~~~~~~~~tL~----- 263 (716)
T KOG3607|consen 242 ----------------------------------------GLEIWTDGN-------------KIDVSEDLRETLH----- 263 (716)
T ss_pred ----------------------------------------EEEecCCCC-------------eecccccHHHHHH-----
Confidence 235555555 6777778888887
Q ss_pred cccccccchhhhccccccccCCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCcccc
Q psy14093 240 NMIRDKWDVRNLLEVFSREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQY 319 (857)
Q Consensus 240 ~~~f~~w~~~~ll~~~s~~~~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~ 319 (857)
.|..|+...+.. ...|| .|||||+..|.+.++|+||+| |||++.++ +++. .
T Consensus 264 --~F~~wr~~~l~~----r~~hD---~a~L~~~~~~~~~~~G~a~~~--------~mCs~~~s--~gv~---------~- 314 (716)
T KOG3607|consen 264 --NFLKWRKSYLTT----RLPHD---AAHLLSGILFYGKYVGLAYFG--------GMCSPGHS--GGVN---------K- 314 (716)
T ss_pred --HHHHHHHhhccc----cCCCC---ceEEEEeeeccCceeceeecc--------cccCcccc--ccee---------e-
Confidence 699999976664 35789 999999999999999999999 99999763 3332 0
Q ss_pred ccccceeecccCCccccccCCccccccchhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCCCCCCC
Q psy14093 320 FKNGYTLYLNSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVNNKTFS 399 (857)
Q Consensus 320 ~~~~~~~~~n~Gl~~~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS 399 (857)
+.... +...|.|+||||||||||.||...+.|.+ .+.||| +..+. .+.+.+||
T Consensus 315 -------------------~~~~~-~~~~a~v~AhelgH~lGm~hD~~~c~C~~----~~~CiM-~~~~~--~~~~~~FS 367 (716)
T KOG3607|consen 315 -------------------FHSDI-LLAFAVVLAHELGHNLGMIHDESSCSCPS----SGLCIM-PEETG--FYIPKKFS 367 (716)
T ss_pred -------------------cCccc-chhHHHHHHHHHHhhcCcccccccccCCC----CCcccc-ccccC--cCcccccc
Confidence 11111 23579999999999999999997777763 367999 55442 36789999
Q ss_pred CCCCCCCcccHHHHhc-------cccccCC-------CcccCccccCCCCCCccCCCcccccccCCCCCCCCCCCCCCCC
Q psy14093 400 SHVDGTNFCLAHLFTH-------QSFWSRG-------SSILGLAYIASPRPYSIGGGILGLAYVGSPRRNSKCCYHSILS 465 (857)
Q Consensus 400 ~~~~~~~~CS~~~i~~-------~Cl~~~~-------~~~CGN~ive~g~~EeCDcG~~~~~~~~~~~~~d~CC~~~~~C 465 (857)
+ ||++++.. .||.+.| .++||||+||+| ||||||+.+ .|.+.||++. ||
T Consensus 368 ~-------CS~~~~~~~~~~~~~~cl~n~p~p~~~~~~~~CGNg~vE~g--EECDCG~~~-------eC~~~cC~~~-tC 430 (716)
T KOG3607|consen 368 N-------CSYQDFESFLLKGGGSCLLNLPSPEKIFSGPVCGNGVVEEG--EECDCGTPE-------ECSNLCCCAS-TC 430 (716)
T ss_pred c-------chHHHHHHHHhcCCcceEecCCCcccccCCCccCCcccccC--CcCCCCChH-------HcCccccccc-cc
Confidence 9 99999998 4988744 589999999999 999999998 4777799988 99
Q ss_pred cccCCCccCCCCCCcccCCcccCCCceeecCCCCCCCCCcccCCCCCCCCCCCCCCCCCcCCC-CceecCCccch---Hh
Q psy14093 466 ELFPVLLYSDKNSPCCQNCKFMAVGMKCRDAQYATCEQESRCTGSSSVCPPSAPMSDNTGCLE-RGKCRGGKCIP---FC 541 (857)
Q Consensus 466 ~l~~ga~Cs~~~g~CC~~C~~~~~g~~Cr~~~~~~C~~~~~C~G~s~~Cp~~~~~~dgt~C~~-~~~C~~G~C~~---~C 541 (857)
+|+++|+|+. |+||.+|+|+|+|++||++. ++||++|||+|+|+.||+|.+++||++|.. .++||+|.|.+ +|
T Consensus 431 kL~pga~Ca~--G~CC~~C~~~p~G~~CR~~~-~eCDlpE~C~G~S~~Cp~d~~~~dG~~C~~~~g~Cy~G~C~t~~~QC 507 (716)
T KOG3607|consen 431 KLKPGAQCAL--GLCCKDCKFLPAGTVCRQAK-NECDLPEYCNGTSSQCPPDLYVQDGIPCQGGQGYCYDGRCHTRDRQC 507 (716)
T ss_pred cccCCCCCCC--CCchhccccccCcceecccC-CccccccccCCCCCCCCCCceeECCEecCCCcceecCCCCCCHHHHH
Confidence 9999999995 99999999999999999987 899999999999999999999999999998 68999999996 89
Q ss_pred hhcc
Q psy14093 542 ETQN 545 (857)
Q Consensus 542 ~~~~ 545 (857)
+++.
T Consensus 508 ~~i~ 511 (716)
T KOG3607|consen 508 QKIW 511 (716)
T ss_pred HHHh
Confidence 9963
No 3
>KOG3607|consensus
Probab=100.00 E-value=3.2e-45 Score=432.53 Aligned_cols=225 Identities=27% Similarity=0.469 Sum_probs=196.8
Q ss_pred ccccccccccccCCCCCCcceEEEEEeeeecccCCCcceeeeecCC-CCCcccceec---cccccccchhhheeeeeeec
Q psy14093 589 WDVRNLLETFSSHVDGTNFCLAHLFTHQSFWSRGSSILGLAYIASP-RPYSIGGSIH---ENLKYFDTLHSSHISHTIVK 664 (857)
Q Consensus 589 W~~~~lL~~~~~~~~hD~~c~AhLft~~df~~~~~~tvGlA~vgs~-~~~~s~Gv~~---~~~~~~~~~~aheigh~l~~ 664 (857)
|+...+..+++ || .||||++..| .+.++|+||+|+| ++.+++||.. ++...++.++||||||+|
T Consensus 268 wr~~~l~~r~~----hD---~a~L~~~~~~---~~~~~G~a~~~~mCs~~~s~gv~~~~~~~~~~~a~v~AhelgH~l-- 335 (716)
T KOG3607|consen 268 WRKSYLTTRLP----HD---AAHLLSGILF---YGKYVGLAYFGGMCSPGHSGGVNKFHSDILLAFAVVLAHELGHNL-- 335 (716)
T ss_pred HHHhhccccCC----CC---ceEEEEeeec---cCceeceeecccccCcccccceeecCcccchhHHHHHHHHHHhhc--
Confidence 54444444444 89 9999999888 4899999999999 7778999874 356788899999999999
Q ss_pred ccccccCCC------CCCcccC--------CCCCCCCHHHHHHHhhcCCCCcCCCC-------CccccCCcccccccccc
Q psy14093 665 RGVQESNHP------FNKIKEV--------TFKTLGNLRSIRKVLEAKSGKCFSEP-------EESFCGNLRVEGDEECD 723 (857)
Q Consensus 665 ~g~~~d~~~------~gcim~~--------~fss~CS~~~~~~~l~~~~~~Cl~~~-------~~~~CGNgiVE~gEeCD 723 (857)
||.||... ..|||.. .| |.||+++|..++.++.+.||+|. ..++||||+||+|||||
T Consensus 336 -Gm~hD~~~c~C~~~~~CiM~~~~~~~~~~~F-S~CS~~~~~~~~~~~~~~cl~n~p~p~~~~~~~~CGNg~vE~gEECD 413 (716)
T KOG3607|consen 336 -GMIHDESSCSCPSSGLCIMPEETGFYIPKKF-SNCSYQDFESFLLKGGGSCLLNLPSPEKIFSGPVCGNGVVEEGEECD 413 (716)
T ss_pred -CcccccccccCCCCCccccccccCcCccccc-ccchHHHHHHHHhcCCcceEecCCCcccccCCCccCCcccccCCcCC
Confidence 99999733 3588852 45 47999999999999999999982 56889999999999999
Q ss_pred CCCCCCCCCCCCCC-CCCccccCCCCCcccCCCCCCCCCCCCccccccCccccccccccCccccccccCccCCCc---cc
Q psy14093 724 AGLLGTEDNDSCCD-KVCKLRRNEGAGDEECDAGLLGTEDNDSCCDKVCKLRRNEGATKGTSSFGKRRNKTHTLC---RR 799 (857)
Q Consensus 724 CG~~~~c~~d~CC~-~tC~Lk~ga~~~~a~C~~G~~~~~~n~~CC~~~C~~~~~~~~~~g~~c~~~r~~~~~~~C---e~ 799 (857)
||++++|. |.||+ +||||+|| |+||.|+ || ++|+|+|+ |++| |+ +.++| ||
T Consensus 414 CG~~~eC~-~~cC~~~tCkL~pg-----a~Ca~G~--------CC-~~C~~~p~-----G~~C---R~--~~~eCDlpE~ 468 (716)
T KOG3607|consen 414 CGTPEECS-NLCCCASTCKLKPG-----AQCALGL--------CC-KDCKFLPA-----GTVC---RQ--AKNECDLPEY 468 (716)
T ss_pred CCChHHcC-ccccccccccccCC-----CCCCCCC--------ch-hccccccC-----ccee---cc--cCCccccccc
Confidence 99999995 55666 79999999 9999999 99 99999999 9999 87 77888 99
Q ss_pred cCCCcccccccccccCCCCCCCCccc-----CCChhhhhhcccCCcccchhhhHHHhh
Q psy14093 800 CGKSSYHIQKKKCAQCGYPCPRLRHY-----NWSVKAQRRKTTGTGRMRHLKIVRRRF 852 (857)
Q Consensus 800 C~G~s~~Cp~d~~~~~G~~C~~~~~~-----c~~~~~qc~~~~G~~~~~a~~~c~~~~ 852 (857)
|+|+|++||+|+|++||+||+..++| |.++++||++|||++|+.|++.||+.+
T Consensus 469 C~G~S~~Cp~d~~~~dG~~C~~~~g~Cy~G~C~t~~~QC~~i~g~~a~~a~~~Cy~~v 526 (716)
T KOG3607|consen 469 CNGTSSQCPPDLYVQDGIPCQGGQGYCYDGRCHTRDRQCQKIWGPGARAAPKYCYEKV 526 (716)
T ss_pred cCCCCCCCCCCceeECCEecCCCcceecCCCCCCHHHHHHHHhCCCcccCchhhhhhc
Confidence 99999999999999999999987788 679999999999999999999999874
No 4
>KOG3658|consensus
Probab=100.00 E-value=8.1e-42 Score=384.67 Aligned_cols=243 Identities=36% Similarity=0.635 Sum_probs=204.7
Q ss_pred CCCCcceeeeeEEEcCCCCccCCCccccCccc-ccccccccccccccCCCCCCcceEEEEEeeeecccCCCcceeeeecC
Q psy14093 555 GYRGMGFVIKKIVVHSDATRVRQGEAHYNMIR-DKWDVRNLLETFSSHVDGTNFCLAHLFTHQSFWSRGSSILGLAYIAS 633 (857)
Q Consensus 555 ~~~g~C~pi~~~~il~~GT~C~~G~c~~n~c~-r~W~~~~lL~~~~~~~~hD~~c~AhLft~~df~~~~~~tvGlA~vgs 633 (857)
.++++.|+|++|.|+.+++--..+.-++|+-+ ...++.+||+..+ ..+|+++|+|||||++|| +++||||||||+
T Consensus 274 g~kni~F~IkrIkI~~~~~~~~~~~~~~N~f~~e~~~v~~fl~l~s-~~~~sdfCLAylfT~rdF---e~GtLGLAwVgs 349 (764)
T KOG3658|consen 274 GFKNIGFVIKRIKILDSPACVRPGEDHYNMFCEEVKDVLEFLSLNS-EEKHSDFCLAYLFTYRDF---EGGTLGLAWVGS 349 (764)
T ss_pred CccceEEEEEEEEecCchhhcccCcCccccchhhHHHHHHHhhhhh-hccccchhheeeeeeccc---ccceeeeEEecc
Confidence 37899999999999988875556778888887 6667777777654 567899999999999999 799999999999
Q ss_pred CCCCcccceeccc-------cccccc-------------------hhhheeeeeeecccccccCC-CC--------CCcc
Q psy14093 634 PRPYSIGGSIHEN-------LKYFDT-------------------LHSSHISHTIVKRGVQESNH-PF--------NKIK 678 (857)
Q Consensus 634 ~~~~~s~Gv~~~~-------~~~~~~-------------------~~aheigh~l~~~g~~~d~~-~~--------gcim 678 (857)
+..+.+|||+... ..++++ ++|||||||+ |..||-. .+ .=||
T Consensus 350 p~~~saGGIC~k~~~y~~G~~~sLNtGi~T~~NYg~~Vp~kvs~lt~AHEiGHNf---GSpHDp~~ECsPg~~~~GnyiM 426 (764)
T KOG3658|consen 350 PRSNSAGGICEKAYTYNEGKKRSLNTGISTSVNYGKRVPTKVSDLTLAHEIGHNF---GSPHDPDIECSPGESQGGNYIM 426 (764)
T ss_pred CccCCCCcccccceeccCcceEEeecceeeeeecCCccCcchhheeehhhhcccc---CCCCCCCCccCCCCCCCCcEEE
Confidence 8667899998532 123332 4599999999 8788753 11 1256
Q ss_pred c-----------CCCCCCCCHHHHHHHhhcCCCCcCCCCCccccCCccccccccccCCCCCCCCCCCCCC-------CCC
Q psy14093 679 E-----------VTFKTLGNLRSIRKVLEAKSGKCFSEPEESFCGNLRVEGDEECDAGLLGTEDNDSCCD-------KVC 740 (857)
Q Consensus 679 ~-----------~~fss~CS~~~~~~~l~~~~~~Cl~~~~~~~CGNgiVE~gEeCDCG~~~~c~~d~CC~-------~tC 740 (857)
. .+| |+||+++|+++|..++..||+++..+||||+|||+|||||||...+| .|.||+ ..|
T Consensus 427 ~a~AtSGd~~NN~kF-SpCS~ksI~~VL~~k~~~CFqE~~~sfCGN~iVE~GEECDcG~~~~~-~D~CC~p~~~~~~k~C 504 (764)
T KOG3658|consen 427 YAYATSGDKPNNKKF-SPCSLKSISKVLESKKRNCFQERESSFCGNGIVEPGEECDCGFSTDC-KDSCCDPQPNLDEKPC 504 (764)
T ss_pred EEecccCcCccCccc-CcchHHHHHHHHHHhhhhhccCccccccCCccccCCcccCccccccc-CCcccCCCCCCCCCCc
Confidence 2 478 58999999999999999999999999999999999999999999998 999999 689
Q ss_pred ccccCCCCCcccCCCCCCCCCCCCccccccCccccccccccCccccccccCccCCCc---cccCCCcccccccccccCCC
Q psy14093 741 KLRRNEGAGDEECDAGLLGTEDNDSCCDKVCKLRRNEGATKGTSSFGKRRNKTHTLC---RRCGKSSYHIQKKKCAQCGY 817 (857)
Q Consensus 741 ~Lk~ga~~~~a~C~~G~~~~~~n~~CC~~~C~~~~~~~~~~g~~c~~~r~~~~~~~C---e~C~G~s~~Cp~d~~~~~G~ 817 (857)
||||| ||| ||+|++||+.+|||+++ |..| |. ....+| .+|+|.+++||+...+.||+
T Consensus 505 ~lk~g-----aqC------SpsqgpCC~~~Cqf~~~-----~~kc---~~-~d~~~C~~~s~CnG~~aeCP~s~~~~d~t 564 (764)
T KOG3658|consen 505 TLKPG-----AQC------SPSQGPCCTPNCQFHTS-----GEKC---RE-ADEATCKGESTCNGFSAECPPSPPKPDGT 564 (764)
T ss_pred eeCCC-----Ccc------CCCCCCccCCcceeecc-----cCee---ec-cccccCcCcccccCCccCCcCCCCCCCCC
Confidence 99999 655 89999999779999999 9999 64 245689 89999999999999999999
Q ss_pred CCCCCcccCC
Q psy14093 818 PCPRLRHYNW 827 (857)
Q Consensus 818 ~C~~~~~~c~ 827 (857)
+|.. .+.|.
T Consensus 565 ~C~~-~~~C~ 573 (764)
T KOG3658|consen 565 VCNE-TGVCI 573 (764)
T ss_pred cccc-cceEe
Confidence 9984 66654
No 5
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=100.00 E-value=8.5e-41 Score=354.74 Aligned_cols=235 Identities=51% Similarity=0.881 Sum_probs=183.6
Q ss_pred eeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEEEeccchhhcccccccc
Q psy14093 96 RCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYN 175 (857)
Q Consensus 96 ~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v~~~~~~~~~~~~~~~ 175 (857)
+|.|+||||++||++++++++++++.||+++||+||.||++++|+.+ +..+|+|.|.+|+|+
T Consensus 2 ~C~l~vVaD~~~~~~~~~~~~~~v~~~~~~~vn~vn~iY~~~n~~~~----~l~~i~i~~~~i~I~-------------- 63 (244)
T cd04270 2 TCKLLLVADHRFYKYMGRGEEETTINYLISHIDRVDDIYRNTDWDGG----GFKGIGFQIKRIRIH-------------- 63 (244)
T ss_pred ccEEEEEEcHHHHHHhCCChHHHHHHHHHHHHHHHHHHhcccccCCC----cccceEEEEEEEEEE--------------
Confidence 59999999999999999644788999999999999999999886422 123566666655332
Q ss_pred ccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccccccccccchhhhcccc
Q psy14093 176 MIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEVF 255 (857)
Q Consensus 176 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~~~f~~w~~~~ll~~~ 255 (857)
.+.+ +..++. ..++..|..|+...+++.+
T Consensus 64 -----------------------------~~~~--------------------~~~~~~--~~~~~~f~~w~~~~~~~~~ 92 (244)
T cd04270 64 -----------------------------TTPD--------------------EVDPGN--KFYNKSFPNWGVEKFLVKL 92 (244)
T ss_pred -----------------------------cCcc--------------------cccCCC--cccccCccchhHHHHhhhh
Confidence 1111 111111 1122358999998888766
Q ss_pred ccccCCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecccCCccc
Q psy14093 256 SREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGLSSS 335 (857)
Q Consensus 256 s~~~~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~Gl~~~ 335 (857)
+....+|+||+|||||++||.++++||||+|++..+..||||++.++... +...++|+||++.
T Consensus 93 ~~~~~~d~~c~A~LlT~~df~~~tlGlA~vg~~~~~~~gGIC~~~~s~~~-----------------~~~~~~n~gl~t~ 155 (244)
T cd04270 93 LLEQFSDDVCLAHLFTYRDFDMGTLGLAYVGSPRDNSAGGICEKAYYYSN-----------------GKKKYLNTGLTTT 155 (244)
T ss_pred hhhcCCccceEEEEEeccccCCCceeeeeecccccCCCCcccCCcccccC-----------------CcceeeecceEee
Confidence 55455666899999999999999999999999888889999999764311 1224789999988
Q ss_pred cccCCccccccchhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCCCCCCCCCCCCCCcccHHHHhc
Q psy14093 336 RNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCLAHLFTH 415 (857)
Q Consensus 336 ~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS~~~~~~~~CS~~~i~~ 415 (857)
.+ ||.+++++.+++||||||||||||+||++.+.|.|...++++|||++.+++|+.+++++||+ ||+++|++
T Consensus 156 ~~-~~~~~~~~~~a~t~AHElGHnlGm~HD~~~~~C~~~~~~~~~cIM~~~~~~~~~~~~~~FS~-------CS~~~~~~ 227 (244)
T cd04270 156 VN-YGKRVPTKESDLVTAHELGHNFGSPHDPDIAECAPGESQGGNYIMYARATSGDKENNKKFSP-------CSKKSISK 227 (244)
T ss_pred ec-cCCccchhHHHHHHHHHHHHhcCCCCCCCCCCCCCCCCCCCceEecCccccCCCCCCCCCCH-------hHHHHHHH
Confidence 88 89888888889999999999999999997667877655467999999988888889999999 99999987
Q ss_pred -------cccccCCCc
Q psy14093 416 -------QSFWSRGSS 424 (857)
Q Consensus 416 -------~Cl~~~~~~ 424 (857)
.||.++..+
T Consensus 228 ~L~~~~~~CL~~~~~~ 243 (244)
T cd04270 228 VLEVKSNSCFVERSQS 243 (244)
T ss_pred HHccCCCCCcCCCCCC
Confidence 599887544
No 6
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=99.97 E-value=2.2e-31 Score=276.35 Aligned_cols=191 Identities=24% Similarity=0.361 Sum_probs=142.4
Q ss_pred ceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEEEeccchhhccccccc
Q psy14093 95 TRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHY 174 (857)
Q Consensus 95 ~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v~~~~~~~~~~~~~~ 174 (857)
+|+||+||+|+.+|++++. +.+++|++.+||+||.||+++++. ++|+|.|+.|+
T Consensus 1 ~~vEl~vv~D~~~~~~~~~---~~~~~y~~~~~n~v~~~y~~~s~~--------~~i~i~l~~i~--------------- 54 (207)
T cd04273 1 RYVETLVVADSKMVEFHHG---EDLEHYILTLMNIVASLYKDPSLG--------NSINIVVVRLI--------------- 54 (207)
T ss_pred CeEEEEEEECHHHHHHcCh---HHHHHHHHHHHHHHHHHhcCcccC--------CCeEEEEEEEE---------------
Confidence 5899999999999998763 458899999999999999986532 45666666653
Q ss_pred cccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccccccccccchhhhccc
Q psy14093 175 NMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEV 254 (857)
Q Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~~~f~~w~~~~ll~~ 254 (857)
+|++.++ .+.+..++..+|. .|.+|+.+.+...
T Consensus 55 -----i~~~~~~-----------------------------------~~~~~~~~~~~L~-------~F~~w~~~~~~~~ 87 (207)
T cd04273 55 -----VLEDEES-----------------------------------GLLISGNAQKSLK-------SFCRWQKKLNPPN 87 (207)
T ss_pred -----EecCCCC-----------------------------------cCcccCCHHHHHH-------HHHHHHHHcCCcc
Confidence 3433221 1223456666665 7999998776643
Q ss_pred cccccCCCccceEEEeeccccC-----CCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecc
Q psy14093 255 FSREYSHKDFCLAHLFTDLKFE-----GGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLN 329 (857)
Q Consensus 255 ~s~~~~~D~~cla~L~T~~df~-----~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n 329 (857)
.....++| +|+|||++||. ++++|+||+| |||++.++. ++. .+
T Consensus 88 ~~~~~~~D---~a~llt~~d~~~~~~~~~~~G~A~~g--------giC~~~~s~--~i~-------------------~~ 135 (207)
T cd04273 88 DSDPEHHD---HAILLTRQDICRSNGNCDTLGLAPVG--------GMCSPSRSC--SIN-------------------ED 135 (207)
T ss_pred cccccccc---eEEEEeeecccccCCCCCceEEeccc--------cCCCCCcce--EEE-------------------cC
Confidence 22224677 99999999996 6899999999 999987632 222 11
Q ss_pred cCCccccccCCccccccchhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCCCCCCCCCCCCCCccc
Q psy14093 330 SGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCL 409 (857)
Q Consensus 330 ~Gl~~~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS~~~~~~~~CS 409 (857)
.|+ ..|.|+||||||+|||.||++.+.|++.. +++|||++..... .++++||+ ||
T Consensus 136 ~~~--------------~~a~~~aHElGH~LG~~HD~~~~~C~~~~--~~~~IM~~~~~~~--~~~~~fS~-------CS 190 (207)
T cd04273 136 TGL--------------SSAFTIAHELGHVLGMPHDGDGNSCGPEG--KDGHIMSPTLGAN--TGPFTWSK-------CS 190 (207)
T ss_pred CCc--------------eeEEeeeeechhhcCCCCCCCCCCCCCCC--CCceeecccccCC--CCCCCcCH-------HH
Confidence 111 35789999999999999999878897642 2479999987642 36899999 99
Q ss_pred HHHHhc
Q psy14093 410 AHLFTH 415 (857)
Q Consensus 410 ~~~i~~ 415 (857)
+++|.+
T Consensus 191 ~~~~~~ 196 (207)
T cd04273 191 RRYLTS 196 (207)
T ss_pred HHHHHH
Confidence 999987
No 7
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=99.97 E-value=9.4e-31 Score=268.65 Aligned_cols=187 Identities=28% Similarity=0.495 Sum_probs=143.2
Q ss_pred ceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEEEeccchhhccccccc
Q psy14093 95 TRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHY 174 (857)
Q Consensus 95 ~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v~~~~~~~~~~~~~~ 174 (857)
+|+|++||+|+.+|+++++ +++++++|++.++|.||.||++. +|+|.|..|
T Consensus 1 ~~iE~~vv~D~~~~~~~~~-n~~~~~~~v~~~~n~vn~~y~~~------------~i~v~l~~i---------------- 51 (194)
T cd04269 1 KYVELVVVVDNSLYKKYGS-NLSKVRQRVIEIVNIVDSIYRPL------------NIRVVLVGL---------------- 51 (194)
T ss_pred CeEEEEEEEeHHHHHHhCC-CHHHHHHHHHHHHHHHHHHHcCC------------CeEEEEEEE----------------
Confidence 5899999999999999985 68889999999999999999983 466666665
Q ss_pred cccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccccccccccchhhhccc
Q psy14093 175 NMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEV 254 (857)
Q Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~~~f~~w~~~~ll~~ 254 (857)
+||++++++ .+..++..+|. .|.+|+...+..
T Consensus 52 ----~i~~~~~~~------------------------------------~~~~~~~~~L~-------~f~~w~~~~~~~- 83 (194)
T cd04269 52 ----EIWTDKDKI------------------------------------SVSGDAGETLN-------RFLDWKRSNLLP- 83 (194)
T ss_pred ----EEeecCCcc------------------------------------cccCCHHHHHH-------HHHHHHHHhhcc-
Confidence 346555543 23345555555 688998876653
Q ss_pred cccccCCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecccCCcc
Q psy14093 255 FSREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGLSS 334 (857)
Q Consensus 255 ~s~~~~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~Gl~~ 334 (857)
...+| +|+|||+++|.++++|+||+| |+|++.++ .+.+. .
T Consensus 84 ---~~~~D---~a~Llt~~~~~~~~~G~A~~g--------~iC~~~~~-~~v~~------------------~------- 123 (194)
T cd04269 84 ---RKPHD---NAQLLTGRDFDGNTVGLAYVG--------GMCSPKYS-GGVVQ------------------D------- 123 (194)
T ss_pred ---ccCCC---ceEEEEeeecCCCceeeeecC--------CccCCCcc-eEEEE------------------e-------
Confidence 24677 999999999988999999999 99997552 22221 0
Q ss_pred ccccCCccccccchhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCCCCCCCCCCCCCCcccHHHHh
Q psy14093 335 SRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCLAHLFT 414 (857)
Q Consensus 335 ~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS~~~~~~~~CS~~~i~ 414 (857)
.+.. ....+.|+||||||+|||.||++.+.|+. ++|||++.... ++.+||+ ||+++|.
T Consensus 124 ----~~~~--~~~~a~~~AHElGH~lG~~HD~~~c~C~~-----~~cIM~~~~~~----~~~~fS~-------CS~~~~~ 181 (194)
T cd04269 124 ----HSRN--LLLFAVTMAHELGHNLGMEHDDGGCTCGR-----STCIMAPSPSS----LTDAFSN-------CSYEDYQ 181 (194)
T ss_pred ----CCcc--hHHHHHHHHHHHHhhcCCCcCCCCCCCCC-----CCeEecCCCCC----CCCCCCh-------hhHHHHH
Confidence 0100 12468999999999999999998777764 57999998653 6899999 9999998
Q ss_pred c-------ccccc
Q psy14093 415 H-------QSFWS 420 (857)
Q Consensus 415 ~-------~Cl~~ 420 (857)
+ +||.+
T Consensus 182 ~~l~~~~~~CL~~ 194 (194)
T cd04269 182 KFLSRGGGQCLLN 194 (194)
T ss_pred HHHhcCCCCccCC
Confidence 7 57763
No 8
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=99.97 E-value=3.7e-31 Score=272.84 Aligned_cols=191 Identities=27% Similarity=0.464 Sum_probs=136.3
Q ss_pred ceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEEEeccchhhccccccc
Q psy14093 95 TRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHY 174 (857)
Q Consensus 95 ~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v~~~~~~~~~~~~~~ 174 (857)
||+||+||+|+.+|++|++ +++.+++|+++++|+|+.+|++. +|.|.|..+
T Consensus 1 kyiEl~vvvD~~~~~~~~~-n~~~~~~~~~~i~n~v~~~y~~l------------~i~v~l~~l---------------- 51 (199)
T PF01421_consen 1 KYIELLVVVDNSMYQYHGS-NVTKVIQYVLTIVNIVDSIYQQL------------NIRVVLVGL---------------- 51 (199)
T ss_dssp EEEEEEEEE-HHHHHHTTT-GHHHHHHHHHHHHHHHHHHHGGG------------TEEEEEEEE----------------
T ss_pred CeEEEEEEEeHHHHHHhcc-cHHHHHHHHHHHHHHHhhhcccC------------CeEEEEEEE----------------
Confidence 6899999999999999985 78889999999999999999983 355555554
Q ss_pred cccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccccccccccchhhhccc
Q psy14093 175 NMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEV 254 (857)
Q Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~~~f~~w~~~~ll~~ 254 (857)
+||+++|+ |.+..++..+|. .|.+|+...+.+
T Consensus 52 ----eiw~~~d~------------------------------------i~~~~~~~~~L~-------~F~~w~~~~~~~- 83 (199)
T PF01421_consen 52 ----EIWTEEDK------------------------------------INISNDADSTLE-------NFCNWQKSELPP- 83 (199)
T ss_dssp ----EEESSSTS------------------------------------S---SSHHHHHH-------HHHHHHHHTHHH-
T ss_pred ----EEcccCCc------------------------------------eeeecchHHHHH-------HHHHHHHhhccc-
Confidence 44555553 444566777776 699999987663
Q ss_pred cccccCCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecccCCcc
Q psy14093 255 FSREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGLSS 334 (857)
Q Consensus 255 ~s~~~~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~Gl~~ 334 (857)
..+|| +|+|||++++..+++|+||+| |||++.++ +++. ++
T Consensus 84 ---~~~~D---~a~Llt~~~~~~~~~G~a~~~--------~~C~~~~s--~~i~---------~~--------------- 123 (199)
T PF01421_consen 84 ---RIHHD---HAHLLTGKDFPDSTVGLAYVG--------GMCSPSRS--CGIV---------ED--------------- 123 (199)
T ss_dssp ---HS--S---EEEEEESS-CSCCBSEEE-TT---------TTSTTTS--EEEE---------E----------------
T ss_pred ---ccccc---eeEEeeecccccceeeeEeCC--------CCCCcCCC--CcEe---------ee---------------
Confidence 34678 999999999987899999999 99999763 2332 10
Q ss_pred ccccCCccccccchhhHHhhhhhhhCCCCCCCC-CCCCCCCCCCCCceEEEeeccCCCCCCCCCCCCCCCCCCcccHHHH
Q psy14093 335 SRNHYGQRVITREADLVTAHEFGHNWGSEHDPD-MPECSPSASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCLAHLF 413 (857)
Q Consensus 335 ~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~-~~~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS~~~~~~~~CS~~~i 413 (857)
.+ ......|.++||||||+|||.||+. .+.|.+ ..+|||++.... .....||+ ||++++
T Consensus 124 ----~~--~~~~~~a~~~AHelGH~lGm~HD~~~~C~C~~----~~~cIM~~~~~~---~~~~~fS~-------CS~~~~ 183 (199)
T PF01421_consen 124 ----HS--RSGLSFAVIIAHELGHNLGMPHDGDNGCKCPR----RDGCIMSPSIGP---SSSDKFSN-------CSRRQF 183 (199)
T ss_dssp ----SS--SSHHHHHHHHHHHHHHHTT---TTTCTHCSSS----SSSSTTSSSSSS---SSTSSS-H-------HHHHHH
T ss_pred ----cc--chhHHHHHHHHHHHHHhcCCCCCCCCcCCcCC----CCCccccccccC---CCCCCcCH-------HHHHHH
Confidence 00 0112578999999999999999998 677853 378999996542 34569999 999999
Q ss_pred hc-------cccccCC
Q psy14093 414 TH-------QSFWSRG 422 (857)
Q Consensus 414 ~~-------~Cl~~~~ 422 (857)
.+ .||.+.|
T Consensus 184 ~~~l~~~~~~CL~n~P 199 (199)
T PF01421_consen 184 EEFLSSGKSSCLLNKP 199 (199)
T ss_dssp HHHHHHHTTGGGSS--
T ss_pred HHHHccCCCCccCCCC
Confidence 98 6999876
No 9
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=99.96 E-value=1.9e-28 Score=256.69 Aligned_cols=189 Identities=20% Similarity=0.262 Sum_probs=134.5
Q ss_pred eeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEEEeccchhhcccccccc
Q psy14093 96 RCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYN 175 (857)
Q Consensus 96 ~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v~~~~~~~~~~~~~~~ 175 (857)
|+||+||+|+.+|++++ +.++++.|++.+||.||.||+++. .++|.|.|+.|
T Consensus 2 yvEl~iV~D~~~~~~~~--~~~~~~~yv~~~~n~vn~~Y~~~~---------~p~I~i~lv~i----------------- 53 (220)
T cd04272 2 YPELFVVVDYDHQSEFF--SNEQLIRYLAVMVNAANLRYRDLK---------SPRIRLLLVGI----------------- 53 (220)
T ss_pred cceEEEEecHHHHHhcC--CHHHHHHHHHHHHHHHHHHhhhcC---------CCcEEEEEEEE-----------------
Confidence 79999999999999954 578899999999999999999853 24677777776
Q ss_pred ccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccccccccccchhhhcccc
Q psy14093 176 MIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEVF 255 (857)
Q Consensus 176 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~~~f~~w~~~~ll~~~ 255 (857)
+||+..+++.+. .+.+. -..++.++|. .|.+|+.+.+.
T Consensus 54 ---~i~~~~~~~~~~------------~~~~~-----------------~~~~~~~tL~-------~F~~~~~~~~~--- 91 (220)
T cd04272 54 ---TISKDPDFEPYI------------HPINY-----------------GYIDAAETLE-------NFNEYVKKKRD--- 91 (220)
T ss_pred ---EeccCcccceee------------ccCCc-----------------ccccHHHHHH-------HHHHHHhccCC---
Confidence 446655543220 00000 0123445665 68999886543
Q ss_pred ccccCCCccceEEEeecccc--------CCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceee
Q psy14093 256 SREYSHKDFCLAHLFTDLKF--------EGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLY 327 (857)
Q Consensus 256 s~~~~~D~~cla~L~T~~df--------~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (857)
..++| +|+|||++|| ..+++|+||+| |||++.. .+.+.
T Consensus 92 --~~~~D---~~~LlT~~~~~~~~~g~~~~~~~G~A~~g--------~~C~~~~--~~~~~------------------- 137 (220)
T cd04272 92 --YFNPD---VVFLVTGLDMSTYSGGSLQTGTGGYAYVG--------GACTENR--VAMGE------------------- 137 (220)
T ss_pred --CCccc---EEEEEeccceeeccCcccccCccceEeec--------CccCCCc--eeEee-------------------
Confidence 23667 9999999998 35789999999 9999542 22221
Q ss_pred cccCCccccccCCccccccchhhHHhhhhhhhCCCCCCCCCC-----------CCCCCCCCCCceEEEeeccCCCCCCCC
Q psy14093 328 LNSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMP-----------ECSPSASQGGSYLMYTYSVSGYDVNNK 396 (857)
Q Consensus 328 ~n~Gl~~~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~-----------~C~~~~~~~g~~IM~~~~~~g~~~n~~ 396 (857)
+.+. +...+.|+||||||+|||+||++.. .|++ .++|||++... ..+++
T Consensus 138 -d~~~------------~~~~~~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~~C~~----~~~~IM~~~~~---~~~~~ 197 (220)
T cd04272 138 -DTPG------------SYYGVYTMTHELAHLLGAPHDGSPPPSWVKGHPGSLDCPW----DDGYIMSYVVN---GERQY 197 (220)
T ss_pred -cCCC------------CcccHHHHHHHHHHHhCCCCCCCCCCccCCCCCccCcCCC----CCCeEEeeccC---CCCCC
Confidence 1110 0134899999999999999998653 2543 35699998643 35788
Q ss_pred CCCCCCCCCCcccHHHHhc
Q psy14093 397 TFSSHVDGTNFCLAHLFTH 415 (857)
Q Consensus 397 ~FS~~~~~~~~CS~~~i~~ 415 (857)
+||+ ||+++|..
T Consensus 198 ~fS~-------CS~~~~~~ 209 (220)
T cd04272 198 RFSQ-------CSQRQIRN 209 (220)
T ss_pred ccCc-------chHHHHHH
Confidence 9999 99999987
No 10
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=99.96 E-value=1.5e-28 Score=258.43 Aligned_cols=200 Identities=18% Similarity=0.294 Sum_probs=136.7
Q ss_pred eeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEEEeccchhhcccccccc
Q psy14093 96 RCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYN 175 (857)
Q Consensus 96 ~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v~~~~~~~~~~~~~~~ 175 (857)
.+-+.||||+.||++|+ +.++++.||+++||+||.||+++ .+|+|.|++|
T Consensus 2 ~~~~~vvaD~~~~~~~~--~~~~~~~~v~~ivN~vn~~Y~~~-----------~nI~v~Lvgl----------------- 51 (228)
T cd04271 2 VALIGVAADCSYTKSFG--SVEEARRNILNNVNSASQLYESS-----------FNISLGLRNL----------------- 51 (228)
T ss_pred eEEEEEEechHHHHHcC--CHHHHHHHHHHHHHHHHHHhhCc-----------ceEEEEEEEE-----------------
Confidence 46688999999999997 36889999999999999999963 2466776665
Q ss_pred ccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccccccccccchhhhcccc
Q psy14093 176 MIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEVF 255 (857)
Q Consensus 176 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~~~f~~w~~~~ll~~~ 255 (857)
++|++++++++ +....|+.. +.+..++..+|. .|.+|+.+.+
T Consensus 52 ---ei~~~~d~~~~---------~~~~~~~~~---------------~~~~~~~~~tL~-------~F~~Wr~~~~---- 93 (228)
T cd04271 52 ---TISDASCPSTA---------VDSAPWNLP---------------CNSRIDIDDRLS-------IFSQWRGQQP---- 93 (228)
T ss_pred ---EEecCCCCCcc---------ccccccccc---------------ccccCCHHHHHH-------HHHHHHhcCC----
Confidence 55777776543 122234321 112345666776 7999987532
Q ss_pred ccccCCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecccCCccc
Q psy14093 256 SREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGLSSS 335 (857)
Q Consensus 256 s~~~~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~Gl~~~ 335 (857)
.+| ..+++|+|+.. .++++||||+| |||++.++ .+ +. ++ +.+.++..
T Consensus 94 ----~~d-~a~~~l~t~~~-~~~~lGlA~vg--------~mC~~~~s-~~-~~---------~~-------~~~~~~~~- 140 (228)
T cd04271 94 ----DDG-NAFWTLMTACP-SGSEVGVAWLG--------QLCRTGAS-DQ-GN---------ET-------VAGTNVVV- 140 (228)
T ss_pred ----CCC-cEEeEeecccc-CCCceeEeecc--------CCCCCCcC-CC-cc---------cc-------ccccceee-
Confidence 233 12449999986 57999999999 99999763 22 21 11 11111211
Q ss_pred cccCCccccccchhhHHhhhhhhhCCCCCCCCCCCCCC-----------CC---CCCCceEEEeeccCCCCCCCCCCCCC
Q psy14093 336 RNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSP-----------SA---SQGGSYLMYTYSVSGYDVNNKTFSSH 401 (857)
Q Consensus 336 ~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~-----------~~---~~~g~~IM~~~~~~g~~~n~~~FS~~ 401 (857)
.+...+.||||||||||||.||++...|.+ +. .+.++|||++.+. . ++++||+
T Consensus 141 --------~~~~~~~t~AHElGHnLGm~HD~~~~~C~~~~~~~~~~Cp~~~~~C~~~~~~IM~~~~~---~-~~~~FS~- 207 (228)
T cd04271 141 --------RTSNEWQVFAHEIGHTFGAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPSSS---S-GITEFSP- 207 (228)
T ss_pred --------cccccceehhhhhhhhcCCCCCCCCCccccccccccccCCCcccccCCCCCEEeecCcC---C-CCCccCh-
Confidence 111235799999999999999987655632 11 1247899999743 2 7899999
Q ss_pred CCCCCcccHHHHhc
Q psy14093 402 VDGTNFCLAHLFTH 415 (857)
Q Consensus 402 ~~~~~~CS~~~i~~ 415 (857)
||+++|++
T Consensus 208 ------CS~~~~~~ 215 (228)
T cd04271 208 ------CTIGNICS 215 (228)
T ss_pred ------hhHHHHHH
Confidence 99999986
No 11
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=99.95 E-value=8.6e-28 Score=246.23 Aligned_cols=185 Identities=30% Similarity=0.487 Sum_probs=134.4
Q ss_pred ceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEEEeccchhhccccccc
Q psy14093 95 TRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHY 174 (857)
Q Consensus 95 ~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v~~~~~~~~~~~~~~ 174 (857)
+||||+||+|+.+|++|++ +.+.++.||+.+||.||.+|++++++ .+|+|.|..|+
T Consensus 1 ~~vE~~vv~D~~~~~~~~~-~~~~~~~~i~~~in~vn~~y~~~~~~--------~~i~~~l~~i~--------------- 56 (192)
T cd04267 1 REIELVVVADHRMVSYFNS-DENILQAYITELINIANSIYRSTNLR--------LGIRISLEGLQ--------------- 56 (192)
T ss_pred CEEEEEEEEcHHHHHHhcC-CHHHHHHHHHHHHHHHHHHHhcCccC--------CCeEEEEEEEE---------------
Confidence 5899999999999999974 67889999999999999999996521 45666666653
Q ss_pred cccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccccccccccchhhhccc
Q psy14093 175 NMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEV 254 (857)
Q Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~~~f~~w~~~~ll~~ 254 (857)
+|+..+.+.. +..++..+|. .|..|+... .
T Consensus 57 -----i~~~~~~~~~-----------------------------------~~~~~~~~l~-------~f~~~~~~~-~-- 86 (192)
T cd04267 57 -----ILKGEQFAPP-----------------------------------IDSDASNTLN-------SFSFWRAEG-P-- 86 (192)
T ss_pred -----EecCCCccCc-----------------------------------cCccHHHHHH-------HHHHHHhcC-C--
Confidence 3443332110 0123334444 588887754 1
Q ss_pred cccccCCCccceEEEeecccc-CCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecccCCc
Q psy14093 255 FSREYSHKDFCLAHLFTDLKF-EGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGLS 333 (857)
Q Consensus 255 ~s~~~~~D~~cla~L~T~~df-~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~Gl~ 333 (857)
.++| +++|||+++| ..+++|+||++ ++|++.++ .+.+. ..
T Consensus 87 ----~~~d---~~~l~t~~~~~~~~~~G~a~~~--------~~C~~~~~-~~v~~--------------------~~--- 127 (192)
T cd04267 87 ----IRHD---NAVLLTAQDFIEGDILGLAYVG--------SMCNPYSS-VGVVE--------------------DT--- 127 (192)
T ss_pred ----CCCc---eEEEEeeeccCCCCeeeeeccc--------CCCCCCCC-eEEEe--------------------cC---
Confidence 3678 9999999998 78999999999 99998652 22222 11
Q ss_pred cccccCCccccccchhhHHhhhhhhhCCCCCCCCCC---CCCCCCCCCCceEEEeeccCCCCCCCCCCCCCCCCCCcccH
Q psy14093 334 SSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMP---ECSPSASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCLA 410 (857)
Q Consensus 334 ~~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~---~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS~~~~~~~~CS~ 410 (857)
+ + +...+.|+||||||+|||.||+..+ .|.+ +++|||++.... .++.+||+ ||+
T Consensus 128 ---~-~-----~~~~~~~~aHElGH~lG~~HD~~~~~~~~c~~----~~~~IM~~~~~~---~~~~~FS~-------CS~ 184 (192)
T cd04267 128 ---G-F-----TLLTALTMAHELGHNLGAEHDGGDELAFECDG----GGNYIMAPVDSG---LNSYRFSQ-------CSI 184 (192)
T ss_pred ---C-c-----ceeehhhhhhhHHhhcCCcCCCCCCccccCCC----CCCeEEcccccC---CCCCccCh-------hhH
Confidence 1 0 1246889999999999999998643 3332 467999997542 57899999 999
Q ss_pred HHHhc
Q psy14093 411 HLFTH 415 (857)
Q Consensus 411 ~~i~~ 415 (857)
.+|.+
T Consensus 185 ~~i~~ 189 (192)
T cd04267 185 GSIRE 189 (192)
T ss_pred HHHHH
Confidence 99874
No 12
>KOG3538|consensus
Probab=99.93 E-value=2.1e-25 Score=274.03 Aligned_cols=343 Identities=24% Similarity=0.359 Sum_probs=224.1
Q ss_pred EEEEEcCCeEEEEEeCCCCCCCCCC-ccEEEEEecccccccccCCCCCCCCCCCCCCcCCCCccccCcCCCCCCCccccc
Q psy14093 2 TASIHTPEETYHIEPSWRHLPHLGN-QSMVAYRASDVKLSWDHAHDKPASPCGYVKEEFNTTDFDLDMEEDDPDMPHTRS 80 (857)
Q Consensus 2 ~G~I~~~~~~Y~IEPl~~~~~~~~~-~~HivYr~sd~~~~~~~~~~~~~~~Cg~~~~~~~~~~~~~~~~~~~~~~~~~r~ 80 (857)
.|+|+++++.|+|||+.+....... ..|++||..++..- ...+..+.. ..... .........+.|.
T Consensus 107 ~g~~~~~~~~~~iepl~~~~~~~~~~~~h~~~~r~~~~~~-----~~~~~~~~~--~~~~~------~~~~~~~~~~~~~ 173 (845)
T KOG3538|consen 107 HGSIQLQGHDYFIEPLPRGPAAPVSGRAHLVYRRLDVELA-----SEVLDGTRD--QKCDP------SPGDGAGRGRCRG 173 (845)
T ss_pred ccceeccCcceeEeeccCCCcccccCCCcccccchhhccc-----ccCcccccc--ccccc------CChhhcccccccc
Confidence 4899999999999999876544333 58999999333210 000111110 00000 0000011111222
Q ss_pred ccccCCCCCCCCCcceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEEE
Q psy14093 81 KRQVDPAYEYKPTKTRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVV 160 (857)
Q Consensus 81 kR~~~~~~~~~~~~~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v 160 (857)
||.+. ..+ +|.+||+|..+|.+|+. .+..|++++||+|..||++.+.. ..|++.|+.+++
T Consensus 174 ~r~~~-------~~~-vetlvvaD~~~~~~~~~----~~~~~~lt~~n~v~~l~~~~sl~--------~~i~i~vvr~~~ 233 (845)
T KOG3538|consen 174 RRSVS-------RHR-VETLVVADKCMVAFHGE----SVERYVLTLMNIVASLYKDPSLG--------NHINIVVVRLII 233 (845)
T ss_pred ccccc-------cce-eeEEEEecHHHHHhcch----hhHHHHHHHHHHHHHHhcCCCCC--------CceEEEEEEEEE
Confidence 34443 233 99999999999999982 37899999999999999997643 567777776644
Q ss_pred eccchhhccccccccccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhcccccccc
Q psy14093 161 HSDATRVRQGEAHYNMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYN 240 (857)
Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~ 240 (857)
.... ++ ...|..++..+|.
T Consensus 234 l~~~---------------------------------------------~~----------~~~i~~~~~~sl~------ 252 (845)
T KOG3538|consen 234 LLKE---------------------------------------------PS----------ALNIEGNAGKSLD------ 252 (845)
T ss_pred EecC---------------------------------------------CC----------CceeccCcccchh------
Confidence 2110 01 1334455554554
Q ss_pred ccccccchhhhccccccccCCCccceEEEeeccccC------CCceeeeeccCCCCCCcccccCCCccccceEEeccCCC
Q psy14093 241 MIRDKWDVRNLLEVFSREYSHKDFCLAHLFTDLKFE------GGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGL 314 (857)
Q Consensus 241 ~~f~~w~~~~ll~~~s~~~~~D~~cla~L~T~~df~------~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~ 314 (857)
.|++|+....... . ..++| +|+|||+.||. .+++|+|.++ |||.+.++.. .
T Consensus 253 -~fC~wq~~~n~~~-d-~~~~D---~ailltr~d~~~~~~~~~~~~G~A~v~--------gmC~~~~sC~-i-------- 309 (845)
T KOG3538|consen 253 -SFCKWQTSLNSEE-D-TEHHD---HAILLTRQDFCEQQNGPCDTLGVAPVG--------GMCSPSRSCS-I-------- 309 (845)
T ss_pred -hhcccccccCCCc-c-ccccc---eeEEeeeeecccccCCCCccccceeec--------ceeCCcccee-e--------
Confidence 7999998654431 1 13567 99999998875 4789999999 9999988642 1
Q ss_pred CccccccccceeecccCCccccccCCccccccchhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCC
Q psy14093 315 SSSQYFKNGYTLYLNSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVN 394 (857)
Q Consensus 315 ~~~~~~~~~~~~~~n~Gl~~~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n 394 (857)
..++|| .+++|+|||+||+|||.||+....|.+. ..||++....+. .
T Consensus 310 ------------~e~~gl--------------~~aftiahE~GH~~gm~hd~~~~~C~~~-----~~im~~~~~~~~--~ 356 (845)
T KOG3538|consen 310 ------------NEDTGL--------------PSAFTIAHELGHNFGMEHDGRGNSCGPT-----FHVMSSDTCGGN--H 356 (845)
T ss_pred ------------ecCCCC--------------ccceeeeeccccccCcccCCcCCCCCCc-----ccccCcccccCC--C
Confidence 123333 2589999999999999999987779864 289999876653 4
Q ss_pred CCCCCCCCCCCCcccHHHHhc-------cccccCCCcccCccccCCCCCCccCCCcccccccCCCCCCCCCCCCCCCCcc
Q psy14093 395 NKTFSSHVDGTNFCLAHLFTH-------QSFWSRGSSILGLAYIASPRPYSIGGGILGLAYVGSPRRNSKCCYHSILSEL 467 (857)
Q Consensus 395 ~~~FS~~~~~~~~CS~~~i~~-------~Cl~~~~~~~CGN~ive~g~~EeCDcG~~~~~~~~~~~~~d~CC~~~~~C~l 467 (857)
...||+ ||+..+.. .||.+.+. ...+.-+. .-+|..+ +++.+|++
T Consensus 357 ~~~WS~-------CS~~~~t~fl~~~~~~Cl~d~p~---~~~~~~p~----~lPg~~y--------------~~~~qC~~ 408 (845)
T KOG3538|consen 357 ELTWSE-------CSREYLTRFLDDGEGNCLSDQPF---SALPKLPS----QLPGQTY--------------DADEQCEF 408 (845)
T ss_pred CccCCc-------chHHHHHHHhccCccceeecCCc---cccCCCCc----cCCcccc--------------CCCCcccc
Confidence 555999 99999998 59998774 22222222 2234322 23346777
Q ss_pred cCCCccCCCCCCcccCCcccCCCceeecCCCCCCCCCcccCCCCC--CCCCCC-CCCCCCcCCC-CceecCCccch
Q psy14093 468 FPVLLYSDKNSPCCQNCKFMAVGMKCRDAQYATCEQESRCTGSSS--VCPPSA-PMSDNTGCLE-RGKCRGGKCIP 539 (857)
Q Consensus 468 ~~ga~Cs~~~g~CC~~C~~~~~g~~Cr~~~~~~C~~~~~C~G~s~--~Cp~~~-~~~dgt~C~~-~~~C~~G~C~~ 539 (857)
..|.. =..|.......+|+.. ||.+... .|.+.. |+.|||.|+. .++|.+|.|++
T Consensus 409 ~fg~~--------~~~c~~~~~~~~c~~l---------WC~~~~g~~~C~t~~~p~~~gt~Cg~~~~~c~~g~c~~ 467 (845)
T KOG3538|consen 409 QFGPK--------SKECPDMRAPDPCREL---------WCHTDGGSSSCSTKHLPAADGTECGEFGKWCRKGRCTP 467 (845)
T ss_pred ccCCc--------ceeccccCccchhhhh---------cccCCCCCCccccCcCCcccCCCcCCcCCcccCCcccC
Confidence 76542 2235544433567653 7765443 698854 9999999997 99999999996
No 13
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=99.91 E-value=7.9e-25 Score=224.82 Aligned_cols=183 Identities=40% Similarity=0.671 Sum_probs=98.1
Q ss_pred cceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHH-HhhccCCcccccccCCCccceeEEEEEEEEeccchhhccccc
Q psy14093 94 KTRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRV-HKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEA 172 (857)
Q Consensus 94 ~~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V-~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v~~~~~~~~~~~~ 172 (857)
+++|+|+||+|++|++.|++ +. +++.||+.+|+.| +.+|++ .+|+|+|..|.|++.+
T Consensus 2 ~~t~~l~v~~D~~f~~~~~~-~~-~~~~~i~~~~~~v~n~~~~~------------~~I~~~i~~i~i~~~~-------- 59 (196)
T PF13688_consen 2 KTTCELLVVADYEFYQAFGS-DQ-AVISYIASLFNQVSNLYYRT------------SNIQFQISGITIWTSS-------- 59 (196)
T ss_dssp SEEEEEEEEE-HHHHHHTST--H-HHHHHHHHHHHHHHHHHHT----------------EEEEEEEEE--S---------
T ss_pred ceEEEEEEEEcHHHHHHhCC-cH-HHHHHHHHHHHHHHHHHhhc------------cCceEEEEEEEEecCC--------
Confidence 56899999999999999986 33 7999999999999 777776 2688999988664322
Q ss_pred cccccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccccccccccchhhhc
Q psy14093 173 HYNMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLL 252 (857)
Q Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~~~f~~w~~~~ll 252 (857)
++.... . | -..+....+|
T Consensus 60 ------------~~~~~~-----------~-~--------------------------------------~~~~~~~~~L 77 (196)
T PF13688_consen 60 ------------DPYTNN-----------V-S--------------------------------------NSNIDAEDTL 77 (196)
T ss_dssp ------------B---TT-------------T--------------------------------------SSS--HHHHH
T ss_pred ------------CCCccc-----------c-c--------------------------------------ccccCHHHHH
Confidence 211000 0 0 0112222233
Q ss_pred ccc----ccccCCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeec
Q psy14093 253 EVF----SREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYL 328 (857)
Q Consensus 253 ~~~----s~~~~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (857)
+.| +.....+.+|+++|||++||.++ +|+||+|++..+..+++|+.... . +
T Consensus 78 ~~f~~~~~~~~~~~~~dl~~l~t~~d~~~g-lGlA~~g~~~~~~~~~~~~~~~~----~-------------------~- 132 (196)
T PF13688_consen 78 QDFNNDFSSWRDSSYYDLAHLFTGRDFSGG-LGLAYVGSSCSNTSGGICSSSVS----G-------------------S- 132 (196)
T ss_dssp HHH--HHHTHHHHTTSSEEEEEE----GGG-EEEESSSSSSTT-SSSSS--E-E----E-------------------E-
T ss_pred HHHhhhhhhccCCCCCCEEEEEecccCCCC-ceeEeccccCcCCccceeccCcc----c-------------------c-
Confidence 333 22122234469999999999877 99999993333333333332110 0 0
Q ss_pred ccCCccccccCCccccccchhhHHhhhhhhhCCCCCCCCCC----CCCCC---CCCCCceEEEeeccCCCCCCCCCCC
Q psy14093 329 NSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMP----ECSPS---ASQGGSYLMYTYSVSGYDVNNKTFS 399 (857)
Q Consensus 329 n~Gl~~~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~----~C~~~---~~~~g~~IM~~~~~~g~~~n~~~FS 399 (857)
++..+++...++|+||||||+||+.||++.. .|++. ..++++|||++... +|+.+||
T Consensus 133 ----------~~~~~~~~~~~~~~AHEiGH~lGa~HD~~~~~~~~~c~~~~~~~~~~~~~IM~y~~~----~n~~~FS 196 (196)
T PF13688_consen 133 ----------YGQVPPTYNGAITFAHEIGHNLGAPHDGDYSSTSSSCPPSSSTCPAGGGYIMSYCAS----PNNNKFS 196 (196)
T ss_dssp ----------TTEE--HHHHHHHHHHHHHHHTT-----SSS---TTTS-----GGGT---TTSSS------GGGGS--
T ss_pred ----------cceECCCCceehhhHHhHHHhcCCCCCCCCCCCCCCCCCCccccCCCCCeEEcccCC----CCcccCC
Confidence 2222334567999999999999999999753 67663 23468999999865 6899998
No 14
>smart00050 DISIN Homologues of snake disintegrins. Snake disintegrins inhibit the binding of ligands to integrin receptors. They contain a 'RGD' sequence, identical to the recognition site of many adhesion proteins. Molecules containing both disintegrin and metalloprotease domains are known as ADAMs.
Probab=99.80 E-value=4.8e-20 Score=159.45 Aligned_cols=75 Identities=29% Similarity=0.656 Sum_probs=69.6
Q ss_pred CCCCCCccCCCcccccccCCCCCCCCCCCCCCCCcccCCCccCCCCCCcccCCcccCCCceeecCCCCCCCCCcccCCCC
Q psy14093 432 ASPRPYSIGGGILGLAYVGSPRRNSKCCYHSILSELFPVLLYSDKNSPCCQNCKFMAVGMKCRDAQYATCEQESRCTGSS 511 (857)
Q Consensus 432 e~g~~EeCDcG~~~~~~~~~~~~~d~CC~~~~~C~l~~ga~Cs~~~g~CC~~C~~~~~g~~Cr~~~~~~C~~~~~C~G~s 511 (857)
|+| ||||||+.. .+.|+||++. +|+|++||+|+ +|+||.+|+|+++|++||+++ ++||++|||+|.|
T Consensus 1 E~g--EeCDCG~~~-------~~~d~CC~~~-tC~l~~ga~Cs--~G~CC~~C~~~~~g~~CR~~~-~~Cd~~e~C~G~s 67 (75)
T smart00050 1 EEG--EECDCGSPK-------ECTNPCCDPA-TCKLKPGAQCA--SGPCCDNCKFKPAGTLCRPSV-DECDLPEYCNGTS 67 (75)
T ss_pred CcC--cccCCCChh-------hcCCCCcCCC-cceecCCCeec--CCCCCCcceEccCCCEecCcc-CCCCCccccCCCC
Confidence 678 999999986 4799999997 89999999999 499999999999999999997 6999999999999
Q ss_pred CCCCCCCC
Q psy14093 512 SVCPPSAP 519 (857)
Q Consensus 512 ~~Cp~~~~ 519 (857)
++||+++|
T Consensus 68 ~~CP~~~~ 75 (75)
T smart00050 68 ADCPPDVY 75 (75)
T ss_pred CcCcCCCC
Confidence 99999865
No 15
>smart00050 DISIN Homologues of snake disintegrins. Snake disintegrins inhibit the binding of ligands to integrin receptors. They contain a 'RGD' sequence, identical to the recognition site of many adhesion proteins. Molecules containing both disintegrin and metalloprotease domains are known as ADAMs.
Probab=99.76 E-value=2.2e-19 Score=155.37 Aligned_cols=71 Identities=34% Similarity=0.754 Sum_probs=65.5
Q ss_pred cccccccCCCCCCCCCCCCCCC-CCccccCCCCCcccCCCCCCCCCCCCccccccCccccccccccCccccccccCccCC
Q psy14093 717 EGDEECDAGLLGTEDNDSCCDK-VCKLRRNEGAGDEECDAGLLGTEDNDSCCDKVCKLRRNEGATKGTSSFGKRRNKTHT 795 (857)
Q Consensus 717 E~gEeCDCG~~~~c~~d~CC~~-tC~Lk~ga~~~~a~C~~G~~~~~~n~~CC~~~C~~~~~~~~~~g~~c~~~r~~~~~~ 795 (857)
|+|||||||+.++| .|+||++ +|+|++| |+|++|+ || .+|+|+++ |++| |+ +.+
T Consensus 1 E~gEeCDCG~~~~~-~d~CC~~~tC~l~~g-----a~Cs~G~--------CC-~~C~~~~~-----g~~C---R~--~~~ 55 (75)
T smart00050 1 EEGEECDCGSPKEC-TNPCCDPATCKLKPG-----AQCASGP--------CC-DNCKFKPA-----GTLC---RP--SVD 55 (75)
T ss_pred CcCcccCCCChhhc-CCCCcCCCcceecCC-----CeecCCC--------CC-CcceEccC-----CCEe---cC--ccC
Confidence 78999999999886 8999997 7999999 8998888 99 69999999 9999 87 666
Q ss_pred Cc---cccCCCccccccccc
Q psy14093 796 LC---RRCGKSSYHIQKKKC 812 (857)
Q Consensus 796 ~C---e~C~G~s~~Cp~d~~ 812 (857)
+| |||+|+|+.||+|+|
T Consensus 56 ~Cd~~e~C~G~s~~CP~~~~ 75 (75)
T smart00050 56 ECDLPEYCNGTSADCPPDVY 75 (75)
T ss_pred CCCCccccCCCCCcCcCCCC
Confidence 88 999999999999986
No 16
>PF00200 Disintegrin: Disintegrin; InterPro: IPR001762 Disintegrins are a family of small proteins from viper venoms that function as potent inhibitors of both platelet aggregation and integrin-dependent cell adhesion [, ]. Integrin receptors are involved in cell-cell and cell-extracellular matrix interactions, serving as the final common pathway leading to aggregation via formation of platelet-platelet bridges, which are essential in thrombosis and haemostasis. Disintegrins contain an RGD (Arg-Gly-Asp) or KGD (Lys-Gly-Asp) sequence motif that binds specifically to integrin IIb-IIIa receptors on the platelet surface, thereby blocking the binding of fibrinogen to the receptor-glycoprotein complex of activated platelets. Disintegrins act as receptor antagonists, inhibiting aggregation induced by ADP, thrombin, platelet-activating factor and collagen []. The role of disintegrin in preventing blood coagulation renders it of medical interest, particularly with regard to its use as an anti-coagulant []. Disintegrins from different snake species have been characterised: albolabrin, applagin, barbourin, batroxostatin, bitistatin, obtustatin [], schistatin [], echistatin [], elegantin, eristicophin, flavoridin [], halysin, kistrin, tergeminin, salmosin [] and triflavin. Disintegrin-like proteins are found in various species ranging from slime mold to humans. Some other proteins known to contain a disintegrin domain are: Some snake venom zinc metalloproteinases [] consist of an N-terminal catalytic domain fused to a disintegrin domain. Such is the case for trimerelysin I (HR1B), atrolysin-e (Ht-e) and trigramin. It has been suggested that these proteinases are able to cleave themselves from the disintegrin domains and that the latter may arise from such a post-translational processing. The beta-subunit of guinea pig sperm surface protein PH30 []. PH30 is a protein involved in sperm-egg fusion. The beta subunit contains a disintegrin at the N-terminal extremity. Mammalian epididymial apical protein 1 (EAP I) []. EAP I is associated with the sperm membrane and may play a role in sperm maturation. Structurally, EAP I consists of an N-terminal domain, followed by a zinc metalloproteinase domain, a disintegrin domain, and a large C-terminal domain that contains a transmembrane region. ; PDB: 1L3X_A 3C05_D 3K7N_A 3K7L_A 2ECH_A 1RO3_A 2PJF_A 2PJG_A 1Q7J_A 1Q7I_A ....
Probab=99.76 E-value=2.4e-19 Score=156.37 Aligned_cols=76 Identities=34% Similarity=0.670 Sum_probs=67.0
Q ss_pred CCCCCCccCCCcccccccCCCCCCCCCCCCCCCCcccCCCccCCCCCCcccCCcccCCCceeecCCCCCCCCCcccCCCC
Q psy14093 432 ASPRPYSIGGGILGLAYVGSPRRNSKCCYHSILSELFPVLLYSDKNSPCCQNCKFMAVGMKCRDAQYATCEQESRCTGSS 511 (857)
Q Consensus 432 e~g~~EeCDcG~~~~~~~~~~~~~d~CC~~~~~C~l~~ga~Cs~~~g~CC~~C~~~~~g~~Cr~~~~~~C~~~~~C~G~s 511 (857)
|+| ||||||+... |.|+||++. +|+|+++|+|+ +|+||.+|+|+++|++||+++ ++||++|||+|.|
T Consensus 1 E~g--EeCDCG~~~~-------C~~~CC~~~-~C~l~~ga~C~--~g~CC~~C~~~~~g~~Cr~~~-~~Cd~~e~C~G~s 67 (76)
T PF00200_consen 1 EEG--EECDCGSPEE-------CNDPCCDAT-TCKLKPGAQCS--NGPCCNNCQFKPSGTVCRPAT-NECDLPEYCNGSS 67 (76)
T ss_dssp -TT--TS-STSSTTT-------HTCTTEETT-TTEESTTSSTS--SSTTEETTEESSTTEEEEECS-STTSEEEEECSSS
T ss_pred CcC--ccCCCCChhh-------ccCCCcCcc-ccccccccccc--cCcccCcccccccccccCccc-CcCCCCCcCCCcC
Confidence 678 9999999874 489999986 79999999994 599999999999999999997 7999999999999
Q ss_pred CCCCCCCCC
Q psy14093 512 SVCPPSAPM 520 (857)
Q Consensus 512 ~~Cp~~~~~ 520 (857)
+.||+++||
T Consensus 68 ~~CP~d~~k 76 (76)
T PF00200_consen 68 AECPPDVYK 76 (76)
T ss_dssp SSS-SSTT-
T ss_pred CcCcCCCCC
Confidence 999999885
No 17
>PF00200 Disintegrin: Disintegrin; InterPro: IPR001762 Disintegrins are a family of small proteins from viper venoms that function as potent inhibitors of both platelet aggregation and integrin-dependent cell adhesion [, ]. Integrin receptors are involved in cell-cell and cell-extracellular matrix interactions, serving as the final common pathway leading to aggregation via formation of platelet-platelet bridges, which are essential in thrombosis and haemostasis. Disintegrins contain an RGD (Arg-Gly-Asp) or KGD (Lys-Gly-Asp) sequence motif that binds specifically to integrin IIb-IIIa receptors on the platelet surface, thereby blocking the binding of fibrinogen to the receptor-glycoprotein complex of activated platelets. Disintegrins act as receptor antagonists, inhibiting aggregation induced by ADP, thrombin, platelet-activating factor and collagen []. The role of disintegrin in preventing blood coagulation renders it of medical interest, particularly with regard to its use as an anti-coagulant []. Disintegrins from different snake species have been characterised: albolabrin, applagin, barbourin, batroxostatin, bitistatin, obtustatin [], schistatin [], echistatin [], elegantin, eristicophin, flavoridin [], halysin, kistrin, tergeminin, salmosin [] and triflavin. Disintegrin-like proteins are found in various species ranging from slime mold to humans. Some other proteins known to contain a disintegrin domain are: Some snake venom zinc metalloproteinases [] consist of an N-terminal catalytic domain fused to a disintegrin domain. Such is the case for trimerelysin I (HR1B), atrolysin-e (Ht-e) and trigramin. It has been suggested that these proteinases are able to cleave themselves from the disintegrin domains and that the latter may arise from such a post-translational processing. The beta-subunit of guinea pig sperm surface protein PH30 []. PH30 is a protein involved in sperm-egg fusion. The beta subunit contains a disintegrin at the N-terminal extremity. Mammalian epididymial apical protein 1 (EAP I) []. EAP I is associated with the sperm membrane and may play a role in sperm maturation. Structurally, EAP I consists of an N-terminal domain, followed by a zinc metalloproteinase domain, a disintegrin domain, and a large C-terminal domain that contains a transmembrane region. ; PDB: 1L3X_A 3C05_D 3K7N_A 3K7L_A 2ECH_A 1RO3_A 2PJF_A 2PJG_A 1Q7J_A 1Q7I_A ....
Probab=99.75 E-value=3.5e-19 Score=155.35 Aligned_cols=72 Identities=36% Similarity=0.781 Sum_probs=63.8
Q ss_pred cccccccCCCCCCCCCCCCCCC-CCccccCCCCCcccCCCCCCCCCCCCccccccCccccccccccCccccccccCccCC
Q psy14093 717 EGDEECDAGLLGTEDNDSCCDK-VCKLRRNEGAGDEECDAGLLGTEDNDSCCDKVCKLRRNEGATKGTSSFGKRRNKTHT 795 (857)
Q Consensus 717 E~gEeCDCG~~~~c~~d~CC~~-tC~Lk~ga~~~~a~C~~G~~~~~~n~~CC~~~C~~~~~~~~~~g~~c~~~r~~~~~~ 795 (857)
|+|||||||++.+| .|+||++ +|+|++| |+|+.|+ || .+|+|+++ |++| |+ +.+
T Consensus 1 E~gEeCDCG~~~~C-~~~CC~~~~C~l~~g-----a~C~~g~--------CC-~~C~~~~~-----g~~C---r~--~~~ 55 (76)
T PF00200_consen 1 EEGEECDCGSPEEC-NDPCCDATTCKLKPG-----AQCSNGP--------CC-NNCQFKPS-----GTVC---RP--ATN 55 (76)
T ss_dssp -TTTS-STSSTTTH-TCTTEETTTTEESTT-----SSTSSST--------TE-ETTEESST-----TEEE---EE--CSS
T ss_pred CcCccCCCCChhhc-cCCCcCccccccccc-----cccccCc--------cc-Cccccccc-----cccc---Cc--ccC
Confidence 78999999999999 8999996 9999999 8998888 99 69999999 9999 88 667
Q ss_pred Cc---cccCCCcccccccccc
Q psy14093 796 LC---RRCGKSSYHIQKKKCA 813 (857)
Q Consensus 796 ~C---e~C~G~s~~Cp~d~~~ 813 (857)
+| |||+|.|+.||+|+|+
T Consensus 56 ~Cd~~e~C~G~s~~CP~d~~k 76 (76)
T PF00200_consen 56 ECDLPEYCNGSSAECPPDVYK 76 (76)
T ss_dssp TTSEEEEECSSSSSS-SSTT-
T ss_pred cCCCCCcCCCcCCcCcCCCCC
Confidence 88 9999999999999985
No 18
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=99.73 E-value=2.9e-19 Score=180.72 Aligned_cols=103 Identities=40% Similarity=0.663 Sum_probs=51.1
Q ss_pred ccCCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecccCCccccc
Q psy14093 258 EYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGLSSSRN 337 (857)
Q Consensus 258 ~~~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~Gl~~~~n 337 (857)
...+++||++||| .++..+.+||||+| ++|++.+... |++. +
T Consensus 64 ~~~~~~~Di~~l~--~~~~~g~~Gla~~g--------~~c~~~~~~~--------------------------g~~~--~ 105 (173)
T PF13574_consen 64 WIGSSNYDIGHLF--GTFSGGGGGLAYVG--------GVCSKGYKAT--------------------------GLSS--P 105 (173)
T ss_dssp ----TCSS-CCEE--EEESSGCCSSEEE---------CCCCTCCCE---------------------------CCCS---
T ss_pred cCCCCCCCEEEEe--eccCCCCccEEEEe--------EeCcCCCEEE--------------------------EEEc--C
Confidence 4456678899999 44467899999999 9999865321 1111 1
Q ss_pred cCCccccccchhhHHhhhhhhhCCCCCCCCCCCCCC---CCCCCCceEEEeeccCCCCCCCCCCCCCCCCCCcccHHHHh
Q psy14093 338 HYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSP---SASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCLAHLFT 414 (857)
Q Consensus 338 ~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~---~~~~~g~~IM~~~~~~g~~~n~~~FS~~~~~~~~CS~~~i~ 414 (857)
+|.. ..+.|+||||||+||++||.+. .|.+ .....+.|||++. + .+++.+||+ ||+++|+
T Consensus 106 -~~~~----~~~~~~aHElGH~lGa~Hd~~~-~~~~~~~~~~~sg~~IMsy~---~-~~~~~~fS~-------cS~~~i~ 168 (173)
T PF13574_consen 106 -YGDF----FGIDTFAHELGHQLGAPHDFDN-GCNPGSPTCPGSGGYIMSYA---G-ICNNDKFSP-------CSIRQIR 168 (173)
T ss_dssp --TTS----HHHHHHHHHHHHHHT---SSSS-----SS--T-STTT-TT-SS-----GGGG-------------------
T ss_pred -CCce----eeeeeehhhhHhhcCCCCCCCC-CCCCCCCcCCCCCCEEEeeC---C-CCCCccccc-------ccccccc
Confidence 2221 2578999999999999999975 3222 2234589999976 2 478999999 9999997
Q ss_pred c
Q psy14093 415 H 415 (857)
Q Consensus 415 ~ 415 (857)
.
T Consensus 169 ~ 169 (173)
T PF13574_consen 169 A 169 (173)
T ss_dssp -
T ss_pred c
Confidence 5
No 19
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=99.67 E-value=6.7e-16 Score=160.49 Aligned_cols=42 Identities=24% Similarity=0.423 Sum_probs=37.5
Q ss_pred cceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhcc
Q psy14093 94 KTRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYN 135 (857)
Q Consensus 94 ~~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~ 135 (857)
.+.++|+|++|+.+++.|++.+++.+..+|..+++.+|.+|.
T Consensus 2 ~~~~~l~v~~d~~~~~~~~~~s~~~v~~~i~~~v~~~N~~Y~ 43 (206)
T PF13583_consen 2 LRVYRLAVVYDYSFYSWFFGGSIEDVKAFIASMVNFANEAYN 43 (206)
T ss_pred ceEEEEEEEECHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHh
Confidence 467999999999999999434688899999999999999994
No 20
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=99.65 E-value=2e-16 Score=168.47 Aligned_cols=145 Identities=35% Similarity=0.566 Sum_probs=103.6
Q ss_pred CcceeeeeEEEcCCCCccCCCccccCcccccccccccccccccCCCCCCcceEEEEEeeeecccCCCcceeeeecCCCCC
Q psy14093 558 GMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLETFSSHVDGTNFCLAHLFTHQSFWSRGSSILGLAYIASPRPY 637 (857)
Q Consensus 558 g~C~pi~~~~il~~GT~C~~G~c~~n~c~r~W~~~~lL~~~~~~~~hD~~c~AhLft~~df~~~~~~tvGlA~vgs~~~~ 637 (857)
++-+.+.++.|..++.+-....-.++..-..|+...+++.+.....||.+|+|||||+++| .++++||||+|+++..
T Consensus 52 ~i~i~~~~i~I~~~~~~~~~~~~~~~~~f~~w~~~~~~~~~~~~~~~d~~c~A~LlT~~df---~~~tlGlA~vg~~~~~ 128 (244)
T cd04270 52 GIGFQIKRIRIHTTPDEVDPGNKFYNKSFPNWGVEKFLVKLLLEQFSDDVCLAHLFTYRDF---DMGTLGLAYVGSPRDN 128 (244)
T ss_pred ceEEEEEEEEEEcCcccccCCCcccccCccchhHHHHhhhhhhhcCCccceEEEEEecccc---CCCceeeeeecccccC
Confidence 4556667777777665432222222234567888777754332223555799999999999 5789999999998655
Q ss_pred cccceeccccc--------------------------cccchhhheeeeeeecccccccCC--CC--------CCcccC-
Q psy14093 638 SIGGSIHENLK--------------------------YFDTLHSSHISHTIVKRGVQESNH--PF--------NKIKEV- 680 (857)
Q Consensus 638 ~s~Gv~~~~~~--------------------------~~~~~~aheigh~l~~~g~~~d~~--~~--------gcim~~- 680 (857)
.++|||.+... .++.+|||||||+| ||.||.. .+ +|||+.
T Consensus 129 ~~gGIC~~~~s~~~~~~~~~n~gl~t~~~~~~~~~~~~~a~t~AHElGHnl---Gm~HD~~~~~C~~~~~~~~~cIM~~~ 205 (244)
T cd04270 129 SAGGICEKAYYYSNGKKKYLNTGLTTTVNYGKRVPTKESDLVTAHELGHNF---GSPHDPDIAECAPGESQGGNYIMYAR 205 (244)
T ss_pred CCCcccCCcccccCCcceeeecceEeeeccCCccchhHHHHHHHHHHHHhc---CCCCCCCCCCCCCCCCCCCceEecCc
Confidence 67888754221 13468999999999 9999976 22 489962
Q ss_pred ----------CCCCCCCHHHHHHHhhcCCCCcCCCCCcc
Q psy14093 681 ----------TFKTLGNLRSIRKVLEAKSGKCFSEPEES 709 (857)
Q Consensus 681 ----------~fss~CS~~~~~~~l~~~~~~Cl~~~~~~ 709 (857)
.| |.||+++|.++|..+.+.||.+...+
T Consensus 206 ~~~~~~~~~~~F-S~CS~~~~~~~L~~~~~~CL~~~~~~ 243 (244)
T cd04270 206 ATSGDKENNKKF-SPCSKKSISKVLEVKSNSCFVERSQS 243 (244)
T ss_pred cccCCCCCCCCC-CHhHHHHHHHHHccCCCCCcCCCCCC
Confidence 36 47999999999999999999987654
No 21
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.60 E-value=5.6e-15 Score=147.48 Aligned_cols=106 Identities=29% Similarity=0.359 Sum_probs=74.1
Q ss_pred CCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecccCCccccccC
Q psy14093 260 SHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGLSSSRNHY 339 (857)
Q Consensus 260 ~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~Gl~~~~n~~ 339 (857)
.+| +++++++.++.++.+|+||+| ++|.+... .++. . ... .
T Consensus 51 ~~d---i~~~~~~~~~~~~~~g~a~~~--------~~c~~~~~--~~~~--------------------~-----~~~-~ 91 (167)
T cd00203 51 KAD---IAILVTRQDFDGGTGGWAYLG--------RVCDSLRG--VGVL--------------------Q-----DNQ-S 91 (167)
T ss_pred cCc---EEEEeccccCCCCceEEEecC--------CccCCCCC--cEEE--------------------e-----cCC-c
Confidence 457 999999999888899999999 89976541 1211 0 000 0
Q ss_pred CccccccchhhHHhhhhhhhCCCCCCCCCCCCCC---------CCCCCCceEEEeeccCCCCCCCCCCCCCCCCCCcccH
Q psy14093 340 GQRVITREADLVTAHEFGHNWGSEHDPDMPECSP---------SASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCLA 410 (857)
Q Consensus 340 G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~---------~~~~~g~~IM~~~~~~g~~~n~~~FS~~~~~~~~CS~ 410 (857)
+ ....+.|+||||||+||+.||.+.+.+.. .......+||++....+..+...+||+ ||+
T Consensus 92 ~----~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~~~~~~~~~~~~siM~y~~~~~~~~~~~~fS~-------~d~ 160 (167)
T cd00203 92 G----TKEGAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSVMSYTKGSFSDGQRKDFSQ-------CDI 160 (167)
T ss_pred c----cccchhhHHHHHHHHhCCCccCcCCCCCCCccccccccCCCCCCCeEeccCccccCcccCCCcCH-------HHH
Confidence 0 12457899999999999999987543220 011246899999876543357899999 999
Q ss_pred HHHhc
Q psy14093 411 HLFTH 415 (857)
Q Consensus 411 ~~i~~ 415 (857)
++|.+
T Consensus 161 ~~i~~ 165 (167)
T cd00203 161 DQINK 165 (167)
T ss_pred HHHHh
Confidence 98864
No 22
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=99.30 E-value=2e-12 Score=134.50 Aligned_cols=94 Identities=18% Similarity=0.214 Sum_probs=77.6
Q ss_pred CCCcceEEEEEeeeecc--cCCCcceeeeecCC-CCCcccceeccccccccchhhheeeeeeecccccccCCC-------
Q psy14093 604 GTNFCLAHLFTHQSFWS--RGSSILGLAYIASP-RPYSIGGSIHENLKYFDTLHSSHISHTIVKRGVQESNHP------- 673 (857)
Q Consensus 604 hD~~c~AhLft~~df~~--~~~~tvGlA~vgs~-~~~~s~Gv~~~~~~~~~~~~aheigh~l~~~g~~~d~~~------- 673 (857)
|| +|+|||+++|.. ..++++|+||+|++ ++..+++|..+..+..+.++||||||+| ||.||+..
T Consensus 94 ~D---~a~llt~~d~~~~~~~~~~~G~A~~ggiC~~~~s~~i~~~~~~~~a~~~aHElGH~L---G~~HD~~~~~C~~~~ 167 (207)
T cd04273 94 HD---HAILLTRQDICRSNGNCDTLGLAPVGGMCSPSRSCSINEDTGLSSAFTIAHELGHVL---GMPHDGDGNSCGPEG 167 (207)
T ss_pred cc---eEEEEeeecccccCCCCCceEEeccccCCCCCcceEEEcCCCceeEEeeeeechhhc---CCCCCCCCCCCCCCC
Confidence 56 999999999910 01589999999999 5678888887777778889999999999 99999751
Q ss_pred -CCCcccC---------CCCCCCCHHHHHHHhhcCCCCcCC
Q psy14093 674 -FNKIKEV---------TFKTLGNLRSIRKVLEAKSGKCFS 704 (857)
Q Consensus 674 -~gcim~~---------~fss~CS~~~~~~~l~~~~~~Cl~ 704 (857)
-+|||+. .| |.||+++|.++|..+...||+
T Consensus 168 ~~~~IM~~~~~~~~~~~~f-S~CS~~~~~~~l~~~~~~CL~ 207 (207)
T cd04273 168 KDGHIMSPTLGANTGPFTW-SKCSRRYLTSFLDTGDGNCLL 207 (207)
T ss_pred CCceeecccccCCCCCCCc-CHHHHHHHHHHHccCCCcccC
Confidence 1699962 36 589999999999998899985
No 23
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=99.29 E-value=7e-13 Score=136.98 Aligned_cols=105 Identities=21% Similarity=0.386 Sum_probs=75.8
Q ss_pred cccccccccccccCCCCCCcceEEEEEeeeecccCCCcceeeeecCC-CCCcccceeccc---cccccchhhheeeeeee
Q psy14093 588 KWDVRNLLETFSSHVDGTNFCLAHLFTHQSFWSRGSSILGLAYIASP-RPYSIGGSIHEN---LKYFDTLHSSHISHTIV 663 (857)
Q Consensus 588 ~W~~~~lL~~~~~~~~hD~~c~AhLft~~df~~~~~~tvGlA~vgs~-~~~~s~Gv~~~~---~~~~~~~~aheigh~l~ 663 (857)
.|+...+..+.. || .|+|||++++ ...++|+||+|+| ++..++||..+. ...++.++||||||+|
T Consensus 75 ~w~~~~~~~~~~----~D---~a~Llt~~~~---~~~~~G~a~~~~~C~~~~s~~i~~~~~~~~~~~a~~~AHelGH~l- 143 (199)
T PF01421_consen 75 NWQKSELPPRIH----HD---HAHLLTGKDF---PDSTVGLAYVGGMCSPSRSCGIVEDHSRSGLSFAVIIAHELGHNL- 143 (199)
T ss_dssp HHHHHTHHHHS------S---EEEEEESS-C---SCCBSEEE-TT-TTSTTTSEEEEE-SSSSHHHHHHHHHHHHHHHT-
T ss_pred HHHHhhcccccc----cc---eeEEeeeccc---ccceeeeEeCCCCCCcCCCCcEeeeccchhHHHHHHHHHHHHHhc-
Confidence 355444443333 78 9999999999 3559999999999 666888888654 6678889999999999
Q ss_pred cccccccCC-C--C----CCcccC--------CCCCCCCHHHHHHHhhcCCCCcCCCC
Q psy14093 664 KRGVQESNH-P--F----NKIKEV--------TFKTLGNLRSIRKVLEAKSGKCFSEP 706 (857)
Q Consensus 664 ~~g~~~d~~-~--~----gcim~~--------~fss~CS~~~~~~~l~~~~~~Cl~~~ 706 (857)
||.||+. . + +|||+. .| |.||..++.+++.++.+.||+|.
T Consensus 144 --Gm~HD~~~~C~C~~~~~cIM~~~~~~~~~~~f-S~CS~~~~~~~l~~~~~~CL~n~ 198 (199)
T PF01421_consen 144 --GMPHDGDNGCKCPRRDGCIMSPSIGPSSSDKF-SNCSRRQFEEFLSSGKSSCLLNK 198 (199)
T ss_dssp --T---TTTCTHCSSSSSSSTTSSSSSSSSTSSS--HHHHHHHHHHHHHHTTGGGSS-
T ss_pred --CCCCCCCCcCCcCCCCCccccccccCCCCCCc-CHHHHHHHHHHHccCCCCccCCC
Confidence 9999987 2 2 789974 35 57999999999999889999874
No 24
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=99.28 E-value=3.1e-12 Score=131.57 Aligned_cols=92 Identities=23% Similarity=0.447 Sum_probs=76.9
Q ss_pred CCCcceEEEEEeeeecccCCCcceeeeecCC-CCCcccceeccc---cccccchhhheeeeeeecccccccCCCC-----
Q psy14093 604 GTNFCLAHLFTHQSFWSRGSSILGLAYIASP-RPYSIGGSIHEN---LKYFDTLHSSHISHTIVKRGVQESNHPF----- 674 (857)
Q Consensus 604 hD~~c~AhLft~~df~~~~~~tvGlA~vgs~-~~~~s~Gv~~~~---~~~~~~~~aheigh~l~~~g~~~d~~~~----- 674 (857)
|| +|||||+++| .++++|+||+|++ ++..+++|.... ....+.++||||||+| ||.||+..+
T Consensus 87 ~D---~a~Llt~~~~---~~~~~G~A~~g~iC~~~~~~~v~~~~~~~~~~~a~~~AHElGH~l---G~~HD~~~c~C~~~ 157 (194)
T cd04269 87 HD---NAQLLTGRDF---DGNTVGLAYVGGMCSPKYSGGVVQDHSRNLLLFAVTMAHELGHNL---GMEHDDGGCTCGRS 157 (194)
T ss_pred CC---ceEEEEeeec---CCCceeeeecCCccCCCcceEEEEeCCcchHHHHHHHHHHHHhhc---CCCcCCCCCCCCCC
Confidence 67 9999999998 5799999999999 447778887543 4566789999999999 999998543
Q ss_pred CCcccC-------CCCCCCCHHHHHHHhhcCCCCcCCC
Q psy14093 675 NKIKEV-------TFKTLGNLRSIRKVLEAKSGKCFSE 705 (857)
Q Consensus 675 gcim~~-------~fss~CS~~~~~~~l~~~~~~Cl~~ 705 (857)
+|||.. .| |.||.+++.++|..+...||++
T Consensus 158 ~cIM~~~~~~~~~~f-S~CS~~~~~~~l~~~~~~CL~~ 194 (194)
T cd04269 158 TCIMAPSPSSLTDAF-SNCSYEDYQKFLSRGGGQCLLN 194 (194)
T ss_pred CeEecCCCCCCCCCC-ChhhHHHHHHHHhcCCCCccCC
Confidence 689963 45 5799999999999988999974
No 25
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=99.20 E-value=1.2e-11 Score=130.67 Aligned_cols=105 Identities=17% Similarity=0.196 Sum_probs=73.7
Q ss_pred ccccccccccCCCCCCcceEEEEEeeeecccCCCcceeeeecCC-CCCcccceecccc---cccc-----chhhheeeee
Q psy14093 591 VRNLLETFSSHVDGTNFCLAHLFTHQSFWSRGSSILGLAYIASP-RPYSIGGSIHENL---KYFD-----TLHSSHISHT 661 (857)
Q Consensus 591 ~~~lL~~~~~~~~hD~~c~AhLft~~df~~~~~~tvGlA~vgs~-~~~~s~Gv~~~~~---~~~~-----~~~aheigh~ 661 (857)
+..|.+|......||. .++||+++..+ ++++||||+|+| ++..++|+..+.. ...+ .++||||||+
T Consensus 82 L~~F~~Wr~~~~~~d~-a~~~l~t~~~~----~~~lGlA~vg~mC~~~~s~~~~~~~~~~~~~~~~~~~~~t~AHElGHn 156 (228)
T cd04271 82 LSIFSQWRGQQPDDGN-AFWTLMTACPS----GSEVGVAWLGQLCRTGASDQGNETVAGTNVVVRTSNEWQVFAHEIGHT 156 (228)
T ss_pred HHHHHHHHhcCCCCCc-EEeEeeccccC----CCceeEeeccCCCCCCcCCCccccccccceeecccccceehhhhhhhh
Confidence 4455555443334551 14499999865 899999999999 5656666653221 1111 3899999999
Q ss_pred eecccccccCCC--C----------------------CCcccC-------CCCCCCCHHHHHHHhhcCC--CCcCC
Q psy14093 662 IVKRGVQESNHP--F----------------------NKIKEV-------TFKTLGNLRSIRKVLEAKS--GKCFS 704 (857)
Q Consensus 662 l~~~g~~~d~~~--~----------------------gcim~~-------~fss~CS~~~~~~~l~~~~--~~Cl~ 704 (857)
| ||.||+.. + +|||+. .| |.||+.+|.++|..+. +.||.
T Consensus 157 L---Gm~HD~~~~~C~~~~~~~~~~Cp~~~~~C~~~~~~IM~~~~~~~~~~F-S~CS~~~~~~~l~~~~~~~~CL~ 228 (228)
T cd04271 157 F---GAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPSSSSGITEF-SPCTIGNICSLLGRNPVRTSCLS 228 (228)
T ss_pred c---CCCCCCCCCccccccccccccCCCcccccCCCCCEEeecCcCCCCCcc-ChhhHHHHHHHHhcCCCCCcccC
Confidence 9 99999642 1 469973 46 4799999999998865 89984
No 26
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=99.12 E-value=3.3e-11 Score=114.81 Aligned_cols=61 Identities=41% Similarity=0.672 Sum_probs=34.0
Q ss_pred cceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecccCCccccccCCccc
Q psy14093 264 FCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGLSSSRNHYGQRV 343 (857)
Q Consensus 264 ~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~Gl~~~~n~~G~~v 343 (857)
+++++||+..++ .+..|+||+| ++|...... +... .. ..++
T Consensus 64 aD~v~~~~~~~~-~~~~G~A~~~--------~~~~~~~~~-~~~~--------------------~~------~~~~--- 104 (124)
T PF13582_consen 64 ADLVHLFTGNDL-NGAGGIAYIG--------GVCSNNSGY-GVNY--------------------DS------SPYG--- 104 (124)
T ss_dssp -TTSEEEEEESS-GGB-EEEECT-----------TTT--T-T-SE--------------------EE------EE-----
T ss_pred CCEEEEeccCCC-CCCceEEEeC--------CccccCCce-EEEc--------------------cc------CCCC---
Confidence 339999999986 6788999999 999986531 1110 00 0011
Q ss_pred cccchhhHHhhhhhhhCCCCCC
Q psy14093 344 ITREADLVTAHEFGHNWGSEHD 365 (857)
Q Consensus 344 ~~~~~~~t~AHElGHnlG~~HD 365 (857)
.....|+||||||+||+.||
T Consensus 105 --~~~~~~~~HEiGH~lGl~Hd 124 (124)
T PF13582_consen 105 --DSGVDTFAHEIGHNLGLNHD 124 (124)
T ss_dssp --TTSTTHHHHHHHHHTT----
T ss_pred --cccceEeeehhhHhcCCCCC
Confidence 12358999999999999998
No 27
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=99.08 E-value=9.1e-11 Score=123.31 Aligned_cols=95 Identities=17% Similarity=0.206 Sum_probs=73.4
Q ss_pred CCCCcceEEEEEeeee----cc-cCCCcceeeeecCCCCCcccceecccc--ccccchhhheeeeeeecccccccCCC--
Q psy14093 603 DGTNFCLAHLFTHQSF----WS-RGSSILGLAYIASPRPYSIGGSIHENL--KYFDTLHSSHISHTIVKRGVQESNHP-- 673 (857)
Q Consensus 603 ~hD~~c~AhLft~~df----~~-~~~~tvGlA~vgs~~~~~s~Gv~~~~~--~~~~~~~aheigh~l~~~g~~~d~~~-- 673 (857)
.|| +|+|||+++| .. ...+++|+||+|+++...++|++.++. ...+.++||||||+| ||.||+..
T Consensus 94 ~~D---~~~LlT~~~~~~~~~g~~~~~~~G~A~~g~~C~~~~~~~~~d~~~~~~~~~~~AHElGH~l---G~~HD~~~~~ 167 (220)
T cd04272 94 NPD---VVFLVTGLDMSTYSGGSLQTGTGGYAYVGGACTENRVAMGEDTPGSYYGVYTMTHELAHLL---GAPHDGSPPP 167 (220)
T ss_pred ccc---EEEEEeccceeeccCcccccCccceEeecCccCCCceeEeecCCCCcccHHHHHHHHHHHh---CCCCCCCCCC
Confidence 357 9999999987 10 014678999999995456778776653 355789999999999 99998532
Q ss_pred ---------------CCCcccC--------CCCCCCCHHHHHHHhhcCCCCcCC
Q psy14093 674 ---------------FNKIKEV--------TFKTLGNLRSIRKVLEAKSGKCFS 704 (857)
Q Consensus 674 ---------------~gcim~~--------~fss~CS~~~~~~~l~~~~~~Cl~ 704 (857)
.+|||+. .| |.||..+|..+|..+...||.
T Consensus 168 ~~~~~~~~~~~C~~~~~~IM~~~~~~~~~~~f-S~CS~~~~~~~l~~~~~~CL~ 220 (220)
T cd04272 168 SWVKGHPGSLDCPWDDGYIMSYVVNGERQYRF-SQCSQRQIRNVFRRLGASCLH 220 (220)
T ss_pred ccCCCCCccCcCCCCCCeEEeeccCCCCCCcc-CcchHHHHHHHHcCCCccccC
Confidence 2589973 36 579999999999998899984
No 28
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=98.84 E-value=2.5e-09 Score=109.79 Aligned_cols=85 Identities=29% Similarity=0.410 Sum_probs=67.2
Q ss_pred CCCCCcceEEEEEeeeecccCCCcceeeeecCC-CCCcccceeccc--cccccchhhheeeeeeecccccccCCC-----
Q psy14093 602 VDGTNFCLAHLFTHQSFWSRGSSILGLAYIASP-RPYSIGGSIHEN--LKYFDTLHSSHISHTIVKRGVQESNHP----- 673 (857)
Q Consensus 602 ~~hD~~c~AhLft~~df~~~~~~tvGlA~vgs~-~~~~s~Gv~~~~--~~~~~~~~aheigh~l~~~g~~~d~~~----- 673 (857)
..|| +++|||+.+| ...+++|+||++++ .+..++||+.+. ....+.++||||||+| ||.||+..
T Consensus 87 ~~~d---~~~l~t~~~~--~~~~~~G~a~~~~~C~~~~~~~v~~~~~~~~~~~~~~aHElGH~l---G~~HD~~~~~~~~ 158 (192)
T cd04267 87 IRHD---NAVLLTAQDF--IEGDILGLAYVGSMCNPYSSVGVVEDTGFTLLTALTMAHELGHNL---GAEHDGGDELAFE 158 (192)
T ss_pred CCCc---eEEEEeeecc--CCCCeeeeecccCCCCCCCCeEEEecCCcceeehhhhhhhHHhhc---CCcCCCCCCcccc
Confidence 3577 9999999997 23688999999999 566778887543 3566779999999999 99999752
Q ss_pred ----CCCcccC--------CCCCCCCHHHHHHHh
Q psy14093 674 ----FNKIKEV--------TFKTLGNLRSIRKVL 695 (857)
Q Consensus 674 ----~gcim~~--------~fss~CS~~~~~~~l 695 (857)
-+|||+. .| |.||+.+|.++|
T Consensus 159 c~~~~~~IM~~~~~~~~~~~F-S~CS~~~i~~~l 191 (192)
T cd04267 159 CDGGGNYIMAPVDSGLNSYRF-SQCSIGSIREFL 191 (192)
T ss_pred CCCCCCeEEcccccCCCCCcc-ChhhHHHHHHHh
Confidence 1579962 46 579999999886
No 29
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=98.32 E-value=1.3e-07 Score=97.28 Aligned_cols=103 Identities=26% Similarity=0.422 Sum_probs=53.4
Q ss_pred CcceeeeeEEEcCCCCccCCCccccCccccccccccccccc----ccCCCCCCcceEEEEEeeeecccCCCcceeeeecC
Q psy14093 558 GMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLETF----SSHVDGTNFCLAHLFTHQSFWSRGSSILGLAYIAS 633 (857)
Q Consensus 558 g~C~pi~~~~il~~GT~C~~G~c~~n~c~r~W~~~~lL~~~----~~~~~hD~~c~AhLft~~df~~~~~~tvGlA~vgs 633 (857)
+.-+.+..+.+....++=... .....++...+|+.| +.....+..|++||||+++| .++ +|+||+|+
T Consensus 45 ~I~~~i~~i~i~~~~~~~~~~-----~~~~~~~~~~~L~~f~~~~~~~~~~~~~dl~~l~t~~d~---~~g-lGlA~~g~ 115 (196)
T PF13688_consen 45 NIQFQISGITIWTSSDPYTNN-----VSNSNIDAEDTLQDFNNDFSSWRDSSYYDLAHLFTGRDF---SGG-LGLAYVGS 115 (196)
T ss_dssp --EEEEEEEEE--S-B---TT------TSSS--HHHHHHHH--HHHTHHHHTTSSEEEEEE-------GGG-EEEESSSS
T ss_pred CceEEEEEEEEecCCCCCccc-----ccccccCHHHHHHHHhhhhhhccCCCCCCEEEEEecccC---CCC-ceeEeccc
Confidence 556778888876655431111 134456666777666 22111233469999999999 344 99999999
Q ss_pred CCCCcccceeccc----------cccccchhhheeeeeeecccccccCC
Q psy14093 634 PRPYSIGGSIHEN----------LKYFDTLHSSHISHTIVKRGVQESNH 672 (857)
Q Consensus 634 ~~~~~s~Gv~~~~----------~~~~~~~~aheigh~l~~~g~~~d~~ 672 (857)
++....+|++... ......++||||||+| |..||+.
T Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~AHEiGH~l---Ga~HD~~ 161 (196)
T PF13688_consen 116 SCSNTSGGICSSSVSGSYGQVPPTYNGAITFAHEIGHNL---GAPHDGD 161 (196)
T ss_dssp SSTT-SSSSS--E-EEETTEE--HHHHHHHHHHHHHHHT---T-----S
T ss_pred cCcCCccceeccCcccccceECCCCceehhhHHhHHHhc---CCCCCCC
Confidence 9655566666433 1233458899999999 9999864
No 30
>KOG3475|consensus
Probab=98.21 E-value=5.6e-07 Score=78.10 Aligned_cols=75 Identities=73% Similarity=1.287 Sum_probs=72.8
Q ss_pred cccCccccccccCccCCCccccCCCcccccccccccCCCCCCCCcccCCChhhhhhcccCCcccchhhhHHHhhc
Q psy14093 779 ATKGTSSFGKRRNKTHTLCRRCGKSSYHIQKKKCAQCGYPCPRLRHYNWSVKAQRRKTTGTGRMRHLKIVRRRFS 853 (857)
Q Consensus 779 ~~~g~~c~~~r~~~~~~~Ce~C~G~s~~Cp~d~~~~~G~~C~~~~~~c~~~~~qc~~~~G~~~~~a~~~c~~~~~ 853 (857)
|+|||...|+|.++++.+|..|.-.|.+-+...+.++|+|=+.++.|.|+.+...+...|+|.|..+++.|+||+
T Consensus 1 MtKGT~sFGKr~nkshtlC~RCG~~syH~QKstC~~CGYpaak~R~ynws~KAkRR~TtGTGrMr~lk~v~rrfr 75 (92)
T KOG3475|consen 1 MTKGTSSFGKRHNKSHTLCRRCGRRSYHIQKSTCSSCGYPAAKKRSYNWSAKAKRRRTTGTGRMRYLKIVYRRFR 75 (92)
T ss_pred CCCcccchhhccccchHHHHHhCchhhhhhcccccccCCcchhccCCCcchhhhhccccccchhHHHHHHHHHHh
Confidence 678999999999999999999999999999999999999999999999999999999999999999999999997
No 31
>PF08516 ADAM_CR: ADAM cysteine-rich; InterPro: IPR006586 An ADAM is a transmembrane protein that contains a disintegrin and metalloprotease domain (MEROPS peptidase family M12B). All members of the ADAM family display a common domain organisation - a pro-domain, the metalloprotease, disintigrin, cysteine-rich, epidermal-growth factor like, and transmembrane domains and a C-terminal cytoplasmic tail. They possess four potential functions: proteolysis, cell adhesion, cell fusion, and cell signalling. ADAMs are membrane-anchored proteases that proteolytically modify cell surface and extracellular matrix (ECM) in order to alter cell behaviour. They are responsible for the proteolytic cleavage of transmembrane proteins and release of their extracellular domain [, ]. The ADAM cysteine-rich domain is not found in plant, archaeal, bacterial or viral proteins. The cysteine-rich domain complements the binding capacity of the disintegrin domain, and perhaps imparts specificity to disintegrin domain-mediated interactions. It has been shown that the cysteine-rich domain of ADAM13 regulates the protein's metalloprotease activity []. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2DW2_A 2DW0_B 2DW1_A 3G5C_B 2ERQ_B 2ERO_A 2ERP_B 3K7L_A 2E3X_A 3K7N_A.
Probab=98.18 E-value=3.6e-07 Score=86.92 Aligned_cols=39 Identities=18% Similarity=0.192 Sum_probs=30.6
Q ss_pred CCCCCCCCccc-----CCChhhhhhcccCCcccchhhhHHHhhc
Q psy14093 815 CGYPCPRLRHY-----NWSVKAQRRKTTGTGRMRHLKIVRRRFS 853 (857)
Q Consensus 815 ~G~~C~~~~~~-----c~~~~~qc~~~~G~~~~~a~~~c~~~~~ 853 (857)
||+||.+.++| |++++.||+.|||++|++|+++||+.+-
T Consensus 1 nG~pC~~~~~yCy~G~C~t~~~QC~~lwG~~a~~A~~~Cy~~~N 44 (117)
T PF08516_consen 1 NGTPCSNGQGYCYNGRCPTRDQQCQELWGPGARVAPDSCYEEVN 44 (117)
T ss_dssp TT-EETTTTEEEBTTBEE-HHHHHHHHH-TTEEE--GGGGGGGG
T ss_pred CCcccCCCCeEEECCcCCCHhHHHHHHhCCCccccchhHHHHHh
Confidence 79999987788 6799999999999999999999997753
No 32
>PTZ00073 60S ribosomal protein L37; Provisional
Probab=98.15 E-value=1.5e-06 Score=76.98 Aligned_cols=75 Identities=71% Similarity=1.245 Sum_probs=72.4
Q ss_pred cccCccccccccCccCCCccccCCCcccccccccccCCCCCCCCcccCCChhhhhhcccCCcccchhhhHHHhhc
Q psy14093 779 ATKGTSSFGKRRNKTHTLCRRCGKSSYHIQKKKCAQCGYPCPRLRHYNWSVKAQRRKTTGTGRMRHLKIVRRRFS 853 (857)
Q Consensus 779 ~~~g~~c~~~r~~~~~~~Ce~C~G~s~~Cp~d~~~~~G~~C~~~~~~c~~~~~qc~~~~G~~~~~a~~~c~~~~~ 853 (857)
|++||..+|+|.++++..|..|.-.|.+-+...+..+|+|.+.++.|-|+.+...+...|+|.|..++..++||+
T Consensus 1 MtKGT~SfGKr~~ktHtlCrRCG~~syH~qK~~CasCGypsak~R~YnWs~Ka~rr~ttGtGrmr~lk~v~rrf~ 75 (91)
T PTZ00073 1 MTKGTGSFGKRHGKTHTLCRRCGKRSFHVQKKRCASCGYPSAKMRRYNWSVKAKRRRTTGTGRCRYLKTVARRAK 75 (91)
T ss_pred CCCCCcccccccCcCcchhcccCccccccccccchhcCCchhhccccchhhhhccccCCccchhHHHHHHHHHHh
Confidence 678999999999999999999999999999999999999999999999999999999999999999999999996
No 33
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=98.13 E-value=9.4e-07 Score=89.73 Aligned_cols=87 Identities=22% Similarity=0.282 Sum_probs=38.8
Q ss_pred CCCCCCcceEEEEEeeeecccCCCcceeeeecCCCCCc--ccceecccc-ccccchhhheeeeeeecccccccCCCC---
Q psy14093 601 HVDGTNFCLAHLFTHQSFWSRGSSILGLAYIASPRPYS--IGGSIHENL-KYFDTLHSSHISHTIVKRGVQESNHPF--- 674 (857)
Q Consensus 601 ~~~hD~~c~AhLft~~df~~~~~~tvGlA~vgs~~~~~--s~Gv~~~~~-~~~~~~~aheigh~l~~~g~~~d~~~~--- 674 (857)
...++++|++||| .++ .++..||||+|+++... ..|+..+.. .+..-++||||||+| |+.||...-
T Consensus 64 ~~~~~~~Di~~l~--~~~---~~g~~Gla~~g~~c~~~~~~~g~~~~~~~~~~~~~~aHElGH~l---Ga~Hd~~~~~~~ 135 (173)
T PF13574_consen 64 WIGSSNYDIGHLF--GTF---SGGGGGLAYVGGVCSKGYKATGLSSPYGDFFGIDTFAHELGHQL---GAPHDFDNGCNP 135 (173)
T ss_dssp ----TCSS-CCEE--EEE---SSGCCSSEEE-CCCCTCCCE-CCCS--TTSHHHHHHHHHHHHHH---T---SSSS----
T ss_pred cCCCCCCCEEEEe--ecc---CCCCccEEEEeEeCcCCCEEEEEEcCCCceeeeeeehhhhHhhc---CCCCCCCCCCCC
Confidence 3446778899999 444 37899999999994322 234432211 123346999999999 999987431
Q ss_pred ---------CCccc-------CCCCCCCCHHHHHHHhh
Q psy14093 675 ---------NKIKE-------VTFKTLGNLRSIRKVLE 696 (857)
Q Consensus 675 ---------gcim~-------~~fss~CS~~~~~~~l~ 696 (857)
+-||. ..| |+||+++|..+|.
T Consensus 136 ~~~~~~~sg~~IMsy~~~~~~~~f-S~cS~~~i~~~l~ 172 (173)
T PF13574_consen 136 GSPTCPGSGGYIMSYAGICNNDKF-SPCSIRQIRAVLG 172 (173)
T ss_dssp SS--T-STTT-TT-SS-GGGG-----------------
T ss_pred CCCcCCCCCCEEEeeCCCCCCccc-ccccccccccccC
Confidence 24775 267 5899999998875
No 34
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=97.62 E-value=4.9e-05 Score=75.84 Aligned_cols=58 Identities=22% Similarity=0.307 Sum_probs=45.1
Q ss_pred eEEEEEeeeecccCCCcceeeeecCC-CCCcccceecccc---ccccchhhheeeeeeecccccccCC
Q psy14093 609 LAHLFTHQSFWSRGSSILGLAYIASP-RPYSIGGSIHENL---KYFDTLHSSHISHTIVKRGVQESNH 672 (857)
Q Consensus 609 ~AhLft~~df~~~~~~tvGlA~vgs~-~~~~s~Gv~~~~~---~~~~~~~aheigh~l~~~g~~~d~~ 672 (857)
++++++..++ .+++.|+||++++ .+....++..... .....+++|||||.| ||.|+..
T Consensus 54 i~~~~~~~~~---~~~~~g~a~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~HElGH~L---Gl~H~~~ 115 (167)
T cd00203 54 IAILVTRQDF---DGGTGGWAYLGRVCDSLRGVGVLQDNQSGTKEGAQTIAHELGHAL---GFYHDHD 115 (167)
T ss_pred EEEEeccccC---CCCceEEEecCCccCCCCCcEEEecCCcccccchhhHHHHHHHHh---CCCccCc
Confidence 9999999987 4688999999998 5555555554332 356779999999999 9988754
No 35
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=97.46 E-value=7.7e-05 Score=76.15 Aligned_cols=100 Identities=24% Similarity=0.379 Sum_probs=56.5
Q ss_pred cccccccchhhhcccccccc--CCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCcc
Q psy14093 240 NMIRDKWDVRNLLEVFSREY--SHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSS 317 (857)
Q Consensus 240 ~~~f~~w~~~~ll~~~s~~~--~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~ 317 (857)
+....++....+++...... ..+ .+.++|..|+.. -|++||. |+|.+... .+++. .
T Consensus 47 ~~~r~Q~~a~~iL~~l~~~~~~~~~---~vl~vt~~DLy~--~~~nfvf--------G~a~~~~~--~aVvS-------~ 104 (179)
T PRK13267 47 DWERGQYRAEKFLPLLSRIGRFNGD---KNIGITDCDLYY--RGLNFVF--------GLAYPNLR--GAVIS-------T 104 (179)
T ss_pred CCCCceecHHHHHHHHHhhCCCCCc---eEEEEEccccCC--CCCCcee--------EEeecCCC--EEEEE-------c
Confidence 33445566666665544322 334 788899999753 3677877 88887542 23330 0
Q ss_pred ccccccceeecccCCccccccCCcc----ccccchhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEee
Q psy14093 318 QYFKNGYTLYLNSGLSSSRNHYGQR----VITREADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTY 386 (857)
Q Consensus 318 ~~~~~~~~~~~n~Gl~~~~n~~G~~----v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~ 386 (857)
. -++..+ ||.. +...-...+++|||||+||+.| |. ...|+|+.+
T Consensus 105 ~--------Rl~~~~------~~~~~~~~l~~~R~~k~~~HElGH~lGL~H------C~-----~~~CvM~~s 152 (179)
T PRK13267 105 Y--------RLRPEF------YGNKPDSELFEERVRKEVTHELGHTLGLEH------CD-----NPRCVMNFS 152 (179)
T ss_pred c--------ccChhh------ccccchHHHHHHHHHHHHHHHHHHHcCCcc------CC-----CCCccCCCC
Confidence 0 011110 2211 1111234458999999999999 43 246899854
No 36
>KOG3538|consensus
Probab=97.31 E-value=0.00037 Score=87.23 Aligned_cols=112 Identities=17% Similarity=0.242 Sum_probs=84.3
Q ss_pred CCCcceEEEEEeeeecc---cCCCcceeeeecCC-CCCcccceeccccccccchhhheeeeeeecccccccCCCCC----
Q psy14093 604 GTNFCLAHLFTHQSFWS---RGSSILGLAYIASP-RPYSIGGSIHENLKYFDTLHSSHISHTIVKRGVQESNHPFN---- 675 (857)
Q Consensus 604 hD~~c~AhLft~~df~~---~~~~tvGlA~vgs~-~~~~s~Gv~~~~~~~~~~~~aheigh~l~~~g~~~d~~~~g---- 675 (857)
|| .|.|||..||-. -...++|+|.+++| .+.++.-|+.+..+..+.++|||+||++ ||.||+....
T Consensus 270 ~D---~ailltr~d~~~~~~~~~~~~G~A~v~gmC~~~~sC~i~e~~gl~~aftiahE~GH~~---gm~hd~~~~~C~~~ 343 (845)
T KOG3538|consen 270 HD---HAILLTRQDFCEQQNGPCDTLGVAPVGGMCSPSRSCSINEDTGLPSAFTIAHELGHNF---GMEHDGRGNSCGPT 343 (845)
T ss_pred cc---eeEEeeeeecccccCCCCccccceeecceeCCccceeeecCCCCccceeeeecccccc---CcccCCcCCCCCCc
Confidence 57 999999888721 23478999999999 7888888887777778889999999999 9999974322
Q ss_pred -CcccC--------CCCCCCCHHHHHHHhhcCCCCcCCCCC--------ccccCCccccccccc
Q psy14093 676 -KIKEV--------TFKTLGNLRSIRKVLEAKSGKCFSEPE--------ESFCGNLRVEGDEEC 722 (857)
Q Consensus 676 -cim~~--------~fss~CS~~~~~~~l~~~~~~Cl~~~~--------~~~CGNgiVE~gEeC 722 (857)
.||.. ...|.||...+..||......||.... ....| ......|||
T Consensus 344 ~~im~~~~~~~~~~~~WS~CS~~~~t~fl~~~~~~Cl~d~p~~~~~~~p~~lPg-~~y~~~~qC 406 (845)
T KOG3538|consen 344 FHVMSSDTCGGNHELTWSECSREYLTRFLDDGEGNCLSDQPFSALPKLPSQLPG-QTYDADEQC 406 (845)
T ss_pred ccccCcccccCCCCccCCcchHHHHHHHhccCccceeecCCccccCCCCccCCc-cccCCCCcc
Confidence 36752 224689999999999988789998542 12344 345556666
No 37
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=96.83 E-value=0.00042 Score=65.92 Aligned_cols=58 Identities=24% Similarity=0.256 Sum_probs=34.5
Q ss_pred CcceEEEEEeeeecccCCCcceeeeecCC-CCCcccceeccc---cccccchhhheeeeeeeccccccc
Q psy14093 606 NFCLAHLFTHQSFWSRGSSILGLAYIASP-RPYSIGGSIHEN---LKYFDTLHSSHISHTIVKRGVQES 670 (857)
Q Consensus 606 ~~c~AhLft~~df~~~~~~tvGlA~vgs~-~~~~s~Gv~~~~---~~~~~~~~aheigh~l~~~g~~~d 670 (857)
++.+++||+..++ .+..|+||++++ .+....++.... ......+++|||||+| |+.||
T Consensus 63 ~aD~v~~~~~~~~----~~~~G~A~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGH~l---Gl~Hd 124 (124)
T PF13582_consen 63 GADLVHLFTGNDL----NGAGGIAYIGGVCSNNSGYGVNYDSSPYGDSGVDTFAHEIGHNL---GLNHD 124 (124)
T ss_dssp T-TTSEEEEEESS----GGB-EEEECT---TTT--TT-SEEEEE--TTSTTHHHHHHHHHT---T----
T ss_pred CCCEEEEeccCCC----CCCceEEEeCCccccCCceEEEcccCCCCcccceEeeehhhHhc---CCCCC
Confidence 3449999999887 688999999988 444333443221 2234578999999999 98886
No 38
>smart00608 ACR ADAM Cysteine-Rich Domain.
Probab=96.83 E-value=0.0005 Score=67.37 Aligned_cols=30 Identities=10% Similarity=-0.064 Sum_probs=27.5
Q ss_pred cccCCChhhhhhcccCCcccchhhhHHHhh
Q psy14093 823 RHYNWSVKAQRRKTTGTGRMRHLKIVRRRF 852 (857)
Q Consensus 823 ~~~c~~~~~qc~~~~G~~~~~a~~~c~~~~ 852 (857)
+|.|.+++.||+.|||++|+.|++.||+.+
T Consensus 14 ~G~C~t~~~QC~~lfG~~a~~A~~~Cy~~~ 43 (137)
T smart00608 14 NGRCPTRDNQCQALFGPGAKVAPDSCYEEL 43 (137)
T ss_pred CCCcCCcchhhHhhhCCCCcccChhHhHHH
Confidence 356999999999999999999999999865
No 39
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=96.82 E-value=0.0011 Score=69.46 Aligned_cols=86 Identities=22% Similarity=0.281 Sum_probs=57.0
Q ss_pred CcceEEEEEeeeecccCCCcceeeeecCCC-CCcccceeccc---cccccchhhheeeeeeecccccccCCCC-------
Q psy14093 606 NFCLAHLFTHQSFWSRGSSILGLAYIASPR-PYSIGGSIHEN---LKYFDTLHSSHISHTIVKRGVQESNHPF------- 674 (857)
Q Consensus 606 ~~c~AhLft~~df~~~~~~tvGlA~vgs~~-~~~s~Gv~~~~---~~~~~~~~aheigh~l~~~g~~~d~~~~------- 674 (857)
++++++||+.... ..+..|+||+++.. .+..+++.... ...+ -+++|||||+| |+.|+...-
T Consensus 93 ~~D~v~l~~~~~~---~~~~~GlA~~~~~~~~~~~~~~~~~~~~~~~~~-~~~aHEiGH~l---Gl~H~~~~~~~~~~~~ 165 (206)
T PF13583_consen 93 NADLVILLTDTNK---DSGTCGLAYVGSVCWDNAKAGAYSDPDLSANGY-QTFAHEIGHNL---GLRHDFDYSGDPYSYA 165 (206)
T ss_pred CCCEEEEEecccc---CCCCCCeeeeccccccccceEEEEecccccccc-hHHHHHHHHHh---cCCCCcccCCCccccc
Confidence 3449999998754 37899999999984 44444444321 1122 35999999999 988876321
Q ss_pred ------CCcccC-------CCCCCCCHHHHHHHhhcCC
Q psy14093 675 ------NKIKEV-------TFKTLGNLRSIRKVLEAKS 699 (857)
Q Consensus 675 ------gcim~~-------~fss~CS~~~~~~~l~~~~ 699 (857)
.-||.. .| |+|++.++..++..+.
T Consensus 166 ~~~~~~~tIMsY~~~~~~~~F-S~~~i~~i~~~~~~~~ 202 (206)
T PF13583_consen 166 YECDSGQTIMSYASRPSRPYF-SPPSIEYIREVVICGD 202 (206)
T ss_pred ccCCCCCEEeCCCCCCCCCCC-CchhHHHHhCCCcCcc
Confidence 247752 35 4788888876654443
No 40
>PRK04179 rpl37e 50S ribosomal protein L37e; Reviewed
Probab=96.05 E-value=0.004 Score=51.92 Aligned_cols=54 Identities=50% Similarity=1.046 Sum_probs=49.4
Q ss_pred cccCccccccccCc-cCCCccccCCCcccccccccccCCC-CCCCCcccCCChhhh
Q psy14093 779 ATKGTSSFGKRRNK-THTLCRRCGKSSYHIQKKKCAQCGY-PCPRLRHYNWSVKAQ 832 (857)
Q Consensus 779 ~~~g~~c~~~r~~~-~~~~Ce~C~G~s~~Cp~d~~~~~G~-~C~~~~~~c~~~~~q 832 (857)
|++||..+|+|.++ ++..|..|.-.|.+-+...++.+|+ |...++.|-|+....
T Consensus 1 MtKGT~SfGKr~~k~tHt~CrRCG~~syh~qK~~CasCGygps~k~R~YnWs~Ka~ 56 (62)
T PRK04179 1 MSKGTPSMGKRNKGKTHIRCRRCGRHSYNVRKKYCAACGFGRSKRIRRYSWQNKKV 56 (62)
T ss_pred CCCCCcccccccCCcccchhcccCcccccccccchhhcCCCcccccccccHHHHhh
Confidence 67899999999865 8999999999999999999999999 999999999987653
No 41
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=95.73 E-value=0.009 Score=59.44 Aligned_cols=42 Identities=31% Similarity=0.384 Sum_probs=29.4
Q ss_pred hhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCCCCCCCC
Q psy14093 348 ADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVNNKTFSS 400 (857)
Q Consensus 348 ~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS~ 400 (857)
...|++|||||.||+.|.... ..-||++....+. ......+.
T Consensus 104 ~~~~~~HEiGHaLGL~H~~~~----------~~siM~p~~~~~~-~~~~~l~~ 145 (156)
T cd04279 104 LQAIALHELGHALGLWHHSDR----------PEDAMYPSQGQGP-DGNPTLSA 145 (156)
T ss_pred HHHHHHHHhhhhhcCCCCCCC----------ccceeeeecccCC-cCcCCCCH
Confidence 467999999999999999752 1369999865431 12335555
No 42
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=95.62 E-value=0.0028 Score=62.39 Aligned_cols=31 Identities=35% Similarity=0.629 Sum_probs=24.8
Q ss_pred chhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeecc
Q psy14093 347 EADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSV 388 (857)
Q Consensus 347 ~~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~ 388 (857)
....|++|||||.||+.|..+ ..-||++...
T Consensus 104 ~~~~v~~HEiGHaLGL~H~~~-----------~~svM~~~~~ 134 (154)
T PF00413_consen 104 DLQSVAIHEIGHALGLDHSND-----------PNSVMYPYYR 134 (154)
T ss_dssp EHHHHHHHHHHHHTTBESSSS-----------TTSTTSSSCT
T ss_pred hhhhhhhhccccccCcCcCCC-----------cccceeeecc
Confidence 356899999999999999864 2358999754
No 43
>smart00608 ACR ADAM Cysteine-Rich Domain.
Probab=95.47 E-value=0.0094 Score=58.47 Aligned_cols=23 Identities=35% Similarity=0.810 Sum_probs=20.8
Q ss_pred CCCcCCCC-ceecCCccch---Hhhhc
Q psy14093 522 DNTGCLER-GKCRGGKCIP---FCETQ 544 (857)
Q Consensus 522 dgt~C~~~-~~C~~G~C~~---~C~~~ 544 (857)
||+||.++ ++||+|+|.+ ||+.+
T Consensus 1 dG~pC~~~~~yCy~G~C~t~~~QC~~l 27 (137)
T smart00608 1 DGTPCDNGQGYCYNGRCPTRDNQCQAL 27 (137)
T ss_pred CCcCcCCCCEEeECCCcCCcchhhHhh
Confidence 79999986 8999999996 89885
No 44
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=94.63 E-value=0.016 Score=57.71 Aligned_cols=40 Identities=35% Similarity=0.466 Sum_probs=28.2
Q ss_pred hhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCCCCCCCC
Q psy14093 348 ADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVNNKTFSS 400 (857)
Q Consensus 348 ~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS~ 400 (857)
...|+.|||||.||+.|-.+ ..-||++..... .+...+|+
T Consensus 107 ~~~~~~HEiGHaLGL~H~~~-----------~~~vM~~~~~~~--~~~~~l~~ 146 (157)
T cd04278 107 LFSVAAHEIGHALGLGHSSD-----------PDSIMYPYYQGP--VPKFKLSQ 146 (157)
T ss_pred HHHHHHHHhccccccCCCCC-----------CcCeecccccCC--CcCCCCCH
Confidence 45799999999999999764 235999874321 13455665
No 45
>TIGR02232 myxo_disulf_rpt Myxococcus cysteine-rich repeat. This model represents a sequence region shared between several proteins of Myxococcus xanthus DK 1622 and some eukaryotic proteins that include human pappalysin-1. The region of about 40 amino acids contains several conserved Cys residues presumed to form disulfide bonds. The region appears in up to 13 repeats in Myxococcus.
Probab=94.57 E-value=0.034 Score=42.36 Aligned_cols=34 Identities=35% Similarity=0.776 Sum_probs=26.6
Q ss_pred cccCCccccccccccCCCCCCCCCCCCCCCCCccccC
Q psy14093 709 SFCGNLRVEGDEECDAGLLGTEDNDSCCDKVCKLRRN 745 (857)
Q Consensus 709 ~~CGNgiVE~gEeCDCG~~~~c~~d~CC~~tC~Lk~g 745 (857)
+.||||+|..+||||=|-... .|- |.+.|++-++
T Consensus 2 ~~CGdG~~~~~EeCDDGN~~~--~DG-Cs~~C~iE~~ 35 (38)
T TIGR02232 2 PTCGDGIIEPGEECDDGNTTS--GDG-CSATCRLEEG 35 (38)
T ss_pred CcCCCCcCcCCccccCCCCcC--CCC-cCcccccCCC
Confidence 679999999999999996532 233 4678998776
No 46
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=94.24 E-value=0.021 Score=55.53 Aligned_cols=18 Identities=44% Similarity=0.667 Sum_probs=16.5
Q ss_pred hHHhhhhhhhCCCCCCCC
Q psy14093 350 LVTAHEFGHNWGSEHDPD 367 (857)
Q Consensus 350 ~t~AHElGHnlG~~HD~~ 367 (857)
.|++|||||.||+.|...
T Consensus 88 ~~~~HEigHaLGl~H~~~ 105 (140)
T smart00235 88 GVAAHELGHALGLYHEQS 105 (140)
T ss_pred ccHHHHHHHHhcCCcCCC
Confidence 389999999999999975
No 47
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=94.09 E-value=0.043 Score=56.21 Aligned_cols=40 Identities=28% Similarity=0.282 Sum_probs=27.7
Q ss_pred hhhHHhhhhhhhCCCCCCCCCCCCCCC-----CCCCCceEEEeec
Q psy14093 348 ADLVTAHEFGHNWGSEHDPDMPECSPS-----ASQGGSYLMYTYS 387 (857)
Q Consensus 348 ~~~t~AHElGHnlG~~HD~~~~~C~~~-----~~~~g~~IM~~~~ 387 (857)
.-.|+.|||||.||+.|..+...-.+. ......-||++..
T Consensus 113 ~~~t~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SVMSY~~ 157 (186)
T cd04277 113 GYQTIIHEIGHALGLEHPGDYNGGDPVPPTYALDSREYTVMSYNS 157 (186)
T ss_pred hHHHHHHHHHHHhcCCCCCcCCCCCCCCccccccCcceEEEeecC
Confidence 467999999999999999865332221 1123456999864
No 48
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=94.07 E-value=0.021 Score=56.89 Aligned_cols=40 Identities=20% Similarity=0.228 Sum_probs=26.1
Q ss_pred hhhHHhhhhhhhCCCCCCCCCCCCCCC-----CCCCCceEEEeec
Q psy14093 348 ADLVTAHEFGHNWGSEHDPDMPECSPS-----ASQGGSYLMYTYS 387 (857)
Q Consensus 348 ~~~t~AHElGHnlG~~HD~~~~~C~~~-----~~~~g~~IM~~~~ 387 (857)
...|++|||||.||+.|-.....-.+. ....-.-||++..
T Consensus 94 ~~~~~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SvM~y~~ 138 (165)
T cd04268 94 LRNTAEHELGHALGLRHNFAASDRDDNVDLLAEKGDTSSVMDYAP 138 (165)
T ss_pred HHHHHHHHHHHHhcccccCcCCccCCcchhhccCCCCcccCCCCc
Confidence 467999999999999999864322210 0001356888864
No 49
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=93.93 E-value=0.095 Score=52.88 Aligned_cols=27 Identities=30% Similarity=0.347 Sum_probs=21.3
Q ss_pred hhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEee
Q psy14093 349 DLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTY 386 (857)
Q Consensus 349 ~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~ 386 (857)
.--+.|||||.||+.|=++ ..|+|+.+
T Consensus 125 ~KEv~HElGH~~GL~HC~N-----------~~CVM~FS 151 (181)
T COG1913 125 VKEVLHELGHLLGLSHCPN-----------PRCVMNFS 151 (181)
T ss_pred HHHHHHHhhhhcCcccCCC-----------CCcEEeCC
Confidence 3457899999999999643 46899865
No 50
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=93.90 E-value=0.2 Score=53.27 Aligned_cols=21 Identities=29% Similarity=0.248 Sum_probs=18.3
Q ss_pred hhhHHhhhhhhhCCCCCCCCC
Q psy14093 348 ADLVTAHEFGHNWGSEHDPDM 368 (857)
Q Consensus 348 ~~~t~AHElGHnlG~~HD~~~ 368 (857)
...|++|||||-||+.|--..
T Consensus 137 ~g~t~~HEvGH~lGL~HtF~~ 157 (225)
T cd04275 137 LGDTATHEVGHWLGLYHTFQG 157 (225)
T ss_pred ccceeEEeccceeeeeeeecC
Confidence 357999999999999999764
No 51
>COG2126 RPL37A Ribosomal protein L37E [Translation, ribosomal structure and biogenesis]
Probab=93.59 E-value=0.049 Score=44.89 Aligned_cols=53 Identities=58% Similarity=1.195 Sum_probs=47.4
Q ss_pred cccCccccccccCccCCCccccCCCcccccccccccCCCC-CCCCcccCCChhh
Q psy14093 779 ATKGTSSFGKRRNKTHTLCRRCGKSSYHIQKKKCAQCGYP-CPRLRHYNWSVKA 831 (857)
Q Consensus 779 ~~~g~~c~~~r~~~~~~~Ce~C~G~s~~Cp~d~~~~~G~~-C~~~~~~c~~~~~ 831 (857)
|++||..+|+|.++++..|..|.-.|.+.....+..+|+| -..++.|-|..+.
T Consensus 1 MskGT~S~GKrnk~tH~~CRRCGr~syhv~k~~CaaCGfgrs~r~R~y~W~~K~ 54 (61)
T COG2126 1 MSKGTPSFGKRNKKTHIRCRRCGRRSYHVRKKYCAACGFGRSARMRSYNWQAKK 54 (61)
T ss_pred CCCCCccccccCCcceehhhhccchheeeccceecccCCCCccccccchhhhhh
Confidence 6789999999998889999999999999999999999999 7778888776553
No 52
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=93.44 E-value=0.049 Score=56.93 Aligned_cols=28 Identities=29% Similarity=0.470 Sum_probs=21.3
Q ss_pred chhhHHhhhhhhhCCCCCCCC--CCCCCCC
Q psy14093 347 EADLVTAHEFGHNWGSEHDPD--MPECSPS 374 (857)
Q Consensus 347 ~~~~t~AHElGHnlG~~HD~~--~~~C~~~ 374 (857)
..+.||+|||||.+|+.|-+- ...|...
T Consensus 132 ~~~hvi~HEiGH~IGfRHTD~~~R~SCG~~ 161 (211)
T PF12388_consen 132 VIEHVITHEIGHCIGFRHTDYFNRSSCGSG 161 (211)
T ss_pred HHHHHHHHHhhhhccccccCcCCccccccc
Confidence 357899999999999999762 1267543
No 53
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=92.93 E-value=0.09 Score=58.59 Aligned_cols=19 Identities=37% Similarity=0.416 Sum_probs=16.6
Q ss_pred hhHHhhhhhhhCCCCCCCC
Q psy14093 349 DLVTAHEFGHNWGSEHDPD 367 (857)
Q Consensus 349 ~~t~AHElGHnlG~~HD~~ 367 (857)
..|+.||||||||+.|=..
T Consensus 151 ~~~~~HElgHN~GL~Ha~~ 169 (314)
T PF05548_consen 151 WATIMHELGHNLGLWHAGR 169 (314)
T ss_pred HHHHHHHhhhhccccccCC
Confidence 3489999999999999863
No 54
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=92.61 E-value=0.049 Score=54.49 Aligned_cols=24 Identities=29% Similarity=0.346 Sum_probs=15.9
Q ss_pred hhhHHhhhhhhhCCCCCCCCCCCCC
Q psy14093 348 ADLVTAHEFGHNWGSEHDPDMPECS 372 (857)
Q Consensus 348 ~~~t~AHElGHnlG~~HD~~~~~C~ 372 (857)
.+.|++||+||-||+.|--.. .|.
T Consensus 69 ~g~TltHEvGH~LGL~HtF~~-~C~ 92 (154)
T PF05572_consen 69 FGKTLTHEVGHWLGLYHTFGG-GCD 92 (154)
T ss_dssp SSHHHHHHHHHHTT---TT----TS
T ss_pred cccchhhhhhhhhcccccccC-CCC
Confidence 478999999999999999753 375
No 55
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=92.23 E-value=0.093 Score=54.65 Aligned_cols=26 Identities=27% Similarity=0.396 Sum_probs=20.6
Q ss_pred hhhHHhhhhhhhCCCCCCCCCCCCCC
Q psy14093 348 ADLVTAHEFGHNWGSEHDPDMPECSP 373 (857)
Q Consensus 348 ~~~t~AHElGHnlG~~HD~~~~~C~~ 373 (857)
...+++|||||.||+.|.-......|
T Consensus 116 ~~~~~~he~gh~lGl~hn~~~s~~~~ 141 (197)
T cd04276 116 LRYLLAHEVGHTLGLRHNFKASSDGS 141 (197)
T ss_pred HHHHHHHHHHHHhcCcccccccccCc
Confidence 45799999999999999865544544
No 56
>TIGR02232 myxo_disulf_rpt Myxococcus cysteine-rich repeat. This model represents a sequence region shared between several proteins of Myxococcus xanthus DK 1622 and some eukaryotic proteins that include human pappalysin-1. The region of about 40 amino acids contains several conserved Cys residues presumed to form disulfide bonds. The region appears in up to 13 repeats in Myxococcus.
Probab=92.05 E-value=0.16 Score=38.78 Aligned_cols=37 Identities=14% Similarity=0.079 Sum_probs=26.8
Q ss_pred cccCccccCCCCCCccCCCcccccccCCCCCCCCCCCCCCCCcccCCCcc
Q psy14093 424 SILGLAYIASPRPYSIGGGILGLAYVGSPRRNSKCCYHSILSELFPVLLY 473 (857)
Q Consensus 424 ~~CGN~ive~g~~EeCDcG~~~~~~~~~~~~~d~CC~~~~~C~l~~ga~C 473 (857)
+.|||+++..+ ||||-|-.. -.|-| ... |++-++..|
T Consensus 2 ~~CGdG~~~~~--EeCDDGN~~--------~~DGC-s~~--C~iE~~~~C 38 (38)
T TIGR02232 2 PTCGDGIIEPG--EECDDGNTT--------SGDGC-SAT--CRLEEGFAC 38 (38)
T ss_pred CcCCCCcCcCC--ccccCCCCc--------CCCCc-Ccc--cccCCCCCC
Confidence 68999999999 999999653 23444 333 777776655
No 57
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=91.84 E-value=0.093 Score=53.83 Aligned_cols=85 Identities=15% Similarity=0.207 Sum_probs=55.7
Q ss_pred CcccccccccccccccccCC--CCCCcceEEEEEeeeecccCCCcceeeeecCC-CCCcccceeccccc-----------
Q psy14093 583 NMIRDKWDVRNLLETFSSHV--DGTNFCLAHLFTHQSFWSRGSSILGLAYIASP-RPYSIGGSIHENLK----------- 648 (857)
Q Consensus 583 n~c~r~W~~~~lL~~~~~~~--~hD~~c~AhLft~~df~~~~~~tvGlA~vgs~-~~~~s~Gv~~~~~~----------- 648 (857)
+.-+..++...+|+++.... +.+ .+.++|..|.+. -|+.|+.+. .+...+||++...+
T Consensus 47 ~~~r~Q~~a~~iL~~l~~~~~~~~~---~vl~vt~~DLy~-----~~~nfvfG~a~~~~~~aVvS~~Rl~~~~~~~~~~~ 118 (179)
T PRK13267 47 DWERGQYRAEKFLPLLSRIGRFNGD---KNIGITDCDLYY-----RGLNFVFGLAYPNLRGAVISTYRLRPEFYGNKPDS 118 (179)
T ss_pred CCCCceecHHHHHHHHHhhCCCCCc---eEEEEEccccCC-----CCCCceeEEeecCCCEEEEEccccChhhccccchH
Confidence 44445566667776665432 233 678889988642 367888888 66667777742111
Q ss_pred -----cccchhhheeeeeeecccccccCCCCCCccc
Q psy14093 649 -----YFDTLHSSHISHTIVKRGVQESNHPFNKIKE 679 (857)
Q Consensus 649 -----~~~~~~aheigh~l~~~g~~~d~~~~gcim~ 679 (857)
.+...++||+||++ |+.|-. +..|+|+
T Consensus 119 ~l~~~R~~k~~~HElGH~l---GL~HC~-~~~CvM~ 150 (179)
T PRK13267 119 ELFEERVRKEVTHELGHTL---GLEHCD-NPRCVMN 150 (179)
T ss_pred HHHHHHHHHHHHHHHHHHc---CCccCC-CCCccCC
Confidence 11224799999999 999974 5689997
No 58
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=91.11 E-value=0.076 Score=50.95 Aligned_cols=23 Identities=35% Similarity=0.512 Sum_probs=14.4
Q ss_pred hhHHhhhhhhhCCCCCCCCCCCCC
Q psy14093 349 DLVTAHEFGHNWGSEHDPDMPECS 372 (857)
Q Consensus 349 ~~t~AHElGHnlG~~HD~~~~~C~ 372 (857)
-.|+||||||.||++-.-.+ -|+
T Consensus 78 ~RIaaHE~GHiLGLPD~y~G-pCS 100 (132)
T PF02031_consen 78 TRIAAHELGHILGLPDHYPG-PCS 100 (132)
T ss_dssp HHHHHHHHHHHHT----TTS--TT
T ss_pred ceeeeehhccccCCCCCCCC-ccH
Confidence 46999999999999866543 465
No 59
>PF01907 Ribosomal_L37e: Ribosomal protein L37e; InterPro: IPR001569 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. A number of eukaryotic and archaeal ribosomal proteins can be grouped on the basis of sequence similarities. One of these families consists of proteins of 56 to 96 amino-acid residues that share a highly conserved region located in the N-terminal part.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A19_A 4A1D_A 4A18_A 4A1B_A 1S1I_Y 3O5H_d 3IZS_l 3O58_d 2ZKR_2 3IZR_l ....
Probab=90.12 E-value=0.12 Score=42.46 Aligned_cols=52 Identities=79% Similarity=1.421 Sum_probs=41.2
Q ss_pred ccCccccccccCccCCCccccCCCcccccccccccCCCCCCCCc-ccCCChhh
Q psy14093 780 TKGTSSFGKRRNKTHTLCRRCGKSSYHIQKKKCAQCGYPCPRLR-HYNWSVKA 831 (857)
Q Consensus 780 ~~g~~c~~~r~~~~~~~Ce~C~G~s~~Cp~d~~~~~G~~C~~~~-~~c~~~~~ 831 (857)
++||..+|+|.++++..|..|.-.|.+-....+..+|+|=..++ .|-|+.+.
T Consensus 1 tKGT~S~Gkr~~ktH~~CrRCG~~syH~qK~~CasCGyp~~kkrr~ynWs~Ka 53 (55)
T PF01907_consen 1 TKGTSSFGKRHNKTHTLCRRCGRRSYHIQKKTCASCGYPAAKKRRKYNWSAKA 53 (55)
T ss_dssp -TTHHHHTTS-S-SEEE-TTTSSEEEETTTTEETTTBTTTSSS----SSSSHH
T ss_pred CCCCCCccccCCccEeeecccCCeeeecCCCcccccCCCcccccccccchhhh
Confidence 36999999999889999999999999999999999999998877 88887654
No 60
>PF10462 Peptidase_M66: Peptidase M66; InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=88.39 E-value=0.22 Score=55.03 Aligned_cols=19 Identities=42% Similarity=0.667 Sum_probs=14.5
Q ss_pred hhhHHhhhhhhhCCCCCCC
Q psy14093 348 ADLVTAHEFGHNWGSEHDP 366 (857)
Q Consensus 348 ~~~t~AHElGHnlG~~HD~ 366 (857)
...+++|||||+||+.|=+
T Consensus 193 ~~~~f~HE~GH~~GL~H~p 211 (305)
T PF10462_consen 193 YGNEFSHELGHNFGLGHYP 211 (305)
T ss_dssp SHHHHHHHHHHTTT--SST
T ss_pred ccceeehhhhhhcCCCCCC
Confidence 4689999999999999943
No 61
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=87.84 E-value=0.26 Score=51.17 Aligned_cols=22 Identities=41% Similarity=0.675 Sum_probs=18.2
Q ss_pred hhHHhhhhhhhCCCCCCCCCCC
Q psy14093 349 DLVTAHEFGHNWGSEHDPDMPE 370 (857)
Q Consensus 349 ~~t~AHElGHnlG~~HD~~~~~ 370 (857)
..|+.|||||.||+.|....++
T Consensus 93 ~~~i~HElgHaLG~~HEh~rpd 114 (198)
T cd04327 93 SRVVLHEFGHALGFIHEHQSPA 114 (198)
T ss_pred HHHHHHHHHHHhcCcccccCCC
Confidence 3589999999999999976543
No 62
>KOG1565|consensus
Probab=86.47 E-value=0.4 Score=56.26 Aligned_cols=30 Identities=40% Similarity=0.618 Sum_probs=25.3
Q ss_pred hhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeecc
Q psy14093 348 ADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSV 388 (857)
Q Consensus 348 ~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~ 388 (857)
+.+|+||||||.||+.|-.+ ..-||+|...
T Consensus 211 l~~Va~HEiGH~LGL~HS~~-----------~~aiM~P~y~ 240 (469)
T KOG1565|consen 211 LFLVAAHEIGHALGLGHSSD-----------PDAIMYPFYQ 240 (469)
T ss_pred hHHHhhhhcccccccCCCCC-----------cccccccccc
Confidence 57899999999999999854 3579999865
No 63
>PF08516 ADAM_CR: ADAM cysteine-rich; InterPro: IPR006586 An ADAM is a transmembrane protein that contains a disintegrin and metalloprotease domain (MEROPS peptidase family M12B). All members of the ADAM family display a common domain organisation - a pro-domain, the metalloprotease, disintigrin, cysteine-rich, epidermal-growth factor like, and transmembrane domains and a C-terminal cytoplasmic tail. They possess four potential functions: proteolysis, cell adhesion, cell fusion, and cell signalling. ADAMs are membrane-anchored proteases that proteolytically modify cell surface and extracellular matrix (ECM) in order to alter cell behaviour. They are responsible for the proteolytic cleavage of transmembrane proteins and release of their extracellular domain [, ]. The ADAM cysteine-rich domain is not found in plant, archaeal, bacterial or viral proteins. The cysteine-rich domain complements the binding capacity of the disintegrin domain, and perhaps imparts specificity to disintegrin domain-mediated interactions. It has been shown that the cysteine-rich domain of ADAM13 regulates the protein's metalloprotease activity []. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2DW2_A 2DW0_B 2DW1_A 3G5C_B 2ERQ_B 2ERO_A 2ERP_B 3K7L_A 2E3X_A 3K7N_A.
Probab=86.06 E-value=0.34 Score=46.28 Aligned_cols=31 Identities=29% Similarity=0.733 Sum_probs=22.8
Q ss_pred CCCcCCCC-ceecCCccch---Hhhhc-------cccccccc
Q psy14093 522 DNTGCLER-GKCRGGKCIP---FCETQ-------NQQSCMCD 552 (857)
Q Consensus 522 dgt~C~~~-~~C~~G~C~~---~C~~~-------~~~~C~c~ 552 (857)
||+||.++ ++||+|+|.+ ||+.+ +.+.|+=.
T Consensus 1 nG~pC~~~~~yCy~G~C~t~~~QC~~lwG~~a~~A~~~Cy~~ 42 (117)
T PF08516_consen 1 NGTPCSNGQGYCYNGRCPTRDQQCQELWGPGARVAPDSCYEE 42 (117)
T ss_dssp TT-EETTTTEEEBTTBEE-HHHHHHHHH-TTEEE--GGGGGG
T ss_pred CCcccCCCCeEEECCcCCCHhHHHHHHhCCCccccchhHHHH
Confidence 79999986 8999999996 89996 23667643
No 64
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=85.55 E-value=0.35 Score=49.76 Aligned_cols=18 Identities=28% Similarity=0.366 Sum_probs=16.3
Q ss_pred hHHhhhhhhhCCCCCCCC
Q psy14093 350 LVTAHEFGHNWGSEHDPD 367 (857)
Q Consensus 350 ~t~AHElGHnlG~~HD~~ 367 (857)
-|++|||||.||+.|.-.
T Consensus 79 G~i~HEl~HaLG~~HEhs 96 (182)
T cd04283 79 GIIQHELLHALGFYHEQT 96 (182)
T ss_pred chHHHHHHHHhCCccccc
Confidence 489999999999999974
No 65
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=85.02 E-value=0.42 Score=49.59 Aligned_cols=19 Identities=32% Similarity=0.490 Sum_probs=13.1
Q ss_pred hhhHHhhhhhhhCCCCCCC
Q psy14093 348 ADLVTAHEFGHNWGSEHDP 366 (857)
Q Consensus 348 ~~~t~AHElGHnlG~~HD~ 366 (857)
+.-++.||+||.||+.|=.
T Consensus 145 ~~Kea~HElGH~~GL~HC~ 163 (194)
T PF07998_consen 145 VCKEAVHELGHLFGLDHCE 163 (194)
T ss_dssp HHHHHHHHHHHHTT----S
T ss_pred HHHHHHHHHHHHcCCcCCC
Confidence 4678999999999999964
No 66
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=84.61 E-value=0.4 Score=50.10 Aligned_cols=18 Identities=33% Similarity=0.484 Sum_probs=16.2
Q ss_pred hHHhhhhhhhCCCCCCCC
Q psy14093 350 LVTAHEFGHNWGSEHDPD 367 (857)
Q Consensus 350 ~t~AHElGHnlG~~HD~~ 367 (857)
-|++|||||.||+.|.-.
T Consensus 89 Gti~HEl~HaLGf~HEhs 106 (200)
T cd04281 89 GIVVHELGHVIGFWHEHT 106 (200)
T ss_pred chHHHHHHHHhcCcchhc
Confidence 489999999999999964
No 67
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=84.19 E-value=0.41 Score=49.04 Aligned_cols=18 Identities=28% Similarity=0.340 Sum_probs=16.4
Q ss_pred hHHhhhhhhhCCCCCCCC
Q psy14093 350 LVTAHEFGHNWGSEHDPD 367 (857)
Q Consensus 350 ~t~AHElGHnlG~~HD~~ 367 (857)
.|+.|||||.||+.|...
T Consensus 76 g~v~HE~~HalG~~HEh~ 93 (180)
T cd04280 76 GTIVHELMHALGFYHEQS 93 (180)
T ss_pred chhHHHHHHHhcCcchhc
Confidence 589999999999999974
No 68
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=82.68 E-value=0.45 Score=49.26 Aligned_cols=20 Identities=30% Similarity=0.342 Sum_probs=15.4
Q ss_pred hhHHhhhhhhhCCCCCCCCC
Q psy14093 349 DLVTAHEFGHNWGSEHDPDM 368 (857)
Q Consensus 349 ~~t~AHElGHnlG~~HD~~~ 368 (857)
.-|++|||||.||+.|.-..
T Consensus 80 ~~~i~HEl~HaLG~~HEh~R 99 (191)
T PF01400_consen 80 VGTILHELGHALGFWHEHQR 99 (191)
T ss_dssp HHHHHHHHHHHHTB--GGGS
T ss_pred ccchHHHHHHHHhhhhhhhc
Confidence 45899999999999999753
No 69
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=81.38 E-value=0.55 Score=54.35 Aligned_cols=18 Identities=39% Similarity=0.359 Sum_probs=15.0
Q ss_pred cchhhHHhhhhhhhCCCC
Q psy14093 346 READLVTAHEFGHNWGSE 363 (857)
Q Consensus 346 ~~~~~t~AHElGHnlG~~ 363 (857)
.+.+-|+||||||..|=.
T Consensus 128 sElagViAHEigHv~qrH 145 (484)
T COG4783 128 SELAGVIAHEIGHVAQRH 145 (484)
T ss_pred HHHHHHHHHHHHHHhhhh
Confidence 367999999999988743
No 70
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=80.66 E-value=0.53 Score=51.82 Aligned_cols=37 Identities=30% Similarity=0.228 Sum_probs=23.4
Q ss_pred hhHHhhhhhhhCCCCCCCCCC-CCCCCCCCCCceEEEee
Q psy14093 349 DLVTAHEFGHNWGSEHDPDMP-ECSPSASQGGSYLMYTY 386 (857)
Q Consensus 349 ~~t~AHElGHnlG~~HD~~~~-~C~~~~~~~g~~IM~~~ 386 (857)
..|+|||+||.||+++-=+.. .-.. ..-+.-.||+..
T Consensus 166 igv~~HE~gH~lGLPDlY~~~~~~~~-~~vG~w~lM~~G 203 (286)
T TIGR03296 166 VGVIAHELGHDLGLPDLYDTSYDGGG-EPVGYWSLMSSG 203 (286)
T ss_pred eeeeehhhhcccCCCccccCcCCCCC-CCccccccccCC
Confidence 569999999999998764321 1110 012455699873
No 71
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=80.46 E-value=0.73 Score=49.20 Aligned_cols=18 Identities=33% Similarity=0.484 Sum_probs=16.2
Q ss_pred hHHhhhhhhhCCCCCCCC
Q psy14093 350 LVTAHEFGHNWGSEHDPD 367 (857)
Q Consensus 350 ~t~AHElGHnlG~~HD~~ 367 (857)
-|++|||||.||+.|.-.
T Consensus 122 Gti~HEl~HalGf~HEqs 139 (230)
T cd04282 122 ATVEHEFLHALGFYHEQS 139 (230)
T ss_pred chHHHHHHHHhCCccccc
Confidence 489999999999999974
No 72
>PF13948 DUF4215: Domain of unknown function (DUF4215)
Probab=75.37 E-value=1.3 Score=35.39 Aligned_cols=21 Identities=43% Similarity=0.781 Sum_probs=18.4
Q ss_pred ccccCCccccccccccCCCCC
Q psy14093 708 ESFCGNLRVEGDEECDAGLLG 728 (857)
Q Consensus 708 ~~~CGNgiVE~gEeCDCG~~~ 728 (857)
.++||++||...||||=|-..
T Consensus 18 ~~~CGDgii~~~E~CDD~N~~ 38 (47)
T PF13948_consen 18 EPICGDGIIVGDEQCDDGNSI 38 (47)
T ss_pred ccccCCCeEECCcccCCCCcc
Confidence 588999999999999998543
No 73
>KOG3714|consensus
Probab=72.64 E-value=1.3 Score=51.33 Aligned_cols=18 Identities=28% Similarity=0.294 Sum_probs=16.1
Q ss_pred hHHhhhhhhhCCCCCCCC
Q psy14093 350 LVTAHEFGHNWGSEHDPD 367 (857)
Q Consensus 350 ~t~AHElGHnlG~~HD~~ 367 (857)
-|++|||+|.||..|...
T Consensus 161 G~i~HEl~HaLGf~Hehs 178 (411)
T KOG3714|consen 161 GTIVHELMHALGFWHEHS 178 (411)
T ss_pred chhHHHHHHHhhhhhccC
Confidence 489999999999999864
No 74
>PF12044 Metallopep: Putative peptidase family; InterPro: IPR021917 This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length.
Probab=70.55 E-value=2.7 Score=48.72 Aligned_cols=17 Identities=41% Similarity=0.741 Sum_probs=15.3
Q ss_pred HHhhhhhhhCCCCCCCC
Q psy14093 351 VTAHEFGHNWGSEHDPD 367 (857)
Q Consensus 351 t~AHElGHnlG~~HD~~ 367 (857)
-+-|||||.||++|-.+
T Consensus 319 A~lHEiGH~fg~pH~~~ 335 (423)
T PF12044_consen 319 AFLHEIGHLFGCPHQED 335 (423)
T ss_pred HHHHHHHHhcCCCCCCC
Confidence 47899999999999985
No 75
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=69.96 E-value=1.7 Score=41.31 Aligned_cols=20 Identities=35% Similarity=0.639 Sum_probs=17.3
Q ss_pred chhhHHhhhhhhhCCCCCCC
Q psy14093 347 EADLVTAHEFGHNWGSEHDP 366 (857)
Q Consensus 347 ~~~~t~AHElGHnlG~~HD~ 366 (857)
....|+-|||||-||+.-|.
T Consensus 108 ~vthvliHEIgHhFGLsDdd 127 (136)
T COG3824 108 QVTHVLIHEIGHHFGLSDDD 127 (136)
T ss_pred HhhhhhhhhhhhhcCCChhH
Confidence 45789999999999998775
No 76
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=68.19 E-value=1.9 Score=39.74 Aligned_cols=14 Identities=36% Similarity=0.565 Sum_probs=11.7
Q ss_pred chhhHHhhhhhhhC
Q psy14093 347 EADLVTAHEFGHNW 360 (857)
Q Consensus 347 ~~~~t~AHElGHnl 360 (857)
.-.+++||||||.+
T Consensus 41 ~~~f~laHELgH~~ 54 (122)
T PF06114_consen 41 RQRFTLAHELGHIL 54 (122)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHH
Confidence 34789999999975
No 77
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=66.23 E-value=3 Score=46.77 Aligned_cols=26 Identities=15% Similarity=0.307 Sum_probs=21.4
Q ss_pred cccccchhhHHhhhhhhhCCCCCCCC
Q psy14093 342 RVITREADLVTAHEFGHNWGSEHDPD 367 (857)
Q Consensus 342 ~v~~~~~~~t~AHElGHnlG~~HD~~ 367 (857)
.+|......|+||||+|.+|.-..++
T Consensus 190 ~~p~~~~P~T~~HElAHq~G~a~E~E 215 (318)
T PF12725_consen 190 DLPPYSLPFTICHELAHQLGFASEDE 215 (318)
T ss_pred CCCcccccHHHHHHHHHHhCCCCHHH
Confidence 34555678999999999999998875
No 78
>PF13948 DUF4215: Domain of unknown function (DUF4215)
Probab=65.38 E-value=3 Score=33.31 Aligned_cols=20 Identities=20% Similarity=0.129 Sum_probs=17.8
Q ss_pred CcccCccccCCCCCCccCCCcc
Q psy14093 423 SSILGLAYIASPRPYSIGGGIL 444 (857)
Q Consensus 423 ~~~CGN~ive~g~~EeCDcG~~ 444 (857)
.++||++||... ||||-|-.
T Consensus 18 ~~~CGDgii~~~--E~CDD~N~ 37 (47)
T PF13948_consen 18 EPICGDGIIVGD--EQCDDGNS 37 (47)
T ss_pred ccccCCCeEECC--cccCCCCc
Confidence 579999999999 99999854
No 79
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=60.37 E-value=3.5 Score=43.52 Aligned_cols=13 Identities=38% Similarity=0.657 Sum_probs=11.4
Q ss_pred hhhHHhhhhhhhC
Q psy14093 348 ADLVTAHEFGHNW 360 (857)
Q Consensus 348 ~~~t~AHElGHnl 360 (857)
--+|+||||||.|
T Consensus 72 ~rFtlAHELGH~l 84 (213)
T COG2856 72 KRFTLAHELGHAL 84 (213)
T ss_pred HHHHHHHHHhHHH
Confidence 3689999999987
No 80
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=58.53 E-value=4.9 Score=43.88 Aligned_cols=20 Identities=40% Similarity=0.694 Sum_probs=13.4
Q ss_pred chhhHHhhhhhhhCCCCCCC
Q psy14093 347 EADLVTAHEFGHNWGSEHDP 366 (857)
Q Consensus 347 ~~~~t~AHElGHnlG~~HD~ 366 (857)
.+..|+.||+||.||--=|.
T Consensus 215 ~~~~v~vHE~GHsf~~LaDE 234 (264)
T PF09471_consen 215 SFKQVVVHEFGHSFGGLADE 234 (264)
T ss_dssp THHHHHHHHHHHHTT-----
T ss_pred cccceeeeeccccccccccc
Confidence 35789999999999998886
No 81
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=57.82 E-value=4.5 Score=47.88 Aligned_cols=27 Identities=44% Similarity=0.806 Sum_probs=19.5
Q ss_pred cccccCCccccccchhhHHhhhhhhhC---CCCCCCC
Q psy14093 334 SSRNHYGQRVITREADLVTAHEFGHNW---GSEHDPD 367 (857)
Q Consensus 334 ~~~n~~G~~v~~~~~~~t~AHElGHnl---G~~HD~~ 367 (857)
+..| || -...||+|||||-| |+..|++
T Consensus 480 ~a~N-YG------gIGaVIgHEI~HgFDdqGakfD~~ 509 (654)
T COG3590 480 SAAN-YG------GIGAVIGHEIGHGFDDQGAKFDGD 509 (654)
T ss_pred hhhc-cc------CccceehhhhcccccCCccccCCC
Confidence 3446 77 45789999999987 5556654
No 82
>KOG4525|consensus
Probab=56.26 E-value=7.3 Score=44.60 Aligned_cols=17 Identities=35% Similarity=0.755 Sum_probs=15.3
Q ss_pred HHhhhhhhhCCCCCCCC
Q psy14093 351 VTAHEFGHNWGSEHDPD 367 (857)
Q Consensus 351 t~AHElGHnlG~~HD~~ 367 (857)
.|-|||||.||.+|.++
T Consensus 304 A~~HElGH~lgcpHq~~ 320 (614)
T KOG4525|consen 304 AVCHELGHCLGCPHQSE 320 (614)
T ss_pred HHHHHhhhccCCCCCCC
Confidence 46799999999999976
No 83
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=55.72 E-value=3.9 Score=39.99 Aligned_cols=29 Identities=17% Similarity=0.261 Sum_probs=21.9
Q ss_pred cccchhhheeeeeeecccccccCCCCCCcccCC
Q psy14093 649 YFDTLHSSHISHTIVKRGVQESNHPFNKIKEVT 681 (857)
Q Consensus 649 ~~~~~~aheigh~l~~~g~~~d~~~~gcim~~~ 681 (857)
.+..++.|||||.| |+.|. +.-..+|...
T Consensus 104 ~~~~v~~HEiGHaL---GL~H~-~~~~svM~~~ 132 (154)
T PF00413_consen 104 DLQSVAIHEIGHAL---GLDHS-NDPNSVMYPY 132 (154)
T ss_dssp EHHHHHHHHHHHHT---TBESS-SSTTSTTSSS
T ss_pred hhhhhhhhcccccc---CcCcC-CCcccceeee
Confidence 44568899999999 99876 3457788743
No 84
>PF12115 Salp15: Salivary protein of 15kDa inhibits CD4+ T cell activation; InterPro: IPR021971 This is a family of 15kDa salivary proteins from Acari Arachnids that is induced on feeding and assists the parasite to remain attached to its arthropod host. By repressing calcium fluxes triggered by TCR engagement, Salp15 inhibits CD4+ T cell activation. Salp15 shows weak similarity to Inhibin A, a member of the TGF-beta superfamily that inhibits the production of cytokines and the proliferation of T cells.
Probab=55.26 E-value=7.5 Score=37.20 Aligned_cols=21 Identities=33% Similarity=0.835 Sum_probs=18.6
Q ss_pred CCCCCcCC-CCceecCCccchH
Q psy14093 520 MSDNTGCL-ERGKCRGGKCIPF 540 (857)
Q Consensus 520 ~~dgt~C~-~~~~C~~G~C~~~ 540 (857)
++|||||+ .+.+|.+|.|+..
T Consensus 93 LpdgtPCg~~~~~C~~G~Cv~~ 114 (118)
T PF12115_consen 93 LPDGTPCGPNGQTCKNGKCVGH 114 (118)
T ss_pred CCCCCccCCCcCEECCCccCCC
Confidence 89999999 5789999999863
No 85
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=51.80 E-value=7.3 Score=38.57 Aligned_cols=19 Identities=11% Similarity=0.373 Sum_probs=15.2
Q ss_pred ccchhhheeeeeeecccccccC
Q psy14093 650 FDTLHSSHISHTIVKRGVQESN 671 (857)
Q Consensus 650 ~~~~~aheigh~l~~~g~~~d~ 671 (857)
+-.+++|||||.| |+.|-.
T Consensus 94 ~~~~~~HEiGHaL---GL~H~~ 112 (165)
T cd04268 94 LRNTAEHELGHAL---GLRHNF 112 (165)
T ss_pred HHHHHHHHHHHHh---cccccC
Confidence 4467899999999 987653
No 86
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=50.33 E-value=7.2 Score=40.51 Aligned_cols=18 Identities=39% Similarity=0.405 Sum_probs=14.8
Q ss_pred cchhhHHhhhhhhhCCCC
Q psy14093 346 READLVTAHEFGHNWGSE 363 (857)
Q Consensus 346 ~~~~~t~AHElGHnlG~~ 363 (857)
.+.+.|+||||||...=.
T Consensus 87 ~el~aVlaHElgH~~~~h 104 (226)
T PF01435_consen 87 DELAAVLAHELGHIKHRH 104 (226)
T ss_dssp HHHHHHHHHHHHHHHTTH
T ss_pred HHHHHHHHHHHHHHHcCC
Confidence 478999999999997433
No 87
>PRK01345 heat shock protein HtpX; Provisional
Probab=50.25 E-value=6.2 Score=44.22 Aligned_cols=15 Identities=40% Similarity=0.379 Sum_probs=13.4
Q ss_pred cchhhHHhhhhhhhC
Q psy14093 346 READLVTAHEFGHNW 360 (857)
Q Consensus 346 ~~~~~t~AHElGHnl 360 (857)
.+.+-|+||||||.-
T Consensus 122 dEL~aVlAHElgHi~ 136 (317)
T PRK01345 122 EEVAGVMAHELAHVK 136 (317)
T ss_pred HHHHHHHHHHHHHHH
Confidence 478999999999987
No 88
>PF05547 Peptidase_M6: Immune inhibitor A peptidase M6; InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=49.97 E-value=4.4 Score=49.41 Aligned_cols=17 Identities=47% Similarity=0.520 Sum_probs=14.8
Q ss_pred hhHHhhhhhhhCCCCCC
Q psy14093 349 DLVTAHEFGHNWGSEHD 365 (857)
Q Consensus 349 ~~t~AHElGHnlG~~HD 365 (857)
.-|+|||+||.||++--
T Consensus 222 iGVfaHEfGH~LGLPDl 238 (645)
T PF05547_consen 222 IGVFAHEFGHDLGLPDL 238 (645)
T ss_pred eEEEEeeccccCCCCCc
Confidence 45999999999999854
No 89
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=49.08 E-value=8.8 Score=35.57 Aligned_cols=20 Identities=20% Similarity=0.356 Sum_probs=14.9
Q ss_pred cchhhHHhhhhhhhCCCCCC
Q psy14093 346 READLVTAHEFGHNWGSEHD 365 (857)
Q Consensus 346 ~~~~~t~AHElGHnlG~~HD 365 (857)
.....|+-||+||.||+.-|
T Consensus 71 ~~I~~tlvhEiah~fG~~~e 90 (97)
T PF06262_consen 71 ELIRDTLVHEIAHHFGISDE 90 (97)
T ss_dssp HHHHHHHHHHHHHHTT--HH
T ss_pred HHHHHHHHHHHHHHcCCCHH
Confidence 35678999999999998754
No 90
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=48.90 E-value=6.7 Score=42.83 Aligned_cols=15 Identities=47% Similarity=0.579 Sum_probs=13.1
Q ss_pred cchhhHHhhhhhhhC
Q psy14093 346 READLVTAHEFGHNW 360 (857)
Q Consensus 346 ~~~~~t~AHElGHnl 360 (857)
.+...|+|||+||..
T Consensus 155 dEl~aVlaHElgHi~ 169 (302)
T COG0501 155 DELEAVLAHELGHIK 169 (302)
T ss_pred HHHHHHHHHHHHHHh
Confidence 578999999999974
No 91
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=47.33 E-value=14 Score=44.62 Aligned_cols=14 Identities=36% Similarity=0.406 Sum_probs=12.1
Q ss_pred chhhHHhhhhhhhC
Q psy14093 347 EADLVTAHEFGHNW 360 (857)
Q Consensus 347 ~~~~t~AHElGHnl 360 (857)
-.+.|+|||+||.+
T Consensus 336 ~dv~TL~HElGHa~ 349 (549)
T TIGR02289 336 GDIDVLTHEAGHAF 349 (549)
T ss_pred hHHHHHHHHhhHHH
Confidence 35789999999987
No 92
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=45.00 E-value=16 Score=44.41 Aligned_cols=14 Identities=36% Similarity=0.627 Sum_probs=12.0
Q ss_pred chhhHHhhhhhhhC
Q psy14093 347 EADLVTAHEFGHNW 360 (857)
Q Consensus 347 ~~~~t~AHElGHnl 360 (857)
-.+.|+|||+||.+
T Consensus 377 ~dv~TLaHElGHa~ 390 (591)
T TIGR00181 377 NSVFTLAHELGHSM 390 (591)
T ss_pred chHHHHHHHhhhHH
Confidence 35889999999985
No 93
>PRK03001 M48 family peptidase; Provisional
Probab=43.72 E-value=10 Score=41.65 Aligned_cols=15 Identities=47% Similarity=0.454 Sum_probs=13.2
Q ss_pred cchhhHHhhhhhhhC
Q psy14093 346 READLVTAHEFGHNW 360 (857)
Q Consensus 346 ~~~~~t~AHElGHnl 360 (857)
.+.+.|+||||||.-
T Consensus 122 ~El~aVlAHElgHi~ 136 (283)
T PRK03001 122 REIRGVMAHELAHVK 136 (283)
T ss_pred HHHHHHHHHHHHHHh
Confidence 578999999999975
No 94
>PRK02870 heat shock protein HtpX; Provisional
Probab=43.35 E-value=9.9 Score=42.97 Aligned_cols=18 Identities=39% Similarity=0.477 Sum_probs=14.5
Q ss_pred cchhhHHhhhhhhhCCCCCC
Q psy14093 346 READLVTAHEFGHNWGSEHD 365 (857)
Q Consensus 346 ~~~~~t~AHElGHnlG~~HD 365 (857)
.+.+-|+||||||. ..+|
T Consensus 171 dEL~aVlAHELgHi--k~~d 188 (336)
T PRK02870 171 DELQAVMAHELSHI--RHGD 188 (336)
T ss_pred HHHHHHHHHHHHHH--Hccc
Confidence 57899999999998 4444
No 95
>PRK03982 heat shock protein HtpX; Provisional
Probab=43.22 E-value=11 Score=41.67 Aligned_cols=15 Identities=40% Similarity=0.353 Sum_probs=13.1
Q ss_pred cchhhHHhhhhhhhC
Q psy14093 346 READLVTAHEFGHNW 360 (857)
Q Consensus 346 ~~~~~t~AHElGHnl 360 (857)
.+.+.|+||||||.-
T Consensus 123 ~El~AVlAHElgHi~ 137 (288)
T PRK03982 123 DELEGVIAHELTHIK 137 (288)
T ss_pred HHHHHHHHHHHHHHH
Confidence 478999999999975
No 96
>PRK05457 heat shock protein HtpX; Provisional
Probab=42.92 E-value=10 Score=41.86 Aligned_cols=15 Identities=40% Similarity=0.446 Sum_probs=12.9
Q ss_pred cchhhHHhhhhhhhC
Q psy14093 346 READLVTAHEFGHNW 360 (857)
Q Consensus 346 ~~~~~t~AHElGHnl 360 (857)
.+...|+||||||.-
T Consensus 132 ~El~aVlAHElgHi~ 146 (284)
T PRK05457 132 DEVEAVLAHEISHIA 146 (284)
T ss_pred HHHHHHHHHHHHHHH
Confidence 578999999999963
No 97
>PRK01265 heat shock protein HtpX; Provisional
Probab=41.96 E-value=9.8 Score=42.80 Aligned_cols=14 Identities=43% Similarity=0.655 Sum_probs=12.5
Q ss_pred cchhhHHhhhhhhh
Q psy14093 346 READLVTAHEFGHN 359 (857)
Q Consensus 346 ~~~~~t~AHElGHn 359 (857)
.+...|+||||||.
T Consensus 138 ~El~aVlAHElgHi 151 (324)
T PRK01265 138 DEIKAVAGHELGHL 151 (324)
T ss_pred HHHHHHHHHHHHHH
Confidence 57889999999995
No 98
>PRK03072 heat shock protein HtpX; Provisional
Probab=41.49 E-value=12 Score=41.43 Aligned_cols=16 Identities=38% Similarity=0.528 Sum_probs=13.5
Q ss_pred cchhhHHhhhhhhhCC
Q psy14093 346 READLVTAHEFGHNWG 361 (857)
Q Consensus 346 ~~~~~t~AHElGHnlG 361 (857)
.+...|+||||||.-.
T Consensus 125 ~El~aVlAHElgHi~~ 140 (288)
T PRK03072 125 RELRGVLGHELSHVYN 140 (288)
T ss_pred HHHHHHHHHHHHHHhc
Confidence 5789999999999753
No 99
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=39.91 E-value=10 Score=37.59 Aligned_cols=29 Identities=7% Similarity=0.140 Sum_probs=20.4
Q ss_pred cccchhhheeeeeeecccccccCCCCCCcccC
Q psy14093 649 YFDTLHSSHISHTIVKRGVQESNHPFNKIKEV 680 (857)
Q Consensus 649 ~~~~~~aheigh~l~~~g~~~d~~~~gcim~~ 680 (857)
.+..++.|||||.| |+.|....-..+|..
T Consensus 103 ~~~~~~~HEiGHaL---GL~H~~~~~~siM~p 131 (156)
T cd04279 103 NLQAIALHELGHAL---GLWHHSDRPEDAMYP 131 (156)
T ss_pred HHHHHHHHHhhhhh---cCCCCCCCccceeee
Confidence 34568899999999 987763322377763
No 100
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=39.25 E-value=11 Score=38.42 Aligned_cols=23 Identities=9% Similarity=0.109 Sum_probs=19.0
Q ss_pred hhhheeeeeeecccccccCCCCCCccc
Q psy14093 653 LHSSHISHTIVKRGVQESNHPFNKIKE 679 (857)
Q Consensus 653 ~~aheigh~l~~~g~~~d~~~~gcim~ 679 (857)
-+.||+||.+ |..|- .|-.|+|+
T Consensus 127 Ev~HElGH~~---GL~HC-~N~~CVM~ 149 (181)
T COG1913 127 EVLHELGHLL---GLSHC-PNPRCVMN 149 (181)
T ss_pred HHHHHhhhhc---CcccC-CCCCcEEe
Confidence 4589999999 98887 35689997
No 101
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=39.20 E-value=17 Score=37.45 Aligned_cols=24 Identities=25% Similarity=0.278 Sum_probs=20.3
Q ss_pred cccchhhHHhhhhhhhCCCCCCCC
Q psy14093 344 ITREADLVTAHEFGHNWGSEHDPD 367 (857)
Q Consensus 344 ~~~~~~~t~AHElGHnlG~~HD~~ 367 (857)
|......|++|||.|..-++|...
T Consensus 160 P~~~idYVvvHEL~Hl~~~nHs~~ 183 (205)
T PF01863_consen 160 PPEVIDYVVVHELCHLRHPNHSKR 183 (205)
T ss_pred CccHHHHHHHHHHHHhccCCCCHH
Confidence 445678999999999999999863
No 102
>PRK04897 heat shock protein HtpX; Provisional
Probab=38.99 E-value=12 Score=41.43 Aligned_cols=15 Identities=33% Similarity=0.379 Sum_probs=13.2
Q ss_pred cchhhHHhhhhhhhC
Q psy14093 346 READLVTAHEFGHNW 360 (857)
Q Consensus 346 ~~~~~t~AHElGHnl 360 (857)
.+...|+||||||.-
T Consensus 135 ~El~aVlAHElgHi~ 149 (298)
T PRK04897 135 EELEGVIGHEISHIR 149 (298)
T ss_pred HHHHHHHHHHHHHHh
Confidence 578899999999975
No 103
>PF11350 DUF3152: Protein of unknown function (DUF3152); InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=38.17 E-value=16 Score=38.17 Aligned_cols=20 Identities=40% Similarity=0.722 Sum_probs=17.2
Q ss_pred chhhHHhhhhhhhCCCCCCC
Q psy14093 347 EADLVTAHEFGHNWGSEHDP 366 (857)
Q Consensus 347 ~~~~t~AHElGHnlG~~HD~ 366 (857)
.-..+|=||+||-||-.|.+
T Consensus 138 YRqYvINHEVGH~LGh~H~~ 157 (203)
T PF11350_consen 138 YRQYVINHEVGHALGHGHEP 157 (203)
T ss_pred HHHHhhhhhhhhhcccCCCc
Confidence 45789999999999988875
No 104
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=38.11 E-value=13 Score=36.78 Aligned_cols=30 Identities=17% Similarity=0.237 Sum_probs=21.7
Q ss_pred cccchhhheeeeeeecccccccCCCCCCcccCCC
Q psy14093 649 YFDTLHSSHISHTIVKRGVQESNHPFNKIKEVTF 682 (857)
Q Consensus 649 ~~~~~~aheigh~l~~~g~~~d~~~~gcim~~~f 682 (857)
.+-.++.|||||.| |+.|- +..+.||...+
T Consensus 106 ~~~~~~~HEiGHaL---GL~H~-~~~~~vM~~~~ 135 (157)
T cd04278 106 DLFSVAAHEIGHAL---GLGHS-SDPDSIMYPYY 135 (157)
T ss_pred hHHHHHHHHhcccc---ccCCC-CCCcCeecccc
Confidence 34457899999999 98764 34578886433
No 105
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=38.11 E-value=10 Score=38.08 Aligned_cols=18 Identities=17% Similarity=0.429 Sum_probs=12.7
Q ss_pred ccchhhheeeeeeeccccccc
Q psy14093 650 FDTLHSSHISHTIVKRGVQES 670 (857)
Q Consensus 650 ~~~~~aheigh~l~~~g~~~d 670 (857)
.+.+++||+||-| |+.|.
T Consensus 69 ~g~TltHEvGH~L---GL~Ht 86 (154)
T PF05572_consen 69 FGKTLTHEVGHWL---GLYHT 86 (154)
T ss_dssp SSHHHHHHHHHHT---T---T
T ss_pred cccchhhhhhhhh---ccccc
Confidence 4568999999999 87775
No 106
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=37.75 E-value=23 Score=37.76 Aligned_cols=45 Identities=13% Similarity=0.276 Sum_probs=27.9
Q ss_pred CCcceeeeecCC-CC--Ccccceec-c--------ccccccchhhheeeeeeeccccccc
Q psy14093 623 SSILGLAYIASP-RP--YSIGGSIH-E--------NLKYFDTLHSSHISHTIVKRGVQES 670 (857)
Q Consensus 623 ~~tvGlA~vgs~-~~--~~s~Gv~~-~--------~~~~~~~~~aheigh~l~~~g~~~d 670 (857)
.+.+|.|+.=.. .. ....||+- . ....+..+++||+||-| |+.|.
T Consensus 98 ~~~lG~a~fP~~~~~~~~~~dGvvi~~~~~~~~~~~~~n~g~t~~HEvGH~l---GL~Ht 154 (225)
T cd04275 98 GGLLGYATFPDSLVSLAFITDGVVINPSSLPGGSAAPYNLGDTATHEVGHWL---GLYHT 154 (225)
T ss_pred CCcCEEEECCCcccCCccccceEEEeccccCCCCcccccccceeEEecccee---eeeee
Confidence 577898875443 21 23345541 1 12234568999999999 88776
No 107
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=37.37 E-value=13 Score=38.42 Aligned_cols=44 Identities=25% Similarity=0.326 Sum_probs=27.0
Q ss_pred chhhHHhhhhhhhC---CCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCCCCCCCCCCCCCCcccHHHHhc--cccccC
Q psy14093 347 EADLVTAHEFGHNW---GSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCLAHLFTH--QSFWSR 421 (857)
Q Consensus 347 ~~~~t~AHElGHnl---G~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS~~~~~~~~CS~~~i~~--~Cl~~~ 421 (857)
.++.+|||||+|.| |...|.+ | .....||+ -+...+.+ +||.+.
T Consensus 35 ~lG~ilahel~hafd~~g~~~D~~----------------------g--~~~~wWs~-------~~~~~~~~~~~Cl~~q 83 (206)
T PF01431_consen 35 GLGFILAHELMHAFDPEGINYDED----------------------G--NLRNWWSE-------ESRSNFKERAQCLRDQ 83 (206)
T ss_dssp THHHHHHHHHHHCTSTTGGGB-TT----------------------S---BS--S-H-------HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHhHhhcCcc----------------------c--ccccccch-------hhHHHHHHHHHHHHHH
Confidence 56899999999987 3343432 1 12356888 77777776 688753
No 108
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=36.80 E-value=20 Score=38.08 Aligned_cols=14 Identities=50% Similarity=0.593 Sum_probs=11.7
Q ss_pred chhhHHhhhhhhhC
Q psy14093 347 EADLVTAHEFGHNW 360 (857)
Q Consensus 347 ~~~~t~AHElGHnl 360 (857)
.+.-|.|||.||.+
T Consensus 88 aAvaVAAHEvGHAi 101 (222)
T PF04298_consen 88 AAVAVAAHEVGHAI 101 (222)
T ss_pred HHHHHHHHHHhHHH
Confidence 46779999999975
No 109
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=36.75 E-value=25 Score=42.65 Aligned_cols=14 Identities=36% Similarity=0.577 Sum_probs=12.0
Q ss_pred chhhHHhhhhhhhC
Q psy14093 347 EADLVTAHEFGHNW 360 (857)
Q Consensus 347 ~~~~t~AHElGHnl 360 (857)
..+.|+|||+||.+
T Consensus 374 ~~v~TL~HE~GHa~ 387 (587)
T TIGR02290 374 RDVSTLAHELGHAY 387 (587)
T ss_pred hhHHHHHHHhhHHH
Confidence 35789999999987
No 110
>PRK02391 heat shock protein HtpX; Provisional
Probab=34.89 E-value=17 Score=40.33 Aligned_cols=14 Identities=43% Similarity=0.570 Sum_probs=12.4
Q ss_pred cchhhHHhhhhhhh
Q psy14093 346 READLVTAHEFGHN 359 (857)
Q Consensus 346 ~~~~~t~AHElGHn 359 (857)
.+...|+||||||.
T Consensus 131 ~El~aVlaHElgHi 144 (296)
T PRK02391 131 DELEAVLAHELSHV 144 (296)
T ss_pred HHHHHHHHHHHHHH
Confidence 47889999999995
No 111
>PTZ00337 surface protease GP63; Provisional
Probab=32.68 E-value=44 Score=40.52 Aligned_cols=19 Identities=42% Similarity=0.538 Sum_probs=15.7
Q ss_pred chhhHHhhhhhhhCCCCCC
Q psy14093 347 EADLVTAHEFGHNWGSEHD 365 (857)
Q Consensus 347 ~~~~t~AHElGHnlG~~HD 365 (857)
....|++|||.|.||...+
T Consensus 228 ~~~~v~~HEi~HALGFs~~ 246 (567)
T PTZ00337 228 GDVRVAAHELGHALGFVRD 246 (567)
T ss_pred HHHHHHHHHHHHHHccCHH
Confidence 3467999999999999654
No 112
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=31.22 E-value=21 Score=39.54 Aligned_cols=13 Identities=46% Similarity=0.833 Sum_probs=10.6
Q ss_pred chhhHHhhhhhhh
Q psy14093 347 EADLVTAHEFGHN 359 (857)
Q Consensus 347 ~~~~t~AHElGHn 359 (857)
..+.|||||.||-
T Consensus 169 a~ayVlAHEyGHH 181 (292)
T PF04228_consen 169 AQAYVLAHEYGHH 181 (292)
T ss_pred HHHHHHHHHHHHH
Confidence 4577999999983
No 113
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=30.25 E-value=19 Score=43.84 Aligned_cols=16 Identities=38% Similarity=0.493 Sum_probs=13.0
Q ss_pred chhhHHhhhhhhhCCC
Q psy14093 347 EADLVTAHEFGHNWGS 362 (857)
Q Consensus 347 ~~~~t~AHElGHnlG~ 362 (857)
-.+.|+|||+||.+=+
T Consensus 379 ~dV~TLaHElGHs~Hs 394 (598)
T COG1164 379 RDVFTLAHELGHSVHS 394 (598)
T ss_pred hHHHHHHHHccHHHHH
Confidence 4689999999998643
No 114
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=29.92 E-value=23 Score=37.21 Aligned_cols=23 Identities=22% Similarity=0.273 Sum_probs=16.9
Q ss_pred ccccchhhHHhhhhhhhCCCCCC
Q psy14093 343 VITREADLVTAHEFGHNWGSEHD 365 (857)
Q Consensus 343 v~~~~~~~t~AHElGHnlG~~HD 365 (857)
+|-..+..++|||++|.|=.-..
T Consensus 88 LPrll~gsiLAHE~mHa~Lrl~g 110 (212)
T PF12315_consen 88 LPRLLTGSILAHELMHAWLRLNG 110 (212)
T ss_pred CCHHHHhhHHHHHHHHHHhcccC
Confidence 34445688999999999855544
No 115
>KOG2719|consensus
Probab=29.45 E-value=30 Score=40.16 Aligned_cols=14 Identities=50% Similarity=0.567 Sum_probs=11.8
Q ss_pred cchhhHHhhhhhhh
Q psy14093 346 READLVTAHEFGHN 359 (857)
Q Consensus 346 ~~~~~t~AHElGHn 359 (857)
.+.+.|+|||+||-
T Consensus 278 eel~AVl~HELGHW 291 (428)
T KOG2719|consen 278 EELVAVLAHELGHW 291 (428)
T ss_pred HHHHHHHHHHhhHH
Confidence 36788999999994
No 116
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=29.29 E-value=15 Score=35.66 Aligned_cols=24 Identities=13% Similarity=0.461 Sum_probs=13.5
Q ss_pred hhhheeeeeeecccccccC-CCCCCccc
Q psy14093 653 LHSSHISHTIVKRGVQESN-HPFNKIKE 679 (857)
Q Consensus 653 ~~aheigh~l~~~g~~~d~-~~~gcim~ 679 (857)
+.||||||+| |+.+.- ..++-+|+
T Consensus 80 IaaHE~GHiL---GLPD~y~GpCS~LMS 104 (132)
T PF02031_consen 80 IAAHELGHIL---GLPDHYPGPCSELMS 104 (132)
T ss_dssp HHHHHHHHHH---T----TTS-TT-GGG
T ss_pred eeeehhcccc---CCCCCCCCccHHhhc
Confidence 7899999999 876542 23455664
No 117
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=28.97 E-value=84 Score=36.70 Aligned_cols=14 Identities=36% Similarity=0.551 Sum_probs=11.9
Q ss_pred chhhHHhhhhhhhC
Q psy14093 347 EADLVTAHEFGHNW 360 (857)
Q Consensus 347 ~~~~t~AHElGHnl 360 (857)
..+.|++||+||.+
T Consensus 207 ~~v~tLfHEfGHal 220 (422)
T cd06456 207 DEVTTLFHEFGHAL 220 (422)
T ss_pred HHHHHHHHHHHHHH
Confidence 45789999999976
No 118
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=27.99 E-value=25 Score=32.36 Aligned_cols=16 Identities=44% Similarity=0.727 Sum_probs=12.5
Q ss_pred hhhHHhhhhhhhCCCC
Q psy14093 348 ADLVTAHEFGHNWGSE 363 (857)
Q Consensus 348 ~~~t~AHElGHnlG~~ 363 (857)
...|++||++|.+=..
T Consensus 25 ~~~~l~HE~~H~~~~~ 40 (128)
T PF13485_consen 25 LDRVLAHELAHQWFGN 40 (128)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4589999999997333
No 119
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=27.60 E-value=18 Score=37.71 Aligned_cols=89 Identities=18% Similarity=0.258 Sum_probs=42.1
Q ss_pred ccCcccccccccccccccccC---CCCCCc----c-eEEEEEeeeecccC-CCcceeeeecCCCCCcccceecc--cc--
Q psy14093 581 HYNMIRDKWDVRNLLETFSSH---VDGTNF----C-LAHLFTHQSFWSRG-SSILGLAYIASPRPYSIGGSIHE--NL-- 647 (857)
Q Consensus 581 ~~n~c~r~W~~~~lL~~~~~~---~~hD~~----c-~AhLft~~df~~~~-~~tvGlA~vgs~~~~~s~Gv~~~--~~-- 647 (857)
.++.-...+....++.++... ...|.+ . .+..+|..|.+.+. .-+.|+|..+ ...+|++. ..
T Consensus 60 a~~~~r~Qy~a~~ll~~L~~~~~~~~~d~~~k~~gi~~l~it~~DlY~~~~nfVFG~A~~~-----~~~aVvS~~~~~fy 134 (194)
T PF07998_consen 60 AYNWYRKQYDAEALLDRLKSLREEYPDDKVTKIFGITVLGITDRDLYSPGLNFVFGLARPG-----GGVAVVSTSRNEFY 134 (194)
T ss_dssp GEETTTTEEEHHHHHHHHHT--S----SEE------SEEEEESS-EEETTESEBSEEEECC-----SSEEEEEGGCGGGG
T ss_pred cCCcccceeeHHHHHHHHHHHHHhhhhhhhhhccccceEEEeccccCCCCCceEEEEeecC-----CCeEEEEEeccccc
Confidence 344445566666767665432 122311 0 12556777753211 2345555542 23334321 10
Q ss_pred --------ccccchhhheeeeeeecccccccCCCCCCccc
Q psy14093 648 --------KYFDTLHSSHISHTIVKRGVQESNHPFNKIKE 679 (857)
Q Consensus 648 --------~~~~~~~aheigh~l~~~g~~~d~~~~gcim~ 679 (857)
.-+...+.||+||.+ |+.|-.. .|+|+
T Consensus 135 ~~~~~l~~~R~~Kea~HElGH~~---GL~HC~~--~CvM~ 169 (194)
T PF07998_consen 135 GEDEELFLERVCKEAVHELGHLF---GLDHCEN--RCVMN 169 (194)
T ss_dssp TSSHHHHHHHHHHHHHHHHHHHT---T----SS--TSTTS
T ss_pred cccHHHHHHHHHHHHHHHHHHHc---CCcCCCC--CCccC
Confidence 112346789999999 9999854 99997
No 120
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=27.44 E-value=16 Score=35.26 Aligned_cols=16 Identities=13% Similarity=0.441 Sum_probs=13.8
Q ss_pred chhhheeeeeeeccccccc
Q psy14093 652 TLHSSHISHTIVKRGVQES 670 (857)
Q Consensus 652 ~~~aheigh~l~~~g~~~d 670 (857)
.+++|||||.| |+.|.
T Consensus 88 ~~~~HEigHaL---Gl~H~ 103 (140)
T smart00235 88 GVAAHELGHAL---GLYHE 103 (140)
T ss_pred ccHHHHHHHHh---cCCcC
Confidence 47899999999 98765
No 121
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=27.04 E-value=30 Score=35.98 Aligned_cols=26 Identities=42% Similarity=0.629 Sum_probs=18.2
Q ss_pred ccCCccccccchhhHHhhhhhhhCCCCCC
Q psy14093 337 NHYGQRVITREADLVTAHEFGHNWGSEHD 365 (857)
Q Consensus 337 n~~G~~v~~~~~~~t~AHElGHnlG~~HD 365 (857)
|++|..| ....|.|||+||.+==.||
T Consensus 84 ~y~g~Si---a~~aVAAHEVGHAiQd~~~ 109 (226)
T COG2738 84 NYYGPSI---AAIAVAAHEVGHAIQDQED 109 (226)
T ss_pred ccCCccH---HHHHHHHHHhhHHHhhhcc
Confidence 4455444 3567999999998865555
No 122
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=26.96 E-value=26 Score=40.35 Aligned_cols=14 Identities=36% Similarity=0.717 Sum_probs=11.5
Q ss_pred chhhHHhhhhhhhC
Q psy14093 347 EADLVTAHEFGHNW 360 (857)
Q Consensus 347 ~~~~t~AHElGHnl 360 (857)
....|++||+||.+
T Consensus 221 ~~v~tl~HE~GHa~ 234 (427)
T cd06459 221 DDVFTLAHELGHAF 234 (427)
T ss_pred hhHHHHHHHhhHHH
Confidence 35789999999964
No 123
>KOG3624|consensus
Probab=26.32 E-value=35 Score=42.31 Aligned_cols=14 Identities=29% Similarity=0.579 Sum_probs=12.4
Q ss_pred chhhHHhhhhhhhC
Q psy14093 347 EADLVTAHEFGHNW 360 (857)
Q Consensus 347 ~~~~t~AHElGHnl 360 (857)
..+.|+||||+|.|
T Consensus 517 ~iG~vigHEl~H~F 530 (687)
T KOG3624|consen 517 GIGFVIGHELTHGF 530 (687)
T ss_pred HHHHHHHHHHhhcc
Confidence 57899999999966
No 124
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=26.19 E-value=27 Score=41.29 Aligned_cols=14 Identities=36% Similarity=0.422 Sum_probs=12.0
Q ss_pred chhhHHhhhhhhhC
Q psy14093 347 EADLVTAHEFGHNW 360 (857)
Q Consensus 347 ~~~~t~AHElGHnl 360 (857)
..+.|++||+||.+
T Consensus 262 ~~V~TLfHEfGHal 275 (472)
T cd06455 262 DEVETFFHEFGHVI 275 (472)
T ss_pred HHHHHHHHHHHHHH
Confidence 45889999999976
No 125
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=26.04 E-value=35 Score=34.16 Aligned_cols=11 Identities=45% Similarity=0.446 Sum_probs=9.5
Q ss_pred hhHHhhhhhhh
Q psy14093 349 DLVTAHEFGHN 359 (857)
Q Consensus 349 ~~t~AHElGHn 359 (857)
.-|+||||+|-
T Consensus 136 lDVvaHEltHG 146 (150)
T PF01447_consen 136 LDVVAHELTHG 146 (150)
T ss_dssp HHHHHHHHHHH
T ss_pred cceeeeccccc
Confidence 56999999995
No 126
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=25.69 E-value=26 Score=39.30 Aligned_cols=19 Identities=42% Similarity=0.574 Sum_probs=14.5
Q ss_pred cchhhHHhhhhhhhCCCCCC
Q psy14093 346 READLVTAHEFGHNWGSEHD 365 (857)
Q Consensus 346 ~~~~~t~AHElGHnlG~~HD 365 (857)
.+.|.|+||||||.- ++|-
T Consensus 122 sEvAAVl~HEmgHVt-AnHg 140 (479)
T COG4784 122 SEVAAVLAHEMGHVT-ANHG 140 (479)
T ss_pred HHHHHHHHhhhhhee-cchh
Confidence 367999999999964 4443
No 127
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=25.62 E-value=27 Score=40.86 Aligned_cols=14 Identities=36% Similarity=0.501 Sum_probs=11.7
Q ss_pred chhhHHhhhhhhhC
Q psy14093 347 EADLVTAHEFGHNW 360 (857)
Q Consensus 347 ~~~~t~AHElGHnl 360 (857)
..+.|++||+||.+
T Consensus 241 ~~v~tLfHE~GHa~ 254 (458)
T PF01432_consen 241 DDVETLFHEFGHAM 254 (458)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred hhHHHHHHHHhHHH
Confidence 35789999999965
No 128
>KOG1047|consensus
Probab=25.46 E-value=31 Score=41.11 Aligned_cols=14 Identities=43% Similarity=0.703 Sum_probs=12.0
Q ss_pred chhhHHhhhhhhhC
Q psy14093 347 EADLVTAHEFGHNW 360 (857)
Q Consensus 347 ~~~~t~AHElGHnl 360 (857)
..+.|+||||+|+|
T Consensus 287 sl~~vIaHEIAHSW 300 (613)
T KOG1047|consen 287 SLVDVIAHEIAHSW 300 (613)
T ss_pred chhhHHHHHhhhhh
Confidence 34789999999987
No 129
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=24.31 E-value=22 Score=37.50 Aligned_cols=18 Identities=22% Similarity=0.532 Sum_probs=14.7
Q ss_pred ccchhhheeeeeeeccccccc
Q psy14093 650 FDTLHSSHISHTIVKRGVQES 670 (857)
Q Consensus 650 ~~~~~aheigh~l~~~g~~~d 670 (857)
...+|+|||||.+ |+.|-
T Consensus 133 ~~hvi~HEiGH~I---GfRHT 150 (211)
T PF12388_consen 133 IEHVITHEIGHCI---GFRHT 150 (211)
T ss_pred HHHHHHHHhhhhc---ccccc
Confidence 4458999999999 88764
No 130
>KOG2090|consensus
Probab=23.83 E-value=94 Score=37.60 Aligned_cols=14 Identities=29% Similarity=0.321 Sum_probs=11.5
Q ss_pred chhhHHhhhhhhhC
Q psy14093 347 EADLVTAHEFGHNW 360 (857)
Q Consensus 347 ~~~~t~AHElGHnl 360 (857)
..+-|+-|||||.+
T Consensus 478 ~~vetLFHEmGHAM 491 (704)
T KOG2090|consen 478 SEVETLFHEMGHAM 491 (704)
T ss_pred HHHHHHHHHHHHHH
Confidence 45789999999964
No 131
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=23.74 E-value=40 Score=40.46 Aligned_cols=18 Identities=28% Similarity=0.255 Sum_probs=14.6
Q ss_pred chhhHHhhhhhhhCCCCC
Q psy14093 347 EADLVTAHEFGHNWGSEH 364 (857)
Q Consensus 347 ~~~~t~AHElGHnlG~~H 364 (857)
....|++|||+|.||..-
T Consensus 209 ~~~~~~~HEi~HaLGFs~ 226 (521)
T PF01457_consen 209 EFFRTVIHEIAHALGFSS 226 (521)
T ss_dssp HHHHHHHHHHHHHTT-SH
T ss_pred cccceeeeeeeeeeeecc
Confidence 456799999999999876
No 132
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=23.23 E-value=54 Score=29.17 Aligned_cols=13 Identities=31% Similarity=0.391 Sum_probs=10.7
Q ss_pred hhhHHhhhhhhhC
Q psy14093 348 ADLVTAHEFGHNW 360 (857)
Q Consensus 348 ~~~t~AHElGHnl 360 (857)
...++||||.|.+
T Consensus 61 ~~~llaHEl~Hv~ 73 (79)
T PF13699_consen 61 GRALLAHELAHVV 73 (79)
T ss_pred cchhHhHHHHHHH
Confidence 3568999999975
No 133
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=23.18 E-value=23 Score=36.17 Aligned_cols=17 Identities=18% Similarity=0.393 Sum_probs=14.1
Q ss_pred chhhheeeeeeecccccccC
Q psy14093 652 TLHSSHISHTIVKRGVQESN 671 (857)
Q Consensus 652 ~~~aheigh~l~~~g~~~d~ 671 (857)
.++.|||||.| |+.|-.
T Consensus 115 ~t~~HEiGHaL---GL~H~~ 131 (186)
T cd04277 115 QTIIHEIGHAL---GLEHPG 131 (186)
T ss_pred HHHHHHHHHHh---cCCCCC
Confidence 47899999999 987754
No 134
>PF14891 Peptidase_M91: Effector protein
Probab=22.97 E-value=30 Score=35.23 Aligned_cols=21 Identities=19% Similarity=0.137 Sum_probs=17.6
Q ss_pred chhhHHhhhhhhhCCCCCCCC
Q psy14093 347 EADLVTAHEFGHNWGSEHDPD 367 (857)
Q Consensus 347 ~~~~t~AHElGHnlG~~HD~~ 367 (857)
..++++||||+|.+-+.++..
T Consensus 102 ~p~v~L~HEL~HA~~~~~Gt~ 122 (174)
T PF14891_consen 102 PPFVVLYHELIHAYDYMNGTM 122 (174)
T ss_pred HHHHHHHHHHHHHHHHHCCCC
Confidence 457899999999998888764
No 135
>PF08325 WLM: WLM domain; InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=22.63 E-value=42 Score=34.80 Aligned_cols=22 Identities=27% Similarity=0.268 Sum_probs=19.2
Q ss_pred cchhhHHhhhhhhhCCCCCCCC
Q psy14093 346 READLVTAHEFGHNWGSEHDPD 367 (857)
Q Consensus 346 ~~~~~t~AHElGHnlG~~HD~~ 367 (857)
..+..||-|||.|+.=++||..
T Consensus 80 ~~i~~t~lHELaH~~~~~H~~~ 101 (186)
T PF08325_consen 80 ETILGTMLHELAHNVHGPHDDK 101 (186)
T ss_pred HHHHHHHHHHHHhcccCCccHH
Confidence 4567899999999999999974
No 136
>PF11403 Yeast_MT: Yeast metallothionein; InterPro: IPR022710 Metallothioneins are characterised by an abundance of cysteine residues and a lack of generic secondary structure motifs. This protein functions in primary metal storage, transport and detoxification []. For the first 40 residues in the protein the polypeptide wraps around the metal by forming two large parallel loops separated by a deep cleft containing the metal cluster []. ; PDB: 1AQS_A 1AQR_A 1RJU_V 1FMY_A 1AOO_A 1AQQ_A.
Probab=21.39 E-value=1.3e+02 Score=22.46 Aligned_cols=37 Identities=24% Similarity=0.659 Sum_probs=17.6
Q ss_pred CceeecCCCCCCCCCcccCCCCCCCCCCCCCCCCCcCCC
Q psy14093 490 GMKCRDAQYATCEQESRCTGSSSVCPPSAPMSDNTGCLE 528 (857)
Q Consensus 490 g~~Cr~~~~~~C~~~~~C~G~s~~Cp~~~~~~dgt~C~~ 528 (857)
|+.|+-.- +.|...|.|-- |-.||+.-...|--||++
T Consensus 4 ghecqcqc-gscknneqcqk-scscptgcnsddkcpcgn 40 (40)
T PF11403_consen 4 GHECQCQC-GSCKNNEQCQK-SCSCPTGCNSDDKCPCGN 40 (40)
T ss_dssp SS----SS-STTTT-TTSTT-S-SS-TTTTSSTT--TT-
T ss_pred Cceeeeec-CCccChHHHhh-cCCCCCCCCCCCcCCCCC
Confidence 44444332 45666666654 567888777888778863
No 137
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=20.79 E-value=40 Score=37.80 Aligned_cols=17 Identities=12% Similarity=0.315 Sum_probs=14.8
Q ss_pred chhhheeeeeeecccccccC
Q psy14093 652 TLHSSHISHTIVKRGVQESN 671 (857)
Q Consensus 652 ~~~aheigh~l~~~g~~~d~ 671 (857)
.++.||||||+ |+.|-+
T Consensus 152 ~~~~HElgHN~---GL~Ha~ 168 (314)
T PF05548_consen 152 ATIMHELGHNL---GLWHAG 168 (314)
T ss_pred HHHHHHhhhhc---cccccC
Confidence 37899999999 998875
No 138
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=20.11 E-value=42 Score=35.28 Aligned_cols=19 Identities=32% Similarity=0.447 Sum_probs=16.0
Q ss_pred hhHHhhhhhhhCCCC-CCCC
Q psy14093 349 DLVTAHEFGHNWGSE-HDPD 367 (857)
Q Consensus 349 ~~t~AHElGHnlG~~-HD~~ 367 (857)
.-|.-|||||.||.. |-+-
T Consensus 188 ~~tarhElGhaLgi~ghsd~ 207 (236)
T COG5549 188 NPTARHELGHALGIWGHSDL 207 (236)
T ss_pred hHHHHHhhcchheecccccc
Confidence 568889999999999 8753
Done!