Query 037955
Match_columns 733
No_of_seqs 236 out of 1611
Neff 7.8
Searched_HMMs 46136
Date Fri Mar 29 06:05:56 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/037955.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/037955hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG0339 Dcp Zn-dependent oligo 100.0 4E-147 8E-152 1213.7 68.6 649 47-724 14-682 (683)
2 PRK10280 dipeptidyl carboxypep 100.0 5E-139 1E-143 1204.5 71.0 645 48-724 14-680 (681)
3 PRK10911 oligopeptidase A; Pro 100.0 7E-133 2E-137 1158.1 71.4 647 49-724 9-679 (680)
4 KOG2090 Metalloendopeptidase f 100.0 4E-132 8E-137 1061.9 62.6 634 46-703 39-681 (704)
5 KOG2089 Metalloendopeptidase f 100.0 5E-129 1E-133 1051.9 57.7 649 50-727 36-712 (718)
6 cd06457 M3A_MIP Peptidase M3 m 100.0 1E-101 3E-106 866.1 46.9 435 267-701 2-448 (458)
7 cd06455 M3A_TOP Peptidase M3 T 100.0 2E-95 4E-100 820.6 52.1 468 190-721 1-472 (472)
8 cd06456 M3A_DCP_Oligopeptidase 100.0 1.1E-95 2E-100 807.1 46.9 418 290-723 1-422 (422)
9 PF01432 Peptidase_M3: Peptida 100.0 1.5E-89 3.2E-94 774.8 39.9 444 267-723 1-458 (458)
10 cd06258 Peptidase_M3_like The 100.0 4E-64 8.8E-69 551.9 40.5 358 293-720 2-364 (365)
11 TIGR02289 M3_not_pepF oligoend 100.0 6.4E-55 1.4E-59 500.9 46.4 505 74-701 8-526 (549)
12 TIGR00181 pepF oligoendopeptid 100.0 2.3E-55 5E-60 510.6 41.8 530 59-702 18-568 (591)
13 cd06459 M3B_Oligoendopeptidase 100.0 1.6E-51 3.4E-56 461.9 31.1 383 251-719 27-420 (427)
14 TIGR02290 M3_fam_3 oligoendope 100.0 2.6E-49 5.7E-54 458.6 47.9 506 87-720 49-572 (587)
15 COG1164 Oligoendopeptidase F [ 100.0 7.2E-41 1.6E-45 383.1 44.9 506 86-703 51-573 (598)
16 cd06461 M2_ACE Peptidase famil 99.9 7.7E-26 1.7E-30 251.7 30.5 380 261-720 33-465 (477)
17 cd06460 M32_Taq Peptidase fami 99.9 3.6E-23 7.9E-28 224.3 33.1 339 290-722 28-392 (396)
18 PF02074 Peptidase_M32: Carbox 99.8 6.1E-16 1.3E-20 172.4 45.7 413 125-721 44-490 (494)
19 PF01401 Peptidase_M2: Angiote 99.8 4.1E-16 9E-21 177.9 38.8 504 89-721 18-568 (595)
20 COG2317 Zn-dependent carboxype 99.7 4.7E-14 1E-18 150.9 37.4 416 125-722 44-492 (497)
21 KOG3690 Angiotensin I-converti 99.6 1.3E-10 2.7E-15 127.7 42.0 489 114-721 61-597 (646)
22 cd04277 ZnMc_serralysin_like Z 87.7 2.1 4.5E-05 42.3 7.9 31 400-434 38-68 (186)
23 COG2856 Predicted Zn peptidase 87.1 0.47 1E-05 47.9 2.8 31 478-517 57-87 (213)
24 cd04278 ZnMc_MMP Zinc-dependen 80.9 7.3 0.00016 37.3 8.1 15 500-514 105-119 (157)
25 PF06114 DUF955: Domain of unk 80.4 1.4 3.1E-05 39.3 2.9 33 479-520 28-60 (122)
26 PF14247 DUF4344: Domain of un 80.1 1.2 2.6E-05 45.2 2.4 19 501-519 91-109 (220)
27 PF00413 Peptidase_M10: Matrix 76.1 1.4 3E-05 41.8 1.5 22 399-420 25-46 (154)
28 cd04327 ZnMc_MMP_like_3 Zinc-d 75.9 14 0.00029 37.0 8.6 16 502-517 92-107 (198)
29 PF13058 DUF3920: Protein of u 69.2 3.5 7.7E-05 36.4 2.2 25 484-516 66-90 (126)
30 PF13398 Peptidase_M50B: Pepti 69.2 3.8 8.3E-05 41.1 2.8 22 501-522 21-42 (200)
31 cd04268 ZnMc_MMP_like Zinc-dep 68.6 2.9 6.2E-05 40.2 1.8 23 399-421 18-40 (165)
32 PF14891 Peptidase_M91: Effect 68.6 3.5 7.6E-05 40.4 2.4 21 501-521 102-122 (174)
33 PF13583 Reprolysin_4: Metallo 68.5 4.4 9.4E-05 40.9 3.1 23 398-420 32-54 (206)
34 cd04279 ZnMc_MMP_like_1 Zinc-d 68.3 2.8 6.1E-05 40.1 1.6 15 501-515 103-117 (156)
35 PF10460 Peptidase_M30: Peptid 65.4 6.3 0.00014 43.1 3.7 45 501-549 138-184 (366)
36 smart00235 ZnMc Zinc-dependent 61.4 4.1 8.9E-05 38.1 1.3 11 504-514 88-98 (140)
37 PRK13267 archaemetzincin-like 60.0 11 0.00024 37.1 4.1 18 497-514 120-137 (179)
38 PF13688 Reprolysin_5: Metallo 58.8 4.8 0.00011 39.9 1.3 15 501-515 141-155 (196)
39 PF13574 Reprolysin_2: Metallo 57.9 6.7 0.00015 38.4 2.1 13 502-514 111-123 (173)
40 PF13582 Reprolysin_3: Metallo 57.7 5.1 0.00011 36.5 1.2 13 502-514 107-119 (124)
41 PF13485 Peptidase_MA_2: Pepti 55.4 15 0.00033 32.9 3.9 36 503-544 26-61 (128)
42 cd00203 ZnMc Zinc-dependent me 54.5 6.5 0.00014 37.7 1.4 24 398-421 24-47 (167)
43 PF04228 Zn_peptidase: Putativ 52.9 10 0.00022 40.3 2.7 19 501-519 169-187 (292)
44 PF05572 Peptidase_M43: Pregna 51.5 7.7 0.00017 37.2 1.3 15 501-515 68-82 (154)
45 PF02031 Peptidase_M7: Strepto 47.1 13 0.00029 34.1 2.0 16 500-515 75-90 (132)
46 cd04280 ZnMc_astacin_like Zinc 45.9 10 0.00023 37.3 1.3 14 502-515 74-87 (180)
47 PF01400 Astacin: Astacin (Pep 42.5 15 0.00033 36.4 1.9 14 502-515 79-92 (191)
48 cd04272 ZnMc_salivary_gland_MP 41.3 16 0.00034 37.2 1.8 15 500-514 143-157 (220)
49 cd04283 ZnMc_hatching_enzyme Z 41.1 15 0.00032 36.4 1.5 16 502-517 77-92 (182)
50 PF02163 Peptidase_M50: Peptid 41.0 17 0.00037 35.8 2.0 24 503-526 8-31 (192)
51 PF12388 Peptidase_M57: Dual-a 40.8 13 0.00028 37.4 1.1 14 504-517 135-148 (211)
52 KOG3658 Tumor necrosis factor- 40.4 21 0.00045 41.6 2.7 65 440-514 338-404 (764)
53 cd04270 ZnMc_TACE_like Zinc-de 40.2 15 0.00033 38.0 1.6 13 502-514 167-179 (244)
54 cd04271 ZnMc_ADAM_fungal Zinc- 38.9 9.4 0.0002 39.2 -0.2 14 501-514 144-157 (228)
55 KOG2719 Metalloprotease [Gener 37.6 35 0.00075 38.0 3.8 55 453-520 243-298 (428)
56 PF04298 Zn_peptidase_2: Putat 37.4 22 0.00048 36.1 2.1 18 501-518 88-105 (222)
57 cd06163 S2P-M50_PDZ_RseP-like 37.0 22 0.00047 35.2 2.0 21 502-522 9-29 (182)
58 cd05709 S2P-M50 Site-2 proteas 36.5 22 0.00048 34.7 2.0 20 503-522 9-28 (180)
59 cd04267 ZnMc_ADAM_like Zinc-de 35.6 15 0.00033 36.2 0.7 14 501-514 132-145 (192)
60 PF07023 DUF1315: Protein of u 35.4 1E+02 0.0022 26.9 5.5 47 139-186 7-58 (93)
61 cd04281 ZnMc_BMP1_TLD Zinc-dep 35.1 20 0.00044 35.9 1.5 16 502-517 87-102 (200)
62 PF09471 Peptidase_M64: IgA Pe 34.4 19 0.00041 37.8 1.2 19 501-519 215-233 (264)
63 cd06161 S2P-M50_SpoIVFB SpoIVF 33.9 26 0.00055 35.4 2.0 19 502-520 38-56 (208)
64 COG4086 Predicted secreted pro 33.5 5.1E+02 0.011 27.3 11.0 84 118-204 166-254 (299)
65 PF01435 Peptidase_M48: Peptid 33.3 30 0.00065 34.8 2.4 21 498-518 84-105 (226)
66 cd04276 ZnMc_MMP_like_2 Zinc-d 33.3 24 0.00052 35.4 1.6 13 503-515 117-129 (197)
67 cd04269 ZnMc_adamalysin_II_lik 33.2 23 0.0005 35.1 1.5 14 501-514 130-143 (194)
68 COG2738 Predicted Zn-dependent 32.2 33 0.00071 33.9 2.2 17 501-517 91-107 (226)
69 PF05960 DUF885: Bacterial pro 31.8 1.3E+02 0.0029 34.8 7.8 165 500-697 370-544 (549)
70 PF13699 DUF4157: Domain of un 31.4 28 0.00061 29.3 1.5 16 501-516 60-75 (79)
71 TIGR03296 M6dom_TIGR03296 M6 f 30.5 11 0.00023 40.2 -1.5 14 502-515 165-178 (286)
72 cd06159 S2P-M50_PDZ_Arch Uncha 28.0 36 0.00079 35.7 2.0 18 502-519 118-135 (263)
73 cd04282 ZnMc_meprin Zinc-depen 27.8 31 0.00067 35.4 1.4 17 502-518 120-136 (230)
74 cd06164 S2P-M50_SpoIVFB_CBS Sp 25.7 43 0.00093 34.3 2.0 17 503-519 54-70 (227)
75 cd07633 BAR_OPHN1 The Bin/Amph 24.2 7.5E+02 0.016 25.0 11.2 120 61-207 12-132 (207)
76 KOG2857 Predicted MYND Zn-fing 24.1 5.2E+02 0.011 24.3 8.3 43 62-105 75-117 (157)
77 cd06162 S2P-M50_PDZ_SREBP Ster 23.7 49 0.0011 34.9 2.0 17 502-518 135-151 (277)
78 KOG3133 40 kDa farnesylated pr 23.7 3.1E+02 0.0068 28.5 7.6 64 128-205 145-208 (267)
79 PF08463 EcoEI_R_C: EcoEI R pr 22.4 3.2E+02 0.007 26.0 7.4 22 158-179 102-123 (164)
80 PRK11546 zraP zinc resistance 22.2 1.8E+02 0.0039 27.5 5.2 38 156-197 42-79 (143)
81 PF05505 Ebola_NP: Ebola nucle 22.2 3E+02 0.0065 31.4 7.6 65 118-197 323-393 (717)
82 PF06160 EzrA: Septation ring 22.1 1.3E+03 0.028 27.0 16.0 71 125-202 197-268 (560)
83 PF01447 Peptidase_M4: Thermol 21.6 61 0.0013 30.9 2.0 15 501-515 134-148 (150)
84 cd06160 S2P-M50_like_2 Unchara 21.5 59 0.0013 32.1 2.0 20 503-522 42-61 (183)
85 PF01742 Peptidase_M27: Clostr 20.8 75 0.0016 34.9 2.6 23 497-519 209-231 (408)
86 PF05548 Peptidase_M11: Gameto 20.6 49 0.0011 35.7 1.3 15 501-515 149-163 (314)
87 PF01421 Reprolysin: Reprolysi 20.6 54 0.0012 32.5 1.5 14 501-514 130-143 (199)
88 TIGR00488 putative HD superfam 20.3 6.7E+02 0.014 23.7 9.0 99 501-600 34-146 (158)
No 1
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=100.00 E-value=3.6e-147 Score=1213.68 Aligned_cols=649 Identities=29% Similarity=0.514 Sum_probs=613.3
Q ss_pred CCCCCCCCCCCCCcCChHHHHHHHHHHHHHHHHHHHcC---CChHHHHHHHHHhhHHHhhhhchhhhhhccCCchHHHHH
Q 037955 47 VPTGLYGFDHLKSPNGFQRFVDDAIERSSELVNYISEM---PSSVEIIRAMDEISDAVCSVVDSAELCRQTHPDREFVEE 123 (733)
Q Consensus 47 ~~~gl~~~~~l~~p~~~~~~~~~~i~~~~~~i~~i~~~---~~~~~~v~~ld~~~~~l~~~~~~~~~~~~~~~d~e~r~a 123 (733)
+-.|+|.|..+ .|+++.|++++++++|++.|++|... |||+|+|.+|+++++.||++++++++++++|+|+++|++
T Consensus 14 ~~~~lP~F~~i-~~ed~~pAf~~~l~~~~~~i~~i~~~~~~pt~~n~v~~Le~~~~~L~rv~~~~~~l~~v~~~~elr~a 92 (683)
T COG0339 14 PFGGLPPFDAI-KPEDVKPAFDEALAQARAEIDAIAANPDAPTWENTVEPLERAGDRLGRVWSVVSHLNSVHNSPELREA 92 (683)
T ss_pred CcCCCCCcccC-ChhhccHHHHHHHHHHHHHHHHHHcCCCCCcHHHHHHHHHHHhhHHHHHHHHHHHHHhcCCCHHHHHH
Confidence 34579999999 59999999999999999999999864 699999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHhhCCHHHHHHHHHHh--hcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHH
Q 037955 124 ASKASMRISEYLHYLNTNHTLYDAVKKAE--LDGHLLSKEAHRAANHLRIDFEKGGIHLCADKLDRVNQLNMDIFQLCRE 201 (733)
Q Consensus 124 a~~~~~~l~~~~~~l~~~~~Ly~~l~~~~--~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~L~~~ 201 (733)
++++.+++++|++.|++|..||++++++. .+...||+|++|+++..+++|+++|+.|++++|+|+.+|++|+++|+++
T Consensus 93 ~e~~~pklse~~~~l~~~~~Lf~~~~al~~~~e~~~ld~e~~r~~e~~~rdF~~sGa~L~~~~k~r~~~I~~ela~L~~~ 172 (683)
T COG0339 93 YEEILPKLSEFSNDLGQNPGLFARYKALWQSPESAGLDPEQKRVVENALRDFVLSGAKLPPEKKARLAEINTELAELGAQ 172 (683)
T ss_pred HHHhhHHHHHHHHHhhcCHHHHHHHHHHhcCcccccCCHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999999999993 3466799999999999999999999999999999999999999999999
Q ss_pred HHhhcccCCC-C---c-ccccCCCCCHHHHHhhccccccCCCCCcchhhhhcccCCCCEEEecCCCcHHHHhhhCCCHHH
Q 037955 202 FNQNIINDPG-H---V-DIFPESRIPKHIHHLLKPICRLTSGPSRESLISWDNKKEKGFRITTDSRILQSILQWTSDDEV 276 (733)
Q Consensus 202 F~~ni~~~~~-~---v-~~~~l~glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~l~~~~~~~vL~~~~d~~~ 276 (733)
|++|+.+++. + + +..+|+|||++.+++++..+ ..++.+||.||++.|++.|+|++|.||++
T Consensus 173 Fs~nvL~at~~~~~~~~~~~~LaGlp~~~l~aa~~~A--------------~~kg~~~~~itl~~p~~~p~l~~~~~r~l 238 (683)
T COG0339 173 FSNNVLDATKAWALIVTDEAELAGLPESLLAAAAAAA--------------EAKGLEGYLITLDIPSYLPVLTYADNRAL 238 (683)
T ss_pred HHhhhccccccceeecCcHHHhcCCCHHHHHHHHHHH--------------HhcCCCCeEEeecccchhHHHHhcccHHH
Confidence 9999997773 2 2 24579999999999998763 33344789999999999999999999999
Q ss_pred HHHHHHHhccC-----CcchHHHHHHHHHHHHHHHHHcCCCCHHHHHhccCcCCCHHHHHHHHHHHHHhhhHHHHHHHHH
Q 037955 277 RKMVYIQGHSV-----PQANHEVLHELIAARNELAQIMGYRSYAEFIVMPNMASSPEVVKSFLLEMSKMIKPKADEEFEA 351 (733)
Q Consensus 277 Rk~~~~a~~~~-----~~~n~~~l~~ll~lR~e~A~llGy~sya~~~l~~~ma~spe~V~~fL~~l~~~~~p~~~~e~~~ 351 (733)
|+++|+|+..+ ..+|.+++.+++++|.|+|+||||+|||+|+|.++||++|+.|.+||++|+.+.+|.+++|++.
T Consensus 239 Re~~y~A~~~ra~~~~~~~n~~ii~~~L~LR~E~A~LLGf~~yA~~~L~~kMA~~p~~Vl~fL~~l~~ka~~~a~~e~a~ 318 (683)
T COG0339 239 REKLYRAYVTRASEGGPNDNRAIIEEILKLRAELAKLLGFSNYAEYSLADKMAKTPEAVLNFLNDLAEKARPQAEKELAE 318 (683)
T ss_pred HHHHHHHHhhhccccCCcccHHHHHHHHHHHHHHHHHcCcccHHHHhHHHHhhcCHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 99999998753 3458899999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHh-cCCCCCCCCcCCHHHHHHHhhhhccCCChhhccCCCCHHHHHHHHHHHHHHhhCceeEeecCCCCCccccc
Q 037955 352 IKNFKRKS-CGQKYVHLEPWDEAYYTAMMKSSAYNLDACVVASYFPLGQCIEGLKMLAESLFGVTFHSVPLAPGESWHPD 430 (733)
Q Consensus 352 L~~~~~~~-~g~~~~~l~pWD~~y~~~~~~~~~~~~d~~~~~~yfpl~~vl~gl~~l~~~lfgi~~~~~~~~~~~~wh~d 430 (733)
|+.++++. .| ..+++|||+.||++++|+.+|++|.+++++|||++.|++|+|+++++||||+|++.. ++++||||
T Consensus 319 L~~~~~~~~~~--~~~l~~WD~~yyaeK~r~~~y~~de~elrpYF~l~~Vl~GlF~~~~rLfGI~~~e~~--~~~vwHpD 394 (683)
T COG0339 319 LQAFAAEEEGG--LPELQPWDWAYYAEKQRQEKYAFDEEELRPYFPLNKVLEGLFEVAKRLFGITFVERK--DIPVWHPD 394 (683)
T ss_pred HHHHHHhhccC--CcccchhhHHHHHHHHHhhhcCCCHHHhhhcCChhHHHHHHHHHHHHHcCeEEEECC--CCCccCCC
Confidence 99999885 44 678999999999999999999999999999999999999999999999999999974 46899999
Q ss_pred cceeEEEcCCCCcccceeeecccCCCCccCcccccccCceecCCCCccccEEEEEccCCCCCCC-CCCcChhhHHHHHHH
Q 037955 431 VLKLSLQHPEEGEMGYLYLDLYSRAGKYTGCANFAIKGGRRLSETEYQLPVVALICNFPGSHNL-SVRLNHHEVETLFHE 509 (733)
Q Consensus 431 V~~~~v~d~~~~~lg~~ylDl~~R~gK~~ga~~~~i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~-~~lL~~~~v~TLfHE 509 (733)
|++|+|+|.+|..+|.||+|+|+|+||++||||+++++++.+.+|+.+.||++++|||++|.++ |+||+|+||+|||||
T Consensus 395 Vr~~~v~d~~g~~~g~fY~DlyaR~~KrgGAWM~~~~~~~~~~~~~~q~PV~ylvCNF~~p~~gkPaLls~dEV~TLFHE 474 (683)
T COG0339 395 VRVFEVFDENGELIGLFYLDLYARDGKRGGAWMDDFVSQRRLDDGGGQKPVIYLVCNFTKPVGGKPALLSHDEVTTLFHE 474 (683)
T ss_pred ceEEEEEcCCCCEEEEEEeecccCCCCccchHHHHhhhcccccCCCcccceEEEeccCCCCCCCCCceeeHHHHHHHHHH
Confidence 9999999999999999999999999999999999999999998899999999999999999876 599999999999999
Q ss_pred HHHHHHHhhhcCCCCccCCCcccchhhhhhHHHHHHhhhcHHHHHHHHhcccCCCCChHHHHHHHHHhhhhhhHHHHHHH
Q 037955 510 FGHALHSLLSRTDYQHFSGTRVALDFAETPSNLFEYYAWDYRVLRRFAKHYLTGEIVPEKLVKSMQGARDMFAATELQRQ 589 (733)
Q Consensus 510 fGHalH~lls~~~~~~~sgt~~~~D~~E~pS~~~E~~~~~~~~L~~~s~h~~tge~lp~~l~~~l~~~~~~~~~~~~~~q 589 (733)
|||+||+||++++|+.+|||+|+|||||+||||||||||+|.+|..||+||+||+|||++++++|.+++++++|+++++|
T Consensus 475 fGHgLH~mlt~v~~~~vsGt~v~wDfVElPSQ~mE~w~~~p~vL~~~a~Hy~TGe~lP~~ll~k~laaknf~~g~~t~rq 554 (683)
T COG0339 475 FGHGLHHLLTRVKYPGVSGTNVPWDFVELPSQFMENWCWEPEVLAKYARHYQTGEPLPKELLDKMLAAKNFQAGLFTLRQ 554 (683)
T ss_pred hhhHHHHHhhcCCccccCCCCCCcchhhccHHHHHHhhcCHHHHHHHHHhhccCCcCHHHHHHHHHHHHhhhhHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHhHHHhhcCC-CCCCCCHHHHHHHHHhhccCCCCCCCCcccccccccc--cccccchhHHHHHHHHHHHHHHHHHc
Q 037955 590 IFYALVDQTLFGER-LGQTRDTSSIVADMKRQHTSWNHVEGTHWHIRFSHFI--NYGAGYYSYLYAKCFAATIWQKLCQE 666 (733)
Q Consensus 590 l~~a~fD~~lh~~~-~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~~~f~Hl~--~Y~a~YYsYl~s~v~A~di~~~~~~~ 666 (733)
+.+++|||.+|+.. |....++.++++++.+++...+.++..+|.++|+||| ||+||||||+|++|+++|.|+.|++.
T Consensus 555 l~fal~Dm~~H~~~~~~~~~~i~~~~~~~~~~~~v~~~~~~~~~~~~F~HIFagGYsAGYYSY~WaeVLsaDafa~Fee~ 634 (683)
T COG0339 555 LEFALFDMRLHTEFDPDANADILEFEAEVLKKVAVLPSIPPRRRPHSFGHIFAGGYSAGYYSYLWAEVLSADAFAAFEEE 634 (683)
T ss_pred HHHHHHHHHhhccCCcccccCHHHHHHHHHHHhCCCCCcchhhccccccceecCcccchhHHHHHHHHHhhHHHHHHHhc
Confidence 99999999999974 4456889999999999998887777889999999999 69999999999999999999999999
Q ss_pred CCCCHHHHHHHHHHHhcCCCCCCHHHHHHHhcCCCcccccCCCCCCChHHHHHHcCCC
Q 037955 667 DPLSLTTGTTLRTKILQHGGAKEPADMLNDLVGDGILRYCNGGIVPDITSFSDEVKLM 724 (733)
Q Consensus 667 ~~~~~~~G~~~r~~vL~~Ggs~~~~ell~~flGrd~~~~~~~g~~p~~~a~l~~~g~~ 724 (733)
|++|+++|++||+.||+.|||++|++++++|+|| +|+++|+|++.|+.
T Consensus 635 g~~~~e~G~rfrd~ILs~GGS~dp~e~f~~frGr----------ep~~dalLr~~Gl~ 682 (683)
T COG0339 635 GPFNRETGQRFRDAILSRGGSRDPMELFKAFRGR----------EPSIDALLRHRGLA 682 (683)
T ss_pred CCCCHHHHHHHHHHHHhccCCcCHHHHHHHHhcC----------CCChhHHHHhcCCC
Confidence 9999999999999999999999999999999999 69999999999985
No 2
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=100.00 E-value=4.8e-139 Score=1204.53 Aligned_cols=645 Identities=23% Similarity=0.363 Sum_probs=593.8
Q ss_pred CCCCCCCCCCCCcCChHHHHHHHHHHHHHHHHHHHcC---CChHHHHHHHHHhhHHHhhhhchhhhhhccCCchHHHHHH
Q 037955 48 PTGLYGFDHLKSPNGFQRFVDDAIERSSELVNYISEM---PSSVEIIRAMDEISDAVCSVVDSAELCRQTHPDREFVEEA 124 (733)
Q Consensus 48 ~~gl~~~~~l~~p~~~~~~~~~~i~~~~~~i~~i~~~---~~~~~~v~~ld~~~~~l~~~~~~~~~~~~~~~d~e~r~aa 124 (733)
+.|++.|+.+ +|++|.++++++|+.+++.|++|... |||+|+|.+||++++.||.+++++.+++++|||+++|+++
T Consensus 14 ~~~~p~f~~i-~~e~~~~a~~~~~~~~~~~i~~i~~~~~~~t~~n~i~~ld~~~~~l~~~~~~~~~l~~v~~~~~~r~a~ 92 (681)
T PRK10280 14 PYLAPHFDQI-ADHHYRPAFDEGVRQKRAEIAAIALNPQAPDFNNTILALEQSGELLTRVTSVFFAMTAAHTNDELQRLD 92 (681)
T ss_pred CCCCCCCCcC-CHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH
Confidence 3478999999 69999999999999999999999753 5899999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHhhCCHHHHHHHHHHhhc--CCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHH
Q 037955 125 SKASMRISEYLHYLNTNHTLYDAVKKAELD--GHLLSKEAHRAANHLRIDFEKGGIHLCADKLDRVNQLNMDIFQLCREF 202 (733)
Q Consensus 125 ~~~~~~l~~~~~~l~~~~~Ly~~l~~~~~~--~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~L~~~F 202 (733)
+++.+++++|++++++|++||++|+++... ...|++|++|++++++++|+++|++|++++|+++++|+.+|++|+++|
T Consensus 93 ~~~~~~l~~~~~~l~~~~~Ly~~l~~~~~~~~~~~l~~e~~r~l~~~l~dF~~sG~~L~~~~r~r~~~l~~~l~~L~~~F 172 (681)
T PRK10280 93 EQFSAELAELANDIYLNGELFARVDAVWQQRESLGLDSESIRLVEVIHQRFVLAGAKLAQADKAKLKVLNTEAATLTSQF 172 (681)
T ss_pred HHHHHHHHHHHHHhhCCHHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHH
Confidence 999999999999999999999999999322 237999999999999999999999999999999999999999999999
Q ss_pred HhhcccCCC--C--c-ccccCCCCCHHHHHhhccccccCCCCCcchhhhhcccCCC-CEEEecCCCcHHHHhhhCCCHHH
Q 037955 203 NQNIINDPG--H--V-DIFPESRIPKHIHHLLKPICRLTSGPSRESLISWDNKKEK-GFRITTDSRILQSILQWTSDDEV 276 (733)
Q Consensus 203 ~~ni~~~~~--~--v-~~~~l~glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~i~l~~~~~~~vL~~~~d~~~ 276 (733)
++|+.+++. . | +..+|+|||+++++.++..+ ...+.. +|.||++++++.+||++|+||++
T Consensus 173 ~~n~~~~~~~~~~~~~~~~eL~Glp~~~~~~~~~~a--------------~~~g~~g~~~itl~~~~~~p~l~~~~dr~~ 238 (681)
T PRK10280 173 NQRLLAANKSGGLVVNDIHQLAGLSEQEIALAAEAA--------------REKGLDNRWLIPLLNTTQQPALAELRDRQT 238 (681)
T ss_pred HHhHhhhhhcCeeecCCHHHHcCCCHHHHHHHHHHH--------------HhcCCCCcEEEeCcCCcHhHHHhcCCCHHH
Confidence 999987763 2 2 23579999999999987642 222223 39999999999999999999999
Q ss_pred HHHHHHHhccCC-----cchHHHHHHHHHHHHHHHHHcCCCCHHHHHhccCcCCCHHHHHHHHHHHHHhhhHHHHHHHHH
Q 037955 277 RKMVYIQGHSVP-----QANHEVLHELIAARNELAQIMGYRSYAEFIVMPNMASSPEVVKSFLLEMSKMIKPKADEEFEA 351 (733)
Q Consensus 277 Rk~~~~a~~~~~-----~~n~~~l~~ll~lR~e~A~llGy~sya~~~l~~~ma~spe~V~~fL~~l~~~~~p~~~~e~~~ 351 (733)
||++|.|+.+.. .+|.++|.+|+++|+++|++|||+||++|++.++|++||++|.+||++|...++|.+++|++.
T Consensus 239 Re~~~~A~~~r~~~~~~~dn~~il~~ll~lR~e~A~lLGf~~yA~~~l~~kMa~spe~V~~FL~~L~~~~~~~a~~E~~~ 318 (681)
T PRK10280 239 RENLFAAGWTRAEKGDANDTRAIIQRLVEIRAQQAKLLGFPHYAAWKIADQMAKTPEAALNFMREIVPAARQRASDELAS 318 (681)
T ss_pred HHHHHHHHHhhhccCCchhHHHHHHHHHHHHHHHHHHcCCCcHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 999999998754 268999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHhcCCCCCCCCcCCHHHHHHHhhhhccCCChhhccCCCCHHHHH-HHHHHHHHHhhCceeEeecCCCCCccccc
Q 037955 352 IKNFKRKSCGQKYVHLEPWDEAYYTAMMKSSAYNLDACVVASYFPLGQCI-EGLKMLAESLFGVTFHSVPLAPGESWHPD 430 (733)
Q Consensus 352 L~~~~~~~~g~~~~~l~pWD~~y~~~~~~~~~~~~d~~~~~~yfpl~~vl-~gl~~l~~~lfgi~~~~~~~~~~~~wh~d 430 (733)
|++++++..| ...|+|||+.||++++++++|++|++++++|||++.|+ +|||.++++||||+|++++. .++||||
T Consensus 319 L~~~~~~~~g--~~~l~pWD~~yy~ek~r~~~~~~d~~~l~~YFpl~~Vl~~Glf~l~~~LfGi~f~~~~~--~~vWh~d 394 (681)
T PRK10280 319 IQAVIDKQQG--GFSAQAWDWAFYAEQVRREKYALDEAQLKPYFELNTVLNEGVFWTANQLFGIKFVERFD--IPVYHPD 394 (681)
T ss_pred HHHHHHHhcC--CCCCChhHHHHHHHHHHHHhcCCCHHHcCCcCcHHHHHHHhHHHHHHHHcCeEEEECCC--CCCCCCC
Confidence 9999987766 45799999999999999999999999999999999999 79999999999999999764 4699999
Q ss_pred cceeEEEcCCCCcccceeeecccCCCCccCcccccccCceecCCCCccccEEEEEccCCCCCCC-CCCcChhhHHHHHHH
Q 037955 431 VLKLSLQHPEEGEMGYLYLDLYSRAGKYTGCANFAIKGGRRLSETEYQLPVVALICNFPGSHNL-SVRLNHHEVETLFHE 509 (733)
Q Consensus 431 V~~~~v~d~~~~~lg~~ylDl~~R~gK~~ga~~~~i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~-~~lL~~~~v~TLfHE 509 (733)
|++|+|+|++|..+|++|+|+|+|+||++||||.+++.+... .++.|+++++|||++|.++ |+||+|+||+|||||
T Consensus 395 V~~~~V~d~~g~~lG~fY~Dl~~R~gK~~gawm~~~~~~~~~---~~~~Pv~~lvcNf~~p~~~~p~LL~~~eV~TlFHE 471 (681)
T PRK10280 395 VRVWEIFDHNGVGLALFYGDFFARDSKSGGAWMGNFVEQSTL---NETRPVIYNVCNYQKPAAGQPALLLWDDVITLFHE 471 (681)
T ss_pred eeEEEEEcCCCCEEEEEEecCCCCCCCCCCCCCccccccccc---CCCCCeEEEECCCCCCCCCCCCccCHHHHHHHHHH
Confidence 999999998889999999999999999999999888754332 2467999999999998754 599999999999999
Q ss_pred HHHHHHHhhhcCCCCccCCCcccchhhhhhHHHHHHhhhcHHHHHHHHhcccCCCCChHHHHHHHHHhhhhhhHHHHHHH
Q 037955 510 FGHALHSLLSRTDYQHFSGTRVALDFAETPSNLFEYYAWDYRVLRRFAKHYLTGEIVPEKLVKSMQGARDMFAATELQRQ 589 (733)
Q Consensus 510 fGHalH~lls~~~~~~~sgt~~~~D~~E~pS~~~E~~~~~~~~L~~~s~h~~tge~lp~~l~~~l~~~~~~~~~~~~~~q 589 (733)
||||||++|++++|++++||+|+|||||+||||||+|||+|+||+.||+||+||+|||++++++|++++++++|+++++|
T Consensus 472 fGHalH~lls~~~y~~~sGt~v~~DfVE~PSq~mE~w~~~~~vL~~~a~Hy~TgepiP~~l~~~l~~ar~~~~g~~~~~q 551 (681)
T PRK10280 472 FGHTLHGLFARQRYATLSGTNTPRDFVEFPSQINEHWASHPQVFARYARHYQSGEAMPDELQEKMRNASLFNKGYDMSEL 551 (681)
T ss_pred HHHHHHHHHhCCCccccCCCCCCcchhcCcHHHHHHHhcCHHHHHHHhhccCCCCCCCHHHHHHHHHhhCcchHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHhHHHhhcCCCC-CCCCHHHHHHHHHhhccC-CCCCCCCcccccccccc--cccccchhHHHHHHHHHHHHHHHHH
Q 037955 590 IFYALVDQTLFGERLG-QTRDTSSIVADMKRQHTS-WNHVEGTHWHIRFSHFI--NYGAGYYSYLYAKCFAATIWQKLCQ 665 (733)
Q Consensus 590 l~~a~fD~~lh~~~~~-~~~~~~~l~~~l~~~~~~-~~~~~~~~~~~~f~Hl~--~Y~a~YYsYl~s~v~A~di~~~~~~ 665 (733)
+.+|+|||++|+..+. ...++..++.++.+++.. ++.+++++|+++|+||| ||+||||||+||+|+|+|+|+.|++
T Consensus 552 l~~al~D~~lH~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~F~Hif~ggY~AgYYsYlwaevlaaD~f~~f~~ 631 (681)
T PRK10280 552 LSAALLDMRWHCLEENEAMQDVDDFELRALVAENLDLPAVPPRYRSSYFAHIFGGGYAAGYYAYLWTQMLADDGYQWFVE 631 (681)
T ss_pred HHHHHHhHHHhccCcccccccHHHHHHHHHHHhCCCCCCCCCCCCCCcccccccCCcchhhHHHHHHHHHHHHHHHHHHh
Confidence 9999999999975322 124788888788777664 56777889999999999 6999999999999999999999999
Q ss_pred cCCCCHHHHHHHHHHHhcCCCCCCHHHHHHHhcCCCcccccCCCCCCChHHHHHHcCCC
Q 037955 666 EDPLSLTTGTTLRTKILQHGGAKEPADMLNDLVGDGILRYCNGGIVPDITSFSDEVKLM 724 (733)
Q Consensus 666 ~~~~~~~~G~~~r~~vL~~Ggs~~~~ell~~flGrd~~~~~~~g~~p~~~a~l~~~g~~ 724 (733)
+|++|+++|++||++||++|||+||+++|++|+|| +|+++|||+++|+.
T Consensus 632 ~g~~n~~~G~~fr~~iL~~GGs~d~~~~~~~FlGR----------~P~~~alL~~~Gl~ 680 (681)
T PRK10280 632 QGGLTRENGQRFREAILSRGNSTDLERLYRQWRGH----------APQIMPMLQHRGLN 680 (681)
T ss_pred cCCCCHHHHHHHHHHHhhcCCCcCHHHHHHHhcCC----------CCChHHHHHhcCCC
Confidence 99999999999999999999999999999999999 69999999999985
No 3
>PRK10911 oligopeptidase A; Provisional
Probab=100.00 E-value=7.3e-133 Score=1158.13 Aligned_cols=647 Identities=27% Similarity=0.459 Sum_probs=601.6
Q ss_pred CCCCCCCCCCCcCChHHHHHHHHHHHHHHHHHHHcC---CChHHHHHHHHHhhHHHhhhhchhhhhhccCCchHHHHHHH
Q 037955 49 TGLYGFDHLKSPNGFQRFVDDAIERSSELVNYISEM---PSSVEIIRAMDEISDAVCSVVDSAELCRQTHPDREFVEEAS 125 (733)
Q Consensus 49 ~gl~~~~~l~~p~~~~~~~~~~i~~~~~~i~~i~~~---~~~~~~v~~ld~~~~~l~~~~~~~~~~~~~~~d~e~r~aa~ 125 (733)
.++|.|+.+ +|++|.++++++|++|++.|++|... +||+|+|.+|+++++.|+.+++++.++++||||+++|++++
T Consensus 9 ~~~p~f~~i-~~~~~~~~~~~~i~~~~~~i~~i~~~~~~~t~~n~i~~l~~~~~~l~~~~~~~~~l~~v~~~~~~r~a~~ 87 (680)
T PRK10911 9 FSLPPFSAI-KPEHVVPAVTKALNDCREAVERVVAQGAPYTWENLCQPLAEVDDVLGRIFSPVSHLNSVKNSPELREAYE 87 (680)
T ss_pred CCCCCCCcC-CHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHH
Confidence 467889998 79999999999999999999999653 38999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHhhCCHHHHHHHHHHhh--cCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHH
Q 037955 126 KASMRISEYLHYLNTNHTLYDAVKKAEL--DGHLLSKEAHRAANHLRIDFEKGGIHLCADKLDRVNQLNMDIFQLCREFN 203 (733)
Q Consensus 126 ~~~~~l~~~~~~l~~~~~Ly~~l~~~~~--~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~L~~~F~ 203 (733)
++.+++++|.+++++|++||++|+++.. ....|++|++|++++++++|+++|++|++++|+++++|+.+|++|+++|+
T Consensus 88 ~~~~~l~~~~~~~~~~~~Ly~~~~~~~~~~~~~~l~~e~~r~l~~~~~~F~~sG~~L~~~~r~~~~~i~~~l~~l~~~F~ 167 (680)
T PRK10911 88 QTLPLLSEYSTWVGQHEGLYQAYRDLRDGDHYATLNTAQKKAVDNALRDFELSGIGLPKEKQQRYGEIAARLSELGNQYS 167 (680)
T ss_pred HHHHHHHHHHHHHhCCHHHHHHHHHHhccccccCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHH
Confidence 9999999999999999999999999942 22479999999999999999999999999999999999999999999999
Q ss_pred hhcccCCCC----c-ccccCCCCCHHHHHhhccccccCCCCCcchhhhhcccCCCCEEEecCCCcHHHHhhhCCCHHHHH
Q 037955 204 QNIINDPGH----V-DIFPESRIPKHIHHLLKPICRLTSGPSRESLISWDNKKEKGFRITTDSRILQSILQWTSDDEVRK 278 (733)
Q Consensus 204 ~ni~~~~~~----v-~~~~l~glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~l~~~~~~~vL~~~~d~~~Rk 278 (733)
+|+.+++.. + +..+|+|||+++++.++..+ ...+.+||.||++.|++.+||++|+|+++||
T Consensus 168 ~n~~~~~~~~~~~~~~~~eL~Glp~~~~~~~~~~a--------------~~~~~~g~~~tl~~p~~~p~L~~~~drelRk 233 (680)
T PRK10911 168 NNVLDATMGWTKLITDEAELAGMPESALAAAKAQA--------------EAKEQEGYLLTLDIPSYLPVMTYCDNQALRE 233 (680)
T ss_pred HhhhhhhccceeecCCHHHHCCCCHHHHHHHHHHH--------------HhcCCCCeEEEeecCcHHHHHhhCCCHHHHH
Confidence 999977742 2 23579999999999987642 2233357999999999999999999999999
Q ss_pred HHHHHhccCC---------cchHHHHHHHHHHHHHHHHHcCCCCHHHHHhccCcCCCHHHHHHHHHHHHHhhhHHHHHHH
Q 037955 279 MVYIQGHSVP---------QANHEVLHELIAARNELAQIMGYRSYAEFIVMPNMASSPEVVKSFLLEMSKMIKPKADEEF 349 (733)
Q Consensus 279 ~~~~a~~~~~---------~~n~~~l~~ll~lR~e~A~llGy~sya~~~l~~~ma~spe~V~~fL~~l~~~~~p~~~~e~ 349 (733)
++|+|+.+.. .+|.++|.+|+++|+++|++|||+||++|++.++|++||++|.+||++|.+.++|.+++|+
T Consensus 234 ~~y~A~~~r~~~~~~~~~~~dN~~il~eil~lR~e~AkLLGy~syAd~~L~~kma~spe~V~~fL~~l~~~~~p~a~~El 313 (680)
T PRK10911 234 EMYRAYSTRASDQGPNAGKWDNSEVMEEILALRHELAQLLGFENYADKSLATKMAENPQQVLDFLTDLAKRARPQGEKEL 313 (680)
T ss_pred HHHHHHHhhhhhccCcccccchHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHH
Confidence 9999998543 2689999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHhcCCCCCCCCcCCHHHHHHHhhhhccCCChhhccCCCCHHHHHHHHHHHHHHhhCceeEeecCCCCCcccc
Q 037955 350 EAIKNFKRKSCGQKYVHLEPWDEAYYTAMMKSSAYNLDACVVASYFPLGQCIEGLKMLAESLFGVTFHSVPLAPGESWHP 429 (733)
Q Consensus 350 ~~L~~~~~~~~g~~~~~l~pWD~~y~~~~~~~~~~~~d~~~~~~yfpl~~vl~gl~~l~~~lfgi~~~~~~~~~~~~wh~ 429 (733)
+.|++++++.+| .++|+|||+.||.+++++++|++|++++++|||++.|++||+.++++||||+|++++ ++++|||
T Consensus 314 ~~L~~~~k~~~g--~~~L~pWD~~yy~~~~~~~~~~~d~~~l~~YFpl~~v~~gl~~~~~~Lfgi~~~e~~--~~~vwh~ 389 (680)
T PRK10911 314 AQLRAFAKAEFG--VDELQPWDIAYYSEKQKQHLYSISDEQLRPYFPENKAVNGLFEVVKRIYGITAKERK--DVDVWHP 389 (680)
T ss_pred HHHHHHHHhhcC--CCCCChhhHHHHhhHHHHhhcCCCHHHhcccCcHHHHHHHHHHHHHHHcCeeEEecC--CCCCCCC
Confidence 999999988777 568999999999999999999999999999999999999999999999999998875 4689999
Q ss_pred ccceeEEEcCCCCcccceeeecccCCCCccCcccccccCceecCCCCccccEEEEEccCCCCCCC-CCCcChhhHHHHHH
Q 037955 430 DVLKLSLQHPEEGEMGYLYLDLYSRAGKYTGCANFAIKGGRRLSETEYQLPVVALICNFPGSHNL-SVRLNHHEVETLFH 508 (733)
Q Consensus 430 dV~~~~v~d~~~~~lg~~ylDl~~R~gK~~ga~~~~i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~-~~lL~~~~v~TLfH 508 (733)
||++|+|+|++|+++|++|+|+|+|+||++||||++++++....+|.++.|+++++|||++|.++ |+||+|+||.||||
T Consensus 390 dV~~~~v~d~~~~~iG~~y~D~~~R~gK~~ga~~~~~~~~~~~~~g~~~~Pv~~l~~Nf~~p~~~~p~LL~~~~v~tlfH 469 (680)
T PRK10911 390 DVRFFELYDENNELRGSFYLDLYARENKRGGAWMDDCVGQMRKADGSLQKPVAYLTCNFNRPVNGKPALFTHDEVITLFH 469 (680)
T ss_pred cceEEEEEeCCCCeEEEEEeeccCCCCCCCCccccccccccccCCCceeCCeEEEECCCCCCCCCCCcccCHHHHHHHHH
Confidence 99999999988899999999999999999999999998887777888899999999999998754 59999999999999
Q ss_pred HHHHHHHHhhhcCCCCccCCCc-ccchhhhhhHHHHHHhhhcHHHHHHHHhcccCCCCChHHHHHHHHHhhhhhhHHHHH
Q 037955 509 EFGHALHSLLSRTDYQHFSGTR-VALDFAETPSNLFEYYAWDYRVLRRFAKHYLTGEIVPEKLVKSMQGARDMFAATELQ 587 (733)
Q Consensus 509 EfGHalH~lls~~~~~~~sgt~-~~~D~~E~pS~~~E~~~~~~~~L~~~s~h~~tge~lp~~l~~~l~~~~~~~~~~~~~ 587 (733)
|||||||++|++++|+.++|++ +++||||+|||+||+|||+|++|+.||+||.||+|+|++++++|+++++++.++.++
T Consensus 470 EfGHalH~~ls~~~~~~~sGt~~~~~D~vE~pS~~~E~~~~~~~vL~~~a~H~~tgeplp~~l~~~l~~~~~~~~~~~~~ 549 (680)
T PRK10911 470 EFGHGLHHMLTRIETAGVSGISGVPWDAVELPSQFMENWCWEPEALAFISGHYETGEPLPKELLDKMLAAKNYQAALFIL 549 (680)
T ss_pred HHhHHHHHHHhCCCcCcCCCcCCCCchHhhccHHHHHHHhcCHHHHHHHHHHhcCCCCCCHHHHHHHHHhhhhhhHHHHH
Confidence 9999999999999999999976 899999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHhHHHhhcCCCC-CCCCHHHHHHHHHhhccCCCCCCCCcccccccccc--cccccchhHHHHHHHHHHHHHHHH
Q 037955 588 RQIFYALVDQTLFGERLG-QTRDTSSIVADMKRQHTSWNHVEGTHWHIRFSHFI--NYGAGYYSYLYAKCFAATIWQKLC 664 (733)
Q Consensus 588 ~ql~~a~fD~~lh~~~~~-~~~~~~~l~~~l~~~~~~~~~~~~~~~~~~f~Hl~--~Y~a~YYsYl~s~v~A~di~~~~~ 664 (733)
+|+.+++||+++|+..+. ...++.++++++++++..++..++.+|+++|+||| ||+|+||||+||+|+|+|+|+.|+
T Consensus 550 rql~~a~~D~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~F~Hif~gGY~AgYYsYlwa~vla~d~~~~f~ 629 (680)
T PRK10911 550 RQLEFGLFDFRLHAEFDPDQGAKILETLAEIKKQVAVVPSPSWGRFPHAFSHIFAGGYAAGYYSYLWADVLAADAFSRFE 629 (680)
T ss_pred HHHHHHHHHHHHhhccccccccCHHHHHHHHHHHcCCCCCCCCCcCCccccccCcCCcccchHHHHHHHHHHHHHHHHHH
Confidence 999999999999975332 12378999999999998777777888999999999 799999999999999999999999
Q ss_pred HcCCCCHHHHHHHHHHHhcCCCCCCHHHHHHHhcCCCcccccCCCCCCChHHHHHHcCCC
Q 037955 665 QEDPLSLTTGTTLRTKILQHGGAKEPADMLNDLVGDGILRYCNGGIVPDITSFSDEVKLM 724 (733)
Q Consensus 665 ~~~~~~~~~G~~~r~~vL~~Ggs~~~~ell~~flGrd~~~~~~~g~~p~~~a~l~~~g~~ 724 (733)
++|++|+++|++||++||++|||++|++++++|+|| +|+++|||+++|+.
T Consensus 630 ~~g~~~~~~g~~~r~~iL~~Ggs~~p~~~~~~F~GR----------~P~~~all~~~gl~ 679 (680)
T PRK10911 630 EEGIFNRETGQSFLDNILSRGGSEEPMELFKRFRGR----------EPQLDAMLEHYGIK 679 (680)
T ss_pred hcCCCCHHHHHHHHHHHHhCcCCcCHHHHHHHhcCC----------CCChHHHHHhcCCC
Confidence 999999999999999999999999999999999999 69999999999984
No 4
>KOG2090 consensus Metalloendopeptidase family - mitochondrial intermediate peptidase [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=3.6e-132 Score=1061.87 Aligned_cols=634 Identities=44% Similarity=0.755 Sum_probs=588.8
Q ss_pred CCCCCCCCCCCCCCcCChHHHHHHHHHHHHHHHHHHHcCCChHHHHHHHHHhhHHHhhhhchhhhhhccCCchHHHHHHH
Q 037955 46 GVPTGLYGFDHLKSPNGFQRFVDDAIERSSELVNYISEMPSSVEIIRAMDEISDAVCSVVDSAELCRQTHPDREFVEEAS 125 (733)
Q Consensus 46 ~~~~gl~~~~~l~~p~~~~~~~~~~i~~~~~~i~~i~~~~~~~~~v~~ld~~~~~l~~~~~~~~~~~~~~~d~e~r~aa~ 125 (733)
...+||||+|.|++|++|..+.+.++++++++|+.+.+.++...+|..+|++||.||+|+|.++|+|..|||+++++||+
T Consensus 39 ~~~tGLFgn~~L~t~eGF~~l~~~a~~~t~eLi~~~~~~~~g~~ti~~~DeiSd~lC~V~DLaEfvR~aHPd~~fv~aAe 118 (704)
T KOG2090|consen 39 GDETGLFGNPELSTAEGFNRLPEAALEKTQELIDELLSTPSGPRTIQIFDEISDTLCRVADLAEFVRQAHPDPEFVEAAE 118 (704)
T ss_pred cccccccCCcccCChhHHHHHHHHHHHHHHHHHHHHhcCCCCchHHHHHHHHhHHHHHHHHHHHHHHHhCCCHHHHHHHH
Confidence 45699999999999999999999999999999999999888888999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHhhCCHHHHHHHHHHhhcC---CCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHH
Q 037955 126 KASMRISEYLHYLNTNHTLYDAVKKAELDG---HLLSKEAHRAANHLRIDFEKGGIHLCADKLDRVNQLNMDIFQLCREF 202 (733)
Q Consensus 126 ~~~~~l~~~~~~l~~~~~Ly~~l~~~~~~~---~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~L~~~F 202 (733)
+|+..+.++++.+|+|..||++||.+.++. ..+++|.. +++.++.|||++||||++++|+++++|+.+|.+||.+|
T Consensus 119 ~a~~~~~e~ve~LNTn~~LY~~Lk~~l~~~~~l~~~d~e~~-v~~lll~DFE~sGIhL~~~kr~kfv~Ls~eI~~lg~~F 197 (704)
T KOG2090|consen 119 EACRSMFELVESLNTNVALYQKLKKVLQDSSRLDDLDPETY-VARLLLDDFEKSGIHLDPEKREKFVQLSSEIFDLGREF 197 (704)
T ss_pred HHHHHHHHHHHHhccCHHHHHHHHHHhcCcccccccCHHHH-HHHHHHHHHHhhcccCCHHHHHHHHHHhHHHHHHHHHH
Confidence 999999999999999999999999985443 34677766 99999999999999999999999999999999999999
Q ss_pred HhhcccCC-CCcccccCCCCC---HHHHHhhccccccCCCCCcchhhhhcccCCCCEEEecCCCcHHHHhhhCCCHHHHH
Q 037955 203 NQNIINDP-GHVDIFPESRIP---KHIHHLLKPICRLTSGPSRESLISWDNKKEKGFRITTDSRILQSILQWTSDDEVRK 278 (733)
Q Consensus 203 ~~ni~~~~-~~v~~~~l~glp---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~l~~~~~~~vL~~~~d~~~Rk 278 (733)
.+|+..-| ..+.. -+..|| +..++.. .+++.+..++..+..+++++.|+.+||
T Consensus 198 ~~n~~~~P~~~l~~-s~~~l~~~~~~~~~~k----------------------~knF~~~t~~~~i~~ll~~~~d~~vRk 254 (704)
T KOG2090|consen 198 QNNTDRFPDNKLPN-SLERLPFSKKNFIESK----------------------RKNFDQATDPYIIYGLLSSSEDESVRK 254 (704)
T ss_pred HhccccCCccccch-hhhhccCcchhhHHHH----------------------hcCCCcCCCCeeeehhhhcCCCHHHHH
Confidence 99987533 11211 122222 1221111 123444445666778899999999999
Q ss_pred HHHHHhccCCcchHHHHHHHHHHHHHHHHHcCCCCHHHHHhccCcCCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHH
Q 037955 279 MVYIQGHSVPQANHEVLHELIAARNELAQIMGYRSYAEFIVMPNMASSPEVVKSFLLEMSKMIKPKADEEFEAIKNFKRK 358 (733)
Q Consensus 279 ~~~~a~~~~~~~n~~~l~~ll~lR~e~A~llGy~sya~~~l~~~ma~spe~V~~fL~~l~~~~~p~~~~e~~~L~~~~~~ 358 (733)
.+|..+++....+..+|+.+|..|+++|++.||+|||+++++++|+++|+.|.+||+.|.++++|...+|+..|..+|++
T Consensus 255 ~vY~~~~~~~~~q~~~L~~Li~sRheLa~l~G~~SfA~~al~~~~a~~pk~V~~Fl~~Ls~k~~~~~~kel~~i~~mk~k 334 (704)
T KOG2090|consen 255 MVYNTFHSPSDIQVKLLEHLISSRHELAKLVGKSSFAHRALEGKLAKNPKTVRSFLEELSEKLSEKTDKELAVIRDMKKK 334 (704)
T ss_pred HHHHHhcCCcHHHHHHHHHHHHHHHHHHHHhCcchHHHHHhhccccCChHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhh
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hcCCCCCCCCcCCHHHHHHHhhhhccCCChhhccCCCCHHHHHHHHHHHHHHhhCceeEeecCCCCCccccccceeEEEc
Q 037955 359 SCGQKYVHLEPWDEAYYTAMMKSSAYNLDACVVASYFPLGQCIEGLKMLAESLFGVTFHSVPLAPGESWHPDVLKLSLQH 438 (733)
Q Consensus 359 ~~g~~~~~l~pWD~~y~~~~~~~~~~~~d~~~~~~yfpl~~vl~gl~~l~~~lfgi~~~~~~~~~~~~wh~dV~~~~v~d 438 (733)
.++....+++|||..||..+++++.++++.....+|||++.|++|+..++++||||+|.+.+..|||+|||||++++|++
T Consensus 335 ~~~~~~~e~~~WD~~YYT~~~r~~~~~~~~~~~~~fFslg~~ieGLs~L~~~LyGirl~~~~l~pGE~WhpdV~KL~vv~ 414 (704)
T KOG2090|consen 335 ENNNNNAEIEPWDRPYYTSMYRQSNNSLNPSIYSPFFSLGSCIEGLSTLFQRLYGIRLIPEPLAPGEVWHPDVRKLNVVH 414 (704)
T ss_pred hcCCCCCccccccchhhhhHhhcccCCCCcceeeccccHHHHHHHHHHHHHHHhCeeeeccCCCCccccCchhhhheeec
Confidence 98877789999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCCCcccceeeecccCCCCccCcccccccCceecCCCCccccEEEEEccCCCCCCC-CCCcChhhHHHHHHHHHHHHHHh
Q 037955 439 PEEGEMGYLYLDLYSRAGKYTGCANFAIKGGRRLSETEYQLPVVALICNFPGSHNL-SVRLNHHEVETLFHEFGHALHSL 517 (733)
Q Consensus 439 ~~~~~lg~~ylDl~~R~gK~~ga~~~~i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~-~~lL~~~~v~TLfHEfGHalH~l 517 (733)
++.+.+|+||+|+|.|+||..|+++|+||||+.++||++|.|+++++|||..+... +++|++.+|+|||||||||||++
T Consensus 415 E~eg~lG~IY~Dlf~R~gK~~g~aHFTIr~sr~l~Dg~yQlPVi~L~cnf~rss~~s~t~L~~~~vetLFHEmGHAMHSm 494 (704)
T KOG2090|consen 415 EQEGLLGYIYCDLFERPGKTVGDAHFTIRGSRQLSDGTYQLPVIVLVCNFVRSSQSSPTFLSLSEVETLFHEMGHAMHSM 494 (704)
T ss_pred CCCCceeEEEEEeeccCCCCCCCceEEeeccccCCCCCeeceeeEEeecccccccCCCcccCHHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999987754 49999999999999999999999
Q ss_pred hhcCCCCccCCCcccchhhhhhHHHHHHhhhcHHHHHHHHhcccCCCCChHHHHHHHHHhhhhhhHHHHHHHHHHHHhHH
Q 037955 518 LSRTDYQHFSGTRVALDFAETPSNLFEYYAWDYRVLRRFAKHYLTGEIVPEKLVKSMQGARDMFAATELQRQIFYALVDQ 597 (733)
Q Consensus 518 ls~~~~~~~sgt~~~~D~~E~pS~~~E~~~~~~~~L~~~s~h~~tge~lp~~l~~~l~~~~~~~~~~~~~~ql~~a~fD~ 597 (733)
|++|+||+++||+||+||+|+||.+||+|++|+.||..|++||.|++++|+++++++..+++++++.++++|+.++++|+
T Consensus 495 LGrT~YQhvtGTRc~tDfaEiPSiLMEyFa~D~rVl~~~aRhy~t~e~l~~~mv~~l~~s~n~~Aa~e~q~Qv~ya~~Dq 574 (704)
T KOG2090|consen 495 LGRTHYQHVTGTRCPTDFAEIPSILMEYFANDYRVLRFFARHYSTGEPLPEDMVNRLCESRNSFAAQETQRQVFYALLDQ 574 (704)
T ss_pred hccchhccccCcccchhHhhhhHHHHHHHhcchHHHHHHHHHcCCCCCCCHHHHHHHHHHhccchhHHHHHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HhhcC-CCCCCCCHHHHHHHHHhhccCCCCCCCCcccccccccccccccchhHHHHHHHHHHHHHHHHHcCCCCHHHHHH
Q 037955 598 TLFGE-RLGQTRDTSSIVADMKRQHTSWNHVEGTHWHIRFSHFINYGAGYYSYLYAKCFAATIWQKLCQEDPLSLTTGTT 676 (733)
Q Consensus 598 ~lh~~-~~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~~~f~Hl~~Y~a~YYsYl~s~v~A~di~~~~~~~~~~~~~~G~~ 676 (733)
.+|.. .+....+..+++.++.+++++++..+++.|+.+|+|++||||.||||++++++|.-||+..|+++|+++++|++
T Consensus 575 ~fhg~~~~~~~~~~~~~~~~v~~k~~~~~~~~~taw~~rFsHl~gYGA~YYSYL~~r~~AS~IWq~~Fe~dPfsR~aGek 654 (704)
T KOG2090|consen 575 EFHGIACPLIAEDTTDLLSEVKRKFSGLLYVPPTAWQLRFSHLVGYGATYYSYLFARAIASLIWQQLFENDPFSRKAGEK 654 (704)
T ss_pred HHhcccccccccchhHHHHHHHHhcCCCCCCCCCccccchhhhhccCchHHHHHHHHHHHHHHHHHHHhcCccchhhhHH
Confidence 99994 44445589999999999999998899999999999999999999999999999999999999999999999999
Q ss_pred HHHHHhcCCCCCCHHHHHHHhcCCCcc
Q 037955 677 LRTKILQHGGAKEPADMLNDLVGDGIL 703 (733)
Q Consensus 677 ~r~~vL~~Ggs~~~~ell~~flGrd~~ 703 (733)
||++||++||+++|+++++++||.+++
T Consensus 655 ~r~eil~hGG~~~P~~lva~~L~~~~~ 681 (704)
T KOG2090|consen 655 FRKEILKHGGGRDPAELVADILGKPPL 681 (704)
T ss_pred HHHHHHHhcCCCChHHHHHHHhcCCCC
Confidence 999999999999999999999999644
No 5
>KOG2089 consensus Metalloendopeptidase family - saccharolysin & thimet oligopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=4.8e-129 Score=1051.86 Aligned_cols=649 Identities=28% Similarity=0.480 Sum_probs=604.6
Q ss_pred CCCCCCCCCCcCChHHH--HHHHHHHHHHHHHHHHc----CCChHHHHHHHHHhhHHHhhhhchhhhhhccCCchHHHHH
Q 037955 50 GLYGFDHLKSPNGFQRF--VDDAIERSSELVNYISE----MPSSVEIIRAMDEISDAVCSVVDSAELCRQTHPDREFVEE 123 (733)
Q Consensus 50 gl~~~~~l~~p~~~~~~--~~~~i~~~~~~i~~i~~----~~~~~~~v~~ld~~~~~l~~~~~~~~~~~~~~~d~e~r~a 123 (733)
|-..++.+.|++++.-. +.++|++..++.+.++. .++++|+|+|+.++.+++..++.++.+..+|++++++|.+
T Consensus 36 g~~~~~~~~s~~~i~~~~~~~eii~~~~~v~d~v~~~~~ee~s~en~vepla~ie~el~~~~~~L~f~~~vs~~~~~R~a 115 (718)
T KOG2089|consen 36 GANDPPWLLSASDIEVRPGTREIIQQSKAVYDSVATLKLEEPSYENVVEPLAKIEVELTVVIGMLVFPQHVSPDKELRKA 115 (718)
T ss_pred ccCCCCCCCCHhHHhhHHHHHHHHHHHHHHHHHhhhccccccchHHHHhHHHHHHHHHHHHHhhhhhhhhcCCchhhhhh
Confidence 33567777789999877 99999999999999963 3799999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHhhCCHHHHHHHHHHh--hcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHH
Q 037955 124 ASKASMRISEYLHYLNTNHTLYDAVKKAE--LDGHLLSKEAHRAANHLRIDFEKGGIHLCADKLDRVNQLNMDIFQLCRE 201 (733)
Q Consensus 124 a~~~~~~l~~~~~~l~~~~~Ly~~l~~~~--~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~L~~~ 201 (733)
++++.+++.++..++.+++++|.+++++. +....++||++|+++++++++++||++|++++|+++++|+++|++++..
T Consensus 116 s~~~~~~~~~~~~r~~~r~di~~~~~~i~e~~~~dslspe~~RylE~~ike~k~nGL~L~~~kr~~ik~ikk~l~~l~~~ 195 (718)
T KOG2089|consen 116 STEADKKLDEFDLRLSMRKDIYNRFQAIYEKPAKDSLSPEAQRYLEKLIKEGKLNGLHLDDDKREEIKEIKKELSELSIK 195 (718)
T ss_pred hhcchHHHHHHHHHhhhhhhHHHHHHHHHHhcccccCCHHHHHHHHHHHHHHHhcCCCCChhHHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999999999983 4567899999999999999999999999999999999999999999999
Q ss_pred HHhhcccCC---CCc--ccccCCCCCHHHHHhhccccccCCCCCcchhhhhcccCCCCEEEecCCCcHHHHhhhCCCHHH
Q 037955 202 FNQNIINDP---GHV--DIFPESRIPKHIHHLLKPICRLTSGPSRESLISWDNKKEKGFRITTDSRILQSILQWTSDDEV 276 (733)
Q Consensus 202 F~~ni~~~~---~~v--~~~~l~glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~l~~~~~~~vL~~~~d~~~ 276 (733)
|++|+++++ +++ +..+++|+|++++..++.+. ++.+...|+||+++++|.|||++|+++.+
T Consensus 196 f~~nvne~~~~~t~litd~~el~glPps~L~~la~~~--------------~~k~sgpwkiTl~~p~~~pvmk~c~~r~t 261 (718)
T KOG2089|consen 196 FSKNVNEDTKKFTFLITDKKELEGLPPSFLQSLAKDE--------------DDKGSGPWKITLKYPHYFPVMKHCKIRQT 261 (718)
T ss_pred HHhhhhhcccccceeeecHHHhccCCHHHHHHHhhcc--------------cCCCCCceEEEecccchhHHHHhCCcHHH
Confidence 999999888 555 33489999999999998851 22222349999999999999999999999
Q ss_pred HHHHHHHhccCCc----chHHHHHHHHHHHHHHHHHcCCCCHHHHHhccCcCCCHHHHHHHHHHHHHhhhHHHHHHHHHH
Q 037955 277 RKMVYIQGHSVPQ----ANHEVLHELIAARNELAQIMGYRSYAEFIVMPNMASSPEVVKSFLLEMSKMIKPKADEEFEAI 352 (733)
Q Consensus 277 Rk~~~~a~~~~~~----~n~~~l~~ll~lR~e~A~llGy~sya~~~l~~~ma~spe~V~~fL~~l~~~~~p~~~~e~~~L 352 (733)
|++||+|++..+. +|..+|++|+.+|.++|++|||.|||+|+|+.+||+|+++|..||++|.++++|.+..|...+
T Consensus 262 R~~v~~A~~~ra~~~~~eNs~ii~~l~~lR~~lAklLGY~t~Ad~~L~~kMa~ss~tv~~fl~dL~~kL~pl~~~e~~v~ 341 (718)
T KOG2089|consen 262 RKTVWRAYNSRASEGDLENSAIIEQLLKLRLELAKLLGYSTYADYSLAMKMAKSSETVVEFLDDLSQKLRPLGIDERSVL 341 (718)
T ss_pred HHHHHHHHhccCCcccccccHHHHHHHHHHHHHHHHhCchhHHHHHHHHHhhcCcHHHHHHHHHHHHHhhhcchhHHHHH
Confidence 9999999987654 599999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHH---hcCC-CCCCCCcCCHHHHHHHhhhhccCCChhhccCCCCHHHHHHHHHHHHHHhhCceeEeecCCCCCccc
Q 037955 353 KNFKRK---SCGQ-KYVHLEPWDEAYYTAMMKSSAYNLDACVVASYFPLGQCIEGLKMLAESLFGVTFHSVPLAPGESWH 428 (733)
Q Consensus 353 ~~~~~~---~~g~-~~~~l~pWD~~y~~~~~~~~~~~~d~~~~~~yfpl~~vl~gl~~l~~~lfgi~~~~~~~~~~~~wh 428 (733)
.++++. ..|. -...++.||+.||.+++....|+||++.+++|||++.|++|||.+++.||||.|++++. .++||
T Consensus 342 ~elk~~e~k~~g~~~~~~~~~wD~~yy~~~~~e~~f~vd~~~LreyFPl~~v~~Gl~~i~q~LFglkf~e~~d--a~vWh 419 (718)
T KOG2089|consen 342 LELKKGEAKDRGAPFDGKLTAWDLRYYMKRVEESKFDVDQEDLREYFPLPVVLSGLFGIYQTLFGLKFEEATD--AEVWH 419 (718)
T ss_pred HHHHHHHHHhcCCCccccchhhHHHHHHHHHHHHhcCCCHHHHHhhCCcHHHHHHHHHHHHHHhCceeeecCC--chhcc
Confidence 888874 3342 13579999999999999999999999999999999999999999999999999999985 68999
Q ss_pred cccceeEEEcC-CCCcccceeeecccCCCCccCcccccccCceecCCCCccccEEEEEccCCCCCCCC-CCcChhhHHHH
Q 037955 429 PDVLKLSLQHP-EEGEMGYLYLDLYSRAGKYTGCANFAIKGGRRLSETEYQLPVVALICNFPGSHNLS-VRLNHHEVETL 506 (733)
Q Consensus 429 ~dV~~~~v~d~-~~~~lg~~ylDl~~R~gK~~ga~~~~i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~~-~lL~~~~v~TL 506 (733)
+||++|.|.|. +|.++|+||+|+|+|+||++|++||.+++++...+|+++.|+++++|||++|.... +||.|++|+|+
T Consensus 420 ~dVr~y~v~D~~Sg~~vG~fY~D~y~RegK~gh~~~f~l~~~~~~~~ss~~~PVaalv~nfS~p~~~kpsll~~~ev~t~ 499 (718)
T KOG2089|consen 420 ADVRVYTVKDSASGNPVGYFYLDPYPREGKYGHAAVFGLQPGCLQKDSSRRIPVAALVCNFSKPQSDKPSLLGHDEVETL 499 (718)
T ss_pred cceeEEeccCCCCCceeeEEEeccCCCccccchhhhhccchhhhccCCccccchHHHHHhcCCcccCCCCccchHHHHHH
Confidence 99999999985 48999999999999999999999999999999999999999999999999998654 99999999999
Q ss_pred HHHHHHHHHHhhhcCCCCccCCC-cccchhhhhhHHHHHHhhhcHHHHHHHHhcccCCCCChHHHHHHHHHhhhhhhHHH
Q 037955 507 FHEFGHALHSLLSRTDYQHFSGT-RVALDFAETPSNLFEYYAWDYRVLRRFAKHYLTGEIVPEKLVKSMQGARDMFAATE 585 (733)
Q Consensus 507 fHEfGHalH~lls~~~~~~~sgt-~~~~D~~E~pS~~~E~~~~~~~~L~~~s~h~~tge~lp~~l~~~l~~~~~~~~~~~ 585 (733)
||||||+||++|+++.|..++|+ +++|||+|+||||||||+|++.+|..+|+||+||+++|++++++|+.++..++|+.
T Consensus 500 FheFGh~~q~ll~Qa~~~~fsG~~~vewDave~psq~Lenwv~~~d~L~~lS~Hy~tge~l~eEl~~kl~~~r~~~~gl~ 579 (718)
T KOG2089|consen 500 FHEFGHVLQHLLTQADFARFSGPRNVEWDAVEVPSQFLENWVWDPDTLRSLSKHYKTGEPLPEELLKKLILTRTVNAGLF 579 (718)
T ss_pred HHHHhHHHHHHHhcCccccccCcccCCcchhhchHHHHHHhccCchHhhhhcccccCCCccHHHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999999 79999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHhHHHhhcCCCCCCCCHHHHHHHHHhhccCCCCCCCCcccccccccc--cccccchhHHHHHHHHHHHHHHH
Q 037955 586 LQRQIFYALVDQTLFGERLGQTRDTSSIVADMKRQHTSWNHVEGTHWHIRFSHFI--NYGAGYYSYLYAKCFAATIWQKL 663 (733)
Q Consensus 586 ~~~ql~~a~fD~~lh~~~~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~~~f~Hl~--~Y~a~YYsYl~s~v~A~di~~~~ 663 (733)
++|||+++.||+++|+... .+..+.|++++++|+++|..++++.+|+|+||| ||+|+||+|+||+|+|+|||+.+
T Consensus 580 tlrqL~~a~~D~~lht~~d---~~~~~~~~~l~~~i~~~p~~~~d~~pcsF~hifa~gy~A~yY~yLWsEv~aaDif~t~ 656 (718)
T KOG2089|consen 580 TLRQLVLADFDLELHTKTD---ADLADTYRQLCQEISIVPATPGDNMPCSFGHIFAGGYAAGYYSYLWSEVLAADIFSTF 656 (718)
T ss_pred HHHHHHHHhhhHHHhhccc---cchHHHHHHhchhheecCCCCCCCCCccccchhcCchHHHHHHHHHHHHHHHHHHHHH
Confidence 9999999999999998744 367799999999999999999999999999999 69999999999999999999988
Q ss_pred HH-cCCCCH-HHHHHHHHHHhcCCCCCCHHHHHHHhcCCCcccccCCCCCCChHHHHHHcCCChhh
Q 037955 664 CQ-EDPLSL-TTGTTLRTKILQHGGAKEPADMLNDLVGDGILRYCNGGIVPDITSFSDEVKLMEDK 727 (733)
Q Consensus 664 ~~-~~~~~~-~~G~~~r~~vL~~Ggs~~~~ell~~flGrd~~~~~~~g~~p~~~a~l~~~g~~~~~ 727 (733)
|+ +|+.|. ++|.+||++||++||+.+++|+|++|+|| +|+.+|||+++||....
T Consensus 657 fe~~g~~N~~~~G~ryR~tiLa~GG~~~~~e~f~~FlGR----------ePS~~Afl~s~gl~~~~ 712 (718)
T KOG2089|consen 657 FEQEGEDNIKEVGMRYRNTILAPGGGKDPMEVFKRFLGR----------EPSQEAFLKSLGLQEAS 712 (718)
T ss_pred hhhcCCccHHHHHHHHHHhhhcCCCCccHHHHHHHhhCC----------CCChhHHHHhhcccccc
Confidence 87 889998 99999999999999999999999999999 59999999999997543
No 6
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=100.00 E-value=1.3e-101 Score=866.13 Aligned_cols=435 Identities=46% Similarity=0.820 Sum_probs=410.9
Q ss_pred HhhhCCCHHHHHHHHHHhccCCcchHHHHHHHHHHHHHHHHHcCCCCHHHHHhccCcCCCHHHHHHHHHHHHHhhhHHHH
Q 037955 267 ILQWTSDDEVRKMVYIQGHSVPQANHEVLHELIAARNELAQIMGYRSYAEFIVMPNMASSPEVVKSFLLEMSKMIKPKAD 346 (733)
Q Consensus 267 vL~~~~d~~~Rk~~~~a~~~~~~~n~~~l~~ll~lR~e~A~llGy~sya~~~l~~~ma~spe~V~~fL~~l~~~~~p~~~ 346 (733)
||++|+|+++||++|.|+.+...+|.++|.+|+++|+++|++|||+||++|++.++|++||++|.+||.+|.++++|.++
T Consensus 2 ~l~~~~d~~~Rk~a~~a~~~~~~~n~~il~~l~~lR~e~A~llGy~sya~~~l~~~ma~spe~v~~fL~~l~~~~~p~~~ 81 (458)
T cd06457 2 LLASVPNESVRELVYRAFHSSSAEQLELLEELLSSRAELAQLLGFESYAHRALRGKMAKNPENVMEFLTELSEKLRPRAE 81 (458)
T ss_pred ccccCCCHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHH
Confidence 68999999999999999999888999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHhcCCCCCCCCcCCHHHHHHHhhhhccCCChhhccCCCCHHHHHHHHHHHHHHhhCceeEeecCCCCCc
Q 037955 347 EEFEAIKNFKRKSCGQKYVHLEPWDEAYYTAMMKSSAYNLDACVVASYFPLGQCIEGLKMLAESLFGVTFHSVPLAPGES 426 (733)
Q Consensus 347 ~e~~~L~~~~~~~~g~~~~~l~pWD~~y~~~~~~~~~~~~d~~~~~~yfpl~~vl~gl~~l~~~lfgi~~~~~~~~~~~~ 426 (733)
+|++.|++++++..|...+.|.|||+.||.++++++++++|++++++|||++.|++||+.++++||||+|++++..++++
T Consensus 82 ~e~~~L~~~~~~~~~~~~~~L~pwD~~yy~~~~~~~~~~~d~~~l~~YFpl~~vl~gl~~~~~~lfgi~~~~~~~~~~~~ 161 (458)
T cd06457 82 EELAVLRDMKRKVLGKPLSSLAPWDRDYYTGQYRQSRFDSEPSNLSPYFSLGTVMEGLSRLFSRLYGIRLVPVPLAPGEV 161 (458)
T ss_pred HHHHHHHHHHHHhcCCCcCCCCHHHHHHHHHHHHHhhcCCChHHhcccCcHHHHHHHHHHHHHHHhCeEEEecCCCCCCC
Confidence 99999999998775422357999999999999999999999999999999999999999999999999999998767899
Q ss_pred cccccceeEEEcCCCCcccceeeecccCCCCccCcccccccCceecCC----------CCccccEEEEEccCCCCCCC-C
Q 037955 427 WHPDVLKLSLQHPEEGEMGYLYLDLYSRAGKYTGCANFAIKGGRRLSE----------TEYQLPVVALICNFPGSHNL-S 495 (733)
Q Consensus 427 wh~dV~~~~v~d~~~~~lg~~ylDl~~R~gK~~ga~~~~i~~~~~~~~----------g~~~~P~v~l~~Nf~~~~~~-~ 495 (733)
|||||++|+|+|++|+++|++|+|+|+|+||++||||+++++++...+ |.+|.|+++++|||++|.++ |
T Consensus 162 Wh~dV~~~~v~d~~~~~lG~~YlDl~~R~~K~~~a~~~~~~~~~~~~~~~~~~~~~~~g~~q~Pv~~lvcnf~~p~~~~p 241 (458)
T cd06457 162 WHPDVRKLDVVHEDEGLLGVIYCDLFSRPGKPPGAAHFTIRCSRRLDDDDVAERGGRGGTYQLPVVALMCNFPPPSPSGP 241 (458)
T ss_pred cCccceEEEEEeCCCCEEEEEEeecCCCCCCCCCcceeccccccccCcccccccccCCCceeCCeEEEECCCCCCCCCCC
Confidence 999999999999888899999999999999999999999999987665 78899999999999998765 5
Q ss_pred CCcChhhHHHHHHHHHHHHHHhhhcCCCCccCCCcccchhhhhhHHHHHHhhhcHHHHHHHHhcccCCCCChHHHHHHHH
Q 037955 496 VRLNHHEVETLFHEFGHALHSLLSRTDYQHFSGTRVALDFAETPSNLFEYYAWDYRVLRRFAKHYLTGEIVPEKLVKSMQ 575 (733)
Q Consensus 496 ~lL~~~~v~TLfHEfGHalH~lls~~~~~~~sgt~~~~D~~E~pS~~~E~~~~~~~~L~~~s~h~~tge~lp~~l~~~l~ 575 (733)
+||+|++|.|||||||||||++|++++|+.++||++++||||+||||||+|+|+|++|+.||+||+|++++|++++++++
T Consensus 242 ~lL~~~~v~TLfHEfGHalH~~ls~~~~~~~sgt~~~~d~vE~pS~~~E~~~~~~~~L~~~a~h~~t~e~ip~~l~~~l~ 321 (458)
T cd06457 242 TLLSPHEVETLFHEMGHAMHSMLGRTEYQHVSGTRCATDFVEVPSILMEYFASDPRVLKLFARHYSTGEPLPEEMLARLL 321 (458)
T ss_pred CCcCHHHHHHHHHHHhHHHHHHHcCCCccccCCCCCCcchhhcCHHHHHHHHhhHHHHHHHhcccCCCCcCcHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HhhhhhhHHHHHHHHHHHHhHHHhhcCCCCCCCCHHHHHHHHHhhccCCCCCCC-CcccccccccccccccchhHHHHHH
Q 037955 576 GARDMFAATELQRQIFYALVDQTLFGERLGQTRDTSSIVADMKRQHTSWNHVEG-THWHIRFSHFINYGAGYYSYLYAKC 654 (733)
Q Consensus 576 ~~~~~~~~~~~~~ql~~a~fD~~lh~~~~~~~~~~~~l~~~l~~~~~~~~~~~~-~~~~~~f~Hl~~Y~a~YYsYl~s~v 654 (733)
++++++.++.+++|+.++.||+++|+..+....++.++|.++.+++.+.+..++ .+|+++|+||+||+|+||+|+||++
T Consensus 322 ~~~~~~~~~~~~rq~~~a~fD~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~Hl~gy~a~YYsYl~a~v 401 (458)
T cd06457 322 ASKNSFAALETQQQIVYALLDQELHGEQPLSPTFTSDVLRDSTEIFYGLPYVPGGTAWQLRFGHLVGYGATYYSYLFDRA 401 (458)
T ss_pred HhhhhhHHHHHHHHHHHHHHHHHHhCCCcccccCHHHHHHHHHHHhcCCCCCCCCCCCccccccccCccccchHHHHHHH
Confidence 999999999999999999999999976544344678999999999988776665 7788999999999999999999999
Q ss_pred HHHHHHHHHHHcCCCCHHHHHHHHHHHhcCCCCCCHHHHHHHhcCCC
Q 037955 655 FAATIWQKLCQEDPLSLTTGTTLRTKILQHGGAKEPADMLNDLVGDG 701 (733)
Q Consensus 655 ~A~di~~~~~~~~~~~~~~G~~~r~~vL~~Ggs~~~~ell~~flGrd 701 (733)
+|+++|+.+|+++++|+++|++||+.||++|||++|+|++++|+|||
T Consensus 402 la~di~~~~f~~~~~n~~~g~~y~~~iL~~Ggs~~p~e~l~~flGrd 448 (458)
T cd06457 402 IASKIWQKLFAADPLSREAGERLREELLKHGGGKDPWELLAGVLGKP 448 (458)
T ss_pred HHHHHHHHHHHhCCCCHHHHHHHHHHHccCCCCcCHHHHHHHHcCCC
Confidence 99999998888899999999999999999999999999999999995
No 7
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=100.00 E-value=2e-95 Score=820.61 Aligned_cols=468 Identities=37% Similarity=0.677 Sum_probs=437.9
Q ss_pred HHHHHHHHHHHHHHhhcccCCCCcccccCCCCCHHHHHhhccccccCCCCCcchhhhhcccCCCCEEEecCCCcHHHHhh
Q 037955 190 QLNMDIFQLCREFNQNIINDPGHVDIFPESRIPKHIHHLLKPICRLTSGPSRESLISWDNKKEKGFRITTDSRILQSILQ 269 (733)
Q Consensus 190 ~l~~~i~~L~~~F~~ni~~~~~~v~~~~l~glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~l~~~~~~~vL~ 269 (733)
+++++|.+|+++|++|+.+ +.+++.++|+
T Consensus 1 ~~~~~~~~~~~~f~~~~~~---------------------------------------------------~~~~~~~~l~ 29 (472)
T cd06455 1 ALKKELSELCQEFSKNLNE---------------------------------------------------KYPDYFPVMK 29 (472)
T ss_pred ChhHHHHHHHHHHHHhhhh---------------------------------------------------cCccHHHHHh
Confidence 3788999999999999872 2367999999
Q ss_pred hCCCHHHHHHHHHHhccC-CcchHHHHHHHHHHHHHHHHHcCCCCHHHHHhccCcCCCHHHHHHHHHHHHHhhhHHHHHH
Q 037955 270 WTSDDEVRKMVYIQGHSV-PQANHEVLHELIAARNELAQIMGYRSYAEFIVMPNMASSPEVVKSFLLEMSKMIKPKADEE 348 (733)
Q Consensus 270 ~~~d~~~Rk~~~~a~~~~-~~~n~~~l~~ll~lR~e~A~llGy~sya~~~l~~~ma~spe~V~~fL~~l~~~~~p~~~~e 348 (733)
+++|+++||++|+++.+. ..+|.++|.+++++|+++|+++||+||+++++.++|++||++|.+||+.|.+.++|.+++|
T Consensus 30 ~~~d~~~Rk~~~~a~~~~~~~~n~~~l~~ll~~R~~~A~l~Gy~s~a~~~l~~~m~~s~e~v~~fl~~l~~~~~p~~~~e 109 (472)
T cd06455 30 YAKNAETRKAMWVAFQNRGGPENVPLLEELVALRHELARLLGYKSHADYVLEDRMAKSPETVRDFLEDLSKKLKPLAEKE 109 (472)
T ss_pred cCCCHHHHHHHHHHHHhhcchhHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHH
Confidence 999999999999999987 7899999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHhcCCCCCCCCcCCHHHHHHHhhhhccCCChhhccCCCCHHHHHHHHHHHHHHhhCceeEeecCCCCCccc
Q 037955 349 FEAIKNFKRKSCGQKYVHLEPWDEAYYTAMMKSSAYNLDACVVASYFPLGQCIEGLKMLAESLFGVTFHSVPLAPGESWH 428 (733)
Q Consensus 349 ~~~L~~~~~~~~g~~~~~l~pWD~~y~~~~~~~~~~~~d~~~~~~yfpl~~vl~gl~~l~~~lfgi~~~~~~~~~~~~wh 428 (733)
++.|.+++++.+|....+++|||+.||.++++++++++|++.+++|||++.|++|++.++.++||++|++++. .++||
T Consensus 110 ~~~l~~~k~~~~~~~~~~l~~wD~~y~~~~~~~~~~~~~~~~~~~yf~~~~~~~~i~~~~~~lfg~~~~~~~~--~~~w~ 187 (472)
T cd06455 110 LAELLELKKKEVPEAGDRIYPWDLAYYMERVEEEKYDVDQEKIREYFPLEVVIEGMLDIYQRLFGLRFEEVPD--ASVWH 187 (472)
T ss_pred HHHHHHHHHHhcCCCCCCCCHhhHHHHHHHHHHHhcCCCHHHHhccCcHHHHHHHHHHHHHHHhCeEEEeCCC--CCCCC
Confidence 9999999998877322269999999999999999999999999999999999999999999999999999874 37999
Q ss_pred cccceeEEEcCC-CCcccceeeecccCCCCccCcccccccCceecCCCCccccEEEEEccCCCCCCC-CCCcChhhHHHH
Q 037955 429 PDVLKLSLQHPE-EGEMGYLYLDLYSRAGKYTGCANFAIKGGRRLSETEYQLPVVALICNFPGSHNL-SVRLNHHEVETL 506 (733)
Q Consensus 429 ~dV~~~~v~d~~-~~~lg~~ylDl~~R~gK~~ga~~~~i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~-~~lL~~~~v~TL 506 (733)
|||++|+|+|++ |+++|++|||+++|+||++||||+++++++...+|.+|.|+++|+|||+++.++ |+||++++|.||
T Consensus 188 ~dv~~~~v~d~~~~~~~g~~ylD~~~R~gK~~Ga~~~~~~~~~~~~~g~~~~P~~~i~~Nf~~~~~~~p~ll~~~~V~TL 267 (472)
T cd06455 188 EDVRLYSVWDADTGEFLGYFYLDLHPREGKYGHAANFGLQPGFLLPDGSRQYPVAALVCNFPKPTADKPSLLRHDEVETF 267 (472)
T ss_pred CcceEEEEEECCCCCEEEEEEeecCCCCCCCCCccccccccceecCCCCEeCCEEEEECcCCCCCCCCCCCCCHHHHHHH
Confidence 999999999965 789999999999999999999999999998888898899999999999998754 599999999999
Q ss_pred HHHHHHHHHHhhhcCCCCccCCCcccchhhhhhHHHHHHhhhcHHHHHHHHhcccCCCCChHHHHHHHHHhhhhhhHHHH
Q 037955 507 FHEFGHALHSLLSRTDYQHFSGTRVALDFAETPSNLFEYYAWDYRVLRRFAKHYLTGEIVPEKLVKSMQGARDMFAATEL 586 (733)
Q Consensus 507 fHEfGHalH~lls~~~~~~~sgt~~~~D~~E~pS~~~E~~~~~~~~L~~~s~h~~tge~lp~~l~~~l~~~~~~~~~~~~ 586 (733)
|||||||||++|++++++.++|+++++||||+|||+||+|+|+|++|..+++|++|++++|++++++++++++++.++++
T Consensus 268 fHEfGHalH~~ls~~~~~~~sg~~~~~d~aE~pS~~~E~~~~~~~~l~~l~~h~~t~e~i~~~li~~~~~~~~~~~~~~~ 347 (472)
T cd06455 268 FHEFGHVIHHLLGRTKYARFSGTRVERDFVEAPSQMLENWCWEPEVLKRLSKHYKTGEKIPDELIERLIASRHFNRGLFY 347 (472)
T ss_pred HHHHHHHHHHHhcCCCccccCCCcCChhhhhcchHHHHHHhcCHHHHHHHhhccCCCCCCCHHHHHHHHHhhhhhHHHHH
Confidence 99999999999999999899999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHhHHHhhcCCCCCCCCHHHHHHHHHhhccCCCCCCCCccccccccccc-ccccchhHHHHHHHHHHHHHHHHH
Q 037955 587 QRQIFYALVDQTLFGERLGQTRDTSSIVADMKRQHTSWNHVEGTHWHIRFSHFIN-YGAGYYSYLYAKCFAATIWQKLCQ 665 (733)
Q Consensus 587 ~~ql~~a~fD~~lh~~~~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~~~f~Hl~~-Y~a~YYsYl~s~v~A~di~~~~~~ 665 (733)
++|+.++.||+++|...+. +.++.++|.++++++.+.+..++.+|+++|+||++ |+++||+|+||+++|+++|+++++
T Consensus 348 ~~q~~~a~fd~~~h~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~H~~~~Y~a~yY~Y~~a~~la~~~~~~~~~ 426 (472)
T cd06455 348 LRQLFFALFDLALHTGDPA-DLDTTKLYNDLREEISLIPSTEGTHGPASFGHLAGGYDAGYYGYLWSEVFAADMFSSFFK 426 (472)
T ss_pred HHHHHHHHHHHHHhCCCCC-CCCHHHHHHHHHHHhCCCCCCCCCCccccccccccCcccchHHHHHHHHHHHHHHHHHHh
Confidence 9999999999999986543 36899999999999998766666788899999996 799999999999999999999999
Q ss_pred cCCCCHHHHHHHHHHHhcCCCCCCHHHHHHHhcCCCcccccCCCCCCChHHHHHHc
Q 037955 666 EDPLSLTTGTTLRTKILQHGGAKEPADMLNDLVGDGILRYCNGGIVPDITSFSDEV 721 (733)
Q Consensus 666 ~~~~~~~~G~~~r~~vL~~Ggs~~~~ell~~flGrd~~~~~~~g~~p~~~a~l~~~ 721 (733)
+|++|+++|++|+++||++|||++|.|++++|+|| +|+++||++++
T Consensus 427 ~~~~~~~~g~~y~~~~L~~Ggs~~p~ell~~flGr----------~~~~~a~~~~~ 472 (472)
T cd06455 427 DGLLNPEVGLRYRDTVLAPGGSKDAADMLKDFLGR----------EPNNDAFLKSL 472 (472)
T ss_pred cCCCCHHHHHHHHHHHhcCcCCcCHHHHHHHHhCC----------CCChhHHhhcC
Confidence 99999999999998999999999999999999999 59999999975
No 8
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=100.00 E-value=1.1e-95 Score=807.06 Aligned_cols=418 Identities=34% Similarity=0.614 Sum_probs=395.3
Q ss_pred chHHHHHHHHHHHHHHHHHcCCCCHHHHHhccCcCCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhcCCCCCCCCc
Q 037955 290 ANHEVLHELIAARNELAQIMGYRSYAEFIVMPNMASSPEVVKSFLLEMSKMIKPKADEEFEAIKNFKRKSCGQKYVHLEP 369 (733)
Q Consensus 290 ~n~~~l~~ll~lR~e~A~llGy~sya~~~l~~~ma~spe~V~~fL~~l~~~~~p~~~~e~~~L~~~~~~~~g~~~~~l~p 369 (733)
+|.++|++|+++|+++|++|||+||++|+++++|++||++|.+||++|.++++|.+++|++.|.+++++..+ ...|+|
T Consensus 1 ~N~~~l~~l~~lR~e~A~llG~~s~a~~~l~~~ma~sp~~v~~Fl~~l~~~~~p~~~~e~~~L~~~~~~~~~--~~~l~~ 78 (422)
T cd06456 1 DNRPLIEEILALRAEKAKLLGFENYAEYSLADKMAKSPEAVLEFLEDLAPKAKPQAKKELAELQAFAKEEGG--EDELEP 78 (422)
T ss_pred ChHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CCCCCH
Confidence 589999999999999999999999999999999999999999999999999999999999999999998765 457999
Q ss_pred CCHHHHHHHhhhhccCCChhhccCCCCHHHHHHHHHHHHHHhhCceeEeecCCCCCccccccceeEEEcCCCCcccceee
Q 037955 370 WDEAYYTAMMKSSAYNLDACVVASYFPLGQCIEGLKMLAESLFGVTFHSVPLAPGESWHPDVLKLSLQHPEEGEMGYLYL 449 (733)
Q Consensus 370 WD~~y~~~~~~~~~~~~d~~~~~~yfpl~~vl~gl~~l~~~lfgi~~~~~~~~~~~~wh~dV~~~~v~d~~~~~lg~~yl 449 (733)
||+.||.++.+++.+++|++++++|||++.|++||+.++++||||+|++++. .++|||||++|+|+|++++++|++|+
T Consensus 79 wD~~yy~~~~~~~~~~~d~~~l~~YFpl~~v~~gl~~~~~~lfgi~~~~~~~--~~~Wh~dV~~~~v~d~~~~~lG~~yl 156 (422)
T cd06456 79 WDWAYYSEKLRKEKYDLDEEELRPYFPLEKVLDGLFELAERLYGITFKERTD--LPVWHPDVRVYEVFDKDGSHIGLFYL 156 (422)
T ss_pred HHHHHHHHHHHHHhcCCCHHHHhccCCHHHHHHHHHHHHHHHcCeeEEECCC--CCCCCCCceEEEEEeCCCCeEEEEEE
Confidence 9999999999999999999999999999999999999999999999999874 38999999999999988899999999
Q ss_pred ecccCCCCccCcccccccCceecCCCCccccEEEEEccCCCCCCC-CCCcChhhHHHHHHHHHHHHHHhhhcCCCCccCC
Q 037955 450 DLYSRAGKYTGCANFAIKGGRRLSETEYQLPVVALICNFPGSHNL-SVRLNHHEVETLFHEFGHALHSLLSRTDYQHFSG 528 (733)
Q Consensus 450 Dl~~R~gK~~ga~~~~i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~-~~lL~~~~v~TLfHEfGHalH~lls~~~~~~~sg 528 (733)
|+|+|+||++||||+++++++. +|.++.|+++++|||++|.++ |+||++++|.|||||||||+|+++++++|+.++|
T Consensus 157 Dl~~R~~K~~ga~~~~~~~~~~--~~~~~~P~~~l~~nf~~~~~~~p~lL~~~~v~tLfHEfGHalH~~ls~~~~~~l~~ 234 (422)
T cd06456 157 DLYAREGKRGGAWMNNLRSQSK--NGLGQKPVAYLVCNFTKPAGGKPALLTHDEVTTLFHEFGHALHHLLTDVEYPSLGG 234 (422)
T ss_pred eccCCCCCCCCceeeccccccc--CCCCCCCEEEEECCCCCCCCCCCCccCHHHHHHHHHHHHHHHHHHHhcCCccccCC
Confidence 9999999999999999988765 678899999999999988754 4999999999999999999999999999999999
Q ss_pred CcccchhhhhhHHHHHHhhhcHHHHHHHHhcccCCCCChHHHHHHHHHhhhhhhHHHHHHHHHHHHhHHHhhcCCCCC-C
Q 037955 529 TRVALDFAETPSNLFEYYAWDYRVLRRFAKHYLTGEIVPEKLVKSMQGARDMFAATELQRQIFYALVDQTLFGERLGQ-T 607 (733)
Q Consensus 529 t~~~~D~~E~pS~~~E~~~~~~~~L~~~s~h~~tge~lp~~l~~~l~~~~~~~~~~~~~~ql~~a~fD~~lh~~~~~~-~ 607 (733)
+++++||||+|||+||+|+|+|++|+.+++||+||+++|++++++++++++++.++++++|+.++.||+++|+..+.. .
T Consensus 235 ~~~~~d~~E~pS~~~E~~~~d~~vL~~~s~h~~t~~~lp~~l~~~~~~~~~~~~~~~~~~ql~~a~fD~~lh~~~~~~~~ 314 (422)
T cd06456 235 TNVEWDFVELPSQFMENWAWEPEVLKLFAKHYETGEPLPDELIDKLLAARNFNSGFATVRQLEFALLDLALHSLTDPEIL 314 (422)
T ss_pred CcCchhHhhccHHHHHHHhcCHHHHHHHHhhcCCCCCCCHHHHHHHHHHhhHhHHHHHHHHHHHHHHHHHHhcCCccccc
Confidence 999999999999999999999999999999999999999999999999999999999999999999999999864432 3
Q ss_pred CCHHHHHHHHHhhccCCCCCCCCcccccccccc--cccccchhHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHhcCC
Q 037955 608 RDTSSIVADMKRQHTSWNHVEGTHWHIRFSHFI--NYGAGYYSYLYAKCFAATIWQKLCQEDPLSLTTGTTLRTKILQHG 685 (733)
Q Consensus 608 ~~~~~l~~~l~~~~~~~~~~~~~~~~~~f~Hl~--~Y~a~YYsYl~s~v~A~di~~~~~~~~~~~~~~G~~~r~~vL~~G 685 (733)
..+..++.++.+++.+++..++++|+++|+||| ||+|+||||+||+++|+|||+.|+++|++|+++|++||++||++|
T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~Hi~~~gY~A~YYsYlws~vla~di~~~f~~~~~~~~~~G~~~r~~iL~~G 394 (422)
T cd06456 315 DVVQFELDALRKEGLVIPPPPPRYFSNYFSHIFSGGYAAGYYSYKWAEVLDADAFSAFEEEGIFNRETGRRFRDTILSKG 394 (422)
T ss_pred chHHHHHHHHHHhcCCCCCCCCCCCCCccCcCCCCCcchhhHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhhcC
Confidence 568889999988888777777888889999998 799999999999999999999999999999999999999999999
Q ss_pred CCCCHHHHHHHhcCCCcccccCCCCCCChHHHHHHcCC
Q 037955 686 GAKEPADMLNDLVGDGILRYCNGGIVPDITSFSDEVKL 723 (733)
Q Consensus 686 gs~~~~ell~~flGrd~~~~~~~g~~p~~~a~l~~~g~ 723 (733)
||++|.+++++|+|| +|+++|||+.+|+
T Consensus 395 gs~~~~e~~~~F~Gr----------~p~~~a~l~~~gl 422 (422)
T cd06456 395 GSRDPMELFRAFRGR----------DPSIEALLRRRGL 422 (422)
T ss_pred CCcCHHHHHHHhcCC----------CCChHHHHHHcCC
Confidence 999999999999999 6999999999986
No 9
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=100.00 E-value=1.5e-89 Score=774.83 Aligned_cols=444 Identities=43% Similarity=0.741 Sum_probs=402.6
Q ss_pred HhhhCCCHHHHHHHHHHhccCCc------chHHHHHHHHHHHHHHHHHcCCCCHHHHHhccCcCCCHHHHHHHHHHHHHh
Q 037955 267 ILQWTSDDEVRKMVYIQGHSVPQ------ANHEVLHELIAARNELAQIMGYRSYAEFIVMPNMASSPEVVKSFLLEMSKM 340 (733)
Q Consensus 267 vL~~~~d~~~Rk~~~~a~~~~~~------~n~~~l~~ll~lR~e~A~llGy~sya~~~l~~~ma~spe~V~~fL~~l~~~ 340 (733)
||++|+|+++||++|+++.+... +|..+|.+++++|+++|+++||+||+++++..+|+++|++|.+||.++.++
T Consensus 1 ~l~~~~d~~~Rk~~~~a~~~~~~~~~~~~~n~~~l~~l~~~R~~~A~llGy~s~~~~~l~~~m~~~~~~v~~fL~~~~~~ 80 (458)
T PF01432_consen 1 VLKYSPDRELRKKAYRAYYSRYSDNQPTLDNAAILNELLKLRNELAKLLGYPSYAEYSLQDKMAKNPENVLDFLDELVKK 80 (458)
T ss_dssp HHHC-S-HHHHHHHHHHHHTTTHHTHHHHHHHHHHHHHHHHHHHHHHHTT-SSHHHHHHTTSTTTSHHHHHHHHHHHHHH
T ss_pred CCCcCcCHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHhhhcCChHHHHHHHHHHHHH
Confidence 68899999999999999998765 789999999999999999999999999999999999999999999999999
Q ss_pred hhHHHHHHHHHHHHHHHHhcCCCCCCCCcCCHHHHHHHhhhhccCCChhhccCCCCHHHHHHHHHHHHHHhhCceeEeec
Q 037955 341 IKPKADEEFEAIKNFKRKSCGQKYVHLEPWDEAYYTAMMKSSAYNLDACVVASYFPLGQCIEGLKMLAESLFGVTFHSVP 420 (733)
Q Consensus 341 ~~p~~~~e~~~L~~~~~~~~g~~~~~l~pWD~~y~~~~~~~~~~~~d~~~~~~yfpl~~vl~gl~~l~~~lfgi~~~~~~ 420 (733)
++|.++++++.+++++++.+|.. .+|+|||++||.+++++..+++++..+++|||+++|++||+.+++++|||+|++++
T Consensus 81 ~~p~~~~~~~~l~~~~~~~~g~~-~~l~~wD~~y~~~~~~~~~~~~~~~~~~~~f~~~~~~~g~~~l~~~lfgi~~~~~~ 159 (458)
T PF01432_consen 81 LKPLLERELELLKKLKKKRLGLE-KKLRPWDVAYYMEQYRQERYDLDEEELSPYFPLEEVLEGLFELAERLFGIRFEEVP 159 (458)
T ss_dssp HHHHHHHHHHHHHHHHHHHTT-S-SSBBGGGHHHHHHHHHHHHTSSSHHHHGGG-BHHHHHHHHHHHHHHHHTEEEEECT
T ss_pred HHHHHHHHHHHHHHHHHHhcCCc-cccccchhHHHhhHHHHHHhccchhhcCCcCcHHHHHHHHHHHHHHHhcEEEEecc
Confidence 99999999999999999988832 49999999999999999999999999999999999999999999999999999998
Q ss_pred CCCCCccccccceeEEEcC--CCCcccceeeecccCCCCccCcccccccCceecCCCCccccEEEEEccCCCCCCC-CCC
Q 037955 421 LAPGESWHPDVLKLSLQHP--EEGEMGYLYLDLYSRAGKYTGCANFAIKGGRRLSETEYQLPVVALICNFPGSHNL-SVR 497 (733)
Q Consensus 421 ~~~~~~wh~dV~~~~v~d~--~~~~lg~~ylDl~~R~gK~~ga~~~~i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~-~~l 497 (733)
...+++|||||++|+|+|. ++.++|++|+|+|+|+||++||||+++++++ .+|+++.|+++|+|||++|..+ ++|
T Consensus 160 ~~~~~~w~~dv~~~~v~d~~~~~~~ig~~ylDl~~R~gK~~ga~~~~~~~~~--~~~~~~~~~~~i~~n~~~~~~~~~~l 237 (458)
T PF01432_consen 160 DADGEVWHPDVRKFEVWDEIFSGMFIGYIYLDLYPRPGKRSGAFCFTLRPSR--SDGERQLPVPYIFCNFTGPSAGKPSL 237 (458)
T ss_dssp CGGHHHSSTT-EEEEEEETHTTHCECEEEEEEES--TTS-SS-EEEEEEC-B--TTSTCECEEEEEEEEE-S-BTTC--B
T ss_pred cccccceecceeEEEEeehhhcccchhcccccchhcCCCCCCceeCCccCcc--ccccCCCCceEEEecCCCCCCCCCCc
Confidence 6557899999999999997 6788999999999999999999999999988 7899999999999999998765 499
Q ss_pred cChhhHHHHHHHHHHHHHHhhhcCCCCccCCCcccchhhhhhHHHHHHhhhcHHHHHHHHhcccCCCCChHHHHHHHHHh
Q 037955 498 LNHHEVETLFHEFGHALHSLLSRTDYQHFSGTRVALDFAETPSNLFEYYAWDYRVLRRFAKHYLTGEIVPEKLVKSMQGA 577 (733)
Q Consensus 498 L~~~~v~TLfHEfGHalH~lls~~~~~~~sgt~~~~D~~E~pS~~~E~~~~~~~~L~~~s~h~~tge~lp~~l~~~l~~~ 577 (733)
|++++|.|||||||||||++|+++++++++|+++++|++|+||+|||+|+|+|.+++.+++|+++++++|+++++.+.++
T Consensus 238 l~~~~v~tLfHE~GHa~H~~ls~~~~~~~sg~~~~~d~aE~~S~~~E~~~~~~~~l~~~~~~~~~~~~i~~~~l~~~~~~ 317 (458)
T PF01432_consen 238 LSHDDVETLFHEFGHAMHSLLSRTKYQHLSGTRVPMDFAEFPSQFMENWLWDPLVLKAFSRHYETGEPIPEELLEDLIAS 317 (458)
T ss_dssp -SHHHHHHHHHHHHHHHHHHHCCCSSGGGSTTSS-CHHCHHHHHHHHHHGGCHHHHHHH-BSTTTHHHHHHHHHHHHHHT
T ss_pred cChhhHHHHHHHHhHHHHHHHhccccccccCCchhHHHHhcchHHHHHhhhchhhhhhhccChhhhhhHHHHHHHhhhhh
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhhhhHHHHHHHHHHHHhHHHhhcCCCCCC---CCHHHHHHHHHhhccCCCCCCCCcccccccccc--cccccchhHHHH
Q 037955 578 RDMFAATELQRQIFYALVDQTLFGERLGQT---RDTSSIVADMKRQHTSWNHVEGTHWHIRFSHFI--NYGAGYYSYLYA 652 (733)
Q Consensus 578 ~~~~~~~~~~~ql~~a~fD~~lh~~~~~~~---~~~~~l~~~l~~~~~~~~~~~~~~~~~~f~Hl~--~Y~a~YYsYl~s 652 (733)
++.+.++.+.+|+.++.||+.+|+..+... .++.++|.++++++.+.+..++.+|+++|+||+ +|+++||+|+||
T Consensus 318 ~~~~~~~~~~~ql~~a~fd~~~h~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~f~hl~~~~Y~a~yY~Y~ya 397 (458)
T PF01432_consen 318 RNFFAAIFLFRQLLFALFDQELHESPEDGEPESEDLNELYRELQKEYYGDPSDPDDYFPASFSHLFSHFYAAGYYSYLYA 397 (458)
T ss_dssp TTTTHHHHHHHHHHHHHHHHHHCCHCCCTHHHHHHHHHHHHHHHHHCCTTTCBTTCCGGGGGGGGHCCCGTTTTTCHHHH
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHhccccCCccchhHHHHHHHHHHHhcCcchhccccccceehhhccCccCCCCchhHHH
Confidence 999999999999999999999998755322 357899999999999988778888889999998 799999999999
Q ss_pred HHHHHHHHHHHHHcCCCCHHHHHHHHHHHhcCCCCCCHHHHHHHhcCCCcccccCCCCCCChHHHHHHcCC
Q 037955 653 KCFAATIWQKLCQEDPLSLTTGTTLRTKILQHGGAKEPADMLNDLVGDGILRYCNGGIVPDITSFSDEVKL 723 (733)
Q Consensus 653 ~v~A~di~~~~~~~~~~~~~~G~~~r~~vL~~Ggs~~~~ell~~flGrd~~~~~~~g~~p~~~a~l~~~g~ 723 (733)
+++|+++|+.+++++++|+++|.++++++|++|||++|.|++++|+|+ +|++++|++.+|+
T Consensus 398 ~~~a~~~~~~~~~~~~~n~~~g~~~~~~~L~~Ggs~~~~e~l~~~~g~----------~~~~~~~~~~~~~ 458 (458)
T PF01432_consen 398 EVLAADLFSQFFEEDPLNRETGRRFYKEFLSPGGSKDPLELLKKFLGR----------EPSPDAFLKALGL 458 (458)
T ss_dssp HHHHHHHHHHHHHCHTTCHHHHHHHHHHHCTTTTSS-HHHHHHHCCSS----------TTSSHHHHHHHT-
T ss_pred HHHHHHHHHHHHhcCCcchHHHHHHHHHHhcCCCCCCHHHHHHHhCCC----------CCChHHHHHHcCC
Confidence 999999999999989999999999999999999999999999999999 6999999999986
No 10
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=100.00 E-value=4e-64 Score=551.85 Aligned_cols=358 Identities=29% Similarity=0.452 Sum_probs=326.1
Q ss_pred HHHHHHHHHHHHHHHHcCCCCHHHHHhccCcCCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhcCCCCCCCCcCCH
Q 037955 293 EVLHELIAARNELAQIMGYRSYAEFIVMPNMASSPEVVKSFLLEMSKMIKPKADEEFEAIKNFKRKSCGQKYVHLEPWDE 372 (733)
Q Consensus 293 ~~l~~ll~lR~e~A~llGy~sya~~~l~~~ma~spe~V~~fL~~l~~~~~p~~~~e~~~L~~~~~~~~g~~~~~l~pWD~ 372 (733)
++|++++++|+++|+++||+||+++++..+|++||++|.+||++|.+.++|.++++++.+++.+++..+ ...+.||
T Consensus 2 ~~l~~lv~lr~e~A~~lGy~~~~d~~l~~~~~~~~e~v~~f~~~l~~~~~p~~~~~~~~l~~~~~~~~~--~~~~~~~-- 77 (365)
T cd06258 2 ALLEELVSLRNQLARLLGYENFADYKLALQEAKSPETVEGFFEELKRKLRPLLAKLREEISAAKQKEEE--IYGELPA-- 77 (365)
T ss_pred hHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC--Ccccccc--
Confidence 579999999999999999999999999999999999999999999999999999999999988766554 4468888
Q ss_pred HHHHHHhhhhccCCChhhccCCCCHHHHHHHHHHHHHHhhCceeEeecCCCCCccccccceeEEEcCCCCcccceeeecc
Q 037955 373 AYYTAMMKSSAYNLDACVVASYFPLGQCIEGLKMLAESLFGVTFHSVPLAPGESWHPDVLKLSLQHPEEGEMGYLYLDLY 452 (733)
Q Consensus 373 ~y~~~~~~~~~~~~d~~~~~~yfpl~~vl~gl~~l~~~lfgi~~~~~~~~~~~~wh~dV~~~~v~d~~~~~lg~~ylDl~ 452 (733)
.+++|++.+++|||++.++++++.++ |+|++
T Consensus 78 ----------~~~~~~~~~~~~f~~~~~~~~~~~~~---------------------------------------~lD~~ 108 (365)
T cd06258 78 ----------RYDVDSALLKEFFDAERPWEGALPFF---------------------------------------YLDLY 108 (365)
T ss_pred ----------cccCCHhhccCcCChHHHHHHHHHhH---------------------------------------hccCC
Confidence 45688999999999999999876533 99999
Q ss_pred cCCCCccCcccccccCceecCCCCccccEEEEEccCCCCCCC-CCCcChhhHHHHHHHHHHHHHHhhhcCCCCccCCCcc
Q 037955 453 SRAGKYTGCANFAIKGGRRLSETEYQLPVVALICNFPGSHNL-SVRLNHHEVETLFHEFGHALHSLLSRTDYQHFSGTRV 531 (733)
Q Consensus 453 ~R~gK~~ga~~~~i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~-~~lL~~~~v~TLfHEfGHalH~lls~~~~~~~sgt~~ 531 (733)
+|+||++||||+++++++ +++.|.+.|+|||+++.++ ++||++++|.|||||||||+|+++++++|+.++|+++
T Consensus 109 ~R~gK~~~a~~~~~~~~~-----~~~~~~~~i~~n~~~~~~~~~~ll~~~~v~tl~HE~GHa~h~~l~~~~~~~~~g~~~ 183 (365)
T cd06258 109 DRKGKYPHGFCTGLDPGF-----NRQDKDVRILANFTSPAAPDPVLLGHDDINTLFHEFGHAVHFLLIQQRYPFQERTPT 183 (365)
T ss_pred CCCCCCCCCeeccccCCC-----CCCCCeEEEEccCCCCCCCCCCcCCHHHHHHHHHHHhHHHHHHHhcCCCCcCCCCCC
Confidence 999999999999987665 3467999999999998755 4999999999999999999999999999888899999
Q ss_pred cchhhhhhHHHHHHhhhcHHHHHHHHhcccCCCCChHHHHHHHHHhhhhhhHHHHHHHHHHHHhHHHhhcCCCCCCCCHH
Q 037955 532 ALDFAETPSNLFEYYAWDYRVLRRFAKHYLTGEIVPEKLVKSMQGARDMFAATELQRQIFYALVDQTLFGERLGQTRDTS 611 (733)
Q Consensus 532 ~~D~~E~pS~~~E~~~~~~~~L~~~s~h~~tge~lp~~l~~~l~~~~~~~~~~~~~~ql~~a~fD~~lh~~~~~~~~~~~ 611 (733)
++||+|+||++||+|++++.+|+.+++|++++ ++|.++++++++....+.++.+.+|+.++.||+++|..... ..+++
T Consensus 184 ~~~~~E~~S~~~E~~~~~~~~L~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~l~~~~~~-~~~~~ 261 (365)
T cd06258 184 STDFAEAQSMFLESFATDPEWLERYARHYQGG-VVPDELIEKLIAARLPNTLYETRRILVVAKFEKALYENPDR-ELELQ 261 (365)
T ss_pred CccHHhccHHHHHHHHCCHHHHHHHhhhcCCC-CCcHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHhCCCCc-cCCHH
Confidence 99999999999999999999999999999988 89999999999988888889999999999999999986442 34799
Q ss_pred HHHHHHHhhccCCCCCCCCccccccccccc-ccccchhHHHHHHHHHHHHHHHHHcC---CCCHHHHHHHHHHHhcCCCC
Q 037955 612 SIVADMKRQHTSWNHVEGTHWHIRFSHFIN-YGAGYYSYLYAKCFAATIWQKLCQED---PLSLTTGTTLRTKILQHGGA 687 (733)
Q Consensus 612 ~l~~~l~~~~~~~~~~~~~~~~~~f~Hl~~-Y~a~YYsYl~s~v~A~di~~~~~~~~---~~~~~~G~~~r~~vL~~Ggs 687 (733)
++|+++.+++.++++.+++.++++|+|+++ |+++||+|+||+++|++||+.+++++ +.++++|++||++||++|||
T Consensus 262 ~~~~~l~~~~~g~~~~~~~~~~~~~~H~~~gy~~yyy~Y~~~~v~a~qi~~~~~~~~~~~~~~~~~g~~l~~~il~~G~s 341 (365)
T cd06258 262 KLWRDLVKEILGVRPDPSTPDPAAFPHLAGGSPAYYYGYLLAEMLASQLRATFKKKVGYLTDNPEAGPRLREHILRPGNS 341 (365)
T ss_pred HHHHHHHHHhcCCCCCCCCCCCCccchhccCcccchHHHHHHHHHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHccCcCC
Confidence 999999999988766666677789999994 99999999999999999999999875 68899999999889999999
Q ss_pred CCHHHHHHHhcCCCcccccCCCCCCChHHHHHH
Q 037955 688 KEPADMLNDLVGDGILRYCNGGIVPDITSFSDE 720 (733)
Q Consensus 688 ~~~~ell~~flGrd~~~~~~~g~~p~~~a~l~~ 720 (733)
++|.|++++|+|+ +||+++|+++
T Consensus 342 ~~~~el~~~~~G~----------~~~~~a~~~~ 364 (365)
T cd06258 342 EPWKELLKRATGE----------DPNADAFLDH 364 (365)
T ss_pred cCHHHHHHHHcCC----------CCChHHHHhc
Confidence 9999999999999 6999999985
No 11
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=100.00 E-value=6.4e-55 Score=500.87 Aligned_cols=505 Identities=18% Similarity=0.211 Sum_probs=374.1
Q ss_pred HHHHHHHHHcC-CChHHHHHHHHHhhHHHhhhhchhhhhhccCCch-HHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHH
Q 037955 74 SSELVNYISEM-PSSVEIIRAMDEISDAVCSVVDSAELCRQTHPDR-EFVEEASKASMRISEYLHYLNTNHTLYDAVKKA 151 (733)
Q Consensus 74 ~~~~i~~i~~~-~~~~~~v~~ld~~~~~l~~~~~~~~~~~~~~~d~-e~r~aa~~~~~~l~~~~~~l~~~~~Ly~~l~~~ 151 (733)
.+++++++... ....+++..++++.+.+......+.+..++++++ ++.+........+......+. ..|.+++..
T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~s~dt~d~~~~~~~~~~~~~~~p~~~~~~--~~~~~~~~~- 84 (549)
T TIGR02289 8 LQELLEKFISSKLEQENWINEENDVNDDIEEMITSAYIRHSVDTDDEEFYDLEEHFQDEIKPLLKRYN--TKFDQKIIE- 84 (549)
T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCHHHHHHHHHHHHHhCcHHHHHH--HHHHHHHHc-
Confidence 33344444333 4457899999999999999999999999999966 666555444444433333221 133332211
Q ss_pred hhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCcccccCCCCCHHHHHhhcc
Q 037955 152 ELDGHLLSKEAHRAANHLRIDFEKGGIHLCADKLDRVNQLNMDIFQLCREFNQNIINDPGHVDIFPESRIPKHIHHLLKP 231 (733)
Q Consensus 152 ~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~L~~~F~~ni~~~~~~v~~~~l~glp~~~~~~~~~ 231 (733)
..--++..+++...+.+..+...--++++ ...+..+.+.+++.|.+.+..-.
T Consensus 85 ---~~~~~~l~~~~~~~l~~~~~~~~~~~s~~----~~~~~~~~~~~~~~y~~l~~~~~--------------------- 136 (549)
T TIGR02289 85 ---SPFREELDERFYELLFKLIKNKLKLFNEK----NIPLLQKENILSTKYREIISNIS--------------------- 136 (549)
T ss_pred ---CcchhHHHHHHHHHHHHHHHhhhhhcCHH----HHHHHHHHHHHHHHHHHHHhccE---------------------
Confidence 11112334455555555555543344444 66677888999999999654111
Q ss_pred ccccCCCCCcchhhhhcccCCCCEEEecCCCcHHHHhhhCCCHHHHHHHHHHhccCC----cchHHHHHHHHHHHHHHHH
Q 037955 232 ICRLTSGPSRESLISWDNKKEKGFRITTDSRILQSILQWTSDDEVRKMVYIQGHSVP----QANHEVLHELIAARNELAQ 307 (733)
Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~g~~i~l~~~~~~~vL~~~~d~~~Rk~~~~a~~~~~----~~n~~~l~~ll~lR~e~A~ 307 (733)
..++|..+|++ .+.++|+ ++|+++||++|.|+.... ..+..++.+++++|+++|+
T Consensus 137 ------------------~~~~Ge~~~l~--~~~~~l~-~~dr~~Rk~a~~a~~~~~~~~~~~~~~il~~lv~~R~~~Ak 195 (549)
T TIGR02289 137 ------------------IIFEGEEKTLS--QLKPYLQ-DPNRSTRKKAWYARSEFFAVVEEELDRIYDELVQIRTQIAL 195 (549)
T ss_pred ------------------EEECCEeccHH--HhhHHhh-CCCHHHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHH
Confidence 12346677774 4888898 599999999999987533 3467899999999999999
Q ss_pred HcCCCCHHHHHh--ccCcCCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhcCCCCCCCCcCCHHHHHHHhhhhccC
Q 037955 308 IMGYRSYAEFIV--MPNMASSPEVVKSFLLEMSKMIKPKADEEFEAIKNFKRKSCGQKYVHLEPWDEAYYTAMMKSSAYN 385 (733)
Q Consensus 308 llGy~sya~~~l--~~~ma~spe~V~~fL~~l~~~~~p~~~~e~~~L~~~~~~~~g~~~~~l~pWD~~y~~~~~~~~~~~ 385 (733)
++||+||+++++ .++|+.+|++|.+|++.|.+.+.|.+++ +.+.+++.+| .+++.+||+.| .
T Consensus 196 l~Gy~s~~e~~~~~~~r~~~~~e~v~~~l~~v~~~~~p~~~~----~~~~k~k~lG--~~~l~~wD~~~----------~ 259 (549)
T TIGR02289 196 NLGFSNYRDYMYKLKNRTDYSAEDCYKYRESILKYVVPLWTK----LRKIKKKRLG--IKTLRPWDESA----------V 259 (549)
T ss_pred HCCCCCHHHHHHHHhcccCCCHHHHHHHHHHHHHHhHHHHHH----HHHHHHHhCC--CCcCChhhccC----------C
Confidence 999999999995 5799999999999999999999999875 6667777778 56899999853 2
Q ss_pred CChhhccCCCCHHHHHHHHHHHHHHhhCceeEeecCCCCCccccccceeEEEcCCCCcccceeeecccCCCCccCccccc
Q 037955 386 LDACVVASYFPLGQCIEGLKMLAESLFGVTFHSVPLAPGESWHPDVLKLSLQHPEEGEMGYLYLDLYSRAGKYTGCANFA 465 (733)
Q Consensus 386 ~d~~~~~~yfpl~~vl~gl~~l~~~lfgi~~~~~~~~~~~~wh~dV~~~~v~d~~~~~lg~~ylDl~~R~gK~~ga~~~~ 465 (733)
++.....+|||.+.++++++.++.++ |.++.+ +.+ .+++..|+|+++|+||++||||++
T Consensus 260 ~~~~~~~~~~~~e~~~~~~~~~~~~l-~~~~~e-----------------~~~---~~~~~~~iD~~~r~gK~~Gayc~~ 318 (549)
T TIGR02289 260 FLDGNVKPFGNVDFLLEKAEKMYKEL-SLEFYE-----------------FFN---FMREKNLLDLESRKGKAAGGYCTY 318 (549)
T ss_pred CCCCCcCCCCCHHHHHHHHHHHhcCC-CcHHHH-----------------HHH---HHHhcCccccCCCCCCCCCcccCC
Confidence 34445678999999999998877654 322211 111 257888999999999999999988
Q ss_pred ccCceecCCCCccccEEEEEccCCCCCCCCCCcChhhHHHHHHHHHHHHHHhhhcCCCCccCCCcccchhhhhhHHHHHH
Q 037955 466 IKGGRRLSETEYQLPVVALICNFPGSHNLSVRLNHHEVETLFHEFGHALHSLLSRTDYQHFSGTRVALDFAETPSNLFEY 545 (733)
Q Consensus 466 i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~~~lL~~~~v~TLfHEfGHalH~lls~~~~~~~sgt~~~~D~~E~pS~~~E~ 545 (733)
+.+ ...|+| +|||+++. +||.||+||||||+|+++++.. +.......++|++|+||++||+
T Consensus 319 ~~~--------~~~P~I--~~Nf~~t~--------~dv~TL~HElGHa~H~~~s~~~-~~~~~~~~~~~~aE~aS~~~E~ 379 (549)
T TIGR02289 319 LPK--------YKAPFI--FSNFNGTS--------GDIDVLTHEAGHAFHVYESRKD-LLPEYRWPTYEAAELASMSMEL 379 (549)
T ss_pred CCC--------CCCcEE--EEeCCCCh--------hHHHHHHHHhhHHHHHHHhcCC-cccccccCcchhhhhhHHHHHH
Confidence 753 246865 79999976 8999999999999999999863 3233334689999999999999
Q ss_pred hhhcHHHHHHHHhcccCCCCChHHHHHHHHHhhhhhhHHHHHHHHHHHHhHHHhhcCCCCCCCCHHHHHHHHHhhccCCC
Q 037955 546 YAWDYRVLRRFAKHYLTGEIVPEKLVKSMQGARDMFAATELQRQIFYALVDQTLFGERLGQTRDTSSIVADMKRQHTSWN 625 (733)
Q Consensus 546 ~~~~~~~L~~~s~h~~tge~lp~~l~~~l~~~~~~~~~~~~~~ql~~a~fD~~lh~~~~~~~~~~~~l~~~l~~~~~~~~ 625 (733)
|+|++. ..++.++++.+..+++.|..+..+ +.+|..+..|...+|...+.+..+++++|.++.++|.+..
T Consensus 380 l~~~~l-----~~~~~~~e~~~~~~~~~L~~~~~~-----~~~~~~~d~fe~~vy~~~~lt~~e~~~~~~~l~~~y~~~~ 449 (549)
T TIGR02289 380 LSMPWW-----DLFYTDEKDLKKAKISHLKGALSF-----LPYGVIVDHFQHWVYENPNHTPEERDEKYAELEKRYQPST 449 (549)
T ss_pred HHHHHH-----HHHcCCcchHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHccCCCCCHHHHHHHHHHHHHHhCCCc
Confidence 999873 346777788888888887665543 3466777888888888655677789999999999987421
Q ss_pred C------CCCCcccccccccccccccchhHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHhcCCCCCCHHHHHHHhcC
Q 037955 626 H------VEGTHWHIRFSHFINYGAGYYSYLYAKCFAATIWQKLCQEDPLSLTTGTTLRTKILQHGGAKEPADMLNDLVG 699 (733)
Q Consensus 626 ~------~~~~~~~~~f~Hl~~Y~a~YYsYl~s~v~A~di~~~~~~~~~~~~~~G~~~r~~vL~~Ggs~~~~ell~~flG 699 (733)
. .....||.+++|+|++++|||+|.||+++|.++|..+.+++ ..+.+.| .++|+.|||++|.|+++.+ |
T Consensus 450 ~~~~~~~~~~~~~W~~~~H~~~~pFYyy~Y~~a~~~a~~l~~~~~~~~---~~~~~~Y-~~~L~~Ggs~~~~ell~~a-G 524 (549)
T TIGR02289 450 VYIAGLELEIGTFWLRQLHIFEVPFYYIEYTIAQIGALQIYKIYKEDP---EKALKDY-KKLCSAGGSQSFLELYETA-G 524 (549)
T ss_pred ccCCccccccCCcceeeceeecCCCcchhhHHHHHHHHHHHHHHHhCH---HHHHHHH-HHHHhccCCcCHHHHHHHh-C
Confidence 1 12233447999999888999999999999999999987643 3455677 5899999999999999995 9
Q ss_pred CC
Q 037955 700 DG 701 (733)
Q Consensus 700 rd 701 (733)
.|
T Consensus 525 id 526 (549)
T TIGR02289 525 LT 526 (549)
T ss_pred CC
Confidence 87
No 12
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=100.00 E-value=2.3e-55 Score=510.62 Aligned_cols=530 Identities=18% Similarity=0.197 Sum_probs=401.2
Q ss_pred CcCChHHHHHHHHHHHHHHHHHHHc-----CCChHHHHHHHHHhhHHHhhhhchhhhhhccC-CchHHHHHHHHHHHHHH
Q 037955 59 SPNGFQRFVDDAIERSSELVNYISE-----MPSSVEIIRAMDEISDAVCSVVDSAELCRQTH-PDREFVEEASKASMRIS 132 (733)
Q Consensus 59 ~p~~~~~~~~~~i~~~~~~i~~i~~-----~~~~~~~v~~ld~~~~~l~~~~~~~~~~~~~~-~d~e~r~aa~~~~~~l~ 132 (733)
+.+++...++.+-+.+.++ .++.. .+++.+.+..+++++..+.++...+.+...++ .|++.++...++...++
T Consensus 18 ~~~~~~~~~~~l~~~~~~l-~~~~~~~~~~~~~l~~~l~~~e~~~~~~~~l~~Ya~l~~s~d~~d~~~~~~~~~~~~l~s 96 (591)
T TIGR00181 18 NKEEWELFLEALEEDIKEI-KAFKKGLLHSKETFLEALALEEKILILLNRLYNYASMKLSTDVTDPEANAISQKLSNLYT 96 (591)
T ss_pred CHHHHHHHHHHHHHHHHHH-HHHhCcccCCHHHHHHHHHHHHHHHHHHHHHHHHhccccccCCCCHHHHHHHHHHHHHHH
Confidence 3455555555544333332 32222 13677899999999999999999999999999 59999999999999999
Q ss_pred HHHHHhh-CCHHHHHHHHHH-hhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHhhcccCC
Q 037955 133 EYLHYLN-TNHTLYDAVKKA-ELDGHLLSKEAHRAANHLRIDFEKGGIHLCADKLDRVNQLNMDIFQLCREFNQNIINDP 210 (733)
Q Consensus 133 ~~~~~l~-~~~~Ly~~l~~~-~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~L~~~F~~ni~~~~ 210 (733)
++...+. .+..|++.-+.. ..--+. +++-+.+...+.+.++..|..|+++..+.+.+++...+..+..|......|.
T Consensus 97 ~~~~~~sf~~~eL~~l~~~~~~~~l~~-~~~L~~y~~~l~~~~r~k~h~Ls~e~E~lLa~~~~~~~a~~~~f~~l~~ad~ 175 (591)
T TIGR00181 97 KVASATSFFEPEILEIEEKIIKEWLKD-PEELADYKRALEEIFRDKPHILSEEVEKLLSALSEVFGSPSDIYSTLTNADM 175 (591)
T ss_pred HHHHHHHHHHHHHHhCCHHHHHHHHhc-ChhhHHHHHHHHHHHHhCCCCCCHHHHHHHHhCchhhccHHHHHHHHHhccC
Confidence 9988876 788888864433 111111 2345677777778899999999999989999999988889999998654332
Q ss_pred CCcccccCCCCCHHHHHhhccccccCCCCCcchhhhhcccCCCCEEEecCCCcHHHHhhhCCCHHHHHHHHHHhccCCc-
Q 037955 211 GHVDIFPESRIPKHIHHLLKPICRLTSGPSRESLISWDNKKEKGFRITTDSRILQSILQWTSDDEVRKMVYIQGHSVPQ- 289 (733)
Q Consensus 211 ~~v~~~~l~glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~l~~~~~~~vL~~~~d~~~Rk~~~~a~~~~~~- 289 (733)
. + .. -...+|..++++.+.+.++|++++|+++||++|+++...+.
T Consensus 176 ~---f--------------~~-----------------~~~~~G~~~~l~~~~~~~~l~~~~dr~~Rk~a~~a~~~~~~~ 221 (591)
T TIGR00181 176 D---F--------------GS-----------------IEDYKGKKYPITNSTYENFLQKNKDREIRKKAYESFYKAYRK 221 (591)
T ss_pred C---C--------------Cc-----------------eecCCCcEEeccHHHHHHHHhcCCCHHHHHHHHHHHHHHHHH
Confidence 1 1 00 00114678999999999999999999999999999887543
Q ss_pred ---chHHHHHHHHHHHHHHHHHcCCCCHHHHHhccCcCCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhcCCCCCC
Q 037955 290 ---ANHEVLHELIAARNELAQIMGYRSYAEFIVMPNMASSPEVVKSFLLEMSKMIKPKADEEFEAIKNFKRKSCGQKYVH 366 (733)
Q Consensus 290 ---~n~~~l~~ll~lR~e~A~llGy~sya~~~l~~~ma~spe~V~~fL~~l~~~~~p~~~~e~~~L~~~~~~~~g~~~~~ 366 (733)
.|..+|.++++.|.+.|+++||+||+++++..+|. |++|..+|.++.....|.+++ +.+++++.+| .++
T Consensus 222 ~~~~~a~~l~~~v~~r~~~A~lrGy~s~~~~~l~~~~i--~~~v~~~l~~~v~~~~~~~~~----y~~~k~k~lg--~~~ 293 (591)
T TIGR00181 222 HKNTFAALYYGNVQKNVFLAKLRNYESYIDASLFSDEV--PREVYDNLYDTIKKNAPVLQR----YYKLRKKVLK--LDK 293 (591)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHhccCCC--CHHHHHHHHHHHHHhhHHHHH----HHHHHHHHhC--CCc
Confidence 46789999999999999999999999999999885 888888888877777787765 6667777778 678
Q ss_pred CCcCCHHHHHHHhhhhccCCChhhccCCCCHHHHHHHHHHHHHHhhCceeEeecCCCCCccccccceeEEEcCCCCcccc
Q 037955 367 LEPWDEAYYTAMMKSSAYNLDACVVASYFPLGQCIEGLKMLAESLFGVTFHSVPLAPGESWHPDVLKLSLQHPEEGEMGY 446 (733)
Q Consensus 367 l~pWD~~y~~~~~~~~~~~~d~~~~~~yfpl~~vl~gl~~l~~~lfgi~~~~~~~~~~~~wh~dV~~~~v~d~~~~~lg~ 446 (733)
+++||+. +++.+ ...+.++++++.+.|++.+ +.||.+ |.+.|+.. ++.
T Consensus 294 l~~~D~~--~~~~~---------~~~~~~~~~~a~~~v~~~~-~~~g~~-----------~~~~~~~~---------~~~ 341 (591)
T TIGR00181 294 MEPYDLY--LPLVK---------EKNPKFSIEEAKELILKSL-EPLGEE-----------YIKILKRA---------FNE 341 (591)
T ss_pred ccHHHcc--CCCCC---------CcCCcccHHHHHHHHHHHH-hccCHH-----------HHHHHHHH---------hhC
Confidence 9999984 33221 1235788999888777755 356643 43333331 233
Q ss_pred eeeecccCCCCccCcccccccCceecCCCCccccEEEEEccCCCCCCCCCCcChhhHHHHHHHHHHHHHHhhhcCCCCcc
Q 037955 447 LYLDLYSRAGKYTGCANFAIKGGRRLSETEYQLPVVALICNFPGSHNLSVRLNHHEVETLFHEFGHALHSLLSRTDYQHF 526 (733)
Q Consensus 447 ~ylDl~~R~gK~~ga~~~~i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~~~lL~~~~v~TLfHEfGHalH~lls~~~~~~~ 526 (733)
.|||+++|+||++||||+++.+ ..|+| +|||+++. ++|.||||||||++|+++++.. ++.
T Consensus 342 ~wiD~~~r~gK~~Ga~~~~~~~---------~~p~i--l~N~~~~~--------~dv~TLaHElGHa~H~~~~~~~-~~~ 401 (591)
T TIGR00181 342 RWVDYAENKGKRSGAYSIGGYK---------VKPYI--LMNWDGTL--------NSVFTLAHELGHSMHSYFSSKH-QPY 401 (591)
T ss_pred CCeeecCCCCCCCCcccCCCCC---------CCCeE--EEecCCCc--------chHHHHHHHhhhHHHHHHHccC-CCC
Confidence 5999999999999999977521 35765 88999876 8999999999999999999764 445
Q ss_pred CCCcccchhhhhhHHHHHHhhhcHHHHHHHHhcccCCCCChHHHHHHHHHhhhhhhHHHHHHHHHHHHhHHHhhcC----
Q 037955 527 SGTRVALDFAETPSNLFEYYAWDYRVLRRFAKHYLTGEIVPEKLVKSMQGARDMFAATELQRQIFYALVDQTLFGE---- 602 (733)
Q Consensus 527 sgt~~~~D~~E~pS~~~E~~~~~~~~L~~~s~h~~tge~lp~~l~~~l~~~~~~~~~~~~~~ql~~a~fD~~lh~~---- 602 (733)
.+.++++++||+||++||+|++++.. .|+ ++++.+..++.+++. +++. .+++|+.++.||+.+|..
T Consensus 402 ~~~~~~~~~aE~aS~~~E~l~~~~l~-----~~~-~~~~~k~~~l~~~l~--~~~~--~~~~q~~~~~Fe~~~~~~~~~~ 471 (591)
T TIGR00181 402 PNSDYSIFYAEIASTFNELLLADYLL-----KNS-NDPEMKIYILLERIS--NFFG--TFTRQTLFAEFEYEAYELIEEG 471 (591)
T ss_pred ccCCCCchhhhHhHHHHHHHHHHHHH-----hhC-CCHHHHHHHHHHHHH--HHHH--HHHHHHHHHHHHHHHHHHHHcC
Confidence 56689999999999999999999852 455 455566667766655 3333 479999999999999974
Q ss_pred CCCCCCCHHHHHHHHHhhccCCCC--CC-CCcccccccccccccccchhHHHHH--HHHHHHHHHHHHcCCCCHHHHHHH
Q 037955 603 RLGQTRDTSSIVADMKRQHTSWNH--VE-GTHWHIRFSHFINYGAGYYSYLYAK--CFAATIWQKLCQEDPLSLTTGTTL 677 (733)
Q Consensus 603 ~~~~~~~~~~l~~~l~~~~~~~~~--~~-~~~~~~~f~Hl~~Y~a~YYsYl~s~--v~A~di~~~~~~~~~~~~~~G~~~ 677 (733)
.+.+..+++++|.+++++|.+... .+ ...+|.+++|+| .+ ||+|.||. ++|..+|.++.++++. ++++|
T Consensus 472 ~~lt~~~l~~~~~~~~~~~~g~~~~~~~~~~~~w~~~~Hfy--~~-fY~Y~Ya~g~~~a~~l~~~~~~~~~~---~~~~Y 545 (591)
T TIGR00181 472 EPLTAETLNEIYANLLKKYFGDLVKIDEGAGLTWMRIPHFY--MG-FYVYKYATGQVAATALYEKIKEEGKG---AVEKY 545 (591)
T ss_pred CCCCHHHHHHHHHHHHHHHcCCccccCCcccceeeecCccc--CC-CcCHHHHHHHHHHHHHHHHHHHCCcc---HHHHH
Confidence 355667899999999999866321 11 234557999985 44 77777765 5566669999887653 48999
Q ss_pred HHHHhcCCCCCCHHHHHHHhcCCCc
Q 037955 678 RTKILQHGGAKEPADMLNDLVGDGI 702 (733)
Q Consensus 678 r~~vL~~Ggs~~~~ell~~flGrd~ 702 (733)
+ .+|+.|||++|.|+++.+ |.|+
T Consensus 546 ~-~~L~~Ggs~~p~ell~~~-Gvd~ 568 (591)
T TIGR00181 546 L-KFLKSGGSKYPLETLKIA-GVDL 568 (591)
T ss_pred H-HHHhCcCCCCHHHHHHHc-CcCC
Confidence 8 999999999999999995 9973
No 13
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=100.00 E-value=1.6e-51 Score=461.89 Aligned_cols=383 Identities=24% Similarity=0.329 Sum_probs=299.5
Q ss_pred CCCCEEEecCCCcHHHHhhhCCCHHHHHHHHHHhccCCc----chHHHHHHHHHHHHHHHHHcCCCCHHHHHhccCcCCC
Q 037955 251 KEKGFRITTDSRILQSILQWTSDDEVRKMVYIQGHSVPQ----ANHEVLHELIAARNELAQIMGYRSYAEFIVMPNMASS 326 (733)
Q Consensus 251 ~~~g~~i~l~~~~~~~vL~~~~d~~~Rk~~~~a~~~~~~----~n~~~l~~ll~lR~e~A~llGy~sya~~~l~~~ma~s 326 (733)
..+|..++++. .+.++++ ++|+++||++|.++.+... .+.++|.+++++|+++|+++||+||+++++..+| +
T Consensus 27 ~~~G~~~~l~~-~~~~~~~-~~d~~~Rk~a~~a~~~~~~~~~~~~~~~l~~lv~~r~~~A~~~Gy~~~~~~~l~~~~--~ 102 (427)
T cd06459 27 EFDGEELTLSQ-NLSNLLE-SPDREVRKKAFEALYKAYEKYENTLAAILNTLVKLRLTLAKLRGYDSYLEAALFNNN--I 102 (427)
T ss_pred cCCCeEEeccH-HHHHHHh-CcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHhhccCC--C
Confidence 34678999988 7888777 6899999999999876543 4668999999999999999999999999999998 8
Q ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhcCCCCCCCCcCCHHHHHHHhhhhccCCChhhccCCCCHHHHHHHHHH
Q 037955 327 PEVVKSFLLEMSKMIKPKADEEFEAIKNFKRKSCGQKYVHLEPWDEAYYTAMMKSSAYNLDACVVASYFPLGQCIEGLKM 406 (733)
Q Consensus 327 pe~V~~fL~~l~~~~~p~~~~e~~~L~~~~~~~~g~~~~~l~pWD~~y~~~~~~~~~~~~d~~~~~~yfpl~~vl~gl~~ 406 (733)
|++|.+||.++.+.+.|..++.. +.+++.+| .+++.|||+.|+....+ .+.||++++++++.+
T Consensus 103 ~~~~~~~l~~v~~~~~p~~~~~~----~~~~~~lg--~~~l~~wD~~~~~~~~~-----------~~~~~~~~~~~~v~~ 165 (427)
T cd06459 103 PEDVYDFLIAVVKENVPLLHRYL----KLKKKLLG--LDKLRPYDLYAPLVSGN-----------PPKYTYEEAKELVLE 165 (427)
T ss_pred CHHHHHHHHHHHHHhcHHHHHHH----HHHHHHhC--CCcCcHhhcCCCCCCCC-----------CCcCcHHHHHHHHHH
Confidence 99999999999999999888844 44556667 57899999976533221 258999999998888
Q ss_pred HHHHhhCceeEeecCCCCCccccccceeEEEcCCCCcccceeeecccCCCCccCcccccccCceecCCCCccccEEEEEc
Q 037955 407 LAESLFGVTFHSVPLAPGESWHPDVLKLSLQHPEEGEMGYLYLDLYSRAGKYTGCANFAIKGGRRLSETEYQLPVVALIC 486 (733)
Q Consensus 407 l~~~lfgi~~~~~~~~~~~~wh~dV~~~~v~d~~~~~lg~~ylDl~~R~gK~~ga~~~~i~~~~~~~~g~~~~P~v~l~~ 486 (733)
+++.+ | |++.. +. +.+++..|+|+++|+||++||||+++.++ ..|++ +|
T Consensus 166 ~~~~l-~---------------~~~~~--~~---~~~~~~~~iD~~~r~gK~~gaf~~~~~~~--------~~p~i--~~ 214 (427)
T cd06459 166 ALSPL-G---------------PEYAE--FA---KRAFEERWIDVEPRKGKRSGAYCTGLPPG--------KHPFI--LM 214 (427)
T ss_pred HHHcc-C---------------HHHHH--HH---HHHhhCCCeeccCCCCCCCCeecCCCCCC--------CCCeE--Ee
Confidence 77543 3 22211 11 12467789999999999999999876421 34654 89
Q ss_pred cCCCCCCCCCCcChhhHHHHHHHHHHHHHHhhhcCCCCccCCCcccchhhhhhHHHHHHhhhcHHHHHHHHhcccCCCCC
Q 037955 487 NFPGSHNLSVRLNHHEVETLFHEFGHALHSLLSRTDYQHFSGTRVALDFAETPSNLFEYYAWDYRVLRRFAKHYLTGEIV 566 (733)
Q Consensus 487 Nf~~~~~~~~lL~~~~v~TLfHEfGHalH~lls~~~~~~~sgt~~~~D~~E~pS~~~E~~~~~~~~L~~~s~h~~tge~l 566 (733)
||+++. ++|.|||||||||+|+++++.. +...++++++|++|+||++||+|++++. ..|+.+.+ .
T Consensus 215 n~~~~~--------~~v~tl~HE~GHa~h~~~~~~~-~~~~~~~~~~~~~E~~S~~~E~~~~~~l-----~~~~~~~~-~ 279 (427)
T cd06459 215 NFNGTL--------DDVFTLAHELGHAFHSYLSRDN-QPYLYSDYPIFLAEIASTFNELLLFDYL-----LKFAKDPE-E 279 (427)
T ss_pred cCCCCh--------hhHHHHHHHhhHHHHHHHHccC-CCcccCCCCchhhHHHHHHHHHHHHHHH-----HHhCCCHH-H
Confidence 999865 8999999999999999999875 5546678999999999999999999876 24664421 2
Q ss_pred hHHHHHHHHHhhhhhhHHHHHHHHHHHHhHHHhhcCC----CCCCCCHHHHHHHHHhhccCCCC---CCCCccccccccc
Q 037955 567 PEKLVKSMQGARDMFAATELQRQIFYALVDQTLFGER----LGQTRDTSSIVADMKRQHTSWNH---VEGTHWHIRFSHF 639 (733)
Q Consensus 567 p~~l~~~l~~~~~~~~~~~~~~ql~~a~fD~~lh~~~----~~~~~~~~~l~~~l~~~~~~~~~---~~~~~~~~~f~Hl 639 (733)
+..++..+ ...+.+ .+++|+.++.||+++|... +.+..++.++|+++.++|.+... .....+|.+|+|+
T Consensus 280 ~~~~~~~~---~~~~~~-~~~~~~~~~~fe~~l~~~~~~~~~~~~~~~~~~~~~~~~~y~g~~~~~~~~~~~~w~~~~H~ 355 (427)
T cd06459 280 KLYLLEHL---LEDIRA-TLPRQTMFAEFEHEVYENPEEGEPLTAEELNEIYRELEKKYGGDLVEIDEEHGYEWARIPHF 355 (427)
T ss_pred HHHHHHHH---HHHHHH-HHHHHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHhcCCccccCCccCeeeeecCee
Confidence 22233333 232322 5789999999999999742 23456789999999999988543 2233333699999
Q ss_pred ccccccchhHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHhcCCCCCCHHHHHHHhcCCCcccccCCCCCCChHHHHH
Q 037955 640 INYGAGYYSYLYAKCFAATIWQKLCQEDPLSLTTGTTLRTKILQHGGAKEPADMLNDLVGDGILRYCNGGIVPDITSFSD 719 (733)
Q Consensus 640 ~~Y~a~YYsYl~s~v~A~di~~~~~~~~~~~~~~G~~~r~~vL~~Ggs~~~~ell~~flGrd~~~~~~~g~~p~~~a~l~ 719 (733)
+.+++|||+|+||.++|.++|..+.+++ +.++.+|+ ++|+.||+++|.|+++.+ |.| +....+++
T Consensus 356 ~~~~fYyy~Y~~g~~~a~~l~~~~~~~~---~~~~~~y~-~~L~~Ggs~~~~ell~~~-g~d----------~~~~~~~~ 420 (427)
T cd06459 356 YYVPFYVYPYAFGQLAALALYAKYKEDG---EGAVEKYL-ELLKAGGSKSPLELLKKA-GVD----------LTSPDFWE 420 (427)
T ss_pred cCCCCcChHHHHHHHHHHHHHHHHHHcC---ccHHHHHH-HHHHccCCCCHHHHHHHc-CcC----------CCChHHHH
Confidence 9777899999999999999999998864 35678996 999999999999999985 986 55666655
No 14
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=100.00 E-value=2.6e-49 Score=458.62 Aligned_cols=506 Identities=18% Similarity=0.182 Sum_probs=349.2
Q ss_pred hHHHHHHHHHhhHHHhhhhchhhhhhccCC-chHHHHHHHHHHHHHHHHHHHhh-CCHHHHH----HHHHHhhcCCCCCH
Q 037955 87 SVEIIRAMDEISDAVCSVVDSAELCRQTHP-DREFVEEASKASMRISEYLHYLN-TNHTLYD----AVKKAELDGHLLSK 160 (733)
Q Consensus 87 ~~~~v~~ld~~~~~l~~~~~~~~~~~~~~~-d~e~r~aa~~~~~~l~~~~~~l~-~~~~Ly~----~l~~~~~~~~~L~~ 160 (733)
+.+.+..++++...+.++...+.+..++++ |+++++....+.....++...+. -...|.+ .++.+. + ++
T Consensus 49 l~~~l~~~e~~~~~~~~l~~Ya~l~~~~Dt~d~~~~~~~~~~~~l~~~~~~~~sf~~~el~~l~~~~l~~~~-~----~~ 123 (587)
T TIGR02290 49 LQNALKRYEALGELLSKVGAYASLLYSADTSDPEAKALLGELEALSSEIQQALLFFTLELMRLSDEVWAALL-S----DP 123 (587)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHH-h----Cc
Confidence 456777778888888888888988888888 77776666555555544443321 1112221 011111 1 11
Q ss_pred HHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHH-HHHHHHHHHHhhcccCCCCcccccCCCCCHHHHHhhccccccCCCC
Q 037955 161 EAHRAANHLRIDFEKGGIHLCADKLDRVNQLNM-DIFQLCREFNQNIINDPGHVDIFPESRIPKHIHHLLKPICRLTSGP 239 (733)
Q Consensus 161 e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~-~i~~L~~~F~~ni~~~~~~v~~~~l~glp~~~~~~~~~~~~~~~~~ 239 (733)
+-+++-..+.+-++....-|+++.-+-+.++.. -....+..|.+... +- .+ +
T Consensus 124 ~L~~y~~~l~~~~~~~~h~Ls~e~E~ll~~~~~~~~~~~~~~y~~l~~-~~---~~-~---------------------- 176 (587)
T TIGR02290 124 DLAPYRFYLKELRKEAPHTLSEEEEKLINALSLTGRAAWSRLYDELTG-TL---RI-P---------------------- 176 (587)
T ss_pred hHHHHHHHHHHHHHcCCcCCCHHHHHHHHhcccchhhhHHHHHHHHHc-cC---cc-e----------------------
Confidence 222222232233444455566655333434431 22455566665332 21 11 0
Q ss_pred CcchhhhhcccCCCCEEEecCCCcHHHHhhhCCCHHHHHHHHHHhccCCcchH----HHHHHHHHHHHHHHHHcCCCCHH
Q 037955 240 SRESLISWDNKKEKGFRITTDSRILQSILQWTSDDEVRKMVYIQGHSVPQANH----EVLHELIAARNELAQIMGYRSYA 315 (733)
Q Consensus 240 ~~~~~~~~~~~~~~g~~i~l~~~~~~~vL~~~~d~~~Rk~~~~a~~~~~~~n~----~~l~~ll~lR~e~A~llGy~sya 315 (733)
..+ .+|....++.+.+..+|. ++|+++||++|+++......+. .+|..+++.|...|+++||+||.
T Consensus 177 -------~~~--~~G~~~~~t~~~~~~~l~-~~dr~~Rk~a~~~~~~~~~~~~~~~a~~l~~~v~~~~~~a~~rgy~s~~ 246 (587)
T TIGR02290 177 -------VDG--KDGEEETLSLEQALNLLR-DPDAEVRKKAFKALLKAWEKNAPTLAAILNALAGDRLTEYRLRGYDHPL 246 (587)
T ss_pred -------eec--CCCceEEcCHHHHHHHHh-CcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCHH
Confidence 011 245666777788888775 7899999999999988766554 46688888999999999999999
Q ss_pred HHHhccCcCCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhcCCCCCCCCcCCHHHHHHHhhhhccCCChhhccCCC
Q 037955 316 EFIVMPNMASSPEVVKSFLLEMSKMIKPKADEEFEAIKNFKRKSCGQKYVHLEPWDEAYYTAMMKSSAYNLDACVVASYF 395 (733)
Q Consensus 316 ~~~l~~~ma~spe~V~~fL~~l~~~~~p~~~~e~~~L~~~~~~~~g~~~~~l~pWD~~y~~~~~~~~~~~~d~~~~~~yf 395 (733)
++++..+.. +++.+.+|++.+.+. .|..++ +.+.+++.+| .+++++||+.+. . . ....+.+
T Consensus 247 e~~l~~~~i-~~~~~~~l~~~v~~~-~p~~~~----y~~~k~~~lg--l~~l~~~D~~~p--~--------~-~~~~~~~ 307 (587)
T TIGR02290 247 EPRLLYNRI-DQETLDAMLEAIKEN-YPLFRR----YYKLKAKLLG--KEKLDFYDLYAP--L--------G-DSSAKTY 307 (587)
T ss_pred HHhhccCCC-CHHHHHHHHHHHHHh-hHHHHH----HHHHHHHHhC--CCcccHHHcCCC--C--------C-CCCCCee
Confidence 999988544 699999999999877 687665 4556667778 678999998421 1 1 1124577
Q ss_pred CHHHHHHHHHHHHHHhhCceeEeecCCCCCccccccceeEEEcCCCCcccceeeecccCCCCccCcccccccCceecCCC
Q 037955 396 PLGQCIEGLKMLAESLFGVTFHSVPLAPGESWHPDVLKLSLQHPEEGEMGYLYLDLYSRAGKYTGCANFAIKGGRRLSET 475 (733)
Q Consensus 396 pl~~vl~gl~~l~~~lfgi~~~~~~~~~~~~wh~dV~~~~v~d~~~~~lg~~ylDl~~R~gK~~ga~~~~i~~~~~~~~g 475 (733)
+++++.+.+.+.+.. |+|++..+.. .++...|+|+++|+||++||||.++.++
T Consensus 308 ~~e~a~~~v~~~~~~----------------l~~e~~~~~~-----~~~~~~~iD~~~r~gK~~Ga~~~~~~~~------ 360 (587)
T TIGR02290 308 TFDEAKELVLEAFGK----------------FSPEMADFAE-----KAFEEGWIDAEPRPGKRGGAFCTGFPPS------ 360 (587)
T ss_pred cHHHHHHHHHHHHHh----------------cCHHHHHHHH-----HHHHcCCcccCCCCCCCCCcccCCCCCC------
Confidence 999888877765532 2333222211 1345569999999999999999876432
Q ss_pred CccccEEEEEccCCCCCCCCCCcChhhHHHHHHHHHHHHHHhhhcCCCCccCCCcccchhhhhhHHHHHHhhhcHHHHHH
Q 037955 476 EYQLPVVALICNFPGSHNLSVRLNHHEVETLFHEFGHALHSLLSRTDYQHFSGTRVALDFAETPSNLFEYYAWDYRVLRR 555 (733)
Q Consensus 476 ~~~~P~v~l~~Nf~~~~~~~~lL~~~~v~TLfHEfGHalH~lls~~~~~~~sgt~~~~D~~E~pS~~~E~~~~~~~~L~~ 555 (733)
..|+ |+|||+++. ++|.|||||||||+|++++++ +...++++++|++|+||++||+|+|++. |+.
T Consensus 361 --~~p~--i~~N~~~~~--------~~v~TL~HE~GHa~H~~ls~~--~~~~~~~~~~~~aE~~S~~~E~l~~~~l-l~~ 425 (587)
T TIGR02290 361 --KEPR--VLMNYDGSR--------RDVSTLAHELGHAYHSELAKD--QPLLNARYPMTLAETASIFAEMLLFDAL-LKE 425 (587)
T ss_pred --CCCE--EEEecCCCc--------hhHHHHHHHhhHHHHHHHHcc--CCcccCCCCchhhhHHHHHHHHHHHHHH-Hhh
Confidence 2475 589999976 899999999999999999977 4466778999999999999999999985 443
Q ss_pred HHhcccCCCCChHHHHHHHHHhhhhhhHH-HHHHHHHHHHhHHHhhcC---CCCCCCCHHHHHHHHHhhccCCC--CCC-
Q 037955 556 FAKHYLTGEIVPEKLVKSMQGARDMFAAT-ELQRQIFYALVDQTLFGE---RLGQTRDTSSIVADMKRQHTSWN--HVE- 628 (733)
Q Consensus 556 ~s~h~~tge~lp~~l~~~l~~~~~~~~~~-~~~~ql~~a~fD~~lh~~---~~~~~~~~~~l~~~l~~~~~~~~--~~~- 628 (733)
+ . +++....+... ....++ .+++|+.++.||+.+|.. ...+..++.++|.++.+++.+.. ..+
T Consensus 426 ~----~-----~~~~~~~~~~~-~l~~~~~~~~~q~~~~~fE~~l~~~~~~~~lt~~~l~~~~~~~~~~~~g~~~~~~~~ 495 (587)
T TIGR02290 426 A----K-----TDEEKLSLLAE-KLEDAIATLVRIHARFLFERRFHEARKEGELSADDICDLMLEAQKESYGDALDESEL 495 (587)
T ss_pred C----C-----CHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHhCcccccCCC
Confidence 2 1 12223333332 333344 468999999999999963 22455678999999999876532 221
Q ss_pred CCcccccccccccccccchhHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHhcCCCCCCHHHHHHHhcCCCcccccCC
Q 037955 629 GTHWHIRFSHFINYGAGYYSYLYAKCFAATIWQKLCQEDPLSLTTGTTLRTKILQHGGAKEPADMLNDLVGDGILRYCNG 708 (733)
Q Consensus 629 ~~~~~~~f~Hl~~Y~a~YYsYl~s~v~A~di~~~~~~~~~~~~~~G~~~r~~vL~~Ggs~~~~ell~~flGrd~~~~~~~ 708 (733)
...+|++++|++..+.|||+|+||.++|.++|..+.+++.. +-.+|+ .||+.||+++|.|++++ +|+|
T Consensus 496 ~~~~w~~~~H~y~~~fY~y~Y~~g~~~a~~l~~~~~~~~~~---~~~~y~-~~L~~Ggs~~p~ell~~-~G~d------- 563 (587)
T TIGR02290 496 HPYMWAYKPHFYHAPFYNYPYTFGYLFVLGLYAKYREEGES---FVPKYI-ALLRDTGSMTPEELVKK-FGFD------- 563 (587)
T ss_pred CCceeecCCeecCCCCcChhhHHHHHHHHHHHHHHHHCChh---HHHHHH-HHHHCcCCCCHHHHHHH-hCcC-------
Confidence 34556899999866677799999999999999999876531 235675 89999999999999999 7995
Q ss_pred CCCCChHHHHHH
Q 037955 709 GIVPDITSFSDE 720 (733)
Q Consensus 709 g~~p~~~a~l~~ 720 (733)
|+..+|++.
T Consensus 564 ---~~~~~~~~~ 572 (587)
T TIGR02290 564 ---LTSPDFWQK 572 (587)
T ss_pred ---CCChHHHHH
Confidence 888888753
No 15
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=100.00 E-value=7.2e-41 Score=383.15 Aligned_cols=506 Identities=20% Similarity=0.259 Sum_probs=359.7
Q ss_pred ChHHHHHHHHHhhHHHhhhhchhhhhhccCC-chHHHHHHHHHHHHHHHHHHHh-hCCHHHHHHHHHH----hhcCCCCC
Q 037955 86 SSVEIIRAMDEISDAVCSVVDSAELCRQTHP-DREFVEEASKASMRISEYLHYL-NTNHTLYDAVKKA----ELDGHLLS 159 (733)
Q Consensus 86 ~~~~~v~~ld~~~~~l~~~~~~~~~~~~~~~-d~e~r~aa~~~~~~l~~~~~~l-~~~~~Ly~~l~~~----~~~~~~L~ 159 (733)
.+.+.+...+++...+.++...+.+...+.. |+++......+.....++...+ +-+..+.+.-.++ ....+.|
T Consensus 51 ~l~~~l~~~e~~~~~l~~~~~ya~~~~s~d~~d~~~~~~~~~~~~~~~~~~~~~~~~~~el~~~~e~~~~~~l~~~~~l- 129 (598)
T COG1164 51 DLLEALEEYEKLEELLGRLYAYASMKLSTDTTDEEAQKLYQKLEEFYADFSSALSFFEPELLELDEEVIQSLLTSGPEL- 129 (598)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccHHHHHHHHhcCcch-
Confidence 4566777888888999999999999888887 6666666666666666655442 2344444433332 1111112
Q ss_pred HHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCcccccCCCCCHHHHHhhccccccCCCC
Q 037955 160 KEAHRAANHLRIDFEKGGIHLCADKLDRVNQLNMDIFQLCREFNQNIINDPGHVDIFPESRIPKHIHHLLKPICRLTSGP 239 (733)
Q Consensus 160 ~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~L~~~F~~ni~~~~~~v~~~~l~glp~~~~~~~~~~~~~~~~~ 239 (733)
..-+..++.+ +++.-..|++...+-+..+......-...|......+ +.+
T Consensus 130 ~~y~~~l~~~---~~~k~h~ls~~~Ekll~~~~~~~~~~~~~~~~~~~~d---l~~------------------------ 179 (598)
T COG1164 130 ADYRFYLEEL---FRRKPHQLSAEEEKLLAQLSPVGNSASNIFFDLTNSD---LTF------------------------ 179 (598)
T ss_pred hhhHHHHHHH---HhhCcccCCHHHHHHHHhhhhhhcchhHHhhhhhccc---ccc------------------------
Confidence 1223344444 3344556677665666666554544444444432211 111
Q ss_pred CcchhhhhcccCCCCEEEecCCCcHHHHhhhCCCHHHHHHHHHHhccCCcchH----HHHHHHHHHHHHHHHHcCCCCHH
Q 037955 240 SRESLISWDNKKEKGFRITTDSRILQSILQWTSDDEVRKMVYIQGHSVPQANH----EVLHELIAARNELAQIMGYRSYA 315 (733)
Q Consensus 240 ~~~~~~~~~~~~~~g~~i~l~~~~~~~vL~~~~d~~~Rk~~~~a~~~~~~~n~----~~l~~ll~lR~e~A~llGy~sya 315 (733)
......+|.++|+. .+..+|+ ++||++||++|+++.....++. .++..+++.+...|+.+||+|+.
T Consensus 180 -------~~~~~~~g~~~~~~--~~~~ll~-~~Dr~~Rk~A~~a~~~~~~~~~~t~a~~l~~~~~~~~~~a~~r~y~~~~ 249 (598)
T COG1164 180 -------PDIGDAKGKKLTLE--QLLNLLE-DPDREVRKAAYEALYKAYEKHRNTLAALLNTLVKVLAFLARARNYDDVI 249 (598)
T ss_pred -------ccccccccceecHH--HHHHHhh-CCCHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhcCcchHH
Confidence 01112346778876 4556665 9999999999999988766554 45666666788899999999999
Q ss_pred HHHhccCcCCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhcCCCCCCCCcCCHHHHHHHhhhhccCCChhhccCCC
Q 037955 316 EFIVMPNMASSPEVVKSFLLEMSKMIKPKADEEFEAIKNFKRKSCGQKYVHLEPWDEAYYTAMMKSSAYNLDACVVASYF 395 (733)
Q Consensus 316 ~~~l~~~ma~spe~V~~fL~~l~~~~~p~~~~e~~~L~~~~~~~~g~~~~~l~pWD~~y~~~~~~~~~~~~d~~~~~~yf 395 (733)
+..+.++-. +.+.|..+.+.+.+...|...+ +.+++++.+| .++++|||+. .+ +......+-+
T Consensus 250 ~~~l~~~~i-~~~~~d~l~~~v~~~~~~l~~r----y~~~r~k~Lg--l~~l~~yD~~--~p--------~~~~~~~~~~ 312 (598)
T COG1164 250 DSALLRNEV-DREVVDNLIESVKEAFLPLLHR----YYKLRAKVLG--LEKLRPYDLY--AP--------LLDKDPSPEY 312 (598)
T ss_pred HHHHHhCCC-CHHHHHHHHHHHHHhcchHHHH----HHHHHHHHhC--cccCCHHhhc--CC--------cccCCCCccc
Confidence 999988432 6888888888888788888877 6667788888 6899999983 22 1122446788
Q ss_pred CHHHHHHHHHHHHHHhhCceeEeecCCCCCccccccceeEEEcCCCCcccceeeecccCCCCccCcccccccCceecCCC
Q 037955 396 PLGQCIEGLKMLAESLFGVTFHSVPLAPGESWHPDVLKLSLQHPEEGEMGYLYLDLYSRAGKYTGCANFAIKGGRRLSET 475 (733)
Q Consensus 396 pl~~vl~gl~~l~~~lfgi~~~~~~~~~~~~wh~dV~~~~v~d~~~~~lg~~ylDl~~R~gK~~ga~~~~i~~~~~~~~g 475 (733)
+.+++++-+.+.++.| |+.+...++.. +...|+|+++|+||++||||.+.-.
T Consensus 313 s~~ea~~~v~~~l~~l------------g~ey~~~~~~a---------~~~~WiD~~~~~gKrsGaYs~~~~~------- 364 (598)
T COG1164 313 SYEEAKELVLKALAPL------------GPEYAKIARRA---------FDERWIDVYPRKGKRSGAYSIGFYK------- 364 (598)
T ss_pred cHHHHHHHHHHHHHhh------------CHHHHHHHHHH---------HhcCCeeccCCCCCCCCcccCCCCC-------
Confidence 9999988877766544 34455544443 3447999999999999999976522
Q ss_pred CccccEEEEEccCCCCCCCCCCcChhhHHHHHHHHHHHHHHhhhcCCCCccCCCcccchhhhhhHHHHHHhhhcHHHHHH
Q 037955 476 EYQLPVVALICNFPGSHNLSVRLNHHEVETLFHEFGHALHSLLSRTDYQHFSGTRVALDFAETPSNLFEYYAWDYRVLRR 555 (733)
Q Consensus 476 ~~~~P~v~l~~Nf~~~~~~~~lL~~~~v~TLfHEfGHalH~lls~~~~~~~sgt~~~~D~~E~pS~~~E~~~~~~~~L~~ 555 (733)
...|+| ++||.+.. +||.||+||+||++|+.+++.. +++.....++.+||+||+++|.+++++.+-.
T Consensus 365 -~~~p~I--lmN~~gt~--------~dV~TLaHElGHs~Hs~~s~~~-qp~~~~~~~i~~AEiAS~fnE~l~~~~ll~~- 431 (598)
T COG1164 365 -GDHPFI--LMNYDGTL--------RDVFTLAHELGHSVHSYFSRKH-QPYLYADYSIFLAEIASTFNEMLLFDYLLER- 431 (598)
T ss_pred -CCCCeE--EEeCCCch--------hHHHHHHHHccHHHHHHHHhcc-CCccccCCchHHHHHHHHHHHHHHHHHHHHH-
Confidence 146876 78999977 8999999999999999999875 6656667889999999999999999988632
Q ss_pred HHhcccCCCCChHHHHHHHHHhhhhhhHHHHHHHHHHHHhHHHhhcC----CCCCCCCHHHHHHHHHhhccCCC--CCCC
Q 037955 556 FAKHYLTGEIVPEKLVKSMQGARDMFAATELQRQIFYALVDQTLFGE----RLGQTRDTSSIVADMKRQHTSWN--HVEG 629 (733)
Q Consensus 556 ~s~h~~tge~lp~~l~~~l~~~~~~~~~~~~~~ql~~a~fD~~lh~~----~~~~~~~~~~l~~~l~~~~~~~~--~~~~ 629 (733)
+ +. +++.+.-+......+.++ +.||+.++.|++.+|.. .+++...++++|.++.++|.+.. ..++
T Consensus 432 ----~--~~--~~~~~~il~~~l~~~~~t-~~rq~~f~~FE~~~h~~~~~~~~lt~~~l~~~~~~l~~~y~gd~~~~d~~ 502 (598)
T COG1164 432 ----F--KD--PEERLAILEEKLEGFFAT-LFRQTLFAEFEHRVHELIEEGEELTAEELNELYLELQKEYYGDAVKLDEL 502 (598)
T ss_pred ----c--CC--HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHhcCCccccCcc
Confidence 2 22 244444444444444444 88999999999999986 45677889999999999999853 2222
Q ss_pred -CcccccccccccccccchhHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHhcCCCCCCHHHHHHHhcCCCcc
Q 037955 630 -THWHIRFSHFINYGAGYYSYLYAKCFAATIWQKLCQEDPLSLTTGTTLRTKILQHGGAKEPADMLNDLVGDGIL 703 (733)
Q Consensus 630 -~~~~~~f~Hl~~Y~a~YYsYl~s~v~A~di~~~~~~~~~~~~~~G~~~r~~vL~~Ggs~~~~ell~~flGrd~~ 703 (733)
..+|++++||+.-+++||.|..|.++|..+|..+..+++ +.-+.+..++|+.|||++|.|+++ ..|.|+.
T Consensus 503 ~~~~W~~ipHfy~~pFYvy~Ya~G~~~a~~l~~~~~~~~~---~~~~~~y~~~Lk~GgS~~p~ell~-~~GiD~~ 573 (598)
T COG1164 503 SGLEWARIPHFYHSPFYVYQYATGQLAALALYAKILTNDA---EAFEKYYIAFLKSGGSKSPLELLK-IAGIDLT 573 (598)
T ss_pred cccchhhcCeeeecCceehHhHHHHHHHHHHHHHHHhccH---HHHHHHHHHHHhCCCCCCHHHHHH-HcCCCCC
Confidence 344579999995588889999999999999999998763 223445468999999999999999 5699843
No 16
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=99.95 E-value=7.7e-26 Score=251.67 Aligned_cols=380 Identities=14% Similarity=0.093 Sum_probs=245.9
Q ss_pred CCcHHHHhhhCCCHHHHHHHHHHhcc-CCcchHHHHHHHHHHHHHHHHHcCCCCHHHHHhccCcCCCHHHHHHHHHHHHH
Q 037955 261 SRILQSILQWTSDDEVRKMVYIQGHS-VPQANHEVLHELIAARNELAQIMGYRSYAEFIVMPNMASSPEVVKSFLLEMSK 339 (733)
Q Consensus 261 ~~~~~~vL~~~~d~~~Rk~~~~a~~~-~~~~n~~~l~~ll~lR~e~A~llGy~sya~~~l~~~ma~spe~V~~fL~~l~~ 339 (733)
.+.+.++|+.++|+++|+.+|+++.. ....-..++.+++++|++.|+.+||+||.+|. |...+++++.+|++.|++
T Consensus 33 ~~~l~~i~~~s~D~~~r~~aW~~w~~~~g~~~~~~y~~lV~lrn~~A~~~Gf~n~~~~w---R~~Ye~~d~~~~~e~i~~ 109 (477)
T cd06461 33 EPDLESIMAKSRDYDELLYAWEGWRDAVGPPMRPLYERYVELANEAAKLNGFVDAGEYW---RSEYETPDFEQDVERLWE 109 (477)
T ss_pred HHHHHHHHHhCCCHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHcCCCCHHHHh---HhcCCHHHHHHHHHHHHH
Confidence 47799999889999999999999864 34566789999999999999999999999999 555678999999999999
Q ss_pred hhhHHHHHHHHHHHHHHHHhcCC-------CCCCCCcCCHHHHHHHhhhh---------------ccCCChhhccCCCCH
Q 037955 340 MIKPKADEEFEAIKNFKRKSCGQ-------KYVHLEPWDEAYYTAMMKSS---------------AYNLDACVVASYFPL 397 (733)
Q Consensus 340 ~~~p~~~~e~~~L~~~~~~~~g~-------~~~~l~pWD~~y~~~~~~~~---------------~~~~d~~~~~~yfpl 397 (733)
.++|+.++ |+.+.++.++. +.+.+-||++ ..++..+. ..+++++-+..-++.
T Consensus 110 ~v~Ply~~----Lh~~~r~~L~~~y~~~~~d~~~~iPahl--~gn~~~q~W~~~~~~~~p~p~~~~~dv~~~l~~~~~t~ 183 (477)
T cd06461 110 ELKPLYLQ----LHAYVRRKLRKKYGDDVVNRDGPIPAHL--LGNMWAQSWSNIYDLVKPYPGKPLLDVTDAMVEQGYTA 183 (477)
T ss_pred HHHHHHHH----HHHHHHHHHHHhcccccCCCCCCCcccc--ccCccccccccccccccCCCCCCCCChHHHHHHcCCCH
Confidence 99999766 66665554432 1133445655 22222221 011221112211111
Q ss_pred HHHHHHHHHHHHHhhCceeEeecCCCCCccccccceeEEEcCCCCcccceeeec---------ccCCCCccCcccccccC
Q 037955 398 GQCIEGLKMLAESLFGVTFHSVPLAPGESWHPDVLKLSLQHPEEGEMGYLYLDL---------YSRAGKYTGCANFAIKG 468 (733)
Q Consensus 398 ~~vl~gl~~l~~~lfgi~~~~~~~~~~~~wh~dV~~~~v~d~~~~~lg~~ylDl---------~~R~gK~~ga~~~~i~~ 468 (733)
..|+..+++.|. .+| ..|+ ..|+||. |++|..-.-
T Consensus 184 ----~~mf~~~~~ff~-----------------------------smg--L~dl~~~fw~~s~~~rp~k~-~~~C~~saw 227 (477)
T cd06461 184 ----KRMFKEAEEFFT-----------------------------SLG--LPPMPPSFWTKSMLEKPTDR-EVVCHASAW 227 (477)
T ss_pred ----HHHHHHHHHHHH-----------------------------HCC--CCcCChHHHhhccccCCCCC-CCccCcccc
Confidence 233333333332 111 3444 7899999 778766111
Q ss_pred ceecCCCCccccEEEEEccCCCCCCCCCCcChhhHHHHHHHHHHHHHHhh-hcCCCCccCCCcccc--hhhhhhHHHH--
Q 037955 469 GRRLSETEYQLPVVALICNFPGSHNLSVRLNHHEVETLFHEFGHALHSLL-SRTDYQHFSGTRVAL--DFAETPSNLF-- 543 (733)
Q Consensus 469 ~~~~~~g~~~~P~v~l~~Nf~~~~~~~~lL~~~~v~TLfHEfGHalH~ll-s~~~~~~~sgt~~~~--D~~E~pS~~~-- 543 (733)
. . .....|.| .+|+..+. +|+.|+.||+||+...+. .+..+-...|.+.+. .++|+.|+.|
T Consensus 228 d--~--~~~~d~rI--~~c~~~t~--------~D~~t~~HE~GH~~yy~~y~~~p~~~r~~anp~fheav~e~~smS~~t 293 (477)
T cd06461 228 D--F--YNGKDFRI--KMCTKVNM--------EDFVTVHHEMGHIQYYLQYKDQPVLFREGANPGFHEAVGDAIALSVST 293 (477)
T ss_pred c--C--CCCCCcce--eeCCCCCH--------HHHHHHHHHHHHHHHHHHhccCCHHHhCCCCCChHHHHHHHHHHhcCC
Confidence 0 0 11234665 78988776 899999999999996544 333333345667777 9999999999
Q ss_pred -HHhhhcHHHHHHHHhcccCCCCChHHHHHHHHHhhhhhhHHHHHHHHHHHHhHHHhhcCCCCCCCCHHHHHHHHHhhcc
Q 037955 544 -EYYAWDYRVLRRFAKHYLTGEIVPEKLVKSMQGARDMFAATELQRQIFYALVDQTLFGERLGQTRDTSSIVADMKRQHT 622 (733)
Q Consensus 544 -E~~~~~~~~L~~~s~h~~tge~lp~~l~~~l~~~~~~~~~~~~~~ql~~a~fD~~lh~~~~~~~~~~~~l~~~l~~~~~ 622 (733)
|++..-+. +..+ .+.....+ ..+...+.....+..-..++.+ |-..++.. ..+..+.++.|++|.++|.
T Consensus 294 pe~L~~~~l-l~~~--~~~~~~~i-~~l~~~al~~i~fLP~~~~vD~-----fr~~vy~~-~~~~~e~n~~WweL~~~yq 363 (477)
T cd06461 294 PKHLHKIGL-LDSE--VDDEEADI-NFLLKMALDKIAFLPFGYLLDK-----WRWDVFDG-EIPKDDYNKAWWELREKYQ 363 (477)
T ss_pred HHHHhhccc-cccc--CCCcHHHH-HHHHHHHHHHHHHhhHHHHHHH-----HHHHHhcC-CCCHHHHHHHHHHHHHHhc
Confidence 88874332 2221 01111111 2333333333333221111111 12223444 3455789999999999999
Q ss_pred CCCCCCC----CcccccccccccccccchhHHHHHHHHHHHHHHHHHc----CCC-------CHHHHHHHHHHHhcCCCC
Q 037955 623 SWNHVEG----THWHIRFSHFINYGAGYYSYLYAKCFAATIWQKLCQE----DPL-------SLTTGTTLRTKILQHGGA 687 (733)
Q Consensus 623 ~~~~~~~----~~~~~~f~Hl~~Y~a~YYsYl~s~v~A~di~~~~~~~----~~~-------~~~~G~~~r~~vL~~Ggs 687 (733)
++.++.. ......+-|+.. ...|..|..|++++.++|..++.. +|+ ++++|+.|+ ++|+.|||
T Consensus 364 gv~pP~~r~~~~fd~gak~h~~~-~~PYi~YflA~ilqfQf~~~lc~~a~~~~plh~C~~~~s~~Ag~~l~-~~l~lG~S 441 (477)
T cd06461 364 GVVPPVPRSEEDFDPGAKYHVPA-NTPYIRYFLSFILQFQFHKALCKAAGHTGPLHKCDIYGSKEAGKKLR-AMLSLGSS 441 (477)
T ss_pred CCCCCCCCCccccCccceeeccC-CCcHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccChHHHHHHHH-HHHhCcCC
Confidence 8633321 112345556654 344999999999999999997642 444 678999997 89999999
Q ss_pred CCHHHHHHHhcCCCcccccCCCCCCChHHHHHH
Q 037955 688 KEPADMLNDLVGDGILRYCNGGIVPDITSFSDE 720 (733)
Q Consensus 688 ~~~~ell~~flGrd~~~~~~~g~~p~~~a~l~~ 720 (733)
++..|+++.+.|.. +.+..++++.
T Consensus 442 ~~w~e~le~~~g~~---------~~s~~~l~ey 465 (477)
T cd06461 442 KPWPEALEALTGER---------EMDASALLEY 465 (477)
T ss_pred CCHHHHHHHhcCCC---------CCChHHHHHH
Confidence 99999999998874 5666667764
No 17
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=99.93 E-value=3.6e-23 Score=224.30 Aligned_cols=339 Identities=14% Similarity=0.156 Sum_probs=229.7
Q ss_pred chHHHHHHHHHHHHHHHHHcCC-CCHHHHHhcc-CcCCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhcCCCCCCC
Q 037955 290 ANHEVLHELIAARNELAQIMGY-RSYAEFIVMP-NMASSPEVVKSFLLEMSKMIKPKADEEFEAIKNFKRKSCGQKYVHL 367 (733)
Q Consensus 290 ~n~~~l~~ll~lR~e~A~llGy-~sya~~~l~~-~ma~spe~V~~fL~~l~~~~~p~~~~e~~~L~~~~~~~~g~~~~~l 367 (733)
.-.+.|++++.+..++|..+|+ .+..|..|.. ..--+.+.+..++.+|.+.+.|...+ +.. + +...+
T Consensus 28 ~f~p~L~~~v~l~re~a~~~~~~~~~YdaLLd~ye~g~~~~~id~~F~~lk~~l~~l~~~----v~~---~--~~~~~-- 96 (396)
T cd06460 28 LFAPYLEKIVELSREIAEYLGYEKHPYDALLDLYEPGMTTAKLDAIFAELKAGLVPLLKK----VLE---K--QKPPD-- 96 (396)
T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCChHHHHHHhhCCCCCHHHHHHHHHHHHHHHHHHHHH----HHh---c--CCCCC--
Confidence 3468999999999999999999 6777888777 22228999999999999888887755 211 0 10000
Q ss_pred CcCCHHHHHHHhhhhccCCChhhccCCCCHHHHHHHHHHHHHHhhCceeEeecCCCCCccccccceeEEEcCCCCcccce
Q 037955 368 EPWDEAYYTAMMKSSAYNLDACVVASYFPLGQCIEGLKMLAESLFGVTFHSVPLAPGESWHPDVLKLSLQHPEEGEMGYL 447 (733)
Q Consensus 368 ~pWD~~y~~~~~~~~~~~~d~~~~~~yfpl~~vl~gl~~l~~~lfgi~~~~~~~~~~~~wh~dV~~~~v~d~~~~~lg~~ 447 (733)
...+...||.+....-..++. +.+|.. +...
T Consensus 97 --------------------~~~~~~~~~~~~Q~~~~~~~~-~~~g~d----------------------------f~~g 127 (396)
T cd06460 97 --------------------DSLLSGPFPEEKQEALGRELL-EALGFD----------------------------FDRG 127 (396)
T ss_pred --------------------ccccCCCCCHHHHHHHHHHHH-HHhCCc----------------------------ccCC
Confidence 001234567666544333333 334422 2335
Q ss_pred eeecccCCCCccCcccccccCceecCCCCccccEEEEEccCCCCCCCCCCcChhhHHHHHHHHHHHHHHh-hhcCCCCcc
Q 037955 448 YLDLYSRAGKYTGCANFAIKGGRRLSETEYQLPVVALICNFPGSHNLSVRLNHHEVETLFHEFGHALHSL-LSRTDYQHF 526 (733)
Q Consensus 448 ylDl~~R~gK~~ga~~~~i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~~~lL~~~~v~TLfHEfGHalH~l-ls~~~~~~~ 526 (733)
|+|..++| ||..+.+ ..+. |.+||.+... .+.|.|+.||+|||+|.. +...-....
T Consensus 128 riD~s~hp------F~~~~~~---------~dvR--Itt~y~~~d~------~~~l~t~iHE~GHalye~~l~~~~~~~p 184 (396)
T cd06460 128 RLDVSAHP------FTGGLGP---------GDVR--ITTRYDENDF------RSALFSTIHETGHALYEQGLPPELRGTP 184 (396)
T ss_pred eeecCCCC------CCCCCCC---------CCce--EEeeeCCcch------HHHHHHHHHHhhHHHHHhcCCHhHccCC
Confidence 78877765 6543311 1244 4789987621 279999999999999998 542111122
Q ss_pred CCCcccchhhhhhHHHHHHhhhcH-----HHHHHHHhcccC--CCCChHHHHHHHHHhh-hhh-----hHHHHHHHHHHH
Q 037955 527 SGTRVALDFAETPSNLFEYYAWDY-----RVLRRFAKHYLT--GEIVPEKLVKSMQGAR-DMF-----AATELQRQIFYA 593 (733)
Q Consensus 527 sgt~~~~D~~E~pS~~~E~~~~~~-----~~L~~~s~h~~t--ge~lp~~l~~~l~~~~-~~~-----~~~~~~~ql~~a 593 (733)
.|...++-++|.+|.|+|+.+.+. .++..+..++.. +.--++++...+.... .+. .....++++++.
T Consensus 185 l~~~~sm~ihESqS~f~E~~vgrs~~F~~~l~~~l~~~f~~~~~~~~~e~~~~~~n~v~ps~iRveADEvty~lHiilR~ 264 (396)
T cd06460 185 LGGGASMGIHESQSRLWENQVGRSRAFWEFLYPKLKKTFPEQLKDVSLENFYRAVNRVQPSLIRVEADEVTYNLHIMLRY 264 (396)
T ss_pred cccCCCchhHHHHHHHHHHHHhcCHHHHHHHHHHHHHHcccccccCCHHHHHHHHhhcCccceeeecchhhHHHHHHHHH
Confidence 344569999999999999988763 234445445422 1112444322222212 111 113578999999
Q ss_pred HhHHHhhcCCCCCCCCHHHHHHHHHhhccCCCCCCCCcccccccccc-cccccchhHHHHHHHHHHHHHHHHHcCC-CCH
Q 037955 594 LVDQTLFGERLGQTRDTSSIVADMKRQHTSWNHVEGTHWHIRFSHFI-NYGAGYYSYLYAKCFAATIWQKLCQEDP-LSL 671 (733)
Q Consensus 594 ~fD~~lh~~~~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~~~f~Hl~-~Y~a~YYsYl~s~v~A~di~~~~~~~~~-~~~ 671 (733)
.|+..+..+ +.+..++.++|++..++|.|+.++.+...+.+-.|.+ |.-+|+|+|+.|.++|+++|+.+.++.+ +..
T Consensus 265 e~Ek~l~~G-~l~v~dLp~~W~~~~~eylG~~~~~d~~g~lQd~HW~~g~fGYfptYalG~l~Aaql~~~~~~~~~~~~~ 343 (396)
T cd06460 265 ELEKALIEG-DLEVADLPEAWNEKMKEYLGIRPPNDAEGCLQDIHWSGGSFGYFPTYTLGNLYAAQLFAAAKKDLPDIDE 343 (396)
T ss_pred HHHHHHHcC-CCCHHHHHHHHHHHHHHHcCCCCCCCccccccccccccCcCCcCchhHHHHHHHHHHHHHHHHhCccHHH
Confidence 999999987 6677899999999999999976554333335667776 4333569999999999999999877643 222
Q ss_pred H--------HHHHHHHHHhcCCCCCCHHHHHHHhcCCCcccccCCCCCCChHHHHHHcC
Q 037955 672 T--------TGTTLRTKILQHGGAKEPADMLNDLVGDGILRYCNGGIVPDITSFSDEVK 722 (733)
Q Consensus 672 ~--------~G~~~r~~vL~~Ggs~~~~ell~~flGrd~~~~~~~g~~p~~~a~l~~~g 722 (733)
. ...++|+.|++.||+.+|.|++++++|. +.|++.|+++|.
T Consensus 344 ~i~~g~f~~~~~wL~e~i~~~G~~~~p~eLl~~atG~----------~l~~~~f~~yL~ 392 (396)
T cd06460 344 QIERGDFSPLLEWLRENIHQHGSRYSPDELLKKATGE----------PLNPEYFLEYLE 392 (396)
T ss_pred HHhccChHHHHHHHHHHHHhccCCCCHHHHHHHHhCC----------CCChHHHHHHHH
Confidence 2 5688999999999999999999999999 599999999873
No 18
>PF02074 Peptidase_M32: Carboxypeptidase Taq (M32) metallopeptidase; InterPro: IPR001333 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M32 (carboxypeptidase Taq family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Carboxypeptidase Taq is a zinc-containing thermostable metallopeptidase. It was originally discovered and purified from Thermus aquaticus; optimal enzymatic activity occurs at 80 celcius. Although very little is known about this enzyme, it is thought either to be associated with a membrane or to be particle bound.; GO: 0004181 metallocarboxypeptidase activity, 0006508 proteolysis; PDB: 1K9X_A 1KA4_A 1KA2_A 3DWC_A 1WGZ_A 3HQ2_A 3HOA_B.
Probab=99.82 E-value=6.1e-16 Score=172.41 Aligned_cols=413 Identities=17% Similarity=0.217 Sum_probs=258.2
Q ss_pred HHHHHHHHHHHHHhhCCHHHHHHHHHHhhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHh
Q 037955 125 SKASMRISEYLHYLNTNHTLYDAVKKAELDGHLLSKEAHRAANHLRIDFEKGGIHLCADKLDRVNQLNMDIFQLCREFNQ 204 (733)
Q Consensus 125 ~~~~~~l~~~~~~l~~~~~Ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~L~~~F~~ 204 (733)
.++...|+....++.+++.+-+.|..+....+.|++.+++.++.+.++++++ ..+|.+.-+++.++..+-...-.+..+
T Consensus 44 aeqla~Ls~~~hel~T~~~~~elL~~l~~~~~~l~~~e~a~lr~~~r~~~~~-~~iP~elv~~~~~~~s~a~~~W~~AR~ 122 (494)
T PF02074_consen 44 AEQLATLSGLIHELLTSPEIGELLEELEEEKEDLDEWERANLREMRRDYERA-TKIPEELVEELARLTSEAEQAWEEARE 122 (494)
T ss_dssp HHHHHHHHHHHHHHHTSHHHHHHHHHHHCSGSSTTHHHHHHHHHHHHHHHHH-HCS-HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHcCHHHHHHHHHHhccccCCCHHHHHHHHHHHHHHHHH-hCCCHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4566778888999999999999999995555679999999999999999884 355666555555444432222222111
Q ss_pred hcccCCCCcccccCCCCCHHHHHhhccccccCCCCCcchhhhhcccCCCCEEEecCCCcHHHHhhhCCCHHHHHHHHHHh
Q 037955 205 NIINDPGHVDIFPESRIPKHIHHLLKPICRLTSGPSRESLISWDNKKEKGFRITTDSRILQSILQWTSDDEVRKMVYIQG 284 (733)
Q Consensus 205 ni~~~~~~v~~~~l~glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~l~~~~~~~vL~~~~d~~~Rk~~~~a~ 284 (733)
+
T Consensus 123 ~------------------------------------------------------------------------------- 123 (494)
T PF02074_consen 123 N------------------------------------------------------------------------------- 123 (494)
T ss_dssp C-------------------------------------------------------------------------------
T ss_pred h-------------------------------------------------------------------------------
Confidence 0
Q ss_pred ccCCcchHHHHHHHHHHHHHHHHHcCCC-CHHHHHhcc---CcCCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhc
Q 037955 285 HSVPQANHEVLHELIAARNELAQIMGYR-SYAEFIVMP---NMASSPEVVKSFLLEMSKMIKPKADEEFEAIKNFKRKSC 360 (733)
Q Consensus 285 ~~~~~~n~~~l~~ll~lR~e~A~llGy~-sya~~~l~~---~ma~spe~V~~fL~~l~~~~~p~~~~e~~~L~~~~~~~~ 360 (733)
+....-.+.|++++.++.++|..+|++ +-.|..|.. .| +.+.|..++++|.+.+.|.+++ +.+.
T Consensus 124 -nDf~~F~P~Le~iv~l~re~a~~~~~~~~~YDaLLd~yEpg~--t~~~ld~~F~~lk~~l~~l~~~----i~~~----- 191 (494)
T PF02074_consen 124 -NDFSAFAPYLEKIVELQREIAEYLGYELSPYDALLDDYEPGM--TTEKLDEIFAELKAFLVPLLQK----ILEK----- 191 (494)
T ss_dssp -T-HHHHHHHHHHHHHHHHHHHHHCTSTTSHHHHHHHHHSTT----HHHHHHHHHHHHHHHHHHHHH----HHCH-----
T ss_pred -CCHHHHHHHHHHHHHHHHHHHHHhcCCCCcHHHHHhhhCCCC--CHHHHHHHHHHHHHHHHHHHHH----HHhc-----
Confidence 001123589999999999999999986 677888776 35 7999999999999998887765 2111
Q ss_pred CCCCCCCCcCCHHHHHHHhhhhccCCChhhccCCCCHHHHHHHHHHHHHHhhCceeEeecCCCCCccccccceeEEEcCC
Q 037955 361 GQKYVHLEPWDEAYYTAMMKSSAYNLDACVVASYFPLGQCIEGLKMLAESLFGVTFHSVPLAPGESWHPDVLKLSLQHPE 440 (733)
Q Consensus 361 g~~~~~l~pWD~~y~~~~~~~~~~~~d~~~~~~yfpl~~vl~gl~~l~~~lfgi~~~~~~~~~~~~wh~dV~~~~v~d~~ 440 (733)
+ .. .+...+..-||.+....-...++ +.+|..|.
T Consensus 192 ~---~~-------------------~~~~~l~~~~~~~~Q~~~~~~~~-~~~G~d~~----------------------- 225 (494)
T PF02074_consen 192 Q---KQ-------------------PDDSFLHGPFPEEKQKAFSRELL-EYLGFDFD----------------------- 225 (494)
T ss_dssp T---CC-------------------HTGGGGGSB--HHHHHHHHHHHH-HHHT--GC-----------------------
T ss_pred C---CC-------------------CCCccCCCCCCHHHHHHHHHHHH-HHcCCCcc-----------------------
Confidence 1 00 11122344577666654444433 45564332
Q ss_pred CCcccceeeecccCCCCccCcccccccCceecCCCCccccEEEEEccCCCCCCCCCCcChhhHHHHHHHHHHHHHHhhhc
Q 037955 441 EGEMGYLYLDLYSRAGKYTGCANFAIKGGRRLSETEYQLPVVALICNFPGSHNLSVRLNHHEVETLFHEFGHALHSLLSR 520 (733)
Q Consensus 441 ~~~lg~~ylDl~~R~gK~~ga~~~~i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~~~lL~~~~v~TLfHEfGHalH~lls~ 520 (733)
...+|. ..|.||.++.++. |-+...+.+... .+.+....||.||||...-..
T Consensus 226 -----~grld~------s~HPFt~~~~~~D-----------vRiTTry~e~d~------~~~l~s~iHE~GHalYEq~~~ 277 (494)
T PF02074_consen 226 -----RGRLDE------SAHPFTTGFGPND-----------VRITTRYDEDDF------LSALFSTIHETGHALYEQGLP 277 (494)
T ss_dssp -----GEEEEE-------SS-EEEEEETTE-----------EEEEE--BTTBT------HHHHHHHHHHHHHHHHHHTS-
T ss_pred -----ceEEec------CCCCCCCCCCCCc-----------eeeecccccccH------HHHHHHHHHHHHHHHHHcCCC
Confidence 224432 4466766553322 334556655421 267888889999999976554
Q ss_pred CCCC-ccCCCcccchhhhhhHHHHHHhhhc-HHHHH----HHHhcccCC-CCChHHHHHHHHHhhhhh------hHHHHH
Q 037955 521 TDYQ-HFSGTRVALDFAETPSNLFEYYAWD-YRVLR----RFAKHYLTG-EIVPEKLVKSMQGARDMF------AATELQ 587 (733)
Q Consensus 521 ~~~~-~~sgt~~~~D~~E~pS~~~E~~~~~-~~~L~----~~s~h~~tg-e~lp~~l~~~l~~~~~~~------~~~~~~ 587 (733)
..+. ...|..+++-+.|..|-|+||.+.. ..++. .+-.++... .--++++...++..+..+ .-+..+
T Consensus 278 ~~~~~tpl~~~~smgiHESQSrf~En~vgrS~~Fw~~~~p~l~~~f~~~~~v~~e~~y~~~N~V~ps~IR~eADElTY~l 357 (494)
T PF02074_consen 278 PEYYGTPLGEGVSMGIHESQSRFWENMVGRSRAFWEFLYPKLQEHFPQLKDVSLENFYRAINRVKPSLIRVEADELTYPL 357 (494)
T ss_dssp GGGTTSTTSS-S-HHHHHHHHHHHHTTTTTSHHHHHHHHHHHHHHCGGGTT-SHHHHHHHHT-----S-GGG--TTTHHH
T ss_pred hhHccCccccccccchHHHHHHHHHHHHccCHHHHHHHHHHHHHHhCcccCCCHHHHHHHHccCCCcccchhhHHHHHHH
Confidence 3322 2234557899999999999996543 33322 222333222 223455554444332221 124567
Q ss_pred HHHHHHHhHHHhhcCCCCCCCCHHHHHHHHHhhccCCCCCC-------CCcccccccccccccccchhHHHHHHHHHHHH
Q 037955 588 RQIFYALVDQTLFGERLGQTRDTSSIVADMKRQHTSWNHVE-------GTHWHIRFSHFINYGAGYYSYLYAKCFAATIW 660 (733)
Q Consensus 588 ~ql~~a~fD~~lh~~~~~~~~~~~~l~~~l~~~~~~~~~~~-------~~~~~~~f~Hl~~Y~a~YYsYl~s~v~A~di~ 660 (733)
..++...++.++-.+ ..+..|+.++|++..++|.|+.+.. +.||. .-.|| ||-+|..|.++|+++|
T Consensus 358 HIilRyEiEk~li~g-~l~v~dLP~~Wn~km~eyLGi~p~~d~eG~LQDvHWs---~G~fG---YFPtY~LG~~~AaQl~ 430 (494)
T PF02074_consen 358 HIILRYEIEKALING-ELSVDDLPEAWNEKMEEYLGITPPNDAEGVLQDVHWS---SGSFG---YFPTYALGNIYAAQLF 430 (494)
T ss_dssp HHHHHHHHHHHHHTT-SS-GGGHHHHHHHHHHHHHS---SSCTTTTTS-STTT---TT-TS----THHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHcC-CCChhhhHHHHHHHHHHHcCCCCCCCCcccccccccc---CCccc---cchhhHHHHHHHHHHH
Confidence 778888899999876 3456899999999999999975432 24563 22333 3679999999999999
Q ss_pred HHHHHc----------CCCCHHHHHHHHHHHhcCCCCCCHHHHHHHhcCCCcccccCCCCCCChHHHHHHc
Q 037955 661 QKLCQE----------DPLSLTTGTTLRTKILQHGGAKEPADMLNDLVGDGILRYCNGGIVPDITSFSDEV 721 (733)
Q Consensus 661 ~~~~~~----------~~~~~~~G~~~r~~vL~~Ggs~~~~ell~~flGrd~~~~~~~g~~p~~~a~l~~~ 721 (733)
+...++ |.+.+ .-.++++.|.+.|+..+|.|++++.+|. ++|++.|+++|
T Consensus 431 ~~~~~~~pd~~~~i~~G~f~~-i~~WL~e~Ih~~G~~y~~~eLl~~aTGe----------~l~~~~~~~YL 490 (494)
T PF02074_consen 431 AAMKKDIPDLDEQIAKGDFSP-IRDWLRENIHQYGSRYTPDELLKKATGE----------PLNPEYFIDYL 490 (494)
T ss_dssp HHHHHHHTTHHHHHHTT--HH-HHHHHHHHTGGGTTSS-HHHHHHHHHSS----------S--THHHHHHH
T ss_pred HHHHHHccCHHHHHHcCChHH-HHHHHHHHHHHhccCCCHHHHHHHHhCC----------CCCHHHHHHHH
Confidence 987543 33433 5589999999999999999999999999 69999999986
No 19
>PF01401 Peptidase_M2: Angiotensin-converting enzyme This Prosite motif covers only the active site.; InterPro: IPR001548 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M2 (clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The catalytic residues and zinc ligands have been identified, the zinc ion being ligated to two His residues within the motif HEXXH, showing that the enzyme belongs to the E sub-group of metalloproteases []. Pepetidyl-dipeptidase A (angiotensin-converting enzyme) is a mammalian enzyme responsible for cleavage of dipeptides from the C-termini of proteins, notably converting angiotensin I to angiotensin II []. The enzyme exists in two differentially transcribed forms, the most common of which is from lung endothelium; this contains two homologous domains that have arisen by gene duplication []. The testis-specific form contains only the C-terminal domain, arising from a duplicated promoter region present in intron 12 of the gene []. Both enzymatic forms are membrane proteins that are anchored by means of a C-terminal transmembrane domain. Both domains of the endothelial enzyme are active, but have differing kinetic constants []. ]. A number of insect enzymes have been shown to be similar to peptidyl-dipeptidase A, these containing a single catalytic domain.; GO: 0008237 metallopeptidase activity, 0008241 peptidyl-dipeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 2YDM_A 3BKL_A 2C6N_B 1UZE_A 3BKK_A 2C6F_B 2IUX_A 2IUL_A 2XYD_B 3NXQ_B ....
Probab=99.79 E-value=4.1e-16 Score=177.92 Aligned_cols=504 Identities=14% Similarity=0.203 Sum_probs=300.6
Q ss_pred HHHHHHHHhhHHHhhhhchhhhhhccCCchHHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHHhhcCCCC-CHHHHHHHH
Q 037955 89 EIIRAMDEISDAVCSVVDSAELCRQTHPDREFVEEASKASMRISEYLHYLNTNHTLYDAVKKAELDGHLL-SKEAHRAAN 167 (733)
Q Consensus 89 ~~v~~ld~~~~~l~~~~~~~~~~~~~~~d~e~r~aa~~~~~~l~~~~~~l~~~~~Ly~~l~~~~~~~~~L-~~e~~r~l~ 167 (733)
..+..++.-...+|.....+.-...+.-.++-.....++...+.+|..+... ..+.. +...+ ++..+|.++
T Consensus 18 ~FL~~yn~ea~~v~~~~a~A~W~y~TNITdeN~~~~~ea~~~~a~F~k~~~~------~AkkF--d~~~~~d~~lkRql~ 89 (595)
T PF01401_consen 18 AFLEEYNREAEQVYYQSAEAQWNYNTNITDENQQLMAEASLEYAKFQKEYAK------EAKKF--DWTKFKDDTLKRQLK 89 (595)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHTS-SHHHHHHHHHHHHHHHHHHHHHHH------HHTTS--HGGGGSSHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHhhhhHhhcCCChHHHHHHHHHHHHHHHHHHHHHH------HHHhc--CcccCCCHHHHHHHH
Confidence 4555566666667777777777677777777777777777777777653322 11111 11122 344566655
Q ss_pred HHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCcccccCCCCCHHHHHhhccccccCCCCCcchhhhh
Q 037955 168 HLRIDFEKGGIHLCADKLDRVNQLNMDIFQLCREFNQNIINDPGHVDIFPESRIPKHIHHLLKPICRLTSGPSRESLISW 247 (733)
Q Consensus 168 ~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~L~~~F~~ni~~~~~~v~~~~l~glp~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (733)
.+.. .....|++++.+++.++..+ +...|+..-. -.
T Consensus 90 ~L~~---lG~AaL~~~k~~el~~i~s~---M~~iYst~kv------C~-------------------------------- 125 (595)
T PF01401_consen 90 KLSK---LGPAALPPEKLEELNKILSE---MESIYSTAKV------CP-------------------------------- 125 (595)
T ss_dssp HHTS----GGGGSTHHHHHHHHHHHHH---HHHHHHH-EE------EE--------------------------------
T ss_pred HHhh---cccccCCHHHHHHHHHHHHH---HHHHhccccc------cC--------------------------------
Confidence 5432 23356788888888777664 4444655211 00
Q ss_pred cccCCCCEEEecCCCcHHHHhhhCCCHHHHHHHHHHhccC-CcchHHHHHHHHHHHHHHHHHcCCCCHHHHHhcc-CcCC
Q 037955 248 DNKKEKGFRITTDSRILQSILQWTSDDEVRKMVYIQGHSV-PQANHEVLHELIAARNELAQIMGYRSYAEFIVMP-NMAS 325 (733)
Q Consensus 248 ~~~~~~g~~i~l~~~~~~~vL~~~~d~~~Rk~~~~a~~~~-~~~n~~~l~~ll~lR~e~A~llGy~sya~~~l~~-~ma~ 325 (733)
.++..+.++ |++..+|+.+.|.+.++-+|.++.+. .....+.+.+++++-++.|++.||++..+|.... .|
T Consensus 126 ----~~~~cl~Le-Pdl~~ima~Srd~~eL~~~W~~Wr~~vg~~~r~~y~~~V~L~N~aA~~nG~~d~g~~Wr~~ye~-- 198 (595)
T PF01401_consen 126 ----YDGKCLSLE-PDLEEIMATSRDYDELLYAWEGWRDAVGPPMRPLYERYVELSNEAARLNGFKDTGEYWRSSYEM-- 198 (595)
T ss_dssp ----SSSEEEETT-THHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTSSSHHHHHHGGG-C--
T ss_pred ----CCCccccch-hhHHHHHhccCCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhcc--
Confidence 012335554 68999999999999999999999863 4567899999999999999999999999998766 33
Q ss_pred CHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHH----hcCCC----C--------CCC--CcCCHHHHHHH-h-hhhccC
Q 037955 326 SPEVVKSFLLEMSKMIKPKADEEFEAIKNFKRK----SCGQK----Y--------VHL--EPWDEAYYTAM-M-KSSAYN 385 (733)
Q Consensus 326 spe~V~~fL~~l~~~~~p~~~~e~~~L~~~~~~----~~g~~----~--------~~l--~pWD~~y~~~~-~-~~~~~~ 385 (733)
++....++.|++.++|+-++ |+.+.+. .+|.. . +.+ +-|.--|-.-. . .+..++
T Consensus 199 --~~~~~~~~~lw~~i~PLY~~----LHayVR~~L~~~Yg~~~v~~~gpIPAHLLGnmwaq~W~ni~~~~~P~p~~~~~d 272 (595)
T PF01401_consen 199 --PNFEQELERLWQQIKPLYKQ----LHAYVRRKLREKYGDDVVPPDGPIPAHLLGNMWAQSWSNIYDLVLPYPDKPSLD 272 (595)
T ss_dssp --TTHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHTTTTS-TTS-EEGGGSSSTTSS-GGGGHHHH-STTTS--S-
T ss_pred --ccHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHcCcccCCCCCCccHHHHHHHHHHHHhhhhccccCCCCCCCCC
Confidence 35778899999999998876 6666543 33421 0 122 22333221111 0 011233
Q ss_pred CChhhccCCCCHHHHHHHHHHHHHHhhC-ceeEeecCCCCCccccccceeEEEcCCCCcccceeeecccCCCC----ccC
Q 037955 386 LDACVVASYFPLGQCIEGLKMLAESLFG-VTFHSVPLAPGESWHPDVLKLSLQHPEEGEMGYLYLDLYSRAGK----YTG 460 (733)
Q Consensus 386 ~d~~~~~~yfpl~~vl~gl~~l~~~lfg-i~~~~~~~~~~~~wh~dV~~~~v~d~~~~~lg~~ylDl~~R~gK----~~g 460 (733)
++...+++-+. ...|+.+++++|. +.|.++| .+-|..++..- +..|+ .+.
T Consensus 273 vt~~l~~q~~t----~~~mf~~ae~ff~SlGl~~mp---~~FW~~S~~~k------------------p~d~r~~~Chas 327 (595)
T PF01401_consen 273 VTEELVKQGYT----AKKMFKTAEEFFTSLGLPPMP---PTFWEKSMFEK------------------PTDGRDVVCHAS 327 (595)
T ss_dssp THHHHHHTT------HHHHHHHHHHHHHHTTS-------HHHHHHSB-S--------------------SSSS----S-E
T ss_pred cHHHHHHcccc----HHHHHHHHHHHHHhCCCCcCC---HHHHHHHHhcC------------------CCCCCCCCcCcc
Confidence 44333333332 2567777777764 5566654 24565443110 11121 122
Q ss_pred cccccccCceecCCCCccccEEEEEccCCCCCCCCCCcChhhHHHHHHHHHHHHHHhhhcCCCCccCCCcccchhhhhhH
Q 037955 461 CANFAIKGGRRLSETEYQLPVVALICNFPGSHNLSVRLNHHEVETLFHEFGHALHSLLSRTDYQHFSGTRVALDFAETPS 540 (733)
Q Consensus 461 a~~~~i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~~~lL~~~~v~TLfHEfGHalH~lls~~~~~~~sgt~~~~D~~E~pS 540 (733)
||.|.-+. -+-+.+|. -++.+++.|..||+||..+.+.-+.. +.+-....-..|.|.-+
T Consensus 328 awdf~~~~-----------d~Rik~Ct---------~v~~~df~t~hhemghi~Y~~~y~~q-p~~fr~gan~gfhEAig 386 (595)
T PF01401_consen 328 AWDFYNGD-----------DFRIKMCT---------EVTMEDFLTAHHEMGHIQYYMQYKDQ-PVLFREGANPGFHEAIG 386 (595)
T ss_dssp EEEESSSS-----------EEEEE-------------SSHHHHHHHHHHHHHHHHHHHTTTS--GGGSS-SSHHHHHHHH
T ss_pred hhhccCCC-----------ccceeecc---------cCCHHHHHHHHHHHHHHHHHHHHhhC-ChhhhcCCCchHHHHHH
Confidence 33332211 11123453 23469999999999999998775432 22222234568999999
Q ss_pred HHHHHhhhcHHHHHHHHhcccCCCCChHHHHHHHHHh-hhh---hhHHHHHHHHHHHHhHHHhhcCCCCCCCCHHHHHHH
Q 037955 541 NLFEYYAWDYRVLRRFAKHYLTGEIVPEKLVKSMQGA-RDM---FAATELQRQIFYALVDQTLFGERLGQTRDTSSIVAD 616 (733)
Q Consensus 541 ~~~E~~~~~~~~L~~~s~h~~tge~lp~~l~~~l~~~-~~~---~~~~~~~~ql~~a~fD~~lh~~~~~~~~~~~~l~~~ 616 (733)
.++-.-+..|..|+++.---.. ..-.+..++.|... ..- ..-..++.++.+..|+ +. .+..+.++-|++
T Consensus 387 d~ials~~tp~hL~~~gLl~~~-~~~~~~~in~L~~~AL~ki~~lPf~~~~d~wRw~vf~-----g~-i~~~~~n~~wW~ 459 (595)
T PF01401_consen 387 DTIALSVSTPKHLKRIGLLDNV-TDDEESDINFLLKQALDKIAFLPFGYLMDKWRWDVFS-----GS-IPPDQYNEAWWE 459 (595)
T ss_dssp HHHHHHHTSHHHHHHTTSSSTT-TTSHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHT-----TS-S-CCGHHHHHHH
T ss_pred HHHHHHcCCHHHHHHcCCCCCC-CcchHHHHHHHHHHhhhHHHhccHHHHHHHHHHHHhc-----CC-CCHHHHHHHHHH
Confidence 9999999999999987654222 22234444444222 111 1111233444444443 32 234679999999
Q ss_pred HHhhccCCC-CCCC--Ccc-cccccccccccccchhHHHHHHHHHHHHHHHHHc----CC-------CCHHHHHHHHHHH
Q 037955 617 MKRQHTSWN-HVEG--THW-HIRFSHFINYGAGYYSYLYAKCFAATIWQKLCQE----DP-------LSLTTGTTLRTKI 681 (733)
Q Consensus 617 l~~~~~~~~-~~~~--~~~-~~~f~Hl~~Y~a~YYsYl~s~v~A~di~~~~~~~----~~-------~~~~~G~~~r~~v 681 (733)
++.+|.|+. +++. ..+ +....|+.. ...|..|..|.++--++++.++.. || -++++|++++ ++
T Consensus 460 lr~~yqGi~pP~~r~~~~fD~~ak~hi~~-~~py~ryfls~ilqFQ~~~~lC~~ag~~~pLh~Cdiy~s~~aG~~l~-~m 537 (595)
T PF01401_consen 460 LREKYQGIVPPVERSESDFDPGAKYHIPA-NTPYIRYFLSTILQFQFYKALCEAAGHEGPLHRCDIYGSKEAGRKLK-KM 537 (595)
T ss_dssp HHHHHHSEE-SS-EETTS-GGGGSHHHHT-T--SHHHHHHHHHHHHHHHHHHHHTTTTSSGGG---TT-HHHHHHHH-HH
T ss_pred HHHHcCCccCCCCCCcccCCcchhhhhcc-ccccHHHHHHHHHHHHHHHHHHHhcCCCCCccccccccchhhHHHHH-HH
Confidence 999999974 3321 122 345668773 356999999999999999998753 34 2688999996 89
Q ss_pred hcCCCCCCHHHHHHHhcCCCcccccCCCCCCChHHHHHHc
Q 037955 682 LQHGGAKEPADMLNDLVGDGILRYCNGGIVPDITSFSDEV 721 (733)
Q Consensus 682 L~~Ggs~~~~ell~~flGrd~~~~~~~g~~p~~~a~l~~~ 721 (733)
|+.|+|+++.++++.++|.+ .-|.+++|+++
T Consensus 538 l~~G~S~~W~~~l~~~tg~~---------~~~a~~ll~YF 568 (595)
T PF01401_consen 538 LSLGSSKPWPEALEELTGER---------EMSASALLEYF 568 (595)
T ss_dssp HTTTTSS-HHHHHHHHHSSS---------S--SHHHHHHH
T ss_pred hcCCCCCCHHHHHHHHhCCC---------CcChHHHHHHH
Confidence 99999999999999999976 78888888875
No 20
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=99.72 E-value=4.7e-14 Score=150.89 Aligned_cols=416 Identities=16% Similarity=0.161 Sum_probs=262.5
Q ss_pred HHHHHHHHHHHHHhhCCHHHHHHHHHHhhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHh
Q 037955 125 SKASMRISEYLHYLNTNHTLYDAVKKAELDGHLLSKEAHRAANHLRIDFEKGGIHLCADKLDRVNQLNMDIFQLCREFNQ 204 (733)
Q Consensus 125 ~~~~~~l~~~~~~l~~~~~Ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~L~~~F~~ 204 (733)
.++...|+....++.+.+.+=+.|.+++... .|++.++.+++.+.++++.+ .-+|++.-+++..+..+-...-.+-..
T Consensus 44 s~a~a~Ls~l~~~l~t~~e~~~ll~~a~~e~-~L~e~~~~~vre~~r~~~~~-~~iP~e~~~e~s~~~s~a~~aWreAr~ 121 (497)
T COG2317 44 SEAIAELSVLRHELLTSPEFAELLEKAEEEK-DLSEIEAGVVRELKREYEKA-KKIPEELVKEYSKLTSKAEHAWREARE 121 (497)
T ss_pred HHHHHHHHHHHHHHhccHHHHHHHHHhhhcc-CccHHHHHHHHHHHHHHHHH-hcCCHHHHHHHHHHHHHHHHHHHHHhh
Confidence 4567778888888999999999999985554 79999999999999999885 456766555554444422221111000
Q ss_pred hcccCCCCcccccCCCCCHHHHHhhccccccCCCCCcchhhhhcccCCCCEEEecCCCcHHHHhhhCCCHHHHHHHHHHh
Q 037955 205 NIINDPGHVDIFPESRIPKHIHHLLKPICRLTSGPSRESLISWDNKKEKGFRITTDSRILQSILQWTSDDEVRKMVYIQG 284 (733)
Q Consensus 205 ni~~~~~~v~~~~l~glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~l~~~~~~~vL~~~~d~~~Rk~~~~a~ 284 (733)
..
T Consensus 122 --------------------------------------------kn---------------------------------- 123 (497)
T COG2317 122 --------------------------------------------KN---------------------------------- 123 (497)
T ss_pred --------------------------------------------cc----------------------------------
Confidence 00
Q ss_pred ccCCcchHHHHHHHHHHHHHHHHHcCCC-CHHHHHhccC-cCCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhcCC
Q 037955 285 HSVPQANHEVLHELIAARNELAQIMGYR-SYAEFIVMPN-MASSPEVVKSFLLEMSKMIKPKADEEFEAIKNFKRKSCGQ 362 (733)
Q Consensus 285 ~~~~~~n~~~l~~ll~lR~e~A~llGy~-sya~~~l~~~-ma~spe~V~~fL~~l~~~~~p~~~~e~~~L~~~~~~~~g~ 362 (733)
...--.+.|++|+.+-.+.|+.+||. +-.+..|..- ---+.++|..++.+|.+.+.|++.+-+ .. +
T Consensus 124 --Df~~F~p~Lekiv~l~re~A~~~~~~~~pYdaLld~yEpG~t~~~i~~vF~~Lk~~L~~ll~kv~----~~-----~- 191 (497)
T COG2317 124 --DFSIFKPYLEKIVELKREFAEYRGYEEHPYDALLDLYEPGLTVRDVDRVFAELKKELVPLLDKVL----EK-----G- 191 (497)
T ss_pred --cHhhhhHHHHHHHHHHHHHHHhcccccCcHHHHHHhhcCCCcHHHHHHHHHHHHHHHHHHHHHHH----hc-----c-
Confidence 01122478899999999999999996 4556666551 122789999999999999988776622 11 1
Q ss_pred CCCCCCcCCHHHHHHHhhhhccCCChhhccCCCCHHHHHHHHHHHHHHhhCceeEeecCCCCCccccccceeEEEcCCCC
Q 037955 363 KYVHLEPWDEAYYTAMMKSSAYNLDACVVASYFPLGQCIEGLKMLAESLFGVTFHSVPLAPGESWHPDVLKLSLQHPEEG 442 (733)
Q Consensus 363 ~~~~l~pWD~~y~~~~~~~~~~~~d~~~~~~yfpl~~vl~gl~~l~~~lfgi~~~~~~~~~~~~wh~dV~~~~v~d~~~~ 442 (733)
...++..+ ...-|+.+...+-...++ +++|..|+
T Consensus 192 --~~~~~~~~------------------~~~~~~~e~q~~~~~~vl-~~lGfdf~------------------------- 225 (497)
T COG2317 192 --KSPRSDLL------------------EKEKYDKEKQEALGLRVL-ELLGFDFE------------------------- 225 (497)
T ss_pred --CCCccccc------------------ccCCCCHHHHHHHHHHHH-HHhCCCcc-------------------------
Confidence 01111111 012344444333333323 55554332
Q ss_pred cccceeeecccCCCCccCcccccccCceecCCCCccccEEEEEccCCCCCCCCCCcChhhHHHHHHHHHHHHHHhhhcCC
Q 037955 443 EMGYLYLDLYSRAGKYTGCANFAIKGGRRLSETEYQLPVVALICNFPGSHNLSVRLNHHEVETLFHEFGHALHSLLSRTD 522 (733)
Q Consensus 443 ~lg~~ylDl~~R~gK~~ga~~~~i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~~~lL~~~~v~TLfHEfGHalH~lls~~~ 522 (733)
..-+ .-..|.||..+-++. |-|...|.+... .+-+....||+|||+.-.-....
T Consensus 226 ---~GRl------D~S~HPF~~g~~~~D-----------VRITTRy~~~df------~~aL~g~iHE~GHAlYEqn~~~~ 279 (497)
T COG2317 226 ---RGRL------DVSVHPFTTGLPIND-----------VRITTRYNEQDF------RSALFGTIHETGHALYEQNLDEA 279 (497)
T ss_pred ---Cccc------cCCCCCccCCCCCCc-----------eeEEeecCCccH------HHHHHHHHHhhhhHHHHcCCCHH
Confidence 1223 335677876653211 334667765432 15778889999999986544322
Q ss_pred C-CccCCCcccchhhhhhHHHHHHhhhc-HHHHHHHH----hcccCC-CCChH-HHHHHHHHhhhhh------hHHHHHH
Q 037955 523 Y-QHFSGTRVALDFAETPSNLFEYYAWD-YRVLRRFA----KHYLTG-EIVPE-KLVKSMQGARDMF------AATELQR 588 (733)
Q Consensus 523 ~-~~~sgt~~~~D~~E~pS~~~E~~~~~-~~~L~~~s----~h~~tg-e~lp~-~l~~~l~~~~~~~------~~~~~~~ 588 (733)
+ -...|+..++-+.|-.|-||||.+-. +.+++.+. .|+... +..+. ++.......+..+ .-+.-+.
T Consensus 280 l~gtPlg~g~smgiHESQSrF~En~VGRs~aFw~~~~~~~~~~~~~~~~~~s~dd~y~~vn~v~~~lIRveADElTY~lH 359 (497)
T COG2317 280 LLGTPLGEGRSMGIHESQSRFWENQVGRSRAFWEAIYPLLRKHFPEQFDKYSLDDFYRAVNRVEPSLIRVEADELTYPLH 359 (497)
T ss_pred HcCCcccCCcccccccHHHHHHHHHhccCHHHHHHHHHHHHHhhhhhcccCCHHHHHHHHhcccCcceeeeccccceeee
Confidence 1 11234557899999999999998753 33333332 232100 11222 2222222211111 1123445
Q ss_pred HHHHHHhHHHhhcCCCCCCCCHHHHHHHHHhhccCCCCCC-------CCcccccccccccccccchhHHHHHHHHHHHHH
Q 037955 589 QIFYALVDQTLFGERLGQTRDTSSIVADMKRQHTSWNHVE-------GTHWHIRFSHFINYGAGYYSYLYAKCFAATIWQ 661 (733)
Q Consensus 589 ql~~a~fD~~lh~~~~~~~~~~~~l~~~l~~~~~~~~~~~-------~~~~~~~f~Hl~~Y~a~YYsYl~s~v~A~di~~ 661 (733)
.++...++..+-++ .....|+.++|++..++|.|+.+.. +.||. ++- || ||-+|..|.++|+++|.
T Consensus 360 IilRyEiEk~li~G-~l~v~DlP~lWn~kme~yLGirp~~d~eG~LQDIHWs--~Gs-fG---YFPsYtlG~v~AAQ~~~ 432 (497)
T COG2317 360 IILRYEIEKELISG-ELKVDDLPELWNDKMEEYLGIRPKNDAEGVLQDIHWS--HGS-FG---YFPTYTLGNVYAAQLYA 432 (497)
T ss_pred eehhHHHHHHHHcC-CcchhhhHHHHHHHHHHHcCCCCCCcccccccccccc--CCC-cC---cCchhhhHHHHHHHHHH
Confidence 56667777777766 3456799999999999999975432 24663 222 22 35799999999999998
Q ss_pred HHHHc----------CCCCHHHHHHHHHHHhcCCCCCCHHHHHHHhcCCCcccccCCCCCCChHHHHHHcC
Q 037955 662 KLCQE----------DPLSLTTGTTLRTKILQHGGAKEPADMLNDLVGDGILRYCNGGIVPDITSFSDEVK 722 (733)
Q Consensus 662 ~~~~~----------~~~~~~~G~~~r~~vL~~Ggs~~~~ell~~flGrd~~~~~~~g~~p~~~a~l~~~g 722 (733)
...++ |.+++ ...++|++|.+.|...+|.|++++.+|. +.|++.|+++|.
T Consensus 433 a~~~~~p~~d~~i~~gd~~~-i~~WL~e~ih~~Gs~~~p~eLi~~atGE----------~lnp~y~i~yL~ 492 (497)
T COG2317 433 AMKKDIPDVDALIAKGDFSP-IKNWLRENIHRHGSRYPPKELLKRATGE----------ALNPEYFIDYLE 492 (497)
T ss_pred HHHhhcccHHHHHHcCCcHH-HHHHHHHHHHHccCcCCcHHHHHHhhCC----------cCCHHHHHHHHH
Confidence 87653 33444 5689999999999999999999999998 699999999873
No 21
>KOG3690 consensus Angiotensin I-converting enzymes - M2 family peptidases [Amino acid transport and metabolism]
Probab=99.56 E-value=1.3e-10 Score=127.73 Aligned_cols=489 Identities=18% Similarity=0.246 Sum_probs=290.1
Q ss_pred cCCchHHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHHhhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHH
Q 037955 114 THPDREFVEEASKASMRISEYLHYLNTNHTLYDAVKKAELDGHLLSKEAHRAANHLRIDFEKGGIHLCADKLDRVNQLNM 193 (733)
Q Consensus 114 ~~~d~e~r~aa~~~~~~l~~~~~~l~~~~~Ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~ 193 (733)
+.-+++...+-.++..++.+|...+.++.+.|+.. ..-|+-.+|.|+++-. ...+ .|+++.-+++..+..
T Consensus 61 ~~~~~~~~~a~~ea~~~~~~~~r~~a~q~k~~~~~-------~l~D~~~rRql~~~s~-lg~s--aL~~~d~~~~~~ll~ 130 (646)
T KOG3690|consen 61 TNGTEENLLALLEAEDKLVEFVRSLATQAKKFDVA-------ELQDERLRRQLQKVSQ-LGMS--ALSPDDLKRYNQLLS 130 (646)
T ss_pred cCCChHHHHHHHHHHHHHHHHHHHHHHHHhhccHh-------hccCHHHHHHHHHHhh-hccc--cCCHHHHHHHHHHHH
Confidence 34455567777888888888888777766665521 1124556777777532 2233 566666566666555
Q ss_pred HHHHHHHHHHhhcccCCCCcccccCCCCCHHHHHhhccccccCCCCCcchhhhhcccCCCCEEEecCCCcHHHHhhhCCC
Q 037955 194 DIFQLCREFNQNIINDPGHVDIFPESRIPKHIHHLLKPICRLTSGPSRESLISWDNKKEKGFRITTDSRILQSILQWTSD 273 (733)
Q Consensus 194 ~i~~L~~~F~~ni~~~~~~v~~~~l~glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~l~~~~~~~vL~~~~d 273 (733)
++.+.+.. .+ | +.+ +. +.+ ..+.+ .|.+..+|.++.|
T Consensus 131 ---~m~~~y~~----~~--v-------------------C~y------------~~-p~~-~~L~l-~P~l~~i~~~Srd 167 (646)
T KOG3690|consen 131 ---SMSTNYNT----AT--V-------------------CAY------------DQ-PSA-CCLLL-EPQLQSIMANSRD 167 (646)
T ss_pred ---HHHhhcCC----ce--e-------------------eCC------------CC-CCC-cceec-CccHHHHHhccCC
Confidence 33333322 11 1 000 00 001 11222 4789999999999
Q ss_pred HHHHHHHHHHhccCC-cchHHHHHHHHHHHHHHHHHcCCCCHHHHHhccCcCCCHHHHHHHHHHHHHhhhHHHHHHHHHH
Q 037955 274 DEVRKMVYIQGHSVP-QANHEVLHELIAARNELAQIMGYRSYAEFIVMPNMASSPEVVKSFLLEMSKMIKPKADEEFEAI 352 (733)
Q Consensus 274 ~~~Rk~~~~a~~~~~-~~n~~~l~~ll~lR~e~A~llGy~sya~~~l~~~ma~spe~V~~fL~~l~~~~~p~~~~e~~~L 352 (733)
-+-+.-.|.+|.+.. ......+.+++.+-++.|++-||.++.+|....-- ++-+..+-|+.+...++|..+. |
T Consensus 168 ~~eL~~~W~~wr~~~g~~~r~~y~~yv~L~~e~A~lNg~~~~gdyW~~~yE--~~~~~~~~ld~i~~ei~PlY~~----L 241 (646)
T KOG3690|consen 168 YDELQWYWVEWRRKIGKAMRQSYEEYVDLSNEAAQLNGYVNGGDYWRLAYE--TPGDFEQDLDAIFEEIRPLYRQ----L 241 (646)
T ss_pred HHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhc--CchhHHHHHHHHHHHHHHHHHH----H
Confidence 999999999998743 34567999999999999999999999999987632 2336777888899999998765 6
Q ss_pred HHHHHH----hcCCC-CCCC-------------CcCCHHHHHHHhhhhc--cCCChhhccCCCCHHHHHHHHHHHHHHhh
Q 037955 353 KNFKRK----SCGQK-YVHL-------------EPWDEAYYTAMMKSSA--YNLDACVVASYFPLGQCIEGLKMLAESLF 412 (733)
Q Consensus 353 ~~~~~~----~~g~~-~~~l-------------~pWD~~y~~~~~~~~~--~~~d~~~~~~yfpl~~vl~gl~~l~~~lf 412 (733)
+.+.+. .+|.. .... +.|---|....--.++ .++..+...+-.. .+.||..+.+.|
T Consensus 242 HAYvRr~L~~~Ygp~~i~~~gpiPahLlgnm~gq~Ws~~y~~~~P~pe~~~~dvt~~m~~qg~~----~qkmF~~a~efF 317 (646)
T KOG3690|consen 242 HAYVRRKLRGAYGPDGISRDGPIPAHLLGNMWGQDWSNHYDLTTPFPERPLIDVTMEMVKQGYT----VQKMFKLAAEFF 317 (646)
T ss_pred HHHHHHHHHHhcCcccCCCCCCcCHHHHhhhhccchHhhhhcccCCCCCCCCchHHHHHHcccc----HHHHHHHHHHHH
Confidence 666543 23320 0111 1122222111111111 1111111111111 156777777766
Q ss_pred C-ceeEeecCCCCCccccccceeEEEcCCCCcccceeeecccCCCCccCcccccccCceecCCCCcccc-EEEEEccCCC
Q 037955 413 G-VTFHSVPLAPGESWHPDVLKLSLQHPEEGEMGYLYLDLYSRAGKYTGCANFAIKGGRRLSETEYQLP-VVALICNFPG 490 (733)
Q Consensus 413 g-i~~~~~~~~~~~~wh~dV~~~~v~d~~~~~lg~~ylDl~~R~gK~~ga~~~~i~~~~~~~~g~~~~P-~v~l~~Nf~~ 490 (733)
- +.|-.+| .+.|...+. .|++-..++.|..-.-.+ +..| +-+-.|
T Consensus 318 ~SlGl~~lp---~sfw~~s~~--------------------~~p~d~~~v~C~~sAwdf------~~~~Dfrik~C---- 364 (646)
T KOG3690|consen 318 TSLGLPALP---PSFWTNSIL--------------------TRPDDRDMVVCHASAWDF------YAGPDFRIKYC---- 364 (646)
T ss_pred HHcCCCcCC---hHHHhHHHh--------------------cCCCCCCceeechhhhcc------ccCCCceeeec----
Confidence 3 3333332 345543321 223322223221100000 0111 111234
Q ss_pred CCCCCCCcChhhHHHHHHHHHHHHHHhhhcCCCCccCCCcccchhhhhhHHHHHHhhhcHHHHHHHHhcccCCCCChHHH
Q 037955 491 SHNLSVRLNHHEVETLFHEFGHALHSLLSRTDYQHFSGTRVALDFAETPSNLFEYYAWDYRVLRRFAKHYLTGEIVPEKL 570 (733)
Q Consensus 491 ~~~~~~lL~~~~v~TLfHEfGHalH~lls~~~~~~~sgt~~~~D~~E~pS~~~E~~~~~~~~L~~~s~h~~tge~lp~~l 570 (733)
+.++..++.+.+||+||.-.++.-+.. +.+......--|.|...-.+-.-+..|..|+++.---..+..-.+..
T Consensus 365 -----t~vd~~~f~~~Hhem~hiQY~l~y~~q-P~llre~anPgFheAIgd~~als~stPrhL~slgLL~~~~~~d~~~~ 438 (646)
T KOG3690|consen 365 -----TKVDEEDFEQAHHEMGHIQYYLQYKQQ-PFLLREGANPGFHEAIGDAFALSVSTPRHLHSLGLLSEDLDEDDEVR 438 (646)
T ss_pred -----CcCCHHHHHHHHHHHHHHHHHHHHHhc-cHHhhCCCCCcHHHHHHHHHHHhcCCHHHHHHhcccccccccchHHH
Confidence 355668999999999999988887653 33333444567889999999999999999887653321222111334
Q ss_pred HHHHHHh-hhhh---hHHHHHHHHHHHHhHHHhhcCCCCCCCCHHHHHHHHHhhccCCCC-CCCC--cc-cccccccccc
Q 037955 571 VKSMQGA-RDMF---AATELQRQIFYALVDQTLFGERLGQTRDTSSIVADMKRQHTSWNH-VEGT--HW-HIRFSHFINY 642 (733)
Q Consensus 571 ~~~l~~~-~~~~---~~~~~~~ql~~a~fD~~lh~~~~~~~~~~~~l~~~l~~~~~~~~~-~~~~--~~-~~~f~Hl~~Y 642 (733)
|++|-+. .... .-...+..+.+..|| + .....+.+.-|++++.+|.|+.+ ++.+ .+ +...-|+ .-
T Consensus 439 IN~L~k~aL~~v~~LPf~y~~DkwR~~Vf~-----G-~i~k~~~N~~~W~lr~ky~Gv~~P~~r~~~dfD~~akf~~-~~ 511 (646)
T KOG3690|consen 439 INRLFKMALDKVAFLPFTYALDKWRYEVFD-----G-EIPKDQYNCAYWELRNKYEGVRPPVPRTNKDFDPPAKFHI-QV 511 (646)
T ss_pred HHHHHHHHHHHHhhhhhhhhhhhhHHHHhc-----C-CCCcchhhHHHHHHHHHhcCcCCCCcCccccCCchhhhcc-Cc
Confidence 4544222 1111 112233444444444 2 22346799999999999999743 3321 11 1233443 33
Q ss_pred cccchhHHHHHHHHHHHHHHHHHc-C---C-------------CCHHHHHHHHHHHhcCCCCCCHHHHHHHhcCCCcccc
Q 037955 643 GAGYYSYLYAKCFAATIWQKLCQE-D---P-------------LSLTTGTTLRTKILQHGGAKEPADMLNDLVGDGILRY 705 (733)
Q Consensus 643 ~a~YYsYl~s~v~A~di~~~~~~~-~---~-------------~~~~~G~~~r~~vL~~Ggs~~~~ell~~flGrd~~~~ 705 (733)
...|-.|..|.|+--++++.++.. | | .++++|+++| .+++.|.|+++.|.++.++|..
T Consensus 512 d~~~~ryf~s~vlqFQf~kalC~~agq~~pg~pr~pLh~CDi~~sk~aG~kL~-~~M~lG~S~~W~evLE~~tGe~---- 586 (646)
T KOG3690|consen 512 DVSYTRYFLSTVLQFQFLKALCQAAGQYGPGDPRKPLHNCDIYDSKEAGEKLR-TMMSLGSSRHWREVLEEITGEG---- 586 (646)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCCCCchhccccCCHHHHHHHH-HHHHccCCccHHHHHHHHcCCC----
Confidence 445789999999999999988642 1 2 3689999997 8999999999999999999987
Q ss_pred cCCCCCCChHHHHHHc
Q 037955 706 CNGGIVPDITSFSDEV 721 (733)
Q Consensus 706 ~~~g~~p~~~a~l~~~ 721 (733)
+-|.+++|+++
T Consensus 587 -----els~~~LLeYF 597 (646)
T KOG3690|consen 587 -----ELSGRALLEYF 597 (646)
T ss_pred -----cCChHHHHHHH
Confidence 78888888875
No 22
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=87.74 E-value=2.1 Score=42.33 Aligned_cols=31 Identities=13% Similarity=-0.050 Sum_probs=19.9
Q ss_pred HHHHHHHHHHHhhCceeEeecCCCCCcccccccee
Q 037955 400 CIEGLKMLAESLFGVTFHSVPLAPGESWHPDVLKL 434 (733)
Q Consensus 400 vl~gl~~l~~~lfgi~~~~~~~~~~~~wh~dV~~~ 434 (733)
.+...+....++-+|+|++++.. ...++++.
T Consensus 38 ~i~~A~~~w~~~~~l~F~e~~~~----~~adI~i~ 68 (186)
T cd04277 38 AARDALEAWEDVADIDFVEVSDN----SGADIRFG 68 (186)
T ss_pred HHHHHHHHHHhhcCceeEECCCC----CcceEEEE
Confidence 34555566778889999998632 34455443
No 23
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=87.05 E-value=0.47 Score=47.92 Aligned_cols=31 Identities=35% Similarity=0.390 Sum_probs=23.5
Q ss_pred cccEEEEEccCCCCCCCCCCcChhhHHHHHHHHHHHHHHh
Q 037955 478 QLPVVALICNFPGSHNLSVRLNHHEVETLFHEFGHALHSL 517 (733)
Q Consensus 478 ~~P~v~l~~Nf~~~~~~~~lL~~~~v~TLfHEfGHalH~l 517 (733)
+.|++++..|-+.. ....||+||+||++=|-
T Consensus 57 ~~~~I~iN~n~~~~---------r~rFtlAHELGH~llH~ 87 (213)
T COG2856 57 EKPVIYINANNSLE---------RKRFTLAHELGHALLHT 87 (213)
T ss_pred cCceEEEeCCCCHH---------HHHHHHHHHHhHHHhcc
Confidence 34888887776332 69999999999997543
No 24
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=80.87 E-value=7.3 Score=37.29 Aligned_cols=15 Identities=40% Similarity=0.527 Sum_probs=13.0
Q ss_pred hhhHHHHHHHHHHHH
Q 037955 500 HHEVETLFHEFGHAL 514 (733)
Q Consensus 500 ~~~v~TLfHEfGHal 514 (733)
.+-..|+.||+||+|
T Consensus 105 ~~~~~~~~HEiGHaL 119 (157)
T cd04278 105 TDLFSVAAHEIGHAL 119 (157)
T ss_pred chHHHHHHHHhcccc
Confidence 357889999999997
No 25
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=80.41 E-value=1.4 Score=39.33 Aligned_cols=33 Identities=27% Similarity=0.366 Sum_probs=23.1
Q ss_pred ccEEEEEccCCCCCCCCCCcChhhHHHHHHHHHHHHHHhhhc
Q 037955 479 LPVVALICNFPGSHNLSVRLNHHEVETLFHEFGHALHSLLSR 520 (733)
Q Consensus 479 ~P~v~l~~Nf~~~~~~~~lL~~~~v~TLfHEfGHalH~lls~ 520 (733)
.+++++-.|- .+. ....|++||+||.+.+--..
T Consensus 28 ~~~I~in~~~-~~~--------~~~f~laHELgH~~~~~~~~ 60 (122)
T PF06114_consen 28 NPIIFINSNL-SPE--------RQRFTLAHELGHILLHHGDE 60 (122)
T ss_dssp TTEEEEESSS--HH--------HHHHHHHHHHHHHHHHH-HH
T ss_pred CCEEEECCCC-CHH--------HHHHHHHHHHHHHHhhhccc
Confidence 4777776652 222 78999999999999876553
No 26
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=80.13 E-value=1.2 Score=45.24 Aligned_cols=19 Identities=47% Similarity=0.670 Sum_probs=16.8
Q ss_pred hhHHHHHHHHHHHHHHhhh
Q 037955 501 HEVETLFHEFGHALHSLLS 519 (733)
Q Consensus 501 ~~v~TLfHEfGHalH~lls 519 (733)
.-+.||+||+|||+.+.+.
T Consensus 91 ~~~~~l~HE~GHAlI~~~~ 109 (220)
T PF14247_consen 91 NVLFTLYHELGHALIDDLD 109 (220)
T ss_pred HHHHHHHHHHHHHHHHHhc
Confidence 4789999999999998874
No 27
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=76.14 E-value=1.4 Score=41.82 Aligned_cols=22 Identities=14% Similarity=0.105 Sum_probs=16.2
Q ss_pred HHHHHHHHHHHHhhCceeEeec
Q 037955 399 QCIEGLKMLAESLFGVTFHSVP 420 (733)
Q Consensus 399 ~vl~gl~~l~~~lfgi~~~~~~ 420 (733)
+++...+..-....+++|++++
T Consensus 25 ~~i~~A~~~W~~~~~~~F~~~~ 46 (154)
T PF00413_consen 25 DAIRQAFQAWNDVAPLNFTEVS 46 (154)
T ss_dssp HHHHHHHHHHHTTSSEEEEEES
T ss_pred HHHHHHHHHHHhcCCceEEecc
Confidence 3455666667788889999886
No 28
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=75.86 E-value=14 Score=37.02 Aligned_cols=16 Identities=44% Similarity=0.698 Sum_probs=12.8
Q ss_pred hHHHHHHHHHHHHHHh
Q 037955 502 EVETLFHEFGHALHSL 517 (733)
Q Consensus 502 ~v~TLfHEfGHalH~l 517 (733)
...|+.|||||||=..
T Consensus 92 ~~~~i~HElgHaLG~~ 107 (198)
T cd04327 92 FSRVVLHEFGHALGFI 107 (198)
T ss_pred HHHHHHHHHHHHhcCc
Confidence 4579999999998543
No 29
>PF13058 DUF3920: Protein of unknown function (DUF3920)
Probab=69.23 E-value=3.5 Score=36.43 Aligned_cols=25 Identities=32% Similarity=0.717 Sum_probs=19.0
Q ss_pred EEccCCCCCCCCCCcChhhHHHHHHHHHHHHHH
Q 037955 484 LICNFPGSHNLSVRLNHHEVETLFHEFGHALHS 516 (733)
Q Consensus 484 l~~Nf~~~~~~~~lL~~~~v~TLfHEfGHalH~ 516 (733)
.+|-|..-. .-+.||+|||-|+|++
T Consensus 66 Fv~~we~y~--------qvlktllhefrh~mQh 90 (126)
T PF13058_consen 66 FVCMWEEYE--------QVLKTLLHEFRHAMQH 90 (126)
T ss_pred EEEehHHHH--------HHHHHHHHHHHHHHHh
Confidence 357666543 4689999999999984
No 30
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=69.18 E-value=3.8 Score=41.11 Aligned_cols=22 Identities=41% Similarity=0.611 Sum_probs=18.9
Q ss_pred hhHHHHHHHHHHHHHHhhhcCC
Q 037955 501 HEVETLFHEFGHALHSLLSRTD 522 (733)
Q Consensus 501 ~~v~TLfHEfGHalH~lls~~~ 522 (733)
.-+.|++||+||++=.+++-.+
T Consensus 21 ~~l~t~~HE~gHal~a~l~G~~ 42 (200)
T PF13398_consen 21 RLLVTFVHELGHALAALLTGGR 42 (200)
T ss_pred HHHHHHHHHHHHHHHHHHhCCC
Confidence 5678999999999999998655
No 31
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=68.64 E-value=2.9 Score=40.18 Aligned_cols=23 Identities=13% Similarity=0.068 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHhhCceeEeecC
Q 037955 399 QCIEGLKMLAESLFGVTFHSVPL 421 (733)
Q Consensus 399 ~vl~gl~~l~~~lfgi~~~~~~~ 421 (733)
.++...+...++...|+|+++..
T Consensus 18 ~~i~~A~~~W~~~~~i~F~e~~~ 40 (165)
T cd04268 18 AAILDAIEAWNKAFAIGFKNAND 40 (165)
T ss_pred HHHHHHHHHHHHHhCcCceeccc
Confidence 44555556677888999998753
No 32
>PF14891 Peptidase_M91: Effector protein
Probab=68.59 E-value=3.5 Score=40.40 Aligned_cols=21 Identities=24% Similarity=0.305 Sum_probs=18.1
Q ss_pred hhHHHHHHHHHHHHHHhhhcC
Q 037955 501 HEVETLFHEFGHALHSLLSRT 521 (733)
Q Consensus 501 ~~v~TLfHEfGHalH~lls~~ 521 (733)
.-+..|+|||.||.|.+-...
T Consensus 102 ~p~v~L~HEL~HA~~~~~Gt~ 122 (174)
T PF14891_consen 102 PPFVVLYHELIHAYDYMNGTM 122 (174)
T ss_pred HHHHHHHHHHHHHHHHHCCCC
Confidence 578999999999999887654
No 33
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=68.51 E-value=4.4 Score=40.91 Aligned_cols=23 Identities=17% Similarity=0.137 Sum_probs=16.6
Q ss_pred HHHHHHHHHHHHHhhCceeEeec
Q 037955 398 GQCIEGLKMLAESLFGVTFHSVP 420 (733)
Q Consensus 398 ~~vl~gl~~l~~~lfgi~~~~~~ 420 (733)
..++..+.+++.+=+||+|+-+.
T Consensus 32 ~~~v~~~N~~Y~s~v~i~~~Lv~ 54 (206)
T PF13583_consen 32 ASMVNFANEAYNSDVGIRFRLVH 54 (206)
T ss_pred HHHHHHHHHHHhcCCCCEEEEec
Confidence 45556677777888899997663
No 34
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=68.32 E-value=2.8 Score=40.08 Aligned_cols=15 Identities=40% Similarity=0.585 Sum_probs=12.3
Q ss_pred hhHHHHHHHHHHHHH
Q 037955 501 HEVETLFHEFGHALH 515 (733)
Q Consensus 501 ~~v~TLfHEfGHalH 515 (733)
+-..|+.|||||+|=
T Consensus 103 ~~~~~~~HEiGHaLG 117 (156)
T cd04279 103 NLQAIALHELGHALG 117 (156)
T ss_pred HHHHHHHHHhhhhhc
Confidence 456789999999983
No 35
>PF10460 Peptidase_M30: Peptidase M30; InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue.
Probab=65.42 E-value=6.3 Score=43.07 Aligned_cols=45 Identities=22% Similarity=0.083 Sum_probs=30.5
Q ss_pred hhHHHHHHHHHHHHHHhhhcCCCCccCCC--cccchhhhhhHHHHHHhhhc
Q 037955 501 HEVETLFHEFGHALHSLLSRTDYQHFSGT--RVALDFAETPSNLFEYYAWD 549 (733)
Q Consensus 501 ~~v~TLfHEfGHalH~lls~~~~~~~sgt--~~~~D~~E~pS~~~E~~~~~ 549 (733)
.-+.||+|||=|.+|..-.... .|. ...+=+=|.-|+++|.++..
T Consensus 138 ~~~sTlAHEfQHmInfy~~~v~----~g~~~~~dtWLnE~lS~~aEdl~s~ 184 (366)
T PF10460_consen 138 TVYSTLAHEFQHMINFYQRGVL----HGKQYAMDTWLNEMLSMSAEDLYSS 184 (366)
T ss_pred HHHHHHHHHHHHHHHHHHHHHh----cCCCcccccHHHHHHHHHHHHHHhc
Confidence 4689999999999997543221 232 23334458899999987743
No 36
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=61.38 E-value=4.1 Score=38.07 Aligned_cols=11 Identities=55% Similarity=0.797 Sum_probs=10.1
Q ss_pred HHHHHHHHHHH
Q 037955 504 ETLFHEFGHAL 514 (733)
Q Consensus 504 ~TLfHEfGHal 514 (733)
.|+.||+||||
T Consensus 88 ~~~~HEigHaL 98 (140)
T smart00235 88 GVAAHELGHAL 98 (140)
T ss_pred ccHHHHHHHHh
Confidence 48999999998
No 37
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=60.05 E-value=11 Score=37.08 Aligned_cols=18 Identities=28% Similarity=0.506 Sum_probs=12.9
Q ss_pred CcChhhHHHHHHHHHHHH
Q 037955 497 RLNHHEVETLFHEFGHAL 514 (733)
Q Consensus 497 lL~~~~v~TLfHEfGHal 514 (733)
++...-..+..||+||.+
T Consensus 120 l~~~R~~k~~~HElGH~l 137 (179)
T PRK13267 120 LFEERVRKEVTHELGHTL 137 (179)
T ss_pred HHHHHHHHHHHHHHHHHc
Confidence 344455566899999996
No 38
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=58.80 E-value=4.8 Score=39.92 Aligned_cols=15 Identities=40% Similarity=0.567 Sum_probs=13.4
Q ss_pred hhHHHHHHHHHHHHH
Q 037955 501 HEVETLFHEFGHALH 515 (733)
Q Consensus 501 ~~v~TLfHEfGHalH 515 (733)
..+.|++||+||.|=
T Consensus 141 ~~~~~~AHEiGH~lG 155 (196)
T PF13688_consen 141 NGAITFAHEIGHNLG 155 (196)
T ss_dssp HHHHHHHHHHHHHTT
T ss_pred ceehhhHHhHHHhcC
Confidence 799999999999863
No 39
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=57.91 E-value=6.7 Score=38.35 Aligned_cols=13 Identities=46% Similarity=0.902 Sum_probs=12.1
Q ss_pred hHHHHHHHHHHHH
Q 037955 502 EVETLFHEFGHAL 514 (733)
Q Consensus 502 ~v~TLfHEfGHal 514 (733)
++.+++||+||.+
T Consensus 111 ~~~~~aHElGH~l 123 (173)
T PF13574_consen 111 GIDTFAHELGHQL 123 (173)
T ss_dssp HHHHHHHHHHHHH
T ss_pred eeeeehhhhHhhc
Confidence 8899999999987
No 40
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=57.68 E-value=5.1 Score=36.46 Aligned_cols=13 Identities=54% Similarity=0.882 Sum_probs=11.2
Q ss_pred hHHHHHHHHHHHH
Q 037955 502 EVETLFHEFGHAL 514 (733)
Q Consensus 502 ~v~TLfHEfGHal 514 (733)
...|+.||+||.|
T Consensus 107 ~~~~~~HEiGH~l 119 (124)
T PF13582_consen 107 GVDTFAHEIGHNL 119 (124)
T ss_dssp STTHHHHHHHHHT
T ss_pred cceEeeehhhHhc
Confidence 4589999999986
No 41
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=55.42 E-value=15 Score=32.86 Aligned_cols=36 Identities=19% Similarity=0.139 Sum_probs=24.5
Q ss_pred HHHHHHHHHHHHHHhhhcCCCCccCCCcccchhhhhhHHHHH
Q 037955 503 VETLFHEFGHALHSLLSRTDYQHFSGTRVALDFAETPSNLFE 544 (733)
Q Consensus 503 v~TLfHEfGHalH~lls~~~~~~~sgt~~~~D~~E~pS~~~E 544 (733)
..+|.||++|.+|.-.... ....|.=+.|.-+..+|
T Consensus 26 ~~~l~HE~~H~~~~~~~~~------~~~~~~W~~EG~A~y~~ 61 (128)
T PF13485_consen 26 DRVLAHELAHQWFGNYFGG------DDNAPRWFNEGLAEYVE 61 (128)
T ss_pred HHHHHHHHHHHHHHHHcCC------CccCchHHHHHHHHHHh
Confidence 4999999999999877542 12344445666666655
No 42
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=54.51 E-value=6.5 Score=37.71 Aligned_cols=24 Identities=13% Similarity=0.040 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHHHhhCceeEeecC
Q 037955 398 GQCIEGLKMLAESLFGVTFHSVPL 421 (733)
Q Consensus 398 ~~vl~gl~~l~~~lfgi~~~~~~~ 421 (733)
..++..++.+.+...+|+|+++..
T Consensus 24 ~~~v~~a~~~w~~~~~i~f~~~~~ 47 (167)
T cd00203 24 QSLILIAMQIWRDYLNIRFVLVGV 47 (167)
T ss_pred HHHHHHHHHHHHhhhCceEEEecc
Confidence 344566667777778999998753
No 43
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=52.89 E-value=10 Score=40.31 Aligned_cols=19 Identities=37% Similarity=0.681 Sum_probs=16.3
Q ss_pred hhHHHHHHHHHHHHHHhhh
Q 037955 501 HEVETLFHEFGHALHSLLS 519 (733)
Q Consensus 501 ~~v~TLfHEfGHalH~lls 519 (733)
...-+|+||+||.++.++.
T Consensus 169 a~ayVlAHEyGHHVQ~l~G 187 (292)
T PF04228_consen 169 AQAYVLAHEYGHHVQNLLG 187 (292)
T ss_pred HHHHHHHHHHHHHHHHHhh
Confidence 4666899999999999885
No 44
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=51.49 E-value=7.7 Score=37.21 Aligned_cols=15 Identities=47% Similarity=0.463 Sum_probs=12.2
Q ss_pred hhHHHHHHHHHHHHH
Q 037955 501 HEVETLFHEFGHALH 515 (733)
Q Consensus 501 ~~v~TLfHEfGHalH 515 (733)
..-.||.||+||-|-
T Consensus 68 ~~g~TltHEvGH~LG 82 (154)
T PF05572_consen 68 NFGKTLTHEVGHWLG 82 (154)
T ss_dssp -SSHHHHHHHHHHTT
T ss_pred ccccchhhhhhhhhc
Confidence 567999999999864
No 45
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=47.07 E-value=13 Score=34.11 Aligned_cols=16 Identities=31% Similarity=0.449 Sum_probs=12.9
Q ss_pred hhhHHHHHHHHHHHHH
Q 037955 500 HHEVETLFHEFGHALH 515 (733)
Q Consensus 500 ~~~v~TLfHEfGHalH 515 (733)
++.+....||+||.|=
T Consensus 75 y~~~RIaaHE~GHiLG 90 (132)
T PF02031_consen 75 YNSTRIAAHELGHILG 90 (132)
T ss_dssp S-HHHHHHHHHHHHHT
T ss_pred CccceeeeehhccccC
Confidence 4678899999999974
No 46
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=45.92 E-value=10 Score=37.29 Aligned_cols=14 Identities=43% Similarity=0.612 Sum_probs=11.9
Q ss_pred hHHHHHHHHHHHHH
Q 037955 502 EVETLFHEFGHALH 515 (733)
Q Consensus 502 ~v~TLfHEfGHalH 515 (733)
.+.|+.|||||||=
T Consensus 74 ~~g~v~HE~~HalG 87 (180)
T cd04280 74 SLGTIVHELMHALG 87 (180)
T ss_pred cCchhHHHHHHHhc
Confidence 46799999999984
No 47
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=42.49 E-value=15 Score=36.45 Aligned_cols=14 Identities=57% Similarity=0.935 Sum_probs=12.3
Q ss_pred hHHHHHHHHHHHHH
Q 037955 502 EVETLFHEFGHALH 515 (733)
Q Consensus 502 ~v~TLfHEfGHalH 515 (733)
.+.|+.|||||||=
T Consensus 79 ~~~~i~HEl~HaLG 92 (191)
T PF01400_consen 79 SVGTILHELGHALG 92 (191)
T ss_dssp SHHHHHHHHHHHHT
T ss_pred CccchHHHHHHHHh
Confidence 67899999999984
No 48
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=41.31 E-value=16 Score=37.21 Aligned_cols=15 Identities=40% Similarity=0.434 Sum_probs=13.3
Q ss_pred hhhHHHHHHHHHHHH
Q 037955 500 HHEVETLFHEFGHAL 514 (733)
Q Consensus 500 ~~~v~TLfHEfGHal 514 (733)
+..+.|++||+||.|
T Consensus 143 ~~~~~~~AHElGH~l 157 (220)
T cd04272 143 YYGVYTMTHELAHLL 157 (220)
T ss_pred cccHHHHHHHHHHHh
Confidence 357999999999998
No 49
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=41.08 E-value=15 Score=36.35 Aligned_cols=16 Identities=31% Similarity=0.206 Sum_probs=12.6
Q ss_pred hHHHHHHHHHHHHHHh
Q 037955 502 EVETLFHEFGHALHSL 517 (733)
Q Consensus 502 ~v~TLfHEfGHalH~l 517 (733)
.+-|..|||||||=..
T Consensus 77 ~~G~i~HEl~HaLG~~ 92 (182)
T cd04283 77 YKGIIQHELLHALGFY 92 (182)
T ss_pred ccchHHHHHHHHhCCc
Confidence 3568999999998533
No 50
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=41.03 E-value=17 Score=35.83 Aligned_cols=24 Identities=33% Similarity=0.599 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHHHhhhcCCCCcc
Q 037955 503 VETLFHEFGHALHSLLSRTDYQHF 526 (733)
Q Consensus 503 v~TLfHEfGHalH~lls~~~~~~~ 526 (733)
+..++||+||++=......+...+
T Consensus 8 i~i~~HE~gH~~~a~~~G~~~~~~ 31 (192)
T PF02163_consen 8 ISIVLHELGHALAARLYGDKVPRF 31 (192)
T ss_dssp HHHHHHHHHHHHHHHTTT--B--E
T ss_pred cccccccccccccccccccccccc
Confidence 567899999999887766554434
No 51
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=40.79 E-value=13 Score=37.43 Aligned_cols=14 Identities=29% Similarity=0.503 Sum_probs=11.3
Q ss_pred HHHHHHHHHHHHHh
Q 037955 504 ETLFHEFGHALHSL 517 (733)
Q Consensus 504 ~TLfHEfGHalH~l 517 (733)
.++.||+||++=.-
T Consensus 135 hvi~HEiGH~IGfR 148 (211)
T PF12388_consen 135 HVITHEIGHCIGFR 148 (211)
T ss_pred HHHHHHhhhhcccc
Confidence 48999999999533
No 52
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=40.45 E-value=21 Score=41.60 Aligned_cols=65 Identities=22% Similarity=0.353 Sum_probs=35.3
Q ss_pred CCCcccceeeecccCCCCccCccc--ccccCceecCCCCccccEEEEEccCCCCCCCCCCcChhhHHHHHHHHHHHH
Q 037955 440 EEGEMGYLYLDLYSRAGKYTGCAN--FAIKGGRRLSETEYQLPVVALICNFPGSHNLSVRLNHHEVETLFHEFGHAL 514 (733)
Q Consensus 440 ~~~~lg~~ylDl~~R~gK~~ga~~--~~i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~~~lL~~~~v~TLfHEfGHal 514 (733)
+++.||--|+-- +|.|-.+|-+. .+..-|+..+-. .= +.-+-|+...... -..=.||+||+||.+
T Consensus 338 e~GtLGLAwVgs-p~~~saGGIC~k~~~y~~G~~~sLN---tG-i~T~~NYg~~Vp~-----kvs~lt~AHEiGHNf 404 (764)
T KOG3658|consen 338 EGGTLGLAWVGS-PRSNSAGGICEKAYTYNEGKKRSLN---TG-ISTSVNYGKRVPT-----KVSDLTLAHEIGHNF 404 (764)
T ss_pred ccceeeeEEecc-CccCCCCcccccceeccCcceEEee---cc-eeeeeecCCccCc-----chhheeehhhhcccc
Confidence 577899999954 67775555431 122212211100 11 1124567665432 244568999999986
No 53
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=40.17 E-value=15 Score=38.02 Aligned_cols=13 Identities=31% Similarity=0.432 Sum_probs=11.1
Q ss_pred hHHHHHHHHHHHH
Q 037955 502 EVETLFHEFGHAL 514 (733)
Q Consensus 502 ~v~TLfHEfGHal 514 (733)
-..|++||+||.+
T Consensus 167 ~a~t~AHElGHnl 179 (244)
T cd04270 167 SDLVTAHELGHNF 179 (244)
T ss_pred HHHHHHHHHHHhc
Confidence 3489999999987
No 54
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=38.91 E-value=9.4 Score=39.16 Aligned_cols=14 Identities=36% Similarity=0.700 Sum_probs=12.2
Q ss_pred hhHHHHHHHHHHHH
Q 037955 501 HEVETLFHEFGHAL 514 (733)
Q Consensus 501 ~~v~TLfHEfGHal 514 (733)
..+.|++||+||.|
T Consensus 144 ~~~~t~AHElGHnL 157 (228)
T cd04271 144 NEWQVFAHEIGHTF 157 (228)
T ss_pred ccceehhhhhhhhc
Confidence 45689999999998
No 55
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=37.62 E-value=35 Score=38.03 Aligned_cols=55 Identities=16% Similarity=0.192 Sum_probs=31.3
Q ss_pred cCCCCccCcccccccCceecCCCCccccEEEEEccCCCCCCCCCCcC-hhhHHHHHHHHHHHHHHhhhc
Q 037955 453 SRAGKYTGCANFAIKGGRRLSETEYQLPVVALICNFPGSHNLSVRLN-HHEVETLFHEFGHALHSLLSR 520 (733)
Q Consensus 453 ~R~gK~~ga~~~~i~~~~~~~~g~~~~P~v~l~~Nf~~~~~~~~lL~-~~~v~TLfHEfGHalH~lls~ 520 (733)
+++++.+.||..++-+.. .+++.-++.... ..++ ..=+..++||+||--|.-..+
T Consensus 243 s~rs~hsNAyfyG~~~~K----------RIvIyDtLl~~~---~~~~~eel~AVl~HELGHW~~~H~~K 298 (428)
T KOG2719|consen 243 SKRSSHSNAYFYGLCKNK----------RIVIYDTLLLEE---EHLNNEELVAVLAHELGHWKLNHVLK 298 (428)
T ss_pred CCCCCCCCeeeeeccccc----------eEEEehhhhhhh---hccccHHHHHHHHHHhhHHHHhhHHH
Confidence 356667778876664332 244555555200 0011 134567899999998865543
No 56
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=37.37 E-value=22 Score=36.07 Aligned_cols=18 Identities=33% Similarity=0.438 Sum_probs=15.3
Q ss_pred hhHHHHHHHHHHHHHHhh
Q 037955 501 HEVETLFHEFGHALHSLL 518 (733)
Q Consensus 501 ~~v~TLfHEfGHalH~ll 518 (733)
..+.+-+||.|||+++--
T Consensus 88 aAvaVAAHEvGHAiQ~a~ 105 (222)
T PF04298_consen 88 AAVAVAAHEVGHAIQHAE 105 (222)
T ss_pred HHHHHHHHHHhHHHhccc
Confidence 678888999999998753
No 57
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=36.96 E-value=22 Score=35.16 Aligned_cols=21 Identities=29% Similarity=0.392 Sum_probs=16.1
Q ss_pred hHHHHHHHHHHHHHHhhhcCC
Q 037955 502 EVETLFHEFGHALHSLLSRTD 522 (733)
Q Consensus 502 ~v~TLfHEfGHalH~lls~~~ 522 (733)
-+..+.||+||.+-......+
T Consensus 9 ~~~v~iHElGH~~~Ar~~Gv~ 29 (182)
T cd06163 9 GILIFVHELGHFLVAKLFGVK 29 (182)
T ss_pred HHHHHHHHHHHHHHHHHcCCe
Confidence 356789999999987766544
No 58
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=36.51 E-value=22 Score=34.73 Aligned_cols=20 Identities=30% Similarity=0.414 Sum_probs=15.1
Q ss_pred HHHHHHHHHHHHHHhhhcCC
Q 037955 503 VETLFHEFGHALHSLLSRTD 522 (733)
Q Consensus 503 v~TLfHEfGHalH~lls~~~ 522 (733)
+..+.||+||++=......+
T Consensus 9 i~i~iHE~gH~~~A~~~G~~ 28 (180)
T cd05709 9 ISVTVHELGHALVARRLGVK 28 (180)
T ss_pred HHHHHHHHHHHHHHHHcCCC
Confidence 56789999999976665443
No 59
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=35.61 E-value=15 Score=36.21 Aligned_cols=14 Identities=43% Similarity=0.624 Sum_probs=12.5
Q ss_pred hhHHHHHHHHHHHH
Q 037955 501 HEVETLFHEFGHAL 514 (733)
Q Consensus 501 ~~v~TLfHEfGHal 514 (733)
....|++||+||.|
T Consensus 132 ~~~~~~aHElGH~l 145 (192)
T cd04267 132 LTALTMAHELGHNL 145 (192)
T ss_pred eehhhhhhhHHhhc
Confidence 57789999999997
No 60
>PF07023 DUF1315: Protein of unknown function (DUF1315); InterPro: IPR009749 This family consists of several bacterial proteins of around 90 residues in length. The function of this family is unknown.
Probab=35.44 E-value=1e+02 Score=26.88 Aligned_cols=47 Identities=17% Similarity=0.273 Sum_probs=30.7
Q ss_pred hCCHHHHHHHHHHh-----hcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHH
Q 037955 139 NTNHTLYDAVKKAE-----LDGHLLSKEAHRAANHLRIDFEKGGIHLCADKLD 186 (733)
Q Consensus 139 ~~~~~Ly~~l~~~~-----~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~ 186 (733)
++.+++|++|+... +++..|++||+...-..+--++.. .+++++.|-
T Consensus 7 ~mtPevY~rL~~AVElGKWpDG~~LT~eQre~~mQaVm~yq~~-~n~~~e~~t 58 (93)
T PF07023_consen 7 SMTPEVYERLKQAVELGKWPDGRALTPEQRESCMQAVMLYQAR-HNLPEEHRT 58 (93)
T ss_pred hCCHHHHHHHHHHHHhCcCCCCCCCCHHHHHHHHHHHHHHHHH-cCCCHHHcc
Confidence 46789999999872 466789999887555444444432 235665543
No 61
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=35.09 E-value=20 Score=35.93 Aligned_cols=16 Identities=25% Similarity=0.501 Sum_probs=12.5
Q ss_pred hHHHHHHHHHHHHHHh
Q 037955 502 EVETLFHEFGHALHSL 517 (733)
Q Consensus 502 ~v~TLfHEfGHalH~l 517 (733)
.+-|+.||+||||=..
T Consensus 87 ~~Gti~HEl~HaLGf~ 102 (200)
T cd04281 87 KFGIVVHELGHVIGFW 102 (200)
T ss_pred cCchHHHHHHHHhcCc
Confidence 3568999999998543
No 62
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=34.39 E-value=19 Score=37.77 Aligned_cols=19 Identities=32% Similarity=0.569 Sum_probs=13.1
Q ss_pred hhHHHHHHHHHHHHHHhhh
Q 037955 501 HEVETLFHEFGHALHSLLS 519 (733)
Q Consensus 501 ~~v~TLfHEfGHalH~lls 519 (733)
.-..++.|||||++..|--
T Consensus 215 ~~~~v~vHE~GHsf~~LaD 233 (264)
T PF09471_consen 215 SFKQVVVHEFGHSFGGLAD 233 (264)
T ss_dssp THHHHHHHHHHHHTT----
T ss_pred cccceeeeecccccccccc
Confidence 3578999999999987653
No 63
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=33.90 E-value=26 Score=35.39 Aligned_cols=19 Identities=37% Similarity=0.466 Sum_probs=14.2
Q ss_pred hHHHHHHHHHHHHHHhhhc
Q 037955 502 EVETLFHEFGHALHSLLSR 520 (733)
Q Consensus 502 ~v~TLfHEfGHalH~lls~ 520 (733)
=+..++||+||++=...-.
T Consensus 38 ~~~v~iHElgH~~~A~~~G 56 (208)
T cd06161 38 FLSVLLHELGHALVARRYG 56 (208)
T ss_pred HHHHHHHHHHHHHHHHHcC
Confidence 3567899999998755543
No 64
>COG4086 Predicted secreted protein [Function unknown]
Probab=33.46 E-value=5.1e+02 Score=27.34 Aligned_cols=84 Identities=11% Similarity=0.143 Sum_probs=53.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhCCH----HHHHHHHHHhhcCCCCCHHHH-HHHHHHHHHHHHcCCCCCHHHHHHHHHHH
Q 037955 118 REFVEEASKASMRISEYLHYLNTNH----TLYDAVKKAELDGHLLSKEAH-RAANHLRIDFEKGGIHLCADKLDRVNQLN 192 (733)
Q Consensus 118 ~e~r~aa~~~~~~l~~~~~~l~~~~----~Ly~~l~~~~~~~~~L~~e~~-r~l~~~~~~f~~~G~~L~~~~r~~~~~l~ 192 (733)
++-...|++-....++..++++-+. .|+..+|.-....+..||++. ++++.... .-|++|++.+...+..+.
T Consensus 166 e~~KqvaneEL~~~sel~~k~G~d~~r~a~l~~~VK~~~a~~~~~~~~dirkvv~dv~~---~ynvnltd~qvn~i~~~~ 242 (299)
T COG4086 166 EANKQVANEELVATSELGDKIGDDPRRAAALMAEVKEEVAKQKVDDPADIRKVVDDVAN---NYNVNLTDTQVNQIVNLF 242 (299)
T ss_pred HHHHHHHHHHHHHHHHhhhhcCCCHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHH---HcCCCCCHHHHHHHHHHH
Confidence 4455667777777888888887665 455555543222333567654 34444432 348889999888888888
Q ss_pred HHHHHHHHHHHh
Q 037955 193 MDIFQLCREFNQ 204 (733)
Q Consensus 193 ~~i~~L~~~F~~ 204 (733)
..++.|-..|.+
T Consensus 243 ~~~~~~n~d~~k 254 (299)
T COG4086 243 LAMSNLNIDWTK 254 (299)
T ss_pred HHhhccCccHHH
Confidence 777776555555
No 65
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=33.34 E-value=30 Score=34.79 Aligned_cols=21 Identities=38% Similarity=0.648 Sum_probs=15.6
Q ss_pred cChhhHH-HHHHHHHHHHHHhh
Q 037955 498 LNHHEVE-TLFHEFGHALHSLL 518 (733)
Q Consensus 498 L~~~~v~-TLfHEfGHalH~ll 518 (733)
++.+++. .|+||+||..|.-.
T Consensus 84 ~~~~el~aVlaHElgH~~~~h~ 105 (226)
T PF01435_consen 84 LSEDELAAVLAHELGHIKHRHI 105 (226)
T ss_dssp SSHHHHHHHHHHHHHHHHTTHC
T ss_pred ccHHHHHHHHHHHHHHHHcCCc
Confidence 3446776 78999999987543
No 66
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=33.34 E-value=24 Score=35.35 Aligned_cols=13 Identities=46% Similarity=0.544 Sum_probs=10.9
Q ss_pred HHHHHHHHHHHHH
Q 037955 503 VETLFHEFGHALH 515 (733)
Q Consensus 503 v~TLfHEfGHalH 515 (733)
..++.||+||+|=
T Consensus 117 ~~~~~he~gh~lG 129 (197)
T cd04276 117 RYLLAHEVGHTLG 129 (197)
T ss_pred HHHHHHHHHHHhc
Confidence 3689999999973
No 67
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=33.18 E-value=23 Score=35.07 Aligned_cols=14 Identities=43% Similarity=0.555 Sum_probs=12.5
Q ss_pred hhHHHHHHHHHHHH
Q 037955 501 HEVETLFHEFGHAL 514 (733)
Q Consensus 501 ~~v~TLfHEfGHal 514 (733)
.-+.+++||+||.+
T Consensus 130 ~~a~~~AHElGH~l 143 (194)
T cd04269 130 LFAVTMAHELGHNL 143 (194)
T ss_pred HHHHHHHHHHHhhc
Confidence 56899999999998
No 68
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=32.20 E-value=33 Score=33.91 Aligned_cols=17 Identities=29% Similarity=0.518 Sum_probs=14.3
Q ss_pred hhHHHHHHHHHHHHHHh
Q 037955 501 HEVETLFHEFGHALHSL 517 (733)
Q Consensus 501 ~~v~TLfHEfGHalH~l 517 (733)
.....-+||.|||+++-
T Consensus 91 a~~aVAAHEVGHAiQd~ 107 (226)
T COG2738 91 AAIAVAAHEVGHAIQDQ 107 (226)
T ss_pred HHHHHHHHHhhHHHhhh
Confidence 57778899999999864
No 69
>PF05960 DUF885: Bacterial protein of unknown function (DUF885); InterPro: IPR010281 This family consists of hypothetical bacterial proteins.; PDB: 3O0Y_B 3U24_A 3IUK_A.
Probab=31.83 E-value=1.3e+02 Score=34.85 Aligned_cols=165 Identities=19% Similarity=0.198 Sum_probs=84.4
Q ss_pred hhhHHHHHHHH--HHHHHHhhhcCCCCc-cCCCc-c-c--chhhhhhHHHHHHhhhcHHHHHHHHhcccCCCCChHHHHH
Q 037955 500 HHEVETLFHEF--GHALHSLLSRTDYQH-FSGTR-V-A--LDFAETPSNLFEYYAWDYRVLRRFAKHYLTGEIVPEKLVK 572 (733)
Q Consensus 500 ~~~v~TLfHEf--GHalH~lls~~~~~~-~sgt~-~-~--~D~~E~pS~~~E~~~~~~~~L~~~s~h~~tge~lp~~l~~ 572 (733)
+.-..|++||. ||-+|....+.. +. .+..+ . . .-|+|.=....|.++++.. .-| +.-|...+-
T Consensus 370 ~~~~~~~~HE~~PGHh~Q~~~~~~~-~~~~~~~r~~~~~~~~~~EGWAlY~E~l~~~e~------G~~---~~~p~~~lg 439 (549)
T PF05960_consen 370 YSLYTTAAHEAYPGHHLQIAYANEH-PDKLPPFRRLLGSYTAFVEGWALYAEQLMYDEL------GLY---DDDPLDRLG 439 (549)
T ss_dssp TCHHHHHHHHSTTTHHHHHHHHHHT-CCGS-HHHHHC---HHHHHHHHHHHHHHHHHHT------TTT---CCSHHHHHH
T ss_pred hhHHHHHHhhcCCcHHHHHHHHHHc-cccCcHHHHhccccCchhhHHHHHHHHHHHHhc------CCC---CCCHHHHHH
Confidence 46779999998 999998887653 21 11111 1 1 2555555555555443321 112 122443333
Q ss_pred HHHHhhhhhhHHHHHHHHHHHHhHHHhhcCCCCCCCCHHHHHHHHHhhccCCCCCC-CCcccccccccccccccchhHHH
Q 037955 573 SMQGARDMFAATELQRQIFYALVDQTLFGERLGQTRDTSSIVADMKRQHTSWNHVE-GTHWHIRFSHFINYGAGYYSYLY 651 (733)
Q Consensus 573 ~l~~~~~~~~~~~~~~ql~~a~fD~~lh~~~~~~~~~~~~l~~~l~~~~~~~~~~~-~~~~~~~f~Hl~~Y~a~YYsYl~ 651 (733)
.+. ..++..+...+|.-+|.... +..+.. +...+..++.... ..-+ .-+...++.+=+|.+
T Consensus 440 ~l~---------~~l~ra~r~vvD~glH~~~w----t~e~a~-~~l~~~~~~~~~~a~~ev----~ry~~~Pgq~~sY~~ 501 (549)
T PF05960_consen 440 QLN---------DELWRAARLVVDTGLHYGGW----TREQAI-DYLVENTGFSEEEAESEV----DRYISSPGQALSYKV 501 (549)
T ss_dssp HHH---------HHHHHHHHHHHHHHHHCCB------HHHHH-HHHHHHS-S-HHHHHHHH----HHHHHSTTGGGHHHH
T ss_pred HHH---------HHHHHHHHHHHHHhhhcCCC----CHHHHH-HHHHHhcCCChHHHHHHH----HHHHhCcHHHHHHHH
Confidence 322 23445667889999998632 233332 2333333332110 0001 111224444459999
Q ss_pred HHHHHHHHHHHHHHc--CCCCHHHHHHHHHHHhcCCCCCCHHHHHHHh
Q 037955 652 AKCFAATIWQKLCQE--DPLSLTTGTTLRTKILQHGGAKEPADMLNDL 697 (733)
Q Consensus 652 s~v~A~di~~~~~~~--~~~~~~~G~~~r~~vL~~Ggs~~~~ell~~f 697 (733)
|+.-=.++-+...+. +.|+. +.|-+.||+.|.. |..++++.
T Consensus 502 G~l~i~~LR~~a~~~lG~~F~l---k~FHd~iL~~G~~--Pl~~l~~~ 544 (549)
T PF05960_consen 502 GYLEILELREEAEEELGDKFDL---KEFHDAILSNGPL--PLDVLEEE 544 (549)
T ss_dssp HHHHHHHHHHHHHHHHGGG--H---HHHHHHHHCT-S----HHHHHHH
T ss_pred HHHHHHHHHHHHHHHhCCCCCH---HHHHHHHHhCCCC--CHHHHHHH
Confidence 998888887777643 34663 7888899998843 56666653
No 70
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=31.43 E-value=28 Score=29.34 Aligned_cols=16 Identities=25% Similarity=0.405 Sum_probs=14.3
Q ss_pred hhHHHHHHHHHHHHHH
Q 037955 501 HEVETLFHEFGHALHS 516 (733)
Q Consensus 501 ~~v~TLfHEfGHalH~ 516 (733)
.+...|.||+.|.++.
T Consensus 60 ~~~~llaHEl~Hv~Qq 75 (79)
T PF13699_consen 60 EGRALLAHELAHVVQQ 75 (79)
T ss_pred CcchhHhHHHHHHHhh
Confidence 5889999999999875
No 71
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=30.53 E-value=11 Score=40.19 Aligned_cols=14 Identities=43% Similarity=0.686 Sum_probs=11.9
Q ss_pred hHHHHHHHHHHHHH
Q 037955 502 EVETLFHEFGHALH 515 (733)
Q Consensus 502 ~v~TLfHEfGHalH 515 (733)
.+-|+.|||||.|-
T Consensus 165 ~igv~~HE~gH~lG 178 (286)
T TIGR03296 165 GVGVIAHELGHDLG 178 (286)
T ss_pred ceeeeehhhhcccC
Confidence 58999999999763
No 72
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=28.04 E-value=36 Score=35.69 Aligned_cols=18 Identities=22% Similarity=0.425 Sum_probs=13.2
Q ss_pred hHHHHHHHHHHHHHHhhh
Q 037955 502 EVETLFHEFGHALHSLLS 519 (733)
Q Consensus 502 ~v~TLfHEfGHalH~lls 519 (733)
-+..+.||+||++=...-
T Consensus 118 ~isv~iHElgHa~~Ar~~ 135 (263)
T cd06159 118 VVGVVVHELSHGILARVE 135 (263)
T ss_pred HHHHHHHHHHHHHHHHHc
Confidence 345789999999865543
No 73
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=27.76 E-value=31 Score=35.40 Aligned_cols=17 Identities=41% Similarity=0.286 Sum_probs=13.0
Q ss_pred hHHHHHHHHHHHHHHhh
Q 037955 502 EVETLFHEFGHALHSLL 518 (733)
Q Consensus 502 ~v~TLfHEfGHalH~ll 518 (733)
.+-|..||+||||=..-
T Consensus 120 ~~Gti~HEl~HalGf~H 136 (230)
T cd04282 120 YKATVEHEFLHALGFYH 136 (230)
T ss_pred CCchHHHHHHHHhCCcc
Confidence 35688999999985443
No 74
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=25.69 E-value=43 Score=34.32 Aligned_cols=17 Identities=35% Similarity=0.557 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHHHHhhh
Q 037955 503 VETLFHEFGHALHSLLS 519 (733)
Q Consensus 503 v~TLfHEfGHalH~lls 519 (733)
+..++||+||++=...-
T Consensus 54 ~~v~iHElgH~~~A~~~ 70 (227)
T cd06164 54 ASVLLHELGHSLVARRY 70 (227)
T ss_pred HHHHHHHHHHHHHHHHc
Confidence 56789999999865543
No 75
>cd07633 BAR_OPHN1 The Bin/Amphiphysin/Rvs (BAR) domain of Oligophrenin-1. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Oligophrenin-1 (OPHN1) is a GTPase activating protein (GAP) with activity towards RhoA, Rac, and Cdc42, that is expressed in developing spinal cord and in adult brain areas with high plasticity. It plays a role in regulating the actin cystoskeleton as well as morphology changes in axons and dendrites, and may also function in modulating neuronal connectivity. Mutations in the OPHN1 gene causes X-linked mental retardation associated with cerebellar hypoplasia, lateral ventricle enlargement and epilepsy. OPHN1 contains an N-terminal BAR domain, followed by a Pleckstrin homology (PH) domain, and a Rho GAP domain. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=24.22 E-value=7.5e+02 Score=24.96 Aligned_cols=120 Identities=13% Similarity=0.212 Sum_probs=65.8
Q ss_pred CChHHHHHHHHHHHHHHHHHHHcCCChHHHHHHHHHhhHHHhhhhchhhhhhccCCchHHHHHHHHHHHHHHHHHHHhh-
Q 037955 61 NGFQRFVDDAIERSSELVNYISEMPSSVEIIRAMDEISDAVCSVVDSAELCRQTHPDREFVEEASKASMRISEYLHYLN- 139 (733)
Q Consensus 61 ~~~~~~~~~~i~~~~~~i~~i~~~~~~~~~v~~ld~~~~~l~~~~~~~~~~~~~~~d~e~r~aa~~~~~~l~~~~~~l~- 139 (733)
+.....++.+|..|..+|++...- ......+. +.|+... .+++-...+|+|...+ +|..+++....++.
T Consensus 12 e~~~~~IkkliK~~~~li~a~K~~---s~A~r~Fa---~~L~df~--f~~igd~~tdde~~I~--~sL~~F~~~L~~ie~ 81 (207)
T cd07633 12 ERTNKFIKDVIKDGNALISAIKEY---SSAVQKFS---QTLQSFQ--FDFIGDTLTDDEINIA--ESFKEFAELLQEVEE 81 (207)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHH---HHHHHHH--hhcCCCcccchHHHHH--HHHHHHHHHHHHHHH
Confidence 556778888888898888875321 11111111 1111111 1344455567776543 55555555555432
Q ss_pred CCHHHHHHHHHHhhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 037955 140 TNHTLYDAVKKAELDGHLLSKEAHRAANHLRIDFEKGGIHLCADKLDRVNQLNMDIFQLCREFNQNII 207 (733)
Q Consensus 140 ~~~~Ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~L~~~F~~ni~ 207 (733)
.+..|++ ..++.+..-+..|++.+|.--.+.|.+|..... .++..-.+|+.
T Consensus 82 ~r~~l~d--------------~aq~s~~~~L~~F~Kedi~~~Ke~KK~FdK~se---~~~~aL~k~a~ 132 (207)
T cd07633 82 ERMMMVQ--------------NASDLLIKPLENFRKEQIGFTKERKKKFEKDSE---KFYSLLDRHVN 132 (207)
T ss_pred HHHHHHH--------------HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhh---HHHHHHHHHhc
Confidence 2333333 235556666777888777665667777776666 45555555554
No 76
>KOG2857 consensus Predicted MYND Zn-finger protein/hormone receptor interactor [Transcription]
Probab=24.09 E-value=5.2e+02 Score=24.32 Aligned_cols=43 Identities=21% Similarity=0.269 Sum_probs=28.8
Q ss_pred ChHHHHHHHHHHHHHHHHHHHcCCChHHHHHHHHHhhHHHhhhh
Q 037955 62 GFQRFVDDAIERSSELVNYISEMPSSVEIIRAMDEISDAVCSVV 105 (733)
Q Consensus 62 ~~~~~~~~~i~~~~~~i~~i~~~~~~~~~v~~ld~~~~~l~~~~ 105 (733)
-+.....+++..+. .+..+...|...+.|..+|+++|.+..+.
T Consensus 75 rV~~~~~kal~sss-eLrsll~dp~Lr~Li~sid~~~n~~kel~ 117 (157)
T KOG2857|consen 75 RVPSLALKALPSSS-ELRSLLEDPHLRNLIESIDELLNIMKELV 117 (157)
T ss_pred cchHHHHHhccchH-HHHHHHhCHHHHHHHHHHHHHhhHHHHHH
Confidence 34444455555453 44555677888999999999888877544
No 77
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=23.74 E-value=49 Score=34.94 Aligned_cols=17 Identities=24% Similarity=0.536 Sum_probs=12.8
Q ss_pred hHHHHHHHHHHHHHHhh
Q 037955 502 EVETLFHEFGHALHSLL 518 (733)
Q Consensus 502 ~v~TLfHEfGHalH~ll 518 (733)
=+..+.||+||++=...
T Consensus 135 ~isvvvHElgHal~A~~ 151 (277)
T cd06162 135 LISGVVHEMGHGVAAVR 151 (277)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34678999999986544
No 78
>KOG3133 consensus 40 kDa farnesylated protein associated with peroxisomes [Intracellular trafficking, secretion, and vesicular transport]
Probab=23.70 E-value=3.1e+02 Score=28.55 Aligned_cols=64 Identities=14% Similarity=0.190 Sum_probs=48.4
Q ss_pred HHHHHHHHHHhhCCHHHHHHHHHHhhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHhh
Q 037955 128 SMRISEYLHYLNTNHTLYDAVKKAELDGHLLSKEAHRAANHLRIDFEKGGIHLCADKLDRVNQLNMDIFQLCREFNQN 205 (733)
Q Consensus 128 ~~~l~~~~~~l~~~~~Ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~L~~~F~~n 205 (733)
.+.+...+..+.+-+-||.-+|.+..++ =+ =++.||..|+++..+||.+...=|..++..|..-
T Consensus 145 e~~m~~iMqqllSKEILyeplKEl~~~Y----------Pk----wLeen~e~l~~E~~erYqkQ~~~i~~i~~~~e~e 208 (267)
T KOG3133|consen 145 EPIMESIMQQLLSKEILYEPLKELGANY----------PK----WLEENGESLSKEDKERYQKQFELIKEIESVYENE 208 (267)
T ss_pred HHHHHHHHHHHHHHHHhhhhHHHHHHHh----------hH----HHHhcccccCHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 4556666666777777888777773222 22 2677999999999999999999999999999774
No 79
>PF08463 EcoEI_R_C: EcoEI R protein C-terminal; InterPro: IPR013670 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type I restriction endonucleases are components of prokaryotic DNA restriction-modification mechanisms that protects the organism against invading foreign DNA. Type I enzymes have three different subunits subunits - M (modification), S (specificity) and R (restriction) - that form multifunctional enzymes with restriction (3.1.21.3 from EC), methylase (2.1.1.72 from EC) and ATPase activities [, ]. The S subunit is required for both restriction and modification and is responsible for recognition of the DNA sequence specific for the system. The M subunit is necessary for modification, and the R subunit is required for restriction. These enzymes use S-Adenosyl-L-methionine (AdoMet) as the methyl group donor in the methylation reaction, and have a requirement for ATP. They recognise asymmetric DNA sequences split into two domains of specific sequence, one 3-4 bp long and another 4-5 bp long, separated by a nonspecific spacer 6-8 bp in length. Cleavage occurs a considerable distance from the recognition sites, rarely less than 400 bp away and up to 7000 bp away. Adenosyl residues are methylated, one on each strand of the recognition sequence. These enzymes are widespread in eubacteria and archaea. In enteric bacteria they have been subdivide into four families: types IA, IB, IC and ID. Type III restriction endonucleases (3.1.21.5 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. Type III enzymes are hetero-oligomeric, multifunctional proteins composed of two subunits, Res and Mod. The Mod subunit recognises the DNA sequence specific for the system and is a modification methyltransferase; as such it is functionally equivalent to the M and S subunits of type I restriction endonuclease. Res is required for restriction, although it has no enzymatic activity on its own. Type III enzymes recognise short 5-6 bp long asymmetric DNA sequences and cleave 25-27 bp downstream to leave short, single-stranded 5' protrusions. They require the presence of two inversely oriented unmethylated recognition sites for restriction to occur. These enzymes methylate only one strand of the DNA, at the N-6 position of adenosyl residues, so newly replicated DNA will have only one strand methylated, which is sufficient to protect against restriction. Type III enzymes belong to the beta-subfamily of N6 adenine methyltransferases, containing the nine motifs that characterise this family, including motif I, the AdoMet binding pocket (FXGXG), and motif IV, the catalytic region (S/D/N (PP) Y/F) [, ]. This entry represents the C-terminal domain found in both the R subunit of type I enzymes and the Res subunit of type III enzymes. The type I enzyme represented is EcoEI, which recognises 5'-GAGN(7)ATGC-3; the R protein (HsdR) is required for both nuclease and ATPase activity [, ]. ; GO: 0003677 DNA binding, 0003824 catalytic activity, 0006304 DNA modification
Probab=22.45 E-value=3.2e+02 Score=26.02 Aligned_cols=22 Identities=14% Similarity=0.218 Sum_probs=15.1
Q ss_pred CCHHHHHHHHHHHHHHHHcCCC
Q 037955 158 LSKEAHRAANHLRIDFEKGGIH 179 (733)
Q Consensus 158 L~~e~~r~l~~~~~~f~~~G~~ 179 (733)
++++++.+|+.++.-+..+|+-
T Consensus 102 ~~~~Q~~~L~~i~~~~~~~G~~ 123 (164)
T PF08463_consen 102 FNAEQREFLERILDYYAQNGII 123 (164)
T ss_pred CCHHHHHHHHHHHHHHHHhCcc
Confidence 6667777777777777776653
No 80
>PRK11546 zraP zinc resistance protein; Provisional
Probab=22.24 E-value=1.8e+02 Score=27.53 Aligned_cols=38 Identities=13% Similarity=0.087 Sum_probs=24.5
Q ss_pred CCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHH
Q 037955 156 HLLSKEAHRAANHLRIDFEKGGIHLCADKLDRVNQLNMDIFQ 197 (733)
Q Consensus 156 ~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~ 197 (733)
.+|++|++..++.++.+|... ..+-|+++...+.|+..
T Consensus 42 ~~LT~EQQa~~q~I~~~f~~~----t~~LRqqL~aKr~ELnA 79 (143)
T PRK11546 42 APLTTEQQAAWQKIHNDFYAQ----TSALRQQLVSKRYEYNA 79 (143)
T ss_pred ccCCHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHH
Confidence 368999999998888888764 33444444444444433
No 81
>PF05505 Ebola_NP: Ebola nucleoprotein; InterPro: IPR008609 This family consists of Ebola virus sp., Lake Victoria marburgvirus nucleoproteins. These proteins are responsible for encapsidation of genomic RNA. It has been found that nucleoprotein DNA vaccines can offer protection from the virus [].; GO: 0019074 viral RNA genome packaging, 0019013 viral nucleocapsid
Probab=22.16 E-value=3e+02 Score=31.40 Aligned_cols=65 Identities=17% Similarity=0.230 Sum_probs=42.4
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHHhhcCCCCCHHHHHHHHHHHHHHHHcCCCCCH------HHHHHHHHH
Q 037955 118 REFVEEASKASMRISEYLHYLNTNHTLYDAVKKAELDGHLLSKEAHRAANHLRIDFEKGGIHLCA------DKLDRVNQL 191 (733)
Q Consensus 118 ~e~r~aa~~~~~~l~~~~~~l~~~~~Ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~------~~r~~~~~l 191 (733)
..+|+||++|.+.|..|.... ++ +.-.||+++++.|..+... ++.|.... -.|+|+.+|
T Consensus 323 QQLREAAteAEkqLQqyaEsr-----------El--d~lglDdqEkKIL~~FHqk--KnEIsfqQT~AmvtlrkErLakL 387 (717)
T PF05505_consen 323 QQLREAATEAEKQLQQYAESR-----------EL--DHLGLDDQEKKILMNFHQK--KNEISFQQTNAMVTLRKERLAKL 387 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHhh-----------hh--hhhccchHHHHHHHHHHhh--cccchHhHhHHHHHHHHHHHHHH
Confidence 348999999999999987521 11 2235889999888887653 45554432 146667666
Q ss_pred HHHHHH
Q 037955 192 NMDIFQ 197 (733)
Q Consensus 192 ~~~i~~ 197 (733)
...|..
T Consensus 388 teaIt~ 393 (717)
T PF05505_consen 388 TEAITS 393 (717)
T ss_pred HHHHHH
Confidence 666653
No 82
>PF06160 EzrA: Septation ring formation regulator, EzrA ; InterPro: IPR010379 During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerises into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation [].; GO: 0000921 septin ring assembly, 0005940 septin ring, 0016021 integral to membrane
Probab=22.07 E-value=1.3e+03 Score=26.99 Aligned_cols=71 Identities=14% Similarity=0.164 Sum_probs=47.3
Q ss_pred HHHHHHHHHHHHHhhCCHHHHHHHHHHhhcCCCCCHHHHHHHHHHHHHHHHcCCCCCH-HHHHHHHHHHHHHHHHHHHH
Q 037955 125 SKASMRISEYLHYLNTNHTLYDAVKKAELDGHLLSKEAHRAANHLRIDFEKGGIHLCA-DKLDRVNQLNMDIFQLCREF 202 (733)
Q Consensus 125 ~~~~~~l~~~~~~l~~~~~Ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~-~~r~~~~~l~~~i~~L~~~F 202 (733)
.++...+......+..=++||..++.. + |++-.-|+.-++.|+..|.+|+. +--+++..+..++.++...-
T Consensus 197 ~~l~~~~~~l~~~~e~IP~l~~~l~~~------~-P~ql~eL~~gy~~m~~~gy~l~~~~i~~~i~~i~~~l~~~~~~L 268 (560)
T PF06160_consen 197 EKLKEETDELEEIMEDIPKLYKELQKE------F-PDQLEELKEGYREMEEEGYYLEHLDIEEEIEQIEEQLEEALALL 268 (560)
T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHH------h-HHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHH
Confidence 333444444444444555666655544 1 56677778889999999999987 56778888888877765544
No 83
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=21.59 E-value=61 Score=30.93 Aligned_cols=15 Identities=20% Similarity=0.501 Sum_probs=11.9
Q ss_pred hhHHHHHHHHHHHHH
Q 037955 501 HEVETLFHEFGHALH 515 (733)
Q Consensus 501 ~~v~TLfHEfGHalH 515 (733)
..+-.+.|||+|++=
T Consensus 134 ~~lDVvaHEltHGVt 148 (150)
T PF01447_consen 134 SSLDVVAHELTHGVT 148 (150)
T ss_dssp G-HHHHHHHHHHHHH
T ss_pred cccceeeeccccccc
Confidence 358889999999973
No 84
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=21.49 E-value=59 Score=32.12 Aligned_cols=20 Identities=25% Similarity=0.227 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHHhhhcCC
Q 037955 503 VETLFHEFGHALHSLLSRTD 522 (733)
Q Consensus 503 v~TLfHEfGHalH~lls~~~ 522 (733)
+..+.||+||++=......+
T Consensus 42 ~~l~iHElgH~~~A~~~G~~ 61 (183)
T cd06160 42 AILGIHEMGHYLAARRHGVK 61 (183)
T ss_pred HHHHHHHHHHHHHHHHCCCC
Confidence 45569999999876654433
No 85
>PF01742 Peptidase_M27: Clostridial neurotoxin zinc protease This family is a subset of the Prosite family; InterPro: IPR000395 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M27 (clan MA(E)). A number of the proteins have been classified as non-peptidase homologues as they have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity of peptidases in the family. There are seven antigenically distinct forms of botulinum neurotoxin, designated A, B, C1, D, E, F and G. The seven neurotoxins are potent protein toxins that inhibit neurotransmitter release from peripheral cholinergic synapses []. On binding to the neuronal synapses, the molecules are internalised and move by retrograde transport up the axon into the spinal cord, where they can move between post- and presynaptic neurons. The toxin inhibits neurotransmitter release by acting as a zinc endopeptidase that cleaves synaptic proteins such as synaptobrevins, syntaxin and SNAP-25 []. The protein toxins exist as disulphide-linked heterodimers of light and heavy chains. The light chain has the pharmacological activity, while the N- and C-termini of the heavy chain mediate channel formation and toxin binding []. The light chain exhibits a high level of sequence similarity to tetanus toxin (TeTx). Alignment of all characterised neurotoxin sequences reveals the presence of highly conserved amino acid domains interspersed with amino acid tracts with little overall similarity. The most divergent region corresponds to the C-terminal extremity of each toxin, which may reflect differences in specificity of binding to neurone acceptor sites []. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0009405 pathogenesis; PDB: 2QN0_A 3D3X_A 3NF3_A 2ISG_A 2ISE_B 2IMC_B 2IMA_B 2ILP_A 2IMB_A 2ISH_A ....
Probab=20.76 E-value=75 Score=34.91 Aligned_cols=23 Identities=30% Similarity=0.463 Sum_probs=18.3
Q ss_pred CcChhhHHHHHHHHHHHHHHhhh
Q 037955 497 RLNHHEVETLFHEFGHALHSLLS 519 (733)
Q Consensus 497 lL~~~~v~TLfHEfGHalH~lls 519 (733)
..=-|=+.+|+||+=|+||.|.+
T Consensus 209 ~F~~DPAl~LmheLIh~Lh~LYG 231 (408)
T PF01742_consen 209 EFYADPALELMHELIHSLHGLYG 231 (408)
T ss_dssp EEE--HHHHHHHHHHHHHHHHTT
T ss_pred heecCHHHHHHHHHHHHHHhhcc
Confidence 34457889999999999999986
No 86
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=20.56 E-value=49 Score=35.70 Aligned_cols=15 Identities=40% Similarity=0.771 Sum_probs=12.2
Q ss_pred hhHHHHHHHHHHHHH
Q 037955 501 HEVETLFHEFGHALH 515 (733)
Q Consensus 501 ~~v~TLfHEfGHalH 515 (733)
.+..++|||+||-+=
T Consensus 149 ~~~~~~~HElgHN~G 163 (314)
T PF05548_consen 149 QDWATIMHELGHNLG 163 (314)
T ss_pred ccHHHHHHHhhhhcc
Confidence 356799999999873
No 87
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=20.56 E-value=54 Score=32.54 Aligned_cols=14 Identities=36% Similarity=0.522 Sum_probs=12.1
Q ss_pred hhHHHHHHHHHHHH
Q 037955 501 HEVETLFHEFGHAL 514 (733)
Q Consensus 501 ~~v~TLfHEfGHal 514 (733)
.-..+++||+||.|
T Consensus 130 ~~a~~~AHelGH~l 143 (199)
T PF01421_consen 130 SFAVIIAHELGHNL 143 (199)
T ss_dssp HHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHhc
Confidence 56789999999986
No 88
>TIGR00488 putative HD superfamily hydrolase of NAD metabolism. The function of this protein family is unknown. Members of this family of uncharacterized proteins from the Mycoplasmas are longer at the amino end, fused to a region of nicotinamide nucleotide adenylyltransferase, an NAD salvage biosynthesis enzyme. Members are putative metal-dependent phosphohydrolases for NAD metabolism.
Probab=20.26 E-value=6.7e+02 Score=23.73 Aligned_cols=99 Identities=15% Similarity=0.113 Sum_probs=54.9
Q ss_pred hhHHHHHHHHHHHHHH--hh---hcCCCC-ccCCCcccchhhhhhHHHHHHhh-h-cHHHHHHHHhcccCCCCChHHHH-
Q 037955 501 HEVETLFHEFGHALHS--LL---SRTDYQ-HFSGTRVALDFAETPSNLFEYYA-W-DYRVLRRFAKHYLTGEIVPEKLV- 571 (733)
Q Consensus 501 ~~v~TLfHEfGHalH~--ll---s~~~~~-~~sgt~~~~D~~E~pS~~~E~~~-~-~~~~L~~~s~h~~tge~lp~~l~- 571 (733)
.-+.-|.|.+|..+-. ++ .+...+ .........+-.++.+.+++.|. . +++++..+..|..+.++.+ .+.
T Consensus 34 a~~AGLLHDIGk~~~~~~~~~~~~~~~~~~~~~~~~~~~~H~~vGa~ll~~w~~~~~~~i~~aI~~H~~~~~~~~-~l~~ 112 (158)
T TIGR00488 34 AEIAGAYHDLAKFLPKEQLKQIAKREKMPAHLLYPSPKLLHAYVGAYILKREFGVQDEDILDAIRNHTSGPPGMS-LLDM 112 (158)
T ss_pred HHHHHHHHHHhccCCHHHHHHHHHHcCCCchhhcccccccHHHHHHHHHHHHhCCCcHHHHHHHHHhCCCCCCCC-HHHH
Confidence 4677899999984311 11 111000 01112234568899999999986 5 6778888888875544333 111
Q ss_pred -----HHHHHhhhhhhHHHHHHHHHHHHhHHHhh
Q 037955 572 -----KSMQGARDMFAATELQRQIFYALVDQTLF 600 (733)
Q Consensus 572 -----~~l~~~~~~~~~~~~~~ql~~a~fD~~lh 600 (733)
+.+...+..+.+.+-+|.+.+-.+|..+.
T Consensus 113 iV~lAD~i~~~~~~~~~~~~~~~~~~~~l~~~~~ 146 (158)
T TIGR00488 113 IIYVADKLEPNRGAGIEIDELRKLAKTDLKQAYI 146 (158)
T ss_pred HHHhHHHHhhcccccccHHHHHHHHHHHHHHHHH
Confidence 22322232244555667776666666554
Done!