Query 005310
Match_columns 703
No_of_seqs 225 out of 1830
Neff 7.7
Searched_HMMs 46136
Date Thu Mar 28 21:27:25 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/005310.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/005310hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG0339 Dcp Zn-dependent oligo 100.0 4E-153 8E-158 1258.2 65.4 642 52-698 18-679 (683)
2 PRK10280 dipeptidyl carboxypep 100.0 4E-146 1E-150 1258.2 70.9 642 49-698 14-677 (681)
3 KOG2089 Metalloendopeptidase f 100.0 2E-140 5E-145 1137.5 56.6 649 49-699 35-707 (718)
4 PRK10911 oligopeptidase A; Pro 100.0 8E-139 2E-143 1202.9 70.7 641 52-698 11-676 (680)
5 KOG2090 Metalloendopeptidase f 100.0 7E-128 2E-132 1026.8 57.9 638 47-700 42-691 (704)
6 cd06457 M3A_MIP Peptidase M3 m 100.0 2E-105 5E-110 892.8 46.0 436 255-695 2-455 (458)
7 cd06455 M3A_TOP Peptidase M3 T 100.0 8E-101 2E-105 860.7 51.8 467 194-697 1-471 (472)
8 cd06456 M3A_DCP_Oligopeptidase 100.0 4E-100 9E-105 838.9 47.0 415 279-698 1-420 (422)
9 PF01432 Peptidase_M3: Peptida 100.0 3.1E-92 6.8E-97 792.5 42.7 440 255-698 1-456 (458)
10 cd06258 Peptidase_M3_like The 100.0 2.7E-66 6E-71 566.1 40.0 357 282-697 2-364 (365)
11 TIGR00181 pepF oligoendopeptid 100.0 2.8E-59 6E-64 540.7 39.5 538 59-697 18-576 (591)
12 TIGR02289 M3_not_pepF oligoend 100.0 3.7E-54 7.9E-59 492.3 43.6 501 76-690 10-528 (549)
13 cd06459 M3B_Oligoendopeptidase 100.0 6.1E-52 1.3E-56 462.5 27.8 380 240-699 29-423 (427)
14 TIGR02290 M3_fam_3 oligoendope 100.0 1E-49 2.2E-54 459.8 44.4 530 63-698 16-573 (587)
15 COG1164 Oligoendopeptidase F [ 100.0 4.4E-40 9.5E-45 375.2 40.9 508 88-697 49-580 (598)
16 cd06461 M2_ACE Peptidase famil 99.9 1.8E-24 3.9E-29 239.9 34.1 370 248-696 32-464 (477)
17 cd06460 M32_Taq Peptidase fami 99.9 7.4E-19 1.6E-23 190.2 33.6 333 281-699 30-392 (396)
18 PF01401 Peptidase_M2: Angiote 99.6 3.8E-12 8.2E-17 145.2 36.4 495 96-697 21-567 (595)
19 PF02074 Peptidase_M32: Carbox 99.4 1.6E-09 3.4E-14 121.1 39.9 416 128-698 43-490 (494)
20 COG2317 Zn-dependent carboxype 99.3 3.9E-08 8.4E-13 105.9 36.9 411 129-698 44-491 (497)
21 KOG3690 Angiotensin I-converti 99.0 1.3E-06 2.8E-11 96.5 35.6 485 118-697 61-596 (646)
22 cd04277 ZnMc_serralysin_like Z 90.1 1.2 2.7E-05 43.8 8.2 30 387-418 38-67 (186)
23 COG2856 Predicted Zn peptidase 88.2 0.45 9.9E-06 47.9 3.5 33 464-506 57-89 (213)
24 cd04278 ZnMc_MMP Zinc-dependen 83.1 5.6 0.00012 37.9 8.2 16 487-502 105-120 (157)
25 PF06114 DUF955: Domain of unk 82.5 1.5 3.2E-05 39.1 3.7 33 465-507 28-60 (122)
26 PF14247 DUF4344: Domain of un 82.2 0.89 1.9E-05 46.0 2.3 31 488-524 91-121 (220)
27 PF00413 Peptidase_M10: Matrix 78.8 0.98 2.1E-05 42.7 1.3 23 386-408 25-47 (154)
28 PF13058 DUF3920: Protein of u 75.6 2 4.3E-05 37.9 2.1 25 470-503 66-90 (126)
29 PF13398 Peptidase_M50B: Pepti 74.0 2.7 5.9E-05 42.0 3.0 24 487-510 20-43 (200)
30 cd04268 ZnMc_MMP_like Zinc-dep 70.8 2.4 5.2E-05 40.6 1.7 32 386-419 18-49 (165)
31 cd04279 ZnMc_MMP_like_1 Zinc-d 70.1 2.4 5.3E-05 40.4 1.6 15 488-502 103-117 (156)
32 smart00235 ZnMc Zinc-dependent 64.1 3.4 7.3E-05 38.5 1.2 11 491-501 88-98 (140)
33 PF14891 Peptidase_M91: Effect 63.7 4.9 0.00011 39.2 2.3 22 487-508 101-122 (174)
34 PF13574 Reprolysin_2: Metallo 63.0 4.7 0.0001 39.3 2.1 14 489-502 111-124 (173)
35 PF13688 Reprolysin_5: Metallo 62.7 4.1 9E-05 40.2 1.6 17 486-502 139-155 (196)
36 cd04327 ZnMc_MMP_like_3 Zinc-d 61.8 4.2 9.1E-05 40.5 1.5 16 489-504 92-107 (198)
37 PF13582 Reprolysin_3: Metallo 61.6 4.1 8.9E-05 36.9 1.3 13 489-501 107-119 (124)
38 PF04228 Zn_peptidase: Putativ 60.2 6.6 0.00014 41.6 2.7 22 486-507 167-188 (292)
39 PF04298 Zn_peptidase_2: Putat 57.5 9.4 0.0002 38.6 3.1 20 486-505 86-105 (222)
40 cd00203 ZnMc Zinc-dependent me 56.8 5.5 0.00012 38.0 1.3 24 385-408 24-47 (167)
41 PF13583 Reprolysin_4: Metallo 55.8 5.4 0.00012 40.1 1.1 14 488-501 136-149 (206)
42 PRK13267 archaemetzincin-like 55.5 27 0.00058 34.3 5.8 22 427-452 71-92 (179)
43 PF05572 Peptidase_M43: Pregna 53.9 7 0.00015 37.4 1.5 17 486-502 66-82 (154)
44 COG0501 HtpX Zn-dependent prot 53.4 13 0.00028 39.3 3.6 21 484-504 151-172 (302)
45 PF10460 Peptidase_M30: Peptid 53.3 12 0.00027 40.7 3.4 44 489-536 139-184 (366)
46 PRK04778 septation ring format 52.4 2E+02 0.0044 33.6 13.5 74 122-202 194-268 (569)
47 cd04280 ZnMc_astacin_like Zinc 49.9 8.1 0.00018 37.9 1.2 13 490-502 75-87 (180)
48 PF13485 Peptidase_MA_2: Pepti 48.3 20 0.00043 31.9 3.5 17 491-507 27-43 (128)
49 PF06160 EzrA: Septation ring 48.1 2.4E+02 0.0053 32.9 13.2 75 123-204 191-266 (560)
50 PF02031 Peptidase_M7: Strepto 46.7 13 0.00028 34.2 1.9 17 486-502 74-90 (132)
51 cd04272 ZnMc_salivary_gland_MP 46.7 11 0.00024 38.1 1.7 16 486-501 142-157 (220)
52 cd04283 ZnMc_hatching_enzyme Z 45.4 11 0.00024 37.1 1.4 15 490-504 78-92 (182)
53 PF12388 Peptidase_M57: Dual-a 45.4 10 0.00022 38.2 1.1 12 491-502 135-146 (211)
54 PF01400 Astacin: Astacin (Pep 45.3 13 0.00027 36.9 1.8 14 489-502 79-92 (191)
55 cd07633 BAR_OPHN1 The Bin/Amph 43.6 1.5E+02 0.0032 29.7 8.9 115 59-200 10-124 (207)
56 cd04270 ZnMc_TACE_like Zinc-de 42.7 13 0.00029 38.3 1.5 12 490-501 168-179 (244)
57 cd04271 ZnMc_ADAM_fungal Zinc- 42.6 7.7 0.00017 39.7 -0.2 14 488-501 144-157 (228)
58 cd07602 BAR_RhoGAP_OPHN1-like 40.9 2.2E+02 0.0047 28.7 9.7 122 59-207 10-131 (207)
59 cd04267 ZnMc_ADAM_like Zinc-de 39.7 12 0.00026 36.9 0.6 15 487-501 131-145 (192)
60 cd04281 ZnMc_BMP1_TLD Zinc-dep 39.2 16 0.00034 36.6 1.4 14 490-503 88-101 (200)
61 PF09471 Peptidase_M64: IgA Pe 38.0 16 0.00035 38.2 1.3 19 489-507 216-234 (264)
62 cd04269 ZnMc_adamalysin_II_lik 37.7 18 0.00038 35.7 1.5 15 487-501 129-143 (194)
63 cd06163 S2P-M50_PDZ_RseP-like 37.6 20 0.00044 35.2 1.9 20 490-509 10-29 (182)
64 cd05709 S2P-M50 Site-2 proteas 37.0 21 0.00045 34.7 1.9 24 490-513 9-32 (180)
65 COG2738 Predicted Zn-dependent 36.3 28 0.00062 34.3 2.6 19 486-504 89-107 (226)
66 cd04276 ZnMc_MMP_like_2 Zinc-d 35.9 20 0.00044 35.8 1.6 12 491-502 118-129 (197)
67 PF13699 DUF4157: Domain of un 35.8 20 0.00044 30.1 1.3 19 486-504 58-76 (79)
68 cd07636 BAR_GRAF The Bin/Amphi 35.7 2.5E+02 0.0053 28.3 9.2 122 59-207 10-131 (207)
69 cd06161 S2P-M50_SpoIVFB SpoIVF 35.2 23 0.0005 35.6 1.9 18 490-507 39-56 (208)
70 PF02163 Peptidase_M50: Peptid 34.2 26 0.00056 34.4 2.0 24 490-513 8-31 (192)
71 PF01435 Peptidase_M48: Peptid 33.2 28 0.00061 34.8 2.2 23 485-507 84-107 (226)
72 TIGR03296 M6dom_TIGR03296 M6 f 32.5 9.6 0.00021 40.4 -1.4 13 489-501 165-177 (286)
73 cd04282 ZnMc_meprin Zinc-depen 31.6 24 0.00052 36.1 1.3 15 490-504 121-135 (230)
74 PRK02870 heat shock protein Ht 30.7 31 0.00068 37.4 2.1 20 485-504 168-188 (336)
75 PRK03982 heat shock protein Ht 30.6 30 0.00066 36.6 2.0 21 485-505 120-141 (288)
76 KOG2719 Metalloprotease [Gener 30.2 54 0.0012 36.4 3.8 54 440-507 244-298 (428)
77 PRK03001 M48 family peptidase; 29.8 31 0.00068 36.4 1.9 21 485-505 119-140 (283)
78 cd06159 S2P-M50_PDZ_Arch Uncha 28.4 35 0.00075 35.7 1.9 18 490-507 119-136 (263)
79 COG4086 Predicted secreted pro 28.0 4.6E+02 0.01 27.6 9.7 78 122-202 166-248 (299)
80 PRK01345 heat shock protein Ht 27.7 39 0.00085 36.4 2.2 21 485-505 119-140 (317)
81 PRK04897 heat shock protein Ht 27.2 38 0.00082 36.1 2.0 21 485-505 132-153 (298)
82 cd06164 S2P-M50_SpoIVFB_CBS Sp 26.7 39 0.00085 34.5 1.9 18 490-507 54-71 (227)
83 cd00923 Cyt_c_Oxidase_Va Cytoc 26.7 3.4E+02 0.0074 24.0 7.2 62 118-187 38-100 (103)
84 TIGR03042 PS_II_psbQ_bact phot 26.5 4.5E+02 0.0098 24.8 8.7 13 57-69 29-41 (142)
85 PRK03072 heat shock protein Ht 26.4 42 0.00091 35.6 2.1 22 485-506 122-144 (288)
86 PF01742 Peptidase_M27: Clostr 26.2 48 0.001 36.3 2.5 25 483-507 208-232 (408)
87 PRK01265 heat shock protein Ht 26.1 42 0.00092 36.2 2.1 21 485-505 135-156 (324)
88 PTZ00337 surface protease GP63 25.0 78 0.0017 36.9 4.1 33 464-501 209-241 (567)
89 PF09610 Myco_arth_vir_N: Myco 24.4 32 0.0007 23.6 0.5 23 13-35 5-27 (33)
90 PRK05457 heat shock protein Ht 24.2 51 0.0011 34.9 2.3 20 485-504 129-149 (284)
91 cd06162 S2P-M50_PDZ_SREBP Ster 24.1 47 0.001 35.0 1.9 17 490-506 136-152 (277)
92 PF05055 DUF677: Protein of un 23.9 2.7E+02 0.006 30.2 7.8 50 65-114 232-282 (336)
93 cd07635 BAR_GRAF2 The Bin/Amph 23.9 3.4E+02 0.0074 27.3 7.8 121 60-207 11-131 (207)
94 cd07601 BAR_APPL The Bin/Amphi 22.9 6.5E+02 0.014 25.5 9.7 122 57-207 8-129 (215)
95 PF01447 Peptidase_M4: Thermol 22.9 54 0.0012 31.2 2.0 14 489-502 135-148 (150)
96 PF01421 Reprolysin: Reprolysi 22.6 54 0.0012 32.4 2.0 16 486-501 128-143 (199)
97 PF01418 HTH_6: Helix-turn-hel 22.5 70 0.0015 26.5 2.3 18 292-309 52-69 (77)
98 PF05823 Gp-FAR-1: Nematode fa 22.3 1.6E+02 0.0036 28.0 5.1 33 146-178 54-90 (154)
99 cd04275 ZnMc_pappalysin_like Z 21.4 19 0.00041 36.7 -1.6 17 486-502 134-150 (225)
100 cd06160 S2P-M50_like_2 Unchara 21.1 60 0.0013 32.0 1.9 18 491-508 43-60 (183)
101 PF05548 Peptidase_M11: Gameto 20.9 48 0.001 35.6 1.2 15 488-502 149-163 (314)
102 PRK02391 heat shock protein Ht 20.8 61 0.0013 34.5 2.0 21 485-505 128-149 (296)
103 TIGR00054 RIP metalloprotease 20.5 56 0.0012 36.6 1.8 20 489-508 14-33 (420)
104 PF08328 ASL_C: Adenylosuccina 20.3 2.2E+02 0.0048 25.8 5.1 42 146-196 69-110 (115)
105 cd04273 ZnMc_ADAMTS_like Zinc- 20.2 21 0.00046 35.7 -1.6 14 488-501 139-152 (207)
106 PF14559 TPR_19: Tetratricopep 20.2 1.8E+02 0.004 22.4 4.3 54 643-696 10-63 (68)
No 1
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=100.00 E-value=3.5e-153 Score=1258.18 Aligned_cols=642 Identities=30% Similarity=0.507 Sum_probs=610.9
Q ss_pred CCcccCCChhhHHHHHHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHhhhhhhcccccccccccccCCcHHHHHHHHHH
Q 005310 52 SIVRVNLSASEILKLADKIVSKSKEVHDAVASVPLDKVTFMNVISPLAELEAQQFPLVQSCVFPKLVCTSDDVCKSSAEA 131 (703)
Q Consensus 52 ~~~~~~~s~~~i~~~~~~~i~~~~~~~~~i~~~~~~~~t~~n~i~~l~~l~~~l~~~~~~~~~~~~v~~d~e~r~aa~e~ 131 (703)
.|+...+.|+++.|+++++|+.+++.+++|... ++.+||+|+|.||+++++.||.+++.+.++++||+|+++|+++++|
T Consensus 18 lP~F~~i~~ed~~pAf~~~l~~~~~~i~~i~~~-~~~pt~~n~v~~Le~~~~~L~rv~~~~~~l~~v~~~~elr~a~e~~ 96 (683)
T COG0339 18 LPPFDAIKPEDVKPAFDEALAQARAEIDAIAAN-PDAPTWENTVEPLERAGDRLGRVWSVVSHLNSVHNSPELREAYEEI 96 (683)
T ss_pred CCCcccCChhhccHHHHHHHHHHHHHHHHHHcC-CCCCcHHHHHHHHHHHhhHHHHHHHHHHHHHhcCCCHHHHHHHHHh
Confidence 666677999999999999999999999999887 4789999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHhhCCHHHHHHHHHHHhcC--CCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHhh
Q 005310 132 ERKIDAHMLSCSNREDVYRVVKAVAATG--EWVSPEAKRYIQSLVRDFERSGLNLTISKREEVQRLRDQIDELSLQYVRN 209 (703)
Q Consensus 132 ~~~l~~~~~~l~~~~~ly~~l~~~~~~~--~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~l~~~f~~n 209 (703)
.++|++|++.|++|.+||.+++++++.. ..||+|++|+++..+++|+++|+.|+++.|+|+.+|++|++.|+++|++|
T Consensus 97 ~pklse~~~~l~~~~~Lf~~~~al~~~~e~~~ld~e~~r~~e~~~rdF~~sGa~L~~~~k~r~~~I~~ela~L~~~Fs~n 176 (683)
T COG0339 97 LPKLSEFSNDLGQNPGLFARYKALWQSPESAGLDPEQKRVVENALRDFVLSGAKLPPEKKARLAEINTELAELGAQFSNN 176 (683)
T ss_pred hHHHHHHHHHhhcCHHHHHHHHHHhcCcccccCCHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 9999999999999999999999999843 46899999999999999999999999999999999999999999999999
Q ss_pred ccccccc--eEe-cccccCCCCHHHHHHHhhcc----CCceEEEccCccHHHHHhcCCCHHHHHHHHHHHhcccC----C
Q 005310 210 LNDDKTF--LLF-SEADLLGLPPEFLKSLDKAE----NGIFKVTLKSHHVAAVLELCKVGRTRRLVAVAYGKRCG----G 278 (703)
Q Consensus 210 ~~~~~~~--~~~-~~~eL~glp~~~l~~~~~~~----~g~~~i~l~~~~~~~~l~~~~d~~~Rk~~~~a~~~r~~----~ 278 (703)
+.+++.. +.+ ++++|.|+|+..+++++... .++|.|||..|++.|+|++|.||++||++|+|+.+|++ .
T Consensus 177 vL~at~~~~~~~~~~~~LaGlp~~~l~aa~~~A~~kg~~~~~itl~~p~~~p~l~~~~~r~lRe~~y~A~~~ra~~~~~~ 256 (683)
T COG0339 177 VLDATKAWALIVTDEAELAGLPESLLAAAAAAAEAKGLEGYLITLDIPSYLPVLTYADNRALREKLYRAYVTRASEGGPN 256 (683)
T ss_pred hccccccceeecCcHHHhcCCCHHHHHHHHHHHHhcCCCCeEEeecccchhHHHHhcccHHHHHHHHHHHhhhccccCCc
Confidence 9999975 334 37899999999999887643 24599999999999999999999999999999999986 2
Q ss_pred chHHHHHHHHHHHHHHHHHcCCCCHHHhHhhhhccCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHh-cCCCCCCcc
Q 005310 279 INLSVLESLVELRHKFARLLGYSNYADYALDLRMAKSSSKVFEFLEEISASLTDLASRELMMLKDLKRKE-EGELPFGIE 357 (703)
Q Consensus 279 ~n~~~l~~ll~lR~~~A~llGy~sya~~~l~~~ma~tpe~V~~fL~~l~~~~~p~~~~el~~L~~~k~~~-~g~~~l~pW 357 (703)
+|.+++.+++++|.|+|+||||+|||+|+|.++||+||+.|.+||++|..+.+|.+++|++.|+++++++ .|..+++||
T Consensus 257 ~n~~ii~~~L~LR~E~A~LLGf~~yA~~~L~~kMA~~p~~Vl~fL~~l~~ka~~~a~~e~a~L~~~~~~~~~~~~~l~~W 336 (683)
T COG0339 257 DNRAIIEEILKLRAELAKLLGFSNYAEYSLADKMAKTPEAVLNFLNDLAEKARPQAEKELAELQAFAAEEEGGLPELQPW 336 (683)
T ss_pred ccHHHHHHHHHHHHHHHHHcCcccHHHHhHHHHhhcCHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhhccCCcccchh
Confidence 5889999999999999999999999999999999999999999999999999999999999999999985 788899999
Q ss_pred chhhhHHHHHhhhcCCChhhhccCCChHHHHHHHHHHHhhhhcceEEEecccccccccceEEEEEecCCCceeeeeeecc
Q 005310 358 DLLYYVRRVEELQFDLDFGAVKQYFPVNLVLSGIFKVFQDLFGLRFEEVLDAVVWHSDVRVFSVFDLSSNDLLGYFYLDI 437 (703)
Q Consensus 358 D~~y~~~~~~~~~~~~d~~~l~~yf~~~~vl~gl~~l~~~lfgl~~~~~~~~e~W~~dV~~~~v~d~~~g~~lG~~ylDl 437 (703)
|+.||.+++|+.+|++|+++++||||++.||+|+|.++++||||+|+|..++++||||||+|+|+| .+|..+|.||+|+
T Consensus 337 D~~yyaeK~r~~~y~~de~elrpYF~l~~Vl~GlF~~~~rLfGI~~~e~~~~~vwHpDVr~~~v~d-~~g~~~g~fY~Dl 415 (683)
T COG0339 337 DWAYYAEKQRQEKYAFDEEELRPYFPLNKVLEGLFEVAKRLFGITFVERKDIPVWHPDVRVFEVFD-ENGELIGLFYLDL 415 (683)
T ss_pred hHHHHHHHHHhhhcCCCHHHhhhcCChhHHHHHHHHHHHHHcCeEEEECCCCCccCCCceEEEEEc-CCCCEEEEEEeec
Confidence 999999999999999999999999999999999999999999999999999999999999999999 5799999999999
Q ss_pred cccCCCcCCccccccCCCcccCCCCccccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhhhcCCccccCCCc
Q 005310 438 YKREGKYVHTCVVALQNGALSSTGARQIPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQICNRASFARFSGLR 517 (703)
Q Consensus 438 ~~R~gK~~ga~~~~l~~~~~~~~g~~q~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~lls~~~~~~~sg~~ 517 (703)
|+|+||++||||++++++....+|..|.||++++|||.+|.+++|+||+|+||.||||||||+||+||++++|+.+|||+
T Consensus 416 yaR~~KrgGAWM~~~~~~~~~~~~~~q~PV~ylvCNF~~p~~gkPaLls~dEV~TLFHEfGHgLH~mlt~v~~~~vsGt~ 495 (683)
T COG0339 416 YARDGKRGGAWMDDFVSQRRLDDGGGQKPVIYLVCNFTKPVGGKPALLSHDEVTTLFHEFGHGLHHLLTRVKYPGVSGTN 495 (683)
T ss_pred ccCCCCccchHHHHhhhcccccCCCcccceEEEeccCCCCCCCCCceeeHHHHHHHHHHhhhHHHHHhhcCCccccCCCC
Confidence 99999999999999999988888889999999999999999999999999999999999999999999999999999999
Q ss_pred cchhhhhccHHHHHHhHhcHHHHHHhhcccCCCCccchHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhcCC-c---c
Q 005310 518 VDPDFVEIPAQLLENWCYESFSLKLISGFHQDITTPIKDEICKSLKRRRYAFSALKLKQEILYCIFDQVIHSAD-N---V 593 (703)
Q Consensus 518 ~~~DfvE~pS~l~E~~~~~~~~L~~l~~h~~t~ge~ip~~l~~~l~~~~~~~~~~~~~~ql~~a~~D~~~H~~~-~---~ 593 (703)
|+|||||+|||+||||||+|++|..+++||+| |+|||++++++|+++|||++|+.+++|+.+|+|||.+|+.. | .
T Consensus 496 v~wDfVElPSQ~mE~w~~~p~vL~~~a~Hy~T-Ge~lP~~ll~k~laaknf~~g~~t~rql~fal~Dm~~H~~~~~~~~~ 574 (683)
T COG0339 496 VPWDFVELPSQFMENWCWEPEVLAKYARHYQT-GEPLPKELLDKMLAAKNFQAGLFTLRQLEFALFDMRLHTEFDPDANA 574 (683)
T ss_pred CCcchhhccHHHHHHhhcCHHHHHHHHHhhcc-CCcCHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHhhccCCccccc
Confidence 99999999999999999999999999999999 99999999999999999999999999999999999999876 2 4
Q ss_pred cHHHHHHHHhcccccCCCCCCCCCCcccccccc-CcccCCcccHHHHHHHHHHHHHHHHhc-CCCCHHHHHHHHHHhccC
Q 005310 594 DIVELFKHLHPKVMLGLPMLEGTNPASCFPRSA-IGFESACYSRIWSEVFAADIFACKFRD-GLLNQRVGMQFRNNVLAP 671 (703)
Q Consensus 594 ~~~~l~~~l~~~~~~~~~~~~~~~~~~~f~Hl~-~~Y~a~YYsYl~a~vlA~~i~~~~f~~-~~~~~~~g~~~r~~vL~~ 671 (703)
++.+++.++.++ ...++.++..+|.++|+||| |||+||||||+||+|+++|.| ++|++ |++|+++|++||+.||+.
T Consensus 575 ~i~~~~~~~~~~-~~v~~~~~~~~~~~~F~HIFagGYsAGYYSY~WaeVLsaDaf-a~Fee~g~~~~e~G~rfrd~ILs~ 652 (683)
T COG0339 575 DILEFEAEVLKK-VAVLPSIPPRRRPHSFGHIFAGGYSAGYYSYLWAEVLSADAF-AAFEEEGPFNRETGQRFRDAILSR 652 (683)
T ss_pred CHHHHHHHHHHH-hCCCCCcchhhccccccceecCcccchhHHHHHHHHHhhHHH-HHHHhcCCCCHHHHHHHHHHHHhc
Confidence 899999999999 66565666688889999999 789999999999999999999 77776 899999999999999999
Q ss_pred CCCccHHHHHHHhhCCCCChHHHHHHh
Q 005310 672 GGSKEPIEILSDFLGREPSIQAFVDSR 698 (703)
Q Consensus 672 Ggs~~~~ell~~flGr~p~~~a~l~~~ 698 (703)
|||++|++++++|+||+|+++++++.+
T Consensus 653 GGS~dp~e~f~~frGrep~~dalLr~~ 679 (683)
T COG0339 653 GGSRDPMELFKAFRGREPSIDALLRHR 679 (683)
T ss_pred cCCcCHHHHHHHHhcCCCChhHHHHhc
Confidence 999999999999999999999999987
No 2
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=100.00 E-value=4.4e-146 Score=1258.17 Aligned_cols=642 Identities=22% Similarity=0.345 Sum_probs=596.8
Q ss_pred CCCCCcccCCChhhHHHHHHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHhhhhhhcccccccccccccCCcHHHHHHH
Q 005310 49 LQGSIVRVNLSASEILKLADKIVSKSKEVHDAVASVPLDKVTFMNVISPLAELEAQQFPLVQSCVFPKLVCTSDDVCKSS 128 (703)
Q Consensus 49 ~p~~~~~~~~s~~~i~~~~~~~i~~~~~~~~~i~~~~~~~~t~~n~i~~l~~l~~~l~~~~~~~~~~~~v~~d~e~r~aa 128 (703)
|.+.|+.-.++|++|.++++++|+.+++.+++|.+. +..+||+|||.|+|++++.++.+++.+.++++||||+++|+++
T Consensus 14 ~~~~p~f~~i~~e~~~~a~~~~~~~~~~~i~~i~~~-~~~~t~~n~i~~ld~~~~~l~~~~~~~~~l~~v~~~~~~r~a~ 92 (681)
T PRK10280 14 PYLAPHFDQIADHHYRPAFDEGVRQKRAEIAAIALN-PQAPDFNNTILALEQSGELLTRVTSVFFAMTAAHTNDELQRLD 92 (681)
T ss_pred CCCCCCCCcCCHHHHHHHHHHHHHHHHHHHHHHHcC-CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH
Confidence 344555555799999999999999999999999875 4457999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHhhCCHHHHHHHHHHHhcCC--CCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHH
Q 005310 129 AEAERKIDAHMLSCSNREDVYRVVKAVAATGE--WVSPEAKRYIQSLVRDFERSGLNLTISKREEVQRLRDQIDELSLQY 206 (703)
Q Consensus 129 ~e~~~~l~~~~~~l~~~~~ly~~l~~~~~~~~--~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~l~~~f 206 (703)
++|.+++++|.+++++|++||++++++++... .|++|++|++++++++|+++|++|++++|+++++|+.+|+.|+++|
T Consensus 93 ~~~~~~l~~~~~~l~~~~~Ly~~l~~~~~~~~~~~l~~e~~r~l~~~l~dF~~sG~~L~~~~r~r~~~l~~~l~~L~~~F 172 (681)
T PRK10280 93 EQFSAELAELANDIYLNGELFARVDAVWQQRESLGLDSESIRLVEVIHQRFVLAGAKLAQADKAKLKVLNTEAATLTSQF 172 (681)
T ss_pred HHHHHHHHHHHHHhhCCHHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999999999976433 7999999999999999999999999999999999999999999999
Q ss_pred Hhhccccccc--eEe-cccccCCCCHHHHHHHhhc-----cCCceEEEccCccHHHHHhcCCCHHHHHHHHHHHhcccCC
Q 005310 207 VRNLNDDKTF--LLF-SEADLLGLPPEFLKSLDKA-----ENGIFKVTLKSHHVAAVLELCKVGRTRRLVAVAYGKRCGG 278 (703)
Q Consensus 207 ~~n~~~~~~~--~~~-~~~eL~glp~~~l~~~~~~-----~~g~~~i~l~~~~~~~~l~~~~d~~~Rk~~~~a~~~r~~~ 278 (703)
++|+.++++. +.+ +++||+|||+++++++++. .+|+|.||++++++.|+|++|+||++||++|+|+.+|++.
T Consensus 173 ~~n~~~~~~~~~~~~~~~~eL~Glp~~~~~~~~~~a~~~g~~g~~~itl~~~~~~p~l~~~~dr~~Re~~~~A~~~r~~~ 252 (681)
T PRK10280 173 NQRLLAANKSGGLVVNDIHQLAGLSEQEIALAAEAAREKGLDNRWLIPLLNTTQQPALAELRDRQTRENLFAAGWTRAEK 252 (681)
T ss_pred HHhHhhhhhcCeeecCCHHHHcCCCHHHHHHHHHHHHhcCCCCcEEEeCcCCcHhHHHhcCCCHHHHHHHHHHHHhhhcc
Confidence 9999999874 455 5789999999999998763 2456999999999999999999999999999999999863
Q ss_pred ----chHHHHHHHHHHHHHHHHHcCCCCHHHhHhhhhccCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcCCCCC
Q 005310 279 ----INLSVLESLVELRHKFARLLGYSNYADYALDLRMAKSSSKVFEFLEEISASLTDLASRELMMLKDLKRKEEGELPF 354 (703)
Q Consensus 279 ----~n~~~l~~ll~lR~~~A~llGy~sya~~~l~~~ma~tpe~V~~fL~~l~~~~~p~~~~el~~L~~~k~~~~g~~~l 354 (703)
+|.++|.+|+++|+++|++|||+||++|++.++|++||++|.+||++|...++|.+++|++.|++++++..|...+
T Consensus 253 ~~~~dn~~il~~ll~lR~e~A~lLGf~~yA~~~l~~kMa~spe~V~~FL~~L~~~~~~~a~~E~~~L~~~~~~~~g~~~l 332 (681)
T PRK10280 253 GDANDTRAIIQRLVEIRAQQAKLLGFPHYAAWKIADQMAKTPEAALNFMREIVPAARQRASDELASIQAVIDKQQGGFSA 332 (681)
T ss_pred CCchhHHHHHHHHHHHHHHHHHHcCCCcHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC
Confidence 7899999999999999999999999999999999999999999999999999999999999999999877677789
Q ss_pred CccchhhhHHHHHhhhcCCChhhhccCCChHHHH-HHHHHHHhhhhcceEEEecccccccccceEEEEEecCCCceeeee
Q 005310 355 GIEDLLYYVRRVEELQFDLDFGAVKQYFPVNLVL-SGIFKVFQDLFGLRFEEVLDAVVWHSDVRVFSVFDLSSNDLLGYF 433 (703)
Q Consensus 355 ~pWD~~y~~~~~~~~~~~~d~~~l~~yf~~~~vl-~gl~~l~~~lfgl~~~~~~~~e~W~~dV~~~~v~d~~~g~~lG~~ 433 (703)
+|||+.||.+++++++|++|+++++||||++.|+ +|||.++++||||+|+++++.++|||||++|+|+| ++|.++|+|
T Consensus 333 ~pWD~~yy~ek~r~~~~~~d~~~l~~YFpl~~Vl~~Glf~l~~~LfGi~f~~~~~~~vWh~dV~~~~V~d-~~g~~lG~f 411 (681)
T PRK10280 333 QAWDWAFYAEQVRREKYALDEAQLKPYFELNTVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRVWEIFD-HNGVGLALF 411 (681)
T ss_pred ChhHHHHHHHHHHHHhcCCCHHHcCCcCcHHHHHHHhHHHHHHHHcCeEEEECCCCCCCCCCeeEEEEEc-CCCCEEEEE
Confidence 9999999999999999999999999999999999 79999999999999999988899999999999999 567899999
Q ss_pred eecccccCCCcCCccccccCCCcccCCCCccccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhhhcCCcccc
Q 005310 434 YLDIYKREGKYVHTCVVALQNGALSSTGARQIPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQICNRASFARF 513 (703)
Q Consensus 434 ylDl~~R~gK~~ga~~~~l~~~~~~~~g~~q~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~lls~~~~~~~ 513 (703)
|+|+|+|+||++||||.+++.+.. ..+|.|+++++|||++|++++|+||+|+||+||||||||+||++|++++|+++
T Consensus 412 Y~Dl~~R~gK~~gawm~~~~~~~~---~~~~~Pv~~lvcNf~~p~~~~p~LL~~~eV~TlFHEfGHalH~lls~~~y~~~ 488 (681)
T PRK10280 412 YGDFFARDSKSGGAWMGNFVEQST---LNETRPVIYNVCNYQKPAAGQPALLLWDDVITLFHEFGHTLHGLFARQRYATL 488 (681)
T ss_pred EecCCCCCCCCCCCCCcccccccc---cCCCCCeEEEECCCCCCCCCCCCccCHHHHHHHHHHHHHHHHHHHhCCCcccc
Confidence 999999999999999998875432 24568999999999999999999999999999999999999999999999999
Q ss_pred CCCccchhhhhccHHHHHHhHhcHHHHHHhhcccCCCCccchHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhcCCc-
Q 005310 514 SGLRVDPDFVEIPAQLLENWCYESFSLKLISGFHQDITTPIKDEICKSLKRRRYAFSALKLKQEILYCIFDQVIHSADN- 592 (703)
Q Consensus 514 sg~~~~~DfvE~pS~l~E~~~~~~~~L~~l~~h~~t~ge~ip~~l~~~l~~~~~~~~~~~~~~ql~~a~~D~~~H~~~~- 592 (703)
|||+|+|||||+||||||+|||+|+||+.||+||+| |||||++++++|+++|++++|+.+++|+.+|+|||.+|+..+
T Consensus 489 sGt~v~~DfVE~PSq~mE~w~~~~~vL~~~a~Hy~T-gepiP~~l~~~l~~ar~~~~g~~~~~ql~~al~D~~lH~~~~~ 567 (681)
T PRK10280 489 SGTNTPRDFVEFPSQINEHWASHPQVFARYARHYQS-GEAMPDELQEKMRNASLFNKGYDMSELLSAALLDMRWHCLEEN 567 (681)
T ss_pred CCCCCCcchhcCcHHHHHHHhcCHHHHHHHhhccCC-CCCCCHHHHHHHHHhhCcchHHHHHHHHHHHHHhHHHhccCcc
Confidence 999999999999999999999999999999999999 999999999999999999999999999999999999997632
Q ss_pred ---ccHHHHHHHHhccccc-CCCCCCCCCCcccccccc-CcccCCcccHHHHHHHHHHHHHHHHhc-CCCCHHHHHHHHH
Q 005310 593 ---VDIVELFKHLHPKVML-GLPMLEGTNPASCFPRSA-IGFESACYSRIWSEVFAADIFACKFRD-GLLNQRVGMQFRN 666 (703)
Q Consensus 593 ---~~~~~l~~~l~~~~~~-~~~~~~~~~~~~~f~Hl~-~~Y~a~YYsYl~a~vlA~~i~~~~f~~-~~~~~~~g~~~r~ 666 (703)
.++..++.++..+ +. .+|.+++++|+++|+||| |||+||||||+||+|+|+|+| +.|.+ |++|+++|++||+
T Consensus 568 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~F~Hif~ggY~AgYYsYlwaevlaaD~f-~~f~~~g~~n~~~G~~fr~ 645 (681)
T PRK10280 568 EAMQDVDDFELRALVA-ENLDLPAVPPRYRSSYFAHIFGGGYAAGYYAYLWTQMLADDGY-QWFVEQGGLTRENGQRFRE 645 (681)
T ss_pred cccccHHHHHHHHHHH-hCCCCCCCCCCCCCCcccccccCCcchhhHHHHHHHHHHHHHH-HHHHhcCCCCHHHHHHHHH
Confidence 2677777777766 43 256677888999999999 689999999999999999999 66655 8899999999999
Q ss_pred HhccCCCCccHHHHHHHhhCCCCChHHHHHHh
Q 005310 667 NVLAPGGSKEPIEILSDFLGREPSIQAFVDSR 698 (703)
Q Consensus 667 ~vL~~Ggs~~~~ell~~flGr~p~~~a~l~~~ 698 (703)
+||++|||++|++++++||||+|+++||++++
T Consensus 646 ~iL~~GGs~d~~~~~~~FlGR~P~~~alL~~~ 677 (681)
T PRK10280 646 AILSRGNSTDLERLYRQWRGHAPQIMPMLQHR 677 (681)
T ss_pred HHhhcCCCcCHHHHHHHhcCCCCChHHHHHhc
Confidence 99999999999999999999999999999986
No 3
>KOG2089 consensus Metalloendopeptidase family - saccharolysin & thimet oligopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=2.4e-140 Score=1137.47 Aligned_cols=649 Identities=41% Similarity=0.706 Sum_probs=621.5
Q ss_pred CCCCCcccCCChhhHHHH--HHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHhhhhhhcccccccccccccCCcHHHHH
Q 005310 49 LQGSIVRVNLSASEILKL--ADKIVSKSKEVHDAVASVPLDKVTFMNVISPLAELEAQQFPLVQSCVFPKLVCTSDDVCK 126 (703)
Q Consensus 49 ~p~~~~~~~~s~~~i~~~--~~~~i~~~~~~~~~i~~~~~~~~t~~n~i~~l~~l~~~l~~~~~~~~~~~~v~~d~e~r~ 126 (703)
+++.+++++.|+++|.-. +.++|.++.+..|.++.++.+.++|+|++.|+.++++++..++++|.|+.+|++++++|+
T Consensus 35 ~g~~~~~~~~s~~~i~~~~~~~eii~~~~~v~d~v~~~~~ee~s~en~vepla~ie~el~~~~~~L~f~~~vs~~~~~R~ 114 (718)
T KOG2089|consen 35 LGANDPPWLLSASDIEVRPGTREIIQQSKAVYDSVATLKLEEPSYENVVEPLAKIEVELTVVIGMLVFPQHVSPDKELRK 114 (718)
T ss_pred cccCCCCCCCCHhHHhhHHHHHHHHHHHHHHHHHhhhccccccchHHHHhHHHHHHHHHHHHHhhhhhhhhcCCchhhhh
Confidence 444667888999999888 999999999999999877788999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHhhCCHHHHHHHHHHHh--cCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHH
Q 005310 127 SSAEAERKIDAHMLSCSNREDVYRVVKAVAA--TGEWVSPEAKRYIQSLVRDFERSGLNLTISKREEVQRLRDQIDELSL 204 (703)
Q Consensus 127 aa~e~~~~l~~~~~~l~~~~~ly~~l~~~~~--~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~l~~ 204 (703)
|+.++.+++.++..++.+++++|..+..+++ ..+.+++|++|++++.++++++||++|++++|+++.++++++++++.
T Consensus 115 as~~~~~~~~~~~~r~~~r~di~~~~~~i~e~~~~dslspe~~RylE~~ike~k~nGL~L~~~kr~~ik~ikk~l~~l~~ 194 (718)
T KOG2089|consen 115 ASTEADKKLDEFDLRLSMRKDIYNRFQAIYEKPAKDSLSPEAQRYLEKLIKEGKLNGLHLDDDKREEIKEIKKELSELSI 194 (718)
T ss_pred hhhcchHHHHHHHHHhhhhhhHHHHHHHHHHhcccccCCHHHHHHHHHHHHHHHhcCCCCChhHHHHHHHHHHHHHHHHH
Confidence 9999999999999999999999999999886 45789999999999999999999999999999999999999999999
Q ss_pred HHHhhccccc---cceEecccccCCCCHHHHHHHhhccC----CceEEEccCccHHHHHhcCCCHHHHHHHHHHHhcccC
Q 005310 205 QYVRNLNDDK---TFLLFSEADLLGLPPEFLKSLDKAEN----GIFKVTLKSHHVAAVLELCKVGRTRRLVAVAYGKRCG 277 (703)
Q Consensus 205 ~f~~n~~~~~---~~~~~~~~eL~glp~~~l~~~~~~~~----g~~~i~l~~~~~~~~l~~~~d~~~Rk~~~~a~~~r~~ 277 (703)
+|++|+++++ +.+.+|..+|+|+|+++|..++++++ |+|+||+++|+|.|||++|++|++|++||+||..||+
T Consensus 195 ~f~~nvne~~~~~t~litd~~el~glPps~L~~la~~~~~k~sgpwkiTl~~p~~~pvmk~c~~r~tR~~v~~A~~~ra~ 274 (718)
T KOG2089|consen 195 KFSKNVNEDTKKFTFLITDKKELEGLPPSFLQSLAKDEDDKGSGPWKITLKYPHYFPVMKHCKIRQTRKTVWRAYNSRAS 274 (718)
T ss_pred HHHhhhhhcccccceeeecHHHhccCCHHHHHHHhhcccCCCCCceEEEecccchhHHHHhCCcHHHHHHHHHHHhccCC
Confidence 9999999999 45677899999999999999999765 4599999999999999999999999999999999997
Q ss_pred C---chHHHHHHHHHHHHHHHHHcCCCCHHHhHhhhhccCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH---hcC-
Q 005310 278 G---INLSVLESLVELRHKFARLLGYSNYADYALDLRMAKSSSKVFEFLEEISASLTDLASRELMMLKDLKRK---EEG- 350 (703)
Q Consensus 278 ~---~n~~~l~~ll~lR~~~A~llGy~sya~~~l~~~ma~tpe~V~~fL~~l~~~~~p~~~~el~~L~~~k~~---~~g- 350 (703)
. +|..+|++|+++|.++|++|||.+||||+|..+|++|+++|..||++|.++++|.+.+|...+.++++. ..|
T Consensus 275 ~~~~eNs~ii~~l~~lR~~lAklLGY~t~Ad~~L~~kMa~ss~tv~~fl~dL~~kL~pl~~~e~~v~~elk~~e~k~~g~ 354 (718)
T KOG2089|consen 275 EGDLENSAIIEQLLKLRLELAKLLGYSTYADYSLAMKMAKSSETVVEFLDDLSQKLRPLGIDERSVLLELKKGEAKDRGA 354 (718)
T ss_pred cccccccHHHHHHHHHHHHHHHHhCchhHHHHHHHHHhhcCcHHHHHHHHHHHHHhhhcchhHHHHHHHHHHHHHHhcCC
Confidence 6 799999999999999999999999999999999999999999999999999999999999999998873 344
Q ss_pred --CCCCCccchhhhHHHHHhhhcCCChhhhccCCChHHHHHHHHHHHhhhhcceEEEecccccccccceEEEEEecCCCc
Q 005310 351 --ELPFGIEDLLYYVRRVEELQFDLDFGAVKQYFPVNLVLSGIFKVFQDLFGLRFEEVLDAVVWHSDVRVFSVFDLSSND 428 (703)
Q Consensus 351 --~~~l~pWD~~y~~~~~~~~~~~~d~~~l~~yf~~~~vl~gl~~l~~~lfgl~~~~~~~~e~W~~dV~~~~v~d~~~g~ 428 (703)
...+..||+.||.+++...+|++|+.+|++|||+..|++||+.+++.||||+|+++++.++||+|||+|.|.|..+|.
T Consensus 355 ~~~~~~~~wD~~yy~~~~~e~~f~vd~~~LreyFPl~~v~~Gl~~i~q~LFglkf~e~~da~vWh~dVr~y~v~D~~Sg~ 434 (718)
T KOG2089|consen 355 PFDGKLTAWDLRYYMKRVEESKFDVDQEDLREYFPLPVVLSGLFGIYQTLFGLKFEEATDAEVWHADVRVYTVKDSASGN 434 (718)
T ss_pred CccccchhhHHHHHHHHHHHHhcCCCHHHHHhhCCcHHHHHHHHHHHHHHhCceeeecCCchhcccceeEEeccCCCCCc
Confidence 457999999999999999999999999999999999999999999999999999999999999999999999977799
Q ss_pred eeeeeeecccccCCCcCCccccccCCCcccCCCCccccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhhhcC
Q 005310 429 LLGYFYLDIYKREGKYVHTCVVALQNGALSSTGARQIPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQICNRA 508 (703)
Q Consensus 429 ~lG~~ylDl~~R~gK~~ga~~~~l~~~~~~~~g~~q~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~lls~~ 508 (703)
++|+||+|+|+|+||++|++++.+++++...+|+++.||++++|||++|..++|+||+|+||+|+||||||+||++|+++
T Consensus 435 ~vG~fY~D~y~RegK~gh~~~f~l~~~~~~~~ss~~~PVaalv~nfS~p~~~kpsll~~~ev~t~FheFGh~~q~ll~Qa 514 (718)
T KOG2089|consen 435 PVGYFYLDPYPREGKYGHAAVFGLQPGCLQKDSSRRIPVAALVCNFSKPQSDKPSLLGHDEVETLFHEFGHVLQHLLTQA 514 (718)
T ss_pred eeeEEEeccCCCccccchhhhhccchhhhccCCccccchHHHHHhcCCcccCCCCccchHHHHHHHHHHhHHHHHHHhcC
Confidence 99999999999999999999999999999899999999999999999999999999999999999999999999999999
Q ss_pred CccccCCC-ccchhhhhccHHHHHHhHhcHHHHHHhhcccCCCCccchHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH
Q 005310 509 SFARFSGL-RVDPDFVEIPAQLLENWCYESFSLKLISGFHQDITTPIKDEICKSLKRRRYAFSALKLKQEILYCIFDQVI 587 (703)
Q Consensus 509 ~~~~~sg~-~~~~DfvE~pS~l~E~~~~~~~~L~~l~~h~~t~ge~ip~~l~~~l~~~~~~~~~~~~~~ql~~a~~D~~~ 587 (703)
.|..+||+ +++|||||+||||||||+|++++|..+|+||+| |+|||++++++|+.+|.+++|+.++|||.+|.||+.+
T Consensus 515 ~~~~fsG~~~vewDave~psq~Lenwv~~~d~L~~lS~Hy~t-ge~l~eEl~~kl~~~r~~~~gl~tlrqL~~a~~D~~l 593 (718)
T KOG2089|consen 515 DFARFSGPRNVEWDAVEVPSQFLENWVWDPDTLRSLSKHYKT-GEPLPEELLKKLILTRTVNAGLFTLRQLVLADFDLEL 593 (718)
T ss_pred ccccccCcccCCcchhhchHHHHHHhccCchHhhhhcccccC-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHH
Confidence 99999999 799999999999999999999999999999999 9999999999999999999999999999999999999
Q ss_pred hcCCcccHHHHHHHHhcccccCCCCCCCCCCcccccccc-CcccCCcccHHHHHHHHHHHHHHHHh-cCCCCH-HHHHHH
Q 005310 588 HSADNVDIVELFKHLHPKVMLGLPMLEGTNPASCFPRSA-IGFESACYSRIWSEVFAADIFACKFR-DGLLNQ-RVGMQF 664 (703)
Q Consensus 588 H~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~~~~f~Hl~-~~Y~a~YYsYl~a~vlA~~i~~~~f~-~~~~~~-~~g~~~ 664 (703)
|+....+..+.|++++++ +.++|..++++++|+|+||| +||+|+||+|+||+|+|+|||+++|+ +|+.|. ++|+||
T Consensus 594 ht~~d~~~~~~~~~l~~~-i~~~p~~~~d~~pcsF~hifa~gy~A~yY~yLWsEv~aaDif~t~fe~~g~~N~~~~G~ry 672 (718)
T KOG2089|consen 594 HTKTDADLADTYRQLCQE-ISIVPATPGDNMPCSFGHIFAGGYAAGYYSYLWSEVLAADIFSTFFEQEGEDNIKEVGMRY 672 (718)
T ss_pred hhccccchHHHHHHhchh-heecCCCCCCCCCccccchhcCchHHHHHHHHHHHHHHHHHHHHHhhhcCCccHHHHHHHH
Confidence 999888888999999999 88899999999999999999 58999999999999999999988999 488898 999999
Q ss_pred HHHhccCCCCccHHHHHHHhhCCCCChHHHHHHhh
Q 005310 665 RNNVLAPGGSKEPIEILSDFLGREPSIQAFVDSRA 699 (703)
Q Consensus 665 r~~vL~~Ggs~~~~ell~~flGr~p~~~a~l~~~~ 699 (703)
|++||++||+.+|+|+|++|+||+||.+||++++.
T Consensus 673 R~tiLa~GG~~~~~e~f~~FlGRePS~~Afl~s~g 707 (718)
T KOG2089|consen 673 RNTILAPGGGKDPMEVFKRFLGREPSQEAFLKSLG 707 (718)
T ss_pred HHhhhcCCCCccHHHHHHHhhCCCCChhHHHHhhc
Confidence 99999999999999999999999999999999873
No 4
>PRK10911 oligopeptidase A; Provisional
Probab=100.00 E-value=8.2e-139 Score=1202.87 Aligned_cols=641 Identities=27% Similarity=0.494 Sum_probs=601.5
Q ss_pred CCcccCCChhhHHHHHHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHhhhhhhcccccccccccccCCcHHHHHHHHHH
Q 005310 52 SIVRVNLSASEILKLADKIVSKSKEVHDAVASVPLDKVTFMNVISPLAELEAQQFPLVQSCVFPKLVCTSDDVCKSSAEA 131 (703)
Q Consensus 52 ~~~~~~~s~~~i~~~~~~~i~~~~~~~~~i~~~~~~~~t~~n~i~~l~~l~~~l~~~~~~~~~~~~v~~d~e~r~aa~e~ 131 (703)
+|+...++|++|.++++++|+++++.+++|.+.+ ..+||+|||.|++++.+.++.++..+.++.+||||+++|+++++|
T Consensus 11 ~p~f~~i~~~~~~~~~~~~i~~~~~~i~~i~~~~-~~~t~~n~i~~l~~~~~~l~~~~~~~~~l~~v~~~~~~r~a~~~~ 89 (680)
T PRK10911 11 LPPFSAIKPEHVVPAVTKALNDCREAVERVVAQG-APYTWENLCQPLAEVDDVLGRIFSPVSHLNSVKNSPELREAYEQT 89 (680)
T ss_pred CCCCCcCCHHHHHHHHHHHHHHHHHHHHHHHcCC-CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHH
Confidence 3444456999999999999999999999997643 346999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHhhCCHHHHHHHHHHHhcC--CCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHhh
Q 005310 132 ERKIDAHMLSCSNREDVYRVVKAVAATG--EWVSPEAKRYIQSLVRDFERSGLNLTISKREEVQRLRDQIDELSLQYVRN 209 (703)
Q Consensus 132 ~~~l~~~~~~l~~~~~ly~~l~~~~~~~--~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~l~~~f~~n 209 (703)
.+++++|.+++++|++||++++++.+.. ..|++|++|++++++++|+++|++|++++|+++++|+.+|++|+++|++|
T Consensus 90 ~~~l~~~~~~~~~~~~Ly~~~~~~~~~~~~~~l~~e~~r~l~~~~~~F~~sG~~L~~~~r~~~~~i~~~l~~l~~~F~~n 169 (680)
T PRK10911 90 LPLLSEYSTWVGQHEGLYQAYRDLRDGDHYATLNTAQKKAVDNALRDFELSGIGLPKEKQQRYGEIAARLSELGNQYSNN 169 (680)
T ss_pred HHHHHHHHHHHhCCHHHHHHHHHHhccccccCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 9999999999999999999999997522 37999999999999999999999999999999999999999999999999
Q ss_pred ccccccce--Ee-cccccCCCCHHHHHHHhhcc-----CCceEEEccCccHHHHHhcCCCHHHHHHHHHHHhcccCC---
Q 005310 210 LNDDKTFL--LF-SEADLLGLPPEFLKSLDKAE-----NGIFKVTLKSHHVAAVLELCKVGRTRRLVAVAYGKRCGG--- 278 (703)
Q Consensus 210 ~~~~~~~~--~~-~~~eL~glp~~~l~~~~~~~-----~g~~~i~l~~~~~~~~l~~~~d~~~Rk~~~~a~~~r~~~--- 278 (703)
+.+++... .+ ++++|+|+|+++++.+++.. +| |.|++++|++.|+|++|+|+++||++|+|+.+|++.
T Consensus 170 ~~~~~~~~~~~~~~~~eL~Glp~~~~~~~~~~a~~~~~~g-~~~tl~~p~~~p~L~~~~drelRk~~y~A~~~r~~~~~~ 248 (680)
T PRK10911 170 VLDATMGWTKLITDEAELAGMPESALAAAKAQAEAKEQEG-YLLTLDIPSYLPVMTYCDNQALREEMYRAYSTRASDQGP 248 (680)
T ss_pred hhhhhccceeecCCHHHHCCCCHHHHHHHHHHHHhcCCCC-eEEEeecCcHHHHHhhCCCHHHHHHHHHHHHhhhhhccC
Confidence 99998753 33 67899999999999987632 34 999999999999999999999999999999998853
Q ss_pred -----chHHHHHHHHHHHHHHHHHcCCCCHHHhHhhhhccCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcCCCC
Q 005310 279 -----INLSVLESLVELRHKFARLLGYSNYADYALDLRMAKSSSKVFEFLEEISASLTDLASRELMMLKDLKRKEEGELP 353 (703)
Q Consensus 279 -----~n~~~l~~ll~lR~~~A~llGy~sya~~~l~~~ma~tpe~V~~fL~~l~~~~~p~~~~el~~L~~~k~~~~g~~~ 353 (703)
+|.++|.+|+++|+++|++|||+||++|++.++|++||++|.+||+++.++++|.+++|++.|++++++..|.++
T Consensus 249 ~~~~~dN~~il~eil~lR~e~AkLLGy~syAd~~L~~kma~spe~V~~fL~~l~~~~~p~a~~El~~L~~~~k~~~g~~~ 328 (680)
T PRK10911 249 NAGKWDNSEVMEEILALRHELAQLLGFENYADKSLATKMAENPQQVLDFLTDLAKRARPQGEKELAQLRAFAKAEFGVDE 328 (680)
T ss_pred cccccchHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC
Confidence 689999999999999999999999999999999999999999999999999999999999999999988778889
Q ss_pred CCccchhhhHHHHHhhhcCCChhhhccCCChHHHHHHHHHHHhhhhcceEEEecccccccccceEEEEEecCCCceeeee
Q 005310 354 FGIEDLLYYVRRVEELQFDLDFGAVKQYFPVNLVLSGIFKVFQDLFGLRFEEVLDAVVWHSDVRVFSVFDLSSNDLLGYF 433 (703)
Q Consensus 354 l~pWD~~y~~~~~~~~~~~~d~~~l~~yf~~~~vl~gl~~l~~~lfgl~~~~~~~~e~W~~dV~~~~v~d~~~g~~lG~~ 433 (703)
++|||+.||.+++++.+|++|+++++||||++.|++||+.++++||||+|++++++++|||||++|+|+| ++|.+||+|
T Consensus 329 L~pWD~~yy~~~~~~~~~~~d~~~l~~YFpl~~v~~gl~~~~~~Lfgi~~~e~~~~~vwh~dV~~~~v~d-~~~~~iG~~ 407 (680)
T PRK10911 329 LQPWDIAYYSEKQKQHLYSISDEQLRPYFPENKAVNGLFEVVKRIYGITAKERKDVDVWHPDVRFFELYD-ENNELRGSF 407 (680)
T ss_pred CChhhHHHHhhHHHHhhcCCCHHHhcccCcHHHHHHHHHHHHHHHcCeeEEecCCCCCCCCcceEEEEEe-CCCCeEEEE
Confidence 9999999999999999999999999999999999999999999999999999888999999999999999 567899999
Q ss_pred eecccccCCCcCCccccccCCCcccCCCCccccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhhhcCCcccc
Q 005310 434 YLDIYKREGKYVHTCVVALQNGALSSTGARQIPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQICNRASFARF 513 (703)
Q Consensus 434 ylDl~~R~gK~~ga~~~~l~~~~~~~~g~~q~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~lls~~~~~~~ 513 (703)
|+|+|+|+||++||||+++++++...+|.+|.|+++|+|||++|.+++|+||+|+||+||||||||+||++|++++|+++
T Consensus 408 y~D~~~R~gK~~ga~~~~~~~~~~~~~g~~~~Pv~~l~~Nf~~p~~~~p~LL~~~~v~tlfHEfGHalH~~ls~~~~~~~ 487 (680)
T PRK10911 408 YLDLYARENKRGGAWMDDCVGQMRKADGSLQKPVAYLTCNFNRPVNGKPALFTHDEVITLFHEFGHGLHHMLTRIETAGV 487 (680)
T ss_pred EeeccCCCCCCCCccccccccccccCCCceeCCeEEEECCCCCCCCCCCcccCHHHHHHHHHHHhHHHHHHHhCCCcCcC
Confidence 99999999999999999998877767788899999999999999999999999999999999999999999999999999
Q ss_pred CCCc-cchhhhhccHHHHHHhHhcHHHHHHhhcccCCCCccchHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhcCCc
Q 005310 514 SGLR-VDPDFVEIPAQLLENWCYESFSLKLISGFHQDITTPIKDEICKSLKRRRYAFSALKLKQEILYCIFDQVIHSADN 592 (703)
Q Consensus 514 sg~~-~~~DfvE~pS~l~E~~~~~~~~L~~l~~h~~t~ge~ip~~l~~~l~~~~~~~~~~~~~~ql~~a~~D~~~H~~~~ 592 (703)
|||+ ++|||||+|||+||+|||+|+||+.|++||+| |+|||++++++++++|++++|+.+++|+.+|+||+.+|+..+
T Consensus 488 sGt~~~~~D~vE~pS~~~E~~~~~~~vL~~~a~H~~t-geplp~~l~~~l~~~~~~~~~~~~~rql~~a~~D~~~H~~~~ 566 (680)
T PRK10911 488 SGISGVPWDAVELPSQFMENWCWEPEALAFISGHYET-GEPLPKELLDKMLAAKNYQAALFILRQLEFGLFDFRLHAEFD 566 (680)
T ss_pred CCcCCCCchHhhccHHHHHHHhcCHHHHHHHHHHhcC-CCCCCHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHhhccc
Confidence 9986 89999999999999999999999999999999 999999999999999999999999999999999999997532
Q ss_pred ----ccHHHHHHHHhcccccCCCCCCCCCCcccccccc-CcccCCcccHHHHHHHHHHHHHHHHhc-CCCCHHHHHHHHH
Q 005310 593 ----VDIVELFKHLHPKVMLGLPMLEGTNPASCFPRSA-IGFESACYSRIWSEVFAADIFACKFRD-GLLNQRVGMQFRN 666 (703)
Q Consensus 593 ----~~~~~l~~~l~~~~~~~~~~~~~~~~~~~f~Hl~-~~Y~a~YYsYl~a~vlA~~i~~~~f~~-~~~~~~~g~~~r~ 666 (703)
.++.+++++++++ +..+|.++++.|+++|+||| |||+||||||+||+++|+|+| +.|.+ |++|+++|++||+
T Consensus 567 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~F~Hif~gGY~AgYYsYlwa~vla~d~~-~~f~~~g~~~~~~g~~~r~ 644 (680)
T PRK10911 567 PDQGAKILETLAEIKKQ-VAVVPSPSWGRFPHAFSHIFAGGYAAGYYSYLWADVLAADAF-SRFEEEGIFNRETGQSFLD 644 (680)
T ss_pred cccccCHHHHHHHHHHH-cCCCCCCCCCcCCccccccCcCCcccchHHHHHHHHHHHHHH-HHHHhcCCCCHHHHHHHHH
Confidence 2789999999999 76667777788889999999 789999999999999999999 66655 8999999999999
Q ss_pred HhccCCCCccHHHHHHHhhCCCCChHHHHHHh
Q 005310 667 NVLAPGGSKEPIEILSDFLGREPSIQAFVDSR 698 (703)
Q Consensus 667 ~vL~~Ggs~~~~ell~~flGr~p~~~a~l~~~ 698 (703)
+||++|||++|++++++|+||+|+++||++.+
T Consensus 645 ~iL~~Ggs~~p~~~~~~F~GR~P~~~all~~~ 676 (680)
T PRK10911 645 NILSRGGSEEPMELFKRFRGREPQLDAMLEHY 676 (680)
T ss_pred HHHhCcCCcCHHHHHHHhcCCCCChHHHHHhc
Confidence 99999999999999999999999999999986
No 5
>KOG2090 consensus Metalloendopeptidase family - mitochondrial intermediate peptidase [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=7.4e-128 Score=1026.78 Aligned_cols=638 Identities=26% Similarity=0.436 Sum_probs=578.7
Q ss_pred cCCCCCCcccCCChhhHHHHHHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHhhhhhhcccccccccccccCCcHHHHH
Q 005310 47 KDLQGSIVRVNLSASEILKLADKIVSKSKEVHDAVASVPLDKVTFMNVISPLAELEAQQFPLVQSCVFPKLVCTSDDVCK 126 (703)
Q Consensus 47 ~~~p~~~~~~~~s~~~i~~~~~~~i~~~~~~~~~i~~~~~~~~t~~n~i~~l~~l~~~l~~~~~~~~~~~~v~~d~e~r~ 126 (703)
.++.+-| .+.++++|..+.+.++++++++++.+.+-+... .+|..+|++||.+|.++|.++|+|.+|||+++++
T Consensus 42 tGLFgn~--~L~t~eGF~~l~~~a~~~t~eLi~~~~~~~~g~----~ti~~~DeiSd~lC~V~DLaEfvR~aHPd~~fv~ 115 (704)
T KOG2090|consen 42 TGLFGNP--ELSTAEGFNRLPEAALEKTQELIDELLSTPSGP----RTIQIFDEISDTLCRVADLAEFVRQAHPDPEFVE 115 (704)
T ss_pred ccccCCc--ccCChhHHHHHHHHHHHHHHHHHHHHhcCCCCc----hHHHHHHHHhHHHHHHHHHHHHHHHhCCCHHHHH
Confidence 3455544 888999999999999999999999998843332 3589999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHhhCCHHHHHHHHHHHhcCC---CCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHH
Q 005310 127 SSAEAERKIDAHMLSCSNREDVYRVVKAVAATGE---WVSPEAKRYIQSLVRDFERSGLNLTISKREEVQRLRDQIDELS 203 (703)
Q Consensus 127 aa~e~~~~l~~~~~~l~~~~~ly~~l~~~~~~~~---~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~l~ 203 (703)
||++|+..|.++++.||+|..||+++|.+.++.+ .+++|.. +.+.++.|||++||||++++|+++++|+.+|..|+
T Consensus 116 aAe~a~~~~~e~ve~LNTn~~LY~~Lk~~l~~~~~l~~~d~e~~-v~~lll~DFE~sGIhL~~~kr~kfv~Ls~eI~~lg 194 (704)
T KOG2090|consen 116 AAEEACRSMFELVESLNTNVALYQKLKKVLQDSSRLDDLDPETY-VARLLLDDFEKSGIHLDPEKREKFVQLSSEIFDLG 194 (704)
T ss_pred HHHHHHHHHHHHHHHhccCHHHHHHHHHHhcCcccccccCHHHH-HHHHHHHHHHhhcccCCHHHHHHHHHHhHHHHHHH
Confidence 9999999999999999999999999999877544 3466665 99999999999999999999999999999999999
Q ss_pred HHHHhhccccc-cceEecccccCCCCHHHHHHHhhccCCceEEEccCccHHHHHhcCCCHHHHHHHHHHHhcccCCchHH
Q 005310 204 LQYVRNLNDDK-TFLLFSEADLLGLPPEFLKSLDKAENGIFKVTLKSHHVAAVLELCKVGRTRRLVAVAYGKRCGGINLS 282 (703)
Q Consensus 204 ~~f~~n~~~~~-~~~~~~~~eL~glp~~~l~~~~~~~~g~~~i~l~~~~~~~~l~~~~d~~~Rk~~~~a~~~r~~~~n~~ 282 (703)
.+|.+|...-. +.+..+..-|-+.-+..++...+ ++.+...+.....++..|.|+.+||++|.++.+.. ..+..
T Consensus 195 ~~F~~n~~~~P~~~l~~s~~~l~~~~~~~~~~k~k----nF~~~t~~~~i~~ll~~~~d~~vRk~vY~~~~~~~-~~q~~ 269 (704)
T KOG2090|consen 195 REFQNNTDRFPDNKLPNSLERLPFSKKNFIESKRK----NFDQATDPYIIYGLLSSSEDESVRKMVYNTFHSPS-DIQVK 269 (704)
T ss_pred HHHHhccccCCccccchhhhhccCcchhhHHHHhc----CCCcCCCCeeeehhhhcCCCHHHHHHHHHHhcCCc-HHHHH
Confidence 99999997533 21211222232222333222221 24555455556679999999999999999998644 78999
Q ss_pred HHHHHHHHHHHHHHHcCCCCHHHhHhhhhccCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcC--CCCCCccchh
Q 005310 283 VLESLVELRHKFARLLGYSNYADYALDLRMAKSSSKVFEFLEEISASLTDLASRELMMLKDLKRKEEG--ELPFGIEDLL 360 (703)
Q Consensus 283 ~l~~ll~lR~~~A~llGy~sya~~~l~~~ma~tpe~V~~fL~~l~~~~~p~~~~el~~L~~~k~~~~g--~~~l~pWD~~ 360 (703)
+|+.++..|+++|++.||.||||++++++|+++|+.|.+||+.|.++++|.+.+|++.|..+|+++.+ ..++.|||.+
T Consensus 270 ~L~~Li~sRheLa~l~G~~SfA~~al~~~~a~~pk~V~~Fl~~Ls~k~~~~~~kel~~i~~mk~k~~~~~~~e~~~WD~~ 349 (704)
T KOG2090|consen 270 LLEHLISSRHELAKLVGKSSFAHRALEGKLAKNPKTVRSFLEELSEKLSEKTDKELAVIRDMKKKENNNNNAEIEPWDRP 349 (704)
T ss_pred HHHHHHHHHHHHHHHhCcchHHHHHhhccccCChHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhhcCCCCCccccccch
Confidence 99999999999999999999999999999999999999999999999999999999999999988754 5689999999
Q ss_pred hhHHHHHhhhcCCChhhhccCCChHHHHHHHHHHHhhhhcceEEEe--cccccccccceEEEEEecCCCceeeeeeeccc
Q 005310 361 YYVRRVEELQFDLDFGAVKQYFPVNLVLSGIFKVFQDLFGLRFEEV--LDAVVWHSDVRVFSVFDLSSNDLLGYFYLDIY 438 (703)
Q Consensus 361 y~~~~~~~~~~~~d~~~l~~yf~~~~vl~gl~~l~~~lfgl~~~~~--~~~e~W~~dV~~~~v~d~~~g~~lG~~ylDl~ 438 (703)
||..++++..++++...+.+||+++.|++|+..++++||||+|.+. .+||+|||||++..|.++ ++..+|+||+|+|
T Consensus 350 YYT~~~r~~~~~~~~~~~~~fFslg~~ieGLs~L~~~LyGirl~~~~l~pGE~WhpdV~KL~vv~E-~eg~lG~IY~Dlf 428 (704)
T KOG2090|consen 350 YYTSMYRQSNNSLNPSIYSPFFSLGSCIEGLSTLFQRLYGIRLIPEPLAPGEVWHPDVRKLNVVHE-QEGLLGYIYCDLF 428 (704)
T ss_pred hhhhHhhcccCCCCcceeeccccHHHHHHHHHHHHHHHhCeeeeccCCCCccccCchhhhheeecC-CCCceeEEEEEee
Confidence 9999999999999999999999999999999999999999999986 478999999999999995 4559999999999
Q ss_pred ccCCCcCCccccccCCCcccCCCCccccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhhhcCCccccCCCcc
Q 005310 439 KREGKYVHTCVVALQNGALSSTGARQIPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQICNRASFARFSGLRV 518 (703)
Q Consensus 439 ~R~gK~~ga~~~~l~~~~~~~~g~~q~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~lls~~~~~~~sg~~~ 518 (703)
+|+||..|+|+|+||||+..+||+||+|+++|+|||..+....|++|++.+|+|||||||||||+||++|+|||++||||
T Consensus 429 ~R~gK~~g~aHFTIr~sr~l~Dg~yQlPVi~L~cnf~rss~~s~t~L~~~~vetLFHEmGHAMHSmLGrT~YQhvtGTRc 508 (704)
T KOG2090|consen 429 ERPGKTVGDAHFTIRGSRQLSDGTYQLPVIVLVCNFVRSSQSSPTFLSLSEVETLFHEMGHAMHSMLGRTHYQHVTGTRC 508 (704)
T ss_pred ccCCCCCCCceEEeeccccCCCCCeeceeeEEeecccccccCCCcccCHHHHHHHHHHHHHHHHHHhccchhccccCccc
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred chhhhhccHHHHHHhHhcHHHHHHhhcccCCCCccchHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhcC-Cc---cc
Q 005310 519 DPDFVEIPAQLLENWCYESFSLKLISGFHQDITTPIKDEICKSLKRRRYAFSALKLKQEILYCIFDQVIHSA-DN---VD 594 (703)
Q Consensus 519 ~~DfvE~pS~l~E~~~~~~~~L~~l~~h~~t~ge~ip~~l~~~l~~~~~~~~~~~~~~ql~~a~~D~~~H~~-~~---~~ 594 (703)
|+||+|+||.+||+|++|+.||..|++||+| ++++|.+++.++..+++++++.++++|++||++||.+|+. .+ .+
T Consensus 509 ~tDfaEiPSiLMEyFa~D~rVl~~~aRhy~t-~e~l~~~mv~~l~~s~n~~Aa~e~q~Qv~ya~~Dq~fhg~~~~~~~~~ 587 (704)
T KOG2090|consen 509 PTDFAEIPSILMEYFANDYRVLRFFARHYST-GEPLPEDMVNRLCESRNSFAAQETQRQVFYALLDQEFHGIACPLIAED 587 (704)
T ss_pred chhHhhhhHHHHHHHhcchHHHHHHHHHcCC-CCCCCHHHHHHHHHHhccchhHHHHHHHHHHHHHHHHhcccccccccc
Confidence 9999999999999999999999999999999 9999999999999999999999999999999999999994 23 37
Q ss_pred HHHHHHHHhcccccCCCCCCCCCCccccccccCcccCCcccHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccCCCC
Q 005310 595 IVELFKHLHPKVMLGLPMLEGTNPASCFPRSAIGFESACYSRIWSEVFAADIFACKFRDGLLNQRVGMQFRNNVLAPGGS 674 (703)
Q Consensus 595 ~~~l~~~l~~~~~~~~~~~~~~~~~~~f~Hl~~~Y~a~YYsYl~a~vlA~~i~~~~f~~~~~~~~~g~~~r~~vL~~Ggs 674 (703)
..+++.++.++ +.+++..+++.|+.+|+|++ ||||+||||++++++|.-||++.|++||+++++|++||++||++||+
T Consensus 588 ~~~~~~~v~~k-~~~~~~~~~taw~~rFsHl~-gYGA~YYSYL~~r~~AS~IWq~~Fe~dPfsR~aGek~r~eil~hGG~ 665 (704)
T KOG2090|consen 588 TTDLLSEVKRK-FSGLLYVPPTAWQLRFSHLV-GYGATYYSYLFARAIASLIWQQLFENDPFSRKAGEKFRKEILKHGGG 665 (704)
T ss_pred hhHHHHHHHHh-cCCCCCCCCCccccchhhhh-ccCchHHHHHHHHHHHHHHHHHHHhcCccchhhhHHHHHHHHHhcCC
Confidence 88999999999 89988888999999999998 99999999999999999999999999999999999999999999999
Q ss_pred ccHHHHHHHhhCCCCChHHHHHHhhh
Q 005310 675 KEPIEILSDFLGREPSIQAFVDSRAE 700 (703)
Q Consensus 675 ~~~~ell~~flGr~p~~~a~l~~~~~ 700 (703)
++|+++++++||++|..+.+..+..+
T Consensus 666 ~~P~~lva~~L~~~~~~~g~~~Al~~ 691 (704)
T KOG2090|consen 666 RDPAELVADILGKPPLENGGVDALSK 691 (704)
T ss_pred CChHHHHHHHhcCCCCccchHHHHHH
Confidence 99999999999999999887776543
No 6
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=100.00 E-value=2.2e-105 Score=892.78 Aligned_cols=436 Identities=30% Similarity=0.522 Sum_probs=413.9
Q ss_pred HHhcCCCHHHHHHHHHHHhcccCCchHHHHHHHHHHHHHHHHHcCCCCHHHhHhhhhccCCHHHHHHHHHHHHHHhHHHH
Q 005310 255 VLELCKVGRTRRLVAVAYGKRCGGINLSVLESLVELRHKFARLLGYSNYADYALDLRMAKSSSKVFEFLEEISASLTDLA 334 (703)
Q Consensus 255 ~l~~~~d~~~Rk~~~~a~~~r~~~~n~~~l~~ll~lR~~~A~llGy~sya~~~l~~~ma~tpe~V~~fL~~l~~~~~p~~ 334 (703)
+|++|+|+++||++|.|+.++. .+|.++|.+|+++|+++|++|||+||++|++.++|++||++|.+||+++.++++|.+
T Consensus 2 ~l~~~~d~~~Rk~a~~a~~~~~-~~n~~il~~l~~lR~e~A~llGy~sya~~~l~~~ma~spe~v~~fL~~l~~~~~p~~ 80 (458)
T cd06457 2 LLASVPNESVRELVYRAFHSSS-AEQLELLEELLSSRAELAQLLGFESYAHRALRGKMAKNPENVMEFLTELSEKLRPRA 80 (458)
T ss_pred ccccCCCHHHHHHHHHHHhhhh-hhHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHH
Confidence 7899999999999999999987 899999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHhcC--CCCCCccchhhhHHHHHhhhcCCChhhhccCCChHHHHHHHHHHHhhhhcceEEEecc--cc
Q 005310 335 SRELMMLKDLKRKEEG--ELPFGIEDLLYYVRRVEELQFDLDFGAVKQYFPVNLVLSGIFKVFQDLFGLRFEEVLD--AV 410 (703)
Q Consensus 335 ~~el~~L~~~k~~~~g--~~~l~pWD~~y~~~~~~~~~~~~d~~~l~~yf~~~~vl~gl~~l~~~lfgl~~~~~~~--~e 410 (703)
++|++.|++++++..| .+++.|||+.||.+++++++|++|++++++|||++.|++||+.++++||||+|+++++ ++
T Consensus 81 ~~e~~~L~~~~~~~~~~~~~~L~pwD~~yy~~~~~~~~~~~d~~~l~~YFpl~~vl~gl~~~~~~lfgi~~~~~~~~~~~ 160 (458)
T cd06457 81 EEELAVLRDMKRKVLGKPLSSLAPWDRDYYTGQYRQSRFDSEPSNLSPYFSLGTVMEGLSRLFSRLYGIRLVPVPLAPGE 160 (458)
T ss_pred HHHHHHHHHHHHHhcCCCcCCCCHHHHHHHHHHHHHhhcCCChHHhcccCcHHHHHHHHHHHHHHHhCeEEEecCCCCCC
Confidence 9999999999987754 5679999999999999999999999999999999999999999999999999999864 68
Q ss_pred cccccceEEEEEecCCCceeeeeeecccccCCCcCCccccccCCCcccCC----------CCccccEEEEeccCCCCCCC
Q 005310 411 VWHSDVRVFSVFDLSSNDLLGYFYLDIYKREGKYVHTCVVALQNGALSST----------GARQIPVALLISQIQKDIVG 480 (703)
Q Consensus 411 ~W~~dV~~~~v~d~~~g~~lG~~ylDl~~R~gK~~ga~~~~l~~~~~~~~----------g~~q~P~~~l~~Nf~~~~~~ 480 (703)
+|||||++|+|+| ++|.+||+||+|+|+|+||++||||+++++++...+ |.+|.|+++|+|||++|+++
T Consensus 161 ~Wh~dV~~~~v~d-~~~~~lG~~YlDl~~R~~K~~~a~~~~~~~~~~~~~~~~~~~~~~~g~~q~Pv~~lvcnf~~p~~~ 239 (458)
T cd06457 161 VWHPDVRKLDVVH-EDEGLLGVIYCDLFSRPGKPPGAAHFTIRCSRRLDDDDVAERGGRGGTYQLPVVALMCNFPPPSPS 239 (458)
T ss_pred CcCccceEEEEEe-CCCCEEEEEEeecCCCCCCCCCcceeccccccccCcccccccccCCCceeCCeEEEECCCCCCCCC
Confidence 9999999999999 467799999999999999999999999999987655 78899999999999999999
Q ss_pred CCCccChhhHHHHHHHHHHHHHHhhhcCCccccCCCccchhhhhccHHHHHHhHhcHHHHHHhhcccCCCCccchHHHHH
Q 005310 481 HPSLLRFSEVVNFFHEFGHVVQQICNRASFARFSGLRVDPDFVEIPAQLLENWCYESFSLKLISGFHQDITTPIKDEICK 560 (703)
Q Consensus 481 ~p~lL~~~~V~TLfHEfGHalH~lls~~~~~~~sg~~~~~DfvE~pS~l~E~~~~~~~~L~~l~~h~~t~ge~ip~~l~~ 560 (703)
+|+||+|+||.||||||||+||++|++++|++++||++++||||+|||+||+||||+++|+.||+||+| +++||+++++
T Consensus 240 ~p~lL~~~~v~TLfHEfGHalH~~ls~~~~~~~sgt~~~~d~vE~pS~~~E~~~~~~~~L~~~a~h~~t-~e~ip~~l~~ 318 (458)
T cd06457 240 GPTLLSPHEVETLFHEMGHAMHSMLGRTEYQHVSGTRCATDFVEVPSILMEYFASDPRVLKLFARHYST-GEPLPEEMLA 318 (458)
T ss_pred CCCCcCHHHHHHHHHHHhHHHHHHHcCCCccccCCCCCCcchhhcCHHHHHHHHhhHHHHHHHhcccCC-CCcCcHHHHH
Confidence 999999999999999999999999999999999999999999999999999999999999999999999 9999999999
Q ss_pred HHHHHHhHHHHHHHHHHHHHHHHHHHHhcCCcc---cHHHHHHHHhcccccCCCCCCC-CCCccccccccCcccCCcccH
Q 005310 561 SLKRRRYAFSALKLKQEILYCIFDQVIHSADNV---DIVELFKHLHPKVMLGLPMLEG-TNPASCFPRSAIGFESACYSR 636 (703)
Q Consensus 561 ~l~~~~~~~~~~~~~~ql~~a~~D~~~H~~~~~---~~~~l~~~l~~~~~~~~~~~~~-~~~~~~f~Hl~~~Y~a~YYsY 636 (703)
+++++|+++.|+.+++|+.+|+||+.+|...+. ++.++|.++.++ +.++|.+++ +.|+++|+||+ ||+|+||||
T Consensus 319 ~l~~~~~~~~~~~~~rq~~~a~fD~~~H~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~f~Hl~-gy~a~YYsY 396 (458)
T cd06457 319 RLLASKNSFAALETQQQIVYALLDQELHGEQPLSPTFTSDVLRDSTEI-FYGLPYVPGGTAWQLRFGHLV-GYGATYYSY 396 (458)
T ss_pred HHHHhhhhhHHHHHHHHHHHHHHHHHHhCCCcccccCHHHHHHHHHHH-hcCCCCCCCCCCCcccccccc-CccccchHH
Confidence 999999999999999999999999999976442 567999999999 888887776 67778999998 899999999
Q ss_pred HHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccCCCCccHHHHHHHhhCCCCChHHHH
Q 005310 637 IWSEVFAADIFACKFRDGLLNQRVGMQFRNNVLAPGGSKEPIEILSDFLGREPSIQAFV 695 (703)
Q Consensus 637 l~a~vlA~~i~~~~f~~~~~~~~~g~~~r~~vL~~Ggs~~~~ell~~flGr~p~~~a~l 695 (703)
+||+++|+|+|++.|++|++|+++|++||++||++|||++|++++++||||+|+++...
T Consensus 397 l~a~vla~di~~~~f~~~~~n~~~g~~y~~~iL~~Ggs~~p~e~l~~flGrdp~~~~~~ 455 (458)
T cd06457 397 LFDRAIASKIWQKLFAADPLSREAGERLREELLKHGGGKDPWELLAGVLGKPPLVKGGA 455 (458)
T ss_pred HHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHccCCCCcCHHHHHHHHcCCCCCCCccc
Confidence 99999999999878999999999999999999999999999999999999999998764
No 7
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=100.00 E-value=7.9e-101 Score=860.69 Aligned_cols=467 Identities=45% Similarity=0.785 Sum_probs=440.9
Q ss_pred HHHHHHHHHHHHHHhhccccccceEecccccCCCCHHHHHHHhhccCCceEEEccCccHHHHHhcCCCHHHHHHHHHHHh
Q 005310 194 RLRDQIDELSLQYVRNLNDDKTFLLFSEADLLGLPPEFLKSLDKAENGIFKVTLKSHHVAAVLELCKVGRTRRLVAVAYG 273 (703)
Q Consensus 194 ~l~~~i~~l~~~f~~n~~~~~~~~~~~~~eL~glp~~~l~~~~~~~~g~~~i~l~~~~~~~~l~~~~d~~~Rk~~~~a~~ 273 (703)
+|+.+|..|+++|++|+.+ +++++.++|++|+|+++||++|+++.
T Consensus 1 ~~~~~~~~~~~~f~~~~~~-----------------------------------~~~~~~~~l~~~~d~~~Rk~~~~a~~ 45 (472)
T cd06455 1 ALKKELSELCQEFSKNLNE-----------------------------------KYPDYFPVMKYAKNAETRKAMWVAFQ 45 (472)
T ss_pred ChhHHHHHHHHHHHHhhhh-----------------------------------cCccHHHHHhcCCCHHHHHHHHHHHH
Confidence 3688999999999999853 56889999999999999999999999
Q ss_pred cccCCchHHHHHHHHHHHHHHHHHcCCCCHHHhHhhhhccCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcCCC-
Q 005310 274 KRCGGINLSVLESLVELRHKFARLLGYSNYADYALDLRMAKSSSKVFEFLEEISASLTDLASRELMMLKDLKRKEEGEL- 352 (703)
Q Consensus 274 ~r~~~~n~~~l~~ll~lR~~~A~llGy~sya~~~l~~~ma~tpe~V~~fL~~l~~~~~p~~~~el~~L~~~k~~~~g~~- 352 (703)
+++..+|.++|.+++++|+++|+++||+||+++++.++|++||++|.+||+.+.+.++|.+++|++.|.++|++..|.+
T Consensus 46 ~~~~~~n~~~l~~ll~~R~~~A~l~Gy~s~a~~~l~~~m~~s~e~v~~fl~~l~~~~~p~~~~e~~~l~~~k~~~~~~~~ 125 (472)
T cd06455 46 NRGGPENVPLLEELVALRHELARLLGYKSHADYVLEDRMAKSPETVRDFLEDLSKKLKPLAEKELAELLELKKKEVPEAG 125 (472)
T ss_pred hhcchhHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC
Confidence 9955899999999999999999999999999999999999999999999999999999999999999999998877765
Q ss_pred -CCCccchhhhHHHHHhhhcCCChhhhccCCChHHHHHHHHHHHhhhhcceEEEecccccccccceEEEEEecCCCceee
Q 005310 353 -PFGIEDLLYYVRRVEELQFDLDFGAVKQYFPVNLVLSGIFKVFQDLFGLRFEEVLDAVVWHSDVRVFSVFDLSSNDLLG 431 (703)
Q Consensus 353 -~l~pWD~~y~~~~~~~~~~~~d~~~l~~yf~~~~vl~gl~~l~~~lfgl~~~~~~~~e~W~~dV~~~~v~d~~~g~~lG 431 (703)
++.|||+.||.++++++++++|+.++++|||++.|++|++.+++++||++|+++...++|||||++|+|+|+.+|.++|
T Consensus 126 ~~l~~wD~~y~~~~~~~~~~~~~~~~~~~yf~~~~~~~~i~~~~~~lfg~~~~~~~~~~~w~~dv~~~~v~d~~~~~~~g 205 (472)
T cd06455 126 DRIYPWDLAYYMERVEEEKYDVDQEKIREYFPLEVVIEGMLDIYQRLFGLRFEEVPDASVWHEDVRLYSVWDADTGEFLG 205 (472)
T ss_pred CCCCHhhHHHHHHHHHHHhcCCCHHHHhccCcHHHHHHHHHHHHHHHhCeEEEeCCCCCCCCCcceEEEEEECCCCCEEE
Confidence 6999999999999999889999999999999999999999999999999999998779999999999999965578999
Q ss_pred eeeecccccCCCcCCccccccCCCcccCCCCccccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhhhcCCcc
Q 005310 432 YFYLDIYKREGKYVHTCVVALQNGALSSTGARQIPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQICNRASFA 511 (703)
Q Consensus 432 ~~ylDl~~R~gK~~ga~~~~l~~~~~~~~g~~q~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~lls~~~~~ 511 (703)
++|||+++|+||++||||++++++....+|.+|.|+++|+|||++|.+++|+||++++|.||||||||+||++|++++++
T Consensus 206 ~~ylD~~~R~gK~~Ga~~~~~~~~~~~~~g~~~~P~~~i~~Nf~~~~~~~p~ll~~~~V~TLfHEfGHalH~~ls~~~~~ 285 (472)
T cd06455 206 YFYLDLHPREGKYGHAANFGLQPGFLLPDGSRQYPVAALVCNFPKPTADKPSLLRHDEVETFFHEFGHVIHHLLGRTKYA 285 (472)
T ss_pred EEEeecCCCCCCCCCccccccccceecCCCCEeCCEEEEECcCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHhcCCCcc
Confidence 99999999999999999999999887778888999999999999999999999999999999999999999999999999
Q ss_pred ccCCCccchhhhhccHHHHHHhHhcHHHHHHhhcccCCCCccchHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhcCC
Q 005310 512 RFSGLRVDPDFVEIPAQLLENWCYESFSLKLISGFHQDITTPIKDEICKSLKRRRYAFSALKLKQEILYCIFDQVIHSAD 591 (703)
Q Consensus 512 ~~sg~~~~~DfvE~pS~l~E~~~~~~~~L~~l~~h~~t~ge~ip~~l~~~l~~~~~~~~~~~~~~ql~~a~~D~~~H~~~ 591 (703)
+++|+++++||||+|||+||+|+|||++|..+++|++| +++||++++++++++++++.++++++|+++|+||+.+|...
T Consensus 286 ~~sg~~~~~d~aE~pS~~~E~~~~~~~~l~~l~~h~~t-~e~i~~~li~~~~~~~~~~~~~~~~~q~~~a~fd~~~h~~~ 364 (472)
T cd06455 286 RFSGTRVERDFVEAPSQMLENWCWEPEVLKRLSKHYKT-GEKIPDELIERLIASRHFNRGLFYLRQLFFALFDLALHTGD 364 (472)
T ss_pred ccCCCcCChhhhhcchHHHHHHhcCHHHHHHHhhccCC-CCCCCHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHhCCC
Confidence 99999999999999999999999999999999999999 99999999999999999999999999999999999999876
Q ss_pred c--ccHHHHHHHHhcccccCCCCCCCCCCccccccccCcccCCcccHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhc
Q 005310 592 N--VDIVELFKHLHPKVMLGLPMLEGTNPASCFPRSAIGFESACYSRIWSEVFAADIFACKFRDGLLNQRVGMQFRNNVL 669 (703)
Q Consensus 592 ~--~~~~~l~~~l~~~~~~~~~~~~~~~~~~~f~Hl~~~Y~a~YYsYl~a~vlA~~i~~~~f~~~~~~~~~g~~~r~~vL 669 (703)
+ .++.++|.++.++ +.|.+..+++.|+++|+|++++|+++||+|+||+++|+++|++.+++|++|+++|++|+++||
T Consensus 365 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~f~H~~~~Y~a~yY~Y~~a~~la~~~~~~~~~~~~~~~~~g~~y~~~~L 443 (472)
T cd06455 365 PADLDTTKLYNDLREE-ISLIPSTEGTHGPASFGHLAGGYDAGYYGYLWSEVFAADMFSSFFKDGLLNPEVGLRYRDTVL 443 (472)
T ss_pred CCCCCHHHHHHHHHHH-hCCCCCCCCCCccccccccccCcccchHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHh
Confidence 5 3899999999999 888777666677789999996688899999999999999996656668899999999998999
Q ss_pred cCCCCccHHHHHHHhhCCCCChHHHHHH
Q 005310 670 APGGSKEPIEILSDFLGREPSIQAFVDS 697 (703)
Q Consensus 670 ~~Ggs~~~~ell~~flGr~p~~~a~l~~ 697 (703)
++|||++|++++++||||+|+.++|++.
T Consensus 444 ~~Ggs~~p~ell~~flGr~~~~~a~~~~ 471 (472)
T cd06455 444 APGGSKDAADMLKDFLGREPNNDAFLKS 471 (472)
T ss_pred cCcCCcCHHHHHHHHhCCCCChhHHhhc
Confidence 9999999999999999999999999985
No 8
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=100.00 E-value=4.1e-100 Score=838.85 Aligned_cols=415 Identities=34% Similarity=0.630 Sum_probs=393.4
Q ss_pred chHHHHHHHHHHHHHHHHHcCCCCHHHhHhhhhccCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcCCCCCCccc
Q 005310 279 INLSVLESLVELRHKFARLLGYSNYADYALDLRMAKSSSKVFEFLEEISASLTDLASRELMMLKDLKRKEEGELPFGIED 358 (703)
Q Consensus 279 ~n~~~l~~ll~lR~~~A~llGy~sya~~~l~~~ma~tpe~V~~fL~~l~~~~~p~~~~el~~L~~~k~~~~g~~~l~pWD 358 (703)
+|.++|++|+++|+++|++|||+||++|++.++|++||++|.+||++|.++++|.+++|++.|.+++++..+...+.|||
T Consensus 1 ~N~~~l~~l~~lR~e~A~llG~~s~a~~~l~~~ma~sp~~v~~Fl~~l~~~~~p~~~~e~~~L~~~~~~~~~~~~l~~wD 80 (422)
T cd06456 1 DNRPLIEEILALRAEKAKLLGFENYAEYSLADKMAKSPEAVLEFLEDLAPKAKPQAKKELAELQAFAKEEGGEDELEPWD 80 (422)
T ss_pred ChHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCHHH
Confidence 58999999999999999999999999999999999999999999999999999999999999999999887777899999
Q ss_pred hhhhHHHHHhhhcCCChhhhccCCChHHHHHHHHHHHhhhhcceEEEecccccccccceEEEEEecCCCceeeeeeeccc
Q 005310 359 LLYYVRRVEELQFDLDFGAVKQYFPVNLVLSGIFKVFQDLFGLRFEEVLDAVVWHSDVRVFSVFDLSSNDLLGYFYLDIY 438 (703)
Q Consensus 359 ~~y~~~~~~~~~~~~d~~~l~~yf~~~~vl~gl~~l~~~lfgl~~~~~~~~e~W~~dV~~~~v~d~~~g~~lG~~ylDl~ 438 (703)
+.||.++.++.++++|++++++|||++.|++|++.++++|||++|++++..++|||||++|+|+| ++|.+||+||+|+|
T Consensus 81 ~~yy~~~~~~~~~~~d~~~l~~YFpl~~v~~gl~~~~~~lfgi~~~~~~~~~~Wh~dV~~~~v~d-~~~~~lG~~ylDl~ 159 (422)
T cd06456 81 WAYYSEKLRKEKYDLDEEELRPYFPLEKVLDGLFELAERLYGITFKERTDLPVWHPDVRVYEVFD-KDGSHIGLFYLDLY 159 (422)
T ss_pred HHHHHHHHHHHhcCCCHHHHhccCCHHHHHHHHHHHHHHHcCeeEEECCCCCCCCCCceEEEEEe-CCCCeEEEEEEecc
Confidence 99999999999999999999999999999999999999999999999988899999999999999 46679999999999
Q ss_pred ccCCCcCCccccccCCCcccCCCCccccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhhhcCCccccCCCcc
Q 005310 439 KREGKYVHTCVVALQNGALSSTGARQIPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQICNRASFARFSGLRV 518 (703)
Q Consensus 439 ~R~gK~~ga~~~~l~~~~~~~~g~~q~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~lls~~~~~~~sg~~~ 518 (703)
+|+||++||||++++++.. +|.+|.|+++++|||++|.++.|+||++++|.|||||||||||++|++++|++++|+++
T Consensus 160 ~R~~K~~ga~~~~~~~~~~--~~~~~~P~~~l~~nf~~~~~~~p~lL~~~~v~tLfHEfGHalH~~ls~~~~~~l~~~~~ 237 (422)
T cd06456 160 AREGKRGGAWMNNLRSQSK--NGLGQKPVAYLVCNFTKPAGGKPALLTHDEVTTLFHEFGHALHHLLTDVEYPSLGGTNV 237 (422)
T ss_pred CCCCCCCCceeeccccccc--CCCCCCCEEEEECCCCCCCCCCCCccCHHHHHHHHHHHHHHHHHHHhcCCccccCCCcC
Confidence 9999999999999988764 67889999999999999999999999999999999999999999999999999999999
Q ss_pred chhhhhccHHHHHHhHhcHHHHHHhhcccCCCCccchHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhcCCc----cc
Q 005310 519 DPDFVEIPAQLLENWCYESFSLKLISGFHQDITTPIKDEICKSLKRRRYAFSALKLKQEILYCIFDQVIHSADN----VD 594 (703)
Q Consensus 519 ~~DfvE~pS~l~E~~~~~~~~L~~l~~h~~t~ge~ip~~l~~~l~~~~~~~~~~~~~~ql~~a~~D~~~H~~~~----~~ 594 (703)
++||||+|||+||+|+|||++|+.+++||+| |+|||++++++++++++++.|+.+++|+.+|+||+.+|+..+ ..
T Consensus 238 ~~d~~E~pS~~~E~~~~d~~vL~~~s~h~~t-~~~lp~~l~~~~~~~~~~~~~~~~~~ql~~a~fD~~lh~~~~~~~~~~ 316 (422)
T cd06456 238 EWDFVELPSQFMENWAWEPEVLKLFAKHYET-GEPLPDELIDKLLAARNFNSGFATVRQLEFALLDLALHSLTDPEILDV 316 (422)
T ss_pred chhHhhccHHHHHHHhcCHHHHHHHHhhcCC-CCCCCHHHHHHHHHHhhHhHHHHHHHHHHHHHHHHHHhcCCcccccch
Confidence 9999999999999999999999999999999 999999999999999999999999999999999999998653 35
Q ss_pred HHHHHHHHhcccccCCCCCCCCCCcccccccc-CcccCCcccHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccCCC
Q 005310 595 IVELFKHLHPKVMLGLPMLEGTNPASCFPRSA-IGFESACYSRIWSEVFAADIFACKFRDGLLNQRVGMQFRNNVLAPGG 673 (703)
Q Consensus 595 ~~~l~~~l~~~~~~~~~~~~~~~~~~~f~Hl~-~~Y~a~YYsYl~a~vlA~~i~~~~f~~~~~~~~~g~~~r~~vL~~Gg 673 (703)
+.+++.++.++ +.++|..+++.|+++|+||+ +||+|+||||+||+++|+|||...+++|++|+++|++||++||++||
T Consensus 317 ~~~~~~~~~~~-~~~~~~~~~~~~~~~f~Hi~~~gY~A~YYsYlws~vla~di~~~f~~~~~~~~~~G~~~r~~iL~~Gg 395 (422)
T cd06456 317 VQFELDALRKE-GLVIPPPPPRYFSNYFSHIFSGGYAAGYYSYKWAEVLDADAFSAFEEEGIFNRETGRRFRDTILSKGG 395 (422)
T ss_pred HHHHHHHHHHh-cCCCCCCCCCCCCCccCcCCCCCcchhhHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhhcCC
Confidence 77888888888 77778777888889999999 78999999999999999999944444588999999999999999999
Q ss_pred CccHHHHHHHhhCCCCChHHHHHHh
Q 005310 674 SKEPIEILSDFLGREPSIQAFVDSR 698 (703)
Q Consensus 674 s~~~~ell~~flGr~p~~~a~l~~~ 698 (703)
|++|.+++++|+||+||++||++.+
T Consensus 396 s~~~~e~~~~F~Gr~p~~~a~l~~~ 420 (422)
T cd06456 396 SRDPMELFRAFRGRDPSIEALLRRR 420 (422)
T ss_pred CcCHHHHHHHhcCCCCChHHHHHHc
Confidence 9999999999999999999999986
No 9
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=100.00 E-value=3.1e-92 Score=792.54 Aligned_cols=440 Identities=38% Similarity=0.669 Sum_probs=400.2
Q ss_pred HHhcCCCHHHHHHHHHHHhcccCC-----chHHHHHHHHHHHHHHHHHcCCCCHHHhHhhhhccCCHHHHHHHHHHHHHH
Q 005310 255 VLELCKVGRTRRLVAVAYGKRCGG-----INLSVLESLVELRHKFARLLGYSNYADYALDLRMAKSSSKVFEFLEEISAS 329 (703)
Q Consensus 255 ~l~~~~d~~~Rk~~~~a~~~r~~~-----~n~~~l~~ll~lR~~~A~llGy~sya~~~l~~~ma~tpe~V~~fL~~l~~~ 329 (703)
||++|+|+++||++|+++.+++.. +|..+|.+++++|+++|+++||+||+++++.++|+++|++|.+||.++.++
T Consensus 1 ~l~~~~d~~~Rk~~~~a~~~~~~~~~~~~~n~~~l~~l~~~R~~~A~llGy~s~~~~~l~~~m~~~~~~v~~fL~~~~~~ 80 (458)
T PF01432_consen 1 VLKYSPDRELRKKAYRAYYSRYSDNQPTLDNAAILNELLKLRNELAKLLGYPSYAEYSLQDKMAKNPENVLDFLDELVKK 80 (458)
T ss_dssp HHHC-S-HHHHHHHHHHHHTTTHHTHHHHHHHHHHHHHHHHHHHHHHHTT-SSHHHHHHTTSTTTSHHHHHHHHHHHHHH
T ss_pred CCCcCcCHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHhhhcCChHHHHHHHHHHHHH
Confidence 689999999999999999998732 789999999999999999999999999999999999999999999999999
Q ss_pred hHHHHHHHHHHHHHHHHHhcCCC-CCCccchhhhHHHHHhhhcCCChhhhccCCChHHHHHHHHHHHhhhhcceEEEecc
Q 005310 330 LTDLASRELMMLKDLKRKEEGEL-PFGIEDLLYYVRRVEELQFDLDFGAVKQYFPVNLVLSGIFKVFQDLFGLRFEEVLD 408 (703)
Q Consensus 330 ~~p~~~~el~~L~~~k~~~~g~~-~l~pWD~~y~~~~~~~~~~~~d~~~l~~yf~~~~vl~gl~~l~~~lfgl~~~~~~~ 408 (703)
++|.+++|++.+++++++..|.+ +++|||++||.+++++..++++..++++|||++.|++|++.+++++|||+|+++.+
T Consensus 81 ~~p~~~~~~~~l~~~~~~~~g~~~~l~~wD~~y~~~~~~~~~~~~~~~~~~~~f~~~~~~~g~~~l~~~lfgi~~~~~~~ 160 (458)
T PF01432_consen 81 LKPLLERELELLKKLKKKRLGLEKKLRPWDVAYYMEQYRQERYDLDEEELSPYFPLEEVLEGLFELAERLFGIRFEEVPD 160 (458)
T ss_dssp HHHHHHHHHHHHHHHHHHHTT-SSSBBGGGHHHHHHHHHHHHTSSSHHHHGGG-BHHHHHHHHHHHHHHHHTEEEEECTC
T ss_pred HHHHHHHHHHHHHHHHHHhcCCccccccchhHHHhhHHHHHHhccchhhcCCcCcHHHHHHHHHHHHHHHhcEEEEeccc
Confidence 99999999999999999888887 99999999999999999999999999999999999999999999999999999976
Q ss_pred c--ccccccceEEEEEec-CCCceeeeeeecccccCCCcCCccccccCCCcccCCCCccccEEEEeccCCCCCCCCCCcc
Q 005310 409 A--VVWHSDVRVFSVFDL-SSNDLLGYFYLDIYKREGKYVHTCVVALQNGALSSTGARQIPVALLISQIQKDIVGHPSLL 485 (703)
Q Consensus 409 ~--e~W~~dV~~~~v~d~-~~g~~lG~~ylDl~~R~gK~~ga~~~~l~~~~~~~~g~~q~P~~~l~~Nf~~~~~~~p~lL 485 (703)
. ++|||||++|+|+|. .++.++|++|+|+++|+||++||||+++++++ .+|.+|+|+++++|||++|.++.|+||
T Consensus 161 ~~~~~w~~dv~~~~v~d~~~~~~~ig~~ylDl~~R~gK~~ga~~~~~~~~~--~~~~~~~~~~~i~~n~~~~~~~~~~ll 238 (458)
T PF01432_consen 161 ADGEVWHPDVRKFEVWDEIFSGMFIGYIYLDLYPRPGKRSGAFCFTLRPSR--SDGERQLPVPYIFCNFTGPSAGKPSLL 238 (458)
T ss_dssp GGHHHSSTT-EEEEEEETHTTHCECEEEEEEES--TTS-SS-EEEEEEC-B--TTSTCECEEEEEEEEE-S-BTTC--B-
T ss_pred ccccceecceeEEEEeehhhcccchhcccccchhcCCCCCCceeCCccCcc--ccccCCCCceEEEecCCCCCCCCCCcc
Confidence 5 899999999999993 25778999999999999999999999999988 689999999999999999999999999
Q ss_pred ChhhHHHHHHHHHHHHHHhhhcCCccccCCCccchhhhhccHHHHHHhHhcHHHHHHhhcccCCCCccchHHHHHHHHHH
Q 005310 486 RFSEVVNFFHEFGHVVQQICNRASFARFSGLRVDPDFVEIPAQLLENWCYESFSLKLISGFHQDITTPIKDEICKSLKRR 565 (703)
Q Consensus 486 ~~~~V~TLfHEfGHalH~lls~~~~~~~sg~~~~~DfvE~pS~l~E~~~~~~~~L~~l~~h~~t~ge~ip~~l~~~l~~~ 565 (703)
++++|.|||||||||||++|+++++++++|+++++||||+|||+||+|+||+.++..+++|+++ +++||++++++++++
T Consensus 239 ~~~~v~tLfHE~GHa~H~~ls~~~~~~~sg~~~~~d~aE~~S~~~E~~~~~~~~l~~~~~~~~~-~~~i~~~~l~~~~~~ 317 (458)
T PF01432_consen 239 SHDDVETLFHEFGHAMHSLLSRTKYQHLSGTRVPMDFAEFPSQFMENWLWDPLVLKAFSRHYET-GEPIPEELLEDLIAS 317 (458)
T ss_dssp SHHHHHHHHHHHHHHHHHHHCCCSSGGGSTTSS-CHHCHHHHHHHHHHGGCHHHHHHH-BSTTT-HHHHHHHHHHHHHHT
T ss_pred ChhhHHHHHHHHhHHHHHHHhccccccccCCchhHHHHhcchHHHHHhhhchhhhhhhccChhh-hhhHHHHHHHhhhhh
Confidence 9999999999999999999999999999999999999999999999999999999999999999 999999999999999
Q ss_pred HhHHHHHHHHHHHHHHHHHHHHhcCCcc------cHHHHHHHHhcccccCCCCCCCCCCcccccccc-CcccCCcccHHH
Q 005310 566 RYAFSALKLKQEILYCIFDQVIHSADNV------DIVELFKHLHPKVMLGLPMLEGTNPASCFPRSA-IGFESACYSRIW 638 (703)
Q Consensus 566 ~~~~~~~~~~~ql~~a~~D~~~H~~~~~------~~~~l~~~l~~~~~~~~~~~~~~~~~~~f~Hl~-~~Y~a~YYsYl~ 638 (703)
|+.+.++.+.+|+.++.||+.+|...+. ++.++|.++.++ +.+.+.+++..|+++|+|++ ++|+++||+|+|
T Consensus 318 ~~~~~~~~~~~ql~~a~fd~~~h~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~~~~f~hl~~~~Y~a~yY~Y~y 396 (458)
T PF01432_consen 318 RNFFAAIFLFRQLLFALFDQELHESPEDGEPESEDLNELYRELQKE-YYGDPSDPDDYFPASFSHLFSHFYAAGYYSYLY 396 (458)
T ss_dssp TTTTHHHHHHHHHHHHHHHHHHCCHCCCTHHHHHHHHHHHHHHHHH-CCTTTCBTTCCGGGGGGGGHCCCGTTTTTCHHH
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHhccccCCccchhHHHHHHHHHHH-hcCcchhccccccceehhhccCccCCCCchhHH
Confidence 9999999999999999999999987652 578999999999 88988888888888999997 689999999999
Q ss_pred HHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccCCCCccHHHHHHHhhCCCCChHHHHHHh
Q 005310 639 SEVFAADIFACKFRDGLLNQRVGMQFRNNVLAPGGSKEPIEILSDFLGREPSIQAFVDSR 698 (703)
Q Consensus 639 a~vlA~~i~~~~f~~~~~~~~~g~~~r~~vL~~Ggs~~~~ell~~flGr~p~~~a~l~~~ 698 (703)
|+++|+++|++.+++|++|+++|.+|++++|++|||++|.+++++|+||+|++++|++.+
T Consensus 397 a~~~a~~~~~~~~~~~~~n~~~g~~~~~~~L~~Ggs~~~~e~l~~~~g~~~~~~~~~~~~ 456 (458)
T PF01432_consen 397 AEVLAADLFSQFFEEDPLNRETGRRFYKEFLSPGGSKDPLELLKKFLGREPSPDAFLKAL 456 (458)
T ss_dssp HHHHHHHHHHHHHHCHTTCHHHHHHHHHHHCTTTTSS-HHHHHHHCCSSTTSSHHHHHHH
T ss_pred HHHHHHHHHHHHHhcCCcchHHHHHHHHHHhcCCCCCCHHHHHHHhCCCCCChHHHHHHc
Confidence 999999999666666899999999999999999999999999999999999999999876
No 10
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=100.00 E-value=2.7e-66 Score=566.10 Aligned_cols=357 Identities=28% Similarity=0.451 Sum_probs=327.0
Q ss_pred HHHHHHHHHHHHHHHHcCCCCHHHhHhhhhccCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcCCCCCCccchhh
Q 005310 282 SVLESLVELRHKFARLLGYSNYADYALDLRMAKSSSKVFEFLEEISASLTDLASRELMMLKDLKRKEEGELPFGIEDLLY 361 (703)
Q Consensus 282 ~~l~~ll~lR~~~A~llGy~sya~~~l~~~ma~tpe~V~~fL~~l~~~~~p~~~~el~~L~~~k~~~~g~~~l~pWD~~y 361 (703)
++|++|+++|+++|+++||+||+||.+.++|++||++|.+|+++|.+.++|.++++++.+++++++..+...+.||
T Consensus 2 ~~l~~lv~lr~e~A~~lGy~~~~d~~l~~~~~~~~e~v~~f~~~l~~~~~p~~~~~~~~l~~~~~~~~~~~~~~~~---- 77 (365)
T cd06258 2 ALLEELVSLRNQLARLLGYENFADYKLALQEAKSPETVEGFFEELKRKLRPLLAKLREEISAAKQKEEEIYGELPA---- 77 (365)
T ss_pred hHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccc----
Confidence 5789999999999999999999999999999999999999999999999999999999999887665555567888
Q ss_pred hHHHHHhhhcCCChhhhccCCChHHHHHHHHHHHhhhhcceEEEecccccccccceEEEEEecCCCceeeeeeecccccC
Q 005310 362 YVRRVEELQFDLDFGAVKQYFPVNLVLSGIFKVFQDLFGLRFEEVLDAVVWHSDVRVFSVFDLSSNDLLGYFYLDIYKRE 441 (703)
Q Consensus 362 ~~~~~~~~~~~~d~~~l~~yf~~~~vl~gl~~l~~~lfgl~~~~~~~~e~W~~dV~~~~v~d~~~g~~lG~~ylDl~~R~ 441 (703)
.+++|+..+++|||++.|++|++.++ |+|+++|+
T Consensus 78 --------~~~~~~~~~~~~f~~~~~~~~~~~~~--------------------------------------~lD~~~R~ 111 (365)
T cd06258 78 --------RYDVDSALLKEFFDAERPWEGALPFF--------------------------------------YLDLYDRK 111 (365)
T ss_pred --------cccCCHhhccCcCChHHHHHHHHHhH--------------------------------------hccCCCCC
Confidence 46889999999999999999987653 89999999
Q ss_pred CCcCCccccccCCCcccCCCCccccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhhhcCCccccCCCccchh
Q 005310 442 GKYVHTCVVALQNGALSSTGARQIPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQICNRASFARFSGLRVDPD 521 (703)
Q Consensus 442 gK~~ga~~~~l~~~~~~~~g~~q~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~lls~~~~~~~sg~~~~~D 521 (703)
||++||||+++++++ .++.|.++|+|||++|.+++|+||++++|.|||||||||+|+++++++++.++|+++++|
T Consensus 112 gK~~~a~~~~~~~~~-----~~~~~~~~i~~n~~~~~~~~~~ll~~~~v~tl~HE~GHa~h~~l~~~~~~~~~g~~~~~~ 186 (365)
T cd06258 112 GKYPHGFCTGLDPGF-----NRQDKDVRILANFTSPAAPDPVLLGHDDINTLFHEFGHAVHFLLIQQRYPFQERTPTSTD 186 (365)
T ss_pred CCCCCCeeccccCCC-----CCCCCeEEEEccCCCCCCCCCCcCCHHHHHHHHHHHhHHHHHHHhcCCCCcCCCCCCCcc
Confidence 999999999987664 457899999999999999999999999999999999999999999999998899999999
Q ss_pred hhhccHHHHHHhHhcHHHHHHhhcccCCCCccchHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhcCCc--ccHHHHH
Q 005310 522 FVEIPAQLLENWCYESFSLKLISGFHQDITTPIKDEICKSLKRRRYAFSALKLKQEILYCIFDQVIHSADN--VDIVELF 599 (703)
Q Consensus 522 fvE~pS~l~E~~~~~~~~L~~l~~h~~t~ge~ip~~l~~~l~~~~~~~~~~~~~~ql~~a~~D~~~H~~~~--~~~~~l~ 599 (703)
|+|+||++||+|++++++|..+++|+++ + ++|.+++++++..+..+.++.+.+|+.++.||+.+|.... .+++++|
T Consensus 187 ~~E~~S~~~E~~~~~~~~L~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~l~~~~~~~~~~~~~~ 264 (365)
T cd06258 187 FAEAQSMFLESFATDPEWLERYARHYQG-G-VVPDELIEKLIAARLPNTLYETRRILVVAKFEKALYENPDRELELQKLW 264 (365)
T ss_pred HHhccHHHHHHHHCCHHHHHHHhhhcCC-C-CCcHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHhCCCCccCCHHHHH
Confidence 9999999999999999999999999998 6 8999999999999988999999999999999999997654 3799999
Q ss_pred HHHhcccccCCCCCCCCCCccccccccCcccCCcccHHHHHHHHHHHHHHHHhc-C---CCCHHHHHHHHHHhccCCCCc
Q 005310 600 KHLHPKVMLGLPMLEGTNPASCFPRSAIGFESACYSRIWSEVFAADIFACKFRD-G---LLNQRVGMQFRNNVLAPGGSK 675 (703)
Q Consensus 600 ~~l~~~~~~~~~~~~~~~~~~~f~Hl~~~Y~a~YYsYl~a~vlA~~i~~~~f~~-~---~~~~~~g~~~r~~vL~~Ggs~ 675 (703)
+++.++ +.|++..+++.+..+|+|+++||+++||+|+||+++|+++| +.|++ + +.++++|++||++||++|||+
T Consensus 265 ~~l~~~-~~g~~~~~~~~~~~~~~H~~~gy~~yyy~Y~~~~v~a~qi~-~~~~~~~~~~~~~~~~g~~l~~~il~~G~s~ 342 (365)
T cd06258 265 RDLVKE-ILGVRPDPSTPDPAAFPHLAGGSPAYYYGYLLAEMLASQLR-ATFKKKVGYLTDNPEAGPRLREHILRPGNSE 342 (365)
T ss_pred HHHHHH-hcCCCCCCCCCCCCccchhccCcccchHHHHHHHHHHHHHH-HHHHhcCCCCCCCHHHHHHHHHHHccCcCCc
Confidence 999999 88877666666667999998669999999999999999999 55554 3 578999999998899999999
Q ss_pred cHHHHHHHhhCCCCChHHHHHH
Q 005310 676 EPIEILSDFLGREPSIQAFVDS 697 (703)
Q Consensus 676 ~~~ell~~flGr~p~~~a~l~~ 697 (703)
+|.|++++|+|++||+++|++.
T Consensus 343 ~~~el~~~~~G~~~~~~a~~~~ 364 (365)
T cd06258 343 PWKELLKRATGEDPNADAFLDH 364 (365)
T ss_pred CHHHHHHHHcCCCCChHHHHhc
Confidence 9999999999999999999975
No 11
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=100.00 E-value=2.8e-59 Score=540.70 Aligned_cols=538 Identities=16% Similarity=0.160 Sum_probs=411.0
Q ss_pred ChhhHHHHHHHHHHHHHHHHHHHhC-CCCCCCCHHHHHHHHHhhhhhhcccccccccccccC-CcHHHHHHHHHHHHHHH
Q 005310 59 SASEILKLADKIVSKSKEVHDAVAS-VPLDKVTFMNVISPLAELEAQQFPLVQSCVFPKLVC-TSDDVCKSSAEAERKID 136 (703)
Q Consensus 59 s~~~i~~~~~~~i~~~~~~~~~i~~-~~~~~~t~~n~i~~l~~l~~~l~~~~~~~~~~~~v~-~d~e~r~aa~e~~~~l~ 136 (703)
+.+++....+.+-+....+ ..+.. ...+.++|.+++..+++++..+..+..-+.+...+. .|++.++..+++..+.+
T Consensus 18 ~~~~~~~~~~~l~~~~~~l-~~~~~~~~~~~~~l~~~l~~~e~~~~~~~~l~~Ya~l~~s~d~~d~~~~~~~~~~~~l~s 96 (591)
T TIGR00181 18 NKEEWELFLEALEEDIKEI-KAFKKGLLHSKETFLEALALEEKILILLNRLYNYASMKLSTDVTDPEANAISQKLSNLYT 96 (591)
T ss_pred CHHHHHHHHHHHHHHHHHH-HHHhCcccCCHHHHHHHHHHHHHHHHHHHHHHHHhccccccCCCCHHHHHHHHHHHHHHH
Confidence 4566666666654444333 33332 223456899999999999888888877777777777 69999999999999999
Q ss_pred HHHHHhh-CCHHHHHHHHHHHhcCCCC--CHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHhhcccc
Q 005310 137 AHMLSCS-NREDVYRVVKAVAATGEWV--SPEAKRYIQSLVRDFERSGLNLTISKREEVQRLRDQIDELSLQYVRNLNDD 213 (703)
Q Consensus 137 ~~~~~l~-~~~~ly~~l~~~~~~~~~L--~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~l~~~f~~n~~~~ 213 (703)
++...+. .++.|++.-+...+. .+ +++-+.+...+.+.++..|+.|+++..+.+.+++...+..+..|......+
T Consensus 97 ~~~~~~sf~~~eL~~l~~~~~~~--~l~~~~~L~~y~~~l~~~~r~k~h~Ls~e~E~lLa~~~~~~~a~~~~f~~l~~ad 174 (591)
T TIGR00181 97 KVASATSFFEPEILEIEEKIIKE--WLKDPEELADYKRALEEIFRDKPHILSEEVEKLLSALSEVFGSPSDIYSTLTNAD 174 (591)
T ss_pred HHHHHHHHHHHHHHhCCHHHHHH--HHhcChhhHHHHHHHHHHHHhCCCCCCHHHHHHHHhCchhhccHHHHHHHHHhcc
Confidence 9999887 678888765444221 12 234556666667778888999999999999999999999999999765444
Q ss_pred ccceEecccccCCCCHHHHHHHhhccCCceEEEccCccHHHHHhcCCCHHHHHHHHHHHhcccC---CchHHHHHHHHHH
Q 005310 214 KTFLLFSEADLLGLPPEFLKSLDKAENGIFKVTLKSHHVAAVLELCKVGRTRRLVAVAYGKRCG---GINLSVLESLVEL 290 (703)
Q Consensus 214 ~~~~~~~~~eL~glp~~~l~~~~~~~~g~~~i~l~~~~~~~~l~~~~d~~~Rk~~~~a~~~r~~---~~n~~~l~~ll~l 290 (703)
.. | |. ....+|+ .++++.+++.++|++++|+++||++|+++.+++. ..|..+|..+++.
T Consensus 175 ~~---f--------~~------~~~~~G~-~~~l~~~~~~~~l~~~~dr~~Rk~a~~a~~~~~~~~~~~~a~~l~~~v~~ 236 (591)
T TIGR00181 175 MD---F--------GS------IEDYKGK-KYPITNSTYENFLQKNKDREIRKKAYESFYKAYRKHKNTFAALYYGNVQK 236 (591)
T ss_pred CC---C--------Cc------eecCCCc-EEeccHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 32 1 10 0012674 8999999999999999999999999999998874 2567899999999
Q ss_pred HHHHHHHcCCCCHHHhHhhhhccCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcCCCCCCccchhhhHHHHHhhh
Q 005310 291 RHKFARLLGYSNYADYALDLRMAKSSSKVFEFLEEISASLTDLASRELMMLKDLKRKEEGELPFGIEDLLYYVRRVEELQ 370 (703)
Q Consensus 291 R~~~A~llGy~sya~~~l~~~ma~tpe~V~~fL~~l~~~~~p~~~~el~~L~~~k~~~~g~~~l~pWD~~y~~~~~~~~~ 370 (703)
|.+.|+++||+||+++++.++|+ |++|.++|.++..+..|.+++ +.++|++..|.+++.+||+. .+..+..
T Consensus 237 r~~~A~lrGy~s~~~~~l~~~~i--~~~v~~~l~~~v~~~~~~~~~----y~~~k~k~lg~~~l~~~D~~--~~~~~~~- 307 (591)
T TIGR00181 237 NVFLAKLRNYESYIDASLFSDEV--PREVYDNLYDTIKKNAPVLQR----YYKLRKKVLKLDKMEPYDLY--LPLVKEK- 307 (591)
T ss_pred HHHHHHhcCCCCHHHHHhccCCC--CHHHHHHHHHHHHHhhHHHHH----HHHHHHHHhCCCcccHHHcc--CCCCCCc-
Confidence 99999999999999999999885 899999999999888898877 45567777788899999994 3333222
Q ss_pred cCCChhhhccCCChHHHHHHHHHHHhhhhcceEEEecccccccccceEEEEEecCCCceeeeeeecccccCCCcCCcccc
Q 005310 371 FDLDFGAVKQYFPVNLVLSGIFKVFQDLFGLRFEEVLDAVVWHSDVRVFSVFDLSSNDLLGYFYLDIYKREGKYVHTCVV 450 (703)
Q Consensus 371 ~~~d~~~l~~yf~~~~vl~gl~~l~~~lfgl~~~~~~~~e~W~~dV~~~~v~d~~~g~~lG~~ylDl~~R~gK~~ga~~~ 450 (703)
.+.++.+...+-+..-+ +.||.+| ...|+. .++ ++ |||+++|+||++||||+
T Consensus 308 --------~~~~~~~~a~~~v~~~~-~~~g~~~---------~~~~~~--~~~--~~------wiD~~~r~gK~~Ga~~~ 359 (591)
T TIGR00181 308 --------NPKFSIEEAKELILKSL-EPLGEEY---------IKILKR--AFN--ER------WVDYAENKGKRSGAYSI 359 (591)
T ss_pred --------CCcccHHHHHHHHHHHH-hccCHHH---------HHHHHH--Hhh--CC------CeeecCCCCCCCCcccC
Confidence 22344444333333333 3577544 333433 122 23 99999999999999999
Q ss_pred ccCCCcccCCCCccccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhhhcCCccccCCCccchhhhhccHHHH
Q 005310 451 ALQNGALSSTGARQIPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQICNRASFARFSGLRVDPDFVEIPAQLL 530 (703)
Q Consensus 451 ~l~~~~~~~~g~~q~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~lls~~~~~~~sg~~~~~DfvE~pS~l~ 530 (703)
++. ...|+ ++|||++ ++++|.||||||||++|+++++..++.+++ ++++|+||+||++|
T Consensus 360 ~~~---------~~~p~--il~N~~~---------~~~dv~TLaHElGHa~H~~~~~~~~~~~~~-~~~~~~aE~aS~~~ 418 (591)
T TIGR00181 360 GGY---------KVKPY--ILMNWDG---------TLNSVFTLAHELGHSMHSYFSSKHQPYPNS-DYSIFYAEIASTFN 418 (591)
T ss_pred CCC---------CCCCe--EEEecCC---------CcchHHHHHHHhhhHHHHHHHccCCCCccC-CCCchhhhHhHHHH
Confidence 862 13564 4789998 999999999999999999999765554444 78999999999999
Q ss_pred HHhHhcHHHHHHhhcccCCCCccchHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhcCC-------cccHHHHHHHHh
Q 005310 531 ENWCYESFSLKLISGFHQDITTPIKDEICKSLKRRRYAFSALKLKQEILYCIFDQVIHSAD-------NVDIVELFKHLH 603 (703)
Q Consensus 531 E~~~~~~~~L~~l~~h~~t~ge~ip~~l~~~l~~~~~~~~~~~~~~ql~~a~~D~~~H~~~-------~~~~~~l~~~l~ 603 (703)
|+|++++ +.+|+ + +++++..++++++. +++. .+++|++++.||+.+|... +.+++++|.+++
T Consensus 419 E~l~~~~-----l~~~~-~-~~~~k~~~l~~~l~--~~~~--~~~~q~~~~~Fe~~~~~~~~~~~~lt~~~l~~~~~~~~ 487 (591)
T TIGR00181 419 ELLLADY-----LLKNS-N-DPEMKIYILLERIS--NFFG--TFTRQTLFAEFEYEAYELIEEGEPLTAETLNEIYANLL 487 (591)
T ss_pred HHHHHHH-----HHhhC-C-CHHHHHHHHHHHHH--HHHH--HHHHHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHH
Confidence 9999997 56667 4 57788899988887 3443 6799999999999999641 248899999999
Q ss_pred cccccCCCC--CC--CCCCccccccccCcccCCcccHHHHHHHHHHH--HHHHHhcCCCCHHHHHHHHHHhccCCCCccH
Q 005310 604 PKVMLGLPM--LE--GTNPASCFPRSAIGFESACYSRIWSEVFAADI--FACKFRDGLLNQRVGMQFRNNVLAPGGSKEP 677 (703)
Q Consensus 604 ~~~~~~~~~--~~--~~~~~~~f~Hl~~~Y~a~YYsYl~a~vlA~~i--~~~~f~~~~~~~~~g~~~r~~vL~~Ggs~~~ 677 (703)
++ ++|... .+ +..| .+|+|+ |.+ ||+|.||.++++++ | ..+++++ +.++++|+ ++|+.|||++|
T Consensus 488 ~~-~~g~~~~~~~~~~~~w-~~~~Hf---y~~-fY~Y~Ya~g~~~a~~l~-~~~~~~~--~~~~~~Y~-~~L~~Ggs~~p 557 (591)
T TIGR00181 488 KK-YFGDLVKIDEGAGLTW-MRIPHF---YMG-FYVYKYATGQVAATALY-EKIKEEG--KGAVEKYL-KFLKSGGSKYP 557 (591)
T ss_pred HH-HcCCccccCCccccee-eecCcc---cCC-CcCHHHHHHHHHHHHHH-HHHHHCC--ccHHHHHH-HHHhCcCCCCH
Confidence 99 656422 22 2345 589996 665 99999988666555 7 7777643 56799998 89999999999
Q ss_pred HHHHHHhhCCCCChHHHHHH
Q 005310 678 IEILSDFLGREPSIQAFVDS 697 (703)
Q Consensus 678 ~ell~~flGr~p~~~a~l~~ 697 (703)
.|+++.+ |-+++..+|++.
T Consensus 558 ~ell~~~-Gvd~~~~~~~~~ 576 (591)
T TIGR00181 558 LETLKIA-GVDLTKPQPWQA 576 (591)
T ss_pred HHHHHHc-CcCCCChHHHHH
Confidence 9999995 999999888876
No 12
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=100.00 E-value=3.7e-54 Score=492.26 Aligned_cols=501 Identities=18% Similarity=0.175 Sum_probs=366.4
Q ss_pred HHHHHHhCCCCCCCCHHHHHHHHHhhhhhhcccccccccccccCCcH-HHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHH
Q 005310 76 EVHDAVASVPLDKVTFMNVISPLAELEAQQFPLVQSCVFPKLVCTSD-DVCKSSAEAERKIDAHMLSCSNREDVYRVVKA 154 (703)
Q Consensus 76 ~~~~~i~~~~~~~~t~~n~i~~l~~l~~~l~~~~~~~~~~~~v~~d~-e~r~aa~e~~~~l~~~~~~l~~~~~ly~~l~~ 154 (703)
++++.+.+.. ..+.++|..++++.+.+......+.+--++++++ +..+..+.....+......+.. .+.+.
T Consensus 10 ~~~~~~~~~~---~~~~~~~~~~~~l~~~~~~~~~~~~i~~s~dt~d~~~~~~~~~~~~~~~p~~~~~~~--~~~~~--- 81 (549)
T TIGR02289 10 ELLEKFISSK---LEQENWINEENDVNDDIEEMITSAYIRHSVDTDDEEFYDLEEHFQDEIKPLLKRYNT--KFDQK--- 81 (549)
T ss_pred HHHHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCHHHHHHHHHHHHHhCcHHHHHHH--HHHHH---
Confidence 3555555432 5678899999999988888777755544455544 4444444433333333322211 22221
Q ss_pred HHhcCCCC-CHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHhhccccccceEecccccCCCCHHHHH
Q 005310 155 VAATGEWV-SPEAKRYIQSLVRDFERSGLNLTISKREEVQRLRDQIDELSLQYVRNLNDDKTFLLFSEADLLGLPPEFLK 233 (703)
Q Consensus 155 ~~~~~~~L-~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~l~~~f~~n~~~~~~~~~~~~~eL~glp~~~l~ 233 (703)
..+ ... ++...++...+.+..+....-++++ ...+.++.+.+++.|.+.+.+.+ +.+
T Consensus 82 ~~~--~~~~~~l~~~~~~~l~~~~~~~~~~~s~~----~~~~~~~~~~~~~~y~~l~~~~~--~~~-------------- 139 (549)
T TIGR02289 82 IIE--SPFREELDERFYELLFKLIKNKLKLFNEK----NIPLLQKENILSTKYREIISNIS--IIF-------------- 139 (549)
T ss_pred HHc--CcchhHHHHHHHHHHHHHHHhhhhhcCHH----HHHHHHHHHHHHHHHHHHHhccE--EEE--------------
Confidence 111 222 3445555555556666655555554 66777888999999998876443 222
Q ss_pred HHhhccCCceEEEccCccHHHHHhcCCCHHHHHHHHHHHhcccC---CchHHHHHHHHHHHHHHHHHcCCCCHHHhHh--
Q 005310 234 SLDKAENGIFKVTLKSHHVAAVLELCKVGRTRRLVAVAYGKRCG---GINLSVLESLVELRHKFARLLGYSNYADYAL-- 308 (703)
Q Consensus 234 ~~~~~~~g~~~i~l~~~~~~~~l~~~~d~~~Rk~~~~a~~~r~~---~~n~~~l~~ll~lR~~~A~llGy~sya~~~l-- 308 (703)
+|+ .+++ +.+.++|++ +||++||++|.|+.+++. ..+..++.+++++|+++|+++||+||+++++
T Consensus 140 ------~Ge-~~~l--~~~~~~l~~-~dr~~Rk~a~~a~~~~~~~~~~~~~~il~~lv~~R~~~Akl~Gy~s~~e~~~~~ 209 (549)
T TIGR02289 140 ------EGE-EKTL--SQLKPYLQD-PNRSTRKKAWYARSEFFAVVEEELDRIYDELVQIRTQIALNLGFSNYRDYMYKL 209 (549)
T ss_pred ------CCE-eccH--HHhhHHhhC-CCHHHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH
Confidence 564 6666 458889987 999999999999988764 3567899999999999999999999999995
Q ss_pred hhhccCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcCCCCCCccchhhhHHHHHhhhcCCChhhhccCCChHHHH
Q 005310 309 DLRMAKSSSKVFEFLEEISASLTDLASRELMMLKDLKRKEEGELPFGIEDLLYYVRRVEELQFDLDFGAVKQYFPVNLVL 388 (703)
Q Consensus 309 ~~~ma~tpe~V~~fL~~l~~~~~p~~~~el~~L~~~k~~~~g~~~l~pWD~~y~~~~~~~~~~~~d~~~l~~yf~~~~vl 388 (703)
.+||+.+|++|.+|++.+.+.+.|.+++ +.+++++..|.+++.+||+.| .++.....+||+.+.++
T Consensus 210 ~~r~~~~~e~v~~~l~~v~~~~~p~~~~----~~~~k~k~lG~~~l~~wD~~~----------~~~~~~~~~~~~~e~~~ 275 (549)
T TIGR02289 210 KNRTDYSAEDCYKYRESILKYVVPLWTK----LRKIKKKRLGIKTLRPWDESA----------VFLDGNVKPFGNVDFLL 275 (549)
T ss_pred hcccCCCHHHHHHHHHHHHHHhHHHHHH----HHHHHHHhCCCCcCChhhccC----------CCCCCCcCCCCCHHHHH
Confidence 5899999999999999999999999876 555677778888999999953 34445568999999999
Q ss_pred HHHHHHHhhhhcceEEEecccccccccceEEEEEecCCCceeeeeeecccccCCCcCCccccccCCCcccCCCCccccEE
Q 005310 389 SGIFKVFQDLFGLRFEEVLDAVVWHSDVRVFSVFDLSSNDLLGYFYLDIYKREGKYVHTCVVALQNGALSSTGARQIPVA 468 (703)
Q Consensus 389 ~gl~~l~~~lfgl~~~~~~~~e~W~~dV~~~~v~d~~~g~~lG~~ylDl~~R~gK~~ga~~~~l~~~~~~~~g~~q~P~~ 468 (703)
+++..+++.+ |.++.++ ++ ..++..|+|+++|+||++||||+++++ .+.|+
T Consensus 276 ~~~~~~~~~l-~~~~~e~---------------~~----~~~~~~~iD~~~r~gK~~Gayc~~~~~--------~~~P~- 326 (549)
T TIGR02289 276 EKAEKMYKEL-SLEFYEF---------------FN----FMREKNLLDLESRKGKAAGGYCTYLPK--------YKAPF- 326 (549)
T ss_pred HHHHHHhcCC-CcHHHHH---------------HH----HHHhcCccccCCCCCCCCCcccCCCCC--------CCCcE-
Confidence 9999888764 5443221 11 133444999999999999999999753 24685
Q ss_pred EEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhhhcCCccccCCC-ccchhhhhccHHHHHHhHhcHHHHHHhhccc
Q 005310 469 LLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQICNRASFARFSGL-RVDPDFVEIPAQLLENWCYESFSLKLISGFH 547 (703)
Q Consensus 469 ~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~lls~~~~~~~sg~-~~~~DfvE~pS~l~E~~~~~~~~L~~l~~h~ 547 (703)
++|||++ +++||.||+||||||+|+++++.. ..... ..++|++|+||++||+|+|++ +..++
T Consensus 327 -I~~Nf~~---------t~~dv~TL~HElGHa~H~~~s~~~--~~~~~~~~~~~~aE~aS~~~E~l~~~~-----l~~~~ 389 (549)
T TIGR02289 327 -IFSNFNG---------TSGDIDVLTHEAGHAFHVYESRKD--LLPEYRWPTYEAAELASMSMELLSMPW-----WDLFY 389 (549)
T ss_pred -EEEeCCC---------ChhHHHHHHHHhhHHHHHHHhcCC--cccccccCcchhhhhhHHHHHHHHHHH-----HHHHc
Confidence 4689998 999999999999999999999752 22222 257999999999999999996 56677
Q ss_pred CCCCccchHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhcCC---cccHHHHHHHHhcccccCC---CC----CCCCC
Q 005310 548 QDITTPIKDEICKSLKRRRYAFSALKLKQEILYCIFDQVIHSAD---NVDIVELFKHLHPKVMLGL---PM----LEGTN 617 (703)
Q Consensus 548 ~t~ge~ip~~l~~~l~~~~~~~~~~~~~~ql~~a~~D~~~H~~~---~~~~~~l~~~l~~~~~~~~---~~----~~~~~ 617 (703)
++ +++.+..++..|..+.. .+.+|+++..|...+|... +.+++++|.+++++ |.+- +. ..+..
T Consensus 390 ~~-~e~~~~~~~~~L~~~~~-----~~~~~~~~d~fe~~vy~~~~lt~~e~~~~~~~l~~~-y~~~~~~~~~~~~~~~~~ 462 (549)
T TIGR02289 390 TD-EKDLKKAKISHLKGALS-----FLPYGVIVDHFQHWVYENPNHTPEERDEKYAELEKR-YQPSTVYIAGLELEIGTF 462 (549)
T ss_pred CC-cchHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHccCCCCCHHHHHHHHHHHHHH-hCCCcccCCccccccCCc
Confidence 77 78888888877765432 2346666777777777643 35889999999988 6631 11 12355
Q ss_pred CccccccccCcccCCcccHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccCCCCccHHHHHHHhhCCCCC
Q 005310 618 PASCFPRSAIGFESACYSRIWSEVFAADIFACKFRDGLLNQRVGMQFRNNVLAPGGSKEPIEILSDFLGREPS 690 (703)
Q Consensus 618 ~~~~f~Hl~~~Y~a~YYsYl~a~vlA~~i~~~~f~~~~~~~~~g~~~r~~vL~~Ggs~~~~ell~~flGr~p~ 690 (703)
| .+++|++ .++..||+|.||+++|.++| +.+.+++ +.+.++|. ++|+.|||++|.++++.+ |-+..
T Consensus 463 W-~~~~H~~-~~pFYyy~Y~~a~~~a~~l~-~~~~~~~--~~~~~~Y~-~~L~~Ggs~~~~ell~~a-Gid~~ 528 (549)
T TIGR02289 463 W-LRQLHIF-EVPFYYIEYTIAQIGALQIY-KIYKEDP--EKALKDYK-KLCSAGGSQSFLELYETA-GLTFP 528 (549)
T ss_pred c-eeeceee-cCCCcchhhHHHHHHHHHHH-HHHHhCH--HHHHHHHH-HHHhccCCcCHHHHHHHh-CCCCC
Confidence 7 4899997 22223679999999999999 8888765 77888885 899999999999999996 75544
No 13
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=100.00 E-value=6.1e-52 Score=462.53 Aligned_cols=380 Identities=21% Similarity=0.299 Sum_probs=300.5
Q ss_pred CCceEEEccCccHHHHHhcCCCHHHHHHHHHHHhcccC---CchHHHHHHHHHHHHHHHHHcCCCCHHHhHhhhhccCCH
Q 005310 240 NGIFKVTLKSHHVAAVLELCKVGRTRRLVAVAYGKRCG---GINLSVLESLVELRHKFARLLGYSNYADYALDLRMAKSS 316 (703)
Q Consensus 240 ~g~~~i~l~~~~~~~~l~~~~d~~~Rk~~~~a~~~r~~---~~n~~~l~~ll~lR~~~A~llGy~sya~~~l~~~ma~tp 316 (703)
+| +.++++. .+.+++++ +|+++||++|.++.+++. ..+..+|.+++++|+++|+++||+||+++.+.++| +|
T Consensus 29 ~G-~~~~l~~-~~~~~~~~-~d~~~Rk~a~~a~~~~~~~~~~~~~~~l~~lv~~r~~~A~~~Gy~~~~~~~l~~~~--~~ 103 (427)
T cd06459 29 DG-EELTLSQ-NLSNLLES-PDREVRKKAFEALYKAYEKYENTLAAILNTLVKLRLTLAKLRGYDSYLEAALFNNN--IP 103 (427)
T ss_pred CC-eEEeccH-HHHHHHhC-cCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHhhccCC--CC
Confidence 67 5899988 78888876 899999999999988873 25678999999999999999999999999999998 89
Q ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcCCCCCCccchhhhHHHHHhhhcCCChhhhccCCChHHHHHHHHHHHh
Q 005310 317 SKVFEFLEEISASLTDLASRELMMLKDLKRKEEGELPFGIEDLLYYVRRVEELQFDLDFGAVKQYFPVNLVLSGIFKVFQ 396 (703)
Q Consensus 317 e~V~~fL~~l~~~~~p~~~~el~~L~~~k~~~~g~~~l~pWD~~y~~~~~~~~~~~~d~~~l~~yf~~~~vl~gl~~l~~ 396 (703)
++|.+||.++.+.+.|.++++.+ .+++..|.+++.|||+.|+....+ .+.|+.+..++++..+++
T Consensus 104 ~~~~~~l~~v~~~~~p~~~~~~~----~~~~~lg~~~l~~wD~~~~~~~~~-----------~~~~~~~~~~~~v~~~~~ 168 (427)
T cd06459 104 EDVYDFLIAVVKENVPLLHRYLK----LKKKLLGLDKLRPYDLYAPLVSGN-----------PPKYTYEEAKELVLEALS 168 (427)
T ss_pred HHHHHHHHHHHHHhcHHHHHHHH----HHHHHhCCCcCcHhhcCCCCCCCC-----------CCcCcHHHHHHHHHHHHH
Confidence 99999999999999999999654 445556788999999976644322 256777787888887765
Q ss_pred hhhcceEEEecccccccccceEEEEEecCCCceeeeeeecccccCCCcCCccccccCCCcccCCCCccccEEEEeccCCC
Q 005310 397 DLFGLRFEEVLDAVVWHSDVRVFSVFDLSSNDLLGYFYLDIYKREGKYVHTCVVALQNGALSSTGARQIPVALLISQIQK 476 (703)
Q Consensus 397 ~lfgl~~~~~~~~e~W~~dV~~~~v~d~~~g~~lG~~ylDl~~R~gK~~ga~~~~l~~~~~~~~g~~q~P~~~l~~Nf~~ 476 (703)
.+ |.++.. +.+ ..++..|+|+++||||++||||+++.++ ..| +++|||++
T Consensus 169 ~l-~~~~~~---------------~~~----~~~~~~~iD~~~r~gK~~gaf~~~~~~~--------~~p--~i~~n~~~ 218 (427)
T cd06459 169 PL-GPEYAE---------------FAK----RAFEERWIDVEPRKGKRSGAYCTGLPPG--------KHP--FILMNFNG 218 (427)
T ss_pred cc-CHHHHH---------------HHH----HHhhCCCeeccCCCCCCCCeecCCCCCC--------CCC--eEEecCCC
Confidence 43 422211 111 2334449999999999999999987422 246 55789998
Q ss_pred CCCCCCCccChhhHHHHHHHHHHHHHHhhhcCCccccCCCccchhhhhccHHHHHHhHhcHHHHHHhhcccCCCCccchH
Q 005310 477 DIVGHPSLLRFSEVVNFFHEFGHVVQQICNRASFARFSGLRVDPDFVEIPAQLLENWCYESFSLKLISGFHQDITTPIKD 556 (703)
Q Consensus 477 ~~~~~p~lL~~~~V~TLfHEfGHalH~lls~~~~~~~sg~~~~~DfvE~pS~l~E~~~~~~~~L~~l~~h~~t~ge~ip~ 556 (703)
++++|.|||||||||+|+++++..++ ..+.++++||+|+||++||+|++++ +.+|+++ . ..+.
T Consensus 219 ---------~~~~v~tl~HE~GHa~h~~~~~~~~~-~~~~~~~~~~~E~~S~~~E~~~~~~-----l~~~~~~-~-~~~~ 281 (427)
T cd06459 219 ---------TLDDVFTLAHELGHAFHSYLSRDNQP-YLYSDYPIFLAEIASTFNELLLFDY-----LLKFAKD-P-EEKL 281 (427)
T ss_pred ---------ChhhHHHHHHHhhHHHHHHHHccCCC-cccCCCCchhhHHHHHHHHHHHHHH-----HHHhCCC-H-HHHH
Confidence 89999999999999999999986544 3445689999999999999999996 4567765 2 3444
Q ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhcCC-------cccHHHHHHHHhcccccCCCC----CCCCCCccccccc
Q 005310 557 EICKSLKRRRYAFSALKLKQEILYCIFDQVIHSAD-------NVDIVELFKHLHPKVMLGLPM----LEGTNPASCFPRS 625 (703)
Q Consensus 557 ~l~~~l~~~~~~~~~~~~~~ql~~a~~D~~~H~~~-------~~~~~~l~~~l~~~~~~~~~~----~~~~~~~~~f~Hl 625 (703)
.++..++.. .. ..+.+|++++.||+.+|... +.++.++|+++.++ |.|... ..+..|+ +|+|+
T Consensus 282 ~~~~~~~~~---~~-~~~~~~~~~~~fe~~l~~~~~~~~~~~~~~~~~~~~~~~~~-y~g~~~~~~~~~~~~w~-~~~H~ 355 (427)
T cd06459 282 YLLEHLLED---IR-ATLPRQTMFAEFEHEVYENPEEGEPLTAEELNEIYRELEKK-YGGDLVEIDEEHGYEWA-RIPHF 355 (427)
T ss_pred HHHHHHHHH---HH-HHHHHHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHH-hcCCccccCCccCeeee-ecCee
Confidence 555554432 11 16789999999999999742 34789999999999 877533 2235574 89999
Q ss_pred c-CcccCCcccHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccCCCCccHHHHHHHhhCCCCChHHHHHHhh
Q 005310 626 A-IGFESACYSRIWSEVFAADIFACKFRDGLLNQRVGMQFRNNVLAPGGSKEPIEILSDFLGREPSIQAFVDSRA 699 (703)
Q Consensus 626 ~-~~Y~a~YYsYl~a~vlA~~i~~~~f~~~~~~~~~g~~~r~~vL~~Ggs~~~~ell~~flGr~p~~~a~l~~~~ 699 (703)
+ ++| +||+|+||.++|.++| ..|++++ +.++++|+ ++|+.||+++|.|+++.+ |.+++...|++.-.
T Consensus 356 ~~~~f--Yyy~Y~~g~~~a~~l~-~~~~~~~--~~~~~~y~-~~L~~Ggs~~~~ell~~~-g~d~~~~~~~~~~~ 423 (427)
T cd06459 356 YYVPF--YVYPYAFGQLAALALY-AKYKEDG--EGAVEKYL-ELLKAGGSKSPLELLKKA-GVDLTSPDFWEEAI 423 (427)
T ss_pred cCCCC--cChHHHHHHHHHHHHH-HHHHHcC--ccHHHHHH-HHHHccCCCCHHHHHHHc-CcCCCChHHHHHHH
Confidence 7 233 4889999999999999 7887754 67899997 899999999999999975 99999999998754
No 14
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=100.00 E-value=1e-49 Score=459.82 Aligned_cols=530 Identities=18% Similarity=0.155 Sum_probs=354.2
Q ss_pred HHHHHHHHHHHHHHHHHHHhCC--CC---CCCCHHHHHHHHHhhhhhhcccccccccccccCC-cHHHHHHHHHHHHHHH
Q 005310 63 ILKLADKIVSKSKEVHDAVASV--PL---DKVTFMNVISPLAELEAQQFPLVQSCVFPKLVCT-SDDVCKSSAEAERKID 136 (703)
Q Consensus 63 i~~~~~~~i~~~~~~~~~i~~~--~~---~~~t~~n~i~~l~~l~~~l~~~~~~~~~~~~v~~-d~e~r~aa~e~~~~l~ 136 (703)
+...++.+-+...++....... +. +..++.+.+.-++++...+..+..-+.+...+.+ |++.++..+.+.....
T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~l~~~l~~~e~~~~~~~~l~~Ya~l~~~~Dt~d~~~~~~~~~~~~l~~ 95 (587)
T TIGR02290 16 FEEDLKKIKALADEFESLYQEKLTPLDAKGKEKLQNALKRYEALGELLSKVGAYASLLYSADTSDPEAKALLGELEALSS 95 (587)
T ss_pred HHHHHHHHHHHHHHHHHHHhcccccccccCHHHHHHHHHHHHHHHHHHHHHHHHHhHHHhhcCCCHHHHHHHHHHHHHHH
Confidence 6555555544444433322211 01 1234777777777777777666666555555654 6777666666666555
Q ss_pred HHHHHhhC-CHHHHH----HHHHHHhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHH-HHHHHHHHHHhhc
Q 005310 137 AHMLSCSN-REDVYR----VVKAVAATGEWVSPEAKRYIQSLVRDFERSGLNLTISKREEVQRLRD-QIDELSLQYVRNL 210 (703)
Q Consensus 137 ~~~~~l~~-~~~ly~----~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~-~i~~l~~~f~~n~ 210 (703)
.+...+.. .+.|.+ .++++.+ +++-.++-..+.+-++...+-|+++..+-+.++.. -.......|.+..
T Consensus 96 ~~~~~~sf~~~el~~l~~~~l~~~~~-----~~~L~~y~~~l~~~~~~~~h~Ls~e~E~ll~~~~~~~~~~~~~~y~~l~ 170 (587)
T TIGR02290 96 EIQQALLFFTLELMRLSDEVWAALLS-----DPDLAPYRFYLKELRKEAPHTLSEEEEKLINALSLTGRAAWSRLYDELT 170 (587)
T ss_pred HHHHHHHHHHHHHHhCCHHHHHHHHh-----CchHHHHHHHHHHHHHcCCcCCCHHHHHHHHhcccchhhhHHHHHHHHH
Confidence 55443322 112221 1112111 12222222233334444577778777777766652 4456677777554
Q ss_pred cccccceEecccccCCCCHHHHHHHhhccCCceEEEccCccHHHHHhcCCCHHHHHHHHHHHhcccCCch----HHHHHH
Q 005310 211 NDDKTFLLFSEADLLGLPPEFLKSLDKAENGIFKVTLKSHHVAAVLELCKVGRTRRLVAVAYGKRCGGIN----LSVLES 286 (703)
Q Consensus 211 ~~~~~~~~~~~~eL~glp~~~l~~~~~~~~g~~~i~l~~~~~~~~l~~~~d~~~Rk~~~~a~~~r~~~~n----~~~l~~ 286 (703)
++- .+. ..| .+|+ ...++.+.+..+|.+ +|+++||++|+++.+.. ..+ ..+|..
T Consensus 171 ~~~----~~~---~~~------------~~G~-~~~~t~~~~~~~l~~-~dr~~Rk~a~~~~~~~~-~~~~~~~a~~l~~ 228 (587)
T TIGR02290 171 GTL----RIP---VDG------------KDGE-EETLSLEQALNLLRD-PDAEVRKKAFKALLKAW-EKNAPTLAAILNA 228 (587)
T ss_pred ccC----cce---eec------------CCCc-eEEcCHHHHHHHHhC-cCHHHHHHHHHHHHHHH-HHHHHHHHHHHHH
Confidence 311 111 111 2675 667788888888766 89999999999998876 344 356688
Q ss_pred HHHHHHHHHHHcCCCCHHHhHhhhhccCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcCCCCCCccchhhhHHHH
Q 005310 287 LVELRHKFARLLGYSNYADYALDLRMAKSSSKVFEFLEEISASLTDLASRELMMLKDLKRKEEGELPFGIEDLLYYVRRV 366 (703)
Q Consensus 287 ll~lR~~~A~llGy~sya~~~l~~~ma~tpe~V~~fL~~l~~~~~p~~~~el~~L~~~k~~~~g~~~l~pWD~~y~~~~~ 366 (703)
+++.|...|+++||+||.++++..+.. +++.+..|++.+.+. .|.+++.+ +++++..|.+++++||+.+....
T Consensus 229 ~v~~~~~~a~~rgy~s~~e~~l~~~~i-~~~~~~~l~~~v~~~-~p~~~~y~----~~k~~~lgl~~l~~~D~~~p~~~- 301 (587)
T TIGR02290 229 LAGDRLTEYRLRGYDHPLEPRLLYNRI-DQETLDAMLEAIKEN-YPLFRRYY----KLKAKLLGKEKLDFYDLYAPLGD- 301 (587)
T ss_pred HHHHHHHHHHhhCCCCHHHHhhccCCC-CHHHHHHHHHHHHHh-hHHHHHHH----HHHHHHhCCCcccHHHcCCCCCC-
Confidence 899999999999999999999987665 699999999998887 68887744 45666788899999999532221
Q ss_pred HhhhcCCChhhhccCCChHHHHHHHHHHHhhhhcceEEEecccccccccceEEEEEecCCCceeeeeeecccccCCCcCC
Q 005310 367 EELQFDLDFGAVKQYFPVNLVLSGIFKVFQDLFGLRFEEVLDAVVWHSDVRVFSVFDLSSNDLLGYFYLDIYKREGKYVH 446 (703)
Q Consensus 367 ~~~~~~~d~~~l~~yf~~~~vl~gl~~l~~~lfgl~~~~~~~~e~W~~dV~~~~v~d~~~g~~lG~~ylDl~~R~gK~~g 446 (703)
...+.++.+...+.+...++ -||-. +..-++. .++ +| |+|+++|+||++|
T Consensus 302 ----------~~~~~~~~e~a~~~v~~~~~-~l~~e---------~~~~~~~--~~~--~~------~iD~~~r~gK~~G 351 (587)
T TIGR02290 302 ----------SSAKTYTFDEAKELVLEAFG-KFSPE---------MADFAEK--AFE--EG------WIDAEPRPGKRGG 351 (587)
T ss_pred ----------CCCCeecHHHHHHHHHHHHH-hcCHH---------HHHHHHH--HHH--cC------CcccCCCCCCCCC
Confidence 11233344444444443332 23321 2222221 122 34 9999999999999
Q ss_pred ccccccCCCcccCCCCccccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhhhcCCccccCCCccchhhhhcc
Q 005310 447 TCVVALQNGALSSTGARQIPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQICNRASFARFSGLRVDPDFVEIP 526 (703)
Q Consensus 447 a~~~~l~~~~~~~~g~~q~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~lls~~~~~~~sg~~~~~DfvE~p 526 (703)
|||.++.++ ..| +++|||++ ++++|.|||||||||+|+++++.. ...++++++|+||+|
T Consensus 352 a~~~~~~~~--------~~p--~i~~N~~~---------~~~~v~TL~HE~GHa~H~~ls~~~--~~~~~~~~~~~aE~~ 410 (587)
T TIGR02290 352 AFCTGFPPS--------KEP--RVLMNYDG---------SRRDVSTLAHELGHAYHSELAKDQ--PLLNARYPMTLAETA 410 (587)
T ss_pred cccCCCCCC--------CCC--EEEEecCC---------CchhHHHHHHHhhHHHHHHHHccC--CcccCCCCchhhhHH
Confidence 999987532 247 46789998 899999999999999999999763 344566899999999
Q ss_pred HHHHHHhHhcHHHHHHhhcccCCCCccchHHHHHHHHHHHhHHHHH-HHHHHHHHHHHHHHHhcCC------cccHHHHH
Q 005310 527 AQLLENWCYESFSLKLISGFHQDITTPIKDEICKSLKRRRYAFSAL-KLKQEILYCIFDQVIHSAD------NVDIVELF 599 (703)
Q Consensus 527 S~l~E~~~~~~~~L~~l~~h~~t~ge~ip~~l~~~l~~~~~~~~~~-~~~~ql~~a~~D~~~H~~~------~~~~~~l~ 599 (703)
|++||+|+|++ .|+. +.+ ++....++.. ++..++ .+++|++++.||+.+|... +.++.++|
T Consensus 411 S~~~E~l~~~~-ll~~----~~~------~~~~~~~~~~-~l~~~~~~~~~q~~~~~fE~~l~~~~~~~~lt~~~l~~~~ 478 (587)
T TIGR02290 411 SIFAEMLLFDA-LLKE----AKT------DEEKLSLLAE-KLEDAIATLVRIHARFLFERRFHEARKEGELSADDICDLM 478 (587)
T ss_pred HHHHHHHHHHH-HHhh----CCC------HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHH
Confidence 99999999998 3433 222 1222333333 455555 4699999999999999632 24788999
Q ss_pred HHHhcccccCC--CCCC-C-CCCcccccccc-CcccCCcccHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccCCCC
Q 005310 600 KHLHPKVMLGL--PMLE-G-TNPASCFPRSA-IGFESACYSRIWSEVFAADIFACKFRDGLLNQRVGMQFRNNVLAPGGS 674 (703)
Q Consensus 600 ~~l~~~~~~~~--~~~~-~-~~~~~~f~Hl~-~~Y~a~YYsYl~a~vlA~~i~~~~f~~~~~~~~~g~~~r~~vL~~Ggs 674 (703)
.++.++ +.|. +..+ . ..| .+|+|++ +|| +||.|.||.++|.++| +.|.+++ +.+.++|+ +||+.|||
T Consensus 479 ~~~~~~-~~g~~~~~~~~~~~~w-~~~~H~y~~~f--Y~y~Y~~g~~~a~~l~-~~~~~~~--~~~~~~y~-~~L~~Ggs 550 (587)
T TIGR02290 479 LEAQKE-SYGDALDESELHPYMW-AYKPHFYHAPF--YNYPYTFGYLFVLGLY-AKYREEG--ESFVPKYI-ALLRDTGS 550 (587)
T ss_pred HHHHHH-HhCcccccCCCCCcee-ecCCeecCCCC--cChhhHHHHHHHHHHH-HHHHHCC--hhHHHHHH-HHHHCcCC
Confidence 999988 6553 2222 2 345 5899996 233 2468999999999999 7776643 45677886 89999999
Q ss_pred ccHHHHHHHhhCCCCChHHHHHHh
Q 005310 675 KEPIEILSDFLGREPSIQAFVDSR 698 (703)
Q Consensus 675 ~~~~ell~~flGr~p~~~a~l~~~ 698 (703)
++|.|++++ +|++|+.++|++.-
T Consensus 551 ~~p~ell~~-~G~d~~~~~~~~~~ 573 (587)
T TIGR02290 551 MTPEELVKK-FGFDLTSPDFWQKG 573 (587)
T ss_pred CCHHHHHHH-hCcCCCChHHHHHH
Confidence 999999999 89999999999763
No 15
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=100.00 E-value=4.4e-40 Score=375.21 Aligned_cols=508 Identities=18% Similarity=0.214 Sum_probs=346.1
Q ss_pred CCCHHHHHHHHHhhhhhhcccccccccccccC-CcHHHHHHHHHHHHHHHHHHHHhhC-CHHHHHHHHHHHhcCCCCCH-
Q 005310 88 KVTFMNVISPLAELEAQQFPLVQSCVFPKLVC-TSDDVCKSSAEAERKIDAHMLSCSN-REDVYRVVKAVAATGEWVSP- 164 (703)
Q Consensus 88 ~~t~~n~i~~l~~l~~~l~~~~~~~~~~~~v~-~d~e~r~aa~e~~~~l~~~~~~l~~-~~~ly~~l~~~~~~~~~L~~- 164 (703)
..+|.+.+...+++...+..+..-+.+...+. .|++..+..+.+.....++...+.. +..+.+.-+++.+ ..|..
T Consensus 49 ~~~l~~~l~~~e~~~~~l~~~~~ya~~~~s~d~~d~~~~~~~~~~~~~~~~~~~~~~~~~~el~~~~e~~~~--~~l~~~ 126 (598)
T COG1164 49 AEDLLEALEEYEKLEELLGRLYAYASMKLSTDTTDEEAQKLYQKLEEFYADFSSALSFFEPELLELDEEVIQ--SLLTSG 126 (598)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccHHHHH--HHHhcC
Confidence 45677777777777777766666655544454 3667766666666666665544432 2333333233221 11222
Q ss_pred HHHHHHHHHHHHHHH-cCCCCCHHHHHHHHHHHHHHHHHHHHHHhhccccccceEecccccCCCCHHHHHHHhhccCCce
Q 005310 165 EAKRYIQSLVRDFER-SGLNLTISKREEVQRLRDQIDELSLQYVRNLNDDKTFLLFSEADLLGLPPEFLKSLDKAENGIF 243 (703)
Q Consensus 165 e~~r~l~~~~~~f~~-~G~~L~~~~r~~~~~l~~~i~~l~~~f~~n~~~~~~~~~~~~~eL~glp~~~l~~~~~~~~g~~ 243 (703)
+..+..+..+++..+ .-+.|++.+.+-+..+......-...|....+.+. .+.. ..+.+|+
T Consensus 127 ~~l~~y~~~l~~~~~~k~h~ls~~~Ekll~~~~~~~~~~~~~~~~~~~~dl---~~~~--------------~~~~~g~- 188 (598)
T COG1164 127 PELADYRFYLEELFRRKPHQLSAEEEKLLAQLSPVGNSASNIFFDLTNSDL---TFPD--------------IGDAKGK- 188 (598)
T ss_pred cchhhhHHHHHHHHhhCcccCCHHHHHHHHhhhhhhcchhHHhhhhhcccc---cccc--------------ccccccc-
Confidence 233344444444444 57788888888888887777666666654432221 1111 0122676
Q ss_pred EEEccCccHHHHHhcCCCHHHHHHHHHHHhcccCCch----HHHHHHHHHHHHHHHHHcCCCCHHHhHhhhhccCCHHHH
Q 005310 244 KVTLKSHHVAAVLELCKVGRTRRLVAVAYGKRCGGIN----LSVLESLVELRHKFARLLGYSNYADYALDLRMAKSSSKV 319 (703)
Q Consensus 244 ~i~l~~~~~~~~l~~~~d~~~Rk~~~~a~~~r~~~~n----~~~l~~ll~lR~~~A~llGy~sya~~~l~~~ma~tpe~V 319 (703)
+++++ .+..+|++ +||++||++|+++.... .++ ..++..+++.+...|+.+||+++.+.++.++-. +.+.|
T Consensus 189 ~~~~~--~~~~ll~~-~Dr~~Rk~A~~a~~~~~-~~~~~t~a~~l~~~~~~~~~~a~~r~y~~~~~~~l~~~~i-~~~~~ 263 (598)
T COG1164 189 KLTLE--QLLNLLED-PDREVRKAAYEALYKAY-EKHRNTLAALLNTLVKVLAFLARARNYDDVIDSALLRNEV-DREVV 263 (598)
T ss_pred eecHH--HHHHHhhC-CCHHHHHHHHHHHHHHH-HHhhHHHHHHHHHHHHHHHHHHhhcCcchHHHHHHHhCCC-CHHHH
Confidence 77776 56666666 99999999999998766 443 456677778889999999999999999987654 67778
Q ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHhcCCCCCCccchhhhHHHH---HhhhcCCChhhhccCCChHHHHHHHHHHHh
Q 005310 320 FEFLEEISASLTDLASRELMMLKDLKRKEEGELPFGIEDLLYYVRRV---EELQFDLDFGAVKQYFPVNLVLSGIFKVFQ 396 (703)
Q Consensus 320 ~~fL~~l~~~~~p~~~~el~~L~~~k~~~~g~~~l~pWD~~y~~~~~---~~~~~~~d~~~l~~yf~~~~vl~gl~~l~~ 396 (703)
..+.+.+.+...|.+++.. +++++..|.+++.|||+. .+.. ....++++++ .+.|++++..
T Consensus 264 d~l~~~v~~~~~~l~~ry~----~~r~k~Lgl~~l~~yD~~--~p~~~~~~~~~~s~~ea-------~~~v~~~l~~--- 327 (598)
T COG1164 264 DNLIESVKEAFLPLLHRYY----KLRAKVLGLEKLRPYDLY--APLLDKDPSPEYSYEEA-------KELVLKALAP--- 327 (598)
T ss_pred HHHHHHHHHhcchHHHHHH----HHHHHHhCcccCCHHhhc--CCcccCCCCccccHHHH-------HHHHHHHHHh---
Confidence 8888888878999999855 466777899999999993 3311 1122333333 3445555533
Q ss_pred hhhcceEEEecccccccccceEEEEEecCCCceeeeeeecccccCCCcCCccccccCCCcccCCCCccccEEEEeccCCC
Q 005310 397 DLFGLRFEEVLDAVVWHSDVRVFSVFDLSSNDLLGYFYLDIYKREGKYVHTCVVALQNGALSSTGARQIPVALLISQIQK 476 (703)
Q Consensus 397 ~lfgl~~~~~~~~e~W~~dV~~~~v~d~~~g~~lG~~ylDl~~R~gK~~ga~~~~l~~~~~~~~g~~q~P~~~l~~Nf~~ 476 (703)
|| +.+...++. .++ .+ |+|+++|+||++||||.+... | ..|.| ++||.+
T Consensus 328 --lg---------~ey~~~~~~--a~~--~~------WiD~~~~~gKrsGaYs~~~~~------~--~~p~I--lmN~~g 376 (598)
T COG1164 328 --LG---------PEYAKIARR--AFD--ER------WIDVYPRKGKRSGAYSIGFYK------G--DHPFI--LMNYDG 376 (598)
T ss_pred --hC---------HHHHHHHHH--HHh--cC------CeeccCCCCCCCCcccCCCCC------C--CCCeE--EEeCCC
Confidence 33 233333443 122 34 999999999999999998531 1 46865 569998
Q ss_pred CCCCCCCccChhhHHHHHHHHHHHHHHhhhcCCccccCCCccchhhhhccHHHHHHhHhcHHHHHHhhcccCCCCccchH
Q 005310 477 DIVGHPSLLRFSEVVNFFHEFGHVVQQICNRASFARFSGLRVDPDFVEIPAQLLENWCYESFSLKLISGFHQDITTPIKD 556 (703)
Q Consensus 477 ~~~~~p~lL~~~~V~TLfHEfGHalH~lls~~~~~~~sg~~~~~DfvE~pS~l~E~~~~~~~~L~~l~~h~~t~ge~ip~ 556 (703)
+.+||.||+||+||++|+.+++...+.+.+ ..++.+||+||+++|.+.+++ +...+++ +....
T Consensus 377 ---------t~~dV~TLaHElGHs~Hs~~s~~~qp~~~~-~~~i~~AEiAS~fnE~l~~~~-----ll~~~~~--~~~~~ 439 (598)
T COG1164 377 ---------TLRDVFTLAHELGHSVHSYFSRKHQPYLYA-DYSIFLAEIASTFNEMLLFDY-----LLERFKD--PEERL 439 (598)
T ss_pred ---------chhHHHHHHHHccHHHHHHHHhccCCcccc-CCchHHHHHHHHHHHHHHHHH-----HHHHcCC--HHHHH
Confidence 999999999999999999999876553443 467999999999999999997 3333443 23344
Q ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhcCCc-------ccHHHHHHHHhcccccCC--CCCCC--CCCccccccc
Q 005310 557 EICKSLKRRRYAFSALKLKQEILYCIFDQVIHSADN-------VDIVELFKHLHPKVMLGL--PMLEG--TNPASCFPRS 625 (703)
Q Consensus 557 ~l~~~l~~~~~~~~~~~~~~ql~~a~~D~~~H~~~~-------~~~~~l~~~l~~~~~~~~--~~~~~--~~~~~~f~Hl 625 (703)
.++.+.+.. +. -.+.||++++.|++.+|...+ ..++++|.++.++ |.|. ...++ ..| .+++|+
T Consensus 440 ~il~~~l~~--~~--~t~~rq~~f~~FE~~~h~~~~~~~~lt~~~l~~~~~~l~~~-y~gd~~~~d~~~~~~W-~~ipHf 513 (598)
T COG1164 440 AILEEKLEG--FF--ATLFRQTLFAEFEHRVHELIEEGEELTAEELNELYLELQKE-YYGDAVKLDELSGLEW-ARIPHF 513 (598)
T ss_pred HHHHHHHHH--HH--HHHHHHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHH-hcCCccccCcccccch-hhcCee
Confidence 555554442 22 257999999999999998732 3889999999999 7774 33333 557 599999
Q ss_pred cCcccCCcccHHHHH--HHHHHHHHHHHhcCCCCHHHHHHHHHHhccCCCCccHHHHHHHhhCCCCChHHHHHH
Q 005310 626 AIGFESACYSRIWSE--VFAADIFACKFRDGLLNQRVGMQFRNNVLAPGGSKEPIEILSDFLGREPSIQAFVDS 697 (703)
Q Consensus 626 ~~~Y~a~YYsYl~a~--vlA~~i~~~~f~~~~~~~~~g~~~r~~vL~~Ggs~~~~ell~~flGr~p~~~a~l~~ 697 (703)
+ .++||+|.||. ++|..++ .++..+. ++.-++|..++|+.|||.+|.|+++ -.|-+++..++++.
T Consensus 514 y---~~pFYvy~Ya~G~~~a~~l~-~~~~~~~--~~~~~~~y~~~Lk~GgS~~p~ell~-~~GiD~~~~~~~~~ 580 (598)
T COG1164 514 Y---HSPFYVYQYATGQLAALALY-AKILTND--AEAFEKYYIAFLKSGGSKSPLELLK-IAGIDLTTPDPWEE 580 (598)
T ss_pred e---ecCceehHhHHHHHHHHHHH-HHHHhcc--HHHHHHHHHHHHhCCCCCCHHHHHH-HcCCCCCCchHHHH
Confidence 5 33566555555 6666667 6666643 5555666568999999999999999 88999988787765
No 16
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=99.94 E-value=1.8e-24 Score=239.91 Aligned_cols=370 Identities=15% Similarity=0.129 Sum_probs=242.2
Q ss_pred cCccHHHHHhcCCCHHHHHHHHHHHhcccCCchHHHHHHHHHHHHHHHHHcCCCCHHHhHhhhhccCCHHHHHHHHHHHH
Q 005310 248 KSHHVAAVLELCKVGRTRRLVAVAYGKRCGGINLSVLESLVELRHKFARLLGYSNYADYALDLRMAKSSSKVFEFLEEIS 327 (703)
Q Consensus 248 ~~~~~~~~l~~~~d~~~Rk~~~~a~~~r~~~~n~~~l~~ll~lR~~~A~llGy~sya~~~l~~~ma~tpe~V~~fL~~l~ 327 (703)
+.+.+.++|..++|++.|+.+|+++.....+.-.+++.++|++|++.|+.+||+||.+|. |...+++++.+|++.|.
T Consensus 32 ~~~~l~~i~~~s~D~~~r~~aW~~w~~~~g~~~~~~y~~lV~lrn~~A~~~Gf~n~~~~w---R~~Ye~~d~~~~~e~i~ 108 (477)
T cd06461 32 LEPDLESIMAKSRDYDELLYAWEGWRDAVGPPMRPLYERYVELANEAAKLNGFVDAGEYW---RSEYETPDFEQDVERLW 108 (477)
T ss_pred CHHHHHHHHHhCCCHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHcCCCCHHHHh---HhcCCHHHHHHHHHHHH
Confidence 457889999888999999999999875444667899999999999999999999999998 55668899999999999
Q ss_pred HHhHHHHHHHHHHHHHHHHHh----cC----C-CCCCccchh------hhHHHHHh-------hhcCCChhhhccCCChH
Q 005310 328 ASLTDLASRELMMLKDLKRKE----EG----E-LPFGIEDLL------YYVRRVEE-------LQFDLDFGAVKQYFPVN 385 (703)
Q Consensus 328 ~~~~p~~~~el~~L~~~k~~~----~g----~-~~l~pWD~~------y~~~~~~~-------~~~~~d~~~l~~yf~~~ 385 (703)
+.++|..++ |.+..++. .| + +.+-||++. +|.+...- ...++++.-++.-++.
T Consensus 109 ~~v~Ply~~----Lh~~~r~~L~~~y~~~~~d~~~~iPahl~gn~~~q~W~~~~~~~~p~p~~~~~dv~~~l~~~~~t~- 183 (477)
T cd06461 109 EELKPLYLQ----LHAYVRRKLRKKYGDDVVNRDGPIPAHLLGNMWAQSWSNIYDLVKPYPGKPLLDVTDAMVEQGYTA- 183 (477)
T ss_pred HHHHHHHHH----HHHHHHHHHHHhcccccCCCCCCCccccccCccccccccccccccCCCCCCCCChHHHHHHcCCCH-
Confidence 999998655 33333322 22 1 234455551 11111100 0011111112221111
Q ss_pred HHHHHHHHHHhhhhcceEEEecccccccccceEEEEEecCCCceeeeeeecc---------cccCCCcCCccccc----c
Q 005310 386 LVLSGIFKVFQDLFGLRFEEVLDAVVWHSDVRVFSVFDLSSNDLLGYFYLDI---------YKREGKYVHTCVVA----L 452 (703)
Q Consensus 386 ~vl~gl~~l~~~lfgl~~~~~~~~e~W~~dV~~~~v~d~~~g~~lG~~ylDl---------~~R~gK~~ga~~~~----l 452 (703)
..|+..+++.|. .+| ..|+ ++||||+ |++|.+ +
T Consensus 184 ---~~mf~~~~~ff~----------------------------smg--L~dl~~~fw~~s~~~rp~k~-~~~C~~sawd~ 229 (477)
T cd06461 184 ---KRMFKEAEEFFT----------------------------SLG--LPPMPPSFWTKSMLEKPTDR-EVVCHASAWDF 229 (477)
T ss_pred ---HHHHHHHHHHHH----------------------------HCC--CCcCChHHHhhccccCCCCC-CCccCcccccC
Confidence 223333333331 111 4556 8999999 777776 4
Q ss_pred CCCcccCCCCccccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhh-hcCCccccCCCccch--hhhhccHHH
Q 005310 453 QNGALSSTGARQIPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQIC-NRASFARFSGLRVDP--DFVEIPAQL 529 (703)
Q Consensus 453 ~~~~~~~~g~~q~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~ll-s~~~~~~~sg~~~~~--DfvE~pS~l 529 (703)
. ..+.|.+ .+|+.. +.+|+.|+.||+||+...+. .+.++-...|.+.+. .++|+.|+.
T Consensus 230 ~--------~~~d~rI--~~c~~~---------t~~D~~t~~HE~GH~~yy~~y~~~p~~~r~~anp~fheav~e~~smS 290 (477)
T cd06461 230 Y--------NGKDFRI--KMCTKV---------NMEDFVTVHHEMGHIQYYLQYKDQPVLFREGANPGFHEAVGDAIALS 290 (477)
T ss_pred C--------CCCCcce--eeCCCC---------CHHHHHHHHHHHHHHHHHHHhccCCHHHhCCCCCChHHHHHHHHHHh
Confidence 2 2335544 568887 99999999999999995434 334444456666667 889999999
Q ss_pred H---HHhHhcHHHHHHhhcccCCCCccchHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhcCC-----cccHHHHHHH
Q 005310 530 L---ENWCYESFSLKLISGFHQDITTPIKDEICKSLKRRRYAFSALKLKQEILYCIFDQVIHSAD-----NVDIVELFKH 601 (703)
Q Consensus 530 ~---E~~~~~~~~L~~l~~h~~t~ge~ip~~l~~~l~~~~~~~~~~~~~~ql~~a~~D~~~H~~~-----~~~~~~l~~~ 601 (703)
| |++..-+ +|..+ .+.. ...| ..+..++.. .+.++ .+..++|...|... +.+.++.|++
T Consensus 291 ~~tpe~L~~~~-ll~~~--~~~~-~~~i-~~l~~~al~------~i~fL--P~~~~vD~fr~~vy~~~~~~~e~n~~Wwe 357 (477)
T cd06461 291 VSTPKHLHKIG-LLDSE--VDDE-EADI-NFLLKMALD------KIAFL--PFGYLLDKWRWDVFDGEIPKDDYNKAWWE 357 (477)
T ss_pred cCCHHHHhhcc-ccccc--CCCc-HHHH-HHHHHHHHH------HHHHh--hHHHHHHHHHHHHhcCCCCHHHHHHHHHH
Confidence 9 9987422 22221 0111 1112 233332222 12222 55678888877543 3588999999
Q ss_pred HhcccccCCCCCCC---CC-CccccccccCcccCCcccHHHHHHHHHHHHHHHHhc-----CCC-------CHHHHHHHH
Q 005310 602 LHPKVMLGLPMLEG---TN-PASCFPRSAIGFESACYSRIWSEVFAADIFACKFRD-----GLL-------NQRVGMQFR 665 (703)
Q Consensus 602 l~~~~~~~~~~~~~---~~-~~~~f~Hl~~~Y~a~YYsYl~a~vlA~~i~~~~f~~-----~~~-------~~~~g~~~r 665 (703)
|.++ |.|+..|.. .. ....+-|+.. ..-|..|..|.+++-++| ..+.+ ||+ ++++|++|+
T Consensus 358 L~~~-yqgv~pP~~r~~~~fd~gak~h~~~--~~PYi~YflA~ilqfQf~-~~lc~~a~~~~plh~C~~~~s~~Ag~~l~ 433 (477)
T cd06461 358 LREK-YQGVVPPVPRSEEDFDPGAKYHVPA--NTPYIRYFLSFILQFQFH-KALCKAAGHTGPLHKCDIYGSKEAGKKLR 433 (477)
T ss_pred HHHH-hcCCCCCCCCCccccCccceeeccC--CCcHHHHHHHHHHHHHHH-HHHHHhccCCCCcccccccChHHHHHHHH
Confidence 9999 888632221 11 1233456642 333889999999999999 66532 443 478999997
Q ss_pred HHhccCCCCccHHHHHHHhhCC-CCChHHHHH
Q 005310 666 NNVLAPGGSKEPIEILSDFLGR-EPSIQAFVD 696 (703)
Q Consensus 666 ~~vL~~Ggs~~~~ell~~flGr-~p~~~a~l~ 696 (703)
++|+.|||++..|+++.+.|. +.+..++++
T Consensus 434 -~~l~lG~S~~w~e~le~~~g~~~~s~~~l~e 464 (477)
T cd06461 434 -AMLSLGSSKPWPEALEALTGEREMDASALLE 464 (477)
T ss_pred -HHHhCcCCCCHHHHHHHhcCCCCCChHHHHH
Confidence 899999999999999999876 667777766
No 17
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=99.86 E-value=7.4e-19 Score=190.22 Aligned_cols=333 Identities=18% Similarity=0.183 Sum_probs=210.3
Q ss_pred HHHHHHHHHHHHHHHHHcCC-CCHHHhHhhh-hccCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcC-CCCCCcc
Q 005310 281 LSVLESLVELRHKFARLLGY-SNYADYALDL-RMAKSSSKVFEFLEEISASLTDLASRELMMLKDLKRKEEG-ELPFGIE 357 (703)
Q Consensus 281 ~~~l~~ll~lR~~~A~llGy-~sya~~~l~~-~ma~tpe~V~~fL~~l~~~~~p~~~~el~~L~~~k~~~~g-~~~l~pW 357 (703)
.+.|++++.+..+.|..+|+ .+--|.-|.. ..--|.+.+..+..+|...+.|.+++ + ..+... ...+
T Consensus 30 ~p~L~~~v~l~re~a~~~~~~~~~YdaLLd~ye~g~~~~~id~~F~~lk~~l~~l~~~----v---~~~~~~~~~~~--- 99 (396)
T cd06460 30 APYLEKIVELSREIAEYLGYEKHPYDALLDLYEPGMTTAKLDAIFAELKAGLVPLLKK----V---LEKQKPPDDSL--- 99 (396)
T ss_pred HHHHHHHHHHHHHHHHHhcCCCChHHHHHHhhCCCCCHHHHHHHHHHHHHHHHHHHHH----H---HhcCCCCCccc---
Confidence 58899999999999999998 4444544442 11127899999999999888887655 2 111111 1111
Q ss_pred chhhhHHHHHhhhcCCChhhhccCCChHHHHHHHHHHHhhhhcceEEEecccccccccceEEEEEecCCCceeeeeeecc
Q 005310 358 DLLYYVRRVEELQFDLDFGAVKQYFPVNLVLSGIFKVFQDLFGLRFEEVLDAVVWHSDVRVFSVFDLSSNDLLGYFYLDI 437 (703)
Q Consensus 358 D~~y~~~~~~~~~~~~d~~~l~~yf~~~~vl~gl~~l~~~lfgl~~~~~~~~e~W~~dV~~~~v~d~~~g~~lG~~ylDl 437 (703)
+..-||.+.-.. +..-+-+.+|..| + .| |+|.
T Consensus 100 --------------------~~~~~~~~~Q~~-~~~~~~~~~g~df-------------------~--~g------riD~ 131 (396)
T cd06460 100 --------------------LSGPFPEEKQEA-LGRELLEALGFDF-------------------D--RG------RLDV 131 (396)
T ss_pred --------------------cCCCCCHHHHHH-HHHHHHHHhCCcc-------------------c--CC------eeec
Confidence 011122222111 1111112345322 1 34 7998
Q ss_pred cccCCCcCCccccccCCCcccCCCCccccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHh-hhcC-CccccCC
Q 005310 438 YKREGKYVHTCVVALQNGALSSTGARQIPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQI-CNRA-SFARFSG 515 (703)
Q Consensus 438 ~~R~gK~~ga~~~~l~~~~~~~~g~~q~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~l-ls~~-~~~~~sg 515 (703)
.++| ||.++.+ .. +.+++||.+.. -.+.|.|+.||+|||+|.. +... ..+.+ |
T Consensus 132 s~hp------F~~~~~~---------~d--vRItt~y~~~d-------~~~~l~t~iHE~GHalye~~l~~~~~~~pl-~ 186 (396)
T cd06460 132 SAHP------FTGGLGP---------GD--VRITTRYDEND-------FRSALFSTIHETGHALYEQGLPPELRGTPL-G 186 (396)
T ss_pred CCCC------CCCCCCC---------CC--ceEEeeeCCcc-------hHHHHHHHHHHhhHHHHHhcCCHhHccCCc-c
Confidence 8765 5554311 12 35678999721 2789999999999999999 6521 11222 2
Q ss_pred CccchhhhhccHHHHHHhHhcH-----HHHHHhhcccCCCCccchHHHHHHHHHH-H-hHH-----HHHHHHHHHHHHHH
Q 005310 516 LRVDPDFVEIPAQLLENWCYES-----FSLKLISGFHQDITTPIKDEICKSLKRR-R-YAF-----SALKLKQEILYCIF 583 (703)
Q Consensus 516 ~~~~~DfvE~pS~l~E~~~~~~-----~~L~~l~~h~~t~ge~ip~~l~~~l~~~-~-~~~-----~~~~~~~ql~~a~~ 583 (703)
...++-+.|..|.|+|+.+... ..+..+..++....+....+-+-.+... + .+. .....+++++...|
T Consensus 187 ~~~sm~ihESqS~f~E~~vgrs~~F~~~l~~~l~~~f~~~~~~~~~e~~~~~~n~v~ps~iRveADEvty~lHiilR~e~ 266 (396)
T cd06460 187 GGASMGIHESQSRLWENQVGRSRAFWEFLYPKLKKTFPEQLKDVSLENFYRAVNRVQPSLIRVEADEVTYNLHIMLRYEL 266 (396)
T ss_pred cCCCchhHHHHHHHHHHHHhcCHHHHHHHHHHHHHHcccccccCCHHHHHHHHhhcCccceeeecchhhHHHHHHHHHHH
Confidence 3358899999999999988662 3444454444321122222223222211 0 111 11246788999999
Q ss_pred HHHHhcCCc--ccHHHHHHHHhcccccCCCCCCC-CCCccccccccCcccCCcccHHHHHHHHHHHHHHHHhc-CC-CCH
Q 005310 584 DQVIHSADN--VDIVELFKHLHPKVMLGLPMLEG-TNPASCFPRSAIGFESACYSRIWSEVFAADIFACKFRD-GL-LNQ 658 (703)
Q Consensus 584 D~~~H~~~~--~~~~~l~~~l~~~~~~~~~~~~~-~~~~~~f~Hl~~~Y~a~YYsYl~a~vlA~~i~~~~f~~-~~-~~~ 658 (703)
+..+....- .|+.++|++..++ |.|+..+.. ..| .+-+|...|.-+.||+|..+.++|+++| ...++ .+ +..
T Consensus 267 Ek~l~~G~l~v~dLp~~W~~~~~e-ylG~~~~~d~~g~-lQd~HW~~g~fGYfptYalG~l~Aaql~-~~~~~~~~~~~~ 343 (396)
T cd06460 267 EKALIEGDLEVADLPEAWNEKMKE-YLGIRPPNDAEGC-LQDIHWSGGSFGYFPTYTLGNLYAAQLF-AAAKKDLPDIDE 343 (396)
T ss_pred HHHHHcCCCCHHHHHHHHHHHHHH-HcCCCCCCCcccc-ccccccccCcCCcCchhHHHHHHHHHHH-HHHHHhCccHHH
Confidence 998887532 4899999999999 888755433 345 3567775332234789999999999999 55544 22 222
Q ss_pred H--------HHHHHHHHhccCCCCccHHHHHHHhhCCCCChHHHHHHhh
Q 005310 659 R--------VGMQFRNNVLAPGGSKEPIEILSDFLGREPSIQAFVDSRA 699 (703)
Q Consensus 659 ~--------~g~~~r~~vL~~Ggs~~~~ell~~flGr~p~~~a~l~~~~ 699 (703)
. ..+-++++|++.||+.+|.|++++.+|.++|+++|++...
T Consensus 344 ~i~~g~f~~~~~wL~e~i~~~G~~~~p~eLl~~atG~~l~~~~f~~yL~ 392 (396)
T cd06460 344 QIERGDFSPLLEWLRENIHQHGSRYSPDELLKKATGEPLNPEYFLEYLE 392 (396)
T ss_pred HHhccChHHHHHHHHHHHHhccCCCCHHHHHHHHhCCCCChHHHHHHHH
Confidence 2 5566778899999999999999999999999999998743
No 18
>PF01401 Peptidase_M2: Angiotensin-converting enzyme This Prosite motif covers only the active site.; InterPro: IPR001548 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M2 (clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The catalytic residues and zinc ligands have been identified, the zinc ion being ligated to two His residues within the motif HEXXH, showing that the enzyme belongs to the E sub-group of metalloproteases []. Pepetidyl-dipeptidase A (angiotensin-converting enzyme) is a mammalian enzyme responsible for cleavage of dipeptides from the C-termini of proteins, notably converting angiotensin I to angiotensin II []. The enzyme exists in two differentially transcribed forms, the most common of which is from lung endothelium; this contains two homologous domains that have arisen by gene duplication []. The testis-specific form contains only the C-terminal domain, arising from a duplicated promoter region present in intron 12 of the gene []. Both enzymatic forms are membrane proteins that are anchored by means of a C-terminal transmembrane domain. Both domains of the endothelial enzyme are active, but have differing kinetic constants []. ]. A number of insect enzymes have been shown to be similar to peptidyl-dipeptidase A, these containing a single catalytic domain.; GO: 0008237 metallopeptidase activity, 0008241 peptidyl-dipeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 2YDM_A 3BKL_A 2C6N_B 1UZE_A 3BKK_A 2C6F_B 2IUX_A 2IUL_A 2XYD_B 3NXQ_B ....
Probab=99.59 E-value=3.8e-12 Score=145.18 Aligned_cols=495 Identities=16% Similarity=0.186 Sum_probs=279.2
Q ss_pred HHHHhhhhhhcccccccccccccCCcHHHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHhcCCCC-CHHHHHHHHHHH
Q 005310 96 SPLAELEAQQFPLVQSCVFPKLVCTSDDVCKSSAEAERKIDAHMLSCSNREDVYRVVKAVAATGEWV-SPEAKRYIQSLV 174 (703)
Q Consensus 96 ~~l~~l~~~l~~~~~~~~~~~~v~~d~e~r~aa~e~~~~l~~~~~~l~~~~~ly~~l~~~~~~~~~L-~~e~~r~l~~~~ 174 (703)
.-++.-...+|.....+.+-..+.-+++-.....++.....+|..+... ..+.+. ...+ ++..+|.++.+.
T Consensus 21 ~~yn~ea~~v~~~~a~A~W~y~TNITdeN~~~~~ea~~~~a~F~k~~~~------~AkkFd--~~~~~d~~lkRql~~L~ 92 (595)
T PF01401_consen 21 EEYNREAEQVYYQSAEAQWNYNTNITDENQQLMAEASLEYAKFQKEYAK------EAKKFD--WTKFKDDTLKRQLKKLS 92 (595)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHTS-SHHHHHHHHHHHHHHHHHHHHHHH------HHTTSH--GGGGSSHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHHhhhhHhhcCCChHHHHHHHHHHHHHHHHHHHHHH------HHHhcC--cccCCCHHHHHHHHHHh
Confidence 3333333344444444455455554555566666667777777643322 111121 1233 344455444432
Q ss_pred HHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHhhccccccceEecccccCCCCHHHHHHHhhccCCceEEEccCccHHH
Q 005310 175 RDFERSGLNLTISKREEVQRLRDQIDELSLQYVRNLNDDKTFLLFSEADLLGLPPEFLKSLDKAENGIFKVTLKSHHVAA 254 (703)
Q Consensus 175 ~~f~~~G~~L~~~~r~~~~~l~~~i~~l~~~f~~n~~~~~~~~~~~~~eL~glp~~~l~~~~~~~~g~~~i~l~~~~~~~ 254 (703)
. .....|++++.+++.++..++.. .|+..- +.. .+|+ .+.+ .|++..
T Consensus 93 ~---lG~AaL~~~k~~el~~i~s~M~~---iYst~k------vC~-------------------~~~~-cl~L-ePdl~~ 139 (595)
T PF01401_consen 93 K---LGPAALPPEKLEELNKILSEMES---IYSTAK------VCP-------------------YDGK-CLSL-EPDLEE 139 (595)
T ss_dssp S----GGGGSTHHHHHHHHHHHHHHHH---HHHH-E------EEE-------------------SSSE-EEET-TTHHHH
T ss_pred h---cccccCCHHHHHHHHHHHHHHHH---Hhcccc------ccC-------------------CCCc-cccc-hhhHHH
Confidence 2 22356788888888888665544 455221 110 0233 4555 478999
Q ss_pred HHhcCCCHHHHHHHHHHHhcccCCchHHHHHHHHHHHHHHHHHcCCCCHHHhHhhhhccCCHHHHHHHHHHHHHHhHHHH
Q 005310 255 VLELCKVGRTRRLVAVAYGKRCGGINLSVLESLVELRHKFARLLGYSNYADYALDLRMAKSSSKVFEFLEEISASLTDLA 334 (703)
Q Consensus 255 ~l~~~~d~~~Rk~~~~a~~~r~~~~n~~~l~~ll~lR~~~A~llGy~sya~~~l~~~ma~tpe~V~~fL~~l~~~~~p~~ 334 (703)
+|..+.|.+.++-+|.+|.+..++...+.+.++|++-++-|++.||++..+|-... - ..++..+.++.|.+.++|.-
T Consensus 140 ima~Srd~~eL~~~W~~Wr~~vg~~~r~~y~~~V~L~N~aA~~nG~~d~g~~Wr~~-y--e~~~~~~~~~~lw~~i~PLY 216 (595)
T PF01401_consen 140 IMATSRDYDELLYAWEGWRDAVGPPMRPLYERYVELSNEAARLNGFKDTGEYWRSS-Y--EMPNFEQELERLWQQIKPLY 216 (595)
T ss_dssp HHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTSSSHHHHHHGG-G---CTTHHHHHHHHHHHHHHHH
T ss_pred HHhccCCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHH-h--ccccHHHHHHHHHHHHHHHH
Confidence 99999999999999999987544678899999999999999999999999996543 1 12357788999999999965
Q ss_pred HHHHHHHHHHHHHh----cCC--------------CCCCccchhhhHHHHHh----hhcCCChhhhccCCChHHHHHHHH
Q 005310 335 SRELMMLKDLKRKE----EGE--------------LPFGIEDLLYYVRRVEE----LQFDLDFGAVKQYFPVNLVLSGIF 392 (703)
Q Consensus 335 ~~el~~L~~~k~~~----~g~--------------~~l~pWD~~y~~~~~~~----~~~~~d~~~l~~yf~~~~vl~gl~ 392 (703)
++ |..+.+.. -|. ..+...|+.=....+.. ..+++..+-+++-+ ....|+
T Consensus 217 ~~----LHayVR~~L~~~Yg~~~v~~~gpIPAHLLGnmwaq~W~ni~~~~~P~p~~~~~dvt~~l~~q~~----t~~~mf 288 (595)
T PF01401_consen 217 KQ----LHAYVRRKLREKYGDDVVPPDGPIPAHLLGNMWAQSWSNIYDLVLPYPDKPSLDVTEELVKQGY----TAKKMF 288 (595)
T ss_dssp HH----HHHHHHHHHHHHHTTTTS-TTS-EEGGGSSSTTSS-GGGGHHHH-STTTS--S-THHHHHHTT------HHHHH
T ss_pred HH----HHHHHHHHHHHHcCcccCCCCCCccHHHHHHHHHHHHhhhhccccCCCCCCCCCcHHHHHHccc----cHHHHH
Confidence 54 32222211 121 12333333222222211 11233333233322 235677
Q ss_pred HHHhhhhc-ceEEEecccccccccceEEEEEecCCCceeeeeeecccccC--CC--cCCccccccCCCcccCCCCccccE
Q 005310 393 KVFQDLFG-LRFEEVLDAVVWHSDVRVFSVFDLSSNDLLGYFYLDIYKRE--GK--YVHTCVVALQNGALSSTGARQIPV 467 (703)
Q Consensus 393 ~l~~~lfg-l~~~~~~~~e~W~~dV~~~~v~d~~~g~~lG~~ylDl~~R~--gK--~~ga~~~~l~~~~~~~~g~~q~P~ 467 (703)
...+++|. +.|.++++ +-|..++- .|| |+ ..|+..+.+.. +. -+
T Consensus 289 ~~ae~ff~SlGl~~mp~-~FW~~S~~---------------------~kp~d~r~~~Chasawdf~~------~~---d~ 337 (595)
T PF01401_consen 289 KTAEEFFTSLGLPPMPP-TFWEKSMF---------------------EKPTDGRDVVCHASAWDFYN------GD---DF 337 (595)
T ss_dssp HHHHHHHHHTTS----H-HHHHHSB----------------------S--SSSS----S-EEEEESS------SS---EE
T ss_pred HHHHHHHHhCCCCcCCH-HHHHHHHh---------------------cCCCCCCCCCcCcchhhccC------CC---cc
Confidence 77777654 55555542 45644332 121 21 11211112111 10 11
Q ss_pred EEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhhhcC-CccccCCCccchhhhhccHHHHHHhHhcHHHHHHhhcc
Q 005310 468 ALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQICNRA-SFARFSGLRVDPDFVEIPAQLLENWCYESFSLKLISGF 546 (703)
Q Consensus 468 ~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~lls~~-~~~~~sg~~~~~DfvE~pS~l~E~~~~~~~~L~~l~~h 546 (703)
-+..|. -.+.++..|..||+||..+.+.-+. ++..-.| .-.-|-|.-+.++-.-+..|..|.++.--
T Consensus 338 Rik~Ct----------~v~~~df~t~hhemghi~Y~~~y~~qp~~fr~g--an~gfhEAigd~ials~~tp~hL~~~gLl 405 (595)
T PF01401_consen 338 RIKMCT----------EVTMEDFLTAHHEMGHIQYYMQYKDQPVLFREG--ANPGFHEAIGDTIALSVSTPKHLKRIGLL 405 (595)
T ss_dssp EEE--------------SSHHHHHHHHHHHHHHHHHHHTTTS-GGGSS---SSHHHHHHHHHHHHHHHTSHHHHHHTTSS
T ss_pred ceeecc----------cCCHHHHHHHHHHHHHHHHHHHHhhCChhhhcC--CCchHHHHHHHHHHHHcCCHHHHHHcCCC
Confidence 133453 2489999999999999999876542 2222233 34788999999999999999999887543
Q ss_pred cCCCCccchHHHHHHHHHHHhHHHHHHHHHHHHH-HHHHHHHhcC-----CcccHHHHHHHHhcccccCCC--CCCC-CC
Q 005310 547 HQDITTPIKDEICKSLKRRRYAFSALKLKQEILY-CIFDQVIHSA-----DNVDIVELFKHLHPKVMLGLP--MLEG-TN 617 (703)
Q Consensus 547 ~~t~ge~ip~~l~~~l~~~~~~~~~~~~~~ql~~-a~~D~~~H~~-----~~~~~~~l~~~l~~~~~~~~~--~~~~-~~ 617 (703)
.+ ...-.+.-++.|. ..|+..+-.|-| -++|+.=... .+.+.++-|.++..+ |.|+. .+.. +.
T Consensus 406 -~~-~~~~~~~~in~L~-----~~AL~ki~~lPf~~~~d~wRw~vf~g~i~~~~~n~~wW~lr~~-yqGi~pP~~r~~~~ 477 (595)
T PF01401_consen 406 -DN-VTDDEESDINFLL-----KQALDKIAFLPFGYLMDKWRWDVFSGSIPPDQYNEAWWELREK-YQGIVPPVERSESD 477 (595)
T ss_dssp -ST-TTTSHHHHHHHHH-----HHHHHHCCHHHHHHHHHHHHHHHHTTSS-CCGHHHHHHHHHHH-HHSEE-SS-EETTS
T ss_pred -CC-CCcchHHHHHHHH-----HHhhhHHHhccHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHH-cCCccCCCCCCccc
Confidence 21 1112333444443 234443333333 3445544321 235899999999999 99963 2211 22
Q ss_pred C-ccccccccCcccCCcccHHHHHHHHHHHHHHHHhc-----CCC-------CHHHHHHHHHHhccCCCCccHHHHHHHh
Q 005310 618 P-ASCFPRSAIGFESACYSRIWSEVFAADIFACKFRD-----GLL-------NQRVGMQFRNNVLAPGGSKEPIEILSDF 684 (703)
Q Consensus 618 ~-~~~f~Hl~~~Y~a~YYsYl~a~vlA~~i~~~~f~~-----~~~-------~~~~g~~~r~~vL~~Ggs~~~~ell~~f 684 (703)
+ +....|+.. ...|..|-.|.++--+++ +.+.+ ||+ ++++|++++ ++|+.|+|++..+.++.+
T Consensus 478 fD~~ak~hi~~--~~py~ryfls~ilqFQ~~-~~lC~~ag~~~pLh~Cdiy~s~~aG~~l~-~ml~~G~S~~W~~~l~~~ 553 (595)
T PF01401_consen 478 FDPGAKYHIPA--NTPYIRYFLSTILQFQFY-KALCEAAGHEGPLHRCDIYGSKEAGRKLK-KMLSLGSSKPWPEALEEL 553 (595)
T ss_dssp -GGGGSHHHHT--T--SHHHHHHHHHHHHHH-HHHHHHTTTTSSGGG---TT-HHHHHHHH-HHHTTTTSS-HHHHHHHH
T ss_pred CCcchhhhhcc--ccccHHHHHHHHHHHHHH-HHHHHhcCCCCCccccccccchhhHHHHH-HHhcCCCCCCHHHHHHHH
Confidence 2 234458752 335899999999999998 54432 442 489999997 799999999999999999
Q ss_pred hC-CCCChHHHHHH
Q 005310 685 LG-REPSIQAFVDS 697 (703)
Q Consensus 685 lG-r~p~~~a~l~~ 697 (703)
+| ++.|.+++++-
T Consensus 554 tg~~~~~a~~ll~Y 567 (595)
T PF01401_consen 554 TGEREMSASALLEY 567 (595)
T ss_dssp HSSSS--SHHHHHH
T ss_pred hCCCCcChHHHHHH
Confidence 99 58888888764
No 19
>PF02074 Peptidase_M32: Carboxypeptidase Taq (M32) metallopeptidase; InterPro: IPR001333 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M32 (carboxypeptidase Taq family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Carboxypeptidase Taq is a zinc-containing thermostable metallopeptidase. It was originally discovered and purified from Thermus aquaticus; optimal enzymatic activity occurs at 80 celcius. Although very little is known about this enzyme, it is thought either to be associated with a membrane or to be particle bound.; GO: 0004181 metallocarboxypeptidase activity, 0006508 proteolysis; PDB: 1K9X_A 1KA4_A 1KA2_A 3DWC_A 1WGZ_A 3HQ2_A 3HOA_B.
Probab=99.44 E-value=1.6e-09 Score=121.08 Aligned_cols=416 Identities=17% Similarity=0.210 Sum_probs=231.0
Q ss_pred HHHHHHHHHHHHHHhhCCHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHH
Q 005310 128 SAEAERKIDAHMLSCSNREDVYRVVKAVAATGEWVSPEAKRYIQSLVRDFERSGLNLTISKREEVQRLRDQIDELSLQYV 207 (703)
Q Consensus 128 a~e~~~~l~~~~~~l~~~~~ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~l~~~f~ 207 (703)
=.++...|+....++.+++.+-+.|..+....+.|++.+++.++.+.++++++ ..+|.+.-+++.++..+-.
T Consensus 43 Raeqla~Ls~~~hel~T~~~~~elL~~l~~~~~~l~~~e~a~lr~~~r~~~~~-~~iP~elv~~~~~~~s~a~------- 114 (494)
T PF02074_consen 43 RAEQLATLSGLIHELLTSPEIGELLEELEEEKEDLDEWERANLREMRRDYERA-TKIPEELVEELARLTSEAE------- 114 (494)
T ss_dssp HHHHHHHHHHHHHHHHTSHHHHHHHHHHHCSGSSTTHHHHHHHHHHHHHHHHH-HCS-HHHHHHHHHHHHHHH-------
T ss_pred HHHHHHHHHHHHHHHHcCHHHHHHHHHHhccccCCCHHHHHHHHHHHHHHHHH-hCCCHHHHHHHHHHHHHHH-------
Confidence 34666778889999999999999999887654678888888888888888775 3344443333333332221
Q ss_pred hhccccccceEecccccCCCCHHHHHHHhhccCCceEEEccCccHHHHHhcCCCHHHHHHHHHHHhcccC-CchHHHHHH
Q 005310 208 RNLNDDKTFLLFSEADLLGLPPEFLKSLDKAENGIFKVTLKSHHVAAVLELCKVGRTRRLVAVAYGKRCG-GINLSVLES 286 (703)
Q Consensus 208 ~n~~~~~~~~~~~~~eL~glp~~~l~~~~~~~~g~~~i~l~~~~~~~~l~~~~d~~~Rk~~~~a~~~r~~-~~n~~~l~~ 286 (703)
.+|..-.+... ..-.+.|++
T Consensus 115 -----------------------------------------------------------~~W~~AR~~nDf~~F~P~Le~ 135 (494)
T PF02074_consen 115 -----------------------------------------------------------QAWEEARENNDFSAFAPYLEK 135 (494)
T ss_dssp -----------------------------------------------------------HHHHHHHHCT-HHHHHHHHHH
T ss_pred -----------------------------------------------------------HHHHHHHhhCCHHHHHHHHHH
Confidence 22322111110 122578999
Q ss_pred HHHHHHHHHHHcCCCC-HHHhHhhh---hccCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcCCCCCCccchhhh
Q 005310 287 LVELRHKFARLLGYSN-YADYALDL---RMAKSSSKVFEFLEEISASLTDLASRELMMLKDLKRKEEGELPFGIEDLLYY 362 (703)
Q Consensus 287 ll~lR~~~A~llGy~s-ya~~~l~~---~ma~tpe~V~~fL~~l~~~~~p~~~~el~~L~~~k~~~~g~~~l~pWD~~y~ 362 (703)
++.++.++|..+|++. --|.-|.. .| |.+.+..+..+|...+.|.+++ + ..+ +.. | |.
T Consensus 136 iv~l~re~a~~~~~~~~~YDaLLd~yEpg~--t~~~ld~~F~~lk~~l~~l~~~----i---~~~--~~~---~-~~--- 197 (494)
T PF02074_consen 136 IVELQREIAEYLGYELSPYDALLDDYEPGM--TTEKLDEIFAELKAFLVPLLQK----I---LEK--QKQ---P-DD--- 197 (494)
T ss_dssp HHHHHHHHHHHCTSTTSHHHHHHHHHSTT----HHHHHHHHHHHHHHHHHHHHH----H---HCH--TCC---H-TG---
T ss_pred HHHHHHHHHHHhcCCCCcHHHHHhhhCCCC--CHHHHHHHHHHHHHHHHHHHHH----H---Hhc--CCC---C-CC---
Confidence 9999999999999885 44544442 23 7899999999999999997655 2 111 100 0 00
Q ss_pred HHHHHhhhcCCChhhhccCCChHHHHHHHHHHHhhhhcceEEEecccccccccceEEEEEecCCCceeeeeeecccccCC
Q 005310 363 VRRVEELQFDLDFGAVKQYFPVNLVLSGIFKVFQDLFGLRFEEVLDAVVWHSDVRVFSVFDLSSNDLLGYFYLDIYKREG 442 (703)
Q Consensus 363 ~~~~~~~~~~~d~~~l~~yf~~~~vl~gl~~l~~~lfgl~~~~~~~~e~W~~dV~~~~v~d~~~g~~lG~~ylDl~~R~g 442 (703)
..+..-||.+.-.+-...++ +.+|..| +.| .+|.
T Consensus 198 -------------~~l~~~~~~~~Q~~~~~~~~-~~~G~d~---------------------~~g------rld~----- 231 (494)
T PF02074_consen 198 -------------SFLHGPFPEEKQKAFSRELL-EYLGFDF---------------------DRG------RLDE----- 231 (494)
T ss_dssp -------------GGGGSB--HHHHHHHHHHHH-HHHT--G---------------------CGE------EEEE-----
T ss_pred -------------ccCCCCCCHHHHHHHHHHHH-HHcCCCc---------------------cce------EEec-----
Confidence 01111122222111111222 3446432 122 3453
Q ss_pred CcCCccccccCCCcccCCCCccccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhhhcCCcc-ccCCCccchh
Q 005310 443 KYVHTCVVALQNGALSSTGARQIPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQICNRASFA-RFSGLRVDPD 521 (703)
Q Consensus 443 K~~ga~~~~l~~~~~~~~g~~q~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~lls~~~~~-~~sg~~~~~D 521 (703)
..|.+|.++-++ . +-+..++.+.. =.+.+....||.||||...-....+. .--|..+.+-
T Consensus 232 -s~HPFt~~~~~~------D-----vRiTTry~e~d-------~~~~l~s~iHE~GHalYEq~~~~~~~~tpl~~~~smg 292 (494)
T PF02074_consen 232 -SAHPFTTGFGPN------D-----VRITTRYDEDD-------FLSALFSTIHETGHALYEQGLPPEYYGTPLGEGVSMG 292 (494)
T ss_dssp --SS-EEEEEETT------E-----EEEEE--BTTB-------THHHHHHHHHHHHHHHHHHTS-GGGTTSTTSS-S-HH
T ss_pred -CCCCCCCCCCCC------c-----eeeeccccccc-------HHHHHHHHHHHHHHHHHHcCCChhHccCccccccccc
Confidence 235555554211 1 34455666522 24678888899999998875543221 1122336788
Q ss_pred hhhccHHHHHHhHh-cHHHHHH----hhcccCCCCccch-HHHHHHHHHHHh-HH-----HHHHHHHHHHHHHHHHHHhc
Q 005310 522 FVEIPAQLLENWCY-ESFSLKL----ISGFHQDITTPIK-DEICKSLKRRRY-AF-----SALKLKQEILYCIFDQVIHS 589 (703)
Q Consensus 522 fvE~pS~l~E~~~~-~~~~L~~----l~~h~~t~ge~ip-~~l~~~l~~~~~-~~-----~~~~~~~ql~~a~~D~~~H~ 589 (703)
+-|--|-|+||.+. ...+... +-.++.. ...++ +++...+-..+. +. .....+.-++.-.++..+-.
T Consensus 293 iHESQSrf~En~vgrS~~Fw~~~~p~l~~~f~~-~~~v~~e~~y~~~N~V~ps~IR~eADElTY~lHIilRyEiEk~li~ 371 (494)
T PF02074_consen 293 IHESQSRFWENMVGRSRAFWEFLYPKLQEHFPQ-LKDVSLENFYRAINRVKPSLIRVEADELTYPLHIILRYEIEKALIN 371 (494)
T ss_dssp HHHHHHHHHHTTTTTSHHHHHHHHHHHHHHCGG-GTT-SHHHHHHHHT-----S-GGG--TTTHHHHHHHHHHHHHHHHT
T ss_pred hHHHHHHHHHHHHccCHHHHHHHHHHHHHHhCc-ccCCCHHHHHHHHccCCCcccchhhHHHHHHHHHHHHHHHHHHHHc
Confidence 88999999999543 2332222 2223322 22233 232222211110 00 00112334555566666655
Q ss_pred CC--cccHHHHHHHHhcccccCCCCCCCCCCccccccccCcccCCcc-cHHHHHHHHHHHHHHHHhc-----------CC
Q 005310 590 AD--NVDIVELFKHLHPKVMLGLPMLEGTNPASCFPRSAIGFESACY-SRIWSEVFAADIFACKFRD-----------GL 655 (703)
Q Consensus 590 ~~--~~~~~~l~~~l~~~~~~~~~~~~~~~~~~~f~Hl~~~Y~a~YY-sYl~a~vlA~~i~~~~f~~-----------~~ 655 (703)
.. ..|+-++|++..++ |.|+..+....-...=.|=.+|+= ||+ +|..+.++|+++| ...++ |.
T Consensus 372 g~l~v~dLP~~Wn~km~e-yLGi~p~~d~eG~LQDvHWs~G~f-GYFPtY~LG~~~AaQl~-~~~~~~~pd~~~~i~~G~ 448 (494)
T PF02074_consen 372 GELSVDDLPEAWNEKMEE-YLGITPPNDAEGVLQDVHWSSGSF-GYFPTYALGNIYAAQLF-AAMKKDIPDLDEQIAKGD 448 (494)
T ss_dssp TSS-GGGHHHHHHHHHHH-HHS---SSCTTTTTS-STTTTT-T-S-THHHHHHHHHHHHHH-HHHHHHHTTHHHHHHTT-
T ss_pred CCCChhhhHHHHHHHHHH-HcCCCCCCCCccccccccccCCcc-ccchhhHHHHHHHHHHH-HHHHHHccCHHHHHHcCC
Confidence 43 35899999999999 888743332111112234332321 355 9999999999999 44432 22
Q ss_pred CCHHHHHHHHHHhccCCCCccHHHHHHHhhCCCCChHHHHHHh
Q 005310 656 LNQRVGMQFRNNVLAPGGSKEPIEILSDFLGREPSIQAFVDSR 698 (703)
Q Consensus 656 ~~~~~g~~~r~~vL~~Ggs~~~~ell~~flGr~p~~~a~l~~~ 698 (703)
+ .....=++++|-+.|+..+|.|++++..|.++|++.|++-.
T Consensus 449 f-~~i~~WL~e~Ih~~G~~y~~~eLl~~aTGe~l~~~~~~~YL 490 (494)
T PF02074_consen 449 F-SPIRDWLRENIHQYGSRYTPDELLKKATGEPLNPEYFIDYL 490 (494)
T ss_dssp --HHHHHHHHHHTGGGTTSS-HHHHHHHHHSSS--THHHHHHH
T ss_pred h-HHHHHHHHHHHHHhccCCCHHHHHHHHhCCCCCHHHHHHHH
Confidence 2 34666678899999999999999999999999999999864
No 20
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=99.28 E-value=3.9e-08 Score=105.90 Aligned_cols=411 Identities=17% Similarity=0.186 Sum_probs=234.6
Q ss_pred HHHHHHHHHHHHHhhCCHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHh
Q 005310 129 AEAERKIDAHMLSCSNREDVYRVVKAVAATGEWVSPEAKRYIQSLVRDFERSGLNLTISKREEVQRLRDQIDELSLQYVR 208 (703)
Q Consensus 129 ~e~~~~l~~~~~~l~~~~~ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~l~~~f~~ 208 (703)
.++...|+.....+.+++.+-+.|.++.++. .|++.++.+++.+.++++.+ ..+|++.-+++..+..+-..
T Consensus 44 s~a~a~Ls~l~~~l~t~~e~~~ll~~a~~e~-~L~e~~~~~vre~~r~~~~~-~~iP~e~~~e~s~~~s~a~~------- 114 (497)
T COG2317 44 SEAIAELSVLRHELLTSPEFAELLEKAEEEK-DLSEIEAGVVRELKREYEKA-KKIPEELVKEYSKLTSKAEH------- 114 (497)
T ss_pred HHHHHHHHHHHHHHhccHHHHHHHHHhhhcc-CccHHHHHHHHHHHHHHHHH-hcCCHHHHHHHHHHHHHHHH-------
Confidence 4566778888889999999988888886543 78999988888888888875 45555554444444333221
Q ss_pred hccccccceEecccccCCCCHHHHHHHhhccCCceEEEccCccHHHHHhcCCCHHHHHHHHHHHhcccC-CchHHHHHHH
Q 005310 209 NLNDDKTFLLFSEADLLGLPPEFLKSLDKAENGIFKVTLKSHHVAAVLELCKVGRTRRLVAVAYGKRCG-GINLSVLESL 287 (703)
Q Consensus 209 n~~~~~~~~~~~~~eL~glp~~~l~~~~~~~~g~~~i~l~~~~~~~~l~~~~d~~~Rk~~~~a~~~r~~-~~n~~~l~~l 287 (703)
+|+.-..... .--.+-|++|
T Consensus 115 -----------------------------------------------------------aWreAr~knDf~~F~p~Leki 135 (497)
T COG2317 115 -----------------------------------------------------------AWREAREKNDFSIFKPYLEKI 135 (497)
T ss_pred -----------------------------------------------------------HHHHHhhcccHhhhhHHHHHH
Confidence 1111000000 1124678899
Q ss_pred HHHHHHHHHHcCCCCHHHhHhhhhc--cCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcCCCCC-CccchhhhHH
Q 005310 288 VELRHKFARLLGYSNYADYALDLRM--AKSSSKVFEFLEEISASLTDLASRELMMLKDLKRKEEGELPF-GIEDLLYYVR 364 (703)
Q Consensus 288 l~lR~~~A~llGy~sya~~~l~~~m--a~tpe~V~~fL~~l~~~~~p~~~~el~~L~~~k~~~~g~~~l-~pWD~~y~~~ 364 (703)
+++-.+.|..+||.-..-=+|-+.- --+.++|..++.+|.+.+.|.+.+ +.+..+......-. .++|. +
T Consensus 136 v~l~re~A~~~~~~~~pYdaLld~yEpG~t~~~i~~vF~~Lk~~L~~ll~k----v~~~~~~~~~~~~~~~~~~~----e 207 (497)
T COG2317 136 VELKREFAEYRGYEEHPYDALLDLYEPGLTVRDVDRVFAELKKELVPLLDK----VLEKGKSPRSDLLEKEKYDK----E 207 (497)
T ss_pred HHHHHHHHHhcccccCcHHHHHHhhcCCCcHHHHHHHHHHHHHHHHHHHHH----HHhccCCCcccccccCCCCH----H
Confidence 9999999999999884333333322 127889999999999999987666 22111000000000 12222 1
Q ss_pred HHHhhhcCCChhhhccCCChHHHHHHHHHHHhhhhcceEEEecccccccccceEEEEEecCCCceeeeeeecccccCCCc
Q 005310 365 RVEELQFDLDFGAVKQYFPVNLVLSGIFKVFQDLFGLRFEEVLDAVVWHSDVRVFSVFDLSSNDLLGYFYLDIYKREGKY 444 (703)
Q Consensus 365 ~~~~~~~~~d~~~l~~yf~~~~vl~gl~~l~~~lfgl~~~~~~~~e~W~~dV~~~~v~d~~~g~~lG~~ylDl~~R~gK~ 444 (703)
++ ..+-..++ +++|..|. .|+ +|. .
T Consensus 208 ~q------------------~~~~~~vl----~~lGfdf~---------------------~GR------lD~------S 232 (497)
T COG2317 208 KQ------------------EALGLRVL----ELLGFDFE---------------------RGR------LDV------S 232 (497)
T ss_pred HH------------------HHHHHHHH----HHhCCCcc---------------------Ccc------ccC------C
Confidence 11 11111222 34564321 232 553 3
Q ss_pred CCccccccCCCcccCCCCccccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhhhcCCccccCCC----ccch
Q 005310 445 VHTCVVALQNGALSSTGARQIPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQICNRASFARFSGL----RVDP 520 (703)
Q Consensus 445 ~ga~~~~l~~~~~~~~g~~q~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~lls~~~~~~~sg~----~~~~ 520 (703)
.|.+|.++..+ . +-+..++.+.. =.+-+....||+|||+...-... .+.|| ...+
T Consensus 233 ~HPF~~g~~~~------D-----VRITTRy~~~d-------f~~aL~g~iHE~GHAlYEqn~~~---~l~gtPlg~g~sm 291 (497)
T COG2317 233 VHPFTTGLPIN------D-----VRITTRYNEQD-------FRSALFGTIHETGHALYEQNLDE---ALLGTPLGEGRSM 291 (497)
T ss_pred CCCccCCCCCC------c-----eeEEeecCCcc-------HHHHHHHHHHhhhhHHHHcCCCH---HHcCCcccCCccc
Confidence 46677776321 1 34556776532 23567888999999998764432 23343 3567
Q ss_pred hhhhccHHHHHHhHhc-HHHHHHhhc----ccCCCCccchHHHHHHHHHHHhHHH-------H--H-HHHHHHHHHHHHH
Q 005310 521 DFVEIPAQLLENWCYE-SFSLKLISG----FHQDITTPIKDEICKSLKRRRYAFS-------A--L-KLKQEILYCIFDQ 585 (703)
Q Consensus 521 DfvE~pS~l~E~~~~~-~~~L~~l~~----h~~t~ge~ip~~l~~~l~~~~~~~~-------~--~-~~~~ql~~a~~D~ 585 (703)
-+-|--|-|+||.+-. +.+.+.+.. |+ ++.+++.-.+.+...-|... | + .-+.-++.-.++.
T Consensus 292 giHESQSrF~En~VGRs~aFw~~~~~~~~~~~---~~~~~~~s~dd~y~~vn~v~~~lIRveADElTY~lHIilRyEiEk 368 (497)
T COG2317 292 GIHESQSRFWENQVGRSRAFWEAIYPLLRKHF---PEQFDKYSLDDFYRAVNRVEPSLIRVEADELTYPLHIILRYEIEK 368 (497)
T ss_pred ccccHHHHHHHHHhccCHHHHHHHHHHHHHhh---hhhcccCCHHHHHHHHhcccCcceeeeccccceeeeeehhHHHHH
Confidence 8889999999998653 333333221 22 11111111222211100000 0 0 0011112222333
Q ss_pred HHhcCC--cccHHHHHHHHhcccccCCCCCCCCCCcccccccc-CcccCCcc-cHHHHHHHHHHHHHHHHh---------
Q 005310 586 VIHSAD--NVDIVELFKHLHPKVMLGLPMLEGTNPASCFPRSA-IGFESACY-SRIWSEVFAADIFACKFR--------- 652 (703)
Q Consensus 586 ~~H~~~--~~~~~~l~~~l~~~~~~~~~~~~~~~~~~~f~Hl~-~~Y~a~YY-sYl~a~vlA~~i~~~~f~--------- 652 (703)
.+-+.. -.|+-++|++..++ |.|+.......-...=-|-. |.. ||+ +|..+.++|+++|+..-+
T Consensus 369 ~li~G~l~v~DlP~lWn~kme~-yLGirp~~d~eG~LQDIHWs~Gsf--GYFPsYtlG~v~AAQ~~~a~~~~~p~~d~~i 445 (497)
T COG2317 369 ELISGELKVDDLPELWNDKMEE-YLGIRPKNDAEGVLQDIHWSHGSF--GYFPTYTLGNVYAAQLYAAMKKDIPDVDALI 445 (497)
T ss_pred HHHcCCcchhhhHHHHHHHHHH-HcCCCCCCccccccccccccCCCc--CcCchhhhHHHHHHHHHHHHHhhcccHHHHH
Confidence 332221 13788899999999 89875432211111111221 223 355 999999999999944333
Q ss_pred -cCCCCHHHHHHHHHHhccCCCCccHHHHHHHhhCCCCChHHHHHHh
Q 005310 653 -DGLLNQRVGMQFRNNVLAPGGSKEPIEILSDFLGREPSIQAFVDSR 698 (703)
Q Consensus 653 -~~~~~~~~g~~~r~~vL~~Ggs~~~~ell~~flGr~p~~~a~l~~~ 698 (703)
+|.+ .....=++++|-+.|...+|.|++++.+|.+.|++.|++..
T Consensus 446 ~~gd~-~~i~~WL~e~ih~~Gs~~~p~eLi~~atGE~lnp~y~i~yL 491 (497)
T COG2317 446 AKGDF-SPIKNWLRENIHRHGSRYPPKELLKRATGEALNPEYFIDYL 491 (497)
T ss_pred HcCCc-HHHHHHHHHHHHHccCcCCcHHHHHHhhCCcCCHHHHHHHH
Confidence 2223 34666678899999999999999999999999999999874
No 21
>KOG3690 consensus Angiotensin I-converting enzymes - M2 family peptidases [Amino acid transport and metabolism]
Probab=99.04 E-value=1.3e-06 Score=96.54 Aligned_cols=485 Identities=15% Similarity=0.138 Sum_probs=276.1
Q ss_pred cCCcHHHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHH
Q 005310 118 VCTSDDVCKSSAEAERKIDAHMLSCSNREDVYRVVKAVAATGEWVSPEAKRYIQSLVRDFERSGLNLTISKREEVQRLRD 197 (703)
Q Consensus 118 v~~d~e~r~aa~e~~~~l~~~~~~l~~~~~ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~ 197 (703)
+.-+++...+-.++..++.+|.-.+.++.+.|+... --|+-.+|.|+++-..- .+ .|+++.-+++..|..
T Consensus 61 ~~~~~~~~~a~~ea~~~~~~~~r~~a~q~k~~~~~~-------l~D~~~rRql~~~s~lg-~s--aL~~~d~~~~~~ll~ 130 (646)
T KOG3690|consen 61 TNGTEENLLALLEAEDKLVEFVRSLATQAKKFDVAE-------LQDERLRRQLQKVSQLG-MS--ALSPDDLKRYNQLLS 130 (646)
T ss_pred cCCChHHHHHHHHHHHHHHHHHHHHHHHHhhccHhh-------ccCHHHHHHHHHHhhhc-cc--cCCHHHHHHHHHHHH
Confidence 333455777888999999999988888666665211 12445555655543322 22 466666666666655
Q ss_pred HHHHHHHHHHhhccccccceEecccccCCCCHHHHHHHhhccCCceEEEccCccHHHHHhcCCCHHHHHHHHHHHhcccC
Q 005310 198 QIDELSLQYVRNLNDDKTFLLFSEADLLGLPPEFLKSLDKAENGIFKVTLKSHHVAAVLELCKVGRTRRLVAVAYGKRCG 277 (703)
Q Consensus 198 ~i~~l~~~f~~n~~~~~~~~~~~~~eL~glp~~~l~~~~~~~~g~~~i~l~~~~~~~~l~~~~d~~~Rk~~~~a~~~r~~ 277 (703)
.++. .|+ +++ +. ..+ -|. + ..+. -.|.+..+|.+++|-+-+.-.|.+|.++..
T Consensus 131 ~m~~---~y~----~~~--vC-~y~----~p~----------~--~~L~-l~P~l~~i~~~Srd~~eL~~~W~~wr~~~g 183 (646)
T KOG3690|consen 131 SMST---NYN----TAT--VC-AYD----QPS----------A--CCLL-LEPQLQSIMANSRDYDELQWYWVEWRRKIG 183 (646)
T ss_pred HHHh---hcC----Cce--ee-CCC----CCC----------C--ccee-cCccHHHHHhccCCHHHHHHHHHHHHHHhh
Confidence 5443 232 222 10 000 010 1 1333 368889999999999999999999998775
Q ss_pred CchHHHHHHHHHHHHHHHHHcCCCCHHHhHhhhhccCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH-hcC---CCC
Q 005310 278 GINLSVLESLVELRHKFARLLGYSNYADYALDLRMAKSSSKVFEFLEEISASLTDLASRELMMLKDLKRK-EEG---ELP 353 (703)
Q Consensus 278 ~~n~~~l~~ll~lR~~~A~llGy~sya~~~l~~~ma~tpe~V~~fL~~l~~~~~p~~~~el~~L~~~k~~-~~g---~~~ 353 (703)
..-.+.+.+.+.+-++.|++=||.++.+|=... -.++.+...-|+.+...++|..+. +-..-+.+-. ..| ...
T Consensus 184 ~~~r~~y~~yv~L~~e~A~lNg~~~~gdyW~~~--yE~~~~~~~~ld~i~~ei~PlY~~-LHAYvRr~L~~~Ygp~~i~~ 260 (646)
T KOG3690|consen 184 KAMRQSYEEYVDLSNEAAQLNGYVNGGDYWRLA--YETPGDFEQDLDAIFEEIRPLYRQ-LHAYVRRKLRGAYGPDGISR 260 (646)
T ss_pred hhhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHH--hcCchhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCcccCCC
Confidence 555678999999999999999999999987653 234447788889999999996544 2222222111 111 111
Q ss_pred CCccchhh--------hHHHHH-----hhhcCCChhh--hccCCChHHHHHHHHHHHhhhhc-ceEEEecccccccccce
Q 005310 354 FGIEDLLY--------YVRRVE-----ELQFDLDFGA--VKQYFPVNLVLSGIFKVFQDLFG-LRFEEVLDAVVWHSDVR 417 (703)
Q Consensus 354 l~pWD~~y--------~~~~~~-----~~~~~~d~~~--l~~yf~~~~vl~gl~~l~~~lfg-l~~~~~~~~e~W~~dV~ 417 (703)
..|.-..- |.+.+. .++-.+|... .++=. ..+.||....+.|- +.|-.+++ +-|...++
T Consensus 261 ~gpiPahLlgnm~gq~Ws~~y~~~~P~pe~~~~dvt~~m~~qg~----~~qkmF~~a~efF~SlGl~~lp~-sfw~~s~~ 335 (646)
T KOG3690|consen 261 DGPIPAHLLGNMWGQDWSNHYDLTTPFPERPLIDVTMEMVKQGY----TVQKMFKLAAEFFTSLGLPALPP-SFWTNSIL 335 (646)
T ss_pred CCCcCHHHHhhhhccchHhhhhcccCCCCCCCCchHHHHHHccc----cHHHHHHHHHHHHHHcCCCcCCh-HHHhHHHh
Confidence 11111100 111110 0001111111 11100 11555665555542 22222221 23433332
Q ss_pred EEEEEecCCCceeeeeeecccccCCCcCCccccccCCCcccCCCCcccc-EEEEeccCCCCCCCCCCccChhhHHHHHHH
Q 005310 418 VFSVFDLSSNDLLGYFYLDIYKREGKYVHTCVVALQNGALSSTGARQIP-VALLISQIQKDIVGHPSLLRFSEVVNFFHE 496 (703)
Q Consensus 418 ~~~v~d~~~g~~lG~~ylDl~~R~gK~~ga~~~~l~~~~~~~~g~~q~P-~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHE 496 (703)
.|++-..++.|..-- . + .++.| +-+-.| +.++..+..+..||
T Consensus 336 ---------------------~~p~d~~~v~C~~sA--w---d-f~~~~Dfrik~C----------t~vd~~~f~~~Hhe 378 (646)
T KOG3690|consen 336 ---------------------TRPDDRDMVVCHASA--W---D-FYAGPDFRIKYC----------TKVDEEDFEQAHHE 378 (646)
T ss_pred ---------------------cCCCCCCceeechhh--h---c-cccCCCceeeec----------CcCCHHHHHHHHHH
Confidence 233333333333210 0 0 01111 112223 35688999999999
Q ss_pred HHHHHHHhhhcCCccccCCCccchhhhhccHHHHHHhHhcHHHHHHhhcccCCCCccchHHHHHHHHHHHhHHHHHHHHH
Q 005310 497 FGHVVQQICNRASFARFSGLRVDPDFVEIPAQLLENWCYESFSLKLISGFHQDITTPIKDEICKSLKRRRYAFSALKLKQ 576 (703)
Q Consensus 497 fGHalH~lls~~~~~~~sg~~~~~DfvE~pS~l~E~~~~~~~~L~~l~~h~~t~ge~ip~~l~~~l~~~~~~~~~~~~~~ 576 (703)
+||.=..+.-+. .+.+--...-.-|-|.-.-.+-.-+..|..|.++.-- +..+..--+..+++|.+ .|+..+-
T Consensus 379 m~hiQY~l~y~~-qP~llre~anPgFheAIgd~~als~stPrhL~slgLL-~~~~~~d~~~~IN~L~k-----~aL~~v~ 451 (646)
T KOG3690|consen 379 MGHIQYYLQYKQ-QPFLLREGANPGFHEAIGDAFALSVSTPRHLHSLGLL-SEDLDEDDEVRINRLFK-----MALDKVA 451 (646)
T ss_pred HHHHHHHHHHHh-ccHHhhCCCCCcHHHHHHHHHHHhcCCHHHHHHhccc-ccccccchHHHHHHHHH-----HHHHHHh
Confidence 999877776543 2222112234567788888888888899888876422 11011111344555432 2333322
Q ss_pred H-HHHHHHHHHHhcCC----cc-cHHHHHHHHhcccccCCCCC-CC----CCCccccccccCcccCCcccHHHHHHHHHH
Q 005310 577 E-ILYCIFDQVIHSAD----NV-DIVELFKHLHPKVMLGLPML-EG----TNPASCFPRSAIGFESACYSRIWSEVFAAD 645 (703)
Q Consensus 577 q-l~~a~~D~~~H~~~----~~-~~~~l~~~l~~~~~~~~~~~-~~----~~~~~~f~Hl~~~Y~a~YYsYl~a~vlA~~ 645 (703)
- .+.-.+|..-+..- +. +.+.-|++|..+ |.|+..| +. -..++.| |+-..+ .|-.|..|.|+--+
T Consensus 452 ~LPf~y~~DkwR~~Vf~G~i~k~~~N~~~W~lr~k-y~Gv~~P~~r~~~dfD~~akf-~~~~d~--~~~ryf~s~vlqFQ 527 (646)
T KOG3690|consen 452 FLPFTYALDKWRYEVFDGEIPKDQYNCAYWELRNK-YEGVRPPVPRTNKDFDPPAKF-HIQVDV--SYTRYFLSTVLQFQ 527 (646)
T ss_pred hhhhhhhhhhhHHHHhcCCCCcchhhHHHHHHHHH-hcCcCCCCcCccccCCchhhh-ccCcch--HHHHHHHHHHHHHH
Confidence 2 33445666655432 22 589999999999 8885322 11 1223333 442223 46788889988888
Q ss_pred HHHHHHhc-----CC-------------CCHHHHHHHHHHhccCCCCccHHHHHHHhhCC-CCChHHHHHH
Q 005310 646 IFACKFRD-----GL-------------LNQRVGMQFRNNVLAPGGSKEPIEILSDFLGR-EPSIQAFVDS 697 (703)
Q Consensus 646 i~~~~f~~-----~~-------------~~~~~g~~~r~~vL~~Ggs~~~~ell~~flGr-~p~~~a~l~~ 697 (703)
+. ..|.. +| .++++|+++| ++++.|.|++..|.++.++|. +.|..++++-
T Consensus 528 f~-kalC~~agq~~pg~pr~pLh~CDi~~sk~aG~kL~-~~M~lG~S~~W~evLE~~tGe~els~~~LLeY 596 (646)
T KOG3690|consen 528 FL-KALCQAAGQYGPGDPRKPLHNCDIYDSKEAGEKLR-TMMSLGSSRHWREVLEEITGEGELSGRALLEY 596 (646)
T ss_pred HH-HHHHHHhcccCCCCCCCCchhccccCCHHHHHHHH-HHHHccCCccHHHHHHHHcCCCcCChHHHHHH
Confidence 87 55531 22 2479999997 799999999999999999996 5888888764
No 22
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=90.14 E-value=1.2 Score=43.79 Aligned_cols=30 Identities=23% Similarity=0.410 Sum_probs=20.6
Q ss_pred HHHHHHHHHhhhhcceEEEecccccccccceE
Q 005310 387 VLSGIFKVFQDLFGLRFEEVLDAVVWHSDVRV 418 (703)
Q Consensus 387 vl~gl~~l~~~lfgl~~~~~~~~e~W~~dV~~ 418 (703)
.+...+...+++-+|+|+++.+. ...++++
T Consensus 38 ~i~~A~~~w~~~~~l~F~e~~~~--~~adI~i 67 (186)
T cd04277 38 AARDALEAWEDVADIDFVEVSDN--SGADIRF 67 (186)
T ss_pred HHHHHHHHHHhhcCceeEECCCC--CcceEEE
Confidence 44555566677889999998654 4566665
No 23
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=88.21 E-value=0.45 Score=47.89 Aligned_cols=33 Identities=18% Similarity=0.215 Sum_probs=24.8
Q ss_pred cccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhhh
Q 005310 464 QIPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQICN 506 (703)
Q Consensus 464 q~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~lls 506 (703)
+.|++++..|-+ ...+..||+||+||++-|--.
T Consensus 57 ~~~~I~iN~n~~----------~~r~rFtlAHELGH~llH~~~ 89 (213)
T COG2856 57 EKPVIYINANNS----------LERKRFTLAHELGHALLHTDL 89 (213)
T ss_pred cCceEEEeCCCC----------HHHHHHHHHHHHhHHHhcccc
Confidence 348888876653 457899999999999865443
No 24
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=83.09 E-value=5.6 Score=37.92 Aligned_cols=16 Identities=25% Similarity=0.250 Sum_probs=13.2
Q ss_pred hhhHHHHHHHHHHHHH
Q 005310 487 FSEVVNFFHEFGHVVQ 502 (703)
Q Consensus 487 ~~~V~TLfHEfGHalH 502 (703)
.+-..|+.||+||||=
T Consensus 105 ~~~~~~~~HEiGHaLG 120 (157)
T cd04278 105 TDLFSVAAHEIGHALG 120 (157)
T ss_pred chHHHHHHHHhccccc
Confidence 4567899999999974
No 25
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=82.55 E-value=1.5 Score=39.08 Aligned_cols=33 Identities=18% Similarity=0.376 Sum_probs=23.6
Q ss_pred ccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHHHHhhhc
Q 005310 465 IPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQICNR 507 (703)
Q Consensus 465 ~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~lls~ 507 (703)
.|++++-.|-+ ......|++||+||.+.+--..
T Consensus 28 ~~~I~in~~~~----------~~~~~f~laHELgH~~~~~~~~ 60 (122)
T PF06114_consen 28 NPIIFINSNLS----------PERQRFTLAHELGHILLHHGDE 60 (122)
T ss_dssp TTEEEEESSS-----------HHHHHHHHHHHHHHHHHHH-HH
T ss_pred CCEEEECCCCC----------HHHHHHHHHHHHHHHHhhhccc
Confidence 46677655522 5677899999999999887654
No 26
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=82.23 E-value=0.89 Score=46.02 Aligned_cols=31 Identities=19% Similarity=0.414 Sum_probs=22.0
Q ss_pred hhHHHHHHHHHHHHHHhhhcCCccccCCCccchhhhh
Q 005310 488 SEVVNFFHEFGHVVQQICNRASFARFSGLRVDPDFVE 524 (703)
Q Consensus 488 ~~V~TLfHEfGHalH~lls~~~~~~~sg~~~~~DfvE 524 (703)
.-+.||+||+|||+.+.+.= + +.|. +-|.|-
T Consensus 91 ~~~~~l~HE~GHAlI~~~~l---P-v~Gr--EEDaaD 121 (220)
T PF14247_consen 91 NVLFTLYHELGHALIDDLDL---P-VLGR--EEDAAD 121 (220)
T ss_pred HHHHHHHHHHHHHHHHHhcC---C-cccc--hhhHHH
Confidence 35689999999999998843 2 4453 456663
No 27
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=78.82 E-value=0.98 Score=42.69 Aligned_cols=23 Identities=30% Similarity=0.482 Sum_probs=16.1
Q ss_pred HHHHHHHHHHhhhhcceEEEecc
Q 005310 386 LVLSGIFKVFQDLFGLRFEEVLD 408 (703)
Q Consensus 386 ~vl~gl~~l~~~lfgl~~~~~~~ 408 (703)
.++...+..-+...+++|+++.+
T Consensus 25 ~~i~~A~~~W~~~~~~~F~~~~~ 47 (154)
T PF00413_consen 25 DAIRQAFQAWNDVAPLNFTEVSD 47 (154)
T ss_dssp HHHHHHHHHHHTTSSEEEEEESS
T ss_pred HHHHHHHHHHHhcCCceEEeccC
Confidence 34555666666778889988864
No 28
>PF13058 DUF3920: Protein of unknown function (DUF3920)
Probab=75.56 E-value=2 Score=37.92 Aligned_cols=25 Identities=20% Similarity=0.397 Sum_probs=18.3
Q ss_pred EeccCCCCCCCCCCccChhhHHHHHHHHHHHHHH
Q 005310 470 LISQIQKDIVGHPSLLRFSEVVNFFHEFGHVVQQ 503 (703)
Q Consensus 470 l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHalH~ 503 (703)
.+|-|.. ...=+.||+|||-|+|++
T Consensus 66 Fv~~we~---------y~qvlktllhefrh~mQh 90 (126)
T PF13058_consen 66 FVCMWEE---------YEQVLKTLLHEFRHAMQH 90 (126)
T ss_pred EEEehHH---------HHHHHHHHHHHHHHHHHh
Confidence 4577764 233467999999999994
No 29
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=74.03 E-value=2.7 Score=42.03 Aligned_cols=24 Identities=29% Similarity=0.607 Sum_probs=19.9
Q ss_pred hhhHHHHHHHHHHHHHHhhhcCCc
Q 005310 487 FSEVVNFFHEFGHVVQQICNRASF 510 (703)
Q Consensus 487 ~~~V~TLfHEfGHalH~lls~~~~ 510 (703)
+.-+.|++||+||++=.+++..+.
T Consensus 20 ~~~l~t~~HE~gHal~a~l~G~~v 43 (200)
T PF13398_consen 20 FRLLVTFVHELGHALAALLTGGRV 43 (200)
T ss_pred HHHHHHHHHHHHHHHHHHHhCCCc
Confidence 456789999999999999986543
No 30
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=70.78 E-value=2.4 Score=40.55 Aligned_cols=32 Identities=16% Similarity=0.299 Sum_probs=19.8
Q ss_pred HHHHHHHHHHhhhhcceEEEecccccccccceEE
Q 005310 386 LVLSGIFKVFQDLFGLRFEEVLDAVVWHSDVRVF 419 (703)
Q Consensus 386 ~vl~gl~~l~~~lfgl~~~~~~~~e~W~~dV~~~ 419 (703)
..+..-+...++...|+|+++.+. -..++++.
T Consensus 18 ~~i~~A~~~W~~~~~i~F~e~~~~--~~~di~i~ 49 (165)
T cd04268 18 AAILDAIEAWNKAFAIGFKNANDV--DPADIRYS 49 (165)
T ss_pred HHHHHHHHHHHHHhCcCceecccc--CccCceEE
Confidence 344444566667779999987653 33556643
No 31
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=70.15 E-value=2.4 Score=40.37 Aligned_cols=15 Identities=27% Similarity=0.417 Sum_probs=12.2
Q ss_pred hhHHHHHHHHHHHHH
Q 005310 488 SEVVNFFHEFGHVVQ 502 (703)
Q Consensus 488 ~~V~TLfHEfGHalH 502 (703)
+-..|+.||+||+|=
T Consensus 103 ~~~~~~~HEiGHaLG 117 (156)
T cd04279 103 NLQAIALHELGHALG 117 (156)
T ss_pred HHHHHHHHHhhhhhc
Confidence 345789999999984
No 32
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=64.14 E-value=3.4 Score=38.51 Aligned_cols=11 Identities=36% Similarity=0.486 Sum_probs=10.0
Q ss_pred HHHHHHHHHHH
Q 005310 491 VNFFHEFGHVV 501 (703)
Q Consensus 491 ~TLfHEfGHal 501 (703)
.|+.||+||||
T Consensus 88 ~~~~HEigHaL 98 (140)
T smart00235 88 GVAAHELGHAL 98 (140)
T ss_pred ccHHHHHHHHh
Confidence 38999999998
No 33
>PF14891 Peptidase_M91: Effector protein
Probab=63.65 E-value=4.9 Score=39.24 Aligned_cols=22 Identities=18% Similarity=0.253 Sum_probs=18.7
Q ss_pred hhhHHHHHHHHHHHHHHhhhcC
Q 005310 487 FSEVVNFFHEFGHVVQQICNRA 508 (703)
Q Consensus 487 ~~~V~TLfHEfGHalH~lls~~ 508 (703)
..-+..|+|||.||.|.+-...
T Consensus 101 ~~p~v~L~HEL~HA~~~~~Gt~ 122 (174)
T PF14891_consen 101 RPPFVVLYHELIHAYDYMNGTM 122 (174)
T ss_pred HHHHHHHHHHHHHHHHHHCCCC
Confidence 4568899999999999988764
No 34
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=63.00 E-value=4.7 Score=39.27 Aligned_cols=14 Identities=36% Similarity=0.669 Sum_probs=12.5
Q ss_pred hHHHHHHHHHHHHH
Q 005310 489 EVVNFFHEFGHVVQ 502 (703)
Q Consensus 489 ~V~TLfHEfGHalH 502 (703)
++.|++||+||.|=
T Consensus 111 ~~~~~aHElGH~lG 124 (173)
T PF13574_consen 111 GIDTFAHELGHQLG 124 (173)
T ss_dssp HHHHHHHHHHHHHT
T ss_pred eeeeehhhhHhhcC
Confidence 89999999999873
No 35
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=62.67 E-value=4.1 Score=40.23 Aligned_cols=17 Identities=29% Similarity=0.690 Sum_probs=14.7
Q ss_pred ChhhHHHHHHHHHHHHH
Q 005310 486 RFSEVVNFFHEFGHVVQ 502 (703)
Q Consensus 486 ~~~~V~TLfHEfGHalH 502 (703)
+...+.|++||+||.|=
T Consensus 139 ~~~~~~~~AHEiGH~lG 155 (196)
T PF13688_consen 139 TYNGAITFAHEIGHNLG 155 (196)
T ss_dssp HHHHHHHHHHHHHHHTT
T ss_pred CCceehhhHHhHHHhcC
Confidence 56889999999999864
No 36
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=61.77 E-value=4.2 Score=40.54 Aligned_cols=16 Identities=38% Similarity=0.511 Sum_probs=12.8
Q ss_pred hHHHHHHHHHHHHHHh
Q 005310 489 EVVNFFHEFGHVVQQI 504 (703)
Q Consensus 489 ~V~TLfHEfGHalH~l 504 (703)
...|+.|||||||=..
T Consensus 92 ~~~~i~HElgHaLG~~ 107 (198)
T cd04327 92 FSRVVLHEFGHALGFI 107 (198)
T ss_pred HHHHHHHHHHHHhcCc
Confidence 4579999999998543
No 37
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=61.60 E-value=4.1 Score=36.91 Aligned_cols=13 Identities=46% Similarity=0.710 Sum_probs=11.1
Q ss_pred hHHHHHHHHHHHH
Q 005310 489 EVVNFFHEFGHVV 501 (703)
Q Consensus 489 ~V~TLfHEfGHal 501 (703)
.+.|+.||+||.|
T Consensus 107 ~~~~~~HEiGH~l 119 (124)
T PF13582_consen 107 GVDTFAHEIGHNL 119 (124)
T ss_dssp STTHHHHHHHHHT
T ss_pred cceEeeehhhHhc
Confidence 3489999999986
No 38
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=60.16 E-value=6.6 Score=41.61 Aligned_cols=22 Identities=32% Similarity=0.567 Sum_probs=18.3
Q ss_pred ChhhHHHHHHHHHHHHHHhhhc
Q 005310 486 RFSEVVNFFHEFGHVVQQICNR 507 (703)
Q Consensus 486 ~~~~V~TLfHEfGHalH~lls~ 507 (703)
.+....+|+||+||.++.++.-
T Consensus 167 ~~a~ayVlAHEyGHHVQ~l~Gi 188 (292)
T PF04228_consen 167 DFAQAYVLAHEYGHHVQNLLGI 188 (292)
T ss_pred HHHHHHHHHHHHHHHHHHHhhh
Confidence 4556778999999999999864
No 39
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=57.47 E-value=9.4 Score=38.59 Aligned_cols=20 Identities=30% Similarity=0.404 Sum_probs=17.4
Q ss_pred ChhhHHHHHHHHHHHHHHhh
Q 005310 486 RFSEVVNFFHEFGHVVQQIC 505 (703)
Q Consensus 486 ~~~~V~TLfHEfGHalH~ll 505 (703)
|...|-+=.||.|||+++--
T Consensus 86 SiaAvaVAAHEvGHAiQ~a~ 105 (222)
T PF04298_consen 86 SIAAVAVAAHEVGHAIQHAE 105 (222)
T ss_pred CHHHHHHHHHHHhHHHhccc
Confidence 67888999999999999763
No 40
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=56.79 E-value=5.5 Score=38.00 Aligned_cols=24 Identities=17% Similarity=0.281 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHhhhhcceEEEecc
Q 005310 385 NLVLSGIFKVFQDLFGLRFEEVLD 408 (703)
Q Consensus 385 ~~vl~gl~~l~~~lfgl~~~~~~~ 408 (703)
...+...+.+.++..+++|+++..
T Consensus 24 ~~~v~~a~~~w~~~~~i~f~~~~~ 47 (167)
T cd00203 24 QSLILIAMQIWRDYLNIRFVLVGV 47 (167)
T ss_pred HHHHHHHHHHHHhhhCceEEEecc
Confidence 344566667777778999988753
No 41
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=55.80 E-value=5.4 Score=40.06 Aligned_cols=14 Identities=36% Similarity=0.608 Sum_probs=11.8
Q ss_pred hhHHHHHHHHHHHH
Q 005310 488 SEVVNFFHEFGHVV 501 (703)
Q Consensus 488 ~~V~TLfHEfGHal 501 (703)
....|+.||+||+|
T Consensus 136 ~~~~~~aHEiGH~l 149 (206)
T PF13583_consen 136 NGYQTFAHEIGHNL 149 (206)
T ss_pred ccchHHHHHHHHHh
Confidence 45688999999987
No 42
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=55.46 E-value=27 Score=34.33 Aligned_cols=22 Identities=32% Similarity=0.489 Sum_probs=15.4
Q ss_pred CceeeeeeecccccCCCcCCcccccc
Q 005310 427 NDLLGYFYLDIYKREGKYVHTCVVAL 452 (703)
Q Consensus 427 g~~lG~~ylDl~~R~gK~~ga~~~~l 452 (703)
...+|....|+|++ | -+|.+++
T Consensus 71 ~~vl~vt~~DLy~~-~---~nfvfG~ 92 (179)
T PRK13267 71 DKNIGITDCDLYYR-G---LNFVFGL 92 (179)
T ss_pred ceEEEEEccccCCC-C---CCceeEE
Confidence 46889999999983 4 2355554
No 43
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=53.90 E-value=7 Score=37.39 Aligned_cols=17 Identities=24% Similarity=0.382 Sum_probs=12.9
Q ss_pred ChhhHHHHHHHHHHHHH
Q 005310 486 RFSEVVNFFHEFGHVVQ 502 (703)
Q Consensus 486 ~~~~V~TLfHEfGHalH 502 (703)
....-.||.||+||-|-
T Consensus 66 ~~~~g~TltHEvGH~LG 82 (154)
T PF05572_consen 66 QYNFGKTLTHEVGHWLG 82 (154)
T ss_dssp TS-SSHHHHHHHHHHTT
T ss_pred ccccccchhhhhhhhhc
Confidence 34556999999999865
No 44
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=53.44 E-value=13 Score=39.26 Aligned_cols=21 Identities=33% Similarity=0.491 Sum_probs=17.1
Q ss_pred ccChhhHH-HHHHHHHHHHHHh
Q 005310 484 LLRFSEVV-NFFHEFGHVVQQI 504 (703)
Q Consensus 484 lL~~~~V~-TLfHEfGHalH~l 504 (703)
.|+-+|+. .|.||+||..|.-
T Consensus 151 ~l~~dEl~aVlaHElgHi~~rd 172 (302)
T COG0501 151 LLNDDELEAVLAHELGHIKNRH 172 (302)
T ss_pred hCCHHHHHHHHHHHHHHHhccc
Confidence 46778887 7899999998854
No 45
>PF10460 Peptidase_M30: Peptidase M30; InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue.
Probab=53.28 E-value=12 Score=40.74 Aligned_cols=44 Identities=18% Similarity=0.192 Sum_probs=28.8
Q ss_pred hHHHHHHHHHHHHHHhhhcCCccccCCCc--cchhhhhccHHHHHHhHhc
Q 005310 489 EVVNFFHEFGHVVQQICNRASFARFSGLR--VDPDFVEIPAQLLENWCYE 536 (703)
Q Consensus 489 ~V~TLfHEfGHalH~lls~~~~~~~sg~~--~~~DfvE~pS~l~E~~~~~ 536 (703)
-+.||+|||=|.+|..-.... .|.. ..+=.=|.-|+++|.++..
T Consensus 139 ~~sTlAHEfQHmInfy~~~v~----~g~~~~~dtWLnE~lS~~aEdl~s~ 184 (366)
T PF10460_consen 139 VYSTLAHEFQHMINFYQRGVL----HGKQYAMDTWLNEMLSMSAEDLYSS 184 (366)
T ss_pred HHHHHHHHHHHHHHHHHHHHh----cCCCcccccHHHHHHHHHHHHHHhc
Confidence 467999999999997543211 1211 2222339999999998655
No 46
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=52.35 E-value=2e+02 Score=33.59 Aligned_cols=74 Identities=18% Similarity=0.255 Sum_probs=55.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHcCCCCCH-HHHHHHHHHHHHHH
Q 005310 122 DDVCKSSAEAERKIDAHMLSCSNREDVYRVVKAVAATGEWVSPEAKRYIQSLVRDFERSGLNLTI-SKREEVQRLRDQID 200 (703)
Q Consensus 122 ~e~r~aa~e~~~~l~~~~~~l~~~~~ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~-~~r~~~~~l~~~i~ 200 (703)
-+-++...+....+..+...+..=|+||.-++... |++..-|+.-+++|+..|.+|+. +.-.++..++.++.
T Consensus 194 ~~A~e~l~~l~~~~~~l~~~~~~iP~l~~~~~~~~-------P~ql~el~~gy~~m~~~gy~~~~~~i~~~i~~l~~~i~ 266 (569)
T PRK04778 194 VEAREILDQLEEELAALEQIMEEIPELLKELQTEL-------PDQLQELKAGYRELVEEGYHLDHLDIEKEIQDLKEQID 266 (569)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-------hHHHHHHHHHHHHHHHcCCCCCCCChHHHHHHHHHHHH
Confidence 34455555666666666666666678888777654 57777889999999999999986 57788888888887
Q ss_pred HH
Q 005310 201 EL 202 (703)
Q Consensus 201 ~l 202 (703)
..
T Consensus 267 ~~ 268 (569)
T PRK04778 267 EN 268 (569)
T ss_pred HH
Confidence 74
No 47
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=49.88 E-value=8.1 Score=37.91 Aligned_cols=13 Identities=23% Similarity=0.388 Sum_probs=11.3
Q ss_pred HHHHHHHHHHHHH
Q 005310 490 VVNFFHEFGHVVQ 502 (703)
Q Consensus 490 V~TLfHEfGHalH 502 (703)
+.|+.|||||||=
T Consensus 75 ~g~v~HE~~HalG 87 (180)
T cd04280 75 LGTIVHELMHALG 87 (180)
T ss_pred CchhHHHHHHHhc
Confidence 5799999999974
No 48
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=48.28 E-value=20 Score=31.87 Aligned_cols=17 Identities=18% Similarity=-0.027 Sum_probs=15.1
Q ss_pred HHHHHHHHHHHHHhhhc
Q 005310 491 VNFFHEFGHVVQQICNR 507 (703)
Q Consensus 491 ~TLfHEfGHalH~lls~ 507 (703)
.+|.||++|.+|.....
T Consensus 27 ~~l~HE~~H~~~~~~~~ 43 (128)
T PF13485_consen 27 RVLAHELAHQWFGNYFG 43 (128)
T ss_pred HHHHHHHHHHHHHHHcC
Confidence 89999999999988754
No 49
>PF06160 EzrA: Septation ring formation regulator, EzrA ; InterPro: IPR010379 During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerises into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation [].; GO: 0000921 septin ring assembly, 0005940 septin ring, 0016021 integral to membrane
Probab=48.07 E-value=2.4e+02 Score=32.90 Aligned_cols=75 Identities=15% Similarity=0.245 Sum_probs=57.5
Q ss_pred HHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHcCCCCCH-HHHHHHHHHHHHHHH
Q 005310 123 DVCKSSAEAERKIDAHMLSCSNREDVYRVVKAVAATGEWVSPEAKRYIQSLVRDFERSGLNLTI-SKREEVQRLRDQIDE 201 (703)
Q Consensus 123 e~r~aa~e~~~~l~~~~~~l~~~~~ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~-~~r~~~~~l~~~i~~ 201 (703)
+-++...+....+..+...+..=|++|..++... |.+..-|+.-++.|+..|.+|+. +.-+++..+..++..
T Consensus 191 ~A~eil~~l~~~~~~l~~~~e~IP~l~~~l~~~~-------P~ql~eL~~gy~~m~~~gy~l~~~~i~~~i~~i~~~l~~ 263 (560)
T PF06160_consen 191 EAREILEKLKEETDELEEIMEDIPKLYKELQKEF-------PDQLEELKEGYREMEEEGYYLEHLDIEEEIEQIEEQLEE 263 (560)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHh-------HHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHH
Confidence 4455556666666666666666778888776654 57778888999999999999987 778889999888877
Q ss_pred HHH
Q 005310 202 LSL 204 (703)
Q Consensus 202 l~~ 204 (703)
...
T Consensus 264 ~~~ 266 (560)
T PF06160_consen 264 ALA 266 (560)
T ss_pred HHH
Confidence 654
No 50
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=46.72 E-value=13 Score=34.15 Aligned_cols=17 Identities=24% Similarity=0.469 Sum_probs=13.8
Q ss_pred ChhhHHHHHHHHHHHHH
Q 005310 486 RFSEVVNFFHEFGHVVQ 502 (703)
Q Consensus 486 ~~~~V~TLfHEfGHalH 502 (703)
.++.+....||+||.|=
T Consensus 74 gy~~~RIaaHE~GHiLG 90 (132)
T PF02031_consen 74 GYNSTRIAAHELGHILG 90 (132)
T ss_dssp HS-HHHHHHHHHHHHHT
T ss_pred CCccceeeeehhccccC
Confidence 57778899999999874
No 51
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=46.70 E-value=11 Score=38.14 Aligned_cols=16 Identities=25% Similarity=0.544 Sum_probs=14.0
Q ss_pred ChhhHHHHHHHHHHHH
Q 005310 486 RFSEVVNFFHEFGHVV 501 (703)
Q Consensus 486 ~~~~V~TLfHEfGHal 501 (703)
++..+.|++||+||.|
T Consensus 142 ~~~~~~~~AHElGH~l 157 (220)
T cd04272 142 SYYGVYTMTHELAHLL 157 (220)
T ss_pred CcccHHHHHHHHHHHh
Confidence 4567999999999998
No 52
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=45.42 E-value=11 Score=37.06 Aligned_cols=15 Identities=20% Similarity=0.034 Sum_probs=12.0
Q ss_pred HHHHHHHHHHHHHHh
Q 005310 490 VVNFFHEFGHVVQQI 504 (703)
Q Consensus 490 V~TLfHEfGHalH~l 504 (703)
+-|+.|||||||=..
T Consensus 78 ~G~i~HEl~HaLG~~ 92 (182)
T cd04283 78 KGIIQHELLHALGFY 92 (182)
T ss_pred cchHHHHHHHHhCCc
Confidence 468999999998533
No 53
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=45.40 E-value=10 Score=38.15 Aligned_cols=12 Identities=33% Similarity=0.553 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHH
Q 005310 491 VNFFHEFGHVVQ 502 (703)
Q Consensus 491 ~TLfHEfGHalH 502 (703)
.++.||+||++=
T Consensus 135 hvi~HEiGH~IG 146 (211)
T PF12388_consen 135 HVITHEIGHCIG 146 (211)
T ss_pred HHHHHHhhhhcc
Confidence 389999999996
No 54
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=45.33 E-value=13 Score=36.92 Aligned_cols=14 Identities=36% Similarity=0.688 Sum_probs=11.9
Q ss_pred hHHHHHHHHHHHHH
Q 005310 489 EVVNFFHEFGHVVQ 502 (703)
Q Consensus 489 ~V~TLfHEfGHalH 502 (703)
.+.|+.|||||||=
T Consensus 79 ~~~~i~HEl~HaLG 92 (191)
T PF01400_consen 79 SVGTILHELGHALG 92 (191)
T ss_dssp SHHHHHHHHHHHHT
T ss_pred CccchHHHHHHHHh
Confidence 46799999999984
No 55
>cd07633 BAR_OPHN1 The Bin/Amphiphysin/Rvs (BAR) domain of Oligophrenin-1. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Oligophrenin-1 (OPHN1) is a GTPase activating protein (GAP) with activity towards RhoA, Rac, and Cdc42, that is expressed in developing spinal cord and in adult brain areas with high plasticity. It plays a role in regulating the actin cystoskeleton as well as morphology changes in axons and dendrites, and may also function in modulating neuronal connectivity. Mutations in the OPHN1 gene causes X-linked mental retardation associated with cerebellar hypoplasia, lateral ventricle enlargement and epilepsy. OPHN1 contains an N-terminal BAR domain, followed by a Pleckstrin homology (PH) domain, and a Rho GAP domain. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=43.56 E-value=1.5e+02 Score=29.74 Aligned_cols=115 Identities=10% Similarity=0.171 Sum_probs=69.7
Q ss_pred ChhhHHHHHHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHhhhhhhcccccccccccccCCcHHHHHHHHHHHHHHHHH
Q 005310 59 SASEILKLADKIVSKSKEVHDAVASVPLDKVTFMNVISPLAELEAQQFPLVQSCVFPKLVCTSDDVCKSSAEAERKIDAH 138 (703)
Q Consensus 59 s~~~i~~~~~~~i~~~~~~~~~i~~~~~~~~t~~n~i~~l~~l~~~l~~~~~~~~~~~~v~~d~e~r~aa~e~~~~l~~~ 138 (703)
..+.+...++.+|+.|+.++++..........|.+.+..+.. .++-...+|+|.. ..+|..+++..
T Consensus 10 ele~~~~~IkkliK~~~~li~a~K~~s~A~r~Fa~~L~df~f------------~~igd~~tdde~~--I~~sL~~F~~~ 75 (207)
T cd07633 10 ELERTNKFIKDVIKDGNALISAIKEYSSAVQKFSQTLQSFQF------------DFIGDTLTDDEIN--IAESFKEFAEL 75 (207)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------------hcCCCcccchHHH--HHHHHHHHHHH
Confidence 457788899999999999998876543334456655543332 4566677787753 33555555555
Q ss_pred HHHhhCCHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHH
Q 005310 139 MLSCSNREDVYRVVKAVAATGEWVSPEAKRYIQSLVRDFERSGLNLTISKREEVQRLRDQID 200 (703)
Q Consensus 139 ~~~l~~~~~ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~ 200 (703)
..++..-.. .|=+...+.+..-+..|+..+|.--.+.|++|.+...+..
T Consensus 76 L~~ie~~r~-------------~l~d~aq~s~~~~L~~F~Kedi~~~Ke~KK~FdK~se~~~ 124 (207)
T cd07633 76 LQEVEEERM-------------MMVQNASDLLIKPLENFRKEQIGFTKERKKKFEKDSEKFY 124 (207)
T ss_pred HHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhHHH
Confidence 544433111 1112344556667778888766655566666666665443
No 56
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=42.68 E-value=13 Score=38.34 Aligned_cols=12 Identities=33% Similarity=0.398 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHH
Q 005310 490 VVNFFHEFGHVV 501 (703)
Q Consensus 490 V~TLfHEfGHal 501 (703)
..|++||+||.+
T Consensus 168 a~t~AHElGHnl 179 (244)
T cd04270 168 DLVTAHELGHNF 179 (244)
T ss_pred HHHHHHHHHHhc
Confidence 479999999987
No 57
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=42.59 E-value=7.7 Score=39.67 Aligned_cols=14 Identities=43% Similarity=0.624 Sum_probs=12.2
Q ss_pred hhHHHHHHHHHHHH
Q 005310 488 SEVVNFFHEFGHVV 501 (703)
Q Consensus 488 ~~V~TLfHEfGHal 501 (703)
..+.|++||+||.|
T Consensus 144 ~~~~t~AHElGHnL 157 (228)
T cd04271 144 NEWQVFAHEIGHTF 157 (228)
T ss_pred ccceehhhhhhhhc
Confidence 45689999999998
No 58
>cd07602 BAR_RhoGAP_OPHN1-like The Bin/Amphiphysin/Rvs (BAR) domain of Oligophrenin1-like Rho GTPase Activating Proteins. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. This subfamily is composed of Rho and Rac GTPase activating proteins (GAPs) with similarity to oligophrenin1 (OPHN1). Members contain an N-terminal BAR domain, followed by a Pleckstrin homology (PH) domain, and a Rho GAP domain. Some members contain a C-terminal SH3 domain. Vertebrates harbor at least three Rho GAPs in this subfamily including OPHN1, GTPase Regulator Associated with Focal adhesion kinase (GRAF), GRAF2, and an uncharacterized protein called GAP10-like. OPHN1, GRAF and GRAF2 show GAP activity towards RhoA and Cdc42. In addition, OPHN1 is active towards Rac. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions. The BAR domains of OPHN1
Probab=40.87 E-value=2.2e+02 Score=28.72 Aligned_cols=122 Identities=9% Similarity=0.137 Sum_probs=71.1
Q ss_pred ChhhHHHHHHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHhhhhhhcccccccccccccCCcHHHHHHHHHHHHHHHHH
Q 005310 59 SASEILKLADKIVSKSKEVHDAVASVPLDKVTFMNVISPLAELEAQQFPLVQSCVFPKLVCTSDDVCKSSAEAERKIDAH 138 (703)
Q Consensus 59 s~~~i~~~~~~~i~~~~~~~~~i~~~~~~~~t~~n~i~~l~~l~~~l~~~~~~~~~~~~v~~d~e~r~aa~e~~~~l~~~ 138 (703)
..+.+...++.+|+.|+..+++..........|.+.+..+- ..++-...+|++.. ..++..+++..
T Consensus 10 ele~l~~~ikkLiK~ck~~i~a~k~~~~a~~~F~~~L~~f~------------~~~~g~~~tDDe~~--i~~~L~kF~~~ 75 (207)
T cd07602 10 ELERTNKAIKELIKECKNLISATKNLSKAQRSFAQTLQNFK------------FECIGETQTDDEIE--IAESLKEFGRL 75 (207)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------hhccCCcCcccHHH--HHHHHHHHHHH
Confidence 45778889999999999999988654233345555544333 13455555666554 23566655555
Q ss_pred HHHhhCCHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHH
Q 005310 139 MLSCSNREDVYRVVKAVAATGEWVSPEAKRYIQSLVRDFERSGLNLTISKREEVQRLRDQIDELSLQYV 207 (703)
Q Consensus 139 ~~~l~~~~~ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~l~~~f~ 207 (703)
..++..-.. .|-....+.+-.-+..|+...++--.+.|++|.....+......+|.
T Consensus 76 l~ei~~~r~-------------~L~~q~~~~l~~pL~~F~k~dl~~~ke~kk~FdK~se~~~~al~k~~ 131 (207)
T cd07602 76 IETVEDERD-------------RMLENAEEQLIEPLEKFRKEQIGGAKEEKKKFDKETEKFCSSLEKHL 131 (207)
T ss_pred HHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 555543111 12123344555666777776554334567777777666665555444
No 59
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=39.74 E-value=12 Score=36.85 Aligned_cols=15 Identities=27% Similarity=0.563 Sum_probs=12.9
Q ss_pred hhhHHHHHHHHHHHH
Q 005310 487 FSEVVNFFHEFGHVV 501 (703)
Q Consensus 487 ~~~V~TLfHEfGHal 501 (703)
+....|++||+||.|
T Consensus 131 ~~~~~~~aHElGH~l 145 (192)
T cd04267 131 LLTALTMAHELGHNL 145 (192)
T ss_pred eeehhhhhhhHHhhc
Confidence 456789999999998
No 60
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=39.17 E-value=16 Score=36.62 Aligned_cols=14 Identities=36% Similarity=0.508 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHH
Q 005310 490 VVNFFHEFGHVVQQ 503 (703)
Q Consensus 490 V~TLfHEfGHalH~ 503 (703)
+-|+.||+||||=.
T Consensus 88 ~Gti~HEl~HaLGf 101 (200)
T cd04281 88 FGIVVHELGHVIGF 101 (200)
T ss_pred CchHHHHHHHHhcC
Confidence 45899999999843
No 61
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=37.98 E-value=16 Score=38.20 Aligned_cols=19 Identities=26% Similarity=0.406 Sum_probs=12.6
Q ss_pred hHHHHHHHHHHHHHHhhhc
Q 005310 489 EVVNFFHEFGHVVQQICNR 507 (703)
Q Consensus 489 ~V~TLfHEfGHalH~lls~ 507 (703)
-..++.|||||++..+--.
T Consensus 216 ~~~v~vHE~GHsf~~LaDE 234 (264)
T PF09471_consen 216 FKQVVVHEFGHSFGGLADE 234 (264)
T ss_dssp HHHHHHHHHHHHTT-----
T ss_pred ccceeeeeccccccccccc
Confidence 4688999999999987643
No 62
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=37.67 E-value=18 Score=35.74 Aligned_cols=15 Identities=33% Similarity=0.514 Sum_probs=12.9
Q ss_pred hhhHHHHHHHHHHHH
Q 005310 487 FSEVVNFFHEFGHVV 501 (703)
Q Consensus 487 ~~~V~TLfHEfGHal 501 (703)
..-+.+++||+||.|
T Consensus 129 ~~~a~~~AHElGH~l 143 (194)
T cd04269 129 LLFAVTMAHELGHNL 143 (194)
T ss_pred HHHHHHHHHHHHhhc
Confidence 355899999999998
No 63
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=37.57 E-value=20 Score=35.23 Aligned_cols=20 Identities=25% Similarity=0.436 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHHHhhhcCC
Q 005310 490 VVNFFHEFGHVVQQICNRAS 509 (703)
Q Consensus 490 V~TLfHEfGHalH~lls~~~ 509 (703)
+..+.||+||.+-......+
T Consensus 10 ~~v~iHElGH~~~Ar~~Gv~ 29 (182)
T cd06163 10 ILIFVHELGHFLVAKLFGVK 29 (182)
T ss_pred HHHHHHHHHHHHHHHHcCCe
Confidence 56789999999988777544
No 64
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=37.04 E-value=21 Score=34.74 Aligned_cols=24 Identities=29% Similarity=0.390 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHHHHhhhcCCcccc
Q 005310 490 VVNFFHEFGHVVQQICNRASFARF 513 (703)
Q Consensus 490 V~TLfHEfGHalH~lls~~~~~~~ 513 (703)
+..+.||+||++=......+...+
T Consensus 9 i~i~iHE~gH~~~A~~~G~~~~~~ 32 (180)
T cd05709 9 ISVTVHELGHALVARRLGVKVARF 32 (180)
T ss_pred HHHHHHHHHHHHHHHHcCCCchhe
Confidence 457899999999777665443333
No 65
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=36.27 E-value=28 Score=34.25 Aligned_cols=19 Identities=26% Similarity=0.448 Sum_probs=15.9
Q ss_pred ChhhHHHHHHHHHHHHHHh
Q 005310 486 RFSEVVNFFHEFGHVVQQI 504 (703)
Q Consensus 486 ~~~~V~TLfHEfGHalH~l 504 (703)
|....-.-+||-|||+++-
T Consensus 89 Sia~~aVAAHEVGHAiQd~ 107 (226)
T COG2738 89 SIAAIAVAAHEVGHAIQDQ 107 (226)
T ss_pred cHHHHHHHHHHhhHHHhhh
Confidence 5567778899999999975
No 66
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=35.86 E-value=20 Score=35.76 Aligned_cols=12 Identities=33% Similarity=0.520 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHH
Q 005310 491 VNFFHEFGHVVQ 502 (703)
Q Consensus 491 ~TLfHEfGHalH 502 (703)
.++.||+||+|=
T Consensus 118 ~~~~he~gh~lG 129 (197)
T cd04276 118 YLLAHEVGHTLG 129 (197)
T ss_pred HHHHHHHHHHhc
Confidence 589999999974
No 67
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=35.77 E-value=20 Score=30.11 Aligned_cols=19 Identities=37% Similarity=0.319 Sum_probs=15.2
Q ss_pred ChhhHHHHHHHHHHHHHHh
Q 005310 486 RFSEVVNFFHEFGHVVQQI 504 (703)
Q Consensus 486 ~~~~V~TLfHEfGHalH~l 504 (703)
+..+...|.||+.|+++.-
T Consensus 58 s~~~~~llaHEl~Hv~Qq~ 76 (79)
T PF13699_consen 58 SPEGRALLAHELAHVVQQR 76 (79)
T ss_pred CCCcchhHhHHHHHHHhhc
Confidence 3457788999999999853
No 68
>cd07636 BAR_GRAF The Bin/Amphiphysin/Rvs (BAR) domain of GTPase Regulator Associated with Focal adhesion kinase. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. GTPase Regulator Associated with Focal adhesion kinase (GRAF), also called Rho GTPase activating protein 26 (ARHGAP26), is a GAP with activity towards RhoA and Cdc42 and is only weakly active towards Rac1. It influences Rho-mediated cytoskeletal rearrangements and binds focal adhesion kinase (FAK), which is a critical component of integrin signaling. GRAF contains an N-terminal BAR domain, followed by a Pleckstrin homology (PH) domain, a Rho GAP domain, and a C-terminal SH3 domain. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions. The BAR domain of GRAF directly interacts with its Rho GAP domain and inhibits its activity. Autoinhibited GRAF is capable o
Probab=35.68 E-value=2.5e+02 Score=28.33 Aligned_cols=122 Identities=9% Similarity=0.165 Sum_probs=67.5
Q ss_pred ChhhHHHHHHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHhhhhhhcccccccccccccCCcHHHHHHHHHHHHHHHHH
Q 005310 59 SASEILKLADKIVSKSKEVHDAVASVPLDKVTFMNVISPLAELEAQQFPLVQSCVFPKLVCTSDDVCKSSAEAERKIDAH 138 (703)
Q Consensus 59 s~~~i~~~~~~~i~~~~~~~~~i~~~~~~~~t~~n~i~~l~~l~~~l~~~~~~~~~~~~v~~d~e~r~aa~e~~~~l~~~ 138 (703)
..+...+.+..+|+.|+..+++..........|.+.+..+-. .++-...+|+++.-+ ++..+++..
T Consensus 10 ~le~~~k~ik~liK~~k~~i~A~k~~~~a~~~Fa~sL~~f~~------------~~~gd~~~dDe~~I~--~~L~kF~~~ 75 (207)
T cd07636 10 ELDKTNKFIKELIKDGKSLIAALKNLSSAKRKFADSLNEFKF------------QCIGDAETDDEICIA--RSLQEFAAV 75 (207)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------------hcCCCcccccHHHHH--HHHHHHHHH
Confidence 456778888999999999888876542223455554443331 234444455554332 555555554
Q ss_pred HHHhhCCHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHH
Q 005310 139 MLSCSNREDVYRVVKAVAATGEWVSPEAKRYIQSLVRDFERSGLNLTISKREEVQRLRDQIDELSLQYV 207 (703)
Q Consensus 139 ~~~l~~~~~ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~l~~~f~ 207 (703)
..++..-.. .|=....+.+..-+..|+...|.--.+.|++|-+..........++.
T Consensus 76 L~ei~~~r~-------------~L~~qa~~~l~~~L~~F~kedi~~~Ke~kK~FdK~se~~~~al~k~~ 131 (207)
T cd07636 76 LRNLEDERT-------------RMIENASEVLITPLEKFRKEQIGAAKEAKKKYDKETEKYCAVLEKHL 131 (207)
T ss_pred HHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHhhhhhHHHHHHHHHh
Confidence 444433111 11122344555667788877665555667777776666555444333
No 69
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=35.25 E-value=23 Score=35.58 Aligned_cols=18 Identities=22% Similarity=0.372 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHHhhhc
Q 005310 490 VVNFFHEFGHVVQQICNR 507 (703)
Q Consensus 490 V~TLfHEfGHalH~lls~ 507 (703)
+..++||+||++=...-.
T Consensus 39 ~~v~iHElgH~~~A~~~G 56 (208)
T cd06161 39 LSVLLHELGHALVARRYG 56 (208)
T ss_pred HHHHHHHHHHHHHHHHcC
Confidence 457899999998776654
No 70
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=34.17 E-value=26 Score=34.40 Aligned_cols=24 Identities=25% Similarity=0.455 Sum_probs=16.5
Q ss_pred HHHHHHHHHHHHHHhhhcCCcccc
Q 005310 490 VVNFFHEFGHVVQQICNRASFARF 513 (703)
Q Consensus 490 V~TLfHEfGHalH~lls~~~~~~~ 513 (703)
+..++||+||++=......+...+
T Consensus 8 i~i~~HE~gH~~~a~~~G~~~~~~ 31 (192)
T PF02163_consen 8 ISIVLHELGHALAARLYGDKVPRF 31 (192)
T ss_dssp HHHHHHHHHHHHHHHTTT--B--E
T ss_pred cccccccccccccccccccccccc
Confidence 567899999999988876555444
No 71
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=33.21 E-value=28 Score=34.78 Aligned_cols=23 Identities=26% Similarity=0.408 Sum_probs=17.5
Q ss_pred cChhhHH-HHHHHHHHHHHHhhhc
Q 005310 485 LRFSEVV-NFFHEFGHVVQQICNR 507 (703)
Q Consensus 485 L~~~~V~-TLfHEfGHalH~lls~ 507 (703)
++-+|+. .|.||+||..|.-...
T Consensus 84 ~~~~el~aVlaHElgH~~~~h~~~ 107 (226)
T PF01435_consen 84 LSEDELAAVLAHELGHIKHRHILK 107 (226)
T ss_dssp SSHHHHHHHHHHHHHHHHTTHCCC
T ss_pred ccHHHHHHHHHHHHHHHHcCCcch
Confidence 4667776 8999999998765443
No 72
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=32.53 E-value=9.6 Score=40.35 Aligned_cols=13 Identities=38% Similarity=0.485 Sum_probs=11.3
Q ss_pred hHHHHHHHHHHHH
Q 005310 489 EVVNFFHEFGHVV 501 (703)
Q Consensus 489 ~V~TLfHEfGHal 501 (703)
.+-|+.|||||+|
T Consensus 165 ~igv~~HE~gH~l 177 (286)
T TIGR03296 165 GVGVIAHELGHDL 177 (286)
T ss_pred ceeeeehhhhccc
Confidence 4789999999976
No 73
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=31.61 E-value=24 Score=36.12 Aligned_cols=15 Identities=27% Similarity=0.244 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHHHHh
Q 005310 490 VVNFFHEFGHVVQQI 504 (703)
Q Consensus 490 V~TLfHEfGHalH~l 504 (703)
+-|+.||+||||=..
T Consensus 121 ~Gti~HEl~HalGf~ 135 (230)
T cd04282 121 KATVEHEFLHALGFY 135 (230)
T ss_pred CchHHHHHHHHhCCc
Confidence 468999999998543
No 74
>PRK02870 heat shock protein HtpX; Provisional
Probab=30.75 E-value=31 Score=37.39 Aligned_cols=20 Identities=25% Similarity=0.293 Sum_probs=15.8
Q ss_pred cChhhHH-HHHHHHHHHHHHh
Q 005310 485 LRFSEVV-NFFHEFGHVVQQI 504 (703)
Q Consensus 485 L~~~~V~-TLfHEfGHalH~l 504 (703)
|+-+|+. .++||+||.-|.-
T Consensus 168 L~~dEL~aVlAHELgHik~~d 188 (336)
T PRK02870 168 LDRDELQAVMAHELSHIRHGD 188 (336)
T ss_pred CCHHHHHHHHHHHHHHHHccc
Confidence 4667766 7899999998854
No 75
>PRK03982 heat shock protein HtpX; Provisional
Probab=30.63 E-value=30 Score=36.58 Aligned_cols=21 Identities=24% Similarity=0.267 Sum_probs=16.0
Q ss_pred cChhhHH-HHHHHHHHHHHHhh
Q 005310 485 LRFSEVV-NFFHEFGHVVQQIC 505 (703)
Q Consensus 485 L~~~~V~-TLfHEfGHalH~ll 505 (703)
|+-+|+. .|.||+||.-|.-.
T Consensus 120 l~~~El~AVlAHElgHi~~~h~ 141 (288)
T PRK03982 120 LNEDELEGVIAHELTHIKNRDT 141 (288)
T ss_pred CCHHHHHHHHHHHHHHHHcCCH
Confidence 3556765 78999999998654
No 76
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=30.18 E-value=54 Score=36.44 Aligned_cols=54 Identities=13% Similarity=0.151 Sum_probs=31.2
Q ss_pred cCCCcCCccccccCCCcccCCCCccccEEEEeccCCCCCCCCCCccChhhH-HHHHHHHHHHHHHhhhc
Q 005310 440 REGKYVHTCVVALQNGALSSTGARQIPVALLISQIQKDIVGHPSLLRFSEV-VNFFHEFGHVVQQICNR 507 (703)
Q Consensus 440 R~gK~~ga~~~~l~~~~~~~~g~~q~P~~~l~~Nf~~~~~~~p~lL~~~~V-~TLfHEfGHalH~lls~ 507 (703)
++++.+.||..++-+..+ +++.-++-.. + ..++-+|| -.++||+||--|....+
T Consensus 244 ~rs~hsNAyfyG~~~~KR----------IvIyDtLl~~-~---~~~~~eel~AVl~HELGHW~~~H~~K 298 (428)
T KOG2719|consen 244 KRSSHSNAYFYGLCKNKR----------IVIYDTLLLE-E---EHLNNEELVAVLAHELGHWKLNHVLK 298 (428)
T ss_pred CCCCCCCeeeeeccccce----------EEEehhhhhh-h---hccccHHHHHHHHHHhhHHHHhhHHH
Confidence 467778899888733221 3333333210 0 00233454 48999999999887665
No 77
>PRK03001 M48 family peptidase; Provisional
Probab=29.82 E-value=31 Score=36.38 Aligned_cols=21 Identities=29% Similarity=0.305 Sum_probs=16.2
Q ss_pred cChhhHH-HHHHHHHHHHHHhh
Q 005310 485 LRFSEVV-NFFHEFGHVVQQIC 505 (703)
Q Consensus 485 L~~~~V~-TLfHEfGHalH~ll 505 (703)
|+-+|+. .|.||+||.-|.-.
T Consensus 119 l~~~El~aVlAHElgHi~~~h~ 140 (283)
T PRK03001 119 LSEREIRGVMAHELAHVKHRDI 140 (283)
T ss_pred CCHHHHHHHHHHHHHHHhCCCh
Confidence 4567766 78999999998643
No 78
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=28.43 E-value=35 Score=35.72 Aligned_cols=18 Identities=22% Similarity=0.217 Sum_probs=13.5
Q ss_pred HHHHHHHHHHHHHHhhhc
Q 005310 490 VVNFFHEFGHVVQQICNR 507 (703)
Q Consensus 490 V~TLfHEfGHalH~lls~ 507 (703)
+..+.||+||++=...-.
T Consensus 119 isv~iHElgHa~~Ar~~G 136 (263)
T cd06159 119 VGVVVHELSHGILARVEG 136 (263)
T ss_pred HHHHHHHHHHHHHHHHcC
Confidence 347899999998766543
No 79
>COG4086 Predicted secreted protein [Function unknown]
Probab=27.98 E-value=4.6e+02 Score=27.55 Aligned_cols=78 Identities=15% Similarity=0.114 Sum_probs=50.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhCCH----HHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHc-CCCCCHHHHHHHHHHH
Q 005310 122 DDVCKSSAEAERKIDAHMLSCSNRE----DVYRVVKAVAATGEWVSPEAKRYIQSLVRDFERS-GLNLTISKREEVQRLR 196 (703)
Q Consensus 122 ~e~r~aa~e~~~~l~~~~~~l~~~~----~ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~-G~~L~~~~r~~~~~l~ 196 (703)
++-...|++-....++..++.+.++ .|+..+|.-..+....++++. ++.+++-..+ |+.|++.+...+..+.
T Consensus 166 e~~KqvaneEL~~~sel~~k~G~d~~r~a~l~~~VK~~~a~~~~~~~~di---rkvv~dv~~~ynvnltd~qvn~i~~~~ 242 (299)
T COG4086 166 EANKQVANEELVATSELGDKIGDDPRRAAALMAEVKEEVAKQKVDDPADI---RKVVDDVANNYNVNLTDTQVNQIVNLF 242 (299)
T ss_pred HHHHHHHHHHHHHHHHhhhhcCCCHHHHHHHHHHHHHHHHhCCCCCHHHH---HHHHHHHHHHcCCCCCHHHHHHHHHHH
Confidence 3445667777778888888888776 355555543223333566665 4555566555 8889988777666666
Q ss_pred HHHHHH
Q 005310 197 DQIDEL 202 (703)
Q Consensus 197 ~~i~~l 202 (703)
..++.+
T Consensus 243 ~~~~~~ 248 (299)
T COG4086 243 LAMSNL 248 (299)
T ss_pred HHhhcc
Confidence 655553
No 80
>PRK01345 heat shock protein HtpX; Provisional
Probab=27.73 E-value=39 Score=36.36 Aligned_cols=21 Identities=33% Similarity=0.352 Sum_probs=16.2
Q ss_pred cChhhHH-HHHHHHHHHHHHhh
Q 005310 485 LRFSEVV-NFFHEFGHVVQQIC 505 (703)
Q Consensus 485 L~~~~V~-TLfHEfGHalH~ll 505 (703)
|+-+|+. .|.||+||..|.-.
T Consensus 119 L~~dEL~aVlAHElgHi~~~d~ 140 (317)
T PRK01345 119 LSPEEVAGVMAHELAHVKNRDT 140 (317)
T ss_pred CCHHHHHHHHHHHHHHHHcCCH
Confidence 4557766 78999999998654
No 81
>PRK04897 heat shock protein HtpX; Provisional
Probab=27.17 E-value=38 Score=36.11 Aligned_cols=21 Identities=19% Similarity=0.151 Sum_probs=15.8
Q ss_pred cChhhHH-HHHHHHHHHHHHhh
Q 005310 485 LRFSEVV-NFFHEFGHVVQQIC 505 (703)
Q Consensus 485 L~~~~V~-TLfHEfGHalH~ll 505 (703)
|+-+|+. .|+||+||.-|.=.
T Consensus 132 l~~~El~aVlAHElgHi~~~d~ 153 (298)
T PRK04897 132 MNREELEGVIGHEISHIRNYDI 153 (298)
T ss_pred CCHHHHHHHHHHHHHHHhcCCH
Confidence 4567766 78999999987543
No 82
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=26.67 E-value=39 Score=34.48 Aligned_cols=18 Identities=22% Similarity=0.320 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHHHhhhc
Q 005310 490 VVNFFHEFGHVVQQICNR 507 (703)
Q Consensus 490 V~TLfHEfGHalH~lls~ 507 (703)
+..++||+||++=...-.
T Consensus 54 ~~v~iHElgH~~~A~~~G 71 (227)
T cd06164 54 ASVLLHELGHSLVARRYG 71 (227)
T ss_pred HHHHHHHHHHHHHHHHcC
Confidence 457899999998765543
No 83
>cd00923 Cyt_c_Oxidase_Va Cytochrome c oxidase subunit Va. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit Va is one of three mammalian subunits that lacks a transmembrane region. Subunit Va is located on the matrix side of the membrane and binds thyroid hormone T2, releasing allosteric inhibition caused by the binding of ATP to subunit IV and allowing high turnover at elevated intramitochondrial ATP/ADP ratios.
Probab=26.67 E-value=3.4e+02 Score=24.03 Aligned_cols=62 Identities=13% Similarity=0.207 Sum_probs=40.8
Q ss_pred cCCcHHHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHhcCCCCCHH-HHHHHHHHHHHHHHcCCCCCHH
Q 005310 118 VCTSDDVCKSSAEAERKIDAHMLSCSNREDVYRVVKAVAATGEWVSPE-AKRYIQSLVRDFERSGLNLTIS 187 (703)
Q Consensus 118 v~~d~e~r~aa~e~~~~l~~~~~~l~~~~~ly~~l~~~~~~~~~L~~e-~~r~l~~~~~~f~~~G~~L~~~ 187 (703)
+-|+|++..|+-.|+.+++.|...+ .+++.+|.-. .. ..+ -..+++.+---++.-||.++++
T Consensus 38 lVP~P~ii~aaLrAcRRvND~alAV----R~lE~vK~K~---~~-~~~~y~~~lqeikp~l~ELGI~t~Ee 100 (103)
T cd00923 38 LVPEPKVIEAALRACRRVNDFALAV----RILEAIKDKC---GA-HKEIYPYILQEIKPTLKELGISTPEE 100 (103)
T ss_pred cCCCcHHHHHHHHHHHHhhhHHHHH----HHHHHHHHHc---cC-chhhHHHHHHHHhHHHHHHCCCCHHH
Confidence 4589999999999999999998777 5566555433 11 222 2334555544555568877553
No 84
>TIGR03042 PS_II_psbQ_bact photosystem II protein PsbQ. This protein through the member sll1638 from Synechocystis sp. PCC 6803, was shown to be part of the cyanobacteria photosystem II. It is homologous to (but quite diverged from) the chloroplast PsbQ protein, called oxygen-evolving enhancer protein 3 (OEE3). We designate this cyanobacteria protein PsbQ by homology.
Probab=26.53 E-value=4.5e+02 Score=24.82 Aligned_cols=13 Identities=8% Similarity=0.327 Sum_probs=9.1
Q ss_pred CCChhhHHHHHHH
Q 005310 57 NLSASEILKLADK 69 (703)
Q Consensus 57 ~~s~~~i~~~~~~ 69 (703)
++||++|.+.-..
T Consensus 29 tysp~~l~~i~~~ 41 (142)
T TIGR03042 29 TYSPAQLAQIQRQ 41 (142)
T ss_pred CCCHHHHHHHHHH
Confidence 5788888765444
No 85
>PRK03072 heat shock protein HtpX; Provisional
Probab=26.44 E-value=42 Score=35.58 Aligned_cols=22 Identities=27% Similarity=0.196 Sum_probs=16.7
Q ss_pred cChhhHH-HHHHHHHHHHHHhhh
Q 005310 485 LRFSEVV-NFFHEFGHVVQQICN 506 (703)
Q Consensus 485 L~~~~V~-TLfHEfGHalH~lls 506 (703)
|+-+|+. .++||+||.-|.=.-
T Consensus 122 l~~~El~aVlAHElgHi~~~d~~ 144 (288)
T PRK03072 122 LNERELRGVLGHELSHVYNRDIL 144 (288)
T ss_pred CCHHHHHHHHHHHHHHHhcCCHH
Confidence 4667776 789999999875443
No 86
>PF01742 Peptidase_M27: Clostridial neurotoxin zinc protease This family is a subset of the Prosite family; InterPro: IPR000395 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M27 (clan MA(E)). A number of the proteins have been classified as non-peptidase homologues as they have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity of peptidases in the family. There are seven antigenically distinct forms of botulinum neurotoxin, designated A, B, C1, D, E, F and G. The seven neurotoxins are potent protein toxins that inhibit neurotransmitter release from peripheral cholinergic synapses []. On binding to the neuronal synapses, the molecules are internalised and move by retrograde transport up the axon into the spinal cord, where they can move between post- and presynaptic neurons. The toxin inhibits neurotransmitter release by acting as a zinc endopeptidase that cleaves synaptic proteins such as synaptobrevins, syntaxin and SNAP-25 []. The protein toxins exist as disulphide-linked heterodimers of light and heavy chains. The light chain has the pharmacological activity, while the N- and C-termini of the heavy chain mediate channel formation and toxin binding []. The light chain exhibits a high level of sequence similarity to tetanus toxin (TeTx). Alignment of all characterised neurotoxin sequences reveals the presence of highly conserved amino acid domains interspersed with amino acid tracts with little overall similarity. The most divergent region corresponds to the C-terminal extremity of each toxin, which may reflect differences in specificity of binding to neurone acceptor sites []. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0009405 pathogenesis; PDB: 2QN0_A 3D3X_A 3NF3_A 2ISG_A 2ISE_B 2IMC_B 2IMA_B 2ILP_A 2IMB_A 2ISH_A ....
Probab=26.19 E-value=48 Score=36.26 Aligned_cols=25 Identities=16% Similarity=0.243 Sum_probs=19.4
Q ss_pred CccChhhHHHHHHHHHHHHHHhhhc
Q 005310 483 SLLRFSEVVNFFHEFGHVVQQICNR 507 (703)
Q Consensus 483 ~lL~~~~V~TLfHEfGHalH~lls~ 507 (703)
+..-.|=+.+|+||+=|+||.+.+-
T Consensus 208 ~~F~~DPAl~LmheLIh~Lh~LYGi 232 (408)
T PF01742_consen 208 SEFYADPALELMHELIHSLHGLYGI 232 (408)
T ss_dssp SEEE--HHHHHHHHHHHHHHHHTTT
T ss_pred hheecCHHHHHHHHHHHHHHhhccc
Confidence 3445566899999999999999985
No 87
>PRK01265 heat shock protein HtpX; Provisional
Probab=26.06 E-value=42 Score=36.18 Aligned_cols=21 Identities=29% Similarity=0.320 Sum_probs=16.5
Q ss_pred cChhhHH-HHHHHHHHHHHHhh
Q 005310 485 LRFSEVV-NFFHEFGHVVQQIC 505 (703)
Q Consensus 485 L~~~~V~-TLfHEfGHalH~ll 505 (703)
|+-+|+. .+.||+||.-|.-.
T Consensus 135 l~~~El~aVlAHElgHik~~d~ 156 (324)
T PRK01265 135 LNRDEIKAVAGHELGHLKHRDV 156 (324)
T ss_pred CCHHHHHHHHHHHHHHHHcccH
Confidence 4668876 78999999887644
No 88
>PTZ00337 surface protease GP63; Provisional
Probab=24.99 E-value=78 Score=36.89 Aligned_cols=33 Identities=21% Similarity=0.292 Sum_probs=23.0
Q ss_pred cccEEEEeccCCCCCCCCCCccChhhHHHHHHHHHHHH
Q 005310 464 QIPVALLISQIQKDIVGHPSLLRFSEVVNFFHEFGHVV 501 (703)
Q Consensus 464 q~P~~~l~~Nf~~~~~~~p~lL~~~~V~TLfHEfGHal 501 (703)
..|++.+ .||++..- | .+-..+.+++||+.|||
T Consensus 209 ~RPi~G~-in~np~~i--~--~~~~~~~v~~HEi~HAL 241 (567)
T PTZ00337 209 GRPFAAA-VNFDPRQI--A--VTNGDVRVAAHELGHAL 241 (567)
T ss_pred CCceEEE-EEECHHHc--c--chhHHHHHHHHHHHHHH
Confidence 4687654 68886321 1 14567899999999999
No 89
>PF09610 Myco_arth_vir_N: Mycoplasma virulence signal region (Myco_arth_vir_N); InterPro: IPR011732 This entry represents the N-terminal region of a family of large, virulence-associated proteins in Mycoplasma arthritidis and smaller proteins in Mycoplasma capricolum. It includes a probable signal sequence or signal anchor, which, in most instances, has four consecutive Lys residues before the hydrophobic stretch.
Probab=24.41 E-value=32 Score=23.64 Aligned_cols=23 Identities=13% Similarity=0.111 Sum_probs=16.9
Q ss_pred hhhhhhHHHHhhHHHHHHHHhhh
Q 005310 13 QKRERNVLAFTGATALAVIAVNF 35 (703)
Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~ 35 (703)
+|+.-+++..+++|++++.+...
T Consensus 5 KKKK~~Il~la~~a~l~as~s~g 27 (33)
T PF09610_consen 5 KKKKIKILTLALTASLLASGSFG 27 (33)
T ss_pred hhhhhhhhhHHHHHHHHHceeee
Confidence 46667788888888888766543
No 90
>PRK05457 heat shock protein HtpX; Provisional
Probab=24.25 E-value=51 Score=34.90 Aligned_cols=20 Identities=25% Similarity=0.311 Sum_probs=15.6
Q ss_pred cChhhHH-HHHHHHHHHHHHh
Q 005310 485 LRFSEVV-NFFHEFGHVVQQI 504 (703)
Q Consensus 485 L~~~~V~-TLfHEfGHalH~l 504 (703)
|+-+|+. .+.||+||.-|.-
T Consensus 129 L~~~El~aVlAHElgHi~~~d 149 (284)
T PRK05457 129 MSRDEVEAVLAHEISHIANGD 149 (284)
T ss_pred CCHHHHHHHHHHHHHHHHcCC
Confidence 4667776 7899999988743
No 91
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=24.07 E-value=47 Score=34.99 Aligned_cols=17 Identities=29% Similarity=0.550 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHHHHhhh
Q 005310 490 VVNFFHEFGHVVQQICN 506 (703)
Q Consensus 490 V~TLfHEfGHalH~lls 506 (703)
+..+.||+||++=...-
T Consensus 136 isvvvHElgHal~A~~~ 152 (277)
T cd06162 136 ISGVVHEMGHGVAAVRE 152 (277)
T ss_pred HHHHHHHHHHHHHHHHc
Confidence 45789999999876643
No 92
>PF05055 DUF677: Protein of unknown function (DUF677); InterPro: IPR007749 This entry contains proteins belonging to the UPF0496 family, found in plants. This family includes AT14A like proteins from Arabidopsis thaliana. At14a contains a small domain that has sequence similarities to integrins from fungi, insects and humans. Transcripts of At14a are found in all Arabidopsis tissues and the protein localises partly to the plasma membrane [].
Probab=23.95 E-value=2.7e+02 Score=30.20 Aligned_cols=50 Identities=20% Similarity=0.118 Sum_probs=22.2
Q ss_pred HHHHHHHHHHHHHHHHHhCCC-CCCCCHHHHHHHHHhhhhhhccccccccc
Q 005310 65 KLADKIVSKSKEVHDAVASVP-LDKVTFMNVISPLAELEAQQFPLVQSCVF 114 (703)
Q Consensus 65 ~~~~~~i~~~~~~~~~i~~~~-~~~~t~~n~i~~l~~l~~~l~~~~~~~~~ 114 (703)
......+++.++.++.+..-+ ....+++++=.-++++.+++......+.+
T Consensus 232 ~k~~~al~~~~~~l~~aakGtyI~~~DldTIsrLV~RL~deIE~~~~~v~f 282 (336)
T PF05055_consen 232 KKYEEALKKQKEQLDAAAKGTYILIKDLDTISRLVDRLEDEIEHMKALVDF 282 (336)
T ss_pred HHHHHHHHHHHHHHHHHHhccchHHHHHhhHHHHHHHHHHHHHHHHHHHHH
Confidence 334445666666666664311 00123333333344444555444444444
No 93
>cd07635 BAR_GRAF2 The Bin/Amphiphysin/Rvs (BAR) domain of GTPase Regulator Associated with Focal adhesion 2. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. GTPase Regulator Associated with Focal adhesion kinase 2 (GRAF2), also called Rho GTPase activating protein 10 (ARHGAP10) or PS-GAP, is a GAP with activity towards Cdc42 and RhoA which regulates caspase-activated p21-activated protein kinase-2 (PAK-2p34). GRAF2 interacts with PAK-2p34, leading to its stabilization and decrease of cell death. It is highly expressed in skeletal muscle and also interacts with PKNbeta, which is a target of Rho. GRAF2 contains an N-terminal BAR domain, followed by a Pleckstrin homology (PH) domain, a Rho GAP domain, and a C-terminal SH3 domain. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions. The BAR domain of the related prote
Probab=23.89 E-value=3.4e+02 Score=27.34 Aligned_cols=121 Identities=9% Similarity=0.158 Sum_probs=66.4
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHhhhhhhcccccccccccccCCcHHHHHHHHHHHHHHHHHH
Q 005310 60 ASEILKLADKIVSKSKEVHDAVASVPLDKVTFMNVISPLAELEAQQFPLVQSCVFPKLVCTSDDVCKSSAEAERKIDAHM 139 (703)
Q Consensus 60 ~~~i~~~~~~~i~~~~~~~~~i~~~~~~~~t~~n~i~~l~~l~~~l~~~~~~~~~~~~v~~d~e~r~aa~e~~~~l~~~~ 139 (703)
-+...+.++.+|+.|+..+++..........|.+.+..+-. .++-...+|++.- ..++.+.+..+.
T Consensus 11 le~~~k~i~kLiK~c~~~i~a~k~~~~a~~~Fa~~L~~f~~------------~~~gd~~~dde~~--i~~sl~ef~~~~ 76 (207)
T cd07635 11 LERTNRFIKELLKDGKNLIAATKSLSAAQRKFAHSLRDFKF------------EFIGDAETDDERC--IDASLQEFSNFL 76 (207)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------------hcCCCcccchhHH--HHHHHHHHHHHH
Confidence 35667888899999999888876542334556665554432 2344455555322 245555555555
Q ss_pred HHhhCCHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHH
Q 005310 140 LSCSNREDVYRVVKAVAATGEWVSPEAKRYIQSLVRDFERSGLNLTISKREEVQRLRDQIDELSLQYV 207 (703)
Q Consensus 140 ~~l~~~~~ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~l~~~f~ 207 (703)
.++..-.. .|-..-.+.+..-+..|+...+.--.+.|++|.....+......+|.
T Consensus 77 ~el~d~r~-------------~L~~~~~~~l~~pL~~F~kedl~~~Ke~KK~FdK~se~~~~Al~K~~ 131 (207)
T cd07635 77 KNLEEQRE-------------IMALNVTETLIKPLERFRKEQLGAVKEEKKKFDKETEKNYSLLEKHL 131 (207)
T ss_pred HHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHH
Confidence 54433111 11112234455556677776554334567777766666655554444
No 94
>cd07601 BAR_APPL The Bin/Amphiphysin/Rvs (BAR) domain of Adaptor protein, Phosphotyrosine interaction, PH domain and Leucine zipper containing proteins. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Adaptor protein, Phosphotyrosine interaction, PH domain and Leucine zipper containing (APPL) proteins are effectors of the small GTPase Rab5 that function in endosome-mediated signaling. They contain BAR, pleckstrin homology (PH) and phosphotyrosine binding (PTB) domains. They form homo- and hetero-oligomers that are mediated by their BAR domains, and are localized to cytoplasmic membranes. Vertebrates contain two APPL proteins, APPL1 and APPL2. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=22.88 E-value=6.5e+02 Score=25.50 Aligned_cols=122 Identities=12% Similarity=0.165 Sum_probs=64.2
Q ss_pred CCChhhHHHHHHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHhhhhhhcccccccccccccCCcHHHHHHHHHHHHHHH
Q 005310 57 NLSASEILKLADKIVSKSKEVHDAVASVPLDKVTFMNVISPLAELEAQQFPLVQSCVFPKLVCTSDDVCKSSAEAERKID 136 (703)
Q Consensus 57 ~~s~~~i~~~~~~~i~~~~~~~~~i~~~~~~~~t~~n~i~~l~~l~~~l~~~~~~~~~~~~v~~d~e~r~aa~e~~~~l~ 136 (703)
.-+.+++...++.+++.|++++++..........|.+.+..+..- .|. ..+|+++.. ++..+++
T Consensus 8 E~d~~~L~~~~~kL~K~c~~~~~a~~~~~~A~~~F~~~L~ef~~~-----------~f~--~~~dDe~~~---~~l~kFs 71 (215)
T cd07601 8 EEDALQLSSYMNQLLQACKRVYDAQNELKSATQALSKKLGEYEKQ-----------KFE--LGRDDEILV---STLKQFS 71 (215)
T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-----------ccC--CCCCcHHHH---HHHHHHH
Confidence 346788999999999999999998765422234455555444321 111 125555443 2555555
Q ss_pred HHHHHhhCCHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHH
Q 005310 137 AHMLSCSNREDVYRVVKAVAATGEWVSPEAKRYIQSLVRDFERSGLNLTISKREEVQRLRDQIDELSLQYV 207 (703)
Q Consensus 137 ~~~~~l~~~~~ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~~~i~~l~~~f~ 207 (703)
....++..-.. . |-..-.+.+-.-+..|....+.=-.+.|++|............+|.
T Consensus 72 ~~l~El~~~~~---~----------L~~q~~~~l~~pL~~F~k~Dl~~vKe~kK~FdK~s~~~d~al~K~~ 129 (215)
T cd07601 72 KVVDELSTMHS---T----------LSSQLADTVLHPISQFMESDLAEIMTLKELFKAASNDHDGVLSKYS 129 (215)
T ss_pred HHHHHHHHHHH---H----------HHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHhHHHHHHHHHHh
Confidence 55555543111 1 1112233444555666665333222456666555555544443333
No 95
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=22.87 E-value=54 Score=31.19 Aligned_cols=14 Identities=29% Similarity=0.271 Sum_probs=11.4
Q ss_pred hHHHHHHHHHHHHH
Q 005310 489 EVVNFFHEFGHVVQ 502 (703)
Q Consensus 489 ~V~TLfHEfGHalH 502 (703)
.+..+.|||+|++-
T Consensus 135 ~lDVvaHEltHGVt 148 (150)
T PF01447_consen 135 SLDVVAHELTHGVT 148 (150)
T ss_dssp -HHHHHHHHHHHHH
T ss_pred ccceeeeccccccc
Confidence 47888999999974
No 96
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=22.58 E-value=54 Score=32.42 Aligned_cols=16 Identities=38% Similarity=0.470 Sum_probs=13.1
Q ss_pred ChhhHHHHHHHHHHHH
Q 005310 486 RFSEVVNFFHEFGHVV 501 (703)
Q Consensus 486 ~~~~V~TLfHEfGHal 501 (703)
.+.-..+++||+||.|
T Consensus 128 ~~~~a~~~AHelGH~l 143 (199)
T PF01421_consen 128 GLSFAVIIAHELGHNL 143 (199)
T ss_dssp HHHHHHHHHHHHHHHT
T ss_pred hHHHHHHHHHHHHHhc
Confidence 4556789999999986
No 97
>PF01418 HTH_6: Helix-turn-helix domain, rpiR family; InterPro: IPR000281 This domain contains a helix-turn-helix motif []. Every member of this family is N-terminal to a SIS domain IPR001347 from INTERPRO. Members of this family are probably regulators of genes involved in phosphosugar metobolism.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2O3F_B 3IWF_B.
Probab=22.47 E-value=70 Score=26.50 Aligned_cols=18 Identities=22% Similarity=0.772 Sum_probs=15.4
Q ss_pred HHHHHHcCCCCHHHhHhh
Q 005310 292 HKFARLLGYSNYADYALD 309 (703)
Q Consensus 292 ~~~A~llGy~sya~~~l~ 309 (703)
.++++.+||++|.|+...
T Consensus 52 ~Rf~kkLG~~gf~efk~~ 69 (77)
T PF01418_consen 52 VRFCKKLGFSGFKEFKIA 69 (77)
T ss_dssp HHHHHHCTTTCHHHHHHH
T ss_pred HHHHHHhCCCCHHHHHHH
Confidence 456999999999998875
No 98
>PF05823 Gp-FAR-1: Nematode fatty acid retinoid binding protein (Gp-FAR-1); InterPro: IPR008632 Parasitic nematodes produce at least two structurally novel classes of small helix-rich retinol- and fatty-acid-binding proteins that have no counterparts in their plant or animal hosts and thus represent potential targets for new nematicides. Gp-FAR-1 is a member of the nematode-specific fatty-acid- and retinol-binding (FAR) family of proteins but localises to the surface of the organism, placing it in a strategic position for interaction with the host. Gp-FAR-1 functions as a broad-spectrum retinol- and fatty-acid-binding protein, and it is thought that it is involved in the evasion of primary host plant defence systems [].; GO: 0008289 lipid binding; PDB: 2W9Y_A.
Probab=22.28 E-value=1.6e+02 Score=28.05 Aligned_cols=33 Identities=21% Similarity=0.406 Sum_probs=20.0
Q ss_pred HHHHHHHHHHHh----cCCCCCHHHHHHHHHHHHHHH
Q 005310 146 EDVYRVVKAVAA----TGEWVSPEAKRYIQSLVRDFE 178 (703)
Q Consensus 146 ~~ly~~l~~~~~----~~~~L~~e~~r~l~~~~~~f~ 178 (703)
+.||..+..+.. +-+.|+||.+.+++.++...+
T Consensus 54 P~L~~k~~~l~~~~k~ki~~L~peak~Fv~~li~~~~ 90 (154)
T PF05823_consen 54 PSLYEKAEKLRDKLKKKIDKLSPEAKAFVKELIAKAR 90 (154)
T ss_dssp HHHHHHHHHHHHHHHHTTTT--HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHH
Confidence 445554443322 457899999999988877654
No 99
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=21.38 E-value=19 Score=36.72 Aligned_cols=17 Identities=24% Similarity=0.249 Sum_probs=13.6
Q ss_pred ChhhHHHHHHHHHHHHH
Q 005310 486 RFSEVVNFFHEFGHVVQ 502 (703)
Q Consensus 486 ~~~~V~TLfHEfGHalH 502 (703)
..+.-.|+.||+||.|-
T Consensus 134 ~~n~g~t~~HEvGH~lG 150 (225)
T cd04275 134 PYNLGDTATHEVGHWLG 150 (225)
T ss_pred cccccceeEEeccceee
Confidence 44556799999999885
No 100
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=21.09 E-value=60 Score=31.99 Aligned_cols=18 Identities=22% Similarity=0.292 Sum_probs=13.4
Q ss_pred HHHHHHHHHHHHHhhhcC
Q 005310 491 VNFFHEFGHVVQQICNRA 508 (703)
Q Consensus 491 ~TLfHEfGHalH~lls~~ 508 (703)
..+.||+||++=...-..
T Consensus 43 ~l~iHElgH~~~A~~~G~ 60 (183)
T cd06160 43 ILGIHEMGHYLAARRHGV 60 (183)
T ss_pred HHHHHHHHHHHHHHHCCC
Confidence 456999999987665543
No 101
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=20.85 E-value=48 Score=35.65 Aligned_cols=15 Identities=27% Similarity=0.585 Sum_probs=11.7
Q ss_pred hhHHHHHHHHHHHHH
Q 005310 488 SEVVNFFHEFGHVVQ 502 (703)
Q Consensus 488 ~~V~TLfHEfGHalH 502 (703)
.+..|+|||+||-+=
T Consensus 149 ~~~~~~~HElgHN~G 163 (314)
T PF05548_consen 149 QDWATIMHELGHNLG 163 (314)
T ss_pred ccHHHHHHHhhhhcc
Confidence 345699999999863
No 102
>PRK02391 heat shock protein HtpX; Provisional
Probab=20.76 E-value=61 Score=34.51 Aligned_cols=21 Identities=29% Similarity=0.311 Sum_probs=15.7
Q ss_pred cChhhHH-HHHHHHHHHHHHhh
Q 005310 485 LRFSEVV-NFFHEFGHVVQQIC 505 (703)
Q Consensus 485 L~~~~V~-TLfHEfGHalH~ll 505 (703)
|+-+|+. .+.||+||.-|.=.
T Consensus 128 L~~~El~aVlaHElgHi~~~di 149 (296)
T PRK02391 128 LDPDELEAVLAHELSHVKNRDV 149 (296)
T ss_pred CCHHHHHHHHHHHHHHHHcCCH
Confidence 4557765 78999999887643
No 103
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=20.46 E-value=56 Score=36.62 Aligned_cols=20 Identities=30% Similarity=0.463 Sum_probs=15.6
Q ss_pred hHHHHHHHHHHHHHHhhhcC
Q 005310 489 EVVNFFHEFGHVVQQICNRA 508 (703)
Q Consensus 489 ~V~TLfHEfGHalH~lls~~ 508 (703)
-+..++||+||.+-.-....
T Consensus 14 ~~~v~~HE~gH~~~a~~~g~ 33 (420)
T TIGR00054 14 AVLIFVHELGHFLAARLCGI 33 (420)
T ss_pred HHHHHHHhHHHHHHHHHcCC
Confidence 46788999999988776543
No 104
>PF08328 ASL_C: Adenylosuccinate lyase C-terminal; InterPro: IPR013539 This domain is found at the C terminus of adenylosuccinate lyase(ASL; PurB in Escherichia coli). It has been identified in bacteria, eukaryotes and archaea and is found together with the lyase domain IPR000362 from INTERPRO. ASL catalyses the cleavage of succinylaminoimidazole carboxamide ribotide to aminoimidazole carboxamide ribotide and fumarate and the cleavage of adenylosuccinate to adenylate and fumarate []. ; GO: 0004018 N6-(1,2-dicarboxyethyl)AMP AMP-lyase (fumarate-forming) activity, 0006188 IMP biosynthetic process; PDB: 2HVG_A 2QGA_C 2PTS_A 2PTR_A 2PTQ_B 3BHG_A 3GZH_A.
Probab=20.26 E-value=2.2e+02 Score=25.77 Aligned_cols=42 Identities=19% Similarity=0.385 Sum_probs=27.3
Q ss_pred HHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHH
Q 005310 146 EDVYRVVKAVAATGEWVSPEAKRYIQSLVRDFERSGLNLTISKREEVQRLR 196 (703)
Q Consensus 146 ~~ly~~l~~~~~~~~~L~~e~~r~l~~~~~~f~~~G~~L~~~~r~~~~~l~ 196 (703)
++=|+.+|++- .+...+.+.. ++|.. +..++++.++++.+|.
T Consensus 69 ~~pYE~LK~lT-Rg~~it~~~l-------~~fI~-~L~ip~~~k~~L~~lt 110 (115)
T PF08328_consen 69 PNPYEKLKELT-RGKKITKEDL-------REFIE-SLDIPEEAKARLLALT 110 (115)
T ss_dssp SSHHHHHHHHH-TTS---HHHH-------HHHHH-TSSS-HHHHHHHHH--
T ss_pred CCHHHHHHHHH-cCCCCCHHHH-------HHHHH-hCCCCHHHHHHHHhcC
Confidence 45589999886 3445665554 78887 7889999999988764
No 105
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=20.18 E-value=21 Score=35.71 Aligned_cols=14 Identities=43% Similarity=0.743 Sum_probs=11.9
Q ss_pred hhHHHHHHHHHHHH
Q 005310 488 SEVVNFFHEFGHVV 501 (703)
Q Consensus 488 ~~V~TLfHEfGHal 501 (703)
..+.+++||+||.|
T Consensus 139 ~~a~~~aHElGH~L 152 (207)
T cd04273 139 SSAFTIAHELGHVL 152 (207)
T ss_pred eeEEeeeeechhhc
Confidence 45689999999987
No 106
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=20.16 E-value=1.8e+02 Score=22.44 Aligned_cols=54 Identities=17% Similarity=0.276 Sum_probs=42.4
Q ss_pred HHHHHHHHHhcCCCCHHHHHHHHHHhccCCCCccHHHHHHHhhCCCCChHHHHH
Q 005310 643 AADIFACKFRDGLLNQRVGMQFRNNVLAPGGSKEPIEILSDFLGREPSIQAFVD 696 (703)
Q Consensus 643 A~~i~~~~f~~~~~~~~~g~~~r~~vL~~Ggs~~~~ell~~flGr~p~~~a~l~ 696 (703)
|..++.+.+..+|.|.++.-.+-+..++.|.-....++++..+-..|+...+..
T Consensus 10 A~~~~~~~l~~~p~~~~~~~~la~~~~~~g~~~~A~~~l~~~~~~~~~~~~~~~ 63 (68)
T PF14559_consen 10 AIELLEKALQRNPDNPEARLLLAQCYLKQGQYDEAEELLERLLKQDPDNPEYQQ 63 (68)
T ss_dssp HHHHHHHHHHHTTTSHHHHHHHHHHHHHTT-HHHHHHHHHCCHGGGTTHHHHHH
T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCHHHHHH
Confidence 455565566668888988888888899999999999999999999999654443
Done!