Query psy3708
Match_columns 240
No_of_seqs 183 out of 1556
Neff 9.2
Searched_HMMs 46136
Date Fri Aug 16 23:30:49 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3708.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3708hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1046|consensus 100.0 1.3E-47 2.7E-52 361.7 15.8 214 1-220 176-406 (882)
2 PF01433 Peptidase_M1: Peptida 100.0 8.4E-45 1.8E-49 318.0 16.6 208 1-216 147-369 (390)
3 TIGR02412 pepN_strep_liv amino 100.0 2.8E-43 6.1E-48 331.9 20.6 190 1-198 141-345 (831)
4 TIGR02414 pepN_proteo aminopep 100.0 4E-40 8.6E-45 308.3 18.5 181 1-183 129-326 (863)
5 PRK14015 pepN aminopeptidase N 100.0 5.7E-40 1.2E-44 308.0 19.4 181 1-183 142-339 (875)
6 COG0308 PepN Aminopeptidase N 100.0 1.6E-37 3.5E-42 293.1 17.6 195 1-199 157-367 (859)
7 TIGR02411 leuko_A4_hydro leuko 100.0 5.4E-37 1.2E-41 279.6 15.5 171 1-187 147-326 (601)
8 KOG1047|consensus 100.0 1.3E-30 2.8E-35 225.6 12.3 203 2-218 153-364 (613)
9 KOG1932|consensus 99.9 4.6E-22 1E-26 183.7 15.0 186 2-200 198-397 (1180)
10 KOG1046|consensus 99.1 6.9E-11 1.5E-15 112.9 3.8 56 185-240 250-305 (882)
11 PF13485 Peptidase_MA_2: Pepti 98.6 4.5E-08 9.7E-13 71.8 3.0 41 138-178 23-63 (128)
12 TIGR02414 pepN_proteo aminopep 98.4 1.8E-07 3.8E-12 89.2 3.6 50 191-240 215-264 (863)
13 TIGR02412 pepN_strep_liv amino 98.4 2.3E-07 5.1E-12 88.8 4.0 49 192-240 222-270 (831)
14 PRK14015 pepN aminopeptidase N 98.2 1E-06 2.2E-11 84.3 3.8 49 192-240 229-277 (875)
15 COG0308 PepN Aminopeptidase N 98.2 9.6E-07 2.1E-11 84.8 3.2 51 189-240 238-288 (859)
16 PF01433 Peptidase_M1: Peptida 98.1 2.5E-06 5.3E-11 74.9 3.3 51 190-240 226-276 (390)
17 COG3975 Predicted protease wit 97.9 0.00011 2.4E-09 65.2 9.7 110 71-182 168-300 (558)
18 PF07607 DUF1570: Protein of u 97.2 0.0002 4.4E-09 52.6 1.7 38 142-180 3-43 (128)
19 PF05299 Peptidase_M61: M61 gl 97.0 0.00046 1E-08 50.3 2.5 44 140-183 4-58 (122)
20 TIGR02411 leuko_A4_hydro leuko 97.0 0.00062 1.3E-08 63.2 3.9 41 197-239 226-267 (601)
21 PF10460 Peptidase_M30: Peptid 96.7 0.01 2.3E-07 51.3 8.3 49 137-185 136-188 (366)
22 PF04450 BSP: Peptidase of pla 96.2 0.0061 1.3E-07 48.7 3.8 40 137-179 93-132 (205)
23 PF10026 DUF2268: Predicted Zn 95.6 0.067 1.5E-06 42.5 7.6 49 136-184 61-113 (195)
24 PRK04860 hypothetical protein; 94.6 0.074 1.6E-06 40.8 5.0 65 89-153 4-76 (160)
25 PF10023 DUF2265: Predicted am 94.0 0.27 5.9E-06 42.1 7.5 40 138-183 163-202 (337)
26 PF01447 Peptidase_M4: Thermol 92.8 0.51 1.1E-05 35.8 6.6 66 85-151 67-146 (150)
27 COG4324 Predicted aminopeptida 91.4 0.22 4.7E-06 41.0 3.3 39 138-182 195-233 (376)
28 PF12315 DUF3633: Protein of u 89.9 0.45 9.8E-06 37.8 3.8 40 141-182 94-133 (212)
29 COG3227 LasB Zinc metalloprote 89.8 0.72 1.6E-05 41.1 5.3 138 85-225 269-426 (507)
30 PF12725 DUF3810: Protein of u 89.1 0.8 1.7E-05 39.3 5.0 32 139-182 195-226 (318)
31 PF01435 Peptidase_M48: Peptid 88.0 0.39 8.6E-06 38.5 2.4 63 97-159 36-108 (226)
32 smart00731 SprT SprT homologue 87.7 1.1 2.3E-05 33.8 4.5 30 125-154 44-73 (146)
33 PRK05457 heat shock protein Ht 86.6 1.4 3E-05 37.1 5.0 59 95-155 80-149 (284)
34 PRK04897 heat shock protein Ht 85.3 1.3 2.9E-05 37.5 4.3 60 94-155 82-152 (298)
35 PRK01345 heat shock protein Ht 84.5 1.7 3.6E-05 37.3 4.5 60 94-155 69-139 (317)
36 PF13699 DUF4157: Domain of un 84.4 2.4 5.1E-05 28.4 4.3 53 97-152 6-73 (79)
37 PRK03001 M48 family peptidase; 84.0 1.6 3.6E-05 36.7 4.3 56 94-151 69-135 (283)
38 PRK03072 heat shock protein Ht 82.8 2.2 4.8E-05 36.0 4.6 56 94-151 72-138 (288)
39 PRK03982 heat shock protein Ht 81.5 2.8 6E-05 35.4 4.7 17 136-152 121-137 (288)
40 PF10263 SprT-like: SprT-like 81.4 3.1 6.7E-05 31.4 4.6 30 125-154 45-74 (157)
41 PRK02391 heat shock protein Ht 81.4 2.6 5.7E-05 35.7 4.5 60 94-155 78-148 (296)
42 KOG2661|consensus 80.3 2.5 5.4E-05 36.1 3.9 21 135-155 270-290 (424)
43 PRK01265 heat shock protein Ht 80.1 2.8 6.1E-05 36.0 4.3 55 95-151 86-151 (324)
44 PRK02870 heat shock protein Ht 79.9 3.7 8.1E-05 35.5 5.0 27 125-151 158-184 (336)
45 PRK04351 hypothetical protein; 76.9 3.5 7.6E-05 31.2 3.5 49 101-152 14-73 (149)
46 PF13574 Reprolysin_2: Metallo 75.6 1.8 3.8E-05 33.6 1.7 12 141-152 112-123 (173)
47 PF06114 DUF955: Domain of unk 72.4 2.9 6.3E-05 29.4 2.1 31 123-157 29-59 (122)
48 PF01431 Peptidase_M13: Peptid 72.1 6.4 0.00014 31.2 4.2 64 113-176 7-75 (206)
49 COG3590 PepO Predicted metallo 72.0 1.7 3.7E-05 39.8 0.8 65 112-176 457-526 (654)
50 PF01863 DUF45: Protein of unk 71.1 15 0.00032 28.9 6.1 65 88-155 108-179 (205)
51 PF08325 WLM: WLM domain; Int 70.0 3.5 7.6E-05 32.4 2.2 25 132-156 74-98 (186)
52 COG2856 Predicted Zn peptidase 69.5 12 0.00027 30.1 5.2 54 123-180 59-117 (213)
53 COG1451 Predicted metal-depend 63.7 21 0.00045 29.0 5.5 65 88-155 119-190 (223)
54 KOG2719|consensus 63.7 11 0.00024 33.5 4.2 49 101-151 226-291 (428)
55 COG0501 HtpX Zn-dependent prot 62.5 4.5 9.8E-05 33.9 1.6 25 134-158 151-175 (302)
56 PF13688 Reprolysin_5: Metallo 60.3 5.5 0.00012 31.2 1.6 15 138-152 140-154 (196)
57 PHA02456 zinc metallopeptidase 60.0 6 0.00013 28.1 1.5 26 124-153 67-92 (141)
58 PF08014 DUF1704: Domain of un 57.4 89 0.0019 27.2 8.6 51 123-179 150-213 (349)
59 PRK13267 archaemetzincin-like 55.0 13 0.00027 29.1 2.8 16 137-152 122-137 (179)
60 PF05569 Peptidase_M56: BlaR1 52.7 21 0.00046 30.1 4.0 29 123-151 179-207 (299)
61 PF13582 Reprolysin_3: Metallo 50.9 8.8 0.00019 27.5 1.3 11 142-152 109-119 (124)
62 COG4783 Putative Zn-dependent 50.8 11 0.00023 34.0 1.9 21 137-157 127-147 (484)
63 cd04269 ZnMc_adamalysin_II_lik 49.9 12 0.00025 29.4 1.9 15 138-152 129-143 (194)
64 COG3091 SprT Zn-dependent meta 49.4 21 0.00045 27.0 3.0 23 138-161 59-85 (156)
65 PF01421 Reprolysin: Reprolysi 48.6 29 0.00062 27.3 4.0 15 137-151 128-142 (199)
66 cd04270 ZnMc_TACE_like Zinc-de 46.0 13 0.00028 30.6 1.6 15 140-155 167-181 (244)
67 COG2719 SpoVR Uncharacterized 42.5 17 0.00037 32.4 1.9 50 141-196 250-299 (495)
68 COG5504 Predicted Zn-dependent 42.2 76 0.0016 26.3 5.4 46 138-183 138-189 (280)
69 KOG3607|consensus 41.5 71 0.0015 30.8 5.9 78 75-152 243-335 (716)
70 PF13203 DUF2201_N: Putative m 41.3 56 0.0012 27.4 4.9 39 123-161 43-81 (292)
71 cd04267 ZnMc_ADAM_like Zinc-de 40.4 13 0.00029 28.9 0.9 14 139-152 132-145 (192)
72 COG4900 Predicted metallopepti 39.7 70 0.0015 22.9 4.3 11 141-151 81-91 (133)
73 PF09865 DUF2092: Predicted pe 38.3 69 0.0015 25.8 4.7 58 52-114 55-112 (214)
74 cd04272 ZnMc_salivary_gland_MP 37.7 20 0.00043 28.8 1.5 13 140-152 145-157 (220)
75 PF14891 Peptidase_M91: Effect 35.9 23 0.00051 27.3 1.6 15 139-153 102-116 (174)
76 PF00413 Peptidase_M10: Matrix 35.5 25 0.00054 26.1 1.7 13 139-151 104-116 (154)
77 PF02031 Peptidase_M7: Strepto 35.4 25 0.00054 25.9 1.5 14 138-151 75-88 (132)
78 PF09768 Peptidase_M76: Peptid 33.8 1.8E+02 0.0038 22.6 6.1 18 135-152 66-83 (173)
79 cd04268 ZnMc_MMP_like Zinc-dep 33.0 35 0.00077 25.6 2.2 21 89-109 15-35 (165)
80 PF05572 Peptidase_M43: Pregna 31.6 28 0.0006 26.4 1.4 13 141-154 70-82 (154)
81 PF08219 TOM13: Outer membrane 30.9 33 0.00072 22.6 1.4 11 142-152 54-64 (77)
82 PF13583 Reprolysin_4: Metallo 30.8 26 0.00056 27.9 1.1 11 142-152 139-149 (206)
83 cd04279 ZnMc_MMP_like_1 Zinc-d 30.5 34 0.00074 25.7 1.7 35 73-107 2-39 (156)
84 PF12388 Peptidase_M57: Dual-a 30.2 32 0.00068 27.7 1.5 32 73-104 39-71 (211)
85 cd04271 ZnMc_ADAM_fungal Zinc- 29.2 20 0.00044 29.1 0.3 11 142-152 147-157 (228)
86 PF04228 Zn_peptidase: Putativ 29.2 35 0.00076 28.9 1.7 12 140-151 170-181 (292)
87 PF13402 M60-like: Peptidase M 27.3 2.2E+02 0.0047 23.9 6.3 94 86-184 144-258 (307)
88 PF07998 Peptidase_M54: Peptid 27.3 52 0.0011 26.1 2.3 16 137-152 142-157 (194)
89 PF07765 KIP1: KIP1-like prote 27.0 1.7E+02 0.0036 19.3 4.2 23 152-176 3-25 (74)
90 COG3824 Predicted Zn-dependent 26.8 34 0.00074 24.8 1.0 15 138-152 107-121 (136)
91 cd00203 ZnMc Zinc-dependent me 25.9 35 0.00075 25.6 1.0 22 88-109 21-42 (167)
92 PF14521 Aspzincin_M35: Lysine 25.9 43 0.00093 25.1 1.5 10 141-150 97-106 (148)
93 PF13265 DUF4056: Protein of u 25.7 31 0.00068 28.5 0.8 9 146-155 154-162 (270)
94 TIGR00181 pepF oligoendopeptid 25.5 77 0.0017 29.6 3.4 44 140-183 378-426 (591)
95 PF02163 Peptidase_M50: Peptid 25.0 41 0.00089 26.1 1.3 22 140-161 7-28 (192)
96 PF06262 DUF1025: Possibl zinc 24.9 68 0.0015 22.3 2.2 17 136-152 69-85 (97)
97 COG4784 Putative Zn-dependent 24.6 53 0.0011 28.6 1.9 32 135-166 119-150 (479)
98 PF03272 Enhancin: Viral enhan 23.1 7.3E+02 0.016 24.4 10.7 108 73-181 143-277 (775)
99 PF04298 Zn_peptidase_2: Putat 22.5 2.6E+02 0.0056 22.7 5.4 12 139-150 88-99 (222)
100 COG1164 Oligoendopeptidase F [ 22.5 99 0.0021 29.1 3.5 44 140-184 380-429 (598)
101 cd04278 ZnMc_MMP Zinc-dependen 22.4 48 0.001 24.9 1.2 14 138-151 105-118 (157)
102 PRK11767 SpoVR family protein; 21.5 61 0.0013 29.6 1.8 46 135-186 247-292 (498)
103 PF10186 Atg14: UV radiation r 20.9 1.4E+02 0.0029 24.9 3.8 28 85-112 195-222 (302)
104 cd04273 ZnMc_ADAMTS_like Zinc- 20.3 19 0.00042 28.6 -1.5 13 140-152 140-152 (207)
105 PF10463 Peptidase_U49: Peptid 20.2 1.1E+02 0.0025 24.5 2.9 20 139-158 100-119 (206)
106 PF06744 DUF1215: Protein of u 20.1 59 0.0013 23.5 1.2 29 199-227 41-71 (125)
107 PF08696 Dna2: DNA replication 20.0 1.6E+02 0.0035 23.5 3.8 68 142-214 94-164 (209)
No 1
>KOG1046|consensus
Probab=100.00 E-value=1.3e-47 Score=361.70 Aligned_cols=214 Identities=46% Similarity=0.772 Sum_probs=192.6
Q ss_pred CceEEEEEEEecCCcEEEecCCccccccccCCCCeEEEEEecCCCccceeEEeEEeceeEeecCCCCCCCCCceEEEEEe
Q psy3708 1 MKSTFTISLGHPKKLTSISNMPLERSEDISTKPGWVLDVFQKSVPMSTYLVAFIISDFTFRQSNTSNPDLSRVQFRTWAR 80 (240)
Q Consensus 1 ~Katf~l~i~~p~~~~~~sng~~~~~~~~~~~~~~~~~~f~~t~p~~~y~~a~~vg~f~~~~~~~~~~~~~~~~i~v~~~ 80 (240)
+||+|+|+|.||++++++|||+++++ ...++++++++|..|++||+|++||+||+|...+. .+.+|+++++|++
T Consensus 176 ~KAtF~Itl~hp~~~~aLSNm~v~~~--~~~~~~~~~~~F~~Tp~MstYLvAf~V~~f~~~e~----~~~~~v~vrv~a~ 249 (882)
T KOG1046|consen 176 FKATFTITLVHPKGYTALSNMPVIKE--EPVDDGWKTTTFEKTPKMSTYLVAFAVGDFVYVET----ITKSGVPVRVYAR 249 (882)
T ss_pred ccCceEEEEEecCCceEeecCccccc--ccccCCeeEEEEEecCCCchhhheeeeecccccee----ecCCCceEEEEeC
Confidence 69999999999999999999999983 55666699999999999999999999999999988 4778899999999
Q ss_pred CCccchHHHHHHHHHHHHHHHHHHhCCCCCCCCCCeEEcCCc---------------ceeeEcCCCCChhhHHHHHHHHH
Q psy3708 81 RDVIDQVDFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPSL---------------SALLFDDVRSSIHNEYSIANTIA 145 (240)
Q Consensus 81 ~~~~~~~~~~~~~~~~~l~~~e~~~g~~yp~~~l~~v~~p~~---------------~~~~~~~~~~~~~~~~~~~~~ia 145 (240)
|+...+.+++++.++++|+||+++||++||++|+|+|++|+| ..+++++..+...++.+++.+||
T Consensus 250 p~~~~~~~~al~~~~~~L~~~e~~f~i~yPLpK~D~iavPdf~~GAMENwGLvtyre~~lL~~~~~ss~~~k~~va~vIa 329 (882)
T KOG1046|consen 250 PEKINQGQFALEVATKVLEFYEDYFGIPYPLPKLDLVAVPDFSAGAMENWGLVTYRETALLYDPQTSSSSNKQRVAEVIA 329 (882)
T ss_pred hHHhhHHHHHHHHHHHHHHHHHHHhCCCCCCccccEEecCCccccchhcCcceeeeehhhccCCCcCcHHHHHHHHHHHH
Confidence 999999999999999999999999999999999999999998 66888898899999999999999
Q ss_pred HHhhhhhhCCCCCccccchhhhhhhHHHHHHHHhhhhcccccchhhhhHhhcchhhHH--HHHHHhccCCCCCCccE
Q psy3708 146 HELAHQWFGNLVTMKWWTDLWLNEGFATYMAAQALNDVHILFDSQVDFAAEAGPEFLN--FFERYFNIRYPLPKQDM 220 (240)
Q Consensus 146 HEiaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~p~~k~d~ 220 (240)
||+|||||||+||++||+|+|||||||+|++++.++..+|+|...++|..+.....+. ....-+.+..|+...+.
T Consensus 330 HElAHQWFGNLVTm~wW~dLWLnEGfAt~~~~~~v~~~~p~~~~~~~~~~~~l~~~l~~D~l~~shpi~~~v~~~~e 406 (882)
T KOG1046|consen 330 HELAHQWFGNLVTMKWWNDLWLNEGFATYVEYLAVDHLFPEWDIWEQFLLENLERVLSLDALASSHPISVPVESPSE 406 (882)
T ss_pred HHHHHHHhcCcccHhhhhhhhhcccHHHHHHHHhhccCCcchhhHHHHHHHHHHHHhhhhcccccCCeeeecCCcch
Confidence 9999999999999999999999999999999999999999999998887777766553 33344445555555543
No 2
>PF01433 Peptidase_M1: Peptidase family M1 This is family M1 in the peptidase classification.; InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=100.00 E-value=8.4e-45 Score=317.98 Aligned_cols=208 Identities=42% Similarity=0.714 Sum_probs=175.8
Q ss_pred CceEEEEEEEecCCcEEEecCCccccccccCCCCeEEEEEecCCCccceeEEeEEeceeEeecCCCCCCCCCceEEEEEe
Q psy3708 1 MKSTFTISLGHPKKLTSISNMPLERSEDISTKPGWVLDVFQKSVPMSTYLVAFIISDFTFRQSNTSNPDLSRVQFRTWAR 80 (240)
Q Consensus 1 ~Katf~l~i~~p~~~~~~sng~~~~~~~~~~~~~~~~~~f~~t~p~~~y~~a~~vg~f~~~~~~~~~~~~~~~~i~v~~~ 80 (240)
+||+|+++|++|++++|+|||++++. ...++|+++++|..++|||+|++||++|+|...+.. ..+++++++|++
T Consensus 147 ~ka~f~~~i~~p~~~~~~sng~~~~~--~~~~~~~~~~~f~~t~p~~~yl~a~~vg~~~~~~~~----~~~~~~v~~~~~ 220 (390)
T PF01433_consen 147 FKATFDLTITHPKDYTALSNGPLEEE--ESNDDGWKTTTFETTPPMPTYLFAFAVGDFESVEVT----TKSGVPVRVYAR 220 (390)
T ss_dssp SEEEEEEEEEEETTTEEEESSEEEEE--EEETTTEEEEEEEEEEEEEGGG--EEEESEEEEEEE----TTTEEEEEEEEE
T ss_pred ccceEEEeeeccccceeecccccccc--ccccccceeEeeecccccCchhhhhhcCcccccccc----cccccchheeeh
Confidence 59999999999999999999999984 445578999999999999999999999999998873 445799999999
Q ss_pred CCccchHHHHHHHHHHHHHHHHHHhCCCCCCCCCCeEEcCCc--------c-------eeeEcCCCCChhhHHHHHHHHH
Q psy3708 81 RDVIDQVDFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPSL--------S-------ALLFDDVRSSIHNEYSIANTIA 145 (240)
Q Consensus 81 ~~~~~~~~~~~~~~~~~l~~~e~~~g~~yp~~~l~~v~~p~~--------~-------~~~~~~~~~~~~~~~~~~~~ia 145 (240)
|+.....+.+++.+.+++++|+++||.|||++++++|++|++ + .++++++..+...+..+..+||
T Consensus 221 ~~~~~~~~~~~~~~~~~l~~~~~~~g~~yp~~k~~~v~~p~~~~~~me~~g~i~~~~~~l~~~~~~~~~~~~~~~~~~ia 300 (390)
T PF01433_consen 221 PGDEEQLQFALDIAPKALEYYEEYFGIPYPFKKLDIVAVPDFPFGGMENWGLITYRESYLLYDPDISTIGDKQEIASLIA 300 (390)
T ss_dssp CTCGGGHHHHHHHHHHHHHHHHHHHTS--SSSEEEEEEEST-SSSEE--TTEEEEEGGGTS-STTTS-HHHHHHHHHHHH
T ss_pred hhhHHHHHHHHHhhHHHHHHHHhhccccceecceeEEEEeccccccccccccccccccccccCcccccchhhhhhHHHHH
Confidence 999999999999999999999999999999999999999986 2 2334455566667778889999
Q ss_pred HHhhhhhhCCCCCccccchhhhhhhHHHHHHHHhhhhcccccchhhhhHhhcchhhHHHHHHHhccCCCCC
Q psy3708 146 HELAHQWFGNLVTMKWWTDLWLNEGFATYMAAQALNDVHILFDSQVDFAAEAGPEFLNFFERYFNIRYPLP 216 (240)
Q Consensus 146 HEiaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~p~~ 216 (240)
||+|||||||+|++++|+|+||+||||+|++++++++.++++.+.+.+..+..+..++. +......|+.
T Consensus 301 hElahqWfGn~vt~~~w~d~WL~Eg~a~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--d~~~~~~pl~ 369 (390)
T PF01433_consen 301 HELAHQWFGNLVTPKWWSDLWLNEGFATYLEYLILEKLFGEWQMMELFLVQEMQRALRE--DALPNSHPLS 369 (390)
T ss_dssp HHHHTTTBTTTEEESSGGGHHHHHHHHHHHHHHHHHHHHGHHHHHHHHHHHHHHHHHHH--HTSTTCCCSS
T ss_pred HHHHHHHhccCCccccchhhhHHHHHHHHHHHHhHhhccCcccchhhhhhhhHHHHHHH--hhcCCCcceE
Confidence 99999999999999999999999999999999999999999888888877777555543 3334555655
No 3
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=100.00 E-value=2.8e-43 Score=331.95 Aligned_cols=190 Identities=32% Similarity=0.534 Sum_probs=163.0
Q ss_pred CceEEEEEEEecCCcEEEecCCccccccccCCCCeEEEEEecCCCccceeEEeEEeceeEeecCCCCCCCCCceEEEEEe
Q psy3708 1 MKSTFTISLGHPKKLTSISNMPLERSEDISTKPGWVLDVFQKSVPMSTYLVAFIISDFTFRQSNTSNPDLSRVQFRTWAR 80 (240)
Q Consensus 1 ~Katf~l~i~~p~~~~~~sng~~~~~~~~~~~~~~~~~~f~~t~p~~~y~~a~~vg~f~~~~~~~~~~~~~~~~i~v~~~ 80 (240)
+||+|+++|++|++|+|+|||++.++ ...++.++++|..++|||+|++||++|+|...+. +.+|+++++|++
T Consensus 141 ~KAtf~ltit~p~~~~v~sNg~~~~~---~~~~~~~~~~F~~t~pmstYL~a~~vG~f~~~~~-----~~~gvpi~v~~~ 212 (831)
T TIGR02412 141 LKANFKFSVKAPEDWTVISNSRETDV---TPEPADRRWEFPETPKLSTYLTAVAAGPYHSVQD-----ESRSYPLGIYAR 212 (831)
T ss_pred CceeEEEEEEECCCceEECCCccccc---cccCCCeEEEecCCCCcccceEEEEEeceEEEee-----cCCCEEEEEEEC
Confidence 69999999999999999999998763 2245677899999999999999999999999886 457899999999
Q ss_pred CCccch--HHHHHHHHHHHHHHHHHHhCCCCCCCCCCeEEcCCc--------ceeeEcCC-----CCChhhHHHHHHHHH
Q psy3708 81 RDVIDQ--VDFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPSL--------SALLFDDV-----RSSIHNEYSIANTIA 145 (240)
Q Consensus 81 ~~~~~~--~~~~~~~~~~~l~~~e~~~g~~yp~~~l~~v~~p~~--------~~~~~~~~-----~~~~~~~~~~~~~ia 145 (240)
|+.... .+++++.++++|++|+++||+|||++|+++|++|++ +.+.|.+. ..+...+..+..+|+
T Consensus 213 ~~~~~~~~~~~al~~~~~~l~~~e~~fg~pYP~~k~d~V~vP~f~~GaMEn~Glit~~e~~l~~~~~~~~~~~~~~~via 292 (831)
T TIGR02412 213 RSLAQYLDADAIFTITRQGLAFFHRKFGYPYPFKKYDQIFVPEFNAGAMENAGCVTFAENFLHRAEATRAEKENRAGVIL 292 (831)
T ss_pred cchhhhhhHHHHHHHHHHHHHHHHHHhCCCCCcccCCEEEcCCCCCCcccccceeeechhhccCCcCCHHHHHHHHHHHH
Confidence 987653 568999999999999999999999999999999987 33444332 122345567788999
Q ss_pred HHhhhhhhCCCCCccccchhhhhhhHHHHHHHHhhhhcccccchhhhhHhhcc
Q psy3708 146 HELAHQWFGNLVTMKWWTDLWLNEGFATYMAAQALNDVHILFDSQVDFAAEAG 198 (240)
Q Consensus 146 HEiaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~~~~~~~~~~~~~~~~~~~~~ 198 (240)
||+|||||||+||++||+|+|||||||+|+++++.++.+|.|.....|.....
T Consensus 293 HElAHqWFGnlVT~~wW~dlWLnEGFAty~e~~~~~~~~~~~~~~~~f~~~~~ 345 (831)
T TIGR02412 293 HEMAHMWFGDLVTMRWWNDLWLNESFAEYMGTLASAEATEYTDAWTTFAAQGK 345 (831)
T ss_pred HHHHHHHhCCEeccccccchhHHHHHHHHHHHHHHHhcCCcchHHHHHHHHHH
Confidence 99999999999999999999999999999999999999888887777755443
No 4
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=100.00 E-value=4e-40 Score=308.33 Aligned_cols=181 Identities=24% Similarity=0.345 Sum_probs=154.5
Q ss_pred CceEEEEEEEecCC-c-EEEecCCccccccccCCCCeEEEEEecCCCccceeEEeEEeceeEeecCCCCCCCCCceEEEE
Q psy3708 1 MKSTFTISLGHPKK-L-TSISNMPLERSEDISTKPGWVLDVFQKSVPMSTYLVAFIISDFTFRQSNTSNPDLSRVQFRTW 78 (240)
Q Consensus 1 ~Katf~l~i~~p~~-~-~~~sng~~~~~~~~~~~~~~~~~~f~~t~p~~~y~~a~~vg~f~~~~~~~~~~~~~~~~i~v~ 78 (240)
+||+|+++|++|++ | +++|||+++.+ ....+|+++++|+.++|||+|++||++|+|..++....+.+..++++++|
T Consensus 129 ~kAtf~vtI~~p~~~y~v~lSNg~~~~~--~~~~~g~~~~~f~~t~pmptYLfA~vaGdf~~~~~~~~t~sg~~v~l~iy 206 (863)
T TIGR02414 129 VMSRYTVTITADKKKYPVLLSNGNKIAS--GELPDGRHWAEWEDPFPKPSYLFALVAGDLDVLEDTFTTKSGREVALRVY 206 (863)
T ss_pred CceEEEEEEEECCCcceEEEeCCccccc--eecCCCeEEEEEeCCCCcChhHheEEEeCCEEEEEEeeccCCCceEEEEE
Confidence 59999999999996 7 67899988763 34468899999999999999999999999999875332234457899999
Q ss_pred EeCCccchHHHHHHHHHHHHHHHHHHhCCCCCCCCCCeEEcCCc---------------ceeeEcCCCCChhhHHHHHHH
Q psy3708 79 ARRDVIDQVDFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPSL---------------SALLFDDVRSSIHNEYSIANT 143 (240)
Q Consensus 79 ~~~~~~~~~~~~~~~~~~~l~~~e~~~g~~yp~~~l~~v~~p~~---------------~~~~~~~~~~~~~~~~~~~~~ 143 (240)
++|+....++++++.++++|++||++||.|||++++++|++|++ ..++.++...+...+..+..+
T Consensus 207 ~~p~~~~~~~~al~~~~~~L~~~E~~fG~pYPl~k~diVavpdf~~GaMEN~GLi~f~e~~lL~~~~~~td~~~~~i~~V 286 (863)
T TIGR02414 207 VEEGNKDKCDHAMESLKKAMKWDEEVFGLEYDLDIFMIVAVDDFNMGAMENKGLNIFNSKYVLADPETATDADYERIESV 286 (863)
T ss_pred EccCcHHHHHHHHHHHHHHHHHHHHHhCCCCChhhccEEecCCCCCccccccceeccccceEEeCCCCCCHHHHHHHHHH
Confidence 99998888999999999999999999999999999999999986 223445544455566778899
Q ss_pred HHHHhhhhhhCCCCCccccchhhhhhhHHHHHHHHhhhhc
Q psy3708 144 IAHELAHQWFGNLVTMKWWTDLWLNEGFATYMAAQALNDV 183 (240)
Q Consensus 144 iaHEiaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~~~~~~ 183 (240)
|+||+|||||||+||++||.++|||||||+|++..+.+..
T Consensus 287 IaHElaHqWfGNlVT~~~W~~LWLnEGfAty~e~~~~~~~ 326 (863)
T TIGR02414 287 IAHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSADM 326 (863)
T ss_pred HHHHHHHHHhcceeeecchhhhhhhhhHHHHHHHHHHHHh
Confidence 9999999999999999999999999999999997665544
No 5
>PRK14015 pepN aminopeptidase N; Provisional
Probab=100.00 E-value=5.7e-40 Score=307.99 Aligned_cols=181 Identities=23% Similarity=0.351 Sum_probs=153.1
Q ss_pred CceEEEEEEEecC-Cc-EEEecCCccccccccCCCCeEEEEEecCCCccceeEEeEEeceeEeecCCCCCCCCCceEEEE
Q psy3708 1 MKSTFTISLGHPK-KL-TSISNMPLERSEDISTKPGWVLDVFQKSVPMSTYLVAFIISDFTFRQSNTSNPDLSRVQFRTW 78 (240)
Q Consensus 1 ~Katf~l~i~~p~-~~-~~~sng~~~~~~~~~~~~~~~~~~f~~t~p~~~y~~a~~vg~f~~~~~~~~~~~~~~~~i~v~ 78 (240)
+||+|+++|++|+ .| +++|||+++++ ....+|+++++|..++|||+|++||++|+|..++......+.+++++++|
T Consensus 142 ~KAtf~itI~~p~~~~~~~lSNG~l~~~--~~~~~g~~~~~w~~~~PmpsYL~Al~aGdf~~~~d~~~~~~g~~vpl~iy 219 (875)
T PRK14015 142 VLARYTVRIEADKAKYPVLLSNGNLVES--GELPDGRHWATWEDPFPKPSYLFALVAGDLDVLEDTFTTRSGREVALEIY 219 (875)
T ss_pred CCeeEEEEEEEccccCeEEecCCccccc--eeccCCeEEEEEEeCCCcccceEEEEEeCCEEEEEEeeccCCCeEEEEEE
Confidence 5999999999999 48 68999998873 33467899999999999999999999999999875322223456999999
Q ss_pred EeCCccchHHHHHHHHHHHHHHHHHHhCCCCCCCCCCeEEcCCc--------c-------eeeEcCCCCChhhHHHHHHH
Q psy3708 79 ARRDVIDQVDFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPSL--------S-------ALLFDDVRSSIHNEYSIANT 143 (240)
Q Consensus 79 ~~~~~~~~~~~~~~~~~~~l~~~e~~~g~~yp~~~l~~v~~p~~--------~-------~~~~~~~~~~~~~~~~~~~~ 143 (240)
++|+....++++++.++++|++||++||.|||++++++|++|++ + .++.++...+......+..+
T Consensus 220 ~~p~~~~~~~~al~~~~~~L~~~E~~FG~pYP~~k~diVavp~f~~GaMEN~Gl~~f~~~~lL~~~~~~t~~~~~~i~~v 299 (875)
T PRK14015 220 VEPGNLDKCDHAMDSLKKSMKWDEERFGLEYDLDIFMIVAVDDFNMGAMENKGLNIFNSKYVLADPETATDADYERIESV 299 (875)
T ss_pred EeCCcHHHHHHHHHHHHHHHHHHHHHhCCCCChhhhCEEeCCCCCCcccccccccccccceEecCcccCCHHHHHHHHHH
Confidence 99999888999999999999999999999999999999999986 2 23333333444555667889
Q ss_pred HHHHhhhhhhCCCCCccccchhhhhhhHHHHHHHHhhhhc
Q psy3708 144 IAHELAHQWFGNLVTMKWWTDLWLNEGFATYMAAQALNDV 183 (240)
Q Consensus 144 iaHEiaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~~~~~~ 183 (240)
|+||+|||||||+||++||.|+|||||||+|++..+.+..
T Consensus 300 IaHElaHqWFGNlVT~~~W~dLWLnEGFAty~e~~~~~~~ 339 (875)
T PRK14015 300 IAHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSADL 339 (875)
T ss_pred HHHHHHHHHHhCcceecchhhhhhhhHHHHHHHHHHHHHh
Confidence 9999999999999999999999999999999987765554
No 6
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=100.00 E-value=1.6e-37 Score=293.15 Aligned_cols=195 Identities=35% Similarity=0.591 Sum_probs=172.0
Q ss_pred CceEEEEEEEecCCcEEEecCCccccccccCCCCeEEEEEecCCCccceeEEeEEeceeEeecCCCCCCCCCceEEEEEe
Q psy3708 1 MKSTFTISLGHPKKLTSISNMPLERSEDISTKPGWVLDVFQKSVPMSTYLVAFIISDFTFRQSNTSNPDLSRVQFRTWAR 80 (240)
Q Consensus 1 ~Katf~l~i~~p~~~~~~sng~~~~~~~~~~~~~~~~~~f~~t~p~~~y~~a~~vg~f~~~~~~~~~~~~~~~~i~v~~~ 80 (240)
.||+|+++|+.++++.++|||+.+. .....+|+++++|..++|||+|++|+++|+|..++......+ +++++.+|++
T Consensus 157 ~katf~~~i~~~k~~~~iSN~~~~~--~~~~~~g~~~~~f~~~~~mptYL~al~~G~~~~~~~~~~~~~-~~v~l~iy~~ 233 (859)
T COG0308 157 VKATFTLTIRADKGPKLISNGNLID--GGTLVDGRKIVKFEDTPPMPTYLFALVAGDLEVFRDKFDTRS-RDVPLEIYVP 233 (859)
T ss_pred CcceeEEEEEecCcceeeecCCccc--cccccCCcEEEEEcCCCCcchHhhheeeecceeeeeeeccCC-CCeeEEEEec
Confidence 5899999999999999999999998 345567799999999999999999999999988887432222 5899999999
Q ss_pred CCccchHHHHHHHHHHHHHHHHHHhCCCCCCCCCCeEEcCCc---------------ceeeEcCCCCChhhHHHHHHHHH
Q psy3708 81 RDVIDQVDFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPSL---------------SALLFDDVRSSIHNEYSIANTIA 145 (240)
Q Consensus 81 ~~~~~~~~~~~~~~~~~l~~~e~~~g~~yp~~~l~~v~~p~~---------------~~~~~~~~~~~~~~~~~~~~~ia 145 (240)
++....+++.++.+++++++|+++||.|||+++ ++|.+|+| ..++.++........+++..+|+
T Consensus 234 ~g~~~~a~~~~~~~~~~~~~~e~~fg~~y~l~~-~~V~v~~f~~GaMEN~Gl~tf~~~~ll~~~~~at~~~~~~~~~via 312 (859)
T COG0308 234 PGVLDRAKYALDETKRSIEFYEEYFGLPYALPI-DIVAVPDFSAGAMENWGLVTFREKYLLADPETATDSDYENVEEVIA 312 (859)
T ss_pred CcchhhhhhhHHHHHHHhhhHHHhcCCCCCCcc-cEEeccCCCCccccccceeEEeeeEEeeCcccchhHHHHHHHHHHH
Confidence 988889999999999999999999999999999 99999988 23455555666677788899999
Q ss_pred HHhhhhhhCCCCCccccchhhhhhhHHHHHHHHhhhhccc-ccchhhhhHhhcch
Q psy3708 146 HELAHQWFGNLVTMKWWTDLWLNEGFATYMAAQALNDVHI-LFDSQVDFAAEAGP 199 (240)
Q Consensus 146 HEiaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~~~~~~~~-~~~~~~~~~~~~~~ 199 (240)
||+|||||||+||++||.++|||||||+|+++.+.+.+++ +|...+.|......
T Consensus 313 HElaHqWfGnlVT~~~W~~lWLnEgfat~~e~~~~~~~~~~~~~~~~~~~~~~~~ 367 (859)
T COG0308 313 HELAHQWFGNLVTMKWWDDLWLNEGFATFREVLWSEDLGGRAWKRWEDFRTLRTS 367 (859)
T ss_pred HHHhhhcccceeeccCHHHHHHhhhhHHHHHHHHHHHhcchHHHHHHHHHHHhhh
Confidence 9999999999999999999999999999999999999998 78887777666554
No 7
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=100.00 E-value=5.4e-37 Score=279.63 Aligned_cols=171 Identities=25% Similarity=0.424 Sum_probs=139.8
Q ss_pred CceEEEEEEEecCCcEEEecCCccccccccCCCCeEEEEEecCCCccceeEEeEEeceeEeecCCCCCCCCCceEEEEEe
Q psy3708 1 MKSTFTISLGHPKKLTSISNMPLERSEDISTKPGWVLDVFQKSVPMSTYLVAFIISDFTFRQSNTSNPDLSRVQFRTWAR 80 (240)
Q Consensus 1 ~Katf~l~i~~p~~~~~~sng~~~~~~~~~~~~~~~~~~f~~t~p~~~y~~a~~vg~f~~~~~~~~~~~~~~~~i~v~~~ 80 (240)
+||+|+++|++| +.+++||....+ . .++..+++|..++|||+|++||++|+|..... |.++++|++
T Consensus 147 ~Katf~~~I~~P--~~av~sg~~~~~--~--~~~~~~~~F~~t~pmptYLia~avG~~~~~~~--------g~~~~v~~~ 212 (601)
T TIGR02411 147 VKSTYTAEVESP--LPVLMSGIPDGE--T--SNDPGKYLFKQKVPIPAYLIALASGDLASAPI--------GPRSSVYSE 212 (601)
T ss_pred cceEEEEEEeeC--cceeccCCcccc--c--cCCCceEEEEeCCCcchhhheeeeccceeccc--------CCceEEEcc
Confidence 599999999999 889888766552 1 23456789999999999999999999986543 456899999
Q ss_pred CCccchHHHHHH-HHHHHHHHHHHHhCCCCCCCCCCeEEc-CCc-------ceeeEcCCCCChhhHHHHHHHHHHHhhhh
Q psy3708 81 RDVIDQVDFAAE-AGPEFLNFFERYFNIRYPLPKQDMVAI-PSL-------SALLFDDVRSSIHNEYSIANTIAHELAHQ 151 (240)
Q Consensus 81 ~~~~~~~~~~~~-~~~~~l~~~e~~~g~~yp~~~l~~v~~-p~~-------~~~~~~~~~~~~~~~~~~~~~iaHEiaHq 151 (240)
|+....+++.+. .++++|+++|+++| |||++|+|+|++ |++ .++.+.......+++ ....+||||+|||
T Consensus 213 p~~~~~~~~~~~~~~~~~l~~~e~~~~-pYp~~k~d~vvlpp~f~~GgMEN~~ltf~~~~ll~~d~-s~~~viaHElAHq 290 (601)
T TIGR02411 213 PEQLEKCQYEFEHDTENFIKTAEDLIF-PYEWGQYDLLVLPPSFPYGGMENPNLTFATPTLIAGDR-SNVDVIAHELAHS 290 (601)
T ss_pred chhHHHHHHHHHHhHHHHHHHHHHhCC-CCcCccceEEEecCcccccccccccceeeccccccCCh-hhhhhHHHHHHhh
Confidence 988877777777 89999999999877 999999999987 555 233343322222222 2357999999999
Q ss_pred hhCCCCCccccchhhhhhhHHHHHHHHhhhhccccc
Q psy3708 152 WFGNLVTMKWWTDLWLNEGFATYMAAQALNDVHILF 187 (240)
Q Consensus 152 WfG~~v~~~~w~d~WL~EG~a~y~~~~~~~~~~~~~ 187 (240)
||||+||++||+|+|||||||+|++++++++++|++
T Consensus 291 WfGNlVT~~~W~d~WLnEGfaty~e~~~~~~~~~e~ 326 (601)
T TIGR02411 291 WSGNLVTNCSWEHFWLNEGWTVYLERRIVGRLYGEK 326 (601)
T ss_pred ccCceeecCCchHHHHHhhHHHHHHHHHHHHhcCcH
Confidence 999999999999999999999999999999998875
No 8
>KOG1047|consensus
Probab=99.97 E-value=1.3e-30 Score=225.55 Aligned_cols=203 Identities=28% Similarity=0.443 Sum_probs=161.1
Q ss_pred ceEEEEEEEecCCcEEEecCCccccccccCCCCeEEEEEecCCCccceeEEeEEeceeEeecCCCCCCCCCceEEEEEeC
Q psy3708 2 KSTFTISLGHPKKLTSISNMPLERSEDISTKPGWVLDVFQKSVPMSTYLVAFIISDFTFRQSNTSNPDLSRVQFRTWARR 81 (240)
Q Consensus 2 Katf~l~i~~p~~~~~~sng~~~~~~~~~~~~~~~~~~f~~t~p~~~y~~a~~vg~f~~~~~~~~~~~~~~~~i~v~~~~ 81 (240)
|.||+..|.+|.++.+++++-... ..+..+|+...+|....|+|+|++||++|+....+. |.+-+||+.|
T Consensus 153 K~ty~a~v~vp~~l~a~mSai~~~--~~~~~~~~~~f~f~q~~pIP~YLiai~~G~L~s~eI--------gpRs~VwaEp 222 (613)
T KOG1047|consen 153 KSTYTAEVEVPMGLTALMSAIPAG--EKPGSNGRAIFRFKQEVPIPSYLIAIAVGDLESREI--------GPRSRVWAEP 222 (613)
T ss_pred eeEEEEEEEcCCcceeeeeccccc--cCCCCCCcceEEEEeccCchhhhHHHhhcccccccc--------CCccceecch
Confidence 789999999999999999987765 356677899999999999999999999999988777 3558999999
Q ss_pred CccchHHHHHH-HHHHHHHHHHHHhCCCCCCCCCCeEEcC-Cc-------ceeeEcCCCCChhhHHHHHHHHHHHhhhhh
Q psy3708 82 DVIDQVDFAAE-AGPEFLNFFERYFNIRYPLPKQDMVAIP-SL-------SALLFDDVRSSIHNEYSIANTIAHELAHQW 152 (240)
Q Consensus 82 ~~~~~~~~~~~-~~~~~l~~~e~~~g~~yp~~~l~~v~~p-~~-------~~~~~~~~~~~~~~~~~~~~~iaHEiaHqW 152 (240)
.....+++-+. .+.++|+.-|+.+| ||++.++|++++| +| +.+.|..-.-..+++ ....+|||||||-|
T Consensus 223 ~~~~a~~~ef~~~~e~~L~~Ae~l~G-pY~WgryDllvlPpSFP~gGMENPcltF~TpTllaGDr-sl~~vIaHEIAHSW 300 (613)
T KOG1047|consen 223 CLLDACQEEFAGETEDFLKAAEKLFG-PYVWGRYDLLVLPPSFPFGGMENPCLTFVTPTLLAGDR-SLVDVIAHEIAHSW 300 (613)
T ss_pred hhhHHHHHHHHhhhHHHHHHHHHHcC-CcccccceEEEecCCCCcccccCcceeeecchhhcCCc-chhhHHHHHhhhhh
Confidence 98877766665 89999999999999 9999999999997 45 333333222222222 34679999999999
Q ss_pred hCCCCCccccchhhhhhhHHHHHHHHhhhhcccccchhhhhHhhcchhhHHHHHHHhccCCCCCCc
Q psy3708 153 FGNLVTMKWWTDLWLNEGFATYMAAQALNDVHILFDSQVDFAAEAGPEFLNFFERYFNIRYPLPKQ 218 (240)
Q Consensus 153 fG~~v~~~~w~d~WL~EG~a~y~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~p~~k~ 218 (240)
+||+||...|++.||||||++|+|.+++..++++. ..+|..-.+-.-++...+.++...+++|.
T Consensus 301 tGNlVTN~sWehfWLNEGfTvylErrI~g~~~g~~--~~~f~a~~gw~~L~~~~d~~g~~~~~tkL 364 (613)
T KOG1047|consen 301 TGNLVTNASWEHFWLNEGFTVYLERRIVGRLYGEA--YRQFEALIGWRELRPSMDLFGETSEFTKL 364 (613)
T ss_pred cccccccCccchhhhcccchhhhhhhhhhhhcchh--HHHHHHhcChhhhhhHHHhcCCCcccchh
Confidence 99999999999999999999999999999887632 22333333323345555667777777665
No 9
>KOG1932|consensus
Probab=99.88 E-value=4.6e-22 Score=183.67 Aligned_cols=186 Identities=19% Similarity=0.311 Sum_probs=157.5
Q ss_pred ceEEEEEEEecCCcEEEecCCccccccccCCCCeEEEEEecCCCccceeEEeEEeceeEeecCCCCCCCCCceEEEEEeC
Q psy3708 2 KSTFTISLGHPKKLTSISNMPLERSEDISTKPGWVLDVFQKSVPMSTYLVAFIISDFTFRQSNTSNPDLSRVQFRTWARR 81 (240)
Q Consensus 2 Katf~l~i~~p~~~~~~sng~~~~~~~~~~~~~~~~~~f~~t~p~~~y~~a~~vg~f~~~~~~~~~~~~~~~~i~v~~~~ 81 (240)
++|+++.+++|++++++++|.+.+....+ +.+.++.+|..+.|+++..++||+|.|..... ..+++|..+|.|
T Consensus 198 ~~tWeLeftvp~~~~av~~geLl~~v~~~-D~~Kkt~~ys~tvPvA~~~I~~AiG~F~~~~~------P~~~~i~~f~LP 270 (1180)
T KOG1932|consen 198 RCTWELEFTVPKNLVAVSCGELLEQVETP-DLRKKTYHYSLTVPVAPSNIGFAIGPFKSYVE------PSMIDITHFCLP 270 (1180)
T ss_pred cceEEEEEEecccceeccchhhhheeecc-cccccEEEEEEeccCCccccceeeccccccCC------CccCcceeEecC
Confidence 57999999999999999999999854333 55678999999999999999999999998854 347889999999
Q ss_pred CccchHHHHHHHHHHHHHHHHHHhCCCCCCCCCCeEEcCCc--------------ceeeEcCCCCChhhHHHHHHHHHHH
Q psy3708 82 DVIDQVDFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPSL--------------SALLFDDVRSSIHNEYSIANTIAHE 147 (240)
Q Consensus 82 ~~~~~~~~~~~~~~~~l~~~e~~~g~~yp~~~l~~v~~p~~--------------~~~~~~~~~~~~~~~~~~~~~iaHE 147 (240)
+.+.-.+...-...++++|||+.+|..|||+.+.+|++|.- ++++|+.... .+.......+|--
T Consensus 271 ~~~~~v~nt~~~l~k~iefye~~ls~rYPF~~~k~VFvd~~~~~i~~~asl~I~st~lLy~~~iI--Dq~~~tr~~La~a 348 (1180)
T KOG1932|consen 271 GLEPLVKNTTVYLHKAIEFYEEELSSRYPFSCYKTVFVDEAAVEISSYASLSIFSTSLLYSKNII--DQTFLTRRKLAWA 348 (1180)
T ss_pred cchHHhhhHHHHHHHHHHHHHHHhccCCCcceeeEEEecCCcceeeecceeeeeeccccchHhhh--hHHHHHHHHHHHH
Confidence 99999999999999999999999998899999999999864 4455655443 3334445789999
Q ss_pred hhhhhhCCCCCccccchhhhhhhHHHHHHHHhhhhcccccchhhhhHhhcchh
Q psy3708 148 LAHQWFGNLVTMKWWTDLWLNEGFATYMAAQALNDVHILFDSQVDFAAEAGPE 200 (240)
Q Consensus 148 iaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~~~~~~~~~~~~~~~~~~~~~~~ 200 (240)
+|.||||-.+++..|+|.||-+|+|.|+..+++++..++ ++|..+...+
T Consensus 349 LA~Q~fg~yIsp~~wsD~Wl~~GiagYl~~l~~kk~lGN----NEyry~lKk~ 397 (1180)
T KOG1932|consen 349 LASQWFGVYISPVDWSDFWLLKGIAGYLTGLFVKKFLGN----NEYRYQLKKA 397 (1180)
T ss_pred HHHhhhEEEeeccchhhhHHHHhHHHHHHHHHHHHHhCc----hHHHHHHHHH
Confidence 999999999999999999999999999999999998553 4454444433
No 10
>KOG1046|consensus
Probab=99.08 E-value=6.9e-11 Score=112.94 Aligned_cols=56 Identities=61% Similarity=1.032 Sum_probs=51.9
Q ss_pred cccchhhhhHhhcchhhHHHHHHHhccCCCCCCccEEEeCCCCCcccccccccccC
Q psy3708 185 ILFDSQVDFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPDFSAGAMENWGLITYR 240 (240)
Q Consensus 185 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~p~~k~d~~~~~~~~~~~~e~~~~~~~~ 240 (240)
|+-.....|+.+...+.+.+++++|++.||++|.|++++|||+.+|||||||+|||
T Consensus 250 p~~~~~~~~al~~~~~~L~~~e~~f~i~yPLpK~D~iavPdf~~GAMENwGLvtyr 305 (882)
T KOG1046|consen 250 PEKINQGQFALEVATKVLEFYEDYFGIPYPLPKLDLVAVPDFSAGAMENWGLVTYR 305 (882)
T ss_pred hHHhhHHHHHHHHHHHHHHHHHHHhCCCCCCccccEEecCCccccchhcCcceeee
Confidence 45556668899999999999999999999999999999999999999999999996
No 11
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=98.56 E-value=4.5e-08 Score=71.79 Aligned_cols=41 Identities=41% Similarity=0.724 Sum_probs=35.5
Q ss_pred HHHHHHHHHHhhhhhhCCCCCccccchhhhhhhHHHHHHHH
Q psy3708 138 YSIANTIAHELAHQWFGNLVTMKWWTDLWLNEGFATYMAAQ 178 (240)
Q Consensus 138 ~~~~~~iaHEiaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~ 178 (240)
.....+++||++|+|+++.++.......|++||+|+|++..
T Consensus 23 ~~~~~~l~HE~~H~~~~~~~~~~~~~~~W~~EG~A~y~~~~ 63 (128)
T PF13485_consen 23 DWLDRVLAHELAHQWFGNYFGGDDNAPRWFNEGLAEYVEGR 63 (128)
T ss_pred HHHHHHHHHHHHHHHHHHHcCCCccCchHHHHHHHHHHhcC
Confidence 34557999999999999998876777899999999999943
No 12
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=98.40 E-value=1.8e-07 Score=89.25 Aligned_cols=50 Identities=36% Similarity=0.509 Sum_probs=46.8
Q ss_pred hhhHhhcchhhHHHHHHHhccCCCCCCccEEEeCCCCCcccccccccccC
Q psy3708 191 VDFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPDFSAGAMENWGLITYR 240 (240)
Q Consensus 191 ~~~~~~~~~~~l~~~~~~~~~~~p~~k~d~~~~~~~~~~~~e~~~~~~~~ 240 (240)
..+..+..++.+++++++|+.+||++|.|++++|+++.+|||||||++|+
T Consensus 215 ~~~al~~~~~~L~~~E~~fG~pYPl~k~diVavpdf~~GaMEN~GLi~f~ 264 (863)
T TIGR02414 215 CDHAMESLKKAMKWDEEVFGLEYDLDIFMIVAVDDFNMGAMENKGLNIFN 264 (863)
T ss_pred HHHHHHHHHHHHHHHHHHhCCCCChhhccEEecCCCCCccccccceeccc
Confidence 35778888999999999999999999999999999999999999999986
No 13
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=98.39 E-value=2.3e-07 Score=88.79 Aligned_cols=49 Identities=41% Similarity=0.668 Sum_probs=45.7
Q ss_pred hhHhhcchhhHHHHHHHhccCCCCCCccEEEeCCCCCcccccccccccC
Q psy3708 192 DFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPDFSAGAMENWGLITYR 240 (240)
Q Consensus 192 ~~~~~~~~~~l~~~~~~~~~~~p~~k~d~~~~~~~~~~~~e~~~~~~~~ 240 (240)
.+..+...+.+++++++|+++||++|.|++++|++..+||||||+++|+
T Consensus 222 ~~al~~~~~~l~~~e~~fg~pYP~~k~d~V~vP~f~~GaMEn~Glit~~ 270 (831)
T TIGR02412 222 DAIFTITRQGLAFFHRKFGYPYPFKKYDQIFVPEFNAGAMENAGCVTFA 270 (831)
T ss_pred HHHHHHHHHHHHHHHHHhCCCCCcccCCEEEcCCCCCCcccccceeeec
Confidence 4677888889999999999999999999999999999999999999985
No 14
>PRK14015 pepN aminopeptidase N; Provisional
Probab=98.20 E-value=1e-06 Score=84.32 Aligned_cols=49 Identities=35% Similarity=0.505 Sum_probs=45.7
Q ss_pred hhHhhcchhhHHHHHHHhccCCCCCCccEEEeCCCCCcccccccccccC
Q psy3708 192 DFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPDFSAGAMENWGLITYR 240 (240)
Q Consensus 192 ~~~~~~~~~~l~~~~~~~~~~~p~~k~d~~~~~~~~~~~~e~~~~~~~~ 240 (240)
.+..+..++.+++++++|+..||++|.|++.+|+++.+|||||||++|+
T Consensus 229 ~~al~~~~~~L~~~E~~FG~pYP~~k~diVavp~f~~GaMEN~Gl~~f~ 277 (875)
T PRK14015 229 DHAMDSLKKSMKWDEERFGLEYDLDIFMIVAVDDFNMGAMENKGLNIFN 277 (875)
T ss_pred HHHHHHHHHHHHHHHHHhCCCCChhhhCEEeCCCCCCcccccccccccc
Confidence 4667788899999999999999999999999999999999999999985
No 15
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=98.18 E-value=9.6e-07 Score=84.76 Aligned_cols=51 Identities=51% Similarity=1.064 Sum_probs=46.8
Q ss_pred hhhhhHhhcchhhHHHHHHHhccCCCCCCccEEEeCCCCCcccccccccccC
Q psy3708 189 SQVDFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPDFSAGAMENWGLITYR 240 (240)
Q Consensus 189 ~~~~~~~~~~~~~l~~~~~~~~~~~p~~k~d~~~~~~~~~~~~e~~~~~~~~ 240 (240)
....|..+...+.+++++++|+..||+++ +.|++|+|+.+|||||||+||+
T Consensus 238 ~~a~~~~~~~~~~~~~~e~~fg~~y~l~~-~~V~v~~f~~GaMEN~Gl~tf~ 288 (859)
T COG0308 238 DRAKYALDETKRSIEFYEEYFGLPYALPI-DIVAVPDFSAGAMENWGLVTFR 288 (859)
T ss_pred hhhhhhHHHHHHHhhhHHHhcCCCCCCcc-cEEeccCCCCccccccceeEEe
Confidence 33467888888999999999999999999 9999999999999999999986
No 16
>PF01433 Peptidase_M1: Peptidase family M1 This is family M1 in the peptidase classification.; InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=98.08 E-value=2.5e-06 Score=74.87 Aligned_cols=51 Identities=57% Similarity=1.138 Sum_probs=40.9
Q ss_pred hhhhHhhcchhhHHHHHHHhccCCCCCCccEEEeCCCCCcccccccccccC
Q psy3708 190 QVDFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPDFSAGAMENWGLITYR 240 (240)
Q Consensus 190 ~~~~~~~~~~~~l~~~~~~~~~~~p~~k~d~~~~~~~~~~~~e~~~~~~~~ 240 (240)
...+..+.....+++++++++.+||+.|.|.+++|++..+||||||+++|+
T Consensus 226 ~~~~~~~~~~~~l~~~~~~~g~~yp~~k~~~v~~p~~~~~~me~~g~i~~~ 276 (390)
T PF01433_consen 226 QLQFALDIAPKALEYYEEYFGIPYPFKKLDIVAVPDFPFGGMENWGLITYR 276 (390)
T ss_dssp GHHHHHHHHHHHHHHHHHHHTS--SSSEEEEEEEST-SSSEE--TTEEEEE
T ss_pred HHHHHHHhhHHHHHHHHhhccccceecceeEEEEecccccccccccccccc
Confidence 334677788889999999999999999999999999999999999999873
No 17
>COG3975 Predicted protease with the C-terminal PDZ domain [General function prediction only]
Probab=97.87 E-value=0.00011 Score=65.18 Aligned_cols=110 Identities=16% Similarity=0.198 Sum_probs=75.8
Q ss_pred CCceEEEEEeCCccchHHHHHHHHHHHHHHHHHHhCCCCCCCCCCeEEcCC---c--------ceeeEcCCCCChhh-HH
Q psy3708 71 SRVQFRTWARRDVIDQVDFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPS---L--------SALLFDDVRSSIHN-EY 138 (240)
Q Consensus 71 ~~~~i~v~~~~~~~~~~~~~~~~~~~~l~~~e~~~g~~yp~~~l~~v~~p~---~--------~~~~~~~~~~~~~~-~~ 138 (240)
.+..+.++..-. ..+.+...+..++++..-.+.|| +-||.++.++..-. + +.+.+...-.+..+ .+
T Consensus 168 ~ph~~~~~g~~p-~~d~~~~~~~~k~ii~~~~~vFg-~~~~~~Y~Fl~~~s~q~~GGlEH~~St~l~~~r~~~~~~~ky~ 245 (558)
T COG3975 168 APHTIALRGELP-NFDKERLASDTKKIIEAEIKVFG-SAPFDKYVFLLHLSDQIYGGLEHRRSTALIYDRFGFTDQDKYQ 245 (558)
T ss_pred CceeEEEeeccc-cccHHHHHHHHHHHHHHHHHHhc-CCCccceEEEEEecCCCCCCceeccccccccccccccchhHHH
Confidence 344455554321 34667788888999998899999 78999987776421 1 33444432222222 35
Q ss_pred HHHHHHHHHhhhhhhCCCCCccccc-----------hhhhhhhHHHHHHHHhhhh
Q psy3708 139 SIANTIAHELAHQWFGNLVTMKWWT-----------DLWLNEGFATYMAAQALND 182 (240)
Q Consensus 139 ~~~~~iaHEiaHqWfG~~v~~~~w~-----------d~WL~EG~a~y~~~~~~~~ 182 (240)
....+++||..|-|-+-.+.+.... -+|+.|||++|...++.-.
T Consensus 246 ~~l~llsHEyfH~WNvKrIrpa~l~p~~~d~en~t~~lW~~EG~T~Yy~~ll~lR 300 (558)
T COG3975 246 DLLGLLSHEYFHAWNVKRIRPAALEPFNLDKENYTPLLWFSEGFTSYYDRLLALR 300 (558)
T ss_pred HHHHHHHHHHHHhccceeccccccCCccccccCCCcceeeecCchHHHHHHHHHH
Confidence 6678999999999998888776443 4799999999999776544
No 18
>PF07607 DUF1570: Protein of unknown function (DUF1570); InterPro: IPR011464 This entry represents hypothetical proteins confined to bacteria.
Probab=97.16 E-value=0.0002 Score=52.64 Aligned_cols=38 Identities=39% Similarity=0.593 Sum_probs=28.3
Q ss_pred HHHHHHhhhhhhCCCCCc---cccchhhhhhhHHHHHHHHhh
Q psy3708 142 NTIAHELAHQWFGNLVTM---KWWTDLWLNEGFATYMAAQAL 180 (240)
Q Consensus 142 ~~iaHEiaHqWfG~~v~~---~~w~d~WL~EG~a~y~~~~~~ 180 (240)
.+++||.+||=..|.=-. ..| =.|+.||||+|+|-.-.
T Consensus 3 ~T~~HEa~HQl~~N~Gl~~r~~~~-P~Wv~EGlA~yFE~~~~ 43 (128)
T PF07607_consen 3 ATIAHEATHQLAFNTGLHPRLADW-PRWVSEGLATYFETPGM 43 (128)
T ss_pred hHHHHHHHHHHHHHccccccCCCC-chHHHHhHHHHcCCCcc
Confidence 489999999997763222 222 26999999999997655
No 19
>PF05299 Peptidase_M61: M61 glycyl aminopeptidase; InterPro: IPR007963 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M61 (glycyl aminopeptidase family, clan MA(E)).The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example is glycyl aminopeptidase from Sphingomonas capsulata.
Probab=97.02 E-value=0.00046 Score=50.29 Aligned_cols=44 Identities=25% Similarity=0.457 Sum_probs=35.1
Q ss_pred HHHHHHHHhhhhhhCCCCCcccc-----------chhhhhhhHHHHHHHHhhhhc
Q psy3708 140 IANTIAHELAHQWFGNLVTMKWW-----------TDLWLNEGFATYMAAQALNDV 183 (240)
Q Consensus 140 ~~~~iaHEiaHqWfG~~v~~~~w-----------~d~WL~EG~a~y~~~~~~~~~ 183 (240)
...+++||..|.|-+-.+.|..- +.+|+-||+++|++.+++-+.
T Consensus 4 ~l~l~sHEffH~WnvkrirP~~l~p~dy~~~~~t~~LWv~EG~T~Y~~~l~l~Ra 58 (122)
T PF05299_consen 4 FLGLLSHEFFHSWNVKRIRPAELGPFDYEKPNYTELLWVYEGFTSYYGDLLLVRA 58 (122)
T ss_pred hhhhhhhhccccccceEeccccccCCCCCCCCCCCCEeeeeCcHHHHHHHHHHHc
Confidence 34699999999999776666543 367999999999998876654
No 20
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=97.02 E-value=0.00062 Score=63.20 Aligned_cols=41 Identities=29% Similarity=0.503 Sum_probs=33.6
Q ss_pred cchhhHHHHHHHhccCCCCCCccEEEe-CCCCCccccccccccc
Q psy3708 197 AGPEFLNFFERYFNIRYPLPKQDMVAI-PDFSAGAMENWGLITY 239 (240)
Q Consensus 197 ~~~~~l~~~~~~~~~~~p~~k~d~~~~-~~~~~~~~e~~~~~~~ 239 (240)
...++++.++++ ..+||.+|.|++.+ |++..+||||||| +|
T Consensus 226 ~~~~~l~~~e~~-~~pYp~~k~d~vvlpp~f~~GgMEN~~l-tf 267 (601)
T TIGR02411 226 DTENFIKTAEDL-IFPYEWGQYDLLVLPPSFPYGGMENPNL-TF 267 (601)
T ss_pred hHHHHHHHHHHh-CCCCcCccceEEEecCcccccccccccc-ee
Confidence 445677777765 78999999999877 7899999999996 44
No 21
>PF10460 Peptidase_M30: Peptidase M30; InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue.
Probab=96.67 E-value=0.01 Score=51.26 Aligned_cols=49 Identities=24% Similarity=0.289 Sum_probs=36.2
Q ss_pred HHHHHHHHHHHhhhhh-hC-CCCCcc--ccchhhhhhhHHHHHHHHhhhhccc
Q psy3708 137 EYSIANTIAHELAHQW-FG-NLVTMK--WWTDLWLNEGFATYMAAQALNDVHI 185 (240)
Q Consensus 137 ~~~~~~~iaHEiaHqW-fG-~~v~~~--~w~d~WL~EG~a~y~~~~~~~~~~~ 185 (240)
...+..+||||+-|.= |. +.|... .-.|.|||||+|.-+|.++..+..+
T Consensus 136 ~~~~~sTlAHEfQHmInfy~~~v~~g~~~~~dtWLnE~lS~~aEdl~s~~~~~ 188 (366)
T PF10460_consen 136 PDTVYSTLAHEFQHMINFYQRGVLHGKQYAMDTWLNEMLSMSAEDLYSSKIDP 188 (366)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccHHHHHHHHHHHHHHhcCCCc
Confidence 3457789999999964 32 233332 2369999999999999998887754
No 22
>PF04450 BSP: Peptidase of plants and bacteria; InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=96.17 E-value=0.0061 Score=48.71 Aligned_cols=40 Identities=28% Similarity=0.291 Sum_probs=29.1
Q ss_pred HHHHHHHHHHHhhhhhhCCCCCccccchhhhhhhHHHHHHHHh
Q psy3708 137 EYSIANTIAHELAHQWFGNLVTMKWWTDLWLNEGFATYMAAQA 179 (240)
Q Consensus 137 ~~~~~~~iaHEiaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~~ 179 (240)
+.++..+|.||++|-|=.+--+ -.-.||.||+|.|+.+.+
T Consensus 93 ~~Ei~Gvl~HE~~H~~Q~~~~~---~~P~~liEGIADyVRl~a 132 (205)
T PF04450_consen 93 RDEIIGVLYHEMVHCWQWDGRG---TAPGGLIEGIADYVRLKA 132 (205)
T ss_pred HHHHHHHHHHHHHHHhhcCCCC---CCChhheecHHHHHHHHc
Confidence 3577899999999955433211 224599999999998773
No 23
>PF10026 DUF2268: Predicted Zn-dependent protease (DUF2268); InterPro: IPR018728 This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function.
Probab=95.62 E-value=0.067 Score=42.46 Aligned_cols=49 Identities=22% Similarity=0.220 Sum_probs=35.1
Q ss_pred hHHHHHHHHHHHhhhhhhCCCCC----ccccchhhhhhhHHHHHHHHhhhhcc
Q psy3708 136 NEYSIANTIAHELAHQWFGNLVT----MKWWTDLWLNEGFATYMAAQALNDVH 184 (240)
Q Consensus 136 ~~~~~~~~iaHEiaHqWfG~~v~----~~~w~d~WL~EG~a~y~~~~~~~~~~ 184 (240)
....+..++|||.-|.+--..+. ...--|.-+.||+|++++.....+..
T Consensus 61 ~~~~l~~~iaHE~hH~~r~~~~~~~~~~~TLld~~I~EGlAe~f~~~~~g~~~ 113 (195)
T PF10026_consen 61 SLEELPALIAHEYHHNCRYEQIGWDPEDTTLLDSLIMEGLAEYFAEELYGEEY 113 (195)
T ss_pred cHHHHHHHHHHHHHHHHHHhccCCCCCCCCHHHHHHHhhHHHHHHHHHcCCCC
Confidence 33477789999999986443332 22344678899999999988776654
No 24
>PRK04860 hypothetical protein; Provisional
Probab=94.63 E-value=0.074 Score=40.76 Aligned_cols=65 Identities=22% Similarity=0.220 Sum_probs=39.9
Q ss_pred HHHHHHHHHHHHHHHHhCCCCCCCCCCeEEcC--Cc------ceeeEcCCCCChhhHHHHHHHHHHHhhhhhh
Q psy3708 89 FAAEAGPEFLNFFERYFNIRYPLPKQDMVAIP--SL------SALLFDDVRSSIHNEYSIANTIAHELAHQWF 153 (240)
Q Consensus 89 ~~~~~~~~~l~~~e~~~g~~yp~~~l~~v~~p--~~------~~~~~~~~~~~~~~~~~~~~~iaHEiaHqWf 153 (240)
.+...+...+..-+++||.|++-+++.+-.-- .. ..+-+++..-.......+..+|+||+||.|-
T Consensus 4 ~~~~~~~~~~~~a~~~f~~~f~~p~~~f~~R~rtaG~~~l~~~~I~~Np~ll~~~~~~~l~~~v~HEl~H~~~ 76 (160)
T PRK04860 4 AVMRRLRECLAQANLYFKRTFPEPKVSYTQRGTSAGTAWLQSNEIRLNPVLLLENQQAFIDEVVPHELAHLLV 76 (160)
T ss_pred HHHHHHHHHHHHHHHHhCCCCCCCEEEEeecchhhcchhHhcCCeeeCHHHHhhCcHHHHHhHHHHHHHHHHH
Confidence 44556777788888899988887654332210 00 2334444332333445677899999999874
No 25
>PF10023 DUF2265: Predicted aminopeptidase (DUF2265); InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=94.00 E-value=0.27 Score=42.11 Aligned_cols=40 Identities=38% Similarity=0.385 Sum_probs=30.7
Q ss_pred HHHHHHHHHHhhhhhhCCCCCccccchhhhhhhHHHHHHHHhhhhc
Q psy3708 138 YSIANTIAHELAHQWFGNLVTMKWWTDLWLNEGFATYMAAQALNDV 183 (240)
Q Consensus 138 ~~~~~~iaHEiaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~~~~~~ 183 (240)
..++.+|-||+|||=+. | -.|.=+||+||++.+....+..
T Consensus 163 ~~LA~LIfHELaHq~~Y--v----~~dt~FNEsfAtfVe~~G~~~w 202 (337)
T PF10023_consen 163 GELARLIFHELAHQTLY--V----KGDTAFNESFATFVEREGARRW 202 (337)
T ss_pred hHHHHHHHHHHhhceee--c----CCCchhhHHHHHHHHHHHHHHH
Confidence 57789999999999442 2 2366789999999997766553
No 26
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=92.76 E-value=0.51 Score=35.79 Aligned_cols=66 Identities=17% Similarity=0.257 Sum_probs=38.9
Q ss_pred chHHHHHHHHHHHHHHHHHHhCCCCCCCC--CCeEEcC----Cc-------ceeeEcCCCC-ChhhHHHHHHHHHHHhhh
Q psy3708 85 DQVDFAAEAGPEFLNFFERYFNIRYPLPK--QDMVAIP----SL-------SALLFDDVRS-SIHNEYSIANTIAHELAH 150 (240)
Q Consensus 85 ~~~~~~~~~~~~~l~~~e~~~g~~yp~~~--l~~v~~p----~~-------~~~~~~~~~~-~~~~~~~~~~~iaHEiaH 150 (240)
..+..+...+.++.+||.++|| .-++.. ..++..- .+ ..+.|.+... .-........++|||++|
T Consensus 67 ~~~vdA~~~~~~v~d~y~~~~g-r~siD~~G~~~~~~Vhyg~~~~NAfW~g~~m~yGdG~~~~f~~~~~~lDVvaHEltH 145 (150)
T PF01447_consen 67 SAAVDAHYNAGKVYDYYKNVFG-RNSIDGNGMPIISRVHYGKNYNNAFWNGSQMVYGDGDGQIFKPFASSLDVVAHELTH 145 (150)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHS-S-STTSS-S-EEEEESESSSTT-EEE-SSSEEEE---SSSBS-GGG-HHHHHHHHHH
T ss_pred cHHHHHHHhHHHHHHHHHHHHC-CCCcCCCCcEEEEEEeCCCCccCccccCCEEEEECCCCcccccCccccceeeecccc
Confidence 3456677778999999999999 666642 3333321 12 6678876654 112222335699999999
Q ss_pred h
Q psy3708 151 Q 151 (240)
Q Consensus 151 q 151 (240)
-
T Consensus 146 G 146 (150)
T PF01447_consen 146 G 146 (150)
T ss_dssp H
T ss_pred c
Confidence 5
No 27
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=91.44 E-value=0.22 Score=41.03 Aligned_cols=39 Identities=36% Similarity=0.347 Sum_probs=30.9
Q ss_pred HHHHHHHHHHhhhhhhCCCCCccccchhhhhhhHHHHHHHHhhhh
Q psy3708 138 YSIANTIAHELAHQWFGNLVTMKWWTDLWLNEGFATYMAAQALND 182 (240)
Q Consensus 138 ~~~~~~iaHEiaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~~~~~ 182 (240)
..+..+|-||+|||=+.- -+|.=+||+||+..|-..+++
T Consensus 195 ~~lA~LIFHELAHQk~Y~------~~DtAFNEsFAtaVEt~Gvr~ 233 (376)
T COG4324 195 TYLASLIFHELAHQKIYV------NNDTAFNESFATAVETSGVRK 233 (376)
T ss_pred HHHHHHHHHHHhhheEee------cCcchHhHHHHHHHHHHhHHH
Confidence 467889999999996642 236678999999999776665
No 28
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=89.89 E-value=0.45 Score=37.79 Aligned_cols=40 Identities=25% Similarity=0.261 Sum_probs=30.4
Q ss_pred HHHHHHHhhhhhhCCCCCccccchhhhhhhHHHHHHHHhhhh
Q psy3708 141 ANTIAHELAHQWFGNLVTMKWWTDLWLNEGFATYMAAQALND 182 (240)
Q Consensus 141 ~~~iaHEiaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~~~~~ 182 (240)
..++|||+.|-|.-- ..-.--+..+-||+++-+++++++.
T Consensus 94 gsiLAHE~mHa~Lrl--~g~~~L~~~vEEGiCqvla~~wL~~ 133 (212)
T PF12315_consen 94 GSILAHELMHAWLRL--NGFPNLSPEVEEGICQVLAYLWLES 133 (212)
T ss_pred hhHHHHHHHHHHhcc--cCCCCCChHHHHHHHHHHHHHHHhh
Confidence 469999999999731 1111225689999999999999986
No 29
>COG3227 LasB Zinc metalloprotease (elastase) [Amino acid transport and metabolism]
Probab=89.76 E-value=0.72 Score=41.11 Aligned_cols=138 Identities=14% Similarity=0.118 Sum_probs=78.4
Q ss_pred chHHHHHHHHHHHHHHHHHHhCCCCCCCC--CCeEEcCCc-----------ceeeEcCCCCChhhH-HHHHHHHHHHhhh
Q psy3708 85 DQVDFAAEAGPEFLNFFERYFNIRYPLPK--QDMVAIPSL-----------SALLFDDVRSSIHNE-YSIANTIAHELAH 150 (240)
Q Consensus 85 ~~~~~~~~~~~~~l~~~e~~~g~~yp~~~--l~~v~~p~~-----------~~~~~~~~~~~~~~~-~~~~~~iaHEiaH 150 (240)
..+..+-..+....+||.++|| .-.++. ++++..--+ ..++|.++....-.. .....++|||+.|
T Consensus 269 ~a~~dAh~~~g~vyD~yk~~fg-r~S~Dn~g~~l~s~vHyG~~ynNAfWdG~qMvyGDGDG~~f~~~S~sLDVvAHElTH 347 (507)
T COG3227 269 EAAVDAHYNAGKVYDYYKNTFG-RNSYDNNGMPLVSTVHYGKNYNNAFWDGDQMVYGDGDGSFFTPFSGSLDVVAHELTH 347 (507)
T ss_pred hhhHHHHhhcchHHHHHHHHhc-ccCcCCCCCceEEEEeeccccccccccCceeEeecCCcceecccccccceehhhhcc
Confidence 4455666778899999999999 434433 344432222 567777654222111 1123599999999
Q ss_pred hhhC---CCCCccccchhhhhhhHHHHHHHHhhhhc-cc-ccch-hhhhHhhcchhhHHHHHHHhccCCCCCCccEEEeC
Q psy3708 151 QWFG---NLVTMKWWTDLWLNEGFATYMAAQALNDV-HI-LFDS-QVDFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIP 224 (240)
Q Consensus 151 qWfG---~~v~~~~w~d~WL~EG~a~y~~~~~~~~~-~~-~~~~-~~~~~~~~~~~~l~~~~~~~~~~~p~~k~d~~~~~ 224 (240)
.--+ +++--.+ .-=|||+|+.-+.-++.... +. +|.+ .+.|......+.||.+.+=....-|--..|-++.+
T Consensus 348 Gvtq~tA~L~Y~~q--sGALNEsfSDvfG~~i~~~~~~~~DW~IGddi~~p~~~g~aLRsl~~P~~~gqPdh~~~yy~g~ 425 (507)
T COG3227 348 GVTQQTAGLIYRGQ--SGALNESFSDVFGTLIEQYVKNQVDWLIGDDIYTPGIKGDALRSLSNPGRAGQPDHMKNYYTGT 425 (507)
T ss_pred hhhhhccCceecCC--CCchhhHHHHHHHHHHHHHhccCcceEeccccccCCCCCchhhcccCccccCCcchhhceeecC
Confidence 5544 4444332 23699999998875443332 11 4643 34566666777777655422222244444455555
Q ss_pred C
Q psy3708 225 D 225 (240)
Q Consensus 225 ~ 225 (240)
+
T Consensus 426 ~ 426 (507)
T COG3227 426 D 426 (507)
T ss_pred C
Confidence 5
No 30
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=89.07 E-value=0.8 Score=39.26 Aligned_cols=32 Identities=44% Similarity=0.545 Sum_probs=23.3
Q ss_pred HHHHHHHHHhhhhhhCCCCCccccchhhhhhhHHHHHHHHhhhh
Q psy3708 139 SIANTIAHELAHQWFGNLVTMKWWTDLWLNEGFATYMAAQALND 182 (240)
Q Consensus 139 ~~~~~iaHEiaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~~~~~ 182 (240)
..-.++|||+|||= | ...|.=|+|++++...+
T Consensus 195 ~~P~T~~HElAHq~-G-----------~a~E~EANFiayLac~~ 226 (318)
T PF12725_consen 195 SLPFTICHELAHQL-G-----------FASEDEANFIAYLACIN 226 (318)
T ss_pred cccHHHHHHHHHHh-C-----------CCCHHHHHHHHHHHHhc
Confidence 34469999999993 1 23477799999887654
No 31
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=87.98 E-value=0.39 Score=38.48 Aligned_cols=63 Identities=19% Similarity=0.267 Sum_probs=32.1
Q ss_pred HHHHHHHHhCCCCCCCCCCeEEcCCc-------ce---eeEcCCCCChhhHHHHHHHHHHHhhhhhhCCCCCc
Q psy3708 97 FLNFFERYFNIRYPLPKQDMVAIPSL-------SA---LLFDDVRSSIHNEYSIANTIAHELAHQWFGNLVTM 159 (240)
Q Consensus 97 ~l~~~e~~~g~~yp~~~l~~v~~p~~-------~~---~~~~~~~~~~~~~~~~~~~iaHEiaHqWfG~~v~~ 159 (240)
.++-..+..|...|.+++-++..+.. .. ++.....-...+..++..++|||++|---++....
T Consensus 36 ~v~~l~~~~~~~~~~~~v~v~~~~~~NA~~~g~~~~~~I~v~~~ll~~~~~~el~aVlaHElgH~~~~h~~~~ 108 (226)
T PF01435_consen 36 IVEELARRAGLGIPPPRVYVIDSPSPNAFATGGGPRKRIVVTSGLLESLSEDELAAVLAHELGHIKHRHILKS 108 (226)
T ss_dssp HHHHHHHHHHCTSS--EEEEE--SSEEEEEETTTC--EEEEEHHHHHHSSHHHHHHHHHHHHHHHHTTHCCCC
T ss_pred HHHHHHHHhcCCCCCCeEEEEcCCCCcEEEEccCCCcEEEEeChhhhcccHHHHHHHHHHHHHHHHcCCcchH
Confidence 33333334444555555544444433 11 33333322334456788899999999887654433
No 32
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=87.68 E-value=1.1 Score=33.78 Aligned_cols=30 Identities=17% Similarity=0.195 Sum_probs=19.5
Q ss_pred eeEcCCCCChhhHHHHHHHHHHHhhhhhhC
Q psy3708 125 LLFDDVRSSIHNEYSIANTIAHELAHQWFG 154 (240)
Q Consensus 125 ~~~~~~~~~~~~~~~~~~~iaHEiaHqWfG 154 (240)
|.+++..........+..+|.||++|.+..
T Consensus 44 I~ln~~l~~~~~~~~l~~~l~HEm~H~~~~ 73 (146)
T smart00731 44 IRLNPKLLTENGRDRLRETLLHELCHAALY 73 (146)
T ss_pred EEeCHHHHhhccHHHHHhhHHHHHHHHHHH
Confidence 444444333233356778999999999975
No 33
>PRK05457 heat shock protein HtpX; Provisional
Probab=86.62 E-value=1.4 Score=37.15 Aligned_cols=59 Identities=17% Similarity=0.193 Sum_probs=34.4
Q ss_pred HHHHHHHHHHhCCCCCCCCCCeEEcCCcc-----------eeeEcCCCCChhhHHHHHHHHHHHhhhhhhCC
Q psy3708 95 PEFLNFFERYFNIRYPLPKQDMVAIPSLS-----------ALLFDDVRSSIHNEYSIANTIAHELAHQWFGN 155 (240)
Q Consensus 95 ~~~l~~~e~~~g~~yp~~~l~~v~~p~~~-----------~~~~~~~~~~~~~~~~~~~~iaHEiaHqWfG~ 155 (240)
.+.++-+.+..|. |.+++-++..+... .+.++...-..-+.+++..++|||++|.=-|+
T Consensus 80 ~~~v~~la~~~g~--p~p~v~v~~~~~~NAfa~G~~~~~~~V~vt~gLl~~L~~~El~aVlAHElgHi~~~d 149 (284)
T PRK05457 80 VETVARQARQAGI--GMPEVAIYHSPEINAFATGASKNNSLVAVSTGLLQNMSRDEVEAVLAHEISHIANGD 149 (284)
T ss_pred HHHHHHHHHhCCC--CCCCEEEEeCCCceEEEecCCCCCeEEEeehHHhhhCCHHHHHHHHHHHHHHHHcCC
Confidence 3455555666774 56676655443221 13344433223345788899999999975444
No 34
>PRK04897 heat shock protein HtpX; Provisional
Probab=85.33 E-value=1.3 Score=37.53 Aligned_cols=60 Identities=20% Similarity=0.201 Sum_probs=33.7
Q ss_pred HHHHHHHHHHHhCCCCCCCCCCeEEcCCcc-----------eeeEcCCCCChhhHHHHHHHHHHHhhhhhhCC
Q psy3708 94 GPEFLNFFERYFNIRYPLPKQDMVAIPSLS-----------ALLFDDVRSSIHNEYSIANTIAHELAHQWFGN 155 (240)
Q Consensus 94 ~~~~l~~~e~~~g~~yp~~~l~~v~~p~~~-----------~~~~~~~~~~~~~~~~~~~~iaHEiaHqWfG~ 155 (240)
..+.++-+.+..|. |.|++-++..+... .+.+....-..-+.+++..++|||++|-=-|+
T Consensus 82 L~~~v~~la~~~gi--p~p~v~v~~~~~~NAfa~G~~~~~~~v~vt~gLl~~l~~~El~aVlAHElgHi~~~d 152 (298)
T PRK04897 82 LWHIVEDMAMVAQI--PMPRVFIIDDPSPNAFATGSSPKNAAVAVTTGLLAIMNREELEGVIGHEISHIRNYD 152 (298)
T ss_pred HHHHHHHHHHHcCC--CCCcEEEecCCCCceEEeccCCCCcEEEeehHHHhhCCHHHHHHHHHHHHHHHhcCC
Confidence 44555666667774 45666555433221 23333322222345678899999999954333
No 35
>PRK01345 heat shock protein HtpX; Provisional
Probab=84.46 E-value=1.7 Score=37.33 Aligned_cols=60 Identities=20% Similarity=0.158 Sum_probs=33.4
Q ss_pred HHHHHHHHHHHhCCCCCCCCCCeEEcCCcc-----------eeeEcCCCCChhhHHHHHHHHHHHhhhhhhCC
Q psy3708 94 GPEFLNFFERYFNIRYPLPKQDMVAIPSLS-----------ALLFDDVRSSIHNEYSIANTIAHELAHQWFGN 155 (240)
Q Consensus 94 ~~~~l~~~e~~~g~~yp~~~l~~v~~p~~~-----------~~~~~~~~~~~~~~~~~~~~iaHEiaHqWfG~ 155 (240)
..++++-+.+..|+| .+++-++-.+... .+......-...+.+++..++|||++|.==++
T Consensus 69 L~~~v~~La~~agi~--~p~v~vid~~~~NAFa~G~~~~~~~V~vt~gLL~~L~~dEL~aVlAHElgHi~~~d 139 (317)
T PRK01345 69 LYRMVRDLARRAGLP--MPKVYIIDNPQPNAFATGRNPENAAVAATTGLLQRLSPEEVAGVMAHELAHVKNRD 139 (317)
T ss_pred HHHHHHHHHHHcCCC--CCcEEEEcCCCcceEEecCCCCCeEEEechHHHhhCCHHHHHHHHHHHHHHHHcCC
Confidence 345566666677755 4565444332221 23333333223345688899999999954333
No 36
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=84.43 E-value=2.4 Score=28.38 Aligned_cols=53 Identities=23% Similarity=0.236 Sum_probs=29.8
Q ss_pred HHHHHHHHhCCCCCCCCCCeEEcCCc---------------ceeeEcCCCCChhhHHHHHHHHHHHhhhhh
Q psy3708 97 FLNFFERYFNIRYPLPKQDMVAIPSL---------------SALLFDDVRSSIHNEYSIANTIAHELAHQW 152 (240)
Q Consensus 97 ~l~~~e~~~g~~yp~~~l~~v~~p~~---------------~~~~~~~~~~~~~~~~~~~~~iaHEiaHqW 152 (240)
+...+|..||.+ +.++++-.-|.- ..+.+.+....... ..-..+++||++|-+
T Consensus 6 ~r~~~e~~~G~d--l~~Vrvh~~~~a~~~~~~~~A~A~T~G~~I~f~~g~~~~~s-~~~~~llaHEl~Hv~ 73 (79)
T PF13699_consen 6 IRSRLERAFGAD--LSDVRVHTGPAASRAAAALGARAFTVGNDIYFAPGKYNPDS-PEGRALLAHELAHVV 73 (79)
T ss_pred HHHHHHHHhCCC--ccceEEEeCCchhhhhhccCCeEEEECCEEEEcCCCcCCCC-CCcchhHhHHHHHHH
Confidence 456688899844 666665544322 45666443311111 112359999999954
No 37
>PRK03001 M48 family peptidase; Provisional
Probab=84.03 E-value=1.6 Score=36.66 Aligned_cols=56 Identities=20% Similarity=0.162 Sum_probs=30.7
Q ss_pred HHHHHHHHHHHhCCCCCCCCCCeEEcCCcc-----------eeeEcCCCCChhhHHHHHHHHHHHhhhh
Q psy3708 94 GPEFLNFFERYFNIRYPLPKQDMVAIPSLS-----------ALLFDDVRSSIHNEYSIANTIAHELAHQ 151 (240)
Q Consensus 94 ~~~~l~~~e~~~g~~yp~~~l~~v~~p~~~-----------~~~~~~~~~~~~~~~~~~~~iaHEiaHq 151 (240)
..+.++-+.+..|+|. +++-++..+... .+...+..-..-+.+++..++|||++|-
T Consensus 69 L~~~v~~la~~~g~~~--p~v~v~~~~~~NAfa~G~~~~~~~Ivvt~gLl~~l~~~El~aVlAHElgHi 135 (283)
T PRK03001 69 FYRMVRELAQRAGLPM--PKVYLINEDQPNAFATGRNPEHAAVAATTGILRVLSEREIRGVMAHELAHV 135 (283)
T ss_pred HHHHHHHHHHHcCCCC--CeEEEecCCCcceEEecCCCCCeEEEecHHHHhhCCHHHHHHHHHHHHHHH
Confidence 4455556666777554 455443322111 1333333222234567889999999994
No 38
>PRK03072 heat shock protein HtpX; Provisional
Probab=82.82 E-value=2.2 Score=36.04 Aligned_cols=56 Identities=18% Similarity=0.213 Sum_probs=31.8
Q ss_pred HHHHHHHHHHHhCCCCCCCCCCeEEcCCcceeeE-----------cCCCCChhhHHHHHHHHHHHhhhh
Q psy3708 94 GPEFLNFFERYFNIRYPLPKQDMVAIPSLSALLF-----------DDVRSSIHNEYSIANTIAHELAHQ 151 (240)
Q Consensus 94 ~~~~l~~~e~~~g~~yp~~~l~~v~~p~~~~~~~-----------~~~~~~~~~~~~~~~~iaHEiaHq 151 (240)
..+.++-+.+..|. |.+++-++..+....+.+ ....-..-+.+++..++|||++|-
T Consensus 72 L~~~v~~la~~~g~--p~p~vyv~~~~~~NAFa~G~~~~~~~v~vt~gLl~~l~~~El~aVlAHElgHi 138 (288)
T PRK03072 72 MYRIVRELSTAARQ--PMPRLYISPTAAPNAFATGRNPRNAAVCCTEGILQILNERELRGVLGHELSHV 138 (288)
T ss_pred HHHHHHHHHHHcCC--CCCCEEEecCCCCceEEecCCCCCcEEEecHHHHHhCCHHHHHHHHHHHHHHH
Confidence 44566666677775 456665444333222222 222222234567889999999994
No 39
>PRK03982 heat shock protein HtpX; Provisional
Probab=81.53 E-value=2.8 Score=35.38 Aligned_cols=17 Identities=47% Similarity=0.573 Sum_probs=14.2
Q ss_pred hHHHHHHHHHHHhhhhh
Q psy3708 136 NEYSIANTIAHELAHQW 152 (240)
Q Consensus 136 ~~~~~~~~iaHEiaHqW 152 (240)
+.+++..++|||++|-=
T Consensus 121 ~~~El~AVlAHElgHi~ 137 (288)
T PRK03982 121 NEDELEGVIAHELTHIK 137 (288)
T ss_pred CHHHHHHHHHHHHHHHH
Confidence 45678899999999954
No 40
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=81.41 E-value=3.1 Score=31.43 Aligned_cols=30 Identities=20% Similarity=0.167 Sum_probs=20.3
Q ss_pred eeEcCCCCChhhHHHHHHHHHHHhhhhhhC
Q psy3708 125 LLFDDVRSSIHNEYSIANTIAHELAHQWFG 154 (240)
Q Consensus 125 ~~~~~~~~~~~~~~~~~~~iaHEiaHqWfG 154 (240)
+.+++..........+..+|.||++|.|..
T Consensus 45 I~ls~~~~~~~~~~~~~~tL~HEm~H~~~~ 74 (157)
T PF10263_consen 45 IRLSPKLLDRNPEEELIDTLLHEMAHAAAY 74 (157)
T ss_pred EEECHHHHHhhHHHHHHHHHHHHHHHHHhh
Confidence 444443333334567788999999999983
No 41
>PRK02391 heat shock protein HtpX; Provisional
Probab=81.36 E-value=2.6 Score=35.72 Aligned_cols=60 Identities=12% Similarity=0.062 Sum_probs=32.3
Q ss_pred HHHHHHHHHHHhCCCCCCCCCCeEEcCCcce-----------eeEcCCCCChhhHHHHHHHHHHHhhhhhhCC
Q psy3708 94 GPEFLNFFERYFNIRYPLPKQDMVAIPSLSA-----------LLFDDVRSSIHNEYSIANTIAHELAHQWFGN 155 (240)
Q Consensus 94 ~~~~l~~~e~~~g~~yp~~~l~~v~~p~~~~-----------~~~~~~~~~~~~~~~~~~~iaHEiaHqWfG~ 155 (240)
..+.++-+.+..|.| .+++-++..+.... +++.+..-..-+.+++..++|||++|--=++
T Consensus 78 L~~~v~~la~~~~~~--~p~v~v~~~~~~NAfa~G~~~~~~~V~vt~gLl~~L~~~El~aVlaHElgHi~~~d 148 (296)
T PRK02391 78 LHAMVERLCALADLP--KPRVAVADSDVPNAFATGRSPKNAVVCVTTGLMRRLDPDELEAVLAHELSHVKNRD 148 (296)
T ss_pred HHHHHHHHHHHcCCC--CCcEEEEeCCCCceEEecCCCCCcEEEecHHHHhhCCHHHHHHHHHHHHHHHHcCC
Confidence 334455555677754 45655554332211 3333322222244678899999999954333
No 42
>KOG2661|consensus
Probab=80.30 E-value=2.5 Score=36.14 Aligned_cols=21 Identities=24% Similarity=0.446 Sum_probs=15.9
Q ss_pred hhHHHHHHHHHHHhhhhhhCC
Q psy3708 135 HNEYSIANTIAHELAHQWFGN 155 (240)
Q Consensus 135 ~~~~~~~~~iaHEiaHqWfG~ 155 (240)
.+...++.+++||+|||=-+.
T Consensus 270 k~ddglAtvLgHE~aHaVarH 290 (424)
T KOG2661|consen 270 KDDDGLATVLGHEIAHAVARH 290 (424)
T ss_pred cChHHHHHHHHHHHHHHHHHH
Confidence 344577889999999986543
No 43
>PRK01265 heat shock protein HtpX; Provisional
Probab=80.10 E-value=2.8 Score=36.00 Aligned_cols=55 Identities=22% Similarity=0.250 Sum_probs=30.5
Q ss_pred HHHHHHHHHHhCCCCCCCCCCeEEcCCcc-----------eeeEcCCCCChhhHHHHHHHHHHHhhhh
Q psy3708 95 PEFLNFFERYFNIRYPLPKQDMVAIPSLS-----------ALLFDDVRSSIHNEYSIANTIAHELAHQ 151 (240)
Q Consensus 95 ~~~l~~~e~~~g~~yp~~~l~~v~~p~~~-----------~~~~~~~~~~~~~~~~~~~~iaHEiaHq 151 (240)
.+.++-+.+..|. |.+++-++..+... .+...+..-..-+.+++..++|||++|-
T Consensus 86 ~~~v~~la~~~g~--~~p~vyv~~~~~~NAfa~G~~~~~~~Ivvt~gLl~~l~~~El~aVlAHElgHi 151 (324)
T PRK01265 86 YSIVAEVAKYNGI--RVPKVYIADVPFPNAFAYGSPIAGKRIAITLPLLKILNRDEIKAVAGHELGHL 151 (324)
T ss_pred HHHHHHHHHHcCC--CCCeEEEecCCCCCeEEeccCCCCCEEEEehHHHhhCCHHHHHHHHHHHHHHH
Confidence 3455555666775 45565544333221 2333333223334567889999999994
No 44
>PRK02870 heat shock protein HtpX; Provisional
Probab=79.90 E-value=3.7 Score=35.47 Aligned_cols=27 Identities=19% Similarity=0.202 Sum_probs=17.7
Q ss_pred eeEcCCCCChhhHHHHHHHHHHHhhhh
Q psy3708 125 LLFDDVRSSIHNEYSIANTIAHELAHQ 151 (240)
Q Consensus 125 ~~~~~~~~~~~~~~~~~~~iaHEiaHq 151 (240)
+...++.-..-+..++..++|||++|-
T Consensus 158 Ivvt~GLL~~L~~dEL~aVlAHELgHi 184 (336)
T PRK02870 158 VAITTGLLEKLDRDELQAVMAHELSHI 184 (336)
T ss_pred EEEehHHhhhCCHHHHHHHHHHHHHHH
Confidence 333333333335578889999999995
No 45
>PRK04351 hypothetical protein; Provisional
Probab=76.92 E-value=3.5 Score=31.21 Aligned_cols=49 Identities=18% Similarity=0.151 Sum_probs=27.9
Q ss_pred HHHHhCCCCCCCCCCeEEcCCc-----------ceeeEcCCCCChhhHHHHHHHHHHHhhhhh
Q psy3708 101 FERYFNIRYPLPKQDMVAIPSL-----------SALLFDDVRSSIHNEYSIANTIAHELAHQW 152 (240)
Q Consensus 101 ~e~~~g~~yp~~~l~~v~~p~~-----------~~~~~~~~~~~~~~~~~~~~~iaHEiaHqW 152 (240)
-+++|+.|+|. .+..-..+ ..+-+++..........+..+|+||++|-.
T Consensus 14 s~~~F~~~f~~---~v~~n~RlrttgG~~~l~~~~I~lnp~ll~~~~~~~l~~vv~HElcH~~ 73 (149)
T PRK04351 14 SLEYFGKPFRH---QAYFNKRLRTTGGRYLLKDHHIEFNPKMLEEYGLEELIGIIKHELCHYH 73 (149)
T ss_pred HHHHhCCCCCc---EEEEeccchhhhheeecCCCeEEeCHHHHhhccHHHHHhhHHHHHHHHH
Confidence 35688877773 22222222 223344443333335677889999999954
No 46
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=75.55 E-value=1.8 Score=33.60 Aligned_cols=12 Identities=58% Similarity=0.817 Sum_probs=10.6
Q ss_pred HHHHHHHhhhhh
Q psy3708 141 ANTIAHELAHQW 152 (240)
Q Consensus 141 ~~~iaHEiaHqW 152 (240)
..++|||++||+
T Consensus 112 ~~~~aHElGH~l 123 (173)
T PF13574_consen 112 IDTFAHELGHQL 123 (173)
T ss_dssp HHHHHHHHHHHH
T ss_pred eeeehhhhHhhc
Confidence 458999999998
No 47
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=72.37 E-value=2.9 Score=29.43 Aligned_cols=31 Identities=23% Similarity=0.190 Sum_probs=19.3
Q ss_pred ceeeEcCCCCChhhHHHHHHHHHHHhhhhhhCCCC
Q psy3708 123 SALLFDDVRSSIHNEYSIANTIAHELAHQWFGNLV 157 (240)
Q Consensus 123 ~~~~~~~~~~~~~~~~~~~~~iaHEiaHqWfG~~v 157 (240)
..++++++.+. .+...+++||++|.+++..-
T Consensus 29 ~~I~in~~~~~----~~~~f~laHELgH~~~~~~~ 59 (122)
T PF06114_consen 29 PIIFINSNLSP----ERQRFTLAHELGHILLHHGD 59 (122)
T ss_dssp TEEEEESSS-H----HHHHHHHHHHHHHHHHHH-H
T ss_pred CEEEECCCCCH----HHHHHHHHHHHHHHHhhhcc
Confidence 34555555322 23346999999999997654
No 48
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=72.06 E-value=6.4 Score=31.18 Aligned_cols=64 Identities=19% Similarity=0.313 Sum_probs=35.7
Q ss_pred CCCeEEcCC-c-ceeeEcCCCCChhhHHHHHHHHHHHhhhhhhCCCCCcc---ccchhhhhhhHHHHHH
Q psy3708 113 KQDMVAIPS-L-SALLFDDVRSSIHNEYSIANTIAHELAHQWFGNLVTMK---WWTDLWLNEGFATYMA 176 (240)
Q Consensus 113 ~l~~v~~p~-~-~~~~~~~~~~~~~~~~~~~~~iaHEiaHqWfG~~v~~~---~w~d~WL~EG~a~y~~ 176 (240)
..+.|.+|- . ...+|..+.........+..+|||||+|-.-...+..+ ...+.|-.+....|.+
T Consensus 7 ~~N~i~ip~~~l~~P~f~~~~p~~~~yg~lG~ilahel~hafd~~g~~~D~~g~~~~wWs~~~~~~~~~ 75 (206)
T PF01431_consen 7 RFNSIVIPAGILQPPFFDPNYPPALNYGGLGFILAHELMHAFDPEGINYDEDGNLRNWWSEESRSNFKE 75 (206)
T ss_dssp TTTEEEEEGGGSSTTT--TTS-HHHHHHTHHHHHHHHHHHCTSTTGGGB-TTS-BS--S-HHHHHHHHH
T ss_pred ccCEEEecHHHhCCccCCCCCCHHHHHHHHHHHHHHHHHHHHHHhHhhcCcccccccccchhhHHHHHH
Confidence 455666664 2 66677777766677777888999999997644333321 1223354555555544
No 49
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=71.96 E-value=1.7 Score=39.76 Aligned_cols=65 Identities=25% Similarity=0.378 Sum_probs=42.4
Q ss_pred CCCCeEEcCCc--ceeeEcCCCCChhhHHHHHHHHHHHhhhhhhCCCCC---ccccchhhhhhhHHHHHH
Q psy3708 112 PKQDMVAIPSL--SALLFDDVRSSIHNEYSIANTIAHELAHQWFGNLVT---MKWWTDLWLNEGFATYMA 176 (240)
Q Consensus 112 ~~l~~v~~p~~--~~~~~~~~~~~~~~~~~~~~~iaHEiaHqWfG~~v~---~~~w~d~WL~EG~a~y~~ 176 (240)
+..+.|+.|-. ...+|+++-++..+...+..+|+|||+|..=-+.-. .....|-|.+|--+.|-+
T Consensus 457 p~~N~IVFPAAILQ~PFfd~ea~~a~NYGgIGaVIgHEI~HgFDdqGakfD~~GnL~dWWT~eD~~aF~~ 526 (654)
T COG3590 457 PQKNEIVFPAAILQAPFFDPEADSAANYGGIGAVIGHEIGHGFDDQGAKFDGDGNLNDWWTDEDAAAFKE 526 (654)
T ss_pred CCCceEeeeHHhcCCCCCCCCcchhhcccCccceehhhhcccccCCccccCCCCcHHhhcCHHHHHHHHH
Confidence 34455666643 677788888888888888899999999976221111 112334577787777754
No 50
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=71.07 E-value=15 Score=28.92 Aligned_cols=65 Identities=22% Similarity=0.194 Sum_probs=36.7
Q ss_pred HHHHHHHHHHHHHHHHHhCCCCCCCCCCeEEcCCc-------ceeeEcCCCCChhhHHHHHHHHHHHhhhhhhCC
Q psy3708 88 DFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPSL-------SALLFDDVRSSIHNEYSIANTIAHELAHQWFGN 155 (240)
Q Consensus 88 ~~~~~~~~~~l~~~e~~~g~~yp~~~l~~v~~p~~-------~~~~~~~~~~~~~~~~~~~~~iaHEiaHqWfG~ 155 (240)
+.+.......+..+++.+|.++ +++.+--...- ..+.++-.... ....-+..+|+||++|.-.-|
T Consensus 108 ~~~~~~l~~~~~~~~~~~~~~~--~~i~ir~~ksrWGsc~~~~~I~ln~~L~~-~P~~~idYVvvHEL~Hl~~~n 179 (205)
T PF01863_consen 108 KQAKEYLPERLKKYAKKLGLPP--PKIKIRDMKSRWGSCSSKGNITLNWRLVM-APPEVIDYVVVHELCHLRHPN 179 (205)
T ss_pred HHHHHHHHHHHHHHHHHcCCCc--ceEEEeehhhccccCCCCCcEEeeccccc-CCccHHHHHHHHHHHHhccCC
Confidence 3455666777888888888543 33322211111 33444433222 223456789999999997654
No 51
>PF08325 WLM: WLM domain; InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=69.96 E-value=3.5 Score=32.44 Aligned_cols=25 Identities=32% Similarity=0.347 Sum_probs=19.6
Q ss_pred CChhhHHHHHHHHHHHhhhhhhCCC
Q psy3708 132 SSIHNEYSIANTIAHELAHQWFGNL 156 (240)
Q Consensus 132 ~~~~~~~~~~~~iaHEiaHqWfG~~ 156 (240)
...-....+..++.||+||.++|+-
T Consensus 74 ~~fl~~~~i~~t~lHELaH~~~~~H 98 (186)
T PF08325_consen 74 GGFLPYETILGTMLHELAHNVHGPH 98 (186)
T ss_pred CCEeeHHHHHHHHHHHHHhcccCCc
Confidence 3344556788999999999998873
No 52
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=69.46 E-value=12 Score=30.09 Aligned_cols=54 Identities=22% Similarity=0.110 Sum_probs=32.8
Q ss_pred ceeeEcCCCCChhhHHHHHHHHHHHhhhhhhCCCC-----CccccchhhhhhhHHHHHHHHhh
Q psy3708 123 SALLFDDVRSSIHNEYSIANTIAHELAHQWFGNLV-----TMKWWTDLWLNEGFATYMAAQAL 180 (240)
Q Consensus 123 ~~~~~~~~~~~~~~~~~~~~~iaHEiaHqWfG~~v-----~~~~w~d~WL~EG~a~y~~~~~~ 180 (240)
+.++++.+.+. .+-..++|||++|-++.... ....+...=..|..|++++..++
T Consensus 59 ~~I~iN~n~~~----~r~rFtlAHELGH~llH~~~~~~~~~~~~~~~~~~~E~~AN~FAa~lL 117 (213)
T COG2856 59 PVIYINANNSL----ERKRFTLAHELGHALLHTDLNTRFDAEPTLQQDRKIEAEANAFAAELL 117 (213)
T ss_pred ceEEEeCCCCH----HHHHHHHHHHHhHHHhccccchhhhcccccchhHHHHHHHHHHHHHHh
Confidence 34555555422 23346999999999998654 22223344456888888775543
No 53
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=63.74 E-value=21 Score=28.98 Aligned_cols=65 Identities=18% Similarity=0.106 Sum_probs=38.3
Q ss_pred HHHHHHHHHHHHHHHHHhCCCCCCCCCCeEEcCCc-------ceeeEcCCCCChhhHHHHHHHHHHHhhhhhhCC
Q psy3708 88 DFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPSL-------SALLFDDVRSSIHNEYSIANTIAHELAHQWFGN 155 (240)
Q Consensus 88 ~~~~~~~~~~l~~~e~~~g~~yp~~~l~~v~~p~~-------~~~~~~~~~~~~~~~~~~~~~iaHEiaHqWfG~ 155 (240)
+.+.+.....++.+.+.+|.++.--.+. ....- +.+.++-.. ......-+..+++||+||-=..|
T Consensus 119 ~~~~~~l~~~~~~~~~~l~~~~~~~~ik--~~k~~WGScs~~~~i~~~~~l-~~~p~~~i~YVvvHELaHLke~n 190 (223)
T COG1451 119 EILREILEIRLKEYAKKLGVPPRAIKLK--NMKRRWGSCSKAGEIRFNWRL-VMAPEEVIDYVVVHELAHLKEKN 190 (223)
T ss_pred HHHHHHHHHHHHHHHHHhCCCccceeee--eccceeeeecCCCcEEeehhh-hcCCHHHHHHHHHHHHHHHhhhh
Confidence 3555667777888888898655432222 11111 333343322 22333466789999999998877
No 54
>KOG2719|consensus
Probab=63.72 E-value=11 Score=33.49 Aligned_cols=49 Identities=22% Similarity=0.356 Sum_probs=28.8
Q ss_pred HHHHhCCCCCCCCCCeEEc--CCc-------------ceeeEcCCCC--ChhhHHHHHHHHHHHhhhh
Q psy3708 101 FERYFNIRYPLPKQDMVAI--PSL-------------SALLFDDVRS--SIHNEYSIANTIAHELAHQ 151 (240)
Q Consensus 101 ~e~~~g~~yp~~~l~~v~~--p~~-------------~~~~~~~~~~--~~~~~~~~~~~iaHEiaHq 151 (240)
+.+-.| +|.+++.++-. +.- .-.+|+.-.. .....+++..+++||++|=
T Consensus 226 la~s~g--fp~~k~~vi~~s~rs~hsNAyfyG~~~~KRIvIyDtLl~~~~~~~~eel~AVl~HELGHW 291 (428)
T KOG2719|consen 226 LADSVG--FPLSKYRVIDGSKRSSHSNAYFYGLCKNKRIVIYDTLLLEEEHLNNEELVAVLAHELGHW 291 (428)
T ss_pred HHHhcC--CCceEEEEEecCCCCCCCCeeeeeccccceEEEehhhhhhhhccccHHHHHHHHHHhhHH
Confidence 334455 88889888874 221 1233433221 1123467889999999993
No 55
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=62.49 E-value=4.5 Score=33.94 Aligned_cols=25 Identities=24% Similarity=0.332 Sum_probs=18.9
Q ss_pred hhhHHHHHHHHHHHhhhhhhCCCCC
Q psy3708 134 IHNEYSIANTIAHELAHQWFGNLVT 158 (240)
Q Consensus 134 ~~~~~~~~~~iaHEiaHqWfG~~v~ 158 (240)
.-+.+++..+++||++|.=-++.+.
T Consensus 151 ~l~~dEl~aVlaHElgHi~~rd~~~ 175 (302)
T COG0501 151 LLNDDELEAVLAHELGHIKNRHTLV 175 (302)
T ss_pred hCCHHHHHHHHHHHHHHHhcccHHH
Confidence 3455688899999999987666553
No 56
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=60.29 E-value=5.5 Score=31.23 Aligned_cols=15 Identities=47% Similarity=0.636 Sum_probs=12.2
Q ss_pred HHHHHHHHHHhhhhh
Q psy3708 138 YSIANTIAHELAHQW 152 (240)
Q Consensus 138 ~~~~~~iaHEiaHqW 152 (240)
.....++||||+|.+
T Consensus 140 ~~~~~~~AHEiGH~l 154 (196)
T PF13688_consen 140 YNGAITFAHEIGHNL 154 (196)
T ss_dssp HHHHHHHHHHHHHHT
T ss_pred CceehhhHHhHHHhc
Confidence 344579999999997
No 57
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=60.03 E-value=6 Score=28.14 Aligned_cols=26 Identities=31% Similarity=0.461 Sum_probs=16.8
Q ss_pred eeeEcCCCCChhhHHHHHHHHHHHhhhhhh
Q psy3708 124 ALLFDDVRSSIHNEYSIANTIAHELAHQWF 153 (240)
Q Consensus 124 ~~~~~~~~~~~~~~~~~~~~iaHEiaHqWf 153 (240)
-+..+++.... .+...++||++|-|=
T Consensus 67 ~i~IDP~~~~K----GC~~TL~HEL~H~WQ 92 (141)
T PHA02456 67 WIEIDPDYANK----GCRDTLAHELNHAWQ 92 (141)
T ss_pred EEEECCccccc----chHHHHHHHHHHHHh
Confidence 34455554332 345699999999993
No 58
>PF08014 DUF1704: Domain of unknown function (DUF1704); InterPro: IPR012548 This family contains many hypothetical proteins.
Probab=57.42 E-value=89 Score=27.24 Aligned_cols=51 Identities=14% Similarity=0.072 Sum_probs=29.5
Q ss_pred ceeeEcCCCCChhhHHHHHHHHHHHhhhh-------------hhCCCCCccccchhhhhhhHHHHHHHHh
Q psy3708 123 SALLFDDVRSSIHNEYSIANTIAHELAHQ-------------WFGNLVTMKWWTDLWLNEGFATYMAAQA 179 (240)
Q Consensus 123 ~~~~~~~~~~~~~~~~~~~~~iaHEiaHq-------------WfG~~v~~~~w~d~WL~EG~a~y~~~~~ 179 (240)
..+....+.. -...++..++.||++=+ |++...-+.. =..||+|.+.|++.
T Consensus 150 ~~l~I~~~~~--fs~~~l~~L~~HEigvH~lt~~Ng~~QPl~~l~~Glp~~~----~TQEGLAvl~E~l~ 213 (349)
T PF08014_consen 150 DRLKINKNAM--FSERDLEALLHHEIGVHLLTTLNGRAQPLKILSLGLPGYT----PTQEGLAVLSEYLS 213 (349)
T ss_pred CeeEEcCCCC--cCHHHHHHHHHHhhhhhhccccccccCCcHHhCCCCCCCC----CCchHHHHHHHHHh
Confidence 3455554432 23346778999999543 3332222211 23599999999774
No 59
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=54.99 E-value=13 Score=29.14 Aligned_cols=16 Identities=31% Similarity=0.563 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHhhhhh
Q psy3708 137 EYSIANTIAHELAHQW 152 (240)
Q Consensus 137 ~~~~~~~iaHEiaHqW 152 (240)
..++...++||++|.+
T Consensus 122 ~~R~~k~~~HElGH~l 137 (179)
T PRK13267 122 EERVRKEVTHELGHTL 137 (179)
T ss_pred HHHHHHHHHHHHHHHc
Confidence 3456677999999983
No 60
>PF05569 Peptidase_M56: BlaR1 peptidase M56; InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription.
Probab=52.72 E-value=21 Score=30.06 Aligned_cols=29 Identities=21% Similarity=0.269 Sum_probs=18.7
Q ss_pred ceeeEcCCCCChhhHHHHHHHHHHHhhhh
Q psy3708 123 SALLFDDVRSSIHNEYSIANTIAHELAHQ 151 (240)
Q Consensus 123 ~~~~~~~~~~~~~~~~~~~~~iaHEiaHq 151 (240)
+.++.+.+....-+.++...++.||++|-
T Consensus 179 p~I~lP~~~~~~~~~~el~~il~HEl~Hi 207 (299)
T PF05569_consen 179 PVIVLPESLLEDLSEEELRAILLHELAHI 207 (299)
T ss_pred eEEEecCccccccCHHHHHHHHHHHHHHH
Confidence 45555655332334456678999999993
No 61
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=50.85 E-value=8.8 Score=27.50 Aligned_cols=11 Identities=45% Similarity=0.818 Sum_probs=9.7
Q ss_pred HHHHHHhhhhh
Q psy3708 142 NTIAHELAHQW 152 (240)
Q Consensus 142 ~~iaHEiaHqW 152 (240)
.+++||++|+.
T Consensus 109 ~~~~HEiGH~l 119 (124)
T PF13582_consen 109 DTFAHEIGHNL 119 (124)
T ss_dssp THHHHHHHHHT
T ss_pred eEeeehhhHhc
Confidence 69999999974
No 62
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=50.80 E-value=11 Score=34.04 Aligned_cols=21 Identities=38% Similarity=0.468 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHhhhhhhCCCC
Q psy3708 137 EYSIANTIAHELAHQWFGNLV 157 (240)
Q Consensus 137 ~~~~~~~iaHEiaHqWfG~~v 157 (240)
..++..+||||++|-==+..+
T Consensus 127 esElagViAHEigHv~qrH~a 147 (484)
T COG4783 127 ESELAGVIAHEIGHVAQRHLA 147 (484)
T ss_pred HHHHHHHHHHHHHHHhhhhHH
Confidence 357789999999995434433
No 63
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=49.94 E-value=12 Score=29.37 Aligned_cols=15 Identities=47% Similarity=0.592 Sum_probs=12.3
Q ss_pred HHHHHHHHHHhhhhh
Q psy3708 138 YSIANTIAHELAHQW 152 (240)
Q Consensus 138 ~~~~~~iaHEiaHqW 152 (240)
.....++|||++|+.
T Consensus 129 ~~~a~~~AHElGH~l 143 (194)
T cd04269 129 LLFAVTMAHELGHNL 143 (194)
T ss_pred HHHHHHHHHHHHhhc
Confidence 345679999999986
No 64
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=49.36 E-value=21 Score=27.01 Aligned_cols=23 Identities=39% Similarity=0.400 Sum_probs=15.4
Q ss_pred HHHHHHHHHHhhh----hhhCCCCCccc
Q psy3708 138 YSIANTIAHELAH----QWFGNLVTMKW 161 (240)
Q Consensus 138 ~~~~~~iaHEiaH----qWfG~~v~~~~ 161 (240)
..+..+|.||+|| |=|| .+-+..
T Consensus 59 ~f~~~vV~HELaHl~ly~~~g-r~~phg 85 (156)
T COG3091 59 DFIEQVVPHELAHLHLYQEFG-RYKPHG 85 (156)
T ss_pred HHHHHHHHHHHHHHHHHHHcC-CCCCCc
Confidence 3567899999998 3355 455543
No 65
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=48.60 E-value=29 Score=27.26 Aligned_cols=15 Identities=53% Similarity=0.649 Sum_probs=11.9
Q ss_pred HHHHHHHHHHHhhhh
Q psy3708 137 EYSIANTIAHELAHQ 151 (240)
Q Consensus 137 ~~~~~~~iaHEiaHq 151 (240)
......++|||++|.
T Consensus 128 ~~~~a~~~AHelGH~ 142 (199)
T PF01421_consen 128 GLSFAVIIAHELGHN 142 (199)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHh
Confidence 345667999999995
No 66
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=45.95 E-value=13 Score=30.57 Aligned_cols=15 Identities=40% Similarity=0.636 Sum_probs=12.0
Q ss_pred HHHHHHHHhhhhhhCC
Q psy3708 140 IANTIAHELAHQWFGN 155 (240)
Q Consensus 140 ~~~~iaHEiaHqWfG~ 155 (240)
...++|||++|.+ |.
T Consensus 167 ~a~t~AHElGHnl-Gm 181 (244)
T cd04270 167 SDLVTAHELGHNF-GS 181 (244)
T ss_pred HHHHHHHHHHHhc-CC
Confidence 4569999999997 53
No 67
>COG2719 SpoVR Uncharacterized conserved protein [Function unknown]
Probab=42.50 E-value=17 Score=32.37 Aligned_cols=50 Identities=20% Similarity=0.129 Sum_probs=34.6
Q ss_pred HHHHHHHhhhhhhCCCCCccccchhhhhhhHHHHHHHHhhhhcccccchhhhhHhh
Q psy3708 141 ANTIAHELAHQWFGNLVTMKWWTDLWLNEGFATYMAAQALNDVHILFDSQVDFAAE 196 (240)
Q Consensus 141 ~~~iaHEiaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~~~~~~~~~~~~~~~~~~~ 196 (240)
.-.|..++|.-.+... +.--||||.|+|-.+.++.+++.+.+..+.+..+
T Consensus 250 il~ivR~ea~YF~PQ~------qTkVMNEGWAtfWHytiln~lydE~~~~~~~~lE 299 (495)
T COG2719 250 ILRIVRKEAQYFYPQR------QTKVMNEGWATFWHYTILNHLYDEGKLTERAMLE 299 (495)
T ss_pred HHHHHHHHHHHhcchH------HHHHhhhhHHHHHHHHHHHhhhhhcccChHHHHH
Confidence 3466777775554332 2346899999999999999888766665555443
No 68
>COG5504 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=42.18 E-value=76 Score=26.30 Aligned_cols=46 Identities=20% Similarity=0.172 Sum_probs=27.5
Q ss_pred HHHHHHHHHHhhhhhhCCCCCcc----ccchhhhhhhHHHHH--HHHhhhhc
Q psy3708 138 YSIANTIAHELAHQWFGNLVTMK----WWTDLWLNEGFATYM--AAQALNDV 183 (240)
Q Consensus 138 ~~~~~~iaHEiaHqWfG~~v~~~----~w~d~WL~EG~a~y~--~~~~~~~~ 183 (240)
.++-.+||||.=|..=-..++-. .--|.-+-||||++. |...-+++
T Consensus 138 ~~v~aliaHE~HH~~R~~~i~~~eg~vtLle~lV~EGLAE~av~E~~Gr~~l 189 (280)
T COG5504 138 TSVPALIAHEYHHNCRLRYIDYGEGSVTLLEALVMEGLAEHAVFELFGRGEL 189 (280)
T ss_pred cchHHHHHHHHHhhheecccccCCCceeHHHHHHHHHHHHHHHHHHhCcccc
Confidence 35667999999996522222221 122567789999993 44444444
No 69
>KOG3607|consensus
Probab=41.53 E-value=71 Score=30.77 Aligned_cols=78 Identities=21% Similarity=0.293 Sum_probs=39.8
Q ss_pred EEEEEeCCccchHHHHHHHHHHHHHHHHHHhCCCCCCCCCCeEEcCCc----ceeeEc-----C----C--CCChhhHHH
Q psy3708 75 FRTWARRDVIDQVDFAAEAGPEFLNFFERYFNIRYPLPKQDMVAIPSL----SALLFD-----D----V--RSSIHNEYS 139 (240)
Q Consensus 75 i~v~~~~~~~~~~~~~~~~~~~~l~~~e~~~g~~yp~~~l~~v~~p~~----~~~~~~-----~----~--~~~~~~~~~ 139 (240)
+++|..++.......+.+.....+.|=..++....|.+...++..-.+ .++.|. + . .........
T Consensus 243 lE~Wt~~dki~~~~~~~~tL~~F~~wr~~~l~~r~~hD~a~L~~~~~~~~~~~G~a~~~~mCs~~~s~gv~~~~~~~~~~ 322 (716)
T KOG3607|consen 243 LEIWTDGNKIDVSEDLRETLHNFLKWRKSYLTTRLPHDAAHLLSGILFYGKYVGLAYFGGMCSPGHSGGVNKFHSDILLA 322 (716)
T ss_pred EEecCCCCeecccccHHHHHHHHHHHHHhhccccCCCCceEEEEeeeccCceeceeecccccCcccccceeecCcccchh
Confidence 567777776655555556666666666655553334444433332122 111110 0 0 000111345
Q ss_pred HHHHHHHHhhhhh
Q psy3708 140 IANTIAHELAHQW 152 (240)
Q Consensus 140 ~~~~iaHEiaHqW 152 (240)
.+.++||||+|.-
T Consensus 323 ~a~v~AhelgH~l 335 (716)
T KOG3607|consen 323 FAVVLAHELGHNL 335 (716)
T ss_pred HHHHHHHHHHhhc
Confidence 6789999999963
No 70
>PF13203 DUF2201_N: Putative metallopeptidase domain
Probab=41.34 E-value=56 Score=27.42 Aligned_cols=39 Identities=10% Similarity=-0.008 Sum_probs=29.6
Q ss_pred ceeeEcCCCCChhhHHHHHHHHHHHhhhhhhCCCCCccc
Q psy3708 123 SALLFDDVRSSIHNEYSIANTIAHELAHQWFGNLVTMKW 161 (240)
Q Consensus 123 ~~~~~~~~~~~~~~~~~~~~~iaHEiaHqWfG~~v~~~~ 161 (240)
..+.|++.....-....+..+++||+-|--|+.......
T Consensus 43 ~~l~~nP~~~~~l~~~~~~~~l~HevlH~~~~H~~r~~~ 81 (292)
T PF13203_consen 43 RRLYYNPEFLESLSPEERVGLLLHEVLHCLLRHPWRRGG 81 (292)
T ss_pred cEEEECcHHHhcCCHHHHHHHHHHHHHHHHccchhhhcc
Confidence 578888876444445567789999999999998766665
No 71
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=40.40 E-value=13 Score=28.94 Aligned_cols=14 Identities=50% Similarity=0.577 Sum_probs=11.1
Q ss_pred HHHHHHHHHhhhhh
Q psy3708 139 SIANTIAHELAHQW 152 (240)
Q Consensus 139 ~~~~~iaHEiaHqW 152 (240)
....++|||++|..
T Consensus 132 ~~~~~~aHElGH~l 145 (192)
T cd04267 132 LTALTMAHELGHNL 145 (192)
T ss_pred eehhhhhhhHHhhc
Confidence 44569999999975
No 72
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=39.73 E-value=70 Score=22.89 Aligned_cols=11 Identities=45% Similarity=0.625 Sum_probs=9.5
Q ss_pred HHHHHHHhhhh
Q psy3708 141 ANTIAHELAHQ 151 (240)
Q Consensus 141 ~~~iaHEiaHq 151 (240)
..+++||++|-
T Consensus 81 vKviiHEllHI 91 (133)
T COG4900 81 VKVIIHELLHI 91 (133)
T ss_pred HHHHHHHHhcC
Confidence 46999999995
No 73
>PF09865 DUF2092: Predicted periplasmic protein (DUF2092); InterPro: IPR019207 This entry represents various hypothetical prokaryotic proteins of unknown function.
Probab=38.28 E-value=69 Score=25.82 Aligned_cols=58 Identities=12% Similarity=0.068 Sum_probs=37.0
Q ss_pred EeEEeceeEeecCCCCCCCCCceEEEEEeCCccchHHHHHHHHHHHHHHHHHHhCCCCCCCCC
Q psy3708 52 AFIISDFTFRQSNTSNPDLSRVQFRTWARRDVIDQVDFAAEAGPEFLNFFERYFNIRYPLPKQ 114 (240)
Q Consensus 52 a~~vg~f~~~~~~~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~l~~~e~~~g~~yp~~~l 114 (240)
+-..|+....+. -.+|-.+.+|.+....-....+-..+..+++++.+.+|+..|...+
T Consensus 55 ~~~~gd~~~~~~-----~yDGkt~Tl~~~~~n~Ya~~~aP~tid~~i~~l~~~~gi~~P~aDl 112 (214)
T PF09865_consen 55 IDRRGDGADREF-----YYDGKTFTLYDPNQNVYAQADAPGTIDAAIDYLRDKYGIELPLADL 112 (214)
T ss_pred EEEEcCCcceEE-----EECCCEEEEEcCcCCeEEeccCCCCHHHHHHHHHHhhCCCccHHHh
Confidence 334666665555 4566678777765554322333345567889999999988887443
No 74
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=37.69 E-value=20 Score=28.79 Aligned_cols=13 Identities=46% Similarity=0.501 Sum_probs=11.0
Q ss_pred HHHHHHHHhhhhh
Q psy3708 140 IANTIAHELAHQW 152 (240)
Q Consensus 140 ~~~~iaHEiaHqW 152 (240)
...++|||++|..
T Consensus 145 ~~~~~AHElGH~l 157 (220)
T cd04272 145 GVYTMTHELAHLL 157 (220)
T ss_pred cHHHHHHHHHHHh
Confidence 3579999999985
No 75
>PF14891 Peptidase_M91: Effector protein
Probab=35.93 E-value=23 Score=27.33 Aligned_cols=15 Identities=27% Similarity=0.270 Sum_probs=12.0
Q ss_pred HHHHHHHHHhhhhhh
Q psy3708 139 SIANTIAHELAHQWF 153 (240)
Q Consensus 139 ~~~~~iaHEiaHqWf 153 (240)
....+|+|||+|-|=
T Consensus 102 ~p~v~L~HEL~HA~~ 116 (174)
T PF14891_consen 102 PPFVVLYHELIHAYD 116 (174)
T ss_pred HHHHHHHHHHHHHHH
Confidence 345699999999874
No 76
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=35.52 E-value=25 Score=26.07 Aligned_cols=13 Identities=23% Similarity=0.542 Sum_probs=10.7
Q ss_pred HHHHHHHHHhhhh
Q psy3708 139 SIANTIAHELAHQ 151 (240)
Q Consensus 139 ~~~~~iaHEiaHq 151 (240)
.+..++.|||+|-
T Consensus 104 ~~~~v~~HEiGHa 116 (154)
T PF00413_consen 104 DLQSVAIHEIGHA 116 (154)
T ss_dssp EHHHHHHHHHHHH
T ss_pred hhhhhhhhccccc
Confidence 3557999999996
No 77
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=35.44 E-value=25 Score=25.85 Aligned_cols=14 Identities=43% Similarity=0.643 Sum_probs=10.8
Q ss_pred HHHHHHHHHHhhhh
Q psy3708 138 YSIANTIAHELAHQ 151 (240)
Q Consensus 138 ~~~~~~iaHEiaHq 151 (240)
....++.+||++|-
T Consensus 75 y~~~RIaaHE~GHi 88 (132)
T PF02031_consen 75 YNSTRIAAHELGHI 88 (132)
T ss_dssp S-HHHHHHHHHHHH
T ss_pred Cccceeeeehhccc
Confidence 34567999999996
No 78
>PF09768 Peptidase_M76: Peptidase M76 family; InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=33.83 E-value=1.8e+02 Score=22.64 Aligned_cols=18 Identities=28% Similarity=0.604 Sum_probs=14.4
Q ss_pred hhHHHHHHHHHHHhhhhh
Q psy3708 135 HNEYSIANTIAHELAHQW 152 (240)
Q Consensus 135 ~~~~~~~~~iaHEiaHqW 152 (240)
.....+..+|+||+.|-|
T Consensus 66 ~~~~~l~~~l~HELIHay 83 (173)
T PF09768_consen 66 RSQGHLEDTLTHELIHAY 83 (173)
T ss_pred CCHHHHHHHHHHHHHHHH
Confidence 344567789999999987
No 79
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=32.96 E-value=35 Score=25.60 Aligned_cols=21 Identities=14% Similarity=0.194 Sum_probs=14.0
Q ss_pred HHHHHHHHHHHHHHHHhCCCC
Q psy3708 89 FAAEAGPEFLNFFERYFNIRY 109 (240)
Q Consensus 89 ~~~~~~~~~l~~~e~~~g~~y 109 (240)
...+.+.++++..++...+.+
T Consensus 15 ~~r~~i~~A~~~W~~~~~i~F 35 (165)
T cd04268 15 KLRAAILDAIEAWNKAFAIGF 35 (165)
T ss_pred HHHHHHHHHHHHHHHHhCcCc
Confidence 344667778888888775443
No 80
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=31.58 E-value=28 Score=26.44 Aligned_cols=13 Identities=46% Similarity=1.227 Sum_probs=10.2
Q ss_pred HHHHHHHhhhhhhC
Q psy3708 141 ANTIAHELAHQWFG 154 (240)
Q Consensus 141 ~~~iaHEiaHqWfG 154 (240)
..+++||++| |+|
T Consensus 70 g~TltHEvGH-~LG 82 (154)
T PF05572_consen 70 GKTLTHEVGH-WLG 82 (154)
T ss_dssp SHHHHHHHHH-HTT
T ss_pred ccchhhhhhh-hhc
Confidence 3699999998 554
No 81
>PF08219 TOM13: Outer membrane protein TOM13; InterPro: IPR013262 The TOM13 family of proteins are mitochondrial outer membrane proteins that mediate the assembly of beta-barrel proteins [].; GO: 0005741 mitochondrial outer membrane
Probab=30.92 E-value=33 Score=22.64 Aligned_cols=11 Identities=36% Similarity=0.561 Sum_probs=9.3
Q ss_pred HHHHHHhhhhh
Q psy3708 142 NTIAHELAHQW 152 (240)
Q Consensus 142 ~~iaHEiaHqW 152 (240)
.++|||+++.|
T Consensus 54 El~AhE~~fr~ 64 (77)
T PF08219_consen 54 ELFAHEIAFRL 64 (77)
T ss_pred HHHHHHHHHHh
Confidence 59999999865
No 82
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=30.76 E-value=26 Score=27.93 Aligned_cols=11 Identities=45% Similarity=0.773 Sum_probs=9.7
Q ss_pred HHHHHHhhhhh
Q psy3708 142 NTIAHELAHQW 152 (240)
Q Consensus 142 ~~iaHEiaHqW 152 (240)
.+++|||+|..
T Consensus 139 ~~~aHEiGH~l 149 (206)
T PF13583_consen 139 QTFAHEIGHNL 149 (206)
T ss_pred hHHHHHHHHHh
Confidence 57999999986
No 83
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=30.49 E-value=34 Score=25.66 Aligned_cols=35 Identities=9% Similarity=-0.029 Sum_probs=23.4
Q ss_pred ceEEEEEeCCcc---chHHHHHHHHHHHHHHHHHHhCC
Q psy3708 73 VQFRTWARRDVI---DQVDFAAEAGPEFLNFFERYFNI 107 (240)
Q Consensus 73 ~~i~v~~~~~~~---~~~~~~~~~~~~~l~~~e~~~g~ 107 (240)
.++++|..+... .......+.+.++++..++..++
T Consensus 2 ~~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l 39 (156)
T cd04279 2 SPIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPL 39 (156)
T ss_pred CCeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCe
Confidence 467777776543 23455677888888888877543
No 84
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=30.18 E-value=32 Score=27.68 Aligned_cols=32 Identities=9% Similarity=-0.099 Sum_probs=17.7
Q ss_pred ceEEEEEeCCccch-HHHHHHHHHHHHHHHHHH
Q psy3708 73 VQFRTWARRDVIDQ-VDFAAEAGPEFLNFFERY 104 (240)
Q Consensus 73 ~~i~v~~~~~~~~~-~~~~~~~~~~~l~~~e~~ 104 (240)
..|.|+..+..... .......+.+|+..|...
T Consensus 39 ~~I~i~~~~~~~~~l~s~~~~al~~AI~~yNal 71 (211)
T PF12388_consen 39 RTITIIGYTGGSQSLNSAWRTALDEAINNYNAL 71 (211)
T ss_pred CEEEEEeCCCccccccHHHHHHHHHHHHHHHhh
Confidence 44666665532221 233445667788888765
No 85
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=29.16 E-value=20 Score=29.09 Aligned_cols=11 Identities=36% Similarity=0.806 Sum_probs=9.4
Q ss_pred HHHHHHhhhhh
Q psy3708 142 NTIAHELAHQW 152 (240)
Q Consensus 142 ~~iaHEiaHqW 152 (240)
.++||||+|.-
T Consensus 147 ~t~AHElGHnL 157 (228)
T cd04271 147 QVFAHEIGHTF 157 (228)
T ss_pred eehhhhhhhhc
Confidence 38999999984
No 86
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=29.16 E-value=35 Score=28.94 Aligned_cols=12 Identities=42% Similarity=0.601 Sum_probs=10.0
Q ss_pred HHHHHHHHhhhh
Q psy3708 140 IANTIAHELAHQ 151 (240)
Q Consensus 140 ~~~~iaHEiaHq 151 (240)
...+||||.+|+
T Consensus 170 ~ayVlAHEyGHH 181 (292)
T PF04228_consen 170 QAYVLAHEYGHH 181 (292)
T ss_pred HHHHHHHHHHHH
Confidence 456899999994
No 87
>PF13402 M60-like: Peptidase M60-like family; PDB: 4FCA_A.
Probab=27.32 E-value=2.2e+02 Score=23.86 Aligned_cols=94 Identities=11% Similarity=-0.024 Sum_probs=47.9
Q ss_pred hHHHHHHHHHHHHHHHHHHhCCCCCC--------CCCCeEEcCCc---------ceeeEcCCCCCh-hhHHH---HHHHH
Q psy3708 86 QVDFAAEAGPEFLNFFERYFNIRYPL--------PKQDMVAIPSL---------SALLFDDVRSSI-HNEYS---IANTI 144 (240)
Q Consensus 86 ~~~~~~~~~~~~l~~~e~~~g~~yp~--------~~l~~v~~p~~---------~~~~~~~~~~~~-~~~~~---~~~~i 144 (240)
+....++...++++...+..|.+.+. .+.+++.-... ..+.+....... ..... -..-+
T Consensus 144 d~~~ll~~~D~ii~~~~el~Gl~~~~~~~~~~~~~~~r~v~~v~~~~g~m~a~g~~i~~~~~~~~~~l~~~~~~~~~WG~ 223 (307)
T PF13402_consen 144 DPEELLRFWDRIIDAEYELAGLDKSSPGPENNPMPNNRFVFDVQISAGYMHASGYPIGFPPNWMNELLNPNPLRKGGWGP 223 (307)
T ss_dssp SSHHHHHHHHHHHHHHHHHTT-BSS--GGGB--S--EEEEEETT----SEEEETTEEEEETT--HHHH-HHHHHHH-HHH
T ss_pred hHHHHHHHHHHHHHHHHHHhCCCcccCCccccCcccceEEEeccccccceeecCCcEEeeCcHHhcccCHhHcCCCCeee
Confidence 35677778888999999999976632 22255554433 223444332111 11111 13468
Q ss_pred HHHhhhhhhCCCCCccccchhhhhhhHHHHHHHHhhhhcc
Q psy3708 145 AHELAHQWFGNLVTMKWWTDLWLNEGFATYMAAQALNDVH 184 (240)
Q Consensus 145 aHEiaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~~~~~~~ 184 (240)
.||++|+-=- .+=.|.. +.|.-...++....+...
T Consensus 224 ~HE~GH~~Q~---~~~~~~g--~~EvTnNi~sl~~~~~~~ 258 (307)
T PF13402_consen 224 WHELGHNHQQ---GPWTWSG--MGEVTNNIYSLYVQEKFG 258 (307)
T ss_dssp HHHHHHHH-B---GGG--TT---TTTTHHHHHHHHHHHTT
T ss_pred hhhhhhhcCc---cccccCC--CCchhhHHHHHHHHHHHh
Confidence 9999996311 1111222 567777777766666553
No 88
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=27.32 E-value=52 Score=26.09 Aligned_cols=16 Identities=25% Similarity=0.418 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHhhhhh
Q psy3708 137 EYSIANTIAHELAHQW 152 (240)
Q Consensus 137 ~~~~~~~iaHEiaHqW 152 (240)
..++...+.||++|.|
T Consensus 142 ~~R~~Kea~HElGH~~ 157 (194)
T PF07998_consen 142 LERVCKEAVHELGHLF 157 (194)
T ss_dssp HHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHHc
Confidence 4677889999999986
No 89
>PF07765 KIP1: KIP1-like protein; InterPro: IPR011684 This is a group of sequences found exclusively in plants. They are similar to kinase interacting protein 1 (KIP1), which has been found to interact with the kinase domain of PRK1, a receptor-like kinase []. This particular region contains two coiled-coils, which are described as motifs involved in protein-protein interactions []. It has also been suggested that the coiled-coils of the protein allow it to dimerise in vivo [].
Probab=27.00 E-value=1.7e+02 Score=19.33 Aligned_cols=23 Identities=22% Similarity=0.592 Sum_probs=18.4
Q ss_pred hhCCCCCccccchhhhhhhHHHHHH
Q psy3708 152 WFGNLVTMKWWTDLWLNEGFATYMA 176 (240)
Q Consensus 152 WfG~~v~~~~w~d~WL~EG~a~y~~ 176 (240)
||++.+.++. .-||.+.+++-=+
T Consensus 3 ww~sHi~~~~--skWL~~~l~dmd~ 25 (74)
T PF07765_consen 3 WWDSHISPKQ--SKWLQENLSDMDE 25 (74)
T ss_pred hhhhcCCCCC--CHHHHHHHHHHHH
Confidence 8898888875 6799999887643
No 90
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=26.80 E-value=34 Score=24.83 Aligned_cols=15 Identities=20% Similarity=0.623 Sum_probs=11.7
Q ss_pred HHHHHHHHHHhhhhh
Q psy3708 138 YSIANTIAHELAHQW 152 (240)
Q Consensus 138 ~~~~~~iaHEiaHqW 152 (240)
..+..++.|||+|+.
T Consensus 107 d~vthvliHEIgHhF 121 (136)
T COG3824 107 DQVTHVLIHEIGHHF 121 (136)
T ss_pred hHhhhhhhhhhhhhc
Confidence 345678999999984
No 91
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=25.90 E-value=35 Score=25.64 Aligned_cols=22 Identities=18% Similarity=0.288 Sum_probs=15.5
Q ss_pred HHHHHHHHHHHHHHHHHhCCCC
Q psy3708 88 DFAAEAGPEFLNFFERYFNIRY 109 (240)
Q Consensus 88 ~~~~~~~~~~l~~~e~~~g~~y 109 (240)
......+..+++.+++..++.+
T Consensus 21 ~~~~~~v~~a~~~w~~~~~i~f 42 (167)
T cd00203 21 AQIQSLILIAMQIWRDYLNIRF 42 (167)
T ss_pred HHHHHHHHHHHHHHHhhhCceE
Confidence 5566777888888888766543
No 92
>PF14521 Aspzincin_M35: Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=25.88 E-value=43 Score=25.15 Aligned_cols=10 Identities=40% Similarity=0.697 Sum_probs=9.0
Q ss_pred HHHHHHHhhh
Q psy3708 141 ANTIAHELAH 150 (240)
Q Consensus 141 ~~~iaHEiaH 150 (240)
..+|+||++|
T Consensus 97 ~~TLiHE~SH 106 (148)
T PF14521_consen 97 EGTLIHEWSH 106 (148)
T ss_dssp HHHHHHHHHH
T ss_pred HHHHHHhhhh
Confidence 4699999999
No 93
>PF13265 DUF4056: Protein of unknown function (DUF4056)
Probab=25.75 E-value=31 Score=28.47 Aligned_cols=9 Identities=67% Similarity=1.553 Sum_probs=6.8
Q ss_pred HHhhhhhhCC
Q psy3708 146 HELAHQWFGN 155 (240)
Q Consensus 146 HEiaHqWfG~ 155 (240)
|||| ||||-
T Consensus 154 HEIA-QWyG~ 162 (270)
T PF13265_consen 154 HEIA-QWYGY 162 (270)
T ss_pred HHHH-HHcCc
Confidence 6666 89985
No 94
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=25.49 E-value=77 Score=29.62 Aligned_cols=44 Identities=20% Similarity=0.181 Sum_probs=29.2
Q ss_pred HHHHHHHHhhh---hhhCCC--CCccccchhhhhhhHHHHHHHHhhhhc
Q psy3708 140 IANTIAHELAH---QWFGNL--VTMKWWTDLWLNEGFATYMAAQALNDV 183 (240)
Q Consensus 140 ~~~~iaHEiaH---qWfG~~--v~~~~w~d~WL~EG~a~y~~~~~~~~~ 183 (240)
-..+++||++| +|+.+- -.....-.+++-|--+++.|.+..+.+
T Consensus 378 dv~TLaHElGHa~H~~~~~~~~~~~~~~~~~~~aE~aS~~~E~l~~~~l 426 (591)
T TIGR00181 378 SVFTLAHELGHSMHSYFSSKHQPYPNSDYSIFYAEIASTFNELLLADYL 426 (591)
T ss_pred hHHHHHHHhhhHHHHHHHccCCCCccCCCCchhhhHhHHHHHHHHHHHH
Confidence 34699999997 455431 122223356789999999998876554
No 95
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=25.04 E-value=41 Score=26.12 Aligned_cols=22 Identities=27% Similarity=0.324 Sum_probs=15.7
Q ss_pred HHHHHHHHhhhhhhCCCCCccc
Q psy3708 140 IANTIAHELAHQWFGNLVTMKW 161 (240)
Q Consensus 140 ~~~~iaHEiaHqWfG~~v~~~~ 161 (240)
...++.||++|-+.+...+.+.
T Consensus 7 ~i~i~~HE~gH~~~a~~~G~~~ 28 (192)
T PF02163_consen 7 LISIVLHELGHALAARLYGDKV 28 (192)
T ss_dssp HHHHHHHHHHHHHHHHTTT--B
T ss_pred cccccccccccccccccccccc
Confidence 3468999999999876665554
No 96
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=24.86 E-value=68 Score=22.33 Aligned_cols=17 Identities=35% Similarity=0.565 Sum_probs=12.7
Q ss_pred hHHHHHHHHHHHhhhhh
Q psy3708 136 NEYSIANTIAHELAHQW 152 (240)
Q Consensus 136 ~~~~~~~~iaHEiaHqW 152 (240)
-...+..++.||+||..
T Consensus 69 L~~~I~~tlvhEiah~f 85 (97)
T PF06262_consen 69 LAELIRDTLVHEIAHHF 85 (97)
T ss_dssp HHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHHHc
Confidence 33456779999999973
No 97
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=24.55 E-value=53 Score=28.55 Aligned_cols=32 Identities=16% Similarity=0.146 Sum_probs=19.0
Q ss_pred hhHHHHHHHHHHHhhhhhhCCCCCccccchhh
Q psy3708 135 HNEYSIANTIAHELAHQWFGNLVTMKWWTDLW 166 (240)
Q Consensus 135 ~~~~~~~~~iaHEiaHqWfG~~v~~~~w~d~W 166 (240)
.+..++..+|+||++|-=-...|--..-+..|
T Consensus 119 nd~sEvAAVl~HEmgHVtAnHgi~rQ~~e~a~ 150 (479)
T COG4784 119 NDSSEVAAVLAHEMGHVTANHGIQRQQREAAE 150 (479)
T ss_pred CCHHHHHHHHHhhhhheecchhHHHHHHHHHH
Confidence 33457788999999995443333333333333
No 98
>PF03272 Enhancin: Viral enhancin protein; InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=23.05 E-value=7.3e+02 Score=24.39 Aligned_cols=108 Identities=9% Similarity=0.114 Sum_probs=53.9
Q ss_pred ceEEEEEeCCccch--------HHHHHHHHHHHHHHHHHHhCCCC-CCCCCCeEEcCC-c--------ceeeEcCCCCCh
Q psy3708 73 VQFRTWARRDVIDQ--------VDFAAEAGPEFLNFFERYFNIRY-PLPKQDMVAIPS-L--------SALLFDDVRSSI 134 (240)
Q Consensus 73 ~~i~v~~~~~~~~~--------~~~~~~~~~~~l~~~e~~~g~~y-p~~~l~~v~~p~-~--------~~~~~~~~~~~~ 134 (240)
-.|.+..++..... +....+.=.+++++|.++.|... |-...+.-.-.. | ++-.|+......
T Consensus 143 ~~i~lLVP~~Dk~~l~~~~~~~l~~L~~~Y~~i~~~Yd~l~Gl~~~~~~~~~~n~~~kYF~KAD~~G~G~AYY~~~w~a~ 222 (775)
T PF03272_consen 143 DYIQLLVPPADKPNLNNKDFKSLDELIDFYNDIFKFYDDLTGLSDDPSDPVDKNFNNKYFAKADKSGPGAAYYGSNWTAQ 222 (775)
T ss_pred CEEEEEeCcchHHHHhhhcccCHHHHHHHHHHHHHHHHhhhCCCCCCCcccccccccceEEEecCCCCCCccccccceec
Confidence 34666666655433 33455566778888888888322 211111111001 1 334444332111
Q ss_pred hh---HHHH-----HHHHHHHhhhhhhCCCCCcc-ccchhhhhhhHHHHHHHHhhh
Q psy3708 135 HN---EYSI-----ANTIAHELAHQWFGNLVTMK-WWTDLWLNEGFATYMAAQALN 181 (240)
Q Consensus 135 ~~---~~~~-----~~~iaHEiaHqWfG~~v~~~-~w~d~WL~EG~a~y~~~~~~~ 181 (240)
.. ...+ ...+-|||+|.+=+..+... .+.+.|= -=||.++++..+.
T Consensus 223 ss~s~~~~L~~~~~nW~~LHEIgHgYd~~F~~n~~~~~EVWn-NI~~d~yQ~~~~~ 277 (775)
T PF03272_consen 223 SSSSLSFYLNPSPTNWGALHEIGHGYDFGFTRNGTYLNEVWN-NILADRYQYTYMT 277 (775)
T ss_pred CchhHHHHhCCCCCCchhhhhhhhhcceeEeeCCcchhhhhh-hhhhhhhhhhhcC
Confidence 00 0111 24678999999866666332 3445552 3566666665554
No 99
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=22.53 E-value=2.6e+02 Score=22.73 Aligned_cols=12 Identities=33% Similarity=0.545 Sum_probs=9.9
Q ss_pred HHHHHHHHHhhh
Q psy3708 139 SIANTIAHELAH 150 (240)
Q Consensus 139 ~~~~~iaHEiaH 150 (240)
....+-|||++|
T Consensus 88 aAvaVAAHEvGH 99 (222)
T PF04298_consen 88 AAVAVAAHEVGH 99 (222)
T ss_pred HHHHHHHHHHhH
Confidence 445799999999
No 100
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=22.52 E-value=99 Score=29.12 Aligned_cols=44 Identities=23% Similarity=0.287 Sum_probs=31.9
Q ss_pred HHHHHHHHhhh---hhhCCCCCc---cccchhhhhhhHHHHHHHHhhhhcc
Q psy3708 140 IANTIAHELAH---QWFGNLVTM---KWWTDLWLNEGFATYMAAQALNDVH 184 (240)
Q Consensus 140 ~~~~iaHEiaH---qWfG~~v~~---~~w~d~WL~EG~a~y~~~~~~~~~~ 184 (240)
-..+++||++| ++|-.--.+ .++. +-+-|--+++.|.+..+.+.
T Consensus 380 dV~TLaHElGHs~Hs~~s~~~qp~~~~~~~-i~~AEiAS~fnE~l~~~~ll 429 (598)
T COG1164 380 DVFTLAHELGHSVHSYFSRKHQPYLYADYS-IFLAEIASTFNEMLLFDYLL 429 (598)
T ss_pred HHHHHHHHccHHHHHHHHhccCCccccCCc-hHHHHHHHHHHHHHHHHHHH
Confidence 35699999998 777655444 3343 77889999999887766653
No 101
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=22.39 E-value=48 Score=24.90 Aligned_cols=14 Identities=29% Similarity=0.484 Sum_probs=11.2
Q ss_pred HHHHHHHHHHhhhh
Q psy3708 138 YSIANTIAHELAHQ 151 (240)
Q Consensus 138 ~~~~~~iaHEiaHq 151 (240)
.....++.|||+|-
T Consensus 105 ~~~~~~~~HEiGHa 118 (157)
T cd04278 105 TDLFSVAAHEIGHA 118 (157)
T ss_pred chHHHHHHHHhccc
Confidence 35667999999985
No 102
>PRK11767 SpoVR family protein; Provisional
Probab=21.45 E-value=61 Score=29.56 Aligned_cols=46 Identities=20% Similarity=0.175 Sum_probs=30.9
Q ss_pred hhHHHHHHHHHHHhhhhhhCCCCCccccchhhhhhhHHHHHHHHhhhhcccc
Q psy3708 135 HNEYSIANTIAHELAHQWFGNLVTMKWWTDLWLNEGFATYMAAQALNDVHIL 186 (240)
Q Consensus 135 ~~~~~~~~~iaHEiaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~~~~~~~~~ 186 (240)
..-++-.-.|+.+.|.-.+...- --=||||.|+|--+.++..++.+
T Consensus 247 e~WQReIl~IVR~ea~YF~PQ~q------TKIMNEGWAsyWH~~im~~l~d~ 292 (498)
T PRK11767 247 EPWQREILRIVRKIAQYFYPQRQ------TQVMNEGWATFWHYTILNHLYDE 292 (498)
T ss_pred chHHHHHHHHHHHHHHhccchhh------hhHHhhHhHHHHHHHHHHHhhhc
Confidence 33344445677777766554432 23589999999999999877543
No 103
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 []. The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=20.87 E-value=1.4e+02 Score=24.85 Aligned_cols=28 Identities=18% Similarity=0.507 Sum_probs=24.7
Q ss_pred chHHHHHHHHHHHHHHHHHHhCCCCCCC
Q psy3708 85 DQVDFAAEAGPEFLNFFERYFNIRYPLP 112 (240)
Q Consensus 85 ~~~~~~~~~~~~~l~~~e~~~g~~yp~~ 112 (240)
..+..++..+...+...+.++|+|.||+
T Consensus 195 ~~isaALgyvahlv~lls~yL~v~Lpy~ 222 (302)
T PF10186_consen 195 EEISAALGYVAHLVSLLSRYLGVPLPYP 222 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCCCCC
Confidence 4567899999999999999999988885
No 104
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=20.28 E-value=19 Score=28.57 Aligned_cols=13 Identities=62% Similarity=0.624 Sum_probs=10.5
Q ss_pred HHHHHHHHhhhhh
Q psy3708 140 IANTIAHELAHQW 152 (240)
Q Consensus 140 ~~~~iaHEiaHqW 152 (240)
...++|||++|.-
T Consensus 140 ~a~~~aHElGH~L 152 (207)
T cd04273 140 SAFTIAHELGHVL 152 (207)
T ss_pred eEEeeeeechhhc
Confidence 3569999999974
No 105
>PF10463 Peptidase_U49: Peptidase U49; InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death.
Probab=20.17 E-value=1.1e+02 Score=24.46 Aligned_cols=20 Identities=40% Similarity=0.451 Sum_probs=16.6
Q ss_pred HHHHHHHHHhhhhhhCCCCC
Q psy3708 139 SIANTIAHELAHQWFGNLVT 158 (240)
Q Consensus 139 ~~~~~iaHEiaHqWfG~~v~ 158 (240)
.+..++.||++|-=+|....
T Consensus 100 A~~fil~HE~~Hv~~~h~~~ 119 (206)
T PF10463_consen 100 AIAFILLHELAHVVLGHEGD 119 (206)
T ss_pred HHHHHHHHHHHHHHHcCccc
Confidence 34579999999999998766
No 106
>PF06744 DUF1215: Protein of unknown function (DUF1215); InterPro: IPR010623 This domain represents a conserved region situated towards the C-terminal end of several hypothetical bacterial proteins of unknown function. A few members resemble the ImcF protein, which has been proposed [] to be involved in Vibrio cholerae cell surface reorganisation that results in increased adherence to epithelial cells line and increased conjugation frequency.
Probab=20.06 E-value=59 Score=23.55 Aligned_cols=29 Identities=28% Similarity=0.534 Sum_probs=20.9
Q ss_pred hhhHHHHHHHhccCCCCCCccE--EEeCCCC
Q psy3708 199 PEFLNFFERYFNIRYPLPKQDM--VAIPDFS 227 (240)
Q Consensus 199 ~~~l~~~~~~~~~~~p~~k~d~--~~~~~~~ 227 (240)
.++..+|++.+..|||+.+... +++.||.
T Consensus 41 ~~V~~~~~~~i~gRYPF~~~s~~dv~l~Df~ 71 (125)
T PF06744_consen 41 AEVYPFCRQAIAGRYPFDPDSSRDVSLADFA 71 (125)
T ss_pred HHHHHHHHHHhcCCCCCCCCCcccCCHHHHH
Confidence 3456788899999999985553 6666655
No 107
>PF08696 Dna2: DNA replication factor Dna2; InterPro: IPR014808 Dna2 is a DNA replication factor with single-stranded DNA-dependent ATPase, ATP-dependent nuclease, (5'-flap endonuclease) and helicase activities. It is required for Okazaki fragment processing and is involved in DNA repair pathways []. ; GO: 0003677 DNA binding, 0004003 ATP-dependent DNA helicase activity, 0005524 ATP binding, 0006260 DNA replication
Probab=20.03 E-value=1.6e+02 Score=23.52 Aligned_cols=68 Identities=19% Similarity=0.174 Sum_probs=35.5
Q ss_pred HHHHHHhhhhhhCCCCCccccchhhhhhhHHHHHHHHhhhhccccc---chhhhhHhhcchhhHHHHHHHhccCCC
Q psy3708 142 NTIAHELAHQWFGNLVTMKWWTDLWLNEGFATYMAAQALNDVHILF---DSQVDFAAEAGPEFLNFFERYFNIRYP 214 (240)
Q Consensus 142 ~~iaHEiaHqWfG~~v~~~~w~d~WL~EG~a~y~~~~~~~~~~~~~---~~~~~~~~~~~~~~l~~~~~~~~~~~p 214 (240)
..|.||+.+.-+ ..+.|+..++.+-....++ .++..+|.-. .....-+....+.+.++.+++...+-+
T Consensus 94 GtIvHelfQ~~l----~~~~~~~~~l~~~~~~~l~-~~~~~ly~~~~~~~~~~~~l~~~~~~i~~w~~~~~~~~~~ 164 (209)
T PF08696_consen 94 GTIVHELFQKAL----RTNDFDLEFLEELADRILE-KYLEELYALGETEDEAREELEEYLPNIESWAKQYVKKSPS 164 (209)
T ss_pred eeeHHHHHHHHH----hccCcchHHHHHHHHHHHH-HHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHHccCCCC
Confidence 466777665544 6677777777666665555 4444443211 111122233333444567776665443
Done!