Query psy3038
Match_columns 499
No_of_seqs 179 out of 404
Neff 4.7
Searched_HMMs 46136
Date Fri Aug 16 23:54:33 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3038.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3038hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF01401 Peptidase_M2: Angiote 100.0 6E-153 1E-157 1240.4 25.3 434 2-477 55-595 (595)
2 KOG3690|consensus 100.0 7E-144 1E-148 1141.9 31.0 445 1-481 72-628 (646)
3 cd06461 M2_ACE Peptidase famil 100.0 5E-114 1E-118 917.0 29.0 357 63-461 119-477 (477)
4 COG2317 Zn-dependent carboxype 100.0 2.2E-41 4.7E-46 353.8 19.2 266 123-451 202-492 (497)
5 PF02074 Peptidase_M32: Carbox 100.0 4.5E-42 9.8E-47 366.9 10.4 269 119-451 198-491 (494)
6 cd06460 M32_Taq Peptidase fami 100.0 2.5E-37 5.3E-42 323.2 18.6 268 121-451 100-392 (396)
7 cd06258 Peptidase_M3_like The 100.0 4.4E-33 9.5E-38 284.8 22.5 290 63-450 54-365 (365)
8 cd06459 M3B_Oligoendopeptidase 99.9 2.8E-21 6E-26 200.3 19.7 257 123-449 152-422 (427)
9 TIGR02289 M3_not_pepF oligoend 99.7 2.8E-16 6E-21 170.7 20.6 267 124-459 268-547 (549)
10 TIGR02290 M3_fam_3 oligoendope 99.4 4.6E-12 1E-16 138.8 20.7 226 165-459 349-583 (587)
11 COG1164 Oligoendopeptidase F [ 99.4 3.8E-11 8.3E-16 132.5 20.4 231 165-462 354-594 (598)
12 PF01432 Peptidase_M3: Peptida 99.2 1.2E-10 2.6E-15 123.5 12.9 230 164-450 198-456 (458)
13 TIGR00181 pepF oligoendopeptid 99.1 1.2E-09 2.5E-14 119.9 17.4 229 165-459 353-587 (591)
14 cd06455 M3A_TOP Peptidase M3 T 99.1 2.8E-09 6E-14 114.5 19.7 274 114-449 145-471 (472)
15 cd06456 M3A_DCP_Oligopeptidase 99.0 1.2E-08 2.7E-13 108.5 19.3 200 188-450 205-420 (422)
16 PRK10280 dipeptidyl carboxypep 98.6 8.2E-07 1.8E-11 99.9 16.2 203 188-449 461-676 (681)
17 cd06457 M3A_MIP Peptidase M3 m 98.6 7.6E-07 1.7E-11 95.6 15.2 199 188-446 245-454 (458)
18 PRK10911 oligopeptidase A; Pro 98.6 1.1E-06 2.4E-11 98.9 15.7 202 188-450 460-676 (680)
19 COG0339 Dcp Zn-dependent oligo 98.4 7.1E-06 1.5E-10 91.4 15.5 208 189-449 465-678 (683)
20 KOG2090|consensus 98.0 7.3E-05 1.6E-09 82.3 13.1 131 290-448 546-687 (704)
21 KOG2089|consensus 97.9 0.00019 4.1E-09 79.8 13.6 205 189-451 493-707 (718)
22 PF13485 Peptidase_MA_2: Pepti 49.6 14 0.0003 31.1 2.5 50 176-234 8-59 (128)
23 PF00413 Peptidase_M10: Matrix 46.9 11 0.00023 33.9 1.5 18 190-210 104-121 (154)
24 cd00203 ZnMc Zinc-dependent me 46.5 27 0.00059 31.6 4.0 77 127-203 23-108 (167)
25 PF06114 DUF955: Domain of unk 45.4 26 0.00056 29.2 3.5 30 178-207 28-58 (122)
26 cd04268 ZnMc_MMP_like Zinc-dep 41.8 16 0.00034 33.3 1.7 14 190-203 93-106 (165)
27 PF14247 DUF4344: Domain of un 39.2 35 0.00075 34.3 3.8 49 165-216 45-115 (220)
28 cd04279 ZnMc_MMP_like_1 Zinc-d 37.4 30 0.00065 31.7 2.9 13 191-203 104-116 (156)
29 PF04700 Baculo_gp41: Structur 37.3 83 0.0018 31.0 5.8 60 324-388 6-78 (186)
30 cd04278 ZnMc_MMP Zinc-dependen 36.7 16 0.00034 33.6 0.9 14 190-203 106-119 (157)
31 cd04277 ZnMc_serralysin_like Z 35.3 23 0.00049 33.4 1.7 16 189-204 111-126 (186)
32 PF13398 Peptidase_M50B: Pepti 34.2 25 0.00055 34.1 1.9 22 189-210 20-41 (200)
33 PF13582 Reprolysin_3: Metallo 32.9 22 0.00047 30.9 1.1 12 192-203 108-119 (124)
34 smart00235 ZnMc Zinc-dependent 32.2 23 0.00051 31.6 1.2 26 176-203 73-98 (140)
35 PRK04897 heat shock protein Ht 31.2 94 0.002 32.1 5.6 67 128-204 80-150 (298)
36 PF14891 Peptidase_M91: Effect 30.7 35 0.00077 32.3 2.2 19 189-207 101-119 (174)
37 PF13574 Reprolysin_2: Metallo 30.6 34 0.00074 32.4 2.1 13 191-203 111-123 (173)
38 PRK03982 heat shock protein Ht 30.5 86 0.0019 32.1 5.1 64 128-204 68-138 (288)
39 COG2856 Predicted Zn peptidase 29.4 49 0.0011 33.0 3.0 27 177-204 59-85 (213)
40 smart00731 SprT SprT homologue 28.9 36 0.00077 31.3 1.9 30 178-207 41-75 (146)
41 PRK05457 heat shock protein Ht 28.8 1.1E+02 0.0023 31.6 5.5 65 129-204 78-147 (284)
42 PF09345 DUF1987: Domain of un 26.2 61 0.0013 28.6 2.8 23 447-469 24-46 (99)
43 cd04271 ZnMc_ADAM_fungal Zinc- 25.8 21 0.00046 35.5 -0.2 14 190-203 144-157 (228)
44 PRK03001 M48 family peptidase; 25.8 1.2E+02 0.0027 30.8 5.3 65 129-204 68-137 (283)
45 PF02031 Peptidase_M7: Strepto 25.0 51 0.0011 30.8 2.1 24 180-203 64-89 (132)
46 PF13688 Reprolysin_5: Metallo 24.9 46 0.001 31.3 1.9 16 188-203 139-154 (196)
47 PF10216 ChpXY: CO2 hydration 24.9 40 0.00086 35.7 1.5 63 265-340 15-79 (353)
48 cd04270 ZnMc_TACE_like Zinc-de 24.7 38 0.00083 33.9 1.4 12 192-203 168-179 (244)
49 cd04327 ZnMc_MMP_like_3 Zinc-d 24.4 46 0.001 32.0 1.8 15 190-204 91-105 (198)
50 PRK03072 heat shock protein Ht 21.6 1.6E+02 0.0034 30.4 5.1 37 303-342 201-237 (288)
51 KOG3728|consensus 20.9 2E+02 0.0044 29.7 5.5 85 124-225 58-145 (308)
52 PRK02870 heat shock protein Ht 20.6 1.7E+02 0.0036 31.2 5.2 68 127-204 114-186 (336)
53 PRK02391 heat shock protein Ht 20.5 1.8E+02 0.004 30.1 5.4 64 129-204 77-146 (296)
No 1
>PF01401 Peptidase_M2: Angiotensin-converting enzyme This Prosite motif covers only the active site.; InterPro: IPR001548 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M2 (clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The catalytic residues and zinc ligands have been identified, the zinc ion being ligated to two His residues within the motif HEXXH, showing that the enzyme belongs to the E sub-group of metalloproteases []. Pepetidyl-dipeptidase A (angiotensin-converting enzyme) is a mammalian enzyme responsible for cleavage of dipeptides from the C-termini of proteins, notably converting angiotensin I to angiotensin II []. The enzyme exists in two differentially transcribed forms, the most common of which is from lung endothelium; this contains two homologous domains that have arisen by gene duplication []. The testis-specific form contains only the C-terminal domain, arising from a duplicated promoter region present in intron 12 of the gene []. Both enzymatic forms are membrane proteins that are anchored by means of a C-terminal transmembrane domain. Both domains of the endothelial enzyme are active, but have differing kinetic constants []. ]. A number of insect enzymes have been shown to be similar to peptidyl-dipeptidase A, these containing a single catalytic domain.; GO: 0008237 metallopeptidase activity, 0008241 peptidyl-dipeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 2YDM_A 3BKL_A 2C6N_B 1UZE_A 3BKK_A 2C6F_B 2IUX_A 2IUL_A 2XYD_B 3NXQ_B ....
Probab=100.00 E-value=5.9e-153 Score=1240.40 Aligned_cols=434 Identities=56% Similarity=1.060 Sum_probs=377.5
Q ss_pred ccchHHHHHHHHHHHHhccCCCCCCCCHHHHHHHHHHhhhcccCCCHHHHHH-----------------------HHhc-
Q psy3038 2 NKSAEHAQFTKDQWKELIKYPWQTYSNQDLRRQYKMYSILGSAALPYEKFQK-----------------------CLKA- 57 (499)
Q Consensus 2 e~~~~~~~f~~~~~~~a~~f~~~~~~d~~~kRql~~l~~~g~~aL~~~~~~~-----------------------Cl~l- 57 (499)
+++++.++|.++.+++|++|++..++|+++||||++|+.+|+++||++++++ |+.|
T Consensus 55 ea~~~~a~F~k~~~~~AkkFd~~~~~d~~lkRql~~L~~lG~AaL~~~k~~el~~i~s~M~~iYst~kvC~~~~~cl~Le 134 (595)
T PF01401_consen 55 EASLEYAKFQKEYAKEAKKFDWTKFKDDTLKRQLKKLSKLGPAALPPEKLEELNKILSEMESIYSTAKVCPYDGKCLSLE 134 (595)
T ss_dssp HHHHHHHHHHHHHHHHHTTSHGGGGSSHHHHHHHHHHTS-GGGGSTHHHHHHHHHHHHHHHHHHHH-EEEESSSEEEETT
T ss_pred HHHHHHHHHHHHHHHHHHhcCcccCCCHHHHHHHHHHhhcccccCCHHHHHHHHHHHHHHHHHhccccccCCCCccccch
Confidence 5788999999999999999999999999999999999999999999999888 5555
Q ss_pred ---------------------------cc---------------------------------------------------
Q psy3038 58 ---------------------------NV--------------------------------------------------- 59 (499)
Q Consensus 58 ---------------------------g~--------------------------------------------------- 59 (499)
|+
T Consensus 135 Pdl~~ima~Srd~~eL~~~W~~Wr~~vg~~~r~~y~~~V~L~N~aA~~nG~~d~g~~Wr~~ye~~~~~~~~~~lw~~i~P 214 (595)
T PF01401_consen 135 PDLEEIMATSRDYDELLYAWEGWRDAVGPPMRPLYERYVELSNEAARLNGFKDTGEYWRSSYEMPNFEQELERLWQQIKP 214 (595)
T ss_dssp THHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTSSSHHHHHHGGG-CTTHHHHHHHHHHHHHH
T ss_pred hhHHHHHhccCCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccccHHHHHHHHHHHHHH
Confidence 11
Q ss_pred Cc-c-chhhchhhHhhhCCCcccCCCCCccccccchhhhcccccccccccCCCCCCCCchHHHHHcCCCHHHHHHHHHHH
Q psy3038 60 QY-R-ETFDRRRLNEKYGDKVVNRRGPIPAHLLGNMWAQTWGNIYDIVVPYPEKTPPDVSAELVRQGYTVHKMFRTAEEF 137 (499)
Q Consensus 60 ~y-~-hayvR~~L~~~Yg~~~v~~~gpiPAHLlGnmw~q~W~~~~~~~~P~p~~~~~dvt~~l~~q~yt~~~mf~~ae~f 137 (499)
.| | |||||++|+++||+++|+.+||||||||||||||+|+|||++++|||+++.+|||++|++||||+++||++||+|
T Consensus 215 LY~~LHayVR~~L~~~Yg~~~v~~~gpIPAHLLGnmwaq~W~ni~~~~~P~p~~~~~dvt~~l~~q~~t~~~mf~~ae~f 294 (595)
T PF01401_consen 215 LYKQLHAYVRRKLREKYGDDVVPPDGPIPAHLLGNMWAQSWSNIYDLVLPYPDKPSLDVTEELVKQGYTAKKMFKTAEEF 294 (595)
T ss_dssp HHHHHHHHHHHHHHHHHTTTTS-TTS-EEGGGSSSTTSS-GGGGHHHH-STTTS--S-THHHHHHTT--HHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHcCcccCCCCCCccHHHHHHHHHHHHhhhhccccCCCCCCCCCcHHHHHHccccHHHHHHHHHHH
Confidence 12 5 999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHcCCCCCChHHhhhcccccCCC-CcccccCcccccCCCCcceeeecCCCChhhHHHHHHHHHHHHHHHHhcCCCCccc
Q psy3038 138 FTSINMSAMPPEFWERSMLEKPQG-REVVCHASAWDFHDGKDFRIKMCTRVNEEDLFTIHHEMGHVEYFIQYKDQPMAFR 216 (499)
Q Consensus 138 f~slGl~~mp~~FW~~S~~~kp~~-r~~~CH~sawdf~~~~D~RIk~Ct~~~~~d~~t~hHEmGHi~Y~~~y~~qP~~fr 216 (499)
|+||||++||++||++|||+||.+ |+++|||||||||+++||||||||++|++||.|+|||||||+||++|++||++||
T Consensus 295 f~SlGl~~mp~~FW~~S~~~kp~d~r~~~Chasawdf~~~~d~Rik~Ct~v~~~df~t~hhemghi~Y~~~y~~qp~~fr 374 (595)
T PF01401_consen 295 FTSLGLPPMPPTFWEKSMFEKPTDGRDVVCHASAWDFYNGDDFRIKMCTEVTMEDFLTAHHEMGHIQYYMQYKDQPVLFR 374 (595)
T ss_dssp HHHTTS----HHHHHHSB-S--SSSS----S-EEEEESSSSEEEEE----SSHHHHHHHHHHHHHHHHHHHTTTS-GGGS
T ss_pred HHhCCCCcCCHHHHHHHHhcCCCCCCCCCcCcchhhccCCCccceeecccCCHHHHHHHHHHHHHHHHHHHHhhCChhhh
Confidence 999999999999999999999987 9999999999999999999999999999999999999999999999999999999
Q ss_pred CCCCCCCCccccccccchhhhccccchheeccccchhhHHHHhhhhcCCCcCccccccccccccCCCchhhhHHHHHHHh
Q psy3038 217 EGANPGKNTRGWTALKDGLSFYESWVAEIGLAGTVGSVVMRLLAMQKFVPFPLVINRKFGIRTKINSDKTIFMVEAIFLF 296 (499)
Q Consensus 217 ~gAn~gfhE~~~~Aigd~~~~~e~~~~~iaL~~~~grS~~~~~~~q~~tP~hL~~~~~igLl~~~~~d~e~f~~~IN~L~ 296 (499)
+|||||||| ||||+ ||| |++ ||+|| +++||+++..++.+ .+||+||
T Consensus 375 ~gan~gfhE----Aigd~----------ial------s~~--------tp~hL---~~~gLl~~~~~~~~---~~in~L~ 420 (595)
T PF01401_consen 375 EGANPGFHE----AIGDT----------IAL------SVS--------TPKHL---KRIGLLDNVTDDEE---SDINFLL 420 (595)
T ss_dssp S-SSHHHHH----HHHHH----------HHH------HHT--------SHHHH---HHTTSSSTTTTSHH---HHHHHHH
T ss_pred cCCCchHHH----HHHHH----------HHH------HcC--------CHHHH---HHcCCCCCCCcchH---HHHHHHH
Confidence 999999999 99999 999 999 99999 99999998888888 8999999
Q ss_pred --ccccccccchhHHHHHHHHHhhcCCCCChhhHhHHHHHHHHhcCcccCCCCCCCCCCCccCccceecCCCchhHHHHH
Q psy3038 297 --KIDTFLINTFVNIMMDLWRWGVFKKDISHEEYNKHWWKLRHDYQGVEPPTHRSEDDFDPGAKYHIVASVPYIRYFVSF 374 (499)
Q Consensus 297 --aL~~i~~LPf~~~~~D~wRw~vf~G~i~~~~~N~~WW~lr~~yqGI~pP~~R~e~dfDp~ak~Hi~~~~pYirYfl~~ 374 (499)
||+||+|||| +|++|+|||+||+|+|++++||++||+||++||||+||++|++.||||||||||++|+||||||||+
T Consensus 421 ~~AL~ki~~lPf-~~~~d~wRw~vf~g~i~~~~~n~~wW~lr~~yqGi~pP~~r~~~~fD~~ak~hi~~~~py~ryfls~ 499 (595)
T PF01401_consen 421 KQALDKIAFLPF-GYLMDKWRWDVFSGSIPPDQYNEAWWELREKYQGIVPPVERSESDFDPGAKYHIPANTPYIRYFLST 499 (595)
T ss_dssp HHHHHHCCHHHH-HHHHHHHHHHHHTTSS-CCGHHHHHHHHHHHHHSEE-SS-EETTS-GGGGSHHHHTT--SHHHHHHH
T ss_pred HHhhhHHHhccH-HHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHcCCccCCCCCCcccCCcchhhhhccccccHHHHHHH
Confidence 9999999999 9999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHhCCCCCCCCCCCCCccccCcCcHHHhHHHHHHHhCCCCCChHHHHHHHhCCCCCCHHHHHHHHHHHH
Q psy3038 375 VIQFQFHRALCEKAGEFDPNDPSKQPLHECDIYRSTEAGNLLKNMLAMGSSKPWPDAMEAITGQREMDASALLQYFEPLY 454 (499)
Q Consensus 375 ilqfQf~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG~~L~~mm~~GsS~~W~e~le~~TG~~~lsa~~ll~YF~PL~ 454 (499)
|||||||++||+++||. +|||+||||+||+||++|++||++|+|+||+|||+.+||+++||++|||+||+||+
T Consensus 500 ilqFQ~~~~lC~~ag~~-------~pLh~Cdiy~s~~aG~~l~~ml~~G~S~~W~~~l~~~tg~~~~~a~~ll~YF~PL~ 572 (595)
T PF01401_consen 500 ILQFQFYKALCEAAGHE-------GPLHRCDIYGSKEAGRKLKKMLSLGSSKPWPEALEELTGEREMSASALLEYFQPLM 572 (595)
T ss_dssp HHHHHHHHHHHHHTTTT-------SSGGG---TT-HHHHHHHHHHHTTTTSS-HHHHHHHHHSSSS--SHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhcCCC-------CCccccccccchhhHHHHHHHhcCCCCCCHHHHHHHHhCCCCcChHHHHHHHHHHH
Confidence 99999999999999997 99999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHhhcCCccCCCCCCCcc
Q psy3038 455 KWLEEENARTGEHIGWEATDKKV 477 (499)
Q Consensus 455 ~WL~~~N~~~~~~iGW~~~~~~~ 477 (499)
+||++||++++++|||+.+.|+|
T Consensus 573 ~wL~~~n~~~~~~~GW~~~~w~~ 595 (595)
T PF01401_consen 573 DWLKEQNKKNGECVGWDEYNWTP 595 (595)
T ss_dssp HHHHHHHHHTTS-SS-CSTT--S
T ss_pred HHHHHHHHhcCCcCCCCcCCcCc
Confidence 99999999999999999888875
No 2
>KOG3690|consensus
Probab=100.00 E-value=6.6e-144 Score=1141.85 Aligned_cols=445 Identities=46% Similarity=0.907 Sum_probs=428.0
Q ss_pred CccchHHHHHHHHHHHHhccCCCCCCCCHHHHHHHHHHhhhcccCCCHHHHHH--------------------------H
Q psy3038 1 MNKSAEHAQFTKDQWKELIKYPWQTYSNQDLRRQYKMYSILGSAALPYEKFQK--------------------------C 54 (499)
Q Consensus 1 ~e~~~~~~~f~~~~~~~a~~f~~~~~~d~~~kRql~~l~~~g~~aL~~~~~~~--------------------------C 54 (499)
+|+..++++|.+..+.+|++|+..+++|+++|||+++++++|++||+++++++ |
T Consensus 72 ~ea~~~~~~~~r~~a~q~k~~~~~~l~D~~~rRql~~~s~lg~saL~~~d~~~~~~ll~~m~~~y~~~~vC~y~~p~~~~ 151 (646)
T KOG3690|consen 72 LEAEDKLVEFVRSLATQAKKFDVAELQDERLRRQLQKVSQLGMSALSPDDLKRYNQLLSSMSTNYNTATVCAYDQPSACC 151 (646)
T ss_pred HHHHHHHHHHHHHHHHHHhhccHhhccCHHHHHHHHHHhhhccccCCHHHHHHHHHHHHHHHhhcCCceeeCCCCCCCcc
Confidence 36788999999999999999999999999999999999999999999999999 3
Q ss_pred Hhc----------------------------cc-----------------------------------------------
Q psy3038 55 LKA----------------------------NV----------------------------------------------- 59 (499)
Q Consensus 55 l~l----------------------------g~----------------------------------------------- 59 (499)
|.| |+
T Consensus 152 L~l~P~l~~i~~~Srd~~eL~~~W~~wr~~~g~~~r~~y~~yv~L~~e~A~lNg~~~~gdyW~~~yE~~~~~~~~ld~i~ 231 (646)
T KOG3690|consen 152 LLLEPQLQSIMANSRDYDELQWYWVEWRRKIGKAMRQSYEEYVDLSNEAAQLNGYVNGGDYWRLAYETPGDFEQDLDAIF 231 (646)
T ss_pred eecCccHHHHHhccCCHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhcCchhHHHHHHHHH
Confidence 544 11
Q ss_pred -----Cc-c-chhhchhhHhhhCCCcccCCCCCccccccchhhhcccccccccccCCCCCCCCchHHHHHcCCCHHHHHH
Q psy3038 60 -----QY-R-ETFDRRRLNEKYGDKVVNRRGPIPAHLLGNMWAQTWGNIYDIVVPYPEKTPPDVSAELVRQGYTVHKMFR 132 (499)
Q Consensus 60 -----~y-~-hayvR~~L~~~Yg~~~v~~~gpiPAHLlGnmw~q~W~~~~~~~~P~p~~~~~dvt~~l~~q~yt~~~mf~ 132 (499)
.| + ||||||+|++.||++.|+++||||||||||||||+|+|+|++++|||+++.+|||.+|++||||+++||+
T Consensus 232 ~ei~PlY~~LHAYvRr~L~~~Ygp~~i~~~gpiPahLlgnm~gq~Ws~~y~~~~P~pe~~~~dvt~~m~~qg~~~qkmF~ 311 (646)
T KOG3690|consen 232 EEIRPLYRQLHAYVRRKLRGAYGPDGISRDGPIPAHLLGNMWGQDWSNHYDLTTPFPERPLIDVTMEMVKQGYTVQKMFK 311 (646)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcCcccCCCCCCcCHHHHhhhhccchHhhhhcccCCCCCCCCchHHHHHHccccHHHHHH
Confidence 22 4 9999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHcCCCCCChHHhhhcccccCCCCc-ccccCcccccCCCCcceeeecCCCChhhHHHHHHHHHHHHHHHHhcCC
Q psy3038 133 TAEEFFTSINMSAMPPEFWERSMLEKPQGRE-VVCHASAWDFHDGKDFRIKMCTRVNEEDLFTIHHEMGHVEYFIQYKDQ 211 (499)
Q Consensus 133 ~ae~ff~slGl~~mp~~FW~~S~~~kp~~r~-~~CH~sawdf~~~~D~RIk~Ct~~~~~d~~t~hHEmGHi~Y~~~y~~q 211 (499)
+|++||+||||+.||++||++||+++|++++ ++|||+||||+++.||||||||+++++||+++|||||||||+++|++|
T Consensus 312 ~a~efF~SlGl~~lp~sfw~~s~~~~p~d~~~v~C~~sAwdf~~~~Dfrik~Ct~vd~~~f~~~Hhem~hiQY~l~y~~q 391 (646)
T KOG3690|consen 312 LAAEFFTSLGLPALPPSFWTNSILTRPDDRDMVVCHASAWDFYAGPDFRIKYCTKVDEEDFEQAHHEMGHIQYYLQYKQQ 391 (646)
T ss_pred HHHHHHHHcCCCcCChHHHhHHHhcCCCCCCceeechhhhccccCCCceeeecCcCCHHHHHHHHHHHHHHHHHHHHHhc
Confidence 9999999999999999999999999999965 899999999999999999999999999999999999999999999999
Q ss_pred CCcccCCCCCCCCccccccccchhhhccccchheeccccchhhHHHHhhhhcCCCcCccccccccccccCCC-chhhhHH
Q psy3038 212 PMAFREGANPGKNTRGWTALKDGLSFYESWVAEIGLAGTVGSVVMRLLAMQKFVPFPLVINRKFGIRTKINS-DKTIFMV 290 (499)
Q Consensus 212 P~~fr~gAn~gfhE~~~~Aigd~~~~~e~~~~~iaL~~~~grS~~~~~~~q~~tP~hL~~~~~igLl~~~~~-d~e~f~~ 290 (499)
|++||+|||||||| ||||+ ||| |++ ||+|| +++||+.+..+ +.+ .
T Consensus 392 P~llre~anPgFhe----AIgd~----------~al------s~s--------tPrhL---~slgLL~~~~~~d~~---~ 437 (646)
T KOG3690|consen 392 PFLLREGANPGFHE----AIGDA----------FAL------SVS--------TPRHL---HSLGLLSEDLDEDDE---V 437 (646)
T ss_pred cHHhhCCCCCcHHH----HHHHH----------HHH------hcC--------CHHHH---HHhcccccccccchH---H
Confidence 99999999999999 99999 999 999 99999 99999985443 336 8
Q ss_pred HHHHHh--ccccccccchhHHHHHHHHHhhcCCCCChhhHhHHHHHHHHhcCcccCCCCCCCCCCCccCccceecCCCch
Q psy3038 291 EAIFLF--KIDTFLINTFVNIMMDLWRWGVFKKDISHEEYNKHWWKLRHDYQGVEPPTHRSEDDFDPGAKYHIVASVPYI 368 (499)
Q Consensus 291 ~IN~L~--aL~~i~~LPf~~~~~D~wRw~vf~G~i~~~~~N~~WW~lr~~yqGI~pP~~R~e~dfDp~ak~Hi~~~~pYi 368 (499)
+||+|+ ||++|++||| +|++|+|||+||+|+|+++++||+||+||.+|+||+||++|++.||||+||+|+..++||+
T Consensus 438 ~IN~L~k~aL~~v~~LPf-~y~~DkwR~~Vf~G~i~k~~~N~~~W~lr~ky~Gv~~P~~r~~~dfD~~akf~~~~d~~~~ 516 (646)
T KOG3690|consen 438 RINRLFKMALDKVAFLPF-TYALDKWRYEVFDGEIPKDQYNCAYWELRNKYEGVRPPVPRTNKDFDPPAKFHIQVDVSYT 516 (646)
T ss_pred HHHHHHHHHHHHHhhhhh-hhhhhhhHHHHhcCCCCcchhhHHHHHHHHHhcCcCCCCcCccccCCchhhhccCcchHHH
Confidence 999999 9999999999 9999999999999999999999999999999999999999999999999999999999999
Q ss_pred hHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCCccccCcCcHHHhHHHHHHHhCCCCCChHHHHHHHhCCCCCCHHHHHH
Q psy3038 369 RYFVSFVIQFQFHRALCEKAGEFDPNDPSKQPLHECDIYRSTEAGNLLKNMLAMGSSKPWPDAMEAITGQREMDASALLQ 448 (499)
Q Consensus 369 rYfl~~ilqfQf~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG~~L~~mm~~GsS~~W~e~le~~TG~~~lsa~~ll~ 448 (499)
|||+|+|||||||++||+++|++.|+||. .|||+||||+||+||++|++||++|+|.||+||||.+||+++||+++||+
T Consensus 517 ryf~s~vlqFQf~kalC~~agq~~pg~pr-~pLh~CDi~~sk~aG~kL~~~M~lG~S~~W~evLE~~tGe~els~~~LLe 595 (646)
T KOG3690|consen 517 RYFLSTVLQFQFLKALCQAAGQYGPGDPR-KPLHNCDIYDSKEAGEKLRTMMSLGSSRHWREVLEEITGEGELSGRALLE 595 (646)
T ss_pred HHHHHHHHHHHHHHHHHHHhcccCCCCCC-CCchhccccCCHHHHHHHHHHHHccCCccHHHHHHHHcCCCcCChHHHHH
Confidence 99999999999999999999999999999 99999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHhhcCCccCCCCCCCccccCh
Q psy3038 449 YFEPLYKWLEEENARTGEHIGWEATDKKVFRSD 481 (499)
Q Consensus 449 YF~PL~~WL~~~N~~~~~~iGW~~~~~~~~~~~ 481 (499)
||+||++||+++|+++++.|||+++++++....
T Consensus 596 YFePL~~WLk~~N~~~~~~vGW~~~~~~~~d~~ 628 (646)
T KOG3690|consen 596 YFEPLFTWLKQENERLNVEVGWDGEPKTRQDGD 628 (646)
T ss_pred HHHHHHHHHHhhhhhcCcccCCCCCcCCCCCCc
Confidence 999999999999999999999999988877544
No 3
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=100.00 E-value=5e-114 Score=916.97 Aligned_cols=357 Identities=61% Similarity=1.161 Sum_probs=349.9
Q ss_pred chhhchhhHhhhCCCcccCCCCCccccccchhhhcccccccccccCCCCCCCCchHHHHHcCCCHHHHHHHHHHHHHHcC
Q psy3038 63 ETFDRRRLNEKYGDKVVNRRGPIPAHLLGNMWAQTWGNIYDIVVPYPEKTPPDVSAELVRQGYTVHKMFRTAEEFFTSIN 142 (499)
Q Consensus 63 hayvR~~L~~~Yg~~~v~~~gpiPAHLlGnmw~q~W~~~~~~~~P~p~~~~~dvt~~l~~q~yt~~~mf~~ae~ff~slG 142 (499)
|+|||++|...||.++++++|||||||+||||+|+|+|||+++.|||+++.+||+++|++|+||+++||+.+++||+|||
T Consensus 119 h~~~r~~L~~~y~~~~~d~~~~iPahl~gn~~~q~W~~~~~~~~p~p~~~~~dv~~~l~~~~~t~~~mf~~~~~ff~smg 198 (477)
T cd06461 119 HAYVRRKLRKKYGDDVVNRDGPIPAHLLGNMWAQSWSNIYDLVKPYPGKPLLDVTDAMVEQGYTAKRMFKEAEEFFTSLG 198 (477)
T ss_pred HHHHHHHHHHhcccccCCCCCCCccccccCccccccccccccccCCCCCCCCChHHHHHHcCCCHHHHHHHHHHHHHHCC
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCCCChHHhhhcccccCCCCcccccCcccccCCCCcceeeecCCCChhhHHHHHHHHHHHHHHHHhcCCCCcccCCCCCC
Q psy3038 143 MSAMPPEFWERSMLEKPQGREVVCHASAWDFHDGKDFRIKMCTRVNEEDLFTIHHEMGHVEYFIQYKDQPMAFREGANPG 222 (499)
Q Consensus 143 l~~mp~~FW~~S~~~kp~~r~~~CH~sawdf~~~~D~RIk~Ct~~~~~d~~t~hHEmGHi~Y~~~y~~qP~~fr~gAn~g 222 (499)
|..||++||++||++||.++++.||+|||||++.+|+||+||+++|++||.|+|||||||+||++|++||++||.|||||
T Consensus 199 L~dl~~~fw~~s~~~rp~k~~~~C~~sawd~~~~~d~rI~~c~~~t~~D~~t~~HE~GH~~yy~~y~~~p~~~r~~anp~ 278 (477)
T cd06461 199 LPPMPPSFWTKSMLEKPTDREVVCHASAWDFYNGKDFRIKMCTKVNMEDFVTVHHEMGHIQYYLQYKDQPVLFREGANPG 278 (477)
T ss_pred CCcCChHHHhhccccCCCCCCCccCcccccCCCCCCcceeeCCCCCHHHHHHHHHHHHHHHHHHHhccCCHHHhCCCCCC
Confidence 99999999999999999989999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCccccccccchhhhccccchheeccccchhhHHHHhhhhcCCCcCccccccccccccCCCchhhhHHHHHHHh--cccc
Q psy3038 223 KNTRGWTALKDGLSFYESWVAEIGLAGTVGSVVMRLLAMQKFVPFPLVINRKFGIRTKINSDKTIFMVEAIFLF--KIDT 300 (499)
Q Consensus 223 fhE~~~~Aigd~~~~~e~~~~~iaL~~~~grS~~~~~~~q~~tP~hL~~~~~igLl~~~~~d~e~f~~~IN~L~--aL~~ 300 (499)
||| ||||+ +|| |++ ||+|| .++|+++....+.+ .+||+|+ +|++
T Consensus 279 fhe----av~e~----------~sm------S~~--------tpe~L---~~~~ll~~~~~~~~---~~i~~l~~~al~~ 324 (477)
T cd06461 279 FHE----AVGDA----------IAL------SVS--------TPKHL---HKIGLLDSEVDDEE---ADINFLLKMALDK 324 (477)
T ss_pred hHH----HHHHH----------HHH------hcC--------CHHHH---hhcccccccCCCcH---HHHHHHHHHHHHH
Confidence 999 99999 999 999 99999 99999997645555 6899999 9999
Q ss_pred ccccchhHHHHHHHHHhhcCCCCChhhHhHHHHHHHHhcCcccCCCCCCCCCCCccCccceecCCCchhHHHHHHHHHHH
Q psy3038 301 FLINTFVNIMMDLWRWGVFKKDISHEEYNKHWWKLRHDYQGVEPPTHRSEDDFDPGAKYHIVASVPYIRYFVSFVIQFQF 380 (499)
Q Consensus 301 i~~LPf~~~~~D~wRw~vf~G~i~~~~~N~~WW~lr~~yqGI~pP~~R~e~dfDp~ak~Hi~~~~pYirYfl~~ilqfQf 380 (499)
|+|||| ++++|+|||.||+|+++++++|++||+|+++||||+||++|++.+||||||+|++.++|||+||||+|+||||
T Consensus 325 i~fLP~-~~~vD~fr~~vy~~~~~~~e~n~~WweL~~~yqgv~pP~~r~~~~fd~gak~h~~~~~PYi~YflA~ilqfQf 403 (477)
T cd06461 325 IAFLPF-GYLLDKWRWDVFDGEIPKDDYNKAWWELREKYQGVVPPVPRSEEDFDPGAKYHVPANTPYIRYFLSFILQFQF 403 (477)
T ss_pred HHHhhH-HHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhcCCCCCCCCCccccCccceeeccCCCcHHHHHHHHHHHHHH
Confidence 999999 9999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHhCCCCCCCCCCCCCccccCcCcHHHhHHHHHHHhCCCCCChHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH
Q psy3038 381 HRALCEKAGEFDPNDPSKQPLHECDIYRSTEAGNLLKNMLAMGSSKPWPDAMEAITGQREMDASALLQYFEPLYKWLEEE 460 (499)
Q Consensus 381 ~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG~~L~~mm~~GsS~~W~e~le~~TG~~~lsa~~ll~YF~PL~~WL~~~ 460 (499)
|++||+++||. +|||+||||+|++||+.|++||++|+|+||+|+||.+||+++||+++|++||+||.+||+++
T Consensus 404 ~~~lc~~a~~~-------~plh~C~~~~s~~Ag~~l~~~l~lG~S~~w~e~le~~~g~~~~s~~~l~eyf~pl~~wL~~~ 476 (477)
T cd06461 404 HKALCKAAGHT-------GPLHKCDIYGSKEAGKKLRAMLSLGSSKPWPEALEALTGEREMDASALLEYFQPLIDWLKEQ 476 (477)
T ss_pred HHHHHHhccCC-------CCcccccccChHHHHHHHHHHHhCcCCCCHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHhh
Confidence 99999999997 99999999999999999999999999999999999999999999999999999999999998
Q ss_pred H
Q psy3038 461 N 461 (499)
Q Consensus 461 N 461 (499)
|
T Consensus 477 n 477 (477)
T cd06461 477 N 477 (477)
T ss_pred C
Confidence 7
No 4
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=100.00 E-value=2.2e-41 Score=353.83 Aligned_cols=266 Identities=17% Similarity=0.236 Sum_probs=244.1
Q ss_pred cCCCHHHHHHHHHHHHHHcCCCCCChHHhhhcccccCCC-CcccccCcccccCCCCcceeeecCCCC--hhhHHHHHHHH
Q psy3038 123 QGYTVHKMFRTAEEFFTSINMSAMPPEFWERSMLEKPQG-REVVCHASAWDFHDGKDFRIKMCTRVN--EEDLFTIHHEM 199 (499)
Q Consensus 123 q~yt~~~mf~~ae~ff~slGl~~mp~~FW~~S~~~kp~~-r~~~CH~sawdf~~~~D~RIk~Ct~~~--~~d~~t~hHEm 199 (499)
-.|+..+|.++++.+..-||||. ..| .|++.||||..+. ++|||||++.+.+ .+.+++++||.
T Consensus 202 ~~~~~e~q~~~~~~vl~~lGfdf-------------~~GRlD~S~HPF~~g~~-~~DVRITTRy~~~df~~aL~g~iHE~ 267 (497)
T COG2317 202 EKYDKEKQEALGLRVLELLGFDF-------------ERGRLDVSVHPFTTGLP-INDVRITTRYNEQDFRSALFGTIHET 267 (497)
T ss_pred CCCCHHHHHHHHHHHHHHhCCCc-------------cCccccCCCCCccCCCC-CCceeEEeecCCccHHHHHHHHHHhh
Confidence 37999999999999999999998 677 5999999999998 8999999999887 78999999999
Q ss_pred HHHHHHHHhc--CCCCcccCCCCCCCCccccccccchhhhccccchheeccccchhhHH----HHhhhhcCCCcCccccc
Q psy3038 200 GHVEYFIQYK--DQPMAFREGANPGKNTRGWTALKDGLSFYESWVAEIGLAGTVGSVVM----RLLAMQKFVPFPLVINR 273 (499)
Q Consensus 200 GHi~Y~~~y~--~qP~~fr~gAn~gfhE~~~~Aigd~~~~~e~~~~~iaL~~~~grS~~----~~~~~q~~tP~hL~~~~ 273 (499)
||+.|+++.. -..+++.+|+++|+|| ++|+. ||| .||||.+ .||.+++++|..+ .
T Consensus 268 GHAlYEqn~~~~l~gtPlg~g~smgiHE----SQSrF---~En---------~VGRs~aFw~~~~~~~~~~~~~~~---~ 328 (497)
T COG2317 268 GHALYEQNLDEALLGTPLGEGRSMGIHE----SQSRF---WEN---------QVGRSRAFWEAIYPLLRKHFPEQF---D 328 (497)
T ss_pred hhHHHHcCCCHHHcCCcccCCccccccc----HHHHH---HHH---------HhccCHHHHHHHHHHHHHhhhhhc---c
Confidence 9999999976 4577788999999999 99998 999 8999988 7899999988777 5
Q ss_pred cccccccCCCchhhhHHHHHHHh-------------ccccccccchhHHHHHHHHHhhcCCCCChhhHhHHHHHHHHhcC
Q psy3038 274 KFGIRTKINSDKTIFMVEAIFLF-------------KIDTFLINTFVNIMMDLWRWGVFKKDISHEEYNKHWWKLRHDYQ 340 (499)
Q Consensus 274 ~igLl~~~~~d~e~f~~~IN~L~-------------aL~~i~~LPf~~~~~D~wRw~vf~G~i~~~~~N~~WW~lr~~yq 340 (499)
.+.+.+ |+..+|.+. .||+| ||| . .+..+|+|++..+|+++.|.+.+++|+
T Consensus 329 ~~s~dd--------~y~~vn~v~~~lIRveADElTY~lHIi--lRy-E-----iEk~li~G~l~v~DlP~lWn~kme~yL 392 (497)
T COG2317 329 KYSLDD--------FYRAVNRVEPSLIRVEADELTYPLHII--LRY-E-----IEKELISGELKVDDLPELWNDKMEEYL 392 (497)
T ss_pred cCCHHH--------HHHHHhcccCcceeeeccccceeeeee--hhH-H-----HHHHHHcCCcchhhhHHHHHHHHHHHc
Confidence 555433 788899876 89999 999 8 799999999999999999999999999
Q ss_pred cccCCCCCCCCCCCccCccceecC-C-CchhHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCCccccCcCcHHHhHHHHH
Q psy3038 341 GVEPPTHRSEDDFDPGAKYHIVAS-V-PYIRYFVSFVIQFQFHRALCEKAGEFDPNDPSKQPLHECDIYRSTEAGNLLKN 418 (499)
Q Consensus 341 GI~pP~~R~e~dfDp~ak~Hi~~~-~-pYirYfl~~ilqfQf~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG~~L~~ 418 (499)
||+|+++++++.+| +||+.+ + ||++|.||+|++.|||.++.+..+.+ + ..+.++||. .+.+||++
T Consensus 393 Girp~~d~eG~LQD----IHWs~GsfGYFPsYtlG~v~AAQ~~~a~~~~~p~~-----d-~~i~~gd~~---~i~~WL~e 459 (497)
T COG2317 393 GIRPKNDAEGVLQD----IHWSHGSFGYFPTYTLGNVYAAQLYAAMKKDIPDV-----D-ALIAKGDFS---PIKNWLRE 459 (497)
T ss_pred CCCCCCcccccccc----ccccCCCcCcCchhhhHHHHHHHHHHHHHhhcccH-----H-HHHHcCCcH---HHHHHHHH
Confidence 99999999999999 999976 5 99999999999999999999998776 4 668899998 99999999
Q ss_pred -HHhCCCCCChHHHHHHHhCCCCCCHHHHHHHHH
Q psy3038 419 -MLAMGSSKPWPDAMEAITGQREMDASALLQYFE 451 (499)
Q Consensus 419 -mm~~GsS~~W~e~le~~TG~~~lsa~~ll~YF~ 451 (499)
+++.||.++.+|+++++||+. ++++++++|.+
T Consensus 460 ~ih~~Gs~~~p~eLi~~atGE~-lnp~y~i~yL~ 492 (497)
T COG2317 460 NIHRHGSRYPPKELLKRATGEA-LNPEYFIDYLE 492 (497)
T ss_pred HHHHccCcCCcHHHHHHhhCCc-CCHHHHHHHHH
Confidence 999999999999999999987 99999998865
No 5
>PF02074 Peptidase_M32: Carboxypeptidase Taq (M32) metallopeptidase; InterPro: IPR001333 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M32 (carboxypeptidase Taq family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Carboxypeptidase Taq is a zinc-containing thermostable metallopeptidase. It was originally discovered and purified from Thermus aquaticus; optimal enzymatic activity occurs at 80 celcius. Although very little is known about this enzyme, it is thought either to be associated with a membrane or to be particle bound.; GO: 0004181 metallocarboxypeptidase activity, 0006508 proteolysis; PDB: 1K9X_A 1KA4_A 1KA2_A 3DWC_A 1WGZ_A 3HQ2_A 3HOA_B.
Probab=100.00 E-value=4.5e-42 Score=366.94 Aligned_cols=269 Identities=16% Similarity=0.244 Sum_probs=220.9
Q ss_pred HHHHcCCCHHHHHHHHHHHHHHcCCCCCChHHhhhcccccCCC-CcccccCcccccCCCCcceeeecCCCC--hhhHHHH
Q psy3038 119 ELVRQGYTVHKMFRTAEEFFTSINMSAMPPEFWERSMLEKPQG-REVVCHASAWDFHDGKDFRIKMCTRVN--EEDLFTI 195 (499)
Q Consensus 119 ~l~~q~yt~~~mf~~ae~ff~slGl~~mp~~FW~~S~~~kp~~-r~~~CH~sawdf~~~~D~RIk~Ct~~~--~~d~~t~ 195 (499)
..+.+.|+..+|-+++.++.+.||++. ..| .|++-|||+..++ ++|+||++....+ ...++++
T Consensus 198 ~~l~~~~~~~~Q~~~~~~~~~~~G~d~-------------~~grld~s~HPFt~~~~-~~DvRiTTry~e~d~~~~l~s~ 263 (494)
T PF02074_consen 198 SFLHGPFPEEKQKAFSRELLEYLGFDF-------------DRGRLDESAHPFTTGFG-PNDVRITTRYDEDDFLSALFST 263 (494)
T ss_dssp GGGGSB--HHHHHHHHHHHHHHHT--G-------------CGEEEEE-SS-EEEEEE-TTEEEEEE--BTTBTHHHHHHH
T ss_pred ccCCCCCCHHHHHHHHHHHHHHcCCCc-------------cceEEecCCCCCCCCCC-CCceeeecccccccHHHHHHHH
Confidence 345678999999999999999999998 557 5889999999998 8999999999988 8899999
Q ss_pred HHHHHHHHHHHHhc--CCCCcccCCCCCCCCccccccccchhhhccccchheeccccchhhHH----HHhhhhcCCCcCc
Q psy3038 196 HHEMGHVEYFIQYK--DQPMAFREGANPGKNTRGWTALKDGLSFYESWVAEIGLAGTVGSVVM----RLLAMQKFVPFPL 269 (499)
Q Consensus 196 hHEmGHi~Y~~~y~--~qP~~fr~gAn~gfhE~~~~Aigd~~~~~e~~~~~iaL~~~~grS~~----~~~~~q~~tP~hL 269 (499)
.||+||+.|+++.+ ...+++..|+++|+|| |+|++ ||| +||||.+ .||.+|+.+ ..|
T Consensus 264 iHE~GHalYEq~~~~~~~~tpl~~~~smgiHE----SQSrf---~En---------~vgrS~~Fw~~~~p~l~~~f-~~~ 326 (494)
T PF02074_consen 264 IHETGHALYEQGLPPEYYGTPLGEGVSMGIHE----SQSRF---WEN---------MVGRSRAFWEFLYPKLQEHF-PQL 326 (494)
T ss_dssp HHHHHHHHHHHTS-GGGTTSTTSS-S-HHHHH----HHHHH---HHT---------TTTTSHHHHHHHHHHHHHHC-GGG
T ss_pred HHHHHHHHHHcCCChhHccCccccccccchHH----HHHHH---HHH---------HHccCHHHHHHHHHHHHHHh-Ccc
Confidence 99999999999975 4577888999999999 99999 999 8999998 889999988 446
Q ss_pred cccccccccccCCCchhhhHHHHHHHh-------------ccccccccchhHHHHHHHHHhhcCCCCChhhHhHHHHHHH
Q psy3038 270 VINRKFGIRTKINSDKTIFMVEAIFLF-------------KIDTFLINTFVNIMMDLWRWGVFKKDISHEEYNKHWWKLR 336 (499)
Q Consensus 270 ~~~~~igLl~~~~~d~e~f~~~IN~L~-------------aL~~i~~LPf~~~~~D~wRw~vf~G~i~~~~~N~~WW~lr 336 (499)
..+.+ +.||..||++- .||+| ||| + .+.++|+|+++++|+++.|++.+
T Consensus 327 ---~~v~~--------e~~y~~~N~V~ps~IR~eADElTY~lHIi--lRy-E-----iEk~li~g~l~v~dLP~~Wn~km 387 (494)
T PF02074_consen 327 ---KDVSL--------ENFYRAINRVKPSLIRVEADELTYPLHII--LRY-E-----IEKALINGELSVDDLPEAWNEKM 387 (494)
T ss_dssp ---TT-SH--------HHHHHHHT-----S-GGG--TTTHHHHHH--HHH-H-----HHHHHHTTSS-GGGHHHHHHHHH
T ss_pred ---cCCCH--------HHHHHHHccCCCcccchhhHHHHHHHHHH--HHH-H-----HHHHHHcCCCChhhhHHHHHHHH
Confidence 55543 34999999876 89999 999 8 79999999999999999999999
Q ss_pred HhcCcccCCCCCCCCCCCccCccceecC-C-CchhHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCCccccCcCcHHHhH
Q psy3038 337 HDYQGVEPPTHRSEDDFDPGAKYHIVAS-V-PYIRYFVSFVIQFQFHRALCEKAGEFDPNDPSKQPLHECDIYRSTEAGN 414 (499)
Q Consensus 337 ~~yqGI~pP~~R~e~dfDp~ak~Hi~~~-~-pYirYfl~~ilqfQf~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG~ 414 (499)
++|.||+||.+++++.+| +||+++ + ||++|.||+|++.|||.+++++.+.+ + .-+.++||. .+.+
T Consensus 388 ~eyLGi~p~~d~eG~LQD----vHWs~G~fGYFPtY~LG~~~AaQl~~~~~~~~pd~-----~-~~i~~G~f~---~i~~ 454 (494)
T PF02074_consen 388 EEYLGITPPNDAEGVLQD----VHWSSGSFGYFPTYALGNIYAAQLFAAMKKDIPDL-----D-EQIAKGDFS---PIRD 454 (494)
T ss_dssp HHHHS---SSCTTTTTS-----STTTTT-TS-THHHHHHHHHHHHHHHHHHHHHTTH-----H-HHHHTT--H---HHHH
T ss_pred HHHcCCCCCCCCcccccc----ccccCCccccchhhHHHHHHHHHHHHHHHHHccCH-----H-HHHHcCChH---HHHH
Confidence 999999999999999999 999988 5 99999999999999999999998765 3 457788987 9999
Q ss_pred HHHH-HHhCCCCCChHHHHHHHhCCCCCCHHHHHHHHH
Q psy3038 415 LLKN-MLAMGSSKPWPDAMEAITGQREMDASALLQYFE 451 (499)
Q Consensus 415 ~L~~-mm~~GsS~~W~e~le~~TG~~~lsa~~ll~YF~ 451 (499)
||++ +++.|++++..|+|+++||++ +|++++++|++
T Consensus 455 WL~e~Ih~~G~~y~~~eLl~~aTGe~-l~~~~~~~YL~ 491 (494)
T PF02074_consen 455 WLRENIHQYGSRYTPDELLKKATGEP-LNPEYFIDYLK 491 (494)
T ss_dssp HHHHHTGGGTTSS-HHHHHHHHHSSS---THHHHHHHH
T ss_pred HHHHHHHHhccCCCHHHHHHHHhCCC-CCHHHHHHHHH
Confidence 9999 999999999999999999997 99999999975
No 6
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=100.00 E-value=2.5e-37 Score=323.15 Aligned_cols=268 Identities=17% Similarity=0.251 Sum_probs=233.6
Q ss_pred HHcCCCHHHHHHHHHHHHHHcCCCCCChHHhhhcccccCCC-CcccccCcccccCCCCcceeeecCCCC--hhhHHHHHH
Q psy3038 121 VRQGYTVHKMFRTAEEFFTSINMSAMPPEFWERSMLEKPQG-REVVCHASAWDFHDGKDFRIKMCTRVN--EEDLFTIHH 197 (499)
Q Consensus 121 ~~q~yt~~~mf~~ae~ff~slGl~~mp~~FW~~S~~~kp~~-r~~~CH~sawdf~~~~D~RIk~Ct~~~--~~d~~t~hH 197 (499)
..+.|+..+|.+++.++...+|++. ..+ .|+++||||..++ +.|+||+|....+ ..+++|++|
T Consensus 100 ~~~~~~~~~Q~~~~~~~~~~~g~df-------------~~griD~s~hpF~~~~~-~~dvRItt~y~~~d~~~~l~t~iH 165 (396)
T cd06460 100 LSGPFPEEKQEALGRELLEALGFDF-------------DRGRLDVSAHPFTGGLG-PGDVRITTRYDENDFRSALFSTIH 165 (396)
T ss_pred cCCCCCHHHHHHHHHHHHHHhCCcc-------------cCCeeecCCCCCCCCCC-CCCceEEeeeCCcchHHHHHHHHH
Confidence 3567899999999999999999998 566 5899999999998 6799999999998 999999999
Q ss_pred HHHHHHHHHHhc--CCCCcccCCCCCCCCccccccccchhhhccccchheeccccchhhHH----HHhhhhcCCCcCccc
Q psy3038 198 EMGHVEYFIQYK--DQPMAFREGANPGKNTRGWTALKDGLSFYESWVAEIGLAGTVGSVVM----RLLAMQKFVPFPLVI 271 (499)
Q Consensus 198 EmGHi~Y~~~y~--~qP~~fr~gAn~gfhE~~~~Aigd~~~~~e~~~~~iaL~~~~grS~~----~~~~~q~~tP~hL~~ 271 (499)
|+||+.|+++++ ....++..++++|+|| ++|.+ ||| +||||.+ .+|.+|+.+|..+
T Consensus 166 E~GHalye~~l~~~~~~~pl~~~~sm~ihE----SqS~f---~E~---------~vgrs~~F~~~l~~~l~~~f~~~~-- 227 (396)
T cd06460 166 ETGHALYEQGLPPELRGTPLGGGASMGIHE----SQSRL---WEN---------QVGRSRAFWEFLYPKLKKTFPEQL-- 227 (396)
T ss_pred HhhHHHHHhcCCHhHccCCcccCCCchhHH----HHHHH---HHH---------HHhcCHHHHHHHHHHHHHHccccc--
Confidence 999999999874 1234445889999999 99999 999 8899866 7788998888877
Q ss_pred cccccccccCCCchhhhHHHHHHHh-------------ccccccccchhHHHHHHHHHhhcCCCCChhhHhHHHHHHHHh
Q psy3038 272 NRKFGIRTKINSDKTIFMVEAIFLF-------------KIDTFLINTFVNIMMDLWRWGVFKKDISHEEYNKHWWKLRHD 338 (499)
Q Consensus 272 ~~~igLl~~~~~d~e~f~~~IN~L~-------------aL~~i~~LPf~~~~~D~wRw~vf~G~i~~~~~N~~WW~lr~~ 338 (499)
..+.+ +.|+..+|++- .+|+| |+| . ++..+++|.++++|+++.|++++++
T Consensus 228 -~~~~~--------e~~~~~~n~v~ps~iRveADEvty~lHii--lR~-e-----~Ek~l~~G~l~v~dLp~~W~~~~~e 290 (396)
T cd06460 228 -KDVSL--------ENFYRAVNRVQPSLIRVEADEVTYNLHIM--LRY-E-----LEKALIEGDLEVADLPEAWNEKMKE 290 (396)
T ss_pred -ccCCH--------HHHHHHHhhcCccceeeecchhhHHHHHH--HHH-H-----HHHHHHcCCCCHHHHHHHHHHHHHH
Confidence 54442 22666677643 66777 888 7 7889999999999999999999999
Q ss_pred cCcccCCCCCCCCCCCccCccceecC-C-CchhHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCCccccCcCcHHHhHHH
Q psy3038 339 YQGVEPPTHRSEDDFDPGAKYHIVAS-V-PYIRYFVSFVIQFQFHRALCEKAGEFDPNDPSKQPLHECDIYRSTEAGNLL 416 (499)
Q Consensus 339 yqGI~pP~~R~e~dfDp~ak~Hi~~~-~-pYirYfl~~ilqfQf~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG~~L 416 (499)
|.||.||.++.+..+| .||+.+ + ||+.|.||++++.||++++++..+.+ . ..+.++||. .+.+||
T Consensus 291 ylG~~~~~d~~g~lQd----~HW~~g~fGYfptYalG~l~Aaql~~~~~~~~~~~-----~-~~i~~g~f~---~~~~wL 357 (396)
T cd06460 291 YLGIRPPNDAEGCLQD----IHWSGGSFGYFPTYTLGNLYAAQLFAAAKKDLPDI-----D-EQIERGDFS---PLLEWL 357 (396)
T ss_pred HcCCCCCCCccccccc----cccccCcCCcCchhHHHHHHHHHHHHHHHHhCccH-----H-HHHhccChH---HHHHHH
Confidence 9999999999999999 799987 7 99999999999999999999986554 1 337788986 799999
Q ss_pred HH-HHhCCCCCChHHHHHHHhCCCCCCHHHHHHHHH
Q psy3038 417 KN-MLAMGSSKPWPDAMEAITGQREMDASALLQYFE 451 (499)
Q Consensus 417 ~~-mm~~GsS~~W~e~le~~TG~~~lsa~~ll~YF~ 451 (499)
++ +++.|+++++.|+|+.+||+. |+++++++|++
T Consensus 358 ~e~i~~~G~~~~p~eLl~~atG~~-l~~~~f~~yL~ 392 (396)
T cd06460 358 RENIHQHGSRYSPDELLKKATGEP-LNPEYFLEYLE 392 (396)
T ss_pred HHHHHhccCCCCHHHHHHHHhCCC-CChHHHHHHHH
Confidence 99 999999999999999999997 99999999975
No 7
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=100.00 E-value=4.4e-33 Score=284.84 Aligned_cols=290 Identities=21% Similarity=0.292 Sum_probs=228.4
Q ss_pred chhhchhhHhhhCCCcccCCCCCccccccchhhhcccccccccccCCCCCCCCchHHHHHcCCCHHHHHHHHHHHHHHcC
Q psy3038 63 ETFDRRRLNEKYGDKVVNRRGPIPAHLLGNMWAQTWGNIYDIVVPYPEKTPPDVSAELVRQGYTVHKMFRTAEEFFTSIN 142 (499)
Q Consensus 63 hayvR~~L~~~Yg~~~v~~~gpiPAHLlGnmw~q~W~~~~~~~~P~p~~~~~dvt~~l~~q~yt~~~mf~~ae~ff~slG 142 (499)
++.++.+|.+..+.+ ....+.+|+ ..||++.++++.++..++++.+.++| -
T Consensus 54 ~~~~~~~l~~~~~~~-~~~~~~~~~-------------------------~~~~~~~~~~~~f~~~~~~~~~~~~~---~ 104 (365)
T cd06258 54 LAKLREEISAAKQKE-EEIYGELPA-------------------------RYDVDSALLKEFFDAERPWEGALPFF---Y 104 (365)
T ss_pred HHHHHHHHHHHHHhh-cCCcccccc-------------------------cccCCHhhccCcCChHHHHHHHHHhH---h
Confidence 788888888777643 344556665 55788999999999999999999988 2
Q ss_pred CCCCChHHhhhcccccCCCCcccccCcccccCC-----CCcceeeecCCC---------ChhhHHHHHHHHHHHHHHHHh
Q psy3038 143 MSAMPPEFWERSMLEKPQGREVVCHASAWDFHD-----GKDFRIKMCTRV---------NEEDLFTIHHEMGHVEYFIQY 208 (499)
Q Consensus 143 l~~mp~~FW~~S~~~kp~~r~~~CH~sawdf~~-----~~D~RIk~Ct~~---------~~~d~~t~hHEmGHi~Y~~~y 208 (499)
+|. ..|+ ...|||+|+++.. ..|+||.+|... +++++.|++|||||++|.+..
T Consensus 105 lD~----------~~R~---gK~~~a~~~~~~~~~~~~~~~~~i~~n~~~~~~~~~~ll~~~~v~tl~HE~GHa~h~~l~ 171 (365)
T cd06258 105 LDL----------YDRK---GKYPHGFCTGLDPGFNRQDKDVRILANFTSPAAPDPVLLGHDDINTLFHEFGHAVHFLLI 171 (365)
T ss_pred ccC----------CCCC---CCCCCCeeccccCCCCCCCCeEEEEccCCCCCCCCCCcCCHHHHHHHHHHHhHHHHHHHh
Confidence 221 2222 2589999999983 689999999998 999999999999999999987
Q ss_pred cCCCC-cccCCCCCCCCccccccccchhhhccccchheeccccchhhHHHHhhhhcCCCcCccccccccccccCC-Cchh
Q psy3038 209 KDQPM-AFREGANPGKNTRGWTALKDGLSFYESWVAEIGLAGTVGSVVMRLLAMQKFVPFPLVINRKFGIRTKIN-SDKT 286 (499)
Q Consensus 209 ~~qP~-~fr~gAn~gfhE~~~~Aigd~~~~~e~~~~~iaL~~~~grS~~~~~~~q~~tP~hL~~~~~igLl~~~~-~d~e 286 (499)
+.++. +.+.+++++||| +.|.+ +|+|+ . +|.+| ++++...... -..
T Consensus 172 ~~~~~~~~g~~~~~~~~E----~~S~~---~E~~~-------------~--------~~~~L---~~~~~~~~~~~~~~- 219 (365)
T cd06258 172 QQRYPFQERTPTSTDFAE----AQSMF---LESFA-------------T--------DPEWL---ERYARHYQGGVVPD- 219 (365)
T ss_pred cCCCCcCCCCCCCccHHh----ccHHH---HHHHH-------------C--------CHHHH---HHHhhhcCCCCCcH-
Confidence 76544 578999999999 99999 77732 2 33444 4433222111 111
Q ss_pred hhHHHHHHHh---ccccccccchhHHHHHHHHHhhcCCCCChhhHhHHHHHHHHhcCcccCCCCCCCCCCCccCccceec
Q psy3038 287 IFMVEAIFLF---KIDTFLINTFVNIMMDLWRWGVFKKDISHEEYNKHWWKLRHDYQGVEPPTHRSEDDFDPGAKYHIVA 363 (499)
Q Consensus 287 ~f~~~IN~L~---aL~~i~~LPf~~~~~D~wRw~vf~G~i~~~~~N~~WW~lr~~yqGI~pP~~R~e~dfDp~ak~Hi~~ 363 (499)
..++.+. ....+..++. .++.+.|++.++++.....++|+.||++.++|+|+.| .+...+++.+.||..
T Consensus 220 ---~~~~~~~~~~~~~~~~~~~~-~~~~~~~d~~l~~~~~~~~~~~~~~~~l~~~~~g~~~----~~~~~~~~~~~H~~~ 291 (365)
T cd06258 220 ---ELIEKLIAARLPNTLYETRR-ILVVAKFEKALYENPDRELELQKLWRDLVKEILGVRP----DPSTPDPAAFPHLAG 291 (365)
T ss_pred ---HHHHHHHHhhhhhHHHHHHH-HHHHHHHHHHHhCCCCccCCHHHHHHHHHHHhcCCCC----CCCCCCCCccchhcc
Confidence 2233333 3456677888 9999999999999877666899999999999999955 233466778899987
Q ss_pred --CCCchhHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCCccccCcCcHHHhHHHHH-HHhCCCCCChHHHHHHHhCCCC
Q psy3038 364 --SVPYIRYFVSFVIQFQFHRALCEKAGEFDPNDPSKQPLHECDIYRSTEAGNLLKN-MLAMGSSKPWPDAMEAITGQRE 440 (499)
Q Consensus 364 --~~pYirYfl~~ilqfQf~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG~~L~~-mm~~GsS~~W~e~le~~TG~~~ 440 (499)
+.+|++|++|.++++||++++|+..| |+.+++++|++|++ +++.|+|++|.|+++++||+.
T Consensus 292 gy~~yyy~Y~~~~v~a~qi~~~~~~~~~---------------~~~~~~~~g~~l~~~il~~G~s~~~~el~~~~~G~~- 355 (365)
T cd06258 292 GSPAYYYGYLLAEMLASQLRATFKKKVG---------------YLTDNPEAGPRLREHILRPGNSEPWKELLKRATGED- 355 (365)
T ss_pred CcccchHHHHHHHHHHHHHHHHHHhcCC---------------CCCCCHHHHHHHHHHHccCcCCcCHHHHHHHHcCCC-
Confidence 57999999999999999999999742 34567899999999 999999999999999999997
Q ss_pred CCHHHHHHHH
Q psy3038 441 MDASALLQYF 450 (499)
Q Consensus 441 lsa~~ll~YF 450 (499)
+|+++|++||
T Consensus 356 ~~~~a~~~~~ 365 (365)
T cd06258 356 PNADAFLDHF 365 (365)
T ss_pred CChHHHHhcC
Confidence 9999999996
No 8
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=99.87 E-value=2.8e-21 Score=200.28 Aligned_cols=257 Identities=15% Similarity=0.105 Sum_probs=183.8
Q ss_pred cCCCHHHHHHHHHHHHHHcCCC--CCChHHhhhcc---cccCCCCcccccCcccccCCCCcceeeecCCCChhhHHHHHH
Q psy3038 123 QGYTVHKMFRTAEEFFTSINMS--AMPPEFWERSM---LEKPQGREVVCHASAWDFHDGKDFRIKMCTRVNEEDLFTIHH 197 (499)
Q Consensus 123 q~yt~~~mf~~ae~ff~slGl~--~mp~~FW~~S~---~~kp~~r~~~CH~sawdf~~~~D~RIk~Ct~~~~~d~~t~hH 197 (499)
..|+.+++.+.+.+.|..+|.+ .+-..+|++.- ..+|. ..-+|+|+.+..+.|.||.||...+.+|+.|++|
T Consensus 152 ~~~~~~~~~~~v~~~~~~l~~~~~~~~~~~~~~~~iD~~~r~g---K~~gaf~~~~~~~~~p~i~~n~~~~~~~v~tl~H 228 (427)
T cd06459 152 PKYTYEEAKELVLEALSPLGPEYAEFAKRAFEERWIDVEPRKG---KRSGAYCTGLPPGKHPFILMNFNGTLDDVFTLAH 228 (427)
T ss_pred CcCcHHHHHHHHHHHHHccCHHHHHHHHHHhhCCCeeccCCCC---CCCCeecCCCCCCCCCeEEecCCCChhhHHHHHH
Confidence 4577778888888888887722 11112233221 12221 2345688888877899999999999999999999
Q ss_pred HHHHHHHHHHhc-CCCCcccCCCCCCCCccccccccchhhhccccchheeccccchhhHHHHhhhhcCCCcCcccccccc
Q psy3038 198 EMGHVEYFIQYK-DQPMAFREGANPGKNTRGWTALKDGLSFYESWVAEIGLAGTVGSVVMRLLAMQKFVPFPLVINRKFG 276 (499)
Q Consensus 198 EmGHi~Y~~~y~-~qP~~fr~gAn~gfhE~~~~Aigd~~~~~e~~~~~iaL~~~~grS~~~~~~~q~~tP~hL~~~~~ig 276 (499)
|+||+.|++.-+ .+|++.+ +....|+| +.|-+ +|+|+.+--+ ++. +
T Consensus 229 E~GHa~h~~~~~~~~~~~~~-~~~~~~~E----~~S~~---~E~~~~~~l~-------------------~~~------~ 275 (427)
T cd06459 229 ELGHAFHSYLSRDNQPYLYS-DYPIFLAE----IASTF---NELLLFDYLL-------------------KFA------K 275 (427)
T ss_pred HhhHHHHHHHHccCCCcccC-CCCchhhH----HHHHH---HHHHHHHHHH-------------------HhC------C
Confidence 999988866544 5665555 68889999 88777 7774332111 001 1
Q ss_pred ccccCCCchhhhHHHHHHHh-cccccc-ccchhHHHHHHHHHhhcCC-----CCChhhHhHHHHHHHHhcCcccCCCCCC
Q psy3038 277 IRTKINSDKTIFMVEAIFLF-KIDTFL-INTFVNIMMDLWRWGVFKK-----DISHEEYNKHWWKLRHDYQGVEPPTHRS 349 (499)
Q Consensus 277 Ll~~~~~d~e~f~~~IN~L~-aL~~i~-~LPf~~~~~D~wRw~vf~G-----~i~~~~~N~~WW~lr~~yqGI~pP~~R~ 349 (499)
+..+ ..-.+. .++.+. +++- .++.+.+++.++++ .++++++++.||++.++|.|+.|+ .+.
T Consensus 276 ------~~~~----~~~~~~~~~~~~~~~~~~-~~~~~~fe~~l~~~~~~~~~~~~~~~~~~~~~~~~~y~g~~~~-~~~ 343 (427)
T cd06459 276 ------DPEE----KLYLLEHLLEDIRATLPR-QTMFAEFEHEVYENPEEGEPLTAEELNEIYRELEKKYGGDLVE-IDE 343 (427)
T ss_pred ------CHHH----HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHhcCCccc-cCC
Confidence 0011 111111 445555 6777 88999999999984 367889999999999999999876 445
Q ss_pred CCCCCccCccceec-CCCchhHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCCccccCcCcHHHhHHHHHHHhCCCCCCh
Q psy3038 350 EDDFDPGAKYHIVA-SVPYIRYFVSFVIQFQFHRALCEKAGEFDPNDPSKQPLHECDIYRSTEAGNLLKNMLAMGSSKPW 428 (499)
Q Consensus 350 e~dfDp~ak~Hi~~-~~pYirYfl~~ilqfQf~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG~~L~~mm~~GsS~~W 428 (499)
+..+....+.|+.. +++|+.|++|.++++|+++.+|++ .+.+++..+++|+.|+|+++
T Consensus 344 ~~~~~w~~~~H~~~~~fYyy~Y~~g~~~a~~l~~~~~~~---------------------~~~~~~~y~~~L~~Ggs~~~ 402 (427)
T cd06459 344 EHGYEWARIPHFYYVPFYVYPYAFGQLAALALYAKYKED---------------------GEGAVEKYLELLKAGGSKSP 402 (427)
T ss_pred ccCeeeeecCeecCCCCcChHHHHHHHHHHHHHHHHHHc---------------------CccHHHHHHHHHHccCCCCH
Confidence 55666678899986 688889999999999999999984 12456777799999999999
Q ss_pred HHHHHHHhCCCCCCHHHHHHH
Q psy3038 429 PDAMEAITGQREMDASALLQY 449 (499)
Q Consensus 429 ~e~le~~TG~~~lsa~~ll~Y 449 (499)
.|+++.+ |-...+++.+.+=
T Consensus 403 ~ell~~~-g~d~~~~~~~~~~ 422 (427)
T cd06459 403 LELLKKA-GVDLTSPDFWEEA 422 (427)
T ss_pred HHHHHHc-CcCCCChHHHHHH
Confidence 9999986 5444777666543
No 9
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=99.72 E-value=2.8e-16 Score=170.74 Aligned_cols=267 Identities=17% Similarity=0.135 Sum_probs=178.3
Q ss_pred CCCHHHHHHHHHHHHHHcCCCCCChHHhhhcccccCC-C---Cc-ccccCcccccCCCCcceeeecCCCChhhHHHHHHH
Q psy3038 124 GYTVHKMFRTAEEFFTSINMSAMPPEFWERSMLEKPQ-G---RE-VVCHASAWDFHDGKDFRIKMCTRVNEEDLFTIHHE 198 (499)
Q Consensus 124 ~yt~~~mf~~ae~ff~slGl~~mp~~FW~~S~~~kp~-~---r~-~~CH~sawdf~~~~D~RIk~Ct~~~~~d~~t~hHE 198 (499)
.++.+.+.+.+.+.|..+|... .+|.+. |++... | |+ ..-+|+|+.+......+|.|+...+..|+.|++||
T Consensus 268 ~~~~e~~~~~~~~~~~~l~~~~--~e~~~~-~~~~~~iD~~~r~gK~~Gayc~~~~~~~~P~I~~Nf~~t~~dv~TL~HE 344 (549)
T TIGR02289 268 FGNVDFLLEKAEKMYKELSLEF--YEFFNF-MREKNLLDLESRKGKAAGGYCTYLPKYKAPFIFSNFNGTSGDIDVLTHE 344 (549)
T ss_pred CCCHHHHHHHHHHHhcCCCcHH--HHHHHH-HHhcCccccCCCCCCCCCcccCCCCCCCCcEEEEeCCCChhHHHHHHHH
Confidence 3678888888888888775442 355543 333322 1 22 24567777776567789999999999999999999
Q ss_pred HHHHHHHH-HhcCCCCcccCCCCCCCCccccccccchhhhccccchheeccccchhhHHHHhhhhcCCCcCccccccccc
Q psy3038 199 MGHVEYFI-QYKDQPMAFREGANPGKNTRGWTALKDGLSFYESWVAEIGLAGTVGSVVMRLLAMQKFVPFPLVINRKFGI 277 (499)
Q Consensus 199 mGHi~Y~~-~y~~qP~~fr~gAn~gfhE~~~~Aigd~~~~~e~~~~~iaL~~~~grS~~~~~~~q~~tP~hL~~~~~igL 277 (499)
+||+...+ ....+|...+... .- ++++ -|+ +.. . ++-.+|. +..+
T Consensus 345 lGHa~H~~~s~~~~~~~~~~~~-~~--------~aE~----------aS~------~~E-~-----l~~~~l~--~~~~- 390 (549)
T TIGR02289 345 AGHAFHVYESRKDLLPEYRWPT-YE--------AAEL----------ASM------SME-L-----LSMPWWD--LFYT- 390 (549)
T ss_pred hhHHHHHHHhcCCcccccccCc-ch--------hhhh----------hHH------HHH-H-----HHHHHHH--HHcC-
Confidence 99998654 3334555555322 22 3333 222 111 0 0111120 1111
Q ss_pred cccCCCchhhhHHHHHHHhccccccccchhHHHHHHHHHhhcCC-CCChhhHhHHHHHHHHhcCcccCCCCCCCCC--CC
Q psy3038 278 RTKINSDKTIFMVEAIFLFKIDTFLINTFVNIMMDLWRWGVFKK-DISHEEYNKHWWKLRHDYQGVEPPTHRSEDD--FD 354 (499)
Q Consensus 278 l~~~~~d~e~f~~~IN~L~aL~~i~~LPf~~~~~D~wRw~vf~G-~i~~~~~N~~WW~lr~~yqGI~pP~~R~e~d--fD 354 (499)
+..+..+ ..+..|. ..+.++|+ ..++|.+++.|+++ .++++++++.|+++.++|.| +....... ..
T Consensus 391 --~~e~~~~---~~~~~L~--~~~~~~~~-~~~~d~fe~~vy~~~~lt~~e~~~~~~~l~~~y~~---~~~~~~~~~~~~ 459 (549)
T TIGR02289 391 --DEKDLKK---AKISHLK--GALSFLPY-GVIVDHFQHWVYENPNHTPEERDEKYAELEKRYQP---STVYIAGLELEI 459 (549)
T ss_pred --CcchHHH---HHHHHHH--HHHHHHHH-HHHHHHHHHHHccCCCCCHHHHHHHHHHHHHHhCC---CcccCCcccccc
Confidence 0011111 2233333 34456799 99999999999996 78999999999999999973 22222111 12
Q ss_pred cc---Ccccee-cCCCchhHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCCccccCcCcHHHhHHHHHHHhCCCCCChHH
Q psy3038 355 PG---AKYHIV-ASVPYIRYFVSFVIQFQFHRALCEKAGEFDPNDPSKQPLHECDIYRSTEAGNLLKNMLAMGSSKPWPD 430 (499)
Q Consensus 355 p~---ak~Hi~-~~~pYirYfl~~ilqfQf~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG~~L~~mm~~GsS~~W~e 430 (499)
++ .+.|+. .++||+.|.+|.++++|||+...++ .+.+.+...++++.|+|+++.|
T Consensus 460 ~~~W~~~~H~~~~pFYyy~Y~~a~~~a~~l~~~~~~~---------------------~~~~~~~Y~~~L~~Ggs~~~~e 518 (549)
T TIGR02289 460 GTFWLRQLHIFEVPFYYIEYTIAQIGALQIYKIYKED---------------------PEKALKDYKKLCSAGGSQSFLE 518 (549)
T ss_pred CCcceeeceeecCCCcchhhHHHHHHHHHHHHHHHhC---------------------HHHHHHHHHHHHhccCCcCHHH
Confidence 22 356987 4799999999999999999876442 2366677889999999999999
Q ss_pred HHHHHhCCCCCCHHHHHHHHHHHHHHHHH
Q psy3038 431 AMEAITGQREMDASALLQYFEPLYKWLEE 459 (499)
Q Consensus 431 ~le~~TG~~~lsa~~ll~YF~PL~~WL~~ 459 (499)
+++.+.....++++++.+.++++.+||++
T Consensus 519 ll~~aGid~~f~~~~~~~~~~~~~~~l~~ 547 (549)
T TIGR02289 519 LYETAGLTFPFSEERIKEIVSFVEKLLEE 547 (549)
T ss_pred HHHHhCCCCCCCcccHHHHHHHHHHHHhh
Confidence 99995444456899999999999999974
No 10
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=99.45 E-value=4.6e-12 Score=138.79 Aligned_cols=226 Identities=15% Similarity=0.076 Sum_probs=154.3
Q ss_pred cccCcccccCCCCcceeeecCCCChhhHHHHHHHHHHHHHHHHhcCCCCcccCCCCCCCCccccccccchhhhccccchh
Q psy3038 165 VCHASAWDFHDGKDFRIKMCTRVNEEDLFTIHHEMGHVEYFIQYKDQPMAFREGANPGKNTRGWTALKDGLSFYESWVAE 244 (499)
Q Consensus 165 ~CH~sawdf~~~~D~RIk~Ct~~~~~d~~t~hHEmGHi~Y~~~y~~qP~~fr~gAn~gfhE~~~~Aigd~~~~~e~~~~~ 244 (499)
.-+|+|+.+..+.+.||.|....+..|+.|+.||+||+.+.+.-+.+|... .+....++| .-|-+ .|+|+.+
T Consensus 349 ~~Ga~~~~~~~~~~p~i~~N~~~~~~~v~TL~HE~GHa~H~~ls~~~~~~~-~~~~~~~aE----~~S~~---~E~l~~~ 420 (587)
T TIGR02290 349 RGGAFCTGFPPSKEPRVLMNYDGSRRDVSTLAHELGHAYHSELAKDQPLLN-ARYPMTLAE----TASIF---AEMLLFD 420 (587)
T ss_pred CCCcccCCCCCCCCCEEEEecCCCchhHHHHHHHhhHHHHHHHHccCCccc-CCCCchhhh----HHHHH---HHHHHHH
Confidence 447788887766678999999999999999999999999875446788655 467788999 66655 6663322
Q ss_pred eeccccchhhHHHHhhhhcCCCcCccccccccccccCCCchhhhHHHHHHHh--ccccc-cccchhHHHHHHHHHhhc--
Q psy3038 245 IGLAGTVGSVVMRLLAMQKFVPFPLVINRKFGIRTKINSDKTIFMVEAIFLF--KIDTF-LINTFVNIMMDLWRWGVF-- 319 (499)
Q Consensus 245 iaL~~~~grS~~~~~~~q~~tP~hL~~~~~igLl~~~~~d~e~f~~~IN~L~--aL~~i-~~LPf~~~~~D~wRw~vf-- 319 (499)
-.| +.+.+ +. ..+.++ .+..+ .++.- ..+.+.++..++
T Consensus 421 ~ll--------------~~~~~-----------------~~-----~~~~~~~~~l~~~~~~~~~-q~~~~~fE~~l~~~ 463 (587)
T TIGR02290 421 ALL--------------KEAKT-----------------DE-----EKLSLLAEKLEDAIATLVR-IHARFLFERRFHEA 463 (587)
T ss_pred HHH--------------hhCCC-----------------HH-----HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH
Confidence 111 11111 11 112222 11111 12233 566677888888
Q ss_pred --CCCCChhhHhHHHHHHHHhcCcccCCCCCCCCCCCccCccceec-CCCchhHHHHHHHHHHHHHHHHHHhCCCCCCCC
Q psy3038 320 --KKDISHEEYNKHWWKLRHDYQGVEPPTHRSEDDFDPGAKYHIVA-SVPYIRYFVSFVIQFQFHRALCEKAGEFDPNDP 396 (499)
Q Consensus 320 --~G~i~~~~~N~~WW~lr~~yqGI~pP~~R~e~dfDp~ak~Hi~~-~~pYirYfl~~ilqfQf~~alC~~ag~~~~~~~ 396 (499)
+|.++++++++.|+++.++|-|..++.. .....--..+.|+.. ++|+..|.+|.+++-|++..+++. |.
T Consensus 464 ~~~~~lt~~~l~~~~~~~~~~~~g~~~~~~-~~~~~~w~~~~H~y~~~fY~y~Y~~g~~~a~~l~~~~~~~-~~------ 535 (587)
T TIGR02290 464 RKEGELSADDICDLMLEAQKESYGDALDES-ELHPYMWAYKPHFYHAPFYNYPYTFGYLFVLGLYAKYREE-GE------ 535 (587)
T ss_pred HHcCCCCHHHHHHHHHHHHHHHhCcccccC-CCCCceeecCCeecCCCCcChhhHHHHHHHHHHHHHHHHC-Ch------
Confidence 4889999999999999999888754332 111122245689986 588888999999999999988763 10
Q ss_pred CCCCCccccCcCcHHHhHHHHHHHhCCCCCChHHHHHHHhCCCCCCH-HHHHHHHHHHHHHHHH
Q psy3038 397 SKQPLHECDIYRSTEAGNLLKNMLAMGSSKPWPDAMEAITGQREMDA-SALLQYFEPLYKWLEE 459 (499)
Q Consensus 397 ~~~pLh~Cdiy~sk~aG~~L~~mm~~GsS~~W~e~le~~TG~~~lsa-~~ll~YF~PL~~WL~~ 459 (499)
.+-+..+++++.|+|++..|+++. +|.. ++. +++.+=++=+.+|+++
T Consensus 536 --------------~~~~~y~~~L~~Ggs~~p~ell~~-~G~d-~~~~~~~~~~~~~~~~~v~e 583 (587)
T TIGR02290 536 --------------SFVPKYIALLRDTGSMTPEELVKK-FGFD-LTSPDFWQKGIDVIEEKIDE 583 (587)
T ss_pred --------------hHHHHHHHHHHCcCCCCHHHHHHH-hCcC-CCChHHHHHHHHHHHHHHHH
Confidence 122335689999999999999999 8987 655 4444666666666654
No 11
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=99.36 E-value=3.8e-11 Score=132.55 Aligned_cols=231 Identities=16% Similarity=0.165 Sum_probs=167.4
Q ss_pred cccCcccccCCCCcceeeecCCCChhhHHHHHHHHHHHHHH-HHhcCCCCcccCCCCCCCCccccccccchhhhccccch
Q psy3038 165 VCHASAWDFHDGKDFRIKMCTRVNEEDLFTIHHEMGHVEYF-IQYKDQPMAFREGANPGKNTRGWTALKDGLSFYESWVA 243 (499)
Q Consensus 165 ~CH~sawdf~~~~D~RIk~Ct~~~~~d~~t~hHEmGHi~Y~-~~y~~qP~~fr~gAn~gfhE~~~~Aigd~~~~~e~~~~ 243 (499)
.-.|+|.+++.+...+|.|....++.|+.|+.||+||+.+. ...+.||++.++.+ .-+-
T Consensus 354 rsGaYs~~~~~~~~p~IlmN~~gt~~dV~TLaHElGHs~Hs~~s~~~qp~~~~~~~-i~~A------------------- 413 (598)
T COG1164 354 RSGAYSIGFYKGDHPFILMNYDGTLRDVFTLAHELGHSVHSYFSRKHQPYLYADYS-IFLA------------------- 413 (598)
T ss_pred CCCcccCCCCCCCCCeEEEeCCCchhHHHHHHHHccHHHHHHHHhccCCccccCCc-hHHH-------------------
Confidence 55677777887789999999999999999999999999995 55678998877543 2222
Q ss_pred heeccccchhhHHHHhhhhcCCCcCccccccccccccCCCchhhhHHHHHHHh--ccccccccchhHHHHHHHHHhhcC-
Q psy3038 244 EIGLAGTVGSVVMRLLAMQKFVPFPLVINRKFGIRTKINSDKTIFMVEAIFLF--KIDTFLINTFVNIMMDLWRWGVFK- 320 (499)
Q Consensus 244 ~iaL~~~~grS~~~~~~~q~~tP~hL~~~~~igLl~~~~~d~e~f~~~IN~L~--aL~~i~~LPf~~~~~D~wRw~vf~- 320 (499)
|||= +++.. ++-.|| +....+..+ .+-.|. --+.+.+++- ..+.+.+++.+++
T Consensus 414 EiAS--~fnE~---------l~~~~l--------l~~~~~~~~----~~~il~~~l~~~~~t~~r-q~~f~~FE~~~h~~ 469 (598)
T COG1164 414 EIAS--TFNEM---------LLFDYL--------LERFKDPEE----RLAILEEKLEGFFATLFR-QTLFAEFEHRVHEL 469 (598)
T ss_pred HHHH--HHHHH---------HHHHHH--------HHHcCCHHH----HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH
Confidence 2221 11111 122333 222222222 121122 3345566777 8888999998885
Q ss_pred ----CCCChhhHhHHHHHHHHhcCcccCCCCCC-CCCCCccCccceec-CCCchhHHHHHHHHHHHHHHHHHHhCCCCCC
Q psy3038 321 ----KDISHEEYNKHWWKLRHDYQGVEPPTHRS-EDDFDPGAKYHIVA-SVPYIRYFVSFVIQFQFHRALCEKAGEFDPN 394 (499)
Q Consensus 321 ----G~i~~~~~N~~WW~lr~~yqGI~pP~~R~-e~dfDp~ak~Hi~~-~~pYirYfl~~ilqfQf~~alC~~ag~~~~~ 394 (499)
|.++++++|+.|+++-++|.|=.+..+.. +..+ +.+.|+.. |+|+..|.+|.+.+.|+|+.+..+-
T Consensus 470 ~~~~~~lt~~~l~~~~~~l~~~y~gd~~~~d~~~~~~W--~~ipHfy~~pFYvy~Ya~G~~~a~~l~~~~~~~~------ 541 (598)
T COG1164 470 IEEGEELTAEELNELYLELQKEYYGDAVKLDELSGLEW--ARIPHFYHSPFYVYQYATGQLAALALYAKILTND------ 541 (598)
T ss_pred HhhcCCCCHHHHHHHHHHHHHHhcCCccccCcccccch--hhcCeeeecCceehHhHHHHHHHHHHHHHHHhcc------
Confidence 57899999999999999999988665554 2333 56679886 8999999999999999999998861
Q ss_pred CCCCCCCccccCcCcHHHhHHHHHHHhCCCCCChHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHh
Q psy3038 395 DPSKQPLHECDIYRSTEAGNLLKNMLAMGSSKPWPDAMEAITGQREMDASALLQYFEPLYKWLEEENA 462 (499)
Q Consensus 395 ~~~~~pLh~Cdiy~sk~aG~~L~~mm~~GsS~~W~e~le~~TG~~~lsa~~ll~YF~PL~~WL~~~N~ 462 (499)
....-+.+.++|+.|+|.+..|+++ ..|-.--+.+++-+=++=+..||++-.+
T Consensus 542 --------------~~~~~~~y~~~Lk~GgS~~p~ell~-~~GiD~~~~~~~~~a~~~~~~~ide~~~ 594 (598)
T COG1164 542 --------------AEAFEKYYIAFLKSGGSKSPLELLK-IAGIDLTTPDPWEEALAEFERLIDELEE 594 (598)
T ss_pred --------------HHHHHHHHHHHHhCCCCCCHHHHHH-HcCCCCCCchHHHHHHHHHHHHHHHHHH
Confidence 1244566778999999999999999 7899856667777777778888876544
No 12
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=99.20 E-value=1.2e-10 Score=123.54 Aligned_cols=230 Identities=17% Similarity=0.182 Sum_probs=152.6
Q ss_pred ccccCcccccCCC--------CcceeeecCCC---------ChhhHHHHHHHHHHHHHHHHhc-CCCCcccCCCCCCCCc
Q psy3038 164 VVCHASAWDFHDG--------KDFRIKMCTRV---------NEEDLFTIHHEMGHVEYFIQYK-DQPMAFREGANPGKNT 225 (499)
Q Consensus 164 ~~CH~sawdf~~~--------~D~RIk~Ct~~---------~~~d~~t~hHEmGHi~Y~~~y~-~qP~~fr~gAn~gfhE 225 (499)
...|+.+..+..+ ...+|.|+... +.+|+.|+.||+||+...+--+ .+|.+.......-|-|
T Consensus 198 K~~ga~~~~~~~~~~~~~~~~~~~~i~~n~~~~~~~~~~ll~~~~v~tLfHE~GHa~H~~ls~~~~~~~sg~~~~~d~aE 277 (458)
T PF01432_consen 198 KRSGAFCFTLRPSRSDGERQLPVPYIFCNFTGPSAGKPSLLSHDDVETLFHEFGHAMHSLLSRTKYQHLSGTRVPMDFAE 277 (458)
T ss_dssp S-SS-EEEEEEC-BTTSTCECEEEEEEEEE-S-BTTC--B-SHHHHHHHHHHHHHHHHHHHCCCSSGGGSTTSS-CHHCH
T ss_pred CCCCceeCCccCccccccCCCCceEEEecCCCCCCCCCCccChhhHHHHHHHHhHHHHHHHhccccccccCCchhHHHHh
Confidence 3677788888865 67899999999 9999999999999999876533 4445555556666667
Q ss_pred cccccccchhhhccccchheeccccchhhHHHHhhhhcCCCcCccccccccccccCCCchhhhHHHHHHHh---cccccc
Q psy3038 226 RGWTALKDGLSFYESWVAEIGLAGTVGSVVMRLLAMQKFVPFPLVINRKFGIRTKINSDKTIFMVEAIFLF---KIDTFL 302 (499)
Q Consensus 226 ~~~~Aigd~~~~~e~~~~~iaL~~~~grS~~~~~~~q~~tP~hL~~~~~igLl~~~~~d~e~f~~~IN~L~---aL~~i~ 302 (499)
..|-+ .|+|+.+-.+ -.. +. +|-. .+.-+.. ..++.+. ......
T Consensus 278 ----~~S~~---~E~~~~~~~~------l~~-------~~-~~~~--~~~~i~~----------~~l~~~~~~~~~~~~~ 324 (458)
T PF01432_consen 278 ----FPSQF---MENWLWDPLV------LKA-------FS-RHYE--TGEPIPE----------ELLEDLIASRNFFAAI 324 (458)
T ss_dssp ----HHHHH---HHHHGGCHHH------HHH-------H--BSTT--THHHHHH----------HHHHHHHHTTTTTHHH
T ss_pred ----cchHH---HHHhhhchhh------hhh-------hc-cChh--hhhhHHH----------HHHHhhhhhhhhhHHH
Confidence 55555 6775554222 011 11 1110 0001111 2344443 222223
Q ss_pred ccchhHHHHHHHHHhhcCCCCCh----hhHhHHHHHHHHhcCcccCCCCCCCCCCCccCcccee-c--CCCchhHHHHHH
Q psy3038 303 INTFVNIMMDLWRWGVFKKDISH----EEYNKHWWKLRHDYQGVEPPTHRSEDDFDPGAKYHIV-A--SVPYIRYFVSFV 375 (499)
Q Consensus 303 ~LPf~~~~~D~wRw~vf~G~i~~----~~~N~~WW~lr~~yqGI~pP~~R~e~dfDp~ak~Hi~-~--~~pYirYfl~~i 375 (499)
++.. .+++..+...+++..-.. +++++.|.++.++|.|+..+. ...-+....|+. . +.+|..|..|.+
T Consensus 325 ~~~~-ql~~a~fd~~~h~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~----~~~~~~~f~hl~~~~Y~a~yY~Y~ya~~ 399 (458)
T PF01432_consen 325 FLFR-QLLFALFDQELHESPEDGEPESEDLNELYRELQKEYYGDPSDP----DDYFPASFSHLFSHFYAAGYYSYLYAEV 399 (458)
T ss_dssp HHHH-HHHHHHHHHHHCCHCCCTHHHHHHHHHHHHHHHHHCCTTTCBT----TCCGGGGGGGGHCCCGTTTTTCHHHHHH
T ss_pred HHHH-HHHHHHHHHHHHhccccCCccchhHHHHHHHHHHHhcCcchhc----cccccceehhhccCccCCCCchhHHHHH
Confidence 4555 677788999999864332 479999999999999955553 222233446885 2 248889999999
Q ss_pred HHHHHHHHHHHHhCCCCCCCCCCCCCccccCcCcHHHhHHHHH-HHhCCCCCChHHHHHHHhCCCCCCHHHHHHHH
Q psy3038 376 IQFQFHRALCEKAGEFDPNDPSKQPLHECDIYRSTEAGNLLKN-MLAMGSSKPWPDAMEAITGQREMDASALLQYF 450 (499)
Q Consensus 376 lqfQf~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG~~L~~-mm~~GsS~~W~e~le~~TG~~~lsa~~ll~YF 450 (499)
++.++++.+-+. +|+ ++++|..+++ +++.|+|+++.++++...|.. .+++++++-+
T Consensus 400 ~a~~~~~~~~~~-----------~~~-------n~~~g~~~~~~~L~~Ggs~~~~e~l~~~~g~~-~~~~~~~~~~ 456 (458)
T PF01432_consen 400 LAADLFSQFFEE-----------DPL-------NRETGRRFYKEFLSPGGSKDPLELLKKFLGRE-PSPDAFLKAL 456 (458)
T ss_dssp HHHHHHHHHHHC-----------HTT-------CHHHHHHHHHHHCTTTTSS-HHHHHHHCCSST-TSSHHHHHHH
T ss_pred HHHHHHHHHHhc-----------CCc-------chHHHHHHHHHHhcCCCCCCHHHHHHHhCCCC-CChHHHHHHc
Confidence 999999998885 222 3688888887 999999999999999987775 9999988765
No 13
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=99.14 E-value=1.2e-09 Score=119.85 Aligned_cols=229 Identities=15% Similarity=0.088 Sum_probs=149.8
Q ss_pred cccCcccccCCCCcceeeecCCCChhhHHHHHHHHHHHHHHH-HhcCCCCcccCCCCCCCCccccccccchhhhccccch
Q psy3038 165 VCHASAWDFHDGKDFRIKMCTRVNEEDLFTIHHEMGHVEYFI-QYKDQPMAFREGANPGKNTRGWTALKDGLSFYESWVA 243 (499)
Q Consensus 165 ~CH~sawdf~~~~D~RIk~Ct~~~~~d~~t~hHEmGHi~Y~~-~y~~qP~~fr~gAn~gfhE~~~~Aigd~~~~~e~~~~ 243 (499)
.-+|+|+.++. ...+|.|+...+++|+.|+.||+||+...+ ...+||...+ ....-+-| .-|-+ -|+|+.
T Consensus 353 ~~Ga~~~~~~~-~~p~il~N~~~~~~dv~TLaHElGHa~H~~~~~~~~~~~~~-~~~~~~aE----~aS~~---~E~l~~ 423 (591)
T TIGR00181 353 RSGAYSIGGYK-VKPYILMNWDGTLNSVFTLAHELGHSMHSYFSSKHQPYPNS-DYSIFYAE----IASTF---NELLLA 423 (591)
T ss_pred CCCcccCCCCC-CCCeEEEecCCCcchHHHHHHHhhhHHHHHHHccCCCCccC-CCCchhhh----HhHHH---HHHHHH
Confidence 44788888885 688999999999999999999999998754 3467887655 34455555 22222 233111
Q ss_pred heeccccchhhHHHHhhhhcCCCcCccccccccccccCCCchhhhHHHHHHHhccccccccchhHHHHHHHHHhhcC---
Q psy3038 244 EIGLAGTVGSVVMRLLAMQKFVPFPLVINRKFGIRTKINSDKTIFMVEAIFLFKIDTFLINTFVNIMMDLWRWGVFK--- 320 (499)
Q Consensus 244 ~iaL~~~~grS~~~~~~~q~~tP~hL~~~~~igLl~~~~~d~e~f~~~IN~L~aL~~i~~LPf~~~~~D~wRw~vf~--- 320 (499)
+ || +... ++.+.=..-++..+. +.+.++.- ..+.+.++..+++
T Consensus 424 ~-----------------------~l--------~~~~-~~~~~k~~~l~~~l~-~~~~~~~~-q~~~~~Fe~~~~~~~~ 469 (591)
T TIGR00181 424 D-----------------------YL--------LKNS-NDPEMKIYILLERIS-NFFGTFTR-QTLFAEFEYEAYELIE 469 (591)
T ss_pred H-----------------------HH--------HhhC-CCHHHHHHHHHHHHH-HHHHHHHH-HHHHHHHHHHHHHHHH
Confidence 1 11 1111 111100001111111 13334455 6777888888885
Q ss_pred --CCCChhhHhHHHHHHHHhcCcccCCCCCCCCCCCccCccceecCCCchhHHHHHHHHHHHHHHHHHHhCCCCCCCCCC
Q psy3038 321 --KDISHEEYNKHWWKLRHDYQGVEPPTHRSEDDFDPGAKYHIVASVPYIRYFVSFVIQFQFHRALCEKAGEFDPNDPSK 398 (499)
Q Consensus 321 --G~i~~~~~N~~WW~lr~~yqGI~pP~~R~e~dfDp~ak~Hi~~~~pYirYfl~~ilqfQf~~alC~~ag~~~~~~~~~ 398 (499)
+.++++++++.|-++.++|-|..+. ......+--..+.|+..++|+..|.+|.+..-|+|+...+.
T Consensus 470 ~~~~lt~~~l~~~~~~~~~~~~g~~~~-~~~~~~~~w~~~~Hfy~~fY~Y~Ya~g~~~a~~l~~~~~~~----------- 537 (591)
T TIGR00181 470 EGEPLTAETLNEIYANLLKKYFGDLVK-IDEGAGLTWMRIPHFYMGFYVYKYATGQVAATALYEKIKEE----------- 537 (591)
T ss_pred cCCCCCHHHHHHHHHHHHHHHcCCccc-cCCcccceeeecCcccCCCcCHHHHHHHHHHHHHHHHHHHC-----------
Confidence 5789999999999999998885221 11111111234568776689999999999999999866542
Q ss_pred CCCccccCcCcHHHhHHHHHHHhCCCCCChHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHH
Q psy3038 399 QPLHECDIYRSTEAGNLLKNMLAMGSSKPWPDAMEAITGQREMDASALLQYFEPLYKWLEE 459 (499)
Q Consensus 399 ~pLh~Cdiy~sk~aG~~L~~mm~~GsS~~W~e~le~~TG~~~lsa~~ll~YF~PL~~WL~~ 459 (499)
.+.+++..+++|+.|+|++..|+++.+ |-.-=+++++.+=++=+.+|+++
T Consensus 538 ----------~~~~~~~Y~~~L~~Ggs~~p~ell~~~-Gvd~~~~~~~~~~~~~~~~~i~~ 587 (591)
T TIGR00181 538 ----------GKGAVEKYLKFLKSGGSKYPLETLKIA-GVDLTKPQPWQAAINIFSDWIDE 587 (591)
T ss_pred ----------CccHHHHHHHHHhCcCCCCHHHHHHHc-CcCCCChHHHHHHHHHHHHHHHH
Confidence 113466677999999999999999984 98733445566767777777765
No 14
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=99.13 E-value=2.8e-09 Score=114.50 Aligned_cols=274 Identities=17% Similarity=0.213 Sum_probs=165.8
Q ss_pred CCchHHHHHcCCCHHHHHHHHHHHHHH-cCCCCCCh---HHhhhc-----ccccC------------CCCc-ccccCccc
Q psy3038 114 PDVSAELVRQGYTVHKMFRTAEEFFTS-INMSAMPP---EFWERS-----MLEKP------------QGRE-VVCHASAW 171 (499)
Q Consensus 114 ~dvt~~l~~q~yt~~~mf~~ae~ff~s-lGl~~mp~---~FW~~S-----~~~kp------------~~r~-~~CH~saw 171 (499)
.+++.+-++.-++.....+...+.+.. +|...-+- +-|..+ ++.+. -.|+ ...+|.++
T Consensus 145 ~~~~~~~~~~yf~~~~~~~~i~~~~~~lfg~~~~~~~~~~~w~~dv~~~~v~d~~~~~~~g~~ylD~~~R~gK~~Ga~~~ 224 (472)
T cd06455 145 YDVDQEKIREYFPLEVVIEGMLDIYQRLFGLRFEEVPDASVWHEDVRLYSVWDADTGEFLGYFYLDLHPREGKYGHAANF 224 (472)
T ss_pred cCCCHHHHhccCcHHHHHHHHHHHHHHHhCeEEEeCCCCCCCCCcceEEEEEECCCCCEEEEEEeecCCCCCCCCCcccc
Confidence 455566667778888888888888886 67654322 333322 11111 0122 24556666
Q ss_pred ccCCCCc----------ceeeecCCCC---------hhhHHHHHHHHHHHHHHHHhcCCCCcccCCCCCC--CCcccccc
Q psy3038 172 DFHDGKD----------FRIKMCTRVN---------EEDLFTIHHEMGHVEYFIQYKDQPMAFREGANPG--KNTRGWTA 230 (499)
Q Consensus 172 df~~~~D----------~RIk~Ct~~~---------~~d~~t~hHEmGHi~Y~~~y~~qP~~fr~gAn~g--fhE~~~~A 230 (499)
.+..+.. ..|.|....+ .+++.|+.||+||+...+--.. ....-.|.+.. |-| .
T Consensus 225 ~~~~~~~~~~g~~~~P~~~i~~Nf~~~~~~~p~ll~~~~V~TLfHEfGHalH~~ls~~-~~~~~sg~~~~~d~aE----~ 299 (472)
T cd06455 225 GLQPGFLLPDGSRQYPVAALVCNFPKPTADKPSLLRHDEVETFFHEFGHVIHHLLGRT-KYARFSGTRVERDFVE----A 299 (472)
T ss_pred ccccceecCCCCEeCCEEEEECcCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHhcCC-CccccCCCcCChhhhh----c
Confidence 6653321 2455555544 6999999999999998553222 22223444444 777 5
Q ss_pred ccchhhhccccchheeccccchhhHHHHhhhhcCCCcCccccccccccccCCCchhhhHHHHHHHh-------ccccccc
Q psy3038 231 LKDGLSFYESWVAEIGLAGTVGSVVMRLLAMQKFVPFPLVINRKFGIRTKINSDKTIFMVEAIFLF-------KIDTFLI 303 (499)
Q Consensus 231 igd~~~~~e~~~~~iaL~~~~grS~~~~~~~q~~tP~hL~~~~~igLl~~~~~d~e~f~~~IN~L~-------aL~~i~~ 303 (499)
-|-+ -|+|+.+-.....+.+-.. |++.+ . + ..|+.+. ++..+-
T Consensus 300 pS~~---~E~~~~~~~~l~~l~~h~~--------t~e~i--------~-------~---~li~~~~~~~~~~~~~~~~~- 349 (472)
T cd06455 300 PSQM---LENWCWEPEVLKRLSKHYK--------TGEKI--------P-------D---ELIERLIASRHFNRGLFYLR- 349 (472)
T ss_pred chHH---HHHHhcCHHHHHHHhhccC--------CCCCC--------C-------H---HHHHHHHHhhhhhHHHHHHH-
Confidence 5555 6775433111000000000 11111 1 1 2344444 222222
Q ss_pred cchhHHHHHHHHHhhcCCCCChhhHhHHHHHHHHhcCcccCCCCCCCCCCCccCccceecC--CCchhHHHHHHHHHHHH
Q psy3038 304 NTFVNIMMDLWRWGVFKKDISHEEYNKHWWKLRHDYQGVEPPTHRSEDDFDPGAKYHIVAS--VPYIRYFVSFVIQFQFH 381 (499)
Q Consensus 304 LPf~~~~~D~wRw~vf~G~i~~~~~N~~WW~lr~~yqGI~pP~~R~e~dfDp~ak~Hi~~~--~pYirYfl~~ilqfQf~ 381 (499)
.++.-.+...+++..-..+++++.|.++.++|.|+..... .. =++..-|+..+ .+|-.|..|.+++..++
T Consensus 350 ----q~~~a~fd~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~-~~~~f~H~~~~Y~a~yY~Y~~a~~la~~~~ 421 (472)
T cd06455 350 ----QLFFALFDLALHTGDPADLDTTKLYNDLREEISLIPSTEG---TH-GPASFGHLAGGYDAGYYGYLWSEVFAADMF 421 (472)
T ss_pred ----HHHHHHHHHHHhCCCCCCCCHHHHHHHHHHHhCCCCCCCC---CC-ccccccccccCcccchHHHHHHHHHHHHHH
Confidence 2333457777776432226899999999999999754211 11 12445698754 36779999999999999
Q ss_pred HHHHHHhCCCCCCCCCCCCCccccCcCcHHHhHHHHH-HHhCCCCCChHHHHHHHhCCCCCCHHHHHHH
Q psy3038 382 RALCEKAGEFDPNDPSKQPLHECDIYRSTEAGNLLKN-MLAMGSSKPWPDAMEAITGQREMDASALLQY 449 (499)
Q Consensus 382 ~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG~~L~~-mm~~GsS~~W~e~le~~TG~~~lsa~~ll~Y 449 (499)
+.+.+. +|+ ++++|+.+++ +|+.|+|++..|+++.++|.. ++.++|+++
T Consensus 422 ~~~~~~-----------~~~-------~~~~g~~y~~~~L~~Ggs~~p~ell~~flGr~-~~~~a~~~~ 471 (472)
T cd06455 422 SSFFKD-----------GLL-------NPEVGLRYRDTVLAPGGSKDAADMLKDFLGRE-PNNDAFLKS 471 (472)
T ss_pred HHHHhc-----------CCC-------CHHHHHHHHHHHhcCcCCcCHHHHHHHHhCCC-CChhHHhhc
Confidence 887654 222 4689999985 999999999999999999976 999999864
No 15
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=99.04 E-value=1.2e-08 Score=108.48 Aligned_cols=200 Identities=19% Similarity=0.183 Sum_probs=127.4
Q ss_pred ChhhHHHHHHHHHHHHHHHH-hcCCCCcccCCCCCC--CCccccccccchhhhccccchheeccccchhhHHHHhhhhcC
Q psy3038 188 NEEDLFTIHHEMGHVEYFIQ-YKDQPMAFREGANPG--KNTRGWTALKDGLSFYESWVAEIGLAGTVGSVVMRLLAMQKF 264 (499)
Q Consensus 188 ~~~d~~t~hHEmGHi~Y~~~-y~~qP~~fr~gAn~g--fhE~~~~Aigd~~~~~e~~~~~iaL~~~~grS~~~~~~~q~~ 264 (499)
+.+++.|+.||+||+...+- -..++.+ .|.+.. |-| .-|-+ .|+|+.+-.+...+.+-..
T Consensus 205 ~~~~v~tLfHEfGHalH~~ls~~~~~~l--~~~~~~~d~~E----~pS~~---~E~~~~d~~vL~~~s~h~~-------- 267 (422)
T cd06456 205 THDEVTTLFHEFGHALHHLLTDVEYPSL--GGTNVEWDFVE----LPSQF---MENWAWEPEVLKLFAKHYE-------- 267 (422)
T ss_pred CHHHHHHHHHHHHHHHHHHHhcCCcccc--CCCcCchhHhh----ccHHH---HHHHhcCHHHHHHHHhhcC--------
Confidence 78999999999999997653 2344433 344333 556 44444 6776655211001100000
Q ss_pred CCcCccccccccccccCCCchhhhHHHHHHHh-------ccccccccchhHHHHHHHHHhhcCCCCC--hhhHhHHHHHH
Q psy3038 265 VPFPLVINRKFGIRTKINSDKTIFMVEAIFLF-------KIDTFLINTFVNIMMDLWRWGVFKKDIS--HEEYNKHWWKL 335 (499)
Q Consensus 265 tP~hL~~~~~igLl~~~~~d~e~f~~~IN~L~-------aL~~i~~LPf~~~~~D~wRw~vf~G~i~--~~~~N~~WW~l 335 (499)
|.+.+ .+ .-|+.+. ++..+- .+..-.+...+++.... .+.+...|.++
T Consensus 268 t~~~l---------------p~---~l~~~~~~~~~~~~~~~~~~-----ql~~a~fD~~lh~~~~~~~~~~~~~~~~~~ 324 (422)
T cd06456 268 TGEPL---------------PD---ELIDKLLAARNFNSGFATVR-----QLEFALLDLALHSLTDPEILDVVQFELDAL 324 (422)
T ss_pred CCCCC---------------CH---HHHHHHHHHhhHhHHHHHHH-----HHHHHHHHHHHhcCCcccccchHHHHHHHH
Confidence 11111 11 2344444 333332 23334566777754322 35678888888
Q ss_pred HHhcCcccCCCCCCCCCCCccCccceec-C--CCchhHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCCccccCcCcHHH
Q psy3038 336 RHDYQGVEPPTHRSEDDFDPGAKYHIVA-S--VPYIRYFVSFVIQFQFHRALCEKAGEFDPNDPSKQPLHECDIYRSTEA 412 (499)
Q Consensus 336 r~~yqGI~pP~~R~e~dfDp~ak~Hi~~-~--~pYirYfl~~ilqfQf~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~a 412 (499)
++++.|+.+...+ .-+++..|+.+ + .-|-+|..|.+++..+|.++.+. ++ -++++
T Consensus 325 ~~~~~~~~~~~~~----~~~~~f~Hi~~~gY~A~YYsYlws~vla~di~~~f~~~-----------~~-------~~~~~ 382 (422)
T cd06456 325 RKEGLVIPPPPPR----YFSNYFSHIFSGGYAAGYYSYKWAEVLDADAFSAFEEE-----------GI-------FNRET 382 (422)
T ss_pred HHhcCCCCCCCCC----CCCCccCcCCCCCcchhhHHHHHHHHHHHHHHHHHHhc-----------CC-------CCHHH
Confidence 8888877632221 12234469973 4 37899999999999999888764 22 24699
Q ss_pred hHHHHH-HHhCCCCCChHHHHHHHhCCCCCCHHHHHHHH
Q psy3038 413 GNLLKN-MLAMGSSKPWPDAMEAITGQREMDASALLQYF 450 (499)
Q Consensus 413 G~~L~~-mm~~GsS~~W~e~le~~TG~~~lsa~~ll~YF 450 (499)
|+.+++ +|+.|.|++..++++..+|.. .|.++|+++.
T Consensus 383 G~~~r~~iL~~Ggs~~~~e~~~~F~Gr~-p~~~a~l~~~ 420 (422)
T cd06456 383 GRRFRDTILSKGGSRDPMELFRAFRGRD-PSIEALLRRR 420 (422)
T ss_pred HHHHHHHHhhcCCCcCHHHHHHHhcCCC-CChHHHHHHc
Confidence 999996 999999999999999999975 9999999864
No 16
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=98.60 E-value=8.2e-07 Score=99.93 Aligned_cols=203 Identities=12% Similarity=0.076 Sum_probs=129.2
Q ss_pred ChhhHHHHHHHHHHHHHHHHhc-CCCCcccCCCCCCCCccccccccchhhhccccchheeccccchhhHHHHhhhhcCCC
Q psy3038 188 NEEDLFTIHHEMGHVEYFIQYK-DQPMAFREGANPGKNTRGWTALKDGLSFYESWVAEIGLAGTVGSVVMRLLAMQKFVP 266 (499)
Q Consensus 188 ~~~d~~t~hHEmGHi~Y~~~y~-~qP~~fr~gAn~gfhE~~~~Aigd~~~~~e~~~~~iaL~~~~grS~~~~~~~q~~tP 266 (499)
+.+++.|+.|||||+..-|--+ ..+.+=+...-.-|-| .=|-+ .|||+.+-.....+.+-.. |.
T Consensus 461 ~~~eV~TlFHEfGHalH~lls~~~y~~~sGt~v~~DfVE----~PSq~---mE~w~~~~~vL~~~a~Hy~--------Tg 525 (681)
T PRK10280 461 LWDDVITLFHEFGHTLHGLFARQRYATLSGTNTPRDFVE----FPSQI---NEHWASHPQVFARYARHYQ--------SG 525 (681)
T ss_pred CHHHHHHHHHHHHHHHHHHHhCCCccccCCCCCCcchhc----CcHHH---HHHHhcCHHHHHHHhhccC--------CC
Confidence 3679999999999999965532 3344434445667777 66666 8998887333111111111 22
Q ss_pred cCccccccccccccCCCchhhhHHHHHHHh-------ccccccccchhHHHHHHHHHhhcCCCCChhhHhHHHHHHHHhc
Q psy3038 267 FPLVINRKFGIRTKINSDKTIFMVEAIFLF-------KIDTFLINTFVNIMMDLWRWGVFKKDISHEEYNKHWWKLRHDY 339 (499)
Q Consensus 267 ~hL~~~~~igLl~~~~~d~e~f~~~IN~L~-------aL~~i~~LPf~~~~~D~wRw~vf~G~i~~~~~N~~WW~lr~~y 339 (499)
+.| .+ .-|++|. ++...-.|-| + ++|+--....... ...+....+++.++++
T Consensus 526 epi---------------P~---~l~~~l~~ar~~~~g~~~~~ql~~-a-l~D~~lH~~~~~~-~~~~~~~~~~~~~~~~ 584 (681)
T PRK10280 526 EAM---------------PD---ELQEKMRNASLFNKGYDMSELLSA-A-LLDMRWHCLEENE-AMQDVDDFELRALVAE 584 (681)
T ss_pred CCC---------------CH---HHHHHHHHhhCcchHHHHHHHHHH-H-HHhHHHhccCccc-ccccHHHHHHHHHHHh
Confidence 222 11 3355555 5555544444 3 2333222111111 1235666666777776
Q ss_pred Ccc-cCCCCCCCCCCCccCccceec-C--CCchhHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCCccccCcCcHHHhHH
Q psy3038 340 QGV-EPPTHRSEDDFDPGAKYHIVA-S--VPYIRYFVSFVIQFQFHRALCEKAGEFDPNDPSKQPLHECDIYRSTEAGNL 415 (499)
Q Consensus 340 qGI-~pP~~R~e~dfDp~ak~Hi~~-~--~pYirYfl~~ilqfQf~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG~~ 415 (499)
|+ .|+++. ..+ ++..-||.+ + .-|.+|..|.|++.-+|.++++. +|+ ++++|+.
T Consensus 585 -~~~~~~~~~--~~~-~~~F~Hif~ggY~AgYYsYlwaevlaaD~f~~f~~~-----------g~~-------n~~~G~~ 642 (681)
T PRK10280 585 -NLDLPAVPP--RYR-SSYFAHIFGGGYAAGYYAYLWTQMLADDGYQWFVEQ-----------GGL-------TRENGQR 642 (681)
T ss_pred -CCCCCCCCC--CCC-CCcccccccCCcchhhHHHHHHHHHHHHHHHHHHhc-----------CCC-------CHHHHHH
Confidence 55 333332 222 344469985 4 38889999999999999888864 333 4799999
Q ss_pred HHH-HHhCCCCCChHHHHHHHhCCCCCCHHHHHHH
Q psy3038 416 LKN-MLAMGSSKPWPDAMEAITGQREMDASALLQY 449 (499)
Q Consensus 416 L~~-mm~~GsS~~W~e~le~~TG~~~lsa~~ll~Y 449 (499)
+|+ +|+.|.|++..++++...|.. -+.++||+.
T Consensus 643 fr~~iL~~GGs~d~~~~~~~FlGR~-P~~~alL~~ 676 (681)
T PRK10280 643 FREAILSRGNSTDLERLYRQWRGHA-PQIMPMLQH 676 (681)
T ss_pred HHHHHhhcCCCcCHHHHHHHhcCCC-CChHHHHHh
Confidence 999 999999999999999999975 899999864
No 17
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=98.60 E-value=7.6e-07 Score=95.65 Aligned_cols=199 Identities=16% Similarity=0.136 Sum_probs=122.6
Q ss_pred ChhhHHHHHHHHHHHHHHHHhc-CCCCcccCCCCCCCCccccccccchhhhccccchheeccccchhhHHHHhhhhcCCC
Q psy3038 188 NEEDLFTIHHEMGHVEYFIQYK-DQPMAFREGANPGKNTRGWTALKDGLSFYESWVAEIGLAGTVGSVVMRLLAMQKFVP 266 (499)
Q Consensus 188 ~~~d~~t~hHEmGHi~Y~~~y~-~qP~~fr~gAn~gfhE~~~~Aigd~~~~~e~~~~~iaL~~~~grS~~~~~~~q~~tP 266 (499)
+.+++.|+.||+||+...+--. ..+.+-....-.-|-| .-|-+ .|+|+.+-.+ -.. .-
T Consensus 245 ~~~~v~TLfHEfGHalH~~ls~~~~~~~sgt~~~~d~vE----~pS~~---~E~~~~~~~~------L~~--------~a 303 (458)
T cd06457 245 SPHEVETLFHEMGHAMHSMLGRTEYQHVSGTRCATDFVE----VPSIL---MEYFASDPRV------LKL--------FA 303 (458)
T ss_pred CHHHHHHHHHHHhHHHHHHHcCCCccccCCCCCCcchhh----cCHHH---HHHHHhhHHH------HHH--------Hh
Confidence 4689999999999999965533 2333323233566778 66666 7887766333 111 11
Q ss_pred cCccccccccccccCCCchhhhHHHHHHHh-------ccccccccchhHHHHHHHHHhhcCCCC-ChhhHhHHHHHHHHh
Q psy3038 267 FPLVINRKFGIRTKINSDKTIFMVEAIFLF-------KIDTFLINTFVNIMMDLWRWGVFKKDI-SHEEYNKHWWKLRHD 338 (499)
Q Consensus 267 ~hL~~~~~igLl~~~~~d~e~f~~~IN~L~-------aL~~i~~LPf~~~~~D~wRw~vf~G~i-~~~~~N~~WW~lr~~ 338 (499)
+|. .. | ..-.+ .-++.+. ++..+ + .++.-.+...+.+... ...+..+.|.++.++
T Consensus 304 ~h~---~t-~-----e~ip~---~l~~~l~~~~~~~~~~~~~---r--q~~~a~fD~~~H~~~~~~~~~~~~~~~~~~~~ 366 (458)
T cd06457 304 RHY---ST-G-----EPLPE---EMLARLLASKNSFAALETQ---Q--QIVYALLDQELHGEQPLSPTFTSDVLRDSTEI 366 (458)
T ss_pred ccc---CC-C-----CcCcH---HHHHHHHHhhhhhHHHHHH---H--HHHHHHHHHHHhCCCcccccCHHHHHHHHHHH
Confidence 222 10 0 00011 2344444 23222 2 2333446666664321 122456889999999
Q ss_pred cCcccCCCCCCCCCCCccCccceec-CCCchhHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCCccccCcCcHHHhHHHH
Q psy3038 339 YQGVEPPTHRSEDDFDPGAKYHIVA-SVPYIRYFVSFVIQFQFHRALCEKAGEFDPNDPSKQPLHECDIYRSTEAGNLLK 417 (499)
Q Consensus 339 yqGI~pP~~R~e~dfDp~ak~Hi~~-~~pYirYfl~~ilqfQf~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG~~L~ 417 (499)
|.|+..+.. ...+ ++..-|+.. +..|..|..|.+++.+++....+. +++ ++++|+.++
T Consensus 367 ~~~~~~~~~--~~~~-~~~f~Hl~gy~a~YYsYl~a~vla~di~~~~f~~-----------~~~-------n~~~g~~y~ 425 (458)
T cd06457 367 FYGLPYVPG--GTAW-QLRFGHLVGYGATYYSYLFDRAIASKIWQKLFAA-----------DPL-------SREAGERLR 425 (458)
T ss_pred hcCCCCCCC--CCCC-ccccccccCccccchHHHHHHHHHHHHHHHHHHh-----------CCC-------CHHHHHHHH
Confidence 999754432 1111 234468872 345778999999999999866654 221 468999999
Q ss_pred H-HHhCCCCCChHHHHHHHhCCCCCCHHHH
Q psy3038 418 N-MLAMGSSKPWPDAMEAITGQREMDASAL 446 (499)
Q Consensus 418 ~-mm~~GsS~~W~e~le~~TG~~~lsa~~l 446 (499)
+ +|+.|+|++..|+++.++|.. ++.+.+
T Consensus 426 ~~iL~~Ggs~~p~e~l~~flGrd-p~~~~~ 454 (458)
T cd06457 426 EELLKHGGGKDPWELLAGVLGKP-PLVKGG 454 (458)
T ss_pred HHHccCCCCcCHHHHHHHHcCCC-CCCCcc
Confidence 6 999999999999999999986 665543
No 18
>PRK10911 oligopeptidase A; Provisional
Probab=98.56 E-value=1.1e-06 Score=98.94 Aligned_cols=202 Identities=16% Similarity=0.185 Sum_probs=125.7
Q ss_pred ChhhHHHHHHHHHHHHHHHH-hcCCCCccc-CCCCCCCCccccccccchhhhccccchheeccccchhhHHHHhhhhcCC
Q psy3038 188 NEEDLFTIHHEMGHVEYFIQ-YKDQPMAFR-EGANPGKNTRGWTALKDGLSFYESWVAEIGLAGTVGSVVMRLLAMQKFV 265 (499)
Q Consensus 188 ~~~d~~t~hHEmGHi~Y~~~-y~~qP~~fr-~gAn~gfhE~~~~Aigd~~~~~e~~~~~iaL~~~~grS~~~~~~~q~~t 265 (499)
+.+++.|+.|||||+..-+- -...|.+-. .+.-.-|-| .=|-+ .|+|+.+-.....+ +.+|. |
T Consensus 460 ~~~~v~tlfHEfGHalH~~ls~~~~~~~sGt~~~~~D~vE----~pS~~---~E~~~~~~~vL~~~---a~H~~-----t 524 (680)
T PRK10911 460 THDEVITLFHEFGHGLHHMLTRIETAGVSGISGVPWDAVE----LPSQF---MENWCWEPEALAFI---SGHYE-----T 524 (680)
T ss_pred CHHHHHHHHHHHhHHHHHHHhCCCcCcCCCcCCCCchHhh----ccHHH---HHHHhcCHHHHHHH---HHHhc-----C
Confidence 35999999999999998653 234444422 123457777 55555 88877662220111 11110 2
Q ss_pred CcCccccccccccccCCCchhhhHHHHHHHh-------ccccccccchhHHHHHHHHHhhcCCCC--ChhhHhHHHHHHH
Q psy3038 266 PFPLVINRKFGIRTKINSDKTIFMVEAIFLF-------KIDTFLINTFVNIMMDLWRWGVFKKDI--SHEEYNKHWWKLR 336 (499)
Q Consensus 266 P~hL~~~~~igLl~~~~~d~e~f~~~IN~L~-------aL~~i~~LPf~~~~~D~wRw~vf~G~i--~~~~~N~~WW~lr 336 (499)
.+.| .+ ..++.|. ++..+-.|-| -.+...+.+... ...+..+.|-+++
T Consensus 525 gepl---------------p~---~l~~~l~~~~~~~~~~~~~rql~~-----a~~D~~~H~~~~~~~~~~~~~~~~~~~ 581 (680)
T PRK10911 525 GEPL---------------PK---ELLDKMLAAKNYQAALFILRQLEF-----GLFDFRLHAEFDPDQGAKILETLAEIK 581 (680)
T ss_pred CCCC---------------CH---HHHHHHHHhhhhhhHHHHHHHHHH-----HHHHHHHhhccccccccCHHHHHHHHH
Confidence 2222 11 2344444 4444333333 224444443211 1125678888888
Q ss_pred HhcCcccCCCCCCCCCCCccCccceec-C--CCchhHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCCccccCcCcHHHh
Q psy3038 337 HDYQGVEPPTHRSEDDFDPGAKYHIVA-S--VPYIRYFVSFVIQFQFHRALCEKAGEFDPNDPSKQPLHECDIYRSTEAG 413 (499)
Q Consensus 337 ~~yqGI~pP~~R~e~dfDp~ak~Hi~~-~--~pYirYfl~~ilqfQf~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG 413 (499)
+++ |+.|+...+ .+ |..--||.+ + .-|.+|..|.+++.-+|..+.+. +|+ ++++|
T Consensus 582 ~~~-~~~~~~~~~--~~-~~~F~Hif~gGY~AgYYsYlwa~vla~d~~~~f~~~-----------g~~-------~~~~g 639 (680)
T PRK10911 582 KQV-AVVPSPSWG--RF-PHAFSHIFAGGYAAGYYSYLWADVLAADAFSRFEEE-----------GIF-------NRETG 639 (680)
T ss_pred HHc-CCCCCCCCC--cC-CccccccCcCCcccchHHHHHHHHHHHHHHHHHHhc-----------CCC-------CHHHH
Confidence 888 555553322 11 122359874 3 47899999999999999888864 332 46999
Q ss_pred HHHHH-HHhCCCCCChHHHHHHHhCCCCCCHHHHHHHH
Q psy3038 414 NLLKN-MLAMGSSKPWPDAMEAITGQREMDASALLQYF 450 (499)
Q Consensus 414 ~~L~~-mm~~GsS~~W~e~le~~TG~~~lsa~~ll~YF 450 (499)
+.+++ +|+.|.|++..++++...|.. -|.++||+..
T Consensus 640 ~~~r~~iL~~Ggs~~p~~~~~~F~GR~-P~~~all~~~ 676 (680)
T PRK10911 640 QSFLDNILSRGGSEEPMELFKRFRGRE-PQLDAMLEHY 676 (680)
T ss_pred HHHHHHHHhCcCCcCHHHHHHHhcCCC-CChHHHHHhc
Confidence 99986 999999999999999999975 8999999763
No 19
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=98.37 E-value=7.1e-06 Score=91.39 Aligned_cols=208 Identities=19% Similarity=0.224 Sum_probs=135.2
Q ss_pred hhhHHHHHHHHHHHHHHHHhc-CCCCcccCCCCCCCCccccccccchhhhccccchheeccccchhhHHHHhhhhcCCCc
Q psy3038 189 EEDLFTIHHEMGHVEYFIQYK-DQPMAFREGANPGKNTRGWTALKDGLSFYESWVAEIGLAGTVGSVVMRLLAMQKFVPF 267 (499)
Q Consensus 189 ~~d~~t~hHEmGHi~Y~~~y~-~qP~~fr~gAn~gfhE~~~~Aigd~~~~~e~~~~~iaL~~~~grS~~~~~~~q~~tP~ 267 (499)
.++..|+.|||||...-|--. +.|.+=+.+.---|-| -=|- |+|||..|......+.+-.. |-+
T Consensus 465 ~dEV~TLFHEfGHgLH~mlt~v~~~~vsGt~v~wDfVE----lPSQ---~mE~w~~~p~vL~~~a~Hy~--------TGe 529 (683)
T COG0339 465 HDEVTTLFHEFGHGLHHLLTRVKYPGVSGTNVPWDFVE----LPSQ---FMENWCWEPEVLAKYARHYQ--------TGE 529 (683)
T ss_pred HHHHHHHHHHhhhHHHHHhhcCCccccCCCCCCcchhh----ccHH---HHHHhhcCHHHHHHHHHhhc--------cCC
Confidence 689999999999999976643 4555533333444555 3344 49999999544222211221 222
Q ss_pred CccccccccccccCCCchhhhHHHHHHHhccccccccchhHHHHHHHHHhhcCCCCChhhHhHHHHHHHHhcCcccCCCC
Q psy3038 268 PLVINRKFGIRTKINSDKTIFMVEAIFLFKIDTFLINTFVNIMMDLWRWGVFKKDISHEEYNKHWWKLRHDYQGVEPPTH 347 (499)
Q Consensus 268 hL~~~~~igLl~~~~~d~e~f~~~IN~L~aL~~i~~LPf~~~~~D~wRw~vf~G~i~~~~~N~~WW~lr~~yqGI~pP~~ 347 (499)
.| .+ .|++. ....=|+.-++.++-+|-| + ++|+ +|..-.......+.-+-+-+.+++.-=+.+|..
T Consensus 530 ~l---P~-~ll~k-------~laaknf~~g~~t~rql~f-a-l~Dm-~~H~~~~~~~~~~i~~~~~~~~~~~~v~~~~~~ 595 (683)
T COG0339 530 PL---PK-ELLDK-------MLAAKNFQAGLFTLRQLEF-A-LFDM-RLHTEFDPDANADILEFEAEVLKKVAVLPSIPP 595 (683)
T ss_pred cC---HH-HHHHH-------HHHHHhhhhHHHHHHHHHH-H-HHHH-HhhccCCcccccCHHHHHHHHHHHhCCCCCcch
Confidence 22 00 01110 1122222226666666666 5 3443 333322334456666777788777766644444
Q ss_pred -CCCCCCCccCccceecC-C--CchhHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCCccccCcCcHHHhHHHHH-HHhC
Q psy3038 348 -RSEDDFDPGAKYHIVAS-V--PYIRYFVSFVIQFQFHRALCEKAGEFDPNDPSKQPLHECDIYRSTEAGNLLKN-MLAM 422 (499)
Q Consensus 348 -R~e~dfDp~ak~Hi~~~-~--pYirYfl~~ilqfQf~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG~~L~~-mm~~ 422 (499)
|-...| -||.++ . =|.+|.-++||..=-|.++.+. +|+ +++.|+.+++ +|+.
T Consensus 596 ~~~~~~F-----~HIFagGYsAGYYSY~WaeVLsaDafa~Fee~-----------g~~-------~~e~G~rfrd~ILs~ 652 (683)
T COG0339 596 RRRPHSF-----GHIFAGGYSAGYYSYLWAEVLSADAFAAFEEE-----------GPF-------NRETGQRFRDAILSR 652 (683)
T ss_pred hhccccc-----cceecCcccchhHHHHHHHHHhhHHHHHHHhc-----------CCC-------CHHHHHHHHHHHHhc
Confidence 344567 499875 3 7889999999999999999986 333 4699999999 9999
Q ss_pred CCCCChHHHHHHHhCCCCCCHHHHHHH
Q psy3038 423 GSSKPWPDAMEAITGQREMDASALLQY 449 (499)
Q Consensus 423 GsS~~W~e~le~~TG~~~lsa~~ll~Y 449 (499)
|.|++..++.++.-|.. -+.++||+.
T Consensus 653 GGS~dp~e~f~~frGre-p~~dalLr~ 678 (683)
T COG0339 653 GGSRDPMELFKAFRGRE-PSIDALLRH 678 (683)
T ss_pred cCCcCHHHHHHHHhcCC-CChhHHHHh
Confidence 99999999999999984 899999864
No 20
>KOG2090|consensus
Probab=97.98 E-value=7.3e-05 Score=82.33 Aligned_cols=131 Identities=18% Similarity=0.193 Sum_probs=91.1
Q ss_pred HHHHHHh-------ccccccccchhHHHHHHHHHhhcCCCCChhhHhHHHHHHHHhcCcccCCCCCCC--CCCCccCccc
Q psy3038 290 VEAIFLF-------KIDTFLINTFVNIMMDLWRWGVFKKDISHEEYNKHWWKLRHDYQGVEPPTHRSE--DDFDPGAKYH 360 (499)
Q Consensus 290 ~~IN~L~-------aL~~i~~LPf~~~~~D~wRw~vf~G~i~~~~~N~~WW~lr~~yqGI~pP~~R~e--~dfDp~ak~H 360 (499)
.-+|+|. |.+..-.+- +-++|+.=.+.-.-.+ .++-....-++.+++.| -++++-+. ..| -|
T Consensus 546 ~mv~~l~~s~n~~Aa~e~q~Qv~--ya~~Dq~fhg~~~~~~-~~~~~~~~~~v~~k~~~-~~~~~~taw~~rF-----sH 616 (704)
T KOG2090|consen 546 DMVNRLCESRNSFAAQETQRQVF--YALLDQEFHGIACPLI-AEDTTDLLSEVKRKFSG-LLYVPPTAWQLRF-----SH 616 (704)
T ss_pred HHHHHHHHHhccchhHHHHHHHH--HHHHHHHHhccccccc-ccchhHHHHHHHHhcCC-CCCCCCCccccch-----hh
Confidence 4567776 555553333 3355655444211112 22666777889999999 33333222 233 36
Q ss_pred eec-CCCchhHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCCccccCcCcHHHhHHHHH-HHhCCCCCChHHHHHHHhCC
Q psy3038 361 IVA-SVPYIRYFVSFVIQFQFHRALCEKAGEFDPNDPSKQPLHECDIYRSTEAGNLLKN-MLAMGSSKPWPDAMEAITGQ 438 (499)
Q Consensus 361 i~~-~~pYirYfl~~ilqfQf~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG~~L~~-mm~~GsS~~W~e~le~~TG~ 438 (499)
..+ +.=|.+|.++..++.=|++++|++ .|+ |.++|+++|+ +++.|.+++..++++.+.|.
T Consensus 617 l~gYGA~YYSYL~~r~~AS~IWq~~Fe~-----------dPf-------sR~aGek~r~eil~hGG~~~P~~lva~~L~~ 678 (704)
T KOG2090|consen 617 LVGYGATYYSYLFARAIASLIWQQLFEN-----------DPF-------SRKAGEKFRKEILKHGGGRDPAELVADILGK 678 (704)
T ss_pred hhccCchHHHHHHHHHHHHHHHHHHHhc-----------Ccc-------chhhhHHHHHHHHHhcCCCChHHHHHHHhcC
Confidence 553 567789999999999999999997 455 6699999977 99999999999999999999
Q ss_pred CCCCHHHHHH
Q psy3038 439 REMDASALLQ 448 (499)
Q Consensus 439 ~~lsa~~ll~ 448 (499)
+ |.++.+.+
T Consensus 679 ~-~~~~g~~~ 687 (704)
T KOG2090|consen 679 P-PLENGGVD 687 (704)
T ss_pred C-CCccchHH
Confidence 7 76665543
No 21
>KOG2089|consensus
Probab=97.86 E-value=0.00019 Score=79.84 Aligned_cols=205 Identities=19% Similarity=0.198 Sum_probs=130.5
Q ss_pred hhhHHHHHHHHHHHHHHHHhcCCCCcccCCC---CCCCCccccccccchhhhccccchheeccccchhhHHHHhhhhcCC
Q psy3038 189 EEDLFTIHHEMGHVEYFIQYKDQPMAFREGA---NPGKNTRGWTALKDGLSFYESWVAEIGLAGTVGSVVMRLLAMQKFV 265 (499)
Q Consensus 189 ~~d~~t~hHEmGHi~Y~~~y~~qP~~fr~gA---n~gfhE~~~~Aigd~~~~~e~~~~~iaL~~~~grS~~~~~~~q~~t 265 (499)
.+++.|..||.||+.-.+- ...++.-=.|. -.-|-| +-|.+ .||||-+--. + ++.+
T Consensus 493 ~~ev~t~FheFGh~~q~ll-~Qa~~~~fsG~~~vewDave----~psq~---Lenwv~~~d~--L--~~lS--------- 551 (718)
T KOG2089|consen 493 HDEVETLFHEFGHVLQHLL-TQADFARFSGPRNVEWDAVE----VPSQF---LENWVWDPDT--L--RSLS--------- 551 (718)
T ss_pred hHHHHHHHHHHhHHHHHHH-hcCccccccCcccCCcchhh----chHHH---HHHhccCchH--h--hhhc---------
Confidence 5799999999999997543 44444433333 345666 66666 7888666332 0 1222
Q ss_pred CcCccccccccccccCCCchhhhHHHHHHHhccccccccchhHHHHHHHHHhhcCCCC---ChhhHhHHHHHHHHhcCcc
Q psy3038 266 PFPLVINRKFGIRTKINSDKTIFMVEAIFLFKIDTFLINTFVNIMMDLWRWGVFKKDI---SHEEYNKHWWKLRHDYQGV 342 (499)
Q Consensus 266 P~hL~~~~~igLl~~~~~d~e~f~~~IN~L~aL~~i~~LPf~~~~~D~wRw~vf~G~i---~~~~~N~~WW~lr~~yqGI 342 (499)
+|.- .+-.|. + .-+|.|.+.......= ...-+.-|..|+=++ +.-+....|.+|..+|++
T Consensus 552 -~Hy~--tge~l~-------e---El~~kl~~~r~~~~gl---~tlrqL~~a~~D~~lht~~d~~~~~~~~~l~~~i~~- 614 (718)
T KOG2089|consen 552 -KHYK--TGEPLP-------E---ELLKKLILTRTVNAGL---FTLRQLVLADFDLELHTKTDADLADTYRQLCQEISI- 614 (718)
T ss_pred -cccc--CCCccH-------H---HHHHHHHHHHHHHHHH---HHHHHHHHHhhhHHHhhccccchHHHHHHhchhhee-
Confidence 3330 111111 1 2355555222221111 111222333333221 223445779999999999
Q ss_pred cCCCCCCCCCCCccCccceec-C--CCchhHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCCccccCcCcHHHhHHHHH-
Q psy3038 343 EPPTHRSEDDFDPGAKYHIVA-S--VPYIRYFVSFVIQFQFHRALCEKAGEFDPNDPSKQPLHECDIYRSTEAGNLLKN- 418 (499)
Q Consensus 343 ~pP~~R~e~dfDp~ak~Hi~~-~--~pYirYfl~~ilqfQf~~alC~~ag~~~~~~~~~~pLh~Cdiy~sk~aG~~L~~- 418 (499)
.|+.+ .+-=|.+.-||.+ + .-|.+|.-|.|++.=+|.+.-+..|+. +.+++|..+|+
T Consensus 615 ~p~~~---~d~~pcsF~hifa~gy~A~yY~yLWsEv~aaDif~t~fe~~g~~----------------N~~~~G~ryR~t 675 (718)
T KOG2089|consen 615 VPATP---GDNMPCSFGHIFAGGYAAGYYSYLWSEVLAADIFSTFFEQEGED----------------NIKEVGMRYRNT 675 (718)
T ss_pred cCCCC---CCCCCccccchhcCchHHHHHHHHHHHHHHHHHHHHHhhhcCCc----------------cHHHHHHHHHHh
Confidence 44443 3333555679998 4 378899999999999999877765553 45699999999
Q ss_pred HHhCCCCCChHHHHHHHhCCCCCCHHHHHHHHH
Q psy3038 419 MLAMGSSKPWPDAMEAITGQREMDASALLQYFE 451 (499)
Q Consensus 419 mm~~GsS~~W~e~le~~TG~~~lsa~~ll~YF~ 451 (499)
+|+.|++++-.|++++..|.. -|++|||.-..
T Consensus 676 iLa~GG~~~~~e~f~~FlGRe-PS~~Afl~s~g 707 (718)
T KOG2089|consen 676 ILAPGGGKDPMEVFKRFLGRE-PSQEAFLKSLG 707 (718)
T ss_pred hhcCCCCccHHHHHHHhhCCC-CChhHHHHhhc
Confidence 999999999999999999986 99999997654
No 22
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=49.60 E-value=14 Score=31.14 Aligned_cols=50 Identities=18% Similarity=0.041 Sum_probs=29.1
Q ss_pred CCcceeeecCCCC-hhh-HHHHHHHHHHHHHHHHhcCCCCcccCCCCCCCCccccccccch
Q psy3038 176 GKDFRIKMCTRVN-EED-LFTIHHEMGHVEYFIQYKDQPMAFREGANPGKNTRGWTALKDG 234 (499)
Q Consensus 176 ~~D~RIk~Ct~~~-~~d-~~t~hHEmGHi~Y~~~y~~qP~~fr~gAn~gfhE~~~~Aigd~ 234 (499)
+..-+|..+.... .+. ..++.||++|..-...... ...+..-|+| ++++.
T Consensus 8 ~~~~~i~~~~~~~~~~~~~~~l~HE~~H~~~~~~~~~-----~~~~~~W~~E----G~A~y 59 (128)
T PF13485_consen 8 PGFNRIVVYFQGSDEDWLDRVLAHELAHQWFGNYFGG-----DDNAPRWFNE----GLAEY 59 (128)
T ss_pred cCCCEEEEecCCCCHHHHHHHHHHHHHHHHHHHHcCC-----CccCchHHHH----HHHHH
Confidence 4455677665444 333 3899999999975444321 1233444666 66555
No 23
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=46.90 E-value=11 Score=33.87 Aligned_cols=18 Identities=22% Similarity=0.246 Sum_probs=13.4
Q ss_pred hhHHHHHHHHHHHHHHHHhcC
Q psy3038 190 EDLFTIHHEMGHVEYFIQYKD 210 (499)
Q Consensus 190 ~d~~t~hHEmGHi~Y~~~y~~ 210 (499)
+-..++.||+||+. +..|
T Consensus 104 ~~~~v~~HEiGHaL---GL~H 121 (154)
T PF00413_consen 104 DLQSVAIHEIGHAL---GLDH 121 (154)
T ss_dssp EHHHHHHHHHHHHT---TBES
T ss_pred hhhhhhhhcccccc---CcCc
Confidence 34578999999986 5554
No 24
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=46.48 E-value=27 Score=31.64 Aligned_cols=77 Identities=16% Similarity=0.116 Sum_probs=39.1
Q ss_pred HHHHHHHHHHHHHH-cCCCCCChHH----hhhcccc-cCC-CCcccccCcccccCCCCcceeeecCC-C-ChhhHHHHHH
Q psy3038 127 VHKMFRTAEEFFTS-INMSAMPPEF----WERSMLE-KPQ-GREVVCHASAWDFHDGKDFRIKMCTR-V-NEEDLFTIHH 197 (499)
Q Consensus 127 ~~~mf~~ae~ff~s-lGl~~mp~~F----W~~S~~~-kp~-~r~~~CH~sawdf~~~~D~RIk~Ct~-~-~~~d~~t~hH 197 (499)
...+++.|-+.+.+ +++....-.- ....++. ... +....|++..-..|++..-.|..... . ...-..++.|
T Consensus 23 ~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~~~~~~g~a~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~H 102 (167)
T cd00203 23 IQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDFDGGTGGWAYLGRVCDSLRGVGVLQDNQSGTKEGAQTIAH 102 (167)
T ss_pred HHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccCCCCceEEEecCCccCCCCCcEEEecCCcccccchhhHHH
Confidence 46788888888887 6666422110 0111111 111 12345665544444332222222111 1 2467889999
Q ss_pred HHHHHH
Q psy3038 198 EMGHVE 203 (499)
Q Consensus 198 EmGHi~ 203 (499)
|+||+.
T Consensus 103 ElGH~L 108 (167)
T cd00203 103 ELGHAL 108 (167)
T ss_pred HHHHHh
Confidence 999997
No 25
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=45.38 E-value=26 Score=29.24 Aligned_cols=30 Identities=27% Similarity=0.424 Sum_probs=20.5
Q ss_pred cceeeecCCCC-hhhHHHHHHHHHHHHHHHH
Q psy3038 178 DFRIKMCTRVN-EEDLFTIHHEMGHVEYFIQ 207 (499)
Q Consensus 178 D~RIk~Ct~~~-~~d~~t~hHEmGHi~Y~~~ 207 (499)
.-.|-.....+ ...-+++.||+||+.....
T Consensus 28 ~~~I~in~~~~~~~~~f~laHELgH~~~~~~ 58 (122)
T PF06114_consen 28 NPIIFINSNLSPERQRFTLAHELGHILLHHG 58 (122)
T ss_dssp TTEEEEESSS-HHHHHHHHHHHHHHHHHHH-
T ss_pred CCEEEECCCCCHHHHHHHHHHHHHHHHhhhc
Confidence 34455555544 6778999999999998544
No 26
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=41.76 E-value=16 Score=33.26 Aligned_cols=14 Identities=36% Similarity=0.370 Sum_probs=12.4
Q ss_pred hhHHHHHHHHHHHH
Q psy3038 190 EDLFTIHHEMGHVE 203 (499)
Q Consensus 190 ~d~~t~hHEmGHi~ 203 (499)
..+.++.||+||+.
T Consensus 93 ~~~~~~~HEiGHaL 106 (165)
T cd04268 93 RLRNTAEHELGHAL 106 (165)
T ss_pred HHHHHHHHHHHHHh
Confidence 56899999999996
No 27
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=39.22 E-value=35 Score=34.25 Aligned_cols=49 Identities=31% Similarity=0.427 Sum_probs=35.7
Q ss_pred cccCcccccCCCCcceeeecCCCC----------------------hhhHHHHHHHHHHHHHHHHhcCCCCccc
Q psy3038 165 VCHASAWDFHDGKDFRIKMCTRVN----------------------EEDLFTIHHEMGHVEYFIQYKDQPMAFR 216 (499)
Q Consensus 165 ~CH~sawdf~~~~D~RIk~Ct~~~----------------------~~d~~t~hHEmGHi~Y~~~y~~qP~~fr 216 (499)
.| .-.--|+++.+-.|+||+.-- -.-+.|+.||+||+... .-++|++=|
T Consensus 45 ~C-Ge~nA~ydPe~~~I~iCYEf~~~~~~~f~~~~~~~~~~~~~~~~~~~~~l~HE~GHAlI~--~~~lPv~Gr 115 (220)
T PF14247_consen 45 EC-GEDNAFYDPENRSITICYEFVDEILDRFAKANDPDEEYGQAAIGNVLFTLYHELGHALID--DLDLPVLGR 115 (220)
T ss_pred ec-CCCCCccCCCCCEEEECHHHHHHHHHHHHhCCcCcchHHHHHHHHHHHHHHHHHHHHHHH--HhcCCcccc
Confidence 48 767778999999999997521 12367899999999863 346676633
No 28
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=37.42 E-value=30 Score=31.66 Aligned_cols=13 Identities=31% Similarity=0.286 Sum_probs=10.7
Q ss_pred hHHHHHHHHHHHH
Q psy3038 191 DLFTIHHEMGHVE 203 (499)
Q Consensus 191 d~~t~hHEmGHi~ 203 (499)
-+-++.||+||+.
T Consensus 104 ~~~~~~HEiGHaL 116 (156)
T cd04279 104 LQAIALHELGHAL 116 (156)
T ss_pred HHHHHHHHhhhhh
Confidence 3467899999997
No 29
>PF04700 Baculo_gp41: Structural glycoprotein p40/gp41 conserved region; InterPro: IPR006790 This is a family of viral structural glycoproteins [] from the baculoviridae.; GO: 0005198 structural molecule activity, 0019012 virion
Probab=37.26 E-value=83 Score=30.98 Aligned_cols=60 Identities=25% Similarity=0.456 Sum_probs=45.5
Q ss_pred ChhhHhHHHHHHHHhcCcccCCCCCCCCCCCccCccceecC--CCchhHH-----------HHHHHHHHHHHHHHHHh
Q psy3038 324 SHEEYNKHWWKLRHDYQGVEPPTHRSEDDFDPGAKYHIVAS--VPYIRYF-----------VSFVIQFQFHRALCEKA 388 (499)
Q Consensus 324 ~~~~~N~~WW~lr~~yqGI~pP~~R~e~dfDp~ak~Hi~~~--~pYirYf-----------l~~ilqfQf~~alC~~a 388 (499)
+.+++-+++-+|.++|.|..-|. +..-||+.- +.+. ..|...| ||.++|+|+=+|.-+..
T Consensus 6 sDe~Li~yY~~L~K~~g~~~~~~--~~~IF~~sF---v~sp~mksYA~KFYn~~l~eAA~hLs~~vkYQiA~AVT~n~ 78 (186)
T PF04700_consen 6 SDEELIEYYANLEKKYGGSDVPS--AHGIFDKSF---VISPIMKSYADKFYNRRLNEAARHLSDVVKYQIAEAVTQNK 78 (186)
T ss_pred cHHHHHHHHHHHHHHhCCCCCCc--ccccCchhh---hcCcchHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHcCC
Confidence 35789999999999999999887 456888642 3232 2455444 89999999999998874
No 30
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=36.67 E-value=16 Score=33.63 Aligned_cols=14 Identities=29% Similarity=0.437 Sum_probs=11.9
Q ss_pred hhHHHHHHHHHHHH
Q psy3038 190 EDLFTIHHEMGHVE 203 (499)
Q Consensus 190 ~d~~t~hHEmGHi~ 203 (499)
.-+.++.||+||+.
T Consensus 106 ~~~~~~~HEiGHaL 119 (157)
T cd04278 106 DLFSVAAHEIGHAL 119 (157)
T ss_pred hHHHHHHHHhcccc
Confidence 45779999999995
No 31
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=35.32 E-value=23 Score=33.44 Aligned_cols=16 Identities=38% Similarity=0.362 Sum_probs=14.0
Q ss_pred hhhHHHHHHHHHHHHH
Q psy3038 189 EEDLFTIHHEMGHVEY 204 (499)
Q Consensus 189 ~~d~~t~hHEmGHi~Y 204 (499)
...+.|+.||+||+.-
T Consensus 111 ~~~~~t~~HEiGHaLG 126 (186)
T cd04277 111 SYGYQTIIHEIGHALG 126 (186)
T ss_pred hhhHHHHHHHHHHHhc
Confidence 4679999999999984
No 32
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=34.22 E-value=25 Score=34.13 Aligned_cols=22 Identities=27% Similarity=0.217 Sum_probs=16.5
Q ss_pred hhhHHHHHHHHHHHHHHHHhcC
Q psy3038 189 EEDLFTIHHEMGHVEYFIQYKD 210 (499)
Q Consensus 189 ~~d~~t~hHEmGHi~Y~~~y~~ 210 (499)
..-+.|+.|||||+.--+-.-.
T Consensus 20 ~~~l~t~~HE~gHal~a~l~G~ 41 (200)
T PF13398_consen 20 FRLLVTFVHELGHALAALLTGG 41 (200)
T ss_pred HHHHHHHHHHHHHHHHHHHhCC
Confidence 3457799999999998655443
No 33
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=32.88 E-value=22 Score=30.89 Aligned_cols=12 Identities=42% Similarity=0.548 Sum_probs=10.3
Q ss_pred HHHHHHHHHHHH
Q psy3038 192 LFTIHHEMGHVE 203 (499)
Q Consensus 192 ~~t~hHEmGHi~ 203 (499)
..++.||+||..
T Consensus 108 ~~~~~HEiGH~l 119 (124)
T PF13582_consen 108 VDTFAHEIGHNL 119 (124)
T ss_dssp TTHHHHHHHHHT
T ss_pred ceEeeehhhHhc
Confidence 489999999974
No 34
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=32.24 E-value=23 Score=31.59 Aligned_cols=26 Identities=23% Similarity=0.280 Sum_probs=16.7
Q ss_pred CCcceeeecCCCChhhHHHHHHHHHHHH
Q psy3038 176 GKDFRIKMCTRVNEEDLFTIHHEMGHVE 203 (499)
Q Consensus 176 ~~D~RIk~Ct~~~~~d~~t~hHEmGHi~ 203 (499)
+++.+|.... ....+-++.||+||+.
T Consensus 73 ~g~~~~~~~~--~~~~~~~~~HEigHaL 98 (140)
T smart00235 73 GGDQHFSLGN--GCINTGVAAHELGHAL 98 (140)
T ss_pred CCceEEEccC--CcCCcccHHHHHHHHh
Confidence 4566665522 1224559999999997
No 35
>PRK04897 heat shock protein HtpX; Provisional
Probab=31.22 E-value=94 Score=32.11 Aligned_cols=67 Identities=19% Similarity=0.187 Sum_probs=39.7
Q ss_pred HHHHHHHHHHHHHcCCCCCChHHhhhcccccCCCCcccccCcccccCCCC-cceeeecC--CCC-hhhHHHHHHHHHHHH
Q psy3038 128 HKMFRTAEEFFTSINMSAMPPEFWERSMLEKPQGREVVCHASAWDFHDGK-DFRIKMCT--RVN-EEDLFTIHHEMGHVE 203 (499)
Q Consensus 128 ~~mf~~ae~ff~slGl~~mp~~FW~~S~~~kp~~r~~~CH~sawdf~~~~-D~RIk~Ct--~~~-~~d~~t~hHEmGHi~ 203 (499)
.++.+..++.-...|++ +|+-+.-. +...-|+|+.++..+ -+-+++.. ..+ .|=--.+.||+||+.
T Consensus 80 p~L~~~v~~la~~~gip-~p~v~v~~---------~~~~NAfa~G~~~~~~~v~vt~gLl~~l~~~El~aVlAHElgHi~ 149 (298)
T PRK04897 80 PELWHIVEDMAMVAQIP-MPRVFIID---------DPSPNAFATGSSPKNAAVAVTTGLLAIMNREELEGVIGHEISHIR 149 (298)
T ss_pred HHHHHHHHHHHHHcCCC-CCcEEEec---------CCCCceEEeccCCCCcEEEeehHHHhhCCHHHHHHHHHHHHHHHh
Confidence 36889999999999997 55433221 113455666665322 23333211 124 444556899999987
Q ss_pred H
Q psy3038 204 Y 204 (499)
Q Consensus 204 Y 204 (499)
.
T Consensus 150 ~ 150 (298)
T PRK04897 150 N 150 (298)
T ss_pred c
Confidence 4
No 36
>PF14891 Peptidase_M91: Effector protein
Probab=30.73 E-value=35 Score=32.32 Aligned_cols=19 Identities=16% Similarity=0.329 Sum_probs=16.5
Q ss_pred hhhHHHHHHHHHHHHHHHH
Q psy3038 189 EEDLFTIHHEMGHVEYFIQ 207 (499)
Q Consensus 189 ~~d~~t~hHEmGHi~Y~~~ 207 (499)
..-+..+.|||+|++.++.
T Consensus 101 ~~p~v~L~HEL~HA~~~~~ 119 (174)
T PF14891_consen 101 RPPFVVLYHELIHAYDYMN 119 (174)
T ss_pred HHHHHHHHHHHHHHHHHHC
Confidence 4558999999999999876
No 37
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=30.65 E-value=34 Score=32.37 Aligned_cols=13 Identities=38% Similarity=0.560 Sum_probs=11.7
Q ss_pred hHHHHHHHHHHHH
Q psy3038 191 DLFTIHHEMGHVE 203 (499)
Q Consensus 191 d~~t~hHEmGHi~ 203 (499)
+..++.||+||..
T Consensus 111 ~~~~~aHElGH~l 123 (173)
T PF13574_consen 111 GIDTFAHELGHQL 123 (173)
T ss_dssp HHHHHHHHHHHHH
T ss_pred eeeeehhhhHhhc
Confidence 7788999999986
No 38
>PRK03982 heat shock protein HtpX; Provisional
Probab=30.49 E-value=86 Score=32.06 Aligned_cols=64 Identities=20% Similarity=0.242 Sum_probs=37.2
Q ss_pred HHHHHHHHHHHHHcCCCCCChHHhhhcccccCCCCcccccCcccccCCCCccee-------eecCCCChhhHHHHHHHHH
Q psy3038 128 HKMFRTAEEFFTSINMSAMPPEFWERSMLEKPQGREVVCHASAWDFHDGKDFRI-------KMCTRVNEEDLFTIHHEMG 200 (499)
Q Consensus 128 ~~mf~~ae~ff~slGl~~mp~~FW~~S~~~kp~~r~~~CH~sawdf~~~~D~RI-------k~Ct~~~~~d~~t~hHEmG 200 (499)
.++.+..++.-+..|++. |+ +..-+. ...-|+++.++ +++-+| ++++ ..|=.-.+.||+|
T Consensus 68 p~L~~~v~~la~~~g~~~-p~------v~v~~~---~~~NAfa~G~~-~~~~~V~vt~gLl~~l~--~~El~AVlAHElg 134 (288)
T PRK03982 68 PELYRIVERLAERANIPK-PK------VAIVPT---QTPNAFATGRD-PKHAVVAVTEGILNLLN--EDELEGVIAHELT 134 (288)
T ss_pred HHHHHHHHHHHHHcCCCC-Ce------EEEEeC---CCcceEEeccC-CCCeEEEeehHHHhhCC--HHHHHHHHHHHHH
Confidence 357888889999999974 42 221121 12455666555 223333 2221 2444566899999
Q ss_pred HHHH
Q psy3038 201 HVEY 204 (499)
Q Consensus 201 Hi~Y 204 (499)
|+..
T Consensus 135 Hi~~ 138 (288)
T PRK03982 135 HIKN 138 (288)
T ss_pred HHHc
Confidence 9975
No 39
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=29.43 E-value=49 Score=33.02 Aligned_cols=27 Identities=30% Similarity=0.379 Sum_probs=20.5
Q ss_pred CcceeeecCCCChhhHHHHHHHHHHHHH
Q psy3038 177 KDFRIKMCTRVNEEDLFTIHHEMGHVEY 204 (499)
Q Consensus 177 ~D~RIk~Ct~~~~~d~~t~hHEmGHi~Y 204 (499)
.++=|-.|- ......+|+.||+||+.-
T Consensus 59 ~~I~iN~n~-~~~r~rFtlAHELGH~ll 85 (213)
T COG2856 59 PVIYINANN-SLERKRFTLAHELGHALL 85 (213)
T ss_pred ceEEEeCCC-CHHHHHHHHHHHHhHHHh
Confidence 455566665 457899999999999874
No 40
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=28.88 E-value=36 Score=31.33 Aligned_cols=30 Identities=20% Similarity=0.409 Sum_probs=20.7
Q ss_pred cceeeecCCCC----hhhHH-HHHHHHHHHHHHHH
Q psy3038 178 DFRIKMCTRVN----EEDLF-TIHHEMGHVEYFIQ 207 (499)
Q Consensus 178 D~RIk~Ct~~~----~~d~~-t~hHEmGHi~Y~~~ 207 (499)
+-+|.....+- .+.+. |+.|||.|+..++.
T Consensus 41 ~~~I~ln~~l~~~~~~~~l~~~l~HEm~H~~~~~~ 75 (146)
T smart00731 41 SAEIRLNPKLLTENGRDRLRETLLHELCHAALYLF 75 (146)
T ss_pred CCEEEeCHHHHhhccHHHHHhhHHHHHHHHHHHHh
Confidence 45566665543 24444 89999999998873
No 41
>PRK05457 heat shock protein HtpX; Provisional
Probab=28.79 E-value=1.1e+02 Score=31.57 Aligned_cols=65 Identities=15% Similarity=0.128 Sum_probs=38.2
Q ss_pred HHHHHHHHHHHHcCCCCCChHHhhhcccccCCCCcccccCcccccCCCCcceeeecC----CCC-hhhHHHHHHHHHHHH
Q psy3038 129 KMFRTAEEFFTSINMSAMPPEFWERSMLEKPQGREVVCHASAWDFHDGKDFRIKMCT----RVN-EEDLFTIHHEMGHVE 203 (499)
Q Consensus 129 ~mf~~ae~ff~slGl~~mp~~FW~~S~~~kp~~r~~~CH~sawdf~~~~D~RIk~Ct----~~~-~~d~~t~hHEmGHi~ 203 (499)
.+++..++.-...|++ +|+=+. . + ....-|+|+..+ +++-+|-..+ ..+ .|=--.+.||+||+.
T Consensus 78 ~L~~~v~~la~~~g~p-~p~v~v----~--~---~~~~NAfa~G~~-~~~~~V~vt~gLl~~L~~~El~aVlAHElgHi~ 146 (284)
T PRK05457 78 WLVETVARQARQAGIG-MPEVAI----Y--H---SPEINAFATGAS-KNNSLVAVSTGLLQNMSRDEVEAVLAHEISHIA 146 (284)
T ss_pred HHHHHHHHHHHhCCCC-CCCEEE----E--e---CCCceEEEecCC-CCCeEEEeehHHhhhCCHHHHHHHHHHHHHHHH
Confidence 4788899999999998 554321 1 1 112445555544 3343443333 134 444556899999986
Q ss_pred H
Q psy3038 204 Y 204 (499)
Q Consensus 204 Y 204 (499)
.
T Consensus 147 ~ 147 (284)
T PRK05457 147 N 147 (284)
T ss_pred c
Confidence 4
No 42
>PF09345 DUF1987: Domain of unknown function (DUF1987); InterPro: IPR018530 This family of proteins are functionally uncharacterised.
Probab=26.24 E-value=61 Score=28.59 Aligned_cols=23 Identities=26% Similarity=0.681 Sum_probs=17.4
Q ss_pred HHHHHHHHHHHHHHHhhcCCccC
Q psy3038 447 LQYFEPLYKWLEEENARTGEHIG 469 (499)
Q Consensus 447 l~YF~PL~~WL~~~N~~~~~~iG 469 (499)
.++|+|+.+||++--+..+..|-
T Consensus 24 ~~Fy~Pi~~wl~~Yl~~~~~~i~ 46 (99)
T PF09345_consen 24 FAFYQPILDWLEAYLAEPNKPIT 46 (99)
T ss_pred HHHHHHHHHHHHHHHhCCCCcEE
Confidence 37999999999998775444443
No 43
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=25.83 E-value=21 Score=35.50 Aligned_cols=14 Identities=29% Similarity=0.508 Sum_probs=11.8
Q ss_pred hhHHHHHHHHHHHH
Q psy3038 190 EDLFTIHHEMGHVE 203 (499)
Q Consensus 190 ~d~~t~hHEmGHi~ 203 (499)
..+.|+.|||||..
T Consensus 144 ~~~~t~AHElGHnL 157 (228)
T cd04271 144 NEWQVFAHEIGHTF 157 (228)
T ss_pred ccceehhhhhhhhc
Confidence 35589999999987
No 44
>PRK03001 M48 family peptidase; Provisional
Probab=25.82 E-value=1.2e+02 Score=30.80 Aligned_cols=65 Identities=15% Similarity=0.135 Sum_probs=36.7
Q ss_pred HHHHHHHHHHHHcCCCCCChHHhhhcccccCCCCcccccCcccccCCCCcceee--ecC--CCC-hhhHHHHHHHHHHHH
Q psy3038 129 KMFRTAEEFFTSINMSAMPPEFWERSMLEKPQGREVVCHASAWDFHDGKDFRIK--MCT--RVN-EEDLFTIHHEMGHVE 203 (499)
Q Consensus 129 ~mf~~ae~ff~slGl~~mp~~FW~~S~~~kp~~r~~~CH~sawdf~~~~D~RIk--~Ct--~~~-~~d~~t~hHEmGHi~ 203 (499)
.+.+..++.-...|++. |+ +..-+. ...-|++..++ +..-||- .=. ..+ .|=-..+.||+||+.
T Consensus 68 ~L~~~v~~la~~~g~~~-p~------v~v~~~---~~~NAfa~G~~-~~~~~Ivvt~gLl~~l~~~El~aVlAHElgHi~ 136 (283)
T PRK03001 68 QFYRMVRELAQRAGLPM-PK------VYLINE---DQPNAFATGRN-PEHAAVAATTGILRVLSEREIRGVMAHELAHVK 136 (283)
T ss_pred HHHHHHHHHHHHcCCCC-Ce------EEEecC---CCcceEEecCC-CCCeEEEecHHHHhhCCHHHHHHHHHHHHHHHh
Confidence 57788899999999985 42 221111 23445555443 2223342 100 113 444566899999998
Q ss_pred H
Q psy3038 204 Y 204 (499)
Q Consensus 204 Y 204 (499)
.
T Consensus 137 ~ 137 (283)
T PRK03001 137 H 137 (283)
T ss_pred C
Confidence 5
No 45
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=25.03 E-value=51 Score=30.80 Aligned_cols=24 Identities=25% Similarity=0.247 Sum_probs=15.4
Q ss_pred eeeecCCCC--hhhHHHHHHHHHHHH
Q psy3038 180 RIKMCTRVN--EEDLFTIHHEMGHVE 203 (499)
Q Consensus 180 RIk~Ct~~~--~~d~~t~hHEmGHi~ 203 (499)
+|-+-+..+ .+....+.||+|||.
T Consensus 64 ~I~l~~~~~qgy~~~RIaaHE~GHiL 89 (132)
T PF02031_consen 64 YIFLDYQQNQGYNSTRIAAHELGHIL 89 (132)
T ss_dssp EEEEEHHHHHHS-HHHHHHHHHHHHH
T ss_pred EEEechHHhhCCccceeeeehhcccc
Confidence 444433333 346778999999997
No 46
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=24.92 E-value=46 Score=31.28 Aligned_cols=16 Identities=31% Similarity=0.439 Sum_probs=14.0
Q ss_pred ChhhHHHHHHHHHHHH
Q psy3038 188 NEEDLFTIHHEMGHVE 203 (499)
Q Consensus 188 ~~~d~~t~hHEmGHi~ 203 (499)
+.....++.||+||..
T Consensus 139 ~~~~~~~~AHEiGH~l 154 (196)
T PF13688_consen 139 TYNGAITFAHEIGHNL 154 (196)
T ss_dssp HHHHHHHHHHHHHHHT
T ss_pred CCceehhhHHhHHHhc
Confidence 3778899999999986
No 47
>PF10216 ChpXY: CO2 hydration protein (ChpXY); InterPro: IPR010220 This small family of proteins includes paralogs ChpX and ChpY in Synechococcus sp. (strain PCC 7942) (Anacystis nidulans R2) and other cyanobacteria, associated with distinct NAD(P)H dehydrogenase complexes. These proteins collectively enable light-dependent CO2 hydration and CO2 uptake; loss of both blocks growth at low CO2 concentrations.
Probab=24.87 E-value=40 Score=35.71 Aligned_cols=63 Identities=17% Similarity=0.287 Sum_probs=39.3
Q ss_pred CCcCccccccccccccCCCchhhhHHHHHHHh--ccccccccchhHHHHHHHHHhhcCCCCChhhHhHHHHHHHHhcC
Q psy3038 265 VPFPLVINRKFGIRTKINSDKTIFMVEAIFLF--KIDTFLINTFVNIMMDLWRWGVFKKDISHEEYNKHWWKLRHDYQ 340 (499)
Q Consensus 265 tP~hL~~~~~igLl~~~~~d~e~f~~~IN~L~--aL~~i~~LPf~~~~~D~wRw~vf~G~i~~~~~N~~WW~lr~~yq 340 (499)
||.||. +-||++.+.----+ +| ..|-++ .-.-+.+.|| .+-|+|+++...+-.+||.=|-.|.
T Consensus 15 sP~nll--eVVgilkSYgvvlD-aY-srnL~yiae~qFL~~fPf---------FKYFnGe~~~~kl~~H~whDRIN~E 79 (353)
T PF10216_consen 15 SPENLL--EVVGILKSYGVVLD-AY-SRNLIYIAERQFLNPFPF---------FKYFNGEISLPKLLRHLWHDRINFE 79 (353)
T ss_pred ChHHHH--HHHHHHHHhcchHH-HH-HHHHHHHHHhhhcccchH---------HHHhCCCCCHHHHHHHHhhhhhhhH
Confidence 888883 44798885421111 11 234444 3333445666 2458999999999999998776553
No 48
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=24.70 E-value=38 Score=33.89 Aligned_cols=12 Identities=33% Similarity=0.432 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHH
Q psy3038 192 LFTIHHEMGHVE 203 (499)
Q Consensus 192 ~~t~hHEmGHi~ 203 (499)
-.|+.|||||..
T Consensus 168 a~t~AHElGHnl 179 (244)
T cd04270 168 DLVTAHELGHNF 179 (244)
T ss_pred HHHHHHHHHHhc
Confidence 389999999986
No 49
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=24.35 E-value=46 Score=32.01 Aligned_cols=15 Identities=33% Similarity=0.270 Sum_probs=12.0
Q ss_pred hhHHHHHHHHHHHHH
Q psy3038 190 EDLFTIHHEMGHVEY 204 (499)
Q Consensus 190 ~d~~t~hHEmGHi~Y 204 (499)
..+-++.||+||+.=
T Consensus 91 ~~~~~i~HElgHaLG 105 (198)
T cd04327 91 EFSRVVLHEFGHALG 105 (198)
T ss_pred hHHHHHHHHHHHHhc
Confidence 345799999999973
No 50
>PRK03072 heat shock protein HtpX; Provisional
Probab=21.64 E-value=1.6e+02 Score=30.39 Aligned_cols=37 Identities=11% Similarity=-0.086 Sum_probs=26.7
Q ss_pred ccchhHHHHHHHHHhhcCCCCChhhHhHHHHHHHHhcCcc
Q psy3038 303 INTFVNIMMDLWRWGVFKKDISHEEYNKHWWKLRHDYQGV 342 (499)
Q Consensus 303 ~LPf~~~~~D~wRw~vf~G~i~~~~~N~~WW~lr~~yqGI 342 (499)
+-|.-+|.+|..=.++- | +++.+-+++.++....+..
T Consensus 201 ~SR~rE~~AD~~A~~l~-~--~p~~La~AL~KL~~~~~~~ 237 (288)
T PRK03072 201 ISRSREYQADESGAELT-G--DPLALASALRKISGGVQAA 237 (288)
T ss_pred HHhHHHHHHhHHHHHHh-C--CHHHHHHHHHHHHhccccC
Confidence 34545888888877772 3 5788999999998766543
No 51
>KOG3728|consensus
Probab=20.85 E-value=2e+02 Score=29.73 Aligned_cols=85 Identities=18% Similarity=0.271 Sum_probs=58.1
Q ss_pred CCCHHHHHHHHHHHHHHcCCCCCChHHhhhcccccCCCCcccccCc-ccccCCCCcceeeecCCCC--hhhHHHHHHHHH
Q psy3038 124 GYTVHKMFRTAEEFFTSINMSAMPPEFWERSMLEKPQGREVVCHAS-AWDFHDGKDFRIKMCTRVN--EEDLFTIHHEMG 200 (499)
Q Consensus 124 ~yt~~~mf~~ae~ff~slGl~~mp~~FW~~S~~~kp~~r~~~CH~s-awdf~~~~D~RIk~Ct~~~--~~d~~t~hHEmG 200 (499)
|=|+.+|-.+|+-+-..+|++. |.+....|--+ -.-||. +-=-.|.... ..+|..+.|||=
T Consensus 58 GGtp~Rmk~~a~~~~~el~~~~-------------~~~~~di~a~sdRyamYK---vGPVl~vsHGmGtpS~SImlhEli 121 (308)
T KOG3728|consen 58 GGTPSRMKQFALYLRDELGVSC-------------SGDPVDICARSDRYAMYK---VGPVLCVSHGMGTPSFSIMLHELI 121 (308)
T ss_pred CCCHHHHHHHHHHHHHHhCCCC-------------CCCCcchhcccceeEEEe---ecceEEEecCCCCccHHHHHHHHH
Confidence 5599999999999999999997 54332234332 122221 1112355544 579999999999
Q ss_pred HHHHHHHhcCCCCcccCCCCCCCCc
Q psy3038 201 HVEYFIQYKDQPMAFREGANPGKNT 225 (499)
Q Consensus 201 Hi~Y~~~y~~qP~~fr~gAn~gfhE 225 (499)
...|+-.. .-|+++|-|.+-|+-=
T Consensus 122 KLl~~Arc-kdp~~iRiGT~GGiGv 145 (308)
T KOG3728|consen 122 KLLYYARC-KDPVFIRIGTCGGIGV 145 (308)
T ss_pred HHHHHccC-CCceEEEEeccCccCC
Confidence 99886543 4599999888887744
No 52
>PRK02870 heat shock protein HtpX; Provisional
Probab=20.59 E-value=1.7e+02 Score=31.18 Aligned_cols=68 Identities=16% Similarity=0.300 Sum_probs=37.9
Q ss_pred HHHHHHHHHHHHHHcCCCCCChHHhhhcccccCCCCcccccCcccccCCCCcceee--ecC--CCC-hhhHHHHHHHHHH
Q psy3038 127 VHKMFRTAEEFFTSINMSAMPPEFWERSMLEKPQGREVVCHASAWDFHDGKDFRIK--MCT--RVN-EEDLFTIHHEMGH 201 (499)
Q Consensus 127 ~~~mf~~ae~ff~slGl~~mp~~FW~~S~~~kp~~r~~~CH~sawdf~~~~D~RIk--~Ct--~~~-~~d~~t~hHEmGH 201 (499)
..++.+..++.-...|++.+|.=|. + +. ..--|+|+.++ +.+-+|- .=. ..+ .|=--.+.||+||
T Consensus 114 ~~~L~~~ve~La~~ag~p~~p~V~v----i--~~---~~~NAFA~G~~-~~~~~Ivvt~GLL~~L~~dEL~aVlAHELgH 183 (336)
T PRK02870 114 ERQLYNVVEELLVAAGLRFMPKVYI----I--DA---PYMNAFASGYS-EKSAMVAITTGLLEKLDRDELQAVMAHELSH 183 (336)
T ss_pred hHHHHHHHHHHHHHcCCCCCCeEEE----E--cC---CCCceEEecCC-CCCcEEEEehHHhhhCCHHHHHHHHHHHHHH
Confidence 3467788888888899986663321 1 11 12345555554 2333332 111 123 3344567899999
Q ss_pred HHH
Q psy3038 202 VEY 204 (499)
Q Consensus 202 i~Y 204 (499)
+..
T Consensus 184 ik~ 186 (336)
T PRK02870 184 IRH 186 (336)
T ss_pred HHc
Confidence 953
No 53
>PRK02391 heat shock protein HtpX; Provisional
Probab=20.51 E-value=1.8e+02 Score=30.11 Aligned_cols=64 Identities=19% Similarity=0.186 Sum_probs=35.6
Q ss_pred HHHHHHHHHHHHcCCCCCChHHhhhcccccCCCCcccccCcccccCCCCcceeeecCC-----CC-hhhHHHHHHHHHHH
Q psy3038 129 KMFRTAEEFFTSINMSAMPPEFWERSMLEKPQGREVVCHASAWDFHDGKDFRIKMCTR-----VN-EEDLFTIHHEMGHV 202 (499)
Q Consensus 129 ~mf~~ae~ff~slGl~~mp~~FW~~S~~~kp~~r~~~CH~sawdf~~~~D~RIk~Ct~-----~~-~~d~~t~hHEmGHi 202 (499)
++.+..++.-...|++ +|+-|.-. +..--|+++.++. .. |+-..|+ .+ .|=-..+.||+||+
T Consensus 77 ~L~~~v~~la~~~~~~-~p~v~v~~---------~~~~NAfa~G~~~-~~-~~V~vt~gLl~~L~~~El~aVlaHElgHi 144 (296)
T PRK02391 77 ELHAMVERLCALADLP-KPRVAVAD---------SDVPNAFATGRSP-KN-AVVCVTTGLMRRLDPDELEAVLAHELSHV 144 (296)
T ss_pred HHHHHHHHHHHHcCCC-CCcEEEEe---------CCCCceEEecCCC-CC-cEEEecHHHHhhCCHHHHHHHHHHHHHHH
Confidence 6778888889999998 45433211 1122334444432 22 2222221 24 44556689999998
Q ss_pred HH
Q psy3038 203 EY 204 (499)
Q Consensus 203 ~Y 204 (499)
..
T Consensus 145 ~~ 146 (296)
T PRK02391 145 KN 146 (296)
T ss_pred Hc
Confidence 64
Done!