Query 038652
Match_columns 174
No_of_seqs 132 out of 335
Neff 6.0
Searched_HMMs 46136
Date Fri Mar 29 13:15:25 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/038652.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/038652hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF08284 RVP_2: Retroviral asp 99.9 2E-23 4.4E-28 161.4 9.2 92 41-134 12-131 (135)
2 cd05479 RP_DDI RP_DDI; retrope 99.7 2.2E-17 4.8E-22 125.2 7.9 85 47-132 13-124 (124)
3 PF13975 gag-asp_proteas: gag- 99.6 1E-14 2.3E-19 101.1 8.3 67 44-110 2-69 (72)
4 cd05484 retropepsin_like_LTR_2 99.3 5.9E-12 1.3E-16 90.0 7.0 58 52-109 2-59 (91)
5 TIGR02281 clan_AA_DTGA clan AA 99.1 1.6E-09 3.6E-14 82.1 9.7 60 46-105 7-67 (121)
6 PF13650 Asp_protease_2: Aspar 98.8 2.3E-08 5E-13 69.5 7.5 55 53-107 1-56 (90)
7 TIGR03698 clan_AA_DTGF clan AA 98.8 1.9E-08 4.2E-13 74.7 6.1 69 59-130 14-107 (107)
8 cd05483 retropepsin_like_bacte 98.7 7.5E-08 1.6E-12 67.5 7.5 60 49-108 1-60 (96)
9 PF09668 Asp_protease: Asparty 98.5 3.5E-07 7.5E-12 70.2 5.6 41 47-87 21-61 (124)
10 PF00077 RVP: Retroviral aspar 98.3 1.4E-06 3E-11 62.6 5.2 54 52-108 7-60 (100)
11 cd00303 retropepsin_like Retro 98.2 5.9E-06 1.3E-10 54.0 6.8 55 54-108 2-57 (92)
12 PF12384 Peptidase_A2B: Ty3 tr 98.1 1.1E-05 2.3E-10 64.8 7.4 54 45-98 29-82 (177)
13 COG3577 Predicted aspartyl pro 98.0 2.8E-05 6.1E-10 64.4 7.1 66 34-103 93-159 (215)
14 cd06095 RP_RTVL_H_like Retrope 97.9 3.1E-05 6.7E-10 54.9 5.3 46 54-102 2-47 (86)
15 cd05480 NRIP_C NRIP_C; putativ 97.7 4E-05 8.7E-10 56.9 4.2 33 54-86 2-34 (103)
16 cd05481 retropepsin_like_LTR_1 97.7 0.0001 2.2E-09 53.5 5.7 53 56-108 4-60 (93)
17 PF02160 Peptidase_A3: Caulifl 97.6 0.00022 4.8E-09 58.9 6.6 91 48-141 2-125 (201)
18 PF05585 DUF1758: Putative pep 97.5 0.00019 4.1E-09 56.4 4.7 39 59-97 10-48 (164)
19 KOG0012 DNA damage inducible p 97.2 0.00041 8.8E-09 61.7 4.2 90 46-137 231-348 (380)
20 cd06094 RP_Saci_like RP_Saci_l 96.3 0.0033 7.2E-08 45.7 2.7 44 61-108 9-52 (89)
21 COG5550 Predicted aspartyl pro 94.1 0.1 2.2E-06 40.2 4.7 79 50-131 12-118 (125)
22 cd05482 HIV_retropepsin_like R 89.3 0.4 8.6E-06 34.5 2.9 24 54-77 2-25 (87)
23 cd05476 pepsin_A_like_plant Ch 85.3 4 8.6E-05 33.9 7.1 18 63-80 178-195 (265)
24 cd06096 Plasmepsin_5 Plasmepsi 73.1 9.8 0.00021 32.6 5.8 41 61-104 231-271 (326)
25 PF12382 Peptidase_A2E: Retrot 69.7 2.9 6.3E-05 31.5 1.5 28 62-89 48-75 (137)
26 COG0282 ackA Acetate kinase [E 59.2 5.4 0.00012 36.3 1.5 83 69-165 179-270 (396)
27 cd05470 pepsin_retropepsin_lik 56.7 12 0.00025 26.4 2.7 26 55-80 3-30 (109)
28 PF00455 DeoRC: DeoR C termina 54.2 6.8 0.00015 30.6 1.2 49 63-111 22-81 (161)
29 cd05477 gastricsin Gastricsins 49.0 31 0.00068 29.2 4.6 21 63-83 203-223 (318)
30 cd05474 SAP_like SAPs, pepsin- 48.6 40 0.00087 27.9 5.1 35 53-102 5-41 (295)
31 PF00026 Asp: Eukaryotic aspar 48.6 22 0.00048 29.5 3.5 25 53-77 4-30 (317)
32 cd05472 cnd41_like Chloroplast 48.2 27 0.00058 29.3 4.0 22 63-84 173-194 (299)
33 cd05486 Cathespin_E Cathepsin 48.0 26 0.00057 29.7 4.0 28 56-83 187-220 (316)
34 cd05485 Cathepsin_D_like Cathe 46.2 35 0.00076 29.3 4.5 28 56-83 200-232 (329)
35 cd05478 pepsin_A Pepsin A, asp 42.7 43 0.00093 28.4 4.5 34 51-84 189-229 (317)
36 cd06097 Aspergillopepsin_like 42.6 17 0.00038 30.2 2.0 23 61-83 198-220 (278)
37 KOG0318 WD40 repeat stress pro 40.2 54 0.0012 31.3 4.9 82 17-99 78-167 (603)
38 cd05473 beta_secretase_like Be 38.2 54 0.0012 28.5 4.5 21 63-83 213-233 (364)
39 PRK10681 DNA-binding transcrip 38.0 14 0.0003 31.0 0.7 40 63-102 95-145 (252)
40 cd06097 Aspergillopepsin_like 36.6 38 0.00082 28.1 3.1 25 54-78 4-30 (278)
41 cd05476 pepsin_A_like_plant Ch 33.0 69 0.0015 26.4 4.1 24 54-77 5-30 (265)
42 cd06098 phytepsin Phytepsin, a 32.8 55 0.0012 27.9 3.6 29 56-84 198-233 (317)
43 cd05487 renin_like Renin stimu 31.5 52 0.0011 28.1 3.2 25 53-77 11-37 (326)
44 cd05471 pepsin_like Pepsin-lik 31.0 36 0.00078 27.5 2.1 25 60-84 201-225 (283)
45 cd05490 Cathepsin_D2 Cathepsin 28.7 38 0.00083 28.8 1.9 23 62-84 207-229 (325)
46 cd06098 phytepsin Phytepsin, a 27.5 68 0.0015 27.3 3.3 27 52-78 12-40 (317)
47 cd05475 nucellin_like Nucellin 26.4 70 0.0015 26.6 3.1 24 53-76 5-30 (273)
48 cd05488 Proteinase_A_fungi Fun 26.3 45 0.00097 28.4 1.9 23 62-84 206-228 (320)
49 cd05490 Cathepsin_D2 Cathepsin 25.9 75 0.0016 27.0 3.2 25 53-77 9-35 (325)
50 cd05477 gastricsin Gastricsins 24.7 82 0.0018 26.6 3.2 25 54-78 7-33 (318)
51 PRK10906 DNA-binding transcrip 24.4 34 0.00074 28.7 0.8 39 63-101 94-143 (252)
52 PF06004 DUF903: Bacterial pro 24.2 2.1E+02 0.0045 18.5 5.1 41 92-144 1-42 (50)
53 cd06096 Plasmepsin_5 Plasmepsi 23.8 94 0.002 26.5 3.5 26 53-78 6-33 (326)
54 cd05478 pepsin_A Pepsin A, asp 23.8 85 0.0018 26.6 3.2 26 53-78 13-40 (317)
55 cd05474 SAP_like SAPs, pepsin- 23.0 58 0.0012 26.9 1.9 24 61-84 178-201 (295)
56 PTZ00147 plasmepsin-1; Provisi 22.8 97 0.0021 28.5 3.5 26 53-78 142-169 (453)
57 cd05488 Proteinase_A_fungi Fun 22.6 93 0.002 26.4 3.2 28 51-78 11-40 (320)
58 PF07894 DUF1669: Protein of u 22.4 1E+02 0.0022 26.9 3.4 48 57-106 172-219 (284)
59 cd05473 beta_secretase_like Be 22.0 95 0.0021 27.0 3.2 26 53-78 6-33 (364)
60 PTZ00013 plasmepsin 4 (PM4); P 21.8 98 0.0021 28.5 3.3 26 54-79 142-169 (450)
61 PLN03146 aspartyl protease fam 21.8 1.2E+02 0.0027 27.3 4.0 20 63-82 309-328 (431)
62 cd05472 cnd41_like Chloroplast 21.6 83 0.0018 26.3 2.6 23 54-76 5-29 (299)
63 cd05487 renin_like Renin stimu 21.5 60 0.0013 27.7 1.8 21 63-83 209-229 (326)
64 COG2383 Uncharacterized conser 20.6 30 0.00065 25.9 -0.2 17 112-128 54-70 (109)
65 PTZ00147 plasmepsin-1; Provisi 20.1 1.5E+02 0.0032 27.3 4.1 32 52-83 320-354 (453)
No 1
>PF08284 RVP_2: Retroviral aspartyl protease; InterPro: IPR013242 This region defines single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein; usually pol, more rarely gag. Retroviral proteases appear to be homologous to a single domain of the two-domain eukaryotic aspartyl proteases.
Probab=99.89 E-value=2e-23 Score=161.36 Aligned_cols=92 Identities=27% Similarity=0.466 Sum_probs=81.8
Q ss_pred eccCCCCCCeEEEEEEECCEEEEEEeecCCCCccccHHHHhhcCCcccccc-eeEEEecCCCeeeeeeeEeC--------
Q 038652 41 AVMGLTLPKTFNVLGFVGEHKVTIVIDSGASPNFISAEQVDELKLPISEAE-AYGVILGTGKSKKTQGVCRN-------- 111 (174)
Q Consensus 41 A~~G~~~~~tmrl~g~I~~~~v~~LIDSGSThnFIs~~~v~~l~l~~~~~~-p~~V~vanG~~l~~~~~c~~-------- 111 (174)
+-.+.+.+.+|...+.|+++++.+||||||||||||+++|.++++++.+++ |+.|. ++|+.+.|.+.|++
T Consensus 12 ~~~~~~~~~vi~g~~~I~~~~~~vLiDSGAThsFIs~~~a~~~~l~~~~l~~~~~V~-~~g~~~~~~~~~~~~~~~i~g~ 90 (135)
T PF08284_consen 12 AEEAEESPDVITGTFLINSIPASVLIDSGATHSFISSSFAKKLGLPLEPLPRPIVVS-APGGSINCEGVCPDVPLSIQGH 90 (135)
T ss_pred hhcccCCCCeEEEEEEeccEEEEEEEecCCCcEEccHHHHHhcCCEEEEccCeeEEe-cccccccccceeeeEEEEECCe
Confidence 444557899999999999999999999999999999999999999999874 66665 56777888888866
Q ss_pred -------------------eeeecccCceEEEeeeeEEEEEe
Q 038652 112 -------------------YKWLETLGVTHINWKTRVMKFTF 134 (174)
Q Consensus 112 -------------------mdWL~~~gp~~iDw~~~t~~f~~ 134 (174)
||||++|+| .|||.+|+|+|..
T Consensus 91 ~~~~dl~vl~l~~~DvILGm~WL~~~~~-~IDw~~k~v~f~~ 131 (135)
T PF08284_consen 91 EFVVDLLVLDLGGYDVILGMDWLKKHNP-VIDWATKTVTFNS 131 (135)
T ss_pred EEEeeeEEecccceeeEeccchHHhCCC-EEEccCCEEEEeC
Confidence 999999999 8999999999976
No 2
>cd05479 RP_DDI RP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
Probab=99.71 E-value=2.2e-17 Score=125.22 Aligned_cols=85 Identities=16% Similarity=0.112 Sum_probs=72.6
Q ss_pred CCCeEEEEEEECCEEEEEEeecCCCCccccHHHHhhcCCcccccceeEEEecCCCeeeeeeeEeC---------------
Q 038652 47 LPKTFNVLGFVGEHKVTIVIDSGASPNFISAEQVDELKLPISEAEAYGVILGTGKSKKTQGVCRN--------------- 111 (174)
Q Consensus 47 ~~~tmrl~g~I~~~~v~~LIDSGSThnFIs~~~v~~l~l~~~~~~p~~V~vanG~~l~~~~~c~~--------------- 111 (174)
....|++.+.|+|+++.+|||||||||||++++|+|++++.....++.+.+++++...+.+.|..
T Consensus 13 ~~~~~~v~~~Ing~~~~~LvDTGAs~s~Is~~~a~~lgl~~~~~~~~~~~~~g~g~~~~~g~~~~~~l~i~~~~~~~~~~ 92 (124)
T cd05479 13 KVPMLYINVEINGVPVKAFVDSGAQMTIMSKACAEKCGLMRLIDKRFQGIAKGVGTQKILGRIHLAQVKIGNLFLPCSFT 92 (124)
T ss_pred eeeEEEEEEEECCEEEEEEEeCCCceEEeCHHHHHHcCCccccCcceEEEEecCCCcEEEeEEEEEEEEECCEEeeeEEE
Confidence 45679999999999999999999999999999999999998766677777776555566665533
Q ss_pred ------------eeeecccCceEEEeeeeEEEE
Q 038652 112 ------------YKWLETLGVTHINWKTRVMKF 132 (174)
Q Consensus 112 ------------mdWL~~~gp~~iDw~~~t~~f 132 (174)
||||++++. .|||++++|+|
T Consensus 93 Vl~~~~~d~ILG~d~L~~~~~-~ID~~~~~i~~ 124 (124)
T cd05479 93 VLEDDDVDFLIGLDMLKRHQC-VIDLKENVLRI 124 (124)
T ss_pred EECCCCcCEEecHHHHHhCCe-EEECCCCEEEC
Confidence 999999995 99999999874
No 3
>PF13975 gag-asp_proteas: gag-polyprotein putative aspartyl protease
Probab=99.58 E-value=1e-14 Score=101.10 Aligned_cols=67 Identities=28% Similarity=0.411 Sum_probs=61.9
Q ss_pred CCCCCCeEEEEEEECCEEEEEEeecCCCCccccHHHHhhcCCcccccc-eeEEEecCCCeeeeeeeEe
Q 038652 44 GLTLPKTFNVLGFVGEHKVTIVIDSGASPNFISAEQVDELKLPISEAE-AYGVILGTGKSKKTQGVCR 110 (174)
Q Consensus 44 G~~~~~tmrl~g~I~~~~v~~LIDSGSThnFIs~~~v~~l~l~~~~~~-p~~V~vanG~~l~~~~~c~ 110 (174)
....+++|++.+.|+|+++.+|||||||||||+.++|+||+++....+ +..|++|||....+.+...
T Consensus 2 ~~~~~g~~~v~~~I~g~~~~alvDtGat~~fis~~~a~rLgl~~~~~~~~~~v~~a~g~~~~~~g~~~ 69 (72)
T PF13975_consen 2 KTPDPGLMYVPVSIGGVQVKALVDTGATHNFISESLAKRLGLPLEKPPSPIRVKLANGSVIEIRGVAE 69 (72)
T ss_pred CcccCCEEEEEEEECCEEEEEEEeCCCcceecCHHHHHHhCCCcccCCCCEEEEECCCCccccceEEE
Confidence 356789999999999999999999999999999999999999999987 9999999999998887754
No 4
>cd05484 retropepsin_like_LTR_2 Retropepsins_like_LTR, pepsin-like aspartate proteases. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classif
Probab=99.31 E-value=5.9e-12 Score=90.04 Aligned_cols=58 Identities=19% Similarity=0.209 Sum_probs=53.6
Q ss_pred EEEEEECCEEEEEEeecCCCCccccHHHHhhcCCcccccceeEEEecCCCeeeeeeeE
Q 038652 52 NVLGFVGEHKVTIVIDSGASPNFISAEQVDELKLPISEAEAYGVILGTGKSKKTQGVC 109 (174)
Q Consensus 52 rl~g~I~~~~v~~LIDSGSThnFIs~~~v~~l~l~~~~~~p~~V~vanG~~l~~~~~c 109 (174)
.+++.|+|+++.+|||+||++|||+++.+.+++++........+..+||..+.+.+.|
T Consensus 2 ~~~~~Ing~~i~~lvDTGA~~svis~~~~~~lg~~~~~~~~~~v~~a~G~~~~~~G~~ 59 (91)
T cd05484 2 TVTLLVNGKPLKFQLDTGSAITVISEKTWRKLGSPPLKPTKKRLRTATGTKLSVLGQI 59 (91)
T ss_pred EEEEEECCEEEEEEEcCCcceEEeCHHHHHHhCCCccccccEEEEecCCCEeeEeEEE
Confidence 5789999999999999999999999999999999985556799999999999999887
No 5
>TIGR02281 clan_AA_DTGA clan AA aspartic protease, TIGR02281 family. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix. Sequences in the seed alignment and those scoring above the trusted cutoff are Proteobacterial; homologs scroing between trusted and noise are found in Pyrobaculum aerophilum str. IM2 (archaeal), Pirellula sp. (Planctomycetes), and Nostoc sp. PCC 7120 (Cyanobacteria).
Probab=99.05 E-value=1.6e-09 Score=82.08 Aligned_cols=60 Identities=22% Similarity=0.295 Sum_probs=52.4
Q ss_pred CCCCeEEEEEEECCEEEEEEeecCCCCccccHHHHhhcCCccccc-ceeEEEecCCCeeee
Q 038652 46 TLPKTFNVLGFVGEHKVTIVIDSGASPNFISAEQVDELKLPISEA-EAYGVILGTGKSKKT 105 (174)
Q Consensus 46 ~~~~tmrl~g~I~~~~v~~LIDSGSThnFIs~~~v~~l~l~~~~~-~p~~V~vanG~~l~~ 105 (174)
...+++.+.+.|+|+++.+|||+|||++.|++++|+++++..... .+..+..|||.....
T Consensus 7 ~~~g~~~v~~~InG~~~~flVDTGAs~t~is~~~A~~Lgl~~~~~~~~~~~~ta~G~~~~~ 67 (121)
T TIGR02281 7 DGDGHFYATGRVNGRNVRFLVDTGATSVALNEEDAQRLGLDLNRLGYTVTVSTANGQIKAA 67 (121)
T ss_pred cCCCeEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCcccCCceEEEEeCCCcEEEE
Confidence 356789999999999999999999999999999999999987663 478888999976533
No 6
>PF13650 Asp_protease_2: Aspartyl protease
Probab=98.81 E-value=2.3e-08 Score=69.46 Aligned_cols=55 Identities=27% Similarity=0.238 Sum_probs=47.4
Q ss_pred EEEEECCEEEEEEeecCCCCccccHHHHhhcCCcccccc-eeEEEecCCCeeeeee
Q 038652 53 VLGFVGEHKVTIVIDSGASPNFISAEQVDELKLPISEAE-AYGVILGTGKSKKTQG 107 (174)
Q Consensus 53 l~g~I~~~~v~~LIDSGSThnFIs~~~v~~l~l~~~~~~-p~~V~vanG~~l~~~~ 107 (174)
+.+.|+|+++.+||||||++++|+++++++++++..+.. +..+..++|.......
T Consensus 1 V~v~vng~~~~~liDTGa~~~~i~~~~~~~l~~~~~~~~~~~~~~~~~g~~~~~~~ 56 (90)
T PF13650_consen 1 VPVKVNGKPVRFLIDTGASISVISRSLAKKLGLKPRPKSVPISVSGAGGSVTVYRG 56 (90)
T ss_pred CEEEECCEEEEEEEcCCCCcEEECHHHHHHcCCCCcCCceeEEEEeCCCCEEEEEE
Confidence 367899999999999999999999999999999988775 7888999998544333
No 7
>TIGR03698 clan_AA_DTGF clan AA aspartic protease, AF_0612 family. Members of this protein family are clan AA aspartic proteases, related to family TIGR02281. These proteins resemble retropepsins, pepsin-like proteases of retroviruses such as HIV. Members of this family are found in archaea and bacteria.
Probab=98.75 E-value=1.9e-08 Score=74.73 Aligned_cols=69 Identities=25% Similarity=0.365 Sum_probs=54.0
Q ss_pred CEEEEEEeecCCCCcc-ccHHHHhhcCCcccccceeEEEecCCCeeeeeee------------------------EeCee
Q 038652 59 EHKVTIVIDSGASPNF-ISAEQVDELKLPISEAEAYGVILGTGKSKKTQGV------------------------CRNYK 113 (174)
Q Consensus 59 ~~~v~~LIDSGSThnF-Is~~~v~~l~l~~~~~~p~~V~vanG~~l~~~~~------------------------c~~md 113 (174)
..++.+|||||||+.. |++++|++++++... ...+..|||........ ..+|.
T Consensus 14 ~~~v~~LVDTGat~~~~l~~~~a~~lgl~~~~--~~~~~tA~G~~~~~~v~~~~v~igg~~~~~~v~~~~~~~~~LLG~~ 91 (107)
T TIGR03698 14 FMEVRALVDTGFSGFLLVPPDIVNKLGLPELD--QRRVYLADGREVLTDVAKASIIINGLEIDAFVESLGYVDEPLLGTE 91 (107)
T ss_pred ceEEEEEEECCCCeEEecCHHHHHHcCCCccc--CcEEEecCCcEEEEEEEEEEEEECCEEEEEEEEecCCCCccEecHH
Confidence 3489999999999999 999999999998754 56889999964322211 11299
Q ss_pred eecccCceEEEeeeeEE
Q 038652 114 WLETLGVTHINWKTRVM 130 (174)
Q Consensus 114 WL~~~gp~~iDw~~~t~ 130 (174)
||++++. .+||+++++
T Consensus 92 ~L~~l~l-~id~~~~~~ 107 (107)
T TIGR03698 92 LLEGLGI-VIDYRNQGL 107 (107)
T ss_pred HHhhCCE-EEehhhCcC
Confidence 9999985 899998754
No 8
>cd05483 retropepsin_like_bacteria Bacterial aspartate proteases, retropepsin-like protease family. This family of bacteria aspartate proteases is a subfamily of retropepsin-like protease family, which includes enzymes from retrovirus and retrotransposons. While fungal and mammalian pepsin-like aspartate proteases are bilobal proteins with structurally related N- and C-termini, this family of bacteria aspartate proteases is half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate proteases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=98.70 E-value=7.5e-08 Score=67.49 Aligned_cols=60 Identities=25% Similarity=0.347 Sum_probs=51.4
Q ss_pred CeEEEEEEECCEEEEEEeecCCCCccccHHHHhhcCCcccccceeEEEecCCCeeeeeee
Q 038652 49 KTFNVLGFVGEHKVTIVIDSGASPNFISAEQVDELKLPISEAEAYGVILGTGKSKKTQGV 108 (174)
Q Consensus 49 ~tmrl~g~I~~~~v~~LIDSGSThnFIs~~~v~~l~l~~~~~~p~~V~vanG~~l~~~~~ 108 (174)
+.+.+.+.|+++++.+|||+||++++|+.+++++++++.....+..+..++|........
T Consensus 1 ~~~~v~v~i~~~~~~~llDTGa~~s~i~~~~~~~l~~~~~~~~~~~~~~~~G~~~~~~~~ 60 (96)
T cd05483 1 GHFVVPVTINGQPVRFLLDTGASTTVISEELAERLGLPLTLGGKVTVQTANGRVRAARVR 60 (96)
T ss_pred CcEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCccCCCcEEEEecCCCccceEEE
Confidence 357899999999999999999999999999999999855555778899999987665443
No 9
>PF09668 Asp_protease: Aspartyl protease; InterPro: IPR019103 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This family of eukaryotic aspartyl proteases have a fold similar to retroviral proteases which implies they function proteolytically during regulated protein turnover []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 3S8I_A 2I1A_B.
Probab=98.45 E-value=3.5e-07 Score=70.20 Aligned_cols=41 Identities=15% Similarity=0.138 Sum_probs=32.7
Q ss_pred CCCeEEEEEEECCEEEEEEeecCCCCccccHHHHhhcCCcc
Q 038652 47 LPKTFNVLGFVGEHKVTIVIDSGASPNFISAEQVDELKLPI 87 (174)
Q Consensus 47 ~~~tmrl~g~I~~~~v~~LIDSGSThnFIs~~~v~~l~l~~ 87 (174)
....+.+...|||+++.++|||||-+|-||.++|+|+||.-
T Consensus 21 ~v~mLyI~~~ing~~vkA~VDtGAQ~tims~~~a~r~gL~~ 61 (124)
T PF09668_consen 21 QVSMLYINCKINGVPVKAFVDTGAQSTIMSKSCAERCGLMR 61 (124)
T ss_dssp -----EEEEEETTEEEEEEEETT-SS-EEEHHHHHHTTGGG
T ss_pred CcceEEEEEEECCEEEEEEEeCCCCccccCHHHHHHcCChh
Confidence 45678999999999999999999999999999999999963
No 10
>PF00077 RVP: Retroviral aspartyl protease The Prosite entry also includes Pfam:PF00026; InterPro: IPR018061 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to the MEROPS peptidase family A2 (retropepsin family, clan AA), subfamily A2A. The family includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). Retroviral aspartyl protease is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins.; PDB: 3D3T_B 3SQF_A 1NSO_A 2HB3_A 2HS2_A 2HS1_B 3K4V_A 3GGV_C 1HTG_B 2FDE_A ....
Probab=98.29 E-value=1.4e-06 Score=62.56 Aligned_cols=54 Identities=22% Similarity=0.282 Sum_probs=44.5
Q ss_pred EEEEEECCEEEEEEeecCCCCccccHHHHhhcCCcccccceeEEEecCCCeeeeeee
Q 038652 52 NVLGFVGEHKVTIVIDSGASPNFISAEQVDELKLPISEAEAYGVILGTGKSKKTQGV 108 (174)
Q Consensus 52 rl~g~I~~~~v~~LIDSGSThnFIs~~~v~~l~l~~~~~~p~~V~vanG~~l~~~~~ 108 (174)
++...|+|+++.+|||+||+.++|+.+.+.+...+ ...+..+..++|.. ...+.
T Consensus 7 ~i~v~i~g~~i~~LlDTGA~vsiI~~~~~~~~~~~--~~~~~~v~~~~g~~-~~~~~ 60 (100)
T PF00077_consen 7 YITVKINGKKIKALLDTGADVSIISEKDWKKLGPP--PKTSITVRGAGGSS-SILGS 60 (100)
T ss_dssp EEEEEETTEEEEEEEETTBSSEEESSGGSSSTSSE--EEEEEEEEETTEEE-EEEEE
T ss_pred eEEEeECCEEEEEEEecCCCcceeccccccccccc--ccCCceeccCCCcc-eeeeE
Confidence 56788999999999999999999999999888755 55667788888876 55443
No 11
>cd00303 retropepsin_like Retropepsins; pepsin-like aspartate proteases. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples
Probab=98.22 E-value=5.9e-06 Score=53.98 Aligned_cols=55 Identities=38% Similarity=0.414 Sum_probs=45.3
Q ss_pred EEEECCEEEEEEeecCCCCccccHHHHhhcCC-cccccceeEEEecCCCeeeeeee
Q 038652 54 LGFVGEHKVTIVIDSGASPNFISAEQVDELKL-PISEAEAYGVILGTGKSKKTQGV 108 (174)
Q Consensus 54 ~g~I~~~~v~~LIDSGSThnFIs~~~v~~l~l-~~~~~~p~~V~vanG~~l~~~~~ 108 (174)
.+.++++++.+|+|+||++++++..++.++++ +.....+..+..++|......+.
T Consensus 2 ~~~~~~~~~~~liDtgs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 57 (92)
T cd00303 2 KGKINGVPVRALVDSGASVNFISESLAKKLGLPPRLLPTPLKVKGANGSSVKTLGV 57 (92)
T ss_pred EEEECCEEEEEEEcCCCcccccCHHHHHHcCCCcccCCCceEEEecCCCEeccCcE
Confidence 45688999999999999999999999999998 55556788888888876555443
No 12
>PF12384 Peptidase_A2B: Ty3 transposon peptidase; InterPro: IPR024650 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Ty3 is a gypsy-type, retrovirus-like, element found in the budding yeast. The Ty3 aspartyl protease is required for processing of the viral polyprotein into its mature species [].
Probab=98.12 E-value=1.1e-05 Score=64.84 Aligned_cols=54 Identities=24% Similarity=0.298 Sum_probs=49.8
Q ss_pred CCCCCeEEEEEEECCEEEEEEeecCCCCccccHHHHhhcCCcccccceeEEEec
Q 038652 45 LTLPKTFNVLGFVGEHKVTIVIDSGASPNFISAEQVDELKLPISEAEAYGVILG 98 (174)
Q Consensus 45 ~~~~~tmrl~g~I~~~~v~~LIDSGSThnFIs~~~v~~l~l~~~~~~p~~V~va 98 (174)
-+-.+|....-.++|.++.+|+||||--|||+.+++++|+|+.-..+|++++-+
T Consensus 29 Pevg~T~~v~l~~~~t~i~vLfDSGSPTSfIr~di~~kL~L~~~~app~~fRG~ 82 (177)
T PF12384_consen 29 PEVGKTAIVQLNCKGTPIKVLFDSGSPTSFIRSDIVEKLELPTHDAPPFRFRGF 82 (177)
T ss_pred cccCcEEEEEEeecCcEEEEEEeCCCccceeehhhHHhhCCccccCCCEEEeee
Confidence 445678899999999999999999999999999999999999999999999854
No 13
>COG3577 Predicted aspartyl protease [General function prediction only]
Probab=97.95 E-value=2.8e-05 Score=64.41 Aligned_cols=66 Identities=21% Similarity=0.334 Sum_probs=57.7
Q ss_pred ceeEeeeeccCCCCCCeEEEEEEECCEEEEEEeecCCCCccccHHHHhhcCCccccc-ceeEEEecCCCee
Q 038652 34 IAKVLLNAVMGLTLPKTFNVLGFVGEHKVTIVIDSGASPNFISAEQVDELKLPISEA-EAYGVILGTGKSK 103 (174)
Q Consensus 34 ~~~iSl~A~~G~~~~~tmrl~g~I~~~~v~~LIDSGSThnFIs~~~v~~l~l~~~~~-~p~~V~vanG~~l 103 (174)
-.++++++ ...+++...|.|||+++..|||+|||--=++.+-|+|+++..-.+ -++.|..|||+..
T Consensus 93 ~~~v~Lak----~~~GHF~a~~~VNGk~v~fLVDTGATsVal~~~dA~RlGid~~~l~y~~~v~TANG~~~ 159 (215)
T COG3577 93 YQEVSLAK----SRDGHFEANGRVNGKKVDFLVDTGATSVALNEEDARRLGIDLNSLDYTITVSTANGRAR 159 (215)
T ss_pred ceEEEEEe----cCCCcEEEEEEECCEEEEEEEecCcceeecCHHHHHHhCCCccccCCceEEEccCCccc
Confidence 34566654 455899999999999999999999999999999999999999987 4899999999764
No 14
>cd06095 RP_RTVL_H_like Retropepsin of the RTVL_H family of human endogenous retrovirus-like elements. This family includes aspartate proteases from retroelements with LTR (long terminal repeats) including the RTVL_H family of human endogenous retrovirus-like elements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where
Probab=97.87 E-value=3.1e-05 Score=54.89 Aligned_cols=46 Identities=13% Similarity=0.102 Sum_probs=40.2
Q ss_pred EEEECCEEEEEEeecCCCCccccHHHHhhcCCcccccceeEEEecCCCe
Q 038652 54 LGFVGEHKVTIVIDSGASPNFISAEQVDELKLPISEAEAYGVILGTGKS 102 (174)
Q Consensus 54 ~g~I~~~~v~~LIDSGSThnFIs~~~v~~l~l~~~~~~p~~V~vanG~~ 102 (174)
...|||+++.+|+|+||+++-|+++.++++ .....+..+..++|..
T Consensus 2 ~v~InG~~~~fLvDTGA~~tii~~~~a~~~---~~~~~~~~v~gagG~~ 47 (86)
T cd06095 2 TITVEGVPIVFLVDTGATHSVLKSDLGPKQ---ELSTTSVLIRGVSGQS 47 (86)
T ss_pred EEEECCEEEEEEEECCCCeEEECHHHhhhc---cCCCCcEEEEeCCCcc
Confidence 578999999999999999999999999997 2334788899999975
No 15
>cd05480 NRIP_C NRIP_C; putative nuclear receptor interacting protein. Proteins in this family have been described as probable nuclear receptor interacting proteins. The C-terminal domain of this family is homologous to the retroviral aspartyl protease domain. The domain is structurally related to one lobe of the pepsin molecule. The conserved active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=97.74 E-value=4e-05 Score=56.90 Aligned_cols=33 Identities=27% Similarity=0.366 Sum_probs=31.5
Q ss_pred EEEECCEEEEEEeecCCCCccccHHHHhhcCCc
Q 038652 54 LGFVGEHKVTIVIDSGASPNFISAEQVDELKLP 86 (174)
Q Consensus 54 ~g~I~~~~v~~LIDSGSThnFIs~~~v~~l~l~ 86 (174)
.+++||+|+.++|||||-+|.||+..|+|+||.
T Consensus 2 nCk~nG~~vkAfVDsGaQ~timS~~caercgL~ 34 (103)
T cd05480 2 SCQCAGKELRALVDTGCQYNLISAACLDRLGLK 34 (103)
T ss_pred ceeECCEEEEEEEecCCchhhcCHHHHHHcChH
Confidence 468999999999999999999999999999997
No 16
>cd05481 retropepsin_like_LTR_1 Retropepsins_like_LTR; pepsin-like aspartate protease from retrotransposons with long terminal repeats. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identifi
Probab=97.70 E-value=0.0001 Score=53.47 Aligned_cols=53 Identities=17% Similarity=0.213 Sum_probs=46.1
Q ss_pred EECC-EEEEEEeecCCCCccccHHHHhhcC---CcccccceeEEEecCCCeeeeeee
Q 038652 56 FVGE-HKVTIVIDSGASPNFISAEQVDELK---LPISEAEAYGVILGTGKSKKTQGV 108 (174)
Q Consensus 56 ~I~~-~~v~~LIDSGSThnFIs~~~v~~l~---l~~~~~~p~~V~vanG~~l~~~~~ 108 (174)
.|++ +++.+++|+||+.|.|+.+..++++ .+.....++.+..+||..+...|.
T Consensus 4 ~i~g~~~v~~~vDtGA~vnllp~~~~~~l~~~~~~~L~~t~~~L~~~~g~~~~~~G~ 60 (93)
T cd05481 4 KINGKQSVKFQLDTGATCNVLPLRWLKSLTPDKDPELRPSPVRLTAYGGSTIPVEGG 60 (93)
T ss_pred EeCCceeEEEEEecCCEEEeccHHHHhhhccCCCCcCccCCeEEEeeCCCEeeeeEE
Confidence 5777 9999999999999999999999998 555555679999999999888775
No 17
>PF02160 Peptidase_A3: Cauliflower mosaic virus peptidase (A3); InterPro: IPR000588 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of sequences contain an aspartic peptidase signature that belongs to MEROPS peptidase family A3, subfamily A3A (cauliflower mosaic virus-type endopeptidase, clan AA). Cauliflower mosaic virus belongs to the Retro-transcribing viruses, which have a double-stranded DNA genome. The genome includes an open reading frame (ORF V) that shows similarities to the pol gene of retroviruses. This ORF codes for a polyprotein that includes a reverse transcriptase, which, on the basis of a DTG triplet near the N terminus, was suggested to include an aspartic protease. The presence of an aspartic protease has been confirmed by mutational studies, implicating Asp-45 in catalysis. The protease releases itself from the polyprotein and is involved in reactions required to process the ORF IV polyprotein, which includes the viral coat protein []. The viral aspartic peptidase signature has also been found associated with a polyprotein encoded by integrated pararetrovirus-like sequences in the genome of Nicotiana tabacum (Common tobacco) []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis
Probab=97.56 E-value=0.00022 Score=58.91 Aligned_cols=91 Identities=13% Similarity=0.138 Sum_probs=66.4
Q ss_pred CCeEEEEEEE--CC---EEEEEEeecCCCCccccHHHHhhcCCcccccceeEEEecCCCeeeeeeeEeC-----------
Q 038652 48 PKTFNVLGFV--GE---HKVTIVIDSGASPNFISAEQVDELKLPISEAEAYGVILGTGKSKKTQGVCRN----------- 111 (174)
Q Consensus 48 ~~tmrl~g~I--~~---~~v~~LIDSGSThnFIs~~~v~~l~l~~~~~~p~~V~vanG~~l~~~~~c~~----------- 111 (174)
|+.+.+++.+ .| ..+.++||+|||-+.++...+-..-.. ....|+.|+.|||.......+|++
T Consensus 2 pNsiyI~~~i~~~gy~~~~~~~~vDTGAt~C~~~~~iiP~e~we-~~~~~i~v~~an~~~~~i~~~~~~~~i~I~~~~F~ 80 (201)
T PF02160_consen 2 PNSIYIKVKISFPGYKKFNYHCYVDTGATICCASKKIIPEEYWE-KSKKPIKVKGANGSIIQINKKAKNGKIQIADKIFR 80 (201)
T ss_pred CccEEEEEEEEEcCceeEEEEEEEeCCCceEEecCCcCCHHHHH-hCCCcEEEEEecCCceEEEEEecCceEEEccEEEe
Confidence 5555555554 44 457889999999999987765332211 113578999999998888887776
Q ss_pred ----------------eeeecccCceEEEeeeeEEEEEeCC-eEEEE
Q 038652 112 ----------------YKWLETLGVTHINWKTRVMKFTFGK-SQIVL 141 (174)
Q Consensus 112 ----------------mdWL~~~gp~~iDw~~~t~~f~~~g-~~V~L 141 (174)
++.++.+.| -+.|. ..+.|+..+ +.+..
T Consensus 81 IP~iYq~~~g~d~IlG~NF~r~y~P-fiq~~-~~I~f~~~~~~~~~~ 125 (201)
T PF02160_consen 81 IPTIYQQESGIDIILGNNFLRLYEP-FIQTE-DRIQFHKKGLQKVEI 125 (201)
T ss_pred ccEEEEecCCCCEEecchHHHhcCC-cEEEc-cEEEEEeCCcceeeh
Confidence 899999999 59997 579999877 55443
No 18
>PF05585 DUF1758: Putative peptidase (DUF1758); InterPro: IPR008737 This is a family of nematode proteins of unknown function []. However, it seems likely that these proteins act as aspartic peptidases.
Probab=97.46 E-value=0.00019 Score=56.38 Aligned_cols=39 Identities=21% Similarity=0.352 Sum_probs=32.6
Q ss_pred CEEEEEEeecCCCCccccHHHHhhcCCcccccceeEEEe
Q 038652 59 EHKVTIVIDSGASPNFISAEQVDELKLPISEAEAYGVIL 97 (174)
Q Consensus 59 ~~~v~~LIDSGSThnFIs~~~v~~l~l~~~~~~p~~V~v 97 (174)
-.++.+|+||||.-|||++++|++|+|+......+.+.+
T Consensus 10 ~~~~~~LlDsGSq~SfIt~~la~~L~L~~~~~~~~~~~~ 48 (164)
T PF05585_consen 10 QVEARALLDSGSQRSFITESLANKLNLPGTGEKILVIGT 48 (164)
T ss_pred EEEEEEEEecCCchhHHhHHHHHHhCCCCCCceEEEEec
Confidence 356889999999999999999999999988776544443
No 19
>KOG0012 consensus DNA damage inducible protein [Replication, recombination and repair]
Probab=97.19 E-value=0.00041 Score=61.66 Aligned_cols=90 Identities=16% Similarity=0.090 Sum_probs=65.2
Q ss_pred CCCCeEEEEEEECCEEEEEEeecCCCCccccHHHHhhcCCcccccc-------------------eeEEEecCCCeeeee
Q 038652 46 TLPKTFNVLGFVGEHKVTIVIDSGASPNFISAEQVDELKLPISEAE-------------------AYGVILGTGKSKKTQ 106 (174)
Q Consensus 46 ~~~~tmrl~g~I~~~~v~~LIDSGSThnFIs~~~v~~l~l~~~~~~-------------------p~~V~vanG~~l~~~ 106 (174)
+.-..+.+.+.|||+||.++|||||--|-||..+|+|+|+.---.+ ...|++. ...+.|.
T Consensus 231 ~~v~ML~iN~~ing~~VKAfVDsGaq~timS~~Caer~gL~rlid~r~~g~a~gvg~~ki~g~Ih~~~lki~-~~~l~c~ 309 (380)
T KOG0012|consen 231 TQVTMLYINCEINGVPVKAFVDSGAQTTIMSAACAERCGLNRLIDKRFQGEARGVGTEKILGRIHQAQLKIE-DLYLPCS 309 (380)
T ss_pred ccceEEEEEEEECCEEEEEEEcccchhhhhhHHHHHHhChHHHhhhhhhccccCCCcccccceeEEEEEEec-cEeeccc
Confidence 4556688999999999999999999999999999999998733211 1222222 1233444
Q ss_pred eeEeC---------eeeecccCceEEEeeeeEEEEEeCCe
Q 038652 107 GVCRN---------YKWLETLGVTHINWKTRVMKFTFGKS 137 (174)
Q Consensus 107 ~~c~~---------mdWL~~~gp~~iDw~~~t~~f~~~g~ 137 (174)
..+-+ .|=|+.|.. .||.++..+.|...+.
T Consensus 310 ftV~d~~~~d~llGLd~Lrr~~c-cIdL~~~~L~ig~~~t 348 (380)
T KOG0012|consen 310 FTVLDRRDMDLLLGLDMLRRHQC-CIDLKTNVLRIGNTET 348 (380)
T ss_pred eEEecCCCcchhhhHHHHHhccc-eeecccCeEEecCCCc
Confidence 33322 677889987 8999999998876555
No 20
>cd06094 RP_Saci_like RP_Saci_like, retropepsin family. Retropepsin on retrotransposons with long terminal repeats (LTR) including Saci-1, -2 and -3 of Schistosoma mansoni. Retropepsins are related to fungal and mammalian pepsins. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified
Probab=96.30 E-value=0.0033 Score=45.71 Aligned_cols=44 Identities=16% Similarity=0.289 Sum_probs=35.9
Q ss_pred EEEEEeecCCCCccccHHHHhhcCCcccccceeEEEecCCCeeeeeee
Q 038652 61 KVTIVIDSGASPNFISAEQVDELKLPISEAEAYGVILGTGKSKKTQGV 108 (174)
Q Consensus 61 ~v~~LIDSGSThnFIs~~~v~~l~l~~~~~~p~~V~vanG~~l~~~~~ 108 (174)
.+..|||+||.+|-|.+...++. ....++.+..|||.++..-|.
T Consensus 9 ~~~fLVDTGA~vSviP~~~~~~~----~~~~~~~l~AANgt~I~tyG~ 52 (89)
T cd06094 9 GLRFLVDTGAAVSVLPASSTKKS----LKPSPLTLQAANGTPIATYGT 52 (89)
T ss_pred CcEEEEeCCCceEeecccccccc----ccCCceEEEeCCCCeEeeeee
Confidence 36789999999999998877763 345678999999999887664
No 21
>COG5550 Predicted aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=94.13 E-value=0.1 Score=40.20 Aligned_cols=79 Identities=24% Similarity=0.294 Sum_probs=50.9
Q ss_pred eEEEEEEE---CCEEEEEEeecCCC-CccccHHHHhhcCCcccccceeEEEecCCCeeeeeeeE-----e----------
Q 038652 50 TFNVLGFV---GEHKVTIVIDSGAS-PNFISAEQVDELKLPISEAEAYGVILGTGKSKKTQGVC-----R---------- 110 (174)
Q Consensus 50 tmrl~g~I---~~~~v~~LIDSGST-hnFIs~~~v~~l~l~~~~~~p~~V~vanG~~l~~~~~c-----~---------- 110 (174)
|+.+.... +......|||+|.| ---++++++++++.+.....+ +..++|+++.+.-.. .
T Consensus 12 ~v~~~f~~~~~Gd~~~~~LiDTGFtg~lvlp~~vaek~~~~~~~~~~--~~~a~~~~v~t~V~~~~iki~g~e~~~~Vl~ 89 (125)
T COG5550 12 TVPVTFRLPGQGDFVYDELIDTGFTGYLVLPPQVAEKLGLPLFSTIR--IVLADGGVVKTSVALATIKIDGVEKVAFVLA 89 (125)
T ss_pred eEEEEEEecCCCcEEeeeEEecCCceeEEeCHHHHHhcCCCccCChh--hhhhcCCEEEEEEEEEEEEECCEEEEEEEEc
Confidence 44444444 33334459999999 555799999999999876543 345666666543211 0
Q ss_pred ---------CeeeecccCceEEEeeeeEEE
Q 038652 111 ---------NYKWLETLGVTHINWKTRVMK 131 (174)
Q Consensus 111 ---------~mdWL~~~gp~~iDw~~~t~~ 131 (174)
+++||+.++- .+|.+...++
T Consensus 90 s~~~~~~liG~~~lk~l~~-~vn~~~g~LE 118 (125)
T COG5550 90 SDNLPEPLIGVNLLKLLGL-VVNPKTGKLE 118 (125)
T ss_pred cCCCcccchhhhhhhhccE-EEcCCcceEe
Confidence 1888888875 7777655443
No 22
>cd05482 HIV_retropepsin_like Retropepsins, pepsin-like aspartate proteases. This is a subfamily of retropepsins. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This gro
Probab=89.29 E-value=0.4 Score=34.53 Aligned_cols=24 Identities=17% Similarity=0.302 Sum_probs=21.8
Q ss_pred EEEECCEEEEEEeecCCCCccccH
Q 038652 54 LGFVGEHKVTIVIDSGASPNFISA 77 (174)
Q Consensus 54 ~g~I~~~~v~~LIDSGSThnFIs~ 77 (174)
...|+|+++.+|+|+||-++.|+.
T Consensus 2 ~~~i~g~~~~~llDTGAd~Tvi~~ 25 (87)
T cd05482 2 TLYINGKLFEGLLDTGADVSIIAE 25 (87)
T ss_pred EEEECCEEEEEEEccCCCCeEEcc
Confidence 457899999999999999999986
No 23
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which
Probab=85.30 E-value=4 Score=33.87 Aligned_cols=18 Identities=22% Similarity=0.471 Sum_probs=16.0
Q ss_pred EEEeecCCCCccccHHHH
Q 038652 63 TIVIDSGASPNFISAEQV 80 (174)
Q Consensus 63 ~~LIDSGSThnFIs~~~v 80 (174)
.++||||+|-.++.+++.
T Consensus 178 ~ai~DTGTs~~~lp~~~~ 195 (265)
T cd05476 178 GTIIDSGTTLTYLPDPAY 195 (265)
T ss_pred cEEEeCCCcceEcCcccc
Confidence 389999999999988876
No 24
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=73.11 E-value=9.8 Score=32.61 Aligned_cols=41 Identities=10% Similarity=0.230 Sum_probs=30.9
Q ss_pred EEEEEeecCCCCccccHHHHhhcCCcccccceeEEEecCCCeee
Q 038652 61 KVTIVIDSGASPNFISAEQVDELKLPISEAEAYGVILGTGKSKK 104 (174)
Q Consensus 61 ~v~~LIDSGSThnFIs~~~v~~l~l~~~~~~p~~V~vanG~~l~ 104 (174)
...++||||.|-.|+.+++.+++.-.+ |++.+...+|..+.
T Consensus 231 ~~~aivDSGTs~~~lp~~~~~~l~~~~---P~i~~~f~~g~~~~ 271 (326)
T cd06096 231 GLGMLVDSGSTLSHFPEDLYNKINNFF---PTITIIFENNLKID 271 (326)
T ss_pred CCCEEEeCCCCcccCCHHHHHHHHhhc---CcEEEEEcCCcEEE
Confidence 345899999999999999998865333 66777777665553
No 25
>PF12382 Peptidase_A2E: Retrotransposon peptidase; InterPro: IPR024648 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. This entry represents a small family of fungal retroviral aspartyl peptidases.
Probab=69.72 E-value=2.9 Score=31.49 Aligned_cols=28 Identities=39% Similarity=0.597 Sum_probs=25.4
Q ss_pred EEEEeecCCCCccccHHHHhhcCCcccc
Q 038652 62 VTIVIDSGASPNFISAEQVDELKLPISE 89 (174)
Q Consensus 62 v~~LIDSGSThnFIs~~~v~~l~l~~~~ 89 (174)
+--|||.||-.|-|..+.|+.+++|..+
T Consensus 48 ipclidtgaq~niiteetvrahklptrp 75 (137)
T PF12382_consen 48 IPCLIDTGAQVNIITEETVRAHKLPTRP 75 (137)
T ss_pred ceeEEccCceeeeeehhhhhhccCCCCc
Confidence 4569999999999999999999998775
No 26
>COG0282 ackA Acetate kinase [Energy production and conversion]
Probab=59.17 E-value=5.4 Score=36.28 Aligned_cols=83 Identities=20% Similarity=0.231 Sum_probs=54.5
Q ss_pred CCCCccccHHHHhhcCCcccccceeEEEecCCCeeeeeeeEeCeeeecccCceEEEeeeeEEEEEe-CCeEE-EEecCCC
Q 038652 69 GASPNFISAEQVDELKLPISEAEAYGVILGTGKSKKTQGVCRNYKWLETLGVTHINWKTRVMKFTF-GKSQI-VLQGDPS 146 (174)
Q Consensus 69 GSThnFIs~~~v~~l~l~~~~~~p~~V~vanG~~l~~~~~c~~mdWL~~~gp~~iDw~~~t~~f~~-~g~~V-~L~G~~~ 146 (174)
|-||.|++.++++.|+-|+....-+...++||..+..-. .|- ++|- +|-|+. .|-.. |=.|+-+
T Consensus 179 GtSh~YVs~~aa~~L~k~~~~l~~I~~HLGNGASicAik----------nGk-SvDT---SMGfTPLeGl~MGTRsGdiD 244 (396)
T COG0282 179 GTSHKYVSQRAAEILGKPLEDLNLITCHLGNGASICAIK----------NGK-SVDT---SMGFTPLEGLMMGTRSGDID 244 (396)
T ss_pred ccchHHHHHHHHHHhCCCccccCEEEEEecCchhhhhhh----------CCe-eecc---CCCCCcccceeccCCCCCCC
Confidence 679999999999999999999999999999998875422 222 3442 345543 33211 1133322
Q ss_pred c------cc-ccccHHHHHHHhcccc
Q 038652 147 I------QK-MLISLNAIVRAIRHEG 165 (174)
Q Consensus 147 ~------~~-~~iS~~ql~~l~~~~~ 165 (174)
+ .+ ...|++++..++.+.+
T Consensus 245 P~ii~~l~~~~~~s~~~i~~~LNkkS 270 (396)
T COG0282 245 PGIILYLMEQEGMSAEEIDTLLNKKS 270 (396)
T ss_pred hHHHHHHHHhcCCCHHHHHHHHhhhc
Confidence 2 11 2468888888876654
No 27
>cd05470 pepsin_retropepsin_like Cellular and retroviral pepsin-like aspartate proteases. This family includes both cellular and retroviral pepsin-like aspartate proteases. The cellular pepsin and pepsin-like enzymes are twice as long as their retroviral counterparts. The cellular pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, rennin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (rennin, cathepsin D and E, pepsin) or commercially (chymosin) important. The eukaryotic pepsin-like proteases contain two domains possessing similar topological features. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except in the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The eukaryotic pepsin-like proteases have two active site
Probab=56.65 E-value=12 Score=26.41 Aligned_cols=26 Identities=15% Similarity=0.372 Sum_probs=20.7
Q ss_pred EEECC--EEEEEEeecCCCCccccHHHH
Q 038652 55 GFVGE--HKVTIVIDSGASPNFISAEQV 80 (174)
Q Consensus 55 g~I~~--~~v~~LIDSGSThnFIs~~~v 80 (174)
..|+. +++.+++|+||+.-++..+-.
T Consensus 3 i~vGtP~q~~~~~~DTGSs~~Wv~~~~c 30 (109)
T cd05470 3 IGIGTPPQTFNVLLDTGSSNLWVPSVDC 30 (109)
T ss_pred EEeCCCCceEEEEEeCCCCCEEEeCCCC
Confidence 45665 889999999999999876543
No 28
>PF00455 DeoRC: DeoR C terminal sensor domain; InterPro: IPR014036 The deoR-type HTH domain is a DNA-binding, helix-turn-helix (HTH) domain of about 50-60 amino acids present in transcription regulators of the deoR family, involved in sugar catabolism. This family of prokaryotic regulators is named after Escherichia coli deoR, a repressor of the deo operon, which encodes nucleotide and deoxyribonucleotide catabolic enzymes. DeoR also negatively regulates the expression of nupG and tsx, a nucleoside-specific transport protein and a channel-forming protein, respectively. DeoR-like transcription repressors occur in diverse bacteria as regulators of sugar and nucleoside metabolic systems. The effector molecules for deoR-like regulators are generally phosphorylated intermediates of the relevant metabolic pathway. The DNA-binding deoR-type HTH domain occurs usually in the N-terminal part. The C-terminal part can contain an effector-binding domain and/or an oligomerization domain. DeoR occurs as an octamer, whilst glpR and agaR are tetramers. Several operators may be bound simultaneously, which could facilitate DNA looping [, ].
Probab=54.19 E-value=6.8 Score=30.64 Aligned_cols=49 Identities=22% Similarity=0.200 Sum_probs=36.0
Q ss_pred EEEeecCCCCccccHHHHhhcCCcccc-----------cceeEEEecCCCeeeeeeeEeC
Q 038652 63 TIVIDSGASPNFISAEQVDELKLPISE-----------AEAYGVILGTGKSKKTQGVCRN 111 (174)
Q Consensus 63 ~~LIDSGSThnFIs~~~v~~l~l~~~~-----------~~p~~V~vanG~~l~~~~~c~~ 111 (174)
.+.+|+|+|.-.+.+.+..+.++.+.. .+...|.+..|..-.-...+.+
T Consensus 22 ~Ifld~GtT~~~la~~L~~~~~ltVvTnsl~ia~~l~~~~~~~vi~~GG~~~~~~~~~~G 81 (161)
T PF00455_consen 22 TIFLDSGTTTLELAKYLPDKKNLTVVTNSLPIANELSENPNIEVILLGGEVNPKSLSFVG 81 (161)
T ss_pred EEEEECchHHHHHHHHhhcCCceEEEECCHHHHHHHHhcCceEEEEeCCEEEcCCCcEEC
Confidence 478999999999999999997776654 2567888886755444444444
No 29
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=49.01 E-value=31 Score=29.22 Aligned_cols=21 Identities=19% Similarity=0.305 Sum_probs=18.8
Q ss_pred EEEeecCCCCccccHHHHhhc
Q 038652 63 TIVIDSGASPNFISAEQVDEL 83 (174)
Q Consensus 63 ~~LIDSGSThnFIs~~~v~~l 83 (174)
.++||||.|..++..++++++
T Consensus 203 ~~iiDSGtt~~~lP~~~~~~l 223 (318)
T cd05477 203 QAIVDTGTSLLTAPQQVMSTL 223 (318)
T ss_pred eeeECCCCccEECCHHHHHHH
Confidence 589999999999999988875
No 30
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=48.63 E-value=40 Score=27.88 Aligned_cols=35 Identities=31% Similarity=0.567 Sum_probs=26.7
Q ss_pred EEEEECC--EEEEEEeecCCCCccccHHHHhhcCCcccccceeEEEecCCCe
Q 038652 53 VLGFVGE--HKVTIVIDSGASPNFISAEQVDELKLPISEAEAYGVILGTGKS 102 (174)
Q Consensus 53 l~g~I~~--~~v~~LIDSGSThnFIs~~~v~~l~l~~~~~~p~~V~vanG~~ 102 (174)
....|+. +++.++||+||+.-++. ++.+.-++|..
T Consensus 5 ~~i~iGtp~q~~~v~~DTgS~~~wv~---------------~~~~~Y~~g~~ 41 (295)
T cd05474 5 AELSVGTPPQKVTVLLDTGSSDLWVP---------------DFSISYGDGTS 41 (295)
T ss_pred EEEEECCCCcEEEEEEeCCCCcceee---------------eeEEEeccCCc
Confidence 3456666 88999999999999998 56666666543
No 31
>PF00026 Asp: Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.; InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) . More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=48.59 E-value=22 Score=29.49 Aligned_cols=25 Identities=20% Similarity=0.507 Sum_probs=21.1
Q ss_pred EEEEEC--CEEEEEEeecCCCCccccH
Q 038652 53 VLGFVG--EHKVTIVIDSGASPNFISA 77 (174)
Q Consensus 53 l~g~I~--~~~v~~LIDSGSThnFIs~ 77 (174)
....|+ .+++.++||+||+..|+..
T Consensus 4 ~~v~iGtp~q~~~~~iDTGS~~~wv~~ 30 (317)
T PF00026_consen 4 INVTIGTPPQTFRVLIDTGSSDTWVPS 30 (317)
T ss_dssp EEEEETTTTEEEEEEEETTBSSEEEEB
T ss_pred EEEEECCCCeEEEEEEecccceeeece
Confidence 455676 8999999999999999874
No 32
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=48.24 E-value=27 Score=29.29 Aligned_cols=22 Identities=23% Similarity=0.391 Sum_probs=18.9
Q ss_pred EEEeecCCCCccccHHHHhhcC
Q 038652 63 TIVIDSGASPNFISAEQVDELK 84 (174)
Q Consensus 63 ~~LIDSGSThnFIs~~~v~~l~ 84 (174)
-++||||+|-.++.+++.+++.
T Consensus 173 ~~ivDSGTt~~~lp~~~~~~l~ 194 (299)
T cd05472 173 GVIIDSGTVITRLPPSAYAALR 194 (299)
T ss_pred CeEEeCCCcceecCHHHHHHHH
Confidence 4799999999999998887654
No 33
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=48.04 E-value=26 Score=29.71 Aligned_cols=28 Identities=21% Similarity=0.332 Sum_probs=22.3
Q ss_pred EECCEEE------EEEeecCCCCccccHHHHhhc
Q 038652 56 FVGEHKV------TIVIDSGASPNFISAEQVDEL 83 (174)
Q Consensus 56 ~I~~~~v------~~LIDSGSThnFIs~~~v~~l 83 (174)
.|+++.+ .++||||+|-.++..+.++++
T Consensus 187 ~v~g~~~~~~~~~~aiiDTGTs~~~lP~~~~~~l 220 (316)
T cd05486 187 QVGGTVIFCSDGCQAIVDTGTSLITGPSGDIKQL 220 (316)
T ss_pred EEecceEecCCCCEEEECCCcchhhcCHHHHHHH
Confidence 4565543 589999999999999888766
No 34
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=46.18 E-value=35 Score=29.27 Aligned_cols=28 Identities=25% Similarity=0.380 Sum_probs=22.0
Q ss_pred EECCEEE-----EEEeecCCCCccccHHHHhhc
Q 038652 56 FVGEHKV-----TIVIDSGASPNFISAEQVDEL 83 (174)
Q Consensus 56 ~I~~~~v-----~~LIDSGSThnFIs~~~v~~l 83 (174)
.++++.+ .++||||+|-.++..++++.+
T Consensus 200 ~v~~~~~~~~~~~~iiDSGtt~~~lP~~~~~~l 232 (329)
T cd05485 200 SVGEGEFCSGGCQAIADTGTSLIAGPVDEIEKL 232 (329)
T ss_pred EECCeeecCCCcEEEEccCCcceeCCHHHHHHH
Confidence 3555543 699999999999999887765
No 35
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which
Probab=42.74 E-value=43 Score=28.41 Aligned_cols=34 Identities=15% Similarity=0.262 Sum_probs=25.9
Q ss_pred EEEEE-EECCEEE------EEEeecCCCCccccHHHHhhcC
Q 038652 51 FNVLG-FVGEHKV------TIVIDSGASPNFISAEQVDELK 84 (174)
Q Consensus 51 mrl~g-~I~~~~v------~~LIDSGSThnFIs~~~v~~l~ 84 (174)
+.+.+ .|+++.+ .++||||++-.++......++.
T Consensus 189 v~l~~v~v~g~~~~~~~~~~~iiDTGts~~~lp~~~~~~l~ 229 (317)
T cd05478 189 ITVDSVTINGQVVACSGGCQAIVDTGTSLLVGPSSDIANIQ 229 (317)
T ss_pred EEeeEEEECCEEEccCCCCEEEECCCchhhhCCHHHHHHHH
Confidence 44443 5677665 5899999999999999887753
No 36
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=42.61 E-value=17 Score=30.18 Aligned_cols=23 Identities=13% Similarity=0.304 Sum_probs=19.3
Q ss_pred EEEEEeecCCCCccccHHHHhhc
Q 038652 61 KVTIVIDSGASPNFISAEQVDEL 83 (174)
Q Consensus 61 ~v~~LIDSGSThnFIs~~~v~~l 83 (174)
...++||||+|-.++..++++++
T Consensus 198 ~~~~iiDSGTs~~~lP~~~~~~l 220 (278)
T cd06097 198 GFSAIADTGTTLILLPDAIVEAY 220 (278)
T ss_pred CceEEeecCCchhcCCHHHHHHH
Confidence 35689999999999998877665
No 37
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=40.24 E-value=54 Score=31.27 Aligned_cols=82 Identities=15% Similarity=0.142 Sum_probs=54.9
Q ss_pred ccCCCccccCcccccccceeEeeeeccCCC-----CCCeEEEEEEECCEE---EEEEeecCCCCccccHHHHhhcCCccc
Q 038652 17 RKSGNFRRLTKDEIRAKIAKVLLNAVMGLT-----LPKTFNVLGFVGEHK---VTIVIDSGASPNFISAEQVDELKLPIS 88 (174)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~iSl~A~~G~~-----~~~tmrl~g~I~~~~---v~~LIDSGSThnFIs~~~v~~l~l~~~ 88 (174)
|.+|+|+.|+- -.++.++.=-..+++|-- ....+|+-..-.|++ ...|.|||++.-=|+-.--.-+.+..+
T Consensus 78 D~sG~vRIWdt-t~~~hiLKnef~v~aG~I~Di~Wd~ds~RI~avGEGrerfg~~F~~DSG~SvGei~GhSr~ins~~~K 156 (603)
T KOG0318|consen 78 DVSGKVRIWDT-TQKEHILKNEFQVLAGPIKDISWDFDSKRIAAVGEGRERFGHVFLWDSGNSVGEITGHSRRINSVDFK 156 (603)
T ss_pred CCcCcEEEEec-cCcceeeeeeeeecccccccceeCCCCcEEEEEecCccceeEEEEecCCCccceeeccceeEeeeecc
Confidence 77888888822 222333333345555532 345566666655555 688999999988777776667778888
Q ss_pred ccceeEEEecC
Q 038652 89 EAEAYGVILGT 99 (174)
Q Consensus 89 ~~~p~~V~van 99 (174)
+++|+++..+.
T Consensus 157 psRPfRi~T~s 167 (603)
T KOG0318|consen 157 PSRPFRIATGS 167 (603)
T ss_pred CCCceEEEecc
Confidence 99999987753
No 38
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two
Probab=38.23 E-value=54 Score=28.47 Aligned_cols=21 Identities=14% Similarity=0.289 Sum_probs=18.0
Q ss_pred EEEeecCCCCccccHHHHhhc
Q 038652 63 TIVIDSGASPNFISAEQVDEL 83 (174)
Q Consensus 63 ~~LIDSGSThnFIs~~~v~~l 83 (174)
.++||||+|-.++..++.+.+
T Consensus 213 ~~ivDSGTs~~~lp~~~~~~l 233 (364)
T cd05473 213 KAIVDSGTTNLRLPVKVFNAA 233 (364)
T ss_pred cEEEeCCCcceeCCHHHHHHH
Confidence 489999999999999887753
No 39
>PRK10681 DNA-binding transcriptional repressor DeoR; Provisional
Probab=38.00 E-value=14 Score=30.97 Aligned_cols=40 Identities=23% Similarity=0.280 Sum_probs=27.3
Q ss_pred EEEeecCCCCccccHHHHhhcCCcccc-----------cceeEEEecCCCe
Q 038652 63 TIVIDSGASPNFISAEQVDELKLPISE-----------AEAYGVILGTGKS 102 (174)
Q Consensus 63 ~~LIDSGSThnFIs~~~v~~l~l~~~~-----------~~p~~V~vanG~~ 102 (174)
.+.+|+|+|.-++.+.+..+.++.+.. .+.+.|.+..|..
T Consensus 95 tIflD~GtT~~~la~~L~~~~~ltvvTnsl~i~~~l~~~~~~~villGG~~ 145 (252)
T PRK10681 95 TLFFDCGTTTPWIIEAIDNELPFTAVCYSLNTFLALQEKPHCRAILCGGEF 145 (252)
T ss_pred EEEEECCccHHHHHHhcCCCCCeEEEECCHHHHHHHhhCCCCEEEEECcEE
Confidence 478999999999988887654444432 2456777765544
No 40
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=36.64 E-value=38 Score=28.10 Aligned_cols=25 Identities=16% Similarity=0.329 Sum_probs=20.4
Q ss_pred EEEECC--EEEEEEeecCCCCccccHH
Q 038652 54 LGFVGE--HKVTIVIDSGASPNFISAE 78 (174)
Q Consensus 54 ~g~I~~--~~v~~LIDSGSThnFIs~~ 78 (174)
...|+. +++.++||+||+.-|+...
T Consensus 4 ~i~vGtP~Q~~~v~~DTGS~~~wv~~~ 30 (278)
T cd06097 4 PVKIGTPPQTLNLDLDTGSSDLWVFSS 30 (278)
T ss_pred eEEECCCCcEEEEEEeCCCCceeEeeC
Confidence 445666 8899999999999998654
No 41
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which
Probab=33.02 E-value=69 Score=26.40 Aligned_cols=24 Identities=13% Similarity=0.393 Sum_probs=18.9
Q ss_pred EEEEC--CEEEEEEeecCCCCccccH
Q 038652 54 LGFVG--EHKVTIVIDSGASPNFISA 77 (174)
Q Consensus 54 ~g~I~--~~~v~~LIDSGSThnFIs~ 77 (174)
...|+ .+.+.++||+||++-++..
T Consensus 5 ~i~iGtP~q~~~v~~DTGSs~~wv~~ 30 (265)
T cd05476 5 TLSIGTPPQPFSLIVDTGSDLTWTQC 30 (265)
T ss_pred EEecCCCCcceEEEecCCCCCEEEcC
Confidence 34555 5789999999999999853
No 42
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=32.84 E-value=55 Score=27.86 Aligned_cols=29 Identities=21% Similarity=0.296 Sum_probs=23.3
Q ss_pred EECCEE-------EEEEeecCCCCccccHHHHhhcC
Q 038652 56 FVGEHK-------VTIVIDSGASPNFISAEQVDELK 84 (174)
Q Consensus 56 ~I~~~~-------v~~LIDSGSThnFIs~~~v~~l~ 84 (174)
.|++.. ..++||||+|-.++.+++++++.
T Consensus 198 ~v~g~~~~~~~~~~~aivDTGTs~~~lP~~~~~~i~ 233 (317)
T cd06098 198 LIGGKSTGFCAGGCAAIADSGTSLLAGPTTIVTQIN 233 (317)
T ss_pred EECCEEeeecCCCcEEEEecCCcceeCCHHHHHhhh
Confidence 556654 35899999999999999988764
No 43
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate r
Probab=31.51 E-value=52 Score=28.09 Aligned_cols=25 Identities=20% Similarity=0.305 Sum_probs=20.4
Q ss_pred EEEEEC--CEEEEEEeecCCCCccccH
Q 038652 53 VLGFVG--EHKVTIVIDSGASPNFISA 77 (174)
Q Consensus 53 l~g~I~--~~~v~~LIDSGSThnFIs~ 77 (174)
....|+ .++..++||+||+..++..
T Consensus 11 ~~i~iGtP~q~~~v~~DTGSs~~Wv~~ 37 (326)
T cd05487 11 GEIGIGTPPQTFKVVFDTGSSNLWVPS 37 (326)
T ss_pred EEEEECCCCcEEEEEEeCCccceEEcc
Confidence 345565 7889999999999999964
No 44
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=31.02 E-value=36 Score=27.54 Aligned_cols=25 Identities=20% Similarity=0.296 Sum_probs=21.5
Q ss_pred EEEEEEeecCCCCccccHHHHhhcC
Q 038652 60 HKVTIVIDSGASPNFISAEQVDELK 84 (174)
Q Consensus 60 ~~v~~LIDSGSThnFIs~~~v~~l~ 84 (174)
....++||||++-.++..++.+.+-
T Consensus 201 ~~~~~iiDsGt~~~~lp~~~~~~l~ 225 (283)
T cd05471 201 GGGGAIVDSGTSLIYLPSSVYDAIL 225 (283)
T ss_pred CCcEEEEecCCCCEeCCHHHHHHHH
Confidence 4578999999999999998888764
No 45
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank
Probab=28.66 E-value=38 Score=28.76 Aligned_cols=23 Identities=26% Similarity=0.342 Sum_probs=19.8
Q ss_pred EEEEeecCCCCccccHHHHhhcC
Q 038652 62 VTIVIDSGASPNFISAEQVDELK 84 (174)
Q Consensus 62 v~~LIDSGSThnFIs~~~v~~l~ 84 (174)
..++||||+|-.++..+.++.+.
T Consensus 207 ~~aiiDSGTt~~~~p~~~~~~l~ 229 (325)
T cd05490 207 CEAIVDTGTSLITGPVEEVRALQ 229 (325)
T ss_pred CEEEECCCCccccCCHHHHHHHH
Confidence 46899999999999998887663
No 46
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=27.48 E-value=68 Score=27.26 Aligned_cols=27 Identities=22% Similarity=0.311 Sum_probs=21.2
Q ss_pred EEEEEEC--CEEEEEEeecCCCCccccHH
Q 038652 52 NVLGFVG--EHKVTIVIDSGASPNFISAE 78 (174)
Q Consensus 52 rl~g~I~--~~~v~~LIDSGSThnFIs~~ 78 (174)
.....|+ .+++.++||+||+.-++...
T Consensus 12 ~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~ 40 (317)
T cd06098 12 FGEIGIGTPPQKFTVIFDTGSSNLWVPSS 40 (317)
T ss_pred EEEEEECCCCeEEEEEECCCccceEEecC
Confidence 3455666 68899999999999998653
No 47
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=26.37 E-value=70 Score=26.57 Aligned_cols=24 Identities=21% Similarity=0.328 Sum_probs=19.2
Q ss_pred EEEEEC--CEEEEEEeecCCCCcccc
Q 038652 53 VLGFVG--EHKVTIVIDSGASPNFIS 76 (174)
Q Consensus 53 l~g~I~--~~~v~~LIDSGSThnFIs 76 (174)
+...|+ .+++.+++|+||++-++.
T Consensus 5 ~~i~iGtP~q~~~v~~DTGS~~~Wv~ 30 (273)
T cd05475 5 VTINIGNPPKPYFLDIDTGSDLTWLQ 30 (273)
T ss_pred EEEEcCCCCeeEEEEEccCCCceEEe
Confidence 344555 678999999999999994
No 48
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme. Proteinase A preferentially hydro
Probab=26.29 E-value=45 Score=28.39 Aligned_cols=23 Identities=22% Similarity=0.312 Sum_probs=19.8
Q ss_pred EEEEeecCCCCccccHHHHhhcC
Q 038652 62 VTIVIDSGASPNFISAEQVDELK 84 (174)
Q Consensus 62 v~~LIDSGSThnFIs~~~v~~l~ 84 (174)
..++||||+|-.++..++++.+.
T Consensus 206 ~~~ivDSGtt~~~lp~~~~~~l~ 228 (320)
T cd05488 206 TGAAIDTGTSLIALPSDLAEMLN 228 (320)
T ss_pred CeEEEcCCcccccCCHHHHHHHH
Confidence 35899999999999999988753
No 49
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank
Probab=25.95 E-value=75 Score=26.97 Aligned_cols=25 Identities=24% Similarity=0.337 Sum_probs=19.8
Q ss_pred EEEEEC--CEEEEEEeecCCCCccccH
Q 038652 53 VLGFVG--EHKVTIVIDSGASPNFISA 77 (174)
Q Consensus 53 l~g~I~--~~~v~~LIDSGSThnFIs~ 77 (174)
....|+ .+++.+++|+||+.-++..
T Consensus 9 ~~i~iGtP~q~~~v~~DTGSs~~Wv~~ 35 (325)
T cd05490 9 GEIGIGTPPQTFTVVFDTGSSNLWVPS 35 (325)
T ss_pred EEEEECCCCcEEEEEEeCCCccEEEEc
Confidence 345565 4789999999999999954
No 50
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=24.75 E-value=82 Score=26.63 Aligned_cols=25 Identities=16% Similarity=0.325 Sum_probs=20.1
Q ss_pred EEEEC--CEEEEEEeecCCCCccccHH
Q 038652 54 LGFVG--EHKVTIVIDSGASPNFISAE 78 (174)
Q Consensus 54 ~g~I~--~~~v~~LIDSGSThnFIs~~ 78 (174)
...|+ .+++.++||+||+.-++...
T Consensus 7 ~i~iGtP~q~~~v~~DTGS~~~wv~~~ 33 (318)
T cd05477 7 EISIGTPPQNFLVLFDTGSSNLWVPSV 33 (318)
T ss_pred EEEECCCCcEEEEEEeCCCccEEEccC
Confidence 34565 48899999999999999754
No 51
>PRK10906 DNA-binding transcriptional repressor GlpR; Provisional
Probab=24.36 E-value=34 Score=28.73 Aligned_cols=39 Identities=28% Similarity=0.543 Sum_probs=26.9
Q ss_pred EEEeecCCCCccccHHHHhhcCCcccc-----------cceeEEEecCCC
Q 038652 63 TIVIDSGASPNFISAEQVDELKLPISE-----------AEAYGVILGTGK 101 (174)
Q Consensus 63 ~~LIDSGSThnFIs~~~v~~l~l~~~~-----------~~p~~V~vanG~ 101 (174)
.+.+|+|+|...+.+.+..+.++.+.. .+.+.|.+..|.
T Consensus 94 tIflD~GtT~~~la~~L~~~~~ltVvTNsl~ia~~l~~~~~~~villGG~ 143 (252)
T PRK10906 94 TLFIDIGTTPEAVAHALLNHSNLRIVTNNLNVANTLMAKEDFRIILAGGE 143 (252)
T ss_pred EEEEcCcHHHHHHHHHhcCCCCcEEEECcHHHHHHHhhCCCCEEEEECCE
Confidence 478999999999988887654555443 244667666453
No 52
>PF06004 DUF903: Bacterial protein of unknown function (DUF903); InterPro: IPR010305 This family consists of several small bacterial proteins several of which are classified as putative lipoproteins. The function of this family is unknown.; PDB: 2RB6_B 3BDU_F 2RA2_F 2RD1_C 3FIF_H 2JN0_A 2K57_A.
Probab=24.20 E-value=2.1e+02 Score=18.48 Aligned_cols=41 Identities=17% Similarity=0.238 Sum_probs=32.4
Q ss_pred eeEEEecCCCeeeeeeeEeCeeeecccCceEEEeeeeEEEEEe-CCeEEEEecC
Q 038652 92 AYGVILGTGKSKKTQGVCRNYKWLETLGVTHINWKTRVMKFTF-GKSQIVLQGD 144 (174)
Q Consensus 92 p~~V~vanG~~l~~~~~c~~mdWL~~~gp~~iDw~~~t~~f~~-~g~~V~L~G~ 144 (174)
++.+..-||+.+.+.+. | .+|=.+..++|.. .|+.+.+.-.
T Consensus 1 ~yvi~t~dG~~i~t~gk-----------P-~~D~~tG~~~y~D~~G~~~qIn~d 42 (50)
T PF06004_consen 1 NYVITTNDGRTIVTDGK-----------P-KYDKDTGMYSYTDADGKEQQINKD 42 (50)
T ss_dssp EEEEEETTSEEEEESSE-----------E-EEETTTTEEEEEBTTS-EEEEEGG
T ss_pred CcEEEeCCCcEEEeCCC-----------c-cccCCCCcEEEECCCCCEEEEchH
Confidence 45677889998888775 7 8999999999987 8888887754
No 53
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=23.81 E-value=94 Score=26.52 Aligned_cols=26 Identities=23% Similarity=0.471 Sum_probs=20.3
Q ss_pred EEEEEC--CEEEEEEeecCCCCccccHH
Q 038652 53 VLGFVG--EHKVTIVIDSGASPNFISAE 78 (174)
Q Consensus 53 l~g~I~--~~~v~~LIDSGSThnFIs~~ 78 (174)
+...|+ .+++.++||+||+.-++...
T Consensus 6 ~~i~vGtP~Q~~~v~~DTGS~~~wv~~~ 33 (326)
T cd06096 6 IDIFIGNPPQKQSLILDTGSSSLSFPCS 33 (326)
T ss_pred EEEEecCCCeEEEEEEeCCCCceEEecC
Confidence 344555 58899999999999998654
No 54
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which
Probab=23.75 E-value=85 Score=26.57 Aligned_cols=26 Identities=19% Similarity=0.351 Sum_probs=20.6
Q ss_pred EEEEEC--CEEEEEEeecCCCCccccHH
Q 038652 53 VLGFVG--EHKVTIVIDSGASPNFISAE 78 (174)
Q Consensus 53 l~g~I~--~~~v~~LIDSGSThnFIs~~ 78 (174)
....|+ .+++.++||+||+.-++...
T Consensus 13 ~~i~vGtp~q~~~v~~DTGS~~~wv~~~ 40 (317)
T cd05478 13 GTISIGTPPQDFTVIFDTGSSNLWVPSV 40 (317)
T ss_pred EEEEeCCCCcEEEEEEeCCCccEEEecC
Confidence 345565 57899999999999999753
No 55
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=23.01 E-value=58 Score=26.93 Aligned_cols=24 Identities=21% Similarity=0.570 Sum_probs=20.4
Q ss_pred EEEEEeecCCCCccccHHHHhhcC
Q 038652 61 KVTIVIDSGASPNFISAEQVDELK 84 (174)
Q Consensus 61 ~v~~LIDSGSThnFIs~~~v~~l~ 84 (174)
...++||||++..++..++.+++-
T Consensus 178 ~~~~iiDSGt~~~~lP~~~~~~l~ 201 (295)
T cd05474 178 NLPALLDSGTTLTYLPSDIVDAIA 201 (295)
T ss_pred CccEEECCCCccEeCCHHHHHHHH
Confidence 357899999999999999888753
No 56
>PTZ00147 plasmepsin-1; Provisional
Probab=22.75 E-value=97 Score=28.50 Aligned_cols=26 Identities=15% Similarity=0.327 Sum_probs=21.4
Q ss_pred EEEEEC--CEEEEEEeecCCCCccccHH
Q 038652 53 VLGFVG--EHKVTIVIDSGASPNFISAE 78 (174)
Q Consensus 53 l~g~I~--~~~v~~LIDSGSThnFIs~~ 78 (174)
....|+ .++..++||+||+.-++...
T Consensus 142 ~~I~IGTP~Q~f~Vi~DTGSsdlWVps~ 169 (453)
T PTZ00147 142 GEAKLGDNGQKFNFIFDTGSANLWVPSI 169 (453)
T ss_pred EEEEECCCCeEEEEEEeCCCCcEEEeec
Confidence 445676 78999999999999999654
No 57
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme. Proteinase A preferentially hydro
Probab=22.63 E-value=93 Score=26.43 Aligned_cols=28 Identities=18% Similarity=0.374 Sum_probs=21.7
Q ss_pred EEEEEEEC--CEEEEEEeecCCCCccccHH
Q 038652 51 FNVLGFVG--EHKVTIVIDSGASPNFISAE 78 (174)
Q Consensus 51 mrl~g~I~--~~~v~~LIDSGSThnFIs~~ 78 (174)
......|+ .+++.+++|+||+.-++...
T Consensus 11 Y~~~i~iGtp~q~~~v~~DTGSs~~wv~~~ 40 (320)
T cd05488 11 YFTDITLGTPPQKFKVILDTGSSNLWVPSV 40 (320)
T ss_pred EEEEEEECCCCcEEEEEEecCCcceEEEcC
Confidence 34455666 48899999999999999654
No 58
>PF07894 DUF1669: Protein of unknown function (DUF1669); InterPro: IPR012461 This family is composed of sequences derived from hypothetical eukaryotic proteins of unknown function. Some members of this family are annotated as being potential phospholipases but no literature was found to support this.
Probab=22.42 E-value=1e+02 Score=26.94 Aligned_cols=48 Identities=21% Similarity=0.269 Sum_probs=42.3
Q ss_pred ECCEEEEEEeecCCCCccccHHHHhhcCCcccccceeEEEecCCCeeeee
Q 038652 57 VGEHKVTIVIDSGASPNFISAEQVDELKLPISEAEAYGVILGTGKSKKTQ 106 (174)
Q Consensus 57 I~~~~v~~LIDSGSThnFIs~~~v~~l~l~~~~~~p~~V~vanG~~l~~~ 106 (174)
-.+.||-+|+|=-.-+.|+ ++++++++....++.++|.+..|....|.
T Consensus 172 kR~VpVYiLLD~~~~~~Fl--~Mc~~~~v~~~~~~nmrVRsv~G~~y~~r 219 (284)
T PF07894_consen 172 KRGVPVYILLDEQNLPHFL--EMCEKLGVNLQHLKNMRVRSVTGCTYYSR 219 (284)
T ss_pred hcCCcEEEEechhcChHHH--HHHHHCCCChhhcCCeEEEEecCCeeecC
Confidence 3577999999999999998 58999999999999999999999776554
No 59
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two
Probab=22.01 E-value=95 Score=26.95 Aligned_cols=26 Identities=27% Similarity=0.514 Sum_probs=20.1
Q ss_pred EEEEEC--CEEEEEEeecCCCCccccHH
Q 038652 53 VLGFVG--EHKVTIVIDSGASPNFISAE 78 (174)
Q Consensus 53 l~g~I~--~~~v~~LIDSGSThnFIs~~ 78 (174)
....|+ .+++.++||+||+.-++...
T Consensus 6 ~~i~iGtP~Q~~~v~~DTGSs~lWv~~~ 33 (364)
T cd05473 6 IEMLIGTPPQKLNILVDTGSSNFAVAAA 33 (364)
T ss_pred EEEEecCCCceEEEEEecCCcceEEEcC
Confidence 344565 57899999999999988643
No 60
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=21.77 E-value=98 Score=28.46 Aligned_cols=26 Identities=23% Similarity=0.553 Sum_probs=21.0
Q ss_pred EEEEC--CEEEEEEeecCCCCccccHHH
Q 038652 54 LGFVG--EHKVTIVIDSGASPNFISAEQ 79 (174)
Q Consensus 54 ~g~I~--~~~v~~LIDSGSThnFIs~~~ 79 (174)
...|+ ++++.++||+||+.-++...-
T Consensus 142 ~i~IGTP~Q~f~vi~DTGSsdlWV~s~~ 169 (450)
T PTZ00013 142 EGEVGDNHQKFMLIFDTGSANLWVPSKK 169 (450)
T ss_pred EEEECCCCeEEEEEEeCCCCceEEeccc
Confidence 44565 789999999999999996543
No 61
>PLN03146 aspartyl protease family protein; Provisional
Probab=21.76 E-value=1.2e+02 Score=27.32 Aligned_cols=20 Identities=30% Similarity=0.581 Sum_probs=16.7
Q ss_pred EEEeecCCCCccccHHHHhh
Q 038652 63 TIVIDSGASPNFISAEQVDE 82 (174)
Q Consensus 63 ~~LIDSGSThnFIs~~~v~~ 82 (174)
-++||||+|-.++.+.+.+.
T Consensus 309 ~~iiDSGTt~t~Lp~~~y~~ 328 (431)
T PLN03146 309 NIIIDSGTTLTLLPSDFYSE 328 (431)
T ss_pred cEEEeCCccceecCHHHHHH
Confidence 37999999999999986444
No 62
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=21.59 E-value=83 Score=26.30 Aligned_cols=23 Identities=17% Similarity=0.429 Sum_probs=18.4
Q ss_pred EEEEC--CEEEEEEeecCCCCcccc
Q 038652 54 LGFVG--EHKVTIVIDSGASPNFIS 76 (174)
Q Consensus 54 ~g~I~--~~~v~~LIDSGSThnFIs 76 (174)
...|+ .+++.+++|+||+.-++.
T Consensus 5 ~i~iGtP~q~~~v~~DTGSs~~Wv~ 29 (299)
T cd05472 5 TVGLGTPARDQTVIVDTGSDLTWVQ 29 (299)
T ss_pred EEecCCCCcceEEEecCCCCccccc
Confidence 34455 578999999999999994
No 63
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate r
Probab=21.53 E-value=60 Score=27.68 Aligned_cols=21 Identities=33% Similarity=0.349 Sum_probs=18.4
Q ss_pred EEEeecCCCCccccHHHHhhc
Q 038652 63 TIVIDSGASPNFISAEQVDEL 83 (174)
Q Consensus 63 ~~LIDSGSThnFIs~~~v~~l 83 (174)
.++||||++-.++..+.++++
T Consensus 209 ~aiiDSGts~~~lP~~~~~~l 229 (326)
T cd05487 209 TAVVDTGASFISGPTSSISKL 229 (326)
T ss_pred EEEECCCccchhCcHHHHHHH
Confidence 589999999999999877765
No 64
>COG2383 Uncharacterized conserved protein [Function unknown]
Probab=20.57 E-value=30 Score=25.91 Aligned_cols=17 Identities=41% Similarity=0.894 Sum_probs=15.0
Q ss_pred eeeecccCceEEEeeee
Q 038652 112 YKWLETLGVTHINWKTR 128 (174)
Q Consensus 112 mdWL~~~gp~~iDw~~~ 128 (174)
..||.++|-++|||...
T Consensus 54 l~~La~~GVItin~~al 70 (109)
T COG2383 54 LFWLAQYGVITINWEAL 70 (109)
T ss_pred HHHHHHcCeEEEcHHHH
Confidence 68999999999999753
No 65
>PTZ00147 plasmepsin-1; Provisional
Probab=20.12 E-value=1.5e+02 Score=27.31 Aligned_cols=32 Identities=22% Similarity=0.388 Sum_probs=23.3
Q ss_pred EEEEEECCE---EEEEEeecCCCCccccHHHHhhc
Q 038652 52 NVLGFVGEH---KVTIVIDSGASPNFISAEQVDEL 83 (174)
Q Consensus 52 rl~g~I~~~---~v~~LIDSGSThnFIs~~~v~~l 83 (174)
.+.+.+++. ...++||||.|-.++..+.+.++
T Consensus 320 ~l~~~vg~~~~~~~~aIiDSGTsli~lP~~~~~ai 354 (453)
T PTZ00147 320 DLDVHFGNVSSEKANVIVDSGTSVITVPTEFLNKF 354 (453)
T ss_pred EEEEEECCEecCceeEEECCCCchhcCCHHHHHHH
Confidence 344444443 35699999999999999988764
Done!