Query 040861
Match_columns 199
No_of_seqs 205 out of 1322
Neff 7.0
Searched_HMMs 46136
Date Fri Mar 29 12:34:03 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/040861.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/040861hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN03146 aspartyl protease fam 100.0 1.1E-31 2.4E-36 243.3 15.8 158 13-199 21-182 (431)
2 KOG1339 Aspartyl protease [Pos 99.9 8.4E-24 1.8E-28 189.8 10.8 100 90-199 42-143 (398)
3 cd06096 Plasmepsin_5 Plasmepsi 99.9 9.1E-22 2E-26 171.8 11.6 94 94-199 3-97 (326)
4 PF14543 TAXi_N: Xylanase inhi 99.9 9.6E-22 2.1E-26 156.9 8.2 87 95-199 1-91 (164)
5 cd05490 Cathepsin_D2 Cathepsin 99.8 2E-18 4.4E-23 150.4 10.2 84 89-199 1-85 (325)
6 cd05470 pepsin_retropepsin_lik 99.7 8.8E-18 1.9E-22 124.0 9.4 74 97-199 1-76 (109)
7 cd05487 renin_like Renin stimu 99.7 9.2E-18 2E-22 146.5 9.7 84 89-199 3-87 (326)
8 PTZ00147 plasmepsin-1; Provisi 99.7 1.7E-17 3.7E-22 151.6 11.5 84 85-199 129-216 (453)
9 cd05478 pepsin_A Pepsin A, asp 99.7 1.5E-17 3.2E-22 144.5 10.5 82 89-199 5-87 (317)
10 PTZ00165 aspartyl protease; Pr 99.7 1.9E-17 4E-22 152.4 10.5 85 89-199 115-202 (482)
11 cd05477 gastricsin Gastricsins 99.7 3.2E-17 6.8E-22 142.5 10.3 76 93-199 2-80 (318)
12 cd06097 Aspergillopepsin_like 99.7 2.7E-17 5.7E-22 140.4 9.3 78 95-199 1-79 (278)
13 cd05485 Cathepsin_D_like Cathe 99.7 2.6E-17 5.7E-22 144.0 9.3 84 89-199 6-90 (329)
14 cd06098 phytepsin Phytepsin, a 99.7 4.6E-17 1E-21 141.7 10.1 80 89-199 5-88 (317)
15 cd05473 beta_secretase_like Be 99.7 3.5E-17 7.7E-22 144.9 9.4 74 93-199 2-76 (364)
16 PTZ00013 plasmepsin 4 (PM4); P 99.7 9.8E-17 2.1E-21 146.5 11.4 83 86-199 129-215 (450)
17 cd05486 Cathespin_E Cathepsin 99.7 4.1E-17 8.9E-22 141.7 8.5 74 95-199 1-77 (316)
18 cd05488 Proteinase_A_fungi Fun 99.7 8.5E-17 1.8E-21 140.1 10.1 80 89-199 5-87 (320)
19 cd05475 nucellin_like Nucellin 99.6 8.1E-15 1.7E-19 125.1 8.2 59 94-199 2-62 (273)
20 cd05471 pepsin_like Pepsin-lik 99.5 1.1E-13 2.3E-18 116.7 9.6 78 95-199 1-79 (283)
21 cd05489 xylanase_inhibitor_I_l 99.4 1.7E-13 3.8E-18 121.9 7.2 74 101-199 2-91 (362)
22 PF00026 Asp: Eukaryotic aspar 99.4 1E-13 2.2E-18 118.9 5.3 78 94-199 1-79 (317)
23 cd05472 cnd41_like Chloroplast 98.9 6.5E-10 1.4E-14 95.7 3.4 32 94-125 1-33 (299)
24 cd05476 pepsin_A_like_plant Ch 98.7 9.5E-09 2.1E-13 87.2 3.8 30 94-123 1-30 (265)
25 cd05474 SAP_like SAPs, pepsin- 98.4 2.3E-07 4.9E-12 79.3 4.7 29 94-122 2-30 (295)
26 cd05483 retropepsin_like_bacte 95.4 0.024 5.1E-07 39.7 4.1 29 94-124 2-30 (96)
27 cd05484 retropepsin_like_LTR_2 92.4 0.18 3.8E-06 35.8 3.5 28 95-124 1-28 (91)
28 TIGR02281 clan_AA_DTGA clan AA 84.9 2 4.2E-05 32.4 4.6 31 92-124 9-39 (121)
29 PF00077 RVP: Retroviral aspar 84.4 1.7 3.6E-05 30.9 3.9 27 96-124 7-33 (100)
30 PF13975 gag-asp_proteas: gag- 81.9 3 6.5E-05 28.3 4.2 30 93-124 7-36 (72)
31 PF13650 Asp_protease_2: Aspar 75.5 3.8 8.3E-05 27.8 3.3 25 98-124 2-26 (90)
32 cd05479 RP_DDI RP_DDI; retrope 74.7 6 0.00013 29.7 4.4 29 94-124 16-44 (124)
33 cd05482 HIV_retropepsin_like R 70.9 6.4 0.00014 28.2 3.5 23 98-122 2-24 (87)
34 cd06095 RP_RTVL_H_like Retrope 64.0 9.8 0.00021 26.5 3.3 24 99-124 3-26 (86)
35 PF09668 Asp_protease: Asparty 44.5 20 0.00044 27.4 2.4 34 94-129 24-58 (124)
36 PF12384 Peptidase_A2B: Ty3 tr 29.2 60 0.0013 26.3 2.9 27 97-123 35-61 (177)
37 cd05475 nucellin_like Nucellin 27.4 82 0.0018 26.4 3.7 26 99-124 166-194 (273)
38 cd00303 retropepsin_like Retro 26.0 76 0.0016 19.7 2.6 21 99-121 3-23 (92)
39 PF07496 zf-CW: CW-type Zinc F 25.6 34 0.00074 21.6 0.8 8 119-126 2-10 (50)
40 TIGR03698 clan_AA_DTGF clan AA 25.1 95 0.0021 22.6 3.3 27 97-123 2-33 (107)
41 PF11645 PDDEXK_5: PD-(D/E)XK 22.8 65 0.0014 25.4 2.0 29 95-124 22-50 (149)
42 PF05585 DUF1758: Putative pep 21.6 47 0.001 25.8 1.1 20 105-124 10-29 (164)
43 COG3577 Predicted aspartyl pro 21.3 2.3E+02 0.005 23.7 5.1 37 85-123 95-132 (215)
No 1
>PLN03146 aspartyl protease family protein; Provisional
Probab=99.98 E-value=1.1e-31 Score=243.30 Aligned_cols=158 Identities=28% Similarity=0.440 Sum_probs=124.1
Q ss_pred CCceeEEEEEeecCcccccccccCCCCCCCCCCCCCcHHHHHHHHHhhHHHhcccccCCCCCCcccccCCCceeeeccCC
Q 040861 13 GGRIFSFEMHHRYSDQVKNWSISSGKLSHSDWPDKGSFDYYALLAHRDQILRGRHLSDTDTNSPLIFSDGNSTLRISSLG 92 (199)
Q Consensus 13 ~~~~~~l~l~Hr~s~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~d~~rr~~~l~~~~~~~~~~~~~g~~~~~~~~~~ 92 (199)
+.++++++|+||++++. |+++ +..+..++++++++||++ |.+++..... ...++..+. ...+
T Consensus 21 ~~~~~~~~l~h~~~~~s-------p~~~----~~~~~~~~~~~~~~~~~~-r~~~~~~~~~-~~~~~~~~~-----~~~~ 82 (431)
T PLN03146 21 PKGGFTVDLIHRDSPKS-------PFYN----PSETPSQRLRNAFRRSIS-RVNHFRPTDA-SPNDPQSDL-----ISNG 82 (431)
T ss_pred cCCceEEEEEeCCCCCC-------CCCC----CCCChhHHHHHHHHHHHH-HHHHHhhccc-cCCccccCc-----ccCC
Confidence 46789999999999962 2222 445567899999999998 7666643221 111222221 1234
Q ss_pred ceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCC---C
Q 040861 93 FLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRN---R 168 (199)
Q Consensus 93 ~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~---~ 168 (199)
.+|+++|.||||||++.|+|||||+++||||. |..|..+. ++.|||++|+||+.++|+++.|..+. .
T Consensus 83 ~~Y~v~i~iGTPpq~~~vi~DTGS~l~Wv~C~~C~~C~~~~---------~~~fdps~SST~~~~~C~s~~C~~~~~~~~ 153 (431)
T PLN03146 83 GEYLMNISIGTPPVPILAIADTGSDLIWTQCKPCDDCYKQV---------SPLFDPKKSSTYKDVSCDSSQCQALGNQAS 153 (431)
T ss_pred ccEEEEEEcCCCCceEEEEECCCCCcceEcCCCCcccccCC---------CCcccCCCCCCCcccCCCCcccccCCCCCC
Confidence 58999999999999999999999999999999 99998654 36999999999999999999998763 3
Q ss_pred CCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 169 CPGTFSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 169 C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
|.. ++.|.|.+.| +||+.+.|++++|+|+
T Consensus 154 c~~-~~~c~y~i~Y-gdgs~~~G~l~~Dtlt 182 (431)
T PLN03146 154 CSD-ENTCTYSYSY-GDGSFTKGNLAVETLT 182 (431)
T ss_pred CCC-CCCCeeEEEe-CCCCceeeEEEEEEEE
Confidence 754 3579999999 9999999999999985
No 2
>KOG1339 consensus Aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.90 E-value=8.4e-24 Score=189.76 Aligned_cols=100 Identities=41% Similarity=0.721 Sum_probs=88.3
Q ss_pred cCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CC-CCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCC
Q 040861 90 SLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CS-KCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRN 167 (199)
Q Consensus 90 ~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~-~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~ 167 (199)
....+||++|.||||||.|+|+|||||+++||||. |. .|..+.. +.|||++||||+.+.|.++.|....
T Consensus 42 ~~~~~Y~~~i~IGTPpq~f~v~~DTGS~~lWV~c~~c~~~C~~~~~---------~~f~p~~SSt~~~~~c~~~~c~~~~ 112 (398)
T KOG1339|consen 42 YSSGEYYGNISIGTPPQSFTVVLDTGSDLLWVPCAPCSSACYSQHN---------PIFDPSASSTYKSVGCSSPRCKSLP 112 (398)
T ss_pred ccccccEEEEecCCCCeeeEEEEeCCCCceeeccccccccccccCC---------CccCccccccccccCCCCccccccc
Confidence 33458999999999999999999999999999999 99 7976432 3599999999999999999999986
Q ss_pred CCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 168 RCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 168 ~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
.+..+++.|.|.+.| +|++.++|+|++|+|+
T Consensus 113 ~~~~~~~~C~y~i~Y-gd~~~~~G~l~~Dtv~ 143 (398)
T KOG1339|consen 113 QSCSPNSSCPYSIQY-GDGSSTSGYLATDTVT 143 (398)
T ss_pred cCcccCCcCceEEEe-CCCCceeEEEEEEEEE
Confidence 544446899999999 9999999999999985
No 3
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=99.87 E-value=9.1e-22 Score=171.84 Aligned_cols=94 Identities=32% Similarity=0.559 Sum_probs=84.0
Q ss_pred eEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCCCCCCC
Q 040861 94 LHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRNRCPGT 172 (199)
Q Consensus 94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~~C~~~ 172 (199)
+||++|.||||+|++.|+|||||+++||+|. |..|..+. ++.|+|++|+|++.+.|++..|.....|..
T Consensus 3 ~Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~C~~c~~~~---------~~~y~~~~Sst~~~~~C~~~~c~~~~~~~~- 72 (326)
T cd06096 3 YYFIDIFIGNPPQKQSLILDTGSSSLSFPCSQCKNCGIHM---------EPPYNLNNSITSSILYCDCNKCCYCLSCLN- 72 (326)
T ss_pred eEEEEEEecCCCeEEEEEEeCCCCceEEecCCCCCcCCCC---------CCCcCcccccccccccCCCccccccCcCCC-
Confidence 7999999999999999999999999999999 99997543 368999999999999999999966555643
Q ss_pred CCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 173 FSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 173 ~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
+.|.|.+.| +||+.+.|.+++|+|+
T Consensus 73 -~~~~~~i~Y-~~gs~~~G~~~~D~v~ 97 (326)
T cd06096 73 -NKCEYSISY-SEGSSISGFYFSDFVS 97 (326)
T ss_pred -CcCcEEEEE-CCCCceeeEEEEEEEE
Confidence 689999999 9998899999999974
No 4
>PF14543 TAXi_N: Xylanase inhibitor N-terminal; PDB: 3HD8_A 3VLB_A 3VLA_A 3AUP_D 1T6G_A 1T6E_X 2B42_A.
Probab=99.86 E-value=9.6e-22 Score=156.89 Aligned_cols=87 Identities=39% Similarity=0.709 Sum_probs=73.0
Q ss_pred EEEEEEeCCCCceEEEEEeCCCCceeEeccCCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCC----CCC
Q 040861 95 HYTTVQLGTPGMKFMVALDTGSDLFWVPCECSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRN----RCP 170 (199)
Q Consensus 95 y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~----~C~ 170 (199)
||++|+||||+|++.|+|||||+++|+||. .+.|+|++|+||+.|+|+++.|.... .|.
T Consensus 1 Y~~~~~iGtP~~~~~lvvDtgs~l~W~~C~-----------------~~~f~~~~Sst~~~v~C~s~~C~~~~~~~~~~~ 63 (164)
T PF14543_consen 1 YYVSVSIGTPPQPFSLVVDTGSDLTWVQCP-----------------DPPFDPSKSSTYRPVPCSSPQCSSAPSFCPCCC 63 (164)
T ss_dssp EEEEEECTCTTEEEEEEEETT-SSEEEET---------------------STT-TTSSBEC-BTTSHHHHHCTSSBTCCT
T ss_pred CEEEEEeCCCCceEEEEEECCCCceEEcCC-----------------CcccCCccCCcccccCCCCcchhhcccccccCC
Confidence 889999999999999999999999999982 26899999999999999999999653 344
Q ss_pred CCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 171 GTFSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 171 ~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
..++.|.|.+.| +|++.+.|+|++|+|+
T Consensus 64 ~~~~~C~y~~~y-~~~s~~~G~l~~D~~~ 91 (164)
T PF14543_consen 64 CSNNSCPYSQSY-GDGSSSSGFLASDTLT 91 (164)
T ss_dssp CESSEEEEEEEE-TTTEEEEEEEEEEEEE
T ss_pred CCcCcccceeec-CCCccccCceEEEEEE
Confidence 445789999999 8999999999999984
No 5
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank
Probab=99.76 E-value=2e-18 Score=150.37 Aligned_cols=84 Identities=30% Similarity=0.469 Sum_probs=69.7
Q ss_pred ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCC
Q 040861 89 SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRN 167 (199)
Q Consensus 89 ~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~ 167 (199)
++.+.+||++|.||||+|++.|+|||||+++||+|. |..|... |..++.|+|++|+||+.
T Consensus 1 ~~~~~~Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~-------C~~~~~y~~~~SsT~~~------------ 61 (325)
T cd05490 1 NYMDAQYYGEIGIGTPPQTFTVVFDTGSSNLWVPSVHCSLLDIA-------CWLHHKYNSSKSSTYVK------------ 61 (325)
T ss_pred CCcCCEEEEEEEECCCCcEEEEEEeCCCccEEEEcCCCCCCCcc-------ccCcCcCCcccCcceee------------
Confidence 356789999999999999999999999999999999 8632100 12246899999999986
Q ss_pred CCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 168 RCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 168 ~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
..|.|.+.| ++|+ +.|.+++|+|+
T Consensus 62 ------~~~~~~i~Y-g~G~-~~G~~~~D~v~ 85 (325)
T cd05490 62 ------NGTEFAIQY-GSGS-LSGYLSQDTVS 85 (325)
T ss_pred ------CCcEEEEEE-CCcE-EEEEEeeeEEE
Confidence 258999999 8875 78999999974
No 6
>cd05470 pepsin_retropepsin_like Cellular and retroviral pepsin-like aspartate proteases. This family includes both cellular and retroviral pepsin-like aspartate proteases. The cellular pepsin and pepsin-like enzymes are twice as long as their retroviral counterparts. The cellular pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, rennin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (rennin, cathepsin D and E, pepsin) or commercially (chymosin) important. The eukaryotic pepsin-like proteases contain two domains possessing similar topological features. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except in the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The eukaryotic pepsin-like proteases have two active site
Probab=99.74 E-value=8.8e-18 Score=124.04 Aligned_cols=74 Identities=36% Similarity=0.609 Sum_probs=62.3
Q ss_pred EEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccC-CCCCCCCccccccccccccCCCCCCCCCC
Q 040861 97 TTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIY-NPEVSSTSKKVTCNNLLCAHRNRCPGTFS 174 (199)
Q Consensus 97 ~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~f-dp~~SsT~~~v~C~s~~C~~~~~C~~~~~ 174 (199)
++|.||||+|++.|+|||||+++||+|. |..|..+. ++.| +|++|++++. .
T Consensus 1 ~~i~vGtP~q~~~~~~DTGSs~~Wv~~~~c~~~~~~~---------~~~~~~~~~sst~~~------------------~ 53 (109)
T cd05470 1 IEIGIGTPPQTFNVLLDTGSSNLWVPSVDCQSLAIYS---------HSSYDDPSASSTYSD------------------N 53 (109)
T ss_pred CEEEeCCCCceEEEEEeCCCCCEEEeCCCCCCccccc---------ccccCCcCCCCCCCC------------------C
Confidence 3789999999999999999999999999 98775432 2355 9999999886 4
Q ss_pred CCceEEEEccCCceEEEEEEEEeeC
Q 040861 175 NCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 175 ~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
.|.|.+.| ++|+ ..|.+++|+|+
T Consensus 54 ~~~~~~~Y-~~g~-~~g~~~~D~v~ 76 (109)
T cd05470 54 GCTFSITY-GTGS-LSGGLSTDTVS 76 (109)
T ss_pred CcEEEEEe-CCCe-EEEEEEEEEEE
Confidence 69999999 8874 67999999874
No 7
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate r
Probab=99.73 E-value=9.2e-18 Score=146.50 Aligned_cols=84 Identities=31% Similarity=0.536 Sum_probs=70.8
Q ss_pred ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCC
Q 040861 89 SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRN 167 (199)
Q Consensus 89 ~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~ 167 (199)
++.+.+||++|+||||+|++.|+|||||+++||+|. |..|..+ +..++.|||++|+||+.
T Consensus 3 ~~~~~~y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~-------c~~~~~y~~~~SsT~~~------------ 63 (326)
T cd05487 3 NYLDTQYYGEIGIGTPPQTFKVVFDTGSSNLWVPSSKCSPLYTA-------CVTHNLYDASDSSTYKE------------ 63 (326)
T ss_pred ccCCCeEEEEEEECCCCcEEEEEEeCCccceEEccCCCcCcchh-------hcccCcCCCCCCeeeeE------------
Confidence 456789999999999999999999999999999998 8653211 22357999999999986
Q ss_pred CCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 168 RCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 168 ~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
..|.|.+.| ++|+ +.|.+++|+|+
T Consensus 64 ------~~~~~~~~Y-g~g~-~~G~~~~D~v~ 87 (326)
T cd05487 64 ------NGTEFTIHY-ASGT-VKGFLSQDIVT 87 (326)
T ss_pred ------CCEEEEEEe-CCce-EEEEEeeeEEE
Confidence 368999999 8875 89999999974
No 8
>PTZ00147 plasmepsin-1; Provisional
Probab=99.73 E-value=1.7e-17 Score=151.59 Aligned_cols=84 Identities=29% Similarity=0.563 Sum_probs=72.3
Q ss_pred eeee-ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CC--CCCCCCCCCccCCCccccCCCCCCCCcccccccc
Q 040861 85 TLRI-SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CS--KCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNN 160 (199)
Q Consensus 85 ~~~~-~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~--~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s 160 (199)
.+++ +..+.+||++|.||||||++.|+|||||+++||+|. |. .|.. ++.|||++|+||+.
T Consensus 129 ~v~L~n~~n~~Y~~~I~IGTP~Q~f~Vi~DTGSsdlWVps~~C~~~~C~~-----------~~~yd~s~SsT~~~----- 192 (453)
T PTZ00147 129 NVELKDLANVMSYGEAKLGDNGQKFNFIFDTGSANLWVPSIKCTTEGCET-----------KNLYDSSKSKTYEK----- 192 (453)
T ss_pred eeeccccCCCEEEEEEEECCCCeEEEEEEeCCCCcEEEeecCCCcccccC-----------CCccCCccCcceEE-----
Confidence 3444 677789999999999999999999999999999999 97 3542 46899999999987
Q ss_pred ccccCCCCCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 161 LLCAHRNRCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 161 ~~C~~~~~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
..|.|.+.| ++|+ +.|.+++|+|+
T Consensus 193 -------------~~~~f~i~Y-g~Gs-vsG~~~~DtVt 216 (453)
T PTZ00147 193 -------------DGTKVEMNY-VSGT-VSGFFSKDLVT 216 (453)
T ss_pred -------------CCCEEEEEe-CCCC-EEEEEEEEEEE
Confidence 358999999 8874 89999999984
No 9
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which
Probab=99.73 E-value=1.5e-17 Score=144.55 Aligned_cols=82 Identities=35% Similarity=0.559 Sum_probs=70.3
Q ss_pred ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCC
Q 040861 89 SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRN 167 (199)
Q Consensus 89 ~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~ 167 (199)
|+.+.+||++|.||||+|++.|+|||||+++||+|. |..|..+ .++.|||++|+||+.
T Consensus 5 n~~~~~Y~~~i~vGtp~q~~~v~~DTGS~~~wv~~~~C~~~~c~---------~~~~f~~~~Sst~~~------------ 63 (317)
T cd05478 5 NYLDMEYYGTISIGTPPQDFTVIFDTGSSNLWVPSVYCSSQACS---------NHNRFNPRQSSTYQS------------ 63 (317)
T ss_pred cccCCEEEEEEEeCCCCcEEEEEEeCCCccEEEecCCCCccccc---------ccCcCCCCCCcceee------------
Confidence 566789999999999999999999999999999999 9743211 246999999999987
Q ss_pred CCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 168 RCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 168 ~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
..|.|.+.| ++|+ +.|.++.|+|+
T Consensus 64 ------~~~~~~~~y-g~gs-~~G~~~~D~v~ 87 (317)
T cd05478 64 ------TGQPLSIQY-GTGS-MTGILGYDTVQ 87 (317)
T ss_pred ------CCcEEEEEE-CCce-EEEEEeeeEEE
Confidence 357999999 8876 78999999974
No 10
>PTZ00165 aspartyl protease; Provisional
Probab=99.72 E-value=1.9e-17 Score=152.37 Aligned_cols=85 Identities=26% Similarity=0.457 Sum_probs=70.2
Q ss_pred ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCC--CCCCCCCCccCCCccccCCCCCCCCccccccccccccC
Q 040861 89 SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSK--CAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAH 165 (199)
Q Consensus 89 ~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~--C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~ 165 (199)
|+.+.+||++|.||||||+|.|+|||||+++||+|. |.. |. .++.|||++||||+.+.++.
T Consensus 115 n~~d~~Y~~~I~IGTPpQ~f~Vv~DTGSS~lWVps~~C~~~~C~-----------~~~~yd~s~SSTy~~~~~~~----- 178 (482)
T PTZ00165 115 NFHNSQYFGEIQVGTPPKSFVVVFDTGSSNLWIPSKECKSGGCA-----------PHRKFDPKKSSTYTKLKLGD----- 178 (482)
T ss_pred cccCCeEEEEEEeCCCCceEEEEEeCCCCCEEEEchhcCccccc-----------ccCCCCccccCCcEecCCCC-----
Confidence 788999999999999999999999999999999999 964 54 25799999999999853111
Q ss_pred CCCCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 166 RNRCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 166 ~~~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
....+.++| ++|+ ..|.+++|+++
T Consensus 179 --------~~~~~~i~Y-GsGs-~~G~l~~DtV~ 202 (482)
T PTZ00165 179 --------ESAETYIQY-GTGE-CVLALGKDTVK 202 (482)
T ss_pred --------ccceEEEEe-CCCc-EEEEEEEEEEE
Confidence 122467999 8765 66999999984
No 11
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=99.72 E-value=3.2e-17 Score=142.45 Aligned_cols=76 Identities=33% Similarity=0.614 Sum_probs=67.4
Q ss_pred ceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CC--CCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCCCC
Q 040861 93 FLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CS--KCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRNRC 169 (199)
Q Consensus 93 ~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~--~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~~C 169 (199)
.+|+++|.||||||++.|+|||||+++||+|. |. .|.. ++.|||++|+||+.
T Consensus 2 ~~y~~~i~iGtP~q~~~v~~DTGS~~~wv~~~~C~~~~C~~-----------~~~f~~~~SsT~~~-------------- 56 (318)
T cd05477 2 MSYYGEISIGTPPQNFLVLFDTGSSNLWVPSVLCQSQACTN-----------HTKFNPSQSSTYST-------------- 56 (318)
T ss_pred cEEEEEEEECCCCcEEEEEEeCCCccEEEccCCCCCccccc-----------cCCCCcccCCCceE--------------
Confidence 57999999999999999999999999999999 97 4543 46999999999986
Q ss_pred CCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 170 PGTFSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 170 ~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
..|.|.+.| +||+ ..|.+++|+|+
T Consensus 57 ----~~~~~~~~Y-g~Gs-~~G~~~~D~i~ 80 (318)
T cd05477 57 ----NGETFSLQY-GSGS-LTGIFGYDTVT 80 (318)
T ss_pred ----CCcEEEEEE-CCcE-EEEEEEeeEEE
Confidence 468999999 8876 68999999874
No 12
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=99.71 E-value=2.7e-17 Score=140.42 Aligned_cols=78 Identities=32% Similarity=0.494 Sum_probs=68.8
Q ss_pred EEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCCCCCCCC
Q 040861 95 HYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRNRCPGTF 173 (199)
Q Consensus 95 y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~~C~~~~ 173 (199)
|+++|.||||+|++.|+|||||+++||+|. |..|..+. +..|+|++|+|++.++
T Consensus 1 Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~c~~~~~~~---------~~~y~~~~Sst~~~~~---------------- 55 (278)
T cd06097 1 YLTPVKIGTPPQTLNLDLDTGSSDLWVFSSETPAAQQGG---------HKLYDPSKSSTAKLLP---------------- 55 (278)
T ss_pred CeeeEEECCCCcEEEEEEeCCCCceeEeeCCCCchhhcc---------CCcCCCccCccceecC----------------
Confidence 789999999999999999999999999999 98775332 3579999999998743
Q ss_pred CCCceEEEEccCCceEEEEEEEEeeC
Q 040861 174 SNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 174 ~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
.|.|.+.| ++|+.+.|.+++|+|+
T Consensus 56 -~~~~~i~Y-~~G~~~~G~~~~D~v~ 79 (278)
T cd06097 56 -GATWSISY-GDGSSASGIVYTDTVS 79 (278)
T ss_pred -CcEEEEEe-CCCCeEEEEEEEEEEE
Confidence 68999999 9988899999999974
No 13
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=99.71 E-value=2.6e-17 Score=144.03 Aligned_cols=84 Identities=30% Similarity=0.490 Sum_probs=70.7
Q ss_pred ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCC
Q 040861 89 SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRN 167 (199)
Q Consensus 89 ~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~ 167 (199)
|+.+.+|+++|.||||+|++.|+|||||+++||+|. |..|... +..++.|+|++|+|++.
T Consensus 6 n~~~~~Y~~~i~vGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~-------c~~~~~y~~~~Sst~~~------------ 66 (329)
T cd05485 6 NYMDAQYYGVITIGTPPQSFKVVFDTGSSNLWVPSKKCSWTNIA-------CLLHNKYDSTKSSTYKK------------ 66 (329)
T ss_pred eccCCeEEEEEEECCCCcEEEEEEcCCCccEEEecCCCCCCCcc-------ccCCCeECCcCCCCeEE------------
Confidence 677889999999999999999999999999999999 8732110 11246899999999987
Q ss_pred CCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 168 RCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 168 ~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
..|.|.+.| ++|+ +.|.+++|+++
T Consensus 67 ------~~~~~~i~Y-~~g~-~~G~~~~D~v~ 90 (329)
T cd05485 67 ------NGTEFAIQY-GSGS-LSGFLSTDTVS 90 (329)
T ss_pred ------CCeEEEEEE-CCce-EEEEEecCcEE
Confidence 368999999 8876 78999999874
No 14
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=99.71 E-value=4.6e-17 Score=141.71 Aligned_cols=80 Identities=35% Similarity=0.540 Sum_probs=69.5
Q ss_pred ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CC---CCCCCCCCCccCCCccccCCCCCCCCcccccccccccc
Q 040861 89 SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CS---KCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCA 164 (199)
Q Consensus 89 ~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~---~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~ 164 (199)
|+.+.+||++|.||||+|++.|+|||||+++||+|. |. .|.. ++.|||++|+||+.
T Consensus 5 n~~~~~Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~C~~~~~C~~-----------~~~y~~~~SsT~~~--------- 64 (317)
T cd06098 5 NYLDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSIACYF-----------HSKYKSSKSSTYKK--------- 64 (317)
T ss_pred ccCCCEEEEEEEECCCCeEEEEEECCCccceEEecCCCCCCccccc-----------cCcCCcccCCCccc---------
Confidence 566789999999999999999999999999999999 95 5653 46899999999987
Q ss_pred CCCCCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 165 HRNRCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 165 ~~~~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
....+.+.| ++|+ +.|.+++|+|+
T Consensus 65 ---------~~~~~~i~Y-g~G~-~~G~~~~D~v~ 88 (317)
T cd06098 65 ---------NGTSASIQY-GTGS-ISGFFSQDSVT 88 (317)
T ss_pred ---------CCCEEEEEc-CCce-EEEEEEeeEEE
Confidence 246889999 8775 68999999974
No 15
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two
Probab=99.71 E-value=3.5e-17 Score=144.92 Aligned_cols=74 Identities=26% Similarity=0.293 Sum_probs=65.1
Q ss_pred ceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCCCCCC
Q 040861 93 FLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRNRCPG 171 (199)
Q Consensus 93 ~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~~C~~ 171 (199)
..||++|.||||+|+|.|+|||||+++||+|. |..| ++.|||++|+||+.
T Consensus 2 ~~Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~~~~~-------------~~~f~~~~SsT~~~---------------- 52 (364)
T cd05473 2 QGYYIEMLIGTPPQKLNILVDTGSSNFAVAAAPHPFI-------------HTYFHRELSSTYRD---------------- 52 (364)
T ss_pred CceEEEEEecCCCceEEEEEecCCcceEEEcCCCccc-------------cccCCchhCcCccc----------------
Confidence 47999999999999999999999999999998 7322 35899999999987
Q ss_pred CCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 172 TFSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 172 ~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
..|.|++.| ++|+ +.|.+++|+|+
T Consensus 53 --~~~~~~i~Y-g~Gs-~~G~~~~D~v~ 76 (364)
T cd05473 53 --LGKGVTVPY-TQGS-WEGELGTDLVS 76 (364)
T ss_pred --CCceEEEEE-Ccce-EEEEEEEEEEE
Confidence 358999999 8875 68999999974
No 16
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=99.70 E-value=9.8e-17 Score=146.47 Aligned_cols=83 Identities=25% Similarity=0.567 Sum_probs=72.0
Q ss_pred eee-ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CC--CCCCCCCCCccCCCccccCCCCCCCCccccccccc
Q 040861 86 LRI-SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CS--KCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNL 161 (199)
Q Consensus 86 ~~~-~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~--~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~ 161 (199)
+++ ++.+.+||++|.||||+|++.|+|||||+++||+|. |. .|.. ++.|||++|+|++.
T Consensus 129 ~~l~d~~n~~Yy~~i~IGTP~Q~f~vi~DTGSsdlWV~s~~C~~~~C~~-----------~~~yd~s~SsT~~~------ 191 (450)
T PTZ00013 129 IELDDVANIMFYGEGEVGDNHQKFMLIFDTGSANLWVPSKKCDSIGCSI-----------KNLYDSSKSKSYEK------ 191 (450)
T ss_pred eeeeccCCCEEEEEEEECCCCeEEEEEEeCCCCceEEecccCCcccccc-----------CCCccCccCccccc------
Confidence 444 666789999999999999999999999999999999 96 4643 46899999999987
Q ss_pred cccCCCCCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 162 LCAHRNRCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 162 ~C~~~~~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
..|.|.+.| ++|+ +.|.+++|+|+
T Consensus 192 ------------~~~~~~i~Y-G~Gs-v~G~~~~Dtv~ 215 (450)
T PTZ00013 192 ------------DGTKVDITY-GSGT-VKGFFSKDLVT 215 (450)
T ss_pred ------------CCcEEEEEE-CCce-EEEEEEEEEEE
Confidence 358999999 8875 89999999974
No 17
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=99.70 E-value=4.1e-17 Score=141.74 Aligned_cols=74 Identities=31% Similarity=0.590 Sum_probs=65.6
Q ss_pred EEEEEEeCCCCceEEEEEeCCCCceeEecc-CC--CCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCCCCCC
Q 040861 95 HYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CS--KCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRNRCPG 171 (199)
Q Consensus 95 y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~--~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~~C~~ 171 (199)
||++|.||||+|++.|+|||||+++||+|. |. .|.. ++.|||++|+||+.
T Consensus 1 Y~~~i~iGtP~Q~~~v~~DTGSs~~Wv~s~~C~~~~C~~-----------~~~y~~~~SsT~~~---------------- 53 (316)
T cd05486 1 YFGQISIGTPPQNFTVIFDTGSSNLWVPSIYCTSQACTK-----------HNRFQPSESSTYVS---------------- 53 (316)
T ss_pred CeEEEEECCCCcEEEEEEcCCCccEEEecCCCCCcccCc-----------cceECCCCCccccc----------------
Confidence 789999999999999999999999999999 86 4643 46899999999987
Q ss_pred CCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 172 TFSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 172 ~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
..|.|.+.| ++|+ +.|.+++|+|+
T Consensus 54 --~~~~~~i~Y-g~g~-~~G~~~~D~v~ 77 (316)
T cd05486 54 --NGEAFSIQY-GTGS-LTGIIGIDQVT 77 (316)
T ss_pred --CCcEEEEEe-CCcE-EEEEeeecEEE
Confidence 368999999 8875 79999999974
No 18
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme. Proteinase A preferentially hydro
Probab=99.69 E-value=8.5e-17 Score=140.11 Aligned_cols=80 Identities=34% Similarity=0.590 Sum_probs=70.1
Q ss_pred ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CC--CCCCCCCCCccCCCccccCCCCCCCCccccccccccccC
Q 040861 89 SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CS--KCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAH 165 (199)
Q Consensus 89 ~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~--~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~ 165 (199)
++.+.+||++|.||||+|++.|+|||||+++||+|. |. .|.. ++.|+|++|+|++.
T Consensus 5 n~~~~~Y~~~i~iGtp~q~~~v~~DTGSs~~wv~~~~C~~~~C~~-----------~~~y~~~~Sst~~~---------- 63 (320)
T cd05488 5 NYLNAQYFTDITLGTPPQKFKVILDTGSSNLWVPSVKCGSIACFL-----------HSKYDSSASSTYKA---------- 63 (320)
T ss_pred ccCCCEEEEEEEECCCCcEEEEEEecCCcceEEEcCCCCCcccCC-----------cceECCCCCcceee----------
Confidence 556789999999999999999999999999999999 96 4643 35899999999986
Q ss_pred CCCCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 166 RNRCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 166 ~~~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
+.|.|.+.| ++|+ +.|.+++|+|+
T Consensus 64 --------~~~~~~~~y-~~g~-~~G~~~~D~v~ 87 (320)
T cd05488 64 --------NGTEFKIQY-GSGS-LEGFVSQDTLS 87 (320)
T ss_pred --------CCCEEEEEE-CCce-EEEEEEEeEEE
Confidence 468999999 8876 79999999874
No 19
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=99.56 E-value=8.1e-15 Score=125.13 Aligned_cols=59 Identities=37% Similarity=0.947 Sum_probs=52.3
Q ss_pred eEEEEEEeCCCCceEEEEEeCCCCceeEecc--CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCCCCCC
Q 040861 94 LHYTTVQLGTPGMKFMVALDTGSDLFWVPCE--CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRNRCPG 171 (199)
Q Consensus 94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~--C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~~C~~ 171 (199)
+||++|.||||||++.|+|||||+++||+|. |..|
T Consensus 2 ~Y~~~i~iGtP~q~~~v~~DTGS~~~Wv~c~~~c~~c------------------------------------------- 38 (273)
T cd05475 2 YYYVTINIGNPPKPYFLDIDTGSDLTWLQCDAPCTGC------------------------------------------- 38 (273)
T ss_pred ceEEEEEcCCCCeeEEEEEccCCCceEEeCCCCCCCC-------------------------------------------
Confidence 6899999999999999999999999999984 4322
Q ss_pred CCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 172 TFSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 172 ~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
.|.|.+.| +|++.+.|.+++|+|+
T Consensus 39 ---~c~~~i~Y-gd~~~~~G~~~~D~v~ 62 (273)
T cd05475 39 ---QCDYEIEY-ADGGSSMGVLVTDIFS 62 (273)
T ss_pred ---cCccEeEe-CCCCceEEEEEEEEEE
Confidence 37899999 9889999999999974
No 20
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=99.50 E-value=1.1e-13 Score=116.67 Aligned_cols=78 Identities=37% Similarity=0.665 Sum_probs=62.5
Q ss_pred EEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCCCCCCCC
Q 040861 95 HYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRNRCPGTF 173 (199)
Q Consensus 95 y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~~C~~~~ 173 (199)
|+++|.||||+|++.|+|||||+++||+|. |..|..+... ...|++..|+++..
T Consensus 1 Y~~~i~iGtp~q~~~l~~DTGS~~~wv~~~~c~~~~~~~~~-------~~~~~~~~s~~~~~------------------ 55 (283)
T cd05471 1 YYGEITIGTPPQKFSVIFDTGSSLLWVPSSNCTSCSCQKHP-------RFKYDSSKSSTYKD------------------ 55 (283)
T ss_pred CEEEEEECCCCcEEEEEEeCCCCCEEEecCCCCccccccCC-------CCccCccCCceeec------------------
Confidence 679999999999999999999999999999 9877543211 11377777766654
Q ss_pred CCCceEEEEccCCceEEEEEEEEeeC
Q 040861 174 SNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 174 ~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
..|.|.+.| +++ ...|.+++|+|+
T Consensus 56 ~~~~~~~~Y-~~g-~~~g~~~~D~v~ 79 (283)
T cd05471 56 TGCTFSITY-GDG-SVTGGLGTDTVT 79 (283)
T ss_pred CCCEEEEEE-CCC-eEEEEEEEeEEE
Confidence 579999999 874 678999999874
No 21
>cd05489 xylanase_inhibitor_I_like TAXI-I inhibits degradation of xylan in the cell wall. Xylanase inhibitor-I (TAXI-I) is a member of potent TAXI-type inhibitors of fungal and bacterial family 11 xylanases. Plants developed a diverse battery of defense mechanisms in response to continual challenges by a broad spectrum of pathogenic microorganisms. Their defense arsenal includes inhibitors of cell wall-degrading enzymes, which hinder a possible invasion and colonization by antagonists. Xylanases of fungal and bacterial pathogens are the key enzymes in the degradation of xylan in the cell wall. Plants secrete proteins that inhibit these degradation glycosidases, including xylanase. Surprisingly, TAXI-I displays structural homology with the pepsin-like family of aspartic proteases but is proteolytically nonfunctional, because one or more residues of the essential catalytic triad are absent. The structure of the TAXI-inhibitor, Aspergillus niger xylanase I complex, illustrates the ability
Probab=99.44 E-value=1.7e-13 Score=121.93 Aligned_cols=74 Identities=24% Similarity=0.415 Sum_probs=61.1
Q ss_pred eCCCCce-EEEEEeCCCCceeEeccCCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCC------------
Q 040861 101 LGTPGMK-FMVALDTGSDLFWVPCECSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRN------------ 167 (199)
Q Consensus 101 iGTP~q~-~~v~~DTGS~~~Wv~C~C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~------------ 167 (199)
+|||-.+ +.|+|||||+++||||. |.+|+||+.++|+++.|+.+.
T Consensus 2 ~~~~~~~~~~~~~DTGS~l~WvqC~----------------------~~~sst~~~~~C~s~~C~~~~~~~~~~~~~~~~ 59 (362)
T cd05489 2 TITPLKGAVPLVLDLAGPLLWSTCD----------------------AGHSSTYQTVPCSSSVCSLANRYHCPGTCGGAP 59 (362)
T ss_pred cccCccCCeeEEEECCCCceeeeCC----------------------CCCcCCCCccCcCChhhccccccCCCccccCCC
Confidence 5788878 99999999999999996 136789999999999998641
Q ss_pred --CCCCCCCCCceEEE-EccCCceEEEEEEEEeeC
Q 040861 168 --RCPGTFSNCPYSVS-YVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 168 --~C~~~~~~C~y~~~-Y~gdgs~~~G~l~~Dtlt 199 (199)
.|. ++.|.|... | ++|+.+.|+|++|+|+
T Consensus 60 ~~~c~--~~~C~y~~~~y-~~gs~t~G~l~~Dtl~ 91 (362)
T cd05489 60 GPGCG--NNTCTAHPYNP-VTGECATGDLTQDVLS 91 (362)
T ss_pred CCCCC--CCcCeeEcccc-ccCcEeeEEEEEEEEE
Confidence 343 246989765 7 7889999999999985
No 22
>PF00026 Asp: Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.; InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) . More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=99.44 E-value=1e-13 Score=118.93 Aligned_cols=78 Identities=38% Similarity=0.630 Sum_probs=64.9
Q ss_pred eEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCCCCCCC
Q 040861 94 LHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRNRCPGT 172 (199)
Q Consensus 94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~~C~~~ 172 (199)
+|+++|.||||+|++.|++||||+.+||++. |..|.. +.....|++.+|+|++..
T Consensus 1 ~Y~~~v~iGtp~q~~~~~iDTGS~~~wv~~~~c~~~~~--------~~~~~~y~~~~S~t~~~~---------------- 56 (317)
T PF00026_consen 1 QYYINVTIGTPPQTFRVLIDTGSSDTWVPSSNCNSCSS--------CASSGFYNPSKSSTFSNQ---------------- 56 (317)
T ss_dssp EEEEEEEETTTTEEEEEEEETTBSSEEEEBTTECSHTH--------HCTSC-BBGGGSTTEEEE----------------
T ss_pred CeEEEEEECCCCeEEEEEEecccceeeeceeccccccc--------cccccccccccccccccc----------------
Confidence 5899999999999999999999999999998 876510 112368999999999873
Q ss_pred CCCCceEEEEccCCceEEEEEEEEeeC
Q 040861 173 FSNCPYSVSYVSAQTSTSGILVEMFFT 199 (199)
Q Consensus 173 ~~~C~y~~~Y~gdgs~~~G~l~~Dtlt 199 (199)
.+.|.+.| ++|+ ..|.+++|+|+
T Consensus 57 --~~~~~~~y-~~g~-~~G~~~~D~v~ 79 (317)
T PF00026_consen 57 --GKPFSISY-GDGS-VSGNLVSDTVS 79 (317)
T ss_dssp --EEEEEEEE-TTEE-EEEEEEEEEEE
T ss_pred --eeeeeeec-cCcc-cccccccceEe
Confidence 47899999 8877 99999999974
No 23
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=98.92 E-value=6.5e-10 Score=95.70 Aligned_cols=32 Identities=53% Similarity=0.881 Sum_probs=30.9
Q ss_pred eEEEEEEeCCCCceEEEEEeCCCCceeEecc-C
Q 040861 94 LHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-C 125 (199)
Q Consensus 94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C 125 (199)
+|+++|.||||||++.|+|||||+++||+|. |
T Consensus 1 ~Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~c~~c 33 (299)
T cd05472 1 EYVVTVGLGTPARDQTVIVDTGSDLTWVQCQPC 33 (299)
T ss_pred CeEEEEecCCCCcceEEEecCCCCcccccCCCC
Confidence 4899999999999999999999999999998 7
No 24
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which
Probab=98.72 E-value=9.5e-09 Score=87.18 Aligned_cols=30 Identities=43% Similarity=0.802 Sum_probs=28.9
Q ss_pred eEEEEEEeCCCCceEEEEEeCCCCceeEec
Q 040861 94 LHYTTVQLGTPGMKFMVALDTGSDLFWVPC 123 (199)
Q Consensus 94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C 123 (199)
+|+++|.||||+|++.|+|||||+.+||+|
T Consensus 1 ~Y~~~i~iGtP~q~~~v~~DTGSs~~wv~~ 30 (265)
T cd05476 1 EYLVTLSIGTPPQPFSLIVDTGSDLTWTQC 30 (265)
T ss_pred CeEEEEecCCCCcceEEEecCCCCCEEEcC
Confidence 489999999999999999999999999998
No 25
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=98.43 E-value=2.3e-07 Score=79.27 Aligned_cols=29 Identities=45% Similarity=0.723 Sum_probs=28.1
Q ss_pred eEEEEEEeCCCCceEEEEEeCCCCceeEe
Q 040861 94 LHYTTVQLGTPGMKFMVALDTGSDLFWVP 122 (199)
Q Consensus 94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~ 122 (199)
.|+++|.||||+|++.|+|||||+++||+
T Consensus 2 ~Y~~~i~iGtp~q~~~v~~DTgS~~~wv~ 30 (295)
T cd05474 2 YYSAELSVGTPPQKVTVLLDTGSSDLWVP 30 (295)
T ss_pred eEEEEEEECCCCcEEEEEEeCCCCcceee
Confidence 68899999999999999999999999998
No 26
>cd05483 retropepsin_like_bacteria Bacterial aspartate proteases, retropepsin-like protease family. This family of bacteria aspartate proteases is a subfamily of retropepsin-like protease family, which includes enzymes from retrovirus and retrotransposons. While fungal and mammalian pepsin-like aspartate proteases are bilobal proteins with structurally related N- and C-termini, this family of bacteria aspartate proteases is half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate proteases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=95.44 E-value=0.024 Score=39.73 Aligned_cols=29 Identities=24% Similarity=0.269 Sum_probs=26.1
Q ss_pred eEEEEEEeCCCCceEEEEEeCCCCceeEecc
Q 040861 94 LHYTTVQLGTPGMKFMVALDTGSDLFWVPCE 124 (199)
Q Consensus 94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~ 124 (199)
.|++++.|+ .+++.+++|||++.+|+.-.
T Consensus 2 ~~~v~v~i~--~~~~~~llDTGa~~s~i~~~ 30 (96)
T cd05483 2 HFVVPVTIN--GQPVRFLLDTGASTTVISEE 30 (96)
T ss_pred cEEEEEEEC--CEEEEEEEECCCCcEEcCHH
Confidence 478999999 79999999999999999765
No 27
>cd05484 retropepsin_like_LTR_2 Retropepsins_like_LTR, pepsin-like aspartate proteases. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classif
Probab=92.36 E-value=0.18 Score=35.76 Aligned_cols=28 Identities=25% Similarity=0.336 Sum_probs=24.4
Q ss_pred EEEEEEeCCCCceEEEEEeCCCCceeEecc
Q 040861 95 HYTTVQLGTPGMKFMVALDTGSDLFWVPCE 124 (199)
Q Consensus 95 y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~ 124 (199)
+|+++.|+ .+++.+.+||||+..++.-+
T Consensus 1 ~~~~~~In--g~~i~~lvDTGA~~svis~~ 28 (91)
T cd05484 1 KTVTLLVN--GKPLKFQLDTGSAITVISEK 28 (91)
T ss_pred CEEEEEEC--CEEEEEEEcCCcceEEeCHH
Confidence 46888898 79999999999999998665
No 28
>TIGR02281 clan_AA_DTGA clan AA aspartic protease, TIGR02281 family. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix. Sequences in the seed alignment and those scoring above the trusted cutoff are Proteobacterial; homologs scroing between trusted and noise are found in Pyrobaculum aerophilum str. IM2 (archaeal), Pirellula sp. (Planctomycetes), and Nostoc sp. PCC 7120 (Cyanobacteria).
Probab=84.86 E-value=2 Score=32.40 Aligned_cols=31 Identities=23% Similarity=0.215 Sum_probs=26.0
Q ss_pred CceEEEEEEeCCCCceEEEEEeCCCCceeEecc
Q 040861 92 GFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE 124 (199)
Q Consensus 92 ~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~ 124 (199)
+..|++++.|. .+++.+++|||++.+-+.-.
T Consensus 9 ~g~~~v~~~In--G~~~~flVDTGAs~t~is~~ 39 (121)
T TIGR02281 9 DGHFYATGRVN--GRNVRFLVDTGATSVALNEE 39 (121)
T ss_pred CCeEEEEEEEC--CEEEEEEEECCCCcEEcCHH
Confidence 45788999997 68999999999998877654
No 29
>PF00077 RVP: Retroviral aspartyl protease The Prosite entry also includes Pfam:PF00026; InterPro: IPR018061 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to the MEROPS peptidase family A2 (retropepsin family, clan AA), subfamily A2A. The family includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). Retroviral aspartyl protease is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins.; PDB: 3D3T_B 3SQF_A 1NSO_A 2HB3_A 2HS2_A 2HS1_B 3K4V_A 3GGV_C 1HTG_B 2FDE_A ....
Probab=84.36 E-value=1.7 Score=30.88 Aligned_cols=27 Identities=37% Similarity=0.583 Sum_probs=22.2
Q ss_pred EEEEEeCCCCceEEEEEeCCCCceeEecc
Q 040861 96 YTTVQLGTPGMKFMVALDTGSDLFWVPCE 124 (199)
Q Consensus 96 ~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~ 124 (199)
+.+|.+. .+++.+++|||++..-++-.
T Consensus 7 ~i~v~i~--g~~i~~LlDTGA~vsiI~~~ 33 (100)
T PF00077_consen 7 YITVKIN--GKKIKALLDTGADVSIISEK 33 (100)
T ss_dssp EEEEEET--TEEEEEEEETTBSSEEESSG
T ss_pred eEEEeEC--CEEEEEEEecCCCcceeccc
Confidence 4577777 67999999999999888654
No 30
>PF13975 gag-asp_proteas: gag-polyprotein putative aspartyl protease
Probab=81.88 E-value=3 Score=28.30 Aligned_cols=30 Identities=27% Similarity=0.464 Sum_probs=26.0
Q ss_pred ceEEEEEEeCCCCceEEEEEeCCCCceeEecc
Q 040861 93 FLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE 124 (199)
Q Consensus 93 ~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~ 124 (199)
..+++.+.|| .+.+.+.+|||++..-++-+
T Consensus 7 g~~~v~~~I~--g~~~~alvDtGat~~fis~~ 36 (72)
T PF13975_consen 7 GLMYVPVSIG--GVQVKALVDTGATHNFISES 36 (72)
T ss_pred CEEEEEEEEC--CEEEEEEEeCCCcceecCHH
Confidence 4788999999 69999999999998877665
No 31
>PF13650 Asp_protease_2: Aspartyl protease
Probab=75.50 E-value=3.8 Score=27.84 Aligned_cols=25 Identities=20% Similarity=0.351 Sum_probs=19.6
Q ss_pred EEEeCCCCceEEEEEeCCCCceeEecc
Q 040861 98 TVQLGTPGMKFMVALDTGSDLFWVPCE 124 (199)
Q Consensus 98 ~v~iGTP~q~~~v~~DTGS~~~Wv~C~ 124 (199)
++.|+ .+++.+++|||++...+.-.
T Consensus 2 ~v~vn--g~~~~~liDTGa~~~~i~~~ 26 (90)
T PF13650_consen 2 PVKVN--GKPVRFLIDTGASISVISRS 26 (90)
T ss_pred EEEEC--CEEEEEEEcCCCCcEEECHH
Confidence 45666 68999999999998777544
No 32
>cd05479 RP_DDI RP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
Probab=74.71 E-value=6 Score=29.67 Aligned_cols=29 Identities=17% Similarity=0.410 Sum_probs=25.3
Q ss_pred eEEEEEEeCCCCceEEEEEeCCCCceeEecc
Q 040861 94 LHYTTVQLGTPGMKFMVALDTGSDLFWVPCE 124 (199)
Q Consensus 94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~ 124 (199)
.+|+++.|+ .+++.+++|||+....+.-.
T Consensus 16 ~~~v~~~In--g~~~~~LvDTGAs~s~Is~~ 44 (124)
T cd05479 16 MLYINVEIN--GVPVKAFVDSGAQMTIMSKA 44 (124)
T ss_pred EEEEEEEEC--CEEEEEEEeCCCceEEeCHH
Confidence 567899998 78999999999999988765
No 33
>cd05482 HIV_retropepsin_like Retropepsins, pepsin-like aspartate proteases. This is a subfamily of retropepsins. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This gro
Probab=70.94 E-value=6.4 Score=28.16 Aligned_cols=23 Identities=35% Similarity=0.488 Sum_probs=19.1
Q ss_pred EEEeCCCCceEEEEEeCCCCceeEe
Q 040861 98 TVQLGTPGMKFMVALDTGSDLFWVP 122 (199)
Q Consensus 98 ~v~iGTP~q~~~v~~DTGS~~~Wv~ 122 (199)
++.|+ .|.+.+.+|||.|.+-++
T Consensus 2 ~~~i~--g~~~~~llDTGAd~Tvi~ 24 (87)
T cd05482 2 TLYIN--GKLFEGLLDTGADVSIIA 24 (87)
T ss_pred EEEEC--CEEEEEEEccCCCCeEEc
Confidence 45566 799999999999998774
No 34
>cd06095 RP_RTVL_H_like Retropepsin of the RTVL_H family of human endogenous retrovirus-like elements. This family includes aspartate proteases from retroelements with LTR (long terminal repeats) including the RTVL_H family of human endogenous retrovirus-like elements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where
Probab=64.02 E-value=9.8 Score=26.51 Aligned_cols=24 Identities=17% Similarity=0.371 Sum_probs=19.0
Q ss_pred EEeCCCCceEEEEEeCCCCceeEecc
Q 040861 99 VQLGTPGMKFMVALDTGSDLFWVPCE 124 (199)
Q Consensus 99 v~iGTP~q~~~v~~DTGS~~~Wv~C~ 124 (199)
+.|. .+++.+++|||.+.+-+.-.
T Consensus 3 v~In--G~~~~fLvDTGA~~tii~~~ 26 (86)
T cd06095 3 ITVE--GVPIVFLVDTGATHSVLKSD 26 (86)
T ss_pred EEEC--CEEEEEEEECCCCeEEECHH
Confidence 4454 68899999999999888554
No 35
>PF09668 Asp_protease: Aspartyl protease; InterPro: IPR019103 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This family of eukaryotic aspartyl proteases have a fold similar to retroviral proteases which implies they function proteolytically during regulated protein turnover []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 3S8I_A 2I1A_B.
Probab=44.55 E-value=20 Score=27.36 Aligned_cols=34 Identities=18% Similarity=0.430 Sum_probs=24.3
Q ss_pred eEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCC
Q 040861 94 LHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCA 129 (199)
Q Consensus 94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~ 129 (199)
..|+++.|+ .+++.+.+|||...+-+.-+ +..|.
T Consensus 24 mLyI~~~in--g~~vkA~VDtGAQ~tims~~~a~r~g 58 (124)
T PF09668_consen 24 MLYINCKIN--GVPVKAFVDTGAQSTIMSKSCAERCG 58 (124)
T ss_dssp --EEEEEET--TEEEEEEEETT-SS-EEEHHHHHHTT
T ss_pred eEEEEEEEC--CEEEEEEEeCCCCccccCHHHHHHcC
Confidence 577899999 79999999999988877655 34553
No 36
>PF12384 Peptidase_A2B: Ty3 transposon peptidase; InterPro: IPR024650 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Ty3 is a gypsy-type, retrovirus-like, element found in the budding yeast. The Ty3 aspartyl protease is required for processing of the viral polyprotein into its mature species [].
Probab=29.16 E-value=60 Score=26.28 Aligned_cols=27 Identities=30% Similarity=0.429 Sum_probs=20.6
Q ss_pred EEEEeCCCCceEEEEEeCCCCceeEec
Q 040861 97 TTVQLGTPGMKFMVALDTGSDLFWVPC 123 (199)
Q Consensus 97 ~~v~iGTP~q~~~v~~DTGS~~~Wv~C 123 (199)
..+.+++-..++.+.|||||-...+.-
T Consensus 35 ~~v~l~~~~t~i~vLfDSGSPTSfIr~ 61 (177)
T PF12384_consen 35 AIVQLNCKGTPIKVLFDSGSPTSFIRS 61 (177)
T ss_pred EEEEEeecCcEEEEEEeCCCccceeeh
Confidence 355566668899999999998776644
No 37
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=27.44 E-value=82 Score=26.43 Aligned_cols=26 Identities=19% Similarity=0.364 Sum_probs=19.4
Q ss_pred EEeCC---CCceEEEEEeCCCCceeEecc
Q 040861 99 VQLGT---PGMKFMVALDTGSDLFWVPCE 124 (199)
Q Consensus 99 v~iGT---P~q~~~v~~DTGS~~~Wv~C~ 124 (199)
|+||- +.....+++|||+.++.+|-.
T Consensus 166 i~vg~~~~~~~~~~~ivDTGTt~t~lp~~ 194 (273)
T cd05475 166 LLFNGQPTGGKGLEVVFDSGSSYTYFNAQ 194 (273)
T ss_pred EEECCEECcCCCceEEEECCCceEEcCCc
Confidence 67762 234467899999999999755
No 38
>cd00303 retropepsin_like Retropepsins; pepsin-like aspartate proteases. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples
Probab=25.97 E-value=76 Score=19.70 Aligned_cols=21 Identities=14% Similarity=0.444 Sum_probs=14.9
Q ss_pred EEeCCCCceEEEEEeCCCCceeE
Q 040861 99 VQLGTPGMKFMVALDTGSDLFWV 121 (199)
Q Consensus 99 v~iGTP~q~~~v~~DTGS~~~Wv 121 (199)
+.++ ...+.+.+|||+...-+
T Consensus 3 ~~~~--~~~~~~liDtgs~~~~~ 23 (92)
T cd00303 3 GKIN--GVPVRALVDSGASVNFI 23 (92)
T ss_pred EEEC--CEEEEEEEcCCCccccc
Confidence 3444 37889999999986543
No 39
>PF07496 zf-CW: CW-type Zinc Finger; InterPro: IPR011124 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a CW-type zinc finger motif, named for its conserved cysteine and tryptophan residues. It is predicted to be a highly specialised mononuclear four-cysteine (C4) zinc finger that plays a role in DNA binding and/or promoting protein-protein interactions in complicated eukaryotic processes including chromatin methylation status and early embryonic development. Weak homology to members of IPR001965 from INTERPRO further evidences these predictions. The domain is found exclusively in vertebrates, vertebrate-infecting parasites and higher plants []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2RR4_A 2E61_A 2L7P_A.
Probab=25.63 E-value=34 Score=21.59 Aligned_cols=8 Identities=50% Similarity=1.917 Sum_probs=5.2
Q ss_pred eeEecc-CC
Q 040861 119 FWVPCE-CS 126 (199)
Q Consensus 119 ~Wv~C~-C~ 126 (199)
.||||. |.
T Consensus 2 ~WVQCd~C~ 10 (50)
T PF07496_consen 2 YWVQCDSCL 10 (50)
T ss_dssp EEEE-TTT-
T ss_pred eEEECCCCC
Confidence 599999 84
No 40
>TIGR03698 clan_AA_DTGF clan AA aspartic protease, AF_0612 family. Members of this protein family are clan AA aspartic proteases, related to family TIGR02281. These proteins resemble retropepsins, pepsin-like proteases of retroviruses such as HIV. Members of this family are found in archaea and bacteria.
Probab=25.10 E-value=95 Score=22.58 Aligned_cols=27 Identities=37% Similarity=0.458 Sum_probs=18.6
Q ss_pred EEEEeCCC----CceEEEEEeCCCCcee-Eec
Q 040861 97 TTVQLGTP----GMKFMVALDTGSDLFW-VPC 123 (199)
Q Consensus 97 ~~v~iGTP----~q~~~v~~DTGS~~~W-v~C 123 (199)
+++.|..| ..++.+.+|||.+..- ++.
T Consensus 2 ~~v~~~~p~~~~~~~v~~LVDTGat~~~~l~~ 33 (107)
T TIGR03698 2 LDVELSNPKNPEFMEVRALVDTGFSGFLLVPP 33 (107)
T ss_pred EEEEEeCCCCCCceEEEEEEECCCCeEEecCH
Confidence 46677776 2367889999998653 444
No 41
>PF11645 PDDEXK_5: PD-(D/E)XK endonuclease; InterPro: IPR021671 This family are putative endonuclease proteins which are restricted to Synechocystis. ; PDB: 2OST_D.
Probab=22.85 E-value=65 Score=25.42 Aligned_cols=29 Identities=14% Similarity=0.369 Sum_probs=22.4
Q ss_pred EEEEEEeCCCCceEEEEEeCCCCceeEecc
Q 040861 95 HYTTVQLGTPGMKFMVALDTGSDLFWVPCE 124 (199)
Q Consensus 95 y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~ 124 (199)
|-+-+-+|- .+++.|++|++..++=|||+
T Consensus 22 Y~V~~P~gD-n~~YDLV~d~eg~L~RIQvK 50 (149)
T PF11645_consen 22 YSVSIPFGD-NLKYDLVFDKEGILWRIQVK 50 (149)
T ss_dssp -EEEEESST-TSS-SEEEEETTEEEEEEEE
T ss_pred cEEEeecCC-CCCcCEEEecCCcEEEEEEe
Confidence 445566665 68999999999999999998
No 42
>PF05585 DUF1758: Putative peptidase (DUF1758); InterPro: IPR008737 This is a family of nematode proteins of unknown function []. However, it seems likely that these proteins act as aspartic peptidases.
Probab=21.59 E-value=47 Score=25.82 Aligned_cols=20 Identities=20% Similarity=0.262 Sum_probs=16.1
Q ss_pred CceEEEEEeCCCCceeEecc
Q 040861 105 GMKFMVALDTGSDLFWVPCE 124 (199)
Q Consensus 105 ~q~~~v~~DTGS~~~Wv~C~ 124 (199)
.+...++||+||+..++--.
T Consensus 10 ~~~~~~LlDsGSq~SfIt~~ 29 (164)
T PF05585_consen 10 QVEARALLDSGSQRSFITES 29 (164)
T ss_pred EEEEEEEEecCCchhHHhHH
Confidence 35678899999998887655
No 43
>COG3577 Predicted aspartyl protease [General function prediction only]
Probab=21.33 E-value=2.3e+02 Score=23.72 Aligned_cols=37 Identities=16% Similarity=0.088 Sum_probs=26.9
Q ss_pred eeee-ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEec
Q 040861 85 TLRI-SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPC 123 (199)
Q Consensus 85 ~~~~-~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C 123 (199)
.+.+ ...++-|+++..|- .+++...+|||-..+-++-
T Consensus 95 ~v~Lak~~~GHF~a~~~VN--Gk~v~fLVDTGATsVal~~ 132 (215)
T COG3577 95 EVSLAKSRDGHFEANGRVN--GKKVDFLVDTGATSVALNE 132 (215)
T ss_pred EEEEEecCCCcEEEEEEEC--CEEEEEEEecCcceeecCH
Confidence 4444 23344566888887 8999999999998877643
Done!