Query         040861
Match_columns 199
No_of_seqs    205 out of 1322
Neff          7.0 
Searched_HMMs 46136
Date          Fri Mar 29 12:34:03 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/040861.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/040861hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PLN03146 aspartyl protease fam 100.0 1.1E-31 2.4E-36  243.3  15.8  158   13-199    21-182 (431)
  2 KOG1339 Aspartyl protease [Pos  99.9 8.4E-24 1.8E-28  189.8  10.8  100   90-199    42-143 (398)
  3 cd06096 Plasmepsin_5 Plasmepsi  99.9 9.1E-22   2E-26  171.8  11.6   94   94-199     3-97  (326)
  4 PF14543 TAXi_N:  Xylanase inhi  99.9 9.6E-22 2.1E-26  156.9   8.2   87   95-199     1-91  (164)
  5 cd05490 Cathepsin_D2 Cathepsin  99.8   2E-18 4.4E-23  150.4  10.2   84   89-199     1-85  (325)
  6 cd05470 pepsin_retropepsin_lik  99.7 8.8E-18 1.9E-22  124.0   9.4   74   97-199     1-76  (109)
  7 cd05487 renin_like Renin stimu  99.7 9.2E-18   2E-22  146.5   9.7   84   89-199     3-87  (326)
  8 PTZ00147 plasmepsin-1; Provisi  99.7 1.7E-17 3.7E-22  151.6  11.5   84   85-199   129-216 (453)
  9 cd05478 pepsin_A Pepsin A, asp  99.7 1.5E-17 3.2E-22  144.5  10.5   82   89-199     5-87  (317)
 10 PTZ00165 aspartyl protease; Pr  99.7 1.9E-17   4E-22  152.4  10.5   85   89-199   115-202 (482)
 11 cd05477 gastricsin Gastricsins  99.7 3.2E-17 6.8E-22  142.5  10.3   76   93-199     2-80  (318)
 12 cd06097 Aspergillopepsin_like   99.7 2.7E-17 5.7E-22  140.4   9.3   78   95-199     1-79  (278)
 13 cd05485 Cathepsin_D_like Cathe  99.7 2.6E-17 5.7E-22  144.0   9.3   84   89-199     6-90  (329)
 14 cd06098 phytepsin Phytepsin, a  99.7 4.6E-17   1E-21  141.7  10.1   80   89-199     5-88  (317)
 15 cd05473 beta_secretase_like Be  99.7 3.5E-17 7.7E-22  144.9   9.4   74   93-199     2-76  (364)
 16 PTZ00013 plasmepsin 4 (PM4); P  99.7 9.8E-17 2.1E-21  146.5  11.4   83   86-199   129-215 (450)
 17 cd05486 Cathespin_E Cathepsin   99.7 4.1E-17 8.9E-22  141.7   8.5   74   95-199     1-77  (316)
 18 cd05488 Proteinase_A_fungi Fun  99.7 8.5E-17 1.8E-21  140.1  10.1   80   89-199     5-87  (320)
 19 cd05475 nucellin_like Nucellin  99.6 8.1E-15 1.7E-19  125.1   8.2   59   94-199     2-62  (273)
 20 cd05471 pepsin_like Pepsin-lik  99.5 1.1E-13 2.3E-18  116.7   9.6   78   95-199     1-79  (283)
 21 cd05489 xylanase_inhibitor_I_l  99.4 1.7E-13 3.8E-18  121.9   7.2   74  101-199     2-91  (362)
 22 PF00026 Asp:  Eukaryotic aspar  99.4   1E-13 2.2E-18  118.9   5.3   78   94-199     1-79  (317)
 23 cd05472 cnd41_like Chloroplast  98.9 6.5E-10 1.4E-14   95.7   3.4   32   94-125     1-33  (299)
 24 cd05476 pepsin_A_like_plant Ch  98.7 9.5E-09 2.1E-13   87.2   3.8   30   94-123     1-30  (265)
 25 cd05474 SAP_like SAPs, pepsin-  98.4 2.3E-07 4.9E-12   79.3   4.7   29   94-122     2-30  (295)
 26 cd05483 retropepsin_like_bacte  95.4   0.024 5.1E-07   39.7   4.1   29   94-124     2-30  (96)
 27 cd05484 retropepsin_like_LTR_2  92.4    0.18 3.8E-06   35.8   3.5   28   95-124     1-28  (91)
 28 TIGR02281 clan_AA_DTGA clan AA  84.9       2 4.2E-05   32.4   4.6   31   92-124     9-39  (121)
 29 PF00077 RVP:  Retroviral aspar  84.4     1.7 3.6E-05   30.9   3.9   27   96-124     7-33  (100)
 30 PF13975 gag-asp_proteas:  gag-  81.9       3 6.5E-05   28.3   4.2   30   93-124     7-36  (72)
 31 PF13650 Asp_protease_2:  Aspar  75.5     3.8 8.3E-05   27.8   3.3   25   98-124     2-26  (90)
 32 cd05479 RP_DDI RP_DDI; retrope  74.7       6 0.00013   29.7   4.4   29   94-124    16-44  (124)
 33 cd05482 HIV_retropepsin_like R  70.9     6.4 0.00014   28.2   3.5   23   98-122     2-24  (87)
 34 cd06095 RP_RTVL_H_like Retrope  64.0     9.8 0.00021   26.5   3.3   24   99-124     3-26  (86)
 35 PF09668 Asp_protease:  Asparty  44.5      20 0.00044   27.4   2.4   34   94-129    24-58  (124)
 36 PF12384 Peptidase_A2B:  Ty3 tr  29.2      60  0.0013   26.3   2.9   27   97-123    35-61  (177)
 37 cd05475 nucellin_like Nucellin  27.4      82  0.0018   26.4   3.7   26   99-124   166-194 (273)
 38 cd00303 retropepsin_like Retro  26.0      76  0.0016   19.7   2.6   21   99-121     3-23  (92)
 39 PF07496 zf-CW:  CW-type Zinc F  25.6      34 0.00074   21.6   0.8    8  119-126     2-10  (50)
 40 TIGR03698 clan_AA_DTGF clan AA  25.1      95  0.0021   22.6   3.3   27   97-123     2-33  (107)
 41 PF11645 PDDEXK_5:  PD-(D/E)XK   22.8      65  0.0014   25.4   2.0   29   95-124    22-50  (149)
 42 PF05585 DUF1758:  Putative pep  21.6      47   0.001   25.8   1.1   20  105-124    10-29  (164)
 43 COG3577 Predicted aspartyl pro  21.3 2.3E+02   0.005   23.7   5.1   37   85-123    95-132 (215)

No 1  
>PLN03146 aspartyl protease family protein; Provisional
Probab=99.98  E-value=1.1e-31  Score=243.30  Aligned_cols=158  Identities=28%  Similarity=0.440  Sum_probs=124.1

Q ss_pred             CCceeEEEEEeecCcccccccccCCCCCCCCCCCCCcHHHHHHHHHhhHHHhcccccCCCCCCcccccCCCceeeeccCC
Q 040861           13 GGRIFSFEMHHRYSDQVKNWSISSGKLSHSDWPDKGSFDYYALLAHRDQILRGRHLSDTDTNSPLIFSDGNSTLRISSLG   92 (199)
Q Consensus        13 ~~~~~~l~l~Hr~s~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~d~~rr~~~l~~~~~~~~~~~~~g~~~~~~~~~~   92 (199)
                      +.++++++|+||++++.       |+++    +..+..++++++++||++ |.+++..... ...++..+.     ...+
T Consensus        21 ~~~~~~~~l~h~~~~~s-------p~~~----~~~~~~~~~~~~~~~~~~-r~~~~~~~~~-~~~~~~~~~-----~~~~   82 (431)
T PLN03146         21 PKGGFTVDLIHRDSPKS-------PFYN----PSETPSQRLRNAFRRSIS-RVNHFRPTDA-SPNDPQSDL-----ISNG   82 (431)
T ss_pred             cCCceEEEEEeCCCCCC-------CCCC----CCCChhHHHHHHHHHHHH-HHHHHhhccc-cCCccccCc-----ccCC
Confidence            46789999999999962       2222    445567899999999998 7666643221 111222221     1234


Q ss_pred             ceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCC---C
Q 040861           93 FLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRN---R  168 (199)
Q Consensus        93 ~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~---~  168 (199)
                      .+|+++|.||||||++.|+|||||+++||||. |..|..+.         ++.|||++|+||+.++|+++.|..+.   .
T Consensus        83 ~~Y~v~i~iGTPpq~~~vi~DTGS~l~Wv~C~~C~~C~~~~---------~~~fdps~SST~~~~~C~s~~C~~~~~~~~  153 (431)
T PLN03146         83 GEYLMNISIGTPPVPILAIADTGSDLIWTQCKPCDDCYKQV---------SPLFDPKKSSTYKDVSCDSSQCQALGNQAS  153 (431)
T ss_pred             ccEEEEEEcCCCCceEEEEECCCCCcceEcCCCCcccccCC---------CCcccCCCCCCCcccCCCCcccccCCCCCC
Confidence            58999999999999999999999999999999 99998654         36999999999999999999998763   3


Q ss_pred             CCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          169 CPGTFSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       169 C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                      |.. ++.|.|.+.| +||+.+.|++++|+|+
T Consensus       154 c~~-~~~c~y~i~Y-gdgs~~~G~l~~Dtlt  182 (431)
T PLN03146        154 CSD-ENTCTYSYSY-GDGSFTKGNLAVETLT  182 (431)
T ss_pred             CCC-CCCCeeEEEe-CCCCceeeEEEEEEEE
Confidence            754 3579999999 9999999999999985


No 2  
>KOG1339 consensus Aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.90  E-value=8.4e-24  Score=189.76  Aligned_cols=100  Identities=41%  Similarity=0.721  Sum_probs=88.3

Q ss_pred             cCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CC-CCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCC
Q 040861           90 SLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CS-KCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRN  167 (199)
Q Consensus        90 ~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~-~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~  167 (199)
                      ....+||++|.||||||.|+|+|||||+++||||. |. .|..+..         +.|||++||||+.+.|.++.|....
T Consensus        42 ~~~~~Y~~~i~IGTPpq~f~v~~DTGS~~lWV~c~~c~~~C~~~~~---------~~f~p~~SSt~~~~~c~~~~c~~~~  112 (398)
T KOG1339|consen   42 YSSGEYYGNISIGTPPQSFTVVLDTGSDLLWVPCAPCSSACYSQHN---------PIFDPSASSTYKSVGCSSPRCKSLP  112 (398)
T ss_pred             ccccccEEEEecCCCCeeeEEEEeCCCCceeeccccccccccccCC---------CccCccccccccccCCCCccccccc
Confidence            33458999999999999999999999999999999 99 7976432         3599999999999999999999986


Q ss_pred             CCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          168 RCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       168 ~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                      .+..+++.|.|.+.| +|++.++|+|++|+|+
T Consensus       113 ~~~~~~~~C~y~i~Y-gd~~~~~G~l~~Dtv~  143 (398)
T KOG1339|consen  113 QSCSPNSSCPYSIQY-GDGSSTSGYLATDTVT  143 (398)
T ss_pred             cCcccCCcCceEEEe-CCCCceeEEEEEEEEE
Confidence            544446899999999 9999999999999985


No 3  
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5.  Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=99.87  E-value=9.1e-22  Score=171.84  Aligned_cols=94  Identities=32%  Similarity=0.559  Sum_probs=84.0

Q ss_pred             eEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCCCCCCC
Q 040861           94 LHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRNRCPGT  172 (199)
Q Consensus        94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~~C~~~  172 (199)
                      +||++|.||||+|++.|+|||||+++||+|. |..|..+.         ++.|+|++|+|++.+.|++..|.....|.. 
T Consensus         3 ~Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~C~~c~~~~---------~~~y~~~~Sst~~~~~C~~~~c~~~~~~~~-   72 (326)
T cd06096           3 YYFIDIFIGNPPQKQSLILDTGSSSLSFPCSQCKNCGIHM---------EPPYNLNNSITSSILYCDCNKCCYCLSCLN-   72 (326)
T ss_pred             eEEEEEEecCCCeEEEEEEeCCCCceEEecCCCCCcCCCC---------CCCcCcccccccccccCCCccccccCcCCC-
Confidence            7999999999999999999999999999999 99997543         368999999999999999999966555643 


Q ss_pred             CCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          173 FSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       173 ~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                       +.|.|.+.| +||+.+.|.+++|+|+
T Consensus        73 -~~~~~~i~Y-~~gs~~~G~~~~D~v~   97 (326)
T cd06096          73 -NKCEYSISY-SEGSSISGFYFSDFVS   97 (326)
T ss_pred             -CcCcEEEEE-CCCCceeeEEEEEEEE
Confidence             689999999 9998899999999974


No 4  
>PF14543 TAXi_N:  Xylanase inhibitor N-terminal; PDB: 3HD8_A 3VLB_A 3VLA_A 3AUP_D 1T6G_A 1T6E_X 2B42_A.
Probab=99.86  E-value=9.6e-22  Score=156.89  Aligned_cols=87  Identities=39%  Similarity=0.709  Sum_probs=73.0

Q ss_pred             EEEEEEeCCCCceEEEEEeCCCCceeEeccCCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCC----CCC
Q 040861           95 HYTTVQLGTPGMKFMVALDTGSDLFWVPCECSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRN----RCP  170 (199)
Q Consensus        95 y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~----~C~  170 (199)
                      ||++|+||||+|++.|+|||||+++|+||.                 .+.|+|++|+||+.|+|+++.|....    .|.
T Consensus         1 Y~~~~~iGtP~~~~~lvvDtgs~l~W~~C~-----------------~~~f~~~~Sst~~~v~C~s~~C~~~~~~~~~~~   63 (164)
T PF14543_consen    1 YYVSVSIGTPPQPFSLVVDTGSDLTWVQCP-----------------DPPFDPSKSSTYRPVPCSSPQCSSAPSFCPCCC   63 (164)
T ss_dssp             EEEEEECTCTTEEEEEEEETT-SSEEEET---------------------STT-TTSSBEC-BTTSHHHHHCTSSBTCCT
T ss_pred             CEEEEEeCCCCceEEEEEECCCCceEEcCC-----------------CcccCCccCCcccccCCCCcchhhcccccccCC
Confidence            889999999999999999999999999982                 26899999999999999999999653    344


Q ss_pred             CCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          171 GTFSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       171 ~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                      ..++.|.|.+.| +|++.+.|+|++|+|+
T Consensus        64 ~~~~~C~y~~~y-~~~s~~~G~l~~D~~~   91 (164)
T PF14543_consen   64 CSNNSCPYSQSY-GDGSSSSGFLASDTLT   91 (164)
T ss_dssp             CESSEEEEEEEE-TTTEEEEEEEEEEEEE
T ss_pred             CCcCcccceeec-CCCccccCceEEEEEE
Confidence            445789999999 8999999999999984


No 5  
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank 
Probab=99.76  E-value=2e-18  Score=150.37  Aligned_cols=84  Identities=30%  Similarity=0.469  Sum_probs=69.7

Q ss_pred             ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCC
Q 040861           89 SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRN  167 (199)
Q Consensus        89 ~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~  167 (199)
                      ++.+.+||++|.||||+|++.|+|||||+++||+|. |..|...       |..++.|+|++|+||+.            
T Consensus         1 ~~~~~~Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~-------C~~~~~y~~~~SsT~~~------------   61 (325)
T cd05490           1 NYMDAQYYGEIGIGTPPQTFTVVFDTGSSNLWVPSVHCSLLDIA-------CWLHHKYNSSKSSTYVK------------   61 (325)
T ss_pred             CCcCCEEEEEEEECCCCcEEEEEEeCCCccEEEEcCCCCCCCcc-------ccCcCcCCcccCcceee------------
Confidence            356789999999999999999999999999999999 8632100       12246899999999986            


Q ss_pred             CCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          168 RCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       168 ~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                            ..|.|.+.| ++|+ +.|.+++|+|+
T Consensus        62 ------~~~~~~i~Y-g~G~-~~G~~~~D~v~   85 (325)
T cd05490          62 ------NGTEFAIQY-GSGS-LSGYLSQDTVS   85 (325)
T ss_pred             ------CCcEEEEEE-CCcE-EEEEEeeeEEE
Confidence                  258999999 8875 78999999974


No 6  
>cd05470 pepsin_retropepsin_like Cellular and retroviral pepsin-like aspartate proteases. This family includes both cellular and retroviral pepsin-like aspartate proteases. The cellular pepsin and pepsin-like enzymes are twice as long as their retroviral counterparts. The cellular pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, rennin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (rennin, cathepsin D and E, pepsin) or commercially (chymosin) important. The eukaryotic pepsin-like proteases contain two domains possessing similar topological features. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except in the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The eukaryotic pepsin-like proteases have two active site 
Probab=99.74  E-value=8.8e-18  Score=124.04  Aligned_cols=74  Identities=36%  Similarity=0.609  Sum_probs=62.3

Q ss_pred             EEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccC-CCCCCCCccccccccccccCCCCCCCCCC
Q 040861           97 TTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIY-NPEVSSTSKKVTCNNLLCAHRNRCPGTFS  174 (199)
Q Consensus        97 ~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~f-dp~~SsT~~~v~C~s~~C~~~~~C~~~~~  174 (199)
                      ++|.||||+|++.|+|||||+++||+|. |..|..+.         ++.| +|++|++++.                  .
T Consensus         1 ~~i~vGtP~q~~~~~~DTGSs~~Wv~~~~c~~~~~~~---------~~~~~~~~~sst~~~------------------~   53 (109)
T cd05470           1 IEIGIGTPPQTFNVLLDTGSSNLWVPSVDCQSLAIYS---------HSSYDDPSASSTYSD------------------N   53 (109)
T ss_pred             CEEEeCCCCceEEEEEeCCCCCEEEeCCCCCCccccc---------ccccCCcCCCCCCCC------------------C
Confidence            3789999999999999999999999999 98775432         2355 9999999886                  4


Q ss_pred             CCceEEEEccCCceEEEEEEEEeeC
Q 040861          175 NCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       175 ~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                      .|.|.+.| ++|+ ..|.+++|+|+
T Consensus        54 ~~~~~~~Y-~~g~-~~g~~~~D~v~   76 (109)
T cd05470          54 GCTFSITY-GTGS-LSGGLSTDTVS   76 (109)
T ss_pred             CcEEEEEe-CCCe-EEEEEEEEEEE
Confidence            69999999 8874 67999999874


No 7  
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate  r
Probab=99.73  E-value=9.2e-18  Score=146.50  Aligned_cols=84  Identities=31%  Similarity=0.536  Sum_probs=70.8

Q ss_pred             ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCC
Q 040861           89 SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRN  167 (199)
Q Consensus        89 ~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~  167 (199)
                      ++.+.+||++|+||||+|++.|+|||||+++||+|. |..|..+       +..++.|||++|+||+.            
T Consensus         3 ~~~~~~y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~-------c~~~~~y~~~~SsT~~~------------   63 (326)
T cd05487           3 NYLDTQYYGEIGIGTPPQTFKVVFDTGSSNLWVPSSKCSPLYTA-------CVTHNLYDASDSSTYKE------------   63 (326)
T ss_pred             ccCCCeEEEEEEECCCCcEEEEEEeCCccceEEccCCCcCcchh-------hcccCcCCCCCCeeeeE------------
Confidence            456789999999999999999999999999999998 8653211       22357999999999986            


Q ss_pred             CCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          168 RCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       168 ~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                            ..|.|.+.| ++|+ +.|.+++|+|+
T Consensus        64 ------~~~~~~~~Y-g~g~-~~G~~~~D~v~   87 (326)
T cd05487          64 ------NGTEFTIHY-ASGT-VKGFLSQDIVT   87 (326)
T ss_pred             ------CCEEEEEEe-CCce-EEEEEeeeEEE
Confidence                  368999999 8875 89999999974


No 8  
>PTZ00147 plasmepsin-1; Provisional
Probab=99.73  E-value=1.7e-17  Score=151.59  Aligned_cols=84  Identities=29%  Similarity=0.563  Sum_probs=72.3

Q ss_pred             eeee-ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CC--CCCCCCCCCccCCCccccCCCCCCCCcccccccc
Q 040861           85 TLRI-SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CS--KCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNN  160 (199)
Q Consensus        85 ~~~~-~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~--~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s  160 (199)
                      .+++ +..+.+||++|.||||||++.|+|||||+++||+|. |.  .|..           ++.|||++|+||+.     
T Consensus       129 ~v~L~n~~n~~Y~~~I~IGTP~Q~f~Vi~DTGSsdlWVps~~C~~~~C~~-----------~~~yd~s~SsT~~~-----  192 (453)
T PTZ00147        129 NVELKDLANVMSYGEAKLGDNGQKFNFIFDTGSANLWVPSIKCTTEGCET-----------KNLYDSSKSKTYEK-----  192 (453)
T ss_pred             eeeccccCCCEEEEEEEECCCCeEEEEEEeCCCCcEEEeecCCCcccccC-----------CCccCCccCcceEE-----
Confidence            3444 677789999999999999999999999999999999 97  3542           46899999999987     


Q ss_pred             ccccCCCCCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          161 LLCAHRNRCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       161 ~~C~~~~~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                                   ..|.|.+.| ++|+ +.|.+++|+|+
T Consensus       193 -------------~~~~f~i~Y-g~Gs-vsG~~~~DtVt  216 (453)
T PTZ00147        193 -------------DGTKVEMNY-VSGT-VSGFFSKDLVT  216 (453)
T ss_pred             -------------CCCEEEEEe-CCCC-EEEEEEEEEEE
Confidence                         358999999 8874 89999999984


No 9  
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which 
Probab=99.73  E-value=1.5e-17  Score=144.55  Aligned_cols=82  Identities=35%  Similarity=0.559  Sum_probs=70.3

Q ss_pred             ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCC
Q 040861           89 SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRN  167 (199)
Q Consensus        89 ~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~  167 (199)
                      |+.+.+||++|.||||+|++.|+|||||+++||+|. |..|..+         .++.|||++|+||+.            
T Consensus         5 n~~~~~Y~~~i~vGtp~q~~~v~~DTGS~~~wv~~~~C~~~~c~---------~~~~f~~~~Sst~~~------------   63 (317)
T cd05478           5 NYLDMEYYGTISIGTPPQDFTVIFDTGSSNLWVPSVYCSSQACS---------NHNRFNPRQSSTYQS------------   63 (317)
T ss_pred             cccCCEEEEEEEeCCCCcEEEEEEeCCCccEEEecCCCCccccc---------ccCcCCCCCCcceee------------
Confidence            566789999999999999999999999999999999 9743211         246999999999987            


Q ss_pred             CCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          168 RCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       168 ~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                            ..|.|.+.| ++|+ +.|.++.|+|+
T Consensus        64 ------~~~~~~~~y-g~gs-~~G~~~~D~v~   87 (317)
T cd05478          64 ------TGQPLSIQY-GTGS-MTGILGYDTVQ   87 (317)
T ss_pred             ------CCcEEEEEE-CCce-EEEEEeeeEEE
Confidence                  357999999 8876 78999999974


No 10 
>PTZ00165 aspartyl protease; Provisional
Probab=99.72  E-value=1.9e-17  Score=152.37  Aligned_cols=85  Identities=26%  Similarity=0.457  Sum_probs=70.2

Q ss_pred             ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCC--CCCCCCCCccCCCccccCCCCCCCCccccccccccccC
Q 040861           89 SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSK--CAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAH  165 (199)
Q Consensus        89 ~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~--C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~  165 (199)
                      |+.+.+||++|.||||||+|.|+|||||+++||+|. |..  |.           .++.|||++||||+.+.++.     
T Consensus       115 n~~d~~Y~~~I~IGTPpQ~f~Vv~DTGSS~lWVps~~C~~~~C~-----------~~~~yd~s~SSTy~~~~~~~-----  178 (482)
T PTZ00165        115 NFHNSQYFGEIQVGTPPKSFVVVFDTGSSNLWIPSKECKSGGCA-----------PHRKFDPKKSSTYTKLKLGD-----  178 (482)
T ss_pred             cccCCeEEEEEEeCCCCceEEEEEeCCCCCEEEEchhcCccccc-----------ccCCCCccccCCcEecCCCC-----
Confidence            788999999999999999999999999999999999 964  54           25799999999999853111     


Q ss_pred             CCCCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          166 RNRCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       166 ~~~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                              ....+.++| ++|+ ..|.+++|+++
T Consensus       179 --------~~~~~~i~Y-GsGs-~~G~l~~DtV~  202 (482)
T PTZ00165        179 --------ESAETYIQY-GTGE-CVLALGKDTVK  202 (482)
T ss_pred             --------ccceEEEEe-CCCc-EEEEEEEEEEE
Confidence                    122467999 8765 66999999984


No 11 
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=99.72  E-value=3.2e-17  Score=142.45  Aligned_cols=76  Identities=33%  Similarity=0.614  Sum_probs=67.4

Q ss_pred             ceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CC--CCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCCCC
Q 040861           93 FLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CS--KCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRNRC  169 (199)
Q Consensus        93 ~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~--~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~~C  169 (199)
                      .+|+++|.||||||++.|+|||||+++||+|. |.  .|..           ++.|||++|+||+.              
T Consensus         2 ~~y~~~i~iGtP~q~~~v~~DTGS~~~wv~~~~C~~~~C~~-----------~~~f~~~~SsT~~~--------------   56 (318)
T cd05477           2 MSYYGEISIGTPPQNFLVLFDTGSSNLWVPSVLCQSQACTN-----------HTKFNPSQSSTYST--------------   56 (318)
T ss_pred             cEEEEEEEECCCCcEEEEEEeCCCccEEEccCCCCCccccc-----------cCCCCcccCCCceE--------------
Confidence            57999999999999999999999999999999 97  4543           46999999999986              


Q ss_pred             CCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          170 PGTFSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       170 ~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                          ..|.|.+.| +||+ ..|.+++|+|+
T Consensus        57 ----~~~~~~~~Y-g~Gs-~~G~~~~D~i~   80 (318)
T cd05477          57 ----NGETFSLQY-GSGS-LTGIFGYDTVT   80 (318)
T ss_pred             ----CCcEEEEEE-CCcE-EEEEEEeeEEE
Confidence                468999999 8876 68999999874


No 12 
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=99.71  E-value=2.7e-17  Score=140.42  Aligned_cols=78  Identities=32%  Similarity=0.494  Sum_probs=68.8

Q ss_pred             EEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCCCCCCCC
Q 040861           95 HYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRNRCPGTF  173 (199)
Q Consensus        95 y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~~C~~~~  173 (199)
                      |+++|.||||+|++.|+|||||+++||+|. |..|..+.         +..|+|++|+|++.++                
T Consensus         1 Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~c~~~~~~~---------~~~y~~~~Sst~~~~~----------------   55 (278)
T cd06097           1 YLTPVKIGTPPQTLNLDLDTGSSDLWVFSSETPAAQQGG---------HKLYDPSKSSTAKLLP----------------   55 (278)
T ss_pred             CeeeEEECCCCcEEEEEEeCCCCceeEeeCCCCchhhcc---------CCcCCCccCccceecC----------------
Confidence            789999999999999999999999999999 98775332         3579999999998743                


Q ss_pred             CCCceEEEEccCCceEEEEEEEEeeC
Q 040861          174 SNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       174 ~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                       .|.|.+.| ++|+.+.|.+++|+|+
T Consensus        56 -~~~~~i~Y-~~G~~~~G~~~~D~v~   79 (278)
T cd06097          56 -GATWSISY-GDGSSASGIVYTDTVS   79 (278)
T ss_pred             -CcEEEEEe-CCCCeEEEEEEEEEEE
Confidence             68999999 9988899999999974


No 13 
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=99.71  E-value=2.6e-17  Score=144.03  Aligned_cols=84  Identities=30%  Similarity=0.490  Sum_probs=70.7

Q ss_pred             ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCC
Q 040861           89 SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRN  167 (199)
Q Consensus        89 ~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~  167 (199)
                      |+.+.+|+++|.||||+|++.|+|||||+++||+|. |..|...       +..++.|+|++|+|++.            
T Consensus         6 n~~~~~Y~~~i~vGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~-------c~~~~~y~~~~Sst~~~------------   66 (329)
T cd05485           6 NYMDAQYYGVITIGTPPQSFKVVFDTGSSNLWVPSKKCSWTNIA-------CLLHNKYDSTKSSTYKK------------   66 (329)
T ss_pred             eccCCeEEEEEEECCCCcEEEEEEcCCCccEEEecCCCCCCCcc-------ccCCCeECCcCCCCeEE------------
Confidence            677889999999999999999999999999999999 8732110       11246899999999987            


Q ss_pred             CCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          168 RCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       168 ~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                            ..|.|.+.| ++|+ +.|.+++|+++
T Consensus        67 ------~~~~~~i~Y-~~g~-~~G~~~~D~v~   90 (329)
T cd05485          67 ------NGTEFAIQY-GSGS-LSGFLSTDTVS   90 (329)
T ss_pred             ------CCeEEEEEE-CCce-EEEEEecCcEE
Confidence                  368999999 8876 78999999874


No 14 
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases.  They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=99.71  E-value=4.6e-17  Score=141.71  Aligned_cols=80  Identities=35%  Similarity=0.540  Sum_probs=69.5

Q ss_pred             ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CC---CCCCCCCCCccCCCccccCCCCCCCCcccccccccccc
Q 040861           89 SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CS---KCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCA  164 (199)
Q Consensus        89 ~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~---~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~  164 (199)
                      |+.+.+||++|.||||+|++.|+|||||+++||+|. |.   .|..           ++.|||++|+||+.         
T Consensus         5 n~~~~~Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~C~~~~~C~~-----------~~~y~~~~SsT~~~---------   64 (317)
T cd06098           5 NYLDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSIACYF-----------HSKYKSSKSSTYKK---------   64 (317)
T ss_pred             ccCCCEEEEEEEECCCCeEEEEEECCCccceEEecCCCCCCccccc-----------cCcCCcccCCCccc---------
Confidence            566789999999999999999999999999999999 95   5653           46899999999987         


Q ss_pred             CCCCCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          165 HRNRCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       165 ~~~~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                               ....+.+.| ++|+ +.|.+++|+|+
T Consensus        65 ---------~~~~~~i~Y-g~G~-~~G~~~~D~v~   88 (317)
T cd06098          65 ---------NGTSASIQY-GTGS-ISGFFSQDSVT   88 (317)
T ss_pred             ---------CCCEEEEEc-CCce-EEEEEEeeEEE
Confidence                     246889999 8775 68999999974


No 15 
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two 
Probab=99.71  E-value=3.5e-17  Score=144.92  Aligned_cols=74  Identities=26%  Similarity=0.293  Sum_probs=65.1

Q ss_pred             ceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCCCCCC
Q 040861           93 FLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRNRCPG  171 (199)
Q Consensus        93 ~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~~C~~  171 (199)
                      ..||++|.||||+|+|.|+|||||+++||+|. |..|             ++.|||++|+||+.                
T Consensus         2 ~~Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~~~~~-------------~~~f~~~~SsT~~~----------------   52 (364)
T cd05473           2 QGYYIEMLIGTPPQKLNILVDTGSSNFAVAAAPHPFI-------------HTYFHRELSSTYRD----------------   52 (364)
T ss_pred             CceEEEEEecCCCceEEEEEecCCcceEEEcCCCccc-------------cccCCchhCcCccc----------------
Confidence            47999999999999999999999999999998 7322             35899999999987                


Q ss_pred             CCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          172 TFSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       172 ~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                        ..|.|++.| ++|+ +.|.+++|+|+
T Consensus        53 --~~~~~~i~Y-g~Gs-~~G~~~~D~v~   76 (364)
T cd05473          53 --LGKGVTVPY-TQGS-WEGELGTDLVS   76 (364)
T ss_pred             --CCceEEEEE-Ccce-EEEEEEEEEEE
Confidence              358999999 8875 68999999974


No 16 
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=99.70  E-value=9.8e-17  Score=146.47  Aligned_cols=83  Identities=25%  Similarity=0.567  Sum_probs=72.0

Q ss_pred             eee-ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CC--CCCCCCCCCccCCCccccCCCCCCCCccccccccc
Q 040861           86 LRI-SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CS--KCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNL  161 (199)
Q Consensus        86 ~~~-~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~--~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~  161 (199)
                      +++ ++.+.+||++|.||||+|++.|+|||||+++||+|. |.  .|..           ++.|||++|+|++.      
T Consensus       129 ~~l~d~~n~~Yy~~i~IGTP~Q~f~vi~DTGSsdlWV~s~~C~~~~C~~-----------~~~yd~s~SsT~~~------  191 (450)
T PTZ00013        129 IELDDVANIMFYGEGEVGDNHQKFMLIFDTGSANLWVPSKKCDSIGCSI-----------KNLYDSSKSKSYEK------  191 (450)
T ss_pred             eeeeccCCCEEEEEEEECCCCeEEEEEEeCCCCceEEecccCCcccccc-----------CCCccCccCccccc------
Confidence            444 666789999999999999999999999999999999 96  4643           46899999999987      


Q ss_pred             cccCCCCCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          162 LCAHRNRCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       162 ~C~~~~~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                                  ..|.|.+.| ++|+ +.|.+++|+|+
T Consensus       192 ------------~~~~~~i~Y-G~Gs-v~G~~~~Dtv~  215 (450)
T PTZ00013        192 ------------DGTKVDITY-GSGT-VKGFFSKDLVT  215 (450)
T ss_pred             ------------CCcEEEEEE-CCce-EEEEEEEEEEE
Confidence                        358999999 8875 89999999974


No 17 
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=99.70  E-value=4.1e-17  Score=141.74  Aligned_cols=74  Identities=31%  Similarity=0.590  Sum_probs=65.6

Q ss_pred             EEEEEEeCCCCceEEEEEeCCCCceeEecc-CC--CCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCCCCCC
Q 040861           95 HYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CS--KCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRNRCPG  171 (199)
Q Consensus        95 y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~--~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~~C~~  171 (199)
                      ||++|.||||+|++.|+|||||+++||+|. |.  .|..           ++.|||++|+||+.                
T Consensus         1 Y~~~i~iGtP~Q~~~v~~DTGSs~~Wv~s~~C~~~~C~~-----------~~~y~~~~SsT~~~----------------   53 (316)
T cd05486           1 YFGQISIGTPPQNFTVIFDTGSSNLWVPSIYCTSQACTK-----------HNRFQPSESSTYVS----------------   53 (316)
T ss_pred             CeEEEEECCCCcEEEEEEcCCCccEEEecCCCCCcccCc-----------cceECCCCCccccc----------------
Confidence            789999999999999999999999999999 86  4643           46899999999987                


Q ss_pred             CCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          172 TFSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       172 ~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                        ..|.|.+.| ++|+ +.|.+++|+|+
T Consensus        54 --~~~~~~i~Y-g~g~-~~G~~~~D~v~   77 (316)
T cd05486          54 --NGEAFSIQY-GTGS-LTGIIGIDQVT   77 (316)
T ss_pred             --CCcEEEEEe-CCcE-EEEEeeecEEE
Confidence              368999999 8875 79999999974


No 18 
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme.  Proteinase A preferentially hydro
Probab=99.69  E-value=8.5e-17  Score=140.11  Aligned_cols=80  Identities=34%  Similarity=0.590  Sum_probs=70.1

Q ss_pred             ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEecc-CC--CCCCCCCCCccCCCccccCCCCCCCCccccccccccccC
Q 040861           89 SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CS--KCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAH  165 (199)
Q Consensus        89 ~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~--~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~  165 (199)
                      ++.+.+||++|.||||+|++.|+|||||+++||+|. |.  .|..           ++.|+|++|+|++.          
T Consensus         5 n~~~~~Y~~~i~iGtp~q~~~v~~DTGSs~~wv~~~~C~~~~C~~-----------~~~y~~~~Sst~~~----------   63 (320)
T cd05488           5 NYLNAQYFTDITLGTPPQKFKVILDTGSSNLWVPSVKCGSIACFL-----------HSKYDSSASSTYKA----------   63 (320)
T ss_pred             ccCCCEEEEEEEECCCCcEEEEEEecCCcceEEEcCCCCCcccCC-----------cceECCCCCcceee----------
Confidence            556789999999999999999999999999999999 96  4643           35899999999986          


Q ss_pred             CCCCCCCCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          166 RNRCPGTFSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       166 ~~~C~~~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                              +.|.|.+.| ++|+ +.|.+++|+|+
T Consensus        64 --------~~~~~~~~y-~~g~-~~G~~~~D~v~   87 (320)
T cd05488          64 --------NGTEFKIQY-GSGS-LEGFVSQDTLS   87 (320)
T ss_pred             --------CCCEEEEEE-CCce-EEEEEEEeEEE
Confidence                    468999999 8876 79999999874


No 19 
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=99.56  E-value=8.1e-15  Score=125.13  Aligned_cols=59  Identities=37%  Similarity=0.947  Sum_probs=52.3

Q ss_pred             eEEEEEEeCCCCceEEEEEeCCCCceeEecc--CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCCCCCC
Q 040861           94 LHYTTVQLGTPGMKFMVALDTGSDLFWVPCE--CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRNRCPG  171 (199)
Q Consensus        94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~--C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~~C~~  171 (199)
                      +||++|.||||||++.|+|||||+++||+|.  |..|                                           
T Consensus         2 ~Y~~~i~iGtP~q~~~v~~DTGS~~~Wv~c~~~c~~c-------------------------------------------   38 (273)
T cd05475           2 YYYVTINIGNPPKPYFLDIDTGSDLTWLQCDAPCTGC-------------------------------------------   38 (273)
T ss_pred             ceEEEEEcCCCCeeEEEEEccCCCceEEeCCCCCCCC-------------------------------------------
Confidence            6899999999999999999999999999984  4322                                           


Q ss_pred             CCCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          172 TFSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       172 ~~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                         .|.|.+.| +|++.+.|.+++|+|+
T Consensus        39 ---~c~~~i~Y-gd~~~~~G~~~~D~v~   62 (273)
T cd05475          39 ---QCDYEIEY-ADGGSSMGVLVTDIFS   62 (273)
T ss_pred             ---cCccEeEe-CCCCceEEEEEEEEEE
Confidence               37899999 9889999999999974


No 20 
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=99.50  E-value=1.1e-13  Score=116.67  Aligned_cols=78  Identities=37%  Similarity=0.665  Sum_probs=62.5

Q ss_pred             EEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCCCCCCCC
Q 040861           95 HYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRNRCPGTF  173 (199)
Q Consensus        95 y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~~C~~~~  173 (199)
                      |+++|.||||+|++.|+|||||+++||+|. |..|..+...       ...|++..|+++..                  
T Consensus         1 Y~~~i~iGtp~q~~~l~~DTGS~~~wv~~~~c~~~~~~~~~-------~~~~~~~~s~~~~~------------------   55 (283)
T cd05471           1 YYGEITIGTPPQKFSVIFDTGSSLLWVPSSNCTSCSCQKHP-------RFKYDSSKSSTYKD------------------   55 (283)
T ss_pred             CEEEEEECCCCcEEEEEEeCCCCCEEEecCCCCccccccCC-------CCccCccCCceeec------------------
Confidence            679999999999999999999999999999 9877543211       11377777766654                  


Q ss_pred             CCCceEEEEccCCceEEEEEEEEeeC
Q 040861          174 SNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       174 ~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                      ..|.|.+.| +++ ...|.+++|+|+
T Consensus        56 ~~~~~~~~Y-~~g-~~~g~~~~D~v~   79 (283)
T cd05471          56 TGCTFSITY-GDG-SVTGGLGTDTVT   79 (283)
T ss_pred             CCCEEEEEE-CCC-eEEEEEEEeEEE
Confidence            579999999 874 678999999874


No 21 
>cd05489 xylanase_inhibitor_I_like TAXI-I inhibits degradation of xylan in the cell wall. Xylanase inhibitor-I (TAXI-I) is a member of potent TAXI-type inhibitors of fungal and bacterial family 11 xylanases. Plants developed a diverse battery of defense mechanisms in response to continual challenges by a broad spectrum of pathogenic microorganisms. Their defense arsenal includes inhibitors of cell wall-degrading enzymes, which hinder a possible invasion and colonization by antagonists. Xylanases of fungal and bacterial pathogens are the key enzymes in the degradation of xylan in the cell wall. Plants secrete proteins that inhibit these degradation glycosidases, including xylanase. Surprisingly, TAXI-I displays structural homology with the pepsin-like family of aspartic proteases but is proteolytically nonfunctional, because one or more residues of the essential catalytic triad are absent. The structure of the TAXI-inhibitor, Aspergillus niger xylanase I complex, illustrates the ability 
Probab=99.44  E-value=1.7e-13  Score=121.93  Aligned_cols=74  Identities=24%  Similarity=0.415  Sum_probs=61.1

Q ss_pred             eCCCCce-EEEEEeCCCCceeEeccCCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCC------------
Q 040861          101 LGTPGMK-FMVALDTGSDLFWVPCECSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRN------------  167 (199)
Q Consensus       101 iGTP~q~-~~v~~DTGS~~~Wv~C~C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~------------  167 (199)
                      +|||-.+ +.|+|||||+++||||.                      |.+|+||+.++|+++.|+.+.            
T Consensus         2 ~~~~~~~~~~~~~DTGS~l~WvqC~----------------------~~~sst~~~~~C~s~~C~~~~~~~~~~~~~~~~   59 (362)
T cd05489           2 TITPLKGAVPLVLDLAGPLLWSTCD----------------------AGHSSTYQTVPCSSSVCSLANRYHCPGTCGGAP   59 (362)
T ss_pred             cccCccCCeeEEEECCCCceeeeCC----------------------CCCcCCCCccCcCChhhccccccCCCccccCCC
Confidence            5788878 99999999999999996                      136789999999999998641            


Q ss_pred             --CCCCCCCCCceEEE-EccCCceEEEEEEEEeeC
Q 040861          168 --RCPGTFSNCPYSVS-YVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       168 --~C~~~~~~C~y~~~-Y~gdgs~~~G~l~~Dtlt  199 (199)
                        .|.  ++.|.|... | ++|+.+.|+|++|+|+
T Consensus        60 ~~~c~--~~~C~y~~~~y-~~gs~t~G~l~~Dtl~   91 (362)
T cd05489          60 GPGCG--NNTCTAHPYNP-VTGECATGDLTQDVLS   91 (362)
T ss_pred             CCCCC--CCcCeeEcccc-ccCcEeeEEEEEEEEE
Confidence              343  246989765 7 7889999999999985


No 22 
>PF00026 Asp:  Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.;  InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) .  More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=99.44  E-value=1e-13  Score=118.93  Aligned_cols=78  Identities=38%  Similarity=0.630  Sum_probs=64.9

Q ss_pred             eEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCCCCCCCCccCCCccccCCCCCCCCccccccccccccCCCCCCCC
Q 040861           94 LHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCAPTQGSAYASDFELSIYNPEVSSTSKKVTCNNLLCAHRNRCPGT  172 (199)
Q Consensus        94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~~~~~~~~~~~~~~~~fdp~~SsT~~~v~C~s~~C~~~~~C~~~  172 (199)
                      +|+++|.||||+|++.|++||||+.+||++. |..|..        +.....|++.+|+|++..                
T Consensus         1 ~Y~~~v~iGtp~q~~~~~iDTGS~~~wv~~~~c~~~~~--------~~~~~~y~~~~S~t~~~~----------------   56 (317)
T PF00026_consen    1 QYYINVTIGTPPQTFRVLIDTGSSDTWVPSSNCNSCSS--------CASSGFYNPSKSSTFSNQ----------------   56 (317)
T ss_dssp             EEEEEEEETTTTEEEEEEEETTBSSEEEEBTTECSHTH--------HCTSC-BBGGGSTTEEEE----------------
T ss_pred             CeEEEEEECCCCeEEEEEEecccceeeeceeccccccc--------cccccccccccccccccc----------------
Confidence            5899999999999999999999999999998 876510        112368999999999873                


Q ss_pred             CCCCceEEEEccCCceEEEEEEEEeeC
Q 040861          173 FSNCPYSVSYVSAQTSTSGILVEMFFT  199 (199)
Q Consensus       173 ~~~C~y~~~Y~gdgs~~~G~l~~Dtlt  199 (199)
                        .+.|.+.| ++|+ ..|.+++|+|+
T Consensus        57 --~~~~~~~y-~~g~-~~G~~~~D~v~   79 (317)
T PF00026_consen   57 --GKPFSISY-GDGS-VSGNLVSDTVS   79 (317)
T ss_dssp             --EEEEEEEE-TTEE-EEEEEEEEEEE
T ss_pred             --eeeeeeec-cCcc-cccccccceEe
Confidence              47899999 8877 99999999974


No 23 
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco.  CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=98.92  E-value=6.5e-10  Score=95.70  Aligned_cols=32  Identities=53%  Similarity=0.881  Sum_probs=30.9

Q ss_pred             eEEEEEEeCCCCceEEEEEeCCCCceeEecc-C
Q 040861           94 LHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-C  125 (199)
Q Consensus        94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C  125 (199)
                      +|+++|.||||||++.|+|||||+++||+|. |
T Consensus         1 ~Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~c~~c   33 (299)
T cd05472           1 EYVVTVGLGTPARDQTVIVDTGSDLTWVQCQPC   33 (299)
T ss_pred             CeEEEEecCCCCcceEEEecCCCCcccccCCCC
Confidence            4899999999999999999999999999998 7


No 24 
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  The enzymes specifically cleave bonds in peptides which 
Probab=98.72  E-value=9.5e-09  Score=87.18  Aligned_cols=30  Identities=43%  Similarity=0.802  Sum_probs=28.9

Q ss_pred             eEEEEEEeCCCCceEEEEEeCCCCceeEec
Q 040861           94 LHYTTVQLGTPGMKFMVALDTGSDLFWVPC  123 (199)
Q Consensus        94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C  123 (199)
                      +|+++|.||||+|++.|+|||||+.+||+|
T Consensus         1 ~Y~~~i~iGtP~q~~~v~~DTGSs~~wv~~   30 (265)
T cd05476           1 EYLVTLSIGTPPQPFSLIVDTGSDLTWTQC   30 (265)
T ss_pred             CeEEEEecCCCCcceEEEecCCCCCEEEcC
Confidence            489999999999999999999999999998


No 25 
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=98.43  E-value=2.3e-07  Score=79.27  Aligned_cols=29  Identities=45%  Similarity=0.723  Sum_probs=28.1

Q ss_pred             eEEEEEEeCCCCceEEEEEeCCCCceeEe
Q 040861           94 LHYTTVQLGTPGMKFMVALDTGSDLFWVP  122 (199)
Q Consensus        94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~  122 (199)
                      .|+++|.||||+|++.|+|||||+++||+
T Consensus         2 ~Y~~~i~iGtp~q~~~v~~DTgS~~~wv~   30 (295)
T cd05474           2 YYSAELSVGTPPQKVTVLLDTGSSDLWVP   30 (295)
T ss_pred             eEEEEEEECCCCcEEEEEEeCCCCcceee
Confidence            68899999999999999999999999998


No 26 
>cd05483 retropepsin_like_bacteria Bacterial aspartate proteases, retropepsin-like protease family. This family of bacteria aspartate proteases is a subfamily of retropepsin-like protease family, which includes enzymes from retrovirus and retrotransposons. While fungal and mammalian pepsin-like aspartate proteases are bilobal proteins with structurally related N- and C-termini, this family of bacteria aspartate proteases is half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate proteases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=95.44  E-value=0.024  Score=39.73  Aligned_cols=29  Identities=24%  Similarity=0.269  Sum_probs=26.1

Q ss_pred             eEEEEEEeCCCCceEEEEEeCCCCceeEecc
Q 040861           94 LHYTTVQLGTPGMKFMVALDTGSDLFWVPCE  124 (199)
Q Consensus        94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~  124 (199)
                      .|++++.|+  .+++.+++|||++.+|+.-.
T Consensus         2 ~~~v~v~i~--~~~~~~llDTGa~~s~i~~~   30 (96)
T cd05483           2 HFVVPVTIN--GQPVRFLLDTGASTTVISEE   30 (96)
T ss_pred             cEEEEEEEC--CEEEEEEEECCCCcEEcCHH
Confidence            478999999  79999999999999999765


No 27 
>cd05484 retropepsin_like_LTR_2 Retropepsins_like_LTR, pepsin-like aspartate proteases. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classif
Probab=92.36  E-value=0.18  Score=35.76  Aligned_cols=28  Identities=25%  Similarity=0.336  Sum_probs=24.4

Q ss_pred             EEEEEEeCCCCceEEEEEeCCCCceeEecc
Q 040861           95 HYTTVQLGTPGMKFMVALDTGSDLFWVPCE  124 (199)
Q Consensus        95 y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~  124 (199)
                      +|+++.|+  .+++.+.+||||+..++.-+
T Consensus         1 ~~~~~~In--g~~i~~lvDTGA~~svis~~   28 (91)
T cd05484           1 KTVTLLVN--GKPLKFQLDTGSAITVISEK   28 (91)
T ss_pred             CEEEEEEC--CEEEEEEEcCCcceEEeCHH
Confidence            46888898  79999999999999998665


No 28 
>TIGR02281 clan_AA_DTGA clan AA aspartic protease, TIGR02281 family. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix. Sequences in the seed alignment and those scoring above the trusted cutoff are Proteobacterial; homologs scroing between trusted and noise are found in Pyrobaculum aerophilum str. IM2 (archaeal), Pirellula sp. (Planctomycetes), and Nostoc sp. PCC 7120 (Cyanobacteria).
Probab=84.86  E-value=2  Score=32.40  Aligned_cols=31  Identities=23%  Similarity=0.215  Sum_probs=26.0

Q ss_pred             CceEEEEEEeCCCCceEEEEEeCCCCceeEecc
Q 040861           92 GFLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE  124 (199)
Q Consensus        92 ~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~  124 (199)
                      +..|++++.|.  .+++.+++|||++.+-+.-.
T Consensus         9 ~g~~~v~~~In--G~~~~flVDTGAs~t~is~~   39 (121)
T TIGR02281         9 DGHFYATGRVN--GRNVRFLVDTGATSVALNEE   39 (121)
T ss_pred             CCeEEEEEEEC--CEEEEEEEECCCCcEEcCHH
Confidence            45788999997  68999999999998877654


No 29 
>PF00077 RVP:  Retroviral aspartyl protease The Prosite entry also includes Pfam:PF00026;  InterPro: IPR018061 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to the MEROPS peptidase family A2 (retropepsin family, clan AA), subfamily A2A. The family includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). Retroviral aspartyl protease is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins.; PDB: 3D3T_B 3SQF_A 1NSO_A 2HB3_A 2HS2_A 2HS1_B 3K4V_A 3GGV_C 1HTG_B 2FDE_A ....
Probab=84.36  E-value=1.7  Score=30.88  Aligned_cols=27  Identities=37%  Similarity=0.583  Sum_probs=22.2

Q ss_pred             EEEEEeCCCCceEEEEEeCCCCceeEecc
Q 040861           96 YTTVQLGTPGMKFMVALDTGSDLFWVPCE  124 (199)
Q Consensus        96 ~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~  124 (199)
                      +.+|.+.  .+++.+++|||++..-++-.
T Consensus         7 ~i~v~i~--g~~i~~LlDTGA~vsiI~~~   33 (100)
T PF00077_consen    7 YITVKIN--GKKIKALLDTGADVSIISEK   33 (100)
T ss_dssp             EEEEEET--TEEEEEEEETTBSSEEESSG
T ss_pred             eEEEeEC--CEEEEEEEecCCCcceeccc
Confidence            4577777  67999999999999888654


No 30 
>PF13975 gag-asp_proteas:  gag-polyprotein putative aspartyl protease
Probab=81.88  E-value=3  Score=28.30  Aligned_cols=30  Identities=27%  Similarity=0.464  Sum_probs=26.0

Q ss_pred             ceEEEEEEeCCCCceEEEEEeCCCCceeEecc
Q 040861           93 FLHYTTVQLGTPGMKFMVALDTGSDLFWVPCE  124 (199)
Q Consensus        93 ~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~  124 (199)
                      ..+++.+.||  .+.+.+.+|||++..-++-+
T Consensus         7 g~~~v~~~I~--g~~~~alvDtGat~~fis~~   36 (72)
T PF13975_consen    7 GLMYVPVSIG--GVQVKALVDTGATHNFISES   36 (72)
T ss_pred             CEEEEEEEEC--CEEEEEEEeCCCcceecCHH
Confidence            4788999999  69999999999998877665


No 31 
>PF13650 Asp_protease_2:  Aspartyl protease
Probab=75.50  E-value=3.8  Score=27.84  Aligned_cols=25  Identities=20%  Similarity=0.351  Sum_probs=19.6

Q ss_pred             EEEeCCCCceEEEEEeCCCCceeEecc
Q 040861           98 TVQLGTPGMKFMVALDTGSDLFWVPCE  124 (199)
Q Consensus        98 ~v~iGTP~q~~~v~~DTGS~~~Wv~C~  124 (199)
                      ++.|+  .+++.+++|||++...+.-.
T Consensus         2 ~v~vn--g~~~~~liDTGa~~~~i~~~   26 (90)
T PF13650_consen    2 PVKVN--GKPVRFLIDTGASISVISRS   26 (90)
T ss_pred             EEEEC--CEEEEEEEcCCCCcEEECHH
Confidence            45666  68999999999998777544


No 32 
>cd05479 RP_DDI RP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
Probab=74.71  E-value=6  Score=29.67  Aligned_cols=29  Identities=17%  Similarity=0.410  Sum_probs=25.3

Q ss_pred             eEEEEEEeCCCCceEEEEEeCCCCceeEecc
Q 040861           94 LHYTTVQLGTPGMKFMVALDTGSDLFWVPCE  124 (199)
Q Consensus        94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~  124 (199)
                      .+|+++.|+  .+++.+++|||+....+.-.
T Consensus        16 ~~~v~~~In--g~~~~~LvDTGAs~s~Is~~   44 (124)
T cd05479          16 MLYINVEIN--GVPVKAFVDSGAQMTIMSKA   44 (124)
T ss_pred             EEEEEEEEC--CEEEEEEEeCCCceEEeCHH
Confidence            567899998  78999999999999988765


No 33 
>cd05482 HIV_retropepsin_like Retropepsins, pepsin-like aspartate proteases. This is a subfamily of retropepsins. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This gro
Probab=70.94  E-value=6.4  Score=28.16  Aligned_cols=23  Identities=35%  Similarity=0.488  Sum_probs=19.1

Q ss_pred             EEEeCCCCceEEEEEeCCCCceeEe
Q 040861           98 TVQLGTPGMKFMVALDTGSDLFWVP  122 (199)
Q Consensus        98 ~v~iGTP~q~~~v~~DTGS~~~Wv~  122 (199)
                      ++.|+  .|.+.+.+|||.|.+-++
T Consensus         2 ~~~i~--g~~~~~llDTGAd~Tvi~   24 (87)
T cd05482           2 TLYIN--GKLFEGLLDTGADVSIIA   24 (87)
T ss_pred             EEEEC--CEEEEEEEccCCCCeEEc
Confidence            45566  799999999999998774


No 34 
>cd06095 RP_RTVL_H_like Retropepsin of the RTVL_H family of human endogenous retrovirus-like elements. This family includes aspartate proteases from retroelements with LTR (long terminal repeats) including the RTVL_H family of human endogenous retrovirus-like elements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where 
Probab=64.02  E-value=9.8  Score=26.51  Aligned_cols=24  Identities=17%  Similarity=0.371  Sum_probs=19.0

Q ss_pred             EEeCCCCceEEEEEeCCCCceeEecc
Q 040861           99 VQLGTPGMKFMVALDTGSDLFWVPCE  124 (199)
Q Consensus        99 v~iGTP~q~~~v~~DTGS~~~Wv~C~  124 (199)
                      +.|.  .+++.+++|||.+.+-+.-.
T Consensus         3 v~In--G~~~~fLvDTGA~~tii~~~   26 (86)
T cd06095           3 ITVE--GVPIVFLVDTGATHSVLKSD   26 (86)
T ss_pred             EEEC--CEEEEEEEECCCCeEEECHH
Confidence            4454  68899999999999888554


No 35 
>PF09668 Asp_protease:  Aspartyl protease;  InterPro: IPR019103 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure.  This family of eukaryotic aspartyl proteases have a fold similar to retroviral proteases which implies they function proteolytically during regulated protein turnover []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 3S8I_A 2I1A_B.
Probab=44.55  E-value=20  Score=27.36  Aligned_cols=34  Identities=18%  Similarity=0.430  Sum_probs=24.3

Q ss_pred             eEEEEEEeCCCCceEEEEEeCCCCceeEecc-CCCCC
Q 040861           94 LHYTTVQLGTPGMKFMVALDTGSDLFWVPCE-CSKCA  129 (199)
Q Consensus        94 ~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~-C~~C~  129 (199)
                      ..|+++.|+  .+++.+.+|||...+-+.-+ +..|.
T Consensus        24 mLyI~~~in--g~~vkA~VDtGAQ~tims~~~a~r~g   58 (124)
T PF09668_consen   24 MLYINCKIN--GVPVKAFVDTGAQSTIMSKSCAERCG   58 (124)
T ss_dssp             --EEEEEET--TEEEEEEEETT-SS-EEEHHHHHHTT
T ss_pred             eEEEEEEEC--CEEEEEEEeCCCCccccCHHHHHHcC
Confidence            577899999  79999999999988877655 34553


No 36 
>PF12384 Peptidase_A2B:  Ty3 transposon peptidase;  InterPro: IPR024650 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Ty3 is a gypsy-type, retrovirus-like, element found in the budding yeast. The Ty3 aspartyl protease is required for processing of the viral polyprotein into its mature species [].
Probab=29.16  E-value=60  Score=26.28  Aligned_cols=27  Identities=30%  Similarity=0.429  Sum_probs=20.6

Q ss_pred             EEEEeCCCCceEEEEEeCCCCceeEec
Q 040861           97 TTVQLGTPGMKFMVALDTGSDLFWVPC  123 (199)
Q Consensus        97 ~~v~iGTP~q~~~v~~DTGS~~~Wv~C  123 (199)
                      ..+.+++-..++.+.|||||-...+.-
T Consensus        35 ~~v~l~~~~t~i~vLfDSGSPTSfIr~   61 (177)
T PF12384_consen   35 AIVQLNCKGTPIKVLFDSGSPTSFIRS   61 (177)
T ss_pred             EEEEEeecCcEEEEEEeCCCccceeeh
Confidence            355566668899999999998776644


No 37 
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=27.44  E-value=82  Score=26.43  Aligned_cols=26  Identities=19%  Similarity=0.364  Sum_probs=19.4

Q ss_pred             EEeCC---CCceEEEEEeCCCCceeEecc
Q 040861           99 VQLGT---PGMKFMVALDTGSDLFWVPCE  124 (199)
Q Consensus        99 v~iGT---P~q~~~v~~DTGS~~~Wv~C~  124 (199)
                      |+||-   +.....+++|||+.++.+|-.
T Consensus       166 i~vg~~~~~~~~~~~ivDTGTt~t~lp~~  194 (273)
T cd05475         166 LLFNGQPTGGKGLEVVFDSGSSYTYFNAQ  194 (273)
T ss_pred             EEECCEECcCCCceEEEECCCceEEcCCc
Confidence            67762   234467899999999999755


No 38 
>cd00303 retropepsin_like Retropepsins; pepsin-like aspartate proteases. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples
Probab=25.97  E-value=76  Score=19.70  Aligned_cols=21  Identities=14%  Similarity=0.444  Sum_probs=14.9

Q ss_pred             EEeCCCCceEEEEEeCCCCceeE
Q 040861           99 VQLGTPGMKFMVALDTGSDLFWV  121 (199)
Q Consensus        99 v~iGTP~q~~~v~~DTGS~~~Wv  121 (199)
                      +.++  ...+.+.+|||+...-+
T Consensus         3 ~~~~--~~~~~~liDtgs~~~~~   23 (92)
T cd00303           3 GKIN--GVPVRALVDSGASVNFI   23 (92)
T ss_pred             EEEC--CEEEEEEEcCCCccccc
Confidence            3444  37889999999986543


No 39 
>PF07496 zf-CW:  CW-type Zinc Finger;  InterPro: IPR011124 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents a CW-type zinc finger motif, named for its conserved cysteine and tryptophan residues. It is predicted to be a highly specialised mononuclear four-cysteine (C4) zinc finger that plays a role in DNA binding and/or promoting protein-protein interactions in complicated eukaryotic processes including chromatin methylation status and early embryonic development. Weak homology to members of IPR001965 from INTERPRO further evidences these predictions. The domain is found exclusively in vertebrates, vertebrate-infecting parasites and higher plants [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2RR4_A 2E61_A 2L7P_A.
Probab=25.63  E-value=34  Score=21.59  Aligned_cols=8  Identities=50%  Similarity=1.917  Sum_probs=5.2

Q ss_pred             eeEecc-CC
Q 040861          119 FWVPCE-CS  126 (199)
Q Consensus       119 ~Wv~C~-C~  126 (199)
                      .||||. |.
T Consensus         2 ~WVQCd~C~   10 (50)
T PF07496_consen    2 YWVQCDSCL   10 (50)
T ss_dssp             EEEE-TTT-
T ss_pred             eEEECCCCC
Confidence            599999 84


No 40 
>TIGR03698 clan_AA_DTGF clan AA aspartic protease, AF_0612 family. Members of this protein family are clan AA aspartic proteases, related to family TIGR02281. These proteins resemble retropepsins, pepsin-like proteases of retroviruses such as HIV. Members of this family are found in archaea and bacteria.
Probab=25.10  E-value=95  Score=22.58  Aligned_cols=27  Identities=37%  Similarity=0.458  Sum_probs=18.6

Q ss_pred             EEEEeCCC----CceEEEEEeCCCCcee-Eec
Q 040861           97 TTVQLGTP----GMKFMVALDTGSDLFW-VPC  123 (199)
Q Consensus        97 ~~v~iGTP----~q~~~v~~DTGS~~~W-v~C  123 (199)
                      +++.|..|    ..++.+.+|||.+..- ++.
T Consensus         2 ~~v~~~~p~~~~~~~v~~LVDTGat~~~~l~~   33 (107)
T TIGR03698         2 LDVELSNPKNPEFMEVRALVDTGFSGFLLVPP   33 (107)
T ss_pred             EEEEEeCCCCCCceEEEEEEECCCCeEEecCH
Confidence            46677776    2367889999998653 444


No 41 
>PF11645 PDDEXK_5:  PD-(D/E)XK endonuclease;  InterPro: IPR021671  This family are putative endonuclease proteins which are restricted to Synechocystis. ; PDB: 2OST_D.
Probab=22.85  E-value=65  Score=25.42  Aligned_cols=29  Identities=14%  Similarity=0.369  Sum_probs=22.4

Q ss_pred             EEEEEEeCCCCceEEEEEeCCCCceeEecc
Q 040861           95 HYTTVQLGTPGMKFMVALDTGSDLFWVPCE  124 (199)
Q Consensus        95 y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C~  124 (199)
                      |-+-+-+|- .+++.|++|++..++=|||+
T Consensus        22 Y~V~~P~gD-n~~YDLV~d~eg~L~RIQvK   50 (149)
T PF11645_consen   22 YSVSIPFGD-NLKYDLVFDKEGILWRIQVK   50 (149)
T ss_dssp             -EEEEESST-TSS-SEEEEETTEEEEEEEE
T ss_pred             cEEEeecCC-CCCcCEEEecCCcEEEEEEe
Confidence            445566665 68999999999999999998


No 42 
>PF05585 DUF1758:  Putative peptidase (DUF1758);  InterPro: IPR008737  This is a family of nematode proteins of unknown function []. However, it seems likely that these proteins act as aspartic peptidases. 
Probab=21.59  E-value=47  Score=25.82  Aligned_cols=20  Identities=20%  Similarity=0.262  Sum_probs=16.1

Q ss_pred             CceEEEEEeCCCCceeEecc
Q 040861          105 GMKFMVALDTGSDLFWVPCE  124 (199)
Q Consensus       105 ~q~~~v~~DTGS~~~Wv~C~  124 (199)
                      .+...++||+||+..++--.
T Consensus        10 ~~~~~~LlDsGSq~SfIt~~   29 (164)
T PF05585_consen   10 QVEARALLDSGSQRSFITES   29 (164)
T ss_pred             EEEEEEEEecCCchhHHhHH
Confidence            35678899999998887655


No 43 
>COG3577 Predicted aspartyl protease [General function prediction only]
Probab=21.33  E-value=2.3e+02  Score=23.72  Aligned_cols=37  Identities=16%  Similarity=0.088  Sum_probs=26.9

Q ss_pred             eeee-ccCCceEEEEEEeCCCCceEEEEEeCCCCceeEec
Q 040861           85 TLRI-SSLGFLHYTTVQLGTPGMKFMVALDTGSDLFWVPC  123 (199)
Q Consensus        85 ~~~~-~~~~~~y~~~v~iGTP~q~~~v~~DTGS~~~Wv~C  123 (199)
                      .+.+ ...++-|+++..|-  .+++...+|||-..+-++-
T Consensus        95 ~v~Lak~~~GHF~a~~~VN--Gk~v~fLVDTGATsVal~~  132 (215)
T COG3577          95 EVSLAKSRDGHFEANGRVN--GKKVDFLVDTGATSVALNE  132 (215)
T ss_pred             EEEEEecCCCcEEEEEEEC--CEEEEEEEecCcceeecCH
Confidence            4444 23344566888887  8999999999998877643


Done!