Query         037565
Match_columns 218
No_of_seqs    194 out of 1625
Neff          7.7 
Searched_HMMs 46136
Date          Fri Mar 29 13:34:32 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/037565.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/037565hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PLN03146 aspartyl protease fam 100.0 7.6E-42 1.7E-46  310.0  20.6  205    9-216     4-208 (431)
  2 KOG1339 Aspartyl protease [Pos 100.0 6.6E-28 1.4E-32  216.9  13.9  126   84-215    40-166 (398)
  3 PF14543 TAXi_N:  Xylanase inhi  99.9 7.3E-27 1.6E-31  186.3  11.9  117   91-217     1-118 (164)
  4 cd06096 Plasmepsin_5 Plasmepsi  99.9 5.7E-26 1.2E-30  199.1  13.6  123   89-216     2-125 (326)
  5 PTZ00165 aspartyl protease; Pr  99.9 1.7E-24 3.7E-29  198.6  16.0  111   79-214   109-221 (482)
  6 cd05470 pepsin_retropepsin_lik  99.9 5.2E-24 1.1E-28  157.9  12.3   95   93-215     1-96  (109)
  7 cd05490 Cathepsin_D2 Cathepsin  99.9 4.6E-23   1E-27  180.4  13.1   99   86-214     2-104 (325)
  8 cd05478 pepsin_A Pepsin A, asp  99.9 1.5E-22 3.2E-27  176.7  13.9  105   83-215     3-107 (317)
  9 cd05477 gastricsin Gastricsins  99.9 5.2E-22 1.1E-26  173.2  13.1   97   88-214     1-99  (318)
 10 cd06097 Aspergillopepsin_like   99.9 5.7E-22 1.2E-26  169.9  12.6   98   91-214     1-98  (278)
 11 cd06098 phytepsin Phytepsin, a  99.9 9.8E-22 2.1E-26  171.7  13.3  102   83-214     3-107 (317)
 12 cd05485 Cathepsin_D_like Cathe  99.9 7.9E-22 1.7E-26  173.2  12.6  103   82-214     3-109 (329)
 13 cd05486 Cathespin_E Cathepsin   99.9 1.1E-21 2.3E-26  171.2  11.4   94   91-214     1-96  (316)
 14 cd05488 Proteinase_A_fungi Fun  99.9 4.5E-21 9.8E-26  167.6  13.4  101   84-214     4-106 (320)
 15 PTZ00147 plasmepsin-1; Provisi  99.9 9.3E-21   2E-25  172.8  15.8  102   81-213   130-233 (453)
 16 PTZ00013 plasmepsin 4 (PM4); P  99.8 5.3E-20 1.2E-24  167.7  15.3  102   80-212   128-231 (450)
 17 cd05487 renin_like Renin stimu  99.8 3.1E-20 6.7E-25  162.8  12.7   98   85-213     3-104 (326)
 18 cd05475 nucellin_like Nucellin  99.8 3.7E-20 8.1E-25  158.6  11.5   84   89-214     1-85  (273)
 19 cd05473 beta_secretase_like Be  99.8 3.9E-19 8.5E-24  158.0  12.1   78   89-192     2-79  (364)
 20 cd05471 pepsin_like Pepsin-lik  99.8 1.5E-18 3.1E-23  147.6  12.7   96   91-214     1-98  (283)
 21 cd05472 cnd41_like Chloroplast  99.8 4.6E-19   1E-23  153.3   9.5   78   90-216     1-78  (299)
 22 cd05489 xylanase_inhibitor_I_l  99.7 4.9E-18 1.1E-22  151.3  10.4  101   97-213     2-117 (362)
 23 PF00026 Asp:  Eukaryotic aspar  99.7 2.6E-17 5.6E-22  142.3   8.5   96   90-213     1-97  (317)
 24 cd05476 pepsin_A_like_plant Ch  99.7 1.8E-16 3.8E-21  135.1   9.3   74   90-214     1-74  (265)
 25 cd05474 SAP_like SAPs, pepsin-  99.5 5.7E-14 1.2E-18  120.8  10.3   68   90-212     2-69  (295)
 26 cd05483 retropepsin_like_bacte  97.0  0.0025 5.4E-08   45.0   6.5   79   89-209     1-79  (96)
 27 TIGR02281 clan_AA_DTGA clan AA  92.8     1.1 2.3E-05   33.7   8.5   84   87-211     8-91  (121)
 28 cd05484 retropepsin_like_LTR_2  91.5    0.26 5.6E-06   34.8   3.5   27   91-119     1-27  (91)
 29 PF13650 Asp_protease_2:  Aspar  88.0     3.8 8.2E-05   27.9   7.2   23   94-118     2-24  (90)
 30 PF13975 gag-asp_proteas:  gag-  86.9     1.4   3E-05   29.9   4.3   32   87-120     5-36  (72)
 31 PF00077 RVP:  Retroviral aspar  80.6     2.9 6.2E-05   29.6   3.9   27   92-120     7-33  (100)
 32 cd05479 RP_DDI RP_DDI; retrope  77.8     4.3 9.4E-05   30.4   4.3   31   87-119    13-43  (124)
 33 COG3577 Predicted aspartyl pro  74.4      13 0.00028   30.8   6.4   72   86-192   101-172 (215)
 34 cd05482 HIV_retropepsin_like R  71.5     6.2 0.00014   28.0   3.5   24   94-119     2-25  (87)
 35 cd06095 RP_RTVL_H_like Retrope  65.1     9.1  0.0002   26.6   3.3   23   95-119     3-25  (86)
 36 PF12384 Peptidase_A2B:  Ty3 tr  48.5      26 0.00056   28.1   3.6   28   90-119    34-61  (177)
 37 PF09668 Asp_protease:  Asparty  35.3      78  0.0017   24.0   4.3   30   88-119    22-51  (124)
 38 cd05475 nucellin_like Nucellin  34.6      43 0.00093   28.2   3.2   32   89-120   157-194 (273)
 39 TIGR03698 clan_AA_DTGF clan AA  33.6      58  0.0013   23.6   3.3   24   93-116     2-29  (107)
 40 PF05585 DUF1758:  Putative pep  29.5      31 0.00068   26.8   1.4   19  101-119    10-28  (164)
 41 cd06097 Aspergillopepsin_like   28.1      44 0.00096   28.2   2.2   33   88-120   176-215 (278)
 42 PF11645 PDDEXK_5:  PD-(D/E)XK   27.1      41 0.00089   26.2   1.6   31   90-121    21-51  (149)
 43 cd00303 retropepsin_like Retro  27.0      73  0.0016   19.9   2.8   20   95-116     3-22  (92)
 44 PF07172 GRP:  Glycine rich pro  26.4      33 0.00072   24.7   0.9   21    1-22      1-21  (95)
 45 cd06098 phytepsin Phytepsin, a  26.3      54  0.0012   28.3   2.4   31   89-119   188-226 (317)
 46 cd06096 Plasmepsin_5 Plasmepsi  25.9      53  0.0012   28.5   2.4   31   89-119   208-247 (326)
 47 cd05472 cnd41_like Chloroplast  24.6      56  0.0012   27.8   2.2   31   89-119   146-187 (299)
 48 cd05471 pepsin_like Pepsin-lik  24.6      47   0.001   27.5   1.7   34   88-121   179-220 (283)
 49 PLN03146 aspartyl protease fam  23.6      90  0.0019   28.5   3.5   16  105-120   309-324 (431)
 50 PF03419 Peptidase_U4:  Sporula  23.1 1.2E+02  0.0025   26.2   3.9   28   87-114   154-181 (293)
 51 cd06094 RP_Saci_like RP_Saci_l  22.5 2.9E+02  0.0064   19.6   6.2   20  102-121     8-27  (89)
 52 TIGR02854 spore_II_GA sigma-E   20.6 1.3E+02  0.0029   25.9   3.8   28   87-114   155-182 (288)
 53 PF07496 zf-CW:  CW-type Zinc F  20.5      54  0.0012   20.4   1.0    9  115-123     2-10  (50)
 54 PF00026 Asp:  Eukaryotic aspar  20.1      96  0.0021   26.2   2.8   28   93-120   185-216 (317)
 55 PF01705 CX:  CX module;  Inter  20.1      84  0.0018   20.5   1.9   36  165-212    15-50  (61)

No 1  
>PLN03146 aspartyl protease family protein; Provisional
Probab=100.00  E-value=7.6e-42  Score=309.97  Aligned_cols=205  Identities=51%  Similarity=0.929  Sum_probs=172.7

Q ss_pred             HHHHHHHHhccccceecCCceeEEEEecCCCCCCCCCCCCCCchHHHHHHHhhhhhhhccccCCCcCCCCCcccceecCC
Q 037565            9 AICITATLQFVFPFVESNNGCVTADLIHRDSPLSPLYSPSETNSERFHKAYLRSISRLNNHFNPKAASPEALQAPMVSSG   88 (218)
Q Consensus         9 ~~~~~~~~~~~~~~~~~~~~~~~l~L~h~~s~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~   88 (218)
                      ++++++.-++.++..+++.++++++|+|++++++|++.++....++++++++++.+|.+ ++.++......+..++...+
T Consensus         4 ~~~~~~~~~~~~~~~~~~~~~~~~~l~h~~~~~sp~~~~~~~~~~~~~~~~~~~~~r~~-~~~~~~~~~~~~~~~~~~~~   82 (431)
T PLN03146          4 LLALCLFSFSELSAAEAPKGGFTVDLIHRDSPKSPFYNPSETPSQRLRNAFRRSISRVN-HFRPTDASPNDPQSDLISNG   82 (431)
T ss_pred             hHHHHHHHHhhhhhccccCCceEEEEEeCCCCCCCCCCCCCChhHHHHHHHHHHHHHHH-HHhhccccCCccccCcccCC
Confidence            44455544565666677888999999999999999988877778999999999999988 77543222234455566778


Q ss_pred             ceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCcce
Q 037565           89 GEYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQ  168 (218)
Q Consensus        89 ~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~  168 (218)
                      ++|+++|.||||||++.|++||||+++||||.+|..|+.|.++.|||++|+||+.++|.++.|..++....|... +.|.
T Consensus        83 ~~Y~v~i~iGTPpq~~~vi~DTGS~l~Wv~C~~C~~C~~~~~~~fdps~SST~~~~~C~s~~C~~~~~~~~c~~~-~~c~  161 (431)
T PLN03146         83 GEYLMNISIGTPPVPILAIADTGSDLIWTQCKPCDDCYKQVSPLFDPKKSSTYKDVSCDSSQCQALGNQASCSDE-NTCT  161 (431)
T ss_pred             ccEEEEEEcCCCCceEEEEECCCCCcceEcCCCCcccccCCCCcccCCCCCCCcccCCCCcccccCCCCCCCCCC-CCCe
Confidence            999999999999999999999999999999999999998889999999999999999999999887766567654 4699


Q ss_pred             eEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCCCC
Q 037565          169 FYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTGTF  216 (218)
Q Consensus       169 ~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g~f  216 (218)
                      |.+.|+||+.+.|++++|+|+|++..+. .+.++++.|||++.+.|.|
T Consensus       162 y~i~Ygdgs~~~G~l~~Dtltlg~~~~~-~~~v~~~~FGc~~~~~g~f  208 (431)
T PLN03146        162 YSYSYGDGSFTKGNLAVETLTIGSTSGR-PVSFPGIVFGCGHNNGGTF  208 (431)
T ss_pred             eEEEeCCCCceeeEEEEEEEEeccCCCC-cceeCCEEEeCCCCCCCCc
Confidence            9999999998889999999999875332 2578999999999887766


No 2  
>KOG1339 consensus Aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.95  E-value=6.6e-28  Score=216.87  Aligned_cols=126  Identities=43%  Similarity=0.855  Sum_probs=110.9

Q ss_pred             eecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC-CCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCC
Q 037565           84 MVSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY-KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDT  162 (218)
Q Consensus        84 ~~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~-~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~  162 (218)
                      ..+.+++|+++|.||||||.|.|++||||+++||+|.+|. .|+.+.++.|||++|+||+.+.|.++.|......  |..
T Consensus        40 ~~~~~~~Y~~~i~IGTPpq~f~v~~DTGS~~lWV~c~~c~~~C~~~~~~~f~p~~SSt~~~~~c~~~~c~~~~~~--~~~  117 (398)
T KOG1339|consen   40 SSYSSGEYYGNISIGTPPQSFTVVLDTGSDLLWVPCAPCSSACYSQHNPIFDPSASSTYKSVGCSSPRCKSLPQS--CSP  117 (398)
T ss_pred             ccccccccEEEEecCCCCeeeEEEEeCCCCceeeccccccccccccCCCccCccccccccccCCCCccccccccC--ccc
Confidence            3457889999999999999999999999999999999999 8987666679999999999999999999988754  443


Q ss_pred             CCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCCC
Q 037565          163 SRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTGT  215 (218)
Q Consensus       163 ~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g~  215 (218)
                      . +.|.|.+.|+||+.++|.|++|+|+|++.+   .+.+++++|||+..+.|.
T Consensus       118 ~-~~C~y~i~Ygd~~~~~G~l~~Dtv~~~~~~---~~~~~~~~FGc~~~~~g~  166 (398)
T KOG1339|consen  118 N-SSCPYSIQYGDGSSTSGYLATDTVTFGGTT---SLPVPNQTFGCGTNNPGS  166 (398)
T ss_pred             C-CcCceEEEeCCCCceeEEEEEEEEEEcccc---ccccccEEEEeeecCccc
Confidence            3 689999999998888899999999999743   157788999999998763


No 3  
>PF14543 TAXi_N:  Xylanase inhibitor N-terminal; PDB: 3HD8_A 3VLB_A 3VLA_A 3AUP_D 1T6G_A 1T6E_X 2B42_A.
Probab=99.94  E-value=7.3e-27  Score=186.34  Aligned_cols=117  Identities=37%  Similarity=0.755  Sum_probs=95.0

Q ss_pred             EEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCC-CCCCCCCCccee
Q 037565           91 YFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKD-RRCDTSRNTCQF  169 (218)
Q Consensus        91 Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~-~~C~~~~~~c~~  169 (218)
                      |+++|.||||+|++.|+|||||+++|++|         ..+.|+|.+|+||+.++|.++.|...++. ..|...++.|.|
T Consensus         1 Y~~~~~iGtP~~~~~lvvDtgs~l~W~~C---------~~~~f~~~~Sst~~~v~C~s~~C~~~~~~~~~~~~~~~~C~y   71 (164)
T PF14543_consen    1 YYVSVSIGTPPQPFSLVVDTGSDLTWVQC---------PDPPFDPSKSSTYRPVPCSSPQCSSAPSFCPCCCCSNNSCPY   71 (164)
T ss_dssp             EEEEEECTCTTEEEEEEEETT-SSEEEET-------------STT-TTSSBEC-BTTSHHHHHCTSSBTCCTCESSEEEE
T ss_pred             CEEEEEeCCCCceEEEEEECCCCceEEcC---------CCcccCCccCCcccccCCCCcchhhcccccccCCCCcCcccc
Confidence            89999999999999999999999999999         36889999999999999999999988753 233332368999


Q ss_pred             EEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCCCCC
Q 037565          170 YYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTGTFD  217 (218)
Q Consensus       170 ~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g~f~  217 (218)
                      .+.|+|++.++|++++|+|+++..++. ...+++++|||++.+.|.|.
T Consensus        72 ~~~y~~~s~~~G~l~~D~~~~~~~~~~-~~~~~~~~FGC~~~~~g~~~  118 (164)
T PF14543_consen   72 SQSYGDGSSSSGFLASDTLTFGSSSGG-SNSVPDFIFGCATSNSGLFY  118 (164)
T ss_dssp             EEEETTTEEEEEEEEEEEEEEEEESSS-SEEEEEEEEEEE-GGGTSST
T ss_pred             eeecCCCccccCceEEEEEEecCCCCC-CceeeeEEEEeeeccccCCc
Confidence            999999999999999999999876432 25789999999999887663


No 4  
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5.  Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=99.94  E-value=5.7e-26  Score=199.12  Aligned_cols=123  Identities=27%  Similarity=0.478  Sum_probs=102.9

Q ss_pred             ceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCcce
Q 037565           89 GEYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQ  168 (218)
Q Consensus        89 ~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~  168 (218)
                      ++|+++|.||||+|++.|++||||+++||+|.+|..|..+.++.|||++|+|++.+.|.+..|..   ...|.+  +.|.
T Consensus         2 ~~Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~C~~c~~~~~~~y~~~~Sst~~~~~C~~~~c~~---~~~~~~--~~~~   76 (326)
T cd06096           2 AYYFIDIFIGNPPQKQSLILDTGSSSLSFPCSQCKNCGIHMEPPYNLNNSITSSILYCDCNKCCY---CLSCLN--NKCE   76 (326)
T ss_pred             ceEEEEEEecCCCeEEEEEEeCCCCceEEecCCCCCcCCCCCCCcCcccccccccccCCCccccc---cCcCCC--CcCc
Confidence            58999999999999999999999999999999999998777899999999999999999999943   224554  5799


Q ss_pred             eEEEcCCCceEEEEEEEEEEEecCCCCC-CCcccCCeEEEeEEcCCCCC
Q 037565          169 FYYSYADGSYTTGNVGTEVFVISSKTNP-DQGTFPNIVYGCGNNNTGTF  216 (218)
Q Consensus       169 ~~~~Y~dgs~~~G~l~~D~~~~~~~~~~-~~~~~~~~~FGc~~~~~g~f  216 (218)
                      |.+.|++|+.+.|.+++|+|+|++.... .+....++.|||+..+.+.|
T Consensus        77 ~~i~Y~~gs~~~G~~~~D~v~lg~~~~~~~~~~~~~~~fg~~~~~~~~~  125 (326)
T cd06096          77 YSISYSEGSSISGFYFSDFVSFESYLNSNSEKESFKKIFGCHTHETNLF  125 (326)
T ss_pred             EEEEECCCCceeeEEEEEEEEeccCCCCccccccccEEeccCccccCcc
Confidence            9999999998889999999999874210 00112368899998776644


No 5  
>PTZ00165 aspartyl protease; Provisional
Probab=99.92  E-value=1.7e-24  Score=198.61  Aligned_cols=111  Identities=25%  Similarity=0.405  Sum_probs=94.7

Q ss_pred             CcccceecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC--CCCCCCCCccCCCCCCcceeccCCCccccCCCC
Q 037565           79 ALQAPMVSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY--KCYKQNPPIFDPKESSTSKNITCQSESCLRLEK  156 (218)
Q Consensus        79 ~~~~~~~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~--~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~  156 (218)
                      ..++..++.+.+|+++|.||||||+|.|++||||+++||+|..|.  .|  +.++.|||++|+||+.+.+..        
T Consensus       109 ~~~~l~n~~d~~Y~~~I~IGTPpQ~f~Vv~DTGSS~lWVps~~C~~~~C--~~~~~yd~s~SSTy~~~~~~~--------  178 (482)
T PTZ00165        109 LQQDLLNFHNSQYFGEIQVGTPPKSFVVVFDTGSSNLWIPSKECKSGGC--APHRKFDPKKSSTYTKLKLGD--------  178 (482)
T ss_pred             cceecccccCCeEEEEEEeCCCCceEEEEEeCCCCCEEEEchhcCcccc--cccCCCCccccCCcEecCCCC--------
Confidence            345667899999999999999999999999999999999999997  47  667899999999999852110        


Q ss_pred             CCCCCCCCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565          157 DRRCDTSRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG  214 (218)
Q Consensus       157 ~~~C~~~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g  214 (218)
                              ....+.+.|++|+.. |.+++|+|+|++      +.++++.|||++.+.+
T Consensus       179 --------~~~~~~i~YGsGs~~-G~l~~DtV~ig~------l~i~~q~FG~a~~~s~  221 (482)
T PTZ00165        179 --------ESAETYIQYGTGECV-LALGKDTVKIGG------LKVKHQSIGLAIEESL  221 (482)
T ss_pred             --------ccceEEEEeCCCcEE-EEEEEEEEEECC------EEEccEEEEEEEeccc
Confidence                    122567999999876 999999999987      6899999999998654


No 6  
>cd05470 pepsin_retropepsin_like Cellular and retroviral pepsin-like aspartate proteases. This family includes both cellular and retroviral pepsin-like aspartate proteases. The cellular pepsin and pepsin-like enzymes are twice as long as their retroviral counterparts. The cellular pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, rennin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (rennin, cathepsin D and E, pepsin) or commercially (chymosin) important. The eukaryotic pepsin-like proteases contain two domains possessing similar topological features. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except in the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The eukaryotic pepsin-like proteases have two active site 
Probab=99.91  E-value=5.2e-24  Score=157.91  Aligned_cols=95  Identities=28%  Similarity=0.543  Sum_probs=84.3

Q ss_pred             EEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCcc-CCCCCCcceeccCCCccccCCCCCCCCCCCCCcceeEE
Q 037565           93 VNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIF-DPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQFYY  171 (218)
Q Consensus        93 ~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f-~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~~~~  171 (218)
                      ++|.||||||++.|+|||||+++||+|..|..|..+.++.| +|.+|++++.                     ..|.|.+
T Consensus         1 ~~i~vGtP~q~~~~~~DTGSs~~Wv~~~~c~~~~~~~~~~~~~~~~sst~~~---------------------~~~~~~~   59 (109)
T cd05470           1 IEIGIGTPPQTFNVLLDTGSSNLWVPSVDCQSLAIYSHSSYDDPSASSTYSD---------------------NGCTFSI   59 (109)
T ss_pred             CEEEeCCCCceEEEEEeCCCCCEEEeCCCCCCcccccccccCCcCCCCCCCC---------------------CCcEEEE
Confidence            47999999999999999999999999999987765666777 9999999876                     3699999


Q ss_pred             EcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCCC
Q 037565          172 SYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTGT  215 (218)
Q Consensus       172 ~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g~  215 (218)
                      .|++|+.. |.++.|+|+|++      ..++++.|||++.+.+.
T Consensus        60 ~Y~~g~~~-g~~~~D~v~ig~------~~~~~~~fg~~~~~~~~   96 (109)
T cd05470          60 TYGTGSLS-GGLSTDTVSIGD------IEVVGQAFGCATDEPGA   96 (109)
T ss_pred             EeCCCeEE-EEEEEEEEEECC------EEECCEEEEEEEecCCc
Confidence            99999765 999999999987      57999999999988664


No 7  
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank 
Probab=99.90  E-value=4.6e-23  Score=180.38  Aligned_cols=99  Identities=24%  Similarity=0.430  Sum_probs=88.7

Q ss_pred             cCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC----CCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCC
Q 037565           86 SSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY----KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCD  161 (218)
Q Consensus        86 ~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~----~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~  161 (218)
                      +.+.+|+++|.||||||++.|++||||+++||+|..|.    .|  ..++.|||++|+||+.                  
T Consensus         2 ~~~~~Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~C--~~~~~y~~~~SsT~~~------------------   61 (325)
T cd05490           2 YMDAQYYGEIGIGTPPQTFTVVFDTGSSNLWVPSVHCSLLDIAC--WLHHKYNSSKSSTYVK------------------   61 (325)
T ss_pred             CcCCEEEEEEEECCCCcEEEEEEeCCCccEEEEcCCCCCCCccc--cCcCcCCcccCcceee------------------
Confidence            56889999999999999999999999999999999997    46  4567899999999987                  


Q ss_pred             CCCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565          162 TSRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG  214 (218)
Q Consensus       162 ~~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g  214 (218)
                         ..|.|.+.|++|+. .|.+++|+|+|++      ..++++.|||++.+.|
T Consensus        62 ---~~~~~~i~Yg~G~~-~G~~~~D~v~~g~------~~~~~~~Fg~~~~~~~  104 (325)
T cd05490          62 ---NGTEFAIQYGSGSL-SGYLSQDTVSIGG------LQVEGQLFGEAVKQPG  104 (325)
T ss_pred             ---CCcEEEEEECCcEE-EEEEeeeEEEECC------EEEcCEEEEEEeeccC
Confidence               35899999999985 6999999999987      5899999999987654


No 8  
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which 
Probab=99.89  E-value=1.5e-22  Score=176.71  Aligned_cols=105  Identities=28%  Similarity=0.401  Sum_probs=92.0

Q ss_pred             ceecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCC
Q 037565           83 PMVSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDT  162 (218)
Q Consensus        83 ~~~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~  162 (218)
                      ..++.+.+|+++|.||||+|++.|++||||+++||+|..|..|.-+.++.|||++|+|++..                  
T Consensus         3 l~n~~~~~Y~~~i~vGtp~q~~~v~~DTGS~~~wv~~~~C~~~~c~~~~~f~~~~Sst~~~~------------------   64 (317)
T cd05478           3 LTNYLDMEYYGTISIGTPPQDFTVIFDTGSSNLWVPSVYCSSQACSNHNRFNPRQSSTYQST------------------   64 (317)
T ss_pred             cccccCCEEEEEEEeCCCCcEEEEEEeCCCccEEEecCCCCcccccccCcCCCCCCcceeeC------------------
Confidence            34667899999999999999999999999999999999998432256789999999999874                  


Q ss_pred             CCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCCC
Q 037565          163 SRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTGT  215 (218)
Q Consensus       163 ~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g~  215 (218)
                         .|.|.+.|++|+. .|.++.|+|+|++      +.++++.|||++.+.+.
T Consensus        65 ---~~~~~~~yg~gs~-~G~~~~D~v~ig~------~~i~~~~fg~~~~~~~~  107 (317)
T cd05478          65 ---GQPLSIQYGTGSM-TGILGYDTVQVGG------ISDTNQIFGLSETEPGS  107 (317)
T ss_pred             ---CcEEEEEECCceE-EEEEeeeEEEECC------EEECCEEEEEEEecCcc
Confidence               5789999999985 6999999999987      68999999999877654


No 9  
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=99.88  E-value=5.2e-22  Score=173.23  Aligned_cols=97  Identities=31%  Similarity=0.535  Sum_probs=87.5

Q ss_pred             CceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC--CCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCC
Q 037565           88 GGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY--KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRN  165 (218)
Q Consensus        88 ~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~--~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~  165 (218)
                      |.+|+++|.||||||++.|++||||+++||+|..|.  .|  ..++.|||++|+||+.                     .
T Consensus         1 ~~~y~~~i~iGtP~q~~~v~~DTGS~~~wv~~~~C~~~~C--~~~~~f~~~~SsT~~~---------------------~   57 (318)
T cd05477           1 DMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSVLCQSQAC--TNHTKFNPSQSSTYST---------------------N   57 (318)
T ss_pred             CcEEEEEEEECCCCcEEEEEEeCCCccEEEccCCCCCccc--cccCCCCcccCCCceE---------------------C
Confidence            468999999999999999999999999999999998  57  4578999999999987                     3


Q ss_pred             cceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565          166 TCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG  214 (218)
Q Consensus       166 ~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g  214 (218)
                      .|.|.+.|++|+. .|.+++|+|+|++      +.++++.|||++...+
T Consensus        58 ~~~~~~~Yg~Gs~-~G~~~~D~i~~g~------~~i~~~~Fg~~~~~~~   99 (318)
T cd05477          58 GETFSLQYGSGSL-TGIFGYDTVTVQG------IIITNQEFGLSETEPG   99 (318)
T ss_pred             CcEEEEEECCcEE-EEEEEeeEEEECC------EEEcCEEEEEEEeccc
Confidence            6899999999986 5999999999987      6889999999988654


No 10 
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=99.87  E-value=5.7e-22  Score=169.93  Aligned_cols=98  Identities=28%  Similarity=0.368  Sum_probs=89.0

Q ss_pred             EEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCcceeE
Q 037565           91 YFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQFY  170 (218)
Q Consensus        91 Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~~~  170 (218)
                      |+++|.||||||++.|++||||+++||+|..|..|..+.++.||+++|+|++..                    ..|.|.
T Consensus         1 Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~c~~~~~~~~~~y~~~~Sst~~~~--------------------~~~~~~   60 (278)
T cd06097           1 YLTPVKIGTPPQTLNLDLDTGSSDLWVFSSETPAAQQGGHKLYDPSKSSTAKLL--------------------PGATWS   60 (278)
T ss_pred             CeeeEEECCCCcEEEEEEeCCCCceeEeeCCCCchhhccCCcCCCccCccceec--------------------CCcEEE
Confidence            899999999999999999999999999999999887667788999999999875                    368999


Q ss_pred             EEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565          171 YSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG  214 (218)
Q Consensus       171 ~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g  214 (218)
                      +.|++|+...|.+++|+|+|++      ..++++.|||++...+
T Consensus        61 i~Y~~G~~~~G~~~~D~v~ig~------~~~~~~~fg~~~~~~~   98 (278)
T cd06097          61 ISYGDGSSASGIVYTDTVSIGG------VEVPNQAIELATAVSA   98 (278)
T ss_pred             EEeCCCCeEEEEEEEEEEEECC------EEECCeEEEEEeecCc
Confidence            9999998778999999999987      5789999999987654


No 11 
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases.  They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=99.87  E-value=9.8e-22  Score=171.70  Aligned_cols=102  Identities=25%  Similarity=0.382  Sum_probs=90.0

Q ss_pred             ceecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC---CCCCCCCCccCCCCCCcceeccCCCccccCCCCCCC
Q 037565           83 PMVSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY---KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRR  159 (218)
Q Consensus        83 ~~~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~---~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~  159 (218)
                      ..++.+.+|+++|.||||||++.|++||||+++||+|..|.   .|  ..++.|||++|+||+..               
T Consensus         3 l~n~~~~~Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~C~~~~~C--~~~~~y~~~~SsT~~~~---------------   65 (317)
T cd06098           3 LKNYLDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSIAC--YFHSKYKSSKSSTYKKN---------------   65 (317)
T ss_pred             ccccCCCEEEEEEEECCCCeEEEEEECCCccceEEecCCCCCCccc--cccCcCCcccCCCcccC---------------
Confidence            34678899999999999999999999999999999999996   68  45688999999999873               


Q ss_pred             CCCCCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565          160 CDTSRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG  214 (218)
Q Consensus       160 C~~~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g  214 (218)
                            ...+.+.|++|+. .|.+++|+|+|++      ..++++.|||++.+.+
T Consensus        66 ------~~~~~i~Yg~G~~-~G~~~~D~v~ig~------~~v~~~~f~~~~~~~~  107 (317)
T cd06098          66 ------GTSASIQYGTGSI-SGFFSQDSVTVGD------LVVKNQVFIEATKEPG  107 (317)
T ss_pred             ------CCEEEEEcCCceE-EEEEEeeEEEECC------EEECCEEEEEEEecCC
Confidence                  4678999999986 5999999999987      5899999999987543


No 12 
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=99.87  E-value=7.9e-22  Score=173.16  Aligned_cols=103  Identities=26%  Similarity=0.421  Sum_probs=91.2

Q ss_pred             cceecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC----CCCCCCCCccCCCCCCcceeccCCCccccCCCCC
Q 037565           82 APMVSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY----KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKD  157 (218)
Q Consensus        82 ~~~~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~----~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~  157 (218)
                      ...++.+.+|+++|.||||+|++.|++||||+++||+|..|.    .|  ..++.|||++|+|++.              
T Consensus         3 ~~~n~~~~~Y~~~i~vGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~c--~~~~~y~~~~Sst~~~--------------   66 (329)
T cd05485           3 PLSNYMDAQYYGVITIGTPPQSFKVVFDTGSSNLWVPSKKCSWTNIAC--LLHNKYDSTKSSTYKK--------------   66 (329)
T ss_pred             cceeccCCeEEEEEEECCCCcEEEEEEcCCCccEEEecCCCCCCCccc--cCCCeECCcCCCCeEE--------------
Confidence            345788999999999999999999999999999999999997    46  3467899999999987              


Q ss_pred             CCCCCCCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565          158 RRCDTSRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG  214 (218)
Q Consensus       158 ~~C~~~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g  214 (218)
                             ..|.|.+.|++|+. .|.+++|+|+|++      +.++++.|||+.++.|
T Consensus        67 -------~~~~~~i~Y~~g~~-~G~~~~D~v~ig~------~~~~~~~fg~~~~~~~  109 (329)
T cd05485          67 -------NGTEFAIQYGSGSL-SGFLSTDTVSVGG------VSVKGQTFAEAINEPG  109 (329)
T ss_pred             -------CCeEEEEEECCceE-EEEEecCcEEECC------EEECCEEEEEEEecCC
Confidence                   36899999999985 6999999999987      5789999999987654


No 13 
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=99.86  E-value=1.1e-21  Score=171.19  Aligned_cols=94  Identities=34%  Similarity=0.516  Sum_probs=84.8

Q ss_pred             EEEEEEeCCCceeEEEEEeCCCCceeEecCCCC--CCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCcce
Q 037565           91 YFVNISIGTPRVELIGIADTASDLTWIQCKPCY--KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQ  168 (218)
Q Consensus        91 Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~--~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~  168 (218)
                      |+++|.||||||++.|++||||+++||+|..|.  .|  ..++.|||++|+||+.                     ..|.
T Consensus         1 Y~~~i~iGtP~Q~~~v~~DTGSs~~Wv~s~~C~~~~C--~~~~~y~~~~SsT~~~---------------------~~~~   57 (316)
T cd05486           1 YFGQISIGTPPQNFTVIFDTGSSNLWVPSIYCTSQAC--TKHNRFQPSESSTYVS---------------------NGEA   57 (316)
T ss_pred             CeEEEEECCCCcEEEEEEcCCCccEEEecCCCCCccc--CccceECCCCCccccc---------------------CCcE
Confidence            899999999999999999999999999999997  57  5678999999999987                     3689


Q ss_pred             eEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565          169 FYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG  214 (218)
Q Consensus       169 ~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g  214 (218)
                      |.+.|++|+. .|.+++|+|+|++      +.++++.|||+..+.+
T Consensus        58 ~~i~Yg~g~~-~G~~~~D~v~ig~------~~~~~~~fg~~~~~~~   96 (316)
T cd05486          58 FSIQYGTGSL-TGIIGIDQVTVEG------ITVQNQQFAESVSEPG   96 (316)
T ss_pred             EEEEeCCcEE-EEEeeecEEEECC------EEEcCEEEEEeeccCc
Confidence            9999999975 6999999999987      6899999999977654


No 14 
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme.  Proteinase A preferentially hydro
Probab=99.86  E-value=4.5e-21  Score=167.63  Aligned_cols=101  Identities=28%  Similarity=0.442  Sum_probs=90.0

Q ss_pred             eecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC--CCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCC
Q 037565           84 MVSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY--KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCD  161 (218)
Q Consensus        84 ~~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~--~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~  161 (218)
                      .++.+.+|+++|.||||+|++.|++||||+++||+|..|.  .|  ..++.|+|++|+|++.                  
T Consensus         4 ~n~~~~~Y~~~i~iGtp~q~~~v~~DTGSs~~wv~~~~C~~~~C--~~~~~y~~~~Sst~~~------------------   63 (320)
T cd05488           4 TNYLNAQYFTDITLGTPPQKFKVILDTGSSNLWVPSVKCGSIAC--FLHSKYDSSASSTYKA------------------   63 (320)
T ss_pred             cccCCCEEEEEEEECCCCcEEEEEEecCCcceEEEcCCCCCccc--CCcceECCCCCcceee------------------
Confidence            3567889999999999999999999999999999999997  57  4567999999999986                  


Q ss_pred             CCCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565          162 TSRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG  214 (218)
Q Consensus       162 ~~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g  214 (218)
                         +.|.|.+.|++|+. .|.+++|+|+|++      +.++++.|||++...|
T Consensus        64 ---~~~~~~~~y~~g~~-~G~~~~D~v~ig~------~~~~~~~f~~a~~~~g  106 (320)
T cd05488          64 ---NGTEFKIQYGSGSL-EGFVSQDTLSIGD------LTIKKQDFAEATSEPG  106 (320)
T ss_pred             ---CCCEEEEEECCceE-EEEEEEeEEEECC------EEECCEEEEEEecCCC
Confidence               46899999999985 6999999999987      6889999999987655


No 15 
>PTZ00147 plasmepsin-1; Provisional
Probab=99.86  E-value=9.3e-21  Score=172.82  Aligned_cols=102  Identities=16%  Similarity=0.244  Sum_probs=88.6

Q ss_pred             ccceecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC--CCCCCCCCccCCCCCCcceeccCCCccccCCCCCC
Q 037565           81 QAPMVSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY--KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDR  158 (218)
Q Consensus        81 ~~~~~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~--~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~  158 (218)
                      .+..++.+.+|+++|.||||||++.|++||||+++||+|..|.  .|  +.++.|||++|+||+..              
T Consensus       130 v~L~n~~n~~Y~~~I~IGTP~Q~f~Vi~DTGSsdlWVps~~C~~~~C--~~~~~yd~s~SsT~~~~--------------  193 (453)
T PTZ00147        130 VELKDLANVMSYGEAKLGDNGQKFNFIFDTGSANLWVPSIKCTTEGC--ETKNLYDSSKSKTYEKD--------------  193 (453)
T ss_pred             eeccccCCCEEEEEEEECCCCeEEEEEEeCCCCcEEEeecCCCcccc--cCCCccCCccCcceEEC--------------
Confidence            3444778899999999999999999999999999999999997  56  56789999999999873              


Q ss_pred             CCCCCCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCC
Q 037565          159 RCDTSRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNT  213 (218)
Q Consensus       159 ~C~~~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~  213 (218)
                             .+.|.+.|++|+. .|.+++|+|+|++      +.++ ..|||+....
T Consensus       194 -------~~~f~i~Yg~Gsv-sG~~~~DtVtiG~------~~v~-~qF~~~~~~~  233 (453)
T PTZ00147        194 -------GTKVEMNYVSGTV-SGFFSKDLVTIGN------LSVP-YKFIEVTDTN  233 (453)
T ss_pred             -------CCEEEEEeCCCCE-EEEEEEEEEEECC------EEEE-EEEEEEEecc
Confidence                   6899999999975 5999999999987      5676 5799987643


No 16 
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=99.84  E-value=5.3e-20  Score=167.66  Aligned_cols=102  Identities=15%  Similarity=0.268  Sum_probs=88.2

Q ss_pred             cccceecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCC--CCCCCCCCccCCCCCCcceeccCCCccccCCCCC
Q 037565           80 LQAPMVSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCY--KCYKQNPPIFDPKESSTSKNITCQSESCLRLEKD  157 (218)
Q Consensus        80 ~~~~~~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~--~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~  157 (218)
                      ..+..++.+.+|+++|.||||+|++.|++||||+++||+|..|.  .|  +.++.|||++|+|++..             
T Consensus       128 ~~~l~d~~n~~Yy~~i~IGTP~Q~f~vi~DTGSsdlWV~s~~C~~~~C--~~~~~yd~s~SsT~~~~-------------  192 (450)
T PTZ00013        128 VIELDDVANIMFYGEGEVGDNHQKFMLIFDTGSANLWVPSKKCDSIGC--SIKNLYDSSKSKSYEKD-------------  192 (450)
T ss_pred             ceeeeccCCCEEEEEEEECCCCeEEEEEEeCCCCceEEecccCCcccc--ccCCCccCccCcccccC-------------
Confidence            34445778889999999999999999999999999999999997  57  56789999999999873             


Q ss_pred             CCCCCCCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcC
Q 037565          158 RRCDTSRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNN  212 (218)
Q Consensus       158 ~~C~~~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~  212 (218)
                              .|.|.+.|++|+. .|.+++|+|+|++      +.++ ..|||+...
T Consensus       193 --------~~~~~i~YG~Gsv-~G~~~~Dtv~iG~------~~~~-~~f~~~~~~  231 (450)
T PTZ00013        193 --------GTKVDITYGSGTV-KGFFSKDLVTLGH------LSMP-YKFIEVTDT  231 (450)
T ss_pred             --------CcEEEEEECCceE-EEEEEEEEEEECC------EEEc-cEEEEEEec
Confidence                    6899999999984 6999999999987      5666 578888654


No 17 
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate  r
Probab=99.83  E-value=3.1e-20  Score=162.75  Aligned_cols=98  Identities=24%  Similarity=0.447  Sum_probs=85.5

Q ss_pred             ecCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCC----CCCCCCCccCCCCCCcceeccCCCccccCCCCCCCC
Q 037565           85 VSSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCYK----CYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRC  160 (218)
Q Consensus        85 ~~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~----C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C  160 (218)
                      ++.+.+|+++|.||||+|++.|++||||+++||+|..|..    |  ..++.|||++|+||+.                 
T Consensus         3 ~~~~~~y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~c--~~~~~y~~~~SsT~~~-----------------   63 (326)
T cd05487           3 NYLDTQYYGEIGIGTPPQTFKVVFDTGSSNLWVPSSKCSPLYTAC--VTHNLYDASDSSTYKE-----------------   63 (326)
T ss_pred             ccCCCeEEEEEEECCCCcEEEEEEeCCccceEEccCCCcCcchhh--cccCcCCCCCCeeeeE-----------------
Confidence            5678899999999999999999999999999999888864    6  4567899999999987                 


Q ss_pred             CCCCCcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCC
Q 037565          161 DTSRNTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNT  213 (218)
Q Consensus       161 ~~~~~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~  213 (218)
                          ..|.|.+.|++|+ +.|.+++|+|+|++      +.+ .+.|||..+..
T Consensus        64 ----~~~~~~~~Yg~g~-~~G~~~~D~v~~g~------~~~-~~~fg~~~~~~  104 (326)
T cd05487          64 ----NGTEFTIHYASGT-VKGFLSQDIVTVGG------IPV-TQMFGEVTALP  104 (326)
T ss_pred             ----CCEEEEEEeCCce-EEEEEeeeEEEECC------EEe-eEEEEEEEecc
Confidence                3689999999998 56999999999986      455 47899998753


No 18 
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=99.83  E-value=3.7e-20  Score=158.56  Aligned_cols=84  Identities=39%  Similarity=0.850  Sum_probs=72.4

Q ss_pred             ceEEEEEEeCCCceeEEEEEeCCCCceeEecC-CCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCcc
Q 037565           89 GEYFVNISIGTPRVELIGIADTASDLTWIQCK-PCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTC  167 (218)
Q Consensus        89 ~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~-~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c  167 (218)
                      ++|+++|.||||||++.|++||||+++||+|. +|..|                                        .|
T Consensus         1 ~~Y~~~i~iGtP~q~~~v~~DTGS~~~Wv~c~~~c~~c----------------------------------------~c   40 (273)
T cd05475           1 GYYYVTINIGNPPKPYFLDIDTGSDLTWLQCDAPCTGC----------------------------------------QC   40 (273)
T ss_pred             CceEEEEEcCCCCeeEEEEEccCCCceEEeCCCCCCCC----------------------------------------cC
Confidence            57999999999999999999999999999994 66555                                        27


Q ss_pred             eeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565          168 QFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG  214 (218)
Q Consensus       168 ~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g  214 (218)
                      .|.+.|+||+.++|.+++|+|+|+..++.  ..++++.|||+..+.+
T Consensus        41 ~~~i~Ygd~~~~~G~~~~D~v~~~~~~~~--~~~~~~~Fgc~~~~~~   85 (273)
T cd05475          41 DYEIEYADGGSSMGVLVTDIFSLKLTNGS--RAKPRIAFGCGYDQQG   85 (273)
T ss_pred             ccEeEeCCCCceEEEEEEEEEEEeecCCC--cccCCEEEEeeeccCC
Confidence            89999998888889999999999765442  5788999999987654


No 19 
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two 
Probab=99.80  E-value=3.9e-19  Score=158.04  Aligned_cols=78  Identities=27%  Similarity=0.403  Sum_probs=70.7

Q ss_pred             ceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCcce
Q 037565           89 GEYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQ  168 (218)
Q Consensus        89 ~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~  168 (218)
                      .+|+++|.||||+|++.|+|||||+++||+|.+|.    +.++.|||++|+||+..                     .|.
T Consensus         2 ~~Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~~~----~~~~~f~~~~SsT~~~~---------------------~~~   56 (364)
T cd05473           2 QGYYIEMLIGTPPQKLNILVDTGSSNFAVAAAPHP----FIHTYFHRELSSTYRDL---------------------GKG   56 (364)
T ss_pred             CceEEEEEecCCCceEEEEEecCCcceEEEcCCCc----cccccCCchhCcCcccC---------------------Cce
Confidence            46999999999999999999999999999998874    34678999999999884                     689


Q ss_pred             eEEEcCCCceEEEEEEEEEEEecC
Q 037565          169 FYYSYADGSYTTGNVGTEVFVISS  192 (218)
Q Consensus       169 ~~~~Y~dgs~~~G~l~~D~~~~~~  192 (218)
                      |.+.|++|+. .|.+++|+|+|++
T Consensus        57 ~~i~Yg~Gs~-~G~~~~D~v~ig~   79 (364)
T cd05473          57 VTVPYTQGSW-EGELGTDLVSIPK   79 (364)
T ss_pred             EEEEECcceE-EEEEEEEEEEECC
Confidence            9999999986 5999999999985


No 20 
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=99.79  E-value=1.5e-18  Score=147.58  Aligned_cols=96  Identities=40%  Similarity=0.756  Sum_probs=82.0

Q ss_pred             EEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCc--cCCCCCCcceeccCCCccccCCCCCCCCCCCCCcce
Q 037565           91 YFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPI--FDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQ  168 (218)
Q Consensus        91 Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~--f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~  168 (218)
                      |+++|.||+|+|++.|++||||+++||+|..|..|..+....  |++..|+++..                     ..|.
T Consensus         1 Y~~~i~iGtp~q~~~l~~DTGS~~~wv~~~~c~~~~~~~~~~~~~~~~~s~~~~~---------------------~~~~   59 (283)
T cd05471           1 YYGEITIGTPPQKFSVIFDTGSSLLWVPSSNCTSCSCQKHPRFKYDSSKSSTYKD---------------------TGCT   59 (283)
T ss_pred             CEEEEEECCCCcEEEEEEeCCCCCEEEecCCCCccccccCCCCccCccCCceeec---------------------CCCE
Confidence            789999999999999999999999999999999875554444  67777766544                     4799


Q ss_pred             eEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565          169 FYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG  214 (218)
Q Consensus       169 ~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g  214 (218)
                      |.+.|++|+.. |.+++|+|+|++      ..++++.|||++...+
T Consensus        60 ~~~~Y~~g~~~-g~~~~D~v~~~~------~~~~~~~fg~~~~~~~   98 (283)
T cd05471          60 FSITYGDGSVT-GGLGTDTVTIGG------LTIPNQTFGCATSESG   98 (283)
T ss_pred             EEEEECCCeEE-EEEEEeEEEECC------EEEeceEEEEEeccCC
Confidence            99999998765 999999999997      4689999999998653


No 21 
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco.  CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=99.78  E-value=4.6e-19  Score=153.33  Aligned_cols=78  Identities=47%  Similarity=1.032  Sum_probs=69.1

Q ss_pred             eEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCccee
Q 037565           90 EYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQF  169 (218)
Q Consensus        90 ~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~~  169 (218)
                      +|+++|.||||||++.|++||||+++||+|.+|                                            |.|
T Consensus         1 ~Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~c~~c--------------------------------------------~~~   36 (299)
T cd05472           1 EYVVTVGLGTPARDQTVIVDTGSDLTWVQCQPC--------------------------------------------CLY   36 (299)
T ss_pred             CeEEEEecCCCCcceEEEecCCCCcccccCCCC--------------------------------------------Cee
Confidence            699999999999999999999999999988654                                            368


Q ss_pred             EEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCCCC
Q 037565          170 YYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTGTF  216 (218)
Q Consensus       170 ~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g~f  216 (218)
                      .+.|++|+.++|.+++|+|+|++.     ..++++.|||+..+.+.|
T Consensus        37 ~i~Yg~Gs~~~G~~~~D~v~ig~~-----~~~~~~~Fg~~~~~~~~~   78 (299)
T cd05472          37 QVSYGDGSYTTGDLATDTLTLGSS-----DVVPGFAFGCGHDNEGLF   78 (299)
T ss_pred             eeEeCCCceEEEEEEEEEEEeCCC-----CccCCEEEECCccCCCcc
Confidence            999999998889999999999872     268999999999876644


No 22 
>cd05489 xylanase_inhibitor_I_like TAXI-I inhibits degradation of xylan in the cell wall. Xylanase inhibitor-I (TAXI-I) is a member of potent TAXI-type inhibitors of fungal and bacterial family 11 xylanases. Plants developed a diverse battery of defense mechanisms in response to continual challenges by a broad spectrum of pathogenic microorganisms. Their defense arsenal includes inhibitors of cell wall-degrading enzymes, which hinder a possible invasion and colonization by antagonists. Xylanases of fungal and bacterial pathogens are the key enzymes in the degradation of xylan in the cell wall. Plants secrete proteins that inhibit these degradation glycosidases, including xylanase. Surprisingly, TAXI-I displays structural homology with the pepsin-like family of aspartic proteases but is proteolytically nonfunctional, because one or more residues of the essential catalytic triad are absent. The structure of the TAXI-inhibitor, Aspergillus niger xylanase I complex, illustrates the ability 
Probab=99.75  E-value=4.9e-18  Score=151.25  Aligned_cols=101  Identities=24%  Similarity=0.395  Sum_probs=81.7

Q ss_pred             eCCCcee-EEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCC-----------CCCCCCC
Q 037565           97 IGTPRVE-LIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKD-----------RRCDTSR  164 (218)
Q Consensus        97 iGtP~q~-~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~-----------~~C~~~~  164 (218)
                      +|||-.+ +.|++||||+++||||.+              .+|+||+.++|.++.|+..+.+           ..|.+  
T Consensus         2 ~~~~~~~~~~~~~DTGS~l~WvqC~~--------------~~sst~~~~~C~s~~C~~~~~~~~~~~~~~~~~~~c~~--   65 (362)
T cd05489           2 TITPLKGAVPLVLDLAGPLLWSTCDA--------------GHSSTYQTVPCSSSVCSLANRYHCPGTCGGAPGPGCGN--   65 (362)
T ss_pred             cccCccCCeeEEEECCCCceeeeCCC--------------CCcCCCCccCcCChhhccccccCCCccccCCCCCCCCC--
Confidence            5888888 999999999999999975              4688999999999999876532           24654  


Q ss_pred             CcceeEEE-cCCCceEEEEEEEEEEEecCCCCCC--CcccCCeEEEeEEcCC
Q 037565          165 NTCQFYYS-YADGSYTTGNVGTEVFVISSKTNPD--QGTFPNIVYGCGNNNT  213 (218)
Q Consensus       165 ~~c~~~~~-Y~dgs~~~G~l~~D~~~~~~~~~~~--~~~~~~~~FGc~~~~~  213 (218)
                      +.|.|... |++|+.++|+|++|+|+|+..++..  ..+++++.|||++++.
T Consensus        66 ~~C~y~~~~y~~gs~t~G~l~~Dtl~~~~~~g~~~~~~~~~~~~FGC~~~~~  117 (362)
T cd05489          66 NTCTAHPYNPVTGECATGDLTQDVLSANTTDGSNPLLVVIFNFVFSCAPSLL  117 (362)
T ss_pred             CcCeeEccccccCcEeeEEEEEEEEEecccCCCCcccceeCCEEEEcCCccc
Confidence            46999765 8899899999999999998654421  1368999999998753


No 23 
>PF00026 Asp:  Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.;  InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) .  More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=99.71  E-value=2.6e-17  Score=142.33  Aligned_cols=96  Identities=32%  Similarity=0.601  Sum_probs=83.8

Q ss_pred             eEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCC-CCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCcce
Q 037565           90 EYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKC-YKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQ  168 (218)
Q Consensus        90 ~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C-~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~  168 (218)
                      +|+++|.||||+|++.|++||||+.+||++..|..| .-.....|++.+|+|++..                     .+.
T Consensus         1 ~Y~~~v~iGtp~q~~~~~iDTGS~~~wv~~~~c~~~~~~~~~~~y~~~~S~t~~~~---------------------~~~   59 (317)
T PF00026_consen    1 QYYINVTIGTPPQTFRVLIDTGSSDTWVPSSNCNSCSSCASSGFYNPSKSSTFSNQ---------------------GKP   59 (317)
T ss_dssp             EEEEEEEETTTTEEEEEEEETTBSSEEEEBTTECSHTHHCTSC-BBGGGSTTEEEE---------------------EEE
T ss_pred             CeEEEEEECCCCeEEEEEEecccceeeeceeccccccccccccccccccccccccc---------------------eee
Confidence            699999999999999999999999999998888765 2245678999999999884                     578


Q ss_pred             eEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCC
Q 037565          169 FYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNT  213 (218)
Q Consensus       169 ~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~  213 (218)
                      +.+.|++|+ .+|.+++|+|.|++      +.++++.||.+....
T Consensus        60 ~~~~y~~g~-~~G~~~~D~v~ig~------~~~~~~~f~~~~~~~   97 (317)
T PF00026_consen   60 FSISYGDGS-VSGNLVSDTVSIGG------LTIPNQTFGLADSYS   97 (317)
T ss_dssp             EEEEETTEE-EEEEEEEEEEEETT------EEEEEEEEEEEEEEE
T ss_pred             eeeeccCcc-cccccccceEeeee------ccccccceecccccc
Confidence            999999999 66999999999987      678899999998843


No 24 
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  The enzymes specifically cleave bonds in peptides which 
Probab=99.67  E-value=1.8e-16  Score=135.14  Aligned_cols=74  Identities=50%  Similarity=0.972  Sum_probs=65.4

Q ss_pred             eEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCccee
Q 037565           90 EYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQF  169 (218)
Q Consensus        90 ~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~~  169 (218)
                      +|+++|.||||||++.|++||||+++||+|                                               |.|
T Consensus         1 ~Y~~~i~iGtP~q~~~v~~DTGSs~~wv~~-----------------------------------------------~~~   33 (265)
T cd05476           1 EYLVTLSIGTPPQPFSLIVDTGSDLTWTQC-----------------------------------------------CSY   33 (265)
T ss_pred             CeEEEEecCCCCcceEEEecCCCCCEEEcC-----------------------------------------------Cce
Confidence            699999999999999999999999999986                                               257


Q ss_pred             EEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcCCC
Q 037565          170 YYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNNTG  214 (218)
Q Consensus       170 ~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~~g  214 (218)
                      .+.|+||+..+|.+++|+|+|++..    ..++++.|||+.++.+
T Consensus        34 ~~~Y~dg~~~~G~~~~D~v~~g~~~----~~~~~~~Fg~~~~~~~   74 (265)
T cd05476          34 EYSYGDGSSTSGVLATETFTFGDSS----VSVPNVAFGCGTDNEG   74 (265)
T ss_pred             EeEeCCCceeeeeEEEEEEEecCCC----CccCCEEEEecccccC
Confidence            8899999888999999999999721    1689999999998753


No 25 
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=99.52  E-value=5.7e-14  Score=120.82  Aligned_cols=68  Identities=31%  Similarity=0.572  Sum_probs=61.5

Q ss_pred             eEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCccee
Q 037565           90 EYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQF  169 (218)
Q Consensus        90 ~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~~  169 (218)
                      .|+++|.||||+|++.|++||||+++||+                                                 .|
T Consensus         2 ~Y~~~i~iGtp~q~~~v~~DTgS~~~wv~-------------------------------------------------~~   32 (295)
T cd05474           2 YYSAELSVGTPPQKVTVLLDTGSSDLWVP-------------------------------------------------DF   32 (295)
T ss_pred             eEEEEEEECCCCcEEEEEEeCCCCcceee-------------------------------------------------ee
Confidence            59999999999999999999999999996                                                 25


Q ss_pred             EEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcC
Q 037565          170 YYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNN  212 (218)
Q Consensus       170 ~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~  212 (218)
                      .+.|++|+.+.|.+++|+|+|++      ..++++.|||+++.
T Consensus        33 ~~~Y~~g~~~~G~~~~D~v~~g~------~~~~~~~fg~~~~~   69 (295)
T cd05474          33 SISYGDGTSASGTWGTDTVSIGG------ATVKNLQFAVANST   69 (295)
T ss_pred             EEEeccCCcEEEEEEEEEEEECC------eEecceEEEEEecC
Confidence            77899988788999999999987      47889999999863


No 26 
>cd05483 retropepsin_like_bacteria Bacterial aspartate proteases, retropepsin-like protease family. This family of bacteria aspartate proteases is a subfamily of retropepsin-like protease family, which includes enzymes from retrovirus and retrotransposons. While fungal and mammalian pepsin-like aspartate proteases are bilobal proteins with structurally related N- and C-termini, this family of bacteria aspartate proteases is half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate proteases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=97.05  E-value=0.0025  Score=44.98  Aligned_cols=79  Identities=18%  Similarity=0.148  Sum_probs=52.0

Q ss_pred             ceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCcce
Q 037565           89 GEYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNTCQ  168 (218)
Q Consensus        89 ~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~c~  168 (218)
                      +.|++++.|+  .+++.+++|||++.+|+.-.-...+     ..  +     ..                      ....
T Consensus         1 ~~~~v~v~i~--~~~~~~llDTGa~~s~i~~~~~~~l-----~~--~-----~~----------------------~~~~   44 (96)
T cd05483           1 GHFVVPVTIN--GQPVRFLLDTGASTTVISEELAERL-----GL--P-----LT----------------------LGGK   44 (96)
T ss_pred             CcEEEEEEEC--CEEEEEEEECCCCcEEcCHHHHHHc-----CC--C-----cc----------------------CCCc
Confidence            3589999999  6999999999999999864211111     00  0     00                      1234


Q ss_pred             eEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeE
Q 037565          169 FYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCG  209 (218)
Q Consensus       169 ~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~  209 (218)
                      ..+...+|.........+.|++++      ..+.++.+.-.
T Consensus        45 ~~~~~~~G~~~~~~~~~~~i~ig~------~~~~~~~~~v~   79 (96)
T cd05483          45 VTVQTANGRVRAARVRLDSLQIGG------ITLRNVPAVVL   79 (96)
T ss_pred             EEEEecCCCccceEEEcceEEECC------cEEeccEEEEe
Confidence            566777777666666688899987      45666555444


No 27 
>TIGR02281 clan_AA_DTGA clan AA aspartic protease, TIGR02281 family. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix. Sequences in the seed alignment and those scoring above the trusted cutoff are Proteobacterial; homologs scroing between trusted and noise are found in Pyrobaculum aerophilum str. IM2 (archaeal), Pirellula sp. (Planctomycetes), and Nostoc sp. PCC 7120 (Cyanobacteria).
Probab=92.82  E-value=1.1  Score=33.70  Aligned_cols=84  Identities=11%  Similarity=0.090  Sum_probs=52.9

Q ss_pred             CCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCCc
Q 037565           87 SGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRNT  166 (218)
Q Consensus        87 ~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~~  166 (218)
                      .+|.|++++.|.  .+++.+++|||++.+-+.-.--..-      ..++..                           ..
T Consensus         8 ~~g~~~v~~~In--G~~~~flVDTGAs~t~is~~~A~~L------gl~~~~---------------------------~~   52 (121)
T TIGR02281         8 GDGHFYATGRVN--GRNVRFLVDTGATSVALNEEDAQRL------GLDLNR---------------------------LG   52 (121)
T ss_pred             CCCeEEEEEEEC--CEEEEEEEECCCCcEEcCHHHHHHc------CCCccc---------------------------CC
Confidence            578899999997  5799999999999987753210000      011110                           01


Q ss_pred             ceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEc
Q 037565          167 CQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNN  211 (218)
Q Consensus       167 c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~  211 (218)
                      -...+.=..|........-|.+.+|+      ..+.++.+.....
T Consensus        53 ~~~~~~ta~G~~~~~~~~l~~l~iG~------~~~~nv~~~v~~~   91 (121)
T TIGR02281        53 YTVTVSTANGQIKAARVTLDRVAIGG------IVVNDVDAMVAEG   91 (121)
T ss_pred             ceEEEEeCCCcEEEEEEEeCEEEECC------EEEeCcEEEEeCC
Confidence            12233334555554566888999987      5777887776643


No 28 
>cd05484 retropepsin_like_LTR_2 Retropepsins_like_LTR, pepsin-like aspartate proteases. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classif
Probab=91.46  E-value=0.26  Score=34.81  Aligned_cols=27  Identities=26%  Similarity=0.097  Sum_probs=23.5

Q ss_pred             EEEEEEeCCCceeEEEEEeCCCCceeEec
Q 037565           91 YFVNISIGTPRVELIGIADTASDLTWIQC  119 (218)
Q Consensus        91 Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c  119 (218)
                      |++++.|+  .+++.+++||||+..++.-
T Consensus         1 ~~~~~~In--g~~i~~lvDTGA~~svis~   27 (91)
T cd05484           1 KTVTLLVN--GKPLKFQLDTGSAITVISE   27 (91)
T ss_pred             CEEEEEEC--CEEEEEEEcCCcceEEeCH
Confidence            57888888  5899999999999999864


No 29 
>PF13650 Asp_protease_2:  Aspartyl protease
Probab=88.00  E-value=3.8  Score=27.93  Aligned_cols=23  Identities=13%  Similarity=0.205  Sum_probs=18.7

Q ss_pred             EEEeCCCceeEEEEEeCCCCceeEe
Q 037565           94 NISIGTPRVELIGIADTASDLTWIQ  118 (218)
Q Consensus        94 ~i~iGtP~q~~~~~~DTGS~~~Wv~  118 (218)
                      ++.|+  .+++.+++|||++...+.
T Consensus         2 ~v~vn--g~~~~~liDTGa~~~~i~   24 (90)
T PF13650_consen    2 PVKVN--GKPVRFLIDTGASISVIS   24 (90)
T ss_pred             EEEEC--CEEEEEEEcCCCCcEEEC
Confidence            56666  589999999999987764


No 30 
>PF13975 gag-asp_proteas:  gag-polyprotein putative aspartyl protease
Probab=86.94  E-value=1.4  Score=29.85  Aligned_cols=32  Identities=28%  Similarity=0.474  Sum_probs=27.4

Q ss_pred             CCceEEEEEEeCCCceeEEEEEeCCCCceeEecC
Q 037565           87 SGGEYFVNISIGTPRVELIGIADTASDLTWIQCK  120 (218)
Q Consensus        87 ~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~  120 (218)
                      ..+.+++.+.||.  +.+.+++|||++...+.-.
T Consensus         5 ~~g~~~v~~~I~g--~~~~alvDtGat~~fis~~   36 (72)
T PF13975_consen    5 DPGLMYVPVSIGG--VQVKALVDTGATHNFISES   36 (72)
T ss_pred             cCCEEEEEEEECC--EEEEEEEeCCCcceecCHH
Confidence            4578999999995  9999999999999887654


No 31 
>PF00077 RVP:  Retroviral aspartyl protease The Prosite entry also includes Pfam:PF00026;  InterPro: IPR018061 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to the MEROPS peptidase family A2 (retropepsin family, clan AA), subfamily A2A. The family includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). Retroviral aspartyl protease is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins.; PDB: 3D3T_B 3SQF_A 1NSO_A 2HB3_A 2HS2_A 2HS1_B 3K4V_A 3GGV_C 1HTG_B 2FDE_A ....
Probab=80.56  E-value=2.9  Score=29.65  Aligned_cols=27  Identities=22%  Similarity=0.458  Sum_probs=21.3

Q ss_pred             EEEEEeCCCceeEEEEEeCCCCceeEecC
Q 037565           92 FVNISIGTPRVELIGIADTASDLTWIQCK  120 (218)
Q Consensus        92 ~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~  120 (218)
                      +.+|.+.  .+++.+++|||++..-++-.
T Consensus         7 ~i~v~i~--g~~i~~LlDTGA~vsiI~~~   33 (100)
T PF00077_consen    7 YITVKIN--GKKIKALLDTGADVSIISEK   33 (100)
T ss_dssp             EEEEEET--TEEEEEEEETTBSSEEESSG
T ss_pred             eEEEeEC--CEEEEEEEecCCCcceeccc
Confidence            4566666  47999999999999888643


No 32 
>cd05479 RP_DDI RP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
Probab=77.78  E-value=4.3  Score=30.37  Aligned_cols=31  Identities=16%  Similarity=0.305  Sum_probs=26.1

Q ss_pred             CCceEEEEEEeCCCceeEEEEEeCCCCceeEec
Q 037565           87 SGGEYFVNISIGTPRVELIGIADTASDLTWIQC  119 (218)
Q Consensus        87 ~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c  119 (218)
                      ....+++++.|+  .+++.+++|||++..++.-
T Consensus        13 ~~~~~~v~~~In--g~~~~~LvDTGAs~s~Is~   43 (124)
T cd05479          13 KVPMLYINVEIN--GVPVKAFVDSGAQMTIMSK   43 (124)
T ss_pred             eeeEEEEEEEEC--CEEEEEEEeCCCceEEeCH
Confidence            345689999998  5889999999999998854


No 33 
>COG3577 Predicted aspartyl protease [General function prediction only]
Probab=74.43  E-value=13  Score=30.79  Aligned_cols=72  Identities=15%  Similarity=0.083  Sum_probs=50.2

Q ss_pred             cCCceEEEEEEeCCCceeEEEEEeCCCCceeEecCCCCCCCCCCCCccCCCCCCcceeccCCCccccCCCCCCCCCCCCC
Q 037565           86 SSGGEYFVNISIGTPRVELIGIADTASDLTWIQCKPCYKCYKQNPPIFDPKESSTSKNITCQSESCLRLEKDRRCDTSRN  165 (218)
Q Consensus        86 ~~~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~C~~C~~~~~~~f~p~~Sst~~~~~C~s~~C~~~~~~~~C~~~~~  165 (218)
                      ..+|-|.++..|-  .|++..++|||-+.+-++-..-      ..--||...                           .
T Consensus       101 ~~~GHF~a~~~VN--Gk~v~fLVDTGATsVal~~~dA------~RlGid~~~---------------------------l  145 (215)
T COG3577         101 SRDGHFEANGRVN--GKKVDFLVDTGATSVALNEEDA------RRLGIDLNS---------------------------L  145 (215)
T ss_pred             cCCCcEEEEEEEC--CEEEEEEEecCcceeecCHHHH------HHhCCCccc---------------------------c
Confidence            4688899999987  6999999999999887753210      011233332                           1


Q ss_pred             cceeEEEcCCCceEEEEEEEEEEEecC
Q 037565          166 TCQFYYSYADGSYTTGNVGTEVFVISS  192 (218)
Q Consensus       166 ~c~~~~~Y~dgs~~~G~l~~D~~~~~~  192 (218)
                      .-++.+.-.+|....-.+--|.|.||+
T Consensus       146 ~y~~~v~TANG~~~AA~V~Ld~v~IG~  172 (215)
T COG3577         146 DYTITVSTANGRARAAPVTLDRVQIGG  172 (215)
T ss_pred             CCceEEEccCCccccceEEeeeEEEcc
Confidence            235556667787766778889999987


No 34 
>cd05482 HIV_retropepsin_like Retropepsins, pepsin-like aspartate proteases. This is a subfamily of retropepsins. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This gro
Probab=71.55  E-value=6.2  Score=27.98  Aligned_cols=24  Identities=25%  Similarity=0.329  Sum_probs=19.7

Q ss_pred             EEEeCCCceeEEEEEeCCCCceeEec
Q 037565           94 NISIGTPRVELIGIADTASDLTWIQC  119 (218)
Q Consensus        94 ~i~iGtP~q~~~~~~DTGS~~~Wv~c  119 (218)
                      ++.|+  .|.+.+++|||.+++-+.-
T Consensus         2 ~~~i~--g~~~~~llDTGAd~Tvi~~   25 (87)
T cd05482           2 TLYIN--GKLFEGLLDTGADVSIIAE   25 (87)
T ss_pred             EEEEC--CEEEEEEEccCCCCeEEcc
Confidence            35566  6999999999999998863


No 35 
>cd06095 RP_RTVL_H_like Retropepsin of the RTVL_H family of human endogenous retrovirus-like elements. This family includes aspartate proteases from retroelements with LTR (long terminal repeats) including the RTVL_H family of human endogenous retrovirus-like elements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where 
Probab=65.08  E-value=9.1  Score=26.57  Aligned_cols=23  Identities=17%  Similarity=0.342  Sum_probs=18.4

Q ss_pred             EEeCCCceeEEEEEeCCCCceeEec
Q 037565           95 ISIGTPRVELIGIADTASDLTWIQC  119 (218)
Q Consensus        95 i~iGtP~q~~~~~~DTGS~~~Wv~c  119 (218)
                      +.|.  .+++.+++|||.+.+-+.-
T Consensus         3 v~In--G~~~~fLvDTGA~~tii~~   25 (86)
T cd06095           3 ITVE--GVPIVFLVDTGATHSVLKS   25 (86)
T ss_pred             EEEC--CEEEEEEEECCCCeEEECH
Confidence            4444  5899999999999998853


No 36 
>PF12384 Peptidase_A2B:  Ty3 transposon peptidase;  InterPro: IPR024650 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Ty3 is a gypsy-type, retrovirus-like, element found in the budding yeast. The Ty3 aspartyl protease is required for processing of the viral polyprotein into its mature species [].
Probab=48.50  E-value=26  Score=28.09  Aligned_cols=28  Identities=14%  Similarity=0.233  Sum_probs=20.9

Q ss_pred             eEEEEEEeCCCceeEEEEEeCCCCceeEec
Q 037565           90 EYFVNISIGTPRVELIGIADTASDLTWIQC  119 (218)
Q Consensus        90 ~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c  119 (218)
                      .+.+.+.  .-..++.++|||||....+..
T Consensus        34 T~~v~l~--~~~t~i~vLfDSGSPTSfIr~   61 (177)
T PF12384_consen   34 TAIVQLN--CKGTPIKVLFDSGSPTSFIRS   61 (177)
T ss_pred             EEEEEEe--ecCcEEEEEEeCCCccceeeh
Confidence            3444444  446899999999999988764


No 37 
>PF09668 Asp_protease:  Aspartyl protease;  InterPro: IPR019103 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure.  This family of eukaryotic aspartyl proteases have a fold similar to retroviral proteases which implies they function proteolytically during regulated protein turnover []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 3S8I_A 2I1A_B.
Probab=35.25  E-value=78  Score=23.96  Aligned_cols=30  Identities=20%  Similarity=0.245  Sum_probs=21.1

Q ss_pred             CceEEEEEEeCCCceeEEEEEeCCCCceeEec
Q 037565           88 GGEYFVNISIGTPRVELIGIADTASDLTWIQC  119 (218)
Q Consensus        88 ~~~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c  119 (218)
                      ...+|+++.|.  .+++.+.+|||...+-+.-
T Consensus        22 v~mLyI~~~in--g~~vkA~VDtGAQ~tims~   51 (124)
T PF09668_consen   22 VSMLYINCKIN--GVPVKAFVDTGAQSTIMSK   51 (124)
T ss_dssp             ----EEEEEET--TEEEEEEEETT-SS-EEEH
T ss_pred             cceEEEEEEEC--CEEEEEEEeCCCCccccCH
Confidence            44689999999  5999999999998877753


No 38 
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=34.61  E-value=43  Score=28.24  Aligned_cols=32  Identities=16%  Similarity=0.182  Sum_probs=22.8

Q ss_pred             ceEEEE---EEeCC---CceeEEEEEeCCCCceeEecC
Q 037565           89 GEYFVN---ISIGT---PRVELIGIADTASDLTWIQCK  120 (218)
Q Consensus        89 ~~Y~~~---i~iGt---P~q~~~~~~DTGS~~~Wv~c~  120 (218)
                      ..|.++   |.||.   +.....+++|||+.++.+|-.
T Consensus       157 ~~y~v~l~~i~vg~~~~~~~~~~~ivDTGTt~t~lp~~  194 (273)
T cd05475         157 KHYSPGPASLLFNGQPTGGKGLEVVFDSGSSYTYFNAQ  194 (273)
T ss_pred             CeEEEeEeEEEECCEECcCCCceEEEECCCceEEcCCc
Confidence            466655   57763   233467999999999999854


No 39 
>TIGR03698 clan_AA_DTGF clan AA aspartic protease, AF_0612 family. Members of this protein family are clan AA aspartic proteases, related to family TIGR02281. These proteins resemble retropepsins, pepsin-like proteases of retroviruses such as HIV. Members of this family are found in archaea and bacteria.
Probab=33.63  E-value=58  Score=23.65  Aligned_cols=24  Identities=17%  Similarity=0.382  Sum_probs=18.2

Q ss_pred             EEEEeCCCc----eeEEEEEeCCCCcee
Q 037565           93 VNISIGTPR----VELIGIADTASDLTW  116 (218)
Q Consensus        93 ~~i~iGtP~----q~~~~~~DTGS~~~W  116 (218)
                      +++.|..|.    -++.+++|||.+..-
T Consensus         2 ~~v~~~~p~~~~~~~v~~LVDTGat~~~   29 (107)
T TIGR03698         2 LDVELSNPKNPEFMEVRALVDTGFSGFL   29 (107)
T ss_pred             EEEEEeCCCCCCceEEEEEEECCCCeEE
Confidence            567787772    368999999998653


No 40 
>PF05585 DUF1758:  Putative peptidase (DUF1758);  InterPro: IPR008737  This is a family of nematode proteins of unknown function []. However, it seems likely that these proteins act as aspartic peptidases. 
Probab=29.54  E-value=31  Score=26.84  Aligned_cols=19  Identities=26%  Similarity=0.364  Sum_probs=15.8

Q ss_pred             ceeEEEEEeCCCCceeEec
Q 037565          101 RVELIGIADTASDLTWIQC  119 (218)
Q Consensus       101 ~q~~~~~~DTGS~~~Wv~c  119 (218)
                      .++..++||+||+..++.-
T Consensus        10 ~~~~~~LlDsGSq~SfIt~   28 (164)
T PF05585_consen   10 QVEARALLDSGSQRSFITE   28 (164)
T ss_pred             EEEEEEEEecCCchhHHhH
Confidence            4677899999999988854


No 41 
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=28.14  E-value=44  Score=28.16  Aligned_cols=33  Identities=24%  Similarity=0.364  Sum_probs=22.7

Q ss_pred             CceEEEE---EEeCC----CceeEEEEEeCCCCceeEecC
Q 037565           88 GGEYFVN---ISIGT----PRVELIGIADTASDLTWIQCK  120 (218)
Q Consensus        88 ~~~Y~~~---i~iGt----P~q~~~~~~DTGS~~~Wv~c~  120 (218)
                      ...|.++   |.||.    ......+++|||+.++++|-.
T Consensus       176 ~~~w~v~l~~i~v~~~~~~~~~~~~~iiDSGTs~~~lP~~  215 (278)
T cd06097         176 SGFWQFTSTSYTVGGDAPWSRSGFSAIADTGTTLILLPDA  215 (278)
T ss_pred             CcEEEEEEeeEEECCcceeecCCceEEeecCCchhcCCHH
Confidence            3455544   35653    235678999999999999753


No 42 
>PF11645 PDDEXK_5:  PD-(D/E)XK endonuclease;  InterPro: IPR021671  This family are putative endonuclease proteins which are restricted to Synechocystis. ; PDB: 2OST_D.
Probab=27.14  E-value=41  Score=26.24  Aligned_cols=31  Identities=29%  Similarity=0.289  Sum_probs=25.2

Q ss_pred             eEEEEEEeCCCceeEEEEEeCCCCceeEecCC
Q 037565           90 EYFVNISIGTPRVELIGIADTASDLTWIQCKP  121 (218)
Q Consensus        90 ~Y~~~i~iGtP~q~~~~~~DTGS~~~Wv~c~~  121 (218)
                      .|.+.+.+|- .+++.|++|++..+.-|||+.
T Consensus        21 GY~V~~P~gD-n~~YDLV~d~eg~L~RIQvKT   51 (149)
T PF11645_consen   21 GYSVSIPFGD-NLKYDLVFDKEGILWRIQVKT   51 (149)
T ss_dssp             T-EEEEESST-TSS-SEEEEETTEEEEEEEEE
T ss_pred             CcEEEeecCC-CCCcCEEEecCCcEEEEEEee
Confidence            4888888887 679999999999999999974


No 43 
>cd00303 retropepsin_like Retropepsins; pepsin-like aspartate proteases. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples
Probab=27.02  E-value=73  Score=19.92  Aligned_cols=20  Identities=15%  Similarity=0.315  Sum_probs=14.6

Q ss_pred             EEeCCCceeEEEEEeCCCCcee
Q 037565           95 ISIGTPRVELIGIADTASDLTW  116 (218)
Q Consensus        95 i~iGtP~q~~~~~~DTGS~~~W  116 (218)
                      +.++.  ..+.+++|||+...-
T Consensus         3 ~~~~~--~~~~~liDtgs~~~~   22 (92)
T cd00303           3 GKING--VPVRALVDSGASVNF   22 (92)
T ss_pred             EEECC--EEEEEEEcCCCcccc
Confidence            44443  788999999987643


No 44 
>PF07172 GRP:  Glycine rich protein family;  InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=26.40  E-value=33  Score=24.72  Aligned_cols=21  Identities=14%  Similarity=0.207  Sum_probs=9.9

Q ss_pred             ChhhHHHHHHHHHHHHhccccc
Q 037565            1 MAALISLLAICITATLQFVFPF   22 (218)
Q Consensus         1 M~~~~~~~~~~~~~~~~~~~~~   22 (218)
                      |++= .+|+++++|+.+++++.
T Consensus         1 MaSK-~~llL~l~LA~lLlisS   21 (95)
T PF07172_consen    1 MASK-AFLLLGLLLAALLLISS   21 (95)
T ss_pred             Cchh-HHHHHHHHHHHHHHHHh
Confidence            5522 34555555554444443


No 45 
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases.  They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=26.29  E-value=54  Score=28.33  Aligned_cols=31  Identities=23%  Similarity=0.204  Sum_probs=21.4

Q ss_pred             ceEEEE---EEeCCC-----ceeEEEEEeCCCCceeEec
Q 037565           89 GEYFVN---ISIGTP-----RVELIGIADTASDLTWIQC  119 (218)
Q Consensus        89 ~~Y~~~---i~iGtP-----~q~~~~~~DTGS~~~Wv~c  119 (218)
                      +.|.++   |.||..     .+...+++|||++++++|-
T Consensus       188 ~~w~v~l~~i~v~g~~~~~~~~~~~aivDTGTs~~~lP~  226 (317)
T cd06098         188 GYWQFEMGDVLIGGKSTGFCAGGCAAIADSGTSLLAGPT  226 (317)
T ss_pred             cEEEEEeCeEEECCEEeeecCCCcEEEEecCCcceeCCH
Confidence            445554   567642     2346799999999999874


No 46 
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5.  Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=25.86  E-value=53  Score=28.47  Aligned_cols=31  Identities=23%  Similarity=0.229  Sum_probs=22.0

Q ss_pred             ceEEEE---EEeCCC------ceeEEEEEeCCCCceeEec
Q 037565           89 GEYFVN---ISIGTP------RVELIGIADTASDLTWIQC  119 (218)
Q Consensus        89 ~~Y~~~---i~iGtP------~q~~~~~~DTGS~~~Wv~c  119 (218)
                      ..|.++   |.||..      .....+++|||+.+++++-
T Consensus       208 ~~y~v~l~~i~vg~~~~~~~~~~~~~aivDSGTs~~~lp~  247 (326)
T cd06096         208 YYYYVKLEGLSVYGTTSNSGNTKGLGMLVDSGSTLSHFPE  247 (326)
T ss_pred             ceEEEEEEEEEEcccccceecccCCCEEEeCCCCcccCCH
Confidence            456555   467743      2445789999999999975


No 47 
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco.  CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=24.64  E-value=56  Score=27.81  Aligned_cols=31  Identities=29%  Similarity=0.287  Sum_probs=21.2

Q ss_pred             ceEEEE---EEeCCCc--------eeEEEEEeCCCCceeEec
Q 037565           89 GEYFVN---ISIGTPR--------VELIGIADTASDLTWIQC  119 (218)
Q Consensus        89 ~~Y~~~---i~iGtP~--------q~~~~~~DTGS~~~Wv~c  119 (218)
                      ..|.++   |.||.-.        ....+++|||+.+++++-
T Consensus       146 ~~y~v~l~~i~vg~~~~~~~~~~~~~~~~ivDSGTt~~~lp~  187 (299)
T cd05472         146 TFYYVGLTGISVGGRRLPIPPASFGAGGVIIDSGTVITRLPP  187 (299)
T ss_pred             CeEEEeeEEEEECCEECCCCccccCCCCeEEeCCCcceecCH
Confidence            456655   4676421        223689999999999974


No 48 
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=24.59  E-value=47  Score=27.50  Aligned_cols=34  Identities=24%  Similarity=0.297  Sum_probs=24.3

Q ss_pred             CceEEEE---EEeCC-----CceeEEEEEeCCCCceeEecCC
Q 037565           88 GGEYFVN---ISIGT-----PRVELIGIADTASDLTWIQCKP  121 (218)
Q Consensus        88 ~~~Y~~~---i~iGt-----P~q~~~~~~DTGS~~~Wv~c~~  121 (218)
                      ...|.+.   |.+|.     ......+++|||+..+|++-.-
T Consensus       179 ~~~~~v~l~~i~v~~~~~~~~~~~~~~iiDsGt~~~~lp~~~  220 (283)
T cd05471         179 PGYWQVPLDGISVGGKSVISSSGGGGAIVDSGTSLIYLPSSV  220 (283)
T ss_pred             CCEEEEEeCeEEECCceeeecCCCcEEEEecCCCCEeCCHHH
Confidence            4556554   45664     2456899999999999998643


No 49 
>PLN03146 aspartyl protease family protein; Provisional
Probab=23.61  E-value=90  Score=28.54  Aligned_cols=16  Identities=25%  Similarity=0.275  Sum_probs=13.9

Q ss_pred             EEEEeCCCCceeEecC
Q 037565          105 IGIADTASDLTWIQCK  120 (218)
Q Consensus       105 ~~~~DTGS~~~Wv~c~  120 (218)
                      .++||||+.+++++-.
T Consensus       309 ~~iiDSGTt~t~Lp~~  324 (431)
T PLN03146        309 NIIIDSGTTLTLLPSD  324 (431)
T ss_pred             cEEEeCCccceecCHH
Confidence            6899999999999753


No 50 
>PF03419 Peptidase_U4:  Sporulation factor SpoIIGA  This family belongs to family U4 of the peptidase classification.;  InterPro: IPR005081 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.   The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This group of peptidases belong to the MEROPS peptidase family U4 (SpoIIGA peptidase family, clan U-).  Sporulation in bacteria such as Bacillus subtilis involves the formation of a polar septum, which divides the sporangium into a mother cell and a forespore. The sigma E factor, which is encoded within the spoIIG operon, is a cell-specific regulatory protein that directs gene transcription in the mother cell. Sigma E is synthesised as an inactive proprotein pro-sigma E, which is converted to the mature factor by the putative processing enzyme SpoIIGA []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis, 0030436 asexual sporulation
Probab=23.07  E-value=1.2e+02  Score=26.16  Aligned_cols=28  Identities=36%  Similarity=0.493  Sum_probs=24.0

Q ss_pred             CCceEEEEEEeCCCceeEEEEEeCCCCc
Q 037565           87 SGGEYFVNISIGTPRVELIGIADTASDL  114 (218)
Q Consensus        87 ~~~~Y~~~i~iGtP~q~~~~~~DTGS~~  114 (218)
                      .+..|-+++.+|.-..+++..+|||..+
T Consensus       154 ~~~~~~v~i~~~~~~~~~~allDTGN~L  181 (293)
T PF03419_consen  154 KDYLYPVTIEIGGKKIELKALLDTGNQL  181 (293)
T ss_pred             CcEEEEEEEEECCEEEEEEEEEECCCcc
Confidence            3556899999998888899999999877


No 51 
>cd06094 RP_Saci_like RP_Saci_like, retropepsin family. Retropepsin on retrotransposons with long terminal repeats (LTR) including Saci-1, -2 and -3 of Schistosoma mansoni. Retropepsins are related to fungal and mammalian pepsins. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified
Probab=22.51  E-value=2.9e+02  Score=19.64  Aligned_cols=20  Identities=15%  Similarity=0.174  Sum_probs=15.8

Q ss_pred             eeEEEEEeCCCCceeEecCC
Q 037565          102 VELIGIADTASDLTWIQCKP  121 (218)
Q Consensus       102 q~~~~~~DTGS~~~Wv~c~~  121 (218)
                      .....++|||+...-+|...
T Consensus         8 s~~~fLVDTGA~vSviP~~~   27 (89)
T cd06094           8 SGLRFLVDTGAAVSVLPASS   27 (89)
T ss_pred             CCcEEEEeCCCceEeecccc
Confidence            35688999999999887543


No 52 
>TIGR02854 spore_II_GA sigma-E processing peptidase SpoIIGA. Members of this protein family are the stage II sporulation protein SpoIIGA. This protein acts as an activating protease for Sigma-E, one of several specialized sigma factors of the sporulation process in Bacillus subtilis and related endospore-forming bacteria.
Probab=20.57  E-value=1.3e+02  Score=25.92  Aligned_cols=28  Identities=25%  Similarity=0.408  Sum_probs=24.3

Q ss_pred             CCceEEEEEEeCCCceeEEEEEeCCCCc
Q 037565           87 SGGEYFVNISIGTPRVELIGIADTASDL  114 (218)
Q Consensus        87 ~~~~Y~~~i~iGtP~q~~~~~~DTGS~~  114 (218)
                      .+..|-+++.++.-..+++..+|||..+
T Consensus       155 ~~~~~~v~i~~~g~~~~~~alvDTGN~L  182 (288)
T TIGR02854       155 DKQIYELEICLDGKKVTIKGFLDTGNQL  182 (288)
T ss_pred             hceEEEEEEEECCEEEEEEEEEecCCcc
Confidence            3567999999998888899999999877


No 53 
>PF07496 zf-CW:  CW-type Zinc Finger;  InterPro: IPR011124 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents a CW-type zinc finger motif, named for its conserved cysteine and tryptophan residues. It is predicted to be a highly specialised mononuclear four-cysteine (C4) zinc finger that plays a role in DNA binding and/or promoting protein-protein interactions in complicated eukaryotic processes including chromatin methylation status and early embryonic development. Weak homology to members of IPR001965 from INTERPRO further evidences these predictions. The domain is found exclusively in vertebrates, vertebrate-infecting parasites and higher plants [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2RR4_A 2E61_A 2L7P_A.
Probab=20.46  E-value=54  Score=20.43  Aligned_cols=9  Identities=44%  Similarity=1.553  Sum_probs=5.9

Q ss_pred             eeEecCCCC
Q 037565          115 TWIQCKPCY  123 (218)
Q Consensus       115 ~Wv~c~~C~  123 (218)
                      .||||..|.
T Consensus         2 ~WVQCd~C~   10 (50)
T PF07496_consen    2 YWVQCDSCL   10 (50)
T ss_dssp             EEEE-TTT-
T ss_pred             eEEECCCCC
Confidence            599998775


No 54 
>PF00026 Asp:  Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.;  InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) .  More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=20.11  E-value=96  Score=26.18  Aligned_cols=28  Identities=25%  Similarity=0.296  Sum_probs=20.2

Q ss_pred             EEEEeCCC----ceeEEEEEeCCCCceeEecC
Q 037565           93 VNISIGTP----RVELIGIADTASDLTWIQCK  120 (218)
Q Consensus        93 ~~i~iGtP----~q~~~~~~DTGS~~~Wv~c~  120 (218)
                      ..|.+|.=    .....+++|||+..+|++-.
T Consensus       185 ~~i~i~~~~~~~~~~~~~~~Dtgt~~i~lp~~  216 (317)
T PF00026_consen  185 DSISIGGESVFSSSGQQAILDTGTSYIYLPRS  216 (317)
T ss_dssp             EEEEETTEEEEEEEEEEEEEETTBSSEEEEHH
T ss_pred             ccccccccccccccceeeecccccccccccch
Confidence            33566642    23468999999999999864


No 55 
>PF01705 CX:  CX module;  InterPro: IPR002619 This domain has no known function. It is found in several Caenorhabditis elegans proteins. The domain contains 6 conserved cysteines that probably form three disulphide bridges.
Probab=20.08  E-value=84  Score=20.52  Aligned_cols=36  Identities=19%  Similarity=0.358  Sum_probs=24.1

Q ss_pred             CcceeEEEcCCCceEEEEEEEEEEEecCCCCCCCcccCCeEEEeEEcC
Q 037565          165 NTCQFYYSYADGSYTTGNVGTEVFVISSKTNPDQGTFPNIVYGCGNNN  212 (218)
Q Consensus       165 ~~c~~~~~Y~dgs~~~G~l~~D~~~~~~~~~~~~~~~~~~~FGc~~~~  212 (218)
                      ..|.|.+...|+...       .+.|.+.     ...+.+.|||...+
T Consensus        15 ~~C~y~i~~~d~el~-------~v~F~nG-----t~p~si~fgC~~~~   50 (61)
T PF01705_consen   15 NICEYQISEDDWELG-------NVTFPNG-----TRPKSIFFGCPSYE   50 (61)
T ss_pred             eEEEEECCCCCcccc-------ceEccCC-----CcceeEEEeCCCCC
Confidence            469999987665432       3455442     36789999998653


Done!