Query         043220
Match_columns 121
No_of_seqs    132 out of 586
Neff          6.8 
Searched_HMMs 46136
Date          Fri Mar 29 02:41:50 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/043220.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/043220hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF08284 RVP_2:  Retroviral asp  99.9 4.7E-26   1E-30  161.3   8.2   86   33-120    13-126 (135)
  2 cd05479 RP_DDI RP_DDI; retrope  99.9 3.3E-21 7.2E-26  134.2   8.7   82   38-120    13-121 (124)
  3 cd05484 retropepsin_like_LTR_2  99.6 6.8E-16 1.5E-20  101.6   6.6   65   43-107     2-91  (91)
  4 PF13975 gag-asp_proteas:  gag-  99.6 1.2E-14 2.6E-19   92.6   7.9   61   36-96      3-64  (72)
  5 TIGR02281 clan_AA_DTGA clan AA  99.6 2.1E-14 4.5E-19   99.8   9.3   83   36-118     6-116 (121)
  6 PF13650 Asp_protease_2:  Aspar  99.4 4.1E-13 8.8E-18   86.4   7.1   49   44-92      1-50  (90)
  7 cd05483 retropepsin_like_bacte  99.4 2.4E-12 5.2E-17   83.5   8.1   68   40-107     1-96  (96)
  8 TIGR03698 clan_AA_DTGF clan AA  99.4 2.8E-12 6.1E-17   87.4   7.1   67   51-120    15-106 (107)
  9 COG3577 Predicted aspartyl pro  99.2 4.9E-11 1.1E-15   89.7   9.1   87   25-115    93-207 (215)
 10 PF09668 Asp_protease:  Asparty  99.2 1.3E-11 2.7E-16   86.6   5.1   77   38-115    21-124 (124)
 11 PF00077 RVP:  Retroviral aspar  99.2 4.6E-11 9.9E-16   79.2   4.4   68   41-112     5-98  (100)
 12 cd06095 RP_RTVL_H_like Retrope  99.1 1.1E-10 2.5E-15   76.2   6.1   61   44-107     1-86  (86)
 13 cd00303 retropepsin_like Retro  99.1 7.5E-10 1.6E-14   67.6   7.0   63   45-107     2-92  (92)
 14 PF12384 Peptidase_A2B:  Ty3 tr  99.0 2.1E-09 4.5E-14   78.6   7.3   87   25-111    14-131 (177)
 15 cd05480 NRIP_C NRIP_C; putativ  98.8   6E-09 1.3E-13   70.5   4.7   71   45-117     2-102 (103)
 16 PF02160 Peptidase_A3:  Caulifl  98.5 2.9E-07 6.3E-12   69.3   6.2   75   42-118     7-111 (201)
 17 cd06094 RP_Saci_like RP_Saci_l  98.5 1.2E-07 2.6E-12   62.9   3.6   55   52-110     9-88  (89)
 18 KOG0012 DNA damage inducible p  98.4 2.5E-07 5.3E-12   74.7   3.8   82   37-119   231-339 (380)
 19 PF05585 DUF1758:  Putative pep  98.4 5.6E-07 1.2E-11   64.9   5.2   32   51-82     11-42  (164)
 20 cd05481 retropepsin_like_LTR_1  98.0 1.1E-05 2.4E-10   53.5   5.1   50   46-95      3-56  (93)
 21 COG5550 Predicted aspartyl pro  97.3 0.00077 1.7E-08   47.3   5.7   76   40-118    11-114 (125)
 22 cd05482 HIV_retropepsin_like R  94.6   0.038 8.2E-07   36.4   2.9   25   45-69      2-26  (87)
 23 cd05476 pepsin_A_like_plant Ch  94.4   0.087 1.9E-06   40.2   4.9   56   54-120   178-257 (265)
 24 cd06096 Plasmepsin_5 Plasmepsi  94.1    0.08 1.7E-06   41.7   4.3   65   52-120   231-317 (326)
 25 cd06097 Aspergillopepsin_like   91.9    0.46   1E-05   36.4   5.6   65   52-120   198-273 (278)
 26 cd05471 pepsin_like Pepsin-lik  91.9    0.32 6.9E-06   36.5   4.5   66   51-120   201-278 (283)
 27 PF12382 Peptidase_A2E:  Retrot  91.8    0.18 3.9E-06   34.6   2.8   40   42-81     35-76  (137)
 28 cd06098 phytepsin Phytepsin, a  89.7     1.8 3.9E-05   33.9   7.1   74   46-120   197-312 (317)
 29 cd05485 Cathepsin_D_like Cathe  86.5     3.3 7.1E-05   32.6   6.8   22   98-120   303-324 (329)
 30 PF00026 Asp:  Eukaryotic aspar  84.3    0.35 7.6E-06   37.0   0.3   67   52-120   199-311 (317)
 31 PF00455 DeoRC:  DeoR C termina  81.2     1.1 2.3E-05   32.2   1.8   63   54-116    22-95  (161)
 32 cd05470 pepsin_retropepsin_lik  81.0     1.7 3.6E-05   28.3   2.6   27   45-71      2-30  (109)
 33 cd05474 SAP_like SAPs, pepsin-  80.6     4.2 9.1E-05   31.0   5.1   25   43-67      4-30  (295)
 34 PTZ00013 plasmepsin 4 (PM4); P  75.4     6.1 0.00013   33.1   4.9   22   53-74    332-353 (450)
 35 PF00026 Asp:  Eukaryotic aspar  74.4     4.9 0.00011   30.6   3.9   26   43-68      3-30  (317)
 36 COG2383 Uncharacterized conser  73.6     0.6 1.3E-05   31.8  -1.2   21   99-119    50-70  (109)
 37 COG0282 ackA Acetate kinase [E  73.1     2.2 4.8E-05   35.4   1.7   37   60-96    179-215 (396)
 38 PTZ00147 plasmepsin-1; Provisi  69.8      23 0.00051   29.7   7.1   22   52-73    332-353 (453)
 39 cd06097 Aspergillopepsin_like   66.0     7.6 0.00016   29.6   3.3   26   44-69      3-30  (278)
 40 PRK10681 DNA-binding transcrip  64.6     3.1 6.7E-05   31.9   0.9   51   54-104    95-156 (252)
 41 cd06098 phytepsin Phytepsin, a  60.0      14  0.0003   28.9   3.8   28   42-69     11-40  (317)
 42 COG4067 Uncharacterized protei  59.0       8 0.00017   28.3   2.1   23   51-73     38-60  (162)
 43 cd05487 renin_like Renin stimu  58.3      14 0.00031   28.9   3.6   27   42-68      9-37  (326)
 44 cd05472 cnd41_like Chloroplast  57.6      11 0.00024   28.9   2.9   20   54-73    173-192 (299)
 45 cd05486 Cathespin_E Cathepsin   57.2      11 0.00025   29.2   2.9   29   46-74    186-220 (316)
 46 cd05478 pepsin_A Pepsin A, asp  57.1      14 0.00031   28.7   3.4   73   46-120   194-312 (317)
 47 cd05477 gastricsin Gastricsins  55.8      16 0.00035   28.4   3.5   72   47-120   189-312 (318)
 48 cd05475 nucellin_like Nucellin  54.7      17 0.00037   27.7   3.5   25   43-67      4-30  (273)
 49 cd05490 Cathepsin_D2 Cathepsin  54.4      18 0.00039   28.2   3.6   27   42-68      7-35  (325)
 50 cd05476 pepsin_A_like_plant Ch  53.8      15 0.00032   27.8   3.0   25   44-68      4-30  (265)
 51 PRK09802 DNA-binding transcrip  52.6     6.6 0.00014   30.5   0.9   53   54-106   109-172 (269)
 52 cd05478 pepsin_A Pepsin A, asp  52.0      22 0.00048   27.6   3.8   28   42-69     11-40  (317)
 53 cd06096 Plasmepsin_5 Plasmepsi  51.8      26 0.00055   27.5   4.1   28   42-69      4-33  (326)
 54 cd05477 gastricsin Gastricsins  51.0      24 0.00053   27.3   3.9   27   43-69      5-33  (318)
 55 PRK10411 DNA-binding transcrip  50.8       7 0.00015   29.9   0.7   55   54-109    96-161 (240)
 56 PRK10906 DNA-binding transcrip  49.6     7.8 0.00017   29.8   0.8   49   54-102    94-153 (252)
 57 cd05472 cnd41_like Chloroplast  49.5      19 0.00041   27.6   3.0   24   44-67      4-29  (299)
 58 cd05488 Proteinase_A_fungi Fun  49.5      24 0.00051   27.5   3.6   28   42-69     11-40  (320)
 59 cd05473 beta_secretase_like Be  49.1      23 0.00049   28.2   3.5   27   43-69      5-33  (364)
 60 PTZ00013 plasmepsin 4 (PM4); P  48.9      25 0.00053   29.6   3.8   29   43-71    140-170 (450)
 61 cd05473 beta_secretase_like Be  48.6      40 0.00087   26.8   4.8   20   54-73    213-232 (364)
 62 cd05486 Cathespin_E Cathepsin   48.4      23 0.00049   27.6   3.3   26   44-69      3-30  (316)
 63 PTZ00147 plasmepsin-1; Provisi  48.1      27 0.00058   29.3   3.9   28   42-69    140-169 (453)
 64 cd05485 Cathepsin_D_like Cathe  47.2      27 0.00059   27.4   3.6   29   41-69     11-41  (329)
 65 PF03419 Peptidase_U4:  Sporula  46.7      35 0.00076   26.7   4.1   35   40-74    156-201 (293)
 66 TIGR02854 spore_II_GA sigma-E   45.6      35 0.00077   26.8   4.0   25   38-62    155-181 (288)
 67 PF04930 FUN14:  FUN14 family;   42.5     7.8 0.00017   25.7  -0.1   20  100-119    31-50  (100)
 68 cd06396 PB1_NBR1 The PB1 domai  42.4      59  0.0013   21.1   4.0   44   42-90      1-46  (81)
 69 cd05490 Cathepsin_D2 Cathepsin  42.4      19 0.00042   28.0   2.1   22   53-74    207-228 (325)
 70 PRK10434 srlR DNA-bindng trans  39.7      15 0.00033   28.2   1.1   50   54-103    94-155 (256)
 71 cd05471 pepsin_like Pepsin-lik  39.2      35 0.00075   25.3   3.0   28   45-72      4-33  (283)
 72 cd05488 Proteinase_A_fungi Fun  38.7      22 0.00047   27.7   1.9   22   53-74    206-227 (320)
 73 PF14543 TAXi_N:  Xylanase inhi  35.3      60  0.0013   23.0   3.6   23   44-66      3-27  (164)
 74 PTZ00165 aspartyl protease; Pr  35.1      58  0.0013   27.6   3.9   31   42-72    121-153 (482)
 75 cd05487 renin_like Renin stimu  34.6      29 0.00063   27.1   2.0   22   98-120   299-320 (326)
 76 cd05474 SAP_like SAPs, pepsin-  34.5      29 0.00062   26.3   1.9   22   53-74    179-200 (295)
 77 PF04126 Cyclophil_like:  Cyclo  31.3      33  0.0007   23.5   1.6   30   42-76      1-31  (120)
 78 PRK13509 transcriptional repre  29.9      26 0.00055   26.9   0.9   49   54-103    96-155 (251)
 79 COG1349 GlpR Transcriptional r  28.8      35 0.00076   26.2   1.5   51   54-104    94-155 (253)
 80 PLN03146 aspartyl protease fam  28.8      59  0.0013   26.9   2.9   17   54-70    309-325 (431)
 81 PLN03146 aspartyl protease fam  27.6      78  0.0017   26.2   3.5   27   41-67     84-112 (431)
 82 PF08922 DUF1905:  Domain of un  26.9 1.1E+02  0.0024   19.3   3.4   45   40-86     31-78  (80)
 83 PF05618 Zn_protease:  Putative  26.8      50  0.0011   23.3   1.9   23   50-72     14-36  (138)
 84 COG2861 Uncharacterized protei  26.4      55  0.0012   25.7   2.2   28   52-79    150-177 (250)
 85 cd05489 xylanase_inhibitor_I_l  25.7      70  0.0015   25.7   2.8   20   54-73    231-250 (362)
 86 PRK12440 acetate kinase; Revie  25.6      31 0.00068   28.7   0.8   44   60-103   180-229 (397)
 87 cd01813 UBP_N UBP ubiquitin pr  24.5 1.4E+02  0.0029   18.4   3.4   38   42-79      1-38  (74)
 88 smart00295 B41 Band 4.1 homolo  24.0 2.1E+02  0.0046   20.1   4.8   30   49-78     12-41  (207)
 89 PF14541 TAXi_C:  Xylanase inhi  23.7      50  0.0011   23.1   1.4   19   54-72     31-49  (161)
 90 cd06407 PB1_NLP A PB1 domain i  23.6 1.3E+02  0.0027   19.3   3.2   46   42-89      1-46  (82)
 91 PF04746 DUF575:  Protein of un  23.3      38 0.00082   22.8   0.7   13   97-109    27-39  (101)
 92 PF00564 PB1:  PB1 domain;  Int  23.2 1.8E+02   0.004   17.6   4.5   46   41-89      1-47  (84)
 93 PRK12379 propionate/acetate ki  22.5      37 0.00081   28.3   0.7   45   60-104   176-226 (396)
 94 PF03539 Spuma_A9PTase:  Spumav  22.1      83  0.0018   23.0   2.3   26   48-73      1-26  (163)
 95 cd01812 BAG1_N Ubiquitin-like   20.7 1.8E+02   0.004   16.9   3.4   38   42-79      1-38  (71)
 96 PRK12397 propionate kinase; Re  20.6      38 0.00083   28.3   0.4   45   60-104   180-230 (404)
 97 COG3865 Uncharacterized protei  20.4 2.4E+02  0.0053   20.4   4.4   25   40-65     49-73  (151)

No 1  
>PF08284 RVP_2:  Retroviral aspartyl protease;  InterPro: IPR013242 This region defines single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein; usually pol, more rarely gag. Retroviral proteases appear to be homologous to a single domain of the two-domain eukaryotic aspartyl proteases. 
Probab=99.93  E-value=4.7e-26  Score=161.33  Aligned_cols=86  Identities=24%  Similarity=0.422  Sum_probs=73.5

Q ss_pred             eecccccCeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCC-CCCeEEEecCCceecc---------------
Q 043220           33 VKCGQLRKFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMP-TKPFSVRIANGAHLKS---------------   96 (121)
Q Consensus        33 l~g~~~~~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~-~~~~~V~va~G~~l~~---------------   96 (121)
                      -.+...|..|+.++.|+++++.+|||||||||||++++|++++++..+ ..++.|..++| .+.|               
T Consensus        13 ~~~~~~~~vi~g~~~I~~~~~~vLiDSGAThsFIs~~~a~~~~l~~~~l~~~~~V~~~g~-~~~~~~~~~~~~~~i~g~~   91 (135)
T PF08284_consen   13 EEAEESPDVITGTFLINSIPASVLIDSGATHSFISSSFAKKLGLPLEPLPRPIVVSAPGG-SINCEGVCPDVPLSIQGHE   91 (135)
T ss_pred             hcccCCCCeEEEEEEeccEEEEEEEecCCCcEEccHHHHHhcCCEEEEccCeeEEecccc-cccccceeeeEEEEECCeE
Confidence            344567889999999999999999999999999999999999999885 46777775544 3332               


Q ss_pred             ------------ceEeecchhhhhcCCeeeecccCC
Q 043220           97 ------------PDVALDVKWSEQIGSIMRGWKKDE  120 (121)
Q Consensus        97 ------------~dvILG~~WL~~~~p~~idw~~~~  120 (121)
                                  ||+|||||||++|+| .|||.+++
T Consensus        92 ~~~dl~vl~l~~~DvILGm~WL~~~~~-~IDw~~k~  126 (135)
T PF08284_consen   92 FVVDLLVLDLGGYDVILGMDWLKKHNP-VIDWATKT  126 (135)
T ss_pred             EEeeeEEecccceeeEeccchHHhCCC-EEEccCCE
Confidence                        999999999999999 59999875


No 2  
>cd05479 RP_DDI RP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
Probab=99.85  E-value=3.3e-21  Score=134.20  Aligned_cols=82  Identities=15%  Similarity=0.083  Sum_probs=68.8

Q ss_pred             ccCeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc---------------------
Q 043220           38 LRKFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS---------------------   96 (121)
Q Consensus        38 ~~~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~---------------------   96 (121)
                      ....|++.+.|||+++.+|||||||||||++++|+|+|++.....++.+.++++....+                     
T Consensus        13 ~~~~~~v~~~Ing~~~~~LvDTGAs~s~Is~~~a~~lgl~~~~~~~~~~~~~g~g~~~~~g~~~~~~l~i~~~~~~~~~~   92 (124)
T cd05479          13 KVPMLYINVEINGVPVKAFVDSGAQMTIMSKACAEKCGLMRLIDKRFQGIAKGVGTQKILGRIHLAQVKIGNLFLPCSFT   92 (124)
T ss_pred             eeeEEEEEEEECCEEEEEEEeCCCceEEeCHHHHHHcCCccccCcceEEEEecCCCcEEEeEEEEEEEEECCEEeeeEEE
Confidence            45678999999999999999999999999999999999987666666666665332211                     


Q ss_pred             ------ceEeecchhhhhcCCeeeecccCC
Q 043220           97 ------PDVALDVKWSEQIGSIMRGWKKDE  120 (121)
Q Consensus        97 ------~dvILG~~WL~~~~p~~idw~~~~  120 (121)
                            +|+|||||||++++. .|||++++
T Consensus        93 Vl~~~~~d~ILG~d~L~~~~~-~ID~~~~~  121 (124)
T cd05479          93 VLEDDDVDFLIGLDMLKRHQC-VIDLKENV  121 (124)
T ss_pred             EECCCCcCEEecHHHHHhCCe-EEECCCCE
Confidence                  899999999999996 69999874


No 3  
>cd05484 retropepsin_like_LTR_2 Retropepsins_like_LTR, pepsin-like aspartate proteases. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classif
Probab=99.63  E-value=6.8e-16  Score=101.59  Aligned_cols=65  Identities=20%  Similarity=0.254  Sum_probs=58.7

Q ss_pred             EEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc-------------------------c
Q 043220           43 VVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS-------------------------P   97 (121)
Q Consensus        43 ~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~-------------------------~   97 (121)
                      .+++.|||+++.+||||||++|||+++.+++++++.......++..|+|..+.+                         +
T Consensus         2 ~~~~~Ing~~i~~lvDTGA~~svis~~~~~~lg~~~~~~~~~~v~~a~G~~~~~~G~~~~~v~~~~~~~~~~~~v~~~~~   81 (91)
T cd05484           2 TVTLLVNGKPLKFQLDTGSAITVISEKTWRKLGSPPLKPTKKRLRTATGTKLSVLGQILVTVKYGGKTKVLTLYVVKNEG   81 (91)
T ss_pred             EEEEEECCEEEEEEEcCCcceEEeCHHHHHHhCCCccccccEEEEecCCCEeeEeEEEEEEEEECCEEEEEEEEEEECCC
Confidence            578999999999999999999999999999999987666789999999987643                         8


Q ss_pred             eEeecchhhh
Q 043220           98 DVALDVKWSE  107 (121)
Q Consensus        98 dvILG~~WL~  107 (121)
                      +.|||++||.
T Consensus        82 ~~lLG~~wl~   91 (91)
T cd05484          82 LNLLGRDWLD   91 (91)
T ss_pred             CCccChhhcC
Confidence            9999999984


No 4  
>PF13975 gag-asp_proteas:  gag-polyprotein putative aspartyl protease
Probab=99.58  E-value=1.2e-14  Score=92.56  Aligned_cols=61  Identities=30%  Similarity=0.433  Sum_probs=56.3

Q ss_pred             ccccCeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCC-CeEEEecCCceecc
Q 043220           36 GQLRKFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTK-PFSVRIANGAHLKS   96 (121)
Q Consensus        36 ~~~~~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~-~~~V~va~G~~l~~   96 (121)
                      ...++++++.+.|+|+.+.+|||||||||||+.++|+||+++..+.. +.+|++|||....+
T Consensus         3 ~~~~g~~~v~~~I~g~~~~alvDtGat~~fis~~~a~rLgl~~~~~~~~~~v~~a~g~~~~~   64 (72)
T PF13975_consen    3 TPDPGLMYVPVSIGGVQVKALVDTGATHNFISESLAKRLGLPLEKPPSPIRVKLANGSVIEI   64 (72)
T ss_pred             cccCCEEEEEEEECCEEEEEEEeCCCcceecCHHHHHHhCCCcccCCCCEEEEECCCCcccc
Confidence            45788999999999999999999999999999999999999988766 99999999987765


No 5  
>TIGR02281 clan_AA_DTGA clan AA aspartic protease, TIGR02281 family. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix. Sequences in the seed alignment and those scoring above the trusted cutoff are Proteobacterial; homologs scroing between trusted and noise are found in Pyrobaculum aerophilum str. IM2 (archaeal), Pirellula sp. (Planctomycetes), and Nostoc sp. PCC 7120 (Cyanobacteria).
Probab=99.57  E-value=2.1e-14  Score=99.84  Aligned_cols=83  Identities=16%  Similarity=0.165  Sum_probs=70.5

Q ss_pred             ccccCeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCC-CCCeEEEecCCceecc------------------
Q 043220           36 GQLRKFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMP-TKPFSVRIANGAHLKS------------------   96 (121)
Q Consensus        36 ~~~~~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~-~~~~~V~va~G~~l~~------------------   96 (121)
                      ....+++.+.++|||+++.+|||||||+++|++++|+++++.... ..+..+..|+|.....                  
T Consensus         6 ~~~~g~~~v~~~InG~~~~flVDTGAs~t~is~~~A~~Lgl~~~~~~~~~~~~ta~G~~~~~~~~l~~l~iG~~~~~nv~   85 (121)
T TIGR02281         6 KDGDGHFYATGRVNGRNVRFLVDTGATSVALNEEDAQRLGLDLNRLGYTVTVSTANGQIKAARVTLDRVAIGGIVVNDVD   85 (121)
T ss_pred             EcCCCeEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCcccCCceEEEEeCCCcEEEEEEEeCEEEECCEEEeCcE
Confidence            356789999999999999999999999999999999999987654 4578889999974321                  


Q ss_pred             ---------ceEeecchhhhhcCCeeeeccc
Q 043220           97 ---------PDVALDVKWSEQIGSIMRGWKK  118 (121)
Q Consensus        97 ---------~dvILG~~WL~~~~p~~idw~~  118 (121)
                               .+.+|||+||.++.++.+|-.+
T Consensus        86 ~~v~~~~~~~~~LLGm~fL~~~~~~~~~~~~  116 (121)
T TIGR02281        86 AMVAEGGALSESLLGMSFLNRLSRFTVRGGK  116 (121)
T ss_pred             EEEeCCCcCCceEcCHHHHhccccEEEECCE
Confidence                     4799999999999988777543


No 6  
>PF13650 Asp_protease_2:  Aspartyl protease
Probab=99.44  E-value=4.1e-13  Score=86.37  Aligned_cols=49  Identities=33%  Similarity=0.432  Sum_probs=43.7

Q ss_pred             EEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCC-CeEEEecCCc
Q 043220           44 VATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTK-PFSVRIANGA   92 (121)
Q Consensus        44 v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~-~~~V~va~G~   92 (121)
                      |+++|||+++.+||||||+.++|+++++++++++..... +..+..++|.
T Consensus         1 V~v~vng~~~~~liDTGa~~~~i~~~~~~~l~~~~~~~~~~~~~~~~~g~   50 (90)
T PF13650_consen    1 VPVKVNGKPVRFLIDTGASISVISRSLAKKLGLKPRPKSVPISVSGAGGS   50 (90)
T ss_pred             CEEEECCEEEEEEEcCCCCcEEECHHHHHHcCCCCcCCceeEEEEeCCCC
Confidence            468999999999999999999999999999999877553 6788888887


No 7  
>cd05483 retropepsin_like_bacteria Bacterial aspartate proteases, retropepsin-like protease family. This family of bacteria aspartate proteases is a subfamily of retropepsin-like protease family, which includes enzymes from retrovirus and retrotransposons. While fungal and mammalian pepsin-like aspartate proteases are bilobal proteins with structurally related N- and C-termini, this family of bacteria aspartate proteases is half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate proteases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=99.39  E-value=2.4e-12  Score=83.52  Aligned_cols=68  Identities=25%  Similarity=0.344  Sum_probs=57.1

Q ss_pred             CeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc-----------------------
Q 043220           40 KFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS-----------------------   96 (121)
Q Consensus        40 ~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~-----------------------   96 (121)
                      +.+.+++.||++++.+||||||++++|+.+++++++++.....+..+..++|....+                       
T Consensus         1 ~~~~v~v~i~~~~~~~llDTGa~~s~i~~~~~~~l~~~~~~~~~~~~~~~~G~~~~~~~~~~~i~ig~~~~~~~~~~v~d   80 (96)
T cd05483           1 GHFVVPVTINGQPVRFLLDTGASTTVISEELAERLGLPLTLGGKVTVQTANGRVRAARVRLDSLQIGGITLRNVPAVVLP   80 (96)
T ss_pred             CcEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCccCCCcEEEEecCCCccceEEEcceEEECCcEEeccEEEEeC
Confidence            357899999999999999999999999999999998855555677888888864322                       


Q ss_pred             -----ceEeecchhhh
Q 043220           97 -----PDVALDVKWSE  107 (121)
Q Consensus        97 -----~dvILG~~WL~  107 (121)
                           .|.|||++||+
T Consensus        81 ~~~~~~~gIlG~d~l~   96 (96)
T cd05483          81 GDALGVDGLLGMDFLR   96 (96)
T ss_pred             CcccCCceEeChHHhC
Confidence                 67999999985


No 8  
>TIGR03698 clan_AA_DTGF clan AA aspartic protease, AF_0612 family. Members of this protein family are clan AA aspartic proteases, related to family TIGR02281. These proteins resemble retropepsins, pepsin-like proteases of retroviruses such as HIV. Members of this family are found in archaea and bacteria.
Probab=99.35  E-value=2.8e-12  Score=87.44  Aligned_cols=67  Identities=21%  Similarity=0.340  Sum_probs=56.8

Q ss_pred             EEEEEEecCCCcccc-cCHHHHhhcCCCCCCCCCeEEEecCCceecc------------------------ceEeecchh
Q 043220           51 HEVVILISSGSTANF-VSDKVAALLPLPMMPTKPFSVRIANGAHLKS------------------------PDVALDVKW  105 (121)
Q Consensus        51 ~~v~~LIDSGAT~nF-I~~~~a~rl~l~~~~~~~~~V~va~G~~l~~------------------------~dvILG~~W  105 (121)
                      -++.+|||||||+.+ |++++|+++|++..  ....+.+|||.....                        .+++|||.|
T Consensus        15 ~~v~~LVDTGat~~~~l~~~~a~~lgl~~~--~~~~~~tA~G~~~~~~v~~~~v~igg~~~~~~v~~~~~~~~~LLG~~~   92 (107)
T TIGR03698        15 MEVRALVDTGFSGFLLVPPDIVNKLGLPEL--DQRRVYLADGREVLTDVAKASIIINGLEIDAFVESLGYVDEPLLGTEL   92 (107)
T ss_pred             eEEEEEEECCCCeEEecCHHHHHHcCCCcc--cCcEEEecCCcEEEEEEEEEEEEECCEEEEEEEEecCCCCccEecHHH
Confidence            489999999999998 99999999999764  367899999963321                        578999999


Q ss_pred             hhhcCCeeeecccCC
Q 043220          106 SEQIGSIMRGWKKDE  120 (121)
Q Consensus       106 L~~~~p~~idw~~~~  120 (121)
                      |++++. .+||++++
T Consensus        93 L~~l~l-~id~~~~~  106 (107)
T TIGR03698        93 LEGLGI-VIDYRNQG  106 (107)
T ss_pred             HhhCCE-EEehhhCc
Confidence            999986 59999875


No 9  
>COG3577 Predicted aspartyl protease [General function prediction only]
Probab=99.25  E-value=4.9e-11  Score=89.67  Aligned_cols=87  Identities=15%  Similarity=0.172  Sum_probs=75.3

Q ss_pred             cceEEEeeeecccccCeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCC-CCCCeEEEecCCceecc-------
Q 043220           25 ENSRSLFQVKCGQLRKFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMM-PTKPFSVRIANGAHLKS-------   96 (121)
Q Consensus        25 ~~~isl~Al~g~~~~~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~-~~~~~~V~va~G~~l~~-------   96 (121)
                      .-+++|++    ...+|+...+.|||+++.+|||||||.-.+++.-|+|+|+... ...++.|.+|||+..-.       
T Consensus        93 ~~~v~Lak----~~~GHF~a~~~VNGk~v~fLVDTGATsVal~~~dA~RlGid~~~l~y~~~v~TANG~~~AA~V~Ld~v  168 (215)
T COG3577          93 YQEVSLAK----SRDGHFEANGRVNGKKVDFLVDTGATSVALNEEDARRLGIDLNSLDYTITVSTANGRARAAPVTLDRV  168 (215)
T ss_pred             ceEEEEEe----cCCCcEEEEEEECCEEEEEEEecCcceeecCHHHHHHhCCCccccCCceEEEccCCccccceEEeeeE
Confidence            45788877    8889999999999999999999999999999999999999876 56899999999974321       


Q ss_pred             --------------------ceEeecchhhhhcCCeeee
Q 043220           97 --------------------PDVALDVKWSEQIGSIMRG  115 (121)
Q Consensus        97 --------------------~dvILG~~WL~~~~p~~id  115 (121)
                                          -...|||+||.+++-...+
T Consensus       169 ~IG~I~~~nV~A~V~~~g~L~~sLLGMSfL~rL~~fq~~  207 (215)
T COG3577         169 QIGGIRVKNVDAMVAEDGALDESLLGMSFLNRLSGFQVD  207 (215)
T ss_pred             EEccEEEcCchhheecCCccchhhhhHHHHhhccceEec
Confidence                                5678999999999876543


No 10 
>PF09668 Asp_protease:  Aspartyl protease;  InterPro: IPR019103 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure.  This family of eukaryotic aspartyl proteases have a fold similar to retroviral proteases which implies they function proteolytically during regulated protein turnover []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 3S8I_A 2I1A_B.
Probab=99.23  E-value=1.3e-11  Score=86.60  Aligned_cols=77  Identities=17%  Similarity=0.160  Sum_probs=50.3

Q ss_pred             ccCeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeE-EEecCC-----------------ceecc---
Q 043220           38 LRKFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPFS-VRIANG-----------------AHLKS---   96 (121)
Q Consensus        38 ~~~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~-V~va~G-----------------~~l~~---   96 (121)
                      ....+.+..+|||++++|+|||||.+|.|+.++|+|+|+.....+.+. +..+-|                 ..+.|   
T Consensus        21 ~v~mLyI~~~ing~~vkA~VDtGAQ~tims~~~a~r~gL~~lid~r~~g~a~GvG~~~i~G~Ih~~~l~ig~~~~~~s~~  100 (124)
T PF09668_consen   21 QVSMLYINCKINGVPVKAFVDTGAQSTIMSKSCAERCGLMRLIDKRFAGVAKGVGTQKILGRIHSVQLKIGGLFFPCSFT  100 (124)
T ss_dssp             -----EEEEEETTEEEEEEEETT-SS-EEEHHHHHHTTGGGGEEGGG-EE-------EEEEEEEEEEEEETTEEEEEEEE
T ss_pred             CcceEEEEEEECCEEEEEEEeCCCCccccCHHHHHHcCChhhccccccccccCCCcCceeEEEEEEEEEECCEEEEEEEE
Confidence            345678999999999999999999999999999999998632222221 111112                 12222   


Q ss_pred             ------ceEeecchhhhhcCCeeee
Q 043220           97 ------PDVALDVKWSEQIGSIMRG  115 (121)
Q Consensus        97 ------~dvILG~~WL~~~~p~~id  115 (121)
                            .|++||.|||++|..+ ||
T Consensus       101 Vle~~~~d~llGld~L~~~~c~-ID  124 (124)
T PF09668_consen  101 VLEDQDVDLLLGLDMLKRHKCC-ID  124 (124)
T ss_dssp             EETTSSSSEEEEHHHHHHTT-E-EE
T ss_pred             EeCCCCcceeeeHHHHHHhCcc-cC
Confidence                  8999999999999884 65


No 11 
>PF00077 RVP:  Retroviral aspartyl protease The Prosite entry also includes Pfam:PF00026;  InterPro: IPR018061 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to the MEROPS peptidase family A2 (retropepsin family, clan AA), subfamily A2A. The family includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). Retroviral aspartyl protease is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins.; PDB: 3D3T_B 3SQF_A 1NSO_A 2HB3_A 2HS2_A 2HS1_B 3K4V_A 3GGV_C 1HTG_B 2FDE_A ....
Probab=99.15  E-value=4.6e-11  Score=79.16  Aligned_cols=68  Identities=21%  Similarity=0.322  Sum_probs=55.8

Q ss_pred             eEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc------------------------
Q 043220           41 FCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS------------------------   96 (121)
Q Consensus        41 ~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~------------------------   96 (121)
                      .-++..+++|+++.+||||||+.++|+++...++..+  ...+..+..++|.. .+                        
T Consensus         5 rp~i~v~i~g~~i~~LlDTGA~vsiI~~~~~~~~~~~--~~~~~~v~~~~g~~-~~~~~~~~~v~~~~~~~~~~~~v~~~   81 (100)
T PF00077_consen    5 RPYITVKINGKKIKALLDTGADVSIISEKDWKKLGPP--PKTSITVRGAGGSS-SILGSTTVEVKIGGKEFNHTFLVVPD   81 (100)
T ss_dssp             SSEEEEEETTEEEEEEEETTBSSEEESSGGSSSTSSE--EEEEEEEEETTEEE-EEEEEEEEEEEETTEEEEEEEEESST
T ss_pred             CceEEEeECCEEEEEEEecCCCcceeccccccccccc--ccCCceeccCCCcc-eeeeEEEEEEEEECccceEEEEecCC
Confidence            3467889999999999999999999999998877654  45567788888865 21                        


Q ss_pred             --ceEeecchhhhhcCCe
Q 043220           97 --PDVALDVKWSEQIGSI  112 (121)
Q Consensus        97 --~dvILG~~WL~~~~p~  112 (121)
                        .| |||.|||++++..
T Consensus        82 ~~~~-ILG~D~L~~~~~~   98 (100)
T PF00077_consen   82 LPMN-ILGRDFLKKLNAV   98 (100)
T ss_dssp             CSSE-EEEHHHHTTTTCE
T ss_pred             CCCC-EeChhHHHHcCCE
Confidence              67 9999999999873


No 12 
>cd06095 RP_RTVL_H_like Retropepsin of the RTVL_H family of human endogenous retrovirus-like elements. This family includes aspartate proteases from retroelements with LTR (long terminal repeats) including the RTVL_H family of human endogenous retrovirus-like elements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where 
Probab=99.14  E-value=1.1e-10  Score=76.19  Aligned_cols=61  Identities=18%  Similarity=0.294  Sum_probs=49.6

Q ss_pred             EEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCce---ecc----------------------ce
Q 043220           44 VATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAH---LKS----------------------PD   98 (121)
Q Consensus        44 v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~---l~~----------------------~d   98 (121)
                      +++.|||+++.+|+||||+++.|++..++++   .....+..+..++|..   +..                      .+
T Consensus         1 ~~v~InG~~~~fLvDTGA~~tii~~~~a~~~---~~~~~~~~v~gagG~~~~~v~~~~~~v~vg~~~~~~~~~v~~~~~~   77 (86)
T cd06095           1 VTITVEGVPIVFLVDTGATHSVLKSDLGPKQ---ELSTTSVLIRGVSGQSQQPVTTYRTLVDLGGHTVSHSFLVVPNCPD   77 (86)
T ss_pred             CEEEECCEEEEEEEECCCCeEEECHHHhhhc---cCCCCcEEEEeCCCcccccEEEeeeEEEECCEEEEEEEEEEcCCCC
Confidence            4678999999999999999999999999997   2334677888888863   210                      78


Q ss_pred             Eeecchhhh
Q 043220           99 VALDVKWSE  107 (121)
Q Consensus        99 vILG~~WL~  107 (121)
                      .|||||||.
T Consensus        78 ~lLG~dfL~   86 (86)
T cd06095          78 PLLGRDLLS   86 (86)
T ss_pred             cEechhhcC
Confidence            899999984


No 13 
>cd00303 retropepsin_like Retropepsins; pepsin-like aspartate proteases. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples
Probab=99.07  E-value=7.5e-10  Score=67.60  Aligned_cols=63  Identities=40%  Similarity=0.557  Sum_probs=48.6

Q ss_pred             EEEECCEEEEEEecCCCcccccCHHHHhhcCC-CCCCCCCeEEEecCCceec---------------------------c
Q 043220           45 ATKIRFHEVVILISSGSTANFVSDKVAALLPL-PMMPTKPFSVRIANGAHLK---------------------------S   96 (121)
Q Consensus        45 ~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l-~~~~~~~~~V~va~G~~l~---------------------------~   96 (121)
                      .+.++++++.+|+|+||++++++..++.++++ ......+..+..++|....                           .
T Consensus         2 ~~~~~~~~~~~liDtgs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~   81 (92)
T cd00303           2 KGKINGVPVRALVDSGASVNFISESLAKKLGLPPRLLPTPLKVKGANGSSVKTLGVILPVTIGIGGKTFTVDFYVLDLLS   81 (92)
T ss_pred             EEEECCEEEEEEEcCCCcccccCHHHHHHcCCCcccCCCceEEEecCCCEeccCcEEEEEEEEeCCEEEEEEEEEEcCCC
Confidence            45678899999999999999999999999987 4444455666666553211                           1


Q ss_pred             ceEeecchhhh
Q 043220           97 PDVALDVKWSE  107 (121)
Q Consensus        97 ~dvILG~~WL~  107 (121)
                      +|+|||++||.
T Consensus        82 ~~~ilG~~~l~   92 (92)
T cd00303          82 YDVILGRPWLE   92 (92)
T ss_pred             cCEEecccccC
Confidence            88999999984


No 14 
>PF12384 Peptidase_A2B:  Ty3 transposon peptidase;  InterPro: IPR024650 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Ty3 is a gypsy-type, retrovirus-like, element found in the budding yeast. The Ty3 aspartyl protease is required for processing of the viral polyprotein into its mature species [].
Probab=98.98  E-value=2.1e-09  Score=78.58  Aligned_cols=87  Identities=17%  Similarity=0.252  Sum_probs=70.0

Q ss_pred             cceEEEeeeec-c---cccCeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCce-ecc---
Q 043220           25 ENSRSLFQVKC-G---QLRKFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAH-LKS---   96 (121)
Q Consensus        25 ~~~isl~Al~g-~---~~~~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~-l~~---   96 (121)
                      .+.+.+.-+.- .   ...++..+...++|.++.+|+||||..+||+..++++|+++.....|++++.+.+.. ..|   
T Consensus        14 ~~~~~~~PIV~~~~~Pevg~T~~v~l~~~~t~i~vLfDSGSPTSfIr~di~~kL~L~~~~app~~fRG~vs~~~~~tsEA   93 (177)
T PF12384_consen   14 KSSITTCPIVHYIAIPEVGKTAIVQLNCKGTPIKVLFDSGSPTSFIRSDIVEKLELPTHDAPPFRFRGFVSGESATTSEA   93 (177)
T ss_pred             cceeeeeeeEEEeeccccCcEEEEEEeecCcEEEEEEeCCCccceeehhhHHhhCCccccCCCEEEeeeccCCceEEEEe
Confidence            45555555443 2   344688999999999999999999999999999999999999999998887664431 112   


Q ss_pred             -----------------------ceEeecchhhhhcCC
Q 043220           97 -----------------------PDVALDVKWSEQIGS  111 (121)
Q Consensus        97 -----------------------~dvILG~~WL~~~~p  111 (121)
                                             +|+|+|.+.|+++..
T Consensus        94 v~ld~~i~n~~i~i~aYV~d~m~~dlIIGnPiL~ryp~  131 (177)
T PF12384_consen   94 VTLDFYIDNKLIDIAAYVTDNMDHDLIIGNPILDRYPT  131 (177)
T ss_pred             EEEEEEECCeEEEEEEEEeccCCcceEeccHHHhhhHH
Confidence                                   999999999999854


No 15 
>cd05480 NRIP_C NRIP_C; putative nuclear receptor interacting protein. Proteins in this family have been described as probable nuclear receptor interacting proteins. The  C-terminal domain of this family is homologous to the retroviral aspartyl protease domain. The domain is structurally related to one lobe of the pepsin molecule. The conserved active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=98.82  E-value=6e-09  Score=70.49  Aligned_cols=71  Identities=11%  Similarity=0.148  Sum_probs=53.3

Q ss_pred             EEEECCEEEEEEecCCCcccccCHHHHhhcCCCCC-CCCCe--------------------EEEecCCceecc-------
Q 043220           45 ATKIRFHEVVILISSGSTANFVSDKVAALLPLPMM-PTKPF--------------------SVRIANGAHLKS-------   96 (121)
Q Consensus        45 ~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~-~~~~~--------------------~V~va~G~~l~~-------   96 (121)
                      ..++||++++|+|||||-+|.||+.+|+|.|+.-. ...++                    .+++++ ..+.|       
T Consensus         2 nCk~nG~~vkAfVDsGaQ~timS~~caercgL~r~v~~~r~~g~A~gvgt~~kiiGrih~~~ikig~-~~~~CSftVld~   80 (103)
T cd05480           2 SCQCAGKELRALVDTGCQYNLISAACLDRLGLKERVLKAKAEEEAPSLPTSVKVIGQIERLVLQLGQ-LTVECSAQVVDD   80 (103)
T ss_pred             ceeECCEEEEEEEecCCchhhcCHHHHHHcChHhhhhhccccccccCCCcceeEeeEEEEEEEEeCC-EEeeEEEEEEcC
Confidence            46899999999999999999999999999998632 11111                    111211 23334       


Q ss_pred             --ceEeecchhhhhcCCeeeecc
Q 043220           97 --PDVALDVKWSEQIGSIMRGWK  117 (121)
Q Consensus        97 --~dvILG~~WL~~~~p~~idw~  117 (121)
                        .|++||-|-|++|.. .||.+
T Consensus        81 ~~~d~llGLdmLkrhqc-~IdL~  102 (103)
T cd05480          81 NEKNFSLGLQTLKSLKC-VINLE  102 (103)
T ss_pred             CCcceEeeHHHHhhcce-eeecc
Confidence              999999999999998 48865


No 16 
>PF02160 Peptidase_A3:  Cauliflower mosaic virus peptidase (A3);  InterPro: IPR000588 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of sequences contain an aspartic peptidase signature that belongs to MEROPS peptidase family A3, subfamily A3A (cauliflower mosaic virus-type endopeptidase, clan AA). Cauliflower mosaic virus belongs to the Retro-transcribing viruses, which have a double-stranded DNA genome. The genome includes an open reading frame (ORF V) that shows similarities to the pol gene of retroviruses. This ORF codes for a polyprotein that includes a reverse transcriptase, which, on the basis of a DTG triplet near the N terminus, was suggested to include an aspartic protease. The presence of an aspartic protease has been confirmed by mutational studies, implicating Asp-45 in catalysis. The protease releases itself from the polyprotein and is involved in reactions required to process the ORF IV polyprotein, which includes the viral coat protein []. The viral aspartic peptidase signature has also been found associated with a polyprotein encoded by integrated pararetrovirus-like sequences in the genome of Nicotiana tabacum (Common tobacco) []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis
Probab=98.50  E-value=2.9e-07  Score=69.29  Aligned_cols=75  Identities=17%  Similarity=0.145  Sum_probs=53.6

Q ss_pred             EEEEEEECC---EEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCceec----c------------------
Q 043220           42 CVVATKIRF---HEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLK----S------------------   96 (121)
Q Consensus        42 ~~v~g~i~~---~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~----~------------------   96 (121)
                      ++++..+.|   ..+.++||||||.+.++....=.-.. .....++.|+.||+....    |                  
T Consensus         7 I~~~i~~~gy~~~~~~~~vDTGAt~C~~~~~iiP~e~w-e~~~~~i~v~~an~~~~~i~~~~~~~~i~I~~~~F~IP~iY   85 (201)
T PF02160_consen    7 IKVKISFPGYKKFNYHCYVDTGATICCASKKIIPEEYW-EKSKKPIKVKGANGSIIQINKKAKNGKIQIADKIFRIPTIY   85 (201)
T ss_pred             EEEEEEEcCceeEEEEEEEeCCCceEEecCCcCCHHHH-HhCCCcEEEEEecCCceEEEEEecCceEEEccEEEeccEEE
Confidence            344445555   45788999999999988766421110 123457899999987432    1                  


Q ss_pred             -----ceEeecchhhhhcCCeeeeccc
Q 043220           97 -----PDVALDVKWSEQIGSIMRGWKK  118 (121)
Q Consensus        97 -----~dvILG~~WL~~~~p~~idw~~  118 (121)
                           .|+|||++|++.+.|. +.|..
T Consensus        86 q~~~g~d~IlG~NF~r~y~Pf-iq~~~  111 (201)
T PF02160_consen   86 QQESGIDIILGNNFLRLYEPF-IQTED  111 (201)
T ss_pred             EecCCCCEEecchHHHhcCCc-EEEcc
Confidence                 9999999999999997 88864


No 17 
>cd06094 RP_Saci_like RP_Saci_like, retropepsin family. Retropepsin on retrotransposons with long terminal repeats (LTR) including Saci-1, -2 and -3 of Schistosoma mansoni. Retropepsins are related to fungal and mammalian pepsins. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified
Probab=98.50  E-value=1.2e-07  Score=62.89  Aligned_cols=55  Identities=16%  Similarity=0.414  Sum_probs=45.8

Q ss_pred             EEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc-------------------------ceEeecchhh
Q 043220           52 EVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS-------------------------PDVALDVKWS  106 (121)
Q Consensus        52 ~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~-------------------------~dvILG~~WL  106 (121)
                      .+.+||||||.+|.|.+...++.    ....++.++.|||+.+.+                         ...|||.|+|
T Consensus         9 ~~~fLVDTGA~vSviP~~~~~~~----~~~~~~~l~AANgt~I~tyG~~~l~ldlGlrr~~~w~FvvAdv~~pIlGaDfL   84 (89)
T cd06094           9 GLRFLVDTGAAVSVLPASSTKKS----LKPSPLTLQAANGTPIATYGTRSLTLDLGLRRPFAWNFVVADVPHPILGADFL   84 (89)
T ss_pred             CcEEEEeCCCceEeecccccccc----ccCCceEEEeCCCCeEeeeeeEEEEEEcCCCcEEeEEEEEcCCCcceecHHHH
Confidence            46799999999999998887763    345678999999998754                         5569999999


Q ss_pred             hhcC
Q 043220          107 EQIG  110 (121)
Q Consensus       107 ~~~~  110 (121)
                      ++|+
T Consensus        85 ~~~~   88 (89)
T cd06094          85 QHYG   88 (89)
T ss_pred             HHcC
Confidence            9986


No 18 
>KOG0012 consensus DNA damage inducible protein [Replication, recombination and repair]
Probab=98.40  E-value=2.5e-07  Score=74.65  Aligned_cols=82  Identities=17%  Similarity=0.133  Sum_probs=60.9

Q ss_pred             cccCeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCe-EEEecCC-----------------ceecc--
Q 043220           37 QLRKFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPF-SVRIANG-----------------AHLKS--   96 (121)
Q Consensus        37 ~~~~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~-~V~va~G-----------------~~l~~--   96 (121)
                      .....+.|..+|||++|+|+|||||-.|-||..+|+|.|+.-...+.+ -+.-+.|                 ..+.|  
T Consensus       231 ~~v~ML~iN~~ing~~VKAfVDsGaq~timS~~Caer~gL~rlid~r~~g~a~gvg~~ki~g~Ih~~~lki~~~~l~c~f  310 (380)
T KOG0012|consen  231 TQVTMLYINCEINGVPVKAFVDSGAQTTIMSAACAERCGLNRLIDKRFQGEARGVGTEKILGRIHQAQLKIEDLYLPCSF  310 (380)
T ss_pred             ccceEEEEEEEECCEEEEEEEcccchhhhhhHHHHHHhChHHHhhhhhhccccCCCcccccceeEEEEEEeccEeeccce
Confidence            344456789999999999999999999999999999999864322111 1111111                 12234  


Q ss_pred             -------ceEeecchhhhhcCCeeeecccC
Q 043220           97 -------PDVALDVKWSEQIGSIMRGWKKD  119 (121)
Q Consensus        97 -------~dvILG~~WL~~~~p~~idw~~~  119 (121)
                             .|+.||-+-|++|.. .||.++.
T Consensus       311 tV~d~~~~d~llGLd~Lrr~~c-cIdL~~~  339 (380)
T KOG0012|consen  311 TVLDRRDMDLLLGLDMLRRHQC-CIDLKTN  339 (380)
T ss_pred             EEecCCCcchhhhHHHHHhccc-eeecccC
Confidence                   899999999999999 4998764


No 19 
>PF05585 DUF1758:  Putative peptidase (DUF1758);  InterPro: IPR008737  This is a family of nematode proteins of unknown function []. However, it seems likely that these proteins act as aspartic peptidases. 
Probab=98.39  E-value=5.6e-07  Score=64.88  Aligned_cols=32  Identities=34%  Similarity=0.478  Sum_probs=28.1

Q ss_pred             EEEEEEecCCCcccccCHHHHhhcCCCCCCCC
Q 043220           51 HEVVILISSGSTANFVSDKVAALLPLPMMPTK   82 (121)
Q Consensus        51 ~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~   82 (121)
                      +.+.+|+||||..|||.+++|++|+|+.....
T Consensus        11 ~~~~~LlDsGSq~SfIt~~la~~L~L~~~~~~   42 (164)
T PF05585_consen   11 VEARALLDSGSQRSFITESLANKLNLPGTGEK   42 (164)
T ss_pred             EEEEEEEecCCchhHHhHHHHHHhCCCCCCce
Confidence            56789999999999999999999999876543


No 20 
>cd05481 retropepsin_like_LTR_1 Retropepsins_like_LTR; pepsin-like aspartate protease from retrotransposons with long terminal repeats. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identifi
Probab=98.03  E-value=1.1e-05  Score=53.54  Aligned_cols=50  Identities=20%  Similarity=0.282  Sum_probs=42.9

Q ss_pred             EEECC-EEEEEEecCCCcccccCHHHHhhcC---CCCCCCCCeEEEecCCceec
Q 043220           46 TKIRF-HEVVILISSGSTANFVSDKVAALLP---LPMMPTKPFSVRIANGAHLK   95 (121)
Q Consensus        46 g~i~~-~~v~~LIDSGAT~nFI~~~~a~rl~---l~~~~~~~~~V~va~G~~l~   95 (121)
                      ..+++ +++.+++|+||+.|.|+.+..++++   .+.-.+.++.+..+||..+.
T Consensus         3 ~~i~g~~~v~~~vDtGA~vnllp~~~~~~l~~~~~~~L~~t~~~L~~~~g~~~~   56 (93)
T cd05481           3 MKINGKQSVKFQLDTGATCNVLPLRWLKSLTPDKDPELRPSPVRLTAYGGSTIP   56 (93)
T ss_pred             eEeCCceeEEEEEecCCEEEeccHHHHhhhccCCCCcCccCCeEEEeeCCCEee
Confidence            46788 9999999999999999999999998   45555667999999998764


No 21 
>COG5550 Predicted aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=97.30  E-value=0.00077  Score=47.26  Aligned_cols=76  Identities=24%  Similarity=0.291  Sum_probs=50.2

Q ss_pred             CeEEEEEEE--CC-EEEEEEecCCCc-ccccCHHHHhhcCCCCCCCCCeEEEecCCceecc-------------------
Q 043220           40 KFCVVATKI--RF-HEVVILISSGST-ANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS-------------------   96 (121)
Q Consensus        40 ~~~~v~g~i--~~-~~v~~LIDSGAT-~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~-------------------   96 (121)
                      .++.++...  +| +.-..|||||.| -..+++++|.+++++...+  .++..++|+++++                   
T Consensus        11 ~~v~~~f~~~~~Gd~~~~~LiDTGFtg~lvlp~~vaek~~~~~~~~--~~~~~a~~~~v~t~V~~~~iki~g~e~~~~Vl   88 (125)
T COG5550          11 VTVPVTFRLPGQGDFVYDELIDTGFTGYLVLPPQVAEKLGLPLFST--IRIVLADGGVVKTSVALATIKIDGVEKVAFVL   88 (125)
T ss_pred             eeEEEEEEecCCCcEEeeeEEecCCceeEEeCHHHHHhcCCCccCC--hhhhhhcCCEEEEEEEEEEEEECCEEEEEEEE
Confidence            455555555  33 333449999999 7889999999999876533  2333444443322                   


Q ss_pred             -----ceEeecchhhhhcCCeeeeccc
Q 043220           97 -----PDVALDVKWSEQIGSIMRGWKK  118 (121)
Q Consensus        97 -----~dvILG~~WL~~~~p~~idw~~  118 (121)
                           .-.++|++||+.++- .+|.++
T Consensus        89 ~s~~~~~~liG~~~lk~l~~-~vn~~~  114 (125)
T COG5550          89 ASDNLPEPLIGVNLLKLLGL-VVNPKT  114 (125)
T ss_pred             ccCCCcccchhhhhhhhccE-EEcCCc
Confidence                 223999999999876 477653


No 22 
>cd05482 HIV_retropepsin_like Retropepsins, pepsin-like aspartate proteases. This is a subfamily of retropepsins. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This gro
Probab=94.63  E-value=0.038  Score=36.41  Aligned_cols=25  Identities=12%  Similarity=0.189  Sum_probs=22.6

Q ss_pred             EEEECCEEEEEEecCCCcccccCHH
Q 043220           45 ATKIRFHEVVILISSGSTANFVSDK   69 (121)
Q Consensus        45 ~g~i~~~~v~~LIDSGAT~nFI~~~   69 (121)
                      +.+|+|+.+.+|+||||-++.|...
T Consensus         2 ~~~i~g~~~~~llDTGAd~Tvi~~~   26 (87)
T cd05482           2 TLYINGKLFEGLLDTGADVSIIAEN   26 (87)
T ss_pred             EEEECCEEEEEEEccCCCCeEEccc
Confidence            4678999999999999999999973


No 23 
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  The enzymes specifically cleave bonds in peptides which 
Probab=94.39  E-value=0.087  Score=40.19  Aligned_cols=56  Identities=14%  Similarity=0.157  Sum_probs=38.2

Q ss_pred             EEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCc---------------eecc---------ceEeecchhhhhc
Q 043220           54 VILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGA---------------HLKS---------PDVALDVKWSEQI  109 (121)
Q Consensus        54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~---------------~l~~---------~dvILG~~WL~~~  109 (121)
                      .++||||+|..++.+.+.-          ++.+..++|.               ...|         .-.|||-.||+.+
T Consensus       178 ~ai~DTGTs~~~lp~~~~P----------~i~~~f~~~~~~~i~~~~y~~~~~~~~~C~~~~~~~~~~~~ilG~~fl~~~  247 (265)
T cd05476         178 GTIIDSGTTLTYLPDPAYP----------DLTLHFDGGADLELPPENYFVDVGEGVVCLAILSSSSGGVSILGNIQQQNF  247 (265)
T ss_pred             cEEEeCCCcceEcCccccC----------CEEEEECCCCEEEeCcccEEEECCCCCEEEEEecCCCCCcEEEChhhcccE
Confidence            4899999999999987762          2223232221               1123         3479999999999


Q ss_pred             CCeeeecccCC
Q 043220          110 GSIMRGWKKDE  120 (121)
Q Consensus       110 ~p~~idw~~~~  120 (121)
                      -.+ .|+.++|
T Consensus       248 ~~v-FD~~~~~  257 (265)
T cd05476         248 LVE-YDLENSR  257 (265)
T ss_pred             EEE-EECCCCE
Confidence            885 8887764


No 24 
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5.  Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=94.11  E-value=0.08  Score=41.69  Aligned_cols=65  Identities=12%  Similarity=0.168  Sum_probs=47.5

Q ss_pred             EEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCceec--------------c--------ceEeecchhhhhc
Q 043220           52 EVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLK--------------S--------PDVALDVKWSEQI  109 (121)
Q Consensus        52 ~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~--------------~--------~dvILG~~WL~~~  109 (121)
                      ...++||||.|..++.+.+.+++.-.+   +.+.+...+|..+.              |        --.|||-.||+.+
T Consensus       231 ~~~aivDSGTs~~~lp~~~~~~l~~~~---P~i~~~f~~g~~~~i~p~~y~~~~~~~~c~~~~~~~~~~~ILG~~flr~~  307 (326)
T cd06096         231 GLGMLVDSGSTLSHFPEDLYNKINNFF---PTITIIFENNLKIDWKPSSYLYKKESFWCKGGEKSVSNKPILGASFFKNK  307 (326)
T ss_pred             CCCEEEeCCCCcccCCHHHHHHHHhhc---CcEEEEEcCCcEEEECHHHhccccCCceEEEEEecCCCceEEChHHhcCc
Confidence            345899999999999999998864322   35666666554432              1        2369999999999


Q ss_pred             CCeeeecccCC
Q 043220          110 GSIMRGWKKDE  120 (121)
Q Consensus       110 ~p~~idw~~~~  120 (121)
                      -.+ .|+.++|
T Consensus       308 y~v-FD~~~~r  317 (326)
T cd06096         308 QII-FDLDNNR  317 (326)
T ss_pred             EEE-EECcCCE
Confidence            875 8887764


No 25 
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=91.94  E-value=0.46  Score=36.36  Aligned_cols=65  Identities=8%  Similarity=-0.027  Sum_probs=41.5

Q ss_pred             EEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCc-eecc----------ceEeecchhhhhcCCeeeecccCC
Q 043220           52 EVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGA-HLKS----------PDVALDVKWSEQIGSIMRGWKKDE  120 (121)
Q Consensus        52 ~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~-~l~~----------~dvILG~~WL~~~~p~~idw~~~~  120 (121)
                      ...++||||++..++...+++++.-.+...   ......|. .+.|          ...|||-.+|+++-.+ .||.++|
T Consensus       198 ~~~~iiDSGTs~~~lP~~~~~~l~~~l~g~---~~~~~~~~~~~~C~~~~P~i~f~~~~ilGd~fl~~~y~v-fD~~~~~  273 (278)
T cd06097         198 GFSAIADTGTTLILLPDAIVEAYYSQVPGA---YYDSEYGGWVFPCDTTLPDLSFAVFSILGDVFLKAQYVV-FDVGGPK  273 (278)
T ss_pred             CceEEeecCCchhcCCHHHHHHHHHhCcCC---cccCCCCEEEEECCCCCCCEEEEEEEEEcchhhCceeEE-EcCCCce
Confidence            356899999999999988776652111000   00001111 2333          2569999999999885 9998765


No 26 
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=91.85  E-value=0.32  Score=36.51  Aligned_cols=66  Identities=8%  Similarity=0.011  Sum_probs=42.2

Q ss_pred             EEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc------------ceEeecchhhhhcCCeeeeccc
Q 043220           51 HEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS------------PDVALDVKWSEQIGSIMRGWKK  118 (121)
Q Consensus        51 ~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~------------~dvILG~~WL~~~~p~~idw~~  118 (121)
                      ....++||||++..++...+...+--.+......   .-......|            +..|||..||+.+-.+ .|+.+
T Consensus       201 ~~~~~iiDsGt~~~~lp~~~~~~l~~~~~~~~~~---~~~~~~~~~~~~~~~p~i~f~f~~ilG~~fl~~~y~v-fD~~~  276 (283)
T cd05471         201 GGGGAIVDSGTSLIYLPSSVYDAILKALGAAVSS---SDGGYGVDCSPCDTLPDITFTFLWILGDVFLRNYYTV-FDLDN  276 (283)
T ss_pred             CCcEEEEecCCCCEeCCHHHHHHHHHHhCCcccc---cCCcEEEeCcccCcCCCEEEEEEEEccHhhhhheEEE-EeCCC
Confidence            3568999999999999999888763222111000   000000111            5799999999999885 78776


Q ss_pred             CC
Q 043220          119 DE  120 (121)
Q Consensus       119 ~~  120 (121)
                      ++
T Consensus       277 ~~  278 (283)
T cd05471         277 NR  278 (283)
T ss_pred             CE
Confidence            53


No 27 
>PF12382 Peptidase_A2E:  Retrotransposon peptidase;  InterPro: IPR024648 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  This entry represents a small family of fungal retroviral aspartyl peptidases.
Probab=91.80  E-value=0.18  Score=34.58  Aligned_cols=40  Identities=23%  Similarity=0.384  Sum_probs=31.4

Q ss_pred             EEEEEEECCE--EEEEEecCCCcccccCHHHHhhcCCCCCCC
Q 043220           42 CVVATKIRFH--EVVILISSGSTANFVSDKVAALLPLPMMPT   81 (121)
Q Consensus        42 ~~v~g~i~~~--~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~   81 (121)
                      |.+.+.+-..  .+..|||+||..|.|.++.++.+.+|.++.
T Consensus        35 mvlqa~lp~fkcsipclidtgaq~niiteetvrahklptrpw   76 (137)
T PF12382_consen   35 MVLQAKLPDFKCSIPCLIDTGAQVNIITEETVRAHKLPTRPW   76 (137)
T ss_pred             hhhhhhCCCccccceeEEccCceeeeeehhhhhhccCCCCcc
Confidence            4444444332  467899999999999999999999998764


No 28 
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases.  They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=89.66  E-value=1.8  Score=33.87  Aligned_cols=74  Identities=12%  Similarity=0.063  Sum_probs=47.8

Q ss_pred             EEECCEE-------EEEEecCCCcccccCHHHHhhcCC--CCC---CCCCeEEEecC----------------Cceecc-
Q 043220           46 TKIRFHE-------VVILISSGSTANFVSDKVAALLPL--PMM---PTKPFSVRIAN----------------GAHLKS-   96 (121)
Q Consensus        46 g~i~~~~-------v~~LIDSGAT~nFI~~~~a~rl~l--~~~---~~~~~~V~va~----------------G~~l~~-   96 (121)
                      ..|+++.       ..++||||++..++.+.+++++..  +..   ..+.+....++                +....| 
T Consensus       197 i~v~g~~~~~~~~~~~aivDTGTs~~~lP~~~~~~i~~~~~C~~~~~~P~i~f~f~g~~~~l~~~~yi~~~~~~~~~~C~  276 (317)
T cd06098         197 VLIGGKSTGFCAGGCAAIADSGTSLLAGPTTIVTQINSAVDCNSLSSMPNVSFTIGGKTFELTPEQYILKVGEGAAAQCI  276 (317)
T ss_pred             EEECCEEeeecCCCcEEEEecCCcceeCCHHHHHhhhccCCccccccCCcEEEEECCEEEEEChHHeEEeecCCCCCEEe
Confidence            4666654       358999999999999999987642  221   11233333321                111134 


Q ss_pred             -------------ceEeecchhhhhcCCeeeecccCC
Q 043220           97 -------------PDVALDVKWSEQIGSIMRGWKKDE  120 (121)
Q Consensus        97 -------------~dvILG~~WL~~~~p~~idw~~~~  120 (121)
                                   ...|||-.||+.+-.+ .|+.++|
T Consensus       277 ~~~~~~~~~~~~~~~~IlGd~Flr~~y~V-fD~~~~~  312 (317)
T cd06098         277 SGFTALDVPPPRGPLWILGDVFMGAYHTV-FDYGNLR  312 (317)
T ss_pred             ceEEECCCCCCCCCeEEechHHhcccEEE-EeCCCCE
Confidence                         1359999999999885 8888764


No 29 
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=86.51  E-value=3.3  Score=32.64  Aligned_cols=22  Identities=5%  Similarity=-0.191  Sum_probs=18.6

Q ss_pred             eEeecchhhhhcCCeeeecccCC
Q 043220           98 DVALDVKWSEQIGSIMRGWKKDE  120 (121)
Q Consensus        98 dvILG~~WL~~~~p~~idw~~~~  120 (121)
                      ..|||..||+.+-.+ .||+++|
T Consensus       303 ~~IlG~~fl~~~y~v-FD~~~~~  324 (329)
T cd05485         303 LWILGDVFIGKYYTE-FDLGNNR  324 (329)
T ss_pred             eEEEchHHhccceEE-EeCCCCE
Confidence            379999999999885 8998765


No 30 
>PF00026 Asp:  Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.;  InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) .  More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=84.32  E-value=0.35  Score=37.03  Aligned_cols=67  Identities=7%  Similarity=0.167  Sum_probs=44.0

Q ss_pred             EEEEEecCCCcccccCHHHHhhc----CC---------CCCC---CCCeEEEecCCcee-----------------cc--
Q 043220           52 EVVILISSGSTANFVSDKVAALL----PL---------PMMP---TKPFSVRIANGAHL-----------------KS--   96 (121)
Q Consensus        52 ~v~~LIDSGAT~nFI~~~~a~rl----~l---------~~~~---~~~~~V~va~G~~l-----------------~~--   96 (121)
                      ...++||||++..++...+..++    +.         +...   ..++..... |..+                 .|  
T Consensus       199 ~~~~~~Dtgt~~i~lp~~~~~~i~~~l~~~~~~~~~~~~c~~~~~~p~l~f~~~-~~~~~i~~~~~~~~~~~~~~~~C~~  277 (317)
T PF00026_consen  199 GQQAILDTGTSYIYLPRSIFDAIIKALGGSYSDGVYSVPCNSTDSLPDLTFTFG-GVTFTIPPSDYIFKIEDGNGGYCYL  277 (317)
T ss_dssp             EEEEEEETTBSSEEEEHHHHHHHHHHHTTEEECSEEEEETTGGGGSEEEEEEET-TEEEEEEHHHHEEEESSTTSSEEEE
T ss_pred             ceeeecccccccccccchhhHHHHhhhcccccceeEEEecccccccceEEEeeC-CEEEEecchHhcccccccccceeEe
Confidence            35799999999999999987754    11         1111   122333333 2211                 13  


Q ss_pred             -----------ceEeecchhhhhcCCeeeecccCC
Q 043220           97 -----------PDVALDVKWSEQIGSIMRGWKKDE  120 (121)
Q Consensus        97 -----------~dvILG~~WL~~~~p~~idw~~~~  120 (121)
                                 ...|||++||+.+-.+ +|+.++|
T Consensus       278 ~i~~~~~~~~~~~~iLG~~fl~~~y~v-fD~~~~~  311 (317)
T PF00026_consen  278 GIQPMDSSDDSDDWILGSPFLRNYYVV-FDYENNR  311 (317)
T ss_dssp             SEEEESSTTSSSEEEEEHHHHTTEEEE-EETTTTE
T ss_pred             eeecccccccCCceEecHHHhhceEEE-EeCCCCE
Confidence                       5889999999999875 8887764


No 31 
>PF00455 DeoRC:  DeoR C terminal sensor domain;  InterPro: IPR014036 The deoR-type HTH domain is a DNA-binding, helix-turn-helix (HTH) domain of about 50-60 amino acids present in transcription regulators of the deoR family, involved in sugar catabolism. This family of prokaryotic regulators is named after Escherichia coli deoR, a repressor of the deo operon, which encodes nucleotide and deoxyribonucleotide catabolic enzymes. DeoR also negatively regulates the expression of nupG and tsx, a nucleoside-specific transport protein and a channel-forming protein, respectively. DeoR-like transcription repressors occur in diverse bacteria as regulators of sugar and nucleoside metabolic systems. The effector molecules for deoR-like regulators are generally phosphorylated intermediates of the relevant metabolic pathway. The DNA-binding deoR-type HTH domain occurs usually in the N-terminal part. The C-terminal part can contain an effector-binding domain and/or an oligomerization domain. DeoR occurs as an octamer, whilst glpR and agaR are tetramers. Several operators may be bound simultaneously, which could facilitate DNA looping [, ].
Probab=81.16  E-value=1.1  Score=32.20  Aligned_cols=63  Identities=13%  Similarity=0.050  Sum_probs=44.8

Q ss_pred             EEEecCCCcccccCHHHHhhcCCCCCC-----------CCCeEEEecCCceeccceEeecchhhhhcCCeeeec
Q 043220           54 VILISSGSTANFVSDKVAALLPLPMMP-----------TKPFSVRIANGAHLKSPDVALDVKWSEQIGSIMRGW  116 (121)
Q Consensus        54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~-----------~~~~~V~va~G~~l~~~dvILG~~WL~~~~p~~idw  116 (121)
                      .+.||+|+|.-.+.+.+..+.++.+..           .+...|.+.+|..........|..-+.....+.+|+
T Consensus        22 ~Ifld~GtT~~~la~~L~~~~~ltVvTnsl~ia~~l~~~~~~~vi~~GG~~~~~~~~~~G~~a~~~l~~~~~d~   95 (161)
T PF00455_consen   22 TIFLDSGTTTLELAKYLPDKKNLTVVTNSLPIANELSENPNIEVILLGGEVNPKSLSFVGPIALEALRQFRFDK   95 (161)
T ss_pred             EEEEECchHHHHHHHHhhcCCceEEEECCHHHHHHHHhcCceEEEEeCCEEEcCCCcEECchHHHHHHhhccce
Confidence            378999999999999999997766543           236788889998776666666655555444444443


No 32 
>cd05470 pepsin_retropepsin_like Cellular and retroviral pepsin-like aspartate proteases. This family includes both cellular and retroviral pepsin-like aspartate proteases. The cellular pepsin and pepsin-like enzymes are twice as long as their retroviral counterparts. The cellular pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, rennin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (rennin, cathepsin D and E, pepsin) or commercially (chymosin) important. The eukaryotic pepsin-like proteases contain two domains possessing similar topological features. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except in the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The eukaryotic pepsin-like proteases have two active site 
Probab=81.00  E-value=1.7  Score=28.25  Aligned_cols=27  Identities=19%  Similarity=0.258  Sum_probs=21.5

Q ss_pred             EEEECC--EEEEEEecCCCcccccCHHHH
Q 043220           45 ATKIRF--HEVVILISSGSTANFVSDKVA   71 (121)
Q Consensus        45 ~g~i~~--~~v~~LIDSGAT~nFI~~~~a   71 (121)
                      ...|+.  +++.+++|+||+..++...-+
T Consensus         2 ~i~vGtP~q~~~~~~DTGSs~~Wv~~~~c   30 (109)
T cd05470           2 EIGIGTPPQTFNVLLDTGSSNLWVPSVDC   30 (109)
T ss_pred             EEEeCCCCceEEEEEeCCCCCEEEeCCCC
Confidence            355665  889999999999999887644


No 33 
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=80.64  E-value=4.2  Score=30.97  Aligned_cols=25  Identities=20%  Similarity=0.400  Sum_probs=21.4

Q ss_pred             EEEEEECC--EEEEEEecCCCcccccC
Q 043220           43 VVATKIRF--HEVVILISSGSTANFVS   67 (121)
Q Consensus        43 ~v~g~i~~--~~v~~LIDSGAT~nFI~   67 (121)
                      .+...|+.  +++.+++||||+..++.
T Consensus         4 ~~~i~iGtp~q~~~v~~DTgS~~~wv~   30 (295)
T cd05474           4 SAELSVGTPPQKVTVLLDTGSSDLWVP   30 (295)
T ss_pred             EEEEEECCCCcEEEEEEeCCCCcceee
Confidence            45667776  88999999999999998


No 34 
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=75.40  E-value=6.1  Score=33.14  Aligned_cols=22  Identities=14%  Similarity=0.252  Sum_probs=18.4

Q ss_pred             EEEEecCCCcccccCHHHHhhc
Q 043220           53 VVILISSGSTANFVSDKVAALL   74 (121)
Q Consensus        53 v~~LIDSGAT~nFI~~~~a~rl   74 (121)
                      ..++||||+|..++.+..++++
T Consensus       332 ~~aIlDSGTSli~lP~~~~~~i  353 (450)
T PTZ00013        332 ANVIVDSGTTTITAPSEFLNKF  353 (450)
T ss_pred             cceEECCCCccccCCHHHHHHH
Confidence            4589999999999998876653


No 35 
>PF00026 Asp:  Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.;  InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) .  More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=74.43  E-value=4.9  Score=30.64  Aligned_cols=26  Identities=23%  Similarity=0.408  Sum_probs=22.0

Q ss_pred             EEEEEEC--CEEEEEEecCCCcccccCH
Q 043220           43 VVATKIR--FHEVVILISSGSTANFVSD   68 (121)
Q Consensus        43 ~v~g~i~--~~~v~~LIDSGAT~nFI~~   68 (121)
                      .+...|+  ++++.++|||||+..++..
T Consensus         3 ~~~v~iGtp~q~~~~~iDTGS~~~wv~~   30 (317)
T PF00026_consen    3 YINVTIGTPPQTFRVLIDTGSSDTWVPS   30 (317)
T ss_dssp             EEEEEETTTTEEEEEEEETTBSSEEEEB
T ss_pred             EEEEEECCCCeEEEEEEecccceeeece
Confidence            4566777  7999999999999999874


No 36 
>COG2383 Uncharacterized conserved protein [Function unknown]
Probab=73.59  E-value=0.6  Score=31.78  Aligned_cols=21  Identities=29%  Similarity=0.528  Sum_probs=18.3

Q ss_pred             EeecchhhhhcCCeeeecccC
Q 043220           99 VALDVKWSEQIGSIMRGWKKD  119 (121)
Q Consensus        99 vILG~~WL~~~~p~~idw~~~  119 (121)
                      .||+.-||.++|.+++||.+.
T Consensus        50 yilsl~~La~~GVItin~~al   70 (109)
T COG2383          50 YILSLFWLAQYGVITINWEAL   70 (109)
T ss_pred             HHHHHHHHHHcCeEEEcHHHH
Confidence            378889999999999999754


No 37 
>COG0282 ackA Acetate kinase [Energy production and conversion]
Probab=73.11  E-value=2.2  Score=35.39  Aligned_cols=37  Identities=24%  Similarity=0.492  Sum_probs=33.7

Q ss_pred             CCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc
Q 043220           60 GSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS   96 (121)
Q Consensus        60 GAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~   96 (121)
                      |-||.|++.++++.|+.|+...+-+.+.++||..+..
T Consensus       179 GtSh~YVs~~aa~~L~k~~~~l~~I~~HLGNGASicA  215 (396)
T COG0282         179 GTSHKYVSQRAAEILGKPLEDLNLITCHLGNGASICA  215 (396)
T ss_pred             ccchHHHHHHHHHHhCCCccccCEEEEEecCchhhhh
Confidence            7899999999999999999888899999999987753


No 38 
>PTZ00147 plasmepsin-1; Provisional
Probab=69.77  E-value=23  Score=29.68  Aligned_cols=22  Identities=14%  Similarity=0.259  Sum_probs=18.9

Q ss_pred             EEEEEecCCCcccccCHHHHhh
Q 043220           52 EVVILISSGSTANFVSDKVAAL   73 (121)
Q Consensus        52 ~v~~LIDSGAT~nFI~~~~a~r   73 (121)
                      ...++||||.|..++....+.+
T Consensus       332 ~~~aIiDSGTsli~lP~~~~~a  353 (453)
T PTZ00147        332 KANVIVDSGTSVITVPTEFLNK  353 (453)
T ss_pred             ceeEEECCCCchhcCCHHHHHH
Confidence            3569999999999999998764


No 39 
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=66.02  E-value=7.6  Score=29.61  Aligned_cols=26  Identities=19%  Similarity=0.263  Sum_probs=21.4

Q ss_pred             EEEEECC--EEEEEEecCCCcccccCHH
Q 043220           44 VATKIRF--HEVVILISSGSTANFVSDK   69 (121)
Q Consensus        44 v~g~i~~--~~v~~LIDSGAT~nFI~~~   69 (121)
                      +...|+.  +++.+++||||+..++...
T Consensus         3 ~~i~vGtP~Q~~~v~~DTGS~~~wv~~~   30 (278)
T cd06097           3 TPVKIGTPPQTLNLDLDTGSSDLWVFSS   30 (278)
T ss_pred             eeEEECCCCcEEEEEEeCCCCceeEeeC
Confidence            4566776  8899999999999999754


No 40 
>PRK10681 DNA-binding transcriptional repressor DeoR; Provisional
Probab=64.57  E-value=3.1  Score=31.93  Aligned_cols=51  Identities=12%  Similarity=0.259  Sum_probs=33.0

Q ss_pred             EEEecCCCcccccCHHHHhhcCCCCCC-----------CCCeEEEecCCceeccceEeecch
Q 043220           54 VILISSGSTANFVSDKVAALLPLPMMP-----------TKPFSVRIANGAHLKSPDVALDVK  104 (121)
Q Consensus        54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~-----------~~~~~V~va~G~~l~~~dvILG~~  104 (121)
                      .++||+|+|.-++.+.+....++.+..           .+.+.|.+.+|........+.|..
T Consensus        95 tIflD~GtT~~~la~~L~~~~~ltvvTnsl~i~~~l~~~~~~~villGG~~~~~~~~~~G~~  156 (252)
T PRK10681         95 TLFFDCGTTTPWIIEAIDNELPFTAVCYSLNTFLALQEKPHCRAILCGGEFHASNAIFKPLD  156 (252)
T ss_pred             EEEEECCccHHHHHHhcCCCCCeEEEECCHHHHHHHhhCCCCEEEEECcEEecCcceeeCHH
Confidence            478999999999888887643333321           234677777887654444455543


No 41 
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases.  They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=60.03  E-value=14  Score=28.87  Aligned_cols=28  Identities=14%  Similarity=0.242  Sum_probs=22.7

Q ss_pred             EEEEEEEC--CEEEEEEecCCCcccccCHH
Q 043220           42 CVVATKIR--FHEVVILISSGSTANFVSDK   69 (121)
Q Consensus        42 ~~v~g~i~--~~~v~~LIDSGAT~nFI~~~   69 (121)
                      ..+...|+  .+++.++|||||+..++...
T Consensus        11 Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~   40 (317)
T cd06098          11 YFGEIGIGTPPQKFTVIFDTGSSNLWVPSS   40 (317)
T ss_pred             EEEEEEECCCCeEEEEEECCCccceEEecC
Confidence            45566776  68899999999999998754


No 42 
>COG4067 Uncharacterized protein conserved in archaea [Posttranslational modification, protein turnover, chaperones]
Probab=59.01  E-value=8  Score=28.31  Aligned_cols=23  Identities=13%  Similarity=0.104  Sum_probs=17.6

Q ss_pred             EEEEEEecCCCcccccCHHHHhh
Q 043220           51 HEVVILISSGSTANFVSDKVAAL   73 (121)
Q Consensus        51 ~~v~~LIDSGAT~nFI~~~~a~r   73 (121)
                      ..+.|=||+||..+-++..=..+
T Consensus        38 ~~~kAkiDTGA~TSsL~A~dI~~   60 (162)
T COG4067          38 IQLKAKIDTGAVTSSLSASDIER   60 (162)
T ss_pred             ceeeeeecccceeeeEEeeccee
Confidence            45788999999999888765543


No 43 
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate  r
Probab=58.34  E-value=14  Score=28.87  Aligned_cols=27  Identities=15%  Similarity=0.186  Sum_probs=22.0

Q ss_pred             EEEEEEEC--CEEEEEEecCCCcccccCH
Q 043220           42 CVVATKIR--FHEVVILISSGSTANFVSD   68 (121)
Q Consensus        42 ~~v~g~i~--~~~v~~LIDSGAT~nFI~~   68 (121)
                      ..+...|+  .+++.++|||||+..++..
T Consensus         9 y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~   37 (326)
T cd05487           9 YYGEIGIGTPPQTFKVVFDTGSSNLWVPS   37 (326)
T ss_pred             EEEEEEECCCCcEEEEEEeCCccceEEcc
Confidence            34566776  6889999999999999964


No 44 
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco.  CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=57.63  E-value=11  Score=28.93  Aligned_cols=20  Identities=20%  Similarity=0.245  Sum_probs=17.2

Q ss_pred             EEEecCCCcccccCHHHHhh
Q 043220           54 VILISSGSTANFVSDKVAAL   73 (121)
Q Consensus        54 ~~LIDSGAT~nFI~~~~a~r   73 (121)
                      .++||||+|..++.+...++
T Consensus       173 ~~ivDSGTt~~~lp~~~~~~  192 (299)
T cd05472         173 GVIIDSGTVITRLPPSAYAA  192 (299)
T ss_pred             CeEEeCCCcceecCHHHHHH
Confidence            47999999999999877664


No 45 
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=57.24  E-value=11  Score=29.21  Aligned_cols=29  Identities=7%  Similarity=0.031  Sum_probs=21.7

Q ss_pred             EEECCEEE------EEEecCCCcccccCHHHHhhc
Q 043220           46 TKIRFHEV------VILISSGSTANFVSDKVAALL   74 (121)
Q Consensus        46 g~i~~~~v------~~LIDSGAT~nFI~~~~a~rl   74 (121)
                      ..|+++.+      .++||||++..++.....+++
T Consensus       186 i~v~g~~~~~~~~~~aiiDTGTs~~~lP~~~~~~l  220 (316)
T cd05486         186 IQVGGTVIFCSDGCQAIVDTGTSLITGPSGDIKQL  220 (316)
T ss_pred             EEEecceEecCCCCEEEECCCcchhhcCHHHHHHH
Confidence            35565543      589999999999998876543


No 46 
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which 
Probab=57.14  E-value=14  Score=28.70  Aligned_cols=73  Identities=14%  Similarity=0.058  Sum_probs=48.2

Q ss_pred             EEECCEEE------EEEecCCCcccccCHHHHhhc----CC----------CCC---CCCCeEEEecCCceec-------
Q 043220           46 TKIRFHEV------VILISSGSTANFVSDKVAALL----PL----------PMM---PTKPFSVRIANGAHLK-------   95 (121)
Q Consensus        46 g~i~~~~v------~~LIDSGAT~nFI~~~~a~rl----~l----------~~~---~~~~~~V~va~G~~l~-------   95 (121)
                      ..|+++.+      .++||||++..++......++    +.          +..   ..+.+.+.. +|..+.       
T Consensus       194 v~v~g~~~~~~~~~~~iiDTGts~~~lp~~~~~~l~~~~~~~~~~~~~~~~~C~~~~~~P~~~f~f-~g~~~~i~~~~y~  272 (317)
T cd05478         194 VTINGQVVACSGGCQAIVDTGTSLLVGPSSDIANIQSDIGASQNQNGEMVVNCSSISSMPDVVFTI-NGVQYPLPPSAYI  272 (317)
T ss_pred             EEECCEEEccCCCCEEEECCCchhhhCCHHHHHHHHHHhCCccccCCcEEeCCcCcccCCcEEEEE-CCEEEEECHHHhe
Confidence            46777655      589999999999998877653    21          111   112344555 343221       


Q ss_pred             ------c----------ceEeecchhhhhcCCeeeecccCC
Q 043220           96 ------S----------PDVALDVKWSEQIGSIMRGWKKDE  120 (121)
Q Consensus        96 ------~----------~dvILG~~WL~~~~p~~idw~~~~  120 (121)
                            |          ...|||-.+|+.+-.+ .||.++|
T Consensus       273 ~~~~~~C~~~~~~~~~~~~~IlG~~fl~~~y~v-fD~~~~~  312 (317)
T cd05478         273 LQDQGSCTSGFQSMGLGELWILGDVFIRQYYSV-FDRANNK  312 (317)
T ss_pred             ecCCCEEeEEEEeCCCCCeEEechHHhcceEEE-EeCCCCE
Confidence                  3          2479999999999885 8988764


No 47 
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=55.83  E-value=16  Score=28.36  Aligned_cols=72  Identities=14%  Similarity=0.145  Sum_probs=45.7

Q ss_pred             EECCEEE-------EEEecCCCcccccCHHHHhhc----CCCC----------CC---CCCeEEEecCCcee--------
Q 043220           47 KIRFHEV-------VILISSGSTANFVSDKVAALL----PLPM----------MP---TKPFSVRIANGAHL--------   94 (121)
Q Consensus        47 ~i~~~~v-------~~LIDSGAT~nFI~~~~a~rl----~l~~----------~~---~~~~~V~va~G~~l--------   94 (121)
                      .|++...       .++||||.|..++.....+++    +...          ..   .+.+....+ |..+        
T Consensus       189 ~v~g~~~~~~~~~~~~iiDSGtt~~~lP~~~~~~l~~~~~~~~~~~~~~~~~C~~~~~~p~l~~~f~-g~~~~v~~~~y~  267 (318)
T cd05477         189 QINGQATGWCSQGCQAIVDTGTSLLTAPQQVMSTLMQSIGAQQDQYGQYVVNCNNIQNLPTLTFTIN-GVSFPLPPSAYI  267 (318)
T ss_pred             EECCEEecccCCCceeeECCCCccEECCHHHHHHHHHHhCCccccCCCEEEeCCccccCCcEEEEEC-CEEEEECHHHeE
Confidence            5666543       589999999999999887764    1111          10   123444442 2221        


Q ss_pred             -----cc---------------ceEeecchhhhhcCCeeeecccCC
Q 043220           95 -----KS---------------PDVALDVKWSEQIGSIMRGWKKDE  120 (121)
Q Consensus        95 -----~~---------------~dvILG~~WL~~~~p~~idw~~~~  120 (121)
                           .|               ...|||..+|+.+-.+ .|+.+.|
T Consensus       268 ~~~~~~C~~~i~~~~~~~~~~~~~~ilG~~fl~~~y~v-fD~~~~~  312 (318)
T cd05477         268 LQNNGYCTVGIEPTYLPSQNGQPLWILGDVFLRQYYSV-YDLGNNQ  312 (318)
T ss_pred             ecCCCeEEEEEEecccCCCCCCceEEEcHHHhhheEEE-EeCCCCE
Confidence                 23               1379999999999885 8888764


No 48 
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=54.73  E-value=17  Score=27.68  Aligned_cols=25  Identities=20%  Similarity=0.267  Sum_probs=20.6

Q ss_pred             EEEEEEC--CEEEEEEecCCCcccccC
Q 043220           43 VVATKIR--FHEVVILISSGSTANFVS   67 (121)
Q Consensus        43 ~v~g~i~--~~~v~~LIDSGAT~nFI~   67 (121)
                      .+...|+  .+.+.+++|+||+..++.
T Consensus         4 ~~~i~iGtP~q~~~v~~DTGS~~~Wv~   30 (273)
T cd05475           4 YVTINIGNPPKPYFLDIDTGSDLTWLQ   30 (273)
T ss_pred             EEEEEcCCCCeeEEEEEccCCCceEEe
Confidence            4556666  688999999999999995


No 49 
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank 
Probab=54.38  E-value=18  Score=28.15  Aligned_cols=27  Identities=15%  Similarity=0.226  Sum_probs=21.5

Q ss_pred             EEEEEEECC--EEEEEEecCCCcccccCH
Q 043220           42 CVVATKIRF--HEVVILISSGSTANFVSD   68 (121)
Q Consensus        42 ~~v~g~i~~--~~v~~LIDSGAT~nFI~~   68 (121)
                      ..+...|+.  +++.+++||||+..++..
T Consensus         7 Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~   35 (325)
T cd05490           7 YYGEIGIGTPPQTFTVVFDTGSSNLWVPS   35 (325)
T ss_pred             EEEEEEECCCCcEEEEEEeCCCccEEEEc
Confidence            345667764  889999999999999954


No 50 
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  The enzymes specifically cleave bonds in peptides which 
Probab=53.75  E-value=15  Score=27.84  Aligned_cols=25  Identities=16%  Similarity=0.284  Sum_probs=20.1

Q ss_pred             EEEEEC--CEEEEEEecCCCcccccCH
Q 043220           44 VATKIR--FHEVVILISSGSTANFVSD   68 (121)
Q Consensus        44 v~g~i~--~~~v~~LIDSGAT~nFI~~   68 (121)
                      +...|+  .+.+.+++||||++.++..
T Consensus         4 ~~i~iGtP~q~~~v~~DTGSs~~wv~~   30 (265)
T cd05476           4 VTLSIGTPPQPFSLIVDTGSDLTWTQC   30 (265)
T ss_pred             EEEecCCCCcceEEEecCCCCCEEEcC
Confidence            455666  5789999999999998863


No 51 
>PRK09802 DNA-binding transcriptional regulator AgaR; Provisional
Probab=52.57  E-value=6.6  Score=30.51  Aligned_cols=53  Identities=9%  Similarity=0.106  Sum_probs=34.0

Q ss_pred             EEEecCCCcccccCHHHHhhcCCCCCC-----------CCCeEEEecCCceeccceEeecchhh
Q 043220           54 VILISSGSTANFVSDKVAALLPLPMMP-----------TKPFSVRIANGAHLKSPDVALDVKWS  106 (121)
Q Consensus        54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~-----------~~~~~V~va~G~~l~~~dvILG~~WL  106 (121)
                      .+.||+|+|..++-+.+..+-++.+..           .+.+.|.+.+|........+.|..-+
T Consensus       109 ~Ifld~GtT~~~la~~L~~~~~ltVvTnsl~ia~~l~~~~~~~v~llGG~~~~~~~~~~G~~a~  172 (269)
T PRK09802        109 RVILDSGTTTFEIARLMRKHTDVIAMTNGMNVANALLEAEGVELLMTGGHLRRQSQSFYGDQAE  172 (269)
T ss_pred             EEEECCchHHHHHHHhcCcCCCeEEEeCCHHHHHHHHhCCCCEEEEECCEEecCCCceECHHHH
Confidence            478999999988888876543333322           13467777888766555555555433


No 52 
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which 
Probab=52.02  E-value=22  Score=27.62  Aligned_cols=28  Identities=14%  Similarity=0.246  Sum_probs=22.1

Q ss_pred             EEEEEEEC--CEEEEEEecCCCcccccCHH
Q 043220           42 CVVATKIR--FHEVVILISSGSTANFVSDK   69 (121)
Q Consensus        42 ~~v~g~i~--~~~v~~LIDSGAT~nFI~~~   69 (121)
                      ..+...|+  .+++.++|||||+..++...
T Consensus        11 Y~~~i~vGtp~q~~~v~~DTGS~~~wv~~~   40 (317)
T cd05478          11 YYGTISIGTPPQDFTVIFDTGSSNLWVPSV   40 (317)
T ss_pred             EEEEEEeCCCCcEEEEEEeCCCccEEEecC
Confidence            34566666  57899999999999999754


No 53 
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5.  Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=51.80  E-value=26  Score=27.46  Aligned_cols=28  Identities=11%  Similarity=0.133  Sum_probs=22.1

Q ss_pred             EEEEEEEC--CEEEEEEecCCCcccccCHH
Q 043220           42 CVVATKIR--FHEVVILISSGSTANFVSDK   69 (121)
Q Consensus        42 ~~v~g~i~--~~~v~~LIDSGAT~nFI~~~   69 (121)
                      ..+...|+  .+++.+++||||+..++...
T Consensus         4 Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~   33 (326)
T cd06096           4 YFIDIFIGNPPQKQSLILDTGSSSLSFPCS   33 (326)
T ss_pred             EEEEEEecCCCeEEEEEEeCCCCceEEecC
Confidence            34566776  58899999999999998653


No 54 
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=51.04  E-value=24  Score=27.33  Aligned_cols=27  Identities=19%  Similarity=0.288  Sum_probs=21.6

Q ss_pred             EEEEEECC--EEEEEEecCCCcccccCHH
Q 043220           43 VVATKIRF--HEVVILISSGSTANFVSDK   69 (121)
Q Consensus        43 ~v~g~i~~--~~v~~LIDSGAT~nFI~~~   69 (121)
                      .....|+.  +++.++|||||+..++...
T Consensus         5 ~~~i~iGtP~q~~~v~~DTGS~~~wv~~~   33 (318)
T cd05477           5 YGEISIGTPPQNFLVLFDTGSSNLWVPSV   33 (318)
T ss_pred             EEEEEECCCCcEEEEEEeCCCccEEEccC
Confidence            34566664  8899999999999999754


No 55 
>PRK10411 DNA-binding transcriptional activator FucR; Provisional
Probab=50.76  E-value=7  Score=29.86  Aligned_cols=55  Identities=11%  Similarity=0.228  Sum_probs=34.6

Q ss_pred             EEEecCCCcccccCHHHHhhcCCCCCC-----------CCCeEEEecCCceeccceEeecchhhhhc
Q 043220           54 VILISSGSTANFVSDKVAALLPLPMMP-----------TKPFSVRIANGAHLKSPDVALDVKWSEQI  109 (121)
Q Consensus        54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~-----------~~~~~V~va~G~~l~~~dvILG~~WL~~~  109 (121)
                      .+.||+|+|..++.+.+..+ ++.+..           .+.+.+.+.+|..-..+..+.|..-+...
T Consensus        96 ~Ifld~GtT~~~l~~~L~~~-~ltVvTNs~~ia~~l~~~~~~~vil~GG~~~~~~~~~~G~~a~~~l  161 (240)
T PRK10411         96 VIALDASSTCWYLARQLPDI-NIQVFTNSHPICQELGKRERIQLISSGGTLERKYGCYVNPSLISQL  161 (240)
T ss_pred             EEEEcCcHHHHHHHHhhCCC-CeEEEeCCHHHHHHHhcCCCCEEEEECCEEeCCCCceECHHHHHHH
Confidence            47899999999888888765 444322           13456777777655544555555433333


No 56 
>PRK10906 DNA-binding transcriptional repressor GlpR; Provisional
Probab=49.61  E-value=7.8  Score=29.83  Aligned_cols=49  Identities=16%  Similarity=0.264  Sum_probs=31.1

Q ss_pred             EEEecCCCcccccCHHHHhhcCCCCCC-----------CCCeEEEecCCceeccceEeec
Q 043220           54 VILISSGSTANFVSDKVAALLPLPMMP-----------TKPFSVRIANGAHLKSPDVALD  102 (121)
Q Consensus        54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~-----------~~~~~V~va~G~~l~~~dvILG  102 (121)
                      .+.||+|+|...+.+.+..+.++.+..           .+.+.|.+.+|..-.......|
T Consensus        94 tIflD~GtT~~~la~~L~~~~~ltVvTNsl~ia~~l~~~~~~~villGG~~~~~~~~~~G  153 (252)
T PRK10906         94 TLFIDIGTTPEAVAHALLNHSNLRIVTNNLNVANTLMAKEDFRIILAGGELRSRDGGIIG  153 (252)
T ss_pred             EEEEcCcHHHHHHHHHhcCCCCcEEEECcHHHHHHHhhCCCCEEEEECCEEecCCCccCC
Confidence            478999999999888887654444322           1346777777764433333333


No 57 
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco.  CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=49.47  E-value=19  Score=27.64  Aligned_cols=24  Identities=17%  Similarity=0.425  Sum_probs=19.4

Q ss_pred             EEEEEC--CEEEEEEecCCCcccccC
Q 043220           44 VATKIR--FHEVVILISSGSTANFVS   67 (121)
Q Consensus        44 v~g~i~--~~~v~~LIDSGAT~nFI~   67 (121)
                      +...|+  .+++.+++||||+..++.
T Consensus         4 ~~i~iGtP~q~~~v~~DTGSs~~Wv~   29 (299)
T cd05472           4 VTVGLGTPARDQTVIVDTGSDLTWVQ   29 (299)
T ss_pred             EEEecCCCCcceEEEecCCCCccccc
Confidence            455665  478999999999999984


No 58 
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme.  Proteinase A preferentially hydro
Probab=49.47  E-value=24  Score=27.54  Aligned_cols=28  Identities=11%  Similarity=0.258  Sum_probs=22.9

Q ss_pred             EEEEEEEC--CEEEEEEecCCCcccccCHH
Q 043220           42 CVVATKIR--FHEVVILISSGSTANFVSDK   69 (121)
Q Consensus        42 ~~v~g~i~--~~~v~~LIDSGAT~nFI~~~   69 (121)
                      ..+...|+  .+++.+++||||+..++...
T Consensus        11 Y~~~i~iGtp~q~~~v~~DTGSs~~wv~~~   40 (320)
T cd05488          11 YFTDITLGTPPQKFKVILDTGSSNLWVPSV   40 (320)
T ss_pred             EEEEEEECCCCcEEEEEEecCCcceEEEcC
Confidence            45667776  48999999999999999754


No 59 
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two 
Probab=49.12  E-value=23  Score=28.21  Aligned_cols=27  Identities=22%  Similarity=0.273  Sum_probs=21.4

Q ss_pred             EEEEEEC--CEEEEEEecCCCcccccCHH
Q 043220           43 VVATKIR--FHEVVILISSGSTANFVSDK   69 (121)
Q Consensus        43 ~v~g~i~--~~~v~~LIDSGAT~nFI~~~   69 (121)
                      .+...|+  ++++.++|||||+..++...
T Consensus         5 ~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~   33 (364)
T cd05473           5 YIEMLIGTPPQKLNILVDTGSSNFAVAAA   33 (364)
T ss_pred             EEEEEecCCCceEEEEEecCCcceEEEcC
Confidence            4556666  58899999999999998653


No 60 
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=48.93  E-value=25  Score=29.56  Aligned_cols=29  Identities=14%  Similarity=0.305  Sum_probs=22.7

Q ss_pred             EEEEEEC--CEEEEEEecCCCcccccCHHHH
Q 043220           43 VVATKIR--FHEVVILISSGSTANFVSDKVA   71 (121)
Q Consensus        43 ~v~g~i~--~~~v~~LIDSGAT~nFI~~~~a   71 (121)
                      .....|+  ++++.+++||||+..++...-+
T Consensus       140 y~~i~IGTP~Q~f~vi~DTGSsdlWV~s~~C  170 (450)
T PTZ00013        140 YGEGEVGDNHQKFMLIFDTGSANLWVPSKKC  170 (450)
T ss_pred             EEEEEECCCCeEEEEEEeCCCCceEEecccC
Confidence            3455665  6899999999999999975543


No 61 
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two 
Probab=48.59  E-value=40  Score=26.80  Aligned_cols=20  Identities=25%  Similarity=0.207  Sum_probs=17.2

Q ss_pred             EEEecCCCcccccCHHHHhh
Q 043220           54 VILISSGSTANFVSDKVAAL   73 (121)
Q Consensus        54 ~~LIDSGAT~nFI~~~~a~r   73 (121)
                      .++||||+|..++.....+.
T Consensus       213 ~~ivDSGTs~~~lp~~~~~~  232 (364)
T cd05473         213 KAIVDSGTTNLRLPVKVFNA  232 (364)
T ss_pred             cEEEeCCCcceeCCHHHHHH
Confidence            48999999999999887664


No 62 
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=48.37  E-value=23  Score=27.55  Aligned_cols=26  Identities=15%  Similarity=0.247  Sum_probs=20.3

Q ss_pred             EEEEEC--CEEEEEEecCCCcccccCHH
Q 043220           44 VATKIR--FHEVVILISSGSTANFVSDK   69 (121)
Q Consensus        44 v~g~i~--~~~v~~LIDSGAT~nFI~~~   69 (121)
                      ....|+  .+++.++|||||+..++...
T Consensus         3 ~~i~iGtP~Q~~~v~~DTGSs~~Wv~s~   30 (316)
T cd05486           3 GQISIGTPPQNFTVIFDTGSSNLWVPSI   30 (316)
T ss_pred             EEEEECCCCcEEEEEEcCCCccEEEecC
Confidence            345555  58899999999999999643


No 63 
>PTZ00147 plasmepsin-1; Provisional
Probab=48.11  E-value=27  Score=29.34  Aligned_cols=28  Identities=14%  Similarity=0.182  Sum_probs=23.0

Q ss_pred             EEEEEEEC--CEEEEEEecCCCcccccCHH
Q 043220           42 CVVATKIR--FHEVVILISSGSTANFVSDK   69 (121)
Q Consensus        42 ~~v~g~i~--~~~v~~LIDSGAT~nFI~~~   69 (121)
                      ......|+  ++++.+++||||+..++...
T Consensus       140 Y~~~I~IGTP~Q~f~Vi~DTGSsdlWVps~  169 (453)
T PTZ00147        140 SYGEAKLGDNGQKFNFIFDTGSANLWVPSI  169 (453)
T ss_pred             EEEEEEECCCCeEEEEEEeCCCCcEEEeec
Confidence            34567777  78999999999999999754


No 64 
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=47.20  E-value=27  Score=27.40  Aligned_cols=29  Identities=17%  Similarity=0.198  Sum_probs=23.4

Q ss_pred             eEEEEEEEC--CEEEEEEecCCCcccccCHH
Q 043220           41 FCVVATKIR--FHEVVILISSGSTANFVSDK   69 (121)
Q Consensus        41 ~~~v~g~i~--~~~v~~LIDSGAT~nFI~~~   69 (121)
                      ...+...|+  ++++.+++||||+..++...
T Consensus        11 ~Y~~~i~vGtP~q~~~v~~DTGSs~~Wv~~~   41 (329)
T cd05485          11 QYYGVITIGTPPQSFKVVFDTGSSNLWVPSK   41 (329)
T ss_pred             eEEEEEEECCCCcEEEEEEcCCCccEEEecC
Confidence            345677777  58899999999999999754


No 65 
>PF03419 Peptidase_U4:  Sporulation factor SpoIIGA  This family belongs to family U4 of the peptidase classification.;  InterPro: IPR005081 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.   The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This group of peptidases belong to the MEROPS peptidase family U4 (SpoIIGA peptidase family, clan U-).  Sporulation in bacteria such as Bacillus subtilis involves the formation of a polar septum, which divides the sporangium into a mother cell and a forespore. The sigma E factor, which is encoded within the spoIIG operon, is a cell-specific regulatory protein that directs gene transcription in the mother cell. Sigma E is synthesised as an inactive proprotein pro-sigma E, which is converted to the mature factor by the putative processing enzyme SpoIIGA []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis, 0030436 asexual sporulation
Probab=46.65  E-value=35  Score=26.65  Aligned_cols=35  Identities=17%  Similarity=0.157  Sum_probs=24.4

Q ss_pred             CeEEEEEEECCEE--EEEEecCCCcc---------cccCHHHHhhc
Q 043220           40 KFCVVATKIRFHE--VVILISSGSTA---------NFVSDKVAALL   74 (121)
Q Consensus        40 ~~~~v~g~i~~~~--v~~LIDSGAT~---------nFI~~~~a~rl   74 (121)
                      ....++..++++.  +.+|+|||...         ..++.+.++++
T Consensus       156 ~~~~v~i~~~~~~~~~~allDTGN~L~DPitg~PV~Vve~~~~~~~  201 (293)
T PF03419_consen  156 YLYPVTIEIGGKKIELKALLDTGNQLRDPITGRPVIVVEYEALEKL  201 (293)
T ss_pred             EEEEEEEEECCEEEEEEEEEECCCcccCCCCCCcEEEEEHHHHHhh
Confidence            4567888888876  57899999854         35555555554


No 66 
>TIGR02854 spore_II_GA sigma-E processing peptidase SpoIIGA. Members of this protein family are the stage II sporulation protein SpoIIGA. This protein acts as an activating protease for Sigma-E, one of several specialized sigma factors of the sporulation process in Bacillus subtilis and related endospore-forming bacteria.
Probab=45.61  E-value=35  Score=26.83  Aligned_cols=25  Identities=4%  Similarity=0.102  Sum_probs=19.4

Q ss_pred             ccCeEEEEEEECCEE--EEEEecCCCc
Q 043220           38 LRKFCVVATKIRFHE--VVILISSGST   62 (121)
Q Consensus        38 ~~~~~~v~g~i~~~~--v~~LIDSGAT   62 (121)
                      ....+.+...++|+.  +.+|+|||..
T Consensus       155 ~~~~~~v~i~~~g~~~~~~alvDTGN~  181 (288)
T TIGR02854       155 DKQIYELEICLDGKKVTIKGFLDTGNQ  181 (288)
T ss_pred             hceEEEEEEEECCEEEEEEEEEecCCc
Confidence            345677888888876  6789999975


No 67 
>PF04930 FUN14:  FUN14 family;  InterPro: IPR007014 This is a family of short proteins found in eukaryotes and some archaea. Although the function of these proteins is not known they may contain transmembrane helices.
Probab=42.49  E-value=7.8  Score=25.66  Aligned_cols=20  Identities=25%  Similarity=0.346  Sum_probs=17.2

Q ss_pred             eecchhhhhcCCeeeecccC
Q 043220          100 ALDVKWSEQIGSIMRGWKKD  119 (121)
Q Consensus       100 ILG~~WL~~~~p~~idw~~~  119 (121)
                      ++.+.||.+.|=+.+||++.
T Consensus        31 ~l~lq~l~~~G~i~Vnw~kl   50 (100)
T PF04930_consen   31 FLLLQYLASKGYIKVNWDKL   50 (100)
T ss_pred             HHHHHHHHHCCeEEECHHHH
Confidence            56678999999999999864


No 68 
>cd06396 PB1_NBR1 The PB1 domain is an essential part of NBR1 protein, next to BRCA1, a scaffold protein mediating specific protein-protein interaction with both titin protein kinase and with another scaffold protein p62. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The NBR1 protein contains a type I PB1 domain.
Probab=42.44  E-value=59  Score=21.06  Aligned_cols=44  Identities=11%  Similarity=0.120  Sum_probs=32.8

Q ss_pred             EEEEEEECCEEEEEEecCCCcc--cccCHHHHhhcCCCCCCCCCeEEEecC
Q 043220           42 CVVATKIRFHEVVILISSGSTA--NFVSDKVAALLPLPMMPTKPFSVRIAN   90 (121)
Q Consensus        42 ~~v~g~i~~~~v~~LIDSGAT~--nFI~~~~a~rl~l~~~~~~~~~V~va~   90 (121)
                      +++++..+|..+.+.++..++.  .=+...+++++++.     ++.++-.+
T Consensus         1 V~vKaty~~d~~rf~~~~~~~~~~~~L~~ev~~rf~l~-----~f~lKYlD   46 (81)
T cd06396           1 VNLKVTYNGESQSFLVSDSENTTWASVEAMVKVSFGLN-----DIQIKYVD   46 (81)
T ss_pred             CEEEEEECCeEEEEEecCCCCCCHHHHHHHHHHHhCCC-----cceeEEEc
Confidence            4788999999999999996644  44667788888877     45555443


No 69 
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank 
Probab=42.40  E-value=19  Score=27.98  Aligned_cols=22  Identities=9%  Similarity=0.069  Sum_probs=18.8

Q ss_pred             EEEEecCCCcccccCHHHHhhc
Q 043220           53 VVILISSGSTANFVSDKVAALL   74 (121)
Q Consensus        53 v~~LIDSGAT~nFI~~~~a~rl   74 (121)
                      ..++||||+|..++....++.+
T Consensus       207 ~~aiiDSGTt~~~~p~~~~~~l  228 (325)
T cd05490         207 CEAIVDTGTSLITGPVEEVRAL  228 (325)
T ss_pred             CEEEECCCCccccCCHHHHHHH
Confidence            4689999999999998877654


No 70 
>PRK10434 srlR DNA-bindng transcriptional repressor SrlR; Provisional
Probab=39.75  E-value=15  Score=28.23  Aligned_cols=50  Identities=14%  Similarity=0.176  Sum_probs=31.4

Q ss_pred             EEEecCCCcccccCHHHHhhcCCCCCC-----------CCC-eEEEecCCceeccceEeecc
Q 043220           54 VILISSGSTANFVSDKVAALLPLPMMP-----------TKP-FSVRIANGAHLKSPDVALDV  103 (121)
Q Consensus        54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~-----------~~~-~~V~va~G~~l~~~dvILG~  103 (121)
                      .++||+|+|...+.+.+....++.+..           ..+ +.|.+.+|........+.|.
T Consensus        94 tIfld~GtT~~~la~~L~~~~~ltVvTnsl~ia~~l~~~~~~~~v~l~GG~~~~~~~~~~G~  155 (256)
T PRK10434         94 SIILDAGSTVLQMVPLLSRFNNITVMTNSLHIVNALSELDNEQTILMPGGTFRKKSASFHGQ  155 (256)
T ss_pred             EEEEcCcHHHHHHHHHhccCCCeEEEECCHHHHHHHhhCCCCCEEEEECCEEeCCCCeEECH
Confidence            478999999998888887653333221           123 57888888755433444443


No 71 
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=39.19  E-value=35  Score=25.33  Aligned_cols=28  Identities=14%  Similarity=0.284  Sum_probs=21.0

Q ss_pred             EEEECC--EEEEEEecCCCcccccCHHHHh
Q 043220           45 ATKIRF--HEVVILISSGSTANFVSDKVAA   72 (121)
Q Consensus        45 ~g~i~~--~~v~~LIDSGAT~nFI~~~~a~   72 (121)
                      ...|+.  +++.++||+||+..++...-..
T Consensus         4 ~i~iGtp~q~~~l~~DTGS~~~wv~~~~c~   33 (283)
T cd05471           4 EITIGTPPQKFSVIFDTGSSLLWVPSSNCT   33 (283)
T ss_pred             EEEECCCCcEEEEEEeCCCCCEEEecCCCC
Confidence            445543  6889999999999998766443


No 72 
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme.  Proteinase A preferentially hydro
Probab=38.73  E-value=22  Score=27.72  Aligned_cols=22  Identities=18%  Similarity=0.277  Sum_probs=18.7

Q ss_pred             EEEEecCCCcccccCHHHHhhc
Q 043220           53 VVILISSGSTANFVSDKVAALL   74 (121)
Q Consensus        53 v~~LIDSGAT~nFI~~~~a~rl   74 (121)
                      ..++||||+|..++...+++.+
T Consensus       206 ~~~ivDSGtt~~~lp~~~~~~l  227 (320)
T cd05488         206 TGAAIDTGTSLIALPSDLAEML  227 (320)
T ss_pred             CeEEEcCCcccccCCHHHHHHH
Confidence            3589999999999999987753


No 73 
>PF14543 TAXi_N:  Xylanase inhibitor N-terminal; PDB: 3HD8_A 3VLB_A 3VLA_A 3AUP_D 1T6G_A 1T6E_X 2B42_A.
Probab=35.31  E-value=60  Score=22.97  Aligned_cols=23  Identities=22%  Similarity=0.436  Sum_probs=17.3

Q ss_pred             EEEEECC--EEEEEEecCCCccccc
Q 043220           44 VATKIRF--HEVVILISSGSTANFV   66 (121)
Q Consensus        44 v~g~i~~--~~v~~LIDSGAT~nFI   66 (121)
                      +...|+.  +++.+.||+|+...++
T Consensus         3 ~~~~iGtP~~~~~lvvDtgs~l~W~   27 (164)
T PF14543_consen    3 VSVSIGTPPQPFSLVVDTGSDLTWV   27 (164)
T ss_dssp             EEEECTCTTEEEEEEEETT-SSEEE
T ss_pred             EEEEeCCCCceEEEEEECCCCceEE
Confidence            4555655  7889999999998886


No 74 
>PTZ00165 aspartyl protease; Provisional
Probab=35.13  E-value=58  Score=27.59  Aligned_cols=31  Identities=13%  Similarity=0.274  Sum_probs=24.4

Q ss_pred             EEEEEEECC--EEEEEEecCCCcccccCHHHHh
Q 043220           42 CVVATKIRF--HEVVILISSGSTANFVSDKVAA   72 (121)
Q Consensus        42 ~~v~g~i~~--~~v~~LIDSGAT~nFI~~~~a~   72 (121)
                      ......|+.  +++.+++||||+..++...-+.
T Consensus       121 Y~~~I~IGTPpQ~f~Vv~DTGSS~lWVps~~C~  153 (482)
T PTZ00165        121 YFGEIQVGTPPKSFVVVFDTGSSNLWIPSKECK  153 (482)
T ss_pred             EEEEEEeCCCCceEEEEEeCCCCCEEEEchhcC
Confidence            445667776  8999999999999999765443


No 75 
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate  r
Probab=34.59  E-value=29  Score=27.08  Aligned_cols=22  Identities=5%  Similarity=-0.035  Sum_probs=18.2

Q ss_pred             eEeecchhhhhcCCeeeecccCC
Q 043220           98 DVALDVKWSEQIGSIMRGWKKDE  120 (121)
Q Consensus        98 dvILG~~WL~~~~p~~idw~~~~  120 (121)
                      ..|||-.+|+.+-.+ .||.++|
T Consensus       299 ~~ilG~~flr~~y~v-fD~~~~~  320 (326)
T cd05487         299 LWVLGATFIRKFYTE-FDRQNNR  320 (326)
T ss_pred             eEEEehHHhhccEEE-EeCCCCE
Confidence            369999999999875 8888765


No 76 
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=34.53  E-value=29  Score=26.32  Aligned_cols=22  Identities=18%  Similarity=0.477  Sum_probs=19.4

Q ss_pred             EEEEecCCCcccccCHHHHhhc
Q 043220           53 VVILISSGSTANFVSDKVAALL   74 (121)
Q Consensus        53 v~~LIDSGAT~nFI~~~~a~rl   74 (121)
                      ..++||||++..++...+.+++
T Consensus       179 ~~~iiDSGt~~~~lP~~~~~~l  200 (295)
T cd05474         179 LPALLDSGTTLTYLPSDIVDAI  200 (295)
T ss_pred             ccEEECCCCccEeCCHHHHHHH
Confidence            5789999999999999988764


No 77 
>PF04126 Cyclophil_like:  Cyclophilin-like;  InterPro: IPR007256 Proteins of this family have no known function.; PDB: 2KA0_A 1ZX8_C 2NNZ_A.
Probab=31.34  E-value=33  Score=23.51  Aligned_cols=30  Identities=30%  Similarity=0.270  Sum_probs=21.3

Q ss_pred             EEEEEEECCEEEEE-EecCCCcccccCHHHHhhcCC
Q 043220           42 CVVATKIRFHEVVI-LISSGSTANFVSDKVAALLPL   76 (121)
Q Consensus        42 ~~v~g~i~~~~v~~-LIDSGAT~nFI~~~~a~rl~l   76 (121)
                      |+++..++++.+.+ |.|+-     -.++++++|.+
T Consensus         1 mkI~i~i~~~~~~a~L~d~~-----ta~~~~~~LPl   31 (120)
T PF04126_consen    1 MKIKITIGGQEIEAELNDSP-----TARAFAAQLPL   31 (120)
T ss_dssp             EEEEEEETTEEEEEEEETTH-----HHHHHHHC-SE
T ss_pred             CeEEEEECCEEEEEEECCCH-----HHHHHHHhCCe
Confidence            78999999999887 88883     34555666543


No 78 
>PRK13509 transcriptional repressor UlaR; Provisional
Probab=29.90  E-value=26  Score=26.86  Aligned_cols=49  Identities=27%  Similarity=0.378  Sum_probs=29.4

Q ss_pred             EEEecCCCcccccCHHHHhhcCCCCCC-----------CCCeEEEecCCceeccceEeecc
Q 043220           54 VILISSGSTANFVSDKVAALLPLPMMP-----------TKPFSVRIANGAHLKSPDVALDV  103 (121)
Q Consensus        54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~-----------~~~~~V~va~G~~l~~~dvILG~  103 (121)
                      .+.||+|+|.-.+.+.+..+ ++.+..           .+.+.+.+.+|..-.....+.|.
T Consensus        96 ~Ifld~GsT~~~la~~L~~~-~ltVvTnsl~ia~~l~~~~~~~v~l~GG~~~~~~~~~~G~  155 (251)
T PRK13509         96 SVVINCGSTAFLLGRELCGK-PVQIITNYLPLANYLIDQEHDSVIIMGGQYNKSQSITLSP  155 (251)
T ss_pred             EEEECCcHHHHHHHHHhCCC-CeEEEeCCHHHHHHHHhCCCCEEEEECCeEcCCcceeECH
Confidence            57899999998777766543 333222           13456777777654434444444


No 79 
>COG1349 GlpR Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]
Probab=28.81  E-value=35  Score=26.20  Aligned_cols=51  Identities=14%  Similarity=0.165  Sum_probs=33.8

Q ss_pred             EEEecCCCcccccCHHHHhhcCCCCCC-----------CCCeEEEecCCceeccceEeecch
Q 043220           54 VILISSGSTANFVSDKVAALLPLPMMP-----------TKPFSVRIANGAHLKSPDVALDVK  104 (121)
Q Consensus        54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~-----------~~~~~V~va~G~~l~~~dvILG~~  104 (121)
                      .+.+|+|+|.-++-..+....++.+..           .+...+.+.+|..-.....+.|..
T Consensus        94 ~ifld~GTT~~~la~~L~~~~~ltviTNsl~ia~~l~~~~~~~vi~~GG~~~~~~~~~~G~~  155 (253)
T COG1349          94 TIFLDAGTTTLALARALPDDNNLTVITNSLNIAAALLEKPNIEVILLGGTVRKKSGSFVGPL  155 (253)
T ss_pred             EEEECCCcHHHHHHHHhCcCCCeEEEeCCHHHHHHHHhCCCCeEEEeCcEEEcCCCeEEcHH
Confidence            468999999999888888665544322           134567677776655555555553


No 80 
>PLN03146 aspartyl protease family protein; Provisional
Probab=28.79  E-value=59  Score=26.89  Aligned_cols=17  Identities=29%  Similarity=0.520  Sum_probs=15.2

Q ss_pred             EEEecCCCcccccCHHH
Q 043220           54 VILISSGSTANFVSDKV   70 (121)
Q Consensus        54 ~~LIDSGAT~nFI~~~~   70 (121)
                      .++||||+|..++.+..
T Consensus       309 ~~iiDSGTt~t~Lp~~~  325 (431)
T PLN03146        309 NIIIDSGTTLTLLPSDF  325 (431)
T ss_pred             cEEEeCCccceecCHHH
Confidence            47999999999999985


No 81 
>PLN03146 aspartyl protease family protein; Provisional
Probab=27.60  E-value=78  Score=26.18  Aligned_cols=27  Identities=11%  Similarity=0.145  Sum_probs=22.4

Q ss_pred             eEEEEEEEC--CEEEEEEecCCCcccccC
Q 043220           41 FCVVATKIR--FHEVVILISSGSTANFVS   67 (121)
Q Consensus        41 ~~~v~g~i~--~~~v~~LIDSGAT~nFI~   67 (121)
                      ...+...|+  .+++.+++||||...++.
T Consensus        84 ~Y~v~i~iGTPpq~~~vi~DTGS~l~Wv~  112 (431)
T PLN03146         84 EYLMNISIGTPPVPILAIADTGSDLIWTQ  112 (431)
T ss_pred             cEEEEEEcCCCCceEEEEECCCCCcceEc
Confidence            566777887  478899999999999985


No 82 
>PF08922 DUF1905:  Domain of unknown function (DUF1905);  InterPro: IPR015018 This family consist of hypothetical bacterial proteins. ; PDB: 2D9R_A.
Probab=26.93  E-value=1.1e+02  Score=19.26  Aligned_cols=45  Identities=18%  Similarity=0.007  Sum_probs=24.8

Q ss_pred             CeEEEEEEECCEEEEE-EecC--CCcccccCHHHHhhcCCCCCCCCCeEE
Q 043220           40 KFCVVATKIRFHEVVI-LISS--GSTANFVSDKVAALLPLPMMPTKPFSV   86 (121)
Q Consensus        40 ~~~~v~g~i~~~~v~~-LIDS--GAT~nFI~~~~a~rl~l~~~~~~~~~V   86 (121)
                      +.++|.+.|++++... |+=.  |.-.-.++..+.+..++  .....+.|
T Consensus        31 g~v~V~~tI~g~~~~~sl~p~g~G~~~Lpv~~~vRk~~g~--~~Gd~V~v   78 (80)
T PF08922_consen   31 GRVPVRGTIDGHPWRTSLFPMGNGGYILPVKAAVRKAIGK--EAGDTVEV   78 (80)
T ss_dssp             S-EEEEEEETTEEEEEEEEESSTT-EEEEE-HHHHHHHT----TTSEEEE
T ss_pred             CceEEEEEECCEEEEEEEEECCCCCEEEEEcHHHHHHcCC--CCCCEEEE
Confidence            7899999999988653 3333  44444555666665554  33444444


No 83 
>PF05618 Zn_protease:  Putative ATP-dependant zinc protease;  InterPro: IPR008503 This family consists of several hypothetical proteins from different archaeal and bacterial species.; PDB: 2PMA_B.
Probab=26.82  E-value=50  Score=23.28  Aligned_cols=23  Identities=9%  Similarity=0.024  Sum_probs=15.4

Q ss_pred             CEEEEEEecCCCcccccCHHHHh
Q 043220           50 FHEVVILISSGSTANFVSDKVAA   72 (121)
Q Consensus        50 ~~~v~~LIDSGAT~nFI~~~~a~   72 (121)
                      +..+.|=|||||..|-|+..=.+
T Consensus        14 ~~~~~aKiDTGA~tSSLhA~~I~   36 (138)
T PF05618_consen   14 GLTIKAKIDTGAKTSSLHATDIE   36 (138)
T ss_dssp             TEEEEEEE-TT-SSEEEE-EEEE
T ss_pred             CCEEEEEEcCCCcccceeecceE
Confidence            44578899999999999765544


No 84 
>COG2861 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.41  E-value=55  Score=25.68  Aligned_cols=28  Identities=25%  Similarity=0.475  Sum_probs=24.9

Q ss_pred             EEEEEecCCCcccccCHHHHhhcCCCCC
Q 043220           52 EVVILISSGSTANFVSDKVAALLPLPMM   79 (121)
Q Consensus        52 ~v~~LIDSGAT~nFI~~~~a~rl~l~~~   79 (121)
                      .-..+||||.+.+=+..++|+++++|..
T Consensus       150 r~l~flDs~T~a~S~a~~iAk~~gVp~~  177 (250)
T COG2861         150 RGLYFLDSGTIANSLAGKIAKEIGVPVI  177 (250)
T ss_pred             CCeEEEcccccccchhhhhHhhcCCcee
Confidence            4568999999999999999999998864


No 85 
>cd05489 xylanase_inhibitor_I_like TAXI-I inhibits degradation of xylan in the cell wall. Xylanase inhibitor-I (TAXI-I) is a member of potent TAXI-type inhibitors of fungal and bacterial family 11 xylanases. Plants developed a diverse battery of defense mechanisms in response to continual challenges by a broad spectrum of pathogenic microorganisms. Their defense arsenal includes inhibitors of cell wall-degrading enzymes, which hinder a possible invasion and colonization by antagonists. Xylanases of fungal and bacterial pathogens are the key enzymes in the degradation of xylan in the cell wall. Plants secrete proteins that inhibit these degradation glycosidases, including xylanase. Surprisingly, TAXI-I displays structural homology with the pepsin-like family of aspartic proteases but is proteolytically nonfunctional, because one or more residues of the essential catalytic triad are absent. The structure of the TAXI-inhibitor, Aspergillus niger xylanase I complex, illustrates the ability 
Probab=25.74  E-value=70  Score=25.74  Aligned_cols=20  Identities=5%  Similarity=0.004  Sum_probs=15.7

Q ss_pred             EEEecCCCcccccCHHHHhh
Q 043220           54 VILISSGSTANFVSDKVAAL   73 (121)
Q Consensus        54 ~~LIDSGAT~nFI~~~~a~r   73 (121)
                      .++||||.+..++...+.+.
T Consensus       231 g~iiDSGTs~t~lp~~~y~~  250 (362)
T cd05489         231 GVKLSTVVPYTVLRSDIYRA  250 (362)
T ss_pred             cEEEecCCceEEECHHHHHH
Confidence            48999999999977765443


No 86 
>PRK12440 acetate kinase; Reviewed
Probab=25.57  E-value=31  Score=28.75  Aligned_cols=44  Identities=30%  Similarity=0.474  Sum_probs=35.0

Q ss_pred             CCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc------ceEeecc
Q 043220           60 GSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS------PDVALDV  103 (121)
Q Consensus        60 GAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~------~dvILG~  103 (121)
                      |-+|-|+..++++.++.+....+-+.+.++||..+..      .|.-.|+
T Consensus       180 GlS~~~v~~~~a~~lg~~~~~~~~Iv~HLG~G~Si~Ai~~GksvDtsmG~  229 (397)
T PRK12440        180 GTSHYFVSREAAKMLNKPIEESSFISVHLGNGASVCAIKNGQSVDTSMGF  229 (397)
T ss_pred             HHhHHHHHHHHHHHhCCChHHcCEEEEEeCCCcEeeeeeCCEEEEcCCCC
Confidence            5688889999999888777777778899999988754      6666665


No 87 
>cd01813 UBP_N UBP ubiquitin processing protease. The UBP (ubiquitin processing protease) domain (also referred to as USP which stands for "ubiquitin-specific protease") is present at in a large family of cysteine proteases that specifically cleave ubiquitin conjugates.  This family includes Rpn11, UBP6 (USP14), USP7 (HAUSP).   This domain is closely related to the amino-terminal ubiquitin-like domain of BAG1 (Bcl2-associated anthanogene1) protein and is found only in eukaryotes.
Probab=24.47  E-value=1.4e+02  Score=18.44  Aligned_cols=38  Identities=16%  Similarity=-0.052  Sum_probs=32.8

Q ss_pred             EEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCC
Q 043220           42 CVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMM   79 (121)
Q Consensus        42 ~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~   79 (121)
                      |++..+.+|+...+=++..+|..=+-+.+..+.++|..
T Consensus         1 ~~i~vk~~g~~~~v~v~~~~Tv~~lK~~i~~~tgvp~~   38 (74)
T cd01813           1 VPVIVKWGGQEYSVTTLSEDTVLDLKQFIKTLTGVLPE   38 (74)
T ss_pred             CEEEEEECCEEEEEEECCCCCHHHHHHHHHHHHCCCHH
Confidence            46788889999988999999999999999999887764


No 88 
>smart00295 B41 Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
Probab=23.98  E-value=2.1e+02  Score=20.13  Aligned_cols=30  Identities=17%  Similarity=0.089  Sum_probs=27.0

Q ss_pred             CCEEEEEEecCCCcccccCHHHHhhcCCCC
Q 043220           49 RFHEVVILISSGSTANFVSDKVAALLPLPM   78 (121)
Q Consensus        49 ~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~   78 (121)
                      +|....+-||+-.|..-+-..+++++|++.
T Consensus        12 dg~~~~~~~~~~~t~~ev~~~v~~~~~l~~   41 (207)
T smart00295       12 DGTTLEFEVDSSTTAEELLETVCRKLGIRE   41 (207)
T ss_pred             CCCEEEEEECCCCCHHHHHHHHHHHhCCCc
Confidence            677888999999999999999999999954


No 89 
>PF14541 TAXi_C:  Xylanase inhibitor C-terminal; PDB: 3AUP_D 3HD8_A 1T6G_A 1T6E_X 2B42_A 3VLB_A 3VLA_A.
Probab=23.69  E-value=50  Score=23.11  Aligned_cols=19  Identities=26%  Similarity=0.377  Sum_probs=14.3

Q ss_pred             EEEecCCCcccccCHHHHh
Q 043220           54 VILISSGSTANFVSDKVAA   72 (121)
Q Consensus        54 ~~LIDSGAT~nFI~~~~a~   72 (121)
                      .+.||||++..++.+.+=.
T Consensus        31 ~~iiDSGT~~T~L~~~~y~   49 (161)
T PF14541_consen   31 GTIIDSGTTYTYLPPPVYD   49 (161)
T ss_dssp             SEEE-SSSSSEEEEHHHHH
T ss_pred             CEEEECCCCccCCcHHHHH
Confidence            3679999999999877643


No 90 
>cd06407 PB1_NLP A PB1 domain is present in NIN like proteins (NLP), a key enzyme in a process of establishment of symbiosis betweeen legumes and nitrogen fixing bacteria (Rhizobium). The PB1 domain is a modular domain mediating specific protein-protein interaction which play a role in many critical cell processes like osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster.  Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-inte
Probab=23.64  E-value=1.3e+02  Score=19.27  Aligned_cols=46  Identities=9%  Similarity=0.091  Sum_probs=35.7

Q ss_pred             EEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEec
Q 043220           42 CVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIA   89 (121)
Q Consensus        42 ~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va   89 (121)
                      |++++..|+--+.+-+.+..+..=+-.++++++++.  ...++.++--
T Consensus         1 ~~vK~~~~~d~~r~~l~~~~~~~~L~~~i~~r~~~~--~~~~f~LkY~   46 (82)
T cd06407           1 VRVKATYGEEKIRFRLPPSWGFTELKQEIAKRFKLD--DMSAFDLKYL   46 (82)
T ss_pred             CEEEEEeCCeEEEEEcCCCCCHHHHHHHHHHHhCCC--CCCeeEEEEE
Confidence            578888999989999999988888899999999864  1235555543


No 91 
>PF04746 DUF575:  Protein of unknown function (DUF575);  InterPro: IPR006835 This represents a conserved region found in a number of Chlamydophila pneumoniae proteins.
Probab=23.30  E-value=38  Score=22.76  Aligned_cols=13  Identities=8%  Similarity=0.442  Sum_probs=10.5

Q ss_pred             ceEeecchhhhhc
Q 043220           97 PDVALDVKWSEQI  109 (121)
Q Consensus        97 ~dvILG~~WL~~~  109 (121)
                      -.++.|++||-.+
T Consensus        27 Ghiv~GieWLvS~   39 (101)
T PF04746_consen   27 GHIVMGIEWLVSR   39 (101)
T ss_pred             ceEEeehHHHHHH
Confidence            4678999999765


No 92 
>PF00564 PB1:  PB1 domain;  InterPro: IPR000270 The Phox and Bem1p domain, is present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.; GO: 0005515 protein binding; PDB: 1IPG_A 1IP9_A 2KFK_A 1WMH_A 1VD2_A 1WI0_A 1OEY_C 1PQS_A 1Q1O_A 1TZ1_A ....
Probab=23.22  E-value=1.8e+02  Score=17.56  Aligned_cols=46  Identities=13%  Similarity=0.078  Sum_probs=35.4

Q ss_pred             eEEEEEEECCEEEE-EEecCCCcccccCHHHHhhcCCCCCCCCCeEEEec
Q 043220           41 FCVVATKIRFHEVV-ILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIA   89 (121)
Q Consensus        41 ~~~v~g~i~~~~v~-~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va   89 (121)
                      |++++...++.... +-+.++.+..-+-..++.+++++   ...+.++-.
T Consensus         1 t~~vK~~~~~~~~~~~~~~~~~s~~~L~~~i~~~~~~~---~~~~~l~Y~   47 (84)
T PF00564_consen    1 TVRVKVRYGGDIRRIISLPSDVSFDDLRSKIREKFGLL---DEDFQLKYK   47 (84)
T ss_dssp             SEEEEEEETTEEEEEEEECSTSHHHHHHHHHHHHHTTS---TSSEEEEEE
T ss_pred             CEEEEEEECCeeEEEEEcCCCCCHHHHHHHHHHHhCCC---CccEEEEee
Confidence            57888889888777 78888888888889999999887   344554443


No 93 
>PRK12379 propionate/acetate kinase; Provisional
Probab=22.53  E-value=37  Score=28.29  Aligned_cols=45  Identities=22%  Similarity=0.386  Sum_probs=33.9

Q ss_pred             CCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc------ceEeecch
Q 043220           60 GSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS------PDVALDVK  104 (121)
Q Consensus        60 GAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~------~dvILG~~  104 (121)
                      |-+|-||..++++.++.+....+-+.+.+++|..+..      .|.-.|+.
T Consensus       176 GlS~~~va~~~a~~lg~~~~~~~lIv~HLG~G~Si~Ai~~GksvDtsmG~t  226 (396)
T PRK12379        176 GTSHRYVSQRAHSLLNLDEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMT  226 (396)
T ss_pred             HHHHHHHHHHHHHHhCCChhHCCEEEEEeCCCcchheeeCCEEEEeCCCCC
Confidence            5577788888888887776666778889999987754      77766664


No 94 
>PF03539 Spuma_A9PTase:  Spumavirus aspartic protease (A9);  InterPro: IPR001641 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A9 (spumapepsin family, clan AA). Foamy viruses are single-stranded enveloped retroviruses that have been noted to infect monkeys, cats and humans. In the human virus, the aspartic protease is encoded by the retroviral gag gene [], and in monkeys by the pol gene []. At present, the virus has not been proven to cause any particular disease. However, studies have shown Human foamy virus causes neurological disorders in infected mice []. It is not clear whether the Foamy virus/spumavirus proteases share a common evolutionary origin with other aspartic proteases. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 2JYS_A.
Probab=22.12  E-value=83  Score=22.99  Aligned_cols=26  Identities=12%  Similarity=0.079  Sum_probs=20.4

Q ss_pred             ECCEEEEEEecCCCcccccCHHHHhh
Q 043220           48 IRFHEVVILISSGSTANFVSDKVAAL   73 (121)
Q Consensus        48 i~~~~v~~LIDSGAT~nFI~~~~a~r   73 (121)
                      |.|..+.+--||||+...|.+.+...
T Consensus         1 ikg~~l~~~wDsga~ITCiP~~fl~~   26 (163)
T PF03539_consen    1 IKGTKLKGHWDSGAQITCIPESFLEE   26 (163)
T ss_dssp             ETTEEEEEEE-TT-SSEEEEGGGTTT
T ss_pred             CCCceeeEEecCCCeEEEccHHHhCc
Confidence            46788899999999999999988664


No 95 
>cd01812 BAG1_N Ubiquitin-like domain of BAG1. BAG1_N  N-terminal ubiquitin-like (Ubl) domain of the BAG1 protein.  This domain occurs together with the BAG domain and is closely related to the Ubl domain of a family of deubiquitinases that includes Rpn11, UBP6 (USP14), USP7 (HAUSP).
Probab=20.67  E-value=1.8e+02  Score=16.95  Aligned_cols=38  Identities=18%  Similarity=-0.023  Sum_probs=30.1

Q ss_pred             EEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCC
Q 043220           42 CVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMM   79 (121)
Q Consensus        42 ~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~   79 (121)
                      |++..+-+|+...+-++..+|..=+-.+++.+.|++..
T Consensus         1 i~i~vk~~g~~~~i~v~~~~tv~~lK~~i~~~~gi~~~   38 (71)
T cd01812           1 IRVRVKHGGESHDLSISSQATFGDLKKMLAPVTGVEPR   38 (71)
T ss_pred             CEEEEEECCEEEEEEECCCCcHHHHHHHHHHhhCCChH
Confidence            35666667888888899999999888999888887653


No 96 
>PRK12397 propionate kinase; Reviewed
Probab=20.62  E-value=38  Score=28.31  Aligned_cols=45  Identities=18%  Similarity=0.415  Sum_probs=33.1

Q ss_pred             CCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc------ceEeecch
Q 043220           60 GSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS------PDVALDVK  104 (121)
Q Consensus        60 GAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~------~dvILG~~  104 (121)
                      |-+|-|+..++++.++.+....+-+.+.++||..+..      .|.-.|+.
T Consensus       180 GlS~~yva~~~a~~lg~~~~~~~lIv~HLG~GaSi~Ai~~GksvDtsmG~t  230 (404)
T PRK12397        180 GTSHKYVSGVLAEKLGVPLSALRVICCHLGNGSSICAIKNGRSVNTSMGFT  230 (404)
T ss_pred             HHHHHHHHHHHHHHhCCChhHCCEEEEEeCCCcchheeeCCEEEEcCCCCC
Confidence            4567777777887777666666678889999987754      77777664


No 97 
>COG3865 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.38  E-value=2.4e+02  Score=20.43  Aligned_cols=25  Identities=16%  Similarity=0.309  Sum_probs=20.3

Q ss_pred             CeEEEEEEECCEEEEEEecCCCcccc
Q 043220           40 KFCVVATKIRFHEVVILISSGSTANF   65 (121)
Q Consensus        40 ~~~~v~g~i~~~~v~~LIDSGAT~nF   65 (121)
                      +.+....+++|+.+.+ +|+|..++|
T Consensus        49 ~Vl~a~F~l~g~~f~~-ld~g~~~~f   73 (151)
T COG3865          49 KVLVAEFTLNGQSFMA-LDGGPNTSF   73 (151)
T ss_pred             cEEEEEEEECCeEEEE-EcCCCCcCC
Confidence            4577899999999866 599988776


Done!