Query         045784
Match_columns 90
No_of_seqs    108 out of 747
Neff          8.0 
Searched_HMMs 46136
Date          Fri Mar 29 05:28:52 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/045784.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/045784hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF08284 RVP_2:  Retroviral asp 100.0 9.6E-29 2.1E-33  156.6   9.8   89    1-89     21-119 (135)
  2 cd05479 RP_DDI RP_DDI; retrope  99.9 1.2E-25 2.5E-30  140.4  10.5   88    1-88     16-113 (124)
  3 cd05484 retropepsin_like_LTR_2  99.9 3.6E-22 7.7E-27  118.4   9.3   81    2-84      1-91  (91)
  4 PF09668 Asp_protease:  Asparty  99.8 2.4E-20 5.3E-25  116.5   9.8   88    1-88     24-121 (124)
  5 cd05480 NRIP_C NRIP_C; putativ  99.8 1.8E-18   4E-23  103.8   8.5   85    4-88      1-97  (103)
  6 PF13650 Asp_protease_2:  Aspar  99.8 1.9E-17   4E-22   96.5  10.3   77    4-82      1-90  (90)
  7 TIGR02281 clan_AA_DTGA clan AA  99.7 2.6E-17 5.6E-22  102.5   9.9   86    2-89     12-110 (121)
  8 PF00077 RVP:  Retroviral aspar  99.7 2.7E-17 5.8E-22   98.4   9.4   83    3-88      7-97  (100)
  9 cd05483 retropepsin_like_bacte  99.7 4.5E-16 9.7E-21   91.4   9.6   82    1-84      2-96  (96)
 10 cd06095 RP_RTVL_H_like Retrope  99.7 5.8E-16 1.3E-20   91.0   8.7   79    4-84      1-86  (86)
 11 cd00303 retropepsin_like Retro  99.6 8.8E-15 1.9E-19   82.0   9.8   81    4-84      1-92  (92)
 12 KOG0012 DNA damage inducible p  99.5 2.7E-14 5.8E-19  101.7   6.4   88    1-88    235-332 (380)
 13 PF13975 gag-asp_proteas:  gag-  99.5 6.3E-14 1.4E-18   80.0   6.0   51    1-51      8-69  (72)
 14 PF02160 Peptidase_A3:  Caulifl  99.5   2E-13 4.3E-18   91.2   8.9   79   11-89     19-106 (201)
 15 TIGR03698 clan_AA_DTGF clan AA  99.5 6.8E-13 1.5E-17   81.0   9.7   75   11-88     15-98  (107)
 16 PF12384 Peptidase_A2B:  Ty3 tr  99.5   4E-13 8.7E-18   87.1   9.0   87    1-88     34-131 (177)
 17 cd06094 RP_Saci_like RP_Saci_l  99.3 5.3E-12 1.2E-16   74.6   7.0   74   11-87      8-88  (89)
 18 COG3577 Predicted aspartyl pro  99.3 3.8E-12 8.3E-17   85.1   7.0   85    2-88    106-203 (215)
 19 cd05481 retropepsin_like_LTR_1  99.3   4E-11 8.7E-16   71.5   8.7   74    6-81      3-90  (93)
 20 cd05482 HIV_retropepsin_like R  98.7 1.7E-07 3.6E-12   55.4   8.5   78    5-84      2-87  (87)
 21 COG5550 Predicted aspartyl pro  98.4 1.7E-06 3.6E-11   54.0   6.2   72   13-87     27-107 (125)
 22 PF05585 DUF1758:  Putative pep  98.3   3E-06 6.6E-11   54.7   6.2   25   10-34     10-34  (164)
 23 PF05618 Zn_protease:  Putative  97.6 0.00043 9.4E-09   44.1   6.4   77   10-86     14-124 (138)
 24 PF12382 Peptidase_A2E:  Retrot  97.1  0.0017 3.8E-08   39.5   5.3   63   12-74     47-118 (137)
 25 COG4067 Uncharacterized protei  97.1  0.0022 4.7E-08   41.6   5.9   76   11-87     38-148 (162)
 26 cd05476 pepsin_A_like_plant Ch  96.2   0.043 9.4E-07   37.6   7.5   71   13-87    177-248 (265)
 27 cd06097 Aspergillopepsin_like   95.9  0.0093   2E-07   41.1   3.2   24   11-34    197-220 (278)
 28 PF03539 Spuma_A9PTase:  Spumav  95.4    0.27 5.8E-06   31.9   8.2   73    8-86      1-83  (163)
 29 cd05477 gastricsin Gastricsins  95.4    0.13 2.9E-06   36.1   7.5   75   14-88    203-304 (318)
 30 PF00026 Asp:  Eukaryotic aspar  95.2   0.063 1.4E-06   37.1   5.4   76   12-87    199-302 (317)
 31 cd05478 pepsin_A Pepsin A, asp  95.2    0.16 3.4E-06   35.7   7.4   83    6-88    194-304 (317)
 32 cd06096 Plasmepsin_5 Plasmepsi  95.1   0.044 9.5E-07   38.7   4.4   72   11-87    230-308 (326)
 33 cd05474 SAP_like SAPs, pepsin-  94.9    0.12 2.5E-06   35.7   6.0   77   11-87    177-280 (295)
 34 cd05474 SAP_like SAPs, pepsin-  94.7     0.6 1.3E-05   32.1   9.2   73    3-82      4-80  (295)
 35 cd05486 Cathespin_E Cathepsin   94.6    0.21 4.5E-06   35.1   6.8   75   14-88    200-303 (316)
 36 PF00026 Asp:  Eukaryotic aspar  94.6    0.32   7E-06   33.5   7.7   78    3-83      3-115 (317)
 37 PTZ00147 plasmepsin-1; Provisi  94.5    0.34 7.3E-06   36.3   8.0   77   12-88    332-435 (453)
 38 PTZ00013 plasmepsin 4 (PM4); P  94.4    0.49 1.1E-05   35.5   8.7   76    4-82    141-251 (450)
 39 cd05485 Cathepsin_D_like Cathe  94.4    0.36 7.8E-06   34.2   7.7   76   13-88    211-316 (329)
 40 PTZ00147 plasmepsin-1; Provisi  94.4    0.56 1.2E-05   35.2   8.9   76    4-82    142-252 (453)
 41 cd06098 phytepsin Phytepsin, a  93.8    0.44 9.5E-06   33.5   7.0   22   13-34    211-232 (317)
 42 cd05471 pepsin_like Pepsin-lik  93.3   0.091   2E-06   35.6   2.9   25   10-34    200-224 (283)
 43 cd05488 Proteinase_A_fungi Fun  93.3    0.36 7.7E-06   34.0   6.0   75   13-87    206-306 (320)
 44 cd06097 Aspergillopepsin_like   92.9     1.3 2.7E-05   30.5   8.1   27    3-29      2-30  (278)
 45 cd05487 renin_like Renin stimu  92.5     2.4 5.1E-05   29.9   9.2   26    3-28     10-37  (326)
 46 cd05490 Cathepsin_D2 Cathepsin  92.0    0.67 1.5E-05   32.6   6.0   76   13-88    207-312 (325)
 47 PTZ00013 plasmepsin 4 (PM4); P  91.5     0.8 1.7E-05   34.4   6.1   76   12-87    331-433 (450)
 48 cd05477 gastricsin Gastricsins  91.1       3 6.5E-05   29.2   8.5   26    3-28      5-32  (318)
 49 cd05476 pepsin_A_like_plant Ch  91.0       2 4.2E-05   29.4   7.3   75    3-82      3-88  (265)
 50 cd05473 beta_secretase_like Be  90.8     3.1 6.7E-05   29.8   8.5   27    3-29      5-33  (364)
 51 cd05470 pepsin_retropepsin_lik  90.1    0.31 6.6E-06   28.7   2.4   26    5-30      2-29  (109)
 52 cd05472 cnd41_like Chloroplast  89.6     2.9 6.2E-05   29.0   7.3   78    3-82      3-89  (299)
 53 cd05488 Proteinase_A_fungi Fun  89.4     5.7 0.00012   27.9   8.7   26    3-28     12-39  (320)
 54 cd05486 Cathespin_E Cathepsin   89.4     3.7   8E-05   28.8   7.7   25    4-28      3-29  (316)
 55 cd05490 Cathepsin_D2 Cathepsin  89.2       5 0.00011   28.1   8.4   25    3-27      8-34  (325)
 56 cd06098 phytepsin Phytepsin, a  89.2     5.1 0.00011   28.1   8.4   26    3-28     12-39  (317)
 57 cd05475 nucellin_like Nucellin  88.7     5.9 0.00013   27.2   8.3   41    3-43      4-51  (273)
 58 PTZ00165 aspartyl protease; Pr  88.1     2.8   6E-05   31.8   6.7   23   12-34    327-349 (482)
 59 cd05478 pepsin_A Pepsin A, asp  88.0     5.9 0.00013   27.7   8.0   26    3-28     12-39  (317)
 60 cd05487 renin_like Renin stimu  79.3     8.3 0.00018   27.2   5.8   75   13-87    208-311 (326)
 61 cd05472 cnd41_like Chloroplast  78.3     2.2 4.8E-05   29.5   2.6   21   14-34    173-193 (299)
 62 KOG3048 Molecular chaperone Pr  74.2       2 4.3E-05   27.8   1.3   27    1-29     72-98  (153)
 63 cd05473 beta_secretase_like Be  72.3     4.3 9.2E-05   29.1   2.8   21   14-34    213-233 (364)
 64 PF14543 TAXi_N:  Xylanase inhi  70.8     7.6 0.00017   24.8   3.5   24    3-26      2-27  (164)
 65 cd05485 Cathepsin_D_like Cathe  67.6     9.4  0.0002   27.0   3.7   27    3-29     13-41  (329)
 66 PF13856 Gifsy-2:  ATP-binding   66.1      24 0.00052   20.6   6.7   21    4-24     22-42  (95)
 67 cd06096 Plasmepsin_5 Plasmepsi  66.1      11 0.00024   26.5   3.9   27    3-29      5-33  (326)
 68 PF03419 Peptidase_U4:  Sporula  64.9      12 0.00025   26.4   3.7   33    2-34    158-201 (293)
 69 cd05471 pepsin_like Pepsin-lik  61.7      11 0.00024   25.3   3.1   27    4-30      3-31  (283)
 70 TIGR02854 spore_II_GA sigma-E   61.4      17 0.00036   25.7   4.0   33    2-34    159-202 (288)
 71 PLN03146 aspartyl protease fam  52.5      22 0.00047   26.4   3.5   25    3-27     86-112 (431)
 72 PTZ00165 aspartyl protease; Pr  52.5      22 0.00048   27.0   3.6   27    3-29    122-150 (482)
 73 KOG0679 Actin-related protein   52.3     7.3 0.00016   29.1   1.0   21    8-28    150-170 (426)
 74 PF14541 TAXi_C:  Xylanase inhi  49.2      11 0.00024   23.8   1.4   21   14-34     31-51  (161)
 75 cd05489 xylanase_inhibitor_I_l  48.4      17 0.00036   26.3   2.3   21   14-34    231-251 (362)
 76 KOG4584 Uncharacterized conser  46.9      15 0.00033   26.7   1.9   22   59-80    196-217 (348)
 77 COG0052 RpsB Ribosomal protein  35.1      25 0.00054   24.7   1.4   19   10-28    179-197 (252)
 78 KOG1339 Aspartyl protease [Pos  32.7      35 0.00076   24.9   2.0   22   10-31     57-78  (398)
 79 cd06407 PB1_NLP A PB1 domain i  31.8      66  0.0014   18.4   2.7   34    2-35      1-34  (82)
 80 KOG3101 Esterase D [General fu  31.5      21 0.00046   25.0   0.6   27    8-34    211-237 (283)
 81 PRK05449 aspartate alpha-decar  30.2      76  0.0016   20.0   2.9   39    4-47      6-58  (126)
 82 PF11925 DUF3443:  Protein of u  29.3      66  0.0014   23.9   2.9   26    4-29    246-272 (370)
 83 cd06919 Asp_decarbox Aspartate  28.9      85  0.0018   19.4   2.9   39    4-47      5-57  (111)
 84 TIGR03711 acc_sec_asp3 accesso  28.8      94   0.002   19.8   3.2   51   37-87     33-92  (135)
 85 PF04126 Cyclophil_like:  Cyclo  28.3 1.2E+02  0.0026   18.5   3.6   28    2-34      1-29  (120)
 86 cd06396 PB1_NBR1 The PB1 domai  28.3 1.2E+02  0.0027   17.5   3.4   33    2-34      1-35  (81)
 87 PF00564 PB1:  PB1 domain;  Int  27.9   1E+02  0.0022   16.8   3.1   34    1-34      1-35  (84)
 88 PF04746 DUF575:  Protein of un  27.8      27 0.00058   20.9   0.5   12   75-86     28-39  (101)
 89 PF14612 Ino80_Iec3:  IEC3 subu  27.3      26 0.00055   24.4   0.4    8   81-88    217-224 (232)
 90 PF07341 DUF1473:  Protein of u  25.3      46   0.001   21.6   1.3   29   53-81      5-33  (163)
 91 TIGR00223 panD L-aspartate-alp  25.1      96  0.0021   19.6   2.7   39    4-47      6-58  (126)
 92 PF02261 Asp_decarbox:  Asparta  23.7      89  0.0019   19.4   2.3   39    4-47      6-58  (116)
 93 PF00455 DeoRC:  DeoR C termina  23.4      47   0.001   21.2   1.1   21   14-34     22-42  (161)
 94 KOG3283 40S ribosomal protein   23.1   1E+02  0.0022   20.6   2.6   58    4-62     46-109 (200)
 95 PF08922 DUF1905:  Domain of un  22.6 1.4E+02   0.003   16.8   2.9   13    2-14     33-45  (80)
 96 CHL00067 rps2 ribosomal protei  22.5      57  0.0012   22.4   1.5   18   10-27    184-201 (230)
 97 PRK05299 rpsB 30S ribosomal pr  22.3      57  0.0012   22.8   1.4   19   10-28    180-198 (258)
 98 PRK15423 hypoxanthine phosphor  21.9      46 0.00099   21.8   0.9   26    8-33     90-118 (178)
 99 PF12264 Waikav_capsid_1:  Waik  20.6 1.6E+02  0.0034   19.5   3.1   25   16-40    141-165 (197)

No 1  
>PF08284 RVP_2:  Retroviral aspartyl protease;  InterPro: IPR013242 This region defines single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein; usually pol, more rarely gag. Retroviral proteases appear to be homologous to a single domain of the two-domain eukaryotic aspartyl proteases. 
Probab=99.96  E-value=9.6e-29  Score=156.56  Aligned_cols=89  Identities=33%  Similarity=0.567  Sum_probs=79.4

Q ss_pred             CEEEEEEECCEEEEEEEcCCCCcccccHHhhhee---------eEEe-cCCCeeeeeceeeeeEEEEeceeEEeeEEEcC
Q 045784            1 MIRVIGRLVHKVVTVLIDSRRTHNFVRQELAVKL---------NVMV-ASGERFSSLGTCLQTLIKLQGVPLIVDFYLLP   70 (90)
Q Consensus         1 ~~~v~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l---------~v~~-a~G~~~~~~~~~~~~~i~i~~~~~~~~~~v~~   70 (90)
                      +|+....|+++|+.+||||||||||||+++|+++         .+.+ ++|+...+.+.|..+++.++++.+..+|.|++
T Consensus        21 vi~g~~~I~~~~~~vLiDSGAThsFIs~~~a~~~~l~~~~l~~~~~V~~~g~~~~~~~~~~~~~~~i~g~~~~~dl~vl~  100 (135)
T PF08284_consen   21 VITGTFLINSIPASVLIDSGATHSFISSSFAKKLGLPLEPLPRPIVVSAPGGSINCEGVCPDVPLSIQGHEFVVDLLVLD  100 (135)
T ss_pred             eEEEEEEeccEEEEEEEecCCCcEEccHHHHHhcCCEEEEccCeeEEecccccccccceeeeEEEEECCeEEEeeeEEec
Confidence            3677889999999999999999999999999998         2222 45666777888999999999999999999999


Q ss_pred             CCCceEEeCcchhhhcCCC
Q 045784           71 FEGYDVVLDTQWLSTFGLI   89 (90)
Q Consensus        71 ~~~~dvILG~~wL~~~~p~   89 (90)
                      +.++|+|||||||++|+|+
T Consensus       101 l~~~DvILGm~WL~~~~~~  119 (135)
T PF08284_consen  101 LGGYDVILGMDWLKKHNPV  119 (135)
T ss_pred             ccceeeEeccchHHhCCCE
Confidence            9999999999999999984


No 2  
>cd05479 RP_DDI RP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
Probab=99.93  E-value=1.2e-25  Score=140.39  Aligned_cols=88  Identities=16%  Similarity=0.144  Sum_probs=77.8

Q ss_pred             CEEEEEEECCEEEEEEEcCCCCcccccHHhhheee----------EEecCCCeeeeeceeeeeEEEEeceeEEeeEEEcC
Q 045784            1 MIRVIGRLVHKVVTVLIDSRRTHNFVRQELAVKLN----------VMVASGERFSSLGTCLQTLIKLQGVPLIVDFYLLP   70 (90)
Q Consensus         1 ~~~v~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l~----------v~~a~G~~~~~~~~~~~~~i~i~~~~~~~~~~v~~   70 (90)
                      .|+++++|||+++.+|+||||++|||++++|+++.          +.+++++.....+.+..+++++++..+..+|.|+|
T Consensus        16 ~~~v~~~Ing~~~~~LvDTGAs~s~Is~~~a~~lgl~~~~~~~~~~~~~g~g~~~~~g~~~~~~l~i~~~~~~~~~~Vl~   95 (124)
T cd05479          16 MLYINVEINGVPVKAFVDSGAQMTIMSKACAEKCGLMRLIDKRFQGIAKGVGTQKILGRIHLAQVKIGNLFLPCSFTVLE   95 (124)
T ss_pred             EEEEEEEECCEEEEEEEeCCCceEEeCHHHHHHcCCccccCcceEEEEecCCCcEEEeEEEEEEEEECCEEeeeEEEEEC
Confidence            37899999999999999999999999999999982          34455344566778888899999999999999999


Q ss_pred             CCCceEEeCcchhhhcCC
Q 045784           71 FEGYDVVLDTQWLSTFGL   88 (90)
Q Consensus        71 ~~~~dvILG~~wL~~~~p   88 (90)
                      ..++|+|||||||++++.
T Consensus        96 ~~~~d~ILG~d~L~~~~~  113 (124)
T cd05479          96 DDDVDFLIGLDMLKRHQC  113 (124)
T ss_pred             CCCcCEEecHHHHHhCCe
Confidence            999999999999999985


No 3  
>cd05484 retropepsin_like_LTR_2 Retropepsins_like_LTR, pepsin-like aspartate proteases. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classif
Probab=99.88  E-value=3.6e-22  Score=118.36  Aligned_cols=81  Identities=21%  Similarity=0.296  Sum_probs=75.8

Q ss_pred             EEEEEEECCEEEEEEEcCCCCcccccHHhhhee----------eEEecCCCeeeeeceeeeeEEEEeceeEEeeEEEcCC
Q 045784            2 IRVIGRLVHKVVTVLIDSRRTHNFVRQELAVKL----------NVMVASGERFSSLGTCLQTLIKLQGVPLIVDFYLLPF   71 (90)
Q Consensus         2 ~~v~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l----------~v~~a~G~~~~~~~~~~~~~i~i~~~~~~~~~~v~~~   71 (90)
                      +++++.|||+++.+|+||||++|+|+++.++++          .++.|||+.+.+.|.+ .+++.+++..+..+|+|++.
T Consensus         1 ~~~~~~Ing~~i~~lvDTGA~~svis~~~~~~lg~~~~~~~~~~v~~a~G~~~~~~G~~-~~~v~~~~~~~~~~~~v~~~   79 (91)
T cd05484           1 KTVTLLVNGKPLKFQLDTGSAITVISEKTWRKLGSPPLKPTKKRLRTATGTKLSVLGQI-LVTVKYGGKTKVLTLYVVKN   79 (91)
T ss_pred             CEEEEEECCEEEEEEEcCCcceEEeCHHHHHHhCCCccccccEEEEecCCCEeeEeEEE-EEEEEECCEEEEEEEEEEEC
Confidence            368999999999999999999999999999877          7889999999999988 78999999999999999999


Q ss_pred             CCceEEeCcchhh
Q 045784           72 EGYDVVLDTQWLS   84 (90)
Q Consensus        72 ~~~dvILG~~wL~   84 (90)
                      . ++.|||++||.
T Consensus        80 ~-~~~lLG~~wl~   91 (91)
T cd05484          80 E-GLNLLGRDWLD   91 (91)
T ss_pred             C-CCCccChhhcC
Confidence            8 99999999985


No 4  
>PF09668 Asp_protease:  Aspartyl protease;  InterPro: IPR019103 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure.  This family of eukaryotic aspartyl proteases have a fold similar to retroviral proteases which implies they function proteolytically during regulated protein turnover []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 3S8I_A 2I1A_B.
Probab=99.84  E-value=2.4e-20  Score=116.51  Aligned_cols=88  Identities=18%  Similarity=0.240  Sum_probs=70.8

Q ss_pred             CEEEEEEECCEEEEEEEcCCCCcccccHHhhheeeE---------EecCC-CeeeeeceeeeeEEEEeceeEEeeEEEcC
Q 045784            1 MIRVIGRLVHKVVTVLIDSRRTHNFVRQELAVKLNV---------MVASG-ERFSSLGTCLQTLIKLQGVPLIVDFYLLP   70 (90)
Q Consensus         1 ~~~v~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l~v---------~~a~G-~~~~~~~~~~~~~i~i~~~~~~~~~~v~~   70 (90)
                      +++|++++||++++|++||||.+|.||+++|+|+++         ..+-| +.....|+++.+++.+++..+++.|.|++
T Consensus        24 mLyI~~~ing~~vkA~VDtGAQ~tims~~~a~r~gL~~lid~r~~g~a~GvG~~~i~G~Ih~~~l~ig~~~~~~s~~Vle  103 (124)
T PF09668_consen   24 MLYINCKINGVPVKAFVDTGAQSTIMSKSCAERCGLMRLIDKRFAGVAKGVGTQKILGRIHSVQLKIGGLFFPCSFTVLE  103 (124)
T ss_dssp             --EEEEEETTEEEEEEEETT-SS-EEEHHHHHHTTGGGGEEGGG-EE-------EEEEEEEEEEEEETTEEEEEEEEEET
T ss_pred             eEEEEEEECCEEEEEEEeCCCCccccCHHHHHHcCChhhccccccccccCCCcCceeEEEEEEEEEECCEEEEEEEEEeC
Confidence            368999999999999999999999999999999811         11222 44567789999999999999999999999


Q ss_pred             CCCceEEeCcchhhhcCC
Q 045784           71 FEGYDVVLDTQWLSTFGL   88 (90)
Q Consensus        71 ~~~~dvILG~~wL~~~~p   88 (90)
                      -...|++||.|||++|+-
T Consensus       104 ~~~~d~llGld~L~~~~c  121 (124)
T PF09668_consen  104 DQDVDLLLGLDMLKRHKC  121 (124)
T ss_dssp             TSSSSEEEEHHHHHHTT-
T ss_pred             CCCcceeeeHHHHHHhCc
Confidence            889999999999999974


No 5  
>cd05480 NRIP_C NRIP_C; putative nuclear receptor interacting protein. Proteins in this family have been described as probable nuclear receptor interacting proteins. The  C-terminal domain of this family is homologous to the retroviral aspartyl protease domain. The domain is structurally related to one lobe of the pepsin molecule. The conserved active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=99.78  E-value=1.8e-18  Score=103.77  Aligned_cols=85  Identities=14%  Similarity=0.135  Sum_probs=75.0

Q ss_pred             EEEEECCEEEEEEEcCCCCcccccHHhhheeeEE----------ecC--CCeeeeeceeeeeEEEEeceeEEeeEEEcCC
Q 045784            4 VIGRLVHKVVTVLIDSRRTHNFVRQELAVKLNVM----------VAS--GERFSSLGTCLQTLIKLQGVPLIVDFYLLPF   71 (90)
Q Consensus         4 v~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l~v~----------~a~--G~~~~~~~~~~~~~i~i~~~~~~~~~~v~~~   71 (90)
                      +.+++||+|++|++||||.++.||+++|+|+.+.          .|-  |++..+.|+++.+++.+++..+++.|.|++.
T Consensus         1 vnCk~nG~~vkAfVDsGaQ~timS~~caercgL~r~v~~~r~~g~A~gvgt~~kiiGrih~~~ikig~~~~~CSftVld~   80 (103)
T cd05480           1 VSCQCAGKELRALVDTGCQYNLISAACLDRLGLKERVLKAKAEEEAPSLPTSVKVIGQIERLVLQLGQLTVECSAQVVDD   80 (103)
T ss_pred             CceeECCEEEEEEEecCCchhhcCHHHHHHcChHhhhhhccccccccCCCcceeEeeEEEEEEEEeCCEEeeEEEEEEcC
Confidence            4689999999999999999999999999999221          122  4456788999999999999999999999999


Q ss_pred             CCceEEeCcchhhhcCC
Q 045784           72 EGYDVVLDTQWLSTFGL   88 (90)
Q Consensus        72 ~~~dvILG~~wL~~~~p   88 (90)
                      ...|++||.|-|++|+-
T Consensus        81 ~~~d~llGLdmLkrhqc   97 (103)
T cd05480          81 NEKNFSLGLQTLKSLKC   97 (103)
T ss_pred             CCcceEeeHHHHhhcce
Confidence            99999999999999973


No 6  
>PF13650 Asp_protease_2:  Aspartyl protease
Probab=99.76  E-value=1.9e-17  Score=96.46  Aligned_cols=77  Identities=27%  Similarity=0.304  Sum_probs=64.4

Q ss_pred             EEEEECCEEEEEEEcCCCCcccccHHhhhee-----------eEEecCCCeeeeeceeeeeEEEEeceeE-EeeEEEcC-
Q 045784            4 VIGRLVHKVVTVLIDSRRTHNFVRQELAVKL-----------NVMVASGERFSSLGTCLQTLIKLQGVPL-IVDFYLLP-   70 (90)
Q Consensus         4 v~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l-----------~v~~a~G~~~~~~~~~~~~~i~i~~~~~-~~~~~v~~-   70 (90)
                      |++++||+++.+|+||||+.++|++++++++           .+..++|+.....+...  .+++++..+ ..++.+.+ 
T Consensus         1 V~v~vng~~~~~liDTGa~~~~i~~~~~~~l~~~~~~~~~~~~~~~~~g~~~~~~~~~~--~i~ig~~~~~~~~~~v~~~   78 (90)
T PF13650_consen    1 VPVKVNGKPVRFLIDTGASISVISRSLAKKLGLKPRPKSVPISVSGAGGSVTVYRGRVD--SITIGGITLKNVPFLVVDL   78 (90)
T ss_pred             CEEEECCEEEEEEEcCCCCcEEECHHHHHHcCCCCcCCceeEEEEeCCCCEEEEEEEEE--EEEECCEEEEeEEEEEECC
Confidence            5789999999999999999999999999998           34567787444344443  688999887 57999999 


Q ss_pred             CCCceEEeCcch
Q 045784           71 FEGYDVVLDTQW   82 (90)
Q Consensus        71 ~~~~dvILG~~w   82 (90)
                      ...+|.|||+||
T Consensus        79 ~~~~~~iLG~df   90 (90)
T PF13650_consen   79 GDPIDGILGMDF   90 (90)
T ss_pred             CCCCEEEeCCcC
Confidence            667999999998


No 7  
>TIGR02281 clan_AA_DTGA clan AA aspartic protease, TIGR02281 family. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix. Sequences in the seed alignment and those scoring above the trusted cutoff are Proteobacterial; homologs scroing between trusted and noise are found in Pyrobaculum aerophilum str. IM2 (archaeal), Pirellula sp. (Planctomycetes), and Nostoc sp. PCC 7120 (Cyanobacteria).
Probab=99.74  E-value=2.6e-17  Score=102.48  Aligned_cols=86  Identities=20%  Similarity=0.243  Sum_probs=70.1

Q ss_pred             EEEEEEECCEEEEEEEcCCCCcccccHHhhhee-----------eEEecCCCeeeeeceeeeeEEEEeceeEE-eeEEEc
Q 045784            2 IRVIGRLVHKVVTVLIDSRRTHNFVRQELAVKL-----------NVMVASGERFSSLGTCLQTLIKLQGVPLI-VDFYLL   69 (90)
Q Consensus         2 ~~v~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l-----------~v~~a~G~~~~~~~~~~~~~i~i~~~~~~-~~~~v~   69 (90)
                      ++++++|||+++.+|+||||++++|++++|+++           .+..|||......  ..--.+++++..+. .++.|+
T Consensus        12 ~~v~~~InG~~~~flVDTGAs~t~is~~~A~~Lgl~~~~~~~~~~~~ta~G~~~~~~--~~l~~l~iG~~~~~nv~~~v~   89 (121)
T TIGR02281        12 FYATGRVNGRNVRFLVDTGATSVALNEEDAQRLGLDLNRLGYTVTVSTANGQIKAAR--VTLDRVAIGGIVVNDVDAMVA   89 (121)
T ss_pred             EEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCcccCCceEEEEeCCCcEEEEE--EEeCEEEECCEEEeCcEEEEe
Confidence            789999999999999999999999999999999           3456677643222  22235889999877 789999


Q ss_pred             CCCC-ceEEeCcchhhhcCCC
Q 045784           70 PFEG-YDVVLDTQWLSTFGLI   89 (90)
Q Consensus        70 ~~~~-~dvILG~~wL~~~~p~   89 (90)
                      |... .+.+|||+||+++.++
T Consensus        90 ~~~~~~~~LLGm~fL~~~~~~  110 (121)
T TIGR02281        90 EGGALSESLLGMSFLNRLSRF  110 (121)
T ss_pred             CCCcCCceEcCHHHHhccccE
Confidence            9874 5899999999998754


No 8  
>PF00077 RVP:  Retroviral aspartyl protease The Prosite entry also includes Pfam:PF00026;  InterPro: IPR018061 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to the MEROPS peptidase family A2 (retropepsin family, clan AA), subfamily A2A. The family includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). Retroviral aspartyl protease is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins.; PDB: 3D3T_B 3SQF_A 1NSO_A 2HB3_A 2HS2_A 2HS1_B 3K4V_A 3GGV_C 1HTG_B 2FDE_A ....
Probab=99.74  E-value=2.7e-17  Score=98.38  Aligned_cols=83  Identities=20%  Similarity=0.301  Sum_probs=72.4

Q ss_pred             EEEEEECCEEEEEEEcCCCCcccccHHhhhee--------eEEecCCCeeeeeceeeeeEEEEeceeEEeeEEEcCCCCc
Q 045784            3 RVIGRLVHKVVTVLIDSRRTHNFVRQELAVKL--------NVMVASGERFSSLGTCLQTLIKLQGVPLIVDFYLLPFEGY   74 (90)
Q Consensus         3 ~v~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l--------~v~~a~G~~~~~~~~~~~~~i~i~~~~~~~~~~v~~~~~~   74 (90)
                      +++.++||+++.|||||||+.|+|+++.+.+.        .+..++|.. ...+.. .+++.+++..+...|+|.+....
T Consensus         7 ~i~v~i~g~~i~~LlDTGA~vsiI~~~~~~~~~~~~~~~~~v~~~~g~~-~~~~~~-~~~v~~~~~~~~~~~~v~~~~~~   84 (100)
T PF00077_consen    7 YITVKINGKKIKALLDTGADVSIISEKDWKKLGPPPKTSITVRGAGGSS-SILGST-TVEVKIGGKEFNHTFLVVPDLPM   84 (100)
T ss_dssp             EEEEEETTEEEEEEEETTBSSEEESSGGSSSTSSEEEEEEEEEETTEEE-EEEEEE-EEEEEETTEEEEEEEEESSTCSS
T ss_pred             eEEEeECCEEEEEEEecCCCcceecccccccccccccCCceeccCCCcc-eeeeEE-EEEEEEECccceEEEEecCCCCC
Confidence            57899999999999999999999999999877        566677776 555554 68899999999999999998778


Q ss_pred             eEEeCcchhhhcCC
Q 045784           75 DVVLDTQWLSTFGL   88 (90)
Q Consensus        75 dvILG~~wL~~~~p   88 (90)
                      | |||.|||++++.
T Consensus        85 ~-ILG~D~L~~~~~   97 (100)
T PF00077_consen   85 N-ILGRDFLKKLNA   97 (100)
T ss_dssp             E-EEEHHHHTTTTC
T ss_pred             C-EeChhHHHHcCC
Confidence            8 999999999875


No 9  
>cd05483 retropepsin_like_bacteria Bacterial aspartate proteases, retropepsin-like protease family. This family of bacteria aspartate proteases is a subfamily of retropepsin-like protease family, which includes enzymes from retrovirus and retrotransposons. While fungal and mammalian pepsin-like aspartate proteases are bilobal proteins with structurally related N- and C-termini, this family of bacteria aspartate proteases is half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate proteases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=99.69  E-value=4.5e-16  Score=91.43  Aligned_cols=82  Identities=23%  Similarity=0.297  Sum_probs=68.2

Q ss_pred             CEEEEEEECCEEEEEEEcCCCCcccccHHhhhee----------eEEecCCCeeeeeceeeeeEEEEeceeEE-eeEEEc
Q 045784            1 MIRVIGRLVHKVVTVLIDSRRTHNFVRQELAVKL----------NVMVASGERFSSLGTCLQTLIKLQGVPLI-VDFYLL   69 (90)
Q Consensus         1 ~~~v~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l----------~v~~a~G~~~~~~~~~~~~~i~i~~~~~~-~~~~v~   69 (90)
                      .+.+++++|++++.+|+||||+.++|++++++++          .+..++|........  --++.+++..+. ..+.++
T Consensus         2 ~~~v~v~i~~~~~~~llDTGa~~s~i~~~~~~~l~~~~~~~~~~~~~~~~G~~~~~~~~--~~~i~ig~~~~~~~~~~v~   79 (96)
T cd05483           2 HFVVPVTINGQPVRFLLDTGASTTVISEELAERLGLPLTLGGKVTVQTANGRVRAARVR--LDSLQIGGITLRNVPAVVL   79 (96)
T ss_pred             cEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCccCCCcEEEEecCCCccceEEE--cceEEECCcEEeccEEEEe
Confidence            3789999999999999999999999999999976          455566765544433  346889999876 689999


Q ss_pred             CCCC--ceEEeCcchhh
Q 045784           70 PFEG--YDVVLDTQWLS   84 (90)
Q Consensus        70 ~~~~--~dvILG~~wL~   84 (90)
                      +...  .|.|||+|||+
T Consensus        80 d~~~~~~~gIlG~d~l~   96 (96)
T cd05483          80 PGDALGVDGLLGMDFLR   96 (96)
T ss_pred             CCcccCCceEeChHHhC
Confidence            9987  99999999985


No 10 
>cd06095 RP_RTVL_H_like Retropepsin of the RTVL_H family of human endogenous retrovirus-like elements. This family includes aspartate proteases from retroelements with LTR (long terminal repeats) including the RTVL_H family of human endogenous retrovirus-like elements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where 
Probab=99.67  E-value=5.8e-16  Score=91.02  Aligned_cols=79  Identities=24%  Similarity=0.305  Sum_probs=62.5

Q ss_pred             EEEEECCEEEEEEEcCCCCcccccHHhhhee-------eEEecCCCeeeeeceeeeeEEEEeceeEEeeEEEcCCCCceE
Q 045784            4 VIGRLVHKVVTVLIDSRRTHNFVRQELAVKL-------NVMVASGERFSSLGTCLQTLIKLQGVPLIVDFYLLPFEGYDV   76 (90)
Q Consensus         4 v~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l-------~v~~a~G~~~~~~~~~~~~~i~i~~~~~~~~~~v~~~~~~dv   76 (90)
                      +.++|||+|+.+|+||||+.+.|+++.++++       .+..++|..........+ .+.+++++....+.+.+- ..+.
T Consensus         1 ~~v~InG~~~~fLvDTGA~~tii~~~~a~~~~~~~~~~~v~gagG~~~~~v~~~~~-~v~vg~~~~~~~~~v~~~-~~~~   78 (86)
T cd06095           1 VTITVEGVPIVFLVDTGATHSVLKSDLGPKQELSTTSVLIRGVSGQSQQPVTTYRT-LVDLGGHTVSHSFLVVPN-CPDP   78 (86)
T ss_pred             CEEEECCEEEEEEEECCCCeEEECHHHhhhccCCCCcEEEEeCCCcccccEEEeee-EEEECCEEEEEEEEEEcC-CCCc
Confidence            4689999999999999999999999999984       677788865111111111 588999999988988884 3699


Q ss_pred             EeCcchhh
Q 045784           77 VLDTQWLS   84 (90)
Q Consensus        77 ILG~~wL~   84 (90)
                      |||||||+
T Consensus        79 lLG~dfL~   86 (86)
T cd06095          79 LLGRDLLS   86 (86)
T ss_pred             EechhhcC
Confidence            99999985


No 11 
>cd00303 retropepsin_like Retropepsins; pepsin-like aspartate proteases. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples
Probab=99.63  E-value=8.8e-15  Score=81.96  Aligned_cols=81  Identities=37%  Similarity=0.619  Sum_probs=67.1

Q ss_pred             EEEEECCEEEEEEEcCCCCcccccHHhhhee-----------eEEecCCCeeeeeceeeeeEEEEeceeEEeeEEEcCCC
Q 045784            4 VIGRLVHKVVTVLIDSRRTHNFVRQELAVKL-----------NVMVASGERFSSLGTCLQTLIKLQGVPLIVDFYLLPFE   72 (90)
Q Consensus         4 v~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l-----------~v~~a~G~~~~~~~~~~~~~i~i~~~~~~~~~~v~~~~   72 (90)
                      +.+.++|+++.+|+|+||++++++.++++++           .+..++|......+.+..+.+.+++..+...|.+.+..
T Consensus         1 ~~~~~~~~~~~~liDtgs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~   80 (92)
T cd00303           1 LKGKINGVPVRALVDSGASVNFISESLAKKLGLPPRLLPTPLKVKGANGSSVKTLGVILPVTIGIGGKTFTVDFYVLDLL   80 (92)
T ss_pred             CEEEECCEEEEEEEcCCCcccccCHHHHHHcCCCcccCCCceEEEecCCCEeccCcEEEEEEEEeCCEEEEEEEEEEcCC
Confidence            3577899999999999999999999999875           33455565555444445678888899999999999999


Q ss_pred             CceEEeCcchhh
Q 045784           73 GYDVVLDTQWLS   84 (90)
Q Consensus        73 ~~dvILG~~wL~   84 (90)
                      .+|+|||++||+
T Consensus        81 ~~~~ilG~~~l~   92 (92)
T cd00303          81 SYDVILGRPWLE   92 (92)
T ss_pred             CcCEEecccccC
Confidence            999999999985


No 12 
>KOG0012 consensus DNA damage inducible protein [Replication, recombination and repair]
Probab=99.52  E-value=2.7e-14  Score=101.67  Aligned_cols=88  Identities=18%  Similarity=0.177  Sum_probs=77.4

Q ss_pred             CEEEEEEECCEEEEEEEcCCCCcccccHHhhheeeEE---------ecC-CCeeeeeceeeeeEEEEeceeEEeeEEEcC
Q 045784            1 MIRVIGRLVHKVVTVLIDSRRTHNFVRQELAVKLNVM---------VAS-GERFSSLGTCLQTLIKLQGVPLIVDFYLLP   70 (90)
Q Consensus         1 ~~~v~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l~v~---------~a~-G~~~~~~~~~~~~~i~i~~~~~~~~~~v~~   70 (90)
                      +++|.++|||+|++|++||||..+.||.+||+|+.+.         .|- .....+.|+++.+.+++++..+.+.|.|++
T Consensus       235 ML~iN~~ing~~VKAfVDsGaq~timS~~Caer~gL~rlid~r~~g~a~gvg~~ki~g~Ih~~~lki~~~~l~c~ftV~d  314 (380)
T KOG0012|consen  235 MLYINCEINGVPVKAFVDSGAQTTIMSAACAERCGLNRLIDKRFQGEARGVGTEKILGRIHQAQLKIEDLYLPCSFTVLD  314 (380)
T ss_pred             EEEEEEEECCEEEEEEEcccchhhhhhHHHHHHhChHHHhhhhhhccccCCCcccccceeEEEEEEeccEeeccceEEec
Confidence            3689999999999999999999999999999999221         122 235677889999999999999999999999


Q ss_pred             CCCceEEeCcchhhhcCC
Q 045784           71 FEGYDVVLDTQWLSTFGL   88 (90)
Q Consensus        71 ~~~~dvILG~~wL~~~~p   88 (90)
                      -...|+.||.+-|++|+-
T Consensus       315 ~~~~d~llGLd~Lrr~~c  332 (380)
T KOG0012|consen  315 RRDMDLLLGLDMLRRHQC  332 (380)
T ss_pred             CCCcchhhhHHHHHhccc
Confidence            999999999999999973


No 13 
>PF13975 gag-asp_proteas:  gag-polyprotein putative aspartyl protease
Probab=99.50  E-value=6.3e-14  Score=79.99  Aligned_cols=51  Identities=29%  Similarity=0.395  Sum_probs=45.7

Q ss_pred             CEEEEEEECCEEEEEEEcCCCCcccccHHhhhee-----------eEEecCCCeeeeeceee
Q 045784            1 MIRVIGRLVHKVVTVLIDSRRTHNFVRQELAVKL-----------NVMVASGERFSSLGTCL   51 (90)
Q Consensus         1 ~~~v~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l-----------~v~~a~G~~~~~~~~~~   51 (90)
                      +|++++.|+|+++.+|+||||++|||++++|+++           .+++|||+...+.+...
T Consensus         8 ~~~v~~~I~g~~~~alvDtGat~~fis~~~a~rLgl~~~~~~~~~~v~~a~g~~~~~~g~~~   69 (72)
T PF13975_consen    8 LMYVPVSIGGVQVKALVDTGATHNFISESLAKRLGLPLEKPPSPIRVKLANGSVIEIRGVAE   69 (72)
T ss_pred             EEEEEEEECCEEEEEEEeCCCcceecCHHHHHHhCCCcccCCCCEEEEECCCCccccceEEE
Confidence            4889999999999999999999999999999998           67889998887766554


No 14 
>PF02160 Peptidase_A3:  Cauliflower mosaic virus peptidase (A3);  InterPro: IPR000588 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of sequences contain an aspartic peptidase signature that belongs to MEROPS peptidase family A3, subfamily A3A (cauliflower mosaic virus-type endopeptidase, clan AA). Cauliflower mosaic virus belongs to the Retro-transcribing viruses, which have a double-stranded DNA genome. The genome includes an open reading frame (ORF V) that shows similarities to the pol gene of retroviruses. This ORF codes for a polyprotein that includes a reverse transcriptase, which, on the basis of a DTG triplet near the N terminus, was suggested to include an aspartic protease. The presence of an aspartic protease has been confirmed by mutational studies, implicating Asp-45 in catalysis. The protease releases itself from the polyprotein and is involved in reactions required to process the ORF IV polyprotein, which includes the viral coat protein []. The viral aspartic peptidase signature has also been found associated with a polyprotein encoded by integrated pararetrovirus-like sequences in the genome of Nicotiana tabacum (Common tobacco) []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis
Probab=99.50  E-value=2e-13  Score=91.24  Aligned_cols=79  Identities=13%  Similarity=0.119  Sum_probs=68.5

Q ss_pred             EEEEEEEcCCCCcccccHHhhhe-----e----eEEecCCCeeeeeceeeeeEEEEeceeEEeeEEEcCCCCceEEeCcc
Q 045784           11 KVVTVLIDSRRTHNFVRQELAVK-----L----NVMVASGERFSSLGTCLQTLIKLQGVPLIVDFYLLPFEGYDVVLDTQ   81 (90)
Q Consensus        11 ~~~~aLiDsGA~~~~i~~~~a~~-----l----~v~~a~G~~~~~~~~~~~~~i~i~~~~~~~~~~v~~~~~~dvILG~~   81 (90)
                      ..+.+++||||+.+.+++...-.     .    .++.||++......++.+..+.++++.|..++.-.--.+.|+|||++
T Consensus        19 ~~~~~~vDTGAt~C~~~~~iiP~e~we~~~~~i~v~~an~~~~~i~~~~~~~~i~I~~~~F~IP~iYq~~~g~d~IlG~N   98 (201)
T PF02160_consen   19 FNYHCYVDTGATICCASKKIIPEEYWEKSKKPIKVKGANGSIIQINKKAKNGKIQIADKIFRIPTIYQQESGIDIILGNN   98 (201)
T ss_pred             EEEEEEEeCCCceEEecCCcCCHHHHHhCCCcEEEEEecCCceEEEEEecCceEEEccEEEeccEEEEecCCCCEEecch
Confidence            66789999999999988776633     2    78899999888889999999999999999887666668999999999


Q ss_pred             hhhhcCCC
Q 045784           82 WLSTFGLI   89 (90)
Q Consensus        82 wL~~~~p~   89 (90)
                      |++.++|.
T Consensus        99 F~r~y~Pf  106 (201)
T PF02160_consen   99 FLRLYEPF  106 (201)
T ss_pred             HHHhcCCc
Confidence            99999984


No 15 
>TIGR03698 clan_AA_DTGF clan AA aspartic protease, AF_0612 family. Members of this protein family are clan AA aspartic proteases, related to family TIGR02281. These proteins resemble retropepsins, pepsin-like proteases of retroviruses such as HIV. Members of this family are found in archaea and bacteria.
Probab=99.48  E-value=6.8e-13  Score=81.00  Aligned_cols=75  Identities=21%  Similarity=0.274  Sum_probs=60.9

Q ss_pred             EEEEEEEcCCCCccc-ccHHhhhee--------eEEecCCCeeeeeceeeeeEEEEeceeEEeeEEEcCCCCceEEeCcc
Q 045784           11 KVVTVLIDSRRTHNF-VRQELAVKL--------NVMVASGERFSSLGTCLQTLIKLQGVPLIVDFYLLPFEGYDVVLDTQ   81 (90)
Q Consensus        11 ~~~~aLiDsGA~~~~-i~~~~a~~l--------~v~~a~G~~~~~~~~~~~~~i~i~~~~~~~~~~v~~~~~~dvILG~~   81 (90)
                      .++.+|+||||+..+ ++++.|+++        .+.+|||+.....-  ....+.+++.+....+.+.+..+ +.+|||.
T Consensus        15 ~~v~~LVDTGat~~~~l~~~~a~~lgl~~~~~~~~~tA~G~~~~~~v--~~~~v~igg~~~~~~v~~~~~~~-~~LLG~~   91 (107)
T TIGR03698        15 MEVRALVDTGFSGFLLVPPDIVNKLGLPELDQRRVYLADGREVLTDV--AKASIIINGLEIDAFVESLGYVD-EPLLGTE   91 (107)
T ss_pred             eEEEEEEECCCCeEEecCHHHHHHcCCCcccCcEEEecCCcEEEEEE--EEEEEEECCEEEEEEEEecCCCC-ccEecHH
Confidence            489999999999998 999999999        77889997655442  34578889988766666666655 8999999


Q ss_pred             hhhhcCC
Q 045784           82 WLSTFGL   88 (90)
Q Consensus        82 wL~~~~p   88 (90)
                      ||++++.
T Consensus        92 ~L~~l~l   98 (107)
T TIGR03698        92 LLEGLGI   98 (107)
T ss_pred             HHhhCCE
Confidence            9999874


No 16 
>PF12384 Peptidase_A2B:  Ty3 transposon peptidase;  InterPro: IPR024650 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Ty3 is a gypsy-type, retrovirus-like, element found in the budding yeast. The Ty3 aspartyl protease is required for processing of the viral polyprotein into its mature species [].
Probab=99.48  E-value=4e-13  Score=87.14  Aligned_cols=87  Identities=16%  Similarity=0.197  Sum_probs=71.4

Q ss_pred             CEEEEEEECCEEEEEEEcCCCCcccccHHhhheeeEEec----------C-CCeeeeeceeeeeEEEEeceeEEeeEEEc
Q 045784            1 MIRVIGRLVHKVVTVLIDSRRTHNFVRQELAVKLNVMVA----------S-GERFSSLGTCLQTLIKLQGVPLIVDFYLL   69 (90)
Q Consensus         1 ~~~v~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l~v~~a----------~-G~~~~~~~~~~~~~i~i~~~~~~~~~~v~   69 (90)
                      |..+.-.++|.++.+|+||||-.|||+.+.+++|.+...          . +....+.. ...+++.+++..+...++|+
T Consensus        34 T~~v~l~~~~t~i~vLfDSGSPTSfIr~di~~kL~L~~~~app~~fRG~vs~~~~~tsE-Av~ld~~i~n~~i~i~aYV~  112 (177)
T PF12384_consen   34 TAIVQLNCKGTPIKVLFDSGSPTSFIRSDIVEKLELPTHDAPPFRFRGFVSGESATTSE-AVTLDFYIDNKLIDIAAYVT  112 (177)
T ss_pred             EEEEEEeecCcEEEEEEeCCCccceeehhhHHhhCCccccCCCEEEeeeccCCceEEEE-eEEEEEEECCeEEEEEEEEe
Confidence            346778899999999999999999999999999933322          2 23333333 34688999999999999999


Q ss_pred             CCCCceEEeCcchhhhcCC
Q 045784           70 PFEGYDVVLDTQWLSTFGL   88 (90)
Q Consensus        70 ~~~~~dvILG~~wL~~~~p   88 (90)
                      +..++|+|+|.|.|++|..
T Consensus       113 d~m~~dlIIGnPiL~ryp~  131 (177)
T PF12384_consen  113 DNMDHDLIIGNPILDRYPT  131 (177)
T ss_pred             ccCCcceEeccHHHhhhHH
Confidence            9999999999999999864


No 17 
>cd06094 RP_Saci_like RP_Saci_like, retropepsin family. Retropepsin on retrotransposons with long terminal repeats (LTR) including Saci-1, -2 and -3 of Schistosoma mansoni. Retropepsins are related to fungal and mammalian pepsins. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified
Probab=99.35  E-value=5.3e-12  Score=74.64  Aligned_cols=74  Identities=18%  Similarity=0.304  Sum_probs=63.8

Q ss_pred             EEEEEEEcCCCCcccccHHhhhee------eEEecCCCeeeeeceeeeeEEEEece-eEEeeEEEcCCCCceEEeCcchh
Q 045784           11 KVVTVLIDSRRTHNFVRQELAVKL------NVMVASGERFSSLGTCLQTLIKLQGV-PLIVDFYLLPFEGYDVVLDTQWL   83 (90)
Q Consensus        11 ~~~~aLiDsGA~~~~i~~~~a~~l------~v~~a~G~~~~~~~~~~~~~i~i~~~-~~~~~~~v~~~~~~dvILG~~wL   83 (90)
                      -...+|+||||.+|.|.+...++.      .++.|||+.+.+.|.. .+.+.+|.. ++.-.|.|.+..  ..|||.|||
T Consensus         8 s~~~fLVDTGA~vSviP~~~~~~~~~~~~~~l~AANgt~I~tyG~~-~l~ldlGlrr~~~w~FvvAdv~--~pIlGaDfL   84 (89)
T cd06094           8 SGLRFLVDTGAAVSVLPASSTKKSLKPSPLTLQAANGTPIATYGTR-SLTLDLGLRRPFAWNFVVADVP--HPILGADFL   84 (89)
T ss_pred             CCcEEEEeCCCceEeeccccccccccCCceEEEeCCCCeEeeeeeE-EEEEEcCCCcEEeEEEEEcCCC--cceecHHHH
Confidence            346899999999999999998883      7889999999999865 678888875 799999998875  479999999


Q ss_pred             hhcC
Q 045784           84 STFG   87 (90)
Q Consensus        84 ~~~~   87 (90)
                      ++|+
T Consensus        85 ~~~~   88 (89)
T cd06094          85 QHYG   88 (89)
T ss_pred             HHcC
Confidence            9986


No 18 
>COG3577 Predicted aspartyl protease [General function prediction only]
Probab=99.34  E-value=3.8e-12  Score=85.12  Aligned_cols=85  Identities=22%  Similarity=0.253  Sum_probs=70.3

Q ss_pred             EEEEEEECCEEEEEEEcCCCCcccccHHhhhee-----------eEEecCCCeeeeeceeeeeEEEEeceeEE-eeEEEc
Q 045784            2 IRVIGRLVHKVVTVLIDSRRTHNFVRQELAVKL-----------NVMVASGERFSSLGTCLQTLIKLQGVPLI-VDFYLL   69 (90)
Q Consensus         2 ~~v~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l-----------~v~~a~G~~~~~~~~~~~~~i~i~~~~~~-~~~~v~   69 (90)
                      |...++|||+++.+|+|||||.-.++++-|+|+           .+.+|||+.....-..  -.+.|++.+.. .+..|.
T Consensus       106 F~a~~~VNGk~v~fLVDTGATsVal~~~dA~RlGid~~~l~y~~~v~TANG~~~AA~V~L--d~v~IG~I~~~nV~A~V~  183 (215)
T COG3577         106 FEANGRVNGKKVDFLVDTGATSVALNEEDARRLGIDLNSLDYTITVSTANGRARAAPVTL--DRVQIGGIRVKNVDAMVA  183 (215)
T ss_pred             EEEEEEECCEEEEEEEecCcceeecCHHHHHHhCCCccccCCceEEEccCCccccceEEe--eeEEEccEEEcCchhhee
Confidence            568899999999999999999999999999999           6788999765443333  25788888876 688999


Q ss_pred             CCCC-ceEEeCcchhhhcCC
Q 045784           70 PFEG-YDVVLDTQWLSTFGL   88 (90)
Q Consensus        70 ~~~~-~dvILG~~wL~~~~p   88 (90)
                      +.+. -...|||+||++.+-
T Consensus       184 ~~g~L~~sLLGMSfL~rL~~  203 (215)
T COG3577         184 EDGALDESLLGMSFLNRLSG  203 (215)
T ss_pred             cCCccchhhhhHHHHhhccc
Confidence            7664 669999999998763


No 19 
>cd05481 retropepsin_like_LTR_1 Retropepsins_like_LTR; pepsin-like aspartate protease from retrotransposons with long terminal repeats. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identifi
Probab=99.29  E-value=4e-11  Score=71.54  Aligned_cols=74  Identities=15%  Similarity=0.178  Sum_probs=63.7

Q ss_pred             EEECC-EEEEEEEcCCCCcccccHHhhhee-------------eEEecCCCeeeeeceeeeeEEEEeceeEEeeEEEcCC
Q 045784            6 GRLVH-KVVTVLIDSRRTHNFVRQELAVKL-------------NVMVASGERFSSLGTCLQTLIKLQGVPLIVDFYLLPF   71 (90)
Q Consensus         6 ~~i~~-~~~~aLiDsGA~~~~i~~~~a~~l-------------~v~~a~G~~~~~~~~~~~~~i~i~~~~~~~~~~v~~~   71 (90)
                      -.+|+ +++++++||||+.|.|+.+..+++             .++.+||+.+...|.+ .+++.+++..+..+|+|++.
T Consensus         3 ~~i~g~~~v~~~vDtGA~vnllp~~~~~~l~~~~~~~L~~t~~~L~~~~g~~~~~~G~~-~~~v~~~~~~~~~~f~Vvd~   81 (93)
T cd05481           3 MKINGKQSVKFQLDTGATCNVLPLRWLKSLTPDKDPELRPSPVRLTAYGGSTIPVEGGV-KLKCRYRNPKYNLTFQVVKE   81 (93)
T ss_pred             eEeCCceeEEEEEecCCEEEeccHHHHhhhccCCCCcCccCCeEEEeeCCCEeeeeEEE-EEEEEECCcEEEEEEEEECC
Confidence            46788 999999999999999999999875             5888999999999884 78999999999999999997


Q ss_pred             CCceEEeCcc
Q 045784           72 EGYDVVLDTQ   81 (90)
Q Consensus        72 ~~~dvILG~~   81 (90)
                      .+. -|||.+
T Consensus        82 ~~~-~lLG~~   90 (93)
T cd05481          82 EGP-PLLGAK   90 (93)
T ss_pred             CCC-ceEccc
Confidence            744 445653


No 20 
>cd05482 HIV_retropepsin_like Retropepsins, pepsin-like aspartate proteases. This is a subfamily of retropepsins. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This gro
Probab=98.74  E-value=1.7e-07  Score=55.42  Aligned_cols=78  Identities=13%  Similarity=0.062  Sum_probs=57.6

Q ss_pred             EEEECCEEEEEEEcCCCCcccccHHhhhee--------eEEecCCCeeeeeceeeeeEEEEeceeEEeeEEEcCCCCceE
Q 045784            5 IGRLVHKVVTVLIDSRRTHNFVRQELAVKL--------NVMVASGERFSSLGTCLQTLIKLQGVPLIVDFYLLPFEGYDV   76 (90)
Q Consensus         5 ~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l--------~v~~a~G~~~~~~~~~~~~~i~i~~~~~~~~~~v~~~~~~dv   76 (90)
                      ..+++|++..+|+||||..++|++....+.        .+.-..| ..... ....+.+++.++.....+.+.+.....-
T Consensus         2 ~~~i~g~~~~~llDTGAd~Tvi~~~~~p~~w~~~~~~~~i~GIGG-~~~~~-~~~~v~i~i~~~~~~g~vlv~~~~~P~n   79 (87)
T cd05482           2 TLYINGKLFEGLLDTGADVSIIAENDWPKNWPIQPAPSNLTGIGG-AITPS-QSSVLLLEIDGEGHLGTILVYVLSLPVN   79 (87)
T ss_pred             EEEECCEEEEEEEccCCCCeEEcccccCCCCccCCCCeEEEeccc-eEEEE-EEeeEEEEEcCCeEEEEEEEccCCCccc
Confidence            467899999999999999999998766543        2333333 22222 2336789999998889999998745567


Q ss_pred             EeCcchhh
Q 045784           77 VLDTQWLS   84 (90)
Q Consensus        77 ILG~~wL~   84 (90)
                      |+|.|.|+
T Consensus        80 llGRd~L~   87 (87)
T cd05482          80 LWGRDILS   87 (87)
T ss_pred             EEccccCC
Confidence            89999874


No 21 
>COG5550 Predicted aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.38  E-value=1.7e-06  Score=53.99  Aligned_cols=72  Identities=21%  Similarity=0.206  Sum_probs=55.9

Q ss_pred             EEEEEcCCCC-cccccHHhhhee--------eEEecCCCeeeeeceeeeeEEEEeceeEEeeEEEcCCCCceEEeCcchh
Q 045784           13 VTVLIDSRRT-HNFVRQELAVKL--------NVMVASGERFSSLGTCLQTLIKLQGVPLIVDFYLLPFEGYDVVLDTQWL   83 (90)
Q Consensus        13 ~~aLiDsGA~-~~~i~~~~a~~l--------~v~~a~G~~~~~~~~~~~~~i~i~~~~~~~~~~v~~~~~~dvILG~~wL   83 (90)
                      ..+|||||++ --.++++.|+++        ++..++|+...+.  .....++++|.+...-..+.+....+ ++|++||
T Consensus        27 ~~~LiDTGFtg~lvlp~~vaek~~~~~~~~~~~~~a~~~~v~t~--V~~~~iki~g~e~~~~Vl~s~~~~~~-liG~~~l  103 (125)
T COG5550          27 YDELIDTGFTGYLVLPPQVAEKLGLPLFSTIRIVLADGGVVKTS--VALATIKIDGVEKVAFVLASDNLPEP-LIGVNLL  103 (125)
T ss_pred             eeeEEecCCceeEEeCHHHHHhcCCCccCChhhhhhcCCEEEEE--EEEEEEEECCEEEEEEEEccCCCccc-chhhhhh
Confidence            3359999999 788999999999        4556776655442  44567889998877777777777777 9999999


Q ss_pred             hhcC
Q 045784           84 STFG   87 (90)
Q Consensus        84 ~~~~   87 (90)
                      +..+
T Consensus       104 k~l~  107 (125)
T COG5550         104 KLLG  107 (125)
T ss_pred             hhcc
Confidence            9765


No 22 
>PF05585 DUF1758:  Putative peptidase (DUF1758);  InterPro: IPR008737  This is a family of nematode proteins of unknown function []. However, it seems likely that these proteins act as aspartic peptidases. 
Probab=98.29  E-value=3e-06  Score=54.74  Aligned_cols=25  Identities=32%  Similarity=0.465  Sum_probs=23.2

Q ss_pred             CEEEEEEEcCCCCcccccHHhhhee
Q 045784           10 HKVVTVLIDSRRTHNFVRQELAVKL   34 (90)
Q Consensus        10 ~~~~~aLiDsGA~~~~i~~~~a~~l   34 (90)
                      -+++.+|+||||+.|||++++|+++
T Consensus        10 ~~~~~~LlDsGSq~SfIt~~la~~L   34 (164)
T PF05585_consen   10 QVEARALLDSGSQRSFITESLANKL   34 (164)
T ss_pred             EEEEEEEEecCCchhHHhHHHHHHh
Confidence            3678899999999999999999999


No 23 
>PF05618 Zn_protease:  Putative ATP-dependant zinc protease;  InterPro: IPR008503 This family consists of several hypothetical proteins from different archaeal and bacterial species.; PDB: 2PMA_B.
Probab=97.56  E-value=0.00043  Score=44.06  Aligned_cols=77  Identities=16%  Similarity=0.141  Sum_probs=48.3

Q ss_pred             CEEEEEEEcCCCCcccccHHhhhee------eEEe-----c--CCCeeeee------------------ceeeeeEEEEe
Q 045784           10 HKVVTVLIDSRRTHNFVRQELAVKL------NVMV-----A--SGERFSSL------------------GTCLQTLIKLQ   58 (90)
Q Consensus        10 ~~~~~aLiDsGA~~~~i~~~~a~~l------~v~~-----a--~G~~~~~~------------------~~~~~~~i~i~   58 (90)
                      |..+.|-+||||..|-|+..=.+.+      .|+.     .  .+....+.                  ..+-+..+.++
T Consensus        14 ~~~~~aKiDTGA~tSSLhA~~I~~ferdg~~~VrF~~~~~~~~~~~~~~~e~p~~~~~~Ik~s~g~~e~R~VV~~~~~lg   93 (138)
T PF05618_consen   14 GLTIKAKIDTGAKTSSLHATDIEEFERDGEKWVRFTVHDPDNEEKEGVTFEAPVVRRRKIKSSNGESERRPVVETTLCLG   93 (138)
T ss_dssp             TEEEEEEE-TT-SSEEEE-EEEEEEEETTEEEEEE----EEEETTEEEEEEEEEECEEE---------CCEEEEEEEEET
T ss_pred             CCEEEEEEcCCCcccceeecceEEeeeCCceEEEEecccccCCCcceEEEEEEeEeEEEEEcCCCceeEeeEEEEEEEEC
Confidence            5568999999999999988777765      2222     1  11111111                  12235678899


Q ss_pred             ceeEEeeEEEcCC--CCceEEeC-cchhhhc
Q 045784           59 GVPLIVDFYLLPF--EGYDVVLD-TQWLSTF   86 (90)
Q Consensus        59 ~~~~~~~~~v~~~--~~~dvILG-~~wL~~~   86 (90)
                      +..+..+|.+.+=  ..|+++|| ..||+..
T Consensus        94 ~~~~~~e~tL~dR~~m~yp~LlGrR~~l~~~  124 (138)
T PF05618_consen   94 GKTWKIEFTLTDRSNMKYPMLLGRRNFLRGR  124 (138)
T ss_dssp             TEEEEEEEEEE-S--SS-SEEE-HHHHHHTT
T ss_pred             CEEEEEEEEEcCCCcCcCCEEEEehHHhcCC
Confidence            9999999999985  45999999 9998753


No 24 
>PF12382 Peptidase_A2E:  Retrotransposon peptidase;  InterPro: IPR024648 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  This entry represents a small family of fungal retroviral aspartyl peptidases.
Probab=97.13  E-value=0.0017  Score=39.53  Aligned_cols=63  Identities=16%  Similarity=0.195  Sum_probs=42.2

Q ss_pred             EEEEEEcCCCCcccccHHhhheeeEEe--------cCC-CeeeeeceeeeeEEEEeceeEEeeEEEcCCCCc
Q 045784           12 VVTVLIDSRRTHNFVRQELAVKLNVMV--------ASG-ERFSSLGTCLQTLIKLQGVPLIVDFYLLPFEGY   74 (90)
Q Consensus        12 ~~~aLiDsGA~~~~i~~~~a~~l~v~~--------a~G-~~~~~~~~~~~~~i~i~~~~~~~~~~v~~~~~~   74 (90)
                      .+..|||+||..|.|.++.++.+.+..        .+| -+-.+..+...+.+.++|.....+|+|..--..
T Consensus        47 sipclidtgaq~niiteetvrahklptrpw~~sviyggvyp~kinrkt~kl~i~lngisikteflvvkkfsh  118 (137)
T PF12382_consen   47 SIPCLIDTGAQVNIITEETVRAHKLPTRPWSQSVIYGGVYPNKINRKTIKLNINLNGISIKTEFLVVKKFSH  118 (137)
T ss_pred             cceeEEccCceeeeeehhhhhhccCCCCcchhheEeccccccccccceEEEEEEecceEEEEEEEEEEeccC
Confidence            356899999999999999998872221        111 111222233456678889999999998864433


No 25 
>COG4067 Uncharacterized protein conserved in archaea [Posttranslational modification, protein turnover, chaperones]
Probab=97.12  E-value=0.0022  Score=41.60  Aligned_cols=76  Identities=22%  Similarity=0.276  Sum_probs=54.9

Q ss_pred             EEEEEEEcCCCCcccccHHhhhee---------------------------------eEEecCCCeeeeeceeeeeEEEE
Q 045784           11 KVVTVLIDSRRTHNFVRQELAVKL---------------------------------NVMVASGERFSSLGTCLQTLIKL   57 (90)
Q Consensus        11 ~~~~aLiDsGA~~~~i~~~~a~~l---------------------------------~v~~a~G~~~~~~~~~~~~~i~i   57 (90)
                      ....|-+||||..|-++..=...+                                 .++-++|...+. ..+-...+.+
T Consensus        38 ~~~kAkiDTGA~TSsL~A~dI~~fkRdGe~WVRF~~~~~~~~~~~~~~~e~pvi~~ikvR~s~~~~~e~-RpVV~~~l~l  116 (162)
T COG4067          38 IQLKAKIDTGAVTSSLSASDIERFKRDGERWVRFRLADTDNLDQRSEECEAPVIRKIKVRSSSGSRAER-RPVVRLTLCL  116 (162)
T ss_pred             ceeeeeecccceeeeEEeecceeeeeCCceEEEEEeecccCccccceeeccceEEEEEEecCCCCcccc-ccEEEEEEee
Confidence            557899999999999998777666                                 112233332221 1233467889


Q ss_pred             eceeEEeeEEEcCCC--CceEEeCcchhhhcC
Q 045784           58 QGVPLIVDFYLLPFE--GYDVVLDTQWLSTFG   87 (90)
Q Consensus        58 ~~~~~~~~~~v~~~~--~~dvILG~~wL~~~~   87 (90)
                      |+...+.+|...+-.  .|+++||.-+|++..
T Consensus       117 G~~~~~~E~tLtDR~~m~Yp~LlGrk~l~~~~  148 (162)
T COG4067         117 GGRILPIEFTLTDRSNMRYPVLLGRKALRHFG  148 (162)
T ss_pred             CCeeeeEEEEeecccccccceEecHHHHhhCC
Confidence            999999999999854  599999999999843


No 26 
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  The enzymes specifically cleave bonds in peptides which 
Probab=96.15  E-value=0.043  Score=37.57  Aligned_cols=71  Identities=15%  Similarity=0.181  Sum_probs=43.3

Q ss_pred             EEEEEcCCCCcccccHHhhheeeEEecCCCeeeeeceeeeeEEEEeceeEEeeEEEcCC-CCceEEeCcchhhhcC
Q 045784           13 VTVLIDSRRTHNFVRQELAVKLNVMVASGERFSSLGTCLQTLIKLQGVPLIVDFYLLPF-EGYDVVLDTQWLSTFG   87 (90)
Q Consensus        13 ~~aLiDsGA~~~~i~~~~a~~l~v~~a~G~~~~~~~~~~~~~i~i~~~~~~~~~~v~~~-~~~dvILG~~wL~~~~   87 (90)
                      ..+++|||++..++.++....+.+...+|........-  .-+.. +....+ +-.++. .....|||.+||+.+.
T Consensus       177 ~~ai~DTGTs~~~lp~~~~P~i~~~f~~~~~~~i~~~~--y~~~~-~~~~~C-~~~~~~~~~~~~ilG~~fl~~~~  248 (265)
T cd05476         177 GGTIIDSGTTLTYLPDPAYPDLTLHFDGGADLELPPEN--YFVDV-GEGVVC-LAILSSSSGGVSILGNIQQQNFL  248 (265)
T ss_pred             CcEEEeCCCcceEcCccccCCEEEEECCCCEEEeCccc--EEEEC-CCCCEE-EEEecCCCCCcEEEChhhcccEE
Confidence            34899999999999999887677777766544433211  11111 111122 122232 3456899999999864


No 27 
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=95.90  E-value=0.0093  Score=41.11  Aligned_cols=24  Identities=8%  Similarity=0.109  Sum_probs=20.9

Q ss_pred             EEEEEEEcCCCCcccccHHhhhee
Q 045784           11 KVVTVLIDSRRTHNFVRQELAVKL   34 (90)
Q Consensus        11 ~~~~aLiDsGA~~~~i~~~~a~~l   34 (90)
                      ....++||||++..++.++.++++
T Consensus       197 ~~~~~iiDSGTs~~~lP~~~~~~l  220 (278)
T cd06097         197 SGFSAIADTGTTLILLPDAIVEAY  220 (278)
T ss_pred             CCceEEeecCCchhcCCHHHHHHH
Confidence            456799999999999999888877


No 28 
>PF03539 Spuma_A9PTase:  Spumavirus aspartic protease (A9);  InterPro: IPR001641 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A9 (spumapepsin family, clan AA). Foamy viruses are single-stranded enveloped retroviruses that have been noted to infect monkeys, cats and humans. In the human virus, the aspartic protease is encoded by the retroviral gag gene [], and in monkeys by the pol gene []. At present, the virus has not been proven to cause any particular disease. However, studies have shown Human foamy virus causes neurological disorders in infected mice []. It is not clear whether the Foamy virus/spumavirus proteases share a common evolutionary origin with other aspartic proteases. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 2JYS_A.
Probab=95.38  E-value=0.27  Score=31.90  Aligned_cols=73  Identities=15%  Similarity=0.220  Sum_probs=46.3

Q ss_pred             ECCEEEEEEEcCCCCcccccHHhhhee------eEEecCCCeeeeeceeeeeEEEEeceeEEeeEEEcCCCCceEEe---
Q 045784            8 LVHKVVTVLIDSRRTHNFVRQELAVKL------NVMVASGERFSSLGTCLQTLIKLQGVPLIVDFYLLPFEGYDVVL---   78 (90)
Q Consensus         8 i~~~~~~aLiDsGA~~~~i~~~~a~~l------~v~~a~G~~~~~~~~~~~~~i~i~~~~~~~~~~v~~~~~~dvIL---   78 (90)
                      +.|..+.+.-||||+...|.+.|...-      .+..-.|++..   .+.-+++.++|+...++..--++   |.+|   
T Consensus         1 ikg~~l~~~wDsga~ITCiP~~fl~~E~Pi~~~~i~Tihg~~~~---~vYYl~fKi~grkv~aEVi~s~~---dy~li~p   74 (163)
T PF03539_consen    1 IKGTKLKGHWDSGAQITCIPESFLEEEQPIGKTLIKTIHGEKEQ---DVYYLTFKINGRKVEAEVIASPY---DYILISP   74 (163)
T ss_dssp             ETTEEEEEEE-TT-SSEEEEGGGTTT---SEEEEEE-SS-EEEE---EEEEEEEEESS-EEEEEEEEESS---SSEEE-T
T ss_pred             CCCceeeEEecCCCeEEEccHHHhCccccccceEEEEecCceec---cEEEEEEEEcCeEEEEEEecCcc---ceEEEcc
Confidence            457788999999999999999998876      45555665443   33457899999988888776653   3222   


Q ss_pred             -Ccchhhhc
Q 045784           79 -DTQWLSTF   86 (90)
Q Consensus        79 -G~~wL~~~   86 (90)
                       -.||+++.
T Consensus        75 ~diPw~~~~   83 (163)
T PF03539_consen   75 SDIPWYKKK   83 (163)
T ss_dssp             TT-HHHHS-
T ss_pred             cccccccCC
Confidence             34677654


No 29 
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=95.38  E-value=0.13  Score=36.05  Aligned_cols=75  Identities=12%  Similarity=0.116  Sum_probs=45.4

Q ss_pred             EEEEcCCCCcccccHHhhhee----eEEe-cCCC-eeeeec--eeeeeEEEEeceeEEee-------------EEEcCC-
Q 045784           14 TVLIDSRRTHNFVRQELAVKL----NVMV-ASGE-RFSSLG--TCLQTLIKLQGVPLIVD-------------FYLLPF-   71 (90)
Q Consensus        14 ~aLiDsGA~~~~i~~~~a~~l----~v~~-a~G~-~~~~~~--~~~~~~i~i~~~~~~~~-------------~~v~~~-   71 (90)
                      .+++|||++..++.++..+++    .... ..|. ...|..  ..+.+.+.+++.++...             +-+.+. 
T Consensus       203 ~~iiDSGtt~~~lP~~~~~~l~~~~~~~~~~~~~~~~~C~~~~~~p~l~~~f~g~~~~v~~~~y~~~~~~~C~~~i~~~~  282 (318)
T cd05477         203 QAIVDTGTSLLTAPQQVMSTLMQSIGAQQDQYGQYVVNCNNIQNLPTLTFTINGVSFPLPPSAYILQNNGYCTVGIEPTY  282 (318)
T ss_pred             eeeECCCCccEECCHHHHHHHHHHhCCccccCCCEEEeCCccccCCcEEEEECCEEEEECHHHeEecCCCeEEEEEEecc
Confidence            589999999999999999988    1111 1122 223321  12456677777655432             112111 


Q ss_pred             -----CCceEEeCcchhhhcCC
Q 045784           72 -----EGYDVVLDTQWLSTFGL   88 (90)
Q Consensus        72 -----~~~dvILG~~wL~~~~p   88 (90)
                           +....|||.+||+.+..
T Consensus       283 ~~~~~~~~~~ilG~~fl~~~y~  304 (318)
T cd05477         283 LPSQNGQPLWILGDVFLRQYYS  304 (318)
T ss_pred             cCCCCCCceEEEcHHHhhheEE
Confidence                 12458999999998753


No 30 
>PF00026 Asp:  Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.;  InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) .  More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=95.20  E-value=0.063  Score=37.06  Aligned_cols=76  Identities=11%  Similarity=0.119  Sum_probs=45.1

Q ss_pred             EEEEEEcCCCCcccccHHhhhee----eEE-ecCCCeeeeece--eeeeEEEEeceeEE-----------------eeEE
Q 045784           12 VVTVLIDSRRTHNFVRQELAVKL----NVM-VASGERFSSLGT--CLQTLIKLQGVPLI-----------------VDFY   67 (90)
Q Consensus        12 ~~~aLiDsGA~~~~i~~~~a~~l----~v~-~a~G~~~~~~~~--~~~~~i~i~~~~~~-----------------~~~~   67 (90)
                      +..|++|||++...+..+..+++    ... ..+.-.+.|...  .+.+.+.+++..+.                 +-+.
T Consensus       199 ~~~~~~Dtgt~~i~lp~~~~~~i~~~l~~~~~~~~~~~~c~~~~~~p~l~f~~~~~~~~i~~~~~~~~~~~~~~~~C~~~  278 (317)
T PF00026_consen  199 GQQAILDTGTSYIYLPRSIFDAIIKALGGSYSDGVYSVPCNSTDSLPDLTFTFGGVTFTIPPSDYIFKIEDGNGGYCYLG  278 (317)
T ss_dssp             EEEEEEETTBSSEEEEHHHHHHHHHHHTTEEECSEEEEETTGGGGSEEEEEEETTEEEEEEHHHHEEEESSTTSSEEEES
T ss_pred             ceeeecccccccccccchhhHHHHhhhcccccceeEEEecccccccceEEEeeCCEEEEecchHhcccccccccceeEee
Confidence            46899999999999999999988    111 111112232211  23444455443321                 2222


Q ss_pred             EcC----CCCceEEeCcchhhhcC
Q 045784           68 LLP----FEGYDVVLDTQWLSTFG   87 (90)
Q Consensus        68 v~~----~~~~dvILG~~wL~~~~   87 (90)
                      +.+    ......|||++||+++-
T Consensus       279 i~~~~~~~~~~~~iLG~~fl~~~y  302 (317)
T PF00026_consen  279 IQPMDSSDDSDDWILGSPFLRNYY  302 (317)
T ss_dssp             EEEESSTTSSSEEEEEHHHHTTEE
T ss_pred             eecccccccCCceEecHHHhhceE
Confidence            222    34578999999999864


No 31 
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which 
Probab=95.17  E-value=0.16  Score=35.72  Aligned_cols=83  Identities=11%  Similarity=0.121  Sum_probs=51.1

Q ss_pred             EEECCEEE------EEEEcCCCCcccccHHhhhee----eEEe-cCCC-eeeeec--eeeeeEEEEeceeEEee------
Q 045784            6 GRLVHKVV------TVLIDSRRTHNFVRQELAVKL----NVMV-ASGE-RFSSLG--TCLQTLIKLQGVPLIVD------   65 (90)
Q Consensus         6 ~~i~~~~~------~aLiDsGA~~~~i~~~~a~~l----~v~~-a~G~-~~~~~~--~~~~~~i~i~~~~~~~~------   65 (90)
                      ..+|++.+      .+++|||++..++.++..+++    .... .+|. .+.|..  ..+.+.+.+++..+..+      
T Consensus       194 v~v~g~~~~~~~~~~~iiDTGts~~~lp~~~~~~l~~~~~~~~~~~~~~~~~C~~~~~~P~~~f~f~g~~~~i~~~~y~~  273 (317)
T cd05478         194 VTINGQVVACSGGCQAIVDTGTSLLVGPSSDIANIQSDIGASQNQNGEMVVNCSSISSMPDVVFTINGVQYPLPPSAYIL  273 (317)
T ss_pred             EEECCEEEccCCCCEEEECCCchhhhCCHHHHHHHHHHhCCccccCCcEEeCCcCcccCCcEEEEECCEEEEECHHHhee
Confidence            46777655      589999999999999998887    1111 1222 233332  23456677777655422      


Q ss_pred             -------EEEcCCC-CceEEeCcchhhhcCC
Q 045784           66 -------FYLLPFE-GYDVVLDTQWLSTFGL   88 (90)
Q Consensus        66 -------~~v~~~~-~~dvILG~~wL~~~~p   88 (90)
                             ..+.+.. ....|||-.||+.+..
T Consensus       274 ~~~~~C~~~~~~~~~~~~~IlG~~fl~~~y~  304 (317)
T cd05478         274 QDQGSCTSGFQSMGLGELWILGDVFIRQYYS  304 (317)
T ss_pred             cCCCEEeEEEEeCCCCCeEEechHHhcceEE
Confidence                   1222322 2458999999998653


No 32 
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5.  Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=95.09  E-value=0.044  Score=38.74  Aligned_cols=72  Identities=14%  Similarity=0.080  Sum_probs=40.9

Q ss_pred             EEEEEEEcCCCCcccccHHhhhee-------eEEecCCCeeeeeceeeeeEEEEeceeEEeeEEEcCCCCceEEeCcchh
Q 045784           11 KVVTVLIDSRRTHNFVRQELAVKL-------NVMVASGERFSSLGTCLQTLIKLQGVPLIVDFYLLPFEGYDVVLDTQWL   83 (90)
Q Consensus        11 ~~~~aLiDsGA~~~~i~~~~a~~l-------~v~~a~G~~~~~~~~~~~~~i~i~~~~~~~~~~v~~~~~~dvILG~~wL   83 (90)
                      ....+++|||.+..++.++..+++       .+...+|..+.....  +.-+...+..  + +.......--.|||-+||
T Consensus       230 ~~~~aivDSGTs~~~lp~~~~~~l~~~~P~i~~~f~~g~~~~i~p~--~y~~~~~~~~--c-~~~~~~~~~~~ILG~~fl  304 (326)
T cd06096         230 KGLGMLVDSGSTLSHFPEDLYNKINNFFPTITIIFENNLKIDWKPS--SYLYKKESFW--C-KGGEKSVSNKPILGASFF  304 (326)
T ss_pred             cCCCEEEeCCCCcccCCHHHHHHHHhhcCcEEEEEcCCcEEEECHH--HhccccCCce--E-EEEEecCCCceEEChHHh
Confidence            345689999999999999999987       334444444432211  1111111111  1 111122223579999999


Q ss_pred             hhcC
Q 045784           84 STFG   87 (90)
Q Consensus        84 ~~~~   87 (90)
                      +.+-
T Consensus       305 r~~y  308 (326)
T cd06096         305 KNKQ  308 (326)
T ss_pred             cCcE
Confidence            9864


No 33 
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=94.91  E-value=0.12  Score=35.66  Aligned_cols=77  Identities=16%  Similarity=0.164  Sum_probs=45.5

Q ss_pred             EEEEEEEcCCCCcccccHHhhhee----eEEecC--C-Ceeeeecee-eeeEEEEeceeEEee-----------------
Q 045784           11 KVVTVLIDSRRTHNFVRQELAVKL----NVMVAS--G-ERFSSLGTC-LQTLIKLQGVPLIVD-----------------   65 (90)
Q Consensus        11 ~~~~aLiDsGA~~~~i~~~~a~~l----~v~~a~--G-~~~~~~~~~-~~~~i~i~~~~~~~~-----------------   65 (90)
                      ....+++|||++..++.++..+++    ......  + ....|.... +.+.+.+++..+..+                 
T Consensus       177 ~~~~~iiDSGt~~~~lP~~~~~~l~~~~~~~~~~~~~~~~~~C~~~~~p~i~f~f~g~~~~i~~~~~~~~~~~~~~~~~~  256 (295)
T cd05474         177 KNLPALLDSGTTLTYLPSDIVDAIAKQLGATYDSDEGLYVVDCDAKDDGSLTFNFGGATISVPLSDLVLPASTDDGGDGA  256 (295)
T ss_pred             CCccEEECCCCccEeCCHHHHHHHHHHhCCEEcCCCcEEEEeCCCCCCCEEEEEECCeEEEEEHHHhEeccccCCCCCCC
Confidence            446899999999999999999988    111111  1 112221111 356666666444321                 


Q ss_pred             --EEEcCCCCceEEeCcchhhhcC
Q 045784           66 --FYLLPFEGYDVVLDTQWLSTFG   87 (90)
Q Consensus        66 --~~v~~~~~~dvILG~~wL~~~~   87 (90)
                        +.+.+......|||..||+.+.
T Consensus       257 C~~~i~~~~~~~~iLG~~fl~~~y  280 (295)
T cd05474         257 CYLGIQPSTSDYNILGDTFLRSAY  280 (295)
T ss_pred             eEEEEEeCCCCcEEeChHHhhcEE
Confidence              2222333245899999999874


No 34 
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=94.70  E-value=0.6  Score=32.10  Aligned_cols=73  Identities=18%  Similarity=0.181  Sum_probs=49.5

Q ss_pred             EEEEEECC--EEEEEEEcCCCCcccccHHhhheeeEEecCCCeeeeeceeeeeEEEEeceeEE-eeEEEcC-CCCceEEe
Q 045784            3 RVIGRLVH--KVVTVLIDSRRTHNFVRQELAVKLNVMVASGERFSSLGTCLQTLIKLQGVPLI-VDFYLLP-FEGYDVVL   78 (90)
Q Consensus         3 ~v~~~i~~--~~~~aLiDsGA~~~~i~~~~a~~l~v~~a~G~~~~~~~~~~~~~i~i~~~~~~-~~~~v~~-~~~~dvIL   78 (90)
                      .++..|..  ++..+++||||+..++.     ...+..++|...  .|....=.+.+++.... ..|-+.. ....|-||
T Consensus         4 ~~~i~iGtp~q~~~v~~DTgS~~~wv~-----~~~~~Y~~g~~~--~G~~~~D~v~~g~~~~~~~~fg~~~~~~~~~Gil   76 (295)
T cd05474           4 SAELSVGTPPQKVTVLLDTGSSDLWVP-----DFSISYGDGTSA--SGTWGTDTVSIGGATVKNLQFAVANSTSSDVGVL   76 (295)
T ss_pred             EEEEEECCCCcEEEEEEeCCCCcceee-----eeEEEeccCCcE--EEEEEEEEEEECCeEecceEEEEEecCCCCccee
Confidence            46677776  99999999999999999     457777776433  23333335667766543 3454443 34588888


Q ss_pred             Ccch
Q 045784           79 DTQW   82 (90)
Q Consensus        79 G~~w   82 (90)
                      |..+
T Consensus        77 GLg~   80 (295)
T cd05474          77 GIGL   80 (295)
T ss_pred             eECC
Confidence            8765


No 35 
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=94.63  E-value=0.21  Score=35.08  Aligned_cols=75  Identities=12%  Similarity=0.172  Sum_probs=45.1

Q ss_pred             EEEEcCCCCcccccHHhhhee----eEEecCCC-eeeeec--eeeeeEEEEeceeEEee---EEE--------------c
Q 045784           14 TVLIDSRRTHNFVRQELAVKL----NVMVASGE-RFSSLG--TCLQTLIKLQGVPLIVD---FYL--------------L   69 (90)
Q Consensus        14 ~aLiDsGA~~~~i~~~~a~~l----~v~~a~G~-~~~~~~--~~~~~~i~i~~~~~~~~---~~v--------------~   69 (90)
                      .+++|||++..++.++..+++    .....+|. ...|..  ..+.+.+.+++..+...   +.+              .
T Consensus       200 ~aiiDTGTs~~~lP~~~~~~l~~~~~~~~~~~~~~~~C~~~~~~p~i~f~f~g~~~~l~~~~y~~~~~~~~~~~C~~~~~  279 (316)
T cd05486         200 QAIVDTGTSLITGPSGDIKQLQNYIGATATDGEYGVDCSTLSLMPSVTFTINGIPYSLSPQAYTLEDQSDGGGYCSSGFQ  279 (316)
T ss_pred             EEEECCCcchhhcCHHHHHHHHHHhCCcccCCcEEEeccccccCCCEEEEECCEEEEeCHHHeEEecccCCCCEEeeEEE
Confidence            699999999999999988777    22222332 223332  23456777777654421   111              1


Q ss_pred             CC-----CCceEEeCcchhhhcCC
Q 045784           70 PF-----EGYDVVLDTQWLSTFGL   88 (90)
Q Consensus        70 ~~-----~~~dvILG~~wL~~~~p   88 (90)
                      .+     .+...|||-.||+++-.
T Consensus       280 ~~~~~~~~~~~~ILGd~flr~~y~  303 (316)
T cd05486         280 GLDIPPPAGPLWILGDVFIRQYYS  303 (316)
T ss_pred             ECCCCCCCCCeEEEchHHhcceEE
Confidence            11     12247999999998753


No 36 
>PF00026 Asp:  Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.;  InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) .  More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=94.63  E-value=0.32  Score=33.51  Aligned_cols=78  Identities=17%  Similarity=0.214  Sum_probs=47.8

Q ss_pred             EEEEEEC--CEEEEEEEcCCCCcccccHHh-------hhee-----------------eEEecCCCeeeeeceeeeeEEE
Q 045784            3 RVIGRLV--HKVVTVLIDSRRTHNFVRQEL-------AVKL-----------------NVMVASGERFSSLGTCLQTLIK   56 (90)
Q Consensus         3 ~v~~~i~--~~~~~aLiDsGA~~~~i~~~~-------a~~l-----------------~v~~a~G~~~~~~~~~~~~~i~   56 (90)
                      .++..|.  .++..+++|||++..++...-       ..+.                 .+..++|.   ..|....=.+.
T Consensus         3 ~~~v~iGtp~q~~~~~iDTGS~~~wv~~~~c~~~~~~~~~~~y~~~~S~t~~~~~~~~~~~y~~g~---~~G~~~~D~v~   79 (317)
T PF00026_consen    3 YINVTIGTPPQTFRVLIDTGSSDTWVPSSNCNSCSSCASSGFYNPSKSSTFSNQGKPFSISYGDGS---VSGNLVSDTVS   79 (317)
T ss_dssp             EEEEEETTTTEEEEEEEETTBSSEEEEBTTECSHTHHCTSC-BBGGGSTTEEEEEEEEEEEETTEE---EEEEEEEEEEE
T ss_pred             EEEEEECCCCeEEEEEEecccceeeeceeccccccccccccccccccccccccceeeeeeeccCcc---cccccccceEe
Confidence            5677787  899999999999998887421       1111                 23334443   33444444677


Q ss_pred             EeceeEE-eeEEEcCC--------CCceEEeCcchh
Q 045784           57 LQGVPLI-VDFYLLPF--------EGYDVVLDTQWL   83 (90)
Q Consensus        57 i~~~~~~-~~~~v~~~--------~~~dvILG~~wL   83 (90)
                      +++.... ..|.....        ..+|-|||+.+-
T Consensus        80 ig~~~~~~~~f~~~~~~~~~~~~~~~~~GilGLg~~  115 (317)
T PF00026_consen   80 IGGLTIPNQTFGLADSYSGDPFSPIPFDGILGLGFP  115 (317)
T ss_dssp             ETTEEEEEEEEEEEEEEESHHHHHSSSSEEEE-SSG
T ss_pred             eeeccccccceeccccccccccccccccccccccCC
Confidence            7877655 45544433        347888888753


No 37 
>PTZ00147 plasmepsin-1; Provisional
Probab=94.52  E-value=0.34  Score=36.34  Aligned_cols=77  Identities=13%  Similarity=0.134  Sum_probs=45.3

Q ss_pred             EEEEEEcCCCCcccccHHhhhee--eE---Eec-CCC-eeeee-ceeeeeEEEEeceeEEee---------------E--
Q 045784           12 VVTVLIDSRRTHNFVRQELAVKL--NV---MVA-SGE-RFSSL-GTCLQTLIKLQGVPLIVD---------------F--   66 (90)
Q Consensus        12 ~~~aLiDsGA~~~~i~~~~a~~l--~v---~~a-~G~-~~~~~-~~~~~~~i~i~~~~~~~~---------------~--   66 (90)
                      ...|++|||.+..++.++.++++  .+   ... .+. ...|. ...+.+.+.+++..+..+               +  
T Consensus       332 ~~~aIiDSGTsli~lP~~~~~ai~~~l~~~~~~~~~~y~~~C~~~~lP~~~f~f~g~~~~L~p~~yi~~~~~~~~~~C~~  411 (453)
T PTZ00147        332 KANVIVDSGTSVITVPTEFLNKFVESLDVFKVPFLPLYVTTCNNTKLPTLEFRSPNKVYTLEPEYYLQPIEDIGSALCML  411 (453)
T ss_pred             ceeEEECCCCchhcCCHHHHHHHHHHhCCeecCCCCeEEEeCCCCCCCeEEEEECCEEEEECHHHheeccccCCCcEEEE
Confidence            46799999999999999998877  11   111 111 12332 223456666666544211               1  


Q ss_pred             EEcC--CCCceEEeCcchhhhcCC
Q 045784           67 YLLP--FEGYDVVLDTQWLSTFGL   88 (90)
Q Consensus        67 ~v~~--~~~~dvILG~~wL~~~~p   88 (90)
                      -+.+  ......|||..||+++-.
T Consensus       412 ~i~~~~~~~~~~ILGd~FLr~~Yt  435 (453)
T PTZ00147        412 NIIPIDLEKNTFILGDPFMRKYFT  435 (453)
T ss_pred             EEEECCCCCCCEEECHHHhccEEE
Confidence            1222  222357999999998753


No 38 
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=94.45  E-value=0.49  Score=35.46  Aligned_cols=76  Identities=17%  Similarity=0.297  Sum_probs=45.6

Q ss_pred             EEEEEC--CEEEEEEEcCCCCcccccHHhhhe--------e---------------eEEecCCCeeeeeceeeeeEEEEe
Q 045784            4 VIGRLV--HKVVTVLIDSRRTHNFVRQELAVK--------L---------------NVMVASGERFSSLGTCLQTLIKLQ   58 (90)
Q Consensus         4 v~~~i~--~~~~~aLiDsGA~~~~i~~~~a~~--------l---------------~v~~a~G~~~~~~~~~~~~~i~i~   58 (90)
                      .+..|.  .++..+++||||+..++...-...        .               .+..++|.   ..|....=.+.++
T Consensus       141 ~~i~IGTP~Q~f~vi~DTGSsdlWV~s~~C~~~~C~~~~~yd~s~SsT~~~~~~~~~i~YG~Gs---v~G~~~~Dtv~iG  217 (450)
T PTZ00013        141 GEGEVGDNHQKFMLIFDTGSANLWVPSKKCDSIGCSIKNLYDSSKSKSYEKDGTKVDITYGSGT---VKGFFSKDLVTLG  217 (450)
T ss_pred             EEEEECCCCeEEEEEEeCCCCceEEecccCCccccccCCCccCccCcccccCCcEEEEEECCce---EEEEEEEEEEEEC
Confidence            345555  799999999999999986432211        0               23344443   2344333356777


Q ss_pred             ceeEEeeEEEc-C---------CCCceEEeCcch
Q 045784           59 GVPLIVDFYLL-P---------FEGYDVVLDTQW   82 (90)
Q Consensus        59 ~~~~~~~~~v~-~---------~~~~dvILG~~w   82 (90)
                      +......|... +         ...+|-|||+.|
T Consensus       218 ~~~~~~~f~~~~~~~~~~~~~~~~~~dGIlGLg~  251 (450)
T PTZ00013        218 HLSMPYKFIEVTDTDDLEPIYSSSEFDGILGLGW  251 (450)
T ss_pred             CEEEccEEEEEEeccccccceecccccceecccC
Confidence            76655444332 1         124789999976


No 39 
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=94.44  E-value=0.36  Score=34.17  Aligned_cols=76  Identities=12%  Similarity=0.117  Sum_probs=45.0

Q ss_pred             EEEEEcCCCCcccccHHhhhee----e-EEecCCC-eeeeec--eeeeeEEEEeceeEEee---EE--------------
Q 045784           13 VTVLIDSRRTHNFVRQELAVKL----N-VMVASGE-RFSSLG--TCLQTLIKLQGVPLIVD---FY--------------   67 (90)
Q Consensus        13 ~~aLiDsGA~~~~i~~~~a~~l----~-v~~a~G~-~~~~~~--~~~~~~i~i~~~~~~~~---~~--------------   67 (90)
                      ..+++|||++..++.++.++.+    . ....++. .+.|..  ..+.+.+.++++.+..+   +.              
T Consensus       211 ~~~iiDSGtt~~~lP~~~~~~l~~~~~~~~~~~~~~~~~C~~~~~~p~i~f~fgg~~~~i~~~~yi~~~~~~~~~~C~~~  290 (329)
T cd05485         211 CQAIADTGTSLIAGPVDEIEKLNNAIGAKPIIGGEYMVNCSAIPSLPDITFVLGGKSFSLTGKDYVLKVTQMGQTICLSG  290 (329)
T ss_pred             cEEEEccCCcceeCCHHHHHHHHHHhCCccccCCcEEEeccccccCCcEEEEECCEEeEEChHHeEEEecCCCCCEEeee
Confidence            3699999999999999988877    1 1112221 123321  22456677777655422   11              


Q ss_pred             EcC-----CCCceEEeCcchhhhcCC
Q 045784           68 LLP-----FEGYDVVLDTQWLSTFGL   88 (90)
Q Consensus        68 v~~-----~~~~dvILG~~wL~~~~p   88 (90)
                      +.+     ..+...|||..||+.+-.
T Consensus       291 ~~~~~~~~~~~~~~IlG~~fl~~~y~  316 (329)
T cd05485         291 FMGIDIPPPAGPLWILGDVFIGKYYT  316 (329)
T ss_pred             EEECcCCCCCCCeEEEchHHhccceE
Confidence            111     112358999999998753


No 40 
>PTZ00147 plasmepsin-1; Provisional
Probab=94.39  E-value=0.56  Score=35.19  Aligned_cols=76  Identities=14%  Similarity=0.236  Sum_probs=46.4

Q ss_pred             EEEEEC--CEEEEEEEcCCCCcccccHHhhh------e--e---------------eEEecCCCeeeeeceeeeeEEEEe
Q 045784            4 VIGRLV--HKVVTVLIDSRRTHNFVRQELAV------K--L---------------NVMVASGERFSSLGTCLQTLIKLQ   58 (90)
Q Consensus         4 v~~~i~--~~~~~aLiDsGA~~~~i~~~~a~------~--l---------------~v~~a~G~~~~~~~~~~~~~i~i~   58 (90)
                      .+..|.  .++..+++||||+..++...-..      .  .               .+...+|.   ..|....=.+.++
T Consensus       142 ~~I~IGTP~Q~f~Vi~DTGSsdlWVps~~C~~~~C~~~~~yd~s~SsT~~~~~~~f~i~Yg~Gs---vsG~~~~DtVtiG  218 (453)
T PTZ00147        142 GEAKLGDNGQKFNFIFDTGSANLWVPSIKCTTEGCETKNLYDSSKSKTYEKDGTKVEMNYVSGT---VSGFFSKDLVTIG  218 (453)
T ss_pred             EEEEECCCCeEEEEEEeCCCCcEEEeecCCCcccccCCCccCCccCcceEECCCEEEEEeCCCC---EEEEEEEEEEEEC
Confidence            455665  78999999999999998643211      0  0               34445553   2344433356777


Q ss_pred             ceeEEeeEEEcC----------CCCceEEeCcch
Q 045784           59 GVPLIVDFYLLP----------FEGYDVVLDTQW   82 (90)
Q Consensus        59 ~~~~~~~~~v~~----------~~~~dvILG~~w   82 (90)
                      +...+..|....          ....|-|||+.|
T Consensus       219 ~~~v~~qF~~~~~~~~f~~~~~~~~~DGILGLG~  252 (453)
T PTZ00147        219 NLSVPYKFIEVTDTNGFEPFYTESDFDGIFGLGW  252 (453)
T ss_pred             CEEEEEEEEEEEeccCcccccccccccceecccC
Confidence            766554443321          124789999987


No 41 
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases.  They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=93.76  E-value=0.44  Score=33.53  Aligned_cols=22  Identities=9%  Similarity=0.087  Sum_probs=19.8

Q ss_pred             EEEEEcCCCCcccccHHhhhee
Q 045784           13 VTVLIDSRRTHNFVRQELAVKL   34 (90)
Q Consensus        13 ~~aLiDsGA~~~~i~~~~a~~l   34 (90)
                      ..|++|||++..+..+++++++
T Consensus       211 ~~aivDTGTs~~~lP~~~~~~i  232 (317)
T cd06098         211 CAAIADSGTSLLAGPTTIVTQI  232 (317)
T ss_pred             cEEEEecCCcceeCCHHHHHhh
Confidence            4699999999999999998876


No 42 
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=93.33  E-value=0.091  Score=35.57  Aligned_cols=25  Identities=8%  Similarity=0.069  Sum_probs=22.5

Q ss_pred             CEEEEEEEcCCCCcccccHHhhhee
Q 045784           10 HKVVTVLIDSRRTHNFVRQELAVKL   34 (90)
Q Consensus        10 ~~~~~aLiDsGA~~~~i~~~~a~~l   34 (90)
                      .....++||||++..++.++..+++
T Consensus       200 ~~~~~~iiDsGt~~~~lp~~~~~~l  224 (283)
T cd05471         200 SGGGGAIVDSGTSLIYLPSSVYDAI  224 (283)
T ss_pred             CCCcEEEEecCCCCEeCCHHHHHHH
Confidence            3467899999999999999999988


No 43 
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme.  Proteinase A preferentially hydro
Probab=93.32  E-value=0.36  Score=33.98  Aligned_cols=75  Identities=13%  Similarity=0.191  Sum_probs=44.9

Q ss_pred             EEEEEcCCCCcccccHHhhhee--eEE--e-cCCC-eeeeec--eeeeeEEEEeceeEEee---E----------EEcC-
Q 045784           13 VTVLIDSRRTHNFVRQELAVKL--NVM--V-ASGE-RFSSLG--TCLQTLIKLQGVPLIVD---F----------YLLP-   70 (90)
Q Consensus        13 ~~aLiDsGA~~~~i~~~~a~~l--~v~--~-a~G~-~~~~~~--~~~~~~i~i~~~~~~~~---~----------~v~~-   70 (90)
                      ..+++|||++..++.++.++.+  .+.  . .++. .+.|..  ..+.+.+.+++..+...   +          .+.+ 
T Consensus       206 ~~~ivDSGtt~~~lp~~~~~~l~~~~~~~~~~~~~~~~~C~~~~~~P~i~f~f~g~~~~i~~~~y~~~~~g~C~~~~~~~  285 (320)
T cd05488         206 TGAAIDTGTSLIALPSDLAEMLNAEIGAKKSWNGQYTVDCSKVDSLPDLTFNFDGYNFTLGPFDYTLEVSGSCISAFTGM  285 (320)
T ss_pred             CeEEEcCCcccccCCHHHHHHHHHHhCCccccCCcEEeeccccccCCCEEEEECCEEEEECHHHheecCCCeEEEEEEEC
Confidence            4689999999999999998887  111  1 1221 123321  13456677777655422   1          1111 


Q ss_pred             -C---CCceEEeCcchhhhcC
Q 045784           71 -F---EGYDVVLDTQWLSTFG   87 (90)
Q Consensus        71 -~---~~~dvILG~~wL~~~~   87 (90)
                       .   .+...|||..||+.+-
T Consensus       286 ~~~~~~~~~~ilG~~fl~~~y  306 (320)
T cd05488         286 DFPEPVGPLAIVGDAFLRKYY  306 (320)
T ss_pred             cCCCCCCCeEEEchHHhhheE
Confidence             1   1235899999998764


No 44 
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=92.90  E-value=1.3  Score=30.49  Aligned_cols=27  Identities=7%  Similarity=0.080  Sum_probs=22.1

Q ss_pred             EEEEEECC--EEEEEEEcCCCCcccccHH
Q 045784            3 RVIGRLVH--KVVTVLIDSRRTHNFVRQE   29 (90)
Q Consensus         3 ~v~~~i~~--~~~~aLiDsGA~~~~i~~~   29 (90)
                      .++..|++  ++..+++||||+..++..+
T Consensus         2 ~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~   30 (278)
T cd06097           2 LTPVKIGTPPQTLNLDLDTGSSDLWVFSS   30 (278)
T ss_pred             eeeEEECCCCcEEEEEEeCCCCceeEeeC
Confidence            35667777  9999999999999988644


No 45 
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate  r
Probab=92.47  E-value=2.4  Score=29.91  Aligned_cols=26  Identities=12%  Similarity=0.054  Sum_probs=21.4

Q ss_pred             EEEEEEC--CEEEEEEEcCCCCcccccH
Q 045784            3 RVIGRLV--HKVVTVLIDSRRTHNFVRQ   28 (90)
Q Consensus         3 ~v~~~i~--~~~~~aLiDsGA~~~~i~~   28 (90)
                      .++..|.  .++..+++|||++..++..
T Consensus        10 ~~~i~iGtP~q~~~v~~DTGSs~~Wv~~   37 (326)
T cd05487          10 YGEIGIGTPPQTFKVVFDTGSSNLWVPS   37 (326)
T ss_pred             EEEEEECCCCcEEEEEEeCCccceEEcc
Confidence            4556666  7999999999999999963


No 46 
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank 
Probab=92.03  E-value=0.67  Score=32.57  Aligned_cols=76  Identities=14%  Similarity=0.133  Sum_probs=44.7

Q ss_pred             EEEEEcCCCCcccccHHhhhee-----eEEecCCC-eeeeec--eeeeeEEEEeceeEEee---EE--------------
Q 045784           13 VTVLIDSRRTHNFVRQELAVKL-----NVMVASGE-RFSSLG--TCLQTLIKLQGVPLIVD---FY--------------   67 (90)
Q Consensus        13 ~~aLiDsGA~~~~i~~~~a~~l-----~v~~a~G~-~~~~~~--~~~~~~i~i~~~~~~~~---~~--------------   67 (90)
                      ..|+||||++..++.++.++++     .....+|. .+.|..  ..+.+.+.+++..+..+   ++              
T Consensus       207 ~~aiiDSGTt~~~~p~~~~~~l~~~~~~~~~~~~~~~~~C~~~~~~P~i~f~fgg~~~~l~~~~y~~~~~~~~~~~C~~~  286 (325)
T cd05490         207 CEAIVDTGTSLITGPVEEVRALQKAIGAVPLIQGEYMIDCEKIPTLPVISFSLGGKVYPLTGEDYILKVSQRGTTICLSG  286 (325)
T ss_pred             CEEEECCCCccccCCHHHHHHHHHHhCCccccCCCEEecccccccCCCEEEEECCEEEEEChHHeEEeccCCCCCEEeeE
Confidence            4799999999999999988877     11111221 233321  22446666776644321   11              


Q ss_pred             Ec--CC---CCceEEeCcchhhhcCC
Q 045784           68 LL--PF---EGYDVVLDTQWLSTFGL   88 (90)
Q Consensus        68 v~--~~---~~~dvILG~~wL~~~~p   88 (90)
                      +.  +.   .+...|||..||+++-.
T Consensus       287 ~~~~~~~~~~~~~~ilGd~flr~~y~  312 (325)
T cd05490         287 FMGLDIPPPAGPLWILGDVFIGRYYT  312 (325)
T ss_pred             EEECCCCCCCCceEEEChHhheeeEE
Confidence            11  11   12357999999998743


No 47 
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=91.49  E-value=0.8  Score=34.36  Aligned_cols=76  Identities=16%  Similarity=0.118  Sum_probs=44.0

Q ss_pred             EEEEEEcCCCCcccccHHhhhee--eEEe----cCCC-eeeee-ceeeeeEEEEeceeEEe-----------------eE
Q 045784           12 VVTVLIDSRRTHNFVRQELAVKL--NVMV----ASGE-RFSSL-GTCLQTLIKLQGVPLIV-----------------DF   66 (90)
Q Consensus        12 ~~~aLiDsGA~~~~i~~~~a~~l--~v~~----a~G~-~~~~~-~~~~~~~i~i~~~~~~~-----------------~~   66 (90)
                      ...|++|||++..++.++.++++  .+..    ..+. ...|. ...+.+.+.+++..+..                 -+
T Consensus       331 ~~~aIlDSGTSli~lP~~~~~~i~~~l~~~~~~~~~~y~~~C~~~~lP~i~F~~~g~~~~L~p~~Yi~~~~~~~~~~C~~  410 (450)
T PTZ00013        331 KANVIVDSGTTTITAPSEFLNKFFANLNVIKVPFLPFYVTTCDNKEMPTLEFKSANNTYTLEPEYYMNPLLDVDDTLCMI  410 (450)
T ss_pred             ccceEECCCCccccCCHHHHHHHHHHhCCeecCCCCeEEeecCCCCCCeEEEEECCEEEEECHHHheehhccCCCCeeEE
Confidence            35699999999999999988877  2211    1111 12232 12344556666544321                 12


Q ss_pred             EEcCC--CCceEEeCcchhhhcC
Q 045784           67 YLLPF--EGYDVVLDTQWLSTFG   87 (90)
Q Consensus        67 ~v~~~--~~~dvILG~~wL~~~~   87 (90)
                      .+.+.  .+...|||-.||+++-
T Consensus       411 ~i~~~~~~~~~~ILGd~FLr~~Y  433 (450)
T PTZ00013        411 TMLPVDIDDNTFILGDPFMRKYF  433 (450)
T ss_pred             EEEECCCCCCCEEECHHHhccEE
Confidence            22222  2235899999999864


No 48 
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=91.12  E-value=3  Score=29.18  Aligned_cols=26  Identities=15%  Similarity=0.046  Sum_probs=21.2

Q ss_pred             EEEEEEC--CEEEEEEEcCCCCcccccH
Q 045784            3 RVIGRLV--HKVVTVLIDSRRTHNFVRQ   28 (90)
Q Consensus         3 ~v~~~i~--~~~~~aLiDsGA~~~~i~~   28 (90)
                      ..+..|.  .+++.+++||||+..++..
T Consensus         5 ~~~i~iGtP~q~~~v~~DTGS~~~wv~~   32 (318)
T cd05477           5 YGEISIGTPPQNFLVLFDTGSSNLWVPS   32 (318)
T ss_pred             EEEEEECCCCcEEEEEEeCCCccEEEcc
Confidence            4556666  4899999999999999874


No 49 
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  The enzymes specifically cleave bonds in peptides which 
Probab=90.97  E-value=2  Score=29.38  Aligned_cols=75  Identities=12%  Similarity=0.081  Sum_probs=46.5

Q ss_pred             EEEEEEC--CEEEEEEEcCCCCcccccHHhhheeeEEecCCCeeeeeceeeeeEEEEece--eEE-eeEEEcCC------
Q 045784            3 RVIGRLV--HKVVTVLIDSRRTHNFVRQELAVKLNVMVASGERFSSLGTCLQTLIKLQGV--PLI-VDFYLLPF------   71 (90)
Q Consensus         3 ~v~~~i~--~~~~~aLiDsGA~~~~i~~~~a~~l~v~~a~G~~~~~~~~~~~~~i~i~~~--~~~-~~~~v~~~------   71 (90)
                      .++..|.  .++..+++||||+..++..   ....+...+|+..  .|....=.+.+++.  ... ..|-+...      
T Consensus         3 ~~~i~iGtP~q~~~v~~DTGSs~~wv~~---~~~~~~Y~dg~~~--~G~~~~D~v~~g~~~~~~~~~~Fg~~~~~~~~~~   77 (265)
T cd05476           3 LVTLSIGTPPQPFSLIVDTGSDLTWTQC---CSYEYSYGDGSST--SGVLATETFTFGDSSVSVPNVAFGCGTDNEGGSF   77 (265)
T ss_pred             EEEEecCCCCcceEEEecCCCCCEEEcC---CceEeEeCCCcee--eeeEEEEEEEecCCCCccCCEEEEecccccCCcc
Confidence            4556666  6899999999999999864   3345566655433  33333335666665  222 33444432      


Q ss_pred             CCceEEeCcch
Q 045784           72 EGYDVVLDTQW   82 (90)
Q Consensus        72 ~~~dvILG~~w   82 (90)
                      ...|-|||+.+
T Consensus        78 ~~~~GIlGLg~   88 (265)
T cd05476          78 GGADGILGLGR   88 (265)
T ss_pred             CCCCEEEECCC
Confidence            24889999875


No 50 
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two 
Probab=90.84  E-value=3.1  Score=29.80  Aligned_cols=27  Identities=11%  Similarity=0.107  Sum_probs=21.5

Q ss_pred             EEEEEEC--CEEEEEEEcCCCCcccccHH
Q 045784            3 RVIGRLV--HKVVTVLIDSRRTHNFVRQE   29 (90)
Q Consensus         3 ~v~~~i~--~~~~~aLiDsGA~~~~i~~~   29 (90)
                      .++..|.  .++..+++||||+..++..+
T Consensus         5 ~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~   33 (364)
T cd05473           5 YIEMLIGTPPQKLNILVDTGSSNFAVAAA   33 (364)
T ss_pred             EEEEEecCCCceEEEEEecCCcceEEEcC
Confidence            4556666  58999999999999988654


No 51 
>cd05470 pepsin_retropepsin_like Cellular and retroviral pepsin-like aspartate proteases. This family includes both cellular and retroviral pepsin-like aspartate proteases. The cellular pepsin and pepsin-like enzymes are twice as long as their retroviral counterparts. The cellular pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, rennin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (rennin, cathepsin D and E, pepsin) or commercially (chymosin) important. The eukaryotic pepsin-like proteases contain two domains possessing similar topological features. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except in the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The eukaryotic pepsin-like proteases have two active site 
Probab=90.13  E-value=0.31  Score=28.65  Aligned_cols=26  Identities=15%  Similarity=0.110  Sum_probs=21.0

Q ss_pred             EEEECC--EEEEEEEcCCCCcccccHHh
Q 045784            5 IGRLVH--KVVTVLIDSRRTHNFVRQEL   30 (90)
Q Consensus         5 ~~~i~~--~~~~aLiDsGA~~~~i~~~~   30 (90)
                      +..|..  +++.+++||||+..++..+-
T Consensus         2 ~i~vGtP~q~~~~~~DTGSs~~Wv~~~~   29 (109)
T cd05470           2 EIGIGTPPQTFNVLLDTGSSNLWVPSVD   29 (109)
T ss_pred             EEEeCCCCceEEEEEeCCCCCEEEeCCC
Confidence            456665  99999999999998887653


No 52 
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco.  CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=89.62  E-value=2.9  Score=28.99  Aligned_cols=78  Identities=17%  Similarity=0.074  Sum_probs=45.8

Q ss_pred             EEEEEEC--CEEEEEEEcCCCCcccccHHhhheeeEEecCCCeeeeeceeeeeEEEEece-eEE-eeEEEcCC-----CC
Q 045784            3 RVIGRLV--HKVVTVLIDSRRTHNFVRQELAVKLNVMVASGERFSSLGTCLQTLIKLQGV-PLI-VDFYLLPF-----EG   73 (90)
Q Consensus         3 ~v~~~i~--~~~~~aLiDsGA~~~~i~~~~a~~l~v~~a~G~~~~~~~~~~~~~i~i~~~-~~~-~~~~v~~~-----~~   73 (90)
                      .++..|.  .++..+++||||+..++..+-.....+...+|...  .|....=++.+++. ..+ ..|-+...     ..
T Consensus         3 ~~~i~iGtP~q~~~v~~DTGSs~~Wv~c~~c~~~~i~Yg~Gs~~--~G~~~~D~v~ig~~~~~~~~~Fg~~~~~~~~~~~   80 (299)
T cd05472           3 VVTVGLGTPARDQTVIVDTGSDLTWVQCQPCCLYQVSYGDGSYT--TGDLATDTLTLGSSDVVPGFAFGCGHDNEGLFGG   80 (299)
T ss_pred             EEEEecCCCCcceEEEecCCCCcccccCCCCCeeeeEeCCCceE--EEEEEEEEEEeCCCCccCCEEEECCccCCCccCC
Confidence            4556666  58999999999999998432112336677777643  23332224555654 322 23333321     14


Q ss_pred             ceEEeCcch
Q 045784           74 YDVVLDTQW   82 (90)
Q Consensus        74 ~dvILG~~w   82 (90)
                      .|-|||+.+
T Consensus        81 ~~GilGLg~   89 (299)
T cd05472          81 AAGLLGLGR   89 (299)
T ss_pred             CCEEEECCC
Confidence            788999875


No 53 
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme.  Proteinase A preferentially hydro
Probab=89.36  E-value=5.7  Score=27.91  Aligned_cols=26  Identities=15%  Similarity=0.103  Sum_probs=21.1

Q ss_pred             EEEEEEC--CEEEEEEEcCCCCcccccH
Q 045784            3 RVIGRLV--HKVVTVLIDSRRTHNFVRQ   28 (90)
Q Consensus         3 ~v~~~i~--~~~~~aLiDsGA~~~~i~~   28 (90)
                      .++..|+  .++..+++|||++..++..
T Consensus        12 ~~~i~iGtp~q~~~v~~DTGSs~~wv~~   39 (320)
T cd05488          12 FTDITLGTPPQKFKVILDTGSSNLWVPS   39 (320)
T ss_pred             EEEEEECCCCcEEEEEEecCCcceEEEc
Confidence            4566666  5999999999999998864


No 54 
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=89.36  E-value=3.7  Score=28.76  Aligned_cols=25  Identities=16%  Similarity=0.117  Sum_probs=20.0

Q ss_pred             EEEEEC--CEEEEEEEcCCCCcccccH
Q 045784            4 VIGRLV--HKVVTVLIDSRRTHNFVRQ   28 (90)
Q Consensus         4 v~~~i~--~~~~~aLiDsGA~~~~i~~   28 (90)
                      .+..|.  .++..+++||||+..++..
T Consensus         3 ~~i~iGtP~Q~~~v~~DTGSs~~Wv~s   29 (316)
T cd05486           3 GQISIGTPPQNFTVIFDTGSSNLWVPS   29 (316)
T ss_pred             EEEEECCCCcEEEEEEcCCCccEEEec
Confidence            455565  6889999999999998864


No 55 
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank 
Probab=89.24  E-value=5  Score=28.15  Aligned_cols=25  Identities=16%  Similarity=0.085  Sum_probs=20.6

Q ss_pred             EEEEEEC--CEEEEEEEcCCCCccccc
Q 045784            3 RVIGRLV--HKVVTVLIDSRRTHNFVR   27 (90)
Q Consensus         3 ~v~~~i~--~~~~~aLiDsGA~~~~i~   27 (90)
                      .++..|.  .++..+++||||+..++.
T Consensus         8 ~~~i~iGtP~q~~~v~~DTGSs~~Wv~   34 (325)
T cd05490           8 YGEIGIGTPPQTFTVVFDTGSSNLWVP   34 (325)
T ss_pred             EEEEEECCCCcEEEEEEeCCCccEEEE
Confidence            4566776  489999999999999985


No 56 
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases.  They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=89.20  E-value=5.1  Score=28.13  Aligned_cols=26  Identities=15%  Similarity=0.057  Sum_probs=21.2

Q ss_pred             EEEEEEC--CEEEEEEEcCCCCcccccH
Q 045784            3 RVIGRLV--HKVVTVLIDSRRTHNFVRQ   28 (90)
Q Consensus         3 ~v~~~i~--~~~~~aLiDsGA~~~~i~~   28 (90)
                      .++..|.  .++..+++||||+..++..
T Consensus        12 ~~~i~iGtP~Q~~~v~~DTGSs~lWv~~   39 (317)
T cd06098          12 FGEIGIGTPPQKFTVIFDTGSSNLWVPS   39 (317)
T ss_pred             EEEEEECCCCeEEEEEECCCccceEEec
Confidence            4566676  6899999999999988865


No 57 
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=88.73  E-value=5.9  Score=27.21  Aligned_cols=41  Identities=15%  Similarity=0.117  Sum_probs=27.6

Q ss_pred             EEEEEEC--CEEEEEEEcCCCCcccccHH-----hhheeeEEecCCCe
Q 045784            3 RVIGRLV--HKVVTVLIDSRRTHNFVRQE-----LAVKLNVMVASGER   43 (90)
Q Consensus         3 ~v~~~i~--~~~~~aLiDsGA~~~~i~~~-----~a~~l~v~~a~G~~   43 (90)
                      .++..|.  .++..+++||||+..++.-+     +.....+..++|+.
T Consensus         4 ~~~i~iGtP~q~~~v~~DTGS~~~Wv~c~~~c~~c~c~~~i~Ygd~~~   51 (273)
T cd05475           4 YVTINIGNPPKPYFLDIDTGSDLTWLQCDAPCTGCQCDYEIEYADGGS   51 (273)
T ss_pred             EEEEEcCCCCeeEEEEEccCCCceEEeCCCCCCCCcCccEeEeCCCCc
Confidence            4556666  78999999999999999531     11223666665543


No 58 
>PTZ00165 aspartyl protease; Provisional
Probab=88.09  E-value=2.8  Score=31.77  Aligned_cols=23  Identities=13%  Similarity=-0.008  Sum_probs=20.3

Q ss_pred             EEEEEEcCCCCcccccHHhhhee
Q 045784           12 VVTVLIDSRRTHNFVRQELAVKL   34 (90)
Q Consensus        12 ~~~aLiDsGA~~~~i~~~~a~~l   34 (90)
                      ...|++|||++.....++.++++
T Consensus       327 ~~~aIiDTGTSli~lP~~~~~~i  349 (482)
T PTZ00165        327 KCKAAIDTGSSLITGPSSVINPL  349 (482)
T ss_pred             ceEEEEcCCCccEeCCHHHHHHH
Confidence            34699999999999999998877


No 59 
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which 
Probab=88.04  E-value=5.9  Score=27.74  Aligned_cols=26  Identities=15%  Similarity=0.097  Sum_probs=21.0

Q ss_pred             EEEEEEC--CEEEEEEEcCCCCcccccH
Q 045784            3 RVIGRLV--HKVVTVLIDSRRTHNFVRQ   28 (90)
Q Consensus         3 ~v~~~i~--~~~~~aLiDsGA~~~~i~~   28 (90)
                      .++..|.  .++..+++||||+..++..
T Consensus        12 ~~~i~vGtp~q~~~v~~DTGS~~~wv~~   39 (317)
T cd05478          12 YGTISIGTPPQDFTVIFDTGSSNLWVPS   39 (317)
T ss_pred             EEEEEeCCCCcEEEEEEeCCCccEEEec
Confidence            3456666  5899999999999999864


No 60 
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate  r
Probab=79.27  E-value=8.3  Score=27.15  Aligned_cols=75  Identities=16%  Similarity=0.119  Sum_probs=43.4

Q ss_pred             EEEEEcCCCCcccccHHhhhee----eEEecCCC-eeeeec--eeeeeEEEEeceeEEee---------------E--EE
Q 045784           13 VTVLIDSRRTHNFVRQELAVKL----NVMVASGE-RFSSLG--TCLQTLIKLQGVPLIVD---------------F--YL   68 (90)
Q Consensus        13 ~~aLiDsGA~~~~i~~~~a~~l----~v~~a~G~-~~~~~~--~~~~~~i~i~~~~~~~~---------------~--~v   68 (90)
                      ..+++|||++..+..++..+++    ......+. ...|..  ..+.+.+.+++..+...               +  .+
T Consensus       208 ~~aiiDSGts~~~lP~~~~~~l~~~~~~~~~~~~y~~~C~~~~~~P~i~f~fgg~~~~v~~~~yi~~~~~~~~~~C~~~~  287 (326)
T cd05487         208 CTAVVDTGASFISGPTSSISKLMEALGAKERLGDYVVKCNEVPTLPDISFHLGGKEYTLSSSDYVLQDSDFSDKLCTVAF  287 (326)
T ss_pred             CEEEECCCccchhCcHHHHHHHHHHhCCcccCCCEEEeccccCCCCCEEEEECCEEEEeCHHHhEEeccCCCCCEEEEEE
Confidence            3589999999999999988877    11111121 233332  12445666666544321               1  11


Q ss_pred             cCC-----CCceEEeCcchhhhcC
Q 045784           69 LPF-----EGYDVVLDTQWLSTFG   87 (90)
Q Consensus        69 ~~~-----~~~dvILG~~wL~~~~   87 (90)
                      ...     .+...|||..||+.+-
T Consensus       288 ~~~~~~~~~~~~~ilG~~flr~~y  311 (326)
T cd05487         288 HAMDIPPPTGPLWVLGATFIRKFY  311 (326)
T ss_pred             EeCCCCCCCCCeEEEehHHhhccE
Confidence            111     1124799999999864


No 61 
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco.  CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=78.29  E-value=2.2  Score=29.54  Aligned_cols=21  Identities=24%  Similarity=0.178  Sum_probs=19.4

Q ss_pred             EEEEcCCCCcccccHHhhhee
Q 045784           14 TVLIDSRRTHNFVRQELAVKL   34 (90)
Q Consensus        14 ~aLiDsGA~~~~i~~~~a~~l   34 (90)
                      .+++|||++..++.++..+++
T Consensus       173 ~~ivDSGTt~~~lp~~~~~~l  193 (299)
T cd05472         173 GVIIDSGTVITRLPPSAYAAL  193 (299)
T ss_pred             CeEEeCCCcceecCHHHHHHH
Confidence            589999999999999998887


No 62 
>KOG3048 consensus Molecular chaperone Prefoldin, subunit 5 [Posttranslational modification, protein turnover, chaperones]
Probab=74.18  E-value=2  Score=27.76  Aligned_cols=27  Identities=26%  Similarity=0.395  Sum_probs=22.5

Q ss_pred             CEEEEEEECCEEEEEEEcCCCCcccccHH
Q 045784            1 MIRVIGRLVHKVVTVLIDSRRTHNFVRQE   29 (90)
Q Consensus         1 ~~~v~~~i~~~~~~aLiDsGA~~~~i~~~   29 (90)
                      .|+|+|+++. +-++|||.| |-.||..+
T Consensus        72 SlYVPGkl~d-~~k~lVDIG-TGYyVEK~   98 (153)
T KOG3048|consen   72 SLYVPGKLSD-NSKFLVDIG-TGYYVEKD   98 (153)
T ss_pred             ceeccceecc-ccceeEecc-CceEEeec
Confidence            4789999988 889999999 77777544


No 63 
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two 
Probab=72.27  E-value=4.3  Score=29.10  Aligned_cols=21  Identities=14%  Similarity=0.091  Sum_probs=19.4

Q ss_pred             EEEEcCCCCcccccHHhhhee
Q 045784           14 TVLIDSRRTHNFVRQELAVKL   34 (90)
Q Consensus        14 ~aLiDsGA~~~~i~~~~a~~l   34 (90)
                      .+++|||++..++.++..+++
T Consensus       213 ~~ivDSGTs~~~lp~~~~~~l  233 (364)
T cd05473         213 KAIVDSGTTNLRLPVKVFNAA  233 (364)
T ss_pred             cEEEeCCCcceeCCHHHHHHH
Confidence            489999999999999998887


No 64 
>PF14543 TAXi_N:  Xylanase inhibitor N-terminal; PDB: 3HD8_A 3VLB_A 3VLA_A 3AUP_D 1T6G_A 1T6E_X 2B42_A.
Probab=70.75  E-value=7.6  Score=24.82  Aligned_cols=24  Identities=13%  Similarity=0.198  Sum_probs=18.6

Q ss_pred             EEEEEECC--EEEEEEEcCCCCcccc
Q 045784            3 RVIGRLVH--KVVTVLIDSRRTHNFV   26 (90)
Q Consensus         3 ~v~~~i~~--~~~~aLiDsGA~~~~i   26 (90)
                      .++..|..  ++..+.+|||++..++
T Consensus         2 ~~~~~iGtP~~~~~lvvDtgs~l~W~   27 (164)
T PF14543_consen    2 YVSVSIGTPPQPFSLVVDTGSDLTWV   27 (164)
T ss_dssp             EEEEECTCTTEEEEEEEETT-SSEEE
T ss_pred             EEEEEeCCCCceEEEEEECCCCceEE
Confidence            45666665  7999999999999886


No 65 
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=67.59  E-value=9.4  Score=26.98  Aligned_cols=27  Identities=11%  Similarity=0.135  Sum_probs=21.9

Q ss_pred             EEEEEEC--CEEEEEEEcCCCCcccccHH
Q 045784            3 RVIGRLV--HKVVTVLIDSRRTHNFVRQE   29 (90)
Q Consensus         3 ~v~~~i~--~~~~~aLiDsGA~~~~i~~~   29 (90)
                      .++..|.  .++..+++||||+..++...
T Consensus        13 ~~~i~vGtP~q~~~v~~DTGSs~~Wv~~~   41 (329)
T cd05485          13 YGVITIGTPPQSFKVVFDTGSSNLWVPSK   41 (329)
T ss_pred             EEEEEECCCCcEEEEEEcCCCccEEEecC
Confidence            4567777  58999999999999888753


No 66 
>PF13856 Gifsy-2:  ATP-binding sugar transporter from pro-phage; PDB: 2PP6_A.
Probab=66.11  E-value=24  Score=20.55  Aligned_cols=21  Identities=10%  Similarity=0.009  Sum_probs=15.5

Q ss_pred             EEEEECCEEEEEEEcCCCCcc
Q 045784            4 VIGRLVHKVVTVLIDSRRTHN   24 (90)
Q Consensus         4 v~~~i~~~~~~aLiDsGA~~~   24 (90)
                      -...|||+++.+.+|......
T Consensus        22 ~~~~InG~~~~~v~d~~~~~~   42 (95)
T PF13856_consen   22 EEHTINGKPYRAVFDEPELEE   42 (95)
T ss_dssp             EEEEETTEEEEEEEGGGT---
T ss_pred             CeEEECCEEEEEEECCchhhh
Confidence            467899999999999874444


No 67 
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5.  Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=66.08  E-value=11  Score=26.50  Aligned_cols=27  Identities=4%  Similarity=-0.102  Sum_probs=21.3

Q ss_pred             EEEEEEC--CEEEEEEEcCCCCcccccHH
Q 045784            3 RVIGRLV--HKVVTVLIDSRRTHNFVRQE   29 (90)
Q Consensus         3 ~v~~~i~--~~~~~aLiDsGA~~~~i~~~   29 (90)
                      .++..|.  .++..+++||||+..++...
T Consensus         5 ~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~   33 (326)
T cd06096           5 FIDIFIGNPPQKQSLILDTGSSSLSFPCS   33 (326)
T ss_pred             EEEEEecCCCeEEEEEEeCCCCceEEecC
Confidence            4556666  58999999999999987653


No 68 
>PF03419 Peptidase_U4:  Sporulation factor SpoIIGA  This family belongs to family U4 of the peptidase classification.;  InterPro: IPR005081 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.   The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This group of peptidases belong to the MEROPS peptidase family U4 (SpoIIGA peptidase family, clan U-).  Sporulation in bacteria such as Bacillus subtilis involves the formation of a polar septum, which divides the sporangium into a mother cell and a forespore. The sigma E factor, which is encoded within the spoIIG operon, is a cell-specific regulatory protein that directs gene transcription in the mother cell. Sigma E is synthesised as an inactive proprotein pro-sigma E, which is converted to the mature factor by the putative processing enzyme SpoIIGA []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis, 0030436 asexual sporulation
Probab=64.89  E-value=12  Score=26.37  Aligned_cols=33  Identities=24%  Similarity=0.154  Sum_probs=23.0

Q ss_pred             EEEEEEECCEE--EEEEEcCCCCc---------ccccHHhhhee
Q 045784            2 IRVIGRLVHKV--VTVLIDSRRTH---------NFVRQELAVKL   34 (90)
Q Consensus         2 ~~v~~~i~~~~--~~aLiDsGA~~---------~~i~~~~a~~l   34 (90)
                      ..++..++|+.  ++||+|||.+.         .+++.+.++++
T Consensus       158 ~~v~i~~~~~~~~~~allDTGN~L~DPitg~PV~Vve~~~~~~~  201 (293)
T PF03419_consen  158 YPVTIEIGGKKIELKALLDTGNQLRDPITGRPVIVVEYEALEKL  201 (293)
T ss_pred             EEEEEEECCEEEEEEEEEECCCcccCCCCCCcEEEEEHHHHHhh
Confidence            45677778765  56999999764         36666666655


No 69 
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=61.72  E-value=11  Score=25.29  Aligned_cols=27  Identities=11%  Similarity=0.030  Sum_probs=20.6

Q ss_pred             EEEEECC--EEEEEEEcCCCCcccccHHh
Q 045784            4 VIGRLVH--KVVTVLIDSRRTHNFVRQEL   30 (90)
Q Consensus         4 v~~~i~~--~~~~aLiDsGA~~~~i~~~~   30 (90)
                      ++..|..  ++..+++|||++..++..+-
T Consensus         3 ~~i~iGtp~q~~~l~~DTGS~~~wv~~~~   31 (283)
T cd05471           3 GEITIGTPPQKFSVIFDTGSSLLWVPSSN   31 (283)
T ss_pred             EEEEECCCCcEEEEEEeCCCCCEEEecCC
Confidence            4555654  58999999999998886653


No 70 
>TIGR02854 spore_II_GA sigma-E processing peptidase SpoIIGA. Members of this protein family are the stage II sporulation protein SpoIIGA. This protein acts as an activating protease for Sigma-E, one of several specialized sigma factors of the sporulation process in Bacillus subtilis and related endospore-forming bacteria.
Probab=61.40  E-value=17  Score=25.75  Aligned_cols=33  Identities=12%  Similarity=0.019  Sum_probs=22.0

Q ss_pred             EEEEEEECCEE--EEEEEcCCCCc---------ccccHHhhhee
Q 045784            2 IRVIGRLVHKV--VTVLIDSRRTH---------NFVRQELAVKL   34 (90)
Q Consensus         2 ~~v~~~i~~~~--~~aLiDsGA~~---------~~i~~~~a~~l   34 (90)
                      ..++..++|+.  +++|+|||.+.         ..++.+.++++
T Consensus       159 ~~v~i~~~g~~~~~~alvDTGN~L~DPlT~~PV~Ive~~~~~~~  202 (288)
T TIGR02854       159 YELEICLDGKKVTIKGFLDTGNQLRDPLTKLPVIVVEYDSLKSI  202 (288)
T ss_pred             EEEEEEECCEEEEEEEEEecCCcccCCCCCCCEEEEEHHHhhhh
Confidence            35667778755  57999999763         45555555544


No 71 
>PLN03146 aspartyl protease family protein; Provisional
Probab=52.48  E-value=22  Score=26.43  Aligned_cols=25  Identities=4%  Similarity=-0.049  Sum_probs=19.8

Q ss_pred             EEEEEEC--CEEEEEEEcCCCCccccc
Q 045784            3 RVIGRLV--HKVVTVLIDSRRTHNFVR   27 (90)
Q Consensus         3 ~v~~~i~--~~~~~aLiDsGA~~~~i~   27 (90)
                      .++..|.  .++..+++|||++..++.
T Consensus        86 ~v~i~iGTPpq~~~vi~DTGS~l~Wv~  112 (431)
T PLN03146         86 LMNISIGTPPVPILAIADTGSDLIWTQ  112 (431)
T ss_pred             EEEEEcCCCCceEEEEECCCCCcceEc
Confidence            4555665  478889999999999985


No 72 
>PTZ00165 aspartyl protease; Provisional
Probab=52.47  E-value=22  Score=26.98  Aligned_cols=27  Identities=11%  Similarity=0.081  Sum_probs=21.4

Q ss_pred             EEEEEECC--EEEEEEEcCCCCcccccHH
Q 045784            3 RVIGRLVH--KVVTVLIDSRRTHNFVRQE   29 (90)
Q Consensus         3 ~v~~~i~~--~~~~aLiDsGA~~~~i~~~   29 (90)
                      ..+..|..  ++..+++||||+.-++...
T Consensus       122 ~~~I~IGTPpQ~f~Vv~DTGSS~lWVps~  150 (482)
T PTZ00165        122 FGEIQVGTPPKSFVVVFDTGSSNLWIPSK  150 (482)
T ss_pred             EEEEEeCCCCceEEEEEeCCCCCEEEEch
Confidence            34566665  8999999999999988653


No 73 
>KOG0679 consensus Actin-related protein - Arp4p/Act3p [Cytoskeleton]
Probab=52.25  E-value=7.3  Score=29.12  Aligned_cols=21  Identities=19%  Similarity=0.235  Sum_probs=17.3

Q ss_pred             ECCEEEEEEEcCCCCcccccH
Q 045784            8 LVHKVVTVLIDSRRTHNFVRQ   28 (90)
Q Consensus         8 i~~~~~~aLiDsGA~~~~i~~   28 (90)
                      =+|+....++|+||+++-+++
T Consensus       150 A~GrstalVvDiGa~~~svsP  170 (426)
T KOG0679|consen  150 ANGRSTALVVDIGATHTSVSP  170 (426)
T ss_pred             hcCCCceEEEEecCCCceeee
Confidence            368899999999999986654


No 74 
>PF14541 TAXi_C:  Xylanase inhibitor C-terminal; PDB: 3AUP_D 3HD8_A 1T6G_A 1T6E_X 2B42_A 3VLB_A 3VLA_A.
Probab=49.22  E-value=11  Score=23.79  Aligned_cols=21  Identities=24%  Similarity=0.284  Sum_probs=17.7

Q ss_pred             EEEEcCCCCcccccHHhhhee
Q 045784           14 TVLIDSRRTHNFVRQELAVKL   34 (90)
Q Consensus        14 ~aLiDsGA~~~~i~~~~a~~l   34 (90)
                      .++||||++..++.+..-+.+
T Consensus        31 ~~iiDSGT~~T~L~~~~y~~l   51 (161)
T PF14541_consen   31 GTIIDSGTTYTYLPPPVYDAL   51 (161)
T ss_dssp             SEEE-SSSSSEEEEHHHHHHH
T ss_pred             CEEEECCCCccCCcHHHHHHH
Confidence            367999999999999988888


No 75 
>cd05489 xylanase_inhibitor_I_like TAXI-I inhibits degradation of xylan in the cell wall. Xylanase inhibitor-I (TAXI-I) is a member of potent TAXI-type inhibitors of fungal and bacterial family 11 xylanases. Plants developed a diverse battery of defense mechanisms in response to continual challenges by a broad spectrum of pathogenic microorganisms. Their defense arsenal includes inhibitors of cell wall-degrading enzymes, which hinder a possible invasion and colonization by antagonists. Xylanases of fungal and bacterial pathogens are the key enzymes in the degradation of xylan in the cell wall. Plants secrete proteins that inhibit these degradation glycosidases, including xylanase. Surprisingly, TAXI-I displays structural homology with the pepsin-like family of aspartic proteases but is proteolytically nonfunctional, because one or more residues of the essential catalytic triad are absent. The structure of the TAXI-inhibitor, Aspergillus niger xylanase I complex, illustrates the ability 
Probab=48.38  E-value=17  Score=26.35  Aligned_cols=21  Identities=10%  Similarity=0.146  Sum_probs=18.8

Q ss_pred             EEEEcCCCCcccccHHhhhee
Q 045784           14 TVLIDSRRTHNFVRQELAVKL   34 (90)
Q Consensus        14 ~aLiDsGA~~~~i~~~~a~~l   34 (90)
                      .++||||.+..++.++..+.+
T Consensus       231 g~iiDSGTs~t~lp~~~y~~l  251 (362)
T cd05489         231 GVKLSTVVPYTVLRSDIYRAF  251 (362)
T ss_pred             cEEEecCCceEEECHHHHHHH
Confidence            489999999999999888877


No 76 
>KOG4584 consensus Uncharacterized conserved protein [General function prediction only]
Probab=46.87  E-value=15  Score=26.65  Aligned_cols=22  Identities=27%  Similarity=0.548  Sum_probs=17.5

Q ss_pred             ceeEEeeEEEcCCCCceEEeCc
Q 045784           59 GVPLIVDFYLLPFEGYDVVLDT   80 (90)
Q Consensus        59 ~~~~~~~~~v~~~~~~dvILG~   80 (90)
                      +.++.+-..-.+-.|+|+|||+
T Consensus       196 ~~p~K~~lif~DNSG~DvILGi  217 (348)
T KOG4584|consen  196 GKPHKCALIFVDNSGFDVILGI  217 (348)
T ss_pred             CCCcceEEEEecCCCcceeeee
Confidence            4566666777788999999997


No 77 
>COG0052 RpsB Ribosomal protein S2 [Translation, ribosomal structure and biogenesis]
Probab=35.10  E-value=25  Score=24.71  Aligned_cols=19  Identities=21%  Similarity=0.123  Sum_probs=16.9

Q ss_pred             CEEEEEEEcCCCCcccccH
Q 045784           10 HKVVTVLIDSRRTHNFVRQ   28 (90)
Q Consensus        10 ~~~~~aLiDsGA~~~~i~~   28 (90)
                      |.|+.|++||-+.-..||-
T Consensus       179 gIPVvAlvDTn~dpd~VD~  197 (252)
T COG0052         179 GIPVVALVDTNCDPDGVDY  197 (252)
T ss_pred             CCCEEEEecCCCCCccCce
Confidence            6799999999999888874


No 78 
>KOG1339 consensus Aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=32.66  E-value=35  Score=24.89  Aligned_cols=22  Identities=18%  Similarity=0.184  Sum_probs=18.0

Q ss_pred             CEEEEEEEcCCCCcccccHHhh
Q 045784           10 HKVVTVLIDSRRTHNFVRQELA   31 (90)
Q Consensus        10 ~~~~~aLiDsGA~~~~i~~~~a   31 (90)
                      .++..+++|||++..++.-...
T Consensus        57 pq~f~v~~DTGS~~lWV~c~~c   78 (398)
T KOG1339|consen   57 PQSFTVVLDTGSDLLWVPCAPC   78 (398)
T ss_pred             CeeeEEEEeCCCCceeeccccc
Confidence            4779999999999998877444


No 79 
>cd06407 PB1_NLP A PB1 domain is present in NIN like proteins (NLP), a key enzyme in a process of establishment of symbiosis betweeen legumes and nitrogen fixing bacteria (Rhizobium). The PB1 domain is a modular domain mediating specific protein-protein interaction which play a role in many critical cell processes like osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster.  Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-inte
Probab=31.84  E-value=66  Score=18.41  Aligned_cols=34  Identities=15%  Similarity=0.224  Sum_probs=29.8

Q ss_pred             EEEEEEECCEEEEEEEcCCCCcccccHHhhheee
Q 045784            2 IRVIGRLVHKVVTVLIDSRRTHNFVRQELAVKLN   35 (90)
Q Consensus         2 ~~v~~~i~~~~~~aLiDsGA~~~~i~~~~a~~l~   35 (90)
                      |+|++..|+.-.++-+.+..+..=+-++++++++
T Consensus         1 ~~vK~~~~~d~~r~~l~~~~~~~~L~~~i~~r~~   34 (82)
T cd06407           1 VRVKATYGEEKIRFRLPPSWGFTELKQEIAKRFK   34 (82)
T ss_pred             CEEEEEeCCeEEEEEcCCCCCHHHHHHHHHHHhC
Confidence            6899999999999999999888888888888774


No 80 
>KOG3101 consensus Esterase D [General function prediction only]
Probab=31.54  E-value=21  Score=24.98  Aligned_cols=27  Identities=22%  Similarity=0.233  Sum_probs=20.4

Q ss_pred             ECCEEEEEEEcCCCCcccccHHhhhee
Q 045784            8 LVHKVVTVLIDSRRTHNFVRQELAVKL   34 (90)
Q Consensus         8 i~~~~~~aLiDsGA~~~~i~~~~a~~l   34 (90)
                      .++.+..+|||.|+..+|..+++.-..
T Consensus       211 y~~~~~~ilIdqG~~D~Fl~~qLlPe~  237 (283)
T KOG3101|consen  211 YRGVGDDILIDQGAADNFLAEQLLPEN  237 (283)
T ss_pred             cCCCCccEEEecCccchhhhhhcChHH
Confidence            345566799999999999986665544


No 81 
>PRK05449 aspartate alpha-decarboxylase; Provisional
Probab=30.22  E-value=76  Score=20.00  Aligned_cols=39  Identities=21%  Similarity=0.372  Sum_probs=26.8

Q ss_pred             EEEEECCEEEEEEEcCCCCccc-----ccHHhhhee---------eEEecCCCeeeee
Q 045784            4 VIGRLVHKVVTVLIDSRRTHNF-----VRQELAVKL---------NVMVASGERFSSL   47 (90)
Q Consensus         4 v~~~i~~~~~~aLiDsGA~~~~-----i~~~~a~~l---------~v~~a~G~~~~~~   47 (90)
                      +++|+....+     |+|..+|     ||+++.++-         -+.+.||.+.++.
T Consensus         6 LksKiHratV-----T~a~L~Y~GSitID~~Ll~aagi~p~E~V~V~Nv~NG~Rf~TY   58 (126)
T PRK05449          6 LKSKIHRATV-----TEADLNYEGSITIDEDLLDAAGILENEKVQIVNVNNGARFETY   58 (126)
T ss_pred             hhhcccceEE-----eccccccceeEEECHHHHHhcCCCCCCEEEEEECCCCcEEEEE
Confidence            4566666666     7887776     678888777         3455678887663


No 82 
>PF11925 DUF3443:  Protein of unknown function (DUF3443);  InterPro: IPR021847  This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 400 to 434 amino acids in length. This protein has two conserved sequence motifs: NPV and DNNG. 
Probab=29.25  E-value=66  Score=23.93  Aligned_cols=26  Identities=12%  Similarity=0.160  Sum_probs=21.1

Q ss_pred             EEEEECCEEEEE-EEcCCCCcccccHH
Q 045784            4 VIGRLVHKVVTV-LIDSRRTHNFVRQE   29 (90)
Q Consensus         4 v~~~i~~~~~~a-LiDsGA~~~~i~~~   29 (90)
                      +....||+.... .||||+...|....
T Consensus       246 ~tt~~~G~t~~~sf~DSGSNg~fF~d~  272 (370)
T PF11925_consen  246 FTTTFNGQTYSASFFDSGSNGYFFPDS  272 (370)
T ss_pred             EEEEecCceeeeeeEecCCceeeccCC
Confidence            456778888887 99999999998843


No 83 
>cd06919 Asp_decarbox Aspartate alpha-decarboxylase or L-aspartate 1-decarboxylase, a pyruvoyl group-dependent  decarboxylase in beta-alanine production. Decarboxylation of aspartate is  the major route of beta-alanine production in bacteria, and is catalyzed  by the enzyme L-aspartate decarboxylase (ADC), EC:4.1.1.11 which  requires a pyruvoyl group for its activity. The pyruvoyl cofactor is  covalently bound to the enzyme. The protein is synthesized as a  proenzyme and cleaved via self-processing at Gly23-Ser24 to yield an  alpha chain (C-terminal fragment) and beta chain (N-terminal fragment),  and the pyruvoyl group. Beta-alanine is required for the biosynthesis of  pantothenate, in which the enzyme plays a critical regulatory role. The  active site of the tetrameric enzyme is located at the interface of two  subunits, with a Lysine and a Histidine from the beta chain of one  subunit forming the active site with residues from the alpha chain of  the adjacent subunit. This alignment 
Probab=28.92  E-value=85  Score=19.36  Aligned_cols=39  Identities=21%  Similarity=0.386  Sum_probs=26.5

Q ss_pred             EEEEECCEEEEEEEcCCCCccc-----ccHHhhhee---------eEEecCCCeeeee
Q 045784            4 VIGRLVHKVVTVLIDSRRTHNF-----VRQELAVKL---------NVMVASGERFSSL   47 (90)
Q Consensus         4 v~~~i~~~~~~aLiDsGA~~~~-----i~~~~a~~l---------~v~~a~G~~~~~~   47 (90)
                      +++|+....+     |+|..+|     ||+++.+.-         -+.+.||.+.++.
T Consensus         5 LksKiHratV-----T~a~L~YeGSitID~~Ll~aagi~~~E~V~I~Nv~NG~Rf~TY   57 (111)
T cd06919           5 LKSKIHRATV-----TEADLNYEGSITIDEDLLEAAGILPYEKVLVVNVNNGARFETY   57 (111)
T ss_pred             hhhcccceEE-----eccccccceeEEECHHHHHhcCCCCCCEEEEEECCCCcEEEEE
Confidence            4556666666     7777776     678888776         3455678887663


No 84 
>TIGR03711 acc_sec_asp3 accessory Sec system protein Asp3. This protein is designated Asp3 because, along with SecY2, SecA2, and other proteins it is part of the accessory Sec system. The system is involved in the export of serine-rich glycoproteins important for virulence in a number of Gram-positive species, including Streptococcus gordonii and Staphylococcus aureus. This protein family is assigned to transport rather than glycosylation function, but the specific molecular role is unknown.
Probab=28.83  E-value=94  Score=19.78  Aligned_cols=51  Identities=16%  Similarity=0.168  Sum_probs=29.7

Q ss_pred             EecCCCeeeeec---------eeeeeEEEEeceeEEeeEEEcCCCCceEEeCcchhhhcC
Q 045784           37 MVASGERFSSLG---------TCLQTLIKLQGVPLIVDFYLLPFEGYDVVLDTQWLSTFG   87 (90)
Q Consensus        37 ~~a~G~~~~~~~---------~~~~~~i~i~~~~~~~~~~v~~~~~~dvILG~~wL~~~~   87 (90)
                      .++.|+.+....         ..+++|+--+|+.+...+.+--.+.--+.|-+.|+.+++
T Consensus        33 lMpsG~~I~~W~S~~nyq~~r~~P~LPlLk~g~~Y~i~~n~~~~P~~s~~~ki~F~dr~~   92 (135)
T TIGR03711        33 LMPSGETIHTWSSLTNYQAARTVPSLPLLKRGQTYKLSLNADASPEGSVYLKITFFDRQG   92 (135)
T ss_pred             CCCCCCEEEEEEeeecccccCCCCCCcEEcCCCEEEEEEeeeeCCCceEEEEEEEeccCC
Confidence            456677665432         235677777787776555443333445666666666654


No 85 
>PF04126 Cyclophil_like:  Cyclophilin-like;  InterPro: IPR007256 Proteins of this family have no known function.; PDB: 2KA0_A 1ZX8_C 2NNZ_A.
Probab=28.25  E-value=1.2e+02  Score=18.48  Aligned_cols=28  Identities=14%  Similarity=0.129  Sum_probs=21.0

Q ss_pred             EEEEEEECCEEEEEE-EcCCCCcccccHHhhhee
Q 045784            2 IRVIGRLVHKVVTVL-IDSRRTHNFVRQELAVKL   34 (90)
Q Consensus         2 ~~v~~~i~~~~~~aL-iDsGA~~~~i~~~~a~~l   34 (90)
                      |+|+..++++.+.|- .|+     --..+++++|
T Consensus         1 mkI~i~i~~~~~~a~L~d~-----~ta~~~~~~L   29 (120)
T PF04126_consen    1 MKIKITIGGQEIEAELNDS-----PTARAFAAQL   29 (120)
T ss_dssp             EEEEEEETTEEEEEEEETT-----HHHHHHHHC-
T ss_pred             CeEEEEECCEEEEEEECCC-----HHHHHHHHhC
Confidence            799999999999864 455     4556777777


No 86 
>cd06396 PB1_NBR1 The PB1 domain is an essential part of NBR1 protein, next to BRCA1, a scaffold protein mediating specific protein-protein interaction with both titin protein kinase and with another scaffold protein p62. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The NBR1 protein contains a type I PB1 domain.
Probab=28.25  E-value=1.2e+02  Score=17.50  Aligned_cols=33  Identities=9%  Similarity=0.105  Sum_probs=25.8

Q ss_pred             EEEEEEECCEEEEEEEcCCCCc--ccccHHhhhee
Q 045784            2 IRVIGRLVHKVVTVLIDSRRTH--NFVRQELAVKL   34 (90)
Q Consensus         2 ~~v~~~i~~~~~~aLiDsGA~~--~~i~~~~a~~l   34 (90)
                      +++++..+|..+.+.++..+..  .=+.+++++++
T Consensus         1 V~vKaty~~d~~rf~~~~~~~~~~~~L~~ev~~rf   35 (81)
T cd06396           1 VNLKVTYNGESQSFLVSDSENTTWASVEAMVKVSF   35 (81)
T ss_pred             CEEEEEECCeEEEEEecCCCCCCHHHHHHHHHHHh
Confidence            4799999999999999996644  44556677766


No 87 
>PF00564 PB1:  PB1 domain;  InterPro: IPR000270 The Phox and Bem1p domain, is present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.; GO: 0005515 protein binding; PDB: 1IPG_A 1IP9_A 2KFK_A 1WMH_A 1VD2_A 1WI0_A 1OEY_C 1PQS_A 1Q1O_A 1TZ1_A ....
Probab=27.92  E-value=1e+02  Score=16.81  Aligned_cols=34  Identities=18%  Similarity=0.218  Sum_probs=26.7

Q ss_pred             CEEEEEEECCEEEE-EEEcCCCCcccccHHhhhee
Q 045784            1 MIRVIGRLVHKVVT-VLIDSRRTHNFVRQELAVKL   34 (90)
Q Consensus         1 ~~~v~~~i~~~~~~-aLiDsGA~~~~i~~~~a~~l   34 (90)
                      |++++...++.... +-+..+.+..-+-..+++++
T Consensus         1 t~~vK~~~~~~~~~~~~~~~~~s~~~L~~~i~~~~   35 (84)
T PF00564_consen    1 TVRVKVRYGGDIRRIISLPSDVSFDDLRSKIREKF   35 (84)
T ss_dssp             SEEEEEEETTEEEEEEEECSTSHHHHHHHHHHHHH
T ss_pred             CEEEEEEECCeeEEEEEcCCCCCHHHHHHHHHHHh
Confidence            78999999998888 77777776666666666665


No 88 
>PF04746 DUF575:  Protein of unknown function (DUF575);  InterPro: IPR006835 This represents a conserved region found in a number of Chlamydophila pneumoniae proteins.
Probab=27.82  E-value=27  Score=20.95  Aligned_cols=12  Identities=25%  Similarity=0.733  Sum_probs=9.2

Q ss_pred             eEEeCcchhhhc
Q 045784           75 DVVLDTQWLSTF   86 (90)
Q Consensus        75 dvILG~~wL~~~   86 (90)
                      .++.|++||-..
T Consensus        28 hiv~GieWLvS~   39 (101)
T PF04746_consen   28 HIVMGIEWLVSR   39 (101)
T ss_pred             eEEeehHHHHHH
Confidence            367899999654


No 89 
>PF14612 Ino80_Iec3:  IEC3 subunit of the Ino80 complex, chromatin re-modelling
Probab=27.29  E-value=26  Score=24.37  Aligned_cols=8  Identities=25%  Similarity=0.642  Sum_probs=7.3

Q ss_pred             chhhhcCC
Q 045784           81 QWLSTFGL   88 (90)
Q Consensus        81 ~wL~~~~p   88 (90)
                      +||++|.|
T Consensus       217 NWLRKh~P  224 (232)
T PF14612_consen  217 NWLRKHQP  224 (232)
T ss_pred             HHHHhcCC
Confidence            79999987


No 90 
>PF07341 DUF1473:  Protein of unknown function (DUF1473);  InterPro: IPR009941 This entry represents a family of hypothetical proteins of around 150 residues in length found in Borrelia species. The function of this family is unknown.
Probab=25.33  E-value=46  Score=21.56  Aligned_cols=29  Identities=17%  Similarity=0.198  Sum_probs=23.7

Q ss_pred             eEEEEeceeEEeeEEEcCCCCceEEeCcc
Q 045784           53 TLIKLQGVPLIVDFYLLPFEGYDVVLDTQ   81 (90)
Q Consensus        53 ~~i~i~~~~~~~~~~v~~~~~~dvILG~~   81 (90)
                      +++--.++++.-.+.++|+=.+|-|||.|
T Consensus         5 lkILTk~Kt~~y~lk~ipvY~WDsiLGFd   33 (163)
T PF07341_consen    5 LKILTKNKTYTYDLKDIPVYEWDSILGFD   33 (163)
T ss_pred             eeeeecCcceeeeccccchhhhhhhcccc
Confidence            44555677888889999999999999984


No 91 
>TIGR00223 panD L-aspartate-alpha-decarboxylase. Members of this family are aspartate 1-decarboxylase, the enzyme that makes beta-alanine and C02 from aspartate. Beta-alanine is then used to make the vitamin pantothenate, from which coenzyme A is made. Aspartate 1-decarboxylase is synthesized as a proenzyme, then cleaved to an alpha (C-terminal) and beta (N-terminal) subunit with a pyruvoyl group.
Probab=25.13  E-value=96  Score=19.56  Aligned_cols=39  Identities=28%  Similarity=0.470  Sum_probs=26.1

Q ss_pred             EEEEECCEEEEEEEcCCCCccc-----ccHHhhhee---------eEEecCCCeeeee
Q 045784            4 VIGRLVHKVVTVLIDSRRTHNF-----VRQELAVKL---------NVMVASGERFSSL   47 (90)
Q Consensus         4 v~~~i~~~~~~aLiDsGA~~~~-----i~~~~a~~l---------~v~~a~G~~~~~~   47 (90)
                      +++|+....+     |+|..+|     ||+++.+.-         -+.+.||.+.++.
T Consensus         6 LksKIHratV-----T~a~L~Y~GSItID~~Lm~aagi~p~E~V~V~Nv~NG~Rf~TY   58 (126)
T TIGR00223         6 LQGKLHRATV-----THANLNYEGSITIDEDLLDAAGILENEKVDIVNVNNGKRFSTY   58 (126)
T ss_pred             hhhhhcceEE-----eccccccceeEEECHHHHHhcCCCCCCEEEEEECCCCcEEEEE
Confidence            4556666666     7777776     678887776         3455678887663


No 92 
>PF02261 Asp_decarbox:  Aspartate decarboxylase;  InterPro: IPR003190 Decarboxylation of aspartate is the major route of alanine production in bacteria, and is catalysed by the enzyme aspartate decarboxylase. The enzyme is translated as an inactive proenzyme of two chains, A and B. This family contains both chains of aspartate decarboxylase.; GO: 0004068 aspartate 1-decarboxylase activity, 0006523 alanine biosynthetic process; PDB: 1PYU_C 1AW8_A 1PYQ_B 3TM7_C 1PT1_A 1PQH_A 1PPY_B 1PT0_B 1PQF_A 1PQE_A ....
Probab=23.66  E-value=89  Score=19.41  Aligned_cols=39  Identities=23%  Similarity=0.433  Sum_probs=23.1

Q ss_pred             EEEEECCEEEEEEEcCCCCccc-----ccHHhhhee------e---EEecCCCeeeee
Q 045784            4 VIGRLVHKVVTVLIDSRRTHNF-----VRQELAVKL------N---VMVASGERFSSL   47 (90)
Q Consensus         4 v~~~i~~~~~~aLiDsGA~~~~-----i~~~~a~~l------~---v~~a~G~~~~~~   47 (90)
                      +++|+....+     |+|..+|     ||+++.+.-      .   +.+.||.+.++.
T Consensus         6 LksKiHratV-----T~a~L~Y~GSitID~~Ll~aagi~p~E~V~V~Nv~nG~Rf~TY   58 (116)
T PF02261_consen    6 LKSKIHRATV-----TEADLNYEGSITIDEDLLDAAGILPYEQVQVVNVNNGERFETY   58 (116)
T ss_dssp             EEEEEEEEE-------EEETTSTSCEEEEHHHHHHCT--TTBEEEEEETTT--EEEEE
T ss_pred             hhhhhcceEE-----eccccccceeeEECHHHHHHcCCCcCCEEEEEECCCCcEEEEE
Confidence            5667766666     7777666     678887776      3   345668887663


No 93 
>PF00455 DeoRC:  DeoR C terminal sensor domain;  InterPro: IPR014036 The deoR-type HTH domain is a DNA-binding, helix-turn-helix (HTH) domain of about 50-60 amino acids present in transcription regulators of the deoR family, involved in sugar catabolism. This family of prokaryotic regulators is named after Escherichia coli deoR, a repressor of the deo operon, which encodes nucleotide and deoxyribonucleotide catabolic enzymes. DeoR also negatively regulates the expression of nupG and tsx, a nucleoside-specific transport protein and a channel-forming protein, respectively. DeoR-like transcription repressors occur in diverse bacteria as regulators of sugar and nucleoside metabolic systems. The effector molecules for deoR-like regulators are generally phosphorylated intermediates of the relevant metabolic pathway. The DNA-binding deoR-type HTH domain occurs usually in the N-terminal part. The C-terminal part can contain an effector-binding domain and/or an oligomerization domain. DeoR occurs as an octamer, whilst glpR and agaR are tetramers. Several operators may be bound simultaneously, which could facilitate DNA looping [, ].
Probab=23.42  E-value=47  Score=21.19  Aligned_cols=21  Identities=29%  Similarity=0.222  Sum_probs=18.9

Q ss_pred             EEEEcCCCCcccccHHhhhee
Q 045784           14 TVLIDSRRTHNFVRQELAVKL   34 (90)
Q Consensus        14 ~aLiDsGA~~~~i~~~~a~~l   34 (90)
                      .+.+|+|++...+-+.+.++.
T Consensus        22 ~Ifld~GtT~~~la~~L~~~~   42 (161)
T PF00455_consen   22 TIFLDSGTTTLELAKYLPDKK   42 (161)
T ss_pred             EEEEECchHHHHHHHHhhcCC
Confidence            478999999999999999985


No 94 
>KOG3283 consensus 40S ribosomal protein S8 [Translation, ribosomal structure and biogenesis]
Probab=23.12  E-value=1e+02  Score=20.60  Aligned_cols=58  Identities=16%  Similarity=0.176  Sum_probs=32.8

Q ss_pred             EEEEECCEEEEEE-EcCCCCcccccHHhhheee---EEe--cCCCeeeeeceeeeeEEEEeceeE
Q 045784            4 VIGRLVHKVVTVL-IDSRRTHNFVRQELAVKLN---VMV--ASGERFSSLGTCLQTLIKLQGVPL   62 (90)
Q Consensus         4 v~~~i~~~~~~aL-iDsGA~~~~i~~~~a~~l~---v~~--a~G~~~~~~~~~~~~~i~i~~~~~   62 (90)
                      |+++-.|+...|| +|+| ..|+.|+.+.++-.   |..  .|.+-+.+..-...+-+.++..+|
T Consensus        46 VRvRGGN~KyrALRlD~G-nfsw~Se~~trktri~dvvY~~snneLVRT~TL~k~aiV~ida~Pf  109 (200)
T KOG3283|consen   46 VRVRGGNKKYRALRLDMG-NFSWGSEGTTRKTRILDVVYHPSNNELVRTNTLTKSAIVQIDAAPF  109 (200)
T ss_pred             EEecCCchhhheeeeccC-cccccccceeeeeeeeeeEecCCccceeeecchhhceEEEeccchH
Confidence            4445455556654 8999 78899998888772   221  234444443323333455555544


No 95 
>PF08922 DUF1905:  Domain of unknown function (DUF1905);  InterPro: IPR015018 This family consist of hypothetical bacterial proteins. ; PDB: 2D9R_A.
Probab=22.60  E-value=1.4e+02  Score=16.85  Aligned_cols=13  Identities=15%  Similarity=0.107  Sum_probs=11.2

Q ss_pred             EEEEEEECCEEEE
Q 045784            2 IRVIGRLVHKVVT   14 (90)
Q Consensus         2 ~~v~~~i~~~~~~   14 (90)
                      ++|+|+||+++..
T Consensus        33 v~V~~tI~g~~~~   45 (80)
T PF08922_consen   33 VPVRGTIDGHPWR   45 (80)
T ss_dssp             EEEEEEETTEEEE
T ss_pred             eEEEEEECCEEEE
Confidence            6899999998876


No 96 
>CHL00067 rps2 ribosomal protein S2
Probab=22.53  E-value=57  Score=22.35  Aligned_cols=18  Identities=6%  Similarity=0.111  Sum_probs=15.6

Q ss_pred             CEEEEEEEcCCCCccccc
Q 045784           10 HKVVTVLIDSRRTHNFVR   27 (90)
Q Consensus        10 ~~~~~aLiDsGA~~~~i~   27 (90)
                      |.|+.+++||-+.-++|+
T Consensus       184 ~IPvIaivDTn~~p~~id  201 (230)
T CHL00067        184 GIPTISILDTNCDPDLAD  201 (230)
T ss_pred             CCCEEEEEeCCCCccccc
Confidence            679999999999888774


No 97 
>PRK05299 rpsB 30S ribosomal protein S2; Provisional
Probab=22.29  E-value=57  Score=22.80  Aligned_cols=19  Identities=16%  Similarity=0.102  Sum_probs=16.1

Q ss_pred             CEEEEEEEcCCCCcccccH
Q 045784           10 HKVVTVLIDSRRTHNFVRQ   28 (90)
Q Consensus        10 ~~~~~aLiDsGA~~~~i~~   28 (90)
                      |.|+.+++||-+.-++|+-
T Consensus       180 ~IPiIaivDTn~dp~~Idy  198 (258)
T PRK05299        180 GIPVVAIVDTNCDPDGVDY  198 (258)
T ss_pred             CCCEEEEeeCCCCCcccce
Confidence            6799999999998887763


No 98 
>PRK15423 hypoxanthine phosphoribosyltransferase; Provisional
Probab=21.89  E-value=46  Score=21.84  Aligned_cols=26  Identities=35%  Similarity=0.422  Sum_probs=18.7

Q ss_pred             ECCEEEEE---EEcCCCCcccccHHhhhe
Q 045784            8 LVHKVVTV---LIDSRRTHNFVRQELAVK   33 (90)
Q Consensus         8 i~~~~~~a---LiDsGA~~~~i~~~~a~~   33 (90)
                      ++|+.+..   ++|||.|...+.+.+-.+
T Consensus        90 v~gk~VLlVDDIiDTG~TL~~l~~~l~~~  118 (178)
T PRK15423         90 IRGKDVLIVEDIIDSGNTLSKVREILSLR  118 (178)
T ss_pred             CCCCEEEEEeeecCchHHHHHHHHHHHhC
Confidence            55666553   689999999888776553


No 99 
>PF12264 Waikav_capsid_1:  Waikavirus capsid protein 1;  InterPro: IPR024379 This entry represents capsid protein 1 of Rice tungro spherical virus []. 
Probab=20.59  E-value=1.6e+02  Score=19.55  Aligned_cols=25  Identities=20%  Similarity=0.126  Sum_probs=14.0

Q ss_pred             EEcCCCCcccccHHhhheeeEEecC
Q 045784           16 LIDSRRTHNFVRQELAVKLNVMVAS   40 (90)
Q Consensus        16 LiDsGA~~~~i~~~~a~~l~v~~a~   40 (90)
                      +-=||++|||-.-...+-..+.+.|
T Consensus       141 iC~SGSE~S~~~~G~L~I~~~NvL~  165 (197)
T PF12264_consen  141 ICLSGSEHSFGRMGILKICCLNVLN  165 (197)
T ss_pred             eeecCCcccccccceEEEEEeeecc
Confidence            3447888887655544444444444


Done!