Query         psy18197
Match_columns 144
No_of_seqs    125 out of 760
Neff          7.3 
Searched_HMMs 46136
Date          Sat Aug 17 01:17:57 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy18197.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/18197hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG0012|consensus              100.0 6.7E-32 1.5E-36  217.7   8.7  135    1-135   235-369 (380)
  2 PF09668 Asp_protease:  Asparty 100.0 7.9E-31 1.7E-35  186.4  12.1  101    1-101    24-124 (124)
  3 cd05479 RP_DDI RP_DDI; retrope 100.0 7.2E-29 1.6E-33  176.5  14.5  109    1-109    16-124 (124)
  4 cd05480 NRIP_C NRIP_C; putativ 100.0 5.5E-29 1.2E-33  169.2  10.3  101    4-104     1-103 (103)
  5 PF08284 RVP_2:  Retroviral asp  99.9 3.4E-23 7.5E-28  149.5  13.5  109    2-111    22-131 (135)
  6 TIGR02281 clan_AA_DTGA clan AA  99.9 2.6E-20 5.7E-25  132.1  13.3  105    2-109    12-119 (121)
  7 cd05484 retropepsin_like_LTR_2  99.8   8E-20 1.7E-24  123.0  11.9   90    2-94      1-91  (91)
  8 PF13650 Asp_protease_2:  Aspar  99.8 4.6E-18   1E-22  112.4  11.9   88    4-92      1-90  (90)
  9 PF00077 RVP:  Retroviral aspar  99.8 5.7E-18 1.2E-22  115.2  11.4   94    2-101     6-100 (100)
 10 cd05483 retropepsin_like_bacte  99.7 3.2E-17   7E-22  109.4  11.0   92    1-94      2-96  (96)
 11 TIGR03698 clan_AA_DTGF clan AA  99.7 1.2E-16 2.6E-21  111.1  11.1  100    3-107     1-107 (107)
 12 cd06095 RP_RTVL_H_like Retrope  99.7 1.1E-16 2.4E-21  106.9  10.0   83    4-94      1-86  (86)
 13 COG3577 Predicted aspartyl pro  99.6 4.1E-15 8.8E-20  113.1  10.0  104    2-109   106-213 (215)
 14 cd00303 retropepsin_like Retro  99.5 1.1E-12 2.4E-17   83.0  10.9   90    4-94      1-92  (92)
 15 cd05481 retropepsin_like_LTR_1  99.4 1.7E-12 3.7E-17   88.1  10.3   86    5-92      2-91  (93)
 16 PF12384 Peptidase_A2B:  Ty3 tr  99.3 4.6E-11 9.9E-16   88.3  11.7   93    3-97     36-130 (177)
 17 PF13975 gag-asp_proteas:  gag-  99.3 3.1E-11 6.6E-16   78.1   7.5   41    2-42      9-49  (72)
 18 COG5550 Predicted aspartyl pro  99.1 6.7E-10 1.4E-14   78.5   9.9   94   13-111    27-121 (125)
 19 cd06094 RP_Saci_like RP_Saci_l  99.1 6.8E-10 1.5E-14   74.5   6.9   79   12-97      9-88  (89)
 20 cd05482 HIV_retropepsin_like R  99.0 2.3E-09   5E-14   72.0   8.8   86    5-94      2-87  (87)
 21 PF02160 Peptidase_A3:  Caulifl  98.9 1.3E-08 2.9E-13   77.8  10.3  103    2-113     5-119 (201)
 22 PF05585 DUF1758:  Putative pep  98.7 9.6E-08 2.1E-12   70.5   8.0   29   11-39     11-39  (164)
 23 PF12382 Peptidase_A2E:  Retrot  97.7 0.00034 7.3E-09   48.2   7.6   85   13-100    48-134 (137)
 24 PF00026 Asp:  Eukaryotic aspar  97.3  0.0037   8E-08   49.5  10.4   87    2-91      2-113 (317)
 25 PTZ00013 plasmepsin 4 (PM4); P  97.2  0.0031 6.7E-08   53.9   9.8   89    1-92    138-251 (450)
 26 cd05470 pepsin_retropepsin_lik  97.1  0.0061 1.3E-07   41.1   8.3   84    4-90      1-109 (109)
 27 PTZ00147 plasmepsin-1; Provisi  97.1  0.0063 1.4E-07   52.0  10.2   89    1-92    139-252 (453)
 28 cd05476 pepsin_A_like_plant Ch  96.9  0.0072 1.6E-07   47.4   8.5   83   14-111   178-262 (265)
 29 COG4067 Uncharacterized protei  96.9  0.0037 8.1E-08   46.0   6.0  100    6-106    31-157 (162)
 30 cd05474 SAP_like SAPs, pepsin-  96.8   0.018 3.9E-07   45.4  10.3   73    2-91      3-79  (295)
 31 cd06097 Aspergillopepsin_like   96.8   0.014 3.1E-07   45.9   9.2   89    2-92      1-114 (278)
 32 cd05478 pepsin_A Pepsin A, asp  96.5   0.038 8.3E-07   44.4  10.0   88    2-92     11-122 (317)
 33 cd05476 pepsin_A_like_plant Ch  96.4   0.027 5.8E-07   44.2   8.4   76    2-92      2-88  (265)
 34 cd05487 renin_like Renin stimu  96.2   0.089 1.9E-06   42.5  10.8   88    2-92      9-121 (326)
 35 PF05618 Zn_protease:  Putative  96.2   0.018 3.8E-07   41.8   5.9   45   62-107    87-134 (138)
 36 cd05477 gastricsin Gastricsins  96.0   0.083 1.8E-06   42.4   9.9   88    2-92      4-115 (318)
 37 cd05488 Proteinase_A_fungi Fun  96.0     0.1 2.2E-06   42.0  10.1   88    2-92     11-122 (320)
 38 cd05477 gastricsin Gastricsins  95.9   0.066 1.4E-06   43.0   8.9  102    6-111   188-317 (318)
 39 PTZ00165 aspartyl protease; Pr  95.9     0.1 2.2E-06   45.0  10.3   89    2-92    121-237 (482)
 40 cd05478 pepsin_A Pepsin A, asp  95.9    0.17 3.8E-06   40.6  11.0  101    6-110   194-316 (317)
 41 cd06096 Plasmepsin_5 Plasmepsi  95.7   0.046   1E-06   44.2   7.1   89   13-111   232-322 (326)
 42 cd05472 cnd41_like Chloroplast  95.7    0.06 1.3E-06   42.8   7.6   79    2-92      2-89  (299)
 43 cd05486 Cathespin_E Cathepsin   95.7    0.11 2.4E-06   41.7   9.1   88    2-92      1-112 (316)
 44 cd06098 phytepsin Phytepsin, a  95.7   0.084 1.8E-06   42.5   8.4   89    2-92     11-123 (317)
 45 cd05474 SAP_like SAPs, pepsin-  95.5    0.11 2.5E-06   40.8   8.5   96   13-111   179-294 (295)
 46 cd06098 phytepsin Phytepsin, a  95.4    0.16 3.4E-06   40.9   9.3   90    6-109   197-315 (317)
 47 cd05490 Cathepsin_D2 Cathepsin  95.4    0.24 5.2E-06   39.9  10.1   88    2-92      7-120 (325)
 48 cd05485 Cathepsin_D_like Cathe  95.3    0.31 6.7E-06   39.5  10.8  101    6-110   199-328 (329)
 49 cd05485 Cathepsin_D_like Cathe  95.2    0.21 4.6E-06   40.4   9.4   89    2-92     12-125 (329)
 50 cd05473 beta_secretase_like Be  95.1    0.26 5.6E-06   40.5   9.7   88    2-92      4-113 (364)
 51 cd05471 pepsin_like Pepsin-lik  95.1   0.049 1.1E-06   42.2   5.2   79   12-110   202-282 (283)
 52 cd06096 Plasmepsin_5 Plasmepsi  94.8    0.18   4E-06   40.7   8.1   91    1-93      3-139 (326)
 53 cd05475 nucellin_like Nucellin  94.8    0.22 4.8E-06   39.2   8.3   88    6-111   166-270 (273)
 54 cd05488 Proteinase_A_fungi Fun  94.7    0.31 6.7E-06   39.2   9.1  101    6-110   194-319 (320)
 55 PF03539 Spuma_A9PTase:  Spumav  94.6    0.38 8.3E-06   35.4   8.3   80    8-97      1-84  (163)
 56 cd06097 Aspergillopepsin_like   94.5   0.069 1.5E-06   42.1   4.8   85    6-109   186-276 (278)
 57 cd05472 cnd41_like Chloroplast  94.3    0.21 4.6E-06   39.6   7.1   27   85-111   270-296 (299)
 58 cd05475 nucellin_like Nucellin  93.7    0.45 9.8E-06   37.4   8.0   83    2-92      3-102 (273)
 59 cd05471 pepsin_like Pepsin-lik  93.7    0.66 1.4E-05   35.8   8.8   91    2-94      1-115 (283)
 60 PF00026 Asp:  Eukaryotic aspar  93.6    0.18   4E-06   39.7   5.6   98    6-110   187-315 (317)
 61 cd05486 Cathespin_E Cathepsin   93.2    0.54 1.2E-05   37.7   7.8   99    6-109   186-314 (316)
 62 PLN03146 aspartyl protease fam  92.5    0.48   1E-05   40.2   6.8   27   85-111   399-425 (431)
 63 cd05487 renin_like Renin stimu  91.6     3.4 7.4E-05   33.2  10.6   98    6-110   195-324 (326)
 64 cd05490 Cathepsin_D2 Cathepsin  90.5     1.5 3.2E-05   35.3   7.4   93   13-109   207-323 (325)
 65 PF03419 Peptidase_U4:  Sporula  90.1     6.6 0.00014   31.5  10.8   33    2-34    158-201 (293)
 66 PTZ00165 aspartyl protease; Pr  89.9     4.1   9E-05   35.2  10.0   27   85-111   419-445 (482)
 67 PF14541 TAXi_C:  Xylanase inhi  89.7     1.3 2.8E-05   32.2   5.9   28   83-110   133-160 (161)
 68 cd05489 xylanase_inhibitor_I_l  89.5     4.1   9E-05   33.7   9.5   26   86-111   335-360 (362)
 69 PTZ00147 plasmepsin-1; Provisi  87.5     2.8   6E-05   36.0   7.3   93   12-111   332-448 (453)
 70 PTZ00013 plasmepsin 4 (PM4); P  86.3     6.4 0.00014   33.8   8.9   27   85-111   421-447 (450)
 71 KOG1339|consensus               83.6      11 0.00023   31.5   8.9   89    2-92     47-181 (398)
 72 cd05473 beta_secretase_like Be  78.9     2.9 6.3E-05   34.2   3.9   26   86-111   319-344 (364)
 73 PF11925 DUF3443:  Protein of u  77.8     6.4 0.00014   33.0   5.6   92    1-93     23-149 (370)
 74 PF14543 TAXi_N:  Xylanase inhi  76.5     4.3 9.2E-05   29.6   3.9   25    2-26      1-27  (164)
 75 TIGR02854 spore_II_GA sigma-E   68.3     7.5 0.00016   31.3   3.8   34    2-35    159-203 (288)
 76 PLN03146 aspartyl protease fam  58.9      12 0.00026   31.7   3.5   27    2-28     85-113 (431)
 77 PF13856 Gifsy-2:  ATP-binding   51.6      18 0.00038   24.0   2.8   19    4-22     22-40  (95)
 78 cd06396 PB1_NBR1 The PB1 domai  44.8      38 0.00082   22.3   3.4   35    3-37      2-38  (81)
 79 PRK08364 sulfur carrier protei  36.6      59  0.0013   20.2   3.3   33    1-37      2-36  (70)
 80 cd06407 PB1_NLP A PB1 domain i  33.0      66  0.0014   20.9   3.2   35    3-37      2-36  (82)
 81 PF13954 PapC_N:  PapC N-termin  27.7 1.6E+02  0.0035   20.9   4.8   37    2-38     32-74  (146)
 82 KOG3048|consensus               27.2      47   0.001   24.4   1.8   23    2-25     73-95  (153)
 83 cd01813 UBP_N UBP ubiquitin pr  27.1   1E+02  0.0022   19.3   3.3   36    3-38      2-37  (74)
 84 KOG0679|consensus               24.9      37 0.00079   29.0   1.0   20    9-28    151-170 (426)
 85 PF09379 FERM_N:  FERM N-termin  24.7      49  0.0011   20.6   1.4   31    8-38      4-34  (80)
 86 PF13510 Fer2_4:  2Fe-2S iron-s  24.6      88  0.0019   20.1   2.7   28    3-36      3-30  (82)
 87 PRK06083 sulfur carrier protei  24.3      82  0.0018   20.6   2.5   30    1-36     16-45  (84)
 88 KOG1339|consensus               22.8      69  0.0015   26.6   2.3   27   85-111   364-391 (398)
 89 COG3617 Prophage antirepressor  22.2 1.2E+02  0.0026   22.7   3.3   33    5-38     11-43  (176)

No 1  
>KOG0012|consensus
Probab=99.97  E-value=6.7e-32  Score=217.69  Aligned_cols=135  Identities=63%  Similarity=1.037  Sum_probs=129.9

Q ss_pred             CeEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEec
Q psy18197          1 MLFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILE   80 (144)
Q Consensus         1 ~~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~   80 (144)
                      |+||+|+|||+|++|.|||||+.|+||.++|+|||+.+.+++++.+++.|+|..++.|+++.++++||+..++|+|.|++
T Consensus       235 ML~iN~~ing~~VKAfVDsGaq~timS~~Caer~gL~rlid~r~~g~a~gvg~~ki~g~Ih~~~lki~~~~l~c~ftV~d  314 (380)
T KOG0012|consen  235 MLYINCEINGVPVKAFVDSGAQTTIMSAACAERCGLNRLIDKRFQGEARGVGTEKILGRIHQAQLKIEDLYLPCSFTVLD  314 (380)
T ss_pred             EEEEEEEECCEEEEEEEcccchhhhhhHHHHHHhChHHHhhhhhhccccCCCcccccceeEEEEEEeccEeeccceEEec
Confidence            89999999999999999999999999999999999999999999999999998899999999999999999999999999


Q ss_pred             CCCCceEeeechhheecEEEEcCCCEEEEccCCeEEEeecCCCcCceeeccCCCC
Q psy18197         81 EQPMDMLLGLDMLRRHQCCIDLKRNVLRIGTTGTETRFLSESELPVCARLTNGVD  135 (144)
Q Consensus        81 ~~~~d~ILG~d~L~~~~~~iD~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~  135 (144)
                      ..+.|++||+|.|++|+|+||+++++|++++++.++||+...+.|...+++.--+
T Consensus       315 ~~~~d~llGLd~Lrr~~ccIdL~~~~L~ig~~~teiPfl~~~~lp~~~~~~~~~~  369 (380)
T KOG0012|consen  315 RRDMDLLLGLDMLRRHQCCIDLKTNVLRIGNTETEIPFLPSNELPSHNKETGASP  369 (380)
T ss_pred             CCCcchhhhHHHHHhccceeecccCeEEecCCCccccccccccCCccccccccCC
Confidence            9999999999999999999999999999999989999999999999998775443


No 2  
>PF09668 Asp_protease:  Aspartyl protease;  InterPro: IPR019103 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure.  This family of eukaryotic aspartyl proteases have a fold similar to retroviral proteases which implies they function proteolytically during regulated protein turnover []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 3S8I_A 2I1A_B.
Probab=99.97  E-value=7.9e-31  Score=186.42  Aligned_cols=101  Identities=68%  Similarity=1.188  Sum_probs=86.2

Q ss_pred             CeEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEec
Q psy18197          1 MLFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILE   80 (144)
Q Consensus         1 ~~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~   80 (144)
                      |+||+|+|||++++|+|||||+.|+||.++|+|||+++..++++.+.+.|+|..++.|+++.+++++|+.+++++|.|++
T Consensus        24 mLyI~~~ing~~vkA~VDtGAQ~tims~~~a~r~gL~~lid~r~~g~a~GvG~~~i~G~Ih~~~l~ig~~~~~~s~~Vle  103 (124)
T PF09668_consen   24 MLYINCKINGVPVKAFVDTGAQSTIMSKSCAERCGLMRLIDKRFAGVAKGVGTQKILGRIHSVQLKIGGLFFPCSFTVLE  103 (124)
T ss_dssp             --EEEEEETTEEEEEEEETT-SS-EEEHHHHHHTTGGGGEEGGG-EE-------EEEEEEEEEEEEETTEEEEEEEEEET
T ss_pred             eEEEEEEECCEEEEEEEeCCCCccccCHHHHHHcCChhhccccccccccCCCcCceeEEEEEEEEEECCEEEEEEEEEeC
Confidence            79999999999999999999999999999999999999999999889999998999999999999999999999999999


Q ss_pred             CCCCceEeeechhheecEEEE
Q psy18197         81 EQPMDMLLGLDMLRRHQCCID  101 (144)
Q Consensus        81 ~~~~d~ILG~d~L~~~~~~iD  101 (144)
                      +.+.|.|||+|||++++|+||
T Consensus       104 ~~~~d~llGld~L~~~~c~ID  124 (124)
T PF09668_consen  104 DQDVDLLLGLDMLKRHKCCID  124 (124)
T ss_dssp             TSSSSEEEEHHHHHHTT-EEE
T ss_pred             CCCcceeeeHHHHHHhCcccC
Confidence            889999999999999999998


No 3  
>cd05479 RP_DDI RP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
Probab=99.96  E-value=7.2e-29  Score=176.52  Aligned_cols=109  Identities=66%  Similarity=1.105  Sum_probs=98.1

Q ss_pred             CeEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEec
Q psy18197          1 MLFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILE   80 (144)
Q Consensus         1 ~~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~   80 (144)
                      ++|++++|||++++|||||||++|+||+++|+++|++...+.+....+.|.|+....|.++.+++++|+++++++|.|++
T Consensus        16 ~~~v~~~Ing~~~~~LvDTGAs~s~Is~~~a~~lgl~~~~~~~~~~~~~g~g~~~~~g~~~~~~l~i~~~~~~~~~~Vl~   95 (124)
T cd05479          16 MLYINVEINGVPVKAFVDSGAQMTIMSKACAEKCGLMRLIDKRFQGIAKGVGTQKILGRIHLAQVKIGNLFLPCSFTVLE   95 (124)
T ss_pred             EEEEEEEECCEEEEEEEeCCCceEEeCHHHHHHcCCccccCcceEEEEecCCCcEEEeEEEEEEEEECCEEeeeEEEEEC
Confidence            47999999999999999999999999999999999975444445446777777778899988999999999999999999


Q ss_pred             CCCCceEeeechhheecEEEEcCCCEEEE
Q psy18197         81 EQPMDMLLGLDMLRRHQCCIDLKRNVLRI  109 (144)
Q Consensus        81 ~~~~d~ILG~d~L~~~~~~iD~~~~~l~~  109 (144)
                      ...+|+|||||||+++++.|||++++|++
T Consensus        96 ~~~~d~ILG~d~L~~~~~~ID~~~~~i~~  124 (124)
T cd05479          96 DDDVDFLIGLDMLKRHQCVIDLKENVLRI  124 (124)
T ss_pred             CCCcCEEecHHHHHhCCeEEECCCCEEEC
Confidence            88999999999999999999999999985


No 4  
>cd05480 NRIP_C NRIP_C; putative nuclear receptor interacting protein. Proteins in this family have been described as probable nuclear receptor interacting proteins. The  C-terminal domain of this family is homologous to the retroviral aspartyl protease domain. The domain is structurally related to one lobe of the pepsin molecule. The conserved active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=99.96  E-value=5.5e-29  Score=169.18  Aligned_cols=101  Identities=23%  Similarity=0.478  Sum_probs=94.8

Q ss_pred             EEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecc-eeeeEeeccce-eEEEEEEEEEEEEEcCEEEeeeEEEecC
Q psy18197          4 INCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDT-RWAGVAKGVGI-QRIIGRIHMVQVVIEKDFLTTSLSILEE   81 (144)
Q Consensus         4 v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~-~~~~~a~g~g~-~~~~g~i~~~~i~iG~~~~~~~~~V~~~   81 (144)
                      |++++||+|++|+|||||++|+||+.+|+|+||.+..+. ++.+.|.|+|+ .++.|+++.++++||+.+++++|.|++.
T Consensus         1 vnCk~nG~~vkAfVDsGaQ~timS~~caercgL~r~v~~~r~~g~A~gvgt~~kiiGrih~~~ikig~~~~~CSftVld~   80 (103)
T cd05480           1 VSCQCAGKELRALVDTGCQYNLISAACLDRLGLKERVLKAKAEEEAPSLPTSVKVIGQIERLVLQLGQLTVECSAQVVDD   80 (103)
T ss_pred             CceeECCEEEEEEEecCCchhhcCHHHHHHcChHhhhhhccccccccCCCcceeEeeEEEEEEEEeCCEEeeEEEEEEcC
Confidence            479999999999999999999999999999999877665 77789999997 6999999999999999999999999999


Q ss_pred             CCCceEeeechhheecEEEEcCC
Q psy18197         82 QPMDMLLGLDMLRRHQCCIDLKR  104 (144)
Q Consensus        82 ~~~d~ILG~d~L~~~~~~iD~~~  104 (144)
                      .+.|.+||+|.|++|+|.||+++
T Consensus        81 ~~~d~llGLdmLkrhqc~IdL~k  103 (103)
T cd05480          81 NEKNFSLGLQTLKSLKCVINLEK  103 (103)
T ss_pred             CCcceEeeHHHHhhcceeeeccC
Confidence            89999999999999999999874


No 5  
>PF08284 RVP_2:  Retroviral aspartyl protease;  InterPro: IPR013242 This region defines single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein; usually pol, more rarely gag. Retroviral proteases appear to be homologous to a single domain of the two-domain eukaryotic aspartyl proteases. 
Probab=99.91  E-value=3.4e-23  Score=149.54  Aligned_cols=109  Identities=21%  Similarity=0.305  Sum_probs=91.0

Q ss_pred             eEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeee-cceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEec
Q psy18197          2 LFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLI-DTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILE   80 (144)
Q Consensus         2 ~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~-~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~   80 (144)
                      +...++|+++++.+|+||||++|+||+++|++++++... +.+.. +....+.....+.+..+.++++++.+..+|.|++
T Consensus        22 i~g~~~I~~~~~~vLiDSGAThsFIs~~~a~~~~l~~~~l~~~~~-V~~~g~~~~~~~~~~~~~~~i~g~~~~~dl~vl~  100 (135)
T PF08284_consen   22 ITGTFLINSIPASVLIDSGATHSFISSSFAKKLGLPLEPLPRPIV-VSAPGGSINCEGVCPDVPLSIQGHEFVVDLLVLD  100 (135)
T ss_pred             EEEEEEeccEEEEEEEecCCCcEEccHHHHHhcCCEEEEccCeeE-EecccccccccceeeeEEEEECCeEEEeeeEEec
Confidence            345689999999999999999999999999999995432 23332 3222233455666778999999999999999999


Q ss_pred             CCCCceEeeechhheecEEEEcCCCEEEEcc
Q psy18197         81 EQPMDMLLGLDMLRRHQCCIDLKRNVLRIGT  111 (144)
Q Consensus        81 ~~~~d~ILG~d~L~~~~~~iD~~~~~l~~~~  111 (144)
                      ..++|+|||||||+++++.|||.+++++|..
T Consensus       101 l~~~DvILGm~WL~~~~~~IDw~~k~v~f~~  131 (135)
T PF08284_consen  101 LGGYDVILGMDWLKKHNPVIDWATKTVTFNS  131 (135)
T ss_pred             ccceeeEeccchHHhCCCEEEccCCEEEEeC
Confidence            9899999999999999999999999999876


No 6  
>TIGR02281 clan_AA_DTGA clan AA aspartic protease, TIGR02281 family. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix. Sequences in the seed alignment and those scoring above the trusted cutoff are Proteobacterial; homologs scroing between trusted and noise are found in Pyrobaculum aerophilum str. IM2 (archaeal), Pirellula sp. (Planctomycetes), and Nostoc sp. PCC 7120 (Cyanobacteria).
Probab=99.85  E-value=2.6e-20  Score=132.12  Aligned_cols=105  Identities=21%  Similarity=0.228  Sum_probs=80.0

Q ss_pred             eEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEe-eeEEEec
Q psy18197          2 LFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLT-TSLSILE   80 (144)
Q Consensus         2 ~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~-~~~~V~~   80 (144)
                      ++++++|||++++|||||||+.++||+++|++||++..... ......+.++......+...++++|+..+. +++.|++
T Consensus        12 ~~v~~~InG~~~~flVDTGAs~t~is~~~A~~Lgl~~~~~~-~~~~~~ta~G~~~~~~~~l~~l~iG~~~~~nv~~~v~~   90 (121)
T TIGR02281        12 FYATGRVNGRNVRFLVDTGATSVALNEEDAQRLGLDLNRLG-YTVTVSTANGQIKAARVTLDRVAIGGIVVNDVDAMVAE   90 (121)
T ss_pred             EEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCcccCC-ceEEEEeCCCcEEEEEEEeCEEEECCEEEeCcEEEEeC
Confidence            68999999999999999999999999999999999653211 111233333333345566789999998874 8999998


Q ss_pred             CC-CCceEeeechhheec-EEEEcCCCEEEE
Q psy18197         81 EQ-PMDMLLGLDMLRRHQ-CCIDLKRNVLRI  109 (144)
Q Consensus        81 ~~-~~d~ILG~d~L~~~~-~~iD~~~~~l~~  109 (144)
                      .. ..++|||||||++++ +.+|-  ++|.+
T Consensus        91 ~~~~~~~LLGm~fL~~~~~~~~~~--~~l~l  119 (121)
T TIGR02281        91 GGALSESLLGMSFLNRLSRFTVRG--GKLIL  119 (121)
T ss_pred             CCcCCceEcCHHHHhccccEEEEC--CEEEE
Confidence            65 358999999999997 66654  55555


No 7  
>cd05484 retropepsin_like_LTR_2 Retropepsins_like_LTR, pepsin-like aspartate proteases. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classif
Probab=99.83  E-value=8e-20  Score=123.00  Aligned_cols=90  Identities=26%  Similarity=0.290  Sum_probs=74.7

Q ss_pred             eEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeee-EeeccceeEEEEEEEEEEEEEcCEEEeeeEEEec
Q psy18197          2 LFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAG-VAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILE   80 (144)
Q Consensus         2 ~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~-~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~   80 (144)
                      .|++++|||++++|||||||++|+||++.++++|+++..+....- .+.| ...+..|.+ .+++++|+..+..+|.|++
T Consensus         1 ~~~~~~Ing~~i~~lvDTGA~~svis~~~~~~lg~~~~~~~~~~v~~a~G-~~~~~~G~~-~~~v~~~~~~~~~~~~v~~   78 (91)
T cd05484           1 KTVTLLVNGKPLKFQLDTGSAITVISEKTWRKLGSPPLKPTKKRLRTATG-TKLSVLGQI-LVTVKYGGKTKVLTLYVVK   78 (91)
T ss_pred             CEEEEEECCEEEEEEEcCCcceEEeCHHHHHHhCCCccccccEEEEecCC-CEeeEeEEE-EEEEEECCEEEEEEEEEEE
Confidence            489999999999999999999999999999999986533332221 2333 346788887 6899999999999999999


Q ss_pred             CCCCceEeeechhh
Q psy18197         81 EQPMDMLLGLDMLR   94 (144)
Q Consensus        81 ~~~~d~ILG~d~L~   94 (144)
                      .. ++.|||+|||.
T Consensus        79 ~~-~~~lLG~~wl~   91 (91)
T cd05484          79 NE-GLNLLGRDWLD   91 (91)
T ss_pred             CC-CCCccChhhcC
Confidence            76 99999999984


No 8  
>PF13650 Asp_protease_2:  Aspartyl protease
Probab=99.78  E-value=4.6e-18  Score=112.42  Aligned_cols=88  Identities=23%  Similarity=0.366  Sum_probs=68.5

Q ss_pred             EEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEE-eeeEEEec-C
Q psy18197          4 INCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFL-TTSLSILE-E   81 (144)
Q Consensus         4 v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~-~~~~~V~~-~   81 (144)
                      |+++|||++++||+||||+.++|++++|+++|++..... ......+.++..........++++|+..+ ++++.+.+ .
T Consensus         1 V~v~vng~~~~~liDTGa~~~~i~~~~~~~l~~~~~~~~-~~~~~~~~~g~~~~~~~~~~~i~ig~~~~~~~~~~v~~~~   79 (90)
T PF13650_consen    1 VPVKVNGKPVRFLIDTGASISVISRSLAKKLGLKPRPKS-VPISVSGAGGSVTVYRGRVDSITIGGITLKNVPFLVVDLG   79 (90)
T ss_pred             CEEEECCEEEEEEEcCCCCcEEECHHHHHHcCCCCcCCc-eeEEEEeCCCCEEEEEEEEEEEEECCEEEEeEEEEEECCC
Confidence            589999999999999999999999999999999643222 11134455443333444556999999887 78899998 5


Q ss_pred             CCCceEeeech
Q psy18197         82 QPMDMLLGLDM   92 (144)
Q Consensus        82 ~~~d~ILG~d~   92 (144)
                      ...|+|||+||
T Consensus        80 ~~~~~iLG~df   90 (90)
T PF13650_consen   80 DPIDGILGMDF   90 (90)
T ss_pred             CCCEEEeCCcC
Confidence            68999999998


No 9  
>PF00077 RVP:  Retroviral aspartyl protease The Prosite entry also includes Pfam:PF00026;  InterPro: IPR018061 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to the MEROPS peptidase family A2 (retropepsin family, clan AA), subfamily A2A. The family includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). Retroviral aspartyl protease is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins.; PDB: 3D3T_B 3SQF_A 1NSO_A 2HB3_A 2HS2_A 2HS1_B 3K4V_A 3GGV_C 1HTG_B 2FDE_A ....
Probab=99.77  E-value=5.7e-18  Score=115.22  Aligned_cols=94  Identities=27%  Similarity=0.468  Sum_probs=77.7

Q ss_pred             eEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccce-eEEEEEEEEEEEEEcCEEEeeeEEEec
Q psy18197          2 LFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGI-QRIIGRIHMVQVVIEKDFLTTSLSILE   80 (144)
Q Consensus         2 ~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~-~~~~g~i~~~~i~iG~~~~~~~~~V~~   80 (144)
                      ++++++|||++++|||||||++|+|+++.+.+.+...  ....  ...|+|+ ....|.. ..++++++..+...|.|++
T Consensus         6 p~i~v~i~g~~i~~LlDTGA~vsiI~~~~~~~~~~~~--~~~~--~v~~~~g~~~~~~~~-~~~v~~~~~~~~~~~~v~~   80 (100)
T PF00077_consen    6 PYITVKINGKKIKALLDTGADVSIISEKDWKKLGPPP--KTSI--TVRGAGGSSSILGST-TVEVKIGGKEFNHTFLVVP   80 (100)
T ss_dssp             SEEEEEETTEEEEEEEETTBSSEEESSGGSSSTSSEE--EEEE--EEEETTEEEEEEEEE-EEEEEETTEEEEEEEEESS
T ss_pred             ceEEEeECCEEEEEEEecCCCcceecccccccccccc--cCCc--eeccCCCcceeeeEE-EEEEEEECccceEEEEecC
Confidence            5899999999999999999999999999998887642  1111  4456654 3566655 5799999999999999999


Q ss_pred             CCCCceEeeechhheecEEEE
Q psy18197         81 EQPMDMLLGLDMLRRHQCCID  101 (144)
Q Consensus        81 ~~~~d~ILG~d~L~~~~~~iD  101 (144)
                      +.+.+ |||+|||+++++.|+
T Consensus        81 ~~~~~-ILG~D~L~~~~~~i~  100 (100)
T PF00077_consen   81 DLPMN-ILGRDFLKKLNAVIN  100 (100)
T ss_dssp             TCSSE-EEEHHHHTTTTCEEE
T ss_pred             CCCCC-EeChhHHHHcCCEEC
Confidence            87778 999999999999875


No 10 
>cd05483 retropepsin_like_bacteria Bacterial aspartate proteases, retropepsin-like protease family. This family of bacteria aspartate proteases is a subfamily of retropepsin-like protease family, which includes enzymes from retrovirus and retrotransposons. While fungal and mammalian pepsin-like aspartate proteases are bilobal proteins with structurally related N- and C-termini, this family of bacteria aspartate proteases is half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate proteases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=99.74  E-value=3.2e-17  Score=109.42  Aligned_cols=92  Identities=27%  Similarity=0.405  Sum_probs=69.8

Q ss_pred             CeEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEE-eeeEEEe
Q psy18197          1 MLFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFL-TTSLSIL   79 (144)
Q Consensus         1 ~~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~-~~~~~V~   79 (144)
                      +++++++|||++++|++||||+.++|++++|+++++........  ...+.++..........++++|+..+ +.++.++
T Consensus         2 ~~~v~v~i~~~~~~~llDTGa~~s~i~~~~~~~l~~~~~~~~~~--~~~~~~G~~~~~~~~~~~i~ig~~~~~~~~~~v~   79 (96)
T cd05483           2 HFVVPVTINGQPVRFLLDTGASTTVISEELAERLGLPLTLGGKV--TVQTANGRVRAARVRLDSLQIGGITLRNVPAVVL   79 (96)
T ss_pred             cEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCccCCCcE--EEEecCCCccceEEEcceEEECCcEEeccEEEEe
Confidence            47999999999999999999999999999999999722112222  23333332223333467999999876 4789999


Q ss_pred             cCCC--CceEeeechhh
Q psy18197         80 EEQP--MDMLLGLDMLR   94 (144)
Q Consensus        80 ~~~~--~d~ILG~d~L~   94 (144)
                      +...  .|+|||+|||+
T Consensus        80 d~~~~~~~gIlG~d~l~   96 (96)
T cd05483          80 PGDALGVDGLLGMDFLR   96 (96)
T ss_pred             CCcccCCceEeChHHhC
Confidence            9765  99999999985


No 11 
>TIGR03698 clan_AA_DTGF clan AA aspartic protease, AF_0612 family. Members of this protein family are clan AA aspartic proteases, related to family TIGR02281. These proteins resemble retropepsins, pepsin-like proteases of retroviruses such as HIV. Members of this family are found in archaea and bacteria.
Probab=99.72  E-value=1.2e-16  Score=111.07  Aligned_cols=100  Identities=18%  Similarity=0.283  Sum_probs=75.0

Q ss_pred             EEEEEECC------EEEEEEEecCCCcee-cCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeee
Q psy18197          3 FINCCVNG------VPVKAFVDSGAQTTI-MSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTS   75 (144)
Q Consensus         3 ~v~v~InG------~~v~~LvDTGA~~s~-Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~   75 (144)
                      ++++.+.+      .+++|||||||+..+ |++++|+++|+.+....+.. .|+|.   ...-.+...++++|++.....
T Consensus         1 ~~~v~~~~p~~~~~~~v~~LVDTGat~~~~l~~~~a~~lgl~~~~~~~~~-tA~G~---~~~~~v~~~~v~igg~~~~~~   76 (107)
T TIGR03698         1 TLDVELSNPKNPEFMEVRALVDTGFSGFLLVPPDIVNKLGLPELDQRRVY-LADGR---EVLTDVAKASIIINGLEIDAF   76 (107)
T ss_pred             CEEEEEeCCCCCCceEEEEEEECCCCeEEecCHHHHHHcCCCcccCcEEE-ecCCc---EEEEEEEEEEEEECCEEEEEE
Confidence            35667744      389999999999997 99999999999764333322 55552   223345578999999887555


Q ss_pred             EEEecCCCCceEeeechhheecEEEEcCCCEE
Q psy18197         76 LSILEEQPMDMLLGLDMLRRHQCCIDLKRNVL  107 (144)
Q Consensus        76 ~~V~~~~~~d~ILG~d~L~~~~~~iD~~~~~l  107 (144)
                      +.+.+..+ +.||||.||++++..|||++++|
T Consensus        77 v~~~~~~~-~~LLG~~~L~~l~l~id~~~~~~  107 (107)
T TIGR03698        77 VESLGYVD-EPLLGTELLEGLGIVIDYRNQGL  107 (107)
T ss_pred             EEecCCCC-ccEecHHHHhhCCEEEehhhCcC
Confidence            45445434 89999999999999999999864


No 12 
>cd06095 RP_RTVL_H_like Retropepsin of the RTVL_H family of human endogenous retrovirus-like elements. This family includes aspartate proteases from retroelements with LTR (long terminal repeats) including the RTVL_H family of human endogenous retrovirus-like elements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where 
Probab=99.71  E-value=1.1e-16  Score=106.88  Aligned_cols=83  Identities=24%  Similarity=0.381  Sum_probs=63.3

Q ss_pred             EEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeE--EEEEEEEE-EEEEcCEEEeeeEEEec
Q psy18197          4 INCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQR--IIGRIHMV-QVVIEKDFLTTSLSILE   80 (144)
Q Consensus         4 v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~--~~g~i~~~-~i~iG~~~~~~~~~V~~   80 (144)
                      ++++|||+|++|||||||+.|+|+++.|+++..    ..... ...|+|+..  ....  .. .+++|+++....+.+.+
T Consensus         1 ~~v~InG~~~~fLvDTGA~~tii~~~~a~~~~~----~~~~~-~v~gagG~~~~~v~~--~~~~v~vg~~~~~~~~~v~~   73 (86)
T cd06095           1 VTITVEGVPIVFLVDTGATHSVLKSDLGPKQEL----STTSV-LIRGVSGQSQQPVTT--YRTLVDLGGHTVSHSFLVVP   73 (86)
T ss_pred             CEEEECCEEEEEEEECCCCeEEECHHHhhhccC----CCCcE-EEEeCCCcccccEEE--eeeEEEECCEEEEEEEEEEc
Confidence            478999999999999999999999999999822    11222 456666543  2222  23 59999999888888887


Q ss_pred             CCCCceEeeechhh
Q psy18197         81 EQPMDMLLGLDMLR   94 (144)
Q Consensus        81 ~~~~d~ILG~d~L~   94 (144)
                      .. .+.|||||||.
T Consensus        74 ~~-~~~lLG~dfL~   86 (86)
T cd06095          74 NC-PDPLLGRDLLS   86 (86)
T ss_pred             CC-CCcEechhhcC
Confidence            53 58999999985


No 13 
>COG3577 Predicted aspartyl protease [General function prediction only]
Probab=99.61  E-value=4.1e-15  Score=113.08  Aligned_cols=104  Identities=24%  Similarity=0.291  Sum_probs=78.8

Q ss_pred             eEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceee-ecceee-eEeeccceeEEEEEEEEEEEEEcCEEE-eeeEEE
Q psy18197          2 LFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRL-IDTRWA-GVAKGVGIQRIIGRIHMVQVVIEKDFL-TTSLSI   78 (144)
Q Consensus         2 ~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~-~~~~~~-~~a~g~g~~~~~g~i~~~~i~iG~~~~-~~~~~V   78 (144)
                      ++++.+|||++++|||||||+...++++.|+|+|+... .+.++. .+|+|   ......++..+++||+.++ ++++.|
T Consensus       106 F~a~~~VNGk~v~fLVDTGATsVal~~~dA~RlGid~~~l~y~~~v~TANG---~~~AA~V~Ld~v~IG~I~~~nV~A~V  182 (215)
T COG3577         106 FEANGRVNGKKVDFLVDTGATSVALNEEDARRLGIDLNSLDYTITVSTANG---RARAAPVTLDRVQIGGIRVKNVDAMV  182 (215)
T ss_pred             EEEEEEECCEEEEEEEecCcceeecCHHHHHHhCCCccccCCceEEEccCC---ccccceEEeeeEEEccEEEcCchhhe
Confidence            57889999999999999999999999999999999542 122221 24555   2334467789999999765 689999


Q ss_pred             ecCC-CCceEeeechhheecEEEEcCCCEEEE
Q psy18197         79 LEEQ-PMDMLLGLDMLRRHQCCIDLKRNVLRI  109 (144)
Q Consensus        79 ~~~~-~~d~ILG~d~L~~~~~~iD~~~~~l~~  109 (144)
                      +++. -...||||+||++++. +..++..|.+
T Consensus       183 ~~~g~L~~sLLGMSfL~rL~~-fq~~g~~LiL  213 (215)
T COG3577         183 AEDGALDESLLGMSFLNRLSG-FQVDGDLLIL  213 (215)
T ss_pred             ecCCccchhhhhHHHHhhccc-eEecCcEEEe
Confidence            9765 4569999999999973 3345555554


No 14 
>cd00303 retropepsin_like Retropepsins; pepsin-like aspartate proteases. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples
Probab=99.48  E-value=1.1e-12  Score=83.00  Aligned_cols=90  Identities=31%  Similarity=0.445  Sum_probs=66.7

Q ss_pred             EEEEECCEEEEEEEecCCCceecCHHHHHHcCc-eeeecceee-eEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEecC
Q psy18197          4 INCCVNGVPVKAFVDSGAQTTIMSAECAERCAI-MRLIDTRWA-GVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILEE   81 (144)
Q Consensus         4 v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl-~~~~~~~~~-~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~~   81 (144)
                      +++.+||+++++|+||||+.++++.+.+++++. ....+.... ..+.|. .....+......+.+++..+...+.+.+.
T Consensus         1 ~~~~~~~~~~~~liDtgs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~i~~~~~~~~~~~~~~   79 (92)
T cd00303           1 LKGKINGVPVRALVDSGASVNFISESLAKKLGLPPRLLPTPLKVKGANGS-SVKTLGVILPVTIGIGGKTFTVDFYVLDL   79 (92)
T ss_pred             CEEEECCEEEEEEEcCCCcccccCHHHHHHcCCCcccCCCceEEEecCCC-EeccCcEEEEEEEEeCCEEEEEEEEEEcC
Confidence            478899999999999999999999999999987 222222221 122221 12233333457888999888999999998


Q ss_pred             CCCceEeeechhh
Q psy18197         82 QPMDMLLGLDMLR   94 (144)
Q Consensus        82 ~~~d~ILG~d~L~   94 (144)
                      ..+|.|||+|||+
T Consensus        80 ~~~~~ilG~~~l~   92 (92)
T cd00303          80 LSYDVILGRPWLE   92 (92)
T ss_pred             CCcCEEecccccC
Confidence            8999999999984


No 15 
>cd05481 retropepsin_like_LTR_1 Retropepsins_like_LTR; pepsin-like aspartate protease from retrotransposons with long terminal repeats. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identifi
Probab=99.44  E-value=1.7e-12  Score=88.11  Aligned_cols=86  Identities=17%  Similarity=0.180  Sum_probs=68.1

Q ss_pred             EEEECC-EEEEEEEecCCCceecCHHHHHHcC---ceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEec
Q psy18197          5 NCCVNG-VPVKAFVDSGAQTTIMSAECAERCA---IMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILE   80 (144)
Q Consensus         5 ~v~InG-~~v~~LvDTGA~~s~Is~~~a~~lg---l~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~   80 (144)
                      +.+||| ++++|++||||+.|+||.+.+++++   ..+..++...-.+.+-+.....|.+ .+.+++++..++++|.|++
T Consensus         2 ~~~i~g~~~v~~~vDtGA~vnllp~~~~~~l~~~~~~~L~~t~~~L~~~~g~~~~~~G~~-~~~v~~~~~~~~~~f~Vvd   80 (93)
T cd05481           2 DMKINGKQSVKFQLDTGATCNVLPLRWLKSLTPDKDPELRPSPVRLTAYGGSTIPVEGGV-KLKCRYRNPKYNLTFQVVK   80 (93)
T ss_pred             ceEeCCceeEEEEEecCCEEEeccHHHHhhhccCCCCcCccCCeEEEeeCCCEeeeeEEE-EEEEEECCcEEEEEEEEEC
Confidence            568999 9999999999999999999999998   4444445543344443346788874 5899999999999999999


Q ss_pred             CCCCceEeeech
Q psy18197         81 EQPMDMLLGLDM   92 (144)
Q Consensus        81 ~~~~d~ILG~d~   92 (144)
                      . +...|||.+.
T Consensus        81 ~-~~~~lLG~~~   91 (93)
T cd05481          81 E-EGPPLLGAKA   91 (93)
T ss_pred             C-CCCceEcccc
Confidence            5 3568899864


No 16 
>PF12384 Peptidase_A2B:  Ty3 transposon peptidase;  InterPro: IPR024650 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Ty3 is a gypsy-type, retrovirus-like, element found in the budding yeast. The Ty3 aspartyl protease is required for processing of the viral polyprotein into its mature species [].
Probab=99.31  E-value=4.6e-11  Score=88.25  Aligned_cols=93  Identities=15%  Similarity=0.309  Sum_probs=74.1

Q ss_pred             EEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccce--eEEEEEEEEEEEEEcCEEEeeeEEEec
Q psy18197          3 FINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGI--QRIIGRIHMVQVVIEKDFLTTSLSILE   80 (144)
Q Consensus         3 ~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~--~~~~g~i~~~~i~iG~~~~~~~~~V~~   80 (144)
                      .++..++|.+++.|+||||-.|+|+++++++|+|+......+  ...|+.+  ....-....+++.+++..+...++|.+
T Consensus        36 ~v~l~~~~t~i~vLfDSGSPTSfIr~di~~kL~L~~~~app~--~fRG~vs~~~~~tsEAv~ld~~i~n~~i~i~aYV~d  113 (177)
T PF12384_consen   36 IVQLNCKGTPIKVLFDSGSPTSFIRSDIVEKLELPTHDAPPF--RFRGFVSGESATTSEAVTLDFYIDNKLIDIAAYVTD  113 (177)
T ss_pred             EEEEeecCcEEEEEEeCCCccceeehhhHHhhCCccccCCCE--EEeeeccCCceEEEEeEEEEEEECCeEEEEEEEEec
Confidence            356788999999999999999999999999999976533332  3334321  222223445789999999999999999


Q ss_pred             CCCCceEeeechhheec
Q psy18197         81 EQPMDMLLGLDMLRRHQ   97 (144)
Q Consensus        81 ~~~~d~ILG~d~L~~~~   97 (144)
                      ..+.|+|+|.++|+++.
T Consensus       114 ~m~~dlIIGnPiL~ryp  130 (177)
T PF12384_consen  114 NMDHDLIIGNPILDRYP  130 (177)
T ss_pred             cCCcceEeccHHHhhhH
Confidence            99999999999999885


No 17 
>PF13975 gag-asp_proteas:  gag-polyprotein putative aspartyl protease
Probab=99.26  E-value=3.1e-11  Score=78.11  Aligned_cols=41  Identities=29%  Similarity=0.470  Sum_probs=37.1

Q ss_pred             eEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecc
Q psy18197          2 LFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDT   42 (144)
Q Consensus         2 ~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~   42 (144)
                      +++++.|||+++.+|+||||++||||.++|+|||++.....
T Consensus         9 ~~v~~~I~g~~~~alvDtGat~~fis~~~a~rLgl~~~~~~   49 (72)
T PF13975_consen    9 MYVPVSIGGVQVKALVDTGATHNFISESLAKRLGLPLEKPP   49 (72)
T ss_pred             EEEEEEECCEEEEEEEeCCCcceecCHHHHHHhCCCcccCC
Confidence            68999999999999999999999999999999999654433


No 18 
>COG5550 Predicted aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.13  E-value=6.7e-10  Score=78.48  Aligned_cols=94  Identities=15%  Similarity=0.228  Sum_probs=72.3

Q ss_pred             EEEEEecCCC-ceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEecCCCCceEeeec
Q psy18197         13 VKAFVDSGAQ-TTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILEEQPMDMLLGLD   91 (144)
Q Consensus        13 v~~LvDTGA~-~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~~~~~d~ILG~d   91 (144)
                      ..+|||||++ ..++++++|+++|+++....+.. .+.|  + .....++...+++++.+......+.+..+.+ +||++
T Consensus        27 ~~~LiDTGFtg~lvlp~~vaek~~~~~~~~~~~~-~a~~--~-~v~t~V~~~~iki~g~e~~~~Vl~s~~~~~~-liG~~  101 (125)
T COG5550          27 YDELIDTGFTGYLVLPPQVAEKLGLPLFSTIRIV-LADG--G-VVKTSVALATIKIDGVEKVAFVLASDNLPEP-LIGVN  101 (125)
T ss_pred             eeeEEecCCceeEEeCHHHHHhcCCCccCChhhh-hhcC--C-EEEEEEEEEEEEECCEEEEEEEEccCCCccc-chhhh
Confidence            4559999999 89999999999999765433433 2222  2 3344567789999998877666666666666 99999


Q ss_pred             hhheecEEEEcCCCEEEEcc
Q psy18197         92 MLRRHQCCIDLKRNVLRIGT  111 (144)
Q Consensus        92 ~L~~~~~~iD~~~~~l~~~~  111 (144)
                      ||+.++..+|+.+++|+-+.
T Consensus       102 ~lk~l~~~vn~~~g~LEk~~  121 (125)
T COG5550         102 LLKLLGLVVNPKTGKLEKPP  121 (125)
T ss_pred             hhhhccEEEcCCcceEeccc
Confidence            99999999999999998543


No 19 
>cd06094 RP_Saci_like RP_Saci_like, retropepsin family. Retropepsin on retrotransposons with long terminal repeats (LTR) including Saci-1, -2 and -3 of Schistosoma mansoni. Retropepsins are related to fungal and mammalian pepsins. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified
Probab=99.06  E-value=6.8e-10  Score=74.55  Aligned_cols=79  Identities=14%  Similarity=0.182  Sum_probs=55.9

Q ss_pred             EEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCE-EEeeeEEEecCCCCceEeee
Q psy18197         12 PVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKD-FLTTSLSILEEQPMDMLLGL   90 (144)
Q Consensus        12 ~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~-~~~~~~~V~~~~~~d~ILG~   90 (144)
                      .++|||||||++|+|+....++. .+   +....-.|..-...++.|.. ...+.+|.. .+.-+|.|.+-  ...|||.
T Consensus         9 ~~~fLVDTGA~vSviP~~~~~~~-~~---~~~~~l~AANgt~I~tyG~~-~l~ldlGlrr~~~w~FvvAdv--~~pIlGa   81 (89)
T cd06094           9 GLRFLVDTGAAVSVLPASSTKKS-LK---PSPLTLQAANGTPIATYGTR-SLTLDLGLRRPFAWNFVVADV--PHPILGA   81 (89)
T ss_pred             CcEEEEeCCCceEeecccccccc-cc---CCceEEEeCCCCeEeeeeeE-EEEEEcCCCcEEeEEEEEcCC--CcceecH
Confidence            47999999999999999987763 21   22222122221235788854 578999985 67778888663  3489999


Q ss_pred             chhheec
Q psy18197         91 DMLRRHQ   97 (144)
Q Consensus        91 d~L~~~~   97 (144)
                      |||++++
T Consensus        82 DfL~~~~   88 (89)
T cd06094          82 DFLQHYG   88 (89)
T ss_pred             HHHHHcC
Confidence            9999886


No 20 
>cd05482 HIV_retropepsin_like Retropepsins, pepsin-like aspartate proteases. This is a subfamily of retropepsins. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This gro
Probab=99.04  E-value=2.3e-09  Score=71.99  Aligned_cols=86  Identities=15%  Similarity=0.142  Sum_probs=60.1

Q ss_pred             EEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEecCCCC
Q psy18197          5 NCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILEEQPM   84 (144)
Q Consensus         5 ~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~~~~~   84 (144)
                      .++|+|+++++|+||||+.|+|+...+.+---.  ...+.  ...|+|+....-+.....+++.+......+.|.+..-.
T Consensus         2 ~~~i~g~~~~~llDTGAd~Tvi~~~~~p~~w~~--~~~~~--~i~GIGG~~~~~~~~~v~i~i~~~~~~g~vlv~~~~~P   77 (87)
T cd05482           2 TLYINGKLFEGLLDTGADVSIIAENDWPKNWPI--QPAPS--NLTGIGGAITPSQSSVLLLEIDGEGHLGTILVYVLSLP   77 (87)
T ss_pred             EEEECCEEEEEEEccCCCCeEEcccccCCCCcc--CCCCe--EEEeccceEEEEEEeeEEEEEcCCeEEEEEEEccCCCc
Confidence            689999999999999999999998665443211  11111  45677654333334467899998887778888775234


Q ss_pred             ceEeeechhh
Q psy18197         85 DMLLGLDMLR   94 (144)
Q Consensus        85 d~ILG~d~L~   94 (144)
                      .-|||.|.|.
T Consensus        78 ~nllGRd~L~   87 (87)
T cd05482          78 VNLWGRDILS   87 (87)
T ss_pred             ccEEccccCC
Confidence            4799999874


No 21 
>PF02160 Peptidase_A3:  Cauliflower mosaic virus peptidase (A3);  InterPro: IPR000588 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of sequences contain an aspartic peptidase signature that belongs to MEROPS peptidase family A3, subfamily A3A (cauliflower mosaic virus-type endopeptidase, clan AA). Cauliflower mosaic virus belongs to the Retro-transcribing viruses, which have a double-stranded DNA genome. The genome includes an open reading frame (ORF V) that shows similarities to the pol gene of retroviruses. This ORF codes for a polyprotein that includes a reverse transcriptase, which, on the basis of a DTG triplet near the N terminus, was suggested to include an aspartic protease. The presence of an aspartic protease has been confirmed by mutational studies, implicating Asp-45 in catalysis. The protease releases itself from the polyprotein and is involved in reactions required to process the ORF IV polyprotein, which includes the viral coat protein []. The viral aspartic peptidase signature has also been found associated with a polyprotein encoded by integrated pararetrovirus-like sequences in the genome of Nicotiana tabacum (Common tobacco) []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis
Probab=98.92  E-value=1.3e-08  Score=77.75  Aligned_cols=103  Identities=19%  Similarity=0.302  Sum_probs=71.5

Q ss_pred             eEEEEEE--CC---EEEEEEEecCCCceecCHHH-----HHHcCceeeecceeeeEeeccc-e-eEEEEEEEEEEEEEcC
Q psy18197          2 LFINCCV--NG---VPVKAFVDSGAQTTIMSAEC-----AERCAIMRLIDTRWAGVAKGVG-I-QRIIGRIHMVQVVIEK   69 (144)
Q Consensus         2 ~~v~v~I--nG---~~v~~LvDTGA~~s~Is~~~-----a~~lgl~~~~~~~~~~~a~g~g-~-~~~~g~i~~~~i~iG~   69 (144)
                      .|+.+++  .|   ..+.++|||||+..+++...     |+++.-      +.  ...|+. . ......+....+.|++
T Consensus         5 iyI~~~i~~~gy~~~~~~~~vDTGAt~C~~~~~iiP~e~we~~~~------~i--~v~~an~~~~~i~~~~~~~~i~I~~   76 (201)
T PF02160_consen    5 IYIKVKISFPGYKKFNYHCYVDTGATICCASKKIIPEEYWEKSKK------PI--KVKGANGSIIQINKKAKNGKIQIAD   76 (201)
T ss_pred             EEEEEEEEEcCceeEEEEEEEeCCCceEEecCCcCCHHHHHhCCC------cE--EEEEecCCceEEEEEecCceEEEcc
Confidence            4566544  55   24799999999998876544     444332      11  223332 2 4555567778999999


Q ss_pred             EEEeeeEEEecCCCCceEeeechhheecEEEEcCCCEEEEccCC
Q psy18197         70 DFLTTSLSILEEQPMDMLLGLDMLRRHQCCIDLKRNVLRIGTTG  113 (144)
Q Consensus        70 ~~~~~~~~V~~~~~~d~ILG~d~L~~~~~~iD~~~~~l~~~~~~  113 (144)
                      ..|..+..-.-..+.|+|||++|++.++.-+.+. .++.|+..+
T Consensus        77 ~~F~IP~iYq~~~g~d~IlG~NF~r~y~Pfiq~~-~~I~f~~~~  119 (201)
T PF02160_consen   77 KIFRIPTIYQQESGIDIILGNNFLRLYEPFIQTE-DRIQFHKKG  119 (201)
T ss_pred             EEEeccEEEEecCCCCEEecchHHHhcCCcEEEc-cEEEEEeCC
Confidence            9888775555446899999999999999888886 467776644


No 22 
>PF05585 DUF1758:  Putative peptidase (DUF1758);  InterPro: IPR008737  This is a family of nematode proteins of unknown function []. However, it seems likely that these proteins act as aspartic peptidases. 
Probab=98.69  E-value=9.6e-08  Score=70.50  Aligned_cols=29  Identities=24%  Similarity=0.421  Sum_probs=26.3

Q ss_pred             EEEEEEEecCCCceecCHHHHHHcCceee
Q psy18197         11 VPVKAFVDSGAQTTIMSAECAERCAIMRL   39 (144)
Q Consensus        11 ~~v~~LvDTGA~~s~Is~~~a~~lgl~~~   39 (144)
                      +++++|+||||+.|+|++++|++|+|+..
T Consensus        11 ~~~~~LlDsGSq~SfIt~~la~~L~L~~~   39 (164)
T PF05585_consen   11 VEARALLDSGSQRSFITESLANKLNLPGT   39 (164)
T ss_pred             EEEEEEEecCCchhHHhHHHHHHhCCCCC
Confidence            46899999999999999999999999653


No 23 
>PF12382 Peptidase_A2E:  Retrotransposon peptidase;  InterPro: IPR024648 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  This entry represents a small family of fungal retroviral aspartyl peptidases.
Probab=97.67  E-value=0.00034  Score=48.24  Aligned_cols=85  Identities=16%  Similarity=0.272  Sum_probs=58.7

Q ss_pred             EEEEEecCCCceecCHHHHHHcCceeeecceee-eEeec-cceeEEEEEEEEEEEEEcCEEEeeeEEEecCCCCceEeee
Q psy18197         13 VKAFVDSGAQTTIMSAECAERCAIMRLIDTRWA-GVAKG-VGIQRIIGRIHMVQVVIEKDFLTTSLSILEEQPMDMLLGL   90 (144)
Q Consensus        13 v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~-~~a~g-~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~~~~~d~ILG~   90 (144)
                      +-+|+||||+.++|..+.++...++..   ++. .+.+| +=..++.-......|.+.|..+...|.|+...+.+.-+-.
T Consensus        48 ipclidtgaq~niiteetvrahklptr---pw~~sviyggvyp~kinrkt~kl~i~lngisikteflvvkkfshpaaisf  124 (137)
T PF12382_consen   48 IPCLIDTGAQVNIITEETVRAHKLPTR---PWSQSVIYGGVYPNKINRKTIKLNINLNGISIKTEFLVVKKFSHPAAISF  124 (137)
T ss_pred             ceeEEccCceeeeeehhhhhhccCCCC---cchhheEeccccccccccceEEEEEEecceEEEEEEEEEEeccCcceEEE
Confidence            568999999999999999999888432   111 13333 2123333334456888999999999999987666676666


Q ss_pred             chhheecEEE
Q psy18197         91 DMLRRHQCCI  100 (144)
Q Consensus        91 d~L~~~~~~i  100 (144)
                      ..|...++.|
T Consensus       125 ttlydnniei  134 (137)
T PF12382_consen  125 TTLYDNNIEI  134 (137)
T ss_pred             EEEeeCcEEe
Confidence            6666655544


No 24 
>PF00026 Asp:  Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.;  InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) .  More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=97.29  E-value=0.0037  Score=49.48  Aligned_cols=87  Identities=20%  Similarity=0.396  Sum_probs=56.8

Q ss_pred             eEEEEEEC--CEEEEEEEecCCCceecCHHHHHHc---C---c-eee-------ecceeeeEeeccceeEEEEEEEEEEE
Q psy18197          2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAERC---A---I-MRL-------IDTRWAGVAKGVGIQRIIGRIHMVQV   65 (144)
Q Consensus         2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~l---g---l-~~~-------~~~~~~~~a~g~g~~~~~g~i~~~~i   65 (144)
                      +++++.|+  ++++++++|||++...+....+...   .   . ...       ...... ...|.| . ..|......+
T Consensus         2 Y~~~v~iGtp~q~~~~~iDTGS~~~wv~~~~c~~~~~~~~~~~y~~~~S~t~~~~~~~~~-~~y~~g-~-~~G~~~~D~v   78 (317)
T PF00026_consen    2 YYINVTIGTPPQTFRVLIDTGSSDTWVPSSNCNSCSSCASSGFYNPSKSSTFSNQGKPFS-ISYGDG-S-VSGNLVSDTV   78 (317)
T ss_dssp             EEEEEEETTTTEEEEEEEETTBSSEEEEBTTECSHTHHCTSC-BBGGGSTTEEEEEEEEE-EEETTE-E-EEEEEEEEEE
T ss_pred             eEEEEEECCCCeEEEEEEecccceeeeceeccccccccccccccccccccccccceeeee-eeccCc-c-cccccccceE
Confidence            57899997  8999999999999998875433332   1   1 000       011111 233332 2 7788888999


Q ss_pred             EEcCEEEe-eeEEEecC--------CCCceEeeec
Q psy18197         66 VIEKDFLT-TSLSILEE--------QPMDMLLGLD   91 (144)
Q Consensus        66 ~iG~~~~~-~~~~V~~~--------~~~d~ILG~d   91 (144)
                      ++|+.... ..|..+..        ...|++||+-
T Consensus        79 ~ig~~~~~~~~f~~~~~~~~~~~~~~~~~GilGLg  113 (317)
T PF00026_consen   79 SIGGLTIPNQTFGLADSYSGDPFSPIPFDGILGLG  113 (317)
T ss_dssp             EETTEEEEEEEEEEEEEEESHHHHHSSSSEEEE-S
T ss_pred             eeeeccccccceecccccccccccccccccccccc
Confidence            99998765 56666653        2579999998


No 25 
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=97.23  E-value=0.0031  Score=53.89  Aligned_cols=89  Identities=16%  Similarity=0.195  Sum_probs=58.7

Q ss_pred             CeEEEEEEC--CEEEEEEEecCCCceecCHHHHHHcCcee--ee-----------cceeeeEeeccceeEEEEEEEEEEE
Q psy18197          1 MLFINCCVN--GVPVKAFVDSGAQTTIMSAECAERCAIMR--LI-----------DTRWAGVAKGVGIQRIIGRIHMVQV   65 (144)
Q Consensus         1 ~~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~lgl~~--~~-----------~~~~~~~a~g~g~~~~~g~i~~~~i   65 (144)
                      ++|.+++|+  ++++++++||||+..-++..-|...++..  .-           ...+. ...|.|  ...|.+....+
T Consensus       138 ~Yy~~i~IGTP~Q~f~vi~DTGSsdlWV~s~~C~~~~C~~~~~yd~s~SsT~~~~~~~~~-i~YG~G--sv~G~~~~Dtv  214 (450)
T PTZ00013        138 MFYGEGEVGDNHQKFMLIFDTGSANLWVPSKKCDSIGCSIKNLYDSSKSKSYEKDGTKVD-ITYGSG--TVKGFFSKDLV  214 (450)
T ss_pred             EEEEEEEECCCCeEEEEEEeCCCCceEEecccCCccccccCCCccCccCcccccCCcEEE-EEECCc--eEEEEEEEEEE
Confidence            367888996  79999999999999999765444322210  00           01122 445544  36788888999


Q ss_pred             EEcCEEEeeeEEEecC----------CCCceEeeech
Q psy18197         66 VIEKDFLTTSLSILEE----------QPMDMLLGLDM   92 (144)
Q Consensus        66 ~iG~~~~~~~~~V~~~----------~~~d~ILG~d~   92 (144)
                      .+|+...+..|..+..          ...|+|||+.+
T Consensus       215 ~iG~~~~~~~f~~~~~~~~~~~~~~~~~~dGIlGLg~  251 (450)
T PTZ00013        215 TLGHLSMPYKFIEVTDTDDLEPIYSSSEFDGILGLGW  251 (450)
T ss_pred             EECCEEEccEEEEEEeccccccceecccccceecccC
Confidence            9999876655544421          24699999975


No 26 
>cd05470 pepsin_retropepsin_like Cellular and retroviral pepsin-like aspartate proteases. This family includes both cellular and retroviral pepsin-like aspartate proteases. The cellular pepsin and pepsin-like enzymes are twice as long as their retroviral counterparts. The cellular pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, rennin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (rennin, cathepsin D and E, pepsin) or commercially (chymosin) important. The eukaryotic pepsin-like proteases contain two domains possessing similar topological features. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except in the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The eukaryotic pepsin-like proteases have two active site 
Probab=97.07  E-value=0.0061  Score=41.14  Aligned_cols=84  Identities=17%  Similarity=0.256  Sum_probs=51.5

Q ss_pred             EEEEECC--EEEEEEEecCCCceecCHHHHHHcCcee---e-----------ecceeeeEeeccceeEEEEEEEEEEEEE
Q psy18197          4 INCCVNG--VPVKAFVDSGAQTTIMSAECAERCAIMR---L-----------IDTRWAGVAKGVGIQRIIGRIHMVQVVI   67 (144)
Q Consensus         4 v~v~InG--~~v~~LvDTGA~~s~Is~~~a~~lgl~~---~-----------~~~~~~~~a~g~g~~~~~g~i~~~~i~i   67 (144)
                      +++.|+.  +++.+++|||++...+..+-+..++...   .           ....+. ..++-|.  ..|.+....+.+
T Consensus         1 ~~i~vGtP~q~~~~~~DTGSs~~Wv~~~~c~~~~~~~~~~~~~~~~sst~~~~~~~~~-~~Y~~g~--~~g~~~~D~v~i   77 (109)
T cd05470           1 IEIGIGTPPQTFNVLLDTGSSNLWVPSVDCQSLAIYSHSSYDDPSASSTYSDNGCTFS-ITYGTGS--LSGGLSTDTVSI   77 (109)
T ss_pred             CEEEeCCCCceEEEEEeCCCCCEEEeCCCCCCcccccccccCCcCCCCCCCCCCcEEE-EEeCCCe--EEEEEEEEEEEE
Confidence            4678876  8999999999999988776554443210   0           011111 3344332  345566789999


Q ss_pred             cCEEE-eeeEEEecC--------CCCceEeee
Q psy18197         68 EKDFL-TTSLSILEE--------QPMDMLLGL   90 (144)
Q Consensus        68 G~~~~-~~~~~V~~~--------~~~d~ILG~   90 (144)
                      |+..+ ...|.++..        ...|+|||+
T Consensus        78 g~~~~~~~~fg~~~~~~~~~~~~~~~~GilGL  109 (109)
T cd05470          78 GDIEVVGQAFGCATDEPGATFLPALFDGILGL  109 (109)
T ss_pred             CCEEECCEEEEEEEecCCccccccccccccCC
Confidence            98764 345555542        246788885


No 27 
>PTZ00147 plasmepsin-1; Provisional
Probab=97.07  E-value=0.0063  Score=52.02  Aligned_cols=89  Identities=15%  Similarity=0.143  Sum_probs=58.4

Q ss_pred             CeEEEEEEC--CEEEEEEEecCCCceecCHHHHHHcCce--eee-----------cceeeeEeeccceeEEEEEEEEEEE
Q psy18197          1 MLFINCCVN--GVPVKAFVDSGAQTTIMSAECAERCAIM--RLI-----------DTRWAGVAKGVGIQRIIGRIHMVQV   65 (144)
Q Consensus         1 ~~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~lgl~--~~~-----------~~~~~~~a~g~g~~~~~g~i~~~~i   65 (144)
                      ++|.+++|+  ++++.+++|||++..-++..-|...++.  ..-           ...+. ...|.|  ...|.+..+.+
T Consensus       139 ~Y~~~I~IGTP~Q~f~Vi~DTGSsdlWVps~~C~~~~C~~~~~yd~s~SsT~~~~~~~f~-i~Yg~G--svsG~~~~DtV  215 (453)
T PTZ00147        139 MSYGEAKLGDNGQKFNFIFDTGSANLWVPSIKCTTEGCETKNLYDSSKSKTYEKDGTKVE-MNYVSG--TVSGFFSKDLV  215 (453)
T ss_pred             EEEEEEEECCCCeEEEEEEeCCCCcEEEeecCCCcccccCCCccCCccCcceEECCCEEE-EEeCCC--CEEEEEEEEEE
Confidence            367889997  7999999999999999875544322221  000           11121 345544  35788888999


Q ss_pred             EEcCEEEeeeEEEecC----------CCCceEeeech
Q psy18197         66 VIEKDFLTTSLSILEE----------QPMDMLLGLDM   92 (144)
Q Consensus        66 ~iG~~~~~~~~~V~~~----------~~~d~ILG~d~   92 (144)
                      ++|+..++..|..+..          ...|+|||+-|
T Consensus       216 tiG~~~v~~qF~~~~~~~~f~~~~~~~~~DGILGLG~  252 (453)
T PTZ00147        216 TIGNLSVPYKFIEVTDTNGFEPFYTESDFDGIFGLGW  252 (453)
T ss_pred             EECCEEEEEEEEEEEeccCcccccccccccceecccC
Confidence            9999876655543321          24799999976


No 28 
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  The enzymes specifically cleave bonds in peptides which 
Probab=96.90  E-value=0.0072  Score=47.39  Aligned_cols=83  Identities=18%  Similarity=0.193  Sum_probs=47.7

Q ss_pred             EEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEE-cCEEEeeeEEEecC-CCCceEeeec
Q psy18197         14 KAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVI-EKDFLTTSLSILEE-QPMDMLLGLD   91 (144)
Q Consensus        14 ~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~i-G~~~~~~~~~V~~~-~~~d~ILG~d   91 (144)
                      .+++|||++.+.++++..      +  +-.+  .-.+ +..  . .+.....-+ ......| +...+. .....|||..
T Consensus       178 ~ai~DTGTs~~~lp~~~~------P--~i~~--~f~~-~~~--~-~i~~~~y~~~~~~~~~C-~~~~~~~~~~~~ilG~~  242 (265)
T cd05476         178 GTIIDSGTTLTYLPDPAY------P--DLTL--HFDG-GAD--L-ELPPENYFVDVGEGVVC-LAILSSSSGGVSILGNI  242 (265)
T ss_pred             cEEEeCCCcceEcCcccc------C--CEEE--EECC-CCE--E-EeCcccEEEECCCCCEE-EEEecCCCCCcEEEChh
Confidence            489999999999999887      1  1111  1111 110  0 000001101 1111122 122332 3445899999


Q ss_pred             hhheecEEEEcCCCEEEEcc
Q psy18197         92 MLRRHQCCIDLKRNVLRIGT  111 (144)
Q Consensus        92 ~L~~~~~~iD~~~~~l~~~~  111 (144)
                      ||+.+-...|++++++-+..
T Consensus       243 fl~~~~~vFD~~~~~iGfa~  262 (265)
T cd05476         243 QQQNFLVEYDLENSRLGFAP  262 (265)
T ss_pred             hcccEEEEEECCCCEEeeec
Confidence            99999999999999998755


No 29 
>COG4067 Uncharacterized protein conserved in archaea [Posttranslational modification, protein turnover, chaperones]
Probab=96.86  E-value=0.0037  Score=45.99  Aligned_cols=100  Identities=22%  Similarity=0.286  Sum_probs=65.8

Q ss_pred             EEECCE--EEEEEEecCCCceecCHHHHHHcCc---ee----ee------------cceee---eEeeccce-eEEEEEE
Q psy18197          6 CCVNGV--PVKAFVDSGAQTTIMSAECAERCAI---MR----LI------------DTRWA---GVAKGVGI-QRIIGRI   60 (144)
Q Consensus         6 v~InG~--~v~~LvDTGA~~s~Is~~~a~~lgl---~~----~~------------~~~~~---~~a~g~g~-~~~~g~i   60 (144)
                      +.++|-  .++|=+||||..|.++....++..-   ..    ..            +.++.   .+-.+-+. ..... +
T Consensus        31 ~~l~~l~~~~kAkiDTGA~TSsL~A~dI~~fkRdGe~WVRF~~~~~~~~~~~~~~~e~pvi~~ikvR~s~~~~~e~Rp-V  109 (162)
T COG4067          31 VSLPGLKIQLKAKIDTGAVTSSLSASDIERFKRDGERWVRFRLADTDNLDQRSEECEAPVIRKIKVRSSSGSRAERRP-V  109 (162)
T ss_pred             EEcCcccceeeeeecccceeeeEEeecceeeeeCCceEEEEEeecccCccccceeeccceEEEEEEecCCCCcccccc-E
Confidence            556664  5899999999999998766554422   10    00            00000   01111111 12222 3


Q ss_pred             EEEEEEEcCEEEeeeEEEecC--CCCceEeeechhheecEEEEcCCCE
Q psy18197         61 HMVQVVIEKDFLTTSLSILEE--QPMDMLLGLDMLRRHQCCIDLKRNV  106 (144)
Q Consensus        61 ~~~~i~iG~~~~~~~~~V~~~--~~~d~ILG~d~L~~~~~~iD~~~~~  106 (144)
                      ...++++|+...+..|...+.  ..|++|||..+|+++.+.+|++..-
T Consensus       110 V~~~l~lG~~~~~~E~tLtDR~~m~Yp~LlGrk~l~~~~~~VDpSr~y  157 (162)
T COG4067         110 VRLTLCLGGRILPIEFTLTDRSNMRYPVLLGRKALRHFGAVVDPSRKY  157 (162)
T ss_pred             EEEEEeeCCeeeeEEEEeecccccccceEecHHHHhhCCeEECchhhc
Confidence            347899999999999999985  3699999999999999999987653


No 30 
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=96.83  E-value=0.018  Score=45.39  Aligned_cols=73  Identities=15%  Similarity=0.212  Sum_probs=53.2

Q ss_pred             eEEEEEECC--EEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEe-eeEEE
Q psy18197          2 LFINCCVNG--VPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLT-TSLSI   78 (144)
Q Consensus         2 ~~v~v~InG--~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~-~~~~V   78 (144)
                      +++++.|+.  +++.+++|||++...+.               .+. ...+-| ....|.+....+.+|+..++ ..|..
T Consensus         3 Y~~~i~iGtp~q~~~v~~DTgS~~~wv~---------------~~~-~~Y~~g-~~~~G~~~~D~v~~g~~~~~~~~fg~   65 (295)
T cd05474           3 YSAELSVGTPPQKVTVLLDTGSSDLWVP---------------DFS-ISYGDG-TSASGTWGTDTVSIGGATVKNLQFAV   65 (295)
T ss_pred             EEEEEEECCCCcEEEEEEeCCCCcceee---------------eeE-EEeccC-CcEEEEEEEEEEEECCeEecceEEEE
Confidence            678899988  88999999999999988               122 233322 35677777889999987653 45666


Q ss_pred             ecC-CCCceEeeec
Q psy18197         79 LEE-QPMDMLLGLD   91 (144)
Q Consensus        79 ~~~-~~~d~ILG~d   91 (144)
                      +.. ...|+|||+-
T Consensus        66 ~~~~~~~~GilGLg   79 (295)
T cd05474          66 ANSTSSDVGVLGIG   79 (295)
T ss_pred             EecCCCCcceeeEC
Confidence            653 4678999966


No 31 
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=96.75  E-value=0.014  Score=45.93  Aligned_cols=89  Identities=15%  Similarity=0.176  Sum_probs=56.9

Q ss_pred             eEEEEEECC--EEEEEEEecCCCceecCHHHHHHcCce------eee-------c-ceeeeEeeccceeEEEEEEEEEEE
Q psy18197          2 LFINCCVNG--VPVKAFVDSGAQTTIMSAECAERCAIM------RLI-------D-TRWAGVAKGVGIQRIIGRIHMVQV   65 (144)
Q Consensus         2 ~~v~v~InG--~~v~~LvDTGA~~s~Is~~~a~~lgl~------~~~-------~-~~~~~~a~g~g~~~~~g~i~~~~i   65 (144)
                      ++++++|+.  +++.+++|||++...+...-+..+...      +..       + ..+. ...|.| ....|.+....+
T Consensus         1 Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~c~~~~~~~~~~y~~~~Sst~~~~~~~~~~-i~Y~~G-~~~~G~~~~D~v   78 (278)
T cd06097           1 YLTPVKIGTPPQTLNLDLDTGSSDLWVFSSETPAAQQGGHKLYDPSKSSTAKLLPGATWS-ISYGDG-SSASGIVYTDTV   78 (278)
T ss_pred             CeeeEEECCCCcEEEEEEeCCCCceeEeeCCCCchhhccCCcCCCccCccceecCCcEEE-EEeCCC-CeEEEEEEEEEE
Confidence            478899988  899999999999999876544433221      000       0 1111 334433 236777778899


Q ss_pred             EEcCEEEe-eeEEEecC--------CCCceEeeech
Q psy18197         66 VIEKDFLT-TSLSILEE--------QPMDMLLGLDM   92 (144)
Q Consensus        66 ~iG~~~~~-~~~~V~~~--------~~~d~ILG~d~   92 (144)
                      .+|+..++ ..|.++..        ...|+|||+-+
T Consensus        79 ~ig~~~~~~~~fg~~~~~~~~~~~~~~~dGilGLg~  114 (278)
T cd06097          79 SIGGVEVPNQAIELATAVSASFFSDTASDGLLGLAF  114 (278)
T ss_pred             EECCEEECCeEEEEEeecCccccccccccceeeecc
Confidence            99997653 34544432        35799999964


No 32 
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which 
Probab=96.46  E-value=0.038  Score=44.39  Aligned_cols=88  Identities=16%  Similarity=0.205  Sum_probs=55.4

Q ss_pred             eEEEEEEC--CEEEEEEEecCCCceecCHHHHHHcCcee--ee-----------cceeeeEeeccceeEEEEEEEEEEEE
Q psy18197          2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAERCAIMR--LI-----------DTRWAGVAKGVGIQRIIGRIHMVQVV   66 (144)
Q Consensus         2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~lgl~~--~~-----------~~~~~~~a~g~g~~~~~g~i~~~~i~   66 (144)
                      +|+++.|+  .+++.+++|||++...+...-+.......  ..           ...+. ...|.|.  ..|.+....++
T Consensus        11 Y~~~i~vGtp~q~~~v~~DTGS~~~wv~~~~C~~~~c~~~~~f~~~~Sst~~~~~~~~~-~~yg~gs--~~G~~~~D~v~   87 (317)
T cd05478          11 YYGTISIGTPPQDFTVIFDTGSSNLWVPSVYCSSQACSNHNRFNPRQSSTYQSTGQPLS-IQYGTGS--MTGILGYDTVQ   87 (317)
T ss_pred             EEEEEEeCCCCcEEEEEEeCCCccEEEecCCCCcccccccCcCCCCCCcceeeCCcEEE-EEECCce--EEEEEeeeEEE
Confidence            57889997  58899999999999999754433222210  00           11111 3344333  46777788999


Q ss_pred             EcCEEEe-eeEEEecC--------CCCceEeeech
Q psy18197         67 IEKDFLT-TSLSILEE--------QPMDMLLGLDM   92 (144)
Q Consensus        67 iG~~~~~-~~~~V~~~--------~~~d~ILG~d~   92 (144)
                      +|+...+ ..|.....        ...|+|||+.+
T Consensus        88 ig~~~i~~~~fg~~~~~~~~~~~~~~~dGilGLg~  122 (317)
T cd05478          88 VGGISDTNQIFGLSETEPGSFFYYAPFDGILGLAY  122 (317)
T ss_pred             ECCEEECCEEEEEEEecCccccccccccceeeecc
Confidence            9997653 34544432        13689999864


No 33 
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  The enzymes specifically cleave bonds in peptides which 
Probab=96.36  E-value=0.027  Score=44.16  Aligned_cols=76  Identities=14%  Similarity=0.214  Sum_probs=53.4

Q ss_pred             eEEEEEEC--CEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCE--EE-eeeE
Q psy18197          2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKD--FL-TTSL   76 (144)
Q Consensus         2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~--~~-~~~~   76 (144)
                      ++++++|+  .+++.+++|||++..-+..       .      .+. ...+- +....|.+....+.+++.  .. +..|
T Consensus         2 Y~~~i~iGtP~q~~~v~~DTGSs~~wv~~-------~------~~~-~~Y~d-g~~~~G~~~~D~v~~g~~~~~~~~~~F   66 (265)
T cd05476           2 YLVTLSIGTPPQPFSLIVDTGSDLTWTQC-------C------SYE-YSYGD-GSSTSGVLATETFTFGDSSVSVPNVAF   66 (265)
T ss_pred             eEEEEecCCCCcceEEEecCCCCCEEEcC-------C------ceE-eEeCC-CceeeeeEEEEEEEecCCCCccCCEEE
Confidence            57889998  6889999999999988864       1      121 23332 235677777789999987  44 3556


Q ss_pred             EEecCC------CCceEeeech
Q psy18197         77 SILEEQ------PMDMLLGLDM   92 (144)
Q Consensus        77 ~V~~~~------~~d~ILG~d~   92 (144)
                      .++...      ..|+|||+.+
T Consensus        67 g~~~~~~~~~~~~~~GIlGLg~   88 (265)
T cd05476          67 GCGTDNEGGSFGGADGILGLGR   88 (265)
T ss_pred             EecccccCCccCCCCEEEECCC
Confidence            666532      4899999875


No 34 
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate  r
Probab=96.19  E-value=0.089  Score=42.47  Aligned_cols=88  Identities=15%  Similarity=0.231  Sum_probs=55.1

Q ss_pred             eEEEEEEC--CEEEEEEEecCCCceecCHHHHHH----cCceeee-----------cceeeeEeeccceeEEEEEEEEEE
Q psy18197          2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAER----CAIMRLI-----------DTRWAGVAKGVGIQRIIGRIHMVQ   64 (144)
Q Consensus         2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~----lgl~~~~-----------~~~~~~~a~g~g~~~~~g~i~~~~   64 (144)
                      ++++++|+  ++++++++|||++..-+...-+..    |.....-           ...+. ...|.|  ...|.+....
T Consensus         9 y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~c~~~~~y~~~~SsT~~~~~~~~~-~~Yg~g--~~~G~~~~D~   85 (326)
T cd05487           9 YYGEIGIGTPPQTFKVVFDTGSSNLWVPSSKCSPLYTACVTHNLYDASDSSTYKENGTEFT-IHYASG--TVKGFLSQDI   85 (326)
T ss_pred             EEEEEEECCCCcEEEEEEeCCccceEEccCCCcCcchhhcccCcCCCCCCeeeeECCEEEE-EEeCCc--eEEEEEeeeE
Confidence            57889997  789999999999999996533321    2110000           11111 344433  3678888889


Q ss_pred             EEEcCEEEeeeEEEecC--------CCCceEeeech
Q psy18197         65 VVIEKDFLTTSLSILEE--------QPMDMLLGLDM   92 (144)
Q Consensus        65 i~iG~~~~~~~~~V~~~--------~~~d~ILG~d~   92 (144)
                      +.+|+......|..+..        ...|+|||+-+
T Consensus        86 v~~g~~~~~~~fg~~~~~~~~~~~~~~~dGilGLg~  121 (326)
T cd05487          86 VTVGGIPVTQMFGEVTALPAIPFMLAKFDGVLGMGY  121 (326)
T ss_pred             EEECCEEeeEEEEEEEeccCCccceeecceEEecCC
Confidence            99998755433333321        24689999865


No 35 
>PF05618 Zn_protease:  Putative ATP-dependant zinc protease;  InterPro: IPR008503 This family consists of several hypothetical proteins from different archaeal and bacterial species.; PDB: 2PMA_B.
Probab=96.17  E-value=0.018  Score=41.76  Aligned_cols=45  Identities=20%  Similarity=0.372  Sum_probs=31.2

Q ss_pred             EEEEEEcCEEEeeeEEEecC--CCCceEee-echhheecEEEEcCCCEE
Q psy18197         62 MVQVVIEKDFLTTSLSILEE--QPMDMLLG-LDMLRRHQCCIDLKRNVL  107 (144)
Q Consensus        62 ~~~i~iG~~~~~~~~~V~~~--~~~d~ILG-~d~L~~~~~~iD~~~~~l  107 (144)
                      ...+++|+......|...+.  ..+++||| ..|| +.++.+|.++.-+
T Consensus        87 ~~~~~lg~~~~~~e~tL~dR~~m~yp~LlGrR~~l-~~~~lVD~s~~~~  134 (138)
T PF05618_consen   87 ETTLCLGGKTWKIEFTLTDRSNMKYPMLLGRRNFL-RGRFLVDVSRSFL  134 (138)
T ss_dssp             EEEEEETTEEEEEEEEEE-S--SS-SEEE-HHHHH-HTTEEEETT---T
T ss_pred             EEEEEECCEEEEEEEEEcCCCcCcCCEEEEehHHh-cCCEEECCChhhc
Confidence            46899999999999999985  36899999 5555 4568888876543


No 36 
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=96.04  E-value=0.083  Score=42.39  Aligned_cols=88  Identities=15%  Similarity=0.288  Sum_probs=56.5

Q ss_pred             eEEEEEECC--EEEEEEEecCCCceecCHHHHHH--cCceeee-----------cceeeeEeeccceeEEEEEEEEEEEE
Q psy18197          2 LFINCCVNG--VPVKAFVDSGAQTTIMSAECAER--CAIMRLI-----------DTRWAGVAKGVGIQRIIGRIHMVQVV   66 (144)
Q Consensus         2 ~~v~v~InG--~~v~~LvDTGA~~s~Is~~~a~~--lgl~~~~-----------~~~~~~~a~g~g~~~~~g~i~~~~i~   66 (144)
                      ++.++.|+.  +++.+++|||++..-+...-+..  |+-....           ...+. ...|.|.  ..|.+....+.
T Consensus         4 y~~~i~iGtP~q~~~v~~DTGS~~~wv~~~~C~~~~C~~~~~f~~~~SsT~~~~~~~~~-~~Yg~Gs--~~G~~~~D~i~   80 (318)
T cd05477           4 YYGEISIGTPPQNFLVLFDTGSSNLWVPSVLCQSQACTNHTKFNPSQSSTYSTNGETFS-LQYGSGS--LTGIFGYDTVT   80 (318)
T ss_pred             EEEEEEECCCCcEEEEEEeCCCccEEEccCCCCCccccccCCCCcccCCCceECCcEEE-EEECCcE--EEEEEEeeEEE
Confidence            678899975  89999999999999887543331  2110000           11122 3445443  46778888999


Q ss_pred             EcCEEEe-eeEEEecCC--------CCceEeeech
Q psy18197         67 IEKDFLT-TSLSILEEQ--------PMDMLLGLDM   92 (144)
Q Consensus        67 iG~~~~~-~~~~V~~~~--------~~d~ILG~d~   92 (144)
                      +|+..++ ..|.++...        ..|+|||+-+
T Consensus        81 ~g~~~i~~~~Fg~~~~~~~~~~~~~~~~GilGLg~  115 (318)
T cd05477          81 VQGIIITNQEFGLSETEPGTNFVYAQFDGILGLAY  115 (318)
T ss_pred             ECCEEEcCEEEEEEEecccccccccceeeEeecCc
Confidence            9998764 456655431        3589999974


No 37 
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme.  Proteinase A preferentially hydro
Probab=95.95  E-value=0.1  Score=42.03  Aligned_cols=88  Identities=16%  Similarity=0.257  Sum_probs=55.3

Q ss_pred             eEEEEEEC--CEEEEEEEecCCCceecCHHHHHH--cCceee-----------ecceeeeEeeccceeEEEEEEEEEEEE
Q psy18197          2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAER--CAIMRL-----------IDTRWAGVAKGVGIQRIIGRIHMVQVV   66 (144)
Q Consensus         2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~--lgl~~~-----------~~~~~~~~a~g~g~~~~~g~i~~~~i~   66 (144)
                      ++++++|+  ++++.+++|||++...+...-+..  |.....           ....+. ...|.|  ...|.+....+.
T Consensus        11 Y~~~i~iGtp~q~~~v~~DTGSs~~wv~~~~C~~~~C~~~~~y~~~~Sst~~~~~~~~~-~~y~~g--~~~G~~~~D~v~   87 (320)
T cd05488          11 YFTDITLGTPPQKFKVILDTGSSNLWVPSVKCGSIACFLHSKYDSSASSTYKANGTEFK-IQYGSG--SLEGFVSQDTLS   87 (320)
T ss_pred             EEEEEEECCCCcEEEEEEecCCcceEEEcCCCCCcccCCcceECCCCCcceeeCCCEEE-EEECCc--eEEEEEEEeEEE
Confidence            67889997  489999999999999886543332  211100           011121 334433  256877788999


Q ss_pred             EcCEEEe-eeEEEecC--------CCCceEeeech
Q psy18197         67 IEKDFLT-TSLSILEE--------QPMDMLLGLDM   92 (144)
Q Consensus        67 iG~~~~~-~~~~V~~~--------~~~d~ILG~d~   92 (144)
                      +|+..++ ..|..+..        ...|+|||+.+
T Consensus        88 ig~~~~~~~~f~~a~~~~g~~~~~~~~dGilGLg~  122 (320)
T cd05488          88 IGDLTIKKQDFAEATSEPGLAFAFGKFDGILGLAY  122 (320)
T ss_pred             ECCEEECCEEEEEEecCCCcceeeeeeceEEecCC
Confidence            9987653 34544421        13589999975


No 38 
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=95.94  E-value=0.066  Score=43.00  Aligned_cols=102  Identities=19%  Similarity=0.183  Sum_probs=60.2

Q ss_pred             EEECCEEE-------EEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEE--EEEEEEEEEEEcCEEEee--
Q psy18197          6 CCVNGVPV-------KAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRI--IGRIHMVQVVIEKDFLTT--   74 (144)
Q Consensus         6 v~InG~~v-------~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~--~g~i~~~~i~iG~~~~~~--   74 (144)
                      ++|||++.       .++||||.+.+.++++.++++--.  ......  ..+.-...+  ........+++++..+.+  
T Consensus       188 i~v~g~~~~~~~~~~~~iiDSGtt~~~lP~~~~~~l~~~--~~~~~~--~~~~~~~~C~~~~~~p~l~~~f~g~~~~v~~  263 (318)
T cd05477         188 FQINGQATGWCSQGCQAIVDTGTSLLTAPQQVMSTLMQS--IGAQQD--QYGQYVVNCNNIQNLPTLTFTINGVSFPLPP  263 (318)
T ss_pred             EEECCEEecccCCCceeeECCCCccEECCHHHHHHHHHH--hCCccc--cCCCEEEeCCccccCCcEEEEECCEEEEECH
Confidence            57788653       589999999999999998876321  011100  001000000  001123455566654432  


Q ss_pred             -----------eEEEecC------CCCceEeeechhheecEEEEcCCCEEEEcc
Q psy18197         75 -----------SLSILEE------QPMDMLLGLDMLRRHQCCIDLKRNVLRIGT  111 (144)
Q Consensus        75 -----------~~~V~~~------~~~d~ILG~d~L~~~~~~iD~~~~~l~~~~  111 (144)
                                 -+.+.+.      .....|||..||+..-...|+.++++-+..
T Consensus       264 ~~y~~~~~~~C~~~i~~~~~~~~~~~~~~ilG~~fl~~~y~vfD~~~~~ig~a~  317 (318)
T cd05477         264 SAYILQNNGYCTVGIEPTYLPSQNGQPLWILGDVFLRQYYSVYDLGNNQVGFAT  317 (318)
T ss_pred             HHeEecCCCeEEEEEEecccCCCCCCceEEEcHHHhhheEEEEeCCCCEEeeee
Confidence                       2233221      113489999999999999999999987643


No 39 
>PTZ00165 aspartyl protease; Provisional
Probab=95.89  E-value=0.1  Score=45.03  Aligned_cols=89  Identities=17%  Similarity=0.194  Sum_probs=55.9

Q ss_pred             eEEEEEECC--EEEEEEEecCCCceecCHHHHHHcCceee--e----cceee-----------eEeeccceeEEEEEEEE
Q psy18197          2 LFINCCVNG--VPVKAFVDSGAQTTIMSAECAERCAIMRL--I----DTRWA-----------GVAKGVGIQRIIGRIHM   62 (144)
Q Consensus         2 ~~v~v~InG--~~v~~LvDTGA~~s~Is~~~a~~lgl~~~--~----~~~~~-----------~~a~g~g~~~~~g~i~~   62 (144)
                      +|.++.|+.  +++++++|||++..-++..-+...++..+  -    .+.+.           ...+|.|.  ..|.+..
T Consensus       121 Y~~~I~IGTPpQ~f~Vv~DTGSS~lWVps~~C~~~~C~~~~~yd~s~SSTy~~~~~~~~~~~~~i~YGsGs--~~G~l~~  198 (482)
T PTZ00165        121 YFGEIQVGTPPKSFVVVFDTGSSNLWIPSKECKSGGCAPHRKFDPKKSSTYTKLKLGDESAETYIQYGTGE--CVLALGK  198 (482)
T ss_pred             EEEEEEeCCCCceEEEEEeCCCCCEEEEchhcCcccccccCCCCccccCCcEecCCCCccceEEEEeCCCc--EEEEEEE
Confidence            578899987  89999999999999887655443222110  0    01110           12345443  3466777


Q ss_pred             EEEEEcCEEEe-eeEEEecC--------CCCceEeeech
Q psy18197         63 VQVVIEKDFLT-TSLSILEE--------QPMDMLLGLDM   92 (144)
Q Consensus        63 ~~i~iG~~~~~-~~~~V~~~--------~~~d~ILG~d~   92 (144)
                      +.+++|+..++ ..|.++..        ...|+|||+-|
T Consensus       199 DtV~ig~l~i~~q~FG~a~~~s~~~f~~~~~DGILGLg~  237 (482)
T PTZ00165        199 DTVKIGGLKVKHQSIGLAIEESLHPFADLPFDGLVGLGF  237 (482)
T ss_pred             EEEEECCEEEccEEEEEEEeccccccccccccceeecCC
Confidence            89999997664 34444321        24799999986


No 40 
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which 
Probab=95.85  E-value=0.17  Score=40.58  Aligned_cols=101  Identities=16%  Similarity=0.126  Sum_probs=60.7

Q ss_pred             EEECCEEE------EEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEE--EEEEEEEEEEEcCEEEeee--
Q psy18197          6 CCVNGVPV------KAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRI--IGRIHMVQVVIEKDFLTTS--   75 (144)
Q Consensus         6 v~InG~~v------~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~--~g~i~~~~i~iG~~~~~~~--   75 (144)
                      ++|||+.+      .+++|||++.+.++++..+++--. . .....  ..+.-...+  ........+++++..+.++  
T Consensus       194 v~v~g~~~~~~~~~~~iiDTGts~~~lp~~~~~~l~~~-~-~~~~~--~~~~~~~~C~~~~~~P~~~f~f~g~~~~i~~~  269 (317)
T cd05478         194 VTINGQVVACSGGCQAIVDTGTSLLVGPSSDIANIQSD-I-GASQN--QNGEMVVNCSSISSMPDVVFTINGVQYPLPPS  269 (317)
T ss_pred             EEECCEEEccCCCCEEEECCCchhhhCCHHHHHHHHHH-h-CCccc--cCCcEEeCCcCcccCCcEEEEECCEEEEECHH
Confidence            67888865      589999999999999998876321 0 00000  001000000  0112234556666554322  


Q ss_pred             -----------EEEecC-CCCceEeeechhheecEEEEcCCCEEEEc
Q psy18197         76 -----------LSILEE-QPMDMLLGLDMLRRHQCCIDLKRNVLRIG  110 (144)
Q Consensus        76 -----------~~V~~~-~~~d~ILG~d~L~~~~~~iD~~~~~l~~~  110 (144)
                                 ..+.+. .....|||..||+..-.+.|++++++-+-
T Consensus       270 ~y~~~~~~~C~~~~~~~~~~~~~IlG~~fl~~~y~vfD~~~~~iG~A  316 (317)
T cd05478         270 AYILQDQGSCTSGFQSMGLGELWILGDVFIRQYYSVFDRANNKVGLA  316 (317)
T ss_pred             HheecCCCEEeEEEEeCCCCCeEEechHHhcceEEEEeCCCCEEeec
Confidence                       222222 12458999999999999999999998763


No 41 
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5.  Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=95.69  E-value=0.046  Score=44.20  Aligned_cols=89  Identities=13%  Similarity=0.091  Sum_probs=50.5

Q ss_pred             EEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEE--EEEcCEEEeeeEEEecCCCCceEeee
Q psy18197         13 VKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQ--VVIEKDFLTTSLSILEEQPMDMLLGL   90 (144)
Q Consensus        13 v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~--i~iG~~~~~~~~~V~~~~~~d~ILG~   90 (144)
                      ..++||||.+.++++++.++++--.-   +.+.-...+ |. +..  +...+  ++..+.  .+-+.... ...-.|||-
T Consensus       232 ~~aivDSGTs~~~lp~~~~~~l~~~~---P~i~~~f~~-g~-~~~--i~p~~y~~~~~~~--~c~~~~~~-~~~~~ILG~  301 (326)
T cd06096         232 LGMLVDSGSTLSHFPEDLYNKINNFF---PTITIIFEN-NL-KID--WKPSSYLYKKESF--WCKGGEKS-VSNKPILGA  301 (326)
T ss_pred             CCEEEeCCCCcccCCHHHHHHHHhhc---CcEEEEEcC-Cc-EEE--ECHHHhccccCCc--eEEEEEec-CCCceEECh
Confidence            45899999999999999999874211   111101111 10 000  00000  001111  01112222 223489999


Q ss_pred             chhheecEEEEcCCCEEEEcc
Q psy18197         91 DMLRRHQCCIDLKRNVLRIGT  111 (144)
Q Consensus        91 d~L~~~~~~iD~~~~~l~~~~  111 (144)
                      .||+.+-.++|++++++-+..
T Consensus       302 ~flr~~y~vFD~~~~riGfa~  322 (326)
T cd06096         302 SFFKNKQIIFDLDNNRIGFVE  322 (326)
T ss_pred             HHhcCcEEEEECcCCEEeeEc
Confidence            999999999999999998754


No 42 
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco.  CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=95.68  E-value=0.06  Score=42.81  Aligned_cols=79  Identities=16%  Similarity=0.277  Sum_probs=53.1

Q ss_pred             eEEEEEEC--CEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCE-EE-eeeEE
Q psy18197          2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKD-FL-TTSLS   77 (144)
Q Consensus         2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~-~~-~~~~~   77 (144)
                      +++++.|+  .+++.+++|||++..-+...   .+       ..+. ..+|-| ....|......+++|+. .. +..|.
T Consensus         2 Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~c~---~c-------~~~~-i~Yg~G-s~~~G~~~~D~v~ig~~~~~~~~~Fg   69 (299)
T cd05472           2 YVVTVGLGTPARDQTVIVDTGSDLTWVQCQ---PC-------CLYQ-VSYGDG-SYTTGDLATDTLTLGSSDVVPGFAFG   69 (299)
T ss_pred             eEEEEecCCCCcceEEEecCCCCcccccCC---CC-------Ceee-eEeCCC-ceEEEEEEEEEEEeCCCCccCCEEEE
Confidence            68899998  58899999999998887432   11       1222 344433 34567777889999986 44 34555


Q ss_pred             EecCC-----CCceEeeech
Q psy18197         78 ILEEQ-----PMDMLLGLDM   92 (144)
Q Consensus        78 V~~~~-----~~d~ILG~d~   92 (144)
                      +....     ..|+|||+-+
T Consensus        70 ~~~~~~~~~~~~~GilGLg~   89 (299)
T cd05472          70 CGHDNEGLFGGAAGLLGLGR   89 (299)
T ss_pred             CCccCCCccCCCCEEEECCC
Confidence            55421     4789999964


No 43 
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=95.66  E-value=0.11  Score=41.70  Aligned_cols=88  Identities=17%  Similarity=0.285  Sum_probs=54.5

Q ss_pred             eEEEEEEC--CEEEEEEEecCCCceecCHHHHH--HcCceeee-----------cceeeeEeeccceeEEEEEEEEEEEE
Q psy18197          2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAE--RCAIMRLI-----------DTRWAGVAKGVGIQRIIGRIHMVQVV   66 (144)
Q Consensus         2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~--~lgl~~~~-----------~~~~~~~a~g~g~~~~~g~i~~~~i~   66 (144)
                      +|.+++|+  ++++++++|||++..-+...-+.  .|.-...-           ...+. ...|-|  ...|.+....++
T Consensus         1 Y~~~i~iGtP~Q~~~v~~DTGSs~~Wv~s~~C~~~~C~~~~~y~~~~SsT~~~~~~~~~-i~Yg~g--~~~G~~~~D~v~   77 (316)
T cd05486           1 YFGQISIGTPPQNFTVIFDTGSSNLWVPSIYCTSQACTKHNRFQPSESSTYVSNGEAFS-IQYGTG--SLTGIIGIDQVT   77 (316)
T ss_pred             CeEEEEECCCCcEEEEEEcCCCccEEEecCCCCCcccCccceECCCCCcccccCCcEEE-EEeCCc--EEEEEeeecEEE
Confidence            47889997  68999999999999888653332  12111000           01111 344544  356778788999


Q ss_pred             EcCEEEe-eeEEEec--------CCCCceEeeech
Q psy18197         67 IEKDFLT-TSLSILE--------EQPMDMLLGLDM   92 (144)
Q Consensus        67 iG~~~~~-~~~~V~~--------~~~~d~ILG~d~   92 (144)
                      +|+..++ ..|.+..        ....|+|||+-+
T Consensus        78 ig~~~~~~~~fg~~~~~~~~~~~~~~~dGilGLg~  112 (316)
T cd05486          78 VEGITVQNQQFAESVSEPGSTFQDSEFDGILGLAY  112 (316)
T ss_pred             ECCEEEcCEEEEEeeccCcccccccccceEeccCc
Confidence            9986553 3444332        125699999964


No 44 
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases.  They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=95.65  E-value=0.084  Score=42.51  Aligned_cols=89  Identities=16%  Similarity=0.206  Sum_probs=55.2

Q ss_pred             eEEEEEEC--CEEEEEEEecCCCceecCHHHHH---HcCceeee----cce------eeeEeeccceeEEEEEEEEEEEE
Q psy18197          2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAE---RCAIMRLI----DTR------WAGVAKGVGIQRIIGRIHMVQVV   66 (144)
Q Consensus         2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~---~lgl~~~~----~~~------~~~~a~g~g~~~~~g~i~~~~i~   66 (144)
                      ++++++|+  ++++++++|||++...+...-+.   .|.....-    .+.      ......|.|.  ..|.+....+.
T Consensus        11 Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~C~~~~~C~~~~~y~~~~SsT~~~~~~~~~i~Yg~G~--~~G~~~~D~v~   88 (317)
T cd06098          11 YFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYKSSKSSTYKKNGTSASIQYGTGS--ISGFFSQDSVT   88 (317)
T ss_pred             EEEEEEECCCCeEEEEEECCCccceEEecCCCCCCccccccCcCCcccCCCcccCCCEEEEEcCCce--EEEEEEeeEEE
Confidence            57889997  68999999999998888654332   22211100    011      1113444443  46877788999


Q ss_pred             EcCEEEe-eeEEEecC--------CCCceEeeech
Q psy18197         67 IEKDFLT-TSLSILEE--------QPMDMLLGLDM   92 (144)
Q Consensus        67 iG~~~~~-~~~~V~~~--------~~~d~ILG~d~   92 (144)
                      +|+..++ ..|.++..        ...|+|||+-+
T Consensus        89 ig~~~v~~~~f~~~~~~~~~~~~~~~~dGilGLg~  123 (317)
T cd06098          89 VGDLVVKNQVFIEATKEPGLTFLLAKFDGILGLGF  123 (317)
T ss_pred             ECCEEECCEEEEEEEecCCccccccccceeccccc
Confidence            9997654 34444321        24689999975


No 45 
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=95.49  E-value=0.11  Score=40.82  Aligned_cols=96  Identities=17%  Similarity=0.232  Sum_probs=56.0

Q ss_pred             EEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEE-EEEEEEEcCEEEeee----------------
Q psy18197         13 VKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRI-HMVQVVIEKDFLTTS----------------   75 (144)
Q Consensus        13 v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i-~~~~i~iG~~~~~~~----------------   75 (144)
                      ..++||||++...++.+.++++--.  ...... ...+.....+.-.. ....+++++..+.++                
T Consensus       179 ~~~iiDSGt~~~~lP~~~~~~l~~~--~~~~~~-~~~~~~~~~C~~~~~p~i~f~f~g~~~~i~~~~~~~~~~~~~~~~~  255 (295)
T cd05474         179 LPALLDSGTTLTYLPSDIVDAIAKQ--LGATYD-SDEGLYVVDCDAKDDGSLTFNFGGATISVPLSDLVLPASTDDGGDG  255 (295)
T ss_pred             ccEEECCCCccEeCCHHHHHHHHHH--hCCEEc-CCCcEEEEeCCCCCCCEEEEEECCeEEEEEHHHhEeccccCCCCCC
Confidence            5789999999999999999876321  000000 00010000000000 234556666444221                


Q ss_pred             ---EEEecCCCCceEeeechhheecEEEEcCCCEEEEcc
Q psy18197         76 ---LSILEEQPMDMLLGLDMLRRHQCCIDLKRNVLRIGT  111 (144)
Q Consensus        76 ---~~V~~~~~~d~ILG~d~L~~~~~~iD~~~~~l~~~~  111 (144)
                         +.+.+......|||..||+.+-...|.+++++-+.+
T Consensus       256 ~C~~~i~~~~~~~~iLG~~fl~~~y~vfD~~~~~ig~a~  294 (295)
T cd05474         256 ACYLGIQPSTSDYNILGDTFLRSAYVVYDLDNNEISLAQ  294 (295)
T ss_pred             CeEEEEEeCCCCcEEeChHHhhcEEEEEECCCCEEEeec
Confidence               222222224589999999999999999999997743


No 46 
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases.  They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=95.44  E-value=0.16  Score=40.91  Aligned_cols=90  Identities=12%  Similarity=0.120  Sum_probs=54.0

Q ss_pred             EEECCEE-------EEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEE------
Q psy18197          6 CCVNGVP-------VKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFL------   72 (144)
Q Consensus         6 v~InG~~-------v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~------   72 (144)
                      ++|||+.       ..+++|||.+...+++++++++...  .+   +   ....      ......+++++..+      
T Consensus       197 i~v~g~~~~~~~~~~~aivDTGTs~~~lP~~~~~~i~~~--~~---C---~~~~------~~P~i~f~f~g~~~~l~~~~  262 (317)
T cd06098         197 VLIGGKSTGFCAGGCAAIADSGTSLLAGPTTIVTQINSA--VD---C---NSLS------SMPNVSFTIGGKTFELTPEQ  262 (317)
T ss_pred             EEECCEEeeecCCCcEEEEecCCcceeCCHHHHHhhhcc--CC---c---cccc------cCCcEEEEECCEEEEEChHH
Confidence            5788875       3589999999999999998877521  00   0   0000      00111222222211      


Q ss_pred             -----------eeeEEEe--c---CCCCceEeeechhheecEEEEcCCCEEEE
Q psy18197         73 -----------TTSLSIL--E---EQPMDMLLGLDMLRRHQCCIDLKRNVLRI  109 (144)
Q Consensus        73 -----------~~~~~V~--~---~~~~d~ILG~d~L~~~~~~iD~~~~~l~~  109 (144)
                                 .|-+.+.  +   ......|||..||+.+-.+.|++++++-+
T Consensus       263 yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~IlGd~Flr~~y~VfD~~~~~iGf  315 (317)
T cd06098         263 YILKVGEGAAAQCISGFTALDVPPPRGPLWILGDVFMGAYHTVFDYGNLRVGF  315 (317)
T ss_pred             eEEeecCCCCCEEeceEEECCCCCCCCCeEEechHHhcccEEEEeCCCCEEee
Confidence                       1222221  1   11224799999999999999999998865


No 47 
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank 
Probab=95.37  E-value=0.24  Score=39.87  Aligned_cols=88  Identities=15%  Similarity=0.208  Sum_probs=54.7

Q ss_pred             eEEEEEEC--CEEEEEEEecCCCceecCHHHHH----HcCcee----ee-------cceeeeEeeccceeEEEEEEEEEE
Q psy18197          2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAE----RCAIMR----LI-------DTRWAGVAKGVGIQRIIGRIHMVQ   64 (144)
Q Consensus         2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~----~lgl~~----~~-------~~~~~~~a~g~g~~~~~g~i~~~~   64 (144)
                      +++++.|+  ++++.+++|||++..-+...-+.    .|....    ..       ...+. ...|.|.  ..|.+....
T Consensus         7 Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~C~~~~~y~~~~SsT~~~~~~~~~-i~Yg~G~--~~G~~~~D~   83 (325)
T cd05490           7 YYGEIGIGTPPQTFTVVFDTGSSNLWVPSVHCSLLDIACWLHHKYNSSKSSTYVKNGTEFA-IQYGSGS--LSGYLSQDT   83 (325)
T ss_pred             EEEEEEECCCCcEEEEEEeCCCccEEEEcCCCCCCCccccCcCcCCcccCcceeeCCcEEE-EEECCcE--EEEEEeeeE
Confidence            57889997  48899999999999888543332    222110    00       01111 3455443  568888899


Q ss_pred             EEEcCEEEe-eeEEEecC--------CCCceEeeech
Q psy18197         65 VVIEKDFLT-TSLSILEE--------QPMDMLLGLDM   92 (144)
Q Consensus        65 i~iG~~~~~-~~~~V~~~--------~~~d~ILG~d~   92 (144)
                      +++|+..++ ..|..+..        ...|+|||+-+
T Consensus        84 v~~g~~~~~~~~Fg~~~~~~~~~~~~~~~dGilGLg~  120 (325)
T cd05490          84 VSIGGLQVEGQLFGEAVKQPGITFIAAKFDGILGMAY  120 (325)
T ss_pred             EEECCEEEcCEEEEEEeeccCCcccceeeeEEEecCC
Confidence            999997654 34444321        13689999854


No 48 
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=95.34  E-value=0.31  Score=39.48  Aligned_cols=101  Identities=14%  Similarity=0.122  Sum_probs=57.5

Q ss_pred             EEECCEEE-----EEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEE--EEEEEEEEEEEcCEEEeee---
Q psy18197          6 CCVNGVPV-----KAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRI--IGRIHMVQVVIEKDFLTTS---   75 (144)
Q Consensus         6 v~InG~~v-----~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~--~g~i~~~~i~iG~~~~~~~---   75 (144)
                      ++++++..     .++||||++.+.++++.++.+.-.  .....  ...+.-...+  ........+++|+..+.++   
T Consensus       199 i~v~~~~~~~~~~~~iiDSGtt~~~lP~~~~~~l~~~--~~~~~--~~~~~~~~~C~~~~~~p~i~f~fgg~~~~i~~~~  274 (329)
T cd05485         199 VSVGEGEFCSGGCQAIADTGTSLIAGPVDEIEKLNNA--IGAKP--IIGGEYMVNCSAIPSLPDITFVLGGKSFSLTGKD  274 (329)
T ss_pred             EEECCeeecCCCcEEEEccCCcceeCCHHHHHHHHHH--hCCcc--ccCCcEEEeccccccCCcEEEEECCEEeEEChHH
Confidence            46676654     699999999999999988776321  01000  0001000000  0012234555565443221   


Q ss_pred             --------------EEEec-----CCCCceEeeechhheecEEEEcCCCEEEEc
Q psy18197         76 --------------LSILE-----EQPMDMLLGLDMLRRHQCCIDLKRNVLRIG  110 (144)
Q Consensus        76 --------------~~V~~-----~~~~d~ILG~d~L~~~~~~iD~~~~~l~~~  110 (144)
                                    ..+..     ......|||..||+.+-.+.|++++++-+-
T Consensus       275 yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~IlG~~fl~~~y~vFD~~~~~ig~a  328 (329)
T cd05485         275 YVLKVTQMGQTICLSGFMGIDIPPPAGPLWILGDVFIGKYYTEFDLGNNRVGFA  328 (329)
T ss_pred             eEEEecCCCCCEEeeeEEECcCCCCCCCeEEEchHHhccceEEEeCCCCEEeec
Confidence                          11121     112247999999999999999999998763


No 49 
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=95.20  E-value=0.21  Score=40.40  Aligned_cols=89  Identities=12%  Similarity=0.231  Sum_probs=55.2

Q ss_pred             eEEEEEEC--CEEEEEEEecCCCceecCHHHHH----HcCceeee----cceee------eEeeccceeEEEEEEEEEEE
Q psy18197          2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAE----RCAIMRLI----DTRWA------GVAKGVGIQRIIGRIHMVQV   65 (144)
Q Consensus         2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~----~lgl~~~~----~~~~~------~~a~g~g~~~~~g~i~~~~i   65 (144)
                      +++++.|+  ++++.+++|||++..-+...-+.    .++.....    .+...      ....|.|.  ..|.+....+
T Consensus        12 Y~~~i~vGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~c~~~~~y~~~~Sst~~~~~~~~~i~Y~~g~--~~G~~~~D~v   89 (329)
T cd05485          12 YYGVITIGTPPQSFKVVFDTGSSNLWVPSKKCSWTNIACLLHNKYDSTKSSTYKKNGTEFAIQYGSGS--LSGFLSTDTV   89 (329)
T ss_pred             EEEEEEECCCCcEEEEEEcCCCccEEEecCCCCCCCccccCCCeECCcCCCCeEECCeEEEEEECCce--EEEEEecCcE
Confidence            67899998  58999999999999888654332    22221111    01110      12344443  5687878899


Q ss_pred             EEcCEEEe-eeEEEecC--------CCCceEeeech
Q psy18197         66 VIEKDFLT-TSLSILEE--------QPMDMLLGLDM   92 (144)
Q Consensus        66 ~iG~~~~~-~~~~V~~~--------~~~d~ILG~d~   92 (144)
                      .+|+..++ ..|.++..        ...|+|||+-+
T Consensus        90 ~ig~~~~~~~~fg~~~~~~~~~~~~~~~~GilGLg~  125 (329)
T cd05485          90 SVGGVSVKGQTFAEAINEPGLTFVAAKFDGILGMGY  125 (329)
T ss_pred             EECCEEECCEEEEEEEecCCccccccccceEEEcCC
Confidence            99987653 34544421        23589999875


No 50 
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two 
Probab=95.09  E-value=0.26  Score=40.47  Aligned_cols=88  Identities=17%  Similarity=0.190  Sum_probs=52.0

Q ss_pred             eEEEEEEC--CEEEEEEEecCCCceecCHHHHHHcC-c-ee-------eecceeeeEeeccceeEEEEEEEEEEEEEcC-
Q psy18197          2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAERCA-I-MR-------LIDTRWAGVAKGVGIQRIIGRIHMVQVVIEK-   69 (144)
Q Consensus         2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~lg-l-~~-------~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~-   69 (144)
                      +++++.|+  ++++.+++|||++..-+...-+.... . .+       .....+. ...|.|.  ..|.+..+.+++|+ 
T Consensus         4 Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~~~~~~~~f~~~~SsT~~~~~~~~~-i~Yg~Gs--~~G~~~~D~v~ig~~   80 (364)
T cd05473           4 YYIEMLIGTPPQKLNILVDTGSSNFAVAAAPHPFIHTYFHRELSSTYRDLGKGVT-VPYTQGS--WEGELGTDLVSIPKG   80 (364)
T ss_pred             eEEEEEecCCCceEEEEEecCCcceEEEcCCCccccccCCchhCcCcccCCceEE-EEECcce--EEEEEEEEEEEECCC
Confidence            57889997  68999999999998887654221111 0 00       0011122 3455443  46888889999985 


Q ss_pred             --EEEeeeEEEec--------CCCCceEeeech
Q psy18197         70 --DFLTTSLSILE--------EQPMDMLLGLDM   92 (144)
Q Consensus        70 --~~~~~~~~V~~--------~~~~d~ILG~d~   92 (144)
                        ..+...+....        ....|+|||+-+
T Consensus        81 ~~~~~~~~~~~~~~~~~~~~~~~~~dGIlGLg~  113 (364)
T cd05473          81 PNVTFRANIAAITESENFFLNGSNWEGILGLAY  113 (364)
T ss_pred             CccceEEeeEEEeccccceecccccceeeeecc
Confidence              22332222221        124699999975


No 51 
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=95.07  E-value=0.049  Score=42.16  Aligned_cols=79  Identities=16%  Similarity=0.126  Sum_probs=48.5

Q ss_pred             EEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeecccee--EEEEEEEEEEEEEcCEEEeeeEEEecCCCCceEee
Q psy18197         12 PVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQ--RIIGRIHMVQVVIEKDFLTTSLSILEEQPMDMLLG   89 (144)
Q Consensus        12 ~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~--~~~g~i~~~~i~iG~~~~~~~~~V~~~~~~d~ILG   89 (144)
                      ...+++|||++.+.+++++++.+--.  ...... ........  ..........+++                 ..|||
T Consensus       202 ~~~~iiDsGt~~~~lp~~~~~~l~~~--~~~~~~-~~~~~~~~~~~~~~~~p~i~f~f-----------------~~ilG  261 (283)
T cd05471         202 GGGAIVDSGTSLIYLPSSVYDAILKA--LGAAVS-SSDGGYGVDCSPCDTLPDITFTF-----------------LWILG  261 (283)
T ss_pred             CcEEEEecCCCCEeCCHHHHHHHHHH--hCCccc-ccCCcEEEeCcccCcCCCEEEEE-----------------EEEcc
Confidence            46899999999999999999886431  111110 00000000  0000011111211                 79999


Q ss_pred             echhheecEEEEcCCCEEEEc
Q psy18197         90 LDMLRRHQCCIDLKRNVLRIG  110 (144)
Q Consensus        90 ~d~L~~~~~~iD~~~~~l~~~  110 (144)
                      ..||+..-...|.+++++-+.
T Consensus       262 ~~fl~~~y~vfD~~~~~igfa  282 (283)
T cd05471         262 DVFLRNYYTVFDLDNNRIGFA  282 (283)
T ss_pred             HhhhhheEEEEeCCCCEEeec
Confidence            999999999999999988763


No 52 
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5.  Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=94.83  E-value=0.18  Score=40.69  Aligned_cols=91  Identities=16%  Similarity=0.247  Sum_probs=56.5

Q ss_pred             CeEEEEEEC--CEEEEEEEecCCCceecCHHHHHHcCcee--ee----cc-----------------------eeeeEee
Q psy18197          1 MLFINCCVN--GVPVKAFVDSGAQTTIMSAECAERCAIMR--LI----DT-----------------------RWAGVAK   49 (144)
Q Consensus         1 ~~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~lgl~~--~~----~~-----------------------~~~~~a~   49 (144)
                      ++++++.|+  .+++.+++|||++..-++..-+..++...  .-    .+                       .+. ...
T Consensus         3 ~Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~C~~c~~~~~~~y~~~~Sst~~~~~C~~~~c~~~~~~~~~~~~~~-i~Y   81 (326)
T cd06096           3 YYFIDIFIGNPPQKQSLILDTGSSSLSFPCSQCKNCGIHMEPPYNLNNSITSSILYCDCNKCCYCLSCLNNKCEYS-ISY   81 (326)
T ss_pred             eEEEEEEecCCCeEEEEEEeCCCCceEEecCCCCCcCCCCCCCcCcccccccccccCCCccccccCcCCCCcCcEE-EEE
Confidence            478899998  58999999999999988766555554310  00    00                       111 233


Q ss_pred             ccceeEEEEEEEEEEEEEcCEEEe--------eeEEEecC-------CCCceEeeechh
Q psy18197         50 GVGIQRIIGRIHMVQVVIEKDFLT--------TSLSILEE-------QPMDMLLGLDML   93 (144)
Q Consensus        50 g~g~~~~~g~i~~~~i~iG~~~~~--------~~~~V~~~-------~~~d~ILG~d~L   93 (144)
                      |.| +...|....+.+.+|+...+        ..|.....       ...|+|||+.+=
T Consensus        82 ~~g-s~~~G~~~~D~v~lg~~~~~~~~~~~~~~~fg~~~~~~~~~~~~~~~GilGLg~~  139 (326)
T cd06096          82 SEG-SSISGFYFSDFVSFESYLNSNSEKESFKKIFGCHTHETNLFLTQQATGILGLSLT  139 (326)
T ss_pred             CCC-CceeeEEEEEEEEeccCCCCccccccccEEeccCccccCcccccccceEEEccCC
Confidence            433 34677777889999875432        12222211       246899999863


No 53 
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=94.83  E-value=0.22  Score=39.20  Aligned_cols=88  Identities=16%  Similarity=0.082  Sum_probs=51.7

Q ss_pred             EEECCEE-----EEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcC--EE------E
Q psy18197          6 CCVNGVP-----VKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEK--DF------L   72 (144)
Q Consensus         6 v~InG~~-----v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~--~~------~   72 (144)
                      ++|||++     ..++||||++.+.++++..    +++   -.+  ...+...        ..++++-.  ..      .
T Consensus       166 i~vg~~~~~~~~~~~ivDTGTt~t~lp~~~y----~p~---i~~--~f~~~~~--------~~~~~l~~~~y~~~~~~~~  228 (273)
T cd05475         166 LLFNGQPTGGKGLEVVFDSGSSYTYFNAQAY----FKP---LTL--KFGKGWR--------TRLLEIPPENYLIISEKGN  228 (273)
T ss_pred             EEECCEECcCCCceEEEECCCceEEcCCccc----ccc---EEE--EECCCCc--------eeEEEeCCCceEEEcCCCC
Confidence            5778774     4689999999999998865    211   111  1111000        01122211  10      1


Q ss_pred             eeeEEEecCC----CCceEeeechhheecEEEEcCCCEEEEcc
Q psy18197         73 TTSLSILEEQ----PMDMLLGLDMLRRHQCCIDLKRNVLRIGT  111 (144)
Q Consensus        73 ~~~~~V~~~~----~~d~ILG~d~L~~~~~~iD~~~~~l~~~~  111 (144)
                      .| +..++..    ....|||..||+.+-...|++++++-+..
T Consensus       229 ~C-l~~~~~~~~~~~~~~ilG~~~l~~~~~vfD~~~~riGfa~  270 (273)
T cd05475         229 VC-LGILNGSEIGLGNTNIIGDISMQGLMVIYDNEKQQIGWVR  270 (273)
T ss_pred             EE-EEEecCCCcCCCceEEECceEEEeeEEEEECcCCEeCccc
Confidence            23 2223221    12479999999999999999999887654


No 54 
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme.  Proteinase A preferentially hydro
Probab=94.75  E-value=0.31  Score=39.22  Aligned_cols=101  Identities=14%  Similarity=0.106  Sum_probs=58.1

Q ss_pred             EEECCEE-----EEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEE--EEEEEEEEEEEcCEEEee----
Q psy18197          6 CCVNGVP-----VKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRI--IGRIHMVQVVIEKDFLTT----   74 (144)
Q Consensus         6 v~InG~~-----v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~--~g~i~~~~i~iG~~~~~~----   74 (144)
                      ++|+++.     ..+++|||++.+.+++++++.+.-.  ......  ..+.-...+  ........+++++..+.+    
T Consensus       194 i~vg~~~~~~~~~~~ivDSGtt~~~lp~~~~~~l~~~--~~~~~~--~~~~~~~~C~~~~~~P~i~f~f~g~~~~i~~~~  269 (320)
T cd05488         194 IGLGDEELELENTGAAIDTGTSLIALPSDLAEMLNAE--IGAKKS--WNGQYTVDCSKVDSLPDLTFNFDGYNFTLGPFD  269 (320)
T ss_pred             EEECCEEeccCCCeEEEcCCcccccCCHHHHHHHHHH--hCCccc--cCCcEEeeccccccCCCEEEEECCEEEEECHHH
Confidence            4666654     3689999999999999998875311  000000  001000000  011223455666654432    


Q ss_pred             ---------eEEEecC-----CCCceEeeechhheecEEEEcCCCEEEEc
Q psy18197         75 ---------SLSILEE-----QPMDMLLGLDMLRRHQCCIDLKRNVLRIG  110 (144)
Q Consensus        75 ---------~~~V~~~-----~~~d~ILG~d~L~~~~~~iD~~~~~l~~~  110 (144)
                               -..+...     .....|||..||+..-.+.|+.+++|-+-
T Consensus       270 y~~~~~g~C~~~~~~~~~~~~~~~~~ilG~~fl~~~y~vfD~~~~~iG~a  319 (320)
T cd05488         270 YTLEVSGSCISAFTGMDFPEPVGPLAIVGDAFLRKYYSVYDLGNNAVGLA  319 (320)
T ss_pred             heecCCCeEEEEEEECcCCCCCCCeEEEchHHhhheEEEEeCCCCEEeec
Confidence                     2222221     12358999999999999999999988763


No 55 
>PF03539 Spuma_A9PTase:  Spumavirus aspartic protease (A9);  InterPro: IPR001641 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A9 (spumapepsin family, clan AA). Foamy viruses are single-stranded enveloped retroviruses that have been noted to infect monkeys, cats and humans. In the human virus, the aspartic protease is encoded by the retroviral gag gene [], and in monkeys by the pol gene []. At present, the virus has not been proven to cause any particular disease. However, studies have shown Human foamy virus causes neurological disorders in infected mice []. It is not clear whether the Foamy virus/spumavirus proteases share a common evolutionary origin with other aspartic proteases. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 2JYS_A.
Probab=94.61  E-value=0.38  Score=35.37  Aligned_cols=80  Identities=18%  Similarity=0.230  Sum_probs=42.4

Q ss_pred             ECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEecCCCCceE
Q psy18197          8 VNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILEEQPMDML   87 (144)
Q Consensus         8 InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~~~~~d~I   87 (144)
                      |-|.++++--||||++++|+..+.+.   +..+.........|    .....+.-+.++|.|+.+.+.+.-.+   +|-+
T Consensus         1 ikg~~l~~~wDsga~ITCiP~~fl~~---E~Pi~~~~i~Tihg----~~~~~vYYl~fKi~grkv~aEVi~s~---~dy~   70 (163)
T PF03539_consen    1 IKGTKLKGHWDSGAQITCIPESFLEE---EQPIGKTLIKTIHG----EKEQDVYYLTFKINGRKVEAEVIASP---YDYI   70 (163)
T ss_dssp             ETTEEEEEEE-TT-SSEEEEGGGTTT------SEEEEEE-SS-----EEEEEEEEEEEEESS-EEEEEEEEES---SSSE
T ss_pred             CCCceeeEEecCCCeEEEccHHHhCc---cccccceEEEEecC----ceeccEEEEEEEEcCeEEEEEEecCc---cceE
Confidence            45888999999999999999987432   11112222212333    22233445789999987766544443   3322


Q ss_pred             e----eechhheec
Q psy18197         88 L----GLDMLRRHQ   97 (144)
Q Consensus        88 L----G~d~L~~~~   97 (144)
                      |    -.+|+.+..
T Consensus        71 li~p~diPw~~~~p   84 (163)
T PF03539_consen   71 LISPSDIPWYKKKP   84 (163)
T ss_dssp             EE-TTT-HHHHS--
T ss_pred             EEcccccccccCCC
Confidence            2    257888654


No 56 
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=94.53  E-value=0.069  Score=42.06  Aligned_cols=85  Identities=15%  Similarity=0.153  Sum_probs=50.0

Q ss_pred             EEECCE------EEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEe
Q psy18197          6 CCVNGV------PVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSIL   79 (144)
Q Consensus         6 v~InG~------~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~   79 (144)
                      ++|+|+      ...++||||++...++++.++++--. ....... ...|  .    ..+ .|.-.+-..    .|.+ 
T Consensus       186 i~v~~~~~~~~~~~~~iiDSGTs~~~lP~~~~~~l~~~-l~g~~~~-~~~~--~----~~~-~C~~~~P~i----~f~~-  251 (278)
T cd06097         186 YTVGGDAPWSRSGFSAIADTGTTLILLPDAIVEAYYSQ-VPGAYYD-SEYG--G----WVF-PCDTTLPDL----SFAV-  251 (278)
T ss_pred             EEECCcceeecCCceEEeecCCchhcCCHHHHHHHHHh-CcCCccc-CCCC--E----EEE-ECCCCCCCE----EEEE-
Confidence            456665      35789999999999999888776310 0000000 0001  0    001 111111111    1111 


Q ss_pred             cCCCCceEeeechhheecEEEEcCCCEEEE
Q psy18197         80 EEQPMDMLLGLDMLRRHQCCIDLKRNVLRI  109 (144)
Q Consensus        80 ~~~~~d~ILG~d~L~~~~~~iD~~~~~l~~  109 (144)
                           ..|||-.||+.+-.+.|+.++++-+
T Consensus       252 -----~~ilGd~fl~~~y~vfD~~~~~ig~  276 (278)
T cd06097         252 -----FSILGDVFLKAQYVVFDVGGPKLGF  276 (278)
T ss_pred             -----EEEEcchhhCceeEEEcCCCceeee
Confidence                 5799999999999999999998765


No 57 
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco.  CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=94.26  E-value=0.21  Score=39.64  Aligned_cols=27  Identities=7%  Similarity=0.091  Sum_probs=24.2

Q ss_pred             ceEeeechhheecEEEEcCCCEEEEcc
Q psy18197         85 DMLLGLDMLRRHQCCIDLKRNVLRIGT  111 (144)
Q Consensus        85 d~ILG~d~L~~~~~~iD~~~~~l~~~~  111 (144)
                      ..|||..||+.+-.+.|++++++-+..
T Consensus       270 ~~ilG~~fl~~~~vvfD~~~~~igfa~  296 (299)
T cd05472         270 LSIIGNVQQQTFRVVYDVAGGRIGFAP  296 (299)
T ss_pred             CEEEchHHccceEEEEECCCCEEeEec
Confidence            379999999999999999999998754


No 58 
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=93.75  E-value=0.45  Score=37.42  Aligned_cols=83  Identities=14%  Similarity=0.228  Sum_probs=50.6

Q ss_pred             eEEEEEEC--CEEEEEEEecCCCceecCHH-HHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCE----EE-e
Q psy18197          2 LFINCCVN--GVPVKAFVDSGAQTTIMSAE-CAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKD----FL-T   73 (144)
Q Consensus         2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~-~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~----~~-~   73 (144)
                      +++++.|+  .+++.+++|||++..-+.-+ .+..++.      .+. ..+|-+ ....|.+..+.++++..    .. +
T Consensus         3 Y~~~i~iGtP~q~~~v~~DTGS~~~Wv~c~~~c~~c~c------~~~-i~Ygd~-~~~~G~~~~D~v~~~~~~~~~~~~~   74 (273)
T cd05475           3 YYVTINIGNPPKPYFLDIDTGSDLTWLQCDAPCTGCQC------DYE-IEYADG-GSSMGVLVTDIFSLKLTNGSRAKPR   74 (273)
T ss_pred             eEEEEEcCCCCeeEEEEEccCCCceEEeCCCCCCCCcC------ccE-eEeCCC-CceEEEEEEEEEEEeecCCCcccCC
Confidence            68899998  78899999999999988531 1222222      222 334422 24567777778888532    22 2


Q ss_pred             eeEEEec---------CCCCceEeeech
Q psy18197         74 TSLSILE---------EQPMDMLLGLDM   92 (144)
Q Consensus        74 ~~~~V~~---------~~~~d~ILG~d~   92 (144)
                      ..|-...         ....|+|||+-+
T Consensus        75 ~~Fgc~~~~~~~~~~~~~~~dGIlGLg~  102 (273)
T cd05475          75 IAFGCGYDQQGPLLNPPPPTDGILGLGR  102 (273)
T ss_pred             EEEEeeeccCCcccCCCccCCEEEECCC
Confidence            3333331         125789999953


No 59 
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=93.72  E-value=0.66  Score=35.78  Aligned_cols=91  Identities=13%  Similarity=0.192  Sum_probs=56.0

Q ss_pred             eEEEEEECC--EEEEEEEecCCCceecCHHHHHHcCceee--------------ecceeeeEeeccceeEEEEEEEEEEE
Q psy18197          2 LFINCCVNG--VPVKAFVDSGAQTTIMSAECAERCAIMRL--------------IDTRWAGVAKGVGIQRIIGRIHMVQV   65 (144)
Q Consensus         2 ~~v~v~InG--~~v~~LvDTGA~~s~Is~~~a~~lgl~~~--------------~~~~~~~~a~g~g~~~~~g~i~~~~i   65 (144)
                      +++++.|+.  +++.+++|||++.+.+...-+..+.....              ..........+.|  ...|.+....+
T Consensus         1 Y~~~i~iGtp~q~~~l~~DTGS~~~wv~~~~c~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~Y~~g--~~~g~~~~D~v   78 (283)
T cd05471           1 YYGEITIGTPPQKFSVIFDTGSSLLWVPSSNCTSCSCQKHPRFKYDSSKSSTYKDTGCTFSITYGDG--SVTGGLGTDTV   78 (283)
T ss_pred             CEEEEEECCCCcEEEEEEeCCCCCEEEecCCCCccccccCCCCccCccCCceeecCCCEEEEEECCC--eEEEEEEEeEE
Confidence            467888876  58999999999999987766555532110              0011111333332  34555557899


Q ss_pred             EEcCEEE-eeeEEEecC-------CCCceEeeechhh
Q psy18197         66 VIEKDFL-TTSLSILEE-------QPMDMLLGLDMLR   94 (144)
Q Consensus        66 ~iG~~~~-~~~~~V~~~-------~~~d~ILG~d~L~   94 (144)
                      ++++... ...|..+..       ...++|||+..=.
T Consensus        79 ~~~~~~~~~~~fg~~~~~~~~~~~~~~~GilGLg~~~  115 (283)
T cd05471          79 TIGGLTIPNQTFGCATSESGDFSSSGFDGILGLGFPS  115 (283)
T ss_pred             EECCEEEeceEEEEEeccCCcccccccceEeecCCcc
Confidence            9998754 344444442       3578999986643


No 60 
>PF00026 Asp:  Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.;  InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) .  More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=93.65  E-value=0.18  Score=39.74  Aligned_cols=98  Identities=14%  Similarity=0.149  Sum_probs=59.3

Q ss_pred             EEECCE------EEEEEEecCCCceecCHHHHHHcC----ceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEe--
Q psy18197          6 CCVNGV------PVKAFVDSGAQTTIMSAECAERCA----IMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLT--   73 (144)
Q Consensus         6 v~InG~------~v~~LvDTGA~~s~Is~~~a~~lg----l~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~--   73 (144)
                      +.+++.      ...+++|||++...++++..+++-    ..... ........-   .   .......+++++..+.  
T Consensus       187 i~i~~~~~~~~~~~~~~~Dtgt~~i~lp~~~~~~i~~~l~~~~~~-~~~~~~c~~---~---~~~p~l~f~~~~~~~~i~  259 (317)
T PF00026_consen  187 ISIGGESVFSSSGQQAILDTGTSYIYLPRSIFDAIIKALGGSYSD-GVYSVPCNS---T---DSLPDLTFTFGGVTFTIP  259 (317)
T ss_dssp             EEETTEEEEEEEEEEEEEETTBSSEEEEHHHHHHHHHHHTTEEEC-SEEEEETTG---G---GGSEEEEEEETTEEEEEE
T ss_pred             ccccccccccccceeeecccccccccccchhhHHHHhhhcccccc-eeEEEeccc---c---cccceEEEeeCCEEEEec
Confidence            566665      368999999999999999887762    21110 000000000   0   0012234455554332  


Q ss_pred             ---------------eeEEEec----CCCCceEeeechhheecEEEEcCCCEEEEc
Q psy18197         74 ---------------TSLSILE----EQPMDMLLGLDMLRRHQCCIDLKRNVLRIG  110 (144)
Q Consensus        74 ---------------~~~~V~~----~~~~d~ILG~d~L~~~~~~iD~~~~~l~~~  110 (144)
                                     |...+..    ......+||+.||+.+=..+|++++++-+-
T Consensus       260 ~~~~~~~~~~~~~~~C~~~i~~~~~~~~~~~~iLG~~fl~~~y~vfD~~~~~ig~A  315 (317)
T PF00026_consen  260 PSDYIFKIEDGNGGYCYLGIQPMDSSDDSDDWILGSPFLRNYYVVFDYENNRIGFA  315 (317)
T ss_dssp             HHHHEEEESSTTSSEEEESEEEESSTTSSSEEEEEHHHHTTEEEEEETTTTEEEEE
T ss_pred             chHhcccccccccceeEeeeecccccccCCceEecHHHhhceEEEEeCCCCEEEEe
Confidence                           2222332    235679999999999999999999998764


No 61 
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=93.24  E-value=0.54  Score=37.68  Aligned_cols=99  Identities=16%  Similarity=0.060  Sum_probs=55.4

Q ss_pred             EEECCEEE------EEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEE--EEEEEEEEEEcCEEEee---
Q psy18197          6 CCVNGVPV------KAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRII--GRIHMVQVVIEKDFLTT---   74 (144)
Q Consensus         6 v~InG~~v------~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~--g~i~~~~i~iG~~~~~~---   74 (144)
                      ++|||+.+      .++||||++.+.++++.++++--.  .....   ..|.-...+-  .......+.+++..+.+   
T Consensus       186 i~v~g~~~~~~~~~~aiiDTGTs~~~lP~~~~~~l~~~--~~~~~---~~~~~~~~C~~~~~~p~i~f~f~g~~~~l~~~  260 (316)
T cd05486         186 IQVGGTVIFCSDGCQAIVDTGTSLITGPSGDIKQLQNY--IGATA---TDGEYGVDCSTLSLMPSVTFTINGIPYSLSPQ  260 (316)
T ss_pred             EEEecceEecCCCCEEEECCCcchhhcCHHHHHHHHHH--hCCcc---cCCcEEEeccccccCCCEEEEECCEEEEeCHH
Confidence            57777643      699999999999999887766210  00000   0000000000  01122344555543321   


Q ss_pred             --------------eEEEecC-----CCCceEeeechhheecEEEEcCCCEEEE
Q psy18197         75 --------------SLSILEE-----QPMDMLLGLDMLRRHQCCIDLKRNVLRI  109 (144)
Q Consensus        75 --------------~~~V~~~-----~~~d~ILG~d~L~~~~~~iD~~~~~l~~  109 (144)
                                    -+.+...     .....|||-.||+.+-.+.|+.++++-+
T Consensus       261 ~y~~~~~~~~~~~C~~~~~~~~~~~~~~~~~ILGd~flr~~y~vfD~~~~~IGf  314 (316)
T cd05486         261 AYTLEDQSDGGGYCSSGFQGLDIPPPAGPLWILGDVFIRQYYSVFDRGNNRVGF  314 (316)
T ss_pred             HeEEecccCCCCEEeeEEEECCCCCCCCCeEEEchHHhcceEEEEeCCCCEeec
Confidence                          1111111     1123799999999999999999998765


No 62 
>PLN03146 aspartyl protease family protein; Provisional
Probab=92.50  E-value=0.48  Score=40.17  Aligned_cols=27  Identities=11%  Similarity=0.051  Sum_probs=24.3

Q ss_pred             ceEeeechhheecEEEEcCCCEEEEcc
Q psy18197         85 DMLLGLDMLRRHQCCIDLKRNVLRIGT  111 (144)
Q Consensus        85 d~ILG~d~L~~~~~~iD~~~~~l~~~~  111 (144)
                      ..|||..+++.+.+..|.+++++-|..
T Consensus       399 ~~IlG~~~q~~~~vvyDl~~~~igFa~  425 (431)
T PLN03146        399 IAIFGNLAQMNFLVGYDLESKTVSFKP  425 (431)
T ss_pred             ceEECeeeEeeEEEEEECCCCEEeeec
Confidence            379999999999999999999998755


No 63 
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate  r
Probab=91.58  E-value=3.4  Score=33.23  Aligned_cols=98  Identities=12%  Similarity=0.108  Sum_probs=55.9

Q ss_pred             EEECCEEE------EEEEecCCCceecCHHHHHHcCc----eeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEe--
Q psy18197          6 CCVNGVPV------KAFVDSGAQTTIMSAECAERCAI----MRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLT--   73 (144)
Q Consensus         6 v~InG~~v------~~LvDTGA~~s~Is~~~a~~lgl----~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~--   73 (144)
                      ++|+++.+      .++||||++.+.++++.++++--    ... ...+.-...-      ......+.+++|+..+.  
T Consensus       195 i~vg~~~~~~~~~~~aiiDSGts~~~lP~~~~~~l~~~~~~~~~-~~~y~~~C~~------~~~~P~i~f~fgg~~~~v~  267 (326)
T cd05487         195 VSVGSSTLLCEDGCTAVVDTGASFISGPTSSISKLMEALGAKER-LGDYVVKCNE------VPTLPDISFHLGGKEYTLS  267 (326)
T ss_pred             EEECCEEEecCCCCEEEECCCccchhCcHHHHHHHHHHhCCccc-CCCEEEeccc------cCCCCCEEEEECCEEEEeC
Confidence            56676643      58999999999999998776521    100 0000000000      00112234455554332  


Q ss_pred             ---------------eeEEEec-----CCCCceEeeechhheecEEEEcCCCEEEEc
Q psy18197         74 ---------------TSLSILE-----EQPMDMLLGLDMLRRHQCCIDLKRNVLRIG  110 (144)
Q Consensus        74 ---------------~~~~V~~-----~~~~d~ILG~d~L~~~~~~iD~~~~~l~~~  110 (144)
                                     |-..+..     ......|||..||+.+-.+.|+.++++-+-
T Consensus       268 ~~~yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~ilG~~flr~~y~vfD~~~~~IGfA  324 (326)
T cd05487         268 SSDYVLQDSDFSDKLCTVAFHAMDIPPPTGPLWVLGATFIRKFYTEFDRQNNRIGFA  324 (326)
T ss_pred             HHHhEEeccCCCCCEEEEEEEeCCCCCCCCCeEEEehHHhhccEEEEeCCCCEEeee
Confidence                           2122221     111247999999999999999999988663


No 64 
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank 
Probab=90.46  E-value=1.5  Score=35.28  Aligned_cols=93  Identities=11%  Similarity=0.074  Sum_probs=52.2

Q ss_pred             EEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEE--EEEEEEEEEEcCEEEe-----------------
Q psy18197         13 VKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRII--GRIHMVQVVIEKDFLT-----------------   73 (144)
Q Consensus        13 v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~--g~i~~~~i~iG~~~~~-----------------   73 (144)
                      ..++||||.+.+.++++.++.+.-. .....   ...+.-...+.  .......+.+|+..+.                 
T Consensus       207 ~~aiiDSGTt~~~~p~~~~~~l~~~-~~~~~---~~~~~~~~~C~~~~~~P~i~f~fgg~~~~l~~~~y~~~~~~~~~~~  282 (325)
T cd05490         207 CEAIVDTGTSLITGPVEEVRALQKA-IGAVP---LIQGEYMIDCEKIPTLPVISFSLGGKVYPLTGEDYILKVSQRGTTI  282 (325)
T ss_pred             CEEEECCCCccccCCHHHHHHHHHH-hCCcc---ccCCCEEecccccccCCCEEEEECCEEEEEChHHeEEeccCCCCCE
Confidence            4799999999999999988876321 00000   00110000000  0112234555554332                 


Q ss_pred             eeEEEec-----CCCCceEeeechhheecEEEEcCCCEEEE
Q psy18197         74 TSLSILE-----EQPMDMLLGLDMLRRHQCCIDLKRNVLRI  109 (144)
Q Consensus        74 ~~~~V~~-----~~~~d~ILG~d~L~~~~~~iD~~~~~l~~  109 (144)
                      |-+.+..     ......|||..||+.+-.+.|++++++-+
T Consensus       283 C~~~~~~~~~~~~~~~~~ilGd~flr~~y~vfD~~~~~IGf  323 (325)
T cd05490         283 CLSGFMGLDIPPPAGPLWILGDVFIGRYYTVFDRDNDRVGF  323 (325)
T ss_pred             EeeEEEECCCCCCCCceEEEChHhheeeEEEEEcCCcEeec
Confidence            2111211     01224799999999999999999998865


No 65 
>PF03419 Peptidase_U4:  Sporulation factor SpoIIGA  This family belongs to family U4 of the peptidase classification.;  InterPro: IPR005081 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.   The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This group of peptidases belong to the MEROPS peptidase family U4 (SpoIIGA peptidase family, clan U-).  Sporulation in bacteria such as Bacillus subtilis involves the formation of a polar septum, which divides the sporangium into a mother cell and a forespore. The sigma E factor, which is encoded within the spoIIG operon, is a cell-specific regulatory protein that directs gene transcription in the mother cell. Sigma E is synthesised as an inactive proprotein pro-sigma E, which is converted to the mature factor by the putative processing enzyme SpoIIGA []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis, 0030436 asexual sporulation
Probab=90.09  E-value=6.6  Score=31.49  Aligned_cols=33  Identities=24%  Similarity=0.389  Sum_probs=25.9

Q ss_pred             eEEEEEECCEE--EEEEEecCCCc---------eecCHHHHHHc
Q psy18197          2 LFINCCVNGVP--VKAFVDSGAQT---------TIMSAECAERC   34 (144)
Q Consensus         2 ~~v~v~InG~~--v~~LvDTGA~~---------s~Is~~~a~~l   34 (144)
                      +-+++.++|++  +++++|||-+.         .+++.+.++++
T Consensus       158 ~~v~i~~~~~~~~~~allDTGN~L~DPitg~PV~Vve~~~~~~~  201 (293)
T PF03419_consen  158 YPVTIEIGGKKIELKALLDTGNQLRDPITGRPVIVVEYEALEKL  201 (293)
T ss_pred             EEEEEEECCEEEEEEEEEECCCcccCCCCCCcEEEEEHHHHHhh
Confidence            34677889985  79999999764         57788888877


No 66 
>PTZ00165 aspartyl protease; Provisional
Probab=89.87  E-value=4.1  Score=35.22  Aligned_cols=27  Identities=15%  Similarity=0.255  Sum_probs=24.7

Q ss_pred             ceEeeechhheecEEEEcCCCEEEEcc
Q psy18197         85 DMLLGLDMLRRHQCCIDLKRNVLRIGT  111 (144)
Q Consensus        85 d~ILG~d~L~~~~~~iD~~~~~l~~~~  111 (144)
                      ..|||-.||+++-.+.|..++++-+-+
T Consensus       419 ~~ILGd~Flr~yy~VFD~~n~rIGfA~  445 (482)
T PTZ00165        419 LFVLGNNFIRKYYSIFDRDHMMVGLVP  445 (482)
T ss_pred             eEEEchhhheeEEEEEeCCCCEEEEEe
Confidence            479999999999999999999998865


No 67 
>PF14541 TAXi_C:  Xylanase inhibitor C-terminal; PDB: 3AUP_D 3HD8_A 1T6G_A 1T6E_X 2B42_A 3VLB_A 3VLA_A.
Probab=89.65  E-value=1.3  Score=32.15  Aligned_cols=28  Identities=11%  Similarity=0.077  Sum_probs=24.1

Q ss_pred             CCceEeeechhheecEEEEcCCCEEEEc
Q psy18197         83 PMDMLLGLDMLRRHQCCIDLKRNVLRIG  110 (144)
Q Consensus        83 ~~d~ILG~d~L~~~~~~iD~~~~~l~~~  110 (144)
                      ..-.+||.-.++.+.+..|.+++++-+.
T Consensus       133 ~~~~viG~~~~~~~~v~fDl~~~~igF~  160 (161)
T PF14541_consen  133 DGVSVIGNFQQQNYHVVFDLENGRIGFA  160 (161)
T ss_dssp             SSSEEE-HHHCCTEEEEEETTTTEEEEE
T ss_pred             CCcEEECHHHhcCcEEEEECCCCEEEEe
Confidence            4569999999999999999999999874


No 68 
>cd05489 xylanase_inhibitor_I_like TAXI-I inhibits degradation of xylan in the cell wall. Xylanase inhibitor-I (TAXI-I) is a member of potent TAXI-type inhibitors of fungal and bacterial family 11 xylanases. Plants developed a diverse battery of defense mechanisms in response to continual challenges by a broad spectrum of pathogenic microorganisms. Their defense arsenal includes inhibitors of cell wall-degrading enzymes, which hinder a possible invasion and colonization by antagonists. Xylanases of fungal and bacterial pathogens are the key enzymes in the degradation of xylan in the cell wall. Plants secrete proteins that inhibit these degradation glycosidases, including xylanase. Surprisingly, TAXI-I displays structural homology with the pepsin-like family of aspartic proteases but is proteolytically nonfunctional, because one or more residues of the essential catalytic triad are absent. The structure of the TAXI-inhibitor, Aspergillus niger xylanase I complex, illustrates the ability 
Probab=89.53  E-value=4.1  Score=33.67  Aligned_cols=26  Identities=15%  Similarity=0.350  Sum_probs=23.2

Q ss_pred             eEeeechhheecEEEEcCCCEEEEcc
Q psy18197         86 MLLGLDMLRRHQCCIDLKRNVLRIGT  111 (144)
Q Consensus        86 ~ILG~d~L~~~~~~iD~~~~~l~~~~  111 (144)
                      .|||.-+|+.+....|.+++++-+..
T Consensus       335 ~IlG~~~~~~~~vvyD~~~~riGfa~  360 (362)
T cd05489         335 VVIGGHQMEDNLLVFDLEKSRLGFSS  360 (362)
T ss_pred             EEEeeheecceEEEEECCCCEeeccc
Confidence            58999999999999999999987643


No 69 
>PTZ00147 plasmepsin-1; Provisional
Probab=87.48  E-value=2.8  Score=35.99  Aligned_cols=93  Identities=13%  Similarity=0.222  Sum_probs=53.6

Q ss_pred             EEEEEEecCCCceecCHHHHHHcCc----eeee-cceeeeEeeccceeEEEEEEEEEEEEEcCEEEe-------------
Q psy18197         12 PVKAFVDSGAQTTIMSAECAERCAI----MRLI-DTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLT-------------   73 (144)
Q Consensus        12 ~v~~LvDTGA~~s~Is~~~a~~lgl----~~~~-~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~-------------   73 (144)
                      ...+++|||.+.+.++++.++++--    .... ..........       .......+.+++..+.             
T Consensus       332 ~~~aIiDSGTsli~lP~~~~~ai~~~l~~~~~~~~~~y~~~C~~-------~~lP~~~f~f~g~~~~L~p~~yi~~~~~~  404 (453)
T PTZ00147        332 KANVIVDSGTSVITVPTEFLNKFVESLDVFKVPFLPLYVTTCNN-------TKLPTLEFRSPNKVYTLEPEYYLQPIEDI  404 (453)
T ss_pred             ceeEEECCCCchhcCCHHHHHHHHHHhCCeecCCCCeEEEeCCC-------CCCCeEEEEECCEEEEECHHHheeccccC
Confidence            3679999999999999998876421    1000 0000000000       0111233334443221             


Q ss_pred             ----eeEEEecC--CCCceEeeechhheecEEEEcCCCEEEEcc
Q psy18197         74 ----TSLSILEE--QPMDMLLGLDMLRRHQCCIDLKRNVLRIGT  111 (144)
Q Consensus        74 ----~~~~V~~~--~~~d~ILG~d~L~~~~~~iD~~~~~l~~~~  111 (144)
                          |-+.+.+.  .....|||..||+.+-.+.|+.++++-+-.
T Consensus       405 ~~~~C~~~i~~~~~~~~~~ILGd~FLr~~YtVFD~~n~rIGfA~  448 (453)
T PTZ00147        405 GSALCMLNIIPIDLEKNTFILGDPFMRKYFTVFDYDNHTVGFAL  448 (453)
T ss_pred             CCcEEEEEEEECCCCCCCEEECHHHhccEEEEEECCCCEEEEEE
Confidence                22223321  123479999999999999999999998765


No 70 
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=86.31  E-value=6.4  Score=33.77  Aligned_cols=27  Identities=15%  Similarity=0.338  Sum_probs=23.9

Q ss_pred             ceEeeechhheecEEEEcCCCEEEEcc
Q psy18197         85 DMLLGLDMLRRHQCCIDLKRNVLRIGT  111 (144)
Q Consensus        85 d~ILG~d~L~~~~~~iD~~~~~l~~~~  111 (144)
                      ..|||-.||+.+-.+.|+.++++-+-.
T Consensus       421 ~~ILGd~FLr~~Y~VFD~~n~rIGfA~  447 (450)
T PTZ00013        421 TFILGDPFMRKYFTVFDYDKESVGFAI  447 (450)
T ss_pred             CEEECHHHhccEEEEEECCCCEEEEEE
Confidence            479999999999999999999987643


No 71 
>KOG1339|consensus
Probab=83.56  E-value=11  Score=31.52  Aligned_cols=89  Identities=15%  Similarity=0.234  Sum_probs=53.9

Q ss_pred             eEEEEEECC--EEEEEEEecCCCceecCHHHHH-HcCc-eee-ec------------ce-----------------eeeE
Q psy18197          2 LFINCCVNG--VPVKAFVDSGAQTTIMSAECAE-RCAI-MRL-ID------------TR-----------------WAGV   47 (144)
Q Consensus         2 ~~v~v~InG--~~v~~LvDTGA~~s~Is~~~a~-~lgl-~~~-~~------------~~-----------------~~~~   47 (144)
                      +|+++.|+-  +++.+++|||++..-++-.-+. .+.- ... .+            ..                 +. .
T Consensus        47 Y~~~i~IGTPpq~f~v~~DTGS~~lWV~c~~c~~~C~~~~~~~f~p~~SSt~~~~~c~~~~c~~~~~~~~~~~~C~y~-i  125 (398)
T KOG1339|consen   47 YYGNISIGTPPQSFTVVLDTGSDLLWVPCAPCSSACYSQHNPIFDPSASSTYKSVGCSSPRCKSLPQSCSPNSSCPYS-I  125 (398)
T ss_pred             cEEEEecCCCCeeeEEEEeCCCCceeeccccccccccccCCCccCccccccccccCCCCccccccccCcccCCcCceE-E
Confidence            577788865  6699999999999999876555 3322 100 00            01                 22 3


Q ss_pred             eeccceeEEEEEEEEEEEEEcCE---EE-eeeEEEecC--------CCCceEeeech
Q psy18197         48 AKGVGIQRIIGRIHMVQVVIEKD---FL-TTSLSILEE--------QPMDMLLGLDM   92 (144)
Q Consensus        48 a~g~g~~~~~g~i~~~~i~iG~~---~~-~~~~~V~~~--------~~~d~ILG~d~   92 (144)
                      .+|-|+ ...|......+++++.   .. ...|.....        ...|+|||+-+
T Consensus       126 ~Ygd~~-~~~G~l~~Dtv~~~~~~~~~~~~~~FGc~~~~~g~~~~~~~~dGIlGLg~  181 (398)
T KOG1339|consen  126 QYGDGS-STSGYLATDTVTFGGTTSLPVPNQTFGCGTNNPGSFGLFAAFDGILGLGR  181 (398)
T ss_pred             EeCCCC-ceeEEEEEEEEEEccccccccccEEEEeeecCccccccccccceEeecCC
Confidence            455433 4677777788999873   21 133332211        35789999874


No 72 
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two 
Probab=78.87  E-value=2.9  Score=34.24  Aligned_cols=26  Identities=8%  Similarity=0.180  Sum_probs=24.4

Q ss_pred             eEeeechhheecEEEEcCCCEEEEcc
Q psy18197         86 MLLGLDMLRRHQCCIDLKRNVLRIGT  111 (144)
Q Consensus        86 ~ILG~d~L~~~~~~iD~~~~~l~~~~  111 (144)
                      .|||..||+.+-.+.|..++++-+..
T Consensus       319 ~ILG~~flr~~yvvfD~~~~rIGfa~  344 (364)
T cd05473         319 TVIGAVIMEGFYVVFDRANKRVGFAV  344 (364)
T ss_pred             eEEeeeeEcceEEEEECCCCEEeeEe
Confidence            79999999999999999999998866


No 73 
>PF11925 DUF3443:  Protein of unknown function (DUF3443);  InterPro: IPR021847  This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 400 to 434 amino acids in length. This protein has two conserved sequence motifs: NPV and DNNG. 
Probab=77.82  E-value=6.4  Score=33.03  Aligned_cols=92  Identities=12%  Similarity=0.218  Sum_probs=50.5

Q ss_pred             CeEEEEEECC------EEE-EEEEecCCCc-eecCHHHHHHc--Cceeeec--ceeeeEeeccceeEEEEEEEEEEEEEc
Q psy18197          1 MLFINCCVNG------VPV-KAFVDSGAQT-TIMSAECAERC--AIMRLID--TRWAGVAKGVGIQRIIGRIHMVQVVIE   68 (144)
Q Consensus         1 ~~~v~v~InG------~~v-~~LvDTGA~~-s~Is~~~a~~l--gl~~~~~--~~~~~~a~g~g~~~~~g~i~~~~i~iG   68 (144)
                      |+|+.|+|.-      +.+ .+|||||+.- .++++.+-.-|  .|+....  .++. +.........-|.++...++||
T Consensus        23 ~p~VsVtVC~PGts~CqTIdnvlVDTGS~GLRi~~sAl~~~l~~~Lp~~t~~g~~la-EC~~F~sgytWGsVr~AdV~ig  101 (370)
T PF11925_consen   23 IPTVSVTVCAPGTSNCQTIDNVLVDTGSYGLRIFASALPSSLAGSLPQQTGGGAPLA-ECAQFASGYTWGSVRTADVTIG  101 (370)
T ss_pred             ceeeEEEEeCCCCCCceeeCcEEEeccchhhhHHHhhhchhhhccCCcccCCCcchh-hhhhccCcccccceEEEEEEEc
Confidence            5677777732      333 7899999964 33333231222  2221110  0000 1111112356778889999999


Q ss_pred             CEE-EeeeEEEecCC----------------------CCceEeeechh
Q psy18197         69 KDF-LTTSLSILEEQ----------------------PMDMLLGLDML   93 (144)
Q Consensus        69 ~~~-~~~~~~V~~~~----------------------~~d~ILG~d~L   93 (144)
                      ++. ..+++.|+.+.                      ...+|||..-+
T Consensus       102 ge~A~~iPiQvI~D~~~~~~P~sC~~~g~~~~t~~~lgaNGILGIg~~  149 (370)
T PF11925_consen  102 GETASSIPIQVIGDSAAPSVPSSCSNSGASMNTVADLGANGILGIGPF  149 (370)
T ss_pred             CeeccccCEEEEcCCCCCCCCchhhcCCCCCCCcccccCceEEeecCC
Confidence            975 35667777431                      35799999554


No 74 
>PF14543 TAXi_N:  Xylanase inhibitor N-terminal; PDB: 3HD8_A 3VLB_A 3VLA_A 3AUP_D 1T6G_A 1T6E_X 2B42_A.
Probab=76.54  E-value=4.3  Score=29.65  Aligned_cols=25  Identities=20%  Similarity=0.481  Sum_probs=20.7

Q ss_pred             eEEEEEECC--EEEEEEEecCCCceec
Q psy18197          2 LFINCCVNG--VPVKAFVDSGAQTTIM   26 (144)
Q Consensus         2 ~~v~v~InG--~~v~~LvDTGA~~s~I   26 (144)
                      +++++.|+.  +++.+++|||++.+-+
T Consensus         1 Y~~~~~iGtP~~~~~lvvDtgs~l~W~   27 (164)
T PF14543_consen    1 YYVSVSIGTPPQPFSLVVDTGSDLTWV   27 (164)
T ss_dssp             EEEEEECTCTTEEEEEEEETT-SSEEE
T ss_pred             CEEEEEeCCCCceEEEEEECCCCceEE
Confidence            578888877  7899999999988876


No 75 
>TIGR02854 spore_II_GA sigma-E processing peptidase SpoIIGA. Members of this protein family are the stage II sporulation protein SpoIIGA. This protein acts as an activating protease for Sigma-E, one of several specialized sigma factors of the sporulation process in Bacillus subtilis and related endospore-forming bacteria.
Probab=68.34  E-value=7.5  Score=31.33  Aligned_cols=34  Identities=21%  Similarity=0.508  Sum_probs=26.1

Q ss_pred             eEEEEEECCEE--EEEEEecCCC---------ceecCHHHHHHcC
Q psy18197          2 LFINCCVNGVP--VKAFVDSGAQ---------TTIMSAECAERCA   35 (144)
Q Consensus         2 ~~v~v~InG~~--v~~LvDTGA~---------~s~Is~~~a~~lg   35 (144)
                      +-++++++|++  +++++|||-+         +.+++.+.++++-
T Consensus       159 ~~v~i~~~g~~~~~~alvDTGN~L~DPlT~~PV~Ive~~~~~~~l  203 (288)
T TIGR02854       159 YELEICLDGKKVTIKGFLDTGNQLRDPLTKLPVIVVEYDSLKSIL  203 (288)
T ss_pred             EEEEEEECCEEEEEEEEEecCCcccCCCCCCCEEEEEHHHhhhhC
Confidence            34678889985  7999999976         4677888777763


No 76 
>PLN03146 aspartyl protease family protein; Provisional
Probab=58.87  E-value=12  Score=31.71  Aligned_cols=27  Identities=22%  Similarity=0.450  Sum_probs=22.5

Q ss_pred             eEEEEEECC--EEEEEEEecCCCceecCH
Q psy18197          2 LFINCCVNG--VPVKAFVDSGAQTTIMSA   28 (144)
Q Consensus         2 ~~v~v~InG--~~v~~LvDTGA~~s~Is~   28 (144)
                      +++++.|+.  +++.+++|||++.+-+.-
T Consensus        85 Y~v~i~iGTPpq~~~vi~DTGS~l~Wv~C  113 (431)
T PLN03146         85 YLMNISIGTPPVPILAIADTGSDLIWTQC  113 (431)
T ss_pred             EEEEEEcCCCCceEEEEECCCCCcceEcC
Confidence            578888974  678999999999998854


No 77 
>PF13856 Gifsy-2:  ATP-binding sugar transporter from pro-phage; PDB: 2PP6_A.
Probab=51.56  E-value=18  Score=24.04  Aligned_cols=19  Identities=26%  Similarity=0.382  Sum_probs=15.5

Q ss_pred             EEEEECCEEEEEEEecCCC
Q psy18197          4 INCCVNGVPVKAFVDSGAQ   22 (144)
Q Consensus         4 v~v~InG~~v~~LvDTGA~   22 (144)
                      -++.|||+++++++|..-.
T Consensus        22 ~~~~InG~~~~~v~d~~~~   40 (95)
T PF13856_consen   22 EEHTINGKPYRAVFDEPEL   40 (95)
T ss_dssp             EEEEETTEEEEEEEGGGT-
T ss_pred             CeEEECCEEEEEEECCchh
Confidence            3689999999999997654


No 78 
>cd06396 PB1_NBR1 The PB1 domain is an essential part of NBR1 protein, next to BRCA1, a scaffold protein mediating specific protein-protein interaction with both titin protein kinase and with another scaffold protein p62. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The NBR1 protein contains a type I PB1 domain.
Probab=44.75  E-value=38  Score=22.29  Aligned_cols=35  Identities=11%  Similarity=0.062  Sum_probs=29.5

Q ss_pred             EEEEEECCEEEEEEEecCC--CceecCHHHHHHcCce
Q psy18197          3 FINCCVNGVPVKAFVDSGA--QTTIMSAECAERCAIM   37 (144)
Q Consensus         3 ~v~v~InG~~v~~LvDTGA--~~s~Is~~~a~~lgl~   37 (144)
                      .++++.||..++|.++..+  +..-+...+++++++.
T Consensus         2 ~vKaty~~d~~rf~~~~~~~~~~~~L~~ev~~rf~l~   38 (81)
T cd06396           2 NLKVTYNGESQSFLVSDSENTTWASVEAMVKVSFGLN   38 (81)
T ss_pred             EEEEEECCeEEEEEecCCCCCCHHHHHHHHHHHhCCC
Confidence            5788999999999999966  5566788899999985


No 79 
>PRK08364 sulfur carrier protein ThiS; Provisional
Probab=36.59  E-value=59  Score=20.22  Aligned_cols=33  Identities=21%  Similarity=0.178  Sum_probs=23.3

Q ss_pred             CeEEEEEECCE--EEEEEEecCCCceecCHHHHHHcCce
Q psy18197          1 MLFINCCVNGV--PVKAFVDSGAQTTIMSAECAERCAIM   37 (144)
Q Consensus         1 ~~~v~v~InG~--~v~~LvDTGA~~s~Is~~~a~~lgl~   37 (144)
                      |..+++++||+  +...=++.|++.    .++.+.|++.
T Consensus         2 ~~mm~v~vng~~~~~~~~~~~~~tv----~~ll~~l~~~   36 (70)
T PRK08364          2 MLMIRVKVIGRGIEKEIEWRKGMKV----ADILRAVGFN   36 (70)
T ss_pred             ceEEEEEEeccccceEEEcCCCCcH----HHHHHHcCCC
Confidence            56789999999  444445766553    3677888874


No 80 
>cd06407 PB1_NLP A PB1 domain is present in NIN like proteins (NLP), a key enzyme in a process of establishment of symbiosis betweeen legumes and nitrogen fixing bacteria (Rhizobium). The PB1 domain is a modular domain mediating specific protein-protein interaction which play a role in many critical cell processes like osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster.  Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-inte
Probab=33.01  E-value=66  Score=20.93  Aligned_cols=35  Identities=11%  Similarity=0.040  Sum_probs=32.0

Q ss_pred             EEEEEECCEEEEEEEecCCCceecCHHHHHHcCce
Q psy18197          3 FINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIM   37 (144)
Q Consensus         3 ~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~   37 (144)
                      -|++..||..+++=+.+..+..-+-.++++++++.
T Consensus         2 ~vK~~~~~d~~r~~l~~~~~~~~L~~~i~~r~~~~   36 (82)
T cd06407           2 RVKATYGEEKIRFRLPPSWGFTELKQEIAKRFKLD   36 (82)
T ss_pred             EEEEEeCCeEEEEEcCCCCCHHHHHHHHHHHhCCC
Confidence            57888999999999999999999999999999984


No 81 
>PF13954 PapC_N:  PapC N-terminal domain; PDB: 2VQI_B 3FIP_A 3RFZ_E 3OHN_A 1ZDV_A 1ZE3_D 3BWU_D 1ZDX_A.
Probab=27.67  E-value=1.6e+02  Score=20.86  Aligned_cols=37  Identities=14%  Similarity=0.181  Sum_probs=25.7

Q ss_pred             eEEEEEECCEE-----EEEEEecCC-CceecCHHHHHHcCcee
Q psy18197          2 LFINCCVNGVP-----VKAFVDSGA-QTTIMSAECAERCAIMR   38 (144)
Q Consensus         2 ~~v~v~InG~~-----v~~LvDTGA-~~s~Is~~~a~~lgl~~   38 (144)
                      +.+.+.|||+.     ++|.-+... ..-++++++.+++|++.
T Consensus        32 Y~vdv~vN~~~~~~~~i~f~~~~~~~~~pClt~~~L~~~Gi~~   74 (146)
T PF13954_consen   32 YSVDVYVNGKFIGRYDIEFINNDDGKLQPCLTPELLRQLGIKP   74 (146)
T ss_dssp             EEEEEEETTEEEEEEEEEEEESSSTSEEEE-BHHHHHHTTB-G
T ss_pred             EEEEEEECCeeeeeEEEEEEeCCCcceeEEeCHHHHHHcCCCc
Confidence            46889999974     455444333 45799999999999964


No 82 
>KOG3048|consensus
Probab=27.16  E-value=47  Score=24.37  Aligned_cols=23  Identities=22%  Similarity=0.322  Sum_probs=19.6

Q ss_pred             eEEEEEECCEEEEEEEecCCCcee
Q psy18197          2 LFINCCVNGVPVKAFVDSGAQTTI   25 (144)
Q Consensus         2 ~~v~v~InG~~v~~LvDTGA~~s~   25 (144)
                      +|||++++. +-++|||-|-..-+
T Consensus        73 lYVPGkl~d-~~k~lVDIGTGYyV   95 (153)
T KOG3048|consen   73 LYVPGKLSD-NSKFLVDIGTGYYV   95 (153)
T ss_pred             eeccceecc-ccceeEeccCceEE
Confidence            799999987 78999999987655


No 83 
>cd01813 UBP_N UBP ubiquitin processing protease. The UBP (ubiquitin processing protease) domain (also referred to as USP which stands for "ubiquitin-specific protease") is present at in a large family of cysteine proteases that specifically cleave ubiquitin conjugates.  This family includes Rpn11, UBP6 (USP14), USP7 (HAUSP).   This domain is closely related to the amino-terminal ubiquitin-like domain of BAG1 (Bcl2-associated anthanogene1) protein and is found only in eukaryotes.
Probab=27.12  E-value=1e+02  Score=19.34  Aligned_cols=36  Identities=6%  Similarity=-0.136  Sum_probs=31.6

Q ss_pred             EEEEEECCEEEEEEEecCCCceecCHHHHHHcCcee
Q psy18197          3 FINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMR   38 (144)
Q Consensus         3 ~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~   38 (144)
                      -+.++.+|+.+..=+|..+++.-+-..++++.|+++
T Consensus         2 ~i~vk~~g~~~~v~v~~~~Tv~~lK~~i~~~tgvp~   37 (74)
T cd01813           2 PVIVKWGGQEYSVTTLSEDTVLDLKQFIKTLTGVLP   37 (74)
T ss_pred             EEEEEECCEEEEEEECCCCCHHHHHHHHHHHHCCCH
Confidence            478999999999889999999999999999988753


No 84 
>KOG0679|consensus
Probab=24.93  E-value=37  Score=28.96  Aligned_cols=20  Identities=40%  Similarity=0.453  Sum_probs=17.4

Q ss_pred             CCEEEEEEEecCCCceecCH
Q psy18197          9 NGVPVKAFVDSGAQTTIMSA   28 (144)
Q Consensus         9 nG~~v~~LvDTGA~~s~Is~   28 (144)
                      ||++--++||.||+++-+++
T Consensus       151 ~GrstalVvDiGa~~~svsP  170 (426)
T KOG0679|consen  151 NGRSTALVVDIGATHTSVSP  170 (426)
T ss_pred             cCCCceEEEEecCCCceeee
Confidence            88999999999999886654


No 85 
>PF09379 FERM_N:  FERM N-terminal domain ;  InterPro: IPR018979  This domain is the N-terminal ubiquitin-like structural domain of the FERM domain.  The FERM domain (F for 4.1 protein, E for ezrin, R for radixin and M for moesin) is a widespread protein module involved in localising proteins to the plasma membrane []. FERM domains are found in a number of cytoskeletal-associated proteins that associate with various proteins at the interface between the plasma membrane and the cytoskeleton. The FERM domain is located at the N terminus of the majority of FERM-containing proteins [, ], which includes:    Band 4.1, which links the spectrin-actin cytoskeleton of erythrocytes to the plasma membrane. Ezrin, a component of the undercoat of the microvilli plasma membrane. Moesin, which is probably involved in binding major cytoskeletal structures to the plasma membrane. Radixin, which is involved in the binding of the barbed end of actin filaments to the plasma membrane in the undercoat of the cell- to-cell adherens junction. Talin, a cytoskeletal protein concentrated in regions of cell-substratum contact and, in lymphocytes, of cell-cell contacts. Filopodin, a slime mold protein that binds actin and which is involved in the control of cell motility and chemotaxis. Merlin (or schwannomin). Protein NBL4. Unconventional myosins X, VIIa and XV, which are mutated in congenital deafness. Focal-adhesion kinases (FAKs), cytoplasmic protein tyrosine kinases involved in signalling through integrins. Janus tyrosine kinases (JAKs), cytoplasmic tyrosine kinases that are non-covalently associated with the cytoplasmic tails of receptors for cytokines or polypeptidic hormones. Non-receptor tyrosine-protein kinase TYK2. Protein-tyrosine phosphatases PTPN3 and PTPN4, enzyme that appear to act at junctions between the membrane and the cytoskeleton. Protein-tyrosine phosphatases PTPN14 and PTP-D1, PTP-RL10 and PTP2E.  Caenorhabditis elegans protein phosphatase ptp-1.   Ezrin, moesin, and radixin are highly related proteins (ERM protein family), but the other proteins in which the FERM domain is found do not share any region of similarity outside of this domain. ERM proteins are made of three domains, the FERM domain, a central helical domain and a C-terminal tail domain, which binds F-actin. The amino-acid sequence of the FERM domain is highly conserved among ERM proteins and is responsible for membrane association by direct binding to the cytoplasmic domain or tail of integral membrane proteins. ERM proteins are regulated by an intramolecular association of the FERM and C-terminal tail domains that masks their binding sites for other molecules. For cytoskeleton-membrane cross-linking, the dormant molecules becomes activated and the FERM domain attaches to the membrane by binding specific membrane proteins, while the last 34 residues of the tail bind actin filaments. Aside from binding to membranes, the activated FERM domain of ERM proteins can also bind the guanine nucleotide dissociation inhibitor of Rho GTPase (RhoDGI), which suggests that in addition to functioning as a cross-linker, ERM proteins may influence Rho signalling pathways. The crystal structure of the FERM domain reveals that it is composed of three structural modules (F1, F2, and F3) that together form a compact clover-shaped structure []. The FERM domain has also been called the amino-terminal domain, the 30kDa domain, 4.1N30, the membrane-cytoskeletal-linking domain, the ERM-like domain, the ezrin-like domain of the band 4.1 superfamily, the conserved N-terminal region, and the membrane attachment domain [].; PDB: 1EF1_B 1SGH_A 1E5W_A 2KC2_A 2KMA_A 3IVF_A 1H4R_B 3U8Z_A 1ISN_A 3BIN_A ....
Probab=24.67  E-value=49  Score=20.57  Aligned_cols=31  Identities=10%  Similarity=0.164  Sum_probs=28.0

Q ss_pred             ECCEEEEEEEecCCCceecCHHHHHHcCcee
Q psy18197          8 VNGVPVKAFVDSGAQTTIMSAECAERCAIMR   38 (144)
Q Consensus         8 InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~   38 (144)
                      .||....+-+|..|+..-+=..+++++|+.+
T Consensus         4 lD~~~~~~~v~~~~t~~~l~~~v~~~l~l~e   34 (80)
T PF09379_consen    4 LDGTTKTFEVDPKTTGQDLLEQVCDKLGLKE   34 (80)
T ss_dssp             SSEEEEEEEEETTSBHHHHHHHHHHHHTTSS
T ss_pred             cCCCcEEEEEcCCCcHHHHHHHHHHHcCCCC
Confidence            4788899999999999999999999999953


No 86 
>PF13510 Fer2_4:  2Fe-2S iron-sulfur cluster binding domain; PDB: 1Y56_A 3ADA_A 1VRQ_A 1X31_A 3AD9_A 3AD8_A 3AD7_A 2GAG_A 2GAH_A.
Probab=24.57  E-value=88  Score=20.08  Aligned_cols=28  Identities=29%  Similarity=0.514  Sum_probs=18.2

Q ss_pred             EEEEEECCEEEEEEEecCCCceecCHHHHHHcCc
Q psy18197          3 FINCCVNGVPVKAFVDSGAQTTIMSAECAERCAI   36 (144)
Q Consensus         3 ~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl   36 (144)
                      -|++.|||+++.+-  .|.  |++  +.+.+.|+
T Consensus         3 ~v~i~idG~~v~~~--~G~--til--~al~~~gi   30 (82)
T PF13510_consen    3 MVTITIDGKPVEVP--PGE--TIL--EALLAAGI   30 (82)
T ss_dssp             EEEEEETTEEEEEE--ET---BHH--HHHHHTT-
T ss_pred             EEEEEECCEEEEEc--CCC--HHH--HHHHHCCC
Confidence            37899999999885  344  454  34566666


No 87 
>PRK06083 sulfur carrier protein ThiS; Provisional
Probab=24.35  E-value=82  Score=20.65  Aligned_cols=30  Identities=17%  Similarity=0.352  Sum_probs=20.1

Q ss_pred             CeEEEEEECCEEEEEEEecCCCceecCHHHHHHcCc
Q psy18197          1 MLFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAI   36 (144)
Q Consensus         1 ~~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl   36 (144)
                      |..++++|||++..+  +.+.++    .++.+.+++
T Consensus        16 ~~~m~I~VNG~~~~~--~~~~tl----~~LL~~l~~   45 (84)
T PRK06083         16 MVLITISINDQSIQV--DISSSL----AQIIAQLSL   45 (84)
T ss_pred             CceEEEEECCeEEEc--CCCCcH----HHHHHHcCC
Confidence            457889999998876  444432    245566776


No 88 
>KOG1339|consensus
Probab=22.82  E-value=69  Score=26.63  Aligned_cols=27  Identities=15%  Similarity=0.122  Sum_probs=23.0

Q ss_pred             ceEeeechhheecEEEEcC-CCEEEEcc
Q psy18197         85 DMLLGLDMLRRHQCCIDLK-RNVLRIGT  111 (144)
Q Consensus        85 d~ILG~d~L~~~~~~iD~~-~~~l~~~~  111 (144)
                      ..|||--+++.+....|.. ++++-+..
T Consensus       364 ~~ilG~~~~~~~~~~~D~~~~~riGfa~  391 (398)
T KOG1339|consen  364 LWILGDVFQQNYLVVFDLGENSRVGFAP  391 (398)
T ss_pred             eEEEchHHhCCEEEEEeCCCCCEEEecc
Confidence            5899999999999999998 77776654


No 89 
>COG3617 Prophage antirepressor [Transcription]
Probab=22.21  E-value=1.2e+02  Score=22.74  Aligned_cols=33  Identities=15%  Similarity=0.248  Sum_probs=29.3

Q ss_pred             EEEECCEEEEEEEecCCCceecCHHHHHHcCcee
Q psy18197          5 NCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMR   38 (144)
Q Consensus         5 ~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~   38 (144)
                      +....++.+++++|-| +.-++-.++|+-||+..
T Consensus        11 ~f~F~~~~vRvv~~d~-epwFva~Dva~~Lg~~n   43 (176)
T COG3617          11 PFTFEDHAVRVVLDDG-EPWFVAADVARALGYTN   43 (176)
T ss_pred             cEeecCceEEEEEeCC-ceeEeHHHHHHHHCCCC
Confidence            3566788999999999 99999999999999964


Done!