Query psy18197
Match_columns 144
No_of_seqs 125 out of 760
Neff 7.3
Searched_HMMs 46136
Date Sat Aug 17 01:17:57 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy18197.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/18197hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0012|consensus 100.0 6.7E-32 1.5E-36 217.7 8.7 135 1-135 235-369 (380)
2 PF09668 Asp_protease: Asparty 100.0 7.9E-31 1.7E-35 186.4 12.1 101 1-101 24-124 (124)
3 cd05479 RP_DDI RP_DDI; retrope 100.0 7.2E-29 1.6E-33 176.5 14.5 109 1-109 16-124 (124)
4 cd05480 NRIP_C NRIP_C; putativ 100.0 5.5E-29 1.2E-33 169.2 10.3 101 4-104 1-103 (103)
5 PF08284 RVP_2: Retroviral asp 99.9 3.4E-23 7.5E-28 149.5 13.5 109 2-111 22-131 (135)
6 TIGR02281 clan_AA_DTGA clan AA 99.9 2.6E-20 5.7E-25 132.1 13.3 105 2-109 12-119 (121)
7 cd05484 retropepsin_like_LTR_2 99.8 8E-20 1.7E-24 123.0 11.9 90 2-94 1-91 (91)
8 PF13650 Asp_protease_2: Aspar 99.8 4.6E-18 1E-22 112.4 11.9 88 4-92 1-90 (90)
9 PF00077 RVP: Retroviral aspar 99.8 5.7E-18 1.2E-22 115.2 11.4 94 2-101 6-100 (100)
10 cd05483 retropepsin_like_bacte 99.7 3.2E-17 7E-22 109.4 11.0 92 1-94 2-96 (96)
11 TIGR03698 clan_AA_DTGF clan AA 99.7 1.2E-16 2.6E-21 111.1 11.1 100 3-107 1-107 (107)
12 cd06095 RP_RTVL_H_like Retrope 99.7 1.1E-16 2.4E-21 106.9 10.0 83 4-94 1-86 (86)
13 COG3577 Predicted aspartyl pro 99.6 4.1E-15 8.8E-20 113.1 10.0 104 2-109 106-213 (215)
14 cd00303 retropepsin_like Retro 99.5 1.1E-12 2.4E-17 83.0 10.9 90 4-94 1-92 (92)
15 cd05481 retropepsin_like_LTR_1 99.4 1.7E-12 3.7E-17 88.1 10.3 86 5-92 2-91 (93)
16 PF12384 Peptidase_A2B: Ty3 tr 99.3 4.6E-11 9.9E-16 88.3 11.7 93 3-97 36-130 (177)
17 PF13975 gag-asp_proteas: gag- 99.3 3.1E-11 6.6E-16 78.1 7.5 41 2-42 9-49 (72)
18 COG5550 Predicted aspartyl pro 99.1 6.7E-10 1.4E-14 78.5 9.9 94 13-111 27-121 (125)
19 cd06094 RP_Saci_like RP_Saci_l 99.1 6.8E-10 1.5E-14 74.5 6.9 79 12-97 9-88 (89)
20 cd05482 HIV_retropepsin_like R 99.0 2.3E-09 5E-14 72.0 8.8 86 5-94 2-87 (87)
21 PF02160 Peptidase_A3: Caulifl 98.9 1.3E-08 2.9E-13 77.8 10.3 103 2-113 5-119 (201)
22 PF05585 DUF1758: Putative pep 98.7 9.6E-08 2.1E-12 70.5 8.0 29 11-39 11-39 (164)
23 PF12382 Peptidase_A2E: Retrot 97.7 0.00034 7.3E-09 48.2 7.6 85 13-100 48-134 (137)
24 PF00026 Asp: Eukaryotic aspar 97.3 0.0037 8E-08 49.5 10.4 87 2-91 2-113 (317)
25 PTZ00013 plasmepsin 4 (PM4); P 97.2 0.0031 6.7E-08 53.9 9.8 89 1-92 138-251 (450)
26 cd05470 pepsin_retropepsin_lik 97.1 0.0061 1.3E-07 41.1 8.3 84 4-90 1-109 (109)
27 PTZ00147 plasmepsin-1; Provisi 97.1 0.0063 1.4E-07 52.0 10.2 89 1-92 139-252 (453)
28 cd05476 pepsin_A_like_plant Ch 96.9 0.0072 1.6E-07 47.4 8.5 83 14-111 178-262 (265)
29 COG4067 Uncharacterized protei 96.9 0.0037 8.1E-08 46.0 6.0 100 6-106 31-157 (162)
30 cd05474 SAP_like SAPs, pepsin- 96.8 0.018 3.9E-07 45.4 10.3 73 2-91 3-79 (295)
31 cd06097 Aspergillopepsin_like 96.8 0.014 3.1E-07 45.9 9.2 89 2-92 1-114 (278)
32 cd05478 pepsin_A Pepsin A, asp 96.5 0.038 8.3E-07 44.4 10.0 88 2-92 11-122 (317)
33 cd05476 pepsin_A_like_plant Ch 96.4 0.027 5.8E-07 44.2 8.4 76 2-92 2-88 (265)
34 cd05487 renin_like Renin stimu 96.2 0.089 1.9E-06 42.5 10.8 88 2-92 9-121 (326)
35 PF05618 Zn_protease: Putative 96.2 0.018 3.8E-07 41.8 5.9 45 62-107 87-134 (138)
36 cd05477 gastricsin Gastricsins 96.0 0.083 1.8E-06 42.4 9.9 88 2-92 4-115 (318)
37 cd05488 Proteinase_A_fungi Fun 96.0 0.1 2.2E-06 42.0 10.1 88 2-92 11-122 (320)
38 cd05477 gastricsin Gastricsins 95.9 0.066 1.4E-06 43.0 8.9 102 6-111 188-317 (318)
39 PTZ00165 aspartyl protease; Pr 95.9 0.1 2.2E-06 45.0 10.3 89 2-92 121-237 (482)
40 cd05478 pepsin_A Pepsin A, asp 95.9 0.17 3.8E-06 40.6 11.0 101 6-110 194-316 (317)
41 cd06096 Plasmepsin_5 Plasmepsi 95.7 0.046 1E-06 44.2 7.1 89 13-111 232-322 (326)
42 cd05472 cnd41_like Chloroplast 95.7 0.06 1.3E-06 42.8 7.6 79 2-92 2-89 (299)
43 cd05486 Cathespin_E Cathepsin 95.7 0.11 2.4E-06 41.7 9.1 88 2-92 1-112 (316)
44 cd06098 phytepsin Phytepsin, a 95.7 0.084 1.8E-06 42.5 8.4 89 2-92 11-123 (317)
45 cd05474 SAP_like SAPs, pepsin- 95.5 0.11 2.5E-06 40.8 8.5 96 13-111 179-294 (295)
46 cd06098 phytepsin Phytepsin, a 95.4 0.16 3.4E-06 40.9 9.3 90 6-109 197-315 (317)
47 cd05490 Cathepsin_D2 Cathepsin 95.4 0.24 5.2E-06 39.9 10.1 88 2-92 7-120 (325)
48 cd05485 Cathepsin_D_like Cathe 95.3 0.31 6.7E-06 39.5 10.8 101 6-110 199-328 (329)
49 cd05485 Cathepsin_D_like Cathe 95.2 0.21 4.6E-06 40.4 9.4 89 2-92 12-125 (329)
50 cd05473 beta_secretase_like Be 95.1 0.26 5.6E-06 40.5 9.7 88 2-92 4-113 (364)
51 cd05471 pepsin_like Pepsin-lik 95.1 0.049 1.1E-06 42.2 5.2 79 12-110 202-282 (283)
52 cd06096 Plasmepsin_5 Plasmepsi 94.8 0.18 4E-06 40.7 8.1 91 1-93 3-139 (326)
53 cd05475 nucellin_like Nucellin 94.8 0.22 4.8E-06 39.2 8.3 88 6-111 166-270 (273)
54 cd05488 Proteinase_A_fungi Fun 94.7 0.31 6.7E-06 39.2 9.1 101 6-110 194-319 (320)
55 PF03539 Spuma_A9PTase: Spumav 94.6 0.38 8.3E-06 35.4 8.3 80 8-97 1-84 (163)
56 cd06097 Aspergillopepsin_like 94.5 0.069 1.5E-06 42.1 4.8 85 6-109 186-276 (278)
57 cd05472 cnd41_like Chloroplast 94.3 0.21 4.6E-06 39.6 7.1 27 85-111 270-296 (299)
58 cd05475 nucellin_like Nucellin 93.7 0.45 9.8E-06 37.4 8.0 83 2-92 3-102 (273)
59 cd05471 pepsin_like Pepsin-lik 93.7 0.66 1.4E-05 35.8 8.8 91 2-94 1-115 (283)
60 PF00026 Asp: Eukaryotic aspar 93.6 0.18 4E-06 39.7 5.6 98 6-110 187-315 (317)
61 cd05486 Cathespin_E Cathepsin 93.2 0.54 1.2E-05 37.7 7.8 99 6-109 186-314 (316)
62 PLN03146 aspartyl protease fam 92.5 0.48 1E-05 40.2 6.8 27 85-111 399-425 (431)
63 cd05487 renin_like Renin stimu 91.6 3.4 7.4E-05 33.2 10.6 98 6-110 195-324 (326)
64 cd05490 Cathepsin_D2 Cathepsin 90.5 1.5 3.2E-05 35.3 7.4 93 13-109 207-323 (325)
65 PF03419 Peptidase_U4: Sporula 90.1 6.6 0.00014 31.5 10.8 33 2-34 158-201 (293)
66 PTZ00165 aspartyl protease; Pr 89.9 4.1 9E-05 35.2 10.0 27 85-111 419-445 (482)
67 PF14541 TAXi_C: Xylanase inhi 89.7 1.3 2.8E-05 32.2 5.9 28 83-110 133-160 (161)
68 cd05489 xylanase_inhibitor_I_l 89.5 4.1 9E-05 33.7 9.5 26 86-111 335-360 (362)
69 PTZ00147 plasmepsin-1; Provisi 87.5 2.8 6E-05 36.0 7.3 93 12-111 332-448 (453)
70 PTZ00013 plasmepsin 4 (PM4); P 86.3 6.4 0.00014 33.8 8.9 27 85-111 421-447 (450)
71 KOG1339|consensus 83.6 11 0.00023 31.5 8.9 89 2-92 47-181 (398)
72 cd05473 beta_secretase_like Be 78.9 2.9 6.3E-05 34.2 3.9 26 86-111 319-344 (364)
73 PF11925 DUF3443: Protein of u 77.8 6.4 0.00014 33.0 5.6 92 1-93 23-149 (370)
74 PF14543 TAXi_N: Xylanase inhi 76.5 4.3 9.2E-05 29.6 3.9 25 2-26 1-27 (164)
75 TIGR02854 spore_II_GA sigma-E 68.3 7.5 0.00016 31.3 3.8 34 2-35 159-203 (288)
76 PLN03146 aspartyl protease fam 58.9 12 0.00026 31.7 3.5 27 2-28 85-113 (431)
77 PF13856 Gifsy-2: ATP-binding 51.6 18 0.00038 24.0 2.8 19 4-22 22-40 (95)
78 cd06396 PB1_NBR1 The PB1 domai 44.8 38 0.00082 22.3 3.4 35 3-37 2-38 (81)
79 PRK08364 sulfur carrier protei 36.6 59 0.0013 20.2 3.3 33 1-37 2-36 (70)
80 cd06407 PB1_NLP A PB1 domain i 33.0 66 0.0014 20.9 3.2 35 3-37 2-36 (82)
81 PF13954 PapC_N: PapC N-termin 27.7 1.6E+02 0.0035 20.9 4.8 37 2-38 32-74 (146)
82 KOG3048|consensus 27.2 47 0.001 24.4 1.8 23 2-25 73-95 (153)
83 cd01813 UBP_N UBP ubiquitin pr 27.1 1E+02 0.0022 19.3 3.3 36 3-38 2-37 (74)
84 KOG0679|consensus 24.9 37 0.00079 29.0 1.0 20 9-28 151-170 (426)
85 PF09379 FERM_N: FERM N-termin 24.7 49 0.0011 20.6 1.4 31 8-38 4-34 (80)
86 PF13510 Fer2_4: 2Fe-2S iron-s 24.6 88 0.0019 20.1 2.7 28 3-36 3-30 (82)
87 PRK06083 sulfur carrier protei 24.3 82 0.0018 20.6 2.5 30 1-36 16-45 (84)
88 KOG1339|consensus 22.8 69 0.0015 26.6 2.3 27 85-111 364-391 (398)
89 COG3617 Prophage antirepressor 22.2 1.2E+02 0.0026 22.7 3.3 33 5-38 11-43 (176)
No 1
>KOG0012|consensus
Probab=99.97 E-value=6.7e-32 Score=217.69 Aligned_cols=135 Identities=63% Similarity=1.037 Sum_probs=129.9
Q ss_pred CeEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEec
Q psy18197 1 MLFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILE 80 (144)
Q Consensus 1 ~~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~ 80 (144)
|+||+|+|||+|++|.|||||+.|+||.++|+|||+.+.+++++.+++.|+|..++.|+++.++++||+..++|+|.|++
T Consensus 235 ML~iN~~ing~~VKAfVDsGaq~timS~~Caer~gL~rlid~r~~g~a~gvg~~ki~g~Ih~~~lki~~~~l~c~ftV~d 314 (380)
T KOG0012|consen 235 MLYINCEINGVPVKAFVDSGAQTTIMSAACAERCGLNRLIDKRFQGEARGVGTEKILGRIHQAQLKIEDLYLPCSFTVLD 314 (380)
T ss_pred EEEEEEEECCEEEEEEEcccchhhhhhHHHHHHhChHHHhhhhhhccccCCCcccccceeEEEEEEeccEeeccceEEec
Confidence 89999999999999999999999999999999999999999999999999998899999999999999999999999999
Q ss_pred CCCCceEeeechhheecEEEEcCCCEEEEccCCeEEEeecCCCcCceeeccCCCC
Q psy18197 81 EQPMDMLLGLDMLRRHQCCIDLKRNVLRIGTTGTETRFLSESELPVCARLTNGVD 135 (144)
Q Consensus 81 ~~~~d~ILG~d~L~~~~~~iD~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (144)
..+.|++||+|.|++|+|+||+++++|++++++.++||+...+.|...+++.--+
T Consensus 315 ~~~~d~llGLd~Lrr~~ccIdL~~~~L~ig~~~teiPfl~~~~lp~~~~~~~~~~ 369 (380)
T KOG0012|consen 315 RRDMDLLLGLDMLRRHQCCIDLKTNVLRIGNTETEIPFLPSNELPSHNKETGASP 369 (380)
T ss_pred CCCcchhhhHHHHHhccceeecccCeEEecCCCccccccccccCCccccccccCC
Confidence 9999999999999999999999999999999989999999999999998775443
No 2
>PF09668 Asp_protease: Aspartyl protease; InterPro: IPR019103 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This family of eukaryotic aspartyl proteases have a fold similar to retroviral proteases which implies they function proteolytically during regulated protein turnover []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 3S8I_A 2I1A_B.
Probab=99.97 E-value=7.9e-31 Score=186.42 Aligned_cols=101 Identities=68% Similarity=1.188 Sum_probs=86.2
Q ss_pred CeEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEec
Q psy18197 1 MLFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILE 80 (144)
Q Consensus 1 ~~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~ 80 (144)
|+||+|+|||++++|+|||||+.|+||.++|+|||+++..++++.+.+.|+|..++.|+++.+++++|+.+++++|.|++
T Consensus 24 mLyI~~~ing~~vkA~VDtGAQ~tims~~~a~r~gL~~lid~r~~g~a~GvG~~~i~G~Ih~~~l~ig~~~~~~s~~Vle 103 (124)
T PF09668_consen 24 MLYINCKINGVPVKAFVDTGAQSTIMSKSCAERCGLMRLIDKRFAGVAKGVGTQKILGRIHSVQLKIGGLFFPCSFTVLE 103 (124)
T ss_dssp --EEEEEETTEEEEEEEETT-SS-EEEHHHHHHTTGGGGEEGGG-EE-------EEEEEEEEEEEEETTEEEEEEEEEET
T ss_pred eEEEEEEECCEEEEEEEeCCCCccccCHHHHHHcCChhhccccccccccCCCcCceeEEEEEEEEEECCEEEEEEEEEeC
Confidence 79999999999999999999999999999999999999999999889999998999999999999999999999999999
Q ss_pred CCCCceEeeechhheecEEEE
Q psy18197 81 EQPMDMLLGLDMLRRHQCCID 101 (144)
Q Consensus 81 ~~~~d~ILG~d~L~~~~~~iD 101 (144)
+.+.|.|||+|||++++|+||
T Consensus 104 ~~~~d~llGld~L~~~~c~ID 124 (124)
T PF09668_consen 104 DQDVDLLLGLDMLKRHKCCID 124 (124)
T ss_dssp TSSSSEEEEHHHHHHTT-EEE
T ss_pred CCCcceeeeHHHHHHhCcccC
Confidence 889999999999999999998
No 3
>cd05479 RP_DDI RP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
Probab=99.96 E-value=7.2e-29 Score=176.52 Aligned_cols=109 Identities=66% Similarity=1.105 Sum_probs=98.1
Q ss_pred CeEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEec
Q psy18197 1 MLFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILE 80 (144)
Q Consensus 1 ~~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~ 80 (144)
++|++++|||++++|||||||++|+||+++|+++|++...+.+....+.|.|+....|.++.+++++|+++++++|.|++
T Consensus 16 ~~~v~~~Ing~~~~~LvDTGAs~s~Is~~~a~~lgl~~~~~~~~~~~~~g~g~~~~~g~~~~~~l~i~~~~~~~~~~Vl~ 95 (124)
T cd05479 16 MLYINVEINGVPVKAFVDSGAQMTIMSKACAEKCGLMRLIDKRFQGIAKGVGTQKILGRIHLAQVKIGNLFLPCSFTVLE 95 (124)
T ss_pred EEEEEEEECCEEEEEEEeCCCceEEeCHHHHHHcCCccccCcceEEEEecCCCcEEEeEEEEEEEEECCEEeeeEEEEEC
Confidence 47999999999999999999999999999999999975444445446777777778899988999999999999999999
Q ss_pred CCCCceEeeechhheecEEEEcCCCEEEE
Q psy18197 81 EQPMDMLLGLDMLRRHQCCIDLKRNVLRI 109 (144)
Q Consensus 81 ~~~~d~ILG~d~L~~~~~~iD~~~~~l~~ 109 (144)
...+|+|||||||+++++.|||++++|++
T Consensus 96 ~~~~d~ILG~d~L~~~~~~ID~~~~~i~~ 124 (124)
T cd05479 96 DDDVDFLIGLDMLKRHQCVIDLKENVLRI 124 (124)
T ss_pred CCCcCEEecHHHHHhCCeEEECCCCEEEC
Confidence 88999999999999999999999999985
No 4
>cd05480 NRIP_C NRIP_C; putative nuclear receptor interacting protein. Proteins in this family have been described as probable nuclear receptor interacting proteins. The C-terminal domain of this family is homologous to the retroviral aspartyl protease domain. The domain is structurally related to one lobe of the pepsin molecule. The conserved active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=99.96 E-value=5.5e-29 Score=169.18 Aligned_cols=101 Identities=23% Similarity=0.478 Sum_probs=94.8
Q ss_pred EEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecc-eeeeEeeccce-eEEEEEEEEEEEEEcCEEEeeeEEEecC
Q psy18197 4 INCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDT-RWAGVAKGVGI-QRIIGRIHMVQVVIEKDFLTTSLSILEE 81 (144)
Q Consensus 4 v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~-~~~~~a~g~g~-~~~~g~i~~~~i~iG~~~~~~~~~V~~~ 81 (144)
|++++||+|++|+|||||++|+||+.+|+|+||.+..+. ++.+.|.|+|+ .++.|+++.++++||+.+++++|.|++.
T Consensus 1 vnCk~nG~~vkAfVDsGaQ~timS~~caercgL~r~v~~~r~~g~A~gvgt~~kiiGrih~~~ikig~~~~~CSftVld~ 80 (103)
T cd05480 1 VSCQCAGKELRALVDTGCQYNLISAACLDRLGLKERVLKAKAEEEAPSLPTSVKVIGQIERLVLQLGQLTVECSAQVVDD 80 (103)
T ss_pred CceeECCEEEEEEEecCCchhhcCHHHHHHcChHhhhhhccccccccCCCcceeEeeEEEEEEEEeCCEEeeEEEEEEcC
Confidence 479999999999999999999999999999999877665 77789999997 6999999999999999999999999999
Q ss_pred CCCceEeeechhheecEEEEcCC
Q psy18197 82 QPMDMLLGLDMLRRHQCCIDLKR 104 (144)
Q Consensus 82 ~~~d~ILG~d~L~~~~~~iD~~~ 104 (144)
.+.|.+||+|.|++|+|.||+++
T Consensus 81 ~~~d~llGLdmLkrhqc~IdL~k 103 (103)
T cd05480 81 NEKNFSLGLQTLKSLKCVINLEK 103 (103)
T ss_pred CCcceEeeHHHHhhcceeeeccC
Confidence 89999999999999999999874
No 5
>PF08284 RVP_2: Retroviral aspartyl protease; InterPro: IPR013242 This region defines single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein; usually pol, more rarely gag. Retroviral proteases appear to be homologous to a single domain of the two-domain eukaryotic aspartyl proteases.
Probab=99.91 E-value=3.4e-23 Score=149.54 Aligned_cols=109 Identities=21% Similarity=0.305 Sum_probs=91.0
Q ss_pred eEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeee-cceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEec
Q psy18197 2 LFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLI-DTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILE 80 (144)
Q Consensus 2 ~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~-~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~ 80 (144)
+...++|+++++.+|+||||++|+||+++|++++++... +.+.. +....+.....+.+..+.++++++.+..+|.|++
T Consensus 22 i~g~~~I~~~~~~vLiDSGAThsFIs~~~a~~~~l~~~~l~~~~~-V~~~g~~~~~~~~~~~~~~~i~g~~~~~dl~vl~ 100 (135)
T PF08284_consen 22 ITGTFLINSIPASVLIDSGATHSFISSSFAKKLGLPLEPLPRPIV-VSAPGGSINCEGVCPDVPLSIQGHEFVVDLLVLD 100 (135)
T ss_pred EEEEEEeccEEEEEEEecCCCcEEccHHHHHhcCCEEEEccCeeE-EecccccccccceeeeEEEEECCeEEEeeeEEec
Confidence 345689999999999999999999999999999995432 23332 3222233455666778999999999999999999
Q ss_pred CCCCceEeeechhheecEEEEcCCCEEEEcc
Q psy18197 81 EQPMDMLLGLDMLRRHQCCIDLKRNVLRIGT 111 (144)
Q Consensus 81 ~~~~d~ILG~d~L~~~~~~iD~~~~~l~~~~ 111 (144)
..++|+|||||||+++++.|||.+++++|..
T Consensus 101 l~~~DvILGm~WL~~~~~~IDw~~k~v~f~~ 131 (135)
T PF08284_consen 101 LGGYDVILGMDWLKKHNPVIDWATKTVTFNS 131 (135)
T ss_pred ccceeeEeccchHHhCCCEEEccCCEEEEeC
Confidence 9899999999999999999999999999876
No 6
>TIGR02281 clan_AA_DTGA clan AA aspartic protease, TIGR02281 family. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix. Sequences in the seed alignment and those scoring above the trusted cutoff are Proteobacterial; homologs scroing between trusted and noise are found in Pyrobaculum aerophilum str. IM2 (archaeal), Pirellula sp. (Planctomycetes), and Nostoc sp. PCC 7120 (Cyanobacteria).
Probab=99.85 E-value=2.6e-20 Score=132.12 Aligned_cols=105 Identities=21% Similarity=0.228 Sum_probs=80.0
Q ss_pred eEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEe-eeEEEec
Q psy18197 2 LFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLT-TSLSILE 80 (144)
Q Consensus 2 ~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~-~~~~V~~ 80 (144)
++++++|||++++|||||||+.++||+++|++||++..... ......+.++......+...++++|+..+. +++.|++
T Consensus 12 ~~v~~~InG~~~~flVDTGAs~t~is~~~A~~Lgl~~~~~~-~~~~~~ta~G~~~~~~~~l~~l~iG~~~~~nv~~~v~~ 90 (121)
T TIGR02281 12 FYATGRVNGRNVRFLVDTGATSVALNEEDAQRLGLDLNRLG-YTVTVSTANGQIKAARVTLDRVAIGGIVVNDVDAMVAE 90 (121)
T ss_pred EEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCcccCC-ceEEEEeCCCcEEEEEEEeCEEEECCEEEeCcEEEEeC
Confidence 68999999999999999999999999999999999653211 111233333333345566789999998874 8999998
Q ss_pred CC-CCceEeeechhheec-EEEEcCCCEEEE
Q psy18197 81 EQ-PMDMLLGLDMLRRHQ-CCIDLKRNVLRI 109 (144)
Q Consensus 81 ~~-~~d~ILG~d~L~~~~-~~iD~~~~~l~~ 109 (144)
.. ..++|||||||++++ +.+|- ++|.+
T Consensus 91 ~~~~~~~LLGm~fL~~~~~~~~~~--~~l~l 119 (121)
T TIGR02281 91 GGALSESLLGMSFLNRLSRFTVRG--GKLIL 119 (121)
T ss_pred CCcCCceEcCHHHHhccccEEEEC--CEEEE
Confidence 65 358999999999997 66654 55555
No 7
>cd05484 retropepsin_like_LTR_2 Retropepsins_like_LTR, pepsin-like aspartate proteases. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classif
Probab=99.83 E-value=8e-20 Score=123.00 Aligned_cols=90 Identities=26% Similarity=0.290 Sum_probs=74.7
Q ss_pred eEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeee-EeeccceeEEEEEEEEEEEEEcCEEEeeeEEEec
Q psy18197 2 LFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAG-VAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILE 80 (144)
Q Consensus 2 ~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~-~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~ 80 (144)
.|++++|||++++|||||||++|+||++.++++|+++..+....- .+.| ...+..|.+ .+++++|+..+..+|.|++
T Consensus 1 ~~~~~~Ing~~i~~lvDTGA~~svis~~~~~~lg~~~~~~~~~~v~~a~G-~~~~~~G~~-~~~v~~~~~~~~~~~~v~~ 78 (91)
T cd05484 1 KTVTLLVNGKPLKFQLDTGSAITVISEKTWRKLGSPPLKPTKKRLRTATG-TKLSVLGQI-LVTVKYGGKTKVLTLYVVK 78 (91)
T ss_pred CEEEEEECCEEEEEEEcCCcceEEeCHHHHHHhCCCccccccEEEEecCC-CEeeEeEEE-EEEEEECCEEEEEEEEEEE
Confidence 489999999999999999999999999999999986533332221 2333 346788887 6899999999999999999
Q ss_pred CCCCceEeeechhh
Q psy18197 81 EQPMDMLLGLDMLR 94 (144)
Q Consensus 81 ~~~~d~ILG~d~L~ 94 (144)
.. ++.|||+|||.
T Consensus 79 ~~-~~~lLG~~wl~ 91 (91)
T cd05484 79 NE-GLNLLGRDWLD 91 (91)
T ss_pred CC-CCCccChhhcC
Confidence 76 99999999984
No 8
>PF13650 Asp_protease_2: Aspartyl protease
Probab=99.78 E-value=4.6e-18 Score=112.42 Aligned_cols=88 Identities=23% Similarity=0.366 Sum_probs=68.5
Q ss_pred EEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEE-eeeEEEec-C
Q psy18197 4 INCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFL-TTSLSILE-E 81 (144)
Q Consensus 4 v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~-~~~~~V~~-~ 81 (144)
|+++|||++++||+||||+.++|++++|+++|++..... ......+.++..........++++|+..+ ++++.+.+ .
T Consensus 1 V~v~vng~~~~~liDTGa~~~~i~~~~~~~l~~~~~~~~-~~~~~~~~~g~~~~~~~~~~~i~ig~~~~~~~~~~v~~~~ 79 (90)
T PF13650_consen 1 VPVKVNGKPVRFLIDTGASISVISRSLAKKLGLKPRPKS-VPISVSGAGGSVTVYRGRVDSITIGGITLKNVPFLVVDLG 79 (90)
T ss_pred CEEEECCEEEEEEEcCCCCcEEECHHHHHHcCCCCcCCc-eeEEEEeCCCCEEEEEEEEEEEEECCEEEEeEEEEEECCC
Confidence 589999999999999999999999999999999643222 11134455443333444556999999887 78899998 5
Q ss_pred CCCceEeeech
Q psy18197 82 QPMDMLLGLDM 92 (144)
Q Consensus 82 ~~~d~ILG~d~ 92 (144)
...|+|||+||
T Consensus 80 ~~~~~iLG~df 90 (90)
T PF13650_consen 80 DPIDGILGMDF 90 (90)
T ss_pred CCCEEEeCCcC
Confidence 68999999998
No 9
>PF00077 RVP: Retroviral aspartyl protease The Prosite entry also includes Pfam:PF00026; InterPro: IPR018061 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to the MEROPS peptidase family A2 (retropepsin family, clan AA), subfamily A2A. The family includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). Retroviral aspartyl protease is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins.; PDB: 3D3T_B 3SQF_A 1NSO_A 2HB3_A 2HS2_A 2HS1_B 3K4V_A 3GGV_C 1HTG_B 2FDE_A ....
Probab=99.77 E-value=5.7e-18 Score=115.22 Aligned_cols=94 Identities=27% Similarity=0.468 Sum_probs=77.7
Q ss_pred eEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccce-eEEEEEEEEEEEEEcCEEEeeeEEEec
Q psy18197 2 LFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGI-QRIIGRIHMVQVVIEKDFLTTSLSILE 80 (144)
Q Consensus 2 ~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~-~~~~g~i~~~~i~iG~~~~~~~~~V~~ 80 (144)
++++++|||++++|||||||++|+|+++.+.+.+... .... ...|+|+ ....|.. ..++++++..+...|.|++
T Consensus 6 p~i~v~i~g~~i~~LlDTGA~vsiI~~~~~~~~~~~~--~~~~--~v~~~~g~~~~~~~~-~~~v~~~~~~~~~~~~v~~ 80 (100)
T PF00077_consen 6 PYITVKINGKKIKALLDTGADVSIISEKDWKKLGPPP--KTSI--TVRGAGGSSSILGST-TVEVKIGGKEFNHTFLVVP 80 (100)
T ss_dssp SEEEEEETTEEEEEEEETTBSSEEESSGGSSSTSSEE--EEEE--EEEETTEEEEEEEEE-EEEEEETTEEEEEEEEESS
T ss_pred ceEEEeECCEEEEEEEecCCCcceecccccccccccc--cCCc--eeccCCCcceeeeEE-EEEEEEECccceEEEEecC
Confidence 5899999999999999999999999999998887642 1111 4456654 3566655 5799999999999999999
Q ss_pred CCCCceEeeechhheecEEEE
Q psy18197 81 EQPMDMLLGLDMLRRHQCCID 101 (144)
Q Consensus 81 ~~~~d~ILG~d~L~~~~~~iD 101 (144)
+.+.+ |||+|||+++++.|+
T Consensus 81 ~~~~~-ILG~D~L~~~~~~i~ 100 (100)
T PF00077_consen 81 DLPMN-ILGRDFLKKLNAVIN 100 (100)
T ss_dssp TCSSE-EEEHHHHTTTTCEEE
T ss_pred CCCCC-EeChhHHHHcCCEEC
Confidence 87778 999999999999875
No 10
>cd05483 retropepsin_like_bacteria Bacterial aspartate proteases, retropepsin-like protease family. This family of bacteria aspartate proteases is a subfamily of retropepsin-like protease family, which includes enzymes from retrovirus and retrotransposons. While fungal and mammalian pepsin-like aspartate proteases are bilobal proteins with structurally related N- and C-termini, this family of bacteria aspartate proteases is half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate proteases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=99.74 E-value=3.2e-17 Score=109.42 Aligned_cols=92 Identities=27% Similarity=0.405 Sum_probs=69.8
Q ss_pred CeEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEE-eeeEEEe
Q psy18197 1 MLFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFL-TTSLSIL 79 (144)
Q Consensus 1 ~~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~-~~~~~V~ 79 (144)
+++++++|||++++|++||||+.++|++++|+++++........ ...+.++..........++++|+..+ +.++.++
T Consensus 2 ~~~v~v~i~~~~~~~llDTGa~~s~i~~~~~~~l~~~~~~~~~~--~~~~~~G~~~~~~~~~~~i~ig~~~~~~~~~~v~ 79 (96)
T cd05483 2 HFVVPVTINGQPVRFLLDTGASTTVISEELAERLGLPLTLGGKV--TVQTANGRVRAARVRLDSLQIGGITLRNVPAVVL 79 (96)
T ss_pred cEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCccCCCcE--EEEecCCCccceEEEcceEEECCcEEeccEEEEe
Confidence 47999999999999999999999999999999999722112222 23333332223333467999999876 4789999
Q ss_pred cCCC--CceEeeechhh
Q psy18197 80 EEQP--MDMLLGLDMLR 94 (144)
Q Consensus 80 ~~~~--~d~ILG~d~L~ 94 (144)
+... .|+|||+|||+
T Consensus 80 d~~~~~~~gIlG~d~l~ 96 (96)
T cd05483 80 PGDALGVDGLLGMDFLR 96 (96)
T ss_pred CCcccCCceEeChHHhC
Confidence 9765 99999999985
No 11
>TIGR03698 clan_AA_DTGF clan AA aspartic protease, AF_0612 family. Members of this protein family are clan AA aspartic proteases, related to family TIGR02281. These proteins resemble retropepsins, pepsin-like proteases of retroviruses such as HIV. Members of this family are found in archaea and bacteria.
Probab=99.72 E-value=1.2e-16 Score=111.07 Aligned_cols=100 Identities=18% Similarity=0.283 Sum_probs=75.0
Q ss_pred EEEEEECC------EEEEEEEecCCCcee-cCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeee
Q psy18197 3 FINCCVNG------VPVKAFVDSGAQTTI-MSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTS 75 (144)
Q Consensus 3 ~v~v~InG------~~v~~LvDTGA~~s~-Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~ 75 (144)
++++.+.+ .+++|||||||+..+ |++++|+++|+.+....+.. .|+|. ...-.+...++++|++.....
T Consensus 1 ~~~v~~~~p~~~~~~~v~~LVDTGat~~~~l~~~~a~~lgl~~~~~~~~~-tA~G~---~~~~~v~~~~v~igg~~~~~~ 76 (107)
T TIGR03698 1 TLDVELSNPKNPEFMEVRALVDTGFSGFLLVPPDIVNKLGLPELDQRRVY-LADGR---EVLTDVAKASIIINGLEIDAF 76 (107)
T ss_pred CEEEEEeCCCCCCceEEEEEEECCCCeEEecCHHHHHHcCCCcccCcEEE-ecCCc---EEEEEEEEEEEEECCEEEEEE
Confidence 35667744 389999999999997 99999999999764333322 55552 223345578999999887555
Q ss_pred EEEecCCCCceEeeechhheecEEEEcCCCEE
Q psy18197 76 LSILEEQPMDMLLGLDMLRRHQCCIDLKRNVL 107 (144)
Q Consensus 76 ~~V~~~~~~d~ILG~d~L~~~~~~iD~~~~~l 107 (144)
+.+.+..+ +.||||.||++++..|||++++|
T Consensus 77 v~~~~~~~-~~LLG~~~L~~l~l~id~~~~~~ 107 (107)
T TIGR03698 77 VESLGYVD-EPLLGTELLEGLGIVIDYRNQGL 107 (107)
T ss_pred EEecCCCC-ccEecHHHHhhCCEEEehhhCcC
Confidence 45445434 89999999999999999999864
No 12
>cd06095 RP_RTVL_H_like Retropepsin of the RTVL_H family of human endogenous retrovirus-like elements. This family includes aspartate proteases from retroelements with LTR (long terminal repeats) including the RTVL_H family of human endogenous retrovirus-like elements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where
Probab=99.71 E-value=1.1e-16 Score=106.88 Aligned_cols=83 Identities=24% Similarity=0.381 Sum_probs=63.3
Q ss_pred EEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeE--EEEEEEEE-EEEEcCEEEeeeEEEec
Q psy18197 4 INCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQR--IIGRIHMV-QVVIEKDFLTTSLSILE 80 (144)
Q Consensus 4 v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~--~~g~i~~~-~i~iG~~~~~~~~~V~~ 80 (144)
++++|||+|++|||||||+.|+|+++.|+++.. ..... ...|+|+.. .... .. .+++|+++....+.+.+
T Consensus 1 ~~v~InG~~~~fLvDTGA~~tii~~~~a~~~~~----~~~~~-~v~gagG~~~~~v~~--~~~~v~vg~~~~~~~~~v~~ 73 (86)
T cd06095 1 VTITVEGVPIVFLVDTGATHSVLKSDLGPKQEL----STTSV-LIRGVSGQSQQPVTT--YRTLVDLGGHTVSHSFLVVP 73 (86)
T ss_pred CEEEECCEEEEEEEECCCCeEEECHHHhhhccC----CCCcE-EEEeCCCcccccEEE--eeeEEEECCEEEEEEEEEEc
Confidence 478999999999999999999999999999822 11222 456666543 2222 23 59999999888888887
Q ss_pred CCCCceEeeechhh
Q psy18197 81 EQPMDMLLGLDMLR 94 (144)
Q Consensus 81 ~~~~d~ILG~d~L~ 94 (144)
.. .+.|||||||.
T Consensus 74 ~~-~~~lLG~dfL~ 86 (86)
T cd06095 74 NC-PDPLLGRDLLS 86 (86)
T ss_pred CC-CCcEechhhcC
Confidence 53 58999999985
No 13
>COG3577 Predicted aspartyl protease [General function prediction only]
Probab=99.61 E-value=4.1e-15 Score=113.08 Aligned_cols=104 Identities=24% Similarity=0.291 Sum_probs=78.8
Q ss_pred eEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceee-ecceee-eEeeccceeEEEEEEEEEEEEEcCEEE-eeeEEE
Q psy18197 2 LFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRL-IDTRWA-GVAKGVGIQRIIGRIHMVQVVIEKDFL-TTSLSI 78 (144)
Q Consensus 2 ~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~-~~~~~~-~~a~g~g~~~~~g~i~~~~i~iG~~~~-~~~~~V 78 (144)
++++.+|||++++|||||||+...++++.|+|+|+... .+.++. .+|+| ......++..+++||+.++ ++++.|
T Consensus 106 F~a~~~VNGk~v~fLVDTGATsVal~~~dA~RlGid~~~l~y~~~v~TANG---~~~AA~V~Ld~v~IG~I~~~nV~A~V 182 (215)
T COG3577 106 FEANGRVNGKKVDFLVDTGATSVALNEEDARRLGIDLNSLDYTITVSTANG---RARAAPVTLDRVQIGGIRVKNVDAMV 182 (215)
T ss_pred EEEEEEECCEEEEEEEecCcceeecCHHHHHHhCCCccccCCceEEEccCC---ccccceEEeeeEEEccEEEcCchhhe
Confidence 57889999999999999999999999999999999542 122221 24555 2334467789999999765 689999
Q ss_pred ecCC-CCceEeeechhheecEEEEcCCCEEEE
Q psy18197 79 LEEQ-PMDMLLGLDMLRRHQCCIDLKRNVLRI 109 (144)
Q Consensus 79 ~~~~-~~d~ILG~d~L~~~~~~iD~~~~~l~~ 109 (144)
+++. -...||||+||++++. +..++..|.+
T Consensus 183 ~~~g~L~~sLLGMSfL~rL~~-fq~~g~~LiL 213 (215)
T COG3577 183 AEDGALDESLLGMSFLNRLSG-FQVDGDLLIL 213 (215)
T ss_pred ecCCccchhhhhHHHHhhccc-eEecCcEEEe
Confidence 9765 4569999999999973 3345555554
No 14
>cd00303 retropepsin_like Retropepsins; pepsin-like aspartate proteases. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples
Probab=99.48 E-value=1.1e-12 Score=83.00 Aligned_cols=90 Identities=31% Similarity=0.445 Sum_probs=66.7
Q ss_pred EEEEECCEEEEEEEecCCCceecCHHHHHHcCc-eeeecceee-eEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEecC
Q psy18197 4 INCCVNGVPVKAFVDSGAQTTIMSAECAERCAI-MRLIDTRWA-GVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILEE 81 (144)
Q Consensus 4 v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl-~~~~~~~~~-~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~~ 81 (144)
+++.+||+++++|+||||+.++++.+.+++++. ....+.... ..+.|. .....+......+.+++..+...+.+.+.
T Consensus 1 ~~~~~~~~~~~~liDtgs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 79 (92)
T cd00303 1 LKGKINGVPVRALVDSGASVNFISESLAKKLGLPPRLLPTPLKVKGANGS-SVKTLGVILPVTIGIGGKTFTVDFYVLDL 79 (92)
T ss_pred CEEEECCEEEEEEEcCCCcccccCHHHHHHcCCCcccCCCceEEEecCCC-EeccCcEEEEEEEEeCCEEEEEEEEEEcC
Confidence 478899999999999999999999999999987 222222221 122221 12233333457888999888999999998
Q ss_pred CCCceEeeechhh
Q psy18197 82 QPMDMLLGLDMLR 94 (144)
Q Consensus 82 ~~~d~ILG~d~L~ 94 (144)
..+|.|||+|||+
T Consensus 80 ~~~~~ilG~~~l~ 92 (92)
T cd00303 80 LSYDVILGRPWLE 92 (92)
T ss_pred CCcCEEecccccC
Confidence 8999999999984
No 15
>cd05481 retropepsin_like_LTR_1 Retropepsins_like_LTR; pepsin-like aspartate protease from retrotransposons with long terminal repeats. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identifi
Probab=99.44 E-value=1.7e-12 Score=88.11 Aligned_cols=86 Identities=17% Similarity=0.180 Sum_probs=68.1
Q ss_pred EEEECC-EEEEEEEecCCCceecCHHHHHHcC---ceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEec
Q psy18197 5 NCCVNG-VPVKAFVDSGAQTTIMSAECAERCA---IMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILE 80 (144)
Q Consensus 5 ~v~InG-~~v~~LvDTGA~~s~Is~~~a~~lg---l~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~ 80 (144)
+.+||| ++++|++||||+.|+||.+.+++++ ..+..++...-.+.+-+.....|.+ .+.+++++..++++|.|++
T Consensus 2 ~~~i~g~~~v~~~vDtGA~vnllp~~~~~~l~~~~~~~L~~t~~~L~~~~g~~~~~~G~~-~~~v~~~~~~~~~~f~Vvd 80 (93)
T cd05481 2 DMKINGKQSVKFQLDTGATCNVLPLRWLKSLTPDKDPELRPSPVRLTAYGGSTIPVEGGV-KLKCRYRNPKYNLTFQVVK 80 (93)
T ss_pred ceEeCCceeEEEEEecCCEEEeccHHHHhhhccCCCCcCccCCeEEEeeCCCEeeeeEEE-EEEEEECCcEEEEEEEEEC
Confidence 568999 9999999999999999999999998 4444445543344443346788874 5899999999999999999
Q ss_pred CCCCceEeeech
Q psy18197 81 EQPMDMLLGLDM 92 (144)
Q Consensus 81 ~~~~d~ILG~d~ 92 (144)
. +...|||.+.
T Consensus 81 ~-~~~~lLG~~~ 91 (93)
T cd05481 81 E-EGPPLLGAKA 91 (93)
T ss_pred C-CCCceEcccc
Confidence 5 3568899864
No 16
>PF12384 Peptidase_A2B: Ty3 transposon peptidase; InterPro: IPR024650 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Ty3 is a gypsy-type, retrovirus-like, element found in the budding yeast. The Ty3 aspartyl protease is required for processing of the viral polyprotein into its mature species [].
Probab=99.31 E-value=4.6e-11 Score=88.25 Aligned_cols=93 Identities=15% Similarity=0.309 Sum_probs=74.1
Q ss_pred EEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccce--eEEEEEEEEEEEEEcCEEEeeeEEEec
Q psy18197 3 FINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGI--QRIIGRIHMVQVVIEKDFLTTSLSILE 80 (144)
Q Consensus 3 ~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~--~~~~g~i~~~~i~iG~~~~~~~~~V~~ 80 (144)
.++..++|.+++.|+||||-.|+|+++++++|+|+......+ ...|+.+ ....-....+++.+++..+...++|.+
T Consensus 36 ~v~l~~~~t~i~vLfDSGSPTSfIr~di~~kL~L~~~~app~--~fRG~vs~~~~~tsEAv~ld~~i~n~~i~i~aYV~d 113 (177)
T PF12384_consen 36 IVQLNCKGTPIKVLFDSGSPTSFIRSDIVEKLELPTHDAPPF--RFRGFVSGESATTSEAVTLDFYIDNKLIDIAAYVTD 113 (177)
T ss_pred EEEEeecCcEEEEEEeCCCccceeehhhHHhhCCccccCCCE--EEeeeccCCceEEEEeEEEEEEECCeEEEEEEEEec
Confidence 356788999999999999999999999999999976533332 3334321 222223445789999999999999999
Q ss_pred CCCCceEeeechhheec
Q psy18197 81 EQPMDMLLGLDMLRRHQ 97 (144)
Q Consensus 81 ~~~~d~ILG~d~L~~~~ 97 (144)
..+.|+|+|.++|+++.
T Consensus 114 ~m~~dlIIGnPiL~ryp 130 (177)
T PF12384_consen 114 NMDHDLIIGNPILDRYP 130 (177)
T ss_pred cCCcceEeccHHHhhhH
Confidence 99999999999999885
No 17
>PF13975 gag-asp_proteas: gag-polyprotein putative aspartyl protease
Probab=99.26 E-value=3.1e-11 Score=78.11 Aligned_cols=41 Identities=29% Similarity=0.470 Sum_probs=37.1
Q ss_pred eEEEEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecc
Q psy18197 2 LFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDT 42 (144)
Q Consensus 2 ~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~ 42 (144)
+++++.|||+++.+|+||||++||||.++|+|||++.....
T Consensus 9 ~~v~~~I~g~~~~alvDtGat~~fis~~~a~rLgl~~~~~~ 49 (72)
T PF13975_consen 9 MYVPVSIGGVQVKALVDTGATHNFISESLAKRLGLPLEKPP 49 (72)
T ss_pred EEEEEEECCEEEEEEEeCCCcceecCHHHHHHhCCCcccCC
Confidence 68999999999999999999999999999999999654433
No 18
>COG5550 Predicted aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.13 E-value=6.7e-10 Score=78.48 Aligned_cols=94 Identities=15% Similarity=0.228 Sum_probs=72.3
Q ss_pred EEEEEecCCC-ceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEecCCCCceEeeec
Q psy18197 13 VKAFVDSGAQ-TTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILEEQPMDMLLGLD 91 (144)
Q Consensus 13 v~~LvDTGA~-~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~~~~~d~ILG~d 91 (144)
..+|||||++ ..++++++|+++|+++....+.. .+.| + .....++...+++++.+......+.+..+.+ +||++
T Consensus 27 ~~~LiDTGFtg~lvlp~~vaek~~~~~~~~~~~~-~a~~--~-~v~t~V~~~~iki~g~e~~~~Vl~s~~~~~~-liG~~ 101 (125)
T COG5550 27 YDELIDTGFTGYLVLPPQVAEKLGLPLFSTIRIV-LADG--G-VVKTSVALATIKIDGVEKVAFVLASDNLPEP-LIGVN 101 (125)
T ss_pred eeeEEecCCceeEEeCHHHHHhcCCCccCChhhh-hhcC--C-EEEEEEEEEEEEECCEEEEEEEEccCCCccc-chhhh
Confidence 4559999999 89999999999999765433433 2222 2 3344567789999998877666666666666 99999
Q ss_pred hhheecEEEEcCCCEEEEcc
Q psy18197 92 MLRRHQCCIDLKRNVLRIGT 111 (144)
Q Consensus 92 ~L~~~~~~iD~~~~~l~~~~ 111 (144)
||+.++..+|+.+++|+-+.
T Consensus 102 ~lk~l~~~vn~~~g~LEk~~ 121 (125)
T COG5550 102 LLKLLGLVVNPKTGKLEKPP 121 (125)
T ss_pred hhhhccEEEcCCcceEeccc
Confidence 99999999999999998543
No 19
>cd06094 RP_Saci_like RP_Saci_like, retropepsin family. Retropepsin on retrotransposons with long terminal repeats (LTR) including Saci-1, -2 and -3 of Schistosoma mansoni. Retropepsins are related to fungal and mammalian pepsins. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified
Probab=99.06 E-value=6.8e-10 Score=74.55 Aligned_cols=79 Identities=14% Similarity=0.182 Sum_probs=55.9
Q ss_pred EEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCE-EEeeeEEEecCCCCceEeee
Q psy18197 12 PVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKD-FLTTSLSILEEQPMDMLLGL 90 (144)
Q Consensus 12 ~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~-~~~~~~~V~~~~~~d~ILG~ 90 (144)
.++|||||||++|+|+....++. .+ +....-.|..-...++.|.. ...+.+|.. .+.-+|.|.+- ...|||.
T Consensus 9 ~~~fLVDTGA~vSviP~~~~~~~-~~---~~~~~l~AANgt~I~tyG~~-~l~ldlGlrr~~~w~FvvAdv--~~pIlGa 81 (89)
T cd06094 9 GLRFLVDTGAAVSVLPASSTKKS-LK---PSPLTLQAANGTPIATYGTR-SLTLDLGLRRPFAWNFVVADV--PHPILGA 81 (89)
T ss_pred CcEEEEeCCCceEeecccccccc-cc---CCceEEEeCCCCeEeeeeeE-EEEEEcCCCcEEeEEEEEcCC--CcceecH
Confidence 47999999999999999987763 21 22222122221235788854 578999985 67778888663 3489999
Q ss_pred chhheec
Q psy18197 91 DMLRRHQ 97 (144)
Q Consensus 91 d~L~~~~ 97 (144)
|||++++
T Consensus 82 DfL~~~~ 88 (89)
T cd06094 82 DFLQHYG 88 (89)
T ss_pred HHHHHcC
Confidence 9999886
No 20
>cd05482 HIV_retropepsin_like Retropepsins, pepsin-like aspartate proteases. This is a subfamily of retropepsins. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This gro
Probab=99.04 E-value=2.3e-09 Score=71.99 Aligned_cols=86 Identities=15% Similarity=0.142 Sum_probs=60.1
Q ss_pred EEEECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEecCCCC
Q psy18197 5 NCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILEEQPM 84 (144)
Q Consensus 5 ~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~~~~~ 84 (144)
.++|+|+++++|+||||+.|+|+...+.+---. ...+. ...|+|+....-+.....+++.+......+.|.+..-.
T Consensus 2 ~~~i~g~~~~~llDTGAd~Tvi~~~~~p~~w~~--~~~~~--~i~GIGG~~~~~~~~~v~i~i~~~~~~g~vlv~~~~~P 77 (87)
T cd05482 2 TLYINGKLFEGLLDTGADVSIIAENDWPKNWPI--QPAPS--NLTGIGGAITPSQSSVLLLEIDGEGHLGTILVYVLSLP 77 (87)
T ss_pred EEEECCEEEEEEEccCCCCeEEcccccCCCCcc--CCCCe--EEEeccceEEEEEEeeEEEEEcCCeEEEEEEEccCCCc
Confidence 689999999999999999999998665443211 11111 45677654333334467899998887778888775234
Q ss_pred ceEeeechhh
Q psy18197 85 DMLLGLDMLR 94 (144)
Q Consensus 85 d~ILG~d~L~ 94 (144)
.-|||.|.|.
T Consensus 78 ~nllGRd~L~ 87 (87)
T cd05482 78 VNLWGRDILS 87 (87)
T ss_pred ccEEccccCC
Confidence 4799999874
No 21
>PF02160 Peptidase_A3: Cauliflower mosaic virus peptidase (A3); InterPro: IPR000588 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of sequences contain an aspartic peptidase signature that belongs to MEROPS peptidase family A3, subfamily A3A (cauliflower mosaic virus-type endopeptidase, clan AA). Cauliflower mosaic virus belongs to the Retro-transcribing viruses, which have a double-stranded DNA genome. The genome includes an open reading frame (ORF V) that shows similarities to the pol gene of retroviruses. This ORF codes for a polyprotein that includes a reverse transcriptase, which, on the basis of a DTG triplet near the N terminus, was suggested to include an aspartic protease. The presence of an aspartic protease has been confirmed by mutational studies, implicating Asp-45 in catalysis. The protease releases itself from the polyprotein and is involved in reactions required to process the ORF IV polyprotein, which includes the viral coat protein []. The viral aspartic peptidase signature has also been found associated with a polyprotein encoded by integrated pararetrovirus-like sequences in the genome of Nicotiana tabacum (Common tobacco) []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis
Probab=98.92 E-value=1.3e-08 Score=77.75 Aligned_cols=103 Identities=19% Similarity=0.302 Sum_probs=71.5
Q ss_pred eEEEEEE--CC---EEEEEEEecCCCceecCHHH-----HHHcCceeeecceeeeEeeccc-e-eEEEEEEEEEEEEEcC
Q psy18197 2 LFINCCV--NG---VPVKAFVDSGAQTTIMSAEC-----AERCAIMRLIDTRWAGVAKGVG-I-QRIIGRIHMVQVVIEK 69 (144)
Q Consensus 2 ~~v~v~I--nG---~~v~~LvDTGA~~s~Is~~~-----a~~lgl~~~~~~~~~~~a~g~g-~-~~~~g~i~~~~i~iG~ 69 (144)
.|+.+++ .| ..+.++|||||+..+++... |+++.- +. ...|+. . ......+....+.|++
T Consensus 5 iyI~~~i~~~gy~~~~~~~~vDTGAt~C~~~~~iiP~e~we~~~~------~i--~v~~an~~~~~i~~~~~~~~i~I~~ 76 (201)
T PF02160_consen 5 IYIKVKISFPGYKKFNYHCYVDTGATICCASKKIIPEEYWEKSKK------PI--KVKGANGSIIQINKKAKNGKIQIAD 76 (201)
T ss_pred EEEEEEEEEcCceeEEEEEEEeCCCceEEecCCcCCHHHHHhCCC------cE--EEEEecCCceEEEEEecCceEEEcc
Confidence 4566544 55 24799999999998876544 444332 11 223332 2 4555567778999999
Q ss_pred EEEeeeEEEecCCCCceEeeechhheecEEEEcCCCEEEEccCC
Q psy18197 70 DFLTTSLSILEEQPMDMLLGLDMLRRHQCCIDLKRNVLRIGTTG 113 (144)
Q Consensus 70 ~~~~~~~~V~~~~~~d~ILG~d~L~~~~~~iD~~~~~l~~~~~~ 113 (144)
..|..+..-.-..+.|+|||++|++.++.-+.+. .++.|+..+
T Consensus 77 ~~F~IP~iYq~~~g~d~IlG~NF~r~y~Pfiq~~-~~I~f~~~~ 119 (201)
T PF02160_consen 77 KIFRIPTIYQQESGIDIILGNNFLRLYEPFIQTE-DRIQFHKKG 119 (201)
T ss_pred EEEeccEEEEecCCCCEEecchHHHhcCCcEEEc-cEEEEEeCC
Confidence 9888775555446899999999999999888886 467776644
No 22
>PF05585 DUF1758: Putative peptidase (DUF1758); InterPro: IPR008737 This is a family of nematode proteins of unknown function []. However, it seems likely that these proteins act as aspartic peptidases.
Probab=98.69 E-value=9.6e-08 Score=70.50 Aligned_cols=29 Identities=24% Similarity=0.421 Sum_probs=26.3
Q ss_pred EEEEEEEecCCCceecCHHHHHHcCceee
Q psy18197 11 VPVKAFVDSGAQTTIMSAECAERCAIMRL 39 (144)
Q Consensus 11 ~~v~~LvDTGA~~s~Is~~~a~~lgl~~~ 39 (144)
+++++|+||||+.|+|++++|++|+|+..
T Consensus 11 ~~~~~LlDsGSq~SfIt~~la~~L~L~~~ 39 (164)
T PF05585_consen 11 VEARALLDSGSQRSFITESLANKLNLPGT 39 (164)
T ss_pred EEEEEEEecCCchhHHhHHHHHHhCCCCC
Confidence 46899999999999999999999999653
No 23
>PF12382 Peptidase_A2E: Retrotransposon peptidase; InterPro: IPR024648 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. This entry represents a small family of fungal retroviral aspartyl peptidases.
Probab=97.67 E-value=0.00034 Score=48.24 Aligned_cols=85 Identities=16% Similarity=0.272 Sum_probs=58.7
Q ss_pred EEEEEecCCCceecCHHHHHHcCceeeecceee-eEeec-cceeEEEEEEEEEEEEEcCEEEeeeEEEecCCCCceEeee
Q psy18197 13 VKAFVDSGAQTTIMSAECAERCAIMRLIDTRWA-GVAKG-VGIQRIIGRIHMVQVVIEKDFLTTSLSILEEQPMDMLLGL 90 (144)
Q Consensus 13 v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~-~~a~g-~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~~~~~d~ILG~ 90 (144)
+-+|+||||+.++|..+.++...++.. ++. .+.+| +=..++.-......|.+.|..+...|.|+...+.+.-+-.
T Consensus 48 ipclidtgaq~niiteetvrahklptr---pw~~sviyggvyp~kinrkt~kl~i~lngisikteflvvkkfshpaaisf 124 (137)
T PF12382_consen 48 IPCLIDTGAQVNIITEETVRAHKLPTR---PWSQSVIYGGVYPNKINRKTIKLNINLNGISIKTEFLVVKKFSHPAAISF 124 (137)
T ss_pred ceeEEccCceeeeeehhhhhhccCCCC---cchhheEeccccccccccceEEEEEEecceEEEEEEEEEEeccCcceEEE
Confidence 568999999999999999999888432 111 13333 2123333334456888999999999999987666676666
Q ss_pred chhheecEEE
Q psy18197 91 DMLRRHQCCI 100 (144)
Q Consensus 91 d~L~~~~~~i 100 (144)
..|...++.|
T Consensus 125 ttlydnniei 134 (137)
T PF12382_consen 125 TTLYDNNIEI 134 (137)
T ss_pred EEEeeCcEEe
Confidence 6666655544
No 24
>PF00026 Asp: Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.; InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) . More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=97.29 E-value=0.0037 Score=49.48 Aligned_cols=87 Identities=20% Similarity=0.396 Sum_probs=56.8
Q ss_pred eEEEEEEC--CEEEEEEEecCCCceecCHHHHHHc---C---c-eee-------ecceeeeEeeccceeEEEEEEEEEEE
Q psy18197 2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAERC---A---I-MRL-------IDTRWAGVAKGVGIQRIIGRIHMVQV 65 (144)
Q Consensus 2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~l---g---l-~~~-------~~~~~~~~a~g~g~~~~~g~i~~~~i 65 (144)
+++++.|+ ++++++++|||++...+....+... . . ... ...... ...|.| . ..|......+
T Consensus 2 Y~~~v~iGtp~q~~~~~iDTGS~~~wv~~~~c~~~~~~~~~~~y~~~~S~t~~~~~~~~~-~~y~~g-~-~~G~~~~D~v 78 (317)
T PF00026_consen 2 YYINVTIGTPPQTFRVLIDTGSSDTWVPSSNCNSCSSCASSGFYNPSKSSTFSNQGKPFS-ISYGDG-S-VSGNLVSDTV 78 (317)
T ss_dssp EEEEEEETTTTEEEEEEEETTBSSEEEEBTTECSHTHHCTSC-BBGGGSTTEEEEEEEEE-EEETTE-E-EEEEEEEEEE
T ss_pred eEEEEEECCCCeEEEEEEecccceeeeceeccccccccccccccccccccccccceeeee-eeccCc-c-cccccccceE
Confidence 57899997 8999999999999998875433332 1 1 000 011111 233332 2 7788888999
Q ss_pred EEcCEEEe-eeEEEecC--------CCCceEeeec
Q psy18197 66 VIEKDFLT-TSLSILEE--------QPMDMLLGLD 91 (144)
Q Consensus 66 ~iG~~~~~-~~~~V~~~--------~~~d~ILG~d 91 (144)
++|+.... ..|..+.. ...|++||+-
T Consensus 79 ~ig~~~~~~~~f~~~~~~~~~~~~~~~~~GilGLg 113 (317)
T PF00026_consen 79 SIGGLTIPNQTFGLADSYSGDPFSPIPFDGILGLG 113 (317)
T ss_dssp EETTEEEEEEEEEEEEEEESHHHHHSSSSEEEE-S
T ss_pred eeeeccccccceecccccccccccccccccccccc
Confidence 99998765 56666653 2579999998
No 25
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=97.23 E-value=0.0031 Score=53.89 Aligned_cols=89 Identities=16% Similarity=0.195 Sum_probs=58.7
Q ss_pred CeEEEEEEC--CEEEEEEEecCCCceecCHHHHHHcCcee--ee-----------cceeeeEeeccceeEEEEEEEEEEE
Q psy18197 1 MLFINCCVN--GVPVKAFVDSGAQTTIMSAECAERCAIMR--LI-----------DTRWAGVAKGVGIQRIIGRIHMVQV 65 (144)
Q Consensus 1 ~~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~lgl~~--~~-----------~~~~~~~a~g~g~~~~~g~i~~~~i 65 (144)
++|.+++|+ ++++++++||||+..-++..-|...++.. .- ...+. ...|.| ...|.+....+
T Consensus 138 ~Yy~~i~IGTP~Q~f~vi~DTGSsdlWV~s~~C~~~~C~~~~~yd~s~SsT~~~~~~~~~-i~YG~G--sv~G~~~~Dtv 214 (450)
T PTZ00013 138 MFYGEGEVGDNHQKFMLIFDTGSANLWVPSKKCDSIGCSIKNLYDSSKSKSYEKDGTKVD-ITYGSG--TVKGFFSKDLV 214 (450)
T ss_pred EEEEEEEECCCCeEEEEEEeCCCCceEEecccCCccccccCCCccCccCcccccCCcEEE-EEECCc--eEEEEEEEEEE
Confidence 367888996 79999999999999999765444322210 00 01122 445544 36788888999
Q ss_pred EEcCEEEeeeEEEecC----------CCCceEeeech
Q psy18197 66 VIEKDFLTTSLSILEE----------QPMDMLLGLDM 92 (144)
Q Consensus 66 ~iG~~~~~~~~~V~~~----------~~~d~ILG~d~ 92 (144)
.+|+...+..|..+.. ...|+|||+.+
T Consensus 215 ~iG~~~~~~~f~~~~~~~~~~~~~~~~~~dGIlGLg~ 251 (450)
T PTZ00013 215 TLGHLSMPYKFIEVTDTDDLEPIYSSSEFDGILGLGW 251 (450)
T ss_pred EECCEEEccEEEEEEeccccccceecccccceecccC
Confidence 9999876655544421 24699999975
No 26
>cd05470 pepsin_retropepsin_like Cellular and retroviral pepsin-like aspartate proteases. This family includes both cellular and retroviral pepsin-like aspartate proteases. The cellular pepsin and pepsin-like enzymes are twice as long as their retroviral counterparts. The cellular pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, rennin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (rennin, cathepsin D and E, pepsin) or commercially (chymosin) important. The eukaryotic pepsin-like proteases contain two domains possessing similar topological features. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except in the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The eukaryotic pepsin-like proteases have two active site
Probab=97.07 E-value=0.0061 Score=41.14 Aligned_cols=84 Identities=17% Similarity=0.256 Sum_probs=51.5
Q ss_pred EEEEECC--EEEEEEEecCCCceecCHHHHHHcCcee---e-----------ecceeeeEeeccceeEEEEEEEEEEEEE
Q psy18197 4 INCCVNG--VPVKAFVDSGAQTTIMSAECAERCAIMR---L-----------IDTRWAGVAKGVGIQRIIGRIHMVQVVI 67 (144)
Q Consensus 4 v~v~InG--~~v~~LvDTGA~~s~Is~~~a~~lgl~~---~-----------~~~~~~~~a~g~g~~~~~g~i~~~~i~i 67 (144)
+++.|+. +++.+++|||++...+..+-+..++... . ....+. ..++-|. ..|.+....+.+
T Consensus 1 ~~i~vGtP~q~~~~~~DTGSs~~Wv~~~~c~~~~~~~~~~~~~~~~sst~~~~~~~~~-~~Y~~g~--~~g~~~~D~v~i 77 (109)
T cd05470 1 IEIGIGTPPQTFNVLLDTGSSNLWVPSVDCQSLAIYSHSSYDDPSASSTYSDNGCTFS-ITYGTGS--LSGGLSTDTVSI 77 (109)
T ss_pred CEEEeCCCCceEEEEEeCCCCCEEEeCCCCCCcccccccccCCcCCCCCCCCCCcEEE-EEeCCCe--EEEEEEEEEEEE
Confidence 4678876 8999999999999988776554443210 0 011111 3344332 345566789999
Q ss_pred cCEEE-eeeEEEecC--------CCCceEeee
Q psy18197 68 EKDFL-TTSLSILEE--------QPMDMLLGL 90 (144)
Q Consensus 68 G~~~~-~~~~~V~~~--------~~~d~ILG~ 90 (144)
|+..+ ...|.++.. ...|+|||+
T Consensus 78 g~~~~~~~~fg~~~~~~~~~~~~~~~~GilGL 109 (109)
T cd05470 78 GDIEVVGQAFGCATDEPGATFLPALFDGILGL 109 (109)
T ss_pred CCEEECCEEEEEEEecCCccccccccccccCC
Confidence 98764 345555542 246788885
No 27
>PTZ00147 plasmepsin-1; Provisional
Probab=97.07 E-value=0.0063 Score=52.02 Aligned_cols=89 Identities=15% Similarity=0.143 Sum_probs=58.4
Q ss_pred CeEEEEEEC--CEEEEEEEecCCCceecCHHHHHHcCce--eee-----------cceeeeEeeccceeEEEEEEEEEEE
Q psy18197 1 MLFINCCVN--GVPVKAFVDSGAQTTIMSAECAERCAIM--RLI-----------DTRWAGVAKGVGIQRIIGRIHMVQV 65 (144)
Q Consensus 1 ~~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~lgl~--~~~-----------~~~~~~~a~g~g~~~~~g~i~~~~i 65 (144)
++|.+++|+ ++++.+++|||++..-++..-|...++. ..- ...+. ...|.| ...|.+..+.+
T Consensus 139 ~Y~~~I~IGTP~Q~f~Vi~DTGSsdlWVps~~C~~~~C~~~~~yd~s~SsT~~~~~~~f~-i~Yg~G--svsG~~~~DtV 215 (453)
T PTZ00147 139 MSYGEAKLGDNGQKFNFIFDTGSANLWVPSIKCTTEGCETKNLYDSSKSKTYEKDGTKVE-MNYVSG--TVSGFFSKDLV 215 (453)
T ss_pred EEEEEEEECCCCeEEEEEEeCCCCcEEEeecCCCcccccCCCccCCccCcceEECCCEEE-EEeCCC--CEEEEEEEEEE
Confidence 367889997 7999999999999999875544322221 000 11121 345544 35788888999
Q ss_pred EEcCEEEeeeEEEecC----------CCCceEeeech
Q psy18197 66 VIEKDFLTTSLSILEE----------QPMDMLLGLDM 92 (144)
Q Consensus 66 ~iG~~~~~~~~~V~~~----------~~~d~ILG~d~ 92 (144)
++|+..++..|..+.. ...|+|||+-|
T Consensus 216 tiG~~~v~~qF~~~~~~~~f~~~~~~~~~DGILGLG~ 252 (453)
T PTZ00147 216 TIGNLSVPYKFIEVTDTNGFEPFYTESDFDGIFGLGW 252 (453)
T ss_pred EECCEEEEEEEEEEEeccCcccccccccccceecccC
Confidence 9999876655543321 24799999976
No 28
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which
Probab=96.90 E-value=0.0072 Score=47.39 Aligned_cols=83 Identities=18% Similarity=0.193 Sum_probs=47.7
Q ss_pred EEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEE-cCEEEeeeEEEecC-CCCceEeeec
Q psy18197 14 KAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVI-EKDFLTTSLSILEE-QPMDMLLGLD 91 (144)
Q Consensus 14 ~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~i-G~~~~~~~~~V~~~-~~~d~ILG~d 91 (144)
.+++|||++.+.++++.. + +-.+ .-.+ +.. . .+.....-+ ......| +...+. .....|||..
T Consensus 178 ~ai~DTGTs~~~lp~~~~------P--~i~~--~f~~-~~~--~-~i~~~~y~~~~~~~~~C-~~~~~~~~~~~~ilG~~ 242 (265)
T cd05476 178 GTIIDSGTTLTYLPDPAY------P--DLTL--HFDG-GAD--L-ELPPENYFVDVGEGVVC-LAILSSSSGGVSILGNI 242 (265)
T ss_pred cEEEeCCCcceEcCcccc------C--CEEE--EECC-CCE--E-EeCcccEEEECCCCCEE-EEEecCCCCCcEEEChh
Confidence 489999999999999887 1 1111 1111 110 0 000001101 1111122 122332 3445899999
Q ss_pred hhheecEEEEcCCCEEEEcc
Q psy18197 92 MLRRHQCCIDLKRNVLRIGT 111 (144)
Q Consensus 92 ~L~~~~~~iD~~~~~l~~~~ 111 (144)
||+.+-...|++++++-+..
T Consensus 243 fl~~~~~vFD~~~~~iGfa~ 262 (265)
T cd05476 243 QQQNFLVEYDLENSRLGFAP 262 (265)
T ss_pred hcccEEEEEECCCCEEeeec
Confidence 99999999999999998755
No 29
>COG4067 Uncharacterized protein conserved in archaea [Posttranslational modification, protein turnover, chaperones]
Probab=96.86 E-value=0.0037 Score=45.99 Aligned_cols=100 Identities=22% Similarity=0.286 Sum_probs=65.8
Q ss_pred EEECCE--EEEEEEecCCCceecCHHHHHHcCc---ee----ee------------cceee---eEeeccce-eEEEEEE
Q psy18197 6 CCVNGV--PVKAFVDSGAQTTIMSAECAERCAI---MR----LI------------DTRWA---GVAKGVGI-QRIIGRI 60 (144)
Q Consensus 6 v~InG~--~v~~LvDTGA~~s~Is~~~a~~lgl---~~----~~------------~~~~~---~~a~g~g~-~~~~g~i 60 (144)
+.++|- .++|=+||||..|.++....++..- .. .. +.++. .+-.+-+. ..... +
T Consensus 31 ~~l~~l~~~~kAkiDTGA~TSsL~A~dI~~fkRdGe~WVRF~~~~~~~~~~~~~~~e~pvi~~ikvR~s~~~~~e~Rp-V 109 (162)
T COG4067 31 VSLPGLKIQLKAKIDTGAVTSSLSASDIERFKRDGERWVRFRLADTDNLDQRSEECEAPVIRKIKVRSSSGSRAERRP-V 109 (162)
T ss_pred EEcCcccceeeeeecccceeeeEEeecceeeeeCCceEEEEEeecccCccccceeeccceEEEEEEecCCCCcccccc-E
Confidence 556664 5899999999999998766554422 10 00 00000 01111111 12222 3
Q ss_pred EEEEEEEcCEEEeeeEEEecC--CCCceEeeechhheecEEEEcCCCE
Q psy18197 61 HMVQVVIEKDFLTTSLSILEE--QPMDMLLGLDMLRRHQCCIDLKRNV 106 (144)
Q Consensus 61 ~~~~i~iG~~~~~~~~~V~~~--~~~d~ILG~d~L~~~~~~iD~~~~~ 106 (144)
...++++|+...+..|...+. ..|++|||..+|+++.+.+|++..-
T Consensus 110 V~~~l~lG~~~~~~E~tLtDR~~m~Yp~LlGrk~l~~~~~~VDpSr~y 157 (162)
T COG4067 110 VRLTLCLGGRILPIEFTLTDRSNMRYPVLLGRKALRHFGAVVDPSRKY 157 (162)
T ss_pred EEEEEeeCCeeeeEEEEeecccccccceEecHHHHhhCCeEECchhhc
Confidence 347899999999999999985 3699999999999999999987653
No 30
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=96.83 E-value=0.018 Score=45.39 Aligned_cols=73 Identities=15% Similarity=0.212 Sum_probs=53.2
Q ss_pred eEEEEEECC--EEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEe-eeEEE
Q psy18197 2 LFINCCVNG--VPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLT-TSLSI 78 (144)
Q Consensus 2 ~~v~v~InG--~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~-~~~~V 78 (144)
+++++.|+. +++.+++|||++...+. .+. ...+-| ....|.+....+.+|+..++ ..|..
T Consensus 3 Y~~~i~iGtp~q~~~v~~DTgS~~~wv~---------------~~~-~~Y~~g-~~~~G~~~~D~v~~g~~~~~~~~fg~ 65 (295)
T cd05474 3 YSAELSVGTPPQKVTVLLDTGSSDLWVP---------------DFS-ISYGDG-TSASGTWGTDTVSIGGATVKNLQFAV 65 (295)
T ss_pred EEEEEEECCCCcEEEEEEeCCCCcceee---------------eeE-EEeccC-CcEEEEEEEEEEEECCeEecceEEEE
Confidence 678899988 88999999999999988 122 233322 35677777889999987653 45666
Q ss_pred ecC-CCCceEeeec
Q psy18197 79 LEE-QPMDMLLGLD 91 (144)
Q Consensus 79 ~~~-~~~d~ILG~d 91 (144)
+.. ...|+|||+-
T Consensus 66 ~~~~~~~~GilGLg 79 (295)
T cd05474 66 ANSTSSDVGVLGIG 79 (295)
T ss_pred EecCCCCcceeeEC
Confidence 653 4678999966
No 31
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=96.75 E-value=0.014 Score=45.93 Aligned_cols=89 Identities=15% Similarity=0.176 Sum_probs=56.9
Q ss_pred eEEEEEECC--EEEEEEEecCCCceecCHHHHHHcCce------eee-------c-ceeeeEeeccceeEEEEEEEEEEE
Q psy18197 2 LFINCCVNG--VPVKAFVDSGAQTTIMSAECAERCAIM------RLI-------D-TRWAGVAKGVGIQRIIGRIHMVQV 65 (144)
Q Consensus 2 ~~v~v~InG--~~v~~LvDTGA~~s~Is~~~a~~lgl~------~~~-------~-~~~~~~a~g~g~~~~~g~i~~~~i 65 (144)
++++++|+. +++.+++|||++...+...-+..+... +.. + ..+. ...|.| ....|.+....+
T Consensus 1 Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~c~~~~~~~~~~y~~~~Sst~~~~~~~~~~-i~Y~~G-~~~~G~~~~D~v 78 (278)
T cd06097 1 YLTPVKIGTPPQTLNLDLDTGSSDLWVFSSETPAAQQGGHKLYDPSKSSTAKLLPGATWS-ISYGDG-SSASGIVYTDTV 78 (278)
T ss_pred CeeeEEECCCCcEEEEEEeCCCCceeEeeCCCCchhhccCCcCCCccCccceecCCcEEE-EEeCCC-CeEEEEEEEEEE
Confidence 478899988 899999999999999876544433221 000 0 1111 334433 236777778899
Q ss_pred EEcCEEEe-eeEEEecC--------CCCceEeeech
Q psy18197 66 VIEKDFLT-TSLSILEE--------QPMDMLLGLDM 92 (144)
Q Consensus 66 ~iG~~~~~-~~~~V~~~--------~~~d~ILG~d~ 92 (144)
.+|+..++ ..|.++.. ...|+|||+-+
T Consensus 79 ~ig~~~~~~~~fg~~~~~~~~~~~~~~~dGilGLg~ 114 (278)
T cd06097 79 SIGGVEVPNQAIELATAVSASFFSDTASDGLLGLAF 114 (278)
T ss_pred EECCEEECCeEEEEEeecCccccccccccceeeecc
Confidence 99997653 34544432 35799999964
No 32
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which
Probab=96.46 E-value=0.038 Score=44.39 Aligned_cols=88 Identities=16% Similarity=0.205 Sum_probs=55.4
Q ss_pred eEEEEEEC--CEEEEEEEecCCCceecCHHHHHHcCcee--ee-----------cceeeeEeeccceeEEEEEEEEEEEE
Q psy18197 2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAERCAIMR--LI-----------DTRWAGVAKGVGIQRIIGRIHMVQVV 66 (144)
Q Consensus 2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~lgl~~--~~-----------~~~~~~~a~g~g~~~~~g~i~~~~i~ 66 (144)
+|+++.|+ .+++.+++|||++...+...-+....... .. ...+. ...|.|. ..|.+....++
T Consensus 11 Y~~~i~vGtp~q~~~v~~DTGS~~~wv~~~~C~~~~c~~~~~f~~~~Sst~~~~~~~~~-~~yg~gs--~~G~~~~D~v~ 87 (317)
T cd05478 11 YYGTISIGTPPQDFTVIFDTGSSNLWVPSVYCSSQACSNHNRFNPRQSSTYQSTGQPLS-IQYGTGS--MTGILGYDTVQ 87 (317)
T ss_pred EEEEEEeCCCCcEEEEEEeCCCccEEEecCCCCcccccccCcCCCCCCcceeeCCcEEE-EEECCce--EEEEEeeeEEE
Confidence 57889997 58899999999999999754433222210 00 11111 3344333 46777788999
Q ss_pred EcCEEEe-eeEEEecC--------CCCceEeeech
Q psy18197 67 IEKDFLT-TSLSILEE--------QPMDMLLGLDM 92 (144)
Q Consensus 67 iG~~~~~-~~~~V~~~--------~~~d~ILG~d~ 92 (144)
+|+...+ ..|..... ...|+|||+.+
T Consensus 88 ig~~~i~~~~fg~~~~~~~~~~~~~~~dGilGLg~ 122 (317)
T cd05478 88 VGGISDTNQIFGLSETEPGSFFYYAPFDGILGLAY 122 (317)
T ss_pred ECCEEECCEEEEEEEecCccccccccccceeeecc
Confidence 9997653 34544432 13689999864
No 33
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which
Probab=96.36 E-value=0.027 Score=44.16 Aligned_cols=76 Identities=14% Similarity=0.214 Sum_probs=53.4
Q ss_pred eEEEEEEC--CEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCE--EE-eeeE
Q psy18197 2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKD--FL-TTSL 76 (144)
Q Consensus 2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~--~~-~~~~ 76 (144)
++++++|+ .+++.+++|||++..-+.. . .+. ...+- +....|.+....+.+++. .. +..|
T Consensus 2 Y~~~i~iGtP~q~~~v~~DTGSs~~wv~~-------~------~~~-~~Y~d-g~~~~G~~~~D~v~~g~~~~~~~~~~F 66 (265)
T cd05476 2 YLVTLSIGTPPQPFSLIVDTGSDLTWTQC-------C------SYE-YSYGD-GSSTSGVLATETFTFGDSSVSVPNVAF 66 (265)
T ss_pred eEEEEecCCCCcceEEEecCCCCCEEEcC-------C------ceE-eEeCC-CceeeeeEEEEEEEecCCCCccCCEEE
Confidence 57889998 6889999999999988864 1 121 23332 235677777789999987 44 3556
Q ss_pred EEecCC------CCceEeeech
Q psy18197 77 SILEEQ------PMDMLLGLDM 92 (144)
Q Consensus 77 ~V~~~~------~~d~ILG~d~ 92 (144)
.++... ..|+|||+.+
T Consensus 67 g~~~~~~~~~~~~~~GIlGLg~ 88 (265)
T cd05476 67 GCGTDNEGGSFGGADGILGLGR 88 (265)
T ss_pred EecccccCCccCCCCEEEECCC
Confidence 666532 4899999875
No 34
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate r
Probab=96.19 E-value=0.089 Score=42.47 Aligned_cols=88 Identities=15% Similarity=0.231 Sum_probs=55.1
Q ss_pred eEEEEEEC--CEEEEEEEecCCCceecCHHHHHH----cCceeee-----------cceeeeEeeccceeEEEEEEEEEE
Q psy18197 2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAER----CAIMRLI-----------DTRWAGVAKGVGIQRIIGRIHMVQ 64 (144)
Q Consensus 2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~----lgl~~~~-----------~~~~~~~a~g~g~~~~~g~i~~~~ 64 (144)
++++++|+ ++++++++|||++..-+...-+.. |.....- ...+. ...|.| ...|.+....
T Consensus 9 y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~c~~~~~y~~~~SsT~~~~~~~~~-~~Yg~g--~~~G~~~~D~ 85 (326)
T cd05487 9 YYGEIGIGTPPQTFKVVFDTGSSNLWVPSSKCSPLYTACVTHNLYDASDSSTYKENGTEFT-IHYASG--TVKGFLSQDI 85 (326)
T ss_pred EEEEEEECCCCcEEEEEEeCCccceEEccCCCcCcchhhcccCcCCCCCCeeeeECCEEEE-EEeCCc--eEEEEEeeeE
Confidence 57889997 789999999999999996533321 2110000 11111 344433 3678888889
Q ss_pred EEEcCEEEeeeEEEecC--------CCCceEeeech
Q psy18197 65 VVIEKDFLTTSLSILEE--------QPMDMLLGLDM 92 (144)
Q Consensus 65 i~iG~~~~~~~~~V~~~--------~~~d~ILG~d~ 92 (144)
+.+|+......|..+.. ...|+|||+-+
T Consensus 86 v~~g~~~~~~~fg~~~~~~~~~~~~~~~dGilGLg~ 121 (326)
T cd05487 86 VTVGGIPVTQMFGEVTALPAIPFMLAKFDGVLGMGY 121 (326)
T ss_pred EEECCEEeeEEEEEEEeccCCccceeecceEEecCC
Confidence 99998755433333321 24689999865
No 35
>PF05618 Zn_protease: Putative ATP-dependant zinc protease; InterPro: IPR008503 This family consists of several hypothetical proteins from different archaeal and bacterial species.; PDB: 2PMA_B.
Probab=96.17 E-value=0.018 Score=41.76 Aligned_cols=45 Identities=20% Similarity=0.372 Sum_probs=31.2
Q ss_pred EEEEEEcCEEEeeeEEEecC--CCCceEee-echhheecEEEEcCCCEE
Q psy18197 62 MVQVVIEKDFLTTSLSILEE--QPMDMLLG-LDMLRRHQCCIDLKRNVL 107 (144)
Q Consensus 62 ~~~i~iG~~~~~~~~~V~~~--~~~d~ILG-~d~L~~~~~~iD~~~~~l 107 (144)
...+++|+......|...+. ..+++||| ..|| +.++.+|.++.-+
T Consensus 87 ~~~~~lg~~~~~~e~tL~dR~~m~yp~LlGrR~~l-~~~~lVD~s~~~~ 134 (138)
T PF05618_consen 87 ETTLCLGGKTWKIEFTLTDRSNMKYPMLLGRRNFL-RGRFLVDVSRSFL 134 (138)
T ss_dssp EEEEEETTEEEEEEEEEE-S--SS-SEEE-HHHHH-HTTEEEETT---T
T ss_pred EEEEEECCEEEEEEEEEcCCCcCcCCEEEEehHHh-cCCEEECCChhhc
Confidence 46899999999999999985 36899999 5555 4568888876543
No 36
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=96.04 E-value=0.083 Score=42.39 Aligned_cols=88 Identities=15% Similarity=0.288 Sum_probs=56.5
Q ss_pred eEEEEEECC--EEEEEEEecCCCceecCHHHHHH--cCceeee-----------cceeeeEeeccceeEEEEEEEEEEEE
Q psy18197 2 LFINCCVNG--VPVKAFVDSGAQTTIMSAECAER--CAIMRLI-----------DTRWAGVAKGVGIQRIIGRIHMVQVV 66 (144)
Q Consensus 2 ~~v~v~InG--~~v~~LvDTGA~~s~Is~~~a~~--lgl~~~~-----------~~~~~~~a~g~g~~~~~g~i~~~~i~ 66 (144)
++.++.|+. +++.+++|||++..-+...-+.. |+-.... ...+. ...|.|. ..|.+....+.
T Consensus 4 y~~~i~iGtP~q~~~v~~DTGS~~~wv~~~~C~~~~C~~~~~f~~~~SsT~~~~~~~~~-~~Yg~Gs--~~G~~~~D~i~ 80 (318)
T cd05477 4 YYGEISIGTPPQNFLVLFDTGSSNLWVPSVLCQSQACTNHTKFNPSQSSTYSTNGETFS-LQYGSGS--LTGIFGYDTVT 80 (318)
T ss_pred EEEEEEECCCCcEEEEEEeCCCccEEEccCCCCCccccccCCCCcccCCCceECCcEEE-EEECCcE--EEEEEEeeEEE
Confidence 678899975 89999999999999887543331 2110000 11122 3445443 46778888999
Q ss_pred EcCEEEe-eeEEEecCC--------CCceEeeech
Q psy18197 67 IEKDFLT-TSLSILEEQ--------PMDMLLGLDM 92 (144)
Q Consensus 67 iG~~~~~-~~~~V~~~~--------~~d~ILG~d~ 92 (144)
+|+..++ ..|.++... ..|+|||+-+
T Consensus 81 ~g~~~i~~~~Fg~~~~~~~~~~~~~~~~GilGLg~ 115 (318)
T cd05477 81 VQGIIITNQEFGLSETEPGTNFVYAQFDGILGLAY 115 (318)
T ss_pred ECCEEEcCEEEEEEEecccccccccceeeEeecCc
Confidence 9998764 456655431 3589999974
No 37
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme. Proteinase A preferentially hydro
Probab=95.95 E-value=0.1 Score=42.03 Aligned_cols=88 Identities=16% Similarity=0.257 Sum_probs=55.3
Q ss_pred eEEEEEEC--CEEEEEEEecCCCceecCHHHHHH--cCceee-----------ecceeeeEeeccceeEEEEEEEEEEEE
Q psy18197 2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAER--CAIMRL-----------IDTRWAGVAKGVGIQRIIGRIHMVQVV 66 (144)
Q Consensus 2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~--lgl~~~-----------~~~~~~~~a~g~g~~~~~g~i~~~~i~ 66 (144)
++++++|+ ++++.+++|||++...+...-+.. |..... ....+. ...|.| ...|.+....+.
T Consensus 11 Y~~~i~iGtp~q~~~v~~DTGSs~~wv~~~~C~~~~C~~~~~y~~~~Sst~~~~~~~~~-~~y~~g--~~~G~~~~D~v~ 87 (320)
T cd05488 11 YFTDITLGTPPQKFKVILDTGSSNLWVPSVKCGSIACFLHSKYDSSASSTYKANGTEFK-IQYGSG--SLEGFVSQDTLS 87 (320)
T ss_pred EEEEEEECCCCcEEEEEEecCCcceEEEcCCCCCcccCCcceECCCCCcceeeCCCEEE-EEECCc--eEEEEEEEeEEE
Confidence 67889997 489999999999999886543332 211100 011121 334433 256877788999
Q ss_pred EcCEEEe-eeEEEecC--------CCCceEeeech
Q psy18197 67 IEKDFLT-TSLSILEE--------QPMDMLLGLDM 92 (144)
Q Consensus 67 iG~~~~~-~~~~V~~~--------~~~d~ILG~d~ 92 (144)
+|+..++ ..|..+.. ...|+|||+.+
T Consensus 88 ig~~~~~~~~f~~a~~~~g~~~~~~~~dGilGLg~ 122 (320)
T cd05488 88 IGDLTIKKQDFAEATSEPGLAFAFGKFDGILGLAY 122 (320)
T ss_pred ECCEEECCEEEEEEecCCCcceeeeeeceEEecCC
Confidence 9987653 34544421 13589999975
No 38
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=95.94 E-value=0.066 Score=43.00 Aligned_cols=102 Identities=19% Similarity=0.183 Sum_probs=60.2
Q ss_pred EEECCEEE-------EEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEE--EEEEEEEEEEEcCEEEee--
Q psy18197 6 CCVNGVPV-------KAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRI--IGRIHMVQVVIEKDFLTT-- 74 (144)
Q Consensus 6 v~InG~~v-------~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~--~g~i~~~~i~iG~~~~~~-- 74 (144)
++|||++. .++||||.+.+.++++.++++--. ...... ..+.-...+ ........+++++..+.+
T Consensus 188 i~v~g~~~~~~~~~~~~iiDSGtt~~~lP~~~~~~l~~~--~~~~~~--~~~~~~~~C~~~~~~p~l~~~f~g~~~~v~~ 263 (318)
T cd05477 188 FQINGQATGWCSQGCQAIVDTGTSLLTAPQQVMSTLMQS--IGAQQD--QYGQYVVNCNNIQNLPTLTFTINGVSFPLPP 263 (318)
T ss_pred EEECCEEecccCCCceeeECCCCccEECCHHHHHHHHHH--hCCccc--cCCCEEEeCCccccCCcEEEEECCEEEEECH
Confidence 57788653 589999999999999998876321 011100 001000000 001123455566654432
Q ss_pred -----------eEEEecC------CCCceEeeechhheecEEEEcCCCEEEEcc
Q psy18197 75 -----------SLSILEE------QPMDMLLGLDMLRRHQCCIDLKRNVLRIGT 111 (144)
Q Consensus 75 -----------~~~V~~~------~~~d~ILG~d~L~~~~~~iD~~~~~l~~~~ 111 (144)
-+.+.+. .....|||..||+..-...|+.++++-+..
T Consensus 264 ~~y~~~~~~~C~~~i~~~~~~~~~~~~~~ilG~~fl~~~y~vfD~~~~~ig~a~ 317 (318)
T cd05477 264 SAYILQNNGYCTVGIEPTYLPSQNGQPLWILGDVFLRQYYSVYDLGNNQVGFAT 317 (318)
T ss_pred HHeEecCCCeEEEEEEecccCCCCCCceEEEcHHHhhheEEEEeCCCCEEeeee
Confidence 2233221 113489999999999999999999987643
No 39
>PTZ00165 aspartyl protease; Provisional
Probab=95.89 E-value=0.1 Score=45.03 Aligned_cols=89 Identities=17% Similarity=0.194 Sum_probs=55.9
Q ss_pred eEEEEEECC--EEEEEEEecCCCceecCHHHHHHcCceee--e----cceee-----------eEeeccceeEEEEEEEE
Q psy18197 2 LFINCCVNG--VPVKAFVDSGAQTTIMSAECAERCAIMRL--I----DTRWA-----------GVAKGVGIQRIIGRIHM 62 (144)
Q Consensus 2 ~~v~v~InG--~~v~~LvDTGA~~s~Is~~~a~~lgl~~~--~----~~~~~-----------~~a~g~g~~~~~g~i~~ 62 (144)
+|.++.|+. +++++++|||++..-++..-+...++..+ - .+.+. ...+|.|. ..|.+..
T Consensus 121 Y~~~I~IGTPpQ~f~Vv~DTGSS~lWVps~~C~~~~C~~~~~yd~s~SSTy~~~~~~~~~~~~~i~YGsGs--~~G~l~~ 198 (482)
T PTZ00165 121 YFGEIQVGTPPKSFVVVFDTGSSNLWIPSKECKSGGCAPHRKFDPKKSSTYTKLKLGDESAETYIQYGTGE--CVLALGK 198 (482)
T ss_pred EEEEEEeCCCCceEEEEEeCCCCCEEEEchhcCcccccccCCCCccccCCcEecCCCCccceEEEEeCCCc--EEEEEEE
Confidence 578899987 89999999999999887655443222110 0 01110 12345443 3466777
Q ss_pred EEEEEcCEEEe-eeEEEecC--------CCCceEeeech
Q psy18197 63 VQVVIEKDFLT-TSLSILEE--------QPMDMLLGLDM 92 (144)
Q Consensus 63 ~~i~iG~~~~~-~~~~V~~~--------~~~d~ILG~d~ 92 (144)
+.+++|+..++ ..|.++.. ...|+|||+-|
T Consensus 199 DtV~ig~l~i~~q~FG~a~~~s~~~f~~~~~DGILGLg~ 237 (482)
T PTZ00165 199 DTVKIGGLKVKHQSIGLAIEESLHPFADLPFDGLVGLGF 237 (482)
T ss_pred EEEEECCEEEccEEEEEEEeccccccccccccceeecCC
Confidence 89999997664 34444321 24799999986
No 40
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which
Probab=95.85 E-value=0.17 Score=40.58 Aligned_cols=101 Identities=16% Similarity=0.126 Sum_probs=60.7
Q ss_pred EEECCEEE------EEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEE--EEEEEEEEEEEcCEEEeee--
Q psy18197 6 CCVNGVPV------KAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRI--IGRIHMVQVVIEKDFLTTS-- 75 (144)
Q Consensus 6 v~InG~~v------~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~--~g~i~~~~i~iG~~~~~~~-- 75 (144)
++|||+.+ .+++|||++.+.++++..+++--. . ..... ..+.-...+ ........+++++..+.++
T Consensus 194 v~v~g~~~~~~~~~~~iiDTGts~~~lp~~~~~~l~~~-~-~~~~~--~~~~~~~~C~~~~~~P~~~f~f~g~~~~i~~~ 269 (317)
T cd05478 194 VTINGQVVACSGGCQAIVDTGTSLLVGPSSDIANIQSD-I-GASQN--QNGEMVVNCSSISSMPDVVFTINGVQYPLPPS 269 (317)
T ss_pred EEECCEEEccCCCCEEEECCCchhhhCCHHHHHHHHHH-h-CCccc--cCCcEEeCCcCcccCCcEEEEECCEEEEECHH
Confidence 67888865 589999999999999998876321 0 00000 001000000 0112234556666554322
Q ss_pred -----------EEEecC-CCCceEeeechhheecEEEEcCCCEEEEc
Q psy18197 76 -----------LSILEE-QPMDMLLGLDMLRRHQCCIDLKRNVLRIG 110 (144)
Q Consensus 76 -----------~~V~~~-~~~d~ILG~d~L~~~~~~iD~~~~~l~~~ 110 (144)
..+.+. .....|||..||+..-.+.|++++++-+-
T Consensus 270 ~y~~~~~~~C~~~~~~~~~~~~~IlG~~fl~~~y~vfD~~~~~iG~A 316 (317)
T cd05478 270 AYILQDQGSCTSGFQSMGLGELWILGDVFIRQYYSVFDRANNKVGLA 316 (317)
T ss_pred HheecCCCEEeEEEEeCCCCCeEEechHHhcceEEEEeCCCCEEeec
Confidence 222222 12458999999999999999999998763
No 41
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=95.69 E-value=0.046 Score=44.20 Aligned_cols=89 Identities=13% Similarity=0.091 Sum_probs=50.5
Q ss_pred EEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEE--EEEcCEEEeeeEEEecCCCCceEeee
Q psy18197 13 VKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQ--VVIEKDFLTTSLSILEEQPMDMLLGL 90 (144)
Q Consensus 13 v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~--i~iG~~~~~~~~~V~~~~~~d~ILG~ 90 (144)
..++||||.+.++++++.++++--.- +.+.-...+ |. +.. +...+ ++..+. .+-+.... ...-.|||-
T Consensus 232 ~~aivDSGTs~~~lp~~~~~~l~~~~---P~i~~~f~~-g~-~~~--i~p~~y~~~~~~~--~c~~~~~~-~~~~~ILG~ 301 (326)
T cd06096 232 LGMLVDSGSTLSHFPEDLYNKINNFF---PTITIIFEN-NL-KID--WKPSSYLYKKESF--WCKGGEKS-VSNKPILGA 301 (326)
T ss_pred CCEEEeCCCCcccCCHHHHHHHHhhc---CcEEEEEcC-Cc-EEE--ECHHHhccccCCc--eEEEEEec-CCCceEECh
Confidence 45899999999999999999874211 111101111 10 000 00000 001111 01112222 223489999
Q ss_pred chhheecEEEEcCCCEEEEcc
Q psy18197 91 DMLRRHQCCIDLKRNVLRIGT 111 (144)
Q Consensus 91 d~L~~~~~~iD~~~~~l~~~~ 111 (144)
.||+.+-.++|++++++-+..
T Consensus 302 ~flr~~y~vFD~~~~riGfa~ 322 (326)
T cd06096 302 SFFKNKQIIFDLDNNRIGFVE 322 (326)
T ss_pred HHhcCcEEEEECcCCEEeeEc
Confidence 999999999999999998754
No 42
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=95.68 E-value=0.06 Score=42.81 Aligned_cols=79 Identities=16% Similarity=0.277 Sum_probs=53.1
Q ss_pred eEEEEEEC--CEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCE-EE-eeeEE
Q psy18197 2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKD-FL-TTSLS 77 (144)
Q Consensus 2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~-~~-~~~~~ 77 (144)
+++++.|+ .+++.+++|||++..-+... .+ ..+. ..+|-| ....|......+++|+. .. +..|.
T Consensus 2 Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~c~---~c-------~~~~-i~Yg~G-s~~~G~~~~D~v~ig~~~~~~~~~Fg 69 (299)
T cd05472 2 YVVTVGLGTPARDQTVIVDTGSDLTWVQCQ---PC-------CLYQ-VSYGDG-SYTTGDLATDTLTLGSSDVVPGFAFG 69 (299)
T ss_pred eEEEEecCCCCcceEEEecCCCCcccccCC---CC-------Ceee-eEeCCC-ceEEEEEEEEEEEeCCCCccCCEEEE
Confidence 68899998 58899999999998887432 11 1222 344433 34567777889999986 44 34555
Q ss_pred EecCC-----CCceEeeech
Q psy18197 78 ILEEQ-----PMDMLLGLDM 92 (144)
Q Consensus 78 V~~~~-----~~d~ILG~d~ 92 (144)
+.... ..|+|||+-+
T Consensus 70 ~~~~~~~~~~~~~GilGLg~ 89 (299)
T cd05472 70 CGHDNEGLFGGAAGLLGLGR 89 (299)
T ss_pred CCccCCCccCCCCEEEECCC
Confidence 55421 4789999964
No 43
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=95.66 E-value=0.11 Score=41.70 Aligned_cols=88 Identities=17% Similarity=0.285 Sum_probs=54.5
Q ss_pred eEEEEEEC--CEEEEEEEecCCCceecCHHHHH--HcCceeee-----------cceeeeEeeccceeEEEEEEEEEEEE
Q psy18197 2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAE--RCAIMRLI-----------DTRWAGVAKGVGIQRIIGRIHMVQVV 66 (144)
Q Consensus 2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~--~lgl~~~~-----------~~~~~~~a~g~g~~~~~g~i~~~~i~ 66 (144)
+|.+++|+ ++++++++|||++..-+...-+. .|.-...- ...+. ...|-| ...|.+....++
T Consensus 1 Y~~~i~iGtP~Q~~~v~~DTGSs~~Wv~s~~C~~~~C~~~~~y~~~~SsT~~~~~~~~~-i~Yg~g--~~~G~~~~D~v~ 77 (316)
T cd05486 1 YFGQISIGTPPQNFTVIFDTGSSNLWVPSIYCTSQACTKHNRFQPSESSTYVSNGEAFS-IQYGTG--SLTGIIGIDQVT 77 (316)
T ss_pred CeEEEEECCCCcEEEEEEcCCCccEEEecCCCCCcccCccceECCCCCcccccCCcEEE-EEeCCc--EEEEEeeecEEE
Confidence 47889997 68999999999999888653332 12111000 01111 344544 356778788999
Q ss_pred EcCEEEe-eeEEEec--------CCCCceEeeech
Q psy18197 67 IEKDFLT-TSLSILE--------EQPMDMLLGLDM 92 (144)
Q Consensus 67 iG~~~~~-~~~~V~~--------~~~~d~ILG~d~ 92 (144)
+|+..++ ..|.+.. ....|+|||+-+
T Consensus 78 ig~~~~~~~~fg~~~~~~~~~~~~~~~dGilGLg~ 112 (316)
T cd05486 78 VEGITVQNQQFAESVSEPGSTFQDSEFDGILGLAY 112 (316)
T ss_pred ECCEEEcCEEEEEeeccCcccccccccceEeccCc
Confidence 9986553 3444332 125699999964
No 44
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=95.65 E-value=0.084 Score=42.51 Aligned_cols=89 Identities=16% Similarity=0.206 Sum_probs=55.2
Q ss_pred eEEEEEEC--CEEEEEEEecCCCceecCHHHHH---HcCceeee----cce------eeeEeeccceeEEEEEEEEEEEE
Q psy18197 2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAE---RCAIMRLI----DTR------WAGVAKGVGIQRIIGRIHMVQVV 66 (144)
Q Consensus 2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~---~lgl~~~~----~~~------~~~~a~g~g~~~~~g~i~~~~i~ 66 (144)
++++++|+ ++++++++|||++...+...-+. .|.....- .+. ......|.|. ..|.+....+.
T Consensus 11 Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~C~~~~~C~~~~~y~~~~SsT~~~~~~~~~i~Yg~G~--~~G~~~~D~v~ 88 (317)
T cd06098 11 YFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYKSSKSSTYKKNGTSASIQYGTGS--ISGFFSQDSVT 88 (317)
T ss_pred EEEEEEECCCCeEEEEEECCCccceEEecCCCCCCccccccCcCCcccCCCcccCCCEEEEEcCCce--EEEEEEeeEEE
Confidence 57889997 68999999999998888654332 22211100 011 1113444443 46877788999
Q ss_pred EcCEEEe-eeEEEecC--------CCCceEeeech
Q psy18197 67 IEKDFLT-TSLSILEE--------QPMDMLLGLDM 92 (144)
Q Consensus 67 iG~~~~~-~~~~V~~~--------~~~d~ILG~d~ 92 (144)
+|+..++ ..|.++.. ...|+|||+-+
T Consensus 89 ig~~~v~~~~f~~~~~~~~~~~~~~~~dGilGLg~ 123 (317)
T cd06098 89 VGDLVVKNQVFIEATKEPGLTFLLAKFDGILGLGF 123 (317)
T ss_pred ECCEEECCEEEEEEEecCCccccccccceeccccc
Confidence 9997654 34444321 24689999975
No 45
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=95.49 E-value=0.11 Score=40.82 Aligned_cols=96 Identities=17% Similarity=0.232 Sum_probs=56.0
Q ss_pred EEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEE-EEEEEEEcCEEEeee----------------
Q psy18197 13 VKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRI-HMVQVVIEKDFLTTS---------------- 75 (144)
Q Consensus 13 v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i-~~~~i~iG~~~~~~~---------------- 75 (144)
..++||||++...++.+.++++--. ...... ...+.....+.-.. ....+++++..+.++
T Consensus 179 ~~~iiDSGt~~~~lP~~~~~~l~~~--~~~~~~-~~~~~~~~~C~~~~~p~i~f~f~g~~~~i~~~~~~~~~~~~~~~~~ 255 (295)
T cd05474 179 LPALLDSGTTLTYLPSDIVDAIAKQ--LGATYD-SDEGLYVVDCDAKDDGSLTFNFGGATISVPLSDLVLPASTDDGGDG 255 (295)
T ss_pred ccEEECCCCccEeCCHHHHHHHHHH--hCCEEc-CCCcEEEEeCCCCCCCEEEEEECCeEEEEEHHHhEeccccCCCCCC
Confidence 5789999999999999999876321 000000 00010000000000 234556666444221
Q ss_pred ---EEEecCCCCceEeeechhheecEEEEcCCCEEEEcc
Q psy18197 76 ---LSILEEQPMDMLLGLDMLRRHQCCIDLKRNVLRIGT 111 (144)
Q Consensus 76 ---~~V~~~~~~d~ILG~d~L~~~~~~iD~~~~~l~~~~ 111 (144)
+.+.+......|||..||+.+-...|.+++++-+.+
T Consensus 256 ~C~~~i~~~~~~~~iLG~~fl~~~y~vfD~~~~~ig~a~ 294 (295)
T cd05474 256 ACYLGIQPSTSDYNILGDTFLRSAYVVYDLDNNEISLAQ 294 (295)
T ss_pred CeEEEEEeCCCCcEEeChHHhhcEEEEEECCCCEEEeec
Confidence 222222224589999999999999999999997743
No 46
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=95.44 E-value=0.16 Score=40.91 Aligned_cols=90 Identities=12% Similarity=0.120 Sum_probs=54.0
Q ss_pred EEECCEE-------EEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEE------
Q psy18197 6 CCVNGVP-------VKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFL------ 72 (144)
Q Consensus 6 v~InG~~-------v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~------ 72 (144)
++|||+. ..+++|||.+...+++++++++... .+ + .... ......+++++..+
T Consensus 197 i~v~g~~~~~~~~~~~aivDTGTs~~~lP~~~~~~i~~~--~~---C---~~~~------~~P~i~f~f~g~~~~l~~~~ 262 (317)
T cd06098 197 VLIGGKSTGFCAGGCAAIADSGTSLLAGPTTIVTQINSA--VD---C---NSLS------SMPNVSFTIGGKTFELTPEQ 262 (317)
T ss_pred EEECCEEeeecCCCcEEEEecCCcceeCCHHHHHhhhcc--CC---c---cccc------cCCcEEEEECCEEEEEChHH
Confidence 5788875 3589999999999999998877521 00 0 0000 00111222222211
Q ss_pred -----------eeeEEEe--c---CCCCceEeeechhheecEEEEcCCCEEEE
Q psy18197 73 -----------TTSLSIL--E---EQPMDMLLGLDMLRRHQCCIDLKRNVLRI 109 (144)
Q Consensus 73 -----------~~~~~V~--~---~~~~d~ILG~d~L~~~~~~iD~~~~~l~~ 109 (144)
.|-+.+. + ......|||..||+.+-.+.|++++++-+
T Consensus 263 yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~IlGd~Flr~~y~VfD~~~~~iGf 315 (317)
T cd06098 263 YILKVGEGAAAQCISGFTALDVPPPRGPLWILGDVFMGAYHTVFDYGNLRVGF 315 (317)
T ss_pred eEEeecCCCCCEEeceEEECCCCCCCCCeEEechHHhcccEEEEeCCCCEEee
Confidence 1222221 1 11224799999999999999999998865
No 47
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank
Probab=95.37 E-value=0.24 Score=39.87 Aligned_cols=88 Identities=15% Similarity=0.208 Sum_probs=54.7
Q ss_pred eEEEEEEC--CEEEEEEEecCCCceecCHHHHH----HcCcee----ee-------cceeeeEeeccceeEEEEEEEEEE
Q psy18197 2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAE----RCAIMR----LI-------DTRWAGVAKGVGIQRIIGRIHMVQ 64 (144)
Q Consensus 2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~----~lgl~~----~~-------~~~~~~~a~g~g~~~~~g~i~~~~ 64 (144)
+++++.|+ ++++.+++|||++..-+...-+. .|.... .. ...+. ...|.|. ..|.+....
T Consensus 7 Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~C~~~~~y~~~~SsT~~~~~~~~~-i~Yg~G~--~~G~~~~D~ 83 (325)
T cd05490 7 YYGEIGIGTPPQTFTVVFDTGSSNLWVPSVHCSLLDIACWLHHKYNSSKSSTYVKNGTEFA-IQYGSGS--LSGYLSQDT 83 (325)
T ss_pred EEEEEEECCCCcEEEEEEeCCCccEEEEcCCCCCCCccccCcCcCCcccCcceeeCCcEEE-EEECCcE--EEEEEeeeE
Confidence 57889997 48899999999999888543332 222110 00 01111 3455443 568888899
Q ss_pred EEEcCEEEe-eeEEEecC--------CCCceEeeech
Q psy18197 65 VVIEKDFLT-TSLSILEE--------QPMDMLLGLDM 92 (144)
Q Consensus 65 i~iG~~~~~-~~~~V~~~--------~~~d~ILG~d~ 92 (144)
+++|+..++ ..|..+.. ...|+|||+-+
T Consensus 84 v~~g~~~~~~~~Fg~~~~~~~~~~~~~~~dGilGLg~ 120 (325)
T cd05490 84 VSIGGLQVEGQLFGEAVKQPGITFIAAKFDGILGMAY 120 (325)
T ss_pred EEECCEEEcCEEEEEEeeccCCcccceeeeEEEecCC
Confidence 999997654 34444321 13689999854
No 48
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=95.34 E-value=0.31 Score=39.48 Aligned_cols=101 Identities=14% Similarity=0.122 Sum_probs=57.5
Q ss_pred EEECCEEE-----EEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEE--EEEEEEEEEEEcCEEEeee---
Q psy18197 6 CCVNGVPV-----KAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRI--IGRIHMVQVVIEKDFLTTS--- 75 (144)
Q Consensus 6 v~InG~~v-----~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~--~g~i~~~~i~iG~~~~~~~--- 75 (144)
++++++.. .++||||++.+.++++.++.+.-. ..... ...+.-...+ ........+++|+..+.++
T Consensus 199 i~v~~~~~~~~~~~~iiDSGtt~~~lP~~~~~~l~~~--~~~~~--~~~~~~~~~C~~~~~~p~i~f~fgg~~~~i~~~~ 274 (329)
T cd05485 199 VSVGEGEFCSGGCQAIADTGTSLIAGPVDEIEKLNNA--IGAKP--IIGGEYMVNCSAIPSLPDITFVLGGKSFSLTGKD 274 (329)
T ss_pred EEECCeeecCCCcEEEEccCCcceeCCHHHHHHHHHH--hCCcc--ccCCcEEEeccccccCCcEEEEECCEEeEEChHH
Confidence 46676654 699999999999999988776321 01000 0001000000 0012234555565443221
Q ss_pred --------------EEEec-----CCCCceEeeechhheecEEEEcCCCEEEEc
Q psy18197 76 --------------LSILE-----EQPMDMLLGLDMLRRHQCCIDLKRNVLRIG 110 (144)
Q Consensus 76 --------------~~V~~-----~~~~d~ILG~d~L~~~~~~iD~~~~~l~~~ 110 (144)
..+.. ......|||..||+.+-.+.|++++++-+-
T Consensus 275 yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~IlG~~fl~~~y~vFD~~~~~ig~a 328 (329)
T cd05485 275 YVLKVTQMGQTICLSGFMGIDIPPPAGPLWILGDVFIGKYYTEFDLGNNRVGFA 328 (329)
T ss_pred eEEEecCCCCCEEeeeEEECcCCCCCCCeEEEchHHhccceEEEeCCCCEEeec
Confidence 11121 112247999999999999999999998763
No 49
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=95.20 E-value=0.21 Score=40.40 Aligned_cols=89 Identities=12% Similarity=0.231 Sum_probs=55.2
Q ss_pred eEEEEEEC--CEEEEEEEecCCCceecCHHHHH----HcCceeee----cceee------eEeeccceeEEEEEEEEEEE
Q psy18197 2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAE----RCAIMRLI----DTRWA------GVAKGVGIQRIIGRIHMVQV 65 (144)
Q Consensus 2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~----~lgl~~~~----~~~~~------~~a~g~g~~~~~g~i~~~~i 65 (144)
+++++.|+ ++++.+++|||++..-+...-+. .++..... .+... ....|.|. ..|.+....+
T Consensus 12 Y~~~i~vGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~c~~~~~y~~~~Sst~~~~~~~~~i~Y~~g~--~~G~~~~D~v 89 (329)
T cd05485 12 YYGVITIGTPPQSFKVVFDTGSSNLWVPSKKCSWTNIACLLHNKYDSTKSSTYKKNGTEFAIQYGSGS--LSGFLSTDTV 89 (329)
T ss_pred EEEEEEECCCCcEEEEEEcCCCccEEEecCCCCCCCccccCCCeECCcCCCCeEECCeEEEEEECCce--EEEEEecCcE
Confidence 67899998 58999999999999888654332 22221111 01110 12344443 5687878899
Q ss_pred EEcCEEEe-eeEEEecC--------CCCceEeeech
Q psy18197 66 VIEKDFLT-TSLSILEE--------QPMDMLLGLDM 92 (144)
Q Consensus 66 ~iG~~~~~-~~~~V~~~--------~~~d~ILG~d~ 92 (144)
.+|+..++ ..|.++.. ...|+|||+-+
T Consensus 90 ~ig~~~~~~~~fg~~~~~~~~~~~~~~~~GilGLg~ 125 (329)
T cd05485 90 SVGGVSVKGQTFAEAINEPGLTFVAAKFDGILGMGY 125 (329)
T ss_pred EECCEEECCEEEEEEEecCCccccccccceEEEcCC
Confidence 99987653 34544421 23589999875
No 50
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two
Probab=95.09 E-value=0.26 Score=40.47 Aligned_cols=88 Identities=17% Similarity=0.190 Sum_probs=52.0
Q ss_pred eEEEEEEC--CEEEEEEEecCCCceecCHHHHHHcC-c-ee-------eecceeeeEeeccceeEEEEEEEEEEEEEcC-
Q psy18197 2 LFINCCVN--GVPVKAFVDSGAQTTIMSAECAERCA-I-MR-------LIDTRWAGVAKGVGIQRIIGRIHMVQVVIEK- 69 (144)
Q Consensus 2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~lg-l-~~-------~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~- 69 (144)
+++++.|+ ++++.+++|||++..-+...-+.... . .+ .....+. ...|.|. ..|.+..+.+++|+
T Consensus 4 Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~~~~~~~~f~~~~SsT~~~~~~~~~-i~Yg~Gs--~~G~~~~D~v~ig~~ 80 (364)
T cd05473 4 YYIEMLIGTPPQKLNILVDTGSSNFAVAAAPHPFIHTYFHRELSSTYRDLGKGVT-VPYTQGS--WEGELGTDLVSIPKG 80 (364)
T ss_pred eEEEEEecCCCceEEEEEecCCcceEEEcCCCccccccCCchhCcCcccCCceEE-EEECcce--EEEEEEEEEEEECCC
Confidence 57889997 68999999999998887654221111 0 00 0011122 3455443 46888889999985
Q ss_pred --EEEeeeEEEec--------CCCCceEeeech
Q psy18197 70 --DFLTTSLSILE--------EQPMDMLLGLDM 92 (144)
Q Consensus 70 --~~~~~~~~V~~--------~~~~d~ILG~d~ 92 (144)
..+...+.... ....|+|||+-+
T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~dGIlGLg~ 113 (364)
T cd05473 81 PNVTFRANIAAITESENFFLNGSNWEGILGLAY 113 (364)
T ss_pred CccceEEeeEEEeccccceecccccceeeeecc
Confidence 22332222221 124699999975
No 51
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=95.07 E-value=0.049 Score=42.16 Aligned_cols=79 Identities=16% Similarity=0.126 Sum_probs=48.5
Q ss_pred EEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeecccee--EEEEEEEEEEEEEcCEEEeeeEEEecCCCCceEee
Q psy18197 12 PVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQ--RIIGRIHMVQVVIEKDFLTTSLSILEEQPMDMLLG 89 (144)
Q Consensus 12 ~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~--~~~g~i~~~~i~iG~~~~~~~~~V~~~~~~d~ILG 89 (144)
...+++|||++.+.+++++++.+--. ...... ........ ..........+++ ..|||
T Consensus 202 ~~~~iiDsGt~~~~lp~~~~~~l~~~--~~~~~~-~~~~~~~~~~~~~~~~p~i~f~f-----------------~~ilG 261 (283)
T cd05471 202 GGGAIVDSGTSLIYLPSSVYDAILKA--LGAAVS-SSDGGYGVDCSPCDTLPDITFTF-----------------LWILG 261 (283)
T ss_pred CcEEEEecCCCCEeCCHHHHHHHHHH--hCCccc-ccCCcEEEeCcccCcCCCEEEEE-----------------EEEcc
Confidence 46899999999999999999886431 111110 00000000 0000011111211 79999
Q ss_pred echhheecEEEEcCCCEEEEc
Q psy18197 90 LDMLRRHQCCIDLKRNVLRIG 110 (144)
Q Consensus 90 ~d~L~~~~~~iD~~~~~l~~~ 110 (144)
..||+..-...|.+++++-+.
T Consensus 262 ~~fl~~~y~vfD~~~~~igfa 282 (283)
T cd05471 262 DVFLRNYYTVFDLDNNRIGFA 282 (283)
T ss_pred HhhhhheEEEEeCCCCEEeec
Confidence 999999999999999988763
No 52
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=94.83 E-value=0.18 Score=40.69 Aligned_cols=91 Identities=16% Similarity=0.247 Sum_probs=56.5
Q ss_pred CeEEEEEEC--CEEEEEEEecCCCceecCHHHHHHcCcee--ee----cc-----------------------eeeeEee
Q psy18197 1 MLFINCCVN--GVPVKAFVDSGAQTTIMSAECAERCAIMR--LI----DT-----------------------RWAGVAK 49 (144)
Q Consensus 1 ~~~v~v~In--G~~v~~LvDTGA~~s~Is~~~a~~lgl~~--~~----~~-----------------------~~~~~a~ 49 (144)
++++++.|+ .+++.+++|||++..-++..-+..++... .- .+ .+. ...
T Consensus 3 ~Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~C~~c~~~~~~~y~~~~Sst~~~~~C~~~~c~~~~~~~~~~~~~~-i~Y 81 (326)
T cd06096 3 YYFIDIFIGNPPQKQSLILDTGSSSLSFPCSQCKNCGIHMEPPYNLNNSITSSILYCDCNKCCYCLSCLNNKCEYS-ISY 81 (326)
T ss_pred eEEEEEEecCCCeEEEEEEeCCCCceEEecCCCCCcCCCCCCCcCcccccccccccCCCccccccCcCCCCcCcEE-EEE
Confidence 478899998 58999999999999988766555554310 00 00 111 233
Q ss_pred ccceeEEEEEEEEEEEEEcCEEEe--------eeEEEecC-------CCCceEeeechh
Q psy18197 50 GVGIQRIIGRIHMVQVVIEKDFLT--------TSLSILEE-------QPMDMLLGLDML 93 (144)
Q Consensus 50 g~g~~~~~g~i~~~~i~iG~~~~~--------~~~~V~~~-------~~~d~ILG~d~L 93 (144)
|.| +...|....+.+.+|+...+ ..|..... ...|+|||+.+=
T Consensus 82 ~~g-s~~~G~~~~D~v~lg~~~~~~~~~~~~~~~fg~~~~~~~~~~~~~~~GilGLg~~ 139 (326)
T cd06096 82 SEG-SSISGFYFSDFVSFESYLNSNSEKESFKKIFGCHTHETNLFLTQQATGILGLSLT 139 (326)
T ss_pred CCC-CceeeEEEEEEEEeccCCCCccccccccEEeccCccccCcccccccceEEEccCC
Confidence 433 34677777889999875432 12222211 246899999863
No 53
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=94.83 E-value=0.22 Score=39.20 Aligned_cols=88 Identities=16% Similarity=0.082 Sum_probs=51.7
Q ss_pred EEECCEE-----EEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcC--EE------E
Q psy18197 6 CCVNGVP-----VKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEK--DF------L 72 (144)
Q Consensus 6 v~InG~~-----v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~--~~------~ 72 (144)
++|||++ ..++||||++.+.++++.. +++ -.+ ...+... ..++++-. .. .
T Consensus 166 i~vg~~~~~~~~~~~ivDTGTt~t~lp~~~y----~p~---i~~--~f~~~~~--------~~~~~l~~~~y~~~~~~~~ 228 (273)
T cd05475 166 LLFNGQPTGGKGLEVVFDSGSSYTYFNAQAY----FKP---LTL--KFGKGWR--------TRLLEIPPENYLIISEKGN 228 (273)
T ss_pred EEECCEECcCCCceEEEECCCceEEcCCccc----ccc---EEE--EECCCCc--------eeEEEeCCCceEEEcCCCC
Confidence 5778774 4689999999999998865 211 111 1111000 01122211 10 1
Q ss_pred eeeEEEecCC----CCceEeeechhheecEEEEcCCCEEEEcc
Q psy18197 73 TTSLSILEEQ----PMDMLLGLDMLRRHQCCIDLKRNVLRIGT 111 (144)
Q Consensus 73 ~~~~~V~~~~----~~d~ILG~d~L~~~~~~iD~~~~~l~~~~ 111 (144)
.| +..++.. ....|||..||+.+-...|++++++-+..
T Consensus 229 ~C-l~~~~~~~~~~~~~~ilG~~~l~~~~~vfD~~~~riGfa~ 270 (273)
T cd05475 229 VC-LGILNGSEIGLGNTNIIGDISMQGLMVIYDNEKQQIGWVR 270 (273)
T ss_pred EE-EEEecCCCcCCCceEEECceEEEeeEEEEECcCCEeCccc
Confidence 23 2223221 12479999999999999999999887654
No 54
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme. Proteinase A preferentially hydro
Probab=94.75 E-value=0.31 Score=39.22 Aligned_cols=101 Identities=14% Similarity=0.106 Sum_probs=58.1
Q ss_pred EEECCEE-----EEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEE--EEEEEEEEEEEcCEEEee----
Q psy18197 6 CCVNGVP-----VKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRI--IGRIHMVQVVIEKDFLTT---- 74 (144)
Q Consensus 6 v~InG~~-----v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~--~g~i~~~~i~iG~~~~~~---- 74 (144)
++|+++. ..+++|||++.+.+++++++.+.-. ...... ..+.-...+ ........+++++..+.+
T Consensus 194 i~vg~~~~~~~~~~~ivDSGtt~~~lp~~~~~~l~~~--~~~~~~--~~~~~~~~C~~~~~~P~i~f~f~g~~~~i~~~~ 269 (320)
T cd05488 194 IGLGDEELELENTGAAIDTGTSLIALPSDLAEMLNAE--IGAKKS--WNGQYTVDCSKVDSLPDLTFNFDGYNFTLGPFD 269 (320)
T ss_pred EEECCEEeccCCCeEEEcCCcccccCCHHHHHHHHHH--hCCccc--cCCcEEeeccccccCCCEEEEECCEEEEECHHH
Confidence 4666654 3689999999999999998875311 000000 001000000 011223455666654432
Q ss_pred ---------eEEEecC-----CCCceEeeechhheecEEEEcCCCEEEEc
Q psy18197 75 ---------SLSILEE-----QPMDMLLGLDMLRRHQCCIDLKRNVLRIG 110 (144)
Q Consensus 75 ---------~~~V~~~-----~~~d~ILG~d~L~~~~~~iD~~~~~l~~~ 110 (144)
-..+... .....|||..||+..-.+.|+.+++|-+-
T Consensus 270 y~~~~~g~C~~~~~~~~~~~~~~~~~ilG~~fl~~~y~vfD~~~~~iG~a 319 (320)
T cd05488 270 YTLEVSGSCISAFTGMDFPEPVGPLAIVGDAFLRKYYSVYDLGNNAVGLA 319 (320)
T ss_pred heecCCCeEEEEEEECcCCCCCCCeEEEchHHhhheEEEEeCCCCEEeec
Confidence 2222221 12358999999999999999999988763
No 55
>PF03539 Spuma_A9PTase: Spumavirus aspartic protease (A9); InterPro: IPR001641 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A9 (spumapepsin family, clan AA). Foamy viruses are single-stranded enveloped retroviruses that have been noted to infect monkeys, cats and humans. In the human virus, the aspartic protease is encoded by the retroviral gag gene [], and in monkeys by the pol gene []. At present, the virus has not been proven to cause any particular disease. However, studies have shown Human foamy virus causes neurological disorders in infected mice []. It is not clear whether the Foamy virus/spumavirus proteases share a common evolutionary origin with other aspartic proteases. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 2JYS_A.
Probab=94.61 E-value=0.38 Score=35.37 Aligned_cols=80 Identities=18% Similarity=0.230 Sum_probs=42.4
Q ss_pred ECCEEEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEecCCCCceE
Q psy18197 8 VNGVPVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSILEEQPMDML 87 (144)
Q Consensus 8 InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~~~~~~d~I 87 (144)
|-|.++++--||||++++|+..+.+. +..+.........| .....+.-+.++|.|+.+.+.+.-.+ +|-+
T Consensus 1 ikg~~l~~~wDsga~ITCiP~~fl~~---E~Pi~~~~i~Tihg----~~~~~vYYl~fKi~grkv~aEVi~s~---~dy~ 70 (163)
T PF03539_consen 1 IKGTKLKGHWDSGAQITCIPESFLEE---EQPIGKTLIKTIHG----EKEQDVYYLTFKINGRKVEAEVIASP---YDYI 70 (163)
T ss_dssp ETTEEEEEEE-TT-SSEEEEGGGTTT------SEEEEEE-SS-----EEEEEEEEEEEEESS-EEEEEEEEES---SSSE
T ss_pred CCCceeeEEecCCCeEEEccHHHhCc---cccccceEEEEecC----ceeccEEEEEEEEcCeEEEEEEecCc---cceE
Confidence 45888999999999999999987432 11112222212333 22233445789999987766544443 3322
Q ss_pred e----eechhheec
Q psy18197 88 L----GLDMLRRHQ 97 (144)
Q Consensus 88 L----G~d~L~~~~ 97 (144)
| -.+|+.+..
T Consensus 71 li~p~diPw~~~~p 84 (163)
T PF03539_consen 71 LISPSDIPWYKKKP 84 (163)
T ss_dssp EE-TTT-HHHHS--
T ss_pred EEcccccccccCCC
Confidence 2 257888654
No 56
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=94.53 E-value=0.069 Score=42.06 Aligned_cols=85 Identities=15% Similarity=0.153 Sum_probs=50.0
Q ss_pred EEECCE------EEEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEeeeEEEe
Q psy18197 6 CCVNGV------PVKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLTTSLSIL 79 (144)
Q Consensus 6 v~InG~------~v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~~~~~V~ 79 (144)
++|+|+ ...++||||++...++++.++++--. ....... ...| . ..+ .|.-.+-.. .|.+
T Consensus 186 i~v~~~~~~~~~~~~~iiDSGTs~~~lP~~~~~~l~~~-l~g~~~~-~~~~--~----~~~-~C~~~~P~i----~f~~- 251 (278)
T cd06097 186 YTVGGDAPWSRSGFSAIADTGTTLILLPDAIVEAYYSQ-VPGAYYD-SEYG--G----WVF-PCDTTLPDL----SFAV- 251 (278)
T ss_pred EEECCcceeecCCceEEeecCCchhcCCHHHHHHHHHh-CcCCccc-CCCC--E----EEE-ECCCCCCCE----EEEE-
Confidence 456665 35789999999999999888776310 0000000 0001 0 001 111111111 1111
Q ss_pred cCCCCceEeeechhheecEEEEcCCCEEEE
Q psy18197 80 EEQPMDMLLGLDMLRRHQCCIDLKRNVLRI 109 (144)
Q Consensus 80 ~~~~~d~ILG~d~L~~~~~~iD~~~~~l~~ 109 (144)
..|||-.||+.+-.+.|+.++++-+
T Consensus 252 -----~~ilGd~fl~~~y~vfD~~~~~ig~ 276 (278)
T cd06097 252 -----FSILGDVFLKAQYVVFDVGGPKLGF 276 (278)
T ss_pred -----EEEEcchhhCceeEEEcCCCceeee
Confidence 5799999999999999999998765
No 57
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=94.26 E-value=0.21 Score=39.64 Aligned_cols=27 Identities=7% Similarity=0.091 Sum_probs=24.2
Q ss_pred ceEeeechhheecEEEEcCCCEEEEcc
Q psy18197 85 DMLLGLDMLRRHQCCIDLKRNVLRIGT 111 (144)
Q Consensus 85 d~ILG~d~L~~~~~~iD~~~~~l~~~~ 111 (144)
..|||..||+.+-.+.|++++++-+..
T Consensus 270 ~~ilG~~fl~~~~vvfD~~~~~igfa~ 296 (299)
T cd05472 270 LSIIGNVQQQTFRVVYDVAGGRIGFAP 296 (299)
T ss_pred CEEEchHHccceEEEEECCCCEEeEec
Confidence 379999999999999999999998754
No 58
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=93.75 E-value=0.45 Score=37.42 Aligned_cols=83 Identities=14% Similarity=0.228 Sum_probs=50.6
Q ss_pred eEEEEEEC--CEEEEEEEecCCCceecCHH-HHHHcCceeeecceeeeEeeccceeEEEEEEEEEEEEEcCE----EE-e
Q psy18197 2 LFINCCVN--GVPVKAFVDSGAQTTIMSAE-CAERCAIMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKD----FL-T 73 (144)
Q Consensus 2 ~~v~v~In--G~~v~~LvDTGA~~s~Is~~-~a~~lgl~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~----~~-~ 73 (144)
+++++.|+ .+++.+++|||++..-+.-+ .+..++. .+. ..+|-+ ....|.+..+.++++.. .. +
T Consensus 3 Y~~~i~iGtP~q~~~v~~DTGS~~~Wv~c~~~c~~c~c------~~~-i~Ygd~-~~~~G~~~~D~v~~~~~~~~~~~~~ 74 (273)
T cd05475 3 YYVTINIGNPPKPYFLDIDTGSDLTWLQCDAPCTGCQC------DYE-IEYADG-GSSMGVLVTDIFSLKLTNGSRAKPR 74 (273)
T ss_pred eEEEEEcCCCCeeEEEEEccCCCceEEeCCCCCCCCcC------ccE-eEeCCC-CceEEEEEEEEEEEeecCCCcccCC
Confidence 68899998 78899999999999988531 1222222 222 334422 24567777778888532 22 2
Q ss_pred eeEEEec---------CCCCceEeeech
Q psy18197 74 TSLSILE---------EQPMDMLLGLDM 92 (144)
Q Consensus 74 ~~~~V~~---------~~~~d~ILG~d~ 92 (144)
..|-... ....|+|||+-+
T Consensus 75 ~~Fgc~~~~~~~~~~~~~~~dGIlGLg~ 102 (273)
T cd05475 75 IAFGCGYDQQGPLLNPPPPTDGILGLGR 102 (273)
T ss_pred EEEEeeeccCCcccCCCccCCEEEECCC
Confidence 3333331 125789999953
No 59
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=93.72 E-value=0.66 Score=35.78 Aligned_cols=91 Identities=13% Similarity=0.192 Sum_probs=56.0
Q ss_pred eEEEEEECC--EEEEEEEecCCCceecCHHHHHHcCceee--------------ecceeeeEeeccceeEEEEEEEEEEE
Q psy18197 2 LFINCCVNG--VPVKAFVDSGAQTTIMSAECAERCAIMRL--------------IDTRWAGVAKGVGIQRIIGRIHMVQV 65 (144)
Q Consensus 2 ~~v~v~InG--~~v~~LvDTGA~~s~Is~~~a~~lgl~~~--------------~~~~~~~~a~g~g~~~~~g~i~~~~i 65 (144)
+++++.|+. +++.+++|||++.+.+...-+..+..... ..........+.| ...|.+....+
T Consensus 1 Y~~~i~iGtp~q~~~l~~DTGS~~~wv~~~~c~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~Y~~g--~~~g~~~~D~v 78 (283)
T cd05471 1 YYGEITIGTPPQKFSVIFDTGSSLLWVPSSNCTSCSCQKHPRFKYDSSKSSTYKDTGCTFSITYGDG--SVTGGLGTDTV 78 (283)
T ss_pred CEEEEEECCCCcEEEEEEeCCCCCEEEecCCCCccccccCCCCccCccCCceeecCCCEEEEEECCC--eEEEEEEEeEE
Confidence 467888876 58999999999999987766555532110 0011111333332 34555557899
Q ss_pred EEcCEEE-eeeEEEecC-------CCCceEeeechhh
Q psy18197 66 VIEKDFL-TTSLSILEE-------QPMDMLLGLDMLR 94 (144)
Q Consensus 66 ~iG~~~~-~~~~~V~~~-------~~~d~ILG~d~L~ 94 (144)
++++... ...|..+.. ...++|||+..=.
T Consensus 79 ~~~~~~~~~~~fg~~~~~~~~~~~~~~~GilGLg~~~ 115 (283)
T cd05471 79 TIGGLTIPNQTFGCATSESGDFSSSGFDGILGLGFPS 115 (283)
T ss_pred EECCEEEeceEEEEEeccCCcccccccceEeecCCcc
Confidence 9998754 344444442 3578999986643
No 60
>PF00026 Asp: Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.; InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) . More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=93.65 E-value=0.18 Score=39.74 Aligned_cols=98 Identities=14% Similarity=0.149 Sum_probs=59.3
Q ss_pred EEECCE------EEEEEEecCCCceecCHHHHHHcC----ceeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEe--
Q psy18197 6 CCVNGV------PVKAFVDSGAQTTIMSAECAERCA----IMRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLT-- 73 (144)
Q Consensus 6 v~InG~------~v~~LvDTGA~~s~Is~~~a~~lg----l~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~-- 73 (144)
+.+++. ...+++|||++...++++..+++- ..... ........- . .......+++++..+.
T Consensus 187 i~i~~~~~~~~~~~~~~~Dtgt~~i~lp~~~~~~i~~~l~~~~~~-~~~~~~c~~---~---~~~p~l~f~~~~~~~~i~ 259 (317)
T PF00026_consen 187 ISIGGESVFSSSGQQAILDTGTSYIYLPRSIFDAIIKALGGSYSD-GVYSVPCNS---T---DSLPDLTFTFGGVTFTIP 259 (317)
T ss_dssp EEETTEEEEEEEEEEEEEETTBSSEEEEHHHHHHHHHHHTTEEEC-SEEEEETTG---G---GGSEEEEEEETTEEEEEE
T ss_pred ccccccccccccceeeecccccccccccchhhHHHHhhhcccccc-eeEEEeccc---c---cccceEEEeeCCEEEEec
Confidence 566665 368999999999999999887762 21110 000000000 0 0012234455554332
Q ss_pred ---------------eeEEEec----CCCCceEeeechhheecEEEEcCCCEEEEc
Q psy18197 74 ---------------TSLSILE----EQPMDMLLGLDMLRRHQCCIDLKRNVLRIG 110 (144)
Q Consensus 74 ---------------~~~~V~~----~~~~d~ILG~d~L~~~~~~iD~~~~~l~~~ 110 (144)
|...+.. ......+||+.||+.+=..+|++++++-+-
T Consensus 260 ~~~~~~~~~~~~~~~C~~~i~~~~~~~~~~~~iLG~~fl~~~y~vfD~~~~~ig~A 315 (317)
T PF00026_consen 260 PSDYIFKIEDGNGGYCYLGIQPMDSSDDSDDWILGSPFLRNYYVVFDYENNRIGFA 315 (317)
T ss_dssp HHHHEEEESSTTSSEEEESEEEESSTTSSSEEEEEHHHHTTEEEEEETTTTEEEEE
T ss_pred chHhcccccccccceeEeeeecccccccCCceEecHHHhhceEEEEeCCCCEEEEe
Confidence 2222332 235679999999999999999999998764
No 61
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=93.24 E-value=0.54 Score=37.68 Aligned_cols=99 Identities=16% Similarity=0.060 Sum_probs=55.4
Q ss_pred EEECCEEE------EEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEE--EEEEEEEEEEcCEEEee---
Q psy18197 6 CCVNGVPV------KAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRII--GRIHMVQVVIEKDFLTT--- 74 (144)
Q Consensus 6 v~InG~~v------~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~--g~i~~~~i~iG~~~~~~--- 74 (144)
++|||+.+ .++||||++.+.++++.++++--. ..... ..|.-...+- .......+.+++..+.+
T Consensus 186 i~v~g~~~~~~~~~~aiiDTGTs~~~lP~~~~~~l~~~--~~~~~---~~~~~~~~C~~~~~~p~i~f~f~g~~~~l~~~ 260 (316)
T cd05486 186 IQVGGTVIFCSDGCQAIVDTGTSLITGPSGDIKQLQNY--IGATA---TDGEYGVDCSTLSLMPSVTFTINGIPYSLSPQ 260 (316)
T ss_pred EEEecceEecCCCCEEEECCCcchhhcCHHHHHHHHHH--hCCcc---cCCcEEEeccccccCCCEEEEECCEEEEeCHH
Confidence 57777643 699999999999999887766210 00000 0000000000 01122344555543321
Q ss_pred --------------eEEEecC-----CCCceEeeechhheecEEEEcCCCEEEE
Q psy18197 75 --------------SLSILEE-----QPMDMLLGLDMLRRHQCCIDLKRNVLRI 109 (144)
Q Consensus 75 --------------~~~V~~~-----~~~d~ILG~d~L~~~~~~iD~~~~~l~~ 109 (144)
-+.+... .....|||-.||+.+-.+.|+.++++-+
T Consensus 261 ~y~~~~~~~~~~~C~~~~~~~~~~~~~~~~~ILGd~flr~~y~vfD~~~~~IGf 314 (316)
T cd05486 261 AYTLEDQSDGGGYCSSGFQGLDIPPPAGPLWILGDVFIRQYYSVFDRGNNRVGF 314 (316)
T ss_pred HeEEecccCCCCEEeeEEEECCCCCCCCCeEEEchHHhcceEEEEeCCCCEeec
Confidence 1111111 1123799999999999999999998765
No 62
>PLN03146 aspartyl protease family protein; Provisional
Probab=92.50 E-value=0.48 Score=40.17 Aligned_cols=27 Identities=11% Similarity=0.051 Sum_probs=24.3
Q ss_pred ceEeeechhheecEEEEcCCCEEEEcc
Q psy18197 85 DMLLGLDMLRRHQCCIDLKRNVLRIGT 111 (144)
Q Consensus 85 d~ILG~d~L~~~~~~iD~~~~~l~~~~ 111 (144)
..|||..+++.+.+..|.+++++-|..
T Consensus 399 ~~IlG~~~q~~~~vvyDl~~~~igFa~ 425 (431)
T PLN03146 399 IAIFGNLAQMNFLVGYDLESKTVSFKP 425 (431)
T ss_pred ceEECeeeEeeEEEEEECCCCEEeeec
Confidence 379999999999999999999998755
No 63
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate r
Probab=91.58 E-value=3.4 Score=33.23 Aligned_cols=98 Identities=12% Similarity=0.108 Sum_probs=55.9
Q ss_pred EEECCEEE------EEEEecCCCceecCHHHHHHcCc----eeeecceeeeEeeccceeEEEEEEEEEEEEEcCEEEe--
Q psy18197 6 CCVNGVPV------KAFVDSGAQTTIMSAECAERCAI----MRLIDTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLT-- 73 (144)
Q Consensus 6 v~InG~~v------~~LvDTGA~~s~Is~~~a~~lgl----~~~~~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~-- 73 (144)
++|+++.+ .++||||++.+.++++.++++-- ... ...+.-...- ......+.+++|+..+.
T Consensus 195 i~vg~~~~~~~~~~~aiiDSGts~~~lP~~~~~~l~~~~~~~~~-~~~y~~~C~~------~~~~P~i~f~fgg~~~~v~ 267 (326)
T cd05487 195 VSVGSSTLLCEDGCTAVVDTGASFISGPTSSISKLMEALGAKER-LGDYVVKCNE------VPTLPDISFHLGGKEYTLS 267 (326)
T ss_pred EEECCEEEecCCCCEEEECCCccchhCcHHHHHHHHHHhCCccc-CCCEEEeccc------cCCCCCEEEEECCEEEEeC
Confidence 56676643 58999999999999998776521 100 0000000000 00112234455554332
Q ss_pred ---------------eeEEEec-----CCCCceEeeechhheecEEEEcCCCEEEEc
Q psy18197 74 ---------------TSLSILE-----EQPMDMLLGLDMLRRHQCCIDLKRNVLRIG 110 (144)
Q Consensus 74 ---------------~~~~V~~-----~~~~d~ILG~d~L~~~~~~iD~~~~~l~~~ 110 (144)
|-..+.. ......|||..||+.+-.+.|+.++++-+-
T Consensus 268 ~~~yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~ilG~~flr~~y~vfD~~~~~IGfA 324 (326)
T cd05487 268 SSDYVLQDSDFSDKLCTVAFHAMDIPPPTGPLWVLGATFIRKFYTEFDRQNNRIGFA 324 (326)
T ss_pred HHHhEEeccCCCCCEEEEEEEeCCCCCCCCCeEEEehHHhhccEEEEeCCCCEEeee
Confidence 2122221 111247999999999999999999988663
No 64
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank
Probab=90.46 E-value=1.5 Score=35.28 Aligned_cols=93 Identities=11% Similarity=0.074 Sum_probs=52.2
Q ss_pred EEEEEecCCCceecCHHHHHHcCceeeecceeeeEeeccceeEEE--EEEEEEEEEEcCEEEe-----------------
Q psy18197 13 VKAFVDSGAQTTIMSAECAERCAIMRLIDTRWAGVAKGVGIQRII--GRIHMVQVVIEKDFLT----------------- 73 (144)
Q Consensus 13 v~~LvDTGA~~s~Is~~~a~~lgl~~~~~~~~~~~a~g~g~~~~~--g~i~~~~i~iG~~~~~----------------- 73 (144)
..++||||.+.+.++++.++.+.-. ..... ...+.-...+. .......+.+|+..+.
T Consensus 207 ~~aiiDSGTt~~~~p~~~~~~l~~~-~~~~~---~~~~~~~~~C~~~~~~P~i~f~fgg~~~~l~~~~y~~~~~~~~~~~ 282 (325)
T cd05490 207 CEAIVDTGTSLITGPVEEVRALQKA-IGAVP---LIQGEYMIDCEKIPTLPVISFSLGGKVYPLTGEDYILKVSQRGTTI 282 (325)
T ss_pred CEEEECCCCccccCCHHHHHHHHHH-hCCcc---ccCCCEEecccccccCCCEEEEECCEEEEEChHHeEEeccCCCCCE
Confidence 4799999999999999988876321 00000 00110000000 0112234555554332
Q ss_pred eeEEEec-----CCCCceEeeechhheecEEEEcCCCEEEE
Q psy18197 74 TSLSILE-----EQPMDMLLGLDMLRRHQCCIDLKRNVLRI 109 (144)
Q Consensus 74 ~~~~V~~-----~~~~d~ILG~d~L~~~~~~iD~~~~~l~~ 109 (144)
|-+.+.. ......|||..||+.+-.+.|++++++-+
T Consensus 283 C~~~~~~~~~~~~~~~~~ilGd~flr~~y~vfD~~~~~IGf 323 (325)
T cd05490 283 CLSGFMGLDIPPPAGPLWILGDVFIGRYYTVFDRDNDRVGF 323 (325)
T ss_pred EeeEEEECCCCCCCCceEEEChHhheeeEEEEEcCCcEeec
Confidence 2111211 01224799999999999999999998865
No 65
>PF03419 Peptidase_U4: Sporulation factor SpoIIGA This family belongs to family U4 of the peptidase classification.; InterPro: IPR005081 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This group of peptidases belong to the MEROPS peptidase family U4 (SpoIIGA peptidase family, clan U-). Sporulation in bacteria such as Bacillus subtilis involves the formation of a polar septum, which divides the sporangium into a mother cell and a forespore. The sigma E factor, which is encoded within the spoIIG operon, is a cell-specific regulatory protein that directs gene transcription in the mother cell. Sigma E is synthesised as an inactive proprotein pro-sigma E, which is converted to the mature factor by the putative processing enzyme SpoIIGA []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis, 0030436 asexual sporulation
Probab=90.09 E-value=6.6 Score=31.49 Aligned_cols=33 Identities=24% Similarity=0.389 Sum_probs=25.9
Q ss_pred eEEEEEECCEE--EEEEEecCCCc---------eecCHHHHHHc
Q psy18197 2 LFINCCVNGVP--VKAFVDSGAQT---------TIMSAECAERC 34 (144)
Q Consensus 2 ~~v~v~InG~~--v~~LvDTGA~~---------s~Is~~~a~~l 34 (144)
+-+++.++|++ +++++|||-+. .+++.+.++++
T Consensus 158 ~~v~i~~~~~~~~~~allDTGN~L~DPitg~PV~Vve~~~~~~~ 201 (293)
T PF03419_consen 158 YPVTIEIGGKKIELKALLDTGNQLRDPITGRPVIVVEYEALEKL 201 (293)
T ss_pred EEEEEEECCEEEEEEEEEECCCcccCCCCCCcEEEEEHHHHHhh
Confidence 34677889985 79999999764 57788888877
No 66
>PTZ00165 aspartyl protease; Provisional
Probab=89.87 E-value=4.1 Score=35.22 Aligned_cols=27 Identities=15% Similarity=0.255 Sum_probs=24.7
Q ss_pred ceEeeechhheecEEEEcCCCEEEEcc
Q psy18197 85 DMLLGLDMLRRHQCCIDLKRNVLRIGT 111 (144)
Q Consensus 85 d~ILG~d~L~~~~~~iD~~~~~l~~~~ 111 (144)
..|||-.||+++-.+.|..++++-+-+
T Consensus 419 ~~ILGd~Flr~yy~VFD~~n~rIGfA~ 445 (482)
T PTZ00165 419 LFVLGNNFIRKYYSIFDRDHMMVGLVP 445 (482)
T ss_pred eEEEchhhheeEEEEEeCCCCEEEEEe
Confidence 479999999999999999999998865
No 67
>PF14541 TAXi_C: Xylanase inhibitor C-terminal; PDB: 3AUP_D 3HD8_A 1T6G_A 1T6E_X 2B42_A 3VLB_A 3VLA_A.
Probab=89.65 E-value=1.3 Score=32.15 Aligned_cols=28 Identities=11% Similarity=0.077 Sum_probs=24.1
Q ss_pred CCceEeeechhheecEEEEcCCCEEEEc
Q psy18197 83 PMDMLLGLDMLRRHQCCIDLKRNVLRIG 110 (144)
Q Consensus 83 ~~d~ILG~d~L~~~~~~iD~~~~~l~~~ 110 (144)
..-.+||.-.++.+.+..|.+++++-+.
T Consensus 133 ~~~~viG~~~~~~~~v~fDl~~~~igF~ 160 (161)
T PF14541_consen 133 DGVSVIGNFQQQNYHVVFDLENGRIGFA 160 (161)
T ss_dssp SSSEEE-HHHCCTEEEEEETTTTEEEEE
T ss_pred CCcEEECHHHhcCcEEEEECCCCEEEEe
Confidence 4569999999999999999999999874
No 68
>cd05489 xylanase_inhibitor_I_like TAXI-I inhibits degradation of xylan in the cell wall. Xylanase inhibitor-I (TAXI-I) is a member of potent TAXI-type inhibitors of fungal and bacterial family 11 xylanases. Plants developed a diverse battery of defense mechanisms in response to continual challenges by a broad spectrum of pathogenic microorganisms. Their defense arsenal includes inhibitors of cell wall-degrading enzymes, which hinder a possible invasion and colonization by antagonists. Xylanases of fungal and bacterial pathogens are the key enzymes in the degradation of xylan in the cell wall. Plants secrete proteins that inhibit these degradation glycosidases, including xylanase. Surprisingly, TAXI-I displays structural homology with the pepsin-like family of aspartic proteases but is proteolytically nonfunctional, because one or more residues of the essential catalytic triad are absent. The structure of the TAXI-inhibitor, Aspergillus niger xylanase I complex, illustrates the ability
Probab=89.53 E-value=4.1 Score=33.67 Aligned_cols=26 Identities=15% Similarity=0.350 Sum_probs=23.2
Q ss_pred eEeeechhheecEEEEcCCCEEEEcc
Q psy18197 86 MLLGLDMLRRHQCCIDLKRNVLRIGT 111 (144)
Q Consensus 86 ~ILG~d~L~~~~~~iD~~~~~l~~~~ 111 (144)
.|||.-+|+.+....|.+++++-+..
T Consensus 335 ~IlG~~~~~~~~vvyD~~~~riGfa~ 360 (362)
T cd05489 335 VVIGGHQMEDNLLVFDLEKSRLGFSS 360 (362)
T ss_pred EEEeeheecceEEEEECCCCEeeccc
Confidence 58999999999999999999987643
No 69
>PTZ00147 plasmepsin-1; Provisional
Probab=87.48 E-value=2.8 Score=35.99 Aligned_cols=93 Identities=13% Similarity=0.222 Sum_probs=53.6
Q ss_pred EEEEEEecCCCceecCHHHHHHcCc----eeee-cceeeeEeeccceeEEEEEEEEEEEEEcCEEEe-------------
Q psy18197 12 PVKAFVDSGAQTTIMSAECAERCAI----MRLI-DTRWAGVAKGVGIQRIIGRIHMVQVVIEKDFLT------------- 73 (144)
Q Consensus 12 ~v~~LvDTGA~~s~Is~~~a~~lgl----~~~~-~~~~~~~a~g~g~~~~~g~i~~~~i~iG~~~~~------------- 73 (144)
...+++|||.+.+.++++.++++-- .... .......... .......+.+++..+.
T Consensus 332 ~~~aIiDSGTsli~lP~~~~~ai~~~l~~~~~~~~~~y~~~C~~-------~~lP~~~f~f~g~~~~L~p~~yi~~~~~~ 404 (453)
T PTZ00147 332 KANVIVDSGTSVITVPTEFLNKFVESLDVFKVPFLPLYVTTCNN-------TKLPTLEFRSPNKVYTLEPEYYLQPIEDI 404 (453)
T ss_pred ceeEEECCCCchhcCCHHHHHHHHHHhCCeecCCCCeEEEeCCC-------CCCCeEEEEECCEEEEECHHHheeccccC
Confidence 3679999999999999998876421 1000 0000000000 0111233334443221
Q ss_pred ----eeEEEecC--CCCceEeeechhheecEEEEcCCCEEEEcc
Q psy18197 74 ----TSLSILEE--QPMDMLLGLDMLRRHQCCIDLKRNVLRIGT 111 (144)
Q Consensus 74 ----~~~~V~~~--~~~d~ILG~d~L~~~~~~iD~~~~~l~~~~ 111 (144)
|-+.+.+. .....|||..||+.+-.+.|+.++++-+-.
T Consensus 405 ~~~~C~~~i~~~~~~~~~~ILGd~FLr~~YtVFD~~n~rIGfA~ 448 (453)
T PTZ00147 405 GSALCMLNIIPIDLEKNTFILGDPFMRKYFTVFDYDNHTVGFAL 448 (453)
T ss_pred CCcEEEEEEEECCCCCCCEEECHHHhccEEEEEECCCCEEEEEE
Confidence 22223321 123479999999999999999999998765
No 70
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=86.31 E-value=6.4 Score=33.77 Aligned_cols=27 Identities=15% Similarity=0.338 Sum_probs=23.9
Q ss_pred ceEeeechhheecEEEEcCCCEEEEcc
Q psy18197 85 DMLLGLDMLRRHQCCIDLKRNVLRIGT 111 (144)
Q Consensus 85 d~ILG~d~L~~~~~~iD~~~~~l~~~~ 111 (144)
..|||-.||+.+-.+.|+.++++-+-.
T Consensus 421 ~~ILGd~FLr~~Y~VFD~~n~rIGfA~ 447 (450)
T PTZ00013 421 TFILGDPFMRKYFTVFDYDKESVGFAI 447 (450)
T ss_pred CEEECHHHhccEEEEEECCCCEEEEEE
Confidence 479999999999999999999987643
No 71
>KOG1339|consensus
Probab=83.56 E-value=11 Score=31.52 Aligned_cols=89 Identities=15% Similarity=0.234 Sum_probs=53.9
Q ss_pred eEEEEEECC--EEEEEEEecCCCceecCHHHHH-HcCc-eee-ec------------ce-----------------eeeE
Q psy18197 2 LFINCCVNG--VPVKAFVDSGAQTTIMSAECAE-RCAI-MRL-ID------------TR-----------------WAGV 47 (144)
Q Consensus 2 ~~v~v~InG--~~v~~LvDTGA~~s~Is~~~a~-~lgl-~~~-~~------------~~-----------------~~~~ 47 (144)
+|+++.|+- +++.+++|||++..-++-.-+. .+.- ... .+ .. +. .
T Consensus 47 Y~~~i~IGTPpq~f~v~~DTGS~~lWV~c~~c~~~C~~~~~~~f~p~~SSt~~~~~c~~~~c~~~~~~~~~~~~C~y~-i 125 (398)
T KOG1339|consen 47 YYGNISIGTPPQSFTVVLDTGSDLLWVPCAPCSSACYSQHNPIFDPSASSTYKSVGCSSPRCKSLPQSCSPNSSCPYS-I 125 (398)
T ss_pred cEEEEecCCCCeeeEEEEeCCCCceeeccccccccccccCCCccCccccccccccCCCCccccccccCcccCCcCceE-E
Confidence 577788865 6699999999999999876555 3322 100 00 01 22 3
Q ss_pred eeccceeEEEEEEEEEEEEEcCE---EE-eeeEEEecC--------CCCceEeeech
Q psy18197 48 AKGVGIQRIIGRIHMVQVVIEKD---FL-TTSLSILEE--------QPMDMLLGLDM 92 (144)
Q Consensus 48 a~g~g~~~~~g~i~~~~i~iG~~---~~-~~~~~V~~~--------~~~d~ILG~d~ 92 (144)
.+|-|+ ...|......+++++. .. ...|..... ...|+|||+-+
T Consensus 126 ~Ygd~~-~~~G~l~~Dtv~~~~~~~~~~~~~~FGc~~~~~g~~~~~~~~dGIlGLg~ 181 (398)
T KOG1339|consen 126 QYGDGS-STSGYLATDTVTFGGTTSLPVPNQTFGCGTNNPGSFGLFAAFDGILGLGR 181 (398)
T ss_pred EeCCCC-ceeEEEEEEEEEEccccccccccEEEEeeecCccccccccccceEeecCC
Confidence 455433 4677777788999873 21 133332211 35789999874
No 72
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two
Probab=78.87 E-value=2.9 Score=34.24 Aligned_cols=26 Identities=8% Similarity=0.180 Sum_probs=24.4
Q ss_pred eEeeechhheecEEEEcCCCEEEEcc
Q psy18197 86 MLLGLDMLRRHQCCIDLKRNVLRIGT 111 (144)
Q Consensus 86 ~ILG~d~L~~~~~~iD~~~~~l~~~~ 111 (144)
.|||..||+.+-.+.|..++++-+..
T Consensus 319 ~ILG~~flr~~yvvfD~~~~rIGfa~ 344 (364)
T cd05473 319 TVIGAVIMEGFYVVFDRANKRVGFAV 344 (364)
T ss_pred eEEeeeeEcceEEEEECCCCEEeeEe
Confidence 79999999999999999999998866
No 73
>PF11925 DUF3443: Protein of unknown function (DUF3443); InterPro: IPR021847 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 400 to 434 amino acids in length. This protein has two conserved sequence motifs: NPV and DNNG.
Probab=77.82 E-value=6.4 Score=33.03 Aligned_cols=92 Identities=12% Similarity=0.218 Sum_probs=50.5
Q ss_pred CeEEEEEECC------EEE-EEEEecCCCc-eecCHHHHHHc--Cceeeec--ceeeeEeeccceeEEEEEEEEEEEEEc
Q psy18197 1 MLFINCCVNG------VPV-KAFVDSGAQT-TIMSAECAERC--AIMRLID--TRWAGVAKGVGIQRIIGRIHMVQVVIE 68 (144)
Q Consensus 1 ~~~v~v~InG------~~v-~~LvDTGA~~-s~Is~~~a~~l--gl~~~~~--~~~~~~a~g~g~~~~~g~i~~~~i~iG 68 (144)
|+|+.|+|.- +.+ .+|||||+.- .++++.+-.-| .|+.... .++. +.........-|.++...++||
T Consensus 23 ~p~VsVtVC~PGts~CqTIdnvlVDTGS~GLRi~~sAl~~~l~~~Lp~~t~~g~~la-EC~~F~sgytWGsVr~AdV~ig 101 (370)
T PF11925_consen 23 IPTVSVTVCAPGTSNCQTIDNVLVDTGSYGLRIFASALPSSLAGSLPQQTGGGAPLA-ECAQFASGYTWGSVRTADVTIG 101 (370)
T ss_pred ceeeEEEEeCCCCCCceeeCcEEEeccchhhhHHHhhhchhhhccCCcccCCCcchh-hhhhccCcccccceEEEEEEEc
Confidence 5677777732 333 7899999964 33333231222 2221110 0000 1111112356778889999999
Q ss_pred CEE-EeeeEEEecCC----------------------CCceEeeechh
Q psy18197 69 KDF-LTTSLSILEEQ----------------------PMDMLLGLDML 93 (144)
Q Consensus 69 ~~~-~~~~~~V~~~~----------------------~~d~ILG~d~L 93 (144)
++. ..+++.|+.+. ...+|||..-+
T Consensus 102 ge~A~~iPiQvI~D~~~~~~P~sC~~~g~~~~t~~~lgaNGILGIg~~ 149 (370)
T PF11925_consen 102 GETASSIPIQVIGDSAAPSVPSSCSNSGASMNTVADLGANGILGIGPF 149 (370)
T ss_pred CeeccccCEEEEcCCCCCCCCchhhcCCCCCCCcccccCceEEeecCC
Confidence 975 35667777431 35799999554
No 74
>PF14543 TAXi_N: Xylanase inhibitor N-terminal; PDB: 3HD8_A 3VLB_A 3VLA_A 3AUP_D 1T6G_A 1T6E_X 2B42_A.
Probab=76.54 E-value=4.3 Score=29.65 Aligned_cols=25 Identities=20% Similarity=0.481 Sum_probs=20.7
Q ss_pred eEEEEEECC--EEEEEEEecCCCceec
Q psy18197 2 LFINCCVNG--VPVKAFVDSGAQTTIM 26 (144)
Q Consensus 2 ~~v~v~InG--~~v~~LvDTGA~~s~I 26 (144)
+++++.|+. +++.+++|||++.+-+
T Consensus 1 Y~~~~~iGtP~~~~~lvvDtgs~l~W~ 27 (164)
T PF14543_consen 1 YYVSVSIGTPPQPFSLVVDTGSDLTWV 27 (164)
T ss_dssp EEEEEECTCTTEEEEEEEETT-SSEEE
T ss_pred CEEEEEeCCCCceEEEEEECCCCceEE
Confidence 578888877 7899999999988876
No 75
>TIGR02854 spore_II_GA sigma-E processing peptidase SpoIIGA. Members of this protein family are the stage II sporulation protein SpoIIGA. This protein acts as an activating protease for Sigma-E, one of several specialized sigma factors of the sporulation process in Bacillus subtilis and related endospore-forming bacteria.
Probab=68.34 E-value=7.5 Score=31.33 Aligned_cols=34 Identities=21% Similarity=0.508 Sum_probs=26.1
Q ss_pred eEEEEEECCEE--EEEEEecCCC---------ceecCHHHHHHcC
Q psy18197 2 LFINCCVNGVP--VKAFVDSGAQ---------TTIMSAECAERCA 35 (144)
Q Consensus 2 ~~v~v~InG~~--v~~LvDTGA~---------~s~Is~~~a~~lg 35 (144)
+-++++++|++ +++++|||-+ +.+++.+.++++-
T Consensus 159 ~~v~i~~~g~~~~~~alvDTGN~L~DPlT~~PV~Ive~~~~~~~l 203 (288)
T TIGR02854 159 YELEICLDGKKVTIKGFLDTGNQLRDPLTKLPVIVVEYDSLKSIL 203 (288)
T ss_pred EEEEEEECCEEEEEEEEEecCCcccCCCCCCCEEEEEHHHhhhhC
Confidence 34678889985 7999999976 4677888777763
No 76
>PLN03146 aspartyl protease family protein; Provisional
Probab=58.87 E-value=12 Score=31.71 Aligned_cols=27 Identities=22% Similarity=0.450 Sum_probs=22.5
Q ss_pred eEEEEEECC--EEEEEEEecCCCceecCH
Q psy18197 2 LFINCCVNG--VPVKAFVDSGAQTTIMSA 28 (144)
Q Consensus 2 ~~v~v~InG--~~v~~LvDTGA~~s~Is~ 28 (144)
+++++.|+. +++.+++|||++.+-+.-
T Consensus 85 Y~v~i~iGTPpq~~~vi~DTGS~l~Wv~C 113 (431)
T PLN03146 85 YLMNISIGTPPVPILAIADTGSDLIWTQC 113 (431)
T ss_pred EEEEEEcCCCCceEEEEECCCCCcceEcC
Confidence 578888974 678999999999998854
No 77
>PF13856 Gifsy-2: ATP-binding sugar transporter from pro-phage; PDB: 2PP6_A.
Probab=51.56 E-value=18 Score=24.04 Aligned_cols=19 Identities=26% Similarity=0.382 Sum_probs=15.5
Q ss_pred EEEEECCEEEEEEEecCCC
Q psy18197 4 INCCVNGVPVKAFVDSGAQ 22 (144)
Q Consensus 4 v~v~InG~~v~~LvDTGA~ 22 (144)
-++.|||+++++++|..-.
T Consensus 22 ~~~~InG~~~~~v~d~~~~ 40 (95)
T PF13856_consen 22 EEHTINGKPYRAVFDEPEL 40 (95)
T ss_dssp EEEEETTEEEEEEEGGGT-
T ss_pred CeEEECCEEEEEEECCchh
Confidence 3689999999999997654
No 78
>cd06396 PB1_NBR1 The PB1 domain is an essential part of NBR1 protein, next to BRCA1, a scaffold protein mediating specific protein-protein interaction with both titin protein kinase and with another scaffold protein p62. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The NBR1 protein contains a type I PB1 domain.
Probab=44.75 E-value=38 Score=22.29 Aligned_cols=35 Identities=11% Similarity=0.062 Sum_probs=29.5
Q ss_pred EEEEEECCEEEEEEEecCC--CceecCHHHHHHcCce
Q psy18197 3 FINCCVNGVPVKAFVDSGA--QTTIMSAECAERCAIM 37 (144)
Q Consensus 3 ~v~v~InG~~v~~LvDTGA--~~s~Is~~~a~~lgl~ 37 (144)
.++++.||..++|.++..+ +..-+...+++++++.
T Consensus 2 ~vKaty~~d~~rf~~~~~~~~~~~~L~~ev~~rf~l~ 38 (81)
T cd06396 2 NLKVTYNGESQSFLVSDSENTTWASVEAMVKVSFGLN 38 (81)
T ss_pred EEEEEECCeEEEEEecCCCCCCHHHHHHHHHHHhCCC
Confidence 5788999999999999966 5566788899999985
No 79
>PRK08364 sulfur carrier protein ThiS; Provisional
Probab=36.59 E-value=59 Score=20.22 Aligned_cols=33 Identities=21% Similarity=0.178 Sum_probs=23.3
Q ss_pred CeEEEEEECCE--EEEEEEecCCCceecCHHHHHHcCce
Q psy18197 1 MLFINCCVNGV--PVKAFVDSGAQTTIMSAECAERCAIM 37 (144)
Q Consensus 1 ~~~v~v~InG~--~v~~LvDTGA~~s~Is~~~a~~lgl~ 37 (144)
|..+++++||+ +...=++.|++. .++.+.|++.
T Consensus 2 ~~mm~v~vng~~~~~~~~~~~~~tv----~~ll~~l~~~ 36 (70)
T PRK08364 2 MLMIRVKVIGRGIEKEIEWRKGMKV----ADILRAVGFN 36 (70)
T ss_pred ceEEEEEEeccccceEEEcCCCCcH----HHHHHHcCCC
Confidence 56789999999 444445766553 3677888874
No 80
>cd06407 PB1_NLP A PB1 domain is present in NIN like proteins (NLP), a key enzyme in a process of establishment of symbiosis betweeen legumes and nitrogen fixing bacteria (Rhizobium). The PB1 domain is a modular domain mediating specific protein-protein interaction which play a role in many critical cell processes like osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-inte
Probab=33.01 E-value=66 Score=20.93 Aligned_cols=35 Identities=11% Similarity=0.040 Sum_probs=32.0
Q ss_pred EEEEEECCEEEEEEEecCCCceecCHHHHHHcCce
Q psy18197 3 FINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIM 37 (144)
Q Consensus 3 ~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~ 37 (144)
-|++..||..+++=+.+..+..-+-.++++++++.
T Consensus 2 ~vK~~~~~d~~r~~l~~~~~~~~L~~~i~~r~~~~ 36 (82)
T cd06407 2 RVKATYGEEKIRFRLPPSWGFTELKQEIAKRFKLD 36 (82)
T ss_pred EEEEEeCCeEEEEEcCCCCCHHHHHHHHHHHhCCC
Confidence 57888999999999999999999999999999984
No 81
>PF13954 PapC_N: PapC N-terminal domain; PDB: 2VQI_B 3FIP_A 3RFZ_E 3OHN_A 1ZDV_A 1ZE3_D 3BWU_D 1ZDX_A.
Probab=27.67 E-value=1.6e+02 Score=20.86 Aligned_cols=37 Identities=14% Similarity=0.181 Sum_probs=25.7
Q ss_pred eEEEEEECCEE-----EEEEEecCC-CceecCHHHHHHcCcee
Q psy18197 2 LFINCCVNGVP-----VKAFVDSGA-QTTIMSAECAERCAIMR 38 (144)
Q Consensus 2 ~~v~v~InG~~-----v~~LvDTGA-~~s~Is~~~a~~lgl~~ 38 (144)
+.+.+.|||+. ++|.-+... ..-++++++.+++|++.
T Consensus 32 Y~vdv~vN~~~~~~~~i~f~~~~~~~~~pClt~~~L~~~Gi~~ 74 (146)
T PF13954_consen 32 YSVDVYVNGKFIGRYDIEFINNDDGKLQPCLTPELLRQLGIKP 74 (146)
T ss_dssp EEEEEEETTEEEEEEEEEEEESSSTSEEEE-BHHHHHHTTB-G
T ss_pred EEEEEEECCeeeeeEEEEEEeCCCcceeEEeCHHHHHHcCCCc
Confidence 46889999974 455444333 45799999999999964
No 82
>KOG3048|consensus
Probab=27.16 E-value=47 Score=24.37 Aligned_cols=23 Identities=22% Similarity=0.322 Sum_probs=19.6
Q ss_pred eEEEEEECCEEEEEEEecCCCcee
Q psy18197 2 LFINCCVNGVPVKAFVDSGAQTTI 25 (144)
Q Consensus 2 ~~v~v~InG~~v~~LvDTGA~~s~ 25 (144)
+|||++++. +-++|||-|-..-+
T Consensus 73 lYVPGkl~d-~~k~lVDIGTGYyV 95 (153)
T KOG3048|consen 73 LYVPGKLSD-NSKFLVDIGTGYYV 95 (153)
T ss_pred eeccceecc-ccceeEeccCceEE
Confidence 799999987 78999999987655
No 83
>cd01813 UBP_N UBP ubiquitin processing protease. The UBP (ubiquitin processing protease) domain (also referred to as USP which stands for "ubiquitin-specific protease") is present at in a large family of cysteine proteases that specifically cleave ubiquitin conjugates. This family includes Rpn11, UBP6 (USP14), USP7 (HAUSP). This domain is closely related to the amino-terminal ubiquitin-like domain of BAG1 (Bcl2-associated anthanogene1) protein and is found only in eukaryotes.
Probab=27.12 E-value=1e+02 Score=19.34 Aligned_cols=36 Identities=6% Similarity=-0.136 Sum_probs=31.6
Q ss_pred EEEEEECCEEEEEEEecCCCceecCHHHHHHcCcee
Q psy18197 3 FINCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMR 38 (144)
Q Consensus 3 ~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~ 38 (144)
-+.++.+|+.+..=+|..+++.-+-..++++.|+++
T Consensus 2 ~i~vk~~g~~~~v~v~~~~Tv~~lK~~i~~~tgvp~ 37 (74)
T cd01813 2 PVIVKWGGQEYSVTTLSEDTVLDLKQFIKTLTGVLP 37 (74)
T ss_pred EEEEEECCEEEEEEECCCCCHHHHHHHHHHHHCCCH
Confidence 478999999999889999999999999999988753
No 84
>KOG0679|consensus
Probab=24.93 E-value=37 Score=28.96 Aligned_cols=20 Identities=40% Similarity=0.453 Sum_probs=17.4
Q ss_pred CCEEEEEEEecCCCceecCH
Q psy18197 9 NGVPVKAFVDSGAQTTIMSA 28 (144)
Q Consensus 9 nG~~v~~LvDTGA~~s~Is~ 28 (144)
||++--++||.||+++-+++
T Consensus 151 ~GrstalVvDiGa~~~svsP 170 (426)
T KOG0679|consen 151 NGRSTALVVDIGATHTSVSP 170 (426)
T ss_pred cCCCceEEEEecCCCceeee
Confidence 88999999999999886654
No 85
>PF09379 FERM_N: FERM N-terminal domain ; InterPro: IPR018979 This domain is the N-terminal ubiquitin-like structural domain of the FERM domain. The FERM domain (F for 4.1 protein, E for ezrin, R for radixin and M for moesin) is a widespread protein module involved in localising proteins to the plasma membrane []. FERM domains are found in a number of cytoskeletal-associated proteins that associate with various proteins at the interface between the plasma membrane and the cytoskeleton. The FERM domain is located at the N terminus of the majority of FERM-containing proteins [, ], which includes: Band 4.1, which links the spectrin-actin cytoskeleton of erythrocytes to the plasma membrane. Ezrin, a component of the undercoat of the microvilli plasma membrane. Moesin, which is probably involved in binding major cytoskeletal structures to the plasma membrane. Radixin, which is involved in the binding of the barbed end of actin filaments to the plasma membrane in the undercoat of the cell- to-cell adherens junction. Talin, a cytoskeletal protein concentrated in regions of cell-substratum contact and, in lymphocytes, of cell-cell contacts. Filopodin, a slime mold protein that binds actin and which is involved in the control of cell motility and chemotaxis. Merlin (or schwannomin). Protein NBL4. Unconventional myosins X, VIIa and XV, which are mutated in congenital deafness. Focal-adhesion kinases (FAKs), cytoplasmic protein tyrosine kinases involved in signalling through integrins. Janus tyrosine kinases (JAKs), cytoplasmic tyrosine kinases that are non-covalently associated with the cytoplasmic tails of receptors for cytokines or polypeptidic hormones. Non-receptor tyrosine-protein kinase TYK2. Protein-tyrosine phosphatases PTPN3 and PTPN4, enzyme that appear to act at junctions between the membrane and the cytoskeleton. Protein-tyrosine phosphatases PTPN14 and PTP-D1, PTP-RL10 and PTP2E. Caenorhabditis elegans protein phosphatase ptp-1. Ezrin, moesin, and radixin are highly related proteins (ERM protein family), but the other proteins in which the FERM domain is found do not share any region of similarity outside of this domain. ERM proteins are made of three domains, the FERM domain, a central helical domain and a C-terminal tail domain, which binds F-actin. The amino-acid sequence of the FERM domain is highly conserved among ERM proteins and is responsible for membrane association by direct binding to the cytoplasmic domain or tail of integral membrane proteins. ERM proteins are regulated by an intramolecular association of the FERM and C-terminal tail domains that masks their binding sites for other molecules. For cytoskeleton-membrane cross-linking, the dormant molecules becomes activated and the FERM domain attaches to the membrane by binding specific membrane proteins, while the last 34 residues of the tail bind actin filaments. Aside from binding to membranes, the activated FERM domain of ERM proteins can also bind the guanine nucleotide dissociation inhibitor of Rho GTPase (RhoDGI), which suggests that in addition to functioning as a cross-linker, ERM proteins may influence Rho signalling pathways. The crystal structure of the FERM domain reveals that it is composed of three structural modules (F1, F2, and F3) that together form a compact clover-shaped structure []. The FERM domain has also been called the amino-terminal domain, the 30kDa domain, 4.1N30, the membrane-cytoskeletal-linking domain, the ERM-like domain, the ezrin-like domain of the band 4.1 superfamily, the conserved N-terminal region, and the membrane attachment domain [].; PDB: 1EF1_B 1SGH_A 1E5W_A 2KC2_A 2KMA_A 3IVF_A 1H4R_B 3U8Z_A 1ISN_A 3BIN_A ....
Probab=24.67 E-value=49 Score=20.57 Aligned_cols=31 Identities=10% Similarity=0.164 Sum_probs=28.0
Q ss_pred ECCEEEEEEEecCCCceecCHHHHHHcCcee
Q psy18197 8 VNGVPVKAFVDSGAQTTIMSAECAERCAIMR 38 (144)
Q Consensus 8 InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~ 38 (144)
.||....+-+|..|+..-+=..+++++|+.+
T Consensus 4 lD~~~~~~~v~~~~t~~~l~~~v~~~l~l~e 34 (80)
T PF09379_consen 4 LDGTTKTFEVDPKTTGQDLLEQVCDKLGLKE 34 (80)
T ss_dssp SSEEEEEEEEETTSBHHHHHHHHHHHHTTSS
T ss_pred cCCCcEEEEEcCCCcHHHHHHHHHHHcCCCC
Confidence 4788899999999999999999999999953
No 86
>PF13510 Fer2_4: 2Fe-2S iron-sulfur cluster binding domain; PDB: 1Y56_A 3ADA_A 1VRQ_A 1X31_A 3AD9_A 3AD8_A 3AD7_A 2GAG_A 2GAH_A.
Probab=24.57 E-value=88 Score=20.08 Aligned_cols=28 Identities=29% Similarity=0.514 Sum_probs=18.2
Q ss_pred EEEEEECCEEEEEEEecCCCceecCHHHHHHcCc
Q psy18197 3 FINCCVNGVPVKAFVDSGAQTTIMSAECAERCAI 36 (144)
Q Consensus 3 ~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl 36 (144)
-|++.|||+++.+- .|. |++ +.+.+.|+
T Consensus 3 ~v~i~idG~~v~~~--~G~--til--~al~~~gi 30 (82)
T PF13510_consen 3 MVTITIDGKPVEVP--PGE--TIL--EALLAAGI 30 (82)
T ss_dssp EEEEEETTEEEEEE--ET---BHH--HHHHHTT-
T ss_pred EEEEEECCEEEEEc--CCC--HHH--HHHHHCCC
Confidence 37899999999885 344 454 34566666
No 87
>PRK06083 sulfur carrier protein ThiS; Provisional
Probab=24.35 E-value=82 Score=20.65 Aligned_cols=30 Identities=17% Similarity=0.352 Sum_probs=20.1
Q ss_pred CeEEEEEECCEEEEEEEecCCCceecCHHHHHHcCc
Q psy18197 1 MLFINCCVNGVPVKAFVDSGAQTTIMSAECAERCAI 36 (144)
Q Consensus 1 ~~~v~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl 36 (144)
|..++++|||++..+ +.+.++ .++.+.+++
T Consensus 16 ~~~m~I~VNG~~~~~--~~~~tl----~~LL~~l~~ 45 (84)
T PRK06083 16 MVLITISINDQSIQV--DISSSL----AQIIAQLSL 45 (84)
T ss_pred CceEEEEECCeEEEc--CCCCcH----HHHHHHcCC
Confidence 457889999998876 444432 245566776
No 88
>KOG1339|consensus
Probab=22.82 E-value=69 Score=26.63 Aligned_cols=27 Identities=15% Similarity=0.122 Sum_probs=23.0
Q ss_pred ceEeeechhheecEEEEcC-CCEEEEcc
Q psy18197 85 DMLLGLDMLRRHQCCIDLK-RNVLRIGT 111 (144)
Q Consensus 85 d~ILG~d~L~~~~~~iD~~-~~~l~~~~ 111 (144)
..|||--+++.+....|.. ++++-+..
T Consensus 364 ~~ilG~~~~~~~~~~~D~~~~~riGfa~ 391 (398)
T KOG1339|consen 364 LWILGDVFQQNYLVVFDLGENSRVGFAP 391 (398)
T ss_pred eEEEchHHhCCEEEEEeCCCCCEEEecc
Confidence 5899999999999999998 77776654
No 89
>COG3617 Prophage antirepressor [Transcription]
Probab=22.21 E-value=1.2e+02 Score=22.74 Aligned_cols=33 Identities=15% Similarity=0.248 Sum_probs=29.3
Q ss_pred EEEECCEEEEEEEecCCCceecCHHHHHHcCcee
Q psy18197 5 NCCVNGVPVKAFVDSGAQTTIMSAECAERCAIMR 38 (144)
Q Consensus 5 ~v~InG~~v~~LvDTGA~~s~Is~~~a~~lgl~~ 38 (144)
+....++.+++++|-| +.-++-.++|+-||+..
T Consensus 11 ~f~F~~~~vRvv~~d~-epwFva~Dva~~Lg~~n 43 (176)
T COG3617 11 PFTFEDHAVRVVLDDG-EPWFVAADVARALGYTN 43 (176)
T ss_pred cEeecCceEEEEEeCC-ceeEeHHHHHHHHCCCC
Confidence 3566788999999999 99999999999999964
Done!