Query 043220
Match_columns 121
No_of_seqs 132 out of 586
Neff 6.8
Searched_HMMs 46136
Date Fri Mar 29 02:41:50 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/043220.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/043220hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF08284 RVP_2: Retroviral asp 99.9 4.7E-26 1E-30 161.3 8.2 86 33-120 13-126 (135)
2 cd05479 RP_DDI RP_DDI; retrope 99.9 3.3E-21 7.2E-26 134.2 8.7 82 38-120 13-121 (124)
3 cd05484 retropepsin_like_LTR_2 99.6 6.8E-16 1.5E-20 101.6 6.6 65 43-107 2-91 (91)
4 PF13975 gag-asp_proteas: gag- 99.6 1.2E-14 2.6E-19 92.6 7.9 61 36-96 3-64 (72)
5 TIGR02281 clan_AA_DTGA clan AA 99.6 2.1E-14 4.5E-19 99.8 9.3 83 36-118 6-116 (121)
6 PF13650 Asp_protease_2: Aspar 99.4 4.1E-13 8.8E-18 86.4 7.1 49 44-92 1-50 (90)
7 cd05483 retropepsin_like_bacte 99.4 2.4E-12 5.2E-17 83.5 8.1 68 40-107 1-96 (96)
8 TIGR03698 clan_AA_DTGF clan AA 99.4 2.8E-12 6.1E-17 87.4 7.1 67 51-120 15-106 (107)
9 COG3577 Predicted aspartyl pro 99.2 4.9E-11 1.1E-15 89.7 9.1 87 25-115 93-207 (215)
10 PF09668 Asp_protease: Asparty 99.2 1.3E-11 2.7E-16 86.6 5.1 77 38-115 21-124 (124)
11 PF00077 RVP: Retroviral aspar 99.2 4.6E-11 9.9E-16 79.2 4.4 68 41-112 5-98 (100)
12 cd06095 RP_RTVL_H_like Retrope 99.1 1.1E-10 2.5E-15 76.2 6.1 61 44-107 1-86 (86)
13 cd00303 retropepsin_like Retro 99.1 7.5E-10 1.6E-14 67.6 7.0 63 45-107 2-92 (92)
14 PF12384 Peptidase_A2B: Ty3 tr 99.0 2.1E-09 4.5E-14 78.6 7.3 87 25-111 14-131 (177)
15 cd05480 NRIP_C NRIP_C; putativ 98.8 6E-09 1.3E-13 70.5 4.7 71 45-117 2-102 (103)
16 PF02160 Peptidase_A3: Caulifl 98.5 2.9E-07 6.3E-12 69.3 6.2 75 42-118 7-111 (201)
17 cd06094 RP_Saci_like RP_Saci_l 98.5 1.2E-07 2.6E-12 62.9 3.6 55 52-110 9-88 (89)
18 KOG0012 DNA damage inducible p 98.4 2.5E-07 5.3E-12 74.7 3.8 82 37-119 231-339 (380)
19 PF05585 DUF1758: Putative pep 98.4 5.6E-07 1.2E-11 64.9 5.2 32 51-82 11-42 (164)
20 cd05481 retropepsin_like_LTR_1 98.0 1.1E-05 2.4E-10 53.5 5.1 50 46-95 3-56 (93)
21 COG5550 Predicted aspartyl pro 97.3 0.00077 1.7E-08 47.3 5.7 76 40-118 11-114 (125)
22 cd05482 HIV_retropepsin_like R 94.6 0.038 8.2E-07 36.4 2.9 25 45-69 2-26 (87)
23 cd05476 pepsin_A_like_plant Ch 94.4 0.087 1.9E-06 40.2 4.9 56 54-120 178-257 (265)
24 cd06096 Plasmepsin_5 Plasmepsi 94.1 0.08 1.7E-06 41.7 4.3 65 52-120 231-317 (326)
25 cd06097 Aspergillopepsin_like 91.9 0.46 1E-05 36.4 5.6 65 52-120 198-273 (278)
26 cd05471 pepsin_like Pepsin-lik 91.9 0.32 6.9E-06 36.5 4.5 66 51-120 201-278 (283)
27 PF12382 Peptidase_A2E: Retrot 91.8 0.18 3.9E-06 34.6 2.8 40 42-81 35-76 (137)
28 cd06098 phytepsin Phytepsin, a 89.7 1.8 3.9E-05 33.9 7.1 74 46-120 197-312 (317)
29 cd05485 Cathepsin_D_like Cathe 86.5 3.3 7.1E-05 32.6 6.8 22 98-120 303-324 (329)
30 PF00026 Asp: Eukaryotic aspar 84.3 0.35 7.6E-06 37.0 0.3 67 52-120 199-311 (317)
31 PF00455 DeoRC: DeoR C termina 81.2 1.1 2.3E-05 32.2 1.8 63 54-116 22-95 (161)
32 cd05470 pepsin_retropepsin_lik 81.0 1.7 3.6E-05 28.3 2.6 27 45-71 2-30 (109)
33 cd05474 SAP_like SAPs, pepsin- 80.6 4.2 9.1E-05 31.0 5.1 25 43-67 4-30 (295)
34 PTZ00013 plasmepsin 4 (PM4); P 75.4 6.1 0.00013 33.1 4.9 22 53-74 332-353 (450)
35 PF00026 Asp: Eukaryotic aspar 74.4 4.9 0.00011 30.6 3.9 26 43-68 3-30 (317)
36 COG2383 Uncharacterized conser 73.6 0.6 1.3E-05 31.8 -1.2 21 99-119 50-70 (109)
37 COG0282 ackA Acetate kinase [E 73.1 2.2 4.8E-05 35.4 1.7 37 60-96 179-215 (396)
38 PTZ00147 plasmepsin-1; Provisi 69.8 23 0.00051 29.7 7.1 22 52-73 332-353 (453)
39 cd06097 Aspergillopepsin_like 66.0 7.6 0.00016 29.6 3.3 26 44-69 3-30 (278)
40 PRK10681 DNA-binding transcrip 64.6 3.1 6.7E-05 31.9 0.9 51 54-104 95-156 (252)
41 cd06098 phytepsin Phytepsin, a 60.0 14 0.0003 28.9 3.8 28 42-69 11-40 (317)
42 COG4067 Uncharacterized protei 59.0 8 0.00017 28.3 2.1 23 51-73 38-60 (162)
43 cd05487 renin_like Renin stimu 58.3 14 0.00031 28.9 3.6 27 42-68 9-37 (326)
44 cd05472 cnd41_like Chloroplast 57.6 11 0.00024 28.9 2.9 20 54-73 173-192 (299)
45 cd05486 Cathespin_E Cathepsin 57.2 11 0.00025 29.2 2.9 29 46-74 186-220 (316)
46 cd05478 pepsin_A Pepsin A, asp 57.1 14 0.00031 28.7 3.4 73 46-120 194-312 (317)
47 cd05477 gastricsin Gastricsins 55.8 16 0.00035 28.4 3.5 72 47-120 189-312 (318)
48 cd05475 nucellin_like Nucellin 54.7 17 0.00037 27.7 3.5 25 43-67 4-30 (273)
49 cd05490 Cathepsin_D2 Cathepsin 54.4 18 0.00039 28.2 3.6 27 42-68 7-35 (325)
50 cd05476 pepsin_A_like_plant Ch 53.8 15 0.00032 27.8 3.0 25 44-68 4-30 (265)
51 PRK09802 DNA-binding transcrip 52.6 6.6 0.00014 30.5 0.9 53 54-106 109-172 (269)
52 cd05478 pepsin_A Pepsin A, asp 52.0 22 0.00048 27.6 3.8 28 42-69 11-40 (317)
53 cd06096 Plasmepsin_5 Plasmepsi 51.8 26 0.00055 27.5 4.1 28 42-69 4-33 (326)
54 cd05477 gastricsin Gastricsins 51.0 24 0.00053 27.3 3.9 27 43-69 5-33 (318)
55 PRK10411 DNA-binding transcrip 50.8 7 0.00015 29.9 0.7 55 54-109 96-161 (240)
56 PRK10906 DNA-binding transcrip 49.6 7.8 0.00017 29.8 0.8 49 54-102 94-153 (252)
57 cd05472 cnd41_like Chloroplast 49.5 19 0.00041 27.6 3.0 24 44-67 4-29 (299)
58 cd05488 Proteinase_A_fungi Fun 49.5 24 0.00051 27.5 3.6 28 42-69 11-40 (320)
59 cd05473 beta_secretase_like Be 49.1 23 0.00049 28.2 3.5 27 43-69 5-33 (364)
60 PTZ00013 plasmepsin 4 (PM4); P 48.9 25 0.00053 29.6 3.8 29 43-71 140-170 (450)
61 cd05473 beta_secretase_like Be 48.6 40 0.00087 26.8 4.8 20 54-73 213-232 (364)
62 cd05486 Cathespin_E Cathepsin 48.4 23 0.00049 27.6 3.3 26 44-69 3-30 (316)
63 PTZ00147 plasmepsin-1; Provisi 48.1 27 0.00058 29.3 3.9 28 42-69 140-169 (453)
64 cd05485 Cathepsin_D_like Cathe 47.2 27 0.00059 27.4 3.6 29 41-69 11-41 (329)
65 PF03419 Peptidase_U4: Sporula 46.7 35 0.00076 26.7 4.1 35 40-74 156-201 (293)
66 TIGR02854 spore_II_GA sigma-E 45.6 35 0.00077 26.8 4.0 25 38-62 155-181 (288)
67 PF04930 FUN14: FUN14 family; 42.5 7.8 0.00017 25.7 -0.1 20 100-119 31-50 (100)
68 cd06396 PB1_NBR1 The PB1 domai 42.4 59 0.0013 21.1 4.0 44 42-90 1-46 (81)
69 cd05490 Cathepsin_D2 Cathepsin 42.4 19 0.00042 28.0 2.1 22 53-74 207-228 (325)
70 PRK10434 srlR DNA-bindng trans 39.7 15 0.00033 28.2 1.1 50 54-103 94-155 (256)
71 cd05471 pepsin_like Pepsin-lik 39.2 35 0.00075 25.3 3.0 28 45-72 4-33 (283)
72 cd05488 Proteinase_A_fungi Fun 38.7 22 0.00047 27.7 1.9 22 53-74 206-227 (320)
73 PF14543 TAXi_N: Xylanase inhi 35.3 60 0.0013 23.0 3.6 23 44-66 3-27 (164)
74 PTZ00165 aspartyl protease; Pr 35.1 58 0.0013 27.6 3.9 31 42-72 121-153 (482)
75 cd05487 renin_like Renin stimu 34.6 29 0.00063 27.1 2.0 22 98-120 299-320 (326)
76 cd05474 SAP_like SAPs, pepsin- 34.5 29 0.00062 26.3 1.9 22 53-74 179-200 (295)
77 PF04126 Cyclophil_like: Cyclo 31.3 33 0.0007 23.5 1.6 30 42-76 1-31 (120)
78 PRK13509 transcriptional repre 29.9 26 0.00055 26.9 0.9 49 54-103 96-155 (251)
79 COG1349 GlpR Transcriptional r 28.8 35 0.00076 26.2 1.5 51 54-104 94-155 (253)
80 PLN03146 aspartyl protease fam 28.8 59 0.0013 26.9 2.9 17 54-70 309-325 (431)
81 PLN03146 aspartyl protease fam 27.6 78 0.0017 26.2 3.5 27 41-67 84-112 (431)
82 PF08922 DUF1905: Domain of un 26.9 1.1E+02 0.0024 19.3 3.4 45 40-86 31-78 (80)
83 PF05618 Zn_protease: Putative 26.8 50 0.0011 23.3 1.9 23 50-72 14-36 (138)
84 COG2861 Uncharacterized protei 26.4 55 0.0012 25.7 2.2 28 52-79 150-177 (250)
85 cd05489 xylanase_inhibitor_I_l 25.7 70 0.0015 25.7 2.8 20 54-73 231-250 (362)
86 PRK12440 acetate kinase; Revie 25.6 31 0.00068 28.7 0.8 44 60-103 180-229 (397)
87 cd01813 UBP_N UBP ubiquitin pr 24.5 1.4E+02 0.0029 18.4 3.4 38 42-79 1-38 (74)
88 smart00295 B41 Band 4.1 homolo 24.0 2.1E+02 0.0046 20.1 4.8 30 49-78 12-41 (207)
89 PF14541 TAXi_C: Xylanase inhi 23.7 50 0.0011 23.1 1.4 19 54-72 31-49 (161)
90 cd06407 PB1_NLP A PB1 domain i 23.6 1.3E+02 0.0027 19.3 3.2 46 42-89 1-46 (82)
91 PF04746 DUF575: Protein of un 23.3 38 0.00082 22.8 0.7 13 97-109 27-39 (101)
92 PF00564 PB1: PB1 domain; Int 23.2 1.8E+02 0.004 17.6 4.5 46 41-89 1-47 (84)
93 PRK12379 propionate/acetate ki 22.5 37 0.00081 28.3 0.7 45 60-104 176-226 (396)
94 PF03539 Spuma_A9PTase: Spumav 22.1 83 0.0018 23.0 2.3 26 48-73 1-26 (163)
95 cd01812 BAG1_N Ubiquitin-like 20.7 1.8E+02 0.004 16.9 3.4 38 42-79 1-38 (71)
96 PRK12397 propionate kinase; Re 20.6 38 0.00083 28.3 0.4 45 60-104 180-230 (404)
97 COG3865 Uncharacterized protei 20.4 2.4E+02 0.0053 20.4 4.4 25 40-65 49-73 (151)
No 1
>PF08284 RVP_2: Retroviral aspartyl protease; InterPro: IPR013242 This region defines single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein; usually pol, more rarely gag. Retroviral proteases appear to be homologous to a single domain of the two-domain eukaryotic aspartyl proteases.
Probab=99.93 E-value=4.7e-26 Score=161.33 Aligned_cols=86 Identities=24% Similarity=0.422 Sum_probs=73.5
Q ss_pred eecccccCeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCC-CCCeEEEecCCceecc---------------
Q 043220 33 VKCGQLRKFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMP-TKPFSVRIANGAHLKS--------------- 96 (121)
Q Consensus 33 l~g~~~~~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~-~~~~~V~va~G~~l~~--------------- 96 (121)
-.+...|..|+.++.|+++++.+|||||||||||++++|++++++..+ ..++.|..++| .+.|
T Consensus 13 ~~~~~~~~vi~g~~~I~~~~~~vLiDSGAThsFIs~~~a~~~~l~~~~l~~~~~V~~~g~-~~~~~~~~~~~~~~i~g~~ 91 (135)
T PF08284_consen 13 EEAEESPDVITGTFLINSIPASVLIDSGATHSFISSSFAKKLGLPLEPLPRPIVVSAPGG-SINCEGVCPDVPLSIQGHE 91 (135)
T ss_pred hcccCCCCeEEEEEEeccEEEEEEEecCCCcEEccHHHHHhcCCEEEEccCeeEEecccc-cccccceeeeEEEEECCeE
Confidence 344567889999999999999999999999999999999999999885 46777775544 3332
Q ss_pred ------------ceEeecchhhhhcCCeeeecccCC
Q 043220 97 ------------PDVALDVKWSEQIGSIMRGWKKDE 120 (121)
Q Consensus 97 ------------~dvILG~~WL~~~~p~~idw~~~~ 120 (121)
||+|||||||++|+| .|||.+++
T Consensus 92 ~~~dl~vl~l~~~DvILGm~WL~~~~~-~IDw~~k~ 126 (135)
T PF08284_consen 92 FVVDLLVLDLGGYDVILGMDWLKKHNP-VIDWATKT 126 (135)
T ss_pred EEeeeEEecccceeeEeccchHHhCCC-EEEccCCE
Confidence 999999999999999 59999875
No 2
>cd05479 RP_DDI RP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
Probab=99.85 E-value=3.3e-21 Score=134.20 Aligned_cols=82 Identities=15% Similarity=0.083 Sum_probs=68.8
Q ss_pred ccCeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc---------------------
Q 043220 38 LRKFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS--------------------- 96 (121)
Q Consensus 38 ~~~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~--------------------- 96 (121)
....|++.+.|||+++.+|||||||||||++++|+|+|++.....++.+.++++....+
T Consensus 13 ~~~~~~v~~~Ing~~~~~LvDTGAs~s~Is~~~a~~lgl~~~~~~~~~~~~~g~g~~~~~g~~~~~~l~i~~~~~~~~~~ 92 (124)
T cd05479 13 KVPMLYINVEINGVPVKAFVDSGAQMTIMSKACAEKCGLMRLIDKRFQGIAKGVGTQKILGRIHLAQVKIGNLFLPCSFT 92 (124)
T ss_pred eeeEEEEEEEECCEEEEEEEeCCCceEEeCHHHHHHcCCccccCcceEEEEecCCCcEEEeEEEEEEEEECCEEeeeEEE
Confidence 45678999999999999999999999999999999999987666666666665332211
Q ss_pred ------ceEeecchhhhhcCCeeeecccCC
Q 043220 97 ------PDVALDVKWSEQIGSIMRGWKKDE 120 (121)
Q Consensus 97 ------~dvILG~~WL~~~~p~~idw~~~~ 120 (121)
+|+|||||||++++. .|||++++
T Consensus 93 Vl~~~~~d~ILG~d~L~~~~~-~ID~~~~~ 121 (124)
T cd05479 93 VLEDDDVDFLIGLDMLKRHQC-VIDLKENV 121 (124)
T ss_pred EECCCCcCEEecHHHHHhCCe-EEECCCCE
Confidence 899999999999996 69999874
No 3
>cd05484 retropepsin_like_LTR_2 Retropepsins_like_LTR, pepsin-like aspartate proteases. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classif
Probab=99.63 E-value=6.8e-16 Score=101.59 Aligned_cols=65 Identities=20% Similarity=0.254 Sum_probs=58.7
Q ss_pred EEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc-------------------------c
Q 043220 43 VVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS-------------------------P 97 (121)
Q Consensus 43 ~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~-------------------------~ 97 (121)
.+++.|||+++.+||||||++|||+++.+++++++.......++..|+|..+.+ +
T Consensus 2 ~~~~~Ing~~i~~lvDTGA~~svis~~~~~~lg~~~~~~~~~~v~~a~G~~~~~~G~~~~~v~~~~~~~~~~~~v~~~~~ 81 (91)
T cd05484 2 TVTLLVNGKPLKFQLDTGSAITVISEKTWRKLGSPPLKPTKKRLRTATGTKLSVLGQILVTVKYGGKTKVLTLYVVKNEG 81 (91)
T ss_pred EEEEEECCEEEEEEEcCCcceEEeCHHHHHHhCCCccccccEEEEecCCCEeeEeEEEEEEEEECCEEEEEEEEEEECCC
Confidence 578999999999999999999999999999999987666789999999987643 8
Q ss_pred eEeecchhhh
Q 043220 98 DVALDVKWSE 107 (121)
Q Consensus 98 dvILG~~WL~ 107 (121)
+.|||++||.
T Consensus 82 ~~lLG~~wl~ 91 (91)
T cd05484 82 LNLLGRDWLD 91 (91)
T ss_pred CCccChhhcC
Confidence 9999999984
No 4
>PF13975 gag-asp_proteas: gag-polyprotein putative aspartyl protease
Probab=99.58 E-value=1.2e-14 Score=92.56 Aligned_cols=61 Identities=30% Similarity=0.433 Sum_probs=56.3
Q ss_pred ccccCeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCC-CeEEEecCCceecc
Q 043220 36 GQLRKFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTK-PFSVRIANGAHLKS 96 (121)
Q Consensus 36 ~~~~~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~-~~~V~va~G~~l~~ 96 (121)
...++++++.+.|+|+.+.+|||||||||||+.++|+||+++..+.. +.+|++|||....+
T Consensus 3 ~~~~g~~~v~~~I~g~~~~alvDtGat~~fis~~~a~rLgl~~~~~~~~~~v~~a~g~~~~~ 64 (72)
T PF13975_consen 3 TPDPGLMYVPVSIGGVQVKALVDTGATHNFISESLAKRLGLPLEKPPSPIRVKLANGSVIEI 64 (72)
T ss_pred cccCCEEEEEEEECCEEEEEEEeCCCcceecCHHHHHHhCCCcccCCCCEEEEECCCCcccc
Confidence 45788999999999999999999999999999999999999988766 99999999987765
No 5
>TIGR02281 clan_AA_DTGA clan AA aspartic protease, TIGR02281 family. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix. Sequences in the seed alignment and those scoring above the trusted cutoff are Proteobacterial; homologs scroing between trusted and noise are found in Pyrobaculum aerophilum str. IM2 (archaeal), Pirellula sp. (Planctomycetes), and Nostoc sp. PCC 7120 (Cyanobacteria).
Probab=99.57 E-value=2.1e-14 Score=99.84 Aligned_cols=83 Identities=16% Similarity=0.165 Sum_probs=70.5
Q ss_pred ccccCeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCC-CCCeEEEecCCceecc------------------
Q 043220 36 GQLRKFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMP-TKPFSVRIANGAHLKS------------------ 96 (121)
Q Consensus 36 ~~~~~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~-~~~~~V~va~G~~l~~------------------ 96 (121)
....+++.+.++|||+++.+|||||||+++|++++|+++++.... ..+..+..|+|.....
T Consensus 6 ~~~~g~~~v~~~InG~~~~flVDTGAs~t~is~~~A~~Lgl~~~~~~~~~~~~ta~G~~~~~~~~l~~l~iG~~~~~nv~ 85 (121)
T TIGR02281 6 KDGDGHFYATGRVNGRNVRFLVDTGATSVALNEEDAQRLGLDLNRLGYTVTVSTANGQIKAARVTLDRVAIGGIVVNDVD 85 (121)
T ss_pred EcCCCeEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCcccCCceEEEEeCCCcEEEEEEEeCEEEECCEEEeCcE
Confidence 356789999999999999999999999999999999999987654 4578889999974321
Q ss_pred ---------ceEeecchhhhhcCCeeeeccc
Q 043220 97 ---------PDVALDVKWSEQIGSIMRGWKK 118 (121)
Q Consensus 97 ---------~dvILG~~WL~~~~p~~idw~~ 118 (121)
.+.+|||+||.++.++.+|-.+
T Consensus 86 ~~v~~~~~~~~~LLGm~fL~~~~~~~~~~~~ 116 (121)
T TIGR02281 86 AMVAEGGALSESLLGMSFLNRLSRFTVRGGK 116 (121)
T ss_pred EEEeCCCcCCceEcCHHHHhccccEEEECCE
Confidence 4799999999999988777543
No 6
>PF13650 Asp_protease_2: Aspartyl protease
Probab=99.44 E-value=4.1e-13 Score=86.37 Aligned_cols=49 Identities=33% Similarity=0.432 Sum_probs=43.7
Q ss_pred EEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCC-CeEEEecCCc
Q 043220 44 VATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTK-PFSVRIANGA 92 (121)
Q Consensus 44 v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~-~~~V~va~G~ 92 (121)
|+++|||+++.+||||||+.++|+++++++++++..... +..+..++|.
T Consensus 1 V~v~vng~~~~~liDTGa~~~~i~~~~~~~l~~~~~~~~~~~~~~~~~g~ 50 (90)
T PF13650_consen 1 VPVKVNGKPVRFLIDTGASISVISRSLAKKLGLKPRPKSVPISVSGAGGS 50 (90)
T ss_pred CEEEECCEEEEEEEcCCCCcEEECHHHHHHcCCCCcCCceeEEEEeCCCC
Confidence 468999999999999999999999999999999877553 6788888887
No 7
>cd05483 retropepsin_like_bacteria Bacterial aspartate proteases, retropepsin-like protease family. This family of bacteria aspartate proteases is a subfamily of retropepsin-like protease family, which includes enzymes from retrovirus and retrotransposons. While fungal and mammalian pepsin-like aspartate proteases are bilobal proteins with structurally related N- and C-termini, this family of bacteria aspartate proteases is half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate proteases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=99.39 E-value=2.4e-12 Score=83.52 Aligned_cols=68 Identities=25% Similarity=0.344 Sum_probs=57.1
Q ss_pred CeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc-----------------------
Q 043220 40 KFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS----------------------- 96 (121)
Q Consensus 40 ~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~----------------------- 96 (121)
+.+.+++.||++++.+||||||++++|+.+++++++++.....+..+..++|....+
T Consensus 1 ~~~~v~v~i~~~~~~~llDTGa~~s~i~~~~~~~l~~~~~~~~~~~~~~~~G~~~~~~~~~~~i~ig~~~~~~~~~~v~d 80 (96)
T cd05483 1 GHFVVPVTINGQPVRFLLDTGASTTVISEELAERLGLPLTLGGKVTVQTANGRVRAARVRLDSLQIGGITLRNVPAVVLP 80 (96)
T ss_pred CcEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCccCCCcEEEEecCCCccceEEEcceEEECCcEEeccEEEEeC
Confidence 357899999999999999999999999999999998855555677888888864322
Q ss_pred -----ceEeecchhhh
Q 043220 97 -----PDVALDVKWSE 107 (121)
Q Consensus 97 -----~dvILG~~WL~ 107 (121)
.|.|||++||+
T Consensus 81 ~~~~~~~gIlG~d~l~ 96 (96)
T cd05483 81 GDALGVDGLLGMDFLR 96 (96)
T ss_pred CcccCCceEeChHHhC
Confidence 67999999985
No 8
>TIGR03698 clan_AA_DTGF clan AA aspartic protease, AF_0612 family. Members of this protein family are clan AA aspartic proteases, related to family TIGR02281. These proteins resemble retropepsins, pepsin-like proteases of retroviruses such as HIV. Members of this family are found in archaea and bacteria.
Probab=99.35 E-value=2.8e-12 Score=87.44 Aligned_cols=67 Identities=21% Similarity=0.340 Sum_probs=56.8
Q ss_pred EEEEEEecCCCcccc-cCHHHHhhcCCCCCCCCCeEEEecCCceecc------------------------ceEeecchh
Q 043220 51 HEVVILISSGSTANF-VSDKVAALLPLPMMPTKPFSVRIANGAHLKS------------------------PDVALDVKW 105 (121)
Q Consensus 51 ~~v~~LIDSGAT~nF-I~~~~a~rl~l~~~~~~~~~V~va~G~~l~~------------------------~dvILG~~W 105 (121)
-++.+|||||||+.+ |++++|+++|++.. ....+.+|||..... .+++|||.|
T Consensus 15 ~~v~~LVDTGat~~~~l~~~~a~~lgl~~~--~~~~~~tA~G~~~~~~v~~~~v~igg~~~~~~v~~~~~~~~~LLG~~~ 92 (107)
T TIGR03698 15 MEVRALVDTGFSGFLLVPPDIVNKLGLPEL--DQRRVYLADGREVLTDVAKASIIINGLEIDAFVESLGYVDEPLLGTEL 92 (107)
T ss_pred eEEEEEEECCCCeEEecCHHHHHHcCCCcc--cCcEEEecCCcEEEEEEEEEEEEECCEEEEEEEEecCCCCccEecHHH
Confidence 489999999999998 99999999999764 367899999963321 578999999
Q ss_pred hhhcCCeeeecccCC
Q 043220 106 SEQIGSIMRGWKKDE 120 (121)
Q Consensus 106 L~~~~p~~idw~~~~ 120 (121)
|++++. .+||++++
T Consensus 93 L~~l~l-~id~~~~~ 106 (107)
T TIGR03698 93 LEGLGI-VIDYRNQG 106 (107)
T ss_pred HhhCCE-EEehhhCc
Confidence 999986 59999875
No 9
>COG3577 Predicted aspartyl protease [General function prediction only]
Probab=99.25 E-value=4.9e-11 Score=89.67 Aligned_cols=87 Identities=15% Similarity=0.172 Sum_probs=75.3
Q ss_pred cceEEEeeeecccccCeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCC-CCCCeEEEecCCceecc-------
Q 043220 25 ENSRSLFQVKCGQLRKFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMM-PTKPFSVRIANGAHLKS------- 96 (121)
Q Consensus 25 ~~~isl~Al~g~~~~~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~-~~~~~~V~va~G~~l~~------- 96 (121)
.-+++|++ ...+|+...+.|||+++.+|||||||.-.+++.-|+|+|+... ...++.|.+|||+..-.
T Consensus 93 ~~~v~Lak----~~~GHF~a~~~VNGk~v~fLVDTGATsVal~~~dA~RlGid~~~l~y~~~v~TANG~~~AA~V~Ld~v 168 (215)
T COG3577 93 YQEVSLAK----SRDGHFEANGRVNGKKVDFLVDTGATSVALNEEDARRLGIDLNSLDYTITVSTANGRARAAPVTLDRV 168 (215)
T ss_pred ceEEEEEe----cCCCcEEEEEEECCEEEEEEEecCcceeecCHHHHHHhCCCccccCCceEEEccCCccccceEEeeeE
Confidence 45788877 8889999999999999999999999999999999999999876 56899999999974321
Q ss_pred --------------------ceEeecchhhhhcCCeeee
Q 043220 97 --------------------PDVALDVKWSEQIGSIMRG 115 (121)
Q Consensus 97 --------------------~dvILG~~WL~~~~p~~id 115 (121)
-...|||+||.+++-...+
T Consensus 169 ~IG~I~~~nV~A~V~~~g~L~~sLLGMSfL~rL~~fq~~ 207 (215)
T COG3577 169 QIGGIRVKNVDAMVAEDGALDESLLGMSFLNRLSGFQVD 207 (215)
T ss_pred EEccEEEcCchhheecCCccchhhhhHHHHhhccceEec
Confidence 5678999999999876543
No 10
>PF09668 Asp_protease: Aspartyl protease; InterPro: IPR019103 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This family of eukaryotic aspartyl proteases have a fold similar to retroviral proteases which implies they function proteolytically during regulated protein turnover []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 3S8I_A 2I1A_B.
Probab=99.23 E-value=1.3e-11 Score=86.60 Aligned_cols=77 Identities=17% Similarity=0.160 Sum_probs=50.3
Q ss_pred ccCeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeE-EEecCC-----------------ceecc---
Q 043220 38 LRKFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPFS-VRIANG-----------------AHLKS--- 96 (121)
Q Consensus 38 ~~~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~-V~va~G-----------------~~l~~--- 96 (121)
....+.+..+|||++++|+|||||.+|.|+.++|+|+|+.....+.+. +..+-| ..+.|
T Consensus 21 ~v~mLyI~~~ing~~vkA~VDtGAQ~tims~~~a~r~gL~~lid~r~~g~a~GvG~~~i~G~Ih~~~l~ig~~~~~~s~~ 100 (124)
T PF09668_consen 21 QVSMLYINCKINGVPVKAFVDTGAQSTIMSKSCAERCGLMRLIDKRFAGVAKGVGTQKILGRIHSVQLKIGGLFFPCSFT 100 (124)
T ss_dssp -----EEEEEETTEEEEEEEETT-SS-EEEHHHHHHTTGGGGEEGGG-EE-------EEEEEEEEEEEEETTEEEEEEEE
T ss_pred CcceEEEEEEECCEEEEEEEeCCCCccccCHHHHHHcCChhhccccccccccCCCcCceeEEEEEEEEEECCEEEEEEEE
Confidence 345678999999999999999999999999999999998632222221 111112 12222
Q ss_pred ------ceEeecchhhhhcCCeeee
Q 043220 97 ------PDVALDVKWSEQIGSIMRG 115 (121)
Q Consensus 97 ------~dvILG~~WL~~~~p~~id 115 (121)
.|++||.|||++|..+ ||
T Consensus 101 Vle~~~~d~llGld~L~~~~c~-ID 124 (124)
T PF09668_consen 101 VLEDQDVDLLLGLDMLKRHKCC-ID 124 (124)
T ss_dssp EETTSSSSEEEEHHHHHHTT-E-EE
T ss_pred EeCCCCcceeeeHHHHHHhCcc-cC
Confidence 8999999999999884 65
No 11
>PF00077 RVP: Retroviral aspartyl protease The Prosite entry also includes Pfam:PF00026; InterPro: IPR018061 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to the MEROPS peptidase family A2 (retropepsin family, clan AA), subfamily A2A. The family includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). Retroviral aspartyl protease is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins.; PDB: 3D3T_B 3SQF_A 1NSO_A 2HB3_A 2HS2_A 2HS1_B 3K4V_A 3GGV_C 1HTG_B 2FDE_A ....
Probab=99.15 E-value=4.6e-11 Score=79.16 Aligned_cols=68 Identities=21% Similarity=0.322 Sum_probs=55.8
Q ss_pred eEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc------------------------
Q 043220 41 FCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS------------------------ 96 (121)
Q Consensus 41 ~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~------------------------ 96 (121)
.-++..+++|+++.+||||||+.++|+++...++..+ ...+..+..++|.. .+
T Consensus 5 rp~i~v~i~g~~i~~LlDTGA~vsiI~~~~~~~~~~~--~~~~~~v~~~~g~~-~~~~~~~~~v~~~~~~~~~~~~v~~~ 81 (100)
T PF00077_consen 5 RPYITVKINGKKIKALLDTGADVSIISEKDWKKLGPP--PKTSITVRGAGGSS-SILGSTTVEVKIGGKEFNHTFLVVPD 81 (100)
T ss_dssp SSEEEEEETTEEEEEEEETTBSSEEESSGGSSSTSSE--EEEEEEEEETTEEE-EEEEEEEEEEEETTEEEEEEEEESST
T ss_pred CceEEEeECCEEEEEEEecCCCcceeccccccccccc--ccCCceeccCCCcc-eeeeEEEEEEEEECccceEEEEecCC
Confidence 3467889999999999999999999999998877654 45567788888865 21
Q ss_pred --ceEeecchhhhhcCCe
Q 043220 97 --PDVALDVKWSEQIGSI 112 (121)
Q Consensus 97 --~dvILG~~WL~~~~p~ 112 (121)
.| |||.|||++++..
T Consensus 82 ~~~~-ILG~D~L~~~~~~ 98 (100)
T PF00077_consen 82 LPMN-ILGRDFLKKLNAV 98 (100)
T ss_dssp CSSE-EEEHHHHTTTTCE
T ss_pred CCCC-EeChhHHHHcCCE
Confidence 67 9999999999873
No 12
>cd06095 RP_RTVL_H_like Retropepsin of the RTVL_H family of human endogenous retrovirus-like elements. This family includes aspartate proteases from retroelements with LTR (long terminal repeats) including the RTVL_H family of human endogenous retrovirus-like elements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where
Probab=99.14 E-value=1.1e-10 Score=76.19 Aligned_cols=61 Identities=18% Similarity=0.294 Sum_probs=49.6
Q ss_pred EEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCce---ecc----------------------ce
Q 043220 44 VATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAH---LKS----------------------PD 98 (121)
Q Consensus 44 v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~---l~~----------------------~d 98 (121)
+++.|||+++.+|+||||+++.|++..++++ .....+..+..++|.. +.. .+
T Consensus 1 ~~v~InG~~~~fLvDTGA~~tii~~~~a~~~---~~~~~~~~v~gagG~~~~~v~~~~~~v~vg~~~~~~~~~v~~~~~~ 77 (86)
T cd06095 1 VTITVEGVPIVFLVDTGATHSVLKSDLGPKQ---ELSTTSVLIRGVSGQSQQPVTTYRTLVDLGGHTVSHSFLVVPNCPD 77 (86)
T ss_pred CEEEECCEEEEEEEECCCCeEEECHHHhhhc---cCCCCcEEEEeCCCcccccEEEeeeEEEECCEEEEEEEEEEcCCCC
Confidence 4678999999999999999999999999997 2334677888888863 210 78
Q ss_pred Eeecchhhh
Q 043220 99 VALDVKWSE 107 (121)
Q Consensus 99 vILG~~WL~ 107 (121)
.|||||||.
T Consensus 78 ~lLG~dfL~ 86 (86)
T cd06095 78 PLLGRDLLS 86 (86)
T ss_pred cEechhhcC
Confidence 899999984
No 13
>cd00303 retropepsin_like Retropepsins; pepsin-like aspartate proteases. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples
Probab=99.07 E-value=7.5e-10 Score=67.60 Aligned_cols=63 Identities=40% Similarity=0.557 Sum_probs=48.6
Q ss_pred EEEECCEEEEEEecCCCcccccCHHHHhhcCC-CCCCCCCeEEEecCCceec---------------------------c
Q 043220 45 ATKIRFHEVVILISSGSTANFVSDKVAALLPL-PMMPTKPFSVRIANGAHLK---------------------------S 96 (121)
Q Consensus 45 ~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l-~~~~~~~~~V~va~G~~l~---------------------------~ 96 (121)
.+.++++++.+|+|+||++++++..++.++++ ......+..+..++|.... .
T Consensus 2 ~~~~~~~~~~~liDtgs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 81 (92)
T cd00303 2 KGKINGVPVRALVDSGASVNFISESLAKKLGLPPRLLPTPLKVKGANGSSVKTLGVILPVTIGIGGKTFTVDFYVLDLLS 81 (92)
T ss_pred EEEECCEEEEEEEcCCCcccccCHHHHHHcCCCcccCCCceEEEecCCCEeccCcEEEEEEEEeCCEEEEEEEEEEcCCC
Confidence 45678899999999999999999999999987 4444455666666553211 1
Q ss_pred ceEeecchhhh
Q 043220 97 PDVALDVKWSE 107 (121)
Q Consensus 97 ~dvILG~~WL~ 107 (121)
+|+|||++||.
T Consensus 82 ~~~ilG~~~l~ 92 (92)
T cd00303 82 YDVILGRPWLE 92 (92)
T ss_pred cCEEecccccC
Confidence 88999999984
No 14
>PF12384 Peptidase_A2B: Ty3 transposon peptidase; InterPro: IPR024650 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Ty3 is a gypsy-type, retrovirus-like, element found in the budding yeast. The Ty3 aspartyl protease is required for processing of the viral polyprotein into its mature species [].
Probab=98.98 E-value=2.1e-09 Score=78.58 Aligned_cols=87 Identities=17% Similarity=0.252 Sum_probs=70.0
Q ss_pred cceEEEeeeec-c---cccCeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCce-ecc---
Q 043220 25 ENSRSLFQVKC-G---QLRKFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAH-LKS--- 96 (121)
Q Consensus 25 ~~~isl~Al~g-~---~~~~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~-l~~--- 96 (121)
.+.+.+.-+.- . ...++..+...++|.++.+|+||||..+||+..++++|+++.....|++++.+.+.. ..|
T Consensus 14 ~~~~~~~PIV~~~~~Pevg~T~~v~l~~~~t~i~vLfDSGSPTSfIr~di~~kL~L~~~~app~~fRG~vs~~~~~tsEA 93 (177)
T PF12384_consen 14 KSSITTCPIVHYIAIPEVGKTAIVQLNCKGTPIKVLFDSGSPTSFIRSDIVEKLELPTHDAPPFRFRGFVSGESATTSEA 93 (177)
T ss_pred cceeeeeeeEEEeeccccCcEEEEEEeecCcEEEEEEeCCCccceeehhhHHhhCCccccCCCEEEeeeccCCceEEEEe
Confidence 45555555443 2 344688999999999999999999999999999999999999999998887664431 112
Q ss_pred -----------------------ceEeecchhhhhcCC
Q 043220 97 -----------------------PDVALDVKWSEQIGS 111 (121)
Q Consensus 97 -----------------------~dvILG~~WL~~~~p 111 (121)
+|+|+|.+.|+++..
T Consensus 94 v~ld~~i~n~~i~i~aYV~d~m~~dlIIGnPiL~ryp~ 131 (177)
T PF12384_consen 94 VTLDFYIDNKLIDIAAYVTDNMDHDLIIGNPILDRYPT 131 (177)
T ss_pred EEEEEEECCeEEEEEEEEeccCCcceEeccHHHhhhHH
Confidence 999999999999854
No 15
>cd05480 NRIP_C NRIP_C; putative nuclear receptor interacting protein. Proteins in this family have been described as probable nuclear receptor interacting proteins. The C-terminal domain of this family is homologous to the retroviral aspartyl protease domain. The domain is structurally related to one lobe of the pepsin molecule. The conserved active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=98.82 E-value=6e-09 Score=70.49 Aligned_cols=71 Identities=11% Similarity=0.148 Sum_probs=53.3
Q ss_pred EEEECCEEEEEEecCCCcccccCHHHHhhcCCCCC-CCCCe--------------------EEEecCCceecc-------
Q 043220 45 ATKIRFHEVVILISSGSTANFVSDKVAALLPLPMM-PTKPF--------------------SVRIANGAHLKS------- 96 (121)
Q Consensus 45 ~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~-~~~~~--------------------~V~va~G~~l~~------- 96 (121)
..++||++++|+|||||-+|.||+.+|+|.|+.-. ...++ .+++++ ..+.|
T Consensus 2 nCk~nG~~vkAfVDsGaQ~timS~~caercgL~r~v~~~r~~g~A~gvgt~~kiiGrih~~~ikig~-~~~~CSftVld~ 80 (103)
T cd05480 2 SCQCAGKELRALVDTGCQYNLISAACLDRLGLKERVLKAKAEEEAPSLPTSVKVIGQIERLVLQLGQ-LTVECSAQVVDD 80 (103)
T ss_pred ceeECCEEEEEEEecCCchhhcCHHHHHHcChHhhhhhccccccccCCCcceeEeeEEEEEEEEeCC-EEeeEEEEEEcC
Confidence 46899999999999999999999999999998632 11111 111211 23334
Q ss_pred --ceEeecchhhhhcCCeeeecc
Q 043220 97 --PDVALDVKWSEQIGSIMRGWK 117 (121)
Q Consensus 97 --~dvILG~~WL~~~~p~~idw~ 117 (121)
.|++||-|-|++|.. .||.+
T Consensus 81 ~~~d~llGLdmLkrhqc-~IdL~ 102 (103)
T cd05480 81 NEKNFSLGLQTLKSLKC-VINLE 102 (103)
T ss_pred CCcceEeeHHHHhhcce-eeecc
Confidence 999999999999998 48865
No 16
>PF02160 Peptidase_A3: Cauliflower mosaic virus peptidase (A3); InterPro: IPR000588 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of sequences contain an aspartic peptidase signature that belongs to MEROPS peptidase family A3, subfamily A3A (cauliflower mosaic virus-type endopeptidase, clan AA). Cauliflower mosaic virus belongs to the Retro-transcribing viruses, which have a double-stranded DNA genome. The genome includes an open reading frame (ORF V) that shows similarities to the pol gene of retroviruses. This ORF codes for a polyprotein that includes a reverse transcriptase, which, on the basis of a DTG triplet near the N terminus, was suggested to include an aspartic protease. The presence of an aspartic protease has been confirmed by mutational studies, implicating Asp-45 in catalysis. The protease releases itself from the polyprotein and is involved in reactions required to process the ORF IV polyprotein, which includes the viral coat protein []. The viral aspartic peptidase signature has also been found associated with a polyprotein encoded by integrated pararetrovirus-like sequences in the genome of Nicotiana tabacum (Common tobacco) []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis
Probab=98.50 E-value=2.9e-07 Score=69.29 Aligned_cols=75 Identities=17% Similarity=0.145 Sum_probs=53.6
Q ss_pred EEEEEEECC---EEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCceec----c------------------
Q 043220 42 CVVATKIRF---HEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLK----S------------------ 96 (121)
Q Consensus 42 ~~v~g~i~~---~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~----~------------------ 96 (121)
++++..+.| ..+.++||||||.+.++....=.-.. .....++.|+.||+.... |
T Consensus 7 I~~~i~~~gy~~~~~~~~vDTGAt~C~~~~~iiP~e~w-e~~~~~i~v~~an~~~~~i~~~~~~~~i~I~~~~F~IP~iY 85 (201)
T PF02160_consen 7 IKVKISFPGYKKFNYHCYVDTGATICCASKKIIPEEYW-EKSKKPIKVKGANGSIIQINKKAKNGKIQIADKIFRIPTIY 85 (201)
T ss_pred EEEEEEEcCceeEEEEEEEeCCCceEEecCCcCCHHHH-HhCCCcEEEEEecCCceEEEEEecCceEEEccEEEeccEEE
Confidence 344445555 45788999999999988766421110 123457899999987432 1
Q ss_pred -----ceEeecchhhhhcCCeeeeccc
Q 043220 97 -----PDVALDVKWSEQIGSIMRGWKK 118 (121)
Q Consensus 97 -----~dvILG~~WL~~~~p~~idw~~ 118 (121)
.|+|||++|++.+.|. +.|..
T Consensus 86 q~~~g~d~IlG~NF~r~y~Pf-iq~~~ 111 (201)
T PF02160_consen 86 QQESGIDIILGNNFLRLYEPF-IQTED 111 (201)
T ss_pred EecCCCCEEecchHHHhcCCc-EEEcc
Confidence 9999999999999997 88864
No 17
>cd06094 RP_Saci_like RP_Saci_like, retropepsin family. Retropepsin on retrotransposons with long terminal repeats (LTR) including Saci-1, -2 and -3 of Schistosoma mansoni. Retropepsins are related to fungal and mammalian pepsins. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified
Probab=98.50 E-value=1.2e-07 Score=62.89 Aligned_cols=55 Identities=16% Similarity=0.414 Sum_probs=45.8
Q ss_pred EEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc-------------------------ceEeecchhh
Q 043220 52 EVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS-------------------------PDVALDVKWS 106 (121)
Q Consensus 52 ~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~-------------------------~dvILG~~WL 106 (121)
.+.+||||||.+|.|.+...++. ....++.++.|||+.+.+ ...|||.|+|
T Consensus 9 ~~~fLVDTGA~vSviP~~~~~~~----~~~~~~~l~AANgt~I~tyG~~~l~ldlGlrr~~~w~FvvAdv~~pIlGaDfL 84 (89)
T cd06094 9 GLRFLVDTGAAVSVLPASSTKKS----LKPSPLTLQAANGTPIATYGTRSLTLDLGLRRPFAWNFVVADVPHPILGADFL 84 (89)
T ss_pred CcEEEEeCCCceEeecccccccc----ccCCceEEEeCCCCeEeeeeeEEEEEEcCCCcEEeEEEEEcCCCcceecHHHH
Confidence 46799999999999998887763 345678999999998754 5569999999
Q ss_pred hhcC
Q 043220 107 EQIG 110 (121)
Q Consensus 107 ~~~~ 110 (121)
++|+
T Consensus 85 ~~~~ 88 (89)
T cd06094 85 QHYG 88 (89)
T ss_pred HHcC
Confidence 9986
No 18
>KOG0012 consensus DNA damage inducible protein [Replication, recombination and repair]
Probab=98.40 E-value=2.5e-07 Score=74.65 Aligned_cols=82 Identities=17% Similarity=0.133 Sum_probs=60.9
Q ss_pred cccCeEEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCe-EEEecCC-----------------ceecc--
Q 043220 37 QLRKFCVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPF-SVRIANG-----------------AHLKS-- 96 (121)
Q Consensus 37 ~~~~~~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~-~V~va~G-----------------~~l~~-- 96 (121)
.....+.|..+|||++|+|+|||||-.|-||..+|+|.|+.-...+.+ -+.-+.| ..+.|
T Consensus 231 ~~v~ML~iN~~ing~~VKAfVDsGaq~timS~~Caer~gL~rlid~r~~g~a~gvg~~ki~g~Ih~~~lki~~~~l~c~f 310 (380)
T KOG0012|consen 231 TQVTMLYINCEINGVPVKAFVDSGAQTTIMSAACAERCGLNRLIDKRFQGEARGVGTEKILGRIHQAQLKIEDLYLPCSF 310 (380)
T ss_pred ccceEEEEEEEECCEEEEEEEcccchhhhhhHHHHHHhChHHHhhhhhhccccCCCcccccceeEEEEEEeccEeeccce
Confidence 344456789999999999999999999999999999999864322111 1111111 12234
Q ss_pred -------ceEeecchhhhhcCCeeeecccC
Q 043220 97 -------PDVALDVKWSEQIGSIMRGWKKD 119 (121)
Q Consensus 97 -------~dvILG~~WL~~~~p~~idw~~~ 119 (121)
.|+.||-+-|++|.. .||.++.
T Consensus 311 tV~d~~~~d~llGLd~Lrr~~c-cIdL~~~ 339 (380)
T KOG0012|consen 311 TVLDRRDMDLLLGLDMLRRHQC-CIDLKTN 339 (380)
T ss_pred EEecCCCcchhhhHHHHHhccc-eeecccC
Confidence 899999999999999 4998764
No 19
>PF05585 DUF1758: Putative peptidase (DUF1758); InterPro: IPR008737 This is a family of nematode proteins of unknown function []. However, it seems likely that these proteins act as aspartic peptidases.
Probab=98.39 E-value=5.6e-07 Score=64.88 Aligned_cols=32 Identities=34% Similarity=0.478 Sum_probs=28.1
Q ss_pred EEEEEEecCCCcccccCHHHHhhcCCCCCCCC
Q 043220 51 HEVVILISSGSTANFVSDKVAALLPLPMMPTK 82 (121)
Q Consensus 51 ~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~ 82 (121)
+.+.+|+||||..|||.+++|++|+|+.....
T Consensus 11 ~~~~~LlDsGSq~SfIt~~la~~L~L~~~~~~ 42 (164)
T PF05585_consen 11 VEARALLDSGSQRSFITESLANKLNLPGTGEK 42 (164)
T ss_pred EEEEEEEecCCchhHHhHHHHHHhCCCCCCce
Confidence 56789999999999999999999999876543
No 20
>cd05481 retropepsin_like_LTR_1 Retropepsins_like_LTR; pepsin-like aspartate protease from retrotransposons with long terminal repeats. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identifi
Probab=98.03 E-value=1.1e-05 Score=53.54 Aligned_cols=50 Identities=20% Similarity=0.282 Sum_probs=42.9
Q ss_pred EEECC-EEEEEEecCCCcccccCHHHHhhcC---CCCCCCCCeEEEecCCceec
Q 043220 46 TKIRF-HEVVILISSGSTANFVSDKVAALLP---LPMMPTKPFSVRIANGAHLK 95 (121)
Q Consensus 46 g~i~~-~~v~~LIDSGAT~nFI~~~~a~rl~---l~~~~~~~~~V~va~G~~l~ 95 (121)
..+++ +++.+++|+||+.|.|+.+..++++ .+.-.+.++.+..+||..+.
T Consensus 3 ~~i~g~~~v~~~vDtGA~vnllp~~~~~~l~~~~~~~L~~t~~~L~~~~g~~~~ 56 (93)
T cd05481 3 MKINGKQSVKFQLDTGATCNVLPLRWLKSLTPDKDPELRPSPVRLTAYGGSTIP 56 (93)
T ss_pred eEeCCceeEEEEEecCCEEEeccHHHHhhhccCCCCcCccCCeEEEeeCCCEee
Confidence 46788 9999999999999999999999998 45555667999999998764
No 21
>COG5550 Predicted aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=97.30 E-value=0.00077 Score=47.26 Aligned_cols=76 Identities=24% Similarity=0.291 Sum_probs=50.2
Q ss_pred CeEEEEEEE--CC-EEEEEEecCCCc-ccccCHHHHhhcCCCCCCCCCeEEEecCCceecc-------------------
Q 043220 40 KFCVVATKI--RF-HEVVILISSGST-ANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS------------------- 96 (121)
Q Consensus 40 ~~~~v~g~i--~~-~~v~~LIDSGAT-~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~------------------- 96 (121)
.++.++... +| +.-..|||||.| -..+++++|.+++++...+ .++..++|+++++
T Consensus 11 ~~v~~~f~~~~~Gd~~~~~LiDTGFtg~lvlp~~vaek~~~~~~~~--~~~~~a~~~~v~t~V~~~~iki~g~e~~~~Vl 88 (125)
T COG5550 11 VTVPVTFRLPGQGDFVYDELIDTGFTGYLVLPPQVAEKLGLPLFST--IRIVLADGGVVKTSVALATIKIDGVEKVAFVL 88 (125)
T ss_pred eeEEEEEEecCCCcEEeeeEEecCCceeEEeCHHHHHhcCCCccCC--hhhhhhcCCEEEEEEEEEEEEECCEEEEEEEE
Confidence 455555555 33 333449999999 7889999999999876533 2333444443322
Q ss_pred -----ceEeecchhhhhcCCeeeeccc
Q 043220 97 -----PDVALDVKWSEQIGSIMRGWKK 118 (121)
Q Consensus 97 -----~dvILG~~WL~~~~p~~idw~~ 118 (121)
.-.++|++||+.++- .+|.++
T Consensus 89 ~s~~~~~~liG~~~lk~l~~-~vn~~~ 114 (125)
T COG5550 89 ASDNLPEPLIGVNLLKLLGL-VVNPKT 114 (125)
T ss_pred ccCCCcccchhhhhhhhccE-EEcCCc
Confidence 223999999999876 477653
No 22
>cd05482 HIV_retropepsin_like Retropepsins, pepsin-like aspartate proteases. This is a subfamily of retropepsins. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This gro
Probab=94.63 E-value=0.038 Score=36.41 Aligned_cols=25 Identities=12% Similarity=0.189 Sum_probs=22.6
Q ss_pred EEEECCEEEEEEecCCCcccccCHH
Q 043220 45 ATKIRFHEVVILISSGSTANFVSDK 69 (121)
Q Consensus 45 ~g~i~~~~v~~LIDSGAT~nFI~~~ 69 (121)
+.+|+|+.+.+|+||||-++.|...
T Consensus 2 ~~~i~g~~~~~llDTGAd~Tvi~~~ 26 (87)
T cd05482 2 TLYINGKLFEGLLDTGADVSIIAEN 26 (87)
T ss_pred EEEECCEEEEEEEccCCCCeEEccc
Confidence 4678999999999999999999973
No 23
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which
Probab=94.39 E-value=0.087 Score=40.19 Aligned_cols=56 Identities=14% Similarity=0.157 Sum_probs=38.2
Q ss_pred EEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCc---------------eecc---------ceEeecchhhhhc
Q 043220 54 VILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGA---------------HLKS---------PDVALDVKWSEQI 109 (121)
Q Consensus 54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~---------------~l~~---------~dvILG~~WL~~~ 109 (121)
.++||||+|..++.+.+.- ++.+..++|. ...| .-.|||-.||+.+
T Consensus 178 ~ai~DTGTs~~~lp~~~~P----------~i~~~f~~~~~~~i~~~~y~~~~~~~~~C~~~~~~~~~~~~ilG~~fl~~~ 247 (265)
T cd05476 178 GTIIDSGTTLTYLPDPAYP----------DLTLHFDGGADLELPPENYFVDVGEGVVCLAILSSSSGGVSILGNIQQQNF 247 (265)
T ss_pred cEEEeCCCcceEcCccccC----------CEEEEECCCCEEEeCcccEEEECCCCCEEEEEecCCCCCcEEEChhhcccE
Confidence 4899999999999987762 2223232221 1123 3479999999999
Q ss_pred CCeeeecccCC
Q 043220 110 GSIMRGWKKDE 120 (121)
Q Consensus 110 ~p~~idw~~~~ 120 (121)
-.+ .|+.++|
T Consensus 248 ~~v-FD~~~~~ 257 (265)
T cd05476 248 LVE-YDLENSR 257 (265)
T ss_pred EEE-EECCCCE
Confidence 885 8887764
No 24
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=94.11 E-value=0.08 Score=41.69 Aligned_cols=65 Identities=12% Similarity=0.168 Sum_probs=47.5
Q ss_pred EEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCceec--------------c--------ceEeecchhhhhc
Q 043220 52 EVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLK--------------S--------PDVALDVKWSEQI 109 (121)
Q Consensus 52 ~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~--------------~--------~dvILG~~WL~~~ 109 (121)
...++||||.|..++.+.+.+++.-.+ +.+.+...+|..+. | --.|||-.||+.+
T Consensus 231 ~~~aivDSGTs~~~lp~~~~~~l~~~~---P~i~~~f~~g~~~~i~p~~y~~~~~~~~c~~~~~~~~~~~ILG~~flr~~ 307 (326)
T cd06096 231 GLGMLVDSGSTLSHFPEDLYNKINNFF---PTITIIFENNLKIDWKPSSYLYKKESFWCKGGEKSVSNKPILGASFFKNK 307 (326)
T ss_pred CCCEEEeCCCCcccCCHHHHHHHHhhc---CcEEEEEcCCcEEEECHHHhccccCCceEEEEEecCCCceEEChHHhcCc
Confidence 345899999999999999998864322 35666666554432 1 2369999999999
Q ss_pred CCeeeecccCC
Q 043220 110 GSIMRGWKKDE 120 (121)
Q Consensus 110 ~p~~idw~~~~ 120 (121)
-.+ .|+.++|
T Consensus 308 y~v-FD~~~~r 317 (326)
T cd06096 308 QII-FDLDNNR 317 (326)
T ss_pred EEE-EECcCCE
Confidence 875 8887764
No 25
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=91.94 E-value=0.46 Score=36.36 Aligned_cols=65 Identities=8% Similarity=-0.027 Sum_probs=41.5
Q ss_pred EEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCc-eecc----------ceEeecchhhhhcCCeeeecccCC
Q 043220 52 EVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGA-HLKS----------PDVALDVKWSEQIGSIMRGWKKDE 120 (121)
Q Consensus 52 ~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~-~l~~----------~dvILG~~WL~~~~p~~idw~~~~ 120 (121)
...++||||++..++...+++++.-.+... ......|. .+.| ...|||-.+|+++-.+ .||.++|
T Consensus 198 ~~~~iiDSGTs~~~lP~~~~~~l~~~l~g~---~~~~~~~~~~~~C~~~~P~i~f~~~~ilGd~fl~~~y~v-fD~~~~~ 273 (278)
T cd06097 198 GFSAIADTGTTLILLPDAIVEAYYSQVPGA---YYDSEYGGWVFPCDTTLPDLSFAVFSILGDVFLKAQYVV-FDVGGPK 273 (278)
T ss_pred CceEEeecCCchhcCCHHHHHHHHHhCcCC---cccCCCCEEEEECCCCCCCEEEEEEEEEcchhhCceeEE-EcCCCce
Confidence 356899999999999988776652111000 00001111 2333 2569999999999885 9998765
No 26
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=91.85 E-value=0.32 Score=36.51 Aligned_cols=66 Identities=8% Similarity=0.011 Sum_probs=42.2
Q ss_pred EEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc------------ceEeecchhhhhcCCeeeeccc
Q 043220 51 HEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS------------PDVALDVKWSEQIGSIMRGWKK 118 (121)
Q Consensus 51 ~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~------------~dvILG~~WL~~~~p~~idw~~ 118 (121)
....++||||++..++...+...+--.+...... .-......| +..|||..||+.+-.+ .|+.+
T Consensus 201 ~~~~~iiDsGt~~~~lp~~~~~~l~~~~~~~~~~---~~~~~~~~~~~~~~~p~i~f~f~~ilG~~fl~~~y~v-fD~~~ 276 (283)
T cd05471 201 GGGGAIVDSGTSLIYLPSSVYDAILKALGAAVSS---SDGGYGVDCSPCDTLPDITFTFLWILGDVFLRNYYTV-FDLDN 276 (283)
T ss_pred CCcEEEEecCCCCEeCCHHHHHHHHHHhCCcccc---cCCcEEEeCcccCcCCCEEEEEEEEccHhhhhheEEE-EeCCC
Confidence 3568999999999999999888763222111000 000000111 5799999999999885 78776
Q ss_pred CC
Q 043220 119 DE 120 (121)
Q Consensus 119 ~~ 120 (121)
++
T Consensus 277 ~~ 278 (283)
T cd05471 277 NR 278 (283)
T ss_pred CE
Confidence 53
No 27
>PF12382 Peptidase_A2E: Retrotransposon peptidase; InterPro: IPR024648 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. This entry represents a small family of fungal retroviral aspartyl peptidases.
Probab=91.80 E-value=0.18 Score=34.58 Aligned_cols=40 Identities=23% Similarity=0.384 Sum_probs=31.4
Q ss_pred EEEEEEECCE--EEEEEecCCCcccccCHHHHhhcCCCCCCC
Q 043220 42 CVVATKIRFH--EVVILISSGSTANFVSDKVAALLPLPMMPT 81 (121)
Q Consensus 42 ~~v~g~i~~~--~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~ 81 (121)
|.+.+.+-.. .+..|||+||..|.|.++.++.+.+|.++.
T Consensus 35 mvlqa~lp~fkcsipclidtgaq~niiteetvrahklptrpw 76 (137)
T PF12382_consen 35 MVLQAKLPDFKCSIPCLIDTGAQVNIITEETVRAHKLPTRPW 76 (137)
T ss_pred hhhhhhCCCccccceeEEccCceeeeeehhhhhhccCCCCcc
Confidence 4444444332 467899999999999999999999998764
No 28
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=89.66 E-value=1.8 Score=33.87 Aligned_cols=74 Identities=12% Similarity=0.063 Sum_probs=47.8
Q ss_pred EEECCEE-------EEEEecCCCcccccCHHHHhhcCC--CCC---CCCCeEEEecC----------------Cceecc-
Q 043220 46 TKIRFHE-------VVILISSGSTANFVSDKVAALLPL--PMM---PTKPFSVRIAN----------------GAHLKS- 96 (121)
Q Consensus 46 g~i~~~~-------v~~LIDSGAT~nFI~~~~a~rl~l--~~~---~~~~~~V~va~----------------G~~l~~- 96 (121)
..|+++. ..++||||++..++.+.+++++.. +.. ..+.+....++ +....|
T Consensus 197 i~v~g~~~~~~~~~~~aivDTGTs~~~lP~~~~~~i~~~~~C~~~~~~P~i~f~f~g~~~~l~~~~yi~~~~~~~~~~C~ 276 (317)
T cd06098 197 VLIGGKSTGFCAGGCAAIADSGTSLLAGPTTIVTQINSAVDCNSLSSMPNVSFTIGGKTFELTPEQYILKVGEGAAAQCI 276 (317)
T ss_pred EEECCEEeeecCCCcEEEEecCCcceeCCHHHHHhhhccCCccccccCCcEEEEECCEEEEEChHHeEEeecCCCCCEEe
Confidence 4666654 358999999999999999987642 221 11233333321 111134
Q ss_pred -------------ceEeecchhhhhcCCeeeecccCC
Q 043220 97 -------------PDVALDVKWSEQIGSIMRGWKKDE 120 (121)
Q Consensus 97 -------------~dvILG~~WL~~~~p~~idw~~~~ 120 (121)
...|||-.||+.+-.+ .|+.++|
T Consensus 277 ~~~~~~~~~~~~~~~~IlGd~Flr~~y~V-fD~~~~~ 312 (317)
T cd06098 277 SGFTALDVPPPRGPLWILGDVFMGAYHTV-FDYGNLR 312 (317)
T ss_pred ceEEECCCCCCCCCeEEechHHhcccEEE-EeCCCCE
Confidence 1359999999999885 8888764
No 29
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=86.51 E-value=3.3 Score=32.64 Aligned_cols=22 Identities=5% Similarity=-0.191 Sum_probs=18.6
Q ss_pred eEeecchhhhhcCCeeeecccCC
Q 043220 98 DVALDVKWSEQIGSIMRGWKKDE 120 (121)
Q Consensus 98 dvILG~~WL~~~~p~~idw~~~~ 120 (121)
..|||..||+.+-.+ .||+++|
T Consensus 303 ~~IlG~~fl~~~y~v-FD~~~~~ 324 (329)
T cd05485 303 LWILGDVFIGKYYTE-FDLGNNR 324 (329)
T ss_pred eEEEchHHhccceEE-EeCCCCE
Confidence 379999999999885 8998765
No 30
>PF00026 Asp: Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.; InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) . More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=84.32 E-value=0.35 Score=37.03 Aligned_cols=67 Identities=7% Similarity=0.167 Sum_probs=44.0
Q ss_pred EEEEEecCCCcccccCHHHHhhc----CC---------CCCC---CCCeEEEecCCcee-----------------cc--
Q 043220 52 EVVILISSGSTANFVSDKVAALL----PL---------PMMP---TKPFSVRIANGAHL-----------------KS-- 96 (121)
Q Consensus 52 ~v~~LIDSGAT~nFI~~~~a~rl----~l---------~~~~---~~~~~V~va~G~~l-----------------~~-- 96 (121)
...++||||++..++...+..++ +. +... ..++..... |..+ .|
T Consensus 199 ~~~~~~Dtgt~~i~lp~~~~~~i~~~l~~~~~~~~~~~~c~~~~~~p~l~f~~~-~~~~~i~~~~~~~~~~~~~~~~C~~ 277 (317)
T PF00026_consen 199 GQQAILDTGTSYIYLPRSIFDAIIKALGGSYSDGVYSVPCNSTDSLPDLTFTFG-GVTFTIPPSDYIFKIEDGNGGYCYL 277 (317)
T ss_dssp EEEEEEETTBSSEEEEHHHHHHHHHHHTTEEECSEEEEETTGGGGSEEEEEEET-TEEEEEEHHHHEEEESSTTSSEEEE
T ss_pred ceeeecccccccccccchhhHHHHhhhcccccceeEEEecccccccceEEEeeC-CEEEEecchHhcccccccccceeEe
Confidence 35799999999999999987754 11 1111 122333333 2211 13
Q ss_pred -----------ceEeecchhhhhcCCeeeecccCC
Q 043220 97 -----------PDVALDVKWSEQIGSIMRGWKKDE 120 (121)
Q Consensus 97 -----------~dvILG~~WL~~~~p~~idw~~~~ 120 (121)
...|||++||+.+-.+ +|+.++|
T Consensus 278 ~i~~~~~~~~~~~~iLG~~fl~~~y~v-fD~~~~~ 311 (317)
T PF00026_consen 278 GIQPMDSSDDSDDWILGSPFLRNYYVV-FDYENNR 311 (317)
T ss_dssp SEEEESSTTSSSEEEEEHHHHTTEEEE-EETTTTE
T ss_pred eeecccccccCCceEecHHHhhceEEE-EeCCCCE
Confidence 5889999999999875 8887764
No 31
>PF00455 DeoRC: DeoR C terminal sensor domain; InterPro: IPR014036 The deoR-type HTH domain is a DNA-binding, helix-turn-helix (HTH) domain of about 50-60 amino acids present in transcription regulators of the deoR family, involved in sugar catabolism. This family of prokaryotic regulators is named after Escherichia coli deoR, a repressor of the deo operon, which encodes nucleotide and deoxyribonucleotide catabolic enzymes. DeoR also negatively regulates the expression of nupG and tsx, a nucleoside-specific transport protein and a channel-forming protein, respectively. DeoR-like transcription repressors occur in diverse bacteria as regulators of sugar and nucleoside metabolic systems. The effector molecules for deoR-like regulators are generally phosphorylated intermediates of the relevant metabolic pathway. The DNA-binding deoR-type HTH domain occurs usually in the N-terminal part. The C-terminal part can contain an effector-binding domain and/or an oligomerization domain. DeoR occurs as an octamer, whilst glpR and agaR are tetramers. Several operators may be bound simultaneously, which could facilitate DNA looping [, ].
Probab=81.16 E-value=1.1 Score=32.20 Aligned_cols=63 Identities=13% Similarity=0.050 Sum_probs=44.8
Q ss_pred EEEecCCCcccccCHHHHhhcCCCCCC-----------CCCeEEEecCCceeccceEeecchhhhhcCCeeeec
Q 043220 54 VILISSGSTANFVSDKVAALLPLPMMP-----------TKPFSVRIANGAHLKSPDVALDVKWSEQIGSIMRGW 116 (121)
Q Consensus 54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~-----------~~~~~V~va~G~~l~~~dvILG~~WL~~~~p~~idw 116 (121)
.+.||+|+|.-.+.+.+..+.++.+.. .+...|.+.+|..........|..-+.....+.+|+
T Consensus 22 ~Ifld~GtT~~~la~~L~~~~~ltVvTnsl~ia~~l~~~~~~~vi~~GG~~~~~~~~~~G~~a~~~l~~~~~d~ 95 (161)
T PF00455_consen 22 TIFLDSGTTTLELAKYLPDKKNLTVVTNSLPIANELSENPNIEVILLGGEVNPKSLSFVGPIALEALRQFRFDK 95 (161)
T ss_pred EEEEECchHHHHHHHHhhcCCceEEEECCHHHHHHHHhcCceEEEEeCCEEEcCCCcEECchHHHHHHhhccce
Confidence 378999999999999999997766543 236788889998776666666655555444444443
No 32
>cd05470 pepsin_retropepsin_like Cellular and retroviral pepsin-like aspartate proteases. This family includes both cellular and retroviral pepsin-like aspartate proteases. The cellular pepsin and pepsin-like enzymes are twice as long as their retroviral counterparts. The cellular pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, rennin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (rennin, cathepsin D and E, pepsin) or commercially (chymosin) important. The eukaryotic pepsin-like proteases contain two domains possessing similar topological features. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except in the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The eukaryotic pepsin-like proteases have two active site
Probab=81.00 E-value=1.7 Score=28.25 Aligned_cols=27 Identities=19% Similarity=0.258 Sum_probs=21.5
Q ss_pred EEEECC--EEEEEEecCCCcccccCHHHH
Q 043220 45 ATKIRF--HEVVILISSGSTANFVSDKVA 71 (121)
Q Consensus 45 ~g~i~~--~~v~~LIDSGAT~nFI~~~~a 71 (121)
...|+. +++.+++|+||+..++...-+
T Consensus 2 ~i~vGtP~q~~~~~~DTGSs~~Wv~~~~c 30 (109)
T cd05470 2 EIGIGTPPQTFNVLLDTGSSNLWVPSVDC 30 (109)
T ss_pred EEEeCCCCceEEEEEeCCCCCEEEeCCCC
Confidence 355665 889999999999999887644
No 33
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=80.64 E-value=4.2 Score=30.97 Aligned_cols=25 Identities=20% Similarity=0.400 Sum_probs=21.4
Q ss_pred EEEEEECC--EEEEEEecCCCcccccC
Q 043220 43 VVATKIRF--HEVVILISSGSTANFVS 67 (121)
Q Consensus 43 ~v~g~i~~--~~v~~LIDSGAT~nFI~ 67 (121)
.+...|+. +++.+++||||+..++.
T Consensus 4 ~~~i~iGtp~q~~~v~~DTgS~~~wv~ 30 (295)
T cd05474 4 SAELSVGTPPQKVTVLLDTGSSDLWVP 30 (295)
T ss_pred EEEEEECCCCcEEEEEEeCCCCcceee
Confidence 45667776 88999999999999998
No 34
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=75.40 E-value=6.1 Score=33.14 Aligned_cols=22 Identities=14% Similarity=0.252 Sum_probs=18.4
Q ss_pred EEEEecCCCcccccCHHHHhhc
Q 043220 53 VVILISSGSTANFVSDKVAALL 74 (121)
Q Consensus 53 v~~LIDSGAT~nFI~~~~a~rl 74 (121)
..++||||+|..++.+..++++
T Consensus 332 ~~aIlDSGTSli~lP~~~~~~i 353 (450)
T PTZ00013 332 ANVIVDSGTTTITAPSEFLNKF 353 (450)
T ss_pred cceEECCCCccccCCHHHHHHH
Confidence 4589999999999998876653
No 35
>PF00026 Asp: Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.; InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) . More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=74.43 E-value=4.9 Score=30.64 Aligned_cols=26 Identities=23% Similarity=0.408 Sum_probs=22.0
Q ss_pred EEEEEEC--CEEEEEEecCCCcccccCH
Q 043220 43 VVATKIR--FHEVVILISSGSTANFVSD 68 (121)
Q Consensus 43 ~v~g~i~--~~~v~~LIDSGAT~nFI~~ 68 (121)
.+...|+ ++++.++|||||+..++..
T Consensus 3 ~~~v~iGtp~q~~~~~iDTGS~~~wv~~ 30 (317)
T PF00026_consen 3 YINVTIGTPPQTFRVLIDTGSSDTWVPS 30 (317)
T ss_dssp EEEEEETTTTEEEEEEEETTBSSEEEEB
T ss_pred EEEEEECCCCeEEEEEEecccceeeece
Confidence 4566777 7999999999999999874
No 36
>COG2383 Uncharacterized conserved protein [Function unknown]
Probab=73.59 E-value=0.6 Score=31.78 Aligned_cols=21 Identities=29% Similarity=0.528 Sum_probs=18.3
Q ss_pred EeecchhhhhcCCeeeecccC
Q 043220 99 VALDVKWSEQIGSIMRGWKKD 119 (121)
Q Consensus 99 vILG~~WL~~~~p~~idw~~~ 119 (121)
.||+.-||.++|.+++||.+.
T Consensus 50 yilsl~~La~~GVItin~~al 70 (109)
T COG2383 50 YILSLFWLAQYGVITINWEAL 70 (109)
T ss_pred HHHHHHHHHHcCeEEEcHHHH
Confidence 378889999999999999754
No 37
>COG0282 ackA Acetate kinase [Energy production and conversion]
Probab=73.11 E-value=2.2 Score=35.39 Aligned_cols=37 Identities=24% Similarity=0.492 Sum_probs=33.7
Q ss_pred CCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc
Q 043220 60 GSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS 96 (121)
Q Consensus 60 GAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~ 96 (121)
|-||.|++.++++.|+.|+...+-+.+.++||..+..
T Consensus 179 GtSh~YVs~~aa~~L~k~~~~l~~I~~HLGNGASicA 215 (396)
T COG0282 179 GTSHKYVSQRAAEILGKPLEDLNLITCHLGNGASICA 215 (396)
T ss_pred ccchHHHHHHHHHHhCCCccccCEEEEEecCchhhhh
Confidence 7899999999999999999888899999999987753
No 38
>PTZ00147 plasmepsin-1; Provisional
Probab=69.77 E-value=23 Score=29.68 Aligned_cols=22 Identities=14% Similarity=0.259 Sum_probs=18.9
Q ss_pred EEEEEecCCCcccccCHHHHhh
Q 043220 52 EVVILISSGSTANFVSDKVAAL 73 (121)
Q Consensus 52 ~v~~LIDSGAT~nFI~~~~a~r 73 (121)
...++||||.|..++....+.+
T Consensus 332 ~~~aIiDSGTsli~lP~~~~~a 353 (453)
T PTZ00147 332 KANVIVDSGTSVITVPTEFLNK 353 (453)
T ss_pred ceeEEECCCCchhcCCHHHHHH
Confidence 3569999999999999998764
No 39
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=66.02 E-value=7.6 Score=29.61 Aligned_cols=26 Identities=19% Similarity=0.263 Sum_probs=21.4
Q ss_pred EEEEECC--EEEEEEecCCCcccccCHH
Q 043220 44 VATKIRF--HEVVILISSGSTANFVSDK 69 (121)
Q Consensus 44 v~g~i~~--~~v~~LIDSGAT~nFI~~~ 69 (121)
+...|+. +++.+++||||+..++...
T Consensus 3 ~~i~vGtP~Q~~~v~~DTGS~~~wv~~~ 30 (278)
T cd06097 3 TPVKIGTPPQTLNLDLDTGSSDLWVFSS 30 (278)
T ss_pred eeEEECCCCcEEEEEEeCCCCceeEeeC
Confidence 4566776 8899999999999999754
No 40
>PRK10681 DNA-binding transcriptional repressor DeoR; Provisional
Probab=64.57 E-value=3.1 Score=31.93 Aligned_cols=51 Identities=12% Similarity=0.259 Sum_probs=33.0
Q ss_pred EEEecCCCcccccCHHHHhhcCCCCCC-----------CCCeEEEecCCceeccceEeecch
Q 043220 54 VILISSGSTANFVSDKVAALLPLPMMP-----------TKPFSVRIANGAHLKSPDVALDVK 104 (121)
Q Consensus 54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~-----------~~~~~V~va~G~~l~~~dvILG~~ 104 (121)
.++||+|+|.-++.+.+....++.+.. .+.+.|.+.+|........+.|..
T Consensus 95 tIflD~GtT~~~la~~L~~~~~ltvvTnsl~i~~~l~~~~~~~villGG~~~~~~~~~~G~~ 156 (252)
T PRK10681 95 TLFFDCGTTTPWIIEAIDNELPFTAVCYSLNTFLALQEKPHCRAILCGGEFHASNAIFKPLD 156 (252)
T ss_pred EEEEECCccHHHHHHhcCCCCCeEEEECCHHHHHHHhhCCCCEEEEECcEEecCcceeeCHH
Confidence 478999999999888887643333321 234677777887654444455543
No 41
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=60.03 E-value=14 Score=28.87 Aligned_cols=28 Identities=14% Similarity=0.242 Sum_probs=22.7
Q ss_pred EEEEEEEC--CEEEEEEecCCCcccccCHH
Q 043220 42 CVVATKIR--FHEVVILISSGSTANFVSDK 69 (121)
Q Consensus 42 ~~v~g~i~--~~~v~~LIDSGAT~nFI~~~ 69 (121)
..+...|+ .+++.++|||||+..++...
T Consensus 11 Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~ 40 (317)
T cd06098 11 YFGEIGIGTPPQKFTVIFDTGSSNLWVPSS 40 (317)
T ss_pred EEEEEEECCCCeEEEEEECCCccceEEecC
Confidence 45566776 68899999999999998754
No 42
>COG4067 Uncharacterized protein conserved in archaea [Posttranslational modification, protein turnover, chaperones]
Probab=59.01 E-value=8 Score=28.31 Aligned_cols=23 Identities=13% Similarity=0.104 Sum_probs=17.6
Q ss_pred EEEEEEecCCCcccccCHHHHhh
Q 043220 51 HEVVILISSGSTANFVSDKVAAL 73 (121)
Q Consensus 51 ~~v~~LIDSGAT~nFI~~~~a~r 73 (121)
..+.|=||+||..+-++..=..+
T Consensus 38 ~~~kAkiDTGA~TSsL~A~dI~~ 60 (162)
T COG4067 38 IQLKAKIDTGAVTSSLSASDIER 60 (162)
T ss_pred ceeeeeecccceeeeEEeeccee
Confidence 45788999999999888765543
No 43
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate r
Probab=58.34 E-value=14 Score=28.87 Aligned_cols=27 Identities=15% Similarity=0.186 Sum_probs=22.0
Q ss_pred EEEEEEEC--CEEEEEEecCCCcccccCH
Q 043220 42 CVVATKIR--FHEVVILISSGSTANFVSD 68 (121)
Q Consensus 42 ~~v~g~i~--~~~v~~LIDSGAT~nFI~~ 68 (121)
..+...|+ .+++.++|||||+..++..
T Consensus 9 y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~ 37 (326)
T cd05487 9 YYGEIGIGTPPQTFKVVFDTGSSNLWVPS 37 (326)
T ss_pred EEEEEEECCCCcEEEEEEeCCccceEEcc
Confidence 34566776 6889999999999999964
No 44
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=57.63 E-value=11 Score=28.93 Aligned_cols=20 Identities=20% Similarity=0.245 Sum_probs=17.2
Q ss_pred EEEecCCCcccccCHHHHhh
Q 043220 54 VILISSGSTANFVSDKVAAL 73 (121)
Q Consensus 54 ~~LIDSGAT~nFI~~~~a~r 73 (121)
.++||||+|..++.+...++
T Consensus 173 ~~ivDSGTt~~~lp~~~~~~ 192 (299)
T cd05472 173 GVIIDSGTVITRLPPSAYAA 192 (299)
T ss_pred CeEEeCCCcceecCHHHHHH
Confidence 47999999999999877664
No 45
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=57.24 E-value=11 Score=29.21 Aligned_cols=29 Identities=7% Similarity=0.031 Sum_probs=21.7
Q ss_pred EEECCEEE------EEEecCCCcccccCHHHHhhc
Q 043220 46 TKIRFHEV------VILISSGSTANFVSDKVAALL 74 (121)
Q Consensus 46 g~i~~~~v------~~LIDSGAT~nFI~~~~a~rl 74 (121)
..|+++.+ .++||||++..++.....+++
T Consensus 186 i~v~g~~~~~~~~~~aiiDTGTs~~~lP~~~~~~l 220 (316)
T cd05486 186 IQVGGTVIFCSDGCQAIVDTGTSLITGPSGDIKQL 220 (316)
T ss_pred EEEecceEecCCCCEEEECCCcchhhcCHHHHHHH
Confidence 35565543 589999999999998876543
No 46
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which
Probab=57.14 E-value=14 Score=28.70 Aligned_cols=73 Identities=14% Similarity=0.058 Sum_probs=48.2
Q ss_pred EEECCEEE------EEEecCCCcccccCHHHHhhc----CC----------CCC---CCCCeEEEecCCceec-------
Q 043220 46 TKIRFHEV------VILISSGSTANFVSDKVAALL----PL----------PMM---PTKPFSVRIANGAHLK------- 95 (121)
Q Consensus 46 g~i~~~~v------~~LIDSGAT~nFI~~~~a~rl----~l----------~~~---~~~~~~V~va~G~~l~------- 95 (121)
..|+++.+ .++||||++..++......++ +. +.. ..+.+.+.. +|..+.
T Consensus 194 v~v~g~~~~~~~~~~~iiDTGts~~~lp~~~~~~l~~~~~~~~~~~~~~~~~C~~~~~~P~~~f~f-~g~~~~i~~~~y~ 272 (317)
T cd05478 194 VTINGQVVACSGGCQAIVDTGTSLLVGPSSDIANIQSDIGASQNQNGEMVVNCSSISSMPDVVFTI-NGVQYPLPPSAYI 272 (317)
T ss_pred EEECCEEEccCCCCEEEECCCchhhhCCHHHHHHHHHHhCCccccCCcEEeCCcCcccCCcEEEEE-CCEEEEECHHHhe
Confidence 46777655 589999999999998877653 21 111 112344555 343221
Q ss_pred ------c----------ceEeecchhhhhcCCeeeecccCC
Q 043220 96 ------S----------PDVALDVKWSEQIGSIMRGWKKDE 120 (121)
Q Consensus 96 ------~----------~dvILG~~WL~~~~p~~idw~~~~ 120 (121)
| ...|||-.+|+.+-.+ .||.++|
T Consensus 273 ~~~~~~C~~~~~~~~~~~~~IlG~~fl~~~y~v-fD~~~~~ 312 (317)
T cd05478 273 LQDQGSCTSGFQSMGLGELWILGDVFIRQYYSV-FDRANNK 312 (317)
T ss_pred ecCCCEEeEEEEeCCCCCeEEechHHhcceEEE-EeCCCCE
Confidence 3 2479999999999885 8988764
No 47
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=55.83 E-value=16 Score=28.36 Aligned_cols=72 Identities=14% Similarity=0.145 Sum_probs=45.7
Q ss_pred EECCEEE-------EEEecCCCcccccCHHHHhhc----CCCC----------CC---CCCeEEEecCCcee--------
Q 043220 47 KIRFHEV-------VILISSGSTANFVSDKVAALL----PLPM----------MP---TKPFSVRIANGAHL-------- 94 (121)
Q Consensus 47 ~i~~~~v-------~~LIDSGAT~nFI~~~~a~rl----~l~~----------~~---~~~~~V~va~G~~l-------- 94 (121)
.|++... .++||||.|..++.....+++ +... .. .+.+....+ |..+
T Consensus 189 ~v~g~~~~~~~~~~~~iiDSGtt~~~lP~~~~~~l~~~~~~~~~~~~~~~~~C~~~~~~p~l~~~f~-g~~~~v~~~~y~ 267 (318)
T cd05477 189 QINGQATGWCSQGCQAIVDTGTSLLTAPQQVMSTLMQSIGAQQDQYGQYVVNCNNIQNLPTLTFTIN-GVSFPLPPSAYI 267 (318)
T ss_pred EECCEEecccCCCceeeECCCCccEECCHHHHHHHHHHhCCccccCCCEEEeCCccccCCcEEEEEC-CEEEEECHHHeE
Confidence 5666543 589999999999999887764 1111 10 123444442 2221
Q ss_pred -----cc---------------ceEeecchhhhhcCCeeeecccCC
Q 043220 95 -----KS---------------PDVALDVKWSEQIGSIMRGWKKDE 120 (121)
Q Consensus 95 -----~~---------------~dvILG~~WL~~~~p~~idw~~~~ 120 (121)
.| ...|||..+|+.+-.+ .|+.+.|
T Consensus 268 ~~~~~~C~~~i~~~~~~~~~~~~~~ilG~~fl~~~y~v-fD~~~~~ 312 (318)
T cd05477 268 LQNNGYCTVGIEPTYLPSQNGQPLWILGDVFLRQYYSV-YDLGNNQ 312 (318)
T ss_pred ecCCCeEEEEEEecccCCCCCCceEEEcHHHhhheEEE-EeCCCCE
Confidence 23 1379999999999885 8888764
No 48
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=54.73 E-value=17 Score=27.68 Aligned_cols=25 Identities=20% Similarity=0.267 Sum_probs=20.6
Q ss_pred EEEEEEC--CEEEEEEecCCCcccccC
Q 043220 43 VVATKIR--FHEVVILISSGSTANFVS 67 (121)
Q Consensus 43 ~v~g~i~--~~~v~~LIDSGAT~nFI~ 67 (121)
.+...|+ .+.+.+++|+||+..++.
T Consensus 4 ~~~i~iGtP~q~~~v~~DTGS~~~Wv~ 30 (273)
T cd05475 4 YVTINIGNPPKPYFLDIDTGSDLTWLQ 30 (273)
T ss_pred EEEEEcCCCCeeEEEEEccCCCceEEe
Confidence 4556666 688999999999999995
No 49
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank
Probab=54.38 E-value=18 Score=28.15 Aligned_cols=27 Identities=15% Similarity=0.226 Sum_probs=21.5
Q ss_pred EEEEEEECC--EEEEEEecCCCcccccCH
Q 043220 42 CVVATKIRF--HEVVILISSGSTANFVSD 68 (121)
Q Consensus 42 ~~v~g~i~~--~~v~~LIDSGAT~nFI~~ 68 (121)
..+...|+. +++.+++||||+..++..
T Consensus 7 Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~ 35 (325)
T cd05490 7 YYGEIGIGTPPQTFTVVFDTGSSNLWVPS 35 (325)
T ss_pred EEEEEEECCCCcEEEEEEeCCCccEEEEc
Confidence 345667764 889999999999999954
No 50
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which
Probab=53.75 E-value=15 Score=27.84 Aligned_cols=25 Identities=16% Similarity=0.284 Sum_probs=20.1
Q ss_pred EEEEEC--CEEEEEEecCCCcccccCH
Q 043220 44 VATKIR--FHEVVILISSGSTANFVSD 68 (121)
Q Consensus 44 v~g~i~--~~~v~~LIDSGAT~nFI~~ 68 (121)
+...|+ .+.+.+++||||++.++..
T Consensus 4 ~~i~iGtP~q~~~v~~DTGSs~~wv~~ 30 (265)
T cd05476 4 VTLSIGTPPQPFSLIVDTGSDLTWTQC 30 (265)
T ss_pred EEEecCCCCcceEEEecCCCCCEEEcC
Confidence 455666 5789999999999998863
No 51
>PRK09802 DNA-binding transcriptional regulator AgaR; Provisional
Probab=52.57 E-value=6.6 Score=30.51 Aligned_cols=53 Identities=9% Similarity=0.106 Sum_probs=34.0
Q ss_pred EEEecCCCcccccCHHHHhhcCCCCCC-----------CCCeEEEecCCceeccceEeecchhh
Q 043220 54 VILISSGSTANFVSDKVAALLPLPMMP-----------TKPFSVRIANGAHLKSPDVALDVKWS 106 (121)
Q Consensus 54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~-----------~~~~~V~va~G~~l~~~dvILG~~WL 106 (121)
.+.||+|+|..++-+.+..+-++.+.. .+.+.|.+.+|........+.|..-+
T Consensus 109 ~Ifld~GtT~~~la~~L~~~~~ltVvTnsl~ia~~l~~~~~~~v~llGG~~~~~~~~~~G~~a~ 172 (269)
T PRK09802 109 RVILDSGTTTFEIARLMRKHTDVIAMTNGMNVANALLEAEGVELLMTGGHLRRQSQSFYGDQAE 172 (269)
T ss_pred EEEECCchHHHHHHHhcCcCCCeEEEeCCHHHHHHHHhCCCCEEEEECCEEecCCCceECHHHH
Confidence 478999999988888876543333322 13467777888766555555555433
No 52
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which
Probab=52.02 E-value=22 Score=27.62 Aligned_cols=28 Identities=14% Similarity=0.246 Sum_probs=22.1
Q ss_pred EEEEEEEC--CEEEEEEecCCCcccccCHH
Q 043220 42 CVVATKIR--FHEVVILISSGSTANFVSDK 69 (121)
Q Consensus 42 ~~v~g~i~--~~~v~~LIDSGAT~nFI~~~ 69 (121)
..+...|+ .+++.++|||||+..++...
T Consensus 11 Y~~~i~vGtp~q~~~v~~DTGS~~~wv~~~ 40 (317)
T cd05478 11 YYGTISIGTPPQDFTVIFDTGSSNLWVPSV 40 (317)
T ss_pred EEEEEEeCCCCcEEEEEEeCCCccEEEecC
Confidence 34566666 57899999999999999754
No 53
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=51.80 E-value=26 Score=27.46 Aligned_cols=28 Identities=11% Similarity=0.133 Sum_probs=22.1
Q ss_pred EEEEEEEC--CEEEEEEecCCCcccccCHH
Q 043220 42 CVVATKIR--FHEVVILISSGSTANFVSDK 69 (121)
Q Consensus 42 ~~v~g~i~--~~~v~~LIDSGAT~nFI~~~ 69 (121)
..+...|+ .+++.+++||||+..++...
T Consensus 4 Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~ 33 (326)
T cd06096 4 YFIDIFIGNPPQKQSLILDTGSSSLSFPCS 33 (326)
T ss_pred EEEEEEecCCCeEEEEEEeCCCCceEEecC
Confidence 34566776 58899999999999998653
No 54
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=51.04 E-value=24 Score=27.33 Aligned_cols=27 Identities=19% Similarity=0.288 Sum_probs=21.6
Q ss_pred EEEEEECC--EEEEEEecCCCcccccCHH
Q 043220 43 VVATKIRF--HEVVILISSGSTANFVSDK 69 (121)
Q Consensus 43 ~v~g~i~~--~~v~~LIDSGAT~nFI~~~ 69 (121)
.....|+. +++.++|||||+..++...
T Consensus 5 ~~~i~iGtP~q~~~v~~DTGS~~~wv~~~ 33 (318)
T cd05477 5 YGEISIGTPPQNFLVLFDTGSSNLWVPSV 33 (318)
T ss_pred EEEEEECCCCcEEEEEEeCCCccEEEccC
Confidence 34566664 8899999999999999754
No 55
>PRK10411 DNA-binding transcriptional activator FucR; Provisional
Probab=50.76 E-value=7 Score=29.86 Aligned_cols=55 Identities=11% Similarity=0.228 Sum_probs=34.6
Q ss_pred EEEecCCCcccccCHHHHhhcCCCCCC-----------CCCeEEEecCCceeccceEeecchhhhhc
Q 043220 54 VILISSGSTANFVSDKVAALLPLPMMP-----------TKPFSVRIANGAHLKSPDVALDVKWSEQI 109 (121)
Q Consensus 54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~-----------~~~~~V~va~G~~l~~~dvILG~~WL~~~ 109 (121)
.+.||+|+|..++.+.+..+ ++.+.. .+.+.+.+.+|..-..+..+.|..-+...
T Consensus 96 ~Ifld~GtT~~~l~~~L~~~-~ltVvTNs~~ia~~l~~~~~~~vil~GG~~~~~~~~~~G~~a~~~l 161 (240)
T PRK10411 96 VIALDASSTCWYLARQLPDI-NIQVFTNSHPICQELGKRERIQLISSGGTLERKYGCYVNPSLISQL 161 (240)
T ss_pred EEEEcCcHHHHHHHHhhCCC-CeEEEeCCHHHHHHHhcCCCCEEEEECCEEeCCCCceECHHHHHHH
Confidence 47899999999888888765 444322 13456777777655544555555433333
No 56
>PRK10906 DNA-binding transcriptional repressor GlpR; Provisional
Probab=49.61 E-value=7.8 Score=29.83 Aligned_cols=49 Identities=16% Similarity=0.264 Sum_probs=31.1
Q ss_pred EEEecCCCcccccCHHHHhhcCCCCCC-----------CCCeEEEecCCceeccceEeec
Q 043220 54 VILISSGSTANFVSDKVAALLPLPMMP-----------TKPFSVRIANGAHLKSPDVALD 102 (121)
Q Consensus 54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~-----------~~~~~V~va~G~~l~~~dvILG 102 (121)
.+.||+|+|...+.+.+..+.++.+.. .+.+.|.+.+|..-.......|
T Consensus 94 tIflD~GtT~~~la~~L~~~~~ltVvTNsl~ia~~l~~~~~~~villGG~~~~~~~~~~G 153 (252)
T PRK10906 94 TLFIDIGTTPEAVAHALLNHSNLRIVTNNLNVANTLMAKEDFRIILAGGELRSRDGGIIG 153 (252)
T ss_pred EEEEcCcHHHHHHHHHhcCCCCcEEEECcHHHHHHHhhCCCCEEEEECCEEecCCCccCC
Confidence 478999999999888887654444322 1346777777764433333333
No 57
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=49.47 E-value=19 Score=27.64 Aligned_cols=24 Identities=17% Similarity=0.425 Sum_probs=19.4
Q ss_pred EEEEEC--CEEEEEEecCCCcccccC
Q 043220 44 VATKIR--FHEVVILISSGSTANFVS 67 (121)
Q Consensus 44 v~g~i~--~~~v~~LIDSGAT~nFI~ 67 (121)
+...|+ .+++.+++||||+..++.
T Consensus 4 ~~i~iGtP~q~~~v~~DTGSs~~Wv~ 29 (299)
T cd05472 4 VTVGLGTPARDQTVIVDTGSDLTWVQ 29 (299)
T ss_pred EEEecCCCCcceEEEecCCCCccccc
Confidence 455665 478999999999999984
No 58
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme. Proteinase A preferentially hydro
Probab=49.47 E-value=24 Score=27.54 Aligned_cols=28 Identities=11% Similarity=0.258 Sum_probs=22.9
Q ss_pred EEEEEEEC--CEEEEEEecCCCcccccCHH
Q 043220 42 CVVATKIR--FHEVVILISSGSTANFVSDK 69 (121)
Q Consensus 42 ~~v~g~i~--~~~v~~LIDSGAT~nFI~~~ 69 (121)
..+...|+ .+++.+++||||+..++...
T Consensus 11 Y~~~i~iGtp~q~~~v~~DTGSs~~wv~~~ 40 (320)
T cd05488 11 YFTDITLGTPPQKFKVILDTGSSNLWVPSV 40 (320)
T ss_pred EEEEEEECCCCcEEEEEEecCCcceEEEcC
Confidence 45667776 48999999999999999754
No 59
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two
Probab=49.12 E-value=23 Score=28.21 Aligned_cols=27 Identities=22% Similarity=0.273 Sum_probs=21.4
Q ss_pred EEEEEEC--CEEEEEEecCCCcccccCHH
Q 043220 43 VVATKIR--FHEVVILISSGSTANFVSDK 69 (121)
Q Consensus 43 ~v~g~i~--~~~v~~LIDSGAT~nFI~~~ 69 (121)
.+...|+ ++++.++|||||+..++...
T Consensus 5 ~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~ 33 (364)
T cd05473 5 YIEMLIGTPPQKLNILVDTGSSNFAVAAA 33 (364)
T ss_pred EEEEEecCCCceEEEEEecCCcceEEEcC
Confidence 4556666 58899999999999998653
No 60
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=48.93 E-value=25 Score=29.56 Aligned_cols=29 Identities=14% Similarity=0.305 Sum_probs=22.7
Q ss_pred EEEEEEC--CEEEEEEecCCCcccccCHHHH
Q 043220 43 VVATKIR--FHEVVILISSGSTANFVSDKVA 71 (121)
Q Consensus 43 ~v~g~i~--~~~v~~LIDSGAT~nFI~~~~a 71 (121)
.....|+ ++++.+++||||+..++...-+
T Consensus 140 y~~i~IGTP~Q~f~vi~DTGSsdlWV~s~~C 170 (450)
T PTZ00013 140 YGEGEVGDNHQKFMLIFDTGSANLWVPSKKC 170 (450)
T ss_pred EEEEEECCCCeEEEEEEeCCCCceEEecccC
Confidence 3455665 6899999999999999975543
No 61
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two
Probab=48.59 E-value=40 Score=26.80 Aligned_cols=20 Identities=25% Similarity=0.207 Sum_probs=17.2
Q ss_pred EEEecCCCcccccCHHHHhh
Q 043220 54 VILISSGSTANFVSDKVAAL 73 (121)
Q Consensus 54 ~~LIDSGAT~nFI~~~~a~r 73 (121)
.++||||+|..++.....+.
T Consensus 213 ~~ivDSGTs~~~lp~~~~~~ 232 (364)
T cd05473 213 KAIVDSGTTNLRLPVKVFNA 232 (364)
T ss_pred cEEEeCCCcceeCCHHHHHH
Confidence 48999999999999887664
No 62
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=48.37 E-value=23 Score=27.55 Aligned_cols=26 Identities=15% Similarity=0.247 Sum_probs=20.3
Q ss_pred EEEEEC--CEEEEEEecCCCcccccCHH
Q 043220 44 VATKIR--FHEVVILISSGSTANFVSDK 69 (121)
Q Consensus 44 v~g~i~--~~~v~~LIDSGAT~nFI~~~ 69 (121)
....|+ .+++.++|||||+..++...
T Consensus 3 ~~i~iGtP~Q~~~v~~DTGSs~~Wv~s~ 30 (316)
T cd05486 3 GQISIGTPPQNFTVIFDTGSSNLWVPSI 30 (316)
T ss_pred EEEEECCCCcEEEEEEcCCCccEEEecC
Confidence 345555 58899999999999999643
No 63
>PTZ00147 plasmepsin-1; Provisional
Probab=48.11 E-value=27 Score=29.34 Aligned_cols=28 Identities=14% Similarity=0.182 Sum_probs=23.0
Q ss_pred EEEEEEEC--CEEEEEEecCCCcccccCHH
Q 043220 42 CVVATKIR--FHEVVILISSGSTANFVSDK 69 (121)
Q Consensus 42 ~~v~g~i~--~~~v~~LIDSGAT~nFI~~~ 69 (121)
......|+ ++++.+++||||+..++...
T Consensus 140 Y~~~I~IGTP~Q~f~Vi~DTGSsdlWVps~ 169 (453)
T PTZ00147 140 SYGEAKLGDNGQKFNFIFDTGSANLWVPSI 169 (453)
T ss_pred EEEEEEECCCCeEEEEEEeCCCCcEEEeec
Confidence 34567777 78999999999999999754
No 64
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=47.20 E-value=27 Score=27.40 Aligned_cols=29 Identities=17% Similarity=0.198 Sum_probs=23.4
Q ss_pred eEEEEEEEC--CEEEEEEecCCCcccccCHH
Q 043220 41 FCVVATKIR--FHEVVILISSGSTANFVSDK 69 (121)
Q Consensus 41 ~~~v~g~i~--~~~v~~LIDSGAT~nFI~~~ 69 (121)
...+...|+ ++++.+++||||+..++...
T Consensus 11 ~Y~~~i~vGtP~q~~~v~~DTGSs~~Wv~~~ 41 (329)
T cd05485 11 QYYGVITIGTPPQSFKVVFDTGSSNLWVPSK 41 (329)
T ss_pred eEEEEEEECCCCcEEEEEEcCCCccEEEecC
Confidence 345677777 58899999999999999754
No 65
>PF03419 Peptidase_U4: Sporulation factor SpoIIGA This family belongs to family U4 of the peptidase classification.; InterPro: IPR005081 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This group of peptidases belong to the MEROPS peptidase family U4 (SpoIIGA peptidase family, clan U-). Sporulation in bacteria such as Bacillus subtilis involves the formation of a polar septum, which divides the sporangium into a mother cell and a forespore. The sigma E factor, which is encoded within the spoIIG operon, is a cell-specific regulatory protein that directs gene transcription in the mother cell. Sigma E is synthesised as an inactive proprotein pro-sigma E, which is converted to the mature factor by the putative processing enzyme SpoIIGA []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis, 0030436 asexual sporulation
Probab=46.65 E-value=35 Score=26.65 Aligned_cols=35 Identities=17% Similarity=0.157 Sum_probs=24.4
Q ss_pred CeEEEEEEECCEE--EEEEecCCCcc---------cccCHHHHhhc
Q 043220 40 KFCVVATKIRFHE--VVILISSGSTA---------NFVSDKVAALL 74 (121)
Q Consensus 40 ~~~~v~g~i~~~~--v~~LIDSGAT~---------nFI~~~~a~rl 74 (121)
....++..++++. +.+|+|||... ..++.+.++++
T Consensus 156 ~~~~v~i~~~~~~~~~~allDTGN~L~DPitg~PV~Vve~~~~~~~ 201 (293)
T PF03419_consen 156 YLYPVTIEIGGKKIELKALLDTGNQLRDPITGRPVIVVEYEALEKL 201 (293)
T ss_pred EEEEEEEEECCEEEEEEEEEECCCcccCCCCCCcEEEEEHHHHHhh
Confidence 4567888888876 57899999854 35555555554
No 66
>TIGR02854 spore_II_GA sigma-E processing peptidase SpoIIGA. Members of this protein family are the stage II sporulation protein SpoIIGA. This protein acts as an activating protease for Sigma-E, one of several specialized sigma factors of the sporulation process in Bacillus subtilis and related endospore-forming bacteria.
Probab=45.61 E-value=35 Score=26.83 Aligned_cols=25 Identities=4% Similarity=0.102 Sum_probs=19.4
Q ss_pred ccCeEEEEEEECCEE--EEEEecCCCc
Q 043220 38 LRKFCVVATKIRFHE--VVILISSGST 62 (121)
Q Consensus 38 ~~~~~~v~g~i~~~~--v~~LIDSGAT 62 (121)
....+.+...++|+. +.+|+|||..
T Consensus 155 ~~~~~~v~i~~~g~~~~~~alvDTGN~ 181 (288)
T TIGR02854 155 DKQIYELEICLDGKKVTIKGFLDTGNQ 181 (288)
T ss_pred hceEEEEEEEECCEEEEEEEEEecCCc
Confidence 345677888888876 6789999975
No 67
>PF04930 FUN14: FUN14 family; InterPro: IPR007014 This is a family of short proteins found in eukaryotes and some archaea. Although the function of these proteins is not known they may contain transmembrane helices.
Probab=42.49 E-value=7.8 Score=25.66 Aligned_cols=20 Identities=25% Similarity=0.346 Sum_probs=17.2
Q ss_pred eecchhhhhcCCeeeecccC
Q 043220 100 ALDVKWSEQIGSIMRGWKKD 119 (121)
Q Consensus 100 ILG~~WL~~~~p~~idw~~~ 119 (121)
++.+.||.+.|=+.+||++.
T Consensus 31 ~l~lq~l~~~G~i~Vnw~kl 50 (100)
T PF04930_consen 31 FLLLQYLASKGYIKVNWDKL 50 (100)
T ss_pred HHHHHHHHHCCeEEECHHHH
Confidence 56678999999999999864
No 68
>cd06396 PB1_NBR1 The PB1 domain is an essential part of NBR1 protein, next to BRCA1, a scaffold protein mediating specific protein-protein interaction with both titin protein kinase and with another scaffold protein p62. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The NBR1 protein contains a type I PB1 domain.
Probab=42.44 E-value=59 Score=21.06 Aligned_cols=44 Identities=11% Similarity=0.120 Sum_probs=32.8
Q ss_pred EEEEEEECCEEEEEEecCCCcc--cccCHHHHhhcCCCCCCCCCeEEEecC
Q 043220 42 CVVATKIRFHEVVILISSGSTA--NFVSDKVAALLPLPMMPTKPFSVRIAN 90 (121)
Q Consensus 42 ~~v~g~i~~~~v~~LIDSGAT~--nFI~~~~a~rl~l~~~~~~~~~V~va~ 90 (121)
+++++..+|..+.+.++..++. .=+...+++++++. ++.++-.+
T Consensus 1 V~vKaty~~d~~rf~~~~~~~~~~~~L~~ev~~rf~l~-----~f~lKYlD 46 (81)
T cd06396 1 VNLKVTYNGESQSFLVSDSENTTWASVEAMVKVSFGLN-----DIQIKYVD 46 (81)
T ss_pred CEEEEEECCeEEEEEecCCCCCCHHHHHHHHHHHhCCC-----cceeEEEc
Confidence 4788999999999999996644 44667788888877 45555443
No 69
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank
Probab=42.40 E-value=19 Score=27.98 Aligned_cols=22 Identities=9% Similarity=0.069 Sum_probs=18.8
Q ss_pred EEEEecCCCcccccCHHHHhhc
Q 043220 53 VVILISSGSTANFVSDKVAALL 74 (121)
Q Consensus 53 v~~LIDSGAT~nFI~~~~a~rl 74 (121)
..++||||+|..++....++.+
T Consensus 207 ~~aiiDSGTt~~~~p~~~~~~l 228 (325)
T cd05490 207 CEAIVDTGTSLITGPVEEVRAL 228 (325)
T ss_pred CEEEECCCCccccCCHHHHHHH
Confidence 4689999999999998877654
No 70
>PRK10434 srlR DNA-bindng transcriptional repressor SrlR; Provisional
Probab=39.75 E-value=15 Score=28.23 Aligned_cols=50 Identities=14% Similarity=0.176 Sum_probs=31.4
Q ss_pred EEEecCCCcccccCHHHHhhcCCCCCC-----------CCC-eEEEecCCceeccceEeecc
Q 043220 54 VILISSGSTANFVSDKVAALLPLPMMP-----------TKP-FSVRIANGAHLKSPDVALDV 103 (121)
Q Consensus 54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~-----------~~~-~~V~va~G~~l~~~dvILG~ 103 (121)
.++||+|+|...+.+.+....++.+.. ..+ +.|.+.+|........+.|.
T Consensus 94 tIfld~GtT~~~la~~L~~~~~ltVvTnsl~ia~~l~~~~~~~~v~l~GG~~~~~~~~~~G~ 155 (256)
T PRK10434 94 SIILDAGSTVLQMVPLLSRFNNITVMTNSLHIVNALSELDNEQTILMPGGTFRKKSASFHGQ 155 (256)
T ss_pred EEEEcCcHHHHHHHHHhccCCCeEEEECCHHHHHHHhhCCCCCEEEEECCEEeCCCCeEECH
Confidence 478999999998888887653333221 123 57888888755433444443
No 71
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=39.19 E-value=35 Score=25.33 Aligned_cols=28 Identities=14% Similarity=0.284 Sum_probs=21.0
Q ss_pred EEEECC--EEEEEEecCCCcccccCHHHHh
Q 043220 45 ATKIRF--HEVVILISSGSTANFVSDKVAA 72 (121)
Q Consensus 45 ~g~i~~--~~v~~LIDSGAT~nFI~~~~a~ 72 (121)
...|+. +++.++||+||+..++...-..
T Consensus 4 ~i~iGtp~q~~~l~~DTGS~~~wv~~~~c~ 33 (283)
T cd05471 4 EITIGTPPQKFSVIFDTGSSLLWVPSSNCT 33 (283)
T ss_pred EEEECCCCcEEEEEEeCCCCCEEEecCCCC
Confidence 445543 6889999999999998766443
No 72
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme. Proteinase A preferentially hydro
Probab=38.73 E-value=22 Score=27.72 Aligned_cols=22 Identities=18% Similarity=0.277 Sum_probs=18.7
Q ss_pred EEEEecCCCcccccCHHHHhhc
Q 043220 53 VVILISSGSTANFVSDKVAALL 74 (121)
Q Consensus 53 v~~LIDSGAT~nFI~~~~a~rl 74 (121)
..++||||+|..++...+++.+
T Consensus 206 ~~~ivDSGtt~~~lp~~~~~~l 227 (320)
T cd05488 206 TGAAIDTGTSLIALPSDLAEML 227 (320)
T ss_pred CeEEEcCCcccccCCHHHHHHH
Confidence 3589999999999999987753
No 73
>PF14543 TAXi_N: Xylanase inhibitor N-terminal; PDB: 3HD8_A 3VLB_A 3VLA_A 3AUP_D 1T6G_A 1T6E_X 2B42_A.
Probab=35.31 E-value=60 Score=22.97 Aligned_cols=23 Identities=22% Similarity=0.436 Sum_probs=17.3
Q ss_pred EEEEECC--EEEEEEecCCCccccc
Q 043220 44 VATKIRF--HEVVILISSGSTANFV 66 (121)
Q Consensus 44 v~g~i~~--~~v~~LIDSGAT~nFI 66 (121)
+...|+. +++.+.||+|+...++
T Consensus 3 ~~~~iGtP~~~~~lvvDtgs~l~W~ 27 (164)
T PF14543_consen 3 VSVSIGTPPQPFSLVVDTGSDLTWV 27 (164)
T ss_dssp EEEECTCTTEEEEEEEETT-SSEEE
T ss_pred EEEEeCCCCceEEEEEECCCCceEE
Confidence 4555655 7889999999998886
No 74
>PTZ00165 aspartyl protease; Provisional
Probab=35.13 E-value=58 Score=27.59 Aligned_cols=31 Identities=13% Similarity=0.274 Sum_probs=24.4
Q ss_pred EEEEEEECC--EEEEEEecCCCcccccCHHHHh
Q 043220 42 CVVATKIRF--HEVVILISSGSTANFVSDKVAA 72 (121)
Q Consensus 42 ~~v~g~i~~--~~v~~LIDSGAT~nFI~~~~a~ 72 (121)
......|+. +++.+++||||+..++...-+.
T Consensus 121 Y~~~I~IGTPpQ~f~Vv~DTGSS~lWVps~~C~ 153 (482)
T PTZ00165 121 YFGEIQVGTPPKSFVVVFDTGSSNLWIPSKECK 153 (482)
T ss_pred EEEEEEeCCCCceEEEEEeCCCCCEEEEchhcC
Confidence 445667776 8999999999999999765443
No 75
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate r
Probab=34.59 E-value=29 Score=27.08 Aligned_cols=22 Identities=5% Similarity=-0.035 Sum_probs=18.2
Q ss_pred eEeecchhhhhcCCeeeecccCC
Q 043220 98 DVALDVKWSEQIGSIMRGWKKDE 120 (121)
Q Consensus 98 dvILG~~WL~~~~p~~idw~~~~ 120 (121)
..|||-.+|+.+-.+ .||.++|
T Consensus 299 ~~ilG~~flr~~y~v-fD~~~~~ 320 (326)
T cd05487 299 LWVLGATFIRKFYTE-FDRQNNR 320 (326)
T ss_pred eEEEehHHhhccEEE-EeCCCCE
Confidence 369999999999875 8888765
No 76
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=34.53 E-value=29 Score=26.32 Aligned_cols=22 Identities=18% Similarity=0.477 Sum_probs=19.4
Q ss_pred EEEEecCCCcccccCHHHHhhc
Q 043220 53 VVILISSGSTANFVSDKVAALL 74 (121)
Q Consensus 53 v~~LIDSGAT~nFI~~~~a~rl 74 (121)
..++||||++..++...+.+++
T Consensus 179 ~~~iiDSGt~~~~lP~~~~~~l 200 (295)
T cd05474 179 LPALLDSGTTLTYLPSDIVDAI 200 (295)
T ss_pred ccEEECCCCccEeCCHHHHHHH
Confidence 5789999999999999988764
No 77
>PF04126 Cyclophil_like: Cyclophilin-like; InterPro: IPR007256 Proteins of this family have no known function.; PDB: 2KA0_A 1ZX8_C 2NNZ_A.
Probab=31.34 E-value=33 Score=23.51 Aligned_cols=30 Identities=30% Similarity=0.270 Sum_probs=21.3
Q ss_pred EEEEEEECCEEEEE-EecCCCcccccCHHHHhhcCC
Q 043220 42 CVVATKIRFHEVVI-LISSGSTANFVSDKVAALLPL 76 (121)
Q Consensus 42 ~~v~g~i~~~~v~~-LIDSGAT~nFI~~~~a~rl~l 76 (121)
|+++..++++.+.+ |.|+- -.++++++|.+
T Consensus 1 mkI~i~i~~~~~~a~L~d~~-----ta~~~~~~LPl 31 (120)
T PF04126_consen 1 MKIKITIGGQEIEAELNDSP-----TARAFAAQLPL 31 (120)
T ss_dssp EEEEEEETTEEEEEEEETTH-----HHHHHHHC-SE
T ss_pred CeEEEEECCEEEEEEECCCH-----HHHHHHHhCCe
Confidence 78999999999887 88883 34555666543
No 78
>PRK13509 transcriptional repressor UlaR; Provisional
Probab=29.90 E-value=26 Score=26.86 Aligned_cols=49 Identities=27% Similarity=0.378 Sum_probs=29.4
Q ss_pred EEEecCCCcccccCHHHHhhcCCCCCC-----------CCCeEEEecCCceeccceEeecc
Q 043220 54 VILISSGSTANFVSDKVAALLPLPMMP-----------TKPFSVRIANGAHLKSPDVALDV 103 (121)
Q Consensus 54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~-----------~~~~~V~va~G~~l~~~dvILG~ 103 (121)
.+.||+|+|.-.+.+.+..+ ++.+.. .+.+.+.+.+|..-.....+.|.
T Consensus 96 ~Ifld~GsT~~~la~~L~~~-~ltVvTnsl~ia~~l~~~~~~~v~l~GG~~~~~~~~~~G~ 155 (251)
T PRK13509 96 SVVINCGSTAFLLGRELCGK-PVQIITNYLPLANYLIDQEHDSVIIMGGQYNKSQSITLSP 155 (251)
T ss_pred EEEECCcHHHHHHHHHhCCC-CeEEEeCCHHHHHHHHhCCCCEEEEECCeEcCCcceeECH
Confidence 57899999998777766543 333222 13456777777654434444444
No 79
>COG1349 GlpR Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]
Probab=28.81 E-value=35 Score=26.20 Aligned_cols=51 Identities=14% Similarity=0.165 Sum_probs=33.8
Q ss_pred EEEecCCCcccccCHHHHhhcCCCCCC-----------CCCeEEEecCCceeccceEeecch
Q 043220 54 VILISSGSTANFVSDKVAALLPLPMMP-----------TKPFSVRIANGAHLKSPDVALDVK 104 (121)
Q Consensus 54 ~~LIDSGAT~nFI~~~~a~rl~l~~~~-----------~~~~~V~va~G~~l~~~dvILG~~ 104 (121)
.+.+|+|+|.-++-..+....++.+.. .+...+.+.+|..-.....+.|..
T Consensus 94 ~ifld~GTT~~~la~~L~~~~~ltviTNsl~ia~~l~~~~~~~vi~~GG~~~~~~~~~~G~~ 155 (253)
T COG1349 94 TIFLDAGTTTLALARALPDDNNLTVITNSLNIAAALLEKPNIEVILLGGTVRKKSGSFVGPL 155 (253)
T ss_pred EEEECCCcHHHHHHHHhCcCCCeEEEeCCHHHHHHHHhCCCCeEEEeCcEEEcCCCeEEcHH
Confidence 468999999999888888665544322 134567677776655555555553
No 80
>PLN03146 aspartyl protease family protein; Provisional
Probab=28.79 E-value=59 Score=26.89 Aligned_cols=17 Identities=29% Similarity=0.520 Sum_probs=15.2
Q ss_pred EEEecCCCcccccCHHH
Q 043220 54 VILISSGSTANFVSDKV 70 (121)
Q Consensus 54 ~~LIDSGAT~nFI~~~~ 70 (121)
.++||||+|..++.+..
T Consensus 309 ~~iiDSGTt~t~Lp~~~ 325 (431)
T PLN03146 309 NIIIDSGTTLTLLPSDF 325 (431)
T ss_pred cEEEeCCccceecCHHH
Confidence 47999999999999985
No 81
>PLN03146 aspartyl protease family protein; Provisional
Probab=27.60 E-value=78 Score=26.18 Aligned_cols=27 Identities=11% Similarity=0.145 Sum_probs=22.4
Q ss_pred eEEEEEEEC--CEEEEEEecCCCcccccC
Q 043220 41 FCVVATKIR--FHEVVILISSGSTANFVS 67 (121)
Q Consensus 41 ~~~v~g~i~--~~~v~~LIDSGAT~nFI~ 67 (121)
...+...|+ .+++.+++||||...++.
T Consensus 84 ~Y~v~i~iGTPpq~~~vi~DTGS~l~Wv~ 112 (431)
T PLN03146 84 EYLMNISIGTPPVPILAIADTGSDLIWTQ 112 (431)
T ss_pred cEEEEEEcCCCCceEEEEECCCCCcceEc
Confidence 566777887 478899999999999985
No 82
>PF08922 DUF1905: Domain of unknown function (DUF1905); InterPro: IPR015018 This family consist of hypothetical bacterial proteins. ; PDB: 2D9R_A.
Probab=26.93 E-value=1.1e+02 Score=19.26 Aligned_cols=45 Identities=18% Similarity=0.007 Sum_probs=24.8
Q ss_pred CeEEEEEEECCEEEEE-EecC--CCcccccCHHHHhhcCCCCCCCCCeEE
Q 043220 40 KFCVVATKIRFHEVVI-LISS--GSTANFVSDKVAALLPLPMMPTKPFSV 86 (121)
Q Consensus 40 ~~~~v~g~i~~~~v~~-LIDS--GAT~nFI~~~~a~rl~l~~~~~~~~~V 86 (121)
+.++|.+.|++++... |+=. |.-.-.++..+.+..++ .....+.|
T Consensus 31 g~v~V~~tI~g~~~~~sl~p~g~G~~~Lpv~~~vRk~~g~--~~Gd~V~v 78 (80)
T PF08922_consen 31 GRVPVRGTIDGHPWRTSLFPMGNGGYILPVKAAVRKAIGK--EAGDTVEV 78 (80)
T ss_dssp S-EEEEEEETTEEEEEEEEESSTT-EEEEE-HHHHHHHT----TTSEEEE
T ss_pred CceEEEEEECCEEEEEEEEECCCCCEEEEEcHHHHHHcCC--CCCCEEEE
Confidence 7899999999988653 3333 44444555666665554 33444444
No 83
>PF05618 Zn_protease: Putative ATP-dependant zinc protease; InterPro: IPR008503 This family consists of several hypothetical proteins from different archaeal and bacterial species.; PDB: 2PMA_B.
Probab=26.82 E-value=50 Score=23.28 Aligned_cols=23 Identities=9% Similarity=0.024 Sum_probs=15.4
Q ss_pred CEEEEEEecCCCcccccCHHHHh
Q 043220 50 FHEVVILISSGSTANFVSDKVAA 72 (121)
Q Consensus 50 ~~~v~~LIDSGAT~nFI~~~~a~ 72 (121)
+..+.|=|||||..|-|+..=.+
T Consensus 14 ~~~~~aKiDTGA~tSSLhA~~I~ 36 (138)
T PF05618_consen 14 GLTIKAKIDTGAKTSSLHATDIE 36 (138)
T ss_dssp TEEEEEEE-TT-SSEEEE-EEEE
T ss_pred CCEEEEEEcCCCcccceeecceE
Confidence 44578899999999999765544
No 84
>COG2861 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.41 E-value=55 Score=25.68 Aligned_cols=28 Identities=25% Similarity=0.475 Sum_probs=24.9
Q ss_pred EEEEEecCCCcccccCHHHHhhcCCCCC
Q 043220 52 EVVILISSGSTANFVSDKVAALLPLPMM 79 (121)
Q Consensus 52 ~v~~LIDSGAT~nFI~~~~a~rl~l~~~ 79 (121)
.-..+||||.+.+=+..++|+++++|..
T Consensus 150 r~l~flDs~T~a~S~a~~iAk~~gVp~~ 177 (250)
T COG2861 150 RGLYFLDSGTIANSLAGKIAKEIGVPVI 177 (250)
T ss_pred CCeEEEcccccccchhhhhHhhcCCcee
Confidence 4568999999999999999999998864
No 85
>cd05489 xylanase_inhibitor_I_like TAXI-I inhibits degradation of xylan in the cell wall. Xylanase inhibitor-I (TAXI-I) is a member of potent TAXI-type inhibitors of fungal and bacterial family 11 xylanases. Plants developed a diverse battery of defense mechanisms in response to continual challenges by a broad spectrum of pathogenic microorganisms. Their defense arsenal includes inhibitors of cell wall-degrading enzymes, which hinder a possible invasion and colonization by antagonists. Xylanases of fungal and bacterial pathogens are the key enzymes in the degradation of xylan in the cell wall. Plants secrete proteins that inhibit these degradation glycosidases, including xylanase. Surprisingly, TAXI-I displays structural homology with the pepsin-like family of aspartic proteases but is proteolytically nonfunctional, because one or more residues of the essential catalytic triad are absent. The structure of the TAXI-inhibitor, Aspergillus niger xylanase I complex, illustrates the ability
Probab=25.74 E-value=70 Score=25.74 Aligned_cols=20 Identities=5% Similarity=0.004 Sum_probs=15.7
Q ss_pred EEEecCCCcccccCHHHHhh
Q 043220 54 VILISSGSTANFVSDKVAAL 73 (121)
Q Consensus 54 ~~LIDSGAT~nFI~~~~a~r 73 (121)
.++||||.+..++...+.+.
T Consensus 231 g~iiDSGTs~t~lp~~~y~~ 250 (362)
T cd05489 231 GVKLSTVVPYTVLRSDIYRA 250 (362)
T ss_pred cEEEecCCceEEECHHHHHH
Confidence 48999999999977765443
No 86
>PRK12440 acetate kinase; Reviewed
Probab=25.57 E-value=31 Score=28.75 Aligned_cols=44 Identities=30% Similarity=0.474 Sum_probs=35.0
Q ss_pred CCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc------ceEeecc
Q 043220 60 GSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS------PDVALDV 103 (121)
Q Consensus 60 GAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~------~dvILG~ 103 (121)
|-+|-|+..++++.++.+....+-+.+.++||..+.. .|.-.|+
T Consensus 180 GlS~~~v~~~~a~~lg~~~~~~~~Iv~HLG~G~Si~Ai~~GksvDtsmG~ 229 (397)
T PRK12440 180 GTSHYFVSREAAKMLNKPIEESSFISVHLGNGASVCAIKNGQSVDTSMGF 229 (397)
T ss_pred HHhHHHHHHHHHHHhCCChHHcCEEEEEeCCCcEeeeeeCCEEEEcCCCC
Confidence 5688889999999888777777778899999988754 6666665
No 87
>cd01813 UBP_N UBP ubiquitin processing protease. The UBP (ubiquitin processing protease) domain (also referred to as USP which stands for "ubiquitin-specific protease") is present at in a large family of cysteine proteases that specifically cleave ubiquitin conjugates. This family includes Rpn11, UBP6 (USP14), USP7 (HAUSP). This domain is closely related to the amino-terminal ubiquitin-like domain of BAG1 (Bcl2-associated anthanogene1) protein and is found only in eukaryotes.
Probab=24.47 E-value=1.4e+02 Score=18.44 Aligned_cols=38 Identities=16% Similarity=-0.052 Sum_probs=32.8
Q ss_pred EEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCC
Q 043220 42 CVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMM 79 (121)
Q Consensus 42 ~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~ 79 (121)
|++..+.+|+...+=++..+|..=+-+.+..+.++|..
T Consensus 1 ~~i~vk~~g~~~~v~v~~~~Tv~~lK~~i~~~tgvp~~ 38 (74)
T cd01813 1 VPVIVKWGGQEYSVTTLSEDTVLDLKQFIKTLTGVLPE 38 (74)
T ss_pred CEEEEEECCEEEEEEECCCCCHHHHHHHHHHHHCCCHH
Confidence 46788889999988999999999999999999887764
No 88
>smart00295 B41 Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
Probab=23.98 E-value=2.1e+02 Score=20.13 Aligned_cols=30 Identities=17% Similarity=0.089 Sum_probs=27.0
Q ss_pred CCEEEEEEecCCCcccccCHHHHhhcCCCC
Q 043220 49 RFHEVVILISSGSTANFVSDKVAALLPLPM 78 (121)
Q Consensus 49 ~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~ 78 (121)
+|....+-||+-.|..-+-..+++++|++.
T Consensus 12 dg~~~~~~~~~~~t~~ev~~~v~~~~~l~~ 41 (207)
T smart00295 12 DGTTLEFEVDSSTTAEELLETVCRKLGIRE 41 (207)
T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHhCCCc
Confidence 677888999999999999999999999954
No 89
>PF14541 TAXi_C: Xylanase inhibitor C-terminal; PDB: 3AUP_D 3HD8_A 1T6G_A 1T6E_X 2B42_A 3VLB_A 3VLA_A.
Probab=23.69 E-value=50 Score=23.11 Aligned_cols=19 Identities=26% Similarity=0.377 Sum_probs=14.3
Q ss_pred EEEecCCCcccccCHHHHh
Q 043220 54 VILISSGSTANFVSDKVAA 72 (121)
Q Consensus 54 ~~LIDSGAT~nFI~~~~a~ 72 (121)
.+.||||++..++.+.+=.
T Consensus 31 ~~iiDSGT~~T~L~~~~y~ 49 (161)
T PF14541_consen 31 GTIIDSGTTYTYLPPPVYD 49 (161)
T ss_dssp SEEE-SSSSSEEEEHHHHH
T ss_pred CEEEECCCCccCCcHHHHH
Confidence 3679999999999877643
No 90
>cd06407 PB1_NLP A PB1 domain is present in NIN like proteins (NLP), a key enzyme in a process of establishment of symbiosis betweeen legumes and nitrogen fixing bacteria (Rhizobium). The PB1 domain is a modular domain mediating specific protein-protein interaction which play a role in many critical cell processes like osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-inte
Probab=23.64 E-value=1.3e+02 Score=19.27 Aligned_cols=46 Identities=9% Similarity=0.091 Sum_probs=35.7
Q ss_pred EEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCCCCCCeEEEec
Q 043220 42 CVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIA 89 (121)
Q Consensus 42 ~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va 89 (121)
|++++..|+--+.+-+.+..+..=+-.++++++++. ...++.++--
T Consensus 1 ~~vK~~~~~d~~r~~l~~~~~~~~L~~~i~~r~~~~--~~~~f~LkY~ 46 (82)
T cd06407 1 VRVKATYGEEKIRFRLPPSWGFTELKQEIAKRFKLD--DMSAFDLKYL 46 (82)
T ss_pred CEEEEEeCCeEEEEEcCCCCCHHHHHHHHHHHhCCC--CCCeeEEEEE
Confidence 578888999989999999988888899999999864 1235555543
No 91
>PF04746 DUF575: Protein of unknown function (DUF575); InterPro: IPR006835 This represents a conserved region found in a number of Chlamydophila pneumoniae proteins.
Probab=23.30 E-value=38 Score=22.76 Aligned_cols=13 Identities=8% Similarity=0.442 Sum_probs=10.5
Q ss_pred ceEeecchhhhhc
Q 043220 97 PDVALDVKWSEQI 109 (121)
Q Consensus 97 ~dvILG~~WL~~~ 109 (121)
-.++.|++||-.+
T Consensus 27 Ghiv~GieWLvS~ 39 (101)
T PF04746_consen 27 GHIVMGIEWLVSR 39 (101)
T ss_pred ceEEeehHHHHHH
Confidence 4678999999765
No 92
>PF00564 PB1: PB1 domain; InterPro: IPR000270 The Phox and Bem1p domain, is present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.; GO: 0005515 protein binding; PDB: 1IPG_A 1IP9_A 2KFK_A 1WMH_A 1VD2_A 1WI0_A 1OEY_C 1PQS_A 1Q1O_A 1TZ1_A ....
Probab=23.22 E-value=1.8e+02 Score=17.56 Aligned_cols=46 Identities=13% Similarity=0.078 Sum_probs=35.4
Q ss_pred eEEEEEEECCEEEE-EEecCCCcccccCHHHHhhcCCCCCCCCCeEEEec
Q 043220 41 FCVVATKIRFHEVV-ILISSGSTANFVSDKVAALLPLPMMPTKPFSVRIA 89 (121)
Q Consensus 41 ~~~v~g~i~~~~v~-~LIDSGAT~nFI~~~~a~rl~l~~~~~~~~~V~va 89 (121)
|++++...++.... +-+.++.+..-+-..++.+++++ ...+.++-.
T Consensus 1 t~~vK~~~~~~~~~~~~~~~~~s~~~L~~~i~~~~~~~---~~~~~l~Y~ 47 (84)
T PF00564_consen 1 TVRVKVRYGGDIRRIISLPSDVSFDDLRSKIREKFGLL---DEDFQLKYK 47 (84)
T ss_dssp SEEEEEEETTEEEEEEEECSTSHHHHHHHHHHHHHTTS---TSSEEEEEE
T ss_pred CEEEEEEECCeeEEEEEcCCCCCHHHHHHHHHHHhCCC---CccEEEEee
Confidence 57888889888777 78888888888889999999887 344554443
No 93
>PRK12379 propionate/acetate kinase; Provisional
Probab=22.53 E-value=37 Score=28.29 Aligned_cols=45 Identities=22% Similarity=0.386 Sum_probs=33.9
Q ss_pred CCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc------ceEeecch
Q 043220 60 GSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS------PDVALDVK 104 (121)
Q Consensus 60 GAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~------~dvILG~~ 104 (121)
|-+|-||..++++.++.+....+-+.+.+++|..+.. .|.-.|+.
T Consensus 176 GlS~~~va~~~a~~lg~~~~~~~lIv~HLG~G~Si~Ai~~GksvDtsmG~t 226 (396)
T PRK12379 176 GTSHRYVSQRAHSLLNLDEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMT 226 (396)
T ss_pred HHHHHHHHHHHHHHhCCChhHCCEEEEEeCCCcchheeeCCEEEEeCCCCC
Confidence 5577788888888887776666778889999987754 77766664
No 94
>PF03539 Spuma_A9PTase: Spumavirus aspartic protease (A9); InterPro: IPR001641 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A9 (spumapepsin family, clan AA). Foamy viruses are single-stranded enveloped retroviruses that have been noted to infect monkeys, cats and humans. In the human virus, the aspartic protease is encoded by the retroviral gag gene [], and in monkeys by the pol gene []. At present, the virus has not been proven to cause any particular disease. However, studies have shown Human foamy virus causes neurological disorders in infected mice []. It is not clear whether the Foamy virus/spumavirus proteases share a common evolutionary origin with other aspartic proteases. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 2JYS_A.
Probab=22.12 E-value=83 Score=22.99 Aligned_cols=26 Identities=12% Similarity=0.079 Sum_probs=20.4
Q ss_pred ECCEEEEEEecCCCcccccCHHHHhh
Q 043220 48 IRFHEVVILISSGSTANFVSDKVAAL 73 (121)
Q Consensus 48 i~~~~v~~LIDSGAT~nFI~~~~a~r 73 (121)
|.|..+.+--||||+...|.+.+...
T Consensus 1 ikg~~l~~~wDsga~ITCiP~~fl~~ 26 (163)
T PF03539_consen 1 IKGTKLKGHWDSGAQITCIPESFLEE 26 (163)
T ss_dssp ETTEEEEEEE-TT-SSEEEEGGGTTT
T ss_pred CCCceeeEEecCCCeEEEccHHHhCc
Confidence 46788899999999999999988664
No 95
>cd01812 BAG1_N Ubiquitin-like domain of BAG1. BAG1_N N-terminal ubiquitin-like (Ubl) domain of the BAG1 protein. This domain occurs together with the BAG domain and is closely related to the Ubl domain of a family of deubiquitinases that includes Rpn11, UBP6 (USP14), USP7 (HAUSP).
Probab=20.67 E-value=1.8e+02 Score=16.95 Aligned_cols=38 Identities=18% Similarity=-0.023 Sum_probs=30.1
Q ss_pred EEEEEEECCEEEEEEecCCCcccccCHHHHhhcCCCCC
Q 043220 42 CVVATKIRFHEVVILISSGSTANFVSDKVAALLPLPMM 79 (121)
Q Consensus 42 ~~v~g~i~~~~v~~LIDSGAT~nFI~~~~a~rl~l~~~ 79 (121)
|++..+-+|+...+-++..+|..=+-.+++.+.|++..
T Consensus 1 i~i~vk~~g~~~~i~v~~~~tv~~lK~~i~~~~gi~~~ 38 (71)
T cd01812 1 IRVRVKHGGESHDLSISSQATFGDLKKMLAPVTGVEPR 38 (71)
T ss_pred CEEEEEECCEEEEEEECCCCcHHHHHHHHHHhhCCChH
Confidence 35666667888888899999999888999888887653
No 96
>PRK12397 propionate kinase; Reviewed
Probab=20.62 E-value=38 Score=28.31 Aligned_cols=45 Identities=18% Similarity=0.415 Sum_probs=33.1
Q ss_pred CCcccccCHHHHhhcCCCCCCCCCeEEEecCCceecc------ceEeecch
Q 043220 60 GSTANFVSDKVAALLPLPMMPTKPFSVRIANGAHLKS------PDVALDVK 104 (121)
Q Consensus 60 GAT~nFI~~~~a~rl~l~~~~~~~~~V~va~G~~l~~------~dvILG~~ 104 (121)
|-+|-|+..++++.++.+....+-+.+.++||..+.. .|.-.|+.
T Consensus 180 GlS~~yva~~~a~~lg~~~~~~~lIv~HLG~GaSi~Ai~~GksvDtsmG~t 230 (404)
T PRK12397 180 GTSHKYVSGVLAEKLGVPLSALRVICCHLGNGSSICAIKNGRSVNTSMGFT 230 (404)
T ss_pred HHHHHHHHHHHHHHhCCChhHCCEEEEEeCCCcchheeeCCEEEEcCCCCC
Confidence 4567777777887777666666678889999987754 77777664
No 97
>COG3865 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.38 E-value=2.4e+02 Score=20.43 Aligned_cols=25 Identities=16% Similarity=0.309 Sum_probs=20.3
Q ss_pred CeEEEEEEECCEEEEEEecCCCcccc
Q 043220 40 KFCVVATKIRFHEVVILISSGSTANF 65 (121)
Q Consensus 40 ~~~~v~g~i~~~~v~~LIDSGAT~nF 65 (121)
+.+....+++|+.+.+ +|+|..++|
T Consensus 49 ~Vl~a~F~l~g~~f~~-ld~g~~~~f 73 (151)
T COG3865 49 KVLVAEFTLNGQSFMA-LDGGPNTSF 73 (151)
T ss_pred cEEEEEEEECCeEEEE-EcCCCCcCC
Confidence 4577899999999866 599988776
Done!