Query psy3214
Match_columns 336
No_of_seqs 330 out of 2011
Neff 8.1
Searched_HMMs 46136
Date Fri Aug 16 19:14:27 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3214.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3214hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PTZ00165 aspartyl protease; Pr 100.0 1.9E-33 4E-38 277.1 17.1 201 16-224 117-352 (482)
2 cd05490 Cathepsin_D2 Cathepsin 100.0 4.7E-33 1E-37 263.2 17.3 202 17-224 4-231 (325)
3 cd05486 Cathespin_E Cathepsin 100.0 1.1E-32 2.5E-37 259.7 16.8 198 20-223 1-222 (316)
4 cd06098 phytepsin Phytepsin, a 100.0 3.2E-32 6.9E-37 256.8 17.2 171 17-193 8-185 (317)
5 cd05487 renin_like Renin stimu 100.0 2.2E-31 4.8E-36 252.0 16.6 201 17-224 6-232 (326)
6 cd05478 pepsin_A Pepsin A, asp 100.0 2.9E-31 6.4E-36 250.2 16.8 200 17-224 8-231 (317)
7 cd05477 gastricsin Gastricsins 100.0 3.6E-31 7.7E-36 249.7 16.8 199 19-224 3-226 (318)
8 cd05485 Cathepsin_D_like Cathe 100.0 4E-31 8.6E-36 250.6 16.5 203 16-224 8-235 (329)
9 PTZ00147 plasmepsin-1; Provisi 100.0 8.5E-31 1.8E-35 256.4 16.9 201 15-224 135-357 (453)
10 cd05488 Proteinase_A_fungi Fun 100.0 2.4E-30 5.2E-35 244.3 17.4 200 17-224 8-230 (320)
11 PTZ00013 plasmepsin 4 (PM4); P 100.0 4.4E-30 9.5E-35 251.0 18.3 199 16-223 135-355 (450)
12 PF03723 Hemocyanin_C: Hemocya 100.0 6.5E-31 1.4E-35 237.4 7.3 101 227-336 6-110 (250)
13 cd06097 Aspergillopepsin_like 100.0 2.8E-29 6.1E-34 232.4 16.4 192 20-223 1-222 (278)
14 PF00026 Asp: Eukaryotic aspar 100.0 3.5E-28 7.5E-33 228.2 12.8 198 19-224 1-224 (317)
15 cd05473 beta_secretase_like Be 100.0 1.1E-27 2.5E-32 229.9 15.7 201 17-225 1-237 (364)
16 cd05471 pepsin_like Pepsin-lik 99.9 2.7E-25 5.8E-30 205.0 17.3 168 20-194 1-176 (283)
17 cd06096 Plasmepsin_5 Plasmepsi 99.9 1.2E-24 2.6E-29 205.9 15.2 193 19-225 3-257 (326)
18 KOG1339|consensus 99.9 3.9E-25 8.4E-30 214.7 12.0 195 18-226 45-295 (398)
19 cd05474 SAP_like SAPs, pepsin- 99.9 6.9E-24 1.5E-28 197.6 14.9 170 19-225 2-204 (295)
20 cd05476 pepsin_A_like_plant Ch 99.9 1.9E-22 4.2E-27 185.4 12.3 137 19-194 1-140 (265)
21 cd05472 cnd41_like Chloroplast 99.9 4.7E-22 1E-26 185.9 12.8 169 19-225 1-197 (299)
22 cd05475 nucellin_like Nucellin 99.8 1.2E-20 2.5E-25 174.4 13.9 143 19-193 2-151 (273)
23 PLN03146 aspartyl protease fam 99.8 1.6E-20 3.4E-25 183.9 13.5 192 17-226 82-334 (431)
24 cd05470 pepsin_retropepsin_lik 99.8 8.2E-19 1.8E-23 139.5 10.6 101 23-127 2-109 (109)
25 PF14543 TAXi_N: Xylanase inhi 99.7 4.8E-16 1E-20 132.9 12.0 136 20-177 1-164 (164)
26 cd05489 xylanase_inhibitor_I_l 99.6 1E-15 2.2E-20 146.8 11.2 145 37-193 10-192 (362)
27 cd05483 retropepsin_like_bacte 97.9 2E-05 4.4E-10 60.2 4.7 92 19-129 2-94 (96)
28 cd06098 phytepsin Phytepsin, a 97.4 0.0002 4.4E-09 67.5 5.1 61 1-61 244-304 (317)
29 cd05490 Cathepsin_D2 Cathepsin 97.4 0.00022 4.8E-09 67.4 5.1 61 1-61 252-312 (325)
30 cd05486 Cathespin_E Cathepsin 97.2 0.00037 8.1E-09 65.6 5.0 61 1-61 243-303 (316)
31 PTZ00165 aspartyl protease; Pr 97.1 0.00048 1E-08 68.7 4.8 61 1-61 365-432 (482)
32 cd05487 renin_like Renin stimu 97.1 0.00066 1.4E-08 64.3 4.8 61 1-61 252-312 (326)
33 cd05485 Cathepsin_D_like Cathe 97.0 0.00098 2.1E-08 63.2 5.0 60 1-60 256-315 (329)
34 PTZ00147 plasmepsin-1; Provisi 96.2 0.005 1.1E-07 61.0 4.2 58 1-61 378-435 (453)
35 cd05488 Proteinase_A_fungi Fun 96.1 0.006 1.3E-07 57.6 4.2 56 1-60 251-306 (320)
36 PTZ00013 plasmepsin 4 (PM4); P 95.8 0.0092 2E-07 59.1 4.1 58 1-61 377-434 (450)
37 PF00026 Asp: Eukaryotic aspar 95.7 0.0084 1.8E-07 55.9 3.2 62 1-63 244-305 (317)
38 PF13650 Asp_protease_2: Aspar 95.5 0.068 1.5E-06 39.8 7.1 78 38-128 11-89 (90)
39 cd05478 pepsin_A Pepsin A, asp 95.2 0.019 4E-07 54.1 3.7 53 1-61 252-304 (317)
40 cd05477 gastricsin Gastricsins 94.6 0.04 8.7E-07 51.8 4.2 57 1-61 247-304 (318)
41 TIGR02281 clan_AA_DTGA clan AA 93.7 0.15 3.2E-06 41.1 5.4 93 18-129 10-103 (121)
42 cd05489 xylanase_inhibitor_I_l 93.6 0.11 2.5E-06 49.9 5.3 55 1-60 290-346 (362)
43 cd05474 SAP_like SAPs, pepsin- 93.5 0.068 1.5E-06 49.5 3.5 54 2-60 225-280 (295)
44 KOG1339|consensus 92.6 0.22 4.8E-06 48.5 5.8 55 1-60 321-376 (398)
45 cd05475 nucellin_like Nucellin 91.9 0.34 7.4E-06 44.5 5.8 56 1-60 197-256 (273)
46 cd05479 RP_DDI RP_DDI; retrope 91.4 0.34 7.3E-06 39.1 4.6 91 17-129 14-107 (124)
47 cd05472 cnd41_like Chloroplast 91.2 0.35 7.6E-06 44.9 5.1 55 1-60 227-282 (299)
48 cd05476 pepsin_A_like_plant Ch 89.9 0.48 1E-05 43.2 4.7 52 2-60 196-248 (265)
49 PF11925 DUF3443: Protein of u 86.5 1.2 2.5E-05 42.6 4.9 94 30-129 36-148 (370)
50 cd06096 Plasmepsin_5 Plasmepsi 85.2 0.73 1.6E-05 43.5 3.0 50 2-60 258-308 (326)
51 cd05484 retropepsin_like_LTR_2 83.9 4.6 0.0001 30.3 6.4 63 38-109 13-78 (91)
52 cd06095 RP_RTVL_H_like Retrope 82.9 7.5 0.00016 28.9 7.2 72 38-129 11-84 (86)
53 cd05473 beta_secretase_like Be 74.8 2.7 6E-05 40.2 3.2 55 1-60 269-330 (364)
54 PLN03146 aspartyl protease fam 74.7 4.5 9.8E-05 39.9 4.7 51 1-60 361-411 (431)
55 PF09676 TraV: Type IV conjuga 63.0 7.4 0.00016 30.9 2.8 32 298-332 85-116 (119)
56 PF14541 TAXi_C: Xylanase inhi 52.2 21 0.00045 29.8 4.0 50 1-55 92-142 (161)
57 COG2967 ApaG Uncharacterized p 49.8 31 0.00067 27.6 4.2 32 284-317 29-60 (126)
58 PF00077 RVP: Retroviral aspar 41.7 57 0.0012 24.5 4.7 18 38-55 18-35 (100)
59 PF03531 SSrecog: Structure-sp 36.7 35 0.00077 30.5 3.1 31 277-315 159-189 (222)
60 TIGR03698 clan_AA_DTGF clan AA 32.8 81 0.0018 24.5 4.3 69 23-100 3-73 (107)
61 TIGR02747 TraV type IV conjuga 31.7 48 0.001 27.5 2.9 33 297-332 109-141 (144)
62 KOG0526|consensus 31.7 24 0.00053 35.4 1.3 32 277-316 224-255 (615)
63 PF08284 RVP_2: Retroviral asp 31.2 1.1E+02 0.0023 25.0 4.9 78 37-129 33-112 (135)
64 PRK05461 apaG CO2+/MG2+ efflux 30.4 59 0.0013 26.4 3.2 36 279-316 25-60 (127)
65 PF09275 Pertus-S4-tox: Pertus 28.2 36 0.00077 25.2 1.4 31 281-311 46-86 (110)
66 PF04076 BOF: Bacterial OB fol 27.1 71 0.0015 24.9 3.0 48 275-332 42-89 (103)
67 PF02747 PCNA_C: Proliferating 25.0 65 0.0014 25.8 2.6 24 281-310 105-128 (128)
68 PF04379 DUF525: Protein of un 24.6 1.3E+02 0.0029 22.7 4.0 35 280-316 9-43 (90)
69 TIGR00156 conserved hypothetic 23.2 71 0.0015 25.9 2.4 49 274-332 64-112 (126)
70 PF05164 ZapA: Cell division p 22.8 90 0.002 22.9 2.8 14 3-16 1-14 (89)
71 PRK10053 hypothetical protein; 20.4 88 0.0019 25.5 2.4 49 274-332 68-116 (130)
72 TIGR01451 B_ant_repeat conserv 20.0 1.3E+02 0.0029 20.1 2.9 25 283-307 12-37 (53)
No 1
>PTZ00165 aspartyl protease; Provisional
Probab=100.00 E-value=1.9e-33 Score=277.08 Aligned_cols=201 Identities=22% Similarity=0.382 Sum_probs=167.3
Q ss_pred CccCceeEeccCCccceecccccCcCCCCCCCcEEEcc------ccccceeeeecccCccC--CCc---EEEEEeCCceE
Q psy3214 16 TPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGD------VFIGKFYTVFDMDNNQK--NGT---QFAIRYGSGAV 84 (336)
Q Consensus 16 ~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~------c~~~~~y~~~~Sst~~~--~~~---~~~i~Yg~Gs~ 84 (336)
.-.+|..++++|+|+|.. .+.+||||+++||++. |..|+.|+|++|+||++ .+. .+.|.||+|++
T Consensus 117 ~d~~Y~~~I~IGTPpQ~f----~Vv~DTGSS~lWVps~~C~~~~C~~~~~yd~s~SSTy~~~~~~~~~~~~~i~YGsGs~ 192 (482)
T PTZ00165 117 HNSQYFGEIQVGTPPKSF----VVVFDTGSSNLWIPSKECKSGGCAPHRKFDPKKSSTYTKLKLGDESAETYIQYGTGEC 192 (482)
T ss_pred cCCeEEEEEEeCCCCceE----EEEEeCCCCCEEEEchhcCcccccccCCCCccccCCcEecCCCCccceEEEEeCCCcE
Confidence 345799999999999964 7889999999999974 55688999999999998 665 68899999999
Q ss_pred EEEEeEEEEEECCeEEeceEEEEEEeeCCcccccccccccccCccccc---ccCCCCchHHHHHHhcccCCCeEEEEeec
Q psy3214 85 SGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNI---AVDGVEPPFYNIIQQKLLEKNVFGFYLNR 161 (336)
Q Consensus 85 ~G~~~~D~v~ig~~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~---s~~~~~~~~~~l~~q~~i~~~~Fs~~l~~ 161 (336)
+|.+++|+|++|++++++|.||+++.+++..|...++|||||||++.+ +..+..|++++|++||+|++|+||+||++
T Consensus 193 ~G~l~~DtV~ig~l~i~~q~FG~a~~~s~~~f~~~~~DGILGLg~~~~s~~s~~~~~p~~~~l~~qgli~~~~FS~yL~~ 272 (482)
T PTZ00165 193 VLALGKDTVKIGGLKVKHQSIGLAIEESLHPFADLPFDGLVGLGFPDKDFKESKKALPIVDNIKKQNLLKRNIFSFYMSK 272 (482)
T ss_pred EEEEEEEEEEECCEEEccEEEEEEEeccccccccccccceeecCCCcccccccCCCCCHHHHHHHcCCcccceEEEEecc
Confidence 999999999999999999999999988776787788999999999987 34457899999999999999999999987
Q ss_pred CCCCCCCCceEEecCCCCCccc--ccccceeEecC-Cccc------c---------CCC---ccccchhhhhhhHHHHHH
Q psy3214 162 YNTEKLANGEVPIPSAHDPKEL--ESRIENKYQLP-EYDH------V---------GDG---CSASYTEMQQWTNNIIEA 220 (336)
Q Consensus 162 ~~~~~~~~G~l~f~GGiD~~~~--~G~l~~~~~~~-~~~~------~---------~~~---~~~s~t~l~~w~~~i~~~ 220 (336)
+. ..+|+|+| ||+|++++ .|+++|++... .+|. . ..+ ..++++++...+..++++
T Consensus 273 -~~--~~~G~l~f-GGiD~~~~~~~g~i~~~Pv~~~~yW~i~l~~i~vgg~~~~~~~~~~~aIiDTGTSli~lP~~~~~~ 348 (482)
T PTZ00165 273 -DL--NQPGSISF-GSADPKYTLEGHKIWWFPVISTDYWEIEVVDILIDGKSLGFCDRKCKAAIDTGSSLITGPSSVINP 348 (482)
T ss_pred -CC--CCCCEEEe-CCcCHHHcCCCCceEEEEccccceEEEEeCeEEECCEEeeecCCceEEEEcCCCccEeCCHHHHHH
Confidence 43 34799999 99999887 57899974332 1221 0 111 344778888888888887
Q ss_pred Hhcc
Q psy3214 221 INQG 224 (336)
Q Consensus 221 i~~g 224 (336)
+.+.
T Consensus 349 i~~~ 352 (482)
T PTZ00165 349 LLEK 352 (482)
T ss_pred HHHH
Confidence 7765
No 2
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank
Probab=100.00 E-value=4.7e-33 Score=263.23 Aligned_cols=202 Identities=39% Similarity=0.660 Sum_probs=169.0
Q ss_pred ccCceeEeccCCccceecccccCcCCCCCCCcEEEcc--------ccccceeeeecccCccCCCcEEEEEeCCceEEEEE
Q psy3214 17 PTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGD--------VFIGKFYTVFDMDNNQKNGTQFAIRYGSGAVSGYL 88 (336)
Q Consensus 17 ~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~--------c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs~~G~~ 88 (336)
-.+|.+++.+|+|+|.. .+.+||||+++||++. |..++.|+|++|+||+..++.+.+.||+|+++|.+
T Consensus 4 ~~~Y~~~i~iGtP~q~~----~v~~DTGSs~~Wv~~~~C~~~~~~C~~~~~y~~~~SsT~~~~~~~~~i~Yg~G~~~G~~ 79 (325)
T cd05490 4 DAQYYGEIGIGTPPQTF----TVVFDTGSSNLWVPSVHCSLLDIACWLHHKYNSSKSSTYVKNGTEFAIQYGSGSLSGYL 79 (325)
T ss_pred CCEEEEEEEECCCCcEE----EEEEeCCCccEEEEcCCCCCCCccccCcCcCCcccCcceeeCCcEEEEEECCcEEEEEE
Confidence 35799999999999854 7889999999999863 44578999999999999999999999999999999
Q ss_pred eEEEEEECCeEEeceEEEEEEeeCCcccccccccccccCcccccccCCCCchHHHHHHhcccCCCeEEEEeecCCCCCCC
Q psy3214 89 SQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLA 168 (336)
Q Consensus 89 ~~D~v~ig~~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~~~ 168 (336)
++|+|++|+.++++|.||+++..++..|....+|||||||++.++..+.+|++++|++||+|++++||+||++ +.+...
T Consensus 80 ~~D~v~~g~~~~~~~~Fg~~~~~~~~~~~~~~~dGilGLg~~~~s~~~~~~~~~~l~~~g~i~~~~FS~~L~~-~~~~~~ 158 (325)
T cd05490 80 SQDTVSIGGLQVEGQLFGEAVKQPGITFIAAKFDGILGMAYPRISVDGVTPVFDNIMAQKLVEQNVFSFYLNR-DPDAQP 158 (325)
T ss_pred eeeEEEECCEEEcCEEEEEEeeccCCcccceeeeEEEecCCccccccCCCCHHHHHHhcCCCCCCEEEEEEeC-CCCCCC
Confidence 9999999999999999999998876556667789999999998887778899999999999999999999998 544446
Q ss_pred CceEEecCCCCCcccccccceeEecC-Cccc-------cC-------C---CccccchhhhhhhHHHHHHHhcc
Q psy3214 169 NGEVPIPSAHDPKELESRIENKYQLP-EYDH-------VG-------D---GCSASYTEMQQWTNNIIEAINQG 224 (336)
Q Consensus 169 ~G~l~f~GGiD~~~~~G~l~~~~~~~-~~~~-------~~-------~---~~~~s~t~l~~w~~~i~~~i~~g 224 (336)
+|+|+| ||+|+++|.|+|.|++... .+|. +. . ...++++++.+.+...++++.+.
T Consensus 159 ~G~l~~-Gg~d~~~~~g~l~~~~~~~~~~w~v~l~~i~vg~~~~~~~~~~~aiiDSGTt~~~~p~~~~~~l~~~ 231 (325)
T cd05490 159 GGELML-GGTDPKYYTGDLHYVNVTRKAYWQIHMDQVDVGSGLTLCKGGCEAIVDTGTSLITGPVEEVRALQKA 231 (325)
T ss_pred CCEEEE-CccCHHHcCCceEEEEcCcceEEEEEeeEEEECCeeeecCCCCEEEECCCCccccCCHHHHHHHHHH
Confidence 899999 9999999999999864322 1111 00 1 13346788888888888777664
No 3
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=100.00 E-value=1.1e-32 Score=259.65 Aligned_cols=198 Identities=26% Similarity=0.487 Sum_probs=164.7
Q ss_pred ceeEeccCCccceecccccCcCCCCCCCcEEEcc------ccccceeeeecccCccCCCcEEEEEeCCceEEEEEeEEEE
Q psy3214 20 YVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGD------VFIGKFYTVFDMDNNQKNGTQFAIRYGSGAVSGYLSQDTL 93 (336)
Q Consensus 20 y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~------c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs~~G~~~~D~v 93 (336)
|..++.+|+|+|.+ .+.+||||+++||++. |..++.|+|++|+||+..++.+++.||+|+++|.+++|+|
T Consensus 1 Y~~~i~iGtP~Q~~----~v~~DTGSs~~Wv~s~~C~~~~C~~~~~y~~~~SsT~~~~~~~~~i~Yg~g~~~G~~~~D~v 76 (316)
T cd05486 1 YFGQISIGTPPQNF----TVIFDTGSSNLWVPSIYCTSQACTKHNRFQPSESSTYVSNGEAFSIQYGTGSLTGIIGIDQV 76 (316)
T ss_pred CeEEEEECCCCcEE----EEEEcCCCccEEEecCCCCCcccCccceECCCCCcccccCCcEEEEEeCCcEEEEEeeecEE
Confidence 78899999999864 7889999999999964 4467899999999999999999999999999999999999
Q ss_pred EECCeEEeceEEEEEEeeCCcccccccccccccCcccccccCCCCchHHHHHHhcccCCCeEEEEeecCCCCCCCCceEE
Q psy3214 94 RIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGEVP 173 (336)
Q Consensus 94 ~ig~~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~~~~G~l~ 173 (336)
+|+++++++|.||++...++..|....+|||||||++.++..+..|++++|++||+|++++||+||++ ++....+|+|+
T Consensus 77 ~ig~~~~~~~~fg~~~~~~~~~~~~~~~dGilGLg~~~~s~~~~~p~~~~l~~qg~i~~~~FS~~L~~-~~~~~~~g~l~ 155 (316)
T cd05486 77 TVEGITVQNQQFAESVSEPGSTFQDSEFDGILGLAYPSLAVDGVTPVFDNMMAQNLVELPMFSVYMSR-NPNSADGGELV 155 (316)
T ss_pred EECCEEEcCEEEEEeeccCcccccccccceEeccCchhhccCCCCCHHHHHHhcCCCCCCEEEEEEcc-CCCCCCCcEEE
Confidence 99999999999999887766556667889999999998887778899999999999999999999998 54445689999
Q ss_pred ecCCCCCcccccccceeEecCC-ccc------c--------CC---CccccchhhhhhhHHHHHHHhc
Q psy3214 174 IPSAHDPKELESRIENKYQLPE-YDH------V--------GD---GCSASYTEMQQWTNNIIEAINQ 223 (336)
Q Consensus 174 f~GGiD~~~~~G~l~~~~~~~~-~~~------~--------~~---~~~~s~t~l~~w~~~i~~~i~~ 223 (336)
| ||+|+++|.|+|.|++.... +|. . .. ...++++++.+.+...++++.+
T Consensus 156 f-Gg~d~~~~~g~l~~~pi~~~~~w~v~l~~i~v~g~~~~~~~~~~aiiDTGTs~~~lP~~~~~~l~~ 222 (316)
T cd05486 156 F-GGFDTSRFSGQLNWVPVTVQGYWQIQLDNIQVGGTVIFCSDGCQAIVDTGTSLITGPSGDIKQLQN 222 (316)
T ss_pred E-cccCHHHcccceEEEECCCceEEEEEeeEEEEecceEecCCCCEEEECCCcchhhcCHHHHHHHHH
Confidence 9 99999999999999753321 111 0 01 1333667777777776666644
No 4
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=100.00 E-value=3.2e-32 Score=256.78 Aligned_cols=171 Identities=36% Similarity=0.659 Sum_probs=152.8
Q ss_pred ccCceeEeccCCccceecccccCcCCCCCCCcEEEcc-------ccccceeeeecccCccCCCcEEEEEeCCceEEEEEe
Q psy3214 17 PTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGD-------VFIGKFYTVFDMDNNQKNGTQFAIRYGSGAVSGYLS 89 (336)
Q Consensus 17 ~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~-------c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs~~G~~~ 89 (336)
-..|.+++.+|+|+|- +.+.+||||+++||++. |..++.|+|++|+||+..+..+.+.||+|+++|.++
T Consensus 8 ~~~Y~~~i~iGtP~Q~----~~v~~DTGSs~lWv~~~~C~~~~~C~~~~~y~~~~SsT~~~~~~~~~i~Yg~G~~~G~~~ 83 (317)
T cd06098 8 DAQYFGEIGIGTPPQK----FTVIFDTGSSNLWVPSSKCYFSIACYFHSKYKSSKSSTYKKNGTSASIQYGTGSISGFFS 83 (317)
T ss_pred CCEEEEEEEECCCCeE----EEEEECCCccceEEecCCCCCCccccccCcCCcccCCCcccCCCEEEEEcCCceEEEEEE
Confidence 4578999999999985 47889999999999973 445789999999999999999999999999999999
Q ss_pred EEEEEECCeEEeceEEEEEEeeCCcccccccccccccCcccccccCCCCchHHHHHHhcccCCCeEEEEeecCCCCCCCC
Q psy3214 90 QDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLAN 169 (336)
Q Consensus 90 ~D~v~ig~~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~~~~ 169 (336)
+|+|++|+.++++|.||+++...+..|....+|||||||++..+..+..|++++|++||+|++++||+||++ +.....+
T Consensus 84 ~D~v~ig~~~v~~~~f~~~~~~~~~~~~~~~~dGilGLg~~~~s~~~~~~~~~~l~~qg~i~~~~FS~~L~~-~~~~~~~ 162 (317)
T cd06098 84 QDSVTVGDLVVKNQVFIEATKEPGLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVEQGLVKEPVFSFWLNR-NPDEEEG 162 (317)
T ss_pred eeEEEECCEEECCEEEEEEEecCCccccccccceeccccccchhhcCCCCHHHHHHhcCCCCCCEEEEEEec-CCCCCCC
Confidence 999999999999999999998776567777899999999998877777899999999999999999999998 5444568
Q ss_pred ceEEecCCCCCcccccccceeEec
Q psy3214 170 GEVPIPSAHDPKELESRIENKYQL 193 (336)
Q Consensus 170 G~l~f~GGiD~~~~~G~l~~~~~~ 193 (336)
|+|+| ||+|+++|.|++.|++..
T Consensus 163 G~l~f-Gg~d~~~~~g~l~~~pv~ 185 (317)
T cd06098 163 GELVF-GGVDPKHFKGEHTYVPVT 185 (317)
T ss_pred cEEEE-CccChhhcccceEEEecC
Confidence 99999 999999999999997543
No 5
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate r
Probab=99.97 E-value=2.2e-31 Score=251.96 Aligned_cols=201 Identities=28% Similarity=0.538 Sum_probs=164.8
Q ss_pred ccCceeEeccCCccceecccccCcCCCCCCCcEEEcc--------ccccceeeeecccCccCCCcEEEEEeCCceEEEEE
Q psy3214 17 PTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGD--------VFIGKFYTVFDMDNNQKNGTQFAIRYGSGAVSGYL 88 (336)
Q Consensus 17 ~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~--------c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs~~G~~ 88 (336)
-..|.+++.+|+|+|-+ .+.+||||+++||++. |..++.|+|++|+||+..++.+++.||+|+++|.+
T Consensus 6 ~~~y~~~i~iGtP~q~~----~v~~DTGSs~~Wv~~~~C~~~~~~c~~~~~y~~~~SsT~~~~~~~~~~~Yg~g~~~G~~ 81 (326)
T cd05487 6 DTQYYGEIGIGTPPQTF----KVVFDTGSSNLWVPSSKCSPLYTACVTHNLYDASDSSTYKENGTEFTIHYASGTVKGFL 81 (326)
T ss_pred CCeEEEEEEECCCCcEE----EEEEeCCccceEEccCCCcCcchhhcccCcCCCCCCeeeeECCEEEEEEeCCceEEEEE
Confidence 45789999999999864 7899999999999852 45678999999999999999999999999999999
Q ss_pred eEEEEEECCeEEeceEEEEEEeeCCcccccccccccccCcccccccCCCCchHHHHHHhcccCCCeEEEEeecCCCCCCC
Q psy3214 89 SQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLA 168 (336)
Q Consensus 89 ~~D~v~ig~~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~~~ 168 (336)
++|+|++|+..++ |.||++...+...|....+|||||||++..+..+..|++++|++||+|++++||+||++ +.+...
T Consensus 82 ~~D~v~~g~~~~~-~~fg~~~~~~~~~~~~~~~dGilGLg~~~~s~~~~~~~~~~L~~qg~i~~~~FS~~L~~-~~~~~~ 159 (326)
T cd05487 82 SQDIVTVGGIPVT-QMFGEVTALPAIPFMLAKFDGVLGMGYPKQAIGGVTPVFDNIMSQGVLKEDVFSVYYSR-DSSHSL 159 (326)
T ss_pred eeeEEEECCEEee-EEEEEEEeccCCccceeecceEEecCChhhcccCCCCHHHHHHhcCCCCCCEEEEEEeC-CCCCCC
Confidence 9999999999985 88999987754455556789999999988877778899999999999999999999998 544456
Q ss_pred CceEEecCCCCCcccccccceeEe-------cCCcccc--------CC---CccccchhhhhhhHHHHHHHhcc
Q psy3214 169 NGEVPIPSAHDPKELESRIENKYQ-------LPEYDHV--------GD---GCSASYTEMQQWTNNIIEAINQG 224 (336)
Q Consensus 169 ~G~l~f~GGiD~~~~~G~l~~~~~-------~~~~~~~--------~~---~~~~s~t~l~~w~~~i~~~i~~g 224 (336)
+|+|+| ||+|+++|+|+++|++. ++..... .. ...++++++..++...++++.+.
T Consensus 160 ~G~l~f-Gg~d~~~y~g~l~~~~~~~~~~w~v~l~~i~vg~~~~~~~~~~~aiiDSGts~~~lP~~~~~~l~~~ 232 (326)
T cd05487 160 GGEIVL-GGSDPQHYQGDFHYINTSKTGFWQIQMKGVSVGSSTLLCEDGCTAVVDTGASFISGPTSSISKLMEA 232 (326)
T ss_pred CcEEEE-CCcChhhccCceEEEECCcCceEEEEecEEEECCEEEecCCCCEEEECCCccchhCcHHHHHHHHHH
Confidence 899999 99999999999998632 2211110 11 13346677777777777666554
No 6
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which
Probab=99.97 E-value=2.9e-31 Score=250.15 Aligned_cols=200 Identities=26% Similarity=0.419 Sum_probs=166.4
Q ss_pred ccCceeEeccCCccceecccccCcCCCCCCCcEEEcc------ccccceeeeecccCccCCCcEEEEEeCCceEEEEEeE
Q psy3214 17 PTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGD------VFIGKFYTVFDMDNNQKNGTQFAIRYGSGAVSGYLSQ 90 (336)
Q Consensus 17 ~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~------c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs~~G~~~~ 90 (336)
...|..++.+|+|+|-. .+.+||||+++||++. |..++.|+|++|+||+..+..+.+.||+|+++|.++.
T Consensus 8 ~~~Y~~~i~vGtp~q~~----~v~~DTGS~~~wv~~~~C~~~~c~~~~~f~~~~Sst~~~~~~~~~~~yg~gs~~G~~~~ 83 (317)
T cd05478 8 DMEYYGTISIGTPPQDF----TVIFDTGSSNLWVPSVYCSSQACSNHNRFNPRQSSTYQSTGQPLSIQYGTGSMTGILGY 83 (317)
T ss_pred CCEEEEEEEeCCCCcEE----EEEEeCCCccEEEecCCCCcccccccCcCCCCCCcceeeCCcEEEEEECCceEEEEEee
Confidence 45789999999998854 7889999999999964 3357899999999999999999999999999999999
Q ss_pred EEEEECCeEEeceEEEEEEeeCCcccccccccccccCcccccccCCCCchHHHHHHhcccCCCeEEEEeecCCCCCCCCc
Q psy3214 91 DTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANG 170 (336)
Q Consensus 91 D~v~ig~~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~~~~G 170 (336)
|+|++|+++++++.||++...++..+...+.|||||||++.++..+..|++++|++||+|++++||+||++ + ...+|
T Consensus 84 D~v~ig~~~i~~~~fg~~~~~~~~~~~~~~~dGilGLg~~~~s~~~~~~~~~~L~~~g~i~~~~FS~~L~~-~--~~~~g 160 (317)
T cd05478 84 DTVQVGGISDTNQIFGLSETEPGSFFYYAPFDGILGLAYPSIASSGATPVFDNMMSQGLVSQDLFSVYLSS-N--GQQGS 160 (317)
T ss_pred eEEEECCEEECCEEEEEEEecCccccccccccceeeeccchhcccCCCCHHHHHHhCCCCCCCEEEEEeCC-C--CCCCe
Confidence 99999999999999999987765433345689999999998877777899999999999999999999998 4 23579
Q ss_pred eEEecCCCCCcccccccceeEecCC-ccc------c--------C---CCccccchhhhhhhHHHHHHHhcc
Q psy3214 171 EVPIPSAHDPKELESRIENKYQLPE-YDH------V--------G---DGCSASYTEMQQWTNNIIEAINQG 224 (336)
Q Consensus 171 ~l~f~GGiD~~~~~G~l~~~~~~~~-~~~------~--------~---~~~~~s~t~l~~w~~~i~~~i~~g 224 (336)
+|+| ||+|+++|.|+++|++.... +|. . . ....++++++.+.+...++++...
T Consensus 161 ~l~~-Gg~d~~~~~g~l~~~p~~~~~~w~v~l~~v~v~g~~~~~~~~~~~iiDTGts~~~lp~~~~~~l~~~ 231 (317)
T cd05478 161 VVTF-GGIDPSYYTGSLNWVPVTAETYWQITVDSVTINGQVVACSGGCQAIVDTGTSLLVGPSSDIANIQSD 231 (317)
T ss_pred EEEE-cccCHHHccCceEEEECCCCcEEEEEeeEEEECCEEEccCCCCEEEECCCchhhhCCHHHHHHHHHH
Confidence 9999 99999999999999753321 111 0 0 113346788888888888777665
No 7
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=99.97 E-value=3.6e-31 Score=249.65 Aligned_cols=199 Identities=25% Similarity=0.418 Sum_probs=165.6
Q ss_pred CceeEeccCCccceecccccCcCCCCCCCcEEEcc------ccccceeeeecccCccCCCcEEEEEeCCceEEEEEeEEE
Q psy3214 19 DYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGD------VFIGKFYTVFDMDNNQKNGTQFAIRYGSGAVSGYLSQDT 92 (336)
Q Consensus 19 ~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~------c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs~~G~~~~D~ 92 (336)
.|+.++.+|+|+|-+ .+.+||||+++||++. |..++.|+|++|+||+..++.+++.||+|+++|.++.|+
T Consensus 3 ~y~~~i~iGtP~q~~----~v~~DTGS~~~wv~~~~C~~~~C~~~~~f~~~~SsT~~~~~~~~~~~Yg~Gs~~G~~~~D~ 78 (318)
T cd05477 3 SYYGEISIGTPPQNF----LVLFDTGSSNLWVPSVLCQSQACTNHTKFNPSQSSTYSTNGETFSLQYGSGSLTGIFGYDT 78 (318)
T ss_pred EEEEEEEECCCCcEE----EEEEeCCCccEEEccCCCCCccccccCCCCcccCCCceECCcEEEEEECCcEEEEEEEeeE
Confidence 689999999999854 7889999999999974 445789999999999999999999999999999999999
Q ss_pred EEECCeEEeceEEEEEEeeCCcccccccccccccCcccccccCCCCchHHHHHHhcccCCCeEEEEeecCCCCCCCCceE
Q psy3214 93 LRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGEV 172 (336)
Q Consensus 93 v~ig~~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~~~~G~l 172 (336)
|++|+.++++|.||++...++..+...+.+||||||++..+..+.++++++|++||.|++++||+||++ +. ...+|+|
T Consensus 79 i~~g~~~i~~~~Fg~~~~~~~~~~~~~~~~GilGLg~~~~s~~~~~~~~~~L~~~g~i~~~~FS~~L~~-~~-~~~~g~l 156 (318)
T cd05477 79 VTVQGIIITNQEFGLSETEPGTNFVYAQFDGILGLAYPSISAGGATTVMQGMMQQNLLQAPIFSFYLSG-QQ-GQQGGEL 156 (318)
T ss_pred EEECCEEEcCEEEEEEEecccccccccceeeEeecCcccccccCCCCHHHHHHhcCCcCCCEEEEEEcC-CC-CCCCCEE
Confidence 999999999999999998765555556789999999998887778899999999999999999999998 42 2357999
Q ss_pred EecCCCCCcccccccceeEecC-Cccc------c---------C---CCccccchhhhhhhHHHHHHHhcc
Q psy3214 173 PIPSAHDPKELESRIENKYQLP-EYDH------V---------G---DGCSASYTEMQQWTNNIIEAINQG 224 (336)
Q Consensus 173 ~f~GGiD~~~~~G~l~~~~~~~-~~~~------~---------~---~~~~~s~t~l~~w~~~i~~~i~~g 224 (336)
+| ||+|+++|.|++.|++... .+|. . . ....++++.+...+..+++++.+.
T Consensus 157 ~f-Gg~d~~~~~g~l~~~pv~~~~~w~v~l~~i~v~g~~~~~~~~~~~~iiDSGtt~~~lP~~~~~~l~~~ 226 (318)
T cd05477 157 VF-GGVDNNLYTGQIYWTPVTSETYWQIGIQGFQINGQATGWCSQGCQAIVDTGTSLLTAPQQVMSTLMQS 226 (318)
T ss_pred EE-cccCHHHcCCceEEEecCCceEEEEEeeEEEECCEEecccCCCceeeECCCCccEECCHHHHHHHHHH
Confidence 99 9999999999999874321 1110 0 0 013346677777787777777664
No 8
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=99.97 E-value=4e-31 Score=250.55 Aligned_cols=203 Identities=38% Similarity=0.665 Sum_probs=169.0
Q ss_pred CccCceeEeccCCccceecccccCcCCCCCCCcEEEcc--------ccccceeeeecccCccCCCcEEEEEeCCceEEEE
Q psy3214 16 TPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGD--------VFIGKFYTVFDMDNNQKNGTQFAIRYGSGAVSGY 87 (336)
Q Consensus 16 ~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~--------c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs~~G~ 87 (336)
....|.+++.+|+|+|.. .+.+||||+++||++. |..++.|+|++|+|++..++.|.+.|++|+++|.
T Consensus 8 ~~~~Y~~~i~vGtP~q~~----~v~~DTGSs~~Wv~~~~C~~~~~~c~~~~~y~~~~Sst~~~~~~~~~i~Y~~g~~~G~ 83 (329)
T cd05485 8 MDAQYYGVITIGTPPQSF----KVVFDTGSSNLWVPSKKCSWTNIACLLHNKYDSTKSSTYKKNGTEFAIQYGSGSLSGF 83 (329)
T ss_pred cCCeEEEEEEECCCCcEE----EEEEcCCCccEEEecCCCCCCCccccCCCeECCcCCCCeEECCeEEEEEECCceEEEE
Confidence 346799999999998854 7889999999999974 3346789999999999999999999999999999
Q ss_pred EeEEEEEECCeEEeceEEEEEEeeCCcccccccccccccCcccccccCCCCchHHHHHHhcccCCCeEEEEeecCCCCCC
Q psy3214 88 LSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKL 167 (336)
Q Consensus 88 ~~~D~v~ig~~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~~ 167 (336)
+++|+|++|+.++++|.||++..+++..|...+.+||||||++..+.....|++.+|++||+|++++||+||.+ +....
T Consensus 84 ~~~D~v~ig~~~~~~~~fg~~~~~~~~~~~~~~~~GilGLg~~~~s~~~~~p~~~~l~~qg~i~~~~FS~~l~~-~~~~~ 162 (329)
T cd05485 84 LSTDTVSVGGVSVKGQTFAEAINEPGLTFVAAKFDGILGMGYSSISVDGVVPVFYNMVNQKLVDAPVFSFYLNR-DPSAK 162 (329)
T ss_pred EecCcEEECCEEECCEEEEEEEecCCccccccccceEEEcCCccccccCCCCHHHHHHhCCCCCCCEEEEEecC-CCCCC
Confidence 99999999999999999999988766556667789999999998876677889999999999999999999998 54444
Q ss_pred CCceEEecCCCCCcccccccceeEecCC-ccc-------cC------C---CccccchhhhhhhHHHHHHHhcc
Q psy3214 168 ANGEVPIPSAHDPKELESRIENKYQLPE-YDH-------VG------D---GCSASYTEMQQWTNNIIEAINQG 224 (336)
Q Consensus 168 ~~G~l~f~GGiD~~~~~G~l~~~~~~~~-~~~-------~~------~---~~~~s~t~l~~w~~~i~~~i~~g 224 (336)
.+|+|+| ||+|+++|.|++.|++.... +|. .. . ...++++++..++...++++...
T Consensus 163 ~~G~l~f-Gg~d~~~~~g~l~~~p~~~~~~~~v~~~~i~v~~~~~~~~~~~~iiDSGtt~~~lP~~~~~~l~~~ 235 (329)
T cd05485 163 EGGELIL-GGSDPKHYTGNFTYLPVTRKGYWQFKMDSVSVGEGEFCSGGCQAIADTGTSLIAGPVDEIEKLNNA 235 (329)
T ss_pred CCcEEEE-cccCHHHcccceEEEEcCCceEEEEEeeEEEECCeeecCCCcEEEEccCCcceeCCHHHHHHHHHH
Confidence 6899999 99999999999998754321 111 00 0 13446788888888887777654
No 9
>PTZ00147 plasmepsin-1; Provisional
Probab=99.97 E-value=8.5e-31 Score=256.40 Aligned_cols=201 Identities=21% Similarity=0.375 Sum_probs=164.3
Q ss_pred eCccCceeEeccCCccceecccccCcCCCCCCCcEEEcc------ccccceeeeecccCccCCCcEEEEEeCCceEEEEE
Q psy3214 15 LTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGD------VFIGKFYTVFDMDNNQKNGTQFAIRYGSGAVSGYL 88 (336)
Q Consensus 15 i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~------c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs~~G~~ 88 (336)
+.-..|..++.+|+|+|- +.+.+||||+++||++. |..++.|+|++|+||+..++.+++.||+|+++|.+
T Consensus 135 ~~n~~Y~~~I~IGTP~Q~----f~Vi~DTGSsdlWVps~~C~~~~C~~~~~yd~s~SsT~~~~~~~f~i~Yg~GsvsG~~ 210 (453)
T PTZ00147 135 LANVMSYGEAKLGDNGQK----FNFIFDTGSANLWVPSIKCTTEGCETKNLYDSSKSKTYEKDGTKVEMNYVSGTVSGFF 210 (453)
T ss_pred cCCCEEEEEEEECCCCeE----EEEEEeCCCCcEEEeecCCCcccccCCCccCCccCcceEECCCEEEEEeCCCCEEEEE
Confidence 445678999999999986 47889999999999974 34578999999999999999999999999999999
Q ss_pred eEEEEEECCeEEeceEEEEEEeeCCc--ccccccccccccCcccccccCCCCchHHHHHHhcccCCCeEEEEeecCCCCC
Q psy3214 89 SQDTLRIGDLTIKNQVFGEAIKEPGF--TFVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEK 166 (336)
Q Consensus 89 ~~D~v~ig~~~v~~~~fg~~~~~~~~--~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~ 166 (336)
++|+|+||+.+++ +.|+++....+. .+....+|||||||++.++.....|++++|++|++|++++||+||++ + .
T Consensus 211 ~~DtVtiG~~~v~-~qF~~~~~~~~f~~~~~~~~~DGILGLG~~~~S~~~~~p~~~~L~~qg~I~~~vFS~~L~~-~--~ 286 (453)
T PTZ00147 211 SKDLVTIGNLSVP-YKFIEVTDTNGFEPFYTESDFDGIFGLGWKDLSIGSVDPYVVELKNQNKIEQAVFTFYLPP-E--D 286 (453)
T ss_pred EEEEEEECCEEEE-EEEEEEEeccCcccccccccccceecccCCccccccCCCHHHHHHHcCCCCccEEEEEecC-C--C
Confidence 9999999999998 568877765431 23455789999999998887778899999999999999999999987 3 2
Q ss_pred CCCceEEecCCCCCcccccccceeEecCC-ccc------c-------CCCccccchhhhhhhHHHHHHHhcc
Q psy3214 167 LANGEVPIPSAHDPKELESRIENKYQLPE-YDH------V-------GDGCSASYTEMQQWTNNIIEAINQG 224 (336)
Q Consensus 167 ~~~G~l~f~GGiD~~~~~G~l~~~~~~~~-~~~------~-------~~~~~~s~t~l~~w~~~i~~~i~~g 224 (336)
..+|+|+| ||+|+++|+|+|.|++.... +|. . .....++++++...+...++++.+.
T Consensus 287 ~~~G~L~f-GGiD~~ky~G~l~y~pl~~~~~W~V~l~~~vg~~~~~~~~aIiDSGTsli~lP~~~~~ai~~~ 357 (453)
T PTZ00147 287 KHKGYLTI-GGIEERFYEGPLTYEKLNHDLYWQVDLDVHFGNVSSEKANVIVDSGTSVITVPTEFLNKFVES 357 (453)
T ss_pred CCCeEEEE-CCcChhhcCCceEEEEcCCCceEEEEEEEEECCEecCceeEEECCCCchhcCCHHHHHHHHHH
Confidence 35799999 99999999999999743321 121 0 1123447788888888887777664
No 10
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme. Proteinase A preferentially hydro
Probab=99.97 E-value=2.4e-30 Score=244.26 Aligned_cols=200 Identities=34% Similarity=0.612 Sum_probs=166.1
Q ss_pred ccCceeEeccCCccceecccccCcCCCCCCCcEEEcc------ccccceeeeecccCccCCCcEEEEEeCCceEEEEEeE
Q psy3214 17 PTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGD------VFIGKFYTVFDMDNNQKNGTQFAIRYGSGAVSGYLSQ 90 (336)
Q Consensus 17 ~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~------c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs~~G~~~~ 90 (336)
...|.+++.+|+|.|- +.+.+||||+++||++. |..++.|+|++|+||+.+++.+.+.|++|+++|.+++
T Consensus 8 ~~~Y~~~i~iGtp~q~----~~v~~DTGSs~~wv~~~~C~~~~C~~~~~y~~~~Sst~~~~~~~~~~~y~~g~~~G~~~~ 83 (320)
T cd05488 8 NAQYFTDITLGTPPQK----FKVILDTGSSNLWVPSVKCGSIACFLHSKYDSSASSTYKANGTEFKIQYGSGSLEGFVSQ 83 (320)
T ss_pred CCEEEEEEEECCCCcE----EEEEEecCCcceEEEcCCCCCcccCCcceECCCCCcceeeCCCEEEEEECCceEEEEEEE
Confidence 4569999999999885 47889999999999974 4457899999999999999999999999999999999
Q ss_pred EEEEECCeEEeceEEEEEEeeCCcccccccccccccCcccccccCCCCchHHHHHHhcccCCCeEEEEeecCCCCCCCCc
Q psy3214 91 DTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANG 170 (336)
Q Consensus 91 D~v~ig~~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~~~~G 170 (336)
|+|+++++++++|.||++...++..|.....|||||||++..+..+..|.+.+|++||+|++++||+||++ + ...+|
T Consensus 84 D~v~ig~~~~~~~~f~~a~~~~g~~~~~~~~dGilGLg~~~~s~~~~~~~~~~l~~qg~i~~~~FS~~L~~-~--~~~~G 160 (320)
T cd05488 84 DTLSIGDLTIKKQDFAEATSEPGLAFAFGKFDGILGLAYDTISVNKIVPPFYNMINQGLLDEPVFSFYLGS-S--EEDGG 160 (320)
T ss_pred eEEEECCEEECCEEEEEEecCCCcceeeeeeceEEecCCccccccCCCCHHHHHHhcCCCCCCEEEEEecC-C--CCCCc
Confidence 99999999999999999987766555556789999999998876666788899999999999999999998 3 24579
Q ss_pred eEEecCCCCCcccccccceeEecC-Cccc-------cC------C---CccccchhhhhhhHHHHHHHhcc
Q psy3214 171 EVPIPSAHDPKELESRIENKYQLP-EYDH-------VG------D---GCSASYTEMQQWTNNIIEAINQG 224 (336)
Q Consensus 171 ~l~f~GGiD~~~~~G~l~~~~~~~-~~~~-------~~------~---~~~~s~t~l~~w~~~i~~~i~~g 224 (336)
+|+| ||+|+++|.|++.|++... .+|. .. . ...++++.+..++..+++++...
T Consensus 161 ~l~f-Gg~d~~~~~g~l~~~p~~~~~~w~v~l~~i~vg~~~~~~~~~~~ivDSGtt~~~lp~~~~~~l~~~ 230 (320)
T cd05488 161 EATF-GGIDESRFTGKITWLPVRRKAYWEVELEKIGLGDEELELENTGAAIDTGTSLIALPSDLAEMLNAE 230 (320)
T ss_pred EEEE-CCcCHHHcCCceEEEeCCcCcEEEEEeCeEEECCEEeccCCCeEEEcCCcccccCCHHHHHHHHHH
Confidence 9999 9999999999999975432 1221 00 0 12336777888888887776653
No 11
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=99.97 E-value=4.4e-30 Score=250.98 Aligned_cols=199 Identities=20% Similarity=0.377 Sum_probs=159.8
Q ss_pred CccCceeEeccCCccceecccccCcCCCCCCCcEEEcc------ccccceeeeecccCccCCCcEEEEEeCCceEEEEEe
Q psy3214 16 TPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGD------VFIGKFYTVFDMDNNQKNGTQFAIRYGSGAVSGYLS 89 (336)
Q Consensus 16 ~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~------c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs~~G~~~ 89 (336)
.-..|..++.+|+|+|-. .+.+||||+++||++. |..++.|+|++|+||+..++.+++.||+|+++|.++
T Consensus 135 ~n~~Yy~~i~IGTP~Q~f----~vi~DTGSsdlWV~s~~C~~~~C~~~~~yd~s~SsT~~~~~~~~~i~YG~Gsv~G~~~ 210 (450)
T PTZ00013 135 ANIMFYGEGEVGDNHQKF----MLIFDTGSANLWVPSKKCDSIGCSIKNLYDSSKSKSYEKDGTKVDITYGSGTVKGFFS 210 (450)
T ss_pred CCCEEEEEEEECCCCeEE----EEEEeCCCCceEEecccCCccccccCCCccCccCcccccCCcEEEEEECCceEEEEEE
Confidence 345688899999999854 7889999999999964 445778999999999999999999999999999999
Q ss_pred EEEEEECCeEEeceEEEEEEeeCC--cccccccccccccCcccccccCCCCchHHHHHHhcccCCCeEEEEeecCCCCCC
Q psy3214 90 QDTLRIGDLTIKNQVFGEAIKEPG--FTFVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKL 167 (336)
Q Consensus 90 ~D~v~ig~~~v~~~~fg~~~~~~~--~~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~~ 167 (336)
+|+|++|+++++ +.|+++..... ..+....+|||||||++.++..+..|++++|++||+|++++||+||++ + ..
T Consensus 211 ~Dtv~iG~~~~~-~~f~~~~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~~~p~~~~L~~qg~I~~~vFS~~L~~-~--~~ 286 (450)
T PTZ00013 211 KDLVTLGHLSMP-YKFIEVTDTDDLEPIYSSSEFDGILGLGWKDLSIGSIDPIVVELKNQNKIDNALFTFYLPV-H--DV 286 (450)
T ss_pred EEEEEECCEEEc-cEEEEEEeccccccceecccccceecccCCccccccCCCHHHHHHhccCcCCcEEEEEecC-C--CC
Confidence 999999999998 57777765532 234456789999999998877778899999999999999999999987 3 23
Q ss_pred CCceEEecCCCCCcccccccceeEecCC-ccc------c-------CCCccccchhhhhhhHHHHHHHhc
Q psy3214 168 ANGEVPIPSAHDPKELESRIENKYQLPE-YDH------V-------GDGCSASYTEMQQWTNNIIEAINQ 223 (336)
Q Consensus 168 ~~G~l~f~GGiD~~~~~G~l~~~~~~~~-~~~------~-------~~~~~~s~t~l~~w~~~i~~~i~~ 223 (336)
.+|+|+| ||+|+++|+|+|.|++.... +|. . ..+..++++++...+...++++.+
T Consensus 287 ~~G~L~f-GGiD~~~y~G~L~y~pv~~~~yW~I~l~v~~G~~~~~~~~aIlDSGTSli~lP~~~~~~i~~ 355 (450)
T PTZ00013 287 HAGYLTI-GGIEEKFYEGNITYEKLNHDLYWQIDLDVHFGKQTMQKANVIVDSGTTTITAPSEFLNKFFA 355 (450)
T ss_pred CCCEEEE-CCcCccccccceEEEEcCcCceEEEEEEEEECceeccccceEECCCCccccCCHHHHHHHHH
Confidence 5899999 99999999999999743221 111 0 012344667777777766666654
No 12
>PF03723 Hemocyanin_C: Hemocyanin, ig-like domain; InterPro: IPR005203 Haemocyanins are copper-containing oxygen transport proteins found in the haemolymph of many invertebrates. They are divided into 2 main groups, arthropodan and molluscan. These have structurally similar oxygen-binding centres, which are similar to the oxygen-binding centre of tyrosinases [], but their quaternary structures are arranged differently. The arthropodan proteins exist as hexamers comprising 3 heterogeneous subunits (a, b and c) and possess 1 oxygen-binding centre per subunit; and the molluscan proteins exist as cylindrical oligomers of 10 to 20 subunits and possess 7 or 8 oxygen-binding centres per subunit []. Although the proteins have similar amino acid compositions, the only real similarity in their primary sequences is in the region corresponding to the second copper-binding domain, which also shows similarity to the copper-binding domain of tyrosinases []. Larval storage proteins (LSP) [] are proteins from the hemolymph of insects, which may serve as a store of amino acids for synthesis of adult proteins. There are two classes of LSP's, arylphorins, which are rich in aromatic amino acids, and methionine-rich LSP's. LSP's forms hexameric complexes. LSP's are structurally related to arthropods hemocyanins.; PDB: 3HHS_A 1HCY_A 1HC1_C 1LL1_A 1NOL_A 1LLA_A 1OXY_A 3IXV_E 3IXW_L 3GWJ_F ....
Probab=99.97 E-value=6.5e-31 Score=237.44 Aligned_cols=101 Identities=34% Similarity=0.647 Sum_probs=80.3
Q ss_pred eeecCCCeEEeEEEEeccCCCCCcceeeeeeeecccccccccc-ccccccc---ccccCCCCcEEEEEEecCCCceeeEE
Q psy3214 227 VTLEYPGVELKSLRILNKQGPTDQLETFWQLIRMDEVYVRDKE-LTEEYNI---PYHIQHEDFKYEITIENNSNETKTGI 302 (336)
Q Consensus 227 ~~L~~pgv~i~~v~v~~~~gp~~~l~T~~~~~~~dl~~~~~~~-~~~~~~~---~~~L~h~pF~y~i~v~n~s~~~~~~~ 302 (336)
.+|.+|||+|++|+| ++|.|||+.+++|++|+.+.. ...++.+ ++|||||||+|+|+|+ |+++++++
T Consensus 6 ~eL~fpgV~I~~V~v-------~~L~Tyfe~~~~dlsn~~~~~~~~~~~~v~ar~~rLnHkpF~y~i~V~--s~~~~~~~ 76 (250)
T PF03723_consen 6 EELSFPGVKIENVEV-------DKLVTYFEDFDFDLSNALDFTEFEDDVSVKARQPRLNHKPFTYKINVN--SDKAQDAV 76 (250)
T ss_dssp HHH--TTEEEEEEEE-------ESEEEEEEEEEEEGGGCSTTTTTCTSS-EEEEEEEEEE--EEEEEEEE--ESSSSEEE
T ss_pred HHCcCCCEEEEEEEE-------CeEEEEEEeEEEEhhhccccccccCceeEEEEEeecCCCcEEEEEEEE--cCCCCeEE
Confidence 789999999999999 889999999999999865443 1133333 8999999999999995 45678889
Q ss_pred EEEEEceecCCCCCcccccccccceEEeeeeeeC
Q psy3214 303 MRIFLGAKNDIEGNELPLSAQRSLFIELDKFRVH 336 (336)
Q Consensus 303 vrifl~Pk~d~~g~~~~~~~~r~~~ielD~f~~~ 336 (336)
||||||||||++|++|+++++|++|||||||+++
T Consensus 77 VRiFL~Pk~D~~G~~~~l~e~r~~~iElDkF~~~ 110 (250)
T PF03723_consen 77 VRIFLGPKYDENGREISLNEQRWNFIELDKFVVK 110 (250)
T ss_dssp EEEEEEESB-TTSCB--HHHHCCC-EEEEEEEEE
T ss_pred EEEEeCcccccCCCCccHHHhcceeEEeEEEEEE
Confidence 9999999999999999999999999999999974
No 13
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=99.96 E-value=2.8e-29 Score=232.40 Aligned_cols=192 Identities=21% Similarity=0.319 Sum_probs=154.8
Q ss_pred ceeEeccCCccceecccccCcCCCCCCCcEEEcccc------ccceeeeecccCccC-CCcEEEEEeCCce-EEEEEeEE
Q psy3214 20 YVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVF------IGKFYTVFDMDNNQK-NGTQFAIRYGSGA-VSGYLSQD 91 (336)
Q Consensus 20 y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~------~~~~y~~~~Sst~~~-~~~~~~i~Yg~Gs-~~G~~~~D 91 (336)
|.+++.+|+|+|.. .+.+||||+++||++..| .++.|++++|+|++. .++++.+.|++|+ ++|.+++|
T Consensus 1 Y~~~i~vGtP~Q~~----~v~~DTGS~~~wv~~~~c~~~~~~~~~~y~~~~Sst~~~~~~~~~~i~Y~~G~~~~G~~~~D 76 (278)
T cd06097 1 YLTPVKIGTPPQTL----NLDLDTGSSDLWVFSSETPAAQQGGHKLYDPSKSSTAKLLPGATWSISYGDGSSASGIVYTD 76 (278)
T ss_pred CeeeEEECCCCcEE----EEEEeCCCCceeEeeCCCCchhhccCCcCCCccCccceecCCcEEEEEeCCCCeEEEEEEEE
Confidence 78899999999864 688999999999997522 356799999999987 5789999999997 99999999
Q ss_pred EEEECCeEEeceEEEEEEeeCCcccccccccccccCcccccccC---CCCchHHHHHHhcccCCCeEEEEeecCCCCCCC
Q psy3214 92 TLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVD---GVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLA 168 (336)
Q Consensus 92 ~v~ig~~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~---~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~~~ 168 (336)
+|+||+.++++|.||+++..++..+....+|||||||++..+.. ...+++++|++|+. +++||+||++ + .
T Consensus 77 ~v~ig~~~~~~~~fg~~~~~~~~~~~~~~~dGilGLg~~~~~~~~~~~~~~~~~~l~~~~~--~~~Fs~~l~~-~----~ 149 (278)
T cd06097 77 TVSIGGVEVPNQAIELATAVSASFFSDTASDGLLGLAFSSINTVQPPKQKTFFENALSSLD--APLFTADLRK-A----A 149 (278)
T ss_pred EEEECCEEECCeEEEEEeecCccccccccccceeeeccccccccccCCCCCHHHHHHHhcc--CceEEEEecC-C----C
Confidence 99999999999999999988754566678999999999877542 45689999999965 7999999987 2 3
Q ss_pred CceEEecCCCCCcccccccceeEecC--Ccccc-------C-------C---CccccchhhhhhhHHHHHHHhc
Q psy3214 169 NGEVPIPSAHDPKELESRIENKYQLP--EYDHV-------G-------D---GCSASYTEMQQWTNNIIEAINQ 223 (336)
Q Consensus 169 ~G~l~f~GGiD~~~~~G~l~~~~~~~--~~~~~-------~-------~---~~~~s~t~l~~w~~~i~~~i~~ 223 (336)
.|+|+| ||+|+++|.|+|.|++... .+|.+ . . ...++++++..++..+++++..
T Consensus 150 ~G~l~f-Gg~D~~~~~g~l~~~pi~~~~~~w~v~l~~i~v~~~~~~~~~~~~~iiDSGTs~~~lP~~~~~~l~~ 222 (278)
T cd06097 150 PGFYTF-GYIDESKYKGEISWTPVDNSSGFWQFTSTSYTVGGDAPWSRSGFSAIADTGTTLILLPDAIVEAYYS 222 (278)
T ss_pred CcEEEE-eccChHHcCCceEEEEccCCCcEEEEEEeeEEECCcceeecCCceEEeecCCchhcCCHHHHHHHHH
Confidence 799999 9999999999999986553 22210 0 0 1233567777777766666554
No 14
>PF00026 Asp: Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.; InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) . More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=99.95 E-value=3.5e-28 Score=228.17 Aligned_cols=198 Identities=28% Similarity=0.449 Sum_probs=160.7
Q ss_pred CceeEeccCCccceecccccCcCCCCCCCcEEEcc-------ccccceeeeecccCccCCCcEEEEEeCCceEEEEEeEE
Q psy3214 19 DYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGD-------VFIGKFYTVFDMDNNQKNGTQFAIRYGSGAVSGYLSQD 91 (336)
Q Consensus 19 ~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~-------c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs~~G~~~~D 91 (336)
+|.+++.+|+|+|.. .+.+||||+++||.+. |..+..|++.+|+|++..+..+.+.|++|+++|.++.|
T Consensus 1 ~Y~~~v~iGtp~q~~----~~~iDTGS~~~wv~~~~c~~~~~~~~~~~y~~~~S~t~~~~~~~~~~~y~~g~~~G~~~~D 76 (317)
T PF00026_consen 1 QYYINVTIGTPPQTF----RVLIDTGSSDTWVPSSNCNSCSSCASSGFYNPSKSSTFSNQGKPFSISYGDGSVSGNLVSD 76 (317)
T ss_dssp EEEEEEEETTTTEEE----EEEEETTBSSEEEEBTTECSHTHHCTSC-BBGGGSTTEEEEEEEEEEEETTEEEEEEEEEE
T ss_pred CeEEEEEECCCCeEE----EEEEecccceeeeceeccccccccccccccccccccccccceeeeeeeccCcccccccccc
Confidence 589999999998864 7889999999999963 34567999999999999999999999999999999999
Q ss_pred EEEECCeEEeceEEEEEEeeCCcccccccccccccCcccccccCC-CCchHHHHHHhcccCCCeEEEEeecCCCCCCCCc
Q psy3214 92 TLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVDG-VEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANG 170 (336)
Q Consensus 92 ~v~ig~~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~-~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~~~~G 170 (336)
+|+|++++++++.||++...++..+....++||||||++..+... ..+++++|++||+|++++||++|++ .. ...|
T Consensus 77 ~v~ig~~~~~~~~f~~~~~~~~~~~~~~~~~GilGLg~~~~~~~~~~~~~~~~l~~~g~i~~~~fsl~l~~-~~--~~~g 153 (317)
T PF00026_consen 77 TVSIGGLTIPNQTFGLADSYSGDPFSPIPFDGILGLGFPSLSSSSTYPTFLDQLVQQGLISSNVFSLYLNP-SD--SQNG 153 (317)
T ss_dssp EEEETTEEEEEEEEEEEEEEESHHHHHSSSSEEEE-SSGGGSGGGTS-SHHHHHHHTTSSSSSEEEEEEES-TT--SSEE
T ss_pred eEeeeeccccccceeccccccccccccccccccccccCCcccccccCCcceecchhhccccccccceeeee-cc--cccc
Confidence 999999999999999999976655667888999999998876554 7899999999999999999999999 32 5689
Q ss_pred eEEecCCCCCcccccccceeEecC-Cccc------cCC-----------CccccchhhhhhhHHHHHHHhcc
Q psy3214 171 EVPIPSAHDPKELESRIENKYQLP-EYDH------VGD-----------GCSASYTEMQQWTNNIIEAINQG 224 (336)
Q Consensus 171 ~l~f~GGiD~~~~~G~l~~~~~~~-~~~~------~~~-----------~~~~s~t~l~~w~~~i~~~i~~g 224 (336)
+|+| ||+|+++|+|++.|.+... .+|. ... ...++++....++..+++.+.+.
T Consensus 154 ~l~~-Gg~d~~~~~g~~~~~~~~~~~~w~v~~~~i~i~~~~~~~~~~~~~~~Dtgt~~i~lp~~~~~~i~~~ 224 (317)
T PF00026_consen 154 SLTF-GGYDPSKYDGDLVWVPLVSSGYWSVPLDSISIGGESVFSSSGQQAILDTGTSYIYLPRSIFDAIIKA 224 (317)
T ss_dssp EEEE-SSEEGGGEESEEEEEEBSSTTTTEEEEEEEEETTEEEEEEEEEEEEEETTBSSEEEEHHHHHHHHHH
T ss_pred hhee-eccccccccCceeccCcccccccccccccccccccccccccceeeecccccccccccchhhHHHHhh
Confidence 9999 9999999999999874441 1111 000 11224566667777777666665
No 15
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two
Probab=99.95 E-value=1.1e-27 Score=229.92 Aligned_cols=201 Identities=18% Similarity=0.254 Sum_probs=153.9
Q ss_pred ccCceeEeccCCccceecccccCcCCCCCCCcEEEcc-c-cccceeeeecccCccCCCcEEEEEeCCceEEEEEeEEEEE
Q psy3214 17 PTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGD-V-FIGKFYTVFDMDNNQKNGTQFAIRYGSGAVSGYLSQDTLR 94 (336)
Q Consensus 17 ~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~-c-~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs~~G~~~~D~v~ 94 (336)
+..|.+++.+|+|+|- +.+.|||||+++||++. | ..++.|+|++|+||+..++.|+|.||+|+++|.+++|+|+
T Consensus 1 ~~~Y~~~i~iGtP~Q~----~~v~~DTGSs~lWv~~~~~~~~~~~f~~~~SsT~~~~~~~~~i~Yg~Gs~~G~~~~D~v~ 76 (364)
T cd05473 1 GQGYYIEMLIGTPPQK----LNILVDTGSSNFAVAAAPHPFIHTYFHRELSSTYRDLGKGVTVPYTQGSWEGELGTDLVS 76 (364)
T ss_pred CCceEEEEEecCCCce----EEEEEecCCcceEEEcCCCccccccCCchhCcCcccCCceEEEEECcceEEEEEEEEEEE
Confidence 3579999999999885 47899999999999986 3 3577999999999999999999999999999999999999
Q ss_pred ECCeEEeceE----EEEEEeeCCcccccccccccccCccccccc--CCCCchHHHHHHhcccCCCeEEEEeecC----CC
Q psy3214 95 IGDLTIKNQV----FGEAIKEPGFTFVAAKFDGILGMGYDNIAV--DGVEPPFYNIIQQKLLEKNVFGFYLNRY----NT 164 (336)
Q Consensus 95 ig~~~v~~~~----fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~--~~~~~~~~~l~~q~~i~~~~Fs~~l~~~----~~ 164 (336)
|++. .++. |+.+....+..+....+|||||||++.++. ....|++++|++|+.+ +++||+||+.. +.
T Consensus 77 ig~~--~~~~~~~~~~~~~~~~~~~~~~~~~dGIlGLg~~~l~~~~~~~~~~~~~l~~q~~~-~~~FS~~l~~~~~~~~~ 153 (364)
T cd05473 77 IPKG--PNVTFRANIAAITESENFFLNGSNWEGILGLAYAELARPDSSVEPFFDSLVKQTGI-PDVFSLQMCGAGLPVNG 153 (364)
T ss_pred ECCC--CccceEEeeEEEeccccceecccccceeeeecccccccCCCCCCCHHHHHHhccCC-ccceEEEeccccccccc
Confidence 9863 2333 344443332112234679999999998763 3468999999999997 58999999741 11
Q ss_pred --CCCCCceEEecCCCCCcccccccceeEecCCc-------cc-------c--------CCCccccchhhhhhhHHHHHH
Q psy3214 165 --EKLANGEVPIPSAHDPKELESRIENKYQLPEY-------DH-------V--------GDGCSASYTEMQQWTNNIIEA 220 (336)
Q Consensus 165 --~~~~~G~l~f~GGiD~~~~~G~l~~~~~~~~~-------~~-------~--------~~~~~~s~t~l~~w~~~i~~~ 220 (336)
....+|+|+| ||+|+++|.|+|.|++..... +. . .....++++++..++..++++
T Consensus 154 ~~~~~~~g~l~f-Gg~D~~~~~g~l~~~p~~~~~~~~v~l~~i~vg~~~~~~~~~~~~~~~~ivDSGTs~~~lp~~~~~~ 232 (364)
T cd05473 154 SASGTVGGSMVI-GGIDPSLYKGDIWYTPIREEWYYEVIILKLEVGGQSLNLDCKEYNYDKAIVDSGTTNLRLPVKVFNA 232 (364)
T ss_pred ccccCCCcEEEe-CCcCHhhcCCCceEEecCcceeEEEEEEEEEECCEecccccccccCccEEEeCCCcceeCCHHHHHH
Confidence 2235899999 999999999999987443211 10 0 012345788899999988888
Q ss_pred Hhccc
Q psy3214 221 INQGY 225 (336)
Q Consensus 221 i~~g~ 225 (336)
+.+..
T Consensus 233 l~~~l 237 (364)
T cd05473 233 AVDAI 237 (364)
T ss_pred HHHHH
Confidence 77764
No 16
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=99.93 E-value=2.7e-25 Score=205.02 Aligned_cols=168 Identities=31% Similarity=0.547 Sum_probs=145.8
Q ss_pred ceeEeccCCccceecccccCcCCCCCCCcEEEccccc------cce--eeeecccCccCCCcEEEEEeCCceEEEEEeEE
Q psy3214 20 YVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFI------GKF--YTVFDMDNNQKNGTQFAIRYGSGAVSGYLSQD 91 (336)
Q Consensus 20 y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~------~~~--y~~~~Sst~~~~~~~~~i~Yg~Gs~~G~~~~D 91 (336)
|.+++.+|+|.|- +.+.+||||+++||.+..|. ... |++..|+++....+.+.+.|++|+++|.++.|
T Consensus 1 Y~~~i~iGtp~q~----~~l~~DTGS~~~wv~~~~c~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~Y~~g~~~g~~~~D 76 (283)
T cd05471 1 YYGEITIGTPPQK----FSVIFDTGSSLLWVPSSNCTSCSCQKHPRFKYDSSKSSTYKDTGCTFSITYGDGSVTGGLGTD 76 (283)
T ss_pred CEEEEEECCCCcE----EEEEEeCCCCCEEEecCCCCccccccCCCCccCccCCceeecCCCEEEEEECCCeEEEEEEEe
Confidence 7888999999875 47889999999999986321 122 78899999999999999999999999999999
Q ss_pred EEEECCeEEeceEEEEEEeeCCcccccccccccccCcccccccCCCCchHHHHHHhcccCCCeEEEEeecCCCCCCCCce
Q psy3214 92 TLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGE 171 (336)
Q Consensus 92 ~v~ig~~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~~~~G~ 171 (336)
+|++++.+++++.||++..... .+.....+||||||++..+.....+++++|.+|++|.+++||+||.+ .......|.
T Consensus 77 ~v~~~~~~~~~~~fg~~~~~~~-~~~~~~~~GilGLg~~~~~~~~~~s~~~~l~~~~~i~~~~Fs~~l~~-~~~~~~~g~ 154 (283)
T cd05471 77 TVTIGGLTIPNQTFGCATSESG-DFSSSGFDGILGLGFPSLSVDGVPSFFDQLKSQGLISSPVFSFYLGR-DGDGGNGGE 154 (283)
T ss_pred EEEECCEEEeceEEEEEeccCC-cccccccceEeecCCcccccccCCCHHHHHHHCCCCCCCEEEEEEcC-CCCCCCCCE
Confidence 9999999999999999998874 45566789999999988776677899999999999999999999998 322357899
Q ss_pred EEecCCCCCcccccccceeEecC
Q psy3214 172 VPIPSAHDPKELESRIENKYQLP 194 (336)
Q Consensus 172 l~f~GGiD~~~~~G~l~~~~~~~ 194 (336)
|+| ||+|++++.|++.|++...
T Consensus 155 l~~-Gg~d~~~~~~~~~~~p~~~ 176 (283)
T cd05471 155 LTF-GGIDPSKYTGDLTYTPVVS 176 (283)
T ss_pred EEE-cccCccccCCceEEEecCC
Confidence 999 9999999999999976554
No 17
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=99.92 E-value=1.2e-24 Score=205.95 Aligned_cols=193 Identities=17% Similarity=0.267 Sum_probs=149.9
Q ss_pred CceeEeccCCccceecccccCcCCCCCCCcEEEcc-c---c--ccceeeeecccCccC----------------CCcEEE
Q psy3214 19 DYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGD-V---F--IGKFYTVFDMDNNQK----------------NGTQFA 76 (336)
Q Consensus 19 ~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~-c---~--~~~~y~~~~Sst~~~----------------~~~~~~ 76 (336)
.|.+++.+|+|+|- +.+.+||||+++||++. | . .++.|+|++|+|++. .++.+.
T Consensus 3 ~Y~~~i~vGtP~Q~----~~v~~DTGS~~~wv~~~~C~~c~~~~~~~y~~~~Sst~~~~~C~~~~c~~~~~~~~~~~~~~ 78 (326)
T cd06096 3 YYFIDIFIGNPPQK----QSLILDTGSSSLSFPCSQCKNCGIHMEPPYNLNNSITSSILYCDCNKCCYCLSCLNNKCEYS 78 (326)
T ss_pred eEEEEEEecCCCeE----EEEEEeCCCCceEEecCCCCCcCCCCCCCcCcccccccccccCCCccccccCcCCCCcCcEE
Confidence 58999999999885 47889999999999975 2 1 246899999999986 357999
Q ss_pred EEeCCce-EEEEEeEEEEEECCeEEe-------ceEEEEEEeeCCcccccccccccccCcccccccCCCCchHHHHHHhc
Q psy3214 77 IRYGSGA-VSGYLSQDTLRIGDLTIK-------NQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQK 148 (336)
Q Consensus 77 i~Yg~Gs-~~G~~~~D~v~ig~~~v~-------~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~ 148 (336)
|.||+|+ +.|.+++|+|+|++..++ ++.||++....+ .|.....+||||||++..+. ..++...|++++
T Consensus 79 i~Y~~gs~~~G~~~~D~v~lg~~~~~~~~~~~~~~~fg~~~~~~~-~~~~~~~~GilGLg~~~~~~--~~~~~~~l~~~~ 155 (326)
T cd06096 79 ISYSEGSSISGFYFSDFVSFESYLNSNSEKESFKKIFGCHTHETN-LFLTQQATGILGLSLTKNNG--LPTPIILLFTKR 155 (326)
T ss_pred EEECCCCceeeEEEEEEEEeccCCCCccccccccEEeccCccccC-cccccccceEEEccCCcccc--cCchhHHHHHhc
Confidence 9999998 899999999999987653 467999887764 45556789999999987543 234455688887
Q ss_pred ccCC--CeEEEEeecCCCCCCCCceEEecCCCCCcccc----------cccceeEecCCc-cc-------c---------
Q psy3214 149 LLEK--NVFGFYLNRYNTEKLANGEVPIPSAHDPKELE----------SRIENKYQLPEY-DH-------V--------- 199 (336)
Q Consensus 149 ~i~~--~~Fs~~l~~~~~~~~~~G~l~f~GGiD~~~~~----------G~l~~~~~~~~~-~~-------~--------- 199 (336)
.+.. ++||+||++ + +|+|+| ||+|+++|. +++.|++..... +. .
T Consensus 156 ~~~~~~~~FS~~l~~-~-----~G~l~~-Gg~d~~~~~~~~~~~~~~~~~~~~~p~~~~~~y~v~l~~i~vg~~~~~~~~ 228 (326)
T cd06096 156 PKLKKDKIFSICLSE-D-----GGELTI-GGYDKDYTVRNSSIGNNKVSKIVWTPITRKYYYYVKLEGLSVYGTTSNSGN 228 (326)
T ss_pred ccccCCceEEEEEcC-C-----CeEEEE-CccChhhhcccccccccccCCceEEeccCCceEEEEEEEEEEcccccceec
Confidence 7765 999999998 2 699999 999999998 788887543221 10 0
Q ss_pred ---CCCccccchhhhhhhHHHHHHHhccc
Q psy3214 200 ---GDGCSASYTEMQQWTNNIIEAINQGY 225 (336)
Q Consensus 200 ---~~~~~~s~t~l~~w~~~i~~~i~~g~ 225 (336)
.....++++++...+..+++++....
T Consensus 229 ~~~~~aivDSGTs~~~lp~~~~~~l~~~~ 257 (326)
T cd06096 229 TKGLGMLVDSGSTLSHFPEDLYNKINNFF 257 (326)
T ss_pred ccCCCEEEeCCCCcccCCHHHHHHHHhhc
Confidence 00134477888888888888877654
No 18
>KOG1339|consensus
Probab=99.92 E-value=3.9e-25 Score=214.73 Aligned_cols=195 Identities=26% Similarity=0.455 Sum_probs=154.8
Q ss_pred cCceeEeccCCccceecccccCcCCCCCCCcEEEccccc------cce-eeeecccCccCCCc-E---------------
Q psy3214 18 TDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFI------GKF-YTVFDMDNNQKNGT-Q--------------- 74 (336)
Q Consensus 18 ~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~------~~~-y~~~~Sst~~~~~~-~--------------- 74 (336)
..|..++.+|+|.|.. .+.+||||+++||++.+|. +.. |+|++|+||+..++ .
T Consensus 45 ~~Y~~~i~IGTPpq~f----~v~~DTGS~~lWV~c~~c~~~C~~~~~~~f~p~~SSt~~~~~c~~~~c~~~~~~~~~~~~ 120 (398)
T KOG1339|consen 45 GEYYGNISIGTPPQSF----TVVLDTGSDLLWVPCAPCSSACYSQHNPIFDPSASSTYKSVGCSSPRCKSLPQSCSPNSS 120 (398)
T ss_pred cccEEEEecCCCCeee----EEEEeCCCCceeeccccccccccccCCCccCccccccccccCCCCccccccccCcccCCc
Confidence 3799999999999864 6889999999999985442 444 99999999999773 3
Q ss_pred --EEEEeCCce-EEEEEeEEEEEECC---eEEeceEEEEEEeeCCccccc-ccccccccCcccccccCCCCchHHHHHHh
Q psy3214 75 --FAIRYGSGA-VSGYLSQDTLRIGD---LTIKNQVFGEAIKEPGFTFVA-AKFDGILGMGYDNIAVDGVEPPFYNIIQQ 147 (336)
Q Consensus 75 --~~i~Yg~Gs-~~G~~~~D~v~ig~---~~v~~~~fg~~~~~~~~~~~~-~~~dGIlGLg~~~~s~~~~~~~~~~l~~q 147 (336)
|.|.||+|+ ++|.+++|+|++++ +.++++.||++...++. +.. .++|||||||+..++..+..+.+.++.
T Consensus 121 C~y~i~Ygd~~~~~G~l~~Dtv~~~~~~~~~~~~~~FGc~~~~~g~-~~~~~~~dGIlGLg~~~~S~~~q~~~~~~~~-- 197 (398)
T KOG1339|consen 121 CPYSIQYGDGSSTSGYLATDTVTFGGTTSLPVPNQTFGCGTNNPGS-FGLFAAFDGILGLGRGSLSVPSQLPSFYNAI-- 197 (398)
T ss_pred CceEEEeCCCCceeEEEEEEEEEEccccccccccEEEEeeecCccc-cccccccceEeecCCCCccceeecccccCCc--
Confidence 999999955 99999999999998 88888999999998875 555 678999999999988766655444443
Q ss_pred cccCCCeEEEEeecCCCCC-CCCceEEecCCCCCcccccccceeEecCCc---cc-------cC--------------C-
Q psy3214 148 KLLEKNVFGFYLNRYNTEK-LANGEVPIPSAHDPKELESRIENKYQLPEY---DH-------VG--------------D- 201 (336)
Q Consensus 148 ~~i~~~~Fs~~l~~~~~~~-~~~G~l~f~GGiD~~~~~G~l~~~~~~~~~---~~-------~~--------------~- 201 (336)
++||+||.+ +... ..+|.|+| ||+|+.++.|+++|++..... +. +. .
T Consensus 198 -----~~FS~cL~~-~~~~~~~~G~i~f-G~~d~~~~~~~l~~tPl~~~~~~~y~v~l~~I~vgg~~~~~~~~~~~~~~~ 270 (398)
T KOG1339|consen 198 -----NVFSYCLSS-NGSPSSGGGSIIF-GGVDSSHYTGSLTYTPLLSNPSTYYQVNLDGISVGGKRPIGSSLFCTDGGG 270 (398)
T ss_pred -----eeEEEEeCC-CCCCCCCCcEEEE-CCCcccCcCCceEEEeeccCCCccEEEEEeEEEECCccCCCcceEecCCCC
Confidence 299999999 4322 36899999 999999999999998654332 11 11 0
Q ss_pred CccccchhhhhhhHHHHHHHhccce
Q psy3214 202 GCSASYTEMQQWTNNIIEAINQGYA 226 (336)
Q Consensus 202 ~~~~s~t~l~~w~~~i~~~i~~g~~ 226 (336)
...++++.+.+.+..+++++.+...
T Consensus 271 ~iiDSGTs~t~lp~~~y~~i~~~~~ 295 (398)
T KOG1339|consen 271 AIIDSGTSLTYLPTSAYNALREAIG 295 (398)
T ss_pred EEEECCcceeeccHHHHHHHHHHHH
Confidence 1334778888888888888887754
No 19
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=99.91 E-value=6.9e-24 Score=197.60 Aligned_cols=170 Identities=22% Similarity=0.334 Sum_probs=137.5
Q ss_pred CceeEeccCCccceecccccCcCCCCCCCcEEEccccccceeeeecccCccCCCcEEEEEeCCc-eEEEEEeEEEEEECC
Q psy3214 19 DYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYTVFDMDNNQKNGTQFAIRYGSG-AVSGYLSQDTLRIGD 97 (336)
Q Consensus 19 ~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y~~~~Sst~~~~~~~~~i~Yg~G-s~~G~~~~D~v~ig~ 97 (336)
-|.+++.+|+|.|- +.+.+||||+++||+ ++++.|++| +++|.+++|+|++++
T Consensus 2 ~Y~~~i~iGtp~q~----~~v~~DTgS~~~wv~----------------------~~~~~Y~~g~~~~G~~~~D~v~~g~ 55 (295)
T cd05474 2 YYSAELSVGTPPQK----VTVLLDTGSSDLWVP----------------------DFSISYGDGTSASGTWGTDTVSIGG 55 (295)
T ss_pred eEEEEEEECCCCcE----EEEEEeCCCCcceee----------------------eeEEEeccCCcEEEEEEEEEEEECC
Confidence 48999999999985 478899999999998 789999996 599999999999999
Q ss_pred eEEeceEEEEEEeeCCcccccccccccccCccccccc-----CCCCchHHHHHHhcccCCCeEEEEeecCCCCCCCCceE
Q psy3214 98 LTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAV-----DGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGEV 172 (336)
Q Consensus 98 ~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~-----~~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~~~~G~l 172 (336)
.+++++.||+++... ..+||||||++..+. ...++++++|++||+|++++||+||.+ . ....|.|
T Consensus 56 ~~~~~~~fg~~~~~~-------~~~GilGLg~~~~~~~~~~~~~~~s~~~~L~~~g~i~~~~Fsl~l~~-~--~~~~g~l 125 (295)
T cd05474 56 ATVKNLQFAVANSTS-------SDVGVLGIGLPGNEATYGTGYTYPNFPIALKKQGLIKKNAYSLYLND-L--DASTGSI 125 (295)
T ss_pred eEecceEEEEEecCC-------CCcceeeECCCCCcccccCCCcCCCHHHHHHHCCcccceEEEEEeCC-C--CCCceeE
Confidence 999999999998742 358999999988743 245689999999999999999999998 3 2357999
Q ss_pred EecCCCCCcccccccceeEecCC-------ccc-------c----------C---CCccccchhhhhhhHHHHHHHhccc
Q psy3214 173 PIPSAHDPKELESRIENKYQLPE-------YDH-------V----------G---DGCSASYTEMQQWTNNIIEAINQGY 225 (336)
Q Consensus 173 ~f~GGiD~~~~~G~l~~~~~~~~-------~~~-------~----------~---~~~~~s~t~l~~w~~~i~~~i~~g~ 225 (336)
+| ||+|+++|.|+++|++.... ++. . . ....++++++..++..+++++.+..
T Consensus 126 ~~-Gg~d~~~~~g~~~~~p~~~~~~~~~~~~~~v~l~~i~v~~~~~~~~~~~~~~~~iiDSGt~~~~lP~~~~~~l~~~~ 204 (295)
T cd05474 126 LF-GGVDTAKYSGDLVTLPIVNDNGGSEPSELSVTLSSISVNGSSGNTTLLSKNLPALLDSGTTLTYLPSDIVDAIAKQL 204 (295)
T ss_pred EE-eeeccceeeceeEEEeCcCcCCCCCceEEEEEEEEEEEEcCCCcccccCCCccEEECCCCccEeCCHHHHHHHHHHh
Confidence 99 99999999999998754432 110 0 0 0123366777788887877777653
No 20
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which
Probab=99.88 E-value=1.9e-22 Score=185.44 Aligned_cols=137 Identities=20% Similarity=0.325 Sum_probs=116.5
Q ss_pred CceeEeccCCccceecccccCcCCCCCCCcEEEccccccceeeeecccCccCCCcEEEEEeCCce-EEEEEeEEEEEECC
Q psy3214 19 DYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYTVFDMDNNQKNGTQFAIRYGSGA-VSGYLSQDTLRIGD 97 (336)
Q Consensus 19 ~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs-~~G~~~~D~v~ig~ 97 (336)
+|.+++.+|+|.|- +.+.+||||+++||++ +.+.+.|++|+ ++|.+++|+|+|++
T Consensus 1 ~Y~~~i~iGtP~q~----~~v~~DTGSs~~wv~~--------------------~~~~~~Y~dg~~~~G~~~~D~v~~g~ 56 (265)
T cd05476 1 EYLVTLSIGTPPQP----FSLIVDTGSDLTWTQC--------------------CSYEYSYGDGSSTSGVLATETFTFGD 56 (265)
T ss_pred CeEEEEecCCCCcc----eEEEecCCCCCEEEcC--------------------CceEeEeCCCceeeeeEEEEEEEecC
Confidence 58999999999885 4788999999999973 57899999775 99999999999999
Q ss_pred e--EEeceEEEEEEeeCCcccccccccccccCcccccccCCCCchHHHHHHhcccCCCeEEEEeecCCCCCCCCceEEec
Q psy3214 98 L--TIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGEVPIP 175 (336)
Q Consensus 98 ~--~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~~~~G~l~f~ 175 (336)
. +++++.||++...++ +.....+||||||++.. ++..+|..++ ++||+||.+ +.....+|+|+|
T Consensus 57 ~~~~~~~~~Fg~~~~~~~--~~~~~~~GIlGLg~~~~------s~~~ql~~~~----~~Fs~~l~~-~~~~~~~G~l~f- 122 (265)
T cd05476 57 SSVSVPNVAFGCGTDNEG--GSFGGADGILGLGRGPL------SLVSQLGSTG----NKFSYCLVP-HDDTGGSSPLIL- 122 (265)
T ss_pred CCCccCCEEEEecccccC--CccCCCCEEEECCCCcc------cHHHHhhccc----CeeEEEccC-CCCCCCCCeEEE-
Confidence 8 899999999998874 55667899999998654 3566777665 899999998 433456899999
Q ss_pred CCCCCcccccccceeEecC
Q psy3214 176 SAHDPKELESRIENKYQLP 194 (336)
Q Consensus 176 GGiD~~~~~G~l~~~~~~~ 194 (336)
||+|++ |.|++.|++...
T Consensus 123 Gg~d~~-~~~~l~~~p~~~ 140 (265)
T cd05476 123 GDAADL-GGSGVVYTPLVK 140 (265)
T ss_pred CCcccc-cCCCceEeeccc
Confidence 999999 999999976554
No 21
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=99.87 E-value=4.7e-22 Score=185.92 Aligned_cols=169 Identities=19% Similarity=0.324 Sum_probs=129.2
Q ss_pred CceeEeccCCccceecccccCcCCCCCCCcEEEccccccceeeeecccCccCCCcEEEEEeCCce-EEEEEeEEEEEECC
Q psy3214 19 DYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYTVFDMDNNQKNGTQFAIRYGSGA-VSGYLSQDTLRIGD 97 (336)
Q Consensus 19 ~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs-~~G~~~~D~v~ig~ 97 (336)
+|.+++.+|+|+|. +.+.+||||+++||.+..| +.|.+.|++|+ ++|.+++|+|+|++
T Consensus 1 ~Y~~~i~iGtP~q~----~~v~~DTGSs~~Wv~c~~c-----------------~~~~i~Yg~Gs~~~G~~~~D~v~ig~ 59 (299)
T cd05472 1 EYVVTVGLGTPARD----QTVIVDTGSDLTWVQCQPC-----------------CLYQVSYGDGSYTTGDLATDTLTLGS 59 (299)
T ss_pred CeEEEEecCCCCcc----eEEEecCCCCcccccCCCC-----------------CeeeeEeCCCceEEEEEEEEEEEeCC
Confidence 58999999999985 4789999999999975322 68999999999 68999999999999
Q ss_pred e-EEeceEEEEEEeeCCcccccccccccccCcccccccCCCCchHHHHHHhcccCCCeEEEEeecCCCCCCCCceEEecC
Q psy3214 98 L-TIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGEVPIPS 176 (336)
Q Consensus 98 ~-~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~~~~G~l~f~G 176 (336)
. .++++.||++...++ .+. ..+||||||++..+ +..+|..+ .+++||+||++ .. ...+|+|+| |
T Consensus 60 ~~~~~~~~Fg~~~~~~~-~~~--~~~GilGLg~~~~s------~~~ql~~~---~~~~FS~~L~~-~~-~~~~G~l~f-G 124 (299)
T cd05472 60 SDVVPGFAFGCGHDNEG-LFG--GAAGLLGLGRGKLS------LPSQTASS---YGGVFSYCLPD-RS-SSSSGYLSF-G 124 (299)
T ss_pred CCccCCEEEECCccCCC-ccC--CCCEEEECCCCcch------HHHHhhHh---hcCceEEEccC-CC-CCCCceEEe-C
Confidence 8 899999999987764 332 68999999987653 33445443 35899999987 32 246899999 9
Q ss_pred CCCCcccccccceeEecCC-----ccc-------cC--------------CCccccchhhhhhhHHHHHHHhccc
Q psy3214 177 AHDPKELESRIENKYQLPE-----YDH-------VG--------------DGCSASYTEMQQWTNNIIEAINQGY 225 (336)
Q Consensus 177 GiD~~~~~G~l~~~~~~~~-----~~~-------~~--------------~~~~~s~t~l~~w~~~i~~~i~~g~ 225 (336)
|+|++ +|++.|++.... ++. +. ....++++++.+.+..+++++.+..
T Consensus 125 g~d~~--~g~l~~~pv~~~~~~~~~y~v~l~~i~vg~~~~~~~~~~~~~~~~ivDSGTt~~~lp~~~~~~l~~~l 197 (299)
T cd05472 125 AAASV--PAGASFTPMLSNPRVPTFYYVGLTGISVGGRRLPIPPASFGAGGVIIDSGTVITRLPPSAYAALRDAF 197 (299)
T ss_pred Ccccc--CCCceECCCccCCCCCCeEEEeeEEEEECCEECCCCccccCCCCeEEeCCCcceecCHHHHHHHHHHH
Confidence 99999 999998754421 111 00 1133467888888887777777653
No 22
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=99.85 E-value=1.2e-20 Score=174.39 Aligned_cols=143 Identities=23% Similarity=0.319 Sum_probs=114.7
Q ss_pred CceeEeccCCccceecccccCcCCCCCCCcEEEcc-ccccceeeeecccCccCCCcEEEEEeCCc-eEEEEEeEEEEEEC
Q psy3214 19 DYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGD-VFIGKFYTVFDMDNNQKNGTQFAIRYGSG-AVSGYLSQDTLRIG 96 (336)
Q Consensus 19 ~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~-c~~~~~y~~~~Sst~~~~~~~~~i~Yg~G-s~~G~~~~D~v~ig 96 (336)
.|.+++.+|+|.|.+ .+.+||||+++||+++ +|.. .++.|++.||+| +++|.+++|+|+++
T Consensus 2 ~Y~~~i~iGtP~q~~----~v~~DTGS~~~Wv~c~~~c~~-------------c~c~~~i~Ygd~~~~~G~~~~D~v~~~ 64 (273)
T cd05475 2 YYYVTINIGNPPKPY----FLDIDTGSDLTWLQCDAPCTG-------------CQCDYEIEYADGGSSMGVLVTDIFSLK 64 (273)
T ss_pred ceEEEEEcCCCCeeE----EEEEccCCCceEEeCCCCCCC-------------CcCccEeEeCCCCceEEEEEEEEEEEe
Confidence 589999999999864 7889999999999752 1211 157899999965 59999999999995
Q ss_pred ----CeEEeceEEEEEEeeCCcc-cccccccccccCcccccccCCCCchHHHHHHhcccCCCeEEEEeecCCCCCCCCce
Q psy3214 97 ----DLTIKNQVFGEAIKEPGFT-FVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGE 171 (336)
Q Consensus 97 ----~~~v~~~~fg~~~~~~~~~-~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~~~~~~G~ 171 (336)
+..++++.||++....+.. +...+.|||||||+... +++++|.+|+++ +++||+||++ + .+|.
T Consensus 65 ~~~~~~~~~~~~Fgc~~~~~~~~~~~~~~~dGIlGLg~~~~------s~~~ql~~~~~i-~~~Fs~~l~~-~----~~g~ 132 (273)
T cd05475 65 LTNGSRAKPRIAFGCGYDQQGPLLNPPPPTDGILGLGRGKI------SLPSQLASQGII-KNVIGHCLSS-N----GGGF 132 (273)
T ss_pred ecCCCcccCCEEEEeeeccCCcccCCCccCCEEEECCCCCC------CHHHHHHhcCCc-CceEEEEccC-C----CCeE
Confidence 3578899999998765422 23457899999998653 578899999999 8999999987 3 3689
Q ss_pred EEecCCCCCcccccccceeEec
Q psy3214 172 VPIPSAHDPKELESRIENKYQL 193 (336)
Q Consensus 172 l~f~GGiD~~~~~G~l~~~~~~ 193 (336)
|+| | |.+++.|++.|++..
T Consensus 133 l~~-G--~~~~~~g~i~ytpl~ 151 (273)
T cd05475 133 LFF-G--DDLVPSSGVTWTPMR 151 (273)
T ss_pred EEE-C--CCCCCCCCeeecccc
Confidence 999 8 667788999987544
No 23
>PLN03146 aspartyl protease family protein; Provisional
Probab=99.84 E-value=1.6e-20 Score=183.94 Aligned_cols=192 Identities=16% Similarity=0.310 Sum_probs=138.8
Q ss_pred ccCceeEeccCCccceecccccCcCCCCCCCcEEEcc----cc--ccceeeeecccCccCCC------------------
Q psy3214 17 PTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGD----VF--IGKFYTVFDMDNNQKNG------------------ 72 (336)
Q Consensus 17 ~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~----c~--~~~~y~~~~Sst~~~~~------------------ 72 (336)
..+|++++.+|+|+|.+ .+.+||||+++||++. |. .+..|+|++|+||+...
T Consensus 82 ~~~Y~v~i~iGTPpq~~----~vi~DTGS~l~Wv~C~~C~~C~~~~~~~fdps~SST~~~~~C~s~~C~~~~~~~~c~~~ 157 (431)
T PLN03146 82 GGEYLMNISIGTPPVPI----LAIADTGSDLIWTQCKPCDDCYKQVSPLFDPKKSSTYKDVSCDSSQCQALGNQASCSDE 157 (431)
T ss_pred CccEEEEEEcCCCCceE----EEEECCCCCcceEcCCCCcccccCCCCcccCCCCCCCcccCCCCcccccCCCCCCCCCC
Confidence 45899999999998864 7889999999999974 33 35689999999998742
Q ss_pred --cEEEEEeCCce-EEEEEeEEEEEECC-----eEEeceEEEEEEeeCCcccccccccccccCcccccccCCCCchHHHH
Q psy3214 73 --TQFAIRYGSGA-VSGYLSQDTLRIGD-----LTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVDGVEPPFYNI 144 (336)
Q Consensus 73 --~~~~i~Yg~Gs-~~G~~~~D~v~ig~-----~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l 144 (336)
+.|.+.||+|+ +.|.+++|+|++++ .+++++.||++....+ .|. ...+||||||+..++ ++.+|
T Consensus 158 ~~c~y~i~Ygdgs~~~G~l~~Dtltlg~~~~~~~~v~~~~FGc~~~~~g-~f~-~~~~GilGLG~~~~S------l~sql 229 (431)
T PLN03146 158 NTCTYSYSYGDGSFTKGNLAVETLTIGSTSGRPVSFPGIVFGCGHNNGG-TFD-EKGSGIVGLGGGPLS------LISQL 229 (431)
T ss_pred CCCeeEEEeCCCCceeeEEEEEEEEeccCCCCcceeCCEEEeCCCCCCC-Ccc-CCCceeEecCCCCcc------HHHHh
Confidence 78999999998 68999999999997 5699999999987765 343 257899999987553 45555
Q ss_pred HHhcccCCCeEEEEeecCCCCCCCCceEEecCCCCCccccc-ccceeEecC----Cccc-------cC------------
Q psy3214 145 IQQKLLEKNVFGFYLNRYNTEKLANGEVPIPSAHDPKELES-RIENKYQLP----EYDH-------VG------------ 200 (336)
Q Consensus 145 ~~q~~i~~~~Fs~~l~~~~~~~~~~G~l~f~GGiD~~~~~G-~l~~~~~~~----~~~~-------~~------------ 200 (336)
..+ +. ++||+||.+.+.+....|.|+| ||. +++.| .+.|++.+. .++. ++
T Consensus 230 ~~~--~~-~~FSycL~~~~~~~~~~g~l~f-G~~--~~~~~~~~~~tPl~~~~~~~~y~V~L~gIsVgg~~l~~~~~~~~ 303 (431)
T PLN03146 230 GSS--IG-GKFSYCLVPLSSDSNGTSKINF-GTN--AIVSGSGVVSTPLVSKDPDTFYYLTLEAISVGSKKLPYTGSSKN 303 (431)
T ss_pred hHh--hC-CcEEEECCCCCCCCCCcceEEe-CCc--cccCCCCceEcccccCCCCCeEEEeEEEEEECCEECcCCccccc
Confidence 432 44 5999999752222235899999 984 44544 366654331 1110 00
Q ss_pred -----CCccccchhhhhhhHHHHHHHhccce
Q psy3214 201 -----DGCSASYTEMQQWTNNIIEAINQGYA 226 (336)
Q Consensus 201 -----~~~~~s~t~l~~w~~~i~~~i~~g~~ 226 (336)
....++++.+.+.+..+++++.+...
T Consensus 304 ~~~~g~~iiDSGTt~t~Lp~~~y~~l~~~~~ 334 (431)
T PLN03146 304 GVEEGNIIIDSGTTLTLLPSDFYSELESAVE 334 (431)
T ss_pred cCCCCcEEEeCCccceecCHHHHHHHHHHHH
Confidence 01234677788888887777666543
No 24
>cd05470 pepsin_retropepsin_like Cellular and retroviral pepsin-like aspartate proteases. This family includes both cellular and retroviral pepsin-like aspartate proteases. The cellular pepsin and pepsin-like enzymes are twice as long as their retroviral counterparts. The cellular pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, rennin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (rennin, cathepsin D and E, pepsin) or commercially (chymosin) important. The eukaryotic pepsin-like proteases contain two domains possessing similar topological features. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except in the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The eukaryotic pepsin-like proteases have two active site
Probab=99.79 E-value=8.2e-19 Score=139.46 Aligned_cols=101 Identities=35% Similarity=0.595 Sum_probs=87.8
Q ss_pred EeccCCccceecccccCcCCCCCCCcEEEcccc------cccee-eeecccCccCCCcEEEEEeCCceEEEEEeEEEEEE
Q psy3214 23 KVSIASSSMCLSGFAGMDIPPPAGPIWILGDVF------IGKFY-TVFDMDNNQKNGTQFAIRYGSGAVSGYLSQDTLRI 95 (336)
Q Consensus 23 ~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~------~~~~y-~~~~Sst~~~~~~~~~i~Yg~Gs~~G~~~~D~v~i 95 (336)
++.+|+|+|- +.+.+||||+++||.+..| .++.| +|++|++++..++.+.+.|++|++.|.++.|+|+|
T Consensus 2 ~i~vGtP~q~----~~~~~DTGSs~~Wv~~~~c~~~~~~~~~~~~~~~~sst~~~~~~~~~~~Y~~g~~~g~~~~D~v~i 77 (109)
T cd05470 2 EIGIGTPPQT----FNVLLDTGSSNLWVPSVDCQSLAIYSHSSYDDPSASSTYSDNGCTFSITYGTGSLSGGLSTDTVSI 77 (109)
T ss_pred EEEeCCCCce----EEEEEeCCCCCEEEeCCCCCCcccccccccCCcCCCCCCCCCCcEEEEEeCCCeEEEEEEEEEEEE
Confidence 5678999885 4788999999999997622 23455 99999999999999999999999999999999999
Q ss_pred CCeEEeceEEEEEEeeCCcccccccccccccC
Q psy3214 96 GDLTIKNQVFGEAIKEPGFTFVAAKFDGILGM 127 (336)
Q Consensus 96 g~~~v~~~~fg~~~~~~~~~~~~~~~dGIlGL 127 (336)
++..++++.||++....+..+.....+|||||
T Consensus 78 g~~~~~~~~fg~~~~~~~~~~~~~~~~GilGL 109 (109)
T cd05470 78 GDIEVVGQAFGCATDEPGATFLPALFDGILGL 109 (109)
T ss_pred CCEEECCEEEEEEEecCCccccccccccccCC
Confidence 99999999999999987654555678999998
No 25
>PF14543 TAXi_N: Xylanase inhibitor N-terminal; PDB: 3HD8_A 3VLB_A 3VLA_A 3AUP_D 1T6G_A 1T6E_X 2B42_A.
Probab=99.67 E-value=4.8e-16 Score=132.93 Aligned_cols=136 Identities=22% Similarity=0.405 Sum_probs=103.3
Q ss_pred ceeEeccCCccceecccccCcCCCCCCCcEEEccccccceeeeecccCccCC----------------------CcEEEE
Q psy3214 20 YVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYTVFDMDNNQKN----------------------GTQFAI 77 (336)
Q Consensus 20 y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y~~~~Sst~~~~----------------------~~~~~i 77 (336)
|.+++.+|+|.+- ..+.+||||+.+|+++ ....|+|.+|+||+.. .+.+.+
T Consensus 1 Y~~~~~iGtP~~~----~~lvvDtgs~l~W~~C---~~~~f~~~~Sst~~~v~C~s~~C~~~~~~~~~~~~~~~~C~y~~ 73 (164)
T PF14543_consen 1 YYVSVSIGTPPQP----FSLVVDTGSDLTWVQC---PDPPFDPSKSSTYRPVPCSSPQCSSAPSFCPCCCCSNNSCPYSQ 73 (164)
T ss_dssp EEEEEECTCTTEE----EEEEEETT-SSEEEET-------STT-TTSSBEC-BTTSHHHHHCTSSBTCCTCESSEEEEEE
T ss_pred CEEEEEeCCCCce----EEEEEECCCCceEEcC---CCcccCCccCCcccccCCCCcchhhcccccccCCCCcCccccee
Confidence 7899999999874 4788899999999975 6678999999999852 158999
Q ss_pred EeCCce-EEEEEeEEEEEECC-----eEEeceEEEEEEeeCCcccccccccccccCcccccccCCCCchHHHHHHhcccC
Q psy3214 78 RYGSGA-VSGYLSQDTLRIGD-----LTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLE 151 (336)
Q Consensus 78 ~Yg~Gs-~~G~~~~D~v~ig~-----~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~~i~ 151 (336)
.|++|+ ..|.+++|+|+++. ..+.+..||++....+. +. ..+||||||...+ +++.+|.++ .
T Consensus 74 ~y~~~s~~~G~l~~D~~~~~~~~~~~~~~~~~~FGC~~~~~g~-~~--~~~GilGLg~~~~------Sl~sQl~~~---~ 141 (164)
T PF14543_consen 74 SYGDGSSSSGFLASDTLTFGSSSGGSNSVPDFIFGCATSNSGL-FY--GADGILGLGRGPL------SLPSQLASS---S 141 (164)
T ss_dssp EETTTEEEEEEEEEEEEEEEEESSSSEEEEEEEEEEE-GGGTS-ST--TEEEEEE-SSSTT------SHHHHHHHH----
T ss_pred ecCCCccccCceEEEEEEecCCCCCCceeeeEEEEeeeccccC-Cc--CCCcccccCCCcc------cHHHHHHHh---c
Confidence 999888 89999999999986 56888999999988753 22 7799999996654 677888777 6
Q ss_pred CCeEEEEeecCCCCCCCCceEEecCC
Q psy3214 152 KNVFGFYLNRYNTEKLANGEVPIPSA 177 (336)
Q Consensus 152 ~~~Fs~~l~~~~~~~~~~G~l~f~GG 177 (336)
.+.||++|.+ . +....|.|+| |+
T Consensus 142 ~~~FSyCL~~-~-~~~~~g~l~f-G~ 164 (164)
T PF14543_consen 142 GNKFSYCLPS-S-SPSSSGFLSF-GD 164 (164)
T ss_dssp -SEEEEEB-S---SSSSEEEEEE-CS
T ss_pred CCeEEEECCC-C-CCCCCEEEEe-Cc
Confidence 7999999988 2 3457899999 85
No 26
>cd05489 xylanase_inhibitor_I_like TAXI-I inhibits degradation of xylan in the cell wall. Xylanase inhibitor-I (TAXI-I) is a member of potent TAXI-type inhibitors of fungal and bacterial family 11 xylanases. Plants developed a diverse battery of defense mechanisms in response to continual challenges by a broad spectrum of pathogenic microorganisms. Their defense arsenal includes inhibitors of cell wall-degrading enzymes, which hinder a possible invasion and colonization by antagonists. Xylanases of fungal and bacterial pathogens are the key enzymes in the degradation of xylan in the cell wall. Plants secrete proteins that inhibit these degradation glycosidases, including xylanase. Surprisingly, TAXI-I displays structural homology with the pepsin-like family of aspartic proteases but is proteolytically nonfunctional, because one or more residues of the essential catalytic triad are absent. The structure of the TAXI-inhibitor, Aspergillus niger xylanase I complex, illustrates the ability
Probab=99.64 E-value=1e-15 Score=146.85 Aligned_cols=145 Identities=15% Similarity=0.097 Sum_probs=101.3
Q ss_pred ccCcCCCCCCCcEEEcc----------------ccccceeeeec------ccCccCCCcEEEEE-eCCce-EEEEEeEEE
Q psy3214 37 AGMDIPPPAGPIWILGD----------------VFIGKFYTVFD------MDNNQKNGTQFAIR-YGSGA-VSGYLSQDT 92 (336)
Q Consensus 37 ~~~~~dtGS~~~Wv~~~----------------c~~~~~y~~~~------Sst~~~~~~~~~i~-Yg~Gs-~~G~~~~D~ 92 (336)
+.+.+||||+.+||.+. |..+..|++.+ ++......+.+... |++|+ .+|.+++|+
T Consensus 10 ~~~~~DTGS~l~WvqC~~~~sst~~~~~C~s~~C~~~~~~~~~~~~~~~~~~~c~~~~C~y~~~~y~~gs~t~G~l~~Dt 89 (362)
T cd05489 10 VPLVLDLAGPLLWSTCDAGHSSTYQTVPCSSSVCSLANRYHCPGTCGGAPGPGCGNNTCTAHPYNPVTGECATGDLTQDV 89 (362)
T ss_pred eeEEEECCCCceeeeCCCCCcCCCCccCcCChhhccccccCCCccccCCCCCCCCCCcCeeEccccccCcEeeEEEEEEE
Confidence 57889999999999543 22233343321 11222223556554 78895 899999999
Q ss_pred EEECC--------eEEeceEEEEEEeeCCcccccccccccccCcccccccCCCCchHHHHHHhcccCCCeEEEEeecCCC
Q psy3214 93 LRIGD--------LTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNT 164 (336)
Q Consensus 93 v~ig~--------~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~~~~s~~~~~~~~~~l~~q~~i~~~~Fs~~l~~~~~ 164 (336)
|+++. .+++++.||++.......+ ...+|||||||+..++. ..+|..+.. .+++||+||.+ +.
T Consensus 90 l~~~~~~g~~~~~~~~~~~~FGC~~~~~~~~~-~~~~dGIlGLg~~~lSl------~sql~~~~~-~~~~FS~CL~~-~~ 160 (362)
T cd05489 90 LSANTTDGSNPLLVVIFNFVFSCAPSLLLKGL-PPGAQGVAGLGRSPLSL------PAQLASAFG-VARKFALCLPS-SP 160 (362)
T ss_pred EEecccCCCCcccceeCCEEEEcCCcccccCC-ccccccccccCCCccch------HHHhhhhcC-CCcceEEEeCC-CC
Confidence 99973 3799999999987532122 23489999999988753 335555444 57999999998 42
Q ss_pred CCCCCceEEecCCCCCcccc------cccceeEec
Q psy3214 165 EKLANGEVPIPSAHDPKELE------SRIENKYQL 193 (336)
Q Consensus 165 ~~~~~G~l~f~GGiD~~~~~------G~l~~~~~~ 193 (336)
..+|+|+| ||.|++++. |++.|++..
T Consensus 161 --~~~g~l~f-G~~~~~~~~~~~~~~~~~~~tPl~ 192 (362)
T cd05489 161 --GGPGVAIF-GGGPYYLFPPPIDLSKSLSYTPLL 192 (362)
T ss_pred --CCCeeEEE-CCCchhcccccccccCCccccccc
Confidence 35899999 999998885 778887544
No 27
>cd05483 retropepsin_like_bacteria Bacterial aspartate proteases, retropepsin-like protease family. This family of bacteria aspartate proteases is a subfamily of retropepsin-like protease family, which includes enzymes from retrovirus and retrotransposons. While fungal and mammalian pepsin-like aspartate proteases are bilobal proteins with structurally related N- and C-termini, this family of bacteria aspartate proteases is half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate proteases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=97.87 E-value=2e-05 Score=60.19 Aligned_cols=92 Identities=20% Similarity=0.264 Sum_probs=63.5
Q ss_pred CceeEeccCCccceecccccCcCCCCCCCcEEEccccccceeeeecccCccCCCcEEEEEeCCce-EEEEEeEEEEEECC
Q psy3214 19 DYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYTVFDMDNNQKNGTQFAIRYGSGA-VSGYLSQDTLRIGD 97 (336)
Q Consensus 19 ~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs-~~G~~~~D~v~ig~ 97 (336)
.|.+++.+++.+ ..+.+||||+.+||...+...-.. -........+..++|. .......+.+++|+
T Consensus 2 ~~~v~v~i~~~~------~~~llDTGa~~s~i~~~~~~~l~~-------~~~~~~~~~~~~~~G~~~~~~~~~~~i~ig~ 68 (96)
T cd05483 2 HFVVPVTINGQP------VRFLLDTGASTTVISEELAERLGL-------PLTLGGKVTVQTANGRVRAARVRLDSLQIGG 68 (96)
T ss_pred cEEEEEEECCEE------EEEEEECCCCcEEcCHHHHHHcCC-------CccCCCcEEEEecCCCccceEEEcceEEECC
Confidence 356777776432 256789999999998653211111 1223456678888888 55566689999999
Q ss_pred eEEeceEEEEEEeeCCcccccccccccccCcc
Q psy3214 98 LTIKNQVFGEAIKEPGFTFVAAKFDGILGMGY 129 (336)
Q Consensus 98 ~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~ 129 (336)
.+++++.+.++..... ..|||||+.+
T Consensus 69 ~~~~~~~~~v~d~~~~------~~~gIlG~d~ 94 (96)
T cd05483 69 ITLRNVPAVVLPGDAL------GVDGLLGMDF 94 (96)
T ss_pred cEEeccEEEEeCCccc------CCceEeChHH
Confidence 9999999988765431 5799999864
No 28
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=97.40 E-value=0.0002 Score=67.55 Aligned_cols=61 Identities=56% Similarity=1.214 Sum_probs=50.6
Q ss_pred CCcEEEEECCEEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEccccccceee
Q psy3214 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYT 61 (336)
Q Consensus 1 ~p~i~~~~~~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y~ 61 (336)
+|+|+|+++|+.|+|++.+|+++...+.+..|+.+++.+.+.....+.||+|+.+.+++|.
T Consensus 244 ~P~i~f~f~g~~~~l~~~~yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~IlGd~Flr~~y~ 304 (317)
T cd06098 244 MPNVSFTIGGKTFELTPEQYILKVGEGAAAQCISGFTALDVPPPRGPLWILGDVFMGAYHT 304 (317)
T ss_pred CCcEEEEECCEEEEEChHHeEEeecCCCCCEEeceEEECCCCCCCCCeEEechHHhcccEE
Confidence 6999999999999999999999877666678998887665544445789999999888763
No 29
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank
Probab=97.38 E-value=0.00022 Score=67.42 Aligned_cols=61 Identities=62% Similarity=1.280 Sum_probs=50.0
Q ss_pred CCcEEEEECCEEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEccccccceee
Q psy3214 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYT 61 (336)
Q Consensus 1 ~p~i~~~~~~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y~ 61 (336)
+|+|+|.++|+.|+|++.+|+++........|+.+++.+++.......||+|+.+.++.|.
T Consensus 252 ~P~i~f~fgg~~~~l~~~~y~~~~~~~~~~~C~~~~~~~~~~~~~~~~~ilGd~flr~~y~ 312 (325)
T cd05490 252 LPVISFSLGGKVYPLTGEDYILKVSQRGTTICLSGFMGLDIPPPAGPLWILGDVFIGRYYT 312 (325)
T ss_pred CCCEEEEECCEEEEEChHHeEEeccCCCCCEEeeEEEECCCCCCCCceEEEChHhheeeEE
Confidence 6999999999999999999999876544567998887766544456789999999888764
No 30
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=97.25 E-value=0.00037 Score=65.64 Aligned_cols=61 Identities=49% Similarity=1.092 Sum_probs=49.4
Q ss_pred CCcEEEEECCEEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEccccccceee
Q psy3214 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYT 61 (336)
Q Consensus 1 ~p~i~~~~~~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y~ 61 (336)
+|+|+|.++|+.|+|++.+|++......+..|+.+++.+.+.....+.||+|+.+.++.|.
T Consensus 243 ~p~i~f~f~g~~~~l~~~~y~~~~~~~~~~~C~~~~~~~~~~~~~~~~~ILGd~flr~~y~ 303 (316)
T cd05486 243 MPSVTFTINGIPYSLSPQAYTLEDQSDGGGYCSSGFQGLDIPPPAGPLWILGDVFIRQYYS 303 (316)
T ss_pred CCCEEEEECCEEEEeCHHHeEEecccCCCCEEeeEEEECCCCCCCCCeEEEchHHhcceEE
Confidence 6999999999999999999999754334567998887766544455789999999888764
No 31
>PTZ00165 aspartyl protease; Provisional
Probab=97.14 E-value=0.00048 Score=68.72 Aligned_cols=61 Identities=34% Similarity=0.866 Sum_probs=49.6
Q ss_pred CCcEEEEECCE-----EeeeCccCceeEec--cCCccceecccccCcCCCCCCCcEEEccccccceee
Q psy3214 1 MPNVDIILGGK-----NFTLTPTDYVLKVS--IASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYT 61 (336)
Q Consensus 1 ~p~i~~~~~~~-----~~~i~~~~y~~~~~--~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y~ 61 (336)
+|+|+|+++|+ .|+|++.+|+++.. ...+..|+++++.+++....++.||+|+.|.+++|.
T Consensus 365 lP~itf~f~g~~g~~v~~~l~p~dYi~~~~~~~~~~~~C~~g~~~~d~~~~~g~~~ILGd~Flr~yy~ 432 (482)
T PTZ00165 365 LPRISFVLEDVNGRKIKFDMDPEDYVIEEGDSEEQEHQCVIGIIPMDVPAPRGPLFVLGNNFIRKYYS 432 (482)
T ss_pred CCceEEEECCCCCceEEEEEchHHeeeecccCCCCCCeEEEEEEECCCCCCCCceEEEchhhheeEEE
Confidence 69999999986 99999999999853 223468999998877654456899999999998874
No 32
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate r
Probab=97.07 E-value=0.00066 Score=64.26 Aligned_cols=61 Identities=52% Similarity=1.088 Sum_probs=50.0
Q ss_pred CCcEEEEECCEEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEccccccceee
Q psy3214 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYT 61 (336)
Q Consensus 1 ~p~i~~~~~~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y~ 61 (336)
+|+|+|.++|+.++|++.+|+++.....+..|.+++....+..+.++.||+|+.+.++.|.
T Consensus 252 ~P~i~f~fgg~~~~v~~~~yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~ilG~~flr~~y~ 312 (326)
T cd05487 252 LPDISFHLGGKEYTLSSSDYVLQDSDFSDKLCTVAFHAMDIPPPTGPLWVLGATFIRKFYT 312 (326)
T ss_pred CCCEEEEECCEEEEeCHHHhEEeccCCCCCEEEEEEEeCCCCCCCCCeEEEehHHhhccEE
Confidence 5999999999999999999999876555678998888765433345689999999888764
No 33
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=96.98 E-value=0.00098 Score=63.22 Aligned_cols=60 Identities=65% Similarity=1.363 Sum_probs=48.6
Q ss_pred CCcEEEEECCEEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEcccccccee
Q psy3214 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60 (336)
Q Consensus 1 ~p~i~~~~~~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y 60 (336)
+|+|+|.++|++|+|++++|+++..-.+...|+.+++...+.......||+|+.+.++.|
T Consensus 256 ~p~i~f~fgg~~~~i~~~~yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~IlG~~fl~~~y 315 (329)
T cd05485 256 LPDITFVLGGKSFSLTGKDYVLKVTQMGQTICLSGFMGIDIPPPAGPLWILGDVFIGKYY 315 (329)
T ss_pred CCcEEEEECCEEeEEChHHeEEEecCCCCCEEeeeEEECcCCCCCCCeEEEchHHhccce
Confidence 589999999999999999999987654457899888765543344568999999988866
No 34
>PTZ00147 plasmepsin-1; Provisional
Probab=96.18 E-value=0.005 Score=61.00 Aligned_cols=58 Identities=28% Similarity=0.602 Sum_probs=46.4
Q ss_pred CCcEEEEECCEEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEccccccceee
Q psy3214 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYT 61 (336)
Q Consensus 1 ~p~i~~~~~~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y~ 61 (336)
+|+|+|.++|+.|+|+|.+|+.+..-..+..|+.+++.... .+..||+|+.+.++.|.
T Consensus 378 lP~~~f~f~g~~~~L~p~~yi~~~~~~~~~~C~~~i~~~~~---~~~~~ILGd~FLr~~Yt 435 (453)
T PTZ00147 378 LPTLEFRSPNKVYTLEPEYYLQPIEDIGSALCMLNIIPIDL---EKNTFILGDPFMRKYFT 435 (453)
T ss_pred CCeEEEEECCEEEEECHHHheeccccCCCcEEEEEEEECCC---CCCCEEECHHHhccEEE
Confidence 69999999999999999999986544445679888776443 23579999999988773
No 35
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme. Proteinase A preferentially hydro
Probab=96.11 E-value=0.006 Score=57.57 Aligned_cols=56 Identities=50% Similarity=1.038 Sum_probs=44.4
Q ss_pred CCcEEEEECCEEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEcccccccee
Q psy3214 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60 (336)
Q Consensus 1 ~p~i~~~~~~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y 60 (336)
+|.|+|.++|+.|.|++++|+++. ...|+..+....+....++.||+|+.+.++.|
T Consensus 251 ~P~i~f~f~g~~~~i~~~~y~~~~----~g~C~~~~~~~~~~~~~~~~~ilG~~fl~~~y 306 (320)
T cd05488 251 LPDLTFNFDGYNFTLGPFDYTLEV----SGSCISAFTGMDFPEPVGPLAIVGDAFLRKYY 306 (320)
T ss_pred CCCEEEEECCEEEEECHHHheecC----CCeEEEEEEECcCCCCCCCeEEEchHHhhheE
Confidence 599999999999999999999853 24698877665543334568999999988766
No 36
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=95.80 E-value=0.0092 Score=59.08 Aligned_cols=58 Identities=28% Similarity=0.593 Sum_probs=45.4
Q ss_pred CCcEEEEECCEEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEccccccceee
Q psy3214 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYT 61 (336)
Q Consensus 1 ~p~i~~~~~~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y~ 61 (336)
+|+|+|.++|+.|+|+|.+|+.......+..|+.++.....+ ...||+|+.+.++.|.
T Consensus 377 lP~i~F~~~g~~~~L~p~~Yi~~~~~~~~~~C~~~i~~~~~~---~~~~ILGd~FLr~~Y~ 434 (450)
T PTZ00013 377 MPTLEFKSANNTYTLEPEYYMNPLLDVDDTLCMITMLPVDID---DNTFILGDPFMRKYFT 434 (450)
T ss_pred CCeEEEEECCEEEEECHHHheehhccCCCCeeEEEEEECCCC---CCCEEECHHHhccEEE
Confidence 699999999999999999999754322345799888665432 3589999999888774
No 37
>PF00026 Asp: Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.; InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) . More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=95.68 E-value=0.0084 Score=55.90 Aligned_cols=62 Identities=34% Similarity=0.810 Sum_probs=52.9
Q ss_pred CCcEEEEECCEEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEccccccceeeee
Q psy3214 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYTVF 63 (336)
Q Consensus 1 ~p~i~~~~~~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y~~~ 63 (336)
+|.|+|.+++..|+|++.+|+.+...+....|.+++..+.. .+..+.||+|..++++.|---
T Consensus 244 ~p~l~f~~~~~~~~i~~~~~~~~~~~~~~~~C~~~i~~~~~-~~~~~~~iLG~~fl~~~y~vf 305 (317)
T PF00026_consen 244 LPDLTFTFGGVTFTIPPSDYIFKIEDGNGGYCYLGIQPMDS-SDDSDDWILGSPFLRNYYVVF 305 (317)
T ss_dssp SEEEEEEETTEEEEEEHHHHEEEESSTTSSEEEESEEEESS-TTSSSEEEEEHHHHTTEEEEE
T ss_pred cceEEEeeCCEEEEecchHhcccccccccceeEeeeecccc-cccCCceEecHHHhhceEEEE
Confidence 58999999999999999999999887766689999887555 567789999999999887443
No 38
>PF13650 Asp_protease_2: Aspartyl protease
Probab=95.53 E-value=0.068 Score=39.83 Aligned_cols=78 Identities=22% Similarity=0.184 Sum_probs=45.9
Q ss_pred cCcCCCCCCCcEEEccccccceeeeecccCccCCCcEEEEEeCCce-EEEEEeEEEEEECCeEEeceEEEEEEeeCCccc
Q psy3214 38 GMDIPPPAGPIWILGDVFIGKFYTVFDMDNNQKNGTQFAIRYGSGA-VSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTF 116 (336)
Q Consensus 38 ~~~~dtGS~~~Wv~~~c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs-~~G~~~~D~v~ig~~~v~~~~fg~~~~~~~~~~ 116 (336)
.+.+|||++.+.|..+...+-...+ ........+.-.+|. .......+.+++|+.++.+..|-++.
T Consensus 11 ~~liDTGa~~~~i~~~~~~~l~~~~------~~~~~~~~~~~~~g~~~~~~~~~~~i~ig~~~~~~~~~~v~~------- 77 (90)
T PF13650_consen 11 RFLIDTGASISVISRSLAKKLGLKP------RPKSVPISVSGAGGSVTVYRGRVDSITIGGITLKNVPFLVVD------- 77 (90)
T ss_pred EEEEcCCCCcEEECHHHHHHcCCCC------cCCceeEEEEeCCCCEEEEEEEEEEEEECCEEEEeEEEEEEC-------
Confidence 4568999998776544321111111 111112333334444 34556667899999999888877665
Q ss_pred ccccccccccCc
Q psy3214 117 VAAKFDGILGMG 128 (336)
Q Consensus 117 ~~~~~dGIlGLg 128 (336)
.....+||||+-
T Consensus 78 ~~~~~~~iLG~d 89 (90)
T PF13650_consen 78 LGDPIDGILGMD 89 (90)
T ss_pred CCCCCEEEeCCc
Confidence 134568999974
No 39
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which
Probab=95.19 E-value=0.019 Score=54.09 Aligned_cols=53 Identities=43% Similarity=0.978 Sum_probs=43.0
Q ss_pred CCcEEEEECCEEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEccccccceee
Q psy3214 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYT 61 (336)
Q Consensus 1 ~p~i~~~~~~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y~ 61 (336)
+|+|+|.++|+.++|++++|+.+. ...|+.++.... ..+.||+|+.+.++.|.
T Consensus 252 ~P~~~f~f~g~~~~i~~~~y~~~~----~~~C~~~~~~~~----~~~~~IlG~~fl~~~y~ 304 (317)
T cd05478 252 MPDVVFTINGVQYPLPPSAYILQD----QGSCTSGFQSMG----LGELWILGDVFIRQYYS 304 (317)
T ss_pred CCcEEEEECCEEEEECHHHheecC----CCEEeEEEEeCC----CCCeEEechHHhcceEE
Confidence 699999999999999999999864 467987776532 24679999998888663
No 40
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=94.56 E-value=0.04 Score=51.82 Aligned_cols=57 Identities=33% Similarity=0.880 Sum_probs=44.2
Q ss_pred CCcEEEEECCEEeeeCccCceeEeccCCccceecccccCcCCC-CCCCcEEEccccccceee
Q psy3214 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPP-PAGPIWILGDVFIGKFYT 61 (336)
Q Consensus 1 ~p~i~~~~~~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dt-GS~~~Wv~~~c~~~~~y~ 61 (336)
+|+|+|.++|+.+.|++.+|+.+. ...|..++.....++ +..+.||+|+.+.++.|.
T Consensus 247 ~p~l~~~f~g~~~~v~~~~y~~~~----~~~C~~~i~~~~~~~~~~~~~~ilG~~fl~~~y~ 304 (318)
T cd05477 247 LPTLTFTINGVSFPLPPSAYILQN----NGYCTVGIEPTYLPSQNGQPLWILGDVFLRQYYS 304 (318)
T ss_pred CCcEEEEECCEEEEECHHHeEecC----CCeEEEEEEecccCCCCCCceEEEcHHHhhheEE
Confidence 589999999999999999999863 357988876544332 223679999998887653
No 41
>TIGR02281 clan_AA_DTGA clan AA aspartic protease, TIGR02281 family. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix. Sequences in the seed alignment and those scoring above the trusted cutoff are Proteobacterial; homologs scroing between trusted and noise are found in Pyrobaculum aerophilum str. IM2 (archaeal), Pirellula sp. (Planctomycetes), and Nostoc sp. PCC 7120 (Cyanobacteria).
Probab=93.74 E-value=0.15 Score=41.14 Aligned_cols=93 Identities=11% Similarity=0.124 Sum_probs=56.4
Q ss_pred cCceeEeccCCccceecccccCcCCCCCCCcEEEccccccceeeeecccCccCCCcEEEEEeCCceEE-EEEeEEEEEEC
Q psy3214 18 TDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYTVFDMDNNQKNGTQFAIRYGSGAVS-GYLSQDTLRIG 96 (336)
Q Consensus 18 ~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs~~-G~~~~D~v~ig 96 (336)
-.|.++..+.+. + ..+.+|||++.+-+..+...+-...+. .......+.=+.|.+. ..+.-|.+.+|
T Consensus 10 g~~~v~~~InG~--~----~~flVDTGAs~t~is~~~A~~Lgl~~~------~~~~~~~~~ta~G~~~~~~~~l~~l~iG 77 (121)
T TIGR02281 10 GHFYATGRVNGR--N----VRFLVDTGATSVALNEEDAQRLGLDLN------RLGYTVTVSTANGQIKAARVTLDRVAIG 77 (121)
T ss_pred CeEEEEEEECCE--E----EEEEEECCCCcEEcCHHHHHHcCCCcc------cCCceEEEEeCCCcEEEEEEEeCEEEEC
Confidence 356777777543 2 256789999988665443211112111 1123344444566643 44688999999
Q ss_pred CeEEeceEEEEEEeeCCcccccccccccccCcc
Q psy3214 97 DLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGY 129 (336)
Q Consensus 97 ~~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~ 129 (336)
+..++|....++.... ..+|+||+.+
T Consensus 78 ~~~~~nv~~~v~~~~~-------~~~~LLGm~f 103 (121)
T TIGR02281 78 GIVVNDVDAMVAEGGA-------LSESLLGMSF 103 (121)
T ss_pred CEEEeCcEEEEeCCCc-------CCceEcCHHH
Confidence 9999999977653221 1279999975
No 42
>cd05489 xylanase_inhibitor_I_like TAXI-I inhibits degradation of xylan in the cell wall. Xylanase inhibitor-I (TAXI-I) is a member of potent TAXI-type inhibitors of fungal and bacterial family 11 xylanases. Plants developed a diverse battery of defense mechanisms in response to continual challenges by a broad spectrum of pathogenic microorganisms. Their defense arsenal includes inhibitors of cell wall-degrading enzymes, which hinder a possible invasion and colonization by antagonists. Xylanases of fungal and bacterial pathogens are the key enzymes in the degradation of xylan in the cell wall. Plants secrete proteins that inhibit these degradation glycosidases, including xylanase. Surprisingly, TAXI-I displays structural homology with the pepsin-like family of aspartic proteases but is proteolytically nonfunctional, because one or more residues of the essential catalytic triad are absent. The structure of the TAXI-inhibitor, Aspergillus niger xylanase I complex, illustrates the ability
Probab=93.63 E-value=0.11 Score=49.91 Aligned_cols=55 Identities=22% Similarity=0.438 Sum_probs=41.5
Q ss_pred CCcEEEEECC--EEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEcccccccee
Q psy3214 1 MPNVDIILGG--KNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60 (336)
Q Consensus 1 ~p~i~~~~~~--~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y 60 (336)
+|+|+|.++| ..+.|++++|++++.- ...|+ ++.....+ ...+||.|+-+.+++|
T Consensus 290 ~P~it~~f~g~g~~~~l~~~ny~~~~~~--~~~Cl-~f~~~~~~--~~~~~IlG~~~~~~~~ 346 (362)
T cd05489 290 VPAIDLVLDGGGVNWTIFGANSMVQVKG--GVACL-AFVDGGSE--PRPAVVIGGHQMEDNL 346 (362)
T ss_pred cceEEEEEeCCCeEEEEcCCceEEEcCC--CcEEE-EEeeCCCC--CCceEEEeeheecceE
Confidence 5999999986 9999999999998753 34694 66544332 2468999998877654
No 43
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=93.53 E-value=0.068 Score=49.46 Aligned_cols=54 Identities=26% Similarity=0.471 Sum_probs=43.0
Q ss_pred CcEEEEECCEEeeeCccCceeEecc--CCccceecccccCcCCCCCCCcEEEcccccccee
Q psy3214 2 PNVDIILGGKNFTLTPTDYVLKVSI--ASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60 (336)
Q Consensus 2 p~i~~~~~~~~~~i~~~~y~~~~~~--g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y 60 (336)
|+|+|.++|..++|++.+|+++... +....|..++.... . ..||+|+.+.++.|
T Consensus 225 p~i~f~f~g~~~~i~~~~~~~~~~~~~~~~~~C~~~i~~~~----~-~~~iLG~~fl~~~y 280 (295)
T cd05474 225 GSLTFNFGGATISVPLSDLVLPASTDDGGDGACYLGIQPST----S-DYNILGDTFLRSAY 280 (295)
T ss_pred CEEEEEECCeEEEEEHHHhEeccccCCCCCCCeEEEEEeCC----C-CcEEeChHHhhcEE
Confidence 7999999999999999999997653 23567987776432 1 68999999988765
No 44
>KOG1339|consensus
Probab=92.65 E-value=0.22 Score=48.50 Aligned_cols=55 Identities=42% Similarity=0.920 Sum_probs=42.7
Q ss_pred CCcEEEEEC-CEEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEcccccccee
Q psy3214 1 MPNVDIILG-GKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60 (336)
Q Consensus 1 ~p~i~~~~~-~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y 60 (336)
+|+|.|.++ |+.|.+.+.+|++++..+... |...+..+..+ ..||+|+-+.++.+
T Consensus 321 ~P~i~~~f~~g~~~~l~~~~y~~~~~~~~~~-Cl~~~~~~~~~----~~~ilG~~~~~~~~ 376 (398)
T KOG1339|consen 321 LPDITFHFGGGAVFSLPPKNYLVEVSDGGGV-CLAFFNGMDSG----PLWILGDVFQQNYL 376 (398)
T ss_pred CCcEEEEECCCcEEEeCccceEEEECCCCCc-eeeEEecCCCC----ceEEEchHHhCCEE
Confidence 699999999 899999999999998765322 87666544322 78999997766554
No 45
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=91.93 E-value=0.34 Score=44.54 Aligned_cols=56 Identities=20% Similarity=0.471 Sum_probs=41.5
Q ss_pred CCcEEEEECC----EEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEcccccccee
Q psy3214 1 MPNVDIILGG----KNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60 (336)
Q Consensus 1 ~p~i~~~~~~----~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y 60 (336)
+|+|+|.+++ +.+.|++.+|+++... ...|+..+....+ +..+.||+|+.+.++.|
T Consensus 197 ~p~i~~~f~~~~~~~~~~l~~~~y~~~~~~--~~~Cl~~~~~~~~--~~~~~~ilG~~~l~~~~ 256 (273)
T cd05475 197 FKPLTLKFGKGWRTRLLEIPPENYLIISEK--GNVCLGILNGSEI--GLGNTNIIGDISMQGLM 256 (273)
T ss_pred cccEEEEECCCCceeEEEeCCCceEEEcCC--CCEEEEEecCCCc--CCCceEEECceEEEeeE
Confidence 5899999987 7999999999987543 3469755543222 23468999999877765
No 46
>cd05479 RP_DDI RP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
Probab=91.42 E-value=0.34 Score=39.13 Aligned_cols=91 Identities=15% Similarity=0.151 Sum_probs=53.6
Q ss_pred ccCceeEeccCCccceecccccCcCCCCCCCcEEEccccccceeeeecccCccCCCcEEE-EEeCCce--EEEEEeEEEE
Q psy3214 17 PTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYTVFDMDNNQKNGTQFA-IRYGSGA--VSGYLSQDTL 93 (336)
Q Consensus 17 ~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y~~~~Sst~~~~~~~~~-i~Yg~Gs--~~G~~~~D~v 93 (336)
...+.++..+++.+. ...+|||++-++|..+|..+-...... ...+. ...|.|. ..|....+.+
T Consensus 14 ~~~~~v~~~Ing~~~------~~LvDTGAs~s~Is~~~a~~lgl~~~~-------~~~~~~~~~g~g~~~~~g~~~~~~l 80 (124)
T cd05479 14 VPMLYINVEINGVPV------KAFVDSGAQMTIMSKACAEKCGLMRLI-------DKRFQGIAKGVGTQKILGRIHLAQV 80 (124)
T ss_pred eeEEEEEEEECCEEE------EEEEeCCCceEEeCHHHHHHcCCcccc-------CcceEEEEecCCCcEEEeEEEEEEE
Confidence 344556666654322 457899999998876654321111111 11121 2233332 6787888899
Q ss_pred EECCeEEeceEEEEEEeeCCcccccccccccccCcc
Q psy3214 94 RIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGY 129 (336)
Q Consensus 94 ~ig~~~v~~~~fg~~~~~~~~~~~~~~~dGIlGLg~ 129 (336)
.+++...+ ..|.+... ...|+|||+-|
T Consensus 81 ~i~~~~~~-~~~~Vl~~--------~~~d~ILG~d~ 107 (124)
T cd05479 81 KIGNLFLP-CSFTVLED--------DDVDFLIGLDM 107 (124)
T ss_pred EECCEEee-eEEEEECC--------CCcCEEecHHH
Confidence 99998875 66655421 25689999975
No 47
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=91.18 E-value=0.35 Score=44.93 Aligned_cols=55 Identities=31% Similarity=0.546 Sum_probs=39.9
Q ss_pred CCcEEEEEC-CEEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEcccccccee
Q psy3214 1 MPNVDIILG-GKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60 (336)
Q Consensus 1 ~p~i~~~~~-~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y 60 (336)
+|+|+|+++ +..+.|++.+|+..... ....|+ ++... + +....||+|+.+.++.|
T Consensus 227 ~P~i~f~f~~g~~~~l~~~~y~~~~~~-~~~~C~-~~~~~--~-~~~~~~ilG~~fl~~~~ 282 (299)
T cd05472 227 VPTVSLHFQGGADVELDASGVLYPVDD-SSQVCL-AFAGT--S-DDGGLSIIGNVQQQTFR 282 (299)
T ss_pred cCCEEEEECCCCEEEeCcccEEEEecC-CCCEEE-EEeCC--C-CCCCCEEEchHHccceE
Confidence 599999997 99999999999984332 335686 33321 1 23467999999887765
No 48
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which
Probab=89.87 E-value=0.48 Score=43.24 Aligned_cols=52 Identities=25% Similarity=0.633 Sum_probs=40.3
Q ss_pred CcEEEEEC-CEEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEcccccccee
Q psy3214 2 PNVDIILG-GKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60 (336)
Q Consensus 2 p~i~~~~~-~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y 60 (336)
|+|+|.++ ++.|.+++.+|+++.. ....|. ++.. ++..+.||+|+.+.++.|
T Consensus 196 P~i~~~f~~~~~~~i~~~~y~~~~~--~~~~C~-~~~~----~~~~~~~ilG~~fl~~~~ 248 (265)
T cd05476 196 PDLTLHFDGGADLELPPENYFVDVG--EGVVCL-AILS----SSSGGVSILGNIQQQNFL 248 (265)
T ss_pred CCEEEEECCCCEEEeCcccEEEECC--CCCEEE-EEec----CCCCCcEEEChhhcccEE
Confidence 89999998 9999999999998543 446796 3332 224578999999988765
No 49
>PF11925 DUF3443: Protein of unknown function (DUF3443); InterPro: IPR021847 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 400 to 434 amino acids in length. This protein has two conserved sequence motifs: NPV and DNNG.
Probab=86.48 E-value=1.2 Score=42.56 Aligned_cols=94 Identities=19% Similarity=0.201 Sum_probs=55.9
Q ss_pred cceecccccCcCCCCCCCcEEEccccccc---eeeeecccCccCCCcEEE--EEeCCceEEEEEeEEEEEECCeEEeceE
Q psy3214 30 SMCLSGFAGMDIPPPAGPIWILGDVFIGK---FYTVFDMDNNQKNGTQFA--IRYGSGAVSGYLSQDTLRIGDLTIKNQV 104 (336)
Q Consensus 30 ~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~---~y~~~~Sst~~~~~~~~~--i~Yg~Gs~~G~~~~D~v~ig~~~v~~~~ 104 (336)
.+|+.-. .+.+||||.-+=|........ ......+ .+.+.. ..|++|..-|.+.+.+|+||+.+..+..
T Consensus 36 s~CqTId-nvlVDTGS~GLRi~~sAl~~~l~~~Lp~~t~-----~g~~laEC~~F~sgytWGsVr~AdV~igge~A~~iP 109 (370)
T PF11925_consen 36 SNCQTID-NVLVDTGSYGLRIFASALPSSLAGSLPQQTG-----GGAPLAECAQFASGYTWGSVRTADVTIGGETASSIP 109 (370)
T ss_pred CCceeeC-cEEEeccchhhhHHHhhhchhhhccCCcccC-----CCcchhhhhhccCcccccceEEEEEEEcCeeccccC
Confidence 4676544 477899998776654422111 1222111 333333 7889999999999999999997544433
Q ss_pred EEEEEee-----CC---------cccccccccccccCcc
Q psy3214 105 FGEAIKE-----PG---------FTFVAAKFDGILGMGY 129 (336)
Q Consensus 105 fg~~~~~-----~~---------~~~~~~~~dGIlGLg~ 129 (336)
+-++... +. .........||||+|.
T Consensus 110 iQvI~D~~~~~~P~sC~~~g~~~~t~~~lgaNGILGIg~ 148 (370)
T PF11925_consen 110 IQVIGDSAAPSVPSSCSNSGASMNTVADLGANGILGIGP 148 (370)
T ss_pred EEEEcCCCCCCCCchhhcCCCCCCCcccccCceEEeecC
Confidence 3333221 10 0112345789999985
No 50
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=85.19 E-value=0.73 Score=43.51 Aligned_cols=50 Identities=24% Similarity=0.376 Sum_probs=38.0
Q ss_pred CcEEEEEC-CEEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEcccccccee
Q psy3214 2 PNVDIILG-GKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60 (336)
Q Consensus 2 p~i~~~~~-~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y 60 (336)
|+|+|.++ |+.+.|++.+|+++... ..|..++. +++..||+|+.+.++.|
T Consensus 258 P~i~~~f~~g~~~~i~p~~y~~~~~~---~~c~~~~~------~~~~~~ILG~~flr~~y 308 (326)
T cd06096 258 PTITIIFENNLKIDWKPSSYLYKKES---FWCKGGEK------SVSNKPILGASFFKNKQ 308 (326)
T ss_pred CcEEEEEcCCcEEEECHHHhccccCC---ceEEEEEe------cCCCceEEChHHhcCcE
Confidence 89999998 89999999999987542 22544432 23468999999888765
No 51
>cd05484 retropepsin_like_LTR_2 Retropepsins_like_LTR, pepsin-like aspartate proteases. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classif
Probab=83.90 E-value=4.6 Score=30.30 Aligned_cols=63 Identities=14% Similarity=0.045 Sum_probs=37.3
Q ss_pred cCcCCCCCCCcEEEccccccceeeeecccCccCCCcEEEEEeCCce---EEEEEeEEEEEECCeEEeceEEEEEE
Q psy3214 38 GMDIPPPAGPIWILGDVFIGKFYTVFDMDNNQKNGTQFAIRYGSGA---VSGYLSQDTLRIGDLTIKNQVFGEAI 109 (336)
Q Consensus 38 ~~~~dtGS~~~Wv~~~c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs---~~G~~~~D~v~ig~~~v~~~~fg~~~ 109 (336)
.+.+||||+.++|..+.+.+....+ ..+ ....+.=.+|. +.|.+ .+.+++++.+.. ..|-++.
T Consensus 13 ~~lvDTGA~~svis~~~~~~lg~~~-----~~~--~~~~v~~a~G~~~~~~G~~-~~~v~~~~~~~~-~~~~v~~ 78 (91)
T cd05484 13 KFQLDTGSAITVISEKTWRKLGSPP-----LKP--TKKRLRTATGTKLSVLGQI-LVTVKYGGKTKV-LTLYVVK 78 (91)
T ss_pred EEEEcCCcceEEeCHHHHHHhCCCc-----ccc--ccEEEEecCCCEeeEeEEE-EEEEEECCEEEE-EEEEEEE
Confidence 5678999999998765443322222 112 22333334444 56776 899999998744 5555444
No 52
>cd06095 RP_RTVL_H_like Retropepsin of the RTVL_H family of human endogenous retrovirus-like elements. This family includes aspartate proteases from retroelements with LTR (long terminal repeats) including the RTVL_H family of human endogenous retrovirus-like elements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where
Probab=82.88 E-value=7.5 Score=28.93 Aligned_cols=72 Identities=18% Similarity=0.092 Sum_probs=39.8
Q ss_pred cCcCCCCCCCcEEEccccccceeeeecccCccCCCcEEEEEeCCceEEEEE-eEE-EEEECCeEEeceEEEEEEeeCCcc
Q psy3214 38 GMDIPPPAGPIWILGDVFIGKFYTVFDMDNNQKNGTQFAIRYGSGAVSGYL-SQD-TLRIGDLTIKNQVFGEAIKEPGFT 115 (336)
Q Consensus 38 ~~~~dtGS~~~Wv~~~c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs~~G~~-~~D-~v~ig~~~v~~~~fg~~~~~~~~~ 115 (336)
.+.+|||++.+=+......+. ........+.=..|...-.+ ..+ .+.+|+..+.+ .|......
T Consensus 11 ~fLvDTGA~~tii~~~~a~~~----------~~~~~~~~v~gagG~~~~~v~~~~~~v~vg~~~~~~-~~~v~~~~---- 75 (86)
T cd06095 11 VFLVDTGATHSVLKSDLGPKQ----------ELSTTSVLIRGVSGQSQQPVTTYRTLVDLGGHTVSH-SFLVVPNC---- 75 (86)
T ss_pred EEEEECCCCeEEECHHHhhhc----------cCCCCcEEEEeCCCcccccEEEeeeEEEECCEEEEE-EEEEEcCC----
Confidence 346799999887764432211 11234444443344431111 123 69999999985 45443221
Q ss_pred cccccccccccCcc
Q psy3214 116 FVAAKFDGILGMGY 129 (336)
Q Consensus 116 ~~~~~~dGIlGLg~ 129 (336)
.++|||+.|
T Consensus 76 -----~~~lLG~df 84 (86)
T cd06095 76 -----PDPLLGRDL 84 (86)
T ss_pred -----CCcEechhh
Confidence 279999865
No 53
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two
Probab=74.80 E-value=2.7 Score=40.23 Aligned_cols=55 Identities=24% Similarity=0.422 Sum_probs=37.7
Q ss_pred CCcEEEEECC------EEeeeCccCceeEecc-CCccceecccccCcCCCCCCCcEEEcccccccee
Q psy3214 1 MPNVDIILGG------KNFTLTPTDYVLKVSI-ASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60 (336)
Q Consensus 1 ~p~i~~~~~~------~~~~i~~~~y~~~~~~-g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y 60 (336)
+|+|.|.++| ..+.|++.+|+..... +....|+. +.+. .+...||+|+.+.++.|
T Consensus 269 ~P~i~~~f~g~~~~~~~~l~l~p~~Y~~~~~~~~~~~~C~~----~~~~-~~~~~~ILG~~flr~~y 330 (364)
T cd05473 269 FPKISIYLRDENSSQSFRITILPQLYLRPVEDHGTQLDCYK----FAIS-QSTNGTVIGAVIMEGFY 330 (364)
T ss_pred CCcEEEEEccCCCCceEEEEECHHHhhhhhccCCCcceeeE----Eeee-cCCCceEEeeeeEcceE
Confidence 6999999975 3679999999986532 22356863 1121 23357999998887765
No 54
>PLN03146 aspartyl protease family protein; Provisional
Probab=74.75 E-value=4.5 Score=39.91 Aligned_cols=51 Identities=24% Similarity=0.442 Sum_probs=37.7
Q ss_pred CCcEEEEECCEEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEcccccccee
Q psy3214 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60 (336)
Q Consensus 1 ~p~i~~~~~~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~~~~~y 60 (336)
+|+|+|.++|..+.|++.+|+++..- ...|+ ++.. +.+.||+|+-+.++.|
T Consensus 361 ~P~i~~~F~Ga~~~l~~~~~~~~~~~--~~~Cl-~~~~------~~~~~IlG~~~q~~~~ 411 (431)
T PLN03146 361 LPIITAHFTGADVKLQPLNTFVKVSE--DLVCF-AMIP------TSSIAIFGNLAQMNFL 411 (431)
T ss_pred CCeEEEEECCCeeecCcceeEEEcCC--CcEEE-EEec------CCCceEECeeeEeeEE
Confidence 59999999999999999999997643 34686 2221 1346999997766543
No 55
>PF09676 TraV: Type IV conjugative transfer system lipoprotein (TraV); InterPro: IPR014118 This entry represents TraV, a component of a conjugative type IV secretion system. TraV is an outer membrane lipoprotein that is believed to interact with the secretin TraK [, , ]. This protein contains three conserved cysteines in the N-terminal half.
Probab=62.96 E-value=7.4 Score=30.94 Aligned_cols=32 Identities=25% Similarity=0.515 Sum_probs=27.7
Q ss_pred eeeEEEEEEEceecCCCCCcccccccccceEEeee
Q psy3214 298 TKTGIMRIFLGAKNDIEGNELPLSAQRSLFIELDK 332 (336)
Q Consensus 298 ~~~~~vrifl~Pk~d~~g~~~~~~~~r~~~ielD~ 332 (336)
....|+||.++|=.|+.|. |.+.|..++|+|.
T Consensus 85 ~~~~v~RiwiaP~~D~~g~---l~~~~~Vy~~v~~ 116 (119)
T PF09676_consen 85 TPPRVMRIWIAPWEDADGD---LHDPGYVYFVVDP 116 (119)
T ss_pred cCCceEEEEEeeeECCCCC---EeccceEEEEecC
Confidence 4557899999999999987 8889999999874
No 56
>PF14541 TAXi_C: Xylanase inhibitor C-terminal; PDB: 3AUP_D 3HD8_A 1T6G_A 1T6E_X 2B42_A 3VLB_A 3VLA_A.
Probab=52.21 E-value=21 Score=29.81 Aligned_cols=50 Identities=24% Similarity=0.592 Sum_probs=34.2
Q ss_pred CCcEEEEEC-CEEeeeCccCceeEeccCCccceecccccCcCCCCCCCcEEEcccc
Q psy3214 1 MPNVDIILG-GKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVF 55 (336)
Q Consensus 1 ~p~i~~~~~-~~~~~i~~~~y~~~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~c~ 55 (336)
+|+|+|.+. |..+.|++.+|.++.+ ....|+ ++... +.+.....|.|.-.
T Consensus 92 ~P~i~l~F~~ga~l~l~~~~y~~~~~--~~~~Cl-a~~~~--~~~~~~~~viG~~~ 142 (161)
T PF14541_consen 92 FPTITLHFEGGADLTLPPENYFVQVS--PGVFCL-AFVPS--DADDDGVSVIGNFQ 142 (161)
T ss_dssp S--EEEEETTSEEEEE-HHHHEEEEC--TTEEEE-SEEEE--TSTTSSSEEE-HHH
T ss_pred CCeEEEEEeCCcceeeeccceeeecc--CCCEEE-EEEcc--CCCCCCcEEECHHH
Confidence 699999999 8999999999999887 336786 22222 34677888988754
No 57
>COG2967 ApaG Uncharacterized protein affecting Mg2+/Co2+ transport [Inorganic ion transport and metabolism]
Probab=49.80 E-value=31 Score=27.58 Aligned_cols=32 Identities=28% Similarity=0.346 Sum_probs=26.5
Q ss_pred CcEEEEEEecCCCceeeEEEEEEEceecCCCCCc
Q psy3214 284 DFKYEITIENNSNETKTGIMRIFLGAKNDIEGNE 317 (336)
Q Consensus 284 pF~y~i~v~n~s~~~~~~~vrifl~Pk~d~~g~~ 317 (336)
-|.|+|+|.|+++...+-+-|-++ --|.+|+.
T Consensus 29 vfaYtitI~N~g~~~vqLlsR~W~--ITd~~g~v 60 (126)
T COG2967 29 VFAYTVTIRNLGEVPVQLLSRYWL--ITDGNGRV 60 (126)
T ss_pred EEEEEEEEecCCCccceeeeeEEE--EecCCCcE
Confidence 378999999999999999999988 44666663
No 58
>PF00077 RVP: Retroviral aspartyl protease The Prosite entry also includes Pfam:PF00026; InterPro: IPR018061 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to the MEROPS peptidase family A2 (retropepsin family, clan AA), subfamily A2A. The family includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). Retroviral aspartyl protease is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins.; PDB: 3D3T_B 3SQF_A 1NSO_A 2HB3_A 2HS2_A 2HS1_B 3K4V_A 3GGV_C 1HTG_B 2FDE_A ....
Probab=41.68 E-value=57 Score=24.48 Aligned_cols=18 Identities=11% Similarity=-0.104 Sum_probs=13.2
Q ss_pred cCcCCCCCCCcEEEcccc
Q psy3214 38 GMDIPPPAGPIWILGDVF 55 (336)
Q Consensus 38 ~~~~dtGS~~~Wv~~~c~ 55 (336)
.+.+||||+.+-|...++
T Consensus 18 ~~LlDTGA~vsiI~~~~~ 35 (100)
T PF00077_consen 18 KALLDTGADVSIISEKDW 35 (100)
T ss_dssp EEEEETTBSSEEESSGGS
T ss_pred EEEEecCCCcceeccccc
Confidence 456899998887775544
No 59
>PF03531 SSrecog: Structure-specific recognition protein (SSRP1); InterPro: IPR000969 Human structure-specific recognition protein, SSRP1, [] binds specifically to DNA modified with the anti-cancer drug cisplatin. An 81kDa protein is predicted, containing several highly-charged domains and a stretch of 75 residues that share 47% identity with a portion of the high mobility group (HMG) protein HMG1. This HMG box probably constitutes the structure recognition element for cisplatin-modified DNA, the probable recognition motif being the local duplex unwinding and bending that occurs on formation of intra-strand cross-links []. SSRP1 is the human homologue of a recently identified mouse protein that binds to recombination signal sequences []. These sequences have been postulated to form stem-loop structures, further implicating local bends and unwinding in DNA as a recognition target for HMG-box proteins. A Drosophila melanogaster cDNA encoding an HMG-box-containing protein has also been isolated [, ]. This protein shares 50% sequence identity with human SSRP1. In vitro binding studies using Drosophila SSRP showed that the protein binds to single-stranded DNA and RNA, with highest affinity for nucleotides G and U. Comparison of the predicted amino acid sequences among SSRP family members reveals 48% identity, with structural conservation in the C terminus of the HMG box, as well as domains of highly charged residues. The most highly conserved regions lie in the poorly understood N terminus, suggesting that this portion of the protein is critical for its function []. This entry contains Pob3 Q04636 from SWISSPROT, which is a subunit of the heterodimeric yeast FACT complex (Spt16p (IPR013953 from INTERPRO)-Pob3p) []. The FACT complex facilitates RNA Polymerase II transcription elongation through nucleosomes by destabilising and then reassembling nucleosome structure [, ].; GO: 0003677 DNA binding, 0005634 nucleus; PDB: 3F5R_A 2GCL_B 2GCJ_D.
Probab=36.74 E-value=35 Score=30.50 Aligned_cols=31 Identities=26% Similarity=0.420 Sum_probs=25.0
Q ss_pred ccccCCCCcEEEEEEecCCCceeeEEEEEEEceecCCCC
Q psy3214 277 PYHIQHEDFKYEITIENNSNETKTGIMRIFLGAKNDIEG 315 (336)
Q Consensus 277 ~~~L~h~pF~y~i~v~n~s~~~~~~~vrifl~Pk~d~~g 315 (336)
.-||..|-|.|+|.- + .+.|+||=||-|+..
T Consensus 159 ~lrl~GktyDykI~y--~------~I~rlflLpk~d~~~ 189 (222)
T PF03531_consen 159 FLRLHGKTYDYKIQY--S------SISRLFLLPKPDDRH 189 (222)
T ss_dssp EEEEEESSBEEEEEG--G------GEEEEEEEE-TTSSE
T ss_pred ccccccccccccccc--c------ccccccccccccccc
Confidence 479999999999977 3 479999999998653
No 60
>TIGR03698 clan_AA_DTGF clan AA aspartic protease, AF_0612 family. Members of this protein family are clan AA aspartic proteases, related to family TIGR02281. These proteins resemble retropepsins, pepsin-like proteases of retroviruses such as HIV. Members of this family are found in archaea and bacteria.
Probab=32.80 E-value=81 Score=24.49 Aligned_cols=69 Identities=12% Similarity=0.028 Sum_probs=35.7
Q ss_pred EeccCCccceecccccCcCCCCCCCcEEEcc-ccccceeeeecccCccCCCcEEEEEeCCce-EEEEEeEEEEEECCeEE
Q psy3214 23 KVSIASSSMCLSGFAGMDIPPPAGPIWILGD-VFIGKFYTVFDMDNNQKNGTQFAIRYGSGA-VSGYLSQDTLRIGDLTI 100 (336)
Q Consensus 23 ~~~~g~~~~c~~~~~~~~~dtGS~~~Wv~~~-c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs-~~G~~~~D~v~ig~~~v 100 (336)
++.+.+|.+.-+......+|||.+..-+... -..+-...+.. ...+.=++|+ +.-....++|.++|...
T Consensus 3 ~v~~~~p~~~~~~~v~~LVDTGat~~~~l~~~~a~~lgl~~~~---------~~~~~tA~G~~~~~~v~~~~v~igg~~~ 73 (107)
T TIGR03698 3 DVELSNPKNPEFMEVRALVDTGFSGFLLVPPDIVNKLGLPELD---------QRRVYLADGREVLTDVAKASIIINGLEI 73 (107)
T ss_pred EEEEeCCCCCCceEEEEEEECCCCeEEecCHHHHHHcCCCccc---------CcEEEecCCcEEEEEEEEEEEEECCEEE
Confidence 3444444333333456778999974432322 11111121111 1233335665 55667899999999765
No 61
>TIGR02747 TraV type IV conjugative transfer system lipoprotein TraV. The TraV protein is a component of conjugative type IV secretion systems. TraV is an outer membrane lipoprotein and is believed to interact with the secretin TraK. The alignment contains three conserved cysteines in the N-terminal half.
Probab=31.74 E-value=48 Score=27.53 Aligned_cols=33 Identities=15% Similarity=0.340 Sum_probs=27.6
Q ss_pred ceeeEEEEEEEceecCCCCCcccccccccceEEeee
Q psy3214 297 ETKTGIMRIFLGAKNDIEGNELPLSAQRSLFIELDK 332 (336)
Q Consensus 297 ~~~~~~vrifl~Pk~d~~g~~~~~~~~r~~~ielD~ 332 (336)
..+..|.||.++|-.|..|. +.+.+.-++++|.
T Consensus 109 R~~~~V~RiwIaPy~D~dg~---lh~~~~VY~vv~~ 141 (144)
T TIGR02747 109 RTQERVAKLWIAPYIDPEGN---YHDPGDVYSIIKP 141 (144)
T ss_pred eccccEEEEEEEeeECCCCC---EeccceEEEEecC
Confidence 36678899999999999887 7788888888874
No 62
>KOG0526|consensus
Probab=31.74 E-value=24 Score=35.36 Aligned_cols=32 Identities=25% Similarity=0.406 Sum_probs=27.0
Q ss_pred ccccCCCCcEEEEEEecCCCceeeEEEEEEEceecCCCCC
Q psy3214 277 PYHIQHEDFKYEITIENNSNETKTGIMRIFLGAKNDIEGN 316 (336)
Q Consensus 277 ~~~L~h~pF~y~i~v~n~s~~~~~~~vrifl~Pk~d~~g~ 316 (336)
.-||..+-|.|+|.. +.+.|+||-||-|.+.+
T Consensus 224 ~lrL~GkTyDyKI~y--------~SI~rLflLPk~d~rh~ 255 (615)
T KOG0526|consen 224 FLRLHGKTYDYKIPY--------KSINRLFLLPKKDQRHV 255 (615)
T ss_pred hhhhcccccceecch--------hheeeeEeccCCCCceE
Confidence 479999999999976 35799999999997643
No 63
>PF08284 RVP_2: Retroviral aspartyl protease; InterPro: IPR013242 This region defines single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein; usually pol, more rarely gag. Retroviral proteases appear to be homologous to a single domain of the two-domain eukaryotic aspartyl proteases.
Probab=31.22 E-value=1.1e+02 Score=24.99 Aligned_cols=78 Identities=17% Similarity=0.085 Sum_probs=40.9
Q ss_pred ccCcCCCCCCCcEEEccccccceeeeecccCccCCCcEEEEEeCCceE--EEEEeEEEEEECCeEEeceEEEEEEeeCCc
Q psy3214 37 AGMDIPPPAGPIWILGDVFIGKFYTVFDMDNNQKNGTQFAIRYGSGAV--SGYLSQDTLRIGDLTIKNQVFGEAIKEPGF 114 (336)
Q Consensus 37 ~~~~~dtGS~~~Wv~~~c~~~~~y~~~~Sst~~~~~~~~~i~Yg~Gs~--~G~~~~D~v~ig~~~v~~~~fg~~~~~~~~ 114 (336)
..+.+|||++...|-.++..+...... +...++.+.-..|.. .+....=.+.++|.... ..|-+..
T Consensus 33 ~~vLiDSGAThsFIs~~~a~~~~l~~~------~l~~~~~V~~~g~~~~~~~~~~~~~~~i~g~~~~-~dl~vl~----- 100 (135)
T PF08284_consen 33 ASVLIDSGATHSFISSSFAKKLGLPLE------PLPRPIVVSAPGGSINCEGVCPDVPLSIQGHEFV-VDLLVLD----- 100 (135)
T ss_pred EEEEEecCCCcEEccHHHHHhcCCEEE------EccCeeEEecccccccccceeeeEEEEECCeEEE-eeeEEec-----
Confidence 346789999998887655333222221 123355554333332 22233334566665543 3332221
Q ss_pred ccccccccccccCcc
Q psy3214 115 TFVAAKFDGILGMGY 129 (336)
Q Consensus 115 ~~~~~~~dGIlGLg~ 129 (336)
...+|-|||+-|
T Consensus 101 ---l~~~DvILGm~W 112 (135)
T PF08284_consen 101 ---LGGYDVILGMDW 112 (135)
T ss_pred ---ccceeeEeccch
Confidence 234689999975
No 64
>PRK05461 apaG CO2+/MG2+ efflux protein ApaG; Reviewed
Probab=30.43 E-value=59 Score=26.39 Aligned_cols=36 Identities=31% Similarity=0.328 Sum_probs=26.6
Q ss_pred ccCCCCcEEEEEEecCCCceeeEEEEEEEceecCCCCC
Q psy3214 279 HIQHEDFKYEITIENNSNETKTGIMRIFLGAKNDIEGN 316 (336)
Q Consensus 279 ~L~h~pF~y~i~v~n~s~~~~~~~vrifl~Pk~d~~g~ 316 (336)
..+|--|.|+|+|+|.++...+-.=|=++ -.|..|+
T Consensus 25 ~~~~y~f~Y~ItI~N~~~~~vQL~~R~W~--I~d~~g~ 60 (127)
T PRK05461 25 EEGRYVFAYTITIENLGRVPVQLLSRHWL--ITDANGR 60 (127)
T ss_pred cCCEEEEEEEEEEEECCCCCEEEEeeeEE--EEECCCC
Confidence 45566799999999999887777777666 3355554
No 65
>PF09275 Pertus-S4-tox: Pertussis toxin S4 subunit; InterPro: IPR015355 Members of this family of Bordetella pertussis toxins adopt a structure consisting of an OB fold, with a closed or partly opened beta-barrel in a Greek-key topology []. ; PDB: 1PTO_D 1PRT_K 1BCP_D.
Probab=28.23 E-value=36 Score=25.19 Aligned_cols=31 Identities=23% Similarity=0.470 Sum_probs=18.8
Q ss_pred CCCCcEEEEEEecCCCc----------eeeEEEEEEEceec
Q psy3214 281 QHEDFKYEITIENNSNE----------TKTGIMRIFLGAKN 311 (336)
Q Consensus 281 ~h~pF~y~i~v~n~s~~----------~~~~~vrifl~Pk~ 311 (336)
.|-||-|--+....+.. -+..-+|+|||||.
T Consensus 46 ahvpfcfgkdlkrpgsspmevmlravfmqqrplrmflgpkq 86 (110)
T PF09275_consen 46 AHVPFCFGKDLKRPGSSPMEVMLRAVFMQQRPLRMFLGPKQ 86 (110)
T ss_dssp GSEEEEEEEETTS-S--HHHHHHHHHHHHT-EEEEEEEEEE
T ss_pred CcccceeccccCCCCCCcHHHHHHHHHHhhcchhhhcCcce
Confidence 47778777666332221 13456899999996
No 66
>PF04076 BOF: Bacterial OB fold (BOF) protein; InterPro: IPR005220 Proteins in this entry have an OB-fold fold (oligonucleotide/oligosaccharide binding motif). Analysis of the predicted nucleotide-binding site of the OB-fold suggests that they lack nucleic acid-binding properties. They contain an predicted N-terminal signal peptide which indicates that they localise to the periplasm where they may function to bind proteins, small molecules, or other typical OB-fold ligands. As hypothesised for the distantly related OB-fold containing bacterial enterotoxins, the loss of nucleotide-binding function and the rapid evolution of the OB-fold ligand-binding site may be associated with the presence of members in mobile genetic elements and their potential role in bacterial pathogenicity [].; PDB: 1NNX_A.
Probab=27.07 E-value=71 Score=24.89 Aligned_cols=48 Identities=17% Similarity=0.291 Sum_probs=32.0
Q ss_pred ccccccCCCCcEEEEEEecCCCceeeEEEEEEEceecCCCCCcccccccccceEEeee
Q psy3214 275 NIPYHIQHEDFKYEITIENNSNETKTGIMRIFLGAKNDIEGNELPLSAQRSLFIELDK 332 (336)
Q Consensus 275 ~~~~~L~h~pF~y~i~v~n~s~~~~~~~vrifl~Pk~d~~g~~~~~~~~r~~~ielD~ 332 (336)
.|..||.++-|.|+ ...+.++|-+-++. ++|+.++.++.=++.=|+||
T Consensus 42 ~Iv~~l~~d~Y~F~---------D~TG~I~VeId~~~-w~g~~vt~~~~Vri~GeVDk 89 (103)
T PF04076_consen 42 NIVKQLGDDKYLFR---------DATGEIEVEIDDDV-WRGQTVTPDDKVRISGEVDK 89 (103)
T ss_dssp EEEEEEETTEEEEE---------ETTEEEEEE--GGG-STT----TTSEEEEEEEEEE
T ss_pred EEEEEecCCEEEEE---------CCCCcEEEEEChhh-cCCcccCCCCEEEEEEEEeC
Confidence 35789999977776 23468999997774 78999999887777778885
No 67
>PF02747 PCNA_C: Proliferating cell nuclear antigen, C-terminal domain; InterPro: IPR022649 Proliferating cell nuclear antigen (PCNA), or cyclin, is a non-histone acidic nuclear protein [] that plays a key role in the control of eukaryotic DNA replication []. It acts as a co-factor for DNA polymerase delta, which is responsible for leading strand DNA replication []. The sequence of PCNA is well conserved between plants and animals, indicating a strong selective pressure for structure conservation, and suggesting that this type of DNA replication mechanism is conserved throughout eukaryotes []. In Saccharomyces cerevisiae (Baker's yeast), POL30, is associated with polymerase III, the yeast analog of polymerase delta. Homologues of PCNA have also been identified in the archaea (Euryarchaeota and Crenarchaeota) and in Paramecium bursaria Chlorella virus 1 (PBCV-1) and in nuclear polyhedrosis viruses. ; GO: 0003677 DNA binding, 0030337 DNA polymerase processivity factor activity, 0006275 regulation of DNA replication, 0043626 PCNA complex; PDB: 1IZ5_A 1IZ4_A 1GE8_A 1ISQ_A 3A2F_B 1RWZ_A 3P83_A 1RXM_A 1RXZ_A 1SXJ_F ....
Probab=25.03 E-value=65 Score=25.85 Aligned_cols=24 Identities=21% Similarity=0.493 Sum_probs=16.8
Q ss_pred CCCCcEEEEEEecCCCceeeEEEEEEEcee
Q psy3214 281 QHEDFKYEITIENNSNETKTGIMRIFLGAK 310 (336)
Q Consensus 281 ~h~pF~y~i~v~n~s~~~~~~~vrifl~Pk 310 (336)
+..|-..+.++ .+ .|.++.|||||
T Consensus 105 ~~~Pl~l~f~~--~~----~g~l~f~LAPK 128 (128)
T PF02747_consen 105 EDMPLKLEFEL--AD----GGSLKFYLAPK 128 (128)
T ss_dssp TTSEEEEEEEE--TT----TEEEEEEE-BB
T ss_pred CCCCEEEEEEe--CC----CeEEEEEEcCC
Confidence 45677777777 32 37899999998
No 68
>PF04379 DUF525: Protein of unknown function (DUF525); InterPro: IPR007474 This domain is found in the bacterial protein ApaG and at the C termini of some F-box proteins (IPR001810 from INTERPRO). F-box proteins contain a carboxy-terminal domain that interacts with protein substrates []. The ApaG domain is ~125 amino acids in length, and is named after the bacterial ApaG protein, of which it forms the core. The Salmonella typhimurium ApaG domain protein, CorD, is involved in Co(2+) resistance and Mg(2+) efflux. Tertiary structures from different ApaG proteins show a fold of several beta-sheets. The ApaG domain may be involved in protein-protein interactions which could be implicated in substrate-specificity [, , ].; PDB: 2F1E_A 1XVS_A 1TZA_A 1XQ4_D.
Probab=24.64 E-value=1.3e+02 Score=22.71 Aligned_cols=35 Identities=31% Similarity=0.491 Sum_probs=20.7
Q ss_pred cCCCCcEEEEEEecCCCceeeEEEEEEEceecCCCCC
Q psy3214 280 IQHEDFKYEITIENNSNETKTGIMRIFLGAKNDIEGN 316 (336)
Q Consensus 280 L~h~pF~y~i~v~n~s~~~~~~~vrifl~Pk~d~~g~ 316 (336)
-++.=|.|+|+|+|.+....+-.=|=++ -.|..|+
T Consensus 9 ~~~y~f~Y~I~I~N~~~~~vqL~sR~W~--I~d~~g~ 43 (90)
T PF04379_consen 9 QNRYVFAYRIRIENHSDESVQLLSRHWI--ITDADGH 43 (90)
T ss_dssp GTBEEEEEEEEEEE-SSS-EEEEEEEEE--EEETTS-
T ss_pred CCeEEEEEEEEEEECCCCCEEEEccEEE--EEeCCCC
Confidence 3445589999999988776555555554 2344443
No 69
>TIGR00156 conserved hypothetical protein TIGR00156. As of the last revision, this family consists only of two proteins from Escherichia coli and one from the related species Haemophilus influenzae.
Probab=23.17 E-value=71 Score=25.91 Aligned_cols=49 Identities=24% Similarity=0.335 Sum_probs=35.5
Q ss_pred cccccccCCCCcEEEEEEecCCCceeeEEEEEEEceecCCCCCcccccccccceEEeee
Q psy3214 274 YNIPYHIQHEDFKYEITIENNSNETKTGIMRIFLGAKNDIEGNELPLSAQRSLFIELDK 332 (336)
Q Consensus 274 ~~~~~~L~h~pF~y~i~v~n~s~~~~~~~vrifl~Pk~d~~g~~~~~~~~r~~~ielD~ 332 (336)
..|.+||.|+-|.|+ ...+.++|=+-++ .++|++++.++.=+..=|+||
T Consensus 64 G~Iv~~l~~d~Y~F~---------D~TG~I~VeId~~-~w~G~~v~p~d~V~I~GeVDk 112 (126)
T TIGR00156 64 GNIISHIGDDRYVFR---------DKSGEINVVIPAA-VWNGREVQPKDMVNISGSLDK 112 (126)
T ss_pred EEEEEEeCCceEEEE---------CCCCCEEEEECHH-HcCCCcCCCCCEEEEEEEECC
Confidence 345799999977776 2234577777444 479999998888777778776
No 70
>PF05164 ZapA: Cell division protein ZapA; InterPro: IPR007838 This entry a structural domain found in the cell division protein ZapA, as well as in related proteins. This domain has a core structure consisting of two layers alpha/beta, and has a long C-terminal helix that forms dimeric parallel and tetrameric antiparallel coiled coils []. ZapA interacts with FtsZ, where FtsZ is part of a mid-cell cytokinetic structure termed the Z-ring that recruits a hierarchy of fission related proteins early in the bacterial cell cycle. ZapA drives the polymerisation and filament bundling of FtsZ, thereby contributing to the spatio-temporal tuning of the Z-ring.; PDB: 1T3U_B 1W2E_B 3HNW_A.
Probab=22.81 E-value=90 Score=22.88 Aligned_cols=14 Identities=14% Similarity=0.560 Sum_probs=11.9
Q ss_pred cEEEEECCEEeeeC
Q psy3214 3 NVDIILGGKNFTLT 16 (336)
Q Consensus 3 ~i~~~~~~~~~~i~ 16 (336)
.|+++|+|++|+|.
T Consensus 1 ~V~v~I~G~~y~i~ 14 (89)
T PF05164_consen 1 QVKVTILGREYRIK 14 (89)
T ss_dssp -EEEEETTEEEEEC
T ss_pred CeEEEECCEEEEee
Confidence 37899999999998
No 71
>PRK10053 hypothetical protein; Provisional
Probab=20.38 E-value=88 Score=25.53 Aligned_cols=49 Identities=18% Similarity=0.325 Sum_probs=36.0
Q ss_pred cccccccCCCCcEEEEEEecCCCceeeEEEEEEEceecCCCCCcccccccccceEEeee
Q psy3214 274 YNIPYHIQHEDFKYEITIENNSNETKTGIMRIFLGAKNDIEGNELPLSAQRSLFIELDK 332 (336)
Q Consensus 274 ~~~~~~L~h~pF~y~i~v~n~s~~~~~~~vrifl~Pk~d~~g~~~~~~~~r~~~ielD~ 332 (336)
..|.+||.|+-|.|+ + ..+.++|=+ +.-.++|++++.+++=+++=|+||
T Consensus 68 G~Iv~~lg~d~Y~F~-----D----~tG~I~VeI-D~~~w~G~~v~p~~kV~I~GevDk 116 (130)
T PRK10053 68 GNLIDHKGDDRYVFR-----D----KSGEINVII-PAAVFDGREVQPDQMININGSLDK 116 (130)
T ss_pred EEEEEEeCCceEEEE-----C----CCCcEEEEe-CHHHcCCCcCCCCCEEEEEEEECC
Confidence 345799999987776 1 233466656 555689999999888888888886
No 72
>TIGR01451 B_ant_repeat conserved repeat domain. This model represents the conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis, and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydial outer membrane proteins.
Probab=20.01 E-value=1.3e+02 Score=20.06 Aligned_cols=25 Identities=28% Similarity=0.454 Sum_probs=16.7
Q ss_pred CCcEEEEEEecCCCce-eeEEEEEEE
Q psy3214 283 EDFKYEITIENNSNET-KTGIMRIFL 307 (336)
Q Consensus 283 ~pF~y~i~v~n~s~~~-~~~~vrifl 307 (336)
+..+|+|+|.|.+... ...+|+=-|
T Consensus 12 d~v~Yti~v~N~g~~~a~~v~v~D~l 37 (53)
T TIGR01451 12 DTITYTITVTNNGNVPATNVVVTDIL 37 (53)
T ss_pred CEEEEEEEEEECCCCceEeEEEEEcC
Confidence 3458999999998653 444555444
Done!